Analyzing Type I Error Rates in Imbalanced Scenarios with Varying Sample Sizes
This study investigates the Type I error rates in imbalanced scenarios characterized by frequent and rare events. We analyze different sample sizes—from small (1000) to large (5000)—and assess their impacts on error rates using the IVDL and KHDL metrics. The analysis reveals significant differences in error rates across varying conditions, highlighting the challenges in detecting effects in imbalanced datasets. Our findings contribute to the understanding of statistical power in implications for future research and applications.
Analyzing Type I Error Rates in Imbalanced Scenarios with Varying Sample Sizes
E N D
Presentation Transcript
Imbalanced Scenario () a. frequent events with b. frequent eventswith 0.4 0.4 IVDL 0.3 0.3 KHDL Small sample size 0.2 0.2 Moderate sample size Large sample size 0.1 0.1 0.0 0.0 1000 2000 3000 4000 5000 1000 2000 3000 4000 5000 Type I error c. rare events with d. rare events with 0.4 0.4 0.3 0.3 0.2 0.2 0.1 0.1 0.0 0.0 1000 2000 3000 4000 1000 2000 3000 4000 • Total number of individuals in the loop Appendix Figure 1