How to handle highly imbalanced label uncertainty (if that's the right thing to call it)?

I'm currently working with a dataset of about 1,500 samples in which the labels/response variables take the form of a number of attempts made for that sample and the subsequent number of successes it had.

In other words, you might have Sample 1 be "attempted 25 times and succeeded five times", while Sample 2 is "attempted 12 times and succeeded once".

Unfortunately, the number of times the attempts are made varies greatly. For some samples it may be as high as 300 attempts, while for others it is only a single attempt. For those larger numbers we have a decent amount of confidence in the overall performance of that sample, but for the single attempt samples we have almost none.

When I try to use this data to build a model, the model becomes heavily biased by samples that have single attempts that are successful. So far my solution has just been to drop any samples with less than a minimum number of attempts (I have arbitrarily chosen 10), but this results in a significant loss of samples (around 30%).

Any ideas on how I can better handle this problem?

submitted by Omega037
[link][3 comments]

How to handle highly imbalanced label uncertainty (if that's the right thing to call it)?

Trending Articles

Scuffham Amps - S-GEAR 2.6.0 VST, AAX, STANDALONE x86 x64 (R2R NO iLok2, +NO...

Practice Sheet of Right form of verbs for HSC Students

VHSE First (1st) Allotment 2025 - vhscap.kerala.gov.in

UNIVERSE LEAGUE – UNIVERSE LEAGUE – WAR (We Are Ready) – EP [iTunes Plus M4A]

City Hunter Teledrama – Episode 18 – 07th May 2016

Comment on Proposed Criteria for Identifying Predatory Conferences by Luke...

Bureau of Internal Revenue: Regional Offices (Directory)

Kendrick Lamar – Not Like Us (2024) [24Bit-88.2kHz] [PMEDIA] ⭐️

Inception 2010 Hindi Dual Audio 650MB BRRip 720p ESubs HEVC

East Hull MD admits sexual assaults after another victim comes forward

Download: FK ft Shenky – Nakuyewa ”Prod by: Shenky”

R. v. Sargeant, 2023 ONSC 6406 (CanLII)

Rajasthan Board 10th Result 2016 Roll No wise & Name Wise

Who’s been sentenced at Northampton Magistrates’ Court

मतलबी दोस्त स्टेट्स | Matlabi Dost Status in Hindi – Selfish Friends Status

Family cries out as traditional ruler allegedly abducts brother, extorts N2.5m

Long-Running Conflict In Springfield (MA) Gangland Sphere Has Manzi Family &...

Wondershare Filmora X v10.1.20.16 x64

Man arrested after fracas in flat

Man charged in ongoing Sexual Assault Investigation Derek Nyilas, 46, Faces...