should i sample different size data variables, until they all have the same size?

28 Views Asked by At

my dataset has a column 'sales_method' with 3 values A B C but they have different counts of 7400 4900 2500... should i sample for the smaller categories until they all have 7400 entries? or should i sample the big ones for only 2500 rows ? i have other numbers to compare about like 'revenue'. and maybe it can be inclined to bigger size = bigger revenue. (so sample whole rows without permutation I'm assuming). this could be just to make numpy functions work, when they need input of the same sizes.

0

There are 0 best solutions below