I am new here so please let me know if I can improve myself to be clearer.
I would like to predict absenteeism of employees, so I have to make a factor of this numerical variable. The data is skewed right, so I would like to transfer the observations equal over every category, so that my prediction is more accurate. I would like to have a new variable "Group" that divides all observation equal in to 1, 2 or 3.
The problem is that I have a issue with making this factor with equal n. I tried many possibilities from this topic: splitting a continuous variable into equal sized groups, such as cut, cut2 and Hmisc. All option seem straightforward, but if I apply them, the categorisch are now equal divided.
I hope someone can help me, I am really curious why the above methods are not working for me. I would like an answer with from a basic library. I understand that it would be easier if I display the dataset here as well, but I do not yet know how I can do that.
Thanks in advance!