📜  结合稀疏类 - Python 代码示例

📅  最后修改于: 2022-03-11 14:45:06.933000             🧑  作者: Mango

代码示例1
categorical_features=[feature for feature in dataset.columns if dataset[feature].dtype=='O']
for feature in categorical_features:   
    temp=dataset[feature].value_counts(normalize=True)
    temp_df=temp[temp>0.01].index                     
    
    dataset[feature]=np.where(dataset[feature].isin(temp_df),dataset[feature],'Rare_var')
                                   # condition satisfies then   'X'            else'Y'
                                   #condition---------->   ,take this values else,'Rare_var'