📜  pyspark 随机抽样 - Python 代码示例

📅  最后修改于: 2022-03-11 14:47:07.049000             🧑  作者: Mango

代码示例1
#if replacement=true to allow duplicate entries in the sample & false otherwise.
#0.5 = sample size
#5 =seed
df.sample(true, 0.5, 5)