sklearn.utils.random.sample_without_replacement?

sklearn.utils.random.sample_without_replacement()

采樣整數而不進行替換。

從集合[0,n個總體]中選擇n個樣本整數，不進行替換。

參數	說明
n_population	int, 樣本集的大小。
n_samples	int, 要采樣的整數。
random_state	int, RandomState instance or None, optional (default=None) 如果為int，則random_state是隨機數生成器使用的種子；否則為false。如果是RandomState實例，則random_state是隨機數生成器；如果為None，則隨機數生成器是np.random使用的RandomState實例。
method	“auto”, “tracking_selection”, “reservoir_sampling” or “pool” 如果method 為“ auto”，則使用n_samples / n_population的比率來確定要使用的算法：如果比率在0到0.01之間，則使用跟蹤選擇。如果比率在0.01到0.99之間，則使用numpy.random.permutation。如果比率大于0.99，則使用儲層采樣。所選整數的順序不確定。如果需要隨機順序，則應將選定的子集混洗。如果方法為“ tracking_selection”，則使用基于集合的實現，適用于n_samples <<< n_population。如果方法為“ reservoir_sampling”，則使用適合于高內存約束或當O（n_samples）?O（n_population）時使用的存儲庫采樣算法。所選整數的順序不確定。如果需要隨機順序，則應將選定的子集混洗。如果method為“ pool”，則基于池的算法特別快，甚至比跟蹤選擇方法快。但是，必須初始化包含整個總體的向量。如果n_samples?n_population，則儲層采樣方法更快。

返回值	說明
out	array of size (n_samples, ) 整數的采樣子集。所選整數的子集可能不會被隨機化，請參見method參數。