Train Test Split
Label rows as part of the train or test set based off of percentage split you want to apply to the data.
If you want a row-wise random sample applied, do not pass an order_by column. If you want an ordered split, then pass the order_by column.
Parameters
Name | Type | Description | Is Optional |
---|---|---|---|
order_by | column_list | Optional argument that affects the train/test split method applied. if needed, pass the names of column(s) you want to order by when applying the split. | True |
train_percent | int | Percent of the data you want in the train set, expressed as a decimal (i.e. .8). The rest of the rows will be included in the test set. |
Example
Source Code
Last updated