Train Test Split
Last updated
Last updated
Label rows as part of the train or test set based off of percentage split you want to apply to the data.
If you want a row-wise random sample applied, do not pass an order_by column. If you want an ordered split, then pass the order_by column.
Name | Type | Description | Is Optional |
---|---|---|---|
order_by | column_list | Optional argument that affects the train/test split method applied. if needed, pass the names of column(s) you want to order by when applying the split. | True |
train_percent | int | Percent of the data you want in the train set, expressed as a decimal (i.e. .8). The rest of the rows will be included in the test set. |