Train Test Split
Label rows as part of the train or test set based off of percentage split you want to apply to the data.
If you want a row-wise random sample applied, do not pass an order_by column. If you want an ordered split, then pass the order_by column.
Name | Type | Description | Is Optional |
---|---|---|---|
order_by | column_list | Optional argument that affects the train/test split method applied. if needed, pass the names of column(s) you want to order by when applying the split. | True |
train_percent | int | Percent of the data you want in the train set, expressed as a decimal (i.e. .8). The rest of the rows will be included in the test set. | |
ds = rasgo.get.dataset(id)
ds2 = ds.train_test_split(order_by = ['DATE'],
train_percent = 0.8)
ds2.preview()
ds2b = ds.train_test_split(train_percent = 0.8)
ds2b.preview()
Last modified 1yr ago