Links

Train Test Split

Label rows as part of the train or test set based off of percentage split you want to apply to the data.
If you want a row-wise random sample applied, do not pass an order_by column. If you want an ordered split, then pass the order_by column.

Parameters

Name
Type
Description
Is Optional
order_by
column_list
Optional argument that affects the train/test split method applied. if needed, pass the names of column(s) you want to order by when applying the split.
True
train_percent
int
Percent of the data you want in the train set, expressed as a decimal (i.e. .8). The rest of the rows will be included in the test set.

Example

ds = rasgo.get.dataset(id)
ds2 = ds.train_test_split(order_by = ['DATE'],
train_percent = 0.8)
ds2.preview()
ds2b = ds.train_test_split(train_percent = 0.8)
ds2b.preview()

Source Code

Last modified 1yr ago