Train Test Split

Label rows as part of the train or test set based off of percentage split you want to apply to the data.

If you want a row-wise random sample applied, do not pass an order_by column. If you want an ordered split, then pass the order_by column.

Parameters

Example

ds = rasgo.get.dataset(id)

ds2 = ds.train_test_split(order_by = ['DATE'],
    train_percent = 0.8)
ds2.preview()

ds2b = ds.train_test_split(train_percent = 0.8)
ds2b.preview()

Source Code

Last updated