publish.feature_from_source()

Create features using a Rasgo DataSource table

Parameters

data_source_id:int:ID of a Rasgo DataSource

features:List[str]: (Optional)A list of column names in the DataSource table that should be registered as Features. If no value is passed, all columns in the source that are not listed in the `dimensions` parameter will be registered as features.

dimensions:List[str]:A list of column names in the DataSource table that should be registered as Dimensions

granularity:List[str]:A list of strings that describes the grain of the dimensions

feature_set_name:str: (Optional)Name for this set of Features

sandbox:bool: (Optional)True = mark these features are Sandbox (not Production-ready) | False = mark these features are Production-ready (default is True)

if_exists:str: (Optional) fail - returns an error message if a featureset already exists against this table | return - returns the featureset without operating on it | edit - edits the existing featureset | new - creates a new featureset

Return Object

Rasgo FeatureSet

Sample Usage

Create features from an existing source

dimensions = ['DATE']
features = ['WEEKFROMTODAY', 'TEMPINCELCIUS']

featureset = rasgo.publish_features_from_source(
               data_source_id = 100,
               dimensions= dimensions,
               features=features,
               granularity=['day'],
               name='My Sandbox Features',
               sandbox=True
               )
print('FeatureSet:', featureset)

Columns in your DataSource table that are not referenced in either the "dimensions" or "features" list will be ignored

Best Practices / Tips

Last updated

Was this helpful?