publish.features_from_source_code()

Parameters

data_source_id:int:ID of a Rasgo DataSource

source_code_type:str:Valid values are "sql" or "python"

sql_definition:str: (Optional)Valid SQL select statement that will be used to make a view in the DataWarehouse. Mandatory param if source_code_type = "sql"

python_function:FunctionType: (Optional)Valid python function that accepts a df as a param and returns a df of features. Mandatory param if source_code_type = "python"

features:List[str]: (Optional)A list of column names in the DataSource table that should be registered as Features. If no value is passed, all columns in the source that are not listed in the `dimensions` parameter will be registered as features.

dimensions:List[str]:A list of column names in the DataSource table that should be registered as Dimensions

granularity:List[str]:A list of strings that describes the grain of the dimensions

feature_set_name:str: (Optional)Name for this set of Features

sandbox:bool: (Optional)True = mark these features are Sandbox (not Production-ready) | False = mark these features are Production-ready (default is True)

if_exists:str: (Optional) fail - returns an error message if a featureset already exists against this table | return - returns the featureset without operating on it | edit - edits the existing featureset | new - creates a new featureset

Return Object

Rasgo FeatureSet

Sample Usage

Create features from an existing source

Columns in your DataSource table that are not referenced in either the "dimensions" or "features" list will be ignored

Best Practices / Tips

Last updated

Was this helpful?