publish.features_from_source_code()

Parameters

data_source_id:int:ID of a Rasgo DataSource

source_code_type:str:Valid values are "sql" or "python"

sql_definition:str: (Optional)Valid SQL select statement that will be used to make a view in the DataWarehouse. Mandatory param if source_code_type = "sql"

python_function:FunctionType: (Optional)Valid python function that accepts a df as a param and returns a df of features. Mandatory param if source_code_type = "python"

features:List[str]: (Optional)A list of column names in the DataSource table that should be registered as Features. If no value is passed, all columns in the source that are not listed in the `dimensions` parameter will be registered as features.

dimensions:List[str]:A list of column names in the DataSource table that should be registered as Dimensions

granularity:List[str]:A list of strings that describes the grain of the dimensions

feature_set_name:str: (Optional)Name for this set of Features

sandbox:bool: (Optional)True = mark these features are Sandbox (not Production-ready) | False = mark these features are Production-ready (default is True)

if_exists:str: (Optional) fail - returns an error message if a featureset already exists against this table | return - returns the featureset without operating on it | edit - edits the existing featureset | new - creates a new featureset

Return Object

Rasgo FeatureSetarrow-up-right

Sample Usage

Create features from an existing source

circle-info

Columns in your DataSource table that are not referenced in either the "dimensions" or "features" list will be ignored

Best Practices / Tips

circle-exclamation

Last updated