Join

Join a dataset with another dataset, by matching on one or more columns between the two tables.

If you pass a join_prefix, all column names in the join table will be named "{join_prefix}_{columnname}". If you don't pass a join_prefix, columns that share the same name in both tables will be only have the column from the base table included in the final output.

Parameters

NameTypeDescriptionIs Optional

join_table

table

Dataset object to join with the source dataset.

join_type

join_type

LEFT, RIGHT, or INNER

join_columns

join_dict

Columns to use for the join. Keys are columns in the source_table and values are on columns in the join_table.

join_prefix

value

Prefix all columns in the join_table with a string to differentiate them

True

filters

filter_list

Filter logic on one or more columns. Can choose between a simple comparison filter or advanced filter using free text.

True

Example

internet_sales = rasgo.get.dataset(74)
product = rasgo.get.dataset(75)

ds2 = internet_sales.join(
  join_table=product,
  join_columns={'PRODUCTKEY':'PRODUCTKEY'},
  join_type='LEFT',
  join_prefix='product',
  filters=['CUSTOMERKEY IS NOT NULL', 'ORDERDATE < CURRENT_DATE()'])

ds2.preview()

Source Code

Last updated