Replace Missing

Replace missing values in column/columns with the mean, median, mode, or a value

Parameters

Name
Type
Description
Is Optional

replacements

imputation_dict

Dictionary with keys as column names to replace missing values for, and dictionary values the type of replacement strategy ('mean', 'median', 'mode', )

flag_missing_vals

boolean

Use True to create an indicator column for when a value was replaced. This column will be named like '<col_name>_missing_flag'.

True

Example

ds = rasgo.get.dataset(id)

ds2 = ds.replace_missing(
  replacements={
      'MONTH': 'mean',            # Replace with mean 
      'FIPS': 'median',           # Replace with median
      'COVID_NEW_CASES': 'mode',  # Replace with mode
      'YEAR': '2021',             # Replace with the string '2021'
      'COVID_DEATHS': 2.45       # Replace with the number 2.45
  },
  flag_missing_vals=True)

ds2.preview()

Source Code

Last updated