Impute Transform
The impute transform performs imputation of missing data objects.
Transform Parameters
| Property | Type | Description |
|---|---|---|
| field | Field | Required. The data field for which missing values should be imputed. |
| key | Field | Required. A key field that uniquely identifies data objects within a group. Missing key values (those occurring in the data but not in the current group) will be imputed. |
| keyvals | Any[ ] | An optional array of key values that should be considered for imputation. If provided, this array will be used in addition to the key values observed within the input data. |
| method | String | The imputation method to use for the field value of imputed data objects. One of value (default), mean, median, max, or min. For example, the mean method will calculate the mean of all existing values within a group, and replace the missing values with the calculated mean. |
| groupby | Field[ ] | An optional array of fields by which to group the values. Imputation will then be performed on a per-group basis. For example, missing values may be imputed using the group mean rather than the global mean. |
| value | Any | The field value to use when the imputation method is value. |
Usage
{
"data": [
{
"name": "table",
"values": [
{"x": 0, "y": 28, "c": 0}, {"x": 0, "y": 55, "c": 1},
{"x": 1, "y": 43, "c": 0}, {"x": 1, "y": 91, "c": 1},
{"x": 2, "y": 81, "c": 0}, {"x": 2, "y": 53, "c": 1},
{"x": 3, "y": 19, "c": 0}
],
"transform": [
{
"type": "impute",
"groupby": ["c"],
"key": "x",
"field": "y",
"method": "value",
"value": 500
}
]
}
]
}
In this example, the transform imputes the tuple
{"x": 3, "c": 1, "y": 500}