Data cleaning | Cleaning a dataset to remove incorrect/incomplete data. |
---|
One-hot encoding | Used to represent categorical variables as numerical values. |
---|
Train/test split | Splitting of a dataset into a portion to train a model on, and a portion to test the model on. |
---|
Single Factor Analysis (SFA) | The process of determining the predictive ability of a factor on a target. |
---|
Goodness of fit metric (e.g., R squared) | Metric determining how well the data fits a regression model. |
---|
Linear regression | Modelling technique aiming to establish a linear relationship between a dependent variable and independent variable(s). |
---|