Feature Engineering Reference

Common Techniques

Technique	When to Use
Zero-variance removal	Always check for predictors with single unique value
Dummy variables	Convert categorical predictors to binary 0/1 indicators
Target encoding	Categorical predictors with many levels → single numeric column
Centering & scaling	Models using distance metrics or dot products
Symmetric transformations	Skewed numeric predictors (Yeo-Johnson or orderNorm)
Imputation	Missing predictor values—estimate from other columns
Correlation reduction	Feature extraction (e.g., PCA) or unsupervised correlation filter
Spline terms	Nonlinear relationships between single predictors and outcome
Interaction terms	Joint effects of two or more predictors

For methodology and code examples:

Ordinary linear/logistic/multinomial regression:

Mandatory: indicator variables, zero-variance removal, complete data
Helpful: interaction terms, spline terms, reducing correlation, symmetric distributions

Regularized linear/logistic/multinomial regression:

Mandatory: indicator variables, zero-variance removal, standardized scale, complete data
Helpful: interaction terms, spline terms, reducing correlation, symmetric distributions

K-nearest neighbors:

Mandatory: indicator variables, zero-variance removal, standardized scale, complete data
Helpful: symmetric distributions

Support Vector Machines:

Mandatory: indicator variables, zero-variance removal, standardized scale, complete data
Helpful: symmetric distributions

Generalized Additive Models:

Multivariate Adaptive Regression Splines (MARS):

Naive Bayes:

Neural networks:

Mandatory: indicator variables, zero-variance removal, reducing correlation, standardized scale, complete data
Helpful: symmetric distributions

Single tree models:

Tree ensemble models (random forest, boosting):

RuleFit:

Mandatory: indicator variables, zero-variance removal, standardized scale, complete data
Helpful: none

Cubist: