From Bias to Balance: Techniques for Fairness in Machine Learning

10 April, 2025

When computers learn from data, they can pick up unfair rules. For example, if a hiring tool sees that past hires were mostly men, it might favor male applicants. This is called bias. Bias can hurt people and break trust in AI.

Fairness in machine learning means reducing or removing bias so that all groups are treated equally. There are three main ways to add fairness to your models:

Pre-processing: Change the data before training.
In-processing: Adjust the training method itself.
Post-processing: Fix model outputs after training.

In this post, we will explain each method in simple English, show examples, and give you clear steps to follow.

What Is Bias in Machine Learning?

Bias is when a model makes decisions that are unfair to certain groups. Bias comes from data or algorithms:

Data bias: If the data has more examples from one group, the model may favor that group.
Algorithm bias: If the training process rewards certain outcomes, it can ignore fairness.

Example: A loan approval model uses past loan data. If people from a certain neighborhood got fewer loans before, the model may continue to reject their applications.

Why Fairness Matters

Legal and Ethical Reasons: Unfair models can break laws and harm people.
Trust: Customers and users trust AI more when it is fair.
Better Decisions: Fair models often generalize better and avoid costly mistakes.

Imagine a health app that gives worse advice to one group. Patients could lose trust and stop using the app.

1. Pre-processing Techniques

Pre-processing means fixing data before training. You clean or change the data to remove bias.

1.1 Re-sampling

What it is: You add or remove data points so groups are balanced.
Example: If you have 80 male applicants and 20 female applicants in your data, you can duplicate female data or remove some male data so both groups have 50 each.

1.2 Re-weighting

What it is: You give more weight to examples from underrepresented groups.
Example: In training, each female example counts twice as much as a male example. This tells the model to pay extra attention to female cases.

1.3 Data Transformation

What it is: You change sensitive features so the model sees less bias.
Example: Replace ZIP codes with broader regions. This hides information that might let the model guess someone’s race.

Steps for Pre-processing:

Identify sensitive features (e.g., gender, race).
Check group sizes and distribution.
Apply re-sampling or re-weighting.
Retrain the model on the new data.

2. In-processing Techniques

In-processing means changing the training method to include fairness.

2.1 Fairness Constraints

What it is: You add rules to the model that it must follow.
Example: A classifier must have similar error rates for all groups. If men get 5% errors, women must also have about 5% errors.

2.2 Adversarial Debiasing

What it is: You train two models together: one predicts the target, the other predicts the sensitive feature. You make the first model good at the main task but bad at guessing the sensitive feature.
Example: A loan model is trained to predict repayment. A second model tries to predict gender from the first model’s output. The first model learns to hide gender information.

2.3 Regularization for Fairness

What it is: You add a penalty in the loss function for unfair behavior.
Example: If the model’s predictions differ too much between groups, the loss increases, pushing the model to be fair.

Steps for In-processing:

Choose a fairness metric (e.g., equal error rates).
Select an in-processing method (constraints, adversarial, or regularization).
Modify the training code to include fairness rules.
Train and check both accuracy and fairness metrics.

3. Post-processing Techniques

Post-processing means fixing model outputs after training.

3.1 Threshold Adjustment

What it is: You set different decision thresholds for each group.
Example: A credit model gives scores from 0 to 1. You might require a score of 0.6 for men and 0.5 for women to get the same approval rate.

3.2 Calibration

What it is: You adjust predicted probabilities so they reflect true chances equally for all groups.
Example: If the model’s 70% repayment prediction is right 70% of the time for men but only 60% for women, you recalibrate the scores for women.

3.3 Reject Option Classification

What it is: For cases near the decision boundary, you choose the option that favors the disadvantaged group.
Example: If someone’s score is 0.49 (just below approval) and they belong to an underrepresented group, you approve them.

Steps for Post-processing:

Evaluate model fairness on validation data.
Choose a post-processing method (thresholds, calibration, or reject option).
Apply adjustments to predictions.
Test final performance and fairness.

Putting It All Together: A Loan Approval Example

Scenario: A bank builds a loan approval model. Past data shows fewer approvals for one ZIP code.

Pre-processing: The team re-samples data to balance applications from all ZIP codes.
In-processing: They add a fairness constraint so that approval rates differ by no more than 2% between groups.
Post-processing: After training, they adjust thresholds so both groups have equal approval rates.

The final model approves loans fairly, while still predicting repayment accurately.

Tools and Libraries

AI Fairness 360 (AIF360) by IBM: Offers pre-, in-, and post-processing algorithms.
Fairlearn by Microsoft: Provides fairness metrics and mitigation algorithms.
Themis-ML: A Python library for fairness in machine learning.
Scikit-learn: Can be extended with custom fairness constraints.

Challenges and Tips

Trade-off Between Accuracy and Fairness: Adding fairness can reduce accuracy. Decide on a balance that fits your needs.
Choosing Metrics: Different metrics matter in different cases (e.g., equal opportunity vs. demographic parity).
Data Quality: Fairness starts with good data. Check for errors and missing values.
Continuous Monitoring: Models can become unfair over time as data changes.

Tip: Start with simple methods and add complexity as needed.

Best Practices for Fairness

Define Fairness Goals Early: Know which groups and metrics matter for your use case.
Document Everything: Keep records of data sources, methods, and fairness tests.
Involve Stakeholders: Include legal, ethical, and domain experts.
Test on Real Data: Use hold-out sets that reflect the diversity of your population.
Iterate and Improve: Fairness is not a one-time fix. Review and update models regularly.

Conclusion

Fairness in machine learning is essential to build trustworthy AI. By using pre-processing, in-processing, and post-processing techniques, you can reduce bias and treat all groups equally. Remember to pick the right methods for your problem, use available tools, and involve diverse stakeholders. With these steps, you can move from bias to balance and create AI systems that are both accurate and fair.