Addressing the Gender & Racial Wage Gaps with Data Science

BY GINA MANTICA

The overall gender wage gap hasn’t changed in the Boston area over the past two years, according to a recent report from the Boston Women’s Workforce Council (BWWC). But identifying a problem is often the first step towards finding a solution, and data science provides the quantitative rigor and objectivity that can enable effective change.

Over the past year software engineers and statisticians at Boston University collaborated with the BWWC, a public-private social impact initiative housed at the Hariri Institute, to securely collect and scrutinize payroll data that Greater Boston employers submit anonymously as part of a pledge made with the Boston Mayor’s office and the BWWC to close the gender and racial wage gaps. This year the collected data represented 156,000 employees, covering 14% of the Greater Boston area workforce. Boston University is a signer of the pledge and, as such, contributed their salary data. The BWWC published the results of their recent wage gap analysis online on December 9.

The Hariri Institute’s Software & Application Innovation Lab (SAIL) developed a secure data collection and analysis platform about five years ago that the BWWC continues to use to this day for their report.  The software uses multi-party computation, or MPC, to collect and analyze private wage information from the BWWC’s 100% Talent Compact Signers. “MPC is the idea of multiple parties pooling their data to give you an aggregate result without any one individual’s data being exposed,” explained Arezoo Sadeghi, a software engineer at SAIL, “MPC lets you work with sensitive data and you’ll never be able to identify people.”

Anna Cook created visualizations of the wage gaps to help the BWWC make sense of the data.

After companies submit their information, statisticians process the data to support the BWWC’s analysis. This year, Anna Cook, a Master’s student in the Statistical Practice (MSSP) Program, and Masanao Yajima, the director of MSSP Consulting and an Associate Professor of the Practice for MSSP, led these efforts.  Cook, under Yajima’s supervision, crunched numbers to calculate the average gender and racial wage gaps and created visualizations to help the BWWC make sense of the data.  Working on this report combined Cook’s skillfulness in data science that she gained in MSSP with her interest in social justice.

Masanao Yajima, Associate Professor of the Practice for MSSP, supervised Cook.

Cook and Yajima found that Boston-area women, on average, were paid 70¢ for every dollar earned by a man in 2021. Although the average gender wage gap hasn’t shrunk over the past two years in Boston, the racial wage gap was slightly smaller in 2021 than in 2019 according to the BWWC. Asian and American Indian/Alaskan Native women were paid two and three cents more for every dollar earned by a man in 2021 compared to 2019. “I think of Boston as being a city that is highly educated and has a ton of job opportunities. It was interesting to see that some of the gaps that we saw were comparable to what we would see on a national scale,” Cook reflected.

It is important to note, however, that these numbers are averages across all companies that submitted their data this time around and do not reflect changes over time within a single, individual company. Researchers at the Hariri Institute hope to refine methods at the interface of privacy and statistics so that these individual-level changes can be better understood while still keeping confidentiality in place.

Benchmarking can help drive progress towards equal pay, even without historical wage information from individual companies. Both Mastercard and Starbucks reported their gender wage gaps in the past few years, and committed to transparency and a push for equity in the workforce. But without a standardized method of doing the math, those claims feel insignificant to some. In Boston, the BWWC gives employers both the equations for internal use and a standardized benchmark for comparison. “Working in a way that is data-centered is good. If we have data and analysis to back up claims they are more likely to be believed,” said Vidya Akavoor, a software engineer at SAIL.

The computational tools used to create the BWWC’s report could be applied to a variety of other contexts where equity needs to be measured without compromising trust or privacy. For example, the analysis of college admissions data, financial credit ratings, environmental justice practices, and even healthcare operations could benefit from data-centric thinking but often involve sensitive or proprietary data. “In the modern era where everything is so data driven, one of the key elements in addressing such grand challenges should be data, numbers, and quantifying things,” said Eric Kolaczyk, Director of the Hariri Institute and Professor of Statistics, “If you can quantify a problem, then you have some hope for sitting down together objectively to discuss policies and procedures to achieve a goal.  The success of the BWWC’s work demonstrates that this can be done in a way that respects yet overcomes barriers arising around trust, governance, and the like.”


Interested in learning more about the research happening at the Hariri Institute? Sign up for our newsletter here.