IIT Madras Robert Bosch Centre launches Project to reduce gender data gap in Wikipedia

IIT Madras Robert Bosch Centre launches Project to reduce gender data gap in Wikipedia

2 mins read28 Views Comment FOLLOW US
Anupama
Anupama Mehra
Assistant Manager – Content
New Delhi, Updated on Mar 25, 2022 12:44 IST

The initiative founders have set a goal of auto-generating biographies of several notable women within the next year, specifically within the next International Women’s Day; i.e. March 8, 2023.

IIT Madras Robert Bosch Centre launches Project to Reduce Gender Data Gap in Wikipedia

IIT Madras Robert Bosch Centre launches Project to Reduce Gender Data Gap in Wikipedia

Indian Institute of Technology Madras’ Robert Bosch Centre for Data Science and Artificial Intelligence (RBCDSAI) and SuperBloom Studios, a Business Consultancy Firm, are launching an initiative called ‘Hidden Voices’ to reduce the Gender Data Gap in Digital Sources. Partnering with IITM Alumni Association this initiative is starting with Wikipedia.

The initiative founders have set a goal of auto-generating biographies of several notable women within the next year, specifically within the next International Women’s Day; i.e. March 8, 2023 and thereby make a positive impact on gender representation among digital sources. The Hidden Voices team takes pride in mentioning that SuperBloom Studios was co-founded by three IIT Madras alumnae building this practice across three continents.

Those interested in volunteering for this initiative can register through the following link - http://hiddenvoices.xyz/

Natural language models are increasingly forming the basis of various consumer interaction services and the models depend on open web datasets, including Wikipedia. Independently, most people use digital-first tools such as Wikipedia to initiate their worldview formation on many subjects.

While there are multiple layers of complexity to resolve the nature of equitable representation across all digital platforms, it is noted that there is significant value in increasing women representation in Wikipedia.

Explaining how this initiative would be implemented, Prof. Balaraman Ravindran, Head, RBCDSAI-IIT Madras, said, “The project will be an instance of a human-allied AI execution. While the state-of-the-art of Automated Language Processing has significantly advanced there are situations when the AI will make errors. This is especially so when processing documents about underrepresented populations, the very fact that this project is trying to address. Hence, we will take advantage of AI solutions where possible, and judiciously use human oversight and verification to produce high quality outputs.”

The gender data gap is considered a major barrier to more equitable solutions across domains. The spoken and written impressions on the web (text, audio and video) content are vastly outpacing any other form of data. Online curated content is also the building block data source of many AI/ ML solutions like automated speech recognition and language models that form the basis of many products and services. 

But there is a measurable quantitative lack of representation of gender diverse voices in these core digital data sources. The Hidden Voices initiative sets out to tackle this issue.

Elaborating further on this project, Dr. Raji Baskaran, Founding Partner, SuperBloom Studios, said, “The lack of availability of proper information often creates and cements unintended biases. This is nowhere more prominent than in the ever-widening digital gender data gap. Hidden Voices addresses a critical data gap and builds tools to systematically reduce this gap at scale. Building products and services that are inclusive is at the core of our business strategy.”    

Some of the major barriers in addressing the data gap include editors' gender and interest but also contributions from external sources. Hence, the project aims to develop information theoretical approaches, ML-assisted auto-identification and validation of external sources and textual analysis methods to auto-generate the first draft of Wikipedia-style biography. The models developed will employ this approach to generate Wikipedia articles for notable women in STEMM (Science, Technology, Engineering, Medicine and Management).

Read More:

Follow Shiksha.com for latest education news in detail on Exam Results, Dates, Admit Cards, & Schedules, Colleges & Universities news related to Admissions & Courses, Board exams, Scholarships, Careers, Education Events, New education policies & Regulations.
To get in touch with Shiksha news team, please write to us at news@shiksha.com

About the Author
author-image
Anupama Mehra
Assistant Manager – Content

"The pen is mightier than the sword". Anupama totally believes in this and respects what she conveys through it. She is a vivid writer, who loves to write about education, lifestyle, and governance. She is a hardcor... Read Full Bio

Next Story