On the Interpretability and Significance of Bias Metrics in Texts: a PMI-based Approach

Authors: Francisco Valentini, Germán Rosati, Damián Blasi, Diego Fernandez Slezak, Edgar Altszyler.

Abstract:
In recent years, word embeddings have been widely used to measure biases in texts. Even if they have proven to be effective in detecting a wide variety of biases, metrics based on word embeddings lack transparency and interpretability. We analyze an alternative PMI-based metric to quantify biases in texts. It can be expressed as a function of conditional probabilities, which provides a simple interpretation in terms of word co-occurrences. We also prove that it can be approximated by an odds ratio, which allows estimating confidence intervals and statistical significance of textual biases. This approach produces similar results to metrics based on word embeddings when capturing gender gaps of the real world embedded in large corpora.

More information:
https://aclanthology.org/2023.acl-short.44/

Andres Juarez2023-12-21T12:28:27-03:00 21/diciembre/2023|Papers|

Abstraction-Aware Inference of Metamorphic Relations

Thinness and its variations on some graph families and coloring graphs of bounded thinness

Discrete-event simulation of continuous-time systems: evolution and state of the art of quantized state system methods

Brewing Up Reliability: Espresso Test Generation for Android Apps

Measuring Ideological Spectrum Through NLP

Quantization-based simulation of spiking neurons: theoretical properties and performance analysis

Apparent personality prediction from speech using expert features and wav2vec 2.0

A concrete model for a typed linear algebraic lambda calculus

Benchmarking on Data Acquisition event building network performance for the ATLAS HL-LHC upgrade

An algebraic semantics for possibilistic finite-valued Łukasiewicz logic

Weak-ensconcement for Shielded base contraction

Phone and speaker spatial organization in self-supervised speech representations

Intersection models and forbidden pattern characterizations for 2-thin and proper 2-thin graphs

The descriptive complexity of the set of Poisson generic numbers

Study on the Fairness of Speaker Verification Systems Across Accent and Gender Groups

On the Interpretability and Significance of Bias Metrics in Texts: a PMI-based Approach

Compartir en las redes

Related Posts