Updated Oct 2020 The paper describing the approach used to discover and categorise the biases presented in this tool can be accessed here.

Discovering And Interpreting Conceptual Biases (DAICB)

The DAICB is a tool to interactively compare the discovered biases between two attribute concepts inherent in large textual datasets taken from the internet, as captured by Word Embeddings models. It is an extension over previous work presented at ICWSM, which can be accessed here.

Language carries implicit human biases, functioning both as a reflection and a perpetuation of stereotypes that people carry with them. Recently, ML-based NLP methods such as word embeddings have been shown to learn such language biases strikingly accurately. This capability of word embeddings has been successfully exploited as a tool to quantify and study human biases. However, previous studies only consider a predefined set of conceptual biases to attest (e.g., whether gender is more or less associated with particular jobs), or just discover biased words without helping to understand their meaning at the conceptual level. As such, these approaches are either unable to find conceptual biases that have not been defined in advance, or the biases they find are difficult to interpret and study. This makes existing approaches unsuitable to discover and interpret biases in online communities, as online communities may have different biases from mainstream culture which need to be discovered and properly interpreted. Here we propose a general, data-driven approach to automatically discover and help interpret conceptual biases encoded in word embeddings. We apply this approach to study the conceptual biases present in the language used in online communities and experimentally show the validity and stability of our method.

+ show more

Dataset Selection

Find a link to the online community below:
https://reddit.com/r/dating_advice

Explore the data for

Click on any of the cards and explore different interactive approaches to discover stereotypes and biases found in the selected dataset.

Most Salient Biased Words

Explore the most salient words discovered in the dataset. Most biased words are those who are often found in similar contexts as the and concepts.

Click to jump to the section

Detailed Dataset Word Biases

Explore the details of the most frequent salient words. This section shows the salience, biases, frequency, polarity (sentiment) and part-of-speech (POS) of the most salient words biased towards gender or religion.

Click to jump to the section

Word Distributions of Biases

Explore the distribution of all salient words in a bar graph, for and , ordered by bias or frequency

Click to jump to the section

Bias Polarity

Explore the sentiment of the most salient words, classified in 7 categories ranging from positive to negative.

Click to jump to the section

Word Embedding Space

Explore the distribution of the most salient words in the embedding space, represented in the two principal t-SNE dimensions.

Click to jump to the section

Concept Embedding Space

Compare the concepts obtained from the embedding representations of the most salient and biased sets of words in the two principal t-SNE dimensions.

Click to jump to the section

Semantic Categorisation of Conceptual Biases

Presents the semantic tags assigned the conceptual biases of the community, with the aim to facilitate the comparison between the discovered biases for each attribute concept.

Click to jump to the section

Detailed Ranking of Conceptual Biases

Presents a more detailed analysis of the conceptual biases found, focusing on establishing different methods to sort (rank) each of the clusters discovered.

Click to jump to the section

Semantic Categorisation of Conceptual Biases

The figures below show an overview of the conceptual biases found in the dataset, obtained after tagging every cluster in the partition of conceptual biases with the most frequent semantic ﬁelds (domains) among its words. This allows us to get a general idea of the nature of the conceptual biases discovered for (left pie) and (right pie) in this community.

+ show more

Detailed Dataset Conceptual Biases

This section shows the details of the conceptual biases for (top table) and (bottom table) discovered in the dataset. By ordering the clusters by the different properties, we are effectively ranking the clusters based on the different startegies presented in the paper Rf, Re and Rs.

The table shows various details of the conceptual biases, including:

Words: Words clustered together as a conceptual bias. Only the first 8 words are shown for each cluster. Characters ... indicate that the cluster has more than 8 words.
Total Frequency: The total frequency of the words in the cluster, considering the whole the dataset.
Average Salience: The average salience of all words in the cluster, ranging from 1 (strong) to 0.
Average Bias: The average bias of all words in the cluster towards each attribute concept, ranging from 1 (strong bias) to 0 (non-biased).
Average Sentiment: The average sentiment (polarity) of all words in the cluster, ranging from 1 (very positive) to -1 (very negative), with 0 being neutral, as determined by the NLTK Vader Sentiment Analysis library.

Order the lists of biased words by the different properties, by clicking on a column header. By ordering the clusters by the different properties, we are effectively ranking the clusters based on the different startegies presented in the paper Rf, Re and Rs. For instance, ordering the clusters in by frequency (by clicking on top of the 'Total Frequency' cell) is equivalent to ranking the clusters using the Frequency ranking (Rf) presented in the paper, and should return the same results presented in Figure 2 and Figure 4 (ignoring words with the same stem!). Also, the tables allow to Search for specific words or values.

The tables below show the resulting clusters (conceptual biases) obtained by selecting the k-means partition with highest silhouette score over various runs that tested different values of k, as described in Section 4 of the paper.

+ show more

Rankings reported in the paper for and

#	Words	Total Frequency	Avg. Salience	Avg. Bias	Avg. Sentiment

#	Words	Total Frequency	Avg. Salience	Avg. Bias	Avg. Sentiment

Most Frequent Salient Words

The word clouds presented below show the most salient words biased towards and (left and right, respectively) in the selected dataset, that is, these words more often found in and related contexts. The size and color of each word corresponds with its frequency, bigger means more frequent. For details about each word, see section Detailed Dataset Word Biases.

+ show more

Detailed Dataset Word Biases

This section shows the details of the most salient and biased words for and (left and right, respectively) in the dataset.

+ show more

Nouns

Adjectives

Verbs

#	Word	Sal	Bias	Freq	Sent	POS

#	Word	Sal	Bias	Freq	Sent	POS

Word Distributions of Biases

Explore the bias and frequency distributions of all biased words in the dataset in two bar plots; -biased words are shown on the top bar plot and -biased words on the bottom. By comparing the distributions, one could observe the differences between genders/religions in the dataset (depending on the commnuity explored). For instance, in The Red Pill, although men-biased words are more frequent, women-biased words hold stronger biases.

+ show more

Salience

Bias

Frequency

100

All

Bias Polarity

Explore the sentiment of the most salient and biased words for and (left and right, respectively), classified in 7 categories ranging from positive to negative.

+ show more

Salience

Bias

Frequency

Ignore neutrals

100

200

All

Word Embedding Space

Explore the distribution of salient and biased words in the embedding space as learned by a machine learning algorithm, represented in the two principal t-SNE dimensions.

+ show more

Concept Embedding Space

The figure below shows the distribution of and concepts on the embedding space for the selected dataset, presented in the two most informative t-SNE dimensions.

A concept is formed by the aggregation of semantically similar words, based on the embedding representations learnt by the machine learning model. Concepts add another layer of abstraction between biases and text, allowing a better understanding of the motives that drive these biases.

Similarly as before, semantically related concepts care found close in the figure, while semantically unrelated concepts are placed apart. The similarity between concepts is estimated with concept centroids. The size of each concept corresponds with its relative frequency in the dataset, bigger means more frequent. -biased concepts are shown in pink while -biased concepts are presented in blue. By clicking on a concept, you can explore all words clustered in it together with their relative and absolute frequency. Notice how and -biased concepts are clearly separated in the embedding space, showing a clear separation between the set of most biased words identified with our methodology and the embedding representations learnt by the machine learning system.

Select the concept you want to explore, either by clicking on it or by selecting it using the buttons on the top right of the plot. After selecting a concept, all words clustered in it will appear in red on the right of the plot, showing the absolute and relative frequencies. Hovering on top of a concept will show the average bias, sentiment, and absolute frequency of all words clustered in the concept. This plot is based on LDAvis implementation.

The plot shows the most salient and biased towards and , clustered based on word similarity in the embedding space.

+ show more