Publication

A Full Probabilistic Model for Yes/No Type Crowdsourcing in Multi-Class Classification

May 1, 2019

People

Belen Saldias Fuentes

Former Graduate Student

Share this publication

Saldias-Fuentes, B., Protopapas, P., & Pichara, K. (2019, May). A Full Probabilistic Model for Yes/No Type Crowdsourcing in Multi-Class Classification. Proceedings of the 2019 SIAM International Conference on Data Mining (SDM) (pp. 756-764). SIAM.

Abstract

Crowdsourcing has become widely used in supervised scenarios where training sets are scarce and difficult to obtain. Most crowdsourcing models in the literature assume labelers can provide answers to full questions. In classification contexts, full questions require a labeler to discern among all possible classes. Unfortunately, discernment is not always easy in realistic scenarios. Labelers may not be experts in differentiating all classes. In this work, we provide a full probabilistic model for a shorter type of queries. Our shorter queries only require “yes” or “no” responses. Our model estimates a joint posterior distribution of matrices related to labelers' confusions and the posterior probability of the class of every object. We developed an approximate inference approach, using Monte Carlo Sampling and Black Box Variational Inference, which provides the derivation of the necessary gradients. We built two realistic crowdsourcing scenarios to test our model. The first scenario queries for irregular astronomical time-series. The second scenario relies on the image classification of animals. We achieved results that are comparable with those of full query crowdsourcing. Furthermore, we show that modeling labelers' failures plays an important role in estimating true classes. Finally, we provide the community with two real datasets obtained from our crowdsourcing experiments.

via SIAM

15th Women in Machine Learning Workshop (WiML 2020)

Organized by Xinyi Chen, Erin Grant, Kristy Choi, Krystal Maughan, Xenia Miscouridou, Judy Hanwen Shen, Raquel Aoki, Belén Saldías&nbs…

Event Events

Towards Bridging and Governing Decentralized Communities – Belén Saldías Dissertation Defense

Towards Bridging and Governing Decentralized Communities: Insights and Tools for Constructive Discourse

Article Research

Does AI hold the same values as we do?

During TEDxBentleyU, Belén C. Saldías Fuentes invited the audience to consider the cultural biases built into tools like ChatGPT.

Publication Research

Exploring aspects of similarity between spoken personal narratives by disentangling them into narrative clause types

Saldias, B., & Roy, D. (July, 2020) Exploring aspects of similarity between spoken personal narratives by disentangling them into narrative clause types. Proceedings of the 2020 ACL Workshop on Narrative Understanding, Storylines, and Events (NUSE). ACL.