Are Neural Topic Models Broken? (2024)

Alexander Miserlis Hoyle,Rupak Sarkar,Pranav Goel,Philip Resnik

Abstract

Recently, the relationship between automated and human evaluation of topic models has been called into question. Method developers have staked the efficacy of new topic model variants on automated measures, and their failure to approximate human preferences places these models on uncertain ground. Moreover, existing evaluation paradigms are often divorced from real-world use.Motivated by content analysis as a dominant real-world use case for topic modeling, we analyze two related aspects of topic models that affect their effectiveness and trustworthiness in practice for that purpose: the stability of their estimates and the extent to which the model’s discovered categories align with human-determined categories in the data. We find that neural topic models fare worse in both respects compared to an established classical method. We take a step toward addressing both issues in tandem by demonstrating that a straightforward ensembling method can reliably outperform the members of the ensemble.

Anthology ID:
2022.findings-emnlp.390
Volume:
Findings of the Association for Computational Linguistics: EMNLP 2022
Month:
December
Year:
2022
Address:
Abu Dhabi, United Arab Emirates
Editors:
Yoav Goldberg,Zornitsa Kozareva,Yue Zhang
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
5321–5344
Language:
URL:
https://aclanthology.org/2022.findings-emnlp.390
DOI:
10.18653/v1/2022.findings-emnlp.390
Bibkey:
Cite (ACL):
Alexander Miserlis Hoyle, Rupak Sarkar, Pranav Goel, and Philip Resnik. 2022. Are Neural Topic Models Broken?. In Findings of the Association for Computational Linguistics: EMNLP 2022, pages 5321–5344, Abu Dhabi, United Arab Emirates. Association for Computational Linguistics.
Cite (Informal):
Are Neural Topic Models Broken? (Hoyle et al., Findings 2022)
Copy Citation:
PDF:
https://aclanthology.org/2022.findings-emnlp.390.pdf
Video:
https://aclanthology.org/2022.findings-emnlp.390.mp4

PDFCiteSearchVideo

Export citation
  • BibTeX
  • MODS XML
  • Endnote
  • Preformatted
@inproceedings{hoyle-etal-2022-neural, title = "Are Neural Topic Models Broken?", author = "Hoyle, Alexander Miserlis and Sarkar, Rupak and Goel, Pranav and Resnik, Philip", editor = "Goldberg, Yoav and Kozareva, Zornitsa and Zhang, Yue", booktitle = "Findings of the Association for Computational Linguistics: EMNLP 2022", month = dec, year = "2022", address = "Abu Dhabi, United Arab Emirates", publisher = "Association for Computational Linguistics", url = "https://aclanthology.org/2022.findings-emnlp.390", doi = "10.18653/v1/2022.findings-emnlp.390", pages = "5321--5344", abstract = "Recently, the relationship between automated and human evaluation of topic models has been called into question. Method developers have staked the efficacy of new topic model variants on automated measures, and their failure to approximate human preferences places these models on uncertain ground. Moreover, existing evaluation paradigms are often divorced from real-world use.Motivated by content analysis as a dominant real-world use case for topic modeling, we analyze two related aspects of topic models that affect their effectiveness and trustworthiness in practice for that purpose: the stability of their estimates and the extent to which the model{'}s discovered categories align with human-determined categories in the data. We find that neural topic models fare worse in both respects compared to an established classical method. We take a step toward addressing both issues in tandem by demonstrating that a straightforward ensembling method can reliably outperform the members of the ensemble.",}

Download as File

<?xml version="1.0" encoding="UTF-8"?><modsCollection xmlns="http://www.loc.gov/mods/v3"><mods ID="hoyle-etal-2022-neural"> <titleInfo> <title>Are Neural Topic Models Broken?</title> </titleInfo> <name type="personal"> <namePart type="given">Alexander</namePart> <namePart type="given">Miserlis</namePart> <namePart type="family">Hoyle</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Rupak</namePart> <namePart type="family">Sarkar</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Pranav</namePart> <namePart type="family">Goel</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Philip</namePart> <namePart type="family">Resnik</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <originInfo> <dateIssued>2022-12</dateIssued> </originInfo> <typeOfResource>text</typeOfResource> <relatedItem type="host"> <titleInfo> <title>Findings of the Association for Computational Linguistics: EMNLP 2022</title> </titleInfo> <name type="personal"> <namePart type="given">Yoav</namePart> <namePart type="family">Goldberg</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Zornitsa</namePart> <namePart type="family">Kozareva</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Yue</namePart> <namePart type="family">Zhang</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <originInfo> <publisher>Association for Computational Linguistics</publisher> <place> <placeTerm type="text">Abu Dhabi, United Arab Emirates</placeTerm> </place> </originInfo> <genre authority="marcgt">conference publication</genre> </relatedItem> <abstract>Recently, the relationship between automated and human evaluation of topic models has been called into question. Method developers have staked the efficacy of new topic model variants on automated measures, and their failure to approximate human preferences places these models on uncertain ground. Moreover, existing evaluation paradigms are often divorced from real-world use.Motivated by content analysis as a dominant real-world use case for topic modeling, we analyze two related aspects of topic models that affect their effectiveness and trustworthiness in practice for that purpose: the stability of their estimates and the extent to which the model’s discovered categories align with human-determined categories in the data. We find that neural topic models fare worse in both respects compared to an established classical method. We take a step toward addressing both issues in tandem by demonstrating that a straightforward ensembling method can reliably outperform the members of the ensemble.</abstract> <identifier type="citekey">hoyle-etal-2022-neural</identifier> <identifier type="doi">10.18653/v1/2022.findings-emnlp.390</identifier> <location> <url>https://aclanthology.org/2022.findings-emnlp.390</url> </location> <part> <date>2022-12</date> <extent unit="page"> <start>5321</start> <end>5344</end> </extent> </part></mods></modsCollection>

Download as File

%0 Conference Proceedings%T Are Neural Topic Models Broken?%A Hoyle, Alexander Miserlis%A Sarkar, Rupak%A Goel, Pranav%A Resnik, Philip%Y Goldberg, Yoav%Y Kozareva, Zornitsa%Y Zhang, Yue%S Findings of the Association for Computational Linguistics: EMNLP 2022%D 2022%8 December%I Association for Computational Linguistics%C Abu Dhabi, United Arab Emirates%F hoyle-etal-2022-neural%X Recently, the relationship between automated and human evaluation of topic models has been called into question. Method developers have staked the efficacy of new topic model variants on automated measures, and their failure to approximate human preferences places these models on uncertain ground. Moreover, existing evaluation paradigms are often divorced from real-world use.Motivated by content analysis as a dominant real-world use case for topic modeling, we analyze two related aspects of topic models that affect their effectiveness and trustworthiness in practice for that purpose: the stability of their estimates and the extent to which the model’s discovered categories align with human-determined categories in the data. We find that neural topic models fare worse in both respects compared to an established classical method. We take a step toward addressing both issues in tandem by demonstrating that a straightforward ensembling method can reliably outperform the members of the ensemble.%R 10.18653/v1/2022.findings-emnlp.390%U https://aclanthology.org/2022.findings-emnlp.390%U https://doi.org/10.18653/v1/2022.findings-emnlp.390%P 5321-5344

Download as File

Markdown (Informal)

[Are Neural Topic Models Broken?](https://aclanthology.org/2022.findings-emnlp.390) (Hoyle et al., Findings 2022)

  • Are Neural Topic Models Broken? (Hoyle et al., Findings 2022)
ACL
  • Alexander Miserlis Hoyle, Rupak Sarkar, Pranav Goel, and Philip Resnik. 2022. Are Neural Topic Models Broken?. In Findings of the Association for Computational Linguistics: EMNLP 2022, pages 5321–5344, Abu Dhabi, United Arab Emirates. Association for Computational Linguistics.
See Also
Coherence
Are Neural Topic Models Broken? (2024)
Top Articles
5 Ways to Save Money when you're Broke | My Debt Epiphany
How To Build An AI Stock Prediction Software In 2024?
Terramia Brick Oven Pizza & Trattoria Menu
How to Become an Occupational Therapist (2024) • OT Potential
Gomovies Spiderman
Rescare Training Online
Noaa Marine Point Forecast
Binghamton Legacy Obits
P.o. Box 3002 Phoenixville Pa 19460
Devotion Showtimes Near The Grand 16 - Pier Park
Pubblicare Annunci Gratuiti - comprare e vendere usato in Italia | CLASF
Aes Salt Lake City Showdown
Vados X Male Reader
Studentvue Calexico
Loreal Smith Sarkisian Age
Paisanos Duncan Sc Menu
The Express from Lock Haven, Pennsylvania
I Gave 3 Designers $50 to Spend at T.J. Maxx — Here’s What They Bought
Rent A Stump Grinder Menards
Boyfriend (2018) | KDrama Recaps on Dramabeans
Mr Biggs Soul Sonic Force Net Worth
Home Depot Shopping On Line
Mendoza Clinic Pharmacy
What's My Wells Fargo Routing Number?
The Creator Showtimes Near Regal La Live
Green Light Auto Sales Dallas Photos
DNS server, what is it and why is it needed
Yalelightingconcepts
2009 Acura Tsx Serpentine Belt Diagram
Eli Lilly Clarifies It’s Not Offering Free Insulin After Tweet From Fake Verified Account—As Chaos Unfolds On Twitter
Metro 72 Hour Extension 2022
Pronounce Oneirology
Mayas Mexican Pell City
Blue Beetle Showtimes Near Regal Independence Plaza & Rpx
St. John’s Co-Cathedral: Visiting the gem of Valletta
Orange Door 8000 Price
Williamson Funeral Home Staunton Obituaries
Eddie Scozzare Salary
Shiny Flowers Belinda
Brenda89 Camsoda
Citymd West 104Th Urgent Care - Nyc Photos
Ups Locations Massachusetts
How To Add Friends On Regal App
Catholic Health Ambulatory Care At Commack
Black Gelato Strain Allbud
Cambria Dafont
Pmrank 2022
Gigamonster Outage
Miami Valley Harness Picks
Studio apartments for rent in Marseille, France - Rentberry
Latest Posts
Article information

Author: Neely Ledner

Last Updated:

Views: 6336

Rating: 4.1 / 5 (42 voted)

Reviews: 81% of readers found this page helpful

Author information

Name: Neely Ledner

Birthday: 1998-06-09

Address: 443 Barrows Terrace, New Jodyberg, CO 57462-5329

Phone: +2433516856029

Job: Central Legal Facilitator

Hobby: Backpacking, Jogging, Magic, Driving, Macrame, Embroidery, Foraging

Introduction: My name is Neely Ledner, I am a bright, determined, beautiful, adventurous, adventurous, spotless, calm person who loves writing and wants to share my knowledge and understanding with you.