ACL 2024

Annual Meeting of the Association for Computational Linguistics | Aug 11 – 16, 2024

The Association for Computational Linguistics (ACL)—a scientific and professional society for those working on computational problems involving human language—convenes an annual gathering with the latest research in the field. Computational linguistics and natural language processing (NLP) explore the development of computational models of various kinds of linguistic phenomena.

Discover Georgia Tech’s experts and their solutions in advancing NLP in the age of large language models and other rapidly evolving technologies.

Natural Language Processing (NLP) research helps computers understand and use human language. This allows AI systems to interact with people more naturally, like answering questions and translating language. Meet the Georgia Tech experts who are charting a path forward. #ACL2024

Opening Plenary

Georgia Tech at ACL 2024

Explore Georgia Tech’s experts and the organizations they are working with at ACL.

Includes Main and Findings Papers at ACL | Interact

Partner Organizations

Allen Institute for Artificial Intelligence • Amazon • Bloomberg • California Institute of Technology • Carnegie Mellon University • Cisco • Cornell University • Dartmouth College • East China Normal University • Emory University • Georgia Tech • Google • Harvard University • Heinrich-Heine University Düsseldorf • IBM • Inspir.ai • LG Corporation • Massachusetts Institute of Technology • Meta • Microsoft • Monash University • NAVER • Northeastern University • Ohio State University • Phillips-Universität Marburg • Portland State University • Rajiv Gandhi Institute of Technology • Renmin University of China • Seoul National University • Stanford University • The Chinese University of Hong Kong • Toyota Technological Institute at Chicago • Universidad de Vigo • University Hospital Essen • University of Arizona • University of California, Riverside • University of California, San Diego • University of Illinois at Urbana-Champaign • University of Mannheim • University of Texas at Austin • University of Texas at Dallas • University of Texas Southwestern Medical Center • University of Virginia • University of Washington • University of Wisconsin-Madison • Wesleyan University • West Virginia University • Yale University

Faculty with number of papers 🔗

Global Program

Explore ACL in one single view. More than 7.4K authors contribute to nearly 2000 papers in the Main and Findings Programs.

Search for your organization in the chart (multiselect from search/drop down menu to capture a complete view of a single organization (e.g. Google, Google Deepmind, Google Research).

Also type in part of the organization name to see 1st authors from the organization.

See Georgia Tech’s work here.


 FEATURE 

Limiting Privacy Risks in the Age of AI

By Nathan Deen

A new large-language model (LLM) developed by Georgia Tech researchers detects content that could risk the privacy of social media users and offers alternative phrasing that keeps the context of their posts intact.

Researchers set out to study user awareness of self-disclosure privacy risks on Reddit. It led to users learning just how much personal information they revealed, and that’s when they asked the team to help them strike a balance between sharing and safeguarding.

Wei Xu and Alan Ritter, faculty in the School of Interactive Computing and investigators on the new study. Photo: Kevin Beasley/College of Computing

 FEATURE 

Limiting Privacy Risks in the Age of AI

A new large-language model (LLM) developed by Georgia Tech researchers detects content that could risk the privacy of social media users and offers alternative phrasing that keeps the context of their posts intact.

Researchers set out to study user awareness of self-disclosure privacy risks on Reddit. It led to users learning just how much personal information they revealed, and that’s when they asked the team to help them strike a balance between sharing and safeguarding.

Pictured: Wei Xu and Alan Ritter, faculty in the School of Interactive Computing

Story by Nathan Deen
Photos by Kevin Beasley


 Gaurav Verma, Georgia Tech Ph.D. candidate who led the study

NEWS

Study Highlights Challenges in Detecting Violent Speech Aimed at Asian Communities

By Bryant Wine

A research group is calling for internet and social media moderators to strengthen their detection and intervention protocols for violent speech. 

Their study of language detection software found that algorithms struggle to differentiate anti-Asian violence-provoking speech from general hate speech. Left unchecked, threats of violence online can go unnoticed and turn into real-world attacks. 

Researchers from Georgia Tech and the Anti-Defamation League (ADL) teamed together in the study. They made their discovery while testing natural language processing (NLP) models trained on data they crowdsourced from Asian communities. 

Main Papers

A Community-Centric Perspective for Characterizing and Detecting Anti-Asian Violence-Provoking Speech
Gaurav Verma; Rynaa Grover; Jiawei Zhou; Binny Mathew; Jordan Kraemer; Munmun De Choudhury; Srijan Kumar

ARL2: Aligning Retrievers with Black-box Large Language Models via Self-guided Adaptive Relevance Labeling       
LingXi Zhang; Yue Yu; Kuan Wang; Chao Zhang

Cross-Modal Projection in Multimodal LLMs Doesn’t Really Project Visual Attributes to Textual Space            
Gaurav Verma; Minje Choi; Kartik Sharma; Jamelle Watson-Daniels; Sejoon Oh; Srijan Kumar

Explanation-aware Soft Ensemble Empowers Large Language Model In-context Learning         
Yue Yu; Jiaming Shen; Tianqi Liu; Zhen Qin; Jing Nathan Yan; Jialu Liu; Chao Zhang; Michael Bendersky

FactPICO: Factuality Evaluation for Plain Language Summarization of Medical Evidence       
Sebastian Antony Joseph; Lily Chen; Jan Trienes; Hannah Louisa Göke; Monika Coers; Wei Xu; Byron C Wallace; Junyi Jessy Li

Harnessing the Power of Large Language Models for Natural Language to First-Order Logic Translation            
Yuan Yang; Siheng Xiong; Ali Payani; Ehsan Shareghi; Faramarz Fekri

Having Beer after Prayer? Measuring Cultural Bias in Large Language Models          
Tarek Naous; Michael J Ryan; Alan Ritter; Wei Xu

InfoLossQA: Characterizing and Recovering Information Loss in Text Simplification         
Jan Trienes; Sebastian Antony Joseph; Jörg Schlötterer; Christin Seifert; Kyle Lo; Wei Xu; Byron C Wallace; Junyi Jessy Li

Large Language Models Can Learn Temporal Reasoning  
Siheng Xiong; Ali Payani; Ramana Rao Kompella; Faramarz Fekri

Leveraging Codebook Knowledge with NLI and ChatGPT for Zero-Shot Political Relation Classification             
Yibo Hu; Erick Skorupa Parolin; Latifur Khan; Patrick Brandt; Javier Osorio; Vito D’Orazio

Machine Unlearning of Pre-trained Large Language Models             
Jin Yao; Eli Chien; Minxin Du; Xinyao Niu; Tianhao Wang; Zezhou Cheng; Xiang Yue

MAP’s not dead yet: Uncovering true language model modes by conditioning away degeneracy           
Davis Yoshida; Kartik Goyal; Kevin Gimpel

Meta-Tuning LLMs to Leverage Lexical Knowledge for Generalizable Language Style Understanding      
Ruohao Guo; Wei Xu; Alan Ritter

Multi-Level Feedback Generation with Large Language Models for Empowering Novice Peer Counselors
Alicja Chaszczewicz; Raj Sanjay Shah; Ryan Louie; Bruce A Arnow; Robert Kraut; Diyi Yang

NEO-BENCH: Evaluating Robustness of Large Language Models with Neologisms            
Jonathan Zheng; Alan Ritter; Wei Xu

Predicting Text Preference Via Structured Comparative Reasoning         
Jing Nathan Yan; Tianqi Liu; Justin T Chiu; Jiaming Shen; Zhen Qin; Yue Yu; Charumathi Lakshmanan; Yair Kurzion; Alexander M Rush; Jialu Liu; Michael Bendersky

Prototypical Reward Network for Data-Efficient Model Alignment         
Jinghan Zhang; Xiting Wang; Yiqiao Jin; Changyu Chen; Xinhao Zhang; Kunpeng Liu

RAM-EHR: Retrieval Augmentation Meets Clinical Predictions on Electronic Health Records     
Ran Xu; Wenqi Shi; Yue Yu; Yuchen Zhuang; Bowen Jin; May Dongmei Wang; Joyce C. Ho; Carl Yang

Reducing Privacy Risks in Online Self-Disclosures with Language Models
Yao Dou; Isadora Krsek; Tarek Naous; Anubha Kabra; Sauvik Das; Alan Ritter; Wei Xu

Silent Signals, Loud Impact: LLMs for Word-Sense Disambiguation of Coded Dog Whistles 
Julia Kruk; Michela Marchini; Rijul Magu; Caleb Ziems; David Muchlinski; Diyi Yang

Unintended Impacts of LLM Alignment on Global Representation
Michael J Ryan; William Barr Held; Diyi Yang

Who Wrote this Code? Watermarking for Code Generation             
Taehyun Lee; Seokhee Hong; Jaewoo Ahn; Ilgee Hong; Hwaran Lee; Sangdoo Yun; Jamin Shin; Gunhee Kim

Findings Papers

A Mechanistic Analysis of a Transformer Trained on a Symbolic Multi-Step Reasoning Task         
Jannik Brinkmann; Abhay Sheshadri; Victor Levoso; Paul Swoboda; Christian Bartelt

An Experimental Design Framework for Label-Efficient Supervised Finetuning of Large Language Models             
Gantavya Bhatt; Yifang Chen; Arnav Mohanty Das; Jifan Zhang; Sang T. Truong; Stephen Mussmann; Yinglun Zhu; Jeff Bilmes; Simon Shaolei Du; Kevin Jamieson; Jordan T. Ash; Robert D Nowak

Better Late Than Never: Model-Agnostic Hallucination Post-Processing Framework Towards Clinical Text Summarization             
Songda Li; Yunqi Zhang; Chunyuan Deng; Yake Niu; Hui Zhao

Everything of Thoughts: Defying the Law of Penrose Triangle for Thought Generation      
Ruomeng Ding; Chaoyun Zhang; Lu Wang; Yong Xu; Minghua Ma; Wei Zhang; Si Qin; Saravan Rajmohan; Qingwei Lin; Dongmei Zhang

Knowledge-Infused Prompting: Assessing and Advancing Clinical Text Data Generation with Large Language Models             
Ran Xu; Hejie Cui; Yue Yu; Xuan Kan; Wenqi Shi; Yuchen Zhuang; May Dongmei Wang; Wei Jin; Joyce C. Ho; Carl Yang

LSTPrompt: Large Language Models as Zero-Shot Time Series Forecasters by Long-Short-Term Prompting         
Haoxin Liu; Zhiyuan Zhao; Jindong Wang; Harshavardhan Kamarthi; B. Aditya Prakash

Measuring and Addressing Indexical Bias in Information Retrieval            
Caleb Ziems; William Barr Held; Jane Dwivedi-Yu; Diyi Yang

MM-SOC: Benchmarking Multimodal Large Language Models in Social Media Platforms 
Yiqiao Jin; Minje Choi; Gaurav Verma; Jindong Wang; Srijan Kumar

Perceptions of Language Technology Failures from South Asian English Speakers            
Faye Holt; William Barr Held; Diyi Yang

PLaD: Preference-based Large Language Model Distillation with Pseudo-Preference Pairs             
Rongzhi Zhang; Jiaming Shen; Tianqi Liu; Haorui Wang; Zhen Qin; feng han; Jialu Liu; Simon Baumgartner; Michael Bendersky; Chao Zhang

ProgGen: Generating Named Entity Recognition Datasets Step-by-step with Self-Reflexive Large Language Models          
Yuzhao Heng; Chunyuan Deng; Yitong Li; Yue Yu; Yinghao Li; Rongzhi Zhang; Chao Zhang

Self-Specialization: Uncovering Latent Expertise within Large Language Models          
Junmo Kang; Hongyin Luo; Yada Zhu; Jacob A Hansen; James R. Glass; David Daniel Cox; Alan Ritter; Rogerio Feris; Leonid Karlinsky

Simulated Misinformation Susceptibility (SMISTS): Enhancing Misinformation Research with Large Language Model Simulations       
Weicheng Ma; Chunyuan Deng; Aram Moossavi; Lili Wang; Soroush Vosoughi; Diyi Yang

Token Alignment via Character Matching for Subword Completion      
Ben Athiwaratkun; Shiqi Wang; Mingyue Shang; YUCHEN TIAN; Zijian Wang; Sujan Kumar Gonugondla; Sanjay Krishna Gouda; Robert Kwiatkowski; Ramesh Nallapati; Parminder Bhatia; Bing Xiang

Unveiling the Spectrum of Data Contamination in Language Model: A Survey from Detection to Remediation
Chunyuan Deng; Yilun Zhao; Yuzhao Heng; Yitong Li; Jiannan Cao; Xiangru Tang; Arman Cohan

Demo Papers

Wordflow: Social Prompt Engineering for Large Language Models
Zijie (Jay) Wang; Aishwarya Chakravarthy; David Munechika; Polo Chau

Tutorials

Automatic and Human-AI Interactive Text Generation (with a focus on Text Simplification and Revision)
Yao Dou; Philippe Laban; Claire Gardent; Wei Xu

See you in Bangkok!

Development: College of Computing
Project Lead/Data Graphics: Joshua Preston
News: Nathan Deen, Joshua Preston, Bryant Wine
Select Photos: Kevin Beasley
Data Management: Joni Isbell