Skip to content

A dataset for the ReNoVi paper (NAACL 2024 findings)

Notifications You must be signed in to change notification settings

zhanhl316/ReNoVi

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 

Repository files navigation

RENOVI: A Benchmark Towards Remediating Norm Violations in Socio-Cultural Conversations (NAACL'24 findings)

RENOVI: A Benchmark Towards Remediating Norm Violations in Socio-Cultural Conversations (NAACL'24 findings)

Abstract: Norm violations occur when individuals fail to conform to culturally accepted behaviors, which may lead to potential conflicts. Remediating norm violations requires social awareness and cultural sensitivity of the nuances at play. To equip interactive AI systems with a remediation ability, we offer ReNoVi - a large-scale corpus of 9,258 multi-turn dialogues annotated with social norms, as well as define a sequence of tasks to help understand and remediate norm violations step by step. ReNoVi consists of two parts: 512 human-authored dialogues (real data), and 8,746 synthetic conversations generated by ChatGPT through prompt learning. While collecting sufficient human-authored data is costly, synthetic conversations provide suitable amounts of data to help mitigate the scarcity of training data, as well as the chance to assess the alignment between LLMs and humans in the awareness of social norms. We thus harness the power of ChatGPT to generate synthetic training data for our task. To ensure the quality of both human-authored and synthetic data, we follow a quality control protocol during data collection. Our experimental results demonstrate the importance of remediating norm violations in socio-cultural conversations, as well as the improvement in performance obtained from synthetic data.

Dataset Download

Coming soon...

Have any questions?

Please contact Haolan Zhan through haolan.zhan@monash.edu

About

A dataset for the ReNoVi paper (NAACL 2024 findings)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published