Zero-shot Factual Consistency Evaluation Across Domains

Code, Data, and Models for the paper Zero-shot Factual Consistency Evaluation Across Domains (arxiv)

Abstract: This work addresses the challenge of factual consistency in text generation systems. We unify the tasks of Natural Language Inference, Summarization Evaluation, Factuality Verification and Factual Consistency Evaluation to train models capable of evaluating the factual consistency of source-target pairs across diverse domains. We rigorously evaluate these against eight baselines on a comprehensive benchmark suite comprising 22 datasets that span various tasks, domains, and document lengths. Results demonstrate that our method achieves state-of-the-art performance on this heterogeneous benchmark while addressing efficiency concerns and attaining cross-domain generalization.

Models:

Data:

Results:

Overall Results available here
Dataset-specific Results available here

Cite this work as follows:

@misc{agarwal2024zeroshotfactualconsistencyevaluation,
      title={Zero-shot Factual Consistency Evaluation Across Domains}, 
      author={Raunak Agarwal},
      year={2024},
      eprint={2408.04114},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2408.04114}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
axolotl-ft		axolotl-ft
docs		docs
evaluate		evaluate
ft-t5		ft-t5
preprocess		preprocess
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Zero-shot Factual Consistency Evaluation Across Domains

About

Releases

Packages

Languages

License

raunak-agarwal/factual-consistency-eval

Folders and files

Latest commit

History

Repository files navigation

Zero-shot Factual Consistency Evaluation Across Domains

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages