Reproducing WMT14 #3175

MaxHahnbueck · 2024-11-19T13:28:27Z

I want to reproduce and then adjust the data of the WMT2014 Benchmark. Therefore I cant use helm directly (if I understand it correctly). Therefore i want to use the dataset and the way how the prompt is constructed in my own pipeline. (same applies to the evaluation code)

Unfortunately I cant understand and find my way through the code.

I believe the PromptRunExpander is responsible for creating the prompt but i cant figure out what it exactly does and what values are used and where they are obtained from.

I would happy for any explenation on how the flow of information is and how the final prompt is constructed

yifanmai · 2024-12-07T00:41:39Z

Hi @MaxHahnbueck, here's some pointers to the code:

The main flow for the prompt generation happens in Runner.run_one here. Specifically, this calls WMT14Scenario.get_instances (link) to download and load instances into memory, and it calls GenerationAdapter.generate_requests (link) to turn them into prompt strings, which are placed in RequestState.request.input.text.

You may also want to check out the documentation if you haven't already. Hope this helps.

MaxHahnbueck · 2025-01-08T14:13:56Z

Thanks. I think i figured out most stuff now.

If I understand correctly annotators in scenario_state here describe how postprocessing is done. WMT14 (and others such as med_qa) do not have these annotators.

But if I want to use BLEU-4 as the metric I believe I need some kind of preprocessing as my output looks like this:

It seems like there are two sentences to translate. Here are the translations:

1` German: Wiederaufnahme der Sitzungsperiode
   English: Resumption of the session

2. German: Sie stehen keine 100 Meter voneinander entfernt: Am Dienstag ist in Gutach die neue B 33-Fußgängerampel am Dorfparkplatz in Betrieb genommen worden - in Sichtweite der älteren Rathausampel.
   English: They are not 100 meters apart from each other: On Dienstag, in Gutach, the new B 33 pedestrian traffic light at Dorfparkplatz has been put into operation - in sight of the older town hall traffic light.

Maybe thats just not clean output by the model, but I think I just need the last part after the "english:"

Where can I find the postprecessing steps?

yifanmai added the user question label Dec 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reproducing WMT14 #3175

Reproducing WMT14 #3175

MaxHahnbueck commented Nov 19, 2024

yifanmai commented Dec 7, 2024

MaxHahnbueck commented Jan 8, 2025 •

edited

Loading

Reproducing WMT14 #3175

Reproducing WMT14 #3175

Comments

MaxHahnbueck commented Nov 19, 2024

yifanmai commented Dec 7, 2024

MaxHahnbueck commented Jan 8, 2025 • edited Loading

MaxHahnbueck commented Jan 8, 2025 •

edited

Loading