Renormalize LMNextToken.sample() probs to fix floating point errors #20

gabegrand · 2025-01-03T21:37:32Z

Due to numerical imprecision, the exponentiated next_token_logprobs occasionally do not sum to 1. When the discrepancy is too large, this causes np.random.choice to throw the following error:

    token_id = np.random.choice(len(probs), p=(probs))
               ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "numpy/random/mtrand.pyx", line 975, in numpy.random.mtrand.RandomState.choice
ValueError: probabilities do not sum to 1

This is observed empirically when using token masking and appears to occur more frequently for smaller models (e.g., llama-3.2-1B).

This PR fixes this issue by renormalizing next_token_logprobs after exponentiation. This is intended as a stopgap fix pending future work to look into numerical stability in the token masking code.

alex-lew · 2025-01-04T19:37:56Z

Looks good, as a stopgap! We probably want to have some way of emitting a warning when this happens, so that we catch future bugs relating to logprob computation.

gabegrand · 2025-01-06T18:23:38Z

Thanks! I agree about the warning. I didn't one for now because I didn't want to introduce a ton of spam in the logs. In the future we could consider making better use of the logging / warnings modules, which allow suppression of repeated warnings through filters.

Renormalize LMNextToken.sample() probs to fix floating point errors

f40ee1b

gabegrand requested a review from alex-lew January 3, 2025 21:37

alex-lew merged commit a191fca into main Jan 4, 2025
1 check passed

gabegrand deleted the gg/renormalize branch January 6, 2025 18:19

This was referenced Jan 7, 2025

ValueError: probabilities do not sum to 1 #12

Closed

Integrate AsyncLM models from genlm-backend #24

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Renormalize LMNextToken.sample() probs to fix floating point errors #20

Renormalize LMNextToken.sample() probs to fix floating point errors #20

gabegrand commented Jan 3, 2025

alex-lew commented Jan 4, 2025

gabegrand commented Jan 6, 2025

Renormalize LMNextToken.sample() probs to fix floating point errors #20

Renormalize LMNextToken.sample() probs to fix floating point errors #20

Conversation

gabegrand commented Jan 3, 2025

alex-lew commented Jan 4, 2025

gabegrand commented Jan 6, 2025