241107_nupa | Yi Hu (胡逸)

Why 9.9 > 9.11 is so hard for LLMs? 😫 Check out our new paper: Number Cookbook: Number Understanding of Language Models and How to Improve It, where we introduce a comprehensive benchmark covering four common numerical representations and 17 distinct numerical tasks and investigate the numerical understanding and processing ability (NUPA) of LLMs.