Explore
Show HN: Symbolic AI at Silver Medal, Boosts AlphaGeometry to Beat IMO Geo Gold
Skip to main content View PDF HTML (experimental) Abstract:Proving geometric theorems constitutes a hallmark of visual...
Evaluating faithfulness and content selection of LLMs in book-length summaries
Skip to main content View PDF HTML (experimental) Abstract:While long-context large language models (LLMs) can technically...
Apple Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs
Skip to main content View PDF Abstract:Recent advancements in multimodal large language models (MLLMs) have been...
Direct Nash Optimization: Teaching Language Models to Self-Improve
Skip to main content View PDF HTML (experimental) Abstract:This paper studies post-training large language models (LLMs)...