Explore

arxiv.org ×
  • Show HN: Symbolic AI at Silver Medal, Boosts AlphaGeometry to Beat IMO Geo Gold

    Skip to main content View PDF HTML (experimental) Abstract:Proving geometric theorems constitutes a hallmark of visual...

  • Evaluating faithfulness and content selection of LLMs in book-length summaries

    Skip to main content View PDF HTML (experimental) Abstract:While long-context large language models (LLMs) can technically...

  • Apple Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs

    Skip to main content View PDF Abstract:Recent advancements in multimodal large language models (MLLMs) have been...

  • Direct Nash Optimization: Teaching Language Models to Self-Improve

    Skip to main content View PDF HTML (experimental) Abstract:This paper studies post-training large language models (LLMs)...