From Bandit-Prompt Tuning to VTBench Visual Tokenizers and Greedy Decoding: Precision Tools for Scaling LLM Performance
Latest research summaries in ML, Robotics, CV, NLP and AI
🧠 Stay Focused, One Task at a Time — 30% Off Forget.work (Forever)
At State of AI, we’re always testing tools that make deep work easier. That’s why we’re excited to partner with Forget.work the #1 productivity tool designed for focused, distraction-free work.
Forget.work is purpose-built for single-tasking. It helps you beat time blindness and stay anchored to what you’re doing right now. Always visible on your screen, it gently reminds you to stay on task, no context switching, no endless tab juggling.
Whether you’re deep in a research paper, debugging an experiment, or trying to write without flipping to Slack or Twitter, Forget keeps your mind tethered to the task at hand.
As a State of AI reader, you get 30% off for life.
👉 Use the promo code STATEAI when checking out
10,000+ professionals already use it to get things done, ADHD or not. It’s like a subtle co-pilot for your workday, and we think you’ll love it.
Welcome to the 62 new subscribers since our last edition! Let’s jump straight into our AI research roundup!
Contents
AutoMathKG: The automated mathematical knowledge graph based on LLM and vector database
Robin: A multi-agent system for automating scientific discovery
VTBench: Evaluating Visual Tokenizers for Autoregressive Image Generation
Granary: Speech Recognition and Translation Dataset in 25 European Languages
CIE: Controlling Language Model Text Generations Using Continuous Signals
Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning
Seeing, Saying, Solving: An LLM-to-TL Framework for Cooperative Robots
GraspMolmo: Generalizable Task-Oriented Grasping via Large-Scale Synthetic Data Generation
AutoMathKG: The automated mathematical knowledge graph based on LLM and vector database
Authors: Rong Bian, Yu Geng, Zijian Yang, Bing Cheng
Source and references: https://arxiv.org/abs/2505.13406v1
Introduction
This paper proposes AutoMathKG, a high-quality, wide-coverage, and multi-dimensional knowledge graph (KG) for the field of mathematics, written in natural language and capable of automatic updates.
Keep reading with a 7-day free trial
Subscribe to State of AI to keep reading this post and get 7 days of free access to the full post archives.