Research2 min read
Alibaba's Qwen Team Builds HopChain to Fix How Vision Models Fall Apart During Multi-Step Reasoning
Alibaba's Qwen team and Tsinghua University have released HopChain, a training framework that forces vision-language models to verify intermediate reasoning steps before proceeding. The result: improvements on 20 of 24 benchmarks tested, with some scores more than doubling on hard visual reasoning tasks.