shipfeedAI news, curated daily

18:05:13 CET

pull to refreshlast sync 18:00:10

Just in — 30 new

§ agents · storyline

Hedge-Bench benchmarks financial reasoning agents on hard tasks

Hedge-Bench benchmarks financial reasoning agents on hard tasks

Jun 2 · 19:11:56 · primary fetch1 sourceupdated Jun 2 · 19:11:56

This storyline groups 2 articles from 1 source. The originating feed didn’t ship an excerpt — open any link below to read the piece.

read full article on arxiv.org ↗

§ sources2 publications · timeline below

arxiv.orgHedge-Bench: Benchmarking Agents on Hard, Realistic Tasks Pertaining to Financial Reasoningprimary19:11:56
arxiv.orgBigFinanceBench: A Workflow-Grounded Benchmark for Financial-Research Agents18:12:34

§ how this story moved

18:12:34primary — arXiv — cs.AI publishes the launch post.
19:11:56arXiv — cs.AI picks up coverage.