shipfeedAI news, curated daily

01:18:59 CET

pull to refreshlast sync 00:00:23

Just in — 30 new

§ agents · storyline

LongMemEval-V2: Evaluating Long-Term Agent Memory Toward Experienced Colleagues

LongMemEval-V2: Evaluating Long-Term Agent Memory Toward Experienced Colleagues

May 12 · 19:59:34 · primary fetch2 sourcesupdated May 12 · 19:59:34

This storyline groups 3 articles from 2 sources. The originating feed didn’t ship an excerpt — open any link below to read the piece.

read full article on arxiv.org ↗

§ sources3 publications · timeline below

arxiv.orgLongMemEval-V2: Evaluating Long-Term Agent Memory Toward Experienced Colleaguesprimary19:59:34
arxiv.orgMEME: Multi-entity & Evolving Memory Evaluation19:55:10
reddit.comVibe coding agents with evals on traces02:00:00

§ how this story moved

02:00:00primary — Reddit — AI Communities publishes the launch post.
19:55:10arXiv — cs.CL picks up coverage.
19:59:34arXiv — cs.CL picks up coverage.