shipfeedAI news, curated daily

01:18:59 CET
21 MAY01:18:59shipfeed
pull to refreshlast sync
Just in — 30 new
§ agents · storyline

LongMemEval-V2: Evaluating Long-Term Agent Memory Toward Experienced Colleagues

LongMemEval-V2: Evaluating Long-Term Agent Memory Toward Experienced Colleagues

May 12 · · primary fetch2 sourcesupdated May 12 ·

This storyline groups 3 articles from 2 sources. The originating feed didn’t ship an excerpt — open any link below to read the piece.

read full article on arxiv.org
§ sources3 publications · timeline below
  1. arxiv.orgLongMemEval-V2: Evaluating Long-Term Agent Memory Toward Experienced Colleaguesprimary
  2. arxiv.orgMEME: Multi-entity & Evolving Memory Evaluation
  3. reddit.comVibe coding agents with evals on traces

§ how this story moved

  1. primaryReddit — AI Communities publishes the launch post.
  2. arXiv — cs.CL picks up coverage.
  3. arXiv — cs.CL picks up coverage.