Shipfeed. AI News Channel

items50 latest

▶ ai·19:56

A Durability and Cross-Language Transfer Benchmark for a Validated Teaching-Feedback Classification Protocol

arXiv — cs.CL

▶ ai·19:38

AdvancedMathBench: A Benchmark Suite for Advanced Mathematical Proof Generation and Verification

arXiv — cs.CL

▶ ai·18:37

Forgetting Our Way to Shared Meaning: Effects of Forgetting on Conceptual Alignment in a Non-Partnership Coordination Game

arXiv — cs.CL

▶ ai·18:36

How Temperature Shapes Ideological Discourse in Retrieval-Augmented Generation?

arXiv — cs.CL

▶ ai·18:22

From Expressivity to Sample Complexity: Narrow Teachers for Transformers via C-RASP

arXiv — cs.CL

▶ ai·17:59

MET: Theory-Grounded and Culture-Aware Multilingual Moral Reasoning

arXiv — cs.CL

▶ ai·17:48

STEP: Career-Path Recommendation via Temporal and Educational Trajectory Modeling

arXiv — cs.CL

▶ ai·17:42

JobHop v2: A Large-Scale Career Trajectory Dataset from Unstructured Resumes

arXiv — cs.CL

▶ ai·17:33

Production and Perception in LLMs: A Token Probability Approach

arXiv — cs.CL

▶ ai·17:11

Losing My Composure: Predicting Compositionality Over Time

arXiv — cs.CL

▶ ai·16:28

Globally Consistent Coloring Schemes for Language Identification

arXiv — cs.CL

▶ ai·16:19

Beyond Benchmarks: Exposing the Hidden Crisis in Bangla Hate Speech Detection

arXiv — cs.CL

▶ ai·15:47

PaperRouter-Agent: A Content-Grounded LLM Agent for Personalized Hierarchical Paper Routing

arXiv — cs.CL

▶ ai·15:03

Dzongkha Next Word Prediction System

arXiv — cs.CL

▶ ai·14:56

SCOPE-RL: Optimizing Reasoning Paths Before and After Success

arXiv — cs.CL

▶ ai·14:54

GEIS: A Generation-Evaluation-Improvement Loop of Agent Skills for Long-Form Article Generation

arXiv — cs.CL

▶ ai·14:40

Communicating Chess Strategies in Natural Language

arXiv — cs.CL

▶ ai·19:11

Toward Real-Time Sentence-Level Sign Language Translation

arXiv — cs.CL

▶ ai·18:54

Tokenizer Transplantation: Mitigating Autoregressive Collapse in Edge-Efficient Bengali ASR

arXiv — cs.CL

▶ ai·17:36

FreyaTTS Technical Report

arXiv — cs.CL

▶ ai·17:16

Normalisation-Based Likelihood Ratio Estimation for Forensic Authorship Verification

arXiv — cs.CL

▶ ai·17:04

Neural Collapse Is Forbidden: Information Floors in Language Models

arXiv — cs.CL

▶ ai·14:57

Mach-Mind-4-Flash Technical Report

arXiv — cs.CL

▶ ai·14:28

DKCD: Domain Knowledge-Enhanced Causal Discovery from Unstructured Data

arXiv — cs.CL

▶ ai·14:19

Towards Detecting Inconsistencies in End-to-end Generated TODs

arXiv — cs.CL

▶ ai·13:07

Letter Lemmatization: One-to-one and Banded RNNs for Reversing Character-Set Simplification and Abbreviation in Medieval Text

arXiv — cs.CL

▶ ai·12:55

Super-Tuning: From Activation-Aware Pruning to Sparse Fine-Tuning

arXiv — cs.CL

▶ ai·11:16

Git-Assistant: Planning-Based Support for Updating Git Repositories

arXiv — cs.CL

▶ ai·10:49

Complexity-Guided Component-wise Initialization for Language Model Pretraining

arXiv — cs.CL

▶ ai·10:10

Scoped Verification for Reliable Long-Horizon Agentic Context Evolution under Distribution Shift

arXiv — cs.CL

▶ ai·08:52

MedRealMM: A Real-World Multimodal Benchmark for Chinese Online Medical Consultation

arXiv — cs.CL

▶ ai·19:59

UniClawBench: A Universal Benchmark for Proactive Agents on Real-World Tasks

arXiv — cs.CL

▶ ai·19:01

Do You Need a Frontier Model as a Citation Verifier? Benchmarking Rubric LLMs for Deep-Research Source Attribution

arXiv — cs.CL

▶ ai·18:16

DominoTree: Conditional Tree-Structured Drafting with Domino for Speculative Decoding

arXiv — cs.CL

▶ ai·17:32

It Takes a MAESTRO To Prune Bad Experts

arXiv — cs.CL

▶ ai·16:35

Improving Ad-hoc Search Effectiveness for Conversational Information Retrieval via Model Merging

arXiv — cs.CL

▶ ai·15:59

Cross-seed explainability using Procrustes-conditioned Joint End-to-end Top-K Sparse Autoencoders

arXiv — cs.CL

▶ ai·15:53

Ensemble Diversity Optimization for Subjective Supervision

arXiv — cs.CL

▶ ai·14:42

Detecting Ladder Logic Bombs in IEC 61131-3 PLC Programs using ESBMC-PLC+: A Formal Verification Approach with Trigger Synthesis

arXiv — cs.CL

▶ ai·14:21

Prompt Compression via Activation Aggregation

arXiv — cs.CL

▶ ai·14:18

Token-Flow Firewall: Semantic Runtime Auditing for Persistent AI Agents

arXiv — cs.CL

▶ ai·13:20

Echoes Across Vietnam's Highlands, Delta, and Coast: A Multilingual Corpus for Cham, Khmer, and Tay-Nung

arXiv — cs.CL

▶ ai·12:50

Grounded Event Extraction from SEC 8-K Filings with a Fine-Grained Taxonomy

arXiv — cs.CL

▶ ai·12:29

TypeProbe: Recovering Type Representations from Hidden States of Pre-trained Code Models

arXiv — cs.CL

▶ ai·12:17

XALPHA: A Memory-Driven AI Quant Researcher for Hypothesis-to-Code Alpha Discovery

arXiv — cs.CL

▶ ai·19:57

From Noisy Traces to Root Causes: Structural Trajectory Analysis and Causal Extraction for Agent Optimization

arXiv — cs.CL

▶ ai·19:32

Max Out GRPO Signal: Adaptive Trace Prefix Control for Hard Reasoning Problems

arXiv — cs.CL

▶ ai·19:24

Does Bielik Know What It Doesn't Know? Activation Dispersion Separates Entity Familiarity from Factual Reliability Across Model Scale

arXiv — cs.CL

▶ ai·17:51

PALS: Percentile-Aware Layerwise Sparsity for LLM Pruning

arXiv — cs.CL

▶ ai·17:46

Think Big, Search Small: Where Capacity Matters in Hierarchical Search Agents?

arXiv — cs.CL