▶ ai·
ad slot opena single understated line lives here — sponsor wordmark + a short line.advertise on shipfeed →
items50 latest
▶ ai·
Task-Adaptive Embedding Refinement via Test-time LLM Guidance
▶ ai·
MEME: Multi-entity & Evolving Memory Evaluation
▶ ai·
Routers Learn the Geometry of Their Experts: Geometric Coupling in Sparse Mixture-of-Experts
▶ ai·
Multi-Stream LLMs: Unblocking Language Models with Parallel Streams of Thoughts, Inputs and Outputs
▶ ai·
TextSeal: A Localized LLM Watermark for Provenance & Distillation Protection
▶ ai·
ORCE: Order-Aware Alignment of Verbalized Confidence in Large Language Models
▶ ai·
Geometric Factual Recall in Transformers
▶ ai·
Predicting Disagreement with Human Raters in LLM-as-a-Judge Difficulty Assessment without Using Generation-Time Probability Signals
▶ ai·
ORBIT: Preserving Foundational Language Capabilities in GenRetrieval via Origin-Regulated Merging
▶ ai·
Question Difficulty Estimation for Large Language Models via Answer Plausibility Scoring
▶ ai·
A Comparative Study of Controlled Text Generation Systems Using Level-Playing-Field Evaluation Principles
▶ ai·
Pretraining Exposure Explains Popularity Judgments in Large Language Models
▶ ai·
Context Convergence Improves Answering Inferential Questions
▶ ai·
Output Composability of QLoRA PEFT Modules for Plug-and-Play Attribute-Controlled Text Generation
▶ ai·
A categorical error sensitivity index (ISEC): A preventive ordinal decision-support measure for irrecoverable errors in manual data entry systems
▶ ai·
Overview of the MedHopQA track at BioCreative IX: track description, participation and evaluation of systems for multi-hop medical question answering
▶ ai·
On Predicting the Post-training Potential of Pre-trained LLMs
▶ ai·
Enhancing Target-Guided Proactive Dialogue Systems via Conversational Scenario Modeling and Intent-Keyword Bridging
▶ ai·
Multimodal Abstractive Summarization of Instructional Videos with Vision-Language Models
▶ ai·
StepCodeReasoner: Aligning Code Reasoning with Stepwise Execution Traces via Reinforcement Learning
▶ ai·
YFPO: A Preliminary Study of Yoked Feature Preference Optimization with Neuron-Guided Rewards for Mathematical Reasoning
▶ qwen·
Qwen-Scope: Turning Sparse Features into Development Tools for Large Language Models
▶ ai·
Concordance Comparison as a Means of Assembling Local Grammars
▶ ai·
UniVLR: Unifying Text and Vision in Visual Latent Reasoning for Multimodal LLMs
▶ ai·
Self-Distilled Trajectory-Aware Boltzmann Modeling: Bridging the Training-Inference Discrepancy in Diffusion Language Models
▶ ai·
Probabilistic Calibration Is a Trainable Capability in Language Models
▶ ai·
More Edits, More Stable: Understanding the Lifelong Normalization in Sequential Model Editing
▶ ai·
ROMER: Expert Replacement and Router Calibration for Robust MoE LLMs on Analog Compute-in-Memory Systems
▶ ai·
Choosing features for classifying multiword expressions
▶ ai·
Entropy Polarity in Reinforcement Fine-Tuning: Direction, Asymmetry, and Control
▶ ai·
From Token to Token Pair: Efficient Prompt Compression for Large Language Models in Clinical Prediction
▶ ai·
Safety-Oriented Evaluation of Language Understanding Systems for Air Traffic Control
▶ ai·
Training-Inference Consistent Segmented Execution for Long-Context LLMs
▶ ai·
Learning to Foresee: Unveiling the Unlocking Efficiency of On-Policy Distillation
▶ ai·
AgentDisCo: Towards Disentanglement and Collaboration in Open-ended Deep Research Agents
▶ ai·
Slicing and Dicing: Configuring Optimal Mixtures of Experts
▶ ai·
Robust LLM Unlearning Against Relearning Attacks: The Minor Components in Representations Matter
▶ ai·
Human-Grounded Multimodal Benchmark with 900K-Scale Aggregated Student Response Distributions from Japan's National Assessment of Academic Ability
▶ ai·
ELF: Embedded Language Flows
▶ ai·
DECO: Sparse Mixture-of-Experts with Dense-Comparable Performance on End-Side Devices
▶ ai·
Dynamic Skill Lifecycle Management for Agentic Reinforcement Learning
▶ ai·
WildClawBench: A Benchmark for Real-World, Long-Horizon Agent Evaluation
▶ ai·
RubricEM: Meta-RL with Rubric-guided Policy Decomposition beyond Verifiable Rewards
▶ ai·
Grounded or Guessing? LVLM Confidence Estimation via Blind-Image Contrastive Ranking
▶ ai·
Neural at ArchEHR-QA 2026: One Method Fits All: Unified Prompt Optimization for Clinical QA over EHRs
▶ ai·
Compute Where it Counts: Self Optimizing Language Models
▶ ai·
DGPO: Beyond Pairwise Preferences with Directional Consistent Groupwise Optimization
▶ ai·
RUBEN: Rule-Based Explanations for Retrieval-Augmented LLM Systems
▶ ai·