Cs-Cv

DroneKey++: A Size Prior-free Method and New Benchmark for Drone 3D Pose Estimation from Sequential Images

Computer Vision 5 JAN, 2026

DroneKey++: A Size Prior-free Method and New Benchmark for Drone 3D Pose Estimation from Sequential Images

By Seo-Bin Hwang

MGP-KAD: Multimodal Geometric Priors and Kolmogorov-Arnold Decoder for Single-View 3D Reconstruction in Complex Scenes

Computer Vision 5 JAN, 2026

MGP-KAD: Multimodal Geometric Priors and Kolmogorov-Arnold Decoder for Single-View 3D Reconstruction in Complex Scenes

By Luoxi Zhang

FloorplanVLM: A Vision-Language Model for Floorplan Vectorization

Computer Vision 6 JAN, 2026

FloorplanVLM: A Vision-Language Model for Floorplan Vectorization

By Yuanqing Liu

Rebenchmarking Unsupervised Monocular 3D Occupancy Prediction

Computer Vision 6 JAN, 2026

Rebenchmarking Unsupervised Monocular 3D Occupancy Prediction

By Zizhan Guo

Efficient-LVSM: Faster, Cheaper, and Better Large View Synthesis Model via Decoupled Co-Refinement Attention

Artificial Intelligence 6 JAN, 2026

Efficient-LVSM: Faster, Cheaper, and Better Large View Synthesis Model via Decoupled Co-Refinement Attention

By Xiaosong Jia

MetaSSP: Enhancing Semi-supervised Implicit 3D Reconstruction through Meta-adaptive EMA and SDF-aware Pseudo-label Evaluation

Computer Vision 5 JAN, 2026

MetaSSP: Enhancing Semi-supervised Implicit 3D Reconstruction through Meta-adaptive EMA and SDF-aware Pseudo-label Evaluation

By Luoxi Zhang

MultiGraspNet: A Multitask 3D Vision Model for Multi-gripper Robotic Grasping

Robotics 6 JAN, 2026

MultiGraspNet: A Multitask 3D Vision Model for Multi-gripper Robotic Grasping

By Stephany Ortuno-Chanelo

Same Answer, Different Representations: Hidden instability in VLMs

Artificial Intelligence 6 JAN, 2026

Same Answer, Different Representations: Hidden instability in VLMs

By Farooq Ahmad Wani

Shallow Diffuse: Robust and Invisible Watermarking through Low-Dimensional Subspaces in Diffusion Models

Machine Learning 12 JAN, 2026

Shallow Diffuse: Robust and Invisible Watermarking through Low-Dimensional Subspaces in Diffusion Models

By Wenda Li

Efficient Mixture-of-Expert for Video-based Driver State and Physiological Multi-task Estimation in Conditional Autonomous Driving

Artificial Intelligence 29 JAN, 2026

Efficient Mixture-of-Expert for Video-based Driver State and Physiological Multi-task Estimation in Conditional Autonomous Driving

By Jiyao Wang

Robust Detection of Retinal Neovascularization in Widefield Optical Coherence Tomography

Eess Iv 6 JAN, 2026

Robust Detection of Retinal Neovascularization in Widefield Optical Coherence Tomography

By Jinyi Hao

CompEvent: Complex-valued Event-RGB Fusion for Low-light Video Enhancement and Deblurring

Computer Vision 6 JAN, 2026

CompEvent: Complex-valued Event-RGB Fusion for Low-light Video Enhancement and Deblurring

By Mingchen Zhong

SPIDER: Scalable Physics-Informed Dexterous Retargeting

Robotics 5 JAN, 2026

SPIDER: Scalable Physics-Informed Dexterous Retargeting

By Chaoyi Pan

LL-ViT: Edge Deployable Vision Transformers with Look Up Table Neurons

Machine Learning 2 JAN, 2025

LL-ViT: Edge Deployable Vision Transformers with Look Up Table Neurons

By Shashank Nag

SyncAnyone: Implicit Disentanglement via Progressive Self-Correction for Lip-Syncing in the wild

Computer Vision 6 JAN, 2026

SyncAnyone: Implicit Disentanglement via Progressive Self-Correction for Lip-Syncing in the wild

By Xindi Zhang

Preserving Spectral Structure and Statistics in Diffusion Models

Computer Vision 6 JAN, 2026

Preserving Spectral Structure and Statistics in Diffusion Models

By Baohua Yan

Multi-Sensor Attention Networks for Automated Subsurface Delamination Detection in Concrete Bridge Decks

Eess Iv 6 JAN, 2026

Multi-Sensor Attention Networks for Automated Subsurface Delamination Detection in Concrete Bridge Decks

By Alireza Moayedikia

FloodDiffusion: Tailored Diffusion Forcing for Streaming Motion Generation

Computer Vision 6 JAN, 2026

FloodDiffusion: Tailored Diffusion Forcing for Streaming Motion Generation

By Yiyi Cai

A neuromorphic model of the insect visual system for natural image processing

Neural and Evolutionary Computing 6 JAN, 2026

A neuromorphic model of the insect visual system for natural image processing

By Adam D. Hines

Learning Human Visual Attention on 3D Surfaces through Geometry-Queried Semantic Priors

Computer Vision 6 JAN, 2026

Learning Human Visual Attention on 3D Surfaces through Geometry-Queried Semantic Priors

By Soham Pahari

POINTS-GUI-G: GUI-Grounding Journey

Computer Vision 6 JAN, 2026

POINTS-GUI-G: GUI-Grounding Journey

By Zhongyin Zhao

TFusionOcc: Student's t-Distribution Based Object-Centric Multi-Sensor Fusion Framework for 3D Occupancy Prediction

Artificial Intelligence 6 JAN, 2026

TFusionOcc: Student's t-Distribution Based Object-Centric Multi-Sensor Fusion Framework for 3D Occupancy Prediction

By Zhenxing Ming

Revisiting Salient Object Detection from an Observer-Centric Perspective

Artificial Intelligence 6 JAN, 2026

Revisiting Salient Object Detection from an Observer-Centric Perspective

By Fuxi Zhang

ChatUMM: Robust Context Tracking for Conversational Interleaved Generation

Computer Vision 6 JAN, 2026

ChatUMM: Robust Context Tracking for Conversational Interleaved Generation

By Wenxun Dai