CV Arxiv Daily

Updated on 2025.12.11

This page is maintained by Leheng Li that contains papers he interested in. Source code of this web is at here.

3D
Diffusion
Industry
Autonomous Driving
Traffic Simulation

3D

Publish Date	Title	Authors	PDF	Code
2025-12-09	Selfi: Self Improving Reconstruction Engine via 3D Geometric Feature Alignment	Youming Deng et.al.	2512.08930	null
2025-12-09	Efficiently Reconstructing Dynamic Scenes One D4RT at a Time	Chuhan Zhang et.al.	2512.08924	null
2025-12-09	Self-Evolving 3D Scene Generation from a Single Image	Kaizhi Zheng et.al.	2512.08905	null
2025-12-09	Tri-Bench: Stress-Testing VLM Reliability on Spatial Reasoning under Camera Tilt and Object Interference	Amit Bendkhale et.al.	2512.08860	null
2025-12-09	A Scalable Pipeline Combining Procedural 3D Graphics and Guided Diffusion for Photorealistic Synthetic Training Data Generation in White Button Mushroom Segmentation	Artúr I. Károly et.al.	2512.08747	null
2025-12-09	Dual-Branch Center-Surrounding Contrast: Rethinking Contrastive Learning for 3D Point Clouds	Shaofeng Zhang et.al.	2512.08673	null
2025-12-09	Ergodic Trajectory Planning with Dynamic Sensor Footprints	Ziyue Zheng et.al.	2512.08661	null
2025-12-09	OpenMonoGS-SLAM: Monocular Gaussian Splatting SLAM with Open-set Semantics	Jisang Yoo et.al.	2512.08625	null
2025-12-09	SSCATeR: Sparse Scatter-Based Convolution Algorithm with Temporal Data Recycling for Real-Time 3D Object Detection in LiDAR Point Clouds	Alexander Dow et.al.	2512.08557	null
2025-12-09	Photo3D: Advancing Photorealistic 3D Generation through Structure-Aligned Detail Enhancement	Xinyue Liang et.al.	2512.08535	null
2025-12-09	OCCDiff: Occupancy Diffusion Model for High-Fidelity 3D Building Reconstruction from Noisy Point Clouds	Jialu Sui et.al.	2512.08506	null
2025-12-09	Learning to Control Physically-simulated 3D Characters via Generating and Mimicking 2D Motions	Jianan Li et.al.	2512.08500	null
2025-12-09	On-the-fly Large-scale 3D Reconstruction from Multi-Camera Rigs	Yijia Guo et.al.	2512.08498	null
2025-12-09	Visionary: The World Model Carrier Built on WebGPU-Powered Gaussian Splatting Platform	Yuning Gong et.al.	2512.08478	null
2025-12-09	A Multi-Agent LLM Framework for Design Space Exploration in Autonomous Driving Systems	Po-An Shih et.al.	2512.08476	null
2025-12-09	SDT-6D: Fully Sparse Depth-Transformer for Staged End-to-End 6D Pose Estimation in Industrial Multi-View Bin Picking	Nico Leuze et.al.	2512.08430	null
2025-12-09	SAM-Body4D: Training-Free 4D Human Body Mesh Recovery from Videos	Mingqi Gao et.al.	2512.08406	null
2025-12-09	TrackingWorld: World-centric Monocular 3D Tracking of Almost All Pixels	Jiahao Lu et.al.	2512.08358	null
2025-12-09	HybridSplat: Fast Reflection-baked Gaussian Tracing using Hybrid Splatting	Chang Liu et.al.	2512.08334	null
2025-12-09	PointDico: Contrastive 3D Representation Learning Guided by Diffusion Models	Pengbo Li et.al.	2512.08330	null
2025-12-09	Detecting Dental Landmarks from Intraoral 3D Scans: the 3DTeethLand challenge	Achraf Ben-Hamadou et.al.	2512.08323	null
2025-12-09	PAVAS: Physics-Aware Video-to-Audio Synthesis	Oh Hyun-Bin et.al.	2512.08282	null
2025-12-09	Zero-Splat TeleAssist: A Zero-Shot Pose Estimation Framework for Semantic Teleoperation	Srijan Dokania et.al.	2512.08271	null
2025-12-09	Query-aware Hub Prototype Learning for Few-Shot 3D Point Cloud Semantic Segmentation	YiLin Zhou et.al.	2512.08253	null
2025-12-09	Distilling Future Temporal Knowledge with Masked Feature Reconstruction for 3D Object Detection	Haowen Zheng et.al.	2512.08247	null
2025-12-09	Semantic-Metric Bayesian Risk Fields: Learning Robot Safety from Human Videos with a VLM Prior	Timothy Chen et.al.	2512.08233	null
2025-12-09	SOP^2: Transfer Learning with Scene-Oriented Prompt Pool on 3D Object Detection	Ching-Hung Cheng et.al.	2512.08223	null
2025-12-09	Tumor-anchored deep feature random forests for out-of-distribution detection in lung cancer segmentation	Aneesh Rangnekar et.al.	2512.08216	null
2025-12-09	Blur2Sharp: Human Novel Pose and View Synthesis with Generative Prior Refinement	Chia-Hern Lai et.al.	2512.08215	null
2025-12-09	RAVES-Calib: Robust, Accurate and Versatile Extrinsic Self Calibration Using Optimal Geometric Features	Haoxin Zhang et.al.	2512.08170	null
2025-12-09	CVP: Central-Peripheral Vision-Inspired Multimodal Model for Spatial Reasoning	Zeyuan Chen et.al.	2512.08135	null
2025-12-08	Generalizations of the Normalized Radon Cumulative Distribution Transform for Limited Data Recognition	Matthias Beckmann et.al.	2512.08099	null
2025-12-08	Voxify3D: Pixel Art Meets Volumetric Rendering	Yi-Chuan Huang et.al.	2512.07834	null
2025-12-08	WorldReel: 4D Video Generation with Consistent Geometry and Motion Modeling	Shaoheng Fang et.al.	2512.07821	null
2025-12-08	Inchworm-Inspired Soft Robot with Groove-Guided Locomotion	Hari Prakash Thanabalan et.al.	2512.07813	null
2025-12-08	Lang3D-XL: Language Embedded 3D Gaussians for Large-scale Scenes	Shai Krakovsky et.al.	2512.07807	null
2025-12-08	Multi-view Pyramid Transformer: Look Coarser to See Broader	Gyeongjin Kang et.al.	2512.07806	null
2025-12-08	UltrasODM: A Dual Stream Optical Flow Mamba Network for 3D Freehand Ultrasound Reconstruction	Mayank Anand et.al.	2512.07756	null
2025-12-09	ViSA: 3D-Aware Video Shading for Real-Time Upper-Body Avatar Creation	Fan Yang et.al.	2512.07720	null
2025-12-08	MoCA: Mixture-of-Components Attention for Scalable Compositional 3D Generation	Zhiqi Li et.al.	2512.07628	null
2025-12-08	Online Segment Any 3D Thing as Instance Tracking	Hanshi Wang et.al.	2512.07599	null
2025-12-08	More than Segmentation: Benchmarking SAM 3 for Segmentation, 3D Perception, and Reconstruction in Robotic Surgery	Wenzhen Dong et.al.	2512.07596	null
2025-12-08	Precise Liver Tumor Segmentation in CT Using a Hybrid Deep Learning-Radiomics Framework	Xuecheng Li et.al.	2512.07574	null
2025-12-09	From Orbit to Ground: Generative City Photogrammetry from Extreme Off-Nadir Satellite Images	Fei Yu et.al.	2512.07527	null
2025-12-09	MeshRipple: Structured Autoregressive Generation of Artist-Meshes	Junkai Lin et.al.	2512.07514	null
2025-12-08	ControlVP: Interactive Geometric Refinement of AI-Generated Images with Consistent Vanishing Points	Ryota Okumura et.al.	2512.07504	null
2025-12-08	Affordance Field Intervention: Enabling VLAs to Escape Memory Traps in Robotic Manipulation	Siyu Xu et.al.	2512.07472	null
2025-12-08	Human Geometry Distribution for 3D Animation Generation	Xiangjun Tang et.al.	2512.07459	null
2025-12-08	Reconstructing Objects along Hand Interaction Timelines in Egocentric Video	Zhifan Zhu et.al.	2512.07394	null
2025-12-08	Tessellation GS: Neural Mesh Gaussians for Robust Monocular Reconstruction of Dynamic Objects	Shuohan Tao et.al.	2512.07381	null
2025-12-08	ESPADA: Execution Speedup via Semantics Aware Demonstration Data Downsampling for Imitation Learning	Byungju Kim et.al.	2512.07371	null
2025-12-08	Debiasing Diffusion Priors via 3D Attention for Consistent Gaussian Splatting	Shilong Jin et.al.	2512.07345	null
2025-12-08	Geo3DVQA: Evaluating Vision-Language Models for 3D Geospatial Reasoning from Aerial Imagery	Mai Tsujimoto et.al.	2512.07276	null
2025-12-08	A graph generation pipeline for critical infrastructures based on heuristics, images and depth data	Mike Diessner et.al.	2512.07269	null
2025-12-08	AdLift: Lifting Adversarial Perturbations to Safeguard 3D Gaussian Splatting Assets Against Instruction-Driven Editing	Ziming Hong et.al.	2512.07247	null
2025-12-08	Unified Camera Positional Encoding for Controlled Video Generation	Cheng Zhang et.al.	2512.07237	null
2025-12-08	STRinGS: Selective Text Refinement in Gaussian Splatting	Abhinav Raundhal et.al.	2512.07230	null
2025-12-09	VFM-VLM: Vision Foundation Model and Vision Language Model based Visual Comparison for 3D Pose Estimation	Md Selim Sarowar et.al.	2512.07215	null
2025-12-08	Object Pose Distribution Estimation for Determining Revolution and Reflection Uncertainty in Point Clouds	Frederik Hagelskjær et.al.	2512.07211	null
2025-12-08	AutoLugano: A Deep Learning Framework for Fully Automated Lymphoma Segmentation and Lugano Staging on FDG-PET/CT	Boyang Pan et.al.	2512.07206	null
2025-12-08	SUCCESS-GS: Survey of Compactness and Compression for Efficient Static and Dynamic Gaussian Splatting	Seokhyun Youn et.al.	2512.07197	null
2025-12-08	MuSASplat: Efficient Sparse-View 3D Gaussian Splats via Lightweight Multi-Scale Adaptation	Muyu Xu et.al.	2512.07165	null
2025-12-09	COREA: Coarse-to-Fine 3D Representation Alignment Between Relightable 3D Gaussians and SDF via Bidirectional 3D-to-3D Supervision	Jaeyoon Lee et.al.	2512.07107	null
2025-12-07	RAVE: Rate-Adaptive Visual Encoding for 3D Gaussian Splatting	Hoang-Nhat Tran et.al.	2512.07052	null
2025-12-07	A Hetero-Associative Sequential Memory Model Utilizing Neuromorphic Signals: Validated on a Mobile Manipulator	Runcong Wang et.al.	2512.07032	null
2025-12-07	Hierarchical Image-Guided 3D Point Cloud Segmentation in Industrial Scenes via Multi-View Bayesian Fusion	Yu Zhu et.al.	2512.06882	null
2025-12-07	Dynamic Visual SLAM using a General 3D Prior	Xingguang Zhong et.al.	2512.06868	null
2025-12-07	SparseCoop: Cooperative Perception with Kinematic-Grounded Queries	Jiahao Wang et.al.	2512.06838	null
2025-12-07	MeshSplatting: Differentiable Rendering with Opaque Meshes	Jan Held et.al.	2512.06818	null
2025-12-09	db-LaCAM: Fast and Scalable Multi-Robot Kinodynamic Motion Planning with Discontinuity-Bounded Search and Lightweight MAPF	Akmaral Moldagalieva et.al.	2512.06796	null
2025-12-07	Physics Informed Human Posture Estimation Based on 3D Landmarks from Monocular RGB-Videos	Tobias Leuthold et.al.	2512.06783	null
2025-12-07	RDSplat: Robust Watermarking Against Diffusion Editing for 3D Gaussian Splatting	Longjie Zhao et.al.	2512.06774	null
2025-12-07	EMGauss: Continuous Slice-to-3D Reconstruction via Dynamic Gaussian Modeling in Volume Electron Microscopy	Yumeng He et.al.	2512.06684	null
2025-12-07	A New Trajectory-Oriented Approach to Enhancing Comprehensive Crowd Navigation Performance	Xinyu Zhou et.al.	2512.06608	null
2025-12-06	GNC-Pose: Geometry-Aware GNC-PnP for Accurate 6D Pose Estimation	Xiujin Liu et.al.	2512.06565	null
2025-12-06	SUGAR: A Sweeter Spot for Generative Unlearning of Many Identities	Dung Thuy Nguyen et.al.	2512.06562	null
2025-12-06	AGORA: Adversarial Generation Of Real-time Animatable 3D Gaussian Head Avatars	Ramazan Fazylov et.al.	2512.06438	null
2025-12-06	Automated Deep Learning Estimation of Anthropometric Measurements for Preparticipation Cardiovascular Screening	Lucas R. Mareque et.al.	2512.06434	null
2025-12-06	DragMesh: Interactive 3D Generation Made Easy	Tianshan Zhang et.al.	2512.06424	null
2025-12-09	HuPrior3R: Incorporating Human Priors for Better 3D Dynamic Reconstruction from Monocular Videos	Weitao Xiong et.al.	2512.06368	null
2025-12-06	CryoHype: Reconstructing a thousand cryo-EM structures with transformer-based hypernetworks	Jeffrey Gu et.al.	2512.06332	null
2025-12-06	TriaGS: Differentiable Triangulation-Guided Geometric Consistency for 3D Gaussian Splatting	Quan Tran et.al.	2512.06269	null
2025-12-05	Physics-Grounded Attached Shadow Detection Using Approximate 3D Geometry and Light Direction	Shilin Hu et.al.	2512.06179	null
2025-12-05	Physics-Grounded Shadow Generation from Monocular 3D Geometry Priors and Approximate Light Direction	Shilin Hu et.al.	2512.06174	null
2025-12-05	Tracking-Guided 4D Generation: Foundation-Tracker Motion Priors for 3D Model Animation	Su Sun et.al.	2512.06158	null
2025-12-05	Shoot-Bounce-3D: Single-Shot Occlusion-Aware 3D from Lidar by Decomposing Two-Bounce Light	Tzofi Klinghoffer et.al.	2512.06080	null
2025-12-05	Representation Learning for Point Cloud Understanding	Siming Yan et.al.	2512.06058	null
2025-12-04	Neural reconstruction of 3D ocean wave hydrodynamics from camera sensing	Jiabin Liu et.al.	2512.06024	null
2025-12-05	Correspondence-Oriented Imitation Learning: Flexible Visuomotor Control with 3D Conditioning	Yunhao Cao et.al.	2512.05953	null
2025-12-05	A Comparative Study on Synthetic Facial Data Generation Techniques for Face Recognition	Pedro Vidal et.al.	2512.05928	null
2025-12-05	SCAIL: Towards Studio-Grade Character Animation via In-Context Learning of 3D-Consistent Pose Representations	Wenhao Yan et.al.	2512.05905	null
2025-12-05	Optimal Safety-Aware Scheduling for Multi-Agent Aerial 3D Printing with Utility Maximization under Dependency Constraints	Marios-Nektarios Stamatopoulos et.al.	2512.05815	null
2025-12-05	3D Path Planning for Robot-assisted Vertebroplasty from Arbitrary Bi-plane X-ray via Differentiable Rendering	Blanca Inigo et.al.	2512.05803	null
2025-12-05	Curvature-Regularized Variational Autoencoder for 3D Scene Reconstruction from Sparse Depth	Maryam Yousefi et.al.	2512.05783	null
2025-12-05	Label-Efficient Point Cloud Segmentation with Active Learning	Johannes Meyer et.al.	2512.05759	null
2025-12-05	Manifold-Aware Point Cloud Completion via Geodesic-Attentive Hierarchical Feature Learning	Jianan Sun et.al.	2512.05710	null
2025-12-05	OWL: Unsupervised 3D Object Detection by Occupancy Guided Warm-up and Large Model Priors Reasoning	Xusheng Guo et.al.	2512.05698	null
2025-12-05	Hyperspectral Unmixing with 3D Convolutional Sparse Coding and Projected Simplex Volume Maximization	Gargi Panda et.al.	2512.05674	null
2025-12-05	LeAD-M3D: Leveraging Asymmetric Distillation for Real-time Monocular 3D Detection	Johannes Meier et.al.	2512.05663	null
2025-12-05	Fast SceneScript: Accurate and Efficient Structured Language Model via Multi-Token Prediction	Ruihong Yin et.al.	2512.05597	null
2025-12-05	Learning High-Fidelity Cloth Animation via Skinning-Free Image Transfer	Rong Wang et.al.	2512.05593	null
2025-12-05	MedDIFT: Multi-Scale Diffusion-Based Correspondence in 3D Medical Imaging	Xingyu Zhang et.al.	2512.05571	null
2025-12-05	Concept-based Explainable Data Mining with VLM for 3D Detection	Mai Tsujimoto et.al.	2512.05482	null
2025-12-05	TED-4DGS: Temporally Activated and Embedding-based Deformation for 4DGS Compression	Cheng-Yuan Ho et.al.	2512.05446	null
2025-12-05	EXR: An Interactive Immersive EHR Visualization in Extended Reality	Benoit Marteau et.al.	2512.05438	null
2025-12-05	The Dynamic Prior: Understanding 3D Structures for Casual Dynamic Videos	Zhuoyuan Wu et.al.	2512.05398	null
2025-12-05	PoolNet: Deep Learning for 2D to 3D Video Process Validation	Sanchit Kaul et.al.	2512.05362	null
2025-12-05	SplatPainter: Interactive Authoring of 3D Gaussians from 2D Edits via Test-Time Training	Yang Zheng et.al.	2512.05354	null
2025-12-05	SpaceControl: Introducing Test-Time Spatial Control to 3D Generative Modeling	Elisabetta Fedele et.al.	2512.05343	null
2025-12-04	Seabed-to-Sky Mapping of Maritime Environments with a Dual Orthogonal SONAR and LiDAR Sensor Suite	Christian Westerdahl et.al.	2512.05303	null
2025-12-04	ARCAS: An Augmented Reality Collision Avoidance System with SLAM-Based Tracking for Enhancing VRU Safety	Ahmad Yehia et.al.	2512.05299	null
2025-12-04	Inferring Compositional 4D Scenes without Ever Seeing One	Ahmet Berke Gokmen et.al.	2512.05272	null
2025-12-04	Age-Inclusive 3D Human Mesh Recovery for Action-Preserving Data Anonymization	Georgios Chatzichristodoulou et.al.	2512.05259	null
2025-12-04	Two-Stage Camera Calibration Method for Multi-Camera Systems Using Scene Geometry	Aleksandr Abramov et.al.	2512.05171	null
2025-12-08	Splannequin: Freezing Monocular Mannequin-Challenge Footage with Dual-Detection Splatting	Hao-Jen Chien et.al.	2512.05113	null
2025-12-04	ShadowDraw: From Any Object to Shadow-Drawing Compositional Art	Rundong Luo et.al.	2512.05110	null
2025-12-04	From Generated Human Videos to Physically Plausible Robot Trajectories	James Ni et.al.	2512.05094	null
2025-12-04	Object Reconstruction under Occlusion with Generative Priors and Contact-induced Constraints	Minghan Zhu et.al.	2512.05079	null
2025-12-04	Joint 3D Geometry Reconstruction and Motion Generation for 4D Synthesis from a Single Image	Yanran Zhang et.al.	2512.05044	null
2025-12-04	Stable Single-Pixel Contrastive Learning for Semantic and Geometric Tasks	Leonid Pogorelyuk et.al.	2512.04970	null
2025-12-04	GeoPE:A Unified Geometric Positional Embedding for Structured Tensors	Yupu Yao et.al.	2512.04963	null
2025-12-04	LiteVGGT: Boosting Vanilla VGGT via Geometry-aware Cached Token Merging	Zhijian Shu et.al.	2512.04939	null
2025-12-04	Equivariant Symmetry-Aware Head Pose Estimation for Fetal MRI	Ramya Muthukrishnan et.al.	2512.04890	null
2025-12-04	Contact-Aware Refinement of Human Pose Pseudo-Ground Truth via Bioimpedance Sensing	Maria-Paola Forte et.al.	2512.04862	null
2025-12-04	RobustSplat++: Decoupling Densification, Dynamics, and Illumination for In-the-Wild 3DGS	Chuanyu Fu et.al.	2512.04815	null
2025-12-04	SIMA 2: A Generalist Embodied Agent for Virtual Worlds	SIMA team et.al.	2512.04797	null
2025-12-04	LaFiTe: A Generative Latent Field for 3D Native Texturing	Chia-Hao Chen et.al.	2512.04786	null
2025-12-04	Order Matters: 3D Shape Generation from Sequential VR Sketches	Yizi Chen et.al.	2512.04761	null
2025-12-04	MT-Depth: Multi-task Instance feature analysis for the Depth Completion	Abdul Haseeb Nizamani et.al.	2512.04734	null
2025-12-04	Bridging Simulation and Reality: Cross-Domain Transfer with Semantic 2D Gaussian Splatting	Jian Tang et.al.	2512.04731	null
2025-12-04	When Robots Should Say “I Don’t Know”: Benchmarking Abstention in Embodied Question Answering	Tao Wu et.al.	2512.04597	null
2025-12-05	COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence	Zefeng Zhang et.al.	2512.04563	null
2025-12-04	Gaussian Entropy Fields: Driving Adaptive Sparsity in 3D Gaussian Optimization	Hong Kuang et.al.	2512.04542	null
2025-12-04	Detection of Intoxicated Individuals from Facial Video Sequences via a Recurrent Fusion Model	Bita Baroutian et.al.	2512.04536	null
2025-12-04	Refaçade: Editing Object with Given Reference Texture	Youze Huang et.al.	2512.04534	null
2025-12-04	Auto3R: Automated 3D Reconstruction and Scanning via Data-driven Uncertainty Quantification	Chentao Shen et.al.	2512.04528	null
2025-12-04	SPLICE: Part-Level 3D Shape Editing from Local Semantic Extraction to Global Neural Mixing	Jin Zhou et.al.	2512.04514	null
2025-12-04	MARL Warehouse Robots	Price Allman et.al.	2512.04463	null
2025-12-04	UTrice: Unifying Primitives in Differentiable Ray Tracing and Rasterization via Triangles for Particle-Based 3D Scenes	Changhe Liu et.al.	2512.04421	null
2025-12-04	RoboBPP: Benchmarking Robotic Online Bin Packing with Physics-based Simulation	Zhoufeng Wang et.al.	2512.04415	null
2025-12-04	MAFNet:Multi-frequency Adaptive Fusion Network for Real-time Stereo Matching	Ao Xu et.al.	2512.04358	null
2025-12-03	SyncTrack4D: Cross-Video Motion Alignment and Video Synchronization for Multi-Video 4D Gaussian Splatting	Yonghan Lee et.al.	2512.04315	null
2025-12-03	Mind-to-Face: Neural-Driven Photorealistic Avatar Synthesis via EEG Decoding	Haolin Xiong et.al.	2512.04313	null
2025-12-03	Gamma-from-Mono: Road-Relative, Metric, Self-Supervised Monocular Geometry for Vehicular Applications	Gasser Elazab et.al.	2512.04303	null
2025-12-03	MVRoom: Controllable 3D Indoor Scene Generation with Multi-View Diffusion Models	Shaoheng Fang et.al.	2512.04248	null
2025-12-03	Look Around and Pay Attention: Multi-camera Point Tracking Reimagined with Transformers	Bishoy Galoaa et.al.	2512.04213	null
2025-12-03	Radiance Meshes for Volumetric Reconstruction	Alexander Mai et.al.	2512.04076	null
2025-12-03	RELIC: Interactive Video World Model with Long-Horizon Memory	Yicong Hong et.al.	2512.04040	null
2025-12-03	C3G: Learning Compact 3D Representations with 2K Gaussians	Honggyu An et.al.	2512.04021	null
2025-12-03	Learning Group Actions In Disentangled Latent Image Representations	Farhana Hossain Swarnali et.al.	2512.04015	null
2025-12-03	Emergent Outlier View Rejection in Visual Geometry Grounded Transformers	Jisang Han et.al.	2512.04012	null
2025-12-03	Artificial Microsaccade Compensation: Stable Vision for an Ornithopter	Levi Burner et.al.	2512.03995	null
2025-12-03	Tada-DIP: Input-adaptive Deep Image Prior for One-shot 3D Image Reconstruction	Evan Bell et.al.	2512.03962	null
2025-12-03	MUT3R: Motion-aware Updating Transformer for Dynamic 3D Reconstruction	Guole Shen et.al.	2512.03939	null
2025-12-03	UniMo: Unifying 2D Video and 3D Human Motion with an Autoregressive Framework	Youxin Pang et.al.	2512.03918	null
2025-12-03	An Automated Framework for Large-Scale Graph-Based Cerebrovascular Analysis	Daniele Falcetta et.al.	2512.03869	null
2025-12-03	DINO-RotateMatch: A Rotation-Aware Deep Framework for Robust Image Matching in Large-Scale 3D Reconstruction	Kaichen Zhang et.al.	2512.03715	null
2025-12-03	GaussianBlender: Instant Stylization of 3D Gaussians with Disentangled Latent Spaces	Melis Ocal et.al.	2512.03683	null
2025-12-03	LAMP: Language-Assisted Motion Planning for Controllable Video Generation	Muhammed Burak Kizil et.al.	2512.03619	null
2025-12-03	Motion4D: Learning 3D-Consistent Motion and Semantics for 4D Scene Understanding	Haoran Zhou et.al.	2512.03601	null
2025-12-03	Harnessing Hypergraphs in Geometric Deep Learning for 3D RNA Inverse Folding	Guang Yang et.al.	2512.03592	null
2025-12-03	GAOT: Generating Articulated Objects Through Text-Guided Diffusion Models	Hao Sun et.al.	2512.03566	null
2025-12-03	OpenTrack3D: Towards Accurate and Generalizable Open-Vocabulary 3D Instance Segmentation	Zhishan Zhou et.al.	2512.03532	null
2025-12-03	Think Before You Drive: World Model-Inspired Multimodal Grounding for Autonomous Vehicles	Haicheng Liao et.al.	2512.03454	null
2025-12-03	GeoVideo: Introducing Geometric Regularization into Video Generation Model	Yunpeng Bai et.al.	2512.03453	null
2025-12-03	KeyPointDiffuser: Unsupervised 3D Keypoint Learning via Latent Diffusion Models	Rhys Newbury et.al.	2512.03450	null
2025-12-05	LM-CartSeg: Automated Segmentation of Lateral and Medial Cartilage and Subchondral Bone for Radiomics Analysis	Tongxu Zhang et.al.	2512.03449	null
2025-12-03	What Is The Best 3D Scene Representation for Robotics? From Geometric to Foundation Models	Tianchen Deng et.al.	2512.03422	null
2025-12-03	ShelfGaussian: Shelf-Supervised Open-Vocabulary Gaussian-based 3D Scene Understanding	Lingjun Zhao et.al.	2512.03370	null
2025-12-03	SeeU: Seeing the Unseen World via 4D Dynamics-aware Generation	Yu Yuan et.al.	2512.03350	null
2025-12-03	Hierarchical Attention for Sparse Volumetric Anomaly Detection in Subclinical Keratoconus	Lynn Kandakji et.al.	2512.03346	null
2025-12-02	SpatialReasoner: Active Perception for Large-Scale 3D Scene Understanding	Hongpei Zheng et.al.	2512.03284	null
2025-12-02	LLM-Guided Material Inference for 3D Point Clouds	Nafiseh Izadyar et.al.	2512.03237	null
2025-12-02	Kaleidoscopic Scintillation Event Imaging	Alex Bocchieri et.al.	2512.03216	null
2025-12-02	Flux4D: Flow-based Unsupervised 4D Reconstruction	Jingkang Wang et.al.	2512.03210	null
2025-12-02	Does Head Pose Correction Improve Biometric Facial Recognition?	Justin Norman et.al.	2512.03199	null
2025-12-02	Video4Spatial: Towards Visuospatial Intelligence with Context-Guided Video Generation	Zeqi Xiao et.al.	2512.03040	null
2025-12-02	SurfFill: Completion of LiDAR Point Clouds via Gaussian Surfel Splatting	Svenja Strobel et.al.	2512.03010	null
2025-12-02	DGGT: Feedforward 4D Reconstruction of Dynamic Driving Scenes using Unposed Images	Xiaoxue Chen et.al.	2512.03004	null
2025-12-03	DynamicVerse: A Physically-Aware Multimodal Framework for 4D World Modeling	Kairun Wen et.al.	2512.03000	null
2025-12-02	TEXTRIX: Latent Attribute Grid for Native Texture Generation and Beyond	Yifei Zeng et.al.	2512.02993	null
2025-12-02	GraphFusion3D: Dynamic Graph Attention Convolution with Adaptive Cross-Modal Transformer for 3D Object Detection	Md Sohag Mia et.al.	2512.02991	null
2025-12-02	U4D: Uncertainty-Aware 4D World Modeling from LiDAR Sequences	Xiang Xu et.al.	2512.02982	null
2025-12-02	BEVDilation: LiDAR-Centric Multi-Modal Fusion for 3D Object Detection	Guowen Zhang et.al.	2512.02972	null
2025-12-02	Layout Anything: One Transformer for Universal Room Layout Estimation	Md Sohag Mia et.al.	2512.02952	null
2025-12-02	EGGS: Exchangeable 2D/3D Gaussian Splatting for Geometry-Appearance Balanced Novel View Synthesis	Yancheng Zhang et.al.	2512.02932	null
2025-12-02	Taming Camera-Controlled Video Generation with Verifiable Geometry Reward	Zhaoqing Wang et.al.	2512.02870	null
2025-12-02	DF-Mamba: Deformable State Space Modeling for 3D Hand Pose Estimation in Interactions	Yifan Zhou et.al.	2512.02727	null
2025-12-02	PolarGuide-GSDR: 3D Gaussian Splatting Driven by Polarization Priors and Deferred Reflection for Real-World Reflective Scenes	Derui Shan et.al.	2512.02664	null
2025-12-02	PoreTrack3D: A Benchmark for Dynamic 3D Gaussian Splatting in Pore-Scale Facial Trajectory Tracking	Dong Li et.al.	2512.02648	null
2025-12-02	Content-Aware Texturing for Gaussian Splatting	Panagiotis Papantonakis et.al.	2512.02621	null
2025-12-02	AVGGT: Rethinking Global Attention for Accelerating VGGT	Xianbing Sun et.al.	2512.02541	null
2025-12-02	On the Problem of Consistent Anomalies in Zero-Shot Anomaly Detection	Tai Le-Gia et.al.	2512.02520	null
2025-12-02	Masking Matters: Unlocking the Spatial Reasoning Capabilities of LLMs for 3D Scene-Language Understanding	Yerim Jeon et.al.	2512.02487	null
2025-12-02	G-SHARP: Gaussian Surgical Hardware Accelerated Real-time Pipeline	Vishwesh Nath et.al.	2512.02482	null
2025-12-02	Vision to Geometry: 3D Spatial Memory for Sequential Embodied MLLM Reasoning and Exploration	Zhongyi Cai et.al.	2512.02458	null
2025-12-02	HouseLayout3D: A Benchmark and Training-Free Baseline for 3D Layout Estimation in the Wild	Valentin Bieri et.al.	2512.02450	null
2025-12-02	MitUNet: Enhancing Floor Plan Recognition using a Hybrid Mix-Transformer and U-Net Architecture	Dmitriy Parashchuk et.al.	2512.02413	null
2025-12-02	On-the-fly Feedback SfM: Online Explore-and-Exploit UAV Photogrammetry with Incremental Mesh Quality-Aware Indicator and Predictive Path Planning	Liyuan Lou et.al.	2512.02375	null
2025-12-02	TALO: Pushing 3D Vision Foundation Models Towards Globally Consistent Online Reconstruction	Fengyi Zhang et.al.	2512.02341	null
2025-12-02	Reasoning Path and Latent State Analysis for Multi-view Visual Spatial Reasoning: A Cognitive Science Perspective	Qiyao Xue et.al.	2512.02340	null
2025-12-02	VIGS-SLAM: Visual Inertial Gaussian Splatting SLAM	Zihan Zhu et.al.	2512.02293	null
2025-12-01	DepthScape: Authoring 2.5D Designs via Depth Estimation, Semantic Understanding, and Geometry Extraction	Xia Su et.al.	2512.02263	null
2025-12-01	SplatSuRe: Selective Super-Resolution for Multi-view Consistent 3D Gaussian Splatting	Pranav Asthana et.al.	2512.02172	null
2025-12-01	CoatFusion: Controllable Material Coating in Images	Sagie Levy et.al.	2512.02143	null
2025-12-01	Visual Sync: Multi-Camera Synchronization via Cross-View Object Motion	Shaowei Liu et.al.	2512.02017	null
2025-12-01	Generative Video Motion Editing with 3D Point Tracks	Yao-Chih Lee et.al.	2512.02015	null
2025-12-01	ManualVLA: A Unified VLA Model for Chain-of-Thought Manual Generation and Robotic Manipulation	Chenyang Gu et.al.	2512.02013	null
2025-12-02	Is Image-based Object Pose Estimation Ready to Support Grasping?	Eric C. Joyce et.al.	2512.01856	null
2025-12-01	Register Any Point: Scaling 3D Point Cloud Registration by Flow Matching	Yue Pan et.al.	2512.01850	null
2025-12-01	Seeing through Imagination: Learning Scene Geometry via Implicit Spatial World Modeling	Meng Cao et.al.	2512.01821	null
2025-12-01	IGen: Scalable Data Generation for Robot Learning from Open-World Images	Chenghao Gu et.al.	2512.01773	null
2025-12-01	AgriLiRa4D: A Multi-Sensor UAV Dataset for Robust SLAM in Challenging Agricultural Fields	Zhihao Zhan et.al.	2512.01753	null
2025-12-01	Open-world Hand-Object Interaction Video Generation Based on Structure and Contact-aware Representation	Haodong Yan et.al.	2512.01677	null
2025-12-02	SPARK: Sim-ready Part-level Articulated Reconstruction with VLM Knowledge	Yumeng He et.al.	2512.01629	null
2025-12-01	Integrated YOLOP Perception and Lyapunov-based Control for Autonomous Mobile Robot Navigation on Track	Mo Chen et.al.	2512.01608	null
2025-12-01	FlashVGGT: Efficient and Scalable Visual Geometry Transformers with Compressed Descriptor Attention	Zipeng Wang et.al.	2512.01540	null
2025-12-01	ChronosObserver: Taming 4D World with Hyperspace Diffusion Sampling	Qisen Wang et.al.	2512.01481	null
2025-12-01	FastAnimate: Towards Learnable Template Construction and Pose Deformation for Fast 3D Human Avatar Animation	Jian Shu et.al.	2512.01444	null
2025-12-01	PointNet4D: A Lightweight 4D Point Cloud Video Backbone for Online and Offline Perception in Robotic Applications	Yunze Liu et.al.	2512.01383	null
2025-12-01	Textured Geometry Evaluation: Perceptual 3D Textured Shape Metric via 3D Latent-Geometry Network	Tianyu Luan et.al.	2512.01380	null
2025-12-01	SRAM: Shape-Realism Alignment Metric for No Reference 3D Shape Evaluation	Sheng Liu et.al.	2512.01373	null
2025-12-01	BlinkBud: Detecting Hazards from Behind via Sampled Monocular 3D Detection on a Single Earbud	Yunzhe Li et.al.	2512.01366	null
2025-12-01	OpenBox: Annotate Any Bounding Boxes in 3D	In-Jae Lee et.al.	2512.01352	null
2025-12-01	InternVideo-Next: Towards General Video Foundation Models without Video-Text Supervision	Chenting Wang et.al.	2512.01342	null
2025-12-01	TagSplat: Topology-Aware Gaussian Splatting for Dynamic Mesh Modeling and Tracking	Hanzhi Guo et.al.	2512.01329	null
2025-12-01	Panda: Self-distillation of Reusable Sensor-level Representations for High Energy Physics	Samuel Young et.al.	2512.01324	null
2025-12-01	Rethinking Intracranial Aneurysm Vessel Segmentation: A Perspective from Computational Fluid Dynamics Applications	Feiyang Xiao et.al.	2512.01319	null
2025-12-01	Gaussian Swaying: Surface-Based Framework for Aerodynamic Simulation with 3D Gaussians	Hongru Yan et.al.	2512.01306	null
2025-12-01	EGG-Fusion: Efficient 3D Reconstruction with Geometry-aware Gaussian Surfel on the Fly	Xiaokun Pan et.al.	2512.01296	null
2025-12-01	S $^2$ -MLLM: Boosting Spatial Reasoning Capability of MLLMs for 3D Visual Grounding with Structural Guidance	Beining Xu et.al.	2512.01223	null
2025-12-03	TabletopGen: Instance-Level Interactive 3D Tabletop Scene Generation from Text or Single Image	Ziqian Wang et.al.	2512.01204	null
2025-12-01	VSRD++: Autolabeling for 3D Object Detection via Instance-Aware Volumetric Silhouette Rendering	Zihua Liu et.al.	2512.01178	null
2025-11-30	Learning Eigenstructures of Unstructured Data Manifolds	Roy Velich et.al.	2512.01103	null
2025-11-30	Lotus-2: Advancing Geometric Dense Prediction with Powerful Image Generative Model	Jing He et.al.	2512.01030	null
2025-11-30	LISA-3D: Lifting Language-Image Segmentation to 3D via Multi-View Consistency	Zhongbin Guo et.al.	2512.01008	null
2025-11-30	S2AM3D: Scale-controllable Part Segmentation of 3D Point Cloud	Han Su et.al.	2512.00995	null
2025-11-30	Efficient and Scalable Monocular Human-Object Interaction Motion Reconstruction	Boran Wen et.al.	2512.00960	null
2025-11-30	Binary-Gaussian: Compact and Progressive Representation for 3D Gaussian Segmentation	An Yang et.al.	2512.00944	null
2025-11-30	ForamDeepSlice: A High-Accuracy Deep Learning Framework for Foraminifera Species Classification from 2D Micro-CT Slices	Abdelghafour Halimi et.al.	2512.00912	null
2025-11-30	SwiftVLA: Unlocking Spatiotemporal Dynamics for Lightweight VLA Models at Minimal Overhead	Chaojun Ni et.al.	2512.00903	null
2025-11-30	Feed-Forward 3D Gaussian Splatting Compression with Long-Context Modeling	Zhening Liu et.al.	2512.00877	null
2025-11-30	TAP-CT: 3D Task-Agnostic Pretraining of Computed Tomography Foundation Models	Tim Veenboer et.al.	2512.00872	null
2025-11-30	Smol-GS: Compact Representations for Abstract 3D Gaussian Splatting	Haishan Wang et.al.	2512.00850	null
2025-11-30	PolarGS: Polarimetric Cues for Ambiguity-Free Gaussian Splatting with Accurate Geometry Recovery	Bo Guo et.al.	2512.00794	null
2025-11-30	EAG3R: Event-Augmented 3D Geometry Estimation for Dynamic and Extreme-Lighting Scenes	Xiaoshan Wu et.al.	2512.00771	null
2025-11-30	REM: Evaluating LLM Embodied Spatial Reasoning through Multi-Frame Trajectories	Jacob Thompson et.al.	2512.00736	null
2025-11-30	Dynamic-eDiTor: Training-Free Text-Driven 4D Scene Editing with Multimodal Diffusion Transformer	Dong In Lee et.al.	2512.00677	null
2025-11-29	HAVEN: Hierarchical Adversary-aware Visibility-Enabled Navigation with Cover Utilization using Deep Transformer Q-Networks	Mihir Chauhan et.al.	2512.00592	null
2025-11-29	Describe Anything Anywhere At Any Moment	Nicolas Gorlo et.al.	2512.00565	null
2025-11-29	Asset-Driven Sematic Reconstruction of Dynamic Scene with Multi-Human-Object Interactions	Sandika Biswas et.al.	2512.00547	null
2025-11-29	Cross-Temporal 3D Gaussian Splatting for Sparse-View Guided Scene Update	Zeyuan An et.al.	2512.00534	null
2025-11-29	CC-FMO: Camera-Conditioned Zero-Shot Single Image to 3D Scene Generation with Foundation Model Orchestration	Boshi Tang et.al.	2512.00493	null
2025-11-29	PhysGen: Physically Grounded 3D Shape Generation for Industrial Design	Yingxuan You et.al.	2512.00422	null
2025-11-29	SplatFont3D: Structure-Aware Text-to-3D Artistic Font Generation with Part-Level Style Control	Ji Gan et.al.	2512.00413	null
2025-11-29	EZ-SP: Fast and Lightweight Superpoint-Based 3D Segmentation	Louis Geist et.al.	2512.00385	null
2025-11-29	Pore-scale Image Patch Dataset and A Comparative Evaluation of Pore-scale Facial Features	Dong Li et.al.	2512.00381	null
2025-11-29	Odometry Without Correspondence from Inertially Constrained Ruled Surfaces	Chenqi Zhu et.al.	2512.00327	null
2025-11-29	TGSFormer: Scalable Temporal Gaussian Splatting for Embodied Semantic Scene Completion	Rui Qian et.al.	2512.00300	null
2025-11-29	Words into World: A Task-Adaptive Agent for Language-Guided Spatial Retrieval in AR	Lixing Guo et.al.	2512.00294	null
2025-11-29	HeartFormer: Semantic-Aware Dual-Structure Transformers for 3D Four-Chamber Cardiac Point Cloud Reconstruction	Zhengda Ma et.al.	2512.00264	null
2025-11-29	“Why the face?”: Exploring Robot Error Detection Using Instrumented Bystander Reactions	Maria Teresa Parreira et.al.	2512.00262	null
2025-11-29	Relightable Holoported Characters: Capturing and Relighting Dynamic Human Performance from Sparse Views	Kunwar Maheep Singh et.al.	2512.00255	null
2025-11-28	DenseScan: Advancing 3D Scene Understanding with 2D Dense Annotation	Zirui Wang et.al.	2512.00226	null
2025-11-28	ReactionMamba: Generating Short &Long Human Reaction Sequences	Hajra Anwar Beg et.al.	2512.00208	null
2025-11-28	Object-Centric Data Synthesis for Category-level Object Detection	Vikhyat Agarwal et.al.	2511.23450	null
2025-11-28	Machine Learning for Scientific Visualization: Ensemble Data Analysis	Hamid Gadirov et.al.	2511.23290	null
2025-11-28	Learning to Predict Aboveground Biomass from RGB Images with 3D Synthetic Scenes	Silvia Zuffi et.al.	2511.23249	null
2025-11-28	Language-guided 3D scene synthesis for fine-grained functionality understanding	Jaime Corsetti et.al.	2511.23230	null
2025-12-02	PointCNN++: Performant Convolution on Native Points	Lihan Li et.al.	2511.23227	null
2025-11-28	Zero-Shot Multi-Criteria Visual Quality Inspection for Semi-Controlled Industrial Environments via Real-Time 3D Digital Twin Simulation	Jose Moises Araya-Martinez et.al.	2511.23214	null
2025-11-28	GeoWorld: Unlocking the Potential of Geometry Models to Facilitate High-Fidelity 3D Scene Generation	Yuhao Wan et.al.	2511.23191	null
2025-12-01	Fast Multi-view Consistent 3D Editing with Video Priors	Liyi Chen et.al.	2511.23172	null
2025-11-28	SpaceMind: Camera-Guided Modality Fusion for Spatial Reasoning in Vision-Language Models	Ruosen Zhao et.al.	2511.23075	null
2025-11-28	Image Valuation in NeRF-based 3D reconstruction	Grigorios Aris Cheimariotis et.al.	2511.23052	null
2025-11-28	GOATex: Geometry & Occlusion-Aware Texturing	Hyunjin Kim et.al.	2511.23051	null
2025-11-28	DiskChunGS: Large-Scale 3D Gaussian SLAM Through Chunk-Based Memory Management	Casimir Feldmann et.al.	2511.23030	null
2025-11-28	MrGS: Multi-modal Radiance Fields with 3D Gaussian Splatting for RGB-Thermal Novel View Synthesis	Minseong Kweon et.al.	2511.22997	null
2025-11-28	Taming the Light: Illumination-Invariant Semantic 3DGS-SLAM	Shouhe Zhang et.al.	2511.22968	null
2025-11-28	HMR3D: Hierarchical Multimodal Representation for 3D Scene Understanding with Large Vision-Language Model	Chen Li et.al.	2511.22961	null
2025-12-01	DenoiseGS: Gaussian Reconstruction Model for Burst Denoising	Yongsen Cheng et.al.	2511.22939	null
2025-11-28	MICCAI STS 2024 Challenge: Semi-Supervised Instance-Level Tooth Segmentation in Panoramic X-ray and CBCT Images	Yaqi Wang et.al.	2511.22911	null
2025-11-28	ViGG: Robust RGB-D Point Cloud Registration using Visual-Geometric Mutual Guidance	Congjia Chen et.al.	2511.22908	null
2025-11-28	Scalable Diffusion Transformer for Conditional 4D fMRI Synthesis	Jungwoo Seo et.al.	2511.22870	null
2025-11-28	GLOW: Global Illumination-Aware Inverse Rendering of Indoor Scenes Captured with Dynamic Co-Located Light & Camera	Jiaye Wu et.al.	2511.22857	null
2025-11-28	Captain Safari: A World Engine	Yu-Cheng Chou et.al.	2511.22815	null
2025-11-27	Switching control of underactuated multi-channel systems with input constraints for cooperative manipulation	Dongjae Lee et.al.	2511.22810	null
2025-11-27	Beyond Egocentric Limits: Multi-View Depth-Based Learning for Robust Quadrupedal Locomotion	Rémy Rahem et.al.	2511.22744	null
2025-11-27	Splat-SAP: Feed-Forward Gaussian Splatting for Human-Centered Scene with Scale-Aware Point Map Reconstruction	Boyao Zhou et.al.	2511.22704	null
2025-11-27	Emergent Extreme-View Geometry in 3D Foundation Models	Yiwen Zhang et.al.	2511.22686	null
2025-11-27	MG-Nav: Dual-Scale Visual Navigation via Sparse Spatial Memory	Bo Wang et.al.	2511.22609	null
2025-11-27	Bringing Your Portrait to 3D Presence	Jiawei Zhang et.al.	2511.22553	null
2025-11-27	Fast3Dcache: Training-free 3D Geometry Synthesis Acceleration	Mengyu Yang et.al.	2511.22533	null
2025-11-27	AI killed the video star. Audio-driven diffusion model for expressive talking head generation	Baptiste Chopin et.al.	2511.22488	null
2025-11-27	Gaussians on Fire: High-Frequency Reconstruction of Flames	Jakob Nazarenus et.al.	2511.22459	null
2025-11-27	ITS3D: Inference-Time Scaling for Text-Guided 3D Diffusion Models	Zhenglin Zhou et.al.	2511.22456	null
2025-11-27	Fin3R: Fine-tuning Feed-forward 3D Reconstruction Models via Monocular Knowledge Distillation	Weining Ren et.al.	2511.22429	null
2025-11-27	Wukong’s 72 Transformations: High-fidelity Textured 3D Morphing via Flow Models	Minghao Yin et.al.	2511.22425	null
2025-11-27	DiffStyle360: Diffusion-Based 360° Head Stylization via Style Fusion Attention	Furkan Guzelant et.al.	2511.22411	null
2025-11-27	UAV-MM3D: A Large-Scale Synthetic Benchmark for 3D Perception of Unmanned Aerial Vehicles with Multi-Modal Data	Longkun Zou et.al.	2511.22404	null
2025-11-27	BINDER: Instantly Adaptive Mobile Manipulation with Open-Vocabulary Commands	Seongwon Cho et.al.	2511.22364	null
2025-11-27	AnchorFlow: Training-Free 3D Editing via Latent Anchor-Aligned Flows	Zhenglin Zhou et.al.	2511.22357	null
2025-11-27	Can Protective Watermarking Safeguard the Copyright of 3D Gaussian Splatting?	Wenkai Huang et.al.	2511.22262	null
2025-11-27	ColonAdapter: Geometry Estimation Through Foundation Model Adaptation for Colonoscopy	Zhiyi Jiang et.al.	2511.22250	null
2025-11-27	MLATC: Fast Hierarchical Topological Mapping from 3D LiDAR Point Clouds Based on Adaptive Resonance Theory	Ryosuke Ofuchi et.al.	2511.22238	null
2025-11-27	Bridging 3D Deep Learning and Curation for Analysis and High-Quality Segmentation in Practice	Simon Püttmann et.al.	2511.22236	null
2025-11-27	IE-SRGS: An Internal-External Knowledge Fusion Framework for High-Fidelity 3D Gaussian Splatting Super-Resolution	Xiang Feng et.al.	2511.22233	null
2025-11-27	3D-Consistent Multi-View Editing by Diffusion Guidance	Josef Bengtson et.al.	2511.22228	null
2025-11-27	3D Affordance Keypoint Detection for Robotic Manipulation	Zhiyang Liu et.al.	2511.22195	null
2025-11-27	Controllable 3D Object Generation with Single Image Prompt	Jaeseok Lee et.al.	2511.22194	null
2025-11-27	RemedyGS: Defend 3D Gaussian Splatting against Computation Cost Attacks	Yanping Li et.al.	2511.22147	null
2025-11-27	Cue3D: Quantifying the Role of Image Cues in Single-Image 3D Generation	Xiang Li et.al.	2511.22121	null
2025-11-27	MoE3D: Mixture of Experts meets Multi-Modal 3D Understanding	Yu Li et.al.	2511.22103	null
2025-11-27	SoftNash: Entropy-Regularized Nash Games for Non-Fighting Virtual Fixtures	Tai Inui et.al.	2511.22087	null
2025-11-27	SparseWorld-TC: Trajectory-Conditioned Sparse Occupancy World Model	Jiayuan Du et.al.	2511.22039	null
2025-11-26	PAT3D: Physics-Augmented Text-to-3D Scene Generation	Guying Lin et.al.	2511.21978	null
2025-11-26	TAPVid-360: Tracking Any Point in 360 from Narrow Field of View Video	Finlay G. C. Hudson et.al.	2511.21946	null
2025-11-26	AmodalGen3D: Generative Amodal 3D Object Reconstruction from Sparse Unposed Views	Junwei Zhou et.al.	2511.21945	null
2025-11-26	Comparing SAM 2 and SAM 3 for Zero-Shot Segmentation of 3D Medical Data	Satrajit Chakrabarty et.al.	2511.21926	null
2025-11-26	OpenTwinMap: An Open-Source Digital Twin Generator for Urban Autonomous Driving	Alex Richardson et.al.	2511.21925	null
2025-11-26	UniArt: Unified 3D Representation for Generating 3D Articulated Objects with Open-Set Articulation	Bu Jin et.al.	2511.21887	null
2025-11-25	LAYER: A Quantitative Explainable AI Framework for Decoding Tissue-Layer Drivers of Myofascial Low Back Pain	Zixue Zeng et.al.	2511.21767	null
2025-11-26	TraceGen: World Modeling in 3D Trace Space Enables Learning from Cross-Embodiment Videos	Seungjae Lee et.al.	2511.21690	null
2025-11-27	G $^2$ VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning	Wenbo Hu et.al.	2511.21688	null
2025-11-26	Revolutionizing Glioma Segmentation & Grading Using 3D MRI - Guided Hybrid Deep Learning Models	Pandiyaraju V et.al.	2511.21673	null
2025-11-26	Enhanced Landmark Detection Model in Pelvic Fluoroscopy using 2D/3D Registration Loss	Chou Mo et.al.	2511.21575	null
2025-11-26	Multimodal Robust Prompt Distillation for 3D Point Cloud Models	Xiang Gu et.al.	2511.21574	null
2025-11-26	UAVLight: A Benchmark for Illumination-Robust 3D Reconstruction in Unmanned Aerial Vehicle (UAV) Scenes	Kang Du et.al.	2511.21565	null
2025-11-26	Resolution Where It Counts: Hash-based GPU-Accelerated 3D Reconstruction via Variance-Adaptive Voxel Grids	Lorenzo De Rebotti et.al.	2511.21459	null
2025-11-26	E-M3RF: An Equivariant Multimodal 3D Re-assembly Framework	Adeela Islam et.al.	2511.21422	null
2025-11-26	HTTM: Head-wise Temporal Token Merging for Faster VGGT	Weitian Wang et.al.	2511.21317	null
2025-11-26	CaliTex: Geometry-Calibrated Attention for View-Coherent 3D Texture Generation	Chenyu Liu et.al.	2511.21309	null
2025-11-26	Unlocking Zero-shot Potential of Semi-dense Image Matching via Gaussian Splatting	Juncheng Chen et.al.	2511.21265	null
2025-11-28	FIELDS: Face reconstruction with accurate Inference of Expression using Learning with Direct Supervision	Chen Ling et.al.	2511.21245	null
2025-11-26	Scenes as Tokens: Multi-Scale Normal Distributions Transform Tokenizer for General 3D Vision-Language Understanding	Yutao Tang et.al.	2511.21191	null
2025-11-26	MarketGen: A Scalable Simulation Platform with Auto-Generated Embodied Supermarket Environments	Xu Hu et.al.	2511.21161	null
2025-11-26	Maglev-Pentabot: Magnetic Levitation System for Non-Contact Manipulation using Deep Reinforcement Learning	Guoming Huang et.al.	2511.21149	null
2025-11-26	FaithFusion: Harmonizing Reconstruction and Generation via Pixel-wise Information Gain	YuAn Wang et.al.	2511.21113	null
2025-11-26	Pygmalion Effect in Vision: Image-to-Clay Translation for Reflective Geometry Reconstruction	Gayoung Lee et.al.	2511.21098	null
2025-11-26	CLRecogEye : Curriculum Learning towards exploiting convolution features for Dynamic Iris Recognition	Geetanjali Sharma et.al.	2511.21097	null
2025-11-26	FlowerDance: MeanFlow for Efficient and Refined 3D Dance Generation	Kaixing Yang et.al.	2511.21029	null
2025-11-25	MODEST: Multi-Optics Depth-of-Field Stereo Dataset	Nisarg K. Trivedi et.al.	2511.20853	null
2025-11-25	RefTr: Recurrent Refinement of Confluent Trajectories for 3D Vascular Tree Centerline Graphs	Roman Naeem et.al.	2511.20823	null
2025-11-25	$Δ$ -NeRF: Incremental Refinement of Neural Radiance Fields through Residual Control and Knowledge Transfer	Kriti Ghosh et.al.	2511.20804	null
2025-11-25	Foundry: Distilling 3D Foundation Models for the Edge	Guillaume Letellier et.al.	2511.20721	null
2025-11-25	Infinity-RoPE: Action-Controllable Infinite Video Generation Emerges From Autoregressive Self-Rollout	Hidir Yesiltepe et.al.	2511.20649	null
2025-11-25	LocateAnything3D: Vision-Language 3D Detection with Chain-of-Sight	Yunze Man et.al.	2511.20648	null
2025-11-25	3D-Aware Multi-Task Learning with Cross-View Correlations for Dense Scene Understanding	Xiaoye Wang et.al.	2511.20646	null
2025-11-25	Vision-Language Memory for Spatial Reasoning	Zuntao Liu et.al.	2511.20644	null
2025-11-25	ShapeGen: Towards High-Quality 3D Shape Synthesis	Yangguang Li et.al.	2511.20624	null
2025-11-25	Wanderland: Geometrically Grounded Simulation for Open-World Embodied AI	Xinhao Liu et.al.	2511.20620	null
2025-11-25	Evaluating the Performance of Deep Learning Models in Whole-body Dynamic 3D Posture Prediction During Load-reaching Activities	Seyede Niloofar Hosseini et.al.	2511.20615	null
2025-11-25	Safe and Stable Neural Network Dynamical Systems for Robot Motion Planning	Allen Emmanuel Binny et.al.	2511.20593	null
2025-11-25	VibraVerse: A Large-Scale Geometry-Acoustics Alignment Dataset for Physically-Consistent Multimodal Learning	Bo Pang et.al.	2511.20422	null
2025-11-25	MajutsuCity: Language-driven Aesthetic-adaptive City Generation with Controllable 3D Assets and Layouts	Zilong Huang et.al.	2511.20415	null
2025-11-26	VGGTFace: Topologically Consistent Facial Geometry Reconstruction in the Wild	Xin Ming et.al.	2511.20366	null
2025-11-25	GS-Checker: Tampering Localization for 3D Gaussian Splatting	Haoliang Han et.al.	2511.20354	null
2025-11-28	Quality-guided UAV Surface Exploration for 3D Reconstruction	Benjamin Sportich et.al.	2511.20353	null
2025-11-26	Thinking in 360°: Humanoid Visual Search in the Wild	Heyang Yu et.al.	2511.20351	null
2025-11-28	Material-informed Gaussian Splatting for 3D World Reconstruction in a Digital Twin	Andy Huynh et.al.	2511.20348	null
2025-11-25	AMB3R: Accurate Feed-forward Metric-scale 3D Reconstruction with Backend	Hengyi Wang et.al.	2511.20343	null
2025-11-25	3D Motion Perception of Binocular Vision Target with PID-CNN	Shi Jiazhao et.al.	2511.20332	null
2025-11-25	DAPointMamba: Domain Adaptive Point Mamba for Point Cloud Completion	Yinghui Li et.al.	2511.20278	null
2025-11-25	Zoo3D: Zero-Shot 3D Object Detection at Scene Level	Andrey Lemeshko et.al.	2511.20253	null
2025-11-25	Uplifting Table Tennis: A Robust, Real-World Application for 3D Trajectory and Spin Estimation	Daniel Kienzle et.al.	2511.20250	null
2025-11-25	Robust 3D Brain MRI Inpainting with Random Masking Augmentation	Juexin Zhang et.al.	2511.20202	null
2025-11-26	SKEL-CF: Coarse-to-Fine Biomechanical Skeleton and Surface Mesh Recovery	Da Li et.al.	2511.20157	null
2025-11-25	Vision-Language Models for Automated 3D PET/CT Report Generation	Wenpei Jiao et.al.	2511.20145	null
2025-11-25	FLaTEC: Frequency-Disentangled Latent Triplanes for Efficient Compression of LiDAR Point Clouds	Xiaoge Zhang et.al.	2511.20065	null
2025-11-25	Active3D: Active High-Fidelity 3D Reconstruction via Hierarchical Uncertainty Quantification	Yan Li et.al.	2511.20050	null
2025-11-25	MFM-point: Multi-scale Flow Matching for Point Cloud Generation	Petr Molodyk et.al.	2511.20041	null
2025-11-25	VGGT4D: Mining Motion Cues in Visual Geometry Transformers for 4D Scene Reconstruction	Yu Hu et.al.	2511.19971	null
2025-11-26	GFT-GCN: Privacy-Preserving 3D Face Mesh Recognition with Spectral Diffusion	Hichem Felouat et.al.	2511.19958	null
2025-11-25	Collaborate sim and real: Robot Bin Packing Learning in Real-world and Physical Engine	Lidi Zhang et.al.	2511.19932	null
2025-11-25	Coupled Physics-Gated Adaptation: Spatially Decoding Volumetric Photochemical Conversion in Complex 3D-Printed Objects	Maryam Eftekharifar et.al.	2511.19913	null
2025-11-25	Motion Marionette: Rethinking Rigid Motion Transfer via Prior Guidance	Haoxuan Wang et.al.	2511.19909	null
2025-11-25	MHB: Multimodal Handshape-aware Boundary Detection for Continuous Sign Language Recognition	Mingyu Zhao et.al.	2511.19907	null
2025-11-25	GigaWorld-0: World Models as Data Engine to Empower Embodied AI	GigaWorld Team et.al.	2511.19861	null
2025-11-25	STAvatar: Soft Binding and Temporal Density Control for Monocular 3D Head Avatars Reconstruction	Jiankuo Zhao et.al.	2511.19854	null
2025-11-25	4DWorldBench: A Comprehensive Evaluation Framework for 3D/4D World Generation Models	Yiting Lu et.al.	2511.19836	null
2025-11-24	Prune-Then-Plan: Step-Level Calibration for Stable Frontier Exploration in Embodied Question Answering	Noah Frahm et.al.	2511.19768	null
2025-11-24	A Storage-Efficient Feature for 3D Concrete Defect Segmentation to Replace Normal Vector	Linxin Hua et.al.	2511.19760	null
2025-11-24	Multi-Agent gatekeeper: Safe Flight Planning and Formation Control for Urban Air Mobility	Thomas Marshall Vielmetti et.al.	2511.19691	null
2025-11-24	Proxy-Free Gaussian Splats Deformation with Splat-Based Surface Estimation	Jaeyeong Kim et.al.	2511.19542	null
2025-11-24	MapRF: Weakly Supervised Online HD Map Construction via NeRF-Guided Self-Training	Hongyu Lyu et.al.	2511.19527	null
2025-11-24	Blinking Beyond EAR: A Stable Eyelid Angle Metric for Driver Drowsiness Detection and Data Augmentation	Mathis Wolter et.al.	2511.19519	null
2025-11-24	Single Image to High-Quality 3D Object via Latent Features	Huanning Dong et.al.	2511.19512	null
2025-11-24	The Determinant Ratio Matrix Approach to Solving 3D Matching and 2D Orthographic Projection Alignment Tasks	Andrew J. Hanson et.al.	2511.19511	null
2025-11-25	Cloud4D: Estimating Cloud Properties at a High Spatial and Temporal Resolution	Jacob Lin et.al.	2511.19431	null
2025-11-24	Cook and Clean Together: Teaching Embodied Agents for Parallel Task Execution	Dingkang Liang et.al.	2511.19430	null
2025-11-24	Ref-SAM3D: Bridging SAM3D with Text for Reference 3D Reconstruction	Yun Zhou et.al.	2511.19426	null
2025-11-24	Chain-of-Visual-Thought: Teaching VLMs to See and Think Better with Continuous Visual Tokens	Yiming Qin et.al.	2511.19418	null
2025-11-24	Real-Time Object Tracking with On-Device Deep Learning for Adaptive Beamforming in Dynamic Acoustic Environments	Jorge Ortigoso-Narro et.al.	2511.19396	null
2025-11-24	MonoMSK: Monocular 3D Musculoskeletal Dynamics Estimation	Farnoosh Koleini et.al.	2511.19326	null
2025-11-24	SyncMV4D: Synchronized Multi-view Joint Diffusion of Appearance and Motion for Hand-Object Interaction Synthesis	Lingwei Dang et.al.	2511.19319	null
2025-11-24	IDEAL-M3D: Instance Diversity-Enriched Active Learning for Monocular 3D Detection	Johannes Meier et.al.	2511.19301	null
2025-11-24	DensifyBeforehand: LiDAR-assisted Content-aware Densification for Efficient and Quality 3D Gaussian Splatting	Phurtivilai Patt et.al.	2511.19294	null
2025-11-24	LAST: LeArning to Think in Space and Time for Generalist Vision-Language Models	Shuai Wang et.al.	2511.19261	null
2025-11-24	Adversarial Patch Attacks on Vision-Based Cargo Occupancy Estimation via Differentiable 3D Simulation	Mohamed Rissal Hedna et.al.	2511.19254	null
2025-11-24	IDSplat: Instance-Decomposed 3D Gaussian Splatting for Driving Scenes	Carl Lindström et.al.	2511.19235	null
2025-11-24	Learning Plug-and-play Memory for Guiding Video Diffusion Models	Selena Song et.al.	2511.19229	null
2025-11-24	Percept-WAM: Perception-Enhanced World-Awareness-Action Model for Robust End-to-End Autonomous Driving	Jianhua Han et.al.	2511.19221	null
2025-11-24	ReAlign: Text-to-Motion Generation via Step-Aware Reward-Guided Alignment	Wanjiang Weng et.al.	2511.19217	null
2025-11-24	Soft pneumatic grippers: Topology optimization, 3D-printing and experimental validation	Prabhat Kumar et.al.	2511.19211	null
2025-11-24	NVGS: Neural Visibility for Occlusion Culling in 3D Gaussian Splatting	Brent Zoomers et.al.	2511.19202	null
2025-11-24	Efficient Optimization of a Permanent Magnet Array for a Stable 2D Trap	Ann-Sophia Müller et.al.	2511.19201	null
2025-11-24	Three-Dimensional Anatomical Data Generation Based on Artificial Neural Networks	Ann-Sophia Müller et.al.	2511.19198	null
2025-11-24	nnActive: A Framework for Evaluation of Active Learning in 3D Biomedical Segmentation	Carsten T. Lüth et.al.	2511.19183	null
2025-11-24	MetroGS: Efficient and Stable Reconstruction of Geometrically Accurate High-Fidelity Large-Scale Scenes	Kehua Chen et.al.	2511.19172	null
2025-11-24	FilmSceneDesigner: Chaining Set Design for Procedural Film Scene Generation	Zhifeng Xie et.al.	2511.19137	null
2025-11-24	MonoSR: Open-Vocabulary Spatial Reasoning from Monocular Images	Qirui Wang et.al.	2511.19119	null
2025-11-24	Graph-based 3D Human Pose Estimation using WiFi Signals	Jichao Chen et.al.	2511.19105	null
2025-11-24	DEAP-3DSAM: Decoder Enhanced and Auto Prompt SAM for 3D Medical Image Segmentation	Fangda Chen et.al.	2511.19071	null
2025-11-24	LAA3D: A Benchmark of Detecting and Tracking Low-Altitude Aircraft in 3D Space	Hai Wu et.al.	2511.19057	null
2025-11-26	Multi-Agent Monocular Dense SLAM With 3D Reconstruction Priors	Yuchen Zhou et.al.	2511.19031	null
2025-11-24	A Self-Conditioned Representation Guided Diffusion Model for Realistic Text-to-LiDAR Scene Generation	Wentao Qu et.al.	2511.19004	null
2025-11-24	View-Consistent Diffusion Representations for 3D-Consistent Video Generation	Duolikun Danier et.al.	2511.18991	null
2025-11-24	FineXtrol: Controllable Motion Generation via Fine-Grained Text	Keming Shen et.al.	2511.18927	null
2025-11-24	MatMart: Material Reconstruction of 3D Objects via Diffusion	Xiuchao Wu et.al.	2511.18900	null
2025-11-24	MagicWorld: Interactive Geometry-driven Video World Exploration	Guangyuan Li et.al.	2511.18886	null
2025-11-24	Neural Texture Splatting: Expressive 3D Gaussian Splatting for View Synthesis, Geometry, and Dynamic Reconstruction	Yiming Wang et.al.	2511.18873	null
2025-11-24	Robust Long-term Test-Time Adaptation for 3D Human Pose Estimation through Motion Discretization	Yilin Wen et.al.	2511.18851	null
2025-11-25	Disc3D: Automatic Curation of High-Quality 3D Dialog Data via Discriminative Object Referring	Siyuan Wei et.al.	2511.18817	null
2025-11-24	DetAny4D: Detect Anything 4D Temporally in a Streaming RGB Video	Jiawei Hou et.al.	2511.18814	null
2025-11-24	TPG-INR: Target Prior-Guided Implicit 3D CT Reconstruction for Enhanced Sparse-view Imaging	Qinglei Cao et.al.	2511.18806	null
2025-11-24	PartDiffuser: Part-wise 3D Mesh Generation via Discrete Diffusion	Yichen Yang et.al.	2511.18801	null
2025-11-24	StereoDETR: Stereo-based Transformer for 3D Object Detection	Shiyi Mu et.al.	2511.18788	null
2025-11-24	NI-Tex: Non-isometric Image-based Garment Texture Generation	Hui Shan et.al.	2511.18765	null
2025-11-24	SP-VINS: A Hybrid Stereo Visual Inertial Navigation System based on Implicit Environmental Map	Xueyu Du et.al.	2511.18756	null
2025-11-24	Yo’City: Personalized and Boundless 3D Realistic City Scene Generation via Self-Critic Expansion	Keyang Lu et.al.	2511.18734	null
2025-11-24	DriveFlow: Rectified Flow Adaptation for Robust 3D Object Detection in Autonomous Driving	Hongbin Lin et.al.	2511.18713	null
2025-11-24	Asynchronous Distributed Multi-Robot Motion Planning Under Imperfect Communication	Ardalan Tajbakhsh et.al.	2511.18703	null
2025-11-24	Exploring Surround-View Fisheye Camera 3D Object Detection	Changcai Li et.al.	2511.18695	null
2025-11-24	Hierarchical GraphCut Phase Unwrapping based on Invariance of Diffeomorphisms Framework	Xiang Gao et.al.	2511.18682	null
2025-11-24	Inverse Rendering for High-Genus Surface Meshes from Multi-View Images	Xiang Gao et.al.	2511.18680	null
2025-11-24	Neural Geometry Image-Based Representations with Optimal Transport (OT)	Xiang Gao et.al.	2511.18679	null
2025-11-23	From Healthy Scans to Annotated Tumors: A Tumor Fabrication Framework for 3D Brain MRI Synthesis	Nayu Dong et.al.	2511.18654	null
2025-11-23	RigAnyFace: Scaling Neural Facial Mesh Auto-Rigging with Unlabeled Data	Wenchao Ma et.al.	2511.18601	null
2025-11-23	NeAR: Coupled Neural Asset-Renderer Stack	Hong Li et.al.	2511.18600	null
2025-11-23	PhysGS: Bayesian-Inferred Gaussian Splatting for Physical Property Estimation	Samarth Chopra et.al.	2511.18570	null
2025-11-23	C3Po: Cross-View Cross-Modality Correspondence by Pointmap Prediction	Kuan Wei Huang et.al.	2511.18559	null
2025-11-23	LRDUN: A Low-Rank Deep Unfolding Network for Efficient Spectral Compressive Imaging	He Huang et.al.	2511.18513	null
2025-11-23	Expanding the Workspace of Electromagnetic Navigation Systems Using Dynamic Feedback for Single- and Multi-agent Control	Jasan Zughaibi et.al.	2511.18486	null
2025-11-23	Gaze Beyond the Frame: Forecasting Egocentric 3D Visual Span	Heeseung Yun et.al.	2511.18470	null
2025-11-23	EventBench: Towards Comprehensive Benchmarking of Event-based MLLMs	Shaoyu Liu et.al.	2511.18448	null
2025-11-23	ReCoGS: Real-time ReColoring for Gaussian Splatting scenes	Lorenzo Rutayisire et.al.	2511.18441	null
2025-11-23	CrossJEPA: Cross-Modal Joint-Embedding Predictive Architecture for Efficient 3D Representation Learning from 2D Images	Avishka Perera et.al.	2511.18424	null
2025-11-23	NeuroVascU-Net: A Unified Multi-Scale and Cross-Domain Adaptive Feature Fusion U-Net for Precise 3D Segmentation of Brain Vessels in Contrast-Enhanced T1 MRI	Mohammad Jafari Vayeghan et.al.	2511.18422	null
2025-11-23	SegSplat: Feed-forward Gaussian Splatting and Open-Set Semantic Segmentation	Peter Siegel et.al.	2511.18386	null
2025-11-23	MASS: Motion-Aware Spatial-Temporal Grounding for Physics Reasoning and Comprehension in Vision-Language Models	Xiyang Wu et.al.	2511.18373	null
2025-11-23	MimiCAT: Mimic with Correspondence-Aware Cascade-Transformer for Category-Free 3D Pose Transfer	Zenghao Chai et.al.	2511.18370	null
2025-11-23	Optimal Pose Guidance for Stereo Calibration in 3D Deformation Measurement	Dongcai Tan et.al.	2511.18317	null
2025-11-23	MicCheck: Repurposing Off-the-Shelf Pin Microphones for Easy and Low-Cost Contact Sensing	Steven Oh et.al.	2511.18299	null
2025-11-23	AIA-UltraNeRF:Acoustic-Impedance-Aware Neural Radiance Field with Hash Encodings for Robotic Ultrasound Reconstruction and Localization	Shuai Zhang et.al.	2511.18293	null
2025-11-23	SwiftVGGT: A Scalable Visual Geometry Grounded Transformer for Large-Scale Scenes	Jungho Lee et.al.	2511.18290	null
2025-11-23	UniFlow: Towards Zero-Shot LiDAR Scene Flow for Autonomous Vehicles via Cross-Domain Generalization	Siyi Li et.al.	2511.18254	null
2025-11-22	AFT: Appearance-Based Feature Tracking for Markerless and Training-Free Shape Reconstruction of Soft Robots	Shangyuan Yuan et.al.	2511.18215	null
2025-11-22	MotionDuet: Dual-Conditioned 3D Human Motion Generation with Video-Regularized Text Learning	Yi-Yang Zhang et.al.	2511.18209	null
2025-11-22	InfiniBench: Infinite Benchmarking for Visual Spatial Reasoning with Customizable Scene Complexity	Haoming Wang et.al.	2511.18200	null
2025-11-22	Unified Spherical Frontend: Learning Rotation-Equivariant Representations of Spherical Images from Any Camera	Mukai Yu et.al.	2511.18174	null
2025-11-22	EgoControl: Controllable Egocentric Video Generation via 3D Full-Body Poses	Enrico Pallotta et.al.	2511.18173	null
2025-11-22	scipy.spatial.transform: Differentiable Framework-Agnostic 3D Transformations in Python	Martin Schuck et.al.	2511.18157	null
2025-11-22	Observer Actor: Active Vision Imitation Learning with Sparse View Gaussian Splatting	Yilong Wang et.al.	2511.18140	null
2025-11-22	SFHand: A Streaming Framework for Language-guided 3D Hand Forecasting and Embodied Manipulation	Ruicong Liu et.al.	2511.18127	null
2025-11-22	Muskie: Multi-view Masked Image Modeling for 3D Vision Pre-training	Wenyu Li et.al.	2511.18115	null
2025-11-22	Is Complete Labeling Necessary? Understanding Active Learning in Longitudinal Medical Imaging	Siteng Ma et.al.	2511.18007	null
2025-11-22	RAISECity: A Multimodal Agent Framework for Reality-Aligned 3D World Generation at City-Scale	Shengyuan Wang et.al.	2511.18005	null
2025-11-22	Signal: Selective Interaction and Global-local Alignment for Multi-Modal Object Re-Identification	Yangyang Liu et.al.	2511.17965	null
2025-11-22	RoboArmGS: High-Quality Robotic Arm Splatting via Bézier Curve Refinement	Hao Wang et.al.	2511.17961	null
2025-11-22	Novel View Synthesis from A Few Glimpses via Test-Time Natural Video Completion	Yan Xu et.al.	2511.17932	null
2025-11-22	Frequency-Adaptive Sharpness Regularization for Improving 3D Gaussian Splatting Generalization	Youngsik Yun et.al.	2511.17918	null
2025-11-22	CUS-GS: A Compact Unified Structured Gaussian Splatting Framework for Multimodal Scene Representation	Yuhang Ming et.al.	2511.17904	null
2025-11-22	ArticFlow: Generative Simulation of Articulated Mechanisms	Jiong Lin et.al.	2511.17883	null
2025-11-21	QAL: A Loss for Recall Precision Balance in 3D Reconstruction	Pranay Meshram et.al.	2511.17824	null
2025-11-21	REXO: Indoor Multi-View Radar Object Detection via 3D Bounding Box Diffusion	Ryoma Yataka et.al.	2511.17806	null
2025-11-21	SPIDER: Spatial Image CorresponDence Estimator for Robust Calibration	Zhimin Shao et.al.	2511.17750	null
2025-11-21	AEGIS: Preserving privacy of 3D Facial Avatars with Adversarial Perturbations	Dawid Wolkiewicz et.al.	2511.17747	null
2025-11-21	VisReason: A Large-Scale Dataset for Visual Chain-of-Thought Reasoning	Lingxiao Li et.al.	2511.17731	null
2025-11-21	Native 3D Editing with Full Attention	Weiwei Cai et.al.	2511.17501	null
2025-11-21	HALO: High-Altitude Language-Conditioned Monocular Aerial Exploration and Navigation	Yuezhan Tao et.al.	2511.17497	null
2025-11-21	Radar2Shape: 3D Shape Reconstruction from High-Frequency Radar using Multiresolution Signed Distance Functions	Neel Sortur et.al.	2511.17484	null
2025-11-21	Improving Multimodal Distillation for 3D Semantic Segmentation under Domain Shift	Björn Michele et.al.	2511.17455	null
2025-11-21	Illustrator’s Depth: Monocular Layer Index Prediction for Image Decomposition	Nissim Maruani et.al.	2511.17454	null
2025-11-24	Preventing Shortcut Learning in Medical Image Analysis through Intermediate Layer Knowledge Distillation from Specialist Teachers	Christopher Boland et.al.	2511.17421	null
2025-11-21	SPEAR-1: Scaling Beyond Robot Demonstrations via 3D Understanding	Nikolay Nikolov et.al.	2511.17411	null
2025-11-21	MorphSeek: Fine-grained Latent Representation-Level Policy Optimization for Deformable Image Registration	Runxun Zhang et.al.	2511.17392	null
2025-11-21	SVRecon: Sparse Voxel Rasterization for Surface Reconstruction	Seunghun Oh et.al.	2511.17364	null
2025-11-21	NoPe-NeRF++: Local-to-Global Optimization of NeRF with No Pose Prior	Dongbo Shi et.al.	2511.17322	null
2025-11-21	MuM: Multi-View Masked Image Modeling for 3D Vision	David Nordström et.al.	2511.17309	null
2025-11-21	MonoSpheres: Large-Scale Monocular SLAM-Based UAV Exploration through Perception-Coupled Mapping and Planning	Tomáš Musil et.al.	2511.17299	null
2025-11-21	Range-Edit: Semantic Mask Guided Outdoor LiDAR Scene Editing	Suchetan G. Uppur et.al.	2511.17269	null
2025-11-21	TP-MDDN: Task-Preferenced Multi-Demand-Driven Navigation with Autonomous Decision-Making	Shanshan Li et.al.	2511.17225	null
2025-11-21	QueryOcc: Query-based Self-Supervision for 3D Semantic Occupancy	Adam Lilja et.al.	2511.17221	null
2025-11-21	FisheyeGaussianLift: BEV Feature Lifting for Surround-View Fisheye Camera Perception	Shubham Sonarghare et.al.	2511.17210	null
2025-11-21	Scaling Self-Supervised and Cross-Modal Pretraining for Volumetric CT Transformers	Cris Claessens et.al.	2511.17209	null
2025-11-21	SING3R-SLAM: Submap-based Indoor Monocular Gaussian SLAM with 3D Reconstruction Priors	Kunyi Li et.al.	2511.17207	null
2025-11-21	VLA-4D: Embedding 4D Awareness into Vision-Language-Action Models for SpatioTemporally Coherent Robotic Manipulation	Hanyu Zhou et.al.	2511.17199	null
2025-11-21	PEGS: Physics-Event Enhanced Large Spatiotemporal Motion Reconstruction via 3D Gaussian Splatting	Yijun Xu et.al.	2511.17116	null
2025-11-21	SPAGS: Sparse-View Articulated Object Reconstruction from Single State via Planar Gaussian Splatting	Di Wu et.al.	2511.17092	null
2025-11-21	ReBrain: Brain MRI Reconstruction from Sparse CT Slice via Retrieval-Augmented Diffusion	Junming Liu et.al.	2511.17068	null
2025-11-21	RoomPlanner: Explicit Layout Planner for Easier LLM-Driven 3D Room Generation	Wenzhuo Sun et.al.	2511.17048	null
2025-11-21	Parameter-Free Neural Lens Blur Rendering for High-Fidelity Composites	Lingyan Ruan et.al.	2511.17014	null
2025-11-21	Stable Offline Hand-Eye Calibration for any Robot with Just One Mark	Sicheng Xie et.al.	2511.17001	null
2025-11-21	DepthFocus: Controllable Depth Estimation for See-Through Scenes	Junhong Min et.al.	2511.16993	null
2025-11-21	PhysMorph-GS: Differentiable Shape Morphing via Joint Optimization of Physics and Rendering Objectives	Chang-Yong Song et.al.	2511.16988	null
2025-11-21	Gradient-Driven Natural Selection for Compact 3D Gaussian Splatting	Xiaobin Deng et.al.	2511.16980	null
2025-11-21	MobileOcc: A Human-Aware Semantic Occupancy Dataset for Mobile Robots	Junseo Kim et.al.	2511.16949	null
2025-11-20	BOP-ASK: Object-Interaction Reasoning for Vision-Language Models	Vineet Bhat et.al.	2511.16857	null
2025-11-20	Vorion: A RISC-V GPU with Hardware-Accelerated 3D Gaussian Rendering and Training	Yipeng Wang et.al.	2511.16831	null
2025-11-20	WorldGen: From Text to Traversable and Interactive 3D Worlds	Dilin Wang et.al.	2511.16825	null
2025-11-20	Mesh RAG: Retrieval Augmentation for Autoregressive Mesh Generation	Xiatao Sun et.al.	2511.16807	null
2025-11-20	SVG360: Multi-View SVG Generation with Geometric and Color Consistency from a Single SVG	Mengnan Jiang et.al.	2511.16766	null
2025-11-20	A Machine Learning-Driven Solution for Denoising Inertial Confinement Fusion Images	Asya Y. Akkus et.al.	2511.16717	null
2025-11-20	NoPo-Avatar: Generalizable and Animatable Avatars from Sparse Inputs without Human Poses	Jing Wen et.al.	2511.16673	null
2025-11-20	TriDiff-4D: Fast 4D Generation through Diffusion-based Triplane Re-posing	Eddie Pokming Sheung et.al.	2511.16662	null
2025-11-20	Dexterity from Smart Lenses: Multi-Fingered Robot Manipulation with In-the-Wild Human Demonstrations	Irmak Guzey et.al.	2511.16661	null
2025-11-20	PartUV: Part-Based UV Unwrapping of 3D Meshes	Zhaoning Wang et.al.	2511.16659	null
2025-11-20	Late-decoupled 3D Hierarchical Semantic Segmentation with Semantic Prototype Discrimination based Bi-branch Supervision	Shuyu Cao et.al.	2511.16650	null
2025-11-20	TRIM: Scalable 3D Gaussian Diffusion Inference with Temporal and Spatial Trimming	Zeyuan Yin et.al.	2511.16642	null
2025-11-20	SAM 3D: 3Dfy Anything in Images	SAM 3D Team et.al.	2511.16624	null
2025-11-21	POMA-3D: The Point Map Way to 3D Scene Understanding	Ye Mao et.al.	2511.16567	null
2025-11-20	EOGS++: Earth Observation Gaussian Splatting with Internal Camera Refinement and Direct Panchromatic Rendering	Pierrick Bournez et.al.	2511.16542	null
2025-11-20	Enhancing Multi-Camera Gymnast Tracking Through Domain Knowledge Integration	Fan Yang et.al.	2511.16532	null
2025-11-20	LLaVA $^3$ : Representing 3D Scenes like a Cubist Painter to Boost 3D Scene Understanding of VLMs	Doriand Petit et.al.	2511.16454	null
2025-11-20	From Prompts to Printable Models: Support-Effective 3D Generation via Offset Direct Preference Optimization	Chenming Wu et.al.	2511.16434	null
2025-11-20	CylinderDepth: Cylindrical Spatial Attention for Multi-View Consistent Self-Supervised Surround Depth Estimation	Samer Abualhanud et.al.	2511.16428	null
2025-11-20	CRISTAL: Real-time Camera Registration in Static LiDAR Scans using Neural Rendering	Joni Vanherck et.al.	2511.16349	null
2025-11-20	NaTex: Seamless Texture Generation as Latent Color Diffusion	Zeqiang Lai et.al.	2511.16317	null
2025-11-20	Optimizing 3D Gaussian Splattering for Mobile GPUs	Md Musfiqur Rahman Sanim et.al.	2511.16298	null
2025-11-20	Building temporally coherent 3D maps with VGGT for memory-efficient Semantic SLAM	Gergely Dinya et.al.	2511.16282	null
2025-11-20	Mem-MLP: Real-Time 3D Human Motion Generation from Sparse Inputs	Sinan Mutlu et.al.	2511.16264	null
2025-11-20	How Robot Dogs See the Unseeable	Oliver Bimber et.al.	2511.16262	null
2025-11-20	PrIntMesh: Precise Intersection Surfaces for 3D Organ Mesh Reconstruction	Deniz Sayin Mercadier et.al.	2511.16186	null
2025-11-20	Simba: Towards High-Fidelity and Geometrically-Consistent Point Cloud Completion via Transformation Diffusion	Lirui Zhang et.al.	2511.16161	null
2025-11-20	LEGO-SLAM: Language-Embedded Gaussian Optimization SLAM	Sibaek Lee et.al.	2511.16144	null
2025-11-20	Real-Time 3D Object Detection with Inference-Aligned Learning	Chenyu Zhao et.al.	2511.16140	null
2025-11-20	Clustered Error Correction with Grouped 4D Gaussian Splatting	Taeho Kang et.al.	2511.16112	null
2025-11-20	Rad-GS: Radar-Vision Integration for 3D Gaussian Splatting SLAM in Outdoor Environments	Renxiang Xiao et.al.	2511.16091	null
2025-11-20	Semantic Glitch: Agency and Artistry in an Autonomous Pixel Cloud	Qing Zhang et.al.	2511.16048	null
2025-11-20	CuriGS: Curriculum-Guided Gaussian Splatting for Sparse View Synthesis	Zijian Wu et.al.	2511.16030	null
2025-11-21	Automated Interpretable 2D Video Extraction from 3D Echocardiography	Milos Vukadinovic et.al.	2511.15946	null
2025-11-20	RoMa v2: Harder Better Faster Denser Feature Matching	Johan Edstedt et.al.	2511.15706	null
2025-11-19	Hyperspectral Image Classification using Spectral-Spatial Mixer Network	Mohammed Q. Alkhatib et.al.	2511.15692	null
2025-11-19	FlashMesh: Faster and Better Autoregressive Mesh Synthesis via Structured Speculation	Tingrui Shen et.al.	2511.15618	null
2025-11-19	US-X Complete: A Multi-Modal Approach to Anatomical 3D Shape Recovery	Miruna-Alexandra Gafencu et.al.	2511.15600	null
2025-11-20	CompTrack: Information Bottleneck-Guided Low-Rank Dynamic Token Compression for Point Cloud Tracking	Sifan Zhou et.al.	2511.15580	null
2025-11-19	NTK-Guided Implicit Neural Teaching	Chen Zhang et.al.	2511.15487	null
2025-11-19	ShelfOcc: Native 3D Supervision beyond LiDAR for Vision-Based Occupancy Estimation	Simon Boeder et.al.	2511.15396	null
2025-11-20	Adapt-As-You-Walk Through the Clouds: Training-Free Online Test-Time Adaptation of 3D Vision-Language Foundation Models	Mehran Tamjidi et.al.	2511.15311	null
2025-11-19	Text2Loc++: Generalizing 3D Point Cloud Localization from Natural Language	Yan Xia et.al.	2511.15308	null
2025-11-19	Edge-Centric Relational Reasoning for 3D Scene Graph Prediction	Yanni Ma et.al.	2511.15288	null
2025-11-19	Graph Query Networks for Object Detection with Automotive Radar	Loveneet Saini et.al.	2511.15271	null
2025-11-19	Fluid Control with Localized Spacetime Windows	Yixin Chen et.al.	2511.15189	null
2025-11-20	BrainRotViT: Transformer-ResNet Hybrid for Explainable Modeling of Brain Aging from 3D sMRI	Wasif Jalal et.al.	2511.15188	null
2025-11-19	SceneEdited: A City-Scale Benchmark for 3D HD Map Updating via Image-Guided Change Detection	Chun-Jung Lin et.al.	2511.15153	null
2025-11-19	Gaussian Blending: Rethinking Alpha Blending in 3D Gaussian Splatting	Junseo Koo et.al.	2511.15102	null
2025-11-19	MambaTrack3D: A State Space Model Framework for LiDAR-Based Object Tracking under High Temporal Variation	Shengjing Tian et.al.	2511.15077	null
2025-11-18	RocSync: Millisecond-Accurate Temporal Synchronization for Heterogeneous Camera Systems	Jaro Meyer et.al.	2511.14948	null
2025-11-18	CPSL: Representing Volumetric Video via Content-Promoted Scene Layers	Kaiyuan Hu et.al.	2511.14927	null
2025-11-18	X-WIN: Building Chest Radiograph World Model via Predictive Sensing	Zefan Yang et.al.	2511.14918	null
2025-11-18	InstructMix2Mix: Consistent Sparse-View Editing Through Multi-View Model Personalization	Daniel Gilo et.al.	2511.14899	null
2025-11-18	GeoSceneGraph: Geometric Scene Graph Diffusion Model for Text-guided 3D Indoor Scene Synthesis	Antonio Ruiz et.al.	2511.14884	null
2025-11-18	B-Rep Distance Functions (BR-DF): How to Represent a B-Rep Model by Volumetric Distance Functions?	Fuyang Zhang et.al.	2511.14870	null
2025-11-18	Gaussian See, Gaussian Do: Semantic 3D Motion Transfer from Multiview Video	Yarin Bekor et.al.	2511.14848	null
2025-11-18	Co-Me: Confidence-Guided Token Merging for Visual Geometric Transformers	Yutian Chen et.al.	2511.14751	null
2025-11-18	A Neural Field-Based Approach for View Computation & Data Exploration in 3D Urban Environments	Stefan Cobeli et.al.	2511.14742	null
2025-11-18	FreeSwim: Revisiting Sliding-Window Attention Mechanisms for Training-Free Ultra-High-Resolution Video Generation	Yunfeng Wu et.al.	2511.14712	null
2025-11-18	RepAir: A Framework for Airway Segmentation and Discontinuity Correction in CT	John M. Oyer et.al.	2511.14649	null
2025-11-18	SparseSurf: Sparse-View 3D Gaussian Splatting for Surface Reconstruction	Meiying Gu et.al.	2511.14633	null
2025-11-18	Gallant: Voxel Grid-based Humanoid Locomotion and Local-navigation across 3D Constrained Terrains	Qingwei Ben et.al.	2511.14625	null
2025-11-18	3D-Guided Scalable Flow Matching for Generating Volumetric Tissue Spatial Transcriptomics from Serial Histology	Mohammad Vali Sanian et.al.	2511.14613	null
2025-11-18	MRI Embeddings Complement Clinical Predictors for Cognitive Decline Modeling in Alzheimer’s Disease Cohorts	Nathaniel Putera et.al.	2511.14601	null
2025-11-18	Interaction-Aware 4D Gaussian Splatting for Dynamic Hand-Object Interaction Reconstruction	Hao Tian et.al.	2511.14540	null
2025-11-18	Learning Compact Latent Space for Representing Neural Signed Distance Functions with High-fidelity Geometry Details	Qiang Bai et.al.	2511.14539	null
2025-11-18	DeCo-VAE: Learning Compact Latents for Video Reconstruction via Decoupled Representation	Xiangchen Yin et.al.	2511.14530	null
2025-11-18	BEDLAM2.0: Synthetic Humans and Cameras in Motion	Joachim Tesch et.al.	2511.14394	null
2025-11-19	Cheating Stereo Matching in Full-scale: Physical Adversarial Attack against Binocular Depth Estimation in Autonomous Driving	Kangqiao Zhao et.al.	2511.14386	null
2025-11-18	A Quantitative Method for Shoulder Presentation Evaluation in Biometric Identity Documents	Alfonso Pedro Ridao et.al.	2511.14376	null
2025-11-18	IBGS: Image-Based Gaussian Splatting	Hoang Chuong Nguyen et.al.	2511.14357	null
2025-11-18	Silhouette-to-Contour Registration: Aligning Intraoral Scan Models with Cephalometric Radiographs	Yiyi Miao et.al.	2511.14343	null
2025-11-18	ArchMap: Arch-Flattening and Knowledge-Guided Vision Language Model for Tooth Counting and Structured Dental Understanding	Bohan Zhang et.al.	2511.14336	null
2025-11-18	Simultaneous Localization and 3D-Semi Dense Mapping for Micro Drones Using Monocular Camera and Inertial Sensors	Jeryes Danial et.al.	2511.14335	null
2025-11-18	Dental3R: Geometry-Aware Pairing for Intraoral 3D Reconstruction from Sparse-View Photographs	Yiyi Miao et.al.	2511.14315	null
2025-11-18	Iterative Diffusion-Refined Neural Attenuation Fields for Multi-Source Stationary CT Reconstruction: NAF Meets Diffusion Model	Jiancheng Fang et.al.	2511.14310	null
2025-11-18	GEN3D: Generating Domain-Free 3D Scenes from a Single Image	Yuxin Zhang et.al.	2511.14291	null
2025-11-18	NeuralBoneReg: A Novel Self-Supervised Method for Robust and Accurate Multi-Modal Bone Surface Registration	Luohong Wu et.al.	2511.14286	null
2025-11-18	NeuralSSD: A Neural Solver for Signed Distance Surface Reconstruction	Zi-Chen Xi et.al.	2511.14283	null
2025-11-18	Let Language Constrain Geometry: Vision-Language Models as Semantic and Spatial Critics for 3D Generation	Weimin Bai et.al.	2511.14271	null
2025-11-18	Breaking the Passive Learning Trap: An Active Perception Strategy for Human Motion Prediction	Juncheng Hu et.al.	2511.14237	null
2025-11-19	StreamingTalker: Audio-driven 3D Facial Animation with Autoregressive Diffusion Model	Yifan Yang et.al.	2511.14223	null
2025-11-18	Hierarchical Semantic Learning for Multi-Class Aorta Segmentation	Pengcheng Shi et.al.	2511.14187	null
2025-11-19	RoboTidy : A 3D Gaussian Splatting Household Tidying Benchmark for Embodied Navigation and Action	Xiaoquan Sun et.al.	2511.14161	null
2025-11-19	Wave-Former: Through-Occlusion 3D Reconstruction via Wireless Shape Completion	Laura Dodds et.al.	2511.14152	null
2025-11-18	iGaussian: Real-Time Camera Pose Estimation via Feed-Forward 3D Gaussian Splatting Inversion	Hao Wang et.al.	2511.14149	null
2025-11-18	Error-Driven Scene Editing for 3D Grounding in Large Language Models	Yue Zhang et.al.	2511.14086	null
2025-11-18	MRI Plane Orientation Detection using a Context-Aware 2.5D Model	SangHyuk Kim et.al.	2511.14021	null
2025-11-17	PoCGM: Poisson-Conditioned Generative Model for Sparse-View CT Reconstruction	Changsheng Fang et.al.	2511.13967	null
2025-11-17	GRLoc: Geometric Representation Regression for Visual Localization	Changyang Li et.al.	2511.13864	null
2025-11-17	KANGURA: Kolmogorov-Arnold Network-Based Geometry-Aware Learning with Unified Representation Attention for 3D Modeling of Complex Structures	Mohammad Reza Shafie et.al.	2511.13798	null
2025-11-17	Free-Form Scene Editor: Enabling Multi-Round Object Manipulation like in a 3D Engine	Xincheng Shuai et.al.	2511.13713	null
2025-11-17	Training-Free Multi-View Extension of IC-Light for Textual Position-Aware Scene Relighting	Jiangnan Ye et.al.	2511.13684	null
2025-11-17	PhysX-Anything: Simulation-Ready Physical 3D Assets from Single Image	Ziang Cao et.al.	2511.13648	null
2025-11-17	Part-X-MLLM: Part-aware 3D Multimodal Large Language Model	Chunshi Wang et.al.	2511.13647	null
2025-11-17	AtlasMorph: Learning conditional deformable templates for brain MRI	Marianne Rakic et.al.	2511.13609	null
2025-11-17	Opt3DGS: Optimizing 3D Gaussian Splatting with Adaptive Exploration and Curvature-Aware Exploitation	Ziyang Huang et.al.	2511.13571	null
2025-11-17	TSE-Net: Semi-supervised Monocular Height Estimation from Single Remote Sensing Images	Sining Chen et.al.	2511.13552	null
2025-11-17	InterMoE: Individual-Specific 3D Human Interaction Generation via Dynamic Temporal-Selective MoE	Lipeng Wang et.al.	2511.13488	null
2025-11-17	Contact-Safe Reinforcement Learning with ProMP Reparameterization and Energy Awareness	Bingkun Huang et.al.	2511.13459	null
2025-11-17	FUSE: A Flow-based Mapping Between Shapes	Lorenzo Olearo et.al.	2511.13431	null
2025-11-17	EL3DD: Extended Latent 3D Diffusion for Language Conditioned Multitask Manipulation	Jonas Bode et.al.	2511.13312	null
2025-11-17	DriveLiDAR4D: Sequential and Controllable LiDAR Scene Generation for Autonomous Driving	Kaiwen Cai et.al.	2511.13309	null
2025-11-17	CorrectAD: A Self-Correcting Agentic System to Improve End-to-end Planning in Autonomous Driving	Enhui Ma et.al.	2511.13297	null
2025-11-17	SF-Recon: Simplification-Free Lightweight Building Reconstruction via 3D Gaussian Splatting	Zihan Li et.al.	2511.13278	null
2025-11-19	SymGS : Leveraging Local Symmetries for 3D Gaussian Splatting Compression	Keshav Gupta et.al.	2511.13264	null
2025-11-17	Force-Aware 3D Contact Modeling for Stable Grasp Generation	Zhuo Chen et.al.	2511.13247	null
2025-11-17	MRIQT: Physics-Aware Diffusion Model for Image Quality Transfer in Neonatal Ultra-Low-Field MRI	Malek Al Abed et.al.	2511.13232	null
2025-11-17	Hybrid-Domain Adaptative Representation Learning for Gaze Estimation	Qida Tan et.al.	2511.13222	null
2025-11-17	3DAlign-DAER: Dynamic Attention Policy and Efficient Retrieval Strategy for Fine-grained 3D-Text Alignment at Scale	Yijia Fan et.al.	2511.13211	null
2025-11-17	Difficulty-Aware Label-Guided Denoising for Monocular 3D Object Detection	Soyul Lee et.al.	2511.13195	null
2025-11-17	Video Spatial Reasoning with Object-Centric 3D Rollout	Haoran Tang et.al.	2511.13190	null
2025-11-17	WinMamba: Multi-Scale Shifted Windows in State Space Model for 3D Object Detection	Longhui Zheng et.al.	2511.13138	null
2025-11-17	CloseUpShot: Close-up Novel View Synthesis from Sparse-views via Point-conditioned Diffusion Model	Yuqi Zhang et.al.	2511.13121	null
2025-11-17	A Lightweight 3D Anomaly Detection Method with Rotationally Invariant Features	Hanzhe Liang et.al.	2511.13115	null
2025-11-17	FGNet: Leveraging Feature-Guided Attention to Refine SAM2 for 3D EM Neuron Segmentation	Zhenghua Li et.al.	2511.13063	null
2025-11-17	Monocular 3D Lane Detection via Structure Uncertainty-Aware Network with Curve-Point Queries	Ruixin Liu et.al.	2511.13055	null
2025-11-17	Uni-Inter: Unifying 3D Human Motion Synthesis Across Diverse Interaction Contexts	Sheng Liu et.al.	2511.13032	null
2025-11-18	Towards 3D Object-Centric Feature Learning for Semantic Scene Completion	Weihua Wang et.al.	2511.13031	null
2025-11-18	Geometry Meets Light: Leveraging Geometric Priors for Universal Photometric Stereo under Limited Multi-Illumination Cues	King-Man Tam et.al.	2511.13015	null
2025-11-17	Beyond Darkness: Thermal-Supervised 3D Gaussian Splatting for Low-Light Novel View Synthesis	Qingsen Ma et.al.	2511.13011	null
2025-11-17	TR-Gaussians: High-fidelity Real-time Rendering of Planar Transmission and Reflection with 3D Gaussian Splatting	Yong Liu et.al.	2511.13009	null
2025-11-17	Medal S: Spatio-Textual Prompt Model for Medical Segmentation	Pengcheng Shi et.al.	2511.13001	null
2025-11-18	ArtiWorld: LLM-Driven Articulation of 3D Objects in Scenes	Yixuan Yang et.al.	2511.12977	null
2025-11-17	SplatSearch: Instance Image Goal Navigation for Mobile Robots using 3D Gaussian Splatting and Diffusion Models	Siddarth Narasimhan et.al.	2511.12972	null
2025-11-19	HiFusion: Hierarchical Intra-Spot Alignment and Regional Context Fusion for Spatial Gene Expression Prediction from Histopathology	Ziqiao Weng et.al.	2511.12969	null
2025-11-17	Inertia-Informed Orientation Priors for Event-Based Optical Flow Estimation	Pritam P. Karmokar et.al.	2511.12961	null
2025-11-17	GUIDE: Gaussian Unified Instance Detection for Enhanced Obstacle Perception in Autonomous Driving	Chunyong Hu et.al.	2511.12941	null
2025-11-18	PFAvatar: Pose-Fusion 3D Personalized Avatar Reconstruction from Real-World Outfit-of-the-Day Photos	Dianbing Xi et.al.	2511.12935	null
2025-11-17	Neo: Real-Time On-Device 3D Gaussian Splatting with Reuse-and-Update Sorting Acceleration	Changhun Oh et.al.	2511.12930	null
2025-11-17	CoordAR: One-Reference 6D Pose Estimation of Novel Objects via Autoregressive Coordinate Map Generation	Dexin Zuo et.al.	2511.12919	null
2025-11-17	CASL: Curvature-Augmented Self-supervised Learning for 3D Anomaly Detection	Yaohua Zha et.al.	2511.12909	null
2025-11-17	Functional Mean Flow in Hilbert Space	Zhiqi Li et.al.	2511.12898	null
2025-11-17	Reconstructing 3D Scenes in Native High Dynamic Range	Kaixuan Zhang et.al.	2511.12895	null
2025-11-17	Towards High-Consistency Embodied World Model with Multi-View Trajectory Videos	Taiyi Su et.al.	2511.12882	null
2025-11-18	Uni-Hand: Universal Hand Motion Forecasting in Egocentric Views	Junyi Ma et.al.	2511.12878	null
2025-11-16	Deep Imbalanced Multi-Target Regression: 3D Point Cloud Voxel Content Estimation in Simulated Forests	Amirhossein Hassanzadeh et.al.	2511.12740	null
2025-11-16	DensePercept-NCSSD: Vision Mamba towards Real-time Dense Visual Perception with Non-Causal State Space Duality	Tushar Anand et.al.	2511.12671	null
2025-11-16	Hi-Reco: High-Fidelity Real-Time Conversational Digital Humans	Hongbin Huang et.al.	2511.12662	null
2025-11-16	EcoFlight: Finding Low-Energy Paths Through Obstacles for Autonomous Sensing Drones	Jordan Leyva et.al.	2511.12618	null
2025-11-16	OPFormer: Object Pose Estimation leveraging foundation model with geometric encoding	Artem Moroz et.al.	2511.12614	null
2025-11-16	Uni-MoE-2.0-Omni: Scaling Language-Centric Omnimodal Large Model with Advanced MoE, Training and Data	Yunxin Li et.al.	2511.12609	null
2025-11-16	Visible Structure Retrieval for Lightweight Image-Based Relocalisation	Fereidoon Zangeneh et.al.	2511.12503	null
2025-11-16	Towards Temporal Fusion Beyond the Field of View for Camera-based Semantic Scene Completion	Jongseong Bae et.al.	2511.12498	null
2025-11-16	ClutterNav: Gradient-Guided Search for Efficient 3D Clutter Removal with Learned Costmaps	Navin Sriram Ravie et.al.	2511.12479	null
2025-11-16	DenseAnnotate: Enabling Scalable Dense Caption Collection for Images and 3D Scenes via Spoken Descriptions	Xiaoyu Lin et.al.	2511.12452	null
2025-11-16	Towards Rotation-only Imaging Geometry: Rotation Estimation	Xinrui Li et.al.	2511.12415	null
2025-11-16	DEMIST: \underline{DE}coupled \underline{M}ulti-stream latent d\underline{I}ffusion for Quantitative Myelin Map \underline{S}yn\underline{T}hesis	Jiacheng Wang et.al.	2511.12396	null
2025-11-15	MTMed3D: A Multi-Task Transformer-Based Model for 3D Medical Imaging	Fan Li et.al.	2511.12373	null
2025-11-15	Changes in Real Time: Online Scene Change Detection with Multi-View Fusion	Chamuditha Jayanga Galappaththige et.al.	2511.12370	null
2025-11-15	Ground Plane Projection for Improved Traffic Analytics at Intersections	Sajjad Pakdamansavoji et.al.	2511.12342	null
2025-11-15	LiDAR-GS++:Improving LiDAR Gaussian Reconstruction via Diffusion Priors	Qifeng Chen et.al.	2511.12304	null
2025-11-15	One target to align them all: LiDAR, RGB and event cameras extrinsic calibration for Autonomous Driving	Andrea Bertogalli et.al.	2511.12291	null
2025-11-15	Deep Unfolded BM3D: Unrolling Non-local Collaborative Filtering into a Trainable Neural Network	Kerem Basim et.al.	2511.12248	null
2025-11-18	GeoMVD: Geometry-Enhanced Multi-View Generation Model Based on Geometric Information Extraction	Jiaqi Wu et.al.	2511.12204	null
2025-11-15	LSS3D: Learnable Spatial Shifting for Consistent and High-Quality 3D Generation from Single-Image	Zhuojiang Cai et.al.	2511.12202	null
2025-11-15	MMRINet: Efficient Mamba-Based Segmentation with Dual-Path Refinement for Low-Resource MRI Analysis	Abdelrahman Elsayed et.al.	2511.12193	null
2025-11-15	Rethinking Multimodal Point Cloud Completion: A Completion-by-Correction Perspective	Wang Luo et.al.	2511.12170	null
2025-11-15	Game-Theoretic Safe Multi-Agent Motion Planning with Reachability Analysis for Dynamic and Uncertain Environments (Extended Version)	Wenbin Mai et.al.	2511.12160	null
2025-11-15	RadarMP: Motion Perception for 4D mmWave Radar in Autonomous Driving	Ruiqi Cheng et.al.	2511.12117	null
2025-11-15	Point Cloud Quantization through Multimodal Prompting for 3D Understanding	Hongxuan Li et.al.	2511.12079	null
2025-11-15	SRSplat: Feed-Forward Super-Resolution Gaussian Splatting from Sparse Multi-View Images	Xinyuan Hu et.al.	2511.12040	null
2025-11-15	VPHO: Joint Visual-Physical Cue Learning and Aggregation for Hand-Object Pose Estimation	Jun Zhou et.al.	2511.12030	null
2025-11-14	Advancing Annotat3D with Harpia: A CUDA-Accelerated Library For Large-Scale Volumetric Data Segmentation	Camila Machado de Araujo et.al.	2511.11890	null
2025-11-14	Learning Conjugate Direction Fields for Planar Quadrilateral Mesh Generation	Jiong Tao et.al.	2511.11865	null
2025-11-14	MP-GFormer: A 3D-Geometry-Aware Dynamic Graph Transformer Approach for Machining Process Planning	Fatemeh Elhambakhsh et.al.	2511.11837	null
2025-11-14	Large Language Models and 3D Vision for Intelligent Robotic Perception and Autonomy: A Review	Vinit Mehta et.al.	2511.11777	null
2025-11-14	LARM: A Large Articulated-Object Reconstruction Model	Sylvia Yuan et.al.	2511.11563	null
2025-11-14	Sat2RealCity: Geometry-Aware and Appearance-Controllable 3D Urban Generation from Satellite Imagery	Yijie Kang et.al.	2511.11470	null
2025-11-14	VoxTell: Free-Text Promptable Universal 3D Medical Image Segmentation	Maximilian Rokuss et.al.	2511.11450	null
2025-11-14	RadAround: A Field-Expedient Direction Finder for Contested IoT Sensing & EM Situational Awareness	Owen A. Maute et.al.	2511.11392	null
2025-11-14	Free3D: 3D Human Motion Emerges from Single-View 2D Supervision	Sheng Liu et.al.	2511.11368	null
2025-11-14	6D Strawberry Pose Estimation: Real-time and Edge AI Solutions Using Purely Synthetic Training Data	Saptarshi Neil Sinha et.al.	2511.11307	null
2025-11-14	RTGaze: Real-Time 3D-Aware Gaze Redirection from a Single Image	Hengfei Wang et.al.	2511.11289	null
2025-11-14	Beyond Flatlands: Unlocking Spatial Intelligence by Decoupling 3D Reasoning from Numerical Regression	Zhongbin Guo et.al.	2511.11239	null
2025-11-14	DoReMi: A Domain-Representation Mixture Framework for Generalizable 3D Understanding	Mingwei Xing et.al.	2511.11232	null
2025-11-14	3D Gaussian and Diffusion-Based Gaze Redirection	Abiram Panchalingam et.al.	2511.11231	null
2025-11-14	RealisticDreamer: Guidance Score Distillation for Few-shot Gaussian Splatting	Ruocheng Wu et.al.	2511.11213	null
2025-11-14	One-to-N Backdoor Attack in 3D Point Cloud via Spherical Trigger	Dongmei Shan et.al.	2511.11210	null
2025-11-14	Computationally-efficient deep learning models for nowcasting of precipitation: A solution for the Weather4cast 2025 challenge	Anushree Bhuskute et.al.	2511.11197	null
2025-11-14	CATS-V2V: A Real-World Vehicle-to-Vehicle Cooperative Perception Dataset with Complex Adverse Traffic Scenarios	Hangyu Li et.al.	2511.11168	null
2025-11-14	SplineSplat: 3D Ray Tracing for Higher-Quality Tomography	Youssef Haouchat et.al.	2511.11078	null
2025-11-14	Phys-Liquid: A Physics-Informed Dataset for Estimating 3D Geometry and Volume of Transparent Deformable Liquids	Ke Ma et.al.	2511.11077	null
2025-11-14	Evaluating Latent Generative Paradigms for High-Fidelity 3D Shape Completion from a Single Depth Image	Matthias Humt et.al.	2511.11074	null
2025-11-14	PINGS-X: Physics-Informed Normalized Gaussian Splatting with Axes Alignment for Efficient Super-Resolution of 4D Flow MRI	Sun Jo et.al.	2511.11048	null
2025-11-14	Hyperbolic Hierarchical Alignment Reasoning Network for Text-3D Retrieval	Wenrui Li et.al.	2511.11045	null
2025-11-14	EmbryoDiff: A Conditional Diffusion Framework with Multi-Focal Feature Fusion for Fine-Grained Embryo Developmental Stage Recognition	Yong Sun et.al.	2511.11027	null
2025-11-14	ERMoE: Eigen-Reparameterized Mixture-of-Experts for Stable Routing and Interpretable Specialization	Anzhe Cheng et.al.	2511.10971	null
2025-11-14	Abstract 3D Perception for Spatial Intelligence in Vision-Language Models	Yifan Liu et.al.	2511.10946	null
2025-11-14	DINOv3 as a Frozen Encoder for CRPS-Oriented Probabilistic Rainfall Nowcasting	Luciano Araujo Dourado Filho et.al.	2511.10894	null
2025-11-13	Decentralized Swarm Control via SO(3) Embeddings for 3D Trajectories	Dimitria Silveria et.al.	2511.10858	null
2025-11-13	From 2D to 3D Without Extra Baggage: Data-Efficient Cancer Detection in Digital Breast Tomosynthesis	Yen Nhi Truong Vu et.al.	2511.10597	null
2025-11-14	OmniVGGT: Omni-Modality Driven Visual Geometry Grounded Transformer	Haosong Peng et.al.	2511.10560	null
2025-11-13	Learnable Total Variation with Lambda Mapping for Low-Dose CT Denoising	Yusuf Talha Basak et.al.	2511.10500	null
2025-11-13	3DFETUS: Standardizing Fetal Facial Planes in 3D Ultrasound	Alomar Antonia et.al.	2511.10412	null
2025-11-14	MSGNav: Unleashing the Power of Multi-modal 3D Scene Graph for Zero-Shot Embodied Navigation	Xun Huang et.al.	2511.10376	null
2025-11-13	Depth-Consistent 3D Gaussian Splatting via Physical Defocus Modeling and Multi-View Geometric Supervision	Yu Deng et.al.	2511.10316	null
2025-11-13	LiNeXt: Revisiting LiDAR Completion with Efficient Non-Diffusion Architectures	Wenzhe He et.al.	2511.10209	null
2025-11-13	Split-Layer: Enhancing Implicit Neural Representation by Maximizing the Dimensionality of Feature Space	Zhicheng Cai et.al.	2511.10142	null
2025-11-13	Multivariate Gaussian Representation Learning for Medical Action Evaluation	Luming Yang et.al.	2511.10060	null
2025-11-13	MuSc-V2: Zero-Shot Multimodal Industrial Anomaly Classification and Segmentation with Mutual Scoring of Unlabeled Samples	Xurui Li et.al.	2511.10047	null
2025-11-13	LoG3D: Ultra-High-Resolution 3D Shape Modeling via Local-to-Global Partitioning	Xinran Yang et.al.	2511.10040	null
2025-11-13	DGFusion: Dual-guided Fusion for Robust Multi-Modal 3D Object Detection	Feiyang Jia et.al.	2511.10035	null
2025-11-13	AffordBot: 3D Fine-grained Embodied Reasoning via Multimodal Large Language Models	Xinyi Wang et.al.	2511.10017	null
2025-11-13	DBGroup: Dual-Branch Point Grouping for Weakly Supervised 3D Instance Segmentation	Xuexun Liu et.al.	2511.10003	null
2025-11-13	MOBA: A Material-Oriented Backdoor Attack against LiDAR-based 3D Object Detection Systems	Saket S. Chaturvedi et.al.	2511.09999	null
2025-11-13	TSPE-GS: Probabilistic Depth Extraction for Semi-Transparent Surface Reconstruction via 3D Gaussian Splatting	Zhiyuan Xu et.al.	2511.09944	null
2025-11-13	HCC-3D: Hierarchical Compensatory Compression for 98% 3D Token Reduction in Vision-Language Models	Liheng Zhang et.al.	2511.09883	null
2025-11-13	RWKV-PCSSC: Exploring RWKV Model for Point Cloud Semantic Scene Completion	Wenzhe He et.al.	2511.09878	null
2025-11-13	AHA! Animating Human Avatars in Diverse Scenes with Gaussian Splatting	Aymen Mir et.al.	2511.09827	null
2025-11-12	Lumos3D: A Single-Forward Framework for Low-Light 3D Scene Restoration	Hanzhou Liu et.al.	2511.09818	null
2025-11-12	STORM: Segment, Track, and Object Re-Localization from a Single 3D Model	Yu Deng et.al.	2511.09771	null
2025-11-12	PALMS+: Modular Image-Based Floor Plan Localization Leveraging Depth Foundation Model	Yunqian Cheng et.al.	2511.09724	null
2025-11-12	A Shared-Autonomy Construction Robotic System for Overhead Works	David Minkwan Kim et.al.	2511.09695	null
2025-11-12	ScaleADFG: Affordance-based Dexterous Functional Grasping via Scalable Dataset	Sizhe Wang et.al.	2511.09602	null
2025-11-11	VEDA: 3D Molecular Generation via Variance-Exploding Diffusion with Annealing	Peining Zhang et.al.	2511.09568	null
2025-11-12	IFG: Internet-Scale Guidance for Functional Grasping Generation	Ray Muxin Liu et.al.	2511.09558	null
2025-11-12	SpatialActor: Exploring Disentangled Spatial Representations for Robust Robotic Manipulation	Hao Shi et.al.	2511.09555	null
2025-11-12	DreamPose3D: Hallucinative Diffusion with Prompt Learning for 3D Human Pose Estimation	Jerrin Bright et.al.	2511.09502	null
2025-11-12	BronchOpt : Vision-Based Pose Optimization with Fine-Tuned Foundation Models for Accurate Bronchoscopy Navigation	Hongchao Shu et.al.	2511.09443	null
2025-11-12	OUGS: Active View Selection via Object-aware Uncertainty Estimation in 3DGS	Haiyi Li et.al.	2511.09397	null
2025-11-12	Augment to Augment: Diverse Augmentations Enable Competitive Ultra-Low-Field MRI Enhancement	Felix F Zimmermann et.al.	2511.09366	null
2025-11-11	Spatio-Temporal Context Learning with Temporal Difference Convolution for Moving Infrared Small Target Detection	Houzhang Fang et.al.	2511.09352	null
2025-11-14	FQ-PETR: Fully Quantized Position Embedding Transformation for Multi-View 3D Object Detection	Jiangyong Yu et.al.	2511.09347	null
2025-11-12	UMIGen: A Unified Framework for Egocentric Point Cloud Generation and Cross-Embodiment Robotic Imitation Learning	Yan Huang et.al.	2511.09302	null
2025-11-12	DensiCrafter: Physically-Constrained Generation and Fabrication of Self-Supporting Hollow Structures	Shengqi Dang et.al.	2511.09298	null
2025-11-12	SMF-VO: Direct Ego-Motion Estimation via Sparse Motion Fields	Sangheon Yang et.al.	2511.09072	null
2025-11-13	PAN: A World Model for General, Interactable, and Long-Horizon World Simulation	PAN Team et.al.	2511.09057	null
2025-11-12	4KDehazeFlow: Ultra-High-Definition Image Dehazing via Flow Matching	Xingchi Chen et.al.	2511.09055	null
2025-11-12	Plug-and-Play Clarifier: A Zero-Shot Multimodal Framework for Egocentric Intent Disambiguation	Sicheng Yang et.al.	2511.08971	null
2025-11-12	OG-PCL: Efficient Sparse Point Cloud Processing for Human Activity Recognition	Jiuqi Yan et.al.	2511.08910	null
2025-11-12	SasMamba: A Lightweight Structure-Aware Stride State Space Model for 3D Human Pose Estimation	Hu Cui et.al.	2511.08872	null
2025-11-11	Enhancing Rotation-Invariant 3D Learning with Global Pose Awareness and Attention Mechanisms	Jiaxun Guo et.al.	2511.08833	null
2025-11-11	DT-NVS: Diffusion Transformers for Novel View Synthesis	Wonbong Jang et.al.	2511.08823	null
2025-11-11	Low-cost Multi-agent Fleet for Acoustic Cooperative Localization Research	Nelson Durrant et.al.	2511.08822	null
2025-11-11	Adaptive graph Kolmogorov-Arnold network for 3D human pose estimation	Abu Taib Mohammed Shahjahan et.al.	2511.08809	null
2025-11-11	3D-TDA – Topological feature extraction from 3D images for Alzheimer’s disease classification	Faisal Ahmed et.al.	2511.08663	null
2025-11-10	Fluence Map Prediction with Deep Learning: A Transformer-based Approach	Ujunwa Mgboh et.al.	2511.08645	null
2025-11-11	RePose-NeRF: Robust Radiance Fields for Mesh Reconstruction under Noisy Camera Poses	Sriram Srinivasan et.al.	2511.08545	null
2025-11-11	3D4D: An Interactive, Editable, 4D World Model via 3D Video Generation	Yunhong He et.al.	2511.08536	null
2025-11-11	Large Sign Language Models: Toward 3D American Sign Language Translation	Sen Zhang et.al.	2511.08535	null
2025-11-11	Fast Multi-Organ Fine Segmentation in CT Images with Hierarchical Sparse Sampling and Residual Transformer	Xueqi Guo et.al.	2511.08509	null
2025-11-11	RAPTR: Radar-based 3D Pose Estimation using Transformer	Sorachi Kato et.al.	2511.08387	null
2025-11-11	SkelSplat: Robust Multi-view 3D Human Pose Estimation with Differentiable Gaussian Rendering	Laura Bragagnolo et.al.	2511.08294	null
2025-11-11	Top2Ground: A Height-Aware Dual Conditioning Diffusion Model for Robust Aerial-to-Ground View Generation	Jae Joong Lee et.al.	2511.08258	null
2025-11-11	Hierarchical Direction Perception via Atomic Dot-Product Operators for Rotation-Invariant Point Clouds Learning	Chenyu Hu et.al.	2511.08240	null
2025-11-11	2D Representation for Unguided Single-View 3D Super-Resolution in Real-Time	Ignasi Mas et.al.	2511.08224	null
2025-11-11	Twist and Compute: The Cost of Pose in 3D Generative Diffusion	Kyle Fogarty et.al.	2511.08203	null
2025-11-11	WarpGAN: Warping-Guided 3D GAN Inversion with Style-Based Novel View Inpainting	Kaitao Huang et.al.	2511.08178	null
2025-11-11	Introducing Nylon Face Mask Attacks: A Dataset for Evaluating Generalised Face Presentation Attack Detection	Manasa et.al.	2511.08114	null
2025-11-11	WEDepth: Efficient Adaptation of World Knowledge for Monocular Depth Estimation	Gongshu Wang et.al.	2511.08036	null
2025-11-11	Perceptual Quality Assessment of 3D Gaussian Splatting: A Subjective Dataset and Prediction Metric	Zhaolin Wan et.al.	2511.08032	null
2025-11-14	Invisible Triggers, Visible Threats! Road-Style Adversarial Creation Attack for Visual 3D Detection in Autonomous Driving	Jian Wang et.al.	2511.08015	null
2025-11-12	EAGLE: Episodic Appearance- and Geometry-aware Memory for Unified 2D-3D Visual Query Localization in Egocentric Vision	Yifei Cao et.al.	2511.08007	null
2025-11-11	DANCE: Density-agnostic and Class-aware Network for Point Cloud Completion	Da-Yeong Kim et.al.	2511.07978	null
2025-11-11	Multi-Modal Assistance for Unsupervised Domain Adaptation on Point Cloud 3D Object Detection	Shenao Zhao et.al.	2511.07966	null
2025-11-11	USV Obstacles Detection and Tracking in Marine Environments	Yara AlaaEldin et.al.	2511.07950	null
2025-11-11	Is It Truly Necessary to Process and Fit Minutes-Long Reference Videos for Personalized Talking Face Generation?	Rui-Qing Sun et.al.	2511.07940	null
2025-11-13	HD $^2$ -SSC: High-Dimension High-Density Semantic Scene Completion for Autonomous Driving	Zhiwen Yang et.al.	2511.07925	null
2025-11-11	MonoCLUE : Object-Aware Clustering Enhances Monocular 3D Object Detection	Sunghun Yang et.al.	2511.07862	null
2025-11-11	Human Motion Synthesis in 3D Scenes via Unified Scene Semantic Occupancy	Gong Jingyu et.al.	2511.07819	null
2025-11-11	Sparse3DPR: Training-Free 3D Hierarchical Scene Parsing and Task-Adaptive Subgraph Reasoning from Sparse RGB Views	Haida Feng et.al.	2511.07813	null
2025-11-11	RoboTAG: End-to-end Robot Configuration Estimation via Topological Alignment Graph	Yifan Liu et.al.	2511.07717	null
2025-11-10	TrackStudio: An Integrated Toolkit for Markerless Tracking	Hristo Dimitrov et.al.	2511.07624	null
2025-11-10	CAVER: Curious Audiovisual Exploring Robot	Luca Macesanu et.al.	2511.07619	null
2025-11-10	LiveNeRF: Efficient Face Replacement Through Neural Radiance Fields Integration	Tung Vu et.al.	2511.07552	null
2025-11-10	TwinOR: Photorealistic Digital Twins of Dynamic Operating Rooms for Embodied AI Research	Han Zhang et.al.	2511.07412	null
2025-11-10	DIMO: Diverse 3D Motion Generation for Arbitrary Objects	Linzhan Mou et.al.	2511.07409	null
2025-11-10	SpatialThinker: Reinforcing 3D Reasoning in Multimodal LLMs via Spatial Rewards	Hunar Batra et.al.	2511.07403	null
2025-11-10	Real-Time LiDAR Super-Resolution via Frequency-Aware Multi-Scale Fusion	June Moh Goo et.al.	2511.07377	null
2025-11-10	YoNoSplat: You Only Need One Model for Feedforward 3D Gaussian Splatting	Botao Ye et.al.	2511.07321	null
2025-11-10	Segmentation of Ischemic Stroke Lesions using Transfer Learning on Multi-sequence MRI	R. P. Chowdhury et.al.	2511.07281	null
2025-11-10	4DSTR: Advancing Generative 4D Gaussians with Spatial-Temporal Rectification for High-Quality and Consistent 4D Generation	Mengmeng Liu et.al.	2511.07241	null
2025-11-10	Omni-View: Unlocking How Generation Facilitates Understanding in Unified 3D Model based on Multiview images	JiaKui Hu et.al.	2511.07222	null
2025-11-10	Geometric implicit neural representations for signed distance functions	Luiz Schirmer et.al.	2511.07206	null
2025-11-10	Federated Learning for Video Violence Detection: Complementary Roles of Lightweight CNNs and Vision-Language Models for Energy-Efficient Use	Sébastien Thuau et.al.	2511.07171	null
2025-11-10	ProcGen3D: Learning Neural Procedural Graph Representations for Image-to-3D Reconstruction	Xinyi Zhang et.al.	2511.07142	null
2025-11-10	Sparse4DGS: 4D Gaussian Splatting for Sparse-Frame Dynamic Scene Reconstruction	Changyue Shi et.al.	2511.07122	null
2025-11-10	HENet++: Hybrid Encoding and Multi-task Learning for 3D Perception and End-to-end Autonomous Driving	Zhongyu Xia et.al.	2511.07106	null
2025-11-10	RaLD: Generating High-Resolution 3D Radar Point Clouds with Latent Diffusion	Ruijie Zhang et.al.	2511.07067	null
2025-11-10	3D-ANC: Adaptive Neural Collapse for Robust 3D Point Cloud Recognition	Yuanmin Huang et.al.	2511.07040	null
2025-11-10	Certified L2-Norm Robustness of 3D Point Cloud Recognition in the Frequency Domain	Liang Zhou et.al.	2511.07029	null
2025-11-10	TrueCity: Real and Simulated Urban Data for Cross-Domain 3D Scene Understanding	Duc Nguyen et.al.	2511.07007	null
2025-11-10	GFix: Perceptually Enhanced Gaussian Splatting Video Compression	Siyue Teng et.al.	2511.06953	null
2025-11-10	Mono3DVG-EnSD: Enhanced Spatial-aware and Dimension-decoupled Text Encoding for Monocular 3D Visual Grounding	Yuzhen Li et.al.	2511.06908	null
2025-11-10	MUGSQA: Novel Multi-Uncertainty-Based Gaussian Splatting Quality Assessment Method, Dataset, and Benchmarks	Tianang Chen et.al.	2511.06830	null
2025-11-10	ConeGS: Error-Guided Densification Using Pixel Cones for Improved Reconstruction with Fewer Primitives	Bartłomiej Baranowski et.al.	2511.06810	null
2025-11-10	Robust and High-Fidelity 3D Gaussian Splatting: Fusing Pose Priors and Geometry Constraints for Texture-Deficient Outdoor Scenes	Meijun Guo et.al.	2511.06765	null
2025-11-10	Med-SORA: Symptom to Organ Reasoning in Abdomen CT Images	You-Kyoung Na et.al.	2511.06752	null
2025-11-10	PointCubeNet: 3D Part-level Reasoning with 3x3x3 Point Cloud Blocks	Da-Yeong Kim et.al.	2511.06744	null
2025-11-10	Rethinking Rainy 3D Scene Reconstruction via Perspective Transforming and Brightness Tuning	Qianfeng Yang et.al.	2511.06734	null
2025-11-07	How Many Tokens Do 3D Point Cloud Transformer Architectures Really Need?	Tuan Anh Tran et.al.	2511.05449	null
2025-11-07	PALM: A Dataset and Baseline for Learning Multi-subject Hand Prior	Zicong Fan et.al.	2511.05403	null
2025-11-07	Dense Motion Captioning	Shiyao Xu et.al.	2511.05369	null
2025-11-07	Rethinking Metrics and Diffusion Architecture for 3D Point Cloud Generation	Matteo Bastico et.al.	2511.05308	null
2025-11-07	Automatic segmentation of colorectal liver metastases for ultrasound-based navigated resection	Tiziano Natali et.al.	2511.05253	null
2025-11-07	Accurate online action and gesture recognition system using detectors and Deep SPD Siamese Networks	Mohamed Sanim Akremi et.al.	2511.05250	null
2025-11-07	4D3R: Motion-Aware Neural Reconstruction and Rendering of Dynamic Scenes from Monocular Videos	Mengqi Guo et.al.	2511.05229	null
2025-11-07	Efficient representation of 3D spatial data for defense-related applications	Benjamin Kahl et.al.	2511.05109	null
2025-11-07	No Pose Estimation? No Problem: Pose-Agnostic and Instance-Aware Test-Time Adaptation for Monocular Depth Estimation	Mingyu Sung et.al.	2511.05055	null
2025-11-07	Challenges in 3D Data Synthesis for Training Neural Networks on Topological Features	Dylan Peek et.al.	2511.04972	null
2025-11-07	Pattern-Aware Diffusion Synthesis of fMRI/dMRI with Tissue and Microstructural Refinement	Xiongri Shen et.al.	2511.04963	null
2025-11-07	CLM: Removing the GPU Memory Barrier for 3D Gaussian Splatting	Hexu Zhao et.al.	2511.04951	null
2025-11-06	3D Gaussian Point Encoders	Jim James et.al.	2511.04797	null
2025-11-06	Global 3D Reconstruction of Clouds & Tropical Cyclones	Shirin Ermis et.al.	2511.04773	null
2025-11-06	Cambrian-S: Towards Spatial Supersensing in Video	Shusheng Yang et.al.	2511.04670	null
2025-11-06	SIMS-V: Simulated Instruction-Tuning for Spatial Video Understanding	Ellis Brown et.al.	2511.04668	null
2025-11-06	Real-to-Sim Robot Policy Evaluation with Gaussian Splatting Simulation of Soft-Body Interactions	Kaifeng Zhang et.al.	2511.04665	null
2025-11-06	UniSplat: Unified Spatio-Temporal Fusion via 3D Latent Scaffolds for Dynamic Driving Scene Reconstruction	Chen Shi et.al.	2511.04595	null
2025-11-06	$μ$ NeuFMT: Optical-Property-Adaptive Fluorescence Molecular Tomography via Implicit Neural Representation	Shihan Zhao et.al.	2511.04510	null
2025-11-06	BoRe-Depth: Self-supervised Monocular Depth Estimation with Boundary Refinement for Embedded Systems	Chang Liu et.al.	2511.04388	null
2025-11-06	ForeRobo: Unlocking Infinite Simulation Data for 3D Goal-driven Robotic Manipulation	Dexin wang et.al.	2511.04381	null
2025-11-06	Evaluating the Impact of Weather-Induced Sensor Occlusion on BEVFusion for 3D Object Detection	Sanjay Kumar et.al.	2511.04347	null
2025-11-06	Submanifold Sparse Convolutional Networks for Automated 3D Segmentation of Kidneys and Kidney Tumours in Computed Tomography	Saúl Alonso-Monsalve et.al.	2511.04334	null
2025-11-06	FastGS: Training 3D Gaussian Splatting in 100 Seconds	Shiwei Ren et.al.	2511.04283	null
2025-11-06	GraspView: Active Perception Scoring and Best-View Optimization for Robotic Grasping in Cluttered Environments	Shenglin Wang et.al.	2511.04199	null
2025-11-06	When Swin Transformer Meets KANs: An Improved Transformer Architecture for Medical Image Segmentation	Nishchal Sapkota et.al.	2511.04084	null
2025-11-07	Faithful Contouring: Near-Lossless 3D Voxel Representation Free from Iso-surface	Yihao Luo et.al.	2511.04029	null
2025-11-06	CaRF: Enhancing Multi-View Consistency in Referring 3D Gaussian Splatting Segmentation	Yuwen Tao et.al.	2511.03992	null
2025-11-06	Simple 3D Pose Features Support Human and Machine Social Scene Understanding	Wenshuo Qin et.al.	2511.03988	null
2025-11-06	Room Envelopes: A Synthetic Dataset for Indoor Layout Reconstruction from Images	Sam Bahrami et.al.	2511.03970	null
2025-11-06	A Linear Fractional Transformation Model and Calibration Method for Light Field Camera	Zhong Chen et.al.	2511.03962	null
2025-11-06	Improving Multi-View Reconstruction via Texture-Guided Gaussian-Mesh Joint Optimization	Zhejia Cai et.al.	2511.03950	null
2025-11-05	Shape Deformation Networks for Automated Aortic Valve Finite Element Meshing from 3D CT Images	Linchen Qian et.al.	2511.03890	null
2025-11-05	A Lightweight 3D-CNN for Event-Based Human Action Recognition with Privacy-Preserving Potential	Mehdi Sefidgar Dilmaghani et.al.	2511.03665	null
2025-11-05	Human Mesh Modeling for Anny Body	Romain Brégier et.al.	2511.03589	null
2025-11-05	OneOcc: Semantic Occupancy Prediction for Legged Robots with a Single Panoramic Camera	Hao Shi et.al.	2511.03571	null
2025-11-05	Generalizing Shape-from-Template to Topological Changes	Kevin Manogue et.al.	2511.03459	null
2025-11-05	Robust Alignment of the Human Embryo in 3D Ultrasound using PCA and an Ensemble of Heuristic, Atlas-based and Learning-based Classifiers Evaluated on the Rotterdam Periconceptional Cohort	Nikolai Herrmann et.al.	2511.03416	null
2025-11-05	IEC3D-AD: A 3D Dataset of Industrial Equipment Components for Unsupervised Point Cloud Anomaly Detection	Bingyang Guo et.al.	2511.03267	null
2025-11-05	MvBody: Multi-View-Based Hybrid Transformer Using Optical 3D Body Scan for Explainable Cesarean Section Prediction	Ruting Cheng et.al.	2511.03212	null
2025-11-05	Accelerating Physical Property Reasoning for Augmented Visual Cognition	Hongbo Lan et.al.	2511.03126	null
2025-11-05	DentalSplat: Dental Occlusion Novel View Synthesis from Sparse Intra-Oral Photographs	Yiyi Miao et.al.	2511.03099	null
2025-11-04	3D Cal: An Open-Source Software Library for Calibrating Tactile Sensors	Rohan Kota et.al.	2511.03078	null
2025-11-04	From Propagation to Prediction: Point-level Uncertainty Evaluation of MLS Point Clouds under Limited Ground Truth	Ziyang Xu et.al.	2511.03053	null
2025-11-04	Densemarks: Learning Canonical Embeddings for Human Heads Images via Point Tracks	Dmitrii Pozdeev et.al.	2511.02830	null
2025-11-04	VCode: a Multimodal Coding Benchmark with SVG as Symbolic Visual Representation	Kevin Qinghong Lin et.al.	2511.02778	null
2025-11-04	PercHead: Perceptual Head Model for Single-Image 3D Head Reconstruction & Editing	Antonio Oroz et.al.	2511.02777	null
2025-11-04	Non-Contact Manipulation of Induced Magnetic Dipoles	Seth Stewart et.al.	2511.02761	null
2025-11-04	LiteVoxel: Low-memory Intelligent Thresholding for Efficient Voxel Rasterization	Jee Won Lee et.al.	2511.02510	null
2025-11-04	A Novel Grouping-Based Hybrid Color Correction Algorithm for Color Point Clouds	Kuo-Liang Chung et.al.	2511.02397	null
2025-11-04	3D Point Cloud Object Detection on Edge Devices for Split Computing	Taisuke Noguchi et.al.	2511.02293	null
2025-11-04	Are Euler angles a useful rotation parameterisation for pose estimation with Normalizing Flows?	Giorgos Sfikas et.al.	2511.02277	null
2025-11-04	Can Foundation Models Revolutionize Mobile AR Sparse Sensing?	Yiqin Zhao et.al.	2511.02215	null
2025-11-05	Object-Centric 3D Gaussian Splatting for Strawberry Plant Reconstruction and Phenotyping	Jiajia Li et.al.	2511.02207	null
2025-11-06	Text to Robotic Assembly of Multi Component Objects using 3D Generative AI and Vision Language Models	Alexander Htet Kyaw et.al.	2511.02162	null
2025-11-04	From Instance Segmentation to 3D Growth Trajectory Reconstruction in Planktonic Foraminifera	Huahua Lin et.al.	2511.02142	null
2025-11-01	iFlyBot-VLA Technical Report	Yuan Zhang et.al.	2511.01914	null
2025-11-03	UniLION: Towards Unified Autonomous Driving Model with Linear Group RNNs	Zhe Liu et.al.	2511.01768	null
2025-11-03	Wonder3D++: Cross-domain Diffusion for High-fidelity 3D Generation from a Single Image	Yuxiao Yang et.al.	2511.01767	null
2025-11-03	HGFreNet: Hop-hybrid GraphFomer for 3D Human Pose Estimation with Trajectory Consistency in Frequency Domain	Kai Zhai et.al.	2511.01756	null
2025-11-03	3EED: Ground Everything Everywhere in 3D	Rong Li et.al.	2511.01755	null
2025-11-03	Actial: Activate Spatial Reasoning Ability of Multimodal Large Language Models	Xiaoyu Zhan et.al.	2511.01618	null
2025-11-03	Benchmark-Ready 3D Anatomical Shape Classification	Tomáš Krsička et.al.	2511.01613	null
2025-11-03	Lite ENSAM: a lightweight cancer segmentation model for 3D Computed Tomography	Agnar Martin Bjørnstad et.al.	2511.01600	null
2025-11-03	Discriminately Treating Motion Components Evolves Joint Depth and Ego-Motion Learning	Mengtan Zhang et.al.	2511.01502	null
2025-11-03	HMVLM: Human Motion-Vision-Lanuage Model via MoE LoRA	Lei Hu et.al.	2511.01463	null
2025-11-03	FoldPath: End-to-End Object-Centric Motion Generation via Modulated Implicit Paths	Paolo Rabino et.al.	2511.01407	null
2025-11-03	Semantic BIM enrichment for firefighting assets: Fire-ART dataset and panoramic image-based 3D reconstruction	Ya Wen et.al.	2511.01399	null
2025-11-03	CaRLi-V: Camera-RADAR-LiDAR Point-Wise 3D Velocity Estimation	Landson Guo et.al.	2511.01383	null
2025-11-03	Model to Model: Understanding the Venus Flytrap Snapping Mechanism and Transferring it to a 3D-printed Bistable Soft Robotic Demonstrator	Maartje H. M. Wermelink et.al.	2511.01350	null
2025-11-03	MIQ-SAM3D: From Single-Point Prompt to Multi-Instance Segmentation via Competitive Query Refinement	Jierui Qu et.al.	2511.01345	null
2025-11-03	Gesture Generation (Still) Needs Improved Human Evaluation Practices: Insights from a Community-Driven State-of-the-Art Benchmark	Rajmund Nagy et.al.	2511.01233	null
2025-11-03	MoSa: Motion Generation with Scalable Autoregressive Modeling	Mengyuan Liu et.al.	2511.01200	null
2025-11-03	LiDAR-VGGT: Cross-Modal Coarse-to-Fine Fusion for Globally Consistent and Metric-Scale Dense Mapping	Lijie Wang et.al.	2511.01186	null
2025-11-03	Scaling Cross-Embodiment World Models for Dexterous Manipulation	Zihao He et.al.	2511.01177	null
2025-11-03	Web-Scale Collection of Video Data for 4D Animal Reconstruction	Brian Nlong Zhao et.al.	2511.01169	null
2025-11-02	GauDP: Reinventing Multi-Agent Collaboration through Gaussian-Image Synergy in Diffusion Policies	Ziye Wang et.al.	2511.00998	null
2025-11-02	Breaking the Latency Barrier: Synergistic Perception and Control for High-Frequency 3D Ultrasound Servoing	Yizhao Qian et.al.	2511.00983	null
2025-11-02	URDF-Anything: Constructing Articulated Objects with 3D Multimodal Language Model	Zhe Li et.al.	2511.00940	null
2025-11-02	Fleming-VL: Towards Universal Medical Visual Reasoning with Multimodal LLMs	Yan Shu et.al.	2511.00916	null
2025-11-02	Class-agnostic 3D Segmentation by Granularity-Consistent Automatic 2D Mask Tracking	Juan Wang et.al.	2511.00785	null
2025-11-01	Metadata-Aligned 3D MRI Representations for Contrast Understanding and Quality Control	Mehmet Yigit Avci et.al.	2511.00681	null
2025-11-01	Been There, Scanned That: Nostalgia-Driven LiDAR Compression for Self-Driving Cars	Ali Khalid et.al.	2511.00652	null
2025-11-01	Multi-Mapcher: Loop Closure Detection-Free Heterogeneous LiDAR Multi-Session SLAM Leveraging Outlier-Robust Registration for Autonomous Vehicles	Hyungtae Lim et.al.	2511.00635	null
2025-11-01	4D Neural Voxel Splatting: Dynamic Scene Rendering with Voxelized Guassian Splatting	Chun-Tin Wu et.al.	2511.00560	null
2025-11-01	Image-based ground distance detection for crop-residue-covered soil	Baochao Wang et.al.	2511.00548	null
2025-11-01	Three-dimensional narrow volume reconstruction method with unconditional stability based on a phase-field Lagrange multiplier approach	Renjun Gao et.al.	2511.00508	null
2025-11-01	Diff4Splat: Controllable 4D Scene Generation with Latent Dynamic Reconstruction Models	Panwang Pan et.al.	2511.00503	null
2025-11-01	Design and Development of a Modular Bucket Drum Excavator for Lunar ISRU	Simon Giel et.al.	2511.00492	null
2025-11-01	HumanCrafter: Synergizing Generalizable Human Reconstruction and Semantic 3D Segmentation	Panwang Pan et.al.	2511.00468	null
2025-11-01	Towards Reliable Pediatric Brain Tumor Segmentation: Task-Specific nnU-Net Enhancements	Xiaolong Li et.al.	2511.00449	null
2025-11-01	SonarSweep: Fusing Sonar and Vision for Robust 3D Reconstruction via Plane Sweeping	Lingpeng Chen et.al.	2511.00392	null
2025-11-01	Oitijjo-3D: Generative AI Framework for Rapid 3D Heritage Reconstruction from Street View Imagery	Momen Khandoker Ope et.al.	2511.00362	null
2025-10-31	MambaNetLK: Enhancing Colonoscopy Point Cloud Registration with Mamba	Linzhe Jiang et.al.	2511.00260	null
2025-10-31	Object-Aware 4D Human Motion Generation	Shurui Gui et.al.	2511.00248	null
2025-10-31	VLM6D: VLM based 6Dof Pose Estimation based on RGB-D Images	Md Selim Sarowar et.al.	2511.00120	null
2025-10-31	PETAR: Localized Findings Generation with Mask-Aware Vision-Language Modeling for PET Automated Reporting	Danyal Maqbool et.al.	2510.27680	null
2025-10-31	Spatial-SSRL: Enhancing Spatial Understanding via Self-Supervised Reinforcement Learning	Yuhong Liu et.al.	2510.27606	null
2025-10-31	Deep Neural Watermarking for Robust Copyright Protection in 3D Point Clouds	Khandoker Ashik Uz Zaman et.al.	2510.27533	null
2025-10-31	SAGS: Self-Adaptive Alias-Free Gaussian Splatting for Dynamic Surgical Endoscopic Reconstruction	Wenfeng Huang et.al.	2510.27318	null
2025-10-31	MoRE: 3D Visual Geometry Reconstruction Meets Mixture-of-Experts	Jingnan Gao et.al.	2510.27234	null
2025-10-31	Mask-to-Height: A YOLOv11-Based Architecture for Joint Building Instance Segmentation and Height Classification from Satellite Imagery	Mahmoud El Hussieni et.al.	2510.27224	null
2025-10-31	M^3Detection: Multi-Frame Multi-Level Feature Fusion for Multi-Modal 3D Object Detection with Camera and 4D Imaging Radar	Xiaozhi Li et.al.	2510.27166	null
2025-10-31	HiGS: Hierarchical Generative Scene Framework for Multi-Step Associative Semantic Spatial Composition	Jiacheng Hong et.al.	2510.27148	null
2025-10-31	WildfireX-SLAM: A Large-scale Low-altitude RGB-D Dataset for Wildfire SLAM and Beyond	Zhicong Sun et.al.	2510.27133	null
2025-10-31	Hierarchical Transformers for Unsupervised 3D Shape Abstraction	Aditya Vora et.al.	2510.27088	null
2025-10-30	A Multi-Modal Neuro-Symbolic Approach for Spatial Reasoning-Based Visual Grounding in Robotics	Simindokht Jahangard et.al.	2510.27033	null
2025-10-30	DC4GS: Directional Consistency-Driven Adaptive Density Control for 3D Gaussian Splatting	Moonsoo Jeong et.al.	2510.26921	null
2025-10-30	PF-DAformer: Proximal Femur Segmentation via Domain Adaptive Transformer for Dual-Center QCT	Rochak Dhakal et.al.	2510.26903	null
2025-10-30	OmniX: From Unified Panoramic Generation and Perception to Graphics-Ready 3D Scenes	Yukun Huang et.al.	2510.26800	null
2025-10-30	SEE4D: Pose-Free 4D Generation via Auto-Regressive Video Inpainting	Dongyue Lu et.al.	2510.26796	null
2025-10-30	The Quest for Generalizable Motion Generation: Data, Model, and Evaluation	Jing Lin et.al.	2510.26794	null
2025-10-30	HEIR: Learning Graph-Based Motion Hierarchies	Cheng Zheng et.al.	2510.26786	null
2025-10-30	Clone Deterministic 3D Worlds with Geometrically-Regularized World Models	Zaishuo Xia et.al.	2510.26782	null
2025-10-30	The Impact and Outlook of 3D Gaussian Splatting	Bernhard Kerbl et.al.	2510.26694	null
2025-10-30	All You Need for Object Detection: From Pixels, Points, and Prompts to Next-Gen Fusion and Multimodal LLMs/VLMs in Autonomous Vehicles	Sayed Pedram Haeri Boroujeni et.al.	2510.26641	null
2025-10-30	PointSt3R: Point Tracking through 3D Grounded Correspondence	Rhodri Guerrier et.al.	2510.26443	null
2025-10-30	AgriGS-SLAM: Orchard Mapping Across Seasons via Multi-View Gaussian Splatting SLAM	Mirko Usuelli et.al.	2510.26358	null
2025-10-30	Sketch2PoseNet: Efficient and Generalized Sketch to 3D Human Pose Prediction	Li Wang et.al.	2510.26196	null
2025-10-30	Self-localization on a 3D map by fusing global and local features from a monocular camera	Satoshi Kikuch et.al.	2510.26170	null
2025-10-30	FullPart: Generating each 3D Part at Full Resolution	Lihe Ding et.al.	2510.26140	null
2025-10-30	Kinodynamic Task and Motion Planning using VLM-guided and Interleaved Sampling	Minseo Kwon et.al.	2510.26139	null
2025-10-30	JOGS: Joint Optimization of Pose Estimation and 3D Gaussian Splatting	Yuxuan Li et.al.	2510.26117	null
2025-10-29	BikeScenes: Online LiDAR Semantic Segmentation for Bicycles	Denniz Goren et.al.	2510.25901	null
2025-10-29	STITCH 2.0: Extending Augmented Suturing with EKF Needle Estimation and Thread Management	Kush Hari et.al.	2510.25768	null
2025-11-03	FreeArt3D: Training-Free Articulated Object Generation using 3D Diffusion	Chuhao Chen et.al.	2510.25765	null
2025-11-02	Multimodal Spatial Reasoning in the Large Model Era: A Survey and Benchmarks	Xu Zheng et.al.	2510.25760	null
2025-10-29	Modeling Collapse of Steered Vine Robots Under Their Own Weight	Ciera McFarland et.al.	2510.25727	null
2025-10-29	3D CT-Based Coronary Calcium Assessment: A Feature-Driven Machine Learning Framework	Ayman Abaid et.al.	2510.25347	null
2025-10-29	Informative Sample Selection Model for Skeleton-based Action Recognition with Limited Training Samples	Zhigang Tu et.al.	2510.25345	null
2025-10-29	4-Doodle: Text to 3D Sketches that Move!	Hao Chen et.al.	2510.25319	null
2025-10-29	Learning Disentangled Speech- and Expression-Driven Blendshapes for 3D Talking Face Animation	Yuxiang Mao et.al.	2510.25234	null
2025-10-29	U-CAN: Unsupervised Point Cloud Denoising with Consistency-Aware Noise2Noise Matching	Junsheng Zhou et.al.	2510.25210	null
2025-10-29	SoraNav: Adaptive UAV Task-Centric Navigation via Zeroshot VLM Reasoning	Hongyu Song et.al.	2510.25191	null
2025-10-29	EA3D: Online Open-World 3D Object Extraction from Streaming Videos	Xiaoyu Zhou et.al.	2510.25146	null
2025-10-29	AtlasGS: Atlanta-world Guided Surface Reconstruction with Implicit Structured Gaussians	Xiyu Zhang et.al.	2510.25129	null
2025-10-29	Auto3DSeg for Brain Tumor Segmentation from 3D MRI in BraTS 2023 Challenge	Andriy Myronenko et.al.	2510.25058	null
2025-10-28	Understanding Multi-View Transformers	Michal Stary et.al.	2510.24907	null
2025-10-28	VividCam: Learning Unconventional Camera Motions from Virtual Synthetic Videos	Qiucheng Wu et.al.	2510.24904	null
2025-10-24	Point-level Uncertainty Evaluation of Mobile Laser Scanning Point Clouds	Ziyang Xu et.al.	2510.24773	null
2025-10-28	MIC-BEV: Multi-Infrastructure Camera Bird’s-Eye-View Transformer with Relation-Aware Fusion for 3D Object Detection	Yun Zhang et.al.	2510.24688	null
2025-11-03	Kineo: Calibration-Free Metric Motion Capture From Sparse RGB Cameras	Charles Javerliat et.al.	2510.24464	null
2025-10-28	Flatness-based trajectory planning for 3D overhead cranes with friction compensation and collision avoidance	Jorge Vicente-Martinez et.al.	2510.24457	null
2025-10-28	Adaptive Knowledge Transferring with Switching Dual-Student Framework for Semi-Supervised Medical Image Segmentation	Thanh-Huy Nguyen et.al.	2510.24366	null
2025-10-28	NVSim: Novel View Synthesis Simulator for Large Scale Indoor Navigation	Mingyu Jeong et.al.	2510.24335	null
2025-10-28	Sound Source Localization for Spatial Mapping of Surgical Actions in Dynamic Scenes	Jonas Hein et.al.	2510.24332	null
2025-10-28	DynaRend: Learning 3D Dynamics via Masked Future Rendering for Robotic Manipulation	Jingyi Tian et.al.	2510.24261	null
2025-10-28	LagMemo: Language 3D Gaussian Splatting Memory for Multi-modal Open-vocabulary Multi-goal Visual Navigation	Haotian Zhou et.al.	2510.24118	null
2025-10-28	DogMo: A Large-Scale Multi-View RGB-D Dataset for 4D Canine Motion Recovery	Zan Wang et.al.	2510.24117	null
2025-10-28	ZTRS: Zero-Imitation End-to-end Autonomous Driving with Trajectory Scoring	Zhenxin Li et.al.	2510.24108	null
2025-10-28	Improved Accuracy of Robot Localization Using 3-D LiDAR in a Hippocampus-Inspired Model	Andrew Gerstenslager et.al.	2510.24029	null
2025-10-28	Towards the Automatic Segmentation, Modeling and Meshing of the Aortic Vessel Tree from Multicenter Acquisitions: An Overview of the SEG.A. 2023 Segmentation of the Aorta Challenge	Yuan Jin et.al.	2510.24009	null
2025-10-28	A Survey on Collaborative SLAM with 3D Gaussian Splatting	Phuc Nguyen Xuan et.al.	2510.23988	null
2025-10-27	PlanarGS: High-Fidelity Indoor 3D Gaussian Splatting Guided by Vision-Language Planar Priors	Xirui Jin et.al.	2510.23930	null
2025-10-27	TurboPortrait3D: Single-step diffusion-based fast portrait novel-view synthesis	Emily Kim et.al.	2510.23929	null
2025-10-27	Adaptive Keyframe Selection for Scalable 3D Scene Reconstruction in Dynamic Environments	Raman Jha et.al.	2510.23928	null
2025-10-27	TRELLISWorld: Training-Free World Generation from Object Generators	Hanke Chen et.al.	2510.23880	null
2025-10-27	Concerto: Joint 2D-3D Self-Supervised Learning Emerges Spatial Representations	Yujia Zhang et.al.	2510.23607	null
2025-10-27	Track, Inpaint, Resplat: Subject-driven 3D and 4D Generation with Progressive Texture Infilling	Shuhong Zheng et.al.	2510.23605	null
2025-10-27	InFlux: A Benchmark for Self-Calibration of Dynamic Intrinsics of Video Cameras	Erich Liang et.al.	2510.23589	null
2025-10-27	RobotArena $\infty$ : Scalable Robot Benchmarking via Real-to-Sim Translation	Yash Jangir et.al.	2510.23571	null
2025-10-27	DPGLA: Bridging the Gap between Synthetic and Real Data for Unsupervised Domain Adaptation in 3D LiDAR Semantic Segmentation	Wanmeng Li et.al.	2510.23525	null
2025-10-27	Explicit Memory through Online 3D Gaussian Splatting Improves Class-Agnostic Video Segmentation	Anthony Opipari et.al.	2510.23521	null
2025-10-27	UrbanIng-V2X: A Large-Scale Multi-Vehicle, Multi-Infrastructure Dataset Across Multiple Intersections for Cooperative Perception	Karthikeyan Chandra Sekaran et.al.	2510.23478	null
2025-10-27	Omni-Reward: Towards Generalist Omni-Modal Reward Modeling with Free-Form Preferences	Zhuoran Jin et.al.	2510.23451	null
2025-10-27	MiCADangelo: Fine-Grained Reconstruction of Constrained CAD Models from 3D Scans	Ahmet Serdar Karadeniz et.al.	2510.23429	null
2025-10-27	Quality-controlled registration of urban MLS point clouds reducing drift effects by adaptive fragmentation	Marco Antonio Ortiz Rincon et.al.	2510.23416	null
2025-10-27	Towards Generalisable Foundation Models for 3D Brain MRI	Moona Mazher et.al.	2510.23415	null
2025-10-27	Transferable Deep Reinforcement Learning for Cross-Domain Navigation: from Farmland to the Moon	Shreya Santra et.al.	2510.23329	null
2025-10-27	Multitask Multimodal Self-Supervised Learning for Medical Images	Cristian Simionescu et.al.	2510.23325	null
2025-10-27	ReconViaGen: Towards Accurate Multi-view 3D Object Reconstruction via Generation	Jiahao Chang et.al.	2510.23306	null
2025-10-27	Progressive Growing of Patch Size: Curriculum Learning for Accelerated and Improved Medical Image Segmentation	Stefan M. Fischer et.al.	2510.23241	null
2025-10-27	VR-Drive: Viewpoint-Robust End-to-End Driving with Feed-Forward 3D Gaussian Splatting	Hoonhee Cho et.al.	2510.23205	null
2025-10-27	DecoDINO: 3D Human-Scene Contact Prediction with Semantic Classification	Lukas Bierling et.al.	2510.23203	null
2025-10-27	Evaluation of Vision-LLMs in Surveillance Video	Pascal Benschop et.al.	2510.23190	null
2025-10-27	Finding 3D Scene Analogies with Multimodal Foundation Models	Junho Kim et.al.	2510.23184	null
2025-10-27	AG-Fusion: adaptive gated multimodal fusion for 3d object detection in complex scenes	Sixian Liu et.al.	2510.23151	null
2025-10-27	DQ3D: Depth-guided Query for Transformer-Based 3D Object Detection in Traffic Scenarios	Ziyu Wang et.al.	2510.23144	null
2025-10-27	EndoWave: Rational-Wavelet 4D Gaussian Splatting for Endoscopic Reconstruction	Taoyu Wu et.al.	2510.23087	null
2025-10-27	USF-MAE: Ultrasound Self-Supervised Foundation Model with Masked Autoencoding	Youssef Megahed et.al.	2510.22990	null
2025-10-27	Exploring Semantic-constrained Adversarial Example with Instruction Uncertainty Reduction	Jin Hu et.al.	2510.22981	null
2025-10-27	VoMP: Predicting Volumetric Mechanical Property Fields	Rishit Dagli et.al.	2510.22975	null
2025-10-27	End-to-End Design and Validation of a Low-Cost Stewart Platform with Nonlinear Estimation and Control	Benedictus C. G. Cinun et.al.	2510.22949	null
2025-10-27	Positional Preservation Embedding for Multimodal Large Language Models	Mouxiao Huang et.al.	2510.22936	null
2025-10-27	Gen-LangSplat: Generalized Language Gaussian Splatting with Pre-Trained Feature Compression	Pranav Saxena et.al.	2510.22930	null
2025-10-26	SCAL for Pinch-Lifting: Complementary Rotational and Linear Prototypes for Environment-Adaptive Grasping	Wentao Guo et.al.	2510.22738	null
2025-10-31	IGGT: Instance-Grounded Geometry Transformer for Semantic 3D Reconstruction	Hao Li et.al.	2510.22706	null
2025-10-26	RL-AVIST: Reinforcement Learning for Autonomous Visual Inspection of Space Targets	Matteo El-Hariry et.al.	2510.22699	null
2025-10-28	Look and Tell: A Dataset for Multimodal Grounding Across Egocentric and Exocentric Views	Anna Deichler et.al.	2510.22672	null
2025-10-26	LVD-GS: Gaussian Splatting SLAM for Dynamic Scenes via Hierarchical Explicit-Implicit Representation Collaboration Rendering	Wenkai Zhu et.al.	2510.22669	null
2025-10-26	RoGER-SLAM: A Robust Gaussian Splatting SLAM System for Noisy and Low-light Environment Resilience	Huilin Yin et.al.	2510.22600	null
2025-10-26	From Pixels to Views: Learning Angular-Aware and Physics-Consistent Representations for Light Field Microscopy	Feng He et.al.	2510.22577	null
2025-10-26	GateFuseNet: An Adaptive 3D Multimodal Neuroimaging Fusion Network for Parkinson’s Disease Diagnosis	Rui Jin et.al.	2510.22507	null
2025-10-26	LAMP: Data-Efficient Linear Affine Weight-Space Models for Parameter-Controlled 3D Shape Generation and Extrapolation	Ghadi Nehme et.al.	2510.22491	null
2025-10-26	DynaPose4D: High-Quality 4D Dynamic Content Generation via Pose Alignment Loss	Jing Yang et.al.	2510.22473	null
2025-10-25	SemiETPicker: Fast and Label-Efficient Particle Picking for CryoET Tomography Using Semi-Supervised Learning	Linhan Wang et.al.	2510.22454	null
2025-10-25	3D Roadway Scene Object Detection with LIDARs in Snowfall Conditions	Ghazal Farhani et.al.	2510.22436	null
2025-10-25	EndoSfM3D: Learning to 3D Reconstruct Any Endoscopic Surgery Scene using Self-supervised Foundation Model	Changhao Zhang et.al.	2510.22359	null
2025-10-25	Estimating Continuum Robot Shape under External Loading using Spatiotemporal Neural Networks	Enyi Wang et.al.	2510.22339	null
2025-10-25	GeoDiffusion: A Training-Free Framework for Accurate 3D Geometric Conditioning in Image Generation	Phillip Mueller et.al.	2510.22337	null
2025-10-25	DynamicTree: Interactive Real Tree Animation via Sparse Voxel Spectrum	Yaokun Li et.al.	2510.22213	null
2025-10-25	MOGRAS: Human Motion with Grasping in 3D Scenes	Kunal Bhosikar et.al.	2510.22199	null
2025-10-25	I2-NeRF: Learning Neural Radiance Fields Under Physically-Grounded Media Interactions	Shuhong Liu et.al.	2510.22161	null
2025-10-25	LOC: A General Language-Guided Framework for Open-Set 3D Occupancy Prediction	Yuhang Gao et.al.	2510.22141	null
2025-10-25	STG-Avatar: Animatable Human Avatars via Spacetime Gaussian	Guangan Jiang et.al.	2510.22140	null
2025-10-28	GRAID: Enhancing Spatial Reasoning of VLMs Through High-Fidelity Data Generation	Karim Elmaaroufi et.al.	2510.22118	null
2025-10-24	Scanner-Agnostic MRI Harmonization via SSIM-Guided Disentanglement	Luca Caldera et.al.	2510.22073	null
2025-10-23	LSF-Animation: Label-Free Speech-Driven Facial Animation via Implicit Feature Representation	Xin Lu et.al.	2510.21864	null
2025-10-22	A Literature Review On Stewart-Gough Platform Calibrations A Literature Review On Stewart-Gough Platform Calibrations	Sourabh Karmakar et.al.	2510.21854	null
2025-10-24	WorldGrow: Generating Infinite 3D World	Sikuang Li et.al.	2510.21682	null
2025-10-24	Group Inertial Poser: Multi-Person Pose and Global Translation from Sparse Inertial Sensors and Ultra-Wideband Ranging	Ying Xue et.al.	2510.21654	null
2025-10-24	DAP-MAE: Domain-Adaptive Point Cloud Masked Autoencoder for Effective Cross-Domain Learning	Ziqi Gao et.al.	2510.21635	null
2025-10-24	Epipolar Geometry Improves Video Generation Models	Orest Kupyn et.al.	2510.21615	null
2025-10-24	Scalable Vision-Language-Action Model Pretraining for Robotic Manipulation with Real-Life Human Activity Videos	Qixiu Li et.al.	2510.21571	null
2025-10-24	OpenHype: Hyperbolic Embeddings for Hierarchical Open-Vocabulary Radiance Fields	Lisa Weijler et.al.	2510.21441	null
2025-10-24	ArtiLatent: Realistic Articulated 3D Object Generation via Structured Latents	Honghua Chen et.al.	2510.21432	null
2025-10-24	Remote Autonomy for Multiple Small Lowcost UAVs in GNSS-denied Search and Rescue Operations	Daniel Schleich et.al.	2510.21357	null
2025-10-24	Morphologically Intelligent Perturbation Prediction with FORM	Reed Naidoo et.al.	2510.21337	null
2025-10-24	Towards Physically Executable 3D Gaussian for Embodied Navigation	Bingchen Miao et.al.	2510.21307	null
2025-10-27	Topology Sculptor, Shape Refiner: Discrete Diffusion Model for High-Fidelity 3D Meshes Generation	Kaiyu Song et.al.	2510.21264	null
2025-10-24	Urban 3D Change Detection Using LiDAR Sensor for HD Map Maintenance and Smart Mobility	Hezam Albagami et.al.	2510.21112	null
2025-10-24	ZING-3D: Zero-shot Incremental 3D Scene Graphs via Vision-Language Models	Pranav Saxena et.al.	2510.21069	null
2025-10-23	HRT1: One-Shot Human-to-Robot Trajectory Transfer for Mobile Manipulation	Sai Haneesh Allu et.al.	2510.21026	null
2025-10-23	Thermal Polarimetric Multi-view Stereo	Takahiro Kushida et.al.	2510.20972	null
2025-10-23	3DReasonKnee: Advancing Grounded Reasoning in Medical Vision Language Models	Sraavya Sambara et.al.	2510.20967	null
2025-10-26	Towards General Modality Translation with Contrastive and Predictive Latent Diffusion Bridge	Nimrod Berman et.al.	2510.20819	null
2025-10-23	GSWorld: Closed-Loop Photo-Realistic Simulation Suite for Robotic Manipulation	Guangqi Jiang et.al.	2510.20813	null
2025-10-23	Radar-Camera Fused Multi-Object Tracking: Online Calibration and Common Feature	Lei Cheng et.al.	2510.20794	null
2025-10-23	CUPID: Pose-Grounded Generative 3D Reconstruction from a Single Image	Binbin Huang et.al.	2510.20776	null
2025-10-23	ALICE-LRI: A General Method for Lossless Range Image Generation for Spinning LiDAR Sensors without Calibration Metadata	Samuel Soutullo et.al.	2510.20708	null
2025-10-23	Better Tokens for Better 3D: Advancing Vision-Language Modeling in 3D Medical Imaging	Ibrahim Ethem Hamamci et.al.	2510.20639	null
2025-10-23	OnlineSplatter: Pose-Free Online 3D Reconstruction for Free-Moving Objects	Mark He Huang et.al.	2510.20605	null
2025-10-23	From Far and Near: Perceptual Evaluation of Crowd Representations Across Levels of Detail	Xiaohan Sun et.al.	2510.20558	null
2025-10-23	Blur2seq: Blind Deblurring and Camera Trajectory Estimation from a Single Camera Motion-blurred Image	Guillermo Carbajal et.al.	2510.20539	null
2025-10-23	Degradation-Aware Cooperative Multi-Modal GNSS-Denied Localization Leveraging LiDAR-Based Robot Detections	Václav Pritzl et.al.	2510.20480	null
2025-10-23	PointMapPolicy: Structured Point Cloud Processing for Multi-Modal Imitation Learning	Xiaogang Jia et.al.	2510.20406	null
2025-10-23	Positional Encoding Field	Yunpeng Bai et.al.	2510.20385	null
2025-10-23	Dino-Diffusion Modular Designs Bridge the Cross-Domain Gap in Autonomous Parking	Zixuan Wu et.al.	2510.20335	null
2025-10-23	COS3D: Collaborative Open-Vocabulary 3D Segmentation	Runsong Zhu et.al.	2510.20238	null
2025-10-23	A Structured Review and Quantitative Profiling of Public Brain MRI Datasets for Foundation Model Development	Minh Sao Khue Luu et.al.	2510.20196	null
2025-10-23	IB-GAN: Disentangled Representation Learning with Information Bottleneck Generative Adversarial Networks	Insu Jeon et.al.	2510.20165	null
2025-10-23	PathFormer: A Transformer with 3D Grid Constraints for Digital Twin Robot-Arm Trajectory Generation	Ahmed Alanazi et.al.	2510.20161	null
2025-10-23	Monocular Visual 8D Pose Estimation for Articulated Bicycles and Cyclists	Eduardo R. Corral-Soto et.al.	2510.20158	null
2025-10-23	PartNeXt: A Next-Generation Dataset for Fine-Grained and Hierarchical 3D Part Understanding	Penghao Wang et.al.	2510.20155	null
2025-10-23	Inverse Image-Based Rendering for Light Field Generation from Single Images	Hyunjun Jung et.al.	2510.20132	null
2025-10-23	Physics-Guided Fusion for Robust 3D Tracking of Fast Moving Small Objects	Prithvi Raj Singh et.al.	2510.20126	null
2025-10-22	Design of a Bed Rotation Mechanism to Facilitate In-Situ Photogrammetric Reconstruction of Printed Parts	Travis A. Roberts et.al.	2510.20079	null
2025-10-22	Calibration of Parallel Kinematic Machine Based on Stewart Platform-A Literature Review	Sourabh Karmakar et.al.	2510.20070	null
2025-10-22	Extreme Views: 3DGS Filter for Novel View Synthesis from Out-of-Distribution Camera Poses	Damian Bowness et.al.	2510.20027	null
2025-10-22	FutrTrack: A Camera-LiDAR Fusion Transformer for 3D Multiple Object Tracking	Martha Teiko Teye et.al.	2510.19981	null
2025-10-22	Transformed Multi-view 3D Shape Features with Contrastive Learning	Márcus Vinícius Lobo Costa et.al.	2510.19955	null
2025-10-22	Seed3D 1.0: From Images to High-Fidelity Simulation-Ready 3D Assets	Jiashi Feng et.al.	2510.19944	null
2025-10-21	Re-Activating Frozen Primitives for 3D Gaussian Splatting	Yuxin Cheng et.al.	2510.19653	null
2025-10-22	PoseCrafter: Extreme Pose Estimation with Hybrid Video Synthesis	Qing Mao et.al.	2510.19527	null
2025-10-22	PRGCN: A Graph Memory Network for Cross-Sequence Pattern Reuse in 3D Human Pose Estimation	Zhuoyang Xie et.al.	2510.19475	null
2025-10-22	AegisRF: Adversarial Perturbations Guided with Sensitivity for Protecting Intellectual Property of Neural Radiance Fields	Woo Jae Kim et.al.	2510.19371	null
2025-10-22	ProTerrain: Probabilistic Physics-Informed Rough Terrain World Modeling	Golnaz Raja et.al.	2510.19364	null
2025-10-22	Advances in 4D Representation: Geometry, Motion, and Interaction	Mingrui Zhao et.al.	2510.19255	null
2025-10-22	SFGFusion: Surface Fitting Guided 3D Object Detection with 4D Radar and Camera Fusion	Xiaozhi Li et.al.	2510.19215	null
2025-10-22	MoE-GS: Mixture of Experts for Dynamic Gaussian Splatting	In-Hwan Jin et.al.	2510.19210	null
2025-10-22	GRASPLAT: Enabling dexterous grasping through novel view synthesis	Matteo Bortolon et.al.	2510.19200	null
2025-10-22	Rethinking Driving World Model as Synthetic Data Generator for Perception Tasks	Kai Zeng et.al.	2510.19195	null
2025-10-22	X-Ego: Acquiring Team-Level Tactical Situational Awareness via Cross-Egocentric Contrastive Video Representation Learning	Yunzhe Wang et.al.	2510.19150	null
2025-10-21	Advancing Brain Tumor Segmentation via Attention-based 3D U-Net Architecture and Digital Image Processing	Eyad Gad et.al.	2510.19109	null
2025-10-21	UniHPR: Unified Human Pose Representation via Singular Value Contrastive Learning	Zhongyu Jiang et.al.	2510.19078	null
2025-10-21	$Δ$ t-Mamba3D: A Time-Aware Spatio-Temporal State-Space Model for Breast Cancer Risk Prediction	Zhengbo Zhou et.al.	2510.19003	null
2025-10-21	SHRUMS: Sensor Hallucination for Real-time Underwater Motion Planning with a Compact 3D Sonar	Susheel Vadakkekuruppath et.al.	2510.18996	null
2025-10-21	Underwater Dense Mapping with the First Compact 3D Sonar	Chinmay Burgul et.al.	2510.18991	null
2025-10-21	DSI-Bench: A Benchmark for Dynamic Spatial Intelligence	Ziang Zhang et.al.	2510.18873	null
2025-10-21	Online Object-Level Semantic Mapping for Quadrupeds in Real-World Environments	Emad Razavi et.al.	2510.18776	null
2025-10-21	Moving Light Adaptive Colonoscopy Reconstruction via Illumination-Attenuation-Aware 3D Gaussian Splatting	Hao Wang et.al.	2510.18739	null
2025-10-21	PLANA3R: Zero-shot Metric Planar 3D Reconstruction via Feed-Forward Planar Splatting	Changkun Liu et.al.	2510.18714	null
2025-10-23	A Renaissance of Explicit Motion Information Mining from Transformers for Action Recognition	Peiqin Zhuang et.al.	2510.18705	null
2025-10-21	Think with 3D: Geometric Imagination Grounded Spatial Reasoning from Limited Views	Zhangquan Chen et.al.	2510.18632	null
2025-10-21	GBlobs: Local LiDAR Geometry for Improved Sensor Placement Generalization	Dušan Malić et.al.	2510.18539	null
2025-10-21	LAND: Lung and Nodule Diffusion for 3D Chest CT Synthesis with Anatomical Guidance	Anna Oliveras et.al.	2510.18446	null
2025-10-21	Entropy-Enhanced Conformal Features from Ricci Flow for Robust Alzheimer’s Disease Classification	F. Ahmadi et.al.	2510.18396	null
2025-10-21	Coverage-Recon: Coordinated Multi-Drone Image Sampling with Online Map Feedback	Muhammad Hanif et.al.	2510.18347	null
2025-10-22	OmniNWM: Omniscient Driving Navigation World Models	Bohan Li et.al.	2510.18313	null
2025-10-21	Efficient Few-shot Identity Preserving Attribute Editing for 3D-aware Deep Generative Models	Vishal Vinod et.al.	2510.18287	null
2025-10-21	Latent-Info and Low-Dimensional Learning for Human Mesh Recovery and Parallel Optimization	Xiang Zhang et.al.	2510.18267	null
2025-10-21	Hyperbolic Space Learning Method Leveraging Temporal Motion Priors for Human Mesh Recovery	Xiang Zhang et.al.	2510.18256	null
2025-10-21	OpenInsGaussian: Open-vocabulary Instance Gaussian Segmentation with Context-aware Cross-view Fusion	Tianyu Huang et.al.	2510.18253	null
2025-10-21	BlendCLIP: Bridging Synthetic and Real Domains for Zero-Shot 3D Object Classification with Multimodal Pretraining	Ajinkya Khoche et.al.	2510.18244	null
2025-10-21	A Generalizable Light Transport 3D Embedding for Global Illumination	Bing Xu et.al.	2510.18189	null
2025-10-20	Adapting Stereo Vision From Objects To 3D Lunar Surface Reconstruction with the StereoLunar Dataset	Clementine Grethen et.al.	2510.18172	null
2025-10-20	ANGEL: A Novel Gripper for Versatile and Light-touch Fruit Harvesting	Dharmik Patel et.al.	2510.18127	null
2025-10-20	From Volume Rendering to 3D Gaussian Splatting: Theory and Applications	Vitor Pereira Matias et.al.	2510.18101	null
2025-10-20	HouseTour: A Virtual Real Estate A(I)gent	Ata Çelen et.al.	2510.18054	null
2025-10-19	Conformal Lesion Segmentation for 3D Medical Images	Binyu Tan et.al.	2510.17897	null
2025-10-17	3D Weakly Supervised Semantic Segmentation via Class-Aware and Geometry-Guided Pseudo-Label Refinement	Xiaoxu Xu et.al.	2510.17875	null
2025-10-20	Botany-Bot: Digital Twin Monitoring of Occluded and Underleaf Plant Structures with Gaussian Splats	Simeon Adebola et.al.	2510.17783	null
2025-10-20	Raindrop GS: A Benchmark for 3D Gaussian Splatting under Raindrop Conditions	Zhiqiang Teng et.al.	2510.17719	null
2025-10-20	Towards 3D Objectness Learning in an Open World	Taichi Liu et.al.	2510.17686	null
2025-10-20	4DSegStreamer: Streaming 4D Panoptic Segmentation via Dual Threads	Ling Liu et.al.	2510.17664	null
2025-10-20	Frugal Federated Learning for Violence Detection: A Comparison of LoRA-Tuned VLMs and Personalized CNNs	Sébastien Thuau et.al.	2510.17651	null
2025-10-20	One Dinomaly2 Detect Them All: A Unified Framework for Full-Spectrum Unsupervised Anomaly Detection	Jia Guo et.al.	2510.17611	null
2025-10-20	Integrating BIM and UAV-based photogrammetry for Automated 3D Structure Model Segmentation	Siqi Chen et.al.	2510.17609	null
2025-10-20	ShapeCraft: LLM Agents for Structured, Textured and Interactive 3D Modeling	Shuyuan Zhang et.al.	2510.17603	null
2025-10-21	PAGE-4D: Disentangled Pose and Geometry Estimation for 4D Perception	Kaichen Zhou et.al.	2510.17568	null
2025-10-20	MambaX-Net: Dual-Input Mamba-Enhanced Cross-Attention Network for Longitudinal MRI Segmentation	Yovin Yahathugoda et.al.	2510.17529	null
2025-10-20	HumanMPC - Safe and Efficient MAV Navigation among Humans	Simon Schaefer et.al.	2510.17525	null
2025-10-22	MUG-V 10B: High-efficiency Training Pipeline for Large Video Generation Models	Yongshun Zhang et.al.	2510.17519	null
2025-10-20	Initialize to Generalize: A Stronger Initialization Pipeline for Sparse-View 3DGS	Feng Zhou et.al.	2510.17479	null
2025-10-20	From Spatial to Actions: Grounding Vision-Language-Action Model in Spatial Foundation Priors	Zhengshen Zhang et.al.	2510.17439	null
2025-10-21	DeepDetect: Learning All-in-One Dense Keypoints	Shaharyar Ahmed Khan Tareen et.al.	2510.17422	null
2025-10-20	M2H: Multi-Task Learning with Efficient Window-Based Cross-Task Attention for Monocular Spatial Perception	U. V. B. L Udugama et.al.	2510.17363	null
2025-10-20	Pole-Image: A Self-Supervised Pole-Anchored Descriptor for Long-Term LiDAR Localization and Map Maintenance	Wuhao Xie et.al.	2510.17237	null
2025-10-20	Capturing Head Avatar with Hand Contacts from a Monocular Video	Haonan He et.al.	2510.17181	null
2025-10-20	GACO-CAD: Geometry-Augmented and Conciseness-Optimized CAD Model Generation from Single Image	Yinghui Wang et.al.	2510.17157	null
2025-10-21	DiffVLA++: Bridging Cognitive Reasoning and End-to-End Driving through Metric-Guided Alignment	Yu Gao et.al.	2510.17148	null
2025-10-20	KineDiff3D: Kinematic-Aware Diffusion for Category-Level Articulated Object Shape Reconstruction and Generation	WenBo Xu et.al.	2510.17137	null
2025-10-20	GSPlane: Concise and Accurate Planar Reconstruction via Structured Representation	Ruitong Gan et.al.	2510.17095	null
2025-10-20	Learning to Design Soft Hands using Reward Models	Xueqian Bai et.al.	2510.17086	null
2025-10-20	ProDAT: Progressive Density-Aware Tail-Drop for Point Cloud Coding	Zhe Luo et.al.	2510.17068	null
2025-10-19	Click, Predict, Trust: Clinician-in-the-Loop AI Segmentation for Lung Cancer CT-Based Prognosis within the Knowledge-to-Action Framework	Mohammad R. Salmanpour et.al.	2510.17039	null
2025-10-19	Where, Not What: Compelling Video LLMs to Learn Geometric Causality for 3D-Grounding	Yutong Zhong et.al.	2510.17034	null
2025-10-19	A Scalable In Transit Solution for Comprehensive Exploration of Simulation Data	Paascal Grosset et.al.	2510.16966	null
2025-10-21	RAPID Hand Prototype: Design of an Affordable, Fully-Actuated Biomimetic Hand for Dexterous Teleoperation	Zhaoliang Wan et.al.	2510.16931	null
2025-10-19	Registration is a Powerful Rotation-Invariance Learner for 3D Anomaly Detection	Yuyang Yu et.al.	2510.16865	null
2025-10-19	2DGS-R: Revisiting the Normal Consistency Regularization in 2D Gaussian Splatting	Haofan Ren et.al.	2510.16837	null
2025-10-19	GS2POSE: Marry Gaussian Splatting to 6D Object Pose Estimation	Junbo Li et.al.	2510.16777	null
2025-10-21	SceneCOT: Eliciting Grounded Chain-of-Thought Reasoning in 3D Scenes	Xiongkun Linghu et.al.	2510.16714	null
2025-10-19	Pursuing Minimal Sufficiency in Spatial Reasoning	Yejie Guo et.al.	2510.16688	null
2025-10-18	Structured Interfaces for Automated Reasoning with 3D Scene Graphs	Aaron Ray et.al.	2510.16643	null
2025-10-18	Advancing Off-Road Autonomous Driving: The Large-Scale ORAD-3D Dataset and Comprehensive Benchmarks	Chen Min et.al.	2510.16500	null
2025-10-18	HGC-Avatar: Hierarchical Gaussian Compression for Streamable Dynamic 3D Avatars	Haocheng Tang et.al.	2510.16463	null
2025-10-18	REALM: An MLLM-Agent Framework for Open World 3D Reasoning Segmentation and Editing on Gaussian Splatting	Changyue Shi et.al.	2510.16410	null
2025-10-23	SPLite Hand: Sparsity-Aware Lightweight 3D Hand Pose Estimation	Yeh Keng Hao et.al.	2510.16396	null
2025-10-18	Demeter: A Parametric Model of Crop Plant Morphology from the Real World	Tianhang Cheng et.al.	2510.16377	null
2025-10-17	Embody 3D: A Large-scale Multimodal Motion and Behavior Dataset	Claire McLean et.al.	2510.16258	null
2025-10-17	Automated C-Arm Positioning via Conformal Landmark Localization	Ahmad Arrabi et.al.	2510.16160	null
2025-10-17	Procedural Scene Programs for Open-Universe Scene Generation: LLM-Free Error Correction via Program Search	Maxim Gumin et.al.	2510.16147	null
2025-10-17	GuideFlow3D: Optimization-Guided Rectified Flow For Appearance Transfer	Sayan Deb Sarkar et.al.	2510.16136	null
2025-10-17	Skyfall-GS: Synthesizing Immersive 3D Urban Scenes from Satellite Imagery	Jie-Ying Lee et.al.	2510.15869	null
2025-10-17	3DPR: Single Image 3D Portrait Relight using Generative Priors	Pramod Rao et.al.	2510.15846	null
2025-10-17	Dynamic Recalibration in LiDAR SLAM: Integrating AI and Geometric Methods with Real-Time Feedback Using INAF Fusion	Zahra Arjmandi et.al.	2510.15803	null
2025-10-17	SANR: Scene-Aware Neural Representation for Light Field Image Compression with Rate-Distortion Optimization	Gai Zhang et.al.	2510.15775	null
2025-10-17	Fix False Transparency by Noise Guided Splatting	Aly El Hakie et.al.	2510.15736	null
2025-10-17	Valeo Near-Field: a novel dataset for pedestrian intent detection	Antonyo Musabini et.al.	2510.15673	null
2025-10-17	Freehand 3D Ultrasound Imaging: Sim-in-the-Loop Probe Pose Optimization via Visual Servoing	Yameng Zhang et.al.	2510.15668	null
2025-10-17	Imaginarium: Vision-guided High-Quality 3D Scene Layout Generation	Xiaoming Zhu et.al.	2510.15564	null
2025-10-17	Diffusion Bridge Networks Simulate Clinical-grade PET from MRI for Dementia Diagnostics	Yitong Li et.al.	2510.15556	null
2025-10-17	Iterative Motion Compensation for Canonical 3D Reconstruction from UAV Plant Images Captured in Windy Conditions	Andre Rochow et.al.	2510.15491	null
2025-10-17	PFGS: Pose-Fused 3D Gaussian Splatting for Complete Multi-Pose Object Reconstruction	Ting-Yu Yen et.al.	2510.15386	null
2025-10-17	FreqPDE: Rethinking Positional Depth Embedding for Multi-View 3D Object Detection Transformers	Haisheng Su et.al.	2510.15385	null
2025-10-17	GaussGym: An open-source real-to-sim framework for learning locomotion from pixels	Alejandro Escontrela et.al.	2510.15352	null
2025-10-17	SHARE: Scene-Human Aligned Reconstruction	Joshua Li et.al.	2510.15342	null
2025-10-17	Traversability-aware Consistent Situational Graphs for Indoor Localization and Mapping	Jeewon Kim et.al.	2510.15319	null
2025-10-17	DriveGen3D: Boosting Feed-Forward Driving Scene Generation with Efficient Video Diffusion	Weijie Wang et.al.	2510.15264	null
2025-10-16	Deep generative priors for 3D brain analysis	Ana Lawry Aguila et.al.	2510.15119	null
2025-10-16	SaLon3R: Structure-aware Long-term Generalizable 3D Reconstruction from Unposed Images	Jiaxin Guo et.al.	2510.15072	null
2025-10-16	Comprehensive language-image pre-training for 3D medical image understanding	Tassilo Wald et.al.	2510.15042	null
2025-10-16	NANO3D: A Training-Free Approach for Efficient 3D Editing Without Masks	Junliang Ye et.al.	2510.15019	null
2025-10-16	UrbanVerse: Scaling Urban Simulation by Watching City-Tour Videos	Mingxuan Liu et.al.	2510.15018	null
2025-10-16	Coupled Diffusion Sampling for Training-Free Multi-View Image Editing	Hadi Alzayer et.al.	2510.14981	null
2025-10-16	Terra: Explorable Native 3D World Model with Point Latents	Yuanhui Huang et.al.	2510.14977	null
2025-10-16	ChangingGrounding: 3D Visual Grounding in Changing Scenes	Miao Hu et.al.	2510.14965	null
2025-10-16	C4D: 4D Made from 3D through Dual Correspondences	Shizun Wang et.al.	2510.14960	null
2025-10-16	3D Scene Prompting for Scene-Consistent Camera-Controllable Video Generation	JoungBin Lee et.al.	2510.14945	null
2025-10-16	TOUCH: Text-guided Controllable Generation of Free-Form Hand-Object Interactions	Guangyi Han et.al.	2510.14874	null
2025-10-16	QDepth-VLA: Quantized Depth Prediction as Auxiliary Supervision for Vision-Language-Action Models	Yixuan Li et.al.	2510.14836	null
2025-10-16	RL-100: Performant Robotic Manipulation with Real-World Reinforcement Learning	Kun Lei et.al.	2510.14830	null
2025-10-16	Inpainting the Red Planet: Diffusion Models for the Reconstruction of Martian Environments in Virtual Reality	Giuseppe Lorenzo Catalano et.al.	2510.14765	null
2025-10-16	Leveraging Learned Image Prior for 3D Gaussian Compression	Seungjoo Shin et.al.	2510.14705	null
2025-10-16	GOPLA: Generalizable Object Placement Learning via Synthetic Augmentation of Human Arrangement	Yao Zhong et.al.	2510.14627	null
2025-10-16	BalanceGS: Algorithm-System Co-design for Efficient 3D Gaussian Splatting Training on GPU	Junyi Wu et.al.	2510.14564	null
2025-10-16	Towards Generalist Intelligence in Dentistry: Vision Foundation Models for Oral and Maxillofacial Radiology	Xinrui Huang et.al.	2510.14532	null
2025-10-16	DRBD-Mamba for Robust and Efficient Brain Tumor Segmentation with Analytical Insights	Danish Ali et.al.	2510.14383	null
2025-10-16	SUM-AgriVLN: Spatial Understanding Memory for Agricultural Vision-and-Language Navigation	Xiaobei Zhao et.al.	2510.14357	null
2025-11-11	GauSSmart: Enhanced 3D Reconstruction through 2D Foundation Models and Geometric Filtering	Alexander Valverde et.al.	2510.14270	null
2025-10-16	Prescribed Performance Control of Deformable Object Manipulation in Spatial Latent Space	Ning Han et.al.	2510.14234	null
2025-10-16	Virtually Being: Customizing Camera-Controllable Video Diffusion Models with Multi-View Performance Captures	Yuancheng Xu et.al.	2510.14179	null
2025-10-15	cubic: CUDA-accelerated 3D Bioimage Computing	Alexandr A. Kalinin et.al.	2510.14143	null
2025-10-17	Capture, Canonicalize, Splat: Zero-Shot 3D Gaussian Avatars from Unstructured Phone Images	Emanuel Garbin et.al.	2510.14081	null
2025-10-15	Trace Anything: Representing Any Video in 4D via Trajectory Fields	Xinhang Liu et.al.	2510.13802	null
2025-10-16	Reasoning in Space via Grounding in the World	Yiming Chen et.al.	2510.13800	null
2025-10-15	LiFMCR: Dataset and Benchmark for Light Field Multi-Camera Registration	Aymeric Fleith et.al.	2510.13729	null
2025-10-15	FlashWorld: High-quality 3D Scene Generation within Seconds	Xinyang Li et.al.	2510.13678	null
2025-10-16	OmniGaze: Reward-inspired Generalizable Gaze Estimation In The Wild	Hongyu Qu et.al.	2510.13660	null
2025-10-11	EditCast3D: Single-Frame-Guided 3D Editing with Video Propagation and View Selection	Huaizhi Qu et.al.	2510.13652	null
2025-10-15	PlanarMesh: Building Compact 3D Meshes from LiDAR using Incremental Adaptive Resolution Reconstruction	Jiahao Wang et.al.	2510.13599	null
2025-10-16	Hoecken-D Hand: A Novel Robotic Hand for Linear Parallel Pinching and Self-Adaptive Grasping	Wentao Guo et.al.	2510.13553	null
2025-10-15	Learning Neural Parametric 3D Breast Shape Models for Metrical Surface Reconstruction From Monocular RGB Videos	Maximilian Weiherer et.al.	2510.13540	null
2025-10-15	A Novel Robot Hand with Hoeckens Linkages and Soft Phalanges for Scooping and Self-Adaptive Grasping in Environmental Constraints	Wentao Guo et.al.	2510.13535	null
2025-10-15	VIST3A: Text-to-3D by Stitching a Multi-view Reconstruction Network to a Video Generator	Hyojun Go et.al.	2510.13454	null
2025-10-15	Beyond Pixels: A Differentiable Pipeline for Probing Neuronal Selectivity in 3D	Pavithra Elumalai et.al.	2510.13433	null
2025-10-15	Leveraging 2D Priors and SDF Guidance for Dynamic Urban Scene Rendering	Siddharth Tourani et.al.	2510.13381	null
2025-10-15	DepthVLA: Enhancing Vision-Language-Action Models with Depth-Aware Spatial Reasoning	Tianyuan Yuan et.al.	2510.13375	null
2025-10-15	No-Reference Rendered Video Quality Assessment: Dataset and Metrics	Sipeng Yang et.al.	2510.13349	null
2025-10-15	InstantSfM: Fully Sparse and Parallel Structure-from-Motion	Jiankun Zhong et.al.	2510.13310	null
2025-10-15	Novel Class Discovery for Point Cloud Segmentation via Joint Learning of Causal Representation and Reasoning	Yang Li et.al.	2510.13307	null
2025-10-16	CymbaDiff: Structured Spatial Diffusion for Sketch-based 3D Semantic Urban Scene Generation	Li Liang et.al.	2510.13245	null
2025-10-15	FlyAwareV2: A Multimodal Cross-Domain UAV Dataset for Urban Scene Understanding	Francesco Barbato et.al.	2510.13243	null
2025-10-15	Prompt-based Adaptation in Large-scale Vision Models: A Survey	Xi Xiao et.al.	2510.13219	null
2025-10-15	MimicParts: Part-aware Style Injection for Speech-Driven 3D Motion Generation	Lianlian Liu et.al.	2510.13208	null
2025-10-15	Complementary Information Guided Occupancy Prediction via Multi-Level Representation Fusion	Rongtao Xu et.al.	2510.13198	null
2025-10-15	Real-Time Sign Language to text Translation using Deep Learning: A Comparative study of LSTM and 3D CNN	Madhumati Pol et.al.	2510.13137	null
2025-10-15	True Self-Supervised Novel View Synthesis is Transferable	Thomas W. Mitchel et.al.	2510.13063	null
2025-10-14	Enhancing Sampling-based Planning with a Library of Paths	Michal Minařík et.al.	2510.12962	null
2025-10-14	Gaussian Process Implicit Surfaces as Control Barrier Functions for Safe Robot Navigation	Mouhyemen Khan et.al.	2510.12919	null
2025-10-16	SimULi: Real-Time LiDAR and Camera Simulation with Unscented Transforms	Haithem Turki et.al.	2510.12901	null
2025-10-14	MVP4D: Multi-View Portrait Video Diffusion for Animatable 4D Avatars	Felix Taubner et.al.	2510.12785	null
2025-10-14	Uncertainty Matters in Dynamic Gaussian Splatting for Monocular 4D Reconstruction	Fengzhi Guo et.al.	2510.12768	null
2025-10-14	PET Head Motion Estimation Using Supervised Deep Learning with Attention	Zhuotong Cai et.al.	2510.12758	null
2025-10-14	E-MoFlow: Learning Egomotion and Optical Flow from Event Data via Implicit Regularization	Wenpu Li et.al.	2510.12753	null
2025-10-15	MCOP: Multi-UAV Collaborative Occupancy Prediction	Zefu Lin et.al.	2510.12679	null
2025-10-14	MMOT: The First Challenging Benchmark for Drone-based Multispectral Multi-Object Tracking	Tianhao Li et.al.	2510.12565	null
2025-10-14	Voronoi-Assisted Diffusion for Computing Unsigned Distance Fields from Unoriented Points	Jiayi Kong et.al.	2510.12524	null
2025-10-17	BSGS: Bi-stage 3D Gaussian Splatting for Camera Motion Deblurring	An Zhao et.al.	2510.12493	null
2025-10-14	M3D-skin: Multi-material 3D-printed Tactile Sensor with Hierarchical Infill Structures for Pressure Sensing	Shunnosuke Yoshimura et.al.	2510.12419	null
2025-10-14	Scene Coordinate Reconstruction Priors	Wenjing Bian et.al.	2510.12387	null
2025-10-14	Deep Attention-guided Adaptive Subsampling	Sharath M Shankaranarayana et.al.	2510.12376	null
2025-10-14	Controlling Intent Expressiveness in Robot Motion with Diffusion Models	Wenli Shi et.al.	2510.12370	null
2025-10-14	CurriFlow: Curriculum-Guided Depth Fusion with Optical Flow-Based Temporal Alignment for 3D Semantic Scene Completion	Jinzhou Lin et.al.	2510.12362	null
2025-10-14	Hybrid Gaussian Splatting for Novel Urban View Synthesis	Mohamed Omran et.al.	2510.12308	null
2025-10-14	PAGS: Priority-Adaptive Gaussian Splatting for Dynamic Driving Scenes	Ying A et.al.	2510.12282	null
2025-10-17	Spatial Forcing: Implicit Spatial Representation Alignment for Vision-language-action Model	Fuhao Li et.al.	2510.12276	null
2025-10-14	BEEP3D: Box-Supervised End-to-End Pseudo-Mask Generation for 3D Instance Segmentation	Youngju Yoo et.al.	2510.12182	null
2025-10-14	UniGS: Unified Geometry-Aware Gaussian Splatting for Multimodal Rendering	Yusen Xie et.al.	2510.12174	null
2025-10-14	Hardware-aware Coding Function Design for Compressive Single-Photon 3D Cameras	David Parra et.al.	2510.12123	null
2025-10-14	Gaussian Semantic Field for One-shot LiDAR Global Localization	Pengyu Yin et.al.	2510.12101	null
2025-10-14	G4Splat: Geometry-Guided Gaussian Splatting with Generative Prior	Junfeng Ni et.al.	2510.12099	null
2025-10-14	IL3D: A Large-Scale Indoor Layout Dataset for LLM-Driven 3D Scene Generation	Wenxu Zhou et.al.	2510.12095	null
2025-10-13	Prompt-Guided Spatial Understanding with RGB-D Transformers for Fine-Grained Object Relation Reasoning	Tanner Muturi et.al.	2510.11996	null
2025-10-13	PanoTPS-Net: Panoramic Room Layout Estimation via Thin Plate Spline Transformation	Hatem Ibrahem et.al.	2510.11992	null
2025-10-13	MammoDINO: Anatomically Aware Self-Supervision for Mammographic Images	Sicheng Zhou et.al.	2510.11883	null
2025-10-13	GS-Verse: Mesh-based Gaussian Splatting for Physics-aware Interaction in Virtual Reality	Anastasiya Pechko et.al.	2510.11878	null
2025-10-13	Enhancing the Quality of 3D Lunar Maps Using JAXA’s Kaguya Imagery	Yumi Iwashita et.al.	2510.11817	null
2025-10-13	Audio-Guided Visual Perception for Audio-Visual Navigation	Yi Wang et.al.	2510.11760	null
2025-10-13	Ev4DGS: Novel-view Rendering of Non-Rigid Objects from Monocular Event Streams	Takuya Nakabayashi et.al.	2510.11717	null
2025-10-13	Phys2Real: Fusing VLM Priors with Interactive Online Adaptation for Uncertainty-Aware Sim-to-Real Manipulation	Maggie Wang et.al.	2510.11689	null
2025-10-13	Beyond ‘Templates’: Category-Agnostic Object Pose, Size, and Shape Estimation from a Single View	Jinyu Zhang et.al.	2510.11687	null
2025-10-13	InfiniHuman: Infinite 3D Human Creation with Precise Control	Yuxuan Xue et.al.	2510.11650	null
2025-10-13	PhySIC: Physically Plausible 3D Human-Scene Interaction and Contact from a Single Image	Pradyumna Yalandur Muralidhar et.al.	2510.11649	null
2025-10-13	NV3D: Leveraging Spatial Shape Through Normal Vector-based 3D Object Detection	Krittin Chaowakarn et.al.	2510.11632	null
2025-10-13	EvoCAD: Evolutionary CAD Code Generation with Vision Language Models	Tobias Preintner et.al.	2510.11631	null
2025-10-13	High-resolution Photo Enhancement in Real-time: A Laplacian Pyramid Network	Feng Zhang et.al.	2510.11613	null
2025-10-13	A Framework for Low-Effort Training Data Generation for Urban Semantic Segmentation	Denis Zavadski et.al.	2510.11567	null
2025-10-13	SNAP: Towards Segmenting Anything in Any Point Cloud	Aniket Gupta et.al.	2510.11565	null
2025-10-13	Situat3DChange: Situated 3D Change Understanding Dataset for Multimodal Large Language Model	Ruiping Liu et.al.	2510.11509	null
2025-10-13	Coordinated Strategies in Realistic Air Combat by Hierarchical Multi-Agent Reinforcement Learning	Ardian Selmonaj et.al.	2510.11474	null
2025-10-13	VA-GS: Enhancing the Geometric Representation of Gaussian Splatting via View Alignment	Qing Li et.al.	2510.11473	null
2025-10-14	REACT3D: Recovering Articulations for Interactive Physical 3D Scenes	Zhao Huang et.al.	2510.11340	null
2025-10-13	sketch2symm: Symmetry-aware sketch-to-shape generation via semantic bridging	Yan Zhou et.al.	2510.11303	null
2025-10-13	Investigating Identity Signals in Conversational Facial Dynamics via Disentangled Expression Features	Masoumeh Chapariniya et.al.	2510.11223	null
2025-10-13	MoMaps: Semantics-Aware Scene Motion Generation with Motion Maps	Jiahui Lei et.al.	2510.11107	null
2025-10-13	Benchmarking Deep Learning Models for Laryngeal Cancer Staging Using the LaryngealCT Dataset	Nivea Roy et.al.	2510.11047	null
2025-10-13	Into the Unknown: Towards using Generative Models for Sampling Priors of Environment Uncertainty for Planning in Configuration Spaces	Subhransu S. Bhattacharjee et.al.	2510.11014	null
2025-10-13	Perspective-aware 3D Gaussian Inpainting with Multi-view Consistency	Yuxin Cheng et.al.	2510.10993	null
2025-10-13	Towards a Unified Understanding of Robot Manipulation: A Comprehensive Survey	Shuanghao Bai et.al.	2510.10903	null
2025-10-13	FastHMR: Accelerating Human Mesh Recovery via Token and Layer Merging with Diffusion Decoding	Soroush Mehraban et.al.	2510.10868	null
2025-10-12	Full segmentation annotations of 3D time-lapse microscopy images of MDA231 cells	Aleksandra Melnikova et.al.	2510.10797	null
2025-10-12	ImHead: A Large-scale Implicit Morphable Model for Localized Head Modeling	Rolandos Alexandros Potamias et.al.	2510.10793	null
2025-10-12	Structured Spectral Graph Learning for Multi-label Abnormality Classification in 3D Chest CT Scans	Theo Di Piazza et.al.	2510.10779	null
2025-10-12	Real2USD: Scene Representations in Universal Scene Description Language	Christopher D. Hsu et.al.	2510.10778	null
2025-10-12	MATStruct: High-Quality Medial Mesh Computation via Structure-aware Variational Optimization	Ningna Wang et.al.	2510.10751	null
2025-10-12	WorldMirror: Universal 3D World Reconstruction with Any-Prior Prompting	Yifan Liu et.al.	2510.10726	null
2025-10-12	High-Fidelity Simulated Data Generation for Real-World Zero-Shot Robotic Manipulation Learning with Gaussian Splatting	Haoyu Zhao et.al.	2510.10637	null
2025-10-12	SpikeGrasp: A Benchmark for 6-DoF Grasp Pose Detection from Stereo Spike Streams	Zhuoheng Gao et.al.	2510.10602	null
2025-10-12	SuperEx: Enhancing Indoor Mapping and Exploration using Non-Line-of-Sight Perception	Kush Garg et.al.	2510.10506	null
2025-10-12	Jigsaw3D: Disentangled 3D Style Transfer via Patch Shuffling and Masking	Yuteng Ye et.al.	2510.10497	null
2025-10-12	Towards Efficient 3D Gaussian Human Avatar Compression: A Prior-Guided Framework	Shanzhi Yin et.al.	2510.10492	null
2025-10-12	DAGLFNet:Deep Attention-Guided Global-Local Feature Fusion for Pseudo-Image Point Cloud Segmentation	Chuang Chen et.al.	2510.10471	null
2025-10-12	Combo-Gait: Unified Transformer Framework for Multi-Modal Gait Recognition and Attribute Analysis	Zhao-Yang Wang et.al.	2510.10417	null
2025-10-12	Mesh-Gait: A Unified Framework for Gait Recognition Through Multi-Modal Representation Learning from 2D Silhouettes	Zhao-Yang Wang et.al.	2510.10406	null
2025-10-11	PointMAC: Meta-Learned Adaptation for Robust Test-Time Point Cloud Completion	Linlian Jiang et.al.	2510.10365	null
2025-10-11	sqrtVINS: Robust and Ultrafast Square-Root Filter-based 3D Motion Tracking	Yuxiang Peng et.al.	2510.10346	null
2025-10-11	From Programs to Poses: Factored Real-World Scene Generation via Learned Program Libraries	Joy Hsu et.al.	2510.10292	null
2025-10-11	Bridging Perspectives: Foundation Model Guided BEV Maps for 3D Object Detection and Tracking	Markus Käppeler et.al.	2510.10287	null
2025-10-11	Opacity-Gradient Driven Density Control for Compact and Efficient Few-Shot 3D Gaussian Splatting	Abdelrhman Elrawy et.al.	2510.10257	null
2025-10-11	Are Video Models Emerging as Zero-Shot Learners and Reasoners in Medical Imaging?	Yuxiang Lai et.al.	2510.10254	null
2025-10-11	B2N3D: Progressive Learning from Binary to N-ary Relationships for 3D Object Grounding	Feng Xiao et.al.	2510.10194	null
2025-10-14	HccePose(BF): Predicting Front & Back Surfaces to Construct Ultra-Dense 2D-3D Correspondences for Pose Estimation	Yulin Wang et.al.	2510.10177	null
2025-10-11	Color3D: Controllable and Consistent 3D Colorization with Personalized Colorizer	Yecong Wan et.al.	2510.10152	null
2025-10-28	Gesplat: Robust Pose-Free 3D Reconstruction via Geometry-Guided Gaussian Splatting	Jiahui Lu et.al.	2510.10097	null
2025-10-11	P-4DGS: Predictive 4D Gaussian Splatting with 90 $\times$ Compression	Henan Wang et.al.	2510.10030	null
2025-10-11	Hybrid Robotic Meta-gripper for Tomato Harvesting: Analysis of Auxetic Structures with Lattice Orientation Variations	Shahid Ansari et.al.	2510.10016	null
2025-10-11	CLoD-GS: Continuous Level-of-Detail via 3D Gaussian Splatting	Zhigang Cheng et.al.	2510.09997	null
2025-10-11	FlareX: A Physics-Informed Dataset for Lens Flare Removal via 2D Synthesis and 3D Rendering	Lishen Qu et.al.	2510.09995	null
2025-10-11	VG-Mapping: Variation-Aware 3D Gaussians for Online Semi-static Scene Mapping	Yicheng He et.al.	2510.09962	null
2025-10-11	Semi-disentangled spatiotemporal implicit neural representations of longitudinal neuroimaging data for trajectory classification	Agampreet Aulakh et.al.	2510.09936	null
2025-10-10	SpectralCA: Bi-Directional Cross-Attention for Next-Generation UAV Hyperspectral Vision	D. V. Brovko et.al.	2510.09912	null
2025-10-10	An uncertainty-aware framework for data-efficient multi-view animal pose estimation	Lenny Aharon et.al.	2510.09903	null
2025-10-10	LTGS: Long-Term Gaussian Scene Chronology From Sparse View Updates	Minkwan Kim et.al.	2510.09881	null
2025-10-10	Geometry-Aware Scene Configurations for Novel View Synthesis	Minkwan Kim et.al.	2510.09880	null
2025-10-10	SpaceVista: All-Scale Visual Spatial Reasoning from mm to km	Peiwen Sun et.al.	2510.09606	null
2025-10-10	Vision Language Models: A Survey of 26K Papers	Fengming Lin et.al.	2510.09586	null
2025-10-10	FLOWING: Implicit Neural Flows for Structure-Preserving Morphing	Arthur Bizzi et.al.	2510.09537	null
2025-10-10	Toggling stiffness via multistability	Hugo de Souza Oliveira et.al.	2510.09511	null
2025-10-10	A methodology for clinically driven interactive segmentation evaluation	Parhom Esmaeili et.al.	2510.09499	null
2025-10-10	Mono4DEditor: Text-Driven 4D Scene Editing from Monocular Video via Point-Level Localization of Language-Embedded Gaussians	Jin-Chuan Shi et.al.	2510.09438	null
2025-10-10	Minkowski-MambaNet: A Point Cloud Framework with Selective State Space Models for Forest Biomass Quantification	Jinxiang Tu et.al.	2510.09367	null
2025-10-10	A Biophysically-Conditioned Generative Framework for 3D Brain Tumor MRI Synthesis	Valentin Biller et.al.	2510.09365	null
2025-10-10	Visibility-Aware Densification for 3D Gaussian Splatting in Dynamic Urban Scenes	Yikang Zhang et.al.	2510.09364	null
2025-10-10	Obstacle Avoidance using Dynamic Movement Primitives and Reinforcement Learning	Dominik Urbaniak et.al.	2510.09254	null
2025-10-10	3D Reconstruction from Transient Measurements with Time-Resolved Transformer	Yue Li et.al.	2510.09205	null
2025-10-14	Online Topological Localization for Navigation Assistance in Bronchoscopy	Clara Tomasini et.al.	2510.09144	null
2025-10-10	iMoWM: Taming Interactive Multi-Modal World Model for Robotic Manipulation	Chuanrui Zhang et.al.	2510.09036	null
2025-10-10	Exploring Single Domain Generalization of LiDAR-based Semantic Segmentation under Imperfect Labels	Weitong Kong et.al.	2510.09035	null
2025-10-10	mmJoints: Expanding Joint Representations Beyond (x,y,z) in mmWave-Based 3D Pose Estimation	Zhenyu Wang et.al.	2510.08970	null
2025-10-10	SAM2-3dMed: Empowering SAM2 for 3D Medical Image Segmentation	Yeqing Yang et.al.	2510.08967	null
2025-10-10	Direct Data-Driven Predictive Control for a Three-dimensional Cable-Driven Soft Robotic Arm	Cheng Ouyang et.al.	2510.08953	null
2025-10-09	FOLK: Fast Open-Vocabulary 3D Instance Segmentation via Label-guided Knowledge Distillation	Hongrui Wu et.al.	2510.08849	null
2025-10-09	Reinforcement Learning-Driven Edge Management for Reliable Multi-view 3D Reconstruction	Motahare Mounesan et.al.	2510.08839	null
2025-10-09	BEAR: Benchmarking and Enhancing Multimodal Language Models for Atomic Embodied Capabilities	Yu Qi et.al.	2510.08759	null
2025-10-09	Hulu-Med: A Transparent Generalist Model towards Holistic Medical Vision-Language Understanding	Songtao Jiang et.al.	2510.08668	null
2025-10-09	A 3D Generation Framework from Cross Modality to Parameterized Primitive	Yiming Liang et.al.	2510.08656	null
2025-10-09	ReSplat: Learning Recurrent Gaussian Splats	Haofei Xu et.al.	2510.08575	null
2025-10-09	NovaFlow: Zero-Shot Manipulation via Actionable Flow from Generated Videos	Hongyu Li et.al.	2510.08568	null
2025-10-09	D $^2$ GS: Depth-and-Density Guided Gaussian Splatting for Stable and Accurate Sparse-View Reconstruction	Meixi Song et.al.	2510.08566	null
2025-10-09	ARTDECO: Towards Efficient and High-Fidelity On-the-Fly 3D Reconstruction with Structured Scene Representation	Guanghao Li et.al.	2510.08551	null
2025-10-09	R2RGEN: Real-to-Real 3D Data Generation for Spatially Generalized Manipulation	Xiuwei Xu et.al.	2510.08547	null
2025-10-09	Have We Scene It All? Scene Graph-Aware Deep Point Cloud Compression	Nikolaos Stathoulopoulos et.al.	2510.08512	null
2025-10-09	Splat the Net: Radiance Fields with Splattable Neural Primitives	Xilong Zhou et.al.	2510.08491	null
2025-10-09	DexMan: Learning Bimanual Dexterous Manipulation from Human and Generated Videos	Jhen Hsieh et.al.	2510.08475	null
2025-10-09	Biology-driven assessment of deep learning super-resolution imaging of the porosity network in dentin	Lauren Anderson et.al.	2510.08407	null
2025-10-09	Unlocking 3D Affordance Segmentation with 2D Semantic Knowledge	Yu Huang et.al.	2510.08316	null
2025-10-10	Learning Neural Exposure Fields for View Synthesis	Michael Niemeyer et.al.	2510.08279	null
2025-10-09	SViM3D: Stable Video Material Diffusion for Single Image 3D Generation	Andreas Engelhardt et.al.	2510.08271	null
2025-10-09	Efficient Label Refinement for Face Parsing Under Extreme Poses Using 3D Gaussian Splatting	Ankit Gahlawat et.al.	2510.08096	null
2025-10-09	RayFusion: Ray Fusion Enhanced Collaborative Visual Perception	Shaohong Wang et.al.	2510.08017	null
2025-10-09	CVD-STORM: Cross-View Video Diffusion with Spatial-Temporal Reconstruction Model for Autonomous Driving	Tianrui Zhang et.al.	2510.07944	null
2025-10-09	MMM: Quantum-Chemical Molecular Representation Learning for Combinatorial Drug Recommendation	Chongmyung Kwon et.al.	2510.07910	null
2025-10-09	XYZCylinder: Feedforward Reconstruction for Driving Scenes Based on A Unified Cylinder Lifting Method	Haochen Yu et.al.	2510.07856	null
2025-10-09	AlignGS: Aligning Geometry and Semantics for Robust Indoor Reconstruction from Sparse Views	Yijie Gao et.al.	2510.07839	null
2025-10-09	PrismGS: Physically-Grounded Anti-Aliasing for High-Fidelity Large-Scale 3D Gaussian Splatting	Houqiang Zhong et.al.	2510.07830	null
2025-10-11	MMHOI: Modeling Complex 3D Multi-Human Multi-Object Interactions	Kaen Kogashi et.al.	2510.07828	null
2025-10-09	Demystifying Deep Learning-based Brain Tumor Segmentation with 3D UNets and Explainable AI (XAI): A Comparative Analysis	Ming Jie Ong et.al.	2510.07785	null
2025-10-09	DEGS: Deformable Event-based 3D Gaussian Splatting from RGB and Event Stream	Junhao He et.al.	2510.07752	null
2025-10-09	ComGS: Efficient 3D Object-Scene Composition via Surface Octahedral Probes	Jian Gao et.al.	2510.07729	null
2025-10-09	SyncHuman: Synchronizing 2D and 3D Generative Models for Single-view Human Reconstruction	Wenyue Chen et.al.	2510.07723	null
2025-10-09	EB-MBD: Emerging-Barrier Model-Based Diffusion for Safe Trajectory Optimization in Highly Constrained Environments	Raghav Mishra et.al.	2510.07700	null
2025-10-09	Controllable Video Synthesis via Variational Inference	Haoyi Duan et.al.	2510.07670	null
2025-10-09	PIT-QMM: A Large Multimodal Model For No-Reference Point Cloud Quality Assessment	Shashank Gupta et.al.	2510.07636	null
2025-10-08	IGUANA: Immersive Guidance, Navigation, and Control for Consumer UAV	Victor Victor et.al.	2510.07609	null
2025-10-08	TalkCuts: A Large-Scale Dataset for Multi-Shot Human Speech Video Generation	Jiaben Chen et.al.	2510.07249	null
2025-10-08	COMPAct: Computational Optimization and Automated Modular design of Planetary Actuators	Aman Singh et.al.	2510.07197	null
2025-10-08	Validation of Various Normalization Methods for Brain Tumor Segmentation: Can Federated Learning Overcome This Heterogeneity?	Jan Fiszer et.al.	2510.07126	null
2025-10-08	MoRe: Monocular Geometry Refinement via Graph Optimization for Cross-View Consistency	Dongki Jung et.al.	2510.07119	null
2025-10-08	Temporal-Prior-Guided View Planning for Periodic 3D Plant Reconstruction	Sicong Pan et.al.	2510.07028	null
2025-10-08	Generating Surface for Text-to-3D using 2D Gaussian Splatting	Huanning Dong et.al.	2510.06967	null
2025-10-08	OBJVanish: Physically Realizable Text-to-3D Adv. Generation of LiDAR-Invisible Objects	Bing Li et.al.	2510.06952	null
2025-10-08	HARP-NeXt: High-Speed and Accurate Range-Point Fusion Network for 3D LiDAR Semantic Segmentation	Samir Abou Haidar et.al.	2510.06876	null
2025-10-08	Distributed 3D Source Seeking via SO(3) Geometric Control of Robot Swarms	Jesús Bautista et.al.	2510.06836	null
2025-10-08	Capture and Interact: Rapid 3D Object Acquisition and Rendering with Gaussian Splatting in Unity	Islomjon Shukhratov et.al.	2510.06802	null
2025-10-08	UniFField: A Generalizable Unified Neural Feature Field for Visual, Semantic, and Spatial Uncertainties in Any Scene	Christian Maurer et.al.	2510.06754	null
2025-10-08	SCas4D: Structural Cascaded Optimization for Boosting Persistent 4D Novel View Synthesis	Jipeng Lyu et.al.	2510.06694	null
2025-10-08	FEAorta: A Fully Automated Framework for Finite Element Analysis of the Aorta From 3D CT Images	Jiasong Chen et.al.	2510.06621	null
2025-10-08	A Review of 10 Years of ProtoSpace: Spacecraft CAD Visualization in Collaborative Augmented Reality	Benjamin Nuernberger et.al.	2510.06608	null
2025-10-09	Through the Perspective of LiDAR: A Feature-Enriched and Uncertainty-Aware Annotation Pipeline for Terrestrial Point Cloud Segmentation	Fei Zhang et.al.	2510.06582	null
2025-10-07	Limited-Angle Tomography Reconstruction via Projector Guided 3D Diffusion	Zhantao Deng et.al.	2510.06516	null
2025-10-07	Active Next-Best-View Optimization for Risk-Averse Path Planning	Amirhossein Mollaei Khass et.al.	2510.06481	null
2025-10-07	SIGMA-GEN: Structure and Identity Guided Multi-subject Assembly for Image Generation	Oindrila Saha et.al.	2510.06469	null
2025-10-06	General and Efficient Visual Goal-Conditioned Reinforcement Learning using Object-Agnostic Masks	Fahim Shahriar et.al.	2510.06277	null
2025-10-06	A Total Variation Regularized Framework for Epilepsy-Related MRI Image Segmentation	Mehdi Rabiee et.al.	2510.06276	null
2025-10-03	multimodars: A Rust-powered toolkit for multi-modality cardiac image fusion and registration	Anselm W. Stark et.al.	2510.06241	null
2025-10-07	Human3R: Everyone Everywhere All at Once	Yue Chen et.al.	2510.06219	null
2025-10-07	Dropping the D: RGB-D SLAM Without the Depth Sensor	Mert Kiray et.al.	2510.06216	null
2025-10-07	ShapeGen4D: Towards High Quality 4D Shape Generation from Videos	Jiraphon Yenphraphai et.al.	2510.06208	null
2025-10-07	DYMO-Hair: Generalizable Volumetric Dynamics Modeling for Robot Hair Manipulation	Chengyang Zhao et.al.	2510.06199	null
2025-10-07	Vision-Guided Targeted Grasping and Vibration for Robotic Pollination in Controlled Environments	Jaehwan Jeong et.al.	2510.06146	null
2025-10-07	Bimanual 3D Hand Motion and Articulation Forecasting in Everyday Images	Aditya Prakash et.al.	2510.06145	null
2025-10-07	GLVD: Guided Learned Vertex Descent	Pol Caselles Rico et.al.	2510.06046	null
2025-10-07	Coordinate-Consistent Localization via Continuous-Time Calibration and Fusion of UWB and SLAM Observations	Tien-Dat Nguyen et.al.	2510.05992	null
2025-10-10	ALISE: Annotation-Free LiDAR Instance Segmentation for Autonomous Driving	Yongxuan Lyu et.al.	2510.05752	null
2025-10-07	PointNSP: Autoregressive 3D Point Cloud Generation with Next-Scale Level-of-Detail Prediction	Ziqiao Meng et.al.	2510.05613	null
2025-10-07	HoloScene: Simulation-Ready Interactive 3D Worlds from a Single Video	Hongchi Xia et.al.	2510.05560	null
2025-10-07	GO-Flock: Goal-Oriented Flocking in 3D Unknown Environments with Depth Maps	Yan Rui Tan et.al.	2510.05553	null
2025-10-09	Human Action Recognition from Point Clouds over Time	James Dickens et.al.	2510.05506	null
2025-10-07	ArchitectHead: Continuous Level of Detail Control for 3D Gaussian Head Avatars	Peizhi Yan et.al.	2510.05488	null
2025-10-06	AD-NODE: Adaptive Dynamics Learning with Neural ODEs for Mobile Robots Control	Shao-Yi Yu et.al.	2510.05443	null
2025-10-06	Active Semantic Perception	Huayi Tang et.al.	2510.05430	null
2025-10-06	SegMASt3R: Geometry Grounded Segment Matching	Rohit Jayanti et.al.	2510.05051	null
2025-10-11	Efficient Navigation in Unknown Indoor Environments with Vision-Language Models	D. Schwartz et.al.	2510.04991	null
2025-10-06	Bidirectional Mammogram View Translation with Column-Aware and Implicit 3D Conditional Diffusion	Xin Li et.al.	2510.04947	null
2025-10-06	From Actions to Kinesics: Extracting Human Psychological States through Bodily Movements	Cheyu Lin et.al.	2510.04844	null
2025-10-06	Detailed Aerial Mapping of Photovoltaic Power Plants Through Semantically Significant Keypoints	Viktor Kozák et.al.	2510.04840	null
2025-10-06	Flow Matching for Conditional MRI-CT and CBCT-CT Image Synthesis	Arnela Hadzic et.al.	2510.04823	null
2025-10-06	Did you just see that? Arbitrary view synthesis for egocentric replay of operating room workflows from ambient sensors	Han Zhang et.al.	2510.04802	null
2025-10-06	A Comparative Study of Vision Transformers and CNNs for Few-Shot Rigid Transformation and Fundamental Matrix Estimation	Alon Kaya et.al.	2510.04794	null
2025-10-06	Hands-Free Heritage: Automated 3D Scanning for Cultural Heritage Digitization	Javed Ahmad et.al.	2510.04781	null
2025-10-08	Progressive Gaussian Transformer with Anisotropy-aware Sampling for Open Vocabulary Occupancy Prediction	Chi Yan et.al.	2510.04759	null
2025-10-06	Object-Centric Representation Learning for Enhanced 3D Scene Graph Prediction	KunHo Heo et.al.	2510.04714	null
2025-10-06	Label-Efficient Cross-Modality Generalization for Liver Segmentation in Multi-Phase MRI	Quang-Khai Bui-Tran et.al.	2510.04705	null
2025-10-06	Bio-Inspired Robotic Houbara: From Development to Field Deployment for Behavioral Studies	Lyes Saad Saoud et.al.	2510.04692	null
2025-10-06	C3Editor: Achieving Controllable Consistency in 2D Model for 3D Editing	Zeng Tao et.al.	2510.04539	null
2025-10-06	3Dify: a Framework for Procedural 3D-CG Generation Assisted by LLMs Using MCP and RAG	Shun-ichiro Hayashi et.al.	2510.04536	null
2025-10-06	VaseVQA-3D: Benchmarking 3D VLMs on Ancient Greek Pottery	Nonghai Zhang et.al.	2510.04479	null
2025-10-05	RAP: 3D Rasterization Augmented End-to-End Planning	Lan Feng et.al.	2510.04333	null
2025-10-05	CARE-PD: A Multi-Site Anonymized Clinical Dataset for Parkinson’s Disease Gait Assessment	Vida Adeli et.al.	2510.04312	null
2025-10-05	Scaling Sequence-to-Sequence Generative Neural Rendering	Shikun Liu et.al.	2510.04236	null
2025-10-05	Diffusion Low Rank Hybrid Reconstruction for Sparse View Medical Imaging	Zongyin Deng et.al.	2510.04069	null
2025-10-05	MetaFind: Scene-Aware 3D Asset Retrieval for Coherent Metaverse Scene Generation	Zhenyu Pan et.al.	2510.04057	null
2025-10-05	Fit Pixels, Get Labels: Meta-learned Implicit Networks for Image Segmentation	Kushal Vyas et.al.	2510.04021	null
2025-10-04	Sliding Window Attention for Learned Video Compression	Alexander Kopte et.al.	2510.03926	null
2025-10-04	Talking Tennis: Language Feedback from 3D Biomechanical Action Recognition	Arushi Dashore et.al.	2510.03921	null
2025-10-04	OpenFLAME: Federated Visual Positioning System to Enable Large-Scale Augmented Reality Applications	Sagar Bharadwaj et.al.	2510.03915	null
2025-10-04	Bridge Thinking and Acting: Unleashing Physical Potential of VLM with Generalizable Action Expert	Mingyu Liu et.al.	2510.03896	null
2025-10-04	Seeing the Bigger Picture: 3D Latent Mapping for Mobile Manipulation Policy Learning	Sunghwan Kim et.al.	2510.03885	null
2025-10-04	DHQA-4D: Perceptual Quality Assessment of Dynamic 4D Digital Human	Yunhao Li et.al.	2510.03874	null
2025-10-04	PoseGaze-AHP: A Knowledge-Based 3D Dataset for AI-Driven Ocular and Postural Diagnosis	Saja Al-Dabet et.al.	2510.03873	null
2025-10-04	Efficiency vs. Efficacy: Assessing the Compression Ratio-Dice Score Relationship through a Simple Benchmarking Framework for Cerebrovascular 3D Segmentation	Shimaa Elbana et.al.	2510.03769	null
2025-10-03	SketchPlan: Diffusion Based Drone Planning From Human Sketches	Sixten Norelius et.al.	2510.03545	null
2025-10-08	Platonic Transformers: A Solid Choice For Equivariance	Mohammad Mohaiminul Islam et.al.	2510.03511	null
2025-10-03	Digital-Twin Evaluation for Proactive Human-Robot Collision Avoidance via Prediction-Guided A-RRT*	Vadivelan Murugesan et.al.	2510.03496	null
2025-10-03	Spatial-ViLT: Enhancing Visual Spatial Reasoning through Multi-Task Learning	Chashi Mahiul Islam et.al.	2510.03441	null
2025-10-03	Style Brush: Guided Style Transfer for 3D Objects	Áron Samuel Kovács et.al.	2510.03433	null
2025-10-03	Real-time nonlinear inversion of magnetic resonance elastography with operator learning	Juampablo E. Heras Rivera et.al.	2510.03372	null
2025-10-08	Unified Unsupervised Anomaly Detection via Matching Cost Filtering	Zhe Zhang et.al.	2510.03363	null
2025-10-02	Sonar Image Datasets: A Comprehensive Survey of Resources, Challenges, and Applications	Larissa S. Gomes et.al.	2510.03353	null
2025-10-02	Visual Odometry with Transformers	Vlardimir Yugay et.al.	2510.03348	null
2025-09-30	Universal Beta Splatting	Rong Liu et.al.	2510.03312	null
2025-10-03	Memory Forcing: Spatio-Temporal Memory for Consistent Scene Generation on Minecraft	Junchao Huang et.al.	2510.03198	null
2025-10-03	Dynamic Prompt Generation for Interactive 3D Medical Image Segmentation Training	Tidiane Camaret Ndir et.al.	2510.03189	null
2025-10-03	ROGR: Relightable 3D Objects using Generative Relighting	Jiapeng Tang et.al.	2510.03163	null
2025-10-03	GeoComplete: Geometry-Aware Diffusion for Reference-Driven Image Completion	Beibei Lin et.al.	2510.03110	null
2025-10-03	Geometry Meets Vision: Revisiting Pretrained Semantics in Distilled Fields	Zhiting Mei et.al.	2510.03104	null
2025-10-03	3D-CovDiffusion: 3D-Aware Diffusion Policy for Coverage Path Planning	Chenyuan Chen et.al.	2510.03011	null
2025-10-03	Towards Scalable and Consistent 3D Editing	Ruihao Xia et.al.	2510.02994	null
2025-10-03	PyRadiomics-cuda: a GPU-accelerated 3D features extraction from medical images within PyRadiomics	Jakub Lisowski et.al.	2510.02894	null
2025-10-03	GS-Share: Enabling High-fidelity Map Sharing with Incremental Gaussian Splatting	Xinran Zhang et.al.	2510.02884	null
2025-10-03	Flow with the Force Field: Learning 3D Compliant Flow Matching Policies from Force and Demonstration-Guided Simulation Data	Tianyu Li et.al.	2510.02738	null
2025-10-03	From Tokens to Nodes: Semantic-Guided Motion Control for Dynamic 3D Gaussian Splatting	Jianing Chen et.al.	2510.02732	null
2025-10-03	Visualizing Spatial Point Clouds: A Task-Oriented Taxonomy	Mahsa Partovi et.al.	2510.02651	null
2025-10-02	Ego-Exo 3D Hand Tracking in the Wild with a Mobile Multi-Camera Rig	Patrick Rim et.al.	2510.02601	null
2025-10-02	PhysHMR: Learning Humanoid Control Policies from Vision for Physically Plausible Human Motion Reconstruction	Qiao Feng et.al.	2510.02566	null
2025-10-02	StealthAttack: Robust 3D Gaussian Splatting Poisoning via Density-Guided Illusions	Bo-Hsu Ke et.al.	2510.02314	null
2025-10-02	Paving the Way Towards Kinematic Assessment Using Monocular Video: A Preclinical Benchmark of State-of-the-Art Deep-Learning-Based 3D Human Pose Estimators Against Inertial Sensors in Daily Living Activities	Mario Medrano-Paredes et.al.	2510.02264	null
2025-10-02	GeoPurify: A Data-Efficient Geometric Distillation Framework for Open-Vocabulary 3D Segmentation	Weijia Dou et.al.	2510.02186	null
2025-10-02	DisCo-Layout: Disentangling and Coordinating Semantic and Physical Refinement in a Multi-Agent Framework for 3D Indoor Layout Synthesis	Jialin Gao et.al.	2510.02178	null
2025-10-02	EC3R-SLAM: Efficient and Consistent Monocular Dense SLAM with Feed-Forward 3D Reconstruction	Lingxiang Hu et.al.	2510.02080	null
2025-10-02	GaussianMorphing: Mesh-Guided 3D Gaussians for Semantic-Aware Object Morphing	Mengtian Li et.al.	2510.02034	null
2025-10-02	LiLa-Net: Lightweight Latent LiDAR Autoencoder for 3D Point Cloud Reconstruction	Mario Resino et.al.	2510.02028	null
2025-10-02	ROI-GS: Interest-based Local Quality 3D Gaussian Splatting	Quoc-Anh Bui et.al.	2510.01978	null
2025-10-02	Calibrating the Full Predictive Class Distribution of 3D Object Detectors for Autonomous Driving	Cornelius Schröder et.al.	2510.01829	null
2025-10-02	An Anytime, Scalable and Complete Algorithm for Embedding a Manufacturing Procedure in a Smart Factory	Christopher Leet et.al.	2510.01770	null
2025-10-02	LOBE-GS: Load-Balanced and Efficient 3D Gaussian Splatting for Large-Scale Scene Reconstruction	Sheng-Hsiang Hung et.al.	2510.01767	null
2025-10-03	UniVerse: Unleashing the Scene Prior of Video Diffusion Models for Robust Radiance Field Reconstruction	Jin Cao et.al.	2510.01669	null
2025-10-02	Non-Rigid Structure-from-Motion via Differential Geometry with Recoverable Conformal Scale	Yongbo Chen et.al.	2510.01665	null
2025-10-02	Discrete Facial Encoding: : A Framework for Data-driven Facial Display Discovery	Minh Tran et.al.	2510.01662	null
2025-10-02	Joint Deblurring and 3D Reconstruction for Macrophotography	Yifan Zhao et.al.	2510.01640	null
2025-10-02	MPMAvatar: Learning 3D Gaussian Avatars with Accurate and Robust Physics-Based Dynamics	Changmin Lee et.al.	2510.01619	null
2025-10-02	ActiveUMI: Robotic Manipulation with Active Perception from Robot-Free Human Demonstrations	Qiyuan Zeng et.al.	2510.01607	null
2025-10-02	Real-time Multi-Plane Segmentation Based on GPU Accelerated High-Resolution 3D Voxel Mapping for Legged Robot Locomotion	Shun Niijima et.al.	2510.01592	null
2025-10-01	From 2D to 3D, Deep Learning-based Shape Reconstruction in Magnetic Resonance Imaging: A Review	Emma McMillian et.al.	2510.01296	null
2025-10-01	EvoWorld: Evolving Panoramic World Generation with Explicit 3D Memory	Jiahao Wang et.al.	2510.01183	null
2025-10-01	Audio Driven Real-Time Facial Animation for Social Telepresence	Jiye Lee et.al.	2510.01176	null
2025-10-01	KeySG: Hierarchical Keyframe-Based 3D Scene Graphs	Abdelrhman Werby et.al.	2510.01049	null
2025-10-01	A Scene is Worth a Thousand Features: Feed-Forward Camera Localization from a Collection of Image Features	Axel Barroso-Laguna et.al.	2510.00978	null
2025-10-01	PAL-Net: A Point-Wise CNN with Patch-Attention for 3D Facial Landmark Localization	Ali Shadman Yazdi et.al.	2510.00910	null
2025-10-01	AI-CNet3D: An Anatomically-Informed Cross-Attention Network with Multi-Task Consistency Fine-tuning for 3D Glaucoma Classification	Roshan Kenia et.al.	2510.00882	null
2025-10-01	PhraseStereo: The First Open-Vocabulary Stereo Image Segmentation Dataset	Thomas Campagnolo et.al.	2510.00818	null
2025-10-01	Beyond one-hot encoding? Journey into compact encoding for large multi-class segmentation	Aaron Kujawa et.al.	2510.00667	null
2025-10-01	Enabling High-Frequency Cross-Modality Visual Positioning Service for Accurate Drone Landing	Haoyang Wang et.al.	2510.00646	null
2025-10-01	Multi-level Dynamic Style Transfer for NeRFs	Zesheng Li et.al.	2510.00592	null
2025-10-01	Cascaded Diffusion Framework for Probabilistic Coarse-to-Fine Hand Pose Estimation	Taeyun Woo et.al.	2510.00527	null
2025-10-01	Affordance-Guided Diffusion Prior for 3D Hand Reconstruction	Naru Suzuki et.al.	2510.00506	null
2025-10-01	A Fast and Precise Method for Searching Rectangular Tumor Regions in Brain MR Images	Hidenori Takeshima et.al.	2510.00505	null
2025-10-01	From Human Hands to Robot Arms: Manipulation Skills Transfer via Trajectory Alignment	Han Zhou et.al.	2510.00491	null
2025-10-01	Domain-Specialized Interactive Segmentation Framework for Meningioma Radiotherapy Planning	Junhyeok Lee et.al.	2510.00416	null
2025-09-30	TTT3R: 3D Reconstruction as Test-Time Training	Xingyu Chen et.al.	2509.26645	null
2025-09-30	MLA: A Multisensory Language-Action Model for Multimodal Understanding and Forecasting in Robotic Manipulation	Zhuoyang Liu et.al.	2509.26642	null
2025-09-30	Learning Generalizable Shape Completion with SIM(3) Equivariance	Yuqing Wang et.al.	2509.26631	null
2025-09-30	HART: Human Aligned Reconstruction Transformer	Xiyi Chen et.al.	2509.26621	null
2025-09-30	DA $^2$ : Depth Anything in Any Direction	Haodong Li et.al.	2509.26618	null
2025-09-30	Memory-Efficient 2D/3D Shape Assembly of Robot Swarms	Shuoyu Yue et.al.	2509.26518	null
2025-09-30	DEPTHOR++: Robust Depth Enhancement from a Real-World Lightweight dToF and RGB Guidance	Jijun Xiang et.al.	2509.26498	null
2025-09-30	Stylos: Multi-View 3D Stylization with Single-Forward Gaussian Splatting	Hanzhou Liu et.al.	2509.26455	null
2025-09-30	Continuous Space-Time Video Super-Resolution with 3D Fourier Fields	Alexander Becker et.al.	2509.26325	null
2025-09-30	ISyHand: A Dexterous Multi-finger Robot Hand with an Articulated Palm	Benjamin A. Richardson et.al.	2509.26236	null
2025-09-30	3DiFACE: Synthesizing and Editing Holistic 3D Facial Animation	Balamurugan Thambiraja et.al.	2509.26233	null
2025-09-30	Text-to-Scene with Large Reasoning Models	Frédéric Berdoz et.al.	2509.26091	null
2025-09-30	EasyOcc: 3D Pseudo-Label Supervision for Fully Self-Supervised Semantic Occupancy Prediction Models	Seamie Hayes et.al.	2509.26087	null
2025-09-30	GaussEdit: Adaptive 3D Scene Editing with Text and Image Prompts	Zhenyu Shu et.al.	2509.26055	null
2025-09-30	PFDepth: Heterogeneous Pinhole-Fisheye Joint Depth Estimation via Distortion-aware Gaussian-Splatted Volumetric Fusion	Zhiwei Zhang et.al.	2509.26008	null
2025-09-30	Towards Human Engagement with Realistic AI Combat Pilots	Ardian Selmonaj et.al.	2509.26002	null
2025-09-30	PinPoint3D: Fine-Grained 3D Part Segmentation from a Few Clicks	Bojun Zhang et.al.	2509.25970	null
2025-10-01	A Multimodal LLM Approach for Visual Question Answering on Multiparametric 3D Brain MRI	Arvind Murari Vepa et.al.	2509.25889	null
2025-09-30	Vector sketch animation generation with differentialable motion trajectories	Xinding Zhu et.al.	2509.25857	null
2025-09-30	IPDRecon: Image-Plane Geometric Decoding for View-Invariant Indoor Scene Reconstruction	Mingyang Li et.al.	2509.25744	null
2025-09-30	Dragging with Geometry: From Pixels to Geometry-Guided Image Editing	Xinyu Pu et.al.	2509.25740	null
2025-09-30	LieHMR: Autoregressive Human Mesh Recovery with $SO(3)$ Diffusion	Donghwan Kim et.al.	2509.25739	null
2025-09-30	Using Images from a Video Game to Improve the Detection of Truck Axles	Leandro Arab Marcomini et.al.	2509.25644	null
2025-09-29	GaussianLens: Localized High-Resolution Reconstruction via On-Demand Gaussian Densification	Yijia Weng et.al.	2509.25603	null
2025-09-29	Online Mapping for Autonomous Driving: Addressing Sensor Generalization and Dynamic Map Updates in Campus Environments	Zihan Zhang et.al.	2509.25542	null
2025-09-29	LLM-RG: Referential Grounding in Outdoor Scenarios using Large Language Models	Pranav Saxena et.al.	2509.25528	null
2025-09-29	Robust Visual Localization in Compute-Constrained Environments by Salient Edge Rendering and Weighted Hamming Similarity	Tu-Hoa Pham et.al.	2509.25520	null
2025-10-01	DepthLM: Metric Depth From Vision Language Models	Zhipeng Cai et.al.	2509.25413	null
2025-09-29	Computational Design and Single-Wire Sensing of 3D Printed Objects with Integrated Capacitive Touchpoints	S. Sandra Bae et.al.	2509.25387	null
2025-10-01	Editing Physiological Signals in Videos Using Latent Representations	Tianwen Zhou et.al.	2509.25348	null
2025-09-29	VGGT-X: When VGGT Meets Dense Novel View Synthesis	Yang Liu et.al.	2509.25191	null
2025-09-29	Visual Jigsaw Post-Training Improves MLLMs	Penghao Wu et.al.	2509.25190	null
2025-09-29	PAD3R: Pose-Aware Dynamic 3D Reconstruction from Casual Videos	Ting-Hsuan Liao et.al.	2509.25183	null
2025-09-29	Triangle Splatting+: Differentiable Rendering with Opaque Triangles	Jan Held et.al.	2509.25122	null
2025-09-29	Unsupervised Representation Learning for 3D Mesh Parameterization with Semantic and Visibility Objectives	AmirHossein Zamani et.al.	2509.25094	null
2025-09-29	UniLat3D: Geometry-Appearance Unified Latents for Single-Stage 3D Generation	Guanjun Wu et.al.	2509.25079	null
2025-10-03	GEM: 3D Gaussian Splatting for Efficient and Accurate Cryo-EM Reconstruction	Huaizhi Qu et.al.	2509.25075	null
2025-09-29	LVT: Large-Scale Scene Reconstruction via Local View Transformers	Tooba Imtiaz et.al.	2509.25001	null
2025-09-29	PanoWorld-X: Generating Explorable Panoramic Worlds via Sphere-Aware Video Diffusion	Yuyang Yin et.al.	2509.24997	null
2025-09-29	Light-SQ: Structure-aware Shape Abstraction with Superquadrics for Generated Meshes	Yuhan Wang et.al.	2509.24986	null
2025-09-29	On-the-Fly Data Augmentation for Brain Tumor Segmentation	Ishika Jain et.al.	2509.24973	null
2025-09-29	Social 3D Scene Graphs: Modeling Human Actions and Relations for Interactive Service Robots	Ermanno Bartoli et.al.	2509.24966	null
2025-09-29	Real-time Recognition of Human Interactions from a Single RGB-D Camera for Socially-Aware Robot Navigation	Thanh Long Nguyen et.al.	2509.24907	null
2025-09-29	DWGS: Enhancing Sparse-View Gaussian Splatting with Hybrid-Loss Depth Estimation and Bidirectional Warping	Yu Ma et.al.	2509.24893	null
2025-09-29	Finding an Initial Probe Pose in Teleoperated Robotic Echocardiography via 2D LiDAR-Based 3D Reconstruction	Mariadas Capsran Roshan et.al.	2509.24867	null
2025-09-29	UP2You: Fast Reconstruction of Yourself from Unconstrained Photo Collections	Zeyu Cai et.al.	2509.24817	null
2025-09-29	TACO-Net: Topological Signatures Triumph in 3D Object Classification	Anirban Ghosh et.al.	2509.24802	null
2025-09-29	SkyLink: Unifying Street-Satellite Geo-Localization via UAV-Mediated 3D Scene Alignment	Hongyang Zhang et.al.	2509.24783	null
2025-10-03	ExGS: Extreme 3D Gaussian Compression with Diffusion Priors	Jiaqi Chen et.al.	2509.24758	null
2025-09-29	NeuralPVS: Learned Estimation of Potentially Visible Sets	Xiangyu Wang et.al.	2509.24677	null
2025-09-29	PoseDiff: A Unified Diffusion Model Bridging Robot Pose Estimation and Video-to-Action Control	Haozhuo Zhang et.al.	2509.24591	null
2025-09-29	BFSM: 3D Bidirectional Face-Skull Morphable Model	Zidu Wang et.al.	2509.24577	null
2025-09-29	CORE-3D: Context-aware Open-vocabulary Retrieval by Embeddings in 3D	Mohamad Amin Mirzaei et.al.	2509.24528	null
2025-09-29	NeoWorld: Neural Simulation of Explorable Virtual Worlds via Progressive 3D Unfolding	Yanpeng Zhao et.al.	2509.24441	null
2025-10-01	Proxy-GS: Efficient 3D Gaussian Splatting via Proxy Mesh	Yuanyuan Gao et.al.	2509.24421	null
2025-09-29	RapidMV: Leveraging Spatio-Angular Representations for Efficient and Consistent Text-to-Multi-View Synthesis	Seungwook Kim et.al.	2509.24410	null
2025-09-29	Vid-LLM: A Compact Video-based 3D Multimodal LLM with Reconstruction-Reasoning Synergy	Haijier Chen et.al.	2509.24385	null
2025-09-29	DINOReg: Strong Point Cloud Registration with Vision Foundation Model	Congjia Chen et.al.	2509.24370	null
2025-09-29	SONAR: Semantic-Object Navigation with Aggregated Reasoning through a Cross-Modal Inference Paradigm	Yao Wang et.al.	2509.24321	null
2025-09-29	ASIA: Adaptive 3D Segmentation using Few Image Annotations	Sai Raj Kishore Perla et.al.	2509.24288	null
2025-09-29	Robust Partial 3D Point Cloud Registration via Confidence Estimation under Global Context	Yongqiang Wang et.al.	2509.24275	null
2025-09-29	Skeleton-based Robust Registration Framework for Corrupted 3D Point Clouds	Yongqiang Wang et.al.	2509.24273	null
2025-09-29	Cycle Diffusion Model for Counterfactual Image Generation	Fangrui Huang et.al.	2509.24267	null
2025-09-29	Forge4D: Feed-Forward 4D Human Reconstruction and Interpolation from Uncalibrated Sparse-view Videos	Yingdong Hu et.al.	2509.24209	null
2025-09-29	An Efficient 3D Latent Diffusion Model for T1-contrast Enhanced MRI Generation	Zach Eidex et.al.	2509.24194	null
2025-09-29	Tumor Synthesis conditioned on Radiomics	Jonghun Kim et.al.	2509.24182	null
2025-09-29	LatXGen: Towards Radiation-Free and Accurate Quantitative Analysis of Sagittal Spinal Alignment Via Cross-Modal Radiographic View Synthesis	Moxin Zhao et.al.	2509.24165	null
2025-09-29	Neural Visibility of Point Sets	Jun-Hao Wang et.al.	2509.24150	null
2025-09-29	A Novel Model for 3D Motion Planning for a Generalized Dubins Vehicle with Pitch and Yaw Rate Constraints	Deepak Prakash Kumar et.al.	2509.24143	null
2025-09-28	BOSfM: A View Planning Framework for Optimal 3D Reconstruction of Agricultural Scenes	Athanasios Bacharis et.al.	2509.24126	null
2025-09-28	Unified Multi-Modal Interactive & Reactive 3D Motion Generation via Rectified Flow	Prerit Gupta et.al.	2509.24099	null
2025-09-28	WireBend-kit: A Computational Design and Fabrication Toolkit for Wirebending Custom 3D Wireframe Structures	Faraz Faruqi et.al.	2509.24083	null
2025-09-28	SIE3D: Single-image Expressive 3D Avatar generation via Semantic Embedding and Perceptual Expression Loss	Zhiqi Huang et.al.	2509.24004	null
2025-09-28	RPG360: Robust 360 Depth Estimation with Perspective Foundation Models and Graph Optimization	Dongki Jung et.al.	2509.23991	null
2025-09-28	CrashSplat: 2D to 3D Vehicle Damage Segmentation in Gaussian Splatting	Dragoş-Andrei Chileban et.al.	2509.23947	null
2025-09-28	AssemblyHands-X: Modeling 3D Hand-Body Coordination for Understanding Bimanual Human Activities	Tatsuro Banno et.al.	2509.23888	null
2025-09-28	Learning Adaptive Pseudo-Label Selection for Semi-Supervised 3D Object Detection	Taehun Kong et.al.	2509.23880	null
2025-09-28	Towards Fine-Grained Text-to-3D Quality Assessment: A Benchmark and A Two-Stage Rank-Learning Metric	Bingyang Cui et.al.	2509.23841	null
2025-09-28	Uni4D-LLM: A Unified SpatioTemporal-Aware VLM for 4D Understanding and Generation	Hanyu Zhou et.al.	2509.23828	null
2025-09-28	Controllable Generation of Large-Scale 3D Urban Layouts with Semantic and Structural Guidance	Mengyuan Niu et.al.	2509.23804	null
2025-09-28	GRS-SLAM3R: Real-Time Dense SLAM with Gated Recurrent State	Guole Shen et.al.	2509.23737	null
2025-09-28	M3DLayout: A Multi-Source Dataset of 3D Indoor Layouts and Structured Descriptions for 3D Generation	Yiheng Zhang et.al.	2509.23728	null
2025-09-28	Diff-3DCap: Shape Captioning with Diffusion Models	Zhenyu Shu et.al.	2509.23718	null
2025-09-28	StrucADT: Generating Structure-controlled 3D Point Clouds with Adjacency Diffusion Transformer	Zhenyu Shu et.al.	2509.23709	null
2025-09-28	MSD-KMamba: Bidirectional Spatial-Aware Multi-Modal 3D Brain Segmentation via Multi-scale Self-Distilled Fusion Strategy	Dayu Tan et.al.	2509.23677	null
2025-09-28	Color-Pair Guided Robust Zero-Shot 6D Pose Estimation and Tracking of Cluttered Objects on Edge Devices	Xingjian Yang et.al.	2509.23647	null
2025-09-28	Sparse-Up: Learnable Sparse Upsampling for 3D Generation with High-Fidelity Textures	Lu Xiao et.al.	2509.23646	null
2025-09-28	BioVessel-Net and RetinaMix: Unsupervised Retinal Vessel Segmentation from OCTA Images	Cheng Huang et.al.	2509.23617	null
2025-09-28	InteractMove: Text-Controlled Human-Object Interaction Generation in 3D Scenes with Movable Objects	Xinhao Cai et.al.	2509.23612	null
2025-09-28	FlowLUT: Efficient Image Enhancement via Differentiable LUTs and Iterative Flow Matching	Liubing Hu et.al.	2509.23608	null
2025-09-28	ZeroScene: A Zero-Shot Framework for 3D Scene Generation from a Single Image and Controllable Texture Editing	Xiang Tang et.al.	2509.23607	null
2025-09-28	Generalizable Coarse-to-Fine Robot Manipulation via Language-Aligned 3D Keypoints	Jianshu Hu et.al.	2509.23575	null
2025-09-28	RAVEN: Resilient Aerial Navigation via Open-Set Semantic Memory and Behavior Adaptation	Seungchan Kim et.al.	2509.23563	null
2025-09-28	From Fields to Splats: A Cross-Domain Survey of Real-Time Neural Scene Representations	Javed Ahmad et.al.	2509.23555	null
2025-09-28	OVSeg3R: Learn Open-vocabulary Instance Segmentation from 2D via 3D Reconstruction	Hongyang Li et.al.	2509.23541	null
2025-09-27	Orientation-anchored Hyper-Gaussian for 4D Reconstruction from Casual Videos	Junyi Wu et.al.	2509.23492	null
2025-09-27	3DPCNet: Pose Canonicalization for Robust Viewpoint-Invariant 3D Kinematic Analysis from Monocular RGB cameras	Tharindu Ekanayake et.al.	2509.23455	null
2025-09-30	FM-SIREN & FM-FINER: Nyquist-Informed Frequency Multiplier for Implicit Neural Representation with Periodic Activation	Mohammed Alsakabi et.al.	2509.23438	null
2025-09-27	WorldSplat: Gaussian-Centric Feed-Forward 4D Scene Generation for Autonomous Driving	Ziyue Zhu et.al.	2509.23402	null
2025-09-27	UniPose: Unified Cross-modality Pose Prior Propagation towards RGB-D data for Weakly Supervised 3D Human Pose Estimation	Jinghong Zheng et.al.	2509.23376	null
2025-09-27	Code Arcades: 3d Visualization of Classes, Dependencies and Software Metrics	Anthony Savidis et.al.	2509.23297	null
2025-09-27	OracleGS: Grounding Generative Priors for Sparse-View Gaussian Splatting	Atakan Topaloglu et.al.	2509.23258	null
2025-09-27	Unsupervised Online 3D Instance Segmentation with Synthetic Sequences and Dynamic Loss	Yifan Zhang et.al.	2509.23194	null
2025-09-27	Confidence-Calibrating Regularization for Robust Brain MRI Segmentation Under Domain Shift	Behraj Khan et.al.	2509.23176	null
2025-09-27	Sparse2Dense: A Keypoint-driven Generative Framework for Human Video Compression and Vertex Prediction	Bolin Chen et.al.	2509.23169	null
2025-09-27	Open-Vocabulary Spatio-Temporal Scene Graph for Robot Perception and Teleoperation Planning	Yi Wang et.al.	2509.23107	null
2025-09-27	GeLoc3r: Enhancing Relative Camera Pose Regression with Geometric Consistency Regularization	Jingxing Li et.al.	2509.23038	null
2025-09-27	Desensitizing for Improving Corruption Robustness in Point Cloud Classification through Adversarial Training	Zhiqiang Tian et.al.	2509.23010	null
2025-09-27	ARSS: Taming Decoder-only Autoregressive Visual Generation for View Synthesis From Single View	Wenbin Teng et.al.	2509.23008	null
2025-09-30	Learning Unified Representation of 3D Gaussian Splatting	Yuelin Xin et.al.	2509.22917	null
2025-09-26	Convolutional Set Transformer	Federico Chinello et.al.	2509.22889	null
2025-09-26	ControlEvents: Controllable Synthesis of Event Camera Datawith Foundational Prior from Image Diffusion Models	Yixuan Hu et.al.	2509.22864	null
2025-09-26	Empart: Interactive Convex Decomposition for Converting Meshes to Parts	Brandon Vu et.al.	2509.22847	null
2025-09-26	See, Point, Fly: A Learning-Free VLM Framework for Universal Unmanned Aerial Navigation	Chih Yao Hu et.al.	2509.22653	null
2025-09-26	JanusVLN: Decoupling Semantics and Spatiality with Dual Implicit Memory for Vision-Language Navigation	Shuang Zeng et.al.	2509.22548	null
2025-09-26	EfficientDepth: A Fast and Detail-Preserving Monocular Depth Estimation Model	Andrii Litvynchuk et.al.	2509.22527	null
2025-09-26	HELIOS: Hierarchical Exploration for Language-grounded Interaction in Open Scenes	Katrina Ashton et.al.	2509.22498	null
2025-09-26	EMMA: Generalizing Real-World Robot Manipulation via Generative Visual Transfer	Zhehao Dong et.al.	2509.22407	null
2025-09-26	Deep Learning-Based Cross-Anatomy CT Synthesis Using Adapted nnResU-Net with Anatomical Feature Prioritized Loss	Javier Sequeiro González et.al.	2509.22394	null
2025-09-26	Johnson-Lindenstrauss Lemma Guided Network for Efficient 3D Medical Segmentation	Jinpeng Lu et.al.	2509.22307	null
2025-09-26	MesaTask: Towards Task-Driven Tabletop Scene Generation via 3D Spatial Reasoning	Jinkun Hao et.al.	2509.22281	null
2025-09-26	GS-2M: Gaussian Splatting for Joint Mesh Reconstruction and Material Decomposition	Dinh Minh Nguyen et.al.	2509.22276	null
2025-09-26	Polysemous Language Gaussian Splatting via Matching-based Mask Lifting	Jiayu Ding et.al.	2509.22225	null
2025-09-26	Rigidity-Aware 3D Gaussian Deformation from a Single Image	Jinhyeok Kim et.al.	2509.22222	null
2025-09-26	MultiMat: Multimodal Program Synthesis for Procedural Materials using Large Multimodal Models	Jonas Belouadi et.al.	2509.22151	null
2025-09-26	Joint graph entropy knowledge distillation for point cloud classification and robustness against corruptions	Zhiqiang Tian et.al.	2509.22150	null
2025-09-26	Large Material Gaussian Model for Relightable 3D Generation	Jingrui Ye et.al.	2509.22112	null
2025-09-26	Comparative Analysis of GAN and Diffusion for MRI-to-CT translation	Emily Honey et.al.	2509.22049	null
2025-09-26	Rate-Distortion Optimized Communication for Collaborative Perception	Genjia Liu et.al.	2509.21994	null
2025-09-29	PartSAM: A Scalable Promptable Part Segmentation Model Trained on Native 3D Data	Zhe Zhu et.al.	2509.21965	null
2025-09-26	TDEdit: A Unified Diffusion Framework for Text-Drag Guided Image Manipulation	Qihang Wang et.al.	2509.21905	null
2025-09-26	Drag4D: Align Your Motion with Text-Driven 3D Scene Generation	Minjun Kang et.al.	2509.21888	null
2025-09-26	SRHand: Super-Resolving Hand Images and 3D Shapes via View/Pose-aware Neural Image Representations and Explicit 3D Meshes	Minje Kim et.al.	2509.21859	null
2025-09-26	Dynamic Novel View Synthesis in High Dynamic Range	Kaixuan Zhang et.al.	2509.21853	null
2025-09-26	DiTraj: training-free trajectory control for video diffusion transformer	Cheng Lei et.al.	2509.21839	null
2025-09-25	PowerGS: Display-Rendering Power Co-Optimization for Neural Rendering in Power-Constrained XR Systems	Weikai Lin et.al.	2509.21702	null
2025-09-25	MORPH: Shape-agnostic PDE Foundation Models	Mahindra Singh Rautela et.al.	2509.21670	null
2025-09-25	FantasyWorld: Geometry-Consistent World Modeling via Unified Video and 3D Prediction	Yixiang Dai et.al.	2509.21657	null
2025-09-25	QuadGPT: Native Quadrilateral Mesh Generation with Autoregressive Models	Jian Liu et.al.	2509.21420	null
2025-09-23	TUN3D: Towards Real-World Scene Understanding from Unposed Images	Anton Konushin et.al.	2509.21388	null
2025-09-25	Quantized Visual Geometry Grounded Transformer	Weilun Feng et.al.	2509.21302	null
2025-09-25	\LARGE GMP $^{3}$ : Learning-Driven, Bellman-Guided Trajectory Planning for UAVs in Real-Time on SE(3)	Babak Salamat et.al.	2509.21264	null
2025-09-25	Dense Semantic Matching with VGGT Prior	Songlin Yang et.al.	2509.21263	null
2025-09-25	Decipher-MR: A Vision-Language Foundation Model for 3D MRI Representations	Zhijian Yang et.al.	2509.21249	null
2025-09-25	Hunyuan3D-Omni: A Unified Framework for Controllable Generation of 3D Assets	Team Hunyuan3D et.al.	2509.21245	null
2025-09-25	CHARM: Control-point-based 3D Anime Hairstyle Auto-Regressive Modeling	Yuze He et.al.	2509.21114	null
2025-09-25	Cross-Modal Instructions for Robot Motion Generation	William Barron et.al.	2509.21107	null
2025-09-25	OmniPlantSeg: Species Agnostic 3D Point Cloud Organ Segmentation for High-Resolution Plant Phenotyping Across Modalities	Andreas Gilson et.al.	2509.21038	null
2025-09-25	Multi-Robot Vision-Based Task and Motion Planning for EV Battery Disassembly and Sorting	Abdelaziz Shaarawy et.al.	2509.21020	null
2025-09-25	Marching Neurons: Accurate Surface Extraction for Neural Implicit Shapes	Christian Stippel et.al.	2509.21007	null
2025-09-25	BactoBot: A Low-Cost, Bacteria-Inspired Soft Underwater Robot for Marine Exploration	Rubaiyat Tasnim Chowdhury et.al.	2509.20964	null
2025-09-25	Finding 3D Positions of Distant Objects from Noisy Camera Movement and Semantic Segmentation Sequences	Julius Pesonen et.al.	2509.20906	null
2025-09-25	ArchGPT: Understanding the World’s Architectures with Large Multimodal Models	Yuze Wang et.al.	2509.20858	null
2025-09-25	ARMesh: Autoregressive Mesh Generation via Next-Level-of-Detail Prediction	Jiabao Lei et.al.	2509.20824	null
2025-09-25	MASt3R-Fusion: Integrating Feed-Forward Visual Model with IMU, GNSS for High-Functionality SLAM	Yuxuan Zhou et.al.	2509.20757	null
2025-09-25	FreeInsert: Personalized Object Insertion with Geometric and Style Control	Yuhong Zhang et.al.	2509.20756	null
2025-09-26	SeamCrafter: Enhancing Mesh Seam Generation for Artist UV Unwrapping via Reinforcement Learning	Duoteng Xu et.al.	2509.20725	null
2025-09-24	Reflect3r: Single-View 3D Stereo Reconstruction Aided by Mirror Reflections	Jing Wu et.al.	2509.20607	null
2025-09-24	Large Pre-Trained Models for Bimanual Manipulation in 3D	Hanna Yurchyk et.al.	2509.20579	null
2025-09-24	MELEGROS: Monolithic Elephant-inspired Gripper with Optical Sensors	Petr Trunin et.al.	2509.20510	null
2025-09-24	SceneWeaver: All-in-One 3D Scene Synthesis with an Extensible and Self-Reflective Agent	Yandan Yang et.al.	2509.20414	null
2025-09-23	SGAligner++: Cross-Modal Language-Aided 3D Scene Graph Alignment	Binod Singh et.al.	2509.20401	null
2025-09-23	SeHDR: Single-Exposure HDR Novel View Synthesis via 3D Gaussian Bracketing	Yiyu Li et.al.	2509.20400	null
2025-09-24	PhysCtrl: Generative Physics for Controllable and Physics-Grounded Video Generation	Chen Wang et.al.	2509.20358	null
2025-09-26	mindmap: Spatial Memory in Deep Feature Maps for 3D Action Policies	Remo Steiner et.al.	2509.20297	null
2025-09-24	4D Driving Scene Generation With Stereo Forcing	Hao Lu et.al.	2509.20251	null
2025-09-24	An Anisotropic Cross-View Texture Transfer with Multi-Reference Non-Local Attention for CT Slice Interpolation	Kwang-Hyun Uhm et.al.	2509.20242	null
2025-09-24	PU-Gaussian: Point Cloud Upsampling using 3D Gaussian Representation	Mahmoud Khater et.al.	2509.20207	null
2025-09-24	C-3TO: Continuous 3D Trajectory Optimization on Neural Euclidean Signed Distance Fields	Guillermo Gil et.al.	2509.20084	null
2025-09-24	DB-TSDF: Directional Bitmask-based Truncated Signed Distance Fields for Efficient Volumetric Mapping	Jose E. Maese et.al.	2509.20081	null
2025-09-24	Queryable 3D Scene Representation: A Multi-Modal Framework for Semantic Reasoning and Robotic Task Planning	Xun Li et.al.	2509.20077	null
2025-09-25	OmniScene: Attention-Augmented Multimodal 4D Scene Understanding for Autonomous Driving	Pei Liu et.al.	2509.19973	null
2025-09-24	Generalist Robot Manipulation beyond Action Labeled Data	Alexander Spiridonov et.al.	2509.19958	null
2025-09-24	AJAHR: Amputated Joint Aware 3D Human Mesh Recovery	Hyunjin Cho et.al.	2509.19939	null
2025-09-24	GS-RoadPatching: Inpainting Gaussians via 3D Searching and Placing for Driving Scenes	Guo Chen et.al.	2509.19937	null
2025-09-24	Aerial-Ground Image Feature Matching via 3D Gaussian Splatting-based Intermediate View Rendering	Jiangxue Yu et.al.	2509.19898	null
2025-09-24	Generalized Shortest Path-based Superpixels for 3D Spherical Image Segmentation	Rémi Giraud et.al.	2509.19895	null
2025-09-25	StrCGAN: A Generative Framework for Stellar Image Restoration	Shantanusinh Parmar et.al.	2509.19805	null
2025-09-24	BiTAA: A Bi-Task Adversarial Attack for Object Detection and Depth Estimation via 3D Gaussian Splatting	Yixun Zhang et.al.	2509.19793	null
2025-09-24	PolGS: Polarimetric Gaussian Splatting for Fast Reflective Surface Reconstruction	Yufei Han et.al.	2509.19726	null
2025-09-24	VIMD: Monocular Visual-Inertial Motion and Depth Estimation	Saimouli Katragadda et.al.	2509.19713	null
2025-09-23	The Impact of 2D Segmentation Backbones on Point Cloud Predictions Using 4D Radar	William L. Muckelroy III et.al.	2509.19644	null
2025-09-23	Terra: Hierarchical Terrain-Aware 3D Scene Graph for Task-Agnostic Outdoor Mapping	Chad R. Samuelson et.al.	2509.19579	null
2025-09-23	Autonomous Elemental Characterization Enabled by a Low Cost Robotic Platform Built Upon a Generalized Software Architecture	Xuan Cao et.al.	2509.19541	null
2025-09-23	Real-Time Reinforcement Learning for Dynamic Tasks with a Parallel Soft Robot	James Avtges et.al.	2509.19525	null
2025-09-23	VolSplat: Rethinking Feed-Forward 3D Gaussian Splatting with Voxel-Aligned Prediction	Weijie Wang et.al.	2509.19297	null
2025-09-23	Lyra: Generative 3D Scene Reconstruction via Video Diffusion Model Self-Distillation	Sherwin Bahmani et.al.	2509.19296	null
2025-09-24	MOIS-SAM2: Exemplar-based Segment Anything Model 2 for multilesion interactive segmentation of neurofibromas in whole-body MRI	Georgii Kolokolnikov et.al.	2509.19277	null
2025-09-23	Adversarially-Refined VQ-GAN with Dense Motion Tokenization for Spatio-Temporal Heatmaps	Gabriel Maldonado et.al.	2509.19252	null
2025-09-23	HyKid: An Open MRI Dataset with Expert-Annotated Multi-Structure and Choroid Plexus in Pediatric Hydrocephalus	Yunzhi Xu et.al.	2509.19218	null
2025-09-23	SlicerROS2: A Research and Development Module for Image-Guided Robotic Interventions	Laura Connolly et.al.	2509.19076	null
2025-09-23	WaveletGaussian: Wavelet-domain Diffusion for Sparse-view 3D Gaussian Object Reconstruction	Hung Nguyen et.al.	2509.19073	null
2025-09-24	Seeing Through Reflections: Advancing 3D Scene Reconstruction in Mirror-Containing Environments with Gaussian Splatting	Zijing Guo et.al.	2509.18956	null
2025-09-23	Eva-VLA: Evaluating Vision-Language-Action Models’ Robustness Under Real-World Physical Variations	Hanqing Liu et.al.	2509.18953	null
2025-09-23	Lang2Morph: Language-Driven Morphological Design of Robotic Hands	Yanyuan Qiao et.al.	2509.18937	null
2025-09-23	SynapFlow: A Modular Framework Towards Large-Scale Analysis of Dendritic Spines	Pamela Osuna-Vargas et.al.	2509.18926	null
2025-09-23	LiDAR Point Cloud Image-based Generation Using Denoising Diffusion Probabilistic Models	Amirhesam Aghanouri et.al.	2509.18917	null
2025-09-23	DeblurSplat: SfM-free 3D Gaussian Splatting with Event Camera for Robust Deblurring	Pengteng Li et.al.	2509.18898	null
2025-09-23	RS3DBench: A Comprehensive Benchmark for 3D Spatial Perception in Remote Sensing	Jiayu Wang et.al.	2509.18897	null
2025-09-23	VGGT-DP: Generalizable Robot Control via Vision Foundation Models	Shijia Ge et.al.	2509.18778	null
2025-09-23	FixingGS: Enhancing 3D Gaussian Splatting via Training-Free Score Distillation	Zhaorui Wang et.al.	2509.18759	null
2025-09-23	3D Flow Diffusion Policy: Visuomotor Policy Learning via Generating Flow in 3D Space	Sangjun Noh et.al.	2509.18676	null
2025-09-23	MLF-4DRCNet: Multi-Level Fusion with 4D Radar and Camera for 3D Object Detection in Autonomous Driving	Yuzhi Wu et.al.	2509.18613	null
2025-09-23	End-to-End Crop Row Navigation via LiDAR-Based Deep Reinforcement Learning	Ana Luiza Mineiro et.al.	2509.18608	null
2025-09-23	Event-guided 3D Gaussian Splatting for Dynamic Human and Scene Reconstruction	Xiaoting Yin et.al.	2509.18566	null
2025-09-23	GeoRemover: Removing Objects and Their Causal Visual Artifacts	Zixin Zhu et.al.	2509.18538	null
2025-09-23	BridgeSplat: Bidirectionally Coupled CT and Non-Rigid Gaussian Splatting for Deformable Intraoperative Surgical Navigation	Maximilian Fehrentz et.al.	2509.18501	null
2025-09-22	CPT-4DMR: Continuous sPatial-Temporal Representation for 4D-MRI Reconstruction	Xinyang Wu et.al.	2509.18427	null
2025-09-22	TinyBEV: Cross Modal Knowledge Distillation for Efficient Multi Task Bird’s Eye View Perception and Planning	Reeshad Khan et.al.	2509.18372	null
2025-09-22	OrthoLoC: UAV 6-DoF Localization and Calibration Using Orthographic Geodata	Oussema Dhaouadi et.al.	2509.18350	null
2025-09-22	The Landform Contextual Mesh: Automatically Fusing Surface and Orbital Terrain for Mars 2020	Marsette Vona et.al.	2509.18330	null
2025-09-24	Rethinking Pulmonary Embolism Segmentation: A Study of Current Approaches and Challenges with an Open Weight Model	Yixin Zhang et.al.	2509.18308	null
2025-09-22	PEEK: Guiding and Minimal Image Representations for Zero-Shot Generalization of Robot Manipulation Policies	Jesse Zhang et.al.	2509.18282	null
2025-09-22	VideoFrom3D: 3D Scene Video Generation via Complementary Image and Video Diffusion Models	Geonung Kim et.al.	2509.17985	null
2025-09-22	Multi-needle Localization for Pelvic Seed Implant Brachytherapy based on Tip-handle Detection and Matching	Zhuo Xiao et.al.	2509.17931	null
2025-09-22	ProDyG: Progressive Dynamic Scene Reconstruction via Gaussian Splatting from Monocular Videos	Shi Chen et.al.	2509.17864	null
2025-09-22	Selecting Optimal Camera Views for Gait Analysis: A Multi-Metric Assessment of 2D Projections	Dong Chen et.al.	2509.17805	null
2025-09-22	Effect of Appearance and Animation Realism on the Perception of Emotionally Expressive Virtual Humans	Nabila Amadou et.al.	2509.17803	null
2025-09-23	From Restoration to Reconstruction: Rethinking 3D Gaussian Splatting for Underwater Scenes	Guoxi Huang et.al.	2509.17789	null
2025-09-23	RoboSeek: You Need to Interact with Your Objects	Yibo Peng et.al.	2509.17783	null
2025-09-22	Automated Labeling of Intracranial Arteries with Uncertainty Quantification Using Deep Learning	Javier Bisbal et.al.	2509.17726	null
2025-09-22	RCTDistill: Cross-Modal Knowledge Distillation Framework for Radar-Camera 3D Object Detection with Temporal Fusion	Geonho Bang et.al.	2509.17712	null
2025-09-22	SD-VLM: Spatial Measuring and Understanding with Depth-Encoded Vision-Language Models	Pingyi Chen et.al.	2509.17664	null
2025-09-22	Evict3R: Training-Free Token Eviction for Memory-Bounded Streaming Visual Geometry Transformers	Soroush Mahdi et.al.	2509.17650	null
2025-09-22	VideoArtGS: Building Digital Twins of Articulated Objects from Monocular Video	Yu Liu et.al.	2509.17647	null
2025-09-22	MRN: Harnessing 2D Vision Foundation Models for Diagnosing Parkinson’s Disease with Limited 3D MR Data	Ding Shaodong et.al.	2509.17566	null
2025-09-22	Unified Multimodal Coherent Field: Synchronous Semantic-Spatial-Vision Fusion for Brain Tumor Segmentation	Mingda Zhang et.al.	2509.17520	null
2025-09-22	Stable Video-Driven Portraits	Mallikarjun B. R. et.al.	2509.17476	null
2025-09-22	MAESTRO: Task-Relevant Optimization via Adaptive Feature Enhancement and Suppression for Multi-task 3D Perception	Changwon Kang et.al.	2509.17462	null
2025-09-23	Hierarchical Neural Semantic Representation for 3D Semantic Correspondence	Keyu Du et.al.	2509.17431	null
2025-09-23	EmbodiedSplat: Personalized Real-to-Sim-to-Real Navigation with Gaussian Splats from a Mobile Device	Gunjan Chhablani et.al.	2509.17430	null
2025-09-22	FGGS-LiDAR: Ultra-Fast, GPU-Accelerated Simulation from General 3DGS Models to LiDAR	Junzhe Wu et.al.	2509.17390	null
2025-09-22	3D Printable Soft Liquid Metal Sensors for Delicate Manipulation Tasks	Lois Liow et.al.	2509.17389	null
2025-09-22	AERO-MPPI: Anchor-Guided Ensemble Trajectory Optimization for Agile Mapless Drone Navigation	Xin Chen et.al.	2509.17340	null
2025-09-22	SmokeSeer: 3D Gaussian Splatting for Smoke Removal and Scene Reconstruction	Neham Jain et.al.	2509.17329	null
2025-09-21	Task-Oriented Communications for 3D Scene Representation: Balancing Timeliness and Fidelity	Xiangmin Xu et.al.	2509.17282	null
2025-09-21	Learning and Optimization with 3D Orientations	Alexandros Ntagkas et.al.	2509.17274	null
2025-09-21	SPFSplatV2: Efficient Self-Supervised Pose-Free 3D Gaussian Splatting from Sparse Views	Ranran Huang et.al.	2509.17246	null
2025-09-23	DT-NeRF: A Diffusion and Transformer-Based Optimization Approach for Neural Radiance Fields in 3D Reconstruction	Bo Liu et.al.	2509.17232	null
2025-09-21	High Resolution UDF Meshing via Iterative Networks	Federico Stella et.al.	2509.17212	null
2025-09-21	Point-RTD: Replaced Token Denoising for Pretraining Transformer Models on Point Clouds	Gunner Stone et.al.	2509.17207	null
2025-09-21	Guided and Unguided Conditional Diffusion Mechanisms for Structured and Semantically-Aware 3D Point Cloud Generation	Gunner Stone et.al.	2509.17206	null
2025-09-21	Certifiably Optimal Doppler Positioning using Opportunistic LEO Satellites	Baoshan Song et.al.	2509.17198	null
2025-09-21	Beat on Gaze: Learning Stylized Generation of Gaze and Head Dynamics	Chengwei Shi et.al.	2509.17168	null
2025-09-21	Imagine2Act: Leveraging Object-Action Motion Consistency from Imagined Goals for Robotic Manipulation	Liang Heng et.al.	2509.17125	null
2025-09-21	CoBEVMoE: Heterogeneity-aware Feature Fusion with Dynamic Mixture-of-Experts for Collaborative Perception	Lingzhao Kong et.al.	2509.17107	null
2025-09-23	HyRF: Hybrid Radiance Fields for Memory-efficient and High-quality Novel View Synthesis	Zipeng Wang et.al.	2509.17083	null
2025-09-21	Efficient 3D Scene Reconstruction and Simulation from Sparse Endoscopic Views	Zhenya Yang et.al.	2509.17027	null
2025-09-21	SemanticGarment: Semantic-Controlled Generation and Editing of 3D Gaussian Garments	Ruiyan Wang et.al.	2509.16960	null
2025-09-21	Leveraging RGB Images for Pre-Training of Event-Based Hand Pose Estimation	Ruicong Liu et.al.	2509.16949	null
2025-09-21	ConfidentSplat: Confidence-Weighted Depth Fusion for Accurate 3D Gaussian Splatting SLAM	Amanuel T. Dufera et.al.	2509.16863	null
2025-09-23	L2M-Reg: Building-level Uncertainty-aware Registration of Outdoor LiDAR Point Clouds and Semantic 3D City Models	Ziyang Xu et.al.	2509.16832	null
2025-09-20	SMART-3D: Three-Dimensional Self-Morphing Adaptive Replanning Tree	Priyanshu Agrawal et.al.	2509.16812	null
2025-09-20	MedGS: Gaussian Splatting for Multi-Modal 3D Medical Imaging	Kacper Marzol et.al.	2509.16806	null
2025-09-20	MMPart: Harnessing Multi-Modal Large Language Models for Part-Aware 3D Generation	Omid Bonakdar et.al.	2509.16768	null
2025-09-20	HyPlaneHead: Rethinking Tri-plane-like Representations in Full-Head Image Synthesis	Heyuan Li et.al.	2509.16748	null
2025-09-23	Pain in 3D: Generating Controllable Synthetic Faces for Automated Pain Assessment	Xin Lei Lin et.al.	2509.16727	null
2025-09-20	Text-Scene: A Scene-to-Language Parsing Framework for 3D Scene Understanding	Haoyuan Li et.al.	2509.16721	null
2025-09-20	SQS: Enhancing Sparse Perception Models via Query-based Splatting in Autonomous Driving	Haiming Zhang et.al.	2509.16588	null
2025-09-20	Person Identification from Egocentric Human-Object Interactions using 3D Hand Pose	Muhammad Hamza et.al.	2509.16557	null
2025-09-20	ST-GS: Vision-Based 3D Semantic Occupancy Prediction with Spatial-Temporal Gaussian Splatting	Xiaoyang Yan et.al.	2509.16552	null
2025-09-20	No Need for Real 3D: Fusing 2D Vision with Pseudo 3D Representations for Robotic Manipulation Learning	Run Yu et.al.	2509.16532	null
2025-09-20	RLGF: Reinforcement Learning with Geometric Feedback for Autonomous Driving Video Generation	Tianyi Yan et.al.	2509.16500	null
2025-09-20	Octree Latent Diffusion for Semantic 3D Scene Generation and Completion	Xujia Zhang et.al.	2509.16483	null
2025-09-19	Explainable Gait Abnormality Detection Using Dual-Dataset CNN-LSTM Models	Parth Agarwal et.al.	2509.16472	null
2025-09-19	TractoTransformer: Diffusion MRI Streamline Tractography using CNN and Transformer Networks	Itzik Waizman et.al.	2509.16429	null
2025-09-23	3D Gaussian Flats: Hybrid 2D/3D Photometric Scene Reconstruction	Maria Taktasheva et.al.	2509.16423	null
2025-09-19	StereoAdapter: Adapting Stereo Depth Estimation to Underwater Scenes	Zhengri Wu et.al.	2509.16415	null
2025-09-19	From Canopy to Ground via ForestGen3D: Learning Cross-Domain Generation of 3D Forest Structure from Aerial-to-Terrestrial LiDAR	Juan Castorena et.al.	2509.16346	null
2025-09-19	Neural Atlas Graphs for Dynamic Scene Decomposition and Editing	Jan Philipp Schneider et.al.	2509.16336	null
2025-09-19	Recovering Parametric Scenes from Very Few Time-of-Flight Pixels	Carter Sifferman et.al.	2509.16132	null
2025-09-19	RadarGaussianDet3D: An Efficient and Effective Gaussian-based 3D Detector with 4D Automotive Radars	Weiyi Xiong et.al.	2509.16119	null
2025-09-19	SegDINO3D: 3D Instance Segmentation Empowered by Both Image-Level and Object-Level 2D Features	Jinyuan Qu et.al.	2509.16098	null
2025-09-19	DSPv2: Improved Dense Policy for Effective and Generalizable Whole-body Mobile Manipulation	Yue Su et.al.	2509.16063	null
2025-09-19	Graph-based Point Cloud Surface Reconstruction using B-Splines	Stuti Pathak et.al.	2509.16050	null
2025-09-19	Towards Sharper Object Boundaries in Self-Supervised Depth Estimation	Aurélien Cecille et.al.	2509.15987	null
2025-09-19	The Missing Piece: A Case for Pre-Training in 3D Medical Object Detection	Katharina Eckstein et.al.	2509.15947	null
2025-09-19	PAN: Pillars-Attention-Based Network for 3D Object Detection	Ruan Bispo et.al.	2509.15935	null
2025-09-19	Sparse Multiview Open-Vocabulary 3D Detection	Olivier Moliner et.al.	2509.15924	null
2025-09-19	A CARLA-based Simulation of Electrically Driven Forklifts	David Claus et.al.	2509.15909	null
2025-09-19	MoAngelo: Motion-Aware Neural Surface Reconstruction for Dynamic Scenes	Mohamed Ebbed et.al.	2509.15892	null
2025-09-19	RangeSAM: Leveraging Visual Foundation Models for Range-View repesented LiDAR segmentation	Paul Julius Kühn et.al.	2509.15886	null
2025-09-19	Self-Supervised Cross-Modal Learning for Image-to-Point Cloud Registration	Xingmei Wang et.al.	2509.15882	null
2025-09-19	Improving Robotic Manipulation with Efficient Geometry-Aware Vision Encoder	An Dinh Vuong et.al.	2509.15880	null
2025-09-19	ENSAM: an efficient foundation model for interactive segmentation of 3D medical images	Elias Stenhede et.al.	2509.15874	null
2025-09-19	Zero-Shot Visual Grounding in 3D Gaussians via View Retrieval	Liwei Liao et.al.	2509.15871	null
2025-09-19	Vision-Language Models as Differentiable Semantic and Spatial Rewards for Text-to-3D Generation	Weimin Bai et.al.	2509.15772	null
2025-09-19	GP3: A 3D Geometry-Aware Policy with Multi-View Images for Robotic Manipulation	Quanhao Qian et.al.	2509.15733	null
2025-09-19	SGMAGNet: A Baseline Model for 3D Cloud Phase Structure Reconstruction on a New Passive Active Satellite Benchmark	Chi Yang et.al.	2509.15706	null
2025-09-19	SCENEFORGE: Enhancing 3D-text alignment with Structured Scene Compositions	Cristian Sbrolli et.al.	2509.15693	null
2025-09-19	Camera Splatting for Continuous View Optimization	Gahye Lee et.al.	2509.15677	null
2025-09-22	FingerSplat: Contactless Fingerprint 3D Reconstruction and Generation based on 3D Gaussian Splatting	Yuwei Jia et.al.	2509.15648	null
2025-09-19	GS-Scale: Unlocking Large-Scale 3D Gaussian Splatting Training via Host Offloading	Donghyun Lee et.al.	2509.15645	null
2025-09-19	Implicit Modeling for 3D-printed Multi-material Computational Object Design via Python	Charles Wade et.al.	2509.15562	null
2025-09-22	MS-GS: Multi-Appearance Sparse-View 3D Gaussian Splatting in the Wild	Deming Li et.al.	2509.15548	null
2025-09-19	STARC: See-Through-Wall Augmented Reality Framework for Human-Robot Collaboration in Emergency Response	Shenghai Yuan et.al.	2509.15507	null
2025-09-18	GiAnt: A Bio-Inspired Hexapod for Adaptive Terrain Navigation and Object Detection	Aasfee Mosharraf Bhuiyan et.al.	2509.15264	null
2025-09-18	Causal Reasoning Elicits Controllable 3D Scene Generation	Shen Chen et.al.	2509.15249	null
2025-09-17	GenCAD-3D: CAD Program Generation using Multimodal Latent Space Alignment and Synthetic Dataset Balancing	Nomi Yu et.al.	2509.15246	null
2025-09-17	ProFusion: 3D Reconstruction of Protein Complex Structures from Multi-view AFM Images	Jaydeep Rade et.al.	2509.15242	null
2025-09-17	ChannelFlow-Tools: A Standardized Dataset Creation Pipeline for 3D Obstructed Channel Flows	Shubham Kavane et.al.	2509.15236	null
2025-09-18	Lightweight and Accurate Multi-View Stereo with Confidence-Aware Diffusion Model	Fangjinhua Wang et.al.	2509.15220	null
2025-09-18	Semi-Supervised 3D Medical Segmentation from 2D Natural Images Pretrained Model	Pak-Hei Yeung et.al.	2509.15167	null
2025-09-19	RGB-Only Supervised Camera Parameter Optimization in Dynamic Scenes	Fang Li et.al.	2509.15123	null
2025-09-18	Semantic-LiDAR-Inertial-Wheel Odometry Fusion for Robust Localization in Large-Scale Dynamic Environments	Haoxuan Jiang et.al.	2509.14999	null
2025-09-19	SPATIALGEN: Layout-guided 3D Indoor Scene Generation	Chuan Fang et.al.	2509.14981	null
2025-09-18	Beyond Random Masking: A Dual-Stream Approach for Rotation-Invariant Point Cloud Masked Autoencoders	Xuanhua Yin et.al.	2509.14975	null
2025-09-18	RoboEye: Enhancing 2D Robotic Object Identification with Selective 3D Geometric Keypoint Matching	Xingwu Zhang et.al.	2509.14966	null
2025-09-21	Seeing 3D Through 2D Lenses: 3D Few-Shot Class-Incremental Learning via Cross-Modal Geometric Rectification	Tuo Xiang et.al.	2509.14958	null
2025-09-18	Human Interaction for Collaborative Semantic SLAM using Extended Reality	Laura Ribeiro et.al.	2509.14949	null
2025-09-18	NeRF-based Visualization of 3D Cues Supporting Data-Driven Spacecraft Pose Estimation	Antoine Legrand et.al.	2509.14890	null
2025-09-18	Radiology Report Conditional 3D CT Generation with Multi Encoder Latent diffusion Model	Sina Amirrajab et.al.	2509.14780	null
2025-09-18	FMGS-Avatar: Mesh-Guided 2D Gaussian Splatting with Foundation Model Priors for 3D Monocular Avatar Reconstruction	Jinlong Fan et.al.	2509.14739	null
2025-09-18	RealMirror: A Comprehensive, Open-Source Vision-Language-Action Platform for Embodied AI	Cong Tai et.al.	2509.14687	null
2025-09-18	Efficient 3D Perception on Embedded Systems via Interpolation-Free Tri-Plane Lifting and Volume Fusion	Sibaek Lee et.al.	2509.14641	null
2025-09-18	HybridMamba: A Dual-domain Mamba for 3D Medical Image Segmentation	Weitong Wu et.al.	2509.14609	null
2025-09-19	AToken: A Unified Tokenizer for Vision	Jiasen Lu et.al.	2509.14476	null
2025-09-17	Perception-Integrated Safety Critical Control via Analytic Collision Cone Barrier Functions on 3D Gaussian Splatting	Dario Tscholl et.al.	2509.14421	null
2025-09-17	Investigating the Ways in Which Mobile Phone Images with Open-Source Data Can Be Used to Create an Augmented Virtual Environment (AVE)	Russell Beale et.al.	2509.14374	null
2025-09-17	MCGS-SLAM: A Multi-Camera SLAM Framework Using Gaussian Splatting for High-Fidelity Mapping	Zhihao Cao et.al.	2509.14191	null
2025-09-17	BEVUDA++: Geometric-aware Unsupervised Domain Adaptation for Multi-View 3D Object Detection	Rongyu Zhang et.al.	2509.14151	null
2025-09-17	GeoAware-VLA: Implicit Geometry Aware Vision-Language-Action Model	Ali Abouzeid et.al.	2509.14117	null
2025-09-17	Plug-and-Play PDE Optimization for 3D Gaussian Splatting: Toward High-Quality Rendering and Reconstruction	Yifan Mo et.al.	2509.13938	null
2025-09-17	White Aggregation and Restoration for Few-shot 3D Point Cloud Semantic Segmentation	Jiyun Im et.al.	2509.13907	null
2025-09-17	EvHand-FPV: Efficient Event-Based 3D Hand Tracking from First-Person View	Zhen Xu et.al.	2509.13883	null
2025-09-17	Consistent View Alignment Improves Foundation Models for 3D Medical Image Segmentation	Puru Vaish et.al.	2509.13846	null
2025-09-17	HGACNet: Hierarchical Graph Attention Network for Cross-Modal Point Cloud Completion	Yadan Zeng et.al.	2509.13692	null
2025-09-17	CraftMesh: High-Fidelity Generative Mesh Manipulation via Poisson Seamless Fusion	James Jincheng et.al.	2509.13688	null
2025-09-17	Gaussian Alignment for Relative Camera Pose Estimation via Single-View Reconstruction	Yumin Li et.al.	2509.13652	null
2025-09-17	SAMIR, an efficient registration framework via robust feature learning from SAM	Yue He et.al.	2509.13629	null
2025-09-17	Rest2Visual: Predicting Visually Evoked fMRI from Resting-State Scans	Chuyang Zhou et.al.	2509.13612	null
2025-09-17	A Generalization of CLAP from 3D Localization to Image Processing, A Connection With RANSAC & Hough Transforms	Ruochen Hou et.al.	2509.13605	null
2025-09-16	Object Pose Estimation through Dexterous Touch	Amir-Hossein Shahidzadeh et.al.	2509.13591	null
2025-09-16	Semantic 3D Reconstructions with SLAM for Central Airway Obstruction	Ayberk Acar et.al.	2509.13541	null
2025-09-16	MemGS: Memory-Efficient Gaussian Splatting for Real-Time SLAM	Yinlong Bai et.al.	2509.13536	null
2025-09-16	ColonCrafter: A Depth Estimation Model for Colonoscopy Videos Using Diffusion Priors	Romain Hardy et.al.	2509.13525	null
2025-09-16	Improving 3D Gaussian Splatting Compression by Scene-Adaptive Lattice Vector Quantization	Hao Xu et.al.	2509.13482	null
2025-09-16	Semantic-Enhanced Cross-Modal Place Recognition for Robust Robot Localization	Yujia Lin et.al.	2509.13474	null
2025-09-18	MapAnything: Universal Feed-Forward Metric 3D Reconstruction	Nikhil Keetha et.al.	2509.13414	null
2025-09-16	Generative AI Pipeline for Interactive Prompt-driven 2D-to-3D Vascular Reconstruction for Fontan Geometries from Contrast-Enhanced X-Ray Fluoroscopy Imaging	Prahlad G Menon et.al.	2509.13372	null
2025-09-15	3D Reconstruction of Coronary Vessel Trees from Biplanar X-Ray Images Using a Geometric Approach	Ethan Koland et.al.	2509.13358	null
2025-09-13	Label-Efficient Grasp Joint Prediction with Point-JEPA	Jed Guzelkabaagac et.al.	2509.13349	null
2025-09-16	3D Aware Region Prompted Vision Language Model	An-Chieh Cheng et.al.	2509.13317	null
2025-09-16	Temporally Smooth Mesh Extraction for Procedural Scenes with Long-Range Camera Trajectories using Spacetime Octrees	Zeyu Ma et.al.	2509.13306	null
2025-09-17	StyleSculptor: Zero-Shot Style-Controllable 3D Asset Generation with Texture-Geometry Dual Guidance	Zefan Qu et.al.	2509.13301	null
2025-09-16	More performant and scalable: Rethinking contrastive vision-language pre-training of radiology in the LLM era	Yingtai Li et.al.	2509.13175	null
2025-09-16	Enhancing Dual Network Based Semi-Supervised Medical Image Segmentation with Uncertainty-Guided Pseudo-Labeling	Yunyao Lu et.al.	2509.13084	null
2025-09-16	DVDP: An End-to-End Policy for Mobile Robot Visual Docking with RGB-D Perception	Haohan Min et.al.	2509.13024	null
2025-09-16	Dream3DAvatar: Text-Controlled 3D Avatar Reconstruction from a Single Image	Gaofeng Liu et.al.	2509.13013	null
2025-09-16	Improving Accuracy and Efficiency of Implicit Neural Representations: Making SIREN a WINNER	Hemanth Chandravamsi et.al.	2509.12980	null
2025-09-16	Beyond Averages: Open-Vocabulary 3D Scene Understanding with Gaussian Splatting and Bag of Embeddings	Abdalla Arafa et.al.	2509.12938	null
2025-09-16	4DRadar-GS: Self-Supervised Dynamic Driving Scene Reconstruction with 4D Radar	Xiao Tang et.al.	2509.12931	null
2025-09-16	Few to Big: Prototype Expansion Network via Diffusion Learner for Point Cloud Few-shot Semantic Segmentation	Qianguang Zhao et.al.	2509.12878	null
2025-09-16	Exploring Metric Fusion for Evaluation of NeRFs	Shreyas Shivakumara et.al.	2509.12836	null
2025-09-16	Hunyuan3D Studio: End-to-End AI Pipeline for Game-Ready 3D Asset Generation	Biwen Lei et.al.	2509.12815	null
2025-09-16	SPGen: Spherical Projection as Consistent and Flexible Representation for Single Image 3D Shape Generation	Jingdong Zhang et.al.	2509.12721	null
2025-09-16	DisorientLiDAR: Physical Attacks on LiDAR-based Localization	Yizhen Lao et.al.	2509.12595	null
2025-09-15	DinoAtten3D: Slice-Level Attention Aggregation of DinoV2 for 3D Brain MRI Anomaly Classification	Fazle Rafsani et.al.	2509.12512	null
2025-09-15	Axis-Aligned 3D Stalk Diameter Estimation from RGB-D Imagery	Benjamin Vail et.al.	2509.12511	null
2025-09-15	Artist-Created Mesh Generation from Raw Observation	Yao He et.al.	2509.12501	null
2025-09-15	Towards Foundational Models for Single-Chip Radar	Tianshu Huang et.al.	2509.12482	null
2025-09-15	Neural 3D Object Reconstruction with Small-Scale Unmanned Aerial Vehicles	Àlmos Veres-Vitàlyos et.al.	2509.12458	null
2025-09-15	Deep learning for 3D point cloud processing – from approaches, tasks to its implications on urban and environmental applications	Zhenxin Zhang et.al.	2509.12452	null
2025-09-15	DYNAMO: Dependency-Aware Deep Learning Framework for Articulated Assembly Motion Prediction	Mayank Patel et.al.	2509.12430	null
2025-09-15	An integrated process for design and control of lunar robotics using AI and simulation	Daniel Lindmark et.al.	2509.12367	null
2025-09-15	3D Human Pose and Shape Estimation from LiDAR Point Clouds: A Review	Salma Galaaoui et.al.	2509.12197	null
2025-09-15	HoloGarment: 360° Novel View Synthesis of In-the-Wild Garments	Johanna Karras et.al.	2509.12187	null
2025-09-15	LoRA-fine-tuned Large Vision Models for Automated Assessment of Post-SBRT Lung Injury	M. Bolhassani et.al.	2509.12155	null
2025-09-15	3DViT-GAT: A Unified Atlas-Based 3D Vision Transformer and Graph Learning Framework for Major Depressive Disorder Detection Using Structural MRI Data	Nojod M. Alotaibi et.al.	2509.12143	null
2025-09-15	End-to-End 4D Heart Mesh Recovery Across Full-Stack and Sparse Cardiac MRI	Yihong Chen et.al.	2509.12090	null
2025-09-15	Progressive Flow-inspired Unfolding for Spectral Compressive Imaging	Xiaodong Wang et.al.	2509.12079	null
2025-09-15	U-Mamba2: Scaling State Space Models for Dental Anatomy Segmentation in CBCT	Zhi Qin Tan et.al.	2509.12069	null
2025-09-15	End-to-End Learning of Multi-Organ Implicit Surfaces from 3D Medical Imaging Data	Farahdiba Zarin et.al.	2509.12068	null
2025-09-15	Robust Fetal Pose Estimation across Gestational Ages via Cross-Population Augmentation	Sebastian Diaz et.al.	2509.12062	null
2025-09-15	E2-BKI: Evidential Ellipsoidal Bayesian Kernel Inference for Uncertainty-aware Gaussian Semantic Mapping	Junyoung Kim et.al.	2509.11964	null
2025-09-15	Learning to Generate 4D LiDAR Sequences	Ao Liang et.al.	2509.11959	null
2025-09-16	Enriched text-guided variational multimodal knowledge distillation network (VMD) for automated diagnosis of plaque vulnerability in 3D carotid artery MRI	Bo Cao et.al.	2509.11924	null
2025-09-15	Integrating Prior Observations for Incremental 3D Scene Graph Prediction	Marian Renz et.al.	2509.11895	null
2025-09-15	BREA-Depth: Bronchoscopy Realistic Airway-geometric Depth Estimation	Francis Xiatian Zhang et.al.	2509.11885	null
2025-09-15	Segmentation-Driven Initialization for Sparse-view 3D Gaussian Splatting	Yi-Hsin Li et.al.	2509.11853	null
2025-09-16	MSMA: Multi-Scale Feature Fusion For Multi-Attribute 3D Face Reconstruction From Unconstrained Images	Danling Cao et.al.	2509.11763	null
2025-09-15	ParaEQsA: Parallel and Asynchronous Embodied Questions Scheduling and Answering	Haisheng Wang et.al.	2509.11663	null
2025-09-15	A Controllable 3D Deepfake Generation Framework with Gaussian Splatting	Wending Liu et.al.	2509.11624	null
2025-09-15	Inference-stage Adaptation-projection Strategy Adapts Diffusion Policy to Cross-manipulators Scenarios	Xiangtong Yao et.al.	2509.11621	null
2025-09-15	Gaussian-Plus-SDF SLAM: High-fidelity 3D Reconstruction at 150+ fps	Zhexi Peng et.al.	2509.11574	null
2025-09-14	Beyond Frame-wise Tracking: A Trajectory-based Paradigm for Efficient Point Cloud Tracking	BaiChen Fan et.al.	2509.11453	null
2025-09-14	MultiMAE for Brain MRIs: Robustness to Missing Inputs Using Multi-Modal Masked Autoencoder	Ayhan Can Erdur et.al.	2509.11442	null
2025-09-14	On the Skinning of Gaussian Avatars	Nikolaos Zioulis et.al.	2509.11411	null
2025-09-14	3De Interactive Lenses for Visualization in Virtual Environments	Roberta C. R. Mota et.al.	2509.11410	null
2025-09-14	3D Gaussian Modeling and Ray Marching of OpenVDB datasets for Scientific Visualization	Isha Sharma et.al.	2509.11377	null
2025-09-14	ROSGS: Relightable Outdoor Scenes With Gaussian Splatting	Lianjun Liao et.al.	2509.11275	null
2025-09-14	Scaling Up Forest Vision with Synthetic Data	Yihang She et.al.	2509.11201	null
2025-09-14	SPHERE: Semantic-PHysical Engaged REpresentation for 3D Semantic Scene Completion	Zhiwen Yang et.al.	2509.11171	null
2025-09-14	Multispectral-NeRF:a multispectral modeling approach based on neural radiance fields	Hong Zhang et.al.	2509.11169	null
2025-09-14	No Mesh, No Problem: Estimating Coral Volume and Surface from Sparse Multi-View Images	Diego Eustachio Farchione et.al.	2509.11164	null
2025-09-14	ManiVID-3D: Generalizable View-Invariant Reinforcement Learning for Robotic Manipulation via Disentangled 3D Representations	Zheng Li et.al.	2509.11125	null
2025-09-14	SVR-GS: Spatially Variant Regularization for Probabilistic Masks in 3D Gaussian Splatting	Ashkan Taghipour et.al.	2509.11116	null
2025-09-14	WildSmoke: Ready-to-Use Dynamic 3D Smoke Assets from a Single Video in the Wild	Yuqiu Liu et.al.	2509.11114	null
2025-09-14	3DAeroRelief: The first 3D Benchmark UAV Dataset for Post-Disaster Assessment	Nhut Le et.al.	2509.11097	null
2025-09-14	SH-SAS: An Implicit Neural Representation for Complex Spherical-Harmonic Scattering Fields for 3D Synthetic Aperture Sonar	Omkar Shailendra Vengurlekar et.al.	2509.11087	null
2025-09-13	AD-GS: Alternating Densification for Sparse-Input 3D Gaussian Splatting	Gurutva Patle et.al.	2509.11003	null
2025-09-13	Nav-R1: Reasoning and Navigation in Embodied Scenes	Qingxiang Liu et.al.	2509.10884	null
2025-09-13	OpenUrban3D: Annotation-Free Open-Vocabulary Semantic Segmentation of Large-Scale Urban Point Clouds	Chongyu Wang et.al.	2509.10842	null
2025-09-13	Point-Plane Projections for Accurate LiDAR Semantic Segmentation in Small Data Scenarios	Simone Mosco et.al.	2509.10841	null
2025-09-13	InternScenes: A Large-scale Simulatable Indoor Scene Dataset with Realistic Layouts	Weipeng Zhong et.al.	2509.10813	null
2025-09-12	Stable Part Diffusion 4D: Multi-View RGB and Kinematic Parts Video Generation	Hao Zhang et.al.	2509.10687	null
2025-09-12	A Comparison and Evaluation of Fine-tuned Convolutional Neural Networks to Large Language Models for Image Classification and Segmentation of Brain Tumors on MRI	Felicia Liu et.al.	2509.10683	null
2025-09-12	T2Bs: Text-to-Character Blendshapes via Video Generation	Jiahao Luo et.al.	2509.10678	null
2025-09-12	Building a General SimCLR Self-Supervised Foundation Model Across Neurological Diseases to Advance 3D Brain MRI Diagnoses	Emily Kaczmarek et.al.	2509.10620	null
2025-09-12	SSL-AD: Spatiotemporal Self-Supervised Learning for Generalizability and Adaptability Across Alzheimer’s Prediction Tasks and Datasets	Emily Kaczmarek et.al.	2509.10453	null
2025-09-12	MCL-AD: Multimodal Collaboration Learning for Zero-Shot 3D Anomaly Detection	Gang Li et.al.	2509.10282	null
2025-09-12	Robustness and Diagnostic Performance of Super-Resolution Fetal Brain MRI	Ema Masterl et.al.	2509.10257	null
2025-09-15	On the Geometric Accuracy of Implicit and Primitive-based Representations Derived from View Rendering Constraints	Elias De Smijter et.al.	2509.10241	null
2025-09-12	Leveraging Multi-View Weak Supervision for Occlusion-Aware Multi-Human Parsing	Laura Bragagnolo et.al.	2509.10093	null
2025-09-12	Design and Evaluation of Two Spherical Systems for Mobile 3D Mapping	Marawan Khalil et.al.	2509.10032	null
2025-09-16	Hierarchical MLANet: Multi-level Attention for 3D Face Reconstruction From Single Images	Danling Cao et.al.	2509.10024	null
2025-09-12	Event Camera Guided Visual Media Restoration & 3D Reconstruction: A Survey	Aupendu Kar et.al.	2509.09971	null
2025-09-12	Online 3D Multi-Camera Perception through Robust 2D Tracking and Depth-based Late Aggregation	Vu-Minh Le et.al.	2509.09946	null
2025-09-12	Segment Anything for Cell Tracking	Zhu Chen et.al.	2509.09943	null
2025-09-11	Purge-Gate: Backpropagation-Free Test-Time Adaptation for Point Clouds Classification via Token Purging	Moslem Yazdanpanah et.al.	2509.09785	null
2025-09-09	Australian Supermarket Object Set (ASOS): A Benchmark Dataset of Physical Objects and 3D Models for Robotics and Computer Vision	Akansel Cosgun et.al.	2509.09720	null
2025-09-11	SpatialVID: A Large-Scale Video Dataset with Spatial Annotations	Jiahao Wang et.al.	2509.09676	null
2025-09-11	Geometric Neural Distance Fields for Learning Human Motion Priors	Zhengdi Yu et.al.	2509.09667	null
2025-09-11	ObjectReact: Learning Object-Relative Control for Visual Navigation	Sourav Garg et.al.	2509.09594	null
2025-09-11	Invisible Attributes, Visible Biases: Exploring Demographic Shortcuts in MRI-based Alzheimer’s Disease Classification	Akshit Achara et.al.	2509.09558	null
2025-09-11	InterAct: Advancing Large-Scale Versatile 3D Human-Object Interaction Generation	Sirui Xu et.al.	2509.09555	null
2025-09-11	DualTrack: Sensorless 3D Ultrasound needs Local and Global Context	Paul F. R. Wilson et.al.	2509.09530	null
2025-09-11	SMapper: A Multi-Modal Data Acquisition Platform for SLAM Benchmarking	Pedro Miguel Bastos Soares et.al.	2509.09509	null
2025-09-11	Resource-Efficient Glioma Segmentation on Sub-Saharan MRI	Freedmore Sidume et.al.	2509.09469	null
2025-09-12	OmniEVA: Embodied Versatile Planner via Task-Adaptive 3D-Grounded and Embodiment-aware Reasoning	Yuecheng Liu et.al.	2509.09332	null
2025-09-11	Unified Start, Personalized End: Progressive Pruning for Efficient 3D Medical Image Segmentation	Linhao Li et.al.	2509.09267	null
2025-09-11	Virtual staining for 3D X-ray histology of bone implants	Sarah C. Irvine et.al.	2509.09235	null
2025-09-11	Medverse: A Universal Model for Full-Resolution 3D Medical Image Segmentation, Transformation and Enhancement	Jiesi Hu et.al.	2509.09232	null
2025-09-11	CWSSNet: Hyperspectral Image Classification Enhanced by Wavelet Domain Convolution	Yulin Tong et.al.	2509.09163	null
2025-09-11	Mind Meets Space: Rethinking Agentic Spatial Intelligence from a Neuroscience-inspired Perspective	Bui Duc Manh et.al.	2509.09154	null
2025-09-11	Video Understanding by Design: How Datasets Shape Architectures and Insights	Lei Wang et.al.	2509.09151	null
2025-09-11	Objectness Similarity: Capturing Object-Level Fidelity in 3D Scene Evaluation	Yuiko Uchida et.al.	2509.09143	null
2025-09-11	AEOS: Active Environment-aware Optimal Scanning Control for UAV LiDAR-Inertial Odometry in Complex Scenes	Jianping Li et.al.	2509.09141	null
2025-09-11	KoopMotion: Learning Almost Divergence Free Koopman Flow Fields for Motion Planning	Alice Kate Li et.al.	2509.09074	null
2025-09-11	Enhancing 3D Medical Image Understanding with Pretraining Aided by 2D Multimodal Large Language Models	Qiuhui Chen et.al.	2509.09064	null
2025-09-10	Integrating Anatomical Priors into a Causal Diffusion Model	Binxu Li et.al.	2509.09054	null
2025-09-10	Rapid Manufacturing of Lightweight Drone Frames Using Single-Tow Architected Composites	Md Habib Ullah Khan et.al.	2509.09024	null
2025-09-10	UltrON: Ultrasound Occupancy Networks	Magdalena Wysocki et.al.	2509.08991	null
2025-09-10	iMatcher: Improve matching in point cloud registration via local-to-global geometric consistency learning	Karim Slimani et.al.	2509.08982	null
2025-09-10	Live(r) Die: Predicting Survival in Colorectal Liver Metastasis	Muhammad Alberb et.al.	2509.08935	null
2025-09-09	Morphology-Preserving Remeshing Approach to Particulate Microstructures via Harmonic Decomposition	Mahmoud Shaqfa et.al.	2509.08855	null
2025-09-10	SAFT: Shape and Appearance of Fabrics from Template via Differentiable Physical Simulations from Monocular Video	David Stotko et.al.	2509.08828	null
2025-09-10	Calib3R: A 3D Foundation Model for Multi-Camera to Robot Calibration and 3D Metric-Scaled Scene Reconstruction	Davide Allegro et.al.	2509.08813	null
2025-09-10	CrowdQuery: Density-Guided Query Module for Enhanced 2D and 3D Detection in Crowded Scenes	Marius Dähling et.al.	2509.08738	null
2025-09-10	TANGO: Traversability-Aware Navigation with Local Metric Control for Topological Goals	Stefan Podgorski et.al.	2509.08699	null
2025-09-10	X-Part: high fidelity and structure coherent shape decomposition	Xinhao Yan et.al.	2509.08643	null
2025-09-10	Implicit Shape-Prior for Few-Shot Assisted 3D Segmentation	Mathilde Monvoisin et.al.	2509.08580	null
2025-09-10	Semantic Causality-Aware Vision-Based 3D Occupancy Prediction	Dubing Chen et.al.	2509.08388	null
2025-09-10	InsFusion: Rethink Instance-level LiDAR-Camera Fusion for 3D Object Detection	Zhongyu Xia et.al.	2509.08374	null
2025-09-10	Generalized Zero-Shot Learning for Point Cloud Segmentation with Evidence-Based Dynamic Calibration	Hyeonseok Kim et.al.	2509.08280	null
2025-09-10	Symmetry Interactive Transformer with CNN Framework for Diagnosis of Alzheimer’s Disease Using Structural MRI	Zheng Yang et.al.	2509.08243	null
2025-09-09	Zero-Shot Metric Depth Estimation via Monocular Visual-Inertial Rescaling for Autonomous Aerial Navigation	Steven Yang et.al.	2509.08159	null
2025-09-09	APML: Adaptive Probabilistic Matching Loss for Robust 3D Point Cloud Reconstruction	Sasan Sharifipour et.al.	2509.08104	null
2025-09-08	CardioComposer: Flexible and Compositional Anatomical Structure Generation with Disentangled Geometric Guidance	Karim Kadry et.al.	2509.08015	null
2025-09-11	3D and 4D World Modeling: A Survey	Lingdong Kong et.al.	2509.07996	null
2025-09-09	One View, Many Worlds: Single-Image to 3D Object Meets Generative Domain Randomization for One-Shot 6D Pose Estimation	Zheng Geng et.al.	2509.07978	null
2025-09-09	Dynamic Scene 3D Reconstruction of an Uncooperative Resident Space Object	Bala Prenith Reddy Gopu et.al.	2509.07932	null
2025-09-09	Point Linguist Model: Segment Any Object via Bridged Large 3D-Language Model	Zhuoxu Huang et.al.	2509.07825	null
2025-09-09	SplatFill: 3D Scene Inpainting via Depth-Guided Gaussian Splatting	Mahtab Dahaghin et.al.	2509.07809	null
2025-09-10	HairGS: Hair Strand Reconstruction based on 3D Gaussian Splatting	Yimin Pan et.al.	2509.07774	null
2025-09-09	XSRD-Net: EXplainable Stroke Relapse Detection	Christian Gapp et.al.	2509.07772	null
2025-09-09	Data-Efficient Fine-Tuning of Vision-Language Models for Diagnosis of Alzheimer’s Disease	Fangqi Cheng et.al.	2509.07613	null
2025-09-09	Attention Maps in 3D Shape Classification for Dental Stage Estimation with Class Node Graph Attention Networks	Barkin Buyukcakir et.al.	2509.07581	null
2025-09-09	PanoLAM: Large Avatar Model for Gaussian Full-Head Synthesis from One-shot Unposed Image	Peng Li et.al.	2509.07552	null
2025-09-09	HU-based Foreground Masking for 3D Medical Masked Image Modeling	Jin Lee et.al.	2509.07534	null
2025-09-09	MVAT: Multi-View Aware Teacher for Weakly Supervised 3D Object Detection	Saad Lahlali et.al.	2509.07507	null
2025-09-09	OmniMap: A General Mapping Framework Integrating Optics, Geometry, and Semantics	Yinan Deng et.al.	2509.07500	null
2025-09-09	DiGS: Accurate and Complete Surface Reconstruction from 3D Gaussians via Direct SDF Learning	Wenzhi Guo et.al.	2509.07493	null
2025-09-09	DreamLifting: A Plug-in Module Lifting MV Diffusion Models for 3D Asset Generation	Ze-Xin Yin et.al.	2509.07435	null
2025-09-08	Efficient Multi-Agent Coordination via Dynamic Joint-State Graph Construction	Yanlin Zhou et.al.	2509.07234	null
2025-09-08	On design, analysis, and hybrid manufacturing of microstructured blade-like geometries	Pablo Antolin et.al.	2509.07044	null
2025-09-07	MEGS $^{2}$ : Memory-Efficient Gaussian Splatting via Spherical Gaussians and Unified Pruning	Jiarui Chen et.al.	2509.07021	null
2025-09-06	Human-in-the-Loop: Quantitative Evaluation of 3D Models Generation by Large Language Models	Ahmed R. Sadik et.al.	2509.07010	null
2025-09-08	H $_{2}$ OT: Hierarchical Hourglass Tokenizer for Efficient Video Pose Transformers	Wenhao Li et.al.	2509.06956	null
2025-09-08	Intraoperative 2D/3D Registration via Spherical Similarity Learning and Inference-Time Differentiable Levenberg-Marquardt Optimization	Minheng Chen et.al.	2509.06890	null
2025-09-08	Matching Shapes Under Different Topologies: A Topology-Adaptive Deformation Guided Approach	Aymen Merrouche et.al.	2509.06862	null
2025-09-08	SynthDrive: Scalable Real2Sim2Real Sensor Simulation Pipeline for High-Fidelity Asset Generation and Driving Data Synthesis	Zhengqing Chen et.al.	2509.06798	null
2025-09-10	P3-SAM: Native 3D Part Segmentation	Changfeng Ma et.al.	2509.06784	null
2025-09-08	UrbanTwin: High-Fidelity Synthetic Replicas of Roadside Lidar Datasets	Muhammad Shahbaz et.al.	2509.06781	null
2025-09-11	Zero-shot 3D-Aware Trajectory-Guided image-to-video generation via Test-Time Training	Ruicheng Zhang et.al.	2509.06723	null
2025-09-08	Cortex-Synth: Differentiable Topology-Aware 3D Skeleton Synthesis with Hierarchical Graph Attention	Mohamed Zayaan S et.al.	2509.06705	null
2025-09-08	Towards In-Air Ultrasonic QR Codes: Deep Learning for Classification of Passive Reflector Constellations	Wouter Jansen et.al.	2509.06615	null
2025-09-08	From Skin to Skeleton: Towards Biomechanically Accurate 3D Digital Humans	Marilyn Keller et.al.	2509.06607	null
2025-09-08	LiHRA: A LiDAR-Based HRI Dataset for Automated Risk Monitoring Methods	Frederik Plahl et.al.	2509.06597	null
2025-09-08	CausNVS: Autoregressive Multi-view Diffusion for Flexible 3D Novel View Synthesis	Xin Kong et.al.	2509.06579	null
2025-09-08	From Rigging to Waving: 3D-Guided Diffusion for Natural Animation of Hand-Drawn Characters	Jie Zhou et.al.	2509.06573	null
2025-09-08	Predicting Brain Tumor Response to Therapy using a Hybrid Deep Learning and Radiomics Approach	Daniil Tikhonov et.al.	2509.06511	null
2025-09-08	Does DINOv3 Set a New Medical Vision Standard?	Che Liu et.al.	2509.06467	null
2025-09-08	A Statistical 3D Stomach Shape Model for Anatomical Analysis	Erez Posner et.al.	2509.06464	null
2025-09-08	Cross3DReg: Towards a Large-scale Real-world Cross-source Point Cloud Registration Benchmark	Zongyi Xu et.al.	2509.06456	null
2025-09-08	Real-time Photorealistic Mapping for Situational Awareness in Robot Teleoperation	Ian Page et.al.	2509.06433	null
2025-09-11	Musculoskeletal simulation of limb movement biomechanics in Drosophila melanogaster	Pembe Gizem Özdil et.al.	2509.06426	null
2025-09-08	3DOF+Quantization: 3DGS quantization for large scenes with limited Degrees of Freedom	Matthieu Gendrin et.al.	2509.06400	null
2025-09-08	Towards scalable organ level 3D plant segmentation: Bridging the data algorithm computing gap	Ruiming Du et.al.	2509.06329	null
2025-09-08	Spatial Reasoning with Vision-Language Models in Ego-Centric Multi-View Scenes	Mohsen Gholami et.al.	2509.06266	null
2025-09-07	O $^3$ Afford: One-Shot 3D Object-to-Object Affordance Grounding for Generalizable Robotic Manipulation	Tongxuan Tian et.al.	2509.06233	null
2025-09-07	Learning in ImaginationLand: Omnidirectional Policies through 3D Generative Models (OP-Gen)	Yifei Ren et.al.	2509.06191	null
2025-09-07	SpecSwin3D: Generating Hyperspectral Imagery from Multispectral Data via Transformer Networks	Tang Sui et.al.	2509.06122	null
2025-09-07	MedSeqFT: Sequential Fine-tuning Foundation Models for 3D Medical Image Segmentation	Yiwen Ye et.al.	2509.06096	null
2025-09-07	Robotic Manipulation Framework Based on Semantic Keypoints for Packing Shoes of Different Sizes, Shapes, and Softness	Yi Dong et.al.	2509.06048	null
2025-09-07	Motion Aware ViT-based Framework for Monocular 6-DoF Spacecraft Pose Estimation	Jose Sosa et.al.	2509.06000	null
2025-09-07	S-LAM3D: Segmentation-Guided Monocular 3D Object Detection via Feature Space Fusion	Diana-Alexandra Sas et.al.	2509.05999	null
2025-09-07	Imagining Alternatives: Towards High-Resolution 3D Counterfactual Medical Image Generation via Language Guidance	Mohamed Mohamed et.al.	2509.05978	null
2025-09-07	Spatial-Aware Self-Supervision for Medical 3D Imaging with Multi-Granularity Observable Tasks	Yiqin Zhang et.al.	2509.05967	null
2025-09-07	Neural Bloom: A Deep Learning Approach to Real-Time Lighting	Rafal Karp et.al.	2509.05963	null
2025-09-07	StripDet: Strip Attention-Based Lightweight 3D Object Detection from Point Cloud	Weichao Wang et.al.	2509.05954	null
2025-09-07	Near Real-Time Dust Aerosol Detection with 3D Convolutional Neural Networks on MODIS Data	Caleb Gates et.al.	2509.05887	null
2025-09-06	Programming tension in 3D printed networks inspired by spiderwebs	Thijs Masmeijer et.al.	2509.05855	null
2025-09-06	CRAB: Camera-Radar Fusion for Reducing Depth Ambiguity in Backward Projection based View Transformation	In-Jae Lee et.al.	2509.05785	null
2025-09-06	3DPillars: Pillar-based two-stage 3D object detection	Jongyoun Noh et.al.	2509.05780	null
2025-09-06	Posterior shape models revisited: Improving 3D reconstructions from partial data using target specific models	Jonathan Aellen et.al.	2509.05776	null
2025-09-06	JRN-Geo: A Joint Perception Network based on RGB and Normal images for Cross-view Geo-localization	Hongyu Zhou et.al.	2509.05696	null
2025-09-06	MonoGlass3D: Monocular 3D Glass Detection with Plane Regression and Adaptive Feature Fusion	Kai Zhang et.al.	2509.05599	null
2025-09-06	PaMO: Parallel Mesh Optimization for Intersection-Free Low-Poly Modeling on the GPU	Seonghun Oh et.al.	2509.05595	null
2025-09-06	Reconstruction and Reenactment Separated Method for Realistic Gaussian Head	Zhiling Ye et.al.	2509.05582	null
2025-09-06	OccVLA: Vision-Language-Action Model with Implicit 3D Occupancy Supervision	Ruixun Liu et.al.	2509.05578	null
2025-09-05	Visibility-Aware Language Aggregation for Open-Vocabulary Segmentation in 3D Gaussian Splatting	Sen Wang et.al.	2509.05515	null
2025-09-05	Microrobot Vascular Parkour: Analytic Geometry-based Path Planning with Real-time Dynamic Obstacle Avoidance	Yanda Yang et.al.	2509.05500	null
2025-09-05	Veriserum: A dual-plane fluoroscopic dataset with knee implant phantoms for deep learning in medical imaging	Jinhao Wang et.al.	2509.05483	null
2025-09-02	INF-3DP: Implicit Neural Fields for Collision-Free Multi-Axis 3D Printing	Jiasheng Qu et.al.	2509.05345	null
2025-09-04	Improved 3D Scene Stylization via Text-Guided Generative Image Editing with Region-Based Control	Haruo Fujiwara et.al.	2509.05285	null
2025-09-08	LatticeWorld: A Multimodal Large Language Model-Empowered Framework for Interactive Complex World Generation	Yinglin Duan et.al.	2509.05263	null
2025-09-05	Enhancing 3D Point Cloud Classification with ModelNet-R and Point-SkipNet	Mohammad Saeid et.al.	2509.05198	null
2025-09-05	SGS-3D: High-Fidelity 3D Instance Segmentation via Reliable Semantic Mask Splitting and Growing	Chaolei Wang et.al.	2509.05144	null
2025-09-05	A Scalable Attention-Based Approach for Image-to-3D Texture Mapping	Arianna Rampini et.al.	2509.05131	null
2025-09-05	GeoSplat: A Deep Dive into Geometry-Constrained Gaussian Splatting	Yangming Li et.al.	2509.05075	null
2025-09-05	LUIVITON: Learned Universal Interoperable VIrtual Try-ON	Cong Cao et.al.	2509.05030	null
2025-09-05	*Ground-Aware Octree-A Hybrid Path Planning for Memory-Efficient 3D Navigation of Ground Vehicles**	Byeong-Il Ham et.al.	2509.04950	null
2025-09-05	SynGen-Vision: Synthetic Data Generation for training industrial vision models	Alpana Dubey et.al.	2509.04894	null
2025-09-05	CoRe-GS: Coarse-to-Refined Gaussian Splatting with Semantic Object Focus	Hannah Schieber et.al.	2509.04859	null
2025-09-05	Pose-Free 3D Quantitative Phase Imaging of Flowing Cellular Populations	Enze Ye et.al.	2509.04848	null
2025-09-04	Domain Adaptation for Different Sensor Configurations in 3D Object Detection	Satoshi Tanaka et.al.	2509.04711	null
2025-09-04	Planning from Point Clouds over Continuous Actions for Multi-object Rearrangement	Kallol Saha et.al.	2509.04645	null
2025-09-04	Few-step Flow for 3D Generation via Marginal-Data Transport Distillation	Zanwei Zhou et.al.	2509.04406	null
2025-09-04	SSGaussian: Semantic-Aware and Structure-Preserving 3D Style Transfer	Jimin Xu et.al.	2509.04379	null
2025-09-04	PAOLI: Pose-free Articulated Object Learning from Sparse-view Images	Jianning Deng et.al.	2509.04276	null
2025-09-04	TauGenNet: Plasma-Driven Tau PET Image Synthesis via Text-Guided 3D Diffusion Models	Yuxin Gong et.al.	2509.04269	null
2025-09-04	TensoIS: A Step Towards Feed-Forward Tensorial Inverse Subsurface Scattering for Perlin Distributed Heterogeneous Media	Ashish Tiwari et.al.	2509.04047	null
2025-09-04	SliceSemOcc: Vertical Slice Based Multimodal 3D Semantic Occupancy Representation	Han Huang et.al.	2509.03999	null
2025-09-04	TopoSculpt: Betti-Steered Topological Sculpting of 3D Fine-grained Tubular Shapes	Minghui Zhang et.al.	2509.03938	null
2025-09-04	LMVC: An End-to-End Learned Multiview Video Coding Framework	Xihua Sheng et.al.	2509.03922	null
2025-09-04	OccTENS: 3D Occupancy World Model via Temporal Next-Scale Prediction	Bu Jin et.al.	2509.03887	null
2025-09-04	MedVista3D: Vision-Language Modeling for Reducing Diagnostic Errors in 3D CT Disease Detection, Understanding and Reporting	Yuheng Li et.al.	2509.03800	null
2025-09-03	ContraGS: Codebook-Condensed and Trainable Gaussian Splatting for Fast, Memory-Efficient Reconstruction	Sankeerth Durvasula et.al.	2509.03775	null
2025-09-03	Low-Cost Open-Source Ambidextrous Robotic Hand with 23 Direct-Drive servos for American Sign Language Alphabet	Kelvin Daniel Gonzalez Amador et.al.	2509.03690	null
2025-09-03	Reg3D: Reconstructive Geometry Instruction Tuning for 3D Scene Understanding	Hongpei Zheng et.al.	2509.03635	null
2025-09-03	treeX: Unsupervised Tree Instance Segmentation in Dense Forest Point Clouds	Josafat-Mattias Burmeister et.al.	2509.03633	null
2025-09-03	PointAD+: Learning Hierarchical Representations for Zero-shot 3D Anomaly Detection	Qihang Zhou et.al.	2509.03277	null
2025-09-03	SynBT: High-quality Tumor Synthesis for Breast Tumor Segmentation by 3D Diffusion Model	Hongxu Yang et.al.	2509.03267	null
2025-09-03	PI3DETR: Parametric Instance Detection of 3D Point Cloud Edges with a Geometry-Aware 3DETR	Fabio F. Oberweger et.al.	2509.03262	null
2025-09-03	Efficient Active Training for Deep LiDAR Odometry	Beibei Zhou et.al.	2509.03211	null
2025-09-03	Preserving instance continuity and length in segmentation through connectivity-aware loss computation	Karol Szustakowski et.al.	2509.03154	null
2025-09-03	Temporally-Aware Diffusion Model for Brain Progression Modelling with Bidirectional Temporal Regularisation	Mattia Litrico et.al.	2509.03141	null
2025-09-03	TRELLIS-Enhanced Surface Features for Comprehensive Intracranial Aneurysm Analysis	Clément Hervé et.al.	2509.03095	null
2025-09-03	Isolated Bangla Handwritten Character Classification using Transfer Learning	Abdul Karim et.al.	2509.03061	null
2025-09-03	Resilient Multimodal Industrial Surface Defect Detection with Uncertain Sensors Availability	Shuai Jiang et.al.	2509.02962	null
2025-09-03	High-Fidelity Digital Twins for Bridging the Sim2Real Gap in LiDAR-Based ITS Perception	Muhammad Shahbaz et.al.	2509.02904	null
2025-09-02	Robotic 3D Flower Pose Estimation for Small-Scale Urban Farms	Harsh Muriki et.al.	2509.02870	null
2025-09-02	Improving the Resilience of Quadrotors in Underground Environments by Combining Learning-based and Safety Controllers	Isaac Ronald Ward et.al.	2509.02808	null
2025-09-02	FastVGGT: Training-Free Acceleration of Visual Geometry Transformer	You Shen et.al.	2509.02560	null
2025-09-02	Manipulation as in Simulation: Enabling Accurate Geometry Perception in Robots	Minghuan Liu et.al.	2509.02530	null
2025-09-02	Enhancing Fitness Movement Recognition with Attention Mechanism and Pre-Trained Feature Extractors	Shanjid Hasan Nishat et.al.	2509.02511	null
2025-09-02	Unifi3D: A Study on 3D Representations for Generation and Reconstruction in a Common Framework	Nina Wiedemann et.al.	2509.02474	null
2025-09-02	TeRA: Rethinking Text-guided Realistic 3D Avatar Generation	Yanwen Wang et.al.	2509.02466	null
2025-09-02	U-ARM : Ultra low-cost general teleoperation interface for robot manipulation	Yanwen Zou et.al.	2509.02437	null
2025-09-02	Decoupling Bidirectional Geometric Representations of 4D cost volume with 2D convolution	Xiaobao Wei et.al.	2509.02415	null
2025-09-02	Category-Aware 3D Object Composition with Disentangled Texture and Shape Multi-view Diffusion	Zeren Xiong et.al.	2509.02357	null
2025-09-02	OmniActor: A Generalist GUI and Embodied Agent for 2D&3D Worlds	Longrong Yang et.al.	2509.02322	null
2025-09-03	Sem-RaDiff: Diffusion-Based 3D Radar Semantic Perception in Cluttered Agricultural Environments	Ruibin Zhang et.al.	2509.02283	null
2025-09-02	Think2Sing: Orchestrating Structured Motion Subtitles for Singing-Driven 3D Head Animation	Zikai Huang et.al.	2509.02278	null
2025-09-02	GRMM: Real-Time High-Fidelity Gaussian Morphable Head Model with Learned Residuals	Mohit Mendiratta et.al.	2509.02141	null
2025-09-02	2D Gaussian Splatting with Semantic Alignment for Image Inpainting	Hongyu Li et.al.	2509.01964	null
2025-09-02	AI-Driven Marine Robotics: Emerging Trends in Underwater Perception and Ecosystem Monitoring	Scarlett Raine et.al.	2509.01878	null
2025-09-02	Doctoral Thesis: Geometric Deep Learning For Camera Pose Prediction, Registration, Depth Estimation, and 3D Reconstruction	Xueyang Kang et.al.	2509.01873	null
2025-09-01	Articulated Object Estimation in the Wild	Abdelrhman Werby et.al.	2509.01708	null
2025-09-01	TransForSeg: A Multitask Stereo ViT for Joint Stereo Segmentation and 3D Force Estimation in Catheterization	Pedram Fekri et.al.	2509.01605	null
2025-09-01	ViSTA-SLAM: Visual SLAM with Symmetric Two-view Association	Ganlin Zhang et.al.	2509.01584	null
2025-09-01	Unified Supervision For Vision-Language Modeling in 3D Computed Tomography	Hao-Chih Lee et.al.	2509.01554	null
2025-09-01	FGO-SLAM: Enhancing Gaussian SLAM with Globally Consistent Opacity Radiance Field	Fan Zhu et.al.	2509.01547	null
2025-09-01	A Continuous-Time Consistency Model for 3D Point Cloud Generation	Sebastian Eilermann et.al.	2509.01492	null
2025-09-01	PointSlice: Accurate and Efficient Slice-Based Representation for 3D Object Detection from Point Clouds	Liu Qifeng et.al.	2509.01487	null
2025-09-01	Im2Haircut: Single-view Strand-based Hair Reconstruction for Human Avatars	Vanessa Sklyarova et.al.	2509.01469	null
2025-09-01	RibPull: Implicit Occupancy Fields and Medial Axis Extraction for CT Ribcage Scans	Emmanouil Nikolakakis et.al.	2509.01402	null
2025-09-01	M3Ret: Unleashing Zero-shot Multimodal Medical Image Retrieval via Self-Supervision	Che Liu et.al.	2509.01360	null
2025-09-01	Guided Model-based LiDAR Super-Resolution for Resource-Efficient Automotive scene Segmentation	Alexandros Gkillas et.al.	2509.01317	null
2025-09-01	Towards More Diverse and Challenging Pre-training for Point Cloud Learning: Self-Supervised Cross Reconstruction with Decoupled Views	Xiangdong Zhang et.al.	2509.01250	null
2025-09-01	Learning Correlation-aware Aleatoric Uncertainty for 3D Hand Pose Estimation	Lee Chae-Yeon et.al.	2509.01242	null
2025-09-01	DcMatch: Unsupervised Multi-Shape Matching with Dual-Level Consistency	Tianwei Ye et.al.	2509.01204	null
2025-09-01	RealMat: Realistic Materials with Diffusion and Reinforcement Learning	Xilong Zhou et.al.	2509.01134	null
2025-09-01	Robix: A Unified Model for Robot Interaction, Reasoning and Planning	Huang Fang et.al.	2509.01106	null
2025-09-01	Bidirectional Sparse Attention for Faster Video Diffusion Training	Chenlu Zhan et.al.	2509.01085	null
2025-09-01	TARA: A Low-Cost 3D-Printed Robotic Arm for Accessible Robotics Education	Thays Leach Mitre et.al.	2509.01043	null
2025-08-31	Towards Integrating Multi-Spectral Imaging with Gaussian Splatting	Josef Grün et.al.	2509.00989	null
2025-09-03	GS-TG: 3D Gaussian Splatting Accelerator with Tile Grouping for Reducing Redundant Sorting while Preserving Rasterization Efficiency	Joongho Jo et.al.	2509.00911	null
2025-09-03	UPGS: Unified Pose-aware Gaussian Splatting for Dynamic Scene Deblurring	Zhijing Wu et.al.	2509.00831	null
2025-08-31	SWAGSplatting: Semantic-guided Water-scene Augmented Gaussian Splatting	Zhuodong Jiang et.al.	2509.00800	null
2025-08-31	OmniReason: A Temporal-Guided Vision-Language-Action Framework for Autonomous Driving	Pei Liu et.al.	2509.00789	null
2025-08-31	InterPose: Learning to Generate Human-Object Interactions from Large-Scale Web Videos	Yangsong Zhang et.al.	2509.00767	null
2025-08-31	MarkSplatter: Generalizable Watermarking for 3D Gaussian Splatting Model via Splatter Image Structure	Xiufeng Huang et.al.	2509.00757	null
2025-08-31	MV-SSM: Multi-View State Space Modeling for 3D Human Pose Estimation	Aviral Chharia et.al.	2509.00649	null
2025-08-30	Embodied Spatial Intelligence: from Implicit Scene Modeling to Spatial Reasoning	Jiading Fang et.al.	2509.00465	null
2025-08-30	AGS: Accelerating 3D Gaussian Splatting SLAM via CODEC-Assisted Frame Covisibility Detection	Houshu He et.al.	2509.00433	null
2025-08-30	Domain Adaptation-Based Crossmodal Knowledge Distillation for 3D Semantic Segmentation	Jialiang Kang et.al.	2509.00379	null
2025-08-30	Adaptive Point-Prompt Tuning: Fine-Tuning Heterogeneous Foundation Models for 3D Point Cloud Analysis	Mengke Li et.al.	2509.00374	null
2025-08-30	Autonomous Aggregate Sorting in Construction and Mining via Computer Vision-Aided Robotic Arm Systems	Md. Taherul Islam Shawon et.al.	2509.00339	null
2025-08-29	3D-LATTE: Latent Space 3D Editing from Textual Instructions	Maria Parelli et.al.	2509.00269	null
2025-08-29	MicroLabVR: Interactive 3D Visualization of Simulated Spatiotemporal Microbiome Data in Virtual Reality	Simon Burbach et.al.	2508.21736	null
2025-08-29	CAD2DMD-SET: Synthetic Generation Tool of Digital Measurement Device CAD Model Datasets for fine-tuning Large Vision-Language Models	João Valente et.al.	2508.21732	null
2025-08-29	Temporal Flow Matching for Learning Spatio-Temporal Trajectories in 4D Longitudinal Medical Imaging	Nico Albert Disch et.al.	2508.21580	null
2025-08-29	Complete Gaussian Splats from a Single Image with Denoising Diffusion Models	Ziwei Liao et.al.	2508.21542	null
2025-08-29	Scale-GS: Efficient Scalable Gaussian Splatting via Redundancy-filtering Training on Streaming Content	Jiayu Yang et.al.	2508.21444	null
2025-08-29	Print2Volume: Generating Synthetic OCT-based 3D Fingerprint Volume from 2D Fingerprint Image	Qingran Miao et.al.	2508.21371	null
2025-08-29	Efficient Diffusion-Based 3D Human Pose Estimation with Hierarchical Temporal Pruning	Yuquan Bi et.al.	2508.21363	null
2025-08-29	ARGS: Advanced Regularization on Aligning Gaussians over the Surface	Jeong Uk Lee et.al.	2508.21344	null
2025-08-29	Mini Autonomous Car Driving based on 3D Convolutional Neural Networks	Pablo Moraes et.al.	2508.21271	null
2025-08-28	PHD: Personalized 3D Human Body Fitting with Point Diffusion	Hsuan-I Ho et.al.	2508.21257	null
2025-08-28	SYNBUILD-3D: A large, multi-modal, and semantically rich synthetic dataset of 3D building models at Level of Detail 4	Kevin Mayer et.al.	2508.21169	null
2025-08-28	RadGS-Reg: Registering Spine CT with Biplanar X-rays via Joint 3D Radiative Gaussians Reconstruction and 3D/3D Registration	Ao Shen et.al.	2508.21154	null
2025-08-27	ScanMove: Motion Prediction and Transfer for Unregistered Body Meshes	Thomas Besnier et.al.	2508.21095	null
2025-08-28	Multi-View 3D Point Tracking	Frano Rajič et.al.	2508.21060	null
2025-08-28	ActLoc: Learning to Localize on the Move via Active Viewpoint Selection	Jiajie Li et.al.	2508.20981	null
2025-08-28	DrivingGaussian++: Towards Realistic Reconstruction and Editable Simulation for Surrounding Dynamic Driving Scenes	Yajiao Xiong et.al.	2508.20965	null
2025-08-28	PLUME: Procedural Layer Underground Modeling Engine	Gabriel Manuel Garcia et.al.	2508.20926	null
2025-08-28	Estimating 2D Keypoints of Surgical Tools Using Vision-Language Models with Low-Rank Adaptation	Krit Duangprom et.al.	2508.20830	null
2025-08-28	Surfel-based 3D Registration with Equivariant SE(3) Features	Xueyang Kang et.al.	2508.20789	null
2025-08-28	SeqVLM: Proposal-Guided Multi-View Sequences Reasoning via VLM for Zero-Shot 3D Visual Grounding	Jiawen Lin et.al.	2508.20758	null
2025-08-28	CardioMorphNet: Cardiac Motion Prediction Using a Shape-Guided Bayesian Recurrent Deep Network	Reza Akbari Movahed et.al.	2508.20734	null
2025-08-28	Task-Oriented Edge-Assisted Cross-System Design for Real-Time Human-Robot Interaction in Industrial Metaverse	Kan Chen et.al.	2508.20664	null
2025-08-28	AvatarBack: Back-Head Generation for Complete 3D Avatars from Front-View Images	Shiqi Xin et.al.	2508.20623	null
2025-08-28	Optimization-Based Calibration for Intravascular Ultrasound Volume Reconstruction	Karl-Philippe Beaudet et.al.	2508.20605	null
2025-08-28	Embracing Aleatoric Uncertainty: Generating Diverse 3D Human Motion	Zheng Qin et.al.	2508.20604	null
2025-08-28	GLaRE: A Graph-based Landmark Region Embedding Network for Emotion Recognition	Debasis Maji et.al.	2508.20579	null
2025-08-28	Enhancing Pseudo-Boxes via Data-Level LiDAR-Camera Fusion for Unsupervised 3D Object Detection	Mingqian Ji et.al.	2508.20530	null
2025-08-28	Adam SLAM - the last mile of camera calibration with 3DGS	Matthieu Gendrin et.al.	2508.20526	null
2025-08-28	IAENet: An Importance-Aware Ensemble Model for 3D Point Cloud-Based Anomaly Detection	Xuanming Cao et.al.	2508.20492	null
2025-08-28	Adaptive Dual Uncertainty Optimization: Boosting Monocular 3D Object Detection under Test-Time Shifts	Zixuan Hu et.al.	2508.20488	null
2025-08-28	Realistic and Controllable 3D Gaussian-Guided Object Editing for Driving Video Generation	Jiusi Li et.al.	2508.20471	null
2025-08-28	Droplet3D: Commonsense Priors from Videos Facilitate 3D Generation	Xiaochuan Li et.al.	2508.20470	null
2025-08-28	Prediction of Distant Metastasis for Head and Neck Cancer Patients Using Multi-Modal Tumor and Peritumoral Feature Fusion Network	Zizhao Tang et.al.	2508.20469	null
2025-08-27	MedNet-PVS: A MedNeXt-Based Deep Learning Model for Automated Segmentation of Perivascular Spaces	Zhen Xuen Brandon Low et.al.	2508.20256	null
2025-08-27	Grounding Multimodal Large Language Models with Quantitative Skin Attributes: A Retrieval Study	Max Torop et.al.	2508.20188	null
2025-08-27	Is the medical image segmentation problem solved? A survey of current developments and future directions	Guoping Xu et.al.	2508.20139	null
2025-08-26	A Machine Learning Approach to Volumetric Computations of Solid Pulmonary Nodules	Yihan Zhou et.al.	2508.20127	null
2025-08-27	Seam360GS: Seamless 360° Gaussian Splatting from Real-World Omnidirectional Images	Changha Shin et.al.	2508.20080	null
2025-08-27	OpenM3D: Open Vocabulary Multi-view Indoor 3D Object Detection without Human Annotations	Peng-Hao Hsu et.al.	2508.20063	null
2025-08-27	Visio-Verbal Teleimpedance Interface: Enabling Semi-Autonomous Control of Physical Interaction via Eye Tracking and Speech	Henk H. A. Jekel et.al.	2508.20037	null
2025-08-27	Integrating SAM Supervision for 3D Weakly Supervised Point Cloud Segmentation	Lechun You et.al.	2508.19909	null
2025-08-27	Multispectral LiDAR data for extracting tree points in urban and suburban areas	Narges Takhtkeshha et.al.	2508.19881	null
2025-08-27	Multimodal Conditional MeshGAN for Personalized Aneurysm Growth Prediction	Long Chen et.al.	2508.19862	null
2025-08-27	MAPo : Motion-Aware Partitioning of Deformable 3D Gaussian Splatting for High-Fidelity Dynamic Scene Reconstruction	Han Jiao et.al.	2508.19786	null
2025-08-27	FastAvatar: Towards Unified Fast High-Fidelity 3D Avatar Reconstruction with Large Gaussian Reconstruction Transformers	Yue Wu et.al.	2508.19754	null
2025-08-28	LabelGS: Label-Aware 3D Gaussian Splatting for 3D Scene Segmentation	Yupeng Zhang et.al.	2508.19699	null
2025-08-27	SAT: Supervisor Regularization and Animation Augmentation for Two-process Monocular Texture 3D Human Reconstruction	Gangjian Zhang et.al.	2508.19688	null
2025-08-27	Beyond BEV: Optimizing Point-Level Tokens for Collaborative Perception	Yang Li et.al.	2508.19638	null
2025-08-27	Generalizing Monocular 3D Object Detection	Abhinav Kumar et.al.	2508.19593	null
2025-08-27	DATR: Diffusion-based 3D Apple Tree Reconstruction Framework with Sparse-View	Tian Qiu et.al.	2508.19508	null
2025-08-25	2D Ultrasound Elasticity Imaging of Abdominal Aortic Aneurysms Using Deep Neural Networks	Utsav Ratna Tuladhar et.al.	2508.19303	null
2025-08-25	CellINR: Implicitly Overcoming Photo-induced Artifacts in 4D Live Fluorescence Microscopy	Cunmin Zhao et.al.	2508.19300	null
2025-08-25	Efficient Model-Based Purification Against Adversarial Attacks for LiDAR Segmentation	Alexandros Gkillas et.al.	2508.19290	null
2025-08-26	VoxHammer: Training-Free Precise and Coherent 3D Editing in Native 3D Space	Lin Li et.al.	2508.19247	null
2025-08-26	Articulate3D: Zero-Shot Text-Driven 3D Object Posing	Oishi Deb et.al.	2508.19244	null
2025-08-26	Style4D-Bench: A Benchmark Suite for 4D Stylization	Beiqi Chen et.al.	2508.19243	null
2025-08-26	LSD-3D: Large-Scale 3D Driving Scene Generation with Geometry Grounding	Julian Ost et.al.	2508.19204	null
2025-08-26	Dual Enhancement on 3D Vision-Language Perception for Monocular 3D Visual Grounding	Yuzhen Li et.al.	2508.19165	null
2025-08-26	Random forest-based out-of-distribution detection for robust lung cancer segmentation	Aneesh Rangnekar et.al.	2508.19112	null
2025-08-26	GReAT: leveraging geometric artery data to improve wall shear stress assessment	Julian Suk et.al.	2508.19030	null
2025-08-26	RoofSeg: An edge-aware transformer-based network for end-to-end roof plane segmentation	Siyuan You et.al.	2508.19003	null
2025-08-26	Can we make NeRF-based visual localization privacy-preserving?	Maxime Pietrantoni et.al.	2508.18971	null
2025-08-26	PanoHair: Detailed Hair Strand Synthesis on Volumetric Heads	Shashikant Verma et.al.	2508.18944	null
2025-08-26	ColorGS: High-fidelity Surgical Scene Reconstruction with Colored Gaussian Splatting	Qun Ji et.al.	2508.18696	null
2025-08-26	AgriChrono: A Multi-modal Dataset Capturing Crop Growth and Lighting Variability with a Field Robot	Jaehwan Jeong et.al.	2508.18694	null
2025-08-26	ROSE: Remove Objects with Side Effects in Videos	Chenxuan Miao et.al.	2508.18633	null
2025-08-26	SemLayoutDiff: Semantic Layout Generation with Diffusion Model for Indoor Scene Synthesis	Xiaohao Sun et.al.	2508.18597	null
2025-08-25	Real-time 3D Visualization of Radiance Fields on Light Field Displays	Jonghyun Kim et.al.	2508.18540	null
2025-08-29	Adaptive Visual Navigation Assistant in 3D RPGs	Kaijie Xu et.al.	2508.18539	null
2025-08-25	SAT-SKYLINES: 3D Building Generation from Satellite Imagery and Coarse Geometric Priors	Zhangyu Jin et.al.	2508.18531	null
2025-08-25	DoGFlow: Self-Supervised LiDAR Scene Flow via Cross-Modal Doppler Guidance	Ajinkya Khoche et.al.	2508.18506	null
2025-08-25	FastAvatar: Instant 3D Gaussian Splatting for Faces from Single Unconstrained Poses	Hao Liang et.al.	2508.18389	null
2025-08-23	SERES: Semantic-aware neural reconstruction from sparse views	Bo Xu et.al.	2508.18314	null
2025-08-22	Towards Training-Free Underwater 3D Object Detection from Sonar Point Clouds: A Comparison of Traditional and Deep Learning Approaches	M. Salman Shaukat et.al.	2508.18293	null
2025-08-25	ObjFiller-3D: Consistent Multi-view 3D Inpainting via Video Diffusion Models	Haitang Feng et.al.	2508.18271	null
2025-08-25	GSVisLoc: Generalizable Visual Localization for Gaussian Splatting Scene Representations	Fadi Khatib et.al.	2508.18242	null
2025-08-21	PriorFormer: A Transformer for Real-time Monocular 3D Human Pose Estimation with Versatile Geometric Priors	Mohamed Adjel et.al.	2508.18238	null
2025-08-25	Follow My Hold: Hand-Object Interaction Reconstruction through Geometric Guidance	Ayce Idil Aytekin et.al.	2508.18213	null
2025-09-02	EventTracer: Fast Path Tracing-based Event Stream Rendering	Zhenyang Li et.al.	2508.18071	null
2025-08-25	Topology Aware Neural Interpolation of Scalar Fields	Mohamed Kissi et.al.	2508.17995	null
2025-08-25	SAIL-Recon: Large SfM by Augmenting Scene Regression with Localization	Junyuan Deng et.al.	2508.17972	null
2025-08-25	A holistic perception system of internal and external monitoring for ground autonomous vehicles: AutoTRUST paradigm	Alexandros Gkillas et.al.	2508.17969	null
2025-08-25	Beam Geometry and Input Dimensionality: Impact on Sparse-Sampling Artifact Correction for Clinical CT with U-Nets	Tina Dorosti et.al.	2508.17961	null
2025-08-25	EndoUFM: Utilizing Foundation Models for Monocular depth estimation of endoscopic images	Xinning Yao et.al.	2508.17916	null
2025-08-25	Camera Pose Refinement via 3D Gaussian Splatting	Lulu Hao et.al.	2508.17876	null
2025-08-25	HLG: Comprehensive 3D Room Construction via Hierarchical Layout Generation	Xiping Wang et.al.	2508.17832	null
2025-08-25	CubeDN: Real-time Drone Detection in 3D Space from Dual mmWave Radar Cubes	Yuan Fang et.al.	2508.17831	null
2025-08-25	MeshSplat: Generalizable Sparse-View Surface Reconstruction via Gaussian Splatting	Hanzhi Chang et.al.	2508.17811	null
2025-08-25	DroneKey: Drone 3D Pose Estimation in Image Sequences using Gated Key-representation and Pose-adaptive Learning	Seo-Bin Hwang et.al.	2508.17746	null
2025-08-25	MEVITA: Open-Source Bipedal Robot Assembled from E-Commerce Components via Sheet Metal Welding	Kento Kawaharazuka et.al.	2508.17684	null
2025-08-28	Generating Human-AI Collaborative Design Sequence for 3D Assets via Differentiable Operation Graph	Xiaoyang Huang et.al.	2508.17645	null
2025-08-25	Wound3DAssist: A Practical Framework for 3D Wound Assessment	Remi Chierchia et.al.	2508.17635	null
2025-08-25	GWM: Towards Scalable Gaussian World Models for Robotic Manipulation	Guanxing Lu et.al.	2508.17600	null
2025-08-25	TinyGiantVLM: A Lightweight Vision-Language Architecture for Spatial Reasoning under Resource Constraints	Vinh-Thuan Ly et.al.	2508.17595	null
2025-08-25	IDU: Incremental Dynamic Update of Existing 3D Virtual Environments with New Imagery Data	Meida Chen et.al.	2508.17579	null
2025-08-24	Random-phase Gaussian Wave Splatting for Computer-generated Holography	Brian Chao et.al.	2508.17480	null
2025-09-01	Investigating Domain Gaps for Indoor 3D Object Detection	Zijing Zhao et.al.	2508.17439	null
2025-08-20	Pixie: Fast and Generalizable Supervised Learning of 3D Physics from Pixels	Long Le et.al.	2508.17437	null
2025-08-24	MoCo: Motion-Consistent Human Video Generation via Structure-Appearance Decoupling	Haoyu Wang et.al.	2508.17404	null
2025-08-26	PersPose: 3D Human Pose Estimation with Perspective Encoding and Perspective Rotation	Xiaoyang Hao et.al.	2508.17239	null
2025-08-24	4D Visual Pre-training for Robot Learning	Chengkai Hou et.al.	2508.17230	null
2025-08-24	VROOM - Visual Reconstruction over Onboard Multiview	Yajat Yadav et.al.	2508.17172	null
2025-08-23	DeltaFlow: An Efficient Multi-frame Scene Flow Estimation Method	Qingwen Zhang et.al.	2508.17054	null
2025-08-23	PVNet: Point-Voxel Interaction LiDAR Scene Upsampling Via Diffusion Models	Xianjing Cheng et.al.	2508.17050	null
2025-08-23	M3DMap: Object-aware Multimodal 3D Mapping for Dynamic Environments	Dmitry Yudin et.al.	2508.17044	null
2025-08-23	DualReg: Dual-Space Filtering and Reinforcement for Rigid Registration	Jiayi Li et.al.	2508.17034	null
2025-08-23	Fiducial Marker Splatting for High-Fidelity Robotics Simulations	Diram Tabaa et.al.	2508.17012	null
2025-08-23	A Survey of Deep Learning-based Point Cloud Denoising	Jinxi Wang et.al.	2508.17011	null
2025-08-23	Align 3D Representation and Text Embedding for 3D Content Personalization	Qi Song et.al.	2508.16932	null
2025-08-23	Structural Energy-Guided Sampling for View-Consistent Text-to-3D	Qing Zhang et.al.	2508.16917	null
2025-08-23	MDD: A Dataset for Text-and-Music Conditioned Duet Dance Generation	Prerit Gupta et.al.	2508.16911	null
2025-08-23	Relative Navigation and Dynamic Target Tracking for Autonomous Underwater Proximity Operations	David Baxter et.al.	2508.16901	null
2025-08-23	Generating Synthetic Contrast-Enhanced Chest CT Images from Non-Contrast Scans Using Slice-Consistent Brownian Bridge Diffusion Network	Pouya Shiri et.al.	2508.16897	null
2025-08-23	A Workflow for Map Creation in Autonomous Vehicle Simulations	Zubair Islam et.al.	2508.16856	null
2025-08-22	Towards Open-Vocabulary Multimodal 3D Object Detection with Attributes	Xinhao Xiang et.al.	2508.16812	null
2025-08-21	BrainPath: Generating Subject-Specific Brain Aging Trajectories	Yifan Li et.al.	2508.16667	null
2025-08-22	MV-RAG: Retrieval Augmented Multiview Diffusion	Yosef Dayani et.al.	2508.16577	null
2025-08-22	Real-time 3D Light-field Viewing with Eye-tracking on Conventional Displays	Trung Hieu Pham et.al.	2508.16535	null
2025-08-26	Comparative Analysis of UAV Path Planning Algorithms for Efficient Navigation in Urban 3D Environments	Hichem Cheriet et.al.	2508.16515	null
2025-08-22	On Kinodynamic Global Planning in a Simplicial Complex Environment: A Mixed Integer Approach	Otobong Jerome et.al.	2508.16511	null
2025-08-22	Arbitrary-Scale 3D Gaussian Super-Resolution	Huimin Zeng et.al.	2508.16467	null
2025-08-25	HOSt3R: Keypoint-free Hand-Object 3D Reconstruction from RGB images	Anilkumar Swamy et.al.	2508.16465	null
2025-08-22	HAMSt3R: Human-Aware Multi-view Stereo 3D Reconstruction	Sara Rojas et.al.	2508.16433	null
2025-08-22	SAMFusion: Sensor-Adaptive Multimodal Fusion for 3D Object Detection in Adverse Weather	Edoardo Palladin et.al.	2508.16408	null
2025-08-22	Audio2Face-3D: Audio-driven Realistic Facial Animation For Digital Avatars	NVIDIA et.al.	2508.16401	null
2025-08-22	Self-Validated Learning for Particle Separation: A Correctness-Based Self-Training Framework Without Human Labels	Philipp D. Lösel et.al.	2508.16224	null
2025-08-22	4D Virtual Imaging Platform for Dynamic Joint Assessment via Uni-Plane X-ray and 2D-3D Registration	Hao Tang et.al.	2508.16138	null
2025-08-22	Lightweight and Fast Real-time Image Enhancement via Decomposition of the Spatial-aware Lookup Tables	Wontae Kim et.al.	2508.16121	null
2025-08-22	A Unified Voxel Diffusion Module for Point Cloud 3D Object Detection	Qifeng Liu et.al.	2508.16069	null
2025-08-22	Advances and Trends in the 3D Reconstruction of the Shape and Motion of Animals	Ziqi Li et.al.	2508.16062	null
2025-08-22	NeuralMeshing: Complete Object Mesh Extraction from Casual Captures	Floris Erich et.al.	2508.16026	null
2025-08-21	Self-Aligning EPM Connector: A Versatile Solution for Adaptive and Multi-Modal Interfaces	Bingchao Wang et.al.	2508.16008	null
2025-08-21	GelSLAM: A Real-time, High-Fidelity, and Robust 3D Tactile SLAM System	Hung-Jui Huang et.al.	2508.15990	null
2025-08-21	UnPose: Uncertainty-Guided Diffusion Priors for Zero-Shot Pose Estimation	Zhaodong Jiang et.al.	2508.15972	null
2025-08-21	Text-Driven 3D Hand Motion Generation from Sign Language Data	Léore Bensabath et.al.	2508.15902	null
2025-08-21	Active Prostate Phantom with Multiple Chambers	Sizhe Tian et.al.	2508.15873	null
2025-08-21	SceneGen: Single-Image 3D Scene Generation in One Feedforward Pass	Yanxu Meng et.al.	2508.15769	null
2025-08-21	ATLAS: Decoupling Skeletal and Shape Parameters for Expressive Parametric Human Modeling	Jinhyung Park et.al.	2508.15767	null
2025-08-21	CM2LoD3: Reconstructing LoD3 Building Models Using Semantic Conflict Maps	Franz Hanke et.al.	2508.15672	null
2025-08-25	Hessian-Based Lightweight Neural Network HessNet for State-of-the-Art Brain Vessel Segmentation on a Minimal Training Dataset	Alexandra Bernadotte et.al.	2508.15660	null
2025-08-21	Towards a 3D Transfer-based Black-box Attack via Critical Feature Guidance	Shuchao Pang et.al.	2508.15650	null
2025-08-21	Fast globally optimal Truncated Least Squares point cloud registration with fixed rotation axis	Ivo Ivanov et.al.	2508.15613	null
2025-08-21	Deep Equilibrium Convolutional Sparse Coding for Hyperspectral Image Denoising	Jin Ye et.al.	2508.15553	null
2025-08-21	MExECON: Multi-view Extended Explicit Clothed humans Optimized via Normal integration	Fulden Ece Uğur et.al.	2508.15500	null
2025-08-21	Enhancing Novel View Synthesis from extremely sparse views with SfM-free 3D Gaussian Splatting Framework	Zongqi He et.al.	2508.15457	null
2025-08-25	DriveSplat: Decoupled Driving Scene Reconstruction with Geometry-enhanced Partitioned Neural Gaussians	Cong Wang et.al.	2508.15376	null
2025-08-21	Image-Conditioned 3D Gaussian Splat Quantization	Xinshuang Liu et.al.	2508.15372	null
2025-08-21	RCDINO: Enhancing Radar-Camera 3D Object Detection with DINOv2 Semantic Features	Olga Matykina et.al.	2508.15353	null
2025-08-21	Mag-Match: Magnetic Vector Field Features for Map Matching and Registration	William McDonald et.al.	2508.15300	null
2025-08-21	BasketLiDAR: The First LiDAR-Camera Multimodal Dataset for Professional Basketball MOT	Ryunosuke Hayashi et.al.	2508.15299	null
2025-08-21	Collaborative Multi-Modal Coding for High-Quality 3D Generation	Ziang Cao et.al.	2508.15228	null
2025-08-25	MeSS: City Mesh-Guided Outdoor Scene Generation with Cross-View Consistent Diffusion	Xuyang Chen et.al.	2508.15169	null
2025-08-21	Reliable Multi-view 3D Reconstruction for `Just-in-time’ Edge Environments	Md. Nurul Absur et.al.	2508.15158	null
2025-08-21	Zero-shot Volumetric CT Super-Resolution using 3D Gaussian Splatting with Upsampled 2D X-ray Projection Priors	Jeonghyun Noh et.al.	2508.15151	null
2025-08-20	Virtual Community: An Open World for Humans, Robots, and Society	Qinhong Zhou et.al.	2508.14893	null
2025-08-20	Snap-Snap: Taking Two Images to Reconstruct 3D Human Gaussians in Milliseconds	Jia Lu et.al.	2508.14892	null
2025-08-20	GaussianArt: Unified Modeling of Geometry and Motion for Articulated Objects	Licheng Shen et.al.	2508.14891	null
2025-08-22	MeshCoder: LLM-Powered Structured Mesh Code Generation from Point Clouds	Bingquan Dai et.al.	2508.14879	null
2025-08-20	Tinker: Diffusion’s Gift to 3D–Multi-View Consistent Editing From Sparse Inputs without Per-Scene Optimization	Canyu Zhao et.al.	2508.14811	null
2025-08-20	Fusing Monocular RGB Images with AIS Data to Create a 6D Pose Estimation Dataset for Marine Vessels	Fabian Holst et.al.	2508.14767	null
2025-08-20	GSFix3D: Diffusion-Guided Repair of Novel Views in Gaussian Splatting	Jiaxin Wei et.al.	2508.14717	null
2025-08-20	GeMS: Efficient Gaussian Splatting for Extreme Motion Blur	Gopi Raju Matta et.al.	2508.14682	null
2025-08-20	UST-SSM: Unified Spatio-Temporal State Space Models for Point Cloud Video Modeling	Peiming Li et.al.	2508.14604	null
2025-08-20	Safety-Critical Learning for Long-Tail Events: The TUM Traffic Accident Dataset	Walter Zimmer et.al.	2508.14567	null
2025-08-20	GOGS: High-Fidelity Geometry and Relighting for Glossy Objects via Gaussian Surfels	Xingyuan Yang et.al.	2508.14563	null
2025-08-20	Making Pose Representations More Expressive and Disentangled via Residual Vector Quantization	Sukhyun Jeong et.al.	2508.14561	null
2025-08-20	From Slices to Structures: Unsupervised 3D Reconstruction of Female Pelvic Anatomy from Freehand Transvaginal Ultrasound	Max Krähenmann et.al.	2508.14552	null
2025-08-20	LookOut: Real-World Humanoid Egocentric Navigation	Boxiao Pan et.al.	2508.14466	null
2025-08-20	D^3-Talker: Dual-Branch Decoupled Deformation Fields for Few-Shot 3D Talking Head Synthesis	Yuhang Guo et.al.	2508.14449	null
2025-08-20	Reconstruction Using the Invisible: Intuition from NIR and Metadata for Enhanced 3D Gaussian Splatting	Gyusam Chang et.al.	2508.14443	null
2025-08-20	HyperDiff: Hypergraph Guided Diffusion Model for 3D Human Pose Estimation	Bing Han et.al.	2508.14431	null
2025-08-20	Learning Point Cloud Representations with Pose Continuity for Depth-Based Category-Level 6D Object Pose Estimation	Zhujun Li et.al.	2508.14358	null
2025-08-19	Pixels to Play: A Foundation Model for 3D Gameplay	Yuguang Yue et.al.	2508.14295	null
2025-08-21	GALA: Guided Attention with Language Alignment for Open Vocabulary Gaussian Splatting	Elena Alegret et.al.	2508.14278	null
2025-08-19	Tooth-Diffusion: Guided 3D CBCT Synthesis with Fine-Grained Tooth Conditioning	Said Djafar Said et.al.	2508.14276	null
2025-08-19	SLAM-based Safe Indoor Exploration Strategy	Omar Mostafa et.al.	2508.14235	null
2025-08-19	RynnEC: Bringing MLLMs into Embodied World	Ronghao Dang et.al.	2508.14160	null
2025-08-19	Automated surgical planning with nnU-Net: delineation of the anatomy in hepatobiliary phase MRI	Karin A. Olthof et.al.	2508.14133	null
2025-08-18	3D Cardiac Anatomy Generation Using Mesh Latent Diffusion Models	Jolanta Mozyrska et.al.	2508.14122	null
2025-08-19	LongSplat: Robust Unposed 3D Gaussian Splatting for Casual Long Videos	Chin-Yang Lin et.al.	2508.14041	null
2025-08-19	Distilled-3DGS:Distilled 3D Gaussian Splatting	Lintao Xiang et.al.	2508.14037	null
2025-08-19	GeoSAM2: Unleashing the Power of SAM2 for 3D Part Segmentation	Ken Deng et.al.	2508.14036	null
2025-08-19	Online 3D Gaussian Splatting Modeling with Novel View Selection	Byeonggwon Lee et.al.	2508.14014	null
2025-08-19	ResPlan: A Large-Scale Vector-Graph Dataset of 17,000 Residential Floor Plans	Mohamed Abouagour et.al.	2508.14006	null
2025-08-19	Self-Supervised Sparse Sensor Fusion for Long Range Perception	Edoardo Palladin et.al.	2508.13995	null
2025-08-19	Physics-Based 3D Simulation for Synthetic Data Generation and Failure Analysis in Packaging Stability Assessment	Samuel Seligardi et.al.	2508.13989	null
2025-08-19	OmViD: Omni-supervised active learning for video action detection	Aayush Rana et.al.	2508.13983	null
2025-08-19	ROVR-Open-Dataset: A Large-Scale Depth Dataset for Autonomous Driving	Xianda Guo et.al.	2508.13977	null
2025-08-19	Augmenting cobots for sheet-metal SMEs with 3D object recognition and localisation	Martijn Cramer et.al.	2508.13964	null
2025-08-19	Real-Time, Population-Based Reconstruction of 3D Bone Models via Very-Low-Dose Protocols	Yiqun Lin et.al.	2508.13947	null
2025-08-19	PhysGM: Large Physical Gaussian Model for Feed-Forward 4D Synthesis	Chunji Lv et.al.	2508.13911	null
2025-08-21	Latent Interpolation Learning Using Diffusion Models for Cardiac Volume Reconstruction	Niklas Bubeck et.al.	2508.13826	null
2025-08-19	Is-NeRF: In-scattering Neural Radiance Field for Blurred Images	Nan Luo et.al.	2508.13808	null
2025-08-19	Sketch3DVE: Sketch-based 3D-Aware Scene Video Editing	Feng-Lin Liu et.al.	2508.13797	null
2025-08-19	VisionLaw: Inferring Interpretable Intrinsic Dynamics from Visual Observations via Bilevel Optimization	Jiajing Lin et.al.	2508.13792	null
2025-08-19	Shape-from-Template with Generalised Camera	Agniva Sengupta et.al.	2508.13791	null
2025-08-19	Blast Hole Seeking and Dipping – The Navigation and Perception Framework in a Mine Site Inspection Robot	Liyang Liu et.al.	2508.13785	null
2025-08-19	Deep Biomechanically-Guided Interpolation for Keypoint-Based Brain Shift Registration	Tiago Assis et.al.	2508.13762	null
2025-08-19	Unleashing Semantic and Geometric Priors for 3D Scene Completion	Shiyuan Chen et.al.	2508.13601	null
2025-08-19	The 9th AI City Challenge	Zheng Tang et.al.	2508.13564	null
2025-08-19	Learnable SMPLify: A Neural Solution for Optimization-Free Human Pose Inverse Kinematics	Yuchen Yang et.al.	2508.13562	null
2025-08-22	FLAIR: Frequency and Locality-Aware Implicit Neural Representations	Sukhun Ko et.al.	2508.13544	null
2025-08-19	EAvatar: Expression-Aware Head Avatar Reconstruction with Generative Geometry Priors	Shikun Zhang et.al.	2508.13537	null
2025-08-19	FAMNet: Integrating 2D and 3D Features for Micro-expression Recognition via Multi-task Learning and Hierarchical Attention	Liangyu Fu et.al.	2508.13483	null
2025-08-18	Susceptibility Distortion Correction of Diffusion MRI with a single Phase-Encoding Direction	Sedigheh Dargahi et.al.	2508.13340	null
2025-08-18	InnerGS: Internal Scenes Rendering via Factorized 3D Gaussian Splatting	Shuxin Liang et.al.	2508.13287	null
2025-08-17	PreSem-Surf: RGB-D Surface Reconstruction with Progressive Semantic Modeling and SG-MLP Pre-Rendering Mechanism	Yuyan Ye et.al.	2508.13228	null
2025-08-18	4DNeX: Feed-Forward 4D Generative Modeling Made Easy	Zhaoxi Chen et.al.	2508.13154	null
2025-08-18	IGFuse: Interactive 3D Gaussian Scene Reconstruction via Multi-Scans Fusion	Wenhao Hu et.al.	2508.13153	null
2025-08-24	Odo: Depth-Guided Diffusion for Identity-Preserving Body Reshaping	Siddharth Khandelwal et.al.	2508.13065	null
2025-08-18	IntelliCap: Intelligent Guidance for Consistent View Sampling	Ayaka Yasunaga et.al.	2508.13043	null
2025-08-18	Multi-Phase Automated Segmentation of Dental Structures in CBCT Using a Lightweight Auto3DSeg and SegResNet Implementation	Dominic LaBella et.al.	2508.12962	null
2025-08-18	MaskSem: Semantic-Guided Masking for Learning 3D Hybrid High-Order Motion Representation	Wei Wei et.al.	2508.12948	null
2025-08-18	Lumen: Consistent Video Relighting and Harmonious Background Replacement with Video Generative Models	Jianshu Zeng et.al.	2508.12945	null
2025-08-18	CMF-IoU: Multi-Stage Cross-Modal Fusion 3D Object Detection with IoU Joint Prediction	Zhiwei Ning et.al.	2508.12917	null
2025-08-18	CTFlow: Video-Inspired Latent Flow Matching for 3D CT Synthesis	Jiayi Wang et.al.	2508.12900	null
2025-08-18	MCTR: Midpoint Corrected Triangulation for Autonomous Racing via Digital Twin Simulation in CARLA	Junhao Ye et.al.	2508.12729	null
2025-08-18	Quantifying and Alleviating Co-Adaptation in Sparse-View 3D Gaussian Splatting	Kangjie Chen et.al.	2508.12720	null
2025-08-18	Neural Rendering for Sensor Adaptation in 3D Object Detection	Felix Embacher et.al.	2508.12695	null
2025-08-18	Refine-and-Contrast: Adaptive Instance-Aware BEV Representations for Multi-UAV Collaborative Object Detection	Zhongyao Li et.al.	2508.12684	null
2025-08-18	Stable Diffusion-Based Approach for Human De-Occlusion	Seung Young Noh et.al.	2508.12663	null
2025-08-18	DyCrowd: Towards Dynamic Crowd Reconstruction from a Large-scene Video	Hao Wen et.al.	2508.12644	null
2025-08-18	Synthesizing Accurate and Realistic T1-weighted Contrast-Enhanced MR Images using Posterior-Mean Rectified Flow	Bastian Brandstötter et.al.	2508.12640	null
2025-08-19	WIPES: Wavelet-based Visual Primitives	Wenhao Zhang et.al.	2508.12615	null
2025-08-17	Segmenting Thalamic Nuclei: T1 Maps Provide a Reliable and Efficient Solution	Anqi Feng et.al.	2508.12508	null
2025-08-17	FractMorph: A Fractional Fourier-Based Multi-Domain Transformer for Deformable Image Registration	Shayan Kebriti et.al.	2508.12445	null
2025-08-21	TiP4GEN: Text to Immersive Panorama 4D Scene Generation	Ke Xing et.al.	2508.12415	null
2025-08-19	SRMA-Mamba: Spatial Reverse Mamba Attention Network for Pathological Liver Segmentation in MRI Volumes	Jun Zeng et.al.	2508.12410	null
2025-08-17	Geometry-Aware Video Inpainting for Joint Headset Occlusion Removal and Face Reconstruction in Social XR	Fatemeh Ghorbani Lohesara et.al.	2508.12336	null
2025-08-17	Semi-Infinite Programming for Collision-Avoidance in Optimal and Model Predictive Control	Yunfan Gao et.al.	2508.12335	null
2025-08-17	Improving Densification in 3D Gaussian Splatting for High-Fidelity Rendering	Xiaobin Deng et.al.	2508.12313	null
2025-08-17	In vivo 3D ultrasound computed tomography of musculoskeletal tissues with generative neural physics	Zhijun Zeng et.al.	2508.12226	null
2025-08-17	Splat Feature Solver	Butian Xiong et.al.	2508.12216	null
2025-08-16	RealTalk: Realistic Emotion-Aware Lifelike Talking-Head Synthesis	Wenqing Wang et.al.	2508.12163	null
2025-08-16	VELVET-Med: Vision and Efficient Language Pre-training for Volumetric Imaging Tasks in Medicine	Ziyang Zhang et.al.	2508.12108	null
2025-08-16	Enhancing 3D point accuracy of laser scanner through multi-stage convolutional neural network for applications in construction	Qinyuan Fan et.al.	2508.12089	null
2025-08-16	VimoRAG: Video-based Retrieval-augmented 3D Motion Generation for Motion Language Models	Haidong Xu et.al.	2508.12081	null
2025-08-16	OASIS: Real-Time Opti-Acoustic Sensing for Intervention Systems in Unstructured Environments	Amy Phung et.al.	2508.12071	null
2025-08-16	InstDrive: Instance-Aware 3D Gaussian Splatting for Driving Scenes	Hongyuan Liu et.al.	2508.12015	null
2025-08-16	UniUGG: Unified 3D Understanding and Generation via Geometric-Semantic Encoding	Yueming Xu et.al.	2508.11952	null
2025-08-16	Transferable Class Statistics and Multi-scale Feature Approximation for 3D Object Detection	Hao Peng et.al.	2508.11951	null
2025-08-16	OmniD: Generalizable Robot Manipulation Policy via Image-Based BEV Representation	Jilei Mao et.al.	2508.11898	null
2025-08-16	ComplicitSplat: Downstream Models are Vulnerable to Blackbox Attacks by 3D Gaussian Splat Camouflages	Matthew Hull et.al.	2508.11854	null
2025-08-15	Towards Understanding 3D Vision: the Role of Gaussian Curvature	Sherlon Almeida da Silva et.al.	2508.11825	null
2025-08-15	CoreEditor: Consistent 3D Editing via Correspondence-constrained Diffusion	Zhe Zhu et.al.	2508.11603	null
2025-08-15	Remove360: Benchmarking Residuals After Object Removal in 3D Gaussian Splatting	Simona Kocour et.al.	2508.11431	null
2025-08-15	RMFAT: Recurrent Multi-scale Feature Atmospheric Turbulence Mitigator	Zhiming Liu et.al.	2508.11409	null
2025-08-15	G-CUT3R: Guided 3D Reconstruction with Camera and Depth Prior Integration	Ramil Khafizov et.al.	2508.11379	null
2025-08-15	AnatoMaskGAN: GNN-Driven Slice Feature Fusion and Noise Augmentation for Medical Semantic Image Synthesis	Zonglin Wu et.al.	2508.11375	null
2025-08-15	HOID-R1: Reinforcement Learning for Open-World Human-Object Interaction Detection Reasoning with Multimodal Large Language Model	Zhenhao Zhang et.al.	2508.11350	null
2025-08-15	Delving into Dynamic Scene Cue-Consistency for Robust 3D Multi-Object Tracking	Haonan Zhang et.al.	2508.11323	null
2025-08-15	Unifying Scale-Aware Depth Prediction and Perceptual Priors for Monocular Endoscope Pose Estimation and Tissue Reconstruction	Muzammil Khan et.al.	2508.11282	null
2025-08-15	Domain-aware Category-level Geometry Learning Segmentation for 3D Point Clouds	Pei He et.al.	2508.11265	null
2025-08-15	Generalized Decoupled Learning for Enhancing Open-Vocabulary Dense Perception	Junjie Wang et.al.	2508.11256	null
2025-08-15	StyleMM: Stylized 3D Morphable Face Model via Text-Driven Aligned Image Translation	Seungmi Lee et.al.	2508.11203	null
2025-08-15	CHARM3R: Towards Unseen Camera Height Robust Monocular 3D Detector	Abhinav Kumar et.al.	2508.11185	null
2025-08-14	HierOctFusion: Multi-scale Octree-based 3D Shape Generation via Part-Whole-Hierarchy Message Passing	Xinjie Gao et.al.	2508.11106	null
2025-08-14	Data-Driven Abdominal Phenotypes of Type 2 Diabetes in Lean, Overweight, and Obese Cohorts	Lucas W. Remedios et.al.	2508.11063	null
2025-08-14	Advancing 3D Scene Understanding with MV-ScanQA Multi-View Reasoning Evaluation and TripAlign Pre-training Dataset	Wentao Mo et.al.	2508.11058	null
2025-08-20	3D FlowMatch Actor: Unified 3D Policy for Single- and Dual-Arm Manipulation	Nikolaos Gkanatsios et.al.	2508.11002	null
2025-08-12	Vision-Only Gaussian Splatting for Collaborative Semantic Occupancy Prediction	Cheng Chen et.al.	2508.10936	null
2025-08-18	HQ-OV3D: A High Box Quality Open-World 3D Detection Framework based on Diffision Model	Qi Liu et.al.	2508.10935	null
2025-08-12	ViPE: Video Pose Engine for 3D Geometric Perception	Jiahui Huang et.al.	2508.10934	null
2025-08-14	Quantum Visual Fields with Neural Amplitude Encoding	Shuteng Wang et.al.	2508.10900	null
2025-08-14	Puppeteer: Rig and Animate Your 3D Models	Chaoyue Song et.al.	2508.10898	null
2025-08-14	Human-in-Context: Unified Cross-Domain 3D Human Motion Modeling via In-Context Learning	Mengyuan Liu et.al.	2508.10897	null
2025-08-14	STream3R: Scalable Sequential 3D Reconstruction with Causal Transformer	Yushi Lan et.al.	2508.10893	null
2025-08-14	TexVerse: A Universe of 3D Objects with High-Resolution Textures	Yibo Zhang et.al.	2508.10868	null
2025-08-14	An Efficient Model-Driven Groupwise Approach for Atlas Construction	Ziwei Zou et.al.	2508.10743	null
2025-08-14	Novel View Synthesis using DDIM Inversion	Sehajdeep SIngh et.al.	2508.10688	null
2025-08-14	Physics-Informed Joint Multi-TE Super-Resolution with Implicit Neural Representation for Robust Fetal T2 Mapping	Busra Bulut et.al.	2508.10680	null
2025-08-14	DIVA-VQA: Detecting Inter-frame Variations in UGC Video Quality	Xinyi Wang et.al.	2508.10605	null
2025-08-14	SpaRC-AD: A Baseline for Radar-Camera Fusion in End-to-End Autonomous Driving	Philipp Wolters et.al.	2508.10567	null
2025-08-15	PTQAT: A Hybrid Parameter-Efficient Quantization Algorithm for 3D Perception Tasks	Xinhao Wang et.al.	2508.10557	null
2025-08-14	Multi-Sample Anti-Aliasing and Constrained Optimization for 3D Gaussian Splatting	Zheng Zhou et.al.	2508.10507	null
2025-08-14	STRIDE-QA: Visual Question Answering Dataset for Spatiotemporal Reasoning in Urban Driving Scenes	Keishi Ishihara et.al.	2508.10427	null
2025-08-14	SC-Lane: Slope-aware and Consistent Road Height Estimation Framework for 3D Lane Detection	Chaesong Park et.al.	2508.10411	null
2025-08-14	Towards Spatially Consistent Image Generation: On Incorporating Intrinsic Scene Properties into Diffusion Models	Hyundo Lee et.al.	2508.10382	null
2025-08-14	VIFSS: View-Invariant and Figure Skating-Specific Pose Representation Learning for Temporal Action Segmentation	Ryota Tanaka et.al.	2508.10281	null
2025-08-14	Deep Learning for Crack Detection: A Review of Learning Paradigms, Generalizability, and Datasets	Xinan Zhang et.al.	2508.10256	null
2025-08-13	EntropyGS: An Efficient Entropy Coding on 3D Gaussian Splatting	Yuning Huang et.al.	2508.10227	null
2025-08-13	B-repLer: Semantic B-rep Latent Editor using Large Language Models	Yilin Liu et.al.	2508.10201	null
2025-08-18	From Intent to Execution: Multimodal Chain-of-Thought Reinforcement Learning for Precise CAD Code Generation	Ke Niu et.al.	2508.10118	null
2025-08-13	A Survey on 3D Gaussian Splatting Applications: Segmentation, Editing, and Generation	Shuting He et.al.	2508.09977	null
2025-08-13	PERSONA: Personalized Whole-Body 3D Avatar with Pose-Driven Deformations from a Single Image	Geonhee Sim et.al.	2508.09973	null
2025-08-13	LIA-X: Interpretable Latent Portrait Animator	Yaohui Wang et.al.	2508.09959	null
2025-08-13	E-4DGS: High-Fidelity Dynamic Reconstruction from the Multi-view Event Cameras	Chaoran Feng et.al.	2508.09912	null
2025-08-13	HumanGenesis: Agent-Based Geometric and Generative Modeling for Synthetic Human Dynamics	Weiqi Li et.al.	2508.09858	null
2025-08-13	Toward Human-Robot Teaming: Learning Handover Behaviors from 3D Scenes	Yuekun Wu et.al.	2508.09855	null
2025-08-13	ARI3D: A Software for Interactive Quantification of Regions in X-Ray CT 3D Images	Jan Phillipp Albrecht et.al.	2508.09849	null
2025-08-13	RayletDF: Raylet Distance Fields for Generalizable 3D Surface Reconstruction from Point Clouds or Gaussians	Shenxing Wei et.al.	2508.09830	null
2025-08-13	TRACE: Learning 3D Gaussian Physical Dynamics from Multi-view Videos	Jinxi Li et.al.	2508.09811	null
2025-08-13	Automated Segmentation of Coronal Brain Tissue Slabs for 3D Neuropathology	Jonathan Williams Ramirez et.al.	2508.09805	null
2025-08-13	MUJICA: Reforming SISR Models for PBR Material Super-Resolution via Cross-Map Attention	Xin Du et.al.	2508.09802	null
2025-08-13	Surg-InvNeRF: Invertible NeRF for 3D tracking and reconstruction in surgical vision	Gerardo Loza et.al.	2508.09681	null
2025-08-13	GSFixer: Improving 3D Gaussian Splatting with Reference-Guided Video Diffusion Priors	Xingyilang Yin et.al.	2508.09667	null
2025-08-13	Noise-adapted Neural Operator for Robust Non-Line-of-Sight Imaging	Lianfang Wang et.al.	2508.09655	null
2025-08-13	TOTNet: Occlusion-Aware Temporal Tracking for Robust Ball Detection in Sports Videos	Hao Xu et.al.	2508.09650	null
2025-08-13	The Brain Resection Multimodal Image Registration (ReMIND2Reg) 2025 Challenge	Reuben Dorent et.al.	2508.09649	null
2025-08-13	Enhancing Monocular 3D Hand Reconstruction with Learned Texture Priors	Giorgos Karvounas et.al.	2508.09629	null
2025-08-14	Semantic-aware DropSplat: Adaptive Pruning of Redundant Gaussians for 3D Aerial-View Segmentation	Xu Tang et.al.	2508.09626	null
2025-08-13	MInDI-3D: Iterative Deep Learning in 3D for Sparse-view Cone Beam Computed Tomography	Daniel Barco et.al.	2508.09616	null
2025-08-13	DualPhys-GS: Dual Physically-Guided 3D Gaussian Splatting for Underwater Scene Reconstruction	Jiachen Li et.al.	2508.09610	null
2025-08-15	SVG-Head: Hybrid Surface-Volumetric Gaussians for High-Fidelity Head Reconstruction and Real-Time Editing	Heyi Sun et.al.	2508.09597	null
2025-08-13	CaRoBio: 3D Cable Routing with a Bio-inspired Gripper Fingernail	Jiahui Zuo et.al.	2508.09558	null
2025-08-14	Iterative Volume Fusion for Asymmetric Stereo Matching	Yuanting Gao et.al.	2508.09543	null
2025-08-13	SkySplat: Generalizable 3D Gaussian Splatting from Multi-Temporal Sparse Satellite Images	Xuejun Huang et.al.	2508.09479	null
2025-08-13	CitySeg: A 3D Open Vocabulary Semantic Segmentation Foundation Model in City-scale Scenarios	Jialei Xu et.al.	2508.09470	null
2025-08-13	DAgger Diffusion Navigation: DAgger Boosted Diffusion Policy for Vision-Language Navigation	Haoxiang Shi et.al.	2508.09444	null
2025-08-13	Waymo-3DSkelMo: A Multi-Agent 3D Skeletal Motion Dataset for Pedestrian Interaction Modeling in Autonomous Driving	Guangxun Zhu et.al.	2508.09404	null
2025-08-12	X-UniMotion: Animating Human Images with Expressive, Unified and Identity-Agnostic Motion Latents	Guoxian Song et.al.	2508.09383	null
2025-08-12	Gradient-Direction-Aware Density Control for 3D Gaussian Splatting	Zheng Zhou et.al.	2508.09239	null
2025-08-12	Turbo-VAED: Fast and Stable Transfer of Video-VAEs to Mobile Devices	Ya Zou et.al.	2508.09136	null
2025-08-13	GeoVLA: Empowering 3D Representations in Vision-Language-Action Models	Lin Sun et.al.	2508.09071	null
2025-08-12	A new dataset and comparison for multi-camera frame synthesis	Conall Daly et.al.	2508.09068	null
2025-08-12	VLM-3D:End-to-End Vision-Language Models for Open-World 3D Perception	Fuhao Chang et.al.	2508.09061	null
2025-08-12	DASC: Depth-of-Field Aware Scene Complexity Metric for 3D Visualization on Light Field Display	Kamran Akbar et.al.	2508.08928	null
2025-08-12	Masked Clustering Prediction for Unsupervised Point Cloud Pre-training	Bin Ren et.al.	2508.08910	null
2025-08-12	GaussianUpdate: Continual 3D Gaussian Splatting Update for Changing Environments	Lin Zeng et.al.	2508.08867	null
2025-08-12	DiffPhysCam: Differentiable Physics-Based Camera Simulation for Inverse Rendering and Embodied AI	Bo-Hsun Chen et.al.	2508.08831	null
2025-08-12	3DFroMLLM: 3D Prototype Generation only from Pretrained Multimodal LLMs	Noor Ahmed et.al.	2508.08821	null
2025-08-12	MonoPartNeRF:Human Reconstruction from Monocular Video via Part-Based Neural Radiance Fields	Yao Lu et.al.	2508.08798	null
2025-08-12	SHREC 2025: Retrieval of Optimal Objects for Multi-modal Enhanced Language and Spatial Assistance (ROOMELSA)	Trong-Thuan Nguyen et.al.	2508.08781	null
2025-08-12	ROD: RGB-Only Fast and Efficient Off-road Freespace Detection	Tong Sun et.al.	2508.08697	null
2025-08-14	Yan: Foundational Interactive Video Generation	Deheng Ye et.al.	2508.08601	null
2025-08-12	RealisMotion: Decomposed Human Motion Control and Video Generation in the World Space	Jingyun Liang et.al.	2508.08588	null
2025-08-12	Bio-Generative Design Morphology with Radiolaria: An application of a Nature-Based Generative Shape Grammar for Geometrical Design of Space Frames	Michael Kleiss et.al.	2508.08572	null
2025-08-12	Revisiting the City Tower Project: Geometric Principles and Structural Morphology in the Works of Louis I. Kahn and Anne Tyng	Aysan Mokhtarimousavi et.al.	2508.08561	null
2025-08-11	Empowering Children to Create AI-Enabled Augmented Reality Experiences	Lei Zhang et.al.	2508.08467	null
2025-08-11	Enhanced Liver Tumor Detection in CT Images Using 3D U-Net and Bat Algorithm for Hyperparameter Optimization	Nastaran Ghorbani et.al.	2508.08452	null
2025-08-11	ImageDDI: Image-enhanced Molecular Motif Sequence Representation for Drug-Drug Interaction Prediction	Yuqin He et.al.	2508.08338	null
2025-08-11	Learning an Implicit Physics Model for Image-based Fluid Simulation	Emily Yue-Ting Jia et.al.	2508.08254	null
2025-08-11	ReferSplat: Referring Segmentation in 3D Gaussian Splatting	Shuting He et.al.	2508.08252	null
2025-08-11	LL3M: Large Language 3D Modelers	Sining Lu et.al.	2508.08228	null
2025-08-11	SAGOnline: Segment Any Gaussians Online	Wentao Sun et.al.	2508.08219	null
2025-08-11	Spatial-ORMLLM: Improve Spatial Relation Understanding in the Operating Room with Multimodal Large Language Model	Peiqi He et.al.	2508.08199	null
2025-08-11	Emergent morphogenesis via planar fabrication enabled by a reduced model of composites	Yupeng Zhang et.al.	2508.08198	null
2025-08-12	3D Human Mesh Estimation from Single View RGBD	Ozhan Suat et.al.	2508.08178	null
2025-08-13	CD-TVD: Contrastive Diffusion for 3D Super-Resolution with Scarce High-Resolution Time-Varying Data	Chongke Bi et.al.	2508.08173	null
2025-08-11	FantasyStyle: Controllable Stylized Distillation for 3D Gaussian Splatting	Yitong Yang et.al.	2508.08136	null
2025-08-11	GRASPTrack: Geometry-Reasoned Association via Segmentation and Projection for Multi-Object Tracking	Xudong Han et.al.	2508.08117	null
2025-08-11	3D Plant Root Skeleton Detection and Extraction	Jiakai Lin et.al.	2508.08094	null
2025-08-11	Matrix-3D: Omnidirectional Explorable 3D World Generation	Zhongqi Yang et.al.	2508.08086	null
2025-08-11	S^2VG: 3D Stereoscopic and Spatial Video Generation via Denoising Frame Matrix	Peng Dai et.al.	2508.08048	null
2025-08-11	Aerial Target Encirclement and Interception with Noisy Range Observations	Fen Liu et.al.	2508.08046	null
2025-08-11	TRIDE: A Text-assisted Radar-Image weather-aware fusion network for Depth Estimation	Huawei Sun et.al.	2508.08038	null
2025-08-11	Mitigating Biases in Surgical Operating Rooms with Geometry	Tony Danjun Wang et.al.	2508.08028	null
2025-08-11	TrackOR: Towards Personalized Intelligent Operating Rooms Through Robust Tracking	Tony Danjun Wang et.al.	2508.07968	null
2025-08-11	Safeguarding Generative AI Applications in Preclinical Imaging through Hybrid Anomaly Detection	Jakub Binda et.al.	2508.07923	null
2025-08-11	Diffusing the Blind Spot: Uterine MRI Synthesis with Diffusion Models	Johanna P. Müller et.al.	2508.07903	null
2025-08-11	NeeCo: Image Synthesis of Novel Instrument States Based on Dynamic and Deformable 3D Gaussian Reconstruction	Tianle Zeng et.al.	2508.07897	null
2025-08-11	Autonomous Navigation of Cloud-Controlled Quadcopters in Confined Spaces Using Multi-Modal Perception and LLM-Driven High Semantic Reasoning	Shoaib Ahmmad et.al.	2508.07885	null
2025-08-11	Vertex Features for Neural Global Illumination	Rui Su et.al.	2508.07852	null
2025-08-11	Tracking Any Point Methods for Markerless 3D Tissue Tracking in Endoscopic Stereo Images	Konrad Reuter et.al.	2508.07851	null
2025-08-11	CBDES MoE: Hierarchically Decoupled Mixture-of-Experts for Functional Modules in Autonomous Driving	Qi Xiang et.al.	2508.07838	null
2025-08-11	DiTVR: Zero-Shot Diffusion Transformer for Video Restoration	Sicheng Gao et.al.	2508.07811	null
2025-08-11	Pose-RFT: Enhancing MLLMs for 3D Pose Generation via Hybrid Action Reinforcement Fine-Tuning	Bao Li et.al.	2508.07804	null
2025-08-11	MambaTrans: Multimodal Fusion Image Translation via Large Language Model Priors for Downstream Visual Tasks	Yushen Xu et.al.	2508.07803	null
2025-08-11	Forecasting Continuous Non-Conservative Dynamical Systems in SO(3)	Lennart Bastian et.al.	2508.07775	null
2025-08-13	Multi-view Normal and Distance Guidance Gaussian Splatting for Surface Reconstruction	Bo Jia et.al.	2508.07701	null
2025-08-11	Make Your MoVe: Make Your 3D Contents by Adapting Multi-View Diffusion Models to External Editing	Weitao Wang et.al.	2508.07700	null
2025-08-11	GraphCoT-VLA: A 3D Spatial-Aware Reasoning Vision-Language-Action Model for Robotic Manipulation with Ambiguous Instructions	Helong Huang et.al.	2508.07650	null
2025-08-11	Breaking Down and Building Up: Mixture of Skill-Based Vision-and-Language Navigation Agents	Tianyi Ma et.al.	2508.07642	null
2025-08-11	End-to-End Humanoid Robot Safe and Comfortable Locomotion Policy	Zifan Wang et.al.	2508.07611	null
2025-08-12	Decoupled Functional Evaluation of Autonomous Driving Models via Feature Map Quality Scoring	Ludan Zhang et.al.	2508.07552	null
2025-08-11	CoT-Pose: Chain-of-Thought Reasoning for 3D Pose Generation from Abstract Prompts	Junuk Cha et.al.	2508.07540	null
2025-08-10	Novel View Synthesis with Gaussian Splatting: Impact on Photogrammetry Model Accuracy and Resolution	Pranav Chougule et.al.	2508.07483	null
2025-08-10	CharacterShot: Controllable and Consistent 4D Character Animation	Junyao Gao et.al.	2508.07409	null
2025-08-10	DIP-GS: Deep Image Prior For Gaussian Splatting Sparse View Recovery	Rajaei Khatib et.al.	2508.07372	null
2025-08-12	GS4Buildings: Prior-Guided Gaussian Splatting for 3D Building Reconstruction	Qilin Zhang et.al.	2508.07355	null
2025-08-10	Navigation and Exploration with Active Inference: from Biology to Industry	Daria de Tinguy et.al.	2508.07269	null
2025-08-10	Fading the Digital Ink: A Universal Black-Box Attack Framework for 3DGS Watermarking Systems	Qingyuan Zeng et.al.	2508.07263	null
2025-08-12	Understanding Dynamic Scenes in Ego Centric 4D Point Clouds	Junsheng Huang et.al.	2508.07251	null
2025-08-12	3D Gaussian Representations with Motion Trajectory Field for Dynamic Scene Reconstruction	Xuesong Li et.al.	2508.07182	null
2025-08-10	CoopDiff: Anticipating 3D Human-object Interactions via Contact-consistent Decoupled Diffusion	Xiaotong Lin et.al.	2508.07162	null
2025-08-09	DexFruit: Dexterous Manipulation and Gaussian Splatting Inspection of Fruit	Aiden Swann et.al.	2508.07118	null
2025-08-09	AugLift: Boosting Generalization in Lifting-based 3D Human Pose Estimation	Nikolai Warner et.al.	2508.07112	null
2025-08-09	Communication-Efficient Multi-Agent 3D Detection via Hybrid Collaboration	Yue Hu et.al.	2508.07092	null
2025-08-09	ForeSight: Multi-View Streaming Joint Object Detection and Trajectory Forecasting	Sandro Papais et.al.	2508.07089	null
2025-08-09	TeSO: Representing and Compressing 3D Point Cloud Scenes with Textured Surfel Octree	Yueyu Hu et.al.	2508.07083	null
2025-08-09	SAGCNet: Spatial-Aware Graph Completion Network for Missing Slice Imputation in Population CMR Imaging	Junkai Liu et.al.	2508.07041	null
2025-08-09	3DGS-VBench: A Comprehensive Video Quality Evaluation Benchmark for 3DGS Compression	Yuke Xing et.al.	2508.07038	null
2025-08-12	HiMat: DiT-based Ultra-High Resolution SVBRDF Generation	Zixiong Wang et.al.	2508.07011	null
2025-08-09	Spatio-Temporal Conditional Diffusion Models for Forecasting Future Multiple Sclerosis Lesion Masks Conditioned on Treatments	Gian Mario Favero et.al.	2508.07006	null
2025-08-09	EGS-SLAM: RGB-D Gaussian Splatting SLAM with Events	Siyu Chen et.al.	2508.07003	null
2025-08-09	Evaluating Fisheye-Compatible 3D Gaussian Splatting Methods on Real Images Beyond 180 Degree Field of View	Ulas Gunes et.al.	2508.06968	null
2025-08-09	Hybrid Machine Learning Framework for Predicting Geometric Deviations from 3D Surface Metrology	Hamidreza Samadi et.al.	2508.06845	null
2025-08-09	Edge Detection for Organ Boundaries via Top Down Refinement and SubPixel Upsampling	Aarav Mehta et.al.	2508.06805	null
2025-08-09	DiffUS: Differentiable Ultrasound Rendering from Volumetric Imaging	Noe Bertramo et.al.	2508.06768	null
2025-08-09	VOccl3D: A Video Benchmark Dataset for 3D Human Pose and Shape Estimation under real Occlusions	Yash Garg et.al.	2508.06757	null
2025-08-08	Restage4D: Reanimating Deformable 3D Reconstruction from a Single Video	Jixuan He et.al.	2508.06715	null
2025-08-08	Fourier Optics and Deep Learning Methods for Fast 3D Reconstruction in Digital Holography	Justin London et.al.	2508.06703	null
2025-08-08	CoDe-NeRF: Neural Rendering via Dynamic Coefficient Decomposition	Wenpeng Xing et.al.	2508.06632	null
2025-08-08	LightSwitch: Multi-view Relighting with Material-guided Diffusion	Yehonathan Litman et.al.	2508.06494	null
2025-08-08	MotionSwap	Om Patil et.al.	2508.06430	null
2025-08-08	FVGen: Accelerating Novel-View Synthesis with Adversarial Video Diffusion Distillation	Wenbin Teng et.al.	2508.06392	null
2025-08-08	ViPro-2: Unsupervised State Estimation via Integrated Dynamics for Guiding Video Prediction	Patrick Takenaka et.al.	2508.06335	null
2025-08-08	L2Calib: $SE(3)$ -Manifold Reinforcement Learning for Robust Extrinsic Calibration with Degenerate Motion Resilience	Baorun Li et.al.	2508.06330	null
2025-08-08	Can Diffusion Models Bridge the Domain Gap in Cardiac MR Imaging?	Xin Ci Wong et.al.	2508.06327	null
2025-08-08	Real-Time 3D Vision-Language Embedding Mapping	Christian Rauch et.al.	2508.06291	null
2025-08-08	Situationally-aware Path Planning Exploiting 3D Scene Graphs	Saad Ejaz et.al.	2508.06283	null
2025-08-08	XAG-Net: A Cross-Slice Attention and Skip Gating Network for 2.5D Femur MRI Segmentation	Byunghyun Ko et.al.	2508.06258	null
2025-08-08	PA-HOI: A Physics-Aware Human and Object Interaction Dataset	Ruiyan Wang et.al.	2508.06205	null
2025-08-08	AnomalyMoE: Towards a Language-free Generalist Model for Unified Visual Anomaly Detection	Zhaopeng Gu et.al.	2508.06203	null
2025-08-11	UW-3DGS: Underwater 3D Reconstruction with Physics-Aware Gaussian Splatting	Wenpeng Xing et.al.	2508.06169	null
2025-08-08	Roll Your Eyes: Gaze Redirection via Explicit 3D Eyeball Rotation	YoungChan Choi et.al.	2508.06136	null
2025-08-12	GMF-Drive: Gated Mamba Fusion with Spatial-Aware BEV Representation for End-to-End Autonomous Driving	Jian Wang et.al.	2508.06113	null
2025-08-08	MCA: 2D-3D Retrieval with Noisy Labels via Multi-level Adaptive Correction and Alignment	Gui Zou et.al.	2508.06104	null
2025-08-08	Towards MR-Based Trochleoplasty Planning	Michael Wehrli et.al.	2508.06076	null
2025-08-08	LV-Net: Anatomy-aware lateral ventricle shape modeling with a case study on Alzheimer’s disease, the Australian Imaging Biomarkers and Lifestyle flagship study of ageing	Wonjung Park et.al.	2508.06055	null
2025-08-08	Learning 3D Texture-Aware Representations for Parsing Diverse Human Clothing and Body Parts	Kiran Chhatre et.al.	2508.06032	null
2025-08-08	ExploreGS: Explorable 3D Scene Reconstruction with Virtual Camera Samplings and Diffusion Priors	Minsu Kim et.al.	2508.06014	null
2025-08-08	AnimateScene: Camera-controllable Animation in Any Scene	Qingyang Liu et.al.	2508.05982	null
2025-08-08	A 3DGS-Diffusion Self-Supervised Framework for Normal Estimation from a Single Image	Yanxing Liang et.al.	2508.05950	null
2025-08-08	Enhancing Construction Site Analysis and Understanding with 3D Segmentation	Sri Ramana Saketh Vasanthawada et.al.	2508.05922	null
2025-08-07	HOLODECK 2.0: Vision-Language-Guided 3D World Generation with Editing	Zixuan Bian et.al.	2508.05899	null
2025-08-07	MZEN: Multi-Zoom Enhanced NeRF for 3-D Reconstruction with Unknown Camera Poses	Jong-Ik Park et.al.	2508.05819	null
2025-08-07	Optimization-Free Style Transfer for 3D Gaussian Splats	Raphael Du Sablon et.al.	2508.05813	null
2025-08-07	MAISI-v2: Accelerated 3D High-Resolution Medical Image Synthesis with Rectified Flow and Region-specific Contrastive Loss	Can Zhao et.al.	2508.05772	null
2025-08-07	GAP: Gaussianize Any Point Clouds with Text Guidance	Weiqi Zhang et.al.	2508.05631	null
2025-08-07	Physically Controllable Relighting of Photographs	Chris Careaga et.al.	2508.05626	null
2025-08-07	Hi3DEval: Advancing 3D Generation Evaluation with Hierarchical Validity	Yuhan Zhang et.al.	2508.05609	null
2025-08-07	Robust adaptive fuzzy sliding mode control for trajectory tracking for of cylindrical manipulator	Van Cuong Pham et.al.	2508.05584	null
2025-08-07	Follow-Your-Instruction: A Comprehensive MLLM Agent for World Data Synthesis	Kunyu Feng et.al.	2508.05580	null
2025-08-07	Point cloud segmentation for 3D Clothed Human Layering	Davide Garavaso et.al.	2508.05531	null
2025-08-07	Head Anchor Enhanced Detection and Association for Crowded Pedestrian Tracking	Zewei Wu et.al.	2508.05514	null
2025-08-07	MagicHOI: Leveraging 3D Priors for Accurate Hand-object Reconstruction from Short Monocular Video Clips	Shibo Wang et.al.	2508.05506	null
2025-08-07	Symmetry Understanding of 3D Shapes via Chirality Disentanglement	Weikang Wang et.al.	2508.05505	null
2025-08-07	Computational Design and Fabrication of Modular Robots with Untethered Control	Manas Bhargava et.al.	2508.05410	null
2025-08-07	CT-GRAPH: Hierarchical Graph Attention Network for Anatomy-Guided CT Report Generation	Hamza Kalisch et.al.	2508.05375	null
2025-08-07	3DGabSplat: 3D Gabor Splatting for Frequency-adaptive Radiance Field Rendering	Junyu Zhou et.al.	2508.05343	null
2025-08-08	CF3: Compact and Fast 3D Feature Fields	Hyunjoon Lee et.al.	2508.05254	null
2025-08-07	Coarse-to-Fine Joint Registration of MR and Ultrasound Images via Imaging Style Transfer	Junyi Wang et.al.	2508.05240	null
2025-08-07	EndoMatcher: Generalizable Endoscopic Image Matcher via Multi-Domain Pre-training for Robot-Assisted Surgery	Bingyu Yang et.al.	2508.05205	null
2025-08-07	Refining Gaussian Splatting: A Volumetric Densification Approach	Mohamed Abdul Gafoor et.al.	2508.05187	null
2025-08-07	Learning to See and Act: Task-Aware View Planning for Robotic Manipulation	Yongjie Bai et.al.	2508.05186	null
2025-08-07	FCBV-Net: Category-Level Robotic Garment Smoothing via Feature-Conditioned Bimanual Value Prediction	Mohammed Daba et.al.	2508.05153	null
2025-08-07	FedGIN: Federated Learning with Dynamic Global Intensity Non-linear Augmentation for Organ Segmentation using Multi-modal Images	Sachin Dudda Nagaraju et.al.	2508.05137	null
2025-08-07	A Study of the Framework and Real-World Applications of Language Embedding for 3D Scene Understanding	Mahmoud Chick Zaouali et.al.	2508.05064	null
2025-08-07	DualMat: PBR Material Estimation via Coherent Dual-Path Diffusion	Yifeng Huang et.al.	2508.05060	null
2025-08-07	MAG-Nav: Language-Driven Object Navigation Leveraging Memory-Reserved Active Grounding	Weifan Zhang et.al.	2508.05021	null
2025-08-07	Propagating Sparse Depth via Depth Foundation Model for Out-of-Distribution Depth Completion	Shenglun Chen et.al.	2508.04984	null
2025-08-07	UGOD: Uncertainty-Guided Differentiable Opacity and Soft Dropout for Enhanced Sparse-View 3DGS	Zhihao Guo et.al.	2508.04968	null
2025-08-07	Laplacian Analysis Meets Dynamics Modelling: Gaussian Splatting for 4D Reconstruction	Yifan Zhou et.al.	2508.04966	null
2025-08-07	Perceive-Sample-Compress: Towards Real-Time 3D Gaussian Splatting	Zijian Wang et.al.	2508.04965	null
2025-08-06	CryoGS: Gaussian Splatting for Cryo-EM Homogeneous Reconstruction	Suyi Chen et.al.	2508.04929	null
2025-08-06	LuKAN: A Kolmogorov-Arnold Network Framework for 3D Human Motion Prediction	Md Zahidul Hasan et.al.	2508.04847	null
2025-08-06	Single-Step Reconstruction-Free Anomaly Detection and Segmentation via Diffusion Models	Mehrdad Moradi et.al.	2508.04818	null
2025-08-05	Neural Field-Based 3D Surface Reconstruction of Microstructures from Multi-Detector Signals in Scanning Electron Microscopy	Shuo Chen et.al.	2508.04728	null
2025-08-06	Occupancy Learning with Spatiotemporal Memory	Ziyang Leng et.al.	2508.04705	null
2025-08-06	BEVCon: Advancing Bird’s Eye View Perception with Contrastive Learning	Ziyang Leng et.al.	2508.04702	null
2025-08-06	MienCap: Realtime Performance-Based Facial Animation with Live Mood Dynamics	Ye Pan et.al.	2508.04687	null
2025-08-06	PixCuboid: Room Layout Estimation from Multi-view Featuremetric Alignment	Gustav Hanning et.al.	2508.04659	null
2025-08-06	OmniDepth: Bridging Monocular and Stereo Reasoning with Latent Alignment	Tongfan Guan et.al.	2508.04611	null
2025-08-06	$NavA^3$ : Understanding Any Instruction, Navigating Anywhere, Finding Anything	Lingfeng Zhang et.al.	2508.04598	null
2025-08-06	Pseudo Depth Meets Gaussian: A Feed-forward RGB SLAM Baseline	Linqing Zhao et.al.	2508.04597	null
2025-08-06	LA-CaRe-CNN: Cascading Refinement CNN for Left Atrial Scar Segmentation	Franz Thaler et.al.	2508.04553	null
2025-08-06	Surf3R: Rapid Surface Reconstruction from Sparse RGB Views in Seconds	Haodong Zhu et.al.	2508.04508	null
2025-08-06	MonoCloth: Reconstruction and Animation of Cloth-Decoupled Human Avatars from Monocular Videos	Daisheng Jin et.al.	2508.04505	null
2025-08-06	4DVD: Cascaded Dense-view Video Diffusion Model for High-quality 4D Content Generation	Shuzhou Yang et.al.	2508.04467	null
2025-08-06	Deep Learning-based Scalable Image-to-3D Facade Parser for Generating Thermal 3D Building Models	Yinan Yu et.al.	2508.04406	null
2025-08-06	RiemanLine: Riemannian Manifold Representation of 3D Lines for Factor Graph Optimization	Yanyan Li et.al.	2508.04335	null
2025-08-07	Radiance Fields in XR: A Survey on How Radiance Fields are Envisioned and Addressed for XR Research	Ke Li et.al.	2508.04326	null
2025-08-06	MuGS: Multi-Baseline Generalizable Gaussian Splatting Reconstruction	Yaopeng Lou et.al.	2508.04297	null
2025-08-06	PKSS-Align: Robust Point Cloud Registration on Pre-Kendall Shape Space	Chenlei Lv et.al.	2508.04286	null
2025-08-06	PIS3R: Very Large Parallax Image Stitching via Deep 3D Reconstruction	Muhua Zhu et.al.	2508.04236	null
2025-08-06	SplitGaussian: Reconstructing Dynamic Scenes via Visual Geometry Decomposition	Jiahui Li et.al.	2508.04224	null
2025-08-06	Small Lesions-aware Bidirectional Multimodal Multiscale Fusion Network for Lung Disease Classification	Jianxun Yu et.al.	2508.04205	null
2025-08-06	IDCNet: Guided Video Diffusion for Metric-Consistent RGBD Scene Generation with Precise Camera Control	Lijuan Liu et.al.	2508.04147	null
2025-08-06	DET-GS: Depth- and Edge-Aware Regularization for High-Fidelity 3D Gaussian Splatting	Zexu Huang et.al.	2508.04099	null
2025-08-06	Bridging Diffusion Models and 3D Representations: A 3D Consistent Super-Resolution Framework	Yi-Ting Chen et.al.	2508.04090	null
2025-08-06	RLGS: Reinforcement Learning-Based Adaptive Hyperparameter Tuning for Gaussian Splatting	Zhan Li et.al.	2508.04078	null
2025-08-06	Motion is the Choreographer: Learning Latent Pose Dynamics for Seamless Sign Language Generation	Jiayi He et.al.	2508.04049	null
2025-08-06	JanusNet: Hierarchical Slice-Block Shuffle and Displacement for Semi-Supervised 3D Multi-Organ Segmentation	Zheng Zhang et.al.	2508.03997	null
2025-08-05	La La LiDAR: Large-Scale Layout Generation from LiDAR Data	Youquan Liu et.al.	2508.03691	null
2025-08-05	Veila: Panoramic LiDAR Generation from a Monocular RGB Image	Youquan Liu et.al.	2508.03690	null
2025-08-05	Inland-LOAM: Voxel-Based Structural Semantic Mapping for Inland Waterways	Zhongbi Luo et.al.	2508.03672	null
2025-08-05	OmniShape: Zero-Shot Multi-Hypothesis Shape and Pose Estimation in the Real World	Katherine Liu et.al.	2508.03669	null
2025-08-06	Uni3R: Unified 3D Reconstruction and Semantic Understanding via Generalizable Gaussian Splatting from Unposed Multi-View Images	Xiangyu Sun et.al.	2508.03643	null
2025-08-05	FPG-NAS: FLOPs-Aware Gated Differentiable Neural Architecture Search for Efficient 6DoF Pose Estimation	Nassim Ali Ousalah et.al.	2508.03618	null
2025-08-05	CADD: Context aware disease deviations via restoration of brain images using normative conditional diffusion models	Ana Lawry Aguila et.al.	2508.03594	null
2025-08-05	Spatial Imputation Drives Cross-Domain Alignment for EEG Classification	Hongjun Liu et.al.	2508.03437	null
2025-08-05	WaMo: Wavelet-Enhanced Multi-Frequency Trajectory Analysis for Fine-Grained Text-Motion Retrieval	Junlong Ren et.al.	2508.03343	null
2025-08-05	Robust Single-Stage Fully Sparse 3D Object Detection via Detachable Latent Diffusion	Wentao Qu et.al.	2508.03252	null
2025-08-05	Trace3D: Consistent Segmentation Lifting via Gaussian Instance Tracing	Hongyu Shen et.al.	2508.03227	null
2025-08-05	Monocular Depth Estimation with Global-Aware Discretization and Local Context Modeling	Heng Wu et.al.	2508.03186	null
2025-08-05	Duplex-GS: Proxy-Guided Weighted Blending for Real-Time Order-Independent Gaussian Splatting	Weihang Liu et.al.	2508.03180	null
2025-08-05	H3R: Hybrid Multi-view Correspondence for Generalizable 3D Reconstruction	Heng Jia et.al.	2508.03118	null
2025-08-05	Point2Act: Efficient 3D Distillation of Multimodal LLMs for Zero-Shot Context-Aware Grasping	Sang Min Kim et.al.	2508.03099	null
2025-08-05	RobustGS: Unified Boosting of Feedforward 3D Gaussian Splatting under Low-Quality Conditions	Anran Wu et.al.	2508.03077	null
2025-08-05	SSFMamba: Symmetry-driven Spatial-Frequency Feature Fusion for 3D Medical Image Segmentation	Bo Zhang et.al.	2508.03069	null
2025-08-05	A Survey of Medical Point Cloud Shape Learning: Registration, Reconstruction and Variation	Tongxu Zhang et.al.	2508.03057	null
2025-08-05	SA-3DGS: A Self-Adaptive Compression Method for 3D Gaussian Splatting	Liheng Zhang et.al.	2508.03017	null
2025-08-05	ClinicalFMamba: Advancing Clinical Assessment using Mamba-based Multimodal Neuroimaging Fusion	Meng Zhou et.al.	2508.03008	null
2025-08-05	GACL: Grounded Adaptive Curriculum Learning with Active Task and Performance Monitoring	Linji Wang et.al.	2508.02988	null
2025-08-04	Evaluation of 3D Counterfactual Brain MRI Generation	Pengwei Sun et.al.	2508.02880	null
2025-08-04	MIDAR: Mimicking LiDAR Detection for Traffic Applications with a Lightweight Plug-and-Play Model	Tianheng Zhu et.al.	2508.02858	null
2025-08-04	GENIE: Gaussian Encoding for Neural Radiance Fields Interactive Editing	Mikołaj Zieliński et.al.	2508.02831	null
2025-08-04	PyCAT4: A Hierarchical Vision Transformer-based Framework for 3D Human Pose Estimation	Zongyou Yang et.al.	2508.02806	null
2025-08-04	PMGS: Reconstruction of Projectile Motion across Large Spatiotemporal Spans via 3D Gaussian Splatting	Yijun Xu et.al.	2508.02660	null
2025-08-04	RL-U $^2$ Net: A Dual-Branch UNet with Reinforcement Learning-Assisted Multimodal Feature Fusion for Accurate 3D Whole-Heart Segmentation	Jierui Qu et.al.	2508.02557	null
2025-08-04	Uncertainty-Aware Perception-Based Control for Autonomous Racing	Jelena Trisovic et.al.	2508.02494	null
2025-08-05	Low-Frequency First: Eliminating Floating Artifacts in 3D Gaussian Splatting	Jianchao Wang et.al.	2508.02493	null
2025-08-06	GR-Gaussian: Graph-Based Radiative Gaussian Splatting for Sparse-View CT Reconstruction	Yikuang Yuluo et.al.	2508.02408	null
2025-08-04	Correspondence-Free Fast and Robust Spherical Point Pattern Registration	Anik Sarker et.al.	2508.02339	null
2025-08-04	Dream-to-Recon: Monocular 3D Reconstruction with Diffusion-Depth Distillation from Single Images	Philipp Wulff et.al.	2508.02323	null
2025-08-04	ASDR: Exploiting Adaptive Sampling and Data Reuse for CIM-based Instant Neural Rendering	Fangxin Liu et.al.	2508.02304	null
2025-08-04	Unleashing the Temporal Potential of Stereo Event Cameras for Continuous-Time 3D Object Detection	Jae-Young Kang et.al.	2508.02288	null
2025-08-04	SplatSSC: Decoupled Depth-Guided Gaussian Splatting for Semantic Scene Completion	Rui Qian et.al.	2508.02261	null
2025-08-04	GaussianCross: Cross-modal Self-supervised 3D Representation Learning via Gaussian Splatting	Lei Yao et.al.	2508.02172	null
2025-08-04	Unified Category-Level Object Detection and Pose Estimation from RGB Images using 3D Prototypes	Tom Fischer et.al.	2508.02157	null
2025-08-04	ScrewSplat: An End-to-End Method for Articulated Object Recognition	Seungyeon Kim et.al.	2508.02146	null
2025-08-04	VDEGaussian: Video Diffusion Enhanced 4D Gaussian Splatting for Dynamic Urban Scenes Modeling	Yuru Xiao et.al.	2508.02129	null
2025-08-04	REACT-KD: Region-Aware Cross-modal Topological Knowledge Distillation for Interpretable Medical Image Classification	Hongzhao Chen et.al.	2508.02104	null
2025-08-04	StarPose: 3D Human Pose Estimation via Spatial-Temporal Autoregressive Diffusion	Haoxin Yang et.al.	2508.02056	null
2025-08-04	Protego: User-Centric Pose-Invariant Privacy Protection Against Face Recognition-Induced Digital Footprint Exposure	Ziling Wang et.al.	2508.02034	null
2025-08-04	On-the-Fly Object-aware Representative Point Selection in Point Cloud	Xiaoyu Zhang et.al.	2508.01980	null
2025-08-04	From Photons to Physics: Autonomous Indoor Drones and the Future of Objective Property Assessment	Petteri Teikari et.al.	2508.01965	null
2025-08-03	Less is More: AMBER-AFNO – a New Benchmark for Lightweight 3D Medical Image Segmentation	Andrea Dosi et.al.	2508.01941	null
2025-08-03	MUTE-DSS: A Digital-Twin-Based Decision Support System for Minimizing Underwater Radiated Noise in Ship Voyage Planning	Akash Venkateshwaran et.al.	2508.01907	null
2025-08-03	Beyond Vulnerabilities: A Survey of Adversarial Attacks as Both Threats and Defenses in Computer Vision Systems	Zhongliang Guo et.al.	2508.01845	null
2025-08-03	OmniEvent: Unified Event Representation Learning	Weiqi Yan et.al.	2508.01842	null
2025-08-03	Diffusion-based 3D Hand Motion Recovery with Intuitive Physics	Yufei Zhang et.al.	2508.01835	null
2025-08-03	Skip priors and add graph-based anatomical information, for point-based Couinaud segmentation	Xiaotong Zhang et.al.	2508.01785	null
2025-08-05	VPN: Visual Prompt Navigation	Shuo Feng et.al.	2508.01766	null
2025-08-03	AG $^2$ aussian: Anchor-Graph Structured Gaussian Splatting for Instance-Level 3D Scene Understanding and Editing	Zhaonan Wang et.al.	2508.01740	null
2025-08-03	OpenMap: Instruction Grounding via Open-Vocabulary Visual-Language Mapping	Danyang Li et.al.	2508.01723	null
2025-08-03	LT-Gaussian: Long-Term Map Update Using 3D Gaussian Splatting for Autonomous Driving	Luqi Cheng et.al.	2508.01704	null
2025-08-03	Register Anything: Estimating “Corresponding Prompts” for Segment Anything Model	Shiqi Huang et.al.	2508.01697	null
2025-08-03	DisCo3D: Distilling Multi-View Consistency for 3D Scene Editing	Yufeng Chi et.al.	2508.01684	null
2025-08-03	DAG: Unleash the Potential of Diffusion Model for Open-Vocabulary 3D Affordance Grounding	Hanqing Wang et.al.	2508.01651	null
2025-08-03	StrandDesigner: Towards Practical Strand Generation with Sketch Guidance	Na Zhang et.al.	2508.01650	null
2025-08-03	Self-Navigated Residual Mamba for Universal Industrial Anomaly Detection	Hanxi Li et.al.	2508.01591	null
2025-08-03	A Spatio-temporal Continuous Network for Stochastic 3D Human Motion Prediction	Hua Yu et.al.	2508.01585	null
2025-08-03	Deeply Supervised Multi-Task Autoencoder for Biological Brain Age estimation using three dimensional T $_1$ -weighted magnetic resonance imaging	Mehreen Kanwal et.al.	2508.01565	null
2025-08-03	Adaptive LiDAR Scanning: Harnessing Temporal Cues for Efficient 3D Object Detection via Multi-Modal Fusion	Sara Shoouri et.al.	2508.01562	null
2025-08-02	Decentralized Aerial Manipulation of a Cable-Suspended Load using Multi-Agent Reinforcement Learning	Jack Zeng et.al.	2508.01522	null
2025-08-02	EfficientGFormer: Multimodal Brain Tumor Segmentation via Pruned Graph-Augmented Transformer	Fatemeh Ziaeetabar et.al.	2508.01465	null
2025-08-02	Can3Tok: Canonical 3D Tokenization and Latent Modeling of Scene-Level 3D Gaussians	Quankai Gao et.al.	2508.01464	null
2025-08-02	Uncertainty-Aware Segmentation Quality Prediction via Deep Learning Bayesian Modeling: Comprehensive Evaluation and Interpretation on Skin Cancer and Liver Segmentation	Sikha O K et.al.	2508.01460	null
2025-08-05	3DRot: 3D Rotation Augmentation for RGB-Based 3D Tasks	Shitian Yang et.al.	2508.01423	null
2025-08-02	ReMu: Reconstructing Multi-layer 3D Clothed Human from Image Layers	Onat Vuran et.al.	2508.01381	null
2025-08-02	P3P Made Easy	Seong Hun Lee et.al.	2508.01312	null
2025-08-02	C3D-AD: Toward Continual 3D Anomaly Detection via Kernel Attention with Learnable Advisor	Haoquan Lu et.al.	2508.01311	null
2025-08-02	CoCoLIT: ControlNet-Conditioned Latent Image Translation for MRI to Amyloid PET Synthesis	Alec Sargood et.al.	2508.01292	null
2025-08-02	Integrating Disparity Confidence Estimation into Relative Depth Prior-Guided Unsupervised Stereo Matching	Chuang-Wei Liu et.al.	2508.01275	null
2025-08-05	MeshLLM: Empowering Large Language Models to Progressively Understand and Generate 3D Mesh	Shuangkang Fang et.al.	2508.01242	null
2025-08-02	OCSplats: Observation Completeness Quantification and Label Noise Separation in 3DGS	Han Ling et.al.	2508.01239	null
2025-08-02	Point-wise Diffusion Models for Physical Systems with Shape Variations: Application to Spatio-temporal and Large-scale system	Jiyong Kim et.al.	2508.01230	null
2025-08-02	MoGaFace: Momentum-Guided and Texture-Aware Gaussian Avatars for Consistent Facial Geometry	Yujian Liu et.al.	2508.01218	null
2025-08-02	Perspective from a Broader Context: Can Room Style Knowledge Help Visual Floorplan Localization?	Bolei Chen et.al.	2508.01216	null
2025-08-02	A Coarse-to-Fine Approach to Multi-Modality 3D Occupancy Grounding	Zhan Shi et.al.	2508.01197	null
2025-08-02	Object Affordance Recognition and Grounding via Multi-scale Cross-modal Representation Learning	Xinhang Wan et.al.	2508.01184	null
2025-08-02	No Pose at All: Self-Supervised Pose-Free 3D Gaussian Splatting from Sparse Views	Ranran Huang et.al.	2508.01171	null
2025-08-02	DELTAv2: Accelerating Dense 3D Tracking	Tuan Duc Ngo et.al.	2508.01170	null
2025-08-02	OpenGS-Fusion: Open-Vocabulary Dense Mapping with Hybrid 3D Gaussian Splatting for Refined Object-Level Understanding	Dianyi Yang et.al.	2508.01150	null
2025-08-02	Design of Q8bot: A Miniature, Low-Cost, Dynamic Quadruped Built with Zero Wires	Yufeng Wu et.al.	2508.01149	null
2025-08-02	UniEgoMotion: A Unified Model for Egocentric Motion Reconstruction, Forecasting, and Generation	Chaitanya Patel et.al.	2508.01126	null
2025-08-01	DreamSat-2.0: Towards a General Single-View Asteroid 3D Reconstruction	Santiago Diaz et.al.	2508.01079	null
2025-08-01	Mobile U-ViT: Revisiting large kernel and U-shaped ViT for efficient medical image segmentation	Fenghe Tang et.al.	2508.01064	null
2025-08-01	Structured Spectral Graph Learning for Anomaly Classification in 3D Chest CT Scans	Theo Di Piazza et.al.	2508.01045	null
2025-08-01	3D Reconstruction via Incremental Structure From Motion	Muhammad Zeeshan et.al.	2508.01019	null
2025-08-01	Hestia: Hierarchical Next-Best-View Exploration for Systematic Intelligent Autonomous Data Collection	Cheng-You Lu et.al.	2508.01014	null
2025-08-01	Cooperative Perception: A Resource-Efficient Framework for Multi-Drone 3D Scene Reconstruction Using Federated Diffusion and NeRF	Massoud Pourmandi et.al.	2508.00967	null
2025-07-31	Investigating Crossing Perception in 3D Graph Visualisation	Ying Zhang et.al.	2508.00950	null
2025-08-01	IGL-Nav: Incremental 3D Gaussian Localization for Image-goal Navigation	Wenxuan Guo et.al.	2508.00823	null
2025-08-01	Cross-Dataset Semantic Segmentation Performance Analysis: Unifying NIST Point Cloud City Datasets for 3D Deep Learning	Alexander Nikitas Dimopoulos et.al.	2508.00822	null
2025-08-01	GECO: Geometrically Consistent Embedding with Lightspeed Inference	Regine Hartwig et.al.	2508.00746	null
2025-08-01	Rethinking Backbone Design for Lightweight 3D Object Detection in LiDAR	Adwait Chandorkar et.al.	2508.00744	null
2025-08-04	DPoser-X: Diffusion Model as Robust 3D Whole-body Human Pose Prior	Junzhe Lu et.al.	2508.00599	null
2025-08-01	OmniUnet: A Multimodal Network for Unstructured Terrain Segmentation on Planetary Rovers Using RGB, Depth, and Thermal Imagery	Raul Castilla-Arquillo et.al.	2508.00580	null
2025-08-04	LesiOnTime – Joint Temporal and Clinical Modeling for Small Breast Lesion Segmentation in Longitudinal DCE-MRI	Mohammed Kamran et.al.	2508.00496	null
2025-08-01	HyPCV-Former: Hyperbolic Spatio-Temporal Transformer for 3D Point Cloud Video Anomaly Detection	Jiaping Cao et.al.	2508.00473	null
2025-08-01	Sel3DCraft: Interactive Visual Prompts for User-Friendly Text-to-3D Generation	Nan Xiang et.al.	2508.00428	null
2025-08-01	Contact-Aware Amodal Completion for Human-Object Interaction via Multi-Regional Inpainting	Seunggeun Chi et.al.	2508.00427	null
2025-08-01	Sari Sandbox: A Virtual Retail Store Environment for Embodied AI Agents	Janika Deborah Gajo et.al.	2508.00400	null
2025-08-01	Occlusion-robust Stylization for Drawing-based 3D Animation	Sunjae Yoon et.al.	2508.00398	null
2025-08-01	SparseRecon: Neural Implicit Surface Reconstruction from Sparse Views with Feature and Depth Consistencies	Liang Han et.al.	2508.00366	null
2025-08-01	Stable at Any Speed: Speed-Driven Multi-Object Tracking with Learnable Kalman Filtering	Yan Gong et.al.	2508.00358	null
2025-08-01	Omni-Scan: Creating Visually-Accurate Digital Twin Object Models Using a Bimanual Robot with Handover and Gaussian Splat Merging	Tianshuang Qiu et.al.	2508.00354	null
2025-08-01	AniMer+: Unified Pose and Shape Estimation Across Mammalia and Aves via Family-Aware Transformer	Jin Lyu et.al.	2508.00298	null
2025-08-01	Towards Robust Semantic Correspondence: A Benchmark and Insights	Wenyue Chong et.al.	2508.00272	null
2025-08-05	Multimodal Referring Segmentation: A Survey	Henghui Ding et.al.	2508.00265	null
2025-08-01	PointGauss: Point Cloud-Guided Multi-Object Segmentation for Gaussian Splatting	Wentao Sun et.al.	2508.00259	null
2025-08-01	Weakly Supervised Intracranial Aneurysm Detection and Segmentation in MR angiography via Multi-task UNet with Vesselness Prior	Erin Rainville et.al.	2508.00235	null
2025-07-31	Robust 3D Object Detection using Probabilistic Point Clouds from Single-Photon LiDARs	Bhavya Goyal et.al.	2508.00169	null
2025-07-31	GEPAR3D: Geometry Prior-Assisted Learning for 3D Tooth Segmentation	Tomasz Szczepański et.al.	2508.00155	null
2025-07-31	Stress-Aware Resilient Neural Training	Ashkan Shakarami et.al.	2508.00098	null
2025-07-31	Punching Bag vs. Punching Person: Motion Transferability in Videos	Raiyaan Abdullah et.al.	2508.00085	null
2025-07-31	Gaussian Variation Field Diffusion for High-fidelity Video-to-4D Synthesis	Bowen Zhang et.al.	2507.23785	null
2025-07-31	Half-Physics: Enabling Kinematic 3D Human Model with Physical Interactions	Li Siyao et.al.	2507.23778	null
2025-07-31	SeqAffordSplat: Scene-level Sequential Affordance Reasoning on 3D Gaussian Splatting	Di Li et.al.	2507.23772	null
2025-08-05	Topology Optimization in Medical Image Segmentation with Fast Euler Characteristic	Liu Li et.al.	2507.23763	null
2025-07-31	Enhanced Velocity Field Modeling for Gaussian Video Reconstruction	Zhenyang Li et.al.	2507.23704	null
2025-07-31	Scalable Multi-Task Reinforcement Learning for Generalizable Spatial Intelligence in Visuomotor Agents	Shaofei Cai et.al.	2507.23698	null
2025-07-31	High-resolution eikonal imaging and uncertainty quantification of the Kilauea caldera	Angela F. Gao et.al.	2507.23692	null
2025-07-31	I2V-GS: Infrastructure-to-Vehicle View Transformation with Gaussian Splatting for Autonomous Driving Data Generation	Jialei Chen et.al.	2507.23683	null
2025-07-31	Stereo 3D Gaussian Splatting SLAM for Outdoor Urban Scenes	Xiaohan Li et.al.	2507.23677	null
2025-07-31	DA-Occ: Efficient 3D Voxel Occupancy Prediction via Directional 2D for Geometric Structure Preservation	Yuchen Zhou et.al.	2507.23599	null
2025-08-02	MoGA: 3D Generative Avatar Prior for Monocular Gaussian Avatar Reconstruction	Zijian Dong et.al.	2507.23597	null
2025-07-31	Gaussian Splatting Feature Fields for Privacy-Preserving Visual Localization	Maxime Pietrantoni et.al.	2507.23569	null
2025-07-31	3D-MOOD: Lifting 2D to 3D for Monocular Open-Set Object Detection	Yung-Hsu Yang et.al.	2507.23567	null
2025-08-01	H-RDT: Human Manipulation Enhanced Bimanual Robotic Manipulation	Hongzhe Bi et.al.	2507.23523	null
2025-07-31	Stable-Sim2Real: Exploring Simulation of Real-Captured 3D Data with Two-Stage Depth Diffusion	Mutian Xu et.al.	2507.23483	null
2025-07-31	FastPoint: Accelerating 3D Point Cloud Model Inference via Sample Point Distance Prediction	Donghyun Lee et.al.	2507.23480	null
2025-07-31	3D-R1: Enhancing Reasoning in 3D VLMs for Unified Scene Understanding	Ting Huang et.al.	2507.23478	null
2025-08-01	NeRF Is a Valuable Assistant for 3D Gaussian Splatting	Shuangkang Fang et.al.	2507.23374	null
2025-07-31	MagicRoad: Semantic-Aware 3D Road Surface Reconstruction via Obstacle Inpainting	Xingyue Peng et.al.	2507.23340	null
2025-08-01	Training-free Geometric Image Editing on Diffusion Models	Hanshen Zhu et.al.	2507.23300	null
2025-07-31	iLRM: An Iterative Large 3D Reconstruction Model	Gyeongjin Kang et.al.	2507.23277	null
2025-07-31	GSFusion:Globally Optimized LiDAR-Inertial-Visual Mapping for Gaussian Splatting	Jaeseok Park et.al.	2507.23273	null
2025-07-31	Towards Affordable Tumor Segmentation and Visualization for 3D Breast MRI Using SAM2	Solha Kang et.al.	2507.23272	null
2025-07-30	Details Matter for Indoor Open-vocabulary 3D Instance Segmentation	Sanghun Jung et.al.	2507.23134	null
2025-07-30	Rethink Domain Generalization in Heterogeneous Sequence MRI Segmentation	Zheyuan Zhang et.al.	2507.23110	null
2025-07-30	Reference-Guided Diffusion Inpainting For Multimodal Counterfactual Generation	Alexandru Buburuzan et.al.	2507.23058	null
2025-07-30	Adaptive Time-step Training for Enhancing Spike-Based Neural Radiance Fields	Ranxi Lin et.al.	2507.23033	null
2025-07-30	Learning to Prune Branches in Modern Tree-Fruit Orchards	Abhinav Jain et.al.	2507.23015	null
2025-07-30	Robust and Efficient 3D Gaussian Splatting for Urban Scene Reconstruction	Zhensheng Yuan et.al.	2507.23006	null
2025-07-30	Viser: Imperative, Web-based 3D Visualization in Python	Brent Yi et.al.	2507.22885	null
2025-07-30	DepR: Depth Guided Single-view Scene Reconstruction with Instance-level Diffusion	Qingcheng Zhao et.al.	2507.22825	null
2025-07-30	Wall Shear Stress Estimation in Abdominal Aortic Aneurysms: Towards Generalisable Neural Surrogate Models	Patryk Rygiel et.al.	2507.22817	null
2025-07-30	Modality-Aware Feature Matching: A Comprehensive Review of Single- and Cross-Modality Techniques	Weide Liu et.al.	2507.22791	null
2025-07-30	Social-Pose: Enhancing Trajectory Prediction with Human Body Pose	Yang Gao et.al.	2507.22742	null
2025-07-30	A Linear N-Point Solver for Structure and Motion from Asynchronous Tracks	Hang Su et.al.	2507.22733	null
2025-07-30	Image-Guided Shape-from-Template Using Mesh Inextensibility Constraints	Thuy Tran et.al.	2507.22699	null
2025-07-30	Graph-Guided Dual-Level Augmentation for 3D Scene Segmentation	Hongbin Lin et.al.	2507.22668	null
2025-07-30	trAIce3D: A Prompt-Driven Transformer Based U-Net for Semantic Segmentation of Microglial Cells from Large-Scale 3D Microscopy Images	MohammadAmin Alamalhoda et.al.	2507.22635	null
2025-07-30	Estimating 2D Camera Motion with Hybrid Motion Basis	Haipeng Li et.al.	2507.22480	null
2025-07-30	UAVScenes: A Multi-Modal Dataset for UAVs	Sijie Wang et.al.	2507.22412	null
2025-07-30	UFV-Splatter: Pose-Free Feed-Forward 3D Gaussian Splatting Adapted to Unfavorable Views	Yuki Fujimura et.al.	2507.22342	null
2025-07-30	A Segmentation Framework for Accurate Diagnosis of Amyloid Positivity without Structural Images	Penghan Zhu et.al.	2507.22336	null
2025-07-29	Temporally Consistent Unsupervised Segmentation for Mobile Robot Perception	Christian Ellis et.al.	2507.22194	null
2025-07-29	Enhancing efficiency in paediatric brain tumour segmentation using a pathologically diverse single-center clinical dataset	A. Piffer et.al.	2507.22152	null
2025-07-29	Ov3R: Open-Vocabulary Semantic 3D Reconstruction from RGB Videos	Ziren Gong et.al.	2507.22052	null
2025-07-29	ReXGroundingCT: A 3D Chest CT Dataset for Segmentation of Findings from Free-Text Reports	Mohammed Baharoon et.al.	2507.22030	null
2025-07-29	Cardiac-CLIP: A Vision-Language Foundation Model for 3D Cardiac CT Images	Yutao Hu et.al.	2507.22024	null
2025-07-29	XAI for Point Cloud Data using Perturbations based on Meaningful Segmentation	Raju Ningappa Mulawade et.al.	2507.22020	null
2025-07-29	DISCOVERSE: Efficient Robot Simulation in Complex High-Fidelity Environments	Yufei Jia et.al.	2507.21981	null
2025-07-29	PanoSplatt3R: Leveraging Perspective Pretraining for Generalized Unposed Wide-Baseline Panorama Reconstruction	Jiahui Ren et.al.	2507.21960	null
2025-07-31	MultiEditor: Controllable Multimodal Object Editing for Driving Scenarios Using 3D Gaussian Splatting Priors	Shouyi Lu et.al.	2507.21872	null
2025-07-29	VidFuncta: Towards Generalizable Neural Representations for Ultrasound Videos	Julia Wolleb et.al.	2507.21863	null
2025-07-29	HunyuanWorld 1.0: Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels	HunyuanWorld Team et.al.	2507.21809	null
2025-07-29	AU-LLM: Micro-Expression Action Unit Detection via Enhanced LLM-Based Feature Fusion	Zhishu Liu et.al.	2507.21778	null
2025-07-29	Multi-UAV Deployment in Obstacle-Cluttered Environments with LOS Connectivity	Yuda Chen et.al.	2507.21772	null
2025-07-30	No Redundancy, No Stall: Lightweight Streaming 3D Gaussian Splatting for Real-time Rendering	Linye Wei et.al.	2507.21572	null
2025-07-29	Multi-View Reconstruction with Global Context for 3D Anomaly Detection	Yihan Sun et.al.	2507.21555	null
2025-07-29	LITE: A Learning-Integrated Topological Explorer for Multi-Floor Indoor Environments	Junhao Chen et.al.	2507.21517	null
2025-07-29	ST-DAI: Single-shot 2.5D Spatial Transcriptomics with Intra-Sample Domain Adaptive Imputation for Cost-efficient 3D Reconstruction	Jiahe Qian et.al.	2507.21516	null
2025-07-29	BANG: Dividing 3D Assets via Generative Exploded Dynamics	Longwen Zhang et.al.	2507.21493	null
2025-07-29	Describe, Adapt and Combine: Empowering CLIP Encoders for Open-set 3D Object Retrieval	Zhichuan Wang et.al.	2507.21489	null
2025-07-28	Top2Pano: Learning to Generate Indoor Panoramas from Top-Down View	Zitong Zhang et.al.	2507.21371	null
2025-08-03	Collaborative Perceiver: Elevating Vision-based 3D Object Detection via Local Density-Aware Spatial Occupancy	Jicheng Yuan et.al.	2507.21358	null
2025-07-28	DEM-NeRF: A Neuro-Symbolic Method for Scientific Discovery through Physics-Informed Simulation	Wenkai Tan et.al.	2507.21350	null
2025-07-28	GLCP: Global-to-Local Connectivity Preservation for Tubular Structure Segmentation	Feixiang Zhou et.al.	2507.21328	null
2025-07-28	VoluMe – Authentic 3D Video Calls from Live Gaussian Splat Prediction	Martin de La Gorce et.al.	2507.21311	null
2025-07-28	Fluidically Innervated Lattices Make Versatile and Durable Tactile Sensors	Annan Zhang et.al.	2507.21225	null
2025-08-03	Reconstructing 4D Spatial Intelligence: A Survey	Yukang Cao et.al.	2507.21045	null
2025-07-28	GTAD: Global Temporal Aggregation Denoising Learning for 3D Semantic Occupancy Prediction	Tianhao Li et.al.	2507.20963	null
2025-07-28	$S^3$ LAM: Surfel Splatting SLAM for Geometrically Accurate Tracking and Mapping	Ruoyu Fan et.al.	2507.20854	null
2025-07-28	An Efficient Machine Learning Framework for Forest Height Estimation from Multi-Polarimetric Multi-Baseline SAR data	Francesca Razzano et.al.	2507.20798	null
2025-07-28	KASportsFormer: Kinematic Anatomy Enhanced Transformer for 3D Human Pose Estimation on Short Sports Scene Video	Zhuoer Yin et.al.	2507.20763	null
2025-07-28	Methods for the Segmentation of Reticular Structures Using 3D LiDAR Data: A Comparative Evaluation	Francisco J. Soler Mora et.al.	2507.20589	null
2025-07-28	M-Net: MRI Brain Tumor Sequential Segmentation Network via Mesh-Cast	Jiacheng Lu et.al.	2507.20582	null
2025-07-28	Learning Phonetic Context-Dependent Viseme for Enhancing Speech-Driven 3D Facial Animation	Hyung Kyu Kim et.al.	2507.20568	null
2025-07-28	MemoryTalker: Personalized Speech-Driven 3D Facial Animation via Audio-Guided Stylization	Hyung Kyu Kim et.al.	2507.20562	null
2025-07-28	Uni-Mapper: Unified Mapping Framework for Multi-modal LiDARs in Complex and Dynamic Environments	Gilhwan Kang et.al.	2507.20538	null
2025-07-28	Enhancing Spatial Reasoning through Visual and Textual Thinking	Xun Liang et.al.	2507.20529	null
2025-07-28	GaRe: Relightable 3D Gaussian Splatting for Outdoor Scenes from Unconstrained Photo Collections	Haiyang Bai et.al.	2507.20512	null
2025-07-28	Automated 3D-GS Registration and Fusion via Skeleton Alignment and Gaussian-Adaptive Features	Shiyang Liu et.al.	2507.20480	null
2025-07-29	From Gallery to Wrist: Realistic 3D Bracelet Insertion in Videos	Chenjian Gao et.al.	2507.20331	null
2025-07-27	Decomposing Densification in Gaussian Splatting for Faster 3D Scene Reconstruction	Binxiao Huang et.al.	2507.20239	null
2025-07-27	NeuroVoxel-LM: Language-Aligned 3D Perception via Dynamic Voxelization and Meta-Embedding	Shiyu Liu et.al.	2507.20110	null
2025-07-26	High-Speed Event Vision-Based Tactile Roller Sensor for Large Surface Measurements	Akram Khairi et.al.	2507.19914	null
2025-07-30	RaGS: Unleashing 3D Gaussian Splatting from 4D Radar and Monocular Cues for 3D Object Detection	Xiaokai Bai et.al.	2507.19856	null
2025-07-26	Taking Language Embedded 3D Gaussian Splatting into the Wild	Yuze Wang et.al.	2507.19830	null
2025-07-25	GSCache: Real-Time Radiance Caching for Volume Path Tracing using 3D Gaussian Splatting	David Bauer et.al.	2507.19718	null
2025-07-25	DINO-SLAM: DINO-informed RGB-D SLAM for Neural Implicit and Explicit Representations	Ziren Gong et.al.	2507.19474	null
2025-07-25	Fast Learning of Non-Cooperative Spacecraft 3D Models through Primitive Initialization	Pol Francesch Huc et.al.	2507.19459	null
2025-07-25	NerT-CA: Efficient Dynamic Reconstruction from Sparse-view X-ray Coronary Angiography	Kirsten W. H. Maas et.al.	2507.19328	null
2025-07-25	3DGauCIM: Accelerating Static/Dynamic 3D Gaussian Splatting via Digital CIM for High Frame Rate Real-Time Edge Rendering	Wei-Hsing Huang et.al.	2507.19133	null
2025-07-25	Gaussian Set Surface Reconstruction through Per-Gaussian Optimization	Zhentao Huang et.al.	2507.18923	null
2025-07-24	SaLF: Sparse Local Fields for Multi-Sensor Rendering in Real-Time	Yun Chen et.al.	2507.18713	null
2025-07-24	Unposed 3DGS Reconstruction with Probabilistic Procrustes Mapping	Chong Cheng et.al.	2507.18541	null
2025-07-24	G2S-ICP SLAM: Geometry-aware Gaussian Splatting ICP SLAM	Gyuhyeon Pak et.al.	2507.18344	null
2025-07-24	LONG3R: Long Sequence Streaming 3D Reconstruction	Zhuoguang Chen et.al.	2507.18255	null
2025-07-24	PS-GS: Gaussian Splatting for Multi-View Photometric Stereo	Yixiao Chen et.al.	2507.18231	null
2025-07-24	High-fidelity 3D Gaussian Inpainting: preserving multi-view consistency and photorealistic details	Jun Zhou et.al.	2507.18023	null
2025-07-24	Terrain-Aware Adaptation for Two-Dimensional UAV Path Planners	Kostas Karakontis et.al.	2507.17519	null
2025-07-23	Exploring Active Learning for Label-Efficient Training of Semantic Neural Radiance Field	Yuzhe Zhu et.al.	2507.17351	null
2025-07-23	Temporal Smoothness-Aware Rate-Distortion Optimized 4D Gaussian Splatting	Hyeongmin Lee et.al.	2507.17336	null
2025-07-24	PolarAnything: Diffusion-based Polarimetric Image Synthesis	Kailong Zhang et.al.	2507.17268	null
2025-07-22	StreamME: Simplify 3D Gaussian Avatar within Live Stream	Luchuan Song et.al.	2507.17029	null
2025-07-22	VGGT-Long: Chunk it, Loop it, Align it – Pushing VGGT’s Limits on Kilometer-scale Long RGB Sequences	Kai Deng et.al.	2507.16443	null
2025-07-22	Sparse-View 3D Reconstruction: Recent Advances and Open Challenges	Tanveer Younis et.al.	2507.16406	null
2025-07-22	Dens3R: A Foundation Model for 3D Geometry Prediction	Xianze Fang et.al.	2507.16290	null
2025-07-22	LongSplat: Online Generalizable 3D Gaussian Splatting from Long Sequence Images	Guichen Huang et.al.	2507.16144	null
2025-07-21	Appearance Harmonization via Bilateral Grid Prediction with Transformers for 3DGS	Jisu Shin et.al.	2507.15748	null
2025-07-21	DWTGS: Rethinking Frequency Regularization for Sparse-view 3D Gaussian Splatting	Hung Nguyen et.al.	2507.15690	null
2025-07-21	Hi^2-GSLoc: Dual-Hierarchical Gaussian-Specific Visual Relocalization for Remote Sensing	Boni Hu et.al.	2507.15683	null
2025-07-21	Gaussian Splatting with Discretized SDF for Relightable Assets	Zuo-Liang Zhu et.al.	2507.15629	null
2025-07-28	SurfaceSplat: Connecting Surface Reconstruction and Gaussian Splatting	Zihui Gao et.al.	2507.15602	null
2025-07-21	ObjectGS: Object-aware Scene Reconstruction and Scene Understanding via Gaussian Splatting	Ruijie Zhu et.al.	2507.15454	null
2025-07-25	GCC: A 3DGS Inference Architecture with Gaussian-Wise and Cross-Stage Conditional Processing	Minnan Pei et.al.	2507.15300	null
2025-07-20	3-Dimensional CryoEM Pose Estimation and Shift Correction Pipeline	Kaishva Chintan Shah et.al.	2507.14924	null
2025-07-20	Stereo-GS: Multi-View Stereo Vision Model for Generalizable 3D Gaussian Splatting Reconstruction	Xiufeng Huang et.al.	2507.14921	null
2025-07-20	An Evaluation of DUSt3R/MASt3R/VGGT 3D Reconstruction on Photogrammetric Aerial Blocks	Xinyi Wu et.al.	2507.14798	null
2025-11-05	Advances in Feed-Forward 3D Reconstruction and View Synthesis: A Survey	Jiahui Zhang et.al.	2507.14501	null
2025-07-19	Adaptive 3D Gaussian Splatting Video Streaming: Visual Saliency-Aware Tiling and Meta-Learning-Based Bitrate Adaptation	Han Gong et.al.	2507.14454	null
2025-07-19	Adaptive 3D Gaussian Splatting Video Streaming	Han Gong et.al.	2507.14432	null
2025-08-01	C-DOG: Multi-View Multi-instance Feature Association Using Connected δ-Overlap Graphs	Yung-Hong Sun et.al.	2507.14095	null
2025-07-18	TimeNeRF: Building Generalizable Neural Radiance Fields across Time from Few-Shot Input Views	Hsiang-Hui Hung et.al.	2507.13929	null
2025-07-18	Neural-GASh: A CGA-based neural radiance prediction pipeline for real-time shading	Efstratios Geronikolakis et.al.	2507.13917	null
2025-07-21	PCR-GS: COLMAP-Free 3D Gaussian Splatting via Pose Co-Regularizations	Yu Wei et.al.	2507.13891	null
2025-07-18	EPSilon: Efficient Point Sampling for Lightening of Hybrid-based 3D Avatar Generation	Seungjun Moon et.al.	2507.13648	null
2025-07-18	Moving Object Detection from Moving Camera Using Focus of Expansion Likelihood and Segmentation	Masahiro Ogawa et.al.	2507.13628	null
2025-07-19	AutoPartGen: Autogressive 3D Part Generation and Discovery	Minghao Chen et.al.	2507.13346	null
2025-07-16	VolSegGS: Segmentation and Tracking in Dynamic Volumetric Scenes via Deformable 3D Gaussians	Siyuan Yao et.al.	2507.12667	null
2025-07-16	NLI4VolVis: Natural Language Interaction for Volume Visualization via LLM Multi-Agents and Editable 3D Gaussian Splatting	Kuangshi Ai et.al.	2507.12621	null
2025-07-21	Wavelet-GS: 3D Gaussian Splatting with Wavelet Decomposition	Beizhen Zhao et.al.	2507.12498	null
2025-07-19	SpatialTrackerV2: 3D Point Tracking Made Easy	Yuxi Xiao et.al.	2507.12462	null
2025-07-16	Revealing the Ancient Beauty: Digital Reconstruction of Temple Tiles using Computer Vision	Arkaprabha Basu et.al.	2507.12195	null
2025-07-16	DoRF: Doppler Radiance Fields for Robust Human Activity Recognition Using Wi-Fi	Navid Hasanzadeh et.al.	2507.12132	null
2025-07-16	BRUM: Robust 3D Vehicle Reconstruction from 360 Sparse Images	Davide Di Nucci et.al.	2507.12095	null
2025-07-16	SGLoc: Semantic Localization System for Camera Pose Estimation from 3D Gaussian Splatting Representation	Beining Xu et.al.	2507.12027	null
2025-07-16	HPR3D: Hierarchical Proxy Representation for High-Fidelity 3D Reconstruction and Controllable Editing	Tielong Wang et.al.	2507.11971	null
2025-07-16	Dark-EvGS: Event Camera as an Eye for Radiance Field in the Dark	Jingqian Wu et.al.	2507.11931	null
2025-07-16	CorrMoE: Mixture of Experts with De-stylization Learning for Cross-Scene and Cross-Domain Correspondence Pruning	Peiwen Xia et.al.	2507.11834	null
2025-07-15	Towards Depth Foundation Model: Recent Trends in Vision-Based Depth Estimation	Zhen Xu et.al.	2507.11540	null
2025-07-21	Robust 3D-Masked Part-level Editing in 3D Gaussian Splatting with Regularized Score Distillation Sampling	Hayeon Kim et.al.	2507.11061	null
2025-07-14	ScaffoldAvatar: High-Fidelity Gaussian Avatars with Patch Expressions	Shivangi Aneja et.al.	2507.10542	null
2025-07-14	Binomial Self-Compensation: Mechanism and Suppression of Motion Error in Phase-Shifting Profilometry	Geyou Zhang et.al.	2507.10009	null
2025-07-19	3DGAA: Realistic and Robust 3D Gaussian-based Adversarial Attack for Autonomous Driving	Yixun Zhang et.al.	2507.09993	null
2025-07-14	VoxelRF: Voxelized Radiance Field for Fast Wireless Channel Modeling	Zihang Zeng et.al.	2507.09987	null
2025-07-11	From images to properties: a NeRF-driven framework for granular material parameter inversion	Cheng-Hsi Hsiao et.al.	2507.09005	null
2025-07-11	An Efficient Approach for Muscle Segmentation and 3D Reconstruction Using Keypoint Tracking in MRI Scan	Mengyuan Liu et.al.	2507.08690	null
2025-07-11	Adaptive Framework for Ambient Intelligence in Rehabilitation Assistance	Gábor Baranyi et.al.	2507.08624	null
2025-07-11	Review of Feed-forward 3D Reconstruction: From DUSt3R to VGGT	Wei Zhang et.al.	2507.08448	null
2025-07-11	RePaintGS: Reference-Guided Gaussian Splatting for Realistic and View-Consistent 3D Scene Inpainting	Ji Hyun Seo et.al.	2507.08434	null
2025-07-11	CL3R: 3D Reconstruction and Contrastive Learning for Enhanced Robotic Manipulation Representations	Wenbo Cui et.al.	2507.08262	null
2025-07-10	Temporally Consistent Amodal Completion for 3D Human-Object Interaction Reconstruction	Hyungjun Doh et.al.	2507.08137	null
2025-07-18	RegGS: Unposed Sparse Views Gaussian Splatting with 3DGS Registration	Chong Cheng et.al.	2507.08136	null
2025-07-10	Martian World Models: Controllable Video Synthesis with Physically Accurate 3D Reconstructions	Longfei Li et.al.	2507.07978	null
2025-07-10	RTR-GS: 3D Gaussian Splatting for Inverse Rendering with Radiance Transfer and Reflection	Yongyang Zhou et.al.	2507.07733	null
2025-08-01	Tile and Slide : A New Framework for Scaling NeRF from Local to Global 3D Earth Observation	Camille Billouard et.al.	2507.01631	null
2025-07-01	Puzzles: Unbounded Video-Depth Augmentation for Scalable End-to-End 3D Reconstruction	Jiahao Ma et.al.	2506.23863	null
2025-07-01	AttentionGS: Towards Initialization-Free 3D Gaussian Splatting via Structural Attention	Ziao Liu et.al.	2506.23611	null
2025-07-23	Curve-Aware Gaussian Splatting for 3D Parametric Curve Reconstruction	Zhirui Gao et.al.	2506.21401	null
2025-09-03	Reconstructing Tornadoes in 3D with Gaussian Splatting	Adam Yang et.al.	2506.18677	null
2025-06-24	Limitations of NERF with pre-trained Vision Features for Few-Shot 3D Reconstruction	Ankit Sanjyal et.al.	2506.18208	null
2025-06-24	R3eVision: A Survey on Robust Rendering, Restoration, and Enhancement for 3D Low-Level Vision	Weeyoung Kwon et.al.	2506.16262	null
2025-05-20	Hybrid 3D-4D Gaussian Splatting for Fast Dynamic Scene Representation	Seungjun Oh et.al.	2505.13215	null
2025-05-05	A Survey on 3D Reconstruction Techniques in Plant Phenotyping: From Classical Methods to Neural Radiance Fields (NeRF), 3D Gaussian Splatting (3DGS), and Beyond	Jiajia Li et.al.	2505.00737	null
2025-05-01	GauSS-MI: Gaussian Splatting Shannon Mutual Information for Active 3D Reconstruction	Yuhan Xie et.al.	2504.21067	null
2025-04-10	Wheat3DGS: In-field 3D Reconstruction, Instance Segmentation and Phenotyping of Wheat Heads with Gaussian Splatting	Daiwei Zhang et.al.	2504.06978	null
2025-03-24	DroneSplat: 3D Gaussian Splatting for Robust 3D Reconstruction from In-the-Wild Drone Imagery	Jiadong Tang et.al.	2503.16964	null
2025-02-25	Graph-Guided Scene Reconstruction from Images with 3D Gaussian Splatting	Chong Cheng et.al.	2502.17377	null
2025-02-18	E-3DGS: Event-Based Novel View Rendering of Large-Scale Scenes Using 3D Gaussian Splatting	Sohaib Zahid et.al.	2502.10827	null
2025-05-27	Drag Your Gaussian: Effective Drag-Based Editing with Score Distillation for 3D Gaussian Splatting	Yansong Qu et.al.	2501.18672	null
2025-04-15	SplatMAP: Online Dense Monocular SLAM with 3D Gaussian Splatting	Yue Hu et.al.	2501.07015	null
2025-01-08	ConcealGS: Concealing Invisible Copyright Information in 3D Gaussian Splatting	Yifeng Yang et.al.	2501.03605	null
2025-02-13	WeatherGS: 3D Scene Reconstruction in Adverse Weather Conditions via Gaussian Splatting	Chenghao Qian et.al.	2412.18862	null
2025-04-10	DCSEG: Decoupled 3D Open-Set Segmentation using Gaussian Splatting	Luis Wiedmann et.al.	2412.10972	null
2025-08-15	DGNS: Deformable Gaussian Splatting and Dynamic Neural Surface for Monocular Dynamic 3D Reconstruction	Xuesong Li et.al.	2412.03910	null
2024-12-05	Splats in Splats: Embedding Invisible 3D Watermark within Gaussian Splatting	Yijia Guo et.al.	2412.03121	null
2024-12-16	LineGS : 3D Line Segment Representation on 3D Gaussian Splatting	Chenggang Yang et.al.	2412.00477	null
2025-03-25	TexGaussian: Generating High-quality PBR Material via Octree-based 3D Gaussian Splatting	Bojun Xiong et.al.	2411.19654	null
2024-12-23	GSurf: 3D Reconstruction via Signed Distance Fields with Direct Gaussian Supervision	Baixin Xu et.al.	2411.15723	null
2025-03-11	NexusSplats: Efficient 3D Gaussian Splatting in the Wild	Yuzhou Tang et.al.	2411.14514	null
2025-10-14	Baking Gaussian Splatting into Diffusion Denoiser for Fast and Scalable Single-stage Image-to-3D Generation and Reconstruction	Yuanhao Cai et.al.	2411.14384	null
2025-08-11	MBA-SLAM: Motion Blur Aware Gaussian Splatting SLAM	Peng Wang et.al.	2411.08279	null
2024-10-29	ODGS: 3D Scene Reconstruction from Omnidirectional Images with 3D Gaussian Splattings	Suyoung Lee et.al.	2410.20686	null
2024-10-15	SurgicalGS: Dynamic 3D Gaussian Splatting for Accurate Robotic-Assisted Surgical Scene Reconstruction	Jialei Chen et.al.	2410.09292	null
2024-10-10	HiSplat: Hierarchical 3D Gaussian Splatting for Generalizable Sparse-View Reconstruction	Shengji Tang et.al.	2410.06245	null
2025-04-23	ThermalGaussian: Thermal 3D Gaussian Splatting	Rongfeng Lu et.al.	2409.07200	null
2024-09-11	Sources of Uncertainty in 3D Scene Reconstruction	Marcus Klasson et.al.	2409.06407	null
2024-09-06	Optimizing 3D Gaussian Splatting for Sparse Viewpoint Scene Reconstruction	Shen Chen et.al.	2409.03213	null
2024-09-06	EaDeblur-GS: Event assisted 3D Deblur Reconstruction with Gaussian Splatting	Yuchen Weng et.al.	2407.13520	null
2025-03-06	3DGS.zip: A survey on 3D Gaussian Splatting Compression Methods	Milena T. Bagdasarian et.al.	2407.09510	null
2024-07-12	Survey on Fundamental Deep Learning 3D Reconstruction Techniques	Yonge Bai et.al.	2407.08137	null
2024-10-30	GSD: View-Guided Gaussian Splatting Diffusion for 3D Reconstruction	Yuxuan Mu et.al.	2407.04237	null
2024-06-27	GS-Octree: Octree-based 3D Gaussian Splatting for Robust Object-level 3D Reconstruction Under Strong Lighting	Jiaze Li et.al.	2406.18199	null
2024-06-25	Learning with Noisy Ground Truth: From 2D Classification to 3D Reconstruction	Yangdi Lu et.al.	2406.15982	null
2024-12-10	Effective Rank Analysis and Regularization for Enhanced 3D Gaussian Splatting	Junha Hyung et.al.	2406.11672	null
2025-02-28	Generative Gaussian Splatting for Unbounded 3D City Generation	Haozhe Xie et.al.	2406.06526	null
2025-05-07	3D-HGS: 3D Half-Gaussian Splatting	Haolin Li et.al.	2406.02720	null
2024-11-25	MaGS: Reconstructing and Simulating Dynamic 3D Objects with Mesh-adsorbed Gaussian Splatting	Shaojie Ma et.al.	2406.01593	null
2024-05-29	A Refined 3D Gaussian Representation for High-Quality Dynamic Scene Reconstruction	Bin Zhang et.al.	2405.17891	null
2024-10-29	HDR-GS: Efficient High Dynamic Range Novel View Synthesis at 1000x Speed via Gaussian Splatting	Yuanhao Cai et.al.	2405.15125	null
2024-06-04	Enhanced 3D Urban Scene Reconstruction and Point Cloud Densification using Gaussian Splatting and Google Earth Imagery	Kyle Gao et.al.	2405.11021	null
2024-09-02	Gaussian Splatting: 3D Reconstruction and Novel View Synthesis, a Review	Anurag Dalal et.al.	2405.03417	null
2024-04-16	3D Geometry-aware Deformable Gaussian Splatting for Dynamic View Synthesis	Zhicheng Lu et.al.	2404.06270	null
2024-07-19	NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Radiance Fields	Muhammad Zubair Irshad et.al.	2404.01300	null
2025-01-24	3DGSR: Implicit Surface Reconstruction with 3D Gaussian Splatting	Xiaoyang Lyu et.al.	2404.00409	null
2024-05-28	Gamba: Marry Gaussian Splatting with Mamba for single view 3D reconstruction	Qiuhong Shen et.al.	2403.18795	null
2024-03-19	Creating Seamless 3D Maps Using Radiance Fields	Sai Tarun Sathyan et.al.	2403.11364	null
2024-04-16	Recent Advances in 3D Gaussian Splatting	Tong Wu et.al.	2403.11134	null
2024-05-13	GGRt: Towards Pose-free Generalizable 3D Gaussian Splatting in Real-time	Hao Li et.al.	2403.10147	null
2024-10-29	Radiative Gaussian Splatting for Efficient X-ray Novel View Synthesis	Yuanhao Cai et.al.	2403.04116	null
2024-09-26	Identifying Unnecessary 3D Gaussians using Clustering for Fast Rendering of 3D Gaussian Splatting	Joongho Jo et.al.	2402.13827	null
2024-08-07	Evaluating Neural Radiance Fields (NeRFs) for 3D Plant Geometry Reconstruction in Field Conditions	Muhammad Arbab Arshad et.al.	2402.10344	null
2024-02-02	ViCA-NeRF: View-Consistency-Aware 3D Editing of Neural Radiance Fields	Jiahua Dong et.al.	2402.00864	null
2024-01-30	GaussianBody: Clothed Human Reconstruction via 3d Gaussian Splatting	Mengtian Li et.al.	2401.09720	null
2025-10-07	A Survey on 3D Gaussian Splatting	Guikun Chen et.al.	2401.03890	null
2024-09-25	Deblurring 3D Gaussian Splatting	Byeonghyeon Lee et.al.	2401.00834	null
2024-04-08	pixelSplat: 3D Gaussian Splats from Image Pairs for Scalable Generalizable 3D Reconstruction	David Charatan et.al.	2312.12337	null
2024-07-31	COLMAP-Free 3D Gaussian Splatting	Yang Fu et.al.	2312.07504	null
2024-04-09	Feature 3DGS: Supercharging 3D Gaussian Splatting to Enable Distilled Feature Fields	Shijie Zhou et.al.	2312.03203	null
2025-03-18	NeuSG: Neural Implicit Surface Reconstruction with 3D Gaussian Splatting Guidance	Hanlin Chen et.al.	2312.00846	null
2024-07-09	Gaussian Grouping: Segment and Edit Anything in 3D Scenes	Mingqiao Ye et.al.	2312.00732	null
2023-11-29	Mip-Splatting: Alias-free 3D Gaussian Splatting	Zehao Yu et.al.	2311.16493	null
2023-12-21	GaussianEditor: Swift and Controllable 3D Editing with Gaussian Splatting	Yiwen Chen et.al.	2311.14521	null
2024-11-27	GaussianDiffusion: 3D Gaussian Splatting for Denoising Diffusion Probabilistic Models with Structured Noise	Xinhai Li et.al.	2311.11221	null
2024-03-26	Structure-Aware Sparse-View X-ray 3D Reconstruction	Yuanhao Cai et.al.	2311.10959	null
2023-11-09	UP-NeRF: Unconstrained Pose-Prior-Free Neural Radiance Fields	Injae Kim et.al.	2311.03784	null
2023-09-27	3D Density-Gradient based Edge Detection on Neural Radiance Fields (NeRFs) for Geometric Reconstruction	Miriam Jäger et.al.	2309.14800	null
2023-08-22	Strata-NeRF : Neural Radiance Fields for Stratified Scenes	Ankit Dhiman et.al.	2308.10337	null
2023-08-09	3D Gaussian Splatting for Real-Time Radiance Field Rendering	Bernhard Kerbl et.al.	2308.04079	null
2023-08-29	Seal-3D: Interactive Pixel-Level Editing for Neural Radiance Fields	Xiangyu Wang et.al.	2307.15131	null
2023-04-24	A Comparative Neural Radiance Field (NeRF) 3D Analysis of Camera Poses from HoloLens Trajectories and Structure from Motion	Miriam Jäger et.al.	2304.10664	null
2023-07-17	LiDAR-NeRF: Novel LiDAR View Synthesis via Neural Radiance Fields	Tang Tao et.al.	2304.10406	null
2023-04-07	DITTO-NeRF: Diffusion-based Iterative Text To Omni-directional 3D Model	Hoigi Seo et.al.	2304.02827	null
2023-06-27	BAD-NeRF: Bundle Adjusted Deblur Neural Radiance Fields	Peng Wang et.al.	2211.12853	null
2023-03-10	DP-NeRF: Deblurred Neural Radiance Field with Physical Scene Priors	Dogyoon Lee et.al.	2211.12046	null
2025-08-12	NeRF: Neural Radiance Field in 3D Vision: A Comprehensive Review (Updated Post-Gaussian Splatting)	Kyle Gao et.al.	2210.00379	null
2021-10-22	Style Agnostic 3D Reconstruction via Adversarial Style Transfer	Felix Petersen et.al.	2110.10784	null
2021-03-15	Do 2D GANs Know 3D Shape? Unsupervised 3D shape reconstruction from 2D Image GANs	Xingang Pan et.al.	2011.00844	null
2018-04-02	Extreme 3D Face Reconstruction: Seeing Through Occlusions	Anh Tuan Tran et.al.	1712.05083	null

Diffusion

Publish Date	Title	Authors	PDF	Code
2025-12-09	Astra: General Interactive World Model with Autoregressive Denoising	Yixuan Zhu et.al.	2512.08931	null
2025-12-09	On a cross-diffusion hybrid model: Cancer Invasion Tissue with Normal Cell Involved	Guanjun Pan et.al.	2512.08929	null
2025-12-09	Same Content, Different Answers: Cross-Modal Inconsistency in MLLMs	Angela van Sprang et.al.	2512.08923	null
2025-12-09	Unified Diffusion Transformer for High-fidelity Text-Aware Image Restoration	Jin Hyeon Kim et.al.	2512.08922	null
2025-12-09	Self-Evolving 3D Scene Generation from a Single Image	Kaizhi Zheng et.al.	2512.08905	null
2025-12-09	UniLayDiff: A Unified Diffusion Transformer for Content-Aware Layout Generation	Zeyang Liu et.al.	2512.08897	null
2025-12-09	Differentially Private Synthetic Data Generation Using Context-Aware GANs	Anantaa Kotal et.al.	2512.08869	null
2025-12-09	Refining Diffusion Models for Motion Synthesis with an Acceleration Loss to Generate Realistic IMU Data	Lars Ole Häusler et.al.	2512.08859	null
2025-12-09	CARLoS: Retrieval via Concise Assessment Representation of LoRAs at Scale	Shahar Sarfaty et.al.	2512.08826	null
2025-12-09	Refining Visual Artifacts in Diffusion Models via Explainable AI-based Flaw Activation Maps	Seoyeon Lee et.al.	2512.08774	null
2025-12-09	Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance	Ruihang Chu et.al.	2512.08765	null
2025-12-09	A Scalable Pipeline Combining Procedural 3D Graphics and Guided Diffusion for Photorealistic Synthetic Training Data Generation in White Button Mushroom Segmentation	Artúr I. Károly et.al.	2512.08747	null
2025-12-09	Pose-Based Sign Language Spotting via an End-to-End Encoder Architecture	Samuel Ebimobowei Johnny et.al.	2512.08738	null
2025-12-09	Exposing Hidden Biases in Text-to-Image Models via Automated Prompt Search	Manos Plitsis et.al.	2512.08724	null
2025-12-09	Diffusion and relaxation of topological excitations in layered spin liquids	Aprem P. Joy et.al.	2512.08712	null
2025-12-09	Gradient-Informed Monte Carlo Fine-Tuning of Diffusion Models for Low-Thrust Trajectory Design	Jannik Graebner et.al.	2512.08705	null
2025-12-09	Centrifugal instability of Taylor-Couette flow in stratified and diffusive fluids	Junho Park et.al.	2512.08664	null
2025-12-09	Flow-Based Modelling of Population Dynamics with Consecutive Continuous Mutations	Alexander Bratus et.al.	2512.08660	null
2025-12-09	Global Weak Solutions for the High–Friction Quantum Navier–Stokes–Poisson Model	Giada Cianfarani Carnevale et.al.	2512.08655	null
2025-12-09	Repulsor: Accelerating Generative Modeling with a Contrastive Memory Bank	Shaofeng Zhang et.al.	2512.08648	null
2025-12-09	Chain-of-Image Generation: Toward Monitorable and Controllable Image Generation	Young Kyung Kim et.al.	2512.08645	null
2025-12-09	MBE obtained n-CdO:Eu p-Si heterojunctions – electron beam induced profiling, electrical and structural properties	E Przezdziecka et.al.	2512.08587	null
2025-12-09	Disturbance-Free Surgical Video Generation from Multi-Camera Shadowless Lamps for Open Surgery	Yuna Kato et.al.	2512.08577	null
2025-12-09	A journey to ITACA	J. J. Gómez-Cadenas et.al.	2512.08549	null
2025-12-09	An Iteration-Free Fixed-Point Estimator for Diffusion Inversion	Yifei Chen et.al.	2512.08547	null
2025-12-09	A Novel Wasserstein Quaternion Generative Adversarial Network for Color Image Generation	Zhigang Jia et.al.	2512.08542	null
2025-12-09	Fast-ARDiff: An Entropy-informed Acceleration Framework for Continuous Space Autoregressive Generation	Zhen Zou et.al.	2512.08537	null
2025-12-09	Measuring the diffuse Galactic synchrotron spectral index and curvature between 45 and 2300 MHz	Melis O. Irfan et.al.	2512.08522	null
2025-12-09	Data-Efficient Learning of Anomalous Diffusion with Wavelet Representations: Enabling Direct Learning from Experimental Trajectories	Gongyi Wang et.al.	2512.08510	null
2025-12-09	OCCDiff: Occupancy Diffusion Model for High-Fidelity 3D Building Reconstruction from Noisy Point Clouds	Jialu Sui et.al.	2512.08506	null
2025-12-09	Beyond the Noise: Aligning Prompts with Latent Representations in Diffusion Models	Vasco Ramos et.al.	2512.08505	null
2025-12-09	Fractional Homogenization of Parabolic Equations with Long-Range Random Potentials	Atef Lechiheb et.al.	2512.08496	null
2025-12-09	LLM-based Vulnerable Code Augmentation: Generate or Refactor?	Dyna Soumhane Ouchebara et.al.	2512.08493	null
2025-12-09	Temporal Concept Dynamics in Diffusion Models via Prompt-Conditioned Interventions	Ada Gorgun et.al.	2512.08486	null
2025-12-09	Construction and Performance of Kinetic Schemes for Linear Systems of Conservation Laws	Emmanuel Audusse et.al.	2512.08479	null
2025-12-09	Core@Shell AgBr@CsPbBr3 Nanocrystals as Precursors to Hollow Lead Halide Perovskite Nanocubes	Zhanzhao Li et.al.	2512.08474	null
2025-12-09	Globular Cluster Systems in Dwarf Galaxies: Catalogs and Comparisons	Veronika Dornan et.al.	2512.08453	null
2025-12-09	A Grover-compatible manifold optimization algorithm for quantum search	Zhijian Lai et.al.	2512.08432	null
2025-12-09	Kick & spin: new probes for multi-messenger black-hole mergers in AGNs	Samson H. W. Leong et.al.	2512.08382	null
2025-12-09	Reflecting with Two Voices: A Co-Adaptive Dual-Strategy Framework for LLM-Based Agent Decision Making	Wentao Zhang et.al.	2512.08366	null
2025-12-09	Magneton: Optimizing Energy Efficiency of ML Systems via Differential Energy Debugging	Yi Pan et.al.	2512.08365	null
2025-12-09	SCU-CGAN: Enhancing Fire Detection through Synthetic Fire Image Generation and Dataset Augmentation	Ju-Young Kim et.al.	2512.08362	null
2025-12-09	Conditional Morphogenesis: Emergent Generation of Structural Digits via Neural Cellular Automata	Ali Sakour et.al.	2512.08360	null
2025-12-09	Hydrodynamic limit of the Vlasov-Poisson-Fokker-Planck system in low-field regime	Zhendong Fang et.al.	2512.08346	null
2025-12-09	DINO-BOLDNet: A DINOv3-Guided Multi-Slice Attention Network for T1-to-BOLD Generation	Jianwei Wang et.al.	2512.08337	null
2025-12-09	PointDico: Contrastive 3D Representation Learning Guided by Diffusion Models	Pengbo Li et.al.	2512.08330	null
2025-12-09	Interpreting Structured Perturbations in Image Protection Methods for Diffusion Models	Michael R. Martin et.al.	2512.08329	null
2025-12-09	GeoDiffMM: Geometry-Guided Conditional Diffusion for Motion Magnification	Xuedeng Liu et.al.	2512.08325	null
2025-12-09	Terrain Diffusion: A Diffusion-Based Successor to Perlin Noise in Infinite, Real-Time Terrain Generation	Alexander Goslin et.al.	2512.08309	null
2025-12-09	Triality and adjoint lifting for GL(3)	Wee Teck Gan et.al.	2512.08307	null
2025-12-09	OpenSubject: Leveraging Video-Derived Identity and Diversity Priors for Subject-driven Image Generation and Manipulation	Yexin Liu et.al.	2512.08294	null
2025-12-09	PAVAS: Physics-Aware Video-to-Audio Synthesis	Oh Hyun-Bin et.al.	2512.08282	null
2025-12-09	Model-Based Diffusion Sampling for Predictive Control in Offline Decision Making	Haldun Balim et.al.	2512.08280	null
2025-12-09	A Transcorrelated Wave-Function Framework for Solids: An Application to Bulk and Defected Silicon	Kristoffer Simula et.al.	2512.08276	null
2025-12-09	EgoX: Egocentric Video Generation from a Single Exocentric Video	Taewoong Kang et.al.	2512.08269	null
2025-12-09	Geometric-Stochastic Multimodal Deep Learning for Predictive Modeling of SUDEP and Stroke Vulnerability	Preksha Girish et.al.	2512.08257	null
2025-12-09	Geometry-Aware Sparse Depth Sampling for High-Fidelity RGB-D Depth Completion in Robotic Systems	Tony Salloom et.al.	2512.08229	null
2025-12-09	VisKnow: Constructing Visual Knowledge Base for Object Understanding	Ziwei Yao et.al.	2512.08221	null
2025-12-09	Blur2Sharp: Human Novel Pose and View Synthesis with Generative Prior Refinement	Chia-Hern Lai et.al.	2512.08215	null
2025-12-09	Supernovae Shock Breakout from Red Supergiants in Two Dimensions	Wun-Yi Chen et.al.	2512.08212	null
2025-12-09	Embodied Tree of Thoughts: Deliberate Manipulation Planning with Embodied World Model	Wenjiang Xu et.al.	2512.08188	null
2025-12-09	Ground Slow, Move Fast: A Dual-System Foundation Model for Generalizable Vision-and-Language Navigation	Meng Wei et.al.	2512.08186	null
2025-12-09	TreeGRPO: Tree-Advantage GRPO for Online RL Post-Training of Diffusion Models	Zheng Ding et.al.	2512.08153	null
2025-12-09	FlowSteer: Conditioning Flow Field for Consistent Image Restoration	Tharindu Wickremasinghe et.al.	2512.08125	null
2025-12-08	Segment, Embed, and Align: A Universal Recipe for Aligning Subtitles to Signing	Zifan Jiang et.al.	2512.08094	null
2025-12-08	Bayesian Co-Navigation of a Computational Physical Model and AFM Experiment to Autonomously Survey a Combinatorial Materials Library	Boris N. Slautin et.al.	2512.08084	null
2025-12-08	Towards Sustainable Universal Deepfake Detection with Frequency-Domain Masking	Chandler Timm C. Doloriel et.al.	2512.08042	null
2025-12-08	Lost in Translation, Found in Embeddings: Sign Language Translation and Alignment	Youngjoon Jang et.al.	2512.08040	null
2025-12-08	CLARITY: Medical World Model for Guiding Treatment Decisions by Modeling Context-Aware Disease Trajectories in Latent Space	Tianxingjian Ding et.al.	2512.08029	null
2025-12-08	Provable Diffusion Posterior Sampling for Bayesian Inversion	Jinyuan Chang et.al.	2512.08022	null
2025-12-08	On Schauder Estimates for Fractional Hamilton-Jacobi Equations	Espen Robstad Jakobsen et.al.	2512.07999	null
2025-12-08	VLD: Visual Language Goal Distance for Reinforcement Learning Navigation	Lazar Milikic et.al.	2512.07976	null
2025-12-08	How is cold, star-forming gas in galaxies affected by magnetic fields?	Kamran R. J. Bogue et.al.	2512.07948	null
2025-12-08	Detecting Neutrino Emission from Supernova Remnants: A Theoretically Motivated Target Catalog	Emily Simon et.al.	2512.07940	null
2025-12-08	UnityVideo: Unified Multi-Modal Multi-Task Learning for Enhancing World-Aware Video Generation	Jiehui Huang et.al.	2512.07831	null
2025-12-08	One Layer Is Enough: Adapting Pretrained Visual Encoders for Image Generation	Yuan Gao et.al.	2512.07829	null
2025-12-08	The Adoption and Usage of AI Agents: Early Evidence from Perplexity	Jeremy Yang et.al.	2512.07828	null
2025-12-08	WorldReel: 4D Video Generation with Consistent Geometry and Motion Modeling	Shaoheng Fang et.al.	2512.07821	null
2025-12-08	OneStory: Coherent Multi-Shot Video Generation with Adaptive Memory	Zhaochong An et.al.	2512.07802	null
2025-12-08	Distribution Matching Variational AutoEncoder	Sen Ye et.al.	2512.07778	null
2025-12-08	Physics-Informed Neural Networks for Source Inversion and Parameters Estimation in Atmospheric Dispersion	Brenda Anague et.al.	2512.07755	null
2025-12-08	Unison: A Fully Automatic, Task-Universal, and Low-Cost Framework for Unified Understanding and Generation	Shihao Zhao et.al.	2512.07747	null
2025-12-08	DiffusionDriveV2: Reinforcement Learning-Constrained Truncated Diffusion Modeling in End-to-End Autonomous Driving	Jialv Zou et.al.	2512.07745	null
2025-12-08	Anomalous coarsening and nonlinear diffusion of kinks in an one-dimensional quasi-classical Holstein model	Ho Jang et.al.	2512.07744	null
2025-12-09	ViSA: 3D-Aware Video Shading for Real-Time Upper-Body Avatar Creation	Fan Yang et.al.	2512.07720	null
2025-12-08	Guiding What Not to Generate: Automated Negative Prompting for Text-Image Alignment	Sangha Park et.al.	2512.07702	null
2025-12-08	Delay-Aware Diffusion Policy: Bridging the Observation-Execution Gap in Dynamic Tasks	Aileen Liao et.al.	2512.07697	null
2025-12-08	A winning approach to the intersections of twisted non-recurrent sets with fractals	Junjie Huang et.al.	2512.07686	null
2025-12-08	Optimization-Guided Diffusion for Interactive Scene Generation	Shiaho Li et.al.	2512.07661	null
2025-12-09	Entropy-Smooth Structures on Topological Manifolds	Amandip Sangha et.al.	2512.07660	null
2025-12-08	Computational Studies on O2-P2 Phase-Transition Dynamics in Layered-Oxide Sodium-Ion Cathode Materials	Konstantin Köster et.al.	2512.07642	null
2025-12-08	LongCat-Image Technical Report	Meituan LongCat Team et.al.	2512.07584	null
2025-12-08	On the structure of increasing profits in a 1D general diffusion market with interest rates	Alexis Anagnostakis et.al.	2512.07555	null
2025-12-09	FRWKV:Frequency-Domain Linear Attention for Long-Term Time Series Forecasting	Qingyuan Yang et.al.	2512.07539	null
2025-12-08	Compressible Euler equations with time-dependent damping in the critical regularity setting: global well-posedness and strong relaxation limit	Timothée Crin-Barat et.al.	2512.07516	null
2025-12-08	ControlVP: Interactive Geometric Refinement of AI-Generated Images with Consistent Vanishing Points	Ryota Okumura et.al.	2512.07504	null
2025-12-08	SJD++: Improved Speculative Jacobi Decoding for Training-free Acceleration of Discrete Auto-regressive Text-to-Image Generation	Yao Teng et.al.	2512.07503	null
2025-12-08	MultiMotion: Multi Subject Video Motion Transfer via Video Diffusion Transformer	Penghui Liu et.al.	2512.07500	null
2025-12-08	Materium: An Autoregressive Approach for Material Generation	Niklas Dobberstein et.al.	2512.07486	null
2025-12-08	Single-step Diffusion-based Video Coding with Semantic-Temporal Guidance	Naifu Xue et.al.	2512.07480	null
2025-12-08	Unified Video Editing with Temporal Reasoner	Xiangpeng Yang et.al.	2512.07469	null
2025-12-08	Simple models for the trapping of charged particles and macromolecules by diffusiophoresis in salt gradients	Richard P. Sear et.al.	2512.07442	null
2025-12-08	InterAgent: Physics-based Multi-agent Command Execution via Diffusion on Interaction Graphs	Bin Li et.al.	2512.07410	null
2025-12-08	Traveling Wave Solutions For A Singular Diffusive Prey-Predator Model With Nonlocal Dispersal	Jong-Shenq Guo et.al.	2512.07362	null
2025-12-08	Communication-Efficient Serving for Video Diffusion Models with Latent Parallelism	Zhiyuan Wu et.al.	2512.07350	null
2025-12-08	MICo-150K: A Comprehensive Dataset Advancing Multi-Image Composition	Xinyu Wei et.al.	2512.07348	null
2025-12-08	Debiasing Diffusion Priors via 3D Attention for Consistent Gaussian Splatting	Shilong Jin et.al.	2512.07345	null
2025-12-08	PrivORL: Differentially Private Synthetic Dataset for Offline Reinforcement Learning	Chen Gong et.al.	2512.07342	null
2025-12-08	ContextAnyone: Context-Aware Diffusion for Character-Consistent Text-to-Video Generation	Ziyang Mai et.al.	2512.07328	null
2025-12-08	DBMC-aNOMAly: Asynchronous NOMA with Pilot-Symbol Optimization Protocol for Diffusion-Based Molecular Communication Networks	Alexander Wietfeld et.al.	2512.07317	null
2025-12-08	M-STAR: Multi-Scale Spatiotemporal Autoregression for Human Mobility Modeling	Yuxiao Luo et.al.	2512.07314	null
2025-12-08	Estimation of the elasticity for CKLS model from high-frequency observations	Boyuan Ning et.al.	2512.07301	null
2025-12-08	Diagnosing Interstellar Magnetic Turbulence with TeV Pulsar Halos	Chao-Ming Li et.al.	2512.07290	null
2025-12-08	Equivariant Diffusion for Crystal Structure Prediction	Peijia Lin et.al.	2512.07289	null
2025-12-08	Pauli Master Equation numerical analysis of coherent and incoherent dressed fermions in triplet unconventional superconductors	Pedro L. Contreras E et.al.	2512.07274	null
2025-12-08	See More, Change Less: Anatomy-Aware Diffusion for Contrast Enhancement	Junqi Liu et.al.	2512.07251	null
2025-12-08	AdLift: Lifting Adversarial Perturbations to Safeguard 3D Gaussian Splatting Assets Against Instruction-Driven Editing	Ziming Hong et.al.	2512.07247	null
2025-12-08	Interplay between Escaping Cosmic Rays and Interstellar Medium: Driving of Galactic Winds and Shaping the Local Proton Spectrum	Jiro Shimoda et.al.	2512.07239	null
2025-12-08	Unified Camera Positional Encoding for Controlled Video Generation	Cheng Zhang et.al.	2512.07237	null
2025-12-08	Unsupervised Single-Channel Audio Separation with Diffusion Source Priors	Runwu Shi et.al.	2512.07226	null
2025-12-08	Simulation Study of Binary Mergers of Galaxy Clusters I: Properties of Merger Shocks and Radio Emission	Hyesung Kang et.al.	2512.07214	null
2025-12-08	Sample from What You See: Visuomotor Policy Learning via Diffusion Bridge with Observation-Embedded Stochastic Differential Equation	Zhaoyang Liu et.al.	2512.07212	null
2025-12-08	Coherent Audio-Visual Editing via Conditional Audio Generation Following Video Edits	Masato Ishii et.al.	2512.07209	null
2025-12-08	Understanding Diffusion Models via Code Execution	Cheng Yu et.al.	2512.07201	null
2025-12-08	Generating Storytelling Images with Rich Chains-of-Reasoning	Xiujie Song et.al.	2512.07198	null
2025-12-08	MASim: Multilingual Agent-Based Simulation for Social Science	Xuan Zhang et.al.	2512.07195	null
2025-12-08	HVQ-CGIC: Enabling Hyperprior Entropy Modeling for VQ-Based Controllable Generative Image Compression	Niu Yi et.al.	2512.07192	null
2025-12-08	RefLSM: Linearized Structural-Prior Reflectance Model for Medical Image Segmentation and Bias-Field Correction	Wenqi Zhao et.al.	2512.07191	null
2025-12-08	UniDiff: A Unified Diffusion Framework for Multimodal Time Series Forecasting	Da Zhang et.al.	2512.07184	null
2025-12-08	Improving the Throughput of Diffusion-based Large Language Models via a Training-Free Confidence-Aware Calibration	Jucheng Shen et.al.	2512.07173	null
2025-12-08	Towards Unified Semantic and Controllable Image Fusion: A Diffusion Transformer Approach	Jiayang Li et.al.	2512.07170	null
2025-12-08	Bayesian Inference of Heavy-Quark Dissipation and Jet Transport Parameters from D-Meson observables in heavy-ion collisions at the LHC energies	Xu-Fei Xue et.al.	2512.07169	null
2025-12-08	JEPA as a Neural Tokenizer: Learning Robust Speech Representations with Density Adaptive Attention	Georgios Ioannides et.al.	2512.07168	null
2025-12-08	CHIMERA: Adaptive Cache Injection and Semantic Anchor Prompting for Zero-shot Image Morphing with Morphing-oriented Metrics	Dahyeon Kye et.al.	2512.07155	null
2025-12-08	A Theoretical Framework of Student Agency in AI- Assisted Learning: A Grounded Theory Approach	Yun Dai et.al.	2512.07143	null
2025-12-08	Mimir: Hierarchical Goal-Driven Diffusion with Uncertainty Propagation for End-to-End Autonomous Driving	Zebin Xing et.al.	2512.07130	null
2025-12-08	$\mathrm{D}^{\mathrm{3}}$ -Predictor: Noise-Free Deterministic Diffusion for Dense Prediction	Changliang Xia et.al.	2512.07062	null
2025-12-07	Wage Dispersion, On-the-Job Search, and Stochastic Match Productivity: A Mean Field Game Approach	I. Sebastian Buhai et.al.	2512.07024	null
2025-12-07	Utilizing Multi-Agent Reinforcement Learning with Encoder-Decoder Architecture Agents to Identify Optimal Resection Location in Glioblastoma Multiforme Patients	Krishna Arun et.al.	2512.06990	null
2025-12-07	OXtal: An All-Atom Diffusion Model for Organic Crystal Structure Prediction	Emily Jin et.al.	2512.06987	null
2025-12-07	Physics-Guided Diffusion Priors for Multi-Slice Reconstruction in Scientific Imaging	Laurentius Valdy et.al.	2512.06977	null
2025-12-07	Anisotropic Diffusion Modeling of Cosmic-Ray Lepton Propagation	V. D. Borisov et.al.	2512.06972	null
2025-12-07	Neuro-Vesicles: Neuromodulation Should Be a Dynamical System, Not a Tensor Decoration	Zilin Li et.al.	2512.06966	null
2025-12-07	VideoVLA: Video Generators Can Be Generalizable Robot Manipulators	Yichao Shen et.al.	2512.06963	null
2025-12-07	Confinement-Driven Exciton Behavior in 2D Halide Perovskites from Dielectric-Dependent Hybrid Methods	Rafael B. Araujo et.al.	2512.06913	null
2025-12-07	Surface-directed spinodal decomposition in binary fluid mixtures on an amorphous wall: A molecular dynamics study	Syed Shuja Hasan Zaidi et.al.	2512.06911	null
2025-12-07	Scaling Zero-Shot Reference-to-Video Generation	Zijian Zhou et.al.	2512.06905	null
2025-12-07	Boosting Unsupervised Video Instance Segmentation with Automatic Quality-Guided Self-Training	Kaixuan Lu et.al.	2512.06864	null
2025-12-07	Hide-and-Seek Attribution: Weakly Supervised Segmentation of Vertebral Metastases in CT	Matan Atad et.al.	2512.06849	null
2025-12-07	FGE: A Fast Free-Boundary Grad-Shafranov Evolutive Solver	Cosmas Heiß et.al.	2512.06847	null
2025-12-07	Pseudo Anomalies Are All You Need: Diffusion-Based Generation for Weakly-Supervised Video Anomaly Detection	Satoshi Hashimoto et.al.	2512.06845	null
2025-12-07	Free energy dissipation and a decomposition of general jump diffusions on $\mathbb{R}^n$ without detailed balance	Shuyuan Fan et.al.	2512.06839	null
2025-12-07	MMDuet2: Enhancing Proactive Interaction of Video MLLMs with Multi-Turn Reinforcement Learning	Yueqian Wang et.al.	2512.06810	null
2025-12-07	Optimal and Diffusion Transports in Machine Learning	Gabriel Peyré et.al.	2512.06797	null
2025-12-07	Measuring Over-smoothing beyond Dirichlet energy	Weiqi Guan et.al.	2512.06782	null
2025-12-07	From Next-Token to Next-Block: A Principled Adaptation Path for Diffusion LLMs	Yuchuan Tian et.al.	2512.06776	null
2025-12-07	RDSplat: Robust Watermarking Against Diffusion Editing for 3D Gaussian Splatting	Longjie Zhao et.al.	2512.06774	null
2025-12-07	Revisiting the atmosphere of HAT-P-70b with CARMENES high-resolution transmission spectroscopy	Tianjun Gan et.al.	2512.06731	null
2025-12-07	Mitigating Barren plateaus in quantum denoising diffusion probabilistic models	Haipeng Cao et.al.	2512.06695	null
2025-12-07	Multi-Functional Programmable Metasurfaces for 6G and Beyond	Xu Gan et.al.	2512.06693	null
2025-12-07	EMGauss: Continuous Slice-to-3D Reconstruction via Dynamic Gaussian Modeling in Volume Electron Microscopy	Yumeng He et.al.	2512.06684	null
2025-12-07	RunawayEvil: Jailbreaking the Image-to-Video Generative Models	Songping Wang et.al.	2512.06674	null
2025-12-07	1 + 1 > 2: Detector-Empowered Video Large Language Model for Spatio-Temporal Grounding and Reasoning	Shida Gao et.al.	2512.06673	null
2025-12-07	MIND-V: Hierarchical Video Generation for Long-Horizon Robotic Manipulation with RL-based Physical Alignment	Ruicheng Zhang et.al.	2512.06628	null
2025-12-07	Compression-driven jamming in porous cohesive aggregates	Sota Arakawa et.al.	2512.06624	null
2025-12-07	Monotone data augmentation algorithm for longitudinal continuous, binary and ordinal outcomes: a unifying approach	Yongqiang Tang et.al.	2512.06621	null
2025-12-06	Generic visuality of war? How image-generative AI models (mis)represent Russia’s war against Ukraine	Mykola Makhortykh et.al.	2512.06570	null
2025-12-06	Localization, transport, flux induced extended modes and mobility edge in a self-similar corral substrate	Sayan Bhattacharya et.al.	2512.06569	null
2025-12-06	Chemical Vapor Deposition of Nitrides by Carbon-free Brominated Precursors	Stefano Leone et.al.	2512.06566	null
2025-12-06	Convective Viscous Cahn-Hilliard/Allen-Cahn Equation with memory effects	P. O. Mchedlov-Petrosyan et.al.	2512.06508	null
2025-12-06	Optical Study of TRAPUM Pulsars and Modelling of the Redbacks: PSR J1036 $-$4353 and PSR J1803$-$ 6707	A. Phosrisom et.al.	2512.06503	null
2025-12-06	Model of incompressible turbulent flows via a kinetic theory	Ziyang Xin et.al.	2512.06433	null
2025-12-06	Modelling dust coagulation, dynamical drag and turbulent mixing during star and disc formation	Matthew R. Bate et.al.	2512.06409	null
2025-12-06	Innovation, Spillovers and Economic Geography	José M. Gaspar et.al.	2512.06402	null
2025-12-05	EditThinker: Unlocking Iterative Reasoning for Any Image Editor	Hongyu Li et.al.	2512.05965	null
2025-12-05	Training-Time Action Conditioning for Efficient Real-Time Chunking	Kevin Black et.al.	2512.05964	null
2025-12-05	AQUA-Net: Adaptive Frequency Fusion and Illumination Aware Network for Underwater Image Enhancement	Munsif Ali et.al.	2512.05960	null
2025-12-05	M4-RAG: A Massive-Scale Multilingual Multi-Cultural Multimodal RAG	David Anugraha et.al.	2512.05959	null
2025-12-05	Correspondence-Oriented Imitation Learning: Flexible Visuomotor Control with 3D Conditioning	Yunhao Cao et.al.	2512.05953	null
2025-12-05	Measuring the Effect of Background on Classification and Feature Importance in Deep Learning for AV Perception	Anne Sielemann et.al.	2512.05937	null
2025-12-05	Synset Signset Germany: a Synthetic Dataset for German Traffic Sign Recognition	Anne Sielemann et.al.	2512.05936	null
2025-12-05	A Comparative Study on Synthetic Facial Data Generation Techniques for Face Recognition	Pedro Vidal et.al.	2512.05928	null
2025-12-05	World Models That Know When They Don’t Know: Controllable Video Generation with Calibrated Uncertainty	Zhiting Mei et.al.	2512.05927	null
2025-12-05	A Discontinuous Galerkin Consistent Splitting Method for the Incompressible Navier-Stokes Equations	Dominik Still et.al.	2512.05919	null
2025-12-05	LDLT $\mathcal{L}$ -Lipschitz Network: Generalized Deep End-To-End Lipschitz Network Construction	Marius F. R. Juston et.al.	2512.05915	null
2025-12-05	SCAIL: Towards Studio-Grade Character Animation via In-Context Learning of 3D-Consistent Pose Representations	Wenhao Yan et.al.	2512.05905	null
2025-12-05	Quantitatively mapping the Eady model onto a two-layer quasi-geostrophic model	Julie Meunier et.al.	2512.05902	null
2025-12-05	Bootstrapping Fuzzers for Compilers of Low-Resource Language Dialects Using Language Models	Sairam Vaidya et.al.	2512.05887	null
2025-12-05	Continuous operations on non-Markovian processes	Fabio Costa et.al.	2512.05884	null
2025-12-05	Functional dual-slope frequency-domain near-infrared spectroscopy data interpreted with two- and three-layer models	Jodee Frias et.al.	2512.05877	null
2025-12-05	InstructMPC: A Human-LLM-in-the-Loop Framework for Context-Aware Power Grid Control	Ruixiang Wu et.al.	2512.05876	null
2025-12-05	Underwater Image Reconstruction Using a Swin Transformer-Based Generator and PatchGAN Discriminator	Md. Mahbub Hasan Akash et.al.	2512.05866	null
2025-12-05	Edit-aware RAW Reconstruction	Abhijith Punnappurath et.al.	2512.05859	null
2025-12-05	Non-equilibrium formulation for inertial particles in turbulent swirling flows	Bernardo L. Español et.al.	2512.05855	null
2025-12-05	VRSA: Jailbreaking Multimodal Large Language Models through Visual Reasoning Sequential Attack	Shiji Zhao et.al.	2512.05853	null
2025-12-05	A decomposition theorem for topological branched coverings	Shahryar Ghaed Sharaf et.al.	2512.05848	null
2025-12-05	Design of a High-Power and High-Efficiency GaN-HEMT VCO Based on an Inverse Class-F Amplifier	Junlin Mi et.al.	2512.05846	null
2025-12-05	NEAT: Neighborhood-Guided, Efficient, Autoregressive Set Transformer for 3D Molecular Generation	Daniel Rose et.al.	2512.05844	null
2025-12-05	Hadronic Emissions from the Microquasar V4641 Sgr, SS433, and its implications in the Diffuse Galactic Emission	Basanti Paul et.al.	2512.05839	null
2025-12-05	Boltzmann transport theory of magnon-exciton drag	Zakhar A. Iakovlev et.al.	2512.05835	null
2025-12-05	Higher-order diffusion and Cahn-Hilliard-type models revisited on the half-line	A. Chatziafratis et.al.	2512.05829	null
2025-12-05	Machine-learning-enabled interpretation of tribological deformation patterns in large-scale MD data	Hendrik J. Ehrich et.al.	2512.05818	null
2025-12-05	Optimal Safety-Aware Scheduling for Multi-Agent Aerial 3D Printing with Utility Maximization under Dependency Constraints	Marios-Nektarios Stamatopoulos et.al.	2512.05815	null
2025-12-05	3D Path Planning for Robot-assisted Vertebroplasty from Arbitrary Bi-plane X-ray via Differentiable Rendering	Blanca Inigo et.al.	2512.05803	null
2025-12-05	Bring Your Dreams to Life: Continual Text-to-Video Customization	Jiahua Dong et.al.	2512.05802	null
2025-12-05	Mechanistic Interpretability of Antibody Language Models Using SAEs	Rebonto Haque et.al.	2512.05794	null
2025-12-05	Symmetry-driven phonon confinement in 2D halide perovskites	Mustafa Mahmoud Aboulsaad et.al.	2512.05792	null
2025-12-05	Fast and Robust Diffusion Posterior Sampling for MR Image Reconstruction Using the Preconditioned Unadjusted Langevin Algorithm	Moritz Blumenthal et.al.	2512.05791	null
2025-12-05	USV: Unified Sparsification for Accelerating Video Diffusion Models	Xinjian Wu et.al.	2512.05754	null
2025-12-05	Capturing Classic Authorial Style in Long-Form Story Generation with GRPO Fine-Tuning	Jinlong Liu et.al.	2512.05747	null
2025-12-05	HQ-DM: Single Hadamard Transformation-Based Quantization-Aware Training for Low-Bit Diffusion Models	Shizhuo Mao et.al.	2512.05746	null
2025-12-05	ARGUS: Defending Against Multimodal Indirect Prompt Injection via Steering Instruction-Following Behavior	Weikai Lu et.al.	2512.05745	null
2025-12-05	Distilling Expert Surgical Knowledge: How to train local surgical VLMs for anatomy explanation in Complete Mesocolic Excision	Lennart Maack et.al.	2512.05740	null
2025-12-05	The tube transducer as a novel source for power ultrasound: A case study in delamination of graphite coating from lithium-ion battery anode	Shida Li et.al.	2512.05726	null
2025-12-05	Taylor Approximation Variance Reduction for Approximation Errors in PDE-constrained Bayesian Inverse Problems	Ruanui Nicholson et.al.	2512.05723	null
2025-12-05	Evaluating Concept Filtering Defenses against Child Sexual Abuse Material Generation by Text-to-Image Models	Ana-Maria Cretu et.al.	2512.05707	null
2025-12-05	Exploiting Spatial Multiplexing Based on Pixel Antennas: An Antenna Coding Approach	Zixiang Han et.al.	2512.05706	null
2025-12-05	HiMoE-VLA: Hierarchical Mixture-of-Experts for Generalist Vision-Language-Action Policies	Zhiying Du et.al.	2512.05693	null
2025-12-05	IMMPC: An Internal Model Based MPC for Rejecting Unknown Disturbances	Felix Brändle et.al.	2512.05692	null
2025-12-05	LA-RL: Language Action-guided Reinforcement Learning with Safety Guarantees for Autonomous Highway Driving	Yiming Shu et.al.	2512.05686	null
2025-12-05	An output scaling layer boosts deep neural networks for multiscale ODE systems	Yuxiao Yi et.al.	2512.05685	null
2025-12-05	InverseCrafter: Efficient Video ReCapture as a Latent Domain Inverse Problem	Yeobin Hong et.al.	2512.05672	null
2025-12-05	MedTutor-R1: Socratic Personalized Medical Teaching with Multi-Agent Simulation	Zhitao He et.al.	2512.05671	null
2025-12-05	Interleaved Latent Visual Reasoning with Selective Perceptual Modeling	Shuai Dong et.al.	2512.05665	null
2025-12-05	Self-Supervised AI-Generated Image Detection: A Camera Metadata Perspective	Nan Zhong et.al.	2512.05651	null
2025-12-05	Geometric control of boundary-catalytic branching processes	Denis S. Grebenkov et.al.	2512.05637	null
2025-12-05	Design-marginal calibration of Gaussian process predictive distributions: Bayesian and conformal approaches	Aurélien Pion et.al.	2512.05611	null
2025-12-05	Learning High-Fidelity Cloth Animation via Skinning-Free Image Transfer	Rong Wang et.al.	2512.05593	null
2025-12-05	General and Domain-Specific Zero-shot Detection of Generated Images via Conditional Likelihood	Roy Betser et.al.	2512.05590	null
2025-12-05	Stellar feedback drives the baryon deficiency in low-mass galaxies	Haoran Yu et.al.	2512.05584	null
2025-12-05	A Hyperspectral Imaging Guided Robotic Grasping System	Zheng Sun et.al.	2512.05578	null
2025-12-05	MedDIFT: Multi-Scale Diffusion-Based Correspondence in 3D Medical Imaging	Xingyu Zhang et.al.	2512.05571	null
2025-12-05	ProPhy: Progressive Physical Alignment for Dynamic World Simulation	Zijun Wang et.al.	2512.05564	null
2025-12-05	*A Hierarchy of Entanglement Cones via Rank-Constrained $C^$ -Convex Hulls**	Mohsen Kian et.al.	2512.05560	null
2025-12-05	2K-Characters-10K-Stories: A Quality-Gated Stylized Narrative Dataset with Disentangled Control and Sequence Consistency	Xingxi Yin et.al.	2512.05557	null
2025-12-05	Conscious Gaze: Adaptive Attention Mechanisms for Hallucination Mitigation in Vision-Language Models	Weijue Bu et.al.	2512.05546	null
2025-12-05	Ideal Observer for Segmentation of Dead Leaves Images	Swantje Mahncke et.al.	2512.05539	null
2025-12-05	VOST-SGG: VLM-Aided One-Stage Spatio-Temporal Scene Graph Generation	Chinthani Sugandhika et.al.	2512.05524	null
2025-12-05	Mode-resolved logarithmic quasiballistic heat transport in thin silicon layers: Semianalytic and Boltzmann transport analysis	Jae Sik Jin et.al.	2512.05522	null
2025-12-05	User Negotiations of Authenticity, Ownership, and Governance on AI-Generated Video Platforms: Evidence from Sora	Bohui Shen et.al.	2512.05519	null
2025-12-05	DashFusion: Dual-stream Alignment with Hierarchical Bottleneck Fusion for Multimodal Sentiment Analysis	Yuhua Wen et.al.	2512.05515	null
2025-12-05	Solving Multiparametric Generalized Nash Equilibrium Problems and Explicit Game-Theoretic Model Predictive Control	Sophie Hall et.al.	2512.05505	null
2025-12-05	Decoding with Structured Awareness: Integrating Directional, Frequency-Spatial, and Structural Attention for Medical Image Segmentation	Fan Zhang et.al.	2512.05494	null
2025-12-05	WaterWave: Bridging Underwater Image Enhancement into Video Streams via Wavelet-based Temporal Consistency Field	Qi Zhu et.al.	2512.05492	null
2025-12-05	A new class of general linear method with inherent quadratic stability for solving stiff differential systems	Sakshi Gautam et.al.	2512.05486	null
2025-12-05	EmoStyle: Emotion-Driven Image Stylization	Jingyuan Yang et.al.	2512.05478	null
2025-12-05	Observation-Time-Induced Crossover from Fluctuating Diffusivity	Masahiro Shirataki et.al.	2512.05471	null
2025-12-05	Everything is Context: Agentic File System Abstraction for Context Engineering	Xiwei Xu et.al.	2512.05470	null
2025-12-05	How Ensemble Learning Balances Accuracy and Overfitting: A Bias-Variance Perspective on Tabular Data	Zubair Ahmed Mohammad et.al.	2512.05469	null
2025-12-05	Dynamic hysteresis and transitions induced by potential asymmetry	Samudro Ghosh et.al.	2512.05465	null
2025-12-05	Model Gateway: Model Management Platform for Model-Driven Drug Discovery	Yan-Shiun Wu et.al.	2512.05462	null
2025-12-05	EXR: An Interactive Immersive EHR Visualization in Extended Reality	Benoit Marteau et.al.	2512.05438	null
2025-12-05	Zero-field superconducting diode effect induced by magnetic flux in a van der Waals superconductor trigonal PtBi $_2$	Nan Jiang et.al.	2512.05427	null
2025-12-05	Convergence rate of $\ell^p$-relaxation on a graph to a $p$ -harmonic function with given boundary values	Chenyu Gan et.al.	2512.05424	null
2025-12-05	ParaUni: Enhance Generation in Unified Multimodal Model with Reinforcement-driven Hierarchical Parallel Information Interaction	Jiangtong Tan et.al.	2512.05422	null
2025-12-05	Genetic Algorithms For Parameter Optimization for Disparity Map Generation of Radiata Pine Branch Images	Yida Lin et.al.	2512.05410	null
2025-12-05	Ruminations Upon the Modeling of X-ray Foregrounds, Backgrounds and Faint Sources	Adam B. Mantz et.al.	2512.05405	null
2025-12-05	Hybrid modeling approach for better identification of building thermal network model and improved prediction	Sang woo Ham et.al.	2512.05400	null
2025-12-05	Simulating Life Paths with Digital Twins: AI-Generated Future Selves Influence Decision-Making and Expand Human Choice	Rachel Poonsiriwong et.al.	2512.05397	null
2025-12-05	Delving into Latent Spectral Biasing of Video VAEs for Superior Diffusability	Shizhan Liu et.al.	2512.05394	null
2025-12-05	Fuzzing the brain: Automated stress testing for the safety of ML-driven neurostimulation	Mara Downing et.al.	2512.05383	null
2025-12-05	China Regional 3km Downscaling Based on Residual Corrective Diffusion Model	Honglu Sun et.al.	2512.05377	null
2025-12-05	SplatPainter: Interactive Authoring of 3D Gaussians from 2D Edits via Test-Time Training	Yang Zheng et.al.	2512.05354	null
2025-12-05	SpaceControl: Introducing Test-Time Spatial Control to 3D Generative Modeling	Elisabetta Fedele et.al.	2512.05343	null
2025-12-05	On-Orbit Calibration of Danuri/PolCam. I. Geometric Calibration	Kilho Baek et.al.	2512.05330	null
2025-12-05	CATNUS: Coordinate-Aware Thalamic Nuclei Segmentation Using T1-Weighted MRI	Anqi Feng et.al.	2512.05329	null
2025-12-05	LYNX: Learning Dynamic Exits for Confidence-Controlled Reasoning	Ömer Faruk Akgül et.al.	2512.05325	null
2025-12-04	Uncertainty Quantification for Scientific Machine Learning using Sparse Variational Gaussian Process Kolmogorov-Arnold Networks (SVGP KAN)	Y. Sungtaek Ju et.al.	2512.05306	null
2025-12-04	CFO: Learning Continuous-Time PDE Dynamics via Flow-Matched Neural Operators	Xianglong Hou et.al.	2512.05297	null
2025-12-04	Mapping vacancy and bonding electron distributions around aluminium nanovoids	Philip N. H. Nakashima et.al.	2512.05296	null
2025-12-04	Free quasi-Banach lattices	Alberto Salguero-Alarcón et.al.	2512.05273	null
2025-12-04	XR-DT: Extended Reality-Enhanced Digital Twin for Agentic Mobile Robots	Tianyi Wang et.al.	2512.05270	null
2025-12-04	CARD: Correlation Aware Restoration with Diffusion	Niki Nezakati et.al.	2512.05268	null
2025-12-04	Uncertainty-Aware Data-Efficient AI: An Information-Theoretic Perspective	Osvaldo Simeone et.al.	2512.05267	null
2025-12-04	One-Step Diffusion Samplers via Self-Distillation and Deterministic Flow	Pascal Jutras-Dube et.al.	2512.05251	null
2025-12-04	IE2Video: Adapting Pretrained Diffusion Models for Event-Based Video Reconstruction	Dmitrii Torbunov et.al.	2512.05240	null
2025-12-04	Invariance Co-training for Robot Visual Generalization	Jonathan Yang et.al.	2512.05230	null
2025-12-04	Your Latent Mask is Wrong: Pixel-Equivalent Latent Compositing for Diffusion Models	Rowan Bradbury et.al.	2512.05198	null
2025-12-04	Value Gradient Guidance for Flow Matching Alignment	Zhen Liu et.al.	2512.05116	null
2025-12-04	Light-X: Generative 4D Video Rendering with Camera and Illumination Control	Tianqi Liu et.al.	2512.05115	null
2025-12-04	DraCo: Draft as CoT for Text-to-Image Preview and Rare Concept Generation	Dongzhi Jiang et.al.	2512.05112	null
2025-12-04	ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning	Shengyuan Ding et.al.	2512.05111	null
2025-12-04	ShadowDraw: From Any Object to Shadow-Drawing Compositional Art	Rundong Luo et.al.	2512.05110	null
2025-12-04	NeuralRemaster: Phase-Preserving Diffusion for Structure-Aligned Generation	Yu Zeng et.al.	2512.05106	null
2025-12-04	EvoIR: Towards All-in-One Image Restoration via Evolutionary Frequency Modulation	Jiaqi Ma et.al.	2512.05104	null
2025-12-04	TV2TV: A Unified Framework for Interleaved Language and Video Generation	Xiaochuang Han et.al.	2512.05103	null
2025-12-04	SA-IQA: Redefining Image Quality Assessment for Spatial Aesthetics with Multi-Dimensional Rewards	Yuan Gao et.al.	2512.05098	null
2025-12-04	From Generated Human Videos to Physically Plausible Robot Trajectories	James Ni et.al.	2512.05094	null
2025-12-04	Foundations of Diffusion Models in General State Spaces: A Self-Contained Introduction	Vincent Pauline et.al.	2512.05092	null
2025-12-04	Deep Forcing: Training-Free Long Video Generation with Deep Sink and Participative Compression	Jung Yi et.al.	2512.05081	null
2025-12-04	Object Reconstruction under Occlusion with Generative Priors and Contact-induced Constraints	Minghan Zhu et.al.	2512.05079	null
2025-12-04	BulletTime: Decoupled Control of Time and Camera Pose for Video Generation	Yiming Wang et.al.	2512.05076	null
2025-12-04	The Evolving Landscape of Interactive Surface Sensing Technologies	David Wang et.al.	2512.05071	null
2025-12-04	Control Consistency Losses for Diffusion Bridges	Samuel Howard et.al.	2512.05070	null
2025-12-04	Axionic tunneling from a topological Kondo insulator	Saikat Banerjee et.al.	2512.05057	null
2025-12-04	Joint 3D Geometry Reconstruction and Motion Generation for 4D Synthesis from a Single Image	Yanran Zhang et.al.	2512.05044	null
2025-12-04	Semantic-Guided Two-Stage GAN for Face Inpainting with Hybrid Perceptual Encoding	Abhigyan Bhattacharya et.al.	2512.05039	null
2025-12-04	SuperActivators: Only the Tail of the Distribution Contains Reliable Concept Signals	Cassandra Goldberg et.al.	2512.05038	null
2025-12-04	RAMEN: Resolution-Adjustable Multimodal Encoder for Earth Observation	Nicolas Houdré et.al.	2512.05025	null
2025-12-04	Generative Neural Video Compression via Video Diffusion Prior	Qi Mao et.al.	2512.05016	null
2025-12-04	Plug-and-Play Homeostatic Spark: Zero-Cost Acceleration for SNN Training Across Paradigms	Rui Chen et.al.	2512.05015	null
2025-12-04	Hall-like response from anisotropic Fermi surfaces	Abhiram Soori et.al.	2512.05014	null
2025-12-04	Reflection Removal through Efficient Adaptation of Diffusion Transformers	Daniyar Zakarin et.al.	2512.05000	null
2025-12-04	Influence of Object Affordance on Action Language Understanding: Evidence from Dynamic Causal Modeling Analysis	Supriya Bordoloi et.al.	2512.04989	null
2025-12-04	Towards a unified framework for guided diffusion models	Yuchen Jiao et.al.	2512.04985	null
2025-12-04	Operator Formalism for Laser-Plasma Wakefield Acceleration	Mostafa Behtouei et.al.	2512.04982	null
2025-12-04	Aligned but Stereotypical? The Hidden Influence of System Prompts on Social Bias in LVLM-Based Text-to-Image Models	NaHyeon Park et.al.	2512.04981	null
2025-12-04	Generalized Pinching-Antenna Systems: A Leaky-Coaxial-Cable Perspective	Kaidi Wang et.al.	2512.04979	null
2025-12-04	Exploring YouTube’s Political Communication Networks during the 2024 French Elections	Caroline Violot et.al.	2512.04971	null
2025-12-04	Rethinking the Use of Vision Transformers for AI-Generated Image Detection	NaHyeon Park et.al.	2512.04969	null
2025-12-04	Balanced Few-Shot Episodic Learning for Accurate Retinal Disease Diagnosis	Jasmaine Khale et.al.	2512.04967	null
2025-12-04	Environment-Aware Channel Inference via Cross-Modal Flow: From Multimodal Sensing to Wireless Channels	Guangming Liang et.al.	2512.04966	null
2025-12-04	Hybrid-Diffusion Models: Combining Open-loop Routines with Visuomotor Diffusion Policies	Jonne Van Haastregt et.al.	2512.04960	null
2025-12-04	FASTer: Toward Efficient Autoregressive Vision Language Action Modeling via neural Action Tokenization	Yicheng Liu et.al.	2512.04952	null
2025-12-05	Semantics Lead the Way: Harmonizing Semantic and Texture Modeling with Asynchronous Latent Diffusion	Yueming Pan et.al.	2512.04926	null
2025-12-04	Distributed Riemannian Optimization in Geodesically Non-convex Environments	Xiuheng Wang et.al.	2512.04915	null
2025-12-04	Local mixing length theory with compositional effects:\ First application to asymptotic giant branch evolution	M. M. Ocampo et.al.	2512.04900	null
2025-12-04	Small-Signal Stability Oriented Real-Time Operation of Power Systems with a High Penetration of Inverter-Based Resources	Francesca Rossi et.al.	2512.04892	null
2025-12-04	On hyperbolic approximations for a class of dispersive and diffusive-dispersive equations	Rahul Barthwal et.al.	2512.04882	null
2025-12-04	Isostructural phase transition and equation of state of type-I and type-VIII metallic sodium borosilicide clathrates	M. Demoucron et.al.	2512.04878	null
2025-12-04	The Single Differential Cross Sections (SDCS) for H(3s) Ionization in the First-Born Approximation by Electron and Positron Impact	Fahadul Islam et.al.	2512.04870	null
2025-12-04	Contact-Aware Refinement of Human Pose Pseudo-Ground Truth via Bioimpedance Sensing	Maria-Paola Forte et.al.	2512.04862	null
2025-12-04	Autoregressive Image Generation Needs Only a Few Lines of Cached Tokens	Ziran Qin et.al.	2512.04857	null
2025-12-04	aim-resolve: Automatic Identification and Modeling for Bayesian Radio Interferometric Imaging	Richard Fuchs et.al.	2512.04840	null
2025-12-04	A Sanity Check for Multi-In-Domain Face Forgery Detection in the Real World	Jikang Cheng et.al.	2512.04837	null
2025-12-04	FreeGen: Feed-Forward Reconstruction-Generation Co-Training for Free-Viewpoint Driving Scene Synthesis	Shijie Chen et.al.	2512.04830	null
2025-12-04	Interfacial Synergy in Ag-Doped CuO-AgCl-g-C3N4 Composites for Efficient Charge Separation and Low-power Methylene Blue Degradation	Suresh Chandra Baral et.al.	2512.04825	null
2025-12-04	LatentFM: A Latent Flow Matching Approach for Generative Medical Image Segmentation	Huynh Trinh Ngoc et.al.	2512.04821	null
2025-12-04	Interplay between Superconductivity and Altermagnetism in Disordered Materials and Heterostructures	Rodrigo de las Heras et.al.	2512.04819	null
2025-12-04	EMMA: Efficient Multimodal Understanding, Generation, and Editing with a Unified Architecture	Xin He et.al.	2512.04810	null
2025-12-04	Setting up for failure: automatic discovery of the neural mechanisms of cognitive errors	Puria Radmard et.al.	2512.04808	null
2025-12-04	Existence and uniqueness of the canonical Brownian motion in non-simple conformal loop ensemble gaskets	Jason Miller et.al.	2512.04807	null
2025-12-04	Unveiling gravitational waves from core-collapse supernovae with MUSE	Alessandro Veutro et.al.	2512.04804	null
2025-12-04	SIMA 2: A Generalist Embodied Agent for Virtual Worlds	SIMA team et.al.	2512.04797	null
2025-12-04	Terahertz Fourier Ptychographic Imaging	Pitambar Mukherjee et.al.	2512.04783	null
2025-12-04	Homogenized limits of Stokes flow and advective transport in thin perforated domains	Markus Gahn et.al.	2512.04782	null
2025-12-04	YingMusic-Singer: Zero-shot Singing Voice Synthesis and Editing with Annotation-free Melody Guidance	Junjie Zheng et.al.	2512.04779	null
2025-12-04	TEMPO-VINE: A Multi-Temporal Sensor Fusion Dataset for Localization and Mapping in Vineyards	Mauro Martini et.al.	2512.04772	null
2025-12-04	Complementary Characterization of Agent-Based Models via Computational Mechanics and Diffusion Models	Roberto Garrone et.al.	2512.04771	null
2025-12-04	Embodied Co-Design for Rapidly Evolving Agents: Taxonomy, Frontiers, and Challenges	Yuxing Wang et.al.	2512.04770	null
2025-12-04	Human Cognitive Biases in Explanation-Based Interaction: The Case of Within and Between Session Order Effect	Dario Pesenti et.al.	2512.04764	null
2025-12-04	Order Matters: 3D Shape Generation from Sequential VR Sketches	Yizi Chen et.al.	2512.04761	null
2025-12-04	EtCon: Edit-then-Consolidate for Reliable Knowledge Editing	Ruilin Li et.al.	2512.04753	null
2025-12-04	UnwrapDiff: Conditional Diffusion for Robust InSAR Phase Unwrapping	Yijia Song et.al.	2512.04749	null
2025-12-04	MT-Depth: Multi-task Instance feature analysis for the Depth Completion	Abdul Haseeb Nizamani et.al.	2512.04734	null
2025-12-04	Bridging Simulation and Reality: Cross-Domain Transfer with Semantic 2D Gaussian Splatting	Jian Tang et.al.	2512.04731	null
2025-12-04	M3-TTS: Multi-modal DiT Alignment & Mel-latent for Zero-shot High-fidelity Speech Synthesis	Xiaopeng Wang et.al.	2512.04720	null
2025-12-04	Heat transport in superionic materials via machine-learned molecular dynamics	Wenjiang Zhou et.al.	2512.04718	null
2025-12-04	Towards an AI Fluid Scientist: LLM-Powered Scientific Discovery in Experimental Fluid Mechanics	Haodong Feng et.al.	2512.04716	null
2025-12-04	Large Speech Model Enabled Semantic Communication	Yun Tian et.al.	2512.04711	null
2025-12-04	Multi Task Denoiser Training for Solving Linear Inverse Problems	Clément Bled et.al.	2512.04709	null
2025-12-04	Coordinated Mean-Field Control for Systemic Risk	Toshiaki Yamanaka et.al.	2512.04704	null
2025-12-04	POLARIS: Is Multi-Agentic Reasoning the Next Wave in Engineering Self-Adaptive Systems?	Divyansh Pandey et.al.	2512.04702	null
2025-12-04	The ejection velocities of interstellar objects signpost their progenitor system architectures	Leah Albrow et.al.	2512.04700	null
2025-12-04	OmniScaleSR: Unleashing Scale-Controlled Diffusion Prior for Faithful and Realistic Arbitrary-Scale Image Super-Resolution	Xinning Chai et.al.	2512.04699	null
2025-12-04	Provable FDR Control for Deep Feature Selection: Deep MLPs and Beyond	Kazuma Sawaya et.al.	2512.04696	null
2025-12-04	TimesNet-Gen: Deep Learning-based Site Specific Strong Motion Generation	Baris Yilmaz et.al.	2512.04694	null
2025-12-04	Diffusive geodesics wandering in networks of rigid chains	Ulysse Marquis et.al.	2512.04682	null
2025-12-04	Reward Forcing: Efficient Streaming Video Generation with Rewarded Distribution Matching Distillation	Yunhong Lu et.al.	2512.04678	null
2025-12-05	Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length	Yubo Huang et.al.	2512.04677	null
2025-12-04	Cryptanalysis of Gleeok-128	Siwei Chen et.al.	2512.04675	null
2025-12-04	Topology Matters: Measuring Memory Leakage in Multi-Agent LLMs	Jinbo Liu et.al.	2512.04668	null
2025-12-04	Watt-level coherent microwave emission from dissipation engineered solid-state quantum batteries	Yuanjin Wang et.al.	2512.04666	null
2025-12-04	I2I-Bench: A Comprehensive Benchmark Suite for Image-to-Image Editing Models	Juntong Wang et.al.	2512.04660	null
2025-12-04	The $Λ$ -Set and Its Role in Local Controllability and Necessary Conditions for Free-Time Optimal Control	Mohammad H. M. Rashid et.al.	2512.04651	null
2025-12-04	Persson’s Theory of Purely Normal Elastic Rough Surface Contact: A Tutorial Based on Stochastic Process Theory	Yang Xu et.al.	2512.04648	null
2025-12-04	SEASON: Mitigating Temporal Hallucination in Video Large Language Models via Self-Diagnostic Contrastive Decoding	Chang-Hsun Wu et.al.	2512.04643	null
2025-12-04	When GenAI Meets Fake News: Understanding Image Cascade Dynamics on Reddit	Saumya Chauhan et.al.	2512.04639	null
2025-12-04	Hole to Electron Crossover in a (Cd,Mn)Te Quantum Well through Surface Metallization	Amadeusz Dydniański et.al.	2512.04631	null
2025-12-04	Denoise to Track: Harnessing Video Diffusion Priors for Robust Correspondence	Tianyu Yuan et.al.	2512.04619	null
2025-12-04	Mode interactions in scalar field cosmology	Spiros Cotsakis et.al.	2512.04607	null
2025-12-04	Malicious Image Analysis via Vision-Language Segmentation Fusion: Detection, Element, and Location in One-shot	Sheng Hang et.al.	2512.04599	null
2025-12-04	QoSDiff: An Implicit Topological Embedding Learning Framework Leveraging Denoising Diffusion and Adversarial Attention for Robust QoS Prediction	Guanchen Du et.al.	2512.04596	null
2025-12-04	Stable self-adaptive timestepping for Reduced Order Models for incompressible flows	Josep Plana-Riu et.al.	2512.04592	null
2025-12-04	Structure-Aware Adaptive Kernel MPPCA Denoising for Diffusion MRI	Ananya Singhal et.al.	2512.04586	null
2025-12-04	Infrared UAV Target Tracking with Dynamic Feature Refinement and Global Contextual Attention Knowledge Distillation	Houzhang Fang et.al.	2512.04581	null
2025-12-04	Gauss-Newton accelerated MPPI Control	Hannes Homburger et.al.	2512.04579	null
2025-12-04	Gaussian Fluctuations for the Stochastic Landau-Lifshitz Navier-Stokes Equation in Dimension $D\geq2$	Sotiris Kotitsas et.al.	2512.04567	null
2025-12-03	SimFlow: Simplified and End-to-End Training of Latent Normalizing Flows	Qinyu Zhao et.al.	2512.04084	null
2025-12-03	PosterCopilot: Toward Layout Reasoning and Controllable Editing for Professional Graphic Design	Jiazhe Wei et.al.	2512.04082	null
2025-12-03	Multimessenger Constraints on Supermassive Dark Stars and Their Black Hole Remnants	Marco Manno et.al.	2512.04061	null
2025-12-03	Stable Signer: Hierarchical Sign Language Generative Model	Sen Fang et.al.	2512.04048	null
2025-12-03	Machine Learning Pipeline for Denoising Low Signal-To-Noise Ratio and Out-of-Distribution Transmission Electron Microscopy Datasets	Brian Lee et.al.	2512.04045	null
2025-12-03	Quantum theory of nonlinear phononics	Francesco Libbi et.al.	2512.04041	null
2025-12-03	RELIC: Interactive Video World Model with Long-Horizon Memory	Yicong Hong et.al.	2512.04040	null
2025-12-03	Fast & Efficient Normalizing Flows and Applications of Image Generative Models	Sandeep Nagar et.al.	2512.04039	null
2025-12-03	Jina-VLM: Small Multilingual Vision Language Model	Andreas Koukounas et.al.	2512.04032	null
2025-12-03	PSA: Pyramid Sparse Attention for Efficient Video Understanding and Generation	Xiaolong Li et.al.	2512.04025	null
2025-12-03	Emergent Outlier View Rejection in Visual Geometry Grounded Transformers	Jisang Han et.al.	2512.04012	null
2025-12-03	Freeze-out and spectral running of primordial gravitational waves in viscous cosmology	Giuseppe Fanizza et.al.	2512.04011	null
2025-12-03	A Strict Comparison Principle for Integro-Differential Hamilton-Jacobi-Bellman Equations on Domains with Boundary	Serena Della Corte et.al.	2512.04005	null
2025-12-03	Mixed finite element approximation for non-divergence form elliptic equations with random input data	Amireh Mousavi et.al.	2512.04003	null
2025-12-03	Highly Efficient Test-Time Scaling for T2I Diffusion Models with Text Embedding Perturbation	Hang Xu et.al.	2512.03996	null
2025-12-03	Needle beams and structured space-time wavepackets	Ruediger Grunwald et.al.	2512.03993	null
2025-12-03	DIQ-H: Evaluating Hallucination Persistence in VLMs Under Temporal Visual Degradation	Zexin Lin et.al.	2512.03992	null
2025-12-03	Applied Neural Network-Based Active Control for Vortex-Induced Vibrations Suppression in a Two-Degree-of-Freedom Cylinder	Soha Ilbeigi et.al.	2512.03990	null
2025-12-03	DirectDrag: High-Fidelity, Mask-Free, Prompt-Free Drag-based Image Editing via Readout-Guided Feature Alignment	Sheng-Hao Liao et.al.	2512.03981	null
2025-12-03	BlurDM: A Blur Diffusion Model for Image Deblurring	Jin-Ting He et.al.	2512.03979	null
2025-12-03	Refining Machine Learning Potentials through Thermodynamic Theory of Phase Transitions	Paul Fuchs et.al.	2512.03974	null
2025-12-03	Bounded-degree graphs of non-negative Ollivier-Ricci curvature have subexponential growth and diffusive random walk	Tom Hutchcroft et.al.	2512.03968	null
2025-12-03	Technical Report on Text Dataset Distillation	Keith Ando Ogawa et.al.	2512.03967	null
2025-12-03	Training for Identity, Inference for Controllability: A Unified Approach to Tuning-Free Face Personalization	Lianyu Pang et.al.	2512.03964	null
2025-12-03	TempR1: Improving Temporal Understanding of MLLMs via Temporal-Aware Multi-Task Reinforcement Learning	Tao Wu et.al.	2512.03963	null
2025-12-03	MDE-AgriVLN: Agricultural Vision-and-Language Navigation with Monocular Depth Estimation	Xiaobei Zhao et.al.	2512.03958	null
2025-12-03	Collective dynamics of trail-interacting particles	Paul Pineau et.al.	2512.03950	null
2025-12-03	When are novel methods for analyzing complex chemical mixtures in epidemiology beneficial?	Nate Wiecha et.al.	2512.03946	null
2025-12-03	DSP: A Statistically-Principled Structural Polarization Measure	Giulia Preti et.al.	2512.03937	null
2025-12-03	Beyond the Ground Truth: Enhanced Supervision for Image Restoration	Donghun Ryou et.al.	2512.03932	null
2025-12-03	Quantum-Classical Physics-Informed Neural Networks for Solving Reservoir Seepage Equations	Xiang Rao et.al.	2512.03923	null
2025-12-03	UniMo: Unifying 2D Video and 3D Human Motion with an Autoregressive Framework	Youxin Pang et.al.	2512.03918	null
2025-12-03	A microscopic theory of Anderson localization of electrons in random lattices	Václav Janiš et.al.	2512.03917	null
2025-12-03	Parsimonious Clustering of Covariance Matrices	Yixi Xu et.al.	2512.03912	null
2025-12-03	Zero-Shot Video Translation and Editing with Frame Spatial-Temporal Correspondence	Shuai Yang et.al.	2512.03905	null
2025-12-03	Digital Twin-based Control Co-Design of Full Vehicle Active Suspensions via Deep Reinforcement Learning	Ying-Kuan Tsai et.al.	2512.03891	null
2025-12-03	A Hierarchical Tree-based approach for creating Configurable and Static Deep Research Agent (Static-DRA)	Saurav Prateek et.al.	2512.03887	null
2025-12-03	A Modular Architecture Design for Autonomous Driving Racing in Controlled Environments	Brais Fontan-Costas et.al.	2512.03886	null
2025-12-03	OmniDexVLG: Learning Dexterous Grasp Generation from Vision Language Model-Guided Grasp Semantics, Taxonomy and Functional Affordance	Lei Zhang et.al.	2512.03874	null
2025-12-03	Adaptive Parameter Control Using AAN for Lower Limb Rehabilitation Exoskeletons	Zheng Sun et.al.	2512.03871	null
2025-12-03	Generating a Contact Matrix for Aged Care Settings in Australia: an agent-based model study	Haley Stone et.al.	2512.03866	null
2025-12-03	SUP: An Inferable Private Multiple Testing Framework with Super Uniformity	Kehan Wang et.al.	2512.03859	null
2025-12-03	Classification of diffusion processes in dimension $d$ via the Carleman approach with applications to models involving additive, multiplicative or square-root noises	Cecile Monthus et.al.	2512.03857	null
2025-12-03	PULSE: A Unified Multi-Task Architecture for Cardiac Segmentation, Diagnosis, and Few-Shot Cross-Modality Clinical Adaptation	Hania Ghouse et.al.	2512.03848	null
2025-12-03	Fault-Tolerant Control of Steam Temperature in HRSG Superheater under Actuator Fault Using a Sliding Mode Observer and PINN	Mojtaba Fanoodi et.al.	2512.03846	null
2025-12-03	CoDA: From Text-to-Image Diffusion Models to Training-Free Dataset Distillation	Letian Zhou et.al.	2512.03844	null
2025-12-03	Heatmap Pooling Network for Action Recognition from RGB Videos	Mengyuan Liu et.al.	2512.03837	null
2025-12-03	Multi-Agent Deep Reinforcement Learning for UAV-Assisted 5G Network Slicing: A Comparative Study of MAPPO, MADDPG, and MADQN	Ghoshana Bista et.al.	2512.03835	null
2025-12-03	Lean Unet: A Compact Model for Image Segmentation	Ture Hassler et.al.	2512.03834	null
2025-12-03	First Experimental Demonstration of Machine Learning-Based Tuning on the PSI Injector 2 Cyclotron	M. Haj Tahar et.al.	2512.03829	null
2025-12-03	Hopf bifurcations in a reaction-diffusion model with a general advection term and delay effect	Jingxiao Song et.al.	2512.03813	null
2025-12-03	LSRS: Latent Scale Rejection Sampling for Visual Autoregressive Modeling	Hong-Kai Zheng et.al.	2512.03796	null
2025-12-03	AdaptVision: Efficient Vision-Language Models via Adaptive Visual Acquisition	Zichuan Lin et.al.	2512.03794	null
2025-12-03	Deep Unfolding: Recent Developments, Theory, and Design Guidelines	Nir Shlezinger et.al.	2512.03768	null
2025-12-03	Principled RL for Diffusion LLMs Emerges from a Sequence-Level Perspective	Jingyang Ou et.al.	2512.03759	null
2025-12-03	Fully Unsupervised Self-debiasing of Text-to-Image Diffusion Models	Korada Sri Vardhana et.al.	2512.03749	null
2025-12-03	Widefield Quantum Sensor for Vector Magnetic Field Imaging of Micromagnetic Structures	Orlando D. Cunha et.al.	2512.03748	null
2025-12-03	Thinking with Programming Vision: Towards a Unified View for Thinking with Images	Zirun Guo et.al.	2512.03746	null
2025-12-03	Cross-embodied Co-design for Dexterous Hands	Kehlani Fay et.al.	2512.03743	null
2025-12-03	An arbitrary Lagrangian-Eulerian semi-implicit hybrid method for continuum mechanics with GLM cleaning	Saray Busto et.al.	2512.03741	null
2025-12-03	Out-of-the-box: Black-box Causal Attacks on Object Detectors	Melane Navaratnarajah et.al.	2512.03730	null
2025-12-03	AI/ML in 3GPP 5G Advanced - Services and Architecture	Pradnya Taksande et.al.	2512.03728	null
2025-12-03	A theory-agnostic hierarchical Bayesian framework for black-hole spectroscopy: a case study on GW250114 in Einstein-dilaton-Gauss-Bonnet gravity	Shitong Guo et.al.	2512.03713	null
2025-12-03	Structured Uncertainty Similarity Score (SUSS): Learning a Probabilistic, Interpretable, Perceptual Metric Between Images	Paula Seidler et.al.	2512.03701	null
2025-12-03	Lightweight design and analysis of optical cover plate for exoplanet imaging coronagraph	Lingyi Kong et.al.	2512.03700	null
2025-12-03	Long-term calibration and validation of stability of the Auger Engineering Radio Array using the diffuse Galactic radio emission	The Pierre Auger Collaboration et.al.	2512.03692	null
2025-12-03	GaussianBlender: Instant Stylization of 3D Gaussians with Disentangled Latent Spaces	Melis Ocal et.al.	2512.03683	null
2025-12-03	ConvRot: Rotation-Based Plug-and-Play 4-bit Quantization for Diffusion Transformers	Feice Huang et.al.	2512.03673	null
2025-12-03	ToG-Bench: Task-Oriented Spatio-Temporal Grounding in Egocentric Videos	Qi’ao Xu et.al.	2512.03666	null
2025-12-03	Multi-Scale Visual Prompting for Lightweight Small-Image Classification	Salim Khazem et.al.	2512.03663	null
2025-12-03	Dynamically Scaled Activation Steering	Alex Ferrando et.al.	2512.03661	null
2025-12-03	V-Reactor Dynamics: Dual Chaotic System and Rewriting the Antiviral Response History	Yong-Shou Chen et.al.	2512.03655	null
2025-12-03	Evaluation of Foundational Machine Learned Interatomic Potentials for Migration Barrier Predictions	Achinthya Krishna Bheemaguli et.al.	2512.03642	null
2025-12-03	AlignCheck: a Semantic Open-Domain Metric for Factual Consistency Assessment	Ahmad Aghaebrahimian et.al.	2512.03634	null
2025-12-03	FeatureLens: A Highly Generalizable and Interpretable Framework for Detecting Adversarial Examples Based on Image Features	Zhigang Yang et.al.	2512.03625	null
2025-12-03	The promising potential of vision language models for the generation of textual weather forecasts	Edward C. C. Steele et.al.	2512.03623	null
2025-12-03	ReCamDriving: LiDAR-Free Camera-Controlled Novel Trajectory Video Generation	Yaokun Li et.al.	2512.03621	null
2025-12-03	LAMP: Language-Assisted Motion Planning for Controllable Video Generation	Muhammed Burak Kizil et.al.	2512.03619	null
2025-12-03	Drag reduction via separation control using plasma actuators on a truck cabin side	Lucas Schneeberger et.al.	2512.03613	null
2025-12-03	A Perception-feedback position-tracking control for quadrotors	Eduardo Espindola et.al.	2512.03605	null
2025-12-03	Motion4D: Learning 3D-Consistent Motion and Semantics for 4D Scene Understanding	Haoran Zhou et.al.	2512.03601	null
2025-12-03	CloseUpAvatar: High-Fidelity Animatable Full-Body Avatars with Mixture of Multi-Scale Textures	David Svitov et.al.	2512.03593	null
2025-12-03	Beyond Boundary Frames: Audio-Visual Semantic Guidance for Context-Aware Video Interpolation	Yuchen Deng et.al.	2512.03590	null
2025-12-03	Dynamic Optical Test for Bot Identification (DOT-BI): A simple check to identify bots in surveys and online processes	Malte Bleeker et.al.	2512.03580	null
2025-12-03	Global-Local Aware Scene Text Editing	Fuxiang Yang et.al.	2512.03574	null
2025-12-03	GAOT: Generating Articulated Objects Through Text-Guided Diffusion Models	Hao Sun et.al.	2512.03566	null
2025-12-03	Towards Irreversible Machine Unlearning for Diffusion Models	Xun Yuan et.al.	2512.03564	null
2025-12-03	Dynamic Content Moderation in Livestreams: Combining Supervised Classification with MLLM-Boosted Similarity Matching	Wei Chee Yew et.al.	2512.03553	null
2025-12-03	V-ITI: Mitigating Hallucinations in Multimodal Large Language Models via Visual Inference-Time Intervention	Nan Sun et.al.	2512.03542	null
2025-12-03	CookAnything: A Framework for Flexible and Consistent Multi-Step Recipe Image Generation	Ruoxuan Zhang et.al.	2512.03540	null
2025-12-03	AdaPower: Specializing World Foundation Models for Predictive Manipulation	Yuhang Huang et.al.	2512.03538	null
2025-12-03	Rethinking Prompt Design for Inference-time Scaling in Text-to-Visual Generation	Subin Kim et.al.	2512.03534	null
2025-12-03	Resolving the Multiple Component Outflows in PG 1211+143: I. The Fe-K Absorption Structure and UFO Forest	Misaki Mizumoto et.al.	2512.03533	null
2025-12-03	Multi-probe analysis of strong-field effects in $f(Q)$ gravity	Mohsen Khodadi et.al.	2512.03529	null
2025-12-03	Market share maximizing strategies of CAV fleet operators may cause chaos in our cities	Grzegorz Jamróz et.al.	2512.03524	null
2025-12-03	FloodDiffusion: Tailored Diffusion Forcing for Streaming Motion Generation	Yiyi Cai et.al.	2512.03520	null
2025-12-03	M3DR: Towards Universal Multilingual Multimodal Document Retrieval	Adithya S Kolavi et.al.	2512.03514	null
2025-12-03	CSMapping: Scalable Crowdsourced Semantic Mapping and Topology Inference for Autonomous Driving	Zhijian Qiao et.al.	2512.03510	null
2025-12-03	Exploiting Domain Properties in Language-Driven Domain Generalization for Semantic Segmentation	Seogkyu Jeon et.al.	2512.03508	null
2025-12-03	Complex Wigner entropy and Fisher control of negativity in an oval quantum billiard	Kyu-Won Park et.al.	2512.03505	null
2025-12-02	MagicQuillV2: Precise and Interactive Image Editing with Layered Visual Cues	Zichen Liu et.al.	2512.03046	null
2025-12-02	CAMEO: Correspondence-Attention Alignment for Multi-View Diffusion Models	Minkyung Kwon et.al.	2512.03045	null
2025-12-02	Video2Act: A Dual-System Video Diffusion Policy with Robotic Spatio-Motional Modeling	Yueru Jia et.al.	2512.03044	null
2025-12-02	OneThinker: All-in-one Reasoning Model for Image and Video	Kaituo Feng et.al.	2512.03043	null
2025-12-02	PPTArena: A Benchmark for Agentic PowerPoint Editing	Michael Ofengenden et.al.	2512.03042	null
2025-12-02	MultiShotMaster: A Controllable Multi-Shot Video Generation Framework	Qinghe Wang et.al.	2512.03041	null
2025-12-02	Video4Spatial: Towards Visuospatial Intelligence with Context-Guided Video Generation	Zeqi Xiao et.al.	2512.03040	null
2025-12-02	ViSAudio: End-to-End Video-Driven Binaural Spatial Audio Generation	Mengchen Zhang et.al.	2512.03036	null
2025-12-02	MAViD: A Multimodal Framework for Audio-Visual Dialogue Understanding and Generation	Youxin Pang et.al.	2512.03034	null
2025-12-02	SMP: Reusable Score-Matching Motion Priors for Physics-Based Character Control	Yuxuan Mu et.al.	2512.03028	null
2025-12-02	Unrolled Networks are Conditional Probability Flows in MRI Reconstruction	Kehan Qi et.al.	2512.03020	null
2025-12-02	AutoBrep: Autoregressive B-Rep Generation with Unified Topology and Geometry	Xiang Xu et.al.	2512.03018	null
2025-12-02	Instant Video Models: Universal Adapters for Stabilizing Image-Based Networks	Matthew Dutson et.al.	2512.03014	null
2025-12-02	In-Context Sync-LoRA for Portrait Video Editing	Sagi Polaczek et.al.	2512.03013	null
2025-12-02	DGGT: Feedforward 4D Reconstruction of Dynamic Driving Scenes using Unposed Images	Xiaoxue Chen et.al.	2512.03004	null
2025-12-02	Invasive Context Engineering to Control Large Language Models	Thomas Rivasseau et.al.	2512.03001	null
2025-12-02	TEXTRIX: Latent Attribute Grid for Native Texture Generation and Beyond	Yifei Zeng et.al.	2512.02993	null
2025-12-02	U4D: Uncertainty-Aware 4D World Modeling from LiDAR Sequences	Xiang Xu et.al.	2512.02982	null
2025-12-02	Tunable polarization-entangled near-infrared photons from orthogonal GaAs nanowires	Elise Bailly-Rioufreyt et.al.	2512.02980	null
2025-12-02	Stochastic parallel transport on the Wasserstein space and equivariant diffusions on the group of diffeomorphisms over a closed Riemannian manifold	Aymeric Martin et.al.	2512.02975	null
2025-12-02	Altermagnetoelectric Spin Field Effect Transistor	Ziye Zhu et.al.	2512.02974	null
2025-12-02	Contextual Image Attack: How Visual Context Exposes Multimodal Safety Vulnerabilities	Yuan Xiong et.al.	2512.02973	null
2025-12-02	BEVDilation: LiDAR-Centric Multi-Modal Fusion for 3D Object Detection	Guowen Zhang et.al.	2512.02972	null
2025-12-02	The Convex Matching Distance in Multiparameter Persistence	Patrizio Frosini et.al.	2512.02944	null
2025-12-02	Benchmarking Scientific Understanding and Reasoning for Video Generation using VideoScience-Bench	Lanxiang Hu et.al.	2512.02942	null
2025-12-02	LoVoRA: Text-guided and Mask-free Video Object Removal and Addition with Learnable Object-aware Localization	Zhihan Xiao et.al.	2512.02933	null
2025-12-02	DiverseAR: Boosting Diversity in Bitwise Autoregressive Image Generation	Ying Yang et.al.	2512.02931	null
2025-12-02	Asymptotics for additive functionals of particle systems via Stein’s method	Arturo Jaramillo et.al.	2512.02922	null
2025-12-02	Maintaining SUV Accuracy in Low-Count PET with PETfectior: A Deep Learning Denoising Solution	Yamila Rotstein Habarnau et.al.	2512.02917	null
2025-12-02	MRD: Multi-resolution Retrieval-Detection Fusion for High-Resolution Image Understanding	Fan Yang et.al.	2512.02906	null
2025-12-02	Glance: Accelerating Diffusion Models with 1 Sample	Zhuobai Dong et.al.	2512.02899	null
2025-12-02	Ridged geometries induce axial flow vortices in Couette systems	Akankshya Majhi et.al.	2512.02896	null
2025-12-02	Fast-Decoding Diffusion Language Models via Progress-Aware Confidence Schedules	Amr Mohamed et.al.	2512.02892	null
2025-12-02	Perceptual evaluation of Acoustic Level of Detail in Virtual Acoustic Environments	Stefan Fichna et.al.	2512.02891	null
2025-12-02	Taming Camera-Controlled Video Generation with Verifiable Geometry Reward	Zhaoqing Wang et.al.	2512.02870	null
2025-12-02	Leveraging generative adversarial networks with spatially adaptive denormalization for multivariate stochastic seismic data inversion	Roberto Miele et.al.	2512.02863	null
2025-12-02	PAC-Bayesian Optimal Control with Stability and Generalization Guarantees	Mahrokh Ghoddousi Boroujeni et.al.	2512.02858	null
2025-12-02	SwarmDiffusion: End-To-End Traversability-Guided Diffusion for Embodiment-Agnostic Navigation of Heterogeneous Robots	Iana Zhura et.al.	2512.02851	null
2025-12-02	Are Detectors Fair to Indian IP-AIGC? A Cross-Generator Study	Vishal Dubey et.al.	2512.02850	null
2025-12-02	Action Anticipation at a Glimpse: To What Extent Can Multimodal Cues Replace Video?	Manuel Benavent-Lledo et.al.	2512.02846	null
2025-12-02	VLM as Strategist: Adaptive Generation of Safety-critical Testing Scenarios via Guided Diffusion	Xinzheng Wu et.al.	2512.02844	null
2025-12-02	Experimental Blueprint for Distinguishing Decoherence from Objective Collapse	Ridha Horchani et.al.	2512.02838	null
2025-12-02	ReVSeg: Incentivizing the Reasoning Chain for Video Segmentation with Reinforcement Learning	Yifan Li et.al.	2512.02835	null
2025-12-02	Steering Vision-Language-Action Models as Anti-Exploration: A Test-Time Scaling Approach	Siyuan Yang et.al.	2512.02834	null
2025-12-02	A Comparative Study on How Data Normalization Affects Zero-Shot Generalization in Time Series Foundation Models	Ihab Ahmed et.al.	2512.02833	null
2025-12-02	From Navigation to Refinement: Revealing the Two-Stage Nature of Flow-based Diffusion Models through Oracle Velocity	Haoming Liu et.al.	2512.02826	null
2025-12-02	BOOM: Beyond Only One Modality KIT’s Multimodal Multilingual Lecture Companion	Sai Koneru et.al.	2512.02817	null
2025-12-02	Radiologist Copilot: An Agentic Assistant with Orchestrated Tools for Radiology Reporting with Quality Control	Yongrui Yu et.al.	2512.02814	null
2025-12-02	Vessel Network Topology in Molecular Communication: Insights from Experiments and Theory	Timo Jakumeit et.al.	2512.02811	null
2025-12-02	Swarming by curvature control in arbitrary dimension	Pierre Degond et.al.	2512.02800	null
2025-12-02	Towards Observation Lakehouses: Living, Interactive Archives of Software Behavior	Marcus Kessel et.al.	2512.02795	null
2025-12-02	HUD: Hierarchical Uncertainty-Aware Disambiguation Network for Composed Video Retrieval	Zhiwei Chen et.al.	2512.02792	null
2025-12-02	Making Dialogue Grounding Data Rich: A Three-Tier Data Synthesis Framework for Generalized Referring Expression Comprehension	Juexi Shao et.al.	2512.02791	null
2025-12-02	FiMMIA: scaling semantic perturbation-based membership inference across modalities	Anton Emelyanov et.al.	2512.02786	null
2025-12-02	Martingales, laminates and minimal Korn inequalities	Gabriele Cassese et.al.	2512.02784	null
2025-12-02	Universality and Falsifiability of Quantum Spacetime Decoherence: A Gauge-Invariant Framework for Gravitational-Wave Phase Diffusion	Hu Cang et.al.	2512.02782	null
2025-12-02	LumiX: Structured and Coherent Text-to-Intrinsic Generation	Xu Han et.al.	2512.02781	null
2025-12-02	Rethinking Surgical Smoke: A Smoke-Type-Aware Laparoscopic Video Desmoking Method and Dataset	Qifan Liang et.al.	2512.02780	null
2025-12-02	Enhanced particle diffusion in fluctuating binary environments	Fivos Perakis et.al.	2512.02776	null
2025-12-02	Diffusion-Prior Split Gibbs Sampling for Synthetic Aperture Radar Imaging under Incomplete Measurements	Hefei Gao et.al.	2512.02768	null
2025-12-02	Structural Properties of Entropic Vectors and Stability of the Ingleton Inequality	Rostislav Matveev et.al.	2512.02767	null
2025-12-02	PEFT-Factory: Unified Parameter-Efficient Fine-Tuning of Autoregressive Large Language Models	Robert Belanec et.al.	2512.02764	null
2025-12-02	Channel Knowledge Map Construction via Physics-Inspired Diffusion Model Without Prior Observations	Yunzhe Zhu et.al.	2512.02757	null
2025-12-02	Reasoning-Aware Multimodal Fusion for Hateful Video Detection	Shuonan Yang et.al.	2512.02743	null
2025-12-02	Protein Diffusion and Stokes-Einstein Deviation in Supercooled Cryoprotectant Solutions	Maddalena Bin et.al.	2512.02742	null
2025-12-02	Beyond Paired Data: Self-Supervised UAV Geo-Localization from Reference Imagery Alone	Tristan Amadei et.al.	2512.02737	null
2025-12-02	Self-Improving AI Agents through Self-Play	Przemyslaw Chojecki et.al.	2512.02731	null
2025-12-02	Emergent Bayesian Behaviour and Optimal Cue Combination in LLMs	Julian Ma et.al.	2512.02719	null
2025-12-02	GeoViS: Geospatially Rewarded Visual Search for Remote Sensing Visual Grounding	Peirong Zhang et.al.	2512.02715	null
2025-12-02	Training Data Attribution for Image Generation using Ontology-Aligned Knowledge Graphs	Theodoros Aivalis et.al.	2512.02713	null
2025-12-02	Adaptive hydrogels with spatiotemporal stiffening using pH-modulating enzymes	Natascha Gray et.al.	2512.02698	null
2025-12-02	GeoBridge: A Semantic-Anchored Multi-View Foundation Model Bridging Images and Text for Geo-Localization	Zixuan Song et.al.	2512.02697	null
2025-12-02	ALDI-ray: Adapting the ALDI Framework for Security X-ray Object Detection	Omid Reza Heidari et.al.	2512.02696	null
2025-12-02	Double-Helix based Real-Time Single Particle Tracking	Md Faysal Hossain et.al.	2512.02695	null
2025-12-02	ClimaOoD: Improving Anomaly Segmentation via Physically Realistic Synthetic Data	Yuxing Liu et.al.	2512.02686	null
2025-12-02	PGP-DiffSR: Phase-Guided Progressive Pruning for Efficient Diffusion-based Image Super-Resolution	Zhongbao Yang et.al.	2512.02681	null
2025-12-02	Exploring Depth Generalization in Large Language Models for Solving Recursive Logic Tasks	Zhiyuan He et.al.	2512.02677	null
2025-12-02	Graph VQ-Transformer (GVT): Fast and Accurate Molecular Generation via High-Fidelity Discrete Latents	Haozhuo Zheng et.al.	2512.02667	null
2025-12-02	Modal Analysis of Core Inertial Dynamics: Re-evaluating Grid-Forming Control Design Principles	Gerardo Medrano et.al.	2512.02662	null
2025-12-02	Spatially-Grounded Document Retrieval via Patch-to-Region Relevance Propagation	Agathoklis Georgiou et.al.	2512.02660	null
2025-12-02	Distill, Forget, Repeat: A Framework for Continual Unlearning in Text-to-Image Diffusion Models	Naveen George et.al.	2512.02657	null
2025-12-02	Hear What Matters! Text-conditioned Selective Video-to-Audio Generation	Junwon Lee et.al.	2512.02650	null
2025-12-02	Leveraging Large-Scale Pretrained Spatial-Spectral Priors for General Zero-Shot Pansharpening	Yongchuan Cui et.al.	2512.02643	null
2025-12-02	Joint Distillation for Fast Likelihood Evaluation and Sampling in Flow-based Models	Xinyue Ai et.al.	2512.02636	null
2025-12-02	GaN mid-IR plasmonics: low-loss epsilon-near-zero modes	Julia Inglés-Cerrillo et.al.	2512.02632	null
2025-12-02	RULER-Bench: Probing Rule-based Reasoning Abilities of Next-level Video Generation Models for Vision Foundation Intelligence	Xuming He et.al.	2512.02622	null
2025-12-02	Modeling and Inverse Identification of Interfacial Heat Conduction in Finite Layer and Semi-Infinite Substrate Systems via a Physics-Guided Neural Framework	Wenhao Sha et.al.	2512.02618	null
2025-12-02	Interface Correlators in Symmetric Product Orbifolds	Sebastian Harris et.al.	2512.02616	null
2025-12-02	A unified optical platform for non-Gaussian and fault-tolerant Gottesman-Kitaev-Preskill states	Ozlem Erkilic et.al.	2512.02607	null
2025-12-02	Radio, X-ray, and EUV signatures of internal and external reconnection of an erupting flux rope	Jana Kašparová et.al.	2512.02594	null
2025-12-02	GoRL: An Algorithm-Agnostic Framework for Online Reinforcement Learning with Generative Policies	Chubin Zhang et.al.	2512.02581	null
2025-12-02	Co-speech Gesture Video Generation via Motion-Based Graph Retrieval	Yafei Song et.al.	2512.02576	null
2025-12-02	Predictive Beamforming in Low-Altitude Wireless Networks: A Cross-Attention Approach	Xiaotong Zhao et.al.	2512.02563	null
2025-12-02	EZYer: A simulacrum of high school with generative agent	Jinming Yang et.al.	2512.02561	null
2025-12-02	Empathy Level Prediction in Multi-Modal Scenario with Supervisory Documentation Assistance	Yufei Xiao et.al.	2512.02558	null
2025-12-02	OmniPerson: Unified Identity-Preserving Pedestrian Generation	Changxiao Ma et.al.	2512.02554	null
2025-12-02	Size control guidelines for chemically active droplets	Guido Kusters et.al.	2512.02542	null
2025-12-02	On the spectral geometry of Liouville quantum gravity	Nathanaël Berestycki et.al.	2512.02538	null
2025-12-02	WeMMU: Enhanced Bridging of Vision-Language Models and Diffusion Models via Noisy Query Tokens	Jian Yang et.al.	2512.02536	null
2025-12-02	AID: Agent Intent from Diffusion for Multi-Agent Informative Path Planning	Jeric Lew et.al.	2512.02535	null
2025-12-01	EfficientFlow: Efficient Equivariant Flow Policy Learning for Embodied AI	Jianlei Chang et.al.	2512.02020	null
2025-12-01	A Diffusion Model Framework for Maximum Entropy Reinforcement Learning	Sebastian Sanokowski et.al.	2512.02019	null
2025-12-01	Data-Centric Visual Development for Self-Driving Labs	Anbang Liu et.al.	2512.02018	null
2025-12-01	Objects in Generated Videos Are Slower Than They Appear: Models Suffer Sub-Earth Gravity and Don’t Know Galileo’s Principle…for now	Varun Varma Thozhiyoor et.al.	2512.02016	null
2025-12-01	Generative Video Motion Editing with 3D Point Tracks	Yao-Chih Lee et.al.	2512.02015	null
2025-12-01	TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models	Zhiheng Liu et.al.	2512.02014	null
2025-12-01	ManualVLA: A Unified VLA Model for Chain-of-Thought Manual Generation and Robotic Manipulation	Chenyang Gu et.al.	2512.02013	null
2025-12-01	Learning Dexterous Manipulation Skills from Imperfect Simulations	Elvis Hsieh et.al.	2512.02011	null
2025-12-01	Learning Visual Affordance from Audio	Lidong Lu et.al.	2512.02005	null
2025-12-01	AlignSAE: Concept-Aligned Sparse Autoencoders	Minglai Yang et.al.	2512.02004	null
2025-12-01	JWST & the Waz Arc I: Spatially Resolving the Physical Conditions within a Post-Starburst Galaxy at Redshift 5 with NIRSpec IFS	Taylor A. Hutchison et.al.	2512.02000	null
2025-12-01	Neural steering vectors reveal dose and exposure-dependent impacts of human-AI relationships	Hannah Rose Kirk et.al.	2512.01991	null
2025-12-01	PAI-Bench: A Comprehensive Benchmark For Physical AI	Fengzhe Zhou et.al.	2512.01989	null
2025-12-01	Forecasting in Offline Reinforcement Learning for Non-stationary Environments	Suzan Ece Ada et.al.	2512.01987	null
2025-12-01	ECO: Energy-Constrained Operator Learning for Chaotic Dynamics with Boundedness Guarantees	Andrea Goertzen et.al.	2512.01984	null
2025-12-01	Orientational lineage memory and mechanical ordering during diffusion-limited growth	Ilias-Marios Sarris et.al.	2512.01981	null
2025-12-01	Chain-of-Ground: Improving GUI Grounding via Iterative Reasoning and Reference Feedback	Aiden Yiliu Li et.al.	2512.01979	null
2025-12-01	SGDiff: Scene Graph Guided Diffusion Model for Image Collaborative SegCaptioning	Xu Zhang et.al.	2512.01975	null
2025-12-01	From Atomic to Composite: Reinforcement Learning Enables Generalization in Complementary Reasoning	Sitao Cheng et.al.	2512.01970	null
2025-12-01	SpriteHand: Real-Time Versatile Hand-Object Interaction with Autoregressive Video Generation	Zisu Li et.al.	2512.01960	null
2025-12-01	Admittance and critical current of nonreciprocal Josephson junctions	Tony Liu et.al.	2512.01955	null
2025-12-01	GrndCtrl: Grounding World Models via Self-Supervised Reward Alignment	Haoyang He et.al.	2512.01952	null
2025-12-01	Script: Graph-Structured and Query-Conditioned Semantic Token Pruning for Multimodal Large Language Models	Zhongyu Yang et.al.	2512.01949	null
2025-12-01	Guardian: Detecting Robotic Planning and Execution Errors with Vision-Language Models	Paul Pacaud et.al.	2512.01946	null
2025-12-01	Med-VCD: Mitigating Hallucination for Medical Large Vision Language Models through Visual Contrastive Decoding	Zahra Mahdavi et.al.	2512.01922	null
2025-12-01	Disentangling Progress in Medical Image Registration: Beyond Trend-Driven Architectures towards Domain-Specific Strategies	Bailiang Jian et.al.	2512.01913	null
2025-12-01	*First-principles band alignment engineering in polar and nonpolar orientations for wurtzite AlN, GaN, and B $x$Al${1-x}$ N alloys*	Cody L Milne et.al.	2512.01907	null
2025-12-01	StyleYourSmile: Cross-Domain Face Retargeting Without Paired Multi-Style Data	Avirup Dey et.al.	2512.01895	null
2025-12-01	Improving Phishing Resilience with AI-Generated Training: Evidence on Prompting, Personalization, and Duration	Francesco Greco et.al.	2512.01893	null
2025-12-01	COACH: Collaborative Agents for Contextual Highlighting - A Multi-Agent Framework for Sports Video Analysis	Tsz-To Wong et.al.	2512.01853	null
2025-12-01	Dispersion-Mediated Space-Time States	Klaas De Kinder et.al.	2512.01849	null
2025-12-01	JPEGs Just Got Snipped: Croppable Signatures Against Deepfake Images	Pericle Perazzo et.al.	2512.01845	null
2025-12-01	PhyDetEx: Detecting and Explaining the Physical Plausibility of T2V Models	Zeqing Wang et.al.	2512.01843	null
2025-12-01	Uniform Norm Error Estimates for 2D Turning Point Problem	Shallu et.al.	2512.01841	null
2025-12-01	A Fluctuation-Dissipation Structure of Quantum Dynamical Semigroups Reveals a Unique Internal Hamiltonian	Fabricio Toscano et.al.	2512.01840	null
2025-12-01	Deconstructing Generative Diversity: An Information Bottleneck Analysis of Discrete Latent Generative Models	Yudi Wu et.al.	2512.01831	null
2025-12-01	Heterogeneous diffusion process with power-law nonlinearity	Jorge E. Cardona et.al.	2512.01828	null
2025-12-01	Seeing through Imagination: Learning Scene Geometry via Implicit Spatial World Modeling	Meng Cao et.al.	2512.01821	null
2025-12-01	Dimension-free error estimate for diffusion model and optimal scheduling	Valentin de Bortoli et.al.	2512.01820	null
2025-12-01	Envision: Benchmarking Unified Understanding & Generation for Causal World Process Insights	Juanxi Tian et.al.	2512.01816	null
2025-12-01	Much Ado About Noising: Dispelling the Myths of Generative Robotic Control	Chaoyi Pan et.al.	2512.01809	null
2025-12-01	Generative Action Tell-Tales: Assessing Human Motion in Synthesized Videos	Xavier Thomas et.al.	2512.01803	null
2025-12-01	GR-RL: Going Dexterous and Precise for Long-Horizon Robotic Manipulation	Yunfei Li et.al.	2512.01801	null
2025-12-01	H-Neurons: On the Existence, Impact, and Origin of Hallucination-Associated Neurons	Cheng Gao et.al.	2512.01797	null
2025-12-01	Evaluating SAM2 for Video Semantic Segmentation	Syed Hesham Syed Ariff et.al.	2512.01774	null
2025-12-01	IGen: Scalable Data Generation for Robot Learning from Open-World Images	Chenghao Gu et.al.	2512.01773	null
2025-12-01	VideoScoop: A Non-Traditional Domain-Independent Framework For Video Analysis	Hafsa Billah et.al.	2512.01769	null
2025-12-01	Weight Space Representation Learning with Neural Fields	Zhuoqian Yang et.al.	2512.01759	null
2025-12-01	Mofasa: A Step Change in Metal-Organic Framework Generation	Vaidotas Simkus et.al.	2512.01756	null
2025-12-01	Theory And Applications Of One-Sided Coupled Operator Matrices	Marjeta Kramar et.al.	2512.01719	null
2025-12-01	StreamGaze: Gaze-Guided Temporal Reasoning and Proactive Understanding in Streaming Videos	Daeun Lee et.al.	2512.01707	null
2025-12-01	Self-Organized Freeform Waveguiding	Fadhila Chehami et.al.	2512.01699	null
2025-12-01	Transient laser-induced periodic surface structures revealed by time-resolved EUV diffuse scattering	D. Ksenzov et.al.	2512.01696	null
2025-12-01	Gravitational lensing inside and outside of a marginally unstable photon sphere in a general, static, spherically symmetric, and asymptotically-flat spacetime in strong deflection limits	Naoki Tsukamoto et.al.	2512.01688	null
2025-12-01	DreamingComics: A Story Visualization Pipeline via Subject and Layout Customized Generation using Video Models	Patrick Kwon et.al.	2512.01686	null
2025-12-01	An innovative circuit for testing hot carrier and trap generation in GaN Devices	Moshe Azoulay et.al.	2512.01683	null
2025-12-01	Open-world Hand-Object Interaction Video Generation Based on Structure and Contact-aware Representation	Haodong Yan et.al.	2512.01677	null
2025-12-01	GRASP: Guided Residual Adapters with Sample-wise Partitioning	Felix Nützel et.al.	2512.01675	null
2025-12-01	Dynamic Log-Gaussian Process Control Barrier Function for Safe Robotic Navigation in Dynamic Environments	Xin Yin et.al.	2512.01668	null
2025-12-01	HalluGraph: Auditable Hallucination Detection for Legal RAG Systems via Knowledge Graph Alignment	Valentin Noël et.al.	2512.01659	null
2025-12-01	Textured Word-As-Image illustration	Mohammad Javadian Farzaneh et.al.	2512.01648	null
2025-12-01	ViT $^3$ : Unlocking Test-Time Training in Vision	Dongchen Han et.al.	2512.01643	null
2025-12-01	Searching for EeV photons with Telescope Array Surface Detector and neural networks	Telescope Array Collaboration et.al.	2512.01638	null
2025-12-01	Exploring Scavenging Strategies and Cognitive Problem-Solving in Indian Free-Ranging Dogs	Tuhin Subhra Pal et.al.	2512.01637	null
2025-12-01	Generative Editing in the Joint Vision-Language Space for Zero-Shot Composed Image Retrieval	Xin Wang et.al.	2512.01636	null
2025-12-01	Distinguish the Orientation of Sliding Ferroelectricity by Second-Harmonic Generation	Fengfeng Ye et.al.	2512.01635	null
2025-12-01	SPARK: Sim-ready Part-level Articulated Reconstruction with VLM Knowledge	Yumeng He et.al.	2512.01629	null
2025-12-01	Temperature Dependence of Charge and Exciton Transport in One-Dimensional Systems Subject to Static and Dynamic Disorder	William Barford et.al.	2512.01625	null
2025-12-01	Unsupervised and Supervised Algorithms for Identification of Sample Pixels in FTIR Images	Xiangyu Zhao et.al.	2512.01585	null
2025-12-01	Combined Effects of Transient Ionizing and Electromagnetic Pulse on Vertical NPN Bipolar Transistor	Meiqing Zhong et.al.	2512.01573	null
2025-12-01	Reconstructing Multi-Scale Physical Fields from Extremely Sparse Measurements with an Autoencoder-Diffusion Cascade	Letian Yi et.al.	2512.01572	null
2025-12-01	Deep FlexQP: Accelerated Nonlinear Programming via Deep Unfolding	Alex Oshin et.al.	2512.01565	null
2025-12-01	LLM2Fx-Tools: Tool Calling For Music Post-Production	Seungheon Doh et.al.	2512.01559	null
2025-12-01	LEC: Linear Expectation Constraints for False-Discovery Control in Selective Prediction and Routing Systems	Zhiyuan Wang et.al.	2512.01556	null
2025-12-01	RoMe: Row Granularity Access Memory System for Large Language Models	Hwayong Nam et.al.	2512.01541	null
2025-12-01	FlashVGGT: Efficient and Scalable Visual Geometry Transformers with Compressed Descriptor Attention	Zipeng Wang et.al.	2512.01540	null
2025-12-01	Deep Unsupervised Anomaly Detection in Brain Imaging: Large-Scale Benchmarking and Bias Analysis	Alexander Frotscher et.al.	2512.01534	null
2025-12-01	Diffusion Fuzzy System: Fuzzy Rule Guided Latent Multi-Path Diffusion Modeling	Hailong Yang et.al.	2512.01533	null
2025-12-01	Functional-Analytic Justification of the Time-Domain Foldy-Lax Approximation for Dispersive Acoustic Media: A Feynman-Diagram Viewpoint	Arpan Mukherjee et.al.	2512.01532	null
2025-12-01	Near-infrared polarimetric imaging with nonlinear flat-optics	Evgenii Menshikov et.al.	2512.01525	null
2025-12-01	Massart iron oxide nanoparticles in mechanobiology	Myriam Reffay et.al.	2512.01524	null
2025-12-01	Investigation of Al-Si-Cu alloys as phase change materials for high temperature thermal energy storage	Laura Teodorescu et.al.	2512.01521	null
2025-12-01	QuantumCanvas: A Multimodal Benchmark for Visual Learning of Atomic Interactions	Can Polat et.al.	2512.01519	null
2025-12-01	Neural Networks for Predicting Permeability Tensors of 2D Porous Media: Comparison of Convolution- and Transformer-based Architectures	Sigurd Vargdal et.al.	2512.01517	null
2025-12-01	Enhanced detection limits in the SHINE F150 survey through the Regime Switching Model Optimizing thresholds and investigating environmental noise	Mariam Sabalbal et.al.	2512.01511	null
2025-12-01	Semantic-aware Random Convolution and Source Matching for Domain Generalization in Medical Image Segmentation	Franz Thaler et.al.	2512.01510	null
2025-12-01	Multi-view diffusion geometry using intertwined diffusion trajectories	Gwendal Debaussart-Joniec et.al.	2512.01484	null
2025-12-01	ChronosObserver: Taming 4D World with Hyperspace Diffusion Sampling	Qisen Wang et.al.	2512.01481	null
2025-12-01	All-optical directional switching of non-thermal photocurrents in plasmonic nanocircuits	Roméo Zapata et.al.	2512.01480	null
2025-12-01	A Unified Bayesian Framework for Stochastic Data-Driven Smoothing, Prediction, and Control	Mingzhou Yin et.al.	2512.01475	null
2025-12-01	Does Flatness imply Generalization for Logistic Loss in Univariate Two-Layer ReLU Network?	Dan Qiao et.al.	2512.01473	null
2025-12-01	Automated Risk-of-Bias Assessment of Randomized Controlled Trials: A First Look at a GEPA-trained Programmatic Prompting Framework	Lingbo Li et.al.	2512.01452	null
2025-12-01	Hawkes process with a diffusion-driven baseline: long-run behavior, inference, statistical tests	Maya Sadeler Perrin et.al.	2512.01447	null
2025-12-01	Statistical Properties of the Rooted-Tree Encoding of $\mathbb{N}$	Pierluigi Contucci et.al.	2512.01436	null
2025-12-01	Existence of two thresholds in a bistable equation with nonlocal competition	Matthieu Alfaro et.al.	2512.01435	null
2025-12-01	A Flexible Multi-Agent LLM-Human Framework for Fast Human Validated Tool Building	Daull Xavier et.al.	2512.01434	null
2025-12-01	Language-Guided Open-World Anomaly Segmentation	Klara Reichard et.al.	2512.01427	null
2025-12-01	ResDiT: Evoking the Intrinsic Resolution Scalability in Diffusion Transformers	Yiyang Ma et.al.	2512.01426	null
2025-12-01	An Investigation of Thermal Properties of Cu-Au Janus Nanoparticles	Mehmet Akif Cebeci et.al.	2512.01425	null
2025-12-01	\textit{ViRectify}: A Challenging Benchmark for Video Reasoning Correction with Multimodal Large Language Models	Xusen Hei et.al.	2512.01424	null
2025-11-28	Video-R2: Reinforcing Consistent and Grounded Reasoning in Multimodal Language Models	Muhammad Maaz et.al.	2511.23478	null
2025-11-28	Video-CoM: Interactive Video Reasoning via Chain of Manipulations	Hanoona Rasheed et.al.	2511.23477	null
2025-11-28	AnyTalker: Scaling Multi-Person Talking Video Generation with Interactivity Refinement	Zhizhou Zhong et.al.	2511.23475	null
2025-11-28	Visual Generation Tuning	Jiahao Guo et.al.	2511.23469	null
2025-11-28	SmallWorlds: Assessing Dynamics Understanding of World Models in Isolated Environments	Xinyi Li et.al.	2511.23465	null
2025-11-28	Arbitrary control of the temporal waveform of photons during spontaneous emission	Carl Thomas et.al.	2511.23462	null
2025-11-28	Convergence and front position for an FKPP-type free boundary problem	Julien Berestycki et.al.	2511.23457	null
2025-11-28	Toric structure of the moduli space of points in projective space	Marwan Bit et.al.	2511.23456	null
2025-11-28	Convergence rates of self-repellent random walks, their local time and Event Chain Monte Carlo	Andreas Eberle et.al.	2511.23453	null
2025-11-28	Object-Centric Data Synthesis for Category-level Object Detection	Vikhyat Agarwal et.al.	2511.23450	null
2025-11-28	Physics-Informed Neural Networks for Thermophysical Property Retrieval	Ali Waseem et.al.	2511.23449	null
2025-11-28	Towards Continuous Intelligence Growth: Self-Training, Continual Learning, and Dual-Scale Memory in SuperIntelliAgent	Jianzhe Lin et.al.	2511.23436	null
2025-11-28	Consensus Tree Estimation with False Discovery Rate Control via Partially Ordered Sets	Maria Alejandra Valdez Cabrera et.al.	2511.23433	null
2025-11-28	Hunyuan-GameCraft-2: Instruction-following Interactive Game World Model	Junshu Tang et.al.	2511.23429	null
2025-11-28	DisMo: Disentangled Motion Representations for Open-World Motion Transfer	Thomas Ressler-Antal et.al.	2511.23428	null
2025-11-28	Global well-posedness for hyperbolic SPDEs with non-Lipschitz coefficients driven by space-time Lévy white noise	Raluca M. Balan et.al.	2511.23420	null
2025-11-28	Analytical Fresnel Treatment of Double-Slit Diffraction with Multiple Coherent Waves	J. Sumaya-Martinez et.al.	2511.23394	null
2025-11-28	VQRAE: Representation Quantization Autoencoders for Multimodal Understanding, Generation and Reconstruction	Sinan Du et.al.	2511.23386	null
2025-11-28	Identifying bars in galaxies using machine learning	Rajit Shrivastava et.al.	2511.23383	null
2025-11-28	DEAL-300K: Diffusion-based Editing Area Localization with a 300K-Scale Dataset and Frequency-Prompted Baseline	Rui Zhang et.al.	2511.23377	null
2025-11-28	Optimizing Multimodal Language Models through Attention-based Interpretability	Alexander Sergeev et.al.	2511.23375	null
2025-11-28	Design, modelling and experimental validation of bipenniform shape memory alloy-based linear actuator integrable with hydraulic stroke amplification mechanism	Kanhaiya Lal Chaurasiya et.al.	2511.23372	null
2025-11-28	SimScale: Learning to Drive via Real-World Simulation at Scale	Haochen Tian et.al.	2511.23369	null
2025-11-28	A Hierarchical Computer Vision Pipeline for Physiological Data Extraction from Bedside Monitors	Vinh Chau et.al.	2511.23355	null
2025-11-28	Convergence rates of self-repelling diffusions on Riemannian manifolds	Francis Lörler et.al.	2511.23333	null
2025-11-28	UniGeoSeg: Towards Unified Open-World Segmentation for Geospatial Scenes	Shuo Ni et.al.	2511.23332	null
2025-11-28	A Perceptually Inspired Variational Framework for Color Enhancement	Rodrigo Palma-Amestoy et.al.	2511.23329	null
2025-11-28	Toward Automatic Safe Driving Instruction: A Large-Scale Vision Language Model Approach	Haruki Sakajo et.al.	2511.23311	null
2025-11-28	SafeHumanoid: VLM-RAG-driven Control of Upper Body Impedance for Humanoid Robot	Yara Mahmoud et.al.	2511.23300	null
2025-11-28	Signature approach for pricing and hedging path-dependent options with frictions	Eduardo Abi Jaber et.al.	2511.23295	null
2025-11-28	MCP vs RAG vs NLWeb vs HTML: A Comparison of the Effectiveness and Efficiency of Different Agent Interfaces to the Web (Technical Report)	Aaron Steiner et.al.	2511.23281	null
2025-11-28	Deep Learning for Restoring MPI System Matrices Using Simulated Training Data	Artyom Tsanda et.al.	2511.23251	null
2025-11-28	Existence of solutions and uniform bounds for the stationary semiconductor equations with generation and ionic carriers	Dilara Abdel et.al.	2511.23250	null
2025-11-28	Synthetic Industrial Object Detection: GenAI vs. Feature-Based Methods	Jose Moises Araya-Martinez et.al.	2511.23241	null
2025-11-28	Robust 3DGS-based SLAM via Adaptive Kernel Smoothing	Shouhe Zhang et.al.	2511.23221	null
2025-11-28	Compact localized currents in flat bands with broken time-reversal symmetry	Rohit Kishan Ray et.al.	2511.23218	null
2025-11-28	Field-programmable dynamics in a soft magnetic actuator enabling true random number generation and reservoir computing	Eduardo Sergio Oliveros-Mata et.al.	2511.23215	null
2025-11-28	Vision Bridge Transformer at Scale	Zhenxiong Tan et.al.	2511.23199	null
2025-11-28	Diffusion through complex confining environments: fluctuating triply periodic minimal surfaces	Jakob Mihatsch et.al.	2511.23192	null
2025-11-28	GeoWorld: Unlocking the Potential of Geometry Models to Facilitate High-Fidelity 3D Scene Generation	Yuhao Wan et.al.	2511.23191	null
2025-11-28	Fast Multi-view Consistent 3D Editing with Video Priors	Liyi Chen et.al.	2511.23172	null
2025-11-28	PowerCLIP: Powerset Alignment for Contrastive Pre-Training	Masaki Kawamura et.al.	2511.23170	null
2025-11-28	Large-amplitude Variability Driven by Giant Dust Storms on a Planetary-mass Companion	Xianyu Tan et.al.	2511.23163	null
2025-11-28	REVEAL: Reasoning-enhanced Forensic Evidence Analysis for Explainable AI-generated Image Detection	Huangsen Cao et.al.	2511.23158	null
2025-11-28	InstanceV: Instance-Level Video Generation	Yuheng Chen et.al.	2511.23146	null
2025-11-28	DualCamCtrl: Dual-Branch Diffusion Model for Geometry-Aware Camera-Controlled Video Generation	Hongfei Zhang et.al.	2511.23127	null
2025-11-28	Evolutionary Discovery of Heuristic Policies for Traffic Signal Control	Ruibing Wang et.al.	2511.23122	null
2025-11-28	Freeze, Diffuse, Decode: Geometry-Aware Adaptation of Pretrained Transformer Embeddings for Antimicrobial Peptide Design	Pankhil Gawade et.al.	2511.23120	null
2025-11-28	Dripper: Token-Efficient Main HTML Extraction with a Lightweight LM	Mengjie Liu et.al.	2511.23119	null
2025-11-28	Analyzing Image Beyond Visual Aspect: Image Emotion Classification via Multiple-Affective Captioning	Zibo Zhou et.al.	2511.23115	null
2025-11-28	A Spectral Koopman Approximation Framework for Stochastic Reaction Networks	Ankit Gupta et.al.	2511.23114	null
2025-11-28	db-SP: Accelerating Sparse Attention for Visual Generative Models with Dual-Balanced Sequence Parallelism	Siqi Chen et.al.	2511.23113	null
2025-11-28	MathSight: A Benchmark Exploring Have Vision-Language Models Really Seen in University-Level Mathematical Reasoning?	Yuandong Wang et.al.	2511.23112	null
2025-11-28	NumeriKontrol: Adding Numeric Control to Diffusion Transformers for Instruction-based Image Editing	Zhenyu Xu et.al.	2511.23105	null
2025-11-28	On Computational Aspects of Ordered Matching Problems	Michal Čertík et.al.	2511.23093	null
2025-11-28	Bharat Scene Text: A Novel Comprehensive Dataset and Benchmark for Indian Language Scene Text Understanding	Anik De et.al.	2511.23071	null
2025-11-28	Evaluating the Clinical Impact of Generative Inpainting on Bone Age Estimation	Felipe Akio Matsuoka et.al.	2511.23066	null
2025-11-28	A cascade model for the defect-driven etching of porous GaN distributed Bragg reflectors	Ben Thornley et.al.	2511.23065	null
2025-11-28	Gate-tunable spin-resolved subbands in multilayer WSe2 probed by quantum point contact spectroscopy	Min-Gue Kim et.al.	2511.23063	null
2025-11-28	The impact of anticonformity on the diffusion of innovation – insights from the q-voter model	Angelika Abramiuk-Szurlej et.al.	2511.23061	null
2025-11-28	Infinite-dimensional nonlinear stationary Fokker-Planck-Kolmogorov equations	Vladimir I. Bogachev et.al.	2511.23058	null
2025-11-28	GOATex: Geometry & Occlusion-Aware Texturing	Hyunjin Kim et.al.	2511.23051	null
2025-11-28	Nonequilibrium dynamics of magnetic hopfions driven by spin-orbit torque	Shoya Kasai et.al.	2511.23045	null
2025-11-28	Time Extrapolation with Graph Convolutional Autoencoder and Tensor Train Decomposition	Yuanhong Chen et.al.	2511.23037	null
2025-11-28	LatBot: Distilling Universal Latent Actions for Vision-Language-Action Models	Zuolei Li et.al.	2511.23034	null
2025-11-28	Geodiffussr: Generative Terrain Texturing with Elevation Fidelity	Tai Inui et.al.	2511.23029	null
2025-11-28	A limsup fast dynamo on $\mathbb{T}^3$	Massimo Sorella et.al.	2511.23024	null
2025-11-28	Control Barrier Function for Unknown Systems: An Approximation-free Approach	Shubham Sawarkar et.al.	2511.23022	null
2025-11-28	Masked Diffusion for Generative Recommendation	Kulin Shah et.al.	2511.23021	null
2025-11-28	JarvisEvo: Towards a Self-Evolving Photo Editing Agent with Synergistic Editor-Evaluator Optimization	Yunlong Lin et.al.	2511.23002	null
2025-11-28	Observing the spatial and temporal evolution of exciton wave functions in organic semiconductors	Marcel Theilen et.al.	2511.23001	null
2025-11-28	Guiding Visual Autoregressive Models through Spectrum Weakening	Chaoyang Wang et.al.	2511.22991	null
2025-11-28	MIMM-X: Disentangling Spurious Correlations for Medical Image Analysis	Louisa Fay et.al.	2511.22990	null
2025-11-28	MultiBanana: A Challenging Benchmark for Multi-Reference Text-to-Image Generation	Yuta Oshima et.al.	2511.22989	null
2025-11-28	The Battle of the Water Futures	Dennis Zanutto et.al.	2511.22986	null
2025-11-28	Secret Entanglement, Public Geometry. Quantum Cryptography from a Geometric Perspective	Loris Di Cairano et.al.	2511.22984	null
2025-11-28	Ovis-Image Technical Report	Guo-Hua Wang et.al.	2511.22982	null
2025-11-28	McSc: Motion-Corrective Preference Alignment for Video Generation with Self-Critic Hierarchical Reasoning	Qiushi Yang et.al.	2511.22974	null
2025-11-28	BlockVid: Block Diffusion for High-Quality and Consistent Minute-Long Video Generation	Zeyu Zhang et.al.	2511.22973	null
2025-11-28	Technical Report: Towards Unified Diffusion Models for Multi-Model Climate Emulation at Scale	Francesco Immorlano et.al.	2511.22970	null
2025-11-28	Commanding Humanoid by Free-form Language: A Large Language Action Model with Unified Motion Vocabulary	Zhirui Liu et.al.	2511.22963	null
2025-11-28	HMR3D: Hierarchical Multimodal Representation for 3D Scene Understanding with Large Vision-Language Model	Chen Li et.al.	2511.22961	null
2025-11-28	A Trainable Centrality Framework for Modern Data	Minh Duc Vu et.al.	2511.22959	null
2025-11-28	Contrastive Heliophysical Image Pretraining for Solar Dynamics Observatory Records	Shiyu Shen et.al.	2511.22958	null
2025-11-28	Extended Serial Safety Net: A Refined Serializability Criterion for Multiversion Concurrency Control	Atsushi Kitazawa et.al.	2511.22956	null
2025-11-28	MDcraft – a modern molecular dynamics simulation package with machine learning potentials support	I. S. Galtsov et.al.	2511.22951	null
2025-11-28	RobotSeg: A Model and Dataset for Segmenting Robots in Image and Video	Haiyang Mei et.al.	2511.22950	null
2025-11-28	Do We Need Perfect Data? Leveraging Noise for Domain Generalized Segmentation	Taeyeong Kim et.al.	2511.22948	null
2025-11-28	Visual Puns from Idioms: An Iterative LLM-T2IM-MLLM Framework	Kelaiti Xiao et.al.	2511.22943	null
2025-11-28	One-to-All Animation: Alignment-Free Character Animation and Image Pose Transfe	Shijun Shi et.al.	2511.22940	null
2025-11-28	DenoiseGS: Gaussian Reconstruction Model for Burst Denoising	Yongsen Cheng et.al.	2511.22939	null
2025-11-28	Robust Image Self-Recovery against Tampering using Watermark Generation with Pixel Shuffling	Minyoung Kim et.al.	2511.22936	null
2025-11-28	RAG-Empowered LLM-Driven Dynamic Radio Resource Management in Open 6G RAN	Onur Salan et.al.	2511.22933	null
2025-11-28	Fisher-KPP waves and the minimal speed on hexagonal lattice	Jian Fang et.al.	2511.22932	null
2025-11-28	Visual Orientalism in the AI Era: From West-East Binaries to English-Language Centrism	Zhilong Zhao et.al.	2511.22931	null
2025-11-28	Evidence for unexpectedly low quasiparticle generation rates across Josephson junctions of driven superconducting qubits	Byoung-moo Ann et.al.	2511.22930	null
2025-11-28	Artwork Interpretation with Vision Language Models: A Case Study on Emotions and Emotion Symbols	Sebastian Padó et.al.	2511.22929	null
2025-11-28	Generation of Ultra-Broadband Frequency Comb in Strongly Bistable Nonlinear Magnonic Resonator	Yu Jiang et.al.	2511.22915	null
2025-11-28	See, Rank, and Filter: Important Word-Aware Clip Filtering via Scene Understanding for Moment Retrieval and Highlight Detection	YuEun Lee et.al.	2511.22906	null
2025-11-28	Leveraging Textual Compositional Reasoning for Robust Change Captioning	Kyu Ri Park et.al.	2511.22903	null
2025-11-26	Canvas-to-Image: Compositional Image Generation with Multimodal Controls	Yusuf Dalva et.al.	2511.21691	null
2025-11-26	TraceGen: World Modeling in 3D Trace Space Enables Learning from Cross-Embodiment Videos	Seungjae Lee et.al.	2511.21690	null
2025-11-26	Matrix: Peer-to-Peer Multi-Agent Synthetic Data Generation Framework	Dong Wang et.al.	2511.21686	null
2025-11-26	DSD: A Distributed Speculative Decoding Solution for Edge-Cloud Agile Large Model Serving	Fengze Yu et.al.	2511.21669	null
2025-11-26	Uncertainty Quantification for Visual Object Pose Estimation	Lorenzo Shaikewitz et.al.	2511.21666	null
2025-11-26	Attention-Guided Patch-Wise Sparse Adversarial Attacks on Vision-Language-Action Models	Naifu Zhang et.al.	2511.21663	null
2025-11-26	Model-free practical PI-Lead control design by ultimate sensitivity principle	Michael Ruderman et.al.	2511.21641	null
2025-11-26	Mechanisms of Non-Monotonic Scaling in Vision Transformers	Anantha Padmanaban Krishna Kumar et.al.	2511.21635	null
2025-11-26	Bang-Bang Evasion: Its Stochastic Optimality and a Terminal-Set-Based Implementation	Liraz Mudrik et.al.	2511.21633	null
2025-11-26	Qwen3-VL Technical Report	Shuai Bai et.al.	2511.21631	null
2025-11-26	Two behavioural pseudometrics for continuous-time Markov processes	Linan Chen et.al.	2511.21621	null
2025-11-26	A Super-Eddington, Lensing-Magnified Quasar at $z=5.07$ observed with JWST	Katherine Panebianco et.al.	2511.21618	null
2025-11-26	ReSAM: Refine, Requery, and Reinforce: Self-Prompting Point-Supervised Segmentation for Remote Sensing Images	M. Naseer Subhani et.al.	2511.21606	null
2025-11-26	TAB-DRW: A DFT-based Robust Watermark for Generative Tabular Data	Yizhou Zhao et.al.	2511.21600	null
2025-11-26	MoGAN: Improving Motion Quality in Video Diffusion via Few-Step Motion Adversarial Post-Training	Haotian Xue et.al.	2511.21592	null
2025-11-26	Model-Based Policy Adaptation for Closed-Loop End-to-End Autonomous Driving	Haohong Lin et.al.	2511.21584	null
2025-11-26	Harmony: Harmonizing Audio and Video Generation through Cross-Task Synergy	Teng Hu et.al.	2511.21579	null
2025-11-26	A Generalized Control Function Approach to Production Function Estimation	Ulrich Doraszelski et.al.	2511.21578	null
2025-11-26	HarmonicAttack: An Adaptive Cross-Domain Audio Watermark Removal	Kexin Li et.al.	2511.21577	null
2025-11-26	VacuumVLA: Boosting VLA Capabilities via a Unified Suction and Gripping Tool for Complex Robotic Manipulation	Hui Zhou et.al.	2511.21557	null
2025-11-26	Formation of Light-Emitting Defects in Ag-based Memristors	Diana Singh et.al.	2511.21555	null
2025-11-26	Metastability of diffusion processes in narrow tubes	Wen-Tai Hsu et.al.	2511.21548	null
2025-11-26	Seeing Twice: How Side-by-Side T2I Comparison Changes Auditing Strategies	Matheus Kunzler Maldaner et.al.	2511.21547	null
2025-11-26	$\mathcal{E}_0$ : Enhancing Generalization and Fine-Grained Control in VLA Models via Continuized Discrete Diffusion	Zhihao Zhan et.al.	2511.21542	null
2025-11-26	Video Generation Models Are Good Latent Reward Models	Xiaoyue Mi et.al.	2511.21541	null
2025-11-26	Context-Specific Causal Graph Discovery with Unobserved Contexts: Non-Stationarity, Regimes and Spatio-Temporal Patterns	Martin Rabel et.al.	2511.21537	null
2025-11-26	The Age-specific Alzheimer ‘s Disease Prediction with Characteristic Constraints in Nonuniform Time Span	Xin Hong et.al.	2511.21530	null
2025-11-26	Singular extremals of optimal control problems with $L^1$ cost	Andrei Agrachev et.al.	2511.21527	null
2025-11-26	Unified interface dipole theory for Fermi level pinning effect at metal-semiconductor contacts	Ziying Xiang et.al.	2511.21494	null
2025-11-26	MobileI2V: Fast and High-Resolution Image-to-Video on Mobile Devices	Shuai Zhang et.al.	2511.21475	null
2025-11-26	A Hamilton-Jacobi Framework in a Field-Road System with Unidirectional Advection under Wentzell-Type Boundary Condition	Xinye Xiao et.al.	2511.21469	null
2025-11-26	Hierarchical Besov-Laplace priors for spatially inhomogeneous binary classification	Patric Dolmeta et.al.	2511.21441	null
2025-11-26	Conversational no-code and multi-agentic disease module identification and drug repurposing prediction with ChatDRex	Simon Süwer et.al.	2511.21438	null
2025-11-26	SAM Guided Semantic and Motion Changed Region Mining for Remote Sensing Change Captioning	Futian Wang et.al.	2511.21420	null
2025-11-26	Odin: Oriented Dual-module Integration for Text-rich Network Representation Learning	Kaifeng Hong et.al.	2511.21416	null
2025-11-26	DiverseVAR: Balancing Diversity and Quality of Next-Scale Visual Autoregressive Models	Mingue Park et.al.	2511.21415	null
2025-11-26	Lattice-Distortion-Mediated Proton Pairing and Trapping in Solid State Oxides	Hang Ma et.al.	2511.21410	null
2025-11-26	Controlled nucleation in methylamine-treated perovskite films by artificial seeding and phase-field simulations	Emilia R. Schütz et.al.	2511.21407	null
2025-11-26	Decentralized Shepherding of Non-Cohesive Swarms Through Cluttered Environments via Deep Reinforcement Learning	Cristiana Punzo et.al.	2511.21405	null
2025-11-26	Revealing Fast Ionic Conduction in Solid Electrolytes through Machine Learning Accelerated Raman Calculations	Manuel Grumet et.al.	2511.21404	null
2025-11-26	Monet: Reasoning in Latent Visual Space Beyond Images and Language	Qixun Wang et.al.	2511.21395	null
2025-11-26	Stationary equation of the relativistic heat diffusion in transparent media having $L^1$ –data	Francesco Balducci et.al.	2511.21390	null
2025-11-26	Thinking With Bounding Boxes: Enhancing Spatio-Temporal Video Grounding via Reinforcement Fine-Tuning	Xin Gu et.al.	2511.21375	null
2025-11-26	Differentiable Physics-Neural Models enable Learning of Non-Markovian Closures for Accelerated Coarse-Grained Physics Simulations	Tingkai Xue et.al.	2511.21369	null
2025-11-26	BanglaMM-Disaster: A Multimodal Transformer-Based Deep Learning Framework for Multiclass Disaster Classification in Bangla	Ariful Islam et.al.	2511.21364	null
2025-11-26	Diffusion-controlled reaction rate to an active site in a spherical cavity: Extension of Berg’s theory	Sergey D. Traytak et.al.	2511.21357	null
2025-11-26	Generating Separated Singing Vocals Using a Diffusion Model Conditioned on Music Mixtures	Genís Plaja-Roglans et.al.	2511.21342	null
2025-11-26	Masks Can Be Distracting: On Context Comprehension in Diffusion Language Models	Julianna Piskorz et.al.	2511.21338	null
2025-11-26	TSGM: Regular and Irregular Time-series Generation using Score-based Generative Models	Haksoo Lim et.al.	2511.21335	null
2025-11-26	Sawtooth Sampling for Time Series Denoising Diffusion Implicit Models	Heiko Oppel et.al.	2511.21320	null
2025-11-26	CaliTex: Geometry-Calibrated Attention for View-Coherent 3D Texture Generation	Chenyu Liu et.al.	2511.21309	null
2025-11-26	Co-Training Vision Language Models for Remote Sensing Multi-task Learning	Qingyun Li et.al.	2511.21272	null
2025-11-26	Unlocking Zero-shot Potential of Semi-dense Image Matching via Gaussian Splatting	Juncheng Chen et.al.	2511.21265	null
2025-11-26	Sampling-Based Optimization with Parallelized Physics Simulator for Bimanual Manipulation	Iryna Hurova et.al.	2511.21264	null
2025-11-26	Bifurcation Logic: Separation Through Ordering	Didier Galmiche et.al.	2511.21263	null
2025-11-26	LaGen: Towards Autoregressive LiDAR Scene Generation	Sizhuo Zhou et.al.	2511.21256	null
2025-11-26	AVFakeBench: A Comprehensive Audio-Video Forgery Detection Benchmark for AV-LMMs	Shuhan Xia et.al.	2511.21251	null
2025-11-26	Stability of data-driven Koopman MPC with terminal conditions	Irene Schimperna et.al.	2511.21248	null
2025-11-26	Curvature-driven pattern formation in biomembranes: A gradient flow approach	Patrik Knopf et.al.	2511.21230	null
2025-11-26	Conditional Generative Modeling of Stochastic LTI Systems: A Behavioral Approach	Jiayun Li et.al.	2511.21219	null
2025-11-26	AuthenLoRA: Entangling Stylization with Imperceptible Watermarks for Copyright-Secure LoRA Adapters	Fangming Shi et.al.	2511.21216	null
2025-11-26	From Diffusion to One-Step Generation: A Comparative Study of Flow-Based Models with Application to Image Inpainting	Umang Agarwal et.al.	2511.21215	null
2025-11-26	The effect of tip-speed ratio and free-stream turbulence on the coupled wind turbine blade/wake dynamics	Francisco J. G. de Oliveira et.al.	2511.21206	null
2025-11-26	Transformer Driven Visual Servoing and Dual Arm Impedance Control for Fabric Texture Matching	Fuyuki Tokuda et.al.	2511.21203	null
2025-11-26	Towards an Effective Action-Region Tracking Framework for Fine-grained Video Action Recognition	Baoli Sun et.al.	2511.21202	null
2025-11-26	Optimal preconditioning techniques for finite volume approximation of three-dimensional conservative space-fractional diffusion equations	Wei Qu et.al.	2511.21198	null
2025-11-26	BotaCLIP: Contrastive Learning for Botany-Aware Representation of Earth Observation Data	Selene Cerna et.al.	2511.21194	null
2025-11-26	When Robots Obey the Patch: Universal Transferable Patch Attacks on Vision-Language-Action Models	Hui Lu et.al.	2511.21192	null
2025-11-26	Progress by Pieces: Test-Time Scaling for Autoregressive Image Generation	Joonhyung Park et.al.	2511.21185	null
2025-11-26	Observational appearance and photon rings of non-singular black holes from anisotropic fluids	David Díaz-Guerra et.al.	2511.21183	null
2025-11-26	CAHS-Attack: CLIP-Aware Heuristic Search Attack Method for Stable Diffusion	Shuhan Xia et.al.	2511.21180	null
2025-11-26	MarketGen: A Scalable Simulation Platform with Auto-Generated Embodied Supermarket Environments	Xu Hu et.al.	2511.21161	null
2025-11-26	Maglev-Pentabot: Magnetic Levitation System for Non-Contact Manipulation using Deep Reinforcement Learning	Guoming Huang et.al.	2511.21149	null
2025-11-26	AV-Edit: Multimodal Generative Sound Effect Editing via Audio-Visual Semantic Joint Control	Xinyue Guo et.al.	2511.21146	null
2025-11-26	TEAR: Temporal-aware Automated Red-teaming for Text-to-Video Models	Jiaming He et.al.	2511.21145	null
2025-11-26	Referring Video Object Segmentation with Cross-Modality Proxy Queries	Baoli Sun et.al.	2511.21139	null
2025-11-26	All-Optical Varifocal Switching in a Polarization-Insensitive Si–GST Metalens	Dipika Rani Nath et.al.	2511.21138	null
2025-11-26	Efficient Training for Human Video Generation with Entropy-Guided Prioritized Progressive Learning	Changlin Li et.al.	2511.21136	null
2025-11-26	SocialNav: Training Human-Inspired Foundation Model for Socially-Aware Embodied Navigation	Ziyi Chen et.al.	2511.21135	null
2025-11-26	CtrlVDiff: Controllable Video Generation via Unified Multimodal Video Diffusion	Dianbing Xi et.al.	2511.21129	null
2025-11-26	Which Layer Causes Distribution Deviation? Entropy-Guided Adaptive Pruning for Diffusion and Flow Models	Changlin Li et.al.	2511.21122	null
2025-11-26	Beyond Patch Aggregation: 3-Pass Pyramid Indexing for Vision-Enhanced Document Retrieval	Anup Roy et.al.	2511.21121	null
2025-11-26	Deformation-aware Temporal Generation for Early Prediction of Alzheimers Disease	Xin Honga et.al.	2511.21114	null
2025-11-26	FaithFusion: Harmonizing Reconstruction and Generation via Pixel-wise Information Gain	YuAn Wang et.al.	2511.21113	null
2025-11-26	Half-Vortex Polariton Condensate in a Topological BIC Metasurface	Andrea Zacheo et.al.	2511.21111	null
2025-11-26	From Bits to Rounds: Parallel Decoding with Exploration for Diffusion Language Models	Hengyu Fu et.al.	2511.21103	null
2025-11-26	CLRecogEye : Curriculum Learning towards exploiting convolution features for Dynamic Iris Recognition	Geetanjali Sharma et.al.	2511.21097	null
2025-11-26	MNM : Multi-level Neuroimaging Meta-analysis with Hyperbolic Brain-Text Representations	Seunghun Baek et.al.	2511.21092	null
2025-11-26	MIRA: Multimodal Iterative Reasoning Agent for Image Editing	Ziyun Zeng et.al.	2511.21087	null
2025-11-26	Orthographic Constraint Satisfaction and Human Difficulty Alignment in Large Language Models	Bryan E. Tuck et.al.	2511.21086	null
2025-11-26	A universal framework for nonlinear frequency combs under electro-optic modulation	Yanyun Xue et.al.	2511.21059	null
2025-11-26	Long-Term Alzheimers Disease Prediction: A Novel Image Generation Method Using Temporal Parameter Estimation with Normal Inverse Gamma Distribution on Uneven Time Series	Xin Hong et.al.	2511.21057	null
2025-11-26	Efficient Diffusion Planning with Temporal Diffusion	Jiaming Guo et.al.	2511.21054	null
2025-11-26	Multi-path vector entanglement engineering via dark mode control in optomechanics	P. Djorwé et.al.	2511.21052	null
2025-11-26	MUSE: Manipulating Unified Framework for Synthesizing Emotions in Images via Test-Time Optimization	Yingjie Xia et.al.	2511.21051	null
2025-11-26	CartoonSing: Unifying Human and Nonhuman Timbres in Singing Generation	Jionghao Han et.al.	2511.21045	null
2025-11-26	PG-ControlNet: A Physics-Guided ControlNet for Generative Spatially Varying Image Deblurring	Hakki Motorcu et.al.	2511.21043	null
2025-11-26	LungNoduleAgent: A Collaborative Multi-Agent System for Precision Diagnosis of Lung Nodules	Cheng Yang et.al.	2511.21042	null
2025-11-26	LOOM: Personalized Learning Informed by Daily LLM Conversations Toward Long-Term Mastery via a Dynamic Learner Memory Graph	Justin Cui et.al.	2511.21037	null
2025-11-26	Deep Parameter Interpolation for Scalar Conditioning	Chicago Y. Park et.al.	2511.21028	null
2025-11-25	RubricRL: Simple Generalizable Rewards for Text-to-Image Generation	Xuelu Feng et.al.	2511.20651	null
2025-11-25	MedROV: Towards Real-Time Open-Vocabulary Detection Across Diverse Medical Imaging Modalities	Tooba Tehreem Sheikh et.al.	2511.20650	null
2025-11-25	Infinity-RoPE: Action-Controllable Infinite Video Generation Emerges From Autoregressive Self-Rollout	Hidir Yesiltepe et.al.	2511.20649	null
2025-11-25	LocateAnything3D: Vision-Language 3D Detection with Chain-of-Sight	Yunze Man et.al.	2511.20648	null
2025-11-25	Diverse Video Generation with Determinantal Point Process-Guided Policy Optimization	Tahira Kazimi et.al.	2511.20647	null
2025-11-25	PixelDiT: Pixel Diffusion Transformers for Image Generation	Yongsheng Yu et.al.	2511.20645	null
2025-11-25	Concept-Aware Batch Sampling Improves Language-Image Pretraining	Adhiraj Ghosh et.al.	2511.20643	null
2025-11-25	Unleashing the Power of Vision-Language Models for Long-Tailed Multi-Label Visual Recognition	Wei Tang et.al.	2511.20641	null
2025-11-25	MotionV2V: Editing Motion in a Video	Ryan Burgert et.al.	2511.20640	null
2025-11-25	Image2Gcode: Image-to-G-code Generation for Additive Manufacturing Using Diffusion-Transformer Model	Ziyue Wang et.al.	2511.20636	null
2025-11-25	iMontage: Unified, Versatile, Highly Dynamic Many-to-many Image Generation	Zhoujie Fu et.al.	2511.20635	null
2025-11-25	MapReduce LoRA: Advancing the Pareto Front in Multi-Preference Optimization for Generative Models	Chieh-Yun Chen et.al.	2511.20629	null
2025-11-25	ShapeGen: Towards High-Quality 3D Shape Synthesis	Yangguang Li et.al.	2511.20624	null
2025-11-25	The Consistency Critic: Correcting Inconsistencies in Generated Images via Reference-Guided Attentive Alignment	Ziheng Ouyang et.al.	2511.20614	null
2025-11-25	Adaptive Hopfield Network: Rethinking Similarities in Associative Memory	Shurong Wang et.al.	2511.20609	null
2025-11-25	Latent Diffusion Inversion Requires Understanding the Latent Space	Mingxing Rao et.al.	2511.20592	null
2025-11-25	Morse index stability for p-Yang-Mills connections	Mario Gauvrit et.al.	2511.20588	null
2025-11-25	Anatomica: Localized Control over Geometric and Topological Properties for Anatomical Diffusion Models	Karim Kadry et.al.	2511.20587	null
2025-11-25	A User-customized and Untethered Electro-haptic Device for Immersive Human-Machine Interaction	Ziang Cui et.al.	2511.20578	null
2025-11-25	VQ-VA World: Towards High-Quality Visual Question-Visual Answering	Chenhui Gou et.al.	2511.20573	null
2025-11-25	E2E-GRec: An End-to-End Joint Training Framework for Graph Neural Networks and Recommender Systems	Rui Xue et.al.	2511.20564	null
2025-11-25	A Reason-then-Describe Instruction Interpreter for Controllable Video Generation	Shengqiong Wu et.al.	2511.20563	null
2025-11-25	PhysChoreo: Physics-Controllable Video Generation with Part-Aware Semantic Grounding	Haoze Zhang et.al.	2511.20562	null
2025-11-25	Does Understanding Inform Generation in Unified Multimodal Models? From Analysis to Path Forward	Yuwei Niu et.al.	2511.20561	null
2025-11-25	Flash-DMD: Towards High-Fidelity Few-Step Image Generation with Efficient Distillation and Joint Reinforcement Learning	Guanjie Chen et.al.	2511.20549	null
2025-11-25	Modelling the Spread of Toxicity and Exploring its Mitigation on Online Social Networks	Aatman Vaidya et.al.	2511.20546	null
2025-11-25	Tuning entanglement phases and topological memory in the measurement-only Kitaev model with single and multi-qubit checks	Tushya Kalpada et.al.	2511.20545	null
2025-11-25	Automated Monitoring of Cultural Heritage Artifacts Using Semantic Segmentation	Andrea Ranieri et.al.	2511.20541	null
2025-11-25	Extreme Ultraviolet Spectroscopy of Highly Charged Lu and Yb Ions for Nuclear Charge Radius Determination	Hunter Staiger et.al.	2511.20537	null
2025-11-25	MIMIC-MJX: Neuromechanical Emulation of Animal Behavior	Charles Y. Zhang et.al.	2511.20532	null
2025-11-25	Beyond Generation: Multi-Hop Reasoning for Factual Accuracy in Vision-Language Models	Shamima Hossain et.al.	2511.20531	null
2025-11-25	Assessing LLMs’ Performance: Insights from the Chinese Pharmacist Exam	Xinran Wang et.al.	2511.20526	null
2025-11-25	Mistake Attribution: Fine-Grained Mistake Understanding in Egocentric Videos	Yayuan Li et.al.	2511.20525	null
2025-11-25	HBridge: H-Shape Bridging of Heterogeneous Experts for Unified Multimodal Understanding and Generation	Xiang Wang et.al.	2511.20520	null
2025-11-25	AlignBench: Benchmarking Fine-Grained Image-Text Alignment with Synthetic Image-Caption Pairs	Kuniaki Saito et.al.	2511.20515	null
2025-11-25	Real-time 3D Ultrasonic Needle Tracking with a Photoacoustic Beacon	Christian Baker et.al.	2511.20514	null
2025-11-25	DesignPref: Capturing Personal Preferences in Visual Design Generation	Yi-Hao Peng et.al.	2511.20513	null
2025-11-25	Ultralow noise microwaves with free-running frequency combs and electrical feedforward	Takuma Nakamura et.al.	2511.20504	null
2025-11-25	Adversarial Confusion Attack: Disrupting Multimodal Large Language Models	Jakub Hoscilowicz et.al.	2511.20494	null
2025-11-25	Wide Area Surface Dosimetry with Conformal Scintillator Array for External Beam Radiotherapy	Roman Vasyltsiv et.al.	2511.20472	null
2025-11-25	Efficient and Fast Generative-Based Singing Voice Separation using a Latent Diffusion Model	Genís Plaja-Roglans et.al.	2511.20470	null
2025-11-25	STARFlow-V: End-to-End Video Generative Modeling with Normalizing Flow	Jiatao Gu et.al.	2511.20462	null
2025-11-25	Look Where It Matters: Training-Free Ultra-HR Remote Sensing VQA via Adaptive Zoom Search	Yunqi Zhou et.al.	2511.20460	null
2025-11-25	A Self-Consistent Model of the Ultra High-Energy Gamma-Ray Emission of Pulsar Wind Nebulae: Insights from LHAASO and ATNF Catalogs	Samy Kaci et.al.	2511.20452	null
2025-11-25	Learning to Generate Human-Human-Object Interactions from Textual Descriptions	Jeonghyeon Na et.al.	2511.20446	null
2025-11-25	Diffusion for Fusion: Designing Stellarators with Generative AI	Misha Padidar et.al.	2511.20445	null
2025-11-25	Jet reorientation revealed by intermittent jet activity in radio galaxy 0954+556	Ai-Ling Zeng et.al.	2511.20434	null
2025-11-25	BRIC: Bridging Kinematic Plans and Physical Control at Test Time	Dohun Lim et.al.	2511.20431	null
2025-11-25	Block Cascading: Training Free Acceleration of Block-Causal Video Models	Hmrishav Bandyopadhyay et.al.	2511.20426	null
2025-11-25	Planar Josephson junctions for sensors and electronics:Different geometry, new functionality	Vladimir M. Krasnov et.al.	2511.20424	null
2025-11-25	VibraVerse: A Large-Scale Geometry-Acoustics Alignment Dataset for Physically-Consistent Multimodal Learning	Bo Pang et.al.	2511.20422	null
2025-11-25	Nonuniform-Grid Markov Chain Approximation of Continuous Processes with Time-Linear Moments	Do Hyun Kim et.al.	2511.20416	null
2025-11-25	MajutsuCity: Language-driven Aesthetic-adaptive City Generation with Controllable 3D Assets and Layouts	Zilong Huang et.al.	2511.20415	null
2025-11-25	Self-Identifying Internal Model-Based Online Optimization	Wouter J. A. van Weerelt et.al.	2511.20411	null
2025-11-25	Image-Free Timestep Distillation via Continuous-Time Consistency with Trajectory-Sampled Pairs	Bao Tang et.al.	2511.20410	null
2025-11-25	A Training-Free Approach for Multi-ID Customization via Attention Adjustment and Spatial Control	Jiawei Lin et.al.	2511.20401	null
2025-11-25	Real-Space Imaging of Moiré-Confined Excitons in Twisted Bilayer MoS $_2$	Laurens J. M. Westenberg et.al.	2511.20398	null
2025-11-25	Mechano-chemical modeling of glia initiated secondary injury of neurons under mechanical load	Debabrata Auddya et.al.	2511.20392	null
2025-11-25	FREE: Uncertainty-Aware Autoregression for Parallel Diffusion Transformers	Xinwan Wen et.al.	2511.20390	null
2025-11-25	Unusual sign-changing Faraday effect in nanometer-thick magnetic films	A. V. Belkova et.al.	2511.20373	null
2025-11-25	Nonlinearly preconditioned gradient flows	Konstantinos Oikonomidis et.al.	2511.20370	null
2025-11-25	VGGTFace: Topologically Consistent Facial Geometry Reconstruction in the Wild	Xin Ming et.al.	2511.20366	null
2025-11-25	From Passive Perception to Active Memory: A Weakly Supervised Image Manipulation Localization Framework Driven by Coarse-Grained Annotations	Zhiqing Guo et.al.	2511.20359	null
2025-11-25	Influence of temperature, initial grain-boundary bubble density and grain structure on fission gas behaviour in UO $_2$ : a 3D hybrid multiscale study	Sourav Chatterjee et.al.	2511.20352	null
2025-11-25	ShelfRectNet: Single View Shelf Image Rectification with Homography Estimation	Onur Berk Tore et.al.	2511.20335	null
2025-11-25	A note on ideals in derived geometries	Zachary Gardner et.al.	2511.20331	null
2025-11-25	ArtiBench and ArtiBrain: Benchmarking Generalizable Vision-Language Articulated Object Manipulation	Yuhan Wu et.al.	2511.20330	null
2025-11-25	Modified Equations for Stochastic Optimization	Stefan Perko et.al.	2511.20322	null
2025-11-25	IrisNet: Infrared Image Status Awareness Meta Decoder for Infrared Small Targets Detection	Xuelin Qian et.al.	2511.20319	null
2025-11-25	TReFT: Taming Rectified Flow Models For One-Step Image Translation	Shengqian Li et.al.	2511.20307	null
2025-11-25	TaCo: Capturing Spatio-Temporal Semantic Consistency in Remote Sensing Change Detection	Han Guo et.al.	2511.20306	null
2025-11-25	Back to the Feature: Explaining Video Classifiers with Video Counterfactual Explanations	Chao Wang et.al.	2511.20295	null
2025-11-25	Stochastic Dynamics of Skyrmions on a Racetrack: Impact of Equilibrium and Nonequilibrium Noise	Anton V. Hlushchenko et.al.	2511.20287	null
2025-11-25	Can LLMs Make (Personalized) Access Control Decisions?	Friederike Groschupp et.al.	2511.20284	null
2025-11-25	Bootstrapping Physics-Grounded Video Generation through VLM-Guided Iterative Self-Refinement	Yang Liu et.al.	2511.20280	null
2025-11-25	HVAdam: A Full-Dimension Adaptive Optimizer	Yiheng Zhang et.al.	2511.20277	null
2025-11-25	HAFO: Humanoid Force-Adaptive Control for Intense External Force Interaction Environments	Chenhui Dong et.al.	2511.20275	null
2025-11-25	ScenarioCLIP: Pretrained Transferable Visual Language Models and Action-Genome Dataset for Natural Scene Analysis	Advik Sinha et.al.	2511.20274	null
2025-11-25	Magnetic Order Unlocks Optical Access to Dark Excitons in CrSBr	Sophie Bork et.al.	2511.20268	null
2025-11-25	Advancing Image Classification with Discrete Diffusion Classification Modeling	Omer Belhasin et.al.	2511.20263	null
2025-11-25	The Image as Its Own Reward: Reinforcement Learning with Adversarial Reward for Image Generation	Weijia Mao et.al.	2511.20256	null
2025-11-25	Escaping AB caging via Floquet engineering: photo-induced long-range interference in an all-band-flat model	Aamna Ahmed et.al.	2511.20255	null
2025-11-25	Zoo3D: Zero-Shot 3D Object Detection at Scene Level	Andrey Lemeshko et.al.	2511.20253	null
2025-11-25	PromptMoG: Enhancing Diversity in Long-Prompt Image Generation via Prompt Embedding Mixture-of-Gaussian Sampling	Bo-Kai Ruan et.al.	2511.20251	null
2025-11-25	Uplifting Table Tennis: A Robust, Real-World Application for 3D Trajectory and Spin Estimation	Daniel Kienzle et.al.	2511.20250	null
2025-11-25	Improving the Identification of Real-world Malware’s DNS Covert Channels Using Locality Sensitive Hashing	Pascal Ruffing et.al.	2511.20229	null
2025-11-25	Numerical Simulation of the Cleaning Process of Microchannel by an External Flow	Boris S. Maryshev et.al.	2511.20228	null
2025-11-25	Toward generic control for soft robotic systems	Yu Sun et.al.	2511.20226	null
2025-11-25	DUO-TOK: Dual-Track Semantic Music Tokenizer for Vocal-Accompaniment Generation	Rui Lin et.al.	2511.20224	null
2025-11-25	V-Attack: Targeting Disentangled Value Features for Controllable Adversarial Attacks on LVLMs	Sen Nie et.al.	2511.20223	null
2025-11-25	Text-guided Controllable Diffusion for Realistic Camouflage Images Generation	Yuhang Qian et.al.	2511.20218	null
2025-11-25	Implications of the Four-Color Theorem on the Dynamics of N-Component Phase Separation	Michael Rennick et.al.	2511.20215	null
2025-11-25	OmniAlpha: A Sequence-to-Sequence Framework for Unified Multi-Task RGBA Generation	Hao Yu et.al.	2511.20211	null
2025-11-25	Towards Benign Memory Forgetting for Selective Multimodal Large Language Model Unlearning	Zhen Zeng et.al.	2511.20196	null
2025-11-25	SFA: Scan, Focus, and Amplify toward Guidance-aware Answering for Video TextVQA	Haibin He et.al.	2511.20190	null
2025-11-25	Exo2EgoSyn: Unlocking Foundation Video Generation Models for Exocentric-to-Egocentric Video Synthesis	Mohammad Mahdi et.al.	2511.20186	null
2025-11-25	Bipotentiostatic Control Unlocks Flashing Ratchet Features in Ion Pumps	Eden Grossman et.al.	2511.20174	null
2025-11-25	ADNet: A Large-Scale and Extensible Multi-Domain Benchmark for Anomaly Detection Across 380 Real-World Categories	Hai Ling et.al.	2511.20169	null
2025-11-25	Noninvasive rheological inference from stable flows in confined tissues	Marc Karnat et.al.	2511.20155	null
2025-11-25	Restora-Flow: Mask-Guided Image Restoration with Flow Matching	Arnela Hadzic et.al.	2511.20152	null
2025-11-24	LumiTex: Towards High-Fidelity PBR Texture Generation with Illumination Context	Jingzhi Bao et.al.	2511.19437	null
2025-11-24	VDC-Agent: When Video Detailed Captioners Evolve Themselves via Agentic Self-Reflection	Qiang Wang et.al.	2511.19436	null
2025-11-24	Are Image-to-Video Models Good Zero-Shot Image Editors?	Zechuan Zhang et.al.	2511.19435	null
2025-11-24	Breaking the Likelihood-Quality Trade-off in Diffusion Models by Merging Pretrained Experts	Yasin Esfandiari et.al.	2511.19434	null
2025-11-24	Ref-SAM3D: Bridging SAM3D with Text for Reference 3D Reconstruction	Yun Zhou et.al.	2511.19426	null
2025-11-24	SAM3-Adapter: Efficient Adaptation of Segment Anything 3 for Camouflage Object Segmentation, Shadow Detection, and Medical Image Segmentation	Tianrun Chen et.al.	2511.19425	null
2025-11-24	On the Fujita Phenomenon for a Forced Spatio-Temporal Fractional Diffusion Equation	Rihab Ben Belgacem et.al.	2511.19424	null
2025-11-24	In-Video Instructions: Visual Signals as Generative Control	Gongfan Fang et.al.	2511.19401	null
2025-11-24	Wigner and Gabor phase-space analysis of propagators for evolution equations	Elena Cordero et.al.	2511.19400	null
2025-11-24	BackSplit: The Importance of Sub-dividing the Background in Biomedical Lesion Segmentation	Rachit Saluja et.al.	2511.19394	null
2025-11-24	Predicting partially observable dynamical systems via diffusion models with a multiscale inference scheme	Rudy Morel et.al.	2511.19390	null
2025-11-24	Efficiency vs. Fidelity: A Comparative Analysis of Diffusion Probabilistic Models and Flow Matching on Low-Resource Hardware	Srishti Gupta et.al.	2511.19379	null
2025-11-24	DeCo: Frequency-Decoupled Pixel Diffusion for End-to-End Image Generation	Zehong Ma et.al.	2511.19365	null
2025-11-24	Numerical solution of the nonlinear boson diffusion equation for gluons	J. Rössler et.al.	2511.19363	null
2025-11-24	Growing with the Generator: Self-paced GRPO for Video Generation	Rui Li et.al.	2511.19356	null
2025-11-24	Syn-GRPO: Self-Evolving Data Synthesis for MLLM Perception Reasoning	Qihan Huang et.al.	2511.19343	null
2025-11-24	Explicit Tonal Tension Conditioning via Dual-Level Beam Search for Symbolic Music Generation	Maral Ebrahimzadeh et.al.	2511.19342	null
2025-11-24	Targeted Manipulation: Slope-Based Attacks on Financial Time-Series Data	Dominik Luszczynski et.al.	2511.19330	null
2025-11-24	SteadyDancer: Harmonized and Coherent Human Image Animation with First-Frame Preservation	Jiaming Zhang et.al.	2511.19320	null
2025-11-24	SyncMV4D: Synchronized Multi-view Joint Diffusion of Appearance and Motion for Hand-Object Interaction Synthesis	Lingwei Dang et.al.	2511.19319	null
2025-11-24	Evaluating Dataset Watermarking for Fine-tuning Traceability of Customized Diffusion Models: A Comprehensive Benchmark and Removal Approach	Xincheng Wang et.al.	2511.19316	null
2025-11-24	Dual-Granularity Semantic Prompting for Language Guidance Infrared Small Target Detection	Zixuan Wang et.al.	2511.19306	null
2025-11-24	AutoEnv: Automated Environments for Measuring Cross-Environment Agent Learning	Jiayi Zhang et.al.	2511.19304	null
2025-11-24	Multiphoton ionization with three-dimensional light fields	Darius Köhnke et.al.	2511.19290	null
2025-11-24	What is the signature of a trion in photoemission?	Jinyuan Wu et.al.	2511.19280	null
2025-11-24	Dynamic Multi-Species Bird Soundscape Generation with Acoustic Patterning and 3D Spatialization	Ellie L. Zhang et.al.	2511.19275	null
2025-11-24	Diffusion Reconstruction-based Data Likelihood Estimation for Core-Set Selection	Mingyang Chen et.al.	2511.19274	null
2025-11-24	CDLM: Consistency Diffusion Language Models For Faster Sampling	Minseo Kim et.al.	2511.19269	null
2025-11-24	BideDPO: Conditional Image Generation with Simultaneous Text and Condition Alignment	Dewei Zhou et.al.	2511.19268	null
2025-11-24	Interpreting GFlowNets for Drug Discovery: Extracting Actionable Insights for Medicinal Chemistry	Amirtha Varshini A S et.al.	2511.19264	null
2025-11-24	LAST: LeArning to Think in Space and Time for Generalist Vision-Language Models	Shuai Wang et.al.	2511.19261	null
2025-11-24	SimDiff: Simpler Yet Better Diffusion Model for Time Series Point Forecasting	Hang Ding et.al.	2511.19256	null
2025-11-24	MAESTRO: Multi-Agent Environment Shaping through Task and Reward Optimization	Boyuan Wu et.al.	2511.19253	null
2025-11-24	Spin-Flux Skyrmions: Anomalous Electron Dynamics and Spin-Hall Currents	Sandip Bera et.al.	2511.19239	null
2025-11-24	SENTINEL: A Fully End-to-End Language-Action Model for Humanoid Whole Body Control	Yuxuan Wang et.al.	2511.19236	null
2025-11-24	IDSplat: Instance-Decomposed 3D Gaussian Splatting for Driving Scenes	Carl Lindström et.al.	2511.19235	null
2025-11-24	An O-RAN Framework for AI/ML-Based Localization with OpenAirInterface and FlexRIC	Nada Bouknana et.al.	2511.19233	null
2025-11-24	Learning Plug-and-play Memory for Guiding Video Diffusion Models	Selena Song et.al.	2511.19229	null
2025-11-24	Percept-WAM: Perception-Enhanced World-Awareness-Action Model for Robust End-to-End Autonomous Driving	Jianhua Han et.al.	2511.19221	null
2025-11-24	Are Large Vision Language Models Truly Grounded in Medical Images? Evidence from Italian Clinical Visual Question Answering	Federico Felizzi et.al.	2511.19220	null
2025-11-24	ReAlign: Text-to-Motion Generation via Step-Aware Reward-Guided Alignment	Wanjiang Weng et.al.	2511.19217	null
2025-11-24	Inclinations and Position Angles for Disc Galaxies in the SGA sample	Megan H. Martinez et.al.	2511.19207	null
2025-11-24	Spherical Einstein-Friedberg-Lee-Sirlin boson stars: Self-interacting solutions and their astrophysical appearance	Pedro L. Brito de Sá et.al.	2511.19206	null
2025-11-24	Can Modern Vision Models Understand the Difference Between an Object and a Look-alike?	Itay Cohen et.al.	2511.19200	null
2025-11-24	Three-Dimensional Anatomical Data Generation Based on Artificial Neural Networks	Ann-Sophia Müller et.al.	2511.19198	null
2025-11-24	Insights into neutron transport in a new multipurpose nuclear reactor	Luiz Paulo de Oliveira et.al.	2511.19195	null
2025-11-24	AvatarBrush: Monocular Reconstruction of Gaussian Avatars with Intuitive Local Editing	Mengtian Li et.al.	2511.19189	null
2025-11-24	SpectraNet: FFT-assisted Deep Learning Classifier for Deepfake Face Detection	Nithira Jayarathne et.al.	2511.19187	null
2025-11-24	Torsion-Space Diffusion for Protein Backbone Generation with Geometric Refinement	Lakshaditya Singh et.al.	2511.19184	null
2025-11-24	Large Deviation Principle for Neutral Type Mckean-Vlasov Stochastic Differential Equations	Zhaohang Wang et.al.	2511.19181	null
2025-11-24	Test-Time Preference Optimization for Image Restoration	Bingchen Li et.al.	2511.19169	null
2025-11-24	RAVEN++: Pinpointing Fine-Grained Violations in Advertisement Videos with Active Reinforcement Reasoning	Deyi Ji et.al.	2511.19168	null
2025-11-24	Masked Diffusion Models are Secretly Learned-Order Autoregressive Models	Prateek Garg et.al.	2511.19152	null
2025-11-24	Optimal policy design for innovation diffusion: shaping today’s incentives for transforming the future	Lisa Piccinin et.al.	2511.19143	null
2025-11-24	When Semantics Regulate: Rethinking Patch Shuffle and Internal Bias for Generated Image Detection with CLIP	Beilin Chu et.al.	2511.19126	null
2025-11-24	3M-TI: High-Quality Mobile Thermal Imaging via Calibration-free Multi-Camera Cross-Modal Diffusion	Minchong Chen et.al.	2511.19117	null
2025-11-24	Physics-informed Neural Operator Learning for Nonlinear Grad-Shafranov Equation	Siqi Ding et.al.	2511.19114	null
2025-11-24	DiffSeg30k: A Multi-Turn Diffusion Editing Benchmark for Localized AIGC Detection	Hai Ci et.al.	2511.19111	null
2025-11-24	Fate of diffusion under integrability breaking of classical integrable magnets	Jiaozi Wang et.al.	2511.19110	null
2025-11-24	HABIT: Human Action Benchmark for Interactive Traffic in CARLA	Mohan Ramesh et.al.	2511.19109	null
2025-11-24	Phase Diagrams of the YK Surface-Reaction Model on 2D lattices with Exchange Diffusion	Henrique A. Fernandes et.al.	2511.19082	null
2025-11-24	On the Tail Transition of First Arrival Position Channels: From Cauchy to Exponential Decay	Yen-Chi Lee et.al.	2511.19074	null
2025-11-24	Experimental insights into data augmentation techniques for deep learning-based multimode fiber imaging: limitations and success	Jawaria Maqbool et.al.	2511.19072	null
2025-11-24	Granular Computing-driven SAM: From Coarse-to-Fine Guidance for Prompt-Free Segmentation	Qiyang Yu et.al.	2511.19062	null
2025-11-24	LAA3D: A Benchmark of Detecting and Tracking Low-Altitude Aircraft in 3D Space	Hai Wu et.al.	2511.19057	null
2025-11-24	Fostering Innovation: Streamlining Magnetocaloric Materials Research by Digitalization	Simon Bekemeier et.al.	2511.19053	null
2025-11-24	Beyond Reward Margin: Rethinking and Resolving Likelihood Displacement in Diffusion Models via Video Generation	Ruojun Xu et.al.	2511.19049	null
2025-11-24	MedSAM3: Delving into Segment Anything with Medical Concepts	Anglin Liu et.al.	2511.19046	null
2025-11-24	Diffusion Model-Enhanced Environment Reconstruction in ISAC	Nguyen Duc Minh Quang et.al.	2511.19044	null
2025-11-24	Deterministic Mean Field Games on Networks and Related Optimal Control Problems	Yves Achdou et.al.	2511.19038	null
2025-11-24	Resolving Node Identifiability in Graph Neural Processes via Laplacian Spectral Encodings	Zimo Yan et.al.	2511.19037	null
2025-11-24	Introducing Visual Scenes and Reasoning: A More Realistic Benchmark for Spoken Language Understanding	Di Wu et.al.	2511.19005	null
2025-11-24	A Self-Conditioned Representation Guided Diffusion Model for Realistic Text-to-LiDAR Scene Generation	Wentao Qu et.al.	2511.19004	null
2025-11-24	View-Consistent Diffusion Representations for 3D-Consistent Video Generation	Duolikun Danier et.al.	2511.18991	null
2025-11-24	Rethinking Plant Disease Diagnosis: Bridging the Academic-Practical Gap with Vision Transformers and Zero-Shot Learning	Wassim Benabbas et.al.	2511.18989	null
2025-11-24	Zero-shot segmentation of skin tumors in whole-slide images with vision-language foundation models	Santiago Moreno et.al.	2511.18978	null
2025-11-24	Peregrine: One-Shot Fine-Tuning for FHE Inference of General Deep CNNs	Huaming Ling et.al.	2511.18976	null
2025-11-24	Eevee: Towards Close-up High-resolution Video-based Virtual Try-on	Jianhao Zeng et.al.	2511.18957	null
2025-11-24	Superconducting spintronics with electron symmetry filtering and interfacial spin-orbit coupling	Pablo Tuero et.al.	2511.18951	null
2025-11-24	Leveraging Adversarial Learning for Pathological Fidelity in Virtual Staining	José Teixeira et.al.	2511.18946	null
2025-11-24	VeCoR - Velocity Contrastive Regularization for Flow Matching	Zong-Wei Hong et.al.	2511.18942	null
2025-11-24	Giant Domain Walls and Intrinsic Heterogeneity in 214 Cuprate Superconductors	Mark S. Senn et.al.	2511.18938	null
2025-11-24	FineXtrol: Controllable Motion Generation via Fine-Grained Text	Keming Shen et.al.	2511.18927	null
2025-11-24	One4D: Unified 4D Generation and Reconstruction via Decoupled LoRA Control	Zhenxing Mi et.al.	2511.18922	null
2025-11-24	BackdoorVLM: A Benchmark for Backdoor Attacks on Vision-Language Models	Juncheng Li et.al.	2511.18921	null
2025-11-24	EventSTU: Event-Guided Efficient Spatio-Temporal Understanding for Video Large Language Models	Wenhao Xu et.al.	2511.18920	null
2025-11-24	Learning What to Trust: Bayesian Prior-Guided Optimization for Visual Generation	Ruiying Liu et.al.	2511.18919	null
2025-11-24	Adaptive Probabilistic Constellation Shaping based on Enumerative Sphere Shaping for FSO Channel with Turbulence and Pointing Errors	Jingtian Liu et.al.	2511.18911	null
2025-11-24	MatMart: Material Reconstruction of 3D Objects via Diffusion	Xiuchao Wu et.al.	2511.18900	null
2025-11-24	MagicWorld: Interactive Geometry-driven Video World Exploration	Guangyuan Li et.al.	2511.18886	null
2025-11-24	H{ö}lder regularity of parabolic equations with Dirichlet boundary conditions and application to reaction-diffusion and reaction-cross-diffusion systems	Hector Bouton et.al.	2511.18872	null
2025-11-24	HunyuanVideo 1.5 Technical Report	Bing Wu et.al.	2511.18870	null
2025-11-24	Rethinking Long-tailed Dataset Distillation: A Uni-Level Framework with Unbiased Recovery and Relabeling	Xiao Cui et.al.	2511.18858	null
2025-11-24	UNeMo: Collaborative Visual-Language Reasoning and Navigation via a Multimodal World Model	Changxin Huang et.al.	2511.18845	null
2025-11-24	FlowSteer: Guiding Few-Step Image Synthesis with Authentic Trajectories	Lei Ke et.al.	2511.18834	null
2025-11-24	PrismAudio: Decomposed Chain-of-Thoughts and Multi-dimensional Rewards for Video-to-Audio Generation	Huadai Liu et.al.	2511.18833	null
2025-11-24	Q-Save: Towards Scoring and Attribution for Generated Video Evaluation	Xiele Wu et.al.	2511.18825	null
2025-11-24	VideoPerceiver: Enhancing Fine-Grained Temporal Perception in Video Multimodal Large Language Models	Fufangchen Zhao et.al.	2511.18823	null
2025-11-24	DiP: Taming Diffusion Models in Pixel Space	Zhennan Chen et.al.	2511.18822	null
2025-11-24	Disc3D: Automatic Curation of High-Quality 3D Dialog Data via Discriminative Object Referring	Siyuan Wei et.al.	2511.18817	null
2025-11-21	RynnVLA-002: A Unified Vision-Language-Action and World Model	Jun Cen et.al.	2511.17502	null
2025-11-21	MDG: Masked Denoising Generation for Multi-Agent Behavior Modeling in Traffic Environments	Zhiyu Huang et.al.	2511.17496	null
2025-11-21	EvDiff: High Quality Video with an Event Camera	Weilun Li et.al.	2511.17492	null
2025-11-21	Video-R4: Reinforcing Text-Rich Video Reasoning with Visual Rumination	Yolo Yunlong Tang et.al.	2511.17490	null
2025-11-21	Radar2Shape: 3D Shape Reconstruction from High-Frequency Radar using Multiresolution Signed Distance Functions	Neel Sortur et.al.	2511.17484	null
2025-11-21	Counterfactual World Models via Digital Twin-conditioned Video Diffusion	Yiqing Shen et.al.	2511.17481	null
2025-11-21	Improving Multimodal Distillation for 3D Semantic Segmentation under Domain Shift	Björn Michele et.al.	2511.17455	null
2025-11-21	Illustrator’s Depth: Monocular Layer Index Prediction for Image Decomposition	Nissim Maruani et.al.	2511.17454	null
2025-11-21	Planning with Sketch-Guided Verification for Physics-Aware Video Generation	Yidong Huang et.al.	2511.17450	null
2025-11-21	REMSA: An LLM Agent for Foundation Model Selection in Remote Sensing	Binger Chen et.al.	2511.17442	null
2025-11-21	SMILE: A Composite Lexical-Semantic Metric for Question-Answering Evaluation	Shrikant Kendre et.al.	2511.17432	null
2025-11-21	Preventing Shortcut Learning in Medical Image Analysis through Intermediate Layer Knowledge Distillation from Specialist Teachers	Christopher Boland et.al.	2511.17421	null
2025-11-21	SPEAR-1: Scaling Beyond Robot Demonstrations via 3D Understanding	Nikolay Nikolov et.al.	2511.17411	null
2025-11-21	Accelerating the CLEAN algorithm of radio interferometry with convex optimization	Hendrik Müller et.al.	2511.17410	null
2025-11-21	A Unified Causal Framework for Nonlinear Electrodynamics Black Hole from Courant-Hilbert Approach: Thermodynamics and Singularity	H. Babaei-Aghbolagh et.al.	2511.17407	null
2025-11-21	Sparse Mixture-of-Experts for Multi-Channel Imaging: Are All Channel Interactions Required?	Sukwon Yun et.al.	2511.17400	null
2025-11-21	Designing and Generating Diverse, Equitable Face Image Datasets for Face Verification Tasks	Georgia Baltsou et.al.	2511.17393	null
2025-11-21	Human Imitated Bipedal Locomotion with Frequency Based Gait Generator Network	Yusuf Baran Ates et.al.	2511.17387	null
2025-11-21	Agility Meets Stability: Versatile Humanoid Control with Heterogeneous Data	Yixuan Pan et.al.	2511.17373	null
2025-11-21	ATAC: Augmentation-Based Test-Time Adversarial Correction for CLIP	Linxiang Su et.al.	2511.17362	null
2025-11-21	Don’t Learn, Ground: A Case for Natural Language Inference with Visual Grounding	Daniil Ignatev et.al.	2511.17358	null
2025-11-21	Optimal Thermalization under Indefinite Causal Order with Identical and Asymmetric Baths	Neeraj Sharma et.al.	2511.17357	null
2025-11-21	Learning Latent Transmission and Glare Maps for Lens Veiling Glare Removal	Xiaolong Qian et.al.	2511.17353	null
2025-11-21	Loomis Painter: Reconstructing the Painting Process	Markus Pobitzer et.al.	2511.17344	null
2025-11-21	Refracting Reality: Generating Images with Realistic Transparent Objects	Yue Yin et.al.	2511.17340	null
2025-11-21	A new kid on the block: Distributional semantics predicts the word-specific tone signatures of monosyllabic words in conversational Taiwan Mandarin	Xiaoyun Jin et.al.	2511.17337	null
2025-11-21	Robot Confirmation Generation and Action Planning Using Long-context Q-Former Integrated with Multimodal LLM	Chiori Hori et.al.	2511.17335	null
2025-11-21	MusicAIR: A Multimodal AI Music Generation Framework Powered by an Algorithm-Driven Core	Callie C. Liao et.al.	2511.17323	null
2025-11-21	FORWARD: Dataset of a forwarder operating in rough terrain	Mikael Lundbäck et.al.	2511.17318	null
2025-11-21	SpatialGeo:Boosting Spatial Reasoning in Multimodal LLMs via Geometry-Semantics Fusion	Jiajie Guo et.al.	2511.17308	null
2025-11-21	Deep Investigation of Neutral Gas Origins (DINGO): Options for robust Deep Spectral Line Imaging in the SKA-Era	Jonghwan Rhee et.al.	2511.17307	null
2025-11-21	Framework Matters: Energy Efficiency of UI Automation Testing Frameworks	Timmie M. R. Lagermann et.al.	2511.17303	null
2025-11-21	Where Culture Fades: Revealing the Cultural Gap in Text-to-Image Generation	Chuancheng Shi et.al.	2511.17282	null
2025-11-21	Range-Edit: Semantic Mask Guided Outdoor LiDAR Scene Editing	Suchetan G. Uppur et.al.	2511.17269	null
2025-11-21	A Little More Like This: Text-to-Image Retrieval with Vision-Language Models Using Relevance Feedback	Bulat Khaertdinov et.al.	2511.17255	null
2025-11-21	Intervene-All-Paths: Unified Mitigation of LVLM Hallucinations across Alignment Formats	Jiaye Qian et.al.	2511.17254	null
2025-11-21	FlexiFlow: decomposable flow matching for generation of flexible molecular ensemble	Riccardo Tedoldi et.al.	2511.17249	null
2025-11-21	TP-MDDN: Task-Preferenced Multi-Demand-Driven Navigation with Autonomous Decision-Making	Shanshan Li et.al.	2511.17225	null
2025-11-21	Dual-domain Adaptation Networks for Realistic Image Super-resolution	Chaowei Fang et.al.	2511.17217	null
2025-11-21	Scaling Self-Supervised and Cross-Modal Pretraining for Volumetric CT Transformers	Cris Claessens et.al.	2511.17209	null
2025-11-21	VLA-4D: Embedding 4D Awareness into Vision-Language-Action Models for SpatioTemporally Coherent Robotic Manipulation	Hanyu Zhou et.al.	2511.17199	null
2025-11-21	Real Noise Decoupling for Hyperspectral Image Denoising	Yingkai Zhang et.al.	2511.17196	null
2025-11-21	Steering in the Shadows: Causal Amplification for Activation Space Attacks in Large Language Models	Zhiyuan Xu et.al.	2511.17194	null
2025-11-21	Ferroelectric Switchable Topological Magnon Hall Effect in Type-I Multiferroics	Quanchao Du et.al.	2511.17189	null
2025-11-21	PostCam: Camera-Controllable Novel-View Video Generation with Query-Shared Cross-Attention	Yipeng Chen et.al.	2511.17185	null
2025-11-21	Investigating self-supervised representations for audio-visual deepfake detection	Dragos-Alexandru Boldisor et.al.	2511.17181	null
2025-11-21	Proposal of an AI-Based Support Assistant for the ALICE-FIT Detector Setup at CERN	Ignacy Mermer et.al.	2511.17154	null
2025-11-21	Enhanced Efficiency of Intermediate-Band Semiconductor Solar Cells Embedded with Quantum Dot Superlattices	Naira Petrosyan et.al.	2511.17151	null
2025-11-21	DiffRefiner: Coarse to Fine Trajectory Planning via Diffusion Refinement with Semantic Interaction for End to End Autonomous Driving	Liuhan Yin et.al.	2511.17150	null
2025-11-21	Learning to Look Closer: A New Instance-Wise Loss for Small Cerebral Lesion Segmentation	Luc Bouteille et.al.	2511.17146	null
2025-11-21	One-Step Diffusion Transformer for Controllable Real-World Image Super-Resolution	Yushun Fang et.al.	2511.17138	null
2025-11-21	A Multi-Stage Optimization Framework for Deploying Learned Image Compression on FPGAs	Jiaxun Fang et.al.	2511.17135	null
2025-11-21	Four decades of circumpolar super-resolved satellite land surface temperature data	Sonia Dupuis et.al.	2511.17134	null
2025-11-21	Asymptotics of motion planning complexity for control-affine systems	Michele Motta et.al.	2511.17130	null
2025-11-21	Robustness of optimal control for controlled regime-switching diffusions with incorrect models	Somnath Pradhan et.al.	2511.17121	null
2025-11-21	Transport and removal of a passive tracer in porous media employing surface washing	Georgia Ioannou et.al.	2511.17115	null
2025-11-21	Modeling memory in time-respecting paths on temporal networks	Silvia Guerrini et.al.	2511.17108	null
2025-11-21	Power Flow Solution in Unbalanced 3-Wire MV and 4-Wire LV Networks Using Symmetrical and Eigen-basis Coordinates	Abduljalil S. Aljadani et.al.	2511.17104	null
2025-11-21	Bridging Visual Affective Gap: Borrowing Textual Knowledge by Learning from Noisy Image-Text Pairs	Daiqing Wu et.al.	2511.17103	null
2025-11-21	Sparse Reasoning is Enough: Biological-Inspired Framework for Video Anomaly Detection with Large Pre-trained Models	He Huang et.al.	2511.17094	null
2025-11-21	SPAGS: Sparse-View Articulated Object Reconstruction from Single State via Planar Gaussian Splatting	Di Wu et.al.	2511.17092	null
2025-11-21	Spanning Tree Autoregressive Visual Generation	Sangkyu Lee et.al.	2511.17089	null
2025-11-21	H-GAR: A Hierarchical Interaction Framework via Goal-Driven Observation-Action Refinement for Robotic Manipulation	Yijie Zhu et.al.	2511.17079	null
2025-11-21	Diversity Has Always Been There in Your Visual Autoregressive Models	Tong Wang et.al.	2511.17074	null
2025-11-21	ReBrain: Brain MRI Reconstruction from Sparse CT Slice via Retrieval-Augmented Diffusion	Junming Liu et.al.	2511.17068	null
2025-11-21	Stability and bifurcation of 2D viscous primitive equations with full diffusion	Song Jiang et.al.	2511.17055	null
2025-11-21	OmniPT: Unleashing the Potential of Large Vision Language Models for Pedestrian Tracking and Understanding	Teng Fu et.al.	2511.17053	null
2025-11-21	PathAgent: Toward Interpretable Analysis of Whole-slide Pathology Images via Large Language Model-based Agentic Reasoning	Jingyun Chen et.al.	2511.17052	null
2025-11-21	RoomPlanner: Explicit Layout Planner for Easier LLM-Driven 3D Room Generation	Wenzhuo Sun et.al.	2511.17048	null
2025-11-21	DAPS++: Rethinking Diffusion Inverse Problems with Decoupled Posterior Annealing	Hao Chen et.al.	2511.17038	null
2025-11-21	Energy Scaling Laws for Diffusion Models: Quantifying Compute and Carbon Emissions in Image Generation	Aniketh Iyengar et.al.	2511.17031	null
2025-11-21	Infinite Horizon Linear Quadratic Mean Field Problems with Common Noise and Regime Switching via Conditional McKean-Vlasov FBSDEs	Qingmeng Wei et.al.	2511.17023	null
2025-11-21	Continuous Resilience in Cyber-Physical Systems of Systems: Extending Architectural Models through Adaptive Coordination and Learning	Elisabeth Vogel et.al.	2511.17017	null
2025-11-21	Generative MIMO Beam Map Construction for Location Recovery and Beam Tracking	Wangqian Chen et.al.	2511.17007	null
2025-11-21	FLUID: Training-Free Face De-identification via Latent Identity Substitution	Jinhyeong Park et.al.	2511.17005	null
2025-11-21	Vision Language Models are Confused Tourists	Patrick Amadeus Irawan et.al.	2511.17004	null
2025-11-21	VLM-Augmented Degradation Modeling for Image Restoration Under Adverse Weather Conditions	Qianyi Shao et.al.	2511.16998	null
2025-11-21	DepthFocus: Controllable Depth Estimation for See-Through Scenes	Junhong Min et.al.	2511.16993	null
2025-11-21	DReX: Pure Vision Fusion of Self-Supervised and Convolutional Representations for Image Complexity Prediction	Jonathan Skaza et.al.	2511.16991	null
2025-11-21	The Wireless Charger as a Gesture Sensor: A Novel Approach to Ubiquitous Interaction	Weiyi Wang et.al.	2511.16989	null
2025-11-21	The Finer the Better: Towards Granular-aware Open-set Domain Generalization	Yunyun Wang et.al.	2511.16979	null
2025-11-21	ToC: Tree-of-Claims Search with Multi-Agent Language Models	Shuyang Yu et.al.	2511.16972	null
2025-11-21	Probing Dark Matter Substructure with Image Number Anomaly in Strong Lensing Systems	Wenlin Hou et.al.	2511.16971	null
2025-11-21	A novel double-rim forebaffle design for centimeter to sub-millimeter astrophysical observations	Jacques Delabrouille et.al.	2511.16970	null
2025-11-21	Fast far-sidelobe modeling for centimeter to sub-millimeter astrophysical observations	Oliver Jeong et.al.	2511.16967	null
2025-11-21	Real-Time Cooked Food Image Synthesis and Visual Cooking Progress Monitoring on Edge Devices	Jigyasa Gupta et.al.	2511.16965	null
2025-11-21	On Solving Chance-Constrained Models with Gaussian Mixture Distribution	Shibshankar Dey et.al.	2511.16960	null
2025-11-21	Real Option AI: Reversibility, Silence, and the Release Ladder	I. Sebastian Buhai et.al.	2511.16958	null
2025-11-21	MatPedia: A Universal Generative Foundation for High-Fidelity Material Synthesis	Di Luo et.al.	2511.16957	null
2025-11-21	Distortion of charge distribution due to internal electric fields described by the drift-diffusion semiconductor model	Masakazu Yamamoto et.al.	2511.16956	null
2025-11-21	Neighbor GRPO: Contrastive ODE Policy Optimization Aligns Flow Models	Dailan He et.al.	2511.16955	null
2025-11-21	Point-Supervised Facial Expression Spotting with Gaussian-Based Instance-Adaptive Intensity Modeling	Yicheng Deng et.al.	2511.16952	null
2025-11-21	FingerCap: Fine-grained Finger-level Hand Motion Captioning	Xin Shen et.al.	2511.16951	null
2025-11-21	Rethinking Diffusion Model-Based Video Super-Resolution: Leveraging Dense Guidance from Aligned Features	Jingyi Xu et.al.	2511.16928	null
2025-11-21	Diffusion-Inversion-Net (DIN): An End-to-End Direct Probabilistic Framework for Characterizing Hydraulic Conductivities and Quantifying Uncertainty	Xun Zhang et.al.	2511.16926	null
2025-11-21	DeltaDeno: Zero-Shot Anomaly Generation via Delta-Denoising Attribution	Chaoran Xu et.al.	2511.16920	null
2025-11-21	UniModel: A Visual-Only Framework for Unified Multimodal Understanding and Generation	Chi Zhang et.al.	2511.16917	null
2025-11-21	Q-REAL: Towards Realism and Plausibility Evaluation for AI-Generated Content	Shushi Wang et.al.	2511.16908	null
2025-11-21	Warm Diffusion: Recipe for Blur-Noise Mixture Diffusion Models	Hao-Chien Hsueh et.al.	2511.16904	null
2025-11-21	R-AVST: Empowering Video-LLMs with Fine-Grained Spatio-Temporal Reasoning in Complex Audio-Visual Scenarios	Lu Zhu et.al.	2511.16901	null
2025-11-20	Dataset Distillation for Pre-Trained Self-Supervised Vision Models	George Cazenavette et.al.	2511.16674	null
2025-11-20	EvoLMM: Self-Evolving Large Multimodal Models with Continuous Rewards	Omkat Thawakar et.al.	2511.16672	null
2025-11-20	Video-as-Answer: Predict and Generate Next Video Event with Joint-GRPO	Junhao Cheng et.al.	2511.16669	null
2025-11-20	V-ReasonBench: Toward Unified Reasoning Benchmark Suite for Video Generation Models	Yang Luo et.al.	2511.16668	null
2025-11-20	SceneDesigner: Controllable Multi-Object Image Generation with 9-DoF Pose Manipulation	Zhenyuan Qin et.al.	2511.16666	null
2025-11-20	Taming the Long-Tail: Efficient Reasoning RL Training with Adaptive Drafter	Qinghao Hu et.al.	2511.16665	null
2025-11-20	Worldline Localization	Changha Choi et.al.	2511.16663	null
2025-11-20	TriDiff-4D: Fast 4D Generation through Diffusion-based Triplane Re-posing	Eddie Pokming Sheung et.al.	2511.16662	null
2025-11-20	Prospects for Neutrino Observation and Mass Measurement from Binary Neutron Star Mergers	Vedran Brdar et.al.	2511.16658	null
2025-11-20	Comparison of Text-Based and Image-Based Retrieval in Multimodal Retrieval Augmented Generation Large Language Model Systems	Elias Lumer et.al.	2511.16654	null
2025-11-20	Measurement incompatibility in Bayesian multiparameter quantum estimation	Francesco Albarelli et.al.	2511.16645	null
2025-11-20	TRIM: Scalable 3D Gaussian Diffusion Inference with Temporal and Spatial Trimming	Zeyuan Yin et.al.	2511.16642	null
2025-11-20	SurvAgent: Hierarchical CoT-Enhanced Case Banking and Dichotomy-Based Multi-Agent System for Multimodal Survival Prediction	Guolin Huang et.al.	2511.16635	null
2025-11-20	A Core-Collapse Supernova Neutrino Parameterization with Enhanced Physical Interpretability	Haihao Shi et.al.	2511.16631	null
2025-11-20	Stabilizing Policy Gradient Methods via Reward Profiling	Shihab Ahmed et.al.	2511.16629	null
2025-11-20	TFCDiff: Robust ECG Denoising via Time-Frequency Complementary Diffusion	Pengxin Li et.al.	2511.16627	null
2025-11-20	SAM 3D: 3Dfy Anything in Images	SAM 3D Team et.al.	2511.16624	null
2025-11-20	SAM2S: Segment Anything in Surgical Videos via Semantic Long-term Tracking	Haofeng Liu et.al.	2511.16618	null
2025-11-20	An Information-Theoretic Reconstruction of Curvature	Amandip Sangha et.al.	2511.16601	null
2025-11-20	Time dependent loss reweighting for flow matching and diffusion models is theoretically justified	Lukas Billera et.al.	2511.16599	null
2025-11-20	TimeViper: A Hybrid Mamba-Transformer Vision-Language Model for Efficient Long Video Understanding	Boshen Xu et.al.	2511.16595	null
2025-11-20	Formal Abductive Latent Explanations for Prototype-Based Networks	Jules Soria et.al.	2511.16588	null
2025-11-20	Synthesis of Safety Specifications for Probabilistic Systems	Gaspard Ohlmann et.al.	2511.16579	null
2025-11-20	PolyMinHash: Efficient Area-Based MinHashing of Polygons for Approximate Nearest Neighbor Search	Alima Subedi et.al.	2511.16576	null
2025-11-20	Erase to Retain: Low Rank Adaptation Guided Selective Unlearning in Medical Segmentation Networks	Nirjhor Datta et.al.	2511.16574	null
2025-11-20	Boosting Predictive Performance on Tabular Data through Data Augmentation with Latent-Space Flow-Based Diffusion	Md. Tawfique Ihsan et.al.	2511.16571	null
2025-11-20	Generalized Three-Family Supersymmetric Pati-Salam Models from Type IIA Intersecting D6-Branes	Tianjun Li et.al.	2511.16565	null
2025-11-20	Interfacial and bulk switching MoS2 memristors for an all-2D reservoir computing framework	Asmita S. Thool et.al.	2511.16557	null
2025-11-20	Bayesian polarization calibration and imaging in very long baseline interferometry	Jong-Seo Kim et.al.	2511.16556	null
2025-11-20	Toward Valid Generative Clinical Trial Data with Survival Endpoints	Perrine Chassat et.al.	2511.16551	null
2025-11-20	Supervised Contrastive Learning for Few-Shot AI-Generated Image Detection and Attribution	Jaime Álvarez Urueña et.al.	2511.16541	null
2025-11-20	Contrastive vision-language learning with paraphrasing and negation	Kwun Ho Ngan et.al.	2511.16527	null
2025-11-20	YOWO: You Only Walk Once to Jointly Map An Indoor Scene and Register Ceiling-mounted Cameras	Fan Yang et.al.	2511.16521	null
2025-11-20	Nonsmooth Newton methods with effective subspaces for polyhedral regularization	Tran T. A. Nghia et.al.	2511.16514	null
2025-11-20	A Butterfly’s Eye Camera for Intensity Interferometry with Cherenkov Telescopes	Juan Cortina et.al.	2511.16505	null
2025-11-20	Acquisition Time-Informed Breast Tumor Segmentation from Dynamic Contrast-Enhanced MRI	Rui Wang et.al.	2511.16498	null
2025-11-20	An analytical and experimental study of the energy transition discourse on YouTube	Aleix Bassolas et.al.	2511.16497	null
2025-11-20	Physics-Informed Machine Learning for Efficient Sim-to-Real Data Augmentation in Micro-Object Pose Estimation	Zongcai Tan et.al.	2511.16494	null
2025-11-20	WIMP Freeze-out dynamics under Tsallis statistics	Matias P. Gonzalez et.al.	2511.16487	null
2025-11-20	Flow and Depth Assisted Video Prediction with Latent Transformer	Eliyas Suleyman et.al.	2511.16484	null
2025-11-20	Correlation-Aware Feature Attribution Based Explainable AI	Poushali Sengupta et.al.	2511.16482	null
2025-11-20	A Comparison Between Decision Transformers and Traditional Offline Reinforcement Learning Algorithms	Ali Murtaza Caunhye et.al.	2511.16475	null
2025-11-20	FastSurfer-CC: A robust, accurate, and comprehensive framework for corpus callosum morphometry	Clemens Pollak et.al.	2511.16471	null
2025-11-20	Horizontal and Vertical Regularity of Elastic Wave Geometry	Joonas Ilmavirta et.al.	2511.16466	null
2025-11-20	A physics-inspired momentum-based gradient method	Jianing Zhang et.al.	2511.16441	null
2025-11-20	Beyond Visual Cues: Leveraging General Semantics as Support for Few-Shot Segmentation	Jin Wang et.al.	2511.16435	null
2025-11-20	Generative Modeling of Clinical Time Series via Latent Stochastic Differential Equations	Muhammad Aslanimoghanloo et.al.	2511.16427	null
2025-11-20	FreqFlow: Long-term forecasting using lightweight flow matching	Seyed Mohamad Moghadas et.al.	2511.16426	null
2025-11-20	Second-Order MPC-Based Distributed Q-Learning	Samuel Mallick et.al.	2511.16424	null
2025-11-20	Linear magneto-birefringence as a probe of altermagnetism	V. Sunko et.al.	2511.16421	null
2025-11-20	Denoising weak lensing mass maps with diffusion model and generative adversarial network	Shohei D. Aoyama et.al.	2511.16415	null
2025-11-20	Homogeneous Proportional-Integral-Derivative Controller in Mobile Robotic Manipulators	Luis Luna et.al.	2511.16406	null
2025-11-20	A Comprehensive Study on Cyber Attack Vectors in EV Traction Power Electronics	Siddhesh Pimpale et.al.	2511.16399	null
2025-11-20	Quantifying Phase Transformations in Alloying Anodes via In-Situ Liquid Cell Hard X-ray Spectroscopy and Cryogenic Microscopy	Neil Mulcahy et.al.	2511.16382	null
2025-11-20	CAMS: Towards Compositional Zero-Shot Learning via Gated Cross-Attention and Multi-Space Disentanglement	Pan Yang et.al.	2511.16378	null
2025-11-20	Flow-Aided Flight Through Dynamic Clutters From Point To Motion	Bowen Xu et.al.	2511.16372	null
2025-11-20	Modeling adsorption processes on the core-shell-like polymer structures: star and comb topologies	V. Blavatska et.al.	2511.16371	null
2025-11-20	DetailSemNet: Elevating Signature Verification through Detail-Semantic Integration	Meng-Cheng Shih et.al.	2511.16364	null
2025-11-20	iFCTN: Folding-Free Fully-Connected Tensor Network Decomposition for Tensor Completion	Ziyi Gan et.al.	2511.16358	null
2025-11-20	Tripartite Entanglement Generation in Atom-Coupled Dual Microresonators System	Abhishek Mandal et.al.	2511.16351	null
2025-11-20	Beyond Generative AI: World Models for Clinical Prediction, Counterfactuals, and Planning	Mohammad Areeb Qazi et.al.	2511.16333	null
2025-11-20	Non-squeezing and other global rigidity results in locally conformal symplectic geometry	Mélanie Bertelson et.al.	2511.16329	null
2025-11-20	Revealing computation-communication trade-off in Segmented Pinching Antenna System (PASS)	Deqiao Gan et.al.	2511.16327	null
2025-11-20	SDA: Steering-Driven Distribution Alignment for Open LLMs without Fine-Tuning	Wei Xia et.al.	2511.16324	null
2025-11-20	NaTex: Seamless Texture Generation as Latent Color Diffusion	Zeqiang Lai et.al.	2511.16317	null
2025-11-20	Sparse Autoencoders are Topic Models	Leander Girrbach et.al.	2511.16309	null
2025-11-20	Upsample Anything: A Simple and Hard to Beat Baseline for Feature Upsampling	Minseok Seo et.al.	2511.16301	null
2025-11-20	Asymptotic behavior and sharp estimates for spreading fronts in a cooperative system with free boundaries	Qian Qin et.al.	2511.16300	null
2025-11-20	Optimizing Operation Recipes with Reinforcement Learning for Safe and Interpretable Control of Chemical Processes	Dean Brandner et.al.	2511.16297	null
2025-11-20	Explainable AI for Diabetic Retinopathy Detection Using Deep Learning with Attention Mechanisms and Fuzzy Logic-Based Interpretability	Abishek Karthik et.al.	2511.16294	null
2025-11-20	Spectral Identifiability for Interpretable Probe Geometry	William Hao-Cheng Huang et.al.	2511.16288	null
2025-11-20	Graph Diffusion Counterfactual Explanation	David Bechtoldt et.al.	2511.16287	null
2025-11-20	Building temporally coherent 3D maps with VGGT for memory-efficient Semantic SLAM	Gergely Dinya et.al.	2511.16282	null
2025-11-20	Spatially Dependent Sampling of Component Failures for Power System Preventive Control Against Hurricane	Ziyue Li et.al.	2511.16279	null
2025-11-20	Universal features of non-analytical energy storage in quantum critical quantum batteries	Riccardo Grazi et.al.	2511.16274	null
2025-11-20	Mem-MLP: Real-Time 3D Human Motion Generation from Sparse Inputs	Sinan Mutlu et.al.	2511.16264	null
2025-11-20	A case study in ensemble optimal control for Bayesian input design	Ludovic Sacchelli et.al.	2511.16251	null
2025-11-20	Controllable Layer Decomposition for Reversible Multi-Layer Image Generation	Zihao Liu et.al.	2511.16249	null
2025-11-20	Describing Functions and Phase Response Curves of Excitable Systems	Robin Wroblowski et.al.	2511.16235	null
2025-11-20	Q-MLLM: Vector Quantization for Robust Multimodal Large Language Model Security	Wei Zhao et.al.	2511.16229	null
2025-11-20	Difficulty-Controlled Simplification of Piano Scores with Synthetic Data for Inclusive Music Education	Pedro Ramoneda et.al.	2511.16228	null
2025-11-20	Can MLLMs Read the Room? A Multimodal Benchmark for Assessing Deception in Multi-Party Social Interactions	Caixin Kang et.al.	2511.16221	null
2025-11-20	Diffuse Laser Cooling Based on the $6\mathrm{P}_{3/2}$ Excited State of Rubidium Atoms via 420 nm Blue Light	Jia Zhang et.al.	2511.16220	null
2025-11-20	Geometrical properties of strained and twisted moiré heterostructures	Federico Escudero et.al.	2511.16219	null
2025-11-20	Towards Overcoming Data Scarcity in Nuclear Energy: A Study on Critical Heat Flux with Physics-consistent Conditional Diffusion Model	Farah Alsafadi et.al.	2511.16207	null
2025-11-20	Causal Synthetic Data Generation in Recruitment	Andrea Iommi et.al.	2511.16204	null
2025-11-20	PIPHEN: Physical Interaction Prediction with Hamiltonian Energy Networks	Kewei Chen et.al.	2511.16200	null
2025-11-20	SemanticCite: Citation Verification with AI-Powered Full-Text Analysis and Evidence-Based Reasoning	Sebastian Haan et.al.	2511.16198	null
2025-11-20	Random Attractors for McKean-Vlasov SDEs	Mengyu Cheng et.al.	2511.16190	null
2025-11-20	FOOTPASS: A Multi-Modal Multi-Agent Tactical Context Dataset for Play-by-Play Action Spotting in Soccer Broadcast Videos	Jeremie Ochin et.al.	2511.16183	null
2025-11-20	Green Distributed AI Training: Orchestrating Compute Across Renewable-Powered Micro Datacenters	Giuseppe Tomei et.al.	2511.16182	null
2025-11-20	Mitigating Shared Storage Congestion Using Control Theory	Thomas Collignon et.al.	2511.16177	null
2025-11-20	Mantis: A Versatile Vision-Language-Action Model with Disentangled Visual Foresight	Yi Yang et.al.	2511.16175	null
2025-11-20	An Image Is Worth Ten Thousand Words: Verbose-Text Induction Attacks on VLMs	Zhi Luo et.al.	2511.16163	null
2025-11-20	Simba: Towards High-Fidelity and Geometrically-Consistent Point Cloud Completion via Transformation Diffusion	Lirui Zhang et.al.	2511.16161	null
2025-11-20	Video2Layout: Recall and Reconstruct Metric-Grounded Cognitive Map for Spatial Reasoning	Yibin Huang et.al.	2511.16160	null
2025-11-20	MagBotSim: Physics-Based Simulation and Reinforcement Learning Environments for Magnetic Robotics	Lara Bergmann et.al.	2511.16158	null
2025-11-20	Spreading Properties of a City-Road Reaction-Diffusion Model on One-Dimensional Lattice	Grégory Faye et.al.	2511.16157	null
2025-11-20	Pluggable Pruning with Contiguous Layer Distillation for Diffusion Transformers	Jian Ma et.al.	2511.16156	null
2025-11-20	Synthetic Spatiotemporal Plasmonic Vortices On Chip	Qian Chen et.al.	2511.16155	null
2025-11-19	GeoVista: Web-Augmented Agentic Visual Reasoning for Geolocalization	Yikun Wang et.al.	2511.15705	null
2025-11-19	Think Visually, Reason Textually: Vision-Language Synergy in ARC	Beichen Zhang et.al.	2511.15703	null
2025-11-19	First Frame Is the Place to Go for Video Content Customization	Jingxi Chen et.al.	2511.15700	null
2025-11-19	Joint Semantic-Channel Coding and Modulation for Token Communications	Jingkai Ying et.al.	2511.15699	null
2025-11-19	Assessing Power Flow Controllability via Variable Line Reactance	Eric Haag et.al.	2511.15685	null
2025-11-19	Quantum-Guided Test Case Minimization for LLM-Based Code Generation	Huixiang Zhang et.al.	2511.15665	null
2025-11-19	VisPlay: Self-Evolving Vision-Language Models from Images	Yicheng He et.al.	2511.15661	null
2025-11-19	Spatial scale separation and emergent patterns in coupled diffusive-nondiffusive systems	Théo André et.al.	2511.15648	null
2025-11-19	Economic Linear Quadratic MPC With Non-Unique Optimal Solutions	Mario Zanon et.al.	2511.15630	null
2025-11-19	The SA-FARI Dataset: Segment Anything in Footage of Animals for Recognition and Identification	Dante Francisco Wasmuht et.al.	2511.15622	null
2025-11-19	When to Think and When to Look: Uncertainty-Guided Lookback	Jing Bi et.al.	2511.15613	null
2025-11-19	MaskMed: Decoupled Mask and Class Prediction for Medical Image Segmentation	Bin Xie et.al.	2511.15603	null
2025-11-19	US-X Complete: A Multi-Modal Approach to Anatomical 3D Shape Recovery	Miruna-Alexandra Gafencu et.al.	2511.15600	null
2025-11-19	Real-Time Optimal Control via Transformer Networks and Bernstein Polynomials	Gage MacLin et.al.	2511.15588	null
2025-11-19	Transferable Dual-Domain Feature Importance Attack against AI-Generated Image Detector	Weiheng Zhu et.al.	2511.15571	null
2025-11-19	Excess of diffuse gamma-ray emission detected from the galaxy cluster Abell 119 from 14-year Fermi-LAT Data	Gajanan D Harale et.al.	2511.15559	null
2025-11-19	Multimodal Evaluation of Russian-language Architectures	Artem Chervyakov et.al.	2511.15552	null
2025-11-19	Meta-Black-Box Optimization with Bi-Space Landscape Analysis and Dual-Control Mechanism for SAEA	Yukun Du et.al.	2511.15551	null
2025-11-19	UltraDP: Generalizable Carotid Ultrasound Scanning with Force-Aware Diffusion Policy	Ruoqu Chen et.al.	2511.15550	null
2025-11-19	Infinite Anticipation Backward Stochastic Differential Equations	Guanwei Cheng et.al.	2511.15548	null
2025-11-19	A Physics Informed Machine Learning Framework for Optimal Sensor Placement and Parameter Estimation	Georgios Venianakis et.al.	2511.15543	null
2025-11-19	A Hybrid CNN-ViT-GNN Framework with GAN-Based Augmentation for Intelligent Weed Detection in Precision Agriculture	Pandiyaraju V et.al.	2511.15535	null
2025-11-19	PCARNN-DCBF: Minimal-Intervention Geofence Enforcement for Ground Vehicles	Yinan Yu et.al.	2511.15522	null
2025-11-19	Theoretical Closed-loop Stability Bounds for Dynamical System Coupled with Diffusion Policies	Gabriel Lauzier et.al.	2511.15520	null
2025-11-19	Multi-Text Guided Few-Shot Semantic Segmentation	Qiang Jiao et.al.	2511.15515	null
2025-11-19	Learning to Expand Images for Efficient Visual Autoregressive Modeling	Ruiqing Yang et.al.	2511.15499	null
2025-11-19	Asymptotic stability of planar entropy wave for 3-d Navier-Stokes equations in Eulerian coordinates	Ren-Jun Duan et.al.	2511.15498	null
2025-11-19	FunnyNodules: A Customizable Medical Dataset Tailored for Evaluating Explainable AI	Luisa Gallée et.al.	2511.15481	null
2025-11-19	A Critical Drift-Diffusion Equation: Intermittent Behavior via Geometric Brownian Motion on $\textbf{SL}(n)$	Peter S. Morfe et.al.	2511.15473	null
2025-11-19	Deep Learning for Accurate Vision-based Catch Composition in Tropical Tuna Purse Seiners	Xabier Lekunberri et.al.	2511.15468	null
2025-11-19	Generalized differentiation in Wasserstein space and application to multiagent control problem	Rossana Capuani et.al.	2511.15455	null
2025-11-19	Testing Conditional Independence via the Spectral Generalized Covariance Measure: Beyond Euclidean Data	Ryunosuke Miyazaki et.al.	2511.15453	null
2025-11-19	A Dataset and Baseline for Deep Learning-Based Visual Quality Inspection in Remanufacturing	Johannes C. Bauer et.al.	2511.15440	null
2025-11-19	Self-dual instantons and gravitating dyons in non-Abelian ModMax theory	Fabrizio Canfora et.al.	2511.15437	null
2025-11-19	HV-Attack: Hierarchical Visual Attack for Multimodal Retrieval Augmented Generation	Linyin Luo et.al.	2511.15435	null
2025-11-19	Optimizing Resource Distribution in a One-Dimensional Logistic Diffusion Model	Junyoung Heo et.al.	2511.15428	null
2025-11-19	Enabling NLOS Imaging Capabilities at the Initial Access of 6G Base Stations	Davide Tornielli Bellini et.al.	2511.15416	null
2025-11-19	D4C: Data-free Quantization for Contrastive Language-Image Pre-training Models	Wenlun Zhang et.al.	2511.15411	null
2025-11-19	ShelfOcc: Native 3D Supervision beyond LiDAR for Vision-Based Occupancy Estimation	Simon Boeder et.al.	2511.15396	null
2025-11-19	Simple relations from complex outflows: How the $M-σ$ relation emerges in a multi-phase environment	Matas Tartėnas et.al.	2511.15380	null
2025-11-19	The Empowerment of Science of Science by Large Language Models: New Tools and Methods	Guoqiang Liang et.al.	2511.15370	null
2025-11-19	Covariant Measures of Non-Markovianity in Curved Spacetime	Tushar Waghmare et.al.	2511.15365	null
2025-11-19	An Information-Theoretic Route to Isoperimetric Inequalities via Heat Flow and Entropy Dissipation	Amandip Sangha et.al.	2511.15356	null
2025-11-19	Adaptive thresholding pattern for fingerprint forgery detection	Zahra Farzadpour et.al.	2511.15322	null
2025-11-19	Thermalizing channel states for rapid qubit heating	Ziyang You et.al.	2511.15314	null
2025-11-19	The Rotation Dip in the Envelope-Disk Transition of HH 111: Evidence for Magnetic Braking	Jyun-Heng Lin et.al.	2511.15309	null
2025-11-19	Taming Generative Synthetic Data for X-ray Prohibited Item Detection	Jialong Sun et.al.	2511.15299	null
2025-11-19	Adversarial Attack on Black-Box Multi-Agent by Adaptive Perturbation	Jianming Chen et.al.	2511.15292	null
2025-11-19	Normalized Solutions for the $(2,q)$ -Laplacian Operator Between Mass-Critical Exponents	Laura Baldelli et.al.	2511.15285	null
2025-11-19	Photoluminescence Mapping of Mobile and Fixed Defects in Halide Perovskite Films	Sarah C. Gillespie et.al.	2511.15281	null
2025-11-19	Photoinduced topological phase transition in monolayer 1T $^\prime$-MoS$_2$	Mohammad Mortezaei Nobahari et.al.	2511.15268	null
2025-11-19	ChartEditor: A Reinforcement Learning Framework for Robust Chart Editing	Liangyu Chen et.al.	2511.15266	null
2025-11-19	SplitFlux: Learning to Decouple Content and Style from a Single Image	Yitong Yang et.al.	2511.15258	null
2025-11-19	PresentCoach: Dual-Agent Presentation Coaching through Exemplars and Interactive Feedback	Sirui Chen et.al.	2511.15253	null
2025-11-19	Symmetry-Breaking in Multi-Agent Navigation: Winding Number-Aware MPC with a Learned Topological Strategy	Tomoki Nakao et.al.	2511.15239	null
2025-11-19	Note on Logical Gates by Gauge Field Formalism of Quantum Error Correction	Junichi Haruna et.al.	2511.15224	null
2025-11-19	Why Physics Still Matters: Improving Machine Learning Prediction of Material Properties with Phonon-Informed Datasets	Pol Benítez et.al.	2511.15222	null
2025-11-19	Nonholonomic Robot Parking by Feedback – Part II: Nonmodular, Inverse Optimal, Adaptive, Prescribed/Fixed-Time and Safe Designs	Kwang Hak Kim et.al.	2511.15219	null
2025-11-19	Magnetic signal scan imaging system based on giant magnetoimpedance (GMI) differential sensor	Tao Yang et.al.	2511.15209	null
2025-11-19	Reasoning in Diffusion Large Language Models is Concentrated in Dynamic Confusion Zones	Ranfei Chen et.al.	2511.15208	null
2025-11-19	Trustworthy GenAI over 6G: Integrated Applications and Security Frameworks	Bui Duc Son et.al.	2511.15206	null
2025-11-19	Towards Unbiased Cross-Modal Representation Learning for Food Image-to-Recipe Retrieval	Qing Wang et.al.	2511.15201	null
2025-11-19	VIRAL: Visual Sim-to-Real at Scale for Humanoid Loco-Manipulation	Tairan He et.al.	2511.15200	null
2025-11-19	Particle deformability stabilizes hexatic order and suppresses crystallization	Jatin Kumar et.al.	2511.15195	null
2025-11-19	Masked Auto-Regressive Variational Acceleration: Fast Inference Makes Practical Reinforcement Learning	Yuxuan Gu et.al.	2511.15190	null
2025-11-19	Theoretical Bounds on Parallel Imaging Implicit Data Crimes in an MRI Reproducing Kernel Hilbert Space	Evan Frenklak et.al.	2511.15187	null
2025-11-19	Instruction-Guided Lesion Segmentation for Chest X-rays with Automatically Generated Large-Scale Dataset	Geon Choi et.al.	2511.15186	null
2025-11-19	FaultDiffusion: Few-Shot Fault Time Series Generation with Diffusion Model	Yi Xu et.al.	2511.15174	null
2025-11-19	Generating Natural-Language Surgical Feedback: From Structured Representation to Domain-Grounded Evaluation	Firdavs Nasriddinov et.al.	2511.15159	null
2025-11-19	SceneEdited: A City-Scale Benchmark for 3D HD Map Updating via Image-Guided Change Detection	Chun-Jung Lin et.al.	2511.15153	null
2025-11-19	Pseudo-magnetic Fields and Effective Dynamics in Strained Honeycomb Structures	Chengyu Zhang et.al.	2511.15152	null
2025-11-19	Radial Fast Entangling Gates Under Micromotion in Trapped-Ion Quantum Computers	Phoebe Grosser et.al.	2511.15148	null
2025-11-19	Effects of Interactions and Defect Motion on Ramp Reversal Memory in Locally Phase Separated Materials	Y. Sun et.al.	2511.15147	null
2025-11-19	From Solving to Verifying: A Unified Objective for Robust Reasoning in LLMs	Xiaoxuan Wang et.al.	2511.15137	null
2025-11-19	Beyond Trotterization: Variational Product Formulas for Quantum Simulation	Ibsal Assi et.al.	2511.15124	null
2025-11-19	Nonholonomic Robot Parking by Feedback – Part I: Modular Strict CLF Designs	Velimir Todorovski et.al.	2511.15119	null
2025-11-19	Unbiased Semantic Decoding with Vision Foundation Models for Few-shot Segmentation	Jin Wang et.al.	2511.15118	null
2025-11-19	Microscopic Investigation of rf Vortex Nucleation in Nb3Sn Films Using a Near-Field Magnetic Microwave Microscope	Chung-Yang Wang et.al.	2511.15116	null
2025-11-19	A Comprehensive Study on Visual Token Redundancy for Discrete Diffusion-based Multimodal Large Language Models	Duo Li et.al.	2511.15098	null
2025-11-19	Jointly Conditioned Diffusion Model for Multi-View Pose-Guided Person Image Synthesis	Chengyu Xie et.al.	2511.15092	null
2025-11-19	Chromatographic Peak Shape from Stochastic Model: Analytic Time-Domain Expression in Terms of Physical Parameters and Conditions under which Heterogeneity Reduces Tailing	Hernán R. Sánchez et.al.	2511.15088	null
2025-11-19	Cement2: Temporal Hardware Transactions for High-Level and Efficient FPGA Programming	Youwei Xiao et.al.	2511.15073	null
2025-11-19	BokehFlow: Depth-Free Controllable Bokeh Rendering via Flow Matching	Yachuan Huang et.al.	2511.15066	null
2025-11-19	Reasoning via Video: The First Evaluation of Video Models’ Reasoning Abilities through Maze-Solving Tasks	Cheng Yang et.al.	2511.15065	null
2025-11-19	Evaluating Multimodal Large Language Models on Vertically Written Japanese Text	Keito Sasagawa et.al.	2511.15059	null
2025-11-19	CellGenNet: A Knowledge-Distilled Framework for Robust Cell Segmentation in Cancer Tissues	Srijan Ray et.al.	2511.15054	null
2025-11-19	Global Gevrey solution of 3D anisotropic Navier-Stokes system in a strip domain	Wei-Xi Li et.al.	2511.15050	null
2025-11-19	UniHOI: Unified Human-Object Interaction Understanding via Unified Token Space	Panqi Yang et.al.	2511.15046	null
2025-11-19	Integrating Atomic Scale Catalyst Design with Transport Engineering for Stable and Efficient CO2 Electrolysis to CO in a Membrane Electrode Assembly	Zahra Teimouri et.al.	2511.15042	null
2025-11-19	The existence and instability of blowing-up steady states for the Shigesada-Kawasaki-Teramoto competition model with cross-diffusion	Kousuke Kuto et.al.	2511.15039	null
2025-11-19	Aligning Generative Music AI with Human Preferences: Methods and Challenges	Dorien Herremans et.al.	2511.15038	null
2025-11-19	WiCo-PG: Wireless Channel Foundation Model for Pathloss Map Generation via Synesthesia of Machines	Mingran Sun et.al.	2511.15030	null
2025-11-19	WiCo-MG: Wireless Channel Foundation Model for Multipath Generation via Synesthesia of Machines	Zengrui Han et.al.	2511.15026	null
2025-11-19	Task Specific Sharpness Aware O-RAN Resource Management using Multi Agent Reinforcement Learning	Fatemeh Lotfi et.al.	2511.15002	null
2025-11-19	FinCriticalED: A Visual Benchmark for Financial Fact-Level OCR Evaluation	Yueru He et.al.	2511.14998	null
2025-11-19	Kandinsky 5.0: A Family of Foundation Models for Image and Video Generation	Vladimir Arkhipkin et.al.	2511.14993	null
2025-11-18	SVBRD-LLM: Self-Verifying Behavioral Rule Discovery for Autonomous Vehicle Identification	Xiangyu Li et.al.	2511.14977	null
2025-11-18	Quality-Controlled Multimodal Emotion Recognition in Conversations with Identity-Based Transfer Learning and MAMBA Fusion	Zanxu Wang et.al.	2511.14969	null
2025-11-18	LFreeDA: Label-Free Drift Adaptation for Windows Malware Detection	Adrian Shuai Li et.al.	2511.14963	null
2025-11-18	Reconstruction of three-dimensional shapes of normal and disease-related erythrocytes from partial observations using multi-fidelity neural networks	Haizhou Wen et.al.	2511.14962	null
2025-11-18	ARC Is a Vision Problem!	Keya Hu et.al.	2511.14761	null
2025-11-18	UniGen-1.5: Enhancing Image Generation and Editing through Reward Unification in Reinforcement Learning	Rui Tian et.al.	2511.14760	null
2025-11-18	Robust Verification of Controllers under State Uncertainty via Hamilton-Jacobi Reachability Analysis	Albert Lin et.al.	2511.14755	null
2025-11-18	Squeezing-Enhanced Photon-Number Measurements for GKP State Generation	Paul Renault et.al.	2511.14737	null
2025-11-18	Starlight-driven flared-staircase geometry in radiation hydrodynamic models of protoplanetary disks	Prakruti Sudarshan et.al.	2511.14733	null
2025-11-18	Heterogeneous Multi-Agent Proximal Policy Optimization for Power Distribution System Restoration	Parya Dolatyabi et.al.	2511.14730	null
2025-11-18	Zero-shot Synthetic Video Realism Enhancement via Structure-aware Denoising	Yifan Wang et.al.	2511.14719	null
2025-11-18	Diffusion As Self-Distillation: End-to-End Latent Diffusion In One Model	Xiyuan Wang et.al.	2511.14716	null
2025-11-18	FreeSwim: Revisiting Sliding-Window Attention Mechanisms for Training-Free Ultra-High-Resolution Video Generation	Yunfeng Wu et.al.	2511.14712	null
2025-11-18	Systematic Study of the Self-Renormalized Nucleon Gluon PDF in Large-Momentum Effective Theory	Alex NieMiera et.al.	2511.14708	null
2025-11-18	Cell Shape Emerges from Motion	Gautham Gopinath et.al.	2511.14707	null
2025-11-18	Talk, Snap, Complain: Validation-Aware Multimodal Expert Framework for Fine-Grained Customer Grievances	Rishu Kumar Singh et.al.	2511.14693	null
2025-11-18	NERD: Network-Regularized Diffusion Sampling For 3D Computed Tomography	Shijun Liang et.al.	2511.14680	null
2025-11-18	Giant enhancement of attosecond tunnel ionization competes with disorder-driven decoherence in silicon	D. N. Purschke et.al.	2511.14678	null
2025-11-18	High-resolution weak lensing mass mapping from DES-Y3 data using diffusion-based prior	Supranta S. Boruah et.al.	2511.14667	null
2025-11-18	Optimal ${L^2}$ error estimates of fully discrete finite element methods for the 2D/3D diffuse interface two-phase MHD flows	Ke Zhang et.al.	2511.14656	null
2025-11-18	Search by Return: Stochastic Resetting in Fluctuating Harmonic Potentials	Derek Frydel et.al.	2511.14646	null
2025-11-18	A Specialized Large Language Model for Clinical Reasoning and Diagnosis in Rare Diseases	Tao Yang et.al.	2511.14638	null
2025-11-18	PAC global optimization for VQE in low-curvature geometric regimes	Benjamin Asch et.al.	2511.14628	null
2025-11-18	Gallant: Voxel Grid-based Humanoid Locomotion and Local-navigation across 3D Constrained Terrains	Qingwei Ben et.al.	2511.14625	null
2025-11-18	3D-Guided Scalable Flow Matching for Generating Volumetric Tissue Spatial Transcriptomics from Serial Histology	Mohammad Vali Sanian et.al.	2511.14613	null
2025-11-18	XAttn-BMD: Multimodal Deep Learning with Cross-Attention for Femoral Neck Bone Mineral Density Estimation	Yilin Zhang et.al.	2511.14604	null
2025-11-18	A Controllable Perceptual Feature Generative Model for Melody Harmonization via Conditional Variational Autoencoder	Dengyun Huang et.al.	2511.14600	null
2025-11-18	First measurement of reactor neutrino oscillations at JUNO	Angel Abusleme et.al.	2511.14593	null
2025-11-18	Initial performance results of the JUNO detector	Angel Abusleme et.al.	2511.14590	null
2025-11-18	Task Addition and Weight Disentanglement in Closed-Vocabulary Models	Adam Hazimeh et.al.	2511.14569	null
2025-11-18	Nonlinearity-induced transition in heat conduction through a topological metamaterial of rotors	T. R. Vishnu et.al.	2511.14560	null
2025-11-18	Apo2Mol: 3D Molecule Generation via Dynamic Pocket-Aware Diffusion Models	Xinzhe Zheng et.al.	2511.14559	null
2025-11-18	ForensicFlow: A Tri-Modal Adaptive Network for Robust Deepfake Detection	Mohammad Romani et.al.	2511.14554	null
2025-11-18	MissHDD: Hybrid Deterministic Diffusion for Hetrogeneous Incomplete Data Imputation	Youran Zhou et.al.	2511.14543	null
2025-11-18	Fractal Polariton Topological Insulator	Khalil Sabour et.al.	2511.14542	null
2025-11-18	DeCo-VAE: Learning Compact Latents for Video Reconstruction via Decoupled Representation	Xiangchen Yin et.al.	2511.14530	null
2025-11-18	A Generative Data Framework with Authentic Supervision for Underwater Image Restoration and Enhancement	Yufeng Tian et.al.	2511.14521	null
2025-11-18	Full Atom Peptide Design via Riemannian Euclidean Bayesian Flow Networks	Hao Qian et.al.	2511.14516	null
2025-11-18	Exponential Lower Bounds for the Advection-Diffusion Equation with Shear Flows	Yupei Huang et.al.	2511.14512	null
2025-11-18	Enhancing End-to-End Autonomous Driving with Risk Semantic Distillaion from VLM	Jack Qin et.al.	2511.14499	null
2025-11-18	Covariance-based Imaging and Multi-View Fusion for Networked Sensing	Junyuan Gao et.al.	2511.14490	null
2025-11-18	Segmentation-Aware Latent Diffusion for Satellite Image Super-Resolution: Enabling Smallholder Farm Boundary Delineation	Aditi Agarwal et.al.	2511.14481	null
2025-11-18	Emergent Geometry Governs Optimal Control in Driven Stokes Flows	Kyle McKee et.al.	2511.14479	null
2025-11-18	Multi-network Topology Underlying Individual Language Learning Success	Peilun Song et.al.	2511.14453	null
2025-11-18	DIR-TIR: Dialog-Iterative Refinement for Text-to-Image Retrieval	Zongwei Zhen et.al.	2511.14449	null
2025-11-18	Analyzing Many Simulations of Hybrid Programs in Lince	Reydel Arrieta et.al.	2511.14436	null
2025-11-18	Achieving Safe Control Online through Integration of Harmonic Control Lyapunov-Barrier Functions with Unsafe Object-Centric Action Policies	Marlow Fawn et.al.	2511.14434	null
2025-11-18	Mutation Testing for Industrial Robotic Systems	Marcela Gonçalves dos Santos et.al.	2511.14432	null
2025-11-18	MiAD: Mirage Atom Diffusion for De Novo Crystal Generation	Andrey Okhotin et.al.	2511.14426	null
2025-11-18	FlowRoI A Fast Optical Flow Driven Region of Interest Extraction Framework for High-Throughput Image Compression in Immune Cell Migration Analysis	Xiaowei Xu et.al.	2511.14419	null
2025-11-18	Inertial active particles in a Poiseuille flow: negative mobility and particle separation	Ankit Gupta et.al.	2511.14412	null
2025-11-18	Continuous Vision-Language-Action Co-Learning with Semantic-Physical Alignment for Behavioral Cloning	Xiuxiu Qi et.al.	2511.14396	null
2025-11-18	Blur-Robust Detection via Feature Restoration: An End-to-End Framework for Prior-Guided Infrared UAV Target Detection	Xiaolin Wang et.al.	2511.14371	null
2025-11-18	Oscillation Quenching Induced By Time-Varying Coupling Functions	Dushko Stavrov et.al.	2511.14370	null
2025-11-18	O3SLM: Open Weight, Open Data, and Open Vocabulary Sketch-Language Model	Rishi Gupta et.al.	2511.14368	null
2025-11-18	ATLAS: A High-Difficulty, Multidisciplinary Benchmark for Frontier Scientific Reasoning	Hongwei Liu et.al.	2511.14366	null
2025-11-18	Deep Andromeda JCMT-SCUBA2 Observations. The Submillimeter Maps and Giant Molecular Clouds	Sihan Jiao et.al.	2511.14360	null
2025-11-18	A PDE-constrained Optimization Approach to Optimal Trajectory Planning under Uncertainty via Reflected Schrödinger Bridges	Dante Kalise et.al.	2511.14355	null
2025-11-18	ARC-Chapter: Structuring Hour-Long Videos into Navigable Chapters and Hierarchical Summaries	Junfu Pu et.al.	2511.14349	null
2025-11-18	Silhouette-to-Contour Registration: Aligning Intraoral Scan Models with Cephalometric Radiographs	Yiyi Miao et.al.	2511.14343	null
2025-11-18	Compiler design for hardware specific decomposition optimizations, tailored to diamond NV centers	Folkert de Ronde et.al.	2511.14339	null
2025-11-18	ArchMap: Arch-Flattening and Knowledge-Guided Vision Language Model for Tooth Counting and Structured Dental Understanding	Bohan Zhang et.al.	2511.14336	null
2025-11-18	Akaike-type information criterion of SEM for jump-diffusion processes based on high-frequency data	Shogo Kusano et.al.	2511.14333	null
2025-11-18	Step by Step Network	Dongchen Han et.al.	2511.14329	null
2025-11-18	H-LDM: Hierarchical Latent Diffusion Models for Controllable and Interpretable PCG Synthesis from Clinical Metadata	Chenyang Xu et.al.	2511.14312	null
2025-11-18	Iterative Diffusion-Refined Neural Attenuation Fields for Multi-Source Stationary CT Reconstruction: NAF Meets Diffusion Model	Jiancheng Fang et.al.	2511.14310	null
2025-11-18	GEN3D: Generating Domain-Free 3D Scenes from a Single Image	Yuxin Zhang et.al.	2511.14291	null
2025-11-18	NeuralBoneReg: A Novel Self-Supervised Method for Robust and Accurate Multi-Modal Bone Surface Registration	Luohong Wu et.al.	2511.14286	null
2025-11-18	Generating spatially separated correlated multiphoton states in nonlinear waveguide quantum electrodynamics	Jia-Qi Li et.al.	2511.14281	null
2025-11-18	Gaussian Splatting-based Low-Rank Tensor Representation for Multi-Dimensional Image Recovery	Yiming Zeng et.al.	2511.14270	null
2025-11-18	Statistically controllable microstructure reconstruction framework for heterogeneous materials using sliced-Wasserstein metric and neural networks	Zhenchuan Ma et.al.	2511.14268	null
2025-11-18	ManipShield: A Unified Framework for Image Manipulation Detection, Localization and Explanation	Zitong Xu et.al.	2511.14259	null
2025-11-18	Towards Authentic Movie Dubbing with Retrieve-Augmented Director-Actor Interaction Learning	Rui Liu et.al.	2511.14249	null
2025-11-18	MuCPT: Music-related Natural Language Model Continued Pretraining	Kai Tian et.al.	2511.14245	null
2025-11-18	EBind: a practical approach to space binding	Jim Broadbent et.al.	2511.14229	null
2025-11-18	Integrating electronic structure into generative modeling of inorganic materials	Junkil Park et.al.	2511.14228	null
2025-11-18	DevPiolt: Operation Recommendation for IoT Devices at Xiaomi Home	Yuxiang Wang et.al.	2511.14227	null
2025-11-18	StreamingTalker: Audio-driven 3D Facial Animation with Autoregressive Diffusion Model	Yifan Yang et.al.	2511.14223	null
2025-11-18	Bridging the Gap Between Bayesian Deep Learning and Ensemble Weather Forecasts	Xinlei Xiong et.al.	2511.14218	null
2025-11-18	Measurement-Constrained Sampling for Text-Prompted Blind Face Restoration	Wenjie Li et.al.	2511.14213	null
2025-11-18	InstantViR: Real-Time Video Inverse Problem Solver with Distilled Diffusion Prior	Weimin Bai et.al.	2511.14208	null
2025-11-18	FreeMusco: Motion-Free Learning of Latent Control for Morphology-Adaptive Locomotion in Musculoskeletal Characters	Minkwan Kim et.al.	2511.14205	null
2025-11-18	Online Data Curation for Object Detection via Marginal Contributions to Dataset-level Average Precision	Zitang Sun et.al.	2511.14197	null
2025-11-18	GloTok: Global Perspective Tokenizer for Image Reconstruction and Generation	Xuan Zhao et.al.	2511.14184	null
2025-11-18	UniSER: A Foundation Model for Unified Soft Effects Removal	Jingdong Zhang et.al.	2511.14183	null
2025-11-18	Towards Deploying VLA without Fine-Tuning: Plug-and-Play Inference-Time VLA Policy Steering via Embodied Evolutionary Diffusion	Zhuo Li et.al.	2511.14178	null
2025-11-18	AdaTok: Adaptive Token Compression with Object-Aware Representations for Efficient Multimodal LLMs	Xinliang Zhang et.al.	2511.14169	null
2025-11-18	FlexiCup: Wireless Multimodal Suction Cup with Dual-Zone Vision-Tactile Sensing	Junhao Gong et.al.	2511.14139	null
2025-11-18	Synthetic Survival Control: Extending Synthetic Controls for “When-If” Decision	Jessy Xinyi Han et.al.	2511.14133	null
2025-11-18	Run, Ruminate, and Regulate: A Dual-process Thinking System for Vision-and-Language Navigation	Yu Zhong et.al.	2511.14131	null
2025-11-18	Multi-view Phase-aware Pedestrian-Vehicle Incident Reasoning Framework with Vision-Language Models	Hao Zhen et.al.	2511.14120	null
2025-11-18	Real-Time Mobile Video Analytics for Pre-arrival Emergency Medical Services	Liuyi Jin et.al.	2511.14119	null
2025-11-18	Coffee: Controllable Diffusion Fine-tuning	Ziyao Zeng et.al.	2511.14113	null
2025-11-18	A Patient-Independent Neonatal Seizure Prediction Model Using Reduced Montage EEG and ECG	Sithmini Ranasingha et.al.	2511.14110	null
2025-11-18	Lightweight Multi-task CNN for ECG Diagnosis with GRU-Diffusion	Lehuai Xu et.al.	2511.14104	null
2025-11-18	APD-Agents: A Large Language Model-Driven Multi-Agents Collaborative Framework for Automated Page Design	Xinpeng Chen et.al.	2511.14101	null
2025-11-18	Text-Driven Reasoning Video Editing via Reinforcement Learning on Digital Twin Representations	Yiqing Shen et.al.	2511.14100	null
2025-11-18	FAPE-IR: Frequency-Aware Planning and Execution Framework for All-in-One Image Restoration	Jingren Liu et.al.	2511.14099	null
2025-11-18	Collaborative QA using Interacting LLMs. Impact of Network Structure, Node Capability and Distributed Data	Adit Jain et.al.	2511.14098	null
2025-11-18	Zero-Training Task-Specific Model Synthesis for Few-Shot Medical Image Classification	Yao Qin et.al.	2511.14082	null
2025-11-18	Meta-SimGNN: Adaptive and Robust WiFi Localization Across Dynamic Configurations and Diverse Scenarios	Qiqi Xiao et.al.	2511.14076	null
2025-11-18	CFG-EC: Error Correction Classifier-Free Guidance	Nakkyu Yang et.al.	2511.14075	null
2025-11-18	CORE: Compact Object-centric REpresentations as a New Paradigm for Token Merging in LVLMs	Jingyu Lei et.al.	2511.14072	null
2025-11-18	Semantic Context Matters: Improving Conditioning for Autoregressive Models	Dongyang Jin et.al.	2511.14063	null
2025-11-17	Back to Basics: Let Denoising Generative Models Denoise	Tianhong Li et.al.	2511.13720	null
2025-11-17	Segment Anything Across Shots: A Method and Benchmark	Hengrui Hu et.al.	2511.13715	null
2025-11-17	UnSAMv2: Self-Supervised Learning Enables Segment Anything at Any Granularity	Junwei Yu et.al.	2511.13714	null
2025-11-17	Free-Form Scene Editor: Enabling Multi-Round Object Manipulation like in a 3D Engine	Xincheng Shuai et.al.	2511.13713	null
2025-11-17	TiViBench: Benchmarking Think-in-Video Reasoning for Video Generative Models	Harold Haodong Chen et.al.	2511.13704	null
2025-11-17	Crossing Borders: A Multimodal Challenge for Indian Poetry Translation and Image Generation	Sofia Jamil et.al.	2511.13689	null
2025-11-17	Training-Free Multi-View Extension of IC-Light for Textual Position-Aware Scene Relighting	Jiangnan Ye et.al.	2511.13684	null
2025-11-17	Cross-Learning from Scarce Data via Multi-Task Constrained Optimization	Leopoldo Agorio et.al.	2511.13680	null
2025-11-17	Ontology-Driven Model-to-Model Transformation of Workflow Specifications	Francisco Abreu et.al.	2511.13661	null
2025-11-17	OlmoEarth: Stable Latent Image Modeling for Multimodal Earth Observation	Henry Herzog et.al.	2511.13655	null
2025-11-17	JWST observes the assembly of a massive galaxy at z~4	Aayush Saxena et.al.	2511.13650	null
2025-11-17	Distribution Matching Distillation Meets Reinforcement Learning	Dengyang Jiang et.al.	2511.13649	null
2025-11-17	PhysX-Anything: Simulation-Ready Physical 3D Assets from Single Image	Ziang Cao et.al.	2511.13648	null
2025-11-17	Part-X-MLLM: Part-aware 3D Multimodal Large Language Model	Chunshi Wang et.al.	2511.13647	null
2025-11-17	CacheFlow: Compressive Streaming Memory for Efficient Long-Form Video Understanding	Shrenik Patel et.al.	2511.13644	null
2025-11-17	It’s a Feature, Not a Bug: Secure and Auditable State Rollback for Confidential Cloud Applications	Quinn Burke et.al.	2511.13641	null
2025-11-17	Data Value in the Age of Scaling: Understanding LLM Scaling Dynamics Under Real-Synthetic Data Mixtures	Haohui Wang et.al.	2511.13640	null
2025-11-17	The Bottom-Up Approach for Powerful Testing with FWER Control	Rajesh Karmakar et.al.	2511.13624	null
2025-11-17	VVS: Accelerating Speculative Decoding for Visual Autoregressive Generation via Partial Verification Skipping	Haotian Dong et.al.	2511.13587	null
2025-11-17	Hierarchical Prompt Learning for Image- and Text-Based Person Re-Identification	Linhan Zhou et.al.	2511.13575	null
2025-11-17	Infinite-Horizon Optimal Control of Jump-Diffusion Models for Pollution-Dependent Disasters	Daria Sakhanda et.al.	2511.13568	null
2025-11-17	TSE-Net: Semi-supervised Monocular Height Estimation from Single Remote Sensing Images	Sining Chen et.al.	2511.13552	null
2025-11-17	Robust Defense Strategies for Multimodal Contrastive Learning: Efficient Fine-tuning Against Backdoor Attacks	Md. Iqbal Hossain et.al.	2511.13545	null
2025-11-17	Quantum Machine Learning via Contrastive Training	Liudmila A. Zhukas et.al.	2511.13497	null
2025-11-17	Language-Guided Invariance Probing of Vision-Language Models	Jae Joong Lee et.al.	2511.13494	null
2025-11-17	Semantic Document Derendering: SVG Reconstruction via Vision-Language Modeling	Adam Hazimeh et.al.	2511.13478	null
2025-11-17	Spin Accumulation based deep MOKE Microscopy	Jean Rodriguez et.al.	2511.13468	null
2025-11-17	Contact-Safe Reinforcement Learning with ProMP Reparameterization and Energy Awareness	Bingkun Huang et.al.	2511.13459	null
2025-11-17	Trust in Vision-Language Models: Insights from a Participatory User Workshop	Agnese Chiatti et.al.	2511.13458	null
2025-11-17	Discovering Operational Patterns Using Image-Based Convolutional Clustering and Composite Evaluation: A Case Study in Foundry Melting Processes	Zhipeng Ma et.al.	2511.13444	null
2025-11-17	Unlocking the Forgery Detection Potential of Vanilla MLLMs: A Novel Training-Free Pipeline	Rui Zuo et.al.	2511.13442	null
2025-11-17	Emergent spectral geometry in the Coherence–Curvature Model	Jorge Lamas et.al.	2511.13423	null
2025-11-17	VOPE: Revisiting Hallucination of Vision-Language Models in Voluntary Imagination Task	Xingming Long et.al.	2511.13420	null
2025-11-17	Delineate Anything Flow: Fast, Country-Level Field Boundary Detection from Any Source	Mykola Lavreniuk et.al.	2511.13417	null
2025-11-17	Microwave-acoustic-driven power electronics	Liyang Jin et.al.	2511.13412	null
2025-11-17	Robustness of Morse decomposition for non-local perturbations of the Chafee-Infante equation	Rubén Caballero et.al.	2511.13406	null
2025-11-17	TripleFDS: Triple Feature Disentanglement and Synthesis for Scene Text Editing	Yuchen Bao et.al.	2511.13399	null
2025-11-17	Descriptor: Distance-Annotated Traffic Perception Question Answering (DTPQA)	Nikos Theodoridis et.al.	2511.13397	null
2025-11-17	Generalized Denoising Diffusion Codebook Models (gDDCM): Tokenizing images using a pre-trained diffusion model	Fei Kong et.al.	2511.13387	null
2025-11-17	A space-time hybrid parareal method for kinetic equations in the diffusive scaling	Tino Laidin et.al.	2511.13386	null
2025-11-17	Donors and Recipients: On Asymmetric Transfer Across Tasks and Languages with Parameter-Efficient Fine-Tuning	Kajetan Dymkiewicz et.al.	2511.13368	null
2025-11-17	Local asymptotic normality for discretely observed McKean-Vlasov diffusions	Akram Heidari et.al.	2511.13366	null
2025-11-17	Semi-Supervised Multi-Task Learning for Interpretable Quality As- sessment of Fundus Images	Lucas Gabriel Telesco et.al.	2511.13353	null
2025-11-17	Coliseum project: Correlating climate change data with the behavior of heritage materials	A Cormier et.al.	2511.13343	null
2025-11-17	Role of partial stable stratification on the onset of rotating magnetoconvection with a uniform vertical field	Tirtharaj Barman et.al.	2511.13340	null
2025-11-17	Role of partial stable stratification on the onset of rotating magnetoconvection with a uniform horizontal field	Tirtharaj Barman et.al.	2511.13331	null
2025-11-17	TacEleven: generative tactic discovery for football open play	Siyao Zhao et.al.	2511.13326	null
2025-11-17	Computer Vision based group activity detection and action spotting	Narthana Sivalingam et.al.	2511.13315	null
2025-11-17	EL3DD: Extended Latent 3D Diffusion for Language Conditioned Multitask Manipulation	Jonas Bode et.al.	2511.13312	null
2025-11-17	DriveLiDAR4D: Sequential and Controllable LiDAR Scene Generation for Autonomous Driving	Kaiwen Cai et.al.	2511.13309	null
2025-11-17	On the deep commuting graph of a finite group	Sumana Hatui et.al.	2511.13303	null
2025-11-17	CorrectAD: A Self-Correcting Agentic System to Improve End-to-end Planning in Autonomous Driving	Enhui Ma et.al.	2511.13297	null
2025-11-17	SkyReels-Text: Fine-grained Font-Controllable Text Editing for Poster Design	Yunjie Yu et.al.	2511.13285	null
2025-11-17	TabFlash: Efficient Table Understanding with Progressive Question Conditioning and Token Focusing	Jongha Kim et.al.	2511.13283	null
2025-11-17	Towards Metric-Aware Multi-Person Mesh Recovery by Jointly Optimizing Human Crowd in Camera Space	Kaiwen Wang et.al.	2511.13282	null
2025-11-17	SF-Recon: Simplification-Free Lightweight Building Reconstruction via 3D Gaussian Splatting	Zihan Li et.al.	2511.13278	null
2025-11-17	Pinching-Antenna-Enabled Cognitive Radio Networks	Zeyang Sun et.al.	2511.13272	null
2025-11-17	Examining the Usage of Generative AI Models in Student Learning Activities for Software Programming	Rufeng Chen et.al.	2511.13271	null
2025-11-17	Crossover dynamics and non-Gaussian fluctuations in inertial active chains	Manish Patel et.al.	2511.13270	null
2025-11-17	TransFit-CSM: A Fast, Physically Consistent Framework for Interaction-Powered Transients	Yu-Hao Zhang et.al.	2511.13265	null
2025-11-17	Referring Camouflaged Object Detection With Multi-Context Overlapped Windows Cross-Attention	Yu Wen et.al.	2511.13249	null
2025-11-17	Seek and You Shall Fold	Nadav Bojan Sellam et.al.	2511.13244	null
2025-11-17	Uncovering and Mitigating Transient Blindness in Multimodal Model Editing	Xiaoqi Han et.al.	2511.13243	null
2025-11-17	MMD-Thinker: Adaptive Multi-Dimensional Thinking for Multimodal Misinformation Detection	Junjie Wu et.al.	2511.13242	null
2025-11-17	Dynamical Networking of Polymer Networks with Dedicated Cross-linker Particles	Nadine du Toit et.al.	2511.13241	null
2025-11-17	MRIQT: Physics-Aware Diffusion Model for Image Quality Transfer in Neonatal Ultra-Low-Field MRI	Malek Al Abed et.al.	2511.13232	null
2025-11-17	FoleyBench: A Benchmark For Video-to-Audio Models	Satvik Dixit et.al.	2511.13219	null
2025-11-17	Numerical renormalization group integrated Hamiltonian truncation: Toward generic deformation of integrable lattice models	Xiaodong He et.al.	2511.13218	null
2025-11-17	Spectral component imaging of solar X-ray flares	Muriel Zoë Stiefel et.al.	2511.13213	null
2025-11-17	Cog-RAG: Cognitive-Inspired Dual-Hypergraph with Theme Alignment Retrieval-Augmented Generation	Hao Hu et.al.	2511.13201	null
2025-11-17	Birth of a Painting: Differentiable Brushstroke Reconstruction	Ying Jiang et.al.	2511.13191	null
2025-11-17	Large Language Models Meet Extreme Multi-label Classification: Scaling and Multi-modal Framework	Diego Ortego et.al.	2511.13189	null
2025-11-17	Collision-Free Navigation of Mobile Robots via Quadtree-Based Model Predictive Control	Osama Al Sheikh Ali et.al.	2511.13188	null
2025-11-17	The Geometry of Hidden Modes in Distance-Based Formation Control	Solomon Goldgraber Casspi et.al.	2511.13187	null
2025-11-17	DiffFP: Learning Behaviors from Scratch via Diffusion-based Fictitious Play	Akash Karthikeyan et.al.	2511.13186	null
2025-11-17	GenTract: Generative Global Tractography	Alec Sargood et.al.	2511.13183	null
2025-11-17	Real-time distortion prediction in metallic additive manufacturing via a physics-informed neural operator approach	Mingxuan Tian et.al.	2511.13178	null
2025-11-17	HDW-SR: High-Frequency Guided Diffusion Model based on Wavelet Decomposition for Image Super-Resolution	Chao Yang et.al.	2511.13175	null
2025-11-17	Warm-starting active-set solvers using graph neural networks	Ella J. Schmidtobreick et.al.	2511.13174	null
2025-11-17	Autonomous Sensing UAV for Accurate Multi-User Identification and Localization in Cellular Networks	Niccolò Paglierani et.al.	2511.13171	null
2025-11-17	SOMA: Feature Gradient Enhanced Affine-Flow Matching for SAR-Optical Registration	Haodong Wang et.al.	2511.13168	null
2025-11-17	An inverse design method for generalized zero-étendue sources and two targets	Pieter Braam et.al.	2511.13165	null
2025-11-17	Observational properties of a Schwarzschild black hole surrounded by a Dehnen-type dark matter halo	Zhi Li et.al.	2511.13156	null
2025-11-17	Skeletons Speak Louder than Text: A Motion-Aware Pretraining Paradigm for Video-Based Person Re-Identification	Rifen Lin et.al.	2511.13150	null
2025-11-17	Automated Road Distress Detection Using Vision Transformersand Generative Adversarial Networks	Cesar Portocarrero Rodriguez et.al.	2511.13145	null
2025-11-17	Bridging the genotype-phenotype gap with generative artificial intelligence	Yangfan Liu et.al.	2511.13141	null
2025-11-17	Think with Self-Decoupling and Self-Verification: Automated RTL Design with Backtrack-ToT	Zhiteng Chao et.al.	2511.13139	null
2025-11-17	Conditional Diffusion Model for Multi-Agent Dynamic Task Decomposition	Yanda Zhu et.al.	2511.13137	null
2025-11-17	Topological phase transitions by time-dependent electromagnetic fields in frustrated magnets: Role of dynamical and static magnetic fields	Tatsuya Shirato et.al.	2511.13136	null
2025-11-17	MedGEN-Bench: Contextually entangled benchmark for open-ended multimodal medical generation	Junjie Yang et.al.	2511.13135	null
2025-11-17	MM-Telco: Benchmarks and Multimodal Large Language Models for Telecom Applications	Gagan Raj Gupta et.al.	2511.13131	null
2025-11-17	VEIL: Jailbreaking Text-to-Video Models via Visual Exploitation from Implicit Language	Zonghao Ying et.al.	2511.13127	null
2025-11-17	Departures: Distributional Transport for Single-Cell Perturbation Prediction with Neural Schrödinger Bridges	Changxi Chi et.al.	2511.13124	null
2025-11-17	CloseUpShot: Close-up Novel View Synthesis from Sparse-views via Point-conditioned Diffusion Model	Yuqi Zhang et.al.	2511.13121	null
2025-11-17	Synthetic Forgetting without Access: A Few-shot Zero-glance Framework for Machine Unlearning	Qipeng Song et.al.	2511.13116	null
2025-11-17	Orthogonal Attosecond Control of Solid-State Harmonics by Optical Waveforms and Quantum Geometry Engineering	Zhenjiang Zhao et.al.	2511.13114	null
2025-11-17	Semantics and Content Matter: Towards Multi-Prior Hierarchical Mamba for Image Deraining	Zhaocheng Yu et.al.	2511.13113	null
2025-11-17	DGS-Net: Distillation-Guided Gradient Surgery for CLIP Fine-Tuning in AI-Generated Image Detection	Jiazhen Yan et.al.	2511.13108	null
2025-11-17	Low-Level Dataset Distillation for Medical Image Enhancement	Fengzhi Xu et.al.	2511.13106	null
2025-11-17	Transformer-Based Scalable Multi-Agent Reinforcement Learning for Networked Systems with Long-Range Interactions	Vidur Sinha et.al.	2511.13103	null
2025-11-14	LARM: A Large Articulated-Object Reconstruction Model	Sylvia Yuan et.al.	2511.11563	null
2025-11-14	Aligning Machiavellian Agents: Behavior Steering via Test-Time Policy Shaping	Dena Mujtaba et.al.	2511.11551	null
2025-11-14	STEM EBIC as a Quantitative Probe of Semiconductor Devices	Sebastian Schneider et.al.	2511.11528	null
2025-11-14	Bridging Hidden States in Vision-Language Models	Benjamin Fein-Ashley et.al.	2511.11526	null
2025-11-14	CVChess: A Deep Learning Framework for Converting Chessboard Images to Forsyth-Edwards Notation	Luthira Abeykoon et.al.	2511.11522	null
2025-11-14	Scalable Policy Evaluation with Video World Models	Wei-Cheng Tseng et.al.	2511.11520	null
2025-11-14	Experience-Guided Adaptation of Inference-Time Reasoning Strategies	Adam Stein et.al.	2511.11519	null
2025-11-14	W2S-AlignTree: Weak-to-Strong Inference-Time Alignment for Large Language Models via Monte Carlo Tree Search	Zhenyu Ding et.al.	2511.11518	null
2025-11-14	Scalable Coverage Trajectory Synthesis on GPUs as Statistical Inference	Max M. Sun et.al.	2511.11514	null
2025-11-14	SynthSoM-Twin: A Multi-Modal Sensing-Communication Digital-Twin Dataset for Sim2Real Transfer via Synesthesia of Machines	Junlong Chen et.al.	2511.11503	null
2025-11-14	PAS : Prelim Attention Score for Detecting Object Hallucinations in Large Vision–Language Models	Nhat Hoang-Xuan et.al.	2511.11502	null
2025-11-14	Visible and Terahertz Nonlinear Responses in the Topological Noble Metal Dichalcogenide PdTe2	George J. de Coster et.al.	2511.11493	null
2025-11-14	Intrinsic Dimension Estimation for Radio Galaxy Zoo using Diffusion Models	Joan Font-Quer Roset et.al.	2511.11490	null
2025-11-14	Data-efficient U-Net for Segmentation of Carbide Microstructures in SEM Images of Steel Alloys	Alinda Ezgi Gerçek et.al.	2511.11485	null
2025-11-14	ImAgent: A Unified Multimodal Agent Framework for Test-Time Scalable Image Generation	Kaishen Wang et.al.	2511.11483	null
2025-11-14	Inferring response times of perceptual decisions with Poisson variational autoencoders	Hayden R. Johnson et.al.	2511.11480	null
2025-11-14	Planetary nebulae as tracers of stellar population properties: a pilot study with MUSE	Ana Inés Ennis et.al.	2511.11479	null
2025-11-14	Sat2RealCity: Geometry-Aware and Appearance-Controllable 3D Urban Generation from Satellite Imagery	Yijie Kang et.al.	2511.11470	null
2025-11-14	Enabling Wireless Power Transfer (WPT) in Pinching Antenna Systems (PASS)	Deqiao Gan et.al.	2511.11465	null
2025-11-14	Rethinking Efficient Mixture-of-Experts for Remote Sensing Modality-Missing Classification	Qinghao Gao et.al.	2511.11460	null
2025-11-14	SimTac: A Physics-Based Simulator for Vision-Based Tactile Sensing with Biomorphic Structures	Xuyang Zhang et.al.	2511.11456	null
2025-11-14	VoxTell: Free-Text Promptable Universal 3D Medical Image Segmentation	Maximilian Rokuss et.al.	2511.11450	null
2025-11-14	DiffPro: Joint Timestep and Layer-Wise Precision Optimization for Efficient Diffusion Inference	Farhana Amin et.al.	2511.11446	null
2025-11-14	Influence of Rotation on Fingering Convection in Planetary Cores	Martin Gray et.al.	2511.11442	null
2025-11-14	From Synthetic Scenes to Real Performance: Enhancing Spatial Reasoning in VLMs	Massimo Rizzoli et.al.	2511.11440	null
2025-11-14	Hi-DREAM: Brain Inspired Hierarchical Diffusion for fMRI Reconstruction via ROI Encoder and visuAl Mapping	Guowei Zhang et.al.	2511.11437	null
2025-11-14	The Persistence of Cultural Memory: Investigating Multimodal Iconicity in Diffusion Models	Maria-Teresa De Rosa Palmini et.al.	2511.11435	null
2025-11-14	WEAVE: Unleashing and Benchmarking the In-context Interleaved Comprehension and Generation	Wei Chow et.al.	2511.11434	null
2025-11-14	Photon correlation Fourier spectroscopy of a B center in hBN	Aymeric Delteil et.al.	2511.11428	null
2025-11-14	Comprehension of Multilingual Expressions Referring to Target Objects in Visual Inputs	Francisco Nogueira et.al.	2511.11427	null
2025-11-14	Shrinking the Teacher: An Adaptive Teaching Paradigm for Asymmetric EEG-Vision Alignment	Lukun Wu et.al.	2511.11422	null
2025-11-14	MicroVQA++: High-Quality Microscopy Reasoning Dataset with Weakly Supervised Graphs for Multimodal Large Language Model	Manyu Li et.al.	2511.11407	null
2025-11-14	Disentangling Emotional Bases and Transient Fluctuations: A Low-Rank Sparse Decomposition Approach for Video Affective Analysis	Feng-Qi Cui et.al.	2511.11406	null
2025-11-14	Bidimensional measurements of photon statistics within a multimodal temporal framework	C. Hainaut et.al.	2511.11403	null
2025-11-14	Multi-Phase Spacecraft Trajectory Optimization via Transformer-Based Reinforcement Learning	Amit Jain et.al.	2511.11402	null
2025-11-14	GRANITE: High-Resolution Imaging and Electrical Qualification of Large-Area TPC Electrodes	Shumit A. Mitra et.al.	2511.11401	null
2025-11-14	GRANITE: Mechanical Characterization and Optical Inspection of Large-Area TPC Electrodes	Alexander Deisting et.al.	2511.11400	null
2025-11-14	Universal Safety Controllers with Learned Prophecies	Bernd Finkbeiner et.al.	2511.11390	null
2025-11-14	Robust inverse material design with physical guarantees using the Voigt-Reuss Net	Sanath Keshav et.al.	2511.11388	null
2025-11-14	Optimal Dividend, Reinsurance and Capital Injection Strategies for Collaborating Business Lines: The Case of Excess-of-Loss Reinsurance	Tim J. Boonen et.al.	2511.11383	null
2025-11-14	Interlinking helical spin textures in nanopatterned chiral magnets	Luke Alexander Turnbull et.al.	2511.11372	null
2025-11-14	Global attractor for a Cahn-Hilliard-chemotaxis model with logistic degradation	Giulio Schimperna et.al.	2511.11363	null
2025-11-14	YCB-Ev SD: Synthetic event-vision dataset for 6DoF object pose estimation	Pavel Rojtberg et.al.	2511.11344	null
2025-11-14	Close-in compact super-Earth systems emerging from resonant chains: slow destabilization by unseen remnants of formation	Max Goldberg et.al.	2511.11329	null
2025-11-14	Mathematical and numerical methods for accurate aorta segmentation from non-enhanced CT Data yielding reliable identification and evaluation of large vessel vasculitis	Konan A. Allaly et.al.	2511.11303	null
2025-11-14	SimuFreeMark: A Noise-Simulation-Free Robust Watermarking Against Image Editing	Yichao Tang et.al.	2511.11295	null
2025-11-14	RTGaze: Real-Time 3D-Aware Gaze Redirection from a Single Image	Hengfei Wang et.al.	2511.11289	null
2025-11-14	D-GAP: Improving Out-of-Domain Robustness via Dataset-Agnostic and Gradient-Guided Augmentation in Amplitude and Pixel Spaces	Ruoqi Wang et.al.	2511.11286	null
2025-11-14	Discovering Meaningful Units with Visually Grounded Semantics from Image Captions	Melika Behjati et.al.	2511.11262	null
2025-11-14	CountSteer: Steering Attention for Object Counting in Diffusion Models	Hyemin Boo et.al.	2511.11253	null
2025-11-14	Evidence for the Keplerian orbit of a close companion around a giant star	Mats Esseldeurs et.al.	2511.11247	null
2025-11-14	Arcee: Differentiable Recurrent State Chain for Generative Vision Modeling with Mamba SSMs	Jitesh Chavan et.al.	2511.11243	null
2025-11-14	Controlled accelerations for Rayleigh-Taylor instability	J. T. Horne-Jones et.al.	2511.11241	null
2025-11-14	Beyond Flatlands: Unlocking Spatial Intelligence by Decoupling 3D Reasoning from Numerical Regression	Zhongbin Guo et.al.	2511.11239	null
2025-11-14	Parameter-Efficient MoE LoRA for Few-Shot Multi-Style Editing	Cong Cao et.al.	2511.11236	null
2025-11-14	DoReMi: A Domain-Representation Mixture Framework for Generalizable 3D Understanding	Mingwei Xing et.al.	2511.11232	null
2025-11-14	3D Gaussian and Diffusion-Based Gaze Redirection	Abiram Panchalingam et.al.	2511.11231	null
2025-11-14	3D Stokes polarimetric imaging at nanoscales	Isael Herrera et.al.	2511.11222	null
2025-11-14	Humanoid Whole-Body Badminton via Multi-Stage Reinforcement Learning	Chenhao Liu et.al.	2511.11218	null
2025-11-14	Positional Bias in Multimodal Embedding Models: Do They Favor the Beginning, the Middle, or the End?	Kebin Wu et.al.	2511.11216	null
2025-11-14	RealisticDreamer: Guidance Score Distillation for Few-shot Gaussian Splatting	Ruocheng Wu et.al.	2511.11213	null
2025-11-14	MAFM^3: Modular Adaptation of Foundation Models for Multi-Modal Medical AI	Mohammad Areeb Qazi et.al.	2511.11212	null
2025-11-14	Inverse modeling of porous flow through deep neural networks: the case of coffee percolation	Antoniorenee Barletta et.al.	2511.11194	null
2025-11-14	Integrating Aggregated Electric Vehicle Flexibilities in Unit Commitment Models using Submodular Optimization	Hélène Arvis et.al.	2511.11191	null
2025-11-14	ReTrace: Interactive Visualizations for Reasoning Traces of Large Reasoning Models	Ludwig Felder et.al.	2511.11187	null
2025-11-14	Numerical Discretization Schemes that Preserve Flatness	Ashutosh Jindal et.al.	2511.11183	null
2025-11-14	Viper-F1: Fast and Fine-Grained Multimodal Understanding with Cross-Modal State-Space Modulation	Quoc-Huy Trinh et.al.	2511.11177	null
2025-11-14	Reverberation: Learning the Latencies Before Forecasting Trajectories	Conghao Wong et.al.	2511.11164	null
2025-11-14	OT-ALD: Aligning Latent Distributions with Optimal Transport for Accelerated Image-to-Image Translation	Zhanpeng Wang et.al.	2511.11162	null
2025-11-14	Drift Estimation for Diffusion Processes Using Neural Networks Based on Discretely Observed Independent Paths	Yuzhen Zhao et.al.	2511.11161	null
2025-11-14	Adaptive Symmetrization of the KL Divergence	Omri Ben-Dov et.al.	2511.11159	null
2025-11-14	Galactic foreground residual biases in CMB lensing convergence reconstruction and delensing of B-mode maps	Kishan Deka et.al.	2511.11147	null
2025-11-14	PRSM: A Measure to Evaluate CLIP’s Robustness Against Paraphrases	Udo Schlegel et.al.	2511.11141	null
2025-11-14	Generalizing Lattice Structures to Hypergraphs: Spectra of Clique and Hyperedge-based Laplacians	Eleonora Andreotti et.al.	2511.11138	null
2025-11-14	GGBench: A Geometric Generative Reasoning Benchmark for Unified Multimodal Models	Jingxuan Wei et.al.	2511.11134	null
2025-11-14	Impact of Brain Anisotropy on Transcranial Temporal Interference Stimulation: Numerical Analysis Toward Reliable Montage Optimization	Kanata Yatsuda et.al.	2511.11129	null
2025-11-14	Enhancing Meme Emotion Understanding with Multi-Level Modality Enhancement and Dual-Stage Modal Fusion	Yi Shi et.al.	2511.11126	null
2025-11-14	Non-Convex Global Optimization as an Optimal Stabilization Problem: Convergence Rates	Yuyang Huang et.al.	2511.11122	null
2025-11-14	Non-Gaussianity-induced enhanced target-finding dynamics of confined colloids	Guirec de Tournemire et.al.	2511.11117	null
2025-11-14	Toward Generalized Detection of Synthetic Media: Limitations, Challenges, and the Path to Multimodal Solutions	Redwan Hussain et.al.	2511.11116	null
2025-11-14	VIDEOP2R: Video Understanding from Perception to Reasoning	Yifan Jiang et.al.	2511.11113	null
2025-11-14	Ligand Engineering for Precise Control of Strongly-Confined CsPbI3 Nanoplatelet Superlattices for Efficient Light-Emitting Diodes	Jongbeom Kim et.al.	2511.11107	null
2025-11-14	Ergodic properties of occupation times in heterogeneous media	Vicenç Méndez et.al.	2511.11099	null
2025-11-14	On the accuracy of the model predictive control method	Georgi Angelov et.al.	2511.11098	null
2025-11-14	Machine-Learning Based Detection of Coronary Artery Calcification Using Synthetic Chest X-Rays	Dylan Saeed et.al.	2511.11093	null
2025-11-14	Sheaf Cohomology of Linear Predictive Coding Networks	Jeffrey Seely et.al.	2511.11092	null
2025-11-14	Numerical approximation of Caputo-type advection-diffusion equations in one and multiple spatial dimensions via shifted Chebyshev polynomials	Francisco de la Hoz et.al.	2511.11082	null
2025-11-14	ARCTraj: A Dataset and Benchmark of Human Reasoning Trajectories for Abstract Problem Solving	Sejin Kim et.al.	2511.11079	null
2025-11-14	Phys-Liquid: A Physics-Informed Dataset for Estimating 3D Geometry and Volume of Transparent Deformable Liquids	Ke Ma et.al.	2511.11077	null
2025-11-14	Explosion and implosion of birth-and-death continuous-time random walks	Andrey Pilipenko et.al.	2511.11076	null
2025-11-14	Evaluating Latent Generative Paradigms for High-Fidelity 3D Shape Completion from a Single Depth Image	Matthias Humt et.al.	2511.11074	null
2025-11-14	Optimising Density Computations in Probabilistic Programs via Automatic Loop Vectorisation	Sangho Lim et.al.	2511.11070	null
2025-11-14	S2D-ALIGN: Shallow-to-Deep Auxiliary Learning for Anatomically-Grounded Radiology Report Generation	Jiechao Gao et.al.	2511.11066	null
2025-11-14	From Retinal Pixels to Patients: Evolution of Deep Learning Research in Diabetic Retinopathy Screening	Muskaan Chopra et.al.	2511.11065	null
2025-11-14	LiteAttention: A Temporal Sparse Attention for Diffusion Transformers	Dor Shmilovich et.al.	2511.11062	null
2025-11-14	CareCom: Generative Image Composition with Calibrated Reference Features	Jiaxuan Chen et.al.	2511.11060	null
2025-11-14	Modeling and Control of Sustainable Transitions through Opinion-Behavior Coupling in Heterogeneous Networks	Martina Alutto et.al.	2511.11053	null
2025-11-14	AdaptPNP: Integrating Prehensile and Non-Prehensile Skills for Adaptive Robotic Manipulation	Jinxuan Zhu et.al.	2511.11052	null
2025-11-14	NP-LoRA: Null Space Projection Unifies Subject and Style in LoRA Fusion	Chuheng Chen et.al.	2511.11051	null
2025-11-14	CrossMed: A Multimodal Cross-Task Benchmark for Compositional Generalization in Medical Imaging	Pooja Singh et.al.	2511.11034	null
2025-11-13	Ordinary lattice defects as probes of topology	Aiden J. Mains et.al.	2511.10646	null
2025-11-13	One Small Step in Latent, One Giant Leap for Pixels: Fast Latent Upscale Adapter for Your Diffusion Models	Aleksandr Razin et.al.	2511.10629	null
2025-11-13	Global Solutions to Non-Convex Functional Constrained Problems with Hidden Convexity	Ilyas Fatkhullin et.al.	2511.10626	null
2025-11-13	Verification of Sequential Convex Programming for Parametric Non-convex Optimization	Rajiv Sambharya et.al.	2511.10622	null
2025-11-13	Towards Blind and Low-Vision Accessibility of Lightweight VLMs and Custom LLM-Evals	Shruti Singh Baghel et.al.	2511.10615	null
2025-11-13	Mined Prompting and Metadata-Guided Generation for Wound Care Visual Question Answering	Bavana Durgapraveen et.al.	2511.10591	null
2025-11-13	Safe Planning in Interactive Environments via Iterative Policy Updates and Adversarially Robust Conformal Prediction	Omid Mirzaeedodangeh et.al.	2511.10586	null
2025-11-13	Central Quasi-Morphicity, Central Morphicity, and Strongly $π$ -Regularity	Theophilus Gera et.al.	2511.10569	null
2025-11-13	A Style is Worth One Code: Unlocking Code-to-Style Image Generation with Discrete Style Space	Huijie Liu et.al.	2511.10555	null
2025-11-13	Bowditch representations in Gromov-hyperbolic spaces : characterizations, dynamics of $\mathrm{Out}(\mathbb{F}_2)$ and recognition	Suzanne Schlich et.al.	2511.10551	null
2025-11-13	Benchmarking Diversity in Image Generation via Attribute-Conditional Human Evaluation	Isabela Albuquerque et.al.	2511.10547	null
2025-11-13	Eigenvalues of Brownian Motions on $\mathrm{GL}(N,\mathbb{C})$	Tatiana Brailovskaya et.al.	2511.10535	null
2025-11-13	Self-similar scaling of variable-density Rayleigh-Taylor turbulence	Chian Yeh Goh et.al.	2511.10512	null
2025-11-13	Parallel and GPU accelerated code for phase-field and reaction-diffusion simulations	Steven A. Silber et.al.	2511.10508	null
2025-11-13	Panda: Test-Time Adaptation with Negative Data Augmentation	Ruxi Deng et.al.	2511.10481	null
2025-11-13	OpenSR-SRGAN: A Flexible Super-Resolution Framework for Multispectral Earth Observation Data	Simon Donike et.al.	2511.10461	null
2025-11-13	Wafer-scale conformal metasurface optics	Louis Martin-Monier et.al.	2511.10447	null
2025-11-13	Continuum Dropout for Neural Differential Equations	Jonghun Lee et.al.	2511.10446	null
2025-11-13	Extending the Frontier of Spatially-Resolved Supermassive Black Hole Mass Measurements to at $1\lesssim z\lesssim2$ : Simulations with ELT/MICADO High-Resolution Mass Models and HARMONI Integral-Field Stellar Kinematics	Dieu D. Nguyen et.al.	2511.10427	null
2025-11-13	Enhanced Privacy Leakage from Noise-Perturbed Gradients via Gradient-Guided Conditional Diffusion Models	Jiayang Meng et.al.	2511.10423	null
2025-11-13	Estimating the true number of principal components under the random design	Yasuyuki Matsumura et.al.	2511.10419	null
2025-11-13	LongComp: Long-Tail Compositional Zero-Shot Generalization for Robust Trajectory Prediction	Benjamin Stoler et.al.	2511.10411	null
2025-11-13	Diffusion annealed Langevin dynamics: a theoretical study	Patrick Cattiaux et.al.	2511.10406	null
2025-11-13	nuPlan-R: A Closed-Loop Planning Benchmark for Autonomous Driving via Reactive Multi-Agent Simulation	Mingxing Peng et.al.	2511.10403	null
2025-11-13	Stability Analysis of a Nonlinear Distributed Control Framework for Current Sharing and Voltage Containment in DC Microgrids: The Fast Communication Scenario	Cornelia Skaga et.al.	2511.10401	null
2025-11-13	The Configuration Wall: Characterization and Elimination of Accelerator Configuration Overhead	Josse Van Delm et.al.	2511.10397	null
2025-11-13	GrounDiff: Diffusion-Based Ground Surface Generation from Digital Surface Models	Oussema Dhaouadi et.al.	2511.10391	null
2025-11-13	MonkeyOCR v1.5 Technical Report: Unlocking Robust Document Parsing for Complex Patterns	Jiarui Zhang et.al.	2511.10390	null
2025-11-13	Simulating Misinformation Propagation in Social Networks using Large Language Models	Raj Gaurav Maurya et.al.	2511.10384	null
2025-11-13	Operator Models for Continuous-Time Offline Reinforcement Learning	Nicolas Hoischen et.al.	2511.10383	null
2025-11-13	Fragile by Design: On the Limits of Adversarial Defenses in Personalized Generation	Zhen Chen et.al.	2511.10382	null
2025-11-13	MSGNav: Unleashing the Power of Multi-modal 3D Scene Graph for Zero-Shot Embodied Navigation	Xun Huang et.al.	2511.10376	null
2025-11-13	Sub-diffusive Black-Scholes model and Girsanov transform for sub-diffusions	Shuaiqi Zhang et.al.	2511.10371	null
2025-11-13	DermAI: Clinical dermatology acquisition through quality-driven image collection for AI classification in mobile	Thales Bezerra et.al.	2511.10367	null
2025-11-13	Equivariant Denoisers for Plug and Play Image Restoration	Marien Renaud et.al.	2511.10340	null
2025-11-13	BhashaKritika: Building Synthetic Pretraining Data at Scale for Indic Languages	Guduru Manoj et.al.	2511.10338	null
2025-11-13	TMDC: A Two-Stage Modality Denoising and Complementation Framework for Multimodal Sentiment Analysis with Missing and Noisy Modalities	Yan Zhuang et.al.	2511.10325	null
2025-11-13	Optomechanical Cooling without Residual Heating	Surangana Sengupta et.al.	2511.10318	null
2025-11-13	Depth-Consistent 3D Gaussian Splatting via Physical Defocus Modeling and Multi-View Geometric Supervision	Yu Deng et.al.	2511.10316	null
2025-11-13	CLIP4VI-ReID: Learning Modality-shared Representations via CLIP Semantic Bridge for Visible-Infrared Person Re-identification	Xiaomei Yang et.al.	2511.10309	null
2025-11-13	Causal Model-Based Reinforcement Learning for Sample-Efficient IoT Channel Access	Aswin Arun et.al.	2511.10291	null
2025-11-13	OutSafe-Bench: A Benchmark for Multimodal Offensive Content Detection in Large Language Models	Yuping Yan et.al.	2511.10287	null
2025-11-13	Causal-HalBench: Uncovering LVLMs Object Hallucinations Through Causal Intervention	Zhe Xu et.al.	2511.10268	null
2025-11-13	Ancilla-Free Fast-Forwarding Lindbladian Simulation Algorithms by Hamiltonian Twirling	Minbo Gao et.al.	2511.10253	null
2025-11-13	P4-TAS: P4-Based Time-Aware Shaper for Time-Sensitive Networking	Fabian Ihle et.al.	2511.10249	null
2025-11-13	Systematic dispersion engineering of crystalline microresonators for broadband and coherent frequency comb generation	Liu Yang et.al.	2511.10247	null
2025-11-13	Robustness and Imperceptibility Analysis of Hybrid Spatial-Frequency Domain Image Watermarking	Rizal Khoirul Anam et.al.	2511.10245	null
2025-11-13	TubeRMC: Tube-conditioned Reconstruction with Mutual Constraints for Weakly-supervised Spatio-Temporal Video Grounding	Jinxuan Li et.al.	2511.10241	null
2025-11-13	MTP: Exploring Multimodal Urban Traffic Profiling with Modality Augmentation and Spectrum Fusion	Haolong Xiang et.al.	2511.10218	null
2025-11-13	Out-of-Context Misinformation Detection via Variational Domain-Invariant Learning with Test-Time Training	Xi Yang et.al.	2511.10213	null
2025-11-13	Next-Frame Feature Prediction for Multimodal Deepfake Detection and Temporal Localization	Ashutosh Anshul et.al.	2511.10212	null
2025-11-13	LiNeXt: Revisiting LiDAR Completion with Efficient Non-Diffusion Architectures	Wenzhe He et.al.	2511.10209	null
2025-11-13	Fractional neural attention for efficient multiscale sequence processing	Cheng Kevin Qu et.al.	2511.10208	null
2025-11-13	Kinetic Theory with Fluctuations: Strong Well-Posedness of the Vlasov-Fokker-Planck-Dean-Kawasaki System	Zimo Hao et.al.	2511.10194	null
2025-11-13	M3Scope a 3D multimode multiplane microscope for imaging nanoscale dynamics in soft matter	Steven Huysecom et.al.	2511.10174	null
2025-11-13	GEA: Generation-Enhanced Alignment for Text-to-Image Person Retrieval	Hao Zou et.al.	2511.10154	null
2025-11-13	Right Looks, Wrong Reasons: Compositional Fidelity in Text-to-Image Generation	Mayank Vatsa et.al.	2511.10136	null
2025-11-13	Explicit Temporal-Semantic Modeling for Dense Video Captioning via Context-Aware Cross-Modal Interaction	Mingda Jia et.al.	2511.10134	null
2025-11-13	RAGFort: Dual-Path Defense Against Proprietary Knowledge Base Extraction in Retrieval-Augmented Generation	Qinfeng Li et.al.	2511.10128	null
2025-11-13	Explicit pulsating fronts and minimal speeds in periodic Fisher-KPP equations	Lionel Roques et.al.	2511.10104	null
2025-11-13	Balancing Centralized Learning and Distributed Self-Organization: A Hybrid Model for Embodied Morphogenesis	Takehiro Ishikawa et.al.	2511.10101	null
2025-11-13	MTAttack: Multi-Target Backdoor Attacks against Large Vision-Language Models	Zihan Wang et.al.	2511.10098	null
2025-11-13	SUGAR: Learning Skeleton Representation with Visual-Motion Knowledge for Action Recognition	Qilang Ye et.al.	2511.10091	null
2025-11-13	T2IBias: Uncovering Societal Bias Encoded in the Latent Space of Text-to-Image Generative Models	Abu Sufian et.al.	2511.10089	null
2025-11-13	eXIAA: eXplainable Injections for Adversarial Attack	Leonardo Pesce et.al.	2511.10088	null
2025-11-13	Opinion: Towards Unified Expressive Policy Optimization for Robust Robot Learning	Haidong Huang et.al.	2511.10087	null
2025-11-13	Physics-informed Machine Learning for Static Friction Modeling in Robotic Manipulators Based on Kolmogorov-Arnold Networks	Yizheng Wang et.al.	2511.10079	null
2025-11-13	Mitigating Error Accumulation in Co-Speech Motion Generation via Global Rotation Diffusion and Multi-Level Constraints	Xiangyue Zhang et.al.	2511.10076	null
2025-11-13	VLF-MSC: Vision-Language Feature-Based Multimodal Semantic Communication System	Gwangyeon Ahn et.al.	2511.10074	null
2025-11-13	Perceive, Act and Correct: Confidence Is Not Enough for Hyperspectral Classification	Muzhou Yang et.al.	2511.10068	null
2025-11-13	When Eyes and Ears Disagree: Can MLLMs Discern Audio-Visual Confusion?	Qilang Ye et.al.	2511.10059	null
2025-11-13	An inexact semismooth Newton-Krylov method for semilinear elliptic optimal control problem	Shiqi Chen et.al.	2511.10058	null
2025-11-13	Image Aesthetic Reasoning via HCM-GRPO: Empowering Compact Model for Superior Performance	Zhiyuan Hu et.al.	2511.10055	null
2025-11-13	DGFusion: Dual-guided Fusion for Robust Multi-Modal 3D Object Detection	Feiyang Jia et.al.	2511.10035	null
2025-11-13	Locally uniform ellipticity of the fractional Hessian operators	Ziyu Gan et.al.	2511.10034	null
2025-11-13	GraphSB: Boosting Imbalanced Node Classification on Graphs through Structural Balance	Chaofan Zhu et.al.	2511.10022	null
2025-11-13	Anomagic: Crossmodal Prompt-driven Zero-shot Anomaly Generation	Yuxin Jiang et.al.	2511.10020	null
2025-11-13	AffordBot: 3D Fine-grained Embodied Reasoning via Multimodal Large Language Models	Xinyi Wang et.al.	2511.10017	null
2025-11-13	MIRNet: Integrating Constrained Graph-Based Reasoning with Pre-training for Diagnostic Medical Imaging	Shufeng Kong et.al.	2511.10013	null
2025-11-13	Reinforcing Trustworthiness in Multimodal Emotional Support Systems	Huy M. Le et.al.	2511.10011	null
2025-11-13	DBGroup: Dual-Branch Point Grouping for Weakly Supervised 3D Instance Segmentation	Xuexun Liu et.al.	2511.10003	null
2025-11-13	MOBA: A Material-Oriented Backdoor Attack against LiDAR-based 3D Object Detection Systems	Saket S. Chaturvedi et.al.	2511.09999	null
2025-11-13	Language Drift in Multilingual Retrieval-Augmented Generation: Characterization and Decoding-Time Mitigation	Bo Li et.al.	2511.09984	null
2025-11-13	STELLAR: Scene Text Editor for Low-Resource Languages and Real-World Data	Yongdeuk Seo et.al.	2511.09977	null
2025-11-13	Difference Vector Equalization for Robust Fine-tuning of Vision-Language Models	Satoshi Suzuki et.al.	2511.09973	null
2025-11-13	NumPert: Numerical Perturbations to Probe Language Models for Veracity Prediction	Peter Røysland Aarnes et.al.	2511.09971	null
2025-11-13	Equivariant Sampling for Improving Diffusion Model-based Image Restoration	Chenxu Wu et.al.	2511.09965	null
2025-11-13	EnvTrace: Simulation-Based Semantic Evaluation of LLM Code via Execution Trace Alignment – Demonstrated at Synchrotron Beamlines	Noah van der Vleuten et.al.	2511.09964	null
2025-11-13	AI-Integrated Decision Support System for Real-Time Market Growth Forecasting and Multi-Source Content Diffusion Analytics	Ziqing Yin et.al.	2511.09962	null
2025-11-13	Robust Object Detection with Pseudo Labels from VLMs using Per-Object Co-teaching	Uday Bhaskar et.al.	2511.09955	null
2025-11-13	New ASKAP radio-continuum surveys of the Small Magellanic Cloud	O. K. Khattab et.al.	2511.09954	null
2025-11-13	Learning phase diversity for solving ill-posed inverse problems in imaging	Jasleen Birdi et.al.	2511.09952	null
2025-11-13	$π$ -PIC: a framework for modular particle-in-cell developments and simulations	Frida Brogren et.al.	2511.09950	null
2025-11-13	GPDM: Generation-Prior Diffusion Model for Accelerated Direct Attenuation and Scatter Correction of Whole-body 18F-FDG PET	Min Jeong Cho et.al.	2511.09941	null
2025-11-13	Provably Efficient Quantum Algorithms for Solving Nonlinear Differential Equations Using Multiple Bosonic Modes Coupled with Qubits	Yu Gan et.al.	2511.09939	null
2025-11-13	Debiased Dual-Invariant Defense for Adversarially Robust Person Re-Identification	Yuhang Zhou et.al.	2511.09933	null
2025-11-13	Martingale dimensions for a class of metric measure spaces	Masanori Hino et.al.	2511.09930	null
2025-11-13	Quantum Phase Gradient Imaging Using a Nonlocal Metasurface System	Jinliang Ren et.al.	2511.09922	null
2025-11-13	MosaicDoc: A Large-Scale Bilingual Benchmark for Visually Rich Document Understanding	Ketong Chen et.al.	2511.09919	null
2025-11-13	HI-TransPA: Hearing Impairments Translation Personal Assistant	Zhiming Ma et.al.	2511.09915	null
2025-11-07	DGTN: Graph-Enhanced Transformer with Diffusive Attention Gating Mechanism for Enzyme DDG Prediction	Abigail Lin et.al.	2511.05483	null
2025-11-07	On Flow Matching KL Divergence	Maojiang Su et.al.	2511.05480	null
2025-11-07	Coarse-graining nonequilibrium diffusions with Markov chains	Ramón Nartallo-Kaluarachchi et.al.	2511.05366	null
2025-11-07	Diffusion-Based Electromagnetic Inverse Design of Scattering Structured Media	Mikhail Tsukerman et.al.	2511.05357	null
2025-11-07	Antisolvent-Assisted Growth of Centimeter-Scale CsPbBr $_3$ Perovskite Single Crystals: A Theory-Guided Approach	I. O. Simonenko et.al.	2511.05354	null
2025-11-07	Perceptually Aligning Representations of Music via Noise-Augmented Autoencoders	Mathias Rose Bjare et.al.	2511.05350	null
2025-11-07	A time-fractional Fisher-KPP equation for tumor growth: Analysis and numerical simulation	Marvin Fritz et.al.	2511.05312	null
2025-11-07	Rethinking Metrics and Diffusion Architecture for 3D Point Cloud Generation	Matteo Bastico et.al.	2511.05308	null
2025-11-07	On partial diffusion and mixing without hypoellipticity	Xu’an Dou et.al.	2511.05280	null
2025-11-07	Integrating Score-Based Diffusion Models with Machine Learning-Enhanced Localization for Advanced Data Assimilation in Geological Carbon Storage	Gabriel Serrão Seabra et.al.	2511.05266	null
2025-11-07	The Causal Round Trip: Generating Authentic Counterfactuals by Eliminating Information Loss	Rui Wu et.al.	2511.05236	null
2025-11-07	Multitime fields and hard rod scaling limits	Pablo A. Ferrari et.al.	2511.05230	null
2025-11-07	FreeControl: Efficient, Training-Free Structural Control via One-Step Attention Extraction	Jiang Lin et.al.	2511.05219	null
2025-11-07	Associative Poisoning to Generative Machine Learning	Mathias Lundteigen Mohus et.al.	2511.05177	null
2025-11-07	Implicit reconstruction from point cloud: an adaptive level-set-based semi-Lagrangian method	Silvia Preda et.al.	2511.05145	null
2025-11-07	A Dual-stage Prompt-driven Privacy-preserving Paradigm for Person Re-Identification	Ruolin Li et.al.	2511.05092	null
2025-11-07	kV-Class Lateral NiOx/GaN Super-Heterojunction Diode via Ammonia Molecular Beam Epitaxy (NH3-MBE)	Yizheng Liu et.al.	2511.05060	null
2025-11-07	Pressure2Motion: Hierarchical Motion Synthesis from Ground Pressure with Text Guidance	Zhengxuan Li et.al.	2511.05038	null
2025-11-07	Inverse problem of determining a time-dependent coefficient in the time-fractional subdiffusion equation	Ravshan Ashurov et.al.	2511.05011	null
2025-11-07	MoE-DP: An MoE-Enhanced Diffusion Policy for Robust Long-Horizon Robotic Manipulation with Skill Decomposition and Failure Recovery	Baiye Cheng et.al.	2511.05007	null
2025-11-07	Multi-agent Coordination via Flow Matching	Dongsu Lee et.al.	2511.05005	null
2025-11-07	Peptide2Mol: A Diffusion Model for Generating Small Molecules as Peptide Mimics for Targeted Protein Binding	Xinheng He et.al.	2511.04984	null
2025-11-07	Less Is More: Generating Time Series with LLaMA-Style Autoregression in Simple Factorized Latent Spaces	Siyuan Li et.al.	2511.04973	null
2025-11-07	Pattern-Aware Diffusion Synthesis of fMRI/dMRI with Tissue and Microstructural Refinement	Xiongri Shen et.al.	2511.04963	null
2025-11-07	Nuclear Ptychoscopy: A Ptychographic Framework for Nuclear Spectroscopy	Ziyang Yuan et.al.	2511.04924	null
2025-11-07	Three-dimensional imaging of threading dislocations in GaN by multimodal stimulated Raman scattering microscopy	Shun Takahashi et.al.	2511.04915	null
2025-11-07	Representation formula, regularity, and decay of solutions for sub-diffusion equations	Sandro Coriasco et.al.	2511.04885	null
2025-11-06	Clinical-ComBAT: a diffusion-weighted MRI harmonization method for clinical applications	Gabriel Girard et.al.	2511.04871	null
2025-11-06	SigmaDock: Untwisting Molecular Docking With Fragment-Based SE(3) Diffusion	Alvaro Prat et.al.	2511.04854	null
2025-11-06	Persuading Stable Matching	Jonathan Shaki et.al.	2511.04846	null
2025-11-06	Sublinear iterations can suffice even for DDPMs	Matthew S. Zhang et.al.	2511.04844	null
2025-11-06	Prompt-Based Safety Guidance Is Ineffective for Unlearned Text-to-Image Diffusion Models	Jiwoo Shin et.al.	2511.04834	null
2025-11-06	The strongly nonlocal Allen-Cahn problem	Erisa Hasani et.al.	2511.04818	null
2025-11-06	Unified Multimodal Diffusion Forcing for Forceful Manipulation	Zixuan Huang et.al.	2511.04812	null
2025-11-06	Asymptotic stability proof and port-Hamiltonian physics-informed neural network approach to chaotic synchronization in Hindmarsh-Rose neurons	Behnam Babaeian et.al.	2511.04809	null
2025-11-06	Quantifying the Climate Risk of Generative AI: Region-Aware Carbon Accounting with G-TRACE and the AI Sustainability Pyramid	Zahida Kausar et.al.	2511.04776	null
2025-11-06	ReGen: Generative Robot Simulation via Inverse Design	Phat Nguyen et.al.	2511.04769	null
2025-11-06	CPO: Condition Preference Optimization for Controllable Image Generation	Zonglin Lyu et.al.	2511.04753	null
2025-11-06	The essential elements of dust evolution: a-C(:H) nanoparticle sub-structures and photo-fragmentation	A. P. Jones et.al.	2511.04750	null
2025-11-06	InfinityStar: Unified Spacetime AutoRegressive Modeling for Visual Generation	Jinlai Liu et.al.	2511.04675	null
2025-11-06	X-Diffusion: Training Diffusion Policies on Cross-Embodiment Human Demonstrations	Maximus A. Pace et.al.	2511.04671	null
2025-11-06	Intermittency in Collisionless Large-Amplitude Turbulence	Ryan Golant et.al.	2511.04663	null
2025-11-06	Nowcast3D: Reliable precipitation nowcasting via gray-box learning	Huaguan Chen et.al.	2511.04659	null
2025-11-06	Optimal Inference Schedules for Masked Diffusion Models	Sitan Chen et.al.	2511.04647	null
2025-11-06	Efficient probabilistic surrogate modeling techniques for partially-observed large-scale dynamical systems	Hans Harder et.al.	2511.04641	null
2025-11-06	PromptSep: Generative Audio Separation via Multimodal Prompting	Yutong Wen et.al.	2511.04623	null
2025-11-06	Building Trust in Virtual Immunohistochemistry: Automated Assessment of Image Quality	Tushar Kataria et.al.	2511.04615	null
2025-11-06	Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm	Jingqi Tong et.al.	2511.04570	null
2025-11-06	Evo-1: Lightweight Vision-Language-Action Model with Preserved Semantic Alignment	Tao Lin et.al.	2511.04555	null
2025-11-06	Microservices Is Dying, A New Method for Module Division Based on Universal Interfaces	Qing Wang et.al.	2511.04548	null
2025-11-06	Unified Generative Latent Representation for Functional Brain Graphs	Subati Abulikemu et.al.	2511.04539	null
2025-11-07	THEval. Evaluation Framework for Talking Head Video Generation	Nabyl Quignon et.al.	2511.04520	null
2025-11-06	Launch-Day Diffusion: Tracking Hacker News Impact on GitHub Stars for AI Tools	Obada Kraishan et.al.	2511.04453	null
2025-11-06	First order statistic of afterpulsing and crosstalk events in SiPMs	Sergey Vinogradov et.al.	2511.04443	null
2025-11-06	A Natural Stochastic SIS Model, Analysis of Moments and Comparison of Different Perturbation Techniques	Berk Tan Perçin et.al.	2511.04415	null
2025-11-06	Lecture notes on Quantum Diffusion and Random Matrix Theory	Felipe Hernández et.al.	2511.04380	null
2025-11-06	MusRec: Zero-Shot Text-to-Music Editing via Rectified Flow and Diffusion Transformers	Ali Boudaghi et.al.	2511.04376	null
2025-11-06	RISE-T2V: Rephrasing and Injecting Semantics with LLM for Expansive Text-to-Video Generation	Xiangjun Zhang et.al.	2511.04317	null
2025-11-06	Novel Numerical Methods for Accurate Space Thermal Analysis: Enforcing View Factors and Modeling Diffuse Reflectivity	Bernat Frangi et.al.	2511.04277	null
2025-11-06	Quantum time-marching algorithms for solving linear transport problems including boundary conditions	Sergio Bengoechea et.al.	2511.04271	null
2025-11-06	Proto-LeakNet: Towards Signal-Leak Aware Attribution in Synthetic Human Face Imagery	Claudio Giusti et.al.	2511.04260	null
2025-11-06	Learning to Land Anywhere: Transferable Generative Models for Aircraft Trajectories	Olav Finne Praesteng Larsen et.al.	2511.04155	null
2025-11-06	Text to Sketch Generation with Multi-Styles	Tengjie Li et.al.	2511.04123	null
2025-11-06	Tortoise and Hare Guidance: Accelerating Diffusion Model Inference with Multirate Integration	Yunghee Lee et.al.	2511.04117	null
2025-11-06	SpatialLock: Precise Spatial Control in Text-to-Image Synthesis	Biao Liu et.al.	2511.04112	null
2025-11-06	Sub-exponential Growth in Online Word Usage: A Piecewise Power-Law Model	Hayafumi Watanabe et.al.	2511.04106	null
2025-11-06	A simulation that recapitulates the dynamics of PER-directed colloidal assembly	Cheng-Hung Chou et.al.	2511.04102	null
2025-11-06	Ultra-Diffuse, Ultra-Different: Observed vs. Simulated Ultra-Diffuse Galaxies Live in Fundamentally Different Halos	Jonah S. Gannon et.al.	2511.04006	null
2025-11-06	PhysCorr: Dual-Reward DPO for Physics-Constrained Text-to-Video Generation with Automated Preference Selection	Peiyao Wang et.al.	2511.03997	null
2025-11-06	Multiscale Astrocyte Network Calcium Dynamics for Biologically Plausible Intelligence in Anomaly Detection	Berk Iskar et.al.	2511.03993	null
2025-11-05	Upgrade of Super-Kamiokande with Gadolinium	Yusuke Koshio et.al.	2511.03921	null
2025-11-05	Evolutionary Optimization Trumps Adam Optimization on Embedding Space Exploration	Domício Pereira Neto et.al.	2511.03913	null
2025-11-05	Energy-dependent SEP Fe/O abundances during the May 2024 superstorm	G. D. Muro et.al.	2511.03905	null
2025-11-05	DeepFixel: Crossing white matter fiber identification through spherical convolutional neural networks	Adam M. Saunders et.al.	2511.03893	null
2025-11-05	Diffusion Dynamics in Biofilms with Time-Varying Channels	Yanahan Paramalingam et.al.	2511.03856	null
2025-11-05	From Static to Dynamic: Enhancing Offline-to-Online Reinforcement Learning via Energy-Guided Diffusion Stratification	Lipeng Zu et.al.	2511.03828	null
2025-11-05	Adaptive Geometric Regression for High-Dimensional Structured Data	Pawel Gajer et.al.	2511.03817	null
2025-11-05	Bifidelity Karhunen-Loève Expansion Surrogate with Active Learning for Random Fields	Aniket Jivani et.al.	2511.03756	null
2025-11-04	Attention-based ROI Discovery in 3D Tissue Images	Hossein Fathollahian et.al.	2511.03751	null
2025-11-05	SHIELD: Securing Healthcare IoT with Efficient Machine Learning Techniques for Anomaly Detection	Mahek Desai et.al.	2511.03661	null
2025-11-05	Exchange controls coarsening of surface condensates	Riccardo Rossetto et.al.	2511.03619	null
2025-11-05	The Converse Madelung Question	Jonathan R Dunkley et.al.	2511.03552	null
2025-11-05	HJB equations driven by the Dirichlet-Ferguson Laplacian in Wasserstein-Sobolev spaces	François Delarue et.al.	2511.03522	null
2025-11-05	Parametric resonance, chaos and spatial structure in the Lotka-Volterra model	Mohamed Swailem et.al.	2511.03521	null
2025-11-05	On a Stationarity Theory for Stochastic Volterra Integral Equations	Emmanuel Gnabeyeu et.al.	2511.03474	null
2025-11-05	Spatiotemporal statistics of the dissipation rate at the boundary of a turbulent flow using Diffusing-Wave Spectroscopy	Enzo Francisco et.al.	2511.03462	null
2025-11-05	Universal first-passage time statistics for quantum diffusion	Guido Ladenburger et.al.	2511.03455	null
2025-11-05	Discovery of Slot Plasma Excitations in a AlGaN/GaN Plasmonic Crystal	A. R. Khisameeva et.al.	2511.03450	null
2025-11-05	Rolling carpet strategy to reduce mosquito populations in two-dimensional space	Luís Almeida et.al.	2511.03447	null
2025-11-05	QMeCha: quantum Monte Carlo package for fermions in embedding environments	Matteo Barborini et.al.	2511.03439	null
2025-11-05	Seeing What You Say: Expressive Image Generation from Speech	Jiyoung Lee et.al.	2511.03423	null
2025-11-05	Beyond Citations: Measuring Idea-level Knowledge Diffusion from Research to Journalism and Policy-making	Yangliu Fan et.al.	2511.03378	null
2025-11-05	Noise induced Stability of a Mean-Field model of Systemic Risk with uncertain robustness	Alexander Alecio et.al.	2511.03358	null
2025-11-05	Stellar-like Galactic center excess challenges particle dark matter	Silvia Manconi et.al.	2511.03350	null
2025-11-05	Reversibility, covariance and coarse-graining for Langevin dynamics: On the choice of multiplicative noise	Mario Ayala et.al.	2511.03347	null
2025-11-05	UniAVGen: Unified Audio and Video Generation with Asymmetric Cross-Modal Interactions	Guozhen Zhang et.al.	2511.03334	null
2025-11-05	Far-UVC Field Emission Device at 226 nm and its Sub-Nanometer thick GaN/AlN Quantum Well Anode	D. L. Boiko et.al.	2511.03321	null
2025-11-05	Diffusion-SDPO: Safeguarded Direct Preference Optimization for Diffusion Models	Minghao Fu et.al.	2511.03317	null
2025-11-05	Diffusion-Driven Terahertz Air-Ground Communications under Dynamic Atmospheric Turbulence	Jinhao Yi et.al.	2511.03290	null
2025-11-05	Diffusion Language Models are Super Data Learners	Jinjie Ni et.al.	2511.03276	null
2025-11-05	Unified Long Video Inpainting and Outpainting via Overlapping High-Order Co-Denoising	Shuangquan Lyu et.al.	2511.03272	null
2025-11-05	Enhancing Medical Image Segmentation via Heat Conduction Equation	Rong Wu et.al.	2511.03260	null
2025-11-05	Diffusion-Guided Mask-Consistent Paired Mixing for Endoscopic Image Segmentation	Pengyu Jie et.al.	2511.03219	null
2025-11-05	Provable Separations between Memorization and Generalization in Diffusion Models	Zeqi Ye et.al.	2511.03202	null
2025-11-05	Exploring the mechanisms of transverse relaxation of copper(II)-phthalocyanine spin qubits	Boning Li et.al.	2511.03199	null
2025-11-05	Analysis and Patterns of Nonlocal Klausmeier Model	Md Shah Alam et.al.	2511.03188	null
2025-11-05	Finetuning-Free Personalization of Text to Image Generation via Hypernetworks	Sagar Shrestha et.al.	2511.03156	null
2025-11-05	Optimal Boundary Control of Diffusion on Graphs via Linear Programming	Harbir Antil et.al.	2511.03129	null
2025-11-05	EGMOF: Efficient Generation of Metal-Organic Frameworks Using a Hybrid Diffusion-Transformer Architecture	Seunghee Han et.al.	2511.03122	null
2025-11-05	FP-AbDiff: Improving Score-based Antibody Design by Capturing Nonequilibrium Dynamics through the Underlying Fokker-Planck Equation	Jiameng Chen et.al.	2511.03113	null
2025-11-05	Accelerating inverse materials design using generative diffusion models with reinforcement learning	Junwu Chen et.al.	2511.03112	null
2025-11-05	Scaling Multi-Agent Environment Co-Design with Diffusion Models	Hao Xiang Li et.al.	2511.03100	null
2025-11-05	Novel reaction-diffusion PDE model for fingerprint-like pattern emergence via the Schnakenberg mechanism	Fabián Sepúlveda-Soto et.al.	2511.03096	null
2025-11-04	WorldPlanner: Monte Carlo Tree Search and MPC with Action-Conditioned Visual World Models	R. Khorrambakht et.al.	2511.03077	null
2025-11-04	Last Hitting Time Distributions for Solvable Diffusions	Giuseppe Campolieti et.al.	2511.03037	null
2025-11-04	Robust optimal consumption, investment and reinsurance for recursive preferences	Elizabeth Dadzie et.al.	2511.03031	null
2025-11-04	Discrete Bayesian Sample Inference for Graph Generation	Ole Petersen et.al.	2511.03015	null
2025-11-04	Simultaneous evaporation and imbibition of a droplet on a flooded porous substrate	David Craig et.al.	2511.03006	null
2025-11-04	Scalable Single-Cell Gene Expression Generation with Latent Diffusion Models	Giovanni Palla et.al.	2511.02986	null
2025-11-04	An Atomistically Informed Device Engineering (AIDE) Method Realized: A case study in GaAs	Leopoldo Diaz et.al.	2511.02976	null
2025-11-04	Inference-Time Personalized Alignment with a Few User Preference Queries	Victor-Alexandru Pădurean et.al.	2511.02966	null
2025-11-04	A Conditional Diffusion Model for Building Energy Modeling Workflows	Saumya Sinha et.al.	2511.02930	null
2025-11-04	Dynamical evolution of stellar binaries in galactic centers	Mark Dodici et.al.	2511.02905	null
2025-11-04	A model for positron annihilation in multi-layer systems by solving the diffusion equation using different positron affinities	Lucian Mathes et.al.	2511.02889	null
2025-11-04	Academics and Generative AI: Empirical and Epistemic Indicators of Policy-Practice Voids	R. Yamamoto Ravenor et.al.	2511.02875	null
2025-11-04	Diffusion Models are Robust Pretrainers	Mika Yagoda et.al.	2511.02793	null
2025-11-04	AI-Generated Image Detection: An Empirical Study and Future Research Directions	Nusrat Tasnim et.al.	2511.02791	null
2025-11-04	Measuring AI Diffusion: A Population-Normalized Metric for Tracking Global AI Usage	Amit Misra et.al.	2511.02781	null
2025-11-04	DANIEL: A Distributed and Scalable Approach for Global Representation Learning with EHR Applications	Zebin Wang et.al.	2511.02754	null
2025-11-04	AI Diffusion in Low Resource Language Countries	Amit Misra et.al.	2511.02752	null
2025-11-04	From Densities to Potentials: Benchmarking Local Exchange-Correlation Approximations	Visagan Ravindran et.al.	2511.02744	null
2025-11-04	Numerical valuation of European options under two-asset infinite-activity exponential Lévy models	Massimiliano Moda et.al.	2511.02700	null
2025-11-04	Error Estimates of Generic Discretisation of Reaction-Diffusion System with Constraints	Yahya Alnashri et.al.	2511.02654	null
2025-11-04	Stochastic Redistribution of Indistinguishable Items in Shared Habitation: A Multi-Agent Simulation Framework	Syed Haseeb Shah et.al.	2511.02648	null
2025-11-04	Natural-gas storage modelling by deep reinforcement learning	Tiziano Balaconi et.al.	2511.02646	null
2025-11-04	Generalizable super-resolution turbulence reconstruction from minimal training data	Wu Haokai et.al.	2511.02604	null
2025-11-04	TAUE: Training-free Noise Transplant and Cultivation Diffusion Model	Daichi Nagai et.al.	2511.02580	null
2025-11-04	First-principles Prediction of Carrier Mobility in Semiconductor Nanowires Based on the Spatially Dependent Boltzmann Transport Equation	Zirui He et.al.	2511.02561	null
2025-11-04	Sparse Source Identification in Transient Advection-Diffusion Problems with a Primal-Dual-Active-Point Strategy	Marco Mattuschka et.al.	2511.02552	null
2025-11-04	Implementation and Evaluation of Stable Diffusion on a General-Purpose CGLA Accelerator	Takuto Ando et.al.	2511.02530	null
2025-11-04	Polarization-controlled pattern formation in antiparallel dipolar binary condensates	Zhijun Zhang et.al.	2511.02516	null
2025-11-04	Self-similar blow-up solutions for the supercritical parabolic Hardy-Hénon equation	Razvan Gabriel Iagar et.al.	2511.02511	null
2025-11-04	DetectiumFire: A Comprehensive Multi-modal Dataset Bridging Vision and Language for Fire Understanding	Zixuan Liu et.al.	2511.02495	null
2025-11-05	OLATverse: A Large-scale Real-world Object Dataset with Precise Lighting Control	Xilong Zhou et.al.	2511.02483	null
2025-11-04	Generalized Swing Control Framework for Inverter-based Resources	Rodrigo Bernal et.al.	2511.02482	null
2025-11-04	Wireless Video Semantic Communication with Decoupled Diffusion Multi-frame Compensation	Bingyan Xie et.al.	2511.02478	null
2025-11-04	HAGI++: Head-Assisted Gaze Imputation and Generation	Chuhan Jiao et.al.	2511.02468	null
2025-11-04	KAO: Kernel-Adaptive Optimization in Diffusion for Satellite Image	Teerapong Panboonyuen et.al.	2511.02462	null
2025-11-04	Wasserstein Convergence of Critically Damped Langevin Diffusions	Stanislas Strasman et.al.	2511.02419	null
2025-11-04	Synthetic Crop-Weed Image Generation and its Impact on Model Generalization	Garen Boyadjian et.al.	2511.02417	null
2025-11-04	Characterization Of Cyclic Hygrothermal Swelling And Shrinkage Behavior Of Balsa Wood And Gfrp-Balsa Sandwich Structures	Yuan Wu et.al.	2511.02412	null
2025-11-04	Acoustic orbital Hall effect and orbital pumping in light-metal-ferromagnet bilayers	Mingxing Wu et.al.	2511.02388	null
2025-11-04	Charge glass from supercooling topological-ordered liquid	Kouki Kimata et.al.	2511.02380	null
2025-11-04	ELAIS-N1 Deep Field uGMRT Band-2: Constraints on Diffuse Galactic Synchrotron Emission Power Spectrum	Rashmi Sagar et.al.	2511.02375	null
2025-11-04	LUMA-RAG: Lifelong Multimodal Agents with Provably Stable Streaming Alignment	Rohan Wandre et.al.	2511.02371	null
2025-11-04	LiveSecBench: A Dynamic and Culturally-Relevant AI Safety Benchmark for LLMs in Chinese Context	Yudong Li et.al.	2511.02366	null
2025-11-04	CoCoVa: Chain of Continuous Vision-Language Thought for Latent Space Reasoning	Jizheng Ma et.al.	2511.02360	null
2025-11-04	Global Well-Posedness for the 2D and 3D Prandtl-Shercliff Model	Wei-Xi Li et.al.	2511.02338	null
2025-11-04	Non Asymptotic Mixing Time Analysis of Non-Reversible Markov Chains	Muhammad Abdullah Naeem et.al.	2511.02265	null
2025-11-04	Limit Theorems for Stochastic Gradient Descent in High-Dimensional Single-Layer Networks	Parsa Rangriz et.al.	2511.02258	null
2025-11-04	Wavelet-Optimized Motion Artifact Correction in 3D MRI Using Pre-trained 2D Score Priors	Genyuan Zhang et.al.	2511.02256	null
2025-11-04	Signal attenuation and phase evolution evaluation under the influence of nonlinear gradient	Chenghao Xua et.al.	2511.02242	null
2025-11-04	Diffusion Index Forecast with Tensor Data	Bin Chen et.al.	2511.02235	null
2025-11-04	Search for Diffuse Supernova Neutrino Background with 956.2 days of Super-Kamiokande Gadolinium Dataset	K. Abe et.al.	2511.02222	null
2025-11-04	Null control of heat equations with analytic memory kernels	Qi Lü et.al.	2511.02170	null
2025-11-04	Geometric Solution of Turbulence as Diffusion in Loop Space	Alexander Migdal et.al.	2511.02165	null
2025-11-03	AI Spillover is Different: Flat and Lean Firms as Engines of AI Diffusion and Productivity Gain	Xiaoning Wang et.al.	2511.02099	null
2025-11-03	Watermarking Discrete Diffusion Language Models	Avi Bagchi et.al.	2511.02083	null
2025-11-03	Beyond Static Cutoffs: One-Shot Dynamic Thresholding for Diffusion Language Models	Jucheng Shen et.al.	2511.02077	null
2025-11-03	Human-AI Co-Embodied Intelligence for Scientific Experimentation and Manufacturing	Xinyi Lin et.al.	2511.02071	null
2025-11-03	Quantum-Enhanced Generative Models for Rare Event Prediction	M. Z. Haider et.al.	2511.02042	null
2025-11-03	Stability of mixed-state phases under weak decoherence	Yifan F. Zhang et.al.	2511.01976	null
2025-11-03	Dust back-reaction on gas around planets modifies the cold thermal torque	Raúl O. Chametla et.al.	2511.01973	null
2025-11-03	Quantum Acoustics Demystifies the Strange Metals	Eric J. Heller et.al.	2511.01853	null
2025-11-03	Fractional Diffusion Bridge Models	Gabriel Nobis et.al.	2511.01795	null
2025-11-03	How Far Are Surgeons from Surgical World Models? A Pilot Study on Zero-shot Surgical Video Generation with Expert Assessment	Zhen Chen et.al.	2511.01775	null
2025-11-03	Wonder3D++: Cross-domain Diffusion for High-fidelity 3D Generation from a Single Image	Yuxiao Yang et.al.	2511.01767	null
2025-11-03	Unified Diffusion VLA: Vision-Language-Action Model via Joint Discrete Denoising Diffusion Process	Jiayi Chen et.al.	2511.01718	null
2025-11-03	Learnable Fractional Reaction-Diffusion Dynamics for Under-Display ToF Imaging and Beyond	Xin Qiao et.al.	2511.01704	null
2025-10-31	Phased DMD: Few-step Distribution Matching Distillation via Score Matching within Subintervals	Xiangyu Fan et.al.	2510.27684	null
2025-10-31	Social learning moderates the tradeoffs between efficiency, stability, and equity in group foraging	Ze-Xu Li et.al.	2510.27683	null
2025-10-31	MolChord: Structure-Sequence Alignment for Protein-Guided Drug Design	Wei Zhang et.al.	2510.27671	null
2025-10-31	Bayesian model selection and misspecification testing in imaging inverse problems only from noisy and partial measurements	Tom Sprunck et.al.	2510.27663	null
2025-10-31	A Primal-dual Forward-backward Splitting Method for Cross-diffusion Gradient Flows with General Mobility Matrices	Yunhong Deng et.al.	2510.27660	null
2025-10-31	A stochastic branching particle method for solving non-conservative reaction-diffusion equations	Liyao Lyu et.al.	2510.27615	null
2025-10-31	Dual-Stream Diffusion for World-Model Augmented Vision-Language-Action Model	John Won et.al.	2510.27607	null
2025-10-31	Who Made This? Fake Detection and Source Attribution with Diffusion Features	Simone Bonechi et.al.	2510.27602	null
2025-10-31	Kinematical and dynamical contrast of dislocations in thick GaN substrates observed by synchrotron-radiation X-ray topography under six-beam diffraction conditions	Yongzhao Yao et.al.	2510.27597	null
2025-10-31	Optimal Convergence Analysis of DDPM for General Distributions	Yuchen Jiao et.al.	2510.27562	null
2025-10-31	On the global existence and uniform-in-time bounds for three-component reaction-diffusion systems with mass control and polynomial growth	Redouane Douaifia et.al.	2510.27555	null
2025-10-31	EBT-Policy: Energy Unlocks Emergent Physical Reasoning Capabilities	Travis Davies et.al.	2510.27545	null
2025-10-31	Diffusion velocity modulus of self-propelled spherical and circular particles in the generalized Langevin approach	Pedro J. Colmenares et.al.	2510.27536	null
2025-10-31	On chip plasmonic slit cavity platform for room temperature strong coupling with deterministically positioned colloidal quantum dots	Jin Qin et.al.	2510.27531	null
2025-10-31	InertialAR: Autoregressive 3D Molecule Generation with Inertial Frames	Haorui Li et.al.	2510.27497	null
2025-10-31	Spectral Neural Graph Sparsification	Angelica Liguori et.al.	2510.27474	null
2025-10-31	Diffuse Thinking: Exploring Diffusion Language Models as Efficient Thought Proposers for Reasoning	Chenyang Shao et.al.	2510.27469	null
2025-10-31	Size-dependent transformation patterns in NiTi tubes under tension and bending: Stereo digital image correlation experiments and modeling	Aslan Ahadi et.al.	2510.27464	null
2025-10-31	Ultra-diffuse Galaxy Analogues in the Subaru Hyper-Suprime Cam Wide-field Clusters	N. A. Makda et.al.	2510.27459	null
2025-10-31	From Pixels to Paths: A Multi-Agent Framework for Editable Scientific Illustration	Jianwen Sun et.al.	2510.27452	null
2025-10-31	Novel bidomain partitioned strategies for the simulation of ventricular fibrillation dynamics	Gopika P B et.al.	2510.27447	null
2025-10-31	DeblurSDI: Blind Image Deblurring Using Self-diffusion	Yanlong Yang et.al.	2510.27439	null
2025-10-31	Magnetically Assisted Separation of Weakly Magnetic Metal Ions in Porous Media. Part 2: Numerical Simulations	Muhammad Garba et.al.	2510.27438	null
2025-10-31	Density functional investigations on 2D-Be2C as an anode for alkali Metal-ion batteries	Hetvi Jadav et.al.	2510.27433	null
2025-10-31	From Shock to Synchrotron: a mini-review on magnetic turbulence in Supernova Remnants	Emanuele Greco et.al.	2510.27431	null
2025-10-31	Hexagonal BeX (X: S, Te) monolayer as potential electrode material for alkali metal-ion batteries: A DFT perspective	Hetvi Jadav et.al.	2510.27429	null
2025-10-31	A Hybrid Deep Learning and Forensic Approach for Robust Deepfake Detection	Sales Aribe Jr et.al.	2510.27392	null
2025-10-31	Fine-Tuning Open Video Generators for Cinematic Scene Synthesis: A Small-Data Pipeline with LoRA and Wan2.1 I2V	Meftun Akarsu et.al.	2510.27364	null
2025-10-31	Back to the Communities: A Mixed-Methods and Community-Driven Evaluation of Cultural Sensitivity in Text-to-Image Models	Sarah Kiden et.al.	2510.27361	null
2025-10-31	Understanding the Implicit User Intention via Reasoning with Large Language Model for Image Editing	Yijia Wang et.al.	2510.27335	null
2025-10-31	Instantaneous Total Enhanced Dissipation For Very Rough Shear Flows	Marco Romito et.al.	2510.27331	null
2025-10-31	Generative Semantic Coding for Ultra-Low Bitrate Visual Communication and Analysis	Weiming Chen et.al.	2510.27324	null
2025-10-31	The memory-dependent FPK equation for fractional Gaussian noise	Lifang Feng et.al.	2510.27303	null
2025-10-31	Rethinking Robust Adversarial Concept Erasure in Diffusion Models	Qinghong Yin et.al.	2510.27285	null
2025-10-31	Sample Path Moderate Deviation Principle for Queues with Waiting-time Dependent Interarrival and Service Times	Chang Feng et.al.	2510.27226	null
2025-10-31	MDAS-GNN: Multi-Dimensional Spatiotemporal GNN with Spatial Diffusion for Urban Traffic Risk Forecasting	Ziyuan Gao et.al.	2510.27197	null
2025-10-31	Dual-Scale Antenna Deployment for Pinching Antenna Systems	Xu Gan et.al.	2510.27185	null
2025-10-31	H2-Cache: A Novel Hierarchical Dual-Stage Cache for High-Performance Acceleration of Generative Diffusion Models	Mingyu Sung et.al.	2510.27171	null
2025-10-31	DANCER: Dance ANimation via Condition Enhancement and Rendering with diffusion model	Yucheng Xing et.al.	2510.27169	null
2025-11-03	A monotone finite element method for an elliptic distributed optimal control problem with a convection-dominated state equation	SeongHee Jeong et.al.	2510.27167	null
2025-10-31	Structure-Aware Optimal Intervention for Rumor Dynamics on Networks: Node-Level, Time-Varying, and Resource-Constrained	Yan Zhu et.al.	2510.27165	null
2025-10-31	A Survey on Generative Recommendation: Data, Model, and Tasks	Min Hou et.al.	2510.27157	null
2025-10-31	E-MMDiT: Revisiting Multimodal Diffusion Transformer Design for Fast Image Synthesis under Limited Resources	Tong Shen et.al.	2510.27135	null
2025-10-31	On the Exact Distribution of the Sum of Two CIR Processes	Bilgi Yilmaz et.al.	2510.27081	null
2025-10-30	Blind MIMO Semantic Communication via Parallel Variational Diffusion: A Completely Pilot-Free Approach	Hao Jiang et.al.	2510.27043	null
2025-10-30	Atomistic Simulations of H-Cu Vacancy Cosegregation and H Diffusion in Cu Grain Boundary	Vasileios Fotopoulos et.al.	2510.26991	null
2025-10-30	Limited Memory LRSGA Optimizer to competitive optimization	Katherine Rossella Foglia et.al.	2510.26983	null
2025-10-30	Zeeman Doppler mapping deconstructed	M. J. Stift et.al.	2510.26973	null
2025-10-30	A Critical Examination of the PAH Hypothesis	Alan T. Tokunaga et.al.	2510.26970	null
2025-10-30	Ultra-High Dose-Rates, the FLASH Effect, and Hydrogen Peroxide Yields: Do Experiments and Simulations Really Disagree?	Marc Benjamin Hahn et.al.	2510.26928	null
2025-10-30	Can galactic magnetic fields diffuse into the voids?	Oindrila Ghosh et.al.	2510.26918	null
2025-11-03	Generative diffusion modeling protocols for improving the Kikuchi pattern indexing in electron back-scatter diffraction	Meghraj Prajapat et.al.	2510.26907	null
2025-10-30	Enhancing Neural Network Backflow	Kieran Loehr et.al.	2510.26906	null
2025-10-30	Superdiffusion and anomalous fluctuations in chiral integrable dynamics	Cristiano Muzzi et.al.	2510.26897	null
2025-10-30	BI-DCGAN: A Theoretically Grounded Bayesian Framework for Efficient and Diverse GANs	Mahsa Valizadeh et.al.	2510.26892	null
2025-10-30	Baryon anti-Baryon Photoproduction Cross Sections off the Proton	F. Afzal et.al.	2510.26890	null
2025-10-30	Galaxy Luminosity Function of the Coma Cluster from Deep $u’-g’-r’$ Wendelstein Imaging Data	Raphael Zöller et.al.	2510.26889	null
2025-11-03	Evaluating Perspectival Biases in Cross-Modal Retrieval	Teerapol Saengsukhiran et.al.	2510.26861	null
2025-10-30	Are Video Models Ready as Zero-Shot Reasoners? An Empirical Study with the MME-CoF Benchmark	Ziyu Guo et.al.	2510.26802	null
2025-10-30	Masked Diffusion Captioning for Visual Feature Learning	Chao Feng et.al.	2510.26799	null
2025-10-30	SEE4D: Pose-Free 4D Generation via Auto-Regressive Video Inpainting	Dongyue Lu et.al.	2510.26796	null
2025-10-30	The Quest for Generalizable Motion Generation: Data, Model, and Evaluation	Jing Lin et.al.	2510.26794	null
2025-10-30	Orbital Optimization and Neural-Network-Assisted Configuration Interaction Calculations of Rydberg States	Gianluca Levi et.al.	2510.26751	null
2025-10-30	Hybrid Consistency Policy: Decoupling Multi-Modal Diversity and Real-Time Efficiency in Robotic Manipulation	Qianyou Zhao et.al.	2510.26670	null
2025-10-30	Enzyme Active Bath Affects Protein Condensation	Kevin Ching et.al.	2510.26659	null
2025-10-30	Stabilization of Metallic, Excitonic Insulator, and Superionic Phases in Helium-Rare Gas Compounds at Sub-Terapascal Pressures	Cong Liu et.al.	2510.26626	null
2025-10-30	Optimal Bidding and Coordinated Dispatch of Hybrid Energy Systems in Regulation Markets	Tanmay Mishra et.al.	2510.26602	null
2025-10-30	ResMatching: Noise-Resilient Computational Super-Resolution via Guided Conditional Flow Matching	Anirban Ray et.al.	2510.26601	null
2025-10-30	Emu3.5: Native Multimodal Models are World Learners	Yufeng Cui et.al.	2510.26583	null
2025-10-30	Enhancing ECG Classification Robustness with Lightweight Unsupervised Anomaly Detection Filters	Mustafa Fuad Rifet Ibrahim et.al.	2510.26501	null
2025-10-30	Quantum Gated Recurrent GAN with Gaussian Uncertainty for Network Anomaly Detection	Wajdi Hammami et.al.	2510.26487	null
2025-10-30	Neon is an inhibitor of CO hydrogenation in pre-stellar core conditions	Basile Husquinet et.al.	2510.26445	null
2025-10-30	Diffusion-Aided Bandwidth-Efficient Semantic Communication with Adaptive Requests	Xuesong Wang et.al.	2510.26442	null
2025-11-02	The evolving surface morphochemical reaction-diffusion system for battery modeling	Benedetto Bozzini et.al.	2510.26437	null
2025-10-30	Co-Evolving Latent Action World Models	Yucen Wang et.al.	2510.26433	null
2025-10-30	LoCoT2V-Bench: A Benchmark for Long-Form and Complex Text-to-Video Generation	Xiangqing Zheng et.al.	2510.26412	null
2025-10-30	EEG-Driven Image Reconstruction with Saliency-Guided Diffusion Models	Igor Abramov et.al.	2510.26391	null
2025-10-30	Capillarity Reveals the Role of Capsid Geometry in HIV Nuclear Translocation	Alex W. Brown et.al.	2510.26357	null
2025-10-30	GLYPH-SR: Can We Achieve Both High-Quality Image Super-Resolution and High-Fidelity Text Recovery via VLM-guided Latent Diffusion Model?	Mingyu Sung et.al.	2510.26339	null
2025-10-30	Tracing the evolution of brightest galaxies and diffuse light in galaxy groups	B. Bilata-Woldeyes et.al.	2510.26329	null
2025-10-30	Posterior Sampling by Combining Diffusion Models with Annealed Langevin Dynamics	Zhiyang Xun et.al.	2510.26324	null
2025-10-30	Generative Artificial Intelligence for Air Shower Simulation	C. Bozza et.al.	2510.26316	null
2025-10-30	Apsidal Motion in O-Star Binaries: GENEC rotating binary models put to the k2-test	Sophie Rosu et.al.	2510.26306	null
2025-10-30	Distributional Multi-objective Black-box Optimization for Diffusion-model Inference-time Multi-Target Generation	Kim Yong Tan et.al.	2510.26278	null
2025-10-30	Revisiting Generative Infrared and Visible Image Fusion Based on Human Cognitive Laws	Lin Guo et.al.	2510.26268	null
2025-10-30	Likely Interpolants of Generative Models	Frederik Möbius Rygaard et.al.	2510.26266	null
2025-10-30	Impact of AlN buffer thickness on electrical and thermal characteristics of AlGaN/GaN/AlN HEMTs	Minho Kim et.al.	2510.26244	null
2025-10-30	Which Way Does Time Flow? A Psychophysics-Grounded Evaluation for Vision-Language Models	Shiho Matta et.al.	2510.26241	null
2025-10-30	DiSE: A diffusion probabilistic model for automatic structure elucidation of organic compounds	Haochen Chen et.al.	2510.26231	null
2025-10-30	Don’t Let It Fade: Preserving Edits in Diffusion Language Models via Token Timestep Allocation	Woojin Kim et.al.	2510.26200	null
2025-10-30	Sketch2PoseNet: Efficient and Generalized Sketch to 3D Human Pose Prediction	Li Wang et.al.	2510.26196	null
2025-10-30	MoTDiff: High-resolution Motion Trajectory estimation from a single blurred image using Diffusion models	Wontae Choi et.al.	2510.26173	null
2025-10-30	Linking Heterogeneous Data with Coordinated Agent Flows for Social Media Analysis	Shifu Chen et.al.	2510.26172	null
2025-10-30	Learning to Manage Investment Portfolios beyond Simple Utility Functions	Maarten P. Scholl et.al.	2510.26165	null
2025-10-30	A two-dimensional fractional-order element-free Galerkin method for nonlocal elasticity and complex domain problems	Shubham Desai et.al.	2510.26161	null
2025-10-30	FullPart: Generating each 3D Part at Full Resolution	Lihe Ding et.al.	2510.26140	null
2025-10-30	Diffusive interface approach to oxygen transport and metabolism under cellular flow dynamics in microcirculations	Naoki Takeishi et.al.	2510.26138	null
2025-10-30	Security Risk of Misalignment between Text and Image in Multi-modal Model	Xiaosen Wang et.al.	2510.26105	null
2025-10-30	Robust Super-Capacity SRS Channel Inpainting via Diffusion Models	Usman Akram et.al.	2510.26097	null
2025-10-30	Group-Equivariant Diffusion Models for Lattice Field Theory	Octavio Vega et.al.	2510.26081	null
2025-10-30	New Money: A Systematic Review of Synthetic Data Generation for Finance	James Meldrum et.al.	2510.26076	null
2025-10-30	Dynamic VLM-Guided Negative Prompting for Diffusion Models	Hoyeon Chang et.al.	2510.26052	null
2025-10-29	Conformal Prediction Beyond the Horizon: Distribution-Free Inference for Policy Evaluation	Feichen Gan et.al.	2510.26026	null
2025-10-29	Brain-IT: Image Reconstruction from fMRI via Brain-Interaction Transformer	Roman Beliy et.al.	2510.25976	null
2025-10-29	SplitFlow: Flow Decomposition for Inversion-Free Text-to-Image Editing	Sung-Hoon Yoon et.al.	2510.25970	null
2025-10-29	Generative Image Restoration and Super-Resolution using Physics-Informed Synthetic Data for Scanning Tunneling Microscopy	Nikola L. Kolev et.al.	2510.25921	null
2025-10-29	*Evaluation of Structural Properties and Defect Energetics in Al $x$Ga${1-x}$ N Alloys*	Farshid Reza et.al.	2510.25912	null
2025-10-29	MIRO: MultI-Reward cOnditioned pretraining improves T2I quality and efficiency	Nicolas Dufour et.al.	2510.25897	null
2025-10-29	Figuring Out Gas & Galaxies In Enzo (FOGGIE) XI: Circumgalactic O VI Emission Traces Clumpy Inflowing Recycled Gas	Cassandra Lochhaas et.al.	2510.25844	null
2025-10-29	Demystifying flux eruptions: Magnetic flux transport in magnetically arrested disks	Jonatan Jacquemin-Ide et.al.	2510.25842	null
2025-10-29	$λφ^4$ as an Effective Theory in de Sitter	Sebastian Cespedes et.al.	2510.25826	null
2025-10-29	ScaleDiff: Higher-Resolution Image Synthesis via Efficient and Model-Agnostic Diffusion	Sungho Koh et.al.	2510.25818	null
2025-10-29	VFXMaster: Unlocking Dynamic Visual Effect Generation via In-Context Learning	Baolu Li et.al.	2510.25772	null
2025-10-29	FreeArt3D: Training-Free Articulated Object Generation using 3D Diffusion	Chuhao Chen et.al.	2510.25765	null
2025-10-29	DiagramEval: Evaluating LLM-Generated Diagrams via Graphs	Chumeng Liang et.al.	2510.25761	null
2025-10-29	Hawk: Leveraging Spatial Context for Faster Autoregressive Text-to-Image Generation	Zhi-Kai Chen et.al.	2510.25739	null
2025-10-29	Physics-Guided Conditional Diffusion Networks for Microwave Image Reconstruction	Shirin Chehelgami et.al.	2510.25729	null
2025-10-29	Binaspect – A Python Library for Binaural Audio Analysis, Visualization & Feature Generation	Dan Barry et.al.	2510.25714	null
2025-10-29	Dissipative structure and decay rate for an inviscid non-equilibrium radiation hydrodynamics system	Corrado Lattanzio et.al.	2510.25663	null
2025-10-29	BOLT-GAN: Bayes-Optimal Loss for Stable GAN Training	Mohammadreza Tavasoli Naeini et.al.	2510.25609	null
2025-10-29	Out-of-equilibrium contributions to charm hadrons in a fluid-dynamic approach	Rossana Facen et.al.	2510.25601	null
2025-10-29	Error Bounds and Optimal Schedules for Masked Diffusions with Factorized Approximations	Hugo Lavenant et.al.	2510.25544	null
2025-10-29	Off-policy Reinforcement Learning with Model-based Exploration Augmentation	Likun Wang et.al.	2510.25529	null
2025-10-29	Nonparametric estimation of homogenized invariant measures from multiscale data via Hermite expansion	Jaroslav I. Borodavka et.al.	2510.25521	null
2025-10-29	Dynamics of entanglement fluctuations and quantum Mpemba effect in the $ν=1$ QSSEP model	Angelo Russotto et.al.	2510.25519	null
2025-10-29	Stochastic Control of Dividends with a Drawdown Penalty	Kira Dudziak et.al.	2510.25494	null
2025-10-29	Echo-Conditioned Denoising Diffusion Probabilistic Models for Multi-Target Tracking in RF Sensing	Amirhossein Azarbahram et.al.	2510.25464	null
2025-10-29	The impact of fluctuations on particle systems described by Dean-Kawasaki-type equations	Nathan O. Silvano et.al.	2510.25454	null
2025-10-29	Improving Temporal Consistency and Fidelity at Inference-time in Perceptual Video Restoration by Zero-shot Image-based Diffusion Models	Nasrin Rahimi et.al.	2510.25420	null
2025-10-29	Instance-Level Composed Image Retrieval	Bill Psomas et.al.	2510.25387	null
2025-10-29	Beyond Leakage and Complexity: Towards Realistic and Efficient Information Cascade Prediction	Jie Peng et.al.	2510.25348	null
2025-10-29	Two Orders of Magnitude Enhancement in Oxide Ion Conductivity in Cu2P2O7 via Vanadium Substitution: A Pathway Toward SOFC Electrolytes	Bibhas Ghanta et.al.	2510.25325	null
2025-10-29	4-Doodle: Text to 3D Sketches that Move!	Hao Chen et.al.	2510.25319	null
2025-10-29	A virtual element approximation for the modified transmission eigenvalues for natural materials	Liangkun Xu et.al.	2510.25298	null
2025-10-29	Reactive capacitance of flat patches of arbitrary shape	Denis S. Grebenkov et.al.	2510.25288	null
2025-10-29	Diffusion-Driven Progressive Target Manipulation for Source-Free Domain Adaptation	Yuyang Huang et.al.	2510.25279	null
2025-10-29	Spectral analysis of the stiffness matrix sequence in the approximated Stokes equation	Samuele Ferri et.al.	2510.25252	null
2025-10-29	Balanced conic rectified flow	Kim Shin Seong et.al.	2510.25229	null
2025-10-29	$D^2GS$ : Dense Depth Regularization for LiDAR-free Urban Scene Reconstruction	Kejing Xia et.al.	2510.25173	null
2025-10-29	Error Analysis of Third-Order in Time and Fourth-Order Linear Finite Difference Scheme for Landau-Lifshitz-Gilbert Equation under Large Damping Parameters	Changjian Xie et.al.	2510.25172	null
2025-10-29	Target-Guided Bayesian Flow Networks for Quantitatively Constrained CAD Generation	Wenhao Zheng et.al.	2510.25163	null
2025-10-30	Model-Document Protocol for AI Search	Hongjin Qian et.al.	2510.25160	null
2025-10-30	Energy Approach from $\varepsilon$ -Graph to Continuum Diffusion Model with Connectivity Functional	Yahong Yang et.al.	2510.25114	null
2025-10-29	The Neural Differential Manifold: An Architecture with Explicit Geometric Structure	Di Zhang et.al.	2510.25113	null
2025-10-29	Percolating Corrosion Pathways of Chemically Ordered NiCr Alloys in Molten Salts	Hamdy Arkoub et.al.	2510.25098	null
2025-10-29	Learning Fair Graph Representations with Multi-view Information Bottleneck	Chuxun Liu et.al.	2510.25096	null
2025-10-29	PSTF-AttControl: Per-Subject-Tuning-Free Personalized Image Generation with Controllable Face Attributes	Xiang liu et.al.	2510.25084	null
2025-10-29	Training Across Reservoirs: Using Numerical Differentiation To Couple Trainable Networks With Black-Box Reservoirs	Andrew Clark et.al.	2510.25074	null
2025-10-28	Cluster Formation in Diffusive Systems	Benedict Leimkuhler et.al.	2510.25034	null
2025-10-28	Numerical Studies on the Radio Afterglows in TDE: Bow Shock	Guobin Mou et.al.	2510.25033	null
2025-10-28	Preliminary Demonstration of Diamond-GaN pn Diodes via Grafting	Jie Zhou et.al.	2510.25028	null
2025-10-28	LRT-Diffusion: Calibrated Risk-Aware Guidance for Diffusion Policies	Ximan Sun et.al.	2510.24983	null
2025-10-28	Breaking the Timescale Barrier: Generative Discovery of Conformational Free-Energy Landscapes and Transition Pathways	Chenyu Tang et.al.	2510.24979	null
2025-10-28	Flow-Induced Phase Separation for Active Brownian Particles in Four-Roll-Mill Flow	Soni D. Prajapati et.al.	2510.24960	null
2025-10-28	Energy-Conserving Contact Dynamics of Nonspherical Rigid-Body Particles	Haoyuan Shi et.al.	2510.24945	null
2025-10-28	Interpolated Discrepancy Data Assimilation for PDEs with Sparse Observations	Tong Wu et.al.	2510.24944	null
2025-10-28	Solute dispersion boosts the phoretic removal of colloids from dead-end pores	Yiran Li et.al.	2510.24938	null
2025-10-28	Large-Time Analysis of the Langevin Dynamics for Energies Fulfilling Polyak-Łojasiewicz Conditions	Massimo Fornasier et.al.	2510.24925	null
2025-10-28	VividCam: Learning Unconventional Camera Motions from Virtual Synthetic Videos	Qiucheng Wu et.al.	2510.24904	null
2025-10-28	Emergence of Chimeras States in One-dimensional Ising model with Long-Range Diffusion	Alejandro de Haro García et.al.	2510.24903	null
2025-10-28	General Microstructure Factor Analysis of Diffusion MRI in Gray-Matter Predicts Cognitive Scores	Lucas Z. Brito et.al.	2510.24879	null
2025-10-28	Extracting Spectral Diffusion in Two-Dimensional Coherent Spectra via the Projection Slice Theorem	Cesar Perez et.al.	2510.24865	null
2025-10-28	Ming-Flash-Omni: A Sparse, Unified Architecture for Multimodal Perception and Generation	Inclusion AI et.al.	2510.24821	null
2025-10-28	SafeEditor: Unified MLLM for Efficient Post-hoc T2I Safety Editing	Ruiyang Zhang et.al.	2510.24820	null
2025-10-28	CT-Less Attenuation Correction Using Multiview Ensemble Conditional Diffusion Model on High-Resolution Uncorrected PET Images	Alexandre St-Georges et.al.	2510.24805	null
2025-10-28	Generative View Stitching	Chonghyuk Song et.al.	2510.24718	null
2025-10-28	Uniform Discrete Diffusion with Metric Path for Video Generation	Haoge Deng et.al.	2510.24717	null
2025-10-28	Routing Matters in MoE: Scaling Diffusion Transformers with Explicit Routing Guidance	Yujie Wei et.al.	2510.24711	null
2025-10-29	Pearl: A Foundation Model for Placing Every Atom in the Right Location	Genesis Research Team et.al.	2510.24670	null
2025-10-28	Group Relative Attention Guidance for Image Editing	Xuanpu Zhang et.al.	2510.24657	null
2025-10-28	FunReason-MT Technical Report: Overcoming the Complexity Barrier in Multi-Turn Function Calling	Zengzhuang Xu et.al.	2510.24645	null
2025-10-28	A Dual-Branch CNN for Robust Detection of AI-Generated Facial Forgeries	Xin Zhang et.al.	2510.24640	null
2025-10-28	Causal Ordering for Structure Learning From Time Series	Pedro P. Sanchez et.al.	2510.24639	null
2025-10-28	Reduced Basis Approach for Convection-Diffusion Equations with Non-Linear Boundary Reaction Conditions	Sebastian Matera et.al.	2510.24632	null
2025-10-28	The Impact of Cosmic Ray Transport on the $γ$ -Ray Luminosity of Diffuse Gas	Roark Habegger et.al.	2510.24622	null
2025-10-28	Diffusion LLM with Native Variable Generation Lengths: Let [EOS] Lead the Way	Yicun Yang et.al.	2510.24605	null
2025-10-28	A Novel XAI-Enhanced Quantum Adversarial Networks for Velocity Dispersion Modeling in MaNGA Galaxies	Sathwik Narkedimilli et.al.	2510.24598	null
2025-10-28	Multifunctional Wideband Digital Metasurface for Secure Electromagnetic Manipulation in S-Band	Longpan Wang et.al.	2510.24597	null
2025-10-29	Leveraging Scale Separation and Stochastic Closure for Data-Driven Prediction of Chaotic Dynamics	Ismaël Zighed et.al.	2510.24583	null
2025-10-28	JWST observations of photodissociation regions III. Dust modelling at the illuminated edge of the Horsehead PDR	M. Elyajouri et.al.	2510.24573	null
2025-10-28	Unbiased likelihood estimation of the Langevin diffusion for animal movement modelling	Ron Ronald Togunov et.al.	2510.24539	null
2025-10-28	Multi-messenger constraints on transient accelerators of ultra-high energy cosmic rays	Antonio Condorelli et.al.	2510.24516	null
2025-10-28	Diffusion Models for Wireless Transceivers: From Pilot-Efficient Channel Estimation to AI-Native 6G Receivers	Yuzhi Yang et.al.	2510.24495	null
2025-10-28	Decoupled MeanFlow: Turning Flow Models into Flow Maps for Accelerated Sampling	Kyungmin Lee et.al.	2510.24474	null
2025-10-28	Rethinking Visual Intelligence: Insights from Video Pretraining	Pablo Acuaviva et.al.	2510.24448	null
2025-10-28	Pair Approximation Meets Reality: Diffusion of Innovation in Organizational Networks within the biased-independence q-Voter Model	Angelika Abramiuk-Szurlej et.al.	2510.24447	null
2025-10-28	Optimising Underwater Neutrino Telescopes for All-Flavour Point Source Sensitivity	Iwan Morton-Blake et.al.	2510.24395	null
2025-10-28	Global stability and asymptotic behavior for the incompressible MHD equations without viscosity or magnetic diffusion	Qunyi Bie et.al.	2510.24338	null
2025-10-28	Training-free Source Attribution of AI-generated Images via Resynthesis	Pietro Bongini et.al.	2510.24278	null
2025-10-29	Bistability, Oscillations, and Multistability on Hycean Planets	Yichen Gao et.al.	2510.24224	null
2025-10-28	Beyond Inference Intervention: Identity-Decoupled Diffusion for Face Anonymization	Haoxin Yang et.al.	2510.24213	null
2025-10-28	MC-SJD : Maximal Coupling Speculative Jacobi Decoding for Autoregressive Visual Generation Acceleration	Junhyuk So et.al.	2510.24211	null
2025-10-28	When bubbles matter: hydrogen transport governs apparent kinetics in 4-nitrophenol hydrogenation reaction	Tatiana Nizkaia et.al.	2510.24176	null
2025-10-28	Interplay between Cu diffusion and bonding anisotropy on the thermoelectric performance of double cation chalcohalides $CuBiSeX_{2} (X = Cl, Br)$	Manivannan Saminathan et.al.	2510.24147	null
2025-10-28	Assessment of modern shock capturing schemes for all-speed flows in the OpenFOAM framework	Anurag Adityanarayan Ray et.al.	2510.24146	null
2025-10-29	VC4VG: Optimizing Video Captions for Text-to-Video Generation	Yang Du et.al.	2510.24134	null
2025-10-28	Compositional Image Synthesis with Inference-Time Scaling	Minsuk Ji et.al.	2510.24133	null
2025-10-28	ETC: training-free diffusion models acceleration with Error-aware Trend Consistency	Jiajian Xie et.al.	2510.24129	null
2025-10-29	Effect of flow-aligned external magnetic fields on mushroom instability	Y. Guo et.al.	2510.24121	null
2025-10-28	OmniText: A Training-Free Generalist for Controllable Text-Image Manipulation	Agus Gunawan et.al.	2510.24093	null
2025-10-28	Information-Theoretic Discrete Diffusion	Moongyu Jeon et.al.	2510.24088	null
2025-10-28	A Novel Virus Diffusion Optimization (VDO) Algorithm for Global Optimization	Zhaoqi Sun et.al.	2510.24083	null
2025-10-28	Beyond Objects: Contextual Synthetic Data Generation for Fine-Grained Classification	William Yang et.al.	2510.24078	null
2025-10-28	Language-Conditioned Representations and Mixture-of-Experts Policy for Robust Multi-Task Robotic Manipulation	Xiucheng Zhang et.al.	2510.24055	null
2025-10-28	Graph conductance, synchronization, and a new bottleneck measure	C. Tyler Diggans et.al.	2510.24048	null
2025-10-28	Causal-Aware Generative Adversarial Networks with Reinforcement Learning	Tu Anh Hoang Nguyen et.al.	2510.24046	null
2025-10-28	AutoPrompt: Automated Red-Teaming of Text-to-Image Models via LLM-Driven Adversarial Prompts	Yufan Liu et.al.	2510.24034	null
2025-10-28	OneCast: Structured Decomposition and Modular Generation for Cross-Domain Time Series Forecasting	Tingyue Pan et.al.	2510.24028	null
2025-10-28	Training-Free Safe Text Embedding Guidance for Text-to-Image Diffusion Models	Byeonghu Na et.al.	2510.24012	null
2025-10-28	Score-based constrained generative modeling via Langevin diffusions with boundary conditions	Adam Nordenhög et.al.	2510.23985	null
2025-10-28	Synergistic Neural Forecasting of Air Pollution with Stochastic Sampling	Yohan Abeysinghe et.al.	2510.23977	null
2025-10-28	Diffusion Adaptive Text Embedding for Text-to-Image Diffusion Models	Byeonghu Na et.al.	2510.23974	null
2025-10-28	An efficient probabilistic hardware architecture for diffusion-like models	Andraž Jelinčič et.al.	2510.23972	null
2025-10-28	Neural USD: An object-centric framework for iterative editing and control	Alejandro Escontrela et.al.	2510.23956	null
2025-10-27	Modeling Biological Multifunctionality with Echo State Networks	Anastasia-Maria Leventi-Peetz et.al.	2510.23940	null
2025-10-27	Intrinsic Scalings with Non-standard Growth	M. D. Amaral et.al.	2510.23939	null
2025-10-27	TurboPortrait3D: Single-step diffusion-based fast portrait novel-view synthesis	Emily Kim et.al.	2510.23929	null
2025-10-27	TRELLISWorld: Training-Free World Generation from Object Generators	Hanke Chen et.al.	2510.23880	null
2025-10-27	A PDE-Informed Latent Diffusion Model for 2-m Temperature Downscaling	Paul Rosu et.al.	2510.23866	null
2025-10-27	Low-Dose CT Imaging Using a Regularization-Enhanced Efficient Diffusion Probabilistic Model	Qiang Li et.al.	2510.23859	null
2025-10-27	RareFlow: Physics-Aware Flow-Matching for Cross-Sensor Super-Resolution of Rare-Earth Features	Forouzan Fallah et.al.	2510.23816	null
2025-10-27	Stress in chromium thin films deposited by DC magnetron sputtering on grounded cupper and stainless-steel substrate holders	M. D. Medina et.al.	2510.23801	null
2025-10-27	Galactic Alchemy: Deep Learning Map-to-Map Translation in Hydrodynamical Simulations	Philipp Denzel et.al.	2510.23768	null
2025-10-27	Switching Network System Identification via Convex Optimizations	Kaito Iwasaki et.al.	2510.23721	null
2025-10-27	Shock Acceleration in the Intracluster Medium: Implications of Micromirror Confinement	Rebecca Diesing et.al.	2510.23700	null
2025-10-27	Quantum Kinetic Modeling of KEEN waves in a Warm-Dense Regime	F. Alejandro Padilla-Gomez et.al.	2510.23690	null
2025-10-27	Variational Masked Diffusion Models	Yichi Zhang et.al.	2510.23606	null
2025-10-27	Coupling-induced universal dynamics in bilayer two-dimensional Bose gases	En Chang et.al.	2510.23600	null
2025-10-27	Think Twice: Branch-and-Rethink Reasoning Reward Model	Yizhu Jiao et.al.	2510.23596	null
2025-10-28	PRISM-Bench: A Benchmark of Puzzle-Based Visual Tasks with CoT Error Detection	Yusu Qian et.al.	2510.23594	null
2025-10-27	FARMER: Flow AutoRegressive Transformer over Pixels	Guangting Zheng et.al.	2510.23588	null
2025-10-27	More Than Generation: Unifying Generation and Depth Estimation via Text-to-Image Diffusion Models	Hongkai Lin et.al.	2510.23574	null
2025-10-27	Cosmic Vine: High abundance of massive galaxies and dark matter halos in a forming cluster at z=3.44	Nikolaj B. Sillassen et.al.	2510.23549	null
2025-10-27	FreeFuse: Multi-Subject LoRA Fusion via Auto Masking at Test Time	Yaoli Liu et.al.	2510.23515	null
2025-10-27	Towards Deep Physics-Informed Kolmogorov-Arnold Networks	Spyros Rigas et.al.	2510.23501	null
2025-10-27	Yesnt: Are Diffusion Relighting Models Ready for Capture Stage Compositing? A Hybrid Alternative to Bridge the Gap	Elisabeth Jüttner et.al.	2510.23494	null
2025-10-27	A Finite Element framework for bulk-surface coupled PDEs to solve moving boundary problems in biophysics	Alessandro Contri et.al.	2510.23459	null
2025-10-27	An Efficient Remote Sensing Super Resolution Method Exploring Diffusion Priors and Multi-Modal Constraints for Crop Type Mapping	Songxi Yang et.al.	2510.23382	null
2025-10-27	Elastic modeling and total energy calculations of the structural characteristics of “free-standing”,periodic, pseudomorphic GaN/AlN superlattices	Th. Karakostas et.al.	2510.23344	null
2025-10-27	Genetic interfaces at the frontier of expanding microbial colonies	Jonathan Bauermann et.al.	2510.23307	null
2025-10-27	ReconViaGen: Towards Accurate Multi-view 3D Object Reconstruction via Generation	Jiahao Chang et.al.	2510.23306	null
2025-10-27	Can KM3-230213A be compatible with a cosmogenic origin?	The KM3NeT collaboration et.al.	2510.23287	null
2025-10-27	Adaptive Stochastic Coefficients for Accelerating Diffusion Sampling	Ruoyu Wang et.al.	2510.23285	null
2025-10-27	DCMM-SQL: Automated Data-Centric Pipeline and Multi-Model Collaboration Training for Text-to-SQL Model	Yuanzhen Xie et.al.	2510.23284	null
2025-10-27	Privacy-Preserving Semantic Communication over Wiretap Channels with Learnable Differential Privacy	Weixuan Chen et.al.	2510.23274	null
2025-10-27	A Novel Framework for Multi-Modal Protein Representation Learning	Runjie Zheng et.al.	2510.23273	null
2025-10-27	Analysis of Hematocrit-Plasma Separation in a Trifurcated Microchannel by a Diffusive Flux Model	Rishi Kumar et.al.	2510.23270	null
2025-10-27	Deep Active Inference with Diffusion Policy and Multiple Timescale World Model for Real-World Exploration and Navigation	Riko Yokozawa et.al.	2510.23258	null
2025-10-27	Autoregressive Styled Text Image Generation, but Make it Reliable	Carmine Zaccagnino et.al.	2510.23240	null
2025-10-27	Fluctuations, Clustering, and Interaction-Driven Dynamics in Sedimenting Particles at Low Galileo Numbers: A Neural Network Approach	Nejc Vovk et.al.	2510.23232	null
2025-10-27	Model-free filtering in high dimensions via projection and score-based diffusions	Sören Christensen et.al.	2510.23197	null
2025-10-27	Evaluation of Vision-LLMs in Surveillance Video	Pascal Benschop et.al.	2510.23190	null
2025-10-27	Physics-informed diffusion models for extrapolating crystal structures beyond known motifs	Andrij Vasylenko et.al.	2510.23181	null
2025-10-27	Residual Diffusion Bridge Model for Image Restoration	Hebaixu Wang et.al.	2510.23116	null
2025-10-27	Sampling from Energy distributions with Target Concrete Score Identity	Sergei Kholkin et.al.	2510.23106	null
2025-10-27	Exotic B-series representation of the Feller semigroup for Itô diffusions and the MSR path integral	Alberto Bonicelli et.al.	2510.23102	null
2025-10-27	Mind the Gap – Imaging Buried Interfaces in Twisted Oxide Moirés	Harikrishnan KP et.al.	2510.23042	null
2025-10-27	LLM Meets Diffusion: A Hybrid Framework for Crystal Material Generation	Subhojyoti Khastagir et.al.	2510.23040	null
2025-10-27	Nested AutoRegressive Models	Hongyu Wu et.al.	2510.23028	null
2025-10-27	Mixed Density Diffuser: Efficient Planning with Non-uniform Temporal Resolution	Crimson Stambaugh et.al.	2510.23026	null
2025-10-27	UniAIDet: A Unified and Universal Benchmark for AI-Generated Image Content Detection and Localization	Huixuan Zhang et.al.	2510.23023	null
2025-10-27	M $^{3}$ T2IBench: A Large-Scale Multi-Category, Multi-Instance, Multi-Relation Text-to-Image Benchmark	Huixuan Zhang et.al.	2510.23020	null
2025-10-27	ManiDP: Manipulability-Aware Diffusion Policy for Posture-Dependent Bimanual Manipulation	Zhuo Li et.al.	2510.23016	null
2025-10-27	CoMo: Compositional Motion Customization for Text-to-Video Generation	Youcan Xu et.al.	2510.23007	null
2025-10-27	SceneDecorator: Towards Scene-Oriented Story Generation with Scene Planning and Scene Consistency	Quanjian Song et.al.	2510.22994	null
2025-10-27	Exploring Semantic-constrained Adversarial Example with Instruction Uncertainty Reduction	Jin Hu et.al.	2510.22981	null
2025-10-27	Scaling Up Occupancy-centric Driving Scene Generation: Dataset and Method	Bohan Li et.al.	2510.22973	null
2025-10-27	VALA: Learning Latent Anchors for Training-Free and Temporally Consistent	Zhangkai Wu et.al.	2510.22970	null
2025-10-27	Multi-Agent Conditional Diffusion Model with Mean Field Communication as Wireless Resource Allocation Planner	Kechen Meng et.al.	2510.22969	null
2025-10-27	LightBagel: A Light-weighted, Double Fusion Framework for Unified Multimodal Understanding and Generation	Zeyu Wang et.al.	2510.22946	null
2025-10-27	Diffuse to Detect: A Generalizable Framework for Anomaly Detection with Diffusion Models Applications to UAVs and Beyond	Mingze Gong et.al.	2510.22928	null
2025-10-27	Simple Denoising Diffusion Language Models	Huaisheng Zhu et.al.	2510.22926	null
2025-10-27	Constraint on the physical origin of GRB prompt emission via its non-detected diffuse neutrino emission	Yang-Dong-Jun Ou et.al.	2510.22914	null
2025-10-27	Machine-Learning-Guided Insights into Solid-Electrolyte Interphase Conductivity: Are Amorphous Lithium Fluorophosphates the Key?	Peichen Zhong et.al.	2510.22912	null
2025-10-27	Radiation enhanced diffusion in cartilages as a physical mechanism underlying radiation treatments of osteoarthritis and related disorders	Diana Shvydka et.al.	2510.22903	null
2025-10-26	Encoder-Decoder Diffusion Language Models for Efficient Training and Inference	Marianne Arriola et.al.	2510.22852	null
2025-10-26	Semantic Surgery: Zero-Shot Concept Erasure in Diffusion Models	Lexiang Xiong et.al.	2510.22851	null
2025-10-26	Clustering by Denoising: Latent plug-and-play diffusion for single-cell data	Dominik Meier et.al.	2510.22835	null
2025-10-26	FairJudge: MLLM Judging for Social Attributes and Prompt Image Alignment	Zahraa Al Sahili et.al.	2510.22827	null
2025-10-26	Logical GANs: Adversarial Learning through Ehrenfeucht Fraisse Games	Mirco A. Mannucci et.al.	2510.22824	null
2025-10-26	Analytical Swarm Chemistry: Characterization and Analysis of Emergent Swarm Behaviors	Ricardo Vega et.al.	2510.22821	null
2025-10-26	MAGIC-Talk: Motion-aware Audio-Driven Talking Face Generation with Customizable Identity Control	Fatemeh Nazarieh et.al.	2510.22810	null
2025-10-26	A Free Probabilistic Framework for Denoising Diffusion Models: Entropy, Transport, and Reverse Processes	Swagatam Das et.al.	2510.22778	null
2025-10-26	Distributionally Robust Optimization via Diffusion Ambiguity Modeling	Jiaqi Wen et.al.	2510.22757	null
2025-10-26	Cross-view Localization and Synthesis – Datasets, Challenges and Opportunities	Ningli Xu et.al.	2510.22736	null
2025-10-26	A Unified Numerical Framework for Turbulent Convection and Phase-Change Dynamics in Coupled Fluid-Porous Systems	Rongfu Guo et.al.	2510.22730	null
2025-10-26	TABL-ABM: A Hybrid Framework for Synthetic LOB Generation	Ollie Olby et.al.	2510.22685	null
2025-10-26	Conjugate Relation Modeling for Few-Shot Knowledge Graph Completion	Zilong Wang et.al.	2510.22656	null
2025-10-26	Self-Attention Decomposition For Training Free Diffusion Editing	Tharun Anand et.al.	2510.22650	null
2025-10-26	Directionality-induced jamming in multiplex networks	Mateo Bouchet et.al.	2510.22634	null
2025-10-26	Projection Embedded Diffusion Bridge for CT Reconstruction from Incomplete Data	Yuang Wang et.al.	2510.22605	null
2025-10-26	Diffusion operators on $p$ -adic analytic manifolds	Patrick Erik Bradley et.al.	2510.22563	null
2025-10-26	LO-SDA: Latent Optimization for Score-based Atmospheric Data Assimilation	Jing-An Sun et.al.	2510.22562	null
2025-10-26	DDTR: Diffusion Denoising Trace Recovery	Maximilian Matyash et.al.	2510.22553	null
2025-10-26	SRSR: Enhancing Semantic Accuracy in Real-World Image Super-Resolution with Spatially Re-Focused Text-Conditioning	Chen Chen et.al.	2510.22534	null
2025-10-26	Open Multimodal Retrieval-Augmented Factual Image Generation	Yang Tian et.al.	2510.22521	null
2025-10-26	An End-to-End Generative Diffusion Model for Heavy-Ion Collisions	Jing-An Sun et.al.	2510.22515	null
2025-10-26	CANDI: Hybrid Discrete-Continuous Diffusion Models	Patrick Pynadath et.al.	2510.22510	null
2025-10-26	A Novel Discrete-time Model of Information Diffusion on Social Networks Considering Users Behavior	Tran Van Khanh et.al.	2510.22501	null
2025-10-25	Suppression of quantized heat flow by the dielectric response of a compressible strip at the quantum Hall edge	Eugene V. Sukhorukov et.al.	2510.22459	null
2025-10-25	Introducing the Seyfert-LINER Index (SLI): High Resolution (<100pc) Ionization Structures in the ISM of AGN ESO~137-G34	D. Ł. Król et.al.	2510.22447	null
2025-10-25	PromptReverb: Multimodal Room Impulse Response Generation Through Latent Rectified Flow Matching	Ali Vosoughi et.al.	2510.22439	null
2025-10-25	Hollywood Town: Long-Video Generation via Cross-Modal Multi-Agent Orchestration	Zheng Wei et.al.	2510.22431	null
2025-10-25	T2SMark: Balancing Robustness and Diversity in Noise-as-Watermark for Diffusion Models	Jindong Yang et.al.	2510.22366	null
2025-10-25	New cost terms through the homogenization of an optimal control problem under dynamic boundary conditions on the microscopic particles	J. I. Díaz et.al.	2510.22357	null
2025-10-25	Traveling waves in nonclassical diffusion equations	William Barker et.al.	2510.22349	null
2025-10-25	GeoDiffusion: A Training-Free Framework for Accurate 3D Geometric Conditioning in Image Generation	Phillip Mueller et.al.	2510.22337	null
2025-10-25	Moving Beyond Diffusion: Hierarchy-to-Hierarchy Autoregression for fMRI-to-Image Reconstruction	Xu Zhang et.al.	2510.22335	null
2025-10-25	GRPO-Guard: Mitigating Implicit Over-Optimization in Flow Matching via Regulated Clipping	Jing Wang et.al.	2510.22319	null
2025-10-25	T2I-RiskyPrompt: A Benchmark for Safety Evaluation, Attack, and Defense on Text-to-Image Model	Chenyu Zhang et.al.	2510.22300	null
2025-10-25	Glymphatic Clearance in the Optic Nerve: A Multidomain Electro-osmostic Model	Shanfeng Xiao et.al.	2510.22271	null
2025-10-25	DiffusionLane: Diffusion Model for Lane Detection	Kunyang Zhou et.al.	2510.22236	null
2025-10-25	Robust MIMO Channel Estimation Using Energy-Based Generative Diffusion Models	Ziqi Diao et.al.	2510.22230	null
2025-10-25	Diffusion-Driven Two-Stage Active Learning for Low-Budget Semantic Segmentation	Jeongin Kim et.al.	2510.22229	null
2025-10-25	ACG: Action Coherence Guidance for Flow-based VLA models	Minho Park et.al.	2510.22201	null
2025-10-28	LongCat-Video Technical Report	Meituan LongCat Team et.al.	2510.22200	null
2025-10-25	Scaling Non-Parametric Sampling with Representation	Vincent Lu et.al.	2510.22196	null
2025-10-25	Suppression of Thin-Film Thermal Conductivity due to Surface Roughness	Michimasa Morita et.al.	2510.22185	null
2025-10-28	A Unified Framework for Direction and Diffuseness Estimation Using Tight-Frame Microphone Arrays	Akira Omoto et.al.	2510.22183	null
2025-10-25	Electrokinetic Effects on Flow and Ion Transport in Charge-Patterned Corrugated Nanochannels	Thomas Petersen et.al.	2510.22182	null
2025-10-25	Expert Validation of Synthetic Cervical Spine Radiographs Generated with a Denoising Diffusion Probabilistic Model	Austin A. Barr et.al.	2510.22166	null
2025-10-25	On hypoellipticity of degenerate operators in testing and detection problems	Erhan Bayraktar et.al.	2510.22150	null
2025-10-25	Well-posedness and finite-time extinction of a PDE-ODE spatial-network model with anisotropic diffusion	Xiao Meng et.al.	2510.22147	null
2025-10-25	The oscillation properties of the Blue Large Amplitude Pulsators (BLAPs): relative change rate of periods, excitations, and period relations	Tao Wu et.al.	2510.22144	null
2025-10-25	Discovering Latent Graphs with GFlowNets for Diverse Conditional Image Generation	Bailey Trang et.al.	2510.22107	null
2025-10-24	Compositional Bias Control in Large Language Models: Preference Learning Fails, Supervision Succeeds	Atij Mahesh et.al.	2510.22084	null
2025-10-23	LayerComposer: Interactive Personalized T2I via Spatially-Aware Layered Canvas	Guocheng Gordon Qian et.al.	2510.20820	null
2025-10-23	Towards General Modality Translation with Contrastive and Predictive Latent Diffusion Bridge	Nimrod Berman et.al.	2510.20819	null
2025-10-23	Generative Reasoning Recommendation via LLMs	Minjie Hong et.al.	2510.20815	null
2025-10-23	Video Prediction of Dynamic Physical Simulations With Pixel-Space Spatiotemporal Transformers	Dean L Slack et.al.	2510.20807	null
2025-10-23	ARGenSeg: Image Segmentation with Autoregressive Image Generation Model	Xiaolong Wang et.al.	2510.20803	null
2025-10-23	BadGraph: A Backdoor Attack Against Latent Diffusion Model for Text-Guided Graph Generation	Liang Ye et.al.	2510.20792	null
2025-10-23	DyPE: Dynamic Position Extrapolation for Ultra High Resolution Diffusion	Noam Issachar et.al.	2510.20766	null
2025-10-23	Consumption-Investment Problem in Rank-Based Models	David Itkin et.al.	2510.20763	null
2025-10-23	AutoScape: Geometry-Consistent Long-Horizon Scene Generation	Jiacheng Chen et.al.	2510.20726	null
2025-10-23	ALICE-LRI: A General Method for Lossless Range Image Generation for Spinning LiDAR Sensors without Calibration Metadata	Samuel Soutullo et.al.	2510.20708	null
2025-10-23	Trust, But Verify: An Empirical Evaluation of AI-Generated Code for SDN Controllers	Felipe Avencourt Soares et.al.	2510.20703	null
2025-10-23	Search for neutrino emission from LHAASO observed Microquasar with IceCube 10-year data	Rong-Lan Li et.al.	2510.20687	null
2025-10-23	Downsizing Diffusion Models for Cardinality Estimation	Xinhe Mu et.al.	2510.20681	null
2025-10-23	Niebla: an open-source code for modelling the extragalactic background light	Sara Porras-Bedmar et.al.	2510.20664	null
2025-10-23	UltraHR-100K: Enhancing UHR Image Synthesis with A Large-Scale High-Quality Dataset	Chen Zhao et.al.	2510.20661	null
2025-10-23	MS-BART: Unified Modeling of Mass Spectra and Molecules for Structure Elucidation	Yang Han et.al.	2510.20615	null
2025-10-23	Diffusion Autoencoders with Perceivers for Long, Irregular and Multimodal Astronomical Sequences	Yunyi Shen et.al.	2510.20595	null
2025-10-23	GenColorBench: A Color Evaluation Benchmark for Text-to-Image Generation Models	Muhammad Atif Butt et.al.	2510.20586	null
2025-10-23	Dynamic principles of concentration buffering through liquid-liquid phase separation	Logan de Monchaux-Irons et.al.	2510.20553	null
2025-10-23	Beneath the kinetic interpretation of noise	Carlos Escudero et.al.	2510.20552	null
2025-10-23	EchoDistill: Bidirectional Concept Distillation for One-Step Diffusion Personalization	Yixiong Yang et.al.	2510.20512	null
2025-10-23	Stochastic evolution equations with nonlinear diffusivity, recent progress and critical cases	Ioana Ciotir et.al.	2510.20471	null
2025-10-23	Vacancy diffusion on a brominated Si(100) surface: Critical effect of the dangling bond charge state	T. V. Pavlova et.al.	2510.20426	null
2025-10-23	A Stochastic Parameterization of Non-Orographic Gravity Waves Induced Mixing for Mars Planetary Climate Model	Jiandong Liu et.al.	2510.20410	null
2025-10-23	PointMapPolicy: Structured Point Cloud Processing for Multi-Modal Imitation Learning	Xiaogang Jia et.al.	2510.20406	null
2025-10-23	Positional Encoding Field	Yunpeng Bai et.al.	2510.20385	null
2025-10-23	Self-diffusion in confined systems	Manuel Mayo et.al.	2510.20357	null
2025-10-23	What do AI-Generated Images Want?	Amanda Wasielewski et.al.	2510.20350	null
2025-10-23	Synthetic Data for Robust Runway Detection	Estelle Chigot et.al.	2510.20349	null
2025-10-23	AccuQuant: Simulating Multiple Denoising Steps for Quantizing Diffusion Models	Seunghoon Lee et.al.	2510.20348	null
2025-10-23	Nonergodic extended phase for waves in three dimensions	Marcus Prado et.al.	2510.20346	null
2025-10-23	Dino-Diffusion Modular Designs Bridge the Cross-Domain Gap in Autonomous Parking	Zixuan Wu et.al.	2510.20335	null
2025-10-23	From diffusion optics to photocatalytic rates in multiply scattering porous slabs: finite-slab Green’s function, optical-to-kinetic mapping, and application to core-shell aerogels with embedded anatase nanoparticles	Renaud A. L. Vallée et.al.	2510.20315	null
2025-10-23	Soft Phonon Charge-Density Wave Formation in the Kagome Metal KV $_3$Sb$_5$	Yifan Wang et.al.	2510.20230	null
2025-10-23	EditInfinity: Image Editing with Binary-Quantized Generative Models	Jiahuan Wang et.al.	2510.20217	null
2025-10-23	FlowCycle: Pursuing Cycle-Consistent Flows for Text-based Editing	Yanghao Wang et.al.	2510.20212	null
2025-10-23	Vox-Evaluator: Enhancing Stability and Fidelity for Zero-shot TTS with A Multi-Level Evaluator	Hualei Wang et.al.	2510.20210	null
2025-10-23	RAPO++: Cross-Stage Prompt Optimization for Text-to-Video Generation via Data Alignment and Test-Time Scaling	Bingjie Gao et.al.	2510.20206	null
2025-10-23	Evaluating Video Models as Simulators of Multi-Person Pedestrian Trajectories	Aaron Appelle et.al.	2510.20182	null
2025-10-23	IB-GAN: Disentangled Representation Learning with Information Bottleneck Generative Adversarial Networks	Insu Jeon et.al.	2510.20165	null
2025-10-23	IEnSF: Iterative Ensemble Score Filter for Reducing Error in Posterior Score Estimation in Nonlinear Data Assimilation	Zezhong Zhang et.al.	2510.20159	null
2025-10-23	Understanding Mechanistic Role of Structural and Functional Connectivity in Tau Propagation Through Multi-Layer Modeling	Tingting Dan et.al.	2510.20148	null
2025-10-23	Compositional Generation for Long-Horizon Coupled PDEs	Somayajulu L. N. Dhulipala et.al.	2510.20141	null
2025-10-23	Memory-Dependent FPK Equations for Nonlinear SDOF Oscillators Under Fractional Gaussian Noise Excitation	Lifang Feng et.al.	2510.20124	null
2025-10-23	StableSketcher: Enhancing Diffusion Model for Pixel-based Sketch Generation via Visual Question Answering Feedback	Jiho Park et.al.	2510.20093	null
2025-10-23	Attentive Convolution: Unifying the Expressivity of Self-Attention with Convolutional Efficiency	Hao Yu et.al.	2510.20092	null
2025-10-22	Exposing Blindspots: Cultural Bias Evaluation in Generative Image Models	Huichan Seo et.al.	2510.20042	null
2025-10-22	A Framework for the Adoption and Integration of Generative AI in Midsize Organizations and Enterprises (FAIGMOE)	Abraham Itzhak Weinberg et.al.	2510.19997	null
2025-10-22	Geometric Interpretation of Brownian Motion on Riemannian Manifolds	Taeyoung Lee et.al.	2510.19991	null
2025-10-22	No Compute Left Behind: Rethinking Reasoning and Sampling with Masked Diffusion Models	Zachary Horvitz et.al.	2510.19990	null
2025-10-22	Guiding diffusion models to reconstruct flow fields from sparse data	Marc Amorós-Trepat et.al.	2510.19971	null
2025-10-22	A new wave of vehicle insurance fraud fueled by generative AI	Amir Hever et.al.	2510.19957	null
2025-10-22	Modelling multiscale architecture of biofilm extracellular matrix and its role in oxygen transport	Raghu K. Moorthy et.al.	2510.19947	null
2025-10-22	The Sagittarius C Complex in the Mid-Infrared with SOFIA/FORCAST	Roy J. Zhao et.al.	2510.19908	null
2025-10-22	Compressing Biology: Evaluating the Stable Diffusion VAE for Phenotypic Drug Discovery	Télio Cropsal et.al.	2510.19887	null
2025-10-22	From Denoising to Refining: A Corrective Framework for Vision-Language Diffusion Model	Yatai Ji et.al.	2510.19871	null
2025-10-22	Transforming Multi-Omics Integration with GANs: Applications in Alzheimer’s and Cancer	Md Selim Reza et.al.	2510.19870	null
2025-10-21	Two Quantum Algorithms for Nonlinear Reaction-Diffusion Equation using Chebyshev Approximation Method	Manish Kumar et.al.	2510.19855	null
2025-10-22	Pico-Banana-400K: A Large-Scale Dataset for Text-Guided Image Editing	Yusu Qian et.al.	2510.19808	null
2025-10-22	OmniMotion-X: Versatile Multimodal Whole-Body Motion Generation	Guowei Xu et.al.	2510.19789	null
2025-10-23	A Survey on Cache Methods in Diffusion Models: Toward Efficient Multi-Modal Generation	Jiacheng Liu et.al.	2510.19755	null
2025-10-22	Cultural Dimensions of Artificial Intelligence Adoption: Empirical Insights for Wave 1 from a Multinational Longitudinal Pilot Study	Michelle J. Cummings-Koether et.al.	2510.19743	null
2025-10-22	Enabling Granular Subgroup Level Model Evaluations by Generating Synthetic Medical Time Series	Mahmoud Ibrahim et.al.	2510.19728	null
2025-10-22	High Uniformity GaN Micro-pyramids and Platelets by Selective Area Growth	Changhao Li et.al.	2510.19697	null
2025-10-22	Style Attack Disguise: When Fonts Become a Camouflage for Adversarial Intent	Yangshijie Zhang et.al.	2510.19641	null
2025-10-22	DFT-informed Design of Radiation-Resistant Dilute Ternary Cu Alloys	Vaibhav Vasudevan et.al.	2510.19638	null
2025-10-23	Network Contagion Dynamics in European Banking: A Navier-Stokes Framework for Systemic Risk Assessment	Tatsuru Kikuchi et.al.	2510.19630	null
2025-10-22	Learning and Simulating Building Evacuation Patterns for Enhanced Safety Design Using Generative Models	Jin Han et.al.	2510.19623	null
2025-10-22	Pragmatic Heterogeneous Collaborative Perception via Generative Communication Mechanism	Junfei Zhou et.al.	2510.19618	null
2025-10-22	XBench: A Comprehensive Benchmark for Visual-Language Explanations in Chest Radiography	Haozhe Luo et.al.	2510.19599	null
2025-10-23	CBDiff:Conditional Bernoulli Diffusion Models for Image Forgery Localization	Zhou Lei et.al.	2510.19597	null
2025-10-22	Accretion with two-phase gas supply and its application in black hole X-ray binaries	Yilong Wang et.al.	2510.19584	null
2025-10-22	On an adjoint-based numerical approach for time-dependent optimal control problems of biomedical interest	Zahra Mirzaiyan et.al.	2510.19576	null
2025-10-22	The Intricate Dance of Prompt Complexity, Quality, Diversity, and Consistency in T2I Models	Xiaofeng Zhang et.al.	2510.19557	null
2025-10-22	Observation of counterion binding in the inner Helmholtz layer at the ionic surfactant-water interface	Yuyang Peng et.al.	2510.19554	null
2025-10-22	Quantum Monte Carlo study of low-dimensional Fermi fluids of dipolar atoms	Clio Johnson et.al.	2510.19533	null
2025-10-22	PoseCrafter: Extreme Pose Estimation with Hybrid Video Synthesis	Qing Mao et.al.	2510.19527	null
2025-10-22	WALLABY: an untargeted search for H I-bearing ultra-diffuse galaxies uncovers the first known ultra-diffuse galaxy pair	T. O’Beirne et.al.	2510.19466	null
2025-10-22	A Reduced-Dimensional Model for the Interhemispheric Geostrophic Meridional Overturning Circulation	Elian Vanderborght et.al.	2510.19454	null
2025-10-22	Evolution of Conditional Entropy for Diffusion Dynamics on Graphs	Samuel Koovely et.al.	2510.19441	null
2025-10-22	GigaBrain-0: A World Model-Powered Vision-Language-Action Model	GigaBrain Team et.al.	2510.19430	null
2025-10-22	Optimization Benchmark for Diffusion Models on Dynamical Systems	Fabian Schaipp et.al.	2510.19376	null
2025-10-22	Imitation Learning Policy based on Multi-Step Consistent Integration Shortcut Model	Yu Fang et.al.	2510.19356	null
2025-10-22	Loopholing Discrete Diffusion: Deterministic Bypass of the Sampling Wall	Mingyu Jo et.al.	2510.19304	null
2025-10-22	D2D: Detector-to-Differentiable Critic for Improved Numeracy in Text-to-Image Generation	Nobline Yoo et.al.	2510.19278	null
2025-10-22	SCEESR: Semantic-Control Edge Enhancement for Diffusion-Based Super-Resolution	Yun Kai Zhuang et.al.	2510.19272	null
2025-10-22	From Newborn to Impact: Bias-Aware Citation Prediction	Mingfei Lu et.al.	2510.19246	null
2025-10-22	Particle system approximation of Nash equilibria in large games	Ludovic Tangpi et.al.	2510.19211	null
2025-10-22	Stability and slow dynamics of an interior spiky pattern in a one-dimensional spatial Solow model with capital-induced labor migration	Fanze Kong et.al.	2510.19204	null
2025-10-22	An Active Diffusion Neural Network for Graphs	Mengying Jiang et.al.	2510.19202	null
2025-10-22	Rethinking Driving World Model as Synthetic Data Generator for Perception Tasks	Kai Zeng et.al.	2510.19195	null
2025-10-23	Video Consistency Distance: Enhancing Temporal Consistency for Image-to-Video Generation via Reward-Based Fine-Tuning	Takehiro Aoshima et.al.	2510.19193	null
2025-10-22	Topology optimization for microfluidic mixers by a phase field method	Zongyuan Liu et.al.	2510.19192	null
2025-10-22	Step-Aware Residual-Guided Diffusion for EEG Spatial Super-Resolution	Hongjun Liu et.al.	2510.19166	null
2025-10-21	A Cross-Environment and Cross-Embodiment Path Planning Framework via a Conditional Diffusion Model	Mehran Ghafarian Tamizi et.al.	2510.19128	null
2025-10-21	Learning Peer Influence Probabilities with Linear Contextual Bandits	Ahmed Sayeed Faruk et.al.	2510.19119	null
2025-10-21	MoAlign: Motion-Centric Representation Alignment for Video Diffusion Models	Aritra Bhowmik et.al.	2510.19022	null
2025-10-21	DP $^2$ O-SR: Direct Perceptual Preference Optimization for Real-World Image Super-Resolution	Rongyuan Wu et.al.	2510.18851	null
2025-10-21	Limit Profile for the high-temperature Curie-Weiss model	Lazaros Karageorgiou et.al.	2510.18793	null
2025-10-21	Protein generation with embedding learning for motif diversification	Kevin Michalewicz et.al.	2510.18790	null
2025-10-21	A Frequentist Statistical Introduction to Variational Inference, Autoencoders, and Diffusion Models	Yen-Chi Chen et.al.	2510.18777	null
2025-10-21	UltraGen: High-Resolution Video Generation with Hierarchical Attention	Teng Hu et.al.	2510.18775	null
2025-10-21	The minimal wave speed of time-periodic traveling waves arising from a diffusive Kermack-McKendrick model with seasonality and nonlocal delayed interactions	Shuang-Ming Wang et.al.	2510.18767	null
2025-10-21	Particle acceleration at radiative supernova remnant shocks	Pierre Cristofari et.al.	2510.18763	null
2025-10-21	Diffusion Buffer for Online Generative Speech Enhancement	Bunlong Lay et.al.	2510.18744	null
2025-10-21	SSD: Spatial-Semantic Head Decoupling for Efficient Autoregressive Image Generation	Siyong Jian et.al.	2510.18716	null
2025-10-20	OmniCast: A Masked Latent Diffusion Model for Weather Forecasting Across Time Scales	Tung Nguyen et.al.	2510.18707	null
2025-10-23	A Renaissance of Explicit Motion Information Mining from Transformers for Action Recognition	Peiqin Zhuang et.al.	2510.18705	null
2025-10-21	UniGenBench++: A Unified Semantic Evaluation Benchmark for Text-to-Image Generation	Yibin Wang et.al.	2510.18701	null
2025-10-21	MoGA: Mixture-of-Groups Attention for End-to-End Long Video Generation	Weinan Jia et.al.	2510.18692	null
2025-10-21	A diffuse-interface method for the containerless freezing of three-phase flows in complex geometries	Jiangxu Huang et.al.	2510.18688	null
2025-10-21	Hydrogen redistribution in Zr-base cladding under gradients in temperature and stress	Lars O. Jernkvist et.al.	2510.18685	null
2025-10-21	Defect Landscape of Orthorhombic Ba $_2$In$_2$O$_5$ from First-Principles Calculations: The Role of Oxygen Interstitials	Rachele Sciotto et.al.	2510.18602	null
2025-10-21	Stability Criteria and Optoelectronic Properties of Mg3ZBr3 (Z = As, Sb, Bi) Perovskites for Evaluating the Performance in PIN Photo Diode	Md Mohiuddin et.al.	2510.18579	null
2025-10-21	Kaleido: Open-Sourced Multi-Subject Reference Video Generation Model	Zhenxing Zhang et.al.	2510.18573	null
2025-10-21	RayPose: Ray Bundling Diffusion for Template Views in Unseen 6D Object Pose Estimation	Junwen Huang et.al.	2510.18521	null
2025-10-21	Quantum Reversibility Meets Classical Reverse Diffusion	Ryota Nasu et.al.	2510.18512	null
2025-10-21	GaN-based Resonant Cavity LEDs Fabricated by Photo-Electrochemical Etching and Micro-Transfer Printing	Huanqing Chen et.al.	2510.18507	null
2025-10-21	How Efficient Are Diffusion Language Models? A Critical Examination of Efficiency Evaluation Practices	Han Peng et.al.	2510.18480	null
2025-10-21	Safe But Not Sorry: Reducing Over-Conservatism in Safety Critics via Uncertainty-Aware Modulation	Daniel Bethell et.al.	2510.18478	null
2025-10-21	Smoothed Dissipative Particle Dynamics for Mesoscale Advection-Diffusion-Reaction Problems	Marina Echeverria Ferrero et.al.	2510.18458	null
2025-10-21	Vision Foundation Models Can Be Good Tokenizers for Latent Diffusion Models	Tianci Bi et.al.	2510.18457	null
2025-10-21	LAND: Lung and Nodule Diffusion for 3D Chest CT Synthesis with Anatomical Guidance	Anna Oliveras et.al.	2510.18446	null
2025-10-21	ImageGem: In-the-wild Generative Image Interaction Dataset for Generative Model Personalization	Yuanhe Guo et.al.	2510.18433	null
2025-10-21	Technomolecular Materials: 3D Printed 2D Nanosheets with Self Patterned Electrodes	Hicham Hamoudi et.al.	2510.18389	null
2025-10-21	Aqueous Preparation of CsPbBr3 Perovskite Nanocrystals Under Ambient Conditio	Zhaoyi Du et.al.	2510.18366	null
2025-10-22	FeatureFool: Zero-Query Fooling of Video Models via Feature Map	Duoxun Tang et.al.	2510.18362	null
2025-10-21	Ranking-based Preference Optimization for Diffusion Models from Implicit User Feedback	Yi-Lun Wu et.al.	2510.18353	null
2025-10-21	Scalable, Explainable and Provably Robust Anomaly Detection with One-Step Flow Matching	Zhong Li et.al.	2510.18328	null
2025-10-22	OmniNWM: Omniscient Driving Navigation World Models	Bohan Li et.al.	2510.18313	null
2025-10-21	Fluctuations in first passage times and utility of resetting protocol in biochemical systems with two-state toggling	Hillol Kumar Barman et.al.	2510.18309	null
2025-10-21	GeoDiff: Geometry-Guided Diffusion for Metric Depth Estimation	Tuan Pham et.al.	2510.18291	null
2025-10-21	Efficient Few-shot Identity Preserving Attribute Editing for 3D-aware Deep Generative Models	Vishal Vinod et.al.	2510.18287	null
2025-10-21	Multiplex Networks Provide Structural Pathways for Social Contagion in Rural Social Networks	Yongren Shi et.al.	2510.18280	null
2025-10-21	From Competition to Synergy: Unlocking Reinforcement Learning for Subject-Driven Image Generation	Ziwei Huang et.al.	2510.18263	null
2025-10-21	SKYSURF-11: A New Zodiacal Light Model Optimized for Optical Wavelengths	Rosalia O’Brien et.al.	2510.18231	null
2025-10-21	The origins of the leakage currents of p-n junction and Schottky diodes in all kinds of materials: A novel explanation based on impurity-photovoltaic-effect due to the self-absorption of the room-temperature infrared emission from materials	Jianming Li et.al.	2510.18226	null
2025-10-21	An Explicit Euler-type Scheme for Lévy-driven SDEs with Superlinear and Time-Irregular Coefficients	Sani Biswas et.al.	2510.18222	null
2025-10-21	Eddy thermal diffusivity model and mean temperature profiles in turbulent vertical convection	Ho Yin Ng et.al.	2510.18220	null
2025-10-21	Estimation of a Gas Diffusion Coefficient by Fitting Molecular Dynamics Trajectories to Finite-Difference Simulations	Isaac Viviano et.al.	2510.18191	null
2025-10-21	A Generalizable Light Transport 3D Embedding for Global Illumination	Bing Xu et.al.	2510.18189	null
2025-10-20	Saber: An Efficient Sampling with Adaptive Acceleration and Backtracking Enhanced Remasking for Diffusion Language Model	Yihong Dong et.al.	2510.18165	null
2025-10-20	World-in-World: World Models in a Closed-Loop World	Jiahan Zhang et.al.	2510.18135	null
2025-10-20	HyperDiffusionFields (HyDiF): Diffusion-Guided Hypernetworks for Learning Implicit Molecular Neural Fields	Sudarshan Babu et.al.	2510.18122	null
2025-10-20	Constraints on the Correlation of IceCube Neutrinos with Tracers of Large-Scale Structure	R. Abbasi et.al.	2510.18119	null
2025-10-20	Latent Discrete Diffusion Models	Dario Shariatian et.al.	2510.18114	null
2025-10-20	Extraplanar emission in isolated edge-on late-type galaxies.II. The H $α$ kinematics	Minerva M. Sardaneta et.al.	2510.18110	null
2025-10-20	Fokas method for linear convection-diffusion equation with time-dependent coefficients and its extension to other evolution equations	Konstantinos Kalimeris et.al.	2510.18100	null
2025-10-20	Planned Diffusion	Daniel Israel et.al.	2510.18087	null
2025-10-22	Chimera: Compositional Image Generation using Part-based Concepting	Shivam Singh et.al.	2510.18083	null
2025-10-20	Fine-tuning Flow Matching Generative Models with Intermediate Feedback	Jiajun Fan et.al.	2510.18072	null
2025-10-20	SPACeR: Self-Play Anchoring with Centralized Reference Models	Wei-Jer Chang et.al.	2510.18060	null
2025-10-20	HouseTour: A Virtual Real Estate A(I)gent	Ata Çelen et.al.	2510.18054	null
2025-10-20	Adaptive Divergence Regularized Policy Optimization for Fine-tuning Generative Models	Jiajun Fan et.al.	2510.18053	null
2025-10-20	On the Harnack inequality for time-fractional and more general non-local in time subdiffusion equations	Katarzyna Ryszewska et.al.	2510.17992	null
2025-10-20	Demystifying Transition Matching: When and Why It Can Beat Flow Matching	Jaihoon Kim et.al.	2510.17991	null
2025-10-20	Exogeological inferences from white dwarf pollutants: the impact of stellar physics	Andrew M. Buchan et.al.	2510.17985	null
2025-10-20	Investigating the mysterious nature of 1LHAASO J1740+0948u through deep XMM-Newton observations	G. Brunelli et.al.	2510.17970	null
2025-10-20	UltraCUA: A Foundation Model for Computer Use Agents with Hybrid Action	Yuhao Yang et.al.	2510.17790	null
2025-10-20	Inference-Time Compute Scaling For Flow Matching	Adam Stecklov et.al.	2510.17786	null
2025-10-20	VERA-V: Variational Inference Framework for Jailbreaking Vision-Language Models	Qilin Liao et.al.	2510.17759	null
2025-10-20	Can Image-To-Video Models Simulate Pedestrian Dynamics?	Aaron Appelle et.al.	2510.17731	null
2025-10-20	MT-Video-Bench: A Holistic Video Understanding Benchmark for Evaluating Multimodal LLMs in Multi-Turn Dialogues	Yaning Pan et.al.	2510.17722	null
2025-10-20	GAS: Improving Discretization of Diffusion ODEs via Generalized Adversarial Solver	Aleksandr Oganov et.al.	2510.17699	null
2025-10-20	Quantum Synthetic Data Generation for Industrial Bioprocess Monitoring	Shawn M. Gibford et.al.	2510.17688	null
2025-10-20	Multilingual Text-to-Image Person Retrieval via Bidirectional Relation Reasoning and Aligning	Min Cao et.al.	2510.17685	null
2025-10-20	Handling Extreme Class Imbalance: Using GANs in Data Augmentation for Suicide Prediction	Vaishnavi Visweswaraiah et.al.	2510.17661	null
2025-10-21	PDE-Free Mass-Constrained Learning of Complex Systems with Hidden States: The crowd dynamics case	Gianmaria Viola et.al.	2510.17657	null
2025-10-20	Wild regenerative block bootstrap for Harris recurrent Markov chains	Kyuseong Choi et.al.	2510.17648	null
2025-10-20	Formation of clusters and coarsening in weakly interacting diffusions	Nicolai Gerber et.al.	2510.17629	null
2025-10-20	CaMiT: A Time-Aware Car Model Dataset for Classification and Generation	Frédéric LIN et.al.	2510.17626	null
2025-10-20	GUIDE: Enhancing Gradient Inversion Attacks in Federated Learning with Denoising Models	Vincenzo Carletti et.al.	2510.17621	null
2025-10-20	Non-asymptotic error bounds for probability flow ODEs under weak log-concavity	Gitte Kremling et.al.	2510.17608	null
2025-10-20	Macroscopic fluctuation-response theory and its use for gene regulatory networks	Timur Aslyamov et.al.	2510.17587	null
2025-10-20	Starspots as the origin of ultrafast drifting radio bursts from an active M dwarf	Jiale Zhang et.al.	2510.17547	null
2025-10-20	Highlights from the IceCube Neutrino Observatory	Alexander Kappes et.al.	2510.17523	null
2025-10-20	MUG-V 10B: High-efficiency Training Pipeline for Large Video Generation Models	Yongshun Zhang et.al.	2510.17519	null
2025-10-20	Towards geological inference with process-based and deep generative modeling, part 2: inversion of fluvial deposits and latent-space disentanglement	Guillaume Rongier et.al.	2510.17478	null
2025-10-20	A unified relative entropy framework for macroscopic limits of Vlasov–Fokker–Planck equations	Young-Pil Choi et.al.	2510.17455	null
2025-10-20	Active Inference for an Intelligent Agent in Autonomous Reconnaissance Missions	Johan Schubert et.al.	2510.17450	null
2025-10-20	Ionic current rectification under concentration gradients and its application in evaluating surface charge properties of micropores	Long Ma et.al.	2510.17443	null
2025-10-20	Electrical properties of PbS films doped with iodine by chemical bath deposition	T. B. Charikova et.al.	2510.17441	null
2025-10-20	Diffusion Models as Dataset Distillation Priors	Duo Su et.al.	2510.17421	null
2025-10-20	A Conditional Diffusion Model for Probabilistic Prediction of Battery Capacity Degradation	Hequn Li et.al.	2510.17414	null
2025-10-20	Collective dynamics in holographic fractonic solids	Ling-Zheng Xia et.al.	2510.17404	null
2025-10-20	Latent Spaces Beyond Synthesis: From GANs to Diffusion Models	Ludovica Schaerf et.al.	2510.17383	null
2025-10-20	Beyond Binary Out-of-Distribution Detection: Characterizing Distributional Shifts with Multi-Statistic Diffusion Trajectories	Achref Jaziri et.al.	2510.17381	null
2025-10-20	CharDiff: A Diffusion Model with Character-Level Guidance for License Plate Image Restoration	Gyuhwan Park et.al.	2510.17330	null
2025-10-20	Optimal error estimates of the diffuse domain method for semilinear parabolic equations	Yuejin Xu et.al.	2510.17319	null
2025-10-20	A Tractography Analysis Framework Using Diffusion Maps to Study Thalamic Connectivity in Traumatic Brain Injury	Akul Sharma et.al.	2510.17273	null
2025-10-20	From Preferences to Prejudice: The Role of Alignment Tuning in Shaping Social Bias in Video Diffusion Models	Zefan Cai et.al.	2510.17247	null
2025-10-21	On Efficiency-Effectiveness Trade-off of Diffusion-based Recommenders	Wenyu Mao et.al.	2510.17245	null
2025-10-20	Soft-Masked Diffusion Language Models	Michael Hersche et.al.	2510.17206	null
2025-10-20	HIDISC: A Hyperbolic Framework for Domain Generalization with Generalized Category Discovery	Vaibhav Rathore et.al.	2510.17188	null
2025-10-20	Generalized Group Selection Strategies for Self-sustainable RIS-aided Communication	Lakshmikanta Sau et.al.	2510.17176	null
2025-10-20	Generation then Reconstruction: Accelerating Masked Autoregressive Models via Two-Stage Sampling	Feihong Yan et.al.	2510.17171	null
2025-10-20	KineDiff3D: Kinematic-Aware Diffusion for Category-Level Articulated Object Shape Reconstruction and Generation	WenBo Xu et.al.	2510.17137	null
2025-10-20	In-situ Autoguidance: Eliciting Self-Correction in Diffusion Models	Enhao Gu et.al.	2510.17136	null
2025-10-20	PorousGen: An Efficient Algorithm for Generating Porous Structures with Accurate Porosity and Uniform Density Distribution	Shota Arai et.al.	2510.17133	null
2025-10-20	Do LLMs Recognize Your Latent Preferences? A Benchmark for Latent Information Discovery in Personalized Interaction	Ioannis Tsaknakis et.al.	2510.17132	null
2025-10-20	GOOD: Training-Free Guided Diffusion Sampling for Out-of-Distribution Detection	Xin Gao et.al.	2510.17131	null
2025-10-20	Continuous Q-Score Matching: Diffusion Guided Reinforcement Learning for Continuous-Time Control	Chengxiu Hua et.al.	2510.17122	null
2025-10-20	Boosting Fidelity for Pre-Trained-Diffusion-Based Low-Light Image Enhancement via Condition Refinement	Xiaogang Xu et.al.	2510.17105	null
2025-10-20	GSPlane: Concise and Accurate Planar Reconstruction via Structured Representation	Ruitong Gan et.al.	2510.17095	null
2025-10-20	Compressible subalgebras in II $_1$ factors	Sorin Popa et.al.	2510.17076	null
2025-10-19	Turbulent transport mechanisms in long-lived stable Ekman layers	K. Chand et.al.	2510.17046	null
2025-10-19	HelioFill: Diffusion-Based Model for EUV Reconstruction of the Solar Farside	Firas Ben Ameur et.al.	2510.17012	null
2025-10-19	An empirical study of the effect of video encoders on Temporal Video Grounding	Ignacio M. De la Jara et.al.	2510.17007	null
2025-10-19	The subtlety of the outermost stellarator magnetic surface	Alkesh Punjabi et.al.	2510.16999	null
2025-10-19	Adaptive Deterministic Flow Matching for Target Speaker Extraction	Tsun-An Hsieh et.al.	2510.16995	null
2025-10-19	Graph4MM: Weaving Multimodal Learning with Structural Information	Xuying Ning et.al.	2510.16990	null
2025-10-19	Integrating Metaverse Technologies in Medical Education: Examining Acceptance Factors Among Current and Future Healthcare Providers	Seckin Damar et.al.	2510.16984	null
2025-10-19	One-step Diffusion Models with Bregman Density Ratio Matching	Yuanzhi Zhu et.al.	2510.16983	null
2025-10-19	A first-principles investigation of the diffusivities of oxygen and oxygen defects in ThO $_2$	Maniesha Singh et.al.	2510.16982	null
2025-10-19	Quantile Regression, Variational Autoencoders, and Diffusion Models for Uncertainty Quantification: A Spatial Analysis of Sub-seasonal Wind Speed Prediction	Ganglin Tian et.al.	2510.16958	null
2025-10-19	On a repulsion model with Coulomb interaction and nonlinear mobility	Antonin Chodron de Courcel et.al.	2510.16894	null
2025-10-21	Uniworld-V2: Reinforce Image Editing with Diffusion Negative-aware Finetuning and MLLM Implicit Feedback	Zongjian Li et.al.	2510.16888	null
2025-10-19	Class-N-Diff: Classification-Induced Diffusion Model Can Make Fair Skin Cancer Diagnosis	Nusrat Munia et.al.	2510.16887	null
2025-10-19	From Mannequin to Human: A Pose-Aware and Identity-Preserving Video Generation Framework for Lifelike Clothing Display	Xiangyu Mu et.al.	2510.16833	null
2025-10-19	Strong error analysis and first-order convergence of Milstein-type schemes for McKean-Vlasov SDEs with superlinear coefficients	Jingtao Zhu et.al.	2510.16801	null
2025-10-19	Personalized Image Filter: Mastering Your Photographic Style	Chengxuan Zhu et.al.	2510.16791	null
2025-10-19	Estimating Flux Densities of Diffuse Cosmological Radio Sources Exploiting Vision Transformers	Nicoletta Sanvitale et.al.	2510.16758	null
2025-10-19	Visual Autoregressive Models Beat Diffusion Models on Inference Time Scaling	Erik Riise et.al.	2510.16751	null
2025-10-19	HumanCM: One Step Human Motion Prediction	Liu Haojie et.al.	2510.16709	null
2025-10-19	High-Dimensional Privacy-Utility Dynamics of Noisy Stochastic Gradient Descent on Least Squares	Shurong Lin et.al.	2510.16687	null
2025-10-19	Active Target Discovery under Uninformative Prior: The Power of Permanent and Transient Memory	Anindya Sarkar et.al.	2510.16676	null
2025-10-18	A class of singular control problems with tipping points	Jean-Paul Décamps et.al.	2510.16599	null
2025-10-18	Patronus: Safeguarding Text-to-Image Models against White-Box Adversaries	Xinfeng Li et.al.	2510.16581	null
2025-10-18	Fit for Purpose? Deepfake Detection in the Real World	Guangyu Lin et.al.	2510.16556	null
2025-10-18	Free energy Wasserstein gradient flow and their particle counterparts: toy model, (degenerate) PL inequalities and exit times	Pierre Monmarché et.al.	2510.16506	null
2025-10-18	High-order temporal parametric finite element methods for simulating solid-state dewetting	Xiaowen Gan et.al.	2510.16493	null
2025-10-18	Single-Step Digital Backpropagation for O-band Coherent Transmission Systems	Romulo Aparecido et.al.	2510.16482	null
2025-10-18	Vacancy-concentration-dependent thermal stability of fcc-(Ti,Al)Nx predicted via chemical-environment-sensitive diffusion activation energies	Ganesh Kumar Nayak et.al.	2510.16467	null
2025-10-18	The hard membrane process and transport barriers of turbulent flows	Olga Aryasova et.al.	2510.16456	null
2025-10-18	Tamed Euler approximation for fully superlinear growth McKean-Vlasov SDE and their particle systems: sharp rates for strong propagation of chaos, convergence and ergodicity	Simran Soni et.al.	2510.16427	null
2025-10-18	Determining the space dependent coefficients in space-time fractional diffusion equations via Krylov preconditioning	Asim Ilyas et.al.	2510.16425	null
2025-10-18	Iterative solvers for partial differential equations with dissipative structure: Operator preconditioning and optimal control	Volker Mehrmann et.al.	2510.16399	null
2025-10-18	Integrating LLM and Diffusion-Based Agents for Social Simulation	Xinyi Li et.al.	2510.16366	null
2025-10-18	Beyond Fixed Anchors: Precisely Erasing Concepts with Sibling Exclusive Counterparts	Tong Zhang et.al.	2510.16342	null
2025-10-18	Hele-Shaw flow with surface tension and kinetic undercooling as a sharp interface limit of a fully parabolic Patlak-Keller-Segel system with nonlinear diffusion	Michael Rozowski et.al.	2510.16339	null
2025-10-18	TokenAR: Multiple Subject Generation via Autoregressive Token-level enhancement	Haiyue Sun et.al.	2510.16332	null
2025-10-18	DiffusionX: Efficient Edge-Cloud Collaborative Image Generation with Multi-Round Prompt Evolution	Yi Wei et.al.	2510.16326	null
2025-10-18	Scale-DiT: Ultra-High-Resolution Image Generation with Hierarchical Local Attention	Yuyao Zhang et.al.	2510.16325	null
2025-10-18	Time-Embedded Algorithm Unrolling for Computational MRI	Junno Yun et.al.	2510.16321	null
2025-10-18	Scaling Laws for Deepfake Detection	Wenhao Wang et.al.	2510.16320	null
2025-10-18	Scaffold-Aware Generative Augmentation and Reranking for Enhanced Virtual Screening	Xin Wang et.al.	2510.16306	null
2025-10-18	Parameter Identifiability of RNA Dynamics in PDE Transport Models of Fluorescence Recovery After Photobleaching	Qinyu Xu et.al.	2510.16304	null
2025-10-18	Multiwavelength spectroscopic observations of a quiescent prominence	Jianchao Xue et.al.	2510.16288	null
2025-10-18	A study of general reaction-advection-diffusion equations describing dynamics between target, partaker, and guardian	Madi Yerlanov et.al.	2510.16286	null
2025-10-17	Functional Spectral Imaging by Ultrasound (FSIU): A Spectral-Theoretic Basis for Functional Ultrasound	Cesar Mello Fernando Medina da Cunha et.al.	2510.16256	null
2025-10-17	DNA Nanostructures Characterized via Dual Nanopore Resensing	Wangwei Dong et.al.	2510.16238	null
2025-10-17	Seeing Through the Brain: New Insights from Decoding Visual Stimuli with fMRI	Zheng Huang et.al.	2510.16196	null
2025-10-17	Alignment is Localized: A Causal Probe into Preference Layers	Archie Chaudhury et.al.	2510.16167	null
2025-10-17	AtomBench: A Benchmark for Generative Atomic Structure Models using GPT, Diffusion, and Flow Architectures	Charles Rhys Campbell et.al.	2510.16165	null
2025-10-17	GuideFlow3D: Optimization-Guided Rectified Flow For Appearance Transfer	Sayan Deb Sarkar et.al.	2510.16136	null
2025-10-17	Neutron Star-Main Sequence Collisions Robustly Form Dynamically Stable Thorne-Żytkow Objects	Lauryn E. Williams et.al.	2510.16129	null
2025-10-17	Learning density ratios in causal inference using Bregman-Riesz regression	Oliver J. Hines et.al.	2510.16127	null
2025-10-17	Effective cosmic ray diffusion in multiphase galactic environments	Timon Thomas et.al.	2510.16125	null
2025-10-17	Skyfall-GS: Synthesizing Immersive 3D Urban Scenes from Satellite Imagery	Jie-Ying Lee et.al.	2510.15869	null
2025-10-17	LightsOut: Diffusion-based Outpainting for Enhanced Lens Flare Removal	Shr-Ruei Tsai et.al.	2510.15868	null
2025-10-17	Quantum Monte Carlo Calculations of Light Nuclei with Fully Propagated Theoretical Uncertainties	Ryan Curry et.al.	2510.15860	null
2025-10-17	BLIP3o-NEXT: Next Frontier of Native Image Generation	Jiuhai Chen et.al.	2510.15857	null
2025-10-17	VISTA: A Test-Time Self-Improving Video Generation Agent	Do Xuan Long et.al.	2510.15831	null
2025-10-17	Error analysis of a compositional score-based algorithm for simulation-based inference	Camille Touron et.al.	2510.15817	null
2025-10-17	ReCon: Region-Controllable Data Augmentation with Rectification and Alignment for Object Detection	Haowei Zhu et.al.	2510.15783	null
2025-10-17	Controlling the image generation process with parametric activation functions	Ilia Pavlov et.al.	2510.15778	null
2025-10-17	QSilk: Micrograin Stabilization and Adaptive Quantile Clipping for Detail-Friendly Latent Diffusion	Denis Rychkovskiy et.al.	2510.15761	null
2025-10-17	NDM: A Noise-driven Detection and Mitigation Framework against Implicit Sexual Intentions in Text-to-Image Generation	Yitong Sun et.al.	2510.15752	null
2025-10-17	Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset	Qingyan Bai et.al.	2510.15742	null
2025-10-17	Attention Sinks in Diffusion Language Models	Maximo Eduardo Rulli et.al.	2510.15731	null
2025-10-17	Real-Time Modeling of Skyrmion Dynamics in Arbitrary 2D Spatially Dependent Pinning Potential Landscapes	Simon M. Fröhlich et.al.	2510.15713	null
2025-10-17	Unimedvl: Unifying Medical Multimodal Understanding And Generation Through Observation-Knowledge-Analysis	Junzhi Ning et.al.	2510.15710	null
2025-10-17	All-Dielectric Photo-thermo-optical Metasurfaces for Thermal Landscaping at the Nanoscale	Gopal Narmada Naidu et.al.	2510.15697	null
2025-10-17	Deep Neural ODE Operator Networks for PDEs	Ziqian Li et.al.	2510.15651	null
2025-10-17	Derivation and quasi-invariant asymptotics of phenotype-structured integro-differential models	Emanuele Bernardi et.al.	2510.15646	null
2025-10-17	Simulating the LOcal Web (SLOW) - VI: $γ$ -ray Emission in the Local Universe	Ludwig M. Böss et.al.	2510.15634	null
2025-10-17	Time evolution of the Husimi and Glauber-Sudarshan functions in terms of complementary Hamiltonian symbols	Mritunjay Tyagi et.al.	2510.15628	null
2025-10-17	High order Tensor-Train-Based Schemes for High-Dimensional Mean Field Games	Elisabetta Carlini et.al.	2510.15603	null
2025-10-17	Surface diffusion of phosphorus on Si(100) after PBr3 adsorption	T. V. Pavlova et.al.	2510.15599	null
2025-10-17	Visualizing anomalous exciton diffusion dynamics in TMDCs using transient scattering microscopy: the role of trap states and Auger recombination	Enrique Arévalo Rodríguez et.al.	2510.15587	null
2025-10-17	Lightweight CycleGAN Models for Cross-Modality Image Transformation and Experimental Quality Assessment in Fluorescence Microscopy	Mohammad Soltaninezhad et.al.	2510.15579	null
2025-10-17	Imaginarium: Vision-guided High-Quality 3D Scene Layout Generation	Xiaoming Zhu et.al.	2510.15564	null
2025-10-17	Diffusion Bridge Networks Simulate Clinical-grade PET from MRI for Dementia Diagnostics	Yitong Li et.al.	2510.15556	null
2025-10-17	VO-DP: Semantic-Geometric Adaptive Diffusion Policy for Vision-Only Robotic Manipulation	Zehao Ni et.al.	2510.15530	null
2025-10-17	Exploring Conditions for Diffusion models in Robotic Control	Heeseong Shin et.al.	2510.15510	null
2025-10-17	VDRive: Leveraging Reinforced VLA and Diffusion Policy for End-to-end Autonomous Driving	Ziang Guo et.al.	2510.15446	null
2025-10-17	Safe, Efficient, and Robust Reinforcement Learning for Ranking and Diffusion Models	Shashank Gupta et.al.	2510.15429	null
2025-10-17	Robust High-Resolution Multi-Organ Diffusion MRI Using Synthetic-Data-Tuned Prompt Learning	Chen Qian et.al.	2510.15400	null
2025-10-17	LILAC: Long-sequence Incremental Low-latency Arbitrary Motion Stylization via Streaming VAE-Diffusion with Causal Decoding	Peng Ren et.al.	2510.15392	null
2025-10-17	Iterative Refinement of Flow Policies in Probability Space for Online Reinforcement Learning	Mingyang Sun et.al.	2510.15388	null
2025-10-17	Towards Robust Zero-Shot Reinforcement Learning	Kexin Zheng et.al.	2510.15382	null
2025-10-17	Friction-controlled reentrant aging and fluidization in granular materials	Ye Yuan et.al.	2510.15360	null
2025-10-17	Dynamic Spatial Treatment Effects as Continuous Functionals: Theory and Evidence from Healthcare Access	Tatsuru Kikuchi et.al.	2510.15324	null
2025-10-17	Latent Diffusion Model without Variational Autoencoder	Minglei Shi et.al.	2510.15301	null
2025-10-17	Random walk models of anisotropic diffusion on rectangular and hexagonal lattices	Luke P. Filippini et.al.	2510.15291	null
2025-10-17	Global existence and stability in a class of chemotaxis systems with lethal interactions, nonlinear diffusion and production	Gnanasekaran Shanmugasundaram et.al.	2510.15276	null
2025-10-20	DriveGen3D: Boosting Feed-Forward Driving Scene Generation with Efficient Video Diffusion	Weijie Wang et.al.	2510.15264	null
2025-10-17	Planner and Executor: Collaboration between Discrete Diffusion And Autoregressive Models in Reasoning	Lina Berrayana et.al.	2510.15244	null
2025-10-17	The Face of Persuasion: Analyzing Bias and Generating Culture-Aware Ads	Aysan Aghazadeh et.al.	2510.15240	null
2025-10-17	Toward Black Scholes for Prediction Markets: A Unified Kernel and Market Maker’s Handbook	Shaw Dalen et.al.	2510.15205	null
2025-10-17	Conditional GLMMs for reaction times in choice tasks	Mauricio Tejo et.al.	2510.15203	null
2025-10-16	Salient Concept-Aware Generative Data Augmentation	Tianchen Zhao et.al.	2510.15194	null
2025-10-16	Policy Transfer Ensures Fast Learning for Continuous-Time LQR with Entropy Regularization	Xin Guo et.al.	2510.15165	null
2025-10-16	Diffusion method in field theories with fakeons	Gianluca Calcagni et.al.	2510.15157	null
2025-10-16	HugAgent: Evaluating LLMs in Simulating Human-Like Individual Reasoning on Open-Ended Tasks	Chance Jiajie Li et.al.	2510.15144	null
2025-10-16	Deep generative priors for 3D brain analysis	Ana Lawry Aguila et.al.	2510.15119	null
2025-10-16	TGT: Text-Grounded Trajectories for Locally Controlled Video Generation	Guofeng Zhang et.al.	2510.15104	null
2025-10-16	Operator Flow Matching for Timeseries Forecasting	Yolanne Yi Ran Lee et.al.	2510.15101	null
2025-10-16	Active Ionic Fluxes Induce Symmetry Breaking in Charge-Patterned Nanochannels	Sergi G. Leyva et.al.	2510.15092	null
2025-10-16	Sequential Comics for Jailbreaking Multimodal Large Language Models via Structured Visual Storytelling	Deyue Zhang et.al.	2510.15068	null
2025-10-16	LoRAverse: A Submodular Framework to Retrieve Diverse Adapters for Diffusion Models	Mert Sonmezer et.al.	2510.15022	null
2025-10-16	Constantly Improving Image Models Need Constantly Improving Benchmarks	Jiaxin Ge et.al.	2510.15021	null
2025-10-16	DeLeaker: Dynamic Inference-Time Reweighting For Semantic Leakage Mitigation in Text-to-Image Models	Mor Ventura et.al.	2510.15015	null
2025-10-16	Coupled Diffusion Sampling for Training-Free Multi-View Image Editing	Hadi Alzayer et.al.	2510.14981	null
2025-10-16	Learning an Image Editing Model without Image Editing Pairs	Nupur Kumari et.al.	2510.14978	null
2025-10-16	Ponimator: Unfolding Interactive Pose for Versatile Human-human Interaction Animation	Shaowei Liu et.al.	2510.14976	null
2025-10-16	WithAnyone: Towards Controllable and ID Consistent Image Generation	Hengyuan Xu et.al.	2510.14975	null
2025-10-16	pi-Flow: Policy-Based Few-Step Generation via Imitation Distillation	Hansheng Chen et.al.	2510.14974	null
2025-10-16	Attention Is All You Need for KV Cache in Diffusion LLMs	Quan Nguyen-Tri et.al.	2510.14973	null
2025-10-16	Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn LLM Agents	Guoqing Wang et.al.	2510.14967	null
2025-10-16	RainDiff: End-to-end Precipitation Nowcasting Via Token-wise Attention Diffusion	Thao Nguyen et.al.	2510.14962	null
2025-10-16	Efficient Parallel Samplers for Recurrent-Depth Models and Their Connection to Diffusion Language Models	Jonas Geiping et.al.	2510.14961	null
2025-10-16	RealDPO: Real or Not Real, that is the Preference	Guo Cheng et.al.	2510.14955	null
2025-10-16	OmniMotion: Multimodal Motion Generation with Continuous Masked Autoregression	Zhe Li et.al.	2510.14954	null
2025-10-17	From Language to Locomotion: Retargeting-free Humanoid Control via Motion Latent Guidance	Zhe Li et.al.	2510.14952	null
2025-10-16	DialectGen: Benchmarking and Improving Dialect Robustness in Multimodal Generation	Yu Zhou et.al.	2510.14949	null
2025-10-16	3D Scene Prompting for Scene-Consistent Camera-Controllable Video Generation	JoungBin Lee et.al.	2510.14945	null
2025-10-16	X-ray panorama of the SS433/W50 complex by SRG/eROSITA	Rashid Sunyaev et.al.	2510.14938	null
2025-10-18	VT-Refine: Learning Bimanual Assembly with Visuo-Tactile Feedback via Simulation Fine-Tuning	Binghao Huang et.al.	2510.14930	null
2025-10-16	Finite element methods for electroneutral multicomponent electrolyte flows	Aaron Baier-Reinio et.al.	2510.14923	null
2025-10-16	ScaleWeaver: Weaving Efficient Controllable T2I Generation with Multi-Scale Reference Attention	Keli Liu et.al.	2510.14882	null
2025-10-16	TOUCH: Text-guided Controllable Generation of Free-Form Hand-Object Interactions	Guangyi Han et.al.	2510.14874	null
2025-10-16	ImagerySearch: Adaptive Test-Time Search for Video Generation Beyond Semantic Dependency Constraints	Meiqi Wu et.al.	2510.14847	null
2025-10-16	RL-100: Performant Robotic Manipulation with Real-World Reinforcement Learning	Kun Lei et.al.	2510.14830	null
2025-10-16	FraQAT: Quantization Aware Training with Fractional bits	Luca Morreale et.al.	2510.14823	null
2025-10-16	Generalized Reduced Jacobian Method	M. El Maghri et.al.	2510.14785	null
2025-10-16	Inpainting the Red Planet: Diffusion Models for the Reconstruction of Martian Environments in Virtual Reality	Giuseppe Lorenzo Catalano et.al.	2510.14765	null
2025-10-16	Transport with noise in dilute gases: Effect of Langevin thermostat on transport coefficients	Alejandro Alés et.al.	2510.14745	null
2025-10-16	DEXTER: Diffusion-Guided EXplanations with TExtual Reasoning for Vision Models	Simone Carnemolla et.al.	2510.14741	null
2025-10-16	Numerical Studies on the Radio Afterglows in TDE (I): Forward Shock	Guobin Mou et.al.	2510.14715	null
2025-10-16	A Well-Balanced Space-Time ALE Compact Gas-Kinetic Scheme for the Shallow Water Equations on Unstructured Meshes	Fengxiang Zhao et.al.	2510.14673	null
2025-10-16	Diffusion-Free Dynamics in Rotating Spherical Shell Convection Driven By Internal Heating and Cooling	Neil T. Lewis et.al.	2510.14671	null
2025-10-16	Local Particle Acceleration in an ICME-in-Sheath Structure Observed by Solar Orbiter	Xiaomin Chen et.al.	2510.14652	null
2025-10-16	In-Context Learning with Unpaired Clips for Instruction-based Video Editing	Xinyao Liao et.al.	2510.14648	null
2025-10-16	SteeringTTA: Guiding Diffusion Trajectories for Robust Test-Time-Adaptation	Jihyun Yu et.al.	2510.14634	null
2025-10-16	Adapting Self-Supervised Representations as a Latent Space for Efficient Generation	Ming Gui et.al.	2510.14630	null
2025-10-16	GOPLA: Generalizable Object Placement Learning via Synthetic Augmentation of Human Arrangement	Yao Zhong et.al.	2510.14627	null
2025-10-16	Accelerated Multi-Modal Motion Planning Using Context-Conditioned Diffusion Models	Edward Sandra et.al.	2510.14615	null
2025-10-16	Modeling Diffusion and Permeation Across the Stratum Corneum Lipid Barrier	Rinto Thomas et.al.	2510.14606	null
2025-10-19	STANCE: Motion Coherent Video Generation Via Sparse-to-Dense Anchored Encoding	Zhifei Chen et.al.	2510.14588	null
2025-10-16	Consistent text-to-image generation via scene de-contextualization	Song Tang et.al.	2510.14553	null
2025-10-16	Revisiting electron-capture decay for Galactic cosmic-ray data	M. Borchiellini et.al.	2510.14544	null
2025-10-16	Exploring Image Representation with Decoupled Classical Visual Descriptors	Chenyuan Qu et.al.	2510.14536	null
2025-10-16	Noise Projection: Closing the Prompt-Agnostic Gap Behind Text-to-Image Misalignment in Diffusion Models	Yunze Tong et.al.	2510.14526	null
2025-10-16	Structured Random Phase Retrieval using Optical Diffusers	Zhiyuan Hu et.al.	2510.14490	null
2025-10-16	From Guess2Graph: When and How Can Unreliable Experts Safely Boost Causal Discovery in Finite Samples?	Sujai Hiremath et.al.	2510.14488	null
2025-10-16	ALP couplings to muons and electrons: a comprehensive analysis of supernova bounds	Ricardo Z. Ferreira et.al.	2510.14469	null
2025-10-16	Restoring Noisy Demonstration for Imitation Learning With Diffusion Models	Shang-Fu Chen et.al.	2510.14467	null
2025-10-16	Unsupervised Deep Generative Models for Anomaly Detection in Neuroimaging: A Systematic Scoping Review	Youwan Mahé et.al.	2510.14462	null
2025-10-16	Cryogenic temperature dependence and hysteresis of surface-trap-induced gate leakage in GaN high-electron-mobility transistors	Ching-Yang Pan et.al.	2510.14456	null
2025-10-16	Towards geological inference with process-based and deep generative modeling, part 1: training on fluvial deposits	Guillaume Rongier et.al.	2510.14445	null
2025-10-16	Accounting for the absence of anomalous microwave emission in the M 31 halo	Francesco De Paolis et.al.	2510.14441	null
2025-10-16	The Tracy-Widom distribution at large Dyson index	Alain Comtet et.al.	2510.14433	null
2025-10-16	Deep Compositional Phase Diffusion for Long Motion Sequence Generation	Ho Yin Au et.al.	2510.14427	null
2025-10-16	Dynamic Spatial Treatment Effect Boundaries: A Continuous Functional Framework from Navier-Stokes Equations	Tatsuru Kikuchi et.al.	2510.14409	null
2025-10-16	Multiscale Models For Perovskite Optimisation	Philippe Baranek et.al.	2510.14396	null
2025-10-16	DOS: Directional Object Separation in Text Embeddings for Multi-Object Image Generation	Dongnam Byun et.al.	2510.14376	null
2025-10-16	A Multi-domain Image Translative Diffusion StyleGAN for Iris Presentation Attack Detection	Shivangi Yadav et.al.	2510.14314	null
2025-10-16	Propagation speed of traveling waves for diffusive Lotka-Volterra system with strong competition	Ken-Ichi Nakamura et.al.	2510.14311	null
2025-10-16	Nonparametric Data Attribution for Diffusion Models	Yutian Zhao et.al.	2510.14269	null
2025-10-16	Identity-GRPO: Optimizing Multi-Human Identity-preserving Video Generation via Reinforcement Learning	Xiangyu Meng et.al.	2510.14256	null
2025-10-16	Identity-Preserving Image-to-Video Generation via Reward-Guided Optimization	Liao Shen et.al.	2510.14255	null
2025-10-16	PIA: Deepfake Detection Using Phoneme-Temporal and Identity-Dynamic Analysis	Soumyya Kanti Datta et.al.	2510.14241	null
2025-10-16	LOTA: Bit-Planes Guided AI-Generated Image Detection	Hongsong Wang et.al.	2510.14230	null
2025-10-16	Contrastive Diffusion Alignment: Learning Structured Latents for Controllable Generation	Ruchi Sandilya et.al.	2510.14190	null
2025-10-16	Virtually Being: Customizing Camera-Controllable Video Diffusion Models with Multi-View Performance Captures	Yuancheng Xu et.al.	2510.14179	null
2025-10-15	Briding Diffusion Posterior Sampling and Monte Carlo methods: a survey	Yazid Janati et.al.	2510.14114	null
2025-10-15	DiffLoc: Diffusion Model-Based High-Precision Positioning for 6G Networks	Taekyun Lee et.al.	2510.14111	null
2025-10-15	TENDE: Transfer Entropy Neural Diffusion Estimation	Simon Pedro Galeano Munoz et.al.	2510.14096	null
2025-10-15	Neural Network approximation power on homogeneous and heterogeneous reaction-diffusion equations	Haotian Feng et.al.	2510.14094	null
2025-10-15	DiffOPF: Diffusion Solver for Optimal Power Flow	Milad Hoseinpour et.al.	2510.14075	null
2025-10-15	CausalVerse: Benchmarking Causal Representation Learning with Configurable High-Fidelity Simulations	Guangyi Chen et.al.	2510.14049	null
2025-10-15	Quantum Damping of Cosmological Shear: A New Prediction from Loop Quantum Cosmologies	Wen-Cong Gan et.al.	2510.14021	null
2025-10-15	A Diffusion-Refined Planner with Reinforcement Learning Priors for Confined-Space Parking	Mingyang Jiang et.al.	2510.14000	null
2025-10-15	Sequential Quantum Measurements and the Instrumental Group Algebra	Christopher S. Jackson et.al.	2510.13980	null
2025-10-15	Modelling the nebular emission of galaxies across cosmic time with COLT	William McClymont et.al.	2510.13952	null
2025-10-15	Bell Instability and Cosmic-Ray Acceleration in AGN Ultrafast Outflow Shocks	Rei Nishiura et.al.	2510.13946	null
2025-10-15	Probing Quadratically Coupled Ultralight Dark Matter with Pulsar Timing Arrays	Xucheng Gan et.al.	2510.13945	null
2025-10-15	PhysMaster: Mastering Physical Representation for Video Generation via Reinforcement Learning	Sihui Ji et.al.	2510.13809	null
2025-10-15	Generative Universal Verifier as Multimodal Meta-Reasoner	Xinchen Zhang et.al.	2510.13804	null
2025-10-15	NoisePrints: Distortion-Free Watermarks for Authorship in Private Diffusion Models	Nir Goren et.al.	2510.13793	null
2025-10-15	Adaptive Visual Conditioning for Semantic Consistency in Diffusion-Based Story Continuation	Seyed Mohammad Mousavi et.al.	2510.13787	null
2025-10-15	PriorGuide: Test-Time Prior Adaptation for Simulation-Based Inference	Yang Yang et.al.	2510.13763	null
2025-10-15	Strong solution for polymeric fluid-structure interaction with small initial acceleration	Prince Romeo Mensah et.al.	2510.13753	null
2025-10-15	UniCalli: A Unified Diffusion Framework for Column-Level Generation and Recognition of Chinese Calligraphy	Tianshuo Xu et.al.	2510.13745	null
2025-10-15	Comparing subgrid models for cosmic ray diffusion in a magnetized isolated galaxy simulation	Sarah Thiele et.al.	2510.13737	null
2025-10-15	Cyclic Self-Supervised Diffusion for Ultra Low-field to High-field MRI Synthesis	Zhenxuan Zhang et.al.	2510.13735	null
2025-10-15	Internal Diffusion Limited Aggregation with Critical Branching Random Walks	Amine Asselah et.al.	2510.13733	null
2025-10-15	MVCustom: Multi-View Customized Diffusion via Geometric Latent Rendering and Completion	Minjung Shin et.al.	2510.13702	null
2025-10-15	Generating healthy counterfactuals with denoising diffusion bridge models	Ana Lawry Aguila et.al.	2510.13684	null
2025-10-15	FlashWorld: High-quality 3D Scene Generation within Seconds	Xinyang Li et.al.	2510.13678	null
2025-10-15	CanvasMAR: Improving Masked Autoregressive Video Generation With Canvas	Zian Li et.al.	2510.13669	null
2025-10-15	On the energy image density conjecture of Bouleau and Hirsch	Sylvester Eriksson-Bique et.al.	2510.13659	null
2025-10-15	Manifold Decoders: A Framework for Generative Modeling from Nonlinear Embeddings	Riddhish Thakare et.al.	2510.13622	null
2025-10-15	Stochastic Burgers Equation from Non-Product Stationary Measures via a Generalised Second-Order Boltzmann-Gibbs Principle	Patrícia Gonçalves et.al.	2510.13549	null
2025-10-15	Magnetomechanical Coupling in Ferronematic Phases: Influence of Spindle-Shaped Nanodopants on Liquid Crystalline Order	Karin Koch et.al.	2510.13513	null
2025-10-15	Eddy viscosity by Lévy transport noises	Dejun Luo et.al.	2510.13463	null
2025-10-15	VIST3A: Text-to-3D by Stitching a Multi-view Reconstruction Network to a Video Generator	Hyojun Go et.al.	2510.13454	null
2025-10-15	GO-Diff: Data-free and amortized global structure optimization	Nikolaj Rønne et.al.	2510.13448	null
2025-10-15	Steerable Conditional Diffusion for Domain Adaptation in PET Image Reconstruction	George Webber et.al.	2510.13441	null
2025-10-15	Ultra High-Resolution Image Inpainting with Patch-Based Content Consistency Adapter	Jianhui Zhang et.al.	2510.13419	null
2025-10-15	Reinforcement Learning Meets Masked Generative Models: Mask-GRPO for Text-to-Image Generation	Yifu Luo et.al.	2510.13418	null
2025-10-15	Wettability from Diffusion: A Universal Molecular Scaling Law	Lorenzo Agosta et.al.	2510.13338	null
2025-10-15	Tactile-Conditioned Diffusion Policy for Force-Aware Robotic Manipulation	Erik Helmut et.al.	2510.13324	null
2025-10-15	Km-scale dynamical downscaling through conformalized latent diffusion models	Alessandro Brusaferri et.al.	2510.13301	null
2025-10-15	Federated Conditional Conformal Prediction via Generative Models	Rui Xu et.al.	2510.13297	null
2025-10-15	Higher Satisfaction, Lower Cost: A Technical Report on How LLMs Revolutionize Meituan’s Intelligent Interaction Systems	Xuxin Cheng et.al.	2510.13291	null
2025-10-15	Curvature penalization of strongly anisotropic interfaces models and their phase-field approximation	Jean-François Babadjian et.al.	2510.13275	null
2025-10-15	The Boltzmann equation on smooth and cylindrical domains with Maxwell boundary conditions	Richard Medina Rodriguez et.al.	2510.13260	null
2025-10-15	End-to-End Multi-Modal Diffusion Mamba	Chunhao Lu et.al.	2510.13253	null
2025-10-16	CymbaDiff: Structured Spatial Diffusion for Sketch-based 3D Semantic Urban Scene Generation	Li Liang et.al.	2510.13245	null
2025-10-15	OS-HGAdapter: Open Semantic Hypergraph Adapter for Large Language Models Assisted Entropy-Enhanced Image-Text Alignment	Rongjun Chen et.al.	2510.13131	null
2025-10-15	Thermal and Electrical Properties of (Cr,Mo,Ta,V,W)C High-Entropy Carbide Ceramics	Ali Sarikhani et.al.	2510.13130	null
2025-10-15	Numerical Cosmology	Romain Teyssier et.al.	2510.13129	null
2025-10-15	On the Reasoning Abilities of Masked Diffusion Language Models	Anej Svete et.al.	2510.13117	null
2025-10-15	Edit-Your-Interest: Efficient Video Editing via Feature Most-Similar Propagation	Yi Zuo et.al.	2510.13084	null
2025-10-15	Counting Hallucinations in Diffusion Models	Shuai Fu et.al.	2510.13080	null
2025-10-14	One Dimensional CNN ECG Mamba for Multilabel Abnormality Classification in 12 Lead ECG	Huawei Jiang et.al.	2510.13046	null
2025-10-14	SceneAdapt: Scene-aware Adaptation of Human Motion Diffusion	Jungbin Cho et.al.	2510.13044	null
2025-10-14	SeqBench: Benchmarking Sequential Narrative Generation in Text-to-Video Models	Zhengxu Tang et.al.	2510.13042	null
2025-10-14	Machine Learning-Based Ultrasonic Weld Characterization Using Hierarchical Wave Modeling and Diffusion-Driven Distribution Alignment	Joshua R. Tempelman et.al.	2510.13023	null
2025-10-14	Continuous-Token Diffusion for Speaker-Referenced TTS in Multimodal LLMs	Xinlu He et.al.	2510.12995	null
2025-10-14	Probing the magneto-ionic medium of the Milky Way using pulsars	Saakshi Dhakal et.al.	2510.12991	null
2025-10-14	Reference-Specific Unlearning Metrics Can Hide the Truth: A Reality Check	Sungjun Cho et.al.	2510.12981	null
2025-10-14	A Connection Between Score Matching and Local Intrinsic Dimension	Eric Yeats et.al.	2510.12975	null
2025-10-14	Cosmic Ray Transport and Gamma-Ray Signatures in the Interstellar Medium	Lucas Barreto-Mota et.al.	2510.12965	null
2025-10-14	CADE 2.5 - ZeResFDG: Frequency-Decoupled, Rescaled and Zero-Projected Guidance for SD/SDXL Latent Diffusion Models	Denis Rychkovskiy et.al.	2510.12954	null
2025-10-14	On the Hierarchy of Scales in Modeling of Weakly Interacting Chains of Atoms	Dmitry Golovaty et.al.	2510.12945	null
2025-10-14	Nonlinear fluctuations for a chain of weakly anharmonic oscillators with stochastic perturbation	Kohei Hayashi et.al.	2510.12922	null
2025-10-14	Dark Matter-Electron Interactions Alter the Luminosity and Spectral Index of M87	Abdelaziz Hussein et.al.	2510.12877	null
2025-10-16	Diffusion models for polarimetric reconstruction of circumstellar environments	Quentin Villegas et.al.	2510.12853	null
2025-10-14	DeepMMSearch-R1: Empowering Multimodal LLMs in Multimodal Web Search	Kartik Narayan et.al.	2510.12801	null
2025-10-14	DriveVLA-W0: World Models Amplify Data Scaling Law in Autonomous Driving	Yingyan Li et.al.	2510.12796	null
2025-10-14	UniFusion: Vision-Language Model as Unified Encoder in Image Generation	Kevin Li et.al.	2510.12789	null
2025-10-14	MVP4D: Multi-View Portrait Video Diffusion for Animatable 4D Avatars	Felix Taubner et.al.	2510.12785	null
2025-10-14	FlashVSR: Towards Real-Time Diffusion-Based Streaming Video Super-Resolution	Junhao Zhuang et.al.	2510.12747	null
2025-10-14	Oxygen-vacancy-induced Raman softening in the catalyst Fe $_2$(MoO$_4$)$_3$	Young-Joon Song et.al.	2510.12746	null
2025-10-14	T(R,O) Grasp: Efficient Graph Diffusion of Robot-Object Spatial Transformation for Cross-Embodiment Dexterous Grasping	Xin Fei et.al.	2510.12724	null
2025-10-14	DiffEM: Learning from Corrupted Data with Diffusion Models via Expectation Maximization	Danial Hosseintabar et.al.	2510.12691	null
2025-10-14	Moment-based Posterior Sampling for Multi-reference Alignment	Axel Janson et.al.	2510.12651	null
2025-10-14	Contraction and entropy production in continuous-time Sinkhorn dynamics	Anand Srinivasan et.al.	2510.12639	null
2025-10-14	Adapting Noise to Data: Generative Flows from 1D Processes	Jannis Chemseddine et.al.	2510.12636	null
2025-10-14	Formation of protostars and the launching of stellar core outflows with moving-mesh radiation non-ideal magnetohydrodynamics	Alexander C. Mayer et.al.	2510.12620	null
2025-10-14	Towards Fast Coarse-graining and Equation Discovery with Foundation Inference Models	Manuel Hinz et.al.	2510.12618	null
2025-10-14	Advancing End-to-End Pixel Space Generative Modeling via Self-supervised Pre-training	Jiachen Lei et.al.	2510.12586	null
2025-10-14	LayerSync: Self-aligning Intermediate Layers	Yasaman Haghighi et.al.	2510.12581	null
2025-10-14	ECMSim: A high-performance web simulation of cardiac ECM remodeling through integrated ODE-based signaling and diffusion	Hasi Hays et.al.	2510.12577	null
2025-10-14	Modeling gamma-ray signatures of particle acceleration in stellar clusters from GeV to PeV	A. Inventar et.al.	2510.12562	null
2025-10-14	Unconditional Human Motion and Shape Generation via Balanced Score-Based Diffusion	David Björkstrand et.al.	2510.12537	null
2025-10-14	Voronoi-Assisted Diffusion for Computing Unsigned Distance Fields from Unoriented Points	Jiayi Kong et.al.	2510.12524	null
2025-10-14	Mitigating the Noise Shift for Denoising Generative Models via Noise Awareness Guidance	Jincheng Zhong et.al.	2510.12497	null
2025-10-14	Time-Correlated Video Bridge Matching	Viacheslav Vasilev et.al.	2510.12453	null
2025-10-15	The value of storage in electricity distribution: The role of markets	Dirk Lauinger et.al.	2510.12435	null
2025-10-14	Targeted Pooled Latent-Space Steganalysis Applied to Generative Steganography, with a Fix	Etienne Levecque et.al.	2510.12414	null
2025-10-14	Improving Generative Behavior Cloning via Self-Guidance and Adaptive Chunking	Junhyuk So et.al.	2510.12392	null
2025-10-14	Scene Coordinate Reconstruction Priors	Wenjing Bian et.al.	2510.12387	null
2025-10-14	Controlling Intent Expressiveness in Robot Motion with Diffusion Models	Wenli Shi et.al.	2510.12370	null
2025-10-14	The probe limit in MHD and its implications for magnetic transport	Giorgio Frangi et.al.	2510.12352	null
2025-10-14	Generative Diffusion Model DiffCrysGen Discovers Rare Earth-Free Magnetic Materials	Sourav Mal et.al.	2510.12329	null
2025-10-14	Causal Inspired Multi Modal Recommendation	Jie Yang et.al.	2510.12325	null
2025-10-14	Hybrid Gaussian Splatting for Novel Urban View Synthesis	Mohamed Omran et.al.	2510.12308	null
2025-10-14	Fully mixed virtual element schemes for steady-state poroelastic stress-assisted diffusion	Isaac Bermudez et.al.	2510.12307	null
2025-10-14	Local Background Features Matter in Out-of-Distribution Detection	Jinlun Ye et.al.	2510.12259	null
2025-10-14	FedMMKT:Co-Enhancing a Server Text-to-Image Model and Client Task Models in Multi-Modal Federated Learning	Ningxin He et.al.	2510.12254	null
2025-10-14	Diffusion Models for Reinforcement Learning: Foundations, Taxonomy, and Development	Changfu Xu et.al.	2510.12253	null
2025-10-14	A Gradient Guided Diffusion Framework for Chance Constrained Programming	Boyang Zhang et.al.	2510.12238	null
2025-10-14	BIGFix: Bidirectional Image Generation with Token Fixing	Victor Besnier et.al.	2510.12231	null
2025-10-14	Hierarchical Koopman Diffusion: Fast Generation with Interpretable Diffusion Trajectory	Hanru Bai et.al.	2510.12220	null
2025-10-15	DiSTAR: Diffusion over a Scalable Token Autoregressive Representation for Speech Generation	Yakun Song et.al.	2510.12210	null
2025-10-14	Computational advances and challenges in simulations of turbulence and star formation	Christoph Federrath et.al.	2510.12203	null
2025-10-14	Spatial two-grid compact difference scheme for two-dimensional nonlinear diffusion-wave equations with variable exponent	Hao Zhang et.al.	2510.12188	null
2025-10-14	Audio Palette: A Diffusion Transformer with Multi-Signal Conditioning for Controllable Foley Synthesis	Junnuo Wang et.al.	2510.12175	null
2025-10-14	MatSciBench: Benchmarking the Reasoning Ability of Large Language Models in Materials Science	Junkai Zhang et.al.	2510.12171	null
2025-10-14	DPL: Spatial-Conditioned Diffusion Prototype Enhancement for One-Shot Medical Segmentation	Ziyuan Gao et.al.	2510.12159	null
2025-10-14	Probabilistic Super-Resolution for Urban Micrometeorology via a Schrödinger Bridge	Yuki Yasuda et.al.	2510.12148	null
2025-10-14	Credal Transformer: A Principled Approach for Quantifying and Mitigating Hallucinations in Large Language Models	Shihao Ji et.al.	2510.12137	null
2025-10-14	Precise Attribute Intensity Control in Large Language Models via Targeted Representation Editing	Rongzhi Zhang et.al.	2510.12121	null
2025-10-14	ImageSentinel: Protecting Visual Datasets from Unauthorized Retrieval-Augmented Image Generation	Ziyuan Luo et.al.	2510.12119	null
2025-10-14	Self-Supervised Selective-Guided Diffusion Model for Old-Photo Face Restoration	Wenjie Li et.al.	2510.12114	null
2025-10-14	Optimal run-tumble navigation in disordered landscapes	Yang Bai et.al.	2510.12106	null
2025-10-14	New Classes of Non-monotone Variational Inequality Problems Solvable via Proximal Gradient on Smooth Gap Functions	Lei Zhao et.al.	2510.12105	null
2025-10-14	G4Splat: Geometry-Guided Gaussian Splatting with Generative Prior	Junfeng Ni et.al.	2510.12099	null
2025-10-14	H4G: Unlocking Faithful Inference for Zero-Shot Graph Learning in Hyperbolic Space	Heng Zhang et.al.	2510.12094	null
2025-10-14	Very-Long Baseline Interferometry Imaging with Closure Invariants using Conditional Image Diffusion	Samuel Lai et.al.	2510.12093	null
2025-10-14	Playmate2: Training-Free Multi-Character Audio-Driven Animation via Diffusion Transformer with Reward Feedback	Xingpei Ma et.al.	2510.12089	null
2025-10-14	Elevating Medical Image Security: A Cryptographic Framework Integrating Hyperchaotic Map and GRU	Weixuan Li et.al.	2510.12084	null
2025-10-14	A Review on Domain Adaption and Generative Adversarial Networks(GANs)	Aashish Dhawan et.al.	2510.12075	null
2025-10-14	Metalorganic Chemical Vapor Deposition of AlScN Thin Films and AlScN/AlN/GaN Heterostructures	Vijay Gopal Thirupakuzi Vangipuram et.al.	2510.12074	null
2025-10-14	VIDMP3: Video Editing by Representing Motion with Pose and Position Priors	Sandeep Mishra et.al.	2510.12069	null
2025-10-14	Your VAR Model is Secretly an Efficient and Explainable Generative Classifier	Yi-Chung Chen et.al.	2510.12060	null
2025-10-14	Quantification of Electrolyte Degradation in Lithium-ion Batteries with Neutron Imaging Techniques	Yonggang Hu et.al.	2510.12055	null
2025-10-15	Improving Text-to-Image Generation with Input-Side Inference-Time Scaling	Ruibo Chen et.al.	2510.12041	null
2025-10-13	Asking Clarifying Questions for Preference Elicitation With Large Language Models	Ali Montazeralghaem et.al.	2510.12015	null
2025-10-13	UALM: Unified Audio Language Model for Understanding, Generation and Reasoning	Jinchuan Tian et.al.	2510.12000	null
2025-10-13	Impact of Cosmic Ray Acceleration on the Early Evolution of Bow Shocks around Massive Runaway Stars	Keito Watanabe et.al.	2510.11988	null
2025-10-13	Normalization-equivariant Diffusion Models: Learning Posterior Samplers From Noisy And Partial Measurements	Brett Levac et.al.	2510.11964	null
2025-10-13	MosaicDiff: Training-free Structural Pruning for Diffusion Model Acceleration Reflecting Pretraining Dynamics	Bowei Guo et.al.	2510.11962	null
2025-10-13	Thermal transport in GaN/AlN HEMTs on 4H-SiC: Role of layer thickness and hetero-interfaces	Dat Q. Tran et.al.	2510.11936	null
2025-10-13	Enhancing Diffusion-Based Sampling with Molecular Collective Variables	Juno Nam et.al.	2510.11923	null
2025-10-13	Exosome-mediated chemotaxis optimizes leader-follower cell migration	Louis González et.al.	2510.11909	null
2025-10-13	Long-time contractivity estimates for kinetic Kolmogorov-Fokker-Planck equations	Nicolò Forcillo et.al.	2510.11901	null
2025-10-13	A Closed-form Expression of the Gaussian Noise Model Supporting O-Band Transmission	Zelin Gan et.al.	2510.11867	null
2025-10-13	Learning interpretable closures for thermal radiation transport in optically-thin media using WSINDy	Daniel Messenger et.al.	2510.11840	null
2025-10-13	WaveletDiff: Multilevel Wavelet Diffusion For Time Series Generation	Yu-Hsiang Wang et.al.	2510.11839	null
2025-10-13	The PHANGS-MUSE/HST-Halpha Nebulae Catalogue	A. T. Barnes et.al.	2510.11778	null
2025-10-13	Point Prompting: Counterfactual Tracking with Video Diffusion Models	Ayush Shrivastava et.al.	2510.11715	null
2025-10-13	DiT360: High-Fidelity Panoramic Image Generation via Hybrid Training	Haoran Feng et.al.	2510.11712	null
2025-10-13	TOI-3288 b and TOI-4666 b: two gas giants transiting low-mass stars characterised by NIRPS	Yolanda G. C. Frensch et.al.	2510.11703	null
2025-10-13	Diffusion Transformers with Representation Autoencoders	Boyang Zheng et.al.	2510.11690	null
2025-10-14	Boundary-Guided Policy Optimization for Memory-efficient RL of Diffusion Large Language Models	Nianyi Lin et.al.	2510.11683	null
2025-10-13	InfiniHuman: Infinite 3D Human Creation with Precise Control	Yuxuan Xue et.al.	2510.11650	null
2025-10-13	Diffusion-DFL: Decision-focused Diffusion Models for Stochastic Optimization	Zihao Zhao et.al.	2510.11590	null
2025-10-13	A Framework for Low-Effort Training Data Generation for Urban Semantic Segmentation	Denis Zavadski et.al.	2510.11567	null
2025-10-13	SCOOP’D: Learning Mixed-Liquid-Solid Scooping via Sim2Real Generative Policy	Kuanning Wang et.al.	2510.11566	null
2025-10-14	Massive Activations are the Key to Local Detail Synthesis in Diffusion Transformers	Chaofan Gan et.al.	2510.11538	null
2025-10-13	LikePhys: Evaluating Intuitive Physics Understanding in Video Diffusion Models via Likelihood Preference	Jianhao Yuan et.al.	2510.11512	null
2025-10-13	Offline Reinforcement Learning with Generative Trajectory Policies	Xinsong Feng et.al.	2510.11499	null
2025-10-13	Introduction to quantitative De Giorgi methods	Giovanni Brigati et.al.	2510.11481	null
2025-10-13	Unifying Deductive and Abductive Reasoning in Knowledge Graphs with Masked Diffusion Model	Yisen Gao et.al.	2510.11462	null
2025-10-13	Uncertainty-Aware ControlNet: Bridging Domain Gaps with Synthetic Image Generation	Joshua Niemeijer et.al.	2510.11346	null
2025-10-13	DiffStyleTS: Diffusion Model for Style Transfer in Time Series	Mayank Nagda et.al.	2510.11335	null
2025-10-13	Diffusion-Link: Diffusion Probabilistic Model for Bridging the Audio-Text Modality Gap	KiHyun Nam et.al.	2510.11330	null
2025-10-13	A model reduction method based on nonlinear optimization for multiscale stochastic optimal control problems	Jingyi Zhang et.al.	2510.11325	null
2025-10-13	Template-Based Text-to-Image Alignment for Language Accessibility: A Study on Visualizing Text Simplifications	Belkiss Souayed et.al.	2510.11314	null
2025-10-13	TDADL-IE: A Deep Learning-Driven Cryptographic Architecture for Medical Image Security	Junhua Zhou et.al.	2510.11301	null
2025-10-13	From Prompts to Packets: A View from the Network on ChatGPT, Copilot, and Gemini	Antonio Montieri et.al.	2510.11269	null
2025-10-13	Learning the Structure of Connection Graphs	Leonardo Di Nino et.al.	2510.11245	null
2025-10-13	CSI Prediction Using Diffusion Models	Mehdi Sattari et.al.	2510.11214	null
2025-10-13	The evolution of CH in Planck Galactic Cold Clumps	Gan Luo et.al.	2510.11146	null
2025-10-13	Demystifying Numerosity in Diffusion Models – Limitations and Remedies	Yaqi Zhao et.al.	2510.11117	null
2025-10-13	MoMaps: Semantics-Aware Scene Motion Generation with Motion Maps	Jiahui Lei et.al.	2510.11107	null
2025-10-13	CoDefend: Cross-Modal Collaborative Defense via Diffusion Purification and Prompt Optimization	Fengling Zhu et.al.	2510.11096	null
2025-10-13	Temporal Alignment Guidance: On-Manifold Sampling in Diffusion Models	Youngrok Park et.al.	2510.11057	null
2025-10-13	Latent Refinement Decoding: Enhancing Diffusion-Based Language Models by Refining Belief States	Qinglin Zhu et.al.	2510.11052	null
2025-10-13	Zero-shot Face Editing via ID-Attribute Decoupled Inversion	Yang Hou et.al.	2510.11050	null
2025-10-13	Microscopic description of the proton halo in $^{12}$ N	K. Y. Zhang et.al.	2510.11038	null
2025-10-13	Resonant W and Z Boson Production in FSRQ Jets: Implications for Diffuse Neutrino Fluxes	J. -H. Ha et.al.	2510.11030	null
2025-10-13	GIR-Bench: Versatile Benchmark for Generating Images with Reasoning	Hongxiang Li et.al.	2510.11026	null
2025-10-13	Parareal in time and spectral in space fast L1 quasilinear subdiffusion solver	Josefa Caballero et.al.	2510.11023	null
2025-10-13	Fast radio bursts shed light on direct gravity test on cosmological scales	Shuren Zhou et.al.	2510.11022	null
2025-10-13	Spatial and Temporal Boundaries in Difference-in-Differences: A Framework from Navier-Stokes Equation	Tatsuru Kikuchi et.al.	2510.11013	null
2025-10-13	ContextGen: Contextual Layout Anchoring for Identity-Consistent Multi-Instance Generation	Ruihang Xu et.al.	2510.11000	null
2025-10-13	Perspective-aware 3D Gaussian Inpainting with Multi-view Consistency	Yuxin Cheng et.al.	2510.10993	null
2025-10-13	Blade: A Derivative-free Bayesian Inversion Method using Diffusion Priors	Hongkai Zheng et.al.	2510.10968	null
2025-10-13	DreamMakeup: Face Makeup Customization using Latent Diffusion Models	Geon Yeong Park et.al.	2510.10918	null
2025-10-13	SceneTextStylizer: A Training-Free Scene Text Style Transfer Framework with Diffusion Model	Honghui Yuan et.al.	2510.10910	null
2025-10-13	Multiscale Graph Reduction for Heterogeneous and Anisotropic Discrete Diffusion Processes	Maria Vasilyeva et.al.	2510.10894	null
2025-10-13	Structural encoding with classical codes for computational-basis bit-flip correction in the early fault-tolerant regime	IlKwon Sohn et.al.	2510.10888	null
2025-10-13	FastHMR: Accelerating Human Mesh Recovery via Token and Layer Merging with Diffusion Decoding	Soroush Mehraban et.al.	2510.10868	null
2025-10-12	A model for transport of soluble surfactants in two-phase flows	Suhas S. Jain et.al.	2510.10857	null
2025-10-12	Discrete State Diffusion Models: A Sample Complexity Perspective	Aadithya Srikanth et.al.	2510.10854	null
2025-10-12	Stochastic and deterministic reaction-diffusion equations	Davide A. Bignamini et.al.	2510.10842	null
2025-10-12	Crisis-Aware Regime-Conditioned Diffusion with CVaR Allocation	Ali Atiah Alzahrani et.al.	2510.10807	null
2025-10-12	DISC-GAN: Disentangling Style and Content for Cluster-Specific Synthetic Underwater Image Generation	Sneha Varur et.al.	2510.10782	null
2025-10-12	Structure Over Signal: A Globalized Approach to Multi-relational GNNs for Stock Prediction	Amber Li et.al.	2510.10775	null
2025-10-12	Understanding Sampler Stochasticity in Training Diffusion Models for RLHF	Jiayuan Sheng et.al.	2510.10767	null
2025-10-12	A Stochastic Differential Equation Framework for Multi-Objective LLM Interactions: Dynamical Systems Analysis with Code Generation Applications	Shivani Shukla et.al.	2510.10739	null
2025-10-12	Controllable Generative Trajectory Prediction via Weak Preference Alignment	Yongxi Cao et.al.	2510.10731	null
2025-10-12	VLM-Guided Adaptive Negative Prompting for Creative Generation	Shelly Golan et.al.	2510.10715	null
2025-10-12	AdaViewPlanner: Adapting Video Diffusion Models for Viewpoint Planning in 4D Scenes	Yu Li et.al.	2510.10670	null
2025-10-12	Novel superconvergence and ultraconvergence structures for the finite volume element method	Xiang Wang et.al.	2510.10668	null
2025-10-12	Scalable Face Security Vision Foundation Model for Deepfake, Diffusion, and Spoofing Detection	Gaojian Wang et.al.	2510.10663	null
2025-10-12	DEMO: Disentangled Motion Latent Flow Matching for Fine-Grained Controllable Talking Portrait Synthesis	Peiyin Chen et.al.	2510.10650	null
2025-10-12	ProteinAE: Protein Diffusion Autoencoders for Structure Encoding	Shaoning Li et.al.	2510.10634	null
2025-10-12	Collaborative Text-to-Image Generation via Multi-Agent Reinforcement Learning and Semantic Fusion	Jiabao Shi et.al.	2510.10633	null
2025-10-12	Encoder Decoder Generative Adversarial Network Model for Stock Market Prediction	Bahadur Yadav et.al.	2510.10617	null
2025-10-12	HyperAgent: Leveraging Hypergraphs for Topology Optimization in Multi-Agent Communication	Heng Zhang et.al.	2510.10611	null
2025-10-12	D3MAS: Decompose, Deduce, and Distribute for Enhanced Knowledge Sharing in Multi-Agent Systems	Heng Zhang et.al.	2510.10585	null
2025-10-12	GraphTracer: Graph-Guided Failure Tracing in LLM Agents for Robust Multi-Turn Deep Search	Heng Zhang et.al.	2510.10581	null
2025-10-12	Injecting Frame-Event Complementary Fusion into Diffusion for Optical Flow in Challenging Scenes	Haonan Wang et.al.	2510.10577	null
2025-10-12	Determining nonlinear balance laws in product-type domains by a single local passive boundary observation	Chaohua Duan et.al.	2510.10571	null
2025-10-12	Jigsaw3D: Disentangled 3D Style Transfer via Patch Shuffling and Masking	Yuteng Ye et.al.	2510.10497	null
2025-10-12	Head-wise Adaptive Rotary Positional Encoding for Fine-Grained Image Generation	Jiaye Li et.al.	2510.10489	null
2025-10-12	Gradient Enhanced Self-Training Physics-Informed Neural Network (gST-PINN) for Solving Nonlinear Partial Differential Equations	Narayan S Iyer et.al.	2510.10483	null
2025-10-12	UltraLLaDA: Scaling the Context Length to 128K for Diffusion Large Language Models	Guangxin He et.al.	2510.10481	null
2025-10-12	Latent Retrieval Augmented Generation of Cross-Domain Protein Binders	Zishen Zhang et.al.	2510.10480	null
2025-10-12	Towards Dynamic Quadrupedal Gaits: A Symmetry-Guided RL Hierarchy Enables Free Gait Transitions at Varying Speeds	Jiayu Ding et.al.	2510.10455	null
2025-10-12	Proof of the exact diffusion constant via first passage time in quasi-periodic potentials	Ming Gong et.al.	2510.10435	null
2025-10-12	MonoSE(3)-Diffusion: A Monocular SE(3) Diffusion Framework for Robust Camera-to-Robot Pose Estimation	Kangjian Zhu et.al.	2510.10434	null
2025-10-12	Controllable Graph Generation with Diffusion Models via Inference-Time Tree Search Guidance	Jiachi Zhao et.al.	2510.10402	null
2025-10-12	Breakdown of the Wiedemann-Franz law in an interacting quantum Hall metamaterial	Patrice Roche et.al.	2510.10391	null
2025-10-11	Staggered time discretization in finitely-strained heterogeneous visco-elastodynamics with damage or diffusion in the Eulerian frame	Tomáš Roubíček et.al.	2510.10355	null
2025-10-11	Osmotic forces modify lipid membrane fluctuations	Amaresh Sahu et.al.	2510.10352	null
2025-10-11	Roles of Electrically Excited Magnons in Unidirectional Magnetoresistance of Metallic Magnetic Bilayers	Shashank Gupta et.al.	2510.10309	null
2025-10-11	Defect-driven incoherent skin localization	Emmanouil T. Kokkinakis et.al.	2510.10298	null
2025-10-11	Perturbative and non-perturbative properties of heavy quark transport in a thermal QCD medium	Jiazhen Peng et.al.	2510.10294	null
2025-10-11	ProGress: Structured Music Generation via Graph Diffusion and Hierarchical Music Analysis	Stephen Ni-Hahn et.al.	2510.10249	null
2025-10-11	Efficient Mining of Low-Utility Sequential Patterns	Jian Zhu et.al.	2510.10243	null
2025-10-11	First Passage Problem: Asymptotic Corrections due to Discrete Sampling	Lars Fritz et.al.	2510.10226	null
2025-10-11	You only need 4 extra tokens: Synergistic Test-time Adaptation for LLMs	Yijie Xu et.al.	2510.10223	null
2025-10-11	Hierarchical Bayesian Flow Networks for Molecular Graph Generation	Yida Xiong et.al.	2510.10211	null
2025-10-11	Finite element analysis of a nonlinear heat Equation with damping and pumping effects	Rishabh Shukla et.al.	2510.10210	null
2025-10-11	PIXEL: Adaptive Steering Via Position-wise Injection with eXact Estimated Levels under Subspace Calibration	Manjiang Yu et.al.	2510.10205	null
2025-10-11	Don’t Just Fine-tune the Agent, Tune the Environment	Siyuan Lu et.al.	2510.10197	null
2025-10-11	Chord Colourizer: A Near Real-Time System for Visualizing Musical Key	Paul Haimes et.al.	2510.10173	null
2025-10-11	Multi-Scale Diffusion Transformer for Jointly Simulating User Mobility and Mobile Traffic Pattern	Ziyi Liu et.al.	2510.10158	null
2025-10-11	ReMix: Towards a Unified View of Consistent Character Generation and Editing	Benjia Zhou et.al.	2510.10156	null
2025-10-11	Robust Learning of Diffusion Models with Extremely Noisy Conditions	Xin Chen et.al.	2510.10149	null
2025-10-11	CharCom: Composable Identity Control for Multi-Character Story Illustration	Zhongsheng Wang et.al.	2510.10135	null
2025-10-11	Ctrl-World: A Controllable Generative World Model for Robot Manipulation	Yanjiang Guo et.al.	2510.10125	null
2025-10-11	DeepFusionNet: Autoencoder-Based Low-Light Image Enhancement and Super-Resolution	Halil Hüseyin Çalışkan et.al.	2510.10122	null
2025-10-11	Targeted Sequential Pattern Mining with High Average Utility	Kai Cao et.al.	2510.10115	null
2025-10-11	On the Profile of Singularity Formation for the Incompressible Hydrostatic Boussinesq system	Slim Ibrahim et.al.	2510.10090	null
2025-10-11	SecureWebArena: A Holistic Security Evaluation Benchmark for LVLM-based Web Agents	Zonghao Ying et.al.	2510.10073	null
2025-10-11	Waves of Imagination: Unconditional Spectrogram Generation using Diffusion Architectures	Rahul Vanukuri et.al.	2510.10044	null
2025-10-10	Vision Language Models: A Survey of 26K Papers	Fengming Lin et.al.	2510.09586	null
2025-10-10	TC-LoRA: Temporally Modulated Conditional LoRA for Adaptive Diffusion Control	Minkyoung Cho et.al.	2510.09561	null
2025-10-10	Anomalous Diffusion in a Percolating Disordered Dipolar Spin Ensemble	Andrew Stasiuk et.al.	2510.09549	null
2025-10-10	Beyond Surface Reasoning: Unveiling the True Long Chain-of-Thought Capacity of Diffusion Large Language Models	Qiguang Chen et.al.	2510.09544	null
2025-10-10	SPG: Sandwiched Policy Gradient for Masked Diffusion Language Models	Chengyu Wang et.al.	2510.09541	null
2025-10-10	Conditional Flow Matching for Bayesian Posterior Inference	So Won Jeong et.al.	2510.09534	null
2025-10-10	Is Platinum a Proton Blocking Catalyst?	Aparna Saksena et.al.	2510.09522	null
2025-10-10	CRPS-LAM: Regional ensemble weather forecasting from matching marginals	Erik Larsson et.al.	2510.09484	null
2025-10-10	Modeling Protein Diffusion Across ER-Nuclear Envelope Junctions Reveals Efficient Transport via Simple Diffusion	Sara Merino-Aceituno et.al.	2510.09479	null
2025-10-10	Few-shot multi-token DreamBooth with LoRa for style-consistent character generation	Ruben Pascual et.al.	2510.09475	null
2025-10-10	Cross-Platform Narrative Prediction: Leveraging Platform-Invariant Discourse Networks	Patrick Gerard et.al.	2510.09464	null
2025-10-10	Failure Prediction at Runtime for Generative Robot Policies	Ralf Römer et.al.	2510.09459	null
2025-10-10	A posteriori analysis for nonlinear convection-diffusion systems	Andreas Dedner et.al.	2510.09449	null
2025-10-10	Mono4DEditor: Text-Driven 4D Scene Editing from Monocular Video via Point-Level Localization of Language-Embedded Gaussians	Jin-Chuan Shi et.al.	2510.09438	null
2025-10-10	Are diffusion models ready for materials discovery in unexplored chemical space?	Sanghyun Kim et.al.	2510.09406	null
2025-10-10	Complex Gaussianity and spatio-frequential memory effect of random wave processes	Guillaume Bal et.al.	2510.09402	null
2025-10-10	Sub-Diffraction Chromatin Domains: Architecture, Regulation, and Functional Roles in Nuclear Organization	Vinayak Vinayak et.al.	2510.09375	null
2025-10-10	A Biophysically-Conditioned Generative Framework for 3D Brain Tumor MRI Synthesis	Valentin Biller et.al.	2510.09365	null
2025-10-10	RadioFlow: Efficient Radio Map Construction Framework with Flow Matching	Haozhe Jia et.al.	2510.09314	null
2025-10-10	Mask Tokens as Prophet: Fine-Grained Cache Eviction for Efficient dLLM Inference	Jianuo Huang et.al.	2510.09309	null
2025-10-10	Clear Roads, Clear Vision: Advancements in Multi-Weather Restoration for Smart Transportation	Vijay M. Galshetwar et.al.	2510.09228	null
2025-10-10	Stable Video Infinity: Infinite-Length Video Generation with Error Recycling	Wuyang Li et.al.	2510.09212	null
2025-10-10	Flow-Opt: Scalable Centralized Multi-Robot Trajectory Optimization with Flow Matching and Differentiable Optimization	Simon Idoko et.al.	2510.09204	null
2025-10-10	An exactly solvable asymmetric simple inclusion process	Arvind Ayyer et.al.	2510.09191	null
2025-10-10	Robust Adaptive Boundary Control of a Thermal Process with Thermoelectric Actuators: Theory and Experimental Validation	Paul Mayr et.al.	2510.09169	null
2025-10-10	Enhanced Breakdown and RF Performance in Field-Plated AlGaN/GaN HEMT for High-Power Applications	Tanjim Rahman et.al.	2510.09154	null
2025-10-10	Score-Based Density Estimation from Pairwise Comparisons	Petrus Mikkola et.al.	2510.09146	null
2025-10-10	MSDM: Generating Task-Specific Pathology Images with a Multimodal Conditioned Diffusion Model for Cell and Nuclei Segmentation	Dominik Winter et.al.	2510.09121	null
2025-10-10	Dense2MoE: Restructuring Diffusion Transformer to MoE for Efficient Text-to-Image Generation	Youwei Zheng et.al.	2510.09094	null
2025-10-10	MCMC: Bridging Rendering, Optimization and Generative AI	Gurprit Singh et.al.	2510.09078	null
2025-10-10	OSCAR: Orthogonal Stochastic Control for Alignment-Respecting Diversity in Flow Matching	Jingxuan Wu et.al.	2510.09060	null
2025-10-10	Lesion-Aware Post-Training of Latent Diffusion Models for Synthesizing Diffusion MRI from CT Perfusion	Junhyeok Lee et.al.	2510.09056	null
2025-10-10	Imaging of Gate-Controlled Suppression of Superconductivity via the Meissner Effect	P. J. Scheidegger et.al.	2510.09044	null
2025-10-10	Drift estimation for rough processes under small noise asymptotic : QMLE approach	Arnaud Gloter et.al.	2510.09028	null
2025-10-10	DiTSinger: Scaling Singing Voice Synthesis with Diffusion Transformer and Implicit Alignment	Zongcai Du et.al.	2510.09016	null
2025-10-10	Towards Better & Faster Autoregressive Image Generation: From the Perspective of Entropy	Xiaoxiao Ma et.al.	2510.09012	null
2025-10-10	Speculative Jacobi-Denoising Decoding for Accelerating Autoregressive Text-to-image Generation	Yao Teng et.al.	2510.08994	null
2025-10-10	HandEval: Taking the First Step Towards Hand Quality Evaluation in Generated Images	Zichuan Wang et.al.	2510.08978	null
2025-10-10	DualResearch: Entropy-Gated Dual-Graph Retrieval for Answer Reconstruction	Jinxin Shi et.al.	2510.08959	null
2025-10-10	Denoised Diffusion for Object-Focused Image Augmentation	Nisha Pillai et.al.	2510.08955	null
2025-10-10	Physically Valid Biomolecular Interaction Modeling with Gauss-Seidel Projection	Siyuan Chen et.al.	2510.08946	null
2025-10-10	Personalize Before Retrieve: LLM-based Personalized Query Expansion for User-Centric Retrieval	Yingyi Zhang et.al.	2510.08935	null
2025-10-10	Passivation-Free Ga-Polar AlGaN/GaN Recessed-Gate HEMTs on Sapphire with 2.8 W/mm POUT and 26.8% PAE at 94 GHz	Ruixin Bai et.al.	2510.08933	null
2025-10-10	ControlAudio: Tackling Text-Guided, Timing-Indicated and Intelligible Audio Generation via Progressive Diffusion Modeling	Yuxuan Jiang et.al.	2510.08878	null
2025-10-09	Reinforcement Learning-Driven Edge Management for Reliable Multi-view 3D Reconstruction	Motahare Mounesan et.al.	2510.08839	null
2025-10-09	SkipSR: Faster Super Resolution with Token Skipping	Rohan Choudhury et.al.	2510.08799	null
2025-10-09	PO-CKAN:Physics Informed Deep Operator Kolmogorov Arnold Networks with Chunk Rational Structure	Junyi Wu et.al.	2510.08795	null
2025-10-09	Q-Router: Agentic Video Quality Assessment with Expert Model Routing and Artifact Localization	Shuo Xing et.al.	2510.08789	null
2025-10-09	Geometry-aware Policy Imitation	Yiming Li et.al.	2510.08787	null
2025-10-09	LinearSR: Unlocking Linear Attention for Stable and Efficient Image Super-Resolution	Xiaohui Li et.al.	2510.08771	null
2025-10-09	Graph Diffusion Transformers are In-Context Molecular Designers	Gang Liu et.al.	2510.08744	null
2025-10-09	Emergence of advection-diffusion transport structure and nonlinear amplitude evolution of strongly driven instabilities	Emma G. Devin et.al.	2510.08735	null
2025-10-09	The polar debris disc around 99 Herculis: A potential signpost for polar circumbinary planets	Jeremy L. Smallwood et.al.	2510.08698	null
2025-10-09	Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation	Kang Liao et.al.	2510.08673	null
2025-10-09	FreqCa: Accelerating Diffusion Models via Frequency-Aware Caching	Jiacheng Liu et.al.	2510.08669	null
2025-10-09	dInfer: An Efficient Inference Framework for Diffusion Language Models	Yuxin Ma et.al.	2510.08666	null
2025-10-09	When Truth Does Not Take on Its Shoes: How Misinformation Spreads in Chatrooms	Shuige Liu et.al.	2510.08658	null
2025-10-09	Fragmentation-limited dust filtration in 2D simulations of planet-disk systems with dust coagulation. Parameter study and implications for the inner disk’s dust mass budget and composition	Thomas Pfeil et.al.	2510.08574	null
2025-10-09	Who Said Neural Networks Aren’t Linear?	Nimrod Berman et.al.	2510.08570	null
2025-10-09	NovaFlow: Zero-Shot Manipulation via Actionable Flow from Generated Videos	Hongyu Li et.al.	2510.08568	null
2025-10-09	ResAD: Normalized Residual Trajectory Modeling for End-to-End Autonomous Driving	Zhiyu Zheng et.al.	2510.08562	null
2025-10-11	MultiCOIN: Multi-Modal COntrollable Video INbetweening	Maham Tanveer et.al.	2510.08561	null
2025-10-09	Classical to Quantum Diffusive Transport in Atomically Thin Semiconductors Capped with High-k Dielectric	Jaroslaw Pawlowski et.al.	2510.08557	null
2025-10-09	VideoCanvas: Unified Video Completion from Arbitrary Spatiotemporal Patches via In-Context Conditioning	Minghong Cai et.al.	2510.08555	null
2025-10-09	Improving Reasoning for Diffusion Language Models via Group Diffusion Policy Optimization	Kevin Rojas et.al.	2510.08554	null
2025-10-09	Permutation-Invariant Spectral Learning via Dyson Diffusion	Tassilo Schwarz et.al.	2510.08535	null
2025-10-09	X2Video: Adapting Diffusion Models for Multimodal Controllable Neural Video Rendering	Zhitong Huang et.al.	2510.08530	null
2025-10-09	FlexTraj: Image-to-Video Generation with Flexible Point Trajectory Control	Zhiyuan Zhang et.al.	2510.08527	null
2025-10-09	Kinetic description of one-dimensional stochastic dynamics with small inertia	Denis S. Goldobin et.al.	2510.08502	null
2025-10-09	Diffusion-Based Probabilistic Modeling for Hourly Streamflow Prediction and Assimilation	Wencong Yang et.al.	2510.08488	null
2025-10-09	InstructX: Towards Unified Visual Editing with MLLM Guidance	Chong Mou et.al.	2510.08485	null
2025-10-09	Anomalous Diffusion in Driven Electrolytes due to Hydrodynamic Fluctuations	Ramin Golestanian et.al.	2510.08478	null
2025-10-09	Wavefunction Flows: Efficient Quantum Simulation of Continuous Flow Models	David Layden et.al.	2510.08462	null
2025-10-09	SummDiff: Generative Modeling of Video Summarization with Diffusion	Kwanseok Kim et.al.	2510.08458	null
2025-10-09	Large Scale Diffusion Distillation via Score-Regularized Continuous-Time Consistency	Kaiwen Zheng et.al.	2510.08431	null
2025-10-09	Reinforcing Diffusion Models by Direct Group Preference Optimization	Yihong Luo et.al.	2510.08425	null
2025-10-09	Optimal Stopping in Latent Diffusion Models	Yu-Han Wu et.al.	2510.08409	null
2025-10-09	Biology-driven assessment of deep learning super-resolution imaging of the porosity network in dentin	Lauren Anderson et.al.	2510.08407	null
2025-10-09	MeanVC: Lightweight and Streaming Zero-Shot Voice Conversion via Mean Flows	Guobin Ma et.al.	2510.08392	null
2025-10-09	UniVideo: Unified Understanding, Generation, and Editing for Videos	Cong Wei et.al.	2510.08377	null
2025-10-09	Guided Star-Shaped Masked Diffusion	Viacheslav Meshchaninov et.al.	2510.08369	null
2025-10-09	Hyperspectral data augmentation with transformer-based diffusion models	Mattia Ferrari et.al.	2510.08363	null
2025-10-09	Impact of protein corona morphology on nanoparticle diffusion in biological fluids: insights from a mesoscale approach	Beatrice Cipriani et.al.	2510.08340	null
2025-10-09	A Simultaneous Synergistic Protection Mechanism in Hybrid Perovskite-Organic Multi-junctions Enables Long-Term Stable and Efficient Tandem Solar Cells	Chao Liu et.al.	2510.08330	null
2025-10-09	LinVideo: A Post-Training Framework towards O(n) Attention in Efficient Video Generation	Yushi Huang et.al.	2510.08318	null
2025-10-09	On the Cahn-Hilliard equation with nonlinear diffusion: the non-convex case	Monica Conti et.al.	2510.08287	null
2025-10-10	One Stone with Two Birds: A Null-Text-Null Frequency-Aware Diffusion Models for Text-Guided Image Inpainting	Haipeng Liu et.al.	2510.08273	null
2025-10-09	SViM3D: Stable Video Material Diffusion for Single Image 3D Generation	Andreas Engelhardt et.al.	2510.08271	null
2025-10-09	Multi-Agent Analysis of Off-Exchange Public Information for Cryptocurrency Market Trend Prediction	Kairan Hong et.al.	2510.08268	null
2025-10-09	Enhancing Reasoning for Diffusion LLMs via Distribution Matching Policy Optimization	Yuchen Zhu et.al.	2510.08233	null
2025-10-09	Expressive Value Learning for Scalable Offline Reinforcement Learning	Nicolas Espinosa-Dice et.al.	2510.08218	null
2025-10-09	InstructUDrag: Joint Text Instructions and Object Dragging for Interactive Image Editing	Haoran Yu et.al.	2510.08181	null
2025-10-09	Prepared mind, fast response: A temporal decoupling framework for adaptive knowledge orchestration in open-domain dialogue	Jinling Gan et.al.	2510.08175	null
2025-10-09	UniMMVSR: A Unified Multi-Modal Framework for Cascaded Video Super-Resolution	Shian Du et.al.	2510.08143	null
2025-10-09	Real-Time Motion-Controllable Autoregressive Video Diffusion	Kesen Zhao et.al.	2510.08131	null
2025-10-09	General formulation of an analytic, Lipschitz continuous control allocation for thrust-vectored controlled rigid-bodies	Frank Mukwege et.al.	2510.08119	null
2025-10-09	Physics-Driven Spatiotemporal Modeling for AI-Generated Video Detection	Shuhai Zhang et.al.	2510.08073	null
2025-10-09	Integrated Localization, Mapping, and Communication through VCSEL-Based Light-emitting RIS (LeRIS)	Rashid Iqbal et.al.	2510.08071	null
2025-10-09	Acceleration of Ultrahigh Energy Particles from Fast Radio Bursts	Lin Yu et.al.	2510.08037	null
2025-10-09	Gradient regularity for widely degenerate parabolic equations	Michael Strunk et.al.	2510.07999	null
2025-10-09	SimCast: Enhancing Precipitation Nowcasting with Short-to-Long Term Knowledge Distillation	Yifang Yin et.al.	2510.07953	null
2025-10-09	CVD-STORM: Cross-View Video Diffusion with Spatial-Temporal Reconstruction Model for Autonomous Driving	Tianrui Zhang et.al.	2510.07944	null
2025-10-09	TTOM: Test-Time Optimization and Memorization for Compositional Video Generation	Leigang Qu et.al.	2510.07940	null
2025-10-09	A cross-diffusion system with independent drifts and fast diffusion	Charles Elbar et.al.	2510.07937	null
2025-10-09	Scaling crossover of the generalized Jeffreys-type law	Fugui Ma et.al.	2510.07930	null
2025-10-09	Guitar Tone Morphing by Diffusion-based Model	Kuan-Yu Chen et.al.	2510.07908	null
2025-10-09	Adaptive Optimizable Gaussian Process Regression Linear Least Squares Regression Filtering Method for SEM Images	D. Chee Yong Ong et.al.	2510.07895	null
2025-10-09	Signal-to-Noise Ratio in Scanning Electron Microscopy: A Comprehensive Review	K. S. Sim et.al.	2510.07886	null
2025-10-09	FlowLensing: Simulating Gravitational Lensing with Flow Matching	Hamees Sayed et.al.	2510.07878	null
2025-10-09	DM1: MeanFlow with Dispersive Regularization for 1-Step Robotic Manipulation	Guowei Zou et.al.	2510.07865	null
2025-10-09	Dynamic Generation of Multi-LLM Agents Communication Topologies with Graph Diffusion Models	Eric Hanchen Jiang et.al.	2510.07799	null
2025-10-09	From Noisy to Native: LLM-driven Graph Restoration for Test-Time Graph Domain Adaptation	Xiangwei Lv et.al.	2510.07762	null
2025-10-09	Elucidation of the Correlation between Molecular Conformation and Shear Viscosity of Polymer Melts under Steady-State Shear Flow	Yuhi Sakamaki et.al.	2510.07738	null
2025-10-09	GeoGen: A Two-stage Coarse-to-Fine Framework for Fine-grained Synthetic Location-based Social Network Trajectory Generation	Rongchao Xu et.al.	2510.07735	null
2025-10-09	ComGS: Efficient 3D Object-Scene Composition via Surface Octahedral Probes	Jian Gao et.al.	2510.07729	null
2025-10-09	SyncHuman: Synchronizing 2D and 3D Generative Models for Single-view Human Reconstruction	Wenyue Chen et.al.	2510.07723	null
2025-10-09	RePainter: Empowering E-commerce Object Removal via Spatial-matting Reinforcement Learning	Zipeng Guo et.al.	2510.07721	null
2025-10-09	Multimodal Safety Evaluation in Generative Agent Social Simulations	Alhim Vera et.al.	2510.07709	null
2025-10-09	EB-MBD: Emerging-Barrier Model-Based Diffusion for Safe Trajectory Optimization in Highly Constrained Environments	Raghav Mishra et.al.	2510.07700	null
2025-10-09	Chromium-doped uranium dioxide fuels: A review	Mack Wesley Cleveland et.al.	2510.07698	null
2025-10-09	From tug-of-war to Brownian Boost: explicit ODE solutions for player-funded stochastic-differential games	Alan Hammond et.al.	2510.07682	null
2025-10-09	Curriculum Learning with Synthetic Data for Enhanced Pulmonary Nodule Detection in Chest Radiographs	Pranav Sambhu et.al.	2510.07681	null
2025-10-09	Controllable Video Synthesis via Variational Inference	Haoyi Duan et.al.	2510.07670	null
2025-10-09	MONKEY: Masking ON KEY-Value Activation Adapter for Personalization	James Baker et.al.	2510.07656	null
2025-10-09	Once Is Enough: Lightweight DiT-Based Video Virtual Try-On via One-Time Garment Appearance Injection	Yanjie Pan et.al.	2510.07654	null
2025-10-09	Rectified-CFG++ for Flow Based Models	Shreshth Saini et.al.	2510.07631	null
2025-10-08	What is the most optimal diffusion?	Vasili Baranau et.al.	2510.07571	null
2025-10-08	Symbolic-Diffusion: Deep Learning Based Symbolic Regression with D3PM Discrete Token Diffusion	Ryan T. Tymkow et.al.	2510.07570	null
2025-10-08	Establishing strong 1-boundedness via non-microstates free entropy techniques	Benjamin Major et.al.	2510.07558	null
2025-10-08	TRAVL: A Recipe for Making Video-Language Models Better Judges of Physics Implausibility	Saman Motamed et.al.	2510.07550	null
2025-10-08	PickStyle: Video-to-Video Style Transfer with Context-Style Adapters	Soroush Mehraban et.al.	2510.07546	null
2025-10-08	First order equation on random measures as superposition of weak solutions to the McKean-Vlasov equation	Alessandro Pinzi et.al.	2510.07542	null
2025-10-08	EMPalm: Exfiltrating Palm Biometric Data via Electromagnetic Side-Channels	Haowen Xu et.al.	2510.07533	null
2025-10-08	Flexible Intelligent Metasurface for Reconfiguring Radio Environments	Hanwen Hu et.al.	2510.07466	null
2025-10-08	Effects of skewing collision cells on transport properties in multiparticle collision dynamics simulations	Jinny Cha et.al.	2510.07446	null
2025-10-08	DynamicEval: Rethinking Evaluation for Dynamic Text-to-Video Synthesis	Nithin C. Babu et.al.	2510.07441	null
2025-10-08	From two dimensions to wire networks in a dice-lattice Josephson array	J. D. Bondar et.al.	2510.07412	null
2025-10-08	Classical-quantum oscillators as diffusive processes in phase space	Emanuele Panella et.al.	2510.07402	null
2025-10-08	Neutrinos from stars in the Milky Way	Pablo Martínez-Miravé et.al.	2510.07399	null
2025-10-08	A multiscale evolutionary study of molecular gas in STARFORGE. I. Synthetic observations of SEDIGISM-like molecular clouds	K. R. Neralwar et.al.	2510.07393	null
2025-10-08	Pixel-Perfect Depth with Semantics-Prompted Diffusion Transformers	Gangwei Xu et.al.	2510.07316	null
2025-10-08	WristWorld: Generating Wrist-Views via 4D World Models for Robotic Manipulation	Zezhong Qian et.al.	2510.07313	null
2025-10-08	Diffuse continuum emission and large extended sources at MeV energies	Markus Ackermann et.al.	2510.07311	null
2025-10-08	MATRIX: Mask Track Alignment for Interaction-aware Video Generation	Siyoon Jin et.al.	2510.07310	null
2025-10-08	Entropy and diffusion characterize mutation accumulation and biological information loss	Stephan Baehr et.al.	2510.07265	null
2025-10-08	The cosmic web’s Lyman- $α$ glow at $z \approx 2.5$ ; varying hydrodynamic models, dust, and wide-field, narrow-band imaging detection	Oleksii Sokoliuk et.al.	2510.07259	null
2025-10-08	TalkCuts: A Large-Scale Dataset for Multi-Shot Human Speech Video Generation	Jiaben Chen et.al.	2510.07249	null
2025-10-08	Security-Robustness Trade-offs in Diffusion Steganography: A Comparative Analysis of Pixel-Space and VAE-Based Architectures	Yuhua Xu et.al.	2510.07219	null
2025-10-08	GenPilot: A Multi-Agent System for Test-Time Prompt Optimization in Image Generation	Wen Ye et.al.	2510.07217	null
2025-10-08	L^p-quasicontractiveness and Kernel estimates for semigroups generated by systems of elliptic operators	L. Angiuli et.al.	2510.07216	null
2025-10-08	EigenScore: OOD Detection using Covariance in Diffusion Models	Shirin Shoushtari et.al.	2510.07206	null
2025-10-08	MV-Performer: Taming Video Diffusion Model for Faithful and Synchronized Multi-view Performer Synthesis	Yihao Zhi et.al.	2510.07190	null
2025-10-08	Renormalization of Interacting Random Graph Models	Alessio Catanzaro et.al.	2510.07186	null
2025-10-08	Diffusion Codes: Self-Correction from Small(er)-Set Expansion with Tunable Non-locality	Adithya Sriram et.al.	2510.07179	null
2025-10-08	Derivation of the fourth-order DLSS equation with nonlinear mobility via chemical reactions	Alexander Mielke et.al.	2510.07149	null
2025-10-08	Stability of non-conservative cross diffusion model and approximation by stochastic particle systems	Vincent Bansaye et.al.	2510.07138	null
2025-10-08	A Digital Twin Framework for Metamorphic Testing of Autonomous Driving Systems Using Generative Model	Tony Zhang et.al.	2510.07133	null
2025-10-08	Graph Conditioned Diffusion for Controllable Histopathology Image Generation	Sarah Cechnicka et.al.	2510.07129	null
2025-10-08	Diffusion-Augmented Reinforcement Learning for Robust Portfolio Optimization under Stress Scenarios	Himanshu Choudhary et.al.	2510.07099	null
2025-10-08	Generative World Modelling for Humanoids: 1X World Model Challenge Technical Report	Riccardo Mereu et.al.	2510.07092	null
2025-10-08	Accelerating Diffusion LLM Inference via Local Determinism Propagation	Fanheng Kong et.al.	2510.07081	null
2025-10-08	Diffusing Trajectory Optimization Problems for Recovery During Multi-Finger Manipulation	Abhinav Kumar et.al.	2510.07030	null
2025-10-08	No MoCap Needed: Post-Training Motion Diffusion Models with Reinforcement Learning using Only Textual Prompts	Girolamo Macaluso et.al.	2510.06988	null
2025-10-08	Addressing the ID-Matching Challenge in Long Video Captioning	Zhantao Yang et.al.	2510.06973	null
2025-10-08	Generating Surface for Text-to-3D using 2D Gaussian Splatting	Huanning Dong et.al.	2510.06967	null
2025-10-08	IAR2: Improving Autoregressive Visual Generation with Semantic-Detail Associated Token Prediction	Ran Yi et.al.	2510.06928	null
2025-10-08	StyleKeeper: Prevent Content Leakage using Negative Visual Query Guidance	Jaeseok Jeong et.al.	2510.06827	null
2025-10-08	Accelerated and fast magnetic reconnection through enhanced resistive dissipation for MHD equations	Gennaro Ciampa et.al.	2510.06801	null
2025-10-08	OBS-Diff: Accurate Pruning For Diffusion Models in One-Shot	Junhan Zhu et.al.	2510.06751	null
2025-10-08	An Inertial Langevin Algorithm	Alexander Falk et.al.	2510.06723	null
2025-10-08	A Diffusion Model for Regular Time Series Generation from Irregular Data with Completion and Masking	Gal Fadlon et.al.	2510.06699	null
2025-10-08	ToolMem: Enhancing Multimodal Agents with Learnable Tool Capability Memory	Yunzhong Xiao et.al.	2510.06664	null
2025-10-08	Global weak solutions to nonlinear kinetic Fokker–Planck equations in bounded domains under physical initial data	Young-Pil Choi et.al.	2510.06656	null
2025-10-08	Mass-Lumped Virtual Element Method with Strong Stability-Preserving Runge-Kutta Time Stepping for Two-Dimensional Parabolic Problems	Paulo Akira F. Enabe et.al.	2510.06653	null
2025-10-08	Control-Augmented Autoregressive Diffusion for Data Assimilation	Prakhar Srivastava et.al.	2510.06637	null
2025-10-08	Conditional McKean-Vlasov control	René Carmona et.al.	2510.06543	null
2025-10-08	VUGEN: Visual Understanding priors for GENeration	Xiangyi Chen et.al.	2510.06529	null
2025-10-07	Text-to-Image Models Leave Identifiable Signatures: Implications for Leaderboard Security	Ali Naseh et.al.	2510.06525	null
2025-10-07	Limited-Angle Tomography Reconstruction via Projector Guided 3D Diffusion	Zhantao Deng et.al.	2510.06516	null
2025-10-07	SIGMA-GEN: Structure and Identity Guided Multi-subject Assembly for Image Generation	Oindrila Saha et.al.	2510.06469	null
2025-10-07	TDiff: Thermal Plug-And-Play Prior with Patch-Based Diffusion	Piyush Dashpute et.al.	2510.06460	null
2025-10-07	Generalized Multi-Constraint Extremum Seeking	Alan Williams et.al.	2510.06403	null
2025-10-07	Controllable Stylistic Text Generation with Train-Time Attribute-Regularized Diffusion	Fan Zhou et.al.	2510.06386	null
2025-10-07	Diffusion-Guided Renormalization of Neural Systems via Tensor Networks	Nathan X. Kodama et.al.	2510.06361	null
2025-10-07	Finite element approximation and very weak solution existence in a two-dimensional, degenerate Keller-Segel model	Juan Vicente Gutiérrez-Santacreu et.al.	2510.06341	null
2025-10-07	Conditional Denoising Diffusion Model-Based Robust MR Image Reconstruction from Highly Undersampled Data	Mohammed Alsubaie et.al.	2510.06335	null
2025-10-07	Lumina-DiMOO: An Omni Diffusion Large Language Model for Multi-Modal Generation and Understanding	Yi Xin et.al.	2510.06308	null
2025-10-07	SDAR: A Synergistic Diffusion-AutoRegression Paradigm for Scalable Sequence Generation	Shuang Cheng et.al.	2510.06303	null
2025-10-07	RGBD Gaze Tracking Using Transformer for Feature Fusion	Tobias J. Bauer et.al.	2510.06298	null
2025-10-07	Efficient High-Resolution Image Editing with Hallucination-Aware Loss and Adaptive Tiling	Young D. Kwon et.al.	2510.06295	null
2025-10-07	BlockGPT: Spatio-Temporal Modelling of Rainfall via Frame-Level Autoregression	Cristian Meo et.al.	2510.06293	null
2025-10-07	Traj-Transformer: Diffusion Models with Transformer for GPS Trajectory Generation	Zhiyang Zhang et.al.	2510.06291	null
2025-10-07	Quantum-Theoretical Re-interpretation of Pricing Theory	Tian Xin et.al.	2510.06287	null
2025-10-07	Fine-grained Defocus Blur Control for Generative Image Models	Ayush Shrivastava et.al.	2510.06215	null
2025-10-07	Drive&Gen: Co-Evaluating End-to-End Driving and Video Generation Models	Jiahao Wang et.al.	2510.06209	null
2025-10-07	On Powerful Ways to Generate: Autoregression, Diffusion, and Beyond	Chenxiao Yang et.al.	2510.06190	null
2025-10-07	Thermodynamic Performance Limits for Score-Based Diffusion Models	Nathan X. Kodama et.al.	2510.06174	null
2025-10-07	Bimanual 3D Hand Motion and Articulation Forecasting in Everyday Images	Aditya Prakash et.al.	2510.06145	null
2025-10-07	CreditDecoding: Accelerating Parallel Decoding in Diffusion Large Language Models with Trace Credits	Kangyu Wang et.al.	2510.06133	null
2025-10-07	Discrete Diffusion Models with MLLMs for Unified Medical Multimodal Generation	Jiawei Mao et.al.	2510.06131	null
2025-10-07	Phase-induced switching of ferromagnetic insulators in Josephson spin valves	A. A. Mazanik et.al.	2510.06109	null
2025-10-07	Complete Synchronization and Pattern Selection through Amplitude Dynamics and Diffusion in Heterogeneous Oscillatory Media	Nicolas Thomé et.al.	2510.06083	null
2025-10-07	Mechanistic-statistical inference of mosquito dynamics from mark-release-recapture data	Nga Nguyen et.al.	2510.06080	null
2025-10-07	Controllable Audio-Visual Viewpoint Generation from 360° Spatial Information	Christian Marinoni et.al.	2510.06060	null
2025-10-08	Edit-Based Flow Matching for Temporal Point Processes	David Lüdke et.al.	2510.06050	null
2025-10-07	The gamma-ray emission from Radio Galaxies and their contribution to the Isotropic Gamma-Ray Background	A. Circiello et.al.	2510.06047	null
2025-10-07	Emergent Directedness in Social Contagion	Fabian Tschofenig et.al.	2510.06012	null
2025-10-07	ECTSpeech: Enhancing Efficient Speech Synthesis via Easy Consistency Tuning	Tao Zhu et.al.	2510.05984	null
2025-10-07	Diffusion-Based Image Editing for Breaking Robust Watermarks	Yunyi Ni et.al.	2510.05978	null
2025-10-07	Diffusion Models for Low-Light Image Enhancement: A Multi-Perspective Taxonomy and Performance Analysis	Eashan Adhikarla et.al.	2510.05976	null
2025-10-07	Quantum Lattice Boltzmann Method for Multiple Time Steps Without Reinitialization for Linear Advection-Diffusion Problems	Aaron Nagel et.al.	2510.05965	null
2025-10-07	$\bf{D^3}$ QE: Learning Discrete Distribution Discrepancy-aware Quantization Error for Autoregressive-Generated Image Detection	Yanran Zhang et.al.	2510.05891	null
2025-10-07	Dynamics of Choline Chloride based Deep Eutectic Solvents: Neutron Scattering Study	Rinesh T. et.al.	2510.05882	null
2025-10-07	The Safety Challenge of World Models for Embodied AI Agents: A Review	Lorenzo Baraldi et.al.	2510.05865	null
2025-10-07	FoleyGRAM: Video-to-Audio Generation with GRAM-Aligned Multimodal Encoders	Riccardo Fosco Gramaccioni et.al.	2510.05829	null
2025-10-07	StereoSync: Spatially-Aware Stereo Audio Generation from Video	Christian Marinoni et.al.	2510.05828	null
2025-10-07	First experimental measurements of biophotons from Astrocytes and Glioblastoma cell cultures	L. De Paolis et.al.	2510.05792	null
2025-10-07	Models of topological barriers and molecular motors of bacterial DNA	Marc Joyeux et.al.	2510.05790	null
2025-10-07	New Insights into Involutory and Orthogonal MDS Matrices	Yogesh Kumar et.al.	2510.05766	null
2025-10-07	RareAgent: Self-Evolving Reasoning for Drug Repurposing in Rare Diseases	Lang Qin et.al.	2510.05764	null
2025-10-07	Early Multimodal Prediction of Cross-Lingual Meme Virality on Reddit: A Time-Window Analysis	Sedat Dogan et.al.	2510.05761	null
2025-10-07	Vipera: Blending Visual and LLM-Driven Guidance for Systematic Auditing of Text-to-Image Generative AI	Yanwei Huang et.al.	2510.05742	null
2025-10-07	Improving Discrete Diffusion Unmasking Policies Beyond Explicit Reference Policies	Chunsan Hong et.al.	2510.05725	null
2025-10-07	Data Factory with Minimal Human Effort Using VLMs	Jiaojiao Ye et.al.	2510.05722	null
2025-10-07	DiffSDA: Unsupervised Diffusion Sequential Disentanglement Across Modalities	Hedi Zisling et.al.	2510.05717	null
2025-10-07	AgeBooth: Controllable Facial Aging and Rejuvenation via Diffusion Models	Shihao Zhu et.al.	2510.05715	null
2025-10-07	Hedging of exotic options in Hawkes jump-diffusion models by Malliavin calculus	Ayub Ahmadi et.al.	2510.05689	null
2025-10-07	When and How to Cut Classical Concerts? A Multimodal Automated Video Editing Approach	Daniel Gonzálbez-Biosca et.al.	2510.05661	null
2025-10-07	Teleportraits: Training-Free People Insertion into Any Scene	Jialu Gao et.al.	2510.05660	null
2025-10-07	Beyond Spectral Peaks: Interpreting the Cues Behind Synthetic Image Detection	Sara Mandelli et.al.	2510.05633	null
2025-10-07	Generative AI-Driven Hierarchical Multi-Agent Framework for Zero-Touch Optical Networks	Yao Zhang et.al.	2510.05625	null
2025-10-07	PointNSP: Autoregressive 3D Point Cloud Generation with Next-Scale Level-of-Detail Prediction	Ziqiao Meng et.al.	2510.05613	null
2025-10-07	Efficient Conditional Generation on Scale-based Visual Autoregressive Models	Jiaqi Liu et.al.	2510.05610	null
2025-10-07	Improving Chain-of-Thought Efficiency for Autoregressive Image Generation	Zeqi Gu et.al.	2510.05593	null
2025-10-07	Probing orbital currents through inverse orbital Hall and Rashba effects	E. Santos et.al.	2510.05543	null
2025-10-07	Teamwork: Collaborative Diffusion with Low-rank Coordination and Adaptation	Sam Sartor et.al.	2510.05532	null
2025-10-07	Be Tangential to Manifold: Discovering Riemannian Metric for Diffusion Models	Shinnosuke Saito et.al.	2510.05509	null
2025-10-07	High-Fidelity Synthetic ECG Generation via Mel-Spectrogram Informed Diffusion Training	Zhuoyi Huang et.al.	2510.05492	null
2025-10-06	Surface Excess Energy Governs the Non-Monotonic Behavior of Active Diffusivity with Activity	A. Arango-Restrepo et.al.	2510.05435	null
2025-10-06	See the past: Time-Reversed Scene Reconstruction from Thermal Traces Using Visual Language Models	Kebin Contreras et.al.	2510.05408	null
2025-10-06	LightCache: Memory-Efficient, Training-Free Acceleration for Video Generation	Yang Xiao et.al.	2510.05367	null
2025-10-06	Mitigating Diffusion Model Hallucinations with Dynamic Guidance	Kostas Triaridis et.al.	2510.05356	null
2025-10-06	Domain Decomposition-Based Coupling of High-Fidelity Finite Element and Reduced Order Operator Inference Models Using the Schwarz Alternating Method	Ian Moore et.al.	2510.05350	null
2025-10-06	A System Level Approach to LQR Control of the Diffusion Equation	Addie McCurdy et.al.	2510.05345	null
2025-10-06	Learning the detector in optical tomography	Zijian Wang et.al.	2510.05341	null
2025-10-06	Machine Learning Interatomic Potentials Enable Molecular Dynamics Simulations of Doped MoS2	Abrar Faiyad et.al.	2510.05339	null
2025-10-06	Resonance with quasinormal modes in long-range kinks’ collisions	J. G. F. Campos et.al.	2510.05311	null
2025-10-06	Scalarized Hot Neutron Stars Containing Hyperons and $Δ$ -Resonances in Different Evolution Regimes	Fahimeh Rahimi et.al.	2510.05302	null
2025-10-06	A Data-Driven Prism: Multi-View Source Separation with Diffusion Model Priors	Sebastian Wagner-Carena et.al.	2510.05205	null
2025-10-06	Paper2Video: Automatic Video Generation from Scientific Papers	Zeyu Zhu et.al.	2510.05096	null
2025-10-06	VChain: Chain-of-Visual-Thought for Reasoning in Video Generation	Ziqi Huang et.al.	2510.05094	null
2025-10-06	Character Mixing for Video Generation	Tingting Liao et.al.	2510.05093	null
2025-10-06	Factuality Matters: When Image Generation and Editing Meet Structured Visuals	Le Zhuo et.al.	2510.05091	null
2025-10-06	Finish First, Perfect Later: Test-Time Token-Level Cross-Validation for Diffusion Large Language Models	Runchu Tian et.al.	2510.05090	null
2025-10-06	SAEdit: Token-level control for continuous image editing via Sparse AutoEncoder	Ronen Kamenetsky et.al.	2510.05081	null
2025-10-06	SwiReasoning: Switch-Thinking in Latent and Explicit for Pareto-Superior Reasoning LLMs	Dachuan Shi et.al.	2510.05069	null
2025-10-06	Spectral Properties of Anomalous Microwave Emission in 144 Galactic Clouds	Roke Cepeda-Arroita et.al.	2510.05067	null
2025-10-06	StaMo: Unsupervised Learning of Generalizable Robot Motion from Compact State Representation	Mingyu Liu et.al.	2510.05057	null
2025-10-06	No-reference Quality Assessment of Contrast-distorted Images using Contrast-enhanced Pseudo Reference	Mohammad-Ali Mahmoudpour et.al.	2510.05053	null
2025-10-06	Test-Time Scaling in Diffusion LLMs via Hidden Semi-Autoregressive Experts	Jihoon Lee et.al.	2510.05040	null
2025-10-06	Graph-Aware Diffusion for Signal Generation	Sergio Rozada et.al.	2510.05036	null
2025-10-06	Comparing fine-tuning strategies of MACE machine learning force field for modeling Li-ion diffusion in LiF for batteries	Nada Alghamdi et.al.	2510.05020	null
2025-10-06	Bridging Text and Video Generation: A Survey	Nilay Kumar et.al.	2510.04999	null
2025-10-06	SSDD: Single-Step Diffusion Decoder for Efficient Image Tokenization	Théophane Vallaeys et.al.	2510.04961	null
2025-10-06	Bidirectional Mammogram View Translation with Column-Aware and Implicit 3D Conditional Diffusion	Xin Li et.al.	2510.04947	null
2025-10-06	Steady-State Spread Bounds for Graph Diffusion via Laplacian Regularisation	Ardavan Rahimian et.al.	2510.04924	null
2025-10-06	Effect of ice nucleating proteins on the structure-property relationships of ice: A molecular dynamics study	A. K. Shargh et.al.	2510.04892	null
2025-10-06	Flow-Matching Based Refiner for Molecular Conformer Generation	Xiangyang Xu et.al.	2510.04878	null
2025-10-06	Alignment Tipping Process: How Self-Evolution Pushes LLM Agents Off the Rails	Siwei Han et.al.	2510.04860	null
2025-10-06	Efficient structure-preserving scheme for chemotaxis PDEs with singular sensitivity in crime and epidemic modeling	Rui Wang et.al.	2510.04826	null
2025-10-06	Did you just see that? Arbitrary view synthesis for egocentric replay of operating room workflows from ambient sensors	Han Zhang et.al.	2510.04802	null
2025-10-06	A behavioral reinvestigation of the effect of long ties on social contagions	Luca Lazzaro et.al.	2510.04785	null
2025-10-06	ParallelBench: Understanding the Trade-offs of Parallel Decoding in Diffusion LLMs	Wonjun Kang et.al.	2510.04767	null
2025-10-06	Speak, Edit, Repeat: High-Fidelity Voice Editing and Zero-Shot TTS with Cross-Attentive Mamba	Baher Mohammad et.al.	2510.04738	null
2025-10-06	Sub-Gaussian heat kernel estimates for reflected diffusion on inner uniform domains	Riku Anttila et.al.	2510.04725	null
2025-10-06	BGRem: A background noise remover for astronomical images based on a diffusion model	R. Nicolaas et.al.	2510.04718	null
2025-10-06	ReactDiff: Fundamental Multiple Appropriate Facial Reaction Diffusion Model	Luo Cheng et.al.	2510.04712	null
2025-10-06	ID-Consistent, Precise Expression Generation with Blendshape-Guided Diffusion	Foivos Paraperas Papantoniou et.al.	2510.04706	null
2025-10-06	ConceptSplit: Decoupled Multi-Concept Personalization of Diffusion Models via Token-wise Adaptation and Attention Disentanglement	Habin Lim et.al.	2510.04668	null
2025-10-06	Social Agent: Mastering Dyadic Nonverbal Behavior Generation via Conversational LLM Agents	Zeyi Zhang et.al.	2510.04637	null
2025-10-06	The Role of Acoustic Instability in Cosmic-Ray Self-Confinement	Antonio Capanema et.al.	2510.04635	null
2025-10-06	Exploring the Power of Diffusion Large Language Models for Software Engineering: An Empirical Investigation	Jingyao Zhang et.al.	2510.04605	null
2025-10-06	Investigating into mechanisms of high temperature strength of refractory high-entropy alloys	Sai Anandhi Seetharaman et.al.	2510.04589	null
2025-10-06	Improved probabilistic regression using diffusion models	Carlo Kneissl et.al.	2510.04583	null
2025-10-07	Constrained Dikin-Langevin diffusion for polyhedra	James Chok et.al.	2510.04582	null
2025-10-06	Language Model Based Text-to-Audio Generation: Anti-Causally Aligned Collaborative Residual Transformers	Juncheng Wang et.al.	2510.04577	null
2025-10-06	SONA: Learning Conditional, Unconditional, and Mismatching-Aware Discriminator	Yuhta Takida et.al.	2510.04576	null
2025-10-07	LaDiR: Latent Diffusion Enhances LLMs for Text Reasoning	Haoqiang Kang et.al.	2510.04573	null
2025-10-06	3Dify: a Framework for Procedural 3D-CG Generation Assisted by LLMs Using MCP and RAG	Shun-ichiro Hayashi et.al.	2510.04536	null
2025-10-06	TAG:Tangential Amplifying Guidance for Hallucination-Resistant Diffusion Sampling	Hyunmin Cho et.al.	2510.04533	null
2025-10-06	Demystifying MaskGIT Sampler and Beyond: Adaptive Order Selection in Masked Diffusion	Satoshi Hayakawa et.al.	2510.04525	null
2025-10-06	Toward a Unified Geometry Understanding: Riemannian Diffusion Framework for Graph Generation and Prediction	Yisen Gao et.al.	2510.04522	null
2025-10-06	Asynchronous Denoising Diffusion Models for Aligning Text-to-Image Generation	Zijing Hu et.al.	2510.04504	null
2025-10-06	Non-Monotone Traveling Waves of the Weak Competition Lotka-Volterra System	Chiun-Chuan Chen et.al.	2510.04501	null
2025-10-06	Identifying non-equilibrium fluctuations in Intracellular Motion Using Recurrent Neural Networks	Tomas Basile et.al.	2510.04485	null
2025-10-06	TBStar-Edit: From Image Editing Pattern Shifting to Consistency Enhancement	Hao Fang et.al.	2510.04483	null
2025-10-06	A Diffusion-based Generative Machine Learning Paradigm for Contingency Screening	Quan Tran et.al.	2510.04470	null
2025-10-06	REAR: Rethinking Visual Autoregressive Models via Generator-Tokenizer Consistency Regularization	Qiyuan He et.al.	2510.04450	null
2025-10-06	Fractional Heat Kernel for Semi-Supervised Graph Learning with Small Training Sample Size	Farid Bozorgnia et.al.	2510.04440	null
2025-10-06	spd-metrics-id: A Python Package for SPD-Aware Distance Metrics in Connectome Fingerprinting and Beyond	Kaosar Uddin et.al.	2510.04438	null
2025-10-06	PAD-TRO: Projection-Augmented Diffusion for Direct Trajectory Optimization	Jushan Chen et.al.	2510.04436	null
2025-10-05	On the Origin of Carrier Loss in Mg-Doped N-Polar GaN	Masahiro Kamiyama et.al.	2510.04381	null
2025-10-05	Diffusion^2: Dual Diffusion Model with Uncertainty-Aware Adaptive Noise for Momentary Trajectory Prediction	Yuhao Luo et.al.	2510.04365	null
2025-10-05	Score-based generative emulation of impact-relevant Earth system model outputs	Shahine Bouabid et.al.	2510.04358	null
2025-10-05	Reliable and Scalable Robot Policy Evaluation with Imperfect Simulators	Apurva Badithela et.al.	2510.04354	null
2025-10-05	On strong solution of a multidimensional SDE: extension of Yamada – Watanabe’s theorem	A. A. Lyappieva et.al.	2510.04329	null
2025-10-05	FoilDiff: A Hybrid Transformer Backbone for Diffusion-based Modelling of 2D Airfoil Flow Fields	Kenechukwu Ogbuagu et.al.	2510.04325	null
2025-10-05	ChronoEdit: Towards Temporal Reasoning for Image Editing and World Simulation	Jay Zhangjie Wu et.al.	2510.04290	null
2025-10-05	The best performance in the CARE 2025 – Liver Task (LiSeg-Contrast): Contrast-Aware Semi-Supervised Segmentation with Domain Generalization and Test-Time Adaptation	Jincan Lou et.al.	2510.04243	null
2025-10-05	Diffusion-Assisted Distillation for Self-Supervised Graph Representation Learning with MLPs	Seong Jin Ahn et.al.	2510.04241	null
2025-10-05	Flexible Locomotion Learning with Diffusion Model Predictive Control	Runhan Huang et.al.	2510.04234	null
2025-10-05	MASC: Boosting Autoregressive Image Generation with a Manifold-Aligned Semantic Clustering	Lixuan He et.al.	2510.04220	null
2025-10-05	World-To-Image: Grounding Text-to-Image Generation with Agent-Driven World Knowledge	Moo Hyun Son et.al.	2510.04201	null
2025-10-05	Let Features Decide Their Own Solvers: Hybrid Feature Caching for Diffusion Transformers	Shikang Zheng et.al.	2510.04188	null
2025-10-05	Relief of EGFR/FOS-downregulated miR-103a by loganin alleviates NF-kappaB-triggered inflammation and gut barrier disruption in colitis	Yan Li et.al.	2510.04176	null
2025-10-05	Drax: Speech Recognition with Discrete Flow Matching	Aviv Navon et.al.	2510.04162	null
2025-10-05	GDiffuSE: Diffusion-based speech enhancement with noise model guidance	Efrayim Yanir et.al.	2510.04157	null
2025-10-05	ObCLIP: Oblivious CLoud-Device Hybrid Image Generation with Privacy Preservation	Haoqi Wu et.al.	2510.04153	null
2025-10-05	Self Speculative Decoding for Diffusion Large Language Models	Yifeng Gao et.al.	2510.04147	null
2025-10-05	Beyond Next-Token Prediction: A Performance Characterization of Diffusion versus Autoregressive Language Models	Minseo Kim et.al.	2510.04146	null
2025-10-05	Joint Learning of Pose Regression and Denoising Diffusion with Score Scaling Sampling for Category-level 6D Pose Estimation	Seunghyun Lee et.al.	2510.04125	null
2025-10-07	Harnessing LLM for Noise-Robust Cognitive Diagnosis in Web-Based Intelligent Education Systems	Guixian Zhang et.al.	2510.04093	null
2025-10-05	What Makes Diffusion Language Models Super Data Learners?	Zitian Gao et.al.	2510.04071	null
2025-10-05	Diffusion Low Rank Hybrid Reconstruction for Sparse View Medical Imaging	Zongyin Deng et.al.	2510.04069	null
2025-10-05	Approaching the scaling limit of transport through lattices with dephasing	Subhajit Sarkar et.al.	2510.04062	null
2025-10-05	Variational Diffusion Unlearning: A Variational Inference Framework for Unlearning in Diffusion Models under Data Constraints	Subhodip Panda et.al.	2510.04058	null
2025-10-05	Prompt-to-Prompt: Text-Based Image Editing Via Cross-Attention Mechanisms – The Research of Hyperparameters and Novel Mechanisms to Enhance Existing Frameworks	Linn Bieske et.al.	2510.04034	null
2025-10-05	Principled and Tractable RL for Reasoning with Diffusion Language Models	Anthony Zhan et.al.	2510.04019	null
2025-10-05	Dual Pruning and Sorting-Free Overestimation for Average-Utility Sequential Pattern Mining	Kai Cao et.al.	2510.04014	null
2025-10-05	Optimal estimation of a factorizable density using diffusion models with ReLU neural networks	Jianqing Fan et.al.	2510.03994	null
2025-10-05	Long time evolution of a pair of 2D viscous point vortices	Ping Zhang et.al.	2510.03991	null
2025-10-04	A discrete data assimilation algorithm for the reconstruction of Gray–Scott dynamics	Tsiry Avisoa Randrianasolo et.al.	2510.03972	null
2025-10-04	Global weak martingale solutions to a stochastic two-sidedly degenerate aggregation-diffusion equation issued from biology	Mostafa Bendahmane et.al.	2510.03947	null
2025-10-04	Super-resolution image projection over an extended depth of field using a diffractive decoder	Hanlong Chen et.al.	2510.03938	null
2025-10-04	Self-Speculative Masked Diffusions	Andrew Campbell et.al.	2510.03929	null
2025-10-04	High-order, Compact, and Symmetric Finite Difference Methods for a $d$ -Dimensional Hypercube	Qiwei Feng et.al.	2510.03927	null
2025-10-04	Generating Human Motion Videos using a Cascaded Text-to-Video Framework	Hyelin Nam et.al.	2510.03909	null
2025-10-04	Rare Text Semantics Were Always There in Your Diffusion Transformer	Seil Kang et.al.	2510.03886	null
2025-10-04	Adversarial Agent Collaboration for C to Rust Translation	Tianyu Li et.al.	2510.03879	null
2025-10-04	PoseGaze-AHP: A Knowledge-Based 3D Dataset for AI-Driven Ocular and Postural Diagnosis	Saja Al-Dabet et.al.	2510.03873	null
2025-10-04	SDAKD: Student Discriminator Assisted Knowledge Distillation for Super-Resolution Generative Adversarial Networks	Nikolaos Kaparinos et.al.	2510.03870	null
2025-10-04	Mirage: Unveiling Hidden Artifacts in Synthetic Images with Large Vision-Language Models	Pranav Sharma et.al.	2510.03840	null
2025-10-04	Proximal Diffusion Neural Sampler	Wei Guo et.al.	2510.03824	null
2025-10-04	Contrastive-SDE: Guiding Stochastic Differential Equations with Contrastive Learning for Unpaired Image-to-Image Translation	Venkata Narendra Kotyada et.al.	2510.03821	null
2025-10-04	Diverse Text-to-Image Generation via Contrastive Noise Optimization	Byungjun Kim et.al.	2510.03813	null
2025-10-04	A Variational Method for Conformable Fractional Equations Using Rank-One Updates	Maatank Parashar et.al.	2510.03778	null
2025-10-04	Bridging the Gap Between Multimodal Foundation Models and World Models	Xuehai He et.al.	2510.03727	null
2025-10-04	Person-Centric Annotations of LAION-400M: Auditing Bias and Its Transfer to Models	Leander Girrbach et.al.	2510.03721	null
2025-10-04	Non-negative diffusion bridge of the McKean-Vlasov type: analysis of singular diffusion and application to fish migration	Hidekazu Yoshioka et.al.	2510.03692	null
2025-10-03	Coevolutionary Continuous Discrete Diffusion: Make Your Diffusion Language Model a Latent Reasoner	Cai Zhou et.al.	2510.03206	null
2025-10-03	Memory Forcing: Spatio-Temporal Memory for Consistent Scene Generation on Minecraft	Junchao Huang et.al.	2510.03198	null
2025-10-03	Product-Quantised Image Representation for High-Quality Image Synthesis	Denis Zavadski et.al.	2510.03191	null
2025-10-03	HESS J1831 $-$ 098 – Exploring a pulsar halo scenario with H.E.S.S. data	Karim Sabri et.al.	2510.03183	null
2025-10-03	UniShield: An Adaptive Multi-Agent Framework for Unified Forgery Image Detection and Localization	Qing Huang et.al.	2510.03161	null
2025-10-03	Mask2IV: Interaction-Centric Video Generation via Mask Trajectories	Gen Li et.al.	2510.03135	null
2025-10-03	HAVIR: HierArchical Vision to Image Reconstruction using CLIP-Guided Versatile Diffusion	Shiyi Zhang et.al.	2510.03122	null
2025-10-03	Taming Text-to-Sounding Video Generation via Advanced Modality Condition and Interaction	Kaisi Guan et.al.	2510.03117	null
2025-10-03	GeoComplete: Geometry-Aware Diffusion for Reference-Driven Image Completion	Beibei Lin et.al.	2510.03110	null
2025-10-03	Deciphering the radio-star formation correlation on kpc scales. IV. Radio halos of highly-inclined Virgo cluster spiral galaxies	B. Vollmer et.al.	2510.03098	null
2025-10-03	Distilled Protein Backbone Generation	Liyang Xie et.al.	2510.03095	null
2025-10-03	Latent Diffusion Unlearning: Protecting Against Unauthorized Personalization Through Trajectory Shifted Perturbations	Naresh Kumar Devulapally et.al.	2510.03089	null
2025-10-03	What Drives Compositional Generalization in Visual Generative Models?	Karim Farid et.al.	2510.03075	null
2025-10-03	Self-consistent model of cosmic ray penetration into molecular clouds: Effect of energy losses	D. O. Chernyshov et.al.	2510.03073	null
2025-10-03	Rogue waves in extended Gross-Pitaevskii Models with a Lee-Huang-Yang correction	Sathyanarayanan Chandramouli et.al.	2510.03063	null
2025-10-03	When and Where do Events Switch in Multi-Event Video Generation?	Ruotong Liao et.al.	2510.03049	null
2025-10-03	Physics-Constrained Inc-GAN for Tunnel Propagation Modeling from Sparse Line Measurements	Yang Zhou et.al.	2510.03019	null
2025-10-03	Learning Robust Diffusion Models from Imprecise Supervision	Dong-Dong Wu et.al.	2510.03016	null
2025-10-03	3D-CovDiffusion: 3D-Aware Diffusion Policy for Coverage Path Planning	Chenyuan Chen et.al.	2510.03011	null
2025-10-03	TIT-Score: Evaluating Long-Prompt Based Text-to-Image Alignment via Text-to-Image-to-Text Consistency	Juntong Wang et.al.	2510.02987	null
2025-10-03	Multi-faceted light pollution modelling and its application to the decline of artificial illuminance in France	Rolf Buhler et.al.	2510.02977	null
2025-10-03	Long-Time Analysis of Stochastic Heavy Ball Dynamics for Convex Optimization and Monotone Equations	Radu Ioan Bot et.al.	2510.02951	null
2025-10-03	Stationarity preserving nodal Finite Element methods for multi-dimensional linear hyperbolic balance laws via a Global Flux quadrature formulation	Wasilij Barsukow et.al.	2510.02928	null
2025-10-03	Probing a theoretical framework for a Photonic Extreme Learning Machine	Vicente Rocha et.al.	2510.02918	null
2025-10-03	SALSA-V: Shortcut-Augmented Long-form Synchronized Audio from Videos	Amir Dellali et.al.	2510.02916	null
2025-10-03	DMark: Order-Agnostic Watermarking for Diffusion Large Language Models	Linyu Wu et.al.	2510.02902	null
2025-10-03	Consolidating Reinforcement Learning for Multimodal Discrete Diffusion Models	Tianren Ma et.al.	2510.02880	null
2025-10-03	Dust scattering halo of 4U 1630-47: High resolution X-ray and mm observations constrain source and molecular cloud distances	E. Kalemci et.al.	2510.02879	null
2025-10-03	Flamed-TTS: Flow Matching Attention-Free Models for Efficient Generating and Dynamic Pacing Zero-shot Text-to-Speech	Hieu-Nghia Huynh-Nguyen et.al.	2510.02848	null
2025-10-03	TridentServe: A Stage-level Serving System for Diffusion Pipelines	Yifei Xia et.al.	2510.02838	null
2025-10-03	Multi-scale Autoregressive Models are Laplacian, Discrete, and Latent Diffusion Models in Disguise	Steve Hong et.al.	2510.02826	null
2025-10-03	PromptMap: Supporting Exploratory Text-to-Image Generation	Yuhan Guo et.al.	2510.02814	null
2025-10-03	TeV Emission from PSR B1055-52 with HESS: Evidence for a Pulsar Halo	Tina Wach et.al.	2510.02802	null
2025-10-03	SongFormer: Scaling Music Structure Analysis with Heterogeneous Supervision	Chunbo Hao et.al.	2510.02797	null
2025-10-03	Periodic Event-Triggered Prescribed Time Control of Euler-Lagrange Systems under State and Input Constraints	Chidre Shravista Kashyap et.al.	2510.02769	null
2025-10-03	Neural Jump ODEs as Generative Models	Robert A. Crowell et.al.	2510.02757	null
2025-10-03	Wide-field GMRT imaging of X-shaped Radio-Galaxies: Spectral properties of 4C32.25 and 4C61.23	E. Retana-Montenegro et.al.	2510.02753	null
2025-10-03	Denoising and Augmentation: A Dual Use of Diffusion Model for Enhanced CSI Recovery	Yupeng Li et.al.	2510.02744	null
2025-10-03	Dale meets Langevin: A Multiplicative Denoising Diffusion Model	Nishanth Shetty et.al.	2510.02730	null
2025-10-03	Flow Matching for Measure Transport and Feedback Stabilization of Control-Affine Systems	Karthik Elamvazhuthi et.al.	2510.02706	null
2025-10-03	RAMAC: Multimodal Risk-Aware Offline Reinforcement Learning and the Role of Behavior Regularization	Kai Fukazawa et.al.	2510.02695	null
2025-10-03	Fine-Tuning Diffusion Models via Intermediate Distribution Shaping	Gautham Govind Anil et.al.	2510.02692	null
2025-10-03	Ohta-Kawasaki Model Reveals Patterns on Multicomponent Vesicles	Wangbo Luo et.al.	2510.02688	null
2025-10-03	Smart-GRPO: Smartly Sampling Noise for Efficient RL of Flow-Matching Models	Benjamin Yu et.al.	2510.02654	null
2025-10-03	Dispersion Relations and Pole-Skipping in a Holographic Charmonium Model with Rotating Plasma	Luiz F. Ferreira et.al.	2510.02647	null
2025-10-03	Deep Generative Continual Learning using Functional LoRA: FunLoRA	Victor Enescu et.al.	2510.02631	null
2025-10-02	Input-Aware Sparse Attention for Real-Time Co-Speech Video Generation	Beijia Lu et.al.	2510.02617	null
2025-10-02	UMI-on-Air: Embodiment-Aware Guidance for Embodiment-Agnostic Visuomotor Policies	Harsh Gupta et.al.	2510.02614	null
2025-10-02	PEO: Training-Free Aesthetic Quality Enhancement in Pre-Trained Text-to-Image Diffusion Models with Prompt Embedding Optimization	Hovhannes Margaryan et.al.	2510.02599	null
2025-10-02	Surface Wave Solutions in 1D and 2D for the Broer-Kaup-Boussinesq-Kupershmidt (BKBK) System	Darryl D. Holm et.al.	2510.02577	null
2025-10-02	How Confident are Video Models? Empowering Video Models to Express their Uncertainty	Zhiting Mei et.al.	2510.02571	null
2025-10-02	Learning Microswimmer Collision Dynamics and Predicting Diffusivities using a Neural-Network-Assisted Boltzmann Approach	Haruki Hayano et.al.	2510.02559	null
2025-10-02	Stable determination of the nonlinear parameter in the non-diffusive Westervelt equation from the Dirichlet-to-Neumann map	Mike Wendels et.al.	2510.02553	null
2025-10-02	Active-Learning Inspired Ab Initio Theory-Experiment Loop Approach for Management of Material Defects: Application to Superconducting Qubits	Sarvesh Chaudhari et.al.	2510.02544	null
2025-10-02	Self-supervised diffusion model fine-tuning for costate initialization using Markov chain Monte Carlo	Jannik Graebner et.al.	2510.02527	null
2025-10-02	Graph Generation with Spectral Geodesic Flow Matching	Xikun Huang et.al.	2510.02520	null
2025-10-02	Learning a distance measure from the information-estimation geometry of data	Guy Ohayon et.al.	2510.02514	null
2025-10-02	Beyond Linear Diffusions: Improved Representations for Rare Conditional Generative Modeling	Kulunu Dharmakeerthi et.al.	2510.02499	null
2025-10-02	The Entangled Feedback Impacts of Supernovae in Coarse- versus High-Resolution Galaxy Simulations	Eric Zhang et.al.	2510.02432	null
2025-10-02	Optimal Control Meets Flow Matching: A Principled Route to Multi-Subject Fidelity	Eric Tillmann Bill et.al.	2510.02315	null
2025-10-02	Inferring Dynamic Physical Properties from Video Foundation Models	Guanqi Zhan et.al.	2510.02311	null
2025-10-02	NoiseShift: Resolution-Aware Noise Recalibration for Better Low-Resolution Image Generation	Ruozhen He et.al.	2510.02307	null
2025-10-02	Diffusion Models and the Manifold Hypothesis: Log-Domain Smoothing is Geometry Adaptive	Tyler Farghly et.al.	2510.02305	null
2025-10-02	Knowledge Distillation Detection for Open-weights Models	Qin Shi et.al.	2510.02302	null
2025-10-02	Equilibrium Matching: Generative Modeling with Implicit Energy-Based Models	Runqian Wang et.al.	2510.02300	null
2025-10-02	Continual Personalization for Diffusion Models	Yu-Chien Liao et.al.	2510.02296	null
2025-10-02	Test-Time Anchoring for Discrete Diffusion Posterior Sampling	Litu Rout et.al.	2510.02291	null
2025-10-02	MultiModal Action Conditioned Video Generation	Yichen Li et.al.	2510.02287	null
2025-10-02	Learning to Generate Object Interactions with Physics-Guided Video Diffusion	David Romero et.al.	2510.02284	null
2025-10-02	Self-Forcing++: Towards Minute-Scale High-Quality Video Generation	Justin Cui et.al.	2510.02283	null
2025-10-02	Diffusion^2: Turning 3D Environments into Radio Frequency Heatmaps	Kyoungjun Park et.al.	2510.02274	null
2025-10-02	Do You Know Where Your Camera Is? View-Invariant Policy Learning with Camera Conditioning	Tianchong Jiang et.al.	2510.02268	null
2025-10-02	NeuroSwift: A Lightweight Cross-Subject Framework for fMRI Visual Reconstruction of Complex Scenes	Shiyi Zhang et.al.	2510.02266	null
2025-10-02	DragFlow: Unleashing DiT Priors with Region Based Supervision for Drag Editing	Zihan Zhou et.al.	2510.02253	null
2025-10-02	TempoControl: Temporal Attention Guidance for Text-to-Video Models	Shira Schiber et.al.	2510.02226	null
2025-10-02	Diffusion Transformers for Imputation: Statistical Efficiency and Uncertainty Quantification	Zeqi Ye et.al.	2510.02216	null
2025-10-02	DiFFPO: Training Diffusion LLMs to Reason Fast and Furious via Reinforcement Learning	Hanyang Zhao et.al.	2510.02212	null
2025-10-02	Measurement-Guided Consistency Model Sampling for Inverse Problems	Amirreza Tanevardi et.al.	2510.02208	null
2025-10-02	Chaotic many-body quantum dynamics, spectral correlations, and energy diffusion	J. T. Chalker et.al.	2510.02198	null
2025-10-02	Uncovering Semantic Selectivity of Latent Groups in Higher Visual Cortex with Mutual Information-Guided Diffusion	Yule Wang et.al.	2510.02182	null
2025-10-02	Policy Gradient Guidance Enables Test Time Control	Jianing Qi et.al.	2510.02148	null
2025-10-02	FlexDoc: Parameterized Sampling for Diverse Multilingual Synthetic Documents for Training Document Understanding Models	Karan Dua et.al.	2510.02133	null
2025-10-02	SoundReactor: Frame-level Online Video-to-Audio Generation	Koichi Saito et.al.	2510.02110	null
2025-10-02	Quantum Effects or Theoretical Artifacts? A Computational Reanalysis of Hydrogen at High-Pressure	Stefano Racioppi et.al.	2510.02098	null
2025-10-02	VGDM: Vision-Guided Diffusion Model for Brain Tumor Detection and Segmentation	Arman Behnam et.al.	2510.02086	null
2025-10-02	Fine-Tuning Flow Matching via Maximum Likelihood Estimation of Reconstructions	Zhaoyi Li et.al.	2510.02081	null
2025-10-02	Spec-Gloss Surfels and Normal-Diffuse Priors for Relightable Glossy Objects	Georgios Kouros et.al.	2510.02069	null
2025-10-02	MSRepaint: Multiple Sclerosis Repaint with Conditional Denoising Diffusion Implicit Model for Bidirectional Lesion Filling and Synthesis	Jinwei Zhang et.al.	2510.02063	null
2025-10-02	Zero-shot Human Pose Estimation using Diffusion-based Inverse solvers	Sahil Bhandary Karnoor et.al.	2510.02043	null
2025-10-02	RAD@home discovery of extragalactic radio rings and odd radio circles: clues to their origins	Ananda Hota et.al.	2510.01999	null
2025-10-02	$\text{G}^2$ RPO: Granular GRPO for Precise Reward in Flow Models	Yujie Zhou et.al.	2510.01982	null
2025-10-02	ZK-WAGON: Imperceptible Watermark for Image Generation Models using ZK-SNARKs	Aadarsh Anantha Ramakrishnan et.al.	2510.01967	null
2025-10-02	StelLA: Subspace Learning in Low-rank Adaptation using Stiefel Manifold	Zhizhong Li et.al.	2510.01938	null
2025-10-02	Dark characterization of Ti/Al LEKIDs for the search of axions in the W-band	Victor Rollano et.al.	2510.01913	null
2025-10-02	A probabilistic representation for the gradient in a linear parabolic PDE with Neumann boundary condition	Abdelatif Benchérif Madani et.al.	2510.01898	null
2025-10-02	Multi-marginal temporal Schrödinger Bridge Matching for video generation from unpaired data	Thomas Gravier et.al.	2510.01894	null
2025-10-02	Fisher information and trajectorial interpretation to the Itô–Langevin relative entropy dissipation	Jiaming Chen et.al.	2510.01870	null
2025-10-04	NGGAN: Noise Generation GAN Based on the Practical Measurement Dataset for Narrowband Powerline Communications	Ying-Ren Chien et.al.	2510.01850	null
2025-10-02	Non-Gaussian Rotational Diffusion and Swing Motion of Dumbbell Probes in Two Dimensional Colloids	Jeongmin Kim et.al.	2510.01847	null
2025-10-02	Leveraging Prior Knowledge of Diffusion Model for Person Search	Giyeol Kim et.al.	2510.01841	null
2025-10-02	Representation and Integration by Parts Formulas for Affine Processes	Arturo Kohatsu-Higa et.al.	2510.01839	null
2025-10-02	Intermediate diffusive-ballistic electron conduction around mesoscopic defects in graphene	Toni Markovic et.al.	2510.01821	null
2025-10-02	Mean-field theory of the Santa Fe model revisited: a systematic derivation from an exact BBGKY hierarchy for the zero-intelligence limit-order book model	Taiki Wakatsuki et.al.	2510.01814	null
2025-10-02	Efficient manifold evolution algorithm using adaptive B-Spline interpolation	Muhammad Ammad et.al.	2510.01790	null
2025-10-03	Pack and Force Your Memory: Long-form and Consistent Video Generation	Xiaofei Wu et.al.	2510.01784	null
2025-10-02	Unsupervised Dynamic Feature Selection for Robust Latent Spaces in Vision Tasks	Bruno Corcuera et.al.	2510.01758	null
2025-10-02	Towards Photonic Band Diagram Generation with Transformer-Latent Diffusion Models	Valentin Delchevalerie et.al.	2510.01749	null
2025-10-02	Edge GPU Aware Multiple AI Model Pipeline for Accelerated MRI Reconstruction and Analysis	Ashiyana Abdul Majeed et.al.	2510.01730	null
2025-10-02	First passage times to T cell activation	Tony Wong et.al.	2510.01694	null
2025-10-03	UniVerse: Unleashing the Scene Prior of Video Diffusion Models for Robust Radiance Field Reconstruction	Jin Cao et.al.	2510.01669	null
2025-10-02	FideDiff: Efficient Diffusion Model for High-Fidelity Image Motion Deblurring	Xiaoyang Liu et.al.	2510.01641	null
2025-10-02	Finite isoresidual covers in strata of $k$ -differentials	Dawei Chen et.al.	2510.01630	null
2025-10-02	Local linearization for estimating the diffusion parameter of nonlinear stochastic wave equations with spatially correlated noise	Guoping Liu et.al.	2510.01627	null
2025-10-02	NPN: Non-Linear Projections of the Null-Space for Imaging Inverse Problems	Roman Jacome et.al.	2510.01608	null
2025-10-02	Securing generative artificial intelligence with parallel magnetic tunnel junction true randomness	Youwei Bao et.al.	2510.01598	null
2025-10-02	TetriServe: Efficient DiT Serving for Heterogeneous Image Generation	Runyu Lu et.al.	2510.01565	null
2025-10-02	MIRA: Towards Mitigating Reward Hacking in Inference-Time Alignment of T2I Diffusion Models	Kevin Zhai et.al.	2510.01549	null
2025-10-02	Growing Visual Generative Capacity for Pre-Trained MLLMs	Hanyu Wang et.al.	2510.01546	null
2025-10-02	Step-Aware Policy Optimization for Reasoning in Diffusion Large Language Models	Shaoan Xie et.al.	2510.01544	null
2025-10-02	Towards Better Optimization For Listwise Preference in Diffusion Models	Jiamu Bai et.al.	2510.01540	null
2025-10-01	Correlation estimates for Brownian particles with singular interactions	Mitia Duerinckx et.al.	2510.01507	null
2025-10-01	AortaDiff: A Unified Multitask Diffusion Framework For Contrast-Free AAA Imaging	Yuxuan Ou et.al.	2510.01498	null
2025-10-01	Purrception: Variational Flow Matching for Vector-Quantized Image Generation	Răzvan-Andrei Matişan et.al.	2510.01478	null
2025-10-03	SCOPED: Score-Curvature Out-of-distribution Proximity Evaluator for Diffusion	Brett Barkley et.al.	2510.01456	null
2025-10-01	Diffusion Modeling of the Three-Dimensional Magnetic Field in the Sun’s Corona	Daniel E. da Silva et.al.	2510.01441	null
2025-10-01	DiffKnock: Diffusion-based Knockoff Statistics for Neural Networks Inference	Heng Ge et.al.	2510.01418	null
2025-10-01	How Well do Diffusion Policies Learn Kinematic Constraint Manifolds?	Lexi Foland et.al.	2510.01404	null
2025-10-01	Localized Pattern Formation and Oscillatory Instabilities in a Three-component Gierer Meinhardt Model	Chunyi Gai et.al.	2510.01401	null
2025-10-01	DisCo: Reinforcement with Diversity Constraints for Multi-Human Generation	Shubhankar Borse et.al.	2510.01399	null
2025-10-01	VENTURA: Adapting Image Diffusion Models for Unified Task Conditioned Navigation	Arthur Zhang et.al.	2510.01388	null
2025-10-01	Fine-Tuning Masked Diffusion for Provable Self-Correction	Jaeyeon Kim et.al.	2510.01384	null
2025-10-01	Selective Underfitting in Diffusion Models	Kiwhan Song et.al.	2510.01378	null
2025-10-01	Microquasars as the major contributors to Galactic cosmic rays around the “knee”	Samy Kaci et.al.	2510.01369	null
2025-10-01	Image Generation Based on Image Style Extraction	Shuochen Chang et.al.	2510.01347	null
2025-10-01	Discovery of diffuse gamma-ray emission in the vicinity of G172.8+1.5: An old supernova remnant with different turbulence properties	Yuan Li et.al.	2510.01340	null
2025-10-01	LVTINO: LAtent Video consisTency INverse sOlver for High Definition Video Restoration	Alessio Spagnoletti et.al.	2510.01339	null
2025-10-01	Dynamical Excitation as a probe of planetary origins	Brad M. S. Hansen et.al.	2510.01332	null
2025-10-01	Continuously Augmented Discrete Diffusion model for Categorical Generative Modeling	Huangjie Zheng et.al.	2510.01329	null
2025-10-01	Combining complex Langevin dynamics with score-based and energy-based diffusion models	Gert Aarts et.al.	2510.01328	null
2025-10-01	IMAGEdit: Let Any Subject Transform	Fei Shen et.al.	2510.01186	null
2025-10-01	Temporal Score Rescaling for Temperature Sampling in Diffusion and Flow Models	Yanbo Xu et.al.	2510.01184	null
2025-10-01	EvoWorld: Evolving Panoramic World Generation with Explicit 3D Memory	Jiahao Wang et.al.	2510.01183	null
2025-10-01	Vanishing Acts: Quantifying Black Hole Formation with the DSNB Signal	Tim Charissé et.al.	2510.01177	null
2025-10-01	Audio Driven Real-Time Facial Animation for Social Telepresence	Jiye Lee et.al.	2510.01176	null
2025-10-01	Code2Video: A Code-centric Paradigm for Educational Video Generation	Yanzhe Chen et.al.	2510.01174	null
2025-10-01	Multi-Marginal Flow Matching with Adversarially Learnt Interpolants	Oskar Kviman et.al.	2510.01159	null
2025-10-01	Superpositions of Quantum Gaussian Processes	Lorenzo Braccini et.al.	2510.01156	null
2025-10-01	Compose Your Policies! Improving Diffusion-based or Flow-based Robot Policies via Test-time Distribution-level Composition	Jiahang Cao et.al.	2510.01068	null
2025-10-01	ReSWD: ReSTIR’d, not shaken. Combining Reservoir Sampling and Sliced Wasserstein Distance for Variance Reduction	Mark Boss et.al.	2510.01061	null
2025-10-01	Authentic Discrete Diffusion Model	Xiao Li et.al.	2510.01047	null
2025-10-01	Gated X-TFC: Soft Domain Decomposition for Forward and Inverse Problems in Sharp-Gradient PDEs	Vikas Dwivedi et.al.	2510.01039	null
2025-10-01	Secure and reversible face anonymization with diffusion models	Pol Labarbarie et.al.	2510.01031	null
2025-10-01	Syntax-Guided Diffusion Language Models with User-Integrated Personalization	Ruqian Zhang et.al.	2510.01028	null
2025-10-01	Equivariant Geometric Scattering Networks via Vector Diffusion Wavelets	David R. Johnson et.al.	2510.01022	null
2025-10-01	Molecular Mobility of Extraterrestrial Ices: Surface Diffusion in Astrochemistry and Planetary Science	N. F. W. Ligterink et.al.	2510.01018	null
2025-10-01	ImageDoctor: Diagnosing Text-to-Image Generation via Grounded Image Reasoning	Yuxiang Guo et.al.	2510.01010	null
2025-10-02	SoftCFG: Uncertainty-guided Stable Guidance for Visual Autoregressive Model	Dongli Xu et.al.	2510.00996	null
2025-10-01	Riemannian Consistency Model	Chaoran Cheng et.al.	2510.00983	null
2025-10-01	JEPA-T: Joint-Embedding Predictive Architecture with Text Fusion for Image Generation	Siheng Wan et.al.	2510.00974	null
2025-09-30	Stitch: Training-Free Position Control in Multimodal Diffusion Transformers	Jessica Bader et.al.	2509.26644	null
2025-09-30	Query-Kontext: An Unified Multimodal Model for Image Generation and Editing	Yuxin Song et.al.	2509.26641	null
2025-09-30	Learning to See Before Seeing: Demystifying LLM Visual Priors from Language Pre-training	Junlin Han et.al.	2509.26625	null
2025-09-30	DiffCamera: Arbitrary Refocusing on Images	Yiyang Wang et.al.	2509.26599	null
2025-09-30	Stable Cinemetrics : Structured Taxonomy and Evaluation for Professional Video Generation	Agneet Chatterjee et.al.	2509.26555	null
2025-09-30	Ferret-UI Lite: Lessons from Building Small On-Device GUI Agents	Zhen Yang et.al.	2509.26539	null
2025-09-30	HilbertA: Hilbert Attention for Image Generation with Diffusion Models	Shaoyi Zheng et.al.	2509.26538	null
2025-09-30	Stab-QRAM: An All-Clifford Quantum Random Access Memory for Special Data	Guangyi Li et.al.	2509.26494	null
2025-09-30	Contrastive Diffusion Guidance for Spatial Inverse Problems	Sattwik Basu et.al.	2509.26489	null
2025-09-30	dParallel: Learnable Parallel Decoding for dLLMs	Zigeng Chen et.al.	2509.26488	null
2025-09-30	Closures of moment expansion of anisotropic active Brownian particles	Timothée Gautry et.al.	2509.26453	null
2025-09-30	Post-Training Quantization via Residual Truncation and Zero Suppression for Diffusion Models	Donghoon Kim et.al.	2509.26436	null
2025-10-01	AdaBlock-dLLM: Semantic-Aware Diffusion LLM Inference via Adaptive Block Size	Guanxi Lu et.al.	2509.26432	null
2025-09-30	MotionRAG: Motion Retrieval-Augmented Image-to-Video Generation	Chenhui Zhu et.al.	2509.26391	null
2025-09-30	The Effective Reactivity for Capturing Brownian Motion by Partially Reactive Patches on a Spherical Surface	Denis S. Grebenkov et.al.	2509.26381	null
2025-09-30	Go with Your Gut: Scaling Confidence for Autoregressive Image Generation	Harold Haodong Chen et.al.	2509.26376	null
2025-09-30	Competition of small targets in planar domains: from Dirichlet to Robin and Steklov boundary condition	Denis S. Grebenkov et.al.	2509.26367	null
2025-09-30	Data-to-Energy Stochastic Dynamics	Kirill Tamogashev et.al.	2509.26364	null
2025-09-30	Universal critical dynamics near the chiral phase transition and the QCD critical point	Yunxin Ye et.al.	2509.26355	null
2025-09-30	Fast-dLLM v2: Efficient Block-Diffusion LLM	Chengyue Wu et.al.	2509.26328	null
2025-09-30	Anomaly detection for generic failure monitoring in robotic assembly, screwing and manipulation	Niklas Grambow et.al.	2509.26308	null
2025-09-30	Two-component diffuse Galactic gamma-ray emission revealed with Fermi-LAT	Qi-Ling Chen et.al.	2509.26290	null
2025-09-30	3DiFACE: Synthesizing and Editing Holistic 3D Facial Animation	Balamurugan Thambiraja et.al.	2509.26233	null
2025-09-30	IMG: Calibrating Diffusion Models via Implicit Multimodal Guidance	Jiayi Guo et.al.	2509.26231	null
2025-09-30	Basic Cycle Ratio: Cost-Effective Ranking of Influential Spreaders from Local and Global Perspectives	Wenxin Zheng et.al.	2509.26220	null
2025-09-30	Exact rate of convergence for the empirical measure of a subordinated process in $p$ -Wasserstein distance	René L. Schilling et.al.	2509.26188	null
2025-09-30	BABY 1L: First Tritium Breeding Campaign Results	Rémi Delaporte-Mathurin et.al.	2509.26174	null
2025-09-30	Human-MME: A Holistic Evaluation Benchmark for Human-Centric Multimodal Large Language Models	Yuansen Liu et.al.	2509.26165	null
2025-09-30	Towards Continual Expansion of Data Coverage: Automatic Text-guided Edge-case Synthesis	Kyeongryeol Go et.al.	2509.26158	null
2025-09-30	EchoGen: Generating Visual Echoes in Any Scene via Feed-Forward Subject-Driven Auto-Regressive Model	Ruixiao Dong et.al.	2509.26127	null
2025-10-01	Tracer diffusion coefficients in a sheared granular gas. Exact results	David González Méndez et.al.	2509.26115	null
2025-09-30	EVODiff: Entropy-aware Variance Optimized Diffusion Inference	Shigui Li et.al.	2509.26096	null
2025-09-30	The diffusion-driven orthorhombic to tetragonal transition in YBa $_2$Cu$_3$O$_7$ derived with a machine learning interatomic potential	Davide Gambino et.al.	2509.26095	null
2025-09-30	Fading to Grow: Growing Preference Ratios via Preference Fading Discrete Diffusion for Recommendation	Guoqing Hu et.al.	2509.26063	null
2025-09-30	Initial traces and solvability of the fast diffusion equation with power-type nonlinearity	Kazuhiro Ishige et.al.	2509.26054	null
2025-09-30	PatchVSR: Breaking Video Diffusion Resolution Limits with Patch-wise Video Super-Resolution	Shian Du et.al.	2509.26025	null
2025-09-30	New Fourth-Order Grayscale Indicator-Based Telegraph Diffusion Model for Image Despeckling	Rajendra K. Ray et.al.	2509.26010	null
2025-10-02	VRWKV-Editor: Reducing quadratic complexity in transformer-based video editing	Abdelilah Aitrouga et.al.	2509.25998	null
2025-09-30	Exact Solutions to the Quantum Schrödinger Bridge Problem	Mykola Bordyuh et.al.	2509.25980	null
2025-09-30	Weak-strong uniqueness for general cross-diffusion systems with volume filling	Maria Heitzinger et.al.	2509.25978	null
2025-09-30	Data-Free Continual Learning of Server Models in Model-Heterogeneous Federated learning	Xiao Zhang et.al.	2509.25977	null
2025-09-30	CO3: Contrasting Concepts Compose Better	Debottam Dutta et.al.	2509.25940	null
2025-09-30	Bringing Emerging Architectures to Sequence Labeling in NLP	Ana Ezquerro et.al.	2509.25918	null
2025-10-01	LLaVAShield: Safeguarding Multimodal Multi-Turn Dialogues in Vision-Language Models	Guolei Huang et.al.	2509.25896	null
2025-10-01	A Multimodal LLM Approach for Visual Question Answering on Multiparametric 3D Brain MRI	Arvind Murari Vepa et.al.	2509.25889	null
2025-09-30	Kinetics of the photochromic effect in oxygen-containing rare-earth hydrides	Dmitrii Moldarev et.al.	2509.25887	null
2025-09-30	Training-Free Reward-Guided Image Editing via Trajectory Optimal Control	Jinho Chang et.al.	2509.25845	null
2025-09-30	HiStyle: Hierarchical Style Embedding Predictor for Text-Prompt-Guided Controllable Speech Synthesis	Ziyu Zhang et.al.	2509.25842	null
2025-10-01	Act to See, See to Act: Diffusion-Driven Perception-Action Interplay for Adaptive Policies	Jing Wang et.al.	2509.25822	null
2025-09-30	Pre-equilibrium charm quark dynamics and their impact on D-Meson observables	Manu Kurian et.al.	2509.25806	null
2025-09-30	Numerical approximations to invariant measures of hybrid stochastic differential equations with superlinear coefficients via the backward Euler-Maruyama method	Wei Liu et.al.	2509.25799	null
2025-09-30	PUREVQ-GAN: Defending Data Poisoning Attacks through Vector-Quantized Bottlenecks	Alexander Branch et.al.	2509.25792	null
2025-09-30	Editable Noise Map Inversion: Encoding Target-image into Noise For High-Fidelity Image Manipulation	Mingyu Kang et.al.	2509.25776	null
2025-09-30	PCPO: Proportionate Credit Policy Optimization for Aligning Image Generation Models	Jeongjae Lee et.al.	2509.25774	null
2025-09-30	Free Lunch Alignment of Text-to-Image Diffusion Models without Preference Image Pairs	Jia Jun Cheng Xian et.al.	2509.25771	null
2025-09-30	Quasi-Monte Carlo methods for uncertainty quantification of tumor growth modeled by a parametric semi-linear parabolic reaction-diffusion equation	Alexander D. Gilbert et.al.	2509.25753	null
2025-09-30	ART-VITON: Measurement-Guided Latent Diffusion for Artifact-Free Virtual Try-On	Junseo Park et.al.	2509.25749	null
2025-09-30	LieHMR: Autoregressive Human Mesh Recovery with $SO(3)$ Diffusion	Donghwan Kim et.al.	2509.25739	null
2025-09-30	LaTo: Landmark-tokenized Diffusion Transformer for Fine-grained Human Face Editing	Zhenghao Zhang et.al.	2509.25731	null
2025-09-30	Controlled Generation for Private Synthetic Text	Zihao Zhao et.al.	2509.25729	null
2025-09-30	How Diffusion Models Memorize	Juyeop Kim et.al.	2509.25705	null
2025-09-30	Radiative hydrodynamic simulations of FIP fractionation in solar flares	Jeffrey W. Reep et.al.	2509.25695	null
2025-09-30	Hierarchical Diffusion Motion Planning with Task-Conditioned Uncertainty-Aware Priors	Amelie Minji Kim et.al.	2509.25685	null
2025-09-30	dVLA: Diffusion Vision-Language-Action Model with Multimodal Chain-of-Thought	Junjie Wen et.al.	2509.25681	null
2025-09-30	Swift: An Autoregressive Consistency Model for Efficient Weather Forecasting	Jason Stock et.al.	2509.25631	null
2025-09-30	Mean Field Type Control Problems Driven by Jump-diffusions	Alain Bensoussan et.al.	2509.25614	null
2025-09-29	RFG: Test-Time Scaling for Diffusion Large Language Model Reasoning with Reward-Free Guidance	Tianlang Chen et.al.	2509.25604	null
2025-09-29	MoReFlow: Motion Retargeting Learning through Unsupervised Flow Matching	Wontaek Kim et.al.	2509.25600	null
2025-09-29	Machine Learning Algorithms for Improving Black Box Optimization Solvers	Morteza Kimiaei et.al.	2509.25592	null
2025-09-29	IRIS: Intrinsic Reward Image Synthesis	Yihang Chen et.al.	2509.25562	null
2025-09-29	Spatiotemporal Forecasting of Incidents and Congestion with Implications for Sustainable Traffic Control	Tony Kinchen et.al.	2509.25515	null
2025-09-29	Non-Gaussian statistics of concentration fluctuations in free liquid diffusion	Marco Bussoletti et.al.	2509.25511	null
2025-09-29	Analysis of a Cahn–Hilliard model for viscoelastoplastic two-phase flows	Fan Cheng et.al.	2509.25508	null
2025-09-29	Kinetic Monte Carlo prediction of the morphology of pentaerythritol tetranitrate	Jacob Jeffries et.al.	2509.25490	null
2025-09-29	Noise estimation of SDE from a single data trajectory	Munawar Ali et.al.	2509.25484	null
2025-09-29	Translation from Wearable PPG to 12-Lead ECG	Hui Ji et.al.	2509.25480	null
2025-09-29	Exponential Hedging for the Ornstein-Uhlenbeck Process in the Presence of Linear Price Impact	Yan Dolinsky et.al.	2509.25472	null
2025-09-29	Generating Differentially Private Networks with a Modified Erdős-Rényi Model	Huaiyuan Rao et.al.	2509.25431	null
2025-09-29	Stochastic dynamics on evolving geometric graphs	Alexei Daletskii et.al.	2509.25427	null
2025-09-29	Electropolishing-Induced Topographic Defects in Niobium: Insights and Implications for Superconducting Radio Frequency Applications	Oleksandr Hryhorenko et.al.	2509.25423	null
2025-09-29	Emotion-Aligned Generation in Diffusion Text to Speech Models via Preference-Guided Optimization	Jiacheng Shi et.al.	2509.25416	null
2025-09-29	FlashOmni: A Unified Sparse Attention Engine for Diffusion Transformers	Liang Qiao et.al.	2509.25401	null
2025-09-29	Let Physics Guide Your Protein Flows: Topology-aware Unfolding and Generation	Yogesh Verma et.al.	2509.25379	null
2025-09-29	Safe and Stable Control via Lyapunov-Guided Diffusion Models	Xiaoyuan Cheng et.al.	2509.25375	null
2025-09-29	Diffusion with doubly stochastic resetting	Maxence Arutkin et.al.	2509.25365	null
2025-09-29	The spatially-resolved effect of mergers on the stellar mass assembly of MaNGA galaxies	Eirini Angeloudi et.al.	2509.25340	null
2025-09-29	LUMA: Low-Dimension Unified Motion Alignment with Dual-Path Anchoring for Text-to-Motion Diffusion Model	Haozhe Jia et.al.	2509.25304	null
2025-09-29	Learning to Parallel: Accelerating Diffusion Large Language Models via Adaptive Parallel Decoding	Wenrui Bao et.al.	2509.25188	null
2025-09-29	FlashI2V: Fourier-Guided Latent Shifting Prevents Conditional Image Leakage in Image-to-Video Generation	Yunyang Ge et.al.	2509.25187	null
2025-09-29	Guided Diffusion for the Discovery of New Superconductors	Pawan Prakash et.al.	2509.25186	null
2025-09-29	DC-VideoGen: Efficient Video Generation with Deep Compression Video Autoencoder	Junyu Chen et.al.	2509.25182	null
2025-09-29	A bound-preserving multinumerics scheme for steady-state convection-diffusion equations	Maurice S. Fabien et.al.	2509.25181	null
2025-10-01	DC-Gen: Post-Training Diffusion Acceleration with Deeply Compressed Latent Space	Wenkun He et.al.	2509.25180	null
2025-09-29	GHOST: Hallucination-Inducing Image Generation for Multimodal LLMs	Aryan Yazdan Parast et.al.	2509.25178	null
2025-09-29	Personalized Vision via Visual In-Context Learning	Yuxin Jiang et.al.	2509.25172	null
2025-09-29	TR2-D2: Tree Search Guided Trajectory-Aware Fine-Tuning for Discrete Diffusion	Sophia Tang et.al.	2509.25171	null
2025-09-29	GLASS Flows: Transition Sampling for Alignment of Flow and Diffusion Models	Peter Holderrieth et.al.	2509.25170	null
2025-09-29	Aligning Visual Foundation Encoders to Tokenizers for Diffusion Models	Bowei Chen et.al.	2509.25162	null
2025-09-29	Rolling Forcing: Autoregressive Long Video Diffusion in Real Time	Kunhao Liu et.al.	2509.25161	null
2025-09-29	GSM8K-V: Can Vision Language Models Solve Grade School Math Word Problems in Visual Contexts	Fan Yuan et.al.	2509.25160	null
2025-09-29	LayerD: Decomposing Raster Graphic Designs into Layers	Tomoyuki Suzuki et.al.	2509.25134	null
2025-09-29	Score Distillation of Flow Matching Models	Mingyuan Zhou et.al.	2509.25127	null
2025-09-29	Diffuse Domain Methods with Dirichlet Boundary Conditions	Luke Benfield et.al.	2509.25115	null
2025-09-29	MANI-Pure: Magnitude-Adaptive Noise Injection for Adversarial Purification	Xiaoyi Huang et.al.	2509.25082	null
2025-09-29	Towards a Certificate of Trust: Task-Aware OOD Detection for Scientific AI	Bogdan Raonić et.al.	2509.25080	null
2025-09-29	UniLat3D: Geometry-Appearance Unified Latents for Single-Stage 3D Generation	Guanjun Wu et.al.	2509.25079	null
2025-09-29	Interstellar Dust-Catalyzed Molecular Hydrogen Formation Enabled by Nuclear Quantum Effects	Xiaolong Yang et.al.	2509.25070	null
2025-09-29	Collective transport efficiency of microswimmer swarms optimized by tactic run-tumble dynamics	Maggie Liu et.al.	2509.25068	null
2025-09-29	CharGen: Fast and Fluent Portrait Modification	Jan-Niklas Dihlmann et.al.	2509.25058	null
2025-09-29	Advantage Weighted Matching: Aligning RL with Pretraining in Diffusion Models	Shuchen Xue et.al.	2509.25050	null
2025-09-29	Scaling Synthetic Task Generation for Agents via Exploration	Ram Ramrakhya et.al.	2509.25047	null
2025-09-29	Ultra-Fast Language Generation via Discrete Diffusion Divergence Instruct	Haoyang Zheng et.al.	2509.25035	null
2025-09-29	Lagrangian description and quantification of scalar mixing in fluid flows from particle tracks	Anna Klünker et.al.	2509.25030	null
2025-09-29	STAGE: Stable and Generalizable GRPO for Autoregressive Image Generation	Xiaoxiao Ma et.al.	2509.25027	null
2025-09-29	Score-based Membership Inference on Diffusion Models	Mingxing Rao et.al.	2509.25003	null
2025-09-29	PanoWorld-X: Generating Explorable Panoramic Worlds via Sphere-Aware Video Diffusion	Yuyang Yin et.al.	2509.24997	null
2025-09-29	Path Diffuser: Diffusion Model for Data-Driven Traffic Simulator	Da Saem Lee et.al.	2509.24995	null
2025-09-29	SDPose: Exploiting Diffusion Priors for Out-of-Domain and Robust Pose Estimation	Shuang Liang et.al.	2509.24980	null
2025-09-30	Wan-Alpha: High-Quality Text-to-Video Generation with Alpha Channel	Haotian Dong et.al.	2509.24979	null
2025-09-29	DiffTester: Accelerating Unit Test Generation for Diffusion LLMs via Repetitive Pattern	Lekang Yang et.al.	2509.24975	null
2025-09-29	Double Descent as a Lens for Sample Efficiency in Autoregressive vs. Discrete Diffusion Models	Ahmad Fraij et.al.	2509.24974	null
2025-09-29	VIVALDy: A Hybrid Generative Reduced-Order Model for Turbulent Flows, Applied to Vortex-Induced Vibrations	Niccolò Tonioni et.al.	2509.24965	null
2025-09-29	Sharp behavior of semilinear damped wave equations driven by mixed local-nonlocal operators	Wenhui Chen et.al.	2509.24940	null
2025-09-29	Scalable GANs with Transformers	Sangeek Hyun et.al.	2509.24935	null
2025-09-29	Precision calculation of $^3$He$(α,γ)^7$ Be for solar physics	Ratna Khadka et.al.	2509.24931	null
2025-09-29	SAGA-SR: Semantically and Acoustically Guided Audio Super-Resolution	Jaekwon Im et.al.	2509.24924	null
2025-09-29	From Code to Action: Hierarchical Learning of Diffusion-VLM Policies	Markus Peschl et.al.	2509.24917	null
2025-09-29	Segmentor-Guided Counterfactual Fine-Tuning for Image Synthesis	Tian Xia et.al.	2509.24913	null
2025-09-29	When Scores Learn Geometry: Rate Separations under the Manifold Hypothesis	Xiang Li et.al.	2509.24912	null
2025-09-29	DRCP: Diffusion on Reinforced Cooperative Perception for Perceiving Beyond Limits	Lantao Li et.al.	2509.24903	null
2025-09-29	OpenGPT-4o-Image: A Comprehensive Dataset for Advanced Image Generation and Editing	Zhihong Chen et.al.	2509.24900	null
2025-09-29	Attention Surgery: An Efficient Recipe to Linearize Your Video Diffusion Transformer	Mohsen Ghafoorian et.al.	2509.24899	null
2025-09-29	RealUnify: Do Unified Models Truly Benefit from Unification? A Comprehensive Benchmark	Yang Shi et.al.	2509.24897	null
2025-09-29	VAGUEGAN: Stealthy Poisoning and Backdoor Attacks on Image Generative Pipelines	Mostafa Mohaimen Akand Faisal et.al.	2509.24891	null
2025-09-29	MMRQA: Signal-Enhanced Multimodal Large Language Models for MRI Quality Assessment	Fankai Jia et.al.	2509.24888	null
2025-09-29	Response to dynamic shape changes in suspensions of hard rectangles	Denis Dertli et.al.	2509.24885	null
2025-09-29	ThermalGen: Style-Disentangled Flow-Based Generative Models for RGB-to-Thermal Image Translation	Jiuhong Xiao et.al.	2509.24878	null
2025-09-29	Environment-Aware Satellite Image Generation with Diffusion Models	Nikos Kostagiolas et.al.	2509.24875	null
2025-09-29	Causal-Adapter: Taming Text-to-Image Diffusion for Faithful Counterfactual Generation	Lei Tong et.al.	2509.24798	null
2025-09-29	Fidelity-Aware Data Composition for Robust Robot Generalization	Zizhao Tong et.al.	2509.24797	null
2025-09-29	Collision types and times in interacting particle systems	Sergio Andraus et.al.	2509.24790	null
2025-09-29	FESTIM v2.0: Upgraded framework for multi-species hydrogen transport and enhanced performance	James Dark et.al.	2509.24760	null
2025-09-29	ExGS: Extreme 3D Gaussian Compression with Diffusion Priors	Jiaqi Chen et.al.	2509.24758	null
2025-09-29	Fabrication of hydrogen-bonded metal inorganic-organic complex glasses by ligand-tuning approach	Tianzhao Xu et.al.	2509.24755	null
2025-09-29	Geometric structure of stationary problem for spatial 1D self-diffusion equation with logistic growth	Yu ICHIDA et.al.	2509.24752	null
2025-09-29	Direct numerical simulation of two-phase flows with surfactant-induced surface viscous effects	Debashis Panda et.al.	2509.24722	null
2025-09-29	MAD: Manifold Attracted Diffusion	Dennis Elbrächter et.al.	2509.24710	null
2025-09-29	Enhancing Physical Plausibility in Video Generation by Reasoning the Implausibility	Yutong Hao et.al.	2509.24702	null
2025-09-29	SANA-Video: Efficient Video Generation with Block Linear Diffusion Transformer	Junsong Chen et.al.	2509.24695	null
2025-09-29	The influence of solute induced memory on interface migration	Chad W. Sinclair et.al.	2509.24668	null
2025-09-29	Learning Object-Centric Representations Based on Slots in Real World Scenarios	Adil Kaan Akan et.al.	2509.24652	null
2025-09-29	VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning	Yixuan Zhou et.al.	2509.24650	null
2025-09-30	RIFLE: Removal of Image Flicker-Banding via Latent Diffusion Enhancement	Libo Zhu et.al.	2509.24644	null
2025-09-29	PoseDiff: A Unified Diffusion Model Bridging Robot Pose Estimation and Video-to-Action Control	Haozhuo Zhang et.al.	2509.24591	null
2025-09-29	SAIP: A Plug-and-Play Scale-adaptive Module in Diffusion-based Inverse Problems	Lingyu Wang et.al.	2509.24580	null
2025-09-29	U-DiT Policy: U-shaped Diffusion Transformers for Robotic Manipulation	Linzhi Wu et.al.	2509.24579	null
2025-09-29	SCOPE: Semantic Conditioning for Sim2Real Category-Level Object Pose Estimation in Robotics	Peter Hönig et.al.	2509.24572	null
2025-09-29	Training-Free Multimodal Guidance for Video to Audio Generation	Eleonora Grassucci et.al.	2509.24550	null
2025-09-29	Diffusion Bridge or Flow Matching? A Unifying Framework and Comparative Analysis	Kaizhen Zhu et.al.	2509.24531	null
2025-09-29	CMT: Mid-Training for Efficient Learning of Consistency, Mean Flow, and Flow Map Models	Zheyuan Hu et.al.	2509.24526	null
2025-09-29	The role of viral dynamics and infectivity in models of oncolytic virotherapy for tumours with different motility	David Morselli et.al.	2509.24522	null
2025-09-29	Flow Crossover and Parallel Outflow during Collisionless Magnetic Reconnection	Theerasarn Pianpanit et.al.	2509.24513	null
2025-09-29	A Novel Preprocessing Unit for Effective Deep Learning based Classification and Grading of Diabetic Retinopathy	Pranoti Nage et.al.	2509.24497	null
2025-09-29	LaMoGen: Laban Movement-Guided Diffusion for Text-to-Motion Generation	Heechang Kim et.al.	2509.24469	null
2025-09-29	An Agent-Based Framework for Automated Higher-Voice Harmony Generation	Nia D’Souza Ganapathy et.al.	2509.24463	null
2025-09-29	Alternatives To Next Token Prediction In Text Generation – A Survey	Charlie Wyatt et.al.	2509.24435	null
2025-09-29	UI2V-Bench: An Understanding-based Image-to-video Generation Benchmark	Ailing Zhang et.al.	2509.24427	null
2025-09-29	CLQ: Cross-Layer Guided Orthogonal-based Quantization for Diffusion Transformers	Kai Liu et.al.	2509.24416	null
2025-09-29	Unsupervised Single-Channel Speech Separation with a Diffusion Prior under Speaker-Embedding Guidance	Runwu Shi et.al.	2509.24395	null
2025-09-29	LLaDA-MoE: A Sparse MoE Diffusion Language Model	Fengqi Zhu et.al.	2509.24389	null
2025-09-29	Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning	Xin Qiu et.al.	2509.24372	null
2025-09-29	From Satellite to Street: A Hybrid Framework Integrating Stable Diffusion and PanoGAN for Consistent Cross-View Synthesis	Khawlah Bajbaa et.al.	2509.24369	null
2025-09-29	Watermarking Diffusion Language Models	Thibaud Gloaguen et.al.	2509.24368	null
2025-09-29	Uni-X: Mitigating Modality Conflict with a Two-End-Separated Architecture for Unified Multimodal Models	Jitai Hao et.al.	2509.24365	null
2025-09-29	DRIFT: Divergent Response in Filtered Transformations for Robust Adversarial Defense	Amira Guesmi et.al.	2509.24359	null
2025-09-29	NeRV-Diffusion: Diffuse Implicit Neural Representations for Video Synthesis	Yixuan Ren et.al.	2509.24353	null
2025-09-29	Hyperspherical Latents Improve Continuous-Token Autoregressive Generation	Guolin Ke et.al.	2509.24335	null
2025-09-29	Wavelet-Assisted Mamba for Satellite-Derived Sea Surface Temperature Super-Resolution	Wankun Chen et.al.	2509.24334	null
2025-09-29	3D Structure of Jet-induced Diffusion Wake	Zhong Yang et.al.	2509.24315	null
2025-09-29	A study of Universal ODE approaches to predicting soil organic carbon	Satyanarayana Raju G. V. V et.al.	2509.24306	null
2025-09-29	High-Precision Temperature Estimation Based on Magnetic Nanoparticles Dominated by Brownian Relaxation under Combined AC and DC Magnetic Fields	Zhongzhou Du et.al.	2509.24301	null
2025-09-29	DiffuGuard: How Intrinsic Safety is Lost and Found in Diffusion Large Language Models	Zherui Li et.al.	2509.24296	null
2025-09-29	ASIA: Adaptive 3D Segmentation using Few Image Annotations	Sai Raj Kishore Perla et.al.	2509.24288	null
2025-09-29	Collisional Baryon-Dominated Dwarf Galaxies: A New Probe of Bursty Feedback and Dark Matter Physics	Yi-Ying Wang et.al.	2509.24270	null
2025-09-29	Cycle Diffusion Model for Counterfactual Image Generation	Fangrui Huang et.al.	2509.24267	null
2025-09-29	FreeAction: Training-Free Techniques for Enhanced Fidelity of Trajectory-to-Video Generation	Seungwook Kim et.al.	2509.24241	null
2025-09-29	Geometry-induced criticality in $p$ -adic scaling limits of random walks	Rahul Rajkumar et.al.	2509.24234	null
2025-09-29	Non-Invasive Detection of PROState Cancer with Novel Time-Dependent Diffusion MRI and AI-Enhanced Quantitative Radiological Interpretation: PROS-TD-AI	Baltasar Ramos et.al.	2509.24227	null
2025-09-29	Semantic Editing with Coupled Stochastic Differential Equations	Jianxin Zhang et.al.	2509.24223	null
2025-09-29	The role of the solid-melt interface in accelerating the self-catalyzed growth kinetics of III-V semiconductors	Zhucong Xi et.al.	2509.24206	null
2025-09-30	UniVid: The Open-Source Unified Video Model	Jiabin Luo et.al.	2509.24200	null
2025-09-29	An Efficient 3D Latent Diffusion Model for T1-contrast Enhanced MRI Generation	Zach Eidex et.al.	2509.24194	null
2025-09-29	Simulating Post-Neoadjuvant Chemotherapy Breast Cancer MRI via Diffusion Model with Prompt Tuning	Jonghun Kim et.al.	2509.24185	null
2025-09-29	Tumor Synthesis conditioned on Radiomics	Jonghun Kim et.al.	2509.24182	null
2025-09-29	LatXGen: Towards Radiation-Free and Accurate Quantitative Analysis of Sagittal Spinal Alignment Via Cross-Modal Radiographic View Synthesis	Moxin Zhao et.al.	2509.24165	null
2025-09-29	Asymmetric VAE for One-Step Video Super-Resolution Acceleration	Jianze Li et.al.	2509.24142	null
2025-09-28	GANji: A Framework for Introductory AI Image Generation	Chandon Hamel et.al.	2509.24128	null
2025-09-28	Progressive Layer Stripping Analysis for HVSR Interpretation	Mersad Fathizadeh et.al.	2509.24121	null
2025-09-28	GeoFunFlow: Geometric Function Flow Matching for Inverse Operator Learning over Complex Geometries	Sifan Wang et.al.	2509.24117	null
2025-09-28	BTC-SAM: Leveraging LLMs for Generation of Bias Test Cases for Sentiment Analysis Models	Zsolt T. Kardkovács et.al.	2509.24101	null
2025-09-26	Pixel Motion Diffusion is What We Need for Robot Control	E-Ro Nguyen et.al.	2509.22652	null
2025-09-26	RefAM: Attention Magnets for Zero-Shot Referral Segmentation	Anna Kukleva et.al.	2509.22650	null
2025-09-26	Learning Human-Perceived Fakeness in AI-Generated Videos via Multimodal LLMs	Xingyu Fu et.al.	2509.22646	null
2025-09-26	Language Models Can Learn from Verbal Feedback Without Scalar Rewards	Renjie Luo et.al.	2509.22638	null
2025-09-26	Scale-Wise VAR is Secretly Discrete Diffusion	Amandeep Kumar et.al.	2509.22636	null
2025-09-26	Training-Free Synthetic Data Generation with Dual IP-Adapter Guidance	Luc Boudier et.al.	2509.22635	null
2025-09-26	LongLive: Real-time Interactive Long Video Generation	Shuai Yang et.al.	2509.22622	null
2025-09-26	Exact solutions of open quantum Brownian motions on the real line for two-level systems	Manuel D. de la Iglesia et.al.	2509.22604	null
2025-09-26	Transport Based Mean Flows for Generative Modeling	Elaheh Akbari et.al.	2509.22592	null
2025-09-26	EgoDemoGen: Novel Egocentric Demonstration Generation Enables Viewpoint-Robust Manipulation	Yuan Xu et.al.	2509.22578	null
2025-09-26	UniMIC: Token-Based Multimodal Interactive Coding for Human-AI Collaboration	Qi Mao et.al.	2509.22570	null
2025-09-26	ConQuER: Modular Architectures for Control and Bias Mitigation in IQP Quantum Generative Models	Xiaocheng Zou et.al.	2509.22551	null
2025-09-26	EfficientDepth: A Fast and Detail-Preserving Monocular Depth Estimation Model	Andrii Litvynchuk et.al.	2509.22527	null
2025-09-26	JointDiff: Bridging Continuous and Discrete in Multi-Agent Trajectory Generation	Guillem Capellera et.al.	2509.22522	null
2025-09-26	A phenotype-structured reaction-diffusion model of avascular glioma growth	Francesca Ballatore et.al.	2509.22519	null
2025-09-26	Group Critical-token Policy Optimization for Autoregressive Image Generation	Guohui Zhang et.al.	2509.22485	null
2025-09-26	Bézier Meets Diffusion: Robust Generation Across Domains for Medical Image Segmentation	Chen Li et.al.	2509.22476	null
2025-09-26	Universal Inverse Distillation for Matching Models with Real-Data Supervision (No GANs)	Nikita Kornilov et.al.	2509.22459	null
2025-09-26	Overclocking Electrostatic Generative Models	Daniil Shlenskii et.al.	2509.22454	null
2025-09-26	LucidFlux: Caption-Free Universal Image Restoration via a Large-Scale Diffusion Transformer	Song Fei et.al.	2509.22414	null
2025-09-26	EMMA: Generalizing Real-World Robot Manipulation via Generative Visual Transfer	Zhehao Dong et.al.	2509.22407	null
2025-09-26	Closing the Safety Gap: Surgical Concept Erasure in Visual Autoregressive Models	Xinhao Zhong et.al.	2509.22400	null
2025-09-26	Gradient-based multi-focus image fusion with focus-aware saliency enhancement	Haoyu Li et.al.	2509.22392	null
2025-09-26	SurvDiff: A Diffusion Model for Generating Synthetic Data in Survival Analysis	Marie Brockschmidt et.al.	2509.22352	null
2025-09-26	Decoding quantum low density parity check codes with diffusion	Zejun Liu et.al.	2509.22347	null
2025-09-26	RAPID^3: Tri-Level Reinforced Acceleration Policies for Diffusion Transformer	Wangbo Zhao et.al.	2509.22323	null
2025-09-26	NIFTY: a Non-Local Image Flow Matching for Texture Synthesis	Pierrick Chatillon et.al.	2509.22318	null
2025-09-26	Self-organization mechanism in Bridgman-grown MnBi2Te4/(Bi2Te3)n: influence on layer sequence and magnetic properties	Paweł Skupiński et.al.	2509.22303	null
2025-09-26	HiGS: History-Guided Sampling for Plug-and-Play Enhancement of Diffusion Models	Seyedmorteza Sadat et.al.	2509.22300	null
2025-09-26	Jailbreaking on Text-to-Video Models via Scene Splitting Strategy	Wonjun Lee et.al.	2509.22292	null
2025-09-26	Wavelength-scale noise-resistant on-chip spectrometer	Jianbo Yu et.al.	2509.22286	null
2025-09-26	Conditional Denoising Diffusion Autoencoders for Wireless Semantic Communications	Mehdi Letafati et.al.	2509.22282	null
2025-09-26	FlashEdit: Decoupling Speed, Structure, and Semantics for Precise Image Editing	Junyi Wu et.al.	2509.22244	null
2025-09-26	The moving patch model with fractional diffusion	Sebastián Flores-Sepúlveda et.al.	2509.22234	null
2025-09-26	Question-Driven Analysis and Synthesis: Building Interpretable Thematic Trees with LLMs for Text Clustering and Controllable Generation	Tiago Fernandes Tavares et.al.	2509.22211	null
2025-09-26	MimicDreamer: Aligning Human and Robot Demonstrations for Scalable VLA Training	Haoyun Li et.al.	2509.22199	null
2025-09-26	DragGANSpace: Latent Space Exploration and Control for GANs	Kirsten Odendaal et.al.	2509.22169	null
2025-09-26	REFINE-CONTROL: A Semi-supervised Distillation Method For Conditional Image Generation	Yicheng Jiang et.al.	2509.22139	null
2025-09-26	Guidance Watermarking for Diffusion Models	Enoal Gesny et.al.	2509.22126	null
2025-09-26	Countering adversarial evasion in regression analysis	David Benfield et.al.	2509.22113	null
2025-09-26	Large Material Gaussian Model for Relightable 3D Generation	Jingrui Ye et.al.	2509.22112	null
2025-09-26	50 mm $\times$ 50 mm Cesium Atomic Vapor Cell for Terahertz Imaging: Implementation and Application	Bin Zhang et.al.	2509.22098	null
2025-09-26	Factor-Based Conditional Diffusion Model for Portfolio Optimization	Xuefeng Gao et.al.	2509.22088	null
2025-09-26	SpecXNet: A Dual-Domain Convolutional Network for Robust Deepfake Detection	Inzamamul Alam et.al.	2509.22070	null
2025-09-26	High-Quality Sound Separation Across Diverse Categories via Visually-Guided Generative Modeling	Chao Huang et.al.	2509.22063	null
2025-09-26	Comparative Analysis of GAN and Diffusion for MRI-to-CT translation	Emily Honey et.al.	2509.22049	null
2025-09-26	Latent Diffusion : Multi-Dimension Stable Diffusion Latent Space Explorer	Zhihua Zhong et.al.	2509.22038	null
2025-09-26	Stage-wise Dynamics of Classifier-Free Guidance in Diffusion Models	Cheng Jin et.al.	2509.22007	null
2025-09-26	Exposing Hallucinations To Suppress Them: VLMs Representation Editing With Generative Anchors	Youxu Shi et.al.	2509.21997	null
2025-09-26	FailureAtlas:Mapping the Failure Landscape of T2I Models via Active Exploration	Muxi Chen et.al.	2509.21995	null
2025-09-26	Mind-the-Glitch: Visual Correspondence for Detecting Inconsistencies in Subject-Driven Generation	Abdelrahman Eldesokey et.al.	2509.21989	null
2025-09-26	Hybrid Diffusion for Simultaneous Symbolic and Continuous Planning	Sigmund Hennum Høeg et.al.	2509.21983	null
2025-09-26	Electric-field effect on spin diffusion length in solids: An \textit{ab initio} study beyond the drift-diffusion model	Junqing Xu et.al.	2509.21962	null
2025-09-26	MultiCrafter: High-Fidelity Multi-Subject Generation via Spatially Disentangled Attention and Identity-Aware Reinforcement Learning	Tao Wu et.al.	2509.21953	null
2025-09-26	Modeling the Equilibrium Vacancy Concentration in Multi-Principal Element Alloys from First-Principles	Damien K. J. Lee et.al.	2509.21944	null
2025-09-26	Structural Information-based Hierarchical Diffusion for Offline Reinforcement Learning	Xianghua Zeng et.al.	2509.21942	null
2025-09-26	SemanticControl: A Training-Free Approach for Handling Loosely Aligned Visual Conditions in ControlNet	Woosung Joung et.al.	2509.21938	null
2025-09-26	EqDiff-CT: Equivariant Conditional Diffusion model for CT Image Synthesis from CBCT	Alzahra Altalib et.al.	2509.21913	null
2025-09-26	Discrete Guidance Matching: Exact Guidance for Discrete Flow Matching	Zhengyan Wan et.al.	2509.21912	null
2025-09-26	Logarithmic evolutions in solutions to the convection-diffusion equation of Burgers type	Masakazu Yamamoto et.al.	2509.21909	null
2025-09-26	Error Analysis of Discrete Flow with Generator Matching	Zhengyan Wan et.al.	2509.21906	null
2025-09-26	TDEdit: A Unified Diffusion Framework for Text-Drag Guided Image Manipulation	Qihang Wang et.al.	2509.21905	null
2025-09-26	Syncphony: Synchronized Audio-to-Video Generation with Diffusion Transformers	Jibin Song et.al.	2509.21893	null
2025-09-26	Drag4D: Align Your Motion with Text-Driven 3D Scene Generation	Minjun Kang et.al.	2509.21888	null
2025-09-26	StableDub: Taming Diffusion Prior for Generalized and Efficient Visual Dubbing	Liyang Chen et.al.	2509.21887	null
2025-09-26	Abductive Logical Rule Induction by Bridging Inductive Logic Programming and Multimodal Large Language Models	Yifei Peng et.al.	2509.21874	null
2025-09-26	Deepfakes: we need to re-think the concept of “real” images	Janis Keuper et.al.	2509.21864	null
2025-09-26	DiTraj: training-free trajectory control for video diffusion transformer	Cheng Lei et.al.	2509.21839	null
2025-09-26	On the Complexity Theory of Masked Discrete Diffusion: From $\mathrm{poly}(1/ε)$ to Nearly $ε$ -Free	Xunpeng Huang et.al.	2509.21835	null
2025-09-26	MoWM: Mixture-of-World-Models for Embodied Planning via Latent-to-Pixel Feature Modulation	Yu Shang et.al.	2509.21797	null
2025-09-26	LongScape: Advancing Long-Horizon Embodied World Models with Context-Aware MoE	Yu Shang et.al.	2509.21790	null
2025-09-26	DeHate: A Stable Diffusion-based Multimodal Approach to Mitigate Hate Speech in Images	Dwip Dalal et.al.	2509.21787	null
2025-09-26	UniVid: Unifying Vision Tasks with Pre-trained Video Generation Models	Lan Chen et.al.	2509.21760	null
2025-09-26	Noise-to-Notes: Diffusion-based Generation and Refinement for Automatic Drum Transcription	Michael Yeung et.al.	2509.21739	null
2025-09-26	MESA Isochrones and Stellar Tracks (MIST) III. The White Dwarf Cooling Sequence	Evan B. Bauer et.al.	2509.21717	null
2025-09-26	MusicWeaver: Coherent Long-Range and Editable Music Generation from a Beat-Aligned Structural Plan	Xuanchen Wang et.al.	2509.21714	null
2025-09-25	Snapshot Synthetic Aperture Imaging with Boiling Speckle	Janith B. Senanayaka et.al.	2509.21682	null
2025-09-25	Generating Stable Placements via Physics-guided Diffusion Models	Philippe Nadeau et.al.	2509.21664	null
2025-09-25	RED-DiffEq: Regularization by denoising diffusion models for solving inverse PDE problems with application to full waveform inversion	Siming Shan et.al.	2509.21659	null
2025-09-25	FantasyWorld: Geometry-Consistent World Modeling via Unified Video and 3D Prediction	Yixiang Dai et.al.	2509.21657	null
2025-09-25	DriftLite: Lightweight Drift Control for Inference-Time Scaling of Diffusion Models	Yinuo Ren et.al.	2509.21655	null
2025-09-25	A comprehensive equivalent circuit model for high overtone bulk acoustic resonators (HBARs)	Vikrant J. Gokhale et.al.	2509.21640	null
2025-09-25	Guiding Audio Editing with Audio Language Model	Zitong Lan et.al.	2509.21625	null
2025-09-25	Message passing for epidemiological interventions on networks with loops	Erik Weis et.al.	2509.21596	null
2025-09-25	Transabdominal Fetal Oximetry via Diffuse Optics: Principled Analysis and Demonstration in Pregnant Ovine Models	Weitai Qian et.al.	2509.21594	null
2025-09-25	What Happens Next? Anticipating Future Motion by Generating Point Trajectories	Gabrijel Boduljak et.al.	2509.21592	null
2025-09-25	X-Streamer: Unified Human World Modeling with Audiovisual Interaction	You Xie et.al.	2509.21574	null
2025-09-25	No Alignment Needed for Generation: Learning Linearly Separable Representations in Diffusion Models	Junno Yun et.al.	2509.21565	null
2025-09-25	ControlHair: Physically-based Video Diffusion for Controllable Dynamic Hair Rendering	Weikai Lin et.al.	2509.21541	null
2025-09-25	Patch-Based Diffusion for Data-Efficient, Radiologist-Preferred MRI Reconstruction	Rohan Sanda et.al.	2509.21531	null
2025-09-25	Shortcut Flow Matching for Speech Enhancement: Step-Invariant flows via single stage training	Naisong Zhou et.al.	2509.21522	null
2025-09-25	DistillKac: Few-Step Image Generation via Damped Wave Equations	Weiqiao Han et.al.	2509.21513	null
2025-09-25	Quantum algorithms for solving a drift-diffusion equation: analysing circuit depths	Ellen Devereux et.al.	2509.21509	null
2025-09-25	SlimDiff: Training-Free, Activation-Guided Hands-free Slimming of Diffusion Models	Arani Roy et.al.	2509.21498	null
2025-09-25	d2: Improved Techniques for Training Reasoning Diffusion Language Models	Guanghan Wang et.al.	2509.21474	null
2025-09-25	Are Hallucinations Bad Estimations?	Hude Liu et.al.	2509.21473	null
2025-09-25	Score-based Idempotent Distillation of Diffusion Models	Shehtab Zaman et.al.	2509.21470	null
2025-09-25	Gender Stereotypes in Professional Roles Among Saudis: An Analytical Study of AI-Generated Images Using Language Models	Khaloud S. AlKhalifah et.al.	2509.21466	null
2025-09-25	Viscous Growth Law in Bubble Coarsening: A Molecular Dynamics Perspective	Parameshwaran A et.al.	2509.21457	null
2025-09-25	SD3.5-Flash: Distribution-Guided Distillation of Generative Flows	Hmrishav Bandyopadhyay et.al.	2509.21318	null
2025-09-25	Two ADI compact difference methods for variable-exponent diffusion wave equations	Hao Zhang et.al.	2509.21316	null
2025-09-25	NewtonGen: Physics-Consistent and Controllable Text-to-Video Generation via Neural Newtonian Dynamics	Yu Yuan et.al.	2509.21309	null
2025-09-25	Einstein@Home Searches for Gamma-ray Pulsars in the Inner Galaxy	C. J. Clark et.al.	2509.21307	null
2025-09-26	Outflow-cloud interaction as the possible origin of the peculiar radio emission in the tidal disruption event AT2018cqh	Lei Yang et.al.	2509.21299	null
2025-09-25	Does FLUX Already Know How to Perform Physically Plausible Image Composition?	Shilin Lu et.al.	2509.21278	null
2025-09-25	Dense Semantic Matching with VGGT Prior	Songlin Yang et.al.	2509.21263	null
2025-09-25	Un-Doubling Diffusion: LLM-guided Disambiguation of Homonym Duplication	Evgeny Kaskov et.al.	2509.21262	null
2025-09-25	Hallucination as an Upper Bound: A New Perspective on Text-to-Image Evaluation	Seyed Amir Kasaei et.al.	2509.21257	null
2025-09-25	Hunyuan3D-Omni: A Unified Framework for Controllable Generation of 3D Assets	Team Hunyuan3D et.al.	2509.21245	null
2025-09-25	Evaluating the Evaluators: Metrics for Compositional Text-to-Image Generation	Seyed Amir Kasaei et.al.	2509.21227	null
2025-09-25	A Unified Framework for Diffusion Model Unlearning with f-Divergence	Nicola Novello et.al.	2509.21167	null
2025-09-25	DAGDiff: Guiding Dual-Arm Grasp Diffusion to Stable and Collision-Free Grasps	Md Faizal Karim et.al.	2509.21145	null
2025-09-25	The Unwinnable Arms Race of AI Image Detection	Till Aczel et.al.	2509.21135	null
2025-09-25	MotionFlow:Learning Implicit Motion Flow for Complex Camera Trajectory Control in Video Generation	Guojun Lei et.al.	2509.21119	null
2025-09-25	Are Modern Speech Enhancement Systems Vulnerable to Adversarial Attacks?	Rostislav Makarov et.al.	2509.21087	null
2025-09-25	UniTransfer: Video Concept Transfer via Progressive Spatial and Timestep Decomposition	Guojun Lei et.al.	2509.21086	null
2025-09-25	Normalizing Flows are Capable Visuomotor Policy Learning Models	Simon Kristoffersson Lind et.al.	2509.21073	null
2025-09-25	SPREAD: Sampling-based Pareto front Refinement via Efficient Adaptive Diffusion	Sedjro Salomon Hotegni et.al.	2509.21058	null
2025-09-25	Actor-Critic without Actor	Donghyeon Ki et.al.	2509.21022	null
2025-09-25	Graphical Willmore Problems with Low-Regularity Boundary and Dirichlet Data	Boris Gulyak et.al.	2509.21018	null
2025-09-25	Unbiased Parameter Estimation of Partially Observed Diffusions using Diffusion Bridges	Miguel Alvarez et.al.	2509.21015	null
2025-09-25	A Single Neuron Works: Precise Concept Erasure in Text-to-Image Diffusion Models	Qinqin He et.al.	2509.21008	null
2025-09-26	TF-Restormer: Complex Spectral Prediction for Speech Restoration	Ui-Hyeop Shin et.al.	2509.21003	null
2025-09-25	High energy gammas and neutrinos from the Sun, Jupiter and Earth	Pablo de la Torre et.al.	2509.20970	null
2025-09-25	Flow Matching in the Low-Noise Regime: Pathologies and a Contrastive Remedy	Weili Zeng et.al.	2509.20952	null
2025-09-25	SMC-X: A Distributed Scalable Monte Carlo Simulation Method for Chemically Complex Alloys	Xianglin Liu et.al.	2509.20949	null
2025-09-25	Conditionally Whitened Generative Models for Probabilistic Time Series Forecasting	Yanfeng Yang et.al.	2509.20928	null
2025-09-25	SimDiff: Simulator-constrained Diffusion Model for Physically Plausible Motion Generation	Akihisa Watanabe et.al.	2509.20927	null
2025-09-25	Deterministic Discrete Denoising	Hideyuki Suzuki et.al.	2509.20896	null
2025-09-25	AIBA: Attention-based Instrument Band Alignment for Text-to-Audio Diffusion	Junyoung Koh et.al.	2509.20891	null
2025-09-25	FerretNet: Efficient Synthetic Image Detection via Local Pixel Dependencies	Shuqiao Liang et.al.	2509.20890	null
2025-09-25	Holographic Brownian dynamics of a heavy particle in a boosted thermal plasma background	Anirban Roy Chowdhury et.al.	2509.20889	null
2025-09-25	Nuclear Diffusion Models for Low-Rank Background Suppression in Videos	Tristan S. W. Stevens et.al.	2509.20886	null
2025-09-25	Integrating Object Interaction Self-Attention and GAN-Based Debiasing for Visual Question Answering	Zhifei Li et.al.	2509.20884	null
2025-09-25	WeFT: Weighted Entropy-driven Fine-Tuning for dLLMs	Guowei Xu et.al.	2509.20863	null
2025-09-25	Causal Time Series Generation via Diffusion Models	Yutong Xia et.al.	2509.20846	null
2025-09-25	Topological Catenation-induced Pore Size in 2D Olympic Network	Wenbo Zhao et.al.	2509.20827	null
2025-09-25	T2I-Diff: fMRI Signal Generation via Time-Frequency Image Transform and Classifier-Free Denoising Diffusion Models	Hwa Hui Tew et.al.	2509.20822	null
2025-09-25	Diffusive Scaling limit of stochastic Box-Ball systems and PushTASEP	David Keating et.al.	2509.20779	null
2025-09-25	CusEnhancer: A Zero-Shot Scene and Controllability Enhancement Method for Photo Customization via ResInversion	Maoye Ren et.al.	2509.20775	null
2025-09-25	Measuring LLM Sensitivity in Transformer-based Tabular Data Synthesis	Maria F. Davila R et.al.	2509.20768	null
2025-09-25	FreeInsert: Personalized Object Insertion with Geometric and Style Control	Yuhong Zhang et.al.	2509.20756	null
2025-09-25	RAPTOR-GEN: RApid PosTeriOR GENerator for Bayesian Learning in Biomanufacturing	Wandi Xu et.al.	2509.20753	null
2025-09-25	Parallel Thinking, Sequential Answering: Bridging NAR and AR for Efficient Reasoning	Qihang Ai et.al.	2509.20744	null
2025-09-25	Quantum Algorithm for Subcellular Multiscale Reaction-Diffusion Systems	Margot Lockwood et.al.	2509.20668	null
2025-09-25	Atomistic Insights into Cu/amorphous-Ta $_x$ N Interfacial Adhesion via Machine Learning Interatomic Potentials: Effects of Stoichiometry and Interface Construction	Jeong Min Choi et.al.	2509.20662	null
2025-09-25	Scaling limit for Brownian motions on the $l$ -level Sierpinski gaskets: The fractal to Euclidean crossover	David A. Croydon et.al.	2509.20657	null
2025-09-25	Stray light in 3D porous nanostructures of single crystalline copper film	Yu-Seong Seo et.al.	2509.20644	null
2025-09-24	FS-DFM: Fast and Accurate Long Text Generation with Few-Step Diffusion Language Models	Amin Karimi Monsefi et.al.	2509.20624	null
2025-09-24	MMG: Mutual Information Estimation via the MMSE Gap in Diffusion	Longxuan Yu et.al.	2509.20609	null
2025-09-24	The X-ray Emission of NGC 5005: An Unobscured Low-Luminosity AGN with a Weakly Accreting Broad-Line Region	Anna Trindade Falcão et.al.	2509.20597	null
2025-09-24	von Kármán–Howarth Similarity of Spatial Correlations and the Distribution of Correlation Lengths in Solar Photospheric Turbulence	Rohit Chhiber et.al.	2509.20590	null
2025-09-24	Burning games on strong path products	Sally Ambrose et.al.	2509.20572	null
2025-09-24	PIRF: Physics-Informed Reward Fine-Tuning for Diffusion Models	Mingze Yuan et.al.	2509.20570	null
2025-09-24	A Hierarchical Adaptive Diffusion Model for Flexible Protein-Protein Docking	Rujie Yin et.al.	2509.20542	null
2025-09-24	Pattern Formation in Agent-Based and PDE Models for Evolutionary Games with Payoff-Driven Motion	Tianyong Yao et.al.	2509.20538	null
2025-09-24	InstructVTON: Optimal Auto-Masking and Natural-Language-Guided Interactive Style Control for Inpainting-Based Virtual Try-On	Julien Han et.al.	2509.20524	null
2025-09-24	A Recovery Theory for Diffusion Priors: Deterministic Analysis of the Implicit Prior Algorithm	Oscar Leong et.al.	2509.20511	null
2025-09-24	How two-dimensional are planet-disc interactions? II. Radiation hydrodynamics and suitable cooling prescriptions	Alexandros Ziampras et.al.	2509.20464	null
2025-09-24	On the Hydrodynamic Approximation of Quantum Integrable Models – An Illustration via the repulsive Lieb-Liniger Model	Friedrich Hübner et.al.	2509.20445	null
2025-09-24	pop-cosmos: Star formation over 12 Gyr from generative modelling of a deep infrared-selected galaxy catalogue	Sinan Deger et.al.	2509.20430	null
2025-09-24	Seedream 4.0: Toward Next-generation Multimodal Image Generation	Team Seedream et.al.	2509.20427	null
2025-09-24	Adversarial Defense in Cybersecurity: A Systematic Review of GANs for Threat Detection and Mitigation	Tharcisse Ndayipfukamiye et.al.	2509.20411	null
2025-09-25	EditVerse: Unifying Image and Video Editing and Generation with In-Context Learning	Xuan Ju et.al.	2509.20360	null
2025-09-24	PhysCtrl: Generative Physics for Controllable and Physics-Grounded Video Generation	Chen Wang et.al.	2509.20358	null
2025-09-26	mindmap: Spatial Memory in Deep Feature Maps for 3D Action Policies	Remo Steiner et.al.	2509.20297	null
2025-09-26	FAST: Foreground-aware Diffusion with Accelerated Sampling Trajectory for Segmentation-oriented Anomaly Synthesis	Xichen Xu et.al.	2509.20295	null
2025-09-24	Biologically Plausible Learning via Bidirectional Spike-Based Distillation	Changze Lv et.al.	2509.20284	null
2025-09-24	On Brinkman flows with curvature-induced phase separation in binary mixtures	Pierluigi Colli et.al.	2509.20282	null
2025-09-24	Turing instability and 2-D pattern formation in reaction-diffusion systems derived from kinetic theory	Stefano Boccelli et.al.	2509.20268	null
2025-09-24	Radial Variations in Residence Time Distribution for Pipe Flows	Etienne Boulais et.al.	2509.20256	null
2025-09-24	AnchDrive: Bootstrapping Diffusion Policies with Hybrid Trajectory Anchors for End-to-End Driving	Jinhao Chai et.al.	2509.20253	null
2025-09-24	4D Driving Scene Generation With Stereo Forcing	Hao Lu et.al.	2509.20251	null
2025-09-24	Universal Camouflage Attack on Vision-Language Models for Autonomous Driving	Dehong Kong et.al.	2509.20196	null
2025-09-24	KSDiff: Keyframe-Augmented Speech-Aware Dual-Path Diffusion for Facial Animation	Tianle Lyu et.al.	2509.20128	null
2025-09-24	Experiments on geostrophic convection: the role of the Prandtl number	Hannah M. Clercx et.al.	2509.20126	null
2025-09-24	Discrete Diffusion for Reflective Vision-Language-Action Models in Autonomous Driving	Pengxiang Li et.al.	2509.20109	null
2025-09-24	First-Extinction Law for Resampling Processes	Matteo Benati et.al.	2509.20101	null
2025-09-24	Incomplete Data, Complete Dynamics: A Diffusion Approach	Zihan Zhou et.al.	2509.20098	null
2025-09-24	Constrained Higher-Order Binary Optimization for Wireless Communications Systems Using Ising Machines	Gan Zheng et.al.	2509.20092	null
2025-09-24	Unleashing the Potential of the Semantic Latent Space in Diffusion Models for Image Dehazing	Zizheng Yang et.al.	2509.20091	null
2025-09-24	Hierarchy of timescales in a disordered spin- $1/2$ XX ladder	Kadir Çeven et.al.	2509.20078	null
2025-09-25	From Text to Talk: Audio-Language Model Needs Non-Autoregressive Joint Training	Tianqiao Liu et.al.	2509.20072	null
2025-09-24	Resistive switching behaviors in vertically aligned MoS $_2$ films with Cu, Ag, and Au electrodes	Shuei-De Huang et.al.	2509.20061	null
2025-09-24	Discrete Diffusion for Generative Modeling of Text-Aligned Speech Tokens	Pin-Jui Ku et.al.	2509.20060	null
2025-09-25	Diffusion-Augmented Contrastive Learning: A Noise-Robust Encoder for Biosignal Representations	Rami Zewail et.al.	2509.20048	null
2025-09-24	The role of photospheric magnetic flux diffusion in initiation of solar eruptions	Xinkai Bian et.al.	2509.20040	null
2025-09-24	Development of a time calibration system for the KLM upgrade in the Belle II experiment	Ziyu Liu et.al.	2509.20029	null
2025-09-24	Generative Adversarial Networks Applied for Privacy Preservation in Biometric-Based Authentication and Identification	Lubos Mjachky et.al.	2509.20024	null
2025-09-24	CamPVG: Camera-Controlled Panoramic Video Generation with Epipolar-Aware Diffusion	Chenhao Ji et.al.	2509.19979	null
2025-09-24	Learnable Sampler Distillation for Discrete Diffusion Models	Feiyang Fu et.al.	2509.19962	null
2025-09-24	GS-RoadPatching: Inpainting Gaussians via 3D Searching and Placing for Driving Scenes	Guo Chen et.al.	2509.19937	null
2025-09-25	GUIDE: A Diffusion-Based Autonomous Robot Exploration Framework Using Global Graph Inference	Zijun Che et.al.	2509.19916	null
2025-09-24	Dynamically Optimal Unraveling Schemes for Simulating Lindblad Equations	Yu Cao et.al.	2509.19887	null
2025-09-24	Adaptive User Interest Modeling via Conditioned Denoising Diffusion For Click-Through Rate Prediction	Qihang Zhao et.al.	2509.19876	null
2025-09-24	FreezeVLA: Action-Freezing Attacks against Vision-Language-Action Models	Xin Wang et.al.	2509.19870	null
2025-09-24	Parameter Estimation for Jump-Diffusion Stochastic Master Equations	Weichao Liang et.al.	2509.19862	null
2025-09-24	Gauge invariance and hyperforce correlation theory for equilibrium fluid mixtures	Joshua Matthes et.al.	2509.19837	null
2025-09-24	Boundary effect on asymptotic behaviour of solution to the hyperbolic-parabolic chemotaxis system	Nangao Zhang et.al.	2509.19828	null
2025-09-24	An Efficient Conditional Score-based Filter for High Dimensional Nonlinear Filtering Problems	Zhijun Zeng et.al.	2509.19816	null
2025-09-25	StrCGAN: A Generative Framework for Stellar Image Restoration	Shantanusinh Parmar et.al.	2509.19805	null
2025-09-24	Colossal Effect of Nanopore Surface Ionic Charge on the Dynamics of Confined Water	Armin Mozhdehei et.al.	2509.19802	null
2025-09-24	On The Cutoff Phenomenon For Dyson-Laguerre Processes	Samuel Chan-Ashing et.al.	2509.19798	null
2025-09-24	Beyond Human Demonstrations: Diffusion-Based Reinforcement Learning to Generate Data for VLA Training	Rushuai Yang et.al.	2509.19752	null
2025-09-24	Talking Head Generation via AU-Guided Landmark Prediction	Shao-Yu Chang et.al.	2509.19749	null
2025-09-24	Controls on the ocean response to idealized Antarctic meltwater input	Rory Basinski-Ferris et.al.	2509.19730	null
2025-09-24	PolGS: Polarimetric Gaussian Splatting for Fast Reflective Surface Reconstruction	Yufei Han et.al.	2509.19726	null
2025-09-24	TopoCut: Learning Multi-Step Cutting with Spectral Rewards and Discrete Diffusion Policies	Liquan Wang et.al.	2509.19712	null
2025-09-24	Diffusion and Flow-based Copulas: Forgetting and Remembering Dependencies	David Huk et.al.	2509.19707	null
2025-09-24	Diffusion-Based Impedance Learning for Contact-Rich Manipulation Tasks	Noah Geiger et.al.	2509.19696	null
2025-09-24	From Prompt to Progression: Taming Video Diffusion Models for Seamless Attribute Transition	Ling Lo et.al.	2509.19690	null
2025-09-24	Formal Safety Verification and Refinement for Generative Motion Planners via Certified Local Stabilization	Devesh Nath et.al.	2509.19688	null
2025-09-24	Selective Classifier-free Guidance for Zero-shot Text-to-speech	John Zheng et.al.	2509.19668	null
2025-09-24	Long-Range Dependence in Financial Markets: Empirical Evidence and Generative Modeling Challenges	Yifan He et.al.	2509.19663	null
2025-09-24	Statistical Parameter Calibration with the Generalized Fluctuation Dissipation Theorem and Generative Modeling	Ludovico T. Giorgini et.al.	2509.19660	null
2025-09-23	TIMED: Adversarial and Autoregressive Refinement of Diffusion-Based Time Series Generation	MohammadReza EskandariNasab et.al.	2509.19638	null
2025-09-23	Connecting cosmologically decaying dark matter to neutrino physics	Lea Fuß et.al.	2509.19596	null
2025-09-23	Synthesizing Artifact Dataset for Pixel-level Detection	Dennis Menn et.al.	2509.19589	null
2025-09-23	DAWM: Diffusion Action World Models for Offline Reinforcement Learning via Action-Inferred Transitions	Zongyue Li et.al.	2509.19538	null
2025-09-23	Real-Time Reinforcement Learning for Dynamic Tasks with a Parallel Soft Robot	James Avtges et.al.	2509.19525	null
2025-09-23	Frame-based Equivariant Diffusion Models for 3D Molecular Generation	Mohan Guo et.al.	2509.19506	null
2025-09-23	Hierarchical null controllability of a degenerate parabolic equation with nonlocal coefficient	Juan Límaco et.al.	2509.19505	null
2025-09-23	Reaction/Diffusion Competition Drives Anomalous Relaxation of Vitrimers	Makayla R. Branham-Ferrari et.al.	2509.19496	null
2025-09-23	ArtiFree: Detecting and Reducing Generative Artifacts in Diffusion-based Speech Enhancement	Bhawana Chhaglani et.al.	2509.19495	null
2025-09-23	Anchored Langevin Algorithms	Mert Gurbuzbalaban et.al.	2509.19455	null
2025-09-23	ROPA: Synthetic Robot Pose Generation for RGB-D Bimanual Data Augmentation	Jason Chen et.al.	2509.19454	null
2025-09-23	Two-moment cosmic ray transport in RAMSES	Joki Rosdahl et.al.	2509.19447	null
2025-09-23	CAR-Flow: Condition-Aware Reparameterization Aligns Source and Target for Better Flow Matching	Chen Chen et.al.	2509.19300	null
2025-09-23	Lyra: Generative 3D Scene Reconstruction via Video Diffusion Model Self-Distillation	Sherwin Bahmani et.al.	2509.19296	null
2025-09-23	OverLayBench: A Benchmark for Layout-to-Image Generation with Dense Overlaps	Bingnan Li et.al.	2509.19282	null
2025-09-23	A Gradient Flow Approach to Solving Inverse Problems with Latent Diffusion Models	Tim Y. J. Wang et.al.	2509.19276	null
2025-09-23	Reconstruction of a potential parameter in time-fractional diffusion problems via a Kohn–Vogelius type functional: Theoretical aspects	Hamza Kahlaoui et.al.	2509.19260	null
2025-09-23	Adversarially-Refined VQ-GAN with Dense Motion Tokenization for Spatio-Temporal Heatmaps	Gabriel Maldonado et.al.	2509.19252	null
2025-09-24	Lavida-O: Elastic Large Masked Diffusion Models for Unified Multimodal Understanding and Generation	Shufan Li et.al.	2509.19244	null
2025-09-23	Stability and Generalization of Adversarial Diffusion Training	Hesam Hosseini et.al.	2509.19234	null
2025-09-23	Enabling Plant Phenotyping in Weedy Environments using Multi-Modal Imagery via Synthetic and Generated Training Data	Earl Ranario et.al.	2509.19208	null
2025-09-23	Vision-Free Retrieval: Rethinking Multimodal Search with Textual Scene Descriptions	Ioanna Ntinou et.al.	2509.19203	null
2025-09-23	Detachment limited interlayer transport processes during SrTiO3 pulsed laser epitaxy	Jeffrey G. Ulbrandt et.al.	2509.19181	null
2025-09-23	A noise-robust Monte Carlo method for electric field calculations in EMC3	William De Deyn et.al.	2509.19178	null
2025-09-23	2D implementation of Kinetic-diffusion Monte Carlo in Eiron	Oskar Lappi et.al.	2509.19140	null
2025-09-23	FUNCanon: Learning Pose-Aware Action Primitives via Functional Object Canonicalization for Generalizable Robotic Manipulation	Hongli Xu et.al.	2509.19102	null
2025-09-23	World4RL: Diffusion World Models for Policy Refinement with Reinforcement Learning for Robotic Manipulation	Zhennan Jiang et.al.	2509.19080	null
2025-09-23	Diffusion Bridge Variational Inference for Deep Gaussian Processes	Jian Xu et.al.	2509.19078	null
2025-09-23	WaveletGaussian: Wavelet-domain Diffusion for Sparse-view 3D Gaussian Object Reconstruction	Hung Nguyen et.al.	2509.19073	null
2025-09-23	Dwarf Galaxies in the MATLAS Survey: Hubble Space Telescope Observations of Nuclear Star Clusters	Mélina Poulain et.al.	2509.19068	null
2025-09-23	ManipForce: Force-Guided Policy Learning with Frequency-Aware Representation for Contact-Rich Manipulation	Geonhyup Lee et.al.	2509.19047	null
2025-09-23	Latent Danger Zone: Distilling Unified Attention for Cross-Architecture Black-box Attacks	Yang Li et.al.	2509.19044	null
2025-09-24	Improving Credit Card Fraud Detection through Transformer-Enhanced GAN Oversampling	Kashaf Ul Emaan et.al.	2509.19032	null
2025-09-23	OmniBridge: Unified Multimodal Understanding, Generation, and Retrieval via Latent Space Alignment	Teng Xiao et.al.	2509.19018	null
2025-09-23	Pure Vision Language Action (VLA) Models: A Comprehensive Survey	Dapeng Zhang et.al.	2509.19012	null
2025-09-23	Generative data augmentation for biliary tract detection on intraoperative images	Cristina Iacono et.al.	2509.18958	null
2025-09-23	One-shot Embroidery Customization via Contrastive LoRA Modulation	Jun Ma et.al.	2509.18948	null
2025-09-23	Soret and Dufour effects in hot and dense QCD matter	Kamaljeet Singh et.al.	2509.18946	null
2025-09-23	1-bit RIS-aided Index Modulation with Quantum Annealing	Ioannis Krikidis et.al.	2509.18932	null
2025-09-23	Direct Preference Optimization for Speech Autoregressive Diffusion Models	Zhijun Liu et.al.	2509.18928	null
2025-09-23	Diffusive Stochastic Master Equation (SME) with dispersive qubit/cavity coupling	Pierre Rouchon et.al.	2509.18925	null
2025-09-23	LiDAR Point Cloud Image-based Generation Using Denoising Diffusion Probabilistic Models	Amirhesam Aghanouri et.al.	2509.18917	null
2025-09-23	RS3DBench: A Comprehensive Benchmark for 3D Spatial Perception in Remote Sensing	Jiayu Wang et.al.	2509.18897	null
2025-09-23	How special are the dynamics of deep eutectic solvents? A Look at the Prototypical Case of Ethaline	Mohammad Nadim Kamar et.al.	2509.18896	null
2025-09-23	Quantum-to-classical transition and H-theorem in surface diffusion	E. E. Torres-Miyares et.al.	2509.18844	null
2025-09-23	Validation of a Reynolds-averaged numerical simulation environment to simulate high-pressure, auto-igniting hydrogen diffusion flames	N. Diepstraten et.al.	2509.18841	null
2025-09-23	Text Slider: Efficient and Plug-and-Play Continuous Concept Control for Image/Video Synthesis via LoRA Adapters	Pin-Yen Chiu et.al.	2509.18831	null
2025-09-23	Hyper-Bagel: A Unified Acceleration Framework for Multimodal Understanding and Generation	Yanzuo Lu et.al.	2509.18824	null
2025-09-23	Training-Free Data Assimilation with GenCast	Thomas Savary et.al.	2509.18811	null
2025-09-23	Nonlocal degenerate parabolic hyperbolic equations on bounded domains. Part II: Existence	Jørgen Endal et.al.	2509.18797	null
2025-09-23	Towards Application Aligned Synthetic Surgical Image Synthesis	Danush Kumar Venkatesh et.al.	2509.18796	null
2025-09-23	FixingGS: Enhancing 3D Gaussian Splatting via Training-Free Score Distillation	Zhaorui Wang et.al.	2509.18759	null
2025-09-23	Complexity of Activity Patterns in a Bio-Inspired Hopfield-Type Network in Different Topologies	Marco Cafiso et.al.	2509.18758	null
2025-09-23	RSVG-ZeroOV: Exploring a Training-Free Framework for Zero-Shot Open-Vocabulary Visual Grounding in Remote Sensing Images	Ke Li et.al.	2509.18711	null
2025-09-23	AGSwap: Overcoming Category Boundaries in Object Fusion via Adaptive Group Swapping	Zedong Zhang et.al.	2509.18699	null
2025-09-23	FlowCrypt: Flow-Based Lightweight Encryption with Near-Lossless Recovery for Cloud Photo Privacy	Xiaohui Yang et.al.	2509.18696	null
2025-09-23	Advances in Large Language Models for Medicine	Zhiyu Kan et.al.	2509.18690	null
2025-09-23	Query-Centric Diffusion Policy for Generalizable Robotic Assembly	Ziyi Xu et.al.	2509.18686	null
2025-09-23	3D Flow Diffusion Policy: Visuomotor Policy Learning via Generating Flow in 3D Space	Sangjun Noh et.al.	2509.18676	null
2025-09-23	Global Existence of Solutions for A Class of Nonlocal Reaction-Diffusion Systems and Their Diffusive Limit	Md Shah Alam et.al.	2509.18645	null
2025-09-23	Well-posedness of the Electron MHD with random diffusion	Ruimeng Hu et.al.	2509.18640	null
2025-09-23	Understanding-in-Generation: Reinforcing Generative Capability of Unified Model via Infusing Understanding into Generation	Yuanhuiyi Lyu et.al.	2509.18639	null
2025-09-23	Prompt-Guided Dual Latent Steering for Inversion Problems	Yichen Wu et.al.	2509.18619	null
2025-09-23	Flow marching for a generative PDE foundation model	Zituo Chen et.al.	2509.18611	null
2025-09-23	SynSonic: Augmenting Sound Event Detection through Text-to-Audio Diffusion ControlNet and Effective Sample Filtering	Jiarui Hai et.al.	2509.18603	null
2025-09-23	Training-Free Multi-Style Fusion Through Reference-Based Adaptive Modulation	Xu Liu et.al.	2509.18602	null
2025-09-23	SSCM: A Spatial-Semantic Consistent Model for Multi-Contrast MRI Super-Resolution	Xiaoman Wu et.al.	2509.18593	null
2025-09-23	Kernel Variational Inference Flow for Nonlinear Filtering Problem	Weiye Gan et.al.	2509.18589	null
2025-09-23	DS-Diffusion: Data Style-Guided Diffusion Model for Time-Series Generation	Mingchun Sun et.al.	2509.18584	null
2025-09-23	Active Ornstein-Uhlenbeck particle under stochastic resetting	Uma Shankari et.al.	2509.18515	null
2025-09-23	Source-Free Domain Adaptive Semantic Segmentation of Remote Sensing Images with Diffusion-Guided Label Enrichment	Wenjie Liu et.al.	2509.18502	null
2025-09-23	Differentiable Light Transport with Gaussian Surfels via Adapted Radiosity for Efficient Relighting and Geometry Reconstruction	Kaiwen Jiang et.al.	2509.18497	null
2025-09-23	An Advection-Difusion Model Incorporating Investor Inertia for the Dynamics of Financial Asset Prices	Diego et.al.	2509.18488	null
2025-09-22	Discrete-time diffusion-like models for speech synthesis	Xiaozhou Tan et.al.	2509.18470	null
2025-09-22	Zero-Shot Visual Deepfake Detection: Can AI Predict and Prevent Fake Content Before It’s Created?	Ayan Sar et.al.	2509.18461	null
2025-09-22	Learning Geometry-Aware Nonprehensile Pushing and Pulling with Dexterous Hands	Yunshuang Li et.al.	2509.18455	null
2025-09-22	Diffusion Policies with Offline and Inverse Reinforcement Learning for Promoting Physical Activity in Older Adults Using Wearable Sensors	Chang Liu et.al.	2509.18433	null
2025-09-22	Measurement Score-Based MRI Reconstruction with Automatic Coil Sensitivity Estimation	Tingjun Liu et.al.	2509.18402	null
2025-09-22	Efficient Particle Acceleration in 2.5-Dimensional, Hybrid-Kinetic Simulations of Decaying, Supersonic, Plasma Turbulence	Keyan Gootkin et.al.	2509.18374	null
2025-09-22	Galactic Center Gamma-Ray Emission in MHD Galaxy Formation Simulations with Full Cosmic Ray Spectra	Isabel S. Sands et.al.	2509.18351	null
2025-09-22	Bootstrapping transport in the Drude-Kadanoff-Martin model	Subham Dutta Chowdhury et.al.	2509.18255	null
2025-09-22	Seg4Diff: Unveiling Open-Vocabulary Segmentation in Text-to-Image Diffusion Transformers	Chaehyun Kim et.al.	2509.18096	null
2025-09-22	ComposeMe: Attribute-Specific Image Prompts for Controllable Human Image Generation	Guocheng Gordon Qian et.al.	2509.18092	null
2025-09-22	RnGCam: High-speed video from rolling & global shutter measurements	Kevin Tandi et.al.	2509.18087	null
2025-09-22	Spiffy: Multiplying Diffusion LLM Acceleration via Lossless Speculative Decoding	Sudhanshu Agrawal et.al.	2509.18085	null
2025-09-22	RadarSFD: Single-Frame Diffusion with Pretrained Priors for Radar Point Clouds	Bin Zhao et.al.	2509.18068	null
2025-09-22	Introduction to the relative Langlands program	Raphaël Beuzart-Plessis et.al.	2509.18062	null
2025-09-22	Density convergence on Markov diffusion chaos via Stein’s method	Thanh Dang et.al.	2509.18045	null
2025-09-22	Prepare Before You Act: Learning From Humans to Rearrange Initial States	Yinlong Dai et.al.	2509.18043	null
2025-09-22	Microsecond-Pulsed Nanocalorimetry: A Scalable Approach for Ultrasensitive Heat Capacity Measurements	Hugo Gómez-Torres et.al.	2509.18019	null
2025-09-23	StableGuard: Towards Unified Copyright Protection and Tamper Localization in Latent Diffusion Models	Haoxin Yang et.al.	2509.17993	null
2025-09-22	VideoFrom3D: 3D Scene Video Generation via Complementary Image and Video Diffusion Models	Geonung Kim et.al.	2509.17985	null
2025-09-22	Cosmic inventory of the background fields of relativistic particles in the Universe	Jonathan Biteau et.al.	2509.17954	null
2025-09-22	ComposableNav: Instruction-Following Navigation in Dynamic Environments via Composable Diffusion	Zichao Hu et.al.	2509.17941	null
2025-09-22	MEF: A Systematic Evaluation Framework for Text-to-Image Models	Xiaojing Dong et.al.	2509.17907	null
2025-09-23	Optimizing Inference in Transformer-Based Models: A Multi-Method Benchmark	Siu Hang Ho et.al.	2509.17894	null
2025-09-22	Invariance of finite-dimensional realisations of Heath-Jarrow-Morton models under diffusion estimation	Andreas Celary et.al.	2509.17875	null
2025-09-22	SocialTraj: Two-Stage Socially-Aware Trajectory Prediction for Autonomous Driving via Conditional Diffusion Model	Xiao Zhou et.al.	2509.17850	null
2025-09-22	Semantic and Visual Crop-Guided Diffusion Models for Heterogeneous Tissue Synthesis in Histopathology	Saghir Alfasly et.al.	2509.17847	null
2025-09-22	The origin of the intra-cluster light in The Three Hundred simulations	A. Contreras-Santos et.al.	2509.17831	null
2025-09-22	Folding-unfolding transition of active polymer on the reconfiguration of bidirectional tangential active force	Arindam Panda et.al.	2509.17824	null
2025-09-22	ContextFlow: Training-Free Video Object Editing via Adaptive Context Enrichment	Yiyang Chen et.al.	2509.17818	null
2025-09-22	Solving time-fractional diffusion equations with Robin boundary conditions via fractional Hamiltonian boundary value methods	Qian Luo et.al.	2509.17793	null
2025-09-22	Elucidating the Design Space of FP4 training	Robert Hu et.al.	2509.17791	null
2025-09-22	Conditional Diffusion Models for CT Image Synthesis from CBCT: A Systematic Review	Alzahra Altalib et.al.	2509.17790	null
2025-09-22	I2VWM: Robust Watermarking for Image to Video Generation	Guanjie Wang et.al.	2509.17773	null
2025-09-22	Qwen3-Omni Technical Report	Jin Xu et.al.	2509.17765	null
2025-09-22	Multi-Agent Amodal Completion: Direct Synthesis with Fine-Grained Semantic Guidance	Hongxing Fan et.al.	2509.17757	null
2025-09-22	GAN-Based Multi-Microphone Spatial Target Speaker Extraction	Shrishti Saha Shetu et.al.	2509.17741	null
2025-09-22	Non-equilibrium state during proton-deuteron exchange at a liquid-liquid interface	Tillmann Buttersack et.al.	2509.17724	null
2025-09-22	DINOv3-Diffusion Policy: Self-Supervised Large Visual Model for Visuomotor Diffusion Policy Learning	ThankGod Egbe et.al.	2509.17684	null
2025-09-23	Clothing agnostic Pre-inpainting Virtual Try-ON	Sehyun Kim et.al.	2509.17654	null
2025-09-22	SISMA: Semantic Face Image Synthesis with Mamba	Filippo Botti et.al.	2509.17651	null
2025-09-22	VideoArtGS: Building Digital Twins of Articulated Objects from Monocular Video	Yu Liu et.al.	2509.17647	null
2025-09-22	OmniInsert: Mask-Free Video Insertion of Any Reference via Diffusion Transformer Models	Jinshu Chen et.al.	2509.17627	null
2025-09-22	Audio Super-Resolution with Latent Bridge Models	Chang Li et.al.	2509.17609	null
2025-09-22	Measurements and scaling of X-ray total scattering from single crystals	S. Gorfman et.al.	2509.17605	null
2025-09-22	Conditioning in Generative Quantum Denoising Diffusion Models	Daniel Quinn et.al.	2509.17569	null
2025-09-22	Robust spectral preconditioning for high-Péclet number convection-diffusion	Lukas Holbach et.al.	2509.17531	null
2025-09-22	Stable Video-Driven Portraits	Mallikarjun B. R. et.al.	2509.17476	null
2025-09-22	CARINOX: Inference-time Scaling with Category-Aware Reward-based Initial Noise Optimization and Exploration	Seyed Amir Kasaei et.al.	2509.17458	null
2025-09-22	Learning Dexterous Manipulation with Quantized Hand State	Ying Feng et.al.	2509.17450	null
2025-09-22	Exploring Machine Learning Models for Physical Dose Calculation in Carbon Ion Therapy Using Heterogeneous Imaging Data - A Proof of Concept Study	Miriam Schwarze et.al.	2509.17433	null
2025-09-22	Single-Image Depth from Defocus with Coded Aperture and Diffusion Posterior Sampling	Hodaka Kawachi et.al.	2509.17427	null
2025-09-22	Diff-GNSS: Diffusion-based Pseudorange Error Estimation	Jiaqi Zhu et.al.	2509.17397	null
2025-09-22	The Asymptotic Analysis of Some PDE and Steklov Eigenvalue Problems with Partially Reactive Patches in 3-D	Denis S. Grebenkov et.al.	2509.17394	null
2025-09-22	Magnetically Enhanced Thermoelectric Effect Driven by Martensitic Transformation in the Weak Itinerant Ferromagnet Co $_2$ NbSn	Takumi Kihara et.al.	2509.17378	null
2025-09-22	Volume Density Mapper: 3D Density Reconstruction Algorithm for Molecular Clouds	Guang-Xing Li et.al.	2509.17369	null
2025-09-22	SeqUDA-Rec: Sequential User Behavior Enhanced Recommendation via Global Unsupervised Data Augmentation for Personalized Content Marketing	Ruihan Luo et.al.	2509.17361	null
2025-09-22	DiffQ: Unified Parameter Initialization for Variational Quantum Algorithms via Diffusion Models	Chi Zhang et.al.	2509.17324	null
2025-09-22	GraphWeave: Interpretable and Robust Graph Generation via Random Walk Trajectories	Rahul Nandakumar et.al.	2509.17291	null
2025-09-21	Graph Signal Generative Diffusion Models	Yigit Berkay Uslu et.al.	2509.17250	null
2025-09-21	Scalable Multi Agent Diffusion Policies for Coverage Control	Frederic Vatnsdal et.al.	2509.17244	null
2025-09-21	DT-NeRF: A Diffusion and Transformer-Based Optimization Approach for Neural Radiance Fields in 3D Reconstruction	Bo Liu et.al.	2509.17232	null
2025-09-21	Virtual Consistency for Audio Editing	Matthieu Cervera et.al.	2509.17219	null
2025-09-21	Guided and Unguided Conditional Diffusion Mechanisms for Structured and Semantically-Aware 3D Point Cloud Generation	Gunner Stone et.al.	2509.17206	null
2025-09-21	Conditional Policy Generator for Dynamic Constraint Satisfaction and Optimization	Wook Lee et.al.	2509.17205	null
2025-09-21	Echo-Path: Pathology-Conditioned Echo Video Generation	Kabir Hamzah Muhammad et.al.	2509.17190	null
2025-09-21	Towards a unified turbulence model through multi-objective learning	Zhuo-Ran Liu et.al.	2509.17189	null
2025-09-21	Ambiguous Medical Image Segmentation Using Diffusion Schrödinger Bridge	Lalith Bharadwaj Baru et.al.	2509.17187	null
2025-09-21	SynergyNet: Fusing Generative Priors and State-Space Models for Facial Beauty Prediction	Djamel Eddine Boukhari et.al.	2509.17172	null
2025-09-21	Criticality of a stochastic modern Hopfield network model with exponential interaction function	Marco Cafiso et.al.	2509.17152	null
2025-09-21	Stencil: Subject-Driven Generation with Context Guidance	Gordon Chen et.al.	2509.17120	null
2025-09-21	ScenGAN: Attention-Intensive Generative Model for Uncertainty-Aware Renewable Scenario Forecasting	Yifei Wu et.al.	2509.17119	null
2025-09-21	$\texttt{DiffSyn}$ : A Generative Diffusion Approach to Materials Synthesis Planning	Elton Pan et.al.	2509.17094	null
2025-09-21	AlignedGen: Aligning Style Across Generated Images	Jiexuan Zhang et.al.	2509.17088	null
2025-09-21	CoPlanner: An Interactive Motion Planner with Contingency-Aware Diffusion for Autonomous Driving	Ruiguo Zhong et.al.	2509.17080	null
2025-09-21	Global classical solutions to a two-dimensional chemotaxis-fluid system involving signal-dependent degenerate diffusion	Yansheng Ma et.al.	2509.17073	null
2025-09-21	Intention-aware Hierarchical Diffusion Model for Long-term Trajectory Anomaly Detection	Chen Wang et.al.	2509.17068	null
2025-09-21	Geodesic Prototype Matching via Diffusion Maps for Interpretable Fine-Grained Recognition	Junhao Jia et.al.	2509.17050	null
2025-09-21	Boundary Feller-Dynkin processes associated with Laguerre processes and Pickrell diffusions	Alexander I. Bufetov et.al.	2509.17045	null
2025-09-21	When Color-Space Decoupling Meets Diffusion for Adverse-Weather Image Restoration	Wenxuan Fang et.al.	2509.17024	null
2025-09-21	Multiscale solution decomposition of nonlocal-in-time problems with application in numerical computation	Mengmeng Liu et.al.	2509.17020	null
2025-09-21	DocIQ: A Benchmark Dataset and Feature Fusion Network for Document Image Quality Assessment	Zhichao Ma et.al.	2509.17012	null
2025-09-21	Generalized Momenta-Based Koopman Formalism for Robust Control of Euler-Lagrangian Systems	Rajpal Singh et.al.	2509.17010	null
2025-09-21	Radiation Mediated Shock and Planar Shock Breakout in the Presence of Atomic Transition Lines	Jonathan Morag et.al.	2509.16996	null
2025-09-21	VCE: Safe Autoregressive Image Generation via Visual Contrast Exploitation	Feng Han et.al.	2509.16986	null
2025-09-21	Ledrappier-Young entropy formula for $C^1$ diffeomorphisms with dominated splitting Part 1: Unstable entropy formula and invariance principle	Shaobo Gan et.al.	2509.16981	null
2025-09-21	Penalizing Boundary Activation for Object Completeness in Diffusion Models	Haoyang Xu et.al.	2509.16968	null
2025-09-21	SemanticGarment: Semantic-Controlled Generation and Editing of 3D Gaussian Garments	Ruiyan Wang et.al.	2509.16960	null
2025-09-21	VidCLearn: A Continual Learning Approach for Text-to-Video Generation	Luca Zanchetta et.al.	2509.16956	null
2025-09-21	Machine learning meets Singular Optics II: Single-pixel Detection of Structured Light	Purnesh Singh Badavath et.al.	2509.16946	null
2025-09-21	Discrete Heat Kernels on Simplicial Complexes and Its Application to Functional Brain Networks	Sixtus Dakurah et.al.	2509.16908	null
2025-09-21	PRISM: Precision-Recall Informed Data-Free Knowledge Distillation via Generative Diffusion	Xuewan He et.al.	2509.16897	null
2025-09-21	A Mutil-conditional Diffusion Transformer for Versatile Seismic Wave Generation	Longfei Duan et.al.	2509.16874	null
2025-09-21	$\mathtt{M^3VIR}$ : A Large-Scale Multi-Modality Multi-View Synthesized Benchmark Dataset for Image Restoration and Content Creation	Yuanzhi Li et.al.	2509.16873	null
2025-09-21	HOGraspFlow: Exploring Vision-based Generative Grasp Synthesis with Hand-Object Priors and Taxonomy Awareness	Yitian Shi et.al.	2509.16871	null
2025-09-21	PhysHDR: When Lighting Meets Materials and Scene Geometry in HDR Reconstruction	Hrishav Bakul Barua et.al.	2509.16869	null
2025-09-20	DoubleGen: Debiased Generative Modeling of Counterfactuals	Alex Luedtke et.al.	2509.16842	null
2025-09-20	Factorizing Diffusion Policies for Observation Modality Prioritization	Omkar Patil et.al.	2509.16830	null
2025-09-20	DiffEye: Diffusion-Based Continuous Eye-Tracking Data Generation Conditioned on Natural Images	Ozgur Kara et.al.	2509.16767	null
2025-09-20	Discrete Diffusion Models: Novel Analysis and New Sampler Guarantees	Yuchen Liang et.al.	2509.16756	null
2025-09-20	HyPlaneHead: Rethinking Tri-plane-like Representations in Full-Head Image Synthesis	Heyuan Li et.al.	2509.16748	null
2025-09-20	Pain in 3D: Generating Controllable Synthetic Faces for Automated Pain Assessment	Xin Lei Lin et.al.	2509.16727	null
2025-09-20	Animalbooth: multimodal feature enhancement for animal subject personalization	Chen Liu et.al.	2509.16702	null
2025-09-20	InstanceAssemble: Layout-Aware Image Generation via Instance Assembling Attention	Qiang Xiang et.al.	2509.16691	null
2025-09-20	Follow-Your-Emoji-Faster: Towards Efficient, Fine-Controllable, and Expressive Freestyle Portrait Animation	Yue Ma et.al.	2509.16630	null
2025-09-20	Investigation of the Axe-shaped Radio Galaxy J1051+5523 with uGMRT	Sudheesh T. P. et.al.	2509.16624	null
2025-09-20	Audio-Conditioned Diffusion LLMs for ASR and Deliberation Processing	Mengqi Wang et.al.	2509.16622	null
2025-09-20	An Octave-based Multi-Resolution CQT Architecture for Diffusion-based Audio Generation	Maurício do V. M. da Costa et.al.	2509.16603	null
2025-09-20	FakeChain: Exposing Shallow Cues in Multi-Step Deepfake Detection	Minji Heo et.al.	2509.16602	null
2025-09-19	MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid Vision Tokenizer	Yanghao Li et.al.	2509.16197	null
2025-09-19	AcT2I: Evaluating and Improving Action Depiction in Text-to-Image Models	Vatsal Malaviya et.al.	2509.16141	null
2025-09-19	Dynamic Classifier-Free Diffusion Guidance via Online Feedback	Pinelopi Papalampidi et.al.	2509.16131	null
2025-09-19	DiffusionNFT: Online Diffusion Reinforcement with Forward Process	Kaiwen Zheng et.al.	2509.16117	null
2025-09-19	KRED: Korea Research Economic Database for Macroeconomic Research	Changryong Baek et.al.	2509.16115	null
2025-09-19	PRISM: Probabilistic and Robust Inverse Solver with Measurement-Conditioned Diffusion Prior for Blind Inverse Problems	Yuanyun Hu et.al.	2509.16106	null
2025-09-19	Blind-Spot Guided Diffusion for Self-supervised Real-World Denoising	Shen Cheng et.al.	2509.16091	null
2025-09-19	Generating Detailed Character Motion from Blocking Poses	Purvi Goel et.al.	2509.16064	null
2025-09-19	Latent Conditioned Loco-Manipulation Using Motion Priors	Maciej Stępień et.al.	2509.16061	null
2025-09-19	Compose by Focus: Scene Graph-based Atomic Skills	Han Qi et.al.	2509.16053	null
2025-09-19	A Note on the formulation of the Neumann boundary condition for a nonlocal problem	Antonio Luiz Pereira et.al.	2509.16041	null
2025-09-19	SLaM-DiMM: Shared Latent Modeling for Diffusion Based Missing Modality Synthesis in MRI	Bhavesh Sandbhor et.al.	2509.16019	null
2025-09-19	DistillMatch: Leveraging Knowledge Distillation from Vision Foundation Model for Multimodal Image Matching	Meng Yang et.al.	2509.16017	null
2025-09-19	Going with the Flow: Solving for Symmetry-Driven PDE dynamics with Physics-informed Neural Networks	Michail Kavousanakis et.al.	2509.15963	null
2025-09-19	Structured Information for Improving Spatial Relationships in Text-to-Image Generation	Sander Schildermans et.al.	2509.15962	null
2025-09-19	Optimal Experimental Design of a Moving Sensor for Linear Bayesian Inverse Problems	Nicole Aretz et.al.	2509.15961	null
2025-09-19	Compose Yourself: Average-Velocity Flow Matching for One-Step Speech Enhancement	Gang Yang et.al.	2509.15952	null
2025-09-19	UniTac2Pose: A Unified Approach Learned in Simulation for Category-level Visuotactile In-hand Pose Estimation	Mingdong Wu et.al.	2509.15934	null
2025-09-19	Bayesian Physics Informed Neural Networks for Reliable Transformer Prognostics	Ibai Ramirez et.al.	2509.15933	null
2025-09-19	Enhancing Generative Auto-bidding with Offline Reward Evaluation and Policy Search	Zhiyu Mou et.al.	2509.15927	null
2025-09-19	An optimal-control framework for reaction diffusion systems with application to synthetic developmental biology	Mohamed Amine Ouchdiri et.al.	2509.15889	null
2025-09-19	A Multidimensional Self-Adaptive Numerical Simulation Framework for Semiconductor Boltzmann Transport Equation	Zeyu Zhang et.al.	2509.15879	null
2025-09-19	SAGE: Semantic-Aware Shared Sampling for Efficient Diffusion	Haoran Zhao et.al.	2509.15865	null
2025-09-19	Observation of the Galactic Center in the Sub-MeV Gamma-Ray Band with an Electron-Tracking Compton Camera	Tomonori Ikeda et.al.	2509.15851	null
2025-09-19	Turing Patterns in a Morphogenetic Model with Single Regulatory Function	Mohamed Amine Ouchdiri et.al.	2509.15829	null
2025-09-19	QWD-GAN: Quality-aware Wavelet-driven GAN for Unsupervised Medical Microscopy Images Denoising	Qijun Yang et.al.	2509.15814	null
2025-09-19	Polynomial approximation from diffused data: unisolvence and stability	Ludovico Bruni Bruno et.al.	2509.15813	null
2025-09-19	CIDER: A Causal Cure for Brand-Obsessed Text-to-Image Models	Fangjian Shen et.al.	2509.15803	null
2025-09-19	Monte Carlo Tree Diffusion with Multiple Experts for Protein Design	Xuefeng Liu et.al.	2509.15796	null
2025-09-19	Absence of Radio Emission Reveals an Exceptionally Weak Explosion of the Putative Historical Supernova Pa 30	Yi-xuan Shao et.al.	2509.15792	null
2025-09-19	Vision-Language Models as Differentiable Semantic and Spatial Rewards for Text-to-3D Generation	Weimin Bai et.al.	2509.15772	null
2025-09-19	Learning to Optimize Capacity Planning in Semiconductor Manufacturing	Philipp Andelfinger et.al.	2509.15767	null
2025-09-19	Utility-based Privacy Preserving Data Mining	Qingfeng Zhou et.al.	2509.15755	null
2025-09-19	Discovering Top-k Periodic and High-Utility Patterns	Qingfeng Zhou et.al.	2509.15732	null
2025-09-19	Search for cosmic-ray induced gamma-ray emission from local galaxy clusters using Fermi-LAT data	Judit Pérez-Romero et.al.	2509.15720	null
2025-09-19	Imagination at Inference: Synthesizing In-Hand Views for Robust Visuomotor Policy Inference	Haoran Ding et.al.	2509.15717	null
2025-09-19	Weak Error Estimates of Ergodic Approximations for Monotone Jump-diffusion SODEs	Zhihui Liu et.al.	2509.15698	null
2025-09-19	Bose’s Probabilistic Interactions, Einstein’s Objections, and Their Legacy in Quantum Optics and Stochastic Mechanics	Partha Ghose et.al.	2509.15686	null
2025-09-19	Spontaneous stochasticity in the Armstrong-Vicol passive scalar	Wandrille Ruffenach et.al.	2509.15683	null
2025-09-19	Layout Stroke Imitation: A Layout Guided Handwriting Stroke Generation for Style Imitation with Diffusion Model	Sidra Hanif et.al.	2509.15678	null
2025-09-19	Diffusion of gravitactic chiral active Brownian particles in an asymmetric channel	Narender Khatri et.al.	2509.15630	null
2025-09-19	MAGENTA: Magnitude and Geometry-ENhanced Training Approach for Robust Long-Tailed Sound Event Localization and Detection	Jun-Wei Yeow et.al.	2509.15599	null
2025-09-19	Global Existence of Solutions of Nonlocal Geirer-Meinhardt Model and Effect of Nonlocal Operator in Pattern Formation	Md Shah Alam et.al.	2509.15598	null
2025-09-19	Latent Zoning Network: A Unified Principle for Generative Modeling, Representation Learning, and Classification	Zinan Lin et.al.	2509.15591	null
2025-09-19	Diffusion-Based Cross-Modal Feature Extraction for Multi-Label Classification	Tian Lan et.al.	2509.15553	null
2025-09-19	PolyJuice Makes It Real: Black-Box, Universal Red Teaming for Synthetic Image Detectors	Sepehr Dehdashtian et.al.	2509.15551	null
2025-09-19	Global Existence and Boundedness of Gray-Scott Model with Local and Nonlocal Diffusion	Md Shah Alam et.al.	2509.15535	null
2025-09-19	Lynx: Towards High-Fidelity Personalized Video Generation	Shen Sang et.al.	2509.15496	null
2025-09-18	Full Quantum Stack: Ket Platform	Evandro Rosa et.al.	2509.15484	null
2025-09-18	OpenViGA: Video Generation for Automotive Driving Scenes by Streamlining and Fine-Tuning Open Source Models with Public Data	Björn Möller et.al.	2509.15479	null
2025-09-18	Efficient Multimodal Dataset Distillation via Generative Models	Zhenghao Zhao et.al.	2509.15472	null
2025-09-18	$ν$ SpaceSim: A Comprehensive Simulation Package for Modeling the Measurement of Cosmic Neutrinos using the Earth as the Neutrino Target and Space-based Detectors	Mary Hall Reno et.al.	2509.15469	null
2025-09-18	SERVAL: Surprisingly Effective Zero-Shot Visual Document Retrieval Powered by Large Vision and Language Models	Thong Nguyen et.al.	2509.15432	null
2025-09-18	Random Matrix Theory-guided sparse PCA for single-cell RNA-seq data	Victor Chardès et.al.	2509.15429	null
2025-09-18	Thin-film boundary-layer diffusion of non-equilibrium flow to kinetically limited reactive surfaces via Damköhler thermochemistry tables	Jeffrey D. Engerer et.al.	2509.15427	null
2025-09-18	Spectral Characterization of Wave Scattering at a Granular-Elastic Solid Interface: From Hyperbolic Wave Propagation to Near-Parabolic Diffusion	Joshua R. Tempelman et.al.	2509.15415	null
2025-09-18	Causal Fingerprints of AI Generative Models	Hui Xu et.al.	2509.15406	null
2025-09-18	Caught in the Cosmic Web: Evidence for Ram-Pressure Stripping of a Low-Mass Galaxy by the Cosmic Web	Nicholas Luber et.al.	2509.15405	null
2025-09-18	RaceGAN: A Framework for Preserving Individuality while Converting Racial Information for Image-to-Image Translation	Mst Tasnim Pervin et.al.	2509.15391	null
2025-09-18	MaskAttn-SDXL: Controllable Region-Level Text-To-Image Generation	Yu Chang et.al.	2509.15357	null
2025-09-18	LowDiff: Efficient Diffusion Sampling with Low-Resolution Condition	Jiuyi Xu et.al.	2509.15342	null
2025-09-18	WALLABY Pilot Survey: A gas-rich diffuse dwarf on the baryonic Tully Fisher relation	Rebecca Dudley et.al.	2509.15340	null
2025-09-18	Kuramoto Orientation Diffusion Models	Yue Song et.al.	2509.15328	null
2025-09-18	Anisotropic Cosmic Ray Transport resulting from Magnetic Mirroring and Resonant Curvature Scattering	Jeremiah Lübke et.al.	2509.15320	null
2025-09-18	PRISM: Phase-enhanced Radial-based Image Signature Mapping framework for fingerprinting AI-generated images	Emanuele Ricco et.al.	2509.15270	null
2025-09-18	Autoguided Online Data Curation for Diffusion Model Training	Valeria Pais et.al.	2509.15267	null
2025-09-18	Lightweight and Accurate Multi-View Stereo with Confidence-Aware Diffusion Model	Fangjinhua Wang et.al.	2509.15220	null
2025-09-18	RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation	Yuming Jiang et.al.	2509.15212	null
2025-09-18	Fast and Fluent Diffusion Language Models via Convolutional Decoding and Rejective Fine-tuning	Yeongbin Seo et.al.	2509.15188	null
2025-09-18	Understand Before You Generate: Self-Guided Training for Autoregressive Image Generation	Xiaoyu Yue et.al.	2509.15185	null
2025-09-18	Conditional Prior-based Non-stationary Channel Estimation Using Accelerated Diffusion Models	Muhammad Ahmed Mohsin et.al.	2509.15182	null
2025-09-18	A Race Bias Free Face Aging Model for Reliable Kinship Verification	Ali Nazari et.al.	2509.15177	null
2025-09-18	Unveiling TeV halos among unidentified extended TeV sources	Michela Rigoselli et.al.	2509.15168	null
2025-09-18	AnoF-Diff: One-Step Diffusion-Based Anomaly Detection for Forceful Tool Use	Yating Lin et.al.	2509.15153	null
2025-09-18	WorldForge: Unlocking Emergent 3D/4D Generation in Video Diffusion Model via Training-Free Guidance	Chenxi Song et.al.	2509.15130	null
2025-09-18	Learning Mechanistic Subtypes of Neurodegeneration with a Physics-Informed Variational Autoencoder Mixture Model	Sanduni Pinnawala et.al.	2509.15124	null
2025-09-18	LOFAR 58 MHz Legacy Survey of the 3CRR Catalog	J. M. Boxelaar et.al.	2509.15115	null
2025-09-18	Real-Time Streaming Mel Vocoding with Generative Flow Matching	Simon Welker et.al.	2509.15085	null
2025-09-18	Forecasting and Visualizing Air Quality from Sky Images with Vision-Language Models	Mohammad Saleh Vahdatpour et.al.	2509.15076	null
2025-09-19	Ask-to-Clarify: Resolving Instruction Ambiguity through Multi-turn Dialogue	Xingyao Lin et.al.	2509.15061	null
2025-09-18	How long does it take an Elephant Random Walk to forget its training	Zheng Fang et.al.	2509.15049	null
2025-09-18	AutoEdit: Automatic Hyperparameter Tuning for Image Editing	Chau Pham et.al.	2509.15031	null
2025-09-19	Sea-ing Through Scattered Rays: Revisiting the Image Formation Model for Realistic Underwater Image Generation	Vasiliki Ismiroglou et.al.	2509.15011	null
2025-09-19	SPATIALGEN: Layout-guided 3D Indoor Scene Generation	Chuan Fang et.al.	2509.14981	null
2025-09-18	M4Diffuser: Multi-View Diffusion Policy with Manipulability-Aware Control for Robust Mobile Manipulation	Ju Dong et.al.	2509.14980	null
2025-09-19	Stochastic Hamiltonian Type Jump Diffusion Systems with Countable Regimes: Strong Feller Property and Exponential Ergodicity	Fubao Xi et.al.	2509.14951	null
2025-09-18	A Novel Task-Driven Diffusion-Based Policy with Affordance Learning for Generalizable Manipulation of Articulated Objects	Hao Zhang et.al.	2509.14939	null
2025-09-18	Mitigating data replication in text-to-audio generative diffusion models through anti-memorization guidance	Francisco Messina et.al.	2509.14934	null
2025-09-18	Back to Ear: Perceptually Driven High Fidelity Music Reconstruction	Kangdi Wang et.al.	2509.14912	null
2025-09-18	Finite Volumes for a dissipative free boundary problem	Clément Cancès et.al.	2509.14908	null
2025-09-18	Constraining gamma-ray burst parameters with the first ultra-high energy neutrino event KM3-230213A	KM3NeT Collaboration et.al.	2509.14895	null
2025-09-18	NeRF-based Visualization of 3D Cues Supporting Data-Driven Spacecraft Pose Estimation	Antoine Legrand et.al.	2509.14890	null
2025-09-18	CollabVLA: Self-Reflective Vision-Language-Action Model Dreaming Together with Human	Nan Sun et.al.	2509.14889	null
2025-09-18	Controllable Localized Face Anonymization Via Diffusion Inpainting	Ali Salar et.al.	2509.14866	null
2025-09-19	MeanFlowSE: one-step generative speech enhancement via conditional mean flow	Duojia Li et.al.	2509.14858	null
2025-09-18	A class of flexible and efficient partitioned Runge-Kutta-Chebyshev methods for some time-dependent partial differential equations	Xiao Tang et.al.	2509.14847	null
2025-09-18	[Re] Improving Interpretation Faithfulness for Vision Transformers	Izabela Kurek et.al.	2509.14846	null
2025-09-18	Diffusion-Based Scenario Tree Generation for Multivariate Time Series Prediction and Multistage Stochastic Optimization	Stelios Zarifis et.al.	2509.14832	null
2025-09-18	Spectral survey of the diffuse gas toward BL Lac in the Q band	Maryvonne Gerin et.al.	2509.14822	null
2025-09-18	Acoustic Simulation Framework for Multi-channel Replay Speech Detection	Michael Neri et.al.	2509.14789	null
2025-09-18	MELA-TTS: Joint transformer-diffusion model with representation alignment for speech synthesis	Keyu An et.al.	2509.14784	null
2025-09-18	Radiology Report Conditional 3D CT Generation with Multi Encoder Latent diffusion Model	Sina Amirrajab et.al.	2509.14780	null
2025-09-18	Dataset Distillation for Super-Resolution without Class Labels and Pre-trained Models	Sunwoo Cho et.al.	2509.14777	null
2025-09-18	Diffuse emission from stochastic sources	Anton Stall et.al.	2509.14776	null
2025-09-18	UMind: A Unified Multitask Network for Zero-Shot M/EEG Visual Decoding	Chengjian Xu et.al.	2509.14772	null
2025-09-18	Hydrodynamic Attraction and Hindered Diffusion Govern First-passage Times of Swimming Microorganisms	Yanis Baouche et.al.	2509.14765	null
2025-09-18	Data Augmentation via Latent Diffusion Models for Detecting Smell-Related Objects in Historical Artworks	Ahmed Sheta et.al.	2509.14755	null
2025-09-18	Chain-of-Thought Re-ranking for Image Retrieval Tasks	Shangrong Wu et.al.	2509.14746	null
2025-09-18	UnifiedVisual: A Framework for Constructing Unified Vision-Language Datasets	Pengyu Wang et.al.	2509.14738	null
2025-09-18	Towards Pre-trained Graph Condensation via Optimal Transport	Yeyu Yan et.al.	2509.14722	null
2025-09-18	DACoN: DINO for Anime Paint Bucket Colorization with Any Number of Reference Images	Kazuma Nagata et.al.	2509.14685	null
2025-09-18	Enhancing Situational Awareness in Wearable Audio Devices Using a Lightweight Sound Event Localization and Detection System	Jun-Wei Yeow et.al.	2509.14650	null
2025-09-18	On the algebraic stretching dynamics of variable-density mixing in shock-bubble interaction	Xu Han et.al.	2509.14607	null
2025-09-18	DICE: Diffusion Consensus Equilibrium for Sparse-view CT Reconstruction	Leon Suarez-Rodriguez et.al.	2509.14566	null
2025-09-18	DiffVL: Diffusion-Based Visual Localization on 2D Maps via BEV-Conditioned GPS Denoising	Li Gao et.al.	2509.14565	null
2025-09-18	Adaptive and Iterative Point Cloud Denoising with Score-Based Diffusion Model	Zhaonan Wang et.al.	2509.14560	null
2025-09-18	Radiolunadiff: Estimation of wireless network signal strength in lunar terrain	Paolo Torrado et.al.	2509.14559	null
2025-09-18	Event-LAB: Towards Standardized Evaluation of Neuromorphic Localization Methods	Adam D. Hines et.al.	2509.14516	null
2025-09-18	A Time-Inconsistent Stochastic Optimal Control Problem in an Infinite Time Horizon	Qingmeng Wei et.al.	2509.14495	null
2025-09-17	Error analysis of a fully discrete structure-preserving finite element scheme for a diffuse-interface model of tumour growth	Agus L. Soenjaya et.al.	2509.14486	null
2025-09-17	AToken: A Unified Tokenizer for Vision	Jiasen Lu et.al.	2509.14476	null
2025-09-17	Keywords are not always the key: A metadata field analysis for natural language search on open data portals	Lisa-Yao Gan et.al.	2509.14457	null
2025-09-17	On the equivalence and optimality of transformations of diffusive systems	Davide Gabrielli et.al.	2509.14450	null
2025-09-17	Diffusion-Based Unsupervised Audio-Visual Speech Separation in Noisy Environments with Noise Prior	Yochai Yemini et.al.	2509.14379	null
2025-09-17	Electricity in international comparison – Future technologies in power generation	Axel Kleidon et.al.	2509.14365	null
2025-09-17	DreamControl: Human-Inspired Whole-Body Humanoid Control for Scene Interaction via Guided Diffusion	Dvij Kalaria et.al.	2509.14353	null
2025-09-17	Enhanced Radio Emission Between a Galaxy Cluster Pair	Andrea Botteon et.al.	2509.14348	null
2025-09-17	Dichotomy in Long-Lived Radio Emission from Tidal Disruption Events AT 2020zso and AT 2021sdu: Multi-Component Outflows vs. Host Contamination	Collin T. Christy et.al.	2509.14317	null
2025-09-17	FlowDrive: Energy Flow Field for End-to-End Autonomous Driving	Hao Jiang et.al.	2509.14303	null
2025-09-17	D4PM: A Dual-branch Driven Denoising Diffusion Probabilistic Model with Joint Posterior Diffusion Sampling for EEG Artifacts Removal	Feixue Shao et.al.	2509.14302	null
2025-09-17	SpeechOp: Inference-Time Task Composition for Generative Speech Processing	Justin Lovelace et.al.	2509.14298	null
2025-09-17	GenExam: A Multidisciplinary Text-to-Image Exam	Zhaokai Wang et.al.	2509.14232	null
2025-09-17	Defending Diffusion Models Against Membership Inference Attacks via Higher-Order Langevin Dynamics	Benjamin Sterling et.al.	2509.14225	null
2025-09-17	Looking into the faintEst WIth MUSE (LEWIS): Exploring the nature of ultra-diffuse galaxies in the Hydra-I cluster IV. A study of the Globular Cluster population in four UDGs	Marco Mirabile et.al.	2509.14206	null
2025-09-17	Mass Transport, Turbulent Mixing, and Inflow in Black Hole Accretion	George N. Wong et.al.	2509.14202	null
2025-09-16	\textsc{Gen2Real}: Towards Demo-Free Dexterous Manipulation by Harnessing Generated Video	Kai Ye et.al.	2509.14178	null
2025-09-17	Reaction-diffusion models of invasive tree pest spread: quantifying the spread of oak processionary moth in the UK	Jamie P. McKeown et.al.	2509.14166	null
2025-09-17	Quantum Reinforcement Learning-Guided Diffusion Model for Image Synthesis via Hybrid Quantum-Classical Generative Model Architectures	Chi-Sheng Chen et.al.	2509.14163	null
2025-09-17	MIMIC-D: Multi-modal Imitation for MultI-agent Coordination with Decentralized Diffusion Policies	Dayi Dong et.al.	2509.14159	null
2025-09-17	An Exploratory Study on Abstract Images and Visual Representations Learned from Them	Haotian Li et.al.	2509.14149	null
2025-09-17	FlightDiffusion: Revolutionising Autonomous Drone Training with Diffusion Models Generating FPV Video	Valerii Serpiva et.al.	2509.14082	null
2025-09-17	Dissipativity-Based Data-Driven Decentralized Control of Interconnected Systems	Taiki Nakano et.al.	2509.14047	null
2025-09-17	Cross-diffusion limits in multispecies kinetic models	Ansgar Jüngel et.al.	2509.14046	null
2025-09-17	A Pearl in the Shell: an ultra-compact dwarf within the tidal debris surrounding spiral galaxy NGC 7531	David Martínez-Delgado et.al.	2509.14038	null
2025-09-17	Improving cosmological reach of a gravitational wave observatory using Deep Loop Shaping	Jonas Buchli et.al.	2509.14016	null
2025-09-17	RFM-Editing: Rectified Flow Matching for Text-guided Audio Editing	Liting Gao et.al.	2509.14003	null
2025-09-17	Reconstruction of strong degeneracy region for parabolic equations and systems	Piermarco Cannarsa et.al.	2509.13962	null
2025-09-17	Noise-Level Diffusion Guidance: Well Begun is Half Done	Harvey Mannering et.al.	2509.13936	null
2025-09-17	Towards Robust Defense against Customization via Protective Perturbation Resistant to Diffusion-based Purification	Wenkui Yang et.al.	2509.13922	null
2025-09-17	Recovering the Coupled Treatment of Redshift-Space Distortions and the Lightcone Effect after Diffuse Foreground Removal	Jennifer Feron et.al.	2509.13920	null
2025-09-17	Inverse Design of Amorphous Materials with Targeted Properties	Jonas A. Finkler et.al.	2509.13916	null
2025-09-17	Using Deep Learning Methods to Detect for Ultra-diffuse Galaxies in KiDS	Hao Su et.al.	2509.13910	null
2025-09-17	A Tight Quantum Algorithm for Multiple Collision Search	Xavier Bonnetain et.al.	2509.13909	null
2025-09-17	PhysicalAgent: Towards General Cognitive Robotics with Foundation World Models	Artem Lykov et.al.	2509.13903	null
2025-09-17	Masked Diffusion Models as Energy Minimization	Sitong Chen et.al.	2509.13866	null
2025-09-17	EDITS: Enhancing Dataset Distillation with Implicit Textual Semantics	Qianxin Xia et.al.	2509.13858	null
2025-09-17	Surfing on chemical waves: a simple yet dynamically rich two-sphere responsive gel swimmer	Joseph J. Webber et.al.	2509.13850	null
2025-09-17	SpecDiff: Accelerating Diffusion Model Inference with Self-Speculation	Jiayi Pan et.al.	2509.13848	null
2025-09-17	Polycyclic aromatic hydrocarbons destruction in star-forming regions across 42 nearby galaxies	Oleg V. Egorov et.al.	2509.13845	null
2025-09-18	BWCache: Accelerating Video Diffusion Transformers through Block-Wise Caching	Hanshuai Cui et.al.	2509.13789	null
2025-09-17	Generative Image Coding with Diffusion Prior	Jianhui Chang et.al.	2509.13768	null
2025-09-17	Iterative Prompt Refinement for Safer Text-to-Image Generation	Jinwoo Jeon et.al.	2509.13760	null
2025-09-17	Controllable-Continuous Color Editing in Diffusion Model via Color Mapping	Yuqi Yang et.al.	2509.13756	null
2025-09-17	Cross-modal Full-mode Fine-grained Alignment for Text-to-Image Person Retrieval	Hao Yin et.al.	2509.13754	null
2025-09-17	Heavy Traffic Diffusion Limit for a Closed Queueing Network with Single-Server and Infinite-Server Stations	Amir A. Alwan et.al.	2509.13748	null
2025-09-17	Ion-modulated structure, proton transfer, and capacitance in the Pt(111)/water electric double layer	Xiaoyu Wang et.al.	2509.13727	null
2025-09-17	StyleProtect: Safeguarding Artistic Identity in Fine-tuned Diffusion Models	Qiuyu Tang et.al.	2509.13711	null
2025-09-17	LLM-I: LLMs are Naturally Interleaved Multimodal Creators	Zirun Guo et.al.	2509.13642	null
2025-09-17	Generative Consistency Models for Estimation of Kinetic Parametric Image Posteriors in Total-Body PET	Yun Zhao et.al.	2509.13614	null
2025-09-16	Cross-Distribution Diffusion Priors-Driven Iterative Reconstruction for Sparse-View CT	Haodong Li et.al.	2509.13576	null
2025-09-16	ColonCrafter: A Depth Estimation Model for Colonoscopy Videos Using Diffusion Priors	Romain Hardy et.al.	2509.13525	null
2025-09-16	AERIS: Argonne Earth Systems Model for Reliable and Skillful Predictions	Väinö Hatanpää et.al.	2509.13523	null
2025-09-16	DEFT-VTON: Efficient Virtual Try-On with Consistent Generalised H-Transform	Xingzi Xu et.al.	2509.13506	null
2025-09-16	BiasMap: Leveraging Cross-Attentions to Discover and Mitigate Hidden Social Biases in Text-to-Image Generation	Rajatsubhra Chakraborty et.al.	2509.13496	null
2025-09-16	The effect of parameter drift in the transport of magnetized plasma particles	P. Haerter et.al.	2509.13472	null
2025-09-18	Unified Spatiotemporal Physics-Informed Learning (USPIL): A Framework for Modeling Complex Predator-Prey Dynamics	Julian Evan Chrisnanto et.al.	2509.13425	null
2025-09-16	Modeling Cosmological Evolution of Jetted Seyfert Galaxies for z<10	Julianne Goddard et.al.	2509.13418	null
2025-09-16	SOFIA Polarization Spectrum of Three Star-Forming Clouds	Erin G. Cox et.al.	2509.13416	null
2025-09-16	EdiVal-Agent: An Object-Centric Framework for Automated, Scalable, Fine-Grained Evaluation of Multi-Turn Editing	Tianyu Chen et.al.	2509.13399	null
2025-09-16	Valuation of Exotic Options and Counterparty Games Based on Conditional Diffusion	Helin Zhao et.al.	2509.13374	null
2025-09-16	Runaway electron interactions with whistler waves in tokamak plasmas: energy-dependent transport scaling	Yashika Ghai et.al.	2509.13271	null
2025-09-16	Beyond Private or Public: Large Language Models as Quasi-Public Goods in the AI Economy	Yukun Zhang et.al.	2509.13265	null
2025-09-16	Geometry, Energy and Sensitivity in Stochastic Proton Dynamics	Veronika Chronholm et.al.	2509.13223	null
2025-09-17	The Gamma Expansion of the Level Two Large Deviation Rate Functional for Reversible Diffusion Processes	Claudio Landim et.al.	2509.13222	null
2025-09-18	End4: End-to-end Denoising Diffusion for Diffusion-Based Inpainting Detection	Fei Wang et.al.	2509.13214	null
2025-09-16	Global existence and decay of small solutions in a viscous half Klein-Gordon equation	Louis Garénaux et.al.	2509.13188	null
2025-09-16	PDE-Based Bayesian Hierarchical Modeling for Event Spread, with Application to COVID-19 Infection	Mengqi Cen et.al.	2509.13174	null
2025-09-17	TeraSim-World: Worldwide Safety-Critical Data Synthesis for End-to-End Autonomous Driving	Jiawei Wang et.al.	2509.13164	null
2025-09-16	Enhancing Video Large Language Models with Structured Multi-Video Collaborative Reasoning (early version)	Zhihao He et.al.	2509.13161	null
2025-09-16	MSDNet: Efficient 4D Radar Super-Resolution via Multi-Stage Distillation	Minqing Huang et.al.	2509.13149	null
2025-09-16	Discovering Mathematical Equations with Diffusion Language Model	Xiaoxu Han et.al.	2509.13136	null
2025-09-16	Quantifying CO2 Distribution at the Air-Water Interface – Spatiotemporally Resolved Measurements Using Tunable Diode Laser Spectroscopy	Dongfang Zhao et.al.	2509.13113	null
2025-09-16	Quantitative 3D Morphology of Cellular H2/O2/N2 Flames on a Porous-Plug Burner: Spatially Resolved Measurements of Temperature and OH Radical	Zeyu Yan et.al.	2509.13106	null
2025-09-16	MIA-EPT: Membership Inference Attack via Error Prediction for Tabular Data	Eyal German et.al.	2509.13046	null
2025-09-16	ReTrack: Data Unlearning in Diffusion Models through Redirecting the Denoising Trajectory	Qitan Shi et.al.	2509.13007	null
2025-09-16	Difference-Based Recovery for Modulo Sampling: Tightened Bounds and Robustness Guarantees	Wenyi Yan et.al.	2509.12971	null
2025-09-16	Cosmic dust as a prerequisite for the formation of complex organic molecules in space?	Alexey Potapov et.al.	2509.12967	null
2025-09-16	Mathematical Study of Reaction-Diffusion in Congested Crowd Motion	Noureddine Igbida et.al.	2509.12935	null
2025-09-16	The Anatomy of Alignment: Decomposing Preference Optimization by Steering Sparse Features	Jeremias Ferrao et.al.	2509.12934	null
2025-09-16	Non-parametric estimation of non-linear diffusion coefficient in parabolic SPDEs	Martin Andersson et.al.	2509.12921	null
2025-09-16	Neural Network Localized Orthogonal Decomposition for Numerical Homogenization of Diffusion Operators with Random Coefficients	Fabian Kröpfl et.al.	2509.12896	null
2025-09-16	Runge-Kutta Approximation and Decoupled Attention for Rectified Flow Inversion and Semantic Editing	Weiming Chen et.al.	2509.12888	null
2025-09-16	Few to Big: Prototype Expansion Network via Diffusion Learner for Point Cloud Few-shot Semantic Segmentation	Qianguang Zhao et.al.	2509.12878	null
2025-09-16	Bayesian Signal Separation via Plug-and-Play Diffusion-Within-Gibbs Sampling	Yi Zhang et.al.	2509.12857	null
2025-09-16	Benchmarking thermostat algorithms in molecular dynamics simulations of a binary Lennard-Jones glass-former model	Kumpei Shiraishi et.al.	2509.12837	null
2025-09-16	Pressure dependent structure of neat liquid methanol, CH3OH: molecular dynamics simulations with various united atom type potentials	Imre Bakó et.al.	2509.12834	null
2025-09-16	A Lightweight Pipeline for Noisy Speech Voice Cloning and Accurate Lip Sync Synthesis	Javeria Amir et.al.	2509.12831	null
2025-09-17	DiffHash: Text-Guided Targeted Attack via Diffusion Models against Deep Hashing Image Retrieval	Zechao Liu et.al.	2509.12824	null
2025-09-16	A Pressure-Based Diffusion Model for Influence Maximization on Social Networks	Curt Stutsman et.al.	2509.12822	null
2025-09-16	A Statistical Benchmark for Diffusion Posterior Sampling Algorithms	Martin Zach et.al.	2509.12821	null
2025-09-16	Double Helix Diffusion for Cross-Domain Anomaly Image Generation	Linchun Wu et.al.	2509.12787	null
2025-09-18	A-TDOM: Active TDOM via On-the-Fly 3DGS	Yiwei Xu et.al.	2509.12759	null
2025-09-16	What Makes a Good Generated Image? Investigating Human and Multimodal LLM Image Preference Alignment	Rishab Parthasarathy et.al.	2509.12750	null
2025-09-16	$L^2$ -solutions to stochastic reaction-diffusion equations with superlinear drifts driven by space-time white noise^	Shijie Shang et.al.	2509.12744	null
2025-09-16	Generalizable Holographic Reconstruction via Amplitude-Only Diffusion Priors	Jeongsol Kim et.al.	2509.12728	null
2025-09-16	SPGen: Spherical Projection as Consistent and Flexible Representation for Single Image 3D Shape Generation	Jingdong Zhang et.al.	2509.12721	null
2025-09-16	Joint AoI and Handover Optimization in Space-Air-Ground Integrated Network	Zifan Lang et.al.	2509.12716	null
2025-09-16	AsyMoE: Leveraging Modal Asymmetry for Enhanced Expert Specialization in Large Vision-Language Models	Heng Zhang et.al.	2509.12715	null
2025-09-16	Morphological and Chemical Changes in Cd-free Colloidal QD-LEDs During Operation	Ruiqi Zhang et.al.	2509.12597	null
2025-09-16	Anomalous statistics in the Langevin equation with fluctuating diffusivity: from Brownian yet non-Gaussian diffusion to anomalous diffusion and ergodicity breaking	Takuma Akimoto et.al.	2509.12571	null
2025-09-16	Adaptive Sampling Scheduler	Qi Wang et.al.	2509.12569	null
2025-09-16	Thermal Transport of GaN/Substrate Heterostructures under Non-Uniform Heat Source	Ershuai Yin et.al.	2509.12548	null
2025-09-16	Topological Phononic Crystal on the Scale of Quasi-Ballistic Phonon Transport	Keita Funayama et.al.	2509.12528	null
2025-09-15	Context-Aware Language Models for Forecasting Market Impact from Sequences of Financial News	Ross Koval et.al.	2509.12519	null
2025-09-15	Image Tokenizer Needs Post-Training	Kai Qiu et.al.	2509.12474	null
2025-09-15	Effects of temporal variations on wave speeds of bistable traveling waves for Lotka-Volterra competition systems	Weiwei Ding et.al.	2509.12472	null
2025-09-15	PromptSculptor: Multi-Agent Based Text-to-Image Prompt Optimization	Dawei Xiang et.al.	2509.12446	null
2025-09-15	Diffusion-Based Generation and Imputation of Driving Scenarios from Limited Vehicle CAN Data	Julian Ripper et.al.	2509.12375	null
2025-09-15	Brown Dwarf Formation Through Gravitational Collapse: Insights From 3D Numerical Simulations	Adnan Ali Ahmad et.al.	2509.12336	null
2025-09-15	Radial Oscillations of Viscous Neutron Stars: Zero Diffusion Case	Raissa F. P. Mendes et.al.	2509.12330	null
2025-09-15	LazyDrag: Enabling Stable Drag-Based Editing on Multi-Modal Diffusion Transformers via Explicit Correspondence	Zixin Yin et.al.	2509.12203	null
2025-09-15	OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling	Yang Zhou et.al.	2509.12201	null
2025-09-15	Homogeneous soil moisture fields suppress Sahelian MCS frequency	Ben Maybee et.al.	2509.12118	null
2025-09-15	Predicting Structural Relaxation in Supercooled Small Molecules via Molecular Dynamics Simulations and Microscopic Theory	Anh D. Phan et.al.	2509.12092	null
2025-09-15	Progressive Flow-inspired Unfolding for Spectral Compressive Imaging	Xiaodong Wang et.al.	2509.12079	null
2025-09-15	AvatarSync: Rethinking Talking-Head Animation through Autoregressive Perspective	Yuchen Deng et.al.	2509.12052	null
2025-09-15	Layout-Conditioned Autoregressive Text-to-Image Generation via Structured Masking	Zirui Zheng et.al.	2509.12046	null
2025-09-15	Robust Concept Erasure in Diffusion Models: A Theoretical Perspective on Security and Robustness	Zixuan Fu et.al.	2509.12024	null
2025-09-15	A shortcut through the macroscopic fluctuation theory: a generalised Fick law	Théotim Berlioz et.al.	2509.12017	null
2025-09-15	Data-driven Smile Design: Personalized Dental Aesthetics Outcomes Using Deep Learning	Marcus Lin et.al.	2509.12001	null
2025-09-15	Optimization for Massive 3D-RIS Deployment: A Generative Diffusion Model-Based Approach	Kaining Wang et.al.	2509.11969	null
2025-09-15	Learning to Generate 4D LiDAR Sequences	Ao Liang et.al.	2509.11959	null
2025-09-15	Adaptive least-squares space-time finite element methods for convection-diffusion problems	Christian Köthe et.al.	2509.11955	null
2025-09-15	Sphere-GAN: a GAN-based Approach for Saliency Estimation in 360° Videos	Mahmoud Z. A. Wahba et.al.	2509.11948	null
2025-09-15	The Filter Echo: A General Tool for Filter Visualisation	Daniel Gaa et.al.	2509.11932	null
2025-09-15	VH-Diffuser: Variable Horizon Diffusion Planner for Time-Aware Goal-Conditioned Trajectory Planning	Ruijia Liu et.al.	2509.11930	null
2025-09-15	A thermodynamically consistent model for bulk-surface viscous fluid mixtures: Model derivation and mathematical analysis	Patrik Knopf et.al.	2509.11925	null
2025-09-15	A nonlinear model for long-range segregation	Howen Chuah et.al.	2509.11912	null
2025-09-15	Enhanced Cosmic-Ray Cooling in AGN from Dark Matter Deep Inelastic Scattering	Linjie Li et.al.	2509.11906	null
2025-09-15	Bayesian recalibration of flux scale factors in diffuse radio maps using low-resolution absolute radiometers	Ainulnabilah Nasirudin et.al.	2509.11894	null
2025-09-15	Numerical analysis of fluid estimation for source terms in neutral particles simulation	Zhirui Tang et.al.	2509.11883	null
2025-09-15	Do It Yourself (DIY): Modifying Images for Poems in a Zero-Shot Setting Using Weighted Prompt Manipulation	Sofia Jamil et.al.	2509.11878	null
2025-09-15	Wasserstein error estimates between telegraph processes and Brownian motion	Gerardo Barrera et.al.	2509.11871	null
2025-09-15	Tenma: Robust Cross-Embodiment Robot Manipulation with Diffusion Transformer	Travis Davies et.al.	2509.11865	null
2025-09-15	Understanding variations of galactic energetic particles in the heliosphere: modelling and radiation hazard assessment	Miguel Orcinha et.al.	2509.11837	null
2025-09-15	Rough stochastic filtering	Fabio Bugini et.al.	2509.11825	null
2025-09-15	Stochastic restarting with multiple restart conditions	Johannes Aspman et.al.	2509.11809	null
2025-09-15	Modes of Mechanical Guidance of Adhesion-Independent Cell Migration	Hanna Luise Gertack et.al.	2509.11801	null
2025-09-15	Dense gas properties and star formation in M 82	Fei Li et.al.	2509.11770	null
2025-09-15	Igniting VLMs toward the Embodied Space	Andy Zhai et.al.	2509.11766	null
2025-09-17	Removal Attack and Defense on AI-generated Content Latent-based Watermarking	De Zhang Lee et.al.	2509.11745	null
2025-09-15	DRAG: Data Reconstruction Attack using Guided Diffusion	Wa-Kin Lei et.al.	2509.11724	null
2025-09-15	Controlled growth of polar altermagnets via chemical vapor transport	Hiraka Haruhiro et.al.	2509.11716	null
2025-09-15	Lie symmetry analysis and similarity reductions for the tempered-fractional Keller Segel system	Ghorbanali Haghighatdoost et.al.	2509.11690	null
2025-09-15	DTGen: Generative Diffusion-Based Few-Shot Data Augmentation for Fine-Grained Dirty Tableware Recognition	Lifei Hao et.al.	2509.11661	null
2025-09-15	IS-Diff: Improving Diffusion-Based Inpainting with Better Initial Seed	Yongzhe Lyu et.al.	2509.11638	null
2025-09-15	SpeCa: Accelerating Diffusion Transformers with Speculative Feature Caching	Jiacheng Liu et.al.	2509.11628	null
2025-09-15	Inference-stage Adaptation-projection Strategy Adapts Diffusion Policy to Cross-manipulators Scenarios	Xiangtong Yao et.al.	2509.11621	null
2025-09-15	A Phase Field Formulation of Frictional Sliding Contact for 3D Fully Eulerian Fluid Structure Interactions	Biswajeet Rath et.al.	2509.11611	null
2025-09-15	Scaling to Multimodal and Multichannel Heart Sound Classification: Fine-Tuning Wav2Vec 2.0 with Synthetic and Augmented Biosignals	Milan Marocchi et.al.	2509.11606	null
2025-09-15	MVQA-68K: A Multi-dimensional and Causally-annotated Dataset with Quality Interpretability for Video Assessment	Yanyun Pu et.al.	2509.11589	null
2025-09-15	Reconstructing High-fidelity Plasma Turbulence with Data-driven Tuning of Diffusion in Low Resolution Grids	Kunpeng Li et.al.	2509.11576	null
2025-09-15	The Dynamics of the Profit Rate in an Extended Okishio Framework	Jihyuan Liuh et.al.	2509.11538	null
2025-09-15	Learning Majority-to-Minority Transformations with MMD and Triplet Loss for Imbalanced Classification	Suman Cha et.al.	2509.11511	null
2025-09-15	Collective Recourse for Generative Urban Visualizations	Rashid Mushkani et.al.	2509.11487	null
2025-09-14	Improving LLMs’ Learning for Coreference Resolution	Yujian Gan et.al.	2509.11466	null
2025-09-14	Diffusion of $^{210}\text{Pb}$ and $^{210}\text{Po}$ in Nylon	P. Adhikari et.al.	2509.11464	null
2025-09-14	Fast Percolation Centrality Approximation with Importance Sampling	Antonio Cruciani et.al.	2509.11454	null
2025-09-14	Mechanisms of isotope exchange between aqueous solutions and barite in low-temperature geochemical systems	Chen Zhu et.al.	2509.11428	null
2025-09-14	IGA-LBM: Isogeometric lattice Boltzmann method	Ye Ji et.al.	2509.11427	null
2025-09-14	Solving ill-conditioned polynomial equations using score-based priors with application to multi-target detection	Rafi Beinhorn et.al.	2509.11397	null
2025-09-14	ActivePose: Active 6D Object Pose Estimation and Tracking for Robotic Manipulation	Sheng Liu et.al.	2509.11364	null
2025-09-14	On the Escaping Efficiency of Distributed Adversarial Training Algorithms	Ying Cao et.al.	2509.11337	null
2025-09-14	PINGS: Physics-Informed Neural Network for Fast Generative Sampling	Achmad Ardani Prasha et.al.	2509.11284	null
2025-09-14	VideoAgent: Personalized Synthesis of Scientific Videos	Xiao Liang et.al.	2509.11253	null
2025-09-14	Beyond Autoregression: An Empirical Study of Diffusion Large Language Models for Code Generation	Chengze li et.al.	2509.11252	null
2025-09-14	Beyond Sliders: Mastering the Art of Diffusion-based Image Manipulation	Yufei Tang et.al.	2509.11213	null
2025-09-14	StegOT: Trade-offs in Steganography via Optimal Transport	Chengde Lin et.al.	2509.11178	null
2025-09-14	Cryptanalysis and design for a family of plaintext non-delayed chaotic ciphers	Qianxue Wang et.al.	2509.11158	null
2025-09-14	Entropic active particle transport in pulsating 3D geometries	Rahul Sinha et.al.	2509.11147	null
2025-09-14	Neural cellular automata: applications to biology and beyond classical AI	Benedikt Hartl et.al.	2509.11131	null
2025-09-14	Filling the Gaps: A Multitask Hybrid Multiscale Generative Framework for Missing Modality in Remote Sensing Semantic Segmentation	Nhi Kieu et.al.	2509.11102	null
2025-09-14	PanoLora: Bridging Perspective and Panoramic Video Generation with LoRA Adaptation	Zeyu Dong et.al.	2509.11092	null
2025-09-14	An Advanced Convolutional Neural Network for Bearing Fault Diagnosis under Limited Data	Shengke Sun et.al.	2509.11053	null
2025-09-14	Data-Efficient Ensemble Weather Forecasting with Diffusion Models	Kevin Valencia et.al.	2509.11047	null
2025-09-13	General Decentralized Stochastic Optimal Control via Change of Measure: Applications to the Witsenhausen Counterexample	Bhagyashri Telsang et.al.	2509.11013	null
2025-09-13	Approximation in an optimal design problem governed by the heat equation	Kei Matsushima et.al.	2509.11011	null
2025-09-13	TrueSkin: Towards Fair and Accurate Skin Tone Recognition and Generation	Haoming Lu et.al.	2509.10980	null
2025-09-13	Development and Analysis of Chien-Physics-Informed Neural Networks for Singular Perturbation Problems	Gautam Singh et.al.	2509.10945	null
2025-09-13	ToMA: Token Merge with Attention for Image Generation with Diffusion Models	Wenbo Lu et.al.	2509.10918	null
2025-09-13	Robustifying Diffusion-Denoised Smoothing Against Covariate Shift	Ali Hedayatnia et.al.	2509.10913	null
2025-09-13	Real-Time Super-Resolution Imaging System Based on Zero-Shot Learning for Infrared Non-Destructive Testing	Pengfei Zhu et.al.	2509.10902	null
2025-09-13	Thermal diffusivity characterization of impacted composites using evaporative cryocooling excitation and inverse physics-informed neural networks	Pengfei Zhu et.al.	2509.10898	null
2025-09-13	A novel IR-SRGAN assisted super-resolution evaluation of photothermal coherence tomography for impact damage in toughened thermoplastic CFRP laminates under room temperature and low temperature	Pengfei Zhu et.al.	2509.10894	null
2025-09-13	Text2Sign Diffusion: A Generative Approach for Gloss-Free Sign Language Production	Liqian Feng et.al.	2509.10845	null
2025-09-13	Orbit-based structural decomposition and stellar population recovery for edge-on barred galaxies	Yunpeng Jin et.al.	2509.10832	null
2025-09-13	Multi-Task Diffusion Approach For Prediction of Glioma Tumor Progression	Aghiles Kebaili et.al.	2509.10824	null
2025-09-13	Hybrid Atomic Norm Sparse/Diffuse Channel Estimation	Lei Lyu et.al.	2509.10770	null
2025-09-12	Using Drift Diffusion Model to Analyze Cars’ Lane Change Decisions behind Heavy Vehicles	Nachuan Li et.al.	2509.10733	null
2025-09-12	The Rapid Arrival of Josiah Willard Gibbs’s Elementary Principles in Statistical Mechanics in European University Libraries	Hector Giacomini et.al.	2509.10732	null
2025-09-12	Simultaneous determination of wave speed, diffusivity and nonlinearity in the Westervelt equation using complex time-periodic solutions	Sebastian Acosta et.al.	2509.10718	null
2025-09-12	Maestro: Self-Improving Text-to-Image Generation via Agent Orchestration	Xingchen Wan et.al.	2509.10704	null
2025-09-12	Stable Part Diffusion 4D: Multi-View RGB and Kinematic Parts Video Generation	Hao Zhang et.al.	2509.10687	null
2025-09-12	T2Bs: Text-to-Character Blendshapes via Video Generation	Jiahao Luo et.al.	2509.10678	null
2025-09-12	Parallel and perpendicular diffusion of energetic particles in the near-Sun solar wind observed by Parker Solar Probe	Nibuna Siranjeevi Madam Subashchandar et.al.	2509.10648	null
2025-09-12	Generalized Time-Reversal for Pulse Control in Diffusive Media	Rohin E. McIntosh et.al.	2509.10646	null
2025-09-12	Radiation GRMHD Models of Accretion onto Stellar-Mass Black Holes: II. Super-Eddington Accretion	Lizhong Zhang et.al.	2509.10638	null
2025-09-12	InfGen: A Resolution-Agnostic Paradigm for Scalable Image Synthesis	Tao Han et.al.	2509.10441	null
2025-09-12	Inpainting-Guided Policy Optimization for Diffusion Large Language Models	Siyan Zhao et.al.	2509.10396	null
2025-09-12	Immunizing Images from Text to Image Editing via Adversarial Cross-Attention	Matteo Trippodo et.al.	2509.10359	null
2025-09-12	GARD: Gamma-based Anatomical Restoration and Denoising for Retinal OCT	Botond Fazekas et.al.	2509.10341	null
2025-09-12	Compute Only 16 Tokens in One Timestep: Accelerating Diffusion Transformers with Cluster-Driven Feature Caching	Zhixin Zheng et.al.	2509.10312	null
2025-09-12	Morphogenetic mechanical metamaterials: Emerging tensor properties from self-organized structures	Thomas Fromentèze et.al.	2509.10277	null
2025-09-12	MagicMirror: A Large-Scale Dataset and Benchmark for Fine-Grained Artifacts Assessment in Text-to-Image Generation	Jia Wang et.al.	2509.10260	null
2025-09-12	Mask Consistency Regularization in Object Removal	Hua Yuan et.al.	2509.10259	null
2025-09-12	Computational modeling of diffusive dynamics in a bouncer system with an irregular surface	Luiz Antonio Barreiro et.al.	2509.10253	null
2025-09-12	Phase Transitions for Elephant Random Walks with Two memory Channels	Krishanu Maulik et.al.	2509.10225	null
2025-09-12	Ionospheric Electron Heat Flow Modulates Planetary Ambipolar Electric Fields	Liangliang Yuan et.al.	2509.10218	null
2025-09-12	Subordinators and time-space fractional diffusion equations	Mohamed Majdoub et.al.	2509.10203	null
2025-09-12	P3D: Scalable Neural Surrogates for High-Resolution 3D Physics Simulations with Global Context	Benjamin Holzschuh et.al.	2509.10186	null
2025-09-12	Convergence to equilibrium for fully discretizations of nonlocal Cahn-Hilliard equation	Danni Zhang et.al.	2509.10180	null
2025-09-12	The unified gas kinetic wave-particle method for the neutron transport equation	Guangwei Liu et.al.	2509.10178	null
2025-09-12	Scalable Training for Vector-Quantized Networks with 100% Codebook Utilization	Yifan Chang et.al.	2509.10140	null
2025-09-12	Turing patterns on adaptive networks	Marie Dorchain et.al.	2509.10124	null
2025-09-12	Realism Control One-step Diffusion for Real-World Image Super-Resolution	Zongliang Wu et.al.	2509.10122	null
2025-09-12	Intrinsic disorder in the candidate quantum spin ice Pr $_2$Zr$_2$O$_7$	T. J. Hicken et.al.	2509.10101	null
2025-09-12	HHI-Assist: A Dataset and Benchmark of Human-Human Interaction in Physical Assistance Scenario	Saeed Saadatnejad et.al.	2509.10096	null
2025-09-12	Color Me Correctly: Bridging Perceptual Color Spaces and Text Embeddings for Improved Diffusion Generation	Sung-Lin Tsai et.al.	2509.10058	null
2025-09-12	Approximate Graph Propagation Revisited: Dynamic Parameterized Queries, Tighter Bounds and Dynamic Updates	Zhuowei Zhao et.al.	2509.10036	null
2025-09-12	Effects of harmonic magnetic field boundary conditions in mean-field solar dynamo	V. V. Pipin et.al.	2509.09985	null
2025-09-12	Normalized solutions to a Choquard equation involving mixed local and nonlocal operators	J. Giacomoni et.al.	2509.09968	null
2025-09-12	Limited Reference, Reliable Generation: A Two-Component Framework for Tabular Data Generation in Low-Data Regimes	Mingxuan Jiang et.al.	2509.09960	null
2025-09-12	Chord: Chain of Rendering Decomposition for PBR Material Estimation from Generated Texture Images	Zhi Ying et.al.	2509.09952	null
2025-09-12	Acoustic Scene Classification Using CNN-GRU Model Without Knowledge Distillation	Ee-Leng Tan et.al.	2509.09931	null
2025-09-12	A streamline upwind/Petrov-Galerkin method for the magnetic advection-diffusion problem	Haochen Li et.al.	2509.09913	null
2025-09-11	Accelerating 3D Photoacoustic Computed Tomography with End-to-End Physics-Aware Neural Operators	Jiayun Wang et.al.	2509.09894	null
2025-09-11	PeV particle acceleration and non-thermal emission in the `minimalist’ model of the extended jets in W50/SS433	A. M. Bykov et.al.	2509.09883	null
2025-09-11	Automated Tuning for Diffusion Inverse Problem Solvers without Generative Prior Retraining	Yaşar Utku Alçalar et.al.	2509.09880	null
2025-09-11	Privacy-Preserving Automated Rosacea Detection Based on Medically Inspired Region of Interest Selection	Chengyu Yang et.al.	2509.09844	null
2025-09-11	A risk-sensitive ergodic singular stochastic control problem	Justin Gwee et.al.	2509.09835	null
2025-09-11	DiTReducio: A Training-Free Acceleration for DiT-Based TTS via Progressive Calibration	Yanru Huo et.al.	2509.09748	null
2025-09-11	FLUX-Reason-6M & PRISM-Bench: A Million-Scale Text-to-Image Reasoning Dataset and Comprehensive Benchmark	Rongyao Fang et.al.	2509.09680	null
2025-09-11	Locality in Image Diffusion Models Emerges from Data Statistics	Artem Lukoianov et.al.	2509.09672	null
2025-09-11	Geometric Neural Distance Fields for Learning Human Motion Priors	Zhengdi Yu et.al.	2509.09667	null
2025-09-12	DiFlow-TTS: Discrete Flow Matching with Factorized Speech Tokens for Low-Latency Zero-Shot Text-To-Speech	Ngoc-Son Nguyen et.al.	2509.09631	null
2025-09-11	I Know Who Clones Your Code: Interpretable Smart Contract Similarity Detection	Zhenguang Liu et.al.	2509.09630	null
2025-09-11	Mechanistic Learning with Guided Diffusion Models to Predict Spatio-Temporal Brain Tumor Growth	Daria Laslo et.al.	2509.09610	null
2025-09-11	Constraints on Ultra-heavy DM from TeV-PeV gamma-ray diffuse measurements	Manuel Rocamora et.al.	2509.09609	null
2025-09-11	Iterative energy reduction Galerkin methods and variational adaptivity	Pascal Heid et.al.	2509.09600	null
2025-09-11	Kling-Avatar: Grounding Multimodal Instructions for Cascaded Long-Duration Avatar Animation Synthesis	Yikang Ding et.al.	2509.09595	null
2025-09-11	Exactly Solvable Model of Random Walks with Stochastic Exchange	José Julian Díaz-Pérez et.al.	2509.09577	null
2025-09-11	Improving Video Diffusion Transformer Training by Multi-Feature Fusion and Alignment from Self-Supervised Vision Encoders	Dohun Lee et.al.	2509.09547	null
2025-09-11	Generative Diffusion Contrastive Network for Multi-View Clustering	Jian Zhu et.al.	2509.09527	null
2025-09-11	Mapping of discrete range modulated proton radiograph to water-equivalent path length using machine learning	Atiq Ur Rahman et.al.	2509.09514	null
2025-09-11	Explainable AI for Accelerated Microstructure Imaging: A SHAP-Guided Protocol on the Connectome 2.0 scanner	Quentin Uhl et.al.	2509.09513	null
2025-09-11	Mixture of Semantics Transmission for Generative AI-Enabled Semantic Communication Systems	Junjie Ni et.al.	2509.09499	null
2025-09-11	SEDM: Scalable Self-Evolving Distributed Memory for Agents	Haoran Xu et.al.	2509.09498	null
2025-09-11	Prompt Pirates Need a Map: Stealing Seeds helps Stealing Prompts	Felix Mächtle et.al.	2509.09488	null
2025-09-11	Vorticity Packing Effects on Turbulent Transport in Decaying 2D Incompressible Navier-Stokes Fluids	Snehanshu Maiti et.al.	2509.09487	null
2025-09-11	Comprehensive Mapping of Tracer Diffusivities Across Composition Space in Ternary NiAlTi and Quinary NiCoFeAlTi High-Entropy Alloy Using Diffusion Couple Experiments and Physics Informed Neural Network Inversion	Ismail Kamil Worke et.al.	2509.09486	null
2025-09-11	Bath-induced stabilization of classical non-linear response in two dimensional infrared spectroscopy	Rajesh Dutta et.al.	2509.09476	null
2025-09-11	Axion-Photon Conversion in FLRW with Primordial Magnetic Fields: Explaining the Radio Excess	Setabuddin et.al.	2509.09472	null
2025-09-11	FlexiD-Fuse: Flexible number of inputs multi-modal medical image fusion based on diffusion model	Yushen Xu et.al.	2509.09456	null
2025-09-11	Optimal Investment and Consumption in a Stochastic Factor Model	Florian Gutekunst et.al.	2509.09452	null
2025-09-11	Composable Score-based Graph Diffusion Model for Multi-Conditional Molecular Generation	Anjie Qiao et.al.	2509.09451	null
2025-09-11	Steady advection-diffusion in multiply-connected potential flows	Kyle McKee et.al.	2509.09444	null
2025-09-11	Plug-and-play Diffusion Models for Image Compressive Sensing with Data Consistency Projection	Xiaodong Wang et.al.	2509.09365	null
2025-09-11	Expressive Power of Deep Networks on Manifolds: Simultaneous Approximation	Hanfei Zhou et.al.	2509.09362	null
2025-09-11	Turnpike properties for zero-sum stochastic linear quadratic differential games of Markovian regime switching system	Xun Li et.al.	2509.09358	null
2025-09-11	Euler-type methods for Levy-driven McKean-Vlasov SDEs with super-linear coefficients: mean-square error analysis	Jingtao Zhu et.al.	2509.09302	null
2025-09-11	A note on quantifying the contributions of incidence functions in spatio-temporal epidemic models	Mohamed Mehdaoui et.al.	2509.09301	null
2025-09-11	Data Driven Discovery of Emergent Dynamics in Reaction Diffusion Systems from Sparse and Noisy Observations	Saumitra Dwivedi et.al.	2509.09278	null
2025-09-11	Long time strong convergence analysis of one-step methods for McKean-Vlasov SDEs with superlinear growth coefficients	Taiyuan Liu et.al.	2509.09274	null
2025-09-11	The role of communication delays in the optimal control of spatially invariant systems	Luca Ballotta et.al.	2509.09269	null
2025-09-11	A novel method and dataset for depth-guided image deblurring from smartphone Lidar	Antonio Montanaro et.al.	2509.09241	null
2025-09-11	MAPSS: Manifold-based Assessment of Perceptual Source Separation	Amir Ivry et.al.	2509.09212	null
2025-09-11	ALL-PET: A Low-resource and Low-shot PET Foundation Model in the Projection Domain	Bin Huang et.al.	2509.09130	null
2025-09-11	Zero-shot Hierarchical Plant Segmentation via Foundation Segmentation Models and Text-to-image Attention	Junhao Xing et.al.	2509.09116	null
2025-09-10	Integrating Anatomical Priors into a Causal Diffusion Model	Binxu Li et.al.	2509.09054	null
2025-09-10	Noise-Activated Dopant Dynamics in Two-Dimensional Thermal Landscapes with Localized Cold Spots	Mesfin Taye et.al.	2509.09046	null
2025-09-10	Cosmic Ray Spatial Distribution and the Galactic/Extragalactic Transition	Paolo Lipari et.al.	2509.09028	null
2025-09-10	Complex dynamics and pattern formation in a diffusive epidemic model with an infection-dependent recovery rate	Wael El Khateeb et.al.	2509.09000	null
2025-09-10	HARD: A Performance Portable Radiation Hydrodynamics Code based on FleCSI Framework	Julien Loiseau et.al.	2509.08971	null
2025-09-10	Activity-driven clustering of jamming run-and-tumble particles: Exact three-body steady state by dynamical symmetry	Leo Hahn et.al.	2509.08945	null
2025-09-10	Discovering Divergent Representations between Text-to-Image Models	Lisa Dunlap et.al.	2509.08940	null
2025-09-10	Diffusion-Based Action Recognition Generalizes to Untrained Domains	Rogerio Guimaraes et.al.	2509.08908	null
2025-09-10	Anomalously fast transport in non-integrable lattice gauge theories	Devendra Singh Bhakuni et.al.	2509.08889	null
2025-09-10	RewardDance: Reward Scaling in Visual Generation	Jie Wu et.al.	2509.08826	null
2025-09-10	GeneVA: A Dataset of Human Annotations for Generative Text to Video Artifacts	Jenna Kang et.al.	2509.08818	null
2025-09-10	Calibrating MLLM-as-a-judge via Multimodal Bayesian Prompt Ensembles	Eric Slyman et.al.	2509.08777	null
2025-09-11	Joint Model-based Model-free Diffusion for Planning with Constraints	Wonsuhk Jung et.al.	2509.08775	null
2025-09-10	Sharp power concavity of two relevant free boundary problems of reaction-diffusion type	Qingyou He et.al.	2509.08768	null
2025-09-10	Learning Turbulent Flows with Generative Models: Super-resolution, Forecasting, and Sparse Flow Reconstruction	Vivek Oommen et.al.	2509.08752	null
2025-09-10	On the Lebesgue Constant of Extended-Domain Spectral Methods for Elliptic PDEs	Po-Yi Wu et.al.	2509.08745	null
2025-09-10	Finite-temperature transport in the gapped spin-1/2 XXZ chain and one-dimensional lattice spinless fermion model	J. M. P. Carmelo et.al.	2509.08741	null
2025-09-10	Data-driven generative simulation of SDEs using diffusion models	Xuefeng Gao et.al.	2509.08731	null
2025-09-10	Accelerating Diffusion Transformer-Based Text-to-Speech with Transformer Layer Caching	Siratish Sakpiboonchit et.al.	2509.08696	null
2025-09-10	The Small Magellanic Cloud through the lens of the James Webb Space Telescope : binaries and mass function within the galaxy outskirts	M. V. Legnardi et.al.	2509.08687	null
2025-09-10	X-Part: high fidelity and structure coherent shape decomposition	Xinhao Yan et.al.	2509.08643	null
2025-09-10	RoentMod: A Synthetic Chest X-Ray Modification Model to Identify and Correct Image Interpretation Model Shortcuts	Lauren H. Cooke et.al.	2509.08640	null
2025-09-10	LADB: Latent Aligned Diffusion Bridges for Semi-Supervised Domain Translation	Xuqin Wang et.al.	2509.08628	null
2025-09-10	Microstructural Control and Heat Transport Enhancement in Lanthanum Sulfate for Thermochemical Heat Storage	Kunihiko Shizume et.al.	2509.08585	null
2025-09-10	EfficientIML: Efficient High-Resolution Image Manipulation Localization	Jinhan Li et.al.	2509.08583	null
2025-09-10	Quenched and annealed heat kernel estimates for Brox’s diffusion	Xin Chen et.al.	2509.08559	null
2025-09-10	PEHRT: A Common Pipeline for Harmonizing Electronic Health Record data for Translational Research	Jessica Gronsbell et.al.	2509.08553	null
2025-09-10	System size and boundaries determine the patterning dynamics of attracting active particles	Jan Rombouts et.al.	2509.08533	null
2025-09-10	RoboMatch: A Mobile-Manipulation Teleoperation Platform with Auto-Matching Network Architecture for Long-Horizon Manipulation	Hanyu Liu et.al.	2509.08522	null
2025-09-10	HuMo: Human-Centric Video Generation via Collaborative Multi-Modal Conditioning	Liyang Chen et.al.	2509.08519	null
2025-09-10	Search for a photon peak from keV-scale dark matter annihilation with NuSTAR: Constraints on $\langle σv \rangle$ after 11 years of observations	E. I. Zakharov et.al.	2509.08506	null
2025-09-10	Prompt-Driven Image Analysis with Multimodal Generative AI: Detection, Segmentation, Inpainting, and Interpretation	Kaleem Ahmad et.al.	2509.08489	null
2025-09-10	Audio Deepfake Verification	Li Wang et.al.	2509.08476	null
2025-09-10	Spherical Brownian Bridge Diffusion Models for Conditional Cortical Thickness Forecasting	Ivan Stoyanov et.al.	2509.08442	null
2025-09-10	PegasusFlow: Parallel Rolling-Denoising Score Sampling for Robot Diffusion Planner Flow Matching	Lei Ye et.al.	2509.08435	null
2025-09-10	One-dimensional particle clouds with elastic collisions	Mikhail Menshikov et.al.	2509.08430	null
2025-09-10	LD-ViCE: Latent Diffusion Model for Video Counterfactual Explanations	Payal Varshney et.al.	2509.08422	null
2025-09-10	The Critical 9365 Å Diffuse Interstellar Band and C $_{60}^{+}$ Association	Daniel Majaess et.al.	2509.08414	null
2025-09-10	Protoplanetary disks around magnetized young stars with large-scale magnetic fields I: Steady-state solutions	D. Steiner et.al.	2509.08393	null
2025-09-11	VRAE: Vertical Residual Autoencoder for License Plate Denoising and Deblurring	Cuong Nguyen et.al.	2509.08392	null
2025-09-10	LatentVoiceGrad: Nonparallel Voice Conversion with Latent Diffusion/Flow-Matching Models	Hirokazu Kameoka et.al.	2509.08379	null
2025-09-10	Bitrate-Controlled Diffusion for Disentangling Motion and Content in Video	Xiao Li et.al.	2509.08376	null
2025-09-10	Stop using root-mean-square error as a precipitation target!	Kieran M. R. Hunt et.al.	2509.08369	null
2025-09-10	Physics-Guided Rectified Flow for Low-light RAW Image Enhancement	Juntai Zeng et.al.	2509.08330	null
2025-09-10	Trans-scale spin Seebeck effect in nanostructured bulk composites based on magnetic insulator	Sang J. Park et.al.	2509.08327	null
2025-09-10	Controlling GaN nucleation via O $_2$ -plasma-perforated graphene masks on c-plane sapphire	Su Young An et.al.	2509.08275	null
2025-09-10	Generative Quasi-Continuum Modeling of Confined Fluids at the Nanoscale	Bugra Yalcin et.al.	2509.08223	null
2025-09-10	Moiré excitons in generalized Wigner crystals	Jing-Yang You et.al.	2509.08211	null
2025-09-09	ArtifactGen: Benchmarking WGAN-GP vs Diffusion for Label-Aware EEG Artifact Synthesis	Hritik Arasu et.al.	2509.08188	null
2025-09-09	Modeling of convective cells, turbulence, and transport induced by a radio-frequency antenna in the tokamak boundary plasma	M. V. Umansky et.al.	2509.08178	null
2025-09-09	A Linear Pricing Mechanism for Load Management in Day-Ahead Retail Energy Markets	Phillippe K. Phanivong et.al.	2509.08166	null
2025-09-09	Diffusion-Guided Multi-Arm Motion Planning	Viraj Parimi et.al.	2509.08160	null
2025-09-09	Electronic Fluctuations and Ionic Dynamics in Molten Silver Iodide	Harender S. Dhattarwal et.al.	2509.08143	null
2025-09-09	Joint calibration of the volatility surface and variance term structure	Jiwook Yoo et.al.	2509.08096	null
2025-09-09	DDNet: A Unified Physics-Informed Deep Learning Framework for Semiconductor Device Modeling	Roberto Riganti et.al.	2509.08073	null
2025-09-09	Discovery of a $z \sim 0.8$ Ultra Steep Spectrum Radio Halo in the MeerKAT-South Pole Telescope Survey	Isaac S. Magolego et.al.	2509.08062	null
2025-09-09	Acceleration of Heavy Ions at Non-Relativistic Collisionless Shocks	Damiano Caprioli et.al.	2509.08061	null
2025-09-09	Breaking Dark: Hunting Heavy Decaying Dark Matter with Tibet AS $_γ$ and LHAASO-KM2A	Abhishek Dubey et.al.	2509.08039	null
2025-09-09	PyPAS – Python package for Positron Annihilation Spectroscopy Doppler Broadening Analysis	Achiya Yosef Amrusi et.al.	2509.08023	null
2025-09-08	CardioComposer: Flexible and Compositional Anatomical Structure Generation with Disentangled Geometric Guidance	Karim Kadry et.al.	2509.08015	null
2025-09-08	Validation of a CT-brain analysis tool for measuring global cortical atrophy in older patient cohorts	Sukhdeep Bal et.al.	2509.08012	null
2025-09-09	LHAASO Galactic Plane $γ$ -rays Strongly Constrain Heavy Dark Matter	Celine Boehm et.al.	2509.07982	null
2025-09-09	Edwards-Wilkinson limit for a stochastic advection-diffusion PDE	Sotirios Kotitsas et.al.	2509.07956	null
2025-09-09	Feature Space Analysis by Guided Diffusion Model	Kimiaki Shirahama et.al.	2509.07936	null
2025-09-09	ScoreHOI: Physically Plausible Reconstruction of Human-Object Interaction via Score-Guided Diffusion	Ao Li et.al.	2509.07920	null
2025-09-09	Measurement of ion acceleration and diffusion in a laser-driven magnetized plasma	J. T. Y. Chu et.al.	2509.07880	null
2025-09-09	Duality estimates for subdiffusion problems including time-fractional porous medium type equations	Arlúcio Viana et.al.	2509.07862	null
2025-09-09	Convergence analysis for the Barrett-Garcke-Nurnberg method of transport type	Genming Bai et.al.	2509.07834	null
2025-09-09	A Note on the failure of temporal regularity for stochastic PDEs	Antonio Agresti et.al.	2509.07803	null
2025-09-09	Query Expansion in the Age of Pre-trained and Large Language Models: A Comprehensive Survey	Minghan Li et.al.	2509.07794	null
2025-09-09	SN 2022xlp: The second-known well-observed, intermediate-luminosity Iax supernova	D. Bánhidi et.al.	2509.07717	null
2025-09-09	A Generalisable Generative Model for Multi-Detector Calorimeter Simulation	Piyush Raikwar et.al.	2509.07700	null
2025-09-09	Semantic Watermarking Reinvented: Enhancing Robustness and Generation Quality with Fourier Integrity	Sung Ju Lee et.al.	2509.07647	null
2025-09-09	An all-sky 3D dust map Based on Gaia and LAMOST	Tao Wang et.al.	2509.07640	null
2025-09-10	LSMTCR: A Scalable Multi-Architecture Model for Epitope-Specific T Cell Receptor de novo Design	Ruihao Zhang et.al.	2509.07627	null
2025-09-09	AgentX: Towards Orchestrating Robust Agentic Workflow Patterns with FaaS-hosted MCP Services	Shiva Sai Krishna Anand Tokal et.al.	2509.07595	null
2025-09-09	Sorting of binary active-passive mixtures in designed microchannels	Horacio Serna et.al.	2509.07582	null
2025-09-09	Atomic Layer Etching of Aluminum Nitride: Mechanistic Insights from First-Principles Studies of Chlorine Chemistry	Sanjay Nayak et.al.	2509.07554	null
2025-09-09	PanoLAM: Large Avatar Model for Gaussian Full-Head Synthesis from One-shot Unposed Image	Peng Li et.al.	2509.07552	null
2025-09-09	Two-dimensional fractional Brownian motion: Analysis in time and frequency domains	Michał Balcerek et.al.	2509.07537	null
2025-09-09	Universal Few-Shot Spatial Control for Diffusion Models	Kiet T. Nguyen et.al.	2509.07530	null
2025-09-09	Emergence of continuously varying critical exponents in coupled map lattice as an effect of quenched disorder	Priyanka D. Bhoyar et.al.	2509.07529	null
2025-09-09	Target matching based generative model for speech enhancement	Taihui Wang et.al.	2509.07521	null
2025-09-09	Magnetic Resonance Imaging Virtual Liver Biopsy Using Radiomics Analysis for the Assessment of Chronic Liver Disease	Jiqing Huang et.al.	2509.07516	null
2025-09-09	LINR Bridge: Vector Graphic Animation via Neural Implicits and Video Diffusion Priors	Wenshuo Gao et.al.	2509.07484	null
2025-09-09	Uncertainty in Hadronic Diffuse $γ$ -Ray Emission from the Temporal Stochasticity of Cosmic-Ray Sources	Xing-Jian Lv et.al.	2509.07481	null
2025-09-09	ANYPORTAL: Zero-Shot Consistent Video Background Replacement	Wenshuo Gao et.al.	2509.07472	null
2025-09-09	DepthVision: Robust Vision-Language Understanding through GAN-Based LiDAR-to-RGB Synthesis	Sven Kirchner et.al.	2509.07463	null
2025-09-09	Unveiling Biological Models Through Turing Patterns	Yuhan Li et.al.	2509.07458	null
2025-09-09	Node Position Estimation in Diffusion-Based Molecular Communications Using Multi-Layer Perceptron	Sangjun Hwang et.al.	2509.07441	null
2025-09-09	GRASPion: an Open-Source, Programmable Brainbot for Active Matter Research	F. Novkoski et.al.	2509.07437	null
2025-09-09	DreamLifting: A Plug-in Module Lifting MV Diffusion Models for 3D Asset Generation	Ze-Xin Yin et.al.	2509.07435	null
2025-09-11	Blow-up for a Nonlocal Diffusion Equation with Time Regularly Varying Nonlinearity and Forcing	Rihab Ben Belgacem et.al.	2509.07405	null
2025-09-09	Time evolution of averaged limit shapes of random multiple Young diagrams	Akihito Hora et.al.	2509.07393	null
2025-09-09	On the exponential convergence to equilibrium for ultrafast diffusion equations	Yi C. Huang et.al.	2509.07382	null
2025-09-09	Knowledge Distillation Driven Semantic NOMA for Image Transmission with Diffusion Model	Qifei Wang et.al.	2509.07363	null
2025-09-09	Distributed Frequency Control for Multi-Area Power Systems Considering Transient Frequency Safety	Xiemin Mo et.al.	2509.07345	null
2025-09-09	SpecifyUI: Supporting Iterative UI Design Intent Expression through Structured Specifications and Generative AI	Yunnong Chen et.al.	2509.07334	null
2025-09-09	Data-knowledge fusion driven frequency security assessment: A robust framework for renewable-dominated power grids	Yurun Zhang et.al.	2509.07320	null
2025-09-08	Reconstruction Alignment Improves Unified Multimodal Models	Ji Xie et.al.	2509.07295	null
2025-09-08	Breast Cancer Detection in Thermographic Images via Diffusion-Based Augmentation and Nonlinear Feature Fusion	Sepehr Salem et.al.	2509.07277	null
2025-09-08	Hybrid Galam–Bass Model for Technology Innovation	Giulia Rotundo et.al.	2509.07275	null
2025-09-08	Thermodynamic Irreversibility in Underdamped Brownian Motion with Spatial Temperature Gradients	Mesfin Taye et.al.	2509.07272	null
2025-09-08	Extended Version: Market-Driven Equilibria for Distributed Solar Panel Investment	Mehdi Davoudi et.al.	2509.07203	null
2025-09-08	Realism to Deception: Investigating Deepfake Detectors Against Face Enhancement	Muhammad Saad Saeed et.al.	2509.07178	null
2025-09-08	Ultrathin oxide freestanding membranes with large-scale continuity and structural perfection	Yuhao Hong et.al.	2509.07176	null
2025-09-08	Unveiling the Impact of Cosmic Rays on the Disc Sizes and Outflows from Dwarf Scales to Galaxy Groups	Rebekka Bieri et.al.	2509.07124	null
2025-09-08	Indirect detection of boosted light scalar dark matter	Arindam Basu et.al.	2509.07110	null
2025-09-08	Constraining Baryon Fractions in Galaxy Groups and Clusters with the First CHIME/FRB Outrigger	Adam E. Lanman et.al.	2509.07097	null
2025-09-08	Automated Evaluation of Gender Bias Across 13 Large Multimodal Models	Juan Manuel Contreras et.al.	2509.07050	null
2025-09-07	The Impact of Artificial Intelligence on Traditional Art Forms: A Disruption or Enhancement	Viswa Chaitanya Marella et.al.	2509.07029	null
2025-09-10	Moment- and Power-Spectrum-Based Gaussianity Regularization for Text-to-Image Models	Jisung Hwang et.al.	2509.07027	null
2025-09-08	Scaling Transformer-Based Novel View Synthesis Models with Token Disentanglement and Synthetic Data	Nithin Gopalakrishnan Nair et.al.	2509.06950	null
2025-09-08	Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Models	Yinjie Wang et.al.	2509.06949	null
2025-09-09	Interleaving Reasoning for Better Text-to-Image Generation	Wenxuan Huang et.al.	2509.06945	null
2025-09-09	Directly Aligning the Full Diffusion Trajectory with Fine-Grained Human Preference	Xiangwei Shen et.al.	2509.06942	null
2025-09-10	LLaDA-VLA: Vision Language Diffusion Action Models	Yuqing Wen et.al.	2509.06932	null
2025-09-08	BIR-Adapter: A Low-Complexity Diffusion Model Adapter for Blind Image Restoration	Cem Eteke et.al.	2509.06904	null
2025-09-08	Nanobot Algorithms for Treatment of Diffuse Cancer	Noble Harasha et.al.	2509.06893	null
2025-09-08	Homogenisation of a Passive Scalar Transported by Locally Supported White Noise	Federico Butori et.al.	2509.06878	null
2025-09-08	Infinite Interacting Brownian Motions and EVI Gradient Flows	Kohei Suzuki et.al.	2509.06869	null
2025-09-08	A New Hybrid Model of Generative Adversarial Network and You Only Look Once Algorithm for Automatic License-Plate Recognition	Behnoud Shafiezadeh et.al.	2509.06868	null
2025-09-08	floq: Training Critics via Flow-Matching for Scaling Compute in Value-Based RL	Bhavya Agrawalla et.al.	2509.06863	null
2025-09-08	Stochastic modelling of cosmic-ray sources for Galactic diffuse emissions	Anton Stall et.al.	2509.06857	null
2025-09-08	CRISP – Compliant ROS2 Controllers for Learning-Based Manipulation Policies and Teleoperation	Daniel San José Pro et.al.	2509.06819	null
2025-09-08	UMO: Scaling Multi-Identity Consistency for Image Customization via Matching Reward	Yufeng Cheng et.al.	2509.06818	null
2025-09-08	Large eddy simulations in astrophysics	Wolfram Schmidt-Brückner et.al.	2509.06801	null
2025-09-08	Image Encryption Scheme Based on Hyper-Chaotic Map and Self-Adaptive Diffusion	Yiqi Tang et.al.	2509.06754	null
2025-09-08	Zero-shot 3D-Aware Trajectory-Guided image-to-video generation via Test-Time Training	Ruicheng Zhang et.al.	2509.06723	null
2025-09-08	STAGE: Segmentation-oriented Industrial Anomaly Synthesis via Graded Diffusion with Explicit Mask Alignment	Xichen Xu et.al.	2509.06693	null
2025-09-08	A Parallel Solver with Multiphysics Finite Element Method for Poroelasticity Coupled with Elasticity Model	Zhihao Ge et.al.	2509.06673	null
2025-09-08	The complementary of CTAO, direct detection and collider searches for dark matter in Effective Field Theories and Simplified models	Igor Reis et.al.	2509.06628	null
2025-09-08	Fisher entropic Fokker-Planck model of monatomic rarefied gases	Veronica Montanaro et.al.	2509.06610	null
2025-09-08	Contrastive Anatomy-Contrast Disentanglement: A Domain-General MRI Harmonization Method	Daniel Scholz et.al.	2509.06592	null
2025-09-08	CausNVS: Autoregressive Multi-view Diffusion for Flexible 3D Novel View Synthesis	Xin Kong et.al.	2509.06579	null
2025-09-08	From Rigging to Waving: 3D-Guided Diffusion for Natural Animation of Hand-Drawn Characters	Jie Zhou et.al.	2509.06573	null
2025-09-08	Interlayer Coupling and Exciton Dynamics in 2D Hybrid Structures based on an InGaN Quantum Well coupled to a MoSe2 Monolayer	D. Chen et.al.	2509.06547	null
2025-09-08	A multiscale theory for network advection-reaction-diffusion	Hadrien Oliveri et.al.	2509.06546	null
2025-09-08	Thermalization dynamics of finite-size quantum critical systems	Li Li et.al.	2509.06523	null
2025-09-08	On optimal solutions of classical and sliced Wasserstein GANs with non-Gaussian data	Yu-Jui Huang et.al.	2509.06505	null
2025-09-08	TIDE: Achieving Balanced Subject-Driven Image Generation via Target-Instructed Diffusion Enhancement	Jibai Lin et.al.	2509.06499	null
2025-09-08	Phyllotaxis in a Keller-Segel model	Michael F. Staddon et.al.	2509.06498	null
2025-09-08	Discovery of giant bubbles in the hot gaseous halo of the massive disk galaxy NGC 6286	Lin He et.al.	2509.06470	null
2025-09-08	VQualA 2025 Challenge on Image Super-Resolution Generated Content Quality Assessment: Methods and Results	Yixiao Li et.al.	2509.06413	null
2025-09-08	Diffusion-Shock PDEs for Deep Learning on Position-Orientation Space	Finn M. Sherry et.al.	2509.06405	null
2025-09-08	Non-Destructive Rail Monitoring for Defect Identification	Elissa Akiki et.al.	2509.06394	null
2025-09-08	Hydrogen-induced fast fracture in a 1.5 GPa dual-phase steel	Rama Srinivas Varanasi et.al.	2509.06323	null
2025-09-08	McKean-Vlasov limits of scaling-critical reaction-diffusion equations with random initial data	Bryan Castillo et.al.	2509.06260	null
2025-09-07	Multi-Scale Modeling and Predictive Control of Active Brownian Particles	Sadra Saremi et.al.	2509.06217	null
2025-09-07	Grasp-MPC: Closed-Loop Visual Grasping via Value-Guided Model Predictive Control	Jun Yamada et.al.	2509.06201	null
2025-09-07	Forward and inverse problems of a semilinear transport equation	Kui Ren et.al.	2509.06183	null
2025-09-07	The role of the initial distribution in population survival within a bounded habitat	Rafael de la Rosa et.al.	2509.06179	null
2025-09-07	UniVerse-1: Unified Audio-Video Generation via Stitching of Experts	Duomin Wang et.al.	2509.06155	null
2025-09-07	If generative AI is the answer, what is the question?	Ambuj Tewari et.al.	2509.06120	null
2025-09-10	The Thermodynamic Limit of Extreme First-Passage Times	Talia Baravi et.al.	2509.06098	null
2025-09-07	Home-made Diffusion Model from Scratch to Hatch	Shih-Ying Yeh et.al.	2509.06068	null
2025-09-10	BranchGRPO: Stable and Efficient GRPO with Structured Branching in Diffusion Models	Yuming Li et.al.	2509.06040	null
2025-09-07	DreamAudio: Customized Text-to-Audio Generation with Diffusion Models	Yi Yuan et.al.	2509.06027	null
2025-09-07	The Gross-Pitaewsky equation with time and space dependent coefficients	Federico Lai et.al.	2509.06001	null
2025-09-07	Multi-Strategy Guided Diffusion via Sparse Masking Temporal Reweighting Distribution Correction	Zekun Zhou et.al.	2509.05992	null
2025-09-07	Simulation of Solar Surface Flux Transport Constrained by Magnetic Power Spectra. I. Flux Transport Parameter	Yukun Luo et.al.	2509.05989	null
2025-09-07	Imagining Alternatives: Towards High-Resolution 3D Counterfactual Medical Image Generation via Language Guidance	Mohamed Mohamed et.al.	2509.05978	null
2025-09-09	Coefficients-Preserving Sampling for Reinforcement Learning with Flow Matching	Feng Wang et.al.	2509.05952	null
2025-09-06	Transformer-based Topology Optimization	Aaron Lutheran et.al.	2509.05800	null
2025-09-06	Hybrid Fourier Neural Operator-Plasma Fluid Model for Fast and Accurate Multiscale Simulations of High Power Microwave Breakdown	Kalp Pandya et.al.	2509.05799	null
2025-09-06	Discrete-Time Quantum Random Walk for Epidemiological Modeling	Sayan Manna et.al.	2509.05795	null
2025-09-06	Depth Profiling of Oxygen Migration in Ta/HfO2 Stacks During Ionic Liquid Gating	Beatrice Bednarz et.al.	2509.05748	null
2025-09-06	InterAct: A Large-Scale Dataset of Dynamic, Expressive and Interactive Activities between Two People in Daily Scenarios	Leo Ho et.al.	2509.05747	null
2025-09-06	High-friction limit for bipolar Euler-Riesz systems	Nuno J. Alves et.al.	2509.05742	null
2025-09-06	Polarization memory effect in a multimode fiber	Gauri Arora et.al.	2509.05665	null
2025-09-06	EditIDv2: Editable ID Customization with Data-Lubricated ID Feature Integration for Text-to-Image Generation	Guandong Li et.al.	2509.05659	null
2025-09-06	Well-posedness and regularity theory for the fractional diffusion-wave equation in Lebesgue spaces	Bruno de Andrade et.al.	2509.05654	null
2025-09-06	SuMa: A Subspace Mapping Approach for Robust and Effective Concept Erasure in Text-to-Image Diffusion Models	Kien Nguyen et.al.	2509.05625	null
2025-09-06	Large and moderate deviation principles for stochastic partial differential equation on graph	Jianbo Cui et.al.	2509.05622	null
2025-09-05	Perpendicular ion heating in turbulence and reconnection: magnetic moment breaking by coherent fluctuations	Alfred Mallet et.al.	2509.05518	null
2025-09-05	Chemotaxis Models with Nonlinear/Porous Medium Diffusion, Consumption, and Logistic source on $\mathbb{R}^N$ : I. Global Solvability and Boundedness	Zulaihat Hassan et.al.	2509.05494	null
2025-09-05	From Image Generation to Infrastructure Design: a Multi-agent Pipeline for Street Design Generation	Chenguang Wang et.al.	2509.05469	null
2025-09-05	Newton to Einstein: Axiom-Based Discovery via Game Design	Pingchuan Ma et.al.	2509.05448	null
2025-09-05	The MeerKAT Galaxy Cluster Legacy Survey – II. Catalogue of the diffuse radio emission in MeerKAT-GCLS clusters	Konstantinos Kolokythas et.al.	2509.05442	null
2025-09-05	Diffusioosmosis of electrolyte solutions in uniformly charged channels	Evgeny S. Asmolov et.al.	2509.05387	null
2025-09-05	Spin-transport characteristics in a Si-based spin metal-oxide-semiconductor field-effect transistor (spin MOSFET): Bias dependence of the spin polarization in Si and magnetoresistance in spin-valve signals	Shoichi Sato et.al.	2509.05384	null
2025-09-05	Extreme Negative Polarisation of New Interstellar Comet 3I/ATLAS	Zuri Gray et.al.	2509.05181	null
2025-09-05	Cheaper access to universal fluctuations in integrable spin chains from boundary effects	Sylvain Prolhac et.al.	2509.05176	null
2025-09-05	Latest results from the searches for ultra-high-energy photons at the Pierre Auger Observatory	Pierpaolo Savina et.al.	2509.05113	null
2025-09-05	Painting the market: generative diffusion models for financial limit order book simulation and forecasting	Alfred Backhouse et.al.	2509.05107	null
2025-09-05	Physical interactions enable energy-efficient Turing patterns	Cathelijne ter Burg et.al.	2509.05093	null
2025-09-05	MM-DREX: Multimodal-Driven Dynamic Routing of LLM Experts for Financial Trading	Yang Chen et.al.	2509.05080	null
2025-09-05	Masked Diffusion Language Models with Frequency-Informed Training	Despoina Kosmopoulou et.al.	2509.05056	null
2025-09-05	Active thermodynamics of inertial chiral active gases: equation of state and edge currents	Lorenzo Caprini et.al.	2509.05053	null
2025-09-05	QCA-MolGAN: Quantum Circuit Associative Molecular GAN with Multi-Agent Reinforcement Learning	Aaron Mark Thomas et.al.	2509.05051	null
2025-09-05	LUIVITON: Learned Universal Interoperable VIrtual Try-ON	Cong Cao et.al.	2509.05030	null
2025-09-05	Synthetic Acceleration Preconditioners for Parametric Radiative Transfer Equations based on Trajectory-Aware Reduced Order Models	Ning Tang et.al.	2509.05001	null
2025-09-05	FLOWER: Democratizing Generalist Robot Policies with Efficient Vision-Language-Action Flow Policies	Moritz Reuss et.al.	2509.04996	null
2025-09-05	Improving Spatial Resolution of Background Oriented Schlieren Based on Directional Rays	Xiang Li et.al.	2509.04992	null
2025-09-05	Magnetorotational and convective instabilities in a thin layer of electrically conductive nanofluid under an external helical magnetic field	M. I. Kopp et.al.	2509.04968	null
2025-09-05	Efficient estimation of jump parameters for stochastic differential equations driven by L{é}vy processes	Elise Bayraktar et.al.	2509.04920	null
2025-09-05	Survey of Profile Parameters of the $6196 Å$ Diffuse Interstellar Band. From Uniform Profiles to Doppler Splitting and Blueshifts	M. Piecka et.al.	2509.04915	null
2025-09-05	Off-lattice Microscopic Monte Carlo Modeling of Molecular Hydrogen Formation on Carbonaceous Dust Grains	N. A. Satonkin et.al.	2509.04913	null
2025-09-05	Spectrum of slip dynamics, scaling & statistical laws emerge from simplified model of fault and damage zone architecture	M. Almakari et.al.	2509.04909	null
2025-09-05	Plug-and-Play Latent Diffusion for Electromagnetic Inverse Scattering with Application to Brain Imaging	Rui Guo et.al.	2509.04860	null
2025-09-05	A Knowledge-Driven Diffusion Policy for End-to-End Autonomous Driving Based on Expert Routing	Chengkai Xu et.al.	2509.04853	null
2025-09-05	Stable and unstable spatially-periodic canards created in singular subcritical Turing bifurcations in the Brusselator system	Robert Jencks et.al.	2509.04835	null
2025-09-05	SemSteDiff: Generative Diffusion Model-based Coverless Semantic Steganography Communication	Song Gao et.al.	2509.04803	null
2025-09-05	Stability and Self-Organized Patterns in Coupled Ecohydrological–Fire Dynamics: A Model of Vegetation–Rainfall–Bushfire Interactions	Serena Dipierro et.al.	2509.04766	null
2025-09-05	STADI: Fine-Grained Step-Patch Diffusion Parallelism for Heterogeneous GPUs	Han Liang et.al.	2509.04719	null
2025-09-04	Transforming Fashion with AI: A Comparative Study of Large Language Models in Apparel Design	Nusrat Jahan Lamia et.al.	2509.04705	null
2025-09-04	On convergence of upwinding Petrov-Galerkin methods for convection-diffusion	Constantin Bacuta et.al.	2509.04703	null
2025-09-04	DarkStream: real-time speech anonymization with low latency	Waris Quamer et.al.	2509.04667	null
2025-09-04	Mo Atom Rearrangement Drives Layer-Dependent Reactivity in Two-Dimensional MoS2	Zifan Wang et.al.	2509.04648	null
2025-09-04	Technical Developments of DA on $\mathbb{T}^3$	Hangyue Zhang et.al.	2509.04634	null
2025-09-04	$\mathcal{L}_1$ -DRAC: Distributionally Robust Adaptive Control	Aditya Gahlawat et.al.	2509.04619	null
2025-09-04	DisPatch: Disarming Adversarial Patches in Object Detection with Diffusion Models	Jin Ma et.al.	2509.04597	null
2025-09-04	An S-matrix Formalism for the Nonclassical Optical Response of Plasmonic Sphere Aggregates	Xin Zheng et.al.	2509.04589	null
2025-09-04	Skywork UniPic 2.0: Building Kontext Model with Online RL for Unified Multimodal Model	Hongyang Wei et.al.	2509.04548	null
2025-09-04	Spatial Patterning and Selection: How the Environment Shapes Molecular Complexity	Alexandre Champagne-Ruel et.al.	2509.04547	null
2025-09-04	PromptEnhancer: A Simple Approach to Enhance Text-to-Image Models via Chain-of-Thought Prompt Rewriting	Linqing Wang et.al.	2509.04545	null
2025-09-04	In-Context Policy Adaptation via Cross-Domain Skill Diffusion	Minjong Yoo et.al.	2509.04535	null
2025-09-04	Virtual Fitting Room: Generating Arbitrarily Long Videos of Virtual Try-On from a Single Image – Technical Preview	Jun-Kun Chen et.al.	2509.04450	null
2025-09-04	Plot’n Polish: Zero-shot Story Visualization and Disentangled Editing with Text-to-Image Diffusion Models	Kiymet Akdemir et.al.	2509.04446	null
2025-09-04	Durian: Dual Reference-guided Portrait Animation with Attribute Transfer	Hyunsoo Cha et.al.	2509.04434	null
2025-09-04	Few-step Flow for 3D Generation via Marginal-Data Transport Distillation	Zanwei Zhou et.al.	2509.04406	null
2025-09-04	Transition Models: Rethinking the Generative Learning Objective	Zidong Wang et.al.	2509.04394	null
2025-09-04	SSGaussian: Semantic-Aware and Structure-Preserving 3D Style Transfer	Jimin Xu et.al.	2509.04379	null
2025-09-04	Connections between reinforcement learning with feedback,test-time scaling, and diffusion guidance: An anthology	Yuchen Jiao et.al.	2509.04372	null
2025-09-04	Sensitivities of time-dependent temperature profile predictions for NSTX with the Multi-Mode Model	J. B. Lestz et.al.	2509.04360	null
2025-09-04	From Editor to Dense Geometry Estimator	JiYuan Wang et.al.	2509.04338	null
2025-09-04	The limiting law of the Discrete Gaussian level-lines	Joseph Chen et.al.	2509.04333	null
2025-09-04	Noisy Label Refinement with Semantically Reliable Synthetic Images	Yingxuan Li et.al.	2509.04298	null
2025-09-04	TauGenNet: Plasma-Driven Tau PET Image Synthesis via Text-Guided 3D Diffusion Models	Yuxin Gong et.al.	2509.04269	null
2025-09-04	Thermal diffusivity measurement based on evaporative cryocooling excitation: Theory and experiments	Pengfei Zhu et.al.	2509.04263	null
2025-09-04	Error analysis for learning the time-stepping operator of evolutionary PDEs	Ke Chen et.al.	2509.04256	null
2025-09-04	Synthetic Survival Data Generation for Heart Failure Prognosis Using Deep Generative Models	Chanon Puttanawarut et.al.	2509.04245	null
2025-09-04	Axion-Photon Conversion In Magnetized Universe: Impact On The Global 21-cm Signal	Pravin Kumar Natwariya et.al.	2509.04237	null
2025-09-04	Cosmic-Ray Boosted Diffuse Supernova Neutrinos	Alexander Sandrock et.al.	2509.04229	null
2025-09-04	Making neural networks understand internal heat transfer using Fourier-transformed thermal diffusion wave fields	Pengfei Zhu et.al.	2509.04223	null
2025-09-04	Two-dimensional magnetic tunnel p-n junctions for low-power electronics	Wenkai Zhu et.al.	2509.04206	null
2025-09-04	Laplacian Flows in Complex-valued Directed Networks: Analysis, Design, and Consensus	Aditi Saxena et.al.	2509.04196	null
2025-09-04	DUDE: Diffusion-Based Unsupervised Cross-Domain Image Retrieval	Ruohong Yang et.al.	2509.04193	null
2025-09-04	Set Block Decoding is a Language Model Inference Accelerator	Itai Gat et.al.	2509.04185	null
2025-09-04	On Riordan groups involving formal semi-Laurent series and their Lie group structure	Dariusz Bugajewski et.al.	2509.04160	null
2025-09-04	Hyper Diffusion Avatars: Dynamic Human Avatar Generation using Network Weight Space Diffusion	Dongliang Cao et.al.	2509.04145	null
2025-09-04	MEPG:Multi-Expert Planning and Generation for Compositionally-Rich Image Generation	Yuan Zhao et.al.	2509.04126	null
2025-09-04	A unified stabilized virtual element method for the generalized Oseen equation: stability and robustness	Sudheer Mishra et.al.	2509.04113	null
2025-09-04	Depletion-Induced Interactions Modulate Nanoscale Protein Diffusion in Polymeric Crowder Solutions	Michelle Dargasz et.al.	2509.04087	null
2025-09-04	Keypoint-based Diffusion for Robotic Motion Planning on the NICOL Robot	Lennart Clasmeier et.al.	2509.04076	null
2025-09-04	SMooGPT: Stylized Motion Generation using Large Language Models	Lei Zhong et.al.	2509.04058	null
2025-09-04	CoT-Space: A Theoretical Framework for Internal Slow-Thinking via Reinforcement Learning	Zeyu Gan et.al.	2509.04027	null
2025-09-04	Electromechanical human heart modeling for predicting endocardial heart motion	Milad Hasani et.al.	2509.04024	null
2025-09-04	Divergence-Kernel method for linear responses and diffusion models	Angxiu Ni et.al.	2509.03992	null
2025-09-04	NeuroBreak: Unveil Internal Jailbreak Mechanisms in Large Language Models	Chuhan Zhang et.al.	2509.03985	null
2025-09-05	Improving Vessel Segmentation with Multi-Task Learning and Auxiliary Data Available Only During Model Training	Daniel Sobotka et.al.	2509.03975	null
2025-09-04	ANTS: Shaping the Adaptive Negative Textual Space by MLLM for OOD Detection	Zhu Wenjie et.al.	2509.03951	null
2025-09-04	Fluid boundary conditions in kinetic-diffusion Monte Carlo	Thijs Steel et.al.	2509.03942	null
2025-09-04	Thickness-dependent magnon spin transport in antiferromagnetic insulators: Crossover from quasi-three-dimensional to quasi-two-dimensional regimes	Mathias Åsan Myhre et.al.	2509.03941	null
2025-09-04	SwinSRGAN: Swin Transformer-based Generative Adversarial Network for High-Fidelity Speech Super-Resolution	Jiajun Yuan et.al.	2509.03913	null
2025-09-04	A Generative Foundation Model for Chest Radiography	Yuanfeng Ji et.al.	2509.03903	null
2025-09-04	Diffusion Generative Models Meet Compressed Sensing, with Applications to Image Data and Financial Time Series	Zhengyi Guo et.al.	2509.03898	null
2025-09-04	Human Motion Video Generation: A Survey	Haiwei Xue et.al.	2509.03883	null
2025-09-04	Demonstrating a family of X-ray dark-field retrieval approaches on a common set of samples	Samantha J. Alloo et.al.	2509.03866	null
2025-09-04	A minimization principle behind the diffusion bridge of diurnal fish migration	H. Yoshioka et.al.	2509.03824	null
2025-09-04	Machine Learning for LiDAR-Based Indoor Surface Classification in Intelligent Wireless Environments	Parth Ashokbhai Shiroya et.al.	2509.03813	null
2025-09-04	Causality-guided Prompt Learning for Vision-language Models via Visual Granulation	Mengyu Gao et.al.	2509.03803	null
2025-09-04	Universal Structure of Turbulent Radiative Mixing Layers	Prateek Sharma et.al.	2509.03802	null
2025-09-04	A high-lying isomer in ^{92}Zr with lifetime modulated by the atomic charge states: a proposed approach for a nuclear gamma-ray laser	C. X. Jia et.al.	2509.03797	null
2025-09-04	Fitting Image Diffusion Models on Video Datasets	Juhun Lee et.al.	2509.03794	null
2025-09-03	Learning functions through Diffusion Maps	Alvaro Almeida Gomez et.al.	2509.03758	null
2025-09-03	Effects of Bethe-Heitler pair production in ultraluminous X-ray sources	Gustavo Esteban Romero et.al.	2509.03735	null
2025-09-03	LuxDiT: Lighting Estimation with Video Diffusion Transformer	Ruofan Liang et.al.	2509.03680	null
2025-09-03	Applying a Gaussian networking theory to model motor-driven transport along cytoskeletal filaments	Nadine du Toit et.al.	2509.03671	null
2025-09-06	Efficient Virtuoso: A Latent Diffusion Transformer Model for Goal-Conditioned Trajectory Planning	Antonio Guillen-Perez et.al.	2509.03658	null
2025-09-05	Noise is All You Need: rethinking the value of noise on seismic denoising via diffusion models	Donglin Zhu et.al.	2509.03629	null
2025-09-03	Statistical Analysis of PAHs as a Tracer of Anomalous Microwave Emission Using DIRBE Data	Danielle Sponseller et.al.	2509.03611	null
2025-09-03	Breaking Down the $\textsf{CosmoGEMS}$ : Toward Modeling and Understanding Globular Cluster Stellar Streams in a Fully Cosmological Context	Nondh Panithanpaisal et.al.	2509.03599	null
2025-09-02	Diffusion-RL Based Air Traffic Conflict Detection and Resolution Method	Tonghe Li et.al.	2509.03550	null
2025-09-03	Dynamically Controlled Transport of GeV Cosmic Rays in Diverse Galactic Environments	Ronan Hix et.al.	2509.03519	null
2025-09-03	Easier Painting Than Thinking: Can Text-to-Image Models Set the Stage, but Not Direct the Play?	Ouxiang Li et.al.	2509.03516	null
2025-09-03	OneCAT: Decoder-Only Auto-Regressive Model for Unified Understanding and Generation	Han Li et.al.	2509.03498	null
2025-09-03	From Image Denoisers to Regularizing Imaging Inverse Problems: An Overview	Hong Ye Tan et.al.	2509.03475	null
2025-09-03	Joint Training of Image Generator and Detector for Road Defect Detection	Kuan-Chuan Peng et.al.	2509.03465	null
2025-09-03	Nitrogen chemistry of hycean worlds on the example of K2-18b	Maja W. Radecka et.al.	2509.03455	null
2025-09-03	ANNIE: Be Careful of Your Robots	Yiyang Huang et.al.	2509.03383	null
2025-09-03	Dynamics of Infection Spread and Hotspot Growth in Bi-Pathogen Networks	Alyssa Yu et.al.	2509.03374	null
2025-09-03	Generative Auto-Bidding in Large-Scale Competitive Auctions via Diffusion Completer-Aligner	Yewen Li et.al.	2509.03348	null
2025-09-03	On the MIA Vulnerability Gap Between Private GANs and Diffusion Models	Ilana Sebag et.al.	2509.03341	null
2025-09-03	Dynamical interface above a hard wall and reflected SPDE on the half-line	Pierre Faugère et.al.	2509.03328	null
2025-09-03	Numerical Modeling of Galactic Cosmic Ray Modulation in the Heliosphere	D. A. Shestakov et.al.	2509.03326	null
2025-09-03	InfraDiffusion: zero-shot depth map restoration with diffusion models and prompted segmentation from sparse infrastructure point clouds	Yixiong Jing et.al.	2509.03324	null
2025-09-03	Noise resilience of two-dimensional Floquet topological phases	Balaganchi A. Bhargava et.al.	2509.03296	null
2025-09-03	SynBT: High-quality Tumor Synthesis for Breast Tumor Segmentation by 3D Diffusion Model	Hongxu Yang et.al.	2509.03267	null
2025-09-03	Estudio de la eficiencia en la escalabilidad de GPUs para el entrenamiento de Inteligencia Artificial	David Cortes et.al.	2509.03263	null
2025-09-03	Evaluation of Stress Detection as Time Series Events – A Novel Window-Based F1-Metric	Harald Vilhelm Skat-Rørdam et.al.	2509.03240	null
2025-09-03	Deep Learning for High Speed Optical Coherence Elastography with a Fiber Scanning Endoscope	Maximilian Neidhardt et.al.	2509.03193	null
2025-09-03	Dissecting the Diffuse Emission of the Galaxy with the HAWC Observatory	Georg Schwefer et.al.	2509.03189	null
2025-09-03	The slow evolution of dark matter halos from cusp to core naturally produces extended stellar core-like distributions	Jorge Sanchez Almeida et.al.	2509.03167	null
2025-09-03	Temporally-Aware Diffusion Model for Brain Progression Modelling with Bidirectional Temporal Regularisation	Mattia Litrico et.al.	2509.03141	null
2025-09-03	RecBase: Generative Foundation Model Pretraining for Zero-Shot Recommendation	Sashuai Zhou et.al.	2509.03131	null
2025-09-03	On the Smart Coordination of Flexibility Scheduling in Multi-carrier Integrated Energy Systems	Christian Doh Dinga et.al.	2509.03126	null
2025-09-03	Towards Realistic Hand-Object Interaction with Gravity-Field Based Diffusion Bridge	Miao Xu et.al.	2509.03114	null
2025-09-03	Bounded imaginary powers of generalized diffusion operators	Alexandre Thorel et.al.	2509.03105	null
2025-09-03	Collision operator for electron runaway in cold weakly-ionized plasmas	Yeongsun Lee et.al.	2509.03092	null
2025-09-03	Diffusive shock acceleration: non-classical model of cosmic ray transport	A. A. Lagutin et.al.	2509.03091	null
2025-09-03	High Cursive Complex Character Recognition using GAN External Classifier	S M Rafiuddin et.al.	2509.03062	null
2025-09-03	DCDB: Dynamic Conditional Dual Diffusion Bridge for Ill-posed Multi-Tasks	Chengjie Huang et.al.	2509.03044	null
2025-09-03	Boundary layer effects induced by the fluid in a chemotaxis-Navier-Stokes system	Qianqian Hou et.al.	2509.03028	null
2025-09-03	Enhancing Robustness in Post-Processing Watermarking: An Ensemble Attack Network Using CNNs and Transformers	Tzuhsuan Huang et.al.	2509.03006	null
2025-09-03	DUViN: Diffusion-Based Underwater Visual Navigation via Knowledge-Transferred Depth Features	Jinghe Yang et.al.	2509.02983	null
2025-09-03	InstaDA: Augmenting Instance Segmentation Data with Dual-Agent System	Xianbao Hou et.al.	2509.02973	null
2025-09-03	Non-Linear and Meta-Stable Dynamics in Financial Markets: Evidence from High Frequency Crypto Currency Market Makers	Igor Halperin et.al.	2509.02941	null
2025-09-03	The Role of Far-side Magnetic Structures in Modeling 2024 Solar Eclipse	Guanglu Shi et.al.	2509.02911	null
2025-09-02	The Space Coronagraph Optical Bench (SCoOB): 8. end-to-end numerical modeling of the testbed to estimate the contrast limits	Ramya M Anche et.al.	2509.02887	null
2025-09-02	Fluid Model of Schrodinger equation and derivation of the quantum potential	Lachezar Simeonov et.al.	2509.02868	null
2025-09-02	Predicting Movie Success with Multi-Task Learning: A Hybrid Framework Combining GPT-Based Sentiment Analysis and SIR Propagation	Wenlan Xie et.al.	2509.02809	null
2025-09-02	DrDiff: Dynamic Routing Diffusion with Hierarchical Attention for Breaking the Efficiency-Quality Trade-off	Jusheng Zhang et.al.	2509.02785	null
2025-09-02	Synthetic generation of online social networks through homophily	Alejandro Buitrago López et.al.	2509.02762	null
2025-09-02	Spacetime Wavelet Method for Linear Boundary-Value Problems in Sylvester Matrix Equation Form	Cody D. Cochran et.al.	2509.02720	null
2025-09-02	Ultrafast anisotropic exciton transport in phosphorene	Kai-Wei Chang et.al.	2509.02682	null
2025-09-02	Explosive Dispersal Outflows as a New Class of Fermi Gamma-Ray Sources: The Case of DR21	Paarmita Pandey et.al.	2509.02679	null
2025-09-02	Double-faced white dwarfs and the magnetic inhibition of convection	Sivan Ginzburg et.al.	2509.02671	null
2025-09-02	Is RL fine-tuning harder than regression? A PDE learning approach for diffusion models	Wenlong Mou et.al.	2509.02528	null
2025-09-02	Unifi3D: A Study on 3D Representations for Generation and Reconstruction in a Common Framework	Nina Wiedemann et.al.	2509.02474	null
2025-09-02	TeRA: Rethinking Text-guided Realistic 3D Avatar Generation	Yanwen Wang et.al.	2509.02466	null
2025-09-02	Fractional differential equations: non-constant coefficients, simulation and model reduction	Ruben Aylwin et.al.	2509.02465	null
2025-09-02	GenCompositor: Generative Video Compositing with Diffusion Transformer	Shuzhou Yang et.al.	2509.02460	null
2025-09-02	Quantitative positivity of transition densities for random perturbations of Hamiltonian systems	Shimaa Elesaely et.al.	2509.02448	null
2025-09-02	Kelvin-Helmholtz instability in binary fluids with miscibility gap	Anubhav Dubey et.al.	2509.02400	null
2025-09-02	Revisiting the diffusion equation derivation in Persson’s theory of contact	Yang Xu et.al.	2509.02397	null
2025-09-02	Widely non-degenerate nonlinear frequency conversion in cryogenic titanium in-diffused lithium niobate waveguides	Nina Amelie Lange et.al.	2509.02392	null
2025-09-02	Category-Aware 3D Object Composition with Disentangled Texture and Shape Multi-view Diffusion	Zeren Xiong et.al.	2509.02357	null
2025-09-02	A recursive formula for the $n^\text{th}$ survival function and the $n^\text{th}$ first passage time distribution for jump and diffusion processes. Applications to the pricing of $n^\text{th}$ -to-default CDS	Alessio Lapolla et.al.	2509.02347	null
2025-09-02	Multi-stage PDE-based image processing techniques for noisy MRI scans	Ksenia Slepova et.al.	2509.02342	null
2025-09-02	RDIT: Residual-based Diffusion Implicit Models for Probabilistic Time Series Forecasting	Chih-Yu Lai et.al.	2509.02341	null
2025-09-02	Distribution estimation via Flow Matching with Lipschitz guarantees	Lea Kunkel et.al.	2509.02337	null
2025-09-02	Exploring Diffusion Models for Generative Forecasting of Financial Charts	Taegyeong Lee et.al.	2509.02308	null
2025-09-02	Data-Driven Loss Functions for Inference-Time Optimization in Text-to-Image Generation	Sapir Esther Yiflach et.al.	2509.02295	null
2025-09-03	Sem-RaDiff: Diffusion-Based 3D Radar Semantic Perception in Cluttered Agricultural Environments	Ruibin Zhang et.al.	2509.02283	null
2025-09-02	Think2Sing: Orchestrating Structured Motion Subtitles for Singing-Driven 3D Head Animation	Zikai Huang et.al.	2509.02278	null
2025-09-02	Ergodicity of conditional McKean-Vlasov jump diffusions	Jianhai Bao et.al.	2509.02249	null
2025-09-02	Spectrogram Patch Codec: A 2D Block-Quantized VQ-VAE and HiFi-GAN for Neural Speech Coding	Luis Felipe Chary et.al.	2509.02244	null
2025-09-02	Improving atomic force microscopy structure discovery via style-translation	Jie Huang et.al.	2509.02240	null
2025-09-02	Mechanical performance of hybrid polymer-lipid vesicles with leaflet asymmetry engineered using microfluidics	Yuting Huang et.al.	2509.02194	null
2025-09-02	Enhancing Zero-Shot Pedestrian Attribute Recognition with Synthetic Data Generation: A Comparative Study with Image-To-Image Diffusion Models	Pablo Ayuso-Albizu et.al.	2509.02161	null
2025-09-02	Nuclear fusion plasma fuelling with ice pellets using a neuromorphic controller	L. L. T. C. Jansen et.al.	2509.02147	null
2025-09-02	Differentiable Expectation-Maximisation and Applications to Gaussian Mixture Model Optimal Transport	Samuel Boïté et.al.	2509.02109	null
2025-09-02	GeoLayer: Towards Low-Latency and Cost-Efficient Geo-Distributed Graph Stores with Layered Graph	Feng Yao et.al.	2509.02106	null
2025-09-02	A Data-Centric Approach to Pedestrian Attribute Recognition: Synthetic Augmentation via Prompt-driven Diffusion Models	Alejandro Alonso et.al.	2509.02099	null
2025-09-02	Environment-Aware Channel Measurement and Modeling for Terahertz Monostatic Sensing	Yejian Lyu et.al.	2509.02088	null
2025-09-02	Superexponential dissipation enhancement on $\mathbb{T}^d$	Keefer Rowan et.al.	2509.02081	null
2025-09-02	Data-Dependent Smoothing for Protein Discovery with Walk-Jump Sampling	Srinivas Anumasa et.al.	2509.02069	null
2025-09-02	Measuring metal sulfides in interstellar dust with PRIMA	Izaskun Jiménez-Serra et.al.	2509.02067	null
2025-09-02	Enhanced Raman scattering by fast GaN phonon-polaritons	Mayssoune Mina et.al.	2509.02057	null
2025-09-02	Palette Aligned Image Diffusion	Elad Aharoni et.al.	2509.02000	null
2025-09-02	Draw-In-Mind: Learning Precise Image Editing via Chain-of-Thought Imagination	Ziyun Zeng et.al.	2509.01986	null
2025-09-03	Discrete Noise Inversion for Next-scale Autoregressive Text-based Image Editing	Quan Dao et.al.	2509.01984	null
2025-09-02	Nonmonotonic change with energy of the mean logarithmic mass of cosmic rays in the knee region: the mechanism of formation of this feature and sources of particles	A. A. Lagutin et.al.	2509.01974	null
2025-09-02	Efficient Bayesian Sampling with Langevin Birth-Death Dynamics	Alex Leviyev et.al.	2509.01942	null
2025-09-02	A Diffusion-Based Framework for Configurable and Realistic Multi-Storage Trace Generation	Seohyun Kim et.al.	2509.01919	null
2025-09-02	DroneSR: Rethinking Few-shot Thermal Image Super-Resolution from Drone-based Perspective	Zhipeng Weng et.al.	2509.01898	null
2025-09-02	Far-infrared probing with PRIMA into particle acceleration associated with relativistic jets from active galactic nuclei	Naoki Isobe et.al.	2509.01876	null
2025-09-04	RadioDiff-Loc: Diffusion Model Enhanced Scattering Congnition for NLoS Localization with Sparse Radio Map Estimation	Xiucheng Wang et.al.	2509.01875	null
2025-09-02	Latent Gene Diffusion for Spatial Transcriptomics Completion	Paula Cárdenas et.al.	2509.01864	null
2025-09-02	Does the high-energy AMS-02 positron flux originate from the dark matter density spikes around nearby black holes?	Man Ho Chan et.al.	2509.01860	null
2025-09-01	PractiLight: Practical Light Control Using Foundational Diffusion Models	Yotam Erel et.al.	2509.01837	null
2025-09-01	ManiFlow: A General Robot Manipulation Policy via Consistency Flow Training	Ge Yan et.al.	2509.01819	null
2025-09-03	Intermittent localization and fast spatial learning by non-Markov random walks with decaying memory	Paulina R. Martín-Cornejo et.al.	2509.01806	null
2025-09-01	Mapping Magnetic Fields from Clouds to Cores with PRIMAger	Kate Pattle et.al.	2509.01796	null
2025-09-01	High-Performance Trajectory Tracking MPC for Quadcopters with Coupled Time-Varying Constraints and Stability Proofs	Maedeh Izadi et.al.	2509.01767	null
2025-09-01	Clinical Metadata Guided Limited-Angle CT Image Reconstruction	Yu Shi et.al.	2509.01752	null
2025-09-01	Controllable Generation of Implied Volatility Surfaces with Variational Autoencoders	Jing Wang et.al.	2509.01743	null
2025-09-01	Quadratic Growth Model with Discontinuity: A Link between Monostable and Bistable Traveling Waves	Wonhyung Choi et.al.	2509.01715	null
2025-09-01	The PRIMA promise of deciphering interstellar dust evolution with observations of the nearby Universe	Frédéric Galliano et.al.	2509.01692	null
2025-09-01	The Impact of Baryonic Effects on the Dynamical Masses Inferred Using Satellite Kinematics	Josephine F. W. Baggen et.al.	2509.01690	null
2025-09-01	Preconditioned Regularized Wasserstein Proximal Sampling	Hong Ye Tan et.al.	2509.01685	null
2025-09-01	Efficient Transformer-Inspired Variants of Physics-Informed Deep Operator Networks	Zhi-Feng Wei et.al.	2509.01679	null
2025-09-01	Investigating the role of magnetic fields in the formation and evolution of striations in interstellar clouds with PRIMA	Raphael Skalidis et.al.	2509.01678	null
2025-09-03	Identity-Preserving Text-to-Video Generation via Training-Free Prompt, Image, and Guidance Enhancement	Jiayi Gao et.al.	2509.01362	null
2025-08-29	Achieving Hilbert-Schmidt Independence Under Rényi Differential Privacy for Fair and Private Data Generation	Tobias Hyrup et.al.	2508.21815	null
2025-08-29	Tree-Guided Diffusion Planner	Hyeonseong Jeon et.al.	2508.21800	null
2025-08-29	OptMark: Robust Multi-bit Diffusion Watermarking via Inference Time Optimization	Jiazheng Xing et.al.	2508.21727	null
2025-08-29	FLORA: Efficient Synthetic Data Generation for Object Detection in Low-Data Regimes via finetuning Flux LoRA	Alvaro Patricio et.al.	2508.21712	null
2025-09-01	Infinite-Dimensional Stochastic Differential Equations and Diffusion Dynamics of Coulomb Random Point Fields	Hirofumi Osada et.al.	2508.21658	null
2025-08-29	Deciphering the gamma-ray emission in the Cygnus region	L. Haerer et.al.	2508.21644	null
2025-08-29	Conforming and discontinuous discretizations of non-isothermal Darcy-Forchheimer flows	Stefano Bonetti et.al.	2508.21630	null
2025-09-02	Approximate calculation of multidimensional first passage times	James F. Lutsko et.al.	2508.21607	null
2025-08-29	Condense to Conduct and Conduct to Condense	Tomasz Kazana et.al.	2508.21602	null
2025-08-29	Fluid dynamics of charm quarks from heavy to light-ion collisions	Federica Capellino et.al.	2508.21600	null
2025-08-29	OASIS: Harnessing Diffusion Adversarial Network for Ocean Salinity Imputation using Sparse Drifter Trajectories	Bo Li et.al.	2508.21570	null
2025-08-29	ECHO: Ego-Centric modeling of Human-Object interactions	Ilya A. Petrov et.al.	2508.21556	null
2025-08-29	Complete Gaussian Splats from a Single Image with Denoising Diffusion Models	Ziwei Liao et.al.	2508.21542	null
2025-08-29	Molecular Beam Epitaxy of 2H-TaS $_2$ few-layers on GaN(0001)	Constantin Hilbrunner et.al.	2508.21537	null
2025-08-29	Adaptive generative moment matching networks for improved learning of dependence structures	Marius Hofert et.al.	2508.21531	null
2025-08-29	Few-Shot Neuro-Symbolic Imitation Learning for Long-Horizon Planning and Acting	Pierrick Lorang et.al.	2508.21501	null
2025-08-29	Controllable 3D Molecular Generation for Structure-Based Drug Design Through Bayesian Flow Networks and Gradient Integration	Seungyeon Choi et.al.	2508.21468	null
2025-08-29	Diffusion-based Multi-modal Synergy Interest Network for Click-through Rate Prediction	Xiaoxi Cui et.al.	2508.21460	null
2025-09-01	Contrarian Motives in Social Learning: Information Cascades with Nonconformist Preferences	Georgy Lukyanov et.al.	2508.21446	null
2025-08-29	Quantum enhanced ensemble GANs for anomaly detection in continuous biomanufacturing	Rajiv Kailasanathan et.al.	2508.21438	null
2025-08-29	MedShift: Implicit Conditional Transport for X-Ray Domain Adaptation	Francisco Caetano et.al.	2508.21435	null
2025-08-29	Global Hot Gas Excess in (U)LIRGs: Replicating Galactic Nuclei Scaling Relations between Diffuse X-ray Emission and Star Formation on Galaxy-Wide Scales	Chunyi Zhang et.al.	2508.21401	null
2025-08-29	Dynamics-Compliant Trajectory Diffusion for Super-Nominal Payload Manipulation	Anuj Pasricha et.al.	2508.21375	null
2025-08-29	Print2Volume: Generating Synthetic OCT-based 3D Fingerprint Volume from 2D Fingerprint Image	Qingran Miao et.al.	2508.21371	null
2025-08-29	Efficient Diffusion-Based 3D Human Pose Estimation with Hierarchical Temporal Pruning	Yuquan Bi et.al.	2508.21363	null
2025-08-29	QUAV: Quantum-Assisted Path Planning and Optimization for UAV Navigation with Obstacle Avoidance	Nouhaila Innan et.al.	2508.21361	null
2025-08-29	DLGAN : Time Series Synthesis Based on Dual-Layer Generative Adversarial Networks	Xuan Hou et.al.	2508.21340	null
2025-08-29	Quantum Monte Carlo Benchmarking of Molecular Adsorption on Graphene-Supported Single Pt Atom	Jeonghwan Ahn et.al.	2508.21339	null
2025-08-29	Stage-Diff: Stage-wise Long-Term Time Series Generation Based on Diffusion Models	Xuan Hou et.al.	2508.21330	null
2025-08-28	PHD: Personalized 3D Human Body Fitting with Point Diffusion	Hsuan-I Ho et.al.	2508.21257	null
2025-08-28	Weighted Support Points from Random Measures: An Interpretable Alternative for Generative Modeling	Peiqi Zhao et.al.	2508.21255	null
2025-08-28	Reverse Imaging for Wide-spectrum Generalization of Cardiac MRI Segmentation	Yidong Zhao et.al.	2508.21254	null
2025-08-28	Mutual Information Rate – Linear Noise Approximation and Exact Computation	Manuel Reinhardt et.al.	2508.21220	null
2025-08-28	WaveLLDM: Design and Development of a Lightweight Latent Diffusion Model for Speech Enhancement and Restoration	Kevin Putra Santoso et.al.	2508.21153	null
2025-08-28	Propagation in the Fisher-KPP equation with Mixed Operator	Begoña Barrios et.al.	2508.21151	null
2025-08-28	The COLIBRE project: cosmological hydrodynamical simulations of galaxy formation and evolution	Joop Schaye et.al.	2508.21126	null
2025-08-28	Safe-Control: A Safety Patch for Mitigating Unsafe Content in Text-to-Image Generation Models	Xiangtao Meng et.al.	2508.21099	null
2025-08-28	TrInk: Ink Generation with Transformer Network	Zezhong Jin et.al.	2508.21098	null
2025-08-28	First-Place Solution to NeurIPS 2024 Invisible Watermark Removal Challenge	Fahad Shamshad et.al.	2508.21072	null
2025-08-28	Dress&Dance: Dress up and Dance as You Like It - Technical Preview	Jun-Kun Chen et.al.	2508.21070	null
2025-08-28	OneReward: Unified Mask-Guided Image Generation via Multi-Task Human Preference Learning	Yuan Gong et.al.	2508.21066	null
2025-08-28	Mixture of Contexts for Long Video Generation	Shengqu Cai et.al.	2508.21058	null
2025-08-28	HITTER: A HumanoId Table TEnnis Robot via Hierarchical Planning and Learning	Zhi Su et.al.	2508.21043	null
2025-08-28	FW-GAN: Frequency-Driven Handwriting Synthesis with Wave-Modulated MLP Generator	Huynh Tong Dang Khoa et.al.	2508.21040	null
2025-08-28	Reusing Computation in Text-to-Image Diffusion for Efficient Generation of Image Sets	Dale Decatur et.al.	2508.21032	null
2025-08-28	System size and event shape dependence of particle-identified balance functions in proton-proton collisions at $\sqrt{s}=13$ TeV	Subash Chandra Behera et.al.	2508.21030	null
2025-08-28	POSE: Phased One-Step Adversarial Equilibrium for Video Diffusion Models	Jiaxiang Cheng et.al.	2508.21019	null
2025-08-28	Inference-Time Alignment Control for Diffusion Models with Reinforcement Learning Guidance	Luozhijie Jin et.al.	2508.21016	null
2025-08-28	Train-Once Plan-Anywhere Kinodynamic Motion Planning via Diffusion Trees	Yaniv Hassidof et.al.	2508.21001	null
2025-08-28	RANGAN: GAN-empowered Anomaly Detection in 5G Cloud RAN	Douglas Liao et.al.	2508.20985	null
2025-08-28	Random attractors and nonergodic attractors for diffusions with degeneracies	Yuri Bakhtin et.al.	2508.20968	null
2025-08-28	Very high-energy gamma-ray and neutrino emission from hadronic interaction in compact binary millisecond pulsars	Vittoria Vecchiotti et.al.	2508.20952	null
2025-08-28	Lattice Random Walk Discretisations of Stochastic Differential Equations	Samuel Duffield et.al.	2508.20883	null
2025-08-28	Understanding and evaluating computer vision models through the lens of counterfactuals	Pushkar Shukla et.al.	2508.20881	null
2025-08-28	Leveraging Discriminative Latent Representations for Conditioning GAN-Based Speech Enhancement	Shrishti Saha Shetu et.al.	2508.20859	null
2025-08-28	Uniform error analysis of a rectangular Morley finite element method on a Shishkin mesh for a 4th-order singularly perturbed boundary value problem	Xiangyun Meng et.al.	2508.20857	null
2025-08-28	Learning Primitive Embodied World Models: Towards Scalable Robotic Learning	Qiao Sun et.al.	2508.20840	null
2025-08-28	High-Resolution Atomic Magnetometer-Based Imaging of Integrated Circuits and Batteries	Dominic Hunter et.al.	2508.20834	null
2025-08-28	Distinct Spatiotemporal Dynamics of Thermoelectric Transport Across Superconducting Transition	Rajae Malek et.al.	2508.20792	null
2025-08-28	Prediction of sulphate hazes in the lower Venus atmosphere	Peter Woitke et.al.	2508.20790	null
2025-08-28	Evaluating Compositional Generalisation in VLMs and Diffusion Models	Beth Pearson et.al.	2508.20783	null
2025-08-28	Unleashing Uncertainty: Efficient Machine Unlearning for Generative AI	Christoforos N. Spartalis et.al.	2508.20773	null
2025-08-28	Anomalous diffusion and run-and-tumble motion of a chemotactic particle in low dimensions	Jacopo Romano et.al.	2508.20756	null
2025-08-28	Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning	Yibin Wang et.al.	2508.20751	null
2025-08-29	A two-state generalisation of the strong collision model	Ola Kenji Forslund et.al.	2508.20727	null
2025-08-28	EEGDM: Learning EEG Representation with Latent Diffusion Model	Shaocong Wang et.al.	2508.20705	null
2025-08-28	Agent-based model of information diffusion in the limit order book trading	Mateusz Wilinski et.al.	2508.20672	null
2025-08-28	“Humor, Art, or Misinformation?”: A Multimodal Dataset for Intent-Aware Synthetic Image Detection	Anastasios Skoularikis et.al.	2508.20670	null
2025-08-28	Amadeus: Autoregressive Model with Bidirectional Attribute Modelling for Symbolic Music	Hongju Su et.al.	2508.20665	null
2025-08-28	VarDiU: A Variational Diffusive Upper Bound for One-Step Diffusion Distillation	Leyang Wang et.al.	2508.20646	null
2025-08-28	CraftGraffiti: Exploring Human Identity with Custom Graffiti Art via Facial-Preserving Diffusion Models	Ayan Banerjee et.al.	2508.20640	null
2025-08-28	EmoCAST: Emotional Talking Portrait via Emotive Text Description	Yiguo Jiang et.al.	2508.20615	null
2025-08-28	Revisiting the Privacy Risks of Split Inference: A GAN-Based Data Reconstruction Attack via Progressive Feature Optimization	Yixiang Qiu et.al.	2508.20613	null
2025-08-28	Physics Informed Generative Models for Magnetic Field Images	Aye Phyu Phyu Aung et.al.	2508.20612	null
2025-08-28	GENRE-CMR: Generalizable Deep Learning for Diverse Multi-Domain Cardiac MRI Reconstruction	Kian Anvari Hamedani et.al.	2508.20600	null
2025-08-28	Disruptive Attacks on Face Swapping via Low-Frequency Perceptual Perturbations	Mengxiao Huang et.al.	2508.20595	null
2025-08-28	FastFit: Accelerating Multi-Reference Virtual Try-On via Cacheable Diffusion Models	Zheng Chong et.al.	2508.20586	null
2025-08-28	Persode: Personalized Visual Journaling with Episodic Memory-Aware AI Agent	Seokho Jin et.al.	2508.20585	null
2025-08-28	SimShear: Sim-to-Real Shear-based Tactile Servoing	Kipp McAdam Freud et.al.	2508.20561	null
2025-08-28	Equilibria of aggregation-diffusion models with nonlinear potentials	Francesco Bozzola et.al.	2508.20523	null
2025-08-28	Describe, Don’t Dictate: Semantic Image Editing with Natural Language Intent	En Ci et.al.	2508.20505	null
2025-08-28	Run-and-tumble particle with diffusion: boundary local times and the zero-diffusion limit	Paul C Bressloff et.al.	2508.20473	null
2025-08-28	Realistic and Controllable 3D Gaussian-Guided Object Editing for Driving Video Generation	Jiusi Li et.al.	2508.20471	null
2025-08-28	Breaking Diffusion with Cache: Exploiting Approximate Caches in Diffusion Models	Desen Sun et.al.	2508.20424	null
2025-09-01	AWorld: Orchestrating the Training Recipe for Agentic AI	Chengyue Yu et.al.	2508.20404	null
2025-08-28	Mean Field Game with Reflected Jump Diffusion Dynamics: A Linear Programming Approach	Zongxia Liang et.al.	2508.20388	null
2025-08-28	Do triangles matter? Replicating hypergraph disease dynamics with lower-order interactions	Eugene Tan et.al.	2508.20380	null
2025-08-28	Audio-Guided Visual Editing with Complex Multi-Modal Prompts	Hyeonyu Kim et.al.	2508.20379	null
2025-08-28	Numerical Method for Space-Time Fractional Diffusion: A Stochastic Approach	Tengteng Cui et.al.	2508.20361	null
2025-08-28	Artificial neural network solver for Fokker-Planck and Koopman eigenfunctions	Max Kreider et.al.	2508.20339	null
2025-08-27	Score-Based Diffusion Models in Infinite Dimensions: A Malliavin Calculus Perspective	Ehsan Mirafzali et.al.	2508.20316	null
2025-08-27	Efficient ion re-acceleration in laboratory-produced interpenetrating collisionless shocks	W. Yao et.al.	2508.20303	null
2025-08-27	Out-of-time-order correlators bridge classical transport and quantum dynamics	Sophia N. Fricke et.al.	2508.20235	null
2025-08-27	Velocity Spectrum Imaging using velocity encoding preparation pulses	Luis Hernandez-Garcia et.al.	2508.20218	null
2025-08-27	InfinityHuman: Towards Long-Term Audio-Driven Human	Xiaodi Li et.al.	2508.20210	null
2025-08-27	The structure of the giant radio fossil in the Ophiuchus galaxy cluster	Simona Giacintucci et.al.	2508.20190	null
2025-08-27	SDiFL: Stable Diffusion-Driven Framework for Image Forgery Localization	Yang Su et.al.	2508.20182	null
2025-08-27	Nonlinear diffusion in relativistic kinetic theory	Simone Calogero et.al.	2508.20147	null
2025-08-27	MicroLad: 2D-to-3D Microstructure Reconstruction and Generation via Latent Diffusion and Score Distillation	Kang-Hyun Lee et.al.	2508.20138	null
2025-08-27	Discrete-Guided Diffusion for Scalable and Safe Multi-Robot Motion Planning	Jinhao Liang et.al.	2508.20095	null
2025-08-27	AudioStory: Generating Long-Form Narrative Audio with Large Language Models	Yuxin Guo et.al.	2508.20088	null
2025-08-27	Discrete Diffusion VLA: Bringing Discrete Diffusion to Action Decoding in Vision-Language-Action Policies	Zhixuan Liang et.al.	2508.20072	null
2025-08-27	A unique solution to overcome the barriers to planetesimal formation at low dust-to-gas ratio	H. Meheut et.al.	2508.20070	null
2025-08-27	Neural Conditional Simulation for Complex Spatial Processes	Julia Walchessen et.al.	2508.20067	null
2025-08-27	Joint Analysis of HI Absorption Zeeman Measurements and the Morphology of Filamentary HI Emission	Marta Nowotka et.al.	2508.20065	null
2025-08-27	Wave coarsening drives time crystallization in active solids	Jonas Veenstra et.al.	2508.20052	null
2025-08-27	GS: Generative Segmentation via Label Diffusion	Yuhao Chen et.al.	2508.20020	null
2025-08-27	Diffusion Language Models Know the Answer Before Decoding	Pengxiang Li et.al.	2508.19982	null
2025-08-27	The Information Dynamics of Generative Diffusion	Luca Ambrogioni et.al.	2508.19897	null
2025-08-27	Quantum latent distributions in deep generative models	Omar Bacarreza et.al.	2508.19857	null
2025-08-28	Ego-centric Predictive Model Conditioned on Hand Trajectories	Binjie Zhang et.al.	2508.19852	null
2025-08-27	Physics-Informed DeepONet Coupled with FEM for Convective Transport in Porous Media with Sharp Gaussian Sources	Erdi Kara et.al.	2508.19847	null
2025-08-27	Exotic rheology of materials with active rearrangements	Aondoyima Ioratim-Uba et.al.	2508.19844	null
2025-08-27	Not Every Gift Comes in Gold Paper or with a Red Ribbon: Exploring Color Perception in Text-to-Image Models	Shay Shomer Chai et.al.	2508.19791	null
2025-08-27	StableIntrinsic: Detail-preserving One-step Diffusion Model for Multi-view Material Estimation	Xiuchao Wu et.al.	2508.19789	null
2025-08-27	Fast 3D Diffusion for Scalable Granular Media Synthesis	Muhammad Moeeze Hassan et.al.	2508.19752	null
2025-08-27	Fractal Flow: Hierarchical and Interpretable Normalizing Flow via Topic Modeling and Recursive Strategy	Binhui Zhang et.al.	2508.19750	null
2025-08-27	MC for Gastroretentive Drug Delivery	Sebastian Lotter et.al.	2508.19739	null
2025-08-27	Synthetic Image Detection via Spectral Gaps of QC-RBIM Nishimori Bethe-Hessian Operators	V. S. Usatyuk et.al.	2508.19698	null
2025-08-27	MnBr $_2$ on the graphene on Ir(110) substrate: growth, structure, and super-moiré	Affan Safeer et.al.	2508.19694	null
2025-08-27	Atomistic insights into hydrogen migration in IGZO from machine-learning interatomic potential: linking atomic diffusion to device performance	Hyunsung Cho et.al.	2508.19674	null
2025-08-27	Multi-value Probabilistic Computing with current-controlled Skyrmion Diffusion	Thomas B. Winkler et.al.	2508.19623	null
2025-08-27	IELDG: Suppressing Domain-Specific Noise with Inverse Evolution Layers for Domain Generalized Semantic Segmentation	Qizhe Fan et.al.	2508.19604	null
2025-08-27	Guiding Noisy Label Conditional Diffusion Models with Score-based Discriminator Correction	Dat Nguyen Cong et.al.	2508.19581	null
2025-08-28	Interact-Custom: Customized Human Object Interaction Image Generation	Zhu Xu et.al.	2508.19575	null
2025-08-27	Generative Models for Synthetic Data: Transforming Data Mining in the GenAI Era	Dawei Li et.al.	2508.19570	null
2025-08-27	MonoRelief V2: Leveraging Real Data for High-Fidelity Monocular Relief Recovery	Yu-Wei Zhang et.al.	2508.19555	null
2025-08-27	Blockwise SFT for Diffusion Language Models: Reconciling Bidirectional Attention and Autoregressive Decoding	Bowen Sun et.al.	2508.19529	null
2025-08-27	MotionFlux: Efficient Text-Guided Motion Generation through Rectified Flow Matching and Preference Alignment	Zhiting Gao et.al.	2508.19527	null
2025-08-27	Functionally-graded drug delivery systems with binding reactions: analytical and stochastic approaches for the fraction of drug released	Obi A. Carwood et.al.	2508.19510	null
2025-08-27	DATR: Diffusion-based 3D Apple Tree Reconstruction Framework with Sparse-View	Tian Qiu et.al.	2508.19508	null
2025-08-27	Sat2Flow: A Structure-Aware Diffusion Framework for Human Flow Generation from Satellite Imagery	Xiangxu Wang et.al.	2508.19499	null
2025-08-27	Towards 6G Intelligence: The Role of Generative AI in Future Wireless Networks	Muhammad Ahmed Mohsin et.al.	2508.19495	null
2025-08-26	MRExtrap: Longitudinal Aging of Brain MRIs using Linear Modeling in Latent Space	Jaivardhan Kapoor et.al.	2508.19482	null
2025-08-26	Shining light on degeneracies and uncertainties in quantifying both exchange and restriction with time-dependent diffusion MRI using Bayesian inference	Maëliss Jallais et.al.	2508.19478	null
2025-08-26	Hydrodynamic Limit of the Symmetric Zero-Range Process with Slow Boundary	Oslenne Araújo et.al.	2508.19447	null
2025-08-26	On Surjectivity of Neural Networks: Can you elicit any behavior from your model?	Haozhe Jiang et.al.	2508.19445	null
2025-08-26	Efficiently Generating Multidimensional Calorimeter Data with Tensor Decomposition Parameterization	Paimon Goulart et.al.	2508.19443	null
2025-08-26	Quantification of mobile ions in perovskite solar cells with thermally activated ion current measurements	Moritz C. Schmidt et.al.	2508.19403	null
2025-08-26	DETNO: A Diffusion-Enhanced Transformer Neural Operator for Long-Term Traffic Forecasting	Owais Ahmad et.al.	2508.19389	null
2025-08-26	Grounding the Ungrounded: A Spectral-Graph Framework for Quantifying Hallucinations in multimodal LLMs	Supratik Sarkar et.al.	2508.19366	null
2025-08-28	MIDAS: Multimodal Interactive Digital-humAn Synthesis via Real-time Autoregressive Video Generation	Ming Chen et.al.	2508.19320	null
2025-08-26	Disorder-induced proximate quantum spin ice phase in Pr $_2$Sn$_2$O$_7$	Yi Luo et.al.	2508.19248	null
2025-08-26	Articulate3D: Zero-Shot Text-Driven 3D Object Posing	Oishi Deb et.al.	2508.19244	null
2025-08-26	MemoryVLA: Perceptual-Cognitive Memory in Vision-Language-Action Models for Robotic Manipulation	Hao Shi et.al.	2508.19236	null
2025-08-26	VibeVoice Technical Report	Zhiliang Peng et.al.	2508.19205	null
2025-08-26	LSD-3D: Large-Scale 3D Driving Scene Generation with Geometry Grounding	Julian Ost et.al.	2508.19204	null
2025-08-26	Planning-Query-Guided Model Generation for Model-Based Deformable Object Manipulation	Alex LaGrassa et.al.	2508.19199	null
2025-08-26	All-in-One Slider for Attribute Manipulation in Diffusion Models	Weixin Ye et.al.	2508.19195	null
2025-08-26	MDD: a Mask Diffusion Detector to Protect Speaker Verification Systems from Adversarial Perturbations	Yibo Bai et.al.	2508.19180	null
2025-08-26	Stoch-IDENT: New Method and Mathematical Analysis for Identifying SPDEs from Data	Jianbo Cui et.al.	2508.19177	null
2025-08-26	RDDM: Practicing RAW Domain Diffusion Model for Real-world Image Restoration	Yan Chen et.al.	2508.19154	null
2025-08-26	Saddle Hierarchy in Dense Associative Memory	Robin Thériault et.al.	2508.19151	null
2025-08-26	Alloyed cementite (Fe-Ni-Cr) $_3$ C: structure and hyperfine field from DFT calculations and experimental comparison	Lyudmila V. Dobysheva et.al.	2508.19148	null
2025-08-26	Lattice vacancy migration barriers in Fe-Ni alloys, and why Ni atoms diffuse slowly: An ab initio study	Adam M. Fisher et.al.	2508.19124	null
2025-08-26	Composition and Alignment of Diffusion Models using Constrained Learning	Shervin Khalafi et.al.	2508.19104	null
2025-08-26	Evaluation of in vitro antibacterial activity and phytochemical profile of aqueous leaf extract of Asystasia variabilis	R Wijerathna et.al.	2508.19049	null
2025-08-26	In-vitro Anti-bacterial Activity of Methanol and Aqueous Crude Extracts of Horsfieldia iryaghedhi	RMHKK Rajapaksha et.al.	2508.19025	null
2025-08-28	STDiff: A State Transition Diffusion Framework for Time Series Imputation in Industrial Systems	Gary Simethy et.al.	2508.19011	null
2025-08-26	Detection of Diffuse Radio Emission inside the Supernova Remnant G338.3-0.0 associated with the Gamma-ray Source HESS J1640-465	Moaz Abdelmaguid et.al.	2508.18999	null
2025-08-26	Krylov-Veretennikov desomposition for measure-valued processes induced by SDEs with interaction on Riemannian manifolds	Andrey Dorogovtsev et.al.	2508.18995	null
2025-08-26	Junctional-Fluctuation-Mediated Fluidisation of Multi-Phase Field Epithelial Monolayers	James N. Graham et.al.	2508.18987	null
2025-08-26	Vanishing Angular Viscosity Limit For Micropolar Fluid Model In $\mathbb{R}_+^2$ : Boundary Layer And Optimal Convergence Rate	Yinghui Wang et.al.	2508.18980	null
2025-08-26	Linear approximations of large deviations: Cubic diffusion test	Pelerine Tsobgni Nyawo et.al.	2508.18977	null
2025-08-26	Generative AI in Map-Making: A Technical Exploration and Its Implications for Cartographers	Claudio Affolter et.al.	2508.18959	null
2025-08-26	Energy-Based Flow Matching for Generating 3D Molecular Structure	Wenyin Zhou et.al.	2508.18949	null
2025-08-26	Stochastic Forces Enhance Tracer Diffusion in Non-motile Active Matter	Henry Alston et.al.	2508.18882	null
2025-08-26	Experimental investigation of turbulence and turbulent thermal diffusion in strongly inhomogeneous and anisotropic forced convection	E. Zarbib et.al.	2508.18865	null
2025-08-26	Super and Weak Poincaré Inequalities for Sticky-Reflected Diffusion Processes	Feng-Yu Wang et.al.	2508.18846	null
2025-08-26	Single-Photon Detection in Few-Layer NbSe $_2$ Superconducting Nanowires	Lucio Zugliani et.al.	2508.18843	null
2025-08-26	Quantum-Circuit-Based Visual Fractal Image Generation in Qiskit and Analytics	Hillol Biswas et.al.	2508.18835	null
2025-08-26	On the Application of Diffusion Models for Simultaneous Denoising and Dereverberation	Adrian Meise et.al.	2508.18833	null
2025-08-26	Asymptotic limit of a vector-valued Allen-Cahn equation for phase transition dynamics	Huan Dong et.al.	2508.18754	null
2025-08-26	Joint Time-Position Statistics and Fisher Information in Drift-Diffusion Molecular Channels	Yun-Feng Lo et.al.	2508.18680	null
2025-08-26	ROSE: Remove Objects with Side Effects in Videos	Chenxuan Miao et.al.	2508.18633	null
2025-08-26	Wan-S2V: Audio-Driven Cinematic Video Generation	Xin Gao et.al.	2508.18621	null
2025-08-26	SemLayoutDiff: Semantic Layout Generation with Diffusion Model for Indoor Scene Synthesis	Xiaohao Sun et.al.	2508.18597	null
2025-08-26	Search for the radiative decay of the cosmic neutrino background through spectral measurements of the cosmic infrared background using PRIMA	Yuji Takeuchi et.al.	2508.18590	null
2025-08-25	Controllable Single-shot Animation Blending with Temporal Conditioning	Eleni Tselepi et.al.	2508.18525	null
2025-08-25	VQualA 2025 Challenge on Face Image Quality Assessment: Methods and Results	Sizhuo Ma et.al.	2508.18445	null
2025-08-25	Phase-Field Model of Freeze Casting	Kaihua Ji et.al.	2508.18416	null
2025-08-25	Hillas meets Eddington: the case for blazars as ultra-high-energy neutrino sources	Xavier Rodrigues et.al.	2508.18345	null
2025-08-25	ObjFiller-3D: Consistent Multi-view 3D Inpainting via Video Diffusion Models	Haitang Feng et.al.	2508.18271	null
2025-08-25	SafeBimanual: Diffusion-based Trajectory Optimization for Safe Bimanual Manipulation	Haoyuan Deng et.al.	2508.18268	null
2025-08-25	Diffusiophoretic corner flows	Dobromir Nowak et.al.	2508.18233	null
2025-08-25	Follow My Hold: Hand-Object Interaction Reconstruction through Geometric Guidance	Ayce Idil Aytekin et.al.	2508.18213	null
2025-08-25	New shell-model calculations of the $δ_C$ correction to superallowed $0^+\rightarrow0^+$ nuclear $β$ decay and standard-model implications	L. Xayavong et.al.	2508.18189	null
2025-08-25	SpotEdit: Evaluating Visually-Guided Image Editing Methods	Sara Ghazanfari et.al.	2508.18159	null
2025-08-25	Learning from Few Samples: A Novel Approach for High-Quality Malcode Generation	Haijian Ma et.al.	2508.18148	null
2025-08-25	Incorporating Pre-trained Diffusion Models in Solving the Schrödinger Bridge Problem	Zhicong Tang et.al.	2508.18095	null
2025-08-26	Visual-CoG: Stage-Aware Reinforcement Learning with Chain of Guidance for Text-to-Image Generation	Yaqi Li et.al.	2508.18032	null
2025-08-25	HD 28471: a near-resonant compact multiplanet system with a possible cold giant planet	A. T. Stevenson et.al.	2508.18000	null
2025-08-26	Solute dispersion in axially strained tube flows: Large-time asymptotics and Ornstein-Uhlenbeck Gaussian profiles	Prabakaran Rajamanickam et.al.	2508.17982	null
2025-08-25	Objective and Subjective Evaluation of Diffusion-Based Speech Enhancement for Dysarthric Speech	Dimme de Groot et.al.	2508.17980	null
2025-08-26	Generative Feature Imputing – A Technique for Error-resilient Semantic Communication	Jianhao Huang et.al.	2508.17957	null
2025-08-25	Nodal error behind discrepancies between coupled cluster and diffusion Monte Carlo: AcOH dimer case study	S. Lambie et.al.	2508.17937	null
2025-08-25	Parallel Nodal Interior-Penalty Discontinuous Galerkin Methods for the Subsonic Compressible Navier-Stokes Equations: Applications to Vortical Flows and VIV Problems	Spiros Zafeiris et.al.	2508.17917	null
2025-08-25	Quasi-likelihood inference for SDE with mixed-effects observed at high frequency	Maud Delattre et.al.	2508.17910	null
2025-08-25	Local Well-Posedness of the Cahn-Hilliard-Biot System	Helmut Abels et.al.	2508.17893	null
2025-08-27	Vocoder-Projected Feature Discriminator	Takuhiro Kaneko et.al.	2508.17874	null
2025-08-25	FasterVoiceGrad: Faster One-step Diffusion-Based Voice Conversion with Adversarial Diffusion Conversion Distillation	Takuhiro Kaneko et.al.	2508.17868	null
2025-08-25	Diffusion-Based Data Augmentation for Medical Image Segmentation	Maham Nazir et.al.	2508.17844	null
2025-08-25	Threshold Diffusions	Lina Ji et.al.	2508.17812	null
2025-08-25	CEIDM: A Controlled Entity and Interaction Diffusion Model for Enhanced Text-to-Image Generation	Mingyue Yang et.al.	2508.17760	null
2025-08-25	SuperGen: An Efficient Ultra-high-resolution Video Generation System with Sketching and Tiling	Fanjiang Ye et.al.	2508.17756	null
2025-08-25	DiffusionGS: Generative Search with Query Conditioned Diffusion in Kuaishou	Qinyao Li et.al.	2508.17754	null
2025-08-25	Few-shot Human Action Anomaly Detection via a Unified Contrastive Learning Framework	Koichiro Kamide et.al.	2508.17726	null
2025-08-25	Instant Preference Alignment for Text-to-Image Diffusion Models	Yang Li et.al.	2508.17718	null
2025-08-25	CATformer: Contrastive Adversarial Transformer for Image Super-Resolution	Qinyi Tian et.al.	2508.17708	null
2025-08-25	On the Edge of Memorization in Diffusion Models	Sam Buchanan et.al.	2508.17689	null
2025-08-25	Calculating the power spectrum in stochastic inflation by Monte Carlo simulation and least squares curve fitting	Koichi Miyamoto et.al.	2508.17654	null
2025-08-27	ControlEchoSynth: Boosting Ejection Fraction Estimation Models via Controlled Video Diffusion	Nima Kondori et.al.	2508.17631	null
2025-08-25	Effects of Near-Field Hydrodynamic Interactions on Bacterial Dynamics Near a Solid Surface	Baopi Liu et.al.	2508.17626	null
2025-08-25	Steering When Necessary: Flexible Steering Large Language Models with Backtracking	Jinwei Gan et.al.	2508.17621	null
2025-08-25	Preference Trajectory Modeling via Flow Matching for Sequential Recommendation	Li Li et.al.	2508.17618	null
2025-08-25	JCo-MVTON: Jointly Controllable Multi-Modal Diffusion Transformer for Mask-Free Virtual Try-on	Aowen Wang et.al.	2508.17614	null
2025-08-25	HotSpotter - Patterned Species Instance Recognition	Jonathan P. Crall et.al.	2508.17605	null
2025-08-25	GWM: Towards Scalable Gaussian World Models for Robotic Manipulation	Guanxing Lu et.al.	2508.17600	null
2025-08-25	HERO: Hierarchical Extrapolation and Refresh for Efficient World Models	Quanjian Song et.al.	2508.17588	null
2025-08-24	Controllability of a system of non-autonomous degenerate coupled parabolic equations	Alfredo S. Gamboa et.al.	2508.17546	null
2025-08-24	Universal scaling of higher-order cumulants in quantum isotropic spin chains	Shixian Jiang et.al.	2508.17535	null
2025-08-24	Learning Reaction-Diffusion Kinetics from Mechanical Information	Royal C. Ihuaenyi et.al.	2508.17523	null
2025-08-24	Variational Shape Inference for Grasp Diffusion on SE(3)	S. Talha Bukhari et.al.	2508.17482	null
2025-08-24	T2I-ReasonBench: Benchmarking Reasoning-Informed Text-to-Image Generation	Kaiyue Sun et.al.	2508.17472	null
2025-08-24	A Synthetic Dataset for Manometry Recognition in Robotic Applications	Pedro Antonio Rabelo Saraiva et.al.	2508.17468	null
2025-08-24	Bias Amplification in Stable Diffusion’s Representation of Stigma Through Skin Tones and Their Homogeneity	Kyra Wilson et.al.	2508.17465	null
2025-08-24	Disentangled Geometry and Appearance for Efficient Multi-View Surface Reconstruction and Rendering	Qitong Zhang et.al.	2508.17436	null
2025-08-24	An LLM-LVLM Driven Agent for Iterative and Fine-Grained Image Editing	Zihan Liang et.al.	2508.17435	null
2025-08-24	TinySR: Pruning Diffusion for Real-World Image Super-Resolution	Linwei Dong et.al.	2508.17434	null
2025-08-24	Modular MeanFlow: Towards Stable and Scalable One-Step Generative Modeling	Haochen You et.al.	2508.17426	null
2025-08-24	Asteroid Rotation Periods: Statistical Analysis in the Diameter-Spin Distribution	Maryam Nastaran et.al.	2508.17415	null
2025-08-24	MoCo: Motion-Consistent Human Video Generation via Structure-Appearance Decoupling	Haoyu Wang et.al.	2508.17404	null
2025-08-24	Stability and uniqueness of bounded weak solutions to triangular degenerate cross-diffusion systems	Xiuqing Chen et.al.	2508.17379	null
2025-08-24	ShaLa: Multimodal Shared Latent Space Modelling	Jiali Cui et.al.	2508.17376	null
2025-08-24	Condition Weaving Meets Expert Modulation: Towards Universal and Controllable Image Generation	Guoqing Zhang et.al.	2508.17364	null
2025-08-24	DiCache: Let Diffusion Model Determine Its Own Cache	Jiazi Bu et.al.	2508.17356	null
2025-08-24	ShortListing Model: A Streamlined SimplexDiffusion for Discrete Variable Generation	Yuxuan Song et.al.	2508.17345	null
2025-08-24	Semantic Diffusion Posterior Sampling for Cardiac Ultrasound Dehazing	Tristan S. W. Stevens et.al.	2508.17326	null
2025-08-24	An improved nonlocal electron heat transport model for magnetized plasmas	Z. H. Chen et.al.	2508.17309	null
2025-08-24	PosBridge: Multi-View Positional Embedding Transplant for Identity-Aware Image Editing	Peilin Xiong et.al.	2508.17302	null
2025-08-24	FoundDiff: Foundational Diffusion Model for Generalizable Low-Dose CT Denoising	Zhihao Chen et.al.	2508.17299	null
2025-08-24	4D Visual Pre-training for Robot Learning	Chengkai Hou et.al.	2508.17230	null
2025-08-24	Multi-Metric Preference Alignment for Generative Speech Restoration	Junan Zhang et.al.	2508.17229	null
2025-08-24	Effects of Geometric configuration in relativistic isobaric collisions at $\sqrt{s_{NN}}=200$ GeV	Akash Das et.al.	2508.17227	null
2025-08-24	MMCIG: Multimodal Cover Image Generation for Text-only Documents and Its Dataset Construction via Pseudo-labeling	Hyeyeon Kim et.al.	2508.17199	null
2025-08-23	Generative AI for Multimedia Communication: Recent Advances, An Information-Theoretic Framework, and Future Opportunities	Yili Jin et.al.	2508.17163	null
2025-08-23	SyncGuard: Robust Audio Watermarking Capable of Countering Desynchronization Attacks	Zhenliang Gan et.al.	2508.17121	null
2025-08-23	CP4SBI: Local Conformal Calibration of Credible Sets in Simulation-Based Inference	Luben M. C. Cabezas et.al.	2508.17077	null
2025-08-23	LaGarNet: Goal-Conditioned Recurrent State-Space Models for Pick-and-Place Garment Flattening	Halid Abdulrahim Kadi et.al.	2508.17070	null
2025-08-23	SSG-Dit: A Spatial Signal Guided Framework for Controllable Video Generation	Peng Hu et.al.	2508.17062	null
2025-08-23	PVNet: Point-Voxel Interaction LiDAR Scene Upsampling Via Diffusion Models	Xianjing Cheng et.al.	2508.17050	null
2025-08-23	Styleclone: Face Stylization with Diffusion Based Data Augmentation	Neeraj Matiyali et.al.	2508.17045	null
2025-08-23	A Novel Local Focusing Mechanism for Deepfake Detection Generalization	Mingliang Li et.al.	2508.17029	null
2025-08-23	Dual Orthogonal Guidance for Robust Diffusion-based Handwritten Text Generation	Konstantina Nikolaidou et.al.	2508.17017	null
2025-08-23	An improved lattice Boltzmann method with a novel conservative boundary scheme for viscoelastic fluid flows	Yuan Yu et.al.	2508.16997	null
2025-08-23	Score Matching on Large Geometric Graphs for Cosmology Generation	Diana-Alexandra Onutu et.al.	2508.16990	null
2025-08-23	HiCache: Training-free Acceleration of Diffusion Models via Hermite Polynomial-based Feature Caching	Liang Feng et.al.	2508.16984	null
2025-08-23	Shape optimization problems with random coefficients via the penalty method	Xiaowei Pang et.al.	2508.16961	null
2025-08-23	RPD-Diff: Region-Adaptive Physics-Guided Diffusion Model for Visibility Enhancement under Dense and Non-Uniform Haze	Ruicheng Zhang et.al.	2508.16956	null
2025-08-23	Drive As You Like: Strategy-Level Motion Planning Based on A Multi-Head Diffusion Model	Fan Ding et.al.	2508.16947	null
2025-08-23	Sig-DEG for Distillation: Making Diffusion Models Faster and Lighter	Lei Jiang et.al.	2508.16939	null
2025-08-23	HunyuanVideo-Foley: Multimodal Diffusion with Representation Alignment for High-Fidelity Foley Audio Generation	Sizhe Shan et.al.	2508.16930	null
2025-08-23	Structural Energy-Guided Sampling for View-Consistent Text-to-3D	Qing Zhang et.al.	2508.16917	null
2025-08-23	Remarks on the three-dimensional Navier-Stokes equations with Lions’ exponent forced by space-time white noise	Kazuo Yamazaki et.al.	2508.16906	null
2025-08-23	Enhanced shape recovery in advection–diffusion problems via a novel ADMM-based CCBM optimization	Elmehdi Cherrat et.al.	2508.16898	null
2025-08-23	Generating Synthetic Contrast-Enhanced Chest CT Images from Non-Contrast Scans Using Slice-Consistent Brownian Bridge Diffusion Network	Pouya Shiri et.al.	2508.16897	null
2025-08-23	Delta-SVD: Efficient Compression for Personalized Text-to-Image Models	Tangyuan Zhang et.al.	2508.16863	null
2025-08-23	Subtleties of UV-crosslinking in microfluidic particle fabrication: UV dosage and intensity matter	Sabrina Marnoto et.al.	2508.16862	null
2025-08-23	Intelligent Shanghai Typhoon Model (ISTM): A generative probabilistic emulator for typhoon hybrid modeling	Zeyi Niu et.al.	2508.16851	null
2025-08-23	NinA: Normalizing Flows in Action. Training VLA Models with Normalizing Flows	Denis Tarasov et.al.	2508.16845	null
2025-08-22	A Fluctuating Hydrodynamics Model for Nanoscale Surfactant-laden Interfaces	John B. Bell et.al.	2508.16820	null
2025-08-22	Two-Step Bose-Einstein Condensation of an ideal Magnetized Charged Bosonic gas under neutron star-like conditions	Amanda Castillo Ayon et.al.	2508.16799	null
2025-08-22	TaDiCodec: Text-aware Diffusion Speech Tokenizer for Speech Language Modeling	Yuancheng Wang et.al.	2508.16790	null
2025-08-22	Improving Performance, Robustness, and Fairness of Radiographic AI Models with Finely-Controllable Synthetic Data	Stefania L. Moroianu et.al.	2508.16783	null
2025-08-26	Characterising the short-orbital period X-ray transient Swift J1910.2-0546	J. M. Corral-Santana et.al.	2508.16775	null
2025-08-22	Spontaneous spiral patterns etched on Germanium	Yilin Wong et.al.	2508.16764	null
2025-08-22	A Framework for Benchmarking Fairness-Utility Trade-offs in Text-to-Image Models via Pareto Frontiers	Marco N. Bochernitsan et.al.	2508.16752	null
2025-08-22	Hamiltonian Simulation for Advection-Diffusion Equation with arbitrary transport field	Niladri Gomes et.al.	2508.16728	null
2025-08-22	MV-RAG: Retrieval Augmented Multiview Diffusion	Yosef Dayani et.al.	2508.16577	null
2025-08-22	Time-Aware One Step Diffusion Network for Real-World Image Super-Resolution	Tainyi Zhang et.al.	2508.16557	null
2025-08-22	Constraints-Guided Diffusion Reasoner for Neuro-Symbolic Learning	Xuan Zhang et.al.	2508.16524	null
2025-08-22	Guiding Diffusion Models with Reinforcement Learning for Stable Molecule Generation	Zhijian Zhou et.al.	2508.16521	null
2025-08-22	ARSP: Automated Repair of Verilog Designs via Semantic Partitioning	Bingkun Yao et.al.	2508.16517	null
2025-08-22	Seeing Clearly, Forgetting Deeply: Revisiting Fine-Tuned Video Generators for Driving Simulation	Chun-Peng Chang et.al.	2508.16512	null
2025-08-22	Underdamped Langevin MCMC with third order convergence	Maximilian Scott et.al.	2508.16485	null
2025-08-22	Large-scale concentration and relaxation for mean-field Langevin particle systems	Songbo Wang et.al.	2508.16428	null
2025-08-22	Multiscale Growth Kinetics of Model Biomolecular Condensates Under Passive and Active Conditions	Tamizhmalar Sundararajan et.al.	2508.16398	null
2025-08-22	Parrondo paradox in quantum image encryption	Łukasz Pawela et.al.	2508.16382	null
2025-08-22	Observation of negative orbital torque from Vanadium	Nikhil Vijayan et.al.	2508.16339	null
2025-08-22	A Sharp KL-Convergence Analysis for Diffusion Models under Minimal Assumptions	Nishant Jain et.al.	2508.16306	null
2025-08-22	Towards Diagnostic Quality Flat-Panel Detector CT Imaging Using Diffusion Models	Hélène Corbaz et.al.	2508.16252	null
2025-08-22	Numerical solution of the time fractional nonlinear Fisher-KPP diffusion-reaction equation using the local domain boundary element method	Theodore V. Gortsas et.al.	2508.16241	null
2025-08-22	UniEM-3M: A Universal Electron Micrograph Dataset for Microstructural Segmentation and Generation	Nan wang et.al.	2508.16239	null
2025-08-22	PromptFlare: Prompt-Generalized Defense via Cross-Attention Decoy in Diffusion-Based Inpainting	Hohyun Na et.al.	2508.16217	null
2025-08-22	OmniCache: A Trajectory-Oriented Global Perspective on Training-Free Cache Reuse for Diffusion Transformer Models	Huanpeng Chu et.al.	2508.16212	null
2025-08-22	Forecast then Calibrate: Feature Caching as ODE for Efficient Diffusion Transformers	Shikang Zheng et.al.	2508.16211	null
2025-08-22	Competition and Attraction Improve Model Fusion	João Abrantes et.al.	2508.16204	null
2025-08-22	FuXi-TC: A generative framework integrating deep learning and physics-based models for improved tropical cyclone forecasts	Shan Guo et.al.	2508.16168	null
2025-08-22	Transport Properties of QGP within a Bayesian Holographic QCD Model	Bing Chen et.al.	2508.16167	null
2025-08-22	RAGSR: Regional Attention Guided Diffusion for Image Super-Resolution	Haodong He et.al.	2508.16158	null
2025-08-22	On the Collapse Errors Induced by the Deterministic Sampler for Diffusion Models	Yi Zhang et.al.	2508.16154	null
2025-08-22	Machine Learning for Medicine Must Be Interpretable, Shareable, Reproducible and Accountable by Design	Ayyüce Begüm Bektaş et.al.	2508.16097	null
2025-08-22	Two-flow Feedback Multi-scale Progressive Generative Adversarial Network	Sun Weikai et.al.	2508.16089	null
2025-08-22	A Unified Voxel Diffusion Module for Point Cloud 3D Object Detection	Qifeng Liu et.al.	2508.16069	null
2025-08-21	Clinically-Informed Preprocessing Improves Stroke Segmentation in Low-Resource Settings	Juampablo E. Heras Rivera et.al.	2508.16004	null
2025-08-21	Multiscale Analysis of a Kinetic Model of Confined Suspensions of Self-Propelled Rods	Leonid Berlyand et.al.	2508.16003	null
2025-08-21	Universal Fluctuations in the Tail Probability for d=2 Random Walks in Space-Time Random Environments	Franscesca Ark et.al.	2508.15999	null
2025-08-21	Diverse Signer Avatars with Manual and Non-Manual Feature Modelling for Sign Language Production	Mohamed Ilyes Lakhal et.al.	2508.15988	null
2025-08-21	UnPose: Uncertainty-Guided Diffusion Priors for Zero-Shot Pose Estimation	Zhaodong Jiang et.al.	2508.15972	null
2025-08-21	Physical blowups via buffered time change in a mean-field neural network	Nikolaos Papadopoulos et.al.	2508.15961	null
2025-08-21	Structure-Preserving Medical Image Generation from a Latent Graph Representation	Kevin Arias et.al.	2508.15920	null
2025-08-21	Text-Driven 3D Hand Motion Generation from Sign Language Data	Léore Bensabath et.al.	2508.15902	null
2025-08-21	Spatial Policy: Guiding Visuomotor Robotic Manipulation with Spatial-Aware Modeling and Reasoning	Yijun Liu et.al.	2508.15874	null
2025-08-21	CineScale: Free Lunch in High-Resolution Cinematic Visual Generation	Haonan Qiu et.al.	2508.15774	null
2025-08-21	Scaling Group Inference for Diverse and High-Quality Generation	Gaurav Parmar et.al.	2508.15773	null
2025-08-21	Visual Autoregressive Modeling for Instruction-Guided Image Editing	Qingyang Mao et.al.	2508.15772	null
2025-08-21	Waver: Wave Your Way to Lifelike Video Generation	Yifu Zhang et.al.	2508.15761	null
2025-08-21	Skyrmion Lattice Order Controlled by Confinement Geometry	Raphael Gruber et.al.	2508.15758	null
2025-08-21	Spatial Super-Infection and Co-Infection Dynamics in Networks	Alyssa Yu et.al.	2508.15740	null
2025-08-21	Probability Density from Latent Diffusion Models for Out-of-Distribution Detection	Joonas Järve et.al.	2508.15737	null
2025-08-21	The Status of the Astrophysical Parameters of Upper Main Sequence Stars	Lukas Kueß et.al.	2508.15722	null
2025-08-21	WorldWeaver: Generating Long-Horizon Video Worlds via Rich Perception	Zhiheng Liu et.al.	2508.15720	null
2025-08-21	Mind and Motion Aligned: A Joint Evaluation IsaacSim Benchmark for Task Planning and Low-Level Policies in Mobile Manipulation	Nikita Kachaev et.al.	2508.15663	null
2025-08-21	When and What: Diffusion-Grounded VideoLLM with Entity Aware Segmentation for Long Video Understanding	Pengcheng Fang et.al.	2508.15641	null
2025-08-21	Are Virtual DES Images a Valid Alternative to the Real Ones?	Ana C. Perre et.al.	2508.15594	null
2025-08-21	Lattice distortions and non-sluggish diffusion in BCC refractory high entropy alloys	Jingfeng Zhang et.al.	2508.15558	null
2025-08-21	Dream 7B: Diffusion Large Language Models	Jiacheng Ye et.al.	2508.15487	null
2025-08-21	Reevaluating Anomalous Electric Fields at the Air-Water Interface: A Surface-Specific Spectroscopic Survey	Joseph C. Shirley et.al.	2508.15422	null
2025-08-21	Speckle suppression in digital in-line holographic microscopy through liquid crystal dynamic scattering	Emilia Wdowiak et.al.	2508.15419	null
2025-08-21	Numerical Analysis of Unsupervised Learning Approaches for Parameter Identification in PDEs	Siyu Cen et.al.	2508.15381	null
2025-08-21	Diffusion-driven pattern formation in an opinion dynamical network model	Tim Mauch et.al.	2508.15377	null
2025-08-21	Performance Analysis of RIS-Aided High-Mobility Wireless Systems	Hanwen Hu et.al.	2508.15375	null
2025-08-22	Analytical Theory of Chiral Active Particle Transport in a Fluctuating Density Field	Jayam Joshi et.al.	2508.15366	null
2025-08-21	The effect of multi-occupancy traps on the diffusion and retention of multiple hydrogen isotopes in irradiated tungsten and vanadium	Sanjeet Kaur et.al.	2508.15341	null
2025-08-21	Discovering correlations between metal foam thermal characteristics and non-Fourier behavior	Anna Fehér et.al.	2508.15340	null
2025-08-21	Interface fluctuations for $1$ D stochastic Allen-Cahn equation – singular regime	Weijun Xu et.al.	2508.15319	null
2025-08-21	VideoEraser: Concept Erasure in Text-to-Video Diffusion Models	Naen Xu et.al.	2508.15314	null
2025-08-21	HIP: Model-Agnostic Hypergraph Influence Prediction via Distance-Centrality Fusion and Neural ODEs	Su-Su Zhang et.al.	2508.15312	null
2025-08-21	Modeling Long-term User Behaviors with Diffusion-driven Multi-interest Network for CTR Prediction	Weijiang Lai et.al.	2508.15311	null
2025-08-21	Contribution of Globular Clusters to Diffuse Gamma-ray Emission from Galactic Plane	Jiayin He et.al.	2508.15295	null
2025-08-21	Optimizing Compilation for Distributed Quantum Computing via Clustering and Annealing	Ruilin Zhou et.al.	2508.15267	null
2025-08-21	Pathology-Informed Latent Diffusion Model for Anomaly Detection in Lymph Node Metastasis	Jiamu Wang et.al.	2508.15236	null
2025-08-21	Pretrained Diffusion Models Are Inherently Skipped-Step Samplers	Wenju Xu et.al.	2508.15233	null
2025-08-21	Collaborative Multi-Modal Coding for High-Quality 3D Generation	Ziang Cao et.al.	2508.15228	null
2025-08-21	GenTune: Toward Traceable Prompts to Improve Controllability of Image Refinement in Environment Design	Wen-Fan Wang et.al.	2508.15227	null
2025-08-21	*A rutile-based homologous series Na(PtO $2$)${2\it{n}+1}$ discovered by computationally assisted high-pressure synthesis*	Yasuhito Kobayashi et.al.	2508.15223	null
2025-08-21	See it. Say it. Sorted: Agentic System for Compositional Diagram Generation	Hantao Zhang et.al.	2508.15222	null
2025-08-21	Obstacle-tuned transition from chaotic to coherent vortex flows and odd diffusion in chiral active fluids	Joscha Mecke et.al.	2508.15210	null
2025-08-21	Quantum Differential Equation Solvers with Low State Preparation Cost: Eliminating the Time Dependence in Dissipative Equations	Gengzhi Yang et.al.	2508.15170	null
2025-08-21	MeSS: City Mesh-Guided Outdoor Scene Generation with Cross-View Consistent Diffusion	Xuyang Chen et.al.	2508.15169	null
2025-08-21	Zero-shot Volumetric CT Super-Resolution using 3D Gaussian Splatting with Upsampled 2D X-ray Projection Priors	Jeonghyun Noh et.al.	2508.15151	null
2025-08-21	Electron-Ion Equilibration in the Merging Galaxy Cluster Abell 665	Christian Norseth et.al.	2508.15138	null
2025-08-24	Side Effects of Erasing Concepts from Diffusion Models	Shaswati Saha et.al.	2508.15124	null
2025-08-20	Microstructural and preliminary optical and microwave characterization of erbium doped CaMoO $_4$ thin films	Ignas Masiulionis et.al.	2508.15122	null
2025-08-24	CurveFlow: Curvature-Guided Flow Matching for Image Generation	Yan Luo et.al.	2508.15093	null
2025-08-20	Sampling by averaging: A multiscale approach to score estimation	Paula Cordero-Encinar et.al.	2508.15069	null
2025-08-20	Asymptotic analysis on narrow tubes: narrow escape problems and diffusion processes	Wen-Tai Hsu et.al.	2508.15060	null
2025-08-20	Correlating Particle Acceleration Rates with Plasma Conditions in Colliding Wind Binaries	Gislaine B Cordeiro et.al.	2508.15059	null
2025-08-20	An MRI Atlas of the Human Fetal Brain: Reference and Segmentation Tools for Fetal Brain MRI Analysis	Mahdi Bagheri et.al.	2508.15034	null
2025-08-20	Reversible Unfolding Network for Concealed Visual Perception with Generative Refinement	Chunming He et.al.	2508.15027	null
2025-08-20	TAIGen: Training-Free Adversarial Image Generation via Diffusion Models	Susim Roy et.al.	2508.15020	null
2025-08-20	Probing Magnetic Properties of RuO $_{2}$ Heterostructures Through the Ferromagnetic Layer	Frank M. Abel et.al.	2508.15004	null
2025-08-20	LyLA-Therm: Lyapunov-based Langevin Adaptive Thermodynamic Neural Network Controller	Saiedeh Akbari et.al.	2508.14989	null
2025-08-20	Aura-CAPTCHA: A Reinforcement Learning and GAN-Enhanced Multi-Modal CAPTCHA System	Joydeep Chandra et.al.	2508.14976	null
2025-08-20	Potential and challenges of generative adversarial networks for super-resolution in 4D Flow MRI	Oliver Welin Odeback et.al.	2508.14950	null
2025-08-19	Inference Time Debiasing Concepts in Diffusion Models	Lucas S. Kupssinskü et.al.	2508.14933	null
2025-08-19	TOM: An Open-Source Tongue Segmentation Method with Multi-Teacher Distillation and Task-Specific Data Augmentation	Jiacheng Xie et.al.	2508.14932	null
2025-08-20	Quantization Meets dLLMs: A Systematic Study of Post-training Quantization for Diffusion LLMs	Haokun Lin et.al.	2508.14896	null
2025-08-20	Virtual Community: An Open World for Humans, Robots, and Society	Qinhong Zhou et.al.	2508.14893	null
2025-08-20	Squeezed Diffusion Models	Jyotirmai Singh et.al.	2508.14871	null
2025-08-20	Critical trajectories in kinetic geometry	Helge Dietert et.al.	2508.14868	null
2025-08-20	Universal winding properties of chiral active motion	Ion Santra et.al.	2508.14862	null
2025-08-20	Physics-Informed ML Exploration of Structure-Transport Relationships in Hard Carbon	Nikhil Rampal et.al.	2508.14849	null
2025-08-20	TransLight: Image-Guided Customized Lighting Control with Generative Decoupling	Zongming Li et.al.	2508.14814	null
2025-08-20	Tinker: Diffusion’s Gift to 3D–Multi-View Consistent Editing From Sparse Inputs without Per-Scene Optimization	Canyu Zhao et.al.	2508.14811	null
2025-08-20	Cross-Modality Controlled Molecule Generation with Diffusion Language Model	Yunzhe Zhang et.al.	2508.14748	null
2025-08-20	Modeling the impact of temperature and bird migration on the spread of West Nile virus	Pride Duve et.al.	2508.14740	null
2025-08-20	GSFix3D: Diffusion-Guided Repair of Novel Views in Gaussian Splatting	Jiaxin Wei et.al.	2508.14717	null
2025-08-20	The heating and cooling of 2D electrons at low temperatures	A. K. Jain et.al.	2508.14694	null
2025-08-20	Virtual Multiplex Staining for Histological Images using a Marker-wise Conditioned Diffusion Model	Hyun-Jic Oh et.al.	2508.14681	null
2025-08-21	Phase space transport, quasilinear diffusion and locality in phase velocity	Didier Bénisti et.al.	2508.14657	null
2025-08-20	AnchorSync: Global Consistency Optimization for Long Video Editing	Zichi Liu et.al.	2508.14609	null
2025-08-20	Call Option Price using Pearson Diffusion Processes	Tapan Kar et.al.	2508.14577	null
2025-08-20	Minimizing Task-Oriented Age of Information for Remote Monitoring with Pre-Identification	Shuying Gan et.al.	2508.14575	null
2025-08-20	EffiFusion-GAN: Efficient Fusion Generative Adversarial Network for Speech Enhancement	Bin Wen et.al.	2508.14525	null
2025-08-20	SATURN: Autoregressive Image Generation Guided by Scene Graphs	Thanh-Nhan Vo et.al.	2508.14502	null
2025-08-20	Multimode Fiber Imaging Based on Hydrogel Fiber	Lele He et.al.	2508.14501	null
2025-08-20	DGenCTR: Towards a Universal Generative Paradigm for Click-Through Rate Prediction via Discrete Diffusion	Moyu Zhang et.al.	2508.14500	null
2025-08-20	Vivid-VR: Distilling Concepts from Text-to-Video Diffusion Transformer for Photorealistic Video Restoration	Haoran Bai et.al.	2508.14483	null
2025-08-20	DreamSwapV: Mask-guided Subject Swapping for Any Customized Video Editing	Weitao Wang et.al.	2508.14465	null
2025-08-20	Ouroboros: Single-step Diffusion Models for Cycle-consistent Forward and Inverse Rendering	Shanlin Sun et.al.	2508.14461	null
2025-08-20	Early Evolution of the Cavity and Core of a Coronal Mass Ejection in the Inner Corona	Shuting Li et.al.	2508.14455	null
2025-08-20	FBI: Learning Dexterous In-hand Manipulation with Dynamic Visuotactile Shortcut Policy	Yijin Chen et.al.	2508.14441	null
2025-08-20	MUSE: Multi-Subject Unified Synthesis via Explicit Layout Semantic Expansion	Fei Peng et.al.	2508.14440	null
2025-08-20	Weakly-Convex Regularization for Magnetic Resonance Image Denoising	Akash Prabakar et.al.	2508.14438	null
2025-08-20	FOCUS: Frequency-Optimized Conditioning of DiffUSion Models for mitigating catastrophic forgetting during Test-Time Adaptation	Gabriel Tjio et.al.	2508.14437	null
2025-08-20	HyperDiff: Hypergraph Guided Diffusion Model for 3D Human Pose Estimation	Bing Han et.al.	2508.14431	null
2025-08-20	Disentanglement in T-space for Faster and Distributed Training of Diffusion Models with Fewer Latent-states	Samarth Gupta et.al.	2508.14413	null
2025-08-20	A Real-world Display Inverse Rendering Dataset	Seokjun Choi et.al.	2508.14411	null
2025-08-20	CTA-Flux: Integrating Chinese Cultural Semantics into High-Quality English Text-to-Image Communities	Yue Gong et.al.	2508.14405	null
2025-08-20	Img2ST-Net: Efficient High-Resolution Spatial Omics Prediction from Whole Slide Histology Images via Fully Convolutional Image-to-Image Learning	Junchao Zhu et.al.	2508.14393	null
2025-08-20	Physics-Constrained Diffusion Reconstruction with Posterior Correction for Quantitative and Fast PET Imaging	Yucun Hou et.al.	2508.14364	null
2025-08-20	Organ-Agents: Virtual Human Physiology Simulator via LLMs	Rihao Chang et.al.	2508.14357	null
2025-08-20	SBGD: Improving Graph Diffusion Generative Model via Stochastic Block Diffusion	Junwei Su et.al.	2508.14352	null
2025-08-20	A Non-Asymptotic Convergent Analysis for Scored-Based Graph Generative Model via a System of Stochastic Differential Equations	Junwei Su et.al.	2508.14351	null
2025-08-20	Generative AI Against Poaching: Latent Composite Flow Matching for Wildlife Conservation	Lingkai Kong et.al.	2508.14342	null
2025-08-20	Modeling oxygen-void interactions in uranium nitride	Mohamed AbdulHameed et.al.	2508.14329	null
2025-08-20	MoVieDrive: Multi-Modal Multi-View Urban Scene Video Generation	Guile Wu et.al.	2508.14327	null
2025-08-20	Modeling of silver transport in cubic SiC: Integrating molecular dynamics, bounds averaging, and uncertainty quantification	Mohamed AbdulHameed et.al.	2508.14325	null
2025-08-19	Tooth-Diffusion: Guided 3D CBCT Synthesis with Fine-Grained Tooth Conditioning	Said Djafar Said et.al.	2508.14276	null
2025-08-19	Mean field social optimization: feedback person-by-person optimality and the dynamic programming equation	Minyi Huang et.al.	2508.14236	null
2025-08-19	CO Adsorption Sites on Interstellar Water Ices Explored with Machine Learning Potentials. Binding energy distributions and snowline	Giulia M. Bovolenta et.al.	2508.14219	null
2025-08-19	A well-balanced gas-kinetic scheme with adaptive mesh refinement for shallow water equations	Gaocheng Liu et.al.	2508.14216	null
2025-08-19	Nonadiabatic force matching for alchemical free-energy estimation	Jorge L. Rosa-Raíces et.al.	2508.14179	null
2025-08-19	DPad: Efficient Diffusion Language Models with Suffix Dropout	Xinhua Chen et.al.	2508.14148	null
2025-08-18	3D Cardiac Anatomy Generation Using Mesh Latent Diffusion Models	Jolanta Mozyrska et.al.	2508.14122	null
2025-08-19	InfiniteTalk: Audio-driven Video Generation for Sparse-Frame Video Dubbing	Shaoshu Yang et.al.	2508.14033	null
2025-08-19	Electrochemical response of biological membranes to localized currents and external electric fields	Joshua B. Fernandes et.al.	2508.14001	null
2025-08-19	Physics-Based 3D Simulation for Synthetic Data Generation and Failure Analysis in Packaging Stability Assessment	Samuel Seligardi et.al.	2508.13989	null
2025-08-20	Towards a general diffusion-based information quality assessment model	Anthony Lopes Temporao et.al.	2508.13927	null
2025-08-19	Learning to See Through Flare	Xiaopeng Peng et.al.	2508.13907	null
2025-08-19	Revisiting Diffusion Q-Learning: From Iterative Denoising to One-Step Action Generation	Thanh Nguyen et.al.	2508.13904	null
2025-08-19	Diffusion-Driven High-Dimensional Variable Selection	Minjie Wang et.al.	2508.13890	null
2025-08-19	Toward Deployable Multi-Robot Collaboration via a Symbolically-Guided Decision Transformer	Rathnam Vidushika Rasanji et.al.	2508.13877	null
2025-08-19	SAGA: Learning Signal-Aligned Distributions for Improved Text-to-Image Generation	Paul Grimal et.al.	2508.13866	null
2025-08-19	Stochastic synaptic dynamics under learning	Jakob Stubenrauch et.al.	2508.13846	null
2025-08-19	UniECS: Unified Multimodal E-Commerce Search Framework with Gated Cross-modal Fusion	Zihan Liang et.al.	2508.13843	null
2025-08-20	Latent Interpolation Learning Using Diffusion Models for Cardiac Volume Reconstruction	Niklas Bubeck et.al.	2508.13826	null
2025-08-19	COCO: Cognitive Operating System with Continuous Oversight for Multi-Agent Workflow Reliability	Churong Liang et.al.	2508.13815	null
2025-08-19	Prompt-Based One-Shot Exact Length-Controlled Generation with LLMs	Juncheng Xie et.al.	2508.13805	null
2025-08-19	Elementary Monte Carlo model of the anisotropic recrystallization and antiripening under intensive stirring and high supersaturations	Serhii Abakumov et.al.	2508.13799	null
2025-08-19	Sketch3DVE: Sketch-based 3D-Aware Scene Video Editing	Feng-Lin Liu et.al.	2508.13797	null
2025-08-19	DegDiT: Controllable Audio Generation with Dynamic Event Graph Guided Diffusion Transformer	Yisu Liu et.al.	2508.13786	null
2025-08-19	Comparing Conditional Diffusion Models for Synthesizing Contrast-Enhanced Breast MRI from Pre-Contrast Images	Sebastian Ibarra et.al.	2508.13776	null
2025-08-19	Eliminating Rasterization: Direct Vector Floor Plan Generation with DiffPlanner	Shidong Wang et.al.	2508.13738	null
2025-08-19	Simulation of Impact-induced seismic shaking on asteroid (25143) Itokawa to address its resurfacing process	Sunho Jin et.al.	2508.13727	null
2025-08-19	Unravelling disorder in kagome Yb $_{0.5}$Co$_3$Ge$_3$	A. Korshunov et.al.	2508.13719	null
2025-08-19	Diffuse-Layer Capacitance at the Potential of Zero Charge in Binary Mixtures	Yuki Uematsu et.al.	2508.13691	null
2025-08-19	PHECT: A lightweight computation tool for pulsar halo emission	Kun Fang et.al.	2508.13667	null
2025-08-19	Calibrated Semantic Diffusion: A p-Laplacian Synthesis with Learnable Dissipation, Quantified Constants, and Graph-Aware Calibration	Faruk Alpay et.al.	2508.13658	null
2025-08-19	Personalized Subgraph Federated Learning with Sheaf Collaboration	Wenfei Liang et.al.	2508.13642	null
2025-08-19	V2P: From Background Suppression to Center Peaking for Robust GUI Grounding Task	Jikai Chen et.al.	2508.13634	null
2025-08-19	Text2Weight: Bridging Natural Language and Neural Network Weight Spaces	Bowen Tian et.al.	2508.13633	null
2025-08-20	DiffIER: Optimizing Diffusion Models with Iterative Error Reduction	Ao Chen et.al.	2508.13628	null
2025-08-19	Bridging Clear and Adverse Driving Conditions	Yoel Shapiro et.al.	2508.13592	null
2025-08-19	Temporal-Conditional Referring Video Object Segmentation with Noise-Free Text-to-Video Diffusion Model	Ruixin Zhang et.al.	2508.13584	null
2025-08-19	Overcoming Quantum Resistivity Scaling in Nanoscale Interconnects Using Delafossite PdCoO2	Seoung-Hun Kang et.al.	2508.13573	null
2025-08-19	A stability-enhanced nonstandard finite difference framework for solving one and two-dimensional nonlocal differential equations	Shweta Kumari et.al.	2508.13542	null
2025-08-20	2D Gaussians Meet Visual Tokenizer	Yiang Shi et.al.	2508.13515	null
2025-08-19	A Monte Carlo simulation on the scattering coefficients of solar radio wave propagation	Jiazhen Gan et.al.	2508.13494	null
2025-08-19	The Lévy flight foraging hypothesis: comparison between stationary distributions and anomalous diffusion	Serena Dipierro et.al.	2508.13487	null
2025-08-19	EventTSF: Event-Aware Non-Stationary Time Series Forecasting	Yunfeng Ge et.al.	2508.13434	null
2025-08-19	Hyperactive Magnetar Eruptions: Giant Flares, Baryon Ejections, and FRBs	Ashley Bransgrove et.al.	2508.13419	null
2025-08-18	Counterfactual Probabilistic Diffusion with Expert Models	Wenhao Mu et.al.	2508.13355	null
2025-08-18	Susceptibility Distortion Correction of Diffusion MRI with a single Phase-Encoding Direction	Sedigheh Dargahi et.al.	2508.13340	null
2025-08-18	Resistive diffusion and radiative cooling effects in magnetized oblique shocks	R. Datta et.al.	2508.13310	null
2025-08-18	GaitCrafter: Diffusion Model for Biometric Preserving Gait Synthesis	Sirshapan Mitra et.al.	2508.13300	null
2025-08-18	Field-level Reconstruction from Foreground-Contaminated 21-cm Maps	Shu-Fan Chen et.al.	2508.13265	null
2025-08-18	4DNeX: Feed-Forward 4D Generative Modeling Made Easy	Zhaoxi Chen et.al.	2508.13154	null
2025-08-18	MDPO: Overcoming the Training-Inference Divide of Masked Diffusion Language Models	Haoyu He et.al.	2508.13148	null
2025-08-18	Some semi-decoupled algorithms with optimal convergence for a four-field linear thermo-poroelastic model	Ziliang Li et.al.	2508.13109	null
2025-08-18	Precise Action-to-Video Generation Through Visual Action Prompts	Yuang Wang et.al.	2508.13104	null
2025-08-18	Denoising diffusion models for inverse design of inflatable structures with programmable deformations	Sara Karimi et.al.	2508.13097	null
2025-08-18	DMS:Diffusion-Based Multi-Baseline Stereo Generation for Improving Self-Supervised Depth Estimation	Zihua Liu et.al.	2508.13091	null
2025-08-18	ID-Card Synthetic Generation: Toward a Simulated Bona fide Dataset	Qingwen Zeng et.al.	2508.13078	null
2025-08-18	From Transthoracic to Transesophageal: Cross-Modality Generation using LoRA Diffusion	Emmanuel Oladokun et.al.	2508.13077	null
2025-08-18	Reinforced Context Order Recovery for Adaptive Reasoning and Planning	Long Ma et.al.	2508.13070	null
2025-08-18	Odo: Depth-Guided Diffusion for Identity-Preserving Body Reshaping	Siddharth Khandelwal et.al.	2508.13065	null
2025-08-19	PC-Sampler: Position-Aware Calibration of Decoding Bias in Masked Diffusion Models	Pengcheng Huang et.al.	2508.13021	null
2025-08-18	EgoTwin: Dreaming Body and View in First Person	Jingqiao Xiu et.al.	2508.13013	null
2025-08-18	Matrix-Game 2.0: An Open-Source, Real-Time, and Streaming Interactive World Model	Xianglong He et.al.	2508.13009	null
2025-08-18	Transfer Learning for Neutrino Scattering: Domain Adaptation with GANs	Jose L. Bonilla et.al.	2508.12987	null
2025-08-18	The Leibenson process	Viorel Barbu et.al.	2508.12979	null
2025-08-18	Compact Attention: Exploiting Structured Spatio-Temporal Sparsity for Fast Video Generation	Qirui Li et.al.	2508.12969	null
2025-08-18	Self-Consistent Heating of the Magnetically Closed Solar Corona: Generation of Nanoflares, Thermodynamic Response of the Plasma and Observational Signatures	Craig D. Johnston et.al.	2508.12952	null
2025-08-18	Lumen: Consistent Video Relighting and Harmonious Background Replacement with Video Generative Models	Jianshu Zeng et.al.	2508.12945	null
2025-08-19	Fully Automated Segmentation of Fiber Bundles in Anatomic Tracing Data	Kyriaki-Margarita Bintsi et.al.	2508.12942	null
2025-08-18	7Bench: a Comprehensive Benchmark for Layout-guided Text-to-image Models	Elena Izzo et.al.	2508.12919	null
2025-08-18	FoleySpace: Vision-Aligned Binaural Spatial Audio Generation	Lei Zhao et.al.	2508.12918	null
2025-08-18	S^2-Guidance: Stochastic Self Guidance for Training-Free Enhancement of Diffusion Models	Chubin Chen et.al.	2508.12880	null
2025-08-18	E3RG: Building Explicit Emotion-driven Empathetic Response Generation System with Multimodal Large Language Model	Ronghao Lin et.al.	2508.12854	null
2025-08-18	Strongly correlated stochastic systems	Marco Biroli et.al.	2508.12818	null
2025-08-18	Next Visual Granularity Generation	Yikai Wang et.al.	2508.12811	null
2025-08-18	Wavy Transformer	Satoshi Noguchi et.al.	2508.12787	null
2025-08-18	Right and Wrong Ansätze for Nonlinear Waves in Stochastic PDEs	C. H. S. Hamster et.al.	2508.12786	null
2025-08-18	Leveraging Diffusion Models for Stylization using Multiple Style Images	Dan Ruta et.al.	2508.12784	null
2025-08-18	TURB-Scalar. A large database of passive scalar fields advected by 2D Navier-Stokes in the turbulent inverse cascade regime	Chiara Calascibetta et.al.	2508.12762	null
2025-08-18	Effects of Defects on Thermal Transport across Solid/Solid Heterogeneous Interfaces	Ershuai Yin et.al.	2508.12744	null
2025-08-18	Single-Reference Text-to-Image Manipulation with Dual Contrastive Denoising Score	Syed Muhmmad Israr et.al.	2508.12718	null
2025-08-18	Hyperparameter Optimization in the Estimation of PDE and Delay-PDE models from data	Oliver Mai et.al.	2508.12715	null
2025-08-18	Asymmetric Diffusion Recommendation Model	Yongchun Zhu et.al.	2508.12706	null
2025-08-18	Deadline-Aware Bandwidth Allocation for Semantic Generative Communication with Diffusion Models	Jinhyuk Choi et.al.	2508.12701	null
2025-08-18	MixCache: Mixture-of-Cache for Video Diffusion Transformer Acceleration	Yuanxin Wei et.al.	2508.12691	null
2025-08-18	WP-CLIP: Leveraging CLIP to Predict Wölfflin’s Principles in Visual Art	Abhijay Ghildyal et.al.	2508.12668	null
2025-08-18	Stable Diffusion-Based Approach for Human De-Occlusion	Seung Young Noh et.al.	2508.12663	null
2025-08-18	Score-informed Neural Operator for Enhancing Ordering-based Causal Discovery	Jiyeon Kang et.al.	2508.12650	null
2025-08-18	Cognitive Structure Generation: From Educational Priors to Policy Optimization	Hengnian Gu et.al.	2508.12647	null
2025-08-18	ViLaD: A Large Vision Language Diffusion Framework for End-to-End Autonomous Driving	Can Cui et.al.	2508.12603	null
2025-08-19	A Tale of Two Sightlines: Comparison of Hydrocarbon Dust Absorption Bands toward Cygnus OB2-12 and the Galactic Center	Yvonne J. Pendleton et.al.	2508.12601	null
2025-08-17	Trust Region Constrained Measure Transport in Path Space for Stochastic Optimal Control and Inference	Denis Blessing et.al.	2508.12511	null
2025-08-17	Say It, See It: A Systematic Evaluation on Speech-Based 3D Content Generation Methods in Augmented Reality	Yanming Xiu et.al.	2508.12498	null
2025-08-19	Portable Laser-Pumped Rb Atomic Clock with Digital Circuits	Qiang Hao et.al.	2508.12437	null
2025-08-17	Spin decoherence dynamics of Er $^{3+}$ in CeO$_2$ film	Sagar Kumar Seth et.al.	2508.12429	null
2025-08-17	TiP4GEN: Text to Immersive Panorama 4D Scene Generation	Ke Xing et.al.	2508.12415	null
2025-08-17	Where to Start Alignment? Diffusion Large Language Model May Demand a Distinct Position	Zhixin Xie et.al.	2508.12398	null
2025-08-17	DeCoT: Decomposing Complex Instructions for Enhanced Text-to-Image Generation with Large Language Models	Xiaochuan Lin et.al.	2508.12396	null
2025-08-17	Navigating the Exploration-Exploitation Tradeoff in Inference-Time Scaling of Diffusion Models	Xun Su et.al.	2508.12361	null
2025-08-17	Topological Dissipation as the Missing Link in Multiscale Polymer Dynamics	Xu-Ze Zhang et.al.	2508.12359	null
2025-08-17	Synthetic Data is Sufficient for Zero-Shot Visual Generalization from Offline Data	Ahmet H. Güzel et.al.	2508.12356	null
2025-08-17	Semantic Discrepancy-aware Detector for Image Forgery Identification	Ziye Wang et.al.	2508.12341	null
2025-08-17	Geometry-Aware Video Inpainting for Joint Headset Occlusion Removal and Face Reconstruction in Social XR	Fatemeh Ghorbani Lohesara et.al.	2508.12336	null
2025-08-17	Sketchar: Supporting Character Design and Illustration Prototyping Using Generative AI	Long Ling et.al.	2508.12333	null
2025-08-17	Steering chiral active Brownian motion via stochastic position-orientation resetting	Amir Shee et.al.	2508.12223	null
2025-08-17	Distribution Matching via Generalized Consistency Models	Sagar Shrestha et.al.	2508.12222	null
2025-08-17	Self-Guided Action Diffusion	Rhea Malhotra et.al.	2508.12189	null
2025-08-16	Critical Importance of Grain Boundaries to the Conductivity of Polycrystalline Molecular Crystals	Shujit Chandra Paul et.al.	2508.12172	null
2025-08-16	Belief-Conditioned One-Step Diffusion: Real-Time Trajectory Planning with Just-Enough Sensing	Gokul Puthumanaillam et.al.	2508.12166	null
2025-08-16	A Systematic Particle Filter for Estimating Time-Varying Parameters in Advection-Diffusion Equations with Source Terms	Andrea Arnold et.al.	2508.12155	null
2025-08-16	Demystifying Foreground-Background Memorization in Diffusion Models	Jimmy Z. Di et.al.	2508.12148	null
2025-08-16	Relativistic quintuple-zeta basis sets for the s block	Marten L. Reitsma et.al.	2508.12144	null
2025-08-16	DualFit: A Two-Stage Virtual Try-On via Warping and Synthesis	Minh Tran et.al.	2508.12131	null
2025-08-16	Error Propagation Mechanisms and Compensation Strategies for Quantized Diffusion	Songwei Liu et.al.	2508.12094	null
2025-08-16	Strong overlap of deterministic and stochastic dynamics in a super-diffusive regime	Muhammad Tayyab et.al.	2508.12091	null
2025-08-16	Generic Event Boundary Detection via Denoising Diffusion	Jaejun Hwang et.al.	2508.12084	null
2025-08-16	Content Accuracy and Quality Aware Resource Allocation Based on LP-Guided DRL for ISAC-Driven AIGC Networks	Ningzhe Shi et.al.	2508.12079	null
2025-08-16	Load-Balanced Diffusion Monte Carlo Method with Lattice Regularization	Kousuke Nakano et.al.	2508.12033	null
2025-08-16	Bongard-RWR+: Real-World Representations of Fine-Grained Concepts in Bongard Problems	Szymon Pawlonka et.al.	2508.12026	null
2025-08-16	Virtual Trading in Multi-Settlement Electricity Markets	Agostino Capponi et.al.	2508.11979	null
2025-08-16	UniUGG: Unified 3D Understanding and Generation via Geometric-Semantic Encoding	Yueming Xu et.al.	2508.11952	null
2025-08-19	Assessment of Using Synthetic Data in Brain Tumor Segmentation	Aditi Jahagirdar et.al.	2508.11922	null
2025-08-16	SafeCtrl: Region-Based Safety Control for Text-to-Image Diffusion via Detect-Then-Suppress	Lingyun Zhang et.al.	2508.11904	null
2025-08-16	OmniD: Generalizable Robot Manipulation Policy via Image-Based BEV Representation	Jilei Mao et.al.	2508.11898	null
2025-08-16	Simulation of heavy quarkonium equilibration in the quark-gluon plasma	Shouxing Zhao et.al.	2508.11897	null
2025-08-16	SimInterview: Transforming Business Education through Large Language Model-Based Simulated Multilingual Interview Training System	Truong Thanh Hung Nguyen et.al.	2508.11873	null
2025-08-15	Serendipitous discovery of a young cluster of galaxies at $z \sim 0.5$ projected next to the nearby tadpole galaxy KUG 1138 + 327	Q. Daniel Wang et.al.	2508.11819	null
2025-08-15	FairTabGen: Unifying Counterfactual and Causal Fairness in Synthetic Tabular Data Generation	Nitish Nagesh et.al.	2508.11810	null
2025-08-15	LoRAtorio: An intrinsic approach to LoRA Skill Composition	Niki Foteinopoulou et.al.	2508.11624	null
2025-08-15	Dataset Creation for Visual Entailment using Generative AI	Rob Reijtenbach et.al.	2508.11605	null
2025-08-15	CoreEditor: Consistent 3D Editing via Correspondence-constrained Diffusion	Zhe Zhu et.al.	2508.11603	null
2025-08-15	Low barrier ZrO $_x$ -based Josephson junctions	Jaehong Choi et.al.	2508.11593	null
2025-08-15	Training-Free Anomaly Generation via Dual-Attention Enhancement in Diffusion Model	Zuo Zuo et.al.	2508.11550	null
2025-08-15	Physics-Informed Diffusion Models for Unsupervised Anomaly Detection in Multivariate Time Series	Juhi Soni et.al.	2508.11528	null
2025-08-15	CineTrans: Learning to Generate Videos with Cinematic Transitions via Masked Diffusion Models	Xiaoxue Wu et.al.	2508.11484	null
2025-08-15	SPG: Style-Prompting Guidance for Style-Specific Content Creation	Qian Liang et.al.	2508.11476	null
2025-08-15	DPI-SPR: A Differentiable Physical Inversion for Shadow Profile Reconstruction Framework in Forward Scatter Radar	ShuQi Lei et.al.	2508.11470	null
2025-08-15	Simulation-based inference using splitting schemes for partially observed diffusions in chemical reaction networks	Petar Jovanovski et.al.	2508.11438	null
2025-08-15	MM-R1: Unleashing the Power of Unified Multimodal Large Language Models for Personalized Image Generation	Qian Liang et.al.	2508.11433	null
2025-08-15	Wavelength dependence of laser pulse filamentation around atomic resonances	Gabor Demeter et.al.	2508.11417	null
2025-08-15	The Effect of Flow Parameters and Wall Models on Gas-Surface Interactions: A Numerical Investigation of dsmcFoam	M. B. Agir et.al.	2508.11403	null
2025-08-15	Pairwise correlations of global times in one-dimensional Brownian motion under stochastic resetting	Yihao Wang et.al.	2508.11387	null
2025-08-15	AnatoMaskGAN: GNN-Driven Slice Feature Fusion and Noise Augmentation for Medical Semantic Image Synthesis	Zonglin Wu et.al.	2508.11375	null
2025-08-15	GANDiff FR: Hybrid GAN Diffusion Synthesis for Causal Bias Attribution in Face Recognition	Md Asgor Hossain Reaj et.al.	2508.11334	null
2025-08-15	Noise Matters: Optimizing Matching Noise for Diffusion Classifiers	Yanghao Wang et.al.	2508.11330	null
2025-08-18	TimeMachine: Fine-Grained Facial Age Editing with Identity Preservation	Yilin Mi et.al.	2508.11284	null
2025-08-15	Probing the Representational Power of Sparse Autoencoders in Vision Models	Matthew Lyle Olson et.al.	2508.11277	null
2025-08-15	Generalized Decoupled Learning for Enhancing Open-Vocabulary Dense Perception	Junjie Wang et.al.	2508.11256	null
2025-08-15	FantasyTalking2: Timestep-Layer Adaptive Preference Optimization for Audio-Driven Portrait Animation	MengChao Wang et.al.	2508.11255	null
2025-08-15	Graph Neural Diffusion via Generalized Opinion Dynamics	Asela Hevapathige et.al.	2508.11249	null
2025-08-15	Cross-Granularity Hypergraph Retrieval-Augmented Generation for Multi-hop Question Answering	Changjian Wang et.al.	2508.11247	null
2025-08-15	Efficient Image-to-Image Schrödinger Bridge for CT Field of View Extension	Zhenhao Li et.al.	2508.11211	null
2025-08-15	StyleMM: Stylized 3D Morphable Face Model via Text-Driven Aligned Image Translation	Seungmi Lee et.al.	2508.11203	null
2025-08-15	NGC 2392 and NGC 4361: Spectroscopic Diagnostics of Planetary Nebula Evolution	Atul Kumar Singh et.al.	2508.11202	null
2025-08-15	Statistical Properties of Current Noise Induced by Electron-Phonon Scattering in Metallic Carbon Nanotubes	Aina Sumiyoshi et.al.	2508.11201	null
2025-08-15	Representation Quantization for Collaborative Filtering Augmentation	Yunze Luo et.al.	2508.11194	null
2025-08-15	Semi-supervised Image Dehazing via Expectation-Maximization and Bidirectional Brownian Bridge Diffusion Models	Bing Liu et.al.	2508.11165	null
2025-08-15	LEARN: A Story-Driven Layout-to-Image Generation Framework for STEM Instruction	Maoquan Zhang et.al.	2508.11153	null
2025-08-15	Residual-based Efficient Bidirectional Diffusion Model for Image Dehazing and Haze Generation	Bing Liu et.al.	2508.11134	null
2025-08-15	SQ-A: A Collision Triggered Starburst in Intra-Group Medium of Stephan’s Quintet	C. K. Xu et.al.	2508.11124	null
2025-08-14	Diffusion is a code repair operator and generator	Mukul Singh et.al.	2508.11110	null
2025-08-14	HierOctFusion: Multi-scale Octree-based 3D Shape Generation via Part-Whole-Hierarchy Message Passing	Xinjie Gao et.al.	2508.11106	null
2025-08-14	GenFlowRL: Shaping Rewards with Generative Object-Centric Flow in Visual Reinforcement Learning	Kelin Yu et.al.	2508.11049	null
2025-08-14	A porous medium equation with spatially inhomogeneous absorption. Part II: Large time behavior	Razvan Gabriel Iagar et.al.	2508.11046	null
2025-08-14	3D FlowMatch Actor: Unified 3D Policy for Single- and Dual-Arm Manipulation	Nikolaos Gkanatsios et.al.	2508.11002	null
2025-08-14	Improving Text Style Transfer using Masked Diffusion Language Models with Inference-time Scaling	Tejomay Kishor Padole et.al.	2508.10995	null
2025-08-14	Match & Choose: Model Selection Framework for Fine-tuning Text-to-Image Diffusion Models	Basile Lewandowski et.al.	2508.10993	null
2025-08-14	The extended molecular gas of the Circinus galaxy and NGC 1097 as seen by APEX	Akhil Lasrado et.al.	2508.10982	null
2025-08-14	EVCtrl: Efficient Control Adapter for Visual Generation	Zixiang Yang et.al.	2508.10963	null
2025-08-13	From Promise to Practical Reality: Transforming Diffusion MRI Analysis with Fast Deep Learning Enhancement	Xinyi Wang et.al.	2508.10950	null
2025-08-14	Exchange-driven self-diffusion of nanoscale crystalline parahydrogen clusters on graphite	K. M. Kolevski et.al.	2508.10883	null
2025-08-14	A Survey on Diffusion Language Models	Tianyi Li et.al.	2508.10875	null
2025-08-14	Hierarchical Fine-grained Preference Optimization for Physically Plausible Video Generation	Harold Haodong Chen et.al.	2508.10858	null
2025-08-16	Object Fidelity Diffusion for Remote Sensing Image Generation	Ziqi Ye et.al.	2508.10801	null
2025-08-14	Ultra-High-Definition Reference-Based Landmark Image Super-Resolution with Generative Diffusion Prior	Zhenning Shi et.al.	2508.10779	null
2025-08-14	Video-BLADE: Block-Sparse Attention Meets Step Distillation for Efficient Video Generation	Youping Gu et.al.	2508.10774	null
2025-08-14	AEGIS: Authenticity Evaluation Benchmark for AI-Generated Video Sequences	Jieyu Li et.al.	2508.10771	null
2025-08-14	Formation and protection of an Eu-Ir surface compound below hexagonal boron nitride	Alaa Mohammed Idris Bakhit et.al.	2508.10746	null
2025-08-14	A Kinetic Theory Approach to Ordered Fluids	José A. Carrillo et.al.	2508.10744	null
2025-08-14	Thinking Inside the Mask: In-Place Prompting in Diffusion LLMs	Xiangqi Jin et.al.	2508.10736	null
2025-08-14	Exploiting Discriminative Codebook Prior for Autoregressive Image Generation	Longxiang Tang et.al.	2508.10719	null
2025-08-14	NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at Scale	NextStep Team et.al.	2508.10711	null
2025-08-14	CountCluster: Training-Free Object Quantity Guidance with Cross-Attention Map Clustering for Text-to-Image Generation	Joohyeon Lee et.al.	2508.10710	null
2025-08-14	Probabilistic Forecasting Method for Offshore Wind Farm Cluster under Typhoon Conditions: a Score-Based Conditional Diffusion Model	Jinhua He et.al.	2508.10705	null
2025-08-14	Effective permeability conditions for diffusive transport through impermeable membranes with gaps	Molly Brennan et.al.	2508.10694	null
2025-08-14	Novel View Synthesis using DDIM Inversion	Sehajdeep SIngh et.al.	2508.10688	null
2025-08-14	MDNS: Masked Diffusion Neural Sampler via Stochastic Optimal Control	Yuchen Zhu et.al.	2508.10684	null
2025-08-14	Hybrid Generative Fusion for Efficient and Privacy-Preserving Face Recognition Dataset Generation	Feiran Li et.al.	2508.10672	null
2025-08-14	Geospatial Diffusion for Land Cover Imperviousness Change Forecasting	Debvrat Varshney et.al.	2508.10649	null
2025-08-14	Increasing the Utility of Synthetic Images through Chamfer Guidance	Nicola Dall’Asen et.al.	2508.10631	null
2025-08-14	A Unified Framework from Boltzmann Transport to Proton Treatment Planning	Andreas E. Kyprianou et.al.	2508.10596	null
2025-08-14	HM-Talker: Hybrid Motion Modeling for High-Fidelity Talking Head Synthesis	Shiyu Liu et.al.	2508.10566	null
2025-08-14	Projected Coupled Diffusion for Test-Time Constrained Joint Generation	Hao Luan et.al.	2508.10531	null
2025-08-14	EgoMusic-driven Human Dance Motion Estimation with Skeleton Mamba	Quang Nguyen et.al.	2508.10522	null
2025-08-15	KDPE: A Kernel Density Estimation Strategy for Diffusion Policy Trajectory Selection	Andrea Rosasco et.al.	2508.10511	null
2025-08-14	A Segmentation-driven Editing Method for Bolt Defect Augmentation and Detection	Yangjie Xiao et.al.	2508.10509	null
2025-08-14	TweezeEdit: Consistent and Efficient Image Editing with Path Regularization	Jianda Mao et.al.	2508.10498	null
2025-08-14	A Unified Multi-Agent Framework for Universal Multimodal Understanding and Generation	Jiulin Li et.al.	2508.10494	null
2025-08-14	Jamming of active particles in narrow pores: Implications for ratchet effect and diffusion coefficient	Šimon Pajger et.al.	2508.10483	null
2025-08-14	NanoControl: A Lightweight Framework for Precise and Efficient Control in Diffusion Transformer	Shanyuan Liu et.al.	2508.10424	null
2025-08-14	Extracting a stochastic model for predator-prey dynamic of turbulence and zonal flows with limited data	J. C. Huang et.al.	2508.10408	null
2025-08-14	Translation of Text Embedding via Delta Vector to Suppress Strongly Entangled Content in Text-to-Image Diffusion Models	Eunseo Koh et.al.	2508.10407	null
2025-08-14	PQ-DAF: Pose-driven Quality-controlled Data Augmentation for Data-scarce Driver Distraction Detection	Haibin Sun et.al.	2508.10397	null
2025-08-14	EDIS: A Simulation Software for Dynamic Ion Intercalation/Deintercalation Processes in Electrode Materials	Liqi Wang et.al.	2508.10384	null
2025-08-14	Towards Spatially Consistent Image Generation: On Incorporating Intrinsic Scene Properties into Diffusion Models	Hyundo Lee et.al.	2508.10382	null
2025-08-14	A Semantic-Aware Framework for Safe and Intent-Integrative Assistance in Upper-Limb Exoskeletons	Yu Chen et.al.	2508.10378	null
2025-08-14	Scalable Modeling of Nonlinear Network Dynamics in Neurodegenerative Disease	Daniel Semchin et.al.	2508.10343	null
2025-08-14	ReconVLA: Reconstructive Vision-Language-Action Model as Effective Robot Perceiver	Wenxuan Song et.al.	2508.10333	null
2025-08-14	Cross-view Generalized Diffusion Model for Sparse-view CT Reconstruction	Jixiang Chen et.al.	2508.10313	null
2025-08-14	DiffAxE: Diffusion-driven Hardware Accelerator Generation and Design Space Exploration	Arkapravo Ghosh et.al.	2508.10303	null
2025-08-14	Influence Maximization in Multi-layer Social Networks Based on Differentiated Graph Embeddings	Ronghua Lin et.al.	2508.10289	null
2025-08-14	High Fidelity Text to Image Generation with Contrastive Alignment and Structural Guidance	Danyi Gao et.al.	2508.10280	null
2025-08-14	A Spectral Solver to Capture Unsteady Dynamics in the Aerospike Nozzle Wake	Zachary Pyle et.al.	2508.10275	null
2025-08-14	Non-Decaying Solutions to the 2D Dissipative Quasi-Geostrophic Equations	David M. Ambrose et.al.	2508.10254	null
2025-08-13	Run-and-tumble dynamics with non-reciprocal transitions between three velocity states	Julio C. R. Romo-Cruz et.al.	2508.10213	null
2025-08-13	Diffusive Braking of Penetrative Convection in Stably-Stratified Fluids	Bradley W. Hindman et.al.	2508.10174	null
2025-08-13	Predicting First-Passage Dynamics in Disordered Systems Exactly: Application to Sparse Networks	Daniel Marris et.al.	2508.10140	null
2025-08-13	The Perturbation Theory Approach to Stability in the Scattered Disk	Matthew Belyakov et.al.	2508.10119	null
2025-08-13	Constrained Decoding of Diffusion LLMs with Context-Free Grammars	Niels Mündler et.al.	2508.10111	null
2025-08-13	Quantum circuit simulation with a local time-dependent variational principle	Aaron Sander et.al.	2508.10096	null
2025-08-13	Invisible Watermarks, Visible Gains: Steering Machine Unlearning with Bi-Level Watermarking Design	Yuhao Sun et.al.	2508.10065	null
2025-08-13	Echo-4o: Harnessing the Power of GPT-4o Synthetic Images for Improved Image Generation	Junyan Ye et.al.	2508.09987	null
2025-08-13	Story2Board: A Training-Free Approach for Expressive Storyboard Generation	David Dinkevich et.al.	2508.09983	null
2025-08-13	Masquerade: Learning from In-the-wild Human Videos using Data-Editing	Marion Lepert et.al.	2508.09976	null
2025-08-13	PERSONA: Personalized Whole-Body 3D Avatar with Pose-Driven Deformations from a Single Image	Geonhee Sim et.al.	2508.09973	null
2025-08-13	Noise Hypernetworks: Amortizing Test-Time Compute in Diffusion Models	Luca Eyring et.al.	2508.09968	null
2025-08-13	Stable Diffusion Models are Secretly Good at Visual In-Context Learning	Trevine Oorloff et.al.	2508.09949	null
2025-08-13	AST-n: A Fast Sampling Approach for Low-Dose CT Reconstruction using Diffusion Models	Tomás de la Sotta et.al.	2508.09943	null
2025-08-13	Quo Vadis Handwritten Text Generation for Handwritten Text Recognition?	Vittorio Pippi et.al.	2508.09936	null
2025-08-13	Active Particle Diffusion in Convection Roll Arrays	Pulak Kumar Ghosh et.al.	2508.09924	null
2025-08-14	Prototype-Guided Diffusion: Visual Conditioning without External Memory	Bilal Faye et.al.	2508.09922	null
2025-08-13	Hybrid Quantum-Classical Latent Diffusion Models for Medical Image Generation	Kübra Yeter-Aydeniz et.al.	2508.09903	null
2025-08-13	Binary Mixtures in Linear Convection Arrays	Pulak Kumar Ghosh et.al.	2508.09902	null
2025-08-13	Exploring the Physics of the Plasma Liner Experiment: A Multi-dimensional Study with FLASH, OSIRIS, and HELIOS	E. C. Hansen et.al.	2508.09895	null
2025-08-13	Marketron Through the Looking Glass: From Equity Dynamics to Option Pricing in Incomplete Markets	Igor Halperin et.al.	2508.09863	null
2025-08-13	HumanGenesis: Agent-Based Geometric and Generative Modeling for Synthetic Human Dynamics	Weiqi Li et.al.	2508.09858	null
2025-08-13	Enhancing Diffusion Face Generation with Contrastive Embeddings and SegFormer Guidance	Dhruvraj Singh Rawat et.al.	2508.09847	null
2025-08-13	On the Generalization Limits of Quantum Generative Adversarial Networks with Pure State Generators	Jasmin Frkatovic et.al.	2508.09844	null
2025-08-13	Speed Always Wins: A Survey on Efficient Architectures for Large Language Models	Weigao Sun et.al.	2508.09834	null
2025-08-13	Physical Autoregressive Model for Robotic Manipulation without Action Pretraining	Zijian Song et.al.	2508.09822	null
2025-08-13	Feature Impact Analysis on Top Long-Jump Performances with Quantile Random Forest and Explainable AI Techniques	Qi Gan et.al.	2508.09810	null
2025-08-13	Condition number for finite element discretisation of nonlocal PDE systems with applications to biology	Olusegun E. Adebayo et.al.	2508.09781	null
2025-08-13	Impacts of the duration and intensity of grazing cycle on vegetation population dynamics in semi-arid ecosystems with seasonal succession	Junhong Gan et.al.	2508.09760	null
2025-08-13	Region-to-Region: Enhancing Generative Image Harmonization with Adaptive Regional Injection	Zhiqiu Zhang et.al.	2508.09746	null
2025-08-13	MangaDiT: Reference-Guided Line Art Colorization with Hierarchical Attention in Diffusion Transformers	Qianru Qiu et.al.	2508.09709	null
2025-08-13	Hydrodynamic approximations for driven dense colloidal mixtures in narrow pores	Frantisek Slanina et.al.	2508.09686	null
2025-08-13	Anomalous Transport of Elongated Particles in Oscillatory Vortical Flows	Shiyuan Hu et.al.	2508.09677	null
2025-08-13	GSFixer: Improving 3D Gaussian Splatting with Reference-Guided Video Diffusion Priors	Xingyilang Yin et.al.	2508.09667	null
2025-08-13	NegFaceDiff: The Power of Negative Context in Identity-Conditioned Diffusion for Synthetic Face Generation	Eduarda Caldeira et.al.	2508.09661	null
2025-08-13	Asymptotic-analysis-inspired boundary conditions aiming at eliminating polymer diffusive instability	Ming Dong et.al.	2508.09635	null
2025-08-15	Preacher: Paper-to-Video Agentic System	Jingwei Liu et.al.	2508.09632	null
2025-08-13	MInDI-3D: Iterative Deep Learning in 3D for Sparse-view Cone Beam Computed Tomography	Daniel Barco et.al.	2508.09616	null
2025-08-13	Global uniform regularity for the 3D incompressible MHD equations with slip boundary condition near a background magnetic field	Jincheng Gao et.al.	2508.09609	null
2025-08-13	Images Speak Louder Than Scores: Failure Mode Escape for Enhancing Generative Quality	Jie Shao et.al.	2508.09598	null
2025-08-13	Dual Recursive Feedback on Generation and Appearance Latents for Pose-Robust Text-to-Image Diffusion	Jiwon Kim et.al.	2508.09575	null
2025-08-13	Zeolitic imidazolate framework glasses emit white light	Zhencai Li et.al.	2508.09552	null
2025-08-13	Exploring the Equivalence of Closed-Set Generative and Real Data Augmentation in Image Classification	Haowen Wang et.al.	2508.09550	null
2025-08-13	Boron Clusters for Metal-Free Water Splitting	Masaya Fujioka et.al.	2508.09538	null
2025-08-13	Ehrenfest Dynamics with Spontaneous Localization	Anderson A. Tomaz et.al.	2508.09526	null
2025-08-13	Generation of Indian Sign Language Letters, Numbers, and Words	Ajeet Kumar Yadav et.al.	2508.09522	null
2025-08-13	A hyperbolic finite difference scheme for anisotropic diffusion equations: preserving the discrete maximum principle	Tokuhiro Eto et.al.	2508.09509	null
2025-08-13	Stingrays in the radio sky: Two unusual diffuse radio relic sources in the direction of the Magellanic Stream	Zachary J Smeaton et.al.	2508.09495	null
2025-08-13	SARE: Semantic-Aware Reconstruction Error for Generalizable Diffusion-Generated Image Detection	Ju Yeon Kang et.al.	2508.09487	null
2025-08-13	CLIP-Flow: A Universal Discriminator for AI-Generated Images Inspired by Anomaly Detection	Zhipeng Yuan et.al.	2508.09477	null
2025-08-14	From Large Angles to Consistent Faces: Identity-Preserving Video Generation via Mixture of Facial Experts	Yuji Wang et.al.	2508.09476	null
2025-08-13	Leveraging Failed Samples: A Few-Shot and Training-Free Framework for Generalized Deepfake Detection	Shibo Yao et.al.	2508.09475	null
2025-08-13	Gen-AFFECT: Generation of Avatar Fine-grained Facial Expressions with Consistent identiTy	Hao Yu et.al.	2508.09461	null
2025-08-13	RASR: Retrieval-Augmented Super Resolution for Practical Reference-based Image Restoration	Jiaqi Yan et.al.	2508.09449	null
2025-08-13	DAgger Diffusion Navigation: DAgger Boosted Diffusion Policy for Vision-Language Navigation	Haoxiang Shi et.al.	2508.09444	null
2025-08-13	Scaling behaviour of rotating convection in a spherical shell with different Prandtl numbers	Wei Fan et.al.	2508.09416	null
2025-08-13	Dynamos driven by top-heavy double-diffusive convection in the strong-field regime	Wei Fan et.al.	2508.09410	null
2025-08-12	Understanding Dementia Speech Alignment with Diffusion-Based Image Generation	Mansi et.al.	2508.09385	null
2025-08-12	X-UniMotion: Animating Human Images with Expressive, Unified and Identity-Agnostic Motion Latents	Guoxian Song et.al.	2508.09383	null
2025-08-12	UltraLight Med-Vision Mamba for Classification of Neoplastic Progression in Tubular Adenomas	Aqsa Sultana et.al.	2508.09339	null
2025-08-12	Lung-DDPM+: Efficient Thoracic CT Image Synthesis using Diffusion Probabilistic Model	Yifan Jiang et.al.	2508.09327	null
2025-08-12	Quantum correction to the Langevin cross section in resonant-exchange processes	I. Simbotin et.al.	2508.09302	null
2025-08-12	Evolution of a Long-Lived Deep-Seated Main-Sequence Magnetic Field During White Dwarf Cooling	Matias Castro-Tapia et.al.	2508.09268	null
2025-08-12	TFZ: Topology-Preserving Compression of 2D Symmetric and Asymmetric Second-Order Tensor Fields	Nathaniel Gorski et.al.	2508.09235	null
2025-08-12	GSMT: Graph Fusion and Spatiotemporal TaskCorrection for Multi-Bus Trajectory Prediction	Fan Ding et.al.	2508.09227	null
2025-08-12	Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models	Wen Wang et.al.	2508.09138	null
2025-08-12	Turbo-VAED: Fast and Stable Transfer of Video-VAEs to Mobile Devices	Ya Zou et.al.	2508.09136	null
2025-08-13	Training-Free Text-Guided Color Editing with Multi-Modal Diffusion Transformer	Zixin Yin et.al.	2508.09131	null
2025-08-13	Robust quantum computational advantage with programmable 3050-photon Gaussian boson sampling	Hua-Liang Liu et.al.	2508.09092	null
2025-08-13	Direct Measurement of Electron Heating in Electron-Only Reconnection in a Laboratory Mini-Magnetosphere	Lucas Rovige et.al.	2508.09086	null
2025-08-12	Rankin-Selberg integrals for $\mathrm{GSpin}$ groups with application to the global Gan-Gross-Prasad conjecture	Pan Yan et.al.	2508.09066	null
2025-08-12	Per-Query Visual Concept Learning	Ori Malca et.al.	2508.09045	null
2025-08-12	Stochastic Decentralized Optimization of Non-Smooth Convex and Convex-Concave Problems over Time-Varying Networks	Maxim Divilkovskiy et.al.	2508.09029	null
2025-08-12	Envisioning Generative Artificial Intelligence in Cartography and Mapmaking	Yuhao Kang et.al.	2508.09028	null
2025-08-12	TaoCache: Structure-Maintained Video Generation Acceleration	Zhentao Fan et.al.	2508.08978	null
2025-08-12	Urban-STA4CLC: Urban Theory-Informed Spatio-Temporal Attention Model for Predicting Post-Disaster Commercial Land Use Change	Ziyi Guo et.al.	2508.08976	null
2025-08-12	Listen through the Sound: Generative Speech Restoration Leveraging Acoustic Context Representation	Soo-Whan Chung et.al.	2508.08953	null
2025-08-12	Lay2Story: Extending Diffusion Transformers for Layout-Togglable Story Generation	Ao Ma et.al.	2508.08949	null
2025-08-12	EGGCodec: A Robust Neural Encodec Framework for EGG Reconstruction and F0 Extraction	Rui Feng et.al.	2508.08924	null
2025-08-12	When and How Ultrasound Enhances Nanoparticle Diffusion in Hydrogels: A Stick-and-Release Mechanism	Pablo M. Blanco et.al.	2508.08918	null
2025-08-12	Sound Signal Synthesis with Auxiliary Classifier GAN, COVID-19 cough as an example	Yahya Sherif Solayman Mohamed Saleh et.al.	2508.08892	null
2025-08-12	Transient Noise Removal via Diffusion-based Speech Inpainting	Mordehay Moradi et.al.	2508.08890	null
2025-08-12	DiffPhysCam: Differentiable Physics-Based Camera Simulation for Inverse Rendering and Embodied AI	Bo-Hsun Chen et.al.	2508.08831	null
2025-08-12	Geometry-Aware Global Feature Aggregation for Real-Time Indirect Illumination	Meng Gai et.al.	2508.08826	null
2025-08-12	TARA: Token-Aware LoRA for Composable Personalization in Diffusion Models	Yuqi Peng et.al.	2508.08812	null
2025-08-12	Identity-Preserving Aging and De-Aging of Faces in the StyleGAN Latent Space	Luis S. Luevano et.al.	2508.08808	null
2025-08-12	Anomalous Sodium Insertion in Highly Oriented Graphite: Thermodynamics, Kinetics and Evidence for Two-Sided Intercalation	Chuanhai Gan et.al.	2508.08806	null
2025-08-14	Measurement-Based Quantum Diffusion Models	Xinyu Liu et.al.	2508.08799	null
2025-08-12	DiffPose-Animal: A Language-Conditioned Diffusion Framework for Animal Pose Estimation	Tianyu Xiong et.al.	2508.08783	null
2025-08-12	Patient-Adaptive Focused Transmit Beamforming using Cognitive Ultrasound	Wessel L. van Nierop et.al.	2508.08782	null
2025-08-12	Exploring Palette based Color Guidance in Diffusion Models	Qianru Qiu et.al.	2508.08754	null
2025-08-12	Elucidating Rectified Flow with Deterministic Sampler: Polynomial Discretization Complexity for Multi and One-step Models	Ruofeng Yang et.al.	2508.08735	null
2025-08-13	A Survey on Parallel Text Generation: From Parallel Decoding to Diffusion Language Models	Lingzhe Zhang et.al.	2508.08712	null
2025-08-12	Towards Safe Imitation Learning via Potential Field-Guided Flow Matching	Haoran Ding et.al.	2508.08707	null
2025-08-12	SafeFix: Targeted Model Repair via Controlled Image Generation	Ouyang Xu et.al.	2508.08701	null
2025-08-12	Subjective and Objective Quality Assessment of Banding Artifacts on Compressed Videos	Qi Zheng et.al.	2508.08700	null
2025-08-12	DiffVolume: Diffusion Models for Volume Generation in Limit Order Books	Zhuohan Wang et.al.	2508.08698	null
2025-08-12	Detecting Sterile Neutrino Dark Matter at MeV Gamma-Ray Observatories	Subaru Fujisawa et.al.	2508.08695	null
2025-08-12	Expert-Guided Diffusion Planner for Auto-bidding	Yunshan Peng et.al.	2508.08687	null
2025-08-12	In-Context Learning as Nonparametric Conditional Probability Estimation: Risk Bounds and Optimality	Chenrui Liu et.al.	2508.08673	null
2025-08-12	Nonlinear dynamics of reaction-diffusion wave trains under large and fully nonlocalized modulations	Joannis Alexopoulos et.al.	2508.08637	null
2025-08-14	Yan: Foundational Interactive Video Generation	Deheng Ye et.al.	2508.08601	null
2025-08-12	RealisMotion: Decomposed Human Motion Control and Video Generation in the World Space	Jingyun Liang et.al.	2508.08588	null
2025-08-12	Unlocking the Potential of Diffusion Priors in Blind Face Restoration	Yunqi Miao et.al.	2508.08556	null
2025-08-12	UQGNN: Uncertainty Quantification of Graph Neural Networks for Multivariate Spatiotemporal Prediction	Dahai Yu et.al.	2508.08551	null
2025-08-12	Fluorescence time profile measurement of LAB based liquid scintillator in response to medium relativistic ion particles	Xiaojie Luo et.al.	2508.08546	null
2025-08-12	Transition to Petschek Reconnection in Subrelativistic Pair Plasmas: Implications for Particle Acceleration	Adam Robbins et.al.	2508.08533	null
2025-08-11	SynLLM: A Comparative Analysis of Large Language Models for Medical Tabular Synthetic Data Generation via Prompt Engineering	Arshia Ilaty et.al.	2508.08529	null
2025-08-11	Control-affine Schrödinger Bridge and Generalized Bohm Potential	Alexis M. H. Teter et.al.	2508.08511	null
2025-08-11	CObL: Toward Zero-Shot Ordinal Layering without User Prompting	Aneel Damaraju et.al.	2508.08498	null
2025-08-11	MuGa-VTON: Multi-Garment Virtual Try-On via Diffusion Transformers with Prompt Customization	Ankan Deria et.al.	2508.08488	null
2025-08-11	MAViS: A Multi-Agent Framework for Long-Sequence Video Storytelling	Qian Wang et.al.	2508.08487	null
2025-08-11	Discrete Diffusion-Based Model-Level Explanation of Heterogeneous GNNs with Node Features	Pallabee Das et.al.	2508.08458	null
2025-08-11	Hot Jupiter formation in dense stellar clusters: A Monte Carlo model applied to 47 Tucanae	J. A. Wirth et.al.	2508.08406	null
2025-08-11	Wave Propagation Dynamics via Lattice Difference Equations	Eddy Kwessi et.al.	2508.08387	null
2025-08-11	Spatiotemporally Consistent Indoor Lighting Estimation with Diffusion Priors	Mutian Tong et.al.	2508.08384	null
2025-08-11	Exponentially Improved Constant in Quantum Solution Extraction	Gumaro Rendon et.al.	2508.08375	null
2025-08-11	StableAvatar: Infinite-Length Audio-Driven Avatar Video Generation	Shuyuan Tu et.al.	2508.08248	null
2025-08-12	Cut2Next: Generating Next Shot via In-Context Tuning	Jingwen He et.al.	2508.08244	null
2025-08-13	BeyondMimic: From Motion Tracking to Versatile Humanoid Control via Guided Diffusion	Qiayuan Liao et.al.	2508.08241	null
2025-08-11	OMGSR: You Only Need One Mid-timestep Guidance for Real-World Image Super-Resolution	Zhiqiang Wu et.al.	2508.08227	null
2025-08-11	Learning User Preferences for Image Generation Model	Wenyi Mo et.al.	2508.08220	null
2025-08-11	Reinforcement Learning in Vision: A Survey	Weijia Wu et.al.	2508.08189	null
2025-08-13	CD-TVD: Contrastive Diffusion for 3D Super-Resolution with Scarce High-Resolution Time-Varying Data	Chongke Bi et.al.	2508.08173	null
2025-08-11	ReconDreamer-RL: Enhancing Reinforcement Learning via Diffusion-based Scene Reconstruction	Chaojun Ni et.al.	2508.08170	null
2025-08-11	An effective potential for generative modelling with active matter	Adrian Baule et.al.	2508.08146	null
2025-08-11	Reproducing and Extending Brownian Motion in Optical Trap: A Computational Reimplementation of Volpe and Volpe (2013)	Eyad I. B Hamid et.al.	2508.08138	null
2025-08-11	FantasyStyle: Controllable Stylized Distillation for 3D Gaussian Splatting	Yitong Yang et.al.	2508.08136	null
2025-08-11	Optimal Dividend, Reinsurance, and Capital Injection Strategies for an Insurer with Two Collaborating Business Lines	Tim J. Boonen et.al.	2508.08130	null
2025-08-11	Learned Regularization for Microwave Tomography	Bowen Tong et.al.	2508.08114	null
2025-08-11	TBAC-UniImage: Unified Understanding and Generation by Ladder-Side Diffusion Tuning	Junzhe Xu et.al.	2508.08098	null
2025-08-11	Fast and Generalizable parameter-embedded Neural Operators for Lithium-Ion Battery Simulation	Amir Ali Panahi et.al.	2508.08087	null
2025-08-11	Matrix-3D: Omnidirectional Explorable 3D World Generation	Zhongqi Yang et.al.	2508.08086	null
2025-08-12	Why Bohmian velocity might not be the only quantum velocity and the role of quantum diffusion flux is super-luminal wave packets	Charalampos Antonakos et.al.	2508.08065	null
2025-08-11	S^2VG: 3D Stereoscopic and Spatial Video Generation via Denoising Frame Matrix	Peng Dai et.al.	2508.08048	null
2025-08-12	Omni-Effects: Unified and Spatially-Controllable Visual Effects Generation	Fangyuan Mao et.al.	2508.07981	null
2025-08-11	Well-posedness for a fourth-order nonisothermal tumor growth model of Caginalp type	Giulia Cavalleri et.al.	2508.07979	null
2025-08-12	Adaptive Multiple Access and Service Placement for Generative Diffusion Models	Hamidreza Mazandarani et.al.	2508.07978	null
2025-08-11	Deep imaging of the galaxy Malin 2 shows new faint structures and a candidate satellite dwarf galaxy	Junais et.al.	2508.07930	null
2025-08-11	Score Augmentation for Diffusion Models	Liang Hou et.al.	2508.07926	null
2025-08-11	Generative Video Matting	Yongtao Ge et.al.	2508.07905	null
2025-08-11	Diffusing the Blind Spot: Uterine MRI Synthesis with Diffusion Models	Johanna P. Müller et.al.	2508.07903	null
2025-08-12	Stand-In: A Lightweight and Plug-and-Play Identity Control for Video Generation	Bowen Xue et.al.	2508.07901	null
2025-08-11	NeeCo: Image Synthesis of Novel Instrument States Based on Dynamic and Deformable 3D Gaussian Reconstruction	Tianle Zeng et.al.	2508.07897	null
2025-08-11	Deep Learning-Based Desikan-Killiany Parcellation of the Brain Using Diffusion MRI	Yousef Sadegheih et.al.	2508.07815	null
2025-08-11	DiTVR: Zero-Shot Diffusion Transformer for Video Restoration	Sicheng Gao et.al.	2508.07811	null
2025-08-11	MambaTrans: Multimodal Fusion Image Translation via Large Language Model Priors for Downstream Visual Tasks	Yushen Xu et.al.	2508.07803	null
2025-08-11	Generative Inversion for Property-Targeted Materials Design: Application to Shape Memory Alloys	Cheng Li et.al.	2508.07798	null
2025-08-11	Feynman-Kac formula gor general time dependent stochastic parabolic equation on a bounded domain and applications	Yaozhong Hu et.al.	2508.07793	null
2025-08-13	AgentWorld: An Interactive Simulation Platform for Scene Construction and Mobile Robotic Manipulation	Yizheng Zhang et.al.	2508.07770	null
2025-08-11	Dream4D: Lifting Camera-Controlled I2V towards Spatiotemporally Consistent 4D Generation	Xiaoyan Liu et.al.	2508.07769	null
2025-08-11	Sea-Undistort: A Dataset for Through-Water Image Restoration in High Resolution Airborne Bathymetric Mapping	Maximilian Kromer et.al.	2508.07760	null
2025-08-11	Correspondence as Video: Test-Time Adaption on SAM2 for Reference Segmentation in the Wild	Haoran Wang et.al.	2508.07759	null
2025-08-11	Comparison Reveals Commonality: Customized Image Generation through Contrastive Inversion	Minseo Kim et.al.	2508.07755	null
2025-08-11	Grouped Speculative Decoding for Autoregressive Image Generation	Junhyuk So et.al.	2508.07747	null
2025-08-11	Is GAN Necessary for Mel-Spectrogram-based Neural Vocoder?	Hui-Peng Du et.al.	2508.07711	null
2025-08-11	Make Your MoVe: Make Your 3D Contents by Adapting Multi-View Diffusion Models to External Editing	Weitao Wang et.al.	2508.07700	null
2025-08-11	DiffVC-OSD: One-Step Diffusion-based Perceptual Neural Video Compression Framework	Wenzhuo Ma et.al.	2508.07682	null
2025-08-11	LaRender: Training-Free Occlusion Control in Image Generation via Latent Rendering	Xiaohang Zhan et.al.	2508.07647	null
2025-08-11	X2Edit: Revisiting Arbitrary-Instruction Image Editing through Self-Constructed Data and Task-Aware Representation Learning	Jian Ma et.al.	2508.07607	null
2025-08-11	LaVieID: Local Autoregressive Diffusion Transformers for Identity-Preserving Video Creation	Wenhui Song et.al.	2508.07603	null
2025-08-11	ShoulderShot: Generating Over-the-Shoulder Dialogue Videos	Yuang Zhang et.al.	2508.07597	null
2025-08-11	Procedural Mixture Sets	Hendrik Rommeswinkel et.al.	2508.07588	null
2025-08-12	From Platform Migration to Cultural Integration: the Ingress and Diffusion of #wlw from TikTok to RedNote in Queer Women Communities	Ziqi Pan et.al.	2508.07579	null
2025-08-11	UniFlow: Unifying Speech Front-End Tasks via Continuous Generative Modeling	Ziqian Wang et.al.	2508.07558	null
2025-08-11	Splat4D: Diffusion-Enhanced 4D Gaussian Splatting for Temporally and Spatially Consistent Content Creation	Minghao Yin et.al.	2508.07557	null
2025-08-11	Physics-informed Multiresolution Wavelet Neural Network Method for Solving Partial Differential Equations	Feng Han et.al.	2508.07546	null
2025-08-11	Exploring Multimodal Diffusion Transformers for Enhanced Prompt-based Image Editing	Joonghyuk Shin et.al.	2508.07519	null
2025-08-10	Forecasting solar power output in Ibadan: A machine learning approach leveraging weather data and system specifications	Obarotu Peter Urhuerhi et.al.	2508.07462	null
2025-08-10	Unified Semiclassical Theory of Nonlinear Hall Effect:Bridging Ballistic and Diffusive Transport Regime	Xinyu Liu et.al.	2508.07445	null
2025-08-10	Robust, fast, and adaptive splitting schemes for nonlinear doubly-degenerate diffusion equations	Ayesha Javed et.al.	2508.07420	null
2025-08-10	CLUE: Leveraging Low-Rank Adaptation to Capture Latent Uncovered Evidence for Image Forgery Localization	Youqi Wang et.al.	2508.07413	null
2025-08-10	Conditional splitting probabilities for hidden-state inference in drift-diffusive processes	Emir Sezik et.al.	2508.07386	null
2025-08-10	Supercritical fluids as a distinct state of matter characterized by sub-short-range structural order	Sha Jin et.al.	2508.07385	null
2025-08-10	SODiff: Semantic-Oriented Diffusion Model for JPEG Compression Artifacts Removal	Tingyu Yang et.al.	2508.07346	null
2025-08-10	CoAR: Concept Injection into Autoregressive Models for Personalized Text-to-Image Generation	Fangtai Wu et.al.	2508.07341	null
2025-08-10	Linear-Quadratic Mean Field Games with Common Noise: A Direct Approach	Wenyu Cong et.al.	2508.07271	null
2025-08-10	Consistent and Controllable Image Animation with Motion Linear Diffusion Transformers	Xin Ma et.al.	2508.07246	null
2025-08-10	Causal Negative Sampling via Diffusion Model for Out-of-Distribution Recommendation	Chu Zhao et.al.	2508.07243	null
2025-08-10	HaDM-ST: Histology-Assisted Differential Modeling for Spatial Transcriptomics Generation	Xuepeng Liu et.al.	2508.07225	null
2025-08-10	Neural Bridge Processes	Jian Xu et.al.	2508.07220	null
2025-08-10	Explainability-in-Action: Enabling Expressive Manipulation and Tacit Understanding by Bending Diffusion Models in ComfyUI	Ahmed M. Abuzuraiq et.al.	2508.07183	null
2025-08-10	CoopDiff: Anticipating 3D Human-object Interactions via Contact-consistent Decoupled Diffusion	Xiaotong Lin et.al.	2508.07162	null
2025-08-10	SketchAnimator: Animate Sketch via Motion Customization of Text-to-Video Diffusion Models	Ruolin Yang et.al.	2508.07149	null
2025-08-10	Intention-Aware Diffusion Model for Pedestrian Trajectory Prediction	Yu Liu et.al.	2508.07146	null
2025-08-10	SketchConcept: Sketching-based Concept Recomposition for Product Design using Generative AI	Runlin Duan et.al.	2508.07141	null
2025-08-10	Canvas3D: Empowering Precise Spatial Control for Image Generation with Constraints from a 3D Virtual Canvas	Runlin Duan et.al.	2508.07135	null
2025-08-10	On the geometric Brownian motion with state-dependent variable exponent diffusion term	Mustafa Avci et.al.	2508.07130	null
2025-08-10	Perceptual Evaluation of GANs and Diffusion Models for Generating X-rays	Gregory Schuit et.al.	2508.07128	null
2025-08-10	Modelling Human Skin Morphology and Simulating Transdermal Transport of 50 Chemicals	Milana Tesfamarian et.al.	2508.07123	null
2025-08-09	DexFruit: Dexterous Manipulation and Gaussian Splatting Inspection of Fruit	Aiden Swann et.al.	2508.07118	null
2025-08-09	Whisfusion: Parallel ASR Decoding via a Diffusion Transformer	Taeyoun Kwon et.al.	2508.07048	null
2025-08-09	A Stage-Aware Mixture of Experts Framework for Neurodegenerative Disease Progression Modelling	Tiantian He et.al.	2508.07032	null
2025-08-09	Trustworthy Medical Imaging with Large Language Models: A Study of Hallucinations Across Modalities	Anindya Bijoy Das et.al.	2508.07031	null
2025-08-09	Vec2Summ: Text Summarization via Probabilistic Sentence Embeddings	Mao Li et.al.	2508.07017	null
2025-08-12	HiMat: DiT-based Ultra-High Resolution SVBRDF Generation	Zixiong Wang et.al.	2508.07011	null
2025-08-09	Spatio-Temporal Conditional Diffusion Models for Forecasting Future Multiple Sclerosis Lesion Masks Conditioned on Treatments	Gian Mario Favero et.al.	2508.07006	null
2025-08-09	Mechanism of Anisotropic Crystallization and Phase Transitions under Van der Waals Squeezing	Yuxiang Gao et.al.	2508.06992	null
2025-08-09	WeatherDiffusion: Weather-Guided Diffusion Model for Forward and Inverse Rendering	Yixin Zhu et.al.	2508.06982	null
2025-08-09	Structure-Preserving Digital Twins via Conditional Neural Whitney Forms	Brooks Kinch et.al.	2508.06981	null
2025-08-09	CannyEdit: Selective Canny Control and Dual-Prompt Guidance for Training-Free Image Editing	Weiyan Xie et.al.	2508.06937	null
2025-08-09	Unveiling the Puzzle of Brittleness in Single Crystal Iridium	Qing Cheng et.al.	2508.06929	null
2025-08-09	AR-GRPO: Training Autoregressive Image Generation Models via Reinforcement Learning	Shihao Yuan et.al.	2508.06924	null
2025-08-09	Talk2Image: A Multi-Agent System for Multi-Turn Image Generation and Editing	Shichao Ma et.al.	2508.06916	null
2025-08-09	MultiRef: Controllable Image Generation with Multiple Visual References	Ruoxi Chen et.al.	2508.06905	null
2025-08-09	Text to Speech System for Meitei Mayek Script	Gangular Singh Irengbam et.al.	2508.06870	null
2025-08-09	Speech Enhancement based on cascaded two flow	Seonggyu Lee et.al.	2508.06842	null
2025-08-09	FlowSE: Flow Matching-based Speech Enhancement	Seonggyu Lee et.al.	2508.06840	null
2025-08-09	Towards Effective Prompt Stealing Attack against Text-to-Image Diffusion Models	Shiqian Zhao et.al.	2508.06837	null
2025-08-09	A Score-based Diffusion Model Approach for Adaptive Learning of Stochastic Partial Differential Equation Solutions	Toan Huynh et.al.	2508.06834	null
2025-08-09	Efficient data-driven regression for reduced-order modeling of spatial pattern formation	Alessandro Alla et.al.	2508.06833	null
2025-08-09	Offline-to-Online Reinforcement Learning with Classifier-Free Diffusion Generation	Xiao Huang et.al.	2508.06806	null
2025-08-09	D3P: Dynamic Denoising Diffusion Policy via Reinforcement Learning	Shu-Ang Yu et.al.	2508.06804	null
2025-08-09	GaN/InN HEMT based UV photodetector on SiC with hexagonal boron nitride passivation	Mustafa Kilin et.al.	2508.06782	null
2025-08-08	Topology Generation of UAV Covert Communication Networks: A Graph Diffusion Approach with Incentive Mechanism	Xin Tang et.al.	2508.06746	null
2025-08-08	Design of high-mobility p-type GaN via the piezomobility tensor	Jie-Cheng Chen et.al.	2508.06723	null
2025-08-08	Restage4D: Reanimating Deformable 3D Reconstruction from a Single Video	Jixuan He et.al.	2508.06715	null
2025-08-08	LightSwitch: Multi-view Relighting with Material-guided Diffusion	Yehonathan Litman et.al.	2508.06494	null
2025-08-08	Weak approximation of stochastic differential equations with sticky boundary conditions	Akash Sharma et.al.	2508.06487	null
2025-08-08	SlimInfer: Accelerating Long-Context LLM Inference via Dynamic Token Pruning	Lingkun Long et.al.	2508.06447	null
2025-08-08	SPARSE Data, Rich Results: Few-Shot Semi-Supervised Learning via Class-Conditioned Image Translation	Guido Manni et.al.	2508.06429	null
2025-08-08	4D operando X-ray nano-holo-tomography reveals multiscale chemomechanics in Silicon-Graphite anode	Victor Vanpeene et.al.	2508.06413	null
2025-08-08	FVGen: Accelerating Novel-View Synthesis with Adversarial Video Diffusion Distillation	Wenbin Teng et.al.	2508.06392	null
2025-08-08	Diffuse measures and nonlinear parabolic equations	Francesco Petitta et.al.	2508.06384	null
2025-08-08	ActivityDiff: A diffusion model with Positive and Negative Activity Guidance for De Novo Drug Design	Renyi Zhou et.al.	2508.06364	null
2025-08-08	Quantum Algorithm for Estimating Intrinsic Geometry	Nhat A. Nghiem et.al.	2508.06355	null
2025-08-08	Can Diffusion Models Bridge the Domain Gap in Cardiac MR Imaging?	Xin Ci Wong et.al.	2508.06327	null
2025-08-08	OM2P: Offline Multi-Agent Mean-Flow Policy	Zhuoran Li et.al.	2508.06269	null
2025-08-08	ADPro: a Test-time Adaptive Diffusion Policy for Robot Manipulation via Manifold and Initial Noise Constraints	Zezeng Li et.al.	2508.06266	null
2025-08-08	Tanaka formula for SDEs driven by fractional Brownian motion	Tommi Sottinen et.al.	2508.06261	null
2025-08-08	Low dimensional dynamics of a sparse balanced synaptic network of quadratic integrate-and-fire neurons	Maria V. Ageeva et.al.	2508.06253	null
2025-08-08	Light-Addressable Smart Nanostructures via Resonant Nanoheating	Victor Tabouillot et.al.	2508.06215	null
2025-08-08	Inverse Source Problems for the Time-Fractional Evolution Equation	Rahmonov Askar Ahmadovich et.al.	2508.06209	null
2025-08-08	Clinically-guided Data Synthesis for Laryngeal Lesion Detection	Chiara Baldini et.al.	2508.06182	null
2025-08-08	Synthetic Data-Driven Multi-Architecture Framework for Automated Polyp Segmentation Through Integrated Detection and Mask Generation	Ojonugwa Oluwafemi Ejiga Peter et.al.	2508.06170	null
2025-08-08	Sharp non-existence threshold for a parabolic Hardy-H{é}non equation with quasilinear diffusion	Razvan Gabriel Iagar et.al.	2508.06164	null
2025-08-08	Fewer Denoising Steps or Cheaper Per-Step Inference: Towards Compute-Optimal Diffusion Model Deployment	Zhenbang Du et.al.	2508.06160	null
2025-08-08	Revealing the Staging Structural Evolution and Li (De)Intercalation Kinetics in Graphite Anodes via Machine Learning Potential	Liqi Wang et.al.	2508.06156	null
2025-08-08	VISTAR:A User-Centric and Role-Driven Benchmark for Text-to-Image Evaluation	Kaiyuan Jiang et.al.	2508.06152	null
2025-08-08	Improving Diagnostic Accuracy for Oral Cancer with inpainting Synthesis Lesions Generated Using Diffusion Models	Yong Oh Lee et.al.	2508.06151	null
2025-08-08	DiffCap: Diffusion-based Real-time Human Motion Capture using Sparse IMUs and a Monocular Camera	Shaohua Pan et.al.	2508.06139	null
2025-08-08	GMF-Drive: Gated Mamba Fusion with Spatial-Aware BEV Representation for End-to-End Autonomous Driving	Jian Wang et.al.	2508.06113	null
2025-08-08	MCA: 2D-3D Retrieval with Noisy Labels via Multi-level Adaptive Correction and Alignment	Gui Zou et.al.	2508.06104	null
2025-08-08	UGD-IML: A Unified Generative Diffusion-based Framework for Constrained and Unconstrained Image Manipulation Localization	Yachun Mi et.al.	2508.06101	null
2025-08-08	MeanAudio: Fast and Faithful Text-to-Audio Generation with Mean Flows	Xiquan Li et.al.	2508.06098	null
2025-08-08	E-React: Towards Emotionally Controlled Synthesis of Human Reactions	Chen Zhu et.al.	2508.06093	null
2025-08-08	SwiftVideo: A Unified Framework for Few-Step Video Generation through Trajectory-Distribution Alignment	Yanxiao Sun et.al.	2508.06082	null
2025-08-08	DreamVE: Unified Instruction-based Image and Video Editing	Bin Xia et.al.	2508.06080	null
2025-08-08	Towards MR-Based Trochleoplasty Planning	Michael Wehrli et.al.	2508.06076	null
2025-08-08	Radio continuum and \HI 21-cm line observations of a nearby luminous infrared galaxy IRAS 17526+3253	Jianfeng Wu et.al.	2508.06075	null
2025-08-08	Real-time physics-informed reconstruction of transient fields using sensor guidance and higher-order time differentiation	Hong-Kyun Noh et.al.	2508.06070	null
2025-08-08	ThematicPlane: Bridging Tacit User Intent and Latent Spaces for Image Generation	Daniel Lee et.al.	2508.06065	null
2025-08-08	NEP: Autoregressive Image Editing via Next Editing Token Prediction	Huimin Wu et.al.	2508.06044	null
2025-08-08	Bayesian Radio Map Estimation: Fundamentals and Implementation via Diffusion Models	Tien Ngoc Ha et.al.	2508.06037	null
2025-08-08	InstantEdit: Text-Guided Few-Step Image Editing with Piecewise Rectified Flow	Yiming Gong et.al.	2508.06033	null
2025-08-08	Learning 3D Texture-Aware Representations for Parsing Diverse Human Clothing and Body Parts	Kiran Chhatre et.al.	2508.06032	null
2025-08-08	Improved Sub-Visible Particle Classification in Flow Imaging Microscopy via Generative AI-Based Image Synthesis	Utku Ozbulak et.al.	2508.06021	null
2025-08-08	Vacuum Dealloyed Brass as Li-Metal Battery Current Collector: Effect of Zinc and Porosity	Eric V Woods et.al.	2508.06015	null
2025-08-08	ExploreGS: Explorable 3D Scene Reconstruction with Virtual Camera Samplings and Diffusion Priors	Minsu Kim et.al.	2508.06014	null
2025-08-08	KnapFormer: An Online Load Balancer for Efficient Diffusion Transformers Training	Kai Zhang et.al.	2508.06001	null
2025-08-08	*Global solutions in $L^{p}{v}L^{\infty}{x}$ for the Boltzmann equation in bounded domains*	Dingqun Deng et.al.	2508.05985	null
2025-08-08	Revisiting $μ$ SR Studies of Ion Dynamics in the Light of Extended Kubo-Toyabe Model	Takashi U. Ito et.al.	2508.05968	null
2025-08-08	Bifrost-1: Bridging Multimodal LLMs and Diffusion Models with Patch-level CLIP Latents	Han Lin et.al.	2508.05954	null
2025-08-08	A 3DGS-Diffusion Self-Supervised Framework for Normal Estimation from a Single Image	Yanxing Liang et.al.	2508.05950	null
2025-08-08	Latent Policy Barrier: Learning Robust Visuomotor Policies by Staying In-Distribution	Zhanyi Sun et.al.	2508.05941	null
2025-08-08	Reverse Diffusion Sequential Monte Carlo Samplers	Luhuan Wu et.al.	2508.05926	null
2025-08-08	Fast, Convex and Conditioned Network for Multi-Fidelity Vectors and Stiff Univariate Differential Equations	Siddharth Rout et.al.	2508.05921	null
2025-08-07	Measurement of All Flavor PeV Neutrino Flux using Combined Datasets from IceCube	Emre Yildizci et.al.	2508.05886	null
2025-08-07	Emerging ultra-wide band gap semiconductors for future high-frequency electronics	Emily M. Garrity et.al.	2508.05823	null
2025-08-07	FineDialFact: A benchmark for Fine-grained Dialogue Fact Verification	Xiangyan Chen et.al.	2508.05782	null
2025-08-07	MAISI-v2: Accelerated 3D High-Resolution Medical Image Synthesis with Rectified Flow and Region-specific Contrastive Loss	Can Zhao et.al.	2508.05772	null
2025-08-07	UnGuide: Learning to Forget with LoRA-Guided Diffusion Models	Agnieszka Polowczyk et.al.	2508.05755	null
2025-08-07	Quantum Reservoir GAN	Hikaru Wakaura et.al.	2508.05716	null
2025-08-07	High multiplicity and global structure of coexistence states in a predator-prey model with saturation	Kousuke Kuto et.al.	2508.05714	null
2025-08-07	Genie Envisioner: A Unified World Foundation Platform for Robotic Manipulation	Yue Liao et.al.	2508.05635	null
2025-08-07	GAP: Gaussianize Any Point Clouds with Text Guidance	Weiqi Zhang et.al.	2508.05631	null
2025-08-07	Latent Space Diffusion for Topology Optimization	Aaron Lutheran et.al.	2508.05624	null
2025-08-07	Uni-cot: Towards Unified Chain-of-Thought Reasoning Across Text and Vision	Luozheng Qin et.al.	2508.05606	null
2025-08-07	Unveiling the Lithium-Ion Transport Mechanism in Li2ZrCl6 Solid-State Electrolyte via Deep Learning-Accelerated Molecular Dynamics Simulations	Hanzeng Guo et.al.	2508.05598	null
2025-08-07	Discrepancy-Aware Contrastive Adaptation in Medical Time Series Analysis	Yifan Wang et.al.	2508.05572	null
2025-08-07	MagicHOI: Leveraging 3D Priors for Accurate Hand-object Reconstruction from Short Monocular Video Clips	Shibo Wang et.al.	2508.05506	null
2025-08-07	Heat and super-diffusive melting fronts in unsaturated porous media	Eirik G. Flekkøy et.al.	2508.05451	null
2025-08-07	Whose Truth? Pluralistic Geo-Alignment for (Agentic) AI	Krzysztof Janowicz et.al.	2508.05432	null
2025-08-07	MolSnap: Snap-Fast Molecular Generation with Latent Variational Mean Flow	Md Atik Ahamed et.al.	2508.05411	null
2025-08-07	UNCAGE: Contrastive Attention Guidance for Masked Generative Transformers in Text-to-Image Generation	Wonjun Kang et.al.	2508.05399	null
2025-08-07	Real-Time Iteration Scheme for Diffusion Policy	Yufei Duan et.al.	2508.05396	null
2025-08-09	Echo: Decoupling Inference and Training for Large-Scale RL Alignment on Heterogeneous Swarms	Jie Xiao et.al.	2508.05387	null
2025-08-07	Multi-Modal Multi-Behavior Sequential Recommendation with Conditional Diffusion-Based Feature Denoising	Xiaoxi Cui et.al.	2508.05352	null
2025-08-07	Stranski-Krastanov Growth of Disordered ScNx Thin Films on MgO(100): Influence of Defect Densities on Electronic Structure and Transport Properties	Susmita Chowdhury et.al.	2508.05330	null
2025-08-07	Textual Inversion for Efficient Adaptation of Open-Vocabulary Object Detectors Without Forgetting	Frank Ruis et.al.	2508.05323	null
2025-08-07	Estimating Musical Surprisal from Audio in Autoregressive Diffusion Model Noise Spaces	Mathias Rose Bjare et.al.	2508.05306	null
2025-08-07	SONAR-LLM: Autoregressive Transformer that Thinks in Sentence Embeddings and Speaks in Tokens	Nikita Dragunov et.al.	2508.05305	null
2025-08-07	An Investigation into the Distribution of Ratios of Particle Solver-based Likelihoods	Emil Løvbak et.al.	2508.05303	null
2025-08-07	Wavelet-Guided Dual-Frequency Encoding for Remote Sensing Change Detection	Xiaoyang Zhang et.al.	2508.05271	null
2025-08-07	B4DL: A Benchmark for 4D LiDAR LLM in Spatio-Temporal Understanding	Changho Choi et.al.	2508.05269	null
2025-08-07	SGDFuse: SAM-Guided Diffusion for High-Fidelity Infrared and Visible Image Fusion	Xiaoyang Zhang et.al.	2508.05264	null
2025-08-07	ArbiViewGen: Controllable Arbitrary Viewpoint Camera Data Generation for Autonomous Driving via Stable Diffusion Models	Yatong Lan et.al.	2508.05236	null
2025-08-07	Parabolic abstract evolution equations in cylindrical domains and uniformly local Sobolev spaces	Joly Romain et.al.	2508.05220	null
2025-08-07	An asymptotic-preserving active flux scheme for the hyperbolic heat equation in the diffusive scaling	Junming Duan et.al.	2508.05166	null
2025-08-07	RAP: Real-time Audio-driven Portrait Animation with Video Diffusion Transformer	Fangyu Du et.al.	2508.05115	null
2025-08-07	PoseGen: In-Context LoRA Finetuning for Pose-Controllable Long Human Video Generation	Jingxuan He et.al.	2508.05091	null
2025-08-07	MetaDiT: Enabling Fine-grained Constraints in High-degree-of Freedom Metasurface Design	Hao Li et.al.	2508.05076	null
2025-08-07	Align-for-Fusion: Harmonizing Triple Preferences via Dual-oriented Diffusion for Cross-domain Sequential Recommendation	Yongfu Zha et.al.	2508.05074	null
2025-08-07	FLUX-Makeup: High-Fidelity, Identity-Consistent, and Robust Makeup Transfer via Diffusion Transformer	Jian Zhu et.al.	2508.05069	null
2025-08-07	DualMat: PBR Material Estimation via Coherent Dual-Path Diffusion	Yifeng Huang et.al.	2508.05060	null
2025-08-07	Observation of Super-ballistic Brownian Motion in Liquid	Jason Boynewicz et.al.	2508.05031	null
2025-08-07	Coupled 1D Chemical Kinetic-Transport and 2D Hydrodynamic Modeling Supports a modest 1-1.5x Supersolar Oxygen Abundance in Jupiter’s Atmosphere	Jeehyun Yang et.al.	2508.05007	null
2025-08-07	Switching Diffusion Systems with Past-Dependent Switching and Countable State Space: Successful Couplings and Strong Ergodicity	Fubao Xi et.al.	2508.04997	null
2025-08-08	REF-VC: Robust, Expressive and Fast Zero-Shot Voice Conversion with Diffusion Transformers	Yuepeng Jiang et.al.	2508.04996	null
2025-08-07	Steering One-Step Diffusion Model with Fidelity-Rich Decoder for Fast Image Compression	Zheng Chen et.al.	2508.04979	null
2025-08-06	Simulation of Non-Premixed, Supersonic Combustion using the Discontinuous Galerkin Method on Fully Unstructured Grids	Cal J. Rising et.al.	2508.04930	null
2025-08-06	Taxonomy of Faults in Attention-Based Neural Networks	Sigma Jahan et.al.	2508.04925	null
2025-08-08	Learning AI Auditing: A Case Study of Teenagers Auditing a Generative AI Model	Luis Morales-Navarro et.al.	2508.04902	null
2025-08-06	The Cosine Schedule is Fisher-Rao-Optimal for Masked Discrete Diffusion Models	Leo Zhang et.al.	2508.04884	null
2025-08-06	Unified Flow Matching for Long Horizon Event Forecasting	Xiao Shou et.al.	2508.04843	null
2025-08-06	Voost: A Unified and Scalable Diffusion Transformer for Bidirectional Virtual Try-On and Try-Off	Seungyong Lee et.al.	2508.04825	null
2025-08-06	Delay-constrained re-entry governs large-scale brain seizures and other network pathologies	Paul Triebkorn et.al.	2508.04824	null
2025-08-06	Single-Step Reconstruction-Free Anomaly Detection and Segmentation via Diffusion Models	Mehrdad Moradi et.al.	2508.04818	null
2025-08-06	Stochastic Optimal Control with Control-Dependent Diffusion and State Constraints: A Degenerate Elliptic Approach	Anderson O. Calixto et.al.	2508.04809	null
2025-08-06	Electrodeless Magnetohydrodynamic Local Force Generator for Aerocapture	Bernard Parent et.al.	2508.04806	null
2025-08-06	ACM Multimedia Grand Challenge on ENT Endoscopy Analysis	Trong-Thuan Nguyen et.al.	2508.04801	null
2025-08-08	Quantum-impurity sensing of altermagnetic order	V. A. S. V. Bittencourt et.al.	2508.04788	null
2025-08-06	Edge-Assisted Collaborative Fine-Tuning for Multi-User Personalized Artificial Intelligence Generated Content (AIGC)	Nan Li et.al.	2508.04745	null
2025-08-06	A colossal dielectric response of HfxZr1-xO2 nanoparticles	Oleksandr S. Pylypchuk et.al.	2508.04697	null
2025-08-06	Diffusion in a $d$ -dimensional rough potential	Jacob Jeffries et.al.	2508.04674	null
2025-08-06	HierarchicalPrune: Position-Aware Compression for Large-Scale Diffusion Models	Young D. Kwon et.al.	2508.04663	null
2025-08-06	Stochastic Calculus for Pathwise Observables of Markov-Jump Processes: Unification of Diffusion and Jump Dynamics	Lars Torbjørn Stutzer et.al.	2508.04647	null
2025-08-06	A unified model for linear responses of physical networks	José M. Ortiz-Tavárez et.al.	2508.04616	null
2025-08-06	Multitask Learning with Stochastic Interpolants	Hugo Negrel et.al.	2508.04605	null
2025-08-07	A Comprehensive Framework for Uncertainty Quantification of Voxel-wise Supervised Models in IVIM MRI	Nicola Casali et.al.	2508.04588	null
2025-08-06	Joint Communication and Indoor Positioning Based on Visible Light in the Presence of Dimming	A. Tarik Leblebici et.al.	2508.04570	null
2025-08-06	DDTracking: A Deep Generative Framework for Diffusion MRI Tractography with Streamline Local-Global Spatiotemporal Modeling	Yijie Li et.al.	2508.04568	null
2025-08-06	TAlignDiff: Automatic Tooth Alignment assisted by Diffusion-based Transformation Learning	Yunbi Liu et.al.	2508.04565	null
2025-08-06	Drone Detection with Event Cameras	Gabriele Magrini et.al.	2508.04564	null
2025-08-06	One Model For All: Partial Diffusion for Unified Try-On and Try-Off in Any Pose	Jinxi Liu et.al.	2508.04559	null
2025-08-06	Two-Way Garment Transfer: Unified Diffusion Framework for Dressing and Undressing Synthesis	Angang Zhang et.al.	2508.04551	null
2025-08-06	MSC: A Marine Wildlife Video Dataset with Grounded Segmentation and Clip-Level Captioning	Quang-Trung Truong et.al.	2508.04549	null
2025-08-06	X-ray thermal diffuse scattering as a texture-robust temperature diagnostic for dynamically compressed solids	P. G. Heighway et.al.	2508.04525	null
2025-08-06	$β$ -Irida-Graphene: A New 2D Carbon Allotrope for Sodium-Ion Battery Anodes	José A. S. Laranjeira et.al.	2508.04506	null
2025-08-06	QuantVSR: Low-Bit Post-Training Quantization for Real-World Video Super-Resolution	Bowen Chai et.al.	2508.04485	null
2025-08-06	Zero-Residual Concept Erasure via Progressive Alignment in Text-to-Image Model	Hongxu Chen et.al.	2508.04472	null
2025-08-06	4DVD: Cascaded Dense-view Video Diffusion Model for High-quality 4D Content Generation	Shuzhou Yang et.al.	2508.04467	null
2025-08-06	Case Studies of Generative Machine Learning Models for Dynamical Systems	Nachiket U. Bapat et.al.	2508.04459	null
2025-08-06	Cognitive Effort in the Two-Step Task: An Active Inference Drift-Diffusion Model Approach	Alvaro Garrido Perez et.al.	2508.04435	null
2025-08-06	Unmasking Interstitial Lung Diseases: Leveraging Masked Autoencoders for Diagnosis	Ethan Dack et.al.	2508.04429	null
2025-08-06	Hydrodynamic Effects in Cryogenic Buffer Gas Cells: Design Insights from Hybrid Simulations	Nick Vogeley et.al.	2508.04364	null
2025-08-06	Derivation and Numerical Simulation of a Thermodynamically Consistent Magneto Two-Phase Flow Model for Magnetic Drug Targeting	Eberhard Bänsch et.al.	2508.04360	null
2025-08-06	From Split to Share: Private Inference with Distributed Feature Sharing	Zihan Liu et.al.	2508.04346	null
2025-08-06	Performative Market Making	Charalampos Kleitsikas et.al.	2508.04344	null
2025-08-06	TempFlow-GRPO: When Timing Matters for GRPO in Flow Models	Xiaoxuan He et.al.	2508.04324	null
2025-08-06	Wave coupling in partially ionized plasmas with shear flows I. Fast-to-Alfvén transformation	Miquel Cantallops et.al.	2508.04319	null
2025-08-06	Turbulent Injection assisted by Diffusion Models for Scale Resolving Simulations	Margaux Boxho et.al.	2508.04318	null
2025-08-06	Parameter Estimation for Weakly Interacting Hypoelliptic Diffusions	Yuga Iguchi et.al.	2508.04287	null
2025-08-06	S2M3: Split-and-Share Multi-Modal Models for Distributed Multi-Task Inference on the Edge	JinYi Yoon et.al.	2508.04271	null
2025-08-06	Sparse Narrow-Band Topology Optimization for Large-Scale Thermal-Fluid Applications	Vladislav Pimanov et.al.	2508.04261	null
2025-08-06	High-Dimensional Matrix-Variate Diffusion Index Models for Time Series Forecasting	Zhiren Ma et.al.	2508.04259	null
2025-08-06	Suspensions of small ultra-soft colloids remain liquids in overcrowded conditions	Nikolaos A. Burger et.al.	2508.04244	null
2025-08-06	PIS3R: Very Large Parallax Image Stitching via Deep 3D Reconstruction	Muhua Zhu et.al.	2508.04236	null
2025-08-06	DocVCE: Diffusion-based Visual Counterfactual Explanations for Document Image Classification	Saifullah Saifullah et.al.	2508.04233	null
2025-08-06	Intention Enhanced Diffusion Model for Multimodal Pedestrian Trajectory Prediction	Yu Liu et.al.	2508.04229	null
2025-08-06	LayerT2V: Interactive Multi-Object Trajectory Layering for Video Generation	Kangrui Cen et.al.	2508.04228	null
2025-08-06	DP-DocLDM: Differentially Private Document Image Generation using Latent Diffusion Models	Saifullah Saifullah et.al.	2508.04208	null
2025-08-06	A background-free signal of jet-induced diffusion wake in quark-gluon plasma	Zhong Yang et.al.	2508.04194	null
2025-08-06	Deeper Inside Deep ViT	Sungrae Hong et.al.	2508.04181	null
2025-08-06	Quasi-Clique Discovery via Energy Diffusion	Yu Zhang et.al.	2508.04174	null
2025-08-06	Non-Equilibrium Dynamics and First-Passage Properties of Stochastic Processes: From Brownian Motion to Active Particles	Mathis Guéneau et.al.	2508.04154	null
2025-08-06	IDCNet: Guided Video Diffusion for Metric-Consistent RGBD Scene Generation with Precise Camera Control	Lijuan Liu et.al.	2508.04147	null
2025-08-06	Polynomial-time sampling despite disorder chaos	Eric Ma et.al.	2508.04133	null
2025-08-06	Conditional Latent Diffusion Models for Zero-Shot Instance Segmentation	Maximilian Ulmer et.al.	2508.04122	null
2025-08-06	Bridging Diffusion Models and 3D Representations: A 3D Consistent Super-Resolution Framework	Yi-Ting Chen et.al.	2508.04090	null
2025-08-06	Long time behavior and Yaglom limit for real trait-structured Birth and Death Processes	Pierre Collet et.al.	2508.04089	null
2025-08-06	Convolutional autoencoders for the reconstruction of three-dimensional interfacial multiphase flows	Murray Cutforth et.al.	2508.04084	null
2025-08-06	POD-based reduced order modeling of global-in-time iterative decoupled algorithms for Biot’s consolidation model	Huipeng Gu et.al.	2508.04082	null
2025-08-06	Uni-DocDiff: A Unified Document Restoration Model Based on Diffusion	Fangmin Zhao et.al.	2508.04055	null
2025-08-06	Motion is the Choreographer: Learning Latent Pose Dynamics for Seamless Sign Language Generation	Jiayi He et.al.	2508.04049	null
2025-08-06	Nonlinear stability of two-dimensional periodic waves in parabolic systems with conservation laws	L. Miguel Rodrigues et.al.	2508.04023	null
2025-08-07	S $^2$ Q-VDiT: Accurate Quantized Video Diffusion Transformer with Salient Data and Sparse Token Distillation	Weilun Feng et.al.	2508.04016	null
2025-08-06	Constructing Generalized Sample Transition Probabilities with Biased Simulations	Yanbin Wang et.al.	2508.03977	null
2025-08-05	Scaling Up Audio-Synchronized Visual Animation: An Efficient Training Paradigm	Lin Zhang et.al.	2508.03955	null
2025-08-05	Point-Based Shape Representation Generation with a Correspondence-Preserving Diffusion Model	Shen Zhu et.al.	2508.03925	null
2025-08-05	Coefficient Identification Problem with Integral Overdetermination Condition for Diffusion Equations	R. R. Ashurov et.al.	2508.03859	null
2025-08-05	VAE-DNN: Energy-Efficient Trainable-by-Parts Surrogate Model For Parametric Partial Differential Equations	Yifei Zong et.al.	2508.03839	null
2025-08-05	HPSv3: Towards Wide-Spectrum Human Preference Score	Yuhang Ma et.al.	2508.03789	null
2025-08-05	LongVie: Multimodal-Guided Controllable Ultra-Long Video Generation	Jianxiong Gao et.al.	2508.03694	null
2025-08-05	LiDARCrafter: Dynamic 4D World Modeling from LiDAR Sequences	Ao Liang et.al.	2508.03692	null
2025-08-05	La La LiDAR: Large-Scale Layout Generation from LiDAR Data	Youquan Liu et.al.	2508.03691	null
2025-08-05	Veila: Panoramic LiDAR Generation from a Monocular RGB Image	Youquan Liu et.al.	2508.03690	null
2025-08-05	OmniShape: Zero-Shot Multi-Hypothesis Shape and Pose Estimation in the Real World	Katherine Liu et.al.	2508.03669	null
2025-08-05	Rigidity for graph product von Neumann algebras	Camille Horbez et.al.	2508.03662	null
2025-08-05	DiWA: Diffusion Policy Adaptation with World Models	Akshay L Chandra et.al.	2508.03645	null
2025-08-05	Likelihood Matching for Diffusion Models	Lei Qian et.al.	2508.03636	null
2025-08-05	Radiative Nonideal MHD Simulations of Inner Protoplanetary Disks: Temperature Structures, Asymmetric Winds, and Episodic Surface Accretion	Shoji Mori et.al.	2508.03624	null
2025-08-05	Expanding the Standard Diffusion Process to Specified Non-Gaussian Marginal Distributions	Robert Richardson et.al.	2508.03617	null
2025-08-05	CADD: Context aware disease deviations via restoration of brain images using normative conditional diffusion models	Ana Lawry Aguila et.al.	2508.03594	null
2025-08-05	Quality-Aware Language-Conditioned Local Auto-Regressive Anomaly Synthesis and Detection	Long Qian et.al.	2508.03539	null
2025-08-05	X-ray Halos of Early-Type Galaxies with AGN Feedback and Accretion from a Circumgalactic Medium: models and observations	Silvia Pellegrini et.al.	2508.03536	null
2025-08-05	CoEmoGen: Towards Semantically-Coherent and Scalable Emotional Image Content Generation	Kaishen Yuan et.al.	2508.03535	null
2025-08-05	LRQ-DiT: Log-Rotation Post-Training Quantization of Diffusion Transformers for Text-to-Image Generation	Lianwei Yang et.al.	2508.03485	null
2025-08-05	When Cars Have Stereotypes: Auditing Demographic Bias in Objects from Text-to-Image Models	Dasol Choi Jihwan Lee et.al.	2508.03483	null
2025-08-05	Draw Your Mind: Personalized Generation via Condition-Level Modeling in Text-to-Image Diffusion Models	Hyungjin Kim et.al.	2508.03481	null
2025-08-05	VideoGuard: Protecting Video Content from Unauthorized Editing	Junjie Cao et.al.	2508.03480	null
2025-08-05	Learning to Incentivize: LLM-Empowered Contract for AIGC Offloading in Teleoperation	Zijun Zhan et.al.	2508.03464	null
2025-08-06	READ: Real-time and Efficient Asynchronous Diffusion for Audio-driven Talking Head Generation	Haotian Wang et.al.	2508.03457	null
2025-08-05	Error Estimates of Semi-Lagrangian Schemes for Diffusive Conservation Laws	Haruki Takemura et.al.	2508.03455	null
2025-08-05	RAAG: Ratio Aware Adaptive Guidance	Shangwen Zhu et.al.	2508.03442	null
2025-08-05	Learning Latent Representations for Image Translation using Frequency Distributed CycleGAN	Shivangi Nigam et.al.	2508.03415	null
2025-08-05	SCFlow: Implicitly Learning Style and Content Disentanglement with Flow Models	Pingchuan Ma et.al.	2508.03402	null
2025-08-05	Delay-facilitated self-assembly in compartmentalized systems	Severin Angerpointner et.al.	2508.03383	null
2025-08-05	Diffusion Once and Done: Degradation-Aware LoRA for Efficient All-in-One Image Restoration	Ni Tang et.al.	2508.03373	null
2025-08-05	A Closed-Loop Multi-Agent Framework for Aerodynamics-Aware Automotive Styling Design	Xinyu Jin et.al.	2508.03370	null
2025-08-05	GL-LCM: Global-Local Latent Consistency Models for Fast High-Resolution Bone Suppression in Chest X-Ray Images	Yifei Sun et.al.	2508.03357	null
2025-08-05	Quenching time and probability estimates for a stochastic reaction-diffusion system with coupled inner singular absorption terms driven by mixed noises	Nikos I. Kavallaris et.al.	2508.03354	null
2025-08-06	Macro-from-Micro Planning for High-Quality and Parallelized Autoregressive Long Video Generation	Xunzhi Xiang et.al.	2508.03334	null
2025-08-05	Skywork UniPic: Unified Autoregressive Modeling for Visual Understanding and Generation	Peiyu Wang et.al.	2508.03320	null
2025-08-05	Thermal Metamaterials for Enhanced Non-Fourier Heat Transport	Harry Mclean et.al.	2508.03316	null
2025-08-05	The non-isothermal Maxwell-Stefan asymptotics of the multi-species Boltzmann equations	Xinqiu Chen et.al.	2508.03311	null
2025-08-05	Zero Shot Domain Adaptive Semantic Segmentation by Synthetic Data Generation and Progressive Adaptation	Jun Luo et.al.	2508.03300	null
2025-08-05	Investigation on deep learning-based galaxy image translation models	Hengxin Ruan et.al.	2508.03291	null
2025-08-07	Well-Posedness of the Cauchy Problem for One-Dimensional Nonlinear Diffusion Equations with Dynamic and Fourth-Type Boundary Conditions in the Lp Lq Maximal Regularity Setting	Ken Furukawa et.al.	2508.03288	null
2025-08-07	Global solvability for doubly degenerate nutrient taxis system with a wide range of bacterial responses in physical dimension	Bao-Ngoc Tran et.al.	2508.03268	null
2025-08-05	Beyond Isolated Words: Diffusion Brush for Handwritten Text-Line Generation	Gang Dai et.al.	2508.03256	null
2025-08-05	V.I.P. : Iterative Online Preference Distillation for Efficient Video Diffusion Models	Jisoo Kim et.al.	2508.03254	null
2025-08-05	Robust Single-Stage Fully Sparse 3D Object Detection via Detachable Latent Diffusion	Wentao Qu et.al.	2508.03252	null
2025-08-06	FFHQ-Makeup: Paired Synthetic Makeup Dataset with Facial Consistency Across Multiple Styles	Xingchao Yang et.al.	2508.03241	null
2025-08-05	BadBlocks: Low-Cost and Stealthy Backdoor Attacks Tailored for Text-to-Image Diffusion Models	Yu Pan et.al.	2508.03221	null
2025-08-05	Timing is everything: How subtle timing changes in MRI echo planar imaging can significantly alter mechanical vibrations and sound level	Amir Seginer et.al.	2508.03220	null
2025-08-05	Convergence of Deterministic and Stochastic Diffusion-Model Samplers: A Simple Analysis in Wasserstein Distance	Eliot Beyler et.al.	2508.03210	null
2025-08-05	Beyond Content: How Grammatical Gender Shapes Visual Representation in Text-to-Image Models	Muhammed Saeed et.al.	2508.03199	null
2025-08-05	An Analytic Model to Determine the Interstitial-Solute Energetics and Underlying Mechanism in Refractory High-Entropy Alloys	Qianxi Zhu et.al.	2508.03163	null
2025-08-05	SARD: Segmentation-Aware Anomaly Synthesis via Region-Constrained Diffusion with Discriminative Mask Guidance	Yanshu Wang et.al.	2508.03143	null
2025-08-05	UniEdit-I: Training-free Image Editing for Unified VLM via Iterative Understanding, Editing and Verifying	Chengyu Bai et.al.	2508.03142	null
2025-08-05	Filtering and 1/3 Power Law for Optimal Time Discretisation in Numerical Integration of Stochastic Differential Equations	Igor G. Vladimirov et.al.	2508.03135	null
2025-08-05	Fine-Tuning Text-to-Speech Diffusion Models Using Reinforcement Learning with Human Feedback	Jingyi Chen et.al.	2508.03123	null
2025-08-05	Power System Voltage Stability Boundary: Computational Results and Applications	Zhenyao Li et.al.	2508.03119	null
2025-08-05	T2UE: Generating Unlearnable Examples from Text Descriptions	Xingjun Ma et.al.	2508.03091	null
2025-08-05	MissDDIM: Deterministic and Efficient Conditional Diffusion for Tabular Data Imputation	Youran Zhou et.al.	2508.03083	null
2025-08-05	Multi-human Interactive Talking Dataset	Zeyu Zhu et.al.	2508.03050	null
2025-08-05	Urban In-Context Learning: Bridging Pretraining and Inference through Masked Diffusion for Urban Profiling	Ruixing Zhang et.al.	2508.03042	null
2025-08-05	Sparse Identification of Nonlinear Dynamics for Stochastic Delay Differential Equations	Dimitri Breda et.al.	2508.03040	null
2025-08-05	MoCA: Identity-Preserving Text-to-Video Generation via Mixture of Cross Attention	Qi Xie et.al.	2508.03034	null
2025-08-05	LiGen: GAN-Augmented Spectral Fingerprinting for Indoor Positioning	Jie Lin et.al.	2508.03024	null
2025-08-05	Generating Light-based Fingerprints for Indoor Localization	Hsun-Yu Lee et.al.	2508.03011	null
2025-08-05	Seeing It Before It Happens: In-Generation NSFW Detection for Diffusion-Based Text-to-Image Models	Fan Yang et.al.	2508.03006	null
2025-08-05	Diffusion Models with Adaptive Negative Sampling Without External Resources	Alakh Desai et.al.	2508.02973	null
2025-08-05	Injecting Measurement Information Yields a Fast and Noise-Robust Diffusion-Based Inverse Problem Solver	Jonathan Patsenker et.al.	2508.02964	null
2025-08-04	X-Actor: Emotional and Expressive Long-Range Portrait Acting from Audio	Chenxu Zhang et.al.	2508.02944	null
2025-08-04	Documenting Patterns of Exoticism of Marginalized Populations within Text-to-Image Generators	Sourojit Ghosh et.al.	2508.02937	null
2025-08-06	A nonstandard finite difference scheme for an SEIQR epidemiological PDE model	Achraf Zinihi et.al.	2508.02928	null
2025-08-04	Goal-Oriented Adaptive Finite Element Multilevel Quasi-{M}onte {C}arlo	Joakim Beck et.al.	2508.02925	null
2025-08-04	How Diffusion Prior Landscapes Shape the Posterior in Blind Deconvolution	Minh-Hai Nguyen et.al.	2508.02923	null
2025-08-04	RDDPM: Robust Denoising Diffusion Probabilistic Model for Unsupervised Anomaly Segmentation	Mehrdad Moradi et.al.	2508.02903	null
2025-08-04	REFLECT: Rectified Flows for Efficient Brain Anomaly Correction Transport	Farzad Beizaee et.al.	2508.02889	null
2025-08-04	Memoirs of mass accretion: probing the edges of intracluster light in simulated galaxy clusters	Tara Dacunha et.al.	2508.02837	null
2025-08-04	DreamVVT: Mastering Realistic Video Virtual Try-On in the Wild via a Stage-Wise Diffusion Transformer Framework	Tongchun Zuo et.al.	2508.02807	null
2025-08-04	NASIM: Revealing the low surface brightness Universe from legacy VISTA data	Elham Saremi et.al.	2508.02780	null
2025-08-04	D2PPO: Diffusion Policy Policy Optimization with Dispersive Loss	Guowei Zou et.al.	2508.02644	null
2025-08-04	CAK: Emergent Audio Effects from Minimal Deep Learning	Austin Rockman et.al.	2508.02643	null
2025-08-04	Anticipating Decoherence: a Predictive Framework for Enhancing Coherence in Quantum Emitters	Pranshu Maan et.al.	2508.02638	null
2025-08-04	ReMoMask: Retrieval-Augmented Masked Motion Generation	Zhengdao Li et.al.	2508.02605	null
2025-08-04	Sparse-dLLM: Accelerating Diffusion LLMs with Dynamic Cache Eviction	Yuerong Song et.al.	2508.02558	null
2025-08-04	From Pixels to Pathology: Restoration Diffusion for Diagnostic-Consistent Virtual IHC	Jingsong Liu et.al.	2508.02528	null
2025-08-06	xDeepServe: Model-as-a-Service on Huawei CloudMatrix384	Ao Xiao et.al.	2508.02520	null
2025-08-04	QuaDreamer: Controllable Panoramic Video Generation for Quadruped Robots	Sheng Wu et.al.	2508.02512	null
2025-08-04	Quantitative and Predictive Folding Models from Limited Single-Molecule Data Using Simulation-Based Inference	Lars Dingeldein et.al.	2508.02509	null
2025-08-04	Toward Using Machine Learning as a Shape Quality Metric for Liver Point Cloud Generation	Khoa Tuan Nguyen et.al.	2508.02482	null
2025-08-04	PoseGuard: Pose-Guided Generation with Safety Guardrails	Kongxin Wang et.al.	2508.02476	null
2025-08-04	Efficient spin-pumping and spin-to-charge conversion in epitaxial Mn $_3$ Sn(0001) noncollinear antiferromagnetic films	Surya N. Panda et.al.	2508.02415	null
2025-08-04	Hydra: Accurate Multi-Modal Leaf Wetness Sensing with mm-Wave and Camera Fusion	Yimeng Liu et.al.	2508.02409	null
2025-08-04	Inference-time Scaling for Diffusion-based Audio Super-resolution	Yizhu Jin et.al.	2508.02391	null
2025-08-04	Talking Surveys: How Photorealistic Embodied Conversational Agents Shape Response Quality, Engagement, and Satisfaction	Matus Krajcovic et.al.	2508.02376	null
2025-08-04	Transport-Guided Rectified Flow Inversion: Improved Image Editing Using Optimal Transport Theory	Marian Lupascu et.al.	2508.02363	null
2025-08-04	Qwen-Image Technical Report	Chenfei Wu et.al.	2508.02324	null
2025-08-04	Dream-to-Recon: Monocular 3D Reconstruction with Diffusion-Depth Distillation from Single Images	Philipp Wulff et.al.	2508.02323	null
2025-08-05	LaMPE: Length-aware Multi-grained Positional Encoding for Adaptive Long-context Scaling Without Training	Sikui Zhang et.al.	2508.02308	null
2025-08-05	Forecasting When to Forecast: Accelerating Diffusion Models with Confidence-Gated Taylor	Xiaoliu Guan et.al.	2508.02240	null
2025-08-04	Abstract Formulation of Mean-Field Models and Propagation of Chaos	Tau Shean Lim et.al.	2508.02224	null
2025-08-04	A theory of strange metals	Simone Fratini et.al.	2508.02221	null
2025-08-04	Seed Diffusion: A Large-Scale Diffusion Language Model with High-Speed Inference	Yuxuan Song et.al.	2508.02193	null
2025-08-04	DreamPainter: Image Background Inpainting for E-commerce Scenarios	Sijie Zhao et.al.	2508.02155	null
2025-08-04	AttriCtrl: Fine-Grained Control of Aesthetic Attribute Intensity in Diffusion Models	Die Chen et.al.	2508.02151	null
2025-08-04	VDEGaussian: Video Diffusion Enhanced 4D Gaussian Splatting for Dynamic Urban Scenes Modeling	Yuru Xiao et.al.	2508.02129	null
2025-08-04	AutoLoRA: Automatic LoRA Retrieval and Fine-Grained Gated Fusion for Text-to-Image Generation	Zhiwen Li et.al.	2508.02107	null
2025-08-04	Towards Immersive Human-X Interaction: A Real-Time Framework for Physically Plausible Motion Synthesis	Kaiyang Ji et.al.	2508.02106	null
2025-08-04	“Stack It Up!”: 3D Stable Structure Generation from 2D Hand-drawn Sketch	Yiqing Xu et.al.	2508.02093	null
2025-08-04	Unsupervised Multi-channel Speech Dereverberation via Diffusion	Yulun Wu et.al.	2508.02071	null
2025-08-04	“Set It Up”: Functional Object Arrangement with Compositional Generative Models	Yiqing Xu et.al.	2508.02068	null
2025-08-04	StarPose: 3D Human Pose Estimation via Spatial-Temporal Autoregressive Diffusion	Haoxin Yang et.al.	2508.02056	null
2025-08-04	Why Generate When You Can Transform? Unleashing Generative Attention for Dynamic Recommendation	Yuli Liu et.al.	2508.02050	null
2025-08-04	Conditional Diffusion Model with Anatomical-Dose Dual Constraints for End-to-End Multi-Tumor Dose Prediction	Hui Xie et.al.	2508.02043	null
2025-08-04	Frequency-Domain Denoising-Based in Vivo Fluorescence Imaging	XuHao Yu et.al.	2508.02025	null
2025-08-04	Significant Mobility Enhancement in Coupled AlGaN/GaN Quantum Wells considering Inter-Well Distance and Asymmetric Widths	Le Tri Dat et.al.	2508.02024	null
2025-08-05	Asymptotic analysis of the Allen-Cahn equation with dynamic boundary conditions of Cahn-Hilliard type	Pierluigi Colli et.al.	2508.02021	null
2025-08-04	Devil is in the Detail: Towards Injecting Fine Details of Image Prompt in Image Generation via Conflict-free Guidance and Stratified Attention	Kyungmin Jo et.al.	2508.02004	null
2025-08-04	Generative Large-Scale Pre-trained Models for Automated Ad Bidding Optimization	Yu Lei et.al.	2508.02002	null
2025-08-04	Path-Integral Formulation of Bosonic Markovian Open Quantum Dynamics with Monte Carlo stochastic trajectories using the Glauber-Sudarshan P, Wigner, and Husimi Q Functions and Hybrids	Toma Yoneya et.al.	2508.01991	null
2025-08-04	Controllable and Stealthy Shilling Attacks via Dispersive Latent Diffusion	Shutong Qiao et.al.	2508.01987	null
2025-08-04	Diffusion models for inverse problems	Hyungjin Chung et.al.	2508.01975	null
2025-08-03	Distributed games with jumps: An $α$ -potential game approach	Xin Guo et.al.	2508.01929	null
2025-08-03	On the Non-Markovian Navier-Stokes Framework for Turbulence Modeling – A Preliminary Analysis	Siamak Kazemzadeh Hannani et.al.	2508.01890	null
2025-08-03	DiffusionFF: Face Forgery Detection via Diffusion-based Artifact Localization	Siran Peng et.al.	2508.01873	null
2025-08-05	Moment Estimate and Variational Approach for Learning Generalized Diffusion with Non-gradient Structures	Fanze Kong et.al.	2508.01854	null
2025-08-03	Diffusion-based 3D Hand Motion Recovery with Intuitive Physics	Yufei Zhang et.al.	2508.01835	null
2025-08-03	Enhancing Spectrogram Realism in Singing Voice Synthesis via Explicit Bandwidth Extension Prior to Vocoder	Runxuan Yang et.al.	2508.01796	null
2025-08-03	Exponential mixing for the stochastic Kuramoto-Sivashinsky equation on the 1D torus	Peng Gao et.al.	2508.01794	null
2025-08-03	DiffSemanticFusion: Semantic Raster BEV Fusion for Autonomous Driving via Online HD Map Diffusion	Zhigang Sun et.al.	2508.01778	null
2025-08-03	Semantically-Guided Inference for Conditional Diffusion Models: Enhancing Covariate Consistency in Time Series Forecasting	Rui Ding et.al.	2508.01761	null
2025-08-03	Dynamic Coupling of Infiltration-Soil Moisture Feedback:Emergent Vegetation Patterns in a Water-Vegetation Model	Juan Yan et.al.	2508.01755	null
2025-08-03	Energy-Efficient Federated Learning for Edge Real-Time Vision via Joint Data, Computation, and Communication Design	Xiangwang Hou et.al.	2508.01745	null
2025-08-05	Imbalance-Robust and Sampling-Efficient Continuous Conditional GANs via Adaptive Vicinity and Auxiliary Regularization	Xin Ding et.al.	2508.01725	null
2025-08-03	ModFus-DM: Explore the Representation in Modulated Signal Diffusion Generated Models	Haoyue Tan et.al.	2508.01719	null
2025-08-05	Versatile Transition Generation with Image-to-Video Diffusion	Zuhao Yang et.al.	2508.01698	null
2025-08-03	DisCo3D: Distilling Multi-View Consistency for 3D Scene Editing	Yufeng Chi et.al.	2508.01684	null
2025-08-03	DAG: Unleash the Potential of Diffusion Model for Open-Vocabulary 3D Affordance Grounding	Hanqing Wang et.al.	2508.01651	null
2025-08-03	StrandDesigner: Towards Practical Strand Generation with Sketch Guidance	Na Zhang et.al.	2508.01650	null
2025-08-03	Hamiltonian simulation for nonlinear partial differential equation by Schrödingerization	Shoya Sasaki et.al.	2508.01640	null
2025-08-03	VFP: Variational Flow-Matching Policy for Multi-Modal Robot Manipulation	Xuanran Zhai et.al.	2508.01622	null
2025-08-03	LLaDA-MedV: Exploring Large Language Diffusion Models for Biomedical Image Understanding	Xuanzhao Dong et.al.	2508.01617	null
2025-08-03	TCDiff: Triplex Cascaded Diffusion for High-fidelity Multimodal EHRs Generation with Incomplete Clinical Data	Yandong Yan et.al.	2508.01615	null
2025-08-03	Practical, Generalizable and Robust Backdoor Attacks on Text-to-Image Diffusion Models	Haoran Dai et.al.	2508.01605	null
2025-08-03	Enhancing Zero-Shot Brain Tumor Subtype Classification via Fine-Grained Patch-Text Alignment	Lubin Gan et.al.	2508.01602	null
2025-08-03	CLASS: Contrastive Learning via Action Sequence Supervision for Robot Manipulation	Sung-Wook Lee et.al.	2508.01600	null
2025-08-03	Why Heuristic Weighting Works: A Theoretical Analysis of Denoising Score Matching	Juyan Zhang et.al.	2508.01597	null
2025-08-03	A Plug-and-Play Multi-Criteria Guidance for Diverse In-Betweening Human Motion Generation	Hua Yu et.al.	2508.01590	null
2025-08-03	Censored Sampling for Topology Design: Guiding Diffusion with Human Preferences	Euihyun Kim et.al.	2508.01589	null
2025-08-03	Diffusion Models for Future Networks and Communications: A Comprehensive Survey	Nguyen Cong Luong et.al.	2508.01586	null
2025-08-03	Tractography-Guided Dual-Label Collaborative Learning for Multi-Modal Cranial Nerves Parcellation	Lei Xie et.al.	2508.01577	null
2025-08-03	Sub 10 nm Nanochannels Enable Directional Quasi Ballistic Exciton Transport over 5 μm at Room Temperature	Xiao-Jie Wang et.al.	2508.01567	null
2025-08-03	MGCR-Net:Multimodal Graph-Conditioned Vision-Language Reconstruction Network for Remote Sensing Change Detection	Chengming Wang et.al.	2508.01555	null
2025-08-02	A Reward-Directed Diffusion Framework for Generative Design Optimization	Hadi Keramati et.al.	2508.01509	null
2025-08-02	Instruction-based Time Series Editing	Jiaxing Qiu et.al.	2508.01504	null
2025-08-02	The role of zealots in the spread of linguistic traits	Vivian Dornelas et.al.	2508.01500	null
2025-08-02	TreeDiff: AST-Guided Code Generation with Diffusion LLMs	Yiming Zeng et.al.	2508.01473	null
2025-08-02	Regression Augmentation With Data-Driven Segmentation	Shayan Alahyari et.al.	2508.01455	null
2025-08-02	Physically-based Lighting Augmentation for Robotic Manipulation	Shutong Jin et.al.	2508.01442	null
2025-08-02	Viscosity Stabilized Plug-and-Play Reconstruction	Arghya Sinha et.al.	2508.01441	null
2025-08-02	Parabolic-elliptic and indirect-direct simplifications in chemotaxis systems driven by indirect signalling	Le Trong Thanh Bui et.al.	2508.01436	null
2025-08-02	Artificial Intelligence and Misinformation in Art: Can Vision Language Models Judge the Hand or the Machine Behind the Canvas?	Tarian Fu et.al.	2508.01408	null
2025-08-02	StyleSentinel: Reliable Artistic Copyright Verification via Stylistic Fingerprints	Lingxiao Chen et.al.	2508.01335	null
2025-08-05	Zero-shot Segmentation of Skin Conditions: Erythema with Edit-Friendly Inversion	Konstantinos Moutselos et.al.	2508.01334	null
2025-08-02	LinkQA: Synthesizing Diverse QA from Multiple Seeds Strongly Linked by Knowledge Points	Xuemiao Zhang et.al.	2508.01317	null
2025-08-02	CoCoLIT: ControlNet-Conditioned Latent Image Translation for MRI to Amyloid PET Synthesis	Alec Sargood et.al.	2508.01292	null
2025-08-02	PromptSafe: Gated Prompt Tuning for Safe Text-to-Image Generation	Zonglei Jing et.al.	2508.01272	null
2025-08-02	Enhancing Diffusion-based Dataset Distillation via Adversary-Guided Curriculum Sampling	Lexiao Zou et.al.	2508.01264	null
2025-08-02	NS-Net: Decoupling CLIP Semantic Information through NULL-Space for Generalizable AI-Generated Image Detection	Jiazhen Yan et.al.	2508.01248	null
2025-08-02	Effect of protection zone on the dynamics of a diffusion-advection population-toxicant model	Jing Gao et.al.	2508.01246	null
2025-08-02	Sliding two-dimensional superconductivity and charge-density-wave state in a bulk crystal	Xiangqi Liu et.al.	2508.01241	null
2025-08-02	SketchAgent: Generating Structured Diagrams from Hand-Drawn Sketches	Cheng Tan et.al.	2508.01237	null
2025-08-02	Point-wise Diffusion Models for Physical Systems with Shape Variations: Application to Spatio-temporal and Large-scale system	Jiyong Kim et.al.	2508.01230	null
2025-08-02	StyDeco: Unsupervised Style Transfer with Distilling Priors and Semantic Decoupling	Yuanlin Yang et.al.	2508.01215	null
2025-08-02	Energy-dependent anisotropy of cosmic-ray muons: A twelve-year study with IceCube Neutrino Observatory	Nabin Upadhya Dhakal et.al.	2508.01194	null
2025-08-02	DELTAv2: Accelerating Dense 3D Tracking	Tuan Duc Ngo et.al.	2508.01170	null
2025-08-02	RoboLinker: A Diffusion-model-based Matching Clothing Generator Between Humans and Companion Robots	Jing Tang et.al.	2508.01165	null
2025-08-02	LawDIS: Language-Window-based Controllable Dichotomous Image Segmentation	Xinyu Yan et.al.	2508.01152	null
2025-08-02	Personalized Safety Alignment for Text-to-Image Diffusion Models	Yu Lei et.al.	2508.01151	null
2025-08-02	Dataset Condensation with Color Compensation	Huyu Wu et.al.	2508.01139	null
2025-08-01	Beyond Fixed: Variable-Length Denoising for Diffusion Large Language Models	Jinsong Li et.al.	2508.00819	null
2025-08-01	Multibeam High Throughput Satellite: Hardware Foundation, Resource Allocation, and Precoding	Rui Chen et.al.	2508.00800	null
2025-08-01	Video Generators are Robot Policies	Junbang Liang et.al.	2508.00795	null
2025-08-01	SpA2V: Harnessing Spatial Auditory Cues for Audio-driven Spatially-aware Video Generation	Kien T. Pham et.al.	2508.00782	null
2025-08-01	Diffusion-Scheduled Denoising Autoencoders for Anomaly Detection in Tabular Data	Timur Sattarov et.al.	2508.00758	null
2025-08-01	LeakyCLIP: Extracting Training Data from CLIP	Yunhao Chen et.al.	2508.00756	null
2025-08-01	SU-ESRGAN: Semantic and Uncertainty-Aware ESRGAN for Super-Resolution of Satellite and Drone Imagery with Fine-Tuning for Cross Domain Evaluation	Prerana Ramkumar et.al.	2508.00750	null
2025-08-01	AudioGen-Omni: A Unified Multimodal Diffusion Transformer for Video-Synchronized Audio, Speech, and Song Generation	Le Wang et.al.	2508.00733	null
2025-08-01	YOLO-Count: Differentiable Object Counting for Text-to-Image Generation	Guanning Zeng et.al.	2508.00728	null
2025-08-01	Controllability of diffusive Lotka-Volterra strongly competitive systems under boundary constrained controls	Elisa Affili et.al.	2508.00713	null
2025-08-01	D3: Training-Free AI-Generated Video Detection Using Second-Order Features	Chende Zheng et.al.	2508.00701	null
2025-08-01	On-Device Diffusion Transformer Policy for Efficient Robot Manipulation	Yiming Wu et.al.	2508.00697	null
2025-08-01	Wind Power Scenario Generation based on the Generalized Dynamic Factor Model and Generative Adversarial Network	Young-ho Cho et.al.	2508.00692	null
2025-08-01	Light-Weight Diffusion Multiplier and Uncertainty Quantification for Fourier Neural Operators	Albert Matveev et.al.	2508.00643	null
2025-08-01	Minimum Data, Maximum Impact: 20 annotated samples for explainable lung nodule classification	Luisa Gallée et.al.	2508.00639	null
2025-08-01	DPoser-X: Diffusion Model as Robust 3D Whole-body Human Pose Prior	Junzhe Lu et.al.	2508.00599	null
2025-08-01	Wukong Framework for Not Safe For Work Detection in Text-to-Image systems	Mingrui Liu et.al.	2508.00591	null
2025-08-01	Guiding Diffusion-Based Articulated Object Generation by Partial Point Cloud Alignment and Physical Plausibility Constraints	Jens U. Kreber et.al.	2508.00558	null
2025-08-01	DBLP: Noise Bridge Consistency Distillation For Efficient And Reliable Adversarial Purification	Chihan Huang et.al.	2508.00552	null
2025-08-01	Video Color Grading via Look-Up Table Generation	Seunghyun Shin et.al.	2508.00548	null
2025-08-01	HannesImitation: Grasping with the Hannes Prosthetic Hand via Imitation Learning	Carlo Alessi et.al.	2508.00491	null
2025-08-01	LAMIC: Layout-Aware Multi-Image Composition via Scalability of Multimodal Diffusion Transformer	Yuzhuo Chen et.al.	2508.00477	null
2025-08-01	A Conditional GAN for Tabular Data Generation with Probabilistic Sampling of Latent Subspaces	Leonidas Akritidis et.al.	2508.00472	null
2025-08-01	Semantic and Temporal Integration in Latent Diffusion Space for High-Fidelity Video Super-Resolution	Yiwen Wang et.al.	2508.00471	null
2025-08-01	AutoDebias: Automated Framework for Debiasing Text-to-Image Models	Hongyi Cai et.al.	2508.00445	null
2025-08-01	SDMatte: Grafting Diffusion Models for Interactive Matting	Longfei Huang et.al.	2508.00443	null
2025-08-01	Diffusion-Based User-Guided Data Augmentation for Coronary Stenosis Detection	Sumin Seo et.al.	2508.00438	null
2025-08-01	Accurate Latent Inversion for Generative Image Steganography via Rectified Flow	Yuqi Qian et.al.	2508.00434	null
2025-08-01	Sel3DCraft: Interactive Visual Prompts for User-Friendly Text-to-3D Generation	Nan Xiang et.al.	2508.00428	null
2025-08-01	Contact-Aware Amodal Completion for Human-Object Interaction via Multi-Regional Inpainting	Seunggeun Chi et.al.	2508.00427	null
2025-08-01	Collimated QED Cascades with Curved Plasma Mirror	Xuesong Geng et.al.	2508.00417	null
2025-08-01	DC-AE 1.5: Accelerating Diffusion Model Convergence with Structured Latent Space	Junyu Chen et.al.	2508.00413	null
2025-08-01	Sortblock: Similarity-Aware Feature Reuse for Diffusion Model	Hanqi Chen et.al.	2508.00412	null
2025-08-01	Predictive information criterion for jump diffusion processes	Yuma Uehara et.al.	2508.00411	null
2025-08-01	Video Forgery Detection with Optical Flow Residuals and Spatial-Temporal Consistency	Xi Xue et.al.	2508.00397	null
2025-08-01	Sheaf Graph Neural Networks via PAC-Bayes Spectral Optimization	Yoonhyuk Choi et.al.	2508.00357	null
2025-08-01	BOOD: Boundary-based Out-Of-Distribution Data Generation	Qilin Liao et.al.	2508.00350	null
2025-08-01	Favorable modifications of Scrape-Off Layer (SOL) heat flux width through pulsed fuelling in ADITYA-U Tokamak	SK Injamul Hoque et.al.	2508.00339	null
2025-08-01	Radially Locked Sun-Ray Patterns in Autocatalytic Reaction-Diffusion-Advection Systems	Surya Narayan Maharana et.al.	2508.00329	null
2025-08-01	Steering Guidance for Personalized Text-to-Image Diffusion Models	Sunghyun Park et.al.	2508.00319	null
2025-08-01	GV-VAD : Exploring Video Generation for Weakly-Supervised Video Anomaly Detection	Suhang Cai et.al.	2508.00312	null
2025-08-01	TopoDiffuser: A Diffusion-Based Multimodal Trajectory Prediction Model with Topometric Maps	Zehui Xu et.al.	2508.00303	null
2025-08-01	Controllable Pedestrian Video Editing for Multi-View Driving Scenarios via Motion Sequence	Danzhen Fu et.al.	2508.00299	null
2025-08-01	AniMer+: Unified Pose and Shape Estimation Across Mammalia and Aves via Family-Aware Transformer	Jin Lyu et.al.	2508.00298	null
2025-08-01	TITAN-Guide: Taming Inference-Time AligNment for Guided Text-to-Video Diffusion Models	Christian Simon et.al.	2508.00289	null
2025-08-01	UAV-ON: A Benchmark for Open-World Object Goal Navigation with Aerial Agents	Jianqiang Xiao et.al.	2508.00288	null
2025-08-01	Towards Robust Semantic Correspondence: A Benchmark and Insights	Wenyue Chong et.al.	2508.00272	null
2025-08-01	Jet Image Generation in High Energy Physics Using Diffusion Models	Victor D. Martinez et.al.	2508.00250	null
2025-07-31	Reliability of 1D radiative-convective photochemical-equilibrium retrievals on transit spectra of WASP-107b	Thomas Konings et.al.	2508.00177	null
2025-07-31	DiSC-Med: Diffusion-based Semantic Communications for Robust Medical Image Transmission	Fupei Guo et.al.	2508.00172	null
2025-07-31	World Consistency Score: A Unified Metric for Video Generation Quality	Akshat Rakheja et.al.	2508.00144	null
2025-07-31	Entanglement spreading and emergent locality in Brownian SYK chains	Onkar Parrikar et.al.	2508.00060	null
2025-07-31	Predicting Large-scale Urban Network Dynamics with Energy-informed Graph Neural Diffusion	Tong Nie et.al.	2508.00037	null
2025-07-31	Gaussian Variation Field Diffusion for High-fidelity Video-to-4D Synthesis	Bowen Zhang et.al.	2507.23785	null
2025-07-31	SUB: Benchmarking CBM Generalization via Synthetic Attribute Substitutions	Jessica Bader et.al.	2507.23784	null
2025-07-31	General diffusions on metric graphs as limits of time-space Markov Chains	Alexis Anagnostakis et.al.	2507.23724	null
2025-07-31	DiffuMatch: Category-Agnostic Spectral Diffusion Priors for Robust Non-rigid Shape Matching	Emery Pierson et.al.	2507.23715	null
2025-07-31	CFDagent: A Language-Guided, Zero-Shot Multi-Agent System for Complex Flow Simulation	Zhaoyue Xu et.al.	2507.23693	null
2025-07-31	UniLDiff: Unlocking the Power of Diffusion Priors for All-in-One Image Restoration	Zihan Cheng et.al.	2507.23685	null
2025-07-31	I2V-GS: Infrastructure-to-Vehicle View Transformation with Gaussian Splatting for Autonomous Driving Data Generation	Jialei Chen et.al.	2507.23683	null
2025-07-31	Analysis of a Cross-Nonlinear Porous-Medium System Modeling Pressure-Driven Cell Population Dynamics	Alexis Béjar-López et.al.	2507.23680	null
2025-07-31	DepMicroDiff: Diffusion-Based Dependency-Aware Multimodal Imputation for Microbiome Data	Rabeya Tus Sadia et.al.	2507.23676	null
2025-07-31	One-Step Flow Policy Mirror Descent	Tianyi Chen et.al.	2507.23675	null
2025-07-31	Adaptively Distilled ControlNet: Accelerated Training and Superior Sampling for Medical Image Synthesis	Kunpeng Qiu et.al.	2507.23652	null
2025-07-31	A stochastic heat equation with non-locally Lipschitz coefficients	Le Chen et.al.	2507.23637	null
2025-07-31	DivControl: Knowledge Diversion for Controllable Image Generation	Yucheng Xie et.al.	2507.23620	null
2025-08-02	MoGA: 3D Generative Avatar Prior for Monocular Gaussian Avatar Reconstruction	Zijian Dong et.al.	2507.23597	null
2025-07-31	Theory of ultrafast conductance modulation in electrochemical protonic synapses by multiphase polarization	Michael L. Li et.al.	2507.23576	null
2025-08-01	H-RDT: Human Manipulation Enhanced Bimanual Robotic Manipulation	Hongzhe Bi et.al.	2507.23523	null
2025-07-31	Conical diffraction of the synchrotron beam to probe the efficiency and morphology of blazed gratings	K. V. Nikolaev et.al.	2507.23513	null
2025-07-31	Emergence of long-range non-equilibrium correlations in free liquid diffusion	Marco Bussoletti et.al.	2507.23507	null
2025-07-31	Digital literacy interventions can boost humans in discerning deepfakes	Dominique Geissler et.al.	2507.23492	null
2025-07-31	Stable-Sim2Real: Exploring Simulation of Real-Captured 3D Data with Two-Stage Depth Diffusion	Mutian Xu et.al.	2507.23483	null
2025-07-31	Adjoint-Based Aerodynamic Shape Optimization with a Manifold Constraint Learned by Diffusion Models	Long Chen et.al.	2507.23443	null
2025-07-31	Out-of-Distribution Detection in Medical Imaging via Diffusion Trajectories	Lemar Abdi et.al.	2507.23411	null
2025-07-31	An optimal preconditioner for high-order scheme arising from multi-dimensional Riesz space fractional diffusion equations with variable coefficients	Yuan-Yuan Huang et.al.	2507.23408	null
2025-07-31	UniEmo: Unifying Emotional Understanding and Generation with Learnable Expert Queries	Yijie Zhu et.al.	2507.23372	null
2025-07-31	IN45023 Neural Network Design Patterns in Computer Vision Seminar Report, Summer 2025	Radu-Andrei Bourceanu et.al.	2507.23357	null
2025-07-31	Who is a Better Talker: Subjective and Objective Quality Assessment for AI-Generated Talking Heads	Yingjie Zhou et.al.	2507.23343	null
2025-07-31	EMU and the DRAGNs I: A Catalogue of DRAGNs	Ray P. Norris et.al.	2507.23337	null
2025-07-31	Classifying Compact Radio Emission in Nearby Galaxies: a 10GHz Study of Active Galactic Nuclei, Supernovae, Anomalous Microwave Emission and Star Forming Regions	Kristen C. Dage et.al.	2507.23332	null
2025-07-31	The Cow of Rembrandt - Analyzing Artistic Prompt Interpretation in Text-to-Image Models	Alfio Ferrara et.al.	2507.23313	null
2025-07-31	PriorFusion: Unified Integration of Priors for Robust Road Perception in Autonomous Driving	Xuewei Tang et.al.	2507.23309	null
2025-08-01	Training-free Geometric Image Editing on Diffusion Models	Hanshen Zhu et.al.	2507.23300	null
2025-07-31	UniLiP: Adapting CLIP for Unified Multimodal Understanding, Generation and Editing	Hao Tang et.al.	2507.23278	null
2025-07-31	PixNerd: Pixel Neural Field Diffusion	Shuai Wang et.al.	2507.23268	null
2025-07-31	Automated Mapping the Pathways of Cranial Nerve II, III, V, and VII/VIII: A Multi-Parametric Multi-Stage Diffusion Tractography Atlas	Lei Xie et.al.	2507.23245	null
2025-07-31	BS-1-to-N: Diffusion-Based Environment-Aware Cross-BS Channel Knowledge Map Generation for Cell-Free Networks	Zhuoyin Dai et.al.	2507.23236	null
2025-07-31	Adversarial-Guided Diffusion for Multimodal LLM Attacks	Chengwei Xia et.al.	2507.23202	null
2025-07-30	X-NeMo: Expressive Neural Motion Reenactment via Disentangled Latent Attention	Xiaochen Zhao et.al.	2507.23143	null
2025-07-30	Nonzero $\mathfrak{n}$ cohomology of Totally Degenerate Limit of Discrete Series representations	Jin Kunwoo Lee et.al.	2507.23102	null
2025-07-30	Diffusion model for gradient preconditioning in hyperspectral imaging inverse problems	Jonathan Monsalve et.al.	2507.23065	null
2025-07-30	Reference-Guided Diffusion Inpainting For Multimodal Counterfactual Generation	Alexandru Buburuzan et.al.	2507.23058	null
2025-07-30	Search for Neutrinos from the Galactic 4FGL Sources with the Pion-bump Signature with IceCube	Alejandra Granados et.al.	2507.23040	null
2025-07-30	Modeling Human Gaze Behavior with Diffusion Models for Unified Scanpath Prediction	Giuseppe Cartella et.al.	2507.23021	null
2025-07-30	Investigating the Invertibility of Multimodal Latent Spaces: Limitations of Optimization-Based Methods	Siwoo Park et.al.	2507.23010	null
2025-07-30	LesionGen: A Concept-Guided Diffusion Model for Dermatology Image Synthesis	Jamil Fayyad et.al.	2507.23001	null
2025-07-29	Neural Autoregressive Modeling of Brain Aging	Ridvan Yesiloglu et.al.	2507.22954	null
2025-07-30	AUV-Fusion: Cross-Modal Adversarial Fusion of User Interactions and Visual Perturbations Against VARS	Hai Ling et.al.	2507.22880	null
2025-07-30	Robust Contract with Career Concerns	Tan Gan et.al.	2507.22852	null
2025-07-30	Morph: ChirpTransformer-based Encoder-decoder Co-design for Reliable LoRa Communication	Yidong Ren et.al.	2507.22851	null
2025-07-30	DepR: Depth Guided Single-view Scene Reconstruction with Instance-level Diffusion	Qingcheng Zhao et.al.	2507.22825	null
2025-07-30	Design and Analysis of Plasmonic-Nanorod-Enhanced Lead-Free Inorganic Perovskite/Silicon Heterojunction Tandem Solar Cell Exceeding the Shockley-Queisser Limit	Md. Sad Abdullah Sami et.al.	2507.22803	null
2025-07-31	G-Core: A Simple, Scalable and Balanced RLHF Trainer	Junyu Wu et.al.	2507.22789	null
2025-07-30	DO-EM: Density Operator Expectation Maximization	Adit Vishnu et.al.	2507.22786	null
2025-08-01	Next Tokens Denoising for Speech Synthesis	Yanqing Liu et.al.	2507.22746	null
2025-07-30	Zero-Shot Image Anomaly Detection Using Generative Foundation Models	Lemar Abdi et.al.	2507.22692	null
2025-07-30	LOTS of Fashion! Multi-Conditioning for Image Generation via Sketch-Text Pairing	Federico Girella et.al.	2507.22627	null
2025-07-30	Hate in Plain Sight: On the Risks of Moderating AI-Generated Hateful Illusions	Yiting Qu et.al.	2507.22617	null
2025-07-30	Generative Active Learning for Long-tail Trajectory Prediction via Controllable Diffusion Model	Daehee Park et.al.	2507.22615	null
2025-07-30	ShortFT: Diffusion Model Alignment via Shortcut-based Fine-Tuning	Xiefan Guo et.al.	2507.22604	null
2025-07-30	Diffusion Models for Influence Maximization on Temporal Networks: A Guide to Make the Best Choice	Aaqib Zahoor et.al.	2507.22589	null
2025-07-30	DACA-Net: A Degradation-Aware Conditional Diffusion Network for Underwater Image Enhancement	Chang Huang et.al.	2507.22501	null
2025-07-30	LoReUn: Data Itself Implicitly Provides Cues to Improve Machine Unlearning	Xiang Li et.al.	2507.22499	null
2025-07-30	Visual Language Models as Zero-Shot Deepfake Detectors	Viacheslav Pirogov et.al.	2507.22469	null
2025-07-30	TopoLiDM: Topology-Aware LiDAR Diffusion Models for Interpretable and Realistic LiDAR Point Cloud Generation	Jiuming Liu et.al.	2507.22454	null
2025-07-30	GVD: Guiding Video Diffusion Model for Scalable Video Distillation	Kunyang Li et.al.	2507.22360	null
2025-07-29	Trade-offs in Image Generation: How Do Different Dimensions Interact?	Sicheng Zhang et.al.	2507.22100	null
2025-07-29	X-Omni: Reinforcement Learning Makes Discrete Autoregressive Image Generative Models Great Again	Zigang Geng et.al.	2507.22058	null
2025-07-30	See Different, Think Better: Visual Variations Mitigating Hallucinations in LVLMs	Ziyun Dai et.al.	2507.22003	null
2025-07-29	Enhancing Generalization in Data-free Quantization via Mixup-class Prompting	Jiwoong Park et.al.	2507.21947	null
2025-07-29	Anyone Can Jailbreak: Prompt-Based Attacks on LLMs and T2Is	Ahmed B Mustafa et.al.	2507.21820	null
2025-07-29	Control Copy-Paste: Controllable Diffusion-Based Augmentation Method for Remote Sensing Few-Shot Object Detection	Yanxing Liu et.al.	2507.21816	null
2025-07-29	MixGRPO: Unlocking Flow-based GRPO Efficiency with Mixed ODE-SDE	Junzhe Li et.al.	2507.21802	null
2025-07-29	APT: Improving Diffusion Models for High Resolution Image Generation with Adaptive Path Tracing	Sangmin Han et.al.	2507.21690	null
2025-07-29	GuidPaint: Class-Guided Image Inpainting with Diffusion Models	Qimin Wang et.al.	2507.21627	null
2025-07-29	Locally Controlled Face Aging with Latent Diffusion Models	Lais Isabelle Alves dos Santos et.al.	2507.21600	null
2025-07-29	Neural network enabled wide field-of-view imaging with hyperbolic metalenses	Joel Yeo et.al.	2507.21562	null
2025-07-29	Chain-of-Cooking:Cooking Process Visualization via Bidirectional Chain-of-Thought Guidance	Mengling Xu et.al.	2507.21529	null
2025-07-29	BANG: Dividing 3D Assets via Generative Exploded Dynamics	Longwen Zhang et.al.	2507.21493	null
2025-07-29	Retrieve-Augmented Generation for Speeding up Diffusion Policy without Additional Training	Sodtavilan Odonchimed et.al.	2507.21452	null
2025-07-30	Multimodal LLMs as Customized Reward Models for Text-to-Image Generation	Shijie Zhou et.al.	2507.21391	null
2025-07-28	Exploring Probabilistic Modeling Beyond Domain Generalization for Semantic Segmentation	I-Hsiang Chen et.al.	2507.21367	null
2025-07-28	A Contrastive Diffusion-based Network (CDNet) for Time Series Classification	Yaoyu Zhang et.al.	2507.21357	null
2025-07-28	HDR Environment Map Estimation with Latent Diffusion Models	Jack Hilliard et.al.	2507.21261	null
2025-07-28	Adaptive Multimodal Protein Plug-and-Play with Diffusion-Based Priors	Amartya Banerjee et.al.	2507.21260	null
2025-07-28	Learning from Limited and Imperfect Data	Harsh Rangwani et.al.	2507.21205	null
2025-08-01	Flow Matching Policy Gradients	David McAllister et.al.	2507.21053	null
2025-07-29	JWB-DH-V1: Benchmark for Joint Whole-Body Talking Avatar and Speech Generation Version 1	Xinhan Di et.al.	2507.20987	null
2025-07-28	Adapting Vehicle Detectors for Aerial Imagery to Unseen Domains with Weak Supervision	Xiao Fang et.al.	2507.20976	null
2025-07-24	Controllable Video Generation: A Survey	Yue Ma et.al.	2507.16869	null
2025-10-14	OmniVCus: Feedforward Subject-driven Video Customization with Multimodal Control Conditions	Yuanhao Cai et.al.	2506.23361	null
2025-06-24	DreamJourney: Perpetual View Generation with Video Diffusion Models	Bo Pan et.al.	2506.17705	null
2025-06-13	Multimodal Cinematic Video Synthesis Using Text-to-Image and Audio Generation Models	Sridhar S et.al.	2506.10005	null
2025-06-04	IllumiCraft: Unified Geometry and Illumination Diffusion for Controllable Video Generation	Yuanze Lin et.al.	2506.03150	null
2025-05-21	LMP: Leveraging Motion Prior in Zero-Shot Video Generation with Diffusion Transformer	Changgu Chen et.al.	2505.14167	null
2025-05-13	ShotAdapter: Text-to-Multi-Shot Video Generation with Diffusion Models	Ozgur Kara et.al.	2505.07652	null
2025-04-16	OmniVDiff: Omni Controllable Video Diffusion for Generation and Understanding	Dianbing Xi et.al.	2504.10825	null
2025-04-14	Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization	Jialu Li et.al.	2504.08641	null
2025-08-28	Evaluating Text-to-Image and Text-to-Video Synthesis with a Conditional Fréchet Distance	Jaywon Koo et.al.	2503.21721	null
2025-10-08	Inference-Time Text-to-Video Alignment with Diffusion Latent Beam Search	Yuta Oshima et.al.	2501.19252	null
2025-01-10	Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control	Zekai Gu et.al.	2501.03847	null
2025-03-12	TexAVi: Generating Stereoscopic VR Video Clips from Text Descriptions	Vriksha Srihari et.al.	2501.01156	null
2024-12-17	TIV-Diffusion: Towards Object-Centric Movement for Text-driven Image to Video Generation	Xingrui Wang et.al.	2412.10275	null
2025-05-23	Motion by Queries: Identity-Motion Trade-offs in Text-to-Video Generation	Yuval Atzmon et.al.	2412.07750	null
2025-10-07	STIV: Scalable Text and Image Conditioned Video Generation	Zongyu Lin et.al.	2412.07730	null
2024-12-06	Divot: Diffusion Powers Video Tokenizer for Comprehension and Generation	Yuying Ge et.al.	2412.04432	null
2025-03-21	VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement	Daeun Lee et.al.	2411.15115	null
2025-05-20	Progressive Autoregressive Video Diffusion Models	Desai Xie et.al.	2410.08151	null
2024-09-26	SurGen: Text-Guided Diffusion Model for Surgical Video Generation	Joseph Cho et.al.	2408.14028	null
2024-09-17	EasyControl: Transfer ControlNet to Video Diffusion for Controllable Generation and Interpolation	Cong Wang et.al.	2408.13005	null
2025-03-11	ControlNeXt: Powerful and Efficient Control for Image and Video Generation	Bohao Peng et.al.	2408.06070	null
2025-08-29	Unlearning Concepts from Text-to-Video Diffusion Models	Shiqi Liu et.al.	2407.14209	null
2024-07-02	SVG: 3D Stereoscopic Video Generation via Denoising Frame Matrix	Peng Dai et.al.	2407.00367	null
2024-06-26	Text-Animator: Controllable Visual Text Video Generation	Lin Liu et.al.	2406.17777	null
2024-06-14	Vivid-ZOO: Multi-View Video Generation with Diffusion Model	Bing Li et.al.	2406.08659	null
2024-06-12	Interactive Generation of Laparoscopic Videos with Diffusion Models	Ivan Iliash et.al.	2406.06537	null
2025-02-25	ViDiT-Q: Efficient and Accurate Quantization of Diffusion Transformers for Image and Video Generation	Tianchen Zhao et.al.	2406.02540	null
2024-10-04	I4VGen: Image as Free Stepping Stone for Text-to-Video Generation	Xiefan Guo et.al.	2406.02230	null
2024-07-12	MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model	Muyao Niu et.al.	2405.20222	null
2024-05-21	Slicedit: Zero-Shot Video Editing With Text-to-Image Diffusion Models Using Spatio-Temporal Slices	Nathaniel Cohen et.al.	2405.12211	null
2024-11-05	FIFO-Diffusion: Generating Infinite Videos from Text without Training	Jihwan Kim et.al.	2405.11473	null
2024-05-08	Vidu: a Highly Consistent, Dynamic and Skilled Text-to-Video Generator with Diffusion Models	Fan Bao et.al.	2405.04233	null
2024-11-19	Video Diffusion Models: A Survey	Andrew Melnik et.al.	2405.03150	null
2024-04-26	TI2V-Zero: Zero-Shot Image Conditioning for Text-to-Video Diffusion Models	Haomiao Ni et.al.	2404.16306	null
2024-04-22	ConCLVD: Controllable Chinese Landscape Video Generation via Diffusion Model	Dingming Liu et.al.	2404.12903	null
2024-12-31	Grid Diffusion Models for Text-to-Video Generation	Taegyeong Lee et.al.	2404.00234	null
2025-04-17	StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text	Roberto Henschel et.al.	2403.14773	null
2024-03-25	S2DM: Sector-Shaped Diffusion Models for Video Generation	Haoran Lang et.al.	2403.13408	null
2024-10-01	VidProM: A Million-scale Real Prompt-Gallery Dataset for Text-to-Video Diffusion Models	Wenhao Wang et.al.	2403.06098	null
2024-03-11	VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion Models	Yabo Zhang et.al.	2403.05438	null
2024-06-10	Sora as an AGI World Model? A Complete Survey on Text-to-Video Generation	Joseph Cho et.al.	2403.05131	null
2024-03-08	Controllable Generation with Text-to-Image Diffusion Models: A Survey	Pu Cao et.al.	2403.04279	null
2024-11-12	UniCtrl: Improving the Spatiotemporal Consistency of Text-to-Video Diffusion Models via Training-Free Unified Attention Control	Tian Xia et.al.	2403.02332	null
2024-06-05	Contextualized Diffusion Models for Text-Guided Image and Video Generation	Ling Yang et.al.	2402.16627	null
2024-08-29	Customize-A-Video: One-Shot Motion Customization of Text-to-Video Diffusion Models	Yixuan Ren et.al.	2402.14780	null
2025-01-03	Neural Network Diffusion	Kai Wang et.al.	2402.13144	null
2024-06-21	Anatomically-Controllable Medical Image Generation with Segmentation-Guided Diffusion Models	Nicholas Konz et.al.	2402.05210	null
2024-02-06	Lumiere: A Space-Time Diffusion Model for Video Generation	Omer Bar-Tal et.al.	2401.12945	null
2024-01-22	Inflation with Diffusion: Efficient Temporal Adaptation for Text-to-Video Super-Resolution	Xin Yuan et.al.	2401.10404	null
2024-05-13	360DVD: Controllable Panorama Video Generation with 360-Degree Video Diffusion Model	Qian Wang et.al.	2401.06578	null
2024-01-09	MirrorDiffusion: Stabilizing Diffusion Process in Zero-shot Image Translation by Prompts Redescription and Beyond	Yupei Lin et.al.	2401.03221	null
2024-01-04	Moonshot: Towards Controllable Video Generation and Editing with Multimodal Conditions	David Junhao Zhang et.al.	2401.01827	null
2024-08-12	Diffusion Reward: Learning Rewards via Conditional Video Diffusion	Tao Huang et.al.	2312.14134	null
2023-12-12	Photorealistic Video Generation with Diffusion Models	Agrim Gupta et.al.	2312.06662	null
2023-12-12	Upscale-A-Video: Temporal-Consistent Diffusion Model for Real-World Video Super-Resolution	Shangchen Zhou et.al.	2312.06640	null
2024-06-04	GenTron: Diffusion Transformers for Image and Video Generation	Shoufa Chen et.al.	2312.04557	null
2023-12-08	AnimateZero: Video Diffusion Models are Zero-Shot Image Animators	Jiwen Yu et.al.	2312.03793	null
2024-09-17	DreamVideo: High-Fidelity Image-to-Video Generation with Image Retention and Text Guidance	Cong Wang et.al.	2312.03018	null
2024-04-10	BIVDiff: A Training-Free Framework for General-Purpose Video Synthesis via Bridging Image and Video Diffusion Models	Fengyuan Shi et.al.	2312.02813	null
2023-12-05	Generative Rendering: Controllable 4D-Guided Video Generation with 2D Diffusion Models	Shengqu Cai et.al.	2312.01409	null
2023-12-05	VMC: Video Motion Customization using Temporal Attention Adaption for Text-to-Video Diffusion Models	Hyeonho Jeong et.al.	2312.00845	null
2023-12-01	ART $\boldsymbol{\cdot}$ V: Auto-Regressive Text-to-Video Generation with Diffusion Models	Wenming Weng et.al.	2311.18834	null
2024-01-01	MicroCinema: A Divide-and-Conquer Approach for Text-to-Video Generation	Yanhui Wang et.al.	2311.18829	null
2023-11-28	Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large Datasets	Andreas Blattmann et.al.	2311.15127	null
2023-12-21	FusionFrames: Efficient Architectural Aspects for Text-to-Video Generation Pipeline	Vladimir Arkhipkin et.al.	2311.13073	null
2024-07-31	MoVideo: Motion-Aware Video Generation with Diffusion Models	Jingyun Liang et.al.	2311.11325	null
2024-08-06	Emu Video: Factorizing Text-to-Video Generation by Explicit Image Conditioning	Rohit Girdhar et.al.	2311.10709	null
2023-10-31	VideoCrafter1: Open Diffusion Models for High-Quality Video Generation	Haoxin Chen et.al.	2310.19512	null
2024-05-07	LLM-grounded Video Diffusion Models	Long Lian et.al.	2309.17444	null
2023-09-08	Text2Control3D: Controllable 3D Avatar Generation in Neural Radiance Fields using Geometry-Guided Text-to-Image Diffusion Model	Sungwon Hwang et.al.	2309.03550	null
2023-09-08	Reuse and Diffuse: Iterative Denoising for Text-to-Video Generation	Jiaxi Gu et.al.	2309.03549	null
2023-09-08	VideoGen: A Reference-Guided Latent Diffusion Approach for High Definition Text-to-Video Generation	Xin Li et.al.	2309.00398	null
2023-08-17	DragNUWA: Fine-grained Control in Video Generation by Integrating Text, Image, and Trajectory	Shengming Yin et.al.	2308.08089	null
2023-08-15	ModelScope Text-to-Video Technical Report	Jiuniu Wang et.al.	2308.06571	null
2023-08-01	MobileVidFactory: Automatic Diffusion-Based Social Media Video Generation for Mobile Devices from Text	Junchen Zhu et.al.	2307.16371	null
2023-08-04	VideoControlNet: A Motion-Guided Video-to-Video Translation Framework by Using Diffusion Model with ControlNet	Zhihao Hu et.al.	2307.14073	null
2023-06-06	Probabilistic Adaptation of Text-to-Video Models	Mengjiao Yang et.al.	2306.01872	null
2023-06-05	Video Colorization with Pre-trained Text-to-Image Diffusion Models	Hanyuan Liu et.al.	2306.01732	null
2023-06-02	Make-Your-Video: Customized Video Generation Using Textual and Structural Guidance	Jinbo Xing et.al.	2306.00943	null
2023-05-30	Gen-L-Video: Multi-Text to Long Video Generation via Temporal Co-Denoising	Fu-Yun Wang et.al.	2305.18264	null
2023-10-31	Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models	Shihao Zhao et.al.	2305.16322	null
2024-08-13	Control-A-Video: Controllable Text-to-Video Diffusion Models with Motion Prior and Reward Feedback Learning	Weifeng Chen et.al.	2305.13840	null
2023-05-23	ControlVideo: Training-free Controllable Text-to-Video Generation	Yabo Zhang et.al.	2305.13077	null
2023-04-19	Latent-Shift: Latent Diffusion with Temporal Shift for Efficient Text-to-Video Generation	Jie An et.al.	2304.08477	null
2024-01-05	Zero-Shot Video Editing Using Off-The-Shelf Image Diffusion Models	Wen Wang et.al.	2303.17599	null
2023-07-11	Physics-Driven Diffusion Models for Impact Sound Synthesis from Videos	Kun Su et.al.	2303.16897	null
2023-03-24	Text2Video-Zero: Text-to-Image Diffusion Models are Zero-Shot Video Generators	Levon Khachatryan et.al.	2303.13439	null
2023-03-23	Pix2Video: Video Editing using Image Diffusion	Duygu Ceylan et.al.	2303.12688	null
2023-10-16	VideoFusion: Decomposed Diffusion Models for High-Quality Video Generation	Zhengxiong Luo et.al.	2303.08320	null
2024-11-11	Text-to-image Diffusion Models in Generative AI: A Survey	Chenshuang Zhang et.al.	2303.07909	null
2023-03-09	Video-P2P: Video Editing with Cross-attention Control	Shaoteng Liu et.al.	2303.04761	null
2023-11-01	Diffusion in the Dark: A Diffusion Model for Low-Light Text Recognition	Cindy M. Nguyen et.al.	2303.04291	null
2023-02-03	Dreamix: Video Diffusion Models are General Video Editors	Eyal Molad et.al.	2302.01329	null
2023-03-20	Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation	Jay Zhangjie Wu et.al.	2212.11565	null
2023-05-12	MagicVideo: Efficient Video Generation With Latent Diffusion Models	Daquan Zhou et.al.	2211.11018	null
2022-10-06	Imagen Video: High Definition Video Generation with Diffusion Models	Jonathan Ho et.al.	2210.02303	null

Industry

Publish Date	Title	Authors	PDF	Code
2025-12-09	Emulation of Complex Matrix Multiplication based on the Chinese Remainder Theorem	Yuki Uchino et.al.	2512.08321	null
2025-12-09	SOP^2: Transfer Learning with Scene-Oriented Prompt Pool on 3D Object Detection	Ching-Hung Cheng et.al.	2512.08223	null
2025-12-08	Optimization-Guided Diffusion for Interactive Scene Generation	Shiaho Li et.al.	2512.07661	null
2025-12-08	Revisiting Quantum Supremacy: Simulating Sycamore-Class Circuits Using Hybrid CPU/GPU HPC Workloads	Bob Wold et.al.	2512.07311	null
2025-12-08	Characterizing Lane-Changing Behavior in Mixed Traffic	Sungyong Chung et.al.	2512.07219	null
2025-12-07	Accurate Models of NVIDIA Tensor Cores	Faizan A. Khattak et.al.	2512.07004	null
2025-12-07	KV-CAR: KV Cache Compression using Autoencoders and KV Reuse in Large Language Models	Sourjya Roy et.al.	2512.06727	null
2025-12-06	Programmable and GPU-Accelerated Edge Inference for Real-Time ISAC on NVIDIA ARC-OTA	Davide Villa et.al.	2512.06493	null
2025-12-06	FIP-TOI: Fast Imaging Pipeline for Pulsar Localisation with a Transient-Oriented Radio Astronomical Imager	X. Li et.al.	2512.06254	null
2025-12-05	GPU acceleration of optical photon propagation in low photon yield applications: Opticks for the Electron Ion Collider	Gabor Galgoczi et.al.	2512.06061	null
2025-12-03	Fast and Flexible Robustness Certificates for Semantic Segmentation	Thomas Massena et.al.	2512.06010	null
2025-12-05	Trusted AI Agents in the Cloud	Teofil Bodea et.al.	2512.05951	null
2025-12-05	OWL: Unsupervised 3D Object Detection by Occupancy Guided Warm-up and Large Model Priors Reasoning	Xusheng Guo et.al.	2512.05698	null
2025-12-05	LeAD-M3D: Leveraging Asymmetric Distillation for Real-time Monocular 3D Detection	Johannes Meier et.al.	2512.05663	null
2025-12-05	An Integrated System for WEEE Sorting Employing X-ray Imaging, AI-based Object Detection and Segmentation, and Delta Robot Manipulation	Panagiotis Giannikos et.al.	2512.05599	null
2025-12-05	Compiler-supported reduced precision and AoS-SoA transformations for heterogeneous hardware	Pawel K. Radtke et.al.	2512.05516	null
2025-12-04	Robustness Test for AI Forecasting of Hurricane Florence Using FourCastNetv2 and Random Perturbations of the Initial Condition	Adam Lizerbram et.al.	2512.05323	null
2025-12-07	NeuralRemaster: Phase-Preserving Diffusion for Structure-Aligned Generation	Yu Zeng et.al.	2512.05106	null
2025-12-04	David vs. Goliath: Can Small Models Win Big with Agentic AI in Hardware Design?	Shashwat Shankar et.al.	2512.05073	null
2025-12-04	SDG-Track: A Heterogeneous Observer-Follower Framework for High-Resolution UAV Tracking on Embedded Platforms	Jiawen Wen et.al.	2512.04883	null
2025-12-05	GPU-Portable Real-Space Density Functional Theory Implementation on Unified-Memory Architectures	Atsushi M. Ito et.al.	2512.04447	null
2025-12-04	A Structure-Aware Irregular Blocking Method for Sparse LU Factorization	Zhen Hu et.al.	2512.04389	null
2025-12-03	From FLOPs to Footprints: The Resource Cost of Artificial Intelligence	Sophia Falk et.al.	2512.04142	null
2025-11-01	Toward Sustainability-Aware LLM Inference on Edge Clusters	Kolichala Rajashekar et.al.	2512.04088	null
2025-12-03	Autonomous Reinforcement Learning Robot Control with Intel’s Loihi 2 Neuromorphic Hardware	Kenneth Stewart et.al.	2512.03911	null
2025-12-03	Crossing the Sim2Real Gap Between Simulation and Ground Testing to Space Deployment of Autonomous Free-flyer Control	Kenneth Stewart et.al.	2512.03736	null
2025-12-03	Autonomous Planning In-space Assembly Reinforcement-learning free-flYer (APIARY) International Space Station Astrobee Testing	Samantha Chapin et.al.	2512.03729	null
2025-12-03	On the Challenges of Energy-Efficiency Analysis in HPC Systems: Evaluating Synthetic Benchmarks and Gromacs	Rafael Ravedutti Lucio Machado et.al.	2512.03697	null
2025-12-02	Flux4D: Flow-based Unsupervised 4D Reconstruction	Jingkang Wang et.al.	2512.03210	null
2025-12-02	DGGT: Feedforward 4D Reconstruction of Dynamic Driving Scenes using Unposed Images	Xiaoxue Chen et.al.	2512.03004	null
2025-12-02	CUDA-L2: Surpassing cuBLAS Performance for Matrix Multiplication through Reinforcement Learning	Songqiao Su et.al.	2512.02551	null
2025-12-02	G-SHARP: Gaussian Surgical Hardware Accelerated Real-time Pipeline	Vishwesh Nath et.al.	2512.02482	null
2025-12-02	Pushing Tensor Accelerators Beyond MatMul in a User-Schedulable Language	Yihong Zhang et.al.	2512.02371	null
2025-12-02	Quantum Vanguard: Server Optimized Privacy Fortified Federated Intelligence for Future Vehicles	Dev Gurung et.al.	2512.02301	null
2025-12-01	Microbenchmarking NVIDIA’s Blackwell Architecture: An in-depth Architectural Analysis	Aaron Jarmusch et.al.	2512.02189	null
2025-12-03	Four Over Six: More Accurate NVFP4 Quantization with Adaptive Block Scaling	Jack Cook et.al.	2512.02010	null
2025-12-01	SpriteHand: Real-Time Versatile Hand-Object Interaction with Autoregressive Video Generation	Zisu Li et.al.	2512.01960	null
2025-12-01	OpenBox: Annotate Any Bounding Boxes in 3D	In-Jae Lee et.al.	2512.01352	null
2025-11-30	Light-Weight Benchmarks Reveal the Hidden Hardware Cost of Zero-Shot Tabular Foundation Models	Aayam Bansal et.al.	2512.00888	null
2025-11-29	Heimdall++: Optimizing GPU Utilization and Pipeline Parallelism for Efficient Single-Pulse Detection	Bingzheng Xia et.al.	2512.00398	null
2025-11-29	Efficient Kernel Mapping and Comprehensive System Evaluation of LLM Acceleration on a CGLA	Takuto Ando et.al.	2512.00335	null
2025-11-26	LLaMCAT: Optimizing Large Language Model Inference with Cache Arbitration and Throttling	Zhongchun Zhou et.al.	2512.00083	null
2025-11-28	Energy-Efficient Vision Transformer Inference for Edge-AI Deployment	Nursultan Amanzhol et.al.	2511.23166	null
2025-11-28	DiskChunGS: Large-Scale 3D Gaussian SLAM Through Chunk-Based Memory Management	Casimir Feldmann et.al.	2511.23030	null
2025-11-27	Accelerating mesh-based Monte Carlo simulations using contemporary graphics ray-tracing hardware	Shijie Yan et.al.	2511.22779	null
2025-11-27	Test-time scaling of diffusions with flow maps	Amirmojtaba Sabour et.al.	2511.22688	null
2025-12-09	Edge Deployment of Small Language Models, a comprehensive comparison of CPU, GPU and NPU backends	Pablo Prieto et.al.	2511.22334	null
2025-11-27	MTR-VP: Towards End-to-End Trajectory Planning through Context-Driven Image Encoding and Multiple Trajectory Prediction	Maitrayee Keskar et.al.	2511.22181	null
2025-11-27	Evaluating the Robustness of Large Language Model Safety Guardrails Against Adversarial Attacks	Richard J. Young et.al.	2511.22047	null
2025-11-27	A Safety and Security Framework for Real-World Agentic Systems	Shaona Ghosh et.al.	2511.21990	null
2025-11-26	Exploring Fusion Strategies for Multimodal Vision-Language Systems	Regan Willis et.al.	2511.21889	null
2025-11-26	FaithFusion: Harmonizing Reconstruction and Generation via Pixel-wise Information Gain	YuAn Wang et.al.	2511.21113	null
2025-11-29	Hardware-Accelerated Phase-Averaging for Cavitating Bubbly Flows	Diego Vaca-Revelo et.al.	2511.21031	null
2025-11-25	NVIDIA Nemotron Parse 1.1	Kateryna Chumachenko et.al.	2511.20478	null
2025-11-26	VGGTFace: Topologically Consistent Facial Geometry Reconstruction in the Wild	Xin Ming et.al.	2511.20366	null
2025-11-25	The PLUTO Code on GPUs: A First Look at Eulerian MHD Methods	Marco Rossazza et.al.	2511.20337	null
2025-11-24	An NLO-Matched Initial and Final State Parton Shower on a GPU	Michael H. Seymour et.al.	2511.19633	null
2025-11-24	IDSplat: Instance-Decomposed 3D Gaussian Splatting for Driving Scenes	Carl Lindström et.al.	2511.19235	null
2025-11-24	An Online Fragmentation-Aware GPU Scheduler for Multi-Tenant MIG-based Clouds	Marco Zambianco et.al.	2511.18906	null
2025-12-02	Evaluation of GPU Video Encoder for Low-Latency Real-Time 4K UHD Encoding	Kasidis Arunruangsirilert et.al.	2511.18688	null
2025-11-24	Evaluation of NVENC Split-Frame Encoding (SFE) for UHD Video Transcoding	Kasidis Arunruangsirilert et.al.	2511.18687	null
2025-11-24	Evaluation of Hardware-based Video Encoders on Modern GPUs for UHD Live-Streaming	Kasidis Arunruangsirilert et.al.	2511.18686	null
2025-11-24	Low-Rank GEMM: Efficient Matrix Multiplication via Low-Rank Approximation with FP8 Acceleration	Alfredo Metere et.al.	2511.18674	null
2025-11-23	UniFlow: Towards Zero-Shot LiDAR Scene Flow for Autonomous Vehicles via Cross-Domain Generalization	Siyi Li et.al.	2511.18254	null
2025-11-26	Pier: Efficient Large Language Model pretraining with Relaxed Global Communication	Shuyuan Fan et.al.	2511.17849	null
2025-11-21	Entity – Hardware-agnostic Particle-in-Cell Code for Plasma Astrophysics. I: Curvilinear Special Relativistic Module	Hayk Hakobyan et.al.	2511.17710	null
2025-11-21	Lane-Frame Quantum Multimodal Driving Forecasts for the Trajectory of Autonomous Vehicles	Navneet Singh et.al.	2511.17675	null
2025-11-21	MDG: Masked Denoising Generation for Multi-Agent Behavior Modeling in Traffic Environments	Zhiyu Huang et.al.	2511.17496	null
2025-11-21	Energy Scaling Laws for Diffusion Models: Quantifying Compute and Carbon Emissions in Image Generation	Aniketh Iyengar et.al.	2511.17031	null
2025-11-20	Optimizing Federated Learning in the Era of LLMs: Message Quantization and Streaming	Ziyue Xu et.al.	2511.16450	null
2025-11-20	Parallelizable Complex Neural Dynamics Models for PMSM Temperature Estimation with Hardware Acceleration	Xinyuan Liao et.al.	2511.16093	null
2025-11-22	CompTrack: Information Bottleneck-Guided Low-Rank Dynamic Token Compression for Point Cloud Tracking	Sifan Zhou et.al.	2511.15580	null
2025-11-18	Multi-GPU Quantum Circuit Simulation and the Impact of Network Performance	W. Michael Brown et.al.	2511.14664	null
2025-11-18	Perception-aware Exploration for Consumer-grade UAVs	Svetlana Seliunina et.al.	2511.14393	null
2025-11-23	Simultaneous Localization and 3D-Semi Dense Mapping for Micro Drones Using Monocular Camera and Inertial Sensors	Jeryes Danial et.al.	2511.14335	null
2025-11-24	InstantViR: Real-Time Video Inverse Problem Solver with Distilled Diffusion Prior	Weimin Bai et.al.	2511.14208	null
2025-11-19	PAVE: An End-to-End Dataset for Production Autonomous Vehicle Evaluation	Xiangyu Li et.al.	2511.14185	null
2025-11-18	RTS-Mono: A Real-Time Self-Supervised Monocular Depth Estimation Method for Real-World Deployment	Zeyu Cheng et.al.	2511.14107	null
2025-11-16	Guaranteed DGEMM Accuracy While Using Reduced Precision Tensor Cores Through Extensions of the Ozaki Scheme	Angelika Schwarz et.al.	2511.13778	null
2025-11-17	T-SAR: A Full-Stack Co-design for CPU-Only Ternary LLM Inference via In-Place SIMD ALU Reorganization	Hyunwoo Oh et.al.	2511.13676	null
2025-11-17	KForge: Program Synthesis for Diverse AI Hardware Accelerators	Taras Sereda et.al.	2511.13274	null
2025-11-17	WinMamba: Multi-Scale Shifted Windows in State Space Model for 3D Object Detection	Longhui Zheng et.al.	2511.13138	null
2025-11-15	Enhancing Road Safety Through Multi-Camera Image Segmentation with Post-Encroachment Time Analysis	Shounak Ray Chaudhuri et.al.	2511.12018	null
2025-11-15	High-Performance N-Queens Solver on GPU: Iterative DFS with Zero Bank Conflicts	Guangchao Yao et.al.	2511.12009	null
2025-11-14	Advancing Annotat3D with Harpia: A CUDA-Accelerated Library For Large-Scale Volumetric Data Segmentation	Camila Machado de Araujo et.al.	2511.11890	null
2025-11-06	AIvailable: A Software-Defined Architecture for LLM-as-a-Service on Heterogeneous and Legacy GPUs	Pedro Antunes et.al.	2511.11621	null
2025-10-30	Mind the Gap: Revealing Inconsistencies Across Heterogeneous AI Accelerators	Elliott Wen et.al.	2511.11601	null
2025-11-14	6D Strawberry Pose Estimation: Real-time and Edge AI Solutions Using Purely Synthetic Training Data	Saptarshi Neil Sinha et.al.	2511.11307	null
2025-11-14	MMA-Sim: Bit-Accurate Reference Model of Tensor Cores and Matrix Cores	Peichen Xie et.al.	2511.10909	null
2025-11-17	FCOC: A Fractal-Chaotic Co-driven Framework for Financial Volatility Forecasting	Yilong Zeng et.al.	2511.10365	null
2025-11-13	EDGC: Entropy-driven Dynamic Gradient Compression for Efficient LLM Training	Qingao Yi et.al.	2511.10333	null
2025-11-13	Depth-Consistent 3D Gaussian Splatting via Physical Defocus Modeling and Multi-View Geometric Supervision	Yu Deng et.al.	2511.10316	null
2025-11-13	Flex-MIG: Enabling Distributed Execution on MIG	Myeongsu Kim et.al.	2511.09143	null
2025-11-12	FLAD: Federated Learning for LLM-based Autonomous Driving in Vehicle-Edge-Cloud Networks	Tianao Xiang et.al.	2511.09025	null
2025-11-12	TiDAR: Think in Diffusion, Talk in Autoregression	Jingyu Liu et.al.	2511.08923	null
2025-11-15	JobSphere: An AI-Powered Multilingual Career Copilot for Government Employment Platforms	Srihari R et.al.	2511.08343	null
2025-11-13	LOw-cOst yet High-Performant Sparse Matrix-Matrix Multiplication on Arm SME Architectures	Kelun Lei et.al.	2511.08158	null
2025-11-11	HipKittens: Fast and Furious AMD Kernels	William Hu et.al.	2511.08083	null
2025-11-11	TurboSAT: Gradient-Guided Boolean Satisfiability Accelerated on GPU-CPU Hybrid System	Steve Dai et.al.	2511.07737	null
2025-11-01	Agentic Educational Content Generation for African Languages on Edge Devices	Ravi Gupta et.al.	2511.07437	null
2025-10-15	Towards Affordable, Adaptive and Automatic GNN Training on CPU-GPU Heterogeneous Platforms	Tong Qiao et.al.	2511.07421	null
2025-11-10	YoNoSplat: You Only Need One Model for Feedforward 3D Gaussian Splatting	Botao Ye et.al.	2511.07321	null
2025-11-09	LUT-LLM: Efficient Large Language Model Inference with Memory-based Computations on FPGAs	Zifan He et.al.	2511.06174	null
2025-11-08	MT4G: A Tool for Reliable Auto-Discovery of NVIDIA and AMD GPU Compute and Memory Topologies	Stepan Vanecek et.al.	2511.05958	null
2025-10-28	AIRMap – AI-Generated Radio Maps for Wireless Digital Twins	Ali Saeizadeh et.al.	2511.05522	null
2025-10-09	Production-Grade Local LLM Inference on Apple Silicon: A Comparative Study of MLX, MLC-LLM, Ollama, llama.cpp, and PyTorch MPS	Varun Rajesh et.al.	2511.05502	null
2025-11-07	No Pose Estimation? No Problem: Pose-Agnostic and Instance-Aware Test-Time Adaptation for Monocular Depth Estimation	Mingyu Sung et.al.	2511.05055	null
2025-11-06	Isaac Lab: A GPU-Accelerated Simulation Framework for Multi-Modal Robot Learning	NVIDIA et.al.	2511.04831	null
2025-11-06	BoRe-Depth: Self-supervised Monocular Depth Estimation with Boundary Refinement for Embedded Systems	Chang Liu et.al.	2511.04388	null
2025-11-06	Block Rotation is All You Need for MXFP4 Quantization	Yuantian Shao et.al.	2511.04214	null
2025-11-06	PICNIC: Silicon Photonic Interconnected Chiplets with Computational Network and In-memory Computing for LLM Inference Acceleration	Yue Jiet Chong et.al.	2511.04036	null
2025-11-06	LogHD: Robust Compression of Hyperdimensional Classifiers via Logarithmic Class-Axis Reduction	Sanggeon Yun et.al.	2511.03938	null
2025-11-07	NVIDIA Nemotron Nano V2 VL	NVIDIA et.al.	2511.03929	null
2025-11-05	DecoHD: Decomposed Hyperdimensional Classification under Extreme Memory Budgets	Sanggeon Yun et.al.	2511.03911	null
2025-11-05	Open Source State-Of-the-Art Solution for Romanian Speech Recognition	Gabriel Pirlogeanu et.al.	2511.03361	null
2025-11-05	Modeling Headway in Heterogeneous and Mixed Traffic Flow: A Statistical Distribution Based on a General Exponential Function	Natchaphon Leungbootnak et.al.	2511.03154	null
2025-11-04	Implementing Multi-GPU Scientific Computing Miniapps Across Performance Portable Frameworks	Johansell Villalobos et.al.	2511.02655	null
2025-11-04	Energy-Efficient Hardware Acceleration of Whisper ASR on a CGLA	Takuto Ando et.al.	2511.02269	null
2025-11-04	Learning Spatial Awareness for Laparoscopic Surgery with AI Assisted Visual Feedback	Songyang Liu et.al.	2511.02233	null
2025-11-09	Investigation of Performance and Scalability of a Quantum-Inspired Evolutionary Optimizer (QIEO) on NVIDIA GPU	Aman Mittal et.al.	2511.01298	null
2025-11-02	Towards Portability at Scale: A Cross-Architecture Performance Evaluation of a GPU-enabled Shallow Water Solver	Johansell Villalobos et.al.	2511.01001	null
2025-11-01	Transfer Learning for Onboard Cloud Segmentation in Thermal Earth Observation: From Landsat to a CubeSat Constellation	Niklas Wölki et.al.	2511.00357	null
2025-10-30	Real-DRL: Teach and Learn in Reality	Yanbing Mao et.al.	2511.00112	null
2025-10-30	Alpamayo-R1: Bridging Reasoning and Action Prediction for Generalizable Autonomous Driving in the Long Tail	NVIDIA et.al.	2511.00088	null
2025-10-28	World Simulation with Video Foundation Models for Physical AI	NVIDIA et.al.	2511.00062	null
2025-10-27	Benchmarking Federated Learning Frameworks for Medical Imaging Deployment: A Comparative Study of NVIDIA FLARE, Flower, and Owkin Substra	Riya Gupta et.al.	2511.00037	null
2025-10-31	RDMA Point-to-Point Communication for LLM Systems	Nandor Licker et.al.	2510.27656	null
2025-10-31	AMD MI300X GPU Performance Analysis	Chandrish Ambati et.al.	2510.27583	null
2025-10-30	Adaptive Data Flywheel: Applying MAPE Control Loops to AI Agent Improvement	Aaditya Shukla et.al.	2510.27051	null
2025-10-30	Photometric Redshifts in JWST Deep Fields: A Pixel-Based Alternative with DeepDISC	Grant Merz et.al.	2510.27032	null
2025-10-30	Towards Reinforcement Learning Based Log Loading Automation	Ilya Kurinov et.al.	2510.26363	null
2025-10-30	MossNet: Mixture of State-Space Experts is a Multi-Head Attention	Shikhar Tuli et.al.	2510.26182	null
2025-11-13	WOD-E2E: Waymo Open Dataset for End-to-End Driving in Challenging Long-tail Scenarios	Runsheng Xu et.al.	2510.26125	null
2025-10-29	Estimating cognitive biases with attention-aware inverse planning	Sounak Banerjee et.al.	2510.25951	null
2025-10-28	zFLoRA: Zero-Latency Fused Low-Rank Adapters	Dhananjaya Gowda et.al.	2510.25784	null
2025-10-29	INT v.s. FP: A Comprehensive Study of Fine-Grained Low-bit Quantization Formats	Mengzhao Chen et.al.	2510.25602	null
2025-11-02	D $^2$ GS: Dense Depth Regularization for LiDAR-free Urban Scene Reconstruction	Kejing Xia et.al.	2510.25173	null
2025-10-31	DINO-YOLO: Self-Supervised Pre-training for Data-Efficient Object Detection in Civil Engineering Applications	Malaisree P et.al.	2510.25140	null
2025-10-31	A GPU-based Compressible Combustion Solver for Applications Exhibiting Disparate Space and Time Scales	Anthony Carreon et.al.	2510.23993	null
2025-10-27	Scalable GPU-Based Integrity Verification for Large Machine Learning Models	Marcin Spoczynski et.al.	2510.23938	null
2025-10-23	Speeding Up MACE: Low-Precision Tricks for Equivarient Force Fields	Alexandre Benoit et.al.	2510.23621	null
2025-10-27	The First Star-by-star $N$ -body/Hydrodynamics Simulation of Our Galaxy Coupling with a Surrogate Model	Keiya Hirashima et.al.	2510.23330	null
2025-11-05	MobileGeo: Exploring Hierarchical Knowledge Distillation for Resource-Efficient Cross-view Drone Geo-Localization	Jian Sun et.al.	2510.22582	null
2025-10-28	GRAID: Enhancing Spatial Reasoning of VLMs Through High-Fidelity Data Generation	Karim Elmaaroufi et.al.	2510.22118	null
2025-10-24	Performance Trade-offs of Optimizing Small Language Models for E-Commerce	Josip Tomo Licardo et.al.	2510.21970	null
2025-10-24	Remote Autonomy for Multiple Small Lowcost UAVs in GNSS-denied Search and Rescue Operations	Daniel Schleich et.al.	2510.21357	null
2025-10-23	AI PB: A Grounded Generative Agent for Personalized Investment Insights	Daewoo Park et.al.	2510.20099	null
2025-10-07	Low-Latency Neural Inference on an Edge Device for Real-Time Handwriting Recognition from EEG Signals	Ovishake Sen et.al.	2510.19832	null
2025-10-22	The Feasibility of Training Sovereign Language Models in the Global South: A Study of Brazil and Mexico	Sandra Malagon et.al.	2510.19801	null
2025-10-22	Unmanned Aerial Vehicles Control in a Digital Twin: Exploring the Effect of Different Points of View on User Experience in Virtual Reality	Francesco Vona et.al.	2510.19604	null
2025-11-25	GigaBrain-0: A World Model-Powered Vision-Language-Action Model	GigaBrain Team et.al.	2510.19430	null
2025-10-21	ProfBench: Multi-Domain Rubrics requiring Professional Knowledge to Answer and Judge	Zhilin Wang et.al.	2510.18941	null
2025-10-21	Benchmarking On-Device Machine Learning on Apple Silicon with MLX	Oluwaseun A. Ajayi et.al.	2510.18921	null
2025-10-21	sNVMe-oF: Secure and Efficient Disaggregated Storage	Marcin Chrapek et.al.	2510.18756	null
2025-10-21	Microsecond Federated SVD on Grassmann Manifold for Real-time IoT Intrusion Detection	Tung-Anh Nguyen et.al.	2510.18501	null
2025-10-20	SPACeR: Self-Play Anchoring with Centralized Reference Models	Wei-Jer Chang et.al.	2510.18060	null
2025-11-17	AoA Services in 5G Networks: A Framework for Real-World Implementation and Systematic Testing	Alberto Ceresoli et.al.	2510.17342	null
2025-10-20	Enhanced Motion Forecasting with Plug-and-Play Multimodal Large Language Models	Katie Luo et.al.	2510.17274	null
2025-10-19	Thermal Conductivity Estimation of Thermoelectric Materials with Uncertainty Quantification Using Bayesian Physics-Informed Neural Networks	Hyeonbin Moon et.al.	2510.16723	null
2025-10-18	Cerberus: Real-Time Video Anomaly Detection via Cascaded Vision-Language Models	Yue Zheng et.al.	2510.16290	null
2025-10-17	CuSfM: CUDA-Accelerated Structure-from-Motion	Jingrui Yu et.al.	2510.15271	null
2025-11-03	Automotive Crash Dynamics Modeling Accelerated with Machine Learning	Mohammad Amin Nabian et.al.	2510.15201	null
2025-10-16	DLER: Doing Length pEnalty Right - Incentivizing More Intelligence per Token via Reinforcement Learning	Shih-Yang Liu et.al.	2510.15110	null
2025-10-16	Hive Hash Table: A Warp-Cooperative, Dynamically Resizable Hash Table for GPUs	Md Sabbir Hossain Polak et.al.	2510.15095	null
2025-10-16	EdgeNavMamba: Mamba Optimized Object Detection for Energy Efficient Edge Devices	Romina Aalishah et.al.	2510.14946	null
2025-10-16	A Performance Portable Matrix Free Dense MTTKRP in GenTen	Gabriel Kosmacher et.al.	2510.14891	null
2025-10-16	Tawa: Automatic Warp Specialization for Modern GPUs with Asynchronous References	Hongzheng Chen et.al.	2510.14719	null
2025-10-16	BalanceGS: Algorithm-System Co-design for Efficient 3D Gaussian Splatting Training on GPU	Junyi Wu et.al.	2510.14564	null
2025-10-15	Adaptive Obstacle-Aware Task Assignment and Planning for Heterogeneous Robot Teaming	Nan Li et.al.	2510.14063	null
2025-10-15	Anonymized Network Sensing using C++26 std::execution on GPUs	Michael Mandulak et.al.	2510.14050	null
2025-10-15	A Complete Pipeline for deploying SNNs with Synaptic Delays on Loihi 2	Balázs Mészáros et.al.	2510.13757	null
2025-10-15	Accelerated Feature Detectors for Visual SLAM: A Comparative Study of FPGA vs GPU	Ruiqi Ye et.al.	2510.13546	null
2025-10-15	Real-Time Crowd Counting for Embedded Systems with Lightweight Architecture	Zhiyuan Zhao et.al.	2510.13250	null
2025-10-14	T(R,O) Grasp: Efficient Graph Diffusion of Robot-Object Spatial Transformation for Cross-Embodiment Dexterous Grasping	Xin Fei et.al.	2510.12724	null
2025-10-14	A GPU-resident Memory-Aware Algorithm for Accelerating Bidiagonalization of Banded Matrices	Evelyne Ringoot et.al.	2510.12705	null
2025-10-14	Noisy Neighbor: Exploiting RDMA for Resource Exhaustion Attacks in Containerized Clouds	Gunwoo Kim et.al.	2510.12629	null
2025-10-14	PolygMap: A Perceptive Locomotion Framework for Humanoid Robot Stair Climbing	Bingquan Li et.al.	2510.12346	null
2025-10-14	PAGS: Priority-Adaptive Gaussian Splatting for Dynamic Driving Scenes	Ying A et.al.	2510.12282	null
2025-10-14	The Impact of Synthetic Data on Object Detection Model Performance: A Comparative Analysis with Real-World Data	Muammer Bay et.al.	2510.12208	null
2025-10-14	nuGPR: GPU-Accelerated Gaussian Process Regression with Iterative Algorithms and Low-Rank Approximations	Ziqi Zhao et.al.	2510.12128	null
2025-10-14	An AI-Based Behavioral Health Safety Filter and Dataset for Identifying Mental Health Crises in Text-Based Conversations	Benjamin W. Nelson et.al.	2510.12083	null
2025-10-13	SCOOP’D: Learning Mixed-Liquid-Solid Scooping via Sim2Real Generative Policy	Kuanning Wang et.al.	2510.11566	null
2025-10-15	A Faster and More Reliable Middleware for Autonomous Driving Systems	Yuankai He et.al.	2510.11448	null
2025-10-12	Real2USD: Scene Representations in Universal Scene Description Language	Christopher D. Hsu et.al.	2510.10778	null
2025-10-12	HYPERDOA: Robust and Efficient DoA Estimation using Hyperdimensional Computing	Rajat Bhattacharjya et.al.	2510.10718	null
2025-10-10	CuPyMag: GPU-Accelerated Finite-Element Micromagnetics with Magnetostriction	Hongyi Guan et.al.	2510.09812	null
2025-10-10	StreamingVLM: Real-Time Understanding for Infinite Video Streams	Ruyi Xu et.al.	2510.09608	null
2025-10-10	Visibility-Aware Densification for 3D Gaussian Splatting in Dynamic Urban Scenes	Yikang Zhang et.al.	2510.09364	null
2025-10-10	Online Video Depth Anything: Temporally-Consistent Depth Prediction with Low Memory Consumption	Johann-Friedrich Feiden et.al.	2510.09182	null
2025-10-09	Maple: A Multi-agent System for Portable Deep Learning across Clusters	Molang Wu et.al.	2510.08842	null
2025-10-09	Neptune: Advanced ML Operator Fusion for Locality and Parallelism on GPUs	Yifan Zhao et.al.	2510.08726	null
2025-11-05	HPQEA: A Scalable and High-Performance Quantum Emulator with High-Bandwidth Memory for Diverse Algorithms Support	Tran Van Duy et.al.	2510.07110	null
2025-10-08	GROMACS Unplugged: How Power Capping and Frequency Shapes Performance on GPUs	Ayesha Afzal et.al.	2510.06902	null
2025-10-08	AWM: Accurate Weight-Matrix Fingerprint for Large Language Models	Boyi Zeng et.al.	2510.06738	null
2025-10-05	Dual-stage and Lightweight Patient Chart Summarization for Emergency Physicians	Jiajun Wu et.al.	2510.06263	null
2025-10-07	MadNCL: A GPU Implementation of Algorithm NCL for Large-Scale, Degenerate Nonlinear Programs	Alexis Montoison et.al.	2510.05885	null
2025-10-07	TensorBLEU: Vectorized GPU-based BLEU Score Implementation for Per-Sentence In-Training Evaluation	Adam Filipek et.al.	2510.05485	null
2025-10-06	Mixed-precision ab initio tensor network state methods adapted for NVIDIA Blackwell technology via emulated FP64 arithmetic	Cole Brower et.al.	2510.04795	null
2025-10-06	Bio-Inspired Robotic Houbara: From Development to Field Deployment for Behavioral Studies	Lyes Saad Saoud et.al.	2510.04692	null
2025-10-06	Fast Witness Persistence for MRI Volumes via Hybrid Landmarking	Jorge Leonardo Ruiz Williams et.al.	2510.04553	null
2025-10-05	RAP: 3D Rasterization Augmented End-to-End Planning	Lan Feng et.al.	2510.04333	null
2025-10-16	ChronoEdit: Towards Temporal Reasoning for Image Editing and World Simulation	Jay Zhangjie Wu et.al.	2510.04290	null
2025-10-23	Replacing Softmax Similarity with a Sharpened Angular Similarity: Theory and Practice of Scaling To Billion-Context Attention	Sahil Joshi et.al.	2510.04008	null
2025-10-04	Datacenter Energy Optimized Power Profiles	Sreedhar Narayanaswamy et.al.	2510.03872	null
2025-09-29	Convolutional Neural Nets vs Vision Transformers: A SpaceNet Case Study with Balanced vs Imbalanced Regimes	Akshar Gothi et.al.	2510.03297	null
2025-09-28	MACE: A Hybrid LLM Serving System with Colocated SLO-aware Continuous Retraining Alignment	Yufei Li et.al.	2510.03283	null
2025-10-03	On the energy efficiency of sparse matrix computations on multi-GPU clusters	Massimo Bernaschi et.al.	2510.02878	null
2025-10-03	Accelerating cosmological simulations on GPUs: a portable approach using OpenMP	M. D. Lepinzan et.al.	2510.02873	null
2025-10-03	Sequence-Preserving Dual-FoV Defense for Traffic Sign and Light Recognition in Autonomous Vehicles	Abhishek Joshi et.al.	2510.02642	null
2025-10-03	microJAX: A Differentiable Framework for Microlensing Modeling with GPU-Accelerated Image-Centered Ray Shooting	Shota Miyazaki et.al.	2510.02639	null
2025-10-02	SIMSplat: Predictive Driving Scene Editing with Language-aligned 4D Gaussian Splatting	Sung-Yeon Park et.al.	2510.02469	null
2025-10-02	Paving the Way Towards Kinematic Assessment Using Monocular Video: A Preclinical Benchmark of State-of-the-Art Deep-Learning-Based 3D Human Pose Estimators Against Inertial Sensors in Daily Living Activities	Mario Medrano-Paredes et.al.	2510.02264	null
2025-10-10	Nav-EE: Navigation-Guided Early Exiting for Efficient Vision-Language Models in Autonomous Driving	Haibo Hu et.al.	2510.01795	null
2025-10-02	Edge GPU Aware Multiple AI Model Pipeline for Accelerated MRI Reconstruction and Analysis	Ashiyana Abdul Majeed et.al.	2510.01730	null
2025-10-02	MMGaP: Multi-User MIMO Detection and Precoding using GPU-assisted Physics-inspired Computation	Abhishek Kumar Singh et.al.	2510.01579	null
2025-10-02	NVIDIA AI Aerial: AI-Native Wireless Communications	Kobi Cohen-Arazi et.al.	2510.01533	null
2025-10-01	ThinKV: Thought-Adaptive KV Cache Compression for Efficient Reasoning Models	Akshat Ramachandran et.al.	2510.01290	null
2025-10-01	Sentry: Authenticating Machine Learning Artifacts on the Fly	Andrew Gan et.al.	2510.00554	null
2025-10-01	A Deep Learning Pipeline for Epilepsy Genomic Analysis Using GPT-2 XL and NVIDIA H100	Muhammad Omer Latif et.al.	2510.00392	null
2025-10-09	TASP: Topology-aware Sequence Parallelism	Yida Wang et.al.	2509.26541	null
2025-09-30	Benchmarking Deep Learning Convolutions on Energy-constrained CPUs	Enrique Galvez et.al.	2509.26217	null
2025-09-30	NuRisk: A Visual Question Answering Dataset for Agent-Level Risk Assessment in Autonomous Driving	Yuan Gao et.al.	2509.25944	null
2025-09-30	SAIL: SRAM-Accelerated LLM Inference System with Lookup-Table-based GEMV	Jingyao Zhang et.al.	2509.25853	null
2025-09-24	AMLA: MUL by ADD in FlashAttention Rescaling	Qichen Liao et.al.	2509.25224	null
2025-09-29	DC-VideoGen: Efficient Video Generation with Deep Compression Video Autoencoder	Junyu Chen et.al.	2509.25182	null
2025-10-01	DC-Gen: Post-Training Diffusion Acceleration with Deeply Compressed Latent Space	Wenkun He et.al.	2509.25180	null
2025-09-30	YOLO26: Key Architectural Enhancements and Performance Benchmarking for Real-Time Object Detection	Ranjan Sapkota et.al.	2509.25164	null
2025-09-29	Pretraining Large Language Models with NVFP4	NVIDIA et.al.	2509.25149	null
2025-09-29	ThermalGen: Style-Disentangled Flow-Based Generative Models for RGB-to-Thermal Image Translation	Jiuhong Xiao et.al.	2509.24878	null
2025-10-13	SANA-Video: Efficient Video Generation with Block Linear Diffusion Transformer	Junsong Chen et.al.	2509.24695	null
2025-09-28	Advancing Multi-agent Traffic Simulation via R1-Style Reinforcement Fine-Tuning	Muleilan Pei et.al.	2509.23993	null
2025-09-28	VFSI: Validity First Spatial Intelligence for Constraint-Guided Traffic Diffusion	Kargi Chauhan et.al.	2509.23971	null
2025-09-28	Learning Adaptive Pseudo-Label Selection for Semi-Supervised 3D Object Detection	Taehun Kong et.al.	2509.23880	null
2025-09-28	FastViDAR: Real-Time Omnidirectional Depth Estimation via Alternative Hierarchical Attention	Hangtian Zhao et.al.	2509.23733	null
2025-09-28	Performance and Numerical Aspects of Decompositional Factorizations with FP64 Floating-Point Emulation in INT8	Piotr Luszczek et.al.	2509.23565	null
2025-10-16	Bridging the Gap Between Promise and Performance for Microscaling FP4 Quantization	Vage Egiazarian et.al.	2509.23202	null
2025-09-26	Tiny-QMoE	Jack Cashman et.al.	2509.22951	null
2025-09-26	Self-driving cars: Are we there yet?	Merve Atasever et.al.	2509.22754	null
2025-09-18	VIRTUS-FPP: Virtual Sensor Modeling for Fringe Projection Profilometry in NVIDIA Isaac Sim	Adam Haroon et.al.	2509.22685	null
2025-09-17	FLAME: A Serving System Optimized for Large-Scale Generative Recommendation with Efficiency	Xianwen Guo et.al.	2509.22681	null
2025-10-13	LongLive: Real-time Interactive Long Video Generation	Shuai Yang et.al.	2509.22622	null
2025-09-26	Lightweight error mitigation strategies for post-training N:M activation sparsity in LLMs	Shirin Alanova et.al.	2509.22166	null
2025-09-25	XenoFlow: How Fast Can a SmartNIC-Based DNS Load Balancer Run?	Max Schrötter et.al.	2509.21656	null
2025-09-25	SuperOffload: Unleashing the Power of Large-Scale LLM Training on Superchips	Xinyu Lian et.al.	2509.21271	null
2025-09-25	Mojo: MLIR-Based Performance-Portable HPC Science Kernels on GPUs for the Python Ecosystem	William F. Godoy et.al.	2509.21039	null
2025-09-24	FlyTrap: Physical Distance-Pulling Attack Towards Camera-based Autonomous Target Tracking Systems	Shaoyuan Xie et.al.	2509.20362	null
2025-09-24	A Comprehensive Evaluation of YOLO-based Deer Detection Performance on Edge Devices	Bishal Adhikari et.al.	2509.20318	null
2025-09-24	Fulcrum: Optimizing Concurrent DNN Training and Inferencing on Edge Accelerators	Prashanthi S. K. et.al.	2509.20205	null
2025-09-24	Pagoda: An Energy and Time Roofline Study for DNN Workloads on Edge Accelerators	Prashanthi S. K. et.al.	2509.20189	null
2025-09-24	Characterizing the Performance of Accelerated Jetson Edge Devices for Training Deep Learning Models	Prashanthi S. K. et.al.	2509.20160	null
2025-09-24	Games Are Not Equal: Classifying Cloud Gaming Contexts for Effective User Experience Measurement	Yifan Wang et.al.	2509.19669	null
2025-09-23	Lyra: Generative 3D Scene Reconstruction via Video Diffusion Model Self-Distillation	Sherwin Bahmani et.al.	2509.19296	null
2025-09-23	Scheduler-Driven Job Atomization	Michal Konopa et.al.	2509.19086	null
2025-09-23	Beyond Backpropagation: Exploring Innovative Algorithms for Energy-Efficient Deep Neural Network Training	Przemysław Spyra et.al.	2509.19063	null
2025-09-23	3D Blocking for Matrix-free Smoothers in 2D Variable-Viscosity Stokes Equations with Applications to Geodynamics	Marcel Ferrari et.al.	2509.19061	null
2025-09-23	Confidential LLM Inference: Performance and Cost Across CPU and GPU TEEs	Marcin Chrapek et.al.	2509.18886	null
2025-09-26	APRIL: Active Partial Rollouts in Reinforcement Learning to Tame Long-tail Generation	Yuzhen Zhou et.al.	2509.18521	null
2025-09-22	RL-augmented Adaptive Model Predictive Control for Bipedal Locomotion over Challenging Terrain	Junnosuke Kamohara et.al.	2509.18466	null
2025-09-22	Robotic Skill Diversification via Active Mutation of Reward Functions in Reinforcement Learning During a Liquid Pouring Task	Jannick van Buuren et.al.	2509.18463	null
2025-09-19	TinyEcoWeedNet: Edge Efficient Real-Time Aerial Agricultural Weed Detection	Omar H. Khater et.al.	2509.18193	null
2025-09-22	AERO-MPPI: Anchor-Guided Ensemble Trajectory Optimization for Agile Mapless Drone Navigation	Xin Chen et.al.	2509.17340	null
2025-09-21	PMRT: A Training Recipe for Fast, 3D High-Resolution Aerodynamic Prediction	Sam Jacob Jacob et.al.	2509.17182	null
2025-09-19	WarpSpeed: A High-Performance Library for Concurrent GPU Hash Tables	Hunter McCoy et.al.	2509.16407	null
2025-09-19	Neural Atlas Graphs for Dynamic Scene Decomposition and Editing	Jan Philipp Schneider et.al.	2509.16336	null
2025-09-24	The Role of High-Performance GPU Resources in Large Language Model Based Radiology Imaging Diagnosis	Jyun-Ping Kao et.al.	2509.16328	null
2025-09-17	GraphMend: Code Transformations for Fixing Graph Breaks in PyTorch 2	Savini Kashmira et.al.	2509.16248	null
2025-09-19	A Memory Efficient Adjoint Method to Enable Billion Parameter Optimization on a Single GPU in Dynamic Problems	Leon Herrmann et.al.	2509.15744	null
2025-09-19	KoopCast: Trajectory Forecasting via Koopman Operators	Jungjin Lee et.al.	2509.15513	null
2025-09-18	Accelerating Garfield++ with CUDA	T. Neep et.al.	2509.15377	null
2025-09-18	Efficient 3D Perception on Embedded Systems via Interpolation-Free Tri-Plane Lifting and Volume Fusion	Sibaek Lee et.al.	2509.14641	null
2025-09-17	An RDMA-First Object Storage System with SmartNIC Offload	Yu Zhu et.al.	2509.13997	null
2025-09-17	SpecDiff: Accelerating Diffusion Model Inference with Self-Speculation	Jiayi Pan et.al.	2509.13848	null
2025-09-16	Testing and benchmarking emerging supercomputers via the MFC flow solver	Benjamin Wilfong et.al.	2509.13575	null
2025-09-16	Real-Time Detection and Tracking of Foreign Object Intrusions in Power Systems via Feature-Based Edge Intelligence	Xinan Wang et.al.	2509.13396	null
2025-09-16	HPIM: Heterogeneous Processing-In-Memory-based Accelerator for Large Language Models Inference	Cenlin Duan et.al.	2509.12993	null
2025-09-07	Profiling LoRA/QLoRA Fine-Tuning Efficiency on Consumer GPUs: An RTX 4060 Case Study	MSR Avinash et.al.	2509.12229	null
2025-09-15	Advanced Layout Analysis Models for Docling	Nikolaos Livathinos et.al.	2509.11720	null
2025-09-15	HeLoFusion: An Efficient and Scalable Encoder for Modeling Heterogeneous and Multi-Scale Interactions in Trajectory Prediction	Bingqing Wei et.al.	2509.11719	null
2025-09-13	PHLoRA: data-free Post-hoc Low-Rank Adapter extraction from full-rank checkpoint	Bhoomit Vasani et.al.	2509.10971	null
2025-09-19	Understanding AI Evaluation Patterns: How Different GPT Models Assess Vision-Language Descriptions	Sajjad Abdoli et.al.	2509.10707	null
2025-09-12	MCBP: A Memory-Compute Efficient LLM Inference Accelerator Leveraging Bit-Slice-enabled Sparsity and Repetitiveness	Huizheng Wang et.al.	2509.10372	null
2025-09-19	Characterizing the Efficiency of Distributed Training: A Power, Performance, and Thermal Perspective	Seokjin Go et.al.	2509.10371	null
2025-09-12	Ruggedized Ultrasound Sensing in Harsh Conditions: eRTIS in the wild	Dennis Laurijssen et.al.	2509.10029	null
2025-09-10	Rapid Manufacturing of Lightweight Drone Frames Using Single-Tow Architected Composites	Md Habib Ullah Khan et.al.	2509.09024	null
2025-09-03	Silent Until Sparse: Backdoor Attacks on Semi-Structured Sparsity	Wei Guo et.al.	2509.08747	null
2025-09-10	Compressing CNN models for resource-constrained systems by channel and layer pruning	Ahmed Sadaqa et.al.	2509.08714	null
2025-09-09	Attribute-based Object Grounding and Robot Grasp Detection with Spatial Reasoning	Houjian Yu et.al.	2509.08126	null
2025-09-09	MVAT: Multi-View Aware Teacher for Weakly Supervised 3D Object Detection	Saad Lahlali et.al.	2509.07507	null
2025-09-09	Network-accelerated Active Messages	Md Ashfaqur Rahaman et.al.	2509.07431	null
2025-09-06	3DPillars: Pillar-based two-stage 3D object detection	Jongyoun Noh et.al.	2509.05780	null
2025-09-06	SpecPrune-VLA: Accelerating Vision-Language-Action Models via Action-Aware Self-Speculative Pruning	Hanzhen Wang et.al.	2509.05614	null
2025-09-05	Characterizing and Optimizing Realistic Workloads on a Commercial Compute-in-SRAM Device	Niansong Zhang et.al.	2509.05451	null
2025-09-05	SpikingBrain Technical Report: Spiking Brain-inspired Large Models	Yuqi Pan et.al.	2509.05276	null
2025-09-04	Guideline-Consistent Segmentation via Multi-Agent Refinement	Vanshika Vats et.al.	2509.04687	null
2025-09-04	A Highly Scalable TDMA for GPUs and Its Application to Flow Solver Optimization	Seungchan Kim et.al.	2509.03933	null
2025-09-04	Real-Time Buoyancy Estimation for AUV Simulations Using Convex Hull-Based Submerged Volume Calculation	Ad-Deen Mahbub et.al.	2509.03804	null
2025-09-03	LuxDiT: Lighting Estimation with Video Diffusion Transformer	Ruofan Liang et.al.	2509.03680	null
2025-09-06	Efficient Virtuoso: A Latent Diffusion Transformer Model for Goal-Conditioned Trajectory Planning	Antonio Guillen-Perez et.al.	2509.03658	null
2025-09-03	Combining Performance and Productivity: Accelerating the Network Sensing Graph Challenge with GPUs and Commodity Data Science Software	Siddharth Samsi et.al.	2509.03653	null
2025-09-03	Can the Waymo Open Motion Dataset Support Realistic Behavioral Modeling? A Validation Study with Naturalistic Trajectories	Yanlin Zhang et.al.	2509.03515	null
2025-09-03	Harnessing Batched BLAS/LAPACK Kernels on GPUs for Parallel Solutions of Block Tridiagonal Systems	David Jin et.al.	2509.03015	null
2025-09-02	Do LLM Modules Generalize? A Study on Motion Generation for Autonomous Driving	Mingyi Wang et.al.	2509.02754	null
2025-09-02	LExI: Layer-Adaptive Active Experts for Efficient MoE Model Inference	Krishna Teja Chitty-Venkata et.al.	2509.02753	null
2025-09-02	HydroGAT: Distributed Heterogeneous Graph Attention Transformer for Spatiotemporal Flood Prediction	Aishwarya Sarkar et.al.	2509.02481	null
2025-09-02	AutoDrive-R $^2$ : Incentivizing Reasoning and Self-Reflection Capacity for VLA Model in Autonomous Driving	Zhenlong Yuan et.al.	2509.01944	null
2025-09-01	PointSlice: Accurate and Efficient Slice-Based Representation for 3D Object Detection from Point Clouds	Liu Qifeng et.al.	2509.01487	null
2025-09-01	LiquidGEMM: Hardware-Efficient W4A8 GEMM Kernel for High-Performance LLM Serving	Huanqi Hu et.al.	2509.01229	null
2025-09-30	Metis: Training LLMs with FP4 Quantization	Hengjie Cao et.al.	2509.00404	null
2025-08-27	More than Carbon: Cradle-to-Grave environmental impacts of GenAI training on the Nvidia A100 GPU	Sophia Falk et.al.	2509.00093	null
2025-08-29	FLORA: Efficient Synthetic Data Generation for Object Detection in Low-Data Regimes via finetuning Flux LoRA	Alvaro Patricio et.al.	2508.21712	null
2025-09-01	$Δ$ -Motif: Subgraph Isomorphism at Scale via Data-Centric Parallelism	Yulun Wang et.al.	2508.21287	null
2025-09-21	GPU-acceleration of the Discontinuous Galerkin Shallow Water Equations Model (DG-SWEM) with OpenACC	Chayanon Wichitrnithed et.al.	2508.21208	null
2025-08-28	Leveraging Generative Models for Real-Time Query-Driven Text Summarization in Large-Scale Web Search	Zeyu Xiong et.al.	2508.20559	null
2025-08-28	Realistic and Controllable 3D Gaussian-Guided Object Editing for Driving Video Generation	Jiusi Li et.al.	2508.20471	null
2025-08-28	MedFoundationHub: A Lightweight and Secure Toolkit for Deploying Medical Vision Language Foundation Models	Xiao Li et.al.	2508.20345	null
2025-08-26	APT-LLM: Exploiting Arbitrary-Precision Tensor Core Computing for LLM Acceleration	Shaobo Ma et.al.	2508.19087	null
2025-08-26	TaiBai: A fully programmable brain-inspired processor with topology-aware efficiency	Qianpeng Li et.al.	2508.18961	null
2025-08-26	ClusterFusion: Expanding Operator Fusion Scope for LLM Inference via Cluster-Level Collective Primitive	Xinhao Luo et.al.	2508.18850	null
2025-08-26	Strata: Hierarchical Context Caching for Long Context Language Model Serving	Zhiqiang Xie et.al.	2508.18572	null
2025-08-25	Practical GPU Choices for Earth Observation: ResNet-50 Training Throughput on Integrated, Laptop, and Cloud Accelerators	Ritvik Chaturvedi et.al.	2508.18206	null
2025-08-24	A Synthetic Dataset for Manometry Recognition in Robotic Applications	Pedro Antonio Rabelo Saraiva et.al.	2508.17468	null
2025-08-24	MoE-Inference-Bench: Performance Evaluation of Mixture of Expert Large Language and Vision Models	Krishna Teja Chitty-Venkata et.al.	2508.17467	null
2025-08-23	DeltaFlow: An Efficient Multi-frame Scene Flow Estimation Method	Qingwen Zhang et.al.	2508.17054	null
2025-08-23	A Novel Local Focusing Mechanism for Deepfake Detection Generalization	Mingliang Li et.al.	2508.17029	null
2025-08-31	GPT-OSS-20B: A Comprehensive Deployment-Centric Analysis of OpenAI’s Open-Weight Mixture of Experts Model	Deepak Kumar et.al.	2508.16700	null
2025-08-17	GPU Acceleration for Faster Evolutionary Spatial Cyclic Game Systems	Louie Sinadjan et.al.	2508.16639	null
2025-08-22	GreenLLM: SLO-Aware Dynamic Frequency Scaling for Energy-Efficient LLM Serving	Qunyou Liu et.al.	2508.16449	null
2025-08-25	Audio2Face-3D: Audio-driven Realistic Facial Animation For Digital Avatars	NVIDIA et.al.	2508.16401	null
2025-08-27	Hybrid Classical-Quantum Supercomputing: A demonstration of a multi-user, multi-QPU and multi-GPU environment	Mateusz Slysz et.al.	2508.16297	null
2025-08-22	Bare-Metal RISC-V + NVDLA SoC for Efficient Deep Learning Inference	Vineet Kumar et.al.	2508.16095	null
2025-08-22	A Unified Voxel Diffusion Module for Point Cloud 3D Object Detection	Qifeng Liu et.al.	2508.16069	null
2025-08-21	graph framework: A Domain Specific Compiler for Building Physics Applications	M. Cianciosa et.al.	2508.15967	null
2025-08-17	Better Together: Leveraging Multiple Digital Twins for Deployment Optimization of Airborne Base Stations	Mauro Belgiovine et.al.	2508.15816	null
2025-09-21	DriveSplat: Decoupled Driving Scene Reconstruction with Geometry-enhanced Partitioned Neural Gaussians	Cong Wang et.al.	2508.15376	null
2025-08-20	Snap-Snap: Taking Two Images to Reconstruct 3D Human Gaussians in Milliseconds	Jia Lu et.al.	2508.14892	null
2025-08-20	Leveraging Hardware-Aware Computation in Mixed-Precision Matrix Multiply: A Tile-Centric Approach	Qiao Zhang et.al.	2508.14848	null
2025-09-10	Memory-Anchored Multimodal Reasoning for Explainable Video Forensics	Chen Chen et.al.	2508.14581	null
2025-09-03	NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model	NVIDIA et.al.	2508.14444	null
2025-08-19	The 9th AI City Challenge	Zheng Tang et.al.	2508.13564	null
2025-08-18	Optimizing Allreduce Operations for Heterogeneous Architectures with Multiple Processes per GPU	Michael Adams et.al.	2508.13397	null
2025-08-18	X-MoE: Enabling Scalable Training for Emerging Mixture-of-Experts Architectures on HPC Platforms	Yueming Yuan et.al.	2508.13337	null
2025-07-28	Sustainable AI Training via Hardware-Software Co-Design on NVIDIA, AMD, and Emerging GPU Architectures	Yashasvi Makin et.al.	2508.13163	null
2025-08-18	CMF-IoU: Multi-Stage Cross-Modal Fusion 3D Object Detection with IoU Joint Prediction	Zhiwei Ning et.al.	2508.12917	null
2025-08-17	CarelessWhisper: Turning Whisper into a Causal Streaming Model	Tomer Krichli et.al.	2508.12301	null
2025-08-17	TSLA: A Task-Specific Learning Adaptation for Semantic Segmentation on Autonomous Vehicles Platform	Jun Liu et.al.	2508.12279	null
2025-08-17	ATLAS: AI-Native Receiver Test-and-Measurement by Leveraging AI-Guided Search	Mauro Belgiovine et.al.	2508.12204	null
2025-08-16	Load-Balanced Diffusion Monte Carlo Method with Lattice Regularization	Kousuke Nakano et.al.	2508.12033	null
2025-08-18	Visual Perception Engine: Fast and Flexible Multi-Head Inference for Robotic Vision Tasks	Jakub Łucki et.al.	2508.11584	null
2025-08-15	Efficient GPU-Centered Singular Value Decomposition Using the Divide-and-Conquer Method	Shifang Liu et.al.	2508.11467	null
2025-08-15	Delving into Dynamic Scene Cue-Consistency for Robust 3D Multi-Object Tracking	Haonan Zhang et.al.	2508.11323	null
2025-08-14	EMLIO: Minimizing I/O Latency and Energy Consumption for Large-Scale AI Training	Hasibul Jamil et.al.	2508.11035	null
2025-08-12	ViPE: Video Pose Engine for 3D Geometric Perception	Jiahui Huang et.al.	2508.10934	null
2025-08-13	GPU accelerated MHD in the DISPATCH framework using directive-based programming	Michael Haahr et.al.	2508.09568	null
2025-08-13	UWBa at SemEval-2025 Task 7: Multilingual and Crosslingual Fact-Checked Claim Retrieval	Ladislav Lenc et.al.	2508.09517	null
2025-08-13	Waymo-3DSkelMo: A Multi-Agent 3D Skeletal Motion Dataset for Pedestrian Interaction Modeling in Autonomous Driving	Guangxun Zhu et.al.	2508.09404	null
2025-08-07	Camel: Energy-Aware LLM Inference on Resource-Constrained Devices	Hao Xu et.al.	2508.09173	null
2025-08-12	Profiling Large Language Model Inference on Apple Silicon: A Quantization Perspective	Afsara Benazir et.al.	2508.08531	null
2025-08-11	Profiling Concurrent Vision Inference Workloads on NVIDIA Jetson – Extended	Abhinaba Chakraborty et.al.	2508.08430	null
2025-08-10	Weather-Driven Agricultural Decision-Making Using Digital Twins Under Imperfect Conditions	Tamim Ahmed et.al.	2508.08326	null
2025-08-11	Efficient Speculative Decoding for Llama at Scale: Challenges and Solutions	Bangsheng Tang et.al.	2508.08192	null
2025-08-11	TLV-HGNN: Thinking Like a Vertex for Memory-efficient HGNN Inference	Dengke Han et.al.	2508.07796	null
2025-08-10	An Experimental Exploration of In-Memory Computing for Multi-Layer Perceptrons	Pedro Carrinho et.al.	2508.07317	null
2025-09-06	The Fused Kernel Library: A C++ API to Develop Highly-Efficient GPU Libraries	Oscar Amoros et.al.	2508.07071	null
2025-08-27	From Imitation to Optimization: A Comparative Study of Offline Learning for Autonomous Driving	Antonio Guillen-Perez et.al.	2508.07029	null
2025-08-09	A Portable Multi-GPU Solver for Collisional Plasmas with Coulombic Interactions	James Almgren-Bell et.al.	2508.06771	null
2025-08-02	PiKV: KV Cache Management System for Mixture of Experts	Dong Liu et.al.	2508.06526	null
2025-08-08	MeanAudio: Fast and Faithful Text-to-Audio Generation with Mean Flows	Xiquan Li et.al.	2508.06098	null
2025-08-07	CleanUpBench: Embodied Sweeping and Grasping Benchmark	Wenbo Li et.al.	2508.05543	null
2025-08-07	MedMambaLite: Hardware-Aware Mamba for Medical Image Classification	Romina Aalishah et.al.	2508.05049	null
2025-08-07	CSRAP: Enhanced Canvas Attention Scheduling for Real-Time Mission Critical Perception	Md Iftekharul Islam Sakib et.al.	2508.04976	null
2025-08-07	Real-Time Doppler and Ionospheric Dispersion Correction Techniques for Arbitrary Waveforms Utilizing GPU Compute	Daniel J. Vickers et.al.	2508.04951	null
2025-08-05	AIC CTU@FEVER 8: On-premise fact checking through long context RAG	Herbert Ullrich et.al.	2508.04390	null
2025-08-06	A Foundation Model for DAS Signal Recognition and Visual Prompt Tuning of the Pre-trained Model for Downstream Tasks	Kun Gui et.al.	2508.04316	null
2025-08-11	Industrial Robot Motion Planning with GPUs: Integration of cuRobo for Extended DOF Systems	Luai Abuelsamen et.al.	2508.04146	null
2025-08-05	La La LiDAR: Large-Scale Layout Generation from LiDAR Data	Youquan Liu et.al.	2508.03691	null
2025-09-04	Understanding the Landscape of Ampere GPU Memory Errors	Zhu Zhu et.al.	2508.03513	null
2025-08-05	Full-History Graphs with Edge-Type Decoupled Networks for Temporal Reasoning	Osama Mohammed et.al.	2508.03251	null
2025-08-04	MicroMix: Efficient Mixed-Precision Quantization with Microscaling Formats for Large Language Models	Wenyuan Liu et.al.	2508.02343	null
2025-08-04	Dream-to-Recon: Monocular 3D Reconstruction with Diffusion-Depth Distillation from Single Images	Philipp Wulff et.al.	2508.02323	null
2025-08-04	CAMERA: Multi-Matrix Joint Compression for MoE Models via Micro-Expert Redundancy Analysis	Yuzhuang Xu et.al.	2508.02322	null
2025-08-04	GPU in the Blind Spot: Overlooked Security Risks in Transportation	Sefatun-Noor Puspa et.al.	2508.01995	null
2025-08-03	Beyond Simulation: Benchmarking World Models for Planning and Causality in Autonomous Driving	Hunter Schofield et.al.	2508.01922	null
2025-08-02	A Parallel Algorithm for Finding Robust Spanners in Large Social Networks	Arindam Khanda et.al.	2508.01485	null
2025-08-01	Hestia: Hierarchical Next-Best-View Exploration for Systematic Intelligent Autonomous Data Collection	Cheng-You Lu et.al.	2508.01014	null
2025-08-01	Optimal Scheduling Algorithms for LLM Inference: Theory and Practice	Agrim Bari et.al.	2508.01002	null
2025-07-29	Forecasting LLM Inference Performance via Hardware-Agnostic Analytical Modeling	Rajeev Patwari et.al.	2508.00904	null
2025-08-12	Context-based Motion Retrieval using Open Vocabulary Methods for Autonomous Driving	Stefan Englmeier et.al.	2508.00589	null
2025-08-09	DGEMM without FP64 Arithmetic – Using FP64 Emulation and FP8 Tensor Cores with Ozaki Scheme	Daichi Mukunoki et.al.	2508.00441	null
2025-08-01	On Learning Closed-Loop Probabilistic Multi-Agent Simulator	Juanwu Lu et.al.	2508.00384	null
2025-08-01	Beamformed 360° Sound Maps: U-Net-Driven Acoustic Source Segmentation and Localization	Belman Jahir Rodriguez et.al.	2508.00307	null
2025-07-31	FastPoint: Accelerating 3D Point Cloud Model Inference via Sample Point Distance Prediction	Donghyun Lee et.al.	2507.23480	null
2025-07-31	InterfO-RAN: Real-Time In-band Cellular Uplink Interference Detection with GPU-Accelerated dApps	Neagin Neasamoni Santhi et.al.	2507.23177	null
2025-07-30	On the Sustainability of AI Inferences in the Edge	Ghazal Sobhani et.al.	2507.23093	null
2025-07-30	Early Goal-Guided Multi-Scale Fusion for Real-Time Vision-Language Driving	Santosh Patapati et.al.	2507.23042	null
2025-07-28	Lightweight Transformer-Driven Segmentation of Hotspots and Snail Trails in Solar PV Thermal Imagery	Deepak Joshi et.al.	2507.20680	null
2025-07-27	SWIFT: A General Sensitive Weight Identification Framework for Fast Sensor-Transfer Pansharpening	Zeyu Xia et.al.	2507.20311	null
2025-07-26	Racing to Idle: Energy Efficiency of Matrix Multiplication on Heterogeneous CPU and GPU Architectures	Mufakir Qamar Ansari et.al.	2507.20063	null
2025-07-26	A Fast Parallel Median Filtering Algorithm Using Hierarchical Tiling	Louis Sugy et.al.	2507.19926	null
2025-08-02	GS-Occ3D: Scaling Vision-only Occupancy Reconstruction with Gaussian Splatting	Baijun Ye et.al.	2507.19451	null
2025-07-25	TokenSmith: Streamlining Data Editing, Search, and Inspection for Large-Scale Language Model Training and Interpretability	Mohammad Aflah Khan et.al.	2507.19419	null
2025-07-25	LOTUS: A Leaderboard for Detailed Image Captioning from Quality to Societal Bias and User Preferences	Yusuke Hirota et.al.	2507.19362	null
2025-07-25	SpeechIQ: Speech Intelligence Quotient Across Cognitive Levels in Voice Understanding Large Language Models	Zhen Wan et.al.	2507.19361	null
2025-07-25	High-Fidelity RF Mapping: Assessing Environmental Modeling in 6G Network Digital Twins	Lorenzo Cazzella et.al.	2507.19173	null
2025-07-24	SaLF: Sparse Local Fields for Multi-Sensor Rendering in Real-Time	Yun Chen et.al.	2507.18713	null
2025-07-24	Unposed 3DGS Reconstruction with Probabilistic Procrustes Mapping	Chong Cheng et.al.	2507.18541	null
2025-07-24	Building an Accelerated OpenFOAM Proof-of-Concept Application using Modern C++	Giulio Malenza et.al.	2507.18268	null
2025-07-26	MultiKernelBench: A Multi-Platform Benchmark for Kernel Generation	Zhongzhen Wen et.al.	2507.17773	null
2025-07-23	BetterCheck: Towards Safeguarding VLMs for Automotive Perception Systems	Malsha Ashani Mahawatta Dona et.al.	2507.17722	null
2025-07-24	Terrain-Aware Adaptation for Two-Dimensional UAV Path Planners	Kostas Karakontis et.al.	2507.17519	null
2025-07-25	HuNavSim 2.0: An Enhanced Human Navigation Simulator for Human-Aware Robot Navigation	Miguel Escudero-Jiménez et.al.	2507.17317	null
2025-07-23	GPU Benchmark through QPE Emulator with cuQuantum for Practical Quantum Applications	Takaki Akiba et.al.	2507.17175	null
2025-07-23	JAM: Keypoint-Guided Joint Prediction after Classification-Aware Marginal Proposal for Multi-Agent Interaction	Fangze Lin et.al.	2507.17152	null
2025-07-23	Model Compression Engine for Wearable Devices Skin Cancer Diagnosis	Jacob M. Delgado-López et.al.	2507.17125	null
2025-07-23	Computer Vision for Real-Time Monkeypox Diagnosis on Embedded Systems	Jacob M. Delgado-López et.al.	2507.17123	null
2025-07-22	Cooling Matters: Benchmarking Large Language Models and Vision-Language Models on Liquid-Cooled Versus Air-Cooled H100 GPU Systems	Imran Latif et.al.	2507.16781	null
2025-07-22	AcceleratedKernels.jl: Cross-Architecture Parallel Algorithms from a Unified, Transpiled Codebase	Andrei-Leonard Nicusan et.al.	2507.16710	null
2025-07-22	VGGT-Long: Chunk it, Loop it, Align it – Pushing VGGT’s Limits on Kilometer-scale Long RGB Sequences	Kai Deng et.al.	2507.16443	null
2025-07-21	MSGM: A Multi-Scale Spatiotemporal Graph Mamba for EEG Emotion Recognition	Hanwen Liu et.al.	2507.15914	null
2025-07-30	GALE: Leveraging Heterogeneous Systems for Efficient Unstructured Mesh Data Analysis	Guoxi Liu et.al.	2507.15230	null
2025-07-19	Artificial Intelligence in the Food Industry: Food Waste Estimation based on Computer Vision, a Brief Case Study in a University Dining Hall	Shayan Rokhva et.al.	2507.14662	null
2025-07-16	GPU-Accelerated Interpretable Generalization for Rapid Cyberattack Detection and Forensics	Shu-Ting Huang et.al.	2507.14222	null
2025-08-12	CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning	Xiaoya Li et.al.	2507.14111	null
2025-07-23	Photonic Fabric Platform for AI Accelerators	Jing Ding et.al.	2507.14000	null
2025-07-18	Leveraging Multi-Instance GPUs through moldable task scheduling	Jorge Villarrubia et.al.	2507.13601	null
2025-07-17	Performance Portable Gradient Computations Using Source Transformation	Kim Liegeois et.al.	2507.13204	null
2025-07-16	MMHU: A Massive-Scale Multimodal Benchmark for Human Behavior Understanding	Renjie Li et.al.	2507.12463	null
2025-07-16	HyDRA: A Hybrid Dual-Mode Network for Closed- and Open-Set RFFI with Optimized VMD	Hanwen Liu et.al.	2507.12133	null
2025-07-16	PoTPTQ: A Two-step Power-of-Two Post-training for LLMs	Xinyu Wang et.al.	2507.11959	null
2025-07-15	MIRAGE: KV Cache Optimization through Parameter Remapping for Multi-tenant LLM Serving	Ruihao Li et.al.	2507.11507	null
2025-07-15	MMStencil: Optimizing High-order Stencils on Multicore CPU using Matrix Unit	Yinuo Wang et.al.	2507.11067	null
2025-07-15	Standards-Compliant DM-RS Allocation via Temporal Channel Prediction for Massive MIMO Systems	Sehyun Ryu et.al.	2507.11064	null
2025-07-15	Modernizing CNN-based Weather Forecast Model towards Higher Computational Efficiency	Minjong Cheon et.al.	2507.10893	null
2025-07-21	Dissecting the NVIDIA Blackwell Architecture with Microbenchmarks	Aaron Jarmusch et.al.	2507.10789	null
2025-07-14	A Benchmarking Framework for AI models in Automotive Aerodynamics	Kaustubh Tangsali et.al.	2507.10747	null
2025-07-14	Quantize-then-Rectify: Efficient VQ-VAE Training	Borui Zhang et.al.	2507.10547	null
2025-07-30	Designing quantum chemistry algorithms with just-in-time compilation	Xiaojie Wu et.al.	2507.09772	null
2025-07-13	GeoWarp: An automatically differentiable and GPU-accelerated implicit MPM framework for geomechanics based on NVIDIA Warp	Yidong Zhao et.al.	2507.09435	null
2025-07-12	Factorization-free Orthogonal Projection onto the Positive Semidefinite Cone with Composite Polynomial Filtering	Shucheng Kang et.al.	2507.09165	null
2025-07-10	Vidyut3d: a GPU accelerated fluid solver for non-equilibrium plasmas on adaptive grids	Hariswaran Sitaraman et.al.	2507.08200	null
2025-07-10	GPUHammer: Rowhammer Attacks on GPU Memories are Practical	Chris S. Lin et.al.	2507.08166	null
2025-07-03	Collective Communication Profiling of Modern-day Machine Learning Workloads	Jit Gupta et.al.	2507.07117	null
2025-07-09	StixelNExT++: Lightweight Monocular Scene Segmentation and Representation for Collective Perception	Marcel Vosshans et.al.	2507.06687	null
2025-07-09	EA: An Event Autoencoder for High-Speed Vision Sensing	Riadul Islam et.al.	2507.06459	null
2025-07-08	CogniSQL-R1-Zero: Lightweight Reinforced Reasoning for Efficient SQL Generation	Kushal Gajjar et.al.	2507.06013	null
2025-07-07	Llama Nemoretriever Colembed: Top-Performing Text-Image Retrieval Model	Mengyao Xu et.al.	2507.05513	null
2025-07-07	Robotic System with AI for Real Time Weed Detection, Canopy Aware Spraying, and Droplet Pattern Evaluation	Inayat Rasool et.al.	2507.05432	null
2025-07-23	Demystifying NCCL: An In-depth Analysis of GPU Communication Protocols and Algorithms	Zhiyi Hu et.al.	2507.04786	null
2025-07-05	ArmGS: Composite Gaussian Appearance Refinement for Modeling Dynamic Urban Environments	Guile Wu et.al.	2507.03886	null
2025-07-24	Outdoor Monocular SLAM with Global Scale-Consistent 3D Gaussian Pointmaps	Chong Cheng et.al.	2507.03737	null
2025-07-03	NVIDIA GPU Confidential Computing Demystified	Zhongshu Gu et.al.	2507.02770	null
2025-07-03	Flotilla: A scalable, modular and resilient federated learning framework for heterogeneous resources	Roopkatha Banerjee et.al.	2507.02295	null
2025-07-02	SAKURAONE: Empowering Transparent and Open AI Platforms through Private-Sector HPC Investment in Japan	Fumikazu Konishi et.al.	2507.02124	null
2025-07-02	Deep Recommender Models Inference: Automatic Asymmetric Data Flow Optimization	Giuseppe Ruggeri et.al.	2507.01676	null
2025-06-20	PyTorch-based Geometric Learning with Non-CUDA Processing Units: Experiences from Intel Gaudi-v2 HPUs	Fanchen Bu et.al.	2507.01031	null
2025-07-01	Anatomy of High-Performance Column-Pivoted QR Decomposition	Maksim Melnichenko et.al.	2507.00976	null
2025-07-01	Twill: Scheduling Compound AI Systems on Heterogeneous Mobile Edge Platforms	Zain Taufique et.al.	2507.00491	null
2025-07-01	Serving LLMs in HPC Clusters: A Comparative Study of Qualcomm Cloud AI 100 Ultra and High-Performance GPUs	Mohammad Firas Sada et.al.	2507.00418	null
2025-07-01	Question Decomposition for Retrieval-Augmented Generation	Paul J. L. Ammann et.al.	2507.00355	null
2025-06-24	AdaDeDup: Adaptive Hybrid Data Pruning for Efficient Large-Scale Object Detection Training	Feiyang Kang et.al.	2507.00049	null
2025-06-30	Towards Building Private LLMs: Exploring Multi-Node Expert Parallelism on Apple Silicon for Mixture-of-Experts Large Language Model	Mu-Chi Chen et.al.	2506.23635	null
2025-06-30	Risk-Based Filtering of Valuable Driving Situations in the Waymo Open Motion Dataset	Tim Puphal et.al.	2506.23433	null
2025-06-29	CMOS+X: Stacking Persistent Embedded Memories based on Oxide Transistors upon GPGPU Platforms	Faaiq Waqar et.al.	2506.23405	null
2025-06-28	FF-INT8: Efficient Forward-Forward DNN Training on Edge Devices with INT8 Precision	Jingxiao Ma et.al.	2506.22771	null
2025-06-27	Quantum-Classical Auxiliary Field Quantum Monte Carlo with Matchgate Shadows on Trapped Ion Quantum Computers	Luning Zhao et.al.	2506.22408	null
2025-06-27	MPipeMoE: Memory Efficient MoE for Pre-trained Models with Adaptive Pipeline Parallelism	Zheng Zhang et.al.	2506.22175	null
2025-06-27	MCFuser: High-Performance and Rapid Fusion of Memory-Bound Compute-Intensive Operators	Zheng Zhang et.al.	2506.22169	null
2025-07-08	BézierGS: Dynamic Urban Scene Reconstruction with Bézier Curve Gaussian Splatting	Zipei Ma et.al.	2506.22099	null
2025-06-27	SceneDiffuser++: City-Scale Traffic Simulation via a Generative World Model	Shuhan Tan et.al.	2506.21976	null
2025-06-23	TrajTok: Technical Report for 2025 Waymo Open Sim Agents Challenge	Zhiyuan Zhang et.al.	2506.21618	null
2025-06-26	SAM4D: Segment Anything in Camera and LiDAR Streams	Jianyun Xu et.al.	2506.21547	null
2025-06-26	Portable High-Performance Kernel Generation for a Computational Fluid Dynamics Code with DaCe	Måns I. Andersson et.al.	2506.20994	null
2025-06-25	Characterization and Mitigation of Training Instabilities in Microscaling Formats	Huangyuan Su et.al.	2506.20752	null
2025-06-24	MegaFold: System-Level Optimizations for Accelerating Protein Structure Prediction Models	Hoa La et.al.	2506.20686	null
2025-06-25	SuperSONIC: Cloud-Native Infrastructure for ML Inferencing	Dmitry Kondratyev et.al.	2506.20657	null
2025-06-25	Exploiting Lightweight Hierarchical ViT and Dynamic Framework for Efficient Visual Tracking	Ben Kang et.al.	2506.20381	null
2025-06-24	Video-XL-2: Towards Very Long-Video Understanding Through Task-Aware KV Sparsification	Minghao Qin et.al.	2506.19225	null
2025-06-23	Let Your Video Listen to Your Music!	Xinyu Zhang et.al.	2506.18881	null
2025-06-23	Cross-Architecture Knowledge Distillation (KD) for Retinal Fundus Image Anomaly Detection on NVIDIA Jetson Nano	Berk Yilmaz et.al.	2506.18220	null
2025-06-22	AMD Versal Implementations of FAM and SSCA Estimators	Carol Jingyi Li et.al.	2506.18003	null
2025-06-20	Code Generation for Near-Roofline Finite Element Actions on GPUs from Symbolic Variational Forms	Kaushik Kulkarni et.al.	2506.17471	null
2025-06-19	VideoGAN-based Trajectory Proposal for Automated Vehicles	Annajoyce Mariani et.al.	2506.16209	null
2025-06-19	Efficient and Privacy-Preserving Soft Prompt Transfer for LLMs	Xun Wang et.al.	2506.16196	null
2025-06-19	HetGPU: The pursuit of making binary compatibility towards GPUs	Yiwei Yang et.al.	2506.15993	null
2025-06-18	Early Attentive Sparsification Accelerates Neural Speech Transcription	Zifei Xu et.al.	2506.15912	null
2025-06-18	UniRelight: Learning Joint Decomposition and Synthesis for Video Relighting	Kai He et.al.	2506.15673	null
2025-06-18	Engineering Supercomputing Platforms for Biomolecular Applications	Robert Welch et.al.	2506.15585	null
2025-07-30	Automated MRI Tumor Segmentation using hybrid U-Net with Transformer and Efficient Attention	Syed Haider Ali et.al.	2506.15562	null
2025-06-18	Align Your Flow: Scaling Continuous-Time Flow Map Distillation	Amirmojtaba Sabour et.al.	2506.14603	null
2025-06-18	Cosmos-Drive-Dreams: Scalable Synthetic Driving Data Generation with World Foundation Models	Xuanchi Ren et.al.	2506.09042	null
2025-06-10	Socratic-MCTS: Test-Time Visual Reasoning by Asking the Right Questions	David Acuna et.al.	2506.08927	null
2025-07-18	Controllable Weather Synthesis and Removal with Video Diffusion Models	Chih-Hao Lin et.al.	2505.00704	null
2025-04-21	LongPerceptualThoughts: Distilling System-2 Reasoning for System-1 Perception	Yuan-Hong Liao et.al.	2504.15362	null
2025-04-15	PARTFIELD: Learning 3D Feature Fields for Part Segmentation and Beyond	Minghua Liu et.al.	2504.11451	null
2025-04-17	VideoPanda: Video Panoramic Diffusion with Multi-view Attention	Kevin Xie et.al.	2504.11389	null
2025-05-20	Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning	NVIDIA et.al.	2503.15558	null
2025-04-03	Cosmos-Transfer1: Conditional World Generation with Adaptive Multimodal Control	NVIDIA et.al.	2503.14492	null
2025-03-05	GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control	Xuanchi Ren et.al.	2503.03751	null
2025-03-03	Difix3D+: Improving 3D Reconstructions with Single-Step Diffusion Models	Jay Zhangjie Wu et.al.	2503.01774	null
2025-03-22	DiffusionRenderer: Neural Inverse and Forward Rendering with Video Diffusion Models	Ruofan Liang et.al.	2501.18590	null
2025-07-11	Cosmos World Foundation Model Platform for Physical AI	NVIDIA et.al.	2501.03575	null
2025-06-26	InfiniCube: Unbounded and Controllable Dynamic 3D Driving Scene Generation with World-Guided Video Models	Yifan Lu et.al.	2412.03934	null
2025-04-01	Feed-Forward Bullet-Time Reconstruction of Dynamic Scenes from Monocular Videos	Hanxue Liang et.al.	2412.03526	null
2024-11-14	LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models	Zhengyi Wang et.al.	2411.09595	null
2025-02-28	ReMatching Dynamic Reconstruction Flow	Sara Oblak et.al.	2411.00705	null
2024-10-29	SCube: Instant Large-Scale Scene Reconstruction using VoxSplats	Xuanchi Ren et.al.	2410.20030	null
2025-02-11	SpaceMesh: A Continuous Representation for Learning Manifold Surface Meshes	Tianchang Shen et.al.	2409.20562	null
2024-09-28	G3R: Gradient Guided Generalizable Reconstruction	Yun Chen et.al.	2409.19405	null
2024-09-27	UniCal: Unified Neural Sensor Calibration	Ze Yang et.al.	2409.18953	null
2024-09-26	Learning to Drive via Asymmetric Self-Play	Chris Zhang et.al.	2409.18218	null
2024-09-17	Reasoning Paths with Reference Objects Elicit Quantitative Spatial Reasoning in Large Vision-Language Models	Yuan-Hong Liao et.al.	2409.09788	null
2025-04-22	OmniRe: Omni Urban Scene Reconstruction	Ziyu Chen et.al.	2408.16760	null
2024-08-19	Photorealistic Object Insertion with Diffusion-Guided Inverse Rendering	Ruofan Liang et.al.	2408.09702	null
2025-03-20	Wolf: Dense Video Captioning with a World Summarization Framework	Boyi Li et.al.	2407.18908	null
2024-07-23	Point Transformer V3 Extreme: 1st Place Solution for 2024 Waymo Open Dataset Challenge in Semantic Segmentation	Xiaoyang Wu et.al.	2407.15282	null
2024-07-15	SuperPADL: Scaling Language-Directed Physics-Based Control with Progressive Supervised Distillation	Jordan Juravsky et.al.	2407.10481	null
2024-10-10	3D Gaussian Ray Tracing: Fast Tracing of Particle Scenes	Nicolas Moenne-Loccoz et.al.	2407.07090	null
2024-07-01	fVDB: A Deep-Learning Framework for Sparse, Large-Scale, and High-Performance Spatial Intelligence	Francis Williams et.al.	2407.01781	null
2024-10-31	DistillNeRF: Perceiving 3D Scenes from Single-Glance Images by Distilling Neural Fields and Foundation Model Features	Letian Wang et.al.	2406.12095	null
2024-08-08	Nemotron-4 340B Technical Report	Nvidia et.al.	2406.11704	null
2024-06-14	L4GM: Large 4D Gaussian Reconstruction Model	Jiawei Ren et.al.	2406.10324	null
2024-06-12	UnO: Unsupervised Occupancy Fields for Perception and Forecasting	Ben Agro et.al.	2406.08691	null
2024-06-13	Outdoor Scene Extrapolation with Hierarchical Generative Cellular Automata	Dongsu Zhang et.al.	2406.08292	null
2024-06-14	DeTra: A Unified Model for Object Detection and Trajectory Forecasting	Sergio Casas et.al.	2406.04426	null
2024-05-13	Lowering Barriers to Entry for Fully-Integrated Custom Payloads on a DJI Matrice	Joshua Springer et.al.	2405.06176	null
2024-04-24	NeRF-XL: Scaling NeRFs with Multiple GPUs	Ruilong Li et.al.	2404.16221	null
2024-04-24	Align Your Steps: Optimizing Sampling Schedules in Diffusion Models	Amirmojtaba Sabour et.al.	2404.14507	null
2024-04-16	RefFusion: Reference Adapted Diffusion Models for 3D Scene Inpainting	Ashkan Mirzaei et.al.	2404.10765	null
2025-05-27	Can Large Vision-Language Models Correct Semantic Grounding Errors By Themselves?	Yuan-Hong Liao et.al.	2404.06510	null
2024-04-01	QuAD: Query-based Interpretable Neural Motion Planning for Autonomous Driving	Sourav Biswas et.al.	2404.01486	null
2024-03-22	LATTE3D: Large-scale Amortized Text-To-Enhanced3D Synthesis	Kevin Xie et.al.	2403.15385	null
2024-03-22	Augmented Reality based Simulated Data (ARSim) with multi-view consistency for AV perception networks	Aqeel Anwar et.al.	2403.15370	null
2024-01-22	EmerDiff: Emerging Pixel-level Semantic Knowledge in Diffusion Models	Koichi Namekata et.al.	2401.11739	null
2024-04-05	Network Anatomy and Real-Time Measurement of Nvidia GeForce NOW Cloud Gaming	Minzhao Lyu et.al.	2401.06366	null
2023-12-28	Compact Neural Graphics Primitives with Learned Hash Probing	Towaki Takikawa et.al.	2312.17241	null
2024-01-03	Align Your Gaussians: Text-to-4D with Dynamic 3D Gaussians and Composed Diffusion Models	Huan Ling et.al.	2312.13763	null
2023-12-11	LightSim: Neural Lighting Simulation for Urban Scenes	Ava Pun et.al.	2312.06654	null
2024-04-16	Trajeglish: Traffic Modeling as Next-Token Prediction	Jonah Philion et.al.	2312.04535	null
2024-06-25	XCube: Large-Scale 3D Generative Modeling using Sparse Voxel Hierarchies	Xuanchi Ren et.al.	2312.03806	null
2024-04-12	WildFusion: Learning 3D-Aware Latent Diffusion Models in View Space	Katja Schwarz et.al.	2311.13570	null
2023-11-16	Adaptive Shells for Efficient Neural Radiance Field Rendering	Zian Wang et.al.	2311.10091	null
2023-11-09	Real-Time Neural Rasterization for Large Scenes	Jeffrey Yunfan Liu et.al.	2311.05607	null
2023-11-09	Reconstructing Objects in-the-wild for Realistic Sensor Simulation	Ze Yang et.al.	2311.05602	null
2023-11-07	3DiffTection: 3D Object Detection with Geometry-Aware Diffusion Features	Chenfeng Xu et.al.	2311.04391	null
2023-11-03	EmerNeRF: Emergent Spatial-Temporal Scene Decomposition via Self-Supervision	Jiawei Yang et.al.	2311.02077	null
2023-11-03	Towards Unsupervised Object Detection From LiDAR Point Clouds	Lunjun Zhang et.al.	2311.02007	null
2023-11-06	MemorySeg: Online LiDAR Semantic Segmentation with a Latent Memory	Enxu Li et.al.	2311.01556	null
2023-11-17	4D-Former: Multimodal 4D Panoptic Segmentation	Ali Athar et.al.	2311.01520	null
2023-11-02	UltraLiDAR: Learning Compact Representations for LiDAR Completion and Generation	Yuwen Xiong et.al.	2311.01448	null
2023-11-02	CADSim: Robust and Scalable in-the-wild 3D Reconstruction for Controllable Sensor Simulation	Jingkang Wang et.al.	2311.01447	null
2023-11-02	Adv3D: Generating Safety-Critical 3D Objects through Closed-Loop Simulation	Jay Sarva et.al.	2311.01446	null
2023-11-02	LabelFormer: Object Trajectory Refinement for Offboard Perception from LiDAR Point Clouds	Anqi Joyce Yang et.al.	2311.01444	null
2023-11-02	Learning Realistic Traffic Agents in Closed-loop	Chris Zhang et.al.	2311.01394	null
2024-04-01	Copilot4D: Learning Unsupervised World Models for Autonomous Driving via Discrete Diffusion	Lunjun Zhang et.al.	2311.01017	null
2024-01-26	ViR: Towards Efficient Vision Retention Backbones	Ali Hatamizadeh et.al.	2310.19731	null
2024-03-18	An Open, Programmable, Multi-vendor 5G O-RAN Testbed with NVIDIA ARC and OpenAirInterface	Davide Villa et.al.	2310.17062	null
2023-10-20	TexFusion: Synthesizing 3D Textures with Text-Guided Image Diffusion Models	Tianshi Cao et.al.	2310.13772	null
2023-09-20	A Digital Forensics Case Study of the DJI Mini 3 Pro and DJI RC	Aaron Taylor et.al.	2309.10487	null
2023-09-13	Behind The Wings: The Case of Reverse Engineering and Drone Hijacking in DJI Enhanced Wi-Fi Protocol	Derry Pratama et.al.	2309.05913	null
2023-09-11	Towards Viewpoint Robustness in Bird’s Eye View Segmentation	Tzofi Klinghoffer et.al.	2309.05192	null
2023-08-10	Flexible Isosurface Extraction for Gradient-Based Mesh Optimization	Tianchang Shen et.al.	2308.05371	null
2023-08-03	UniSim: A Neural Closed-Loop Sensor Simulator	Ze Yang et.al.	2308.01898	null
2023-08-02	Implicit Occupancy Flow Fields for Perception and Prediction in Self-Driving	Ben Agro et.al.	2308.01471	null
2023-07-14	DreamTeacher: Pretraining Image Backbones with Deep Generative Models	Daiqing Li et.al.	2307.07487	null
2023-06-27	Rethinking Closed-loop Training for Autonomous Driving	Chris Zhang et.al.	2306.15713	null
2023-06-22	Multiverse Transformer: 1st Place Solution for Waymo Open Sim Agents Challenge 2023	Yu Wang et.al.	2306.11868	null
2023-06-06	ATT3D: Amortized Text-to-3D Object Synthesis	Jonathan Lorraine et.al.	2306.07349	null
2023-06-09	Neural Kernel Surface Reconstruction	Jiahui Huang et.al.	2305.19590	null
2023-08-13	Neural LiDAR Fields for Novel View Synthesis	Shengyu Huang et.al.	2305.01643	null
2023-04-19	NeuralField-LDM: Scene Generation with Hierarchical Latent Diffusion Models	Seung Wook Kim et.al.	2304.09787	null
2023-12-28	Align your Latents: High-Resolution Video Synthesis with Latent Diffusion Models	Andreas Blattmann et.al.	2304.08818	null
2023-04-06	Neural Fields meet Explicit Geometric Representation for Inverse Rendering of Urban Scenes	Zian Wang et.al.	2304.03266	null
2023-04-04	Trace and Pace: Controllable Pedestrian Animation via Guided Trajectory Diffusion	Davis Rempe et.al.	2304.01893	null
2023-03-25	VoxFormer: Sparse Voxel Transformer for Camera-based 3D Semantic Scene Completion	Yiming Li et.al.	2302.12251	null
2023-02-09	Bridging the Sim2Real gap with CARE: Supervised Detection Adaptation with Conditional Alignment and Reweighting	Viraj Prabhu et.al.	2302.04832	null
2023-02-02	Synthesizing Physical Character-Scene Interactions	Mohamed Hassan et.al.	2302.00883	null
2023-01-31	PADL: Language-Directed Physics-Based Character Control	Jordan Juravsky et.al.	2301.13868	null
2022-12-19	Collision Avoidance Testing of the Waymo Automated Driving System	Kristofer D. Kusano et.al.	2212.08148	null
2023-03-25	Magic3D: High-Resolution Text-to-3D Content Creation	Chen-Hsuan Lin et.al.	2211.10440	null
2022-11-08	GoRela: Go Relative for Viewpoint-Invariant Motion Forecasting	Alexander Cui et.al.	2211.02545	null
2022-10-12	LION: Latent Point Diffusion Models for 3D Shape Generation	Xiaohui Zeng et.al.	2210.06978	null
2022-10-06	XDGAN: Multi-Modal 3D Shape Generation in 2D Space	Hassan Abu Alhaija et.al.	2210.03007	null
2022-10-03	Optimizing Data Collection for Machine Learning	Rafid Mahmood et.al.	2210.01234	null
2022-09-26	EPIC-KITCHENS VISOR Benchmark: VIdeo Segmentations and Object Relations	Ahmad Darkhalil et.al.	2209.13064	null
2022-09-22	GET3D: A Generative Model of High Quality 3D Textured Shapes Learned from Images	Jun Gao et.al.	2209.11163	null
2022-09-22	MTR-A: 1st Place Solution for 2022 Waymo Open Dataset Challenge – Motion Prediction	Shaoshuai Shi et.al.	2209.10033	null
2022-08-19	Neural Light Field Estimation for Street Scenes with Differentiable Virtual Object Insertion	Zian Wang et.al.	2208.09480	null
2022-08-18	MvDeCor: Multi-view Dense Correspondence Learning for Fine-grained 3D Segmentation	Gopal Sharma et.al.	2208.08580	null
2022-07-25	DJI drone IDs are not encrypted	Conner Bender et.al.	2207.10795	null
2022-07-12	MT-Net Submission to the Waymo 3D Detection Leaderboard	Shaoxiang Chen et.al.	2207.04781	null
2022-07-05	Improving Semantic Segmentation in Transformers using Hierarchical Inter-Level Attention	Gary Leung et.al.	2207.02126	null
2022-07-13	How Much More Data Do I Need? Estimating Requirements for Downstream Tasks	Rafid Mahmood et.al.	2207.01725	null
2022-06-19	Scalable Neural Data Server: A Data Recommender for Transfer Learning	Tianshi Cao et.al.	2206.09386	null
2022-06-16	Virtual Correspondence: Humans as a Cue for Extreme-View Geometry	Wei-Chiu Ma et.al.	2206.08365	null
2022-06-15	Variable Bitrate Neural Fields	Towaki Takikawa et.al.	2206.07707	null
2022-06-06	Polymorphic-GAN: Generating Aligned Samples across Multiple Domains with Learned Morph Maps	Seung Wook Kim et.al.	2206.02903	null
2022-05-05	ASE: Large-Scale Reusable Adversarial Skill Embeddings for Physically Simulated Characters	Xue Bin Peng et.al.	2205.01906	null
2022-04-19	M $^2$ BEV: Multi-Camera Joint 3D Detection and Segmentation with Unified Birds-Eye View Representation	Enze Xie et.al.	2204.05088	null
2022-04-08	AUV-Net: Learning Aligned UV Maps for Texture Transfer and Synthesis	Zhiqin Chen et.al.	2204.03105	null
2021-11-16	Towards Optimal Strategies for Training Self-Driving Perception Models in Simulation	David Acuna et.al.	2111.07971	null
2021-07-07	NP-DRAW: A Non-Parametric Structured Latent Variable Model for Image Generation	Xiaohui Zeng et.al.	2106.13435	null
2023-03-08	Low Budget Active Learning via Wasserstein Distance: An Integer Programming Approach	Rafid Mahmood et.al.	2106.02968	null
2021-05-03	DriveGAN: Towards a Controllable High-Quality Neural Simulation	Seung Wook Kim et.al.	2104.15060	null
2021-04-27	Towards Good Practices for Efficiently Annotating Large-Scale Image Classification Datasets	Yuan-Hong Liao et.al.	2104.12690	null
2021-04-07	gradSim: Differentiable simulation for system identification and visuomotor control	Krishna Murthy Jatavallabhula et.al.	2104.02646	null
2021-01-21	IntentNet: Learning to Predict Intention from Raw Sensor Data	Sergio Casas et.al.	2101.07907	null
2021-01-19	MP3: A Unified Model to Map, Perceive, Predict and Plan	Sergio Casas et.al.	2101.06806	null
2020-12-24	Fast and Furious: Real Time End-to-End 3D Detection, Tracking and Motion Forecasting with a Single Convolutional Net	Wenjie Luo et.al.	2012.12395	null
2020-12-23	HDNET: Exploiting HD Maps for 3D Object Detection	Bin Yang et.al.	2012.11704	null
2021-01-18	End-to-End Deep Structured Models for Drawing Crosswalks	Justin Liang et.al.	2012.11585	null
2020-12-22	Deep Continuous Fusion for Multi-Sensor 3D Object Detection	Ming Liang et.al.	2012.10992	null
2020-12-15	A PAC-Bayesian Approach to Generalization Bounds for Graph Neural Networks	Renjie Liao et.al.	2012.07690	null
2020-11-03	Waymo’s Safety Methodologies and Safety Readiness Determinations	Nick Webb et.al.	2011.00054	null
2020-11-03	Waymo Public Road Safety Performance Data	Matthew Schwall et.al.	2011.00038	null
2020-08-21	Meta-Sim2: Unsupervised Learning of Scene Structure for Synthetic Data Generation	Jeevan Devaranjan et.al.	2008.09092	null
2020-08-14	Lift, Splat, Shoot: Encoding Images From Arbitrary Camera Rigs by Implicitly Unprojecting to 3D	Jonah Philion et.al.	2008.05711	null
2020-12-01	LoCo: Local Contrastive Representation Learning	Yuwen Xiong et.al.	2008.01342	null
2020-06-30	2nd Place Solution for Waymo Open Dataset Challenge – 2D Object Detection	Sijia Chen et.al.	2006.15507	null
2020-05-26	Learning to Simulate Dynamic Environments with GameGAN	Seung Wook Kim et.al.	2005.12126	null
2020-04-21	Learning to Evaluate Perception Models Using Planner-Centric Metrics	Jonah Philion et.al.	2004.08745	null
2020-04-02	Neural Data Server: A Large-Scale Search Engine for Transfer Learning Data	Xi Yan et.al.	2001.02799	null
2019-10-28	CrevNet: Conditionally Reversible Video Prediction	Wei Yu et.al.	1910.11577	null
2020-02-14	Learning to Remember from a Multi-Task Teacher	Yuwen Xiong et.al.	1910.04650	null
2019-09-30	DMM-Net: Differentiable Mask-Matching Network for Video Object Segmentation	Xiaohui Zeng et.al.	1909.12471	null
2022-06-22	A Theoretical Analysis of the Number of Shots in Few-Shot Learning	Tianshi Cao et.al.	1909.11722	null
2019-08-13	DSIC: Deep Stereo Image Compression	Jerry Liu et.al.	1908.03631	null
2019-08-21	Video Face Clustering with Unknown Number of Clusters	Makarand Tapaswi et.al.	1908.03381	null
2019-05-16	DARNet: Deep Active Ray Network for Building Segmentation	Dominic Cheng et.al.	1905.05889	null
2020-11-13	DeepSignals: Predicting Intent of Drivers Through Visual Signals	Davi Frossard et.al.	1905.01333	null
2019-06-11	Devil is in the Edges: Learning Semantic Boundaries from Noisy Annotations	David Acuna et.al.	1904.07934	null
2019-03-05	PIXOR: Real-time 3D Object Detection from Point Clouds	Bin Yang et.al.	1902.06326	null
2018-12-05	A Face-to-Face Neural Conversation Model	Hang Chu et.al.	1812.01525	null
2018-12-05	SurfConv: Bridging 3D and 2D Convolution for RGBD Images	Hang Chu et.al.	1812.01519	null
2019-03-22	Learning to Caption Images through a Lifetime by Asking Questions	Kevin Shen et.al.	1812.00235	null
2018-10-24	A Neural Compositional Paradigm for Image Captioning	Bo Dai et.al.	1810.09630	null
2018-10-16	Pose Estimation for Objects with Rotational Symmetry	Enric Corona et.al.	1810.05780	null
2020-11-13	End-to-end Learning of Multi-sensor 3D Tracking by Detection	Davi Frossard et.al.	1806.11534	null
2017-10-23	Be Your Own Prada: Fashion Synthesis with Structural Coherence	Shizhan Zhu et.al.	1710.07346	null
2017-08-16	Situation Recognition with Graph Neural Networks	Ruiyu Li et.al.	1708.04320	null
2018-07-31	VSE++: Improving Visual-Semantic Embeddings with Hard Negatives	Fartash Faghri et.al.	1707.05612	null
2017-11-15	Few-Shot Learning Through an Information Retrieval Lens	Eleni Triantafillou et.al.	1707.02610	null
2017-06-06	Teaching Machines to Describe Images via Natural Language Feedback	Huan Ling et.al.	1706.00130	null
2017-04-20	Annotating Object Instances with a Polygon-RNN	Lluis Castrejon et.al.	1704.05548	null
2017-08-14	Towards Diverse and Natural Image Descriptions via a Conditional GAN	Bo Dai et.al.	1703.06029	null
2016-12-02	TorontoCity: Seeing the World with a Million Eyes	Shenlong Wang et.al.	1612.00423	null
2017-05-08	Deep Watershed Transform for Instance Segmentation	Min Bai et.al.	1611.08303	null
2016-11-14	Song From PI: A Musically Plausible Network for Pop Music Generation	Hang Chu et.al.	1611.03477	null
2016-11-11	Efficient Summarization with Read-Again and Copy Mechanism	Wenyuan Zeng et.al.	1611.03382	null
2017-04-26	3D Object Proposals using Stereo Imagery for Accurate Object Class Detection	Xiaozhi Chen et.al.	1608.07711	null
2016-06-24	Find your Way by Observing the Sun and Other Semantic Cues	Wei-Chiu Ma et.al.	1606.07415	null
2016-04-12	Soccer Field Localization from a Single Image	Namdar Homayounfar et.al.	1604.02715	null
2016-08-01	vDNN: Virtualized Deep Neural Networks for Scalable, Memory-Efficient Neural Network Design	Minsoo Rhu et.al.	1602.08124	null
2016-04-28	Instance-Level Segmentation for Autonomous Driving with Deep Densely Connected MRFs	Ziyu Zhang et.al.	1512.06735	null
2016-09-22	MovieQA: Understanding Stories in Movies through Question-Answering	Makarand Tapaswi et.al.	1512.02902	null
2016-03-02	Order-Embeddings of Images and Language	Ivan Vendrov et.al.	1511.06361	null
2015-06-23	Skip-Thought Vectors	Ryan Kiros et.al.	1506.06726	null
2015-06-23	Aligning Books and Movies: Towards Story-like Visual Explanations by Watching Movies and Reading Books	Yukun Zhu et.al.	1506.06724	null
2015-12-21	Monocular Object Instance Segmentation and Depth Ordering with CNNs	Ziyu Zhang et.al.	1505.03159	null
2015-03-10	Fully Connected Deep Structured Networks	Alexander G. Schwing et.al.	1503.02351	null
2015-03-03	Generating Multi-Sentence Lingual Descriptions of Indoor Scenes	Dahua Lin et.al.	1503.00064	null
2015-02-17	segDeepM: Exploiting Segmentation and Context in Deep Neural Networks for Object Detection	Yukun Zhu et.al.	1502.04275	null
2015-02-09	A Framework for Symmetric Part Detection in Cluttered Scenes	Tom Lee et.al.	1502.01761	null
2014-08-26	Learning a Hierarchical Compositional Shape Vocabulary for Multi-class Object Representation	Sanja Fidler et.al.	1408.5516	null
2014-12-30	FollowMe: Efficient Online Min-Cost Flow Tracking with Bounded Memory and Computation	Philip Lenz et.al.	1407.6251	null
2014-06-17	Human-Machine CRFs for Identifying Bottlenecks in Holistic Scene Understanding	Roozbeh Mottaghi et.al.	1406.3906	null
2014-06-10	Detect What You Can: Detecting and Representing Objects using Holistic Models and Body Parts	Xianjie Chen et.al.	1406.2031	null
2012-07-02	Efficient Structured Prediction with Latent Variables for General Graphical Models	Alexander Schwing et.al.	1206.6436	null
2012-06-18	Multi-View Learning in the Presence of View Disagreement	C. Christoudias et.al.	1206.3242	null
2012-04-09	Continuous Markov Random Fields for Robust Stereo Estimation	Koichiro Yamaguchi et.al.	1204.1393	null
2012-07-10	Approximated Structured Prediction for Learning Large Scale Graphical Models	Tamir Hazan et.al.	1006.2899	null

Autonomous Driving

Publish Date	Title	Authors	PDF	Code
2025-12-09	Astra: General Interactive World Model with Autoregressive Denoising	Yixuan Zhu et.al.	2512.08931	null
2025-12-09	A Multi-Agent LLM Framework for Design Space Exploration in Autonomous Driving Systems	Po-An Shih et.al.	2512.08476	null
2025-12-09	Distilling Future Temporal Knowledge with Masked Feature Reconstruction for 3D Object Detection	Haowen Zheng et.al.	2512.08247	null
2025-12-09	Accuracy Does Not Guarantee Human-Likeness in Monocular Depth Estimators	Yuki Kubota et.al.	2512.08163	null
2025-12-08	DiffusionDriveV2: Reinforcement Learning-Constrained Truncated Diffusion Modeling in End-to-End Autonomous Driving	Jialv Zou et.al.	2512.07745	null
2025-12-08	Optimization-Guided Diffusion for Interactive Scene Generation	Shiaho Li et.al.	2512.07661	null
2025-12-08	VP-AutoTest: A Virtual-Physical Fusion Autonomous Driving Testing Platform	Yiming Cui et.al.	2512.07507	null
2025-12-08	Towards Reliable Test-Time Adaptation: Style Invariance as a Correctness Likelihood	Gilhyun Nam et.al.	2512.07390	null
2025-12-08	Unified Camera Positional Encoding for Controlled Video Generation	Cheng Zhang et.al.	2512.07237	null
2025-12-09	TrajMoE: Scene-Adaptive Trajectory Planning with Mixture of Experts and Reinforcement Learning	Zebin Xing et.al.	2512.07135	null
2025-12-08	Mimir: Hierarchical Goal-Driven Diffusion with Uncertainty Propagation for End-to-End Autonomous Driving	Zebin Xing et.al.	2512.07130	null
2025-12-07	Spatial Retrieval Augmented Autonomous Driving	Xiaosong Jia et.al.	2512.06865	null
2025-12-07	SparseCoop: Cooperative Perception with Kinematic-Grounded Queries	Jiahao Wang et.al.	2512.06838	null
2025-12-07	FedDSR: Federated Deep Supervision and Regularization Towards Autonomous Driving	Wei-Bin Kou et.al.	2512.06676	null
2025-12-07	Statistic-Augmented, Decoupled MoE Routing and Aggregating in Autonomous Driving	Wei-Bin Kou et.al.	2512.06664	null
2025-12-06	Are AI-Generated Driving Videos Ready for Autonomous Driving? A Diagnostic Evaluation Framework	Xinhao Xiang et.al.	2512.06376	null
2025-12-06	Beyond Hallucinations: A Multimodal-Guided Task-Aware Generative Image Compression for Ultra-Low Bitrate	Kaile Wang et.al.	2512.06344	null
2025-12-06	NexusFlow: Unifying Disparate Tasks under Partial Supervision via Invertible Flow Networks	Fangzhou Lin et.al.	2512.06251	null
2025-12-05	Situation-Aware Interactive MPC Switching for Autonomous Driving	Shuhao Qi et.al.	2512.06182	null
2025-12-05	WAM-Flow: Parallel Coarse-to-Fine Motion Planning via Discrete Flow Matching for Autonomous Driving	Yifang Xu et.al.	2512.06112	null
2025-12-05	BeLLA: End-to-End Birds Eye View Large Language Assistant for Autonomous Driving	Karthik Mohan et.al.	2512.06096	null
2025-12-05	Representation Learning for Point Cloud Understanding	Siming Yan et.al.	2512.06058	null
2025-12-04	Closed-Loop Robotic Manipulation of Transparent Substrates for Self-Driving Laboratories using Deep Learning Micro-Error Correction	Kelsey Fontenot et.al.	2512.06038	null
2025-12-05	OWL: Unsupervised 3D Object Detection by Occupancy Guided Warm-up and Large Model Priors Reasoning	Xusheng Guo et.al.	2512.05698	null
2025-12-05	Scenario-aware Uncertainty Quantification for Trajectory Prediction with Statistical Guarantees	Yiming Shu et.al.	2512.05682	null
2025-12-05	Concept-based Explainable Data Mining with VLM for 3D Detection	Mai Tsujimoto et.al.	2512.05482	null
2025-12-05	State-Conditional Adversarial Learning: An Off-Policy Visual Domain Transfer Method for End-to-End Imitation Learning	Yuxiang Liu et.al.	2512.05335	null
2025-12-04	From Segments to Scenes: Temporal Understanding in Autonomous Driving via Vision-Language Model	Kevin Cannons et.al.	2512.05277	null
2025-12-04	ShadowDraw: From Any Object to Shadow-Drawing Compositional Art	Rundong Luo et.al.	2512.05110	null
2025-12-08	FASTer: Toward Efficient Autoregressive Vision Language Action Modeling via Neural Action Tokenization	Yicheng Liu et.al.	2512.04952	null
2025-12-04	FreeGen: Feed-Forward Reconstruction-Generation Co-Training for Free-Viewpoint Driving Scene Synthesis	Shijie Chen et.al.	2512.04830	null
2025-12-09	MT-Depth: Multi-task Instance feature analysis for the Depth Completion	Abdul Haseeb Nizamani et.al.	2512.04734	null
2025-12-04	E3AD: An Emotion-Aware Vision-Language-Action Model for Human-Centric End-to-End Autonomous Driving	Yihong Tang et.al.	2512.04733	null
2025-12-04	dVLM-AD: Enhance Diffusion Vision-Language-Model for Driving via Controllable Reasoning	Yingzi Ma et.al.	2512.04459	null
2025-12-08	MindDrive: An All-in-One Framework Bridging World Models and Vision-Language Model for End-to-End Autonomous Driving	Bin Sun et.al.	2512.04441	null
2025-12-09	RoboBPP: Benchmarking Robotic Online Bin Packing with Physics-based Simulation	Zhoufeng Wang et.al.	2512.04415	null
2025-12-03	Driving Beyond Privilege: Distilling Dense-Reward Knowledge into Sparse-Reward Policies	Feeza Khan Khanzada et.al.	2512.04279	null
2025-12-03	Fast & Efficient Normalizing Flows and Applications of Image Generative Models	Sandeep Nagar et.al.	2512.04039	null
2025-12-03	Highly Efficient Test-Time Scaling for T2I Diffusion Models with Text Embedding Perturbation	Hang Xu et.al.	2512.03996	null
2025-12-03	DIQ-H: Evaluating Hallucination Persistence in VLMs Under Temporal Visual Degradation	Zexin Lin et.al.	2512.03992	null
2025-12-03	Driving is a Game: Combining Planning and Prediction with Bayesian Iterative Best Response	Aron Distelzweig et.al.	2512.03936	null
2025-12-03	Digital Twin-based Control Co-Design of Full Vehicle Active Suspensions via Deep Reinforcement Learning	Ying-Kuan Tsai et.al.	2512.03891	null
2025-12-03	A Modular Architecture Design for Autonomous Driving Racing in Controlled Environments	Brais Fontan-Costas et.al.	2512.03886	null
2025-12-03	MPCFormer: A physics-informed data-driven approach for explainable socially-aware autonomous driving	Jia Hu et.al.	2512.03795	null
2025-12-03	Safety Reinforced Model Predictive Control (SRMPC): Improving MPC with Reinforcement Learning for Motion Planning in Autonomous Driving	Johannes Fischer et.al.	2512.03774	null
2025-12-03	Context-Triggered Contingency Games for Strategic Multi-Agent Interaction	Kilian Schweppe et.al.	2512.03639	null
2025-12-03	Multimodal Control of Manipulators: Coupling Kinematics and Vision for Self-Driving Laboratory Operations	Shifa Sulaiman et.al.	2512.03630	null
2025-12-03	CSMapping: Scalable Crowdsourced Semantic Mapping and Topology Inference for Autonomous Driving	Zhijian Qiao et.al.	2512.03510	null
2025-12-03	Think Before You Drive: World Model-Inspired Multimodal Grounding for Autonomous Vehicles	Haicheng Liao et.al.	2512.03454	null
2025-12-03	NavMapFusion: Diffusion-based Fusion of Navigation Maps for Online Vectorized HD Map Construction	Thomas Monninger et.al.	2512.03317	null
2025-12-02	Flux4D: Flow-based Unsupervised 4D Reconstruction	Jingkang Wang et.al.	2512.03210	null
2025-12-02	DGGT: Feedforward 4D Reconstruction of Dynamic Driving Scenes using Unposed Images	Xiaoxue Chen et.al.	2512.03004	null
2025-12-02	U4D: Uncertainty-Aware 4D World Modeling from LiDAR Sequences	Xiang Xu et.al.	2512.02982	null
2025-12-02	EGGS: Exchangeable 2D/3D Gaussian Splatting for Geometry-Appearance Balanced Novel View Synthesis	Yancheng Zhang et.al.	2512.02932	null
2025-12-02	VLM as Strategist: Adaptive Generation of Safety-critical Testing Scenarios via Guided Diffusion	Xinzheng Wu et.al.	2512.02844	null
2025-12-02	CogDrive: Cognition-Driven Multimodal Prediction-Planning Fusion for Safe Autonomy	Heye Huang et.al.	2512.02777	null
2025-12-02	ClimaOoD: Improving Anomaly Segmentation via Physically Realistic Synthetic Data	Yuxing Liu et.al.	2512.02686	null
2025-12-02	nuScenes Revisited: Progress and Challenges in Autonomous Driving	Whye Kit Fong et.al.	2512.02448	null
2025-12-02	Vehicle Dynamics Embedded World Models for Autonomous Driving	Huiqian Li et.al.	2512.02417	null
2025-12-02	Multi-Domain Enhanced Map-Free Trajectory Prediction with Selective Attention	Wenyi Xiong et.al.	2512.02368	null
2025-12-01	Data-Centric Visual Development for Self-Driving Labs	Anbang Liu et.al.	2512.02018	null
2025-12-01	Visual Sync: Multi-Camera Synchronization via Cross-View Object Motion	Shaowei Liu et.al.	2512.02017	null
2025-12-01	RoaD: Rollouts as Demonstrations for Closed-Loop Supervised Fine-Tuning of Autonomous Driving Policies	Guillermo Garcia-Cobo et.al.	2512.01993	null
2025-12-01	Physical ID-Transfer Attacks against Multi-Object Tracking via Adversarial Trajectory	Chenyi Wang et.al.	2512.01934	null
2025-12-02	OpenREAD: Reinforced Open-Ended Reasoning for End-to-End Autonomous Driving with LLM-as-Critic	Songyan Zhang et.al.	2512.01830	null
2025-12-01	Integrated YOLOP Perception and Lyapunov-based Control for Autonomous Mobile Robot Navigation on Track	Mo Chen et.al.	2512.01608	null
2025-12-01	Language-Guided Open-World Anomaly Segmentation	Klara Reichard et.al.	2512.01427	null
2025-12-01	OpenBox: Annotate Any Bounding Boxes in 3D	In-Jae Lee et.al.	2512.01352	null
2025-11-30	TrajDiff: End-to-end Autonomous Driving without Perception Annotation	Xingtai Gui et.al.	2512.00723	null
2025-11-29	HAVEN: Hierarchical Adversary-aware Visibility-Enabled Navigation with Cover Utilization using Deep Transformer Q-Networks	Mihir Chauhan et.al.	2512.00592	null
2025-12-02	LAP: Fast LAtent Diffusion Planner with Fine-Grained Feature Distillation for Autonomous Driving	Jinhao Zhang et.al.	2512.00470	null
2025-11-29	FR-TTS: Test-Time Scaling for NTP-based Image Generation with Effective Filling-based Reward Signal	Hang Xu et.al.	2512.00438	null
2025-11-29	EZ-SP: Fast and Lightweight Superpoint-Based 3D Segmentation	Louis Geist et.al.	2512.00385	null
2025-11-23	PEFT-DML: Parameter-Efficient Fine-Tuning Deep Metric Learning for Robust Multi-Modal 3D Object Detection in Autonomous Driving	Abdolazim Rezaei et.al.	2512.00060	null
2025-11-28	SimScale: Learning to Drive via Real-World Simulation at Scale	Haochen Tian et.al.	2511.23369	null
2025-11-28	Toward Automatic Safe Driving Instruction: A Large-Scale Vision Language Model Approach	Haruki Sakajo et.al.	2511.23311	null
2025-11-28	Fault-Tolerant MARL for CAVs under Observation Perturbations for Highway On-Ramp Merging	Yuchen Shi et.al.	2511.23193	null
2025-11-28	Seeing before Observable: Potential Risk Reasoning in Autonomous Driving via Vision Language Models	Jiaxin Liu et.al.	2511.22928	null
2025-11-28	DM $^3$ T: Harmonizing Modalities via Diffusion for Multi-Object Tracking	Weiran Li et.al.	2511.22896	null
2025-11-28	SUPER-AD: Semantic Uncertainty-aware Planning for End-to-End Robust Autonomous Driving	Wonjeong Ryu et.al.	2511.22865	null
2025-11-28	Safe Autonomous Lane Changing: Planning with Dynamic Risk Fields and Time-Varying Convex Space Generation	Zhen Tian et.al.	2511.22829	null
2025-11-27	CoT4AD: A Vision-Language-Action Model with Explicit Chain-of-Thought Reasoning for Autonomous Driving	Zhaohui Wang et.al.	2511.22532	null
2025-11-27	Motion-to-Motion Latency Measurement Framework for Connected and Autonomous Vehicle Teleoperation	François Provost et.al.	2511.22467	null
2025-11-27	RoadSceneBench: A Lightweight Benchmark for Mid-Level Road Scene Understanding	Xiyan Liu et.al.	2511.22466	null
2025-11-27	DriveVGGT: Visual Geometry Transformer for Autonomous Driving	Xiaosong Jia et.al.	2511.22264	null
2025-12-03	HybridWorldSim: A Scalable and Controllable High-fidelity Simulator for Autonomous Driving	Qiang Li et.al.	2511.22187	null
2025-11-27	MTR-VP: Towards End-to-End Trajectory Planning through Context-Driven Image Encoding and Multiple Trajectory Prediction	Maitrayee Keskar et.al.	2511.22181	null
2025-11-27	SemOD: Semantic Enabled Object Detection Network under Various Weather Conditions	Aiyinsi Zuo et.al.	2511.22142	null
2025-11-26	OpenTwinMap: An Open-Source Digital Twin Generator for Urban Autonomous Driving	Alex Richardson et.al.	2511.21925	null
2025-11-26	Model-Based Policy Adaptation for Closed-Loop End-to-End Autonomous Driving	Haohong Lin et.al.	2511.21584	null
2025-11-26	LaGen: Towards Autoregressive LiDAR Scene Generation	Sizhuo Zhou et.al.	2511.21256	null
2025-11-25	DeeAD: Dynamic Early Exit of Vision-Language Action for Efficient Autonomous Driving	Haibo HU et.al.	2511.20720	null
2025-11-26	Thinking in 360°: Humanoid Visual Search in the Wild	Heyang Yu et.al.	2511.20351	null
2025-11-25	AD-R1: Closed-Loop Reinforcement Learning for End-to-End Autonomous Driving with Impartial World Models	Tianyi Yan et.al.	2511.20325	null
2025-11-25	Map-World: Masked Action planning and Path-Integral World Model for Autonomous Driving	Bin Hu et.al.	2511.20156	null
2025-11-25	DeLightMono: Enhancing Self-Supervised Monocular Depth Estimation in Endoscopy by Decoupling Uneven Illumination	Mingyang Ou et.al.	2511.20058	null
2025-11-25	WaymoQA: A Multi-View Visual Question Answering Dataset for Safety-Critical Reasoning in Autonomous Driving	Seungjun Yu et.al.	2511.20022	null
2025-11-25	On-Demand Multi-Task Sparsity for Efficient Large-Model Deployment on Edge Devices	Lianming Huang et.al.	2511.19986	null
2025-11-25	CoC-VLA: Delving into Adversarial Domain Transfer for Explainable Autonomous Driving via Chain-of-Causality Visual-Language-Action Model	Dapeng Zhang et.al.	2511.19914	null
2025-11-25	Reasoning-VLA: A Fast and General Vision-Language-Action Reasoning Model for Autonomous Driving	Dapeng Zhang et.al.	2511.19912	null
2025-11-25	4DWorldBench: A Comprehensive Evaluation Framework for 3D/4D World Generation Models	Yiting Lu et.al.	2511.19836	null
2025-11-24	MapRF: Weakly Supervised Online HD Map Construction via NeRF-Guided Self-Training	Hongyu Lyu et.al.	2511.19527	null
2025-11-21	Personalized Reward Modeling for Text-to-Image Generation	Jeongeun Lee et.al.	2511.19458	null
2025-11-19	AVS: A Computational and Hierarchical Storage System for Autonomous Vehicles	Yuxin Wang et.al.	2511.19453	null
2025-11-24	IDSplat: Instance-Decomposed 3D Gaussian Splatting for Driving Scenes	Carl Lindström et.al.	2511.19235	null
2025-11-24	Percept-WAM: Perception-Enhanced World-Awareness-Action Model for Robust End-to-End Autonomous Driving	Jianhua Han et.al.	2511.19221	null
2025-11-24	MonoSR: Open-Vocabulary Spatial Reasoning from Monocular Images	Qirui Wang et.al.	2511.19119	null
2025-11-24	HABIT: Human Action Benchmark for Interactive Traffic in CARLA	Mohan Ramesh et.al.	2511.19109	null
2025-11-24	DEAP-3DSAM: Decoder Enhanced and Auto Prompt SAM for 3D Medical Image Segmentation	Fangda Chen et.al.	2511.19071	null
2025-11-24	End-to-end Autonomous Vehicle Following System using Monocular Fisheye Camera	Jiale Zhang et.al.	2511.19011	null
2025-11-24	SupLID: Geometrical Guidance for Out-of-Distribution Detection in Semantic Segmentation	Nimeshika Udayangani et.al.	2511.18816	null
2025-11-24	MergeVLA: Cross-Skill Model Merging Toward a Generalist Vision-Language-Action Agent	Yuxia Fu et.al.	2511.18810	null
2025-11-24	From Features to Reference Points: Lightweight and Adaptive Fusion for Cooperative Autonomous Driving	Yongqi Zhu et.al.	2511.18757	null
2025-11-24	Thinking Ahead: Foresight Intelligence in MLLMs and World Models	Zhantao Gong et.al.	2511.18735	null
2025-11-24	GuideFlow: Constraint-Guided Flow Matching for Planning in End-to-End Autonomous Driving	Lin Liu et.al.	2511.18729	null
2025-11-24	DriveFlow: Rectified Flow Adaptation for Robust 3D Object Detection in Autonomous Driving	Hongbin Lin et.al.	2511.18713	null
2025-11-24	Data Augmentation Strategies for Robust Lane Marking Detection	Flora Lian et.al.	2511.18668	null
2025-11-22	Rectifying Soft-Label Entangled Bias in Long-Tailed Dataset Distillation	Chenyang Jiang et.al.	2511.17914	null
2025-11-22	QuickLAP: Quick Language-Action Preference Learning for Autonomous Driving Agents	Jordan Abi Nader et.al.	2511.17855	null
2025-11-21	JigsawComm: Joint Semantic Feature Encoding and Transmission for Communication-Efficient Cooperative Perception	Chenyi Wang et.al.	2511.17843	null
2025-11-21	SAFE-SMART: Safety Analysis and Formal Evaluation using STL Metrics for Autonomous RoboTs	Kristy Sakano et.al.	2511.17781	null
2025-11-18	Unified Low-Light Traffic Image Enhancement via Multi-Stage Illumination Recovery and Adaptive Noise Suppression	Siddiqua Namrah et.al.	2511.17612	null
2025-11-21	MDG: Masked Denoising Generation for Multi-Agent Behavior Modeling in Traffic Environments	Zhiyu Huang et.al.	2511.17496	null
2025-11-21	RoboCOIN: An Open-Sourced Bimanual Robotic Data COllection for INtegrated Manipulation	Shihan Wu et.al.	2511.17441	null
2025-11-21	Feasibility of Embodied Dynamics Based Bayesian Learning for Continuous Pursuit Motion Control of Assistive Mobile Robots in the Built Environment	Xiaoshan Zhou et.al.	2511.17401	null
2025-11-24	Agility Meets Stability: Versatile Humanoid Control with Heterogeneous Data	Yixuan Pan et.al.	2511.17373	null
2025-11-25	SuperQuadricOcc: Multi-Layer Gaussian Approximation of Superquadrics for Real-Time Self-Supervised Occupancy Estimation	Seamie Hayes et.al.	2511.17361	null
2025-11-21	FORWARD: Dataset of a forwarder operating in rough terrain	Mikael Lundbäck et.al.	2511.17318	null
2025-11-21	Range-Edit: Semantic Mask Guided Outdoor LiDAR Scene Editing	Suchetan G. Uppur et.al.	2511.17269	null
2025-11-21	QueryOcc: Query-based Self-Supervision for 3D Semantic Occupancy	Adam Lilja et.al.	2511.17221	null
2025-11-21	Navigating in the Dark: A Multimodal Framework and Dataset for Nighttime Traffic Sign Recognition	Aditya Mishra et.al.	2511.17183	null
2025-11-21	DiffRefiner: Coarse to Fine Trajectory Planning via Diffusion Refinement with Semantic Interaction for End to End Autonomous Driving	Liuhan Yin et.al.	2511.17150	null
2025-11-21	Sparse Reasoning is Enough: Biological-Inspired Framework for Video Anomaly Detection with Large Pre-trained Models	He Huang et.al.	2511.17094	null
2025-11-21	VLM-Augmented Degradation Modeling for Image Restoration Under Adverse Weather Conditions	Qianyi Shao et.al.	2511.16998	null
2025-11-21	MobileOcc: A Human-Aware Semantic Occupancy Dataset for Mobile Robots	Junseo Kim et.al.	2511.16949	null
2025-11-20	NoPo-Avatar: Generalizable and Animatable Avatars from Sparse Inputs without Human Poses	Jing Wen et.al.	2511.16673	null
2025-11-20	MiMo-Embodied: X-Embodied Foundation Model Technical Report	Xiaoshuai Hao et.al.	2511.16518	null
2025-11-20	Flow-Aided Flight Through Dynamic Clutters From Point To Motion	Bowen Xu et.al.	2511.16372	null
2025-11-20	LiSTAR: Ray-Centric World Models for 4D LiDAR Sequences in Autonomous Driving	Pei Liu et.al.	2511.16049	null
2025-11-19	Learning from Mistakes: Loss-Aware Memory Enhanced Continual Learning for LiDAR Place Recognition	Xufei Wang et.al.	2511.15597	null
2025-11-22	CompTrack: Information Bottleneck-Guided Low-Rank Dynamic Token Compression for Point Cloud Tracking	Sifan Zhou et.al.	2511.15580	null
2025-11-19	Scriboora: Rethinking Human Pose Forecasting	Daniel Bermuth et.al.	2511.15565	null
2025-11-20	UltraDP: Generalizable Carotid Ultrasound Scanning with Force-Aware Diffusion Policy	Ruoqu Chen et.al.	2511.15550	null
2025-11-19	Driving in Spikes: An Entropy-Guided Object Detector for Spike Cameras	Ziyan Liu et.al.	2511.15459	null
2025-11-19	WarNav: An Autonomous Driving Benchmark for Segmentation of Navigable Zones in War Scenes	Marc-Emmanuel Coupvent des Graviers et.al.	2511.15429	null
2025-11-19	ShelfOcc: Native 3D Supervision beyond LiDAR for Vision-Based Occupancy Estimation	Simon Boeder et.al.	2511.15396	null
2025-11-19	Learning Depth from Past Selves: Self-Evolution Contrast for Robust Depth Estimation	Jing Cao et.al.	2511.15167	null
2025-11-19	SceneEdited: A City-Scale Benchmark for 3D HD Map Updating via Image-Guided Change Detection	Chun-Jung Lin et.al.	2511.15153	null
2025-11-18	Attacking Autonomous Driving Agents with Adversarial Machine Learning: A Holistic Evaluation with the CARLA Leaderboard	Henry Wong et.al.	2511.14876	null
2025-11-19	Is Your VLM for Autonomous Driving Safety-Ready? A Comprehensive Benchmark for Evaluating External and In-Cabin Risks	Xianhui Meng et.al.	2511.14592	null
2025-11-18	Enhancing End-to-End Autonomous Driving with Risk Semantic Distillaion from VLM	Jack Qin et.al.	2511.14499	null
2025-11-18	CompEvent: Complex-valued Event-RGB Fusion for Low-light Video Enhancement and Deblurring	Mingchen Zhong et.al.	2511.14469	null
2025-11-18	Enhancing LLM-based Autonomous Driving with Modular Traffic Light and Sign Recognition	Fabian Schmidt et.al.	2511.14391	null
2025-11-26	Cheating Stereo Matching in Full-scale: Physical Adversarial Attack against Binocular Depth Estimation in Autonomous Driving	Kangqiao Zhao et.al.	2511.14386	null
2025-11-18	Blur-Robust Detection via Feature Restoration: An End-to-End Framework for Prior-Guided Infrared UAV Target Detection	Xiaolin Wang et.al.	2511.14371	null
2025-11-18	Multi-Timescale Model Predictive Control for Slow-Fast Systems	Lukas Schroth et.al.	2511.14311	null
2025-11-19	PAVE: An End-to-End Dataset for Production Autonomous Vehicle Evaluation	Xiangyu Li et.al.	2511.14185	null
2025-11-18	RTS-Mono: A Real-Time Self-Supervised Monocular Depth Estimation Method for Real-World Deployment	Zeyu Cheng et.al.	2511.14107	null
2025-11-17	VLMs Guided Interpretable Decision Making for Autonomous Driving	Xin Hu et.al.	2511.13881	null
2025-11-12	nuCarla: A nuScenes-Style Bird’s-Eye View Perception Dataset for CARLA Simulation	Zhijie Qiao et.al.	2511.13744	null
2025-11-17	DriveLiDAR4D: Sequential and Controllable LiDAR Scene Generation for Autonomous Driving	Kaiwen Cai et.al.	2511.13309	null
2025-11-17	DAP: A Discrete-token Autoregressive Planner for Autonomous Driving	Bowen Ye et.al.	2511.13306	null
2025-11-17	CorrectAD: A Self-Correcting Agentic System to Improve End-to-end Planning in Autonomous Driving	Enhui Ma et.al.	2511.13297	null
2025-11-17	GeoX-Bench: Benchmarking Cross-View Geo-Localization and Pose Estimation Capabilities of Large Multimodal Models	Yushuo Zheng et.al.	2511.13259	null
2025-11-17	Difficulty-Aware Label-Guided Denoising for Monocular 3D Object Detection	Soyul Lee et.al.	2511.13195	null
2025-11-17	WinMamba: Multi-Scale Shifted Windows in State Space Model for 3D Object Detection	Longhui Zheng et.al.	2511.13138	null
2025-11-17	Semantics and Content Matter: Towards Multi-Prior Hierarchical Mamba for Image Deraining	Zhaocheng Yu et.al.	2511.13113	null
2025-11-18	Decoupling Scene Perception and Ego Status: A Multi-Context Fusion Approach for Enhanced Generalization in End-to-End Autonomous Driving	Jiacheng Tang et.al.	2511.13079	null
2025-11-18	Towards 3D Object-Centric Feature Learning for Semantic Scene Completion	Weihua Wang et.al.	2511.13031	null
2025-11-17	T2I-Based Physical-World Appearance Attack against Traffic Sign Recognition Systems in Autonomous Driving	Chen Ma et.al.	2511.12956	null
2025-11-17	GUIDE: Gaussian Unified Instance Detection for Enhanced Obstacle Perception in Autonomous Driving	Chunyong Hu et.al.	2511.12941	null
2025-11-28	Text2Traffic: A Text-to-Image Generation and Editing Method for Traffic Scenes	Feng Lv et.al.	2511.12932	null
2025-11-16	Prompt-Driven Domain Adaptation for End-to-End Autonomous Driving via In-Context RL	Aleesha Khurram et.al.	2511.12755	null
2025-11-16	Are LLMs The Way Forward? A Case Study on LLM-Guided Reinforcement Learning for Decentralized Autonomous Driving	Timur Anvar et.al.	2511.12751	null
2025-11-16	Backdoor Attacks on Open Vocabulary Object Detectors via Multi-Modal Prompt Tuning	Ankita Raj et.al.	2511.12735	null
2025-11-16	FSDAM: Few-Shot Driving Attention Modeling via Vision-Language Coupling	Kaiser Hamid et.al.	2511.12708	null
2025-11-18	Fine-Grained Representation for Lane Topology Reasoning	Guoqing Xu et.al.	2511.12590	null
2025-11-16	VLA-R: Vision-Language Action Retrieval toward Open-World End-to-End Autonomous Driving	Hyunki Seong et.al.	2511.12405	null
2025-11-15	One target to align them all: LiDAR, RGB and event cameras extrinsic calibration for Autonomous Driving	Andrea Bertogalli et.al.	2511.12291	null
2025-11-15	RadarMP: Motion Perception for 4D mmWave Radar in Autonomous Driving	Ruiqi Cheng et.al.	2511.12117	null
2025-11-15	SRSplat: Feed-Forward Super-Resolution Gaussian Splatting from Sparse Multi-View Images	Xinyuan Hu et.al.	2511.12040	null
2025-11-14	LAVQA: A Latency-Aware Visual Question Answering Framework for Shared Autonomy in Self-Driving Vehicles	Shuangyu Xie et.al.	2511.11840	null
2025-11-13	ExpertAD: Enhancing Autonomous Driving Systems with Mixture of Experts	Haowen Jiang et.al.	2511.11740	null
2025-11-11	Learning with Preserving for Continual Multitask Learning	Hanchen David Wang et.al.	2511.11676	null
2025-11-14	A Comparative Evaluation of Prominent Methods in Autonomous Vehicle Certification	Mustafa Erdem Kırmızıgül et.al.	2511.11484	null
2025-11-14	Simulating an Autonomous System in CARLA using ROS 2	Joseph Abdo et.al.	2511.11310	null
2025-11-22	GraphPilot: Grounded Scene Graph Conditioning for Language-Based Autonomous Driving	Fabian Schmidt et.al.	2511.11266	null
2025-11-18	STONE: Pioneering the One-to-N Backdoor Threat in 3D Point Cloud	Dongmei Shan et.al.	2511.11210	null
2025-11-14	CATS-V2V: A Real-World Vehicle-to-Vehicle Cooperative Perception Dataset with Complex Adverse Traffic Scenarios	Hangyu Li et.al.	2511.11168	null
2025-11-24	Autonomous Vehicle Path Planning by Searching With Differentiable Simulation	Asen Nachkov et.al.	2511.11043	null
2025-11-14	Miniature Testbed for Validating Multi-Agent Cooperative Autonomous Driving	Hyunchul Bae et.al.	2511.11022	null
2025-11-18	CARScenes: Semantic VLM Dataset for Safe Autonomous Driving	Yuankai He et.al.	2511.10701	null
2025-11-13	Safe Planning in Interactive Environments via Iterative Policy Updates and Adversarially Robust Conformal Prediction	Omid Mirzaeedodangeh et.al.	2511.10586	null
2025-11-13	LongComp: Long-Tail Compositional Zero-Shot Generalization for Robust Trajectory Prediction	Benjamin Stoler et.al.	2511.10411	null
2025-11-13	nuPlan-R: A Closed-Loop Planning Benchmark for Autonomous Driving via Reactive Multi-Agent Simulation	Mingxing Peng et.al.	2511.10403	null
2025-11-13	DGFusion: Dual-guided Fusion for Robust Multi-Modal 3D Object Detection	Feiyang Jia et.al.	2511.10035	null
2025-11-13	Robust Object Detection with Pseudo Labels from VLMs using Per-Object Co-teaching	Uday Bhaskar et.al.	2511.09955	null
2025-11-12	Baby Sophia: A Developmental Approach to Self-Exploration through Self-Touch and Hand Regard	Stelios Zarifis et.al.	2511.09727	null
2025-11-14	FQ-PETR: Fully Quantized Position Embedding Transformation for Multi-View 3D Object Detection	Jiangyong Yu et.al.	2511.09347	null
2025-11-12	D-AWSIM: Distributed Autonomous Driving Simulator for Dynamic Map Generation Framework	Shunsuke Ito et.al.	2511.09080	null
2025-11-12	Argus: Resilience-Oriented Safety Assurance Framework for End-to-End ADSs	Dingji Wang et.al.	2511.09032	null
2025-11-12	UniMM-V2X: MoE-Enhanced Multi-Level Fusion for End-to-End Cooperative Autonomous Driving	Ziyi Song et.al.	2511.09013	null
2025-11-12	Expand Your SCOPE: Semantic Cognition over Potential-Based Exploration for Embodied Visual Navigation	Ningnan Wang et.al.	2511.08935	null
2025-11-11	Information-Driven Fault Detection and Identification for Multi-Agent Spacecraft Systems: Collaborative On-Orbit Inspection Mission	Akshita Gupta et.al.	2511.08752	null
2025-11-10	Predict and Resist: Long-Term Accident Anticipation under Sensor Noise	Xingcheng Liu et.al.	2511.08640	null
2025-11-11	Simulating the Visual World with Artificial Intelligence: A Roadmap	Jingtong Yue et.al.	2511.08585	null
2025-11-11	UniVA: Universal Video Agent towards Open-Source Next-Generation Video Generalist	Zhengyang Liang et.al.	2511.08521	null
2025-11-11	Prioritizing Perception-Guided Self-Supervision: A New Paradigm for Causal Modeling in End-to-End Autonomous Driving	Yi Huang et.al.	2511.08214	null
2025-11-14	Invisible Triggers, Visible Threats! Road-Style Adversarial Creation Attack for Visual 3D Detection in Autonomous Driving	Jian Wang et.al.	2511.08015	null
2025-11-11	Effective Game-Theoretic Motion Planning via Nested Search	Avishav Engle et.al.	2511.08001	null
2025-11-13	HD $^2$ -SSC: High-Dimension High-Density Semantic Scene Completion for Autonomous Driving	Zhiwen Yang et.al.	2511.07925	null
2025-11-11	MonoCLUE : Object-Aware Clustering Enhances Monocular 3D Object Detection	Sunghun Yang et.al.	2511.07862	null
2025-11-10	PlanT 2.0: Exposing Biases and Structural Flaws in Closed-Loop Driving	Simon Gerstenecker et.al.	2511.07292	null
2025-11-13	MVU-Eval: Towards Multi-Video Understanding Evaluation for Multimodal LLMs	Tianhao Peng et.al.	2511.07250	null
2025-11-10	Leveraging Text-Driven Semantic Variation for Robust OOD Segmentation	Seungheon Song et.al.	2511.07238	null
2025-11-10	Dynamics-Decoupled Trajectory Alignment for Sim-to-Real Transfer in Reinforcement Learning for Autonomous Driving	Thomas Steinecker et.al.	2511.07155	null
2025-11-10	HENet++: Hybrid Encoding and Multi-task Learning for 3D Perception and End-to-end Autonomous Driving	Zhongyu Xia et.al.	2511.07106	null
2025-11-10	Certified L2-Norm Robustness of 3D Point Cloud Recognition in the Frequency Domain	Liang Zhou et.al.	2511.07029	null
2025-11-10	ConeGS: Error-Guided Densification Using Pixel Cones for Improved Reconstruction with Fewer Primitives	Bartłomiej Baranowski et.al.	2511.06810	null
2025-11-11	Relative Energy Learning for LiDAR Out-of-Distribution Detection	Zizhao Li et.al.	2511.06720	null
2025-11-10	DIAL-GS: Dynamic Instance Aware Reconstruction for Label-free Street Scenes with 4D Gaussian Splatting	Chenpeng Su et.al.	2511.06632	null
2025-11-09	A Low-Rank Method for Vision Language Model Hallucination Mitigation in Autonomous Driving	Keke Long et.al.	2511.06496	null
2025-11-09	VDNeRF: Vision-only Dynamic Neural Radiance Field for Urban Scenes	Zhengyu Zou et.al.	2511.06408	null
2025-11-09	From Demonstrations to Safe Deployment: Path-Consistent Safety Filtering for Diffusion Policies	Ralf Römer et.al.	2511.06385	null
2025-11-09	LaneDiffusion: Improving Centerline Graph Learning via Prior Injected BEV Feature Generation	Zijie Wang et.al.	2511.06272	null
2025-11-09	VLDrive: Vision-Augmented Lightweight MLLMs for Efficient Language-grounded Autonomous Driving	Ruifei Zhang et.al.	2511.06256	null
2025-11-09	AdaDrive: Self-Adaptive Slow-Fast System for Language-Grounded Autonomous Driving	Ruifei Zhang et.al.	2511.06253	null
2025-11-08	Hybrid CNN-ViT Framework for Motion-Blurred Scene Text Restoration	Umar Rashid et.al.	2511.06087	null
2025-11-08	Runtime Safety Monitoring of Deep Neural Networks for Perception: A Survey	Albert Schotschneider et.al.	2511.05982	null
2025-11-08	Polymap: generating high definition map based on rasterized polygons	Shiyu Gao et.al.	2511.05944	null
2025-11-03	Compressing Multi-Task Model for Autonomous Driving via Pruning and Knowledge Distillation	Jiayuan Wang et.al.	2511.05557	null
2025-11-11	Token Is All You Need: Cognitive Planning through Belief-Intent Co-Evolution	Shiyao Sang et.al.	2511.05540	null
2025-11-07	SnowyLane: Robust Lane Detection on Snow-covered Rural Roads Using Infrastructural Elements	Jörg Gamerdinger et.al.	2511.05108	null
2025-11-06	ReGen: Generative Robot Simulation via Inverse Design	Phat Nguyen et.al.	2511.04769	null
2025-11-06	X-Diffusion: Training Diffusion Policies on Cross-Embodiment Human Demonstrations	Maximus A. Pace et.al.	2511.04671	null
2025-11-06	Cambrian-S: Towards Spatial Supersensing in Video	Shusheng Yang et.al.	2511.04670	null
2025-11-06	SAFe-Copilot: Unified Shared Autonomy Framework	Phat Nguyen et.al.	2511.04664	null
2025-11-06	UniSplat: Unified Spatio-Temporal Fusion via 3D Latent Scaffolds for Dynamic Driving Scene Reconstruction	Chen Shi et.al.	2511.04595	null
2025-11-06	ForeRobo: Unlocking Infinite Simulation Data for 3D Goal-driven Robotic Manipulation	Dexin wang et.al.	2511.04381	null
2025-11-04	Comprehensive Assessment of LiDAR Evaluation Metrics: A Comparative Study Using Simulated and Real Data	Syed Mostaquim Ali et.al.	2511.02994	null
2025-11-04	EvtSlowTV – A Large and Diverse Dataset for Event-Based Depth Estimation	Sadiq Layi Macaulay et.al.	2511.02953	null
2025-11-09	Toward an Agricultural Operational Design Domain: A Framework	Mirco Felske et.al.	2511.02937	null
2025-11-04	Keeping it Local, Tiny and Real: Automated Report Generation on Edge Computing Devices for Mechatronic-Based Cognitive Systems	Nicolas Schuler et.al.	2511.02507	null
2025-11-04	Self-Supervised Moving Object Segmentation of Sparse and Noisy Radar Point Clouds	Leon Schwarzer et.al.	2511.02395	null
2025-11-04	3D Point Cloud Object Detection on Edge Devices for Split Computing	Taisuke Noguchi et.al.	2511.02293	null
2025-11-03	UniLION: Towards Unified Autonomous Driving Model with Linear Group RNNs	Zhe Liu et.al.	2511.01768	null
2025-11-03	Driving scenario generation and evaluation using a structured layer representation and foundational models	Arthur Hubert et.al.	2511.01541	null
2025-11-03	Embodied Cognition Augmented End2End Autonomous Driving	Ling Niu et.al.	2511.01334	null
2025-11-03	Saliency-Guided Domain Adaptation for Left-Hand Driving in Autonomous Steering	Zahra Mehraban et.al.	2511.01223	null
2025-11-02	Layer-Wise Modality Decomposition for Interpretable Multimodal Sensor Fusion	Jaehyun Park et.al.	2511.00859	null
2025-11-04	Towards classification-based representation learning for place recognition on LiDAR scans	Maksim Konoplia et.al.	2511.00738	null
2025-11-01	Been There, Scanned That: Nostalgia-Driven LiDAR Compression for Self-Driving Cars	Ali Khalid et.al.	2511.00652	null
2025-10-30	Alpamayo-R1: Bridging Reasoning and Action Prediction for Generalizable Autonomous Driving in the Long Tail	NVIDIA et.al.	2511.00088	null
2025-10-26	Gen AI in Automotive: Applications, Challenges, and Opportunities with a Case study on In-Vehicle Experience	Chaitanya Shinde et.al.	2511.00026	null
2025-10-31	Modified-Emergency Index (MEI): A Criticality Metric for Autonomous Driving in Lateral Conflict	Hao Cheng et.al.	2510.27333	null
2025-10-30	AD-SAM: Fine-Tuning the Segment Anything Vision Foundation Model for Autonomous Driving Perception	Mario Camarena et.al.	2510.27047	null
2025-10-30	All You Need for Object Detection: From Pixels, Points, and Prompts to Next-Gen Fusion and Multimodal LLMs/VLMs in Autonomous Vehicles	Sayed Pedram Haeri Boroujeni et.al.	2510.26641	null
2025-10-30	Counteracting Matthew Effect in Self-Improvement of LVLMs through Head-Tail Re-balancing	Xin Guo et.al.	2510.26474	null
2025-10-30	Beyond Imitation: Constraint-Aware Trajectory Generation with Flow Matching For End-to-End Autonomous Driving	Lin Liu et.al.	2510.26292	null
2025-10-30	Self-localization on a 3D map by fusing global and local features from a monocular camera	Satoshi Kikuch et.al.	2510.26170	null
2025-11-13	WOD-E2E: Waymo Open Dataset for End-to-End Driving in Challenging Long-tail Scenarios	Runsheng Xu et.al.	2510.26125	null
2025-10-30	Accelerating Real-World Overtaking in F1TENTH Racing Employing Reinforcement Learning Methods	Emily Steiner et.al.	2510.26040	null
2025-10-29	Integrating Legal and Logical Specifications in Perception, Prediction, and Planning for Automated Driving: A Survey of Methods	Kumar Manas et.al.	2510.25386	null
2025-10-31	MMEdge: Accelerating On-device Multimodal Inference via Pipelined Sensing and Encoding	Runxi Huang et.al.	2510.25327	null
2025-11-02	D $^2$ GS: Dense Depth Regularization for LiDAR-free Urban Scene Reconstruction	Kejing Xia et.al.	2510.25173	null
2025-10-28	SCOUT: A Lightweight Framework for Scenario Coverage Assessment in Autonomous Driving	Anil Yildiz et.al.	2510.24949	null
2025-10-14	DrivingScene: A Multi-Task Online Feed-Forward 3D Gaussian Splatting Method for Dynamic Driving Scenes	Qirui Hou et.al.	2510.24734	null
2025-10-28	Enhancing Vision-Language Models for Autonomous Driving through Task-Specific Prompting and Spatial Reasoning	Aodi Wu et.al.	2510.24152	null
2025-10-28	ZTRS: Zero-Imitation End-to-end Autonomous Driving with Trajectory Scoring	Zhenxin Li et.al.	2510.24108	null
2025-10-28	SynAD: Enhancing Real-World End-to-End Autonomous Driving Models through Synthetic Data Integration	Jongsuk Kim et.al.	2510.24052	null
2025-10-27	Modeling and Scheduling of Fusion Patterns in Autonomous Driving Systems (Extended Version)	Hoora Sobhani et.al.	2510.23895	null
2025-10-27	VR-Drive: Viewpoint-Robust End-to-End Driving with Feed-Forward 3D Gaussian Splatting	Hoonhee Cho et.al.	2510.23205	null
2025-10-27	Planning Oriented Integrated Sensing and Communication	Xibin Jin et.al.	2510.23021	null
2025-10-27	Scaling Up Occupancy-centric Driving Scene Generation: Dataset and Method	Bohan Li et.al.	2510.22973	null
2025-10-26	Uncertainty-Aware Autonomous Vehicles: Predicting the Road Ahead	Shireen Kudukkil Manchingal et.al.	2510.22680	null
2025-10-26	DAMap: Distance-aware MapNet for High Quality HD Map Construction	Jinpeng Dong et.al.	2510.22675	null
2025-10-25	3D Roadway Scene Object Detection with LIDARs in Snowfall Conditions	Ghazal Farhani et.al.	2510.22436	null
2025-10-25	BLIP-FusePPO: A Vision-Language Deep Reinforcement Learning Framework for Lane Keeping in Autonomous Vehicles	Seyed Ahmad Hosseini Miangoleh et.al.	2510.22370	null
2025-10-25	Real-Time Semantic Segmentation on FPGA for Autonomous Vehicles Using LMIINet with the CGRA4ML Framework	Amir Mohammad Khadem Hosseini et.al.	2510.22243	null
2025-10-25	CGoT: A Novel Inference Mechanism for Embodied Multi-Agent Systems Using Composable Graphs of Thoughts	Yixiao Nie et.al.	2510.22235	null
2025-10-25	HARMONY: Hidden Activation Representations and Model Output-Aware Uncertainty Estimation for Vision-Language Models	Erum Mushtaq et.al.	2510.22171	null
2025-10-23	Addressing Corner Cases in Autonomous Driving: A World Model-based Approach with Mixture of Experts and LLMs	Haicheng Liao et.al.	2510.21867	null
2025-10-24	Automated Detection of Visual Attribute Reliance with a Self-Reflective Agent	Christy Li et.al.	2510.21704	null
2025-10-24	Learning Neural Control Barrier Functions from Expert Demonstrations using Inverse Constraint Learning	Yuxuan Yang et.al.	2510.21560	null
2025-10-24	Track-to-Track Association for Collective Perception based on Stochastic Optimization	Laura M. Wolf et.al.	2510.21278	null
2025-10-24	Towards Physics-informed Spatial Intelligence with Human Priors: An Autonomous Driving Pilot Study	Guanlin Wu et.al.	2510.21160	null
2025-10-24	Urban 3D Change Detection Using LiDAR Sensor for HD Map Maintenance and Smart Mobility	Hezam Albagami et.al.	2510.21112	null
2025-10-23	From Cheap to Pro: A Learning-based Adaptive Camera Parameter Network for Professional-Style Imaging	Fuchen Li et.al.	2510.20550	null
2025-10-23	Behavior-Aware Online Prediction of Obstacle Occupancy using Zonotopes	Alvaro Carrizosa-Rendon et.al.	2510.20437	null
2025-10-23	Dino-Diffusion Modular Designs Bridge the Cross-Domain Gap in Autonomous Parking	Zixuan Wu et.al.	2510.20335	null
2025-10-23	Seeing the Unseen: Mask-Driven Positional Encoding and Strip-Convolution Context Modeling for Cross-View Object Geo-Localization	Shuhan Hu et.al.	2510.20247	null
2025-10-23	Monocular Visual 8D Pose Estimation for Articulated Bicycles and Cyclists	Eduardo R. Corral-Soto et.al.	2510.20158	null
2025-10-22	From Forecasting to Planning: Policy World Model for Collaborative State-Action Prediction	Zhida Zhao et.al.	2510.19654	null
2025-10-22	VGD: Visual Geometry Gaussian Splatting for Feed-Forward Surround-view Driving Reconstruction	Junhong Lin et.al.	2510.19578	null
2025-10-22	SFGFusion: Surface Fitting Guided 3D Object Detection with 4D Radar and Camera Fusion	Xiaozhi Li et.al.	2510.19215	null
2025-10-24	Rethinking Driving World Model as Synthetic Data Generator for Perception Tasks	Kai Zeng et.al.	2510.19195	null
2025-10-21	Robust Driving QA through Metadata-Grounded Context and Task-Specific Prompts	Seungjun Yu et.al.	2510.19001	null
2025-10-23	Occluded nuScenes: A Multi-Sensor Dataset for Evaluating Perception Robustness in Automated Driving	Sanjay Kumar et.al.	2510.18552	null
2025-10-21	MMRHP: A Miniature Mixed-Reality HIL Platform for Auditable Closed-Loop Evaluation	Mingxin Li et.al.	2510.18371	null
2025-10-21	ViSE: A Systematic Approach to Vision-Only Street-View Extrapolation	Kaiyuan Tan et.al.	2510.18341	null
2025-10-24	OmniNWM: Omniscient Driving Navigation World Models	Bohan Li et.al.	2510.18313	null
2025-10-21	OpenInsGaussian: Open-vocabulary Instance Gaussian Segmentation with Context-aware Cross-view Fusion	Tianyu Huang et.al.	2510.18253	null
2025-10-21	BlendCLIP: Bridging Synthetic and Real Domains for Zero-Shot 3D Object Classification with Multimodal Pretraining	Ajinkya Khoche et.al.	2510.18244	null
2025-10-20	SPACeR: Self-Play Anchoring with Centralized Reference Models	Wei-Jer Chang et.al.	2510.18060	null
2025-10-20	SAVANT: Semantic Analysis with Vision-Augmented Anomaly deTection	Roberto Brusnicki et.al.	2510.18034	null
2025-10-20	4DSegStreamer: Streaming 4D Panoptic Segmentation via Dual Threads	Ling Liu et.al.	2510.17664	null
2025-10-20	Enhanced Motion Forecasting with Plug-and-Play Multimodal Large Language Models	Katie Luo et.al.	2510.17274	null
2025-10-28	SimpleVSF: VLM-Scoring Fusion for Trajectory Prediction of End-to-End Autonomous Driving	Peiru Zheng et.al.	2510.17191	null
2025-11-04	DiffVLA++: Bridging Cognitive Reasoning and End-to-End Driving through Metric-Guided Alignment	Yu Gao et.al.	2510.17148	null
2025-10-20	ProDAT: Progressive Density-Aware Tail-Drop for Point Cloud Coding	Zhe Luo et.al.	2510.17068	null
2025-10-19	Unsupervised Monocular Road Segmentation for Autonomous Driving via Scene Geometry	Sara Hatami Rostami et.al.	2510.16790	null
2025-10-19	A Comprehensive Survey on World Models for Embodied AI	Xinqing Li et.al.	2510.16732	null
2025-10-29	Vision-Centric 4D Occupancy Forecasting and Planning via Implicit Residual World Models	Jianbiao Mei et.al.	2510.16729	null
2025-10-18	Advancing Off-Road Autonomous Driving: The Large-Scale ORAD-3D Dataset and Comprehensive Benchmarks	Chen Min et.al.	2510.16500	null
2025-10-18	Enhancing Rotated Object Detection via Anisotropic Gaussian Bounding Box and Bhattacharyya Distance	Chien Thai et.al.	2510.16445	null
2025-10-18	Demeter: A Parametric Model of Crop Plant Morphology from the Real World	Tianhang Cheng et.al.	2510.16377	null
2025-10-17	ObjectTransforms for Uncertainty Quantification and Reduction in Vision-Based Perception for Autonomous Vehicles	Nishad Sahu et.al.	2510.16118	null
2025-10-17	LightsOut: Diffusion-based Outpainting for Enhanced Lens Flare Removal	Shr-Ruei Tsai et.al.	2510.15868	null
2025-10-17	Perfect Prediction or Plenty of Proposals? What Matters Most in Planning for Autonomous Driving	Aron Distelzweig et.al.	2510.15505	null
2025-10-17	VDRive: Leveraging Reinforced VLA and Diffusion Policy for End-to-end Autonomous Driving	Ziang Guo et.al.	2510.15446	null
2025-10-17	FreqPDE: Rethinking Positional Depth Embedding for Multi-View 3D Object Detection Transformers	Haisheng Su et.al.	2510.15385	null
2025-10-15	XD-RCDepth: Lightweight Radar-Camera Depth Estimation with Explainability-Aligned and Distribution-Aware Distillation	Huawei Sun et.al.	2510.13565	null
2025-10-17	CoDS: Enhancing Collaborative Perception in Heterogeneous Scenarios via Domain Separation	Yushan Han et.al.	2510.13432	null
2025-10-15	DepthVLA: Enhancing Vision-Language-Action Models with Depth-Aware Spatial Reasoning	Tianyuan Yuan et.al.	2510.13375	null
2025-10-16	CymbaDiff: Structured Spatial Diffusion for Sketch-based 3D Semantic Urban Scene Generation	Li Liang et.al.	2510.13245	null
2025-10-15	Complementary Information Guided Occupancy Prediction via Multi-Level Representation Fusion	Rongtao Xu et.al.	2510.13198	null
2025-10-15	Safe Driving in Occluded Environments	Zhuoyuan Wang et.al.	2510.13114	null
2025-10-15	DriveCritic: Towards Context-Aware, Human-Aligned Evaluation for Autonomous Driving with Vision-Language Models	Jingyu Song et.al.	2510.13108	null
2025-10-16	SimULi: Real-Time LiDAR and Camera Simulation with Unscented Transforms	Haithem Turki et.al.	2510.12901	null
2025-10-14	DriveVLA-W0: World Models Amplify Data Scaling Law in Autonomous Driving	Yingyan Li et.al.	2510.12796	null
2025-10-14	CoIRL-AD: Collaborative-Competitive Imitation-Reinforcement Learning in Latent World Models for Autonomous Driving	Xiaoji Zheng et.al.	2510.12560	null
2025-10-14	CurriFlow: Curriculum-Guided Depth Fusion with Optical Flow-Based Temporal Alignment for 3D Semantic Scene Completion	Jinzhou Lin et.al.	2510.12362	null
2025-10-14	PAGS: Priority-Adaptive Gaussian Splatting for Dynamic Driving Scenes	Ying A et.al.	2510.12282	null
2025-10-14	AngularFuse: A Closer Look at Angle-based Perception for Spatial-Sensitive Multi-Modality Image Fusion	Xiaopeng Liu et.al.	2510.12260	null
2025-10-14	Hierarchical Reasoning with Vision-Language Models for Incident Reports from Dashcam Videos	Shingo Yokoi et.al.	2510.12190	null
2025-10-13	Context-Aware Model-Based Reinforcement Learning for Autonomous Racing	Emran Yasser Moustafa et.al.	2510.11501	null
2025-10-15	A Faster and More Reliable Middleware for Autonomous Driving Systems	Yuankai He et.al.	2510.11448	null
2025-10-13	Future-Aware End-to-End Driving: Bidirectional Modeling of Trajectory Planning and Scene Evolution	Bozhou Zhang et.al.	2510.11092	null
2025-10-13	Flow Matching-Based Autonomous Driving Planning with Advanced Interactive Behavior Modeling	Tianyi Tan et.al.	2510.11083	null
2025-10-13	Game-Theoretic Risk-Shaped Reinforcement Learning for Safe Autonomous Driving	Dong Hu et.al.	2510.10960	null
2025-10-13	rareboost3d: a synthetic lidar dataset with enhanced rare classes	Shutong Lin et.al.	2510.10876	null
2025-10-12	Stability Under Scrutiny: Benchmarking Representation Paradigms for Online HD Mapping	Hao Shan et.al.	2510.10660	null
2025-10-12	A Machine Learning Perspective on Automated Driving Corner Cases	Sebastian Schmidt et.al.	2510.10653	null
2025-10-12	Reinforcement Learning-based Dynamic Adaptation for Sampling-Based Motion Planning in Agile Autonomous Driving	Alexander Langmann et.al.	2510.10567	null
2025-10-12	Align2Act: Instruction-Tuned Models for Human-Aligned Autonomous Driving	Kanishkha Jaisankar et.al.	2510.10503	null
2025-10-11	Bridging Perspectives: Foundation Model Guided BEV Maps for 3D Object Detection and Tracking	Markus Käppeler et.al.	2510.10287	null
2025-10-23	A Style-Based Profiling Framework for Quantifying the Synthetic-to-Real Gap in Autonomous Driving Datasets	Dingyi Yao et.al.	2510.10203	null
2025-10-11	Beyond ADE and FDE: A Comprehensive Evaluation Framework for Safety-Critical Prediction in Multi-Agent Autonomous Driving Scenarios	Feifei Liu et.al.	2510.10086	null
2025-10-11	Probabilistic Hyper-Graphs using Multiple Randomly Masked Autoencoders for Semi-supervised Multi-modal Multi-task Learning	Pîrvu Mihai-Cristian et.al.	2510.10068	null
2025-10-11	Explainable Human-in-the-Loop Segmentation via Critic Feedback Signals	Pouya Shaeri et.al.	2510.09945	null
2025-10-10	SpaceVista: All-Scale Visual Spatial Reasoning from mm to km	Peiwen Sun et.al.	2510.09606	null
2025-10-10	Autonomous Soft Robotic Guidewire Navigation via Imitation Learning	Noah Barnes et.al.	2510.09497	null
2025-10-10	Clear Roads, Clear Vision: Advancements in Multi-Weather Restoration for Smart Transportation	Vijay M. Galshetwar et.al.	2510.09228	null
2025-10-10	Towards Safer and Understandable Driver Intention Prediction	Mukilan Karuppasamy et.al.	2510.09200	null
2025-10-10	TARO: Toward Semantically Rich Open-World Object Detection	Yuchen Zhang et.al.	2510.09173	null
2025-10-10	Exploring Single Domain Generalization of LiDAR-based Semantic Segmentation under Imperfect Labels	Weitong Kong et.al.	2510.09035	null
2025-10-09	ReSplat: Learning Recurrent Gaussian Splats	Haofei Xu et.al.	2510.08575	null
2025-10-09	Scalable Offline Metrics for Autonomous Driving	Animikh Aich et.al.	2510.08571	null
2025-10-09	ResAD: Normalized Residual Trajectory Modeling for End-to-End Autonomous Driving	Zhiyu Zheng et.al.	2510.08562	null
2025-10-09	RayFusion: Ray Fusion Enhanced Collaborative Visual Perception	Shaohong Wang et.al.	2510.08017	null
2025-10-16	CVD-STORM: Cross-View Video Diffusion with Spatial-Temporal Reconstruction Model for Autonomous Driving	Tianrui Zhang et.al.	2510.07944	null
2025-10-09	MARC: Memory-Augmented RL Token Compression for Efficient Video Understanding	Peiran Wu et.al.	2510.07915	null
2025-10-10	GTR-Bench: Evaluating Geo-Temporal Reasoning in Vision-Language Models	Qinghongbing Xie et.al.	2510.07791	null
2025-10-08	VeMo: A Lightweight Data-Driven Approach to Model Vehicle Dynamics	Girolamo Oddo et.al.	2510.07447	null
2025-10-08	HyPlan: Hybrid Learning-Assisted Planning Under Uncertainty for Safe Autonomous Driving	Donald Pfaffmann et.al.	2510.07210	null
2025-10-08	A Digital Twin Framework for Metamorphic Testing of Autonomous Driving Systems Using Generative Model	Tony Zhang et.al.	2510.07133	null
2025-10-08	Learning Global Representation from Queries for Vectorized HD Map Construction	Shoumeng Qiu et.al.	2510.06969	null
2025-10-08	OBJVanish: Physically Realizable Text-to-3D Adv. Generation of LiDAR-Invisible Objects	Bing Li et.al.	2510.06952	null
2025-10-08	DecompGAIL: Learning Realistic Traffic Behaviors with Decomposed Multi-Agent Generative Adversarial Imitation Learning	Ke Guo et.al.	2510.06913	null
2025-10-08	Semantic Segmentation Algorithm Based on Light Field and LiDAR Fusion	Jie Luo et.al.	2510.06687	null
2025-10-08	AIM 2025 Challenge on Real-World RAW Image Denoising	Feiran Li et.al.	2510.06601	null
2025-10-07	Drive&Gen: Co-Evaluating End-to-End Driving and Video Generation Models	Jiahao Wang et.al.	2510.06209	null
2025-10-07	The Safety Challenge of World Models for Embodied AI Agents: A Review	Lorenzo Baraldi et.al.	2510.05865	null
2025-10-10	ALISE: Annotation-Free LiDAR Instance Segmentation for Autonomous Driving	Yongxuan Lyu et.al.	2510.05752	null
2025-10-07	Precise and Efficient Collision Prediction under Uncertainty in Autonomous Driving	Marc Kaufeld et.al.	2510.05729	null
2025-10-07	HoloScene: Simulation-Ready Interactive 3D Worlds from a Single Video	Hongchi Xia et.al.	2510.05560	null
2025-10-06	Comparative Analysis of YOLOv5, Faster R-CNN, SSD, and RetinaNet for Motorbike Detection in Kigali Autonomous Driving Context	Ngeyen Yinkfu et.al.	2510.04912	null
2025-10-08	Progressive Gaussian Transformer with Anisotropy-aware Sampling for Open Vocabulary Occupancy Prediction	Chi Yan et.al.	2510.04759	null
2025-10-05	Diffusion^2: Dual Diffusion Model with Uncertainty-Aware Adaptive Noise for Momentary Trajectory Prediction	Yuhao Luo et.al.	2510.04365	null
2025-10-04	From Filters to VLMs: Benchmarking Defogging Methods through Object Detection and Segmentation Performance	Ardalan Aryashad et.al.	2510.03906	null
2025-10-04	Referring Expression Comprehension for Small Objects	Kanoko Goto et.al.	2510.03701	null
2025-10-04	Safety-Oriented Dynamic Path Planning for Automated Vehicles	Mostafa Emam et.al.	2510.03640	null
2025-10-03	Agile Tradespace Exploration for Space Rendezvous Mission Design via Transformers	Yuji Takubo et.al.	2510.03544	null
2025-10-03	Training-Free Out-Of-Distribution Segmentation With Foundation Models	Laith Nayal et.al.	2510.02909	null
2025-10-03	GS-Share: Enabling High-fidelity Map Sharing with Incremental Gaussian Splatting	Xinran Zhang et.al.	2510.02884	null
2025-10-03	Action Deviation-Aware Inference for Low-Latency Wireless Robots	Jeyoung Park et.al.	2510.02851	null
2025-10-03	Work Zones challenge VLM Trajectory Planning: Toward Mitigation and Robust Autonomous Driving	Yifan Liao et.al.	2510.02803	null
2025-10-03	A Trajectory Generator for High-Density Traffic and Diverse Agent-Interaction Scenarios	Ruining Yang et.al.	2510.02627	null
2025-10-02	Calibrating the Full Predictive Class Distribution of 3D Object Detectors for Autonomous Driving	Cornelius Schröder et.al.	2510.01829	null
2025-10-10	Nav-EE: Navigation-Guided Early Exiting for Efficient Vision-Language Models in Autonomous Driving	Haibo Hu et.al.	2510.01795	null
2025-10-15	Predictive Preference Learning from Human Interventions	Haoyuan Cai et.al.	2510.01545	null
2025-10-01	Strategic Fusion of Vision Language Models: Shapley-Credited Context-Aware Dawid-Skene for Multi-Label Tasks in Autonomous Driving	Yuxiang Feng et.al.	2510.01126	null
2025-10-03	Less is More: Lean yet Powerful Vision-Language Model for Autonomous Driving	Sheng Yang et.al.	2510.00060	null
2025-10-16	TTT3R: 3D Reconstruction as Test-Time Training	Xingyu Chen et.al.	2509.26645	null
2025-09-30	PRISM: Progressive Rain removal with Integrated State-space Modeling	Pengze Xue et.al.	2509.26413	null
2025-09-30	Beyond Pixels: Efficient Dataset Distillation via Sparse Gaussian Representation	Chenyang Jiang et.al.	2509.26219	null
2025-09-30	Beyond Overall Accuracy: Pose- and Occlusion-driven Fairness Analysis in Pedestrian Detection for Autonomous Driving	Mohammad Khoshkdahan et.al.	2509.26166	null
2025-09-30	Preemptive Spatiotemporal Trajectory Adjustment for Heterogeneous Vehicles in Highway Merging Zones	Yuan Li et.al.	2509.25929	null
2025-09-30	MuSLR: Multimodal Symbolic Logical Reasoning	Jundong Xu et.al.	2509.25851	null
2025-09-29	Online Mapping for Autonomous Driving: Addressing Sensor Generalization and Dynamic Map Updates in Campus Environments	Zihan Zhang et.al.	2509.25542	null
2025-09-27	BEV-VLM: Trajectory Planning via Unified BEV Abstraction	Guancheng Chen et.al.	2509.25249	null
2025-09-29	StreamForest: Efficient Online Video Understanding with Persistent Event Memory	Xiangyu Zeng et.al.	2509.24871	null
2025-09-29	TACO-Net: Topological Signatures Triumph in 3D Object Classification	Anirban Ghosh et.al.	2509.24802	null
2025-09-29	Learning to Sample: Reinforcement Learning-Guided Sampling for Autonomous Vehicle Motion Planning	Korbinian Moller et.al.	2509.24313	null
2025-09-29	Skeleton-based Robust Registration Framework for Corrupted 3D Point Clouds	Yongqiang Wang et.al.	2509.24273	null
2025-09-28	Advancing Multi-agent Traffic Simulation via R1-Style Reinforcement Fine-Tuning	Muleilan Pei et.al.	2509.23993	null
2025-10-05	AutoPrune: Each Complexity Deserves a Pruning Policy	Hanshi Wang et.al.	2509.23931	null
2025-09-30	DriveE2E: Closed-Loop Benchmark for End-to-End Autonomous Driving through Real-to-Simulation	Haibao Yu et.al.	2509.23922	null
2025-09-28	Preserving Cross-Modal Stability for Visual Unlearning in Multimodal Scenarios	Jinghan Xu Yuyang Zhang Qixuan Cai Jiancheng Chen Keqiu Li et.al.	2509.23895	null
2025-09-28	From Static to Dynamic: a Survey of Topology-Aware Perception in Autonomous Driving	Yixiao Chen et.al.	2509.23641	null
2025-09-28	BridgeDrive: Diffusion Bridge Policy for Closed-Loop Trajectory Planning in Autonomous Driving	Shu Liu et.al.	2509.23589	null
2025-09-28	OVSeg3R: Learn Open-vocabulary Instance Segmentation from 2D via 3D Reconstruction	Hongyang Li et.al.	2509.23541	null
2025-10-16	WorldSplat: Gaussian-Centric Feed-Forward 4D Scene Generation for Autonomous Driving	Ziyue Zhu et.al.	2509.23402	null
2025-09-27	Preventing Robotic Jailbreaking via Multimodal Domain Adaptation	Francesco Marchiori et.al.	2509.23281	null
2025-09-26	Persistent Autoregressive Mapping with Traffic Rules for Autonomous Driving	Shiyi Liang et.al.	2509.22756	null
2025-09-26	Self-driving cars: Are we there yet?	Merve Atasever et.al.	2509.22754	null
2025-10-07	Robust Object Detection for Autonomous Driving via Curriculum-Guided Group Relative Policy Optimization	Xu Jia et.al.	2509.22688	null
2025-10-17	An Intention-driven Lane Change Framework Considering Heterogeneous Dynamic Cooperation in Mixed-traffic Environment	Xiaoyun Qiu et.al.	2509.22550	null
2025-09-26	EfficientDepth: A Fast and Detail-Preserving Monocular Depth Estimation Model	Andrii Litvynchuk et.al.	2509.22527	null
2025-09-29	A Multi-Modality Evaluation of the Reality Gap in Autonomous Driving Systems	Stefano Carlo Lambertenghi et.al.	2509.22379	null
2025-09-26	UniMapGen: A Generative Framework for Large-Scale Map Construction from Multi-modal Data	Yujian Yuan et.al.	2509.22262	null
2025-09-26	An Adaptive ICP LiDAR Odometry Based on Reliable Initial Pose	Qifeng Wang et.al.	2509.22058	null
2025-09-25	PL-VIWO2: A Lightweight, Fast and Robust Visual-Inertial-Wheel Odometry Using Points and Lines	Zhixin Zhang et.al.	2509.21563	null
2025-09-25	Human-like Navigation in a World Built for Humans	Bhargav Chandaka et.al.	2509.21189	null
2025-09-25	Autoregressive End-to-End Planning with Time-Invariant Spatial Alignment and Multi-Objective Policy Refinement	Jianbo Zhao et.al.	2509.20938	null
2025-09-25	MTRDrive: Memory-Tool Synergistic Reasoning for Robust Autonomous Driving in Corner Cases	Ziang Luo et.al.	2509.20843	null
2025-09-25	DAC-LoRA: Dynamic Adversarial Curriculum for Efficient and Robust Few-Shot Adaptation	Ved Umrajkar et.al.	2509.20792	null
2025-09-29	MASt3R-Fusion: Integrating Feed-Forward Visual Model with IMU, GNSS for High-Functionality SLAM	Yuxuan Zhou et.al.	2509.20757	null
2025-10-04	Cyber Racing Coach: A Haptic Shared Control Framework for Teaching Advanced Driving Skills	Congkai Shen et.al.	2509.20653	null
2025-09-26	AnchDrive: Bootstrapping Diffusion Policies with Hybrid Trajectory Anchors for End-to-End Driving	Jinhao Chai et.al.	2509.20253	null
2025-09-24	Universal Camouflage Attack on Vision-Language Models for Autonomous Driving	Dehong Kong et.al.	2509.20196	null
2025-09-25	Discrete Diffusion for Reflective Vision-Language-Action Models in Autonomous Driving	Pengxiang Li et.al.	2509.20109	null
2025-09-25	Hyperspectral Adapter for Semantic Segmentation with Vision Foundation Models	Juana Valeria Hurtado et.al.	2509.20107	null
2025-09-25	OmniScene: Attention-Augmented Multimodal 4D Scene Understanding for Autonomous Driving	Pei Liu et.al.	2509.19973	null
2025-09-24	BiTAA: A Bi-Task Adversarial Attack for Object Detection and Depth Estimation via 3D Gaussian Splatting	Yixun Zhang et.al.	2509.19793	null
2025-09-24	RDAR: Reward-Driven Agent Relevance Estimation for Autonomous Driving	Carlo Bosio et.al.	2509.19789	null
2025-09-24	EfficienT-HDR: An Efficient Transformer-Based Framework via Multi-Exposure Fusion for HDR Reconstruction	Yu-Shen Huang et.al.	2509.19779	null
2025-10-10	The Impact of 2D Segmentation Backbones on Point Cloud Predictions Using 4D Radar	William Muckelroy III et.al.	2509.19644	null
2025-09-20	Vision-Based Perception for Autonomous Vehicles in Off-Road Environment Using Deep Learning	Nelson Alves Ferreira Neto et.al.	2509.19378	null
2025-09-23	Lyra: Generative 3D Scene Reconstruction via Video Diffusion Model Self-Distillation	Sherwin Bahmani et.al.	2509.19296	null
2025-09-23	TriFusion-AE: Language-Guided Depth and LiDAR Fusion for Robust Point Cloud Processing	Susmit Neogi et.al.	2509.18743	null
2025-09-23	The Case for Negative Data: From Crash Reports to Counterfactuals for Reasonable Driving	Jay Patrikar et.al.	2509.18626	null
2025-09-23	MLF-4DRCNet: Multi-Level Fusion with 4D Radar and Camera for 3D Object Detection in Autonomous Driving	Yuzhi Wu et.al.	2509.18613	null
2025-09-23	PIE: Perception and Interaction Enhanced End-to-End Motion Planning for Autonomous Driving	Chengran Yuan et.al.	2509.18609	null
2025-09-23	Spatial Envelope MPC: High Performance Driving without a Reference	Siyuan Yu et.al.	2509.18506	null
2025-09-22	AD-VF: LLM-Automatic Differentiation Enables Fine-Tuning-Free Robot Planning from Formal Methods Feedback	Yunhao Yang et.al.	2509.18384	null
2025-09-19	MMCD: Multi-Modal Collaborative Decision-Making for Connected Autonomy with Knowledge Distillation	Rui Liu et.al.	2509.18198	null
2025-09-25	V2V-GoT: Vehicle-to-Vehicle Cooperative Autonomous Driving with Multimodal Large Language Models and Graph-of-Thoughts	Hsu-kuang Chiu et.al.	2509.18053	null
2025-09-22	Towards Seeing Bones at Radio Frequency	Yiwen Song et.al.	2509.17979	null
2025-09-22	DriveDPO: Policy Learning via Safety DPO For End-to-End Autonomous Driving	Shuyao Shang et.al.	2509.17940	null
2025-09-22	SocialTraj: Two-Stage Socially-Aware Trajectory Prediction for Autonomous Driving via Conditional Diffusion Model	Xiao Zhou et.al.	2509.17850	null
2025-09-22	Predicting Depth Maps from Single RGB Images and Addressing Missing Information in Depth Estimation	Mohamad Mofeed Chaar et.al.	2509.17686	null
2025-09-22	Tensor-Based Self-Calibration of Cameras via the TrifocalCalib Method	Gregory Schroeder et.al.	2509.17620	null
2025-09-22	Vision-Based Driver Drowsiness Monitoring: Comparative Analysis of YOLOv5-v11 Models	Dilshara Herath et.al.	2509.17498	null
2025-09-22	FGGS-LiDAR: Ultra-Fast, GPU-Accelerated Simulation from General 3DGS Models to LiDAR	Junzhe Wu et.al.	2509.17390	null
2025-09-21	CoPlanner: An Interactive Motion Planner with Contingency-Aware Diffusion for Autonomous Driving	Ruiguo Zhong et.al.	2509.17080	null
2025-09-21	Orchestrate, Generate, Reflect: A VLM-Based Multi-Agent Collaboration Framework for Automated Driving Policy Learning	Zengqi Peng et.al.	2509.17042	null
2025-09-21	SLAM-Former: Putting SLAM into One Transformer	Yijun Yuan et.al.	2509.16909	null
2025-09-21	End2Race: Efficient End-to-End Imitation Learning for Real-Time F1Tenth Racing	Zhijie Qiao et.al.	2509.16894	null
2025-09-20	Improve bounding box in Carla Simulator	Mohamad Mofeed Chaar et.al.	2509.16773	null
2025-09-28	Are VLMs Ready for Lane Topology Awareness in Autonomous Driving?	Xin Chen et.al.	2509.16654	null
2025-09-20	ADVEDM:Fine-grained Adversarial Attack against VLM-based Embodied Agents	Yichen Wang et.al.	2509.16645	null
2025-09-20	SQS: Enhancing Sparse Perception Models via Query-based Splatting in Autonomous Driving	Haiming Zhang et.al.	2509.16588	null
2025-09-20	ST-GS: Vision-Based 3D Semantic Occupancy Prediction with Spatial-Temporal Gaussian Splatting	Xiaoyang Yan et.al.	2509.16552	null
2025-09-20	RLGF: Reinforcement Learning with Geometric Feedback for Autonomous Driving Video Generation	Tianyi Yan et.al.	2509.16500	null
2025-09-19	Neural Atlas Graphs for Dynamic Scene Decomposition and Editing	Jan Philipp Schneider et.al.	2509.16336	null
2025-09-18	RaFD: Flow-Guided Radar Detection for Robust Autonomous Driving	Shuocheng Yang et.al.	2509.16261	null
2025-09-19	RadarGaussianDet3D: An Efficient and Effective Gaussian-based 3D Detector with 4D Automotive Radars	Weiyi Xiong et.al.	2509.16119	null
2025-09-19	SegDINO3D: 3D Instance Segmentation Empowered by Both Image-Level and Object-Level 2D Features	Jinyuan Qu et.al.	2509.16098	null
2025-09-19	CoPAD : Multi-source Trajectory Fusion and Cooperative Trajectory Prediction with Anchor-oriented Decoder in V2X Scenarios	Kangyu Wu et.al.	2509.15984	null
2025-09-19	CoReVLA: A Dual-Stage End-to-End Autonomous Driving Framework for Long-Tail Scenarios via Collect-and-Refine	Shiyu Fang et.al.	2509.15968	null
2025-09-19	RangeSAM: Leveraging Visual Foundation Models for Range-View repesented LiDAR segmentation	Paul Julius Kühn et.al.	2509.15886	null
2025-09-19	CBPNet: A Continual Backpropagation Prompt Network for Alleviating Plasticity Loss on Edge Devices	Runjie Shao et.al.	2509.15785	null
2025-09-19	Enriched Feature Representation and Motion Prediction Module for MOSEv2 Track of 7th LSVOS Challenge: 3rd Place Solution	Chang Soo Lim et.al.	2509.15781	null
2025-09-22	Online Slip Detection and Friction Coefficient Estimation for Autonomous Racing	Christopher Oeltjen et.al.	2509.15423	null
2025-09-18	Out-of-Sight Trajectories: Tracking, Fusion, and Prediction	Haichao Zhang et.al.	2509.15219	null
2025-09-18	Feature-aligned Motion Transformation for Efficient Dynamic Point Cloud Compression	Xuan Deng et.al.	2509.14591	null
2025-09-18	DiffVL: Diffusion-Based Visual Localization on 2D Maps via BEV-Conditioned GPS Denoising	Li Gao et.al.	2509.14565	null
2025-09-17	FlowDrive: Energy Flow Field for End-to-End Autonomous Driving	Hao Jiang et.al.	2509.14303	null
2025-10-03	MCGS-SLAM: A Multi-Camera SLAM Framework Using Gaussian Splatting for High-Fidelity Mapping	Zhihao Cao et.al.	2509.14191	null
2025-09-17	BEVUDA++: Geometric-aware Unsupervised Domain Adaptation for Multi-View 3D Object Detection	Rongyu Zhang et.al.	2509.14151	null
2025-09-17	SEG-Parking: Towards Safe, Efficient, and Generalizable Autonomous Parking via End-to-End Offline Reinforcement Learning	Zewei Yang et.al.	2509.13956	null
2025-09-17	MAP: End-to-End Autonomous Driving with Map-Assisted Planning	Huilin Yin et.al.	2509.13926	null
2025-09-17	Data-Efficient Spectral Classification of Hyperspectral Data Using MiniROCKET and HDC-MiniROCKET	Nick Theisen et.al.	2509.13809	null
2025-09-17	AdaThinkDrive: Adaptive Thinking via Reinforcement Learning for Autonomous Driving	Yuechen Luo et.al.	2509.13769	null
2025-09-17	UM-Depth : Uncertainty Masked Self-Supervised Monocular Depth Estimation with Visual Odometry	Tae-Wook Um et.al.	2509.13713	null
2025-09-17	FishBEV: Distortion-Resilient Bird’s Eye View Segmentation with Surround-View Fisheye Cameras	Hang Li et.al.	2509.13681	null
2025-09-28	TreeIRL: Safe Urban Driving with Tree Search and Inverse Reinforcement Learning	Momchil S. Tomov et.al.	2509.13579	null
2025-09-16	Adversarial Appearance Learning in Augmented Cityscapes for Pedestrian Recognition in Autonomous Driving	Artem Savkin et.al.	2509.13507	null
2025-09-16	Road Obstacle Video Segmentation	Shyam Nandan Rai et.al.	2509.13181	null
2025-09-17	TeraSim-World: Worldwide Safety-Critical Data Synthesis for End-to-End Autonomous Driving	Jiawei Wang et.al.	2509.13164	null
2025-09-16	An Uncertainty-Weighted Decision Transformer for Navigation in Dense, Complex Driving Scenarios	Zhihao Zhang et.al.	2509.13132	null
2025-09-17	Weakly and Self-Supervised Class-Agnostic Motion Prediction for Autonomous Driving	Ruibo Li et.al.	2509.13116	null
2025-09-16	4DRadar-GS: Self-Supervised Dynamic Driving Scene Reconstruction with 4D Radar	Xiao Tang et.al.	2509.12931	null
2025-09-16	StereoCarla: A High-Fidelity Driving Dataset for Generalizable Stereo	Xianda Guo et.al.	2509.12683	null
2025-09-16	Maps for Autonomous Driving: Full-process Survey and Frontiers	Pengxin Chen et.al.	2509.12632	null
2025-09-16	DisorientLiDAR: Physical Attacks on LiDAR-based Localization	Yizhen Lao et.al.	2509.12595	null
2025-08-26	UrgenGo: Urgency-Aware Transparent GPU Kernel Launching for Autonomous Driving	Hanqi Zhu et.al.	2509.12207	null
2025-09-16	Embodied Navigation Foundation Model	Jiazhao Zhang et.al.	2509.12129	null
2025-09-15	Probabilistic Robustness Analysis in High Dimensional Space: Application to Semantic Segmentation Network	Navid Hashemi et.al.	2509.11838	null
2025-09-14	SAMP: Spatial Anchor-based Motion Policy for Collision-Aware Robotic Manipulators	Kai Chen et.al.	2509.11185	null
2025-09-14	SPHERE: Semantic-PHysical Engaged REpresentation for 3D Semantic Scene Completion	Zhiwen Yang et.al.	2509.11171	null
2025-09-13	Point-Plane Projections for Accurate LiDAR Semantic Segmentation in Small Data Scenarios	Simone Mosco et.al.	2509.10841	null
2025-09-11	Large Foundation Models for Trajectory Prediction in Autonomous Driving: A Comprehensive Survey	Wei Dai et.al.	2509.10570	null
2025-09-17	DECAMP: Towards Scene-Consistent Multi-Agent Motion Prediction with Disentangled Context-Aware Pre-Training	Jianxin Shi et.al.	2509.10426	null
2025-09-12	Multimodal SAM-adapter for Semantic Segmentation	Iacopo Curti et.al.	2509.10408	null
2025-09-12	CaR1: A Multi-Modal Baseline for BEV Vehicle Segmentation via Camera-Radar Fusion	Santiago Montiel-Marín et.al.	2509.10139	null
2025-09-12	BEVTraj: Map-Free End-to-End Trajectory Prediction in Bird’s-Eye View with Deformable Attention and Sparse Goal Proposals	Minsang Kong et.al.	2509.10080	null
2025-09-11	MGTraj: Multi-Granularity Goal-Guided Human Trajectory Prediction with Recursive Refinement Network	Ge Sun et.al.	2509.09200	null
2025-09-23	LD-ViCE: Latent Diffusion Model for Video Counterfactual Explanations	Payal Varshney et.al.	2509.08422	null
2025-09-10	Sparse BEV Fusion with Self-View Consistency for Multi-View Detection and Tracking	Keisuke Toida et.al.	2509.08421	null
2025-09-10	InsFusion: Rethink Instance-level LiDAR-Camera Fusion for 3D Object Detection	Zhongyu Xia et.al.	2509.08374	null
2025-09-10	Foundation Models for Autonomous Driving Perception: A Survey Through Core Capabilities	Rajendramayavan Sathyam et.al.	2509.08302	null
2025-09-10	A Comprehensive Review of Reinforcement Learning for Autonomous Driving in the CARLA Simulator	Elahe Delavari et.al.	2509.08221	null
2025-09-09	Mean Field Game-Based Interactive Trajectory Planning Using Physics-Inspired Unified Potential Fields	Zhen Tian et.al.	2509.08147	null
2025-09-09	TA-VLA: Elucidating the Design Space of Torque-aware Vision-Language-Action Models	Zongzheng Zhang et.al.	2509.07962	null
2025-09-09	Bias in Gender Bias Benchmarks: How Spurious Features Distort Evaluation	Yusuke Hirota et.al.	2509.07596	null
2025-09-09	Bias-Aware Machine Unlearning: Towards Fairer Vision Models via Controllable Forgetting	Sai Siddhartha Chary Aylapuram et.al.	2509.07456	null
2025-09-09	Attention and Risk-Aware Decision Framework for Safe Autonomous Driving	Zhen Tian et.al.	2509.07412	null
2025-09-08	SynthDrive: Scalable Real2Sim2Real Sensor Simulation Pipeline for High-Fidelity Asset Generation and Driving Data Synthesis	Zhengqing Chen et.al.	2509.06798	null
2025-09-08	Adaptive Evolution Factor Risk Ellipse Framework for Reliable and Safe Autonomous Driving	Fujiang Yuan et.al.	2509.06375	null
2025-09-06	Scenario-based Decision-making Using Game Theory for Interactive Autonomous Driving: A Survey	Zhihao Lin et.al.	2509.05777	null
2025-09-06	Evaluating YOLO Architectures: Implications for Real-Time Vehicle Detection in Urban Environments of Bangladesh	Ha Meem Hossain et.al.	2509.05652	null
2025-09-06	OccVLA: Vision-Language-Action Model with Implicit 3D Occupancy Supervision	Ruixun Liu et.al.	2509.05578	null
2025-09-03	Unsupervised Instance Segmentation with Superpixels	Cuong Manh Hoang et.al.	2509.05352	null
2025-09-08	LatticeWorld: A Multimodal Large Language Model-Empowered Framework for Interactive Complex World Generation	Yinglin Duan et.al.	2509.05263	null
2025-09-05	Enhancing 3D Point Cloud Classification with ModelNet-R and Point-SkipNet	Mohammad Saeid et.al.	2509.05198	null
2025-09-05	A Knowledge-Driven Diffusion Policy for End-to-End Autonomous Driving Based on Expert Routing	Chengkai Xu et.al.	2509.04853	null
2025-09-05	Enhancing Self-Driving Segmentation in Adverse Weather Conditions: A Dual Uncertainty-Aware Training Approach to SAM Optimization	Dharsan Ravindran et.al.	2509.04735	null
2025-09-04	Bootstrapping Reinforcement Learning with Sub-optimal Policies for Autonomous Driving	Zhihao Zhang et.al.	2509.04712	null
2025-09-04	Domain Adaptation for Different Sensor Configurations in 3D Object Detection	Satoshi Tanaka et.al.	2509.04711	null
2025-09-04	In-Context Policy Adaptation via Cross-Domain Skill Diffusion	Minjong Yoo et.al.	2509.04535	null
2025-09-09	One Flight Over the Gap: A Survey from Perspective to Panoramic Vision	Xin Lin et.al.	2509.04444	null
2025-09-04	TriLiteNet: Lightweight Model for Multi-Task Visual Perception	Quang-Huy Che et.al.	2509.04092	null
2025-09-04	SliceSemOcc: Vertical Slice Based Multimodal 3D Semantic Occupancy Representation	Han Huang et.al.	2509.03999	null
2025-09-03	sam-llm: interpretable lane change trajectoryprediction via parametric finetuning	Zhuo Cao et.al.	2509.03462	null
2025-09-03	KEPT: Knowledge-Enhanced Prediction of Trajectories from Consecutive Driving Frames with Vision-Language Models	Yujin Wang et.al.	2509.02966	null
2025-09-02	2nd Place Solution for CVPR2024 E2E Challenge: End-to-End Autonomous Driving Using Vision Language Model	Zilong Guo et.al.	2509.02659	null
2025-09-02	Omnidirectional Spatial Modeling from Correlated Panoramas	Xinshen Zhang et.al.	2509.02164	null
2025-09-02	Generalizing Unsupervised Lidar Odometry Model from Normal to Snowy Weather Conditions	Beibei Zhou et.al.	2509.02011	null
2025-09-02	Explaining What Machines See: XAI Strategies in Deep Object Detection Models	FatemehSadat Seyedmomeni et.al.	2509.01991	null
2025-09-02	AutoDrive-R $^2$ : Incentivizing Reasoning and Self-Reflection Capacity for VLA Model in Autonomous Driving	Zhenlong Yuan et.al.	2509.01944	null
2025-09-01	PointSlice: Accurate and Efficient Slice-Based Representation for 3D Object Detection from Point Clouds	Liu Qifeng et.al.	2509.01487	null
2025-09-01	Guided Model-based LiDAR Super-Resolution for Resource-Efficient Automotive scene Segmentation	Alexandros Gkillas et.al.	2509.01317	null
2025-09-01	Toward a Holistic Multi-Criteria Trajectory Evaluation Framework for Autonomous Driving in Mixed Traffic Environment	Nouhed Naidja et.al.	2509.01291	null
2025-09-04	Enhanced Mean Field Game for Interactive Decision-Making with Varied Stylish Multi-Vehicles	Liancheng Zheng et.al.	2509.00981	null
2025-08-31	OmniReason: A Temporal-Guided Vision-Language-Action Framework for Autonomous Driving	Pei Liu et.al.	2509.00789	null
2025-08-30	Vehicle-in-Virtual-Environment (VVE) Method for Developing and Evaluating VRU Safety of Connected and Autonomous Driving with Focus on Bicyclist Safety	Haochong Chen et.al.	2509.00624	null
2025-08-30	Safe and Efficient Lane-Changing for Autonomous Vehicles: An Improved Double Quintic Polynomial Approach with Time-to-Collision Evaluation	Rui Bai et.al.	2509.00582	null
2025-08-30	Galaxea Open-World Dataset and G0 Dual-System VLA Model	Tao Jiang et.al.	2509.00576	null
2025-08-30	FLUID: A Fine-Grained Lightweight Urban Signalized-Intersection Dataset of Dense Conflict Trajectories	Yiyang Chen et.al.	2509.00497	null
2025-08-30	Domain Adaptation-Based Crossmodal Knowledge Distillation for 3D Semantic Segmentation	Jialiang Kang et.al.	2509.00379	null
2025-08-29	3D-LATTE: Latent Space 3D Editing from Textual Instructions	Maria Parelli et.al.	2509.00269	null
2025-08-29	DriveQA: Passing the Driving Knowledge Test	Maolin Wei et.al.	2508.21824	null
2025-08-29	Mini Autonomous Car Driving based on 3D Convolutional Neural Networks	Pablo Moraes et.al.	2508.21271	null
2025-09-01	2COOOL: 2nd Workshop on the Challenge Of Out-Of-Label Hazards in Autonomous Driving	Ali K. AlShami et.al.	2508.21080	null
2025-08-28	DrivingGaussian++: Towards Realistic Reconstruction and Editable Simulation for Surrounding Dynamic Driving Scenes	Yajiao Xiong et.al.	2508.20965	null
2025-08-28	Surfel-based 3D Registration with Equivariant SE(3) Features	Xueyang Kang et.al.	2508.20789	null
2025-08-29	SKGE-SWIN: End-To-End Autonomous Vehicle Waypoint Prediction and Navigation Using Skip Stage Swin Transformer	Fachri Najm Noer Kartiman et.al.	2508.20762	null
2025-08-28	UTA-Sign: Unsupervised Thermal Video Augmentation via Event-Assisted Traffic Signage Sketching	Yuqi Han et.al.	2508.20594	null
2025-08-28	Adaptive Dual Uncertainty Optimization: Boosting Monocular 3D Object Detection under Test-Time Shifts	Zixuan Hu et.al.	2508.20488	null
2025-08-28	Realistic and Controllable 3D Gaussian-Guided Object Editing for Driving Video Generation	Jiusi Li et.al.	2508.20471	null
2025-08-27	Streamlining the Development of Active Learning Methods in Real-World Object Detection	Moussa Kassem Sbeyti et.al.	2508.19906	null
2025-08-27	Hyperspectral Sensors and Autonomous Driving: Technologies, Limitations, and Opportunities	Imad Ali Shah et.al.	2508.19905	null
2025-08-27	Generalizing Monocular 3D Object Detection	Abhinav Kumar et.al.	2508.19593	null
2025-08-25	Efficient Model-Based Purification Against Adversarial Attacks for LiDAR Segmentation	Alexandros Gkillas et.al.	2508.19290	null
2025-10-22	Interpretable Decision-Making for End-to-End Autonomous Driving	Mona Mirzaie et.al.	2508.18898	null
2025-08-26	EMind: A Foundation Model for Multi-task Electromagnetic Signals Understanding	Luqing Luo et.al.	2508.18785	null
2025-08-20	GM-Skip: Metric-Guided Transformer Block Skipping for Efficient Vision-Language Models	Lianming Huang et.al.	2508.18227	null
2025-09-02	EventTracer: Fast Path Tracing-based Event Stream Rendering	Zhenyang Li et.al.	2508.18071	null
2025-09-02	Integration of Computer Vision with Adaptive Control for Autonomous Driving Using ADORE	Abu Shad Ahammed et.al.	2508.17985	null
2025-08-25	Enhanced Drift-Aware Computer Vision Architecture for Autonomous Driving	Md Shahi Amran Hossain et.al.	2508.17975	null
2025-08-25	Adaptive Output Steps: FlexiSteps Network for Dynamic Trajectory Prediction	Yunxiang Liu et.al.	2508.17797	null
2025-08-23	A Rapid Iterative Trajectory Planning Method for Automated Parking through Differential Flatness	Zhouheng Li et.al.	2508.17038	null
2025-08-23	A Survey of Deep Learning-based Point Cloud Denoising	Jinxi Wang et.al.	2508.17011	null
2025-08-23	Drive As You Like: Strategy-Level Motion Planning Based on A Multi-Head Diffusion Model	Fan Ding et.al.	2508.16947	null
2025-08-22	Closer to Reality: Practical Semi-Supervised Federated Learning for Foundation Model Adaptation	Guangyu Sun et.al.	2508.16568	null
2025-08-22	Seeing Clearly, Forgetting Deeply: Revisiting Fine-Tuned Video Generators for Driving Simulation	Chun-Peng Chang et.al.	2508.16512	null
2025-08-22	SAMFusion: Sensor-Adaptive Multimodal Fusion for 3D Object Detection in Adverse Weather	Edoardo Palladin et.al.	2508.16408	null
2025-08-22	MapKD: Unlocking Prior Knowledge with Cross-Modal Distillation for Efficient Online HD Map Construction	Ziyang Yan et.al.	2508.15653	null
2025-08-23	ExtraGS: Geometric-Aware Trajectory Extrapolation with Uncertainty-Guided Generative Priors	Kaiyuan Tan et.al.	2508.15529	null
2025-08-21	RCDINO: Enhancing Radar-Camera 3D Object Detection with DINOv2 Semantic Features	Olga Matykina et.al.	2508.15353	null
2025-08-21	RATopo: Improving Lane Topology Reasoning via Redundancy Assignment	Han Li et.al.	2508.15272	null
2025-08-21	Adversarial Agent Behavior Learning in Autonomous Driving Using Deep Reinforcement Learning	Arjun Srinivasan et.al.	2508.15207	null
2025-08-25	MeSS: City Mesh-Guided Outdoor Scene Generation with Cross-View Consistent Diffusion	Xuyang Chen et.al.	2508.15169	null
2025-08-28	Learning to Drive Ethically: Embedding Moral Reasoning into Autonomous Driving	Dianzhao Li et.al.	2508.14926	null
2025-08-20	Multiscale Video Transformers for Class Agnostic Segmentation in Autonomous Driving	Leila Cheshmi et.al.	2508.14729	null
2025-08-20	MoVieDrive: Multi-Modal Multi-View Urban Scene Video Generation	Guile Wu et.al.	2508.14327	null
2025-09-16	ROVR-Open-Dataset: A Large-Scale Depth Dataset for Autonomous Driving	Xianda Guo et.al.	2508.13977	null
2025-08-19	Unleashing Semantic and Geometric Priors for 3D Scene Completion	Shiyuan Chen et.al.	2508.13601	null
2025-08-25	Bridging Clear and Adverse Driving Conditions	Yoel Shapiro et.al.	2508.13592	null
2025-08-19	Generative Model-Based Feature Attention Module for Video Action Analysis	Guiqin Wang et.al.	2508.13565	null
2025-08-19	CORENet: Cross-Modal 4D Radar Denoising Network with LiDAR Supervision for Autonomous Driving	Fuyang Liu et.al.	2508.13485	null
2025-08-19	Structured Prompting and Multi-Agent Knowledge Distillation for Traffic Video Interpretation and Risk Inference	Yunxiang Yang et.al.	2508.13439	null
2025-08-18	Incremental Generalized Hybrid A*	Sidharth Talia et.al.	2508.13392	null
2025-08-18	Prune2Drive: A Plug-and-Play Framework for Accelerating Vision-Language Models in Autonomous Driving	Minhao Xiong et.al.	2508.13305	null
2025-08-18	SpotVLM: Cloud-edge Collaborative Real-time VLM based on Context Transfer	Chen Qian et.al.	2508.12638	null
2025-08-18	ViLaD: A Large Vision Language Diffusion Framework for End-to-End Autonomous Driving	Can Cui et.al.	2508.12603	null
2025-08-17	An Initial Study of Bird’s-Eye View Generation for Autonomous Vehicles using Cross-View Transformers	Felipe Carlos dos Santos et.al.	2508.12520	null
2025-08-17	LMAD: Integrated End-to-End Vision-Language Model for Explainable Autonomous Driving	Nan Song et.al.	2508.12404	null
2025-08-17	DoppDrive: Doppler-Driven Temporal Aggregation for Improved Radar Object Detection	Yuval Haitman et.al.	2508.12330	null
2025-08-17	TSLA: A Task-Specific Learning Adaptation for Semantic Segmentation on Autonomous Vehicles Platform	Jun Liu et.al.	2508.12279	null
2025-08-16	InstDrive: Instance-Aware 3D Gaussian Splatting for Driving Scenes	Hongyuan Liu et.al.	2508.12015	null
2025-08-16	Saliency-Based Attention Shifting: A Framework for Improving Driver Situational Awareness of Out-of-Label Hazards	Yousra Shleibik et.al.	2508.11887	null
2025-08-16	Data Shift of Object Detection in Autonomous Driving	Lida Xu et.al.	2508.11868	null
2025-08-15	Relative Position Matters: Trajectory Prediction and Planning with Polar Representation	Bozhou Zhang et.al.	2508.11492	null
2025-08-15	Perception in Plan: Coupled Perception and Planning for End-to-End Autonomous Driving	Bozhou Zhang et.al.	2508.11488	null
2025-08-15	EvoPSF: Online Evolution of Autonomous Driving Models via Planning-State Feedback	Jiayue Jin et.al.	2508.11453	null
2025-08-15	ImagiDrive: A Unified Imagination-and-Planning Framework for Autonomous Driving	Jingyu Li et.al.	2508.11428	null
2025-08-15	Delving into Dynamic Scene Cue-Consistency for Robust 3D Multi-Object Tracking	Haonan Zhang et.al.	2508.11323	null
2025-08-15	A CLIP-based Uncertainty Modal Modeling (UMM) Framework for Pedestrian Re-Identification in Autonomous Driving	Jialin Li et.al.	2508.11218	null
2025-08-14	CSNR and JMIM Based Spectral Band Selection for Reducing Metamerism in Urban Driving	Jiarong Li et.al.	2508.10962	null
2025-08-18	HQ-OV3D: A High Box Quality Open-World 3D Detection Framework based on Diffision Model	Qi Liu et.al.	2508.10935	null
2025-08-14	Towards Powerful and Practical Patch Attacks for 2D Object Detection in Autonomous Driving	Yuxin Cao et.al.	2508.10600	null
2025-08-14	SpaRC-AD: A Baseline for Radar-Camera Fusion in End-to-End Autonomous Driving	Philipp Wolters et.al.	2508.10567	null
2025-08-14	Reasoning in Computer Vision: Taxonomy, Models, Tasks, and Methodologies	Ayushman Sarkar et.al.	2508.10523	null
2025-08-14	STRIDE-QA: Visual Question Answering Dataset for Spatiotemporal Reasoning in Urban Driving Scenes	Keishi Ishihara et.al.	2508.10427	null
2025-08-14	From Pixel to Mask: A Survey of Out-of-Distribution Segmentation	Wenjie Zhao et.al.	2508.10309	null
2025-08-13	BridgeTA: Bridging the Representation Gap in Knowledge Distillation via Teacher Assistant for Bird’s Eye View Map Segmentation	Beomjun Kim et.al.	2508.09599	null
2025-08-13	Offline Auto Labeling: BAAS	Stefan Haag et.al.	2508.09585	null
2025-08-13	Waymo-3DSkelMo: A Multi-Agent 3D Skeletal Motion Dataset for Pedestrian Interaction Modeling in Autonomous Driving	Guangxun Zhu et.al.	2508.09404	null
2025-08-12	VLM-3D:End-to-End Vision-Language Models for Open-World 3D Perception	Fuhao Chang et.al.	2508.09061	null
2025-08-12	A Pseudo Global Fusion Paradigm-Based Cross-View Network for LiDAR-Based Place Recognition	Jintao Cheng et.al.	2508.08917	null
2025-08-21	ReconDreamer-RL: Enhancing Reinforcement Learning via Diffusion-based Scene Reconstruction	Chaojun Ni et.al.	2508.08170	null
2025-08-18	TRIDE: A Text-assisted Radar-Image weather-aware fusion network for Depth Estimation	Huawei Sun et.al.	2508.08038	null
2025-08-11	CBDES MoE: Hierarchically Decoupled Mixture-of-Experts for Functional Modules in Autonomous Driving	Qi Xiang et.al.	2508.07838	null
2025-08-11	Risk Map As Middleware: Towards Interpretable Cooperative End-to-end Autonomous Driving for Risk-Aware Planning	Mingyue Lei et.al.	2508.07686	null
2025-08-11	Progressive Bird’s Eye View Perception for Safety-Critical Autonomous Driving: A Comprehensive Survey	Yan Gong et.al.	2508.07560	null
2025-08-12	Decoupled Functional Evaluation of Autonomous Driving Models via Feature Map Quality Scoring	Ludan Zhang et.al.	2508.07552	null
2025-08-10	Noise-Aware Generative Microscopic Traffic Simulation	Vindula Jayawardana et.al.	2508.07453	null
2025-08-09	An Evolutionary Game-Theoretic Merging Decision-Making Considering Social Acceptance for Autonomous Driving	Haolin Liu et.al.	2508.07080	null
2025-08-27	From Imitation to Optimization: A Comparative Study of Offline Learning for Autonomous Driving	Antonio Guillen-Perez et.al.	2508.07029	null
2025-08-09	WeatherDiffusion: Weather-Guided Diffusion Model for Forward and Inverse Rendering	Yixin Zhu et.al.	2508.06982	null
2025-08-08	Robust-Sub-Gaussian Model Predictive Control for Safe Ultrasound-Image-Guided Robotic Spinal Surgery	Yunke Ao et.al.	2508.06744	null
2025-08-15	IRL-VLA: Training an Vision-Language-Action Policy via Reward World Model	Anqing Jiang et.al.	2508.06571	null
2025-08-20	MetAdv: A Unified and Interactive Adversarial Testing Platform for Autonomous Driving	Aishan Liu et.al.	2508.06534	null
2025-08-02	RMT-PPAD: Real-time Multi-task Learning for Panoptic Perception in Autonomous Driving	Jiayuan Wang et.al.	2508.06529	null
2025-08-12	GMF-Drive: Gated Mamba Fusion with Spatial-Aware BEV Representation for End-to-End Autonomous Driving	Jian Wang et.al.	2508.06113	null
2025-08-08	ME $^3$ -BEV: Mamba-Enhanced Deep Reinforcement Learning for End-to-End Autonomous Driving with BEV-Perception	Siyi Lu et.al.	2508.06074	null
2025-08-07	VISTA: Vision-Language Imitation of Situational Thinking and Attention for Human-Like Driver Focus in Dynamic Environments	Kaiser Hamid et.al.	2508.05852	null
2025-08-07	SMOL-MapSeg: Show Me One Label	Yunshuang Yuan et.al.	2508.05501	null
2025-08-07	Physical Adversarial Camouflage through Gradient Calibration and Regularization	Jiawei Liang et.al.	2508.05414	null
2025-08-07	DistillDrive: End-to-End Multi-Mode Autonomous Driving Distillation by Isomorphic Hetero-Source Planning Model	Rui Yu et.al.	2508.05402	null
2025-08-07	ArbiViewGen: Controllable Arbitrary Viewpoint Camera Data Generation for Autonomous Driving via Stable Diffusion Models	Yatong Lan et.al.	2508.05236	null
2025-08-07	PhysPatch: A Physically Realizable and Transferable Adversarial Patch Attack for Multimodal Large Language Models-based Autonomous Driving Systems	Qi Guo et.al.	2508.05167	null
2025-08-07	AdvDINO: Domain-Adversarial Self-Supervised Representation Learning for Spatial Proteomics	Stella Su et.al.	2508.04955	null
2025-08-06	Occupancy Learning with Spatiotemporal Memory	Ziyang Leng et.al.	2508.04705	null
2025-08-06	BEVCon: Advancing Bird’s Eye View Perception with Contrastive Learning	Ziyang Leng et.al.	2508.04702	null
2025-08-06	RoboTron-Sim: Improving Real-World Driving via Simulated Hard-Case	Baihui Xiao et.al.	2508.04642	null
2025-08-06	Segment Any Vehicle: Semantic and Visual Context Driven SAM and A Benchmark	Xiao Wang et.al.	2508.04260	null
2025-08-06	DRIVE: Dynamic Rule Inference and Verified Evaluation for Constraint-Aware Autonomous Driving	Longling Geng et.al.	2508.04066	null
2025-08-05	LiDARCrafter: Dynamic 4D World Modeling from LiDAR Sequences	Ao Liang et.al.	2508.03692	null
2025-08-05	La La LiDAR: Large-Scale Layout Generation from LiDAR Data	Youquan Liu et.al.	2508.03691	null
2025-08-05	Veila: Panoramic LiDAR Generation from a Monocular RGB Image	Youquan Liu et.al.	2508.03690	null
2025-08-13	MoCA: Identity-Preserving Text-to-Video Generation via Mixture of Cross Attention	Qi Xie et.al.	2508.03034	null
2025-08-04	Context-aware Risk Assessment and Its Application in Autonomous Driving	Boyang Tian et.al.	2508.02919	null
2025-08-04	MIDAR: Mimicking LiDAR Detection for Traffic Applications with a Lightweight Plug-and-Play Model	Tianheng Zhu et.al.	2508.02858	null
2025-08-04	mmWave Radar-Based Non-Line-of-Sight Pedestrian Localization at T-Junctions Utilizing Road Layout Extraction via Camera	Byeonggyu Park et.al.	2508.02348	null
2025-08-04	Dream-to-Recon: Monocular 3D Reconstruction with Diffusion-Depth Distillation from Single Images	Philipp Wulff et.al.	2508.02323	null
2025-08-04	Test-Time Model Adaptation for Quantized Neural Networks	Zeshuai Deng et.al.	2508.02180	null
2025-08-04	Beyond RGB and Events: Enhancing Object Detection under Adverse Lighting with Monocular Normal Maps	Mingjie Liu et.al.	2508.02127	null
2025-08-04	Mapillary Vistas Validation for Fine-Grained Traffic Signs: A Benchmark Revealing Vision-Language Model Limitations	Sparsh Garg et.al.	2508.02047	null
2025-08-20	Bench2ADVLM: A Closed-Loop Benchmark for Vision-language Models in Autonomous Driving	Tianyuan Zhang et.al.	2508.02028	null
2025-08-03	Beyond Simulation: Benchmarking World Models for Planning and Causality in Autonomous Driving	Hunter Schofield et.al.	2508.01922	null
2025-08-03	StreamAgent: Towards Anticipatory Agents for Streaming Video Understanding	Haolin Yang et.al.	2508.01875	null
2025-08-03	DiffSemanticFusion: Semantic Raster BEV Fusion for Autonomous Driving via Online HD Map Diffusion	Zhigang Sun et.al.	2508.01778	null
2025-08-03	LT-Gaussian: Long-Term Map Update Using 3D Gaussian Splatting for Autonomous Driving	Luqi Cheng et.al.	2508.01704	null
2025-08-03	Adverse Weather-Independent Framework Towards Autonomous Driving Perception through Temporal Correlation and Unfolded Regularization	Wei-Bin Kou et.al.	2508.01583	null
2025-08-02	A Coarse-to-Fine Approach to Multi-Modality 3D Occupancy Grounding	Zhan Shi et.al.	2508.01197	null
2025-08-01	CP-FREEZER: Latency Attacks against Vehicular Cooperative Perception	Chenyi Wang et.al.	2508.01062	null
2025-08-12	Edge-Based Multimodal Sensor Data Fusion with Vision Language Models (VLMs) for Real-time Autonomous Vehicle Accident Avoidance	Fengze Yang et.al.	2508.01057	null
2025-07-31	Service Discovery-Based Hybrid Network Middleware for Efficient Communication in Distributed Robotic Systems	Shiyao Sang et.al.	2508.00947	null
2025-08-01	Rethinking Backbone Design for Lightweight 3D Object Detection in LiDAR	Adwait Chandorkar et.al.	2508.00744	null
2025-08-12	Context-based Motion Retrieval using Open Vocabulary Methods for Autonomous Driving	Stefan Englmeier et.al.	2508.00589	null
2025-08-01	Uncertainty-Aware Likelihood Ratio Estimation for Pixel-Wise Out-of-Distribution Detection	Marc Hölle et.al.	2508.00587	null
2025-08-01	Pro2Guard: Proactive Runtime Enforcement of LLM Agent Safety via Probabilistic Model Checking	Haoyu Wang et.al.	2508.00500	null
2025-08-01	Controllable Pedestrian Video Editing for Multi-View Driving Scenarios via Motion Sequence	Danzhen Fu et.al.	2508.00299	null
2025-07-21	AoI-Aware Resource Allocation with Deep Reinforcement Learning for HAPS-V2X Networks	Ahmet Melih Ince et.al.	2508.00011	null
2025-07-31	I2V-GS: Infrastructure-to-Vehicle View Transformation with Gaussian Splatting for Autonomous Driving Data Generation	Jialei Chen et.al.	2507.23683	null
2025-07-31	DA-Occ: Efficient 3D Voxel Occupancy Prediction via Directional 2D for Geometric Structure Preservation	Yuchen Zhou et.al.	2507.23599	null
2025-08-09	MoGA: 3D Generative Avatar Prior for Monocular Gaussian Avatar Reconstruction	Zijian Dong et.al.	2507.23597	null
2025-07-31	A Unified Perception-Language-Action Framework for Adaptive Autonomous Driving	Yi Zhang et.al.	2507.23540	null
2025-07-31	MagicRoad: Semantic-Aware 3D Road Surface Reconstruction via Obstacle Inpainting	Xingyue Peng et.al.	2507.23340	null
2025-07-31	Contrastive Learning-Driven Traffic Sign Perception: Multi-Modal Fusion of Text and Vision	Qiang Lu et.al.	2507.23331	null
2025-07-31	FASTopoWM: Fast-Slow Lane Segment Topology Reasoning with Latent World Models	Yiming Yang et.al.	2507.23325	null
2025-08-02	FastDriveVLA: Efficient End-to-End Driving via Plug-and-Play Reconstruction-based Token Pruning	Jiajun Cao et.al.	2507.23318	null
2025-08-04	PriorFusion: Unified Integration of Priors for Robust Road Perception in Autonomous Driving	Xuewei Tang et.al.	2507.23309	null
2025-07-30	Causal-Inspired Multi-Agent Decision-Making via Graph Reinforcement Learning	Jing Wang et.al.	2507.23080	null
2025-08-05	Vision-Language Fusion for Real-Time Autonomous Driving: Goal-Centered Cross-Attention of Camera, HD-Map, & Waypoints	Santosh Patapati et.al.	2507.23064	null
2025-07-30	Reference-Guided Diffusion Inpainting For Multimodal Counterfactual Generation	Alexandru Buburuzan et.al.	2507.23058	null
2025-08-07	Bayesian Optimization applied for accelerated Virtual Validation of the Autonomous Driving Function	Satyesh Shanker Awasthi et.al.	2507.22769	null
2025-07-30	Social-Pose: Enhancing Trajectory Prediction with Human Body Pose	Yang Gao et.al.	2507.22742	null
2025-07-30	Generative Active Learning for Long-tail Trajectory Prediction via Controllable Diffusion Model	Daehee Park et.al.	2507.22615	null
2025-07-30	TopoLiDM: Topology-Aware LiDAR Diffusion Models for Interpretable and Realistic LiDAR Point Cloud Generation	Jiuming Liu et.al.	2507.22454	null
2025-07-30	Safety Evaluation of Motion Plans Using Trajectory Predictors as Forward Reachable Set Estimators	Kaustav Chakraborty et.al.	2507.22389	null
2025-07-29	Hierarchical Game-Based Multi-Agent Decision-Making for Autonomous Vehicles	Mushuang Liu et.al.	2507.21941	null
2025-07-31	MultiEditor: Controllable Multimodal Object Editing for Driving Scenarios Using 3D Gaussian Splatting Priors	Shouyi Lu et.al.	2507.21872	null
2025-07-29	SAMITE: Position Prompted SAM2 with Calibrated Memory for Visual Object Tracking	Qianxiong Xu et.al.	2507.21732	null
2025-08-16	Research Challenges and Progress in the End-to-End V2X Cooperative Autonomous Driving Competition	Ruiyang Hao et.al.	2507.21610	null
2025-07-29	SafeDriveRAG: Towards Safe Autonomous Driving with Knowledge Graph-based Retrieval-Augmented Generation	Hao Ye et.al.	2507.21585	null
2025-07-30	No Redundancy, No Stall: Lightweight Streaming 3D Gaussian Splatting for Real-time Rendering	Linye Wei et.al.	2507.21572	null
2025-07-29	RelMap: Enhancing Online Map Construction with Class-Aware Spatial Relation and Semantic Priors	Tianhui Cai et.al.	2507.21567	null
2025-07-29	SLTarch: Towards Scalable Point-Based Neural Rendering by Taming Workload Imbalance and Memory Irregularity	Xingyang Li et.al.	2507.21499	null
2025-07-29	MapDiffusion: Generative Diffusion for Vectorized Online HD Map Construction and Uncertainty Estimation in Autonomous Driving	Thomas Monninger et.al.	2507.21423	null
2025-08-03	Collaborative Perceiver: Elevating Vision-based 3D Object Detection via Local Density-Aware Spatial Occupancy	Jicheng Yuan et.al.	2507.21358	null
2025-07-25	Seeing Beyond Frames: Zero-Shot Pedestrian Intention Prediction with Raw Temporal Video and Multimodal Cues	Pallavi Zambare et.al.	2507.21161	null
2025-07-28	GTAD: Global Temporal Aggregation Denoising Learning for 3D Semantic Occupancy Prediction	Tianhao Li et.al.	2507.20963	null
2025-07-25	Event-Based De-Snowing for Autonomous Driving	Manasi Muglikar et.al.	2507.20901	null
2025-07-28	DriveAgent-R1: Advancing VLM-based Autonomous Driving with Hybrid Thinking and Active Perception	Weicheng Zheng et.al.	2507.20879	null
2025-07-27	Model-Structured Neural Networks to Control the Steering Dynamics of Autonomous Race Cars	Mattia Piccinini et.al.	2507.20427	null
2025-07-27	VESPA: Towards un(Human)supervised Open-World Pointcloud Labeling for Autonomous Driving	Levente Tempfli et.al.	2507.20397	null
2025-07-27	Solving Scene Understanding for Autonomous Navigation in Unstructured Environments	Naveen Mathews Renji et.al.	2507.20389	null
2025-07-27	VLMPlanner: Integrating Visual Language Models with Motion Planning	Zhipeng Tang et.al.	2507.20342	null
2025-07-27	MambaMap: Online Vectorized HD Map Construction using State Space Model	Ruizi Yang et.al.	2507.20224	null
2025-07-27	LRR-Bench: Left, Right or Rotate? Vision-Language models Still Struggle With Spatial Understanding Tasks	Fei Kong et.al.	2507.20174	null
2025-07-27	Meta Fusion: A Unified Framework For Multimodality Fusion with Mutual Learning	Ziyi Liang et.al.	2507.20089	null
2025-07-26	Digital Twin Channel-Enabled Online Resource Allocation for 6G: Principle, Architecture and Application	Tongjie Li et.al.	2507.19974	null
2025-08-12	DriveIndia: An Object Detection Dataset for Diverse Indian Traffic Scenes	Rishav Kumar et.al.	2507.19912	null
2025-07-26	Bridging Simulation and Usability: A User-Friendly Framework for Scenario Generation in CARLA	Ahmed Abouelazm et.al.	2507.19883	null
2025-07-26	FedS2R: One-Shot Federated Domain Generalization for Synthetic-to-Real Semantic Segmentation in Autonomous Driving	Tao Lian et.al.	2507.19881	null
2025-07-30	RaGS: Unleashing 3D Gaussian Splatting from 4D Radar and Monocular Cues for 3D Object Detection	Xiaokai Bai et.al.	2507.19856	null
2025-07-26	A 4D Radar Camera Extrinsic Calibration Tool Based on 3D Uncertainty Perspective N Points	Chuan Cao et.al.	2507.19829	null
2025-07-25	PhysVarMix: Physics-Informed Variational Mixture Model for Multi-Modal Trajectory Prediction	Haichuan Li et.al.	2507.19701	null
2025-07-25	Co-Win: Joint Object Detection and Instance Segmentation in LiDAR Point Clouds via Collaborative Window Processing	Haichuan Li et.al.	2507.19691	null
2025-08-02	GS-Occ3D: Scaling Vision-only Occupancy Reconstruction with Gaussian Splatting	Baijun Ye et.al.	2507.19451	null
2025-07-25	An OpenSource CI/CD Pipeline for Variant-Rich Software-Defined Vehicles	Matthias Weiß et.al.	2507.19446	null
2025-07-25	SDVDiag: A Modular Platform for the Diagnosis of Connected Vehicle Functions	Matthias Weiß et.al.	2507.19403	null
2025-07-25	BEV-LLM: Leveraging Multimodal BEV Maps for Scene Captioning in Autonomous Driving	Felix Brandstaetter et.al.	2507.19370	null
2025-07-25	LOTUS: A Leaderboard for Detailed Image Captioning from Quality to Societal Bias and User Preferences	Yusuke Hirota et.al.	2507.19362	null
2025-07-25	SIDE: Sparse Information Disentanglement for Explainable Artificial Intelligence	Viktar Dubovik et.al.	2507.19321	null
2025-07-25	CoopTrack: Exploring End-to-End Learning for Efficient Cooperative Sequential Perception	Jiaru Zhong et.al.	2507.19239	null
2025-07-25	VisHall3D: Monocular Semantic Scene Completion from Reconstructing the Visible Regions to Hallucinating the Invisible Regions	Haoang Lu et.al.	2507.19188	null
2025-07-25	Continual Learning-Based Unified Model for Unpaired Image Restoration Tasks	Kotha Kartheek et.al.	2507.19184	null
2025-07-25	Diverse and Adaptive Behavior Curriculum for Autonomous Driving: A Student-Teacher Framework with Multi-Agent RL	Ahmed Abouelazm et.al.	2507.19146	null
2025-07-31	PatchTraj: Unified Time-Frequency Representation Learning via Dynamic Patches for Trajectory Prediction	Yanghong Liu et.al.	2507.19119	null
2025-07-25	Fine-Grained Traffic Inference from Road to Lane via Spatio-Temporal Graph Node Generation	Shuhao Li et.al.	2507.19089	null
2025-07-25	HQ-SMem: Video Segmentation and Tracking Using Memory Efficient Object Embedding With Selective Update and Self-Supervised Distillation Feedback	Elham Soltani Kazemi et.al.	2507.18921	null
2025-07-24	Diffusion-FS: Multimodal Free-Space Prediction via Diffusion for Autonomous Driving	Keshav Gupta et.al.	2507.18763	null
2025-07-24	Linear Memory SE(2) Invariant Attention	Ethan Pronovost et.al.	2507.18597	null
2025-07-24	GaussianFusionOcc: A Seamless Sensor Fusion Approach for 3D Occupancy Prediction Using 3D Gaussians	Tomislav Pavković et.al.	2507.18522	null
2025-07-24	Delving into Mapping Uncertainty for Mapless Trajectory Prediction	Zongzheng Zhang et.al.	2507.18498	null
2025-07-24	Reinforced Embodied Active Defense: Exploiting Adaptive Interaction for Robust Visual Perception in Adversarial 3D Environments	Xiao Yang et.al.	2507.18484	null
2025-07-24	CRUISE: Cooperative Reconstruction and Editing in V2X Scenarios using Gaussian Splatting	Haoran Xu et.al.	2507.18473	null
2025-07-24	LONG3R: Long Sequence Streaming 3D Reconstruction	Zhuoguang Chen et.al.	2507.18255	null
2025-07-24	GenAI for Automotive Software Development: From Requirements to Wheels	Nenad Petrovic et.al.	2507.18223	null
2025-07-24	Goal-based Trajectory Prediction for improved Cross-Dataset Generalization	Daniel Grimm et.al.	2507.18196	null
2025-07-24	Policy Disruption in Reinforcement Learning:Adversarial Attack with Large Language Models and Critical State Identification	Junyong Jiang et.al.	2507.18113	null
2025-07-23	BetterCheck: Towards Safeguarding VLMs for Automotive Perception Systems	Malsha Ashani Mahawatta Dona et.al.	2507.17722	null
2025-07-23	Reusing Attention for One-stage Lane Topology Understanding	Yang Li et.al.	2507.17617	null
2025-07-23	InvRGB+L: Inverse Rendering of Complex Scenes with Unified Color and LiDAR Reflectance Modeling	Xiaoxue Chen et.al.	2507.17613	null
2025-07-24	PRIX: Learning to Plan from Raw Pixels for End-to-End Autonomous Driving	Maciej K. Wozniak et.al.	2507.17596	null
2025-07-23	SRMambaV2: Biomimetic Attention for Sparse Point Cloud Upsampling in Autonomous Driving	Chuang Chen et.al.	2507.17479	null
2025-07-23	VLM-Guided Visual Place Recognition for Planet-Scale Geo-Localization	Sania Waheed et.al.	2507.17455	null
2025-07-23	Ctx2TrajGen: Traffic Context-Aware Microscale Vehicle Trajectories using Generative Adversarial Imitation Learning	Joobin Jin et.al.	2507.17418	null
2025-08-06	DeMo++: Motion Decoupling for Autonomous Driving	Bozhou Zhang et.al.	2507.17342	null
2025-07-23	JAM: Keypoint-Guided Joint Prediction after Classification-Aware Marginal Proposal for Multi-Agent Interaction	Fangze Lin et.al.	2507.17152	null
2025-07-23	HySafe-AI: Hybrid Safety Architectural Analysis Framework for AI Systems: A Case Study	Mandar Pitale et.al.	2507.17118	null
2025-07-22	SDGOCC: Semantic and Depth-Guided Bird’s-Eye View Transformation for 3D Multimodal Occupancy Prediction	Zaipeng Duan et.al.	2507.17083	null
2025-07-22	Few-Shot Learning in Video and 3D Object Detection: A Survey	Md Meftahul Ferdaus et.al.	2507.17079	null
2025-07-22	Advancing Robustness in Deep Reinforcement Learning with an Ensemble Defense Approach	Adithya Mohan et.al.	2507.17070	null
2025-07-22	Denoising-While-Completing Network (DWCNet): Robust Point Cloud Completion Under Corruption	Keneni W. Tesema et.al.	2507.16743	null
2025-07-22	Morpheus: A Neural-driven Animatronic Face with Hybrid Actuation and Diverse Emotion Control	Zongzheng Zhang et.al.	2507.16645	null
2025-07-22	A Target-based Multi-LiDAR Multi-Camera Extrinsic Calibration System	Lorenzo Gentilini et.al.	2507.16621	null
2025-07-22	VGGT-Long: Chunk it, Loop it, Align it – Pushing VGGT’s Limits on Kilometer-scale Long RGB Sequences	Kai Deng et.al.	2507.16443	null
2025-07-22	A Sparsity-Aware Autonomous Path Planning Accelerator with HW/SW Co-Design and Multi-Level Dataflow Optimization	Yifan Zhang et.al.	2507.16177	null
2025-07-21	Optimal Batch-Size Control for Low-Latency Federated Learning with Device Heterogeneity	Huiling Yang et.al.	2507.15601	null
2025-07-21	Robots for Kiwifruit Harvesting and Pollination	Jamie Bell et.al.	2507.15484	null
2025-07-21	VLM-UDMC: VLM-Enhanced Unified Decision-Making and Motion Control for Urban Autonomous Driving	Haichao Liu et.al.	2507.15266	null
2025-07-20	CoMoCAVs: Cohesive Decision-Guided Motion Planning for Connected and Autonomous Vehicles with Multi-Policy Reinforcement Learning	Pan Hu et.al.	2507.14903	null
2025-07-23	GEMINUS: Dual-aware Global and Scene-Adaptive Mixture-of-Experts for End-to-End Autonomous Driving	Chi Wan et.al.	2507.14456	null
2025-07-18	Preference-based Multi-Objective Reinforcement Learning	Ni Mu et.al.	2507.14066	null
2025-07-18	Architecting Human-AI Cocreation for Technical Services – Interaction Modes and Contingency Factors	Jochen Wulf et.al.	2507.14034	null
2025-07-18	Enhancing LiDAR Point Features with Foundation Model Priors for 3D Object Detection	Yujian Mo et.al.	2507.13899	null
2025-07-18	Depth3DLane: Fusing Monocular 3D Lane Detection with Self-Supervised Monocular Depth Estimation	Max van den Hoven et.al.	2507.13857	null
2025-07-18	One Step Closer: Creating the Future to Boost Monocular Semantic Scene Completion	Haoang Lu et.al.	2507.13801	null
2025-07-18	AGENTS-LLM: Augmentative GENeration of Challenging Traffic Scenarios with an Agentic LLM Framework	Yu Yao et.al.	2507.13729	null
2025-07-17	CaSTFormer: Causal Spatio-Temporal Transformer for Driving Intention Prediction	Sirui Wang et.al.	2507.13425	null
2025-07-16	From Binary to Semantic: Utilizing Large-Scale Binary Occupancy Data for 3D Semantic Occupancy Prediction	Chihiro Noguchi et.al.	2507.13387	null
2025-07-17	Orbis: Overcoming Challenges of Long-Horizon Prediction in Driving World Models	Arian Mousakhan et.al.	2507.13162	null
2025-07-17	Channel-wise Motion Features for Efficient Motion Segmentation	Riku Inoue et.al.	2507.13082	null
2025-07-23	LaViPlan : Language-Guided Visual Path Planning with RLVR	Hayeon Oh et.al.	2507.12911	null
2025-07-17	World Model-Based End-to-End Scene Generation for Accident Anticipation in Autonomous Driving	Yanchen Guan et.al.	2507.12762	null
2025-07-17	Domain-Enhanced Dual-Branch Model for Efficient and Interpretable Accident Anticipation	Yanchen Guan et.al.	2507.12755	null
2025-07-16	ReAL-AD: Towards Human-Like Reasoning in End-to-End Autonomous Driving	Yuhang Lu et.al.	2507.12499	null
2025-07-16	MMHU: A Massive-Scale Multimodal Benchmark for Human Behavior Understanding	Renjie Li et.al.	2507.12463	null
2025-07-16	AutoVDC: Automated Vision Data Cleaning Using Vision-Language Models	Santosh Vasa et.al.	2507.12414	null
2025-08-06	AD-GS: Object-Aware B-Spline Gaussian Splatting for Self-Supervised Autonomous Driving	Jiawei Xu et.al.	2507.12137	null
2025-07-16	LidarPainter: One-Step Away From Any Lidar View To Novel Guidance	Yuzhou Ji et.al.	2507.12114	null
2025-07-16	Foresight in Motion: Reinforcing Trajectory Prediction with Reward Heuristics	Muleilan Pei et.al.	2507.12083	null
2025-07-16	IANN-MPPI: Interaction-Aware Neural Network-Enhanced Model Predictive Path Integral Approach for Autonomous Driving	Kanghyun Ryu et.al.	2507.11940	null
2025-07-16	Towards Autonomous Riding: A Review of Perception, Planning, and Control in Intelligent Two-Wheelers	Mohammed Hassanin et.al.	2507.11852	null
2025-07-15	Towards Depth Foundation Model: Recent Trends in Vision-Based Depth Estimation	Zhen Xu et.al.	2507.11540	null
2025-07-15	A Survey on Interpretability in Visual Recognition	Qiyang Wan et.al.	2507.11099	null
2025-07-14	RCG: Safety-Critical Scenario Generation for Robust Autonomous Driving via Real-World Crash Grounding	Benjamin Stoler et.al.	2507.10749	null
2025-07-14	Scene-Aware Conversational ADAS with Generative AI for Real-Time Driver Assistance	Kyungtae Han et.al.	2507.10500	null
2025-07-08	U-ViLAR: Uncertainty-Aware Visual Localization for Autonomous Driving via Differentiable Association and Registration	Xiaofan Li et.al.	2507.04503	null
2025-06-12	ReSim: Reliable World Simulation for Autonomous Driving	Jiazhi Yang et.al.	2506.09981	null
2025-06-12	Reinforced Refinement with Self-Aware Expansion for End-to-End Autonomous Driving	Haochen Liu et.al.	2506.09800	null
2025-08-28	Pseudo-Simulation for Autonomous Driving	Wei Cao et.al.	2506.04218	null
2025-05-15	Camera-Only 3D Panoptic Scene Completion for Autonomous Driving through Differentiable Object Shapes	Nicola Marinello et.al.	2505.09562	null
2025-04-08	Data Scaling Laws for End-to-End Autonomous Driving	Alexander Naumann et.al.	2504.04338	null
2025-03-17	Centaur: Robust End-to-End Autonomous Driving with Test-Time Training	Chonghao Sima et.al.	2503.11650	null
2025-02-18	OpenEMMA: Open-Source Multimodal Model for End-to-End Autonomous Driving	Shuo Xing et.al.	2412.15208	null
2024-12-16	Hidden Biases of End-to-End Driving Datasets	Julian Zimmerlin et.al.	2412.09602	null
2024-12-03	InterHub: A Naturalistic Trajectory Dataset with Dense Interaction for Autonomous Driving	Xiyan Jiang et.al.	2411.18302	null
2024-11-18	Explanation for Trajectory Planning using Multi-modal Large Language Model for Autonomous Driving	Shota Yamazaki et.al.	2411.09971	null
2025-03-19	IGDrivSim: A Benchmark for the Imitation Gap in Autonomous Driving	Clémence Grislain et.al.	2411.04653	null
2025-03-31	LoRD: Adapting Differentiable Driving Policies to Distribution Shifts	Christopher Diehl et.al.	2410.09681	null
2024-12-05	DualAD: Dual-Layer Planning for Reasoning in Autonomous Driving	Dingrui Wang et.al.	2409.18053	null
2025-03-20	Accurate Automatic 3D Annotation of Traffic Lights and Signs for Autonomous Driving	Sándor Kunsági-Máté et.al.	2409.12620	null
2024-11-01	NAVSIM: Data-Driven Non-Reactive Autonomous Vehicle Simulation and Benchmarking	Daniel Dauner et.al.	2406.15349	null
2024-06-17	CarLLaVA: Vision language models for camera-only closed-loop driving	Katrin Renz et.al.	2406.10165	null
2024-10-29	Vista: A Generalizable Driving World Model with High Fidelity and Versatile Controllability	Shenyuan Gao et.al.	2405.17398	null
2024-05-10	Multi-Frame, Lightweight & Efficient Vision-Language Models for Question Answering in Autonomous Driving	Akshay Gopalkrishnan et.al.	2403.19838	null
2024-04-16	Are NeRFs ready for autonomous driving? Towards closing the real-to-simulation gap	Carl Lindström et.al.	2403.16092	null
2024-08-09	GenAD: Generalized Predictive Model for Autonomous Driving	Jiazhi Yang et.al.	2403.09630	null
2024-06-26	DriveVLM: The Convergence of Autonomous Driving and Large Vision-Language Models	Xiaoyu Tian et.al.	2402.12289	null
2024-11-26	VLP: Vision Language Planning for Autonomous Driving	Chenbin Pan et.al.	2401.05577	null
2025-01-17	DriveLM: Driving with Graph Visual Question Answering	Chonghao Sima et.al.	2312.14150	null
2023-12-12	Evaluation of Large Language Models for Decision Making in Autonomous Driving	Kotaro Tanahashi et.al.	2312.06351	null
2023-12-29	Towards Knowledge-driven Autonomous Driving	Xin Li et.al.	2312.04316	null
2024-03-25	Open-sourced Data Ecosystem in Autonomous Driving: the Present and Future	Hongyang Li et.al.	2312.03408	null
2024-07-30	A Language Agent for Autonomous Driving	Jiageng Mao et.al.	2311.10813	null
2023-08-25	On Offline Evaluation of 3D Object Detection for Autonomous Driving	Tim Schreier et.al.	2308.12779	null
2023-07-21	Explaining Autonomous Driving Actions with Visual Question Answering	Shahin Atakishiyev et.al.	2307.10408	null
2023-07-17	Linking vision and motion for self-supervised object-centric perception	Kaylene C. Stocking et.al.	2307.07147	null
2024-08-16	End-to-end Autonomous Driving: Challenges and Frontiers	Li Chen et.al.	2306.16927	null
2023-11-13	An Overview about Emerging Technologies of Autonomous Driving	Yu Huang et.al.	2306.13302	null
2023-06-16	Sim-on-Wheels: Physical World in the Loop Simulation for Self-Driving	Yuan Shen et.al.	2306.08807	null
2023-05-31	Generating Driving Scenes with Diffusion	Ethan Pronovost et.al.	2305.18452	null
2023-05-30	Selective Communication for Cooperative Perception in End-to-End Autonomous Driving	Hsu-kuang Chiu et.al.	2305.17181	null
2023-05-29	Automatic Surround Camera Calibration Method in Road Scene for Self-driving Car	Jixiang Li et.al.	2305.16840	null
2023-05-17	Self-Aware Trajectory Prediction for Safe Autonomous Driving	Wenbo Shao et.al.	2305.09147	null
2023-04-10	EGA-Depth: Efficient Guided Attention for Self-Supervised Multi-Camera Depth Estimation	Yunxiao Shi et.al.	2304.03369	null
2023-09-14	Collaborative Perception in Autonomous Driving: Methods, Datasets and Challenges	Yushan Han et.al.	2301.06262	null
2023-03-16	Policy Pre-training for Autonomous Driving via Self-supervised Geometric Modeling	Penghao Wu et.al.	2301.01006	null
2023-03-24	Planning-oriented Autonomous Driving	Yihan Hu et.al.	2212.10156	null
2022-11-21	Rationale-aware Autonomous Driving Policy utilizing Safety Force Field implemented on CARLA Simulator	Ho Suk et.al.	2211.10237	null
2023-09-25	aiMotive Dataset: A Multimodal Dataset for Robust Autonomous Driving with Long-Range Perception	Tamás Matuszka et.al.	2211.09445	null
2022-11-02	Improving Motion Forecasting for Autonomous Driving with the Cycle Consistency Loss	Titas Chakraborty et.al.	2211.00149	null
2022-06-27	MPC-based Imitation Learning for Safe and Human-like Autonomous Driving	Flavia Sofia Acerbo et.al.	2206.12348	null
2022-06-22	Incorporating Voice Instructions in Model-Based Reinforcement Learning for Self-Driving Cars	Mingze Wang et.al.	2206.10249	null
2022-06-22	MPA: MultiPath++ Based Architecture for Motion Prediction	Stepan Konev et.al.	2206.10041	null
2023-04-05	3D Object Detection for Autonomous Driving: A Comprehensive Survey	Jiageng Mao et.al.	2206.09474	null
2022-06-28	A Human-Centric Method for Generating Causal Explanations in Natural Language for Autonomous Vehicle Motion Planning	Balint Gyevnar et.al.	2206.08783	null
2022-06-20	SafeRL-Kit: Evaluating Efficient Reinforcement Learning Methods for Safe Autonomous Driving	Linrui Zhang et.al.	2206.08528	null
2022-06-17	Virtual Correspondence: Humans as a Cue for Extreme-View Geometry	Wei-Chiu Ma et.al.	2206.08365	null
2023-07-31	Pushing the Limits of Learning-based Traversability Analysis for Autonomous Driving on CPU	Daniel Fusaro et.al.	2206.03083	null
2022-06-07	MotionCNN: A Strong Baseline for Motion Prediction in Autonomous Driving	Stepan Konev et.al.	2206.02163	null
2022-06-01	TransFuser: Imitation with Transformer-Based Sensor Fusion for Autonomous Driving	Kashyap Chitta et.al.	2205.15997	null
2022-05-31	OpenCalib: A Multi-sensor Calibration Toolbox for Autonomous Driving	Guohang Yan et.al.	2205.14087	null
2022-04-29	NeurMiPs: Neural Mixture of Planar Experts for View Synthesis	Zhi-Hao Lin et.al.	2204.13696	null
2022-03-31	Image-to-Lidar Self-Supervised Distillation for Autonomous Driving Data	Corentin Sautier et.al.	2203.16258	null
2022-04-19	PseudoProp: Robust Pseudo-Label Generation for Semi-Supervised Object Detection in Autonomous Driving Systems	Shu Hu et.al.	2203.05983	null
2024-12-18	Multi-modal Sensor Fusion for Auto Driving Perception: A Survey	Keli Huang et.al.	2202.02703	null
2021-12-30	Parallelized and Randomized Adversarial Imitation Learning for Safety-Critical Self-Driving Vehicles	Won Joon Yun et.al.	2112.14710	null
2021-11-03	Offline Reinforcement Learning for Autonomous Driving with Safety and Exploration Enhancement	Tianyu Shi et.al.	2110.07067	null
2021-08-10	Workshop on Autonomous Driving at CVPR 2021: Technical Report for Streaming Perception Challenge	Songyang Zhang et.al.	2108.04230	null
2021-04-23	Multi-task Learning with Attention for End-to-end Autonomous Driving	Keishi Ishihara et.al.	2104.10753	null
2021-04-20	Multi-Modal Fusion Transformer for End-to-End Autonomous Driving	Aditya Prakash et.al.	2104.09224	null
2021-04-20	Self-Supervised Pillar Motion Learning for Autonomous Driving	Chenxu Luo et.al.	2104.08683	null
2021-03-31	Multi-modal Trajectory Prediction for Autonomous Driving with Semantic Map and Dynamic Graph Attention Network	Bo Dong et.al.	2103.16273	null
2021-01-20	Deep Feedback Inverse Problem Solver	Wei-Chiu Ma et.al.	2101.07719	null
2021-01-19	Non-parametric Memory for Spatio-Temporal Segmentation of Construction Zones for Self-Driving	Min Bai et.al.	2101.06865	null
2021-01-19	Deep Parametric Continuous Convolutional Neural Networks	Shenlong Wang et.al.	2101.06742	null
2020-12-01	Trajformer: Trajectory Prediction with Local Self-Attentive Contexts for Autonomous Driving	Manoj Bhat et.al.	2011.14910	null
2023-05-16	Control Strategies for Autonomous Vehicles	Chinmay Vilas Samak et.al.	2011.08729	null
2020-10-21	Tracking from Patterns: Learning Corresponding Patterns in Point Clouds for 3D Object Tracking	Jieqi Shi et.al.	2010.10051	null
2022-03-07	RGB cameras failures and their effects in autonomous driving applications	Francesco Secci et.al.	2008.05938	null
2021-09-10	Label Efficient Visual Abstractions for Autonomous Driving	Aseem Behl et.al.	2005.10091	null
2020-03-03	3D Point Cloud Processing and Learning for Autonomous Driving	Siheng Chen et.al.	2003.00601	null
2022-11-21	Self-Driving like a Human driver instead of a Robocar: Personalized comfortable driving experience for autonomous vehicles	Il Bae et.al.	2001.03908	null
2020-05-24	Scalability in Perception for Autonomous Driving: Waymo Open Dataset	Pei Sun et.al.	1912.04838	null
2019-11-12	Multi-Agent Connected Autonomous Driving using Deep Reinforcement Learning	Praveen Palanisamy et.al.	1911.04175	null
2019-11-22	SoildNet: Soiling Degradation Detection in Autonomous Driving	Arindam Das et.al.	1911.01054	null
2019-10-25	Identifying Unknown Instances for Autonomous Driving	Kelvin Wong et.al.	1910.11296	null
2020-03-26	A Survey of Deep Learning Techniques for Autonomous Driving	Sorin Grigorescu et.al.	1910.07738	null
2019-09-18	*A3D Dataset: Towards Autonomous Driving in Challenging Environments**	Quang-Hieu Pham et.al.	1909.07541	null
2019-08-12	Exploiting Sparse Semantic HD Maps for Self-Driving Vehicle Localization	Wei-Chiu Ma et.al.	1908.03274	null
2024-09-23	Key Ingredients of Self-Driving Cars	Rui Fan et.al.	1906.02939	null
2019-05-20	LiDAR Sensor modeling and Data augmentation with GANs for Autonomous driving	Ahmad El Sallab et.al.	1905.07290	null
2019-04-19	Deep Rigid Instance Scene Flow	Wei-Chiu Ma et.al.	1904.08913	null
2019-04-15	The Sound of Motions	Hang Zhao et.al.	1904.05979	null
2019-03-07	The AI Driving Olympics at NeurIPS 2018	Julian Zilly et.al.	1903.02503	null
2018-10-11	CFENet: An Accurate and Efficient Single-Shot Object Detector for Autonomous Driving	Qijie Zhao et.al.	1806.09790	null
2018-05-21	Scene Understanding Networks for Autonomous Driving based on Around View Monitoring System	JeongYeol Baek et.al.	1805.07029	null
2017-04-11	Deep Reinforcement Learning framework for Autonomous Driving	Ahmad El Sallab et.al.	1704.02532	null
2016-06-24	Find your Way by Observing the Sun and Other Semantic Cues	Wei-Chiu Ma et.al.	1606.07415	null

Traffic Simulation

Publish Date	Title	Authors	PDF	Code
2025-12-09	Mind to Hand: Purposeful Robotic Control via Embodied Reasoning	Peijun Tang et.al.	2512.08580	null
2025-12-09	High-Performance Dual-Arm Task and Motion Planning for Tabletop Rearrangement	Duo Zhang et.al.	2512.08206	null
2025-12-07	A Hetero-Associative Sequential Memory Model Utilizing Neuromorphic Signals: Validated on a Mobile Manipulator	Runcong Wang et.al.	2512.07032	null
2025-12-09	db-LaCAM: Fast and Scalable Multi-Robot Kinodynamic Motion Planning with Discontinuity-Bounded Search and Lightweight MAPF	Akmaral Moldagalieva et.al.	2512.06796	null
2025-12-05	Multi-Modal Zero-Shot Prediction of Color Trajectories in Food Drying	Shichen Li et.al.	2512.06190	null
2025-12-05	WAM-Flow: Parallel Coarse-to-Fine Motion Planning via Discrete Flow Matching for Autonomous Driving	Yifang Xu et.al.	2512.06112	null
2025-12-05	Training-Time Action Conditioning for Efficient Real-Time Chunking	Kevin Black et.al.	2512.05964	null
2025-12-05	Toward Efficient and Robust Behavior Models for Multi-Agent Driving Simulation	Fabian Konstantinidis et.al.	2512.05812	null
2025-12-05	Bayesian Active Inference for Intelligent UAV Anti-Jamming and Adaptive Trajectory Planning	Ali Krayani et.al.	2512.05711	null
2025-12-05	Scenario-aware Uncertainty Quantification for Trajectory Prediction with Statistical Guarantees	Yiming Shu et.al.	2512.05682	null
2025-12-04	XR-DT: Extended Reality-Enhanced Digital Twin for Agentic Mobile Robots	Tianyi Wang et.al.	2512.05270	null
2025-12-04	TV2TV: A Unified Framework for Interleaved Language and Video Generation	Xiaochuang Han et.al.	2512.05103	null
2025-12-04	Contact-Implicit Modeling and Simulation of a Snake Robot on Compliant and Granular Terrain	Haroon Hublikar et.al.	2512.05008	null
2025-12-04	Back to Basics: Motion Representation Matters for Human Motion Generation Using Diffusion Model	Yuduo Jin et.al.	2512.04499	null
2025-12-04	DeRA: Decoupled Representation Alignment for Video Tokenization	Pengbo Guo et.al.	2512.04483	null
2025-12-04	Vision-Language-Action Models for Selective Robotic Disassembly: A Case Study on Critical Component Extraction from Desktops	Chang Liu et.al.	2512.04446	null
2025-12-04	MindDrive: An All-in-One Framework Bridging World Models and Vision-Language Model for End-to-End Autonomous Driving	Bin Sun et.al.	2512.04441	null
2025-12-04	FMA-Net++: Motion- and Exposure-Aware Real-World Joint Video Super-Resolution and Deblurring	Geunhyuk Youk et.al.	2512.04390	null
2025-12-03	Gamma-from-Mono: Road-Relative, Metric, Self-Supervised Monocular Geometry for Vehicular Applications	Gasser Elazab et.al.	2512.04303	null
2025-12-03	Inference-time Stochastic Refinement of GRU-Normalizing Flow for Real-time Video Motion Transfer	Tasmiah Haque et.al.	2512.04282	null
2025-12-03	Driving is a Game: Combining Planning and Prediction with Bayesian Iterative Best Response	Aron Distelzweig et.al.	2512.03936	null
2025-12-03	MPCFormer: A physics-informed data-driven approach for explainable socially-aware autonomous driving	Jia Hu et.al.	2512.03795	null
2025-12-03	Safety Reinforced Model Predictive Control (SRMPC): Improving MPC with Reinforcement Learning for Motion Planning in Autonomous Driving	Johannes Fischer et.al.	2512.03774	null
2025-12-03	Bayesian Optimization for Automatic Tuning of Torque-Level Nonlinear Model Predictive Control	Gabriele Fadini et.al.	2512.03772	null
2025-12-03	Prediction-Driven Motion Planning: Route Integration Strategies in Attention-Based Prediction Models	Marlon Steiner et.al.	2512.03756	null
2025-12-03	ContactRL: Safe Reinforcement Learning based Motion Planning for Contact based Human Robot Collaboration	Sundas Rafat Mulkana et.al.	2512.03707	null
2025-12-03	A Novel Approach to Tomato Harvesting Using a Hybrid Gripper with Semantic Segmentation and Keypoint Detection	Shahid Ansari et.al.	2512.03684	null
2025-12-03	Multimodal Control of Manipulators: Coupling Kinematics and Vision for Self-Driving Laboratory Operations	Shifa Sulaiman et.al.	2512.03630	null
2025-12-08	LAMP: Language-Assisted Motion Planning for Controllable Video Generation	Muhammed Burak Kizil et.al.	2512.03619	null
2025-12-03	RoboScape-R: Unified Reward-Observation World Models for Generalizable Robotics Training via RL	Yinzhou Tang et.al.	2512.03556	null
2025-12-03	GeoVideo: Introducing Geometric Regularization into Video Generation Model	Yunpeng Bai et.al.	2512.03453	null
2025-12-03	PerFACT: Motion Policy with LLM-Powered Dataset Synthesis and Fusion Action-Chunking Transformers	Davood Soleymanzadeh et.al.	2512.03444	null
2025-12-03	ProtoEFNet: Dynamic Prototype Learning for Inherently Interpretable Ejection Fraction Estimation in Echocardiography	Yeganeh Ghamary et.al.	2512.03339	null
2025-12-02	Flux4D: Flow-based Unsupervised 4D Reconstruction	Jingkang Wang et.al.	2512.03210	null
2025-12-02	DGGT: Feedforward 4D Reconstruction of Dynamic Driving Scenes using Unposed Images	Xiaoxue Chen et.al.	2512.03004	null
2025-12-02	Experimental Characterization of Fingertip Trajectory following for a 3-DoF Series-Parallel Hybrid Robotic Finger	Nicholas Baiata et.al.	2512.02951	null
2025-12-03	SwarmDiffusion: End-To-End Traversability-Guided Diffusion for Embodiment-Agnostic Navigation of Heterogeneous Robots	Iana Zhura et.al.	2512.02851	null
2025-12-02	ReVSeg: Incentivizing the Reasoning Chain for Video Segmentation with Reinforcement Learning	Yifan Li et.al.	2512.02835	null
2025-12-02	Rethinking Surgical Smoke: A Smoke-Type-Aware Laparoscopic Video Desmoking Method and Dataset	Qifan Liang et.al.	2512.02780	null
2025-12-02	CogDrive: Cognition-Driven Multimodal Prediction-Planning Fusion for Safe Autonomy	Heye Huang et.al.	2512.02777	null
2025-12-02	Joint Distillation for Fast Likelihood Evaluation and Sampling in Flow-based Models	Xinyue Ai et.al.	2512.02636	null
2025-12-02	SAM2Grasp: Resolve Multi-modal Grasping via Prompt-conditioned Temporal Action Prediction	Shengkai Wu et.al.	2512.02609	null
2025-12-02	YingVideo-MV: Music-Driven Multi-Stage Video Generation	Jiahui Chen et.al.	2512.02492	null
2025-12-03	Does Hearing Help Seeing? Investigating Audio-Video Joint Denoising for Video Generation	Jianzong Wu et.al.	2512.02457	null
2025-12-02	On-the-fly Feedback SfM: Online Explore-and-Exploit UAV Photogrammetry with Incremental Mesh Quality-Aware Indicator and Predictive Path Planning	Liyuan Lou et.al.	2512.02375	null
2025-12-02	Multi-Domain Enhanced Map-Free Trajectory Prediction with Selective Attention	Wenyi Xiong et.al.	2512.02368	null
2025-12-02	On the Convergence of Density-Based Predictive Control for Multi-Agent Non-Uniform Area Coverage	Sungjun Seo et.al.	2512.02367	null
2025-12-02	TALO: Pushing 3D Vision Foundation Models Towards Globally Consistent Online Reconstruction	Fengyi Zhang et.al.	2512.02341	null
2025-12-01	EfficientFlow: Efficient Equivariant Flow Policy Learning for Embodied AI	Jianlei Chang et.al.	2512.02020	null
2025-12-01	RoaD: Rollouts as Demonstrations for Closed-Loop Supervised Fine-Tuning of Autonomous Driving Policies	Guillermo Garcia-Cobo et.al.	2512.01993	null
2025-12-01	GrndCtrl: Grounding World Models via Self-Supervised Reward Alignment	Haoyang He et.al.	2512.01952	null
2025-12-01	Physical ID-Transfer Attacks against Multi-Object Tracking via Adversarial Trajectory	Chenyi Wang et.al.	2512.01934	null
2025-12-01	StreamGaze: Gaze-Guided Temporal Reasoning and Proactive Understanding in Streaming Videos	Daeun Lee et.al.	2512.01707	null
2025-12-01	Dynamic Log-Gaussian Process Control Barrier Function for Safe Robotic Navigation in Dynamic Environments	Xin Yin et.al.	2512.01668	null
2025-12-09	CourtMotion: Learning Event-Driven Motion Representations from Skeletal Data for Basketball	Omer Sela et.al.	2512.01478	null
2025-12-01	Modality-Augmented Fine-Tuning of Foundation Robot Policies for Cross-Embodiment Manipulation on GR1 and G1	Junsung Park et.al.	2512.01358	null
2025-12-01	InternVideo-Next: Towards General Video Foundation Models without Video-Text Supervision	Chenting Wang et.al.	2512.01342	null
2025-12-01	DPAC: Distribution-Preserving Adversarial Control for Diffusion Sampling	Han-Jin Lee et.al.	2512.01153	null
2025-11-30	Weakly Supervised Continuous Micro-Expression Intensity Estimation Using Temporal Deep Neural Network	Riyadh Mohammed Almushrafy et.al.	2512.01145	null
2025-11-30	Think Fast: Real-Time Kinodynamic Belief-Space Planning for Projectile Interception	Gabriel Olin et.al.	2512.01108	null
2025-11-30	Estimation of Kinematic Motion from Dashcam Footage	Evelyn Zhang et.al.	2512.01104	null
2025-11-30	FOM-Nav: Frontier-Object Maps for Object Goal Navigation	Thomas Chabal et.al.	2512.01009	null
2025-11-30	Efficient and Scalable Monocular Human-Object Interaction Motion Reconstruction	Boran Wen et.al.	2512.00960	null
2025-11-30	Constant-Time Motion Planning with Manipulation Behaviors	Nayesha Gandotra et.al.	2512.00939	null
2025-11-30	Audio-Visual World Models: Towards Multisensory Imagination in Sight and Sound	Jiahua Wang et.al.	2512.00883	null
2025-11-30	TrajDiff: End-to-end Autonomous Driving without Perception Annotation	Xingtai Gui et.al.	2512.00723	null
2025-11-30	CAR-Net: A Cascade Refinement Network for Rotational Motion Deblurring under Angle Information Uncertainty	Ka Chung Lai et.al.	2512.00700	null
2025-11-29	Sample-Efficient Expert Query Control in Active Imitation Learning via Conformal Prediction	Arad Firouzkouhi et.al.	2512.00453	null
2025-11-29	FR-TTS: Test-Time Scaling for NTP-based Image Generation with Effective Filling-based Reward Signal	Hang Xu et.al.	2512.00438	null
2025-11-29	DPNet: Doppler LiDAR Motion Planning for Highly-Dynamic Environments	Wei Zuo et.al.	2512.00375	null
2025-11-29	Towards aligned body representations in vision models	Andrey Gizdov et.al.	2512.00365	null
2025-11-29	SMamDiff: Spatial Mamba for Stochastic Human Motion Prediction	Junqiao Fan et.al.	2512.00355	null
2025-11-29	mmPred: Radar-based Human Motion Prediction in the Dark	Junqiao Fan et.al.	2512.00345	null
2025-11-29	Relightable Holoported Characters: Capturing and Relighting Dynamic Human Performance from Sparse Views	Kunwar Maheep Singh et.al.	2512.00255	null
2025-11-02	XFlowMP: Task-Conditioned Motion Fields for Generative Robot Planning with Schrodinger Bridges	Khang Nguyen et.al.	2512.00022	null
2025-11-28	From CAD to POMDP: Probabilistic Planning for Robotic Disassembly of End-of-Life Products	Jan Baumgärtner et.al.	2511.23407	null
2025-11-28	Incorporating Ephemeral Traffic Waves in A Data-Driven Framework for Microsimulation in CARLA	Alex Richardson et.al.	2511.23236	null
2025-11-28	Field-programmable dynamics in a soft magnetic actuator enabling true random number generation and reservoir computing	Eduardo Sergio Oliveros-Mata et.al.	2511.23215	null
2025-11-28	LatBot: Distilling Universal Latent Actions for Vision-Language-Action Models	Zuolei Li et.al.	2511.23034	null
2025-11-28	McSc: Motion-Corrective Preference Alignment for Video Generation with Self-Critic Hierarchical Reasoning	Qiushi Yang et.al.	2511.22974	null
2025-12-01	DenoiseGS: Gaussian Reconstruction Model for Burst Denoising	Yongsen Cheng et.al.	2511.22939	null
2025-11-28	Threat-Aware UAV Dodging of Human-Thrown Projectiles with an RGB-D Camera	Yuying Zhang et.al.	2511.22847	null
2025-11-28	Safe Autonomous Lane Changing: Planning with Dynamic Risk Fields and Time-Varying Convex Space Generation	Zhen Tian et.al.	2511.22829	null
2025-11-27	CoT4AD: A Vision-Language-Action Model with Explicit Chain-of-Thought Reasoning for Autonomous Driving	Zhaohui Wang et.al.	2511.22532	null
2025-11-27	UAV-MM3D: A Large-Scale Synthetic Benchmark for 3D Perception of Unmanned Aerial Vehicles with Multi-Modal Data	Longkun Zou et.al.	2511.22404	null
2025-11-27	MTR-VP: Towards End-to-End Trajectory Planning through Context-Driven Image Encoding and Multiple Trajectory Prediction	Maitrayee Keskar et.al.	2511.22181	null
2025-11-27	SparseWorld-TC: Trajectory-Conditioned Sparse Occupancy World Model	Jiayuan Du et.al.	2511.22039	null
2025-11-26	UniArt: Unified 3D Representation for Generating 3D Articulated Objects with Open-Set Articulation	Bu Jin et.al.	2511.21887	null
2025-11-26	TraceGen: World Modeling in 3D Trace Space Enables Learning from Cross-Embodiment Videos	Seungjae Lee et.al.	2511.21690	null
2025-11-26	Model-Based Policy Adaptation for Closed-Loop End-to-End Autonomous Driving	Haohong Lin et.al.	2511.21584	null
2025-11-26	SocialNav: Training Human-Inspired Foundation Model for Socially-Aware Embodied Navigation	Ziyi Chen et.al.	2511.21135	null
2025-11-26	Inversion-Free Style Transfer with Dual Rectified Flows	Yingying Deng et.al.	2511.20986	null
2025-11-25	DeeAD: Dynamic Early Exit of Vision-Language Action for Efficient Autonomous Driving	Haibo HU et.al.	2511.20720	null
2025-11-25	Evaluating the Performance of Deep Learning Models in Whole-body Dynamic 3D Posture Prediction During Load-reaching Activities	Seyede Niloofar Hosseini et.al.	2511.20615	null
2025-11-25	Safe and Stable Neural Network Dynamical Systems for Robot Motion Planning	Allen Emmanuel Binny et.al.	2511.20593	null
2025-11-25	Metric, inertially aligned monocular state estimation via kinetodynamic priors	Jiaxin Liu et.al.	2511.20496	null
2025-11-26	BRIC: Bridging Kinematic Plans and Physical Control at Test Time	Dohun Lim et.al.	2511.20431	null
2025-11-25	FREE: Uncertainty-Aware Autoregression for Parallel Diffusion Transformers	Xinwan Wen et.al.	2511.20390	null
2025-12-01	3D Motion Perception of Binocular Vision Target with PID-CNN	Jiazhao Shi et.al.	2511.20332	null
2025-11-25	How Robot Kinematics Influence Human Performance in Virtual Robot-to-Human Handover Tasks	Róisín Keenan et.al.	2511.20299	null
2025-11-25	Back to the Feature: Explaining Video Classifiers with Video Counterfactual Explanations	Chao Wang et.al.	2511.20295	null
2025-11-25	Dynamic-ICP: Doppler-Aware Iterative Closest Point Registration for Dynamic Scenes	Dong Wang et.al.	2511.20292	null
2025-11-27	SKEL-CF: Coarse-to-Fine Biomechanical Skeleton and Surface Mesh Recovery	Da Li et.al.	2511.20157	null
2025-11-25	Map-World: Masked Action planning and Path-Integral World Model for Autonomous Driving	Bin Hu et.al.	2511.20156	null
2025-11-25	Alzheimers Disease Progression Prediction Based on Manifold Mapping of Irregularly Sampled Longitudinal Data	Xin Hong et.al.	2511.20154	null
2025-11-25	WPT: World-to-Policy Transfer via Online World Model Distillation	Guangfeng Jiang et.al.	2511.20095	null
2025-11-25	Active3D: Active High-Fidelity 3D Reconstruction via Hierarchical Uncertainty Quantification	Yan Li et.al.	2511.20050	null
2025-11-25	ACIT: Attention-Guided Cross-Modal Interaction Transformer for Pedestrian Crossing Intention Prediction	Yuanzhe Li et.al.	2511.20020	null
2025-11-25	Multi-Context Fusion Transformer for Pedestrian Crossing Intention Prediction in Urban Environments	Yuanzhe Li et.al.	2511.20011	null
2025-11-25	Pedestrian Crossing Intention Prediction Using Multimodal Fusion Network	Yuanzhe Li et.al.	2511.20008	null
2025-11-25	Redefining Radar Segmentation: Simultaneous Static-Moving Segmentation and Ego-Motion Estimation using Radar Point Clouds	Simin Zhu et.al.	2511.20003	null
2025-11-25	GazeProphetV2: Head-Movement-Based Gaze Prediction Enabling Efficient Foveated Rendering on Mobile VR	Farhaan Ebadulla et.al.	2511.19988	null
2025-11-25	CoC-VLA: Delving into Adversarial Domain Transfer for Explainable Autonomous Driving via Chain-of-Causality Visual-Language-Action Model	Dapeng Zhang et.al.	2511.19914	null
2025-11-30	GigaWorld-0: World Models as Data Engine to Empower Embodied AI	GigaWorld Team et.al.	2511.19861	null
2025-11-25	Unifying Perception and Action: A Hybrid-Modality Pipeline with Implicit Visual Chain-of-Thought for Robotic Action Generation	Xiangkai Ma et.al.	2511.19859	null
2025-11-24	Whole-Body Inverse Dynamics MPC for Legged Loco-Manipulation	Lukas Molnar et.al.	2511.19709	null
2025-11-24	Fara-7B: An Efficient Agentic Model for Computer Use	Ahmed Awadallah et.al.	2511.19663	null
2025-11-26	Development of a Testbed for Autonomous Vehicles: Integrating MPC Control with Monocular Camera Lane Detection	Shantanu Rahman et.al.	2511.19655	null
2025-11-19	Strong Duality and Dual Ascent Approach to Continuous-Time Chance-Constrained Stochastic Optimal Control	Apurva Patil et.al.	2511.19451	null
2025-11-24	In-Video Instructions: Visual Signals as Generative Control	Gongfan Fang et.al.	2511.19401	null
2025-11-24	Percept-WAM: Perception-Enhanced World-Awareness-Action Model for Robust End-to-End Autonomous Driving	Jianhua Han et.al.	2511.19221	null
2025-11-24	Reference-Free Sampling-Based Model Predictive Control	Fabian Schramm et.al.	2511.19204	null
2025-11-24	Autonomous Docking of Multi-Rotor UAVs on Blimps under the Influence of Wind Gusts	Pascal Goldschmid et.al.	2511.19135	null
2025-11-24	HABIT: Human Action Benchmark for Interactive Traffic in CARLA	Mohan Ramesh et.al.	2511.19109	null
2025-11-24	VeCoR - Velocity Contrastive Regularization for Flow Matching	Zong-Wei Hong et.al.	2511.18942	null
2025-11-24	GContextFormer: A global context-aware hybrid multi-head attention approach with scaled additive aggregation for multimodal trajectory prediction	Yuzhi Chen et.al.	2511.18874	null
2025-11-24	AutoOdom: Learning Auto-regressive Proprioceptive Odometry for Legged Locomotion	Changsheng Luo et.al.	2511.18857	null
2025-11-24	Robust Long-term Test-Time Adaptation for 3D Human Pose Estimation through Motion Discretization	Yilin Wen et.al.	2511.18851	null
2025-11-24	Neural B-Frame Coding: Tackling Domain Shift Issues with Lightweight Online Motion Resolution Adaptation	Sang NguyenQuang et.al.	2511.18724	null
2025-11-24	AIRHILT: A Human-in-the-Loop Testbed for Multimodal Conflict Detection in Aviation	Omar Garib et.al.	2511.18718	null
2025-11-24	Asynchronous Distributed Multi-Robot Motion Planning Under Imperfect Communication	Ardalan Tajbakhsh et.al.	2511.18703	null
2025-11-23	An Analysis of Constraint-Based Multi-Agent Pathfinding Algorithms	Hannah Lee et.al.	2511.18604	null
2025-11-23	C3Po: Cross-View Cross-Modality Correspondence by Pointmap Prediction	Kuan Wei Huang et.al.	2511.18559	null
2025-11-23	Gaze Beyond the Frame: Forecasting Egocentric 3D Visual Span	Heeseung Yun et.al.	2511.18470	null
2025-11-23	Coherent Multi-Agent Trajectory Forecasting in Team Sports with CausalTraj	Wei Zhen Teoh et.al.	2511.18248	null
2025-11-23	Dreaming Falcon: Physics-Informed Model-Based Reinforcement Learning for Quadcopters	Eashan Vytla et.al.	2511.18243	null
2025-11-23	EgoVITA: Learning to Plan and Verify for Egocentric Video Reasoning	Yogesh Kulkarni et.al.	2511.18242	null
2025-11-22	EgoControl: Controllable Egocentric Video Generation via 3D Full-Body Poses	Enrico Pallotta et.al.	2511.18173	null
2025-11-22	Time-aware Motion Planning in Dynamic Environments with Conformal Prediction	Kaier Liang et.al.	2511.18170	null
2025-11-22	SFHand: A Streaming Framework for Language-guided 3D Hand Forecasting and Embodied Manipulation	Ruicong Liu et.al.	2511.18127	null
2025-11-22	Anti-Jamming based on Null-Steering Antennas and Intelligent UAV Swarm Behavior	Miguel Lourenço et.al.	2511.18086	null
2025-11-22	Plan-X: Instruct Video Generation via Semantic Planning	Lun Huang et.al.	2511.17986	null
2025-11-22	V2X-RECT: An Efficient V2X Trajectory Prediction Framework via Redundant Interaction Filtering and Tracking Error Correction	Xiangyan Kong et.al.	2511.17941	null
2025-11-21	Show Me: Unifying Instructional Image and Video Generation with Diffusion Models	Yujiang Pu et.al.	2511.17839	null
2025-11-21	SM2ITH: Safe Mobile Manipulation with Interactive Human Prediction via Task-Hierarchical Bilevel Model Predictive Control	Francesco D’Orazio et.al.	2511.17798	null
2025-11-21	Target-Bench: Can World Models Achieve Mapless Path Planning with Semantic Targets?	Dingrui Wang et.al.	2511.17792	null
2025-11-21	See, Plan, Cut: MPC-Based Autonomous Volumetric Robotic Laser Surgery with OCT Guidance	Ravi Prakash et.al.	2511.17777	null
2025-11-21	The Potential and Limitations of Vision-Language Models for Human Motion Understanding: A Case Study in Data-Driven Stroke Rehabilitation	Victor Li et.al.	2511.17727	null
2025-11-21	Vision-Motion-Reference Alignment for Referring Multi-Object Tracking via Multi-Modal Large Language Models	Weiyi Lv et.al.	2511.17681	null
2025-11-15	EgoCogNav: Cognition-aware Human Egocentric Navigation	Zhiwen Qiu et.al.	2511.17581	null
2025-11-21	MDG: Masked Denoising Generation for Multi-Agent Behavior Modeling in Traffic Environments	Zhiyu Huang et.al.	2511.17496	null
2025-11-21	Planning with Sketch-Guided Verification for Physics-Aware Video Generation	Yidong Huang et.al.	2511.17450	null
2025-11-21	Feasibility of Embodied Dynamics Based Bayesian Learning for Continuous Pursuit Motion Control of Assistive Mobile Robots in the Built Environment	Xiaoshan Zhou et.al.	2511.17401	null
2025-11-21	Vector Cost Behavioral Planning for Autonomous Robotic Systems with Contemporary Validation Strategies	Benjamin R. Toaz et.al.	2511.17375	null
2025-11-21	FORWARD: Dataset of a forwarder operating in rough terrain	Mikael Lundbäck et.al.	2511.17318	null
2025-11-21	DiffRefiner: Coarse to Fine Trajectory Planning via Diffusion Refinement with Semantic Interaction for End to End Autonomous Driving	Liuhan Yin et.al.	2511.17150	null
2025-11-21	PathAgent: Toward Interpretable Analysis of Whole-slide Pathology Images via Large Language Model-based Agentic Reasoning	Jingyun Chen et.al.	2511.17052	null
2025-11-27	RacketVision: A Multiple Racket Sports Benchmark for Unified Ball and Racket Analysis	Linfeng Dong et.al.	2511.17045	null
2025-11-21	MfNeuPAN: Proactive End-to-End Navigation in Dynamic Environments via Direct Multi-Frame Point Constraints	Yiwen Ying et.al.	2511.17013	null
2025-11-20	Flow and Depth Assisted Video Prediction with Latent Transformer	Eliyas Suleyman et.al.	2511.16484	null
2025-11-20	Flow-Aided Flight Through Dynamic Clutters From Point To Motion	Bowen Xu et.al.	2511.16372	null
2025-11-20	Mem-MLP: Real-Time 3D Human Motion Generation from Sparse Inputs	Sinan Mutlu et.al.	2511.16264	null
2025-11-20	SwiTrack: Tri-State Switch for Cross-Modal Object Tracking	Boyue Xu et.al.	2511.16227	null
2025-11-20	FOOTPASS: A Multi-Modal Multi-Agent Tactical Context Dataset for Play-by-Play Action Spotting in Soccer Broadcast Videos	Jeremie Ochin et.al.	2511.16183	null
2025-11-20	Mantis: A Versatile Vision-Language-Action Model with Disentangled Visual Foresight	Yi Yang et.al.	2511.16175	null
2025-11-20	VTinker: Guided Flow Upsampling and Texture Mapping for High-Resolution Video Frame Interpolation	Chenyang Wu et.al.	2511.16124	null
2025-11-19	NMPC-based Motion Planning with Adaptive Weighting for Dynamic Object Interception	Chen Cai et.al.	2511.15532	null
2025-11-19	*RRTformer: Environment-Aware Sampling-Based Motion Planning using Transformer**	Mingyang Feng et.al.	2511.15414	null
2025-11-19	Symmetry-Breaking in Multi-Agent Navigation: Winding Number-Aware MPC with a Learned Topological Strategy	Tomoki Nakao et.al.	2511.15239	null
2025-11-19	MMCM: Multimodality-aware Metric using Clustering-based Modes for Probabilistic Human Motion Prediction	Kyotaro Tokoro et.al.	2511.15179	null
2025-11-19	SceneEdited: A City-Scale Benchmark for 3D HD Map Updating via Image-Guided Change Detection	Chun-Jung Lin et.al.	2511.15153	null
2025-11-24	Reasoning via Video: The First Evaluation of Video Models’ Reasoning Abilities through Maze-Solving Tasks	Cheng Yang et.al.	2511.15065	null
2025-11-19	Lie Group Control Architectures for UAVs: a Comparison of SE2(3)-Based Approaches in Simulation and Hardware	Dimitria Silveria et.al.	2511.15023	null
2025-11-25	SVBRD-LLM: Self-Verifying Behavioral Rule Discovery for Autonomous Vehicle Identification	Xiangyu Li et.al.	2511.14977	null
2025-11-18	Z-Merge: Multi-Agent Reinforcement Learning for On-Ramp Merging with Zone-Specific V2X Traffic Information	Yassine Ibork et.al.	2511.14910	null
2025-11-18	MRI Embeddings Complement Clinical Predictors for Cognitive Decline Modeling in Alzheimer’s Disease Cohorts	Nathaniel Putera et.al.	2511.14601	null
2025-11-18	Perception-aware Exploration for Consumer-grade UAVs	Svetlana Seliunina et.al.	2511.14393	null
2025-11-18	MA-SLAM: Active SLAM in Large-Scale Unknown Environment using Map Aware Deep Reinforcement Learning	Yizhen Yin et.al.	2511.14330	null
2025-11-18	Breaking the Passive Learning Trap: An Active Perception Strategy for Human Motion Prediction	Juncheng Hu et.al.	2511.14237	null
2025-11-19	PAVE: An End-to-End Dataset for Production Autonomous Vehicle Evaluation	Xiangyu Li et.al.	2511.14185	null
2025-11-26	Passive Dementia Screening via Facial Temporal Micro-Dynamics Analysis of In-the-Wild Talking-Head Video	Filippo Cenacchi et.al.	2511.13802	null
2025-11-17	Descriptor: Distance-Annotated Traffic Perception Question Answering (DTPQA)	Nikos Theodoridis et.al.	2511.13397	null
2025-11-17	DAP: A Discrete-token Autoregressive Planner for Autonomous Driving	Bowen Ye et.al.	2511.13306	null
2025-11-17	Collision-Free Navigation of Mobile Robots via Quadtree-Based Model Predictive Control	Osama Al Sheikh Ali et.al.	2511.13188	null
2025-11-17	PlugTrack: Multi-Perceptive Motion Analysis for Adaptive Fusion in Multi-Object Tracking	Seungjae Kim et.al.	2511.13105	null
2025-11-17	Uni-Inter: Unifying 3D Human Motion Synthesis Across Diverse Interaction Contexts	Sheng Liu et.al.	2511.13032	null
2025-11-19	Towards High-Consistency Embodied World Model with Multi-View Trajectory Videos	Taiyi Su et.al.	2511.12882	null
2025-12-05	Uni-Hand: Universal Hand Motion Forecasting in Egocentric Views	Junyi Ma et.al.	2511.12878	null
2025-11-16	DR. Nav: Semantic-Geometric Representations for Proactive Dead-End Recovery and Navigation	Vignesh Rajagopal et.al.	2511.12778	null
2025-11-16	TempoMaster: Efficient Long Video Generation via Next-Frame-Rate Prediction	Yukuo Ma et.al.	2511.12578	null
2025-11-15	Learning Time in Static Classifiers	Xi Ding et.al.	2511.12321	null
2025-11-18	SocialNav-Map: Dynamic Mapping with Human Trajectory Prediction for Zero-Shot Social Navigation	Lingfeng Zhang et.al.	2511.12232	null
2025-11-15	Game-Theoretic Safe Multi-Agent Motion Planning with Reachability Analysis for Dynamic and Uncertain Environments (Extended Version)	Wenbin Mai et.al.	2511.12160	null
2025-11-15	RadarMP: Motion Perception for 4D mmWave Radar in Autonomous Driving	Ruiqi Cheng et.al.	2511.12117	null
2025-11-15	Decoupled Action Head: Confining Task Knowledge to Conditioning Layers	Jian Zhou et.al.	2511.12101	null
2025-11-15	MovSemCL: Movement-Semantics Contrastive Learning for Trajectory Similarity	Zhichen Lai et.al.	2511.12061	null
2025-11-15	SBAMP: Sampling Based Adaptive Motion Planning	Anh-Quan Pham et.al.	2511.12022	null
2025-11-14	SOTFormer: A Minimal Transformer for Unified Object Tracking and Trajectory Prediction	Zhongping Dong et.al.	2511.11824	null
2025-11-14	Who Moved My Distribution? Conformal Prediction for Interactive Multi-Agent Systems	Allen Emmanuel Binny et.al.	2511.11567	null
2025-11-14	Scalable Coverage Trajectory Synthesis on GPUs as Statistical Inference	Max M. Sun et.al.	2511.11514	null
2025-12-09	Humanoid Whole-Body Badminton via Multi-Stage Reinforcement Learning	Chenhao Liu et.al.	2511.11218	null
2025-11-14	RealisticDreamer: Guidance Score Distillation for Few-shot Gaussian Splatting	Ruocheng Wu et.al.	2511.11213	null
2025-11-28	Reverberation: Learning the Latencies Before Forecasting Trajectories	Conghao Wong et.al.	2511.11164	null
2025-11-14	Hindsight Distillation Reasoning with Knowledge Encouragement Preference for Knowledge-based Visual Question Answering	Yu Zhao et.al.	2511.11132	null
2025-11-14	AdaptPNP: Integrating Prehensile and Non-Prehensile Skills for Adaptive Robotic Manipulation	Jinxuan Zhu et.al.	2511.11052	null
2025-11-24	Autonomous Vehicle Path Planning by Searching With Differentiable Simulation	Asen Nachkov et.al.	2511.11043	null
2025-11-14	Collaborative Multi-Robot Non-Prehensile Manipulation via Flow-Matching Co-Generation	Yorai Shaoul et.al.	2511.10874	null
2025-11-14	WetExplorer: Automating Wetland Greenhouse-Gas Surveys with an Autonomous Mobile Robot	Jose Vasquez et.al.	2511.10864	null
2025-11-13	Safe Planning in Interactive Environments via Iterative Policy Updates and Adversarially Robust Conformal Prediction	Omid Mirzaeedodangeh et.al.	2511.10586	null
2025-11-13	LongComp: Long-Tail Compositional Zero-Shot Generalization for Robust Trajectory Prediction	Benjamin Stoler et.al.	2511.10411	null
2025-11-13	nuPlan-R: A Closed-Loop Planning Benchmark for Autonomous Driving via Reactive Multi-Agent Simulation	Mingxing Peng et.al.	2511.10403	null
2025-11-13	VISTA: A Vision and Intent-Aware Social Attention Framework for Multi-Agent Trajectory Prediction	Stephane Da Silva Martins et.al.	2511.10203	null
2025-11-13	Physics-informed Machine Learning for Static Friction Modeling in Robotic Manipulators Based on Kolmogorov-Arnold Networks	Yizheng Wang et.al.	2511.10079	null
2025-11-13	Mitigating Error Accumulation in Co-Speech Motion Generation via Global Rotation Diffusion and Multi-Level Constraints	Xiangyue Zhang et.al.	2511.10076	null
2025-11-13	Trapped by Their Own Light: Deployable and Stealth Retroreflective Patch Attacks on Traffic Sign Recognition Systems	Go Tsuruoka et.al.	2511.10050	null
2025-11-13	AffordBot: 3D Fine-grained Embodied Reasoning via Multimodal Large Language Models	Xinyi Wang et.al.	2511.10017	null
2025-11-13	Debiased Dual-Invariant Defense for Adversarially Robust Person Re-Identification	Yuhang Zhou et.al.	2511.09933	null
2025-11-13	Provably Safe Stein Variational Clarity-Aware Informative Planning	Kaleb Ben Naveed et.al.	2511.09836	null
2025-11-12	A Robust Task-Level Control Architecture for Learned Dynamical Systems	Eshika Pathak et.al.	2511.09790	null
2025-11-12	Social LSTM with Dynamic Occupancy Modeling for Realistic Pedestrian Trajectory Prediction	Ahmed Alia et.al.	2511.09735	null
2025-11-12	A Shared-Autonomy Construction Robotic System for Overhead Works	David Minkwan Kim et.al.	2511.09695	null
2025-11-12	WMPO: World Model-based Policy Optimization for Vision-Language-Action Models	Fangqi Zhu et.al.	2511.09515	null
2025-11-12	DreamPose3D: Hallucinative Diffusion with Prompt Learning for 3D Human Pose Estimation	Jerrin Bright et.al.	2511.09502	null
2025-11-12	CoRL-MPPI: Enhancing MPPI With Learnable Behaviours For Efficient And Provably-Safe Multi-Robot Collision Avoidance	Stepan Dergachev et.al.	2511.09331	null
2025-11-12	FSampler: Training Free Acceleration of Diffusion Sampling via Epsilon Extrapolation	Michael A. Vladimir et.al.	2511.09180	null
2025-11-12	D-AWSIM: Distributed Autonomous Driving Simulator for Dynamic Map Generation Framework	Shunsuke Ito et.al.	2511.09080	null
2025-11-12	USF-Net: A Unified Spatiotemporal Fusion Network for Ground-Based Remote Sensing Cloud Image Sequence Extrapolation	Penghui Niu et.al.	2511.09045	null
2025-11-12	UniMM-V2X: MoE-Enhanced Multi-Level Fusion for End-to-End Cooperative Autonomous Driving	Ziyi Song et.al.	2511.09013	null
2025-11-12	Neural B-frame Video Compression with Bi-directional Reference Harmonization	Yuxi Liu et.al.	2511.08938	null
2025-11-12	A Shared Control Framework for Mobile Robots with Planning-Level Intention Prediction	Jinyu Zhang et.al.	2511.08912	null
2025-11-11	Dual-Arm Whole-Body Motion Planning: Leveraging Overlapping Kinematic Chains	Richard Cheng et.al.	2511.08778	null
2025-11-10	Predict and Resist: Long-Term Accident Anticipation under Sensor Noise	Xingcheng Liu et.al.	2511.08640	null
2025-11-11	X-IONet: Cross-Platform Inertial Odometry Network with Dual-Stage Attention	Dehan Shen et.al.	2511.08277	null
2025-11-11	Prioritizing Perception-Guided Self-Supervision: A New Paradigm for Causal Modeling in End-to-End Autonomous Driving	Yi Huang et.al.	2511.08214	null
2025-11-11	Effective Game-Theoretic Motion Planning via Nested Search	Avishav Engle et.al.	2511.08001	null
2025-11-11	Occlusion-Aware Ground Target Search by a UAV in an Urban Environment	Collin Hague et.al.	2511.07822	null
2025-11-11	Virtual Traffic Lights for Multi-Robot Navigation: Decentralized Planning with Centralized Conflict Resolution	Sagar Gupta et.al.	2511.07811	null
2025-11-11	High-Altitude Balloon Station-Keeping with First Order Model Predictive Control	Myles Pasetsky et.al.	2511.07761	null
2025-11-11	ViPRA: Video Prediction for Robot Actions	Sandeep Routray et.al.	2511.07732	null
2025-11-11	LLM-GROP: Visually Grounded Robot Task and Motion Planning with Large Language Models	Xiaohan Zhang et.al.	2511.07727	null
2025-11-10	FlowFeat: Pixel-Dense Embedding of Motion Profiles	Nikita Araslanov et.al.	2511.07696	null
2025-11-10	Exact Smooth Reformulations for Trajectory Optimization Under Signal Temporal Logic Specifications	Shaohang Han et.al.	2511.07375	null
2025-11-10	PlanT 2.0: Exposing Biases and Structural Flaws in Closed-Loop Driving	Simon Gerstenecker et.al.	2511.07292	null
2025-11-10	Dynamics-Decoupled Trajectory Alignment for Sim-to-Real Transfer in Reinforcement Learning for Autonomous Driving	Thomas Steinecker et.al.	2511.07155	null
2025-11-11	Physics-Informed Deformable Gaussian Splatting: Towards Unified Constitutive Laws for Time-Evolving Material Field	Haoqin Hong et.al.	2511.06299	null
2025-11-08	Fair and Safe: A Real-Time Hierarchical Control Framework for Intersections	Lei Shi et.al.	2511.05886	null
2025-11-11	Token Is All You Need: Cognitive Planning through Belief-Intent Co-Evolution	Shiyao Sang et.al.	2511.05540	null
2025-11-07	EventFlow: Real-Time Neuromorphic Event-Driven Classification of Two-Phase Boiling Flow Regimes	Sanghyeon Chang et.al.	2511.05467	null
2025-11-07	Shared Latent Representation for Joint Text-to-Audio-Visual Synthesis	Dogucan Yaman et.al.	2511.05432	null
2025-11-07	EveryDayVLA: A Vision-Language-Action Model for Affordable Robotic Manipulation	Samarth Chopra et.al.	2511.05397	null
2025-11-07	Accurate online action and gesture recognition system using detectors and Deep SPD Siamese Networks	Mohamed Sanim Akremi et.al.	2511.05250	null
2025-11-07	Context-aware Learned Mesh-based Simulation via Trajectory-Level Meta-Learning	Philipp Dahlinger et.al.	2511.05234	null
2025-11-07	TAPOM: Task-Space Topology-Guided Motion Planning for Manipulating Elongated Object in Cluttered Environments	Zihao Li et.al.	2511.05052	null
2025-11-07	iFlyBot-VLM Technical Report	Xin Nie et.al.	2511.04976	null
2025-11-06	Conformalized Non-uniform Sampling Strategies for Accelerated Sampling-based Motion Planning	Shubham Natraj et.al.	2511.04835	null
2025-11-06	Unified Multimodal Diffusion Forcing for Forceful Manipulation	Zixuan Huang et.al.	2511.04812	null
2025-11-06	ScheduleStream: Temporal Planning with Samplers for GPU-Accelerated Multi-Arm Task and Motion Planning & Scheduling	Caelan Garrett et.al.	2511.04758	null
2025-11-06	X-Diffusion: Training Diffusion Policies on Cross-Embodiment Human Demonstrations	Maximus A. Pace et.al.	2511.04671	null
2025-11-06	Temporal Action Selection for Action Chunking	Yueyang Weng et.al.	2511.04421	null
2025-11-06	Integrating Ergonomics and Manipulability for Upper Limb Postural Optimization in Bimanual Human-Robot Collaboration	Chenzui Li et.al.	2511.04009	null
2025-11-06	Dynamic Shape Control of Soft Robots Enabled by Data-Driven Model Reduction	Iman Adibnazari et.al.	2511.03931	null
2025-11-05	Disentangled Concepts Speak Louder Than Words:Explainable Video Action Recognition	Jongseo Lee et.al.	2511.03725	null
2025-11-05	Motion Planning Under Temporal Logic Specifications In Semantically Unknown Environments	Azizollah Taheri et.al.	2511.03652	null
2025-11-05	Flying Robotics Art: ROS-based Drone Draws the Record-Breaking Mural	Andrei A. Korigodskii et.al.	2511.03651	null
2025-11-05	Manifold-constrained Hamilton-Jacobi Reachability Learning for Decentralized Multi-Agent Motion Planning	Qingyi Chen et.al.	2511.03591	null
2025-11-05	OneOcc: Semantic Occupancy Prediction for Legged Robots with a Single Panoramic Camera	Hao Shi et.al.	2511.03571	null
2025-11-05	Finetuning-Free Personalization of Text to Image Generation via Hypernetworks	Sagar Shrestha et.al.	2511.03156	null
2025-11-04	Many-vs-Many Missile Guidance via Virtual Targets	Marc Schneider et.al.	2511.02526	null
2025-11-04	Self-Supervised Moving Object Segmentation of Sparse and Noisy Radar Point Clouds	Leon Schwarzer et.al.	2511.02395	null
2025-11-10	Whole-body motion planning and safety-critical control for aerial manipulation	Lin Yang et.al.	2511.02342	null
2025-11-04	Estimation of Segmental Longitudinal Strain in Transesophageal Echocardiography by Deep Learning	Anders Austlid Taskén et.al.	2511.02210	null
2025-11-03	TACO: Trajectory-Aware Controller Optimization for Quadrotors	Hersh Sanghvi et.al.	2511.02060	null
2025-11-03	Stein-based Optimization of Sampling Distributions in Model Predictive Path Integral Control	Jace Aldrich et.al.	2511.02015	null
2025-11-01	iFlyBot-VLA Technical Report	Yuan Zhang et.al.	2511.01914	null
2025-11-03	Fractional Diffusion Bridge Models	Gabriel Nobis et.al.	2511.01795	null
2025-11-03	UniLION: Towards Unified Autonomous Driving Model with Linear Group RNNs	Zhe Liu et.al.	2511.01768	null
2025-11-03	Unified Diffusion VLA: Vision-Language-Action Model via Joint Discrete Denoising Diffusion Process	Jiayi Chen et.al.	2511.01718	null
2025-11-03	MO-SeGMan: Rearrangement Planning Framework for Multi Objective Sequential and Guided Manipulation in Constrained Environments	Cankut Bora Tuncer et.al.	2511.01476	null
2025-11-03	FoldPath: End-to-End Object-Centric Motion Generation via Modulated Implicit Paths	Paolo Rabino et.al.	2511.01407	null
2025-11-04	Kinematify: Open-Vocabulary Synthesis of High-DoF Articulated Objects	Jiawei Wang et.al.	2511.01294	null
2025-11-03	MoSa: Motion Generation with Scalable Autoregressive Modeling	Mengyuan Liu et.al.	2511.01200	null
2025-11-02	SLAP: Shortcut Learning for Abstract Planning	Y. Isabel Liu et.al.	2511.01107	null
2025-11-02	Occlusion-Aware Diffusion Model for Pedestrian Intention Prediction	Yu Liu et.al.	2511.00858	null
2025-11-02	Real-Time Learning of Predictive Dynamic Obstacle Models for Robotic Motion Planning	Stella Kombo et.al.	2511.00814	null
2025-11-01	Descriptive Model-based Learning and Control for Bipedal Locomotion	Suraj Kumar et.al.	2511.00512	null
2025-11-01	Diff4Splat: Controllable 4D Scene Generation with Latent Dynamic Reconstruction Models	Panwang Pan et.al.	2511.00503	null
2025-10-31	X-TRACK: Physics-Aware xLSTM for Realistic Vehicle Trajectory Prediction	Aanchal Rajesh Chugh et.al.	2511.00266	null
2025-10-31	End-to-End Dexterous Arm-Hand VLA Policies via Shared Autonomy: VR Teleoperation Augmented by Autonomous Hand VLA Policy for Efficient Data Collection	Yu Cui et.al.	2511.00139	null
2025-10-30	Alpamayo-R1: Bridging Reasoning and Action Prediction for Generalizable Autonomous Driving in the Long Tail	NVIDIA et.al.	2511.00088	null
2025-10-31	Object-IR: Leveraging Object Consistency and Mesh Deformation for Self-Supervised Image Retargeting	Tianli Liao et.al.	2510.27236	null
2025-10-31	GUI-Rise: Structured Reasoning and History Summarization for GUI Navigation	Tao Liu et.al.	2510.27210	null
2025-10-30	SEE4D: Pose-Free 4D Generation via Auto-Regressive Video Inpainting	Dongyue Lu et.al.	2510.26796	null
2025-10-30	Clone Deterministic 3D Worlds with Geometrically-Regularized World Models	Zaishuo Xia et.al.	2510.26782	null
2025-10-30	Process Integrated Computer Vision for Real-Time Failure Prediction in Steel Rolling Mill	Vaibhav Kurrey et.al.	2510.26684	null
2025-10-30	Hybrid Consistency Policy: Decoupling Multi-Modal Diversity and Real-Time Efficiency in Robotic Manipulation	Qianyou Zhao et.al.	2510.26670	null
2025-10-30	Hybrid DQN-TD3 Reinforcement Learning for Autonomous Navigation in Dynamic Environments	Xiaoyi He et.al.	2510.26646	null
2025-10-30	CorVS: Person Identification via Video Trajectory-Sensor Correspondence in a Real-World Warehouse	Kazuma Kano et.al.	2510.26369	null
2025-10-30	Kinodynamic Task and Motion Planning using VLM-guided and Interleaved Sampling	Minseo Kwon et.al.	2510.26139	null
2025-11-13	WOD-E2E: Waymo Open Dataset for End-to-End Driving in Challenging Long-tail Scenarios	Runsheng Xu et.al.	2510.26125	null
2025-10-29	Robotic Assistant: Completing Collaborative Tasks with Dexterous Vision-Language-Action Models	Boshi An et.al.	2510.25713	null
2025-10-29	RegionE: Adaptive Region-Aware Generation for Efficient Image Editing	Pengtao Chen et.al.	2510.25590	null
2025-10-29	Using VLM Reasoning to Constrain Task and Motion Planning	Muyang Yan et.al.	2510.25548	null
2025-10-28	Defect Mitigation for Robot Arm-based Additive Manufacturing Utilizing Intelligent Control and IOT	Matsive Ali et.al.	2510.24994	null
2025-10-28	Global-State-Free Obstacle Avoidance for Quadrotor Control in Air-Ground Cooperation	Baozhe Zhang et.al.	2510.24315	null
2025-10-28	Balanced Collaborative Exploration via Distributed Topological Graph Voronoi Partition	Tianyi Ding et.al.	2510.24067	null
2025-10-27	Full-Dynamics Real-Time Nonlinear Model Predictive Control of Heavy-Duty Hydraulic Manipulator for Trajectory Tracking Tasks	Alvaro Paz et.al.	2510.23386	null
2025-10-27	Payload trajectory tracking control for aerial transportation systems with cable length online optimization	Hai Yu et.al.	2510.23296	null
2025-10-27	Workspace Registration and Collision Detection for Industrial Robotics Applications	Klaus Zauner et.al.	2510.23227	null
2025-10-27	Planning Oriented Integrated Sensing and Communication	Xibin Jin et.al.	2510.23021	null
2025-10-26	Learning Neural Observer-Predictor Models for Limb-level Sampling-based Locomotion Planning	Abhijeet M. Kulkarni et.al.	2510.22789	null
2025-10-25	TrajGATFormer: A Graph-Based Transformer Approach for Worker and Obstacle Trajectory Prediction in Off-site Construction Environments	Mohammed Alduais et.al.	2510.22205	null
2025-10-17	Real-Time QP Solvers: A Concise Review and Practical Guide Towards Legged Robots	Van Nam Dinh et.al.	2510.21773	null
2025-09-30	A phase-aware AI car-following model for electric vehicles with adaptive cruise control: Development and validation using real-world data	Yuhui Liu et.al.	2510.21735	null
2025-10-24	Load-bearing Assessment for Safe Locomotion of Quadruped Robots on Collapsing Terrain	Vivian S. Medeiros et.al.	2510.21369	null
2025-10-23	Dino-Diffusion Modular Designs Bridge the Cross-Domain Gap in Autonomous Parking	Zixuan Wu et.al.	2510.20335	null
2025-10-22	OmniMotion-X: Versatile Multimodal Whole-Body Motion Generation	Guowei Xu et.al.	2510.19789	null
2025-10-22	ProTerrain: Probabilistic Physics-Informed Rough Terrain World Modeling	Golnaz Raja et.al.	2510.19364	null
2025-10-21	Motion Planning and Control of an Overactuated 4-Wheel Drive with Constrained Independent Steering	Shiyu Liu et.al.	2510.19054	null
2025-10-21	$\nabla$ -SDF: Learning Euclidean Signed Distance Functions Online with Gradient-Augmented Octree Interpolation and Neural Residual	Zhirui Dai et.al.	2510.18999	null
2025-10-21	SHRUMS: Sensor Hallucination for Real-time Underwater Motion Planning with a Compact 3D Sonar	Susheel Vadakkekuruppath et.al.	2510.18996	null
2025-10-21	MPC-based motion planning for non-holonomic systems in non-convex domains	Matthias Lorenzen et.al.	2510.18402	null
2025-10-20	SPACeR: Self-Play Anchoring with Centralized Reference Models	Wei-Jer Chang et.al.	2510.18060	null
2025-10-20	Can Image-To-Video Models Simulate Pedestrian Dynamics?	Aaron Appelle et.al.	2510.17731	null
2025-10-20	Enhanced Motion Forecasting with Plug-and-Play Multimodal Large Language Models	Katie Luo et.al.	2510.17274	null
2025-10-28	SimpleVSF: VLM-Scoring Fusion for Trajectory Prediction of End-to-End Autonomous Driving	Peiru Zheng et.al.	2510.17191	null
2025-10-19	T3 Planner: A Self-Correcting LLM Framework for Robotic Motion Planning with Temporal Logic	Jia Li et.al.	2510.16767	null
2025-10-23	HumanCM: One Step Human Motion Prediction	Liu Haojie et.al.	2510.16709	null
2025-10-18	Manual2Skill++: Connector-Aware General Robotic Assembly from Instruction Manuals via Vision-Language Models	Chenrui Tie et.al.	2510.16344	null
2025-10-18	SPOT: Sensing-augmented Trajectory Planning via Obstacle Threat Modeling	Chi Zhang et.al.	2510.16308	null
2025-10-16	Requirement Identification for Traffic Simulations in Driving Simulators	Sven Tarlowski et.al.	2510.14653	null
2025-10-16	Accelerated Multi-Modal Motion Planning Using Context-Conditioned Diffusion Models	Edward Sandra et.al.	2510.14615	null
2025-10-15	Optimistic Reinforcement Learning-Based Skill Insertions for Task and Motion Planning	Gaoyuan Liu et.al.	2510.14065	null
2025-10-15	Physics-Informed Neural Network Modeling of Vehicle Collision Dynamics in Precision Immobilization Technique Maneuvers	Yangye Jiang et.al.	2510.13461	null
2025-10-23	HYPE: Hybrid Planning with Ego Proposal-Conditioned Predictions	Hang Yu et.al.	2510.12733	null
2025-10-14	A Task-Efficient Reinforcement Learning Task-Motion Planner for Safe Human-Robot Cooperation	Gaoyuan Liu et.al.	2510.12477	null
2025-10-14	PolygMap: A Perceptive Locomotion Framework for Humanoid Robot Stair Climbing	Bingquan Li et.al.	2510.12346	null
2025-10-13	NaviGait: Navigating Dynamically Feasible Gait Libraries using Deep Reinforcement Learning	Neil C. Janwani et.al.	2510.11542	null
2025-10-13	IntersectioNDE: Learning Complex Urban Traffic Dynamics based on Interaction Decoupling Strategy	Enli Lin et.al.	2510.11534	null
2025-10-13	MoMaps: Semantics-Aware Scene Motion Generation with Motion Maps	Jiahui Lei et.al.	2510.11107	null
2025-10-13	Unveiling Uncertainty-Aware Autonomous Cooperative Learning Based Planning Strategy	Shiyao Zhang et.al.	2510.11041	null
2025-10-13	Into the Unknown: Towards using Generative Models for Sampling Priors of Environment Uncertainty for Planning in Configuration Spaces	Subhransu S. Bhattacharjee et.al.	2510.11014	null
2025-10-12	Controllable Generative Trajectory Prediction via Weak Preference Alignment	Yongxi Cao et.al.	2510.10731	null
2025-10-12	Reinforcement Learning-based Dynamic Adaptation for Sampling-Based Motion Planning in Agile Autonomous Driving	Alexander Langmann et.al.	2510.10567	null
2025-10-12	Align2Act: Instruction-Tuned Models for Human-Aligned Autonomous Driving	Kanishkha Jaisankar et.al.	2510.10503	null
2025-10-11	Are Video Models Emerging as Zero-Shot Learners and Reasoners in Medical Imaging?	Yuxiang Lai et.al.	2510.10254	null
2025-10-11	Beyond ADE and FDE: A Comprehensive Evaluation Framework for Safety-Critical Prediction in Multi-Agent Autonomous Driving Scenarios	Feifei Liu et.al.	2510.10086	null
2025-10-10	Parametrized Topological Complexity for a Multi-Robot System with Variable Tasks	Gopal Chandra Dutta et.al.	2510.09323	null
2025-10-10	Obstacle Avoidance using Dynamic Movement Primitives and Reinforcement Learning	Dominik Urbaniak et.al.	2510.09254	null
2025-10-09	Adaptive Motion Planning via Contact-Based Intent Inference for Human-Robot Collaboration	Jiurun Song et.al.	2510.08811	null
2025-10-09	Whole Body Model Predictive Control for Spin-Aware Quadrupedal Table Tennis	David Nguyen et.al.	2510.08754	null
2025-10-09	GM3: A General Physical Model for Micro-Mobility Vehicles	Grace Cai et.al.	2510.07807	null
2025-10-08	Inspection Planning Primitives with Implicit Models	Jingyang You et.al.	2510.07611	null
2025-10-08	DecompGAIL: Learning Realistic Traffic Behaviors with Decomposed Multi-Agent Generative Adversarial Imitation Learning	Ke Guo et.al.	2510.06913	null
2025-10-03	Does Physics Knowledge Emerge in Frontier Models?	Ieva Bagdonaviciute et.al.	2510.06251	null
2025-10-07	Learning to Crawl: Latent Model-Based Reinforcement Learning for Soft Robotic Adaptive Locomotion	Vaughn Gzenda et.al.	2510.05957	null
2025-10-07	Stable Robot Motions on Manifolds: Learning Lyapunov-Constrained Neural Manifold ODEs	David Boetius et.al.	2510.05707	null
2025-10-06	Efficient Probabilistic Planning with Maximum-Coverage Distributionally Robust Backward Reachable Trees	Alex Rose et.al.	2510.04807	null
2025-10-14	Hands-Free Heritage: Automated 3D Scanning for Cultural Heritage Digitization	Javed Ahmad et.al.	2510.04781	null
2025-10-06	Building Gradient by Gradient: Decentralised Energy Functions for Bimanual Robot Assembly	Alexander L. Mitchell et.al.	2510.04696	null
2025-10-06	MobRT: A Digital Twin-Based Framework for Scalable Learning in Mobile Manipulation	Yilin Mei et.al.	2510.04592	null
2025-10-05	Diffusion^2: Dual Diffusion Model with Uncertainty-Aware Adaptive Noise for Momentary Trajectory Prediction	Yuhao Luo et.al.	2510.04365	null
2025-10-05	Integrated Planning and Control on Manifolds: Factor Graph Representation and Toolkit	Peiwen Yang et.al.	2510.04278	null
2025-10-04	COVER:COverage-VErified Roadmaps for Fixed-time Motion Planning in Continuous Semi-Static Environments	Niranjan Kumar Ilampooranan et.al.	2510.03875	null
2025-10-04	Trajectory prediction for heterogeneous agents: A performance analysis on small and imbalanced datasets	Tiago Rodrigues de Almeida et.al.	2510.03776	null
2025-10-03	Shape-Space Graphs: Fast and Collision-Free Path Planning for Soft Robots	Carina Veil et.al.	2510.03547	null
2025-10-03	Distributed Connectivity Maintenance and Recovery for Quadrotor Motion Planning	Yutong Wang et.al.	2510.03504	null
2025-10-03	Warm-Starting Optimization-Based Motion Planning for Robotic Manipulators via Point Cloud-Conditioned Flow Matching	Sibo Tian et.al.	2510.03460	null
2025-09-30	A Comprehensive Review on Artificial Intelligence Empowered Solutions for Enhancing Pedestrian and Cyclist Safety	Shucheng Zhang et.al.	2510.03314	null
2025-10-06	Long-Term Human Motion Prediction Using Spatio-Temporal Maps of Dynamics	Yufei Zhu et.al.	2510.03031	null
2025-10-03	Point Cloud-Based Control Barrier Functions for Model Predictive Control in Safety-Critical Navigation of Autonomous Mobile Robots	Faduo Liang et.al.	2510.02885	null
2025-10-03	A Trajectory Generator for High-Density Traffic and Diverse Agent-Interaction Scenarios	Ruining Yang et.al.	2510.02627	null
2025-10-02	SIMSplat: Predictive Driving Scene Editing with Language-aligned 4D Gaussian Splatting	Sung-Yeon Park et.al.	2510.02469	null
2025-10-02	ERUPT: An Open Toolkit for Interfacing with Robot Motion Planners in Extended Reality	Isaac Ngui et.al.	2510.02464	null
2025-10-02	Symskill: Symbol and Skill Co-Invention for Data-Efficient and Real-Time Long-Horizon Manipulation	Yifei Simon Shao et.al.	2510.01661	null
2025-10-01	Safe Motion Planning and Control Using Predictive and Adaptive Barrier Methods for Autonomous Surface Vessels	Alejandro Gonzalez-Garcia et.al.	2510.01357	null
2025-10-01	From Seeing to Predicting: A Vision-Language Framework for Trajectory Forecasting and Controlled Video Generation	Fan Yang et.al.	2510.00806	null
2025-10-01	From Human Hands to Robot Arms: Manipulation Skills Transfer via Trajectory Alignment	Han Zhou et.al.	2510.00491	null
2025-10-01	Conflict-Based Search as a Protocol: A Multi-Agent Motion Planning Protocol for Heterogeneous Agents, Solvers, and Independent Tasks	Rishi Veerapaneni et.al.	2510.00425	null
2025-10-01	EgoTraj-Bench: Towards Robust Trajectory Prediction Under Ego-view Noisy Observations	Jiayi Liu et.al.	2510.00405	null
2025-09-30	A Systematic Study of Large Language Models for Task and Motion Planning With PDDLStream	Jorge Mendez-Mendez et.al.	2510.00182	null
2025-10-03	Less is More: Lean yet Powerful Vision-Language Model for Autonomous Driving	Sheng Yang et.al.	2510.00060	null
2025-09-30	The Trajectory Bundle Method: Unifying Sequential-Convex Programming and Sampling-Based Trajectory Optimization	Kevin Tracy et.al.	2509.26575	null
2025-09-30	Learning from Hallucinating Critical Points for Navigation in Dynamic Environments	Saad Abdul Ghani et.al.	2509.26513	null
2025-09-30	Kinodynamic Motion Planning for Mobile Robot Navigation across Inconsistent World Models	Eric R. Damm et.al.	2509.26339	null
2025-09-30	Hierarchical Diffusion Motion Planning with Task-Conditioned Uncertainty-Aware Priors	Amelie Minji Kim et.al.	2509.25685	null
2025-09-29	Parallel Heuristic Search as Inference for Actor-Critic Reinforcement Learning Models	Hanlan Yang et.al.	2509.25402	null
2025-09-29	SRMP: Search-Based Robot Motion Planning Library	Itamar Mishani et.al.	2509.25352	null
2025-09-29	Path Diffuser: Diffusion Model for Data-Driven Traffic Simulator	Da Saem Lee et.al.	2509.24995	null
2025-09-29	Trajectory Prediction via Bayesian Intention Inference under Unknown Goals and Kinematics	Shunan Yin et.al.	2509.24928	null
2025-09-29	Learning to Sample: Reinforcement Learning-Guided Sampling for Autonomous Vehicle Motion Planning	Korbinian Moller et.al.	2509.24313	null
2025-09-29	Towards Tighter Convex Relaxation of Mixed-integer Programs: Leveraging Logic Network Flow for Task and Motion Planning	Xuan Lin et.al.	2509.24235	null
2025-09-29	ViReSkill: Vision-Grounded Replanning with Skill Memory for LLM-Based Planning in Lifelong Robot Learning	Tomoyuki Kagaya et.al.	2509.24219	null
2025-09-29	Forge4D: Feed-Forward 4D Human Reconstruction and Interpolation from Uncalibrated Sparse-view Videos	Yingdong Hu et.al.	2509.24209	null
2025-09-29	A Novel Model for 3D Motion Planning for a Generalized Dubins Vehicle with Pitch and Yaw Rate Constraints	Deepak Prakash Kumar et.al.	2509.24143	null
2025-09-28	Hazy Pedestrian Trajectory Prediction via Physical Priors and Graph-Mamba	Jian Chen et.al.	2509.24020	null
2025-09-28	Advancing Multi-agent Traffic Simulation via R1-Style Reinforcement Fine-Tuning	Muleilan Pei et.al.	2509.23993	null
2025-09-28	VFSI: Validity First Spatial Intelligence for Constraint-Guided Traffic Diffusion	Kargi Chauhan et.al.	2509.23971	null
2025-09-28	DA-MMP: Learning Coordinated and Accurate Throwing with Dynamics-Aware Motion Manifold Primitives	Chi Chu et.al.	2509.23721	null
2025-09-27	Distributed Multi-Robot Multi-Target Simultaneous Search and Tracking in an Unknown Non-convex Environment	Jun Chen et.al.	2509.23308	null
2025-09-26	Empart: Interactive Convex Decomposition for Converting Meshes to Parts	Brandon Vu et.al.	2509.22847	null
2025-09-26	Towards Developing Standards and Guidelines for Robot Grasping and Manipulation Pipelines in the COMPARE Ecosystem	Huajing Zhao et.al.	2509.22801	null
2025-09-26	Self-driving cars: Are we there yet?	Merve Atasever et.al.	2509.22754	null
2025-10-17	An Intention-driven Lane Change Framework Considering Heterogeneous Dynamic Cooperation in Mixed-traffic Environment	Xiaoyun Qiu et.al.	2509.22550	null
2025-09-26	An Adaptive ICP LiDAR Odometry Based on Reliable Initial Pose	Qifeng Wang et.al.	2509.22058	null
2025-09-25	DroneFL: Federated Learning for Multi-UAV Visual Target Tracking	Xiaofan Yu et.al.	2509.21523	null
2025-09-25	Multi-Robot Vision-Based Task and Motion Planning for EV Battery Disassembly and Sorting	Abdelaziz Shaarawy et.al.	2509.21020	null
2025-09-24	BBoE: Leveraging Bundle of Edges for Kinodynamic Bidirectional Motion Planning	Srikrishna Bangalore Raghu et.al.	2509.20333	null
2025-09-24	Parse-Augment-Distill: Learning Generalizable Bimanual Visuomotor Policies from Single Human Video	Georgios Tziafas et.al.	2509.20286	null
2025-09-23	Look as You Leap: Planning Simultaneous Motion and Perception for High-DOF Robots	Qingxi Meng et.al.	2509.19610	null
2025-09-23	Agentic Scene Policies: Unifying Space, Semantics, and Affordances for Robot Action	Sacha Morin et.al.	2509.19571	null
2025-09-23	Distributionally Robust Safe Motion Planning with Contextual Information	Kaizer Rahaman et.al.	2509.18666	null
2025-09-23	PIE: Perception and Interaction Enhanced End-to-End Motion Planning for Autonomous Driving	Chengran Yuan et.al.	2509.18609	null
2025-09-22	BlurBall: Joint Ball and Motion Blur Estimation for Table Tennis Ball Tracking	Thomas Gossard et.al.	2509.18387	null
2025-09-22	Haptic Communication in Human-Human and Human-Robot Co-Manipulation	Katherine H. Allen et.al.	2509.18327	null
2025-09-22	SocialTraj: Two-Stage Socially-Aware Trajectory Prediction for Autonomous Driving via Conditional Diffusion Model	Xiao Zhou et.al.	2509.17850	null
2025-09-22	Learning Dexterous Manipulation with Quantized Hand State	Ying Feng et.al.	2509.17450	null
2025-09-22	Fast Trajectory Planner with a Reinforcement Learning-based Controller for Robotic Manipulators	Yongliang Wang et.al.	2509.17381	null
2025-09-21	CoPlanner: An Interactive Motion Planner with Contingency-Aware Diffusion for Autonomous Driving	Ruiguo Zhong et.al.	2509.17080	null
2025-09-19	Dynamic Objects Relocalization in Changing Environments with Flow Matching	Francesco Argenziano et.al.	2509.16398	null
2025-09-19	AdaSports-Traj: Role- and Domain-Aware Adaptation for Multi-Agent Trajectory Modeling in Sports	Yi Xu et.al.	2509.16095	null
2025-09-19	CoPAD : Multi-source Trajectory Fusion and Cooperative Trajectory Prediction with Anchor-oriented Decoder in V2X Scenarios	Kangyu Wu et.al.	2509.15984	null
2025-09-19	Enriched Feature Representation and Motion Prediction Module for MOSEv2 Track of 7th LSVOS Challenge: 3rd Place Solution	Chang Soo Lim et.al.	2509.15781	null
2025-09-19	ORB: Operating Room Bot, Automating Operating Room Logistics through Mobile Manipulation	Jinkai Qiu et.al.	2509.15600	null
2025-09-18	Trust-Aware Embodied Bayesian Persuasion for Mixed-Autonomy	Shaoting Peng et.al.	2509.15404	null
2025-09-18	Out-of-Sight Trajectories: Tracking, Fusion, and Prediction	Haichao Zhang et.al.	2509.15219	null
2025-09-17	FlowDrive: Energy Flow Field for End-to-End Autonomous Driving	Hao Jiang et.al.	2509.14303	null
2025-09-17	Language Conditioning Improves Accuracy of Aircraft Goal Prediction in Untowered Airspace	Sundhar Vinodh Sangeetha et.al.	2509.14063	null
2025-09-17	Repulsive Trajectory Modification and Conflict Resolution for Efficient Multi-Manipulator Motion Planning	Junhwa Hong et.al.	2509.13882	null
2025-09-17	CDFlow: Generative Gradient Flows for Configuration Space Distance Fields via Neural ODEs	Mengzhu Li et.al.	2509.13771	null
2025-09-16	Dynamic Aware: Adaptive Multi-Mode Out-of-Distribution Detection for Trajectory Prediction in Autonomous Vehicles	Tongfei Guo et.al.	2509.13577	null
2025-09-16	Trajectory Tracking with Reachability-Guided Quadratic Programming and Freeze-Resume	Hossein Gholampour et.al.	2509.13501	null
2025-09-16	Weakly and Self-Supervised Class-Agnostic Motion Prediction for Autonomous Driving	Ruibo Li et.al.	2509.13116	null
2025-09-16	Bridging Perception and Planning: Towards End-to-End Planning for Signal Temporal Logic Tasks	Bowen Ye et.al.	2509.12813	null
2025-09-15	DYNAMO: Dependency-Aware Deep Learning Framework for Articulated Assembly Motion Prediction	Mayank Patel et.al.	2509.12430	null
2025-09-15	Learning Contact Dynamics for Control with Action-conditioned Face Interaction Graph Networks	Zongyao Yi et.al.	2509.12151	null
2025-09-14	Embodied Intelligence in Disassembly: Multimodal Perception Cross-validation and Continual Learning in Neuro-Symbolic TAMP	Ziwen He et.al.	2509.11270	null
2025-09-14	SAMP: Spatial Anchor-based Motion Policy for Collision-Aware Robotic Manipulators	Kai Chen et.al.	2509.11185	null
2025-09-14	End-to-End Visual Autonomous Parking via Control-Aided Attention	Chao Chen et.al.	2509.11090	null
2025-10-11	Follow-Bench: A Unified Motion Planning Benchmark for Socially-Aware Robot Person Following	Hanjing Ye et.al.	2509.10796	null
2025-09-12	STL-Based Motion Planning and Uncertainty-Aware Risk Analysis for Human-Robot Collaboration with a Multi-Rotor Aerial Vehicle	Giuseppe Silano et.al.	2509.10692	null
2025-09-11	Large Foundation Models for Trajectory Prediction in Autonomous Driving: A Comprehensive Survey	Wei Dai et.al.	2509.10570	null
2025-09-12	Coordinated Motion Planning of a Wearable Multi-Limb System for Enhanced Human-Robot Interaction	Chaerim Moon et.al.	2509.10444	null
2025-09-17	DECAMP: Towards Scene-Consistent Multi-Agent Motion Prediction with Disentangled Context-Aware Pre-Training	Jianxin Shi et.al.	2509.10426	null
2025-09-12	HHI-Assist: A Dataset and Benchmark of Human-Human Interaction in Physical Assistance Scenario	Saeed Saadatnejad et.al.	2509.10096	null
2025-09-12	BEVTraj: Map-Free End-to-End Trajectory Prediction in Bird’s-Eye View with Deformable Attention and Sparse Goal Proposals	Minsang Kong et.al.	2509.10080	null
2025-09-11	BagIt! An Adaptive Dual-Arm Manipulation of Fabric Bags for Object Bagging	Peng Zhou et.al.	2509.09484	null
2025-09-11	ProgD: Progressive Multi-scale Decoding with Dynamic Graphs for Joint Multi-agent Motion Forecasting	Xing Gao et.al.	2509.09210	null
2025-09-11	MGTraj: Multi-Granularity Goal-Guided Human Trajectory Prediction with Recursive Refinement Network	Ge Sun et.al.	2509.09200	null
2025-09-11	KoopMotion: Learning Almost Divergence Free Koopman Flow Fields for Motion Planning	Alice Kate Li et.al.	2509.09074	null
2025-09-11	Joint Model-based Model-free Diffusion for Planning with Constraints	Wonsuhk Jung et.al.	2509.08775	null
2025-09-10	Dual-Stage Safe Herding Framework for Adversarial Attacker in Dynamic Environment	Wenqing Wang et.al.	2509.08460	null
2025-09-09	Diffusion-Guided Multi-Arm Motion Planning	Viraj Parimi et.al.	2509.08160	null
2025-09-09	Decoding RobKiNet: Insights into Efficient Training of Robotic Kinematics Informed Neural Network	Yanlong Peng et.al.	2509.07646	null
2025-09-09	Safe and Non-Conservative Contingency Planning for Autonomous Vehicles via Online Learning-Based Reachable Set Barriers	Rui Yang et.al.	2509.07464	null
2025-09-08	First Plan Then Evaluate: Use a Vectorized Motion Planner for Grasping	Martin Matak et.al.	2509.07162	null
2025-09-08	Deep Reactive Policy: Learning Reactive Manipulator Motion Planning for Dynamic Environments	Jiahui Yang et.al.	2509.06953	null
2025-09-08	Safe Robust Predictive Control-based Motion Planning of Automated Surface Vessels in Inland Waterways	Sajad Ahmadi et.al.	2509.06687	null
2025-09-05	RoboBallet: Planning for Multi-Robot Reaching with Graph Neural Networks and Reinforcement Learning	Matthew Lai et.al.	2509.05397	null
2025-09-02	INF-3DP: Implicit Neural Fields for Collision-Free Multi-Axis 3D Printing	Jiasheng Qu et.al.	2509.05345	null
2025-09-01	Anticipatory Fall Detection in Humans with Hybrid Directed Graph Neural Networks and Long Short-Term Memory	Younggeol Cho et.al.	2509.05337	null
2025-09-04	SAFE–MA–RRT: Multi-Agent Motion Planning with Data-Driven Safety Certificates	Babak Esmaeili et.al.	2509.04413	null
2025-09-04	Lightweight Kinematic and Static Modeling of Cable-Driven Continuum Robots via Actuation-Space Energy Formulation	Ke Wu et.al.	2509.04119	null
2025-09-16	Keypoint-based Diffusion for Robotic Motion Planning on the NICOL Robot	Lennart Clasmeier et.al.	2509.04076	null
2025-09-04	Human Motion Video Generation: A Survey	Haiwei Xue et.al.	2509.03883	null
2025-09-03	sam-llm: interpretable lane change trajectoryprediction via parametric finetuning	Zhuo Cao et.al.	2509.03462	null
2025-09-03	KEPT: Knowledge-Enhanced Prediction of Trajectories from Consecutive Driving Frames with Vision-Language Models	Yujin Wang et.al.	2509.02966	null
2025-09-02	Systematic Evaluation of Trade-Offs in Motion Planning Algorithms for Optimal Industrial Robotic Work Cell Design	G. de Mathelin et.al.	2509.02146	null
2025-09-01	Multi-vessel Interaction-Aware Trajectory Prediction and Collision Risk Assessment	Md Mahbub Alam et.al.	2509.01836	null
2025-09-01	Articulated Object Estimation in the Wild	Abdelrhman Werby et.al.	2509.01708	null
2025-09-01	MoTo: A Zero-shot Plug-in Interaction-aware Navigation for General Mobile Manipulation	Zhenyu Wu et.al.	2509.01658	null
2025-09-01	A Hybrid Input based Deep Reinforcement Learning for Lane Change Decision-Making of Autonomous Vehicle	Ziteng Gao et.al.	2509.01611	null
2025-09-01	Metamorphic Testing of Multimodal Human Trajectory Prediction	Helge Spieker et.al.	2509.01294	null
2025-09-17	Hierarchical Reactive Grasping via Task-Space Velocity Fields and Joint-Space Quadratic Programming	Yonghyeon Lee et.al.	2509.01044	null
2025-09-17	One-Step Model Predictive Path Integral for Manipulator Motion Planning Using Configuration Space Distance Fields	Yulin Li et.al.	2509.00836	null
2025-09-06	An Effective Trajectory Planning and an Optimized Path Planning for a 6-Degree-of-Freedom Robot Manipulator	Takumu Okazaki et.al.	2509.00828	null
2025-08-30	Vehicle-in-Virtual-Environment (VVE) Method for Developing and Evaluating VRU Safety of Connected and Autonomous Driving with Focus on Bicyclist Safety	Haochong Chen et.al.	2509.00624	null
2025-08-30	NeuralSVCD for Efficient Swept Volume Collision Detection	Dongwon Son et.al.	2509.00499	null
2025-08-30	A Framework for Task and Motion Planning based on Expanding AND/OR Graphs	Fulvio Mastrogiovanni et.al.	2509.00317	null
2025-08-26	Hybrid Perception and Equivariant Diffusion for Robust Multi-Node Rebar Tying	Zhitao Wang et.al.	2509.00065	null
2025-08-29	Robust Convex Model Predictive Control with collision avoidance guarantees for robot manipulators	Bernhard Wullt et.al.	2508.21677	null
2025-08-29	Dynamics-Compliant Trajectory Diffusion for Super-Nominal Payload Manipulation	Anuj Pasricha et.al.	2508.21375	null
2025-08-29	Multi-Modal Model Predictive Path Integral Control for Collision Avoidance	Alberto Bertipaglia et.al.	2508.21364	null
2025-08-29	Learning to Assemble the Soma Cube with Legal-Action Masked DQN and Safe ZYZ Regrasp on a Doosan M0609	Jaehong Oh et.al.	2508.21272	null
2025-08-27	ScanMove: Motion Prediction and Transfer for Unregistered Body Meshes	Thomas Besnier et.al.	2508.21095	null
2025-09-04	HITTER: A HumanoId Table TEnnis Robot via Hierarchical Planning and Learning	Zhi Su et.al.	2508.21043	null
2025-09-05	Train-Once Plan-Anywhere Kinodynamic Motion Planning via Diffusion Trees	Yaniv Hassidof et.al.	2508.21001	null
2025-08-28	Deep Fuzzy Optimization for Batch-Size and Nearest Neighbors in Optimal Robot Motion Planning	Liding Zhang et.al.	2508.20884	null
2025-08-28	Uncertainty Aware-Predictive Control Barrier Functions: Safer Human Robot Interaction through Probabilistic Motion Forecasting	Lorenzo Busellato et.al.	2508.20812	null
2025-08-28	CardioMorphNet: Cardiac Motion Prediction Using a Shape-Guided Bayesian Recurrent Deep Network	Reza Akbari Movahed et.al.	2508.20734	null
2025-08-27	Regulation-Aware Game-Theoretic Motion Planning for Autonomous Racing	Francesco Prignoli et.al.	2508.20203	null
2025-08-27	Discrete-Guided Diffusion for Scalable and Safe Multi-Robot Motion Planning	Jinhao Liang et.al.	2508.20095	null
2025-08-27	*APT: Asymptotically Optimal Motion Planning via Adaptively Prolated Elliptical R-Nearest Neighbors**	Liding Zhang et.al.	2508.19790	null
2025-08-27	Tree-Based Grafting Approach for Bidirectional Motion Planning with Local Subsets Optimization	Liding Zhang et.al.	2508.19776	null
2025-08-27	Elliptical K-Nearest Neighbors – Path Optimization via Coulomb’s Law and Invalid Vertices in C-space Obstacles	Liding Zhang et.al.	2508.19771	null
2025-08-27	Autonomous Aerial Manipulation at Arbitrary Pose in SE(3) with Robust Control and Whole-body Planning	Dongjae Lee et.al.	2508.19608	null
2025-09-16	Mining the Long Tail: A Comparative Study of Data-Centric Criticality Metrics for Robust Offline Reinforcement Learning in Autonomous Motion Planning	Antonio Guillen-Perez et.al.	2508.18397	null
2025-08-26	FlowVLA: Thinking in Motion with a Visual Chain of Thought	Zhide Zhong et.al.	2508.18269	null
2025-08-25	Adaptive Output Steps: FlexiSteps Network for Dynamic Trajectory Prediction	Yunxiang Liu et.al.	2508.17797	null
2025-08-23	LLM-based Human-like Traffic Simulation for Self-driving Tests	Wendi Li et.al.	2508.16962	null
2025-08-23	Drive As You Like: Strategy-Level Motion Planning Based on A Multi-Head Diffusion Model	Fan Ding et.al.	2508.16947	null
2025-08-21	Lang2Lift: A Framework for Language-Guided Pallet Detection and Pose Estimation Integrated in Autonomous Outdoor Forklift Operation	Huy Hoang Nguyen et.al.	2508.15427	null
2025-08-20	TRUST-Planner: Topology-guided Robust Trajectory Planner for AAVs with Uncertain Obstacle Spatial-temporal Avoidance	Junzhi Li et.al.	2508.14610	null
2025-08-20	FiReFly: Fair Distributed Receding Horizon Planning for Multiple UAVs	Nicole Fronda et.al.	2508.14381	null
2025-08-16	Task and Motion Planning for Humanoid Loco-manipulation	Michal Ciebielski et.al.	2508.14099	null
2025-08-20	Accelerating Signal-Temporal-Logic-Based Task and Motion Planning of Bipedal Navigation using Benders Decomposition	Jiming Ren et.al.	2508.13407	null
2025-08-18	BOW: Bayesian Optimization over Windows for Motion Planning in Complex Environments	Sourav Raxit et.al.	2508.13052	null
2025-08-28	On the complexity of constrained reconfiguration and motion planning	Nicolas Bousquet et.al.	2508.13032	null
2025-08-31	SocialTrack: Multi-Object Tracking in Complex Urban Traffic Scenes Inspired by Social Behavior	Wenguang Tao et.al.	2508.12777	null
2025-08-17	Autonomous Oil Spill Response Through Liquid Neural Trajectory Modeling and Coordinated Marine Robotics	Hadas C. Kuzmenko et.al.	2508.12456	null
2025-08-17	EgoLoc: A Generalizable Solution for Temporal Interaction Localization in Egocentric Videos	Junyi Ma et.al.	2508.12349	null
2025-08-15	A Comparative Study of Floating-Base Space Parameterizations for Agile Whole-Body Motion Planning	Evangelos Tsiatsianas et.al.	2508.11520	null
2025-08-15	Relative Position Matters: Trajectory Prediction and Planning with Polar Representation	Bozhou Zhang et.al.	2508.11492	null
2025-08-15	EvoPSF: Online Evolution of Autonomous Driving Models via Planning-State Feedback	Jiayue Jin et.al.	2508.11453	null
2025-08-15	ReachVox: Clutter-free Reachability Visualization for Robot Motion Planning in Virtual Reality	Steffen Hauck et.al.	2508.11426	null
2025-08-15	Learning Differentiable Reachability Maps for Optimization-based Humanoid Motion Generation	Masaki Murooka et.al.	2508.11275	null
2025-08-15	A CLIP-based Uncertainty Modal Modeling (UMM) Framework for Pedestrian Re-Identification in Autonomous Driving	Jialin Li et.al.	2508.11218	null
2025-08-20	3D FlowMatch Actor: Unified 3D Policy for Single- and Dual-Arm Manipulation	Nikolaos Gkanatsios et.al.	2508.11002	null
2025-08-14	SpaRC-AD: A Baseline for Radar-Camera Fusion in End-to-End Autonomous Driving	Philipp Wolters et.al.	2508.10567	null
2025-08-14	STRIDE-QA: Visual Question Answering Dataset for Spatiotemporal Reasoning in Urban Driving Scenes	Keishi Ishihara et.al.	2508.10427	null
2025-08-12	CLF-RL: Control Lyapunov Function Guided Reinforcement Learning	Kejun Li et.al.	2508.09354	null
2025-08-10	Whole-Body Coordination for Dynamic Object Grasping with Legged Manipulators	Qiwei Liang et.al.	2508.08328	null
2025-08-11	Learning an Implicit Physics Model for Image-based Fluid Simulation	Emily Yue-Ting Jia et.al.	2508.08254	null
2025-08-10	A Learning-Based Framework for Collision-Free Motion Planning	Mateus Salomão et.al.	2508.07502	null
2025-08-10	Noise-Aware Generative Microscopic Traffic Simulation	Vindula Jayawardana et.al.	2508.07453	null
2025-08-10	Bio-Inspired Topological Autonomous Navigation with Active Inference in Robotics	Daria de Tinguy et.al.	2508.07267	null
2025-08-12	Understanding Dynamic Scenes in Ego Centric 4D Point Clouds	Junsheng Huang et.al.	2508.07251	null
2025-08-10	CoopDiff: Anticipating 3D Human-object Interactions via Contact-consistent Decoupled Diffusion	Xiaotong Lin et.al.	2508.07162	null
2025-08-10	Intention-Aware Diffusion Model for Pedestrian Trajectory Prediction	Yu Liu et.al.	2508.07146	null
2025-08-09	ForeSight: Multi-View Streaming Joint Object Detection and Trajectory Forecasting	Sandro Papais et.al.	2508.07089	null
2025-08-09	Model Predictive Control for Crowd Navigation via Learning-Based Trajectory Prediction	Mohamed Parvez Aslam et.al.	2508.07079	null
2025-08-05	Historical Prediction Attention Mechanism based Trajectory Forecasting for Proactive Work Zone Safety in a Digital Twin Environment	Minhaj Uddin Ahmad et.al.	2508.06544	null
2025-08-04	Symbolic Learning of Interpretable Reduced-Order Models for Jumping Quadruped Robots	Gioele Buriani et.al.	2508.06538	null
2025-08-08	*V: An Efficient Motion Planning Algorithm for Autonomous Vehicles**	Abdullah Zareh Andaryan et.al.	2508.06404	null
2025-08-08	Incremental Language Understanding for Online Motion Planning of Robot Manipulators	Mitchell Abrams et.al.	2508.06095	null
2025-08-08	Dynamical Trajectory Planning of Disturbance Consciousness for Air-Land Bimodal Unmanned Aerial Vehicles	Shaoting Liu et.al.	2508.05972	null
2025-08-07	TrajEvo: Trajectory Prediction Heuristics Design via LLM-driven Evolution	Zhikai Zhao et.al.	2508.05616	null
2025-08-07	Benchmarking Shortcutting Techniques for Multi-Robot-Arm Motion Planning	Philip Huang et.al.	2508.05027	null
2025-08-06	LuKAN: A Kolmogorov-Arnold Network Framework for 3D Human Motion Prediction	Md Zahidul Hasan et.al.	2508.04847	null
2025-08-06	BEVCon: Advancing Bird’s Eye View Perception with Contrastive Learning	Ziyang Leng et.al.	2508.04702	null
2025-08-06	Incorporating Stochastic Models of Controller Behavior into Kinodynamic Efficiently Adaptive State Lattices for Mobile Robot Motion Planning in Off-Road Environments	Eric R. Damm et.al.	2508.04384	null
2025-08-07	Intention Enhanced Diffusion Model for Multimodal Pedestrian Trajectory Prediction	Yu Liu et.al.	2508.04229	null
2025-08-11	Industrial Robot Motion Planning with GPUs: Integration of cuRobo for Extended DOF Systems	Luai Abuelsamen et.al.	2508.04146	null
2025-08-05	Constraint-Preserving Data Generation for Visuomotor Policy Learning	Kevin Lin et.al.	2508.03944	null
2025-08-05	Vision-based Perception System for Automated Delivery Robot-Pedestrians Interactions	Ergi Tushe et.al.	2508.03541	null
2025-08-04	X-Actor: Emotional and Expressive Long-Range Portrait Acting from Audio	Chenxu Zhang et.al.	2508.02944	null
2025-08-04	MIDAR: Mimicking LiDAR Detection for Traffic Applications with a Lightweight Plug-and-Play Model	Tianheng Zhu et.al.	2508.02858	null
2025-08-04	Text2Lip: Progressive Lip-Synced Talking Face Generation from Text via Viseme-Guided Rendering	Xu Wang et.al.	2508.02362	null
2025-08-19	Adaptive Lattice-based Motion Planning	Abhishek Dhar et.al.	2508.02350	null
2025-08-04	Framework for Robust Motion Planning of Tethered Multi-Robot Systems in Marine Environments	Markus Buchholz et.al.	2508.02287	null
2025-08-04	AID4AD: Aerial Image Data for Automated Driving Perception	Daniel Lengerer et.al.	2508.02140	null
2025-08-03	Beyond Simulation: Benchmarking World Models for Planning and Causality in Autonomous Driving	Hunter Schofield et.al.	2508.01922	null
2025-08-03	DiffSemanticFusion: Semantic Raster BEV Fusion for Autonomous Driving via Online HD Map Diffusion	Zhigang Sun et.al.	2508.01778	null
2025-08-03	A Spatio-temporal Continuous Network for Stochastic 3D Human Motion Prediction	Hua Yu et.al.	2508.01585	null
2025-07-29	A Survey on Deep Multi-Task Learning in Connected Autonomous Vehicles	Jiayuan Wang et.al.	2508.00917	null
2025-08-01	On Learning Closed-Loop Probabilistic Multi-Agent Simulator	Juanwu Lu et.al.	2508.00384	null
2025-08-01	TopoDiffuser: A Diffusion-Based Multimodal Trajectory Prediction Model with Topometric Maps	Zehui Xu et.al.	2508.00303	null
2025-07-31	Data-Driven Motion Planning for Uncertain Nonlinear Systems	Babak Esmaeili et.al.	2508.00154	null
2025-07-31	OmniTraj: Pre-Training on Heterogeneous Data for Adaptive and Zero-Shot Human Trajectory Prediction	Yang Gao et.al.	2507.23657	null
2025-07-31	A Framework for Ethical Decision-Making in Automated Vehicles through Human Reasons-based Supervision	Lucas Elbert Suryana et.al.	2507.23308	null
2025-07-31	Simulation-based planning of Motion Sequences for Automated Procedure Optimization in Multi-Robot Assembly Cells	Loris Schneider et.al.	2507.23270	null
2025-08-01	Segment Anything for Video: A Comprehensive Review of Video Object Segmentation and Tracking from Past to Future	Guoping Xu et.al.	2507.22792	null
2025-07-30	Social-Pose: Enhancing Trajectory Prediction with Human Body Pose	Yang Gao et.al.	2507.22742	null
2025-07-30	Generative Active Learning for Long-tail Trajectory Prediction via Controllable Diffusion Model	Daehee Park et.al.	2507.22615	null
2025-07-30	Safety Evaluation of Motion Plans Using Trajectory Predictors as Forward Reachable Set Estimators	Kaustav Chakraborty et.al.	2507.22389	null
2025-07-27	Model-Structured Neural Networks to Control the Steering Dynamics of Autonomous Race Cars	Mattia Piccinini et.al.	2507.20427	null
2025-07-27	VLMPlanner: Integrating Visual Language Models with Motion Planning	Zhipeng Tang et.al.	2507.20342	null
2025-07-27	PUMPS: Skeleton-Agnostic Point-based Universal Motion Pre-Training for Synthesis in Human Motion Tasks	Clinton Ansun Mo et.al.	2507.20170	null
2025-07-25	PhysVarMix: Physics-Informed Variational Mixture Model for Multi-Modal Trajectory Prediction	Haichuan Li et.al.	2507.19701	null
2025-07-25	RAKOMO: Reachability-Aware K-Order Markov Path Optimization for Quadrupedal Loco-Manipulation	Mattia Risiglione et.al.	2507.19652	null
2025-07-25	High-Fidelity RF Mapping: Assessing Environmental Modeling in 6G Network Digital Twins	Lorenzo Cazzella et.al.	2507.19173	null
2025-07-31	PatchTraj: Unified Time-Frequency Representation Learning via Dynamic Patches for Trajectory Prediction	Yanghong Liu et.al.	2507.19119	null
2025-07-24	Probabilistic Collision Risk Estimation through Gauss-Legendre Cubature and Non-Homogeneous Poisson Processes	Trent Weiss et.al.	2507.18819	null
2025-07-24	Delving into Mapping Uncertainty for Mapless Trajectory Prediction	Zongzheng Zhang et.al.	2507.18498	null
2025-07-24	Goal-based Trajectory Prediction for improved Cross-Dataset Generalization	Daniel Grimm et.al.	2507.18196	null
2025-07-24	DanceGraph: A Complementary Architecture for Synchronous Dancing Online	David Sinclair et.al.	2507.18052	null
2025-07-23	Safety Assurance for Quadrotor Kinodynamic Motion Planning	Theodoros Tavoulareas et.al.	2507.17679	null
2025-07-23	IndoorBEV: Joint Detection and Footprint Completion of Objects via Mask-based Prediction in Indoor Scenarios for Bird’s-Eye View Perception	Haichuan Li et.al.	2507.17445	null
2025-08-06	DeMo++: Motion Decoupling for Autonomous Driving	Bozhou Zhang et.al.	2507.17342	null
2025-07-23	JAM: Keypoint-Guided Joint Prediction after Classification-Aware Marginal Proposal for Multi-Agent Interaction	Fangze Lin et.al.	2507.17152	null
2025-07-23	Falconry-like palm landing by a flapping-wing drone based on the human gesture interaction and distance-aware flight planning	Kazuki Numazato et.al.	2507.17144	null
2025-07-22	RAPTAR: Radar Radiation Pattern Acquisition through Automated Collaborative Robotics	Maaz Qureshi et.al.	2507.16988	null
2025-07-21	Resilience Evaluation of Kubernetes in Cloud-Edge Environments via Failure Injection	Zihao Chen et.al.	2507.16109	null
2025-07-21	Multi-Strategy Improved Snake Optimizer Accelerated CNN-LSTM-Attention-Adaboost for Trajectory Prediction	Shiyang Li et.al.	2507.15832	null
2025-07-21	Interleaved LLM and Motion Planning for Generalized Multi-Object Collection in Large Scene Graphs	Ruochu Yang et.al.	2507.15782	null
2025-07-21	Selective Densification for Rapid Motion Planning in High Dimensions with Narrow Passages	Lu Huang et.al.	2507.15710	null
2025-07-21	A Universal Vehicle-Trailer Navigation System with Neural Kinematics and Online Residual Learning	Yanbo Chen et.al.	2507.15607	null
2025-07-21	VLM-UDMC: VLM-Enhanced Unified Decision-Making and Motion Control for Urban Autonomous Driving	Haichao Liu et.al.	2507.15266	null
2025-07-20	Search-Based Autonomous Vehicle Motion Planning Using Game Theory	Pouya Panahandeh et.al.	2507.15088	null
2025-07-20	CoMoCAVs: Cohesive Decision-Guided Motion Planning for Connected and Autonomous Vehicles with Multi-Policy Reinforcement Learning	Pan Hu et.al.	2507.14903	null
2025-07-18	Context-Aware Behavior Learning with Heuristic Motion Memory for Underwater Manipulation	Markus Buchholz et.al.	2507.14099	null
2025-07-18	NeHMO: Neural Hamilton-Jacobi Reachability Learning for Decentralized Safe Multi-Agent Motion Planning	Qingyi Chen et.al.	2507.13940	null
2025-07-18	Conformal Contraction for Robust Nonlinear Control with Distribution-Free Uncertainty Quantification	Sihang Wei et.al.	2507.13613	null
2025-08-08	Trustworthy Pedestrian Trajectory Prediction via Pattern-Aware Interaction Modeling	Kaiyuan Zhai et.al.	2507.13397	null
2025-07-25	Signal Temporal Logic Compliant Co-design of Planning and Control	Manas Sashank Juvvi et.al.	2507.13225	null
2025-07-22	Predictability-Aware Motion Prediction for Edge XR via High-Order Error-State Kalman Filtering	Ziyu Zhong et.al.	2507.13179	null
2025-07-17	Non-differentiable Reward Optimization for Diffusion-based Autonomous Motion Planning	Giwon Lee et.al.	2507.12977	null
2025-07-17	FFI-VTR: Lightweight and Robust Visual Teach and Repeat Navigation based on Feature Flow Indicator and Probabilistic Motion Planning	Jikai Wang et.al.	2507.12800	null
2025-07-16	MMHU: A Massive-Scale Multimodal Benchmark for Human Behavior Understanding	Renjie Li et.al.	2507.12463	null
2025-07-16	Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios	Van-Hoang-Anh Phan et.al.	2507.12449	null
2025-07-16	Regrasp Maps for Sequential Manipulation Planning	Svetlana Levit et.al.	2507.12407	null
2025-07-17	Foresight in Motion: Reinforcing Trajectory Prediction with Reward Heuristics	Muleilan Pei et.al.	2507.12083	null
2025-07-16	IANN-MPPI: Interaction-Aware Neural Network-Enhanced Model Predictive Path Integral Approach for Autonomous Driving	Kanghyun Ryu et.al.	2507.11940	null
2025-07-16	A Fast Method for Planning All Optimal Homotopic Configurations for Tethered Robots and Its Extended Applications	Jinyuan Liu et.al.	2507.11880	null
2025-07-15	MPC-based Coarse-to-Fine Motion Planning for Robotic Object Transportation in Cluttered Environments	Chen Cai et.al.	2507.11211	null
2025-07-15	Enhancing Autonomous Manipulator Control with Human-in-loop for Uncertain Assembly Environments	Ashutosh Mishra et.al.	2507.11006	null
2025-07-15	OffsetCrust: Variable-Radius Offset Approximation with Power Diagrams	Zihan Zhao et.al.	2507.10924	null
2025-07-15	Mixed Discrete and Continuous Planning using Shortest Walks in Graphs of Convex Sets	Savva Morozov et.al.	2507.10878	null
2025-07-14	A Generalizable Physics-Enhanced State Space Model for Long-Term Dynamics Forecasting in Complex Environments	Yuchen Wang et.al.	2507.10792	null
2025-07-23	Leveraging RAG-LLMs for Urban Mobility Simulation and Analysis	Yue Ding et.al.	2507.10382	null
2025-07-16	TOP: Trajectory Optimization via Parallel Optimization towards Constant Time Complexity	Jiajun Yu et.al.	2507.10290	null
2025-07-14	MP-RBFN: Learning-based Vehicle Motion Primitives using Radial Basis Function Networks	Marc Kaufeld et.al.	2507.10047	null
2025-07-24	Active Probing with Multimodal Predictions for Motion Planning	Darshan Gadginmath et.al.	2507.09822	null
2025-07-13	Efficient Multi-Person Motion Prediction by Lightweight Spatial and Temporal Interactions	Yuanhong Zheng et.al.	2507.09446	null
2025-07-12	Real-Time Adaptive Motion Planning via Point Cloud-Guided, Energy-Based Diffusion and Potential Fields	Wondmgezahu Teshome et.al.	2507.09383	null
2025-07-19	Informed Hybrid Zonotope-based Motion Planning Algorithm	Peng Xie et.al.	2507.09309	null
2025-07-12	Integrating Planning and Predictive Control Using the Path Feasibility Governor	Shu Zhang et.al.	2507.09134	null
2025-07-09	Next-Generation Travel Demand Modeling with a Generative Framework for Household Activity Coordination	Xishun Liao et.al.	2507.08871	null
2025-07-14	STRAP: Spatial-Temporal Risk-Attentive Vehicle Trajectory Prediction for Autonomous Driving	Xinyi Ning et.al.	2507.08563	null
2025-07-11	Prediction of Lane Change Intentions of Human Drivers using an LSTM, a CNN and a Transformer	Francesco De Cristofaro et.al.	2507.08365	null
2025-07-11	Neural Parameter-varying Data-enabled Predictive Control of Cold Atmospheric Pressure Plasma Jets	Pegah GhafGhanbari et.al.	2507.08259	null
2025-07-10	GGMotion: Group Graph Dynamics-Kinematics Networks for Human Motion Prediction	Shuaijin Wan et.al.	2507.07515	null
2025-07-10	Towards Safe Autonomous Driving: A Real-Time Safeguarding Concept for Motion Planning Algorithms	Korbinian Moller et.al.	2507.07444	null
2025-07-09	When Context Is Not Enough: Modeling Unexplained Variability in Car-Following Behavior	Chengyuan Zhang et.al.	2507.07012	null
2025-07-09	Robust signal decompositions on the circle	Aral Kose et.al.	2507.07007	null
2025-07-09	ILNet: Trajectory Prediction with Inverse Learning Attention for Enhancing Intention Capture	Mingjin Zeng et.al.	2507.06531	null
2025-07-08	AURA-CVC: Autonomous Ultrasound-guided Robotic Assistance for Central Venous Catheterization	Deepak Raina et.al.	2507.05979	null
2025-07-08	DRO-EDL-MPC: Evidential Deep Learning-Based Distributionally Robust Model Predictive Control for Safe Autonomous Driving	Hyeongchan Ham et.al.	2507.05710	null
2025-07-07	From Marginal to Joint Predictions: Evaluating Scene-Consistent Trajectory Prediction Approaches for Automated Driving	Fabian Konstantinidis et.al.	2507.05254	null
2025-07-07	Beyond Features: How Dataset Design Influences Multi-Agent Trajectory Prediction Performance	Tobias Demmler et.al.	2507.05098	null
2025-07-07	Unifying Robot Optimization: Monte Carlo Tree Search with Tensor Factorization	Teng Xue et.al.	2507.04949	null
2025-07-25	Interaction-Merged Motion Planning: Effectively Leveraging Diverse Motion Datasets for Robust Planning	Giwon Lee et.al.	2507.04790	null
2025-07-07	LTMSformer: A Local Trend-Aware Attention and Motion State Encoding Transformer for Multi-Agent Trajectory Prediction	Yixin Yan et.al.	2507.04634	null
2025-07-06	Free-Space Optical Communication-Driven NMPC Framework for Multi-Rotor Aerial Vehicles in Structured Inspection Scenarios	Giuseppe Silano et.al.	2507.04443	null
2025-07-05	Stochastic Human Motion Prediction with Memory of Action Transition and Action Characteristic	Jianwei Tang et.al.	2507.04062	null
2025-07-05	Temporal Continual Learning with Prior Compensation for Human Motion Prediction	Jianwei Tang et.al.	2507.04060	null
2025-07-05	DK-RRT: Deep Koopman RRT for Collision-Aware Motion Planning of Space Manipulators in Dynamic Debris Environments	Qi Chen et.al.	2507.03878	null
2025-07-05	Enhanced accuracy through ensembling of randomly initialized auto-regressive models for time-dependent PDEs	Ishan Khurjekar et.al.	2507.03863	null
2025-07-04	Label-Free Long-Horizon 3D UAV Trajectory Prediction via Motion-Aligned RGB and Event Cues	Hanfang Liang et.al.	2507.03365	null
2025-07-03	Trajectory Optimization for Differential Drive Mobile Manipulators via Topological Paths Search and Arc Length-Yaw Parameterization	Long Xu et.al.	2507.02761	null
2025-07-03	Improving Consistency in Vehicle Trajectory Prediction Through Preference Optimization	Caio Azevedo et.al.	2507.02406	null
2025-07-03	Path Planning using a One-shot-sampling Skeleton Map	Gabriel O. Flores-Aquino et.al.	2507.02328	null
2025-07-02	GeoAda: Efficiently Finetune Geometric Diffusion Models with Equivariant Adapters	Wanjia Zhao et.al.	2507.02085	null
2025-07-09	Test-Time Scaling with Reflective Generative Model	Zixiao Wang et.al.	2507.01951	null
2025-07-06	AMD: Adaptive Momentum and Decoupled Contrastive Learning Framework for Robust Long-Tail Trajectory Prediction	Bin Rao et.al.	2507.01801	null
2025-07-02	Efficient Collision Detection for Long and Slender Robotic Links in Euclidean Distance Fields: Application to a Forestry Crane	Marc-Philip Ecker et.al.	2507.01705	null
2025-07-02	LANet: A Lane Boundaries-Aware Approach For Robust Trajectory Prediction	Muhammad Atta ur Rahman et.al.	2507.01308	null
2025-07-01	Search-Based Robot Motion Planning With Distance-Based Adaptive Motion Primitives	Benjamin Kraljusic et.al.	2507.01198	null
2025-07-01	ARIG: Autoregressive Interactive Head Generation for Real-time Conversations	Ying Guo et.al.	2507.00472	null
2025-06-30	Rethink 3D Object Detection from Physical World	Satoshi Tanaka et.al.	2507.00190	null
2025-06-30	Epona: Autoregressive Diffusion World Model for Autonomous Driving	Kaiwen Zhang et.al.	2506.24113	null
2025-06-30	STCLocker: Deadlock Avoidance Testing for Autonomous Driving Systems	Mingfei Cheng et.al.	2506.23995	null
2025-06-29	InfGen: Scenario Generation as Next Token Group Prediction	Zhenghao Peng et.al.	2506.23316	null
2025-06-29	Mode Collapse Happens: Evaluating Critical Interactions in Joint Trajectory Prediction Models	Maarten Hugenholtz et.al.	2506.23164	null
2025-06-28	Safe Reinforcement Learning with a Predictive Safety Filter for Motion Planning and Control: A Drifting Vehicle Example	Bei Zhou et.al.	2506.22894	null
2025-06-27	Pedestrian Intention and Trajectory Prediction in Unstructured Traffic Using IDD-PeD	Ruthvik Bokkasam et.al.	2506.22111	null
2025-06-27	A MILP-Based Solution to Multi-Agent Motion Planning and Collision Avoidance in Constrained Environments	Akshay Jaitly et.al.	2506.21982	null
2025-06-27	SceneDiffuser++: City-Scale Traffic Simulation via a Generative World Model	Shuhan Tan et.al.	2506.21976	null
2025-07-14	Ark: An Open-source Python-based Framework for Robot Learning	Magnus Dierking et.al.	2506.21628	null
2025-06-26	GoIRL: Graph-Oriented Inverse Reinforcement Learning for Multimodal Trajectory Prediction	Muleilan Pei et.al.	2506.21121	null
2025-06-25	Near Time-Optimal Hybrid Motion Planning for Timber Cranes	Marc-Philip Ecker et.al.	2506.20314	null
2025-06-24	Trajectory Prediction in Dynamic Object Tracking: A Critical Study	Zhongping Dong et.al.	2506.19341	null
2025-06-25	AnchorDP3: 3D Affordance Guided Sparse Diffusion Policy for Robotic Manipulation	Ziyan Zhao et.al.	2506.19269	null
2025-08-04	Faster Motion Planning via Restarts	Nancy Amato et.al.	2506.19016	null
2025-06-23	SViP: Sequencing Bimanual Visuomotor Policies with Object-Centric Motion Primitives	Yizhou Chen et.al.	2506.18825	null
2025-06-23	Design, fabrication and control of a cable-driven parallel robot	Dhruv Sorathiya et.al.	2506.18526	null
2025-06-23	Integrating Maneuverable Planning and Adaptive Control for Robot Cart-Pushing under Disturbances	Zhe Zhang et.al.	2506.18410	null
2025-06-23	Selective Social-Interaction via Individual Importance for Fast Human Trajectory Prediction	Yota Urano et.al.	2506.18291	null
2025-06-23	Drive-R1: Bridging Reasoning and Planning in VLMs for Autonomous Driving with Reinforcement Learning	Yue Li et.al.	2506.18234	null
2025-06-20	Long-term Traffic Simulation with Interleaved Autoregressive Motion and Scenario Generation	Xiuyu Yang et.al.	2506.17213	null
2025-06-20	Judo: A User-Friendly Open-Source Package for Sampling-Based Model Predictive Control	Albert H. Li et.al.	2506.17184	null
2025-07-11	Experimental Setup and Software Pipeline to Evaluate Optimization based Autonomous Multi-Robot Search Algorithms	Aditya Bhatt et.al.	2506.16710	null
2025-10-08	Trajectory Prediction Meets Large Language Models: A Survey	Yi Xu et.al.	2506.03408	null
2025-05-13	A Framework for Joint Grasp and Motion Planning in Confined Spaces	Martin Rudorfer et.al.	2505.07259	null
2025-04-23	Dynamic Intent Queries for Motion Transformer-based Trajectory Prediction	Tobias Demmler et.al.	2504.15766	null
2025-04-08	Vision-Language Model Predictive Control for Manipulation Planning and Trajectory Generation	Jiaming Chen et.al.	2504.05225	null
2025-02-18	Prediction uncertainty-aware planning using deep ensembles and trajectory optimisation	Anshul Nayak et.al.	2502.10585	null
2025-01-23	Int2Planner: An Intention-based Multi-modal Motion Planner for Integrated Prediction and Planning	Xiaolei Chen et.al.	2501.12799	null
2024-12-16	Adaptive Dual-Headway Unicycle Pose Control and Motion Prediction for Optimal Sampling-Based Feedback Motion Planning	Aykut İşleyen et.al.	2412.10350	null
2024-11-18	Planning by Simulation: Motion Planning with Learning-based Parallel Scenario Prediction for Autonomous Driving	Tian Niu et.al.	2411.09887	null
2024-11-05	Enhancing Social Robot Navigation with Integrated Motion Prediction and Trajectory Planning in Dynamic Human Environments	Thanh Nguyen Canh et.al.	2411.01814	null
2024-11-05	Interaction-Aware Trajectory Prediction for Safe Motion Planning in Autonomous Driving: A Transformer-Transfer Learning Approach	Jinhao Liang et.al.	2411.01475	null
2025-03-03	Stochasticity in Motion: An Information-Theoretic Approach to Trajectory Prediction	Aron Distelzweig et.al.	2410.01628	null
2024-07-09	Potential Based Diffusion Motion Planning	Yunhao Luo et.al.	2407.06169	null
2024-07-09	MSTF: Multiscale Transformer for Incomplete Trajectory Prediction	Zhanwen Liu et.al.	2407.05671	null
2024-05-22	Towards Using Fast Embedded Model Predictive Control for Human-Aware Predictive Robot Navigation	Till Hielscher et.al.	2405.12616	null
2024-05-17	Integrating Uncertainty-Aware Human Motion Prediction into Graph-Based Manipulator Motion Planning	Wansong Liu et.al.	2405.09779	null
2025-06-23	ControlMTR: Control-Guided Motion Transformer with Scene-Compliant Intention Points for Feasible Motion Prediction	Jiawei Sun et.al.	2404.10295	null
2024-04-12	Model Predictive Trajectory Planning for Human-Robot Handovers	Thies Oelerich et.al.	2404.07505	null
2024-03-21	LaCE-LHMP: Airflow Modelling-Inspired Long-Term Human Motion Prediction By Enhancing Laminar Characteristics in Human Flow	Yufei Zhu et.al.	2403.13640	null
2025-01-22	Robust Predictive Motion Planning by Learning Obstacle Uncertainty	Jian Zhou et.al.	2403.06222	null
2024-02-06	SIMPL: A Simple and Efficient Multi-agent Motion Prediction Baseline for Autonomous Driving	Lu Zhang et.al.	2402.02519	null
2023-12-29	A Safer Vision-based Autonomous Planning System for Quadrotor UAVs with Dynamic Obstacle Trajectory Prediction and Its Application with LLMs	Jiageng Zhong et.al.	2311.12893	null
2024-02-29	Large Trajectory Models are Scalable Motion Predictors and Planners	Qiao Sun et.al.	2310.19620	null
2023-10-17	BEVGPT: Generative Pre-trained Large Model for Autonomous Driving Prediction, Decision-Making, and Planning	Pengqin Wang et.al.	2310.10357	null
2023-10-05	Incorporating Target Vehicle Trajectories Predicted by Deep Learning Into Model Predictive Controlled Vehicles	Ni Dang et.al.	2310.02843	null
2023-09-19	Distributionally Robust CVaR-Based Safety Filtering for Motion Planning in Uncertain Environments	Sleiman Safaoui et.al.	2309.08821	null
2023-08-04	An enhanced motion planning approach by integrating driving heterogeneity and long-term trajectory prediction for automated driving systems	Ni Dong et.al.	2308.01369	null
2024-03-12	MTR++: Multi-Agent Motion Prediction with Symmetric Scene Modeling and Guided Intention Querying	Shaoshuai Shi et.al.	2306.17770	null
2023-06-21	QCNeXt: A Next-Generation Framework For Joint Multi-Agent Trajectory Prediction	Zikang Zhou et.al.	2306.10508	null
2023-06-12	Trajectory Prediction with Observations of Variable-Length for Motion Planning in Highway Merging scenarios	Sajjad Mozaffari et.al.	2306.05478	null
2023-06-06	Situational Adaptive Motion Prediction for Firefighting Squads in Indoor Search and Rescue	Nils Mandischer et.al.	2306.02705	null
2023-07-24	Motion-Scenario Decoupling for Rat-Aware Video Position Prediction: Strategy and Benchmark	Xiaofeng Liu et.al.	2305.18310	null
2023-09-29	TrafficBots: Towards World Models for Autonomous Driving Simulation and Motion Prediction	Zhejun Zhang et.al.	2303.04116	null
2023-03-03	Predicting Motion Plans for Articulating Everyday Objects	Arjun Gupta et.al.	2303.01484	null
2023-01-18	Risk-aware Vehicle Motion Planning Using Bayesian LSTM-Based Model Predictive Control	Yufei Huang et.al.	2301.06201	null
2022-12-06	Perceive, Interact, Predict: Learning Dynamic and Static Clues for End-to-End Motion Prediction	Bo Jiang et.al.	2212.02181	null
2022-12-02	Adaptive Conformal Prediction for Motion Planning among Dynamic Agents	Anushri Dixit et.al.	2212.00278	null
2023-07-17	R-Pred: Two-Stage Motion Prediction Via Tube-Query Attention-Based Trajectory Refinement	Sehwan Choi et.al.	2211.08609	null
2022-11-04	P4P: Conflict-Aware Motion Prediction for Planning in Autonomous Driving	Qiao Sun et.al.	2211.01634	null
2023-05-23	Sequence-Based Plan Feasibility Prediction for Efficient Task and Motion Planning	Zhutian Yang et.al.	2211.01576	null
2022-10-25	Planning Coordinated Human-Robot Motions with Neural Network Full-Body Prediction Models	Philipp Kratzer et.al.	2210.13317	null
2022-10-18	Evaluating Guiding Spaces for Motion Planning	Amnon Attali et.al.	2210.08640	null
2022-10-13	Local Planner Bench: Benchmarking for Local Motion Planning	Max Spahn et.al.	2210.06033	null
2022-09-22	MTR-A: 1st Place Solution for 2022 Waymo Open Dataset Challenge – Motion Prediction	Shaoshuai Shi et.al.	2209.10033	null
2022-08-25	Robot Motion Planning as Video Prediction: A Spatio-Temporal Neural Network-based Motion Planner	Xiao Zang et.al.	2208.11287	null
2023-02-21	Differentiable Integrated Motion Prediction and Planning with Learnable Cost Function for Autonomous Driving	Zhiyu Huang et.al.	2207.10422	null
2022-06-28	ScePT: Scene-consistent, Policy-based Trajectory Predictions for Planning	Yuxiao Chen et.al.	2206.13387	null
2022-06-14	Graph-based Spatial Transformer with Memory Replay for Multi-future Pedestrian Trajectory Prediction	Lihuan Li et.al.	2206.05712	null
2022-06-07	MotionCNN: A Strong Baseline for Motion Prediction in Autonomous Driving	Stepan Konev et.al.	2206.02163	null
2023-03-22	Semi-supervised Semantics-guided Adversarial Training for Trajectory Prediction	Ruochen Jiao et.al.	2205.14230	null
2022-04-28	Autonomous Vehicle Parking in Dynamic Environments: An Integrated System with Prediction and Motion Planning	Jessica Leu et.al.	2204.10383	null
2022-04-06	Deep Interactive Motion Prediction and Planning: Playing Games with Motion Prediction Models	Jose L. Vazquez et.al.	2204.02392	null
2022-03-28	Stochastic Trajectory Prediction via Motion Indeterminacy Diffusion	Tianpei Gu et.al.	2203.13777	null
2022-08-16	Flash: Fast and Light Motion Prediction for Autonomous Driving with Bayesian Inverse Planning and Learned Motion Profiles	Morris Antonello et.al.	2203.08251	null
2022-02-28	From Low to High Order Motion Planners: Safe Robot Navigation using Motion Prediction and Reference Governor	Aykut İşleyen et.al.	2202.12816	null
2022-02-04	Technical Report: A Hierarchical Deliberative-Reactive System Architecture for Task and Motion Planning in Partially Known Environments	Vasileios Vasilopoulos et.al.	2202.01385	null
2022-07-27	Motion Planning in Dynamic Environments Using Context-Aware Human Trajectory Prediction	Mark Nicholas Finean et.al.	2201.05058	null
2022-01-10	Data-Efficient Learning of High-Quality Controls for Kinodynamic Planning used in Vehicular Navigation	Seth Karten et.al.	2201.02254	null
2021-11-18	Trajectory Prediction & Path Planning for an Object Intercepting UAV with a Mounted Depth Camera	Jasper Tan et.al.	2111.09083	null
2022-05-20	PredictionNet: Real-Time Joint Probabilistic Traffic Prediction for Planning, Control, and Simulation	Alexey Kamenev et.al.	2109.11094	null
2021-09-21	Interactive multi-modal motion planning with Branch Model Predictive Control	Yuxiao Chen et.al.	2109.05128	null
2021-12-14	CovarianceNet: Conditional Generative Model for Correct Covariance Prediction in Human Motion Prediction	Aleksey Postnikov et.al.	2109.02965	null
2022-03-17	Group-based Motion Prediction for Navigation in Crowded Environments	Allan Wang et.al.	2107.11637	null
2021-06-15	Transition Motion Planning for Multi-Limbed Vertical Climbing Robots Using Complementarity Constraints	Jingwen Zhang et.al.	2106.07127	null
2021-03-16	Neural Motion Prediction for In-flight Uneven Object Catching	Hongxiang Yu et.al.	2103.08368	null
2021-10-22	Learning to Predict Vehicle Trajectories with Model-based Planning	Haoran Song et.al.	2103.04027	null
2021-08-02	FloMo: Tractable Motion Prediction with Normalizing Flows	Christoph Schöller et.al.	2103.03614	null
2021-03-18	Motion Planning for a Pair of Tethered Robots	Reza H. Teshnizi et.al.	2102.13212	null
2021-02-25	Learning Interaction-Aware Trajectory Predictions for Decentralized Multi-Robot Motion Planning in Dynamic Environments	Hai Zhu et.al.	2102.05382	null
2020-12-14	Learning How to Trade-Off Safety with Agility Using Deep Covariance Estimation for Perception Driven UAV Motion Planning	Onur Akgun et.al.	2012.06410	null
2020-11-30	Reactive motion planning with probabilistic safety guarantees	Yuxiao Chen et.al.	2011.03590	null
2020-08-14	Perceive, Predict, and Plan: Safe Motion Planning Through Interpretable Semantic Representations	Abbas Sadat et.al.	2008.05930	null
2022-02-08	Predicted Composite Signed-Distance Fields for Real-Time Motion Planning in Dynamic Environments	Mark Nicholas Finean et.al.	2008.00969	null
2020-07-07	Probabilistic Multi-modal Trajectory Prediction with Lane Attention for Autonomous Vehicles	Chenxu Luo et.al.	2007.02574	null
2021-06-22	Long-term Pedestrian Trajectory Prediction using Mutable Intention Filter and Warp LSTM	Zhe Huang et.al.	2007.00113	null
2020-07-07	Learning Manifolds for Sequential Motion Planning	Isabel M. Rayas Fernández et.al.	2006.07746	null
2020-06-09	Robotic Motion Planning using Learned Critical Sources and Local Sampling	Rajat Kumar Jenamani et.al.	2006.04194	null
2020-06-02	HMPO: Human Motion Prediction in Occluded Environments for Safe Motion Planning	Jae Sung Park et.al.	2006.00424	null
2020-05-06	CoMoGCN: Coherent Motion Aware Trajectory Prediction with Graph Representation	Yuying Chen et.al.	2005.00754	null
2021-02-09	TPNet: Trajectory Proposal Network for Motion Prediction	Liangji Fang et.al.	2004.12255	null
2021-01-19	PiP: Planning-informed Trajectory Prediction for Autonomous Driving	Haoran Song et.al.	2003.11476	null
2020-01-24	Socially intelligent task and motion planning for human-robot interaction	Andrea Frank et.al.	2001.08398	null
2020-05-08	A Real-Time Approach for Chance-Constrained Motion Planning with Dynamic Obstacles	Manuel Castillo-Lopez et.al.	2001.08012	null
2020-05-26	CIAO $^\star$ : MPC-based Safe Motion Planning in Predictable Dynamic Environments	Tobias Schoels et.al.	2001.05449	null
2019-10-21	Map-Predictive Motion Planning in Unknown Environments	Amine Elhafsi et.al.	1910.08184	null
2020-03-19	Prediction of Human Full-Body Movements with Motion Optimization and Recurrent Neural Networks	Philipp Kratzer et.al.	1910.01843	null
2019-06-26	Planning Robot Motion using Deep Visual Prediction	Meenakshi Sarkar et.al.	1906.10182	null
2019-05-30	Scene Induced Multi-Modal Trajectory Forecasting via Planning	Nachiket Deo et.al.	1905.09949	null
2020-07-27	Human Motion Trajectory Prediction: A Survey	Andrey Rudenko et.al.	1905.06113	null
2018-07-20	Motion planning in high-dimensional spaces	Luka Petrović et.al.	1806.07457	null
2019-09-20	Transferable Pedestrian Motion Prediction Models at Intersections	Macheng Shen et.al.	1804.00495	null
2018-05-29	How would surround vehicles move? A Unified Framework for Maneuver Classification and Motion Prediction	Nachiket Deo et.al.	1801.06523	null
2017-08-24	Towards Cooperative Motion Planning for Automated Vehicles in Mixed Traffic	Maximilian Naumann et.al.	1708.06962	null
2017-11-28	I-Planner: Intention-Aware Motion Planning Using Learning Based Human Motion Prediction	Jae Sung Park et.al.	1608.04837	null
2016-06-08	Goal Set Inverse Optimal Control and Iterative Re-planning for Predicting Human Reaching Motions in Shared Workspaces	Jim Mainprice et.al.	1606.02111	null
2016-01-26	Sampling-based Algorithms for Optimal Motion Planning Using Closed-loop Prediction	Oktay Arslan et.al.	1601.06326	null