Updated on 2025.05.17
This page is maintained by Leheng Li that contains papers he interested in. Source code of this web is at here.
3D
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-15 | 3D-Fixup: Advancing Photo Editing with 3D Priors | Yen-Chi Cheng et.al. | 2505.10566 | null |
2025-05-15 | Pharmacophore-Conditioned Diffusion Model for Ligand-Based De Novo Drug Design | Amira Alakhdar et.al. | 2505.10545 | null |
2025-05-15 | AGN star dynamics under the Influence of Outflow-Ambient Interactions | Muxin Liu et.al. | 2505.10524 | null |
2025-05-15 | Consistent Quantity-Quality Control across Scenes for Deployment-Aware Gaussian Splatting | Fengdi Zhang et.al. | 2505.10473 | null |
2025-05-15 | HWA-UNETR: Hierarchical Window Aggregate UNETR for 3D Multimodal Gastric Lesion Segmentation | Jiaming Liang et.al. | 2505.10464 | link |
2025-05-15 | HandReader: Advanced Techniques for Efficient Fingerspelling Recognition | Pavel Korotaev et.al. | 2505.10267 | null |
2025-05-15 | ADHMR: Aligning Diffusion-based Human Mesh Recovery via Direct Preference Optimization | Wenhao Shen et.al. | 2505.10250 | link |
2025-05-15 | MTVCrafter: 4D Motion Tokenization for Open-World Human Image Animation | Yanbo Ding et.al. | 2505.10238 | null |
2025-05-15 | UAV-Enabled Passive 6DMA for ISAC: Joint Location, Orientation, and Reflection Optimization | Peilan Wang et.al. | 2505.10220 | null |
2025-05-15 | UV-SERS monitoring of plasmons photodegradation of biomolecules on Aluminum platforms decorated with Rhodium nanoparticles | Yanqiu Zou et.al. | 2505.10216 | null |
2025-05-15 | VolE: A Point-cloud Framework for Food 3D Reconstruction and Volume Estimation | Umair Haroon et.al. | 2505.10205 | null |
2025-05-15 | Modeling droplet-particle interactions on solid surfaces by coupling the lattice Boltzmann and discrete element methods | Abhinav Naga et.al. | 2505.10171 | null |
2025-05-15 | VRSplat: Fast and Robust Gaussian Splatting for Virtual Reality | Xuechang Tu et.al. | 2505.10144 | link |
2025-05-15 | IMITATE: Image Registration with Context for unknown time frame recovery | Ziad Kheil et.al. | 2505.10124 | link |
2025-05-15 | A label-free sub-diffractive technique for 3D intracellular tomography using thermally induced convection currents | Jayesh Goswami et.al. | 2505.10112 | null |
2025-05-15 | Data-driven discovery of the equations of turbulent convection | Christopher J. Wareing et.al. | 2505.10109 | null |
2025-05-15 | EmbodiedMAE: A Unified 3D Multi-Modal Representation for Robot Manipulation | Zibin Dong et.al. | 2505.10105 | null |
2025-05-15 | Longitudinal oscillations for eigenfunctions in rod like structures | Pablo Benavent-Ocejo et.al. | 2505.10084 | null |
2025-05-15 | FlowDreamer: A RGB-D World Model with Flow-based Motion Representations for Robot Manipulation | Jun Guo et.al. | 2505.10075 | null |
2025-05-15 | ToonifyGB: StyleGAN-based Gaussian Blendshapes for 3D Stylized Head Avatars | Rui-Yang Ju et.al. | 2505.10072 | null |
2025-05-15 | Advances in Radiance Field for Dynamic Scene: From Neural Field to Gaussian Field | Jinlong Fan et.al. | 2505.10049 | link |
2025-05-15 | From Air to Wear: Personalized 3D Digital Fashion with AR/VR Immersive 3D Sketching | Ying Zang et.al. | 2505.09998 | null |
2025-05-15 | APCoTTA: Continual Test-Time Adaptation for Semantic Segmentation of Airborne LiDAR Point Clouds | Yuan Gao et.al. | 2505.09971 | link |
2025-05-15 | Hyper Yoshimura: How a slight tweak on a classical folding pattern unleashes meta-stability for deployable robots | Ziyang Zhou et.al. | 2505.09919 | null |
2025-05-15 | Large-Scale Gaussian Splatting SLAM | Zhe Xin et.al. | 2505.09915 | null |
2025-05-15 | SnapNCode: An Integrated Development Environment for Programming Physical Objects Interactions | Xiaoyan Wei et.al. | 2505.09882 | null |
2025-05-14 | A Framework for Identifying Non-van der Waals 2D Materials | Shota Ono et.al. | 2505.09853 | null |
2025-05-14 | Visual Feedback of Pattern Separability Improves Myoelectric Decoding Performance of Upper Limb Prostheses | Ruichen Yang et.al. | 2505.09819 | null |
2025-05-14 | Uncertainty of magnetic field in 3D NLTE inversions | Jiri Stepan et.al. | 2505.09708 | null |
2025-05-14 | An ion treatment planning framework for inclusion of nanodosimetric ionization detail through cluster dose | Simona Facchiano et.al. | 2505.09667 | null |
2025-05-14 | Real2Render2Real: Scaling Robot Data Without Dynamics Simulation or Robot Hardware | Justin Yu et.al. | 2505.09601 | null |
2025-05-14 | Camera-Only 3D Panoptic Scene Completion for Autonomous Driving through Differentiable Object Shapes | Nicola Marinello et.al. | 2505.09562 | null |
2025-05-14 | Bulk superinsulation and polar nematic order in nanopatterned NbTiN | A. Yu. Mironov et.al. | 2505.09547 | null |
2025-05-14 | Learned Free-Energy Functionals from Pair-Correlation Matching for Dynamical Density Functional Theory | Karnik Ram et.al. | 2505.09543 | null |
2025-05-14 | Uncovering the Varieties of Three-dimensional Hall-MHD Turbulence | Pratik Patel et.al. | 2505.09537 | null |
2025-05-15 | Decentralized Nonlinear Model Predictive Control-Based Flock Navigation with Real-Time Obstacle Avoidance in Unknown Obstructed Environments | Nuthasith Gerdpratoom et.al. | 2505.09434 | null |
2025-05-14 | Efficient LiDAR Reflectance Compression via Scanning Serialization | Jiahao Zhu et.al. | 2505.09433 | null |
2025-05-14 | MoRAL: Motion-aware Multi-Frame 4D Radar and LiDAR Fusion for Robust 3D Object Detection | Xiangyuan Peng et.al. | 2505.09422 | null |
2025-05-14 | Sparse Point Cloud Patches Rendering via Splitting 2D Gaussians | Ma Changfeng et.al. | 2505.09413 | link |
2025-05-14 | UMotion: Uncertainty-driven Human Motion Estimation from Inertial and Ultra-wideband Units | Huakun Liu et.al. | 2505.09393 | link |
2025-05-14 | AfforDance: Personalized AR Dance Learning System with Visual Affordance | Hyunyoung Han et.al. | 2505.09376 | null |
2025-05-14 | APR-Transformer: Initial Pose Estimation for Localization in Complex Environments through Absolute Pose Regression | Srinivas Ravuri et.al. | 2505.09356 | link |
2025-05-14 | Procedural Low-Poly Terrain Generation with Terracing for Computer Games | Richard Tivolt et.al. | 2505.09350 | link |
2025-05-14 | Neural Video Compression using 2D Gaussian Splatting | Lakshya Gupta et.al. | 2505.09324 | null |
2025-05-14 | Recent progress on electron- and magnon-mediated torques | Jia-Min Lai et.al. | 2505.09257 | null |
2025-05-15 | UniCAD: Efficient and Extendable Architecture for Multi-Task Computer-Aided Diagnosis System | Yitao Zhu et.al. | 2505.09178 | null |
2025-05-14 | TopoDiT-3D: Topology-Aware Diffusion Transformer with Bottleneck Structure for 3D Point Cloud Generation | Zechao Guan et.al. | 2505.09140 | link |
2025-05-15 | Quasi-3D beam theory based on equilibrium stress definition and mixed element model for accurate analysis of functionally graded beams | Wenxiong Li et.al. | 2505.09127 | null |
2025-05-14 | 2D-3D Attention and Entropy for Pose Robust 2D Facial Recognition | J. Brennan Peace et.al. | 2505.09073 | null |
2025-05-14 | Solving Reach- and Stabilize-Avoid Problems Using Discounted Reachability | Boyang Li et.al. | 2505.09067 | null |
2025-05-13 | Dynamic restrengthening and stress heterogeneity explain megathrust earthquake complexity | Jeremy Wing Ching Wong et.al. | 2505.08973 | null |
2025-05-13 | Multi-step manipulation task and motion planning guided by video demonstration | Kateryna Zorina et.al. | 2505.08949 | null |
2025-05-13 | Template-Guided Reconstruction of Pulmonary Segments with Neural Implicit Functions | Kangxian Xie et.al. | 2505.08919 | link |
2025-05-13 | Long timescale numerical simulations of large, super-critical accretion discs | P. Chris Fragile et.al. | 2505.08859 | null |
2025-05-12 | TUGS: Physics-based Compact Representation of Underwater Scenes by Tensorized Gaussian | Shijie Lian et.al. | 2505.08811 | null |
2025-05-13 | Full-volume aberration-space holography | Ian Christen et.al. | 2505.08777 | link |
2025-05-14 | Towards Autonomous UAV Visual Object Search in City Space: Benchmark and Agentic Methodology | Yatai Ji et.al. | 2505.08765 | null |
2025-05-13 | Extending Large Vision-Language Model for Diverse Interactive Tasks in Autonomous Driving | Zongchuang Zhao et.al. | 2505.08725 | link |
2025-05-13 | Quantum confinement theory of ultra-thin films: electronic, thermal and superconducting properties | Alessio Zaccone et.al. | 2505.08696 | null |
2025-05-13 | VIViT: Variable-Input Vision Transformer Framework for 3D MR Image Segmentation | Badhan Kumar Das et.al. | 2505.08693 | null |
2025-05-15 | Plastic deformation as a phase transition: a combinatorial model of plastic flow in copper single crystals | Afonso D. M. Barroso et.al. | 2505.08689 | null |
2025-05-13 | CAD-Coder:Text-Guided CAD Files Code Generation | Changqi He et.al. | 2505.08686 | null |
2025-05-13 | Topology and geometry optimization of grid-shells under self-weight loading | Helen E. Fairclough et.al. | 2505.08645 | null |
2025-05-13 | DLO-Splatting: Tracking Deformable Linear Objects Using 3D Gaussian Splatting | Holly Dinkel et.al. | 2505.08644 | null |
2025-05-13 | Learning cardiac activation and repolarization times with operator learning | Edoardo Centofanti et.al. | 2505.08631 | null |
2025-05-15 | A portable diagnosis model for Keratoconus using a smartphone | Yifan Li et.al. | 2505.08616 | null |
2025-05-13 | Probing the Universe’s Topology through a Quantum System? | Evangelos Achilleas Paraskevas et.al. | 2505.08603 | null |
2025-05-13 | MESSI: A Multi-Elevation Semantic Segmentation Image Dataset of an Urban Environment | Barak Pinkovich et.al. | 2505.08589 | null |
2025-05-13 | Building-Block Aware Generative Modeling for 3D Crystals of Metal Organic Frameworks | Chenru Duan et.al. | 2505.08531 | link |
2025-05-14 | Leveraging Segment Anything Model for Source-Free Domain Adaptation via Dual Feature Guided Auto-Prompting | Zheang Huai et.al. | 2505.08527 | link |
2025-05-13 | Experimental investigation of a novel liquid metal plasma facing component with pre-filled microstructures | Yi-Jun Wang et.al. | 2505.08512 | null |
2025-05-13 | FOCI: Trajectory Optimization on Gaussian Splats | Mario Gomez Andreu et.al. | 2505.08510 | null |
2025-05-13 | An MHD-based model for wind-driven disc-planet interactions | Michael Hammer et.al. | 2505.08505 | null |
2025-05-13 | Numerical Solution of Mixed-Dimensional PDEs Using a Neural Preconditioner | Nunzio Dimola et.al. | 2505.08491 | null |
2025-05-13 | A Non-planar ReBCO Test Coil with 3D-printed Aluminum Support Structure for the EPOS Stellarator | Paul Huslage et.al. | 2505.08488 | null |
2025-05-13 | A Survey of 3D Reconstruction with Event Cameras: From Event-based Geometry to Neural 3D Rendering | Chuanzhi Xu et.al. | 2505.08438 | null |
2025-05-13 | MDF: Multi-Modal Data Fusion with CNN-Based Object Detection for Enhanced Indoor Localization Using LiDAR-SLAM | Saqi Hussain Kalan et.al. | 2505.08388 | null |
2025-05-13 | Matched Asymptotic Expansions-Based Transferable Neural Networks for Singular Perturbation Problems | Zhequan Shen et.al. | 2505.08368 | null |
2025-05-13 | A numerically stable comoving frame solver for line radiative transfer | Thomas Ceulemans et.al. | 2505.08309 | null |
2025-05-13 | The Statistics of Gas Density, Velocity, and Magnetic Fields in Cool-Core Galaxy Clusters | Yue Hu et.al. | 2505.08275 | null |
2025-05-13 | The Evolutionary Map of the Universe: A new radio atlas for the southern hemisphere sky | A. M. Hopkins et.al. | 2505.08271 | null |
2025-05-13 | ACT-R: Adaptive Camera Trajectories for 3D Reconstruction from Single Image | Yizhi Wang et.al. | 2505.08239 | null |
2025-05-13 | Critical dynamics of three-dimensional $Z_N$ gauge models and the inverted XY universality class | Claudio Bonati et.al. | 2505.08236 | null |
2025-05-13 | CLTP: Contrastive Language-Tactile Pre-training for 3D Contact Geometry Understanding | Wenxuan Ma et.al. | 2505.08194 | null |
2025-05-15 | Insights into the 3D layered structure of nearby open clusters through N-body simulations | Kaixiang Lang et.al. | 2505.08184 | null |
2025-05-13 | Large Language Models for Computer-Aided Design: A Survey | Licheng Zhang et.al. | 2505.08137 | link |
2025-05-12 | SLAG: Scalable Language-Augmented Gaussian Splatting | Laszlo Szilagyi et.al. | 2505.08124 | null |
2025-05-12 | Recovery dynamics of a gap-engineered transmon after a quasiparticle burst | Heekun Nho et.al. | 2505.08104 | null |
2025-05-12 | Topology-Guided Knowledge Distillation for Efficient Point Cloud Processing | Luu Tung Hai et.al. | 2505.08101 | link |
2025-05-12 | RDD: Robust Feature Detector and Descriptor using Deformable Transformer | Gonglin Chen et.al. | 2505.08013 | null |
2025-05-14 | TSLFormer: A Lightweight Transformer Model for Turkish Sign Language Recognition Using Skeletal Landmarks | Kutay Ertürk et.al. | 2505.07890 | null |
2025-05-14 | Monocular Online Reconstruction with Enhanced Detail Preservation | Songyin Wu et.al. | 2505.07887 | null |
2025-05-07 | Pose Estimation for Intra-cardiac Echocardiography Catheter via AI-Based Anatomical Understanding | Jaeyoung Huh et.al. | 2505.07851 | null |
2025-05-12 | Step1X-3D: Towards High-Fidelity and Controllable Generation of Textured 3D Assets | Weiyu Li et.al. | 2505.07747 | null |
2025-05-12 | Energetic consistency and heat transport in Fourier-Galerkin truncations of free slip 3D rotating convection | Jens D. M. Rademacher et.al. | 2505.07678 | null |
2025-05-12 | Flowing from the Ising Model on the Fuzzy Sphere to the 3D Lee-Yang CFT | Joan Elias Miro et.al. | 2505.07655 | null |
2025-05-12 | Kolmogorov scaling in bubble-induced turbulence | Tian Ma et.al. | 2505.07633 | null |
2025-05-12 | Higher-Order Convolution Improves Neural Predictivity in the Retina | Simone Azeglio et.al. | 2505.07620 | null |
2025-05-12 | Dynamic Object Geographic Coordinate Recognition: An Attitude-Free and Reference-Free Framework via Intrinsic Linear Algebraic Structures | Junfan Yi et.al. | 2505.07597 | null |
2025-05-12 | Fine-scale opposite-polarity magnetic fields in a solar plage revealed by integral field spectropolarimetry | G. Liu et.al. | 2505.07561 | null |
2025-05-12 | TPT-Bench: A Large-Scale, Long-Term and Robot-Egocentric Dataset for Benchmarking Target Person Tracking | Hanjing Ye et.al. | 2505.07446 | null |
2025-05-12 | Characterizing 3D Magnetic Fields and Turbulence in H I Clouds | Yue Hu et.al. | 2505.07422 | null |
2025-05-12 | Cosmic Ray Superdiffusion and Mirror Diffusion in Partially Ionized and Turbulent Medium | Yue Hu et.al. | 2505.07421 | null |
2025-05-12 | Empirical approaches to Frohlich excitonic polarons in polar semiconductors with application to 3D halide perovskites | Jacky Even et.al. | 2505.07406 | null |
2025-05-12 | DepthFusion: Depth-Aware Hybrid Feature Fusion for LiDAR-Camera 3D Object Detection | Mingqian Ji et.al. | 2505.07398 | null |
2025-05-13 | TUM2TWIN: Introducing the Large-Scale Multimodal Urban Digital Twin Benchmark Dataset | Olaf Wysocki et.al. | 2505.07396 | null |
2025-05-12 | Feature Visualization in 3D Convolutional Neural Networks | Chunpeng Li et.al. | 2505.07387 | null |
2025-05-12 | Boosting Global-Local Feature Matching via Anomaly Synthesis for Multi-Class Point Cloud Anomaly Detection | Yuqi Cheng et.al. | 2505.07375 | link |
2025-05-12 | Geometric Prior-Guided Neural Implicit Surface Reconstruction in the Wild | Lintao Xiang et.al. | 2505.07373 | null |
2025-05-12 | Global small-time approximate null and Lagrangian controllability of the viscous non-resistive MHD system in a $3D$ domain with Navier type boundary conditions | Jiajiang Liao et.al. | 2505.07366 | null |
2025-05-12 | Multi-Plane Vision Transformer for Hemorrhage Classification Using Axial and Sagittal MRI Data | Badhan Kumar Das et.al. | 2505.07349 | null |
2025-05-12 | Variational Quantum Monte Carlo investigations of the superconducting pairing in La $_3$Ni$_2$O$_7$ | Yi-Qun Liu et.al. | 2505.07341 | null |
2025-05-12 | Link to the Past: Temporal Propagation for Fast 3D Human Reconstruction from Monocular Video | Matthew Marchellus et.al. | 2505.07333 | null |
2025-05-12 | Enabling Privacy-Aware AI-Based Ergonomic Analysis | Sander De Coninck et.al. | 2505.07306 | null |
2025-05-13 | Human Motion Prediction via Test-domain-aware Adaptation with Easily-available Human Motions Estimated from Videos | Katsuki Shimbo et.al. | 2505.07301 | null |
2025-05-12 | Piloting Structure-Based Drug Design via Modality-Specific Optimal Schedule | Keyue Qiu et.al. | 2505.07286 | null |
2025-05-12 | Towards Accurate State Estimation: Kalman Filter Incorporating Motion Dynamics for 3D Multi-Object Tracking | Mohamed Nagy et.al. | 2505.07254 | null |
2025-05-12 | When Dance Video Archives Challenge Computer Vision | Philippe Colantoni et.al. | 2505.07249 | null |
2025-05-15 | Towards user-centered interactive medical image segmentation in VR with an assistive AI agent | Pascal Spiegler et.al. | 2505.07214 | null |
2025-05-13 | Generation of magnetic chiral solitons, skyrmions, and hedgehogs with electric fields | Teruya Nakagawara et.al. | 2505.07210 | null |
2025-05-12 | Nonuniqueness in law of stochastic 3d navierstokes equations with general multiplicative noise | Huaxiang Lv et.al. | 2505.07181 | null |
2025-05-11 | All Polyhedral Manifolds are Connected by a 2-Step Refolding | Lily Chung et.al. | 2505.07147 | null |
2025-05-11 | DriveSOTIF: Advancing Perception SOTIF Through Multimodal Large Language Models | Shucheng Huang et.al. | 2505.07084 | link |
2025-05-11 | Efficient and Robust Multidimensional Attention in Remote Physiological Sensing through Target Signal Constrained Factorization | Jitesh Joshi et.al. | 2505.07013 | null |
2025-05-11 | Hand-Shadow Poser | Hao Xu et.al. | 2505.07012 | null |
2025-05-11 | CMD: Controllable Multiview Diffusion for 3D Editing and Progressive Generation | Peng Li et.al. | 2505.07003 | null |
2025-05-11 | Radio Map-Enabled 3D Trajectory and Communication Optimization for Low-Altitude Air-Ground Cooperation | Menghao Hu et.al. | 2505.06944 | null |
2025-05-11 | A proof of Onsager’s conjecture for the stochastic 3D Euler equations | Huaxiang Lü et.al. | 2505.06915 | null |
2025-05-11 | Energy-Efficient Ternary Encoding for High-Speed Data Transmission in 3D-Integrated Circuits Using Inductive Coupling Links | Abdullah Saeed Alghotmi et.al. | 2505.06908 | null |
2025-05-11 | Enhancing Monocular Height Estimation via Sparse LiDAR-Guided Correction | Jian Song et.al. | 2505.06905 | null |
2025-05-11 | Missing Data Estimation for MR Spectroscopic Imaging via Mask-Free Deep Learning Methods | Tan-Hanh Pham et.al. | 2505.06811 | null |
2025-05-10 | Self-organization of active rod suspensions on fluid membranes and thin viscous films | Arijit Mahapatra et.al. | 2505.06783 | null |
2025-05-10 | FALCON: Learning Force-Adaptive Humanoid Loco-Manipulation | Yuanhang Zhang et.al. | 2505.06776 | null |
2025-05-10 | Qualification of Bump Bonding in CMS Inner Tracker Pixel Modules for the Phase-2 Upgrade | Panagiotis Assiouras et.al. | 2505.06752 | null |
2025-05-10 | Mastering 3D-detection of Extensive Air Showers in Cherenkov Light | Elena A. Bonvech et.al. | 2505.06723 | null |
2025-05-10 | Evolving dunes under flow reversals: from an initial heap toward an inverted dune | Willian Righi Assis et.al. | 2505.06707 | null |
2025-05-13 | Detection of Moving Objects Using Self-motion Constraints on Optic Flow | Hope Lutwak et.al. | 2505.06686 | null |
2025-05-10 | 3D Characterization of Smoke Plume Dispersion Using Multi-View Drone Swarm | Nikil Krishnakumar et.al. | 2505.06638 | null |
2025-05-10 | Magnetic field morphologies in convective zones influenced by a turbulent surface layer | Anna Guseva et.al. | 2505.06618 | null |
2025-05-10 | GRACE: Estimating Geometry-level 3D Human-Scene Contact from 2D Images | Chengfeng Wang et.al. | 2505.06575 | null |
2025-05-10 | ElectricSight: 3D Hazard Monitoring for Power Lines Using Low-Cost Sensors | Xingchen Li et.al. | 2505.06573 | null |
2025-05-10 | Virtualized 3D Gaussians: Flexible Cluster-based Level-of-Detail System for Real-Time Rendering of Composed Scenes | Xijie Yang et.al. | 2505.06523 | null |
2025-05-10 | Text-to-CadQuery: A New Paradigm for CAD Generation with Scalable Large Model Capabilities | Haoyang Xie et.al. | 2505.06507 | link |
2025-05-10 | FlexNeRFer: A Multi-Dataflow, Adaptive Sparsity-Aware Accelerator for On-Device NeRF Rendering | Seock-Hwan Noh et.al. | 2505.06504 | null |
2025-05-09 | LMLCC-Net: A Semi-Supervised Deep Learning Model for Lung Nodule Malignancy Prediction from CT Scans using a Novel Hounsfield Unit-Based Intensity Filtering | Adhora Madhuri et.al. | 2505.06370 | null |
2025-05-07 | IIKL: Isometric Immersion Kernel Learning with Riemannian Manifold for Geometric Preservation | Zihao Chen et.al. | 2505.06288 | null |
2025-05-09 | Anymate: A Dataset and Baselines for Learning 3D Object Rigging | Yufan Deng et.al. | 2505.06227 | null |
2025-05-09 | VIN-NBV: A View Introspection Network for Next-Best-View Selection for Resource-Efficient 3D Reconstruction | Noah Frahm et.al. | 2505.06219 | null |
2025-05-09 | Neuro-Symbolic Concepts | Jiayuan Mao et.al. | 2505.06191 | null |
2025-05-09 | Active Perception for Tactile Sensing: A Task-Agnostic Attention-Based Approach | Tim Schneider et.al. | 2505.06182 | null |
2025-05-09 | DiffLocks: Generating 3D Hair from a Single Image using Diffusion Models | Radu Alexandru Rosu et.al. | 2505.06166 | null |
2025-05-09 | S2MNet: Speckle-To-Mesh Net for Three-Dimensional Cardiac Morphology Reconstruction via Echocardiogram | Xilin Gong et.al. | 2505.06105 | null |
2025-05-09 | Fault Diagnosis of 3D-Printed Scaled Wind Turbine Blades | Luis Miguel Esquivel-Sancho et.al. | 2505.06080 | null |
2025-05-09 | Document Image Rectification Bases on Self-Adaptive Multitask Fusion | Heng Li et.al. | 2505.06038 | null |
2025-05-09 | Why Are You Wrong? Counterfactual Explanations for Language Grounding with 3D Objects | Tobias Preintner et.al. | 2505.06030 | link |
2025-05-09 | P-CORONA: A New Tool for Calculating the Intensity and Polarization of Coronal Lines in 3D Models of the Solar Corona | Supriya Hebbur Dayananda et.al. | 2505.05962 | null |
2025-05-09 | Achieving 3D Attention via Triplet Squeeze and Excitation Block | Maan Alhazmi et.al. | 2505.05943 | null |
2025-05-09 | Human causal perception in a cube-stacking task | Nikolai Bahr et.al. | 2505.05923 | null |
2025-05-15 | Examining the Source of Defects from a Mechanical Perspective for 3D Anomaly Detection | Hanzhe Liang et.al. | 2505.05901 | link |
2025-05-09 | A 3D pocket-aware and evolutionary conserved interaction guided diffusion model for molecular optimization | Anjie Qiao et.al. | 2505.05874 | null |
2025-05-09 | RefRef: A Synthetic Dataset and Benchmark for Reconstructing Refractive and Reflective Objects | Yue Yin et.al. | 2505.05848 | null |
2025-05-09 | Computational Homogenization in 3D Magnetostatics using E3C Hyper-Reduction | Hauke Goldbeck et.al. | 2505.05836 | null |
2025-05-09 | Artificial intelligence pioneers the double-strangeness factory | Yan He et.al. | 2505.05802 | null |
2025-05-09 | Mitigating Singlet Exciton Back-Transfer using 2D Spacer Layers for Perovskite-Sensitised Upconversion | Nicholas P. Sloane et.al. | 2505.05801 | null |
2025-05-09 | 3D CAVLA: Leveraging Depth and 3D Context to Generalize Vision Language Action Models for Unseen Tasks | Vineet Bhat et.al. | 2505.05800 | null |
2025-05-09 | Hybrid Learning: A Novel Combination of Self-Supervised and Supervised Learning for MRI Reconstruction without High-Quality Training Reference | Haoyang Pei et.al. | 2505.05703 | null |
2025-05-08 | TeGA: Texture Space Gaussian Avatars for High-Resolution Dynamic Head Modeling | Gengyan Li et.al. | 2505.05672 | null |
2025-05-08 | The Moon’s Many Faces: A Single Unified Transformer for Multimodal Lunar Reconstruction | Tom Sander et.al. | 2505.05644 | null |
2025-05-08 | UltraGauss: Ultrafast Gaussian Reconstruction of 3D Ultrasound Volumes | Mark C. Eid et.al. | 2505.05643 | null |
2025-05-08 | Designing 3D Anisotropic Frame Fields with Odeco Tensors | Haikuan Zhu et.al. | 2505.05639 | null |
2025-05-08 | CityNavAgent: Aerial Vision-and-Language Navigation with Hierarchical Semantic Planning and Global Memory | Weichen Zhang et.al. | 2505.05622 | link |
2025-05-08 | QuickSplat: Fast 3D Surface Reconstruction via Learned Gaussian Initialization | Yueh-Cheng Liu et.al. | 2505.05591 | null |
2025-05-08 | Steepest Descent Density Control for Compact 3D Gaussian Splatting | Peihao Wang et.al. | 2505.05587 | null |
2025-05-08 | LaZagna: An Open-Source Framework for Flexible 3D FPGA Architectural Exploration | Ismael Youssef et.al. | 2505.05579 | null |
2025-05-08 | GaMNet: A Hybrid Network with Gabor Fusion and NMamba for Efficient 3D Glioma Segmentation | Chengwei Ye et.al. | 2505.05520 | null |
2025-05-13 | Web2Grasp: Learning Functional Grasps from Web Images of Hand-Object Interactions | Hongyi Chen et.al. | 2505.05517 | null |
2025-05-07 | Occupancy World Model for Robots | Zhang Zhang et.al. | 2505.05512 | null |
2025-05-07 | Apply Hierarchical-Chain-of-Generation to Complex Attributes Text-to-3D Generation | Yiming Qin et.al. | 2505.05505 | link |
2025-05-05 | Learning 3D Persistent Embodied World Models | Siyuan Zhou et.al. | 2505.05495 | null |
2025-05-08 | SVAD: From Single Image to 3D Avatar via Synthetic Data Generation with Video Diffusion and Data Augmentation | Yonwoo Choi et.al. | 2505.05475 | link |
2025-05-08 | 3D Scene Generation: A Survey | Beichen Wen et.al. | 2505.05474 | link |
2025-05-08 | DiffusionSfM: Predicting Structure and Motion via Ray Origin and Endpoint Diffusion | Qitao Zhao et.al. | 2505.05473 | null |
2025-05-08 | Generating Physically Stable and Buildable LEGO Designs from Text | Ava Pun et.al. | 2505.05469 | link |
2025-05-08 | GesPrompt: Leveraging Co-Speech Gestures to Augment LLM-Based Interaction in Virtual Reality | Xiyun Hu et.al. | 2505.05441 | null |
2025-05-08 | Boundary Energy-Momentum Tensors for Asymptotically Flat Spacetimes | Jelle Hartong et.al. | 2505.05432 | null |
2025-05-08 | Representing spherical tensors with scalar-based machine-learning models | Michelangelo Domina et.al. | 2505.05404 | null |
2025-05-09 | Scheimpflug cameras for range-resolved observations of the atmospheric effects on laser propagation | Nathan Meraz et.al. | 2505.05399 | null |
2025-05-08 | PillarMamba: Learning Local-Global Context for Roadside Point Cloud via Hybrid State Space Model | Zhang Zhang et.al. | 2505.05397 | null |
2025-05-09 | GeomHair: Reconstruction of Hair Strands from Colorless 3D Scans | Rachmadio Noval Lazuardi et.al. | 2505.05376 | null |
2025-05-08 | Time of the Flight of the Gaussians: Optimizing Depth Indirectly in Dynamic Radiance Fields | Runfeng Li et.al. | 2505.05356 | null |
2025-05-08 | Progressive Inertial Poser: Progressive Real-Time Kinematic Chain Estimation for 3D Full-Body Pose from Three IMU Sensors | Zunjie Zhu et.al. | 2505.05336 | null |
2025-05-08 | SmartTrap: Automated Precision Experiments with Optical Tweezers | Martin Selin et.al. | 2505.05290 | null |
2025-05-08 | PlaceIt3D: Language-Guided Object Placement in Real 3D Scenes | Ahmed Abdelreheem et.al. | 2505.05288 | null |
2025-05-08 | Three dimensional seepage analysis using a polyhedral scaled boundary finite element method | Mingjiao Yan et.al. | 2505.05244 | null |
2025-05-08 | Planar fault-tolerant circuits for non-Clifford gates on the 2D color code | Andreas Bauer et.al. | 2505.05175 | null |
2025-05-08 | Automated vision-based assistance tools in bronchoscopy: stenosis severity estimation | Clara Tomasini et.al. | 2505.05136 | null |
2025-05-08 | The negative symmetry classification problem | M. P. Kolesnikov et.al. | 2505.05096 | null |
2025-05-08 | Seismic first-arrival traveltime simulation based on reciprocity-constrained PINN | Hang Geng et.al. | 2505.05061 | null |
2025-05-08 | Uncertainty-Aware Scarf Plots | Nelusa Pathmanathan et.al. | 2505.05038 | null |
2025-05-08 | SOAP: Style-Omniscient Animatable Portraits | Tingting Liao et.al. | 2505.05022 | link |
2025-05-08 | Improving Global Motion Estimation in Sparse IMU-based Motion Capture with Physics | Xinyu Yi et.al. | 2505.05010 | null |
2025-05-08 | ReAlign: Bilingual Text-to-Motion Generation via Step-Aware Reward-Guided Alignment | Wanjiang Weng et.al. | 2505.04974 | null |
2025-05-08 | DenseGrounding: Improving Dense Language-Vision Semantics for Ego-Centric 3D Visual Grounding | Henry Zheng et.al. | 2505.04965 | null |
2025-05-08 | MoRe-3DGSMR: Motion-resolved reconstruction framework for free-breathing pulmonary MRI based on 3D Gaussian representation | Tengya Peng et.al. | 2505.04959 | null |
2025-05-08 | Advanced 3D Imaging Approach to TSV/TGV Metrology and Inspection Using Only Optical Microscopy | Gugeong Sung et.al. | 2505.04913 | null |
2025-05-08 | SpatialPrompting: Keyframe-driven Zero-Shot Spatial Reasoning with Off-the-Shelf Multimodal Large Language Models | Shun Taguchi et.al. | 2505.04911 | null |
2025-05-08 | A Multi-Agent AI Framework for Immersive Audiobook Production through Spatial Audio and Neural Narration | Shaja Arul Selvamani et.al. | 2505.04885 | null |
2025-05-07 | Seeing Cells Clearly: Evaluating Machine Vision Strategies for Microglia Centroid Detection in 3D Images | Youjia Zhang et.al. | 2505.04838 | null |
2025-05-07 | Steerable Scene Generation with Post Training and Inference-Time Search | Nicholas Pfaff et.al. | 2505.04831 | link |
2025-05-07 | WIR3D: Visually-Informed and Geometry-Aware 3D Shape Abstraction | Richard Liu et.al. | 2505.04813 | null |
2025-05-07 | Convex Relaxation for Robust Vanishing Point Estimation in Manhattan World | Bangyan Liao et.al. | 2505.04788 | link |
2025-05-07 | Hybrid-Field 6D Movable Antenna for Terahertz Communications: Channel Modeling and Estimation | Xiaodan Shao et.al. | 2505.04753 | null |
2025-05-07 | Towards a Vision-Language Episodic Memory Framework: Large-scale Pretrained Model-Augmented Hippocampal Attractor Dynamics | Chong Li et.al. | 2505.04752 | link |
2025-05-07 | Constrained Hamiltonian dynamics of 3D gravity-coupled topological matter | Omar Rodríguez-Tzompantzi et.al. | 2505.04745 | null |
2025-05-07 | LLM Code Customization with Visual Results: A Benchmark on TikZ | Charly Reux et.al. | 2505.04670 | null |
2025-05-07 | SGCR: Spherical Gaussians for Efficient 3D Curve Reconstruction | Xinran Yang et.al. | 2505.04668 | link |
2025-05-07 | Advancing 3D Medical Image Segmentation: Unleashing the Potential of Planarian Neural Networks in Artificial Intelligence | Ziyuan Huang et.al. | 2505.04664 | null |
2025-05-07 | Crafting Physical Adversarial Examples by Combining Differentiable and Physically Based Renders | Yuqiu Liu et.al. | 2505.04662 | null |
2025-05-07 | GSsplat: Generalizable Semantic Gaussian Splatting for Novel-view Synthesis in 3D Scenes | Feng Xiao et.al. | 2505.04659 | link |
2025-05-07 | MeshGen: Generating PBR Textured Mesh with Render-Enhanced Auto-Encoder and Generative Data Augmentation | Zilong Chen et.al. | 2505.04656 | link |
2025-05-07 | PrimitiveAnything: Human-Crafted 3D Primitive Assembly Generation with Auto-Regressive Transformer | Jingwen Ye et.al. | 2505.04622 | null |
2025-05-07 | Score Distillation Sampling for Audio: Source Separation, Synthesis, and Beyond | Jessie Richter-Powell et.al. | 2505.04621 | null |
2025-05-07 | FastMap: Revisiting Dense and Scalable Structure from Motion | Jiahao Li et.al. | 2505.04612 | null |
2025-05-09 | MonoCoP: Chain-of-Prediction for Monocular 3D Object Detection | Zhihao Zhang et.al. | 2505.04594 | null |
2025-05-08 | TetWeave: Isosurface Extraction using On-The-Fly Delaunay Tetrahedral Grids for Gradient-Based Mesh Optimization | Alexandre Binninger et.al. | 2505.04590 | link |
2025-05-07 | Registration of 3D Point Sets Using Exponential-based Similarity Matrix | Ashutosh Singandhupe et.al. | 2505.04540 | link |
2025-05-07 | Text2CT: Towards 3D CT Volume Generation from Free-text Descriptions Using Diffusion Model | Pengfei Guo et.al. | 2505.04522 | null |
2025-05-08 | FA-KPConv: Introducing Euclidean Symmetries to KPConv via Frame Averaging | Ali Alawieh et.al. | 2505.04485 | null |
2025-05-07 | CAD-Llama: Leveraging Large Language Models for Computer-Aided Design Parametric 3D Model Generation | Jiahao Li et.al. | 2505.04481 | null |
2025-05-08 | Spectroscopic investigations of a filament reconnecting with coronal loops during a two-ribbon solar flare | Reetika Joshi et.al. | 2505.04479 | null |
2025-05-07 | MFSeg: Efficient Multi-frame 3D Semantic Segmentation | Chengjie Huang et.al. | 2505.04408 | null |
2025-05-07 | Geometry-Aware Texture Generation for 3D Head Modeling with Artist-driven Control | Amin Fadaeinejad et.al. | 2505.04387 | null |
2025-05-07 | Label-efficient Single Photon Images Classification via Active Learning | Zili Zhang et.al. | 2505.04376 | null |
2025-05-07 | Global solutions to 3D compressible MHD equations with partial magnetic diffusion | Jiahong Wu et.al. | 2505.04351 | null |
2025-05-07 | Atmospheric loss during giant impacts: mechanisms and scaling of near- and far-field loss | Matthew J. Roche et.al. | 2505.04343 | link |
2025-05-07 | 3D-Integrated Superconducting qubits: CMOS-Compatible, Wafer-Scale Processing for Flip-Chip Architectures | T. Mayer et.al. | 2505.04337 | null |
2025-05-07 | A hybridizable discontinuous Galerkin method with transmission variables for time-harmonic electromagnetic problems | Ari E. Rappaport et.al. | 2505.04288 | null |
2025-05-07 | HDiffTG: A Lightweight Hybrid Diffusion-Transformer-GCN Architecture for 3D Human Pose Estimation | Yajie Fu et.al. | 2505.04276 | link |
2025-05-07 | Bridging Geometry-Coherent Text-to-3D Generation with Multi-View Diffusion Priors and Gaussian Splatting | Feng Yang et.al. | 2505.04262 | null |
2025-05-07 | Technology prediction of a 3D model using Neural Network | Grzegorz Miebs et.al. | 2505.04241 | null |
2025-05-07 | Low Resolution Next Best View for Robot Packing | Giuseppe Fabio Preziosa et.al. | 2505.04228 | null |
2025-05-12 | The stability of generalized phase retrieval problem over compact groups | Tal Amir et.al. | 2505.04190 | null |
2025-05-07 | S3D: Sketch-Driven 3D Model Generation | Hail Song et.al. | 2505.04185 | link |
2025-05-07 | A Framework to Prevent Biometric Data Leakage in the Immersive Technologies Domain | Keshav Sood et.al. | 2505.04123 | null |
2025-05-07 | GAPrompt: Geometry-Aware Point Cloud Prompt for 3D Vision Model | Zixiang Ai et.al. | 2505.04119 | link |
2025-05-07 | One2Any: One-Reference 6D Pose Estimation for Any Object | Mengya Liu et.al. | 2505.04109 | null |
2025-05-07 | 3D Brain MRI Classification for Alzheimer Diagnosis Using CNN with Data Augmentation | Thien Nhan Vo et.al. | 2505.04097 | null |
2025-05-07 | Plexus: Taming Billion-edge Graphs with 3D Parallel GNN Training | Aditya K. Ranjan et.al. | 2505.04083 | null |
2025-05-07 | AS3D: 2D-Assisted Cross-Modal Understanding with Semantic-Spatial Scene Graphs for 3D Visual Grounding | Feng Xiao et.al. | 2505.04058 | link |
2025-05-07 | Person-In-Situ: Scene-Consistent Human Image Insertion with Occlusion-Aware Pose Control | Shun Masuda et.al. | 2505.04052 | null |
2025-05-07 | TerraFusion: Joint Generation of Terrain Geometry and Texture Using Latent Diffusion Models | Kazuki Higo et.al. | 2505.04050 | null |
2025-05-06 | Space-Time Elastic Metamaterials for Zero-Frequency and Zero-Wavenumber Bandgaps | Brahim Lemkalli et.al. | 2505.04012 | null |
2025-05-06 | nuGAN: Generative Adversarial Emulator for Cosmic Web with Neutrinos | Neerav Kaushal et.al. | 2505.03936 | null |
2025-05-06 | Coronal rain formation in a two-fluid approximation | Beatrice Popescu Braileanu et.al. | 2505.03930 | null |
2025-05-06 | Transdimensional anomalous Hall effect in rhombohedral thin graphite | Qingxin Li et.al. | 2505.03891 | null |
2025-05-06 | Orthosymplectic Quivers: Indices, Hilbert Series, and Generalised Symmetries | William Harding et.al. | 2505.03875 | null |
2025-05-05 | A Deep Learning approach for Depressive Symptoms assessment in Parkinson’s disease patients using facial videos | Ioannis Kyprakis et.al. | 2505.03845 | null |
2025-05-04 | PointExplainer: Towards Transparent Parkinson’s Disease Diagnosis | Xuechao Wang et.al. | 2505.03833 | null |
2025-04-30 | Neural Co-Optimization of Structural Topology, Manufacturable Layers, and Path Orientations for Fiber-Reinforced Composites | Tao Liu et.al. | 2505.03779 | null |
2025-05-06 | Effects of transient stellar emissions on planetary climates of tidally-locked exo-earths | Howard Chen et.al. | 2505.03723 | null |
2025-05-13 | Self-Supervised Learning for Robotic Leaf Manipulation: A Hybrid Geometric-Neural Approach | Srecharan Selvam et.al. | 2505.03702 | null |
2025-05-06 | Revolutionizing Brain Tumor Imaging: Generating Synthetic 3D FA Maps from T1-Weighted MRI using CycleGAN Models | Xin Du et.al. | 2505.03662 | null |
2025-05-06 | Stabilizing 3D EPI time series by servo navigation and phase equalization exploiting repeated shots (PEERS) | Malte Riedel et.al. | 2505.03637 | null |
2025-05-06 | Learning Knowledge-based Prompts for Robust 3D Mask Presentation Attack Detection | Fangling Jiang et.al. | 2505.03610 | null |
2025-05-06 | An Enriched Immersed Finite Element Method for 3D Interface Problems | Ruchi Guo et.al. | 2505.03598 | null |
2025-05-06 | RAIL: Region-Aware Instructive Learning for Semi-Supervised Tooth Segmentation in CBCT | Chuyu Zhao et.al. | 2505.03538 | link |
2025-05-06 | High-order exponential solver method for particle-in-cell simulations | Szilárd Majorosi et.al. | 2505.03518 | null |
2025-05-06 | UPMAD-Net: A Brain Tumor Segmentation Network with Uncertainty Guidance and Adaptive Multimodal Feature Fusion | Zhanyuan Jia et.al. | 2505.03494 | link |
2025-05-06 | Blending 3D Geometry and Machine Learning for Multi-View Stereopsis | Vibhas Vats et.al. | 2505.03470 | link |
2025-05-15 | Universal Cosmologies | Paul Marconnet et.al. | 2505.03449 | null |
2025-05-06 | AquaticVision: Benchmarking Visual SLAM in Underwater Environment with Events and Frames | Yifan Peng et.al. | 2505.03448 | null |
2025-05-06 | O(5) multicriticality in the 3D two flavor SU(2) lattice gauge Higgs model | Claudio Bonati et.al. | 2505.03446 | null |
2025-05-07 | manvr3d: A Platform for Human-in-the-loop Cell Tracking in Virtual Reality | Samuel Pantze et.al. | 2505.03440 | link |
2025-05-06 | LiftFeat: 3D Geometry-Aware Local Feature Matching | Yepeng Liu et.al. | 2505.03422 | link |
2025-05-06 | 3D Surface Reconstruction with Enhanced High-Frequency Details | Shikun Zhang et.al. | 2505.03362 | null |
2025-05-06 | Hierarchical dynamic domain decomposition for the multiscale Boltzmann equation | Domenico Caparello et.al. | 2505.03360 | null |
2025-05-06 | GUAVA: Generalizable Upper Body 3D Gaussian Avatar | Dongbin Zhang et.al. | 2505.03351 | null |
2025-05-06 | 3D Gaussian Splatting Data Compression with Mixture of Priors | Lei Liu et.al. | 2505.03310 | null |
2025-05-06 | 3D Can Be Explored In 2D: Pseudo-Label Generation for LiDAR Point Clouds Using Sensor-Intensity-Based 2D Semantic Segmentation | Andrew Caunes et.al. | 2505.03300 | null |
2025-05-06 | OccCylindrical: Multi-Modal Fusion with Cylindrical Representation for 3D Semantic Occupancy Prediction | Zhenxing Ming et.al. | 2505.03284 | null |
2025-05-06 | Anomalous Effects in Single-slit Diffraction of Light at Relativistic Intensities | Longqing Yi et.al. | 2505.03199 | null |
2025-05-06 | InfoVids: Reimagining the Viewer Experience with Alternative Visualization-Presenter Relationships | Ji Won Chung et.al. | 2505.03164 | null |
2025-05-07 | Motion-compensated cardiac MRI using low-rank diffeomorphic flow (DMoCo) | Joseph Kettelkamp et.al. | 2505.03149 | null |
2025-05-06 | HCOA: Hierarchical Class-ordered A for Navigation in Semantic Environments | Evangelos Psomiadis et.al. | 2505.03128 | null |
2025-05-15 | Estimating the Diameter at Breast Height of Trees in a Forest With a Single 360 Camera | Siming He et.al. | 2505.03093 | null |
2025-05-05 | Multiscale Parallel Simulation of Malignant Pleural Mesothelioma via Adaptive Domain Partitioning – an Efficiency Analysis Study | Anton Dolganov et.al. | 2505.03067 | null |
2025-05-05 | Dual Prompting for Diverse Count-level PET Denoising | Xiaofeng Liu et.al. | 2505.03037 | null |
2025-05-05 | Revisiting Performance Models of Distal Pointing Tasks in Virtual Reality | Logan Lane et.al. | 2505.03027 | null |
2025-05-05 | Magneto-optical trap loading with an effusive oven in a large optical access experiment | M. Gaudesius et.al. | 2505.03008 | null |
2025-05-05 | Multi-channel second-order topological states in 3D Dirac semimetal Bi ${0.97}$Sb${0.03}$ | Biplab Bhattacharyya et.al. | 2505.02995 | null |
2025-05-05 | Orbital Entanglement and The Double $d$ -Shell Effect in Binary Transition Metal Molecules | Julianne S. Lampert et.al. | 2505.02930 | null |
2025-05-05 | A Chiral-Planar dualization algorithm for $3d$ $\mathcal{N}=2$ Chern-Simons-matter theories | Sergio Benvenuti et.al. | 2505.02913 | null |
2025-05-05 | Weighted FFT estimators for 1D and 3D correlations of the Lyman- $α$ forest | Martine Lokken et.al. | 2505.02904 | null |
2025-04-29 | Floating Car Observers in Intelligent Transportation Systems: Detection Modeling and Temporal Insights | Jeremias Gerner et.al. | 2505.02845 | null |
2025-05-05 | Scenethesis: A Language and Vision Agentic Framework for 3D Scene Generation | Lu Ling et.al. | 2505.02836 | null |
2025-05-06 | Towards Application-Specific Evaluation of Vision Models: Case Studies in Ecology and Biology | Alex Hoi Hang Chan et.al. | 2505.02825 | null |
2025-05-05 | Freely propagating flanks of wide coronal-mass-ejection-driven shocks: Modelling and observational insights | N. Wijsen et.al. | 2505.02794 | null |
2025-05-05 | DeepSparse: A Foundation Model for Sparse-View CBCT Reconstruction | Yiqun Lin et.al. | 2505.02628 | null |
2025-05-08 | Marker-Based Extrinsic Calibration Method for Accurate Multi-Camera 3D Reconstruction | Nahuel Garcia-D’Urso et.al. | 2505.02539 | link |
2025-05-05 | Corr2Distrib: Making Ambiguous Correspondences an Ally to Predict Reliable 6D Pose Distributions | Asma Brazi et.al. | 2505.02501 | null |
2025-05-05 | Point Cloud Recombination: Systematic Real Data Augmentation Using Robotic Targets for LiDAR Perception Validation | Hubert Padusinski et.al. | 2505.02476 | null |
2025-05-12 | FASTDASH: An Implementation of 3D Earthquake Cycle Simulation on Complex Fault Systems Using the Boundary Element Method Accelerated by H-matrices | Jinhui Cheng et.al. | 2505.02398 | null |
2025-05-05 | MetaScenes: Towards Automated Replica Creation for Real-world 3D Scans | Huangyue Yu et.al. | 2505.02388 | null |
2025-05-05 | A comparison of dust content and properties in GAMA/G10-COSMOS/3D-HST and SIMBA cosmological simulations | Trevor Butrum et.al. | 2505.02359 | null |
2025-05-05 | Sloshing suppression with a controlled elastic baffle via deep reinforcement learning and SPH simulation | Mai Ye et.al. | 2505.02354 | null |
2025-05-05 | TeDA: Boosting Vision-Lanuage Models for Zero-Shot 3D Object Retrieval via Testing-time Distribution Alignment | Zhichuan Wang et.al. | 2505.02325 | link |
2025-05-14 | Dexterous Contact-Rich Manipulation via the Contact Trust Region | H. J. Terry Suh et.al. | 2505.02291 | null |
2025-05-04 | Continuous Normalizing Flows for Uncertainty-Aware Human Pose Estimation | Shipeng Liu et.al. | 2505.02287 | null |
2025-05-04 | Analysis of a 3D Integrated Superconducting Quantum Chip Structure | James Saslow et.al. | 2505.02263 | null |
2025-05-04 | RISE: Radius of Influence based Subgraph Extraction for 3D Molecular Graph Explanation | Jingxiang Qu et.al. | 2505.02247 | link |
2025-05-04 | Topological Surface States of 3D Topological Insulator on Twisted Bilayer Graphene | Yoonkang Kim et.al. | 2505.02187 | null |
2025-05-04 | Sparfels: Fast Reconstruction from Sparse Unposed Imagery | Shubhendu Jena et.al. | 2505.02178 | null |
2025-05-04 | SparSplat: Fast Multi-View Reconstruction with Generalizable 2D Gaussian Splatting | Shubhendu Jena et.al. | 2505.02175 | null |
2025-05-04 | Spotting the Unexpected (STU): A 3D LiDAR Dataset for Anomaly Segmentation in Autonomous Driving | Alexey Nekrasov et.al. | 2505.02148 | null |
2025-05-14 | GarmentGS: Point-Cloud Guided Gaussian Splatting for High-Fidelity Non-Watertight 3D Garment Reconstruction | Zhihao Tang et.al. | 2505.02126 | null |
2025-05-04 | Simulation Based Control Architecture Using Webots and Simulink | Harun Kurt et.al. | 2505.02081 | null |
2025-05-04 | HandOcc: NeRF-based Hand Rendering with Occupancy Networks | Maksym Ivashechkin et.al. | 2505.02079 | null |
2025-05-04 | A survey of knots and quivers | Shivrat Sachdeva et.al. | 2505.02059 | null |
2025-05-04 | Holographic Radiance Cascades for 2D Global Illumination | Rouli Freeman et.al. | 2505.02041 | null |
2025-05-04 | Aokana: A GPU-Driven Voxel Rendering Framework for Open World Games | Yingrong Fang et.al. | 2505.02017 | null |
2025-05-04 | Learning Heterogeneous Mixture of Scene Experts for Large-scale Neural Radiance Fields | Zhenxing Mi et.al. | 2505.02005 | link |
2025-05-04 | Closed-loop control of seizure activity via real-time seizure forecasting by reservoir neuromorphic computing | Maryam Sadeghi et.al. | 2505.02003 | null |
2025-05-04 | MC3D-AD: A Unified Geometry-aware Reconstruction Model for Multi-category 3D Anomaly Detection | Jiayi Cheng et.al. | 2505.01969 | null |
2025-05-04 | Training Environment for High Performance Reinforcement Learning | Greg Search et.al. | 2505.01953 | null |
2025-05-04 | UNet-3D with Adaptive TverskyCE Loss for Pancreas Medical Image Segmentation | Xubei Zhang et.al. | 2505.01951 | null |
2025-05-03 | 3D neuron growth and neurodevelopmental disorder modeling based on truncated hierarchical B-splines with multi-level local refinements | Kuanren Qian et.al. | 2505.01940 | null |
2025-05-03 | HybridGS: High-Efficiency Gaussian Splatting Data Compression using Dual-Channel Sparse Representation and Point Cloud Encoder | Qi Yang et.al. | 2505.01938 | link |
2025-05-10 | OT-Talk: Animating 3D Talking Head with Optimal Transportation | Xinmu Wang et.al. | 2505.01932 | null |
2025-05-03 | GenSync: A Generalized Talking Head Framework for Audio-driven Multi-Subject Lip-Sync using 3D Gaussian Splatting | Anushka Agarwal et.al. | 2505.01928 | null |
2025-05-06 | Roughness-Limited Performance in Ultra-Low-Loss Lithium Niobate Cavities | Ali Khalatpour et.al. | 2505.01913 | null |
2025-05-03 | Rethinking Score Distilling Sampling for 3D Editing and Generation | Xingyu Miao et.al. | 2505.01888 | null |
2025-05-03 | Visual enhancement and 3D representation for underwater scenes: a review | Guoxi Huang et.al. | 2505.01869 | null |
2025-05-03 | DualDiff: Dual-branch Diffusion Model for Autonomous Driving with Semantic Fusion | Haoteng Li et.al. | 2505.01857 | null |
2025-05-03 | MVHumanNet++: A Large-scale Dataset of Multi-view Daily Dressing Human Captures with Richer Annotations for 3D Human Digitization | Chenghong Li et.al. | 2505.01838 | null |
2025-05-03 | Near-field 5D Pose Estimation using Reconfigurable Intelligent Surfaces | Srikar Sharma Sadhu et.al. | 2505.01829 | null |
2025-05-03 | 3DWG: 3D Weakly Supervised Visual Grounding via Category and Instance-Level Alignment | Xiaoqi Li et.al. | 2505.01809 | null |
2025-05-03 | Efficient 3D Full-Body Motion Generation from Sparse Tracking Inputs with Temporal Windows | Georgios Fotios Angelis et.al. | 2505.01802 | null |
2025-05-03 | AquaGS: Fast Underwater Scene Reconstruction with SfM-Free Gaussian Splatting | Junhao Shi et.al. | 2505.01799 | null |
2025-05-03 | Co $^{3}$ Gesture: Towards Coherent Concurrent Co-speech 3D Gesture Generation with Interactive Diffusion | Xingqun Qi et.al. | 2505.01746 | null |
2025-05-03 | Learning Multi-frame and Monocular Prior for Estimating Geometry in Dynamic Scenes | Seong Hyeon Park et.al. | 2505.01737 | null |
2025-05-03 | Probabilistic Interactive 3D Segmentation with Hierarchical Neural Processes | Jie Liu et.al. | 2505.01726 | null |
2025-05-03 | Speculative Evolution Through 3D Cellular Automata | Amir Hossein Khazaei et.al. | 2505.01692 | null |
2025-05-03 | A Novel WaveInst-based Network for Tree Trunk Structure Extraction and Pattern Analysis in Forest Inventory | Chenyang Fan et.al. | 2505.01656 | null |
2025-05-03 | T-REX: Vision-Based System for Autonomous Leaf Detection and Grasp Estimation | Srecharan Selvam et.al. | 2505.01654 | null |
2025-05-03 | Topological Quantum Statistical Mechanics and Topological Quantum Field Theories | Zhidong Zhang et.al. | 2505.01653 | null |
2025-05-02 | Triangle-Decomposable Graphs for Isoperimetric Robots | Nathan Usevitch et.al. | 2505.01624 | null |
2025-05-02 | A low-loss, 24-mode laser-written universal photonic processor in a glass-based platform | Andrea Barzaghi et.al. | 2505.01609 | null |
2025-05-13 | Aerial Path Online Planning for Urban Scene Updation | Mingfeng Tang et.al. | 2505.01486 | null |
2025-05-02 | VIDSTAMP: A Temporally-Aware Watermark for Ownership and Integrity in Video Diffusion Models | Mohammadreza Teymoorianfard et.al. | 2505.01406 | link |
2025-05-02 | FalconWing: An Open-Source Platform for Ultra-Light Fixed-Wing Aircraft Research | Yan Miao et.al. | 2505.01383 | null |
2025-05-02 | FreeInsert: Disentangled Text-Guided Object Insertion in 3D Gaussian Scene without Spatial Priors | Chenxi Li et.al. | 2505.01322 | null |
2025-05-02 | Model See Model Do: Speech-Driven Facial Animation with Style Control | Yifang Pan et.al. | 2505.01319 | null |
2025-05-05 | Direct Evidence of Metal-Ligand Redox in Li-ion Battery Cathodes | Galo J. Paez Fajardo et.al. | 2505.01251 | null |
2025-05-02 | Asymptotic Linear Convergence of ADMM for Isotropic TV Norm Compressed Sensing | Emmanuel Gil Torres et.al. | 2505.01240 | null |
2025-05-02 | Compensating Spatiotemporally Inconsistent Observations for Online Dynamic 3D Gaussian Splatting | Youngsik Yun et.al. | 2505.01235 | null |
2025-05-02 | One Target, Many Views: Multi-User Fusion for Collaborative Uplink ISAC | Sajad Daei et.al. | 2505.01223 | null |
2025-05-02 | Performance of Cell-Free Massive MIMO in Realistic Urban Propagation Environments | Yunlu Xiao et.al. | 2505.01222 | null |
2025-05-02 | High Dynamic Range Novel View Synthesis with Single Exposure | Kaixuan Zhang et.al. | 2505.01212 | null |
2025-05-02 | Efficient Vision-based Vehicle Speed Estimation | Andrej Macko et.al. | 2505.01203 | null |
2025-05-05 | TSTMotion: Training-free Scene-aware Text-to-motion Generation | Ziyan Guo et.al. | 2505.01182 | null |
2025-05-02 | Stochastic Hartree NLS in 3d coming from a Many-Body Quantum System with White Noise Potential | Francesco Carlo De Vecchi et.al. | 2505.01157 | null |
2025-05-02 | NeuroLoc: Encoding Navigation Cells for 6-DOF Camera Localization | Xun Li et.al. | 2505.01113 | null |
2025-05-02 | Enhancing MHD model accuracy and CME forecasting by constraining coronal plasma properties with Faraday rotation | Salvatore Mancuso et.al. | 2505.01080 | null |
2025-05-02 | Quasi-Static IRS: 3D Shaped Beamforming for Area Coverage Enhancement | Zhenyu Jiang et.al. | 2505.01076 | null |
2025-05-02 | Phase-shifting structured illumination with polarization-encoded metasurface | Linzhi Yu et.al. | 2505.01051 | null |
2025-05-02 | 3D Human Pose Estimation via Spatial Graph Order Attention and Temporal Body Aware Transformer | Kamel Aouaidjia et.al. | 2505.01003 | link |
2025-05-02 | Optimizing Indoor Farm Monitoring Efficiency Using UAV: Yield Estimation in a GNSS-Denied Cherry Tomato Greenhouse | Taewook Park et.al. | 2505.00995 | null |
2025-05-02 | Classification of Principle 3D Slices of Filled-in Julia Sets in Multicomplex Spaces | Quentin Charles et.al. | 2505.00957 | null |
2025-05-02 | Enhancing Realism in Holographic Augmented Reality Displays through Occlusion Handling | Woongseob Han et.al. | 2505.00942 | null |
2025-05-02 | Autonomous Embodied Agents: When Robotics Meets Deep Learning Reasoning | Roberto Bigazzi et.al. | 2505.00935 | link |
2025-05-01 | Magnetic excitons in non-magnetic CrCl3 | Georgy Ermolaev et.al. | 2505.00920 | null |
2025-05-01 | The Comparability of Model Fusion to Measured Data in Confuser Rejection | Conor Flynn et.al. | 2505.00836 | null |
2025-05-11 | SmallPlan: Leverage Small Language Models for Sequential Path Planning with Simulation-Powered, LLM-Guided Distillation | Quang P. M. Pham et.al. | 2505.00831 | link |
2025-05-01 | SpatialLLM: A Compound 3D-Informed Design towards Spatially-Intelligent Large Multimodal Models | Wufei Ma et.al. | 2505.00788 | null |
2025-05-01 | Efficient On-Chip Implementation of 4D Radar-Based 3D Object Detection on Hailo-8L | Woong-Chan Byun et.al. | 2505.00757 | null |
2025-05-01 | P2P-Insole: Human Pose Estimation Using Foot Pressure Distribution and Motion Sensors | Atsuya Watanabe et.al. | 2505.00755 | null |
2025-04-30 | A Survey on 3D Reconstruction Techniques in Plant Phenotyping: From Classical Methods to Neural Radiance Fields (NeRF), 3D Gaussian Splatting (3DGS), and Beyond | Jiajia Li et.al. | 2505.00737 | null |
2025-04-29 | Unconstrained Large-scale 3D Reconstruction and Rendering across Altitudes | Neil Joshi et.al. | 2505.00734 | null |
2025-05-01 | Controllable Weather Synthesis and Removal with Video Diffusion Models | Chih-Hao Lin et.al. | 2505.00704 | null |
2025-05-01 | RayZer: A Self-supervised Large View Synthesis Model | Hanwen Jiang et.al. | 2505.00702 | null |
2025-05-06 | Robotic Visual Instruction | Yanbang Li et.al. | 2505.00693 | null |
2025-05-02 | AI-based CSI Feedback with Digital Twins: Real-World Validation and Insights | Tzu-Hao Huang et.al. | 2505.00660 | null |
2025-05-01 | Pixel3DMM: Versatile Screen-Space Priors for Single-Image 3D Face Reconstruction | Simon Giebenhain et.al. | 2505.00615 | null |
2025-05-01 | Dietary Intake Estimation via Continuous 3D Reconstruction of Food | Wallace Lee et.al. | 2505.00606 | null |
2025-05-02 | Multimodal Masked Autoencoder Pre-training for 3D MRI-Based Brain Tumor Analysis with Missing Modalities | Lucas Robinet et.al. | 2505.00568 | link |
2025-05-01 | DeCo: Task Decomposition and Skill Composition for Zero-Shot Generalization in Long-Horizon 3D Manipulation | Zixuan Chen et.al. | 2505.00527 | null |
2025-05-01 | An evaluation of unconditional 3D molecular generation methods | Martin Buttenschoen et.al. | 2505.00518 | null |
2025-05-05 | HeAL3D: Heuristical-enhanced Active Learning for 3D Object Detection | Esteban Rivera et.al. | 2505.00507 | null |
2025-05-12 | Minimal Factorization of Chern-Simons Theory – Gravitational Anyonic Edge Modes | Thomas G. Mertens et.al. | 2505.00501 | null |
2025-05-01 | ClearLines - Camera Calibration from Straight Lines | Gregory Schroeder et.al. | 2505.00452 | null |
2025-05-01 | Leveraging Pretrained Diffusion Models for Zero-Shot Part Assembly | Ruiyuan Zhang et.al. | 2505.00426 | null |
2025-05-01 | Real-Time Animatable 2DGS-Avatars with Detail Enhancement from Monocular Videos | Xia Yuan et.al. | 2505.00421 | null |
2025-05-01 | Multi-dimensional optical imaging on a chip | Liheng Bian et.al. | 2505.00408 | null |
2025-05-01 | Fast Azimuthally Anisotropic 3D Radon Transform by Generalized Fourier Slice Theorem | Ahmadreza Mokhtari et.al. | 2505.00387 | null |
2025-05-01 | Cues3D: Unleashing the Power of Sole NeRF for Consistent and Unique Instances in Open-Vocabulary 3D Panoptic Segmentation | Feng Xue et.al. | 2505.00378 | null |
2025-05-01 | Automated segmenta-on of pediatric neuroblastoma on multi-modal MRI: Results of the SPPIN challenge at MICCAI 2023 | M. A. D. Buser et.al. | 2505.00369 | null |
2025-05-01 | Efficient Neural Video Representation with Temporally Coherent Modulation | Seungjun Shin et.al. | 2505.00335 | null |
2025-05-01 | AI2-Active Safety: AI-enabled Interaction-aware Active Safety Analysis with Vehicle Dynamics | Keshu Wu et.al. | 2505.00322 | null |
2025-04-25 | Future Circular Collider Feasibility Study Report: Volume 3, Civil Engineering, Implementation and Sustainability | M. Benedikt et.al. | 2505.00273 | null |
2025-05-01 | Pack-PTQ: Advancing Post-training Quantization of Neural Networks by Pack-wise Reconstruction | Changjun Li et.al. | 2505.00259 | null |
2025-04-30 | Towards Robust and Generalizable Gerchberg Saxton based Physics Inspired Neural Networks for Computer Generated Holography: A Sensitivity Analysis Framework | Ankit Amrutkar et.al. | 2505.00220 | null |
2025-04-30 | GEOM-Drugs Revisited: Toward More Chemically Accurate Benchmarks for 3D Molecule Generation | Filipp Nikitin et.al. | 2505.00169 | null |
2025-04-30 | V3LMA: Visual 3D-enhanced Language Model for Autonomous Driving | Jannik Lübberstedt et.al. | 2505.00156 | null |
2025-04-30 | Expanding Active Matter to the Third Dimension: Exploring Short and Long-Range Particle-Wall Interactions | Sandeep Ramteke et.al. | 2505.00141 | null |
2025-04-30 | Eye2Eye: A Simple Approach for Monocular-to-Stereo Video Synthesis | Michal Geyer et.al. | 2505.00135 | null |
2025-04-30 | Efficient and robust 3D blind harmonization for large domain gaps | Hwihun Jeong et.al. | 2505.00133 | null |
2025-04-30 | Stereo X-ray tomography on deformed object tracking | Zhenduo Shang et.al. | 2505.00122 | null |
2025-04-30 | Symmetry induced pairing in dark excitonic condensate at finite temperature | Adham Alkady et.al. | 2505.00120 | null |
2025-04-30 | superB/NRPy: Scalable, Task-Based Numerical Relativity for 3G Gravitational Wave Science | Nishita Jadoo et.al. | 2505.00097 | null |
2025-04-30 | ReVision: High-Quality, Low-Cost Video Generation with Explicit 3D Physics Modeling for Complex Motion and Interaction | Qihao Liu et.al. | 2504.21855 | null |
2025-04-30 | 3D Stylization via Large Reconstruction Model | Ipek Oztas et.al. | 2504.21836 | null |
2025-04-30 | An Underwater, Fault-Tolerant, Laser-Aided Robotic Multi-Modal Dense SLAM System for Continuous Underwater In-Situ Observation | Yaming Ou et.al. | 2504.21826 | null |
2025-04-30 | A simple and effective approach for body part recognition on CT scans based on projection estimation | Franko Hrzic et.al. | 2504.21810 | null |
2025-04-30 | Common3D: Self-Supervised Learning of 3D Morphable Models for Common Objects in Neural Feature Space | Leonhard Sommer et.al. | 2504.21749 | null |
2025-04-30 | Adaptive 3D UI Placement in Mixed Reality Using Deep Reinforcement Learning | Feiyu Lu et.al. | 2504.21731 | null |
2025-05-07 | VividListener: Expressive and Controllable Listener Dynamics Modeling for Multi-Modal Responsive Interaction | Shiying Li et.al. | 2504.21718 | null |
2025-05-08 | REHEARSE-3D: A Multi-modal Emulated Rain Dataset for 3D Point Cloud De-raining | Abu Mohammed Raisuddin et.al. | 2504.21699 | null |
2025-04-30 | Self-Supervised Monocular Visual Drone Model Identification through Improved Occlusion Handling | Stavrow A. Bahnam et.al. | 2504.21695 | null |
2025-04-30 | Rank-two tensors and deconfinement in 3d $\mathcal{N}=2$ $SU(N)$ gauge theories | Antonio Amariti et.al. | 2504.21654 | null |
2025-04-30 | HoloTime: Taming Video Diffusion Models for Panoramic 4D Scene Generation | Haiyang Zhou et.al. | 2504.21650 | null |
2025-05-07 | Fast Sign Retrieval via Sub-band Convolution: An Elementary Extension of Binary Classification | Fuma Ito et.al. | 2504.21632 | null |
2025-04-30 | Path Planning on Multi-level Point Cloud with a Weighted Traversability Graph | Yujie Tang et.al. | 2504.21622 | null |
2025-05-09 | 3D Hand-Eye Calibration for Collaborative Robot Arm: Look at Robot Base Once | Leihui Li et.al. | 2504.21619 | null |
2025-04-30 | Toward Realization of Low-Altitude Economy Networks: Core Architecture, Integrated Technologies, and Future Directions | Yixian Wang et.al. | 2504.21583 | null |
2025-04-30 | SAM4EM: Efficient memory-based two stage prompt-free segment anything model adapter for complex 3D neuroscience electron microscopy stacks | Uzair Shah et.al. | 2504.21544 | link |
2025-04-30 | Enhancing Cosmological Constraints by Two-dimensional $β$ -cosmic-web Weighted Angular Correlation Functions | Fenfen Yin et.al. | 2504.21509 | null |
2025-05-10 | MagicPortrait: Temporally Consistent Face Reenactment with 3D Geometric Guidance | Mengting Wei et.al. | 2504.21497 | null |
2025-04-30 | Monolayer C $_{60}$ networks: A first-principles perspective | Bo Peng et.al. | 2504.21485 | null |
2025-05-10 | GarmentDiffusion: 3D Garment Sewing Pattern Generation with Multimodal Diffusion Transformers | Xinyu Li et.al. | 2504.21476 | null |
2025-04-30 | Multiview Point Cloud Registration via Optimization in an Autoencoder Latent Space | Luc Vedrenne et.al. | 2504.21467 | null |
2025-04-30 | Kolmogorov Cascade as the Governing Mechanism for Intervortex Spacing in Quantum Turbulence | Clément Bret et.al. | 2504.21416 | null |
2025-04-30 | Mapping the Human Brain from the Prenatal Period to Infancy Using 3D Magnetic Resonance Imaging | Arnaud Cachia et.al. | 2504.21406 | null |
2025-04-30 | ImaginateAR: AI-Assisted In-Situ Authoring in Augmented Reality | Jaewook Lee et.al. | 2504.21360 | null |
2025-04-30 | MagicCraft: Natural Language-Driven Generation of Dynamic and Interactive 3D Objects for Commercial Metaverse Platforms | Ryutaro Kurai et.al. | 2504.21332 | null |
2025-04-30 | Mamba Based Feature Extraction And Adaptive Multilevel Feature Fusion For 3D Tumor Segmentation From Multi-modal Medical Image | Zexin Ji et.al. | 2504.21281 | null |
2025-04-29 | General vector auxiliary differential equation finite-difference time-domain method for rotationally symmetric vector wave propagation in nonlinear optics | Caleb J. Grimms et.al. | 2504.21201 | null |
2025-04-29 | Slug-Mapper: Magnetic Scanner for Ultra Low-Field MRI Scanners | Jonathan W. Morris et.al. | 2504.21193 | null |
2025-04-29 | Design, analysis, and experimental validation of a stepped plate parametric array loudspeaker | Woongji Kim et.al. | 2504.21171 | null |
2025-04-29 | Dance Style Recognition Using Laban Movement Analysis | Muhammad Turab et.al. | 2504.21166 | null |
2025-04-29 | Emotion Recognition in Contemporary Dance Performances Using Laban Movement Analysis | Muhammad Turab et.al. | 2504.21154 | null |
2025-04-29 | Light-Based Fast Timing in Bulk CsPbBr3 Crystals for TOF-PET and Proton Range Verification | Nicolaus Kratohwil et.al. | 2504.21123 | null |
2025-04-29 | GauSS-MI: Gaussian Splatting Shannon Mutual Information for Active 3D Reconstruction | Yuhan Xie et.al. | 2504.21067 | null |
2025-04-29 | A 3D pocket-aware and affinity-guided diffusion model for lead optimization | Anjie Qiao et.al. | 2504.21065 | null |
2025-04-27 | Transcending Dimensions using Generative AI: Real-Time 3D Model Generation in Augmented Reality | Majid Behravan et.al. | 2504.21033 | null |
2025-04-29 | TesserAct: Learning 4D Embodied World Models | Haoyu Zhen et.al. | 2504.20995 | null |
2025-04-29 | XRISM forecast for the Coma cluster: stormy, with a steep power spectrum | XRISM Collaboration et.al. | 2504.20928 | null |
2025-04-29 | Imaging on the Edge: Mapping Object Corners and Edges with Stereo X-ray Tomography | Zhenduo Shang et.al. | 2504.20892 | null |
2025-04-29 | RadSAM: Segmenting 3D radiological images with a 2D promptable model | Julien Khlaut et.al. | 2504.20837 | null |
2025-04-29 | GaussTrap: Stealthy Poisoning Attacks on 3D Gaussian Splatting for Targeted Scene Confusion | Jiaxin Hong et.al. | 2504.20829 | null |
2025-04-29 | Semi-discrete optimal transport techniques for the compressible semi-geostrophic equations | David P. Bourne et.al. | 2504.20807 | null |
2025-04-29 | DICOM Compatible, 3D Multimodality Image Encryption using Hyperchaotic Signal | Anandik N Anand et.al. | 2504.20689 | null |
2025-04-29 | Efficient Listener: Dyadic Facial Motion Synthesis via Action Diffusion | Zesheng Wang et.al. | 2504.20685 | null |
2025-04-29 | DiffLiB: High-fidelity differentiable modeling of lithium-ion batteries and efficient gradient-based parameter identification | Weipeng Xu et.al. | 2504.20674 | null |
2025-04-29 | Statistical Channel Based Low-Complexity Rotation and Position Optimization for 6D Movable Antennas Enabled Wireless Communication | Qijun Jiang et.al. | 2504.20618 | null |
2025-04-29 | EfficientHuman: Efficient Training and Reconstruction of Moving Human using Articulated 2D Gaussian | Hao Tian et.al. | 2504.20607 | null |
2025-04-29 | Quantitative X-ray Schlieren Nanotomography for Hyperspectral Phase and Absorption Imaging | Herve Hugonne et.al. | 2504.20537 | null |
2025-04-29 | Geometry-aware Temporal Aggregation Network for Monocular 3D Lane Detection | Huan Zheng et.al. | 2504.20525 | null |
2025-04-29 | PRISM: Projection-based Reward Integration for Scene-Aware Real-to-Sim-to-Real Transfer with Few Demonstrations | Haowen Sun et.al. | 2504.20520 | null |
2025-04-29 | SAM-Guided Robust Representation Learning for One-Shot 3D Medical Image Segmentation | Jia Wang et.al. | 2504.20501 | null |
2025-04-29 | Large-scale visual SLAM for in-the-wild videos | Shuo Sun et.al. | 2504.20496 | null |
2025-04-29 | Spin-orbital order and excitations in $3d^4$, $4d^4$, and $5d^4$ systems: Application to $\rm BaFeO_3$, $\rm Sr_2RuO_4$, $\rm Sr_2YIrO_6$, and $\rm K_2OsCl_64$ | Shahid Ahmad et.al. | 2504.20476 | null |
2025-05-05 | LMME3DHF: Benchmarking and Evaluating Multimodal 3D Human Face Generation with LMMs | Woo Yi Yang et.al. | 2504.20466 | null |
2025-04-29 | LymphAtlas- A Unified Multimodal Lymphoma Imaging Repository Delivering AI-Enhanced Diagnostic Insight | Jiajun Ding et.al. | 2504.20454 | null |
2025-04-29 | GarmentX: Autoregressive Parametric Representations for High-Fidelity 3D Garment Generation | Jingfeng Guo et.al. | 2504.20409 | null |
2025-04-29 | SCOPE-MRI: Bankart Lesion Detection as a Case Study in Data Curation and Deep Learning for Challenging Diagnoses | Sahil Sethi et.al. | 2504.20405 | link |
2025-04-29 | Creating Your Editable 3D Photorealistic Avatar with Tetrahedron-constrained Gaussian Splatting | Hanxi Liu et.al. | 2504.20403 | null |
2025-05-01 | GSFeatLoc: Visual Localization Using Feature Correspondence on 3D Gaussian Splatting | Jongwon Lee et.al. | 2504.20379 | null |
2025-04-29 | Sparse2DGS: Geometry-Prioritized Gaussian Splatting for Surface Reconstruction from Sparse Views | Jiang Wu et.al. | 2504.20378 | null |
2025-04-29 | Ant Colony Optimization for Density Functionals in Strongly Correlated Systems | G. M. Tonin et.al. | 2504.20317 | null |
2025-04-28 | FreBIS: Frequency-Based Stratification for Neural Implicit Surface Representations | Naoko Sawada et.al. | 2504.20222 | null |
2025-04-28 | Application of the Holographic Equation of State for Numerical Modeling of the Evolution of Quark-Gluon Plasma | A. V. Anufriev et.al. | 2504.20207 | null |
2025-04-30 | Cosmos: A Cost Model for Serverless Workflows in the 3D Compute Continuum | Cynthia Marcelino et.al. | 2504.20189 | null |
2025-05-03 | Attention to Detail: Fine-Scale Feature Preservation-Oriented Geometric Pre-training for AI-Driven Surrogate Modeling | Yu-hsuan Chen et.al. | 2504.20110 | null |
2025-04-28 | Polarization of light from fast rotating Wolf-Rayet stars: A Monte Carlo simulations compared to analytical formula | Slah Abdellaoui et.al. | 2504.20037 | null |
2025-04-28 | LIRM: Large Inverse Rendering Model for Progressive Reconstruction of Shape, Materials and View-dependent Radiance Fields | Zhengqin Li et.al. | 2504.20026 | null |
2025-04-28 | SpatialReasoner: Towards Explicit and Generalizable 3D Spatial Reasoning | Wufei Ma et.al. | 2504.20024 | null |
2025-04-28 | Interaction of Laguerre-Gaussian laser pulses with borane targets of different hydrogen-boron ratio | Lars Reichwein et.al. | 2504.20015 | null |
2025-04-28 | 3D MPSoC with On-Chip Cache Support – Design and Exploitation | Rodrigo Cataldo et.al. | 2504.19984 | null |
2025-04-28 | Tendon-Actuated Concentric Tube Endonasal Robot (TACTER) | Kent K. Yamamoto et.al. | 2504.19948 | null |
2025-04-28 | Mesh-Learner: Texturing Mesh with Spherical Harmonics | Yunfei Wan et.al. | 2504.19938 | link |
2025-04-28 | Accelerated 3D-3D rigid registration of echocardiographic images obtained from apical window using particle filter | Thanuja Uruththirakodeeswaran et.al. | 2504.19930 | null |
2025-04-28 | Modeling of Parallel Single-Pixel Imaging for 3D Reconstruction: New Insights and Opportunities | Feifei Chen et.al. | 2504.19923 | null |
2025-04-28 | Tracing the ejecta structure of SN 1987A: Insights and diagnostics from 3D MHD simulations | S. Orlando et.al. | 2504.19896 | null |
2025-04-28 | Towards Ball Spin and Trajectory Analysis in Table Tennis Broadcast Videos via Physically Grounded Synthetic-to-Real Transfer | Daniel Kienzle et.al. | 2504.19863 | link |
2025-04-28 | CoherenDream: Boosting Holistic Text Coherence in 3D Generation via Multimodal Large Language Models Feedback | Chenhan Jiang et.al. | 2504.19860 | null |
2025-04-28 | exoALMA. XVI. Predicting Signatures of Large-scale Turbulence in Protoplanetary Disks | Marcelo Barraza-Alfaro et.al. | 2504.19853 | null |
2025-04-28 | AnimateAnywhere: Rouse the Background in Human Image Animation | Xiaoyu Liu et.al. | 2504.19834 | null |
2025-04-28 | Joint Optimization of Neural Radiance Fields and Continuous Camera Motion from a Monocular Video | Hoang Chuong Nguyen et.al. | 2504.19819 | null |
2025-04-28 | Search for structural differences in spike glycoprotein variants of SARS-CoV-2: Infrared Spectroscopy, Circular Dichroism and Computational Analysis | Tiziana Mancini et.al. | 2504.19766 | null |
2025-04-28 | STCOcc: Sparse Spatial-Temporal Cascade Renovation for 3D Occupancy and Scene Flow Prediction | Zhimin Liao et.al. | 2504.19749 | null |
2025-05-04 | Pixels2Points: Fusing 2D and 3D Features for Facial Skin Segmentation | Victoria Yue Chen et.al. | 2504.19718 | null |
2025-04-28 | PhyloProfile v2 – Exploring multi-layered phylogenetic profiles at scale | Vinh Tran et.al. | 2504.19710 | link |
2025-04-28 | Transformation & Translation Occupancy Grid Mapping: 2-Dimensional Deep Learning Refined SLAM | Leon Davies et.al. | 2504.19654 | null |
2025-04-28 | GAN-SLAM: Real-Time GAN Aided Floor Plan Creation Through SLAM | Leon Davies et.al. | 2504.19653 | null |
2025-04-28 | Non-Equilibrium Multiplet Excitations probed by the $M_{5,4}$ Branching Ratio in $3d \rightarrow 4f$ X-ray Absorption Spectroscopy | Tim Amrhein et.al. | 2504.19630 | null |
2025-04-28 | ARMOR: Adaptive Meshing with Reinforcement Optimization for Real-time 3D Monitoring in Unexposed Scenes | Yizhe Zhang et.al. | 2504.19624 | null |
2025-04-28 | DiVE: Efficient Multi-View Driving Scenes Generation Based on Video Diffusion Transformer | Junpeng Jiang et.al. | 2504.19614 | null |
2025-04-28 | SAMBLE: Shape-Specific Point Cloud Sampling for an Optimal Trade-Off Between Local Detail and Global Uniformity | Chengzhi Wu et.al. | 2504.19581 | null |
2025-04-28 | CE-NPBG: Connectivity Enhanced Neural Point-Based Graphics for Novel View Synthesis in Autonomous Driving Scenes | Mohammad Altillawi et.al. | 2504.19557 | null |
2025-04-29 | Signatures of Hund $’s$ metal physics in single-layered 3d transition metal oxide, $\mathrm{Sr_2CoO_4}$ | Shivani Bhardwaj et.al. | 2504.19503 | null |
2025-04-28 | Masked Point-Entity Contrast for Open-Vocabulary 3D Scene Understanding | Yan Wang et.al. | 2504.19500 | null |
2025-04-28 | CasaGPT: Cuboid Arrangement and Scene Assembly for Interior Design | Weitao Feng et.al. | 2504.19478 | null |
2025-04-29 | Geometry-Informed Neural Operator Transformer | Qibang Liu et.al. | 2504.19452 | null |
2025-04-28 | A Real-Time Event-Based Normal Flow Estimator | Dehao Yuan et.al. | 2504.19417 | null |
2025-04-28 | GSFF-SLAM: 3D Semantic Gaussian Splatting SLAM via Feature Field | Zuxing Lu et.al. | 2504.19409 | null |
2025-04-28 | Boosting 3D Liver Shape Datasets with Diffusion Models and Implicit Neural Representations | Khoa Tuan Nguyen et.al. | 2504.19402 | null |
2025-04-27 | Beyond Physical Reach: Comparing Head- and Cane-Mounted Cameras for Last-Mile Navigation by Blind Users | Apurv Varshney et.al. | 2504.19345 | null |
2025-04-27 | Multiscale Roughness of Upper Mantle Discontinuities Inferred from the USArray: Dependence on Tomography Models | Yinzhi Wang et.al. | 2504.19290 | null |
2025-04-27 | High-contrast scattering microscopy in thick tissue with back-illumination interference tomography | Gregory N. McKay et.al. | 2504.19278 | null |
2025-04-27 | Leveraging Multi-Modal Saliency and Fusion for Gaze Target Detection | Athul M. Mathew et.al. | 2504.19271 | null |
2025-04-27 | VI3NR: Variance Informed Initialization for Implicit Neural Representations | Chamin Hewa Koneputugodage et.al. | 2504.19270 | null |
2025-04-27 | OpenFusion++: An Open-vocabulary Real-time Scene Understanding System | Xiaofeng Jin et.al. | 2504.19266 | null |
2025-04-30 | OPAL: Visibility-aware LiDAR-to-OpenStreetMap Place Recognition via Adaptive Radial Fusion | Shuhao Kang et.al. | 2504.19258 | null |
2025-04-27 | LM-MCVT: A Lightweight Multi-modal Multi-view Convolutional-Vision Transformer Approach for 3D Object Recognition | Songsong Xiong et.al. | 2504.19256 | null |
2025-04-27 | Unsupervised 2D-3D lifting of non-rigid objects using local constraints | Shalini Maiti et.al. | 2504.19227 | null |
2025-04-27 | FlexPara: Flexible Neural Surface Parameterization | Yuming Zhao et.al. | 2504.19210 | null |
2025-04-29 | Improving Generalization in MRI-Based Deep Learning Models for Total Knee Replacement Prediction | Ehsan Karami et.al. | 2504.19203 | null |
2025-04-27 | Leveraging Modified Ex Situ Tomography Data for Segmentation of In Situ Synchrotron X-Ray Computed Tomography | Tristan Manchester et.al. | 2504.19200 | null |
2025-04-27 | Sketch2Anim: Towards Transferring Sketch Storyboards into 3D Animation | Lei Zhong et.al. | 2504.19189 | null |
2025-04-27 | A tissue-informed deep learning-based method for positron range correction in preclinical 68Ga PET imaging | Nerea Encina-Baranda et.al. | 2504.19175 | null |
2025-05-03 | CLR-Wire: Towards Continuous Latent Representations for 3D Curve Wireframe Generation | Xueqi Ma et.al. | 2504.19174 | null |
2025-04-29 | IM-Portrait: Learning 3D-aware Video Diffusion for Photorealistic Talking Heads from Monocular Videos | Yuan Li et.al. | 2504.19165 | null |
2025-04-27 | Making Physical Objects with Generative AI and Robotic Assembly: Considering Fabrication Constraints, Sustainability, Time, Functionality, and Accessibility | Alexander Htet Kyaw et.al. | 2504.19131 | null |
2025-04-27 | Towards Latency-Aware 3D Streaming Perception for Autonomous Driving | Jiaqi Peng et.al. | 2504.19115 | null |
2025-04-27 | MISO: Multiresolution Submap Optimization for Efficient Globally Consistent Neural Implicit Reconstruction | Yulun Tian et.al. | 2504.19104 | null |
2025-04-27 | Dual-Branch Residual Network for Cross-Domain Few-Shot Hyperspectral Image Classification with Refined Prototype | Anyong Qin et.al. | 2504.19074 | null |
2025-04-26 | Efficient Control Allocation and 3D Trajectory Tracking of a Highly Manoeuvrable Under-actuated Bio-inspired AUV | Walid Remmas et.al. | 2504.19049 | null |
2025-04-26 | Phases of Floquet code under local decoherence | Yuchen Tang et.al. | 2504.19041 | null |
2025-05-01 | Geometry-aware Active Learning of Spatiotemporal Dynamic Systems | Xizhuo Zhang et.al. | 2504.19012 | null |
2025-04-26 | 3DPyranet Features Fusion for Spatio-temporal Feature Learning | Ihsan Ullah et.al. | 2504.18977 | null |
2025-04-26 | HeartSimSage: Attention-Enhanced Graph Neural Networks for Accelerating Cardiac Mechanics Modeling | Lei Shi et.al. | 2504.18968 | null |
2025-04-30 | 4DGS-CC: A Contextual Coding Framework for 4D Gaussian Splatting Data Compression | Zicong Chen et.al. | 2504.18925 | null |
2025-04-26 | Single power-law rheology of crowded cytoplasm in living cells | H. Ebata et.al. | 2504.18922 | null |
2025-04-26 | Exploiting Multiple Representations: 3D Face Biometrics Fusion with Application to Surveillance | Simone Maurizio La Cava et.al. | 2504.18886 | null |
2025-04-26 | Three-Dimensional Fermi Surface, Van Hove Singularity and Enhancement of Superconductivity in Infinite-Layer Nickelates | Chengliang Xia et.al. | 2504.18778 | null |
2025-05-01 | TransparentGS: Fast Inverse Rendering of Transparent Objects with Gaussians | Letian Huang et.al. | 2504.18768 | null |
2025-04-26 | Global Simulations of Gravitational Instability in Protostellar Disks with Full Radiation Transport II. Locality of Gravitoturbulence, Clumpy Spirals, and Implications for Observable Substructure | Wenrui Xu et.al. | 2504.18751 | null |
2025-04-25 | A Review of 3D Object Detection with Vision-Language Models | Ranjan Sapkota et.al. | 2504.18738 | null |
2025-04-25 | Vysics: Object Reconstruction Under Occlusion by Fusing Vision and Contact-Rich Physics | Bibit Bianchini et.al. | 2504.18719 | null |
2025-04-25 | Decentralized Fusion of 3D Extended Object Tracking based on a B-Spline Shape Model | Longfei Han et.al. | 2504.18708 | null |
2025-04-25 | Robust Push Recovery on Bipedal Robots: Leveraging Multi-Domain Hybrid Systems with Reduced-Order Model Predictive Control | Min Dai et.al. | 2504.18698 | null |
2025-04-25 | SORT3D: Spatial Object-centric Reasoning Toolbox for Zero-Shot 3D Grounding Using Large Language Models | Nader Zantout et.al. | 2504.18684 | link |
2025-04-30 | exoALMA VII: Benchmarking Hydrodynamics and Radiative Transfer Codes | Jaehan Bae et.al. | 2504.18643 | null |
2025-04-25 | Implications of Complexity Factor on Evolution of New Dynamical and Static Wormholes in $f(R, T)$ Gravity | M. Zubair et.al. | 2504.18607 | null |
2025-04-22 | DriVerse: Navigation World Model for Driving Simulation via Multimodal Trajectory Prompting and Motion Alignment | Xiaofan Li et.al. | 2504.18576 | null |
2025-04-25 | Eval3D: Interpretable and Fine-grained Evaluation for 3D Generation | Shivam Duggal et.al. | 2504.18509 | null |
2025-05-05 | RGS-DR: Reflective Gaussian Surfels with Deferred Rendering for Shiny Objects | Georgios Kouros et.al. | 2504.18468 | null |
2025-04-25 | LaRI: Layered Ray Intersections for Single-view 3D Geometric Reasoning | Rui Li et.al. | 2504.18424 | null |
2025-04-25 | A Multimodal Hybrid Late-Cascade Fusion Network for Enhanced 3D Object Detection | Carlo Sgaravatti et.al. | 2504.18419 | link |
2025-04-25 | Spatial Reasoner: A 3D Inference Pipeline for XR Applications | Steven Häsler et.al. | 2504.18380 | null |
2025-04-25 | Anisotropic Piezomagnetism in Noncollinear Antiferromagnets | Vu Thi Ngoc Huyen et.al. | 2504.18363 | null |
2025-04-25 | Interpretable Affordance Detection on 3D Point Clouds with Probabilistic Prototypes | Maximilian Xiling Li et.al. | 2504.18355 | null |
2025-04-25 | NUDF: Neural Unsigned Distance Fields for high resolution 3D medical image segmentation | Kristine Sørensen et.al. | 2504.18344 | null |
2025-04-25 | Controlling the sign of optical forces using metaoptics | Adeel Afridi et.al. | 2504.18341 | null |
2025-04-25 | Depth3DLane: Monocular 3D Lane Detection via Depth Prior Distillation | Dongxin Lyu et.al. | 2504.18325 | null |
2025-04-25 | Deep Reinforcement Learning Based Navigation with Macro Actions and Topological Maps | Simon Hakenes et.al. | 2504.18300 | null |
2025-04-25 | Translocation of Active Polymerlike Worms | Marin Vatin et.al. | 2504.18275 | null |
2025-04-25 | SecCityVR: Visualization and Collaborative Exploration of Software Vulnerabilities in Virtual Reality | Dennis Wüppelman et.al. | 2504.18238 | null |
2025-04-25 | Phenomenology of Schwarzschild-like Black Holes with a Generalized Compton Wavelength | Reggie C. Pantig et.al. | 2504.18226 | null |
2025-04-25 | Unify3D: An Augmented Holistic End-to-end Monocular 3D Human Reconstruction via Anatomy Shaping and Twins Negotiating | Nanjie Yao et.al. | 2504.18215 | null |
2025-04-25 | A Data-Centric Approach to 3D Semantic Segmentation of Railway Scenes | Nicolas Münger et.al. | 2504.18213 | null |
2025-04-25 | LiDAR-Guided Monocular 3D Object Detection for Long-Range Railway Monitoring | Raul David Dominguez Sanchez et.al. | 2504.18203 | null |
2025-04-25 | Unveiling 3D Ocean Biogeochemical Provinces: A Machine Learning Approach for Systematic Clustering and Validation | Yvonne Jenniges et.al. | 2504.18181 | null |
2025-04-25 | The structural effects of (111) growth of La $_2$CoMnO$_6$ on SrTiO$_3$ and LSAT – new insights from 3D crystallographic characterisation with 4D-STEM and Digital Dark Field imaging | Ian MacLaren et.al. | 2504.18171 | null |
2025-04-25 | PerfCam: Digital Twinning for Production Lines Using 3D Gaussian Splatting and Vision Models | Michel Gokan Khan et.al. | 2504.18165 | null |
2025-04-25 | Bayesian Quantum Orthogonal Neural Networks for Anomaly Detection | Natansh Mathur et.al. | 2504.18103 | null |
2025-04-25 | S3MOT: Monocular 3D Object Tracking with Selective State Space Model | Zhuohao Yan et.al. | 2504.18068 | null |
2025-04-25 | Cabbage: A Differential Growth Framework for Open Surfaces | Xiaoyi Liu et.al. | 2504.18040 | null |
2025-04-24 | Virtual Roads, Smarter Safety: A Digital Twin Framework for Mixed Autonomous Traffic Safety Analysis | Hao Zhang et.al. | 2504.17968 | null |
2025-04-24 | iVR-GS: Inverse Volume Rendering for Explorable Visualization via Editable 3D Gaussian Splatting | Kaiyuan Tang et.al. | 2504.17954 | link |
2025-04-24 | Formation of chromospheric fan-shaped jets through magnetic reconnection | Annu Bura et.al. | 2504.17931 | null |
2025-04-24 | FlexPINN: Modeling Fluid Dynamics and Mass Transfer in 3D Micromixer Geometries Using a Flexible Physics-Informed Neural Network | Meraj Hassanzadeh et.al. | 2504.17896 | null |
2025-04-28 | Quaternion Domain Super MDS for 3D Localization | Keigo Masuoka et.al. | 2504.17890 | null |
2025-04-24 | Set Phasers to Stun: Beaming Power and Control to Mobile Robots with Laser Light | Charles J. Carver et.al. | 2504.17865 | null |
2025-04-24 | A Nearby Dark Molecular Cloud in the Local Bubble Revealed via H $_2$ Fluorescence | Blakesley Burkhart et.al. | 2504.17843 | null |
2025-04-23 | Visibility-Uncertainty-guided 3D Gaussian Inpainting via Scene Conceptional Learning | Mingxuan Cui et.al. | 2504.17815 | link |
2025-04-22 | Object Learning and Robust 3D Reconstruction | Sara Sabour et.al. | 2504.17812 | null |
2025-04-24 | The Fourth Monocular Depth Estimation Challenge | Anton Obukhov et.al. | 2504.17787 | null |
2025-04-24 | Nearby open clusters with tidal features: golden sample selection and 3D structure | Ming Xu et.al. | 2504.17744 | null |
2025-04-24 | Fully-Mixed Virtual Element Method for the Biot Problem | Michele Botti et.al. | 2504.17729 | null |
2025-04-24 | CasualHDRSplat: Robust High Dynamic Range 3D Gaussian Splatting from Casually Captured Videos | Shucheng Gong et.al. | 2504.17728 | link |
2025-04-24 | PICO: Reconstructing 3D People In Contact with Objects | Alpár Cseke et.al. | 2504.17695 | null |
2025-04-24 | DiMeR: Disentangled Mesh Reconstruction Model | Lutao Jiang et.al. | 2504.17670 | null |
2025-04-24 | Insights from Analytical Theory of Eccentric Circumbinary Disks | Marcela Grcic et.al. | 2504.17658 | null |
2025-04-24 | polyGen: A Learning Framework for Atomic-level Polymer Structure Generation | Ayush Jain et.al. | 2504.17656 | null |
2025-04-24 | A Guide to Structureless Visual Localization | Vojtech Panek et.al. | 2504.17636 | null |
2025-04-24 | Bolt: Clothing Virtual Characters at Scale | Jonathan Leaf et.al. | 2504.17614 | null |
2025-04-24 | The NewEra model grid | Peter H. Hauschildt et.al. | 2504.17597 | null |
2025-04-24 | When Gaussian Meets Surfel: Ultra-fast High-fidelity Radiance Field Rendering | Keyang Ye et.al. | 2504.17545 | null |
2025-05-03 | Thermodynamics and Holographic RG Flow in 3D C-metric | Shaohua Xue et.al. | 2504.17456 | null |
2025-04-24 | Predict-Optimize-Distill: A Self-Improving Cycle for 4D Object Understanding | Mingxuan Wu et.al. | 2504.17441 | null |
2025-04-24 | 3DV-TON: Textured 3D-Guided Consistent Video Try-on via Diffusion Models | Min Wei et.al. | 2504.17414 | null |
2025-04-24 | Bias-Eliminated PnP for Stereo Visual Odometry: Provably Consistent and Large-Scale Localization | Guangyang Zeng et.al. | 2504.17410 | null |
2025-04-25 | Highly Accurate and Diverse Traffic Data: The DeepScenario Open 3D Dataset | Oussema Dhaouadi et.al. | 2504.17371 | null |
2025-04-24 | Physics-based super-resolved simulation of 3D elastic wave propagation adopting scalable Diffusion Transformer | Hugo Gabrielidis et.al. | 2504.17308 | null |
2025-04-24 | Incompressible and fast rotation limits for 3D compressible rotating Euler system with general initial data | Mikihiro Fujii et.al. | 2504.17290 | null |
2025-04-24 | Some remarks on Liouville type theorems for the 3D steady tropical climate model | Yanyan Dong et.al. | 2504.17285 | null |
2025-04-24 | 3D Deep-learning-based Segmentation of Human Skin Sweat Glands and Their 3D Morphological Response to Temperature Variations | Shaoyu Pei et.al. | 2504.17255 | null |
2025-04-24 | Demonstrating Berkeley Humanoid Lite: An Open-source, Accessible, and Customizable 3D-printed Humanoid Robot | Yufeng Chi et.al. | 2504.17249 | null |
2025-04-24 | Range Image-Based Implicit Neural Compression for LiDAR Point Clouds | Akihiro Kuwabara et.al. | 2504.17229 | null |
2025-04-23 | Global stability for compressible isentropic Navier-Stokes equations in 3D bounded domains with Navier-slip boundary conditions | Yang Liu et.al. | 2504.17136 | null |
2025-04-23 | Physiological neural representation for personalised tracer kinetic parameter estimation from dynamic PET | Kartikay Tehlan et.al. | 2504.17122 | link |
2025-04-23 | Towards understanding stellar variability at the sub m/s level: Isolating granulation signals in synthetic spectral lines | Ginger Frame et.al. | 2504.17011 | null |
2025-04-23 | Zero-shot Sim-to-Real Transfer for Reinforcement Learning-based Visual Servoing of Soft Continuum Arms | Hsin-Jung Yang et.al. | 2504.16916 | null |
2025-04-23 | An Accelerated Camera 3DMA Framework for Efficient Urban GNSS Multipath Estimation | Shiyao Lv et.al. | 2504.16906 | null |
2025-04-23 | Exploring zero-shot structure-based protein fitness prediction | Arnav Sharma et.al. | 2504.16886 | null |
2025-04-23 | A Low-Cost Photogrammetry System for 3D Plant Modeling and Phenotyping | Joe Hrzich et.al. | 2504.16840 | null |
2025-04-23 | Energy Variational Modeling and Numerical Simulation of Open Membranes in Stokes Flow | Han Zhou et.al. | 2504.16823 | null |
2025-04-23 | 4D Multimodal Co-attention Fusion Network with Latent Contrastive Alignment for Alzheimer’s Diagnosis | Yuxiang Wei et.al. | 2504.16798 | null |
2025-04-23 | Graph2Nav: 3D Object-Relation Graph Generation to Robot Navigation | Tixiao Shan et.al. | 2504.16782 | null |
2025-04-23 | The Interplay of Single Ion Anisotropy and Magnetic 3d-4f Interactions in V $^{\rm III}_2$Ln$^{\rm III}_2$ Butterfly Complexes | J. Arneth et.al. | 2504.16758 | null |
2025-04-23 | Gaussian Splatting is an Effective Data Generator for 3D Object Detection | Farhad G. Zanjani et.al. | 2504.16740 | null |
2025-04-24 | DYNUS: Uncertainty-aware Trajectory Planner in Dynamic Unknown Environments | Kota Kondo et.al. | 2504.16734 | null |
2025-05-03 | PIN-WM: Learning Physics-INformed World Models for Non-Prehensile Manipulation | Wenxuan Li et.al. | 2504.16693 | null |
2025-04-30 | Non-uniqueness of (Stochastic) Lagrangian Trajectories for Euler Equations | Huaxiang Lü et.al. | 2504.16687 | null |
2025-04-23 | Small Alfvén Number Limit for the Global-in-time Solutions of Incompressible MHD Equations with General Initial Data | Yuan Cai et.al. | 2504.16650 | null |
2025-04-23 | UAV-Mounted IRS (UMI) in the Presence of Hovering Fluctuations: 3D Pattern Characterization and Performance Analysis | Mohammad Javad Zakavi et.al. | 2504.16613 | null |
2025-04-23 | A hybrid high-order method for the biharmonic problem | Yizhou Liang et.al. | 2504.16608 | null |
2025-04-23 | HUG: Hierarchical Urban Gaussian Splatting with Block-Based Reconstruction | Zhongtao Wang et.al. | 2504.16606 | null |
2025-04-23 | 3D-1D modelling of cranial plate heating induced by low or medium frequency magnetic fields | Alessandro Arduino et.al. | 2504.16600 | null |
2025-04-23 | HERB: Human-augmented Efficient Reinforcement learning for Bin-packing | Gojko Perovic et.al. | 2504.16595 | null |
2025-04-23 | Beyond Anonymization: Object Scrubbing for Privacy-Preserving 2D and 3D Vision Tasks | Murat Bilgehan Ertan et.al. | 2504.16557 | null |
2025-04-23 | ToF-Splatting: Dense SLAM using Sparse Time-of-Flight Depth and Multi-Frame Integration | Andrea Conti et.al. | 2504.16545 | null |
2025-04-23 | PRaDA: Projective Radial Distortion Averaging | Daniil Sinitsyn et.al. | 2504.16499 | null |
2025-04-23 | Assessing the Feasibility of Internet-Sourced Video for Automatic Cattle Lameness Detection | Md Fahimuzzman Sohan et.al. | 2504.16404 | null |
2025-04-23 | Stability threshold of Couette flow for 3D Boussinesq system in Sobolev spaces | Shikun Cui et.al. | 2504.16401 | null |
2025-04-23 | SaENeRF: Suppressing Artifacts in Event-based Neural Radiance Fields | Yuanjian Wang et.al. | 2504.16389 | link |
2025-04-23 | DPGP: A Hybrid 2D-3D Dual Path Potential Ghost Probe Zone Prediction Framework for Safe Autonomous Driving | Weiming Qu et.al. | 2504.16374 | null |
2025-04-23 | Revisiting Radar Camera Alignment by Contrastive Learning for 3D Object Detection | Linhua Kong et.al. | 2504.16368 | null |
2025-04-23 | Real-time Bayesian inference at extreme scale: A digital twin for tsunami early warning applied to the Cascadia subduction zone | Stefan Henneking et.al. | 2504.16344 | null |
2025-04-23 | Helically symmetric solution of 3D Euler equations with vorticity and its free boundary | Lili Du et.al. | 2504.16340 | null |
2025-04-23 | Terahertz field effect in a two-dimensional semiconductor MoS2 | Tomoki Hiraoka et.al. | 2504.16333 | null |
2025-04-23 | PhaseT3M: 3D Imaging at 1.6 Å Resolution via Electron Cryo-Tomography with Nonlinear Phase Retrieval | Juhyeok Lee et.al. | 2504.16332 | null |
2025-04-24 | Subthreshold Jitter in VR Can Induce Visual Discomfort | Samuel J. Levulis et.al. | 2504.16295 | null |
2025-04-22 | Ultimate quantum sensitivity in the 3D relative localisation of two single-photon emitters via two-photon interference | Luca Maggio et.al. | 2504.16294 | null |
2025-04-22 | Environmental Dependence of X-Ray Emission From The Least Massive Galaxies | Marko Mićić et.al. | 2504.16285 | null |
2025-04-22 | Analytic Fourier ptychotomography for volumetric refractive index imaging | Zhenyu Dong et.al. | 2504.16247 | null |
2025-04-22 | High-throughput screening of 2D materials identifies p-type monolayer WS $_2$ as potential ultra-high mobility semiconductor | Viet-Anh Ha et.al. | 2504.16208 | null |
2025-04-22 | Measuring Uncertainty in Shape Completion to Improve Grasp Quality | Nuno Ferreira Duarte et.al. | 2504.16183 | null |
2025-04-22 | Universal giant spin Hall effect in moire metal | Ning Mao et.al. | 2504.16179 | null |
2025-04-22 | Postcarrollian gravity | Florian Ecker et.al. | 2504.16162 | null |
2025-04-24 | AI-Based Vulnerability Analysis of NFT Smart Contracts | Xin Wang et.al. | 2504.16113 | null |
2025-04-15 | Shape Your Ground: Refining Road Surfaces Beyond Planar Representations | Oussema Dhaouadi et.al. | 2504.16103 | null |
2025-04-22 | Optimal intrinsic alignment estimators in the presence of redshift-space distortions | Claire Lamman et.al. | 2504.16076 | null |
2025-04-22 | Reconstruction of source function in a parabolic equation using partial boundary measurements | T. Sharma et.al. | 2504.16070 | null |
2025-04-22 | Vision language models are unreliable at trivial spatial cognition | Sangeet Khemlani et.al. | 2504.16061 | null |
2025-04-22 | Rotational ultrasound and photoacoustic tomography of the human body | Yang Zhang et.al. | 2504.16036 | null |
2025-04-22 | LHCspin: a Polarized Gas Target for LHC | A. Accardi et.al. | 2504.16034 | null |
2025-04-22 | Wilson lines with endpoints in 3d CFT | Nabil Iqbal et.al. | 2504.16017 | null |
2025-04-22 | Approximation of Invariant Solutions to the Nonlinear Filtration Equation by Modified Pade Approximants | Sergii Skurativskyi et.al. | 2504.16001 | null |
2025-04-29 | Small-scale dynamic phenomena associated with interacting fan-spine topologies: quiet-Sun Ellerman bombs, UV brightenings, and chromospheric inverted-Y-shaped jets | Aditi Bhatnagar et.al. | 2504.15996 | null |
2025-04-15 | High order treatment of moving curved boundaries: Arbitrary-Lagrangian-Eulerian methods with a shifted boundary polynomials correction | Walter Boscheri et.al. | 2504.15963 | null |
2025-04-22 | Infrared and X-ray emission of a supernova remnant in a clumpy medium | S. Yu. Dedikov et.al. | 2504.15940 | null |
2025-04-22 | MS-Occ: Multi-Stage LiDAR-Camera Fusion for 3D Semantic Occupancy Prediction | Zhiqiang Wei et.al. | 2504.15888 | null |
2025-04-22 | 3D Printing of Invariant Manifolds in Dynamical Systems | Patrick R. Bishop et.al. | 2504.15884 | null |
2025-04-22 | DERD-Net: Learning Depth from Event-based Ray Densities | Diego de Oliveira Hitzges et.al. | 2504.15863 | null |
2025-04-22 | Text-based Animatable 3D Avatars with Morphable Model Alignment | Yiqian Wu et.al. | 2504.15835 | null |
2025-04-22 | Locating and Mitigating Gradient Conflicts in Point Cloud Domain Adaptation via Saliency Map Skewness | Jiaqi Tang et.al. | 2504.15796 | null |
2025-04-22 | Development and evaluation of a deep learning algorithm for German word recognition from lip movements | Dinh Nam Pham et.al. | 2504.15792 | null |
2025-04-22 | Model-based Metric 3D Shape and Motion Reconstruction of Wild Bottlenose Dolphins in Drone-Shot Videos | Daniele Baieri et.al. | 2504.15782 | null |
2025-04-24 | Clifford Group Equivariant Diffusion Models for 3D Molecular Generation | Cong Liu et.al. | 2504.15773 | null |
2025-04-22 | 3D Maser polarization simulation for J=1-0 SiO masers in the circumstellar envelope of an AGB star | M. Phetra et.al. | 2504.15754 | null |
2025-04-22 | Compact vacuum levitation and control platform with a single 3D-printed fiber lens | Seyed Khalil Alavi et.al. | 2504.15734 | null |
2025-04-22 | Autonomous Control of Redundant Hydraulic Manipulator Using Reinforcement Learning with Action Feedback | Rohit Dhakate et.al. | 2504.15714 | null |
2025-04-27 | A Vision-Enabled Prosthetic Hand for Children with Upper Limb Disabilities | Md Abdul Baset Sarker et.al. | 2504.15654 | null |
2025-04-22 | Enhancing Reinforcement learning in 3-Dimensional Hydrophobic-Polar Protein Folding Model with Attention-based layers | Peizheng Liu et.al. | 2504.15634 | null |
2025-04-22 | Partition laser assembling technique | Yueqiang Zhu et.al. | 2504.15554 | null |
2025-04-21 | A dual-stage constitutive modeling framework based on finite strain data-driven identification and physics-augmented neural networks | Lennart Linden et.al. | 2504.15492 | null |
2025-04-21 | Helicons in multi-Weyl semimetals | Shiv Kumar Ram et.al. | 2504.15426 | null |
2025-04-21 | Physics Driven Image Simulation from Commercial Satellite Imagery | Scott Sorensen et.al. | 2504.15378 | null |
2025-04-21 | The (Limited) Effect of Viscosity in Multiphase Turbulent Mixing | Tirso Marin-Gilabert et.al. | 2504.15345 | null |
2025-04-21 | Vision6D: 3D-to-2D Interactive Visualization and Annotation Tool for 6D Pose Estimation | Yike Zhang et.al. | 2504.15329 | link |
2025-04-21 | StyleMe3D: Stylization with Disentangled Priors by Multiple Encoders on 3D Gaussians | Cailin Zhuang et.al. | 2504.15281 | null |
2025-04-27 | Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs | Chun-Hsiao Yeh et.al. | 2504.15280 | link |
2025-04-21 | Diffusion Bridge Models for 3D Medical Image Translation | Shaorong Zhang et.al. | 2504.15267 | null |
2025-04-21 | Revealing the 3D Cosmic Web through Gravitationally Constrained Neural Fields | Brandon Zhao et.al. | 2504.15262 | null |
2025-04-21 | Bringing Diversity from Diffusion Models to Semantic-Guided Face Asset Generation | Yunxuan Cai et.al. | 2504.15259 | null |
2025-04-21 | Breast density in MRI: an AI-based quantification and relationship to assessment in mammography | Yaqian Chen et.al. | 2504.15192 | null |
2025-04-21 | FaceCraft4D: Animated 3D Facial Avatar Generation from a Single Image | Fei Yin et.al. | 2504.15179 | null |
2025-04-21 | Dynamic 3D KAN Convolution with Adaptive Grid Optimization for Hyperspectral Image Classification | Guandong Li et.al. | 2504.15155 | null |
2025-04-21 | Landmark-Free Preoperative-to-Intraoperative Registration in Laparoscopic Liver Resection | Jun Zhou et.al. | 2504.15152 | null |
2025-04-30 | MoBGS: Motion Deblurring Dynamic 3D Gaussian Splatting for Blurry Monocular Video | Minh-Quan Viet Bui et.al. | 2504.15122 | null |
2025-04-23 | Muon Imaging of Hydrotreatment Reactors | Rafael Armando Martínez-Rivero et.al. | 2504.15103 | null |
2025-04-21 | Robust Planning and Control of Omnidirectional MRAVs for Aerial Communications in Wireless Networks | Giuseppe Silano et.al. | 2504.15089 | null |
2025-04-21 | ScanEdit: Hierarchically-Guided Functional 3D Scan Editing | Mohamed el amine Boudjoghra et.al. | 2504.15049 | null |
2025-04-21 | Bayesian Sensing for Time-Varying Channels in ISAC Systems | Xueyang Wang et.al. | 2504.15042 | null |
2025-04-21 | Cyc3D: Fine-grained Controllable 3D Generation via Cycle Consistency Regularization | Hongbin Xu et.al. | 2504.14975 | null |
2025-04-21 | 3D Gaussian Head Avatars with Expressive Dynamic Appearances by Compact Tensorial Representations | Yating Wang et.al. | 2504.14967 | null |
2025-04-21 | OmniAudio: Generating Spatial Audio from 360-Degree Video | Huadai Liu et.al. | 2504.14906 | null |
2025-04-21 | Uni3C: Unifying Precisely 3D-Enhanced Camera and Human Motion Controls for Video Generation | Chenjie Cao et.al. | 2504.14899 | link |
2025-04-21 | SuFIA-BC: Generating High Quality Demonstration Data for Visuomotor Policy Learning in Surgical Subtasks | Masoud Moghani et.al. | 2504.14857 | null |
2025-04-20 | TAPIP3D: Tracking Any Point in Persistent 3D Geometry | Bowei Zhang et.al. | 2504.14717 | link |
2025-04-20 | Black Hole Survival Guide: Searching for Stars in the Galactic Center That Endure Partial Tidal Disruption | Rewa Clark Bush et.al. | 2504.14705 | null |
2025-04-20 | IXGS-Intraoperative 3D Reconstruction from Sparse, Arbitrarily Posed Real X-rays | Sascha Jecklin et.al. | 2504.14699 | null |
2025-04-20 | Seurat: From Moving Points to Depth | Seokju Cho et.al. | 2504.14687 | link |
2025-04-23 | A Complete and Bounded-Suboptimal Algorithm for a Moving Target Traveling Salesman Problem with Obstacles in 3D | Anoop Bhat et.al. | 2504.14680 | null |
2025-04-20 | NVSMask3D: Hard Visual Prompting with Camera Pose Interpolation for 3D Open Vocabulary Instance Segmentation | Junyuan Fang et.al. | 2504.14638 | null |
2025-04-20 | VM-BHINet:Vision Mamba Bimanual Hand Interaction Network for 3D Interacting Hand Mesh Recovery From a Single RGB Image | Han Bi et.al. | 2504.14618 | null |
2025-04-20 | MP-Mat: A 3D-and-Instance-Aware Human Matting and Editing Framework with Multiplane Representation | Siyi Jiao et.al. | 2504.14606 | null |
2025-04-20 | RoboOcc: Enhancing the Geometric and Semantic Scene Understanding for Robots | Zhang Zhang et.al. | 2504.14604 | null |
2025-04-20 | Haptic-based Complementary Filter for Rigid Body Rotations | Amit Kumar et.al. | 2504.14570 | null |
2025-04-20 | Quenched correlation decay for random splittings of some prototypical 3D flows including the ABC flow | Nianci Jiang et.al. | 2504.14564 | null |
2025-04-20 | VGNC: Reducing the Overfitting of Sparse-view 3DGS via Validation-guided Gaussian Number Control | Lifeng Lin et.al. | 2504.14548 | null |
2025-04-24 | On the development of OpenFOAM solvers for simulating MHD micropolar fluid flows with or without the effect of micromagnetorotation | Kyriaki-Evangelia Aslani et.al. | 2504.14543 | null |
2025-04-20 | Back on Track: Bundle Adjustment for Dynamic Scene Reconstruction | Weirong Chen et.al. | 2504.14516 | null |
2025-04-20 | Metamon-GS: Enhancing Representability with Variance-Guided Densification and Light Encoding | Junyan Su et.al. | 2504.14460 | null |
2025-04-20 | WT-BCP: Wavelet Transform based Bidirectional Copy-Paste for Semi-Supervised Medical Image Segmentation | Mingya Zhang et.al. | 2504.14445 | null |
2025-04-20 | Hall algebra multiplication for stable envelopes on bow varieties | Tommaso Maria Botta et.al. | 2504.14428 | null |
2025-04-23 | SEGA: Drivable 3D Gaussian Head Avatar from a Single Image | Chen Guo et.al. | 2504.14373 | null |
2025-04-19 | Efficient Spiking Point Mamba for Point Cloud Analysis | Peixi Wu et.al. | 2504.14371 | null |
2025-04-19 | The Ophiuchus DIsk Survey Employing ALMA (ODISEA): A Unified Evolutionary Sequence of Planet-Driven Substructures Explaining the Diversity of Disk Morphologies | Santiago Orcajo et.al. | 2504.14318 | null |
2025-04-19 | ProtPainter: Draw or Drag Protein via Topology-guided Diffusion | Zhengxi Lu et.al. | 2504.14274 | null |
2025-04-19 | Exploring Modality Guidance to Enhance VFM-based Feature Fusion for UDA in 3D Semantic Segmentation | Johannes Spoecklberger et.al. | 2504.14231 | null |
2025-04-19 | Real-IAD D3: A Real-World 2D/Pseudo-3D/3D Dataset for Industrial Anomaly Detection | Wenbing Zhu et.al. | 2504.14221 | null |
2025-04-19 | Phase tomography with axial structured illumination | N Goyal et.al. | 2504.14210 | null |
2025-04-19 | A Physics-guided Multimodal Transformer Path to Weather and Climate Sciences | Jing Han et.al. | 2504.14174 | null |
2025-04-19 | Locate 3D: Real-World Object Localization via Self-Supervised Learning in 3D | Sergio Arnaud et.al. | 2504.14151 | null |
2025-04-19 | 3D PIC Study of Magnetic Field Effects on Hall Thruster Electron Drift Instability | KunPeng Zhong et.al. | 2504.14144 | null |
2025-04-19 | HFBRI-MAE: Handcrafted Feature Based Rotation-Invariant Masked Autoencoder for 3D Point Cloud Analysis | Xuanhua Yin et.al. | 2504.14132 | null |
2025-04-18 | AnywhereXR: On-the-fly 3D Environments as a Basis for Open Source Immersive Digital Twin Applications | Alexander Klippel et.al. | 2504.14065 | null |
2025-04-18 | Out on a Limb: The Signatures of East-West Asymmetries in Transmission Spectra from General Circulation Models | Kenneth E. Arnold et.al. | 2504.14060 | null |
2025-04-18 | From sequence to protein structure and conformational dynamics with AI/ML | Alexander M. Ille et.al. | 2504.14059 | null |
2025-04-18 | Occlusion-Ordered Semantic Instance Segmentation | Soroosh Baselizadeh et.al. | 2504.14054 | null |
2025-04-18 | Micromagnetic-atomistic hybrid modeling of defect-induced magnetization dynamics | Nastaran Salehi et.al. | 2504.14019 | null |
2025-04-18 | Scaling LLaNA: Advancing NeRF-Language Understanding Through Large-Scale Training | Andrea Amaduzzi et.al. | 2504.13995 | null |
2025-04-15 | VoxCity: A Seamless Framework for Open Geospatial Data Integration, Grid-Based Semantic 3D City Model Generation, and Urban Environment Simulation | Kunihiko Fujiwara et.al. | 2504.13934 | null |
2025-04-18 | Outlier-Robust Multi-Model Fitting on Quantum Annealers | Saurabh Pandey et.al. | 2504.13836 | null |
2025-04-18 | Strict increase in the number of normally hyperbolic limit tori in 3D polynomial vector fields | Lucas Queiroz Arakaki et.al. | 2504.13832 | null |
2025-04-18 | ChatNekoHacker: Real-Time Fan Engagement with Conversational Agents | Takuya Sera et.al. | 2504.13793 | null |
2025-04-18 | RefComp: A Reference-guided Unified Framework for Unpaired Point Cloud Completion | Yixuan Yang et.al. | 2504.13788 | null |
2025-04-18 | ESPLoRA: Enhanced Spatial Precision with Low-Rank Adaption in Text-to-Image Diffusion Models for High-Definition Synthesis | Andrea Rigo et.al. | 2504.13745 | null |
2025-04-18 | Effect of micromagnetorotation on a micropolar magnetohydrodynamic blood flow in a 3D stenosed artery | Kyriaki-Evangelia Aslani et.al. | 2504.13678 | null |
2025-04-18 | Lightweight LiDAR-Camera 3D Dynamic Object Detection and Multi-Class Trajectory Prediction | Yushen He et.al. | 2504.13647 | link |
2025-04-18 | ViG3D-UNet: Volumetric Vascular Connectivity-Aware Segmentation via 3D Vision Graph Representation | Bowen Liu et.al. | 2504.13599 | null |
2025-04-18 | LMPOcc: 3D Semantic Occupancy Prediction Utilizing Long-Term Memory Prior from Historical Traversals | Shanshuai Yuan et.al. | 2504.13596 | null |
2025-04-18 | KAN or MLP? Point Cloud Shows the Way Forward | Yan Shi et.al. | 2504.13593 | link |
2025-04-18 | HAECcity: Open-Vocabulary Scene Understanding of City-Scale Point Clouds with Superpoint Graph Clustering | Alexander Rusnak et.al. | 2504.13590 | null |
2025-04-18 | Leveraging Automatic CAD Annotations for Supervised Learning in 3D Scene Understanding | Yuchen Rao et.al. | 2504.13580 | link |
2025-04-18 | Integrated Super-resolution Sensing and Symbiotic Communication with 3D Sparse MIMO for Low-Altitude UAV Swarm | Jingran Xu et.al. | 2504.13570 | null |
2025-04-18 | WeatherGen: A Unified Diverse Weather Generator for LiDAR Point Clouds via Spider Mamba Diffusion | Yang Wu et.al. | 2504.13561 | null |
2025-04-18 | EG-Gaussian: Epipolar Geometry and Graph Network Enhanced 3D Gaussian Splatting | Beizhen Zhao et.al. | 2504.13540 | null |
2025-04-18 | Ascribe New Dimensions to Scientific Data Visualization with VR | Daniela Ushizima et.al. | 2504.13448 | null |
2025-04-18 | Mono3R: Exploiting Monocular Cues for Geometric 3D Reconstruction | Wenyu Li et.al. | 2504.13419 | null |
2025-04-18 | How Learnable Grids Recover Fine Detail in Low Dimensions: A Neural Tangent Kernel Analysis of Multigrid Parametric Encodings | Samuel Audia et.al. | 2504.13412 | null |
2025-04-18 | Supervising 3D Talking Head Avatars with Analysis-by-Audio-Synthesis | Radek Daněček et.al. | 2504.13386 | null |
2025-04-17 | SMPL-GPTexture: Dual-View 3D Human Texture Estimation using Text-to-Image Generation Models | Mingxiao Tu et.al. | 2504.13378 | null |
2025-04-17 | Compensation-Like Temperature and Spin-Flip Switch in Strained Thulium Iron Garnet Thin Films: Tuning Sublattice Interactions for Ferrimagnetic Spintronics | Carlos C. Soares et.al. | 2504.13369 | null |
2025-04-24 | Putting the Segment Anything Model to the Test with 3D Knee MRI - A Comparison with State-of-the-Art Performance | Oliver Mills et.al. | 2504.13340 | null |
2025-04-17 | Volume Encoding Gaussians: Transfer Function-Agnostic 3D Gaussians for Volume Rendering | Landon Dyken et.al. | 2504.13339 | null |
2025-04-17 | Weak Cube R-CNN: Weakly Supervised 3D Detection using only 2D Bounding Boxes | Andreas Lau Hansen et.al. | 2504.13297 | null |
2025-04-15 | EDGS: Eliminating Densification for Efficient Convergence of 3DGS | Dmytro Kotovenko et.al. | 2504.13204 | null |
2025-04-14 | Efficient Brain Tumor Segmentation Using a Dual-Decoder 3D U-Net with Attention Gates (DDUNet) | Mohammad Mahdi Danesh Pajouh et.al. | 2504.13200 | null |
2025-04-17 | Novel Demonstration Generation with Gaussian Splatting Enables Robust One-Shot Manipulation | Sizhe Yang et.al. | 2504.13175 | null |
2025-04-18 | ODHSR: Online Dense 3D Reconstruction of Humans and Scenes from Monocular Videos | Zetong Zhang et.al. | 2504.13167 | null |
2025-04-17 | RUKA: Rethinking the Design of Humanoid Hands with Learning | Anya Zorin et.al. | 2504.13165 | null |
2025-04-17 | Digital Twin Generation from Visual Data: A Survey | Andrew Melnik et.al. | 2504.13159 | link |
2025-04-17 | AerialMegaDepth: Learning Aerial-Ground Reconstruction and View Synthesis | Khiem Vuong et.al. | 2504.13157 | null |
2025-04-17 | Training-Free Hierarchical Scene Understanding for Gaussian Splatting with Superpoint Graphs | Shaohui Dai et.al. | 2504.13153 | link |
2025-04-17 | St4RTrack: Simultaneous 4D Reconstruction and Tracking in the World | Haiwen Feng et.al. | 2504.13152 | null |
2025-04-21 | A hybrid U-Net and Fourier neural operator framework for the fast prediction of turbulent flows with mixed periodic and non-periodic boundary conditions | Yunpeng Wang et.al. | 2504.13126 | null |
2025-04-17 | HiScene: Creating Hierarchical 3D Scenes with Isometric View Generation | Wenqi Dong et.al. | 2504.13072 | null |
2025-04-17 | RoboTwin: Dual-Arm Robot Benchmark with Generative Digital Twins | Yao Mu et.al. | 2504.13059 | null |
2025-04-17 | Expert Kernel Generation Network Driven by Contextual Mapping for Hyperspectral Image Classification | Guandong Li et.al. | 2504.13045 | null |
2025-04-17 | CompGS++: Compressed Gaussian Splatting for Static and Dynamic Scene Representation | Xiangrui Liu et.al. | 2504.13022 | null |
2025-04-17 | GSAC: Leveraging Gaussian Splatting for Photorealistic Avatar Creation with Unity Integration | Rendong Zhang et.al. | 2504.12999 | link |
2025-04-17 | X-ray linear dichroic orientation tomography: reconstruction of nanoscale three-dimensional orientation fields | Andreas Apseros et.al. | 2504.12978 | null |
2025-04-18 | Rethinking Temporal Fusion with a Unified Gradient Descent View for 3D Semantic Occupancy Prediction | Dubing Chen et.al. | 2504.12959 | link |
2025-04-17 | Tensor-monopole-induced topological boundary effects in four-dimensional acoustic metamaterials | Qingyang Mo et.al. | 2504.12950 | null |
2025-04-20 | Prospects for Detecting Signs of Life on Exoplanets in the JWST Era | Sara Seager et.al. | 2504.12946 | null |
2025-04-17 | Performance of the advanced gamma-ray trigger system for the High Energy Cosmic Radiation Detection (HERD) facility | Keerthana Rajan Lathika et.al. | 2504.12930 | null |
2025-04-17 | Second-order Optimization of Gaussian Splats with Importance Sampling | Hamza Pehlivan et.al. | 2504.12905 | null |
2025-04-17 | Computer-Aided Design of Personalized Occlusal Positioning Splints Using Multimodal 3D Data | Agnieszka Anna Tomaka et.al. | 2504.12868 | null |
2025-04-17 | 3D-PNAS: 3D Industrial Surface Anomaly Synthesis with Perlin Noise | Yifeng Cheng et.al. | 2504.12856 | null |
2025-04-17 | Particle-based Simulation of an Air-Breathing Electric Propulsion System | Pietro Parodi et.al. | 2504.12829 | null |
2025-04-17 | TwoSquared: 4D Generation from 2D Image Pairs | Lu Sang et.al. | 2504.12825 | null |
2025-04-17 | AAA-Gaussians: Anti-Aliased and Artifact-Free 3D Gaussian Rendering | Michael Steiner et.al. | 2504.12811 | null |
2025-04-17 | CAGE-GS: High-fidelity Cage Based 3D Gaussian Splatting Deformation | Yifei Tong et.al. | 2504.12800 | null |
2025-04-17 | TSGS: Improving Gaussian Splatting for Transparent Surface Reconstruction via Normal and De-lighting Priors | Mingwei Li et.al. | 2504.12799 | null |
2025-04-17 | Supporting Urban Low-Altitude Economy: Channel Gain Map Inference Based on 3D Conditional GAN | Yonghao Wang et.al. | 2504.12794 | null |
2025-04-17 | ARAP-GS: Drag-driven As-Rigid-As-Possible 3D Gaussian Splatting Editing with Diffusion Prior | Xiao Han et.al. | 2504.12788 | null |
2025-04-17 | Tailoring Electromagnetic Fields in RF Cavities | Laurence Wroe et.al. | 2504.12780 | null |
2025-04-24 | 3D MHD wave propagation and energy transport in a simulated solar vortex | Samuel Skirvin et.al. | 2504.12745 | null |
2025-04-17 | TimeCapsule: Solving the Jigsaw Puzzle of Long-Term Time Series Forecasting with Compressed Predictive Representations | Yihang Lu et.al. | 2504.12721 | null |
2025-04-17 | Self-Supervised Pre-training with Combined Datasets for 3D Perception in Autonomous Driving | Shumin Wang et.al. | 2504.12709 | null |
2025-04-17 | Unsupervised Cross-Domain 3D Human Pose Estimation via Pseudo-Label-Guided Global Transforms | Jingjing Liu et.al. | 2504.12699 | null |
2025-04-17 | SOPHY: Generating Simulation-Ready Objects with Physical Materials | Junyi Cao et.al. | 2504.12684 | null |
2025-04-17 | Accurate Tracking of Arabidopsis Root Cortex Cell Nuclei in 3D Time-Lapse Microscopy Images Based on Genetic Algorithm | Yu Song et.al. | 2504.12676 | link |
2025-04-18 | RoPETR: Improving Temporal Camera-Only 3D Detection by Integrating Enhanced Rotary Position Embedding | Hang Ji et.al. | 2504.12643 | null |
2025-04-17 | 3DResT: A Strong Baseline for Semi-Supervised 3D Referring Expression Segmentation | Wenxin Chen et.al. | 2504.12599 | null |
2025-04-16 | A theoretical framework for flow-compatible reconstruction of heart motion | Francesco Capuano et.al. | 2504.12531 | null |
2025-04-16 | Continual Learning Strategies for 3D Engineering Regression Problems: A Benchmarking Study | Kaira M. Samuel et.al. | 2504.12503 | null |
2025-04-16 | MobilePoser: Real-Time Full-Body Pose Estimation and 3D Human Translation from IMUs in Mobile Consumer Devices | Vasco Xu et.al. | 2504.12492 | link |
2025-04-16 | DG-MVP: 3D Domain Generalization via Multiple Views of Point Clouds for Classification | Huantao Ren et.al. | 2504.12456 | null |
2025-04-16 | One Model to Rig Them All: Diverse Skeleton Rigging with UniRig | Jia-Peng Zhang et.al. | 2504.12451 | link |
2025-04-16 | 3D-PointZshotS: Geometry-Aware 3D Point Cloud Zero-Shot Semantic Segmentation Narrowing the Visual-Semantic Gap | Minmin Yang et.al. | 2504.12442 | link |
2025-04-16 | An accurate measurement of parametric array using a spurious sound filter topologically equivalent to a half-wavelength resonator | Woongji Kim et.al. | 2504.12398 | null |
2025-04-16 | Ryu-Takayanagi Formula for Multi-Boundary Black Holes from 2D Large-\textbf{ $c$ } CFT Ensemble | Ning Bao et.al. | 2504.12388 | null |
2025-04-16 | Origin of the IRAS Vela Shell: New Insights from 3D Dust Mapping | Bore Annie Gao et.al. | 2504.12381 | null |
2025-04-16 | WORLDMEM: Long-term Consistent World Simulation with Memory | Zeqi Xiao et.al. | 2504.12369 | null |
2025-04-19 | Boundary Effects and Oxygen Deficiency-Driven Pattern Transitions in Algal Bioconvection | S. Gore et.al. | 2504.12362 | null |
2025-04-16 | Regist3R: Incremental Registration with Stereo Foundation Model | Sidun Liu et.al. | 2504.12356 | null |
2025-04-23 | Deep Generative Model-Based Generation of Synthetic Individual-Specific Brain MRI Segmentations | Ruijie Wang et.al. | 2504.12352 | link |
2025-04-15 | 3D Object Reconstruction with mmWave Radars | Samah Hussein et.al. | 2504.12348 | null |
2025-04-16 | Adapting a World Model for Trajectory Following in a 3D Game | Marko Tot et.al. | 2504.12299 | null |
2025-04-16 | SHeaP: Self-Supervised Head Geometry Predictor Learned via 2D Gaussians | Liam Schoneveld et.al. | 2504.12292 | null |
2025-04-16 | How Do I Do That? Synthesizing 3D Hand Motion and Contacts for Everyday Interactions | Aditya Prakash et.al. | 2504.12284 | null |
2025-04-16 | Wormholes with Ends of the World | Diandian Wang et.al. | 2504.12278 | null |
2025-04-16 | Towards Learning to Complete Anything in Lidar | Ayca Takmaz et.al. | 2504.12264 | null |
2025-04-16 | Stereoscopic Cylindrical Screen (SCS) Projection | Lim Ngian Xin Terry et.al. | 2504.12237 | null |
2025-04-16 | Finite time blowup for Keller-Segel equation with logistic damping in three dimensions | Jiaqi Liu et.al. | 2504.12231 | null |
2025-04-16 | Magnetically driven outflows in 3D common-envelope evolution of massive stars | Marco Vetter et.al. | 2504.12213 | null |
2025-04-16 | CoMotion: Concurrent Multi-person 3D Motion | Alejandro Newell et.al. | 2504.12186 | link |
2025-04-16 | RADLER: Radar Object Detection Leveraging Semantic 3D City Models and Self-Supervised Radar-Image Learning | Yuan Luo et.al. | 2504.12167 | null |
2025-04-16 | The CAM Model: An in vivo Testbed for Molecular Communication Systems | Fardad Vakilipoor et.al. | 2504.12123 | null |
2025-04-16 | Resonant x-ray scattering study of charge-density wave correlations in YBa ${2}$Cu${3}$O$_{6+x}$ under uniaxial stress | S. Nakata et.al. | 2504.12050 | null |
2025-04-16 | Epstein zeta method for many-body lattice sums | Andreas A. Buchheit et.al. | 2504.11989 | null |
2025-04-16 | R-Meshfusion: Reinforcement Learning Powered Sparse-View Mesh Reconstruction with Diffusion Priors | Haoyang Wang et.al. | 2504.11946 | null |
2025-04-18 | Mind2Matter: Creating 3D Models from EEG Signals | Xia Deng et.al. | 2504.11936 | link |
2025-04-16 | CAGS: Open-Vocabulary 3D Scene Understanding with Context-Aware Gaussian Splatting | Wei Sun et.al. | 2504.11893 | null |
2025-04-16 | Detection of wave activity within a realistic 3D MHD quiet sun simulation | George Cherry et.al. | 2504.11886 | null |
2025-04-16 | Synthetic Data for Blood Vessel Network Extraction | Joël Mathys et.al. | 2504.11858 | null |
2025-04-16 | TextDiffSeg: Text-guided Latent Diffusion Model for 3d Medical Images Segmentation | Kangbo Ma et.al. | 2504.11825 | null |
2025-04-16 | Ultra-fast and accurate multimode waveguide design based on dataset-based eigenmode expansion method | Jaesung Song et.al. | 2504.11801 | null |
2025-04-16 | Extended Short- and Long-Range Mesh Learning for Fast and Generalized Garment Simulation | Aoran Liu et.al. | 2504.11763 | null |
2025-04-16 | GrabS: Generative Embodied Agent for 3D Object Segmentation without Scene Supervision | Zihui Zhang et.al. | 2504.11754 | link |
2025-04-16 | Recent Advance in 3D Object and Scene Generation: A Survey | Xiang Tang et.al. | 2504.11734 | null |
2025-04-16 | DM-OSVP++: One-Shot View Planning Using 3D Diffusion Models for Active RGB-Based Object Reconstruction | Sicong Pan et.al. | 2504.11674 | link |
2025-04-15 | 3D full-GR simulations of magnetorotational core-collapse supernovae on GPUs: A systematic study of rotation rates and magnetic fields | Swapnil Shankar et.al. | 2504.11537 | null |
2025-04-11 | Do Segmentation Models Understand Vascular Structure? A Blob-Based XAI Framework | Guillaume Garret et.al. | 2504.11469 | null |
2025-04-16 | Elucidating the Design Space of Multimodal Protein Language Models | Cheng-Yen Hsieh et.al. | 2504.11454 | null |
2025-04-15 | PARTFIELD: Learning 3D Feature Fields for Part Segmentation and Beyond | Minghua Liu et.al. | 2504.11451 | null |
2025-04-16 | Diffusion Distillation With Direct Preference Optimization For Efficient 3D LiDAR Scene Completion | An Zhao et.al. | 2504.11447 | link |
2025-04-15 | Robust Containment Queries over Collections of Trimmed NURBS Surfaces via Generalized Winding Numbers | Jacob Spainhour et.al. | 2504.11435 | null |
2025-04-15 | Deep Learning-based Bathymetry Retrieval without In-situ Depths using Remote Sensing Imagery and SfM-MVS DSMs with Data Gaps | Panagiotis Agrafiotis et.al. | 2504.11416 | link |
2025-04-15 | Five dimensional rotating and Quintessence black hole and their shadows | Milko Estrada et.al. | 2504.11408 | null |
2025-04-15 | Explicit and Implicit Representations in AI-based 3D Reconstruction for Radiology: A systematic literature review | Yuezhe Yang et.al. | 2504.11349 | link |
2025-04-16 | DeepWheel: Generating a 3D Synthetic Wheel Dataset for Design and Performance Evaluation | Soyoung Yoo et.al. | 2504.11347 | null |
2025-04-15 | Implicit dual time-stepping positivity-preserving entropy-stable schemes for the compressible Navier-Stokes equations | Mohammed Sayyari et.al. | 2504.11333 | null |
2025-04-15 | Crystal nucleation and growth in high-entropy alloys revealed by atomic electron tomography | Yakun Yuan et.al. | 2504.11325 | null |
2025-04-13 | Intelligent driving vehicle front multi-target tracking and detection based on YOLOv5 and point cloud 3D projection | Dayong Liu et.al. | 2504.11310 | null |
2025-04-15 | UniAnimate-DiT: Human Image Animation with Large-Scale Video Diffusion Transformer | Xiang Wang et.al. | 2504.11289 | link |
2025-04-16 | 3DAffordSplat: Efficient Affordance Reasoning with 3D Gaussians | Zeming Wei et.al. | 2504.11218 | link |
2025-04-15 | Low-Rank SPIKE Framework for Solving Large Sparse Linear Systems with Applications | Braegan S. Spring et.al. | 2504.11167 | null |
2025-04-15 | Super time-resolved tomography | Zhe Hu et.al. | 2504.11148 | null |
2025-04-15 | AI-guided Antibiotic Discovery Pipeline from Target Selection to Compound Identification | Maximilian G. Schuh et.al. | 2504.11091 | null |
2025-04-15 | Easy3D: A Simple Yet Effective Method for 3D Interactive Segmentation | Andrea Simonelli et.al. | 2504.11024 | null |
2025-04-21 | GATE3D: Generalized Attention-based Task-synergized Estimation in 3D* | Eunsoo Im et.al. | 2504.11014 | null |
2025-04-15 | 3D Gabor Splatting: Reconstruction of High-frequency Surface Texture using Gabor Noise | Haato Watanabe et.al. | 2504.11003 | null |
2025-04-15 | Global N-body Simulation of Gap Edge Structures Created by Perturbations from a Small Satellite Embedded in Saturn’s Rings II: The Effect of Satellite’s Orbital Eccentricity and Inclination | Naoya Torii et.al. | 2504.10989 | null |
2025-04-15 | Modeling liquid-mediated interactions for close-to-substrate magnetic microparticle transport in dynamic magnetic field landscapes | Markus Gusenbauer et.al. | 2504.10963 | null |
2025-04-15 | Bringing together invertible UNets with invertible attention modules for memory-efficient diffusion models | Karan Jain et.al. | 2504.10883 | null |
2025-04-15 | Safe-Construct: Redefining Construction Safety Violation Recognition as 3D Multi-View Engagement Task | Aviral Chharia et.al. | 2504.10880 | null |
2025-04-15 | Algorithmic Advances Towards a Realizable Quantum Lattice Boltzmann Method | Apurva Tiwari et.al. | 2504.10870 | null |
2025-04-15 | AdS3 axion wormholes as stable contributions to the Euclidean gravitational path integral | Andrew Loveridge et.al. | 2504.10868 | null |
2025-04-15 | ZeroGrasp: Zero-Shot Shape Reconstruction Enabled Robotic Grasping | Shun Iwase et.al. | 2504.10857 | null |
2025-04-15 | Stable and High-Precision 3D Positioning via Tunable Composite-Dimensional Hong-Ou-Mandel Interference | Yongqiang Li et.al. | 2504.10843 | null |
2025-04-15 | Room-Temperature Hybrid 2D-3D Quantum Spin System for Enhanced Magnetic Sensing and Many-Body Dynamics | Haoyu Sun et.al. | 2504.10815 | null |
2025-04-17 | GaSLight: Gaussian Splats for Spatially-Varying Lighting in HDR | Christophe Bolduc et.al. | 2504.10809 | null |
2025-04-15 | 3D Wavelet Convolutions with Extended Receptive Fields for Hyperspectral Image Classification | Guandong Li et.al. | 2504.10795 | null |
2025-04-15 | Scanning-free three-dimensional fluorescent dipoles imaging by polarization self-interference digital holography (pSIDH) | Tianlong Man et.al. | 2504.10772 | null |
2025-04-15 | Three-dimensional neural network driving self-interference digital holography enables high-fidelity, non-scanning volumetric fluorescence microscopy | Tianlong Man et.al. | 2504.10769 | null |
2025-04-14 | SpinMeRound: Consistent Multi-View Identity Generation Using Diffusion Models | Stathis Galanakis et.al. | 2504.10716 | null |
2025-04-14 | Optimizing Data Distribution and Kernel Performance for Efficient Training of Chemistry Foundation Models: A Case Study with MACE | Jesun Firoz et.al. | 2504.10700 | null |
2025-04-14 | Dust continuum radiation maps from MHD simulations of accretion-ejection systems around single and binary stars | Somayeh Sheikhnezami et.al. | 2504.10599 | null |
2025-04-13 | Imaging Transformer for MRI Denoising: a Scalable Model Architecture that enables SNR « 1 Imaging | Hui Xue et.al. | 2504.10534 | null |
2025-04-14 | Art3D: Training-Free 3D Generation from Flat-Colored Illustration | Xiaoyan Cong et.al. | 2504.10466 | null |
2025-04-14 | HybridCollab: Unifying In-Person and Remote Collaboration for Cardiovascular Surgical Planning in Mobile Augmented Reality | Pratham Darrpan Mehta et.al. | 2504.10440 | null |
2025-04-14 | Benchmarking 3D Human Pose Estimation Models Under Occlusions | Filipa Lino et.al. | 2504.10350 | null |
2025-04-14 | Improving diffusion modeling in all-solid-state lithium batteries: a novel approach for grain boundary effects | Lena Scholz et.al. | 2504.10348 | null |
2025-04-19 | LL-Gaussian: Low-Light Scene Reconstruction and Enhancement via Gaussian Splatting for Novel View Synthesis | Hao Sun et.al. | 2504.10331 | null |
2025-04-14 | ESCT3D: Efficient and Selectively Controllable Text-Driven 3D Content Generation with Gaussian Splatting | Huiqi Wu et.al. | 2504.10316 | null |
2025-04-14 | Existence of Nonequilibrium Glasses in the Degenerate Stealthy Hyperuniform Ground-State Manifold | Salvatore Torquato et.al. | 2504.10310 | null |
2025-04-23 | Electron-Phonon Coupling Mediated by Fröhlich Interaction in Rb2SnBr6 Perovskite | C. C. S. Soares et.al. | 2504.10292 | null |
2025-04-14 | Look-to-Touch: A Vision-Enhanced Proximity and Tactile Sensor for Distance and Geometry Perception in Robotic Manipulation | Yueshi Dong et.al. | 2504.10280 | null |
2025-04-14 | Dual Theory of Turbulent Mixing | Alexander Migdal et.al. | 2504.10205 | null |
2025-04-14 | Design Optimization of Flip FET Standard Cells with Dual-sided Pins for Ultimate Scaling | Rui Gui et.al. | 2504.10122 | null |
2025-04-14 | AGO: Adaptive Grounding for Open World 3D Occupancy Prediction | Peizheng Li et.al. | 2504.10117 | null |
2025-04-14 | SoccerNet-v3D: Leveraging Sports Broadcast Replays for 3D Scene Understanding | Marc Gutiérrez-Pérez et.al. | 2504.10106 | link |
2025-04-14 | Analyzing reduced density matrices in SU(2) Chern-Simons theory | Atesh Saini et.al. | 2504.10098 | null |
2025-04-14 | Convergence Analysis of a Stochastic Interacting Particle-Field Algorithm for 3D Parabolic-Parabolic Keller-Segel Systems | Boyi Hu et.al. | 2504.10089 | null |
2025-04-14 | Mavors: Multi-granularity Video Representation for Multimodal Large Language Model | Yang Shi et.al. | 2504.10068 | null |
2025-04-14 | Multi-Object Grounding via Hierarchical Contrastive Siamese Transformers | Chengyi Du et.al. | 2504.10048 | null |
2025-04-14 | TT3D: Table Tennis 3D Reconstruction | Thomas Gossard et.al. | 2504.10035 | null |
2025-04-14 | EBAD-Gaussian: Event-driven Bundle Adjusted Deblur Gaussian Splatting | Yufei Deng et.al. | 2504.10012 | null |
2025-04-16 | GaussVideoDreamer: 3D Scene Generation with Video Diffusion and Inconsistency-Aware Gaussian Splatting | Junlin Hao et.al. | 2504.10001 | null |
2025-04-15 | OctGPT: Octree-based Multiscale Autoregressive Models for 3D Shape Generation | Si-Tong Wei et.al. | 2504.09975 | link |
2025-04-14 | Efficient 2D to Full 3D Human Pose Uplifting including Joint Rotations | Katja Ludwig et.al. | 2504.09953 | null |
2025-04-14 | Separate to Collaborate: Dual-Stream Diffusion Model for Coordinated Piano Hand Motion Synthesis | Zihao Liu et.al. | 2504.09885 | null |
2025-04-14 | NeRF-Based Transparent Object Grasping Enhanced by Shape Priors | Yi Han et.al. | 2504.09868 | null |
2025-04-14 | Carbon-Efficient 3D DNN Acceleration: Optimizing Performance and Sustainability | Aikaterini Maria Panteleaki et.al. | 2504.09851 | null |
2025-04-14 | Stiffness, strength, energy dissipation and reusability in heterogeneous architected polycrystals | Seunghwan Lee et.al. | 2504.09817 | null |
2025-04-14 | EquiVDM: Equivariant Video Diffusion Models with Temporally Consistent Noise | Chao Liu et.al. | 2504.09789 | null |
2025-04-13 | Accelerating Ray Tracing-Based Wireless Channels Generation for Real-Time Network Digital Twins | Cláudio Modesto et.al. | 2504.09751 | null |
2025-04-13 | A Full Spectrum of 3D Ferroelectric Memory Architectures Shaped by Polarization Sensing | Jiahui Duan et.al. | 2504.09713 | null |
2025-04-13 | smFISH_batchRun: A smFISH image processing tool for single-molecule RNA Detection and 3D reconstruction | Nimmy S. John et.al. | 2504.09692 | null |
2025-04-13 | LightHeadEd: Relightable & Editable Head Avatars from a Smartphone | Pranav Manu et.al. | 2504.09671 | null |
2025-04-13 | 3D in-situ profiling in a laser micromachining station using dual-comb LiDAR | Hayk Soghomonyan et.al. | 2504.09659 | null |
2025-04-13 | OmniMamba4D: Spatio-temporal Mamba for longitudinal CT lesion segmentation | Justin Namuk Kim et.al. | 2504.09655 | null |
2025-04-13 | Ges3ViG: Incorporating Pointing Gestures into Language-Based 3D Visual Grounding for Embodied Reference Understanding | Atharv Mahesh Mane et.al. | 2504.09623 | link |
2025-04-13 | TextSplat: Text-Guided Semantic Fusion for Generalizable Gaussian Splatting | Zhicong Wu et.al. | 2504.09588 | null |
2025-04-13 | EmbodiedOcc++: Boosting Embodied 3D Occupancy Prediction with Plane Regularization and Uncertainty Sampler | Hao Wang et.al. | 2504.09540 | null |
2025-04-13 | 3D CoCa: Contrastive Learners are 3D Captioners | Ting Huang et.al. | 2504.09518 | link |
2025-04-13 | Capturing Longitudinal Changes in Brain Morphology Using Temporally Parameterized Neural Displacement Fields | Aisha L. Shuaibu et.al. | 2504.09514 | null |
2025-04-13 | Pillar-Voxel Fusion Network for 3D Object Detection in Airborne Hyperspectral Point Clouds | Yanze Jiang et.al. | 2504.09506 | null |
2025-04-13 | DropoutGS: Dropping Out Gaussians for Better Sparse-view Rendering | Yexing Xu et.al. | 2504.09491 | null |
2025-04-13 | Some new Liouville type theorems for 3D steady tropical climate model | Yan Fang et.al. | 2504.09423 | null |
2025-04-12 | Designing Reality-Based VR Interfaces for Geological Uncertainty | Roberta Mota et.al. | 2504.09355 | null |
2025-04-12 | Text To 3D Object Generation For Scalable Room Assembly | Sonia Laguna et.al. | 2504.09328 | null |
2025-04-12 | ReferGPT: Towards Zero-Shot Referring Multi-Object Tracking | Tzoulio Chamiti et.al. | 2504.09195 | null |
2025-04-12 | Dynamic laboratory X-ray phase-contrast microtomography with structure-based prior regularisation | Harry Allan et.al. | 2504.09193 | null |
2025-04-12 | SCFlow2: Plug-and-Play Object Pose Refiner with Shape-Constraint Scene Flow | Qingyuan Wang et.al. | 2504.09160 | null |
2025-04-12 | MASH: Masked Anchored SpHerical Distances for 3D Shape Representation and Generation | Changhao Li et.al. | 2504.09149 | null |
2025-04-12 | A Constrained Optimization Approach for Gaussian Splatting from Coarsely-posed Images and Noisy Lidar Point Clouds | Jizong Peng et.al. | 2504.09129 | null |
2025-04-12 | Optimizing FDTD Solvers for Electromagnetics: A Compiler-Guided Approach with High-Level Tensor Abstractions | Yifei He et.al. | 2504.09118 | null |
2025-04-12 | Multi-modal and Multi-view Fundus Image Fusion for Retinopathy Diagnosis via Multi-scale Cross-attention and Shifted Window Self-attention | Yonghao Huang et.al. | 2504.09106 | null |
2025-04-12 | BIGS: Bimanual Category-agnostic Interaction Reconstruction from Monocular Videos via 3D Gaussian Splatting | Jeongwan On et.al. | 2504.09097 | null |
2025-04-12 | Multi-Modal Brain Tumor Segmentation via 3D Multi-Scale Self-attention and Cross-attention | Yonghao Huang et.al. | 2504.09088 | null |
2025-04-12 | RICCARDO: Radar Hit Prediction and Convolution for Camera-Radar 3D Object Detection | Yunfei Long et.al. | 2504.09086 | null |
2025-04-12 | You Need a Transition Plane: Bridging Continuous Panoramic 3D Reconstruction with Perspective Gaussian Splatting | Zhijie Shen et.al. | 2504.09062 | null |
2025-04-12 | Multimodal 3D Genome Pre-training | Minghao Yang et.al. | 2504.09060 | null |
2025-04-15 | BlockGaussian: Efficient Large-Scale Scene Novel View Synthesis via Adaptive Block-Based Gaussian Splatting | Yongchang Wu et.al. | 2504.09048 | link |
2025-04-11 | Real-scale Smoothed Particle Hydrodynamics Tsunami Runup Modelling, with application to 3-D tsunami urban flows in Cilacap, South Java, Indonesia | Jack Dignan et.al. | 2504.09005 | null |
2025-04-11 | An in silico approach to analyse the influence of carotid haemodynamics on cardiovascular events using 3D tomographic ultrasound and computational fluid dynamics | Sampad Sengupta et.al. | 2504.08969 | null |
2025-04-11 | Holographic duality from Howe duality: Chern-Simons gravity as an ensemble of code CFTs | Anatoly Dymarsky et.al. | 2504.08724 | null |
2025-04-11 | Asteroseismic predictions for a massive main-sequence merger product | J. Henneco et.al. | 2504.08683 | null |
2025-04-11 | X2BR: High-Fidelity 3D Bone Reconstruction from a Planar X-Ray Image with Hybrid Neural Implicit Methods | Gokce Guven et.al. | 2504.08675 | null |
2025-04-11 | The Invisible EgoHand: 3D Hand Forecasting through EgoBody Pose Estimation | Masashi Hatano et.al. | 2504.08654 | null |
2025-04-11 | Rational constitutive law for the viscous stress tensor in incompressible two-phase flows: Derivation and tests against a 3D benchmark experiment | Jacques Magnaudet et.al. | 2504.08648 | null |
2025-04-11 | Reverberation-based Features for Sound Event Localization and Detection with Distance Estimation | Davide Berghi et.al. | 2504.08644 | link |
2025-04-11 | Latent Diffusion Autoencoders: Toward Efficient and Meaningful Unsupervised Representation Learning in Medical Imaging | Gabriele Lozupone et.al. | 2504.08635 | link |
2025-04-21 | Tactile sensing enables vertical obstacle negotiation for elongate many-legged robots | Juntao He et.al. | 2504.08615 | null |
2025-04-11 | FindAnything: Open-Vocabulary and Object-Centric Mapping for Robot Exploration in Any Environment | Sebastián Barbas Laina et.al. | 2504.08603 | null |
2025-04-14 | Hands-On: Segmenting Individual Signs from Continuous Sequences | Low Jian He et.al. | 2504.08593 | null |
2025-04-11 | FMLGS: Fast Multilevel Language Embedded Gaussians for Part-level Interactive Agents | Xin Tan et.al. | 2504.08581 | null |
2025-04-15 | Recovering the polyhedral geometry of fragments | Janos Torok et.al. | 2504.08563 | null |
2025-04-11 | Digital Twin Catalog: A Large-Scale Photorealistic 3D Object Digital Twin Dataset | Zhao Dong et.al. | 2504.08541 | null |
2025-04-11 | Clifford algebras and liquid crystalline fermions | N. Johnson et.al. | 2504.08519 | null |
2025-04-11 | Cut-and-Splat: Leveraging Gaussian Splatting for Synthetic Data Generation | Bram Vanherle et.al. | 2504.08473 | link |
2025-04-11 | Well-Posedness of Discretizations for Fractional Elasto-Plasticity | Michael Feischl et.al. | 2504.08450 | null |
2025-04-11 | CMIP-CIL: A Cross-Modal Benchmark for Image-Point Class Incremental Learning | Chao Qi et.al. | 2504.08422 | link |
2025-04-11 | GeoTexBuild: 3D Building Model Generation from Map Footprints | Ruizhe Wang et.al. | 2504.08419 | null |
2025-04-11 | Boosting the Class-Incremental Learning in 3D Point Clouds via Zero-Collection-Cost Basic Shape Pre-Training | Chao Qi et.al. | 2504.08412 | link |
2025-04-14 | PMNI: Pose-free Multi-view Normal Integration for Reflective and Textureless Surface Reconstruction | Mingzhi Pei et.al. | 2504.08410 | link |
2025-04-11 | In-2-4D: Inbetweening from Two Single-View Images to 4D Generation | Sauradip Nag et.al. | 2504.08366 | null |
2025-04-11 | Location-Oriented Sound Event Localization and Detection with Spatial Mapping and Regression Localization | Xueping Zhang et.al. | 2504.08365 | null |
2025-04-11 | Single View Garment Reconstruction Using Diffusion Mapping Via Pattern Coordinates | Ren Li et.al. | 2504.08353 | null |
2025-04-11 | DSM: Building A Diverse Semantic Map for 3D Visual Grounding | Qinghongbing Xie et.al. | 2504.08307 | null |
2025-04-11 | Generative AI for Film Creation: A Survey of Recent Advances | Ruihan Zhang et.al. | 2504.08296 | null |
2025-04-11 | Sharp norm inflation for 3D Navier-Stokes equations in supercritical spaces | Xiaoyutao Luo et.al. | 2504.08288 | null |
2025-04-14 | RAG-VR: Leveraging Retrieval-Augmented Generation for 3D Question Answering in VR Environments | Shiyi Ding et.al. | 2504.08256 | link |
2025-04-11 | Practical Implementation of an End-to-End Methodology for SPC of 3-D Part Geometry: A Case Study | Yulin An et.al. | 2504.08243 | null |
2025-04-11 | CATCH-FORM-3D: Compliance-Aware Tactile Control and Hybrid Deformation Regulation for 3D Viscoelastic Object Manipulation | Hongjun Ma et.al. | 2504.08238 | null |
2025-04-11 | A 120 lines code for isogeometric topology optimization and its extension to 3D in MATLAB | Xianda Xie et.al. | 2504.08233 | null |
2025-04-11 | CATCH-FORM-ACTer: Compliance-Aware Tactile Control and Hybrid Deformation Regulation-Based Action Transformer for Viscoelastic Object Manipulation | Hongjun Ma et.al. | 2504.08232 | null |
2025-04-11 | Determining 3D atomic coordinates of light-element quantum materials using ptychographic electron tomography | Na Yeon Kim et.al. | 2504.08228 | null |
2025-04-18 | DrivAer Transformer: A high-precision and fast prediction method for vehicle aerodynamic drag coefficient based on the DrivAerNet++ dataset | Jiaqi He et.al. | 2504.08217 | null |
2025-04-11 | Multi-person Physics-based Pose Estimation for Combat Sports | Hossein Feiz et.al. | 2504.08175 | null |
2025-04-10 | Enhanced Cooperative Perception Through Asynchronous Vehicle to Infrastructure Framework with Delay Mitigation for Connected and Automated Vehicles | Nithish Kumar Saravanan et.al. | 2504.08172 | null |
2025-04-10 | Investigating Vision-Language Model for Point Cloud-based Vehicle Classification | Yiqiao Li et.al. | 2504.08154 | null |
2025-04-10 | Gen3DEval: Using vLLMs for Automatic Evaluation of Generated 3D Objects | Shalini Maiti et.al. | 2504.08125 | null |
2025-04-14 | Two-dimensional perovskites with maximum symmetry enable exciton diffusion length exceeding 2 micrometers | Jin Hou et.al. | 2504.08121 | null |
2025-04-10 | Towards Unconstrained 2D Pose Estimation of the Human Spine | Muhammad Saif Ullah Khan et.al. | 2504.08110 | null |
2025-04-10 | ContrastiveGaussian: High-Fidelity 3D Generation with Contrastive Learning and Gaussian Splatting | Junbang Liu et.al. | 2504.08100 | link |
2025-04-10 | Compositional Flows for 3D Molecule and Synthesis Pathway Co-design | Tony Shen et.al. | 2504.08051 | null |
2025-04-10 | Bridging Quasars and Little Red Dots: Insights into Broad-Line AGNs at $z=5-8$ from the First JWST COSMOS-3D Dataset | Xiaojing Lin et.al. | 2504.08039 | null |
2025-04-10 | The Luminosity Function and Clustering of H $α$ Emitting Galaxies at $z\approx4-6$ from a Complete NIRCam Grism Redshift Survey | Xiaojing Lin et.al. | 2504.08028 | null |
2025-04-10 | Self-Bootstrapping for Versatile Test-Time Adaptation | Shuaicheng Niu et.al. | 2504.08010 | null |
2025-04-15 | SlicerNNInteractive: A 3D Slicer extension for nnInteractive | Coen de Vente et.al. | 2504.07991 | link |
2025-04-10 | Geo4D: Leveraging Video Generators for Geometric 4D Scene Reconstruction | Zeren Jiang et.al. | 2504.07961 | link |
2025-04-10 | Detect Anything 3D in the Wild | Hanxue Zhang et.al. | 2504.07958 | null |
2025-04-10 | BoxDreamer: Dreaming Box Corners for Generalizable Object Pose Estimation | Yuanhong Yu et.al. | 2504.07955 | null |
2025-04-10 | InteractAvatar: Modeling Hand-Face Interaction in Photorealistic Avatars with Deformable Gaussians | Kefan Chen et.al. | 2504.07949 | null |
2025-04-10 | HoloPart: Generative 3D Part Amodal Segmentation | Yunhan Yang et.al. | 2504.07943 | null |
2025-04-10 | V2V3D: View-to-View Denoised 3D Reconstruction for Light-Field Microscopy | Jiayin Zhao et.al. | 2504.07853 | null |
2025-04-10 | HarmonySeg: Tubular Structure Segmentation with Deep-Shallow Feature Fusion and Growth-Suppression Balanced Loss | Yi Huang et.al. | 2504.07827 | null |
2025-04-10 | Focal Cortical Dysplasia Type II Detection Using Cross Modality Transfer Learning and Grad-CAM in 3D-CNNs for MRI Analysis | Lorenzo Lasagni et.al. | 2504.07775 | null |
2025-04-10 | CTSR: Cartesian tensor-based sparse regression for data-driven discovery of high-dimensional invariant governing equations | Boqian Zhang et.al. | 2504.07618 | null |
2025-04-10 | Convexity Helps Iterated Search in 3D | Peyman Afshani et.al. | 2504.07545 | null |
2025-04-10 | DGOcc: Depth-aware Global Query-based Network for Monocular 3D Occupancy Prediction | Xu Zhao et.al. | 2504.07524 | null |
2025-04-10 | Laboratory Three-dimensional X-ray Micro-beam Laue Diffraction | Yubin Zhang et.al. | 2504.07452 | null |
2025-04-10 | ThermoStereoRT: Thermal Stereo Matching in Real Time via Knowledge Distillation and Attention-based Refinement | Anning Hu et.al. | 2504.07418 | null |
2025-04-10 | Novel Diffusion Models for Multimodal 3D Hand Trajectory Prediction | Junyi Ma et.al. | 2504.07375 | link |
2025-04-10 | View-Dependent Uncertainty Estimation of 3D Gaussian Splatting | Chenyu Han et.al. | 2504.07370 | null |
2025-04-10 | Ultrahigh room-temperature hole conductivity in a perovskite cuprate with vanishing electron-correlation | Meng Wang et.al. | 2504.07369 | null |
2025-04-09 | DLTPose: 6DoF Pose Estimation From Accurate Dense Surface Point Estimates | Akash Jadhav et.al. | 2504.07335 | null |
2025-04-11 | Objaverse++: Curated 3D Object Dataset with Quality Annotations | Chendi Lin et.al. | 2504.07334 | link |
2025-04-09 | Adaptive Vision-Guided Robotic Arm Control for Precision Pruning in Dynamic Orchard Environments | Dawood Ahmed et.al. | 2504.07309 | null |
2025-04-11 | The dynamical role of optical phonons and sub-lattice screening in a solid-state ion conductor | Kim H. Pham et.al. | 2504.07249 | null |
2025-04-09 | A Pointcloud Registration Framework for Relocalization in Subterranean Environments | David Akhihiero et.al. | 2504.07231 | null |
2025-04-08 | GIGA: Generalizable Sparse Image-driven Gaussian Avatars | Anton Zubekhin et.al. | 2504.07144 | null |
2025-04-09 | Spin state of iron in I-42d-type Mg2SiO4 at ultra-high pressures | Tianqi Wan et.al. | 2504.07067 | null |
2025-04-09 | UAV Position Estimation using a LiDAR-based 3D Object Detection Method | Uthman Olawoye et.al. | 2504.07028 | null |
2025-04-09 | Glossy Object Reconstruction with Cost-effective Polarized Acquisition | Bojian Wu et.al. | 2504.07025 | null |
2025-04-09 | RayFronts: Open-Set Semantic Ray Frontiers for Online Scene Understanding and Exploration | Omar Alama et.al. | 2504.06994 | null |
2025-04-09 | SIGMAN:Scaling 3D Human Gaussian Generation with Millions of Assets | Yuhang Yang et.al. | 2504.06982 | null |
2025-04-09 | Wheat3DGS: In-field 3D Reconstruction, Instance Segmentation and Phenotyping of Wheat Heads with Gaussian Splatting | Daiwei Zhang et.al. | 2504.06978 | null |
2025-04-09 | Two by Two: Learning Multi-Task Pairwise Objects Assembly for Generalizable Robot Manipulation | Yu Qi et.al. | 2504.06961 | null |
2025-04-09 | Temporal dynamics of GHz acoustic waves in chipscale phononic integrated circuits | A. Fahad Malik et.al. | 2504.06959 | null |
2025-04-09 | Longitudinal Assessment of Lung Lesion Burden in CT | Tejas Sudharshan Mathai et.al. | 2504.06924 | null |
2025-04-09 | Leveraging Anatomical Priors for Automated Pancreas Segmentation on Abdominal CT | Anisa V. Prasad et.al. | 2504.06921 | null |
2025-04-09 | S-EO: A Large-Scale Dataset for Geometry-Aware Shadow Detection in Remote Sensing Applications | Masquil Elías et.al. | 2504.06920 | null |
2025-04-09 | UKBOB: One Billion MRI Labeled Masks for Generalizable 3D Medical Image Segmentation | Emmanuelle Bourigault et.al. | 2504.06908 | null |
2025-04-09 | MedSegFactory: Text-Guided Generation of Medical Image-Mask Pairs | Jiawei Mao et.al. | 2504.06897 | null |
2025-04-09 | IAAO: Interactive Affordance Learning for Articulated Objects in 3D Environments | Can Zhang et.al. | 2504.06827 | null |
2025-04-09 | SVG-IR: Spatially-Varying Gaussian Splatting for Inverse Rendering | Hanxiao Sun et.al. | 2504.06815 | null |
2025-04-10 | MonoPlace3D: Learning 3D-Aware Object Placement for 3D Monocular Detection | Rishubh Parihar et.al. | 2504.06801 | null |
2025-04-09 | Hybrid machine learning models based on physical patterns to accelerate CFD simulations: a short guide on autoregressive models | Arindam Sengupta et.al. | 2504.06774 | null |
2025-04-14 | Data-Driven RANS Closures Using a Relative Importance Term Analysis Based Classifier for 2D and 3D Separated Flows | Tyler Buchanan et.al. | 2504.06758 | null |
2025-04-10 | Compass Control: Multi Object Orientation Control for Text-to-Image Generation | Rishubh Parihar et.al. | 2504.06752 | null |
2025-04-09 | Visualisation of a multidimensional point cloud as a 3D swarm of avatars | Leszek Luchowski et.al. | 2504.06751 | link |
2025-04-10 | nnLandmark: A Self-Configuring Method for 3D Medical Landmark Detection | Alexandra Ertl et.al. | 2504.06742 | null |
2025-04-09 | Timing the Escape of a Caged Electron | Connor Fields et.al. | 2504.06733 | null |
2025-04-09 | Masked Scene Modeling: Narrowing the Gap Between Supervised and Self-Supervised Learning in 3D Scene Understanding | Pedro Hermosilla et.al. | 2504.06719 | link |
2025-04-09 | GSta: Efficient Training Scheme with Siestaed Gaussians for Monocular 3D Scene Reconstruction | Anil Armagan et.al. | 2504.06716 | null |
2025-04-09 | Robust and Noise-resilient Long-Term Prediction of Spatiotemporal Data Using Variational Mode Graph Neural Networks with 3D Attention | Osama Ahmad et.al. | 2504.06660 | null |
2025-04-10 | Uni-PrevPredMap: Extending PrevPredMap to a Unified Framework of Prior-Informed Modeling for Online Vectorized HD Map Construction | Nan Peng et.al. | 2504.06647 | link |
2025-04-09 | HGMamba: Enhancing 3D Human Pose Estimation with a HyperGCN-Mamba Network | Hu Cui et.al. | 2504.06638 | null |
2025-04-09 | FACT: Multinomial Misalignment Classification for Point Cloud Registration | Ludvig Dillén et.al. | 2504.06627 | null |
2025-04-09 | Human-like compositional learning of visually-grounded concepts using synthetic environments | Zijun Lin et.al. | 2504.06618 | null |
2025-04-10 | Image registration of 2D optical thin sections in a 3D porous medium: Application to a Berea sandstone digital rock image | Jaehong Chung et.al. | 2504.06604 | link |
2025-04-10 | Stochastic Ray Tracing of 3D Transparent Gaussians | Xin Sun et.al. | 2504.06598 | null |
2025-04-11 | ASHiTA: Automatic Scene-grounded HIerarchical Task Analysis | Yun Chang et.al. | 2504.06553 | null |
2025-04-08 | Implementation of a Zed 2i Stereo Camera for High-Frequency Shoreline Change and Coastal Elevation Monitoring | José A. Pilartes-Congo et.al. | 2504.06464 | null |
2025-04-08 | Electric-Field-Controlled Chemical Reaction via Piezo-Chemistry Creates Programmable Material Stiffness | Jun Wang et.al. | 2504.06405 | null |
2025-04-10 | Fast Globally Optimal and Geometrically Consistent 3D Shape Matching | Paul Roetzer et.al. | 2504.06385 | null |
2025-04-08 | Automated Fabrication of Magnetic Soft Microrobots | Kaitlyn Clancy et.al. | 2504.06370 | null |
2025-04-08 | MACER3D – an upgrade of MACER2D with enhanced subgrid models and gas physics – and its application to simulating AGN feedback in a massive elliptical galaxy | Haoen Zhang et.al. | 2504.06342 | null |
2025-04-08 | The likelihood of not detecting cavity-carving companions in transition discs – A statistical approach | Enrico Ragusa et.al. | 2504.06337 | null |
2025-04-13 | Conformal Slit Mapping Based Spiral Tool Trajectory Planning for Ball-end Milling on Complex Freeform Surfaces | Changqing Shen et.al. | 2504.06310 | null |
2025-04-11 | AI-Driven Reconstruction of Large-Scale Structure from Combined Photometric and Spectroscopic Surveys | Wenying Du et.al. | 2504.06309 | null |
2025-04-08 | D^2USt3R: Enhancing 3D Reconstruction with 4D Pointmaps for Dynamic Scenes | Jisang Han et.al. | 2504.06264 | null |
2025-04-08 | HiMoR: Monocular Deformable Gaussian Reconstruction with Hierarchical Motion Representation | Yiming Liang et.al. | 2504.06210 | null |
2025-04-08 | Factorizing Defects from Generalized Pinning Fields | Fedor K. Popov et.al. | 2504.06203 | null |
2025-04-08 | Flash Sculptor: Modular 3D Worlds from Objects | Yujia Hu et.al. | 2504.06178 | null |
2025-04-08 | 3D evolution of protein networks and lipid globules in heat-treated egg yolk | Felix Wittwer et.al. | 2504.06032 | null |
2025-04-08 | CamContextI2V: Context-aware Controllable Video Generation | Luis Denninger et.al. | 2504.06022 | link |
2025-04-08 | econSG: Efficient and Multi-view Consistent Open-Vocabulary 3D Semantic Gaussians | Can Zhang et.al. | 2504.06003 | null |
2025-04-08 | Under-Sampled High-Dimensional Data Recovery via Symbiotic Multi-Prior Tensor Reconstruction | Jie Yang et.al. | 2504.05992 | null |
2025-04-08 | Modular Soft Wearable Glove for Real-Time Gesture Recognition and Dynamic 3D Shape Reconstruction | Huazhi Dong et.al. | 2504.05983 | null |
2025-04-10 | An Empirical Study of GPT-4o Image Generation Capabilities | Sixiang Chen et.al. | 2504.05979 | link |
2025-04-08 | AVP-AP: Self-supervised Automatic View Positioning in 3D cardiac CT via Atlas Prompting | Xiaolin Fan et.al. | 2504.05966 | null |
2025-04-08 | Deep RL-based Autonomous Navigation of Micro Aerial Vehicles (MAVs) in a complex GPS-denied Indoor Environment | Amit Kumar Singh et.al. | 2504.05918 | null |
2025-04-08 | PRIMEDrive-CoT: A Precognitive Chain-of-Thought Framework for Uncertainty-Aware Object Interaction in Driving Scene Scenario | Sriram Mandalika et.al. | 2504.05908 | null |
2025-04-08 | UVG-VPC: Voxelized Point Cloud Dataset for Visual Volumetric Video-based Coding | Guillaume Gautier et.al. | 2504.05888 | null |
2025-04-08 | Jointly-optimized Trajectory Generation and Camera Control for 3D Coverage Planning | Savvas Papaioannou et.al. | 2504.05887 | null |
2025-04-08 | Rolling Horizon Coverage Control with Collaborative Autonomous Agents | Savvas Papaioannou et.al. | 2504.05883 | null |
2025-04-08 | Turin3D: Evaluating Adaptation Strategies under Label Scarcity in Urban LiDAR Segmentation with Semi-Supervised Techniques | Luca Barco et.al. | 2504.05882 | null |
2025-04-08 | Fast Sphericity and Roundness approximation in 2D and 3D using Local Thickness | Pawel Tomasz Pieta et.al. | 2504.05808 | null |
2025-04-08 | Space-averaged non-equilibrium Green’s function approach for quantum transport in 3D | Vahid Mosallanejad et.al. | 2504.05788 | null |
2025-04-08 | How to Enable LLM with 3D Capacity? A Survey of Spatial Reasoning in LLM | Jirong Zha et.al. | 2504.05786 | null |
2025-04-08 | Kronecker scaling of tensors with applications to arithmetic circuits and algorithms | Andreas Björklund et.al. | 2504.05772 | null |
2025-04-08 | InvNeRF-Seg: Fine-Tuning a Pre-Trained NeRF for 3D Object Segmentation | Jiangsan Zhao et.al. | 2504.05751 | null |
2025-04-08 | Micro-splatting: Maximizing Isotropic Constraints for Refined Optimization in 3D Gaussian Splatting | Jee Won Lee et.al. | 2504.05740 | null |
2025-04-08 | SAP-CoPE: Social-Aware Planning using Cooperative Pose Estimation with Infrastructure Sensor Nodes | Minghao Ning et.al. | 2504.05727 | link |
2025-04-08 | QEMesh: Employing A Quadric Error Metrics-Based Representation for Mesh Generation | Jiaqi Li et.al. | 2504.05720 | null |
2025-04-08 | POMATO: Marrying Pointmap Matching with Temporal Motion for Dynamic 3D Reconstruction | Songyan Zhang et.al. | 2504.05692 | link |
2025-04-08 | POD: Predictive Object Detection with Single-Frame FMCW LiDAR Point Cloud | Yining Shi et.al. | 2504.05649 | null |
2025-04-11 | A Multi-Modal AI System for Screening Mammography: Integrating 2D and 3D Imaging to Improve Breast Cancer Detection in a Prospective Clinical Study | Jungkyu Park et.al. | 2504.05636 | link |
2025-04-08 | Maternal and Fetal Health Status Assessment by Using Machine Learning on Optical 3D Body Scans | Ruting Cheng et.al. | 2504.05627 | null |
2025-04-08 | PyTopo3D: A Python Framework for 3D SIMP-based Topology Optimization | Jihoon Kim et.al. | 2504.05604 | link |
2025-04-14 | TAPNext: Tracking Any Point (TAP) as Next Token Prediction | Artem Zholus et.al. | 2504.05579 | null |
2025-04-07 | Improved Stochastic Texture Filtering Through Sample Reuse | Bartlomiej Wronski et.al. | 2504.05562 | null |
2025-04-07 | View-Dependent Deformation Fields for 2D Editing of 3D Models | Martin El Mqirmi et.al. | 2504.05544 | null |
2025-04-15 | Core-Excited States of Linear and Bent Uranyl Complexes: Insights from High-Energy Resolution X-ray Spectroscopy and Relativistic Quantum Chemistry | Wilken Aldair Misael et.al. | 2504.05542 | null |
2025-04-07 | L3GS: Layered 3D Gaussian Splats for Efficient 3D Scene Delivery | Yi-Zhen Tsai et.al. | 2504.05517 | link |
2025-04-07 | SPARK-Remote: A Cost-Effective System for Remote Bimanual Robot Teleoperation | Adam Imdieke et.al. | 2504.05488 | null |
2025-04-07 | Imperative vs. Declarative Programming Paradigms for Open-Universe Scene Generation | Maxim Gumin et.al. | 2504.05482 | null |
2025-04-04 | Optimizing 4D Gaussians for Dynamic Scene Video from Single Landscape Images | In-Hwan Jin et.al. | 2504.05458 | link |
2025-04-07 | Biomechanical Constraints Assimilation in Deep-Learning Image Registration: Application to sliding and locally rigid deformations | Ziad Kheil et.al. | 2504.05444 | null |
2025-04-07 | GARF: Learning Generalizable 3D Reassembly for Real-World Fractures | Sihang Li et.al. | 2504.05400 | null |
2025-04-07 | Classifying Isolated Symplectic Singularities via 3d $\mathcal{N}=4$ Coulomb Branches | Antoine Bourget et.al. | 2504.05373 | null |
2025-04-07 | InteractVLM: 3D Interaction Reasoning from 2D Foundational Models | Sai Kumar Dwivedi et.al. | 2504.05303 | link |
2025-04-07 | Let it Snow! Animating Static Gaussian Scenes With Dynamic Weather Effects | Gal Fiebelman et.al. | 2504.05296 | null |
2025-04-13 | Non-local charges from perturbed defects via SymTFT in 2d CFT | Federico Ambrosino et.al. | 2504.05277 | null |
2025-04-07 | Texture2LoD3: Enabling LoD3 Building Reconstruction With Panoramic Images | Wenzhao Tang et.al. | 2504.05249 | null |
2025-04-07 | Hybrid machine learning data assimilation for marine biogeochemistry | Ieuan Higgs et.al. | 2504.05218 | null |
2025-04-07 | 3D Universal Lesion Detection and Tagging in CT with Self-Training | Jared Frazier et.al. | 2504.05201 | null |
2025-04-07 | Cellular Network Design for UAV Corridors via Data-driven High-dimensional Bayesian Optimization | Mohamed Benzaghta et.al. | 2504.05176 | null |
2025-04-07 | SSLFusion: Scale & Space Aligned Latent Fusion Model for Multimodal 3D Object Detection | Bonan Ding et.al. | 2504.05170 | null |
2025-04-07 | PanoDreamer: Consistent Text to 360-Degree Scene Generation | Zhexiao Xiong et.al. | 2504.05152 | null |
2025-04-07 | Oscillatory flows in three-dimensional deformable microchannels | Anxu Huang et.al. | 2504.05132 | null |
2025-04-07 | TDFANet: Encoding Sequential 4D Radar Point Clouds Using Trajectory-Guided Deformable Feature Aggregation for Place Recognition | Shouyi Lu et.al. | 2504.05103 | null |
2025-04-07 | PvNeXt: Rethinking Network Design and Temporal Motion for Point Cloud Video Recognition | Jie Wang et.al. | 2504.05075 | null |
2025-04-07 | MotionPRO: Exploring the Role of Pressure in Human MoCap and Beyond | Shenghao Ren et.al. | 2504.05046 | null |
2025-04-07 | Joint BS Deployment and Power Optimization for Minimum EMF Exposure with RL in Real-World Based Urban Scenario | Xueyun Long et.al. | 2504.05017 | null |
2025-04-07 | Scalable chip-based 3D ion traps | Elena Jordan et.al. | 2504.04946 | null |
2025-04-07 | IterMask3D: Unsupervised Anomaly Detection and Segmentation with Test-Time Iterative Mask Refinement in 3D Brain MR | Ziyun Liang et.al. | 2504.04911 | null |
2025-04-15 | Null geodesics around a magnetized Kiselev black hole | Vitalie Lungu et.al. | 2504.04905 | null |
2025-04-07 | SLIDE: Automated Identification and Quantification of Grain Boundary Sliding and Opening in 3D | C. J. A. Mornout et.al. | 2504.04898 | null |
2025-04-07 | Analysis and Computation of Geodesic Distances on Reductive Homogeneous Spaces | Remco Duits et.al. | 2504.04878 | null |
2025-04-16 | 3D Gaussian Particle Approximation of VDB Datasets: A Study for Scientific Visualization | Isha Sharma et.al. | 2504.04857 | null |
2025-04-07 | Embracing Dynamics: Dynamics-aware 4D Gaussian Splatting SLAM | Zhicong Sun et.al. | 2504.04844 | link |
2025-04-07 | OCC-MLLM-CoT-Alpha: Towards Multi-stage Occlusion Recognition Based on Large Language Models via 3D-Aware Supervision and Chain-of-Thoughts Guidance | Chaoyi Wang et.al. | 2504.04781 | null |
2025-04-07 | Bidirectional Hierarchical Protein Multi-Modal Representation Learning | Xuefeng Liu et.al. | 2504.04770 | null |
2025-04-10 | CADCrafter: Generating Computer-Aided Design Models from Unconstrained Images | Cheng Chen et.al. | 2504.04753 | null |
2025-04-07 | Grounding 3D Object Affordance with Language Instructions, Visual Observations and Interactions | He Zhu et.al. | 2504.04744 | null |
2025-04-07 | Inverse++: Vision-Centric 3D Semantic Occupancy Prediction Assisted with 3D Object Detection | Zhenxing Ming et.al. | 2504.04732 | null |
2025-04-07 | DFormerv2: Geometry Self-Attention for RGBD Semantic Segmentation | Bo-Wen Yin et.al. | 2504.04701 | link |
2025-04-07 | DeclutterNeRF: Generative-Free 3D Scene Recovery for Occlusion Removal | Wanzhou Liu et.al. | 2504.04679 | null |
2025-04-07 | 3DM-WeConvene: Learned Image Compression with 3D Multi-Level Wavelet-Domain Convolution and Entropy Model | Haisheng Fu et.al. | 2504.04658 | link |
2025-04-07 | EquiCPI: SE(3)-Equivariant Geometric Deep Learning for Structure-Aware Prediction of Compound-Protein Interactions | Ngoc-Quang Nguyen et.al. | 2504.04654 | link |
2025-04-07 | Basic Pattern of Three-dimensional Magnetic Reconnection within Strongly Turbulent Current Sheets | Yulei Wang et.al. | 2504.04648 | null |
2025-04-06 | DanceMosaic: High-Fidelity Dance Generation with Multimodal Editability | Foram Niravbhai Shah et.al. | 2504.04634 | null |
2025-04-06 | Tool-as-Interface: Learning Robot Policies from Human Tool Usage through Imitation Learning | Haonan Chen et.al. | 2504.04612 | null |
2025-04-06 | Targetless LiDAR-Camera Calibration with Anchored 3D Gaussians | Haebeom Jung et.al. | 2504.04597 | null |
2025-04-06 | DyCON: Dynamic Uncertainty-aware Consistency and Contrastive Learning for Semi-supervised Medical Image Segmentation | Maregu Assefa et.al. | 2504.04566 | null |
2025-04-10 | GPU Volume Rendering with Hierarchical Compression Using VDB | Stefan Zellmann et.al. | 2504.04564 | null |
2025-04-06 | The Point, the Vision and the Text: Does Point Cloud Boost Spatial Reasoning of Large Language Models? | Weichen Zhang et.al. | 2504.04540 | null |
2025-04-06 | GAMBAS: Generalised-Hilbert Mamba for Super-resolution of Paediatric Ultra-Low-Field MRI | Levente Baljer et.al. | 2504.04523 | link |
2025-04-06 | Spatial-Geometry Enhanced 3D Dynamic Snake Convolutional Neural Network for Hyperspectral Image Classification | Guandong Li et.al. | 2504.04463 | null |
2025-04-06 | PRISM: Probabilistic Representation for Integrated Shape Modeling and Generation | Lei Cheng et.al. | 2504.04454 | null |
2025-04-06 | Prot42: a Novel Family of Protein Language Models for Target-aware Protein Binder Generation | Mohammad Amaan Sayeed et.al. | 2504.04453 | null |
2025-04-06 | Thermoxels: a voxel-based method to generate simulation-ready 3D thermal models | Etienne Chassaing et.al. | 2504.04448 | null |
2025-04-10 | A Convex and Global Solution for the P $n$ P Problem in 2D Forward-Looking Sonar | Jiayi Su et.al. | 2504.04445 | null |
2025-04-06 | Momentum imaging and kinetic energy release measurements for various fragmentation pathways in MeV energy proton collision with $SO_2$ molecule | Sandeep Bajrangi Bari et.al. | 2504.04441 | null |
2025-04-06 | Deliberate Planning of 3D Bin Packing on Packing Configuration Trees | Hang Zhao et.al. | 2504.04421 | null |
2025-04-06 | Non-equilibrium Dynamics and Universality of 4D Quantum Vortices and Turbulence | Wei-can Yang et.al. | 2504.04409 | null |
2025-04-06 | Exact large $N$ expansion of $\mathcal{N}=4$ circular quiver Chern-Simons theories and squashing | Naotaka Kubo et.al. | 2504.04402 | null |
2025-04-06 | OmniDrive: A Holistic Vision-Language Dataset for Autonomous Driving with Counterfactual Reasoning | Shihao Wang et.al. | 2504.04348 | null |
2025-04-06 | MedM-VL: What Makes a Good Medical LVLM? | Yiming Shi et.al. | 2504.04323 | link |
2025-04-05 | 3R-GS: Best Practice in Optimizing Camera Poses Along with 3DGS | Zhisheng Huang et.al. | 2504.04294 | null |
2025-04-05 | A Self-Supervised Learning Approach with Differentiable Optimization for UAV Trajectory Planning | Yufei Jiang et.al. | 2504.04289 | null |
2025-04-05 | Interpretable Single-View 3D Gaussian Splatting using Unsupervised Hierarchical Disentangled Representation Learning | Yuyang Zhang et.al. | 2504.04190 | null |
2025-04-05 | Video4DGen: Enhancing Video and 4D Generation through Mutual Optimization | Yikai Wang et.al. | 2504.04153 | link |
2025-04-05 | A tri-static ground-based laser ranging method for precise satellite attitude determination | Peter Bartram et.al. | 2504.04140 | null |
2025-04-05 | Macroscopic ground state degeneracy of the ferro-antiferromagnetic Heisenberg model on diamond-decorated lattices | D. V. Dmitriev et.al. | 2504.04129 | null |
2025-04-05 | Multi-identity Human Image Animation with Structural Video Diffusion | Zhenzhi Wang et.al. | 2504.04126 | null |
2025-04-05 | View2CAD: Reconstructing View-Centric CAD Models from Single RGB-D Scans | James Noeckel et.al. | 2504.04000 | null |
2025-04-04 | DeepOHeat-v1: Efficient Operator Learning for Fast and Trustworthy Thermal Simulation and Optimization in 3D-IC Design | Xinling Yu et.al. | 2504.03955 | null |
2025-04-04 | Particle acceleration by turbulent-driven magnetic reconnection and the production of gamma-rays and neutrinos in AGNs | E. M. de Gouveia Dal Pino et.al. | 2504.03922 | null |
2025-04-04 | The effects of elastic and inelastic collisions in two- and three-body interactions on the stability of 3D Bose-Einstein condensates | R. Sasireka et.al. | 2504.03905 | null |
2025-04-04 | 3D Scene Understanding Through Local Random Access Sequence Modeling | Wanhee Lee et.al. | 2504.03875 | null |
2025-04-04 | A posteriori closure of turbulence models: are symmetries preserved ? | André Freitas et.al. | 2504.03870 | null |
2025-04-04 | CREASE-2D Analysis of Small Angle X-ray Scattering Data from Supramolecular Dipeptide Systems | Nitant Gupta et.al. | 2504.03869 | link |
2025-04-04 | Metal-rich stellar counterpart of the Radcliffe Wave and the 3D chemical footprints of the Milky Way spiral arms | Luis Martinez-Medina et.al. | 2504.03843 | null |
2025-04-04 | Discrete Gauging of 6d SCFTs and Wreathed 3d $\mathcal{N}=4$ Quivers | Craig Lawrie et.al. | 2504.03830 | null |
2025-04-04 | Meshing of High-Dimensional Toroidal Manifolds from Quasi-Periodic Three-Body Problem Dynamics using Parameterization via Discrete One-Forms | Dante Basile et.al. | 2504.03791 | null |
2025-04-04 | Robust Human Registration with Body Part Segmentation on Noisy Point Clouds | Kai Lascheit et.al. | 2504.03602 | null |
2025-04-04 | MedSAM2: Segment Anything in 3D Medical Images and Videos | Jun Ma et.al. | 2504.03600 | link |
2025-04-04 | AdaViT: Adaptive Vision Transformer for Flexible Pretrain and Finetune with Variable 3D Medical Image Modalities | Badhan Kumar Das et.al. | 2504.03589 | null |
2025-04-04 | PF3Det: A Prompted Foundation Feature Assisted Visual LiDAR 3D Detector | Kaidong Li et.al. | 2504.03563 | null |
2025-04-04 | HumanDreamer-X: Photorealistic Single-image Human Avatars Reconstruction via Gaussian Restoration | Boyuan Wang et.al. | 2504.03536 | null |
2025-04-04 | Electromagnetic homogenization of particulate composite materials comprising spheroids and truncated spheroids with orientational distribution | Héctor M. Iga-Buitrón et.al. | 2504.03530 | null |
2025-04-04 | Quenching through the QCD chiral phase transition | Adrien Florio et.al. | 2504.03514 | null |
2025-04-04 | D-Garment: Physics-Conditioned Latent Diffusion for Dynamic Garment Deformations | Antoine Dumoulin et.al. | 2504.03468 | null |
2025-04-07 | ZFusion: An Effective Fuser of Camera and 4D Radar for 3D Object Perception in Autonomous Driving | Sheng Yang et.al. | 2504.03438 | null |
2025-04-04 | NeRFlex: Resource-aware Real-time High-quality Rendering of Complex Scenes on Mobile Devices | Zhe Wang et.al. | 2504.03415 | null |
2025-04-04 | Error estimates of an exponential wave integrator for the nonlinear Schrödinger equation with singular potential | Weizhu Bao et.al. | 2504.03346 | null |
2025-04-04 | Evolution of interacting coronal mass ejections driving the great geomagnetic storm on 10 May 2024 | Soumyaranjan Khuntia et.al. | 2504.03335 | null |
2025-04-04 | TQD-Track: Temporal Query Denoising for 3D Multi-Object Tracking | Shuxiao Ding et.al. | 2504.03258 | null |
2025-04-04 | Unlocking Neural Transparency: Jacobian Maps for Explainable AI in Alzheimer’s Detection | Yasmine Mustafa et.al. | 2504.03230 | null |
2025-04-04 | Enhanced hot electron generation from liquid jets in moderate intensity laser-plasma interactions | Ratul Sabui et.al. | 2504.03217 | null |
2025-04-04 | Endo3R: Unified Online Reconstruction from Dynamic Monocular Endoscopic Video | Jiaxin Guo et.al. | 2504.03198 | null |
2025-04-07 | NuScenes-SpatialQA: A Spatial Understanding and Reasoning Benchmark for Vision-Language Models in Autonomous Driving | Kexin Tian et.al. | 2504.03164 | null |
2025-04-09 | Joint Retrieval of Cloud properties using Attention-based Deep Learning Models | Zahid Hassan Tushar et.al. | 2504.03133 | null |
2025-04-04 | GraphSeg: Segmented 3D Representations via Graph Edge Addition and Contraction | Haozhan Tang et.al. | 2504.03129 | link |
2025-04-04 | Dispersion-Engineered Compact Twisted Metasurfaces Enabling 3D Frequency-Reconfigurable Holography | Cheng Pang et.al. | 2504.03115 | null |
2025-04-04 | Learning Human Perspective in Line Drawings from Single Sketches | Jinfan Yang et.al. | 2504.03099 | null |
2025-04-04 | Single-Satellite Navigation on Lunar North Pole | Tim Gong et.al. | 2504.03091 | null |
2025-04-08 | Compressing 3D Gaussian Splatting by Noise-Substituted Vector Quantization | Haishan Wang et.al. | 2504.03059 | link |
2025-04-03 | Cooperative Inference for Real-Time 3D Human Pose Estimation in Multi-Device Edge Networks | Hyun-Ho Choi et.al. | 2504.03052 | link |
2025-04-03 | A Review of Prototyping in XR: Linking Extended Reality to Digital Fabrication | Bixun Chen et.al. | 2504.02998 | null |
2025-04-03 | Elastic instability of wormlike micelle solution flow in serpentine channels | Emily Y. Chen et.al. | 2504.02951 | null |
2025-04-03 | LiDAR-based Object Detection with Real-time Voice Specifications | Anurag Kulkarni et.al. | 2504.02920 | link |
2025-04-02 | Exploring the Capabilities of LLMs for IMU-based Fine-grained Human Activity Understanding | Lilin Xu et.al. | 2504.02878 | null |
2025-03-31 | Computer Vision and Deep Learning for 4D Augmented Reality | Karthik Shivashankar et.al. | 2504.02860 | null |
2025-04-03 | From moving groups to star formation in the Solar Neighborhood | Cameren Swiggum et.al. | 2504.02825 | null |
2025-04-03 | Efficient Autoregressive Shape Generation via Octree-Based Adaptive Tokenization | Kangle Deng et.al. | 2504.02817 | null |
2025-04-03 | BOP Challenge 2024 on Model-Based and Model-Free 6D Object Pose Estimation | Van Nguyen Nguyen et.al. | 2504.02812 | null |
2025-04-03 | Spline-based Transformers | Prashanth Chandran et.al. | 2504.02797 | null |
2025-04-03 | A 3D view of dwarf galaxies with Gaia and VLT/FLAMES II. The Sextans dwarf spheroidal | Eline Tolstoy et.al. | 2504.02787 | null |
2025-04-03 | Scene Splatter: Momentum 3D Scene Generation from Single Image with Video Diffusion Model | Shengjun Zhang et.al. | 2504.02764 | null |
2025-04-03 | MD-ProjTex: Texturing 3D Shapes with Multi-Diffusion Projection | Ahmet Burak Yildirim et.al. | 2504.02762 | null |
2025-04-03 | Stability of acoustic streaming jets | Bjarne Vincent et.al. | 2504.02756 | null |
2025-04-03 | GEOPARD: Geometric Pretraining for Articulation Prediction in 3D Shapes | Pradyumn Goyal et.al. | 2504.02747 | null |
2025-04-03 | Parity violation as enforced symmetry breaking in 3D fermionic topological order | Shang-Qiang Ning et.al. | 2504.02736 | null |
2025-04-03 | Vortex Flows in the Solar Atmosphere: Detection and Heating Mechanisms in 3D MHD Numerical Simulations | M. Koll Pistarini et.al. | 2504.02729 | null |
2025-04-03 | Anisotropy analysis of bamboo and tooth using 4-angle polarization micro-spectroscopy | Meguya Ryu et.al. | 2504.02711 | null |
2025-04-03 | Handover and SINR-Aware Path Optimization in 5G-UAV mmWave Communication using DRL | Achilles Kiwanuka Machumilane et.al. | 2504.02688 | null |
2025-04-03 | Two-Stage nnU-Net for Automatic Multi-class Bi-Atrial Segmentation from LGE-MRIs | Y. On et.al. | 2504.02668 | null |
2025-04-03 | UAV-Assisted 5G Networks: Mobility-Aware 3D Trajectory Optimization and Resource Allocation for Dynamic Environments | Asad Mahmood et.al. | 2504.02613 | null |
2025-04-03 | Time resolution limits in silicon sensors from Landau fluctuations and electronics noise | Werner Riegler et.al. | 2504.02570 | null |
2025-04-05 | A 3D-1D-0D Multiscale Model of the Neuro-Glial-Vascular Unit for Synaptic and Vascular Dynamics in the Dorsal Vagal Complex | Alexander Hermann et.al. | 2504.02540 | null |
2025-04-03 | MultiNeRF: Multiple Watermark Embedding for Neural Radiance Fields | Yash Kulthe et.al. | 2504.02517 | null |
2025-04-03 | A Memory-Augmented LLM-Driven Method for Autonomous Merging of 3D Printing Work Orders | Yuhao Liu et.al. | 2504.02509 | null |
2025-04-03 | Graph Attention-Driven Bayesian Deep Unrolling for Dual-Peak Single-Photon Lidar Imaging | Kyungmin Choi et.al. | 2504.02480 | null |
2025-04-03 | Multimodal Fusion and Vision-Language Models: A Survey for Robot Vision | Xiaofeng Han et.al. | 2504.02477 | null |
2025-04-03 | RASP: Revisiting 3D Anamorphic Art for Shadow-Guided Packing of Irregular Objects | Soumyaratna Debnath et.al. | 2504.02465 | null |
2025-04-03 | CornerPoint3D: Look at the Nearest Corner Instead of the Center | Ruixiao Zhang et.al. | 2504.02464 | null |
2025-04-03 | MonoGS++: Fast and Accurate Monocular RGB Gaussian SLAM | Renwu Li et.al. | 2504.02437 | null |
2025-04-03 | First observation of ultra-long-range azimuthal correlations in low multiplicity pp and p-Pb collisions at the LHC | ALICE Collaboration et.al. | 2504.02359 | null |
2025-04-03 | LPA3D: 3D Room-Level Scene Generation from In-the-Wild Images | Ming-Jia Yang et.al. | 2504.02337 | null |
2025-04-03 | ConsDreamer: Advancing Multi-View Consistency for Zero-Shot Text-to-3D Generation | Yuan Zhou et.al. | 2504.02316 | link |
2025-04-03 | Solving adhesive rough contact problems with Atomic Force Microscope data | Maria Rosaria Marulli et.al. | 2504.02307 | null |
2025-04-03 | MinkOcc: Towards real-time label-efficient semantic occupancy prediction | Samuel Sze et.al. | 2504.02270 | null |
2025-04-03 | WonderTurbo: Generating Interactive 3D World in 0.72 Seconds | Chaojun Ni et.al. | 2504.02261 | null |
2025-04-03 | In-situ three-dimensional strain engineering of solid-state quantum emitters in photonic structures towards scalable quantum networks | Yan Chen et.al. | 2504.02257 | null |
2025-04-02 | Preference-Driven Active 3D Scene Representation for Robotic Inspection in Nuclear Decommissioning | Zhen Meng et.al. | 2504.02161 | null |
2025-04-02 | UAVTwin: Neural Digital Twins for UAVs using Gaussian Splatting | Jaehoon Choi et.al. | 2504.02158 | null |
2025-04-02 | Three-dimensional non-LTE radiative transfer effects in Fe I lines IV. Line formation at high spatial resolution | R. Holzreuter et.al. | 2504.02092 | null |
2025-04-02 | A Chefs KISS – Utilizing semantic information in both ICP and SLAM framework | Sven Ochs et.al. | 2504.02086 | null |
2025-04-02 | Evaluation of Flight Parameters in UAV-based 3D Reconstruction for Rooftop Infrastructure Assessment | Nick Chodura et.al. | 2504.02084 | null |
2025-04-08 | Planet Earth in reflected and polarized light I. 3D radiative transfer simulations of realistic surface-atmosphere systems | Giulia Roccetti et.al. | 2504.02048 | null |
2025-04-02 | WorldPrompter: Traversable Text-to-Scene Generation | Zhaoyang Zhang et.al. | 2504.02045 | null |
2025-04-02 | Weak-lensing tunnel voids in simulated light-cones: a new pipeline to investigate modified gravity and massive neutrinos signatures | Leonardo Maggiore et.al. | 2504.02041 | null |
2025-04-01 | OccludeNeRF: Geometric-aware 3D Scene Inpainting with Collaborative Score Distillation in NeRF | Jingyu Shi et.al. | 2504.02007 | null |
2025-04-01 | Real-Time Navigation for Autonomous Aerial Vehicles Using Video | Khizar Anjum et.al. | 2504.01996 | null |
2025-03-30 | Multi-Dimensional AGV Path Planning in 3D Warehouses Using Ant Colony Optimization and Advanced Neural Networks | Bo Zhang et.al. | 2504.01985 | null |
2025-04-02 | Diffusion-Guided Gaussian Splatting for Large-Scale Unconstrained 3D Reconstruction and Novel View Synthesis | Niluthpol Chowdhury Mithun et.al. | 2504.01960 | null |
2025-04-03 | Toward Real-world BEV Perception: Depth Uncertainty Estimation via Gaussian Splatting | Shu-Wei Lu et.al. | 2504.01957 | null |
2025-04-03 | VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in One Step | Hanyang Wang et.al. | 2504.01956 | null |
2025-04-02 | Ross3D: Reconstructive Visual Instruction Tuning with 3D-Awareness | Haochen Wang et.al. | 2504.01901 | null |
2025-04-02 | CoMatcher: Multi-View Collaborative Feature Matching | Jintao Zhang et.al. | 2504.01872 | null |
2025-04-02 | Focal Mechanism Uncertainty Quantification In Ground Motion Simulations Of Le Teil Earthquake | Valeria Soto et.al. | 2504.01868 | null |
2025-04-02 | BOGausS: Better Optimized Gaussian Splatting | Stéphane Pateux et.al. | 2504.01844 | null |
2025-04-02 | BlenderGym: Benchmarking Foundational Model Systems for Graphics Editing | Yunqi Gu et.al. | 2504.01786 | link |
2025-04-02 | Dual-stream Transformer-GCN Model with Contextualized Representations Learning for Monocular 3D Human Pose Estimation | Mingrui Ye et.al. | 2504.01764 | link |
2025-04-09 | FIORD: A Fisheye Indoor-Outdoor Dataset with LIDAR Ground Truth for 3D Scene Reconstruction and Benchmarking | Ulas Gunes et.al. | 2504.01732 | null |
2025-04-03 | DreamActor-M1: Holistic, Expressive and Robust Human Image Animation with Hybrid Guidance | Yuxuan Luo et.al. | 2504.01724 | null |
2025-04-12 | Overlap-Aware Feature Learning for Robust Unsupervised Domain Adaptation for 3D Semantic Segmentation | Junjie Chen et.al. | 2504.01668 | null |
2025-04-12 | Robust Unsupervised Domain Adaptation for 3D Point Cloud Segmentation Under Source Adversarial Attacks | Haosheng Li et.al. | 2504.01659 | null |
2025-04-12 | ProtoGuard-guided PROPEL: Class-Aware Prototype Enhancement and Progressive Labeling for Incremental 3D Point Cloud Segmentation | Haosheng Li et.al. | 2504.01648 | null |
2025-04-02 | FlowR: Flowing from Sparse to Dense 3D Reconstructions | Tobias Fischer et.al. | 2504.01647 | null |
2025-04-02 | 3DBonsai: Structure-Aware Bonsai Modeling Using Conditioned 3D Gaussian Splatting | Hao Wu et.al. | 2504.01619 | null |
2025-04-02 | LL-Localizer: A Life-Long Localization System based on Dynamic i-Octree | Xinyi Li et.al. | 2504.01583 | link |
2025-04-02 | RealityAvatar: Towards Realistic Loose Clothing Modeling in Animatable 3D Gaussian Avatars | Yahui Li et.al. | 2504.01559 | null |
2025-04-02 | Rapid Muon Tomography for Border Security | Anzori Sh. Georgadze et.al. | 2504.01525 | null |
2025-04-02 | High-fidelity 3D Object Generation from Single Image with RGBN-Volume Gaussian Reconstruction Model | Yiyang Shen et.al. | 2504.01512 | null |
2025-04-02 | A computational framework for evaluating tire-asphalt hysteretic friction including pavement roughness | Ivana Ban et.al. | 2504.01511 | null |
2025-04-02 | Luminance-GS: Adapting 3D Gaussian Splatting to Challenging Lighting Conditions with View-Adaptive Curve Adjustment | Ziteng Cui et.al. | 2504.01503 | link |
2025-04-02 | Shape derivative for the Dirichlet-to-Neumann operator on a manifold and application to cellular protrusion | F Noisette et.al. | 2504.01493 | null |
2025-04-02 | GarmageNet: A Dataset and Scalable Representation for Generic Garment Modeling | Siran Li et.al. | 2504.01483 | null |
2025-04-02 | Enhanced Cross-modal 3D Retrieval via Tri-modal Reconstruction | Junlong Ren et.al. | 2504.01476 | null |
2025-04-09 | Mesh Mamba: A Unified State Space Model for Saliency Prediction in Non-Textured and Textured Meshes | Kaiwei Zhang et.al. | 2504.01466 | link |
2025-04-02 | Multimodal Point Cloud Semantic Segmentation With Virtual Point Enhancement | Zaipeng Duan et.al. | 2504.01449 | null |
2025-04-02 | MuTri: Multi-view Tri-alignment for OCT to OCTA 3D Image Translation | Zhuangzhuang Chen et.al. | 2504.01428 | link |
2025-04-02 | DF-Calib: Targetless LiDAR-Camera Calibration via Depth Flow | Shu Han et.al. | 2504.01416 | null |
2025-04-02 | 3D Gaussian Inverse Rendering with Approximated Global Illumination | Zirui Wu et.al. | 2504.01358 | null |
2025-04-02 | FlowMotion: Target-Predictive Flow Matching for Realistic Text-Driven Human Motion Generation | Manolo Canales Cuba et.al. | 2504.01338 | null |
2025-04-03 | Direction-Aware Hybrid Representation Learning for 3D Hand Pose and Shape Estimation | Shiyong Liu et.al. | 2504.01298 | null |
2025-04-02 | Nishida-Smoller type large solutions for the compressible Navier-Stokes equations with slip boundary conditions in 3D exterior domains | Minghong Xie et.al. | 2504.01288 | null |
2025-04-02 | A Retina-Inspired Pathway to Real-Time Motion Prediction inside Image Sensors for Extreme-Edge Intelligence | Subhradip Chakraborty et.al. | 2504.01275 | null |
2025-04-02 | Bayesian critical points in classical lattice models | Adam Nahum et.al. | 2504.01264 | null |
2025-04-01 | A New Approach to Motion Planning in 3D for a Dubins Vehicle: Special Case on a Sphere | Deepak Prakash Kumar et.al. | 2504.01215 | link |
2025-04-01 | Articulated Kinematics Distillation from Video Diffusion Models | Xuan Li et.al. | 2504.01204 | null |
2025-04-09 | Towards Signed Distance Function based Metamaterial Design: Neural Operator Transformer for Forward Prediction and Diffusion Model for Inverse Design | Qibang Liu et.al. | 2504.01195 | link |
2025-04-01 | 3D printing for teaching and exploration in astronomy for individuals with blindness/visual impairment: textured representations of imagery | Carol Christian et.al. | 2504.01161 | null |
2025-04-01 | Corrected Trapezoidal Rules for Near-Singular Surface Integrals Applied to 3D Interfacial Stokes Flow | Monika Nitsche et.al. | 2504.01144 | null |
2025-04-01 | Combining Extended Convolutional Autoencoders and Reservoir Computing for Accurate Reduced-Order Predictions of Atmospheric Flows | Arash Hajisharifi et.al. | 2504.01097 | null |
2025-04-01 | Geometric Programming for 3D Circuits | Rongbiao Wang et.al. | 2504.01090 | null |
2025-03-28 | Mesh Compression with Quantized Neural Displacement Fields | Sai Karthikey Pentapati et.al. | 2504.01027 | null |
2025-03-27 | Gaze-Guided 3D Hand Motion Prediction for Detecting Intent in Egocentric Grasping Tasks | Yufei He et.al. | 2504.01024 | null |
2025-03-26 | Omnidirectional Depth-Aided Occupancy Prediction based on Cylindrical Voxel for Autonomous Driving | Chaofan Wu et.al. | 2504.01023 | null |
2025-04-01 | GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion Priors | Tian-Xing Xu et.al. | 2504.01016 | null |
2025-04-01 | SuperDec: 3D Scene Decomposition with Superquadric Primitives | Elisabetta Fedele et.al. | 2504.00992 | null |
2025-04-01 | WorldScore: A Unified Evaluation Benchmark for World Generation | Haoyi Duan et.al. | 2504.00983 | null |
2025-04-07 | Neural Pruning for 3D Scene Reconstruction: Efficient NeRF Acceleration | Tianqi Ding et.al. | 2504.00950 | null |
2025-04-01 | Graph Classification and Radiomics Signature for Identification of Tuberculous Meningitis | Snigdha Agarwal et.al. | 2504.00943 | null |
2025-04-01 | Determining the 3D Dynamics of Solar Flare Magnetic Reconnection | Joel T. Dahlin et.al. | 2504.00913 | null |
2025-04-01 | DBF-UNet: A Two-Stage Framework for Carotid Artery Segmentation with Pseudo-Label Generation | Haoxuan Li et.al. | 2504.00908 | link |
2025-04-01 | Feature-Preserving Mesh Decimation for Normal Integration | Moritz Heep et.al. | 2504.00867 | null |
2025-04-03 | The Role of Magnetic Fields in the Formation of High-Mass Star-Forming Cores | Katerina Sophia Klos et.al. | 2504.00864 | null |
2025-04-01 | Zero-Shot 4D Lidar Panoptic Segmentation | Yushan Zhang et.al. | 2504.00848 | null |
2025-04-01 | Exact Diagonalization, Matrix Product States and Conformal Perturbation Theory Study of a 3D Ising Fuzzy Sphere Model | Andreas M. Läuchli et.al. | 2504.00842 | null |
2025-04-01 | DropGaussian: Structural Regularization for Sparse-view Gaussian Splatting | Hyunwoo Park et.al. | 2504.00773 | null |
2025-04-01 | UnIRe: Unsupervised Instance Decomposition for Dynamic Urban Scene Reconstruction | Yunxuan Mao et.al. | 2504.00763 | null |
2025-04-01 | CAPE: Connectivity-Aware Path Enforcement Loss for Curvilinear Structure Delineation | Elyar Esmaeilzadeh et.al. | 2504.00753 | null |
2025-04-01 | Monocular and Generalizable Gaussian Talking Head Animation | Shengjie Gong et.al. | 2504.00665 | null |
2025-04-01 | Parametric shape optimization for the convected Helmholtz equation with a generalized Myers boundary condition | Alami Nabil et.al. | 2504.00658 | null |
2025-04-01 | Cosmography with the Double Source Plane Strong Gravitational Lens AGEL150745+052256 | Nandini Sahu et.al. | 2504.00656 | null |
2025-04-01 | A posteriori error analysis of a robust virtual element method for stress-assisted diffusion problems | Franco Dassi et.al. | 2504.00648 | null |
2025-04-01 | Coca-Splat: Collaborative Optimization for Camera Parameters and 3D Gaussians | Jiamin Wu et.al. | 2504.00639 | null |
2025-04-01 | Study of Ultra-High-Energy Gamma-Ray Source 1LHAASO J0056+6346u and Its Possible Origins | LHAASO Collaboration et.al. | 2504.00601 | null |
2025-04-01 | AttentiveGRU: Recurrent Spatio-Temporal Modeling for Advanced Radar-Based BEV Object Detection | Loveneet Saini et.al. | 2504.00559 | null |
2025-04-01 | Learning high-accuracy numerical schemes for hyperbolic equations on coarse meshes | Jinrui Zhou et.al. | 2504.00462 | null |
2025-04-03 | Distilling Multi-view Diffusion Models into 3D Generators | Hao Qin et.al. | 2504.00457 | null |
2025-04-01 | Spatiotemporal Airy rings wavepackets | Xiaolin Su et.al. | 2504.00439 | null |
2025-04-01 | Data Synthesis with Diverse Styles for Face Recognition via 3DMM-Guided Diffusion | Yuxi Mi et.al. | 2504.00430 | null |
2025-04-01 | The Timing and Polarization of PSR J0002+6216 | Yu Wei et.al. | 2504.00426 | null |
2025-04-01 | Scene4U: Hierarchical Layered 3D Scene Reconstruction from Single Panoramic Image for Your Immerse Exploration | Zilong Huang et.al. | 2504.00387 | null |
2025-04-01 | Intrinsic-feature-guided 3D Object Detection | Wanjing Zhang et.al. | 2504.00382 | null |
2025-04-01 | Traversing Dual Realities: Investigating Techniques for Transitioning 3D Objects between Desktop and Augmented Reality Environments | Tobias Rau et.al. | 2504.00371 | link |
2025-04-01 | Deconver: A Deconvolutional Network for Medical Image Segmentation | Pooya Ashtari et.al. | 2504.00302 | link |
2025-03-31 | Co-design Optimization of Moving Parts for Compliance and Collision Avoidance | Amir M. Mirzendehdel et.al. | 2504.00292 | null |
2025-03-31 | Cosmic-ray propagation features in gamma-ray measurements | Julia Becker Tjus et.al. | 2504.00290 | null |
2025-03-31 | NeRF-Based defect detection | Tianqi et.al. | 2504.00270 | null |
2025-03-31 | MultiMorph: On-demand Atlas Construction | S. Mazdak Abulnaga et.al. | 2504.00247 | null |
2025-03-31 | LITA-GS: Illumination-Agnostic Novel View Synthesis via Reference-Free 3D Gaussian Splatting and Physical Priors | Han Zhou et.al. | 2504.00219 | null |
2025-03-31 | Fixed-Attention Mechanism for Deep-Learning-Assisted Design of High-Degree-of-Freedom 3D Metamaterials | Huanshu Zhang et.al. | 2504.00203 | null |
2025-03-31 | Leveraging Diffusion Model and Image Foundation Model for Improved Correspondence Matching in Coronary Angiography | Lin Zhao et.al. | 2504.00191 | null |
2025-03-31 | Comparison of Entropy Stable Collocation High-Order DG Methods for Compressible Turbulent Flows | Anna Schwarz et.al. | 2504.00173 | null |
2025-03-31 | SonarSplat: Novel View Synthesis of Imaging Sonar via Gaussian Splatting | Advaith V. Sethuraman et.al. | 2504.00159 | null |
2025-03-31 | Direction-Dependent Faraday Synthesis | Victor Gustafsson et.al. | 2504.00141 | null |
2025-03-31 | The Origin of the Cluster of Local Interstellar Clouds | Catherine Zucker et.al. | 2504.00093 | null |
2025-03-25 | LayerCraft: Enhancing Text-to-Image Generation with CoT Reasoning and Layered Object Integration | Yuyao Zhang et.al. | 2504.00010 | link |
2025-03-31 | Easi3R: Estimating Disentangled Motion from DUSt3R Without Training | Xingyu Chen et.al. | 2503.24391 | link |
2025-03-31 | Free360: Layered Gaussian Splatting for Unbounded 360-Degree View Synthesis from Extremely Sparse and Unposed Views | Chong Bao et.al. | 2503.24382 | null |
2025-03-31 | StochasticSplats: Stochastic Rasterization for Sorting-Free 3D Gaussian Splatting | Shakiba Kheradmand et.al. | 2503.24366 | null |
2025-03-31 | The edge-on disk Tau042021: icy grains at high altitudes and a wind containing astronomical PAHs | E. Dartois et.al. | 2503.24309 | null |
2025-03-31 | Point Tracking in Surgery–The 2024 Surgical Tattoos in Infrared (STIR) Challenge | Adam Schmidt et.al. | 2503.24306 | link |
2025-04-01 | Visual Acoustic Fields | Yuelei Li et.al. | 2503.24270 | null |
2025-03-31 | Pre-training with 3D Synthetic Data: Learning 3D Point Cloud Instance Segmentation from 3D Synthetic Scenes | Daichi Otsuka et.al. | 2503.24229 | null |
2025-03-31 | DiET-GS: Diffusion Prior and Event Stream-Assisted Motion Deblurring 3D Gaussian Splatting | Seungjun Lee et.al. | 2503.24210 | null |
2025-03-31 | Non-linear saturation of gravito-inertial modes excited by tidal resonances in binary neutron stars | Alexis Reboul-Salze et.al. | 2503.24154 | null |
2025-03-31 | Dust Concentration Via Coupled Vertical Settling and Radial Migration in Substructured Non-Ideal MHD Discs and Early Planet Formation | Chun-Yen Hsu et.al. | 2503.24142 | null |
2025-04-03 | IMPACT: A Generic Semantic Loss for Multimodal Medical Image Registration | Valentin Boussot et.al. | 2503.24121 | link |
2025-03-31 | 4D mmWave Radar in Adverse Environments for Autonomous Driving: A Survey | Xiangyuan Peng et.al. | 2503.24091 | null |
2025-04-07 | Controlled Latent Diffusion Models for 3D Porous Media Reconstruction | Danilo Naiff et.al. | 2503.24083 | link |
2025-03-31 | HACTS: a Human-As-Copilot Teleoperation System for Robot Learning | Zhiyuan Xu et.al. | 2503.24070 | null |
2025-03-31 | A low cost singular value decomposition based data assimilation technique for analysis of heterogeneous combustion data | Prajith Pillai et.al. | 2503.24064 | null |
2025-03-31 | Global Well-Posedness of the 3D Navier-Stokes Equations under Multi-Level Logarithmically Improved Criteria | Rishabh Mishra et.al. | 2503.24029 | null |
2025-04-01 | HumanDreamer: Generating Controllable Human-Motion Videos via Decoupled Generation | Boyuan Wang et.al. | 2503.24026 | null |
2025-03-31 | Learning 3D-Gaussian Simulators from RGB Videos | Mikel Zhobro et.al. | 2503.24009 | null |
2025-03-31 | PupiNet: Seamless OCT-OCTA Interconversion Through Wavelet-Driven and Multi-Scale Attention Mechanisms | Renzhi Tian et.al. | 2503.23933 | null |
2025-03-31 | GLane3D : Detecting Lanes with Graph of 3D Keypoints | Halil İbrahim Öztürk et.al. | 2503.23882 | null |
2025-03-31 | ExScene: Free-View 3D Scene Reconstruction with Gaussian Splatting from a Single Image | Tianyi Gong et.al. | 2503.23881 | null |
2025-03-31 | Explodability criteria for the neutrino-driven supernova mechanism | K. Maltsev et.al. | 2503.23856 | null |
2025-03-31 | Three-dimensional Optical Reconstruction of colloidal electrokinetics via multiplane imaging | Flip de Jong et.al. | 2503.23839 | null |
2025-03-31 | A PINN Methodology for Temperature Field Reconstruction in the PIV Measurement Plane: Case of Rayleigh-Bénard Convection | Marie-Christine Volk et.al. | 2503.23801 | null |
2025-04-01 | WaveFormer: A 3D Transformer with Wavelet-Driven Feature Representation for Efficient Medical Image Segmentation | Md Mahfuz Al Hasan et.al. | 2503.23764 | null |
2025-03-31 | Exploring Temporal Dynamics in Event-based Eye Tracker | Hongwei Ren et.al. | 2503.23725 | link |
2025-03-31 | From Geometry to Culture: An Iterative VLM Layout Framework for Placing Objects in Complex 3D Scene Contexts | Yuto Asano et.al. | 2503.23707 | null |
2025-03-31 | Paramagnetic half-moon shaped diffuse scattering arising from 3D magnetic frustration | Nelly Natsch et.al. | 2503.23704 | null |
2025-03-31 | 3D Dental Model Segmentation with Geometrical Boundary Preserving | Shufan Xi et.al. | 2503.23702 | null |
2025-03-31 | Learning Bijective Surface Parameterization for Inferring Signed Distance Functions from Sparse Point Clouds with Grid Deformation | Takeshi Noda et.al. | 2503.23670 | null |
2025-03-31 | LiM-Loc: Visual Localization with Dense and Accurate 3D Reference Maps Directly Corresponding 2D Keypoints to 3D LiDAR Point Clouds | Masahiko Tsuji et.al. | 2503.23664 | null |
2025-04-01 | JAX-BTE: A GPU-Accelerated Differentiable Solver for Phonon Boltzmann Transport Equations | Wenjie Shang et.al. | 2503.23657 | null |
2025-04-07 | Construction of Hyperchaotic Maps Based on 3D-CCC and its Applications in Image Encryption | Jilei Sun et.al. | 2503.23655 | null |
2025-04-01 | Introducing the Short-Time Fourier Kolmogorov Arnold Network: A Dynamic Graph CNN Approach for Tree Species Classification in 3D Point Clouds | Said Ohamouddou et.al. | 2503.23647 | link |
2025-03-31 | Uni-Render: A Unified Accelerator for Real-Time Rendering Across Diverse Neural Renderers | Chaojian Li et.al. | 2503.23644 | null |
2025-03-30 | Gaussian Blending Unit: An Edge GPU Plug-in for Real-Time Gaussian-Based Rendering in AR/VR | Zhifan Ye et.al. | 2503.23625 | null |
2025-03-30 | 3D mirror symmetry in positive characteristic | Shaoyun Bai et.al. | 2503.23590 | null |
2025-03-30 | A first-order DirAC-based parametric Ambisonic coder for immersive communications | Guillaume Fuchs et.al. | 2503.23586 | null |
2025-03-30 | Multiview Image-Based Localization | Cameron Fiore et.al. | 2503.23577 | null |
2025-04-08 | DNA and Human Language: Epigenetic Memory and Redundancy in Linear Sequence | Li Yang et.al. | 2503.23494 | null |
2025-03-30 | Chiral symmetry and magnetism in a 3D Kagome lattice: RPt $_2$ B (R = La and Nd) prototype crystals | C. E. Ardila-Gutiérrez et.al. | 2503.23479 | null |
2025-03-30 | Efficient Dynamic Attention 3D Convolution for Hyperspectral Image Classification | Guandong Li et.al. | 2503.23472 | null |
2025-03-30 | OpenDriveVLA: Towards End-to-end Autonomous Driving with Large Vision Language Action Model | Xingcheng Zhou et.al. | 2503.23463 | link |
2025-03-30 | Visual Acuity Consistent Foveated Rendering towards Retinal Resolution | Zhi Zhang et.al. | 2503.23410 | null |
2025-03-30 | Proprioceptive multistable mechanical metamaterial via soft capacitive sensors | Hugo de Souza Oliveira et.al. | 2503.23389 | null |
2025-03-30 | Meta-Ori: monolithic meta-origami for nonlinear inflatable soft actuators | Hugo de Souza Oliveira et.al. | 2503.23375 | null |
2025-03-30 | Improving Neonatal Care: An Active Dry-Contact Electrode-based Continuous EEG Monitoring System with Seizure Detection | Nima L. Wickramasinghe et.al. | 2503.23338 | null |
2025-03-30 | Enhancing 3D Gaussian Splatting Compression via Spatial Condition-based Prediction | Jingui Ma et.al. | 2503.23337 | null |
2025-03-30 | HiPART: Hierarchical Pose AutoRegressive Transformer for Occluded 3D Human Pose Estimation | Hongwei Zheng et.al. | 2503.23331 | null |
2025-03-30 | AI Agents in Engineering Design: A Multi-Agent Framework for Aesthetic and Aerodynamic Car Design | Mohamed Elrefaie et.al. | 2503.23315 | null |
2025-03-30 | ReasonGrounder: LVLM-Guided Hierarchical Feature Splatting for Open-Vocabulary 3D Visual Grounding and Reasoning | Zhenyang Liu et.al. | 2503.23297 | null |
2025-03-29 | Breaking a superfluid harmonic dam: Observation and theory of rarefaction flow, Riemann invariants and sonic horizon dynamics | Shashwat Sharan et.al. | 2503.23246 | null |
2025-04-02 | Geometry in Style: 3D Stylization via Surface Normal Deformation | Nam Anh Dinh et.al. | 2503.23241 | null |
2025-03-29 | Incorporating GNSS Information with LIDAR-Inertial Odometry for Accurate Land-Vehicle Localization | Jintao Cheng et.al. | 2503.23199 | null |
2025-03-29 | NeuralGS: Bridging Neural Fields and 3D Gaussian Splatting for Compact 3D Representations | Zhenyu Tang et.al. | 2503.23162 | null |
2025-03-29 | Anthropomorphic tissue-mimicking phantoms for oximetry validation in multispectral optical imaging | Kris Kristoffer Dreher et.al. | 2503.23161 | null |
2025-03-29 | Improved Motion Plane Adaptive 360-Degree Video Compression Using Affine Motion Models | Marina Ritthaler et.al. | 2503.23151 | null |
2025-04-06 | A low-cost four-component relativistic coupled cluster linear response theory based on perturbation sensitive natural spinors | Sudipta Chakraborty et.al. | 2503.23144 | null |
2025-04-02 | A point cloud reconstruction method based on uncertainty feature enhancement for aerodynamic shape optimization | Junlin Li et.al. | 2503.23082 | null |
2025-03-29 | Shape and Texture Recognition in Large Vision-Language Models | Sagi Eppel et.al. | 2503.23062 | null |
2025-03-29 | CityGS-X: A Scalable Architecture for Efficient and Geometrically Accurate Large-Scale Scene Reconstruction | Yuanyuan Gao et.al. | 2503.23044 | null |
2025-03-29 | Empowering Large Language Models with 3D Situation Awareness | Zhihao Yuan et.al. | 2503.23024 | null |
2025-03-29 | MeshCraft: Exploring Efficient and Controllable Mesh Generation with Flow-based DiTs | Xianglong He et.al. | 2503.23022 | null |
2025-03-29 | MSNGO: multi-species protein function annotation based on 3D protein structure and network propagation | Beibei Wang et.al. | 2503.23014 | link |
2025-03-29 | FreeSplat++: Generalizable 3D Gaussian Splatting for Efficient Indoor Scene Reconstruction | Yunsong Wang et.al. | 2503.22986 | null |
2025-04-03 | From Flatland to Space: Teaching Vision-Language Models to Perceive and Reason in 3D | Jiahui Zhang et.al. | 2503.22976 | null |
2025-04-03 | Towards Mobile Sensing with Event Cameras on High-agility Resource-constrained Devices: A Survey | Haoyang Wang et.al. | 2503.22943 | null |
2025-03-29 | SR-LIO++: Efficient LiDAR-Inertial Odometry and Quantized Mapping with Sweep Reconstruction | Zikang Yuan et.al. | 2503.22926 | null |
2025-03-29 | LiDAR-based Quadrotor Autonomous Inspection System in Cluttered Environments | Wenyi Liu et.al. | 2503.22921 | null |
2025-03-28 | Development of a Miniaturized, Automated, and Cost-Effective Device for Enzyme-Linked Immunosorbent Assay | Majid Aalizadeh et.al. | 2503.22911 | null |
2025-04-01 | VizFlyt: Perception-centric Pedagogical Framework For Autonomous Aerial Robots | Kushagra Srivastava et.al. | 2503.22876 | link |
2025-04-05 | SIGHT: Single-Image Conditioned Generation of Hand Trajectories for Hand-Object Interaction | Alexey Gavryushin et.al. | 2503.22869 | null |
2025-03-28 | Zero-shot Domain Generalization of Foundational Models for 3D Medical Image Segmentation: An Experimental Study | Soumitri Chattopadhyay et.al. | 2503.22862 | null |
2025-04-05 | Structural stability, elemental ordering, and transport properties of layered ScTaN2 | Baptiste Julien et.al. | 2503.22857 | null |
2025-03-28 | On the structure of open clusters: geometric vs geomantic | Lu Li et.al. | 2503.22800 | null |
2025-03-28 | Co-design of materials, structures and stimuli for magnetic soft robots with large deformation and dynamic contacts | Liwei Wang et.al. | 2503.22767 | null |
2025-03-28 | DSO: Aligning 3D Generators with Simulation Feedback for Physical Soundness | Ruining Li et.al. | 2503.22677 | null |
2025-03-28 | TranSplat: Lighting-Consistent Cross-Scene Object Transfer with 3D Gaussian Splatting | Boyang et.al. | 2503.22676 | null |
2025-03-28 | Revealing the loss mechanisms of a 3D superconducting microwave cavity for use in a dark matter search | J. C. Esmenda et.al. | 2503.22637 | null |
2025-03-28 | Lagrangian multiforms and dispersionless integrable systems | Evgeny V. Ferapontov et.al. | 2503.22615 | null |
2025-03-28 | Comment on “Solvent-Induced Negative Energetic Elasticity in a Lattice Polymer Chain’‘ | L. K. R. Duarte et.al. | 2503.22614 | null |
2025-03-28 | Clouds and Hazes in GJ 1214b’s Metal-Rich Atmosphere | Isaac Malsky et.al. | 2503.22608 | null |
2025-03-28 | On the Mistaken Assumption of Interchangeable Deep Reinforcement Learning Implementations | Rajdeep Singh Hundal et.al. | 2503.22575 | null |
2025-04-02 | 3D Heterogeneous Integration of Silicon Nitride and Aluminum Nitride on Sapphire toward Ultra-wideband Photonics Integrated Circuits | Liang Zhang et.al. | 2503.22544 | null |
2025-03-28 | LIM: Large Interpolator Model for Dynamic Reconstruction | Remy Sabathier et.al. | 2503.22537 | null |
2025-03-28 | Terahertz frequency-domain 4x4 Mueller matrix ellipsometer instrument designed for high-frequency magnetic resonance measurements | Viktor Rindert et.al. | 2503.22500 | null |
2025-03-28 | SemAlign3D: Semantic Correspondence between RGB-Images through Aligning 3D Object-Class Representations | Krispin Wandel et.al. | 2503.22462 | null |
2025-03-28 | A high order multigrid-preconditioned immersed interface solver for the Poisson equation with boundary and interface conditions | James Gabbard et.al. | 2503.22455 | null |
2025-03-28 | EndoLRMGS: Complete Endoscopic Scene Reconstruction combining Large Reconstruction Modelling and Gaussian Splatting | Xu Wang et.al. | 2503.22437 | link |
2025-03-28 | NuGrounding: A Multi-View 3D Visual Grounding Framework in Autonomous Driving | Fuhao Li et.al. | 2503.22436 | null |
2025-03-28 | Collapse and Collision Aware Grasping for Cluttered Shelf Picking | Abhinav Pathak et.al. | 2503.22427 | null |
2025-03-28 | Magnetic Resonance Particle Tracking | Mathieu Suter et.al. | 2503.22425 | null |
2025-03-28 | Light Storage in Light Cages: A Scalable Platform for Multiplexed Quantum Memories | Esteban Gómez-López et.al. | 2503.22423 | null |
2025-04-01 | Unveiling the Mist over 3D Vision-Language Understanding: Object-centric Evaluation with Chain-of-Analysis | Jiangyong Huang et.al. | 2503.22420 | link |
2025-03-28 | Volumetric Material Decomposition Using Spectral Diffusion Posterior Sampling with a Compressed Polychromatic Forward Model | Xiao Jiang et.al. | 2503.22392 | null |
2025-03-28 | Insights on the role of the covalent Ni-O bonds in LiNiO2 positive electrodes: A combined hard X-ray spectroscopy study | Jazer Jose H. Togonon et.al. | 2503.22383 | null |
2025-03-28 | GCRayDiffusion: Pose-Free Surface Reconstruction via Geometric Consistent Ray Diffusion | Li-Heng Chen et.al. | 2503.22349 | null |
2025-03-28 | AH-GS: Augmented 3D Gaussian Splatting for High-Frequency Detail Representation | Chenyang Xu et.al. | 2503.22324 | null |
2025-03-28 | Robust simultaneous UWB-anchor calibration and robot localization for emergency situations | Xinghua Liu et.al. | 2503.22272 | link |
2025-03-28 | Mono2Stereo: A Benchmark and Empirical Study for Stereo Conversion | Songsong Yu et.al. | 2503.22262 | null |
2025-03-28 | Data-driven modeling of fluid flow around rotating structures with graph neural networks | Rui Gao et.al. | 2503.22252 | null |
2025-03-28 | FLAM: Foundation Model-Based Body Stabilization for Humanoid Locomotion and Manipulation | Xianqi Zhang et.al. | 2503.22249 | null |
2025-03-28 | Pneumatic Multi-mode Silicone Actuator with Pressure, Vibration, and Cold Thermal Feedback | Mohammad Shadman Hashem et.al. | 2503.22247 | null |
2025-03-31 | Hi3DGen: High-fidelity 3D Geometry Generation from Images via Normal Bridging | Chongjie Ye et.al. | 2503.22236 | null |
2025-04-05 | CoGen: 3D Consistent Video Generation via Adaptive Conditioning for Autonomous Driving | Yishen Ji et.al. | 2503.22231 | null |
2025-03-28 | Follow Your Motion: A Generic Temporal Consistency Portrait Editing Framework with Trajectory Guidance | Haijie Yang et.al. | 2503.22225 | null |
2025-03-28 | ABC-GS: Alignment-Based Controllable Style Transfer for 3D Gaussian Splatting | Wenjie Liu et.al. | 2503.22218 | null |
2025-03-28 | Segment then Splat: A Unified Approach for 3D Open-Vocabulary Segmentation based on Gaussian Splatting | Yiren Lu et.al. | 2503.22204 | null |
2025-03-28 | ORIGEN: Zero-Shot 3D Orientation Grounding in Text-to-Image Generation | Yunhong Min et.al. | 2503.22194 | null |
2025-03-28 | 3D Acetabular Surface Reconstruction from 2D Pre-operative X-ray Images using SRVF Elastic Registration and Deformation Graph | Shuai Zhang et.al. | 2503.22177 | null |
2025-03-31 | Disentangled 4D Gaussian Splatting: Towards Faster and More Efficient Dynamic Scene Rendering | Hao Feng et.al. | 2503.22159 | null |
2025-03-28 | Permutation-Invariant and Orientation-Aware Dataset Distillation for 3D Point Clouds | Jae-Young Yim et.al. | 2503.22154 | null |
2025-03-28 | Time-resolved dynamic CBCT reconstruction using prior-model-free spatiotemporal Gaussian representation (PMF-STGR) | Jiacheng Xie et.al. | 2503.22139 | null |
2025-03-28 | Mitigating Trade-off: Stream and Query-guided Aggregation for Efficient and Effective 3D Occupancy Prediction | Seokha Moon et.al. | 2503.22087 | link |
2025-03-27 | Improved Tomographic Reconstruction of 3D Global Coronal Density from STEREO/COR1 Observations | Tongjiang Wang et.al. | 2503.22041 | null |
2025-03-27 | Rolled Gaussian process models for curves on manifolds | Simon Preston et.al. | 2503.21980 | null |
2025-03-27 | NeRF-based Point Cloud Reconstruction using a Stationary Camera for Agricultural Applications | Kibon Ku et.al. | 2503.21958 | null |
2025-03-27 | Dust-void evolution driven by turbulent dust flux can induce runaway migration of Earth-mass planets | R. O. Chametla et.al. | 2503.21922 | null |
2025-03-27 | StreetScape: Gamified Tactile Interactions for Collaborative Learning and Play | Areen Khalaila et.al. | 2503.21897 | null |
2025-03-27 | Refined Geometry-guided Head Avatar Reconstruction from Monocular RGB Video | Pilseo Park et.al. | 2503.21886 | null |
2025-03-27 | Decomposition and (Non-Invertible) (-1)-Form Symmetries from the Symmetry Topological Field Theory | Ling Lin et.al. | 2503.21862 | null |
2025-03-27 | Impact of Oxygen on DNA Damage Distribution in 3D Genome and Its Correlation to Oxygen Enhancement Ratio under High LET Irradiation | Ankang Hu et.al. | 2503.21837 | null |
2025-03-26 | Shape Generation via Weight Space Learning | Maximilian Plattner et.al. | 2503.21830 | null |
2025-03-26 | Learning from spatially inhomogenous data: resolution-adaptive convolutions for multiple sclerosis lesion segmentation | Ivan Diaz et.al. | 2503.21829 | null |
2025-03-27 | Semantic Consistent Language Gaussian Splatting for Point-Level Open-vocabulary Querying | Hairong Yin et.al. | 2503.21767 | null |
2025-03-27 | Stable-SCore: A Stable Registration-based Framework for 3D Shape Correspondence | Haolin Liu et.al. | 2503.21766 | null |
2025-03-27 | Uni4D: Unifying Visual Foundation Models for 4D Modeling from a Single Video | David Yifan Yao et.al. | 2503.21761 | link |
2025-03-27 | Reconstructing Humans with a Biomechanically Accurate Skeleton | Yan Xia et.al. | 2503.21751 | null |
2025-03-27 | 3DGen-Bench: Comprehensive Benchmark Suite for 3D Generative Models | Yuhan Zhang et.al. | 2503.21745 | null |
2025-03-27 | SparseFlex: High-Resolution and Arbitrary-Topology 3D Shape Modeling | Xianglong He et.al. | 2503.21732 | null |
2025-03-27 | OccRobNet : Occlusion Robust Network for Accurate 3D Interacting Hand-Object Pose Estimation | Mallika Garg et.al. | 2503.21723 | null |
2025-03-27 | Progressive Rendering Distillation: Adapting Stable Diffusion for Instant Text-to-Mesh Generation without 3D Data | Zhiyuan Ma et.al. | 2503.21694 | link |
2025-03-27 | A Comprehensive Benchmark for RNA 3D Structure-Function Modeling | Luis Wyss et.al. | 2503.21681 | link |
2025-03-27 | Electronic structure dimensionality of the quantum-critical ferromagnet YbNi $_4$P$_2$ | J. Dai et.al. | 2503.21662 | null |
2025-03-27 | Numerical proof-of-concept of a photon, proton, and positron laser-driven source with nanostructured targets | Marta Galbiati et.al. | 2503.21630 | null |
2025-03-27 | AlignDiff: Learning Physically-Grounded Camera Alignment via Diffusion | Liuyue Xie et.al. | 2503.21581 | null |
2025-03-27 | uLayout: Unified Room Layout Estimation for Perspective and Panoramic Images | Jonathan Lee et.al. | 2503.21562 | link |
2025-03-27 | Statistical learning of structure-property relationships for transport in porous media, using hybrid AI modeling | Somayeh Hosseinhashemi et.al. | 2503.21560 | null |
2025-03-27 | ICG-MVSNet: Learning Intra-view and Cross-view Relationships for Guidance in Multi-View Stereo | Yuxi Hu et.al. | 2503.21525 | null |
2025-03-27 | F-INR: Functional Tensor Decomposition for Implicit Neural Representations | Sai Karthikeya Vemuri et.al. | 2503.21507 | null |
2025-03-27 | 3D MHD simulations of runaway pulsars in core collapse supernova remnants | D. M. A. Meyer et.al. | 2503.21492 | null |
2025-03-27 | Towards Generating Realistic 3D Semantic Training Data for Autonomous Driving | Lucas Nunes et.al. | 2503.21449 | link |
2025-03-27 | STAMICS: Splat, Track And Map with Integrated Consistency and Semantics for Dense RGB-D SLAM | Yongxu Wang et.al. | 2503.21425 | null |
2025-03-28 | LandMarkSystem Technical Report | Zhenxiang Ma et.al. | 2503.21364 | link |
2025-03-27 | Tailoring non-collinear magnetism and 3d $-$ 4f exchange interactions in RVO$_3$ epitaxial thin films | O. Copie et.al. | 2503.21327 | null |
2025-03-27 | Surface guided analysis of breast changes during post-operative radiotherapy by using a functional map framework | Pierre Galmiche et.al. | 2503.21317 | null |
2025-03-27 | HORT: Monocular Hand-held Objects Reconstruction with Transformers | Zerui Chen et.al. | 2503.21313 | null |
2025-03-27 | ClimbingCap: Multi-Modal Dataset and Method for Rock Climbing in World Coordinate | Ming Yan et.al. | 2503.21268 | null |
2025-03-27 | Delayed dynamic triggering and enhanced high-frequency seismic radiation due to brittle rock damage in 3D multi-fault rupture simulations | Zihua Niu et.al. | 2503.21260 | null |
2025-03-27 | Global Stable Solutions to the Free Boundary Allen–Cahn and Bernoulli Problems in 3D are One-Dimensional | Hardy Chan et.al. | 2503.21245 | null |
2025-03-27 | The Promise and Pitfalls of WebAssembly: Perspectives from the Industry | Ningyu He et.al. | 2503.21240 | null |
2025-03-27 | Frequency-Aware Gaussian Splatting Decomposition | Yishai Lavi et.al. | 2503.21226 | null |
2025-03-29 | GenFusion: Closing the Loop between Reconstruction and Generation via Videos | Sibo Wu et.al. | 2503.21219 | null |
2025-03-27 | VoxRep: Enhancing 3D Spatial Understanding in 2D Vision-Language Models via Voxel Representation | Alan Dao et.al. | 2503.21214 | null |
2025-03-27 | Percolation of both signs in a triangular-type 3D Ising model above $T_c$ | Jianping Jiang et.al. | 2503.21147 | null |
2025-03-27 | De Novo Functional Protein Sequence Generation: Overcoming Data Scarcity through Regeneration and Large Models | Chenyu Ren et.al. | 2503.21123 | null |
2025-03-27 | One Snapshot is All You Need: A Generalized Method for mmWave Signal Generation | Teng Huang et.al. | 2503.21122 | null |
2025-03-27 | Learning Class Prototypes for Unified Sparse Supervised 3D Object Detection | Yun Zhu et.al. | 2503.21099 | link |
2025-03-27 | Can Video Diffusion Model Reconstruct 4D Geometry? | Jinjie Mai et.al. | 2503.21082 | null |
2025-03-26 | A dynamic reconstruction and motion estimation framework for cardiorespiratory motion-resolved real-time volumetric MR imaging (DREME-MR) | Hua-Chieh Shao et.al. | 2503.21014 | null |
2025-03-26 | Pellet-based 3D Printing of Soft Thermoplastic Elastomeric Membranes for Soft Robotic Applications | Nick Willemstein et.al. | 2503.20957 | null |
2025-03-26 | LATTE-MV: Learning to Anticipate Table Tennis Hits from Monocular Videos | Daniel Etaat et.al. | 2503.20936 | null |
2025-03-26 | A Study of Perceived Safety for Soft Robotics in Caregiving Tasks | Cosima du Pasquier et.al. | 2503.20916 | null |
2025-03-26 | TransDiffSBDD: Causality-Aware Multi-Modal Structure-Based Drug Design | Xiuyuan Hu et.al. | 2503.20913 | null |
2025-03-26 | Speculations on higher Fukaya categories | James Pascaleff et.al. | 2503.20906 | null |
2025-03-26 | 3D Simulations Demonstrate Propagating Thermohaline Convection for Polluted White Dwarfs | Imogen G. Cresswell et.al. | 2503.20885 | null |
2025-03-26 | Synthetic Video Enhances Physical Fidelity in Video Synthesis | Qi Zhao et.al. | 2503.20822 | null |
2025-03-25 | Reflections on Diversity: A Real-time Virtual Mirror for Inclusive 3D Face Transformations | Paraskevi Valergaki et.al. | 2503.20819 | null |
2025-03-26 | FB-4D: Spatial-Temporal Coherent Dynamic 3D Content Generation with Feature Banks | Jinwei Li et.al. | 2503.20784 | link |
2025-03-26 | PGC: Physics-Based Gaussian Cloth from a Single Pose | Michelle Guo et.al. | 2503.20779 | null |
2025-03-26 | Attractors for the Navier–Stokes–Voight equations and their dimension | Alexei Ilyin et.al. | 2503.20760 | null |
2025-03-26 | PhysGen3D: Crafting a Miniature Interactive World from a Single Image | Boyuan Chen et.al. | 2503.20746 | null |
2025-03-26 | A weakly-supervised deep learning model for fast localisation and delineation of the skeleton, internal organs, and spinal canal on Whole-Body Diffusion-Weighted MRI (WB-DWI) | A. Candito et.al. | 2503.20722 | null |
2025-03-26 | Analyzing Iron Dust Bunsen Flames using Numerical Simulations | Thijs Hazenberg et.al. | 2503.20692 | null |
2025-03-27 | Flip Learning: Weakly Supervised Erase to Segment Nodules in Breast Ultrasound | Yuhao Huang et.al. | 2503.20685 | null |
2025-03-26 | GLRD: Global-Local Collaborative Reason and Debate with PSL for 3D Open-Vocabulary Detection | Xingyu Peng et.al. | 2503.20682 | null |
2025-03-26 | ARMO: Autoregressive Rigging for Multi-Category Objects | Mingze Sun et.al. | 2503.20663 | null |
2025-03-27 | Imitating Radiological Scrolling: A Global-Local Attention Model for 3D Chest CT Volumes Multi-Label Anomaly Classification | Theo Di Piazza et.al. | 2503.20652 | null |
2025-03-26 | The planetary nebula NGC 3132 revisited: high definition 3D photoionization model | H. Monteiro et.al. | 2503.20640 | null |
2025-03-26 | SaViD: Spectravista Aesthetic Vision Integration for Robust and Discerning 3D Object Detection in Challenging Environments | Tanmoy Dam et.al. | 2503.20614 | link |
2025-03-26 | Exploring Robustness of Cortical Morphometry in the presence of white matter lesions, using Diffusion Models for Lesion Filling | Vinzenz Uhr et.al. | 2503.20571 | null |
2025-03-26 | Small-scale energetic phenomena in Hε: Ellerman bombs, UV bursts, and small flares | K. Krikova et.al. | 2503.20535 | null |
2025-03-27 | MAR-3D: Progressive Masked Auto-regressor for High-Resolution 3D Generation | Jinnan Chen et.al. | 2503.20519 | null |
2025-03-26 | Common envelopes in massive stars III: The obstructive role of radiation transport in envelope ejection | Mike Y. M. Lau et.al. | 2503.20506 | null |
2025-03-26 | Hybridization of lattice and charge order excitations in a superconducting cuprate | S. M. Souliou et.al. | 2503.20503 | null |
2025-03-26 | Large-Scale, Long-Time Atomistic Simulations of Proton Transport in Polymer Electrolyte Membranes Using a Neural Network Interatomic Potential | Yuta Yoshimoto et.al. | 2503.20412 | null |
2025-03-26 | Phase-resolved modelling of wave transformation in the surf zone over idealised rough bottoms | Emile Guelard Ancilotti et.al. | 2503.20374 | null |
2025-03-26 | Phase-resolved Modeling Of Surf Zone Wave Transformation Over Idealized Rough Bottoms | Emile Guélard Ancilotti et.al. | 2503.20373 | null |
2025-03-27 | Recovering Dynamic 3D Sketches from Videos | Jaeah Lee et.al. | 2503.20321 | null |
2025-03-31 | Perceptually Accurate 3D Talking Head Generation: New Definitions, Speech-Mesh Representation, and Evaluation Metrics | Lee Chae-Yeon et.al. | 2503.20308 | null |
2025-03-26 | 3D Convolutional Neural Networks for Improved Detection of Intracranial bleeding in CT Imaging | Bargava Subramanian et.al. | 2503.20306 | null |
2025-03-26 | Merits of Serving UAVs via Terrestrial Networks: A Vertical Antenna Radiation Study | Nesrine Cherif et.al. | 2503.20296 | null |
2025-03-26 | CryoSAMU: Enhancing 3D Cryo-EM Density Maps of Protein Structures at Intermediate Resolution with Structure-Aware Multimodal U-Nets | Chenwei Zhang et.al. | 2503.20291 | link |
2025-03-26 | Mamba-3D as Masked Autoencoders for Accurate and Data-Efficient Analysis of Medical Ultrasound Videos | Jiaheng Zhou et.al. | 2503.20258 | link |
2025-03-27 | Leveraging 3D Geometric Priors in 2D Rotation Symmetry Detection | Ahyun Seo et.al. | 2503.20235 | null |
2025-03-26 | TC-GS: Tri-plane based compression for 3D Gaussian Splatting | Taorui Wang et.al. | 2503.20221 | link |
2025-03-26 | DINeMo: Learning Neural Mesh Models with no 3D Annotations | Weijie Guo et.al. | 2503.20220 | null |
2025-03-26 | EVolSplat: Efficient Volume-based Gaussian Splatting for Urban View Synthesis | Sheng Miao et.al. | 2503.20168 | null |
2025-03-25 | Zero-Shot Human-Object Interaction Synthesis with Multimodal Priors | Yuke Lou et.al. | 2503.20118 | null |
2025-03-25 | Singular SPDEs with the Cauchy-Riemann operator on a torus | Zdzisław Brzeźniak et.al. | 2503.20075 | null |
2025-03-25 | Pressure tuning of Kitaev spin liquid candidate Na $_3$Co$_2$SbO$_6$ | E. H. T. Poldi et.al. | 2503.20064 | null |
2025-03-25 | Med3DVLM: An Efficient Vision-Language Model for 3D Medical Image Analysis | Yu Xin et.al. | 2503.20047 | link |
2025-03-25 | Toward a Cognitive Data Model: Exploring a Mind-Inspired Approach to Database Design | Dhammika Pieris et.al. | 2503.20041 | null |
2025-03-25 | Gemini Robotics: Bringing AI into the Physical World | Gemini Robotics Team et.al. | 2503.20020 | null |
2025-03-25 | Hyperdimensional Uncertainty Quantification for Multimodal Uncertainty Fusion in Autonomous Vehicles Perception | Luke Chen et.al. | 2503.20011 | null |
2025-03-25 | Hybrid Magnetically and Electrically Powered Metallo-Dielectric Janus Microrobots: Enhanced Motion Control and Operation Beyond Planar Limits | Ido Rachbuch et.al. | 2503.19984 | null |
2025-03-25 | Thin-Shell-SfT: Fine-Grained Monocular Non-rigid 3D Surface Tracking with Neural Deformation Fields | Navami Kairanda et.al. | 2503.19976 | null |
2025-03-25 | Orthosymplectic Quotient Quiver Subtraction II: Framed Quivers | Sam Bennett et.al. | 2503.19954 | null |
2025-03-25 | Global Well-Posedness for the 3D Navier-Stokes Equations under Logarithmically Improved Criteria: Connections to Turbulence Theory | Rishabh Mishra et.al. | 2503.19944 | null |
2025-03-25 | Learning 3D Object Spatial Relationships from Pre-trained 2D Diffusion Models | Sangwon Beak et.al. | 2503.19914 | null |
2025-03-25 | PartRM: Modeling Part-Level Dynamics with Large Cross-State Reconstruction Model | Mingju Gao et.al. | 2503.19913 | null |
2025-03-25 | SuperFlow++: Enhanced Spatiotemporal Consistency for Cross-Modal Data Pretraining | Xiang Xu et.al. | 2503.19912 | link |
2025-03-25 | Tracktention: Leveraging Point Tracking to Attend Videos Faster and Better | Zihang Lai et.al. | 2503.19904 | null |
2025-03-25 | Visuo-Tactile Object Pose Estimation for a Multi-Finger Robot Hand with Low-Resolution In-Hand Tactile Sensing | Lukas Mack et.al. | 2503.19893 | null |
2025-03-25 | A Multi-Agent Framework Integrating Large Language Models and Generative AI for Accelerated Metamaterial Design | Jie Tian et.al. | 2503.19889 | null |
2025-03-25 | AudCast: Audio-Driven Human Video Generation by Cascaded Diffusion Transformers | Jiazhi Guan et.al. | 2503.19824 | null |
2025-03-25 | PAVE: Patching and Adapting Video Large Language Models | Zhuoming Liu et.al. | 2503.19794 | link |
2025-03-26 | In the Blink of an Eye: Instant Game Map Editing using a Generative-AI Smart Brush | Vitaly Gnatyuk et.al. | 2503.19793 | null |
2025-03-31 | Resilient Sensor Fusion under Adverse Sensor Failures via Multi-Modal Expert Fusion | Konyul Park et.al. | 2503.19776 | null |
2025-03-25 | OpenLex3D: A New Evaluation Benchmark for Open-Vocabulary 3D Scene Representations | Christina Kassab et.al. | 2503.19764 | null |
2025-03-25 | Magic teleportation with generalized lattice surgery | Yifei Wang et.al. | 2503.19758 | null |
2025-03-26 | A Survey on Event-driven 3D Reconstruction: Development under Different Categories | Chuanzhi Xu et.al. | 2503.19753 | null |
2025-03-25 | Stellar-wind feedback and magnetic fields around young compact star clusters: 3D MHD simulations | Lucia Härer et.al. | 2503.19745 | null |
2025-03-25 | GRN+: A Simplified Generative Reinforcement Network for Tissue Layer Analysis in 3D Ultrasound Images for Chronic Low-back Pain | Zixue Zeng et.al. | 2503.19736 | null |
2025-03-25 | Decoupled Dynamics Framework with Neural Fields for 3D Spatio-temporal Prediction of Vehicle Collisions | Sanghyuk Kim et.al. | 2503.19712 | null |
2025-03-25 | Data-efficient rapid prediction of urban airflow and temperature fields for complex building geometries | Shaoxiang Qin et.al. | 2503.19708 | null |
2025-03-25 | MultimodalStudio: A Heterogeneous Sensor Dataset and Framework for Neural Rendering across Multiple Imaging Modalities | Federico Lincetto et.al. | 2503.19673 | null |
2025-03-25 | Homogenized harmonic balance finite element method for nonlinear eddy current simulations of fast corrector magnets | Jan-Magnus Christmann et.al. | 2503.19657 | null |
2025-03-25 | SACB-Net: Spatial-awareness Convolutions for Medical Image Registration | Xinxing Cheng et.al. | 2503.19592 | link |
2025-03-25 | Dance Like a Chicken: Low-Rank Stylization for Human Motion Diffusion | Haim Sawdayee et.al. | 2503.19557 | null |
2025-03-25 | RoboFlamingo-Plus: Fusion of Depth and RGB Perception with Vision-Language Models for Enhanced Robotic Manipulation | Sheng Wang et.al. | 2503.19510 | null |
2025-03-25 | The metal-poorest tail of the Galactic halo: hypothesis on its origin from precise spectral analysis | Riano E. Giribaldi et.al. | 2503.19472 | null |
2025-03-28 | GaussianUDF: Inferring Unsigned Distance Functions through 3D Gaussian Splatting | Shujuan Li et.al. | 2503.19458 | null |
2025-03-25 | SparseGS-W: Sparse-View 3D Gaussian Splatting in the Wild with Generative Priors | Yiqing Li et.al. | 2503.19452 | null |
2025-03-26 | COB-GS: Clear Object Boundaries in 3DGS Segmentation Based on Boundary-Adaptive Gaussian Splitting | Jiaxin Zhang et.al. | 2503.19443 | link |
2025-03-25 | Advancing atom probe tomography capabilities to understand bone microstructures at the near-atomic scale | Tim M. Schwarz et.al. | 2503.19421 | null |
2025-03-25 | Multi-modal 3D Pose and Shape Estimation with Computed Tomography | Mingxiao Tu et.al. | 2503.19405 | null |
2025-03-25 | Positronium formation and threshold behavior in positron-sodium collisions at low energies | Ning-Ning Gao et.al. | 2503.19400 | null |
2025-03-25 | DeClotH: Decomposable 3D Cloth and Human Body Reconstruction from a Single Image | Hyeongjin Nam et.al. | 2503.19373 | null |
2025-03-25 | Improved Approximation Algorithms for Three-Dimensional Knapsack | Klaus Jansen et.al. | 2503.19365 | null |
2025-03-26 | ST-VLM: Kinematic Instruction Tuning for Spatio-Temporal Reasoning in Vision-Language Models | Dohwan Ko et.al. | 2503.19355 | null |
2025-03-25 | MATT-GS: Masked Attention-based 3DGS for Robot Perception and Object Detection | Jee Won Lee et.al. | 2503.19330 | null |
2025-03-25 | A Comprehensive Analysis of Mamba for 3D Volumetric Medical Image Segmentation | Chaohan Wang et.al. | 2503.19308 | null |
2025-03-25 | Analyzing the Synthetic-to-Real Domain Gap in 3D Hand Pose Estimation | Zhuoran Zhao et.al. | 2503.19307 | null |
2025-03-25 | UniMoMo: Unified Generative Modeling of 3D Molecules for De Novo Binder Design | Xiangzhe Kong et.al. | 2503.19300 | null |
2025-03-25 | Growing 3D clouds from 2D maps via full spherization | Xunchuan Liu et.al. | 2503.19259 | null |
2025-03-25 | Limited-angle x-ray nano-tomography with machine-learning enabled iterative reconstruction engine | Chonghang Zhao et.al. | 2503.19248 | link |
2025-03-25 | HoGS: Unified Near and Far Object Reconstruction via Homogeneous Gaussian Splatting | Xinpeng Liu et.al. | 2503.19232 | link |
2025-03-24 | FRESA:Feedforward Reconstruction of Personalized Skinned Avatars from Few Images | Rong Wang et.al. | 2503.19207 | link |
2025-03-24 | Open-Vocabulary Functional 3D Scene Graphs for Real-World Indoor Spaces | Chenyangguang Zhang et.al. | 2503.19199 | null |
2025-03-24 | FDS: Frequency-Aware Denoising Score for Text-Guided Latent Diffusion Image Editing | Yufan Ren et.al. | 2503.19191 | null |
2025-03-24 | HOIGPT: Learning Long Sequence Hand-Object Interaction with Language Models | Mingzhen Huang et.al. | 2503.19157 | null |
2025-03-24 | On the constitutive behavior of linear viscoelastic solids under the plane stress condition | Bojan B. Guzina et.al. | 2503.19137 | null |
2025-03-24 | Propagation of controlled frontward impulses through standing crowds | Sina Feldmann et.al. | 2503.19110 | null |
2025-03-24 | Forward propagation of a push through a row of people | Sina Feldmann et.al. | 2503.19104 | null |
2025-03-24 | 3D Structural Phenotype of the Optic Nerve Head at the Intersection of Glaucoma and Myopia – A Key to Improving Glaucoma Diagnosis in Myopic Populations | Swati Sharma et.al. | 2503.19083 | null |
2025-03-24 | Leaky Dust Traps in Planet-Embedded Protoplanetary Disks | Pinghui Huang et.al. | 2503.19026 | null |
2025-03-24 | RomanTex: Decoupling 3D-aware Rotary Positional Embedded Multi-Attention Network for Texture Synthesis | Yifei Feng et.al. | 2503.19011 | null |
2025-03-24 | Foundation Model for Whole-Heart Segmentation: Leveraging Student-Teacher Learning in Multi-Modal Medical Imaging | Abdul Qayyum et.al. | 2503.19005 | null |
2025-03-23 | Generative Data Imputation for Sparse Learner Performance Data Using Generative Adversarial Imputation Networks | Liang Zhang et.al. | 2503.18982 | null |
2025-03-22 | Topological effect on order-disorder transitions in U(1) sigma models | Ryuichi Shindou et.al. | 2503.18969 | null |
2025-03-21 | MedAgent-Pro: Towards Multi-modal Evidence-based Medical Diagnosis via Reasoning Agentic Workflow | Ziyue Wang et.al. | 2503.18968 | null |
2025-03-24 | Target-Aware Video Diffusion Models | Taeksoo Kim et.al. | 2503.18950 | null |
2025-03-24 | DINO in the Room: Leveraging 2D Foundation Models for 3D Segmentation | Karim Abou Zeid et.al. | 2503.18944 | link |
2025-03-24 | Online 3D Scene Reconstruction Using Neural Object Priors | Thomas Chabal et.al. | 2503.18897 | null |
2025-03-24 | 3DSwapping: Texture Swapping For 3D Object From Single Reference Image | Xiao Cao et.al. | 2503.18853 | null |
2025-03-24 | NexusGS: Sparse View Synthesis with Epipolar Depth Priors in 3D Gaussian Splatting | Yulong Zheng et.al. | 2503.18794 | null |
2025-03-27 | AlphaSpace: Enabling Robotic Actions through Semantic Tokenization and Symbolic Reasoning | Alan Dao et.al. | 2503.18769 | null |
2025-03-24 | Large deformation and collapse analysis of re-entrant auxetic and hexagonal honeycomb lattice structures subjected to tension and compression | Sima Farshbaf et.al. | 2503.18736 | null |
2025-03-24 | FG $^2$ : Fine-Grained Cross-View Localization by Fine-Grained Feature Matching | Zimin Xia et.al. | 2503.18725 | link |
2025-03-24 | GS-Marker: Generalizable and Robust Watermarking for 3D Gaussian Splatting | Lijiang Li et.al. | 2503.18718 | null |
2025-03-24 | Maximum Bound Principle and Bound Preserving ETD schemes for a Phase-Field Model of Tumor Growth with Extracellular Matrix Degradation | Qiumei Huang et.al. | 2503.18699 | null |
2025-03-24 | Convergence study of ambipolar diffusion in realistic simulations of magneto-convection | E. Khomenko et.al. | 2503.18686 | null |
2025-03-24 | Hardware-Rasterized Ray-Based Gaussian Splatting | Samuel Rota Bulò et.al. | 2503.18682 | null |
2025-03-25 | Any6D: Model-free 6D Pose Estimation of Novel Objects | Taeyeop Lee et.al. | 2503.18673 | null |
2025-03-24 | Structure-Aware Correspondence Learning for Relative Pose Estimation | Yihan Chen et.al. | 2503.18671 | null |
2025-03-24 | LLGS: Unsupervised Gaussian Splatting for Image Enhancement and Reconstruction in Pure Dark Environment | Haoran Wang et.al. | 2503.18640 | null |
2025-03-24 | Reading Decisions from Gaze Direction during Graphics Turing Test of Gait Animation | Benjamin Knopp et.al. | 2503.18619 | null |
2025-03-24 | Double radio relics and radio halo in the high redshift galaxy cluster El Gordo with the Upgraded GMRT | R. Kale et.al. | 2503.18613 | null |
2025-03-24 | Electronic structure of CeCo $_{1-x}$Fe$_x$Ge$_3$ studied by X-ray photoelectron spectroscopy and first-principles calculations | P. Skokowski et.al. | 2503.18598 | null |
2025-03-24 | RLCAD: Reinforcement Learning Training Gym for Revolution Involved CAD Command Sequence Generation | Xiaolong Yin et.al. | 2503.18549 | null |
2025-03-25 | AIM2PC: Aerial Image to 3D Building Point Cloud Reconstruction | Soulaimene Turki et.al. | 2503.18527 | null |
2025-03-24 | P3Nav: A Unified Framework for Embodied Navigation Integrating Perception, Planning, and Prediction | Yufeng Zhong et.al. | 2503.18525 | null |
2025-03-25 | LookCloser: Frequency-aware Radiance Field for Tiny-Detail Scene | Xiaoyu Zhang et.al. | 2503.18513 | null |
2025-03-25 | Global-Local Tree Search in VLMs for 3D Indoor Scene Generation | Wei Deng et.al. | 2503.18476 | link |
2025-03-24 | MetaSpatial: Reinforcing 3D Spatial Reasoning in VLMs for the Metaverse | Zhenyu Pan et.al. | 2503.18470 | link |
2025-03-24 | MuMA: 3D PBR Texturing via Multi-Channel Multi-View Generation and Agentic Post-Processing | Lingting Zhu et.al. | 2503.18461 | null |
2025-03-25 | StableGS: A Floater-Free Framework for 3D Gaussian Splatting | Luchao Wang et.al. | 2503.18458 | null |
2025-03-24 | ReconDreamer++: Harmonizing Generative and Reconstructive Models for Driving Scene Representation | Guosheng Zhao et.al. | 2503.18438 | null |
2025-03-24 | 4DGC: Rate-Aware 4D Gaussian Compression for Efficient Streamable Free-Viewpoint Video | Qiang Hu et.al. | 2503.18421 | null |
2025-03-26 | DashGaussian: Optimizing 3D Gaussian Splatting in 200 Seconds | Youyu Chen et.al. | 2503.18402 | null |
2025-03-24 | DiffusedWrinkles: A Diffusion-Based Model for Data-Driven Garment Animation | Raquel Vidaurre et.al. | 2503.18370 | null |
2025-03-24 | MoST: Efficient Monarch Sparse Tuning for 3D Representation Learning | Xu Han et.al. | 2503.18368 | link |
2025-03-30 | MonoInstance: Enhancing Monocular Priors via Multi-view Instance Alignment for Neural Rendering and Reconstruction | Wenyuan Zhang et.al. | 2503.18363 | null |
2025-03-24 | Human-Object Interaction with Vision-Language Model Guided Relative Movement Dynamics | Zekai Deng et.al. | 2503.18349 | null |
2025-03-24 | PS-EIP: Robust Photometric Stereo Based on Event Interval Profile | Kazuma Kitazawa et.al. | 2503.18341 | null |
2025-03-24 | LAMOST YSOs. I. Spectroscopically identifying and characterizing M-type young stellar objects | Xiang-Song Fang et.al. | 2503.18318 | null |
2025-03-24 | GI-SLAM: Gaussian-Inertial SLAM | Xulang Liu et.al. | 2503.18275 | null |
2025-03-24 | Surface-Aware Distilled 3D Semantic Features | Lukas Uzolas et.al. | 2503.18254 | null |
2025-03-24 | ZECO: ZeroFusion Guided 3D MRI Conditional Generation | Feiran Wang et.al. | 2503.18246 | link |
2025-03-23 | A Tutorial on Six-Dimensional Movable Antenna Enhanced Wireless Networks: Synergizing Positionable and Rotatable Antennas | Xiaodan Shao et.al. | 2503.18240 | null |
2025-03-25 | SimMotionEdit: Text-Based Human Motion Editing with Motion Similarity Prediction | Zhengyuan Li et.al. | 2503.18211 | link |
2025-03-23 | A Simple Weak Galerkin Finite Element Method for a Class of Fourth-Order Problems in Fluorescence Tomography | Chunmei Wang et.al. | 2503.18200 | null |
2025-03-23 | SNRAware: Improved Deep Learning MRI Denoising with SNR Unit Training and G-factor Map Augmentation | Hui Xue et.al. | 2503.18162 | null |
2025-03-23 | DiffusionTalker: Efficient and Compact Speech-Driven 3D Talking Head via Personalizer-Guided Distillation | Peng Chen et.al. | 2503.18159 | link |
2025-03-25 | Decorum: A Language-Based Approach For Style-Conditioned Synthesis of Indoor 3D Scenes | Kelly O. Marshall et.al. | 2503.18155 | null |
2025-03-23 | AGIR: Assessing 3D Gait Impairment with Reasoning based on LLMs | Diwei Wang et.al. | 2503.18141 | null |
2025-03-23 | MLLM-For3D: Adapting Multimodal Large Language Model for 3D Reasoning Segmentation | Jiaxin Huang et.al. | 2503.18135 | null |
2025-03-23 | Unraveling the Effects of Synthetic Data on End-to-End Autonomous Driving | Junhao Ge et.al. | 2503.18108 | link |
2025-03-23 | PanoGS: Gaussian-based Panoptic Segmentation for 3D Open Vocabulary Scene Understanding | Hongjia Zhai et.al. | 2503.18107 | null |
2025-03-23 | M3Net: Multimodal Multi-task Learning for 3D Detection, Segmentation, and Occupancy Prediction in Autonomous Driving | Xuesong Chen et.al. | 2503.18100 | link |
2025-03-23 | Unified Geometry and Color Compression Framework for Point Clouds via Generative Diffusion Priors | Tianxin Huang et.al. | 2503.18083 | null |
2025-03-23 | GenMetaLoc: Learning to Learn Environment-Aware Fingerprint Generation for Sample Efficient Wireless Localization | Jun Gao et.al. | 2503.18078 | null |
2025-03-23 | PanopticSplatting: End-to-End Panoptic Gaussian Splatting | Yuxuan Xie et.al. | 2503.18073 | null |
2025-03-23 | SceneSplat: Gaussian Splatting-based Scene Understanding with Vision-Language Pretraining | Yue Li et.al. | 2503.18052 | link |
2025-03-23 | Multiple-Particle Autofocusing Algorithm Using Axial Resolution and Morphological Analyses Based on Digital Holography | Wei-Na Li et.al. | 2503.18038 | null |
2025-03-23 | Retrieval Augmented Generation and Understanding in Vision: A Survey and New Outlook | Xu Zheng et.al. | 2503.18016 | null |
2025-03-23 | Large $N$ Wess-Zumino model at finite temperature and large chemical potential in $3d$ | Srijan Kumar et.al. | 2503.17999 | null |
2025-03-23 | Real-time Global Illumination for Dynamic 3D Gaussian Scenes | Chenxiao Hu et.al. | 2503.17897 | null |
2025-03-22 | NVBleed: Covert and Side-Channel Attacks on NVIDIA Multi-GPU Interconnect | Yicheng Zhang et.al. | 2503.17847 | null |
2025-03-22 | 4D-Bench: Benchmarking Multi-modal Large Language Models for 4D Object Understanding | Wenxuan Zhu et.al. | 2503.17827 | link |
2025-03-22 | DVG-Diffusion: Dual-View Guided Diffusion Model for CT Reconstruction from X-Rays | Xing Xie et.al. | 2503.17804 | null |
2025-03-22 | GaussianFocus: Constrained Attention Focus for 3D Gaussian Splatting | Zexu Huang et.al. | 2503.17798 | null |
2025-03-22 | MEPNet: Medical Entity-balanced Prompting Network for Brain CT Report Generation | Xiaodan Zhang et.al. | 2503.17784 | link |
2025-03-22 | Neutron particle transport 3D method of characteristic Multi GPU platform Parallel Computing | Faguo Zhou et.al. | 2503.17743 | null |
2025-03-22 | GS-LTS: 3D Gaussian Splatting-Based Adaptive Modeling for Long-Term Service Robots | Bin Fu et.al. | 2503.17733 | null |
2025-03-22 | Volumetric density measurement in buoyant plumes using Tomographic Background Oriented Schlieren (TBOS) | Javed Mohd et.al. | 2503.17705 | null |
2025-03-22 | MAMAT: 3D Mamba-Based Atmospheric Turbulence Removal and its Object Detection Capability | Paul Hill et.al. | 2503.17700 | null |
2025-03-22 | 3D Modeling: Camera Movement Estimation and path Correction for SFM Model using the Combination of Modified A-SIFT and Stereo System | Usha Kumari et.al. | 2503.17668 | null |
2025-03-22 | Multi-Modality Representation Learning for Antibody-Antigen Interactions Prediction | Peijin Guo et.al. | 2503.17666 | link |
2025-03-21 | Is there anything left? Measuring semantic residuals of objects removed from 3D Gaussian Splatting | Simona Kocour et.al. | 2503.17574 | link |
2025-03-21 | Aligning Thermal and Current Quenches with a High Density Low-Z Injection | Jason Hamilton et.al. | 2503.17557 | null |
2025-03-21 | PRIMAL: Physically Reactive and Interactive Motor Model for Avatar Learning | Yan Zhang et.al. | 2503.17544 | null |
2025-03-21 | MM-UNet: Meta Mamba UNet for Medical Image Segmentation | Bin Xie et.al. | 2503.17540 | null |
2025-03-21 | The Impedance Space: A Look at Mechanical Impedance Ellipses in 3D | Leonardo F. Dos Santos et.al. | 2503.17533 | null |
2025-03-21 | NAVIUS: Navigated Augmented Reality Visualization for Ureteroscopic Surgery | Ayberk Acar et.al. | 2503.17511 | null |
2025-03-25 | ProtoGS: Efficient and High-Quality Rendering with 3D Gaussian Prototypes | Zhengqing Gao et.al. | 2503.17486 | null |
2025-03-21 | Anatomically Guided Motion Correction for Placental IVIM Parameter Estimation with Accelerated Sampling Method | Mbaimou Auxence Ngremmadji et.al. | 2503.17468 | null |
2025-03-21 | Stereological 3D modeling of nano-scale catalyst particles using TEM projections | Lukas Fuchs et.al. | 2503.17437 | null |
2025-03-21 | 3D variational autoencoder for fingerprinting microstructure volume elements | Michael D. White et.al. | 2503.17427 | null |
2025-03-20 | IRef-VLA: A Benchmark for Interactive Referential Grounding with Imperfect Language in 3D Scenes | Haochen Zhang et.al. | 2503.17406 | link |
2025-03-19 | Enhanced Vascular Flow Simulations in Aortic Aneurysm via Physics-Informed Neural Networks and Deep Operator Networks | Oscar L. Cruz-González et.al. | 2503.17402 | null |
2025-03-19 | TripNet: Learning Large-scale High-fidelity 3D Car Aerodynamics with Triplane Networks | Qian Chen et.al. | 2503.17400 | null |
2025-03-21 | Decouple and Track: Benchmarking and Improving Video Diffusion Transformers for Motion Transfer | Qingyu Shi et.al. | 2503.17350 | null |
2025-03-21 | Pow3R: Empowering Unconstrained 3D Reconstruction with Camera and Scene Priors | Wonbong Jang et.al. | 2503.17316 | null |
2025-03-21 | Cross-Band Modulation Design for Hybrid RF-Optical Systems | Thrassos K. Oikonomou et.al. | 2503.17296 | null |
2025-03-21 | 3D Neural Operator-Based Flow Surrogates around 3D geometries: Signed Distance Functions and Derivative Constraints | Ali Rabeh et.al. | 2503.17289 | link |
2025-03-21 | Hamiltonian Chaos: From Galactic Dynamics to Plasma Physics | Henok Tenaw Moges et.al. | 2503.17208 | null |
2025-03-21 | FreeUV: Ground-Truth-Free Realistic Facial UV Texture Recovery via Cross-Assembly Inference Strategy | Xingchao Yang et.al. | 2503.17197 | null |
2025-03-21 | Employing Continuous Integration inspired workflows for benchmarking of scientific software – a use case on numerical cut cell quadrature | Teoman Toprak et.al. | 2503.17192 | null |
2025-03-25 | Which2comm: An Efficient Collaborative Perception Framework for 3D Object Detection | Duanrui Yu et.al. | 2503.17175 | null |
2025-03-21 | Generative adversarial framework to calibrate excursion set models for the 3D morphology of all-solid-state battery cathodes | Orkun Furat et.al. | 2503.17171 | null |
2025-03-26 | Hi-ALPS – An Experimental Robustness Quantification of Six LiDAR-based Object Detection Systems for Autonomous Driving | Alexandra Arzberger et.al. | 2503.17168 | null |
2025-03-21 | Enhancing Steering Estimation with Semantic-Aware GNNs | Fouad Makiyeh et.al. | 2503.17153 | null |
2025-03-27 | Temporal-Guided Spiking Neural Networks for Event-Based Human Action Recognition | Siyuan Yang et.al. | 2503.17132 | null |
2025-03-21 | GAA-TSO: Geometry-Aware Assisted Depth Completion for Transparent and Specular Objects | Yizhe Liu et.al. | 2503.17106 | null |
2025-03-21 | R2LDM: An Efficient 4D Radar Super-Resolution Framework Leveraging Diffusion Model | Boyuan Zheng et.al. | 2503.17097 | null |
2025-03-21 | FFaceNeRF: Few-shot Face Editing in Neural Radiance Fields | Kwan Yun et.al. | 2503.17095 | link |
2025-03-21 | ColabSfM: Collaborative Structure-from-Motion by Point Cloud Registration | Johan Edstedt et.al. | 2503.17093 | null |
2025-03-21 | Closeby Habitable Exoplanet Survey (CHES). III. Retrieval of Planetary Masses in Binaries Using the N-body Model with RV and Astrometry Synergy | Xiumin Huang et.al. | 2503.17090 | null |
2025-03-21 | Ex vivo experiment on vertebral body with defect representing bone metastasis | W. Lokbani et.al. | 2503.17047 | null |
2025-03-21 | ExCap3D: Expressive 3D Scene Understanding via Object Captioning with Varying Detail | Chandan Yeshwanth et.al. | 2503.17044 | null |
2025-03-21 | TaoAvatar: Real-Time Lifelike Full-Body Talking Avatars for Augmented Reality via 3D Gaussian Splatting | Jianchuan Chen et.al. | 2503.17032 | null |
2025-03-21 | Targetless 6DoF Calibration of LiDAR and 2D Scanning Radar Based on Cylindrical Occupancy | Weimin Wang et.al. | 2503.17002 | null |
2025-03-21 | High Accuracy Pulmonary Vessel Segmentation for Contrast and Non-contrast CT Images and Its Clinical Evaluation | Ying Ming et.al. | 2503.16988 | null |
2025-03-21 | Instant Gaussian Stream: Fast and Generalizable Streaming of Dynamic Scene Reconstruction via Gaussian Splatting | Jinbo Yan et.al. | 2503.16979 | link |
2025-03-21 | GeoT: Geometry-guided Instance-dependent Transition Matrix for Semi-supervised Tooth Point Cloud Segmentation | Weihao Yu et.al. | 2503.16976 | null |
2025-03-21 | DroneSplat: 3D Gaussian Splatting for Robust 3D Reconstruction from In-the-Wild Drone Imagery | Jiadong Tang et.al. | 2503.16964 | null |
2025-03-21 | Observational constraints on the origin of the elements. IX. 3D NLTE abundances of metals in the context of Galactic Chemical Evolution Models and 4MOST | Nicholas Storm et.al. | 2503.16946 | null |
2025-03-21 | External tides: an important driver of velocity dispersion in molecular clouds | J. W. Zhou et.al. | 2503.16937 | null |
2025-03-21 | Optimized Minimal 3D Gaussian Splatting | Joo Chan Lee et.al. | 2503.16924 | null |
2025-03-21 | Design of 3D Non-Cartesian Trajectories for Fast Volumetric MRI via Analytic Coordinate Discretization | Kwang Eun Jang et.al. | 2503.16918 | null |
2025-03-21 | HSM: Hierarchical Scene Motifs for Multi-Scale Indoor Scene Generation | Hou In Derek Pun et.al. | 2503.16848 | null |
2025-03-21 | SGFormer: Satellite-Ground Fusion for 3D Semantic Scene Completion | Xiyue Guo et.al. | 2503.16825 | link |
2025-03-21 | Toward AI-driven Multimodal Interfaces for Industrial CAD Modeling | Jiin Choi et.al. | 2503.16824 | null |
2025-03-21 | RigGS: Rigging of 3D Gaussians for Modeling Articulated Objects in Videos | Yuxin Yao et.al. | 2503.16822 | null |
2025-03-21 | Seg2Box: 3D Object Detection by Point-Wise Semantics Supervision | Maoji Zheng et.al. | 2503.16811 | null |
2025-03-21 | Auto-Regressive Diffusion for Generating 3D Human-Object Interactions | Zichen Geng et.al. | 2503.16801 | link |
2025-03-21 | A Pathway to Near Tissue Computing through Processing-in-CTIA Pixels for Biomedical Applications | Zihan Yin et.al. | 2503.16798 | null |
2025-03-21 | Physics-Informed Deep B-Spline Networks for Dynamical Systems | Zhuoyuan Wang et.al. | 2503.16777 | null |
2025-03-21 | OpenCity3D: What do Vision-Language Models know about Urban Environments? | Valentin Bieri et.al. | 2503.16776 | null |
2025-03-21 | Nonlinear stability of compressible vortex sheets in three-dimensional elastodynamics | Robin Ming Chen et.al. | 2503.16758 | null |
2025-03-20 | SAGE: Semantic-Driven Adaptive Gaussian Splatting in Extended Reality | Chiara Schiavo et.al. | 2503.16747 | null |
2025-03-20 | Digitally Prototype Your Eye Tracker: Simulating Hardware Performance using 3D Synthetic Data | Esther Y. H. Lin et.al. | 2503.16742 | null |
2025-03-24 | CTorch: PyTorch-Compatible GPU-Accelerated Auto-Differentiable Projector Toolbox for Computed Tomography | Xiao Jiang et.al. | 2503.16741 | null |
2025-03-20 | Cross-Modal and Uncertainty-Aware Agglomeration for Open-Vocabulary 3D Scene Understanding | Jinlong Li et.al. | 2503.16707 | link |
2025-03-20 | Perturbing finite temperature multicomponent DFT 1D Kohn-Sham systems: Peierls Gap & Kohn Anomaly | Adrian D. Scheppe et.al. | 2503.16705 | null |
2025-03-20 | GauRast: Enhancing GPU Triangle Rasterizers to Accelerate 3D Gaussian Splatting | Sixu Li et.al. | 2503.16681 | null |
2025-03-24 | iFlame: Interleaving Full and Linear Attention for Efficient Mesh Generation | Hanxiao Wang et.al. | 2503.16653 | null |
2025-03-20 | Fed-NDIF: A Noise-Embedded Federated Diffusion Model For Low-Count Whole-Body PET Denoising | Yinchi Zhou et.al. | 2503.16635 | null |
2025-03-20 | TriTex: Learning Texture from a Single Mesh via Triplane Semantic Features | Dana Cohen-Bar et.al. | 2503.16630 | null |
2025-03-20 | Utilizing Reinforcement Learning for Bottom-Up part-wise Reconstruction of 2D Wire-Frame Projections | Julian Ziegler et.al. | 2503.16629 | null |
2025-03-20 | AREPO-IDORT: Implicit Discrete Ordinates Radiation Transport for Radiation Magnetohydrodynamics on an Unstructured Moving Mesh | Jing-Ze Ma et.al. | 2503.16627 | null |
2025-03-20 | A Recipe for Generating 3D Worlds From a Single Image | Katja Schwarz et.al. | 2503.16611 | null |
2025-03-20 | UniK3D: Universal Camera Monocular 3D Estimation | Luigi Piccinelli et.al. | 2503.16591 | link |
2025-03-19 | A Comprehensive Survey on Architectural Advances in Deep CNNs: Challenges, Applications, and Emerging Research Directions | Saddam Hussain Khan et.al. | 2503.16546 | null |
2025-03-18 | Word2Minecraft: Generating 3D Game Levels through Large Language Models | Shuo Huang et.al. | 2503.16536 | null |
2025-03-18 | Vision-Language Embodiment for Monocular Depth Estimation | Jinchang Zhang et.al. | 2503.16535 | null |
2025-03-17 | Immersive Virtual Reality Environments for Embodied Learning of Engineering Students | Rafael Padilla Perez et.al. | 2503.16519 | null |
2025-03-20 | Sonata: Self-Supervised Learning of Reliable Point Representations | Xiaoyang Wu et.al. | 2503.16429 | link |
2025-03-20 | SynCity: Training-Free Generation of 3D Worlds | Paul Engstler et.al. | 2503.16420 | null |
2025-03-20 | M3: 3D-Spatial MultiModal Memory | Xueyan Zou et.al. | 2503.16413 | link |
2025-03-20 | DreamTexture: Shape from Virtual Texture with Analysis by Augmentation | Ananta R. Bhattarai et.al. | 2503.16412 | null |
2025-03-20 | SA-Occ: Satellite-Assisted 3D Occupancy Prediction in Real World | Chen Chen et.al. | 2503.16399 | link |
2025-03-25 | SV4D 2.0: Enhancing Spatio-Temporal Consistency in Multi-View Video Diffusion for High-Quality 4D Generation | Chun-Han Yao et.al. | 2503.16396 | null |
2025-03-20 | Gaussian Graph Network: Learning Efficient and Generalizable Gaussian Representations from Multi-view Images | Shengjun Zhang et.al. | 2503.16338 | null |
2025-03-20 | Dynamic Point Maps: A Versatile Representation for Dynamic 3D Reconstruction | Edgar Sucar et.al. | 2503.16318 | null |
2025-03-20 | Rapid patient-specific neural networks for intraoperative X-ray to volume registration | Vivek Gopalakrishnan et.al. | 2503.16309 | link |
2025-03-26 | Unleashing Vecset Diffusion Model for Fast Shape Generation | Zeqiang Lai et.al. | 2503.16302 | link |
2025-03-20 | Surface quasigeostrophic turbulence: The refined study of an active scalar | Nicolas Valade et.al. | 2503.16294 | null |
2025-03-20 | SceneMI: Motion In-betweening for Modeling Human-Scene Interactions | Inwoo Hwang et.al. | 2503.16289 | null |
2025-03-20 | Generalized Few-shot 3D Point Cloud Segmentation with Vision-Language Model | Zhaochong An et.al. | 2503.16282 | link |
2025-03-20 | Evaluation of Torque Ripple and Tooth Forces of a Skewed PMSM by 2D and 3D FE Simulations | Karsten Müller et.al. | 2503.16279 | null |
2025-03-21 | Uni-3DAR: Unified 3D Generation and Understanding via Autoregression on Compressed Spatial Tokens | Shuqi Lu et.al. | 2503.16278 | link |
2025-03-20 | From Monocular Vision to Autonomous Action: Guiding Tumor Resection via 3D Reconstruction | Ayberk Acar et.al. | 2503.16263 | null |
2025-03-20 | 3D radio data visualisation in open science platforms for next-generation observatories | I. Labadie-García et.al. | 2503.16237 | link |
2025-03-20 | Filters reveal emergent structure in computational morphogenesis | Hazhir Aliahmadi et.al. | 2503.16211 | null |
2025-03-20 | 3D Stochastic Geometry Model for Aerial Vehicle-Relayed Ground-Air-Satellite Connectivity | Yulei Wang et.al. | 2503.16202 | null |
2025-03-20 | OccluGaussian: Occlusion-Aware Gaussian Splatting for Large Scene Reconstruction and Rendering | Shiyong Liu et.al. | 2503.16177 | null |
2025-03-20 | Asymptotically Optimal Path Planning With an Approximation of the Omniscient Set | Jonáš Kříž et.al. | 2503.16164 | link |
2025-03-20 | High-Temperature-Resilient Hyperbolicity in a Mixed-Dimensional Superlattice | Jason Lynch et.al. | 2503.16147 | null |
2025-03-20 | Uncertainty Meets Diversity: A Comprehensive Active Learning Framework for Indoor 3D Object Detection | Jiangyi Wang et.al. | 2503.16125 | null |
2025-03-20 | PoseTraj: Pose-Aware Trajectory Control in Video Diffusion | Longbin Ji et.al. | 2503.16068 | null |
2025-03-20 | Rejecting Outliers in 2D-3D Point Correspondences from 2D Forward-Looking Sonar Observations | Jiayi Su et.al. | 2503.16066 | null |
2025-03-20 | Scattering graph method for 3D radiative transfer | Antti Mikkonen et.al. | 2503.16037 | link |
2025-03-20 | GraspCoT: Integrating Physical Property Reasoning for 6-DoF Grasping under Flexible Language Instructions | Xiaomeng Chu et.al. | 2503.16013 | null |
2025-03-20 | Automating 3D Dataset Generation with Neural Radiance Fields | P. Schulz et.al. | 2503.15997 | link |
2025-03-20 | Animating the Uncaptured: Humanoid Mesh Animation with Video Diffusion Models | Marc Benedí San Millán et.al. | 2503.15996 | null |
2025-03-20 | A framework for efficient reduced order modelling in the Julia programming language | Nicholas Mueller et.al. | 2503.15994 | link |
2025-03-20 | Acc3D: Accelerating Single Image to 3D Diffusion Models via Edge Consistency Guided Score Distillation | Kendong Liu et.al. | 2503.15975 | null |
2025-03-20 | 1-Adamantanamine implementation in surface engineering of biomimetic PVDF-based membranes for enhanced membrane distillation | Samer Al-Gharabli et.al. | 2503.15930 | null |
2025-03-20 | Topology-Driven Design of Bianisotropic Metasurfaces Through Knot-Particles | Nadav Goshen et.al. | 2503.15925 | null |
2025-03-20 | Learning to Efficiently Adapt Foundation Models for Self-Supervised Endoscopic 3D Scene Reconstruction from Any Cameras | Beilei Cui et.al. | 2503.15917 | null |
2025-03-20 | Enhancing Close-up Novel View Synthesis via Pseudo-labeling | Jiatong Xia et.al. | 2503.15908 | link |
2025-03-20 | Reconstructing In-the-Wild Open-Vocabulary Human-Object Interactions | Boran Wen et.al. | 2503.15898 | null |
2025-03-20 | Learning 3D Scene Analogies with Neural Contextual Scene Maps | Junho Kim et.al. | 2503.15897 | null |
2025-03-20 | Repurposing 2D Diffusion Models with Gaussian Atlas for 3D Generation | Tiange Xiang et.al. | 2503.15877 | null |
2025-03-20 | VideoRFSplat: Direct Scene-Level Text-to-3D Gaussian Splatting Generation with Flexible Pose and Multi-View Joint Modeling | Hyojun Go et.al. | 2503.15855 | null |
2025-03-20 | BARD-GS: Blur-Aware Reconstruction of Dynamic Scenes via Gaussian Splatting | Yiren Lu et.al. | 2503.15835 | null |
2025-03-23 | Computation-Efficient and Recognition-Friendly 3D Point Cloud Privacy Protection | Haotian Ma et.al. | 2503.15818 | null |
2025-03-20 | Controlling Avatar Diffusion with Learnable Gaussian Embedding | Xuan Gao et.al. | 2503.15809 | null |
2025-03-20 | Nano-3D: Metasurface-Based Neural Depth Imaging | Bingxuan Li et.al. | 2503.15770 | null |
2025-03-20 | OffsetOPT: Explicit Surface Reconstruction without Normals | Huan Lei et.al. | 2503.15763 | null |
2025-03-20 | CATCH: a Cost Analysis Tool for Co-optimization of chiplet-based Heterogeneous systems | Alexander Graening et.al. | 2503.15753 | link |
2025-03-19 | Universal fault tolerant quantum computation in 2D without getting tied in knots | Margarita Davydova et.al. | 2503.15751 | null |
2025-03-19 | Uncertainty-Aware Diffusion Guided Refinement of 3D Scenes | Sarosij Bose et.al. | 2503.15742 | null |
2025-03-19 | SPNeRF: Open Vocabulary 3D Neural Scene Segmentation with Superpoints | Weiwen Hu et.al. | 2503.15712 | null |
2025-03-19 | GASP: Unifying Geometric and Semantic Self-Supervised Pre-training for Autonomous Driving | William Ljungbergh et.al. | 2503.15672 | null |
2025-03-19 | CHROME: Clothed Human Reconstruction with Occlusion-Resilience and Multiview-Consistency from a Single Image | Arindam Dutta et.al. | 2503.15671 | null |
2025-03-19 | DiffPortrait360: Consistent Portrait Diffusion for 360 View Synthesis | Yuming Gu et.al. | 2503.15667 | link |
2025-03-19 | Toward Scalable, Flexible Scene Flow for Point Clouds | Kyle Vedder et.al. | 2503.15666 | null |
2025-03-19 | Bridging Algebra and Nature: Toward a Deformable 3D Hyper-complex framework for Modeling Dynamic Systems | Abdon Atangana et.al. | 2503.15649 | null |
2025-03-19 | Light in the dark forest – I. An efficient optimal estimator for 3D Lyman-alpha forest power spectrum | N. G. Karaçaylı et.al. | 2503.15619 | null |
2025-03-19 | Towards Unified Latent Space for 3D Molecular Latent Diffusion Modeling | Yanchen Luo et.al. | 2503.15567 | null |
2025-03-19 | Shap-MeD | Nicolás Laverde et.al. | 2503.15562 | null |
2025-03-19 | Cube: A Roblox View of 3D Intelligence | Foundation AI Team et.al. | 2503.15475 | link |
2025-03-19 | EgoDTM: Towards 3D-Aware Egocentric Video-Language Pretraining | Boshen Xu et.al. | 2503.15470 | null |
2025-03-19 | V2X-DG: Domain Generalization for Vehicle-to-Everything Cooperative Perception | Baolu Li et.al. | 2503.15435 | null |
2025-03-19 | Federated Continual 3D Segmentation With Single-round Communication | Can Peng et.al. | 2503.15414 | null |
2025-03-19 | On the linear structure of the interlaced Alfvén vortices in the tail of Uranus at solstice | Filippo Pantellini et.al. | 2503.15396 | null |
2025-03-19 | Simulation of current-driven magnetisation switching in nanopillars with Perpendicular Shape Anisotropy | Natalia Boscolo Meneguolo et.al. | 2503.15388 | null |
2025-03-19 | Halide Perovskites as Spin-1 Dirac Materials | Dmitry Marchenko et.al. | 2503.15343 | null |
2025-03-19 | Euclid Quick Data Release (Q1). Photometric redshifts and physical properties of galaxies through the PHZ processing function | Euclid Collaboration et.al. | 2503.15306 | null |
2025-03-21 | SUM Parts: Benchmarking Part-Level Semantic Segmentation of Urban Meshes | Weixiao Gao et.al. | 2503.15300 | null |
2025-03-19 | EdgeRegNet: Edge Feature-based Multimodal Registration Network between Images and LiDAR Point Clouds | Yuanchao Yue et.al. | 2503.15284 | link |
2025-03-19 | DeepMesh: Auto-Regressive Artist-mesh Creation with Reinforcement Learning | Ruowen Zhao et.al. | 2503.15265 | null |
2025-03-19 | GO-N3RDet: Geometry Optimized NeRF-enhanced 3D Object Detector | Zechuan Li et.al. | 2503.15211 | null |
2025-03-19 | 3D Occupancy Prediction with Low-Resolution Queries via Prototype-aware View Transformation | Gyeongrok Oh et.al. | 2503.15185 | null |
2025-03-20 | Distilling 3D distinctive local descriptors for 6D pose estimation | Amir Hamza et.al. | 2503.15106 | null |
2025-03-19 | Exploring the Perspectives of Social VR-Aware Non-Parent Adults and Parents on Children’s Use of Social Virtual Reality | Cristina Fiani et.al. | 2503.15100 | null |
2025-03-19 | Intelligent Spatial Perception by Building Hierarchical 3D Scene Graphs for Indoor Scenarios with the Help of LLMs | Yao Cheng et.al. | 2503.15091 | null |
2025-03-19 | An Investigation of Beam Density on LiDAR Object Detection Performance | Christoph Griesbacher et.al. | 2503.15087 | null |
2025-03-19 | xMOD: Cross-Modal Distillation for 2D/3D Multi-Object Discovery from 2D motion | Saad Lahlali et.al. | 2503.15022 | null |
2025-03-19 | Learning 4D Panoptic Scene Graph Generation from Rich 2D Visual Scene | Shengqiong Wu et.al. | 2503.15019 | null |
2025-03-19 | Universal Scene Graph Generation | Shengqiong Wu et.al. | 2503.15005 | null |
2025-03-26 | Fault-Tolerant Optical Quantum Computation using 3D Hybrid Cluster States | Peilin Du et.al. | 2503.14988 | null |
2025-03-19 | Depth-Aware Range Image-Based Model for Point Cloud Segmentation | Bike Chen et.al. | 2503.14955 | null |
2025-03-19 | 3D Engine-ready Photorealistic Avatars via Dynamic Textures | Yifan Wang et.al. | 2503.14943 | null |
2025-03-24 | Deep Polycuboid Fitting for Compact 3D Representation of Indoor Scenes | Gahye Lee et.al. | 2503.14912 | null |
2025-03-19 | Robust Distribution Alignment for Industrial Anomaly Detection under Distribution Shift | Jingyi Liao et.al. | 2503.14910 | null |
2025-03-19 | Temporal-Consistent Video Restoration with Pre-trained Diffusion Models | Hengkang Wang et.al. | 2503.14863 | null |
2025-03-19 | ClimateGS: Real-Time Climate Simulation with 3D Gaussian Style Transfer | Yuezhen Xie et.al. | 2503.14845 | null |
2025-03-19 | Decompositional Neural Scene Reconstruction with Generative Diffusion Prior | Junfeng Ni et.al. | 2503.14830 | null |
2025-03-19 | Global well-posedness and optimal time-decay of 3D full compressible Navier-Stokes system | Wenwen Huo et.al. | 2503.14808 | null |
2025-03-18 | SketchSplat: 3D Edge Reconstruction via Differentiable Multi-view Sketch Splatting | Haiyang Ying et.al. | 2503.14786 | null |
2025-03-18 | SceneEval: Evaluating Semantic Coherence in Text-Conditioned 3D Indoor Scene Synthesis | Hou In Ivan Tam et.al. | 2503.14756 | null |
2025-03-18 | HandSplat: Embedding-Driven Gaussian Splatting for High-Fidelity Hand Rendering | Yilan Dong et.al. | 2503.14736 | null |
2025-03-18 | SplatVoxel: History-Aware Novel View Streaming without Temporal Training | Yiming Wang et.al. | 2503.14698 | null |
2025-03-18 | Lagrangian chaos and unique ergodicity for stochastic primitive equations | Antonio Agresti et.al. | 2503.14658 | null |
2025-03-18 | Virtual reality and web browser visualization of high-intensity laser-matter interactions | Martin Matys et.al. | 2503.14632 | null |
2025-03-18 | Three-dimensional Reconstruction of the Lumbar Spine with Submillimeter Accuracy Using Biplanar X-ray Images | Wanxin Yu et.al. | 2503.14573 | null |
2025-03-21 | SuperPC: A Single Diffusion Model for Point Cloud Completion, Upsampling, Denoising, and Colorization | Yi Du et.al. | 2503.14558 | null |
2025-03-22 | Learning-based 3D Reconstruction in Autonomous Driving: A Comprehensive Survey | Liewen Liao et.al. | 2503.14537 | null |
2025-03-14 | Cafe-Talk: Generating 3D Talking Face Animation with Multimodal Coarse- and Fine-grained Control | Hejia Chen et.al. | 2503.14517 | null |
2025-03-19 | Advances in 4D Generation: A Survey | Qiaowei Miao et.al. | 2503.14501 | link |
2025-03-18 | Tracking Meets Large Multimodal Models for Driving Scenario Understanding | Ayesha Ishaq et.al. | 2503.14498 | link |
2025-03-19 | State Space Model Meets Transformer: A New Paradigm for 3D Object Detection | Chuxin Wang et.al. | 2503.14493 | null |
2025-03-18 | Stable Virtual Camera: Generative View Synthesis with Diffusion Models | Jensen et.al. | 2503.14489 | null |
2025-03-18 | Optimized 3D Gaussian Splatting using Coarse-to-Fine Image Frequency Modulation | Umar Farooq et.al. | 2503.14475 | null |
2025-03-18 | SIR-DIFF: Sparse Image Sets Restoration with Multi-View Diffusion Model | Yucheng Mao et.al. | 2503.14463 | null |
2025-03-18 | Origin of holes and rings in the Green Monster of Cassiopeia A: Insights from 3D magnetohydrodynamic simulations | S. Orlando et.al. | 2503.14455 | null |
2025-03-18 | Bolt3D: Generating 3D Scenes in Seconds | Stanislaw Szymanowicz et.al. | 2503.14445 | null |
2025-03-19 | Rods in flows: the PDE theory of immersed elastic filaments | Dallas Albritton et.al. | 2503.14440 | null |
2025-03-24 | DUNE: Distilling a Universal Encoder from Heterogeneous 2D and 3D Teachers | Mert Bulent Sariyildiz et.al. | 2503.14405 | null |
2025-03-18 | Diffusion-based Facial Aesthetics Enhancement with 3D Structure Guidance | Lisha Li et.al. | 2503.14402 | null |
2025-03-18 | Weakly Supervised Spatial Implicit Neural Representation Learning for 3D MRI-Ultrasound Deformable Image Registration in HDR Prostate Brachytherapy | Jing Wang et.al. | 2503.14395 | null |
2025-03-18 | 3D Densification for Multi-Map Monocular VSLAM in Endoscopy | X. Anadón et.al. | 2503.14346 | null |
2025-03-18 | RoMedFormer: A Rotary-Embedding Transformer Foundation Model for 3D Genito-Pelvic Structure Segmentation in MRI and CT | Yuheng Li et.al. | 2503.14304 | null |
2025-03-18 | Towards synthetic generation of realistic wooden logs | Fedor Zolotarev et.al. | 2503.14277 | link |
2025-03-18 | Improving Adaptive Density Control for 3D Gaussian Splatting | Glenn Grubert et.al. | 2503.14274 | link |
2025-03-18 | Integral modelling and Reinforcement Learning control of 3D liquid metal coating on a moving substrate | Fabio Pino et.al. | 2503.14270 | null |
2025-03-18 | A Chain-Driven, Sandwich-Legged Quadruped Robot: Design and Experimental Analysis | Aman Singh et.al. | 2503.14255 | null |
2025-03-18 | Segmentation-Guided Neural Radiance Fields for Novel Street View Synthesis | Yizhou Li et.al. | 2503.14219 | null |
2025-03-18 | RoGSplat: Learning Robust Generalizable Human Gaussian Splatting from Sparse Multi-View Images | Junjin Xiao et.al. | 2503.14198 | link |
2025-03-18 | Lightweight Gradient-Aware Upscaling of 3D Gaussian Splatting Images | Simon Niedermayr et.al. | 2503.14171 | null |
2025-03-18 | Concat-ID: Towards Universal Identity-Preserving Video Synthesis | Yong Zhong et.al. | 2503.14151 | null |
2025-03-18 | Reliable uncertainty quantification for 2D/3D anatomical landmark localization using multi-output conformal prediction | Jef Jonkers et.al. | 2503.14106 | link |
2025-03-18 | SCJD: Sparse Correlation and Joint Distillation for Efficient 3D Human Pose Estimation | Weihong Chen et.al. | 2503.14097 | null |
2025-03-18 | GenPara: Enhancing the 3D Design Editing Process by Inferring Users’ Regions of Interest with Text-Conditional Shape Parameters | Jiin Choi et.al. | 2503.14096 | null |
2025-03-18 | Demonstration of a mechanical external biventricular assist device for resuscitative thoracotomy | Kristóf Sárosi et.al. | 2503.14087 | null |
2025-03-18 | Foundation Feature-Driven Online End-Effector Pose Estimation: A Marker-Free and Learning-Free Approach | Tianshu Wu et.al. | 2503.14051 | null |
2025-03-21 | A Modular Edge Device Network for Surgery Digitalization | Vincent Schorp et.al. | 2503.14049 | null |
2025-03-18 | Rethinking End-to-End 2D to 3D Scene Segmentation in Gaussian Splatting | Runsong Zhu et.al. | 2503.14029 | link |
2025-03-18 | MeshFleet: Filtered and Annotated 3D Vehicle Dataset for Domain Specific Generative Modeling | Damian Boborzi et.al. | 2503.14002 | link |
2025-03-19 | Multimodal Feature-Driven Deep Learning for the Prediction of Duck Body Dimensions and Weight | Yi Xiao et.al. | 2503.14001 | null |
2025-03-18 | A-SCoRe: Attention-based Scene Coordinate Regression for wide-ranging scenarios | Huy-Hoang Bui et.al. | 2503.13982 | link |
2025-03-18 | BG-Triangle: Bézier Gaussian Triangle for 3D Vectorization and Rendering | Minye Wu et.al. | 2503.13961 | null |
2025-03-18 | Light4GS: Lightweight Compact 4D Gaussian Splatting Generation via Context Model | Mufan Liu et.al. | 2503.13948 | null |
2025-03-18 | PSA-SSL: Pose and Size-aware Self-Supervised Learning on LiDAR Point Clouds | Barza Nisar et.al. | 2503.13914 | null |
2025-03-18 | MoK-RAG: Mixture of Knowledge Paths Enhanced Retrieval-Augmented Generation for Embodied AI Environments | Zhengsheng Guo et.al. | 2503.13882 | null |
2025-03-18 | Robust3D-CIL: Robust Class-Incremental Learning for 3D Perception | Jinge Ma et.al. | 2503.13869 | null |
2025-03-18 | MamBEV: Enabling State Space Models to Learn Birds-Eye-View Representations | Hongyu Ke et.al. | 2503.13858 | link |
2025-03-18 | Anisotropic Turbulent Flows Observed in Above the Loop-top Regions During Solar Flares | Xiaoyan Xie et.al. | 2503.13827 | null |
2025-03-18 | Multi-Harmonic Gridded 3D Deconvolution (MH3D) for Robust and Accurate Image Reconstruction in MPI for Single Axis Drive Field Scanners | Toby Sanders et.al. | 2503.13802 | null |
2025-03-17 | Using 3D reconstruction from image motion to predict total leaf area in dwarf tomato plants | Dmitrii Usenko et.al. | 2503.13778 | null |
2025-03-17 | MonoCT: Overcoming Monocular 3D Detection Domain Shift with Consistent Teacher Models | Johannes Meier et.al. | 2503.13743 | null |
2025-03-17 | Planetesimal formation via the streaming instability in simulations of infall dominated young disks | L. -A. Hühn et.al. | 2503.13606 | null |
2025-03-17 | Next-Scale Autoregressive Models are Zero-Shot Single-Image Object View Synthesizers | Shiran Yuan et.al. | 2503.13588 | link |
2025-03-17 | ASMR: Adaptive Skeleton-Mesh Rigging and Skinning via 2D Generative Prior | Seokhyeon Hong et.al. | 2503.13579 | null |
2025-03-17 | Online Signature Verification based on the Lagrange formulation with 2D and 3D robotic models | Moises Diaz et.al. | 2503.13573 | null |
2025-03-17 | MSWAL: 3D Multi-class Segmentation of Whole Abdominal Lesions Dataset | Zhaodong Wu et.al. | 2503.13560 | link |
2025-03-16 | Feasibility study for reconstruction of knee MRI from one corresponding X-ray via CNN | Zhe Wang et.al. | 2503.13555 | null |
2025-03-16 | CNCast: Leveraging 3D Swin Transformer and DiT for Enhanced Regional Weather Forecasting | Hongli Liang et.al. | 2503.13546 | null |
2025-03-17 | Measuring and unbiasing the BAO shift in the Lyman-Alpha forest with AbacusSummit | Boryana Hadzhiyska et.al. | 2503.13442 | null |
2025-03-17 | Amodal3R: Amodal 3D Reconstruction from Occluded 2D Images | Tianhao Wu et.al. | 2503.13439 | null |
2025-03-17 | WideRange4D: Enabling High-Quality 4D Reconstruction with Wide-Range Movements and Scenes | Ling Yang et.al. | 2503.13435 | link |
2025-03-17 | Escaping Plato’s Cave: Robust Conceptual Reasoning through Interpretable 3D Neural Object Volumes | Nhi Pham et.al. | 2503.13429 | null |
2025-03-17 | Spacetime Structure of Regular Accelerating Black Hole Pair in General Relativity | M. M. Akbar et.al. | 2503.13420 | null |
2025-03-17 | Agents Play Thousands of 3D Video Games | Zhongwen Xu et.al. | 2503.13356 | null |
2025-03-17 | TriDF: Triplane-Accelerated Density Fields for Few-Shot Remote Sensing Novel View Synthesis | Jiaming Kang et.al. | 2503.13347 | null |
2025-03-17 | 3D morphology formation in a mixture of three differently averse components | Emilio N. M. Cirillo et.al. | 2503.13338 | null |
2025-03-17 | UniHOPE: A Unified Approach for Hand-Only and Hand-Object Pose Estimation | Yinqiao Wang et.al. | 2503.13303 | null |
2025-03-17 | Generative Gaussian Splatting: Generating 3D Scenes with Video Diffusion Priors | Katja Schwarz et.al. | 2503.13272 | null |
2025-03-19 | FlexWorld: Progressively Expanding 3D Scenes for Flexiable-View Synthesis | Luxi Chen et.al. | 2503.13265 | null |
2025-03-17 | Digital Beamforming Enhanced Radar Odometry | Jingqi Jiang et.al. | 2503.13252 | null |
2025-03-17 | UV $_6$Sn$_6$: a new kagome material with unusual $5f$ magnetism | S. M. Thomas et.al. | 2503.13245 | null |
2025-03-17 | MedLoRD: A Medical Low-Resource Diffusion Model for High-Resolution 3D CT Image Synthesis | Marvin Seyfarth et.al. | 2503.13211 | null |
2025-03-17 | New Liouville type theorems for 3D steady incompressible MHD equations and Hall-MHD equations | Zhibing Zhang et.al. | 2503.13202 | null |
2025-03-17 | 3D Hierarchical Panoptic Segmentation in Real Orchard Environments Across Different Sensors | Matteo Sodano et.al. | 2503.13188 | null |
2025-03-17 | 3DAxisPrompt: Promoting the 3D Grounding and Reasoning in GPT-4o | Dingning Liu et.al. | 2503.13185 | null |
2025-03-17 | DeGauss: Dynamic-Static Decomposition with Gaussian Splatting for Distractor-free 3D Reconstruction | Rui Wang et.al. | 2503.13176 | null |
2025-03-17 | Error analysis of the Strang splitting for the 3D semilinear wave equation with finite-energy data | Maximilian Ruff et.al. | 2503.13126 | null |
2025-03-17 | 3D Human Interaction Generation: A Survey | Siyuan Fan et.al. | 2503.13120 | null |
2025-03-17 | MM-Spatial: Exploring 3D Spatial Understanding in Multimodal LLMs | Erik Daxberger et.al. | 2503.13111 | null |
2025-03-17 | Gaussian On-the-Fly Splatting: A Progressive Framework for Robust Near Real-Time 3DGS Optimization | Yiwei Xu et.al. | 2503.13086 | null |
2025-03-20 | Beyond Role-Based Surgical Domain Modeling: Generalizable Re-Identification in the Operating Room | Tony Danjun Wang et.al. | 2503.13028 | link |
2025-03-17 | PoseSyn: Synthesizing Diverse 3D Pose Data from In-the-Wild 2D Data | ChangHee Yang et.al. | 2503.13025 | null |
2025-03-17 | TFDM: Time-Variant Frequency-Based Point Cloud Diffusion with Mamba | Jiaxu Liu et.al. | 2503.13004 | null |
2025-03-17 | SparseAlign: A Fully Sparse Framework for Cooperative Object Detection | Yunshuang Yuan et.al. | 2503.12982 | null |
2025-03-17 | Exploring 3D Activity Reasoning and Planning: From Implicit Human Intentions to Route-Aware Planning | Xueying Jiang et.al. | 2503.12974 | null |
2025-03-17 | OptiPMB: Enhancing 3D Multi-Object Tracking with Optimized Poisson Multi-Bernoulli Filtering | Guanhua Ding et.al. | 2503.12968 | null |
2025-03-17 | Unlock Pose Diversity: Accurate and Efficient Implicit Keypoint-based Spatiotemporal Diffusion for Audio-driven Talking Portrait | Chaolong Yang et.al. | 2503.12963 | link |
2025-03-17 | HIS-GPT: Towards 3D Human-In-Scene Multimodal Understanding | Jiahe Zhao et.al. | 2503.12955 | null |
2025-03-17 | Open3DBench: Open-Source Benchmark for 3D-IC Backend Implementation and PPA Evaluation | Yunqi Shi et.al. | 2503.12946 | link |
2025-03-17 | GIFT: Generated Indoor video frames for Texture-less point tracking | Jianzheng Huang et.al. | 2503.12944 | null |
2025-03-17 | AR-1-to-3: Single Image to Consistent 3D Object Generation via Next-View Prediction | Xuying Zhang et.al. | 2503.12929 | null |
2025-03-17 | Efficient Multimodal 3D Object Detector via Instance-Level Contrastive Distillation | Zhuoqun Su et.al. | 2503.12914 | null |
2025-03-17 | $3d$ flat bands and coupled $4f$ moments in the kagome-honeycomb permanent magnet Sm${2}$Co${17}$ | Hao Zheng et.al. | 2503.12890 | null |
2025-03-17 | RGBAvatar: Reduced Gaussian Blendshapes for Online Modeling of Head Avatars | Linzhou Li et.al. | 2503.12886 | null |
2025-03-17 | CAT-3DGS Pro: A New Benchmark for Efficient 3DGS Compression | Yu-Ting Zhan et.al. | 2503.12862 | null |
2025-03-17 | Adaptive Transformer Attention and Multi-Scale Fusion for Spine 3D Segmentation | Yanlin Xiang et.al. | 2503.12853 | null |
2025-03-17 | In vivo validation of Wireless Power Transfer System for Magnetically Controlled Robotic Capsule Endoscopy | Alessandro Catania et.al. | 2503.12850 | null |
2025-03-17 | Introducing GPGPUs to smartphone-based digital holographic microscope for 3D imaging | Yuki Nagahama et.al. | 2503.12848 | null |
2025-03-17 | CompMarkGS: Robust Watermarking for Compression 3D Gaussian Splatting | Sumin In et.al. | 2503.12836 | null |
2025-03-17 | PASTA: Part-Aware Sketch-to-3D Shape Generation with Text-Aligned Prior | Seunggwan Lee et.al. | 2503.12834 | null |
2025-03-17 | MT-PCR: Leveraging Modality Transformation for Large-Scale Point Cloud Registration with Limited Overlap | Yilong Wu et.al. | 2503.12833 | null |
2025-03-17 | AV-Surf: Surface-Enhanced Geometry-Aware Novel-View Acoustic Synthesis | Hadam Baek et.al. | 2503.12806 | null |
2025-03-18 | A fast Fourier spectral method for wave kinetic equation | Kunlun Qi et.al. | 2503.12805 | null |
2025-03-17 | TransDiff: Diffusion-Based Method for Manipulating Transparent Objects Using a Single RGB-D Image | Haoxiao Wang et.al. | 2503.12779 | null |
2025-03-17 | VasTSD: Learning 3D Vascular Tree-state Space Diffusion Model for Angiography Synthesis | Zhifeng Wang et.al. | 2503.12758 | null |
2025-03-17 | R3-Avatar: Record and Retrieve Temporal Codebook for Reconstructing Photorealistic Human Avatars | Yifan Zhan et.al. | 2503.12751 | null |
2025-03-17 | ProtoDepth: Unsupervised Continual Depth Completion with Prototypes | Patrick Rim et.al. | 2503.12745 | null |
2025-03-16 | AnyCalib: On-Manifold Learning for Model-Agnostic Single-View Camera Calibration | Javier Tirado-Garín et.al. | 2503.12701 | null |
2025-03-16 | KISS-SLAM: A Simple, Robust, and Accurate 3D LiDAR SLAM System With Enhanced Generalization Capabilities | Tiziano Guadagnino et.al. | 2503.12660 | null |
2025-03-16 | Rubikon: Intelligent Tutoring for Rubik’s Cube Learning Through AR-enabled Physical Task Reconfiguration | Muzhe Wu et.al. | 2503.12619 | null |
2025-03-16 | Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey | Yaoting Wang et.al. | 2503.12605 | link |
2025-03-16 | Point Cloud Based Scene Segmentation: A Survey | Dan Halperin et.al. | 2503.12595 | null |
2025-03-16 | Fourier-Based 3D Multistage Transformer for Aberration Correction in Multicellular Specimens | Thayer Alshaabi et.al. | 2503.12593 | link |
2025-03-16 | MUKCa: Accurate and Affordable Cobot Calibration Without External Measurement Devices | Giovanni Franzese et.al. | 2503.12584 | null |
2025-03-16 | Niagara: Normal-Integrated Geometric Affine Field for Scene Reconstruction from a Single View | Xianzu Wu et.al. | 2503.12553 | link |
2025-03-16 | BFANet: Revisiting 3D Semantic Segmentation with Boundary Feature Analysis | Weiguang Zhao et.al. | 2503.12539 | link |
2025-03-16 | SPC-GS: Gaussian Splatting with Semantic-Prompt Consistency for Indoor Open-World Free-view Synthesis from Sparse Inputs | Guibiao Liao et.al. | 2503.12535 | null |
2025-03-16 | Learning Contour-Guided 3D Face Reconstruction with Occlusions | Dapeng Zhao et.al. | 2503.12494 | null |
2025-03-16 | Geometry-Aware Face Reconstruction Under Occluded Scenes | Dapeng Zhao et.al. | 2503.12492 | null |
2025-03-16 | VRsketch2Gaussian: 3D VR Sketch Guided 3D Object Generation with Gaussian Splatting | Songen Gu et.al. | 2503.12383 | null |
2025-03-16 | RENO: Real-Time Neural Compression for 3D LiDAR Point Clouds | Kang You et.al. | 2503.12382 | link |
2025-03-16 | L2COcc: Lightweight Camera-Centric Semantic Scene Completion via Distillation of LiDAR Model | Ruoyu Wang et.al. | 2503.12369 | null |
2025-03-16 | TopoGaussian: Inferring Internal Topology Structures from Visual Clues | Xiaoyu Xiong et.al. | 2503.12343 | null |
2025-03-18 | GS-I $^{3}$ : Gaussian Splatting for Surface Reconstruction from Illumination-Inconsistent Images | Tengfei Wang et.al. | 2503.12335 | link |
2025-03-16 | Swift4D:Adaptive divide-and-conquer Gaussian Splatting for compact and efficient reconstruction of dynamic scene | Jiahao Wu et.al. | 2503.12307 | null |
2025-03-15 | REdiSplats: Ray Tracing for Editable Gaussian Splatting | Krzysztof Byrski et.al. | 2503.12284 | null |
2025-03-15 | How Do Microstrip Losses Impact Near-Field Beam Depth in Dynamic Metasurface Antennas? | Panagiotis Gavriilidis et.al. | 2503.12280 | null |
2025-03-15 | Decoupled Hands: An Approach for Aligning Perspectives in Collaborative Mixed Reality | Matt Gottsacker et.al. | 2503.12253 | null |
2025-03-15 | Meta-operators for all-optical image processing | Linzhi Yu et.al. | 2503.12252 | null |
2025-03-15 | Multi-slice beam propagation method for imaging multiple-scattering samples on reflective substrates | Jiabei Zhu et.al. | 2503.12246 | null |
2025-03-15 | Shadow Art Kanji: Inverse Rendering Application | William Louis Rothman et.al. | 2503.12229 | link |
2025-03-15 | D4orm: Multi-Robot Trajectories with Dynamics-aware Diffusion Denoised Deformations | Yuhao Zhang et.al. | 2503.12204 | null |
2025-03-15 | Singly occupied 4 $f$ antiferromagnetic insulators: CePO$_4$ and CeVO$_4$ | Hari Paudyal et.al. | 2503.12186 | null |
2025-03-15 | VTON 360: High-Fidelity Virtual Try-On from Any Viewing Direction | Zijian He et.al. | 2503.12165 | null |
2025-03-15 | Point-Cache: Test-time Dynamic and Hierarchical Cache for Robust and Generalizable Point Cloud Analysis | Hongyu Sun et.al. | 2503.12150 | link |
2025-03-15 | Degenerate Fluid Polyamorphism Induced by Symmetrical Molecular Interconversion | Mikhail A. Anisimov et.al. | 2503.12138 | null |
2025-03-15 | NLTE spectral modelling for a carbon-oxygen and helium white-dwarf merger as a Ca-rich transient candidate | F. P. Callan et.al. | 2503.12105 | null |
2025-03-15 | Towards Vision Zero: The Accid3nD Dataset | Walter Zimmer et.al. | 2503.12095 | null |
2025-03-15 | SFMNet: Sparse Focal Modulation for 3D Object Detection | Oren Shrout et.al. | 2503.12093 | null |
2025-03-15 | FA-BARF: Frequency Adapted Bundle-Adjusting Neural Radiance Fields | Rui Qian et.al. | 2503.12086 | null |
2025-03-15 | Metallicity dependence of the CO-to-H $_2$ and the [CI]-to-H$_2$ conversion factors in galaxies | Thomas G. Bisbas et.al. | 2503.12073 | null |
2025-03-15 | A Comprehensive Survey on Knowledge Distillation | Amir M. Mansourian et.al. | 2503.12067 | link |
2025-03-18 | Tailor: An Integrated Text-Driven CG-Ready Human and Garment Generation System | Zhiyao Sun et.al. | 2503.12052 | null |
2025-03-15 | SteerX: Creating Any Camera-Free 3D and 4D Scenes with Geometric Steering | Byeongjun Park et.al. | 2503.12024 | link |
2025-03-18 | UniMamba: Unified Spatial-Channel Representation Learning with Group-Efficient Mamba for LiDAR-based 3D Object Detection | Xin Jin et.al. | 2503.12009 | null |
2025-03-18 | 3D Gaussian Splatting against Moving Objects for High-Fidelity Street Scene Reconstruction | Peizhen Zheng et.al. | 2503.12001 | link |
2025-03-15 | An Acoustic Inversion-Based Flow Measurement Model in 3D Hydrodynamic Systems | Jiwei Li et.al. | 2503.11986 | null |
2025-03-15 | DecompDreamer: Advancing Structured 3D Asset Generation with Multi-Object Decomposition and Gaussian Splatting | Utkarsh Nath et.al. | 2503.11981 | null |
2025-03-15 | Ultrafast space-time optical merons in momentum-energy space | Murat Yessenov et.al. | 2503.11980 | null |
2025-03-15 | DynaGSLAM: Real-Time Gaussian-Splatting SLAM for Online Rendering, Tracking, Motion Predictions of Moving Objects in Dynamic Scenes | Runfa Blark Li et.al. | 2503.11979 | null |
2025-03-15 | Snapmoji: Instant Generation of Animatable Dual-Stylized Avatars | Eric M. Chen et.al. | 2503.11978 | null |
2025-03-15 | CHOrD: Generation of Collision-Free, House-Scale, and Organized Digital Twins for 3D Indoor Scenes with Controllable Floor Plans and Optimal Layouts | Chong Su et.al. | 2503.11958 | null |
2025-03-15 | Integrating Product Coefficients for Improved 3D LiDAR Data Classification | Patricia Medina et.al. | 2503.11943 | null |
2025-03-14 | Periodic phase slips and frequency comb generation at tunable microwave frequencies in superconducting diabolo structures | Axel J. M. Deenen et.al. | 2503.11925 | null |
2025-03-14 | Sketch-to-Skill: Bootstrapping Robot Learning with Human Drawn Trajectory Sketches | Peihong Yu et.al. | 2503.11918 | null |
2025-03-14 | Quantum critical point followed by Kondo-like behavior due to Cu substitution in itinerant, antiferromagnet ${\text{La}{2}\text{(Cu}{x}\text {Ni}_{1-x})_7}$ | Atreyee Das et.al. | 2503.11872 | null |
2025-03-14 | Learning-based Estimation of Forward Kinematics for an Orthotic Parallel Robotic Mechanism | Jingzong Zhou et.al. | 2503.11855 | null |
2025-03-14 | Human-in-the-Loop Local Corrections of 3D Scene Layouts via Infilling | Christopher Xie et.al. | 2503.11806 | null |
2025-03-14 | StyleMorpheus: A Style-Based 3D-Aware Morphable Face Model | Peizhi Yan et.al. | 2503.11792 | null |
2025-03-14 | Safe Multi-Robotic Arm Interaction via 3D Convex Shapes | Ali Umut Kaypak et.al. | 2503.11791 | null |
2025-03-14 | ECLARE: Efficient cross-planar learning for anisotropic resolution enhancement | Samuel W. Remedios et.al. | 2503.11787 | link |
2025-03-12 | Physical knowledge improves prediction of EM Fields | Andrzej Dulny et.al. | 2503.11703 | null |
2025-03-14 | Bring Your Rear Cameras for Egocentric 3D Human Pose Estimation | Hiroyasu Akada et.al. | 2503.11652 | null |
2025-03-14 | VGGT: Visual Geometry Grounded Transformer | Jianyuan Wang et.al. | 2503.11651 | link |
2025-03-14 | The waves-in-space Purcell effect for superconducting qubits | Param Patel et.al. | 2503.11644 | null |
2025-03-14 | Advancing 3D Gaussian Splatting Editing with Complementary and Consensus Information | Xuanqi Zhang et.al. | 2503.11601 | null |
2025-03-13 | Reparametrization of 3D CSC Dubins Paths Enabling 2D Search | Ling Xu et.al. | 2503.11560 | null |
2025-03-14 | FLASHμ: Fast Localizing And Sizing of Holographic Microparticles | Ayush Paliwal et.al. | 2503.11538 | null |
2025-03-14 | HiTVideo: Hierarchical Tokenizers for Enhancing Text-to-Video Generation with Autoregressive Large Language Models | Ziqin Zhou et.al. | 2503.11513 | null |
2025-03-14 | MRS-CWC: A Weakly Constrained Multi-Robot System with Controllable Constraint Stiffness for Mobility and Navigation in Unknown 3D Rough Environments | Runze Xiao et.al. | 2503.11461 | null |
2025-03-14 | A Neural Network Architecture Based on Attention Gate Mechanism for 3D Magnetotelluric Forward Modeling | Xin Zhong et.al. | 2503.11408 | null |
2025-03-14 | Data-constrained 3D MHD Simulation of a Spiral Jet Caused by an Unstable Flux Rope Embedded in Fan-spine Configuration | Z. F. Li et.al. | 2503.11380 | null |
2025-03-17 | EMoTive: Event-guided Trajectory Modeling for 3D Motion Estimation | Zengyu Wan et.al. | 2503.11371 | null |
2025-03-14 | PBR3DGen: A VLM-guided Mesh Generation with High-quality PBR Texture | Xiaokang Wei et.al. | 2503.11368 | null |
2025-03-14 | Confinement controls bacterial spreading at all scales | Renaud Baillou et.al. | 2503.11364 | null |
2025-03-14 | Predictive study of non-axisymmetric neutral beam ion loss on the upgraded KSTAR plasma-facing components | Taeuk Moon et.al. | 2503.11353 | null |
2025-03-14 | EgoSplat: Open-Vocabulary Egocentric Scene Understanding with Language Embedded 3D Gaussian Splatting | Di Li et.al. | 2503.11345 | null |
2025-03-14 | 1D fluids with repulsive nearest-neighbour interactions: Low-temperature anomalies | Igor Travěnec et.al. | 2503.11310 | null |
2025-03-14 | L2RSI: Cross-view LiDAR-based Place Recognition for Large-scale Urban Scenes via Remote Sensing Imagery | Ziwei Shi et.al. | 2503.11245 | null |
2025-03-14 | Growth Laws and Universality in 2-TIPS: Microscopic and Coarse grained approach | Nayana Venkatareddy et.al. | 2503.11243 | null |
2025-03-14 | NF-SLAM: Effective, Normalizing Flow-supported Neural Field representations for object-level visual SLAM in automotive applications | Li Cui et.al. | 2503.11199 | null |
2025-03-14 | Online Test-time Adaptation for 3D Human Pose Estimation: A Practical Perspective with Estimated 2D Poses | Qiuxia Lin et.al. | 2503.11194 | null |
2025-03-14 | Uncertainty-Aware Normal-Guided Gaussian Splatting for Surface Reconstruction from Sparse Image Sequences | Zhen Tan et.al. | 2503.11172 | null |
2025-03-14 | GaussianIP: Identity-Preserving Realistic 3D Human Generation via Human-Centric Diffusion Prior | Zichen Tang et.al. | 2503.11143 | link |
2025-03-14 | DriveGEN: Generalized and Robust 3D Detection in Driving via Controllable Text-to-Image Diffusion Generation | Hongbin Lin et.al. | 2503.11122 | link |
2025-03-19 | Beyond the Destination: A Novel Benchmark for Exploration-Aware Embodied Question Answering | Kaixuan Jiang et.al. | 2503.11117 | null |
2025-03-14 | Advancing Electronics Manufacturing Using Dynamically Programmable Micro-Transfer Printing System | Qinhua Guo et.al. | 2503.11109 | null |
2025-03-14 | Open3DVQA: A Benchmark for Comprehensive Spatial Reasoning with Multimodal Large Language Model in Open Space | Weichen Zhan et.al. | 2503.11094 | link |
2025-03-14 | OmniDiff: A Comprehensive Benchmark for Fine-grained Image Difference Captioning | Yuan Liu et.al. | 2503.11093 | null |
2025-03-14 | Aerial Vision-and-Language Navigation with Grid-based View Selection and Map Construction | Ganlong Zhao et.al. | 2503.11091 | null |
2025-03-14 | Dust Clumping in Outer Protoplanetary Disks: the Interplay Among Four Instabilities | Pinghui Huang et.al. | 2503.11076 | null |
2025-03-14 | Low-cost Real-world Implementation of the Swing-up Pendulum for Deep Reinforcement Learning Experiments | Peter Böhm et.al. | 2503.11065 | null |
2025-03-14 | Magnetoconductivity due to electron-electron interaction in a non-Galilean–invariant Fermi liquid | Tatia Kiliptari et.al. | 2503.11063 | null |
2025-03-20 | Fourier Neural Operator based surrogates for $CO_2$ storage in realistic geologies | Anirban Chandra et.al. | 2503.11031 | null |
2025-03-14 | EmoDiffusion: Enhancing Emotional 3D Facial Animation with Latent Diffusion Models | Yixuan Zhang et.al. | 2503.11028 | null |
2025-03-14 | Enhanced Multi-View Pedestrian Detection Using Probabilistic Occupancy Volume | Reef Alturki et.al. | 2503.10982 | null |
2025-03-13 | Usable Privacy in Virtual Worlds: Design Implications for Data Collection Awareness and Control Interfaces in Virtual Reality | Viktorija Paneva et.al. | 2503.10915 | null |
2025-03-13 | Fabrication of Metal Air Bridges for Superconducting Circuits using Two-photon Lithography | Yi-Hsiang Huang et.al. | 2503.10909 | null |
2025-03-13 | Memory-Efficient 3D High-Resolution Medical Image Synthesis Using CRF-Guided GANs | Mahshid Shiri et.al. | 2503.10899 | null |
2025-03-13 | RI3D: Few-Shot Gaussian Splatting With Repair and Inpainting Diffusion Priors | Avinash Paliwal et.al. | 2503.10860 | link |
2025-03-13 | HeightFormer: Learning Height Prediction in Voxel Features for Roadside Vision Centric 3D Object Detection via Transformer | Zhang Zhang et.al. | 2503.10777 | null |
2025-03-20 | Unifying 2D and 3D Vision-Language Understanding | Ayush Jain et.al. | 2503.10745 | null |
2025-03-13 | 3D Extended Object Tracking based on Extruded B-Spline Side View Profiles | Longfei Han et.al. | 2503.10730 | null |
2025-03-13 | Deep Learning-Based Automated Workflow for Accurate Segmentation and Measurement of Abdominal Organs in CT Scans | Praveen Shastry et.al. | 2503.10717 | null |
2025-03-13 | HiCMamba: Enhancing Hi-C Resolution and Identifying 3D Genome Structures with State Space Modeling | Minghao Yang et.al. | 2503.10713 | null |
2025-03-12 | 3D Multiphase Heterogeneous Microstructure Generation Using Conditional Latent Diffusion Models | Nirmal Baishnab et.al. | 2503.10711 | null |
2025-03-08 | Text-to-3D Generation using Jensen-Shannon Score Distillation | Khoi Do et.al. | 2503.10660 | null |
2025-03-14 | V2Edit: Versatile Video Diffusion Editor for Videos and 3D Scenes | Yanming Zhang et.al. | 2503.10634 | null |
2025-03-13 | NIL: No-data Imitation Learning by Leveraging Pre-trained Video Diffusion Models | Mert Albaba et.al. | 2503.10626 | null |
2025-03-13 | LHM: Large Animatable Human Reconstruction Model from a Single Image in Seconds | Lingteng Qiu et.al. | 2503.10625 | link |
2025-03-13 | ETCH: Generalizing Body Fitting to Clothed Humans via Equivariant Tightness | Boqian Li et.al. | 2503.10624 | link |
2025-03-13 | OCCUQ: Exploring Efficient Uncertainty Quantification for 3D Occupancy Prediction | Severin Heidrich et.al. | 2503.10605 | link |
2025-03-13 | MuDG: Taming Multi-modal Diffusion with Gaussian Splatting for Urban Scene Reconstruction | Yingshuang Zou et.al. | 2503.10604 | null |
2025-03-13 | Long Context Tuning for Video Generation | Yuwei Guo et.al. | 2503.10589 | null |
2025-03-15 | Semantic-Supervised Spatial-Temporal Fusion for LiDAR-based 3D Object Detection | Chaoqun Wang et.al. | 2503.10579 | null |
2025-03-13 | Lightweight Models for Emotional Analysis in Video | Quoc-Tien Nguyen et.al. | 2503.10530 | link |
2025-03-13 | PiSA: A Self-Augmented Data Engine and Training Strategy for 3D Understanding with Large Models | Zilu Guo et.al. | 2503.10529 | null |
2025-03-18 | Beyond Atoms: Enhancing Molecular Pretrained Representations with 3D Space Modeling | Shuqi Lu et.al. | 2503.10489 | null |
2025-03-13 | 4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language Models | Wanhua Li et.al. | 2503.10437 | link |
2025-03-13 | Protostellar Outflows at the EarliesT Stages (POETS). VII. Circumstellar gas kinematics traced by water masers inside the HC HII region NGC7538 IRS1 | Luca Moscadelli et.al. | 2503.10415 | null |
2025-03-13 | RoCo-Sim: Enhancing Roadside Collaborative Perception through Foreground Simulation | Yuwen Du et.al. | 2503.10410 | link |
2025-03-13 | Hyper3D: Efficient 3D Representation via Hybrid Triplane and Octree Feature for Enhanced 3D Shape Variational Auto-Encoders | Jingyu Guo et.al. | 2503.10403 | null |
2025-03-14 | Reexamining Circular Dichroism in Photoemission From a Topological Insulator | Ittai Sidilkover et.al. | 2503.10388 | null |
2025-03-13 | 3D non-LTE Ca II line formation in metal-poor FGK stars. I. Abundance corrections, radial velocity corrections, and synthetic spectra | Cis Lagae et.al. | 2503.10378 | null |
2025-03-13 | DreamInsert: Zero-Shot Image-to-Video Object Insertion from A Single Image | Qi Zhao et.al. | 2503.10342 | null |
2025-03-13 | OSMa-Bench: Evaluating Open Semantic Mapping Under Varying Lighting Conditions | Maxim Popov et.al. | 2503.10331 | null |
2025-03-13 | A Comparison of Calcium Sources for Ion-Trap Loading via Laser Ablation | Daisy R H Smith et.al. | 2503.10329 | null |
2025-03-13 | Pushing the Boundary of Quantum Advantage in Hard Combinatorial Optimization with Probabilistic Computers | Shuvro Chowdhury et.al. | 2503.10302 | null |
2025-03-13 | Analysis of linear Boussinesq-type models coupled with static interfaces | José Galaz et.al. | 2503.10300 | null |
2025-03-13 | MaterialMVP: Illumination-Invariant Material Generation via Multi-view PBR Diffusion | Zebin He et.al. | 2503.10289 | null |
2025-03-13 | VicaSplat: A Single Run is All You Need for 3D Gaussian Splatting and Camera Estimation from Unposed Video Frames | Zhiqi Li et.al. | 2503.10286 | null |
2025-03-13 | ROODI: Reconstructing Occluded Objects with Denoising Inpainters | Yeonjin Chang et.al. | 2503.10256 | null |
2025-03-19 | PRISM: Preference Refinement via Implicit Scene Modeling for 3D Vision-Language Preference-Based Reinforcement Learning | Yirong Sun et.al. | 2503.10177 | null |
2025-03-15 | 3D Student Splatting and Scooping | Jialin Zhu et.al. | 2503.10148 | link |
2025-03-13 | GaussHDR: High Dynamic Range Gaussian Splatting via Learning Unified 3D and 2D Local Tone Mapping | Jinfeng Liu et.al. | 2503.10143 | null |
2025-03-13 | Mapless Collision-Free Flight via MPC using Dual KD-Trees in Cluttered Environments | Linzuo Zhang et.al. | 2503.10141 | null |
2025-03-13 | Deep Learning-Based Direct Leaf Area Estimation using Two RGBD Datasets for Model Development | Namal Jayasuriya et.al. | 2503.10129 | null |
2025-03-13 | Mobile Food Printing in Professional Kitchens: An inquiry of potential use cases with novice chefs | Yağmur Kocaman et.al. | 2503.10116 | null |
2025-03-13 | IMPACT: Intelligent Motion Planning with Acceptable Contact Trajectories via Vision-Language Models | Yiyang Ling et.al. | 2503.10110 | null |
2025-03-13 | G $^{2}$ SF-MIAD: Geometry-Guided Score Fusion for Multimodal Industrial Anomaly Detection | Chengyu Tao et.al. | 2503.10091 | null |
2025-03-13 | AhaRobot: A Low-Cost Open-Source Bimanual Mobile Manipulator for Embodied AI | Haiqin Cui et.al. | 2503.10070 | null |
2025-03-13 | SmartWay: Enhanced Waypoint Prediction and Backtracking for Zero-Shot Vision-and-Language Navigation | Xiangyu Shi et.al. | 2503.10069 | null |
2025-03-13 | Fourier Decomposition for Explicit Representation of 3D Point Cloud Attributes | Donghyun Kim et.al. | 2503.10055 | null |
2025-03-13 | AI-assisted 3D Preservation and Reconstruction of Temple Arts | Naai-Jung Shih et.al. | 2503.10031 | null |
2025-03-13 | Speedy MASt3R | Jingxing Li et.al. | 2503.10017 | null |
2025-03-13 | MetricGrids: Arbitrary Nonlinear Approximation with Elementary Metric Grids based Implicit Neural Representation | Shu Wang et.al. | 2503.10000 | link |
2025-03-13 | Reference-Free 3D Reconstruction of Brain Dissection Photographs with Machine Learning | Lin Tian et.al. | 2503.09963 | link |
2025-03-13 | TGP: Two-modal occupancy prediction with 3D Gaussian and sparse points for 3D Environment Awareness | Mu Chen et.al. | 2503.09941 | null |
2025-03-12 | QuickDraw: Fast Visualization, Analysis and Active Learning for Medical Image Segmentation | Daniel Syomichev et.al. | 2503.09885 | link |
2025-03-12 | CleverDistiller: Simple and Spatially Consistent Cross-modal Distillation | Hariprasath Govindarajan et.al. | 2503.09878 | null |
2025-03-12 | Demonstration of a new CLLBC-based gamma- and neutron-sensitive free-moving omnidirectional imaging detector | Jayson R. Vavrek et.al. | 2503.09862 | null |
2025-03-12 | StyleSpeaker: Audio-Enhanced Fine-Grained Style Modeling for Speech-Driven 3D Facial Animation | An Yang et.al. | 2503.09852 | null |
2025-03-18 | SE(3)-Equivariant Robot Learning and Control: A Tutorial Survey | Joohwan Seo et.al. | 2503.09829 | null |
2025-03-12 | How good are deep learning methods for automated road safety analysis using video data? An experimental study | Qingwu Liu et.al. | 2503.09807 | null |
2025-03-12 | Evaluating the Impact of Synthetic Data on Object Detection Tasks in Autonomous Driving | Enes Özeren et.al. | 2503.09803 | null |
2025-03-12 | I2V3D: Controllable image-to-video generation with 3D guidance | Zhiyuan Zhang et.al. | 2503.09733 | null |
2025-03-12 | Solving Superconformal Ward Identities in Mellin Space | Clément Virally et.al. | 2503.09703 | null |
2025-03-12 | Physics-Aware Human-Object Rendering from Sparse Views via 3D Gaussian Splatting | Weiquan Wang et.al. | 2503.09640 | null |
2025-03-11 | FPGS: Feed-Forward Semantic-aware Photorealistic Style Transfer of Large-Scale Gaussian Splatting | GeonU Kim et.al. | 2503.09635 | null |
2025-03-11 | V2M4: 4D Mesh Animation Reconstruction from a Single Monocular Video | Jianqi Chen et.al. | 2503.09631 | null |
2025-03-13 | RewardSDS: Aligning Score Distillation via Reward-Weighted Sampling | Itay Chachy et.al. | 2503.09601 | link |
2025-03-12 | FCaS: Fine-grained Cardiac Image Synthesis based on 3D Template Conditional Diffusion Model | Jiahao Xia et.al. | 2503.09560 | null |
2025-03-12 | Electromyography-Informed Facial Expression Reconstruction for Physiological-Based Synthesis and Analysis | Tim Büchner et.al. | 2503.09556 | null |
2025-03-12 | Using Convolutional Neural Networks to Accelerate 3D Coherent Synchrotron Radiation Computations | Christopher Leon et.al. | 2503.09551 | null |
2025-03-12 | GenHPE: Generative Counterfactuals for 3D Human Pose Estimation with Radio Frequency Signals | Shuokang Huang et.al. | 2503.09537 | null |
2025-03-12 | CombatVLA: An Efficient Vision-Language-Action Model for Combat Tasks in 3D Action Role-Playing Games | Peng Chen et.al. | 2503.09527 | null |
2025-03-12 | Hybrid Rendering for Multimodal Autonomous Driving: Merging Neural and Physics-Based Simulation | Máté Tóth et.al. | 2503.09464 | null |
2025-03-12 | Effective conductivity of conduit networks with random conductivities | I. Colecchio et.al. | 2503.09457 | null |
2025-03-12 | Online Language Splatting | Saimouli Katragadda et.al. | 2503.09447 | null |
2025-03-12 | SuperCarver: Texture-Consistent 3D Geometry Super-Resolution for High-Fidelity Surface Detail Generation | Qijian Zhang et.al. | 2503.09439 | null |
2025-03-12 | Efficient Alignment of Unconditioned Action Prior for Language-conditioned Pick and Place in Clutter | Kechun Xu et.al. | 2503.09423 | null |
2025-03-12 | Close-up-GS: Enhancing Close-Up View Synthesis in 3D Gaussian Splatting with Progressive Self-Training | Jiatong Xia et.al. | 2503.09396 | null |
2025-03-12 | GASPACHO: Gaussian Splatting for Controllable Humans and Objects | Aymen Mir et.al. | 2503.09342 | null |
2025-03-12 | Stealthy Patch-Wise Backdoor Attack in 3D Point Cloud via Curvature Awareness | Yu Feng et.al. | 2503.09336 | null |
2025-03-12 | Fourier shape parametrization in covariant density functional theory for nuclear fission | Zeyu Li et.al. | 2503.09308 | null |
2025-03-12 | Better Together: Unified Motion Capture and 3D Avatar Reconstruction | Arthur Moreau et.al. | 2503.09293 | null |
2025-03-12 | Magnetization control problem for the 2D and 3D evolutionary Landau-Lifshitz-Bloch equation | Sidhartha Patnaik et.al. | 2503.09266 | null |
2025-03-12 | Contrasting $c$-axis and in-plane uniaxial stress effects on superconductivity and stripe order in La${1.885}$Ba${0.115}$CuO$_4$ | S. S. Islam et.al. | 2503.09236 | null |
2025-03-12 | A 3d particle visualization system for temperature management | Benoit Lange et.al. | 2503.09198 | null |
2025-03-12 | Long-Term Planning Around Humans in Domestic Environments with 3D Scene Graphs | Ermanno Bartoli et.al. | 2503.09173 | null |
2025-03-15 | WonderVerse: Extendable 3D Scene Generation with Video Generative Models | Hao Feng et.al. | 2503.09160 | null |
2025-03-12 | On the impact of observation error correlations in data assimilation, with application to along-track altimeter data | Olivier Goux et.al. | 2503.09140 | null |
2025-03-12 | Extreme resilience and dissipation in heterogeneous disordered materials | Jehoon Moon et.al. | 2503.09056 | null |
2025-03-12 | The SAMI Galaxy Survey: large-scale environment affects galaxy spin amplitudes and the formation of slow rotators | Stefania Barsanti et.al. | 2503.09052 | null |
2025-03-12 | Motion Blender Gaussian Splatting for Dynamic Reconstruction | Xinyu Zhang et.al. | 2503.09040 | null |
2025-03-12 | Computational Design and Fabrication of Protective Foam | Tsukasa Fukusato et.al. | 2503.09019 | null |
2025-03-13 | HumanoidPano: Hybrid Spherical Panoramic-LiDAR Cross-Modal Perception for Humanoid Robots | Qiang Zhang et.al. | 2503.09010 | null |
2025-03-17 | Dual-Domain Homogeneous Fusion with Cross-Modal Mamba and Progressive Decoder for 3D Object Detection | Xuzhong Hu et.al. | 2503.08992 | null |
2025-03-12 | Large-scale multifractality and lack of self-similar decay for Burgers and 3D Navier-Stokes turbulence | Takeshi Matsumoto et.al. | 2503.08983 | null |
2025-03-11 | FP3: A 3D Foundation Policy for Robotic Manipulation | Rujia Yang et.al. | 2503.08950 | null |
2025-03-11 | Acoustic Neural 3D Reconstruction Under Pose Drift | Tianxiang Lin et.al. | 2503.08930 | null |
2025-03-11 | HessianForge: Scalable LiDAR reconstruction with Physics-Informed Neural Representation and Smoothness Energy Constraints | Hrishikesh Viswanath et.al. | 2503.08929 | link |
2025-03-11 | A Deep Bayesian Nonparametric Framework for Robust Mutual Information Estimation | Forough Fazeliasl et.al. | 2503.08902 | null |
2025-03-11 | Improved Approximation Algorithms for Three-Dimensional Bin Packing | Debajyoti Kar et.al. | 2503.08863 | null |
2025-03-11 | Deformable Registration Framework for Augmented Reality-based Surgical Guidance in Head and Neck Tumor Resection | Qingyun Yang et.al. | 2503.08802 | null |
2025-03-11 | 3d Mirrors and Phase Diagrams of Abelian Gauge Theories | Julius F. Grimminger et.al. | 2503.08791 | link |
2025-03-11 | Detection of magnetic fields in superclusters of galaxies | G. V. Pignataro et.al. | 2503.08765 | null |
2025-03-11 | Novel design of biplanar electrodes in a multiwell plate for transepithelial electrical resistance measurement in 3D cell cultures | Georges Dubourg et.al. | 2503.08744 | null |
2025-03-11 | Cooperative Bearing-Only Target Pursuit via Multiagent Reinforcement Learning: Design and Experiment | Jianan Li et.al. | 2503.08740 | null |
2025-03-11 | Representing 3D Shapes With 64 Latent Vectors for 3D Diffusion Models | In Cho et.al. | 2503.08737 | null |
2025-03-10 | Direct Flow Simulations with Implicit Neural Representation of Complex Geometry | Samundra Karki et.al. | 2503.08724 | null |
2025-03-10 | Versatile Multimodal Controls for Whole-Body Talking Human Animation | Zheng Qin et.al. | 2503.08714 | null |
2025-03-11 | GarmentCrafter: Progressive Novel View Synthesis for Single-View 3D Garment Reconstruction and Editing | Yuanhao Wang et.al. | 2503.08678 | null |
2025-03-11 | Language-Depth Navigated Thermal and Visible Image Fusion | Jinchang Zhang et.al. | 2503.08676 | null |
2025-03-11 | Keypoint Detection and Description for Raw Bayer Images | Jiakai Lin et.al. | 2503.08673 | null |
2025-03-11 | MEAT: Multiview Diffusion Model for Human Generation on Megapixels with Mesh Attention | Yuhan Wang et.al. | 2503.08664 | link |
2025-03-11 | GBlobs: Explicit Local Structure via Gaussian Blobs for Improved Cross-Domain LiDAR-based 3D Object Detection | Dušan Malić et.al. | 2503.08639 | null |
2025-03-11 | Birth of magnetized low-mass protostars and circumstellar disks | Adnan Ali Ahmad et.al. | 2503.08637 | null |
2025-03-11 | A Grid Cell-Inspired Structured Vector Algebra for Cognitive Maps | Sven Krausse et.al. | 2503.08608 | null |
2025-03-11 | LiSu: A Dataset and Method for LiDAR Surface Normal Estimation | Dušan Malić et.al. | 2503.08601 | null |
2025-03-11 | X-Field: A Physically Grounded Representation for 3D X-ray Reconstruction | Feiran Wang et.al. | 2503.08596 | link |
2025-03-11 | 3D Point Cloud Generation via Autoregressive Up-sampling | Ziqiao Meng et.al. | 2503.08594 | null |
2025-03-18 | High-Quality 3D Head Reconstruction from Any Single Portrait Image | Jianfu Zhang et.al. | 2503.08516 | null |
2025-03-11 | SAS: Segment Any 3D Scene with Integrated 2D Priors | Zhuoyuan Li et.al. | 2503.08512 | null |
2025-03-11 | PCGS: Progressive Compression of 3D Gaussian Splatting | Yihang Chen et.al. | 2503.08511 | link |
2025-03-11 | TT-GaussOcc: Test-Time Compute for Self-Supervised Occupancy Prediction via Spatio-Temporal Gaussian Splatting | Fengyi Zhang et.al. | 2503.08485 | null |
2025-03-11 | GAS-NeRF: Geometry-Aware Stylization of Dynamic Radiance Fields | Nhat Phuong Anh Vu et.al. | 2503.08483 | null |
2025-03-11 | Collaborative Dynamic 3D Scene Graphs for Open-Vocabulary Urban Scene Understanding | Tim Steinke et.al. | 2503.08474 | null |
2025-03-11 | TrackOcc: Camera-based 4D Panoptic Occupancy Tracking | Zhuoguang Chen et.al. | 2503.08471 | link |
2025-03-13 | JiSAM: Alleviate Labeling Burden and Corner Case Problems in Autonomous Driving via Minimal Real-World Data | Runjian Chen et.al. | 2503.08422 | null |
2025-03-13 | Learning to Detect Objects from Multi-Agent LiDAR Scans without Manual Labels | Qiming Xia et.al. | 2503.08421 | link |
2025-03-11 | AnyMoLe: Any Character Motion In-betweening Leveraging Video Diffusion Models | Kwan Yun et.al. | 2503.08417 | link |
2025-03-11 | Multi-particle-collision simulation of heat transfer in low-dimensional fluids | Rongxiang Luo et.al. | 2503.08409 | null |
2025-03-17 | WildSeg3D: Segment Any 3D Objects in the Wild from 2D Images | Yansong Guo et.al. | 2503.08407 | null |
2025-03-11 | nnInteractive: Redefining 3D Promptable Segmentation | Fabian Isensee et.al. | 2503.08373 | link |
2025-03-11 | 3D Medical Imaging Segmentation on Non-Contrast CT | Canxuan Gang et.al. | 2503.08361 | null |
2025-03-11 | Mitigating Ambiguities in 3D Classification with Gaussian Splatting | Ruiqi Zhang et.al. | 2503.08352 | null |
2025-03-11 | Talk2PC: Enhancing 3D Visual Grounding through LiDAR and Radar Point Clouds Fusion for Autonomous Driving | Runwei Guan et.al. | 2503.08336 | null |
2025-03-11 | Navier-Stokes/Allen-Cahn system with moving contact line | Yinghua Li et.al. | 2503.08334 | null |
2025-03-11 | HERO: Human Reaction Generation from Videos | Chengjun Yu et.al. | 2503.08270 | null |
2025-03-11 | HASARD: A Benchmark for Vision-Based Safe Reinforcement Learning in Embodied Agents | Tristan Tomilin et.al. | 2503.08241 | null |
2025-03-11 | HRAvatar: High-Quality and Relightable Gaussian Head Avatar | Dongbin Zhang et.al. | 2503.08224 | null |
2025-03-11 | CL-MVSNet: Unsupervised Multi-view Stereo with Dual-level Contrastive Learning | Kaiqiang Xiong et.al. | 2503.08219 | null |
2025-03-11 | MVD-HuGaS: Human Gaussians from a Single Image via 3D Human Multi-view Diffusion Prior | Kaiqiang Xiong et.al. | 2503.08218 | null |
2025-03-11 | S3R-GS: Streamlining the Pipeline for Large-Scale Street Scene Reconstruction | Guangting Zheng et.al. | 2503.08217 | null |
2025-03-11 | Explaining Human Preferences via Metrics for Structured 3D Reconstruction | Jack Langerman et.al. | 2503.08208 | null |
2025-03-11 | Dynamic Scene Reconstruction: Recent Advance in Real-time Rendering and Streaming | Jiaxuan Zhu et.al. | 2503.08166 | null |
2025-03-11 | Multimodal Generation of Animatable 3D Human Models with AvatarForge | Xinhang Liu et.al. | 2503.08165 | null |
2025-03-11 | A Framework for Reducing the Complexity of Geometric Vision Problems and its Application to Two-View Triangulation with Approximation Bounds | Felix Rydell et.al. | 2503.08142 | null |
2025-03-11 | HOTFormerLoc: Hierarchical Octree Transformer for Versatile Lidar Place Recognition Across Ground and Aerial Views | Ethan Griffiths et.al. | 2503.08140 | null |
2025-03-11 | ArticulatedGS: Self-supervised Digital Twin Modeling of Articulated Objects using 3D Gaussian Splatting | Junfu Guo et.al. | 2503.08135 | null |
2025-03-11 | THz Beam Squint Mitigation via 3D Rotatable Antennas | Yike Xie et.al. | 2503.08134 | null |
2025-03-11 | MaRI: Material Retrieval Integration across Domains | Jianhui Wang et.al. | 2503.08111 | null |
2025-03-12 | Accelerate 3D Object Detection Models via Zero-Shot Attention Key Pruning | Lizhen Xu et.al. | 2503.08101 | link |
2025-03-13 | MVGSR: Multi-View Consistency Gaussian Splatting for Robust Surface Reconstruction | Chenfeng Hou et.al. | 2503.08093 | null |
2025-03-11 | SparseVoxFormer: Sparse Voxel-based Transformer for Multi-modal 3D Object Detection | Hyeongseok Son et.al. | 2503.08092 | null |
2025-03-11 | Efficient Trajectory Generation Based on Traversable Planes in 3D Complex Architectural Spaces | Mengke Zhang et.al. | 2503.08076 | null |
2025-03-11 | GigaSLAM: Large-Scale Monocular SLAM with Hierachical Gaussian Splats | Kai Deng et.al. | 2503.08071 | link |
2025-03-11 | The AGEL Survey Data Release 2: A Gravitational Lens Sample for Galaxy Evolution and Cosmology | Tania M. Barone et.al. | 2503.08041 | null |
2025-03-11 | A Three-Dimensional Pursuit-Evasion Game Based on Fuzzy Actor-Critic Learning Algorithm | Penglin Hu et.al. | 2503.08013 | null |
2025-03-12 | CDI3D: Cross-guided Dense-view Interpolation for 3D Reconstruction | Zhiyuan Wu et.al. | 2503.08005 | null |
2025-03-11 | Joint Semantic Transmission and Resource Allocation for Intelligent Computation Task Offloading in MEC Systems | Yuanpeng Zheng et.al. | 2503.08001 | null |
2025-03-11 | 7DGS: Unified Spatial-Temporal-Angular Gaussian Splatting | Zhongpai Gao et.al. | 2503.07946 | null |
2025-03-13 | From Slices to Sequences: Autoregressive Tracking Transformer for Cohesive and Consistent 3D Lymph Node Detection in CT Scans | Qinji Yu et.al. | 2503.07933 | null |
2025-03-10 | BEARCUBS: A benchmark for computer-using web agents | Yixiao Song et.al. | 2503.07919 | null |
2025-03-10 | Atom-Chip Compatible Optical Lattice | Robert Leonard et.al. | 2503.07913 | null |
2025-03-10 | FunGraph: Functionality Aware 3D Scene Graphs for Language-Prompted Scene Interaction | Dennis Rotondi et.al. | 2503.07909 | null |
2025-03-10 | Neural Radiance and Gaze Fields for Visual Attention Modeling in 3D Environments | Andrei Chubarau et.al. | 2503.07828 | null |
2025-03-10 | POp-GS: Next Best View in 3D-Gaussian Splatting with P-Optimality | Joey Wilson et.al. | 2503.07819 | null |
2025-03-10 | AgriField3D: A Curated 3D Point Cloud and Procedural Model Dataset of Field-Grown Maize from a Diversity Panel | Elvis Kimara et.al. | 2503.07813 | null |
2025-03-10 | SegResMamba: An Efficient Architecture for 3D Medical Image Segmentation | Badhan Kumar Das et.al. | 2503.07766 | null |
2025-03-10 | Geometric Delocalization in Two Dimensions | Laura Shou et.al. | 2503.07705 | null |
2025-03-10 | TVNet: A Novel Time Series Analysis Method Based on Dynamic Convolution and 3D-Variation | Chenghan Li et.al. | 2503.07674 | null |
2025-03-09 | Data Foundations for Large Scale Multimodal Clinical Foundation Models | Wei Dai et.al. | 2503.07667 | link |
2025-03-08 | HIPPO-MAT: Decentralized Task Allocation Using GraphSAGE and Multi-Agent Deep Reinforcement Learning | Lavanya Ratnabala et.al. | 2503.07662 | null |
2025-03-06 | 3D Surface Reconstruction and Volume Approximation via the meshless methods | T. Li et.al. | 2503.07644 | null |
2025-03-10 | HumanMM: Global Human Motion Recovery from Multi-shot Videos | Yuhong Zhang et.al. | 2503.07597 | link |
2025-03-10 | Hierarchical Cross-Modal Alignment for Open-Vocabulary 3D Object Detection | Youjun Zhao et.al. | 2503.07593 | null |
2025-03-10 | Discovery of a Highly Anisotropic Type-II Ferromagnetic Weyl State Exhibiting a 3D Quantum Hall Effect | Yingdong Guan et.al. | 2503.07564 | null |
2025-03-10 | Alligat0R: Pre-Training Through Co-Visibility Segmentation for Relative Camera Pose Regression | Thibaut Loiseau et.al. | 2503.07561 | null |
2025-03-10 | Real-Time Structural Deflection Estimation in Hydraulically Actuated Systems Using 3D Flexible Multibody Simulation and DNNs | Qasim Khadim et.al. | 2503.07528 | null |
2025-03-10 | PointVLA: Injecting the 3D World into Vision-Language-Action Models | Chengmeng Li et.al. | 2503.07511 | null |
2025-03-10 | PE3R: Perception-Efficient 3D Reconstruction | Jie Hu et.al. | 2503.07507 | link |
2025-03-11 | AthletePose3D: A Benchmark Dataset for 3D Human Pose Estimation and Kinematic Validation in Athletic Movements | Calvin Yeung et.al. | 2503.07499 | link |
2025-03-10 | NeAS: 3D Reconstruction from X-ray Images using Neural Attenuation Surface | Chengrui Zhu et.al. | 2503.07491 | null |
2025-03-10 | Chameleon: Fast-slow Neuro-symbolic Lane Topology Extraction | Zongzheng Zhang et.al. | 2503.07485 | link |
2025-03-10 | SOGS: Second-Order Anchor for Advanced 3D Gaussian Splatting | Jiahui Zhang et.al. | 2503.07476 | null |
2025-03-10 | A Review on Geometry and Surface Inspection in 3D Concrete Printing | K. Mawas et.al. | 2503.07472 | null |
2025-03-12 | EigenGS Representation: From Eigenspace to Gaussian Image Space | Lo-Wei Tai et.al. | 2503.07446 | null |
2025-03-10 | Analysis of 3D Urticaceae Pollen Classification Using Deep Learning Models | Tijs Konijn et.al. | 2503.07419 | null |
2025-03-10 | GM-MoE: Low-Light Enhancement with Gated-Mechanism Mixture-of-Experts | Minwen Liao et.al. | 2503.07417 | null |
2025-03-10 | Skelite: Compact Neural Networks for Efficient Iterative Skeletonization | Luis D. Reyes Vargas et.al. | 2503.07369 | link |
2025-03-10 | Fully Unsupervised Annotation of C. Elegans | Christoph Karg et.al. | 2503.07348 | null |
2025-03-10 | Temporal Triplane Transformers as Occupancy World Models | Haoran Xu et.al. | 2503.07338 | null |
2025-03-10 | Cool-3D: An End-to-End Thermal-Aware Framework for Early-Phase Design Space Exploration of Microfluidic-Cooled 3DICs | Runxi Wang et.al. | 2503.07297 | link |
2025-03-10 | Phase field study of the effective fracture energy increase during dynamic crack propagation in disordered heterogeneous materials | Hervé Henry et.al. | 2503.07267 | null |
2025-03-10 | X-ray and radio data obtained by XMM-Newton and VLA constrain the stellar wind of the magnetic quasi-Wolf-Rayet star in HD45166 | P. Leto et.al. | 2503.07205 | null |
2025-03-12 | Endo-FASt3r: Endoscopic Foundation model Adaptation for Structure from motion | Mona Sheikh Zeinoddin et.al. | 2503.07204 | null |
2025-03-10 | All That Glitters Is Not Gold: Key-Secured 3D Secrets within 3D Gaussian Splatting | Yan Ren et.al. | 2503.07191 | link |
2025-03-10 | Multi-Modal 3D Mesh Reconstruction from Images and Text | Melvin Reka et.al. | 2503.07190 | null |
2025-03-10 | The 4D Human Embryonic Brain Atlas: spatiotemporal atlas generation for rapid anatomical changes using first-trimester ultrasound from the Rotterdam Periconceptional Cohort | Wietske A. P. Bastiaansen et.al. | 2503.07177 | null |
2025-03-10 | Vortex frequency locking and Shapiro steps in superconductor open nanotubes | Igor Bogush et.al. | 2503.07162 | null |
2025-03-10 | Controllable 3D Outdoor Scene Generation via Scene Graphs | Yuheng Liu et.al. | 2503.07152 | link |
2025-03-10 | VidBot: Learning Generalizable 3D Actions from In-the-Wild 2D Human Videos for Zero-Shot Robotic Manipulation | Hanzhi Chen et.al. | 2503.07135 | null |
2025-03-10 | A Light Perspective for 3D Object Detection | Marcelo Eduardo Pederiva et.al. | 2503.07133 | null |
2025-03-10 | Learning A Zero-shot Occupancy Network from Vision Foundation Models via Self-supervised Adaptation | Sihao Lin et.al. | 2503.07125 | null |
2025-03-12 | RS2V-L: Vehicle-Mounted LiDAR Data Generation from Roadside Sensor Observations | Ruidan Xing et.al. | 2503.07085 | null |
2025-03-10 | TIDE : Temporal-Aware Sparse Autoencoders for Interpretable Diffusion Transformers in Image Generation | Victor Shea-Jay Huang et.al. | 2503.07050 | null |
2025-03-10 | Availability-aware Sensor Fusion via Unified Canonical Space for 4D Radar, LiDAR, and Camera | Dong-Hee Paek et.al. | 2503.07029 | link |
2025-03-10 | HybridReg: Robust 3D Point Cloud Registration with Hybrid Motions | Keyu Du et.al. | 2503.07019 | link |
2025-03-10 | Frequency-Aware Density Control via Reparameterization for High-Quality Rendering of 3D Gaussian Splatting | Zhaojie Zeng et.al. | 2503.07000 | link |
2025-03-10 | ConcreTizer: Model Inversion Attack via Occupancy Classification and Dispersion Control for 3D Point Cloud Restoration | Youngseok Kim et.al. | 2503.06986 | null |
2025-03-10 | Griffin: Aerial-Ground Cooperative Detection and Tracking Dataset and Benchmark | Jiahao Wang et.al. | 2503.06983 | link |
2025-03-10 | Aligning Instance-Semantic Sparse Representation towards Unsupervised Object Segmentation and Shape Abstraction with Repeatable Primitives | Jiaxin Li et.al. | 2503.06947 | null |
2025-03-10 | Handle Object Navigation as Weighted Traveling Repairman Problem | Ruimeng Liu et.al. | 2503.06937 | link |
2025-03-10 | CAFusion: Controllable Anatomical Synthesis of Perirectal Lymph Nodes via SDF-guided Diffusion | Weidong Guo et.al. | 2503.06919 | null |
2025-03-10 | DirectTriGS: Triplane-based Gaussian Splatting Field Representation for 3D Generation | Xiaoliang Ju et.al. | 2503.06900 | null |
2025-03-10 | Accessing the Effect of Phyllotaxy and Planting Density on Light Use Efficiency in Field-Grown Maize using 3D Reconstructions | Nasla Saleem et.al. | 2503.06887 | null |
2025-03-10 | HIF: Height Interval Filtering for Efficient Dynamic Points Removal | Shufang Zhang et.al. | 2503.06863 | null |
2025-03-10 | ActiveInitSplat: How Active Image Selection Helps Gaussian Splatting | Konstantinos D. Polyzos et.al. | 2503.06859 | null |
2025-03-10 | A Simple Sonic Mapping Method Verified by CT Scan Images | Jimmy Xuekai Li et.al. | 2503.06842 | null |
2025-03-10 | Sub-Image Recapture for Multi-View 3D Reconstruction | Yanwei Wang et.al. | 2503.06818 | null |
2025-03-09 | Robotic Ultrasound-Guided Femoral Artery Reconstruction of Anatomically-Representative Phantoms | Lidia Al-Zogbi et.al. | 2503.06795 | null |
2025-03-09 | Infinite Leagues Under the Sea: Photorealistic 3D Underwater Terrain Generation by Latent Fractal Diffusion Models | Tianyi Zhang et.al. | 2503.06784 | null |
2025-03-09 | Investigating Image Manifolds of 3D Objects: Learning, Shape Analysis, and Comparisons | Benjamin Beaudett et.al. | 2503.06773 | null |
2025-03-09 | Gaussian RBFNet: Gaussian Radial Basis Functions for Fast and Accurate Representation and Reconstruction of Neural Fields | Abdelaziz Bouzidi et.al. | 2503.06762 | null |
2025-03-09 | CoDa-4DGS: Dynamic Gaussian Splatting with Context and Deformation Awareness for Autonomous Driving | Rui Song et.al. | 2503.06744 | null |
2025-03-12 | X-GAN: A Generative AI-Powered Unsupervised Model for High-Precision Segmentation of Retinal Main Vessels toward Early Detection of Glaucoma | Cheng Huang et.al. | 2503.06743 | null |
2025-03-09 | D3DR: Lighting-Aware Object Insertion in Gaussian Splatting | Vsevolod Skorokhodov et.al. | 2503.06740 | null |
2025-03-09 | ImplicitCell: Resolution Cell Modeling of Joint Implicit Volume Reconstruction and Pose Refinement in Freehand 3D Ultrasound | Sheng Song et.al. | 2503.06686 | link |
2025-03-12 | REArtGS: Reconstructing and Generating Articulated Objects via 3D Gaussian Splatting with Geometric and Motion Constraints | Di Wu et.al. | 2503.06677 | null |
2025-03-09 | AxisPose: Model-Free Matching-Free Single-Shot 6D Object Pose Estimation via Axis Generation | Yang Zou et.al. | 2503.06660 | null |
2025-03-09 | MultiCo3D: Multi-Label Voxel Contrast for One-Shot Incremental Segmentation of 3D Neuroimages | Hao Xu et.al. | 2503.06598 | null |
2025-03-09 | Global-Aware Monocular Semantic Scene Completion with State Space Models | Shijie Li et.al. | 2503.06569 | null |
2025-03-09 | SGA-INTERACT: A 3D Skeleton-based Benchmark for Group Activity Understanding in Modern Basketball Tactic | Yuchen Yang et.al. | 2503.06522 | link |
2025-03-09 | A Mesh Is Worth 512 Numbers: Spectral-domain Diffusion Modeling for High-dimension Shape Generation | Jiajie Fan et.al. | 2503.06485 | null |
2025-03-09 | Density-Matrix Embedding Based Multi-Configurational Perturbation Theory Approach to Single-Ion Magnets | Zhe-Bin Guan et.al. | 2503.06483 | null |
2025-03-09 | Vector Quantized Feature Fields for Fast 3D Semantic Lifting | George Tang et.al. | 2503.06469 | null |
2025-03-09 | SP3D: Boosting Sparsely-Supervised 3D Object Detection via Accurate Cross-Modal Semantic Prompts | Shijia Zhao et.al. | 2503.06467 | link |
2025-03-09 | StructGS: Adaptive Spherical Harmonics and Rendering Enhancements for Superior 3D Gaussian Splatting | Zexu Huang et.al. | 2503.06462 | null |
2025-03-12 | Wind of Change: Faraday Rotation in a Simulated Large Magellanic Cloud | Hilay Shah et.al. | 2503.06449 | null |
2025-03-09 | OV-SCAN: Semantically Consistent Alignment for Novel Object Discovery in Open-Vocabulary 3D Object Detection | Adrian Chow et.al. | 2503.06435 | null |
2025-03-09 | Dynamical scaling study of three-dimensional XY spin glass toward the spin-chirality decoupling picture | Yusuke Terasawa et.al. | 2503.06386 | null |
2025-03-09 | X-LRM: X-ray Large Reconstruction Model for Extremely Sparse-View Computed Tomography Recovery in One Second | Guofeng Zhang et.al. | 2503.06382 | null |
2025-03-08 | Exploring the Performance Improvement of Tensor Processing Engines through Transformation in the Bit-weight Dimension of MACs | Qizhe Wu et.al. | 2503.06342 | link |
2025-03-08 | Observation of Two Cascading Screening Processes in an Iron-based Superconductor | Ming-Hua Chang et.al. | 2503.06314 | null |
2025-03-11 | Optimization and Benchmarking of Monolithically Stackable Gain Cell Memory for Last-Level Cache | Faaiq Waqar et.al. | 2503.06304 | null |
2025-03-08 | An inviscid limit problem for Navier-Stokes equations in 3D domains with oscillatory boundaries | Tuoc Phan et.al. | 2503.06298 | null |
2025-03-08 | From Dataset to Real-world: General 3D Object Detection via Generalized Cross-domain Few-shot Learning | Shuangzhi Li et.al. | 2503.06282 | null |
2025-03-08 | SplatTalk: 3D VQA with Gaussian Splatting | Anh Thai et.al. | 2503.06271 | null |
2025-03-08 | Get In Video: Add Anything You Want to the Video | Shaobin Zhuang et.al. | 2503.06268 | null |
2025-03-08 | Rethinking Lanes and Points in Complex Scenarios for Monocular 3D Lane Detection | Yifan Chang et.al. | 2503.06237 | null |
2025-03-08 | StreamGS: Online Generalizable Gaussian Splatting Reconstruction for Unposed Image Streams | Yang LI et.al. | 2503.06235 | null |
2025-03-15 | Integrating Chain-of-Thought for Multimodal Alignment: A Study on 3D Vision-Language Learning | Yanjun Chen et.al. | 2503.06232 | null |
2025-03-08 | Vision-based 3D Semantic Scene Completion via Capture Dynamic Representations | Meng Wang et.al. | 2503.06222 | null |
2025-03-08 | VLScene: Vision-Language Guidance Distillation for Camera-Based 3D Semantic Scene Completion | Meng Wang et.al. | 2503.06219 | link |
2025-03-08 | ForestSplats: Deformable transient field for Gaussian Splatting in the Wild | Wongi Park et.al. | 2503.06179 | null |
2025-03-08 | Feature-EndoGaussian: Feature Distilled Gaussian Splatting in Surgical Deformable Scene Reconstruction | Kai Li et.al. | 2503.06161 | null |
2025-03-08 | UrbanVideo-Bench: Benchmarking Vision-Language Models on Embodied Intelligence with Video Data in Urban Spaces | Baining Zhao et.al. | 2503.06157 | null |
2025-03-08 | SRM-Hair: Single Image Head Mesh Reconstruction via 3D Morphable Hair | Zidu Wang et.al. | 2503.06154 | link |
2025-03-08 | GSV3D: Gaussian Splatting-based Geometric Distillation with Stable Video Diffusion for Single-Image 3D Object Generation | Ye Tao et.al. | 2503.06136 | null |
2025-03-08 | RGB-Phase Speckle: Cross-Scene Stereo 3D Reconstruction via Wrapped Pre-Normalization | Kai Yang et.al. | 2503.06125 | null |
2025-03-08 | SecureGS: Boosting the Security and Fidelity of 3D Gaussian Splatting Steganography | Xuanyu Zhang et.al. | 2503.06118 | null |
2025-03-08 | NeuraLoc: Visual Localization in Neural Implicit Map with Dual Complementary Features | Hongjia Zhai et.al. | 2503.06117 | null |
2025-03-08 | Fish2Mesh Transformer: 3D Human Mesh Recovery from Egocentric Vision | David C. Jeong et.al. | 2503.06089 | null |
2025-03-08 | Geometrically Templated Dynamic Wrinkling from Suspended Poly(vinyl alcohol) Soap Films | Yuchong Gao et.al. | 2503.06065 | null |
2025-03-08 | Towards Universal Text-driven CT Image Segmentation | Yuheng Li et.al. | 2503.06030 | null |
2025-03-08 | Zero-Shot Peg Insertion: Identifying Mating Holes and Estimating SE(2) Poses with Vision-Language Models | Masaru Yajima et.al. | 2503.06026 | null |
2025-03-08 | Towards Ambiguity-Free Spatial Foundation Model: Rethinking and Decoupling Depth Ambiguity | Xiaohao Xu et.al. | 2503.06014 | link |
2025-03-08 | End-to-End HOI Reconstruction Transformer with Graph-based Encoding | Zhenrong Wang et.al. | 2503.06012 | null |
2025-03-08 | Optimization models for needle placement in 3D-printed masks for high dose rate brachytherapy | Nasim Mirzavand Boroujeni et.al. | 2503.06000 | null |
2025-03-08 | ReJSHand: Efficient Real-Time Hand Pose Estimation and Mesh Reconstruction Using Refined Joint and Skeleton Features | Shan An et.al. | 2503.05995 | link |
2025-03-07 | MagicInfinite: Generating Infinite Talking Videos with Your Words and Voice | Hongwei Yi et.al. | 2503.05978 | null |
2025-03-07 | Bayesian Fields: Task-driven Open-Set Semantic Gaussian Splatting | Dominic Maggio et.al. | 2503.05949 | null |
2025-03-07 | Pink-Beam Dark Field X-ray Microscopy: Expanding 3D/4D Imaging for Complex and Deformed Microstructures | Can Yildirim et.al. | 2503.05921 | null |
2025-03-07 | Global dissipative solutions of the 3D Naiver-Stokes and MHD equations | Alexey Cheskidov et.al. | 2503.05692 | null |
2025-03-07 | Joint 3D Point Cloud Segmentation using Real-Sim Loop: From Panels to Trees and Branches | Tian Qiu et.al. | 2503.05630 | null |
2025-03-07 | Ising on $\mathbb{S}^2$ – The Affine Conjecture | Richard C. Brower et.al. | 2503.05621 | null |
2025-03-07 | The shape of FIREbox galaxies and a potential tension with low-mass disks | Courtney Klein et.al. | 2503.05612 | null |
2025-03-07 | TomatoScanner: phenotyping tomato fruit based on only RGB image | Xiaobei Zhao et.al. | 2503.05568 | link |
2025-03-07 | Disconnect to Connect: A Data Augmentation Method for Improving Topology Accuracy in Image Segmentation | Juan Miguel Valverde et.al. | 2503.05541 | link |
2025-03-12 | Free Your Hands: Lightweight Relightable Turntable Capture Pipeline | Jiahui Fan et.al. | 2503.05511 | null |
2025-03-07 | DecoupledGaussian: Object-Scene Decoupling for Physics-Based Interaction | Miaowei Wang et.al. | 2503.05484 | null |
2025-03-12 | PinchCatcher: Enabling Multi-selection for Gaze+Pinch | Jinwook Kim et.al. | 2503.05456 | null |
2025-03-07 | $\mathrm{O}$/$\mathrm{SO}$ Gauge Groups, $BC$ Quivers and $O3$ Planes | Sam Bennett et.al. | 2503.05443 | null |
2025-03-07 | LiDAR-enhanced 3D Gaussian Splatting Mapping | Jian Shen et.al. | 2503.05425 | null |
2025-03-07 | A skeletonization based image segmentation algorithm to isolate slender regions in 3D microstructures | Vinit Vijay Deshpande et.al. | 2503.05417 | null |
2025-03-07 | Decay of solutions of nonlinear Dirac equations | Sebastian Herr et.al. | 2503.05410 | null |
2025-03-07 | Self-Modeling Robots by Photographing | Kejun Hu et.al. | 2503.05398 | null |
2025-03-07 | CoMoGaussian: Continuous Motion-Aware Gaussian Splatting from Motion-Blurred Images | Jungho Lee et.al. | 2503.05332 | link |
2025-03-07 | Escaping Plato’s Cave: Towards the Alignment of 3D and Text Latent Spaces | Souhail Hadgi et.al. | 2503.05283 | null |
2025-03-07 | Evaluation of 3D Terrestrial and Aerial Spectrum Sharing with Massive MIMO Systems | Achiel Colpaert et.al. | 2503.05279 | null |
2025-03-07 | Separability Membrane: 3D Active Contour for Point Cloud Surface Reconstruction | Gulpi Qorik Oktagalu Pratamasunu et.al. | 2503.05217 | null |
2025-03-07 | STGA: Selective-Training Gaussian Head Avatars | Hanzhi Guo et.al. | 2503.05196 | null |
2025-03-07 | MGSR: 2D/3D Mutual-boosted Gaussian Splatting for High-fidelity Surface Reconstruction under Various Light Conditions | Qingyuan Zhou et.al. | 2503.05182 | null |
2025-03-07 | SplatPose: Geometry-Aware 6-DoF Pose Estimation from Single RGB Image via 3D Gaussian Splatting | Linqi Yang et.al. | 2503.05174 | null |
2025-03-10 | SeeLe: A Unified Acceleration Framework for Real-Time Gaussian Splatting | Xiaotong Huang et.al. | 2503.05168 | null |
2025-03-07 | EvolvingGS: High-Fidelity Streamable Volumetric Video via Evolving 3D Gaussian Representation | Chao Zhang et.al. | 2503.05162 | null |
2025-03-07 | GaussianCAD: Robust Self-Supervised CAD Reconstruction from Three Orthographic Views Using 3D Gaussian Splatting | Zheng Zhou et.al. | 2503.05161 | null |
2025-03-10 | GSplatVNM: Point-of-View Synthesis for Visual Navigation Models Using Gaussian Splatting | Kohei Honda et.al. | 2503.05152 | null |
2025-03-07 | HexPlane Representation for 3D Semantic Scene Understanding | Zeren Chen et.al. | 2503.05127 | null |
2025-03-07 | Fake It To Make It: Virtual Multiviews to Enhance Monocular Indoor Semantic Scene Completion | Anith Selvakumar et.al. | 2503.05086 | null |
2025-03-07 | Taming Video Diffusion Prior with Scene-Grounding Guidance for 3D Gaussian Splatting from Sparse Inputs | Yingji Zhong et.al. | 2503.05082 | null |
2025-03-07 | Object Packing and Scheduling for Sequential 3D Printing: a Linear Arithmetic Model and a CEGAR-inspired Optimal Solver | Pavel Surynek et.al. | 2503.05071 | null |
2025-03-07 | On the continuous properties for the 3D incompressible rotating Euler equations | Jinlu Li et.al. | 2503.05069 | null |
2025-03-07 | Perceiving, Reasoning, Adapting: A Dual-Layer Framework for VLM-Guided Precision Robotic Manipulation | Qingxuan Jia et.al. | 2503.05064 | null |
2025-03-06 | ProtComposer: Compositional Protein Structure Generation with 3D Ellipsoids | Hannes Stark et.al. | 2503.05025 | link |
2025-03-06 | Magnetic Phase Transitions and Mixed Spin in Double Perovskite $Sr_{2}FeMoO_{6}$ | Said Khaireddine et.al. | 2503.05002 | null |
2025-03-06 | Probing circular polarization and magnetic field structure in AGN | Joana A Kramer et.al. | 2503.04970 | link |
2025-03-11 | Prediction of Frozen Region Growth in Kidney Cryoablation Intervention Using a 3D Flow-Matching Model | Siyeop Yoon et.al. | 2503.04966 | null |
2025-03-06 | Spectral Informed Mamba for Robust Point Cloud Processing | Ali Bahri et.al. | 2503.04953 | null |
2025-03-06 | Computation of generalised magnetic coordinates asymptotically close to the separatrix | Stuart Benjamin et.al. | 2503.04934 | null |
2025-03-06 | Metadata-free Georegistration of Ground and Airborne Imagery | Adam Bredvik et.al. | 2503.04927 | null |
2025-03-06 | FirePlace: Geometric Refinements of LLM Common Sense Reasoning for 3D Object Placement | Ian Huang et.al. | 2503.04919 | null |
2025-03-06 | oMEGACat. VI. Analysis of the overall kinematics of Omega Centauri in 3D: velocity dispersion, kinematic distance, anisotropy, and energy equipartition | Maximilian Häberle et.al. | 2503.04903 | null |
2025-03-06 | The Three-mm Ultimate Mopra Milky Way Survey. III. Data Release 6, An Atlas of Physical Conditions, Global Mass Conversion Laws, and 3D Physical Architecture of the Molecular ISM in the Fourth Quadrant | Peter J. Barnes et.al. | 2503.04887 | null |
2025-03-06 | The HST-Hyperion Survey: Grism Observations of a $z\sim2.5$ Proto-Supercluster | Ben Forrest et.al. | 2503.04884 | null |
2025-03-06 | Adapt3R: Adaptive 3D Scene Representation for Domain Transfer in Imitation Learning | Albert Wilcox et.al. | 2503.04877 | null |
2025-03-06 | Towards a Study of Low Energy Antiproton Annihilations on Nuclei | Viktoria Kraxberger et.al. | 2503.04868 | null |
2025-03-06 | Manboformer: Learning Gaussian Representations via Spatial-temporal Attention Mechanism | Ziyue Zhao et.al. | 2503.04863 | null |
2025-03-06 | End-to-End Human Pose Reconstruction from Wearable Sensors for 6G Extended Reality Systems | Nguyen Quang Hieu et.al. | 2503.04860 | link |
2025-03-06 | CAUSAL3D: A Comprehensive Benchmark for Causal Learning from Visual Data | Disheng Liu et.al. | 2503.04852 | null |
2025-03-05 | ZAugNet for Z-Slice Augmentation in Bio-Imaging | Alessandro Pasqui et.al. | 2503.04843 | link |
2025-03-05 | Distilling Dataset into Neural Field | Donghyeok Shin et.al. | 2503.04835 | link |
2025-03-05 | StickMotion: Generating 3D Human Motions by Drawing a Stickman | Tao Wang et.al. | 2503.04829 | null |
2025-03-05 | Rethinking Few-Shot Medical Image Segmentation by SAM2: A Training-Free Framework with Augmentative Prompting and Dynamic Matching | Haiyue Zu et.al. | 2503.04826 | null |
2025-03-04 | Invisible Strings: Revealing Latent Dancer-to-Dancer Interactions with Graph Neural Networks | Luis Vitor Zerkowski et.al. | 2503.04816 | null |
2025-03-06 | FluidNexus: 3D Fluid Reconstruction and Prediction from a Single Video | Yue Gao et.al. | 2503.04720 | null |
2025-03-06 | Implicit Neural Representation for Video and Image Super-Resolution | Mary Aiyetigbo et.al. | 2503.04665 | null |
2025-03-06 | Simulating the Real World: A Unified Survey of Multimodal Generative Models | Yuqi Hu et.al. | 2503.04641 | link |
2025-03-06 | Meshless Super-Resolution of Scattered Data via constrained RBFs and KNN-Driven Densification | Iacopo Tirelli et.al. | 2503.04630 | null |
2025-03-08 | The Best of Both Worlds: Integrating Language Models and Diffusion Models for Video Generation | Aoxiong Yin et.al. | 2503.04606 | link |
2025-03-06 | ExoNav II: Design of a Robotic Tool with Follow-the-Leader Motion Capability for Lateral and Ventral Spinal Cord Stimulation (SCS) | Behnam Moradkhani et.al. | 2503.04603 | null |
2025-03-06 | IMFine: 3D Inpainting via Geometry-guided Multi-view Refinement | Zhihao Shi et.al. | 2503.04501 | null |
2025-03-06 | Learning Object Placement Programs for Indoor Scene Synthesis with Iterative Self Training | Adrian Chang et.al. | 2503.04496 | null |
2025-03-08 | EvidMTL: Evidential Multi-Task Learning for Uncertainty-Aware Semantic Surface Mapping from Monocular RGB Images | Rohit Menon et.al. | 2503.04441 | null |
2025-03-06 | PointsToWood: A deep learning framework for complete canopy leaf-wood segmentation of TLS data across diverse European forests | Harry J. F. Owen et.al. | 2503.04420 | null |
2025-03-06 | From Idea to CAD: A Language Model-Driven Multi-Agent System for Collaborative Design | Felix Ocker et.al. | 2503.04417 | null |
2025-03-07 | Speculative MoE: Communication Efficient Parallel MoE Inference with Speculative Token and Expert Pre-scheduling | Yan Li et.al. | 2503.04398 | null |
2025-03-06 | Real-Time 3D Magnetic Field Camera for a Spherical Volume | Fynn Foerger et.al. | 2503.04391 | null |
2025-03-06 | A Generalist Cross-Domain Molecular Learning Framework for Structure-Based Drug Discovery | Yiheng Zhu et.al. | 2503.04362 | null |
2025-03-06 | A Modular Pipeline for 3D Object Tracking Using RGB Cameras | Lars Bredereke et.al. | 2503.04322 | link |
2025-03-06 | S2Gaussian: Sparse-View Super-Resolution 3D Gaussian Splatting | Yecong Wan et.al. | 2503.04314 | null |
2025-03-06 | New developments in 3D-trench electrode sensors | Jixing Ye et.al. | 2503.04272 | null |
2025-03-06 | How to Move Your Dragon: Text-to-Motion Synthesis for Large-Vocabulary Objects | Wonkwang Lee et.al. | 2503.04257 | null |
2025-03-06 | Numerical Study On Temperature Variations Of Superheated Steam Flowing Through A Regulation Valve | Zhe-hui Ma et.al. | 2503.04209 | null |
2025-03-06 | Learning 3D Medical Image Models From Brain Functional Connectivity Network Supervision For Mental Disorder Diagnosis | Xingcan Hu et.al. | 2503.04205 | null |
2025-03-06 | CA-W3D: Leveraging Context-Aware Knowledge for Weakly Supervised Monocular 3D Detection | Chupeng Liu et.al. | 2503.04154 | null |
2025-03-07 | Diff-Reg v2: Diffusion-Based Matching Matrix Estimation for Image Matching and 3D Registration | Qianliang Wu et.al. | 2503.04127 | null |
2025-03-06 | Instrument-Splatting: Controllable Photorealistic Reconstruction of Surgical Instruments Using Gaussian Splatting | Shuojue Yang et.al. | 2503.04082 | null |
2025-03-06 | Surgical Gaussian Surfels: Highly Accurate Real-time Surgical Scene Rendering | Idris O. Sunmola et.al. | 2503.04079 | null |
2025-03-06 | H3O: Hyper-Efficient 3D Occupancy Prediction with Heterogeneous Supervision | Yunxiao Shi et.al. | 2503.04059 | null |
2025-03-06 | Neural Network Surrogate Model for Junction Temperature and Hotspot Position in $3$ D Multi-Layer High Bandwidth Memory (HBM) Chiplets under Varying Thermal Conditions | Chengxin Zhang et.al. | 2503.04049 | null |
2025-03-06 | Autonomous Robotic Bone Micro-Milling System with Automatic Calibration and 3D Surface Fitting | Enduo Zhao et.al. | 2503.04038 | null |
2025-03-06 | Beyond Existance: Fulfill 3D Reconstructed Scenes with Pseudo Details | Yifei Gao et.al. | 2503.04037 | null |
2025-03-06 | GaussianGraph: 3D Gaussian-based Scene Graph Generation for Open-world Scene Understanding | Xihan Wang et.al. | 2503.04034 | null |
2025-03-06 | Self-Supervised Large Scale Point Cloud Completion for Archaeological Site Restoration | Aocheng Li et.al. | 2503.04030 | null |
2025-03-06 | Bounds on dissipation in three-dimensional planar shear flows: reduction to two-dimensional problems | Farid Rajkotia-Zaheer et.al. | 2503.04005 | null |
2025-03-06 | Uniform Boundedness of Homogeneous Incompressible Flows in $\mathbb{R}^3$ | Ulisse Iotti et.al. | 2503.03991 | null |
2025-03-06 | Integrating Protein Dynamics into Structure-Based Drug Design via Full-Atom Stochastic Flows | Xiangxin Zhou et.al. | 2503.03989 | null |
2025-03-06 | GRaD-Nav: Efficiently Learning Visual Drone Navigation with Gaussian Radiance Fields and Differentiable Dynamics | Qianzhong Chen et.al. | 2503.03984 | null |
2025-03-05 | All-atom Diffusion Transformers: Unified generative modelling of molecules and materials | Chaitanya K. Joshi et.al. | 2503.03965 | link |
2025-03-05 | COARSE: Collaborative Pseudo-Labeling with Coarse Real Labels for Off-Road Semantic Segmentation | Aurelio Noca et.al. | 2503.03947 | null |
2025-03-05 | Neural Descriptors: Self-Supervised Learning of Robust Local Surface Descriptors Using Polynomial Patches | Gal Yona et.al. | 2503.03907 | link |
2025-03-05 | Lagrangian flow statistics in experimental homogeneous isotropic turbulence | Cheng Wang et.al. | 2503.03891 | null |
2025-03-05 | LensDFF: Language-enhanced Sparse Feature Distillation for Efficient Few-Shot Dexterous Manipulation | Qian Feng et.al. | 2503.03890 | null |
2025-03-05 | Silica-coated admixtures of bismuth and gadolinium oxides for 3D printed concrete applications: Rheology, hydration, strength, microstructure, and radiation shielding perspective | Pawel Sikora et.al. | 2503.03864 | null |
2025-03-07 | Leaking Outside the Box: Kinetic Turbulence with Cosmic-Ray Escape | Evgeny A. Gorbunov et.al. | 2503.03820 | null |
2025-03-05 | A Time-Resolved High-Resolution Spectroscopic Analysis of Ionized Calcium and Dynamical Processes in the Ultra-Hot Jupiter HAT-P-70 b | Adam B. Langeveld et.al. | 2503.03814 | null |
2025-03-05 | DDCSR: A Novel End-to-End Deep Learning Framework for Cortical Surface Reconstruction from Diffusion MRI | Chengjin Li et.al. | 2503.03790 | link |
2025-03-07 | Generating Novel Brain Morphology by Deforming Learned Templates | Alan Q. Wang et.al. | 2503.03778 | link |
2025-03-05 | GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control | Xuanchi Ren et.al. | 2503.03751 | link |
2025-03-05 | Active 6D Pose Estimation for Textureless Objects using Multi-View RGB Frames | Jun Yang et.al. | 2503.03726 | null |
2025-03-05 | The shear Alfvén continuum of quasisymmetric stellarators. Part 1. Perturbation theory | Elizabeth J. Paul et.al. | 2503.03711 | null |
2025-03-08 | Rethinking Video Tokenization: A Conditioned Diffusion-based Approach | Nianzu Yang et.al. | 2503.03708 | link |
2025-03-05 | DualDiff+: Dual-Branch Diffusion for High-Fidelity Video Generation with Reward Guidance | Zhao Yang et.al. | 2503.03689 | link |
2025-03-05 | A model for boundary-driven tissue morphogenesis | Daniel S. Alber et.al. | 2503.03688 | null |
2025-03-05 | A Generative Approach to High Fidelity 3D Reconstruction from Text Data | Venkat Kumar R et.al. | 2503.03664 | null |
2025-03-05 | Simulation-Based Performance Evaluation of 3D Object Detection Methods with Deep Learning for a LiDAR Point Cloud Dataset in a SOTIF-related Use Case | Milin Patel et.al. | 2503.03548 | link |
2025-03-05 | A self-supervised cyclic neural-analytic approach for novel view synthesis and 3D reconstruction | Dragos Costea et.al. | 2503.03543 | null |
2025-03-05 | Coordinated Trajectories for Non-stop Flying Carriers Holding a Cable-Suspended Load | Chiara Gabellieri et.al. | 2503.03481 | null |
2025-03-05 | Enhancing Spoken Discourse Modeling in Language Models Using Gestural Cues | Varsha Suresh et.al. | 2503.03474 | null |
2025-03-06 | DTU-Net: A Multi-Scale Dilated Transformer Network for Nonlinear Hyperspectral Unmixing | ChenTong Wang et.al. | 2503.03465 | null |
2025-03-05 | Kondo-like behavior in a mixed valent oxypnictide $\mathrm{La_{3}Cu_{4}P_{4}O_{2}}$ | Szymon Królak et.al. | 2503.03447 | null |
2025-03-05 | Spontaneous rotational symmetry breaking induced by electronic instability in the normal state of La_{1-x} Sr_{x} NiO_{2} | Qiang Zhao et.al. | 2503.03419 | null |
2025-03-05 | REACT: Real-time Efficient Attribute Clustering and Transfer for Updatable 3D Scene Graph | Phuoc Nguyen et.al. | 2503.03412 | link |
2025-03-05 | Direct Sparse Odometry with Continuous 3D Gaussian Maps for Indoor Environments | Jie Deng et.al. | 2503.03373 | link |
2025-03-05 | Top-K Maximum Intensity Projection Priors for 3D Liver Vessel Segmentation | Xiaotong Zhang et.al. | 2503.03367 | null |
2025-03-05 | Label-Efficient LiDAR Semantic Segmentation with 2D-3D Vision Transformer Adapters | Julia Hindel et.al. | 2503.03299 | null |
2025-03-05 | Interactive Segmentation and Report Generation for CT Images | Yannian Gu et.al. | 2503.03294 | null |
2025-03-05 | Well-posedness of the nonhomogeneous incompressible Navier-Stokes/Allen-Cahn system | Yinghua Li et.al. | 2503.03279 | null |
2025-03-05 | BANet: Bilateral Aggregation Network for Mobile Stereo Matching | Gangwei Xu et.al. | 2503.03259 | null |
2025-03-05 | SCORE: Saturated Consensus Relocalization in Semantic Line Maps | Haodong Jiang et.al. | 2503.03254 | link |
2025-03-06 | Mocap-2-to-3: Lifting 2D Diffusion-Based Pretrained Models for 3D Motion Capture | Zhumei Wang et.al. | 2503.03222 | null |
2025-03-06 | DSPNet: Dual-vision Scene Perception for Robust 3D Question Answering | Jingzhou Luo et.al. | 2503.03190 | link |
2025-03-05 | Techniques in high-speed imaging and X-ray micro-computed tomography for characterisation of iron ore fragmentation | Aleese Barron et.al. | 2503.03163 | null |
2025-03-05 | Determinantal Learning for Subset Selection in Wireless Networks | Xiangliu Tu et.al. | 2503.03151 | null |
2025-03-05 | Implicit U-KAN2.0: Dynamic, Efficient and Interpretable Medical Image Segmentation | Chun-Wun Cheng et.al. | 2503.03141 | null |
2025-03-05 | Dynamic Neural Surfaces for Elastic 4D Shape Representation and Analysis | Awais Nizamani et.al. | 2503.03132 | null |
2025-03-05 | NTR-Gaussian: Nighttime Dynamic Thermal Reconstruction with 4D Gaussian Splatting Based on Thermodynamics | Kun Yang et.al. | 2503.03115 | null |
2025-03-05 | Selective Tweezing and Immobilization of Colloids for Dexterous Manipulation of Biological Materials | Krishangi Krishna et.al. | 2503.03102 | null |
2025-03-05 | AirExo-2: Scaling up Generalizable Robotic Imitation Learning with Low-Cost Exoskeletons | Hongjie Fang et.al. | 2503.03081 | null |
2025-03-05 | BEVDriver: Leveraging BEV Maps in LLMs for Robust Closed-Loop Driving | Katharina Winter et.al. | 2503.03074 | link |
2025-03-05 | Multi-View Depth Consistent Image Generation Using Generative AI Models: Application on Architectural Design of University Buildings | Xusheng Du et.al. | 2503.03068 | null |
2025-03-04 | Uniqueness of gauge covariant renormalisation of stochastic 3D Yang-Mills-Higgs | Ilya Chevyrev et.al. | 2503.03060 | null |
2025-03-04 | ArticuBot: Learning Universal Articulated Object Manipulation Policy via Large Scale Simulation | Yufei Wang et.al. | 2503.03045 | null |
2025-03-04 | Industrialisation of spectral/hp element method for incompressible, transitional flow around Formula 1 geometries | Parv Khurana et.al. | 2503.03035 | null |
2025-03-04 | Learning Precoding in Multi-user Multi-antenna Systems: Transformer or Graph Transformer? | Yuxuan Duan et.al. | 2503.02998 | null |
2025-03-04 | Configurational Information Measures, Phase Transitions, and an Upper Bound on Complexity | Damian R Sowinski et.al. | 2503.02980 | null |
2025-03-04 | Objestures: Bimanual Interactions with Everyday Objects and Mid-Air Gestures in Mixed Reality | Zhuoyue Lyu et.al. | 2503.02973 | null |
2025-03-04 | Finite-temperature quantum topological order in three dimensions | Shu-Tong Zhou et.al. | 2503.02928 | null |
2025-03-04 | Straight-Line Diffusion Model for Efficient 3D Molecular Generation | Yuyan Ni et.al. | 2503.02918 | link |
2025-03-04 | Monocular Person Localization under Camera Ego-motion | Yu Zhan et.al. | 2503.02916 | null |
2025-03-04 | Computer-aided shape features extraction and regression models for predicting the ascending aortic aneurysm growth rate | Leonardo Geronzi et.al. | 2503.02915 | null |
2025-03-04 | Towards Robust Multi-UAV Collaboration: MARL with Noise-Resilient Communication and Attention Mechanisms | Zilin Zhao et.al. | 2503.02913 | link |
2025-03-04 | Evaluation of Architectural Synthesis Using Generative AI | Jingfei Huang et.al. | 2503.02861 | null |
2025-03-04 | Comprehensive Analysis of Relative Pressure Estimation Methods Utilizing 4D Flow MRI | Brandon Hardy et.al. | 2503.02847 | null |
2025-03-05 | ArcPro: Architectural Programs for Structured 3D Abstraction of Sparse Points | Qirui Huang et.al. | 2503.02745 | null |
2025-03-04 | Vacua, Symmetries, and Higgsing of Chern-Simons Matter Theories | Fabio Marino et.al. | 2503.02744 | null |
2025-03-04 | Flat band driven itinerant magnetism in the Co-pnictides (La,Ca)Co $_2$(As,P)$_2$ | D. Subires et.al. | 2503.02728 | null |
2025-03-04 | Multi-Strategy Enhanced COA for Path Planning in Autonomous Navigation | Yifei Wang et.al. | 2503.02700 | null |
2025-03-04 | Class-Aware PillarMix: Can Mixed Sample Data Augmentation Enhance 3D Object Detection with Radar Point Clouds? | Miao Zhang et.al. | 2503.02687 | null |
2025-03-05 | A dataset-free approach for self-supervised learning of 3D reflectional symmetries | Isaac Aguirre et.al. | 2503.02660 | null |
2025-03-04 | A Deep, High-Angular Resolution 3D Dust Map of the Southern Galactic Plane | Catherine Zucker et.al. | 2503.02657 | null |
2025-03-04 | On the hydrostatic approximation of 3D Oldroyd-B model | Marius Paicu et.al. | 2503.02638 | null |
2025-03-04 | ARC-Flow : Articulated, Resolution-Agnostic, Correspondence-Free Matching and Interpolation of 3D Shapes Under Flow Fields | Adam Hartshorne et.al. | 2503.02606 | null |
2025-03-04 | StageDesigner: Artistic Stage Generation for Scenography via Theater Scripts | Zhaoxing Gan et.al. | 2503.02595 | null |
2025-03-05 | CMMLoc: Advancing Text-to-PointCloud Localization with Cauchy-Mixture-Model Based Framework | Yanlong Xu et.al. | 2503.02593 | link |
2025-03-04 | Tracking-Aware Deformation Field Estimation for Non-rigid 3D Reconstruction in Robotic Surgeries | Zeqing Wang et.al. | 2503.02558 | null |
2025-03-04 | PVTree: Realistic and Controllable Palm Vein Generation for Recognition Tasks | Sheng Shang et.al. | 2503.02547 | null |
2025-03-04 | A Novel Streamline-based diffusion MRI Tractography Registration Method with Probabilistic Keypoint Detection | Junyi Wang et.al. | 2503.02481 | null |
2025-03-04 | 2DGS-Avatar: Animatable High-fidelity Clothed Avatar via 2D Gaussian Splatting | Qipeng Yan et.al. | 2503.02452 | null |
2025-03-04 | InfoGNN: End-to-end deep learning on mesh via graph neural networks | Ling Gao et.al. | 2503.02414 | null |
2025-03-04 | Building 3D In-Context Learning Universal Model in Neuroimaging | Jiesi Hu et.al. | 2503.02410 | link |
2025-03-04 | A comparison of visual representations for real-world reinforcement learning in the context of vacuum gripping | Nico Sutter et.al. | 2503.02405 | link |
2025-03-04 | mmDEAR: mmWave Point Cloud Density Enhancement for Accurate Human Body Reconstruction | Jiarui Yang et.al. | 2503.02375 | null |
2025-03-04 | Label-Efficient LiDAR Panoptic Segmentation | Ahmet Selim Çanakçı et.al. | 2503.02372 | null |
2025-03-04 | CQ CNN: A Hybrid Classical Quantum Convolutional Neural Network for Alzheimer’s Disease Detection Using Diffusion Generated and U Net Segmented 3D MRI | Mominul Islam et.al. | 2503.02345 | link |
2025-03-04 | COMMA: Coordinate-aware Modulated Mamba Network for 3D Dispersed Vessel Segmentation | Gen Shi et.al. | 2503.02332 | null |
2025-03-04 | Volume Tells: Dual Cycle-Consistent Diffusion for 3D Fluorescence Microscopy De-noising and Super-Resolution | Zelin Li et.al. | 2503.02261 | null |
2025-03-04 | DQO-MAP: Dual Quadrics Multi-Object mapping with Gaussian Splatting | Haoyuan Li et.al. | 2503.02223 | link |
2025-03-04 | Low Complexity Frequency Domain Nonlinear Self-Interference Cancellation for Flexible Duplex | Yonghwi Kim et.al. | 2503.02203 | null |
2025-03-04 | MonoLite3D: Lightweight 3D Object Properties Estimation | Ahmed El-Dawy et.al. | 2503.02201 | null |
2025-03-04 | HyperGCT: A Dynamic Hyper-GNN-Learned Geometric Constraint for 3D Registration | Xiyu Zhang et.al. | 2503.02195 | null |
2025-03-11 | X2CT-CLIP: Enable Multi-Abnormality Detection in Computed Tomography from Chest Radiography via Tri-Modal Contrastive Learning | Jianzhong You et.al. | 2503.02162 | null |
2025-03-03 | Convective Overstability in Radially Global Protoplanetary Disks. II. Impact on planetesimal formation | Marius Lehmann et.al. | 2503.02084 | null |
2025-03-03 | RiboGen: RNA Sequence and Structure Co-Generation with Equivariant MultiFlow | Dana Rubin et.al. | 2503.02058 | null |
2025-03-03 | Constraint-Based Modeling of Dynamic Entities in 3D Scene Graphs for Robust SLAM | Marco Giberna et.al. | 2503.02050 | null |
2025-03-03 | Optimizing Robot Programming: Mixed Reality Gripper Control | Maximilian Rettinger et.al. | 2503.02042 | null |
2025-03-03 | Abn-BLIP: Abnormality-aligned Bootstrapping Language-Image Pre-training for Pulmonary Embolism Diagnosis and Report Generation from CTPA | Zhusi Zhong et.al. | 2503.02034 | null |
2025-03-10 | Reducing Frequency Bias of Fourier Neural Operators in 3D Seismic Wavefield Simulations Through Multi-Stage Training | Qingkai Kong et.al. | 2503.02023 | null |
2025-03-03 | Morpheus: Text-Driven 3D Gaussian Splat Shape and Color Stylization | Jamie Wynn et.al. | 2503.02009 | null |
2025-03-03 | TactStyle: Generating Tactile Textures with Generative AI for Digital Fabrication | Faraz Faruqi et.al. | 2503.02007 | null |
2025-03-03 | The stochastic nature of migration of disc instability protoplanets in three-dimensional hydrodynamical and MHD simulations of fragmenting discs | Noah Kubli et.al. | 2503.01973 | null |
2025-03-03 | Projection-angle effects when “observing” a turbulent magnetized collapsing molecular cloud. II. Magnetic field | A. Tritsis et.al. | 2503.01971 | null |
2025-03-03 | Projection-angle effects when “observing” a turbulent magnetized collapsing molecular cloud. I. Chemistry and line transfer | A. Tritsis et.al. | 2503.01963 | null |
2025-02-28 | FASTer: Focal Token Acquiring-and-Scaling Transformer for Long-term 3D Object Detection | Chenxu Dang et.al. | 2503.01899 | link |
2025-03-03 | Primus: Enforcing Attention Usage for 3D Medical Image Segmentation | Tassilo Wald et.al. | 2503.01835 | null |
2025-03-03 | Hilbert’s sixth problem: derivation of fluid equations via Boltzmann’s kinetic theory | Yu Deng et.al. | 2503.01800 | null |
2025-03-03 | On the behavior of the Generalized Alignment Index (GALI) method for dissipative systems | Henok Tenaw Moges et.al. | 2503.01784 | null |
2025-03-03 | vS-Graphs: Integrating Visual SLAM and Situational Graphs through Multi-level Scene Understanding | Ali Tourani et.al. | 2503.01783 | null |
2025-03-03 | Difix3D+: Improving 3D Reconstructions with Single-Step Diffusion Models | Jay Zhangjie Wu et.al. | 2503.01774 | null |
2025-03-03 | A COMSOL framework for predicting hydrogen embrittlement – Part II: phase field fracture | A. Díaz et.al. | 2503.01765 | null |
2025-03-07 | Gone with the wind: the outward migration of eccentric giant planets in windy disks | Gaylor Wafflard-Fernandez et.al. | 2503.01745 | null |
2025-03-03 | A mesh-free hybrid Chebyshev-Tucker tensor format with applications to multi-particle modelling | Peter Benner et.al. | 2503.01696 | null |
2025-03-03 | MUSt3R: Multi-view Network for Stereo 3D Reconstruction | Yohann Cabon et.al. | 2503.01661 | null |
2025-03-03 | The Interplay between Dust Dynamics and Turbulence Induced by the Vertical Shear Instability | Pinghui Huang et.al. | 2503.01656 | null |
2025-03-03 | OpenGS-SLAM: Open-Set Dense Semantic SLAM with 3D Gaussian Splatting for Object-Level Scene Understanding | Dianyi Yang et.al. | 2503.01646 | null |
2025-03-03 | M-SCAN: A Multistage Framework for Lumbar Spinal Canal Stenosis Grading Using Multi-View Cross Attention | Arnesh Batra et.al. | 2503.01634 | link |
2025-03-03 | Vid2Avatar-Pro: Authentic Avatar from Videos in the Wild via Universal Prior | Chen Guo et.al. | 2503.01610 | null |
2025-03-03 | First-principles Hubbard parameters with automated and reproducible workflows | Lorenzo Bastonero et.al. | 2503.01590 | link |
2025-03-03 | Soft Everting Prosthetic Hand and Comparison with Existing Body-Powered Terminal Devices | Gayoung Park et.al. | 2503.01585 | null |
2025-03-05 | Category-level Meta-learned NeRF Priors for Efficient Object Mapping | Saad Ejaz et.al. | 2503.01582 | null |
2025-03-03 | VF-Plan: Bridging the Art Gallery Problem and Static LiDAR Scanning with Visibility Field Optimization | Biao Xionga et.al. | 2503.01562 | null |
2025-03-03 | AI-Driven Relocation Tracking in Dynamic Kitchen Environments | Arash Nasr Esfahani et.al. | 2503.01547 | link |
2025-03-03 | Exo-ViHa: A Cross-Platform Exoskeleton System with Visual and Haptic Feedback for Efficient Dexterous Skill Learning | Xintao Chao et.al. | 2503.01543 | null |
2025-03-03 | Origami-Inspired Soft Gripper with Tunable Constant Force Output | Zhenwei Ni et.al. | 2503.01481 | null |
2025-03-03 | Modeling the cool gas clumps in the circumgalactic medium | Hang Yang et.al. | 2503.01479 | null |
2025-03-03 | The shooting methods to solve 3D nonlinear strings assemblies | Florian Surmont et.al. | 2503.01473 | null |
2025-03-03 | Generative Human Geometry Distribution | Xiangjun Tang et.al. | 2503.01448 | null |
2025-03-03 | MeshPad: Interactive Sketch Conditioned Artistic-designed Mesh Generation and Editing | Haoxuan Li et.al. | 2503.01425 | null |
2025-03-03 | MIR: a general-relativistic resistive-magneto-hydrodynamic code to study the effect of resistivity in Neutron Star dynamics | Franceschetti Kevin et.al. | 2503.01408 | null |
2025-03-03 | Pushing the boundaries of Structure-Based Drug Design through Collaboration with Large Language Models | Bowen Gao et.al. | 2503.01376 | null |
2025-03-03 | Kiss3DGen: Repurposing Image Diffusion Models for 3D Asset Generation | Jiantao Lin et.al. | 2503.01370 | link |
2025-03-03 | Performance Optimization of 3D Stencil Computation on ARM Scalable Vector Extension | Hongguang Chen et.al. | 2503.01348 | null |
2025-03-03 | Core-collapse supernovae | Anders Jerkstrand et.al. | 2503.01321 | null |
2025-03-03 | Hierarchically Tunable 6DMA for Wireless Communication and Sensing: Modeling and Performance Optimization | Haocheng Hua et.al. | 2503.01317 | null |
2025-03-03 | OnlineAnySeg: Online Zero-Shot 3D Segmentation by Visual Foundation Model Guided 2D Mask Merging | Yijie Tang et.al. | 2503.01309 | null |
2025-03-03 | Comparative Analysis of Ray Tracing and Rayleigh Fading Models for Distributed MIMO Systems in Industrial Environments | Aymen Jaziri et.al. | 2503.01300 | null |
2025-03-03 | Investigating dusty Red Supergiant outflows in Westerlund 1 with 3D Hydrodynamic simulations | C. J. K. Larkin et.al. | 2503.01272 | null |
2025-03-03 | Effects of the three-dimensional interplanar coupling on the centrosymmetric skyrmion crystal formation in the frustrated stacked-triangular Heisenberg model | R. Osamura et.al. | 2503.01258 | null |
2025-03-03 | SVDC: Consistent Direct Time-of-Flight Video Depth Completion with Frequency Selective Fusion | Xuan Zhu et.al. | 2503.01257 | link |
2025-03-03 | Machine Learning for Airborne Electromagnetic Data Inversion: a Bootstrapped Approach | Ophir Greif et.al. | 2503.01221 | null |
2025-03-04 | Tera-MIND: Tera-scale mouse brain simulation via spatial mRNA-guided diffusion | Jiqing Wu et.al. | 2503.01220 | null |
2025-03-03 | LiteGS: A High-Performance Modular Framework for Gaussian Splatting Training | Kaimin Liao et.al. | 2503.01199 | link |
2025-03-03 | False vacuum decay in triamond lattice gauge theory | Ali H. Z. Kavaki et.al. | 2503.01119 | null |
2025-03-03 | FGS-SLAM: Fourier-based Gaussian Splatting for Real-time SLAM with Sparse and Dense Map Fusion | Yansong Xu et.al. | 2503.01109 | null |
2025-03-03 | VideoHandles: Editing 3D Object Compositions in Videos Using Video Generative Priors | Juil Koo et.al. | 2503.01107 | null |
2025-03-04 | Fence Theorem: Towards Dual-Objective Semantic-Structure Isolation in Preprocessing Phase for 3D Anomaly Detection | Hanzhe Liang et.al. | 2503.01100 | null |
2025-03-02 | Direct Summation of the Madelung Constant using Axial Multipoles | Joven V. Calara et.al. | 2503.00977 | null |
2025-03-02 | Molecule Generation for Target Protein Binding with Hierarchical Consistency Diffusion Model | Guanlue Li et.al. | 2503.00975 | link |
2025-03-02 | Revisiting CAD Model Generation by Learning Raster Sketch | Pu Li et.al. | 2503.00928 | null |
2025-03-02 | Inefficiency of the orbit Hall effect on spin torque in transition metal/ferromagnet bilayers | Yizhuo Song et.al. | 2503.00910 | null |
2025-03-02 | DreamPrinting: Volumetric Printing Primitives for High-Fidelity 3D Printing | Youjia Wang et.al. | 2503.00887 | null |
2025-03-02 | Evolving High-Quality Rendering and Reconstruction in a Unified Framework with Contribution-Adaptive Regularization | You Shen et.al. | 2503.00881 | null |
2025-03-02 | Vid2Fluid: 3D Dynamic Fluid Assets from Single-View Videos with Generative Gaussian Splatting | Zhiwei Zhao et.al. | 2503.00868 | null |
2025-03-02 | MTReD: 3D Reconstruction Dataset for Fly-over Videos of Maritime Domain | Rui Yi Yong et.al. | 2503.00853 | null |
2025-03-02 | PSRGS:Progressive Spectral Residual of 3D Gaussian for High-Frequency Recovery | BoCheng Li et.al. | 2503.00848 | null |
2025-03-02 | Foundation Models Secretly Understand Neural Network Weights: Enhancing Hypernetwork Architectures with Foundation Models | Jeffrey Gu et.al. | 2503.00838 | null |
2025-03-02 | Random Walks in Self-supervised Learning for Triangular Meshes | Gal Yefet et.al. | 2503.00816 | null |
2025-03-02 | STAR-Edge: Structure-aware Local Spherical Curve Representation for Thin-walled Edge Extraction from Unstructured Point Clouds | Zikuan Li et.al. | 2503.00801 | link |
2025-03-04 | An Efficient 3D Convolutional Neural Network with Channel-wise, Spatial-grouped, and Temporal Convolutions | Zhe Wang et.al. | 2503.00796 | null |
2025-03-02 | Development of a Five-Fingerd Biomimetic Soft Robotic Hand by 3D Printing the Skin and Skeleton as One Unit | Kazuhiro Miyama et.al. | 2503.00789 | null |
2025-03-02 | LLMs are everywhere: Ubiquitous Utilization of AI Models through Air Computing | Baris Yamansavascilar et.al. | 2503.00767 | null |
2025-03-02 | DoF-Gaussian: Controllable Depth-of-Field for 3D Gaussian Splatting | Liao Shen et.al. | 2503.00746 | null |
2025-03-06 | LesionDiffusion: Towards Text-controlled General Lesion Synthesis | Henrui Tian et.al. | 2503.00741 | link |
2025-03-02 | Multi-Cali Anything: Dense Feature Multi-Frame Structure-from-Motion for Large-Scale Camera Array Calibration | Jinjiang You et.al. | 2503.00737 | link |
2025-03-02 | LightEndoStereo: A Real-time Lightweight Stereo Matching Method for Endoscopy Images | Yang Ding et.al. | 2503.00731 | link |
2025-03-02 | Enhancing Monocular 3D Scene Completion with Diffusion Model | Changlin Song et.al. | 2503.00726 | link |
2025-03-02 | How the CME on 21 April 2023 Triggered the First Severe Geomagnetic Storm of Solar Cycle 25 | Evangelos Paouris et.al. | 2503.00705 | null |
2025-03-02 | Linking Critical Heights in Solar Active Regions with 3D CME Speeds: Insights from Automated and Manual PIL Detection Methods | Harshita Gandhi et.al. | 2503.00683 | null |
2025-03-06 | Dur360BEV: A Real-world 360-degree Single Camera Dataset and Benchmark for Bird-Eye View Mapping in Autonomous Driving | Wenke E et.al. | 2503.00675 | link |
2025-03-01 | GenVDM: Generating Vector Displacement Maps From a Single Image | Yuezhi Yang et.al. | 2503.00605 | null |
2025-03-01 | Cross-Attention Fusion of MRI and Jacobian Maps for Alzheimer’s Disease Diagnosis | Shijia Zhang et.al. | 2503.00586 | null |
2025-03-01 | GaussianSeal: Rooting Adaptive Watermarks for 3D Gaussian Generation Model | Runyi Li et.al. | 2503.00531 | null |
2025-03-01 | Periodic Materials Generation using Text-Guided Joint Diffusion Model | Kishalay Das et.al. | 2503.00522 | link |
2025-03-01 | Explainable LiDAR 3D Point Cloud Segmentation and Clustering for Detecting Airplane-Generated Wind Turbulence | Zhan Qu et.al. | 2503.00518 | null |
2025-03-01 | Inst3D-LMM: Instance-Aware 3D Scene Understanding with Multi-modal Instruction Tuning | Hanxun Yu et.al. | 2503.00513 | link |
2025-03-01 | Flying on Point Clouds with Reinforcement Learning | Guangtong Xu et.al. | 2503.00496 | null |
2025-03-01 | Towards High-fidelity 3D Talking Avatar with Personalized Dynamic Texture | Xuanchen Li et.al. | 2503.00495 | null |
2025-03-01 | Density Matrix Embedding Theory-Based Multi-Configurational Quantum Chemistry Approach to Lanthanide Single-Ion Magnets | Yuhang Ai et.al. | 2503.00487 | null |
2025-03-01 | Condensation energy of superconducting BEC of non-interacting Cooper pairs in multilayers | I. Chávez et.al. | 2503.00473 | null |
2025-03-01 | Bring Your Own Grasp Generator: Leveraging Robot Grasp Generation for Prosthetic Grasping | Giuseppe Stracquadanio et.al. | 2503.00466 | null |
2025-03-01 | Certifying Lyapunov Stability of Black-Box Nonlinear Systems via Counterexample Guided Synthesis (Extended Version) | Chiao Hsieh et.al. | 2503.00431 | link |
2025-03-01 | DADM: Dual Alignment of Domain and Modality for Face Anti-spoofing | Jingyi Yang et.al. | 2503.00429 | null |
2025-03-01 | BGM2Pose: Active 3D Human Pose Estimation with Non-Stationary Sounds | Yuto Shibata et.al. | 2503.00389 | null |
2025-03-04 | EigenActor: Variant Body-Object Interaction Generation Evolved from Invariant Action Basis Reasoning | Xuehao Gao et.al. | 2503.00382 | null |
2025-03-07 | Jointly Understand Your Command and Intention:Reciprocal Co-Evolution between Scene-Aware 3D Human Motion Synthesis and Analysis | Xuehao Gao et.al. | 2503.00371 | null |
2025-03-07 | CAT-3DGS: A Context-Adaptive Triplane Approach to Rate-Distortion-Optimized 3DGS Compression | Yu-Ting Zhan et.al. | 2503.00357 | null |
2025-03-01 | Simulating Negative Hydrogen ion acceleration in LINAC-4 using Unity 3D | D. M. C. M. K. Dissanayake et.al. | 2503.00304 | null |
2025-03-01 | Flow Matching for Medical Image Synthesis: Bridging the Gap Between Speed and Quality | Milad Yazdani et.al. | 2503.00266 | link |
2025-03-01 | CRADMap: Applied Distributed Volumetric Mapping with 5G-Connected Multi-Robots and 4D Radar Sensing | Maaz Qureshi et.al. | 2503.00262 | link |
2025-03-01 | Seeing A 3D World in A Grain of Sand | Yufan Zhang et.al. | 2503.00260 | null |
2025-03-01 | Design of a quantum diamond microscope with efficient scanning confocal readout | Daniel G. Ang et.al. | 2503.00252 | null |
2025-02-28 | TDCOSMO XXI: Triaxiality and projection effects in time-delay cosmography | Xiang-Yu Huang et.al. | 2503.00235 | null |
2025-02-28 | A Framework for Analyzing the Scalability of Ion Trap Geometries | Le Minh Anh Nguyen et.al. | 2503.00218 | null |
2025-02-28 | Thermal Phase Curves in Hot Gas Giant Exoplanets Exhibit a Complex Dependence on Planetary Properties | Mark R Swain et.al. | 2503.00208 | null |
2025-02-28 | Unveiling sex dimorphism in the healthy cardiac anatomy: fundamental differences between male and female heart shapes | Beatrice Moscoloni et.al. | 2503.00197 | null |
2025-02-28 | Manifold Topological Deep Learning for Biomedical Data | Xiang Liu et.al. | 2503.00175 | null |
2025-02-28 | Criteria for ion acceleration in laboratory magnetized quasi-perpendicular collisionless shocks: when are 2D simulations enough? | Luca Orusa et.al. | 2503.00163 | null |
2025-02-28 | EXACT-CT: EXplainable Analysis for Crohn’s and Tuberculosis using CT | Shashwat Gupta et.al. | 2503.00159 | null |
2025-02-28 | Invariant Tokenization of Crystalline Materials for Language Model Enabled Generation | Keqiang Yan et.al. | 2503.00152 | null |
2025-02-28 | RecCrysFormer: Refined Protein Structural Prediction from 3D Patterson Maps via Recycling Training Runs | Tom Pan et.al. | 2503.00143 | link |
2025-02-28 | Filamentary Ejecta Network in Cassiopeia~A Reveals Fingerprints of the Supernova Explosion Mechanism | S. Orlando et.al. | 2503.00130 | null |
2025-02-28 | Graded Index Couplers for Next Generation Chip-to-Chip and Fiber-to-Chip Photonic Packaging | Drew Weninger et.al. | 2503.00121 | null |
2025-02-28 | Protein Structure Tokenization: Benchmarking and New Recipe | Xinyu Yuan et.al. | 2503.00089 | null |
2025-02-28 | ALICE Event Display – from the legacy ROOT-based visualization to the web-based application | Julian Wojciech Myrcha et.al. | 2503.00088 | null |
2025-02-27 | Forecasting Whole-Brain Neuronal Activity from Volumetric Video | Alexander Immer et.al. | 2503.00073 | null |
2025-02-27 | PI-HMR: Towards Robust In-bed Temporal Human Shape Reconstruction with Contact Pressure Sensing | Ziyu Wu et.al. | 2503.00068 | null |
2025-02-26 | Correspondence-Free Pose Estimation with Patterns: A Unified Approach for Multi-Dimensional Vision | Quan Quan et.al. | 2503.00051 | null |
2025-02-26 | Direct Numerical Simulations of Droplet Impact onto Heated Surfaces using the Program Free Surface 3D (FS3D) | Manish Kumar et.al. | 2503.00050 | null |
2025-02-28 | AutoComb: Automated Comb Sign Detector for 3D CTE Scans | Shashwat Gupta et.al. | 2502.21311 | null |
2025-03-08 | Back to the Future Cyclopean Stereo: a human perception approach combining deep and geometric constraints | Sherlon Almeida da Silva et.al. | 2502.21280 | null |
2025-02-28 | Anatomically-guided masked autoencoder pre-training for aneurysm detection | Alberto Mario Ceballos-Arroyo et.al. | 2502.21244 | null |
2025-03-04 | SYN-LUNGS: Towards Simulating Lung Nodules with Anatomy-Informed Digital Twins for AI Training | Fakrul Islam Tushar et.al. | 2502.21187 | null |
2025-02-28 | A Non-contrast Head CT Foundation Model for Comprehensive Neuro-Trauma Triage | Youngjin Yoo et.al. | 2502.21106 | null |
2025-03-03 | FlexDrive: Toward Trajectory Flexibility in Driving Scene Reconstruction and Rendering | Jingqiu Zhou et.al. | 2502.21093 | null |
2025-02-28 | Some computational aspects of using Huygens-Fresnel-Kirchoff diffraction theory | Ilya A. Kudryavtsev et.al. | 2502.21082 | null |
2025-02-28 | Fast 3D point clouds retrieval for Large-scale 3D Place Recognition | Chahine-Nicolas Zede et.al. | 2502.21067 | null |
2025-02-28 | HoloMine: A Synthetic Dataset for Buried Landmines Recognition using Microwave Holographic Imaging | Emanuele Vivoli et.al. | 2502.21054 | null |
2025-02-28 | Synthesizing Individualized Aging Brains in Health and Disease with Generative Models and Parallel Transport | Jingru Fu et.al. | 2502.21049 | link |
2025-02-28 | Sixth-Sense: Self-Supervised Learning of Spatial Awareness of Humans from a Planar Lidar | Simone Arreghini et.al. | 2502.21029 | null |
2025-02-28 | LesionLocator: Zero-Shot Universal Tumor Segmentation and Tracking in 3D Whole-Body Imaging | Maximilian Rokuss et.al. | 2502.20985 | null |
2025-02-28 | Probing the interstellar medium toward GRB 221009A through X-ray dust scattering | B. Vaia et.al. | 2502.20940 | null |
2025-02-28 | MESC-3D:Mining Effective Semantic Cues for 3D Reconstruction from a Single Image | Shaoming Li et.al. | 2502.20861 | null |
2025-02-28 | Improved 3D Point-Line Mapping Regression for Camera Relocalization | Bach-Thuan Bui et.al. | 2502.20814 | link |
2025-02-28 | Towards Semantic 3D Hand-Object Interaction Generation via Functional Text Guidance | Yongqi Tian et.al. | 2502.20805 | null |
2025-02-28 | Unraveling the origin of Kondo-like behavior in the 3 $d$-electron heavy-fermion compound YFe${2}$Ge${2}$ | Bing Xu et.al. | 2502.20796 | null |
2025-02-28 | CADDreamer: CAD object Generation from Single-view Images | Yuan Li et.al. | 2502.20732 | null |
2025-02-28 | Refinement of the $L^{2}$ -decay estimate of solutions to nonlinear Schrödinger equations with attractive-dissipative nonlinearity | Naoyasu Kita et.al. | 2502.20713 | null |
2025-02-28 | EDM: Equirectangular Projection-Oriented Dense Kernelized Feature Matching | Dongki Jung et.al. | 2502.20685 | null |
2025-02-28 | EndoPBR: Material and Lighting Estimation for Photorealistic Surgical Simulations via Physically-based Rendering | John J. Han et.al. | 2502.20669 | null |
2025-02-28 | LV-DOT: LiDAR-visual dynamic obstacle detection and tracking for autonomous robot navigation | Zhefan Xu et.al. | 2502.20607 | link |
2025-03-07 | InstaFace: Identity-Preserving Facial Editing with Single Image Inference | MD Wahiduzzaman Khan et.al. | 2502.20577 | null |
2025-02-27 | Toward Fully Autonomous Flexible Chunk-Based Aerial Additive Manufacturing: Insights from Experimental Validation | Marios-Nektarios Stamatopoulos et.al. | 2502.20549 | null |
2025-02-27 | Best Foot Forward: Robust Foot Reconstruction in-the-wild | Kyle Fogarty et.al. | 2502.20511 | null |
2025-02-27 | Highly Variable Magnetic Pressure-Driven YSO Jets in the Polar Cavity from Toroidal Fields Generated by Inner Disk Accretion | Yisheng Tu et.al. | 2502.20495 | null |
2025-02-27 | The Characteristics of the Z-Boson in Chern-Simons Matter Theory | Amiya Mishra et.al. | 2502.20488 | null |
2025-02-26 | Accurate 3D Grapevine Structure Extraction from High-Resolution Point Clouds | Harry Dobbs et.al. | 2502.20417 | null |
2025-02-27 | LIFT-GS: Cross-Scene Render-Supervised Distillation for 3D Language Grounding | Ang Cao et.al. | 2502.20389 | null |
2025-02-27 | InsTaG: Learning Personalized 3D Talking Head from Few-Second Video | Jiahe Li et.al. | 2502.20387 | link |
2025-02-28 | ARTalk: Speech-Driven 3D Head Animation via Autoregressive Model | Xuangeng Chu et.al. | 2502.20323 | null |
2025-02-27 | Multi-Scale Neighborhood Occupancy Masked Autoencoder for Self-Supervised Learning in LiDAR Point Clouds | Mohamed Abdelsamad et.al. | 2502.20316 | null |
2025-02-27 | M^3Builder: A Multi-Agent System for Automated Machine Learning in Medical Imaging | Jinghao Feng et.al. | 2502.20301 | null |
2025-02-27 | Enhancing 3D Gaze Estimation in the Wild using Weak Supervision with Gaze Following Labels | Pierre Vuillecard et.al. | 2502.20249 | null |
2025-02-27 | Avat3r: Large Animatable Gaussian Reconstruction Model for High-fidelity 3D Head Avatars | Tobias Kirschstein et.al. | 2502.20220 | null |
2025-02-27 | Highly Entangled 2D Ground State: Tensor Network, Order Parameter and Correlation | Olai B. Mykland et.al. | 2502.20192 | null |
2025-02-27 | Cutting-edge 3D reconstruction solutions for underwater coral reef images: A review and comparison | Jiageng Zhong et.al. | 2502.20154 | null |
2025-02-27 | MITracker: Multi-View Integration for Visual Object Tracking | Mengjie Xu et.al. | 2502.20111 | null |
2025-02-27 | UniDepthV2: Universal Monocular Metric Depth Estimation Made Simpler | Luigi Piccinelli et.al. | 2502.20110 | link |
2025-02-27 | Text2VDM: Text to Vector Displacement Maps for Expressive and Interactive 3D Sculpting | Hengyu Meng et.al. | 2502.20045 | null |
2025-03-04 | 3D-AffordanceLLM: Harnessing Large Language Models for Open-Vocabulary Affordance Detection in 3D Worlds | Hengshuo Chu et.al. | 2502.20041 | null |
2025-02-27 | A2-GNN: Angle-Annular GNN for Visual Descriptor-free Camera Relocalization | Yejun Zhang et.al. | 2502.20036 | link |
2025-02-27 | Turbulence and large-scale structures in self-gravitating superfluids | Sanjay Shukla et.al. | 2502.20006 | null |
2025-02-27 | Identity-preserving Distillation Sampling by Fixed-Point Iterator | SeonHwa Kim et.al. | 2502.19930 | null |
2025-02-27 | GenPC: Zero-shot Point Cloud Completion via 3D Generative Priors | An Li et.al. | 2502.19896 | null |
2025-02-27 | High-Fidelity Relightable Monocular Portrait Animation with Lighting-Controllable Video Diffusion Model | Mingtao Guo et.al. | 2502.19894 | link |
2025-02-27 | NeRFCom: Feature Transform Coding Meets Neural Radiance Field for Free-View 3D Scene Semantic Transmission | Weijie Yue et.al. | 2502.19873 | null |
2025-02-27 | No Parameters, No Problem: 3D Gaussian Splatting without Camera Intrinsics and Extrinsics | Dongbo Shi et.al. | 2502.19800 | null |
2025-02-27 | Open-Vocabulary Semantic Part Segmentation of 3D Human | Keito Suzuki et.al. | 2502.19782 | null |
2025-02-27 | Visualising Ventilation Changes following Endobronchial Valve Placement with X-ray Velocimetry Functional Lung Imaging | Ronan Smith et.al. | 2502.19780 | null |
2025-02-27 | QORT-Former: Query-optimized Real-time Transformer for Understanding Two Hands Manipulating Objects | Elkhan Ismayilzada et.al. | 2502.19769 | null |
2025-02-27 | Deep Learning-Based Approach for Automatic 2D and 3D MRI Segmentation of Gliomas | Kiranmayee Janardhan et.al. | 2502.19760 | null |
2025-02-27 | LUCAS: Layered Universal Codec Avatars | Di Liu et.al. | 2502.19739 | null |
2025-02-27 | Extracting intrinsic superconducting properties in intercalated layered superconductors using an extended 2D Tinkham model | Yue Liu et.al. | 2502.19733 | null |
2025-02-27 | Accurate Pose Estimation for Flight Platforms based on Divergent Multi-Aperture Imaging System | Shunkun Liang et.al. | 2502.19708 | null |
2025-02-28 | You Only Click Once: Single Point Weakly Supervised 3D Instance Segmentation for Autonomous Driving | Guangfeng Jiang et.al. | 2502.19698 | null |
2025-02-27 | BEVDiffuser: Plug-and-Play Diffusion Model for BEV Denoising with Ground-Truth Guidance | Xin Ye et.al. | 2502.19694 | null |
2025-02-27 | 3D Trajectory Reconstruction of Moving Points Based on a Monocular Camera | Huayu Huang et.al. | 2502.19689 | null |
2025-03-06 | Noise-Injected Spiking Graph Convolution for Energy-Efficient 3D Point Cloud Denoising | Zikuan Li et.al. | 2502.19660 | link |
2025-02-26 | Ev-3DOD: Pushing the Temporal Boundaries of 3D Object Detection with Event Cameras | Hoonhee Cho et.al. | 2502.19630 | link |
2025-02-26 | 3D Nephrographic Image Synthesis in CT Urography with the Diffusion Model and Swin Transformer | Hongkun Yu et.al. | 2502.19623 | null |
2025-02-26 | Diffusion-based Planning with Learned Viability Filters | Nicholas Ioannidis et.al. | 2502.19564 | null |
2025-02-26 | Mixed Finite Element Analysis of Flexoelectric Response: Exploring Unit Cell Stacking and Strain Gradient Modulation | Arash Kazemi et.al. | 2502.19539 | null |
2025-02-26 | Numerical shape and topology optimization of regions supporting the boundary conditions of a physical problem | Eric Bonnetier et.al. | 2502.19510 | null |
2025-02-26 | Building Interactable Replicas of Complex Articulated Objects via Gaussian Splatting | Yu Liu et.al. | 2502.19459 | link |
2025-02-26 | Compression in 3D Gaussian Splatting: A Survey of Methods, Trends, and Future Directions | Muhammad Salman Ali et.al. | 2502.19457 | null |
2025-02-26 | FLAP: Fully-controllable Audio-driven Portrait Video Generation through 3D head conditioned diffusion mode | Lingzhou Mu et.al. | 2502.19455 | null |
2025-02-26 | ARENA: Adaptive Risk-aware and Energy-efficient NAvigation for Multi-Objective 3D Infrastructure Inspection with a UAV | David-Alexandre Poissant et.al. | 2502.19401 | null |
2025-02-26 | Efficient 4D fMRI ASD Classification using Spatial-Temporal-Omics-based Learning Framework | Ziqiao Weng et.al. | 2502.19386 | null |
2025-02-26 | LiDAR Registration with Visual Foundation Models | Niclas Vödisch et.al. | 2502.19374 | null |
2025-02-26 | Does 3D Gaussian Splatting Need Accurate Volumetric Rendering? | Adam Celarek et.al. | 2502.19318 | link |
2025-02-26 | Differentiable Imaging Meets Adaptive Neural Dropout: An Advancing Method for Transparent Object Tomography | Delong Yang et.al. | 2502.19314 | null |
2025-02-26 | CoopDETR: A Unified Cooperative Perception Framework for 3D Detection via Object Query | Zhe Wang et.al. | 2502.19313 | null |
2025-02-26 | Deep learning and classical computer vision techniques in medical image analysis: Case studies on brain MRI tissue segmentation, lung CT COPD registration, and skin lesion classification | Anyimadu Daniel Tweneboah et.al. | 2502.19258 | null |
2025-02-27 | ProxyTransformation: Preshaping Point Cloud Manifold With Proxy Attention For 3D Visual Grounding | Qihang Peng et.al. | 2502.19247 | null |
2025-02-26 | Black hole solutions of three dimensional E $_{6}$ -gravity | R. Sammani et.al. | 2502.19241 | null |
2025-02-26 | Distill Any Depth: Distillation Creates a Stronger Monocular Depth Estimator | Xiankang He et.al. | 2502.19204 | link |
2025-02-26 | SCA3D: Enhancing Cross-modal 3D Retrieval via 3D Shape and Caption Paired Data Augmentation | Junlong Ren et.al. | 2502.19128 | link |
2025-02-26 | The NeRF Signature: Codebook-Aided Watermarking for Neural Radiance Fields | Ziyuan Luo et.al. | 2502.19125 | null |
2025-02-26 | From Traditional to Deep Learning Approaches in Whole Slide Image Registration: A Methodological Review | Behnaz Elhaminia et.al. | 2502.19123 | null |
2025-02-26 | Naked Eye Three-dimensional Display System Based on Time-multiplexed Technology | Ziyang Liu et.al. | 2502.19099 | null |
2025-02-26 | Flexible Foil Mesh Generation for Spatial Focal-Body Modeling of a Spherical Mirror | Netzer Moriya et.al. | 2502.19092 | null |
2025-02-26 | Quantifying local heterogeneities in the 3D morphology of X-PVMPT battery electrodes based on FIB-SEM measurements | L. Dodell et.al. | 2502.19085 | null |
2025-02-26 | An Improved 3D Skeletons UP-Fall Dataset: Enhancing Data Quality for Efficient Impact Fall Detection | Tresor Y. Koffi et.al. | 2502.19048 | null |
2025-02-26 | The Ising model as a window on quantum gravity with matter | Romuald A. Janik et.al. | 2502.19015 | null |
2025-02-26 | 3D-TrIM: A Memory-Efficient Spatial Computing Architecture for Convolution Workloads | Cristian Sestito et.al. | 2502.18983 | null |
2025-02-26 | Unlocking Hidden Potential in Electron Holography of Non-Collinear Spin Textures | Moritz Winterott et.al. | 2502.18949 | null |
2025-02-26 | SLAM in the Dark: Self-Supervised Learning of Pose, Depth and Loop-Closure from Thermal Images | Yangfan Xu et.al. | 2502.18932 | null |
2025-02-26 | Investigation on the Spreading Behaviour of Sand Powder Used in Binder Jet 3D Printing | Yulun Xu et.al. | 2502.18899 | null |
2025-02-26 | SE(3)-Equivariant Ternary Complex Prediction Towards Target Protein Degradation | Fanglei Xue et.al. | 2502.18875 | link |
2025-02-26 | Towards Higher Order Accuracy in Self-Gravitating Hydrodynamics | Tomoyuki Hanawa et.al. | 2502.18794 | null |
2025-02-26 | Hyperspectral image reconstruction by deep learning with super-Rayleigh speckles | Ziyan Chen et.al. | 2502.18777 | null |
2025-02-26 | Subclass Classification of Gliomas Using MRI Fusion Technique | Kiranmayee Janardhan et.al. | 2502.18775 | null |
2025-02-26 | MaskPlanner: Learning-Based Object-Centric Motion Generation from 3D Point Clouds | Gabriele Tiboni et.al. | 2502.18745 | null |
2025-02-26 | QueryAdapter: Rapid Adaptation of Vision-Language Models in Response to Natural Language Queries | Nicolas Harvey Chapman et.al. | 2502.18735 | null |
2025-03-04 | Spatial Analysis of Neuromuscular Junctions Activation in Three-Dimensional Histology-based Muscle Reconstructions | Alessandro Ascani Orsini et.al. | 2502.18646 | link |
2025-02-25 | eXplainMR: Generating Real-time Textual and Visual eXplanations to Facilitate UltraSonography Learning in MR | Jingying Wang et.al. | 2502.18640 | null |
2025-02-25 | Mechanisms and Scale-up Potential of 3D Solar Interfacial-Evaporators | James H. Zhang et.al. | 2502.18603 | null |
2025-02-25 | 3D Conformal Field Theory in Twistor Space | Aswini Bala et.al. | 2502.18562 | null |
2025-02-24 | General Relativity as an EFT with emergent gravity via principle of spatial energy potentiality: implications for the standard model of cosmology | Farrukh A. Chishtie et.al. | 2502.18524 | null |
2025-02-19 | Physical Depth-aware Early Accident Anticipation: A Multi-dimensional Visual Feature Fusion Framework | Hongpu Huang et.al. | 2502.18496 | null |
2025-02-25 | Nanoscale characterization of atomic positions in orthorhombic perovskite thin films | M. Martirosyan et.al. | 2502.18376 | null |
2025-02-25 | EgoSim: An Egocentric Multi-view Simulator and Real Dataset for Body-worn Cameras during Motion and Activity | Dominik Hollidt et.al. | 2502.18373 | null |
2025-02-25 | Near-Shore Mapping for Detection and Tracking of Vessels | Nicholas Dalhaug et.al. | 2502.18368 | null |
2025-02-25 | The Birth of a Major Coronal Mass Ejection with Intricate Magnetic Structure from Multiple Active Regions | Jinhan Guo et.al. | 2502.18367 | null |
2025-02-25 | Stretchable Capacitive and Resistive Strain Sensors: Accessible Manufacturing Using Direct Ink Writing | Lukas Cha et.al. | 2502.18363 | null |
2025-02-25 | GCDance: Genre-Controlled 3D Full Body Dance Generation Driven By Music | Xinran Liu et.al. | 2502.18309 | null |
2025-03-01 | Table-top three-dimensional photoemission orbital tomography with a femtosecond extreme ultraviolet light source | Wiebke Bennecke et.al. | 2502.18269 | null |
2025-02-25 | A 3D Printed Quad-Ridged Flared Horn Antenna Feeder for Radio-Telescopes | Andreas Hofmann et.al. | 2502.18243 | null |
2025-02-25 | Synthesizing Consistent Novel Views via 3D Epipolar Attention without Re-Training | Botao Ye et.al. | 2502.18219 | null |
2025-02-25 | You Shall Not Pass: Warning Drivers of Unsafe Overtaking Maneuvers on Country Roads by Predicting Safe Sight Distance | Adrian Bauske et.al. | 2502.18163 | link |
2025-02-25 | Joint Reconstruction of Spatially-Coherent and Realistic Clothed Humans and Objects from a Single Image | Ayushi Dutta et.al. | 2502.18150 | null |
2025-02-25 | Electronic Structures across the Superconductor-Insulator Transition at La ${2.85}$Pr${0.15}$Ni$_2$O$_7$/SrLaAlO$_4$ Interfaces | Heng Wang et.al. | 2502.18068 | null |
2025-02-28 | S-Graphs 2.0 – A Hierarchical-Semantic Optimization and Loop Closure for SLAM | Hriday Bavle et.al. | 2502.18044 | link |
2025-02-25 | VLM-E2E: Enhancing End-to-End Autonomous Driving with Multimodal Driver Attention Fusion | Pei Liu et.al. | 2502.18042 | null |
2025-03-04 | OpenFly: A Versatile Toolchain and Large-scale Benchmark for Aerial Vision-Language Navigation | Yunpeng Gao et.al. | 2502.18041 | null |
2025-02-25 | A Peanut-hull-PLA based 3D printing filament with antimicrobial effect | Sabarinathan Palaniyappan et.al. | 2502.17975 | null |
2025-02-25 | Deep-JGAC: End-to-End Deep Joint Geometry and Attribute Compression for Dense Colored Point Clouds | Yun Zhang et.al. | 2502.17939 | null |
2025-02-25 | 3D Anatomical Structure-guided Deep Learning for Accurate Diffusion Microstructure Imaging | Xinrui Ma et.al. | 2502.17933 | null |
2025-02-25 | VVRec: Reconstruction Attacks on DL-based Volumetric Video Upstreaming via Latent Diffusion Model with Gamma Distribution | Rui Lu et.al. | 2502.17880 | null |
2025-02-25 | Animating Childlike Drawings with 2.5D Character Rigs | Harrison Jesse Smith et.al. | 2502.17866 | null |
2025-02-27 | UniGS: Unified Language-Image-3D Pretraining with Gaussian Splatting | Haoyuan Li et.al. | 2502.17860 | null |
2025-02-25 | Sketch-1-to-3: One Single Sketch to 3D Detailed Face Reconstruction | Liting Wen et.al. | 2502.17852 | null |
2025-02-26 | Easy-Poly: A Easy Polyhedral Framework For 3D Multi-Object Tracking | Peng Zhang et.al. | 2502.17822 | null |
2025-02-25 | Design of a Breakaway Utensil Attachment for Enhanced Safety in Robot-Assisted Feeding | Hau Wen Chang et.al. | 2502.17774 | null |
2025-02-25 | AI-driven 3D Spatial Transcriptomics | Cristina Almagro-Pérez et.al. | 2502.17761 | null |
2025-02-24 | Hole Spin in Direct Bandgap Germanium-Tin Quantum Dot | Nicolas Rotaru et.al. | 2502.17659 | null |
2025-02-24 | Evidence for Low Universal Equilibrium Black Hole Spin in Luminous Magnetically Arrested Disks | Beverly Lowell et.al. | 2502.17559 | null |
2025-02-24 | Tales of Tension: Magnetized Infalling Clouds and Cold Streams in the CGM | Ish Kaul et.al. | 2502.17549 | null |
2025-02-24 | Laplace-Beltrami Operator for Gaussian Splatting | Hongyu Zhou et.al. | 2502.17531 | null |
2025-02-18 | Low-Interference Near-Field Multi-User Communication Enabled by Spatially Converging Multi-Mode Vortex Waves | Yufei Zhao et.al. | 2502.17479 | null |
2025-02-24 | CLIMB-3D: Continual Learning for Imbalanced 3D Instance Segmentation | Vishal Thengane et.al. | 2502.17429 | link |
2025-02-24 | Joint Beamforming and 3D Location Optimization for Multi-User Holographic UAV Communications | Chandan Kumar Sheemar et.al. | 2502.17428 | null |
2025-02-24 | X-Dancer: Expressive Music to Human Dance Video Generation | Zeyuan Chen et.al. | 2502.17414 | null |
2025-02-24 | Graph-Guided Scene Reconstruction from Images with 3D Gaussian Splatting | Chong Cheng et.al. | 2502.17377 | null |
2025-02-24 | RELICT: A Replica Detection Framework for Medical Image Generation | Orhun Utku Aydin et.al. | 2502.17360 | link |
2025-03-01 | HybridLinker: Topology-Guided Posterior Sampling for Enhanced Diversity and Validity in 3D Molecular Linker Generation | Minyeong Hwang et.al. | 2502.17349 | null |
2025-02-24 | A Pristine-UNIONS view on the Galaxy: Kinematics of the distant spur feature of the Sagittarius stream traced by Blue Horizontal Branch stars | M. Bayer et.al. | 2502.17319 | null |
2025-02-24 | Modelling conductive thermal transport in three-dimensional fibrous media with fiber-to-fiber contacts | Clémence Gaunand et.al. | 2502.17318 | null |
2025-02-25 | GaussianFlowOcc: Sparse and Weakly Supervised Occupancy Estimation using Gaussian Splatting and Temporal Flow | Simon Boeder et.al. | 2502.17288 | null |
2025-02-24 | Automated generation of epilepsy surgery resection masks; The RAMPS pipeline | Callum Simpson et.al. | 2502.17287 | null |
2025-02-24 | Additional jamming transition in 2D bidisperse granular packings | Juan C. Petit et.al. | 2502.17266 | null |
2025-02-24 | CAR-LOAM: Color-Assisted Robust LiDAR Odometry and Mapping | Yufei Lu et.al. | 2502.17249 | null |
2025-02-25 | Particle geometry space: An integrated characterization of particle shape, surface area, volume, specific surface, and size distribution | Priya Tripathi et.al. | 2502.17243 | null |
2025-02-25 | MegaLoc: One Retrieval to Place Them All | Gabriele Berton et.al. | 2502.17237 | link |
2025-02-24 | A highly sensitive, self-adhesive, biocompatible DLP 3D printed organohydrogel for flexible sensors and wearable devices | Ze Zhang et.al. | 2502.17208 | null |
2025-02-24 | Dimitra: Audio-driven Diffusion model for Expressive Talking Head Generation | Baptiste Chopin et.al. | 2502.17198 | null |
2025-02-24 | Structural Anisotropy Stabilises Asymmetric Beating in Instability Driven Filaments | Bethany Clarke et.al. | 2502.17140 | null |
2025-02-24 | Applications of Large Models in Medicine | YunHe Su et.al. | 2502.17132 | null |
2025-02-24 | Electronic and structural properties of atomically thin metallenes | Kameyab Raza Abidi et.al. | 2502.17131 | null |
2025-02-24 | Rotatable Antenna Enabled Wireless Communication System with Visual Recognition: A Prototype Implementation | Liang Dai et.al. | 2502.17097 | null |
2025-02-24 | Conditional Diffusion-Flow models for generating 3D cosmic density fields: applications to f(R) cosmologies | Julieth Katherine Riveros et.al. | 2502.17087 | null |
2025-02-24 | VR-Pipe: Streamlining Hardware Graphics Pipeline for Volume Rendering | Junseo Lee et.al. | 2502.17078 | null |
2025-02-24 | Status of Iron Based Superconductors: characteristics and relevant properties for applications | Kazumasa Iida et.al. | 2502.17063 | null |
2025-02-26 | PointSea: Point Cloud Completion via Self-structure Augmentation | Zhe Zhu et.al. | 2502.17053 | link |
2025-02-24 | LCV2I: Communication-Efficient and High-Performance Collaborative Perception Framework with Low-Resolution LiDAR | Xinxin Feng et.al. | 2502.17039 | null |
2025-02-25 | Evolution 6.0: Evolving Robotic Capabilities Through Generative Design | Muhammad Haris Khan et.al. | 2502.17034 | null |
2025-02-24 | M3DA: Benchmark for Unsupervised Domain Adaptation in 3D Medical Image Segmentation | Boris Shirokikh et.al. | 2502.17029 | null |
2025-02-24 | Gaussian Difference: Find Any Change Instance in 3D Scenes | Binbin Jiang et.al. | 2502.16941 | null |
2025-02-24 | DemoGen: Synthetic Demonstration Generation for Data-Efficient Visuomotor Policy Learning | Zhengrong Xue et.al. | 2502.16932 | null |
2025-02-24 | Multi-Dimensional Quality Assessment for Text-to-3D Assets: Dataset and Model | Kang Fu et.al. | 2502.16915 | null |
2025-02-24 | Design of a low-cost and lightweight 6 DoF bimanual arm for dynamic and contact-rich manipulation | Jaehyung Kim et.al. | 2502.16908 | null |
2025-02-24 | MambaFlow: A Novel and Flow-guided State Space Model for Scene Flow Estimation | Jiehao Luo et.al. | 2502.16907 | link |
2025-02-28 | Unposed Sparse Views Room Layout Reconstruction in the Age of Pretrain Model | Yaxuan Huang et.al. | 2502.16779 | null |
2025-02-23 | ViSNeRF: Efficient Multidimensional Neural Radiance Field Representation for Visualization Synthesis of Dynamic Volumetric Scenes | Siyuan Yao et.al. | 2502.16731 | link |
2025-02-23 | DOSE3 : Diffusion-based Out-of-distribution detection on SE(3) trajectories | Hongzhe Cheng et.al. | 2502.16725 | null |
2025-02-23 | Puzzles in 3D Off-Shell Geometries via VTQFT | Cynthia Yan et.al. | 2502.16686 | null |
2025-02-23 | Bumpy Ride? Understanding the Effects of External Forces on Spatial Interactions in Moving Vehicles | Markus Sasalovici et.al. | 2502.16656 | link |
2025-02-23 | Dr. Splat: Directly Referring 3D Gaussian Splatting via Direct Language Embedding Registration | Kim Jun-Seong et.al. | 2502.16652 | null |
2025-02-23 | Improving Monocular Visual-Inertial Initialization with Structureless Visual-Inertial Bundle Adjustment | Junlin Song et.al. | 2502.16598 | null |
2025-02-23 | Quasiperiodic Super-Alfvenic Slippage Along Flare Ribbons Observed by the Interface Region Imaging Spectrograph | Yining Zhang et.al. | 2502.16579 | null |
2025-02-23 | Efficient 4D Gaussian Stream with Low Rank Adaptation | Zhenhuan Liu et.al. | 2502.16575 | null |
2025-02-23 | Geometry-Aware 3D Salient Object Detection Network | Chen Wang et.al. | 2502.16488 | null |
2025-02-23 | Flexible Intelligent Metasurfaces for Enhanced MIMO Communications | Jiancheng An et.al. | 2502.16478 | null |
2025-02-23 | Dragen3D: Multiview Geometry Consistent 3D Gaussian Generation with Drag-Based Control | Jinbo Yan et.al. | 2502.16475 | null |
2025-02-23 | Downlink Multiuser Communications Relying on Flexible Intelligent Metasurfaces | Jiancheng An et.al. | 2502.16472 | null |
2025-02-23 | Asteroid shape inversion with light curves using deep learning | YiJun Tang et.al. | 2502.16455 | null |
2025-02-23 | DeProPose: Deficiency-Proof 3D Human Pose Estimation via Adaptive Multi-View Fusion | Jianbin Jiao et.al. | 2502.16419 | link |
2025-02-22 | Supermarket-6DoF: A Real-World Grasping Dataset and Grasp Pose Representation Analysis | Jason Toskov et.al. | 2502.16311 | null |
2025-02-22 | Towards a GPU-Native Adaptive Mesh Refinement Scheme for the Lattice Boltzmann Method in Complex Geometries | Khodr Jaber et.al. | 2502.16310 | link |
2025-02-22 | Bursty acceleration and 3D trajectories of electrons in a solar flare | Shilpi Bhunia et.al. | 2502.16307 | null |
2025-02-22 | Pointmap Association and Piecewise-Plane Constraint for Consistent and Compact 3D Gaussian Segmentation Field | Wenhao Hu et.al. | 2502.16303 | null |
2025-02-22 | DualNeRF: Text-Driven 3D Scene Editing via Dual-Field Representation | Yuxuan Xiong et.al. | 2502.16302 | null |
2025-02-22 | MolSpectra: Pre-training 3D Molecular Representation with Multi-modal Energy Spectra | Liang Wang et.al. | 2502.16284 | link |
2025-02-22 | Schrödinger evolution on surfaces in 3D contact sub-Riemannian manifolds | Riccardo Adami et.al. | 2502.16186 | null |
2025-02-22 | Mojito: LLM-Aided Motion Instructor with Jitter-Reduced Inertial Tokens | Ziwei Shan et.al. | 2502.16175 | null |
2025-02-22 | Teardown Analysis of Samsung S20 Exynos 990 SoC | Nabeel Ahmad Khan Jadoon et.al. | 2502.16166 | null |
2025-02-21 | Exact Nonlinear Decomposition of Ideal-MHD Waves Using Eigenenergies III: Gravity, Generalized Inhomogeneous Quasi-linear PDEs, Mode Conversion, and Numerical Implementation | Abbas Raboonik et.al. | 2502.16010 | null |
2025-02-21 | Mean-Shift Distillation for Diffusion Mode Seeking | Vikas Thamizharasan et.al. | 2502.15989 | null |
2025-02-21 | Towards Autonomous Navigation of Neuroendovascular Tools for Timely Stroke Treatment via Contact-aware Path Planning | Aabha Tamhankar et.al. | 2502.15971 | null |
2025-02-21 | Human Motion Prediction, Reconstruction, and Generation | Canxuan Gang et.al. | 2502.15956 | null |
2025-02-21 | Time-resolved 3D momentum spectroscopy in continuous wave atomic photoionization | Kevin L. Romans et.al. | 2502.15911 | null |
2025-02-18 | Understanding and Evaluating Hallucinations in 3D Visual Language Models | Ruiying Peng et.al. | 2502.15888 | null |
2025-02-21 | Generative AI Framework for 3D Object Generation in Augmented Reality | Majid Behravan et.al. | 2502.15869 | null |
2025-02-21 | DiffCheck: a Scan-CAD Evaluation Tool for Digital Manufacturing and Assembly Processes in Timber Construction | Andrea Settimi et.al. | 2502.15864 | null |
2025-02-19 | Spiking Point Transformer for Point Cloud Classification | Peixi Wu et.al. | 2502.15811 | link |
2025-02-24 | Para-Lane: Multi-Lane Dataset Registering Parallel Scans for Benchmarking Novel View Synthesis | Ziqian Ni et.al. | 2502.15635 | null |
2025-02-21 | RGB-Only Gaussian Splatting SLAM for Unbounded Outdoor Scenes | Sicheng Yu et.al. | 2502.15633 | null |
2025-02-21 | Pick-and-place Manipulation Across Grippers Without Retraining: A Learning-optimization Diffusion Policy Approach | Xiangtong Yao et.al. | 2502.15613 | link |
2025-02-28 | WorldCraft: Photo-Realistic 3D World Creation and Customization via LLM Agents | Xinhang Liu et.al. | 2502.15601 | null |
2025-02-21 | Depth-aware Fusion Method based on Image and 4D Radar Spectrum for 3D Object Detection | Yue Sun et.al. | 2502.15516 | null |
2025-02-21 | Q-PETR: Quant-aware Position Embedding Transformation for Multi-View 3D Object Detection | Jiangyong Yu et.al. | 2502.15488 | null |
2025-02-21 | Game State and Spatio-temporal Action Detection in Soccer using Graph Neural Networks and 3D Convolutional Networks | Jeremie Ochin et.al. | 2502.15462 | null |
2025-02-21 | Robust 4D Radar-aided Inertial Navigation for Aerial Vehicles | Jinwen Zhu et.al. | 2502.15452 | null |
2025-02-21 | Anatomy-Informed Deep Learning and Radiomics for Automated Neurofibroma Segmentation in Whole-Body MRI | Georgii Kolokolnikov et.al. | 2502.15424 | link |
2025-02-21 | Enhancing Vehicle Make and Model Recognition with 3D Attention Modules | Narges Semiromizadeh et.al. | 2502.15398 | null |
2025-02-21 | Semiparametric Bernstein-von Mises Phenomenon via Isotonized Posterior in Wicksell’s problem | Francesco Gili et.al. | 2502.15352 | null |
2025-02-26 | PFSD: A Multi-Modal Pedestrian-Focus Scene Dataset for Rich Tasks in Semi-Structured Environments | Yueting Liu et.al. | 2502.15342 | link |
2025-02-24 | DynamicGSG: Dynamic 3D Gaussian Scene Graphs for Environment Adaptation | Luzhou Ge et.al. | 2502.15309 | link |
2025-02-21 | A deep learning-based noise correction method for light-field fluorescence microscopy | Bohan Qu et.al. | 2502.15259 | null |
2025-02-21 | SiMHand: Mining Similar Hands for Large-Scale 3D Hand Pose Pre-training | Nie Lin et.al. | 2502.15251 | link |
2025-02-21 | Holographic Joint Communications and Sensing With Cramer-Rao Bounds | Chandan Kumar Sheemar et.al. | 2502.15248 | null |
2025-02-21 | Mass enhancement and metal-nonmetal transition driven by d-f hybridization in perovskites La1-xPrxCuO3 | H. Takahashi et.al. | 2502.15238 | null |
2025-02-21 | Lung-DDPM: Semantic Layout-guided Diffusion Models for Thoracic CT Image Synthesis | Yifan Jiang et.al. | 2502.15204 | link |
2025-02-21 | OccProphet: Pushing Efficiency Frontier of Camera-Only 4D Occupancy Forecasting with Observer-Forecaster-Refiner Framework | Junliang Chen et.al. | 2502.15180 | link |
2025-02-21 | Nonlinear Dynamical Systems for Automatic Face Annotation in Head Tracking and Pose Estimation | Thoa Thieu et.al. | 2502.15179 | null |
2025-02-21 | From BPS Spectra of Argyres-Douglas Theories to Families of 3d TFTs | Byeonggi Go et.al. | 2502.15133 | null |
2025-02-21 | 3D Simulations of Semiconvection in Spheres: Turbulent Mixing and Layer Formation | J. R. Fuentes et.al. | 2502.15111 | null |
2025-02-20 | Exploration of Helicon Plasmas for Wakefield Accelerators at the Madison AWAKE Prototype | Marcel Granetzny et.al. | 2502.15085 | null |
2025-02-20 | Synth It Like KITTI: Synthetic Data Generation for Object Detection in Driving Scenarios | Richard Marcus et.al. | 2502.15076 | link |
2025-02-20 | CrossOver: 3D Scene Cross-Modal Alignment | Sayan Deb Sarkar et.al. | 2502.15011 | link |
2025-02-20 | 3D Adiabatic Simulations of Binary Black Hole Formation in AGN | Henry Whitehead et.al. | 2502.14959 | null |
2025-02-20 | Tension of toroidal magnetic field in reconnection plasmoids and relativistic jets | Krzysztof Nalewajko et.al. | 2502.14954 | null |
2025-02-20 | Stochastic interpretations of the oceanic primitive equations with relaxed hydrostatic assumptions | Arnaud Debussche et.al. | 2502.14946 | null |
2025-02-20 | FacaDiffy: Inpainting Unseen Facade Parts Using Diffusion Models | Thomas Froech et.al. | 2502.14940 | link |
2025-02-20 | Online hand gesture recognition using Continual Graph Transformers | Rim Slama et.al. | 2502.14939 | null |
2025-02-20 | GS-Cache: A GS-Cache Inference Framework for Large-scale Gaussian Splatting Models | Miao Tao et.al. | 2502.14938 | null |
2025-02-20 | Denoising, segmentation and volumetric rendering of optical coherence tomography angiography (OCTA) image using deep learning techniques: a review | Kejie Chen et.al. | 2502.14935 | null |
2025-02-20 | Hier-SLAM++: Neuro-Symbolic Semantic SLAM with a Hierarchically Categorical Gaussian Splatting | Boying Li et.al. | 2502.14931 | null |
2025-02-19 | Sce2DriveX: A Generalized MLLM Framework for Scene-to-Drive Learning | Rui Zhao et.al. | 2502.14917 | null |
2025-02-17 | High-Dynamic Radar Sequence Prediction for Weather Nowcasting Using Spatiotemporal Coherent Gaussian Representation | Ziye Wang et.al. | 2502.14895 | null |
2025-02-17 | CoDiff: Conditional Diffusion Model for Collaborative 3D Object Detection | Zhe Huang et.al. | 2502.14891 | link |
2025-02-20 | Long-Term Multidimensional Models of Core-Collapse Supernovae: Progress and Challenges | H. -Thomas Janka et.al. | 2502.14836 | null |
2025-02-20 | Planning, scheduling, and execution on the Moon: the CADRE technology demonstration mission | Gregg Rabideau et.al. | 2502.14803 | null |
2025-02-20 | A Survey on Text-Driven 360-Degree Panorama Generation | Hai Wang et.al. | 2502.14799 | null |
2025-02-20 | Structurally Disentangled Feature Fields Distillation for 3D Understanding and Editing | Yoel Levy et.al. | 2502.14789 | null |
2025-02-20 | MedVAE: Efficient Automated Interpretation of Medical Images with Large-Scale Generalizable Autoencoders | Maya Varma et.al. | 2502.14753 | link |
2025-02-20 | CDGS: Confidence-Aware Depth Regularization for 3D Gaussian Splatting | Qilin Zhang et.al. | 2502.14684 | link |
2025-02-21 | Lopsided and Bulging Distribution of Satellites around Paired Halos. II. 3D Analysis and Dependence on Projection and Selection Effects | Qinglin Ma et.al. | 2502.14668 | null |
2025-02-20 | 3D permutations and triangle solitaire | Juliette Schabanel et.al. | 2502.14657 | null |
2025-02-20 | ReQFlow: Rectified Quaternion Flow for Efficient and High-Quality Protein Backbone Generation | Angxiao Yue et.al. | 2502.14637 | link |
2025-02-20 | Curiosity Driven Multi-agent Reinforcement Learning for 3D Game Testing | Raihana Ferdous et.al. | 2502.14606 | link |
2025-02-20 | Qualitative derivation of a density dependent incompressible Darcy law | Danica Basarić et.al. | 2502.14602 | null |
2025-02-20 | The massive BMS character in 3D quantum gravity | T. Mursheed Amith et.al. | 2502.14578 | null |
2025-02-21 | Accelerated X-Ray Fluorescence Computed Tomography via Multi-Pencil-Beam Excitation | Ryder M. Schmidt et.al. | 2502.14524 | null |
2025-02-20 | Learning Temporal 3D Semantic Scene Completion via Optical Flow Guidance | Meng Wang et.al. | 2502.14520 | null |
2025-02-20 | LXLv2: Enhanced LiDAR Excluded Lean 3D Object Detection with Fusion of 4D Radar and Camera | Weiyi Xiong et.al. | 2502.14503 | null |
2025-02-20 | Exploiting Deblurring Networks for Radiance Fields | Haeyun Choi et.al. | 2502.14454 | null |
2025-02-20 | Bootstrapping SU(3) Lattice Yang-Mills Theory | Yuanhong Guo et.al. | 2502.14421 | null |
2025-02-20 | MedFuncta: Modality-Agnostic Representations Based on Efficient Neural Fields | Paul Friedrich et.al. | 2502.14401 | link |
2025-02-27 | SegAnyPET: Universal Promptable Segmentation from Positron Emission Tomography Images | Yichi Zhang et.al. | 2502.14351 | link |
2025-02-20 | Textured 3D Regenerative Morphing with 3D Diffusion Prior | Songlin Yang et.al. | 2502.14316 | null |
2025-02-20 | Road to 6G Digital Twin Networks: Multi-Task Adaptive Ray-Tracing as a Key Enabler | Li Yu et.al. | 2502.14290 | null |
2025-02-20 | Understanding observational characteristics of solar flare current sheets | Zining Ren et.al. | 2502.14283 | null |
2025-02-21 | Pandora3D: A Comprehensive Framework for High-Quality 3D Shape and Texture Generation | Jiayu Yang et.al. | 2502.14247 | link |
2025-02-26 | SMILE: a universal tool for modulated-enhanced localization microscopy to achieve minimal three-dimensional resolution | Hongfei Zhu et.al. | 2502.14243 | null |
2025-02-20 | OG-Gaussian: Occupancy Based Street Gaussians for Autonomous Driving | Yedong Shen et.al. | 2502.14235 | null |
2025-02-20 | H3DE-Net: Efficient and Accurate 3D Landmark Detection in Medical Imaging | Zhen Huang et.al. | 2502.14221 | link |
2025-02-20 | Rethinking Spiking Neural Networks from an Ensemble Learning Perspective | Yongqi Ding et.al. | 2502.14218 | null |
2025-02-20 | Stereo Image Coding for Machines with Joint Visual Feature Compression | Dengchao Jin et.al. | 2502.14190 | null |
2025-02-20 | NeRF-3DTalker: Neural Radiance Field with 3D Prior Aided Audio Disentanglement for Talking Head Synthesis | Xiaoxing Liu et.al. | 2502.14178 | null |
2025-02-20 | “It Brought the Model to Life”: Exploring the Embodiment of Multimodal I3Ms for People who are Blind or have Low Vision | Samuel Reinders et.al. | 2502.14163 | null |
2025-02-21 | Token Adaptation via Side Graph Convolution for Efficient Fine-tuning of 3D Point Cloud Transformers | Takahiko Furuya et.al. | 2502.14142 | link |
2025-02-19 | GlossGau: Efficient Inverse Rendering for Glossy Surface with Anisotropic Spherical Gaussian | Bang Du et.al. | 2502.14129 | null |
2025-02-19 | Point Cloud Geometry Scalable Coding Using a Resolution and Quality-conditioned Latents Probability Estimator | Daniele Mari et.al. | 2502.14099 | null |
2025-02-23 | Triad: Vision Foundation Model for 3D Magnetic Resonance Imaging | Shansong Wang et.al. | 2502.14064 | link |
2025-02-19 | Im2SurfTex: Surface Texture Generation via Neural Backprojection of Multi-View Images | Yiangos Georgiou et.al. | 2502.14006 | null |
2025-02-19 | Inter3D: A Benchmark and Strong Baseline for Human-Interactive 3D Object Reconstruction | Gan Chen et.al. | 2502.14004 | link |
2025-02-19 | FantasyID: Face Knowledge Enhanced ID-Preserving Video Generation | Yunpeng Zhang et.al. | 2502.13995 | link |
2025-02-19 | Structure-from-Sherds++: Robust Incremental 3D Reassembly of Axially Symmetric Pots from Unordered and Mixed Fragment Collections | Seong Jong Yoo et.al. | 2502.13986 | null |
2025-02-19 | Betsu-Betsu: Multi-View Separable 3D Reconstruction of Two Interacting Objects | Suhas Gopal et.al. | 2502.13968 | null |
2025-02-19 | A Training-Free Framework for Precise Mobile Manipulation of Small Everyday Objects | Arjun Gupta et.al. | 2502.13964 | null |
2025-02-19 | Vertex functions of type $D$ Nakajima quiver varieties | Hunter Dinkins et.al. | 2502.13937 | null |
2025-02-19 | The NavINST Dataset for Multi-Sensor Autonomous Navigation | Paulo Ricardo Marques de Araujo et.al. | 2502.13863 | null |
2025-02-19 | 3D Gaussian Splatting aided Localization for Large and Complex Indoor-Environments | Vincent Ress et.al. | 2502.13803 | null |
2025-02-19 | User Agency and System Automation in Interactive Intelligent Systems | Thomas Langerak et.al. | 2502.13779 | null |
2025-02-19 | Intrinsic Cramér-Rao Bound based 6D Localization and Tracking for 5G/6G Systems | Xueting Xu et.al. | 2502.13733 | null |
2025-02-19 | Neural Density Functional Theory in Higher Dimensions with Convolutional Layers | Felix Glitsch et.al. | 2502.13717 | null |
2025-02-19 | Pericoronary adipose tissue attenuation as a predictor of functional severity of coronary stenosis | Marta Pillitteri et.al. | 2502.13649 | null |
2025-02-19 | Optimization of the Woodcock Particle Tracking Method Using Neural Network | Bingnan Zhang et.al. | 2502.13620 | null |
2025-02-19 | DFT+DMFT study on pressure-induced valence instability of CeCoSi | Shuai-Kang Zhang et.al. | 2502.13585 | null |
2025-02-19 | Multi-Target Radar Search and Track Using Sequence-Capable Deep Reinforcement Learning | Jan-Hendrik Ewers et.al. | 2502.13584 | null |
2025-03-01 | MobileViM: A Light-weight and Dimension-independent Vision Mamba for 3D Medical Image Analysis | Wei Dai et.al. | 2502.13524 | link |
2025-02-19 | Self-ion irradiation effects on nanoindentation-induced plasticity of crystalline iron: A joint experimental and computational study | K. Mulewska et.al. | 2502.13505 | null |
2025-02-19 | 2.5D U-Net with Depth Reduction for 3D CryoET Object Identification | Yusuke Uchida et.al. | 2502.13484 | link |
2025-02-19 | Stochastic tamed 3D Navier-Stokes equations with locally weak monotonicity coefficients: existence, uniqueness and averaging principle | Shuaishuai Lu et.al. | 2502.13478 | null |
2025-02-19 | Anomalous Chern-Simons orbital magnetoelectric coupling of three-dimensional Chern insulators: gauge-discontinuity formalism and adiabatic pumping | Yang Xue et.al. | 2502.13405 | null |
2025-02-19 | Origin of the tiny energy gap and Dirac points in monoclinic trilayer nickelate La $4$Ni$_3$O${10}$ | Hu Zhang et.al. | 2502.13354 | null |
2025-02-18 | Geometry-Aware Diffusion Models for Multiview Scene Inpainting | Ahmad Salimi et.al. | 2502.13335 | null |
2025-02-18 | An unusual first order phase transition in a 2D superconductor | Noah J. Jabusch et.al. | 2502.13265 | null |
2025-02-18 | Learning the Universe: Learning to Optimize Cosmic Initial Conditions with Non-Differentiable Structure Formation Models | Ludvig Doeser et.al. | 2502.13243 | null |
2025-02-18 | Poisson Vertex Algebras and Three-Dimensional Gauge Theory | Ahsan Z. Khan et.al. | 2502.13227 | null |
2025-02-18 | The impact of conformer quality on learned representations of molecular conformer ensembles | Keir Adams et.al. | 2502.13220 | null |
2025-02-18 | The XMAGNET exascale MHD simulations of SMBH feedback in galaxy groups and clusters: Overview and preliminary cluster results | Philipp Grete et.al. | 2502.13213 | null |
2025-02-18 | GS-QA: Comprehensive Quality Assessment Benchmark for Gaussian Splatting View Synthesis | Pedro Martin et.al. | 2502.13196 | null |
2025-02-18 | Fundus2Globe: Generative AI-Driven 3D Digital Twins for Personalized Myopia Management | Danli Shi et.al. | 2502.13182 | null |
2025-02-17 | Generative Topology Optimization: Exploring Diverse Solutions in Structural Design | Andreas Radler et.al. | 2502.13174 | link |
2025-02-16 | Noumenal Labs White Paper: How To Build A Brain | Maxwell J. D. Ramstead et.al. | 2502.13161 | null |
2025-02-18 | SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation | Zekun Qi et.al. | 2502.13143 | null |
2025-02-18 | Pre-training Auto-regressive Robotic Models with 4D Representations | Dantong Niu et.al. | 2502.13142 | null |
2025-02-18 | RobuRCDet: Enhancing Robustness of Radar-Camera Fusion in Bird’s Eye View for 3D Object Detection | Jingtong Yue et.al. | 2502.13071 | null |
2025-02-18 | Enhancing Power Grid Inspections with Machine Learning | Diogo Lavado et.al. | 2502.13037 | null |
2025-02-18 | SHADeS: Self-supervised Monocular Depth Estimation Through Non-Lambertian Image Decomposition | Rema Daher et.al. | 2502.12994 | null |
2025-02-18 | PartSDF: Part-Based Implicit Neural Representation for Composite 3D Shape Parametrization and Optimization | Nicolas Talabot et.al. | 2502.12985 | link |
2025-02-18 | Guaranteed Conditional Diffusion: 3D Block-based Models for Scientific Data Compression | Jaemoon Lee et.al. | 2502.12951 | null |
2025-02-18 | Fabrication and characterization of bimetallic silica-based and 3D-printed active colloidal cubes | Silvana A. Caipa Cure et.al. | 2502.12941 | null |
2025-02-18 | CAST: Component-Aligned 3D Scene Reconstruction from an RGB Image | Kaixin Yao et.al. | 2502.12894 | null |
2025-02-18 | An Experimental Study of SOTA LiDAR Segmentation Models | Bike Chen et.al. | 2502.12860 | null |
2025-02-18 | Carotid Artery Plaque Analysis in 3D Based on Distance Encoding in Mesh Representations | Hinrich Rahlfs et.al. | 2502.12819 | null |
2025-02-18 | Learning Wall Segmentation in 3D Vessel Trees using Sparse Annotations | Hinrich Rahlfs et.al. | 2502.12801 | null |
2025-02-18 | Beyond Timesteps: A Novel Activation-wise Membrane Potential Propagation Mechanism for Spiking Neural Networks in 3D cloud | Jian Song et.al. | 2502.12791 | null |
2025-02-18 | Dynamical Constraints on the Vertical Structure of Jupiter’s Polar Cyclones | Nimrod Gavriel et.al. | 2502.12789 | null |
2025-02-18 | High-Fidelity Novel View Synthesis via Splatting-Guided Diffusion | Xiang Zhang et.al. | 2502.12752 | null |
2025-02-18 | 3D Shape-to-Image Brownian Bridge Diffusion for Brain MRI Synthesis from Cortical Surfaces | Fabian Bongratz et.al. | 2502.12742 | null |
2025-02-18 | Task-Oriented Semantic Communication for Stereo-Vision 3D Object Detection | Zijian Cao et.al. | 2502.12735 | null |
2025-02-18 | RadSplatter: Extending 3D Gaussian Splatting to Radio Frequencies for Wireless Radiomap Extrapolation | Yiheng Wang et.al. | 2502.12686 | null |
2025-02-18 | ROI-NeRFs: Hi-Fi Visualization of Objects of Interest within a Scene by NeRFs Composition | Quoc-Anh Bui et.al. | 2502.12673 | null |
2025-02-25 | LiMo-Calib: On-Site Fast LiDAR-Motor Calibration for Quadruped Robot-Based Panoramic 3D Sensing System | Jianping Li et.al. | 2502.12655 | link |
2025-02-18 | RecDreamer: Consistent Text-to-3D Generation via Uniform Score Distillation | Chenxi Zheng et.al. | 2502.12640 | null |
2025-02-27 | NExT-Mol: 3D Diffusion Meets 1D Language Modeling for 3D Molecule Generation | Zhiyuan Liu et.al. | 2502.12638 | link |
2025-02-18 | Spatiotemporal Multi-Camera Calibration using Freely Moving People | Sang-Eun Lee et.al. | 2502.12546 | null |
2025-02-19 | IM360: Textured Mesh Reconstruction for Large-scale Indoor Mapping with 360 $^\circ$ Cameras | Dongki Jung et.al. | 2502.12545 | null |
2025-02-20 | CityEQA: A Hierarchical LLM Agent on Embodied Question Answering Benchmark in City Space | Yong Zhao et.al. | 2502.12532 | link |
2025-02-18 | Not-So-Optimal Transport Flows for 3D Point Cloud Generation | Ka-Hei Hui et.al. | 2502.12456 | null |
2025-02-18 | Disordered ground state in the 3D face-centred frustrated spin- $\frac{5}{2}$ system MnSn(OH)$_\text{6}$ | Kaushick K. Parui et.al. | 2502.12433 | null |
2025-02-18 | Gaseous Object Detection | Kailai Zhou et.al. | 2502.12415 | null |
2025-02-18 | Sensing-based Robustness Challenges in Agricultural Robotic Harvesting | C. Beldek et.al. | 2502.12403 | null |
2025-02-17 | Vertical structure of an exoplanet’s atmospheric jet stream | Julia V. Seidel et.al. | 2502.12261 | null |
2025-02-17 | A new convection scheme for GCMs of temperate sub-Neptunes | Edouard F. L. Barrier et.al. | 2502.12234 | null |
2025-02-17 | PUGS: Zero-shot Physical Understanding with Gaussian Splatting | Yinghao Shuai et.al. | 2502.12231 | link |
2025-02-14 | 3D ReX: Causal Explanations in 3D Neuroimaging Classification | Melane Navaratnarajah et.al. | 2502.12181 | null |
2025-02-17 | VoLUT: Efficient Volumetric streaming enhanced by LUT-based super-resolution | Chendong Wang et.al. | 2502.12151 | null |
2025-02-17 | Resolving the sodiation process in hard carbon anodes with nanostructure specific X-ray imaging | Martina Olsson et.al. | 2502.12139 | null |
2025-02-19 | FLARE: Feed-forward Geometry, Appearance and Camera Estimation from Uncalibrated Sparse Views | Shangzhan Zhang et.al. | 2502.12138 | null |
2025-02-18 | MagicArticulate: Make Your 3D Models Articulation-Ready | Chaoyue Song et.al. | 2502.12135 | null |
2025-02-17 | 3D Vortices and rotating solitons in ultralight dark matter | Ph. Brax et.al. | 2502.12100 | null |
2025-02-21 | HumanGif: Single-View Human Diffusion with Generative Prior | Shoukang Hu et.al. | 2502.12080 | link |
2025-02-17 | A versatile experimental method to measure the traction forces at interfaces | Yingwei Hou et.al. | 2502.12044 | null |
2025-02-17 | Robotic CBCT Meets Robotic Ultrasound | Feng Li et.al. | 2502.12019 | null |
2025-02-17 | Multi-mode Pulsations in AGB Stars: Insights from 3D RHD CO5BOLD Simulations | Arief Ahmad et.al. | 2502.11978 | null |
2025-02-17 | Identification of Polytypism and Their Dislocations in Bilayer MoS2 Using Correlative Transmission Electron Microscopy and Raman Spectroscopy | Xin Zhou et.al. | 2502.11977 | null |
2025-02-20 | Defining and Evaluating Visual Language Models’ Basic Spatial Abilities: A Perspective from Psychometrics | Wenrui Xu et.al. | 2502.11859 | null |
2025-02-17 | 3D Gaussian Inpainting with Depth-Guided Cross-View Consistency | Sheng-Yu Huang et.al. | 2502.11801 | null |
2025-02-17 | Improving electron tomography of mesoporous silica by Ga intrusion | Alexander Kichigin et.al. | 2502.11794 | null |
2025-02-17 | Exploring the Versal AI Engine for 3D Gaussian Splatting | Kotaro Shimamura et.al. | 2502.11782 | null |
2025-02-17 | Deep Neural Networks for Accurate Depth Estimation with Latent Space Features | Siddiqui Muhammad Yasir et.al. | 2502.11777 | null |
2025-02-21 | FUNCTO: Function-Centric One-Shot Imitation Learning for Tool Manipulation | Chao Tang et.al. | 2502.11744 | null |
2025-02-17 | No-reference geometry quality assessment for colorless point clouds via list-wise rank learning | Zheng Li et.al. | 2502.11726 | link |
2025-02-17 | MVTokenFlow: High-quality 4D Content Generation using Multiview Token Flow | Hanzhuo Huang et.al. | 2502.11697 | null |
2025-02-17 | Global Solvability for the Compressible Hookean Viscoelastic Fluids with a Free Boundary in Some Classes of Large Data | Fei Jiang et.al. | 2502.11683 | null |
2025-02-17 | Deep Subspace Learning for Surface Anomaly Classification Based on 3D Point Cloud Data | Xuanming Cao et.al. | 2502.11669 | null |
2025-02-17 | VRoPE: Rotary Position Embedding for Video Large Language Models | Zikang Liu et.al. | 2502.11664 | link |
2025-02-20 | Astrometric Measurements and Analysis of Double Star System BRT 376 | Xinyue Wang et.al. | 2502.11648 | null |
2025-02-17 | GaussianMotion: End-to-End Learning of Animatable Gaussian Avatars with Pose Guidance from Text | Gyumin Shim et.al. | 2502.11642 | null |
2025-02-17 | An ultra-compact deterministic source of maximally entangled photon pairs | M. Langer et.al. | 2502.11623 | null |
2025-02-18 | Cheesemap: A High-Performance Point-Indexing Data Structure for Neighbor Search in LiDAR Data | Ruben Laso et.al. | 2502.11602 | null |
2025-02-17 | Syllables to Scenes: Literary-Guided Free-Viewpoint 3D Scene Synthesis from Japanese Haiku | Chunan Yu et.al. | 2502.11586 | null |
2025-02-17 | The JCMT BISTRO Survey: Magnetic Fields Align with Orbital Structure in the Galactic Center | Janik Karoly et.al. | 2502.11552 | null |
2025-02-17 | SurgPose: a Dataset for Articulated Robotic Surgical Tool Pose Estimation and Tracking | Zijian Wu et.al. | 2502.11534 | null |
2025-02-17 | Application of Many-body Non-perturbative Theories to the Three-Dimensional Attractive Hubbard Model | Junnian Xiong et.al. | 2502.11527 | null |
2025-02-18 | AI-Assisted Thin Section Image Processing for Pore-Throat Characterization in Tight Clastic Rocks | Muhammad Risha et.al. | 2502.11523 | null |
2025-02-17 | Leveraging Labelled Data Knowledge: A Cooperative Rectification Learning Network for Semi-supervised 3D Medical Image Segmentation | Yanyan Wang et.al. | 2502.11456 | link |
2025-02-17 | MARS: Mesh AutoRegressive Model for 3D Shape Detailization | Jingnan Gao et.al. | 2502.11390 | null |
2025-02-17 | Three-dimensional imaging of biological cells using surface plasmon coupled emission | Anik Mazumder et.al. | 2502.11334 | null |
2025-02-17 | Valency, charge-transfer, and orbital-dependent correlation in bilayer nickelates Nd3Ni2O7 | Daisuke Takegami et.al. | 2502.11327 | null |
2025-02-16 | Wake transition and aerodynamics of a dragonfly-inspired airfoil | Alessandro Chiarini et.al. | 2502.11309 | null |
2025-02-16 | Exploiting Point-Language Models with Dual-Prompts for 3D Anomaly Detection | Jiaxiang Wang et.al. | 2502.11307 | null |
2025-02-16 | MC-BEVRO: Multi-Camera Bird Eye View Road Occupancy Detection for Traffic Monitoring | Arpitsinh Vaghela et.al. | 2502.11287 | null |
2025-02-16 | Set-Based Position Ambiguity Reduction Method for Zonotope Shadow Matching in Urban Areas Using Estimated Multipath Errors | Sanghyun Kim et.al. | 2502.11283 | null |
2025-02-16 | 3D Electron Diffraction as GIWAXS Alternative for Quantitative Structural Characterization of Organic Solar Cells | Irene Kraus et.al. | 2502.11254 | null |
2025-02-19 | Large Language-Geometry Model: When LLM meets Equivariance | Zongzhao Li et.al. | 2502.11149 | null |
2025-02-16 | NavRAG: Generating User Demand Instructions for Embodied Navigation through Retrieval-Augmented LLM | Zihan Wang et.al. | 2502.11142 | link |
2025-02-16 | The Holography of the 2D inhomogeneously deformed CFT | Zhehan Li et.al. | 2502.11135 | null |
2025-02-16 | Advanced 3D-Printed Multiphasic Scaffold with Optimal PRP Dosage for Chondrogenesis of BM-MSCs in Osteochondral Tissue Engineering | Faezeh Ghobadi et.al. | 2502.11130 | null |
2025-02-16 | AdaManip: Adaptive Articulated Object Manipulation Environments and Policy Learning | Yuanfei Wang et.al. | 2502.11124 | null |
2025-02-16 | Text-promptable Propagation for Referring Medical Image Sequence Segmentation | Runtian Yuan et.al. | 2502.11093 | null |
2025-02-16 | OMG: Opacity Matters in Material Modeling with Gaussian Splatting | Silong Yong et.al. | 2502.10988 | null |
2025-02-18 | TEASER: Token Enhanced Spatial Modeling for Expressions Reconstruction | Yunfei Liu et.al. | 2502.10982 | null |
2025-02-16 | GS-GVINS: A Tightly-integrated GNSS-Visual-Inertial Navigation System Augmented by 3D Gaussian Splatting | Zelin Zhou et.al. | 2502.10975 | null |
2025-02-15 | 3D printed human skull phantoms for transcranial photoacoustic imaging | Hannah Linde et.al. | 2502.10910 | null |
2025-02-15 | The Representation and Recall of Interwoven Structured Knowledge in LLMs: A Geometric and Layered Analysis | Ge Lei et.al. | 2502.10871 | link |
2025-02-15 | Mobile Robotic Multi-View Photometric Stereo | Suryansh Kumar et.al. | 2502.10842 | null |
2025-02-15 | E-3DGS: Event-Based Novel View Rendering of Large-Scale Scenes Using 3D Gaussian Splatting | Sohaib Zahid et.al. | 2502.10827 | null |
2025-02-18 | VarGes: Improving Variation in Co-Speech 3D Gesture Generation via StyleCLIPS | Ming Meng et.al. | 2502.10729 | link |
2025-02-15 | Semantics-aware Test-time Adaptation for 3D Human Pose Estimation | Qiuxia Lin et.al. | 2502.10724 | null |
2025-02-15 | Hierarchically-Structured Open-Vocabulary Indoor Scene Synthesis with Pre-trained Large Language Model | Weilin Sun et.al. | 2502.10675 | null |
2025-02-15 | Occlusion-aware Text-Image-Point Cloud Pretraining for Open-World 3D Object Recognition | Khanh Nguyen et.al. | 2502.10674 | null |
2025-02-14 | Tusqh: Topological Control of Volume-Fraction Meshes Near Small Features and Dirty Geometry | Brian Shawcroft et.al. | 2502.10609 | null |
2025-02-14 | HIPPo: Harnessing Image-to-3D Priors for Model-free Zero-shot 6D Pose Estimation | Yibo Liu et.al. | 2502.10606 | null |
2025-02-14 | Accelerating Quantitative MRI using Subspace Multiscale Energy Model (SS-MuSE) | Yan Chen et.al. | 2502.10580 | null |
2025-02-14 | Solar System Elemental Abundances from the Solar Photosphere and CI-Chondrites | Katharina Lodders et.al. | 2502.10575 | null |
2025-02-14 | SAMRI-2: A Memory-based Model for Cartilage and Meniscus Segmentation in 3D MRIs of the Knee Joint | Danielle L. Ferreira et.al. | 2502.10559 | null |
2025-02-14 | Inverse design of 3D-printable metalenses with complementary dispersion for terahertz imaging | Mo Chen et.al. | 2502.10520 | null |
2025-02-14 | Multi-view 3D surface reconstruction from SAR images by inverse rendering | Emile Barbier–Renard et.al. | 2502.10492 | null |
2025-02-13 | X-SG $^2$ S: Safe and Generalizable Gaussian Splatting with X-dimensional Watermarks | Zihang Cheng et.al. | 2502.10475 | null |
2025-02-12 | Deep Reinforcement Learning-Based User Scheduling for Collaborative Perception | Yandi Liu et.al. | 2502.10456 | null |
2025-02-11 | A Survey of Representation Learning, Optimization Strategies, and Applications for Omnidirectional Vision | Hao Ai et.al. | 2502.10444 | null |
2025-02-14 | Text-guided Sparse Voxel Pruning for Efficient 3D Visual Grounding | Wenxuan Guo et.al. | 2502.10392 | link |
2025-02-14 | ReStyle3D: Scene-Level Appearance Transfer with Semantic Correspondences | Liyuan Zhu et.al. | 2502.10377 | null |
2025-02-14 | Analysis and Prediction of Coverage and Channel Rank for UAV Networks in Rural Scenarios with Foliage | Donggu Lee et.al. | 2502.10324 | null |
2025-02-14 | Offset geometry for extended field-of-view in multi-contrast and multi-scale X-ray microtomography of lung cancer lobectomy specimens | Harry Allan et.al. | 2502.10322 | null |
2025-02-14 | Immersive virtual games: winners for deep cognitive assessment | Dom CP Marticorena et.al. | 2502.10290 | null |
2025-02-14 | MITO: Enabling Non-Line-of-Sight Perception using Millimeter-waves through Real-World Datasets and Simulation Tools | Laura Dodds et.al. | 2502.10259 | link |
2025-02-24 | Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model | Guoqing Ma et.al. | 2502.10248 | link |
2025-02-14 | Enhancing Patient Acceptance of Robotic Ultrasound through Conversational Virtual Agent and Immersive Visualizations | Tianyu Song et.al. | 2502.10088 | link |
2025-02-14 | RealCam-I2V: Real-World Image-to-Video Generation with Interactive Complex Camera Control | Teng Li et.al. | 2502.10059 | null |
2025-02-14 | ManiTrend: Bridging Future Generation and Action Prediction with 3D Flow for Robotic Manipulation | Yuxin He et.al. | 2502.10028 | null |
2025-02-14 | From Site Response to Site-city Interaction: a Case Study in the Tokyo Area | Pierre-Yves Bard et.al. | 2502.09976 | null |
2025-02-14 | InteRecon: Towards Reconstructing Interactivity of Personal Memorable Items in Mixed Reality | Zisu Li et.al. | 2502.09973 | null |
2025-02-14 | Temporal Scale and Shift Invariant Automatic Event Recognition using the Mellin Transform | Xi Shen et.al. | 2502.09939 | null |
2025-02-14 | Towards personalised assessment of abdominal aortic aneurysm structural integrity | Mostafa Jamshidian et.al. | 2502.09905 | null |
2025-02-14 | Structure of gaps induced by retrograde satellites embedded in accretion discs | F. J. Sanchez-Salcedo et.al. | 2502.09876 | link |
2025-02-13 | PUGS: Perceptual Uncertainty for Grasp Selection in Underwater Environments | Onur Bagoren et.al. | 2502.09824 | null |
2025-02-13 | Towards Patient-Specific Surgical Planning for Bicuspid Aortic Valve Repair: Fully Automated Segmentation of the Aortic Valve in 4D CT | Zaiyang Guo et.al. | 2502.09805 | null |
2025-02-13 | Automated Muscle and Fat Segmentation in Computed Tomography for Comprehensive Body Composition Analysis | Yaqian Chen et.al. | 2502.09779 | link |
2025-02-13 | A Discontinuous Galerkin Method for Simulating 3D Seismic Wave Propagation in Nonlinear Rock Models: Verification and Application to the 2015 Mw 7.8 Gorkha Earthquake | Zihua Niu et.al. | 2502.09714 | link |
2025-02-13 | The bright, dusty aftermath of giant eruptions & H-rich supernovae. Late interaction of supernova shocks & dusty circumstellar shells | Diana B. Serrano-Hernández et.al. | 2502.09700 | null |
2025-02-13 | In-Silico Investigation of 3D Quantitative Angiography for Internal Carotid Aneurysms Using Biplane Imaging and 3D Vascular Geometry Constraints | Kyle A. Williams et.al. | 2502.09694 | null |
2025-02-13 | NeuralCFD: Deep Learning on High-Fidelity Automotive Aerodynamics Simulations | Maurits Bleeker et.al. | 2502.09692 | null |
2025-02-13 | IMM-MOT: A Novel 3D Multi-object Tracking Framework with Interacting Multiple Model Filter | Xiaohong Liu et.al. | 2502.09672 | null |
2025-02-17 | GraphCompNet: A Position-Aware Model for Predicting and Compensating Shape Deviations in 3D Printing | Lei et.al. | 2502.09652 | null |
2025-02-05 | Volumetric Temporal Texture Synthesis for Smoke Stylization using Neural Cellular Automata | Dongqing Wang et.al. | 2502.09631 | null |
2025-02-13 | Embed Any NeRF: Graph Meta-Networks for Neural Tasks on Arbitrary NeRF Architectures | Francesco Ballerini et.al. | 2502.09623 | null |
2025-02-13 | Exploring the Potential of Encoder-free Architectures in 3D LMMs | Yiwen Tang et.al. | 2502.09620 | link |
2025-02-13 | RigAnything: Template-Free Autoregressive Rigging for Diverse 3D Assets | Isabella Liu et.al. | 2502.09615 | null |
2025-02-13 | Latent Radiance Fields with 3D-aware 2D Representations | Chaoyi Zhou et.al. | 2502.09613 | null |
2025-02-13 | Wireless and passive pressure detection using magneto-mechanical resonances in process engineering | Timo Merbach et.al. | 2502.09575 | null |
2025-02-13 | Self-Calibrating Gaussian Splatting for Large Field of View Reconstruction | Youming Deng et.al. | 2502.09563 | null |
2025-02-13 | A 3D Facial Reconstruction Evaluation Methodology: Comparing Smartphone Scans with Deep Learning Based Methods Using Geometry and Morphometry Criteria | Álvaro Heredia-Lidón et.al. | 2502.09425 | null |
2025-02-13 | Domain Overlapping Algorithm with Nonlinear Mapping for Collocation-Based Solutions of Eigenvalue Problems | Jinwei Yang et.al. | 2502.09398 | null |
2025-02-13 | A Deep Inverse-Mapping Model for a Flapping Robotic Wing | Hadar Sharvit et.al. | 2502.09378 | link |
2025-02-13 | Low-Acceleration Gravitational Anomaly from Bayesian 3D Modeling of Wide Binary Orbits: Methodology and Results with Gaia DR3 | Kyu-Hyun Chae et.al. | 2502.09373 | null |
2025-02-13 | Stability of composite Wave of Planar Viscous Shock and Rarefaction for 3D Barotropic Navier-Stokes Equations | Jiajin Shi et.al. | 2502.09321 | null |
2025-02-17 | ConsistentDreamer: View-Consistent Meshes Through Balanced Multi-View Gaussian Optimization | Onat Şahin et.al. | 2502.09278 | null |
2025-02-13 | FLARES: Fast and Accurate LiDAR Multi-Range Semantic Segmentation | Bin Yang et.al. | 2502.09274 | null |
2025-02-17 | Memory-based Ensemble Learning in CMR Semantic Segmentation | Yiwei Liu et.al. | 2502.09269 | link |
2025-02-13 | Multimodal HIE Lesion Segmentation in Neonates: A Comparative Study of Loss Functions | Annayah Usman et.al. | 2502.09148 | null |
2025-02-13 | Mathematical modeling and simulation of coupled aqueous humor flow and temperature distribution in a realistic 3D human eye geometry | Thomas Saigre et.al. | 2502.09119 | null |
2025-02-13 | Unsupervised Anomaly Detection on Implicit Shape representations for Sarcopenia Detection | Louise Piecuch et.al. | 2502.09088 | null |
2025-02-13 | BevSplat: Resolving Height Ambiguity via Feature-Based Gaussian Primitives for Weakly-Supervised Cross-View Localization | Qiwei Wang et.al. | 2502.09080 | null |
2025-02-13 | PTZ-Calib: Robust Pan-Tilt-Zoom Camera Calibration | Jinhui Guo et.al. | 2502.09075 | link |
2025-02-13 | Critical Motility-Induced Phase Separation in Three Dimensions is Consistent with Ising Universality | Jiechao Feng et.al. | 2502.09069 | null |
2025-02-13 | Large Images are Gaussians: High-Quality Large Image Representation with Levels of 2D Gaussian Splatting | Lingting Zhu et.al. | 2502.09039 | link |
2025-02-13 | Conjugate harmonic functions in 3D with respect to a unitary gradient | Pablo Pedregal et.al. | 2502.09034 | null |
2025-02-13 | CoCreatAR: Enhancing Authoring of Outdoor Augmented Reality Experiences Through Asymmetric Collaboration | Nels Numan et.al. | 2502.08981 | null |
2025-02-13 | Text-driven 3D Human Generation via Contrastive Preference Optimization | Pengfei Zhou et.al. | 2502.08977 | null |
2025-02-13 | Utilizing 3D Fast Spin Echo Anatomical Imaging to Reduce the Number of Contrast Preparations in $T_{1ρ}$ Quantification of Knee Cartilage Using Learning-Based Methods | Junru Zhong et.al. | 2502.08973 | null |
2025-02-13 | SkyRover: A Modular Simulator for Cross-Domain Pathfinding | Wenhui Ma et.al. | 2502.08969 | null |
2025-02-13 | 3D-Grounded Vision-Language Framework for Robotic Task Planning: Automated Prompt Synthesis and Supervised Reasoning | Guoqin Tang et.al. | 2502.08903 | null |
2025-02-13 | CoL3D: Collaborative Learning of Single-view Depth and Camera Intrinsics for Metric 3D Shape Recovery | Chenghao Zhang et.al. | 2502.08902 | null |
2025-02-13 | ShapeLib: designing a library of procedural 3D shape abstractions with Large Language Models | R. Kenny Jones et.al. | 2502.08884 | null |
2025-02-12 | MRUCT: Mixed Reality Assistance for Acupuncture Guided by Ultrasonic Computed Tomography | Yue Yang et.al. | 2502.08786 | null |
2025-02-12 | Exploring Test Time Adaptation for Subcortical Segmentation of the Fetal Brain in 3D Ultrasound | Joshua Omolegan et.al. | 2502.08774 | link |
2025-02-12 | Black Hole Spin-down in Collapsars in 3D Neutrino Transport GRMHD Simulations | Danat Issa et.al. | 2502.08732 | null |
2025-02-21 | Dual view of the Z $_2$ -Gauged XY Model in 3D | Piers Coleman et.al. | 2502.08708 | null |
2025-02-16 | Re $^3$ Sim: Generating High-Fidelity Simulation Data via 3D-Photorealistic Real-to-Sim for Robotic Manipulation | Xiaoshen Han et.al. | 2502.08645 | null |
2025-02-12 | CineMaster: A 3D-Aware and Controllable Framework for Cinematic Text-to-Video Generation | Qinghe Wang et.al. | 2502.08639 | null |
2025-02-13 | PulseCheck457: A Diagnostic Benchmark for 6D Spatial Reasoning of Large Multimodal Models | Xingrui Wang et.al. | 2502.08636 | link |
2025-02-12 | AR Glulam: Accurate Augmented Reality Using Multiple Fiducial Markers for Glulam Fabrication | Alexander Htet Kyaw et.al. | 2502.08566 | null |
2025-02-12 | Brain Latent Progression: Individual-based Spatiotemporal Disease Progression on 3D Brain MRIs via Latent Diffusion | Lemuel Puglisi et.al. | 2502.08560 | link |
2025-02-12 | Human-Centric Foundation Models: Perception, Generation and Agentic Modeling | Shixiang Tang et.al. | 2502.08556 | link |
2025-02-12 | Checkerboard Target Measurement in Unordered Point Clouds with Coloured ICP | June Moh Goo et.al. | 2502.08525 | null |
2025-02-12 | Revisiting 3D LLM Benchmarks: Are We Really Testing 3D Capabilities? | Jiahe Jin et.al. | 2502.08503 | link |
2025-02-12 | CordViP: Correspondence-based Visuomotor Policy for Dexterous Manipulation in Real-World | Yankai Fu et.al. | 2502.08449 | null |
2025-02-12 | Augmented Journeys: Interactive Points of Interest for In-Car Augmented Reality | Robin Connor Schramm et.al. | 2502.08437 | null |
2025-02-12 | Not All Frame Features Are Equal: Video-to-4D Generation via Decoupling Dynamic-Static Features | Liying Yang et.al. | 2502.08377 | null |
2025-02-12 | Screener: Self-supervised Pathology Segmentation Model for 3D Medical Images | Mikhail Goncharov et.al. | 2502.08321 | null |
2025-02-12 | BEAM: Bridging Physically-based Rendering and Gaussian Modeling for Relightable Volumetric Video | Yu Hong et.al. | 2502.08297 | null |
2025-02-12 | CRISP: A Framework for Cryo-EM Image Segmentation and Processing with Conditional Random Field | Szu-Chi Chung et.al. | 2502.08287 | link |
2025-02-12 | FloVD: Optical Flow Meets Video Diffusion Model for Enhanced Camera-Controlled Video Synthesis | Wonjoon Jin et.al. | 2502.08244 | null |
2025-02-12 | Primary and secondary motions in an annular plane Couette flow | Rémi Macadré et.al. | 2502.08241 | null |
2025-02-12 | Likelihood-free Model Selection in Cosmic Reionization with Three-dimensional Tomographic 21 cm Lightcone Images | T. Binnie et.al. | 2502.08152 | null |
2025-02-12 | A Cooperative Bearing-Rate Approach for Observability-Enhanced Target Motion Estimation | Canlun Zheng et.al. | 2502.08089 | null |
2025-02-12 | Radiometric Temperature Measurement for Metal Additive Manufacturing via Temperature Emissivity Separation | Ryan W. Penny et.al. | 2502.08088 | null |
2025-02-12 | Interactive Holographic Visualization for 3D Facial Avatar | Tri Tung Nguyen Nguyen et.al. | 2502.08085 | null |
2025-02-12 | Unraveling charmonium mixing scheme for the $ψ(4220)$ and $ψ(4380)$ by a coupled-channel approach | Zi-Long Man et.al. | 2502.08072 | null |
2025-02-12 | Rapid prediction of organisation in engineered corneal, glial and fibroblast tissues using machine learning and biophysical models | Allison E. Andrews et.al. | 2502.08062 | null |
2025-02-12 | Digital Twin for Porous Electrodes in Redox Flow Batteries | Michael S. Emanuel et.al. | 2502.08034 | link |
2025-02-11 | The stellar mass composition of galaxy clusters and dependencies on dark matter halo properties | Daniel Montenegro-Taborda et.al. | 2502.07927 | null |
2025-02-11 | EventEgo3D++: 3D Human Motion Capture from a Head-Mounted Event Camera | Christen Millerdurai et.al. | 2502.07869 | null |
2025-02-11 | Memory Analysis on the Training Course of DeepSeek Models | Ping Zhang et.al. | 2502.07846 | null |
2025-02-11 | The establishment of static digital humans and the integration with spinal models | Fujiao Ju et.al. | 2502.07844 | null |
2025-02-11 | TranSplat: Surface Embedding-guided 3D Gaussian Splatting for Transparent Object Manipulation | Jeongyun Kim et.al. | 2502.07840 | link |
2025-02-10 | PDM-SSD: Single-Stage Three-Dimensional Object Detector With Point Dilation | Ao Liang et.al. | 2502.07822 | null |
2025-02-11 | Pippo: High-Resolution Multi-View Humans from a Single Image | Yash Kant et.al. | 2502.07785 | null |
2025-02-11 | MatSwap: Light-aware material transfers in images | Ivan Lopes et.al. | 2502.07784 | null |
2025-02-11 | MeshSplats: Mesh-Based Rendering with Gaussian Splatting Initialization | Rafał Tobiasz et.al. | 2502.07754 | link |
2025-02-11 | HiPoNet: A Topology-Preserving Multi-View Neural Network For High Dimensional Point Cloud and Single-Cell Data | Siddharth Viswanath et.al. | 2502.07746 | link |
2025-02-11 | Matrix3D: Large Photogrammetry Model All-in-One | Yuanxun Lu et.al. | 2502.07685 | null |
2025-02-11 | Multiview Point Cloud Registration Based on Minimum Potential Energy for Free-Form Blade Measurement | Zijie Wu et.al. | 2502.07680 | null |
2025-02-11 | Flow Distillation Sampling: Regularizing 3D Gaussians with Pre-trained Matching Priors | Lin-Zhuo Chen et.al. | 2502.07615 | null |
2025-02-11 | Towards Zero-Shot Anomaly Detection and Reasoning with Multimodal Large Language Models | Jiacong Xu et.al. | 2502.07601 | null |
2025-02-16 | DSV: Exploiting Dynamic Sparsity to Accelerate Large-Scale Video DiT Training | Xin Tan et.al. | 2502.07590 | null |
2025-02-11 | Spontaneous stochasticity in a 3d Weierstrass-ABC flow | Antoine Barlet et.al. | 2502.07581 | null |
2025-02-13 | Visual-based spatial audio generation system for multi-speaker environments | Xiaojing Liu et.al. | 2502.07538 | null |
2025-02-11 | Revealing Higher-Order Topological Bulk-boundary Correspondence in Bismuth Crystal with Spin-helical Hinge State Loop and Proximity Superconductivity | D. M. Zhao et.al. | 2502.07533 | null |
2025-02-11 | Efficient Continuous Group Convolutions for Local SE(3) Equivariance in 3D Point Clouds | Lisa Weijler et.al. | 2502.07505 | link |
2025-02-11 | Automated Road Extraction and Centreline Fitting in LiDAR Point Clouds | Xinyu Wang et.al. | 2502.07486 | null |
2025-02-11 | Effect of 3d Transition Metal Doping (Mn, Fe, Co, Ni) on the Electronic and Magnetic Properties of Pd Alloys at Low Impurity Concentrations: An Ab initio Study | Irina I. Piyanzina et.al. | 2502.07485 | null |
2025-02-11 | Extended monocular 3D imaging | Zicheng Shen et.al. | 2502.07403 | null |
2025-02-11 | SAGEPhos: Sage Bio-Coupled and Augmented Fusion for Phosphorylation Site Detection | Jingjie Zhang et.al. | 2502.07384 | link |
2025-02-14 | Supervised contrastive learning for cell stage classification of animal embryos | Yasmine Hachani et.al. | 2502.07360 | null |
2025-02-11 | ERANet: Edge Replacement Augmentation for Semi-Supervised Meniscus Segmentation with Prototype Consistency Alignment and Conditional Self-Training | Siyue Li et.al. | 2502.07331 | null |
2025-02-11 | Semi-Supervised Vision-Centric 3D Occupancy World Model for Autonomous Driving | Xiang Li et.al. | 2502.07309 | link |
2025-02-11 | Articulate That Object Part (ATOP): 3D Part Articulation from Text and Motion Personalization | Aditya Vora et.al. | 2502.07278 | null |
2025-02-11 | The Sand Atlas | Ilija Vego et.al. | 2502.07235 | link |
2025-02-11 | Neutron star evolution by combining discontinuous Galerkin and finite volume methods | Ananya Adhikari et.al. | 2502.07204 | null |
2025-02-11 | Playmate: Flexible Control of Portrait Animation via 3D-Implicit Space Guided Diffusion | Xingpei Ma et.al. | 2502.07203 | null |
2025-02-12 | Space-Aware Instruction Tuning: Dataset and Benchmark for Guide Dog Robots Assisting the Visually Impaired | ByungOk Han et.al. | 2502.07183 | link |
2025-02-11 | Advancing Geological Carbon Storage Monitoring With 3d Digital Shadow Technology | Abhinav Prakash Gahlot et.al. | 2502.07169 | null |
2025-02-11 | Explaining 3D Computed Tomography Classifiers with Counterfactuals | Joseph Paul Cohen et.al. | 2502.07156 | link |
2025-02-10 | Is Long Range Sequential Modeling Necessary For Colorectal Tumor Segmentation? | Abhishek Srivastava et.al. | 2502.07120 | null |
2025-02-10 | Observations and Radiative Transfer Simulations of the Carbon-rich AGB star V Oph with VLTI/MATISSE | Jon Hulberg et.al. | 2502.07092 | null |
2025-02-10 | Comprehensive Analysis of Thermal Dissipation in Lithium-Ion Battery Packs | Xuguang Zhang et.al. | 2502.07070 | null |
2025-02-10 | Uniqueness of Weak Solutions for Biot-Stokes Interactions | George Avalos et.al. | 2502.07061 | null |
2025-02-10 | PrismAvatar: Real-time animated 3D neural head avatars on edge devices | Prashant Raina et.al. | 2502.07030 | null |
2025-02-10 | Grounding Creativity in Physics: A Brief Survey of Physical Priors in AIGC | Siwei Meng et.al. | 2502.07007 | null |
2025-02-10 | Properties of Turbulent Convection and Large-Scale Flows in a Rotating F-type Star Revealed by 3D Realistic Radiative Hydrodynamic Simulations | Irina N. Kitiashvili et.al. | 2502.07006 | null |
2025-02-13 | Geometry-aware RL for Manipulation of Varying Shapes and Deformable Objects | Tai Hoang et.al. | 2502.07005 | link |
2025-02-10 | Indoor Light and Heat Estimation from a Single Panorama | Guanzhou Ji et.al. | 2502.06973 | null |
2025-02-10 | GAS: Generative Avatar Synthesis from a Single Image | Yixing Lu et.al. | 2502.06957 | null |
2025-02-10 | A fast and robust recipe for modeling non-ideal MHD effects in star-formation simulations | E. Agianoglou et.al. | 2502.06933 | link |
2025-02-09 | A Comprehensive Review of U-Net and Its Variants: Advances and Applications in Medical Image Segmentation | Wang Jiangtao et.al. | 2502.06895 | null |
2025-02-07 | Transfer learning in Scalable Graph Neural Network for Improved Physical Simulation | Siqi Shen et.al. | 2502.06848 | null |
2025-02-05 | Functional 3D Scene Synthesis through Human-Scene Optimization | Yao Wei et.al. | 2502.06819 | null |
2025-02-10 | Visual Agentic AI for Spatial Reasoning with a Dynamic API | Damiano Marsili et.al. | 2502.06787 | null |
2025-02-10 | Social Media Isn’t Just Instagram: A Youth-Envisioned Platform for Meaningful Social Connections | JaeWon Kim et.al. | 2502.06696 | null |
2025-02-10 | Transfer Your Perspective: Controllable 3D Generation from Any Viewpoint in a Driving Scene | Tai-Yu Pan et.al. | 2502.06682 | null |
2025-02-10 | TripoSG: High-Fidelity 3D Shape Synthesis using Large-Scale Rectified Flow Models | Yangguang Li et.al. | 2502.06608 | link |
2025-02-10 | Physically-Based Mesh Generation for Confined 3D Point Clouds Using Flexible Foil Models | Netzer Moriya et.al. | 2502.06541 | null |
2025-02-20 | CustomVideoX: 3D Reference Attention Driven Dynamic Adaptation for Zero-Shot Customized Video Diffusion Transformers | D. She et.al. | 2502.06527 | null |
2025-02-10 | SIREN: Semantic, Initialization-Free Registration of Multi-Robot Gaussian Splatting Maps | Ola Shorinwa et.al. | 2502.06519 | null |
2025-02-10 | Three-Dimensional MRI Reconstruction with Gaussian Representations: Tackling the Undersampling Problem | Tengya Peng et.al. | 2502.06510 | null |
2025-02-10 | TANGLED: Generating 3D Hair Strands from Images with Arbitrary Styles and Viewpoints | Pengyu Long et.al. | 2502.06392 | null |
2025-02-10 | Relativistic Gas Accretion onto Supermassive Black Hole Binaries from Inspiral through Merger | Lorenzo Ennoggi et.al. | 2502.06389 | null |
2025-02-10 | Dispersion of backward-propagating waves in a surface defect on a 3D photonic band gap crystal | Timon J. Vreman et.al. | 2502.06369 | null |
2025-02-10 | FOCUS – Multi-View Foot Reconstruction From Synthetically Trained Dense Correspondences | Oliver Boyne et.al. | 2502.06367 | link |
2025-02-10 | Scattering for defocusing cubic NLS under locally damped strong trapping | David Lafontaine et.al. | 2502.06306 | null |
2025-02-14 | Occupancy-SLAM: An Efficient and Robust Algorithm for Simultaneously Optimizing Robot Poses and Occupancy Map | Yingyu Wang et.al. | 2502.06292 | link |
2025-02-17 | Efficient-vDiT: Efficient Video Diffusion Transformers With Attention Tile | Hangliang Ding et.al. | 2502.06155 | null |
2025-02-10 | Quantum Turbulence Across Dimensions: Crossover from two- to three-dimension | Weican Yang et.al. | 2502.06133 | null |
2025-02-10 | Turbulence in stratified rotating topographic wakes | Jinyuan Liu et.al. | 2502.06129 | null |
2025-02-12 | A Novel Multi-Teacher Knowledge Distillation for Real-Time Object Detection using 4D Radar | Seung-Hyun Song et.al. | 2502.06114 | null |
2025-02-10 | DeepMill: Neural Accessibility Learning for Subtractive Manufacturing | Fanchao Zhong et.al. | 2502.06093 | null |
2025-02-09 | Tailoring the normal and superconducting state properties of ternary scandium tellurides, Sc $_6M$Te$_2$ ($M =$ Fe, Ru and Ir) through chemical substitution | J. N. Graham et.al. | 2502.06063 | null |
2025-02-09 | Observationally derived magnetic field strength and 3D components in the HD 142527 disk | Satoshi Ohashi et.al. | 2502.06030 | null |
2025-02-09 | Generating 3D Binding Molecules Using Shape-Conditioned Diffusion Models with Guidance | Ziqi Chen et.al. | 2502.06027 | null |
2025-02-09 | Universal point spread function engineering for 3D optical information processing | Md Sadman Sakib Rahman et.al. | 2502.06025 | null |
2025-02-09 | Mul2MAR: A Multi-Marker Mobile Augmented Reality Application for Improved Visual Perception | Murat Kurt et.al. | 2502.05953 | null |
2025-02-09 | Topology Optimization considering Shielding and Penetrating Features based on Fictitious Physical Model | Daiki Soma et.al. | 2502.05899 | null |
2025-02-09 | MMGDreamer: Mixed-Modality Graph for Geometry-Controllable 3D Indoor Scene Generation | Zhifei Yang et.al. | 2502.05874 | link |
2025-02-09 | Acquisition through My Eyes and Steps: A Joint Predictive Agent Model in Egocentric Worlds | Lu Chen et.al. | 2502.05857 | null |
2025-02-09 | The combinatorial transverse intersection algebra | Daniel An et.al. | 2502.05856 | null |
2025-02-09 | A 3D Multimodal Feature for Infrastructure Anomaly Detection | Yixiong Jing et.al. | 2502.05779 | null |
2025-02-11 | Digital Twin Buildings: 3D Modeling, GIS Integration, and Visual Descriptions Using Gaussian Splatting, ChatGPT/Deepseek, and Google Maps Platform | Kyle Gao et.al. | 2502.05769 | null |
2025-02-09 | The role of $4S$-$3D$ mixing in explaining the $ω$-like $Y(2119)$ observed in $e^+e^-$ annihilation to $ρπ$ and $ρ(1450)π$ | Zi-Yue Bai et.al. | 2502.05754 | null |
2025-02-08 | 4D VQ-GAN: Synthesising Medical Scans at Any Time Point for Personalised Disease Progression Modelling of Idiopathic Pulmonary Fibrosis | An Zhao et.al. | 2502.05713 | null |
2025-02-08 | GWRF: A Generalizable Wireless Radiance Field for Wireless Signal Propagation Modeling | Kang Yang et.al. | 2502.05708 | null |
2025-02-08 | A Cost-Benefit Analysis of Additive Manufacturing as a Service | Igor Ivkić et.al. | 2502.05586 | null |
2025-02-08 | Anomalous Reynolds stress and dynamic mechanisms in two-dimensional elasto-inertial turbulence of viscoelastic channel flow | Haotian Cheng et.al. | 2502.05522 | null |
2025-02-08 | Test beam results on 3D pixel sensors for the CMS Tracker upgrade at the High-Luminosity LHC | Clara Lasaosa et.al. | 2502.05521 | null |
2025-02-08 | Unfitted boundary algebraic equation method based on difference potentials and lattice Green’s function in 3D | Qing Xia et.al. | 2502.05507 | null |
2025-02-14 | HAMSTER: Hierarchical Action Models For Open-World Robot Manipulation | Yi Li et.al. | 2502.05485 | null |
2025-02-08 | Vision-in-the-loop Simulation for Deep Monocular Pose Estimation of UAV in Ocean Environment | Maneesha Wickramasuriya et.al. | 2502.05409 | null |
2025-02-08 | A Novel Convolutional-Free Method for 3D Medical Imaging Segmentation | Canxuan Gang et.al. | 2502.05396 | null |
2025-02-07 | NextBestPath: Efficient 3D Mapping of Unseen Environments | Shiyao Li et.al. | 2502.05378 | null |
2025-02-07 | The Type Ia Supernova and AGB-Regulated Interstellar Medium of Massive Galaxies | Rajsekhar Mohapatra et.al. | 2502.05329 | null |
2025-02-07 | ParquetDB: A Lightweight Python Parquet-Based Database | Logan Lang et.al. | 2502.05311 | null |
2025-02-11 | Light curves and spectra for stellar collisions between main-sequence stars in galactic nuclei | Taeho Ryu et.al. | 2502.05265 | null |
2025-02-05 | VistaFlow: Photorealistic Volumetric Reconstruction with Dynamic Resolution Management via Q-Learning | Jayram Palamadai et.al. | 2502.05222 | null |
2025-02-07 | AuraFusion360: Augmented Unseen Region Alignment for Reference-based 360° Unbounded Scene Inpainting | Chung-Ho Wu et.al. | 2502.05176 | null |
2025-02-07 | Fillerbuster: Multi-View Scene Completion for Casual Captures | Ethan Weber et.al. | 2502.05175 | null |
2025-02-07 | Relationship between 2D and 3D Galaxy Stellar Mass and Correlations with Halo Mass | Conghao Zhou et.al. | 2502.05158 | null |
2025-02-07 | 3DMolFormer: A Dual-channel Framework for Structure-based Drug Discovery | Xiuyuan Hu et.al. | 2502.05107 | link |
2025-02-07 | DCFormer: Efficient 3D Vision-Language Modeling with Decomposed Convolutions | Gorkem Can Ates et.al. | 2502.05091 | null |
2025-02-07 | Differentiable Mobile Display Photometric Stereo | Gawoon Ban et.al. | 2502.05055 | null |
2025-02-07 | GaussRender: Learning 3D Occupancy with Gaussian Rendering | Loick Chambon et.al. | 2502.05040 | link |
2025-02-07 | Assessment of averaged 1D models for column adsorption with 3D computational experiments | Maria Aguareles et.al. | 2502.05029 | null |
2025-02-07 | A Transformation-based Consistent Estimation Framework: Analysis, Design and Applications | Ning Hao et.al. | 2502.05008 | null |
2025-02-07 | OccGS: Zero-shot 3D Occupancy Reconstruction with Semantic and Geometric-Aware Gaussian Splatting | Xiaoyu Zhou et.al. | 2502.04981 | null |
2025-02-07 | Two-Dimensional Lattice-Gas Model for Methane Clathrate Hydrates: Comparative Analysis with Experiments and Three-Dimensional Simulations | Julian Juan et.al. | 2502.04961 | null |
2025-02-07 | Spin dynamics and magnetic excitations of quasi-1D spin chain Ca $_3$ZnMnO$_6$ | Suheon Lee et.al. | 2502.04919 | null |
2025-02-11 | PoI: Pixel of Interest for Novel View Synthesis Assisted Scene Coordinate Regression | Feifei Li et.al. | 2502.04843 | null |
2025-02-07 | DetVPCC: RoI-based Point Cloud Sequence Compression for 3D Object Detection | Mingxuan Yan et.al. | 2502.04804 | null |
2025-02-07 | Practical implementation of a chiral phononic crystal demonstrator with ultra-low frequency bandgap | Line Mardini et.al. | 2502.04775 | null |
2025-02-18 | Statistical Methods and Modal Decompositions for Gridded and Scattered Data: Meshless Statistics and Meshless Data Driven Modal Analysis | Miguel A. Mendez et.al. | 2502.04765 | null |
2025-02-07 | SC-OmniGS: Self-Calibrating Omnidirectional Gaussian Splatting | Huajian Huang et.al. | 2502.04734 | null |
2025-02-12 | Core to Cosmic Edge: SIMBA-C’s New Take on Abundance Profiles in the Intragroup Medium at z = 0 | Aviv Padawer-Blatt et.al. | 2502.04657 | null |
2025-02-10 | Building Rome with Convex Optimization | Haoyu Han et.al. | 2502.04640 | null |
2025-02-07 | High-Speed Dynamic 3D Imaging with Sensor Fusion Splatting | Zihao Zou et.al. | 2502.04630 | null |
2025-02-07 | Effects of Curved Superconducting Magnets on Beam Stability in a Compact Ion Therapy Synchrotron | Hannah X. Q. Norman et.al. | 2502.04617 | null |
2025-02-07 | An Airy Tale at Large $N$ | Nikolay Bobev et.al. | 2502.04606 | null |
2025-02-06 | Ego vs. Exo and Active vs. Passive: Investigating the Effects of Viewpoint and Navigation on Spatial Immersion and Understanding in Immersive Storytelling | Tao Lu et.al. | 2502.04542 | null |
2025-02-06 | Developable Ruled Surfaces Generated by the Curvature Axis of a Curve | Ferhat Taş et.al. | 2502.04523 | null |
2025-02-06 | Fast Video Generation with Sliding Tile Attention | Peiyuan Zhang et.al. | 2502.04507 | null |
2025-02-06 | Measuring Physical Plausibility of 3D Human Poses Using Physics Simulation | Nathan Louis et.al. | 2502.04483 | link |
2025-02-06 | Efficient variable-length hanging tether parameterization for marsupial robot planning in 3D environments | S. Martínez-Rozas et.al. | 2502.04467 | null |
2025-02-05 | Towards Fair Medical AI: Adversarial Debiasing of 3D CT Foundation Embeddings | Guangyao Zheng et.al. | 2502.04386 | link |
2025-02-05 | TexLiDAR: Automated Text Understanding for Panoramic LiDAR Data | Naor Cohen et.al. | 2502.04385 | link |
2025-02-05 | DreamDPO: Aligning Text-to-3D Generation with Human Preferences via Direct Preference Optimization | Zhenglin Zhou et.al. | 2502.04370 | null |
2025-02-05 | Predicting 3D Motion from 2D Video for Behavior-Based VR Biometrics | Mingjun Li et.al. | 2502.04361 | null |
2025-02-06 | sshELF: Single-Shot Hierarchical Extrapolation of Latent Features for 3D Reconstruction from Sparse-Views | Eyvaz Najafli et.al. | 2502.04318 | null |
2025-02-06 | Factorized Implicit Global Convolution for Automotive Computational Fluid Dynamics Prediction | Chris Choy et.al. | 2502.04317 | null |
2025-02-06 | MotionCanvas: Cinematic Shot Design with Controllable Image-to-Video Generation | Jinbo Xing et.al. | 2502.04299 | null |
2025-02-06 | GCE-Pose: Global Context Enhancement for Category-level Object Pose Estimation | Weihang Li et.al. | 2502.04293 | null |
2025-02-06 | The Effects of Kinematic MHD on the Atmospheric Circulation of Eccentric Hot Jupiters | Hayley Beltz et.al. | 2502.04169 | null |
2025-02-06 | HD-EPIC: A Highly-Detailed Egocentric Video Dataset | Toby Perrett et.al. | 2502.04144 | null |
2025-02-06 | Beyond the Final Layer: Hierarchical Query Fusion Transformer with Agent-Interpolation Initialization for 3D Instance Segmentation | Jiahao Lu et.al. | 2502.04139 | null |
2025-02-06 | Adaptive Margin Contrastive Learning for Ambiguity-aware 3D Semantic Segmentation | Yang Chen et.al. | 2502.04111 | null |
2025-02-13 | VTutor: An Open-Source SDK for Generative AI-Powered Animated Pedagogical Agents with Multi-Media Output | Eason Chen et.al. | 2502.04103 | null |
2025-02-06 | Soft and Highly-Integrated Optical Fiber Bending Sensors for Proprioception in Multi-Material 3D Printed Fingers | Ellis Capp et.al. | 2502.04094 | null |
2025-02-06 | 3D Prior is All You Need: Cross-Task Few-shot 2D Gaze Estimation | Yihua Cheng et.al. | 2502.04074 | null |
2025-02-13 | MultiFloodSynth: Multi-Annotated Flood Synthetic Dataset Generation | YoonJe Kang et.al. | 2502.03966 | null |
2025-02-06 | A Flexible FBG-Based Contact Force Sensor for Robotic Gripping Systems | Wenjie Lai et.al. | 2502.03914 | null |
2025-02-06 | LeAP: Consistent multi-domain 3D labeling using Foundation Models | Simon Gebraad et.al. | 2502.03901 | null |
2025-02-06 | Position: Untrained Machine Learning for Anomaly Detection | Juan Du et.al. | 2502.03876 | null |
2025-02-06 | A microscopic model of de Sitter spacetime with an observer | Damiano Tietto et.al. | 2502.03869 | null |
2025-02-06 | Adapting Human Mesh Recovery with Vision-Language Feedback | Chongyang Xu et.al. | 2502.03836 | null |
2025-02-06 | Multiple Invertible and Partial-Equivariant Function for Latent Vector Transformation to Enhance Disentanglement in VAEs | Hee-Jun Jung et.al. | 2502.03740 | null |
2025-02-05 | Bridging high resolution sub-cellular imaging with physiologically relevant engineered tissues | Yasaman Kargar Gaz Kooh et.al. | 2502.03661 | null |
2025-02-05 | Polarons and Exciton-Polarons in Two-Dimensional Polar Materials | V. Shahnazaryan et.al. | 2502.03657 | null |
2025-02-05 | Coexistence of 3D and quasi-2D Fermi surfaces driven by orbital selective Kondo scattering in UTe $_2$ | Byungkyun Kang et.al. | 2502.03646 | null |
2025-02-05 | Towards Physical Understanding in Video Generation: A 3D Point Regularization Approach | Yunuo Chen et.al. | 2502.03639 | null |
2025-02-05 | Inviscid limit on $L^p$ -based Sobolev conormal spaces for the 3D Navier-Stokes equations with the Navier boundary conditions | Mustafa Sencer Aydın et.al. | 2502.03599 | null |
2025-02-05 | EnVisionVR: A Scene Interpretation Tool for Visual Accessibility in Virtual Reality | Junlong Chen et.al. | 2502.03564 | null |
2025-02-05 | Realistic predictions for Gaia black hole discoveries: comparison of isolated binary and dynamical formation models | Pranav Nagarajan et.al. | 2502.03527 | null |
2025-02-05 | Mapping and Localization Using LiDAR Fiducial Markers | Yibo Liu et.al. | 2502.03510 | null |
2025-02-05 | Enhancing Free-hand 3D Photoacoustic and Ultrasound Reconstruction using Deep Learning | SiYeoul Lee et.al. | 2502.03505 | null |
2025-02-05 | Seeing World Dynamics in a Nutshell | Qiuhong Shen et.al. | 2502.03465 | link |
2025-02-05 | SKI Models: Skeleton Induced Vision-Language Embeddings for Understanding Activities of Daily Living | Arkaprava Sinha et.al. | 2502.03459 | null |
2025-02-06 | Clustering of the extreme: A theoretical description of weak lensing critical points power spectra in the mildly nonlinear regime | Zhengyangguang Gong et.al. | 2502.03457 | null |
2025-02-05 | Kineto-Dynamical Planning and Accurate Execution of Minimum-Time Maneuvers on Three-Dimensional Circuits | Mattia Piccinini et.al. | 2502.03454 | null |
2025-02-05 | Dress-1-to-3: Single Image to Simulation-Ready 3D Outfit with Diffusion Prior and Differentiable Physics | Xuan Li et.al. | 2502.03449 | null |
2025-02-05 | Linearized Optimal Transport pyLOT Library: A Toolkit for Machine Learning on Point Clouds | Jun Linwu et.al. | 2502.03439 | null |
2025-02-05 | A Beam’s Eye View to Fluence Maps 3D Network for Ultra Fast VMAT Radiotherapy Planning | Simon Arberet et.al. | 2502.03360 | null |
2025-02-05 | Ferrers bar response models: a grid calculation for Galactic models | A. Silva-Castro et.al. | 2502.03344 | null |
2025-02-05 | First experiments with ultrashort, circularly polarized soft X-ray pulses at FLASH2 | S. Marotzke et.al. | 2502.03301 | null |
2025-02-05 | Practical Introduction to FEM with GMSH: A MATLAB/Octave Perspective | Victor Dominguez et.al. | 2502.03248 | null |
2025-02-05 | JAMMit! Monolithic 3D-Printing of a Bead Jamming Soft Pneumatic Arm | Yao Yao et.al. | 2502.03232 | null |
2025-02-05 | GARAD-SLAM: 3D GAussian splatting for Real-time Anti Dynamic SLAM | Mingrui Li et.al. | 2502.03228 | null |
2025-02-05 | iVISPAR – An Interactive Visual-Spatial Reasoning Benchmark for VLMs | Julius Mayer et.al. | 2502.03214 | link |
2025-02-05 | MotionAgent: Fine-grained Controllable Video Generation via Motion Field Agent | Xinyao Liao et.al. | 2502.03207 | null |
2025-02-10 | CreepyCoCreator? Investigating AI Representation Modes for 3D Object Co-Creation in Virtual Reality | Julian Rasch et.al. | 2502.03069 | null |
2025-02-05 | Nuclear Stellar Disk-like Nature in the Kinematics of SiO Maser Stars around Sagittarius A* | Masato Tsuboi et.al. | 2502.03024 | null |
2025-02-05 | Every Angle Is Worth A Second Glance: Mining Kinematic Skeletal Structures from Multi-view Joint Cloud | Junkun Jiang et.al. | 2502.02936 | null |
2025-02-05 | Three-dimensional simulations of accretion disks in pre-CE systems | Ana L. Juarez-Garcia et.al. | 2502.02933 | null |
2025-02-05 | AudioMiXR: Spatial Audio Object Manipulation with 6DoF for Sound Design in Augmented Reality | Brandon Woodard et.al. | 2502.02929 | null |
2025-02-05 | Revealing the orbital origins of exotic electronic states with Ti substitution in kagome superconductor CsV3Sb5 | Zihao Huang et.al. | 2502.02923 | null |
2025-02-05 | PoleStack: Robust Pole Estimation of Irregular Objects from Silhouette Stacking | Jacopo Villa et.al. | 2502.02907 | null |
2025-02-05 | INST-Sculpt: Interactive Stroke-based Neural SDF Sculpting | Fizza Rubab et.al. | 2502.02891 | null |
2025-02-05 | OceanChat: The Effect of Virtual Conversational AI Agents on Sustainable Attitude and Behavior Change | Pat Pataranutaporn et.al. | 2502.02863 | null |
2025-02-04 | 3D Foundation AI Model for Generalizable Disease Detection in Head Computed Tomography | Weicheng Zhu et.al. | 2502.02779 | null |
2025-02-04 | Planning with affordances: Integrating learned affordance models and symbolic planning | Rajesh Mangannavar et.al. | 2502.02768 | null |
2025-02-04 | Adaptive Voxel-Weighted Loss Using L1 Norms in Deep Neural Networks for Detection and Segmentation of Prostate Cancer Lesions in PET/CT Images | Obed Korshie Dzikunu et.al. | 2502.02756 | link |
2025-02-04 | Computational and Analytical Optimization of Helicon Antennas with a Fast Full Wave Solver Exploiting Azimuthal Fourier Decomposition | Marcel Granetzny et.al. | 2502.02733 | null |
2025-02-04 | Automated split Hopkinson pressure bar for high throughput dynamic experiments | Mouliswar Ramakumaresan et.al. | 2502.02729 | null |
2025-02-04 | A Parareal in time numerical method for the collisional Vlasov equation in the hyperbolic scaling | Tino Laidin et.al. | 2502.02704 | null |
2025-02-04 | New stellar bow shocks and bubbles found around runaway stars | M. Carretero-Castrillo et.al. | 2502.02658 | null |
2025-02-04 | SiLVR: Scalable Lidar-Visual Radiance Field Reconstruction with Uncertainty Quantification | Yifu Tao et.al. | 2502.02657 | null |
2025-02-03 | Classical 1/3 Nusselt number scaling in highly turbulent compressible convection | Harshit Tiwari et.al. | 2502.02611 | null |
2025-02-04 | Articulate AnyMesh: Open-Vocabulary 3D Articulated Objects Modeling | Xiaowen Qiu et.al. | 2502.02590 | null |
2025-02-04 | The Classical-to-Quantum Crossover in strain-induced ferroelectric transition in SrTiO $_3$ membranes | Jiarui Li et.al. | 2502.02586 | null |
2025-02-04 | Learning the RoPEs: Better 2D and 3D Position Encodings with STRING | Connor Schenck et.al. | 2502.02562 | null |
2025-02-09 | Particle Trajectory Representation Learning with Masked Point Modeling | Sam Young et.al. | 2502.02558 | null |
2025-02-04 | Energy field of critical Ising model and examples of singular fields in QFT | Christophe Garban et.al. | 2502.02554 | null |
2025-02-04 | Mosaic3D: Foundation Dataset and Model for Open-Vocabulary 3D Segmentation | Junha Lee et.al. | 2502.02548 | null |
2025-02-04 | Diff9D: Diffusion-Based Domain-Generalized Category-Level 9-DoF Object Pose Estimation | Jian Liu et.al. | 2502.02525 | link |
2025-02-04 | High-Fidelity Human Avatars from Laptop Webcams using Edge Compute | Akash Haridas et.al. | 2502.02468 | null |
2025-02-09 | Towards Consistent and Controllable Image Synthesis for Face Editing | Mengting Wei et.al. | 2502.02465 | null |
2025-02-04 | LLMER: Crafting Interactive Extended Reality Worlds with JSON Data Generated by Large Language Models | Jiangong Chen et.al. | 2502.02441 | link |
2025-02-07 | Transolver++: An Accurate Neural Solver for PDEs on Million-Scale Geometries | Huakun Luo et.al. | 2502.02414 | null |
2025-02-04 | Extending SEEDS to a Supervoxel Algorithm for Medical Image Analysis | Chenhui Zhao et.al. | 2502.02409 | link |
2025-02-04 | Direct observation of the exciton polaron by serial femtosecond crystallography on single CsPbBr $_3$ quantum dots | Zhou Shen et.al. | 2502.02343 | null |
2025-02-04 | Geometric Neural Process Fields | Wenzhe Yin et.al. | 2502.02338 | null |
2025-02-04 | Event-aided Semantic Scene Completion | Shangwei Guo et.al. | 2502.02334 | link |
2025-02-04 | Improving Generalization Ability for 3D Object Detection by Learning Sparsity-invariant Features | Hsin-Cheng Lu et.al. | 2502.02322 | link |
2025-02-04 | MAGNNET: Multi-Agent Graph Neural Network-based Efficient Task Allocation for Autonomous Vehicles with Deep Reinforcement Learning | Lavanya Ratnabala et.al. | 2502.02311 | null |
2025-02-05 | GP-GS: Gaussian Processes for Enhanced Gaussian Splatting | Zhihao Guo et.al. | 2502.02283 | link |
2025-02-04 | Rotation-Adaptive Point Cloud Domain Generalization via Intricate Orientation Learning | Bangzhen Liu et.al. | 2502.02247 | null |
2025-02-04 | Femtosecond charge and spin dynamics in CoPt alloys | Martin Pavelka et.al. | 2502.02240 | null |
2025-02-04 | ShapeShifter: 3D Variations Using Multiscale and Sparse Point-Voxel Diffusion | Nissim Maruani et.al. | 2502.02187 | null |
2025-02-04 | DeepForest: Sensing Into Self-Occluding Volumes of Vegetation With Aerial Imaging | Mohamed Youssef et.al. | 2502.02171 | link |
2025-02-09 | Progressive Correspondence Regenerator for Robust 3D Registration | Guiyu Zhao et.al. | 2502.02163 | null |
2025-02-04 | DOC-Depth: A novel approach for dense depth ground truth generation | Simon de Moreau et.al. | 2502.02144 | null |
2025-02-04 | Efficient Dynamic Scene Editing via 4D Gaussian-based Static-Dynamic Separation | JooHyun Kwon et.al. | 2502.02091 | null |
2025-02-04 | Multimaterial topology optimization for finite strain elastoplasticity: theory, methods, and applications | Yingqi Jia et.al. | 2502.02052 | null |
2025-02-04 | ReMiDi: Reconstruction of Microstructure Using a Differentiable Diffusion MRI Simulator | Prathamesh Pradeep Khole et.al. | 2502.01988 | null |
2025-02-04 | Online Adaptive Traversability Estimation through Interaction for Unstructured, Densely Vegetated Environments | Fabio A. Ruetz et.al. | 2502.01987 | link |
2025-02-04 | DCT-Mamba3D: Spectral Decorrelation and Spatial-Spectral Feature Extraction for Hyperspectral Image Classification | Weijia Cao et.al. | 2502.01986 | null |
2025-02-04 | LAYOUTDREAMER: Physics-guided Layout for Text-to-3D Compositional Scene Generation | Yang Zhou et.al. | 2502.01949 | null |
2025-02-04 | Wake-Informed 3D Path Planning for Autonomous Underwater Vehicles Using A* and Neural Network Approximations | Zachary Cooper-Baldock et.al. | 2502.01918 | null |
2025-02-04 | INTACT: Inducing Noise Tolerance through Adversarial Curriculum Training for LiDAR-based Safety-Critical Perception and Autonomy | Nastaran Darabi et.al. | 2502.01896 | null |
2025-02-04 | Turbulent gas-rich discs at high redshift: origin of thick stellar discs through 3D ‘baryon sloshing’ | Joss Bland-Hawthorn et.al. | 2502.01895 | null |
2025-02-04 | SimBEV: A Synthetic Multi-Task Multi-Sensor Driving Data Generation Tool and Dataset | Goodarz Mehr et.al. | 2502.01894 | link |
2025-02-03 | Geometric Framework for 3D Cell Segmentation Correction | Peter Chen et.al. | 2502.01890 | null |
2025-02-03 | Fully discrete analysis of the Galerkin POD neural network approximation with application to 3D acoustic wave scattering | Jürgen Dölz et.al. | 2502.01859 | null |
2025-02-03 | Reliability-Driven LiDAR-Camera Fusion for Robust 3D Object Detection | Reza Sadeghian et.al. | 2502.01856 | null |
2025-02-03 | Learning Fine-to-Coarse Cuboid Shape Abstraction | Gregor Kobsik et.al. | 2502.01855 | null |
2025-02-03 | Robust virtual element methods for 3D stress-assisted diffusion problems | Andres E. Rubiano et.al. | 2502.01851 | null |
2025-02-11 | UVGS: Reimagining Unstructured 3D Gaussian Splatting using UV Mapping | Aashish Rai et.al. | 2502.01846 | null |
2025-02-03 | Sparse Measurement Medical CT Reconstruction using Multi-Fused Block Matching Denoising Priors | Maliha Hossain et.al. | 2502.01832 | null |
2025-02-03 | Scalable 3D Gaussian Splatting-Based RF Signal Spatial Propagation Modeling | Kang Yang et.al. | 2502.01826 | null |
2025-02-03 | PolyhedronNet: Representation Learning for Polyhedra with Surface-attributed Graph | Dazhou Yu et.al. | 2502.01814 | link |
2025-02-03 | Coronal energy release by MHD avalanches III. Identification of a reconnection outflow from a nanoflare | G. Cozzo et.al. | 2502.01796 | null |
2025-02-03 | Sparse VideoGen: Accelerating Video Diffusion Transformers with Spatial-Temporal Sparsity | Haocheng Xi et.al. | 2502.01776 | null |
2025-02-03 | Coarse-to-Fine 3D Keyframe Transporter | Xupeng Zhu et.al. | 2502.01773 | null |
2025-02-03 | Impact of Altitude, Bandwidth, and NLOS Bias on TDOA-Based 3D UAV Localization: Experimental Results and CRLB Analysis | Cole Dickerson et.al. | 2502.01771 | null |
2025-02-03 | Generating Multi-Image Synthetic Data for Text-to-Image Customization | Nupur Kumari et.al. | 2502.01720 | null |
2025-02-02 | A Novel Real-Time Full-Color 3D Holographic (Diffractive) Video Capture, Processing, and Transmission Pipeline Using Off-The-Shelf Hardware | Ankur Samanta et.al. | 2502.01695 | null |
2025-02-06 | Fast Direct: Query-Efficient Online Black-box Guidance for Diffusion-model Target Generation | Kim Yong Tan et.al. | 2502.01692 | link |
2025-02-01 | Leveraging Stable Diffusion for Monocular Depth Estimation via Image Semantic Encoding | Jingming Xia et.al. | 2502.01666 | null |
2025-02-03 | From pre-transit to post-eclipse: investigating the impact of 3D temperature, chemistry, and dynamics on high-resolution emission spectra of the ultra-hot Jupiter WASP-76b | Joost P. Wardenier et.al. | 2502.01606 | null |
2025-02-03 | FireCastNet: Earth-as-a-Graph for Seasonal Fire Prediction | Dimitrios Michail et.al. | 2502.01550 | null |
2025-02-03 | VR-Robo: A Real-to-Sim-to-Real Framework for Visual Robot Navigation and Locomotion | Shaoting Zhu et.al. | 2502.01536 | null |
2025-02-03 | Transformers trained on proteins can learn to attend to Euclidean distance | Isaac Ellmen et.al. | 2502.01533 | link |
2025-02-03 | Regularized interpolation in 4D neural fields enables optimization of 3D printed geometries | Christos Margadji et.al. | 2502.01517 | link |
2025-02-03 | Three-dimensional holographic imaging of incoherent objects through scattering media | YoonSeok Baek et.al. | 2502.01475 | null |
2025-02-03 | Thermodynamic nonequilibrium effects in three-dimensional high-speed compressible flows: Multiscale modeling and simulation via the discrete Boltzmann method | Qinghong Guo et.al. | 2502.01446 | null |
2025-02-13 | Evolving Symbolic 3D Visual Grounder with Weakly Supervised Reflection | Boyu Mi et.al. | 2502.01401 | link |
2025-02-03 | Bayesian Approximation-Based Trajectory Prediction and Tracking with 4D Radar | Dong-In Kim et.al. | 2502.01357 | null |
2025-02-04 | Quasi-Conformal Convolution : A Learnable Convolution for Deep Learning on Riemann Surfaces | Han Zhang et.al. | 2502.01356 | null |
2025-02-03 | Quiver Yangians as Coulomb branch algebras | Tiantai Chen et.al. | 2502.01323 | null |
2025-02-03 | Ideal MHD. Part II: Rigidity from infinity for ideal Alfvén waves in 3D thin domains | Mengni Li et.al. | 2502.01139 | null |
2025-02-03 | WonderHuman: Hallucinating Unseen Parts in Dynamic 3D Human Reconstruction | Zilong Wang et.al. | 2502.01045 | null |
2025-02-03 | Multi-Object Active Search and Tracking by Multiple Agents in Untrusted, Dynamically Changing Environments | Mingi Jeong et.al. | 2502.01041 | null |
2025-02-03 | Near-Field Integrated Sensing and Communications for Secure UAV Networks | Jingjing Zhao et.al. | 2502.01003 | null |
2025-02-03 | Multi-Resolution SAR and Optical Remote Sensing Image Registration Methods: A Review, Datasets, and Future Perspectives | Wenfei Zhang et.al. | 2502.01002 | null |
2025-02-03 | Revisiting turbulent properties of solar convection with 3D radiative hydrodynamic modeling | Irina N. Kitiashvili et.al. | 2502.00974 | null |
2025-02-02 | SAM-guided Pseudo Label Enhancement for Multi-modal 3D Semantic Segmentation | Mingyu Yang et.al. | 2502.00960 | null |
2025-02-04 | Hypo3D: Exploring Hypothetical Reasoning in 3D | Ye Mao et.al. | 2502.00954 | null |
2025-02-02 | Exploring Multiscale Navigation of Homogeneous and Dense Objects with Progressive Refinement in Virtual Reality | Leonardo Pavanatto et.al. | 2502.00941 | null |
2025-02-02 | QUOKKA-based understanding of outflows (QED) – III. Outflow loading and phase structure as a function of galactic environment | Aditi Vijayan et.al. | 2502.00929 | null |
2025-02-02 | Mathematical Cell Deployment Optimization for Capacity and Coverage of Ground and UAV Users | Saeed Karimi-Bidhendi et.al. | 2502.00928 | null |
2025-02-05 | ToddlerBot: Open-Source ML-Compatible Humanoid Platform for Loco-Manipulation | Haochen Shi et.al. | 2502.00893 | null |
2025-02-02 | Planet Purifiers: A Collaborative Immersive Experience Proposing New Modifications to HOMER and Fishing Reel Interaction Techniques | Alexander Giovannelli et.al. | 2502.00888 | null |
2025-02-02 | Investigating the Influence of Playback Interactivity during Guided Tours for Asynchronous Collaboration in Virtual Reality | Alexander Giovannelli et.al. | 2502.00880 | null |
2025-02-02 | Doped resonating valence bond states: How robust are the spin ice phases in 3D Rydberg arrays | Jingya Wang et.al. | 2502.00836 | null |
2025-02-09 | Bilinear Subspace Variational Bayesian Inference for Joint Scattering Environment Sensing and Data Recovery in ISAC Systems | An Liu et.al. | 2502.00811 | null |
2025-02-02 | Environment-Driven Online LiDAR-Camera Extrinsic Calibration | Zhiwei Huang et.al. | 2502.00801 | null |
2025-02-02 | Fabrication of Fibers with Complex Features Using Thermal Drawing of 3D-Printed Preforms | Ali Anil Demircali et.al. | 2502.00741 | null |
2025-02-02 | PhiP-G: Physics-Guided Text-to-3D Compositional Scene Generation | Qixuan Li et.al. | 2502.00708 | null |
2025-02-02 | Measurement and Analysis of Scattering From Building Surfaces at Millimeter-Wave Frequency | Yulu Guo et.al. | 2502.00699 | null |
2025-02-02 | EmoTalkingGaussian: Continuous Emotion-conditioned Talking Head Synthesis | Junuk Cha et.al. | 2502.00654 | null |
2025-02-02 | Self-Prompt SAM: Medical Image Segmentation via Automatic Prompt SAM Adaptation | Bin Xie et.al. | 2502.00630 | null |
2025-02-02 | Distribution-aware Fairness Learning in Medical Image Segmentation From A Control-Theoretic Perspective | Yujin Oh et.al. | 2502.00619 | null |
2025-02-01 | DeepUKF-VIN: Adaptively-tuned Deep Unscented Kalman Filter for 3D Visual-Inertial Navigation based on IMU-Vision-Net | Khashayar Ghanizadegan et.al. | 2502.00575 | null |
2025-02-01 | Vision-Language Modeling in PET/CT for Visual Grounding of Positive Findings | Zachary Huemann et.al. | 2502.00528 | null |
2025-02-01 | Stabilizability of 2D and 3D Navier-Stokes equations with memory around a non-constant steady state | Wasim Akram et.al. | 2502.00517 | null |
2025-02-01 | PyMOLfold: Interactive Protein and Ligand Structure Prediction in PyMOL | Colby T. Ford et.al. | 2502.00508 | link |
2025-02-01 | Revealing Spin and Spatial Symmetry Decoupling: New Insights into Magnetic Systems with Dzyaloshinskii-Moriya Interaction | Yuxuan Mu et.al. | 2502.00457 | null |
2025-02-01 | Enhancing Highway Safety: Accident Detection on the A9 Test Stretch Using Roadside Sensors | Walter Zimmer et.al. | 2502.00402 | null |
2025-02-01 | FlexCloud: Direct, Modular Georeferencing and Drift-Correction of Point Cloud Maps | Maximilian Leitenstern et.al. | 2502.00395 | link |
2025-02-01 | Shape from Semantics: 3D Shape Generation from Multi-View Semantics | Liangchen Li et.al. | 2502.00360 | null |
2025-02-01 | Embodied Intelligence for 3D Understanding: A Survey on 3D Scene Question Answering | Zechuan Li et.al. | 2502.00342 | null |
2025-02-01 | MonoDINO-DETR: Depth-Enhanced Monocular 3D Object Detection Using a Vision Foundation Model | Jihyeok Kim et.al. | 2502.00315 | null |
2025-02-11 | Electron Acceleration in Carbon Nanotubes | Cristian Bontoiu et.al. | 2502.00183 | null |
2025-01-31 | Lifting by Gaussians: A Simple, Fast and Flexible Method for 3D Instance Segmentation | Rohan Chacko et.al. | 2502.00173 | null |
2025-01-31 | Multimodal MRI-Ultrasound AI for Prostate Cancer Detection Outperforms Radiologist MRI Interpretation: A Multi-Center Study | Hassan Jahanandish et.al. | 2502.00146 | null |
2025-01-31 | TRAPPIST-1 d: Exo-Venus, Exo-Earth or Exo-Dead? | M. J. Way et.al. | 2502.00132 | null |
2025-01-31 | SpikingRTNH: Spiking Neural Network for 4D Radar Object Detection | Dong-Hee Paek et.al. | 2502.00074 | link |
2025-01-27 | HoloGraphs: An Interactive Physicalization for Dynamic Graphs | Daniel Pahr et.al. | 2502.00044 | null |
2025-01-31 | A topological theory for qLDPC: non-Clifford gates and magic state fountain on homological product codes with constant rate and beyond the $N^{1/3}$ distance barrier | Guanyu Zhu et.al. | 2501.19375 | null |
2025-01-31 | Advancing Dense Endoscopic Reconstruction with Gaussian Splatting-driven Surface Normal-aware Tracking and Mapping | Yiming Huang et.al. | 2501.19319 | link |
2025-01-31 | Experimental Investigation of Vortex Shedding in Superfluid $^4$ He | Brichet Lyse et.al. | 2501.19292 | null |
2025-01-31 | Imagine with the Teacher: Complete Shape in a Multi-View Distillation Way | Zhanpeng Luo et.al. | 2501.19270 | null |
2025-01-31 | Medical Semantic Segmentation with Diffusion Pretrain | David Li et.al. | 2501.19265 | null |
2025-01-31 | Single cell resolution 3D imaging and segmentation within intact live tissues | G. Paci et.al. | 2501.19203 | link |
2025-01-31 | RaySplats: Ray Tracing based Gaussian Splatting | Krzysztof Byrski et.al. | 2501.19196 | link |
2025-01-31 | Fractons from covariant higher-rank 3D BF theory | Erica Bertolini et.al. | 2501.19154 | null |
2025-01-31 | JGHand: Joint-Driven Animatable Hand Avater via 3D Gaussian Splatting | Zhoutao Sun et.al. | 2501.19088 | null |
2025-01-31 | Laser: Efficient Language-Guided Segmentation in Neural Radiance Fields | Xingyu Miao et.al. | 2501.19084 | link |
2025-01-31 | TeZO: Empowering the Low-Rankness on the Temporal Dimension in the Zeroth-Order Optimization for Fine-tuning LLMs | Yan Sun et.al. | 2501.19057 | null |
2025-01-31 | Virtual airways heatmaps to optimize point of entry location in lung biopsy planning systems | Debora Gil et.al. | 2501.19003 | null |
2025-01-31 | OmniPhysGS: 3D Constitutive Gaussians for General Physics-Based Dynamics Generation | Yuchen Lin et.al. | 2501.18982 | null |
2025-01-31 | Three-dimensional chiral active Ornstein-Uhlenbeck model for helical motion of microorganisms | Leon Lettermann et.al. | 2501.18927 | null |
2025-01-31 | Asymptotical Behavior of Global Solutions of the Navier-Stokes-Korteweg Equations with Respect to Capillarity Number at Infinity | Fei Jiang et.al. | 2501.18902 | null |
2025-01-31 | Project-and-Fuse: Improving RGB-D Semantic Segmentation via Graph Convolution Networks | Xiaoyan Jiang et.al. | 2501.18851 | null |
2025-01-31 | Reinforcement Learning of Flexible Policies for Symbolic Instructions with Adjustable Mapping Specifications | Wataru Hatanaka et.al. | 2501.18848 | null |
2025-01-31 | An Adversarial Approach to Register Extreme Resolution Tissue Cleared 3D Brain Images | Abdullah Naziba et.al. | 2501.18815 | link |
2025-01-30 | Zero-Shot Novel View and Depth Synthesis with Multi-View Geometric Diffusion | Vitor Guizilini et.al. | 2501.18804 | null |
2025-01-30 | Multispectral 3D mapping on a Roman sculpture to study ancient polychromy | Francesca Uccheddu et.al. | 2501.18786 | null |
2025-01-30 | Vibr-eau: Emulating Fluid Behavior in Vessel Handling through Vibrotactile Actuators | Frank Wencheng Liu et.al. | 2501.18755 | null |
2025-01-30 | A Novel Method to Estimate the FUV Flux and a Catalogue for Star-Hosting Discs in Nearby Star-Forming Regions | Rossella Anania et.al. | 2501.18752 | null |
2025-01-30 | Integrating LMM Planners and 3D Skill Policies for Generalizable Manipulation | Yuelei Li et.al. | 2501.18733 | null |
2025-01-30 | Strong and Controllable 3D Motion Generation | Canxuan Gang et.al. | 2501.18726 | null |
2025-01-30 | Full-Head Segmentation of MRI with Abnormal Brain Anatomy: Model and Data Release | Andrew M Birnbaum et.al. | 2501.18716 | link |
2025-01-30 | CRexit: how different cosmic ray transport modes affect thermal instability in the circumgalactic medium | Matthias Weber et.al. | 2501.18678 | null |
2025-02-07 | Drag Your Gaussian: Effective Drag-Based Editing with Score Distillation for 3D Gaussian Splatting | Yansong Qu et.al. | 2501.18672 | null |
2025-01-29 | 3D Reconstruction of Shoes for Augmented Reality | Pratik Shrestha et.al. | 2501.18643 | null |
2025-01-27 | Deformable Beta Splatting | Rong Liu et.al. | 2501.18630 | link |
2025-01-30 | Foundational Models for 3D Point Clouds: A Survey and Outlook | Vishal Thengane et.al. | 2501.18594 | null |
2025-01-30 | DiffusionRenderer: Neural Inverse and Forward Rendering with Video Diffusion Models | Ruofan Liang et.al. | 2501.18590 | null |
2025-01-30 | Transverse spin photocurrents in ultrathin topological insulator films | Shahrzad Movafagh et.al. | 2501.18547 | null |
2025-01-30 | HSRMamba: Contextual Spatial-Spectral State Space Model for Single Hyperspectral Super-Resolution | Shi Chen et.al. | 2501.18500 | null |
2025-01-30 | Comprehensive Enumeration of Three-Dimensional Photonic Crystals Enabled through Deep Learning Assisted Fourier Synthesis | Congcong Cui et.al. | 2501.18495 | null |
2025-01-30 | Nonequilibrium friction and free energy estimates for kinetic coarse-graining – Driven particles in responsive media | Sebastian Milster et.al. | 2501.18484 | null |
2025-01-30 | Segmentation of cracks in 3d images of fiber reinforced concrete using deep learning | Anna Nowacka et.al. | 2501.18405 | null |
2025-01-30 | Cracks in concrete | Tin Barisin et.al. | 2501.18376 | null |
2025-01-30 | Transductions of Graph Classes Admitting Product Structure | Petr Hliněný et.al. | 2501.18326 | null |
2025-01-30 | Surface Defect Identification using Bayesian Filtering on a 3D Mesh | Matteo Dalle Vedove et.al. | 2501.18315 | null |
2025-01-30 | Simulation of microstructures and machine learning | Katja Schladitz et.al. | 2501.18313 | null |
2025-01-30 | The iToBoS dataset: skin region images extracted from 3D total body photographs for lesion detection | Anup Saha et.al. | 2501.18270 | link |
2025-01-30 | Ambisonics Binaural Rendering via Masked Magnitude Least Squares | Or Berebi et.al. | 2501.18224 | null |
2025-01-30 | IROAM: Improving Roadside Monocular 3D Object Detection Learning from Autonomous Vehicle Data Domain | Zhe Wang et.al. | 2501.18162 | null |
2025-01-30 | Volumetric modulated arc therapy or step-shoot IMRT? A 4D dosimetry study of motion effect in lung SBRT using a dynamic virtual patient model | Tianjun Ma et.al. | 2501.18153 | null |
2025-01-30 | StructuredField: Unifying Structured Geometry and Radiance Field | Kaiwen Song et.al. | 2501.18152 | null |
2025-01-30 | Lifelong 3D Mapping Framework for Hand-held & Robot-mounted LiDAR Mapping Systems | Liudi Yang et.al. | 2501.18110 | null |
2025-01-30 | High order-accurate solution of scattering integral equations with unbounded solutions at corners | Constantine Sideris et.al. | 2501.18065 | null |
2025-01-29 | Five-dimensional single-shot fluorescence imaging using a polarized Fourier light-field microscope | Oumeng Zhang et.al. | 2501.18047 | null |
2025-01-31 | VoD-3DGS: View-opacity-Dependent 3D Gaussian Splatting | Mateusz Nowak et.al. | 2501.17978 | null |
2025-01-29 | TransRAD: Retentive Vision Transformer for Enhanced Radar Object Detection | Lei Cheng et.al. | 2501.17977 | link |
2025-01-29 | A fully adaptive, high-order, fast Poisson solver for complex two-dimensional geometries | Daniel Fortunato et.al. | 2501.17967 | null |
2025-01-29 | Giant orbital Hall effect due to the bulk states of 3D topological insulators | James H. Cullen et.al. | 2501.17919 | null |
2025-01-29 | A comment on “Why is the Galactic disk so cool?”, by Hamilton et al | J A Sellwood et.al. | 2501.17907 | null |
2025-01-29 | Visualization of Organ Movements Using Automatic Region Segmentation of Swallowing CT | Yukihiro Michiwaki et.al. | 2501.17897 | null |
2025-01-28 | ProcTex: Consistent and Interactive Text-to-texture Synthesis for Procedural Models | Ruiqi Xu et.al. | 2501.17895 | null |
2025-01-28 | Object Detection with Deep Learning for Rare Event Search in the GADGET II TPC | Tyler Wheeler et.al. | 2501.17892 | null |
2025-01-28 | VidSole: A Multimodal Dataset for Joint Kinetics Quantification and Disease Detection with Deep Learning | Archit Kambhamettu et.al. | 2501.17890 | null |
2025-01-29 | From Sparse to Dense: Toddler-inspired Reward Transition in Goal-Oriented Reinforcement Learning | Junseok Park et.al. | 2501.17842 | null |
2025-01-29 | SSF: Sparse Long-Range Scene Flow for Autonomous Driving | Ajinkya Khoche et.al. | 2501.17821 | link |
2025-01-29 | A trilinear quantum dot architecture for semiconductor spin qubits | R. Li et.al. | 2501.17814 | null |
2025-01-29 | Cell Deformation Signatures along the Apical-Basal Axis: A 3D Continuum Mechanics Shell Model | Jairo M. Rojas et.al. | 2501.17810 | null |
2025-01-29 | CrowdSplat: Exploring Gaussian Splatting For Crowd Rendering | Xiaohan Sun et.al. | 2501.17792 | link |
2025-01-29 | On the stability of viscous three-dimensional rotating Couette flow | Michele Coti Zelati et.al. | 2501.17735 | null |
2025-01-29 | A Multi-Dimensional Cathodoluminescence Detector with 3D Printed Micro-Optics on a Fiber | Paul H. Bittorf et.al. | 2501.17723 | null |
2025-01-29 | Segmentation-Aware Generative Reinforcement Network (GRN) for Tissue Layer Segmentation in 3-D Ultrasound Images for Chronic Low-back Pain (cLBP) Assessment | Zixue Zeng et.al. | 2501.17690 | link |
2025-01-29 | Topological insulator constrictions – Dirac particles in a magneto-chiral box | Michael Barth et.al. | 2501.17687 | null |
2025-01-29 | FeatureGS: Eigenvalue-Feature Optimization in 3D Gaussian Splatting for Geometrically Accurate and Artifact-Reduced Reconstruction | Miriam Jäger et.al. | 2501.17655 | null |
2025-01-29 | A computational loudness model for electrical stimulation with cochlear implants | Franklin Alvarez et.al. | 2501.17640 | null |
2025-01-30 | Efficient Interactive 3D Multi-Object Removal | Jingcheng Ni et.al. | 2501.17636 | null |
2025-01-29 | Tapor: 3D Hand Pose Reconstruction with Fully Passive Thermal Sensing for Around-device Interactions | Xie Zhang et.al. | 2501.17585 | link |
2025-01-29 | Towards Training-Free Open-World Classification with 3D Generative Models | Xinzhe Xia et.al. | 2501.17547 | null |
2025-01-29 | 3DSES: an indoor Lidar point cloud segmentation dataset with real and pseudo-labels from a 3D model | Maxime Mérizette et.al. | 2501.17534 | null |
2025-01-29 | The quantromon: A qubit-resonator system with orthogonal qubit and readout modes | Kishor V. Salunkhe et.al. | 2501.17439 | null |
2025-01-29 | Unfitted finite element interpolated neural networks | Wei Li et.al. | 2501.17438 | null |
2025-01-29 | ASAP: Learning Generalizable Online Bin Packing via Adaptive Selection After Pruning | Han Fang et.al. | 2501.17377 | null |
2025-01-28 | Post-Training Quantization for 3D Medical Image Segmentation: A Practical Study on Real Inference Engines | Chongyu Qu et.al. | 2501.17343 | null |
2025-01-28 | Recovering Ion Distribution Functions: I. Slepian Reconstruction of VDFs from MMS and Solar Orbiter | Srijan Bharati Das et.al. | 2501.17294 | null |
2025-01-30 | $L^2$ decay estimates of weak solutions to 3D fractional MHD equations in exterior domains | Zhi-Min Chen et.al. | 2501.17179 | null |
2025-01-28 | CubeDiff: Repurposing Diffusion-Based Image Models for Panorama Generation | Nikolai Kalischek et.al. | 2501.17162 | null |
2025-01-31 | IC-Portrait: In-Context Matching for View-Consistent Personalized Portrait | Han Yang et.al. | 2501.17159 | null |
2025-01-28 | Three-Dimensional Diffusion-Weighted Multi-Slab MRI With Slice Profile Compensation Using Deep Energy Model | Reza Ghorbani et.al. | 2501.17152 | null |
2025-01-28 | New Method for Robust Critical Analysis of Magnetic Systems | Harish Chandr Chauhan et.al. | 2501.17139 | null |
2025-01-28 | Evaluating CrowdSplat: Perceived Level of Detail for Gaussian Crowds | Xiaohan Sun et.al. | 2501.17085 | null |
2025-01-29 | Synthesizing 3D Abstractions by Inverting Procedural Buildings with Transformers | Maximilian Dax et.al. | 2501.17044 | null |
2025-01-28 | Non-uniqueness of mild solutions to supercritical heat equations | Irfan Glogić et.al. | 2501.17032 | null |
2025-01-28 | What Really Matters for Learning-based LiDAR-Camera Calibration | Shujuan Huang et.al. | 2501.16969 | null |
2025-01-28 | Correlated electron dynamics with time-dependent quantum Monte Carlo: three-dimensional helium | Ivan P. Christov et.al. | 2501.16774 | null |
2025-01-28 | DiffSplat: Repurposing Image Diffusion Models for Scalable Gaussian Splat Generation | Chenguo Lin et.al. | 2501.16764 | null |
2025-01-31 | Consistency Diffusion Models for Single-Image 3D Reconstruction with Priors | Chenru Jiang et.al. | 2501.16737 | null |
2025-01-28 | Point Cloud Upsampling as Statistical Shape Model for Pelvic | Tongxu Zhang et.al. | 2501.16716 | null |
2025-01-28 | Image-Space Gridding for Nonrigid Motion-Corrected MR Image Reconstruction | Kwang Eun Jang et.al. | 2501.16713 | null |
2025-01-28 | 3D-MoE: A Mixture-of-Experts Multi-modal LLM for 3D Vision and Pose Diffusion via Rectified Flow | Yueen Ma et.al. | 2501.16698 | null |
2025-01-28 | SliceOcc: Indoor 3D Semantic Occupancy Prediction with Vertical Slice Representation | Jianing Li et.al. | 2501.16684 | link |
2025-01-28 | Predicting 3D representations for Dynamic Scenes | Di Qi et.al. | 2501.16617 | null |
2025-01-28 | CascadeV: An Implementation of Wurstchen Architecture for Video Generation | Wenfeng Lin et.al. | 2501.16612 | link |
2025-01-27 | Nanostructured superlattices as a probe of fermiology in Weyl-semimetal NbP | Nathan C. Drucker et.al. | 2501.16512 | null |
2025-01-27 | Just stop doing everything for now!: Understanding security attacks in remote collaborative mixed reality | Maha Sajid et.al. | 2501.16505 | null |
2025-01-27 | Electrically tunable Floquet Weyl photon emission from Dirac semimetal Cd3As2 | Sobhan Subhra Mishra et.al. | 2501.16498 | null |
2025-01-27 | Phase-matched electron-photon interactions enabled by 3D-printed helical waveguides | Masoud Taleb et.al. | 2501.16486 | null |
2025-01-27 | Differential virial analysis: a new technique to determine the dynamical state of molecular clouds | Mark R. Krumholz et.al. | 2501.16474 | link |
2025-01-27 | Sensitivity Analysis of the Laser Power Control System to Measurement Noise in SLS 3D Printers | Hamid Toshani et.al. | 2501.16473 | null |
2025-01-27 | Higher-order chiral scalar from boundary reduction of 3d higher-spin gravity | Calvin Yi-Ren Chen et.al. | 2501.16463 | null |
2025-01-27 | Separate surface and bulk topological Anderson localization transitions in disordered axion insulators | Cormac Grindall et.al. | 2501.16413 | null |
2025-01-25 | MambaTron: Efficient Cross-Modal Point Cloud Enhancement using Aggregate Selective State Space Modeling | Sai Tarun Inaganti et.al. | 2501.16384 | null |
2025-01-27 | A MARVEL-ous study of how well galaxy shapes reflect Dark Matter halo shapes in Cold Dark Matter Simulations | Blake Keith et.al. | 2501.16317 | null |
2025-01-28 | LinPrim: Linear Primitives for Differentiable Volumetric Rendering | Nicolas von Lützow et.al. | 2501.16312 | null |
2025-01-27 | Effect of Numerical Resolution on Synthetic Observables of Simulated Coronal Loops | Cosima Alexandra Breu et.al. | 2501.16293 | null |
2025-01-27 | Brain-Adapter: Enhancing Neurological Disorder Analysis with Adapter-Tuning Multimodal Large Language Models | Jing Zhang et.al. | 2501.16282 | null |
2025-01-27 | CLISC: Bridging clip and sam by enhanced cam for unsupervised brain tumor segmentation | Xiaochuan Ma et.al. | 2501.16246 | null |
2025-01-28 | Automatic Calibration of a Multi-Camera System with Limited Overlapping Fields of View for 3D Surgical Scene Reconstruction | Tim Flückiger et.al. | 2501.16221 | null |
2025-01-27 | Aspects of the dilute Glasma | Markus Leuthner et.al. | 2501.16216 | null |
2025-01-27 | 3D image based stochastic micro-structure modelling of foams for simulating elasticity | Anne Jung et.al. | 2501.16194 | null |
2025-01-27 | BAG: Body-Aligned 3D Wearable Asset Generation | Zhongjin Luo et.al. | 2501.16177 | null |
2025-01-27 | Toward Efficient Generalization in 3D Human Pose Estimation via a Canonical Domain Approach | Hoosang Lee et.al. | 2501.16146 | null |
2025-01-27 | 3D Reconstruction of non-visible surfaces of objects from a Single Depth View – Comparative Study | Rafał Staszak et.al. | 2501.16101 | null |
2025-01-27 | Magnetoelastic coupling in the stretched diamond lattice of TbTaO $_4$ | Xiaotian Zhang et.al. | 2501.15989 | null |
2025-01-27 | MatCLIP: Light- and Shape-Insensitive Assignment of PBR Material Models | Michael Birsak et.al. | 2501.15981 | null |
2025-01-27 | Controllable Hand Grasp Generation for HOI and Efficient Evaluation Methods | Ishant et.al. | 2501.15839 | null |
2025-01-31 | SpatialVLA: Exploring Spatial Representations for Visual-Language-Action Model | Delin Qu et.al. | 2501.15830 | null |
2025-01-27 | Reliable Density Functional Theory Predictions of Bandgaps for Materials | Chenxi Lu et.al. | 2501.15811 | null |
2025-01-27 | NanoHTNet: Nano Human Topology Network for Efficient 3D Human Pose Estimation | Jialun Cai et.al. | 2501.15763 | null |
2025-01-27 | Intuition and importance of feedback control through laboratory experiences | Aldo Jonathan Munoz-Vazquez et.al. | 2501.15742 | null |
2025-01-27 | Geometric Deep Learning for Automated Landmarking of Maxillary Arches on 3D Oral Scans from Newborns with Cleft Lip and Palate | Artur Agaronyan et.al. | 2501.15737 | null |
2025-01-27 | Leveraging Video Vision Transformer for Alzheimer’s Disease Diagnosis from 3D Brain MRI | Taymaz Akan et.al. | 2501.15733 | null |
2025-01-27 | INRet: A General Framework for Accurate Retrieval of INRs for Shapes | Yushi Guan et.al. | 2501.15722 | null |
2025-01-27 | SeqSeg: Learning Local Segments for Automatic Vascular Model Construction | Numi Sveinsson Cepero et.al. | 2501.15712 | link |
2025-01-26 | Multi-compartment diffusion-relaxation MR signal representation in the spherical 3D-SHORE basis | Fabian Bogusz et.al. | 2501.15689 | null |
2025-01-26 | Marker Track: Accurate Fiducial Marker Tracking for Evaluation of Residual Motions During Breath-Hold Radiotherapy | Aimee Guo et.al. | 2501.15660 | null |
2025-01-26 | BoKDiff: Best-of-K Diffusion Alignment for Target-Specific 3D Molecule Generation | Ali Khodabandeh Yalabadi et.al. | 2501.15631 | link |
2025-01-26 | IPVTON: Image-based 3D Virtual Try-on with Image Prompt Adapter | Xiaojing Zhong et.al. | 2501.15616 | null |
2025-01-26 | Tumor Detection, Segmentation and Classification Challenge on Automated 3D Breast Ultrasound: The TDSC-ABUS Challenge | Gongning Luo et.al. | 2501.15588 | null |
2025-01-26 | Comparative clinical evaluation of “memory-efficient” synthetic 3d generative adversarial networks (gan) head-to-head to state of art: results on computed tomography of the chest | Mahshid shiri et.al. | 2501.15572 | null |
2025-01-26 | Fiber Endoscopy Using Synthetic Wavelengths for 3D tissue imaging | Muralidhar Madabhushi Balaji et.al. | 2501.15561 | null |
2025-01-26 | Breaking the SSL-AL Barrier: A Synergistic Semi-Supervised Active Learning Framework for 3D Object Detection | Zengran Wang et.al. | 2501.15449 | null |
2025-01-26 | StochSync: Stochastic Diffusion Synchronization for Image Generation in Arbitrary Spaces | Kyeongmin Yeo et.al. | 2501.15445 | null |
2025-01-31 | Dfilled: Repurposing Edge-Enhancing Diffusion for Guided DSM Void Filling | Daniel Panangian et.al. | 2501.15440 | null |
2025-01-26 | Doracamom: Joint 3D Detection and Occupancy Prediction with Multi-view 4D Radars and Cameras for Omnidirectional Perception | Lianqing Zheng et.al. | 2501.15394 | null |
2025-01-26 | A Panoramic View of MXenes via a New Design Strategy | Noah Oyeniran et.al. | 2501.15390 | null |
2025-01-26 | MetaOcc: Surround-View 4D Radar and Camera Fusion Framework for 3D Occupancy Prediction with Dual Training Strategies | Long Yang et.al. | 2501.15384 | link |
2025-01-28 | Acquiring Submillimeter-Accurate Multi-Task Vision Datasets for Computer-Assisted Orthopedic Surgery | Emma Most et.al. | 2501.15371 | link |
2025-01-26 | Structural Symmetry, Multiplicity, and Differentiability of Eigenfrequencies | Shiyao Sun et.al. | 2501.15357 | link |
2025-01-25 | Processing the 2D and 3D Fresnel experimental databases via topological derivative methods | A. Carpio et.al. | 2501.15327 | null |
2025-01-25 | A Tale of Two Sides of Wafer: Physical Implementation and Block-Level PPA on Flip FET with Dual-sided Signals | Haoran Lu et.al. | 2501.15275 | null |
2025-01-25 | Piecewise Ruled Approximation for Freeform Mesh Surfaces | Yiling Pan et.al. | 2501.15258 | null |
2025-01-25 | End-to-end localized deep learning for Cryo-ET | Vinith Kishore et.al. | 2501.15246 | link |
2025-01-25 | Three-Dimensional Sparse Random Mode Decomposition for Mode Disentangling with Crossover Instantaneous Frequencies | Chen Luo et.al. | 2501.15184 | null |
2025-01-25 | Towards Better Robustness: Progressively Joint Pose-3DGS Learning for Arbitrarily Long Videos | Zhen-Hui Dong et.al. | 2501.15096 | null |
2025-01-25 | Deep Reinforcement Learning for Energy Efficiency Maximization in RSMA-IRS-Assisted ISAC System | Zhangfeng Ma et.al. | 2501.15091 | null |
2025-01-25 | Magnetic Field induced control and Multiple Magnomechanically Induced Transparency in Single Cavity | Ghaisud Din et.al. | 2501.15069 | null |
2025-01-25 | Future Cosmology: New Physics and Opportunity from the China Space Station Telescope (CSST) | Yan Gong et.al. | 2501.15023 | null |
2025-01-25 | HuGDiffusion: Generalizable Single-Image Human Rendering via 3D Gaussian Diffusion | Yingzhi Tang et.al. | 2501.15008 | null |
2025-01-24 | Qubit operations using a modular optical system engineered with PyOpticL: a code-to-CAD optical layout tool | Jacob Myers et.al. | 2501.14957 | link |
2025-01-24 | Motion-enhancement to Echocardiography Segmentation via Inserting a Temporal Attention Module: An Efficient, Adaptable, and Scalable Approach | Md. Kamrul Hasan et.al. | 2501.14929 | null |
2025-01-24 | Light3R-SfM: Towards Feed-forward Structure-from-Motion | Sven Elflein et.al. | 2501.14914 | null |
2025-01-24 | A cluster mean approach for topology optimization of natural frequencies and bandgaps with simple/multiple eigenfrequencies | Shiyao Sun et.al. | 2501.14910 | null |
2025-01-24 | Glissando-Net: Deep sinGLe vIew category level poSe eStimation ANd 3D recOnstruction | Bo Sun et.al. | 2501.14896 | null |
2025-01-24 | Integrated 3D printing of transparency-on-demand glass microstructure | Zhihan Hong et.al. | 2501.14888 | null |
2025-01-24 | Effects of sub-nucleonic fluctuations on the longitudinal structure of heavy-ion collisions | Oscar Garcia-Montero et.al. | 2501.14872 | null |
2025-01-24 | Monte Carlo post-processing for radiation hydro simulations of accreting planets in protoplanetary disks | Anton Krieger et.al. | 2501.14858 | null |
2025-01-23 | Heliometric stereo: a new frontier in surface profilometry | Aleksandar Radic et.al. | 2501.14833 | null |
2025-01-15 | An efficient GPU approach for designing 3D cultural heritage information systems | Luis López et.al. | 2501.14807 | null |
2025-01-24 | HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation | Xin Zhou et.al. | 2501.14729 | link |
2025-01-24 | ngVLA Synthetic Observations of Ionized Gas in Massive Protostars | Jesús M. Jáquez-Domínguez et.al. | 2501.14711 | null |
2025-01-24 | Towards Unified Structured Light Optimization | Tinglei Wan et.al. | 2501.14659 | null |
2025-01-24 | Impact of phonon lifetimes on the single-photon indistinguishability in quantum emitters based on 2D materials | Alexander Steinhoff et.al. | 2501.14656 | null |
2025-01-24 | Trick-GS: A Balanced Bag of Tricks for Efficient Gaussian Splatting | Anil Armagan et.al. | 2501.14534 | null |
2025-01-24 | CheapNVS: Real-Time On-Device Narrow-Baseline Novel View Synthesis | Konstantinos Georgiadis et.al. | 2501.14533 | null |
2025-01-24 | BILLNET: A Binarized Conv3D-LSTM Network with Logic-gated residual architecture for hardware-efficient video inference | Van Thien Nguyen et.al. | 2501.14495 | null |
2025-01-27 | Visual-Lidar Map Alignment for Infrastructure Inspections | Jake McLaughlin et.al. | 2501.14486 | link |
2025-01-24 | Euclid preparation: Extracting physical parameters from galaxies with machine learning | Euclid Collaboration et.al. | 2501.14408 | null |
2025-01-24 | Characterising the Atacama segment of the Chile subduction margin (24°S-31°S) with >165,000 earthquakes | Jannes Münchmeyer et.al. | 2501.14396 | null |
2025-01-24 | Microscopic study of 3D Potts phase transition via Fuzzy Sphere Regularization | Shuai Yang et.al. | 2501.14320 | null |
2025-01-24 | Scalable Benchmarking and Robust Learning for Noise-Free Ego-Motion and 3D Reconstruction from Noisy Video | Xiaohao Xu et.al. | 2501.14319 | link |
2025-02-02 | Nautilus: Locality-aware Autoencoder for Scalable Mesh Generation | Yuxuan Wang et.al. | 2501.14317 | null |
2025-01-24 | Additive Manufacturing Processes Protocol Prediction by Artificial Intelligence using X-ray Computed Tomography data | Sunita Khod et.al. | 2501.14306 | null |
2025-02-02 | Cylindrically confined $H$ atom in magnetic field: variational cut-off factor | A. N. Mendoza Tavera et.al. | 2501.14297 | null |
2025-01-24 | Comparative analysis of two episodes of strongly geoeffective CME events in November and December 2023 | M. Temmer et.al. | 2501.14295 | null |
2025-01-24 | Dense-SfM: Structure from Motion with Dense Consistent Matching | JongMin Lee et.al. | 2501.14277 | null |
2025-02-01 | Point-LN: A Lightweight Framework for Efficient Point Cloud Classification Using Non-Parametric Positional Encoding | Marzieh Mohammadi et.al. | 2501.14238 | link |
2025-01-24 | Micro-macro Wavelet-based Gaussian Splatting for 3D Reconstruction from Unconstrained Images | Yihui Li et.al. | 2501.14231 | null |
2025-01-24 | Leveraging three-dimensionality for navigation in bluff-body wakes | Vedasri Godavarthi et.al. | 2501.14160 | null |
2025-01-24 | HAMMER: Heterogeneous, Multi-Robot Semantic Gaussian Splatting | Javier Yu et.al. | 2501.14147 | null |
2025-01-23 | Flexible 3D Cage-based Deformation via Green Coordinates on Bézier Patches | Dong Xiao et.al. | 2501.14068 | null |
2025-01-23 | Efficient 2D CT Foundation Model for Contrast Phase Classification | Benjamin Hou et.al. | 2501.14066 | null |
2025-01-23 | Probing the Limits of Habitability: A Catalog of Rocky Exoplanets in the Habitable Zone | Abigail Bohl et.al. | 2501.14054 | null |
2025-01-23 | Revisiting CLIP: Efficient Alignment of 3D MRI and Tabular Data using Domain-Specific Foundation Models | Jakob Krogh Petersen et.al. | 2501.14051 | link |
2025-01-23 | Leveraging Multiphase CT for Quality Enhancement of Portal Venous CT: Utility for Pancreas Segmentation | Xinya Wang et.al. | 2501.14013 | null |
2025-01-23 | ME-CPT: Multi-Task Enhanced Cross-Temporal Point Transformer for Urban 3D Change Detection | Luqi Zhang et.al. | 2501.14004 | link |
2025-01-27 | 3DGS $^2$ : Near Second-order Converging 3D Gaussian Splatting | Lei Lan et.al. | 2501.13975 | null |
2025-01-22 | GS-LiDAR: Generating Realistic LiDAR Point Clouds with Panoramic Gaussian Splatting | Junzhe Jiang et.al. | 2501.13971 | link |
2025-01-22 | InsTex: Indoor Scenes Stylized Texture Synthesis | Yunfan Zhang et.al. | 2501.13969 | null |
2025-01-21 | Procedural Generation of 3D Maize Plant Architecture from LIDAR Data | Mozhgan Hadadi et.al. | 2501.13963 | null |
2025-01-21 | A Fast, Scalable, and Robust Deep Learning-based Iterative Reconstruction Framework for Accelerated Industrial Cone-beam X-ray Computed Tomography | Aniket Pramanik et.al. | 2501.13961 | null |
2025-01-23 | Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass | Jianing Yang et.al. | 2501.13928 | link |
2025-01-23 | IMAGINE-E: Image Generation Intelligence Evaluation of State-of-the-art Text-to-Image Models | Jiayi Lei et.al. | 2501.13920 | null |
2025-01-23 | Three-Dimensional to Layered Halide Perovskites: A Parameter-Free Hybrid Functional Method for Predicting Electronic Band Gaps | Ibrahim Buba Garba et.al. | 2501.13852 | null |
2025-01-23 | Towards Real-World Validation of a Physics-Based Ship Motion Prediction Model | Michail Mathioudakis et.al. | 2501.13804 | null |
2025-01-23 | A hybrid Reduced Order Model to enforce outflow pressure boundary conditions in computational haemodynamics | Pierfrancesco Siena et.al. | 2501.13768 | null |
2025-01-23 | Three-dimensional multiscale discrete Radon and John transforms | José Marichal-Hernández et.al. | 2501.13664 | null |
2025-01-31 | SMILES & SELFIES has to go : Representation of Molecules via Algebraic Data Types | Oliver Goldstein et.al. | 2501.13633 | link |
2025-01-23 | Steady 3d Euler flows via a topology-preserving convex integration scheme | Alberto Enciso et.al. | 2501.13632 | null |
2025-01-23 | GoDe: Gaussians on Demand for Progressive Level of Detail and Scalable Compression | Francesco Di Sario et.al. | 2501.13558 | null |
2025-01-23 | Mode-Shell correspondence, a unifying phase space theory in topological physics – part II: Higher-dimensional spectral invariants | Lucien Jezequel et.al. | 2501.13550 | null |
2025-01-23 | A discrete adjoint method for deterministic and probabilistic eikonal-equation-based inversion of traveltime for velocity and source location | Andrea Zunino et.al. | 2501.13532 | null |
2025-01-23 | Leveraging Textual Anatomical Knowledge for Class-Imbalanced Semi-Supervised Multi-Organ Segmentation | Yuliang Gu et.al. | 2501.13470 | link |
2025-01-23 | MultiDreamer3D: Multi-concept 3D Customization with Concept-Aware Diffusion Guidance | Wooseok Song et.al. | 2501.13449 | null |
2025-01-23 | Suppression of ferromagnetism in van der Waals insulator due to pressure-induced layer stacking variation | M. Misek et.al. | 2501.13446 | null |
2025-01-23 | GeomGS: LiDAR-Guided Geometry-Aware Gaussian Splatting for Robot Localization | Jaewon Lee et.al. | 2501.13417 | null |
2025-01-23 | ROMA: ROtary and Movable Antenna | Jiayi Zhang et.al. | 2501.13403 | null |
2025-01-23 | VIGS SLAM: IMU-based Large-Scale 3D Gaussian Splatting SLAM | Gyuhyeon Pak et.al. | 2501.13402 | null |
2025-01-23 | Unraveling Normal Anatomy via Fluid-Driven Anomaly Randomization | Peirong Liu et.al. | 2501.13370 | link |
2025-01-23 | CuriousBot: Interactive Mobile Exploration via Actionable 3D Relational Object Graph | Yixuan Wang et.al. | 2501.13338 | null |
2025-01-23 | Deblur-Avatar: Animatable Avatars from Motion-Blurred Monocular Videos | Xianrui Luo et.al. | 2501.13335 | null |
2025-01-23 | Tensor-Var: Variational Data Assimilation in Tensor Product Feature Space | Yiming Yang et.al. | 2501.13312 | link |
2025-01-23 | Unstable accretion in TW Hya: 3D simulations and comparisons with observations | M. M. Romanova et.al. | 2501.13294 | null |
2025-01-22 | Flying shape and aerodynamics of a full-scale flexible Olympic windsurf sail | J. Zhang et.al. | 2501.13254 | null |
2025-01-22 | High dimensional spatiotemporal toroidal light beams with arbitrary polarization and orientation through a multimode fiber | Andrew V. Komonen et.al. | 2501.13246 | null |
2025-01-22 | Accelerating Discovery of Solid-State Thin-Film Metal Dealloying for 3D Nanoarchitecture Materials Design through Laser Thermal Gradient Treatment | Cheng-Chu Chung et.al. | 2501.13245 | null |
2025-01-22 | GWEn – An Open-Source Wireless Physical-Layer Evaluation Platform | Alexander Heinrich et.al. | 2501.13144 | null |
2025-01-21 | A Learnt Half-Quadratic Splitting-Based Algorithm for Fast and High-Quality Industrial Cone-beam CT Reconstruction | Aniket Pramanik et.al. | 2501.13128 | null |
2025-01-21 | MARTApp: software for the processing and reconstruction of synchrotron radiation-based magnetic tomographies | A. Estela Herguedas-Alonso et.al. | 2501.13127 | link |
2025-01-20 | Linea alba 3D morphometric variability by CT scan exploration | P. Gueroult et.al. | 2501.13116 | null |
2025-01-22 | Neural Radiance Fields for the Real World: A Survey | Wenhui Xiao et.al. | 2501.13104 | null |
2025-01-22 | Orchid: Image Latent Diffusion for Joint Appearance and Geometry Generation | Akshay Krishnan et.al. | 2501.13087 | null |
2025-01-30 | CHaRNet: Conditioned Heatmap Regression for Robust Dental Landmark Localization | José Rodríguez-Ortega et.al. | 2501.13073 | null |
2025-01-22 | Robust Body Composition Analysis by Generating 3D CT Volumes from Limited 2D Slices | Lianrui Zuo et.al. | 2501.13071 | null |
2025-01-22 | Beyond the Lungs: Extending the Field of View in Chest CT with Latent Diffusion Models | Lianrui Zuo et.al. | 2501.13068 | null |
2025-01-22 | A polynomial formula for the perspective four points problem | David Lehavi et.al. | 2501.13058 | null |
2025-01-22 | Dimensional Crossover and Emergence of Novel Phases in Puckered PdSe $_2$ under Pressure | Tanima Kundu et.al. | 2501.13057 | null |
2025-01-22 | HH 270/110 as a jet/shear layer interaction | A. C. Raga et.al. | 2501.13048 | null |
2025-01-22 | Sketch and Patch: Efficient 3D Gaussian Representation for Man-Made Scenes | Yuang Shi et.al. | 2501.13045 | null |
2025-01-22 | MorphoSkel3D: Morphological Skeletonization of 3D Point Clouds for Informed Sampling in Object Classification and Retrieval | Pierre Onghena et.al. | 2501.12974 | link |
2025-01-22 | 3D Object Manipulation in a Single Image using Generative Models | Ruisi Zhao et.al. | 2501.12935 | null |
2025-01-22 | PreciseCam: Precise Camera Control for Text-to-Image Generation | Edurne Bernal-Berdun et.al. | 2501.12910 | null |
2025-01-22 | FilmAgent: A Multi-Agent Framework for End-to-End Film Automation in Virtual 3D Spaces | Zhenran Xu et.al. | 2501.12909 | null |
2025-01-22 | Machine Learning Modeling for Multi-order Human Visual Motion Processing | Zitang Sun et.al. | 2501.12810 | link |
2025-01-22 | An Implicit Adaptive Fourier Neural Operator for Long-term Predictions of Three-dimensional Turbulence | Yuchi Jiang et.al. | 2501.12740 | null |
2025-02-02 | DWTNeRF: Boosting Few-shot Neural Radiance Fields via Discrete Wavelet Transform | Hung Nguyen et.al. | 2501.12637 | null |
2025-01-22 | Adapting OpenAI’s CLIP Model for Few-Shot Image Inspection in Manufacturing Quality Control: An Expository Case Study with Multiple Application Examples | Fadel M. Megahed et.al. | 2501.12596 | null |
2025-01-21 | Optimization of passive superconductors for shaping stellarator magnetic fields | Alan A. Kaptanoglu et.al. | 2501.12468 | null |
2025-01-30 | Survey of Radiative, Two-Temperature Magnetically Arrested Simulations of the Black Hole M87* I: Turbulent Electron Heating | Andrew Chael et.al. | 2501.12448 | null |
2025-01-21 | Field-induced phase transitions in the Kitaev-Heisenberg model: A sign-problem-free quantum Monte Carlo study and possible application to $α$ -RuCl3 | Xuan Zou et.al. | 2501.12437 | null |
2025-01-21 | Enhancing Retrosynthesis with Conformer: A Template-Free Method | Jiaxi Zhuang et.al. | 2501.12434 | null |
2025-01-22 | GPS as a Control Signal for Image Generation | Chao Feng et.al. | 2501.12390 | null |
2025-01-21 | Continuous 3D Perception Model with Persistent State | Qianqian Wang et.al. | 2501.12387 | null |
2025-01-21 | DARB-Splatting: Generalizing Splatting with Decaying Anisotropic Radial Basis Functions | Vishagar Arunan et.al. | 2501.12369 | null |
2025-01-22 | HAC++: Towards 100X Compression of 3D Gaussian Splatting | Yihang Chen et.al. | 2501.12255 | link |
2025-01-21 | Experiments and modeling of dust particle heating resulting from changes in polarity switching in the PK-4 microgravity laboratory | Lori S. McCabe et.al. | 2501.12248 | null |
2025-01-21 | The Structure of the Molecular Envelope of the Ring Nebula (NGC 6720) | Joel H. Kastner et.al. | 2501.12223 | null |
2025-01-22 | Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation | Zibo Zhao et.al. | 2501.12202 | link |
2025-01-21 | High-dimensional multimodal uncertainty estimation by manifold alignment:Application to 3D right ventricular strain computations | Maxime Di Folco et.al. | 2501.12178 | null |
2025-01-21 | DNRSelect: Active Best View Selection for Deferred Neural Rendering | Dongli Wu et.al. | 2501.12150 | null |
2025-01-21 | Uniform boundedness of conformal energy for the 3D quasilinear wave equation | Jingya Zhao et.al. | 2501.12103 | null |
2025-01-21 | A glance to Luttinger liquid and its platforms | Isabelle Bouchoule et.al. | 2501.12097 | null |
2025-01-22 | Numerical Modeling of Oxygen Diffusion in Tissue Spheroids Undergoing Fusion Using Functional Representation and Finite Volumes | Katherine Vilinski-Mazur et.al. | 2501.12095 | null |
2025-01-22 | GSVC: Efficient Video Representation and Compression Through 2D Gaussian Splatting | Longan Wang et.al. | 2501.12060 | null |
2025-01-22 | Unified 3D MRI Representations via Sequence-Invariant Contrastive Learning | Liam Chalcroft et.al. | 2501.12057 | link |
2025-01-21 | Low-Cost 3D printed, Biocompatible Ionic Polymer Membranes for Soft Actuators | Nils Trümpler et.al. | 2501.12025 | null |
2025-01-21 | The Dilemma of Privacy Protection for Developers in the Metaverse | Argianto Rahartomo et.al. | 2501.12006 | null |
2025-01-21 | Fabrication of Poly (ε-Caprolactone) 3D scaffolds with controllable porosity using ultrasound | Martin Weber et.al. | 2501.11995 | null |
2025-01-21 | Survey on Hand Gesture Recognition from Visual Input | Manousos Linardakis et.al. | 2501.11992 | null |
2025-01-21 | SMamba: Sparse Mamba for Event-based Object Detection | Nan Yang et.al. | 2501.11971 | link |
2025-01-21 | Freezing in flat monolayers of soft spherocylinders | Jaydeep Mandal et.al. | 2501.11952 | null |
2025-01-21 | Multi-Modal Variable-Rate CSI Reconstruction for FDD Massive MIMO Systems | Yunseo Nam et.al. | 2501.11926 | null |
2025-01-21 | DynoSAM: Open-Source Smoothing and Mapping Framework for Dynamic SLAM | Jesse Morris et.al. | 2501.11893 | link |
2025-01-21 | 3D structure and stability prediction of DNA with multi-way junctions in ionic solutions | Xunxun Wang et.al. | 2501.11891 | null |
2025-01-21 | FNIN: A Fourier Neural Operator-based Numerical Integration Network for Surface-form-gradients | Jiaqi Leng et.al. | 2501.11876 | link |
2025-01-21 | Saturation in Snapshot Compressive Imaging | Mengyu Zhao et.al. | 2501.11869 | null |
2025-01-21 | EmbodiedEval: Evaluate Multimodal LLMs as Embodied Agents | Zhili Cheng et.al. | 2501.11858 | link |
2025-01-21 | Survey on Monocular Metric Depth Estimation | Jiuling Zhang et.al. | 2501.11841 | null |
2025-01-21 | Automating High Quality RT Planning at Scale | Riqiang Gao et.al. | 2501.11803 | link |
2025-01-21 | Self-Calibrated Epipolar Reconstruction for Assessment of Aneurysms in the Internal Carotid Artery Using In-Silico Biplane Angiograms | Kyle A. Williams et.al. | 2501.11793 | null |
2025-01-20 | A generalizable 3D framework and model for self-supervised learning in medical imaging | Tony Xu et.al. | 2501.11755 | link |
2025-01-20 | Homoclinic orbits, Reeb chords and nice Birkhoff sections for Reeb flows in 3D | Vincent Colin et.al. | 2501.11725 | null |
2025-01-20 | The TW Hydrae Association is a cluster chain of Sco-Cen | N. Miret-Roig et.al. | 2501.11716 | null |
2025-01-20 | Ion Trap Geometry | Evgeny V Krylov et.al. | 2501.11703 | null |
2025-01-23 | SE(3)-Based Trajectory Optimization and Target Tracking in UAV-Enabled ISAC Systems | Dongxiao Xu et.al. | 2501.11687 | null |
2025-01-20 | The 3d $A$ -model and generalised symmetries, Part I: bosonic Chern-Simons theories | Cyril Closset et.al. | 2501.11665 | null |
2025-01-20 | Nonlinear analysis of gravitational instability in a 3D gaseous disc | Joshua J. Brown et.al. | 2501.11658 | null |
2025-01-20 | Wafer-scale waveguide sidewall roughness scattering loss characterization by image processing | Mohit Khurana et.al. | 2501.11590 | null |
2025-01-20 | See In Detail: Enhancing Sparse-view 3D Gaussian Splatting with Local Depth and Semantic Regularization | Zongqi He et.al. | 2501.11508 | null |
2025-01-20 | Efficient Multi-Source Localization in Near-Field Using only Angular Domain MUSIC | Mehdi Haghshenas et.al. | 2501.11460 | null |
2025-01-20 | CatV2TON: Taming Diffusion Transformers for Vision-Based Virtual Try-On with Temporal Concatenation | Zheng Chong et.al. | 2501.11325 | link |
2025-01-20 | Center vortices and the $\mathrm{SU}(3)$ conformal window | J. A. Mickley et.al. | 2501.11279 | null |
2025-01-20 | How Well Do Supervised 3D Models Transfer to Medical Imaging Tasks? | Wenxuan Li et.al. | 2501.11253 | link |
2025-01-19 | LiFT: Lightweight, FPGA-tailored 3D object detection based on LiDAR data | Konrad Lis et.al. | 2501.11159 | link |
2025-02-01 | OpenLiDARMap: Zero-Drift Point Cloud Mapping using Map Priors | Dominik Kulmer et.al. | 2501.11111 | link |
2025-01-19 | RDG-GS: Relative Depth Guidance with Gaussian Splatting for Real-time Sparse-View 3D Rendering | Chenlu Zhan et.al. | 2501.11102 | null |
2025-01-19 | In Vivo Study of Bone Growth Around Additively Manufactured Implants with Ti-6Al-4V and Bioactive Glass Powder Composites | Chih-Yu Lee et.al. | 2501.11098 | null |
2025-01-19 | Data-Constrained Magnetohydrodynamics Simulation of a Confined X-class Flare in NOAA Active Region 11166 | Sanjay Kumar et.al. | 2501.11066 | null |
2025-01-19 | Superlubric-Locked Transition of Twist Grain Boundaries in 3D Crystals | Jin Wang et.al. | 2501.11061 | null |
2025-01-19 | Tracking Mouse from Incomplete Body-Part Observations and Deep-Learned Deformable-Mouse Model Motion-Track Constraint for Behavior Analysis | Olaf Hellwich et.al. | 2501.11030 | null |
2025-01-19 | Car-GS: Addressing Reflective and Transparent Surface Challenges in 3D Car Reconstruction | Congcong Li et.al. | 2501.11020 | null |
2025-01-19 | DC-PCN: Point Cloud Completion Network with Dual-Codebook Guided Quantization | Qiuxia Wu et.al. | 2501.10966 | null |
2025-01-19 | Random batch sum-of-Gaussians algorithm for molecular dynamics simulations of Yukawa systems in three dimensions | Chen Chen et.al. | 2501.10946 | null |
2025-01-19 | Generative Physical AI in Vision: A Survey | Daochang Liu et.al. | 2501.10928 | null |
2025-01-18 | EMICSS: Added-value annotations for EMDB entries | Amudha K. Duraisamy et.al. | 2501.10882 | link |
2025-01-18 | No More Sliding Window: Efficient 3D Medical Image Segmentation with Differentiable Top-k Patch Sampling | Young Seok Jeon et.al. | 2501.10814 | null |
2025-01-18 | Decoupling Appearance Variations with 3D Consistent Features in Gaussian Splatting | Jiaqi Lin et.al. | 2501.10788 | null |
2025-01-18 | Enhancing Diagnostic in 3D COVID-19 Pneumonia CT-scans through Explainable Uncertainty Bayesian Quantification | Juan Manuel Liscano Fierro et.al. | 2501.10770 | null |
2025-01-18 | An Experimental Study on Joint Modeling for Sound Event Localization and Detection with Source Distance Estimation | Yuxuan Dong et.al. | 2501.10755 | null |
2025-01-18 | Incremental Model Order Reduction of Smoothed-Particle Hydrodynamic Simulations | Eduardo Di Costanzo et.al. | 2501.10748 | null |
2025-01-18 | Continuum limit of 3D fractional nonlinear Schrödinger equation | Jiajun Wang et.al. | 2501.10737 | null |
2025-01-22 | A CNN-Transformer for Classification of Longitudinal 3D MRI Images – A Case Study on Hepatocellular Carcinoma Prediction | Jakob Nolte et.al. | 2501.10733 | link |
2025-01-18 | PB-NBV: Efficient Projection-Based Next-Best-View Planning Framework for Reconstruction of Unknown Objects | Zhizhou Jia et.al. | 2501.10663 | link |
2025-01-18 | RoMu4o: A Robotic Manipulation Unit For Orchard Operations Automating Proximal Hyperspectral Leaf Sensing | Mehrad Mortazavi et.al. | 2501.10621 | link |
2025-01-26 | Hierarchical LoG Bayesian Neural Network for Enhanced Aorta Segmentation | Delin An et.al. | 2501.10615 | link |
2025-01-16 | Poxel: Voxel Reconstruction for 3D Printing | Ruixiang Cao et.al. | 2501.10474 | null |
2025-01-15 | BloomScene: Lightweight Structured 3D Gaussian Splatting for Crossmodal Scene Generation | Xiaolu Hou et.al. | 2501.10462 | link |
2025-01-24 | PhyDeformer: High-Quality Non-Rigid Garment Registration with Physics-Awareness | Boyang Yu et.al. | 2501.10455 | null |
2025-01-20 | Zero-Shot Monocular Scene Flow Estimation in the Wild | Yiqing Liang et.al. | 2501.10357 | null |
2025-01-17 | Perception of Visual Variables on Virtual Wall-Sized Tiled Displays in Immersive Environments | Dongyun Han et.al. | 2501.10338 | null |
2025-01-17 | Elucidating the high compliance mechanism by which the urinary bladder fills under low pressures | Fatemeh Azari et.al. | 2501.10312 | null |
2025-01-20 | GSTAR: Gaussian Surface Tracking and Reconstruction | Chengwei Zheng et.al. | 2501.10283 | null |
2025-01-17 | MutualForce: Mutual-Aware Enhancement for 4D Radar-LiDAR 3D Object Detection | Xiangyuan Peng et.al. | 2501.10266 | null |
2025-01-17 | Monolithically 3D nano-printed mm-scale lens actuator for dynamic focus control in optical systems | Florian Lux et.al. | 2501.10254 | null |
2025-01-17 | The R-Vessel-X Project | Abir Affane et.al. | 2501.10068 | link |
2025-01-17 | Textoon: Generating Vivid 2D Cartoon Characters from Text Descriptions | Chao He et.al. | 2501.10020 | null |
2025-01-17 | Mitigating Hallucinations on Object Attributes using Multiview Images and Negative Instructions | Zhijie Tan et.al. | 2501.10011 | null |
2025-01-17 | Static Three-Dimensional Structures Determine Fast Dynamics Between Distal Loci Pairs in Interphase Chromosomes | Guang Shi et.al. | 2501.10004 | null |
2025-01-17 | Multi-Modal Attention Networks for Enhanced Segmentation and Depth Estimation of Subsurface Defects in Pulse Thermography | Mohammed Salah et.al. | 2501.09994 | link |
2025-01-17 | GaussianAvatar-Editor: Photorealistic Animatable Gaussian Head Avatar Editor | Xiangyue Liu et.al. | 2501.09978 | null |
2025-01-17 | Surface-SOS: Self-Supervised Object Segmentation via Neural Surface Representation | Xiaoyun Zheng et.al. | 2501.09947 | link |
2025-01-17 | Study on a Fast Solver for Combined Field Integral Equations of 3D Conducting Bodies Based on Graph Neural Networks | Tao Shan et.al. | 2501.09923 | link |
2025-01-17 | TalkingEyes: Pluralistic Speech-Driven 3D Eye Gaze Animation | Yixiang Zhuang et.al. | 2501.09921 | null |
2025-01-17 | A rigid origami elliptic-hyperbolic vertex duality | Thomas C. Hull et.al. | 2501.09908 | null |
2025-01-17 | High-Accuracy Physical Property Prediction for Organics via Molecular Representation Learning: Bridging Data to Discovery | Qi Ou et.al. | 2501.09896 | null |
2025-01-17 | Holographic Bound of Casimir Effect in General Dimensions | Rong-Xin Miao et.al. | 2501.09886 | null |
2025-01-16 | Quantitative analysis of vectorial torques in thin 3d Co ferromagnet using orbital-spin conversion | B. Bony et.al. | 2501.09864 | null |
2025-01-16 | Detection of Vascular Leukoencephalopathy in CT Images | Z. Cernekova et.al. | 2501.09863 | null |
2025-01-16 | Dust dynamics in radially convective regions of protoplanetary disks | Min-Kai Lin et.al. | 2501.09792 | null |
2025-01-16 | MUSE observations of V1425 Aql reveal an arc-shaped nova shell | L. Celedón et.al. | 2501.09780 | null |
2025-01-16 | SynthLight: Portrait Relighting with Diffusion Model by Learning to Re-render Synthetic Faces | Sumit Chaturvedi et.al. | 2501.09756 | null |
2025-01-16 | SRE-Conv: Symmetric Rotation Equivariant Convolution for Biomedical Image Classification | Yuexi Du et.al. | 2501.09753 | link |
2025-01-16 | A vertical slice frontogenesis test case for compressible nonhydrostatic dynamical cores of atmospheric models | Hiroe Yamazaki et.al. | 2501.09752 | null |
2025-01-16 | 25 years of XMM-Newton observations of the Sgr A complex: 3D distribution and internal structure of the clouds | G. Stel et.al. | 2501.09737 | null |
2025-01-16 | Weak electronic correlations in the cobalt oxychalcogenide superconductor Na2CoSe2O | Zhenchao Wu et.al. | 2501.09675 | null |
2025-01-16 | Jet-shaped filamentary ejecta in common envelope evolution | Ron Schreier et.al. | 2501.09663 | null |
2025-01-16 | Aging of colloidal gels in microgravity | Swagata S. Datta et.al. | 2501.09650 | null |
2025-01-16 | Atleus: Accelerating Transformers on the Edge Enabled by 3D Heterogeneous Manycore Architectures | Pratyush Dhingra et.al. | 2501.09588 | null |
2025-01-16 | Critical relaxational dynamics at the continuous transitions of three-dimensional spin models with ${\mathbb Z}_2$ gauge symmetry | Claudio Bonati et.al. | 2501.09575 | null |
2025-01-16 | Resolution enhancement in quantitative phase microscopy: a review | Vicente Mico et.al. | 2501.09548 | null |
2025-01-16 | WALLABY Pilot Survey & ASymba: Comparing HI Detection Asymmetries to the SIMBA Simulation | Mathieu Perron-Cormier et.al. | 2501.09547 | null |
2025-01-16 | The Devil is in the Details: Simple Remedies for Image-to-LiDAR Representation Learning | Wonjun Jo et.al. | 2501.09485 | null |
2025-01-16 | MonoSOWA: Scalable monocular 3D Object detector Without human Annotations | Jan Skvrna et.al. | 2501.09481 | null |
2025-01-16 | Predicting Air Temperature from Volumetric Urban Morphology with Machine Learning | Berk Kıvılcım et.al. | 2501.09469 | null |
2025-01-16 | Tunable spin and orbital torques in Cu-based magnetic heterostructures | Silvia Damerio et.al. | 2501.09458 | null |
2025-01-16 | CaPa: Carve-n-Paint Synthesis for Efficient 4K Textured Mesh Generation | Hwan Heo et.al. | 2501.09433 | link |
2025-01-16 | AugRefer: Advancing 3D Visual Grounding via Cross-Modal Augmentation and Spatial Relation-based Referring | Xinyi Wang et.al. | 2501.09428 | null |
2025-01-16 | Slowly decaying strain solitons in nonlinear viscoelastic waveguides | F. E. Garbuzov et.al. | 2501.09415 | null |
2025-01-16 | UVRM: A Scalable 3D Reconstruction Model from Unposed Videos | Shiu-hong Kao et.al. | 2501.09347 | null |
2025-01-16 | Robust UAV Path Planning with Obstacle Avoidance for Emergency Rescue | Junteng Mao et.al. | 2501.09338 | null |
2025-01-16 | Automatic exposure volumetric additive manufacturing | Antony Orth et.al. | 2501.09332 | null |
2025-01-16 | Creating Virtual Environments with 3D Gaussian Splatting: A Comparative Study | Shi Qiu et.al. | 2501.09302 | null |
2025-01-16 | Machine Learning Relationships between Nanoporous Structures and Electrochemical Performance in MOF Supercapacitors | Zhenxiang Wang et.al. | 2501.09287 | null |
2025-01-17 | Text-guided Synthetic Geometric Augmentation for Zero-shot 3D Understanding | Kohei Torimi et.al. | 2501.09278 | null |
2025-01-16 | OpticFusion: Multi-Modal Neural Implicit 3D Reconstruction of Microstructures by Fusing White Light Interferometry and Optical Microscopy | Shuo Chen et.al. | 2501.09259 | link |
2025-01-15 | Anomalous and Planar Hall Effects in Cobalt-Holmium Thin Films Near Magnetic Sublattice Compensation | Ramesh C Budhani et.al. | 2501.09206 | null |
2025-01-15 | 3D Printed Maps and Icons for Inclusion: Testing in the Wild by People who are Blind or have Low Vision | Leona Holloway et.al. | 2501.09204 | null |
2025-01-15 | Unified Few-shot Crack Segmentation and its Precise 3D Automatic Measurement in Concrete Structures | Pengru Deng et.al. | 2501.09203 | null |
2025-01-15 | Few-Shot Adaptation of Training-Free Foundation Model for 3D Medical Image Segmentation | Xingxin He et.al. | 2501.09138 | null |
2025-01-15 | Self Pre-training with Adaptive Mask Autoencoders for Variable-Contrast 3D Medical Imaging | Badhan Kumar Das et.al. | 2501.09096 | null |
2025-01-15 | Anthropomorphic Features for On-Line Signatures | Moises Diaz et.al. | 2501.09048 | null |
2025-01-15 | Vision Foundation Models for Computed Tomography | Suraj Pai et.al. | 2501.09001 | link |
2025-01-15 | Calculus with combinatorial differential forms for fluid flow analysis in porous and fractured media | Changhao Liu et.al. | 2501.08996 | null |
2025-01-15 | Topological Bardeen-Cooper-Schrieffer theory of superconducting quantum rings | Elena Landro’ et.al. | 2501.08986 | null |
2025-01-15 | CityDreamer4D: Compositional Generative Model of Unbounded 4D Cities | Haozhe Xie et.al. | 2501.08983 | link |
2025-01-15 | CityLoc: 6 DoF Localization of Text Descriptions in Large-Scale Scenes with Gaussian Representation | Qi Ma et.al. | 2501.08982 | null |
2025-01-15 | Generative Planning with 3D-vision Language Pre-training for End-to-End Autonomous Driving | Tengpeng Li et.al. | 2501.08861 | link |
2025-01-23 | DMCpy: A powder and single crystal neutron diffraction software for DMC | Jakob Lass et.al. | 2501.08845 | null |
2025-01-15 | CHEmical-shift selective Adiabatic Pulse (CHEAP): Fast and High Resolution Downfield 3D 1H-MRSI at 7T | Guodong Weng et.al. | 2501.08827 | null |
2025-01-15 | The 3D structure of the Nucleon in momentum space: TMD phenomenology | Marco Radici et.al. | 2501.08806 | null |
2025-01-15 | The first calibration model for bluetooth angle of arrival: Enhancing positioning accuracy in indoor environments | Ma’mon Saeed Alghananim et.al. | 2501.08805 | null |
2025-01-24 | MeshMask: Physics-Based Simulations with Masked Graph Neural Networks | Paul Garnier et.al. | 2501.08738 | link |
2025-01-16 | Holoview: Interactive 3D visualization of medical data in AR | Pankaj Kaushik et.al. | 2501.08736 | null |
2025-01-15 | GS-LIVO: Real-Time LiDAR, Inertial, and Visual Multi-sensor Fused Odometry with Gaussian Mapping | Sheng Hong et.al. | 2501.08672 | null |
2025-01-15 | Joint Learning of Depth and Appearance for Portrait Image Animation | Xinya Ji et.al. | 2501.08649 | null |
2025-01-15 | Computerized Assessment of Motor Imitation for Distinguishing Autism in Video (CAMI-2DNet) | Kaleab A. Kinfu et.al. | 2501.08609 | null |
2025-01-15 | Image-to-Force Estimation for Soft Tissue Interaction in Robotic-Assisted Surgery Using Structured Light | Jiayin Wang et.al. | 2501.08593 | null |
2025-01-15 | Scalable and High-Quality Neural Implicit Representation for 3D Reconstruction | Leyuan Yang et.al. | 2501.08577 | null |
2025-01-15 | DynamicFace: High-Quality and Consistent Video Face Swapping using Composable 3D Facial Priors | Runqi Wang et.al. | 2501.08553 | null |
2025-01-15 | Score-based 3D molecule generation with neural fields | Matthieu Kirchmeyer et.al. | 2501.08508 | link |
2025-01-14 | Automotive Elevation Mapping with Interferometric Synthetic Aperture Radar | Leyla A. Kabuli et.al. | 2501.08495 | null |
2025-01-14 | Leveraging 2D Masked Reconstruction for Domain Adaptation of 3D Pose Estimation | Hansoo Park et.al. | 2501.08408 | null |
2025-01-14 | 3D Gaussian Splatting with Normal Information for Mesh Extraction and Improved Rendering | Meenakshi Krishnan et.al. | 2501.08370 | null |
2025-01-14 | DAViD: Modeling Dynamic Affordance of 3D Objects using Pre-trained Video Diffusion Models | Hyeonwoo Kim et.al. | 2501.08333 | null |
2025-01-14 | Predicting 4D Hand Trajectory from Monocular Videos | Yufei Ye et.al. | 2501.08329 | null |
2025-01-14 | Efficient Deep Learning-based Forward Solvers for Brain Tumor Growth Models | Zeineb Haouari et.al. | 2501.08226 | link |
2025-01-14 | Theoretical determination of Gilbert damping in reduced dimensions | Balázs Nagyfalusi et.al. | 2501.08119 | null |
2025-01-14 | Guiding the classification of hepatocellular carcinoma on 3D CT-scans using deep and handcrafted radiological features | E. Sarfati et.al. | 2501.08097 | null |
2025-01-13 | Evaluating Human Perception of Novel View Synthesis: Subjective Quality Assessment of Gaussian Splatting and NeRF in Dynamic Scenes | Yuhang Zhang et.al. | 2501.08072 | null |
2025-01-14 | Role of injection parameters in jet propagation through realistic binary neutron star merger environments | Andrea Pavan et.al. | 2501.08032 | null |
2025-01-14 | GDiffRetro: Retrosynthesis Prediction with Dual Graph Enhanced Molecular Representation and Diffusion Generation | Shengyin Sun et.al. | 2501.08001 | link |
2025-01-14 | LLM-Ehnanced Holonic Architecture for Ad-Hoc Scalable SoS | Muhammad Ashfaq et.al. | 2501.07992 | null |
2025-01-14 | GAC-Net_Geometric and attention-based Network for Depth Completion | Kuang Zhu et.al. | 2501.07988 | null |
2025-01-14 | Exploring the energy spectrum of a four-terminal Josephson junction: Towards topological Andreev band structures | Tommaso Antonelli et.al. | 2501.07982 | null |
2025-01-14 | Mapping reionization bubbles in the JWST era II: inferring the position and characteristic size of individual bubbles | Ivan Nikolić et.al. | 2501.07980 | null |
2025-01-16 | Scalable freeform optimization of wide-aperture 3D metalenses by zoned discrete axisymmetry | Mengdi Sun et.al. | 2501.07979 | null |
2025-01-14 | An Open Source Validation System for Continuous Arterial Blood Pressure Measuring Sensors | Attila Répai et.al. | 2501.07973 | link |
2025-01-14 | Early prediction of the transferability of bovine embryos from videomicroscopy | Yasmine Hachani et.al. | 2501.07945 | null |
2025-01-14 | Using curved meshes to derive a priori error estimates for a linear elasticity problem with Robin boundary conditions | Joyce Ghantous et.al. | 2501.07914 | null |
2025-01-15 | Make-A-Character 2: Animatable 3D Character Generation From a Single Image | Lin Liu et.al. | 2501.07870 | null |
2025-01-14 | Origin of dimensional crossover in quasi-one-dimensional hollandite K ${2}$Ru${8}$O$_{16}$ | Asif Ali et.al. | 2501.07822 | null |
2025-01-14 | 3UR-LLM: An End-to-End Multimodal Large Language Model for 3D Scene Understanding | Haomiao Xiong et.al. | 2501.07819 | link |
2025-01-14 | Simulations of Three-dimensional Nematic Guidance of Microswimmers | Zeyang Mou et.al. | 2501.07816 | null |
2025-01-14 | BioPose: Biomechanically-accurate 3D Pose Estimation from Monocular Videos | Farnoosh Koleini et.al. | 2501.07800 | null |
2025-01-14 | HgPCN: A Heterogeneous Architecture for E2E Embedded Point Cloud Inference | Yiming Gao et.al. | 2501.07767 | null |
2025-01-13 | 3D MC I: X-ray Tomography Begins to Unravel the 3-D Structure of a Molecular Cloud in our Galaxy’s Center | Samantha W. Brunker et.al. | 2501.07717 | null |
2025-01-13 | Computational Geometry with Probabilistically Noisy Primitive Operations | David Eppstein et.al. | 2501.07707 | null |
2025-01-13 | Harnessing ultrafast optical pulses for 3D microfabrication by selective tweezing and immobilization of colloidal particles in an integrated system | Krishangi Krishna et.al. | 2501.07684 | null |
2025-01-13 | On the accuracy of dark matter halo merger trees and the consequences for semi-analytic models of galaxy formation | Ángel Chandro-Gómez et.al. | 2501.07677 | link |
2025-01-13 | 3D MC II: X ray echoes reveal a clumpy molecular cloud in the CMZ | Danya Alboslani et.al. | 2501.07669 | null |
2025-01-13 | BlobGEN-Vid: Compositional Text-to-Video Generation with Blob Video Representations | Weixi Feng et.al. | 2501.07647 | null |
2025-01-13 | One-point functions for doubly-holographic BCFTs and backreacting defects | Dongming He et.al. | 2501.07630 | null |
2025-01-13 | Helical Magnetic Field in a Massive Protostellar Jet | A. Rodríguez-Kamenetzky et.al. | 2501.07622 | null |
2025-01-11 | Three-dimensional (3D) tensor-based methodology for characterizing 3D anisotropic thermal conductivity tensor | Dihui Wang et.al. | 2501.07605 | null |
2025-01-13 | UnCommon Objects in 3D | Xingchen Liu et.al. | 2501.07574 | link |
2025-01-13 | 3D-grids are not transducible from planar graphs | Jakub Gajarský et.al. | 2501.07558 | null |
2025-01-13 | Three-dimensional transport of solids in a protoplanetary disk containing a growing giant planet | Eric Van Clepper et.al. | 2501.07520 | null |
2025-01-13 | 3DGS-to-PC: Convert a 3D Gaussian Splatting Scene into a Dense Point Cloud or Mesh | Lewis A G Stuart et.al. | 2501.07478 | link |
2025-01-13 | Metal-THINGS: The Milky Way twin candidate NGC 3521 | L. S. Pilyugin et.al. | 2501.07443 | null |
2025-01-13 | Diff-Ensembler: Learning to Ensemble 2D Diffusion Models for Volume-to-Volume Medical Image Translation | Xiyue Zhu et.al. | 2501.07430 | null |
2025-01-13 | Photonic antiferromagnetic topological insulator with a single surface Dirac cone | Fujia Chen et.al. | 2501.07424 | null |
2025-01-13 | Approaching ballistic motion in 3D simulations of gamma-ray burst jets in realistic binary neutron star merger environments | Emma Dreas et.al. | 2501.07385 | null |
2025-01-13 | Non-unique self-similar blowups in Sabra models: insights from dynamical systems and machine-learning | Ciro Campolina et.al. | 2501.07377 | null |
2025-01-13 | Dynami-CAL GraphNet: A Physics-Informed Graph Neural Network Conserving Linear and Angular Momentum for Dynamical Systems | Vinay Sharma et.al. | 2501.07373 | null |
2025-01-13 | High-efficiency, high-count-rate 2D superconducting nanowire single-photon detector array | Fiona Fleming et.al. | 2501.07357 | null |
2025-01-13 | Theoretical Modelling of Gamma-Ray Burst 090510 | Joseph Saji et.al. | 2501.07283 | null |
2025-01-13 | Skip Mamba Diffusion for Monocular 3D Semantic Scene Completion | Li Liang et.al. | 2501.07260 | link |
2025-01-16 | PO-GVINS: Tightly Coupled GNSS-Visual-Inertial Integration with Pose-Only Representation | Zhuo Xu et.al. | 2501.07259 | null |
2025-01-13 | Enhancing Interaction with Augmented Reality through Mid-Air Haptic Feedback: Architecture Design and User Feedback | Diego Vaquero-Melchor et.al. | 2501.07234 | null |
2025-01-13 | Revealing Point Group Symmetry of Rare-earth Dopants via Polarization-resolved Single-particle Microspectroscopy | Peng Li et.al. | 2501.07199 | null |
2025-01-13 | Robust Single Object Tracking in LiDAR Point Clouds under Adverse Weather Conditions | Xiantong Zhao et.al. | 2501.07133 | null |
2025-01-13 | Confinement of 3d $\mathcal{N}=2$ Gauge Theories from M-theory on CY4 | Marwan Najjar et.al. | 2501.07116 | null |
2025-01-13 | Collaborative Learning for 3D Hand-Object Reconstruction and Compositional Action Recognition from Egocentric RGB Videos Using Superquadrics | Tze Ho Elden Tse et.al. | 2501.07100 | null |
2025-01-19 | Stochastic reconstruction of multiphase composite microstructures using statistics-encoded neural network for poro/micro-mechanical modelling | Jinlong Fu et.al. | 2501.07083 | null |
2025-01-13 | Nonequilibrium Continuous Transition in a Fast Rotating Turbulence | Chandra Shekhar Lohani et.al. | 2501.07079 | null |
2025-01-13 | D3MES: Diffusion Transformer with multihead equivariant self-attention for 3D molecule generation | Zhejun Zhang et.al. | 2501.07077 | link |
2025-01-13 | Representation Learning of Point Cloud Upsampling in Global and Local Inputs | Tongxu Zhang et.al. | 2501.07076 | null |
2025-01-19 | UNetVL: Enhancing 3D Medical Image Segmentation with Chebyshev KAN Powered Vision-LSTM | Xuhui Guo et.al. | 2501.07017 | link |
2025-01-14 | SplatMAP: Online Dense Monocular SLAM with 3D Gaussian Splatting | Yue Hu et.al. | 2501.07015 | null |
2025-01-13 | Nishida-Smoller type large solutions and exponential growth for the compressible Navier-Stokes equations with slip boundary conditions in 3D bounded domain | Saiguo Xu et.al. | 2501.07003 | null |
2025-01-13 | Dark matter halo dynamics in 2D Vlasov Simulations: a self-similar approach | Abineet Parichha et.al. | 2501.07001 | null |
2025-01-12 | Super-Resolution of 3D Micro-CT Images Using Generative Adversarial Networks: Enhancing Resolution and Segmentation Accuracy | Evgeny Ugolkov et.al. | 2501.06939 | link |
2025-01-12 | CULTURE3D: Cultural Landmarks and Terrain Dataset for 3D Applications | Xinyi Zheng et.al. | 2501.06927 | link |
2025-01-12 | Monolithic 3D FPGAs Utilizing Back-End-of-Line Configuration Memories | Faaiq Waqar et.al. | 2501.06921 | null |
2025-01-12 | Synthetic Prior for Few-Shot Drivable Head Avatar Inversion | Wojciech Zielonka et.al. | 2501.06903 | null |
2025-01-12 | ActiveGAMER: Active GAussian Mapping through Efficient Rendering | Liyan Chen et.al. | 2501.06897 | null |
2025-01-12 | A Flux-Tunable cavity for Dark matter detection | Fang Zhao et.al. | 2501.06882 | null |
2025-01-12 | Evaluation of post-blast damage in cut blasting with varying extra-depths: insights from 2D simulations and 3D experiments | Changda Zheng et.al. | 2501.06855 | null |
2025-01-14 | Generalized and Efficient 2D Gaussian Splatting for Arbitrary-scale Super-Resolution | Du Chen et.al. | 2501.06838 | link |
2025-01-12 | Multilayered fluid-structure interactions: existence of weak solutions for time-periodic and initial-value problems | Claudiu Mîndrilă et.al. | 2501.06820 | null |
2025-01-12 | Conefield approach to identifying regions without flux surfaces for magnetic fields | David Martinez-del-Rio et.al. | 2501.06796 | link |
2025-01-12 | Temporal-Aware Spiking Transformer Hashing Based on 3D-DWT | Zihao Mei et.al. | 2501.06786 | null |
2025-01-12 | 3DCoMPaT200: Language-Grounded Compositional Understanding of Parts and Materials of 3D Shapes | Mahmoud Ahmed et.al. | 2501.06785 | link |
2025-01-12 | Neutrino Heating in 1D, 2D, and 3D core-collapse supernovae: characterizing the explosion of high-compactness stars | Luca Boccioli et.al. | 2501.06784 | null |
2025-01-14 | Cost-Effective Robotic Handwriting System with AI Integration | Tianyi Huang et.al. | 2501.06783 | null |
2025-01-12 | Compact Model of Linear Passive Integrated Photonics Device for Photon Design Automation | Zijian Zhang et.al. | 2501.06774 | null |
2025-01-17 | SuperNeRF-GAN: A Universal 3D-Consistent Super-Resolution Framework for Efficient and Enhanced 3D-Aware Image Synthesis | Peng Zheng et.al. | 2501.06770 | null |
2025-01-21 | F3D-Gaus: Feed-forward 3D-aware Generation on ImageNet with Cycle-Consistent Gaussian Splatting | Yuxin Wang et.al. | 2501.06714 | null |
2025-01-14 | Vid2Sim: Realistic and Interactive Simulation from Video for Urban Navigation | Ziyang Xie et.al. | 2501.06693 | null |
2025-01-12 | Application of Vision-Language Model to Pedestrians Behavior and Scene Understanding in Autonomous Driving | Haoxiang Gao et.al. | 2501.06680 | null |
2025-01-11 | Fast multi-contrast MRI using joint multiscale energy model | Nima Yaghoobi et.al. | 2501.06595 | null |
2025-01-11 | CoreNet: Conflict Resolution Network for Point-Pixel Misalignment and Sub-Task Suppression of 3D LiDAR-Camera Object Detection | Yiheng Li et.al. | 2501.06550 | link |
2025-01-11 | CeViT: Copula-Enhanced Vision Transformer in multi-task learning and bi-group image covariates with an application to myopia screening | Chong Zhong et.al. | 2501.06540 | link |
2025-01-11 | NVS-SQA: Exploring Self-Supervised Quality Representation Learning for Neurally Synthesized Scenes without References | Qiang Qu et.al. | 2501.06488 | link |
2025-01-11 | YO-CSA-T: A Real-time Badminton Tracking System Utilizing YOLO Based on Contextual and Spatial Attention | Yuan Lai et.al. | 2501.06472 | null |
2025-01-11 | Discovering an Image-Adaptive Coordinate System for Photography Processing | Ziteng Cui et.al. | 2501.06448 | null |
2025-01-11 | X-ray microcomputed tomography of 3D chaotic microcavities | Ke Tian et.al. | 2501.06393 | null |
2025-01-10 | MEt3R: Measuring Multi-View Consistency in Generated Images | Mohammad Asim et.al. | 2501.06336 | null |
2025-01-10 | Continuum Reverberation in Active Galactic Nuclei Disks Only With Sufficient X-ray Luminosity and Low Albedo | Amy Secunda et.al. | 2501.06304 | link |
2025-01-10 | A Catalog of Stellar and Dust Properties for 500,000 Stars in the Southwest Bar of the Small Magellanic Cloud | Petia Yanchulova Merica-Jones et.al. | 2501.06290 | null |
2025-01-10 | Boundary operator expansion and extraordinary phase transition in the tricritical O(N) model | Xinyu Sun et.al. | 2501.06287 | null |
2025-01-08 | Open-Source Manually Annotated Vocal Tract Database for Automatic Segmentation from 3D MRI Using Deep Learning: Benchmarking 2D and 3D Convolutional and Transformer Networks | Subin Erattakulangara et.al. | 2501.06229 | null |
2025-01-10 | NDOB-Based Control of a UAV with Delta-Arm Considering Manipulator Dynamics | Hongming Chen et.al. | 2501.06122 | null |
2025-01-10 | Gigahertz directional light modulation with electro-optic metasurfaces | Sam Lin et.al. | 2501.06102 | null |
2025-01-10 | Data-driven reduced modeling of streamer discharges in air | Jannis Teunissen et.al. | 2501.06093 | link |
2025-01-10 | Simulation and modelling of convective mixing of carbon dioxide in geological formations | Marco De Paoli et.al. | 2501.06090 | null |
2025-01-10 | Non-planar 3D Printing of Double Shells | Ioanna Mitropoulou et.al. | 2501.06088 | null |
2025-01-10 | Nonisotropic Gaussian Diffusion for Realistic 3D Human Motion Prediction | Cecilia Curreli et.al. | 2501.06035 | null |
2025-01-10 | Thermal emission from bow shocks III: Variable diffuse X-ray emission from stellar-wind bow shocks driven by dynamical instabilities | Jonathan Mackey et.al. | 2501.06021 | null |
2025-01-10 | Pose-independent 3D Anthropometry from Sparse Data | David Bojanić et.al. | 2501.06014 | link |
2025-01-10 | CamCtrl3D: Single-Image Scene Exploration with Precise 3D Camera Control | Stefan Popov et.al. | 2501.06006 | null |
2025-01-10 | Full-domain POD modes from PIV asynchronous patches | Iacopo Tirelli et.al. | 2501.05988 | null |
2025-01-10 | Swin-X2S: Reconstructing 3D Shape from 2D Biplanar X-ray with Swin Transformers | Kuan Liu et.al. | 2501.05961 | link |
2025-01-10 | Self-consistent full MHD coupling of JOREK and STARWALL for advanced plasma free boundary simulation | Raffaele Sparago et.al. | 2501.05956 | null |
2025-01-13 | Inverse Design of 3D Nanophotonic Devices with Structural Integrity Using Auxiliary Thermal Solvers | Oliver Kuster et.al. | 2501.05900 | link |
2025-01-10 | UV-Attack: Physical-World Adversarial Attacks for Person Detection via Dynamic-NeRF-based UV Mapping | Yanjie Li et.al. | 2501.05783 | null |
2025-01-10 | StarGen: A Spatiotemporal Autoregression Framework with Video Diffusion Model for Scalable and Controllable Scene Generation | Shangjin Zhai et.al. | 2501.05763 | null |
2025-01-10 | Development and Comparison of Model-Based and Data-Driven Approaches for the Prediction of the Mechanical Properties of Lattice Structures | Chiara Pasini et.al. | 2501.05762 | null |
2025-01-10 | Locality-aware Gaussian Compression for Fast and High-quality Rendering | Seungjoo Shin et.al. | 2501.05757 | null |
2025-01-20 | Tailored Thin Films: Modulating Soft Photonics with Dynamically Tunable Large Area Microstructures via Controlled Thermal Processing | Srijeeta Biswas et.al. | 2501.05736 | null |
2025-01-10 | Homogenization of Inhomogeneous Incompressible Navier-Stokes Equations in Domains with Very Tiny Holes | Yong Lu et.al. | 2501.05734 | null |
2025-01-07 | Implicit Guidance and Explicit Representation of Semantic Information in Points Cloud: A Survey | Jingyuan Tang et.al. | 2501.05473 | link |
2025-01-06 | The 2nd Place Solution from the 3D Semantic Segmentation Track in the 2024 Waymo Open Dataset Challenge | Qing Wu et.al. | 2501.05472 | null |
2025-01-09 | Consistent Flow Distillation for Text-to-3D Generation | Runjie Yan et.al. | 2501.05445 | null |
2025-01-09 | Zero-1-to-G: Taming Pretrained 2D Diffusion Model for Direct 3D Generation | Xuyi Meng et.al. | 2501.05427 | null |
2025-01-09 | Quantum undetected optical projection tomography | Nathan R. Gemmell et.al. | 2501.05381 | null |
2025-01-13 | Arc2Avatar: Generating Expressive 3D Avatars from a Single Image via ID Guidance | Dimitrios Gerogiannis et.al. | 2501.05379 | null |
2025-01-09 | Solar wind entry into Mercury’s magnetosphere: Simulation results for the second swingby of BepiColombo | Daniel Teubenbacher et.al. | 2501.05363 | null |
2025-01-09 | Large eddy simulation of ocean mesoscale eddies | Pavel Perezhogin et.al. | 2501.05357 | null |
2025-01-09 | The Role of Atmospheric Composition in Defining the Habitable Zone Limits and Supporting E. coli Growth | Asena Kuzucan et.al. | 2501.05297 | null |
2025-01-16 | Towards Balanced Continual Multi-Modal Learning in Human Pose Estimation | Jiaxuan Peng et.al. | 2501.05264 | null |
2025-01-09 | Competition of superconducting pairing symmetries in La3Ni2O7 | Han-Xiang Xu et.al. | 2501.05254 | null |
2025-01-09 | Optimized Sampling for Non-Line-of-Sight Imaging Using Modified Fast Fourier Transforms | Talha Sultan et.al. | 2501.05244 | null |
2025-01-09 | Scaffold-SLAM: Structured 3D Gaussians for Simultaneous Localization and Photorealistic Mapping | Wen Tianci et.al. | 2501.05242 | null |
2025-01-13 | Light Transport-aware Diffusion Posterior Sampling for Single-View Reconstruction of 3D Volumes | Ludwic Leonard et.al. | 2501.05226 | link |
2025-01-09 | State-Based Disassembly Planning | Chao Lei et.al. | 2501.05156 | null |
2025-01-09 | A Systematic Literature Review on Deep Learning-based Depth Estimation in Computer Vision | Ali Rohan et.al. | 2501.05147 | null |
2025-01-09 | Unidirectional motion of topological defects mediating continuous rotation processes | Marisel Di Pietro Martínez et.al. | 2501.05112 | null |
2025-01-09 | EquiBoost: An Equivariant Boosting Approach to Molecular Conformation Generation | Yixuan Yang et.al. | 2501.05109 | link |
2025-01-09 | Motion-X++: A Large-Scale Multimodal 3D Whole-body Human Motion Dataset | Yuhong Zhang et.al. | 2501.05098 | null |
2025-01-09 | Advancing ALS Applications with Large-Scale Pre-training: Dataset Development and Downstream Assessment | Haoyi Xiu et.al. | 2501.05095 | link |
2025-01-09 | Time-Variant Vector Field Visualization for Magnetic Fields of Neutron Star Simulations | Simon J. Lieb et.al. | 2501.05084 | null |
2025-01-09 | Perception-as-Control: Fine-grained Controllable Image Animation with 3D-aware Motion Representation | Yingjie Chen et.al. | 2501.05020 | null |
2025-01-09 | A Fast Path-Planning Method for Continuous Harvesting of Table-Top Grown Strawberries | Zhonghua Miao et.al. | 2501.05004 | null |
2025-01-09 | IPDN: Image-enhanced Prompt Decoding Network for 3D Referring Expression Segmentation | Qi Chen et.al. | 2501.04995 | link |
2025-01-09 | AD-L-JEPA: Self-Supervised Spatial World Models with Joint Embedding Predictive Architecture for Autonomous Driving with LiDAR Data | Haoran Zhu et.al. | 2501.04969 | link |
2025-01-09 | The Catalogue of Virtual Early-Type Galaxies from IllustrisTNG: Validation and Real Observation Consistency | Pedro de Araujo Ferreira et.al. | 2501.04932 | null |
2025-01-09 | Image2CADSeq: Computer-Aided Design Sequence and Knowledge Inference from Product Images | Xingang Li et.al. | 2501.04928 | null |
2025-01-08 | A new rotation-free isogeometric thin shell formulation and a corresponding continuity constraint for patch boundaries | Thang Xuan Duong et.al. | 2501.04855 | null |
2025-01-08 | GaussianVideo: Efficient Video Representation via Hierarchical Gaussian Splatting | Andrew Bond et.al. | 2501.04782 | null |
2025-01-08 | Development of an Adaptive Sliding Mode Controller using Neural Networks for Trajectory Tracking of a Cylindrical Manipulator | TieuNien Le et.al. | 2501.04754 | null |
2025-01-07 | Generative Style Transfer for MRI Image Segmentation: A Case of Glioma Segmentation in Sub-Saharan Africa | Rancy Chepchirchir et.al. | 2501.04734 | link |
2025-01-08 | SPAR3D: Stable Point-Aware Reconstruction of 3D Objects from Single Images | Zixuan Huang et.al. | 2501.04689 | null |
2025-01-08 | RadGPT: Constructing 3D Image-Text Tumor Datasets | Pedro R. A. S. Bassi et.al. | 2501.04678 | link |
2025-01-08 | Vertical structure and kinematics of the LMC disc from SDSS/Gaia | Ó. Jiménez-Arranz et.al. | 2501.04616 | null |
2025-01-08 | FrontierNet: Learning Visual Cues to Explore | Boyang Sun et.al. | 2501.04597 | link |
2025-01-08 | Instructive3D: Editing Large Reconstruction Models with Text Instructions | Kunal Kathare et.al. | 2501.04374 | null |
2025-01-08 | FGU3R: Fine-Grained Fusion via Unified 3D Representation for Multimodal 3D Object Detection | Guoxin Zhang et.al. | 2501.04373 | null |
2025-01-08 | A Unified Framework for Foreground and Anonymization Area Segmentation in CT and MRI Data | Michal Nohel et.al. | 2501.04361 | link |
2025-01-08 | On Domain Decomposition for Magnetostatic Problems in 3D | Mario Mally et.al. | 2501.04340 | null |
2025-01-08 | Building a Mind Palace: Structuring Environment-Grounded Semantic Graphs for Effective Long Video Analysis with LLMs | Zeyi Huang et.al. | 2501.04336 | null |
2025-01-08 | Frenet-Serret-Based Trajectory Prediction | Shashank Verma et.al. | 2501.04273 | null |
2025-01-08 | UPAQ: A Framework for Real-Time and Energy-Efficient 3D Object Detection in Autonomous Vehicles | Abhishek Balasubramaniam et.al. | 2501.04213 | null |
2025-01-07 | Learning to Transfer Human Hand Skills for Robot Manipulations | Sungjae Park et.al. | 2501.04169 | null |
2025-01-16 | Five-brane webs, 3d $\mathcal{N}=2$ theories and quantum curves | Naotaka Kubo et.al. | 2501.04146 | null |
2025-01-07 | Chirpy3D: Continuous Part Latents for Creative 3D Bird Generation | Kam Woh Ng et.al. | 2501.04144 | link |
2025-01-07 | Spatiotemporal Gaussian Optimization for 4D Cone Beam CT Reconstruction from Sparse Projections | Yabo Fu et.al. | 2501.04140 | link |
2025-01-07 | Scalable Discovery of Fundamental Physical Laws: Learning Magnetohydrodynamics from 3D Turbulence Data | Matthew Golden et.al. | 2501.04094 | null |
2025-01-07 | NeRFs are Mirror Detectors: Using Structural Similarity for Multi-View Mirror Scene Reconstruction with 3D Surface Primitives | Leif Van Holland et.al. | 2501.04074 | link |
2025-01-07 | New Liouville type theorems for the stationary MHD equations in $\mathbb{R}^3$ | Wenke Tan et.al. | 2501.04059 | null |
2025-01-03 | Three-dimensional DtN-FEM scattering analysis of Lamb and SH guided waves by a symmetric cavity defect in an isotropic infinite plate | Chen Yang et.al. | 2501.04039 | null |
2024-12-27 | Virtual element methods based on boundary triangulation:fitted and unfitted meshes | Ruchi Guo et.al. | 2501.04021 | null |
2025-01-07 | LargeAD: Large-Scale Cross-Sensor Data Pretraining for Autonomous Driving | Lingdong Kong et.al. | 2501.04005 | null |
2025-01-07 | LiMoE: Mixture of LiDAR Representation Learners from Automotive Scenes | Xiang Xu et.al. | 2501.04004 | link |
2025-01-07 | Extraction Of Cumulative Blobs From Dynamic Gestures | Rishabh Naulakha et.al. | 2501.04002 | null |
2025-01-07 | MAD-BA: 3D LiDAR Bundle Adjustment – from Uncertainty Modelling to Structure Optimization | Krzysztof Ćwian et.al. | 2501.03972 | null |
2025-01-07 | Thermally Adaptive Surface Microscopy for brain functional imaging | Hadrien L. M. Robert et.al. | 2501.03965 | null |
2025-01-07 | Global well-posedness and scattering for the massive Dirac-Klein-Gordon system in two dimensions | Ioan Bejenaru et.al. | 2501.03963 | null |
2025-01-10 | Dolphin: Closed-loop Open-ended Auto-research through Thinking, Practice, and Feedback | Jiakang Yuan et.al. | 2501.03916 | null |
2025-01-12 | SELMA3D challenge: Self-supervised learning for 3D light-sheet microscopy image segmentation | Ying Chen et.al. | 2501.03880 | null |
2025-01-07 | CL3DOR: Contrastive Learning for 3D Large Multimodal Models via Odds Ratio on High-Resolution Point Clouds | Keonwoo Kim et.al. | 2501.03879 | null |
2025-01-09 | Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control | Zekai Gu et.al. | 2501.03847 | link |
2025-01-07 | OmniManip: Towards General Robotic Manipulation via Object-Centric Interaction Primitives as Spatial Constraints | Mingjie Pan et.al. | 2501.03841 | null |
2025-01-07 | MeshConv3D: Efficient convolution and pooling operators for triangular 3D meshes | Germain Bregeon et.al. | 2501.03830 | null |
2025-01-07 | An innovative mixed reality approach for Robotics Surgery | Gabriela Rus et.al. | 2501.03819 | null |
2025-01-11 | 3D Printable Gradient Lattice Design for Multi-Stiffness Robotic Fingers | Siebe J. Schouten et.al. | 2501.03763 | null |
2025-01-07 | Self-adaptive vision-language model for 3D segmentation of pulmonary artery and vein | Xiaotong Guo et.al. | 2501.03722 | null |
2025-01-07 | MoDec-GS: Global-to-Local Motion Decomposition and Temporal Interval Adjustment for Compact Dynamic 3D Gaussian Splatting | Sangwoon Kwak et.al. | 2501.03714 | null |
2025-01-07 | AuxDepthNet: Real-Time Monocular 3D Object Detection with Depth-Sensitive Features | Ruochen Zhang et.al. | 2501.03700 | null |
2025-01-16 | DehazeGS: Seeing Through Fog with 3D Gaussian Splatting | Jinze Yu et.al. | 2501.03659 | null |
2025-01-07 | Advancing the Understanding of Fine-Grained 3D Forest Structures using Digital Cousins and Simulation-to-Reality: Methods and Datasets | Jing Liu et.al. | 2501.03637 | null |
2025-01-07 | A 3D Continuous-Space Electromagnetic Channel Model for 6G Tri-Polarized Multi-user Communications | Yue Yang et.al. | 2501.03608 | null |
2025-01-07 | ConcealGS: Concealing Invisible Copyright Information in 3D Gaussian Splatting | Yifeng Yang et.al. | 2501.03605 | link |
2025-01-07 | Bridged Semantic Alignment for Zero-shot 3D Medical Image Diagnosis | Haoran Lai et.al. | 2501.03565 | null |
2025-01-07 | 3D Single-shot CEST imaging at 3T Based on True FISP Readout | Yupeng Wu et.al. | 2501.03548 | null |
2025-01-07 | TexHOI: Reconstructing Textures of 3D Unknown Objects in Monocular Hand-Object Interaction Scenes | Alakh Aggarwal et.al. | 2501.03525 | link |
2025-01-07 | The Galactic Bulge exploration IV.: RR~Lyrae stars as traces of the Galactic bar – 3D and 5D analysis, extinction variation | Z. Prudil et.al. | 2501.03497 | link |
2025-01-07 | VOILA: Complexity-Aware Universal Segmentation of CT images by Voxel Interacting with Language | Zishuo Wan et.al. | 2501.03482 | link |
2025-01-07 | Interface reconstruction of adhering droplets for distortion correction using glare points and deep learning | Maximilian Dreisbach et.al. | 2501.03453 | null |
2025-01-06 | Compression of 3D Gaussian Splatting with Optimized Feature Planes and Standard Video Codecs | Soonbin Lee et.al. | 2501.03399 | null |
2025-01-12 | DoubleDiffusion: Combining Heat Diffusion with Denoising Diffusion for Generative Learning on 3D Meshes | Xuyang Wang et.al. | 2501.03397 | link |
2025-01-06 | Relative Quantum Gravity: Localized Gravity and the Swampland | Edoardo Anastasi et.al. | 2501.03310 | null |
2025-01-06 | Gaussian Masked Autoencoders | Jathushan Rajasegaran et.al. | 2501.03229 | null |
2025-01-06 | RW-Net: Enhancing Few-Shot Point Cloud Classification with a Wavelet Transform Projection-based Network | Haosheng Zhang et.al. | 2501.03221 | null |
2025-01-06 | MinD the gap: Membrane proteins form 3D patterns in a suspension of liposomes | Amélie Chardac et.al. | 2501.03179 | null |
2025-01-06 | MObI: Multimodal Object Inpainting Using Diffusion Models | Alexandru Buburuzan et.al. | 2501.03173 | null |
2025-01-06 | Predictions for Bottomonium from a Relativistic Screened Potential Model | Chaitanya Anil Bokade et.al. | 2501.03147 | null |
2025-01-06 | Probing Magnetism in Self-Assembled Organometallic Complexes using Kondo Spectroscopy | Wantong Huang et.al. | 2501.03104 | null |
2025-01-07 | Finite Element Analysis of Shear Lag Effect in Long-Span Single-Box Continuous Rigid Bridges | Zhaokun Shen et.al. | 2501.03093 | null |
2025-01-06 | Spectra of standing kink waves in loops and the effects of the lower solar atmosphere | Konstantinos Karampelas et.al. | 2501.03089 | null |
2025-01-06 | SGLDBench: A Benchmark Suite for Stress-Guided Lightweight 3D Designs | Junpeng Wang et.al. | 2501.03068 | null |
2025-01-06 | Shock and SEP Modeling Study for the 5 September 2022 SEP Event | A. Kouloumvakos et.al. | 2501.03066 | null |
2025-01-06 | On the numerical evaluation of wall shear stress using the finite element method | Jana Brunátová et.al. | 2501.02987 | link |
2025-01-06 | HaWoR: World-Space Hand Motion Reconstruction from Egocentric Videos | Jinglei Zhang et.al. | 2501.02973 | null |
2025-01-06 | Towards Quantitative Interpretation of 3D Atomic Force Microscopy at Solid-Liquid Interfaces | Qian Ai et.al. | 2501.02939 | null |
2025-01-06 | FRELLED Reloaded: Multiple techniques for astronomical data visualisation in Blender | Rhys Taylor et.al. | 2501.02919 | null |
2025-01-06 | Pointmap-Conditioned Diffusion for Consistent Novel View Synthesis | Thang-Anh-Quan Nguyen et.al. | 2501.02913 | null |
2025-01-06 | Skillful High-Resolution Ensemble Precipitation Forecasting with an Integrated Deep Learning Framework | Shuangshuang He et.al. | 2501.02905 | null |
2025-01-06 | HOGSA: Bimanual Hand-Object Interaction Understanding with 3D Gaussian Splatting Based Data Augmentation | Wentian Qu et.al. | 2501.02845 | null |
2025-01-06 | Universal Features Guided Zero-Shot Category-Level Object Pose Estimation | Wentian Qu et.al. | 2501.02831 | null |
2025-01-07 | AE-NeRF: Augmenting Event-Based Neural Radiance Fields for Non-ideal Conditions and Larger Scene | Chaoran Feng et.al. | 2501.02807 | null |
2025-01-06 | Constructing 4D Radio Map in LEO Satellite Networks with Limited Samples | Haoxuan Yuan et.al. | 2501.02775 | null |
2025-01-06 | WorldPose: A World Cup Dataset for Global 3D Human Pose Estimation | Tianjian Jiang et.al. | 2501.02771 | null |
2025-01-06 | An Efficient Pre-Processing Method for 6G Dynamic Ray-Tracing Channel Modeling | Songjiang Yang et.al. | 2501.02747 | null |
2025-01-06 | The 3D energy-critical inhomogeneous nonlinear Schrodinger equation with strong singularity | Yoonjung Lee et.al. | 2501.02697 | null |
2025-01-05 | GS-DiT: Advancing Video Generation with Pseudo 4D Gaussian Fields through Efficient Dense 3D Point Tracking | Weikang Bian et.al. | 2501.02690 | null |
2025-01-05 | Neural networks meet hyperelasticity: A monotonic approach | Dominik K. Klein et.al. | 2501.02670 | null |
2025-01-05 | The quest for a stable disk | J A Sellwood et.al. | 2501.02636 | null |
2025-01-07 | Rotatable Antenna Enabled Wireless Communication: Modeling and Optimization | Beixiong Zheng et.al. | 2501.02595 | null |
2025-01-05 | Decoding fMRI Data into Captions using Prefix Language Modeling | Vyacheslav Shen et.al. | 2501.02570 | link |
2025-01-05 | AHMSA-Net: Adaptive Hierarchical Multi-Scale Attention Network for Micro-Expression Recognition | Lijun Zhang et.al. | 2501.02539 | null |
2025-01-05 | Layout2Scene: 3D Semantic Layout Guided Scene Generation via Geometry and Appearance Diffusion Priors | Minglin Chen et.al. | 2501.02519 | null |
2025-01-05 | VLT VIMOS Integral Field Spectroscopy of the nova remnant FH Ser | M. A. Guerrero et.al. | 2501.02501 | null |
2025-01-05 | EOG Communication Interface for Quadriplegics: Prototype & Signal Processing | Aniket Raj et.al. | 2501.02465 | null |
2025-01-05 | Depth Any Camera: Zero-Shot Metric Depth Estimation from Any Camera | Yuliang Guo et.al. | 2501.02464 | link |
2025-01-05 | Blockage-Aware UAV-Assisted Wireless Data Harvesting With Building Avoidance | Gitae Park et.al. | 2501.02453 | null |
2025-01-05 | Efficient periodic density functional theory calculations of charged molecules and surfaces using Coulomb kernel truncation | Sudarshan Vijay et.al. | 2501.02435 | null |
2025-01-05 | Electron-Phonon Temperature Inversion in Nanostructures under Pulsed Photoexcitation | Qian Ye et.al. | 2501.02415 | null |
2025-01-05 | Journey into Automation: Image-Derived Pavement Texture Extraction and Evaluation | Bingjie Lu et.al. | 2501.02414 | null |
2025-01-04 | A Four-dimensional Gauge Theory Perspective on Quantum K-theory | M. Nouman Muteeb et.al. | 2501.02394 | null |
2025-01-04 | A novel 3D sampling method of geological rock-core using X-ray fluorescence | Alexandru Enciu et.al. | 2501.02366 | null |
2025-01-04 | V2X-DGPE: Addressing Domain Gaps and Pose Errors for Robust Collaborative 3D Object Detection | Sichao Wang et.al. | 2501.02363 | link |
2025-01-13 | KD-MSLRT: Lightweight Sign Language Recognition Model Based on Mediapipe and 3D to 1D Knowledge Distillation | Yulong Li et.al. | 2501.02321 | null |
2025-01-04 | RadarNeXt: Real-Time and Reliable 3D Object Detector Based On 4D mmWave Imaging Radar | Liye Jia et.al. | 2501.02314 | link |
2025-01-04 | Vortices and rotating solitons in ultralight dark matter | Philippe Brax et.al. | 2501.02297 | null |
2025-01-07 | Hyperbolic Contrastive Learning for Hierarchical 3D Point Cloud Embedding | Yingjie Liu et.al. | 2501.02285 | null |
2025-01-04 | Covering Underwater Shadow Zones using Acoustic Reconfigurable Intelligent Surfaces | Longfei Zhao et.al. | 2501.02256 | null |
2025-01-04 | IMUFace: Real-Time, Low-Power, Continuous 3D Facial Reconstruction Through Earphones | Xianrong Yao et.al. | 2501.02177 | null |
2025-01-04 | Multifractal Terrain Generation for Evaluating Autonomous Off-Road Ground Vehicles | Casey D. Majhor et.al. | 2501.02172 | null |
2025-01-04 | Unveiling the gap between continuous and discrete adjoint lattice Boltzmann methods | Ji-Wang Luo et.al. | 2501.02161 | null |
2025-01-03 | SafeAug: Safety-Critical Driving Data Augmentation from Naturalistic Datasets | Zhaobin Mo et.al. | 2501.02143 | null |
2025-01-03 | 3D Cloud reconstruction through geospatially-aware Masked Autoencoders | Stella Girtsou et.al. | 2501.02035 | null |
2025-01-02 | Distribution of regularized three-body phase-volume | Yogesh Dandekar et.al. | 2501.02013 | null |
2025-01-01 | SmartSpatial: Enhancing the 3D Spatial Arrangement Capabilities of Stable Diffusion Models and Introducing a Novel 3D Spatial Evaluation Framework | Mao Xun Huang et.al. | 2501.01998 | null |
2025-01-01 | Exact solution of two-dimensional (2D) Ising model with a transverse field: a low-dimensional quantum spin system | Zhidong Zhang et.al. | 2501.01997 | null |
2025-01-03 | VideoLifter: Lifting Videos to 3D with Fast Hierarchical Stereo Alignment | Wenyan Cong et.al. | 2501.01949 | link |
2025-01-03 | Quasi-topological fractons: a 3D dipolar gauge theory | Erica Bertolini et.al. | 2501.01944 | null |
2025-01-03 | GoBERT: Gene Ontology Graph Informed BERT for Universal Gene Function Prediction | Yuwei Miao et.al. | 2501.01930 | null |
2025-01-03 | High critical field superconductivity in a 3d dominated lightweight equiatomic high entropy alloy | S. Jangid et.al. | 2501.01887 | null |
2025-01-03 | On Shilnikov’s scenario in 3D: Topological chaos for vectorfields of class $C^1$ | Hans-Otto Walther et.al. | 2501.01878 | null |
2025-01-03 | JoyGen: Audio-Driven 3D Depth-Aware Talking-Face Video Editing | Qili Wang et.al. | 2501.01798 | link |
2025-01-03 | TCPFormer: Learning Temporal Correlation with Implicit Pose Proxy for 3D Human Pose Estimation | Jiajie Liu et.al. | 2501.01770 | link |
2025-01-03 | Dynamic wetting of concentrated granular suspensions | Reza Azizmalayeri et.al. | 2501.01762 | null |
2025-01-03 | Laparoscopic Scene Analysis for Intraoperative Visualisation of Gamma Probe Signals in Minimally Invasive Cancer Surgery | Baoru Huang et.al. | 2501.01752 | null |
2025-01-03 | Multi-modal classification of forest biodiversity potential from 2D orthophotos and 3D airborne laser scanning point clouds | Simon B. Jensen et.al. | 2501.01728 | null |
2025-01-03 | AR4D: Autoregressive 4D Generation from Monocular Videos | Hanxin Zhu et.al. | 2501.01722 | null |
2025-01-03 | KeyNode-Driven Geometry Coding for Real-World Scanned Human Dynamic Mesh Compression | Huong Hoang et.al. | 2501.01717 | null |
2025-01-03 | Cloth-Splatting: 3D Cloth State Estimation from RGB Supervision | Alberta Longhini et.al. | 2501.01715 | null |
2025-01-03 | CrossView-GS: Cross-view Gaussian Splatting For Large-scale Scene Reconstruction | Chenhao Zhang et.al. | 2501.01695 | null |
2025-01-03 | PG-SAG: Parallel Gaussian Splatting for Fine-Grained Large-Scale Urban Buildings Reconstruction via Semantic-Aware Grouping | Tengfei Wang et.al. | 2501.01677 | link |
2025-01-03 | iCBIR-Sli: Interpretable Content-Based Image Retrieval with 2D Slice Embeddings | Shuhei Tomoshige et.al. | 2501.01642 | null |
2025-01-03 | Data Parallel Visualization and Rendering on the RAMSES Supercomputer with ANARI | Stefan Zellmann et.al. | 2501.01628 | null |
2025-01-03 | Few-shot Implicit Function Generation via Equivariance | Suizhi Huang et.al. | 2501.01601 | null |
2025-01-02 | Indoor Position and Attitude Tracking with SO(3) Manifold | Hammam Salem et.al. | 2501.01555 | null |
2025-01-02 | C $_{60}$ building blocks with tuneable structures for tailored functionalities | Darius Kayley et.al. | 2501.01494 | null |
2025-01-02 | A 22-Billion Solar Mass Black Hole in Holmberg 15A with Keck KCWI Spectroscopy and Triaxial Orbit Modeling | Emily R. Liepold et.al. | 2501.01493 | null |
2024-12-31 | Tech Report: Divide and Conquer 3D Real-Time Reconstruction for Improved IGS | Yicheng Zhu et.al. | 2501.01465 | null |
2024-12-30 | LS-GAN: Human Motion Synthesis with Latent-space GANs | Avinash Amballa et.al. | 2501.01449 | null |
2025-01-09 | GPT4Scene: Understand 3D Scenes from Videos with Vision-Language Models | Zhangyang Qi et.al. | 2501.01428 | link |
2025-01-02 | R-SCoRe: Revisiting Scene Coordinate Regression for Robust Large-Scale Visual Localization | Xudong Jiang et.al. | 2501.01421 | link |
2025-01-02 | On Unifying Video Generation and Camera Pose Estimation | Chun-Hao Paul Huang et.al. | 2501.01409 | null |
2025-01-02 | nnY-Net: Swin-NeXt with Cross-Attention for 3D Medical Images Segmentation | Haixu Liu et.al. | 2501.01406 | null |
2025-01-02 | Learning 3D Garment Animation from Trajectories of A Piece of Cloth | Yidi Shao et.al. | 2501.01393 | link |
2025-01-02 | ViGiL3D: A Linguistically Diverse Dataset for 3D Visual Grounding | Austin T. Wang et.al. | 2501.01366 | null |
2025-01-02 | HybridTrack: A Hybrid Approach for Robust Multi-Object Tracking | Leandro Di Bella et.al. | 2501.01275 | link |
2025-01-02 | Detail Matters: Mamba-Inspired Joint Unfolding Network for Snapshot Spectral Compressive Imaging | Mengjie Qin et.al. | 2501.01262 | link |
2025-01-02 | Revealing diatom-inspired materials multifunctionality | Ludovico Musenich et.al. | 2501.01229 | null |
2025-01-02 | Range-Only Localization System for Small-Scale Flapping-Wing Robots | Raul Tapia et.al. | 2501.01213 | link |
2025-01-02 | D-HAT: a Diatom-inspired structure for a Helmet concept Against Trauma | Ludovico Musenich et.al. | 2501.01211 | null |
2025-01-02 | Empirical Analysis of Nature-Inspired Algorithms for Autism Spectrum Disorder Detection Using 3D Video Dataset | Aneesh Panchal et.al. | 2501.01202 | null |
2025-01-02 | L3D-Pose: Lifting Pose for 3D Avatars from a Single Camera in the Wild | Soumyaratna Debnath et.al. | 2501.01174 | null |
2025-01-02 | 3D-LLaVA: Towards Generalist 3D LMMs with Omni Superpoint Transformer | Jiajun Deng et.al. | 2501.01163 | null |
2025-01-02 | Three-dimensional Helical-rotating Plasma Structures in Beam-generated Partially Magnetized E $\times$ B Plasmas | Jian Chen et.al. | 2501.01161 | null |
2025-01-02 | Leverage Cross-Attention for End-to-End Open-Vocabulary Panoptic Reconstruction | Xuan Yu et.al. | 2501.01119 | null |
2025-01-06 | On Computational Complexity of 3D Ising Spin Glass: Lessons from D-Wave Annealer | Hao Zhang et.al. | 2501.01107 | null |
2025-01-02 | Deformable Gaussian Splatting for Efficient and High-Fidelity Reconstruction of Surgical Scenes | Jiwei Shan et.al. | 2501.01101 | null |
2025-01-08 | Time Difference of Arrival Source Localization: Exact Linear Solutions for the General 3D Problem | Niraj K. Inamdar et.al. | 2501.01076 | null |
2025-01-02 | TS-SatMVSNet: Slope Aware Height Estimation for Large-Scale Earth Terrain Multi-view Stereo | Song Zhang et.al. | 2501.01049 | null |
2025-01-02 | MSC-Bench: Benchmarking and Analyzing Multi-Sensor Corruption for Driving Perception | Xiaoshuai Hao et.al. | 2501.01037 | null |
2025-01-02 | Incomplete Data Multi-Source Static Computed Tomography Reconstruction with Diffusion Priors and Implicit Neural Representation | Ziju Shen et.al. | 2501.01013 | null |
2025-01-02 | EasySplat: View-Adaptive Learning makes 3D Gaussian Splatting Easy | Ao Gao et.al. | 2501.01003 | null |
2025-01-02 | Photometric Objects Around Cosmic Webs (PAC). VII. Disentangling Mass and Environment Quenching with the Aid of Galaxy-halo Connection in Simulations | Yun Zheng et.al. | 2501.00986 | null |
2025-01-01 | Towards End-to-End Neuromorphic Voxel-based 3D Object Reconstruction Without Physical Priors | Chuanzhi Xu et.al. | 2501.00741 | null |
2025-01-01 | Spin Hall effect in 3d ferromagnetic metals for field-free switching of perpendicular magnetization: A first-principles investigation | Fanxing Zheng et.al. | 2501.00737 | null |
2024-12-31 | Measuring the effective stress parameter using the multiphase lattice Boltzmann method and investigating the source of its hysteresis | Reihaneh Hosseini et.al. | 2501.00661 | null |
2025-01-04 | Taming Feed-forward Reconstruction Models as Latent Encoders for 3D Generative Models | Suttisak Wizadwongsa et.al. | 2501.00651 | null |
2024-12-31 | SoundBrush: Sound as a Brush for Visual Scene Editing | Kim Sung-Bin et.al. | 2501.00645 | null |
2025-01-07 | Gaussian Building Mesh (GBM): Extract a Building’s 3D Mesh with Google Earth and Gaussian Splatting | Kyle Gao et.al. | 2501.00625 | null |
2024-12-31 | A Study on Context Length and Efficient Transformers for Biomedical Image Analysis | Sarah M. Hooper et.al. | 2501.00619 | null |
2024-12-31 | STORM: Spatio-Temporal Reconstruction Model for Large-Scale Outdoor Scenes | Jiawei Yang et.al. | 2501.00602 | null |
2025-01-03 | DreamDrive: Generative 4D Scene Modeling from Street View Images | Jiageng Mao et.al. | 2501.00601 | null |
2024-12-31 | H-Net: A Multitask Architecture for Simultaneous 3D Force Estimation and Stereo Semantic Segmentation in Intracardiac Catheters | Pedram Fekri et.al. | 2501.00514 | null |
2024-12-31 | Angle-resolved photoemission of topological materials | Jaime Sánchez-Barriga et.al. | 2501.00497 | null |
2024-12-31 | Enhanced Conformal $BMS_3$ Symmetries | Oscar Fuentealba et.al. | 2501.00439 | null |
2025-01-09 | Embodied VideoAgent: Persistent Memory from Egocentric Videos and Embodied Sensors Enables Dynamic Scene Understanding | Yue Fan et.al. | 2501.00358 | null |
2024-12-31 | PanoSLAM: Panoptic 3D Scene Reconstruction via Gaussian SLAM | Runnan Chen et.al. | 2501.00352 | null |
2024-12-31 | SG-Splatting: Accelerating 3D Gaussian Splatting with Spherical Gaussians | Yiwen Wang et.al. | 2501.00342 | null |
2024-12-31 | Spin waves in magnetic nanodisks, nanorings, and 3D nanovolcanoes | Oleksandr Dobrovolskiy et.al. | 2501.00333 | null |
2024-12-31 | OVGaussian: Generalizable 3D Gaussian Segmentation with Open Vocabularies | Runnan Chen et.al. | 2501.00326 | null |
2024-12-31 | Spatio-Temporal Multi-Subgraph GCN for 3D Human Motion Prediction | Jiexin Wang et.al. | 2501.00317 | null |
2024-12-31 | DecoratingFusion: A LiDAR-Camera Fusion Network with the Combination of Point-level and Feature-level Fusion | Zixuan Yin et.al. | 2501.00220 | null |
2024-12-31 | 3D Carrollian gravity from 2D Euclidean symmetry | Patrick Concha et.al. | 2501.00205 | null |
2024-12-30 | A Functional Human Liver Tissue Model: 3D Bioprinted Co-culture Discoids | Vignesh Subramaniam et.al. | 2501.00086 | null |
2024-12-30 | Human-like Bots for Tactical Shooters Using Compute-Efficient Sensors | Niels Justesen et.al. | 2501.00078 | null |
2024-12-30 | PERSE: Personalized 3D Generative Avatars from A Single Portrait | Hyunsoo Cha et.al. | 2412.21206 | null |
2024-12-30 | Topological Responses of the Standard Model Gauge Group | Zheyan Wan et.al. | 2412.21196 | null |
2024-12-30 | What Makes for a Good Stereoscopic Image? | Netanel Y. Tamir et.al. | 2412.21127 | null |
2025-01-02 | Prometheus: 3D-Aware Latent Diffusion Models for Feed-Forward Text-to-3D Scene Generation | Yuanbo Yang et.al. | 2412.21117 | null |
2024-12-30 | Comparative Analysis of 2D and 3D ResNet Architectures for IDH and MGMT Mutation Detection in Glioma Patients | Danial Elyassirad et.al. | 2412.21091 | null |
2024-12-30 | 3d $\mathcal{N}=4$ Mirror Symmetry, TQFTs, and ‘t Hooft Anomaly Matching | Mahesh K. N. Balasubramanian et.al. | 2412.21066 | null |
2024-12-30 | UnrealZoo: Enriching Photo-realistic Virtual Worlds for Embodied AI | Fangwei Zhong et.al. | 2412.20977 | null |
2024-12-30 | FPGA-based Acceleration of Neural Network for Image Classification using Vitis AI | Zhengdong Li et.al. | 2412.20974 | null |
2024-12-30 | Dimensional Resonance Theory: An Evolutionary Approach to Universal Rest | Andre Carnevali da Silva et.al. | 2412.20961 | null |
2024-12-30 | TiGDistill-BEV: Multi-view BEV 3D Object Detection via Target Inner-Geometry Learning Distillation | Shaoqing Xu et.al. | 2412.20911 | link |
2024-12-30 | LiDAR-Camera Fusion for Video Panoptic Segmentation without Video Training | Fardin Ayar et.al. | 2412.20881 | null |
2024-12-30 | A rational design method for the Nagoya type-III antenna | Daniele Iannarelli et.al. | 2412.20839 | null |
2024-12-30 | KeyGS: A Keyframe-Centric Gaussian Splatting Method for Monocular Image Sequences | Keng-Wei Chang et.al. | 2412.20767 | null |
2024-12-30 | Scaling Limit and Large Deviation for 3D Globally Modified Stochastic Navier-Stokes Equations with Transport Noise | Chang Liu et.al. | 2412.20752 | null |
2024-12-30 | 4D Gaussian Splatting: Modeling Dynamic Scenes with Native 4D Primitives | Zeyu Yang et.al. | 2412.20720 | null |
2024-12-29 | Crossover of Critical Behavior in Dynamic Phase Transitions of Ferromagnetic Thin Films | Erol Vatansever et.al. | 2412.20579 | null |
2024-12-29 | Ns3 meets Sionna: Using Realistic Channels in Network Simulation | Anatolij Zubow et.al. | 2412.20524 | null |
2024-12-29 | MaskGaussian: Adaptive 3D Gaussian Representation from Probabilistic Masks | Yifei Liu et.al. | 2412.20522 | link |
2024-12-29 | Symbiotic novae | Ulisse Munari et.al. | 2412.20499 | null |
2024-12-29 | MR-Occ: Efficient Camera-LiDAR 3D Semantic Occupancy Prediction Using Hierarchical Multi-Resolution Voxel Representation | Minjae Seong et.al. | 2412.20480 | null |
2024-12-29 | Toward Scene Graph and Layout Guided Complex 3D Scene Generation | Yu-Hsiang Huang et.al. | 2412.20473 | null |
2024-12-29 | JADE: Joint-aware Latent Diffusion for 3D Human Generative Modeling | Haorui Ji et.al. | 2412.20470 | null |
2024-12-29 | $S^1$ reduction of 4D $\mathcal{N}=4$ Schur index and 3D $\mathcal{N}=8$ mass-deformed partition function | Tomoki Nakanishi et.al. | 2412.20452 | null |
2024-12-29 | Bringing Objects to Life: 4D generation from 3D objects | Ohad Rahamim et.al. | 2412.20422 | null |
2024-12-29 | Open-Sora: Democratizing Efficient Video Production for All | Zangwei Zheng et.al. | 2412.20404 | link |
2024-12-29 | Protein Structure Prediction in the 3D HP Model Using Deep Reinforcement Learning | Giovanny Espitia et.al. | 2412.20329 | null |
2024-12-29 | Hybrid Feedback Control for Global Navigation with Locally Optimal Obstacle Avoidance in n-Dimensional Spaces | Ishak Cheniouni et.al. | 2412.20320 | null |
2024-12-29 | Narrowband parallel coherent LiDAR with frequency interleaving | Long Wang et.al. | 2412.20311 | null |
2024-12-29 | All meromorphic solutions of a 3D Lotka-Volterra system: detecting partial integrability | Techheang Meng et.al. | 2412.20304 | null |
2024-12-28 | Advances in Additive Manufacturing of 3D-segmented Plastic Scintillator Detectors for Particle Tracking and Calorimetry | Umut Kose et.al. | 2412.20267 | null |
2024-12-28 | Geo-ConvGRU: Geographically Masked Convolutional Gated Recurrent Unit for Bird-Eye View Segmentation | Guanglei Yang et.al. | 2412.20171 | null |
2024-12-28 | Defending Against Network Attacks for Secure AI Agent Migration in Vehicular Metaverses | Xinru Wen et.al. | 2412.20154 | null |
2024-12-28 | DEGSTalk: Decomposed Per-Embedding Gaussian Fields for Hair-Preserving Talking Face Synthesis | Kaijun Deng et.al. | 2412.20148 | link |
2024-12-28 | Self-similarity on 4d cubic lattice | Igor G. Korepanov et.al. | 2412.20140 | null |
2024-12-28 | A finite strain model for fiber angle plasticity of textile fabrics based on isogeometric shell finite elements | Thang Xuan Duong et.al. | 2412.20131 | null |
2024-12-28 | Topological Gauge Theories with Sixteen Supercharges: Higher $A_\infty$ -categorification of Floer Homologies | Arif Er et.al. | 2412.20067 | null |
2024-12-28 | GSplatLoc: Ultra-Precise Camera Localization via 3D Gaussian Splatting | Atticus J. Zeller et.al. | 2412.20056 | link |
2024-12-28 | A Family of Vertex Operator Algebras from Argyres-Douglas Theory | Heeyeon Kim et.al. | 2412.20015 | null |
2024-12-28 | From Generalist to Specialist: A Survey of Large Language Models for Chemistry | Yang Han et.al. | 2412.19994 | link |
2024-12-28 | An Overview of Cellular ISAC for Low-Altitude UAV: New Opportunities and Challenges | Yuxuan Song et.al. | 2412.19973 | null |
2024-12-27 | Not all Views are Created Equal: Analyzing Viewpoint Instabilities in Vision Foundation Models | Mateusz Michalkiewicz et.al. | 2412.19920 | null |
2024-12-26 | UniAvatar: Taming Lifelike Audio-Driven Talking Head Generation with Comprehensive Motion and Lighting Control | Wenzhang Sun et.al. | 2412.19860 | null |
2024-12-25 | 3D Face Reconstruction With Geometry Details From a Single Color Image Under Occluded Scenes | Dapeng Zhao et.al. | 2412.19849 | null |
2024-12-25 | Generative Landmarks Guided Eyeglasses Removal 3D Face Reconstruction | Dapeng Zhao et.al. | 2412.19848 | null |
2024-12-27 | A local automaton for the 2D toric code | Shankar Balasubramanian et.al. | 2412.19803 | null |
2024-12-27 | Classification of Minimal Abelian Coulomb Branches | Antoine Bourget et.al. | 2412.19766 | null |
2024-12-27 | Sharpening Neural Implicit Functions with Frequency Consolidation Priors | Chao Chen et.al. | 2412.19720 | link |
2024-12-27 | ProKAN: Progressive Stacking of Kolmogorov-Arnold Networks for Efficient Liver Segmentation | Bhavesh Gyanchandani et.al. | 2412.19713 | null |
2024-12-27 | A Review on the Integration of Artificial Intelligence and Medical Imaging in IVF Ovarian Stimulation | Jana Zakall et.al. | 2412.19688 | null |
2024-12-27 | Optimizing Local-Global Dependencies for Accurate 3D Human Pose Estimation | Guangsheng Xu et.al. | 2412.19676 | link |
2024-12-27 | CAD-GPT: Synthesising CAD Construction Sequence with Spatial Reasoning-Enhanced Multimodal LLMs | Siyu Wang et.al. | 2412.19663 | null |
2024-12-27 | xFLIE: Leveraging Actionable Hierarchical Scene Representations for Autonomous Semantic-Aware Inspection Missions | Vignesh Kottayam Viswanathan et.al. | 2412.19571 | link |
2024-12-27 | Safe Interval Randomized Path Planing For Manipulators | Nuraddin Kerimov et.al. | 2412.19567 | link |
2024-12-30 | Multiplicative Chern insulator | Archi Banerjee et.al. | 2412.19566 | null |
2024-12-27 | Contrast-Optimized Basis Functions for Self-Navigated Motion Correction in Quantitative MRI | Elisa Marchetto et.al. | 2412.19552 | null |
2024-12-27 | Dust to Tower: Coarse-to-Fine Photo-Realistic Scene Reconstruction from Sparse Uncalibrated Images | Xudong Cai et.al. | 2412.19518 | null |
2024-12-27 | Learning Radiance Fields from a Single Snapshot Compressive Image | Yunhao Li et.al. | 2412.19483 | null |
2024-12-30 | DriveEditor: A Unified 3D Information-Guided Framework for Controllable Object Editing in Driving Scenes | Yiyuan Liang et.al. | 2412.19458 | link |
2024-12-27 | Multi-scale Latent Point Consistency Models for 3D Shape Generation | Bi’an Du et.al. | 2412.19413 | null |
2025-01-05 | BeSplat: Gaussian Splatting from a Single Blurry Image and Event Stream | Gopi Raju Matta et.al. | 2412.19370 | null |
2024-12-26 | Habitability in 4-D: Predicting the Climates of Earth Analogs across Rotation and Orbital Configurations | Arthur D. Adams et.al. | 2412.19357 | link |
2024-12-26 | Flat panel laser displays enabled by large-scale visible photonic integrated circuits | Zhujun Shi et.al. | 2412.19274 | null |
2024-12-26 | Primordial Power Spectrum of Five Dimensional Uniform Inflation | Luis A. Anchordoqui et.al. | 2412.19213 | null |
2024-12-26 | Suppression of blow-up for the 3D Patlak-Keller-Segel-Navier-Stokes system via the Couette flow | Shikun Cui et.al. | 2412.19197 | null |
2024-12-26 | An End-to-End Depth-Based Pipeline for Selfie Image Rectification | Ahmed Alhawwary et.al. | 2412.19189 | null |
2024-12-26 | Revisiting Monocular 3D Object Detection from Scene-Level Depth Retargeting to Instance-Level Spatial Refinement | Qiude Zhang et.al. | 2412.19165 | null |
2024-12-26 | Generating Editable Head Avatars with 3D Gaussian GANs | Guohao Li et.al. | 2412.19149 | link |
2024-12-26 | CLIP-GS: Unifying Vision-Language Representation with 3D Gaussian Splatting | Siyu Jiao et.al. | 2412.19142 | null |
2024-12-26 | MVS-GS: High-Quality 3D Gaussian Splatting Mapping via Online Multi-View Stereo | Byeonggwon Lee et.al. | 2412.19130 | null |
2024-12-26 | Humans as a Calibration Pattern: Dynamic 3D Scene Reconstruction from Unsynchronized and Uncalibrated Videos | Changwoon Choi et.al. | 2412.19089 | null |
2024-12-26 | Imperceptible Adversarial Attacks on Point Clouds Guided by Point-to-Surface Field | Keke Tang et.al. | 2412.19015 | null |
2024-12-25 | TravelAgent: Generative Agents in the Built Environment | Ariel Noyman et.al. | 2412.18985 | null |
2024-12-25 | Generative Face Parsing Map Guided 3D Face Reconstruction Under Occluded Scenes | Dapeng Zhao et.al. | 2412.18920 | null |
2024-12-30 | HV-BEV: Decoupling Horizontal and Vertical Feature Sampling for Multi-View 3D Object Detection | Di Wu et.al. | 2412.18884 | null |
2024-12-25 | MotionMap: Representing Multimodality in Human Pose Forecasting | Reyhaneh Hosseininejad et.al. | 2412.18883 | link |
2024-12-25 | TSceneJAL: Joint Active Learning of Traffic Scenes for 3D Object Detection | Chenyang Lei et.al. | 2412.18870 | link |
2024-12-30 | WeatherGS: 3D Scene Reconstruction in Adverse Weather Conditions via Gaussian Splatting | Chenghao Qian et.al. | 2412.18862 | link |
2024-12-25 | Federated Learning with Partially Labeled Data: A Conditional Distillation Approach | Pochuan Wang et.al. | 2412.18833 | null |
2024-12-25 | GSAVS: Gaussian Splatting-based Autonomous Vehicle Simulator | Rami Wilson et.al. | 2412.18816 | null |
2024-12-25 | ArtNVG: Content-Style Separated Artistic Neighboring-View Gaussian Stylization | Zixiao Gu et.al. | 2412.18783 | null |
2024-12-25 | MRI Reconstruction with Regularized 3D Diffusion Model (R3DM) | Arya Bangun et.al. | 2412.18723 | null |
2025-01-09 | STITCH: Surface reconstrucTion using Implicit neural representations with Topology Constraints and persistent Homology | Anushrut Jignasu et.al. | 2412.18696 | null |
2024-12-24 | Computational Assessment of Turbulent Eddy Impact on Hydrodynamic Mixing in a Stirred Tank Bioreactor with Vent based Impellers | Ayodele James Oyejide et.al. | 2412.18660 | null |
2024-12-21 | Tuning Nonlinear Elastic Materials under Small and Large Deformations | Huanyu Chen et.al. | 2412.18631 | null |
2024-12-29 | PartGen: Part-level 3D Generation and Reconstruction with Multi-View Diffusion Models | Minghao Chen et.al. | 2412.18608 | null |
2024-12-24 | Orient Anything: Learning Robust Object Orientation Estimation from Rendering 3D Models | Zehan Wang et.al. | 2412.18605 | link |
2024-12-24 | ZeroHSI: Zero-Shot 4D Human-Scene Interaction by Video Generation | Hongjie Li et.al. | 2412.18600 | null |
2024-12-24 | DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation | Minghong Cai et.al. | 2412.18597 | link |
2024-12-24 | Resolution-Robust 3D MRI Reconstruction with 2D Diffusion Priors: Diverse-Resolution Training Outperforms Interpolation | Anselm Krainovic et.al. | 2412.18584 | null |
2024-12-24 | 3DEnhancer: Consistent Multi-View Diffusion for 3D Enhancement | Yihang Luo et.al. | 2412.18565 | null |
2024-12-25 | 3DGraphLLM: Combining Semantic Graphs and Large Language Models for 3D Scene Understanding | Tatiana Zemskova et.al. | 2412.18450 | link |
2024-12-24 | Normalized field product approach: A parameter-free density evaluation method for close-to-binary solutions in topology optimization with embedded length scale | Nikhil Singh et.al. | 2412.18441 | null |
2024-12-24 | Field-free current-induced magnetization switching of a room temperature van der Waals magnet for neuromorphic computing | Chenxi Zhou et.al. | 2412.18429 | null |
2024-12-24 | Ultra-Low Complexity On-Orbit Compression for Remote Sensing Imagery via Block Modulated Imaging | Zhibin Wang et.al. | 2412.18417 | link |
2025-01-04 | Detectorless 3D terahertz imaging: achieving subwavelength resolution with reflectance confocal interferometric microscopy | Jorge Silva et.al. | 2412.18403 | link |
2024-12-24 | Agreement of Image Quality Metrics with Radiological Evaluation in the Presence of Motion Artifacts | Elisa Marchetto et.al. | 2412.18389 | null |
2024-12-24 | MR-COGraphs: Communication-efficient Multi-Robot Open-vocabulary Mapping System via 3D Scene Graphs | Qiuyi Gu et.al. | 2412.18381 | link |
2024-12-24 | RSGaussian:3D Gaussian Splatting with LiDAR for Aerial Remote Sensing Novel View Synthesis | Yiling Yao et.al. | 2412.18380 | null |
2024-12-24 | Point-DeepONet: A Deep Operator Network Integrating PointNet for Nonlinear Analysis of Non-Parametric 3D Geometries and Load Conditions | Jangseop Park et.al. | 2412.18362 | link |
2024-12-24 | Low-temperature mean valence of nickel ions in pressurized La $_3$Ni$_2$O$_7$ | Shu Cai et.al. | 2412.18343 | null |
2025-01-03 | Extended Near Horizon Symmetries of Extremal BTZ Black Holes in 3D Massive Gravity | Debojyoti Ballav et.al. | 2412.18286 | null |
2024-12-24 | AdaCo: Overcoming Visual Foundation Model Noise in 3D Semantic Segmentation via Adaptive Label Correction | Pufan Zou et.al. | 2412.18255 | null |
2024-12-24 | Parallel Neural Computing for Scene Understanding from LiDAR Perception in Autonomous Racing | Suwesh Prasad Sah et.al. | 2412.18165 | link |
2024-12-24 | DepthLab: From Partial to Complete | Zhiheng Liu et.al. | 2412.18153 | null |
2024-12-24 | UniPLV: Towards Label-Efficient Open-World 3D Scene Understanding by Regional Visual Language Supervision | Yuru Wang et.al. | 2412.18131 | null |
2024-12-24 | A Review of 3D Particle Tracking and Flow Diagnostics Using Digital Holography | Shyam Kumar M et.al. | 2412.18094 | null |
2024-12-23 | ArchComplete: Autoregressive 3D Architectural Design Generation with Hierarchical Diffusion-Based Upsampling | S. Rasoulzadeh et.al. | 2412.17957 | link |
2024-12-23 | Adaptive Signal Analysis for Automated Subsurface Defect Detection Using Impact Echo in Concrete Slabs | Deepthi Pavurala et.al. | 2412.17953 | null |
2024-12-23 | Trading Devil RL: Backdoor attack via Stock market, Bayesian Optimization and Reinforcement Learning | Orson Mengara et.al. | 2412.17908 | null |
2024-12-23 | Coulomb Branches in 3d $\mathcal{N} = 4$ Revisited | Spencer Tamagni et.al. | 2412.17904 | null |
2024-12-18 | Constraint-Based Model in Multimodal Learning to Improve Ventricular Arrhythmia Prediction | Evariste Njomgue Fotso et.al. | 2412.17840 | null |
2024-12-23 | FaceLift: Single Image to 3D Head with View Generation and GS-LRM | Weijie Lyu et.al. | 2412.17812 | null |
2024-12-28 | ChatGarment: Garment Estimation, Generation and Editing via Large Language Models | Siyuan Bian et.al. | 2412.17811 | null |
2024-12-24 | Dora: Sampling and Benchmarking for 3D Shape Variational Auto-Encoders | Rui Chen et.al. | 2412.17808 | link |
2024-12-23 | Large Motion Video Autoencoding with Cross-modal Video VAE | Yazhou Xing et.al. | 2412.17805 | null |
2024-12-23 | Mimicking-Bench: A Benchmark for Generalizable Humanoid-Scene Interaction Learning via Human Mimicking | Yun Liu et.al. | 2412.17730 | null |
2024-12-23 | GaussianPainter: Painting Point Cloud into 3D Gaussians with Normal Guidance | Jingqiu Zhou et.al. | 2412.17715 | null |
2024-12-23 | Enhanced Temporal Processing in Spiking Neural Networks for Static Object Detection Using 3D Convolutions | Huaxu He et.al. | 2412.17654 | null |
2024-12-24 | LangSurf: Language-Embedded Surface Gaussians for 3D Scene Understanding | Hao Li et.al. | 2412.17635 | null |
2024-12-23 | Mixing and Geometry in the North Atlantic Meridional Overturning Circulation | Renzo Bruera et.al. | 2412.17615 | null |
2024-12-23 | CoSurfGS:Collaborative 3D Surface Gaussian Splatting with Distributed Learning for Large Scene Reconstruction | Yuanyuan Gao et.al. | 2412.17612 | null |
2024-12-23 | V $^2$ -SfMLearner: Learning Monocular Depth and Ego-motion for Multimodal Wireless Capsule Endoscopy | Long Bai et.al. | 2412.17595 | null |
2025-01-04 | S-INF: Towards Realistic Indoor Scene Synthesis via Scene Implicit Neural Field | Zixi Liang et.al. | 2412.17561 | link |
2024-12-23 | Exploring Dynamic Novel View Synthesis Technologies for Cinematography | Adrian Azzarelli et.al. | 2412.17532 | null |
2024-12-23 | Assessment of Deep-Learning Methods for the Enhancement of Experimental Low Dose Dental CBCT Volumes | Louise Friot–Giroux et.al. | 2412.17423 | null |
2024-12-23 | TSformer: A Non-autoregressive Spatial-temporal Transformers for 30-day Ocean Eddy-Resolving Forecasting | Guosong Wang et.al. | 2412.17392 | null |
2024-12-23 | PointVoxelFormer – Reviving point cloud networks for 3D medical imaging | Mattias Paul Heinrich et.al. | 2412.17390 | null |
2025-01-08 | Balanced 3DGS: Gaussian-wise Parallelism Rendering with Fine-Grained Tiling | Hao Gui et.al. | 2412.17378 | null |
2024-12-23 | A Plug-and-Play Physical Motion Restoration Approach for In-the-Wild High-Difficulty Motions | Youliang Zhang et.al. | 2412.17377 | null |
2024-12-23 | DiffFormer: a Differential Spatial-Spectral Transformer for Hyperspectral Image Classification | Muhammad Ahmad et.al. | 2412.17350 | link |
2024-12-23 | Revisiting Multimodal Fusion for 3D Anomaly Detection from an Architectural Perspective | Kaifang Long et.al. | 2412.17297 | null |
2024-12-23 | LMD-PGN: Cross-Modal Knowledge Distillation from First-Person-View Images to Third-Person-View BEV Maps for Universal Point Goal Navigation | Riku Uemura et.al. | 2412.17282 | null |
2024-12-23 | A Coalition Game for On-demand Multi-modal 3D Automated Delivery System | Farzan Moosavi et.al. | 2412.17252 | null |
2024-12-23 | OLiDM: Object-aware LiDAR Diffusion Models for Autonomous Driving | Tianyi Yan et.al. | 2412.17226 | null |
2024-12-25 | Analytic 3D vector non-uniform Fourier crystal optics in arbitrary $\bar{\bar{\varepsilon}}$ dielectric | Chenzhu Xie et.al. | 2412.17224 | null |
2024-12-22 | Foundation Model for Lossy Compression of Spatiotemporal Scientific Data | Xiao Li et.al. | 2412.17184 | null |
2024-12-22 | Nonlinear stage of modulational instability in repulsive two-component Bose-Einstein condensates | S. Mossman et.al. | 2412.17083 | null |
2024-12-22 | An OpenMind for 3D medical vision self-supervised learning | Tassilo Wald et.al. | 2412.17041 | link |
2024-12-22 | HyperNet Fields: Efficiently Training Hypernetworks without Ground Truth by Learning Weight Trajectories | Eric Hedlin et.al. | 2412.17040 | null |
2024-12-22 | Optical evidence of the band reconstruction during the charge-density wave transition in annealed Kagome magnet FeGe | A. Zhang et.al. | 2412.17020 | null |
2024-12-22 | InterDance:Reactive 3D Dance Generation with Realistic Duet Interactions | Ronghui Li et.al. | 2412.16982 | null |
2024-12-22 | GSemSplat: Generalizable Semantic 3D Gaussian Splatting from Uncalibrated Image Pairs | Xingrui Wang et.al. | 2412.16932 | link |
2024-12-22 | 3D Radiative MHD Simulations of Starspots II: Large-scale Structure | Tanayveer Singh Bhatia et.al. | 2412.16921 | null |
2024-12-22 | TAR3D: Creating High-Quality 3D Assets via Next-Part Prediction | Xuying Zhang et.al. | 2412.16919 | null |
2024-12-22 | Anchor3DLane++: 3D Lane Detection via Sample-Adaptive Sparse 3D Anchor Regression | Shaofei Huang et.al. | 2412.16889 | link |
2024-12-29 | SoundLoc3D: Invisible 3D Sound Source Localization and Classification Using a Multimodal RGB-D Acoustic Camera | Yuhang He et.al. | 2412.16861 | null |
2024-12-22 | Application of 3D U-Net Neural Networks in Extracting the Epoch of Reionization Signal from SKA-Low Observations Based on Real Observations of NCP Field from LOFAR | Li-Yang Gao et.al. | 2412.16853 | null |
2024-12-22 | Technical Report: Towards Spatial Feature Regularization in Deep-Learning-Based Array-SAR Reconstruction | Yu Ren et.al. | 2412.16828 | null |
2024-12-22 | GeoTexDensifier: Geometry-Texture-Aware Densification for High-Quality Photorealistic 3D Gaussian Splatting | Hanqing Jiang et.al. | 2412.16809 | null |
2024-12-21 | RoomPainter: View-Integrated Diffusion for Consistent Indoor Scene Texturing | Zhipeng Huang et.al. | 2412.16778 | null |
2024-12-21 | DMesh++: An Efficient Differentiable Mesh for Complex Shapes | Sanghyun Son et.al. | 2412.16776 | null |
2024-12-21 | Janus and RG-interfaces in minimal 3d gauged supergravity | Michael Gutperle et.al. | 2412.16749 | null |
2024-12-21 | EasyVis2: A Real Time Multi-view 3D Visualization for Laparoscopic Surgery Training Enhanced by a Deep Neural Network YOLOv8-Pose | Yung-Hong Sun et.al. | 2412.16742 | null |
2024-12-21 | LUCES-MV: A Multi-View Dataset for Near-Field Point Light Source Photometric Stereo | Fotios Logothetis et.al. | 2412.16737 | null |
2024-12-21 | GANFusion: Feed-Forward Text-to-3D with Diffusion in GAN Space | Souhaib Attaiki et.al. | 2412.16717 | null |
2024-12-21 | Generalizable Articulated Object Perception with Superpoints | Qiaojun Yu et.al. | 2412.16656 | null |
2024-12-21 | An explainable operator approximation framework under the guideline of Green’s function | Jianghang Gu et.al. | 2412.16644 | link |
2024-12-21 | Three-dimensional nucleation and growth of deformation twins in magnesium | Sangwon Lee et.al. | 2412.16640 | null |
2024-12-21 | Performance evaluation of mixed-precision Runge-Kutta methods for the solution of partial differential equations | Ivo Dravins et.al. | 2412.16638 | null |
2024-12-25 | Topology-Aware 3D Gaussian Splatting: Leveraging Persistent Homology for Optimized Structural Integrity | Tianqi Shen et.al. | 2412.16619 | link |
2024-12-21 | OmniSplat: Taming Feed-Forward 3D Gaussian Splatting for Omnidirectional Images with Editable Capabilities | Suyoung Lee et.al. | 2412.16604 | null |
2024-12-21 | Effective and Efficient Representation Learning for Flight Trajectories | Shuo Liu et.al. | 2412.16581 | link |
2024-12-21 | A Generalizable 3D Diffusion Framework for Low-Dose and Few-View Cardiac SPECT | Huidong Xie et.al. | 2412.16573 | null |
2024-12-21 | Non-uniqueness of Leray–Hopf solutions for the $3D$ fractional Navier–Stokes equations perturbed by transport noise | Theresa Lange et.al. | 2412.16532 | null |
2024-12-21 | Context-Aware Outlier Rejection for Robust Multi-View 3D Tracking of Similar Small Birds in An Outdoor Aviary | Keon Moradi et.al. | 2412.16511 | link |
2024-12-21 | Flash3D: Super-scaling Point Transformers through Joint Hardware-Geometry Locality | Liyan Chen et.al. | 2412.16481 | null |
2024-12-21 | Sensing Surface Patches in Volume Rendering for Inferring Signed Distance Functions | Sijia Jiang et.al. | 2412.16467 | link |
2024-12-21 | Exploring Noncollinear Magnetic Energy Landscapes with Bayesian Optimization | Jakob Baumsteiger et.al. | 2412.16433 | null |
2024-12-20 | VerSe: Integrating Multiple Queries as Prompts for Versatile Cardiac MRI Segmentation | Bangwei Guo et.al. | 2412.16381 | link |
2024-12-24 | A Layered Swarm Optimization Method for Fitting Battery Thermal Runaway Models to Accelerating Rate Calorimetry Data | Saakaar Bhatnagar et.al. | 2412.16367 | null |
2024-12-20 | Toward Robust Neural Reconstruction from Sparse Point Sets | Amine Ouasfi et.al. | 2412.16361 | null |
2024-12-20 | XR for All: Understanding Developer Perspectives on Accessibility Integration in Extended Reality | Daniel Killough et.al. | 2412.16321 | null |
2024-12-20 | Atomic resolution imaging of 3D crystallography in functional oxide thin films | Ian MacLaren et.al. | 2412.16297 | null |
2024-12-20 | Electrodynamics and dissipation in the binary magnetosphere of pre-merger neutron stars | Jens F. Mahlmann et.al. | 2412.16280 | null |
2024-12-20 | MetaScientist: A Human-AI Synergistic Framework for Automated Mechanical Metamaterial Design | Jingyuan Qi et.al. | 2412.16270 | null |
2024-12-20 | Interactive Scene Authoring with Specialized Generative Primitives | Clément Jambon et.al. | 2412.16253 | null |
2024-12-18 | TopView: Vectorising road users in a bird’s eye view from uncalibrated street-level imagery with deep learning | Mohamed R Ibrahim et.al. | 2412.16229 | null |
2024-12-18 | Adaptive Calibration: A Unified Conversion Framework of Spiking Neural Network | Ziqing Wang et.al. | 2412.16219 | link |
2024-12-18 | AdvIRL: Reinforcement Learning-Based Adversarial Attacks on 3D NeRF Models | Tommy Nguyen et.al. | 2412.16213 | link |
2024-12-18 | ManiVideo: Generating Hand-Object Manipulation Video with Dexterous and Generalizable Grasping | Youxin Pang et.al. | 2412.16212 | null |
2024-12-16 | Robust Spectral Anomaly Detection in EELS Spectral Images via Three Dimensional Convolutional Variational Autoencoders | Seyfal Sultanov et.al. | 2412.16200 | link |
2024-12-20 | Can Generative Video Models Help Pose Estimation? | Ruojin Cai et.al. | 2412.16155 | null |
2024-12-20 | On the Impact of 3D Visualization of Repository Metrics in Software Engineering Education | Dario Di Dario et.al. | 2412.16061 | null |
2024-12-20 | Full Parity-Violating Trispectrum in Axion Inflation: Reduction to Low-D Integrals | Matthew Reinhard et.al. | 2412.16037 | null |
2024-12-20 | CoCoGaussian: Leveraging Circle of Confusion for Gaussian Splatting from Defocused Images | Jungho Lee et.al. | 2412.16028 | null |
2024-12-20 | Fuzzy-Space Engineering | Paul Schreivogl et.al. | 2412.16011 | null |
2024-12-20 | Adding interferometric lightning detection to the Pierre Auger Observatory | Melanie Joan Weitz et.al. | 2412.15972 | null |
2024-12-20 | Investigating the Interplay between Spin-Polarization and Magnetic Damping in $\mathrm{Co}{x}\mathrm{Fe}{80-x}\mathrm{B}_{20}$ for Magnonics Applications | Lorenzo Gnoatto et.al. | 2412.15954 | null |
2024-12-20 | Reframing Image Difference Captioning with BLIP2IDC and Synthetic Augmentation | Gautier Evennou et.al. | 2412.15939 | link |
2024-12-20 | Propagation of untwisting solar jets from the low-beta corona into the super-Alfvénic wind: Testing a solar origin scenario for switchbacks | Jade Touresse et.al. | 2412.15930 | null |
2024-12-20 | Immersive In Situ Visualizations for Monitoring Architectural-Scale Multiuser MR Experiences | Zhongyuan Yu et.al. | 2412.15918 | null |
2024-12-20 | CCNDF: Curvature Constrained Neural Distance Fields from 3D LiDAR Sequences | Akshit Singh et.al. | 2412.15909 | null |
2024-12-20 | A Digital Phantom for 3D MR Spectroscopy Data Simulation | D. M. J. van de Sande et.al. | 2412.15869 | null |
2024-12-20 | 3D non-linear non-adiabatic MHD simulations of core density collapse event in LHD plasma | A. Civit et.al. | 2412.15823 | null |
2024-12-20 | Bi-directional Mapping of Morphology Metrics and 3D City Blocks for Enhanced Characterization and Generation of Urban Form | Chenyi Cai et.al. | 2412.15801 | null |
2024-12-20 | Sparse Point Clouds Assisted Learned Image Compression | Yiheng Jiang et.al. | 2412.15752 | null |
2024-12-20 | Conformal approach to physics simulations for thin curved 3D membranes | Igor Bogush et.al. | 2412.15741 | link |
2024-12-20 | SCENIC: Scene-aware Semantic Navigation with Instruction-guided Control | Xiaohan Zhang et.al. | 2412.15664 | null |
2024-12-20 | Unified real-space construction scheme for flat bands based on symmetric compact localized states | Rui-Heng Liu et.al. | 2412.15653 | null |
2024-12-24 | 3D Shape Tokenization | Jen-Hao Rick Chang et.al. | 2412.15618 | null |
2024-12-20 | AvatarPerfect: User-Assisted 3D Gaussian Splatting Avatar Refinement with Automatic Pose Suggestion | Jotaro Sakamiya et.al. | 2412.15609 | null |
2024-12-20 | Resonant Beam Enabled Passive 3D Positioning | Yixuan Guo et.al. | 2412.15596 | null |
2024-12-20 | The Analytic Arc Cover Problem and its Applications to Contiguous Art Gallery, Polygon Separation, and Shape Carving | Eliot W. Robson et.al. | 2412.15567 | null |
2024-12-20 | Robust and Feature-Preserving Offset Meshing | Hongyi Cao et.al. | 2412.15564 | null |
2024-12-20 | EGSRAL: An Enhanced 3D Gaussian Splatting based Renderer with Automated Labeling for Large-Scale Driving Scene | Yixiong Huo et.al. | 2412.15550 | link |
2024-12-20 | GCA-3D: Towards Generalized and Consistent Domain Adaptation of 3D Generators | Hengjia Li et.al. | 2412.15491 | null |
2024-12-26 | LiHi-GS: LiDAR-Supervised Gaussian Splatting for Highway Driving Scene Reconstruction | Pou-Chun Kung et.al. | 2412.15447 | null |
2024-12-19 | Efficient Neural Network Encoding for 3D Color Lookup Tables | Vahid Zehtab et.al. | 2412.15438 | link |
2024-12-19 | Long-range exchange coupling in a magnetic multilayer system | L. O. Souza et.al. | 2412.15432 | null |
2024-12-19 | Ground Motion Characteristics of Cascading Earthquakes in a Multiscale Fracture Network | Kadek Hendrawan Palgunadi et.al. | 2412.15416 | null |
2024-12-18 | DreaMark: Rooting Watermark in Score Distillation Sampling Generated Neural Radiance Fields | Xingyu Zhu et.al. | 2412.15278 | null |
2024-12-19 | EnvGS: Modeling View-Dependent Appearance with Environment Gaussian | Tao Xie et.al. | 2412.15215 | link |
2024-12-19 | LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis | Hanlin Wang et.al. | 2412.15214 | link |
2024-12-19 | Scaling 4D Representations | João Carreira et.al. | 2412.15212 | null |
2024-12-19 | Generative Multiview Relighting for 3D Reconstruction under Extreme Illumination Variation | Hadi Alzayer et.al. | 2412.15211 | null |
2024-12-19 | A Pathway to Decay and Fission of Orthosymplectic Quiver Theories | Craig Lawrie et.al. | 2412.15202 | null |
2024-12-19 | DI-PCG: Diffusion-based Efficient Inverse Procedural Content Generation for High-quality 3D Asset Creation | Wang Zhao et.al. | 2412.15200 | null |
2024-12-21 | SqueezeMe: Efficient Gaussian Avatars for VR | Shunsuke Saito et.al. | 2412.15171 | null |
2024-12-21 | Composite Dark Energy and the Cosmological Tensions | Adria Gómez-Valent et.al. | 2412.15124 | null |
2024-12-19 | Revisiting the Classics: On the Optical Colours of Novae as Standard Crayons | Peter Craig et.al. | 2412.15108 | null |
2024-12-19 | Learning Disentangled Equivariant Representation for Explicitly Controllable 3D Molecule Generation | Haoran Liu et.al. | 2412.15086 | null |
2024-12-19 | A linear regression model for quantile function data applied to paired pulmonary 3d CT scans | Marie-Félicia Béclin et.al. | 2412.15049 | null |
2024-12-19 | Noise Analysis and Modeling of the PMD Flexx2 Depth Camera for Robotic Applications | Yuke Cai et.al. | 2412.15040 | null |
2024-12-19 | MitraClip Device Automated Localization in 3D Transesophageal Echocardiography via Deep Learning | Riccardo Munafò et.al. | 2412.15013 | null |
2024-12-19 | Spin-down of solar-mass protostars in magnetospheric accretion paradigm | Shinsuke Takasao et.al. | 2412.14981 | null |
2024-12-19 | Arti-PG: A Toolbox for Procedurally Synthesizing Large-Scale and Diverse Articulated Objects with Rich Annotations | Jianhua Sun et.al. | 2412.14974 | null |
2024-12-19 | Reciprocity-aware adaptive tile low-rank factorization for large-scale 3D multidimensional deconvolution | Fuqiang Chen et.al. | 2412.14973 | null |
2024-12-19 | IDOL: Instant Photorealistic 3D Human Creation from a Single Image | Yiyu Zhuang et.al. | 2412.14963 | null |
2024-12-19 | ThinCurr: An open-source 3D thin-wall eddy current modeling code for the analysis of large-scale systems of conducting structures | Christopher Hansen et.al. | 2412.14962 | link |
2024-12-20 | GURecon: Learning Detailed 3D Geometric Uncertainties for Neural Surface Reconstruction | Zesong Yang et.al. | 2412.14939 | null |
2024-12-24 | Inversion of Dislocation-Impurity Interactions in $α$ -Fe under Magnetic State Changes | Franco Moitzi et.al. | 2412.14920 | null |
2024-12-19 | Linking disks, spinning vortices and exponential networks of augmentation curves | Kunal Gupta et.al. | 2412.14901 | null |
2024-12-19 | Diffusion priors for Bayesian 3D reconstruction from incomplete measurements | Julian L. Möbius et.al. | 2412.14897 | null |
2024-12-27 | Zero-Shot Artifact2Artifact: Self-incentive artifact removal for photoacoustic imaging without any data | Shuang Li et.al. | 2412.14873 | link |
2024-12-20 | Multiplexed Readout of Superconducting Qubits Using a 3D Re-entrant Cavity Filter | Mustafa Bakr et.al. | 2412.14853 | null |
2024-12-19 | ObjVariantEnsemble: Advancing Point Cloud LLM Evaluation in Challenging Scenes with Subtly Distinguished Objects | Qihang Cao et.al. | 2412.14837 | null |
2024-12-19 | Single-mode laser guiding in non-parabolic plasma channels for high-energy electron acceleration | Zsolt Lécz et.al. | 2412.14785 | null |
2024-12-19 | AI-Enabled Rapid Assembly of Thousands of Defect-Free Neutral Atom Arrays with Constant-time-overhead | Rui Lin et.al. | 2412.14647 | null |
2024-12-19 | GSRender: Deduplicated Occupancy Prediction via Weakly Supervised 3D Gaussian Splatting | Qianpu Sun et.al. | 2412.14579 | null |
2024-12-19 | SCKD: Semi-Supervised Cross-Modality Knowledge Distillation for 4D Radar Object Detection | Ruoyu Xu et.al. | 2412.14571 | null |
2024-12-19 | Improving Geometry in Sparse-View 3DGS via Reprojection-based DoF Separation | Yongsung Kim et.al. | 2412.14568 | null |
2024-12-19 | Bright-NeRF:Brightening Neural Radiance Field with Color Restoration from Low-light Raw Images | Min Wang et.al. | 2412.14547 | null |
2024-12-19 | Fractionalization of flux tubes in 3d and screening by emergent electric charges in 2d | Mendel Nguyen et.al. | 2412.14532 | null |
2024-12-19 | Drive-1-to-3: Enriching Diffusion Priors for Novel View Synthesis of Real Vehicles | Chuang Lin et.al. | 2412.14494 | null |
2024-12-19 | GraphEQA: Using 3D Semantic Scene Graphs for Real-time Embodied Question Answering | Saumya Saxena et.al. | 2412.14480 | null |
2024-12-19 | LiftRefine: Progressively Refined View Synthesis from 3D Lifting with Volume-Triplane Representations | Tung Do et.al. | 2412.14464 | null |
2024-12-19 | Color Enhancement for V-PCC Compressed Point Cloud via 2D Attribute Map Optimization | Jingwei Bao et.al. | 2412.14449 | null |
2024-12-19 | GenHMR: Generative Human Mesh Recovery | Muhammad Usama Saleem et.al. | 2412.14444 | null |
2024-12-19 | An Immersive Multi-Elevation Multi-Seasonal Dataset for 3D Reconstruction and Visualization | Xijun Liu et.al. | 2412.14418 | null |
2024-12-20 | SEREP: Semantic Facial Expression Representation for Robust In-the-Wild Capture and Retargeting | Arthur Josi et.al. | 2412.14371 | null |
2024-12-18 | Influence of Magnetic Anisotropy on the Ground State of [CH $_3$NH$_3$]Fe(HCOO)$_3$ : Insights into the Improper Modulated Magnetic Structure | Laura Cañadillas-Delgado et.al. | 2412.14365 | null |
2024-12-18 | The Casimir effect in wetting layers | Alessio Squarcini et.al. | 2412.14334 | null |
2024-12-18 | 3D Supergravity In the Batalin–Vilkovisky Formalism | Alberto S. Cattaneo et.al. | 2412.14300 | null |
2024-12-18 | GraphicsDreamer: Image to 3D Generation with Physical Consistency | Pei Chen et.al. | 2412.14214 | null |
2024-12-18 | MegaSynth: Scaling Up 3D Scene Reconstruction with Synthesized Data | Hanwen Jiang et.al. | 2412.14166 | null |
2024-12-18 | MCMat: Multiview-Consistent and Physically Accurate PBR Material Generation | Shenhao Zhu et.al. | 2412.14148 | null |
2024-12-18 | Foundation Models Meet Low-Cost Sensors: Test-Time Adaptation for Rescaling Disparity for Zero-Shot Metric Depth Estimation | Rémi Marsal et.al. | 2412.14103 | null |
2024-12-18 | Higher-derivative corrections in M-theory from precision numerical bootstrap | Shai M. Chester et.al. | 2412.14094 | null |
2024-12-18 | Sphere free energy of scalar field theories with cubic interactions | Simone Giombi et.al. | 2412.14086 | null |
2024-12-18 | Fractional Skyrmion Tubes in Chiral-Interfaced Three-Dimensional Magnetic Nanowires | John Fullerton et.al. | 2412.14069 | null |
2024-12-18 | CAD-Recode: Reverse Engineering CAD Code from Point Clouds | Danila Rukhovich et.al. | 2412.14042 | link |
2024-12-18 | Generalization of 3D-NSE Global Weak Solution with damping | Mustapha Amara et.al. | 2412.14040 | null |
2024-12-18 | Prompting Depth Anything for 4K Resolution Accurate Metric Depth Estimation | Haotong Lin et.al. | 2412.14015 | link |
2024-12-18 | Turbulent solutions of the binormal flow and the 1D cubic Schrödinger equation | Valeria Banica et.al. | 2412.14013 | null |
2024-12-18 | Components and anisotropy of 3D QFP waves during the early solar eruption | Jialiang Hu et.al. | 2412.13984 | null |
2024-12-18 | GraphAvatar: Compact Head Avatars with GNN-Generated 3D Gaussians | Xiaobao Wei et.al. | 2412.13983 | link |
2024-12-18 | Reactor-scale stellarators with force and torque minimized dipole coils | Alan A. Kaptanoglu et.al. | 2412.13937 | null |
2024-12-18 | Memorizing SAM: 3D Medical Segment Anything Model with Memorizing Transformer | Xinyuan Shao et.al. | 2412.13908 | link |
2024-12-18 | Data-Efficient Inference of Neural Fluid Fields via SciML Foundation Model | Yuqiu Liu et.al. | 2412.13897 | null |
2024-12-18 | UA-MPC: Uncertainty-Aware Model Predictive Control for Motorized LiDAR Odometry | Jianping Li et.al. | 2412.13873 | link |
2024-12-18 | MobiFuse: A High-Precision On-device Depth Perception System with Multi-Data Fusion | Jinrui Zhang et.al. | 2412.13848 | null |
2024-12-18 | Spatial Brain Tumor Concentration Estimation for Individualized Radiotherapy Planning | Jonas Weidner et.al. | 2412.13811 | link |
2024-12-18 | An Efficient Occupancy World Model via Decoupled Dynamic Flow and Image-assisted Training | Haiming Zhang et.al. | 2412.13772 | null |
2024-12-18 | Three-dimensional real space renormalization group with well-controlled approximations | Xinliang Lyu et.al. | 2412.13758 | link |
2024-12-18 | Immersive Human-in-the-Loop Control: Real-Time 3D Surface Meshing and Physics Simulation | Sait Akturk et.al. | 2412.13752 | null |
2024-12-18 | Multi-Exposure Image Fusion via Distilled 3D LUT Grid with Editable Mode | Xin Su et.al. | 2412.13749 | link |
2024-12-19 | 3D Registration in 30 Years: A Survey | Jiaqi Yang et.al. | 2412.13735 | link |
2024-12-18 | GAGS: Granularity-Aware Feature Distillation for Language Gaussian Splatting | Yuning Peng et.al. | 2412.13654 | null |
2024-12-18 | RelationField: Relate Anything in Radiance Fields | Sebastian Koch et.al. | 2412.13652 | link |
2024-12-18 | Learning to Control an Android Robot Head for Facial Animation | Marcel Heisler et.al. | 2412.13641 | null |
2024-12-18 | 4D Radar-Inertial Odometry based on Gaussian Modeling and Multi-Hypothesis Scan Matching | Fernando Amodeo et.al. | 2412.13639 | link |
2024-12-19 | Sign-IDD: Iconicity Disentangled Diffusion for Sign Language Production | Shengeng Tang et.al. | 2412.13609 | link |
2024-12-18 | Read Like a Radiologist: Efficient Vision-Language Model for 3D Medical Imaging Interpretation | Changsun Lee et.al. | 2412.13558 | null |
2024-12-18 | DragScene: Interactive 3D Scene Editing with Single-view Drag Instructions | Chenghao Gu et.al. | 2412.13552 | null |
2024-12-18 | Turbo-GS: Accelerating 3D Gaussian Fitting for High-Quality Radiance Fields | Tao Lu et.al. | 2412.13547 | null |
2024-12-18 | Vivar: A Generative AR System for Intuitive Multi-Modal Sensor Data Presentation | Yunqi Guo et.al. | 2412.13509 | null |
2024-12-18 | Level-Set Parameters: Novel Representation for 3D Shape Analysis | Huan Lei et.al. | 2412.13502 | null |
2024-12-18 | Look Inside for More: Internal Spatial Modality Perception for 3D Anomaly Detection | Hanzhe Liang et.al. | 2412.13461 | link |
2024-12-18 | Pre-training a Density-Aware Pose Transformer for Robust LiDAR-based 3D Human Pose Estimation | Xiaoqi An et.al. | 2412.13454 | link |
2024-12-18 | MMHMR: Generative Masked Modeling for Hand Mesh Recovery | Muhammad Usama Saleem et.al. | 2412.13393 | null |
2024-12-17 | Targeted View-Invariant Adversarial Perturbations for 3D Object Recognition | Christian Green et.al. | 2412.13376 | null |
2024-12-17 | Searching for a Signature of Turnaround in Galaxy Clusters with Convolutional Neural Networks | Nikolaos Triantafyllou et.al. | 2412.13304 | null |
2024-12-17 | The clus model in SPEX: projection and resonant scattering effects on the iron abundance and temperature profiles of galaxy clusters | Lýdia Štofanová et.al. | 2412.13252 | null |
2024-12-17 | iRBSM: A Deep Implicit 3D Breast Shape Model | Maximilian Weiherer et.al. | 2412.13244 | null |
2024-12-19 | Scattering theory for the defocusing 3d NLS in the exterior of a strictly convex obstacle | Xuan Liu et.al. | 2412.13215 | null |
2024-12-17 | GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding | Haoyi Jiang et.al. | 2412.13193 | link |
2024-12-17 | NFL-BA: Improving Endoscopic SLAM with Near-Field Light Bundle Adjustment | Andrea Dunn Beltran et.al. | 2412.13176 | null |
2024-12-17 | Flight Patterns for Swarms of Drones | Shuqin Zhu et.al. | 2412.13119 | null |
2024-12-17 | Motion-2-to-3: Leveraging 2D Motion Data to Boost 3D Motion Generation | Huaijin Pi et.al. | 2412.13111 | null |
2024-12-17 | 3D MedDiffusion: A 3D Medical Diffusion Model for Controllable and High-quality Medical Image Generation | Haoshen Wang et.al. | 2412.13059 | null |
2024-12-17 | CondiMen: Conditional Multi-Person Mesh Recovery | Brégier Romain et.al. | 2412.13058 | null |
2024-12-17 | EOGS: Gaussian Splatting for Earth Observation | Luca Savant Aira et.al. | 2412.13047 | null |
2024-12-24 | TAME: Temporal Audio-based Mamba for Enhanced Drone Trajectory Estimation and Classification | Zhenyuan Xiao et.al. | 2412.13037 | link |
2024-12-17 | A New Adversarial Perspective for LiDAR-based 3D Object Detection | Shijun Zheng et.al. | 2412.13017 | null |
2024-12-17 | Design, fabrication and initial test of a novel 3D-Trench sensor utilizing 8-inch CMOS compatible technology | Manwen Liu et.al. | 2412.13016 | null |
2024-12-17 | The IBEX Imaging Knowledge-Base: A Community Resource Enabling Adoption and Development of Immunofluoresence Imaging Methods | Ziv Yaniv et.al. | 2412.12965 | null |
2024-12-17 | A Conceptual Model of Intelligent Multimedia Data Rendered using Flying Light Specks | Nima Yazdani et.al. | 2412.12938 | null |
2024-12-17 | Generation of cosmic ray trajectories by a Diffusion Model trained on test particles in 3D magnetohydrodynamic turbulence | Johannes Martin et.al. | 2412.12923 | null |
2024-12-17 | 4DRGS: 4D Radiative Gaussian Splatting for Efficient 3D Vessel Reconstruction from Sparse-View Dynamic DSA Images | Zhentao Liu et.al. | 2412.12919 | link |
2024-12-17 | CATSplat: Context-Aware Transformer with Spatial Guidance for Generalizable 3D Gaussian Splatting from A Single-View Image | Wonseok Roh et.al. | 2412.12906 | null |
2024-12-17 | 3D Free-Form Optical Lens – Miniaturised Fibre Couplers for Astrophotonics | Haoran Mu et.al. | 2412.12896 | null |
2024-12-17 | A 3D-1D Virtual Element Method for Modeling Root Water Uptake | Stefano Berrone et.al. | 2412.12884 | null |
2024-12-18 | Dyn-HaMR: Recovering 4D Interacting Hand Motion from a Dynamic Camera | Zhengdi Yu et.al. | 2412.12861 | null |
2024-12-17 | HyperGS: Hyperspectral 3D Gaussian Splatting | Christopher Thirgood et.al. | 2412.12849 | null |
2024-12-17 | Confirmation of the planetary nebula nature of HaTr 5. Not the remnant of Nova Sco 1437 | M. A. Guerrero et.al. | 2412.12813 | null |
2024-12-17 | RCTrans: Radar-Camera Transformer via Radar Densifier and Sequential Decoder for 3D Object Detection | Yiheng Li et.al. | 2412.12799 | link |
2024-12-17 | Towards a Training Free Approach for 3D Scene Editing | Vivek Madhavaram et.al. | 2412.12766 | null |
2024-12-17 | Gaussian Billboards: Expressive 2D Gaussian Splatting with Textures | Sebastian Weiss et.al. | 2412.12734 | null |
2024-12-17 | Thin film flow over a spinning disc: Experiments and direct numerical simulations | Jason Stafford et.al. | 2412.12730 | null |
2024-12-17 | RaCFormer: Towards High-Quality 3D Object Detection via Query-based Radar-Camera Fusion | Xiaomeng Chu et.al. | 2412.12725 | null |
2024-12-24 | Unsupervised UAV 3D Trajectories Estimation with Sparse Point Clouds | Hanfang Liang et.al. | 2412.12716 | link |
2024-12-24 | Audio Array-Based 3D UAV Trajectory Estimation with LiDAR Pseudo-Labeling | Allen Lei et.al. | 2412.12698 | link |
2024-12-17 | SemStereo: Semantic-Constrained Stereo Matching Network for Remote Sensing | Chen Chen et.al. | 2412.12685 | null |
2024-12-17 | Improving the Transferability of 3D Point Cloud Attack via Spectral-aware Admix and Optimization Designs | Shiyu Hu et.al. | 2412.12626 | null |
2024-12-17 | PO3AD: Predicting Point Offsets toward Better 3D Point Cloud Anomaly Detection | Jianan Ye et.al. | 2412.12617 | null |
2024-12-20 | Understanding Emotional Body Expressions via Large Language Models | Haifeng Lu et.al. | 2412.12581 | null |
2024-12-18 | A statistical approach for interpreting polarized dust emission of the filamentary molecular clouds toward the estimate of 3D magnetic field structure | Haruka Fukihara et.al. | 2412.12545 | null |
2024-12-17 | Stiefel Flow Matching for Moment-Constrained Structure Elucidation | Austin Cheng et.al. | 2412.12540 | null |
2024-12-17 | Multiparty Entanglement Microscopy of Quantum Ising models in 1d, 2d and 3d | Liuke Lyu et.al. | 2412.12533 | null |
2024-12-17 | 3DGUT: Enabling Distorted Cameras and Secondary Rays in Gaussian Splatting | Qi Wu et.al. | 2412.12507 | link |
2024-12-17 | Echo: Simulating Distributed Training At Scale | Yicheng Feng et.al. | 2412.12487 | null |
2024-12-17 | PromptDet: A Lightweight 3D Object Detection Framework with LiDAR Prompts | Kun Guo et.al. | 2412.12460 | null |
2024-12-17 | Swarm Intelligence in Collision-free Formation Control for Multi-UAV Systems with 3D Obstacle Avoidance Maneuvers | Reza Ahmadvand et.al. | 2412.12437 | null |
2024-12-16 | Abstract 3D-rotation groups and recognition of icosahedral modules | Lauren McEnerney et.al. | 2412.12411 | null |
2024-12-16 | MASt3R-SLAM: Real-Time Dense SLAM with 3D Reconstruction Priors | Riku Murai et.al. | 2412.12392 | null |
2024-12-19 | A flexible framework for large-scale FDTD simulations: open-source inverse design for 3D nanostructures | Yannik Mahlau et.al. | 2412.12360 | link |
2024-12-16 | Expanded Comprehensive Robotic Cholecystectomy Dataset (CRCD) | Ki-Hwan Oh et.al. | 2412.12238 | link |
2024-12-16 | SitPose: Real-Time Detection of Sitting Posture and Sedentary Behavior Using Ensemble Learning With Depth Sensor | Hang Jin et.al. | 2412.12216 | link |
2024-12-15 | AI-Driven Innovations in Volumetric Video Streaming: A Review | Erfan Entezami et.al. | 2412.12208 | null |
2024-12-13 | Accessing thermonuclear detonation with the shock front induced by the alpha particle deposition | Bohan Shen et.al. | 2412.12181 | null |
2024-12-16 | PanSplat: 4K Panorama Synthesis with Feed-Forward Gaussian Splatting | Cheng Zhang et.al. | 2412.12096 | link |
2024-12-16 | CAP4D: Creating Animatable 4D Portrait Avatars with Morphable Multi-View Diffusion Models | Felix Taubner et.al. | 2412.12093 | null |
2024-12-16 | Wonderland: Navigating 3D Scenes from a Single Image | Hanwen Liang et.al. | 2412.12091 | null |
2024-12-16 | IDArb: Intrinsic Decomposition for Arbitrary Number of Input Views and Illuminations | Zhibing Li et.al. | 2412.12083 | null |
2024-12-16 | QG from SymQRG: AdS $_3$/CFT$_2$ Correspondence as Topological Symmetry-Preserving Quantum RG Flow | Ning Bao et.al. | 2412.12045 | null |
2024-12-16 | Deep-learning-based identification of individual motion characteristics from upper-limb trajectories towards disorder stage evaluation | Tim Sziburis et.al. | 2412.12016 | null |
2024-12-16 | Controllable Shadow Generation with Single-Step Diffusion Models from Synthetic Data | Onur Tasar et.al. | 2412.11972 | null |
2024-12-17 | From 2D CAD Drawings to 3D Parametric Models: A Vision-Language Approach | Xilin Wang et.al. | 2412.11892 | null |
2024-12-16 | Ensemble Learning and 3D Pix2Pix for Comprehensive Brain Tumor Analysis in Multimodal MRI | Ramy A. Zeineldin et.al. | 2412.11849 | null |
2024-12-17 | On orthogonality sampling method for Maxwell’s equations and its applications to experimental data | Thu Le et.al. | 2412.11825 | null |
2024-12-16 | Efficient LiDAR Bundle Adjustment for Multi-Scan Alignment Utilizing Continuous-Time Trajectories | Louis Wiesmann et.al. | 2412.11760 | null |
2024-12-16 | Deformable Radial Kernel Splatting | Yi-Hua Huang et.al. | 2412.11752 | null |
2024-12-16 | EGP3D: Edge-guided Geometric Preserving 3D Point Cloud Super-resolution for RGB-D camera | Zheng Fang et.al. | 2412.11680 | null |
2024-12-16 | 3D $^2$ -Actor: Learning Pose-Conditioned 3D-Aware Denoiser for Realistic Gaussian Avatar Modeling | Zichen Tang et.al. | 2412.11599 | link |
2024-12-16 | MeshArt: Generating Articulated Meshes with Structure-guided Transformers | Daoyi Gao et.al. | 2412.11596 | null |
2024-12-19 | StrandHead: Text to Strand-Disentangled 3D Head Avatars Using Hair Geometric Priors | Xiaokun Sun et.al. | 2412.11586 | link |
2024-12-16 | SweepEvGS: Event-Based 3D Gaussian Splatting for Macro and Micro Radiance Field Rendering from a Single Sweep | Jingqian Wu et.al. | 2412.11579 | null |
2024-12-16 | SP $^2$ T: Sparse Proxy Attention for Dual-stream Point Transformer | Jiaxu Wan et.al. | 2412.11540 | link |
2024-12-19 | RoMeO: Robust Metric Visual Odometry | Junda Cheng et.al. | 2412.11530 | null |
2024-12-21 | Sequence Matters: Harnessing Video Models in 3D Super-Resolution | Hyun-kyu Ko et.al. | 2412.11525 | null |
2024-12-16 | EditSplat: Multi-View Fusion and Attention-Guided Optimization for View-Consistent 3D Scene Editing with 3D Gaussian Splatting | Dong In Lee et.al. | 2412.11520 | null |
2024-12-16 | LineArt: A Knowledge-guided Training-free High-quality Appearance Transfer for Design Drawing with Diffusion Model | Xi Wang et.al. | 2412.11519 | null |
2024-12-16 | HGSFusion: Radar-Camera Fusion with Hybrid Generation and Synchronization for 3D Object Detection | Zijian Gu et.al. | 2412.11489 | link |
2024-12-16 | HResFormer: Hybrid Residual Transformer for Volumetric Medical Image Segmentation | Sucheng Ren et.al. | 2412.11458 | null |
2024-12-16 | MOVIS: Enhancing Multi-Object Novel View Synthesis for Indoor Scenes | Ruijie Lu et.al. | 2412.11457 | null |
2024-12-16 | View Transformation Robustness for Multi-View 3D Object Reconstruction with Reconstruction Error-Guided View Selection | Qi Zhang et.al. | 2412.11428 | link |
2024-12-16 | Category Level 6D Object Pose Estimation from a Single RGB Image using Diffusion | Adam Bethell et.al. | 2412.11420 | null |
2024-12-16 | V-MIND: Building Versatile Monocular Indoor 3D Detector with Diverse 2D Annotations | Jin-Cheng Jhang et.al. | 2412.11412 | null |
2024-12-16 | A Rb-Cs dual-species magneto-optical trap | Shi-Yao Shao et.al. | 2412.11411 | null |
2024-12-16 | An Enhanced Classification Method Based on Adaptive Multi-Scale Fusion for Long-tailed Multispectral Point Clouds | TianZhu Liu et.al. | 2412.11407 | null |
2024-12-16 | Wilson Loop and Topological Properties in 3D Woodpile Photonic Crystal | Huyen Thanh Phan et.al. | 2412.11353 | null |
2024-12-15 | Sonicmesh: Enhancing 3D Human Mesh Reconstruction in Vision-Impaired Environments With Acoustic Signals | Xiaoxuan Liang et.al. | 2412.11325 | null |
2024-12-15 | VividFace: A Diffusion-Based Hybrid Framework for High-Fidelity Video Face Swapping | Hao Shao et.al. | 2412.11279 | null |
2024-12-15 | GaussianProperty: Integrating Physical Properties to 3D Gaussians with LMMs | Xinli Xu et.al. | 2412.11258 | null |
2024-12-15 | Volumetric Mapping with Panoptic Refinement via Kernel Density Estimation for Mobile Robots | Khang Nguyen et.al. | 2412.11241 | link |
2024-12-15 | GenLit: Reformulating Single-Image Relighting as Video Generation | Shrisha Bharadwaj et.al. | 2412.11224 | null |
2024-12-15 | ViPOcc: Leveraging Visual Priors from Vision Foundation Models for Single-View 3D Occupancy Prediction | Yi Feng et.al. | 2412.11210 | link |
2024-12-15 | OccScene: Semantic Occupancy-based Cross-task Mutual Learning for 3D Scene Generation | Bohan Li et.al. | 2412.11183 | null |
2024-12-15 | Benchmarking and Learning Multi-Dimensional Quality Evaluator for Text-to-3D Generation | Yujie Zhang et.al. | 2412.11170 | null |
2024-12-15 | UAV-Enabled Passive 6D Movable Antennas: Joint Deployment and Beamforming Optimization | Changhao Liu et.al. | 2412.11150 | null |
2024-12-15 | An Onsager-type Theorem for General 2D Active Scalar Equations | Xuanxuan Zhao et.al. | 2412.11094 | null |
2024-12-15 | EquiFlow: Equivariant Conditional Flow Matching with Optimal Transport for 3D Molecular Conformation Prediction | Qingwen Tian et.al. | 2412.11082 | null |
2024-12-18 | CFSynthesis: Controllable and Free-view 3D Human Video Synthesis | Liyuan Cui et.al. | 2412.11067 | null |
2024-12-15 | Electromagnetic Interactions of Massive Higher-Spin Fields in 3D via Chiral Theory | Alexey Sharapov et.al. | 2412.11052 | null |
2024-12-15 | Facial Surgery Preview Based on the Orthognathic Treatment Prediction | Huijun Han et.al. | 2412.11045 | null |
2024-12-15 | AURORA: Automated Unleash of 3D Room Outlines for VR Applications | Huijun Han et.al. | 2412.11033 | null |
2024-12-15 | Stability of the Couette flow for 3D Navier-Stokes equations with rotation | Wenting Huang et.al. | 2412.11005 | null |
2024-12-14 | DCSEG: Decoupled 3D Open-Set Segmentation using Gaussian Splatting | Luis Wiedmann et.al. | 2412.10972 | link |
2024-12-18 | A Staged Deep Learning Approach to Spatial Refinement in 3D Temporal Atmospheric Transport | M. Giselle Fernández-Godino et.al. | 2412.10945 | null |
2024-12-14 | Thermofluidic non-equilibrium assembly of reconfigurable functional structures | Desmond J. Quinn et.al. | 2412.10928 | null |
2024-12-14 | Matrix-free implementation of the non-nested multigrid method | Marco Feder et.al. | 2412.10910 | link |
2024-12-17 | Do large language vision models understand 3D shapes? | Sagi Eppel et.al. | 2412.10908 | link |
2024-12-14 | Symmetries of a 3D Field-Theoretic Model | R. Kumar et.al. | 2412.10852 | null |
2024-12-14 | Error Estimates for Discontinuous Galerkin Approximations to the Vlasov-Unsteady Stokes System | Harsha Hutridurga et.al. | 2412.10828 | null |
2024-12-14 | DSRC: Learning Density-insensitive and Semantic-aware Collaborative Representation against Corruptions | Jingyu Zhang et.al. | 2412.10739 | link |
2024-12-14 | 6D Movable Antenna Enhanced Multi-Access Point Coordination via Position and Orientation Optimization | Xiangyu Pi et.al. | 2412.10736 | null |
2024-12-14 | OmniHD-Scenes: A Next-Generation Multimodal Dataset for Autonomous Driving | Lianqing Zheng et.al. | 2412.10734 | null |
2024-12-14 | On concentrated vortices of 3D incompressible Euler equations under helical symmetry: with swirl | Guolin Qin et.al. | 2412.10725 | null |
2024-12-17 | GridShow: Omni Visual Generation | Cong Wan et.al. | 2412.10718 | link |
2024-12-17 | Virtual Trial Room with Computer Vision and Machine Learning | Tulashi Prasad Joshi et.al. | 2412.10710 | null |
2024-12-14 | Magnetic flux emergence and solar eruptions in partially ionized plasmas | Georgios Chouliaras et.al. | 2412.10633 | null |
2024-12-13 | Classification of ancient noncollapsed flows in $\mathbb{R}^4$ | Kyeongsu Choi et.al. | 2412.10581 | null |
2024-12-13 | EVLM: Self-Reflective Multimodal Reasoning for Cross-Dimensional Visual Editing | Umar Khalid et.al. | 2412.10566 | null |
2024-12-13 | Predictive Pattern Recognition Techniques Towards Spatiotemporal Representation of Plant Growth in Simulated and Controlled Environments: A Comprehensive Review | Mohamed Debbagh et.al. | 2412.10538 | null |
2024-12-13 | The Language of Motion: Unifying Verbal and Non-verbal Language of 3D Human Motion | Changan Chen et.al. | 2412.10523 | null |
2024-12-13 | Fast 3D Partial Boundary Data EIT Reconstructions using Direct Inversion CGO-based Methods | Sarah J. Hamilton et.al. | 2412.10520 | null |
2024-12-12 | Automatic Detection, Positioning and Counting of Grape Bunches Using Robots | Xumin Gao et.al. | 2412.10464 | link |
2024-12-11 | Boundary Exploration of Next Best View Policy in 3D Robotic Scanning | Leihui Li et.al. | 2412.10444 | link |
2024-12-11 | Novel 3D Binary Indexed Tree for Volume Computation of 3D Reconstructed Models from Volumetric Data | Quoc-Bao Nguyen-Le et.al. | 2412.10441 | null |
2024-12-11 | Implicit Neural Compression of Point Clouds | Hongning Ruan et.al. | 2412.10433 | null |
2024-12-11 | CUPS: Improving Human Pose-Shape Estimators with Conformalized Deep Uncertainty | Harry Zhang et.al. | 2412.10431 | null |
2024-12-11 | Unsupervised Cross-Domain Regression for Fine-grained 3D Game Character Reconstruction | Qi Wen et.al. | 2412.10430 | null |
2024-12-08 | Exact solution of the three-dimensional (3D) Z2 lattice gauge theory | Zhidong Zhang et.al. | 2412.10412 | null |
2024-12-13 | GaussianWorld: Gaussian World Model for Streaming 3D Occupancy Prediction | Sicheng Zuo et.al. | 2412.10373 | link |
2024-12-13 | GaussianAD: Gaussian-Centric End-to-End Autonomous Driving | Wenzhao Zheng et.al. | 2412.10371 | link |
2024-12-13 | TrafficLoc: Localizing Traffic Surveillance Cameras in 3D Scenes | Yan Xia et.al. | 2412.10308 | null |
2024-12-13 | Coherent 3D Scene Diffusion From a Single RGB Image | Manuel Dahnert et.al. | 2412.10294 | null |
2024-12-13 | Probabilistic Inverse Cameras: Image to 3D via Multiview Geometry | Rishabh Kabra et.al. | 2412.10273 | null |
2024-12-13 | SuperGSeg: Open-Vocabulary 3D Segmentation with Structured Super-Gaussians | Siyun Liang et.al. | 2412.10231 | null |
2024-12-13 | GAF: Gaussian Avatar Reconstruction from Monocular Videos via Multi-view Diffusion | Jiapeng Tang et.al. | 2412.10209 | null |
2024-12-16 | Beam test results of a fully 3D-printed plastic scintillator particle detector prototype | Boato Li et.al. | 2412.10174 | null |
2024-12-13 | 3D projection analysis: characterizing the morphological stability of nearby open clusters | Qingshun Hu et.al. | 2412.10158 | null |
2024-12-13 | Interface instability of two-phase flow in a three-dimensional porous medium | Joachim Falck Brodin et.al. | 2412.10127 | null |
2024-12-13 | An $O(N)$ Algorithm for Solving the Smallest Enclosing Sphere Problem in the Presence of Degeneracies | Netzer Moriya et.al. | 2412.10120 | null |
2024-12-13 | ProbeSDF: Light Field Probes for Neural Surface Reconstruction | Briac Toussaint et.al. | 2412.10084 | null |
2024-12-13 | Toy-GS: Assembling Local Gaussians for Precisely Rendering Large-Scale Free Camera Trajectories | Xiaohan Zhang et.al. | 2412.10078 | null |
2024-12-13 | Three-dimensional tearing instability of flux-tube-like magnetic fields | Vinay Kumar et.al. | 2412.10065 | null |
2024-12-13 | TSGaussian: Semantic and Depth-Guided Target-Specific Gaussian Splatting from Sparse Views | Liang Zhao et.al. | 2412.10051 | link |
2024-12-13 | NeRF-Texture: Synthesizing Neural Radiance Field Textures | Yi-Hua Huang et.al. | 2412.10004 | null |
2024-12-13 | GT23D-Bench: A Comprehensive General Text-to-3D Generation Benchmark | Sitong Su et.al. | 2412.09997 | null |
2024-12-13 | On the Geometry of the Near-Core Magnetic Field in Massive Stars | Rathish P. Ratnasingam et.al. | 2412.09986 | null |
2024-12-18 | SplineGS: Robust Motion-Adaptive Spline for Real-Time Dynamic 3D Gaussians from Monocular Video | Jongmin Park et.al. | 2412.09982 | null |
2024-12-13 | Generating 3D Pseudo-Healthy Knee MR Images to Support Trochleoplasty Planning | Michael Wehrli et.al. | 2412.09962 | link |
2024-12-13 | Low Mach Number Limit of Non-isentropic Inviscid Elastodynamics with General Initial Data | Jiawei Wang et.al. | 2412.09941 | null |
2024-12-13 | RP-SLAM: Real-time Photorealistic SLAM with Efficient 3D Gaussian Splatting | Lizhi Bai et.al. | 2412.09868 | null |
2024-12-13 | A 3D lattice defect and efficient computations in topological MBQC | Gabrielle Tournaire et.al. | 2412.09781 | null |
2024-12-13 | waveOrder: generalist framework for label-agnostic computational microscopy | Talon Chandler et.al. | 2412.09775 | link |
2024-12-16 | Synthetic multi-dimensional Aharonov-Bohm cages in Fock state lattices | Jiajian Zhang et.al. | 2412.09766 | null |
2024-12-12 | Low-cost mobile 3D scanning of heritage objects to facilitate long-distance research collaboration | Dirk HR Spennemann et.al. | 2412.09749 | null |
2024-12-12 | MAC-Ego3D: Multi-Agent Gaussian Consensus for Real-Time Collaborative Ego-Motion and Photorealistic 3D Reconstruction | Xiaohao Xu et.al. | 2412.09723 | link |
2024-12-12 | PBR-NeRF: Inverse Rendering with Physics-Based Neural Fields | Sean Wu et.al. | 2412.09680 | link |
2024-12-11 | Pole-based Vehicle Localization with Vector Maps: A Camera-LiDAR Comparative Study | Maxime Noizet et.al. | 2412.09649 | null |
2024-12-11 | DSplats: 3D Generation by Denoising Splats-Based Multiview Diffusion Models | Kevin Miao et.al. | 2412.09648 | null |
2024-12-11 | Bench2Drive-R: Turning Real World Data into Reactive Closed-Loop Autonomous Driving Benchmark by Generative Model | Junqi You et.al. | 2412.09647 | null |
2024-12-12 | Illusion3D: 3D Multiview Illusion with 2D Diffusion Priors | Yue Feng et.al. | 2412.09625 | null |
2024-12-18 | GenEx: Generating an Explorable World | Taiming Lu et.al. | 2412.09624 | null |
2024-12-12 | Stereo4D: Learning How Things Move in 3D from Internet Stereo Videos | Linyi Jin et.al. | 2412.09621 | null |
2024-12-12 | Learning Camera Movement Control from Real-World Drone Videos | Yunzhong Hou et.al. | 2412.09620 | null |
2024-12-12 | NormalFlow: Fast, Robust, and Accurate Contact-based Object 6DoF Pose Tracking with Vision-based Tactile Sensors | Hung-Jui Huang et.al. | 2412.09617 | link |
2024-12-13 | Olympus: A Universal Task Router for Computer Vision Tasks | Yuanze Lin et.al. | 2412.09612 | link |
2024-12-12 | Feat2GS: Probing Visual Foundation Models with Gaussian Splatting | Yue Chen et.al. | 2412.09606 | null |
2024-12-18 | RatBodyFormer: Rodent Body Surface from Keypoints | Ayaka Higami et.al. | 2412.09599 | null |
2024-12-12 | LiftImage3D: Lifting Any Single Image to 3D Gaussians with Video Generation Priors | Yabo Chen et.al. | 2412.09597 | null |
2024-12-12 | FreeSplatter: Pose-free Gaussian Splatting for Sparse-view 3D Reconstruction | Jiale Xu et.al. | 2412.09573 | null |
2024-12-12 | Meshtron: High-Fidelity, Artist-Like 3D Mesh Generation at Scale | Zekun Hao et.al. | 2412.09548 | null |
2024-12-12 | Nonlinear evolution of fluting oscillations in coronal flux tubes | Roberto Soler et.al. | 2412.09547 | null |
2024-12-19 | SimAvatar: Simulation-Ready Avatars with Layered Hair and Clothing | Xueting Li et.al. | 2412.09545 | null |
2024-12-12 | Kinetic simulations of the Kruskal-Schwarzchild instability in accelerating striped outflows I: Dynamics and energy dissipation | William Groger et.al. | 2412.09541 | null |
2024-12-12 | GEAL: Generalizable 3D Affordance Learning with Cross-Modal Consistency | Dongyue Lu et.al. | 2412.09511 | link |
2024-12-12 | Assessing the Role of Volumetric Brain Information in Multiple Sclerosis Progression | Andy A. Shen et.al. | 2412.09497 | null |
2024-12-12 | Finite-PINN: A Physics-Informed Neural Network Architecture for Solving Solid Mechanics Problems with General Geometries | Haolin Li et.al. | 2412.09453 | link |
2024-12-12 | A Plug-and-Play Algorithm for 3D Video Super-Resolution of Single-Photon LiDAR data | Alice Ruget et.al. | 2412.09427 | null |
2024-12-12 | Mixture of neural fields for heterogeneous reconstruction in cryo-EM | Axel Levy et.al. | 2412.09420 | null |
2024-12-12 | Space-time inverse-scattering of translation-based motion | Jeongsoo Kim et.al. | 2412.09403 | null |
2024-12-19 | SLAM3R: Real-Time Dense Scene Reconstruction from Monocular RGB Videos | Yuzheng Liu et.al. | 2412.09401 | link |
2024-12-12 | Synchrotron X-Ray Multi-Projection Imaging for Multiphase Flow | Tomas Rosén et.al. | 2412.09368 | null |
2024-12-12 | T-SVG: Text-Driven Stereoscopic Video Generation | Qiao Jin et.al. | 2412.09323 | null |
2024-12-12 | Enhancing Implicit Neural Representations via Symmetric Power Transformation | Weixiang Zhang et.al. | 2412.09213 | link |
2024-12-12 | Imperceptible Gaze Guidance Through Ocularity in Virtual Reality | Virmarie Maquiling et.al. | 2412.09204 | null |
2024-12-12 | LIVE-GS: LLM Powers Interactive VR by Enhancing Gaussian Splatting | Haotian Mao et.al. | 2412.09176 | null |
2024-12-12 | Geometrically exact static 3D Cosserat rods problem solved using a shooting method | Surmont Florian et.al. | 2412.09146 | null |
2024-12-12 | LVMark: Robust Watermark for latent video diffusion models | MinHyuk Jang et.al. | 2412.09122 | null |
2024-12-12 | Vision CNNs trained to estimate spatial latents learned similar ventral-stream-aligned representations | Yudi Xie et.al. | 2412.09115 | link |
2024-12-12 | NNLL Transverse Momentum Dependent evolution in the Parton Branching method | Aleksandra Lelek et.al. | 2412.09108 | null |
2024-12-12 | Detrimental 2 $p$-3$d$ Hybridisation in Ni Nanosheets Supported on Strontium Dioxide for Catalytic H$_2$ Production, Necessitating Thickness Optimisation | Kabir S. Suraj et.al. | 2412.09071 | null |
2024-12-12 | Hyperbolic-constraint Point Cloud Reconstruction from Single RGB-D Images | Wenrui Li et.al. | 2412.09055 | null |
2024-12-12 | Supersonic Shear and Wall-Bounded Flows With Body-Fitted Meshes Using the Semi-Lagrangian Lattice Boltzmann Method: Boundary Schemes and Applications | Philipp Spelten et.al. | 2412.09051 | null |
2024-12-12 | Motif Guided Graph Transformer with Combinatorial Skeleton Prototype Learning for Skeleton-Based Person Re-Identification | Haocong Rao et.al. | 2412.09044 | link |
2024-12-12 | Preclinical Water-Mediated Ultrasound Platform using Clinical FOV for Molecular Targeted Contrast-Enhanced Ultrasound | Stavros Melemenidis et.al. | 2412.09042 | null |
2024-12-12 | Physics-Informed Neural Networks for Solving Contact Problems in Three Dimensions | Tarik Sahin et.al. | 2412.09022 | null |
2024-12-12 | MS2Mesh-XR: Multi-modal Sketch-to-Mesh Generation in XR Environments | Yuqi Tong et.al. | 2412.09008 | null |
2024-12-12 | Enhancing Facial Consistency in Conditional Video Generation via Facial Landmark Transformation | Lianrui Mu et.al. | 2412.08976 | null |
2024-12-12 | Three-Dimensional Construction of Hyperuniform, Nonhyperuniform and Antihyperuniform Random Media via Spectral Density Functions and Their Transport Properties | Wenlong Shi et.al. | 2412.08974 | null |
2024-12-12 | Is Contrastive Distillation Enough for Learning Comprehensive 3D Representations? | Yifan Zhang et.al. | 2412.08973 | null |
2024-12-12 | Multimodal Industrial Anomaly Detection by Crossmodal Reverse Distillation | Xinyue Liu et.al. | 2412.08949 | link |
2024-12-12 | WiFo: Wireless Foundation Model for Channel Prediction | Boxun Liu et.al. | 2412.08908 | null |
2024-12-11 | DALI: Domain Adaptive LiDAR Object Detection via Distribution-level and Instance-level Pseudo Label Denoising | Xiaohu Lu et.al. | 2412.08806 | link |
2024-12-11 | Reward-based Blockchain Infrastructure for 3D IC Supply Chain Provenance | Sulyab Thottungal Valapu et.al. | 2412.08777 | null |
2024-12-11 | ProtoOcc: Accurate, Efficient 3D Occupancy Prediction Using Dual Branch Encoder-Prototype Query Decoder | Jungho Kim et.al. | 2412.08774 | link |
2024-12-11 | Vision-based indoor localization of nano drones in controlled environment with its applications | Simranjeet Singh et.al. | 2412.08757 | link |
2024-12-11 | Integrated modeling of RF-Induced Tungsten Erosion at ICRH Antenna Structures in the WEST Tokamak | A. Kumar et.al. | 2412.08748 | null |
2024-12-11 | DeepNose: An Equivariant Convolutional Neural Network Predictive Of Human Olfactory Percepts | Sergey Shuvaev et.al. | 2412.08747 | null |
2024-12-14 | The 3D morphology of open clusters in the solar neighborhood III: Fractal dimension | Chang Qin et.al. | 2412.08710 | null |
2024-12-11 | The Spin-Orbit Alignment of 8 Warm Gas Giant Systems | Juan I. Espinoza-Retamal et.al. | 2412.08692 | null |
2024-12-11 | Coherent3D: Coherent 3D Portrait Video Reconstruction via Triplane Fusion | Shengze Wang et.al. | 2412.08684 | null |
2024-12-11 | StreamChat: Chatting with Streaming Video | Jihao Liu et.al. | 2412.08646 | null |
2024-12-11 | 3D Mesh Editing using Masked LRMs | Will Gao et.al. | 2412.08641 | null |
2024-12-11 | BLADE: Single-view Body Mesh Learning through Accurate Depth Estimation | Shengze Wang et.al. | 2412.08640 | null |
2024-12-11 | Dust and gas modelling in radiative transfer simulations of disc-dominated galaxies with RADMC-3D | Francesco Sinigaglia et.al. | 2412.08609 | null |
2024-12-11 | ASDnB: Merging Face with Body Cues For Robust Active Speaker Detection | Tiago Roxo et.al. | 2412.08594 | link |
2024-12-11 | RoomTour3D: Geometry-Aware Video-Instruction Tuning for Embodied Navigation | Mingfei Han et.al. | 2412.08591 | null |
2024-12-11 | Learning to Decouple the Lights for 3D Face Texture Modeling | Tianxin Huang et.al. | 2412.08524 | null |
2024-12-11 | Combining Neural Fields and Deformation Models for Non-Rigid 3D Motion Reconstruction from Partial Data | Aymen Merrouche et.al. | 2412.08511 | null |
2024-12-11 | PointTalk: Audio-Driven Dynamic Lip Point Cloud for 3D Gaussian-based Talking Head Synthesis | Yifan Xie et.al. | 2412.08504 | null |
2024-12-11 | High-speed scattering polarimetry for correlative nerve fiber imaging and multi-modal analysis | Franca auf der Heiden et.al. | 2412.08499 | null |
2024-12-12 | Drift-free Visual SLAM using Digital Twins | Roxane Merat et.al. | 2412.08496 | null |
2024-12-11 | ConvMesh: Reimagining Mesh Quality Through Convex Optimization | Alexander Valverde et.al. | 2412.08484 | null |
2024-12-14 | PointCFormer: a Relation-based Progressive Feature Extraction Network for Point Cloud Completion | Yi Zhong et.al. | 2412.08421 | link |
2024-12-12 | Pragmatist: Multiview Conditional Diffusion Models for High-Fidelity 3D Reconstruction from Unposed Sparse Views | Songchun Zhang et.al. | 2412.08412 | null |
2024-12-13 | Physical Informed Driving World Model | Zhuoran Yang et.al. | 2412.08410 | null |
2024-12-11 | LOMA: Language-assisted Semantic Occupancy Network via Triplane Mamba | Yubo Cui et.al. | 2412.08388 | null |
2024-12-11 | Simplex tensor network renormalization group for boundary theory of 3+1D symTFT | Kaixin Ji et.al. | 2412.08374 | null |
2024-12-11 | SLGaussian: Fast Language Gaussian Splatting in Sparse Views | Kangjie Chen et.al. | 2412.08331 | null |
2024-12-11 | Digging into Intrinsic Contextual Information for High-fidelity 3D Point Cloud Completion | Jisheng Chu et.al. | 2412.08326 | link |
2024-12-11 | Lightweight Method for Interactive 3D Medical Image Segmentation with Multi-Round Result Fusion | Bingzhi Shen et.al. | 2412.08315 | null |
2024-12-11 | Nonclassical heat flow in passive chiral solids is third rank, not odd | Roderic Lakes et.al. | 2412.08309 | null |
2024-12-11 | Position-aware Guided Point Cloud Completion with CLIP Model | Feng Zhou et.al. | 2412.08271 | null |
2024-12-11 | Neural Observation Field Guided Hybrid Optimization of Camera Placement | Yihan Cao et.al. | 2412.08266 | link |
2024-12-11 | Hierarchical Context Alignment with Disentangled Geometric and Temporal Modeling for Semantic Occupancy Prediction | Bohan Li et.al. | 2412.08243 | null |
2024-12-11 | Unified HT-CNNs Architecture: Transfer Learning for Segmenting Diverse Brain Tumors in MRI from Gliomas to Pediatric Tumors | Ramy A. Zeineldin et.al. | 2412.08240 | null |
2024-12-16 | Generate Any Scene: Evaluating and Improving Text-to-Vision Generation with Scene Graph Programming | Ziqi Gao et.al. | 2412.08221 | link |
2024-12-18 | GN-FR:Generalizable Neural Radiance Fields for Flare Removal | Gopi Raju Matta et.al. | 2412.08200 | null |
2024-12-11 | Semantic Scene Completion Based 3D Traversability Estimation for Off-Road Terrains | Zitong Chen et.al. | 2412.08195 | null |
2024-12-11 | Textured Mesh Saliency: Bridging Geometry and Texture for Human Perception in 3D Graphics | Kaiwei Zhang et.al. | 2412.08188 | link |
2024-12-11 | ProGDF: Progressive Gaussian Differential Field for Controllable and Flexible 3D Editing | Yian Zhao et.al. | 2412.08152 | null |
2024-12-11 | Dense Depth from Event Focal Stack | Kenta Horikawa et.al. | 2412.08120 | null |
2024-12-11 | Generative Zoo | Tomasz Niewiadomski et.al. | 2412.08101 | null |
2024-12-11 | THUD++: Large-Scale Dynamic Indoor Scene Dataset and Benchmark for Mobile Robots | Zeshun Li et.al. | 2412.08096 | null |
2024-12-11 | Photonic torons, topological phase transition and tunable spin monopoles | Haijun Wu et.al. | 2412.08083 | null |
2024-12-11 | How to select slices for annotation to train best-performing deep learning segmentation models for cross-sectional medical images? | Yixin Zhang et.al. | 2412.08081 | null |
2024-12-11 | Error estimate for the first order energy stable scheme of Q-tensor nematic model | Jin Huang et.al. | 2412.08027 | null |
2024-12-11 | Topological columnar nano-SQUID based on a 3D topological insulator | Ella Nikodem et.al. | 2412.07993 | null |
2024-12-10 | Diffusion-Based Attention Warping for Consistent 3D Scene Editing | Eyal Gomel et.al. | 2412.07984 | null |
2024-12-10 | 3D Convective Urca Process in a Simmering White Dwarf | Brendan Boyd et.al. | 2412.07938 | link |
2024-12-10 | Graph convolutional networks enable fast hemorrhagic stroke monitoring with electrical impedance tomography | J. Toivanen et.al. | 2412.07888 | null |
2024-12-10 | Channels of Stellar-mass Black Hole Formation | Adam Burrows et.al. | 2412.07831 | null |
2024-12-10 | 3DSRBench: A Comprehensive 3D Spatial Reasoning Benchmark | Wufei Ma et.al. | 2412.07825 | null |
2024-12-05 | Mogo: RQ Hierarchical Causal Transformer for High-Quality 3D Human Motion Generation | Dongjie Fu et.al. | 2412.07797 | null |
2024-12-10 | From an Image to a Scene: Learning to Imagine the World from a Million 360 Videos | Matthew Wallingford et.al. | 2412.07770 | link |
2024-12-11 | Test-time Correction with Human Feedback: An Online 3D Detection System via Visual Prompting | Zetong Yang et.al. | 2412.07768 | null |
2024-12-12 | Learning Visual Generative Priors without Text | Shuailei Ma et.al. | 2412.07767 | null |
2024-12-10 | Make-A-Texture: Fast Shape-Aware Texture Generation in 3 Seconds | Xiaoyu Xiang et.al. | 2412.07766 | null |
2024-12-10 | SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints | Jianhong Bai et.al. | 2412.07760 | link |
2024-12-10 | 3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation | Xiao Fu et.al. | 2412.07759 | null |
2024-12-10 | SAT: Spatial Aptitude Training for Multimodal Language Models | Arijit Ray et.al. | 2412.07755 | null |
2024-12-10 | LoRA3D: Low-Rank Self-Calibration of 3D Geometric Foundation Models | Ziqi Lu et.al. | 2412.07746 | null |
2024-12-10 | ObjCtrl-2.5D: Training-free Object Control with Camera Poses | Zhouxia Wang et.al. | 2412.07721 | null |
2024-12-10 | SimVS: Simulating World Inconsistencies for Robust View Synthesis | Alex Trevithick et.al. | 2412.07696 | null |
2024-12-10 | Proc-GS: Procedural Building Generation for City Assembly with 3D Gaussians | Yixuan Li et.al. | 2412.07660 | null |
2024-12-18 | PVP: Polar Representation Boost for 3D Semantic Occupancy Prediction | Yujing Xue et.al. | 2412.07616 | null |
2024-12-10 | Faster and Better 3D Splatting via Group Training | Chengbo Wang et.al. | 2412.07608 | null |
2024-12-10 | Optimization-Driven Design of Monolithic Soft-Rigid Grippers | Pierluigi Mansueto et.al. | 2412.07556 | null |
2024-12-10 | Planetary Dynamos in Evolving Cold Gas Giants | Albert Elias-López et.al. | 2412.07551 | null |
2024-12-10 | ReCap: Better Gaussian Relighting with Cross-Environment Captures | Jingzhi Li et.al. | 2412.07534 | link |
2024-12-14 | Stealthy and Robust Backdoor Attack against 3D Point Clouds through Additional Point Features | Xiaoyang Ning et.al. | 2412.07511 | null |
2024-12-10 | Enhancing 3D Object Detection in Autonomous Vehicles Based on Synthetic Virtual Environment Analysis | Vladislav Li et.al. | 2412.07509 | null |
2024-12-10 | ResGS: Residual Densification of 3D Gaussian for Efficient Detail Recovery | Yanzhe Lyu et.al. | 2412.07494 | null |
2024-12-10 | Stereo Hand-Object Reconstruction for Human-to-Robot Handover | Yik Lung Pang et.al. | 2412.07487 | null |
2024-12-10 | Image Reconstruction in Cone Beam Computed Tomography Using Controlled Gradient Sparsity | Alexander Meaney et.al. | 2412.07465 | null |
2024-12-10 | LOGen: Toward Lidar Object Generation by Point Diffusion | Ellington Kirby et.al. | 2412.07385 | null |
2024-12-11 | CADSpotting: Robust Panoptic Symbol Spotting on Large-Scale CAD Drawings | Jiazuo Mu et.al. | 2412.07377 | null |
2024-12-10 | PRM: Photometric Stereo based Large Reconstruction Model | Wenhang Ge et.al. | 2412.07371 | null |
2024-12-10 | In-poor IGZO: superior resilience to hydrogen in forming gas anneal and PBTI | A. Kruv et.al. | 2412.07362 | null |
2024-12-10 | Efficient 3D Recognition with Event-driven Spike Sparse Convolution | Xuerui Qiu et.al. | 2412.07360 | link |
2024-12-10 | High-ellipticity resonant below-threshold harmonic generation by a helium atom driven by a moderately intense elliptically polarized laser field | Mikhail Yu. Emelin et.al. | 2412.07346 | null |
2024-12-10 | CoMA: Compositional Human Motion Generation with Multi-modal Agents | Shanlin Sun et.al. | 2412.07320 | null |
2024-12-10 | Compression of Large-Scale 3D Point Clouds Based on Joint Optimization of Point Sampling and Feature Extraction | Jae-Young Yim et.al. | 2412.07302 | null |
2024-12-10 | EventSplat: 3D Gaussian Splatting from Moving Event Cameras for Real-time Rendering | Toshiya Yura et.al. | 2412.07293 | null |
2024-12-10 | ArtFormer: Controllable Generation of Diverse 3D Articulated Objects | Jiayi Su et.al. | 2412.07237 | link |
2024-12-10 | Deep Non-rigid Structure-from-Motion Revisited: Canonicalization and Sequence Modeling | Hui Deng et.al. | 2412.07230 | null |
2024-12-10 | RoboMM: All-in-One Multimodal Large Model for Robotic Manipulation | Feng Yan et.al. | 2412.07215 | link |
2024-12-10 | X-ray magnetic circular dichroism and resonant inelastic X-ray scattering explained: role of many-body correlation and mixed-valence fluctuations | Beom Hyun Kim et.al. | 2412.07204 | null |
2024-12-10 | Digital Twin Assisted Beamforming Design for Integrated Sensing and Communication Systems | Shuaifeng Jiang et.al. | 2412.07180 | null |
2024-12-10 | Fast Occupancy Network | Mingjie Lu et.al. | 2412.07163 | null |
2024-12-10 | Revisiting Lesion Tracking in 3D Total Body Photography | Wei-Lun Huang et.al. | 2412.07132 | null |
2024-12-10 | Recent progress on the solid-state materials for photocatalysis | L. D. Tamang et.al. | 2412.07110 | null |
2024-12-09 | Flexible and Efficient Semi-Empirical DFTB Parameters for Electronic Structure Prediction of 3D, 2D Iodide Perovskites and Heterostructures | Junke Jiang et.al. | 2412.07016 | null |
2024-12-11 | ProVision: Programmatically Scaling Vision-centric Instruction Data for Multimodal Language Models | Jieyu Zhang et.al. | 2412.07012 | link |
2024-12-11 | Correlation-weighted 23Na magnetic resonance fingerprinting in the brain | Lauren F. O’Donnell et.al. | 2412.07006 | link |
2024-12-09 | Influence of the turbulent magnetic pressure on isothermal jet emitting disks | N. Zimniak et.al. | 2412.06999 | null |
2024-12-09 | Diffusing Differentiable Representations | Yash Savani et.al. | 2412.06981 | null |
2024-12-09 | Linear bending wave propagation in laminar and turbulent discs | Callum W. Fairbairn et.al. | 2412.06955 | null |
2024-12-09 | Tactile DreamFusion: Exploiting Tactile Sensing for 3D Generation | Ruihan Gao et.al. | 2412.06785 | link |
2024-12-09 | Diverse Score Distillation | Yanbo Xu et.al. | 2412.06780 | null |
2024-12-09 | Driv3R: Learning Dense 4D Reconstruction for Autonomous Driving | Xin Fei et.al. | 2412.06777 | link |
2024-12-09 | MAtCha Gaussians: Atlas of Charts for High-Quality Geometry and Photorealism From Sparse Views | Antoine Guédon et.al. | 2412.06767 | null |
2024-12-09 | 3D Graph Attention Networks for High Fidelity Pediatric Glioma Segmentation | Harish Thangaraj et.al. | 2412.06743 | null |
2024-12-09 | Non-invertible twisted compactification of class $\mathcal S$ theory and $(B,B,B)$ branes | Yankun Ma et.al. | 2412.06729 | null |
2024-12-09 | On Pooling-Based Track Fusion Strategies : Harmonic Mean Density | Nikhil Sharma et.al. | 2412.06716 | null |
2024-12-14 | You See it, You Got it: Learning 3D Creation on Pose-Free Videos at Scale | Baorui Ma et.al. | 2412.06699 | link |
2024-12-09 | Gen-3Diffusion: Realistic Image-to-3D Generation via 2D & 3D Diffusion Synergy | Yuxuan Xue et.al. | 2412.06698 | null |
2024-12-09 | MVReward: Better Aligning and Evaluating Multi-View Diffusion Models with Human Preferences | Weitao Wang et.al. | 2412.06614 | null |
2024-12-09 | 3D Spatial Understanding in MLLMs: Disambiguation and Evaluation | Chun-Peng Chang et.al. | 2412.06613 | null |
2024-12-09 | Augmented reality for upper limb rehabilitation: real-time kinematic feedback with HoloLens 2 | Beatrice Luciani et.al. | 2412.06596 | null |
2024-12-09 | PrEditor3D: Fast and Precise 3D Shape Editing | Ziya Erkoç et.al. | 2412.06592 | null |
2024-12-09 | More on thermal holographic RG flows in a 3D gauged supergravity | Anastasia A. Golubtsova et.al. | 2412.06536 | null |
2024-12-09 | A Finite Volume Method for Elastic Waves in Heterogeneous, Anisotropic and Fractured Media | Ingrid Kristine Jacobsen et.al. | 2412.06514 | null |
2024-12-09 | BATseg: Boundary-aware Multiclass Spinal Cord Tumor Segmentation on 3D MRI Scans | Hongkang Song et.al. | 2412.06507 | null |
2024-12-09 | PPT: Pre-Training with Pseudo-Labeled Trajectories for Motion Forecasting | Yihong Xu et.al. | 2412.06491 | null |
2024-12-09 | An Efficient Scene Coordinate Encoding and Relocalization Method | Kuan Xu et.al. | 2412.06488 | link |
2024-12-09 | Determining the acceleration regions of in situ electrons using remote radio and X-ray observations | D. E. Morosan et.al. | 2412.06477 | null |
2024-12-09 | Can foundation models actively gather information in interactive environments to test hypotheses? | Nan Rosemary Ke et.al. | 2412.06438 | null |
2024-12-09 | Deblur4DGS: 4D Gaussian Splatting from Blurry Monocular Video | Renlong Wu et.al. | 2412.06424 | link |
2024-12-09 | World-Consistent Data Generation for Vision-and-Language Navigation | Yu Zhong et.al. | 2412.06413 | null |
2024-12-09 | Cramér-Rao Bound Analysis and Beamforming Design for 3D Extended Target in ISAC: From Optimization to Learning Approaches | Yiqiu Wang et.al. | 2412.06353 | null |
2024-12-09 | TriDi: Trilateral Diffusion of 3D Humans, Objects, and Interactions | Ilya A. Petrov et.al. | 2412.06334 | null |
2024-12-09 | Robust Output Tracking for an Uncertain and Nonlinear 3D PDE-ODE System: Preventing Induced Seismicity in Underground Reservoirs | Diego Gutiérrez-Oribio et.al. | 2412.06327 | null |
2024-12-09 | 4D Gaussian Splatting with Scale-aware Residual Field and Adaptive Optimization for Real-time Rendering of Temporally Complex Dynamic Scenes | Jinbo Yan et.al. | 2412.06299 | null |
2024-12-09 | ZeroKey: Point-Level Reasoning and Zero-Shot 3D Keypoint Detection from Large Language Models | Bingchen Gong et.al. | 2412.06292 | null |
2024-12-09 | NLTE abundances of Eu for a sample of metal-poor stars in the Galactic Halo and Metal-poor Disk with 1D and <3D> models | Yanjun Guo et.al. | 2412.06277 | null |
2024-12-09 | Omni-Scene: Omni-Gaussian Representation for Ego-Centric Sparse-View Scene Reconstruction | Dongxu Wei et.al. | 2412.06273 | null |
2024-12-09 | EchoSim4D: A Proof-of-Concept Gamified XR Echocardiography Training Simulator for Neonates using 4D Ultrasound Volume | Deepthy Rose Jose et.al. | 2412.06271 | null |
2024-12-09 | Open-Vocabulary High-Resolution 3D (OVHR3D) Data Segmentation and Annotation Framework | Jiuyi Xu et.al. | 2412.06268 | null |
2024-12-12 | Advancing Extended Reality with 3D Gaussian Splatting: Innovations and Prospects | Shi Qiu et.al. | 2412.06257 | null |
2024-12-09 | Splatter-360: Generalizable 360 $^{\circ}$ Gaussian Splatting for Wide-baseline Panoramic Images | Zheng Chen et.al. | 2412.06250 | link |
2024-12-09 | Rendering-Refined Stable Diffusion for Privacy Compliant Synthetic Data | Kartik Patwari et.al. | 2412.06248 | null |
2024-12-12 | Generative Densification: Learning to Densify Gaussians for High-Fidelity Generalizable 3D Reconstruction | Seungtae Nam et.al. | 2412.06234 | null |
2024-12-08 | Adaptive and Context-Aware Volumetric Printing | Sammy Florczak et.al. | 2412.06053 | null |
2024-12-08 | Morpho-plastic cellular metamaterials | Victor Charpentier et.al. | 2412.06022 | null |
2024-12-08 | Lightweight Spatial Embedding for Vision-based 3D Occupancy Prediction | Jinqing Zhang et.al. | 2412.05976 | null |
2024-12-12 | Efficient Semantic Splatting for Remote Sensing Multi-view Segmentation | Zipeng Qi et.al. | 2412.05969 | null |
2024-12-08 | FOF-X: Towards Real-time Detailed Human Reconstruction from a Single Image | Qiao Feng et.al. | 2412.05961 | null |
2024-12-08 | Enhanced 3D Generation by 2D Editing | Haoran Li et.al. | 2412.05929 | null |
2024-12-08 | GBR: Generative Bundle Refinement for High-fidelity Gaussian Splatting and Meshing | Jianing Zhang et.al. | 2412.05908 | null |
2024-12-08 | Accelerating Video Diffusion Models via Distribution Matching | Yuanzhi Zhu et.al. | 2412.05899 | null |
2024-12-08 | 3D-Consistent Image Inpainting with Diffusion Models | Leonid Antsfeld et.al. | 2412.05881 | null |
2024-12-08 | Leveraging virtual technologies to enhance museums and art collections: insights from project CHANGES | Gianluca Genovese et.al. | 2412.05880 | null |
2024-12-08 | MG-3D: Multi-Grained Knowledge-Enhanced 3D Medical Vision-Language Pre-training | Xuefeng Ni et.al. | 2412.05876 | link |
2024-12-08 | Unsupervised Multi-Parameter Inverse Solving for Reducing Ring Artifacts in 3D X-Ray CBCT | Qing Wu et.al. | 2412.05853 | null |
2024-12-08 | Evolving Algebraic Multigrid Methods Using Grammar-Guided Genetic Programming | Dinesh Parthasarathy et.al. | 2412.05852 | null |
2024-12-08 | Doppelgangers++: Improved Visual Disambiguation with Geometric 3D Features | Yuanbo Xiangli et.al. | 2412.05826 | null |
2024-12-08 | SizeGS: Size-aware Compression of 3D Gaussians with Hierarchical Mixed Precision Quantization | Shuzhao Xie et.al. | 2412.05808 | null |
2024-12-08 | Weak Serrin-type blowup criterion for the 3D full compressible Navier-Stokes equations | Minghong Xie et.al. | 2412.05793 | null |
2024-12-08 | InfiniteWorld: A Unified Scalable Simulation Framework for General Visual-Language Robot Interaction | Pengzhen Ren et.al. | 2412.05789 | link |
2024-12-07 | Temporally Compressed 3D Gaussian Splatting for Dynamic Scenes | Saqib Javed et.al. | 2412.05700 | null |
2024-12-07 | WATER-GS: Toward Copyright Protection for 3D Gaussian Splatting via Universal Watermarking | Yuqi Tan et.al. | 2412.05695 | null |
2024-12-07 | High Order Free Boundary MHD Equilibria in DESC | Rory Conlin et.al. | 2412.05680 | link |
2024-12-07 | High SNR 3D Imaging from Millimeter-scale Thick Tissues to Cellular Dynamics via Structured Illumination Microscopy | Mengrui Wang et.al. | 2412.05677 | null |
2024-12-07 | Probing massive neutrinos and modified gravity with redshift-space morphologies and anisotropies of large-scale structure | Wei Liu et.al. | 2412.05662 | null |
2024-12-07 | RefSAM3D: Adapting SAM with Cross-modal Reference for 3D Medical Image Segmentation | Xiang Gao et.al. | 2412.05605 | null |
2024-12-07 | TB-HSU: Hierarchical 3D Scene Understanding with Contextual Affordances | Wenting Xu et.al. | 2412.05596 | null |
2024-12-07 | Real-Time 3D Object Detection Using InnovizOne LiDAR and Low-Power Hailo-8 AI Accelerator | Itay Krispin-Avraham et.al. | 2412.05594 | link |
2024-12-07 | UMSPU: Universal Multi-Size Phase Unwrapping via Mutual Self-Distillation and Adaptive Boosting Ensemble Segmenters | Lintong Du et.al. | 2412.05584 | null |
2024-12-07 | Self-Supervised Masked Mesh Learning for Unsupervised Anomaly Detection on 3D Cortical Surfaces | Hao-Chun Yang et.al. | 2412.05580 | null |
2024-12-07 | Rate-Distortion Optimized Skip Coding of Region Adaptive Hierarchical Transform Coefficients for MPEG G-PCC | Zehan Wang et.al. | 2412.05574 | null |
2024-12-07 | Template-free Articulated Gaussian Splatting for Real-time Reposable Dynamic View Synthesis | Diwen Wan et.al. | 2412.05570 | null |
2024-12-07 | SMI-Editor: Edit-based SMILES Language Model with Fragment-level Supervision | Kangjie Zheng et.al. | 2412.05569 | null |
2024-12-07 | Text-to-3D Gaussian Splatting with Physics-Grounded Motion Generation | Wenqing Wang et.al. | 2412.05560 | null |
2024-12-07 | CoE: Deep Coupled Embedding for Non-Rigid Point Cloud Correspondences | Huajian Zeng et.al. | 2412.05557 | link |
2024-12-07 | Street Gaussians without 3D Object Tracker | Ruida Zhang et.al. | 2412.05548 | null |
2024-12-07 | Radiant: Large-scale 3D Gaussian Rendering based on Hierarchical Framework | Haosong Peng et.al. | 2412.05546 | null |
2024-12-07 | Towards 3D Acceleration for low-power Mixture-of-Experts and Multi-Head Attention Spiking Transformers | Boxun Xu et.al. | 2412.05540 | null |
2024-12-07 | A Scene Representation for Online Spatial Sonification | Lan Wu et.al. | 2412.05486 | null |
2024-12-06 | What’s the Move? Hybrid Imitation Learning via Salient Points | Priya Sundaresan et.al. | 2412.05426 | null |
2024-12-06 | Is Chaotic Advection Inherent to Heterogeneous Darcy Flow? | Daniel R. Lester et.al. | 2412.05419 | null |
2024-12-06 | Facile “Pick-up” experiments and Monte Carlo simulations for the entanglement of tunable staple-like particles | Youhan Sohn et.al. | 2412.05416 | null |
2024-12-06 | Linking Dispersion and Stirring in Randomly Braiding Flows | Daniel R. Lester et.al. | 2412.05407 | null |
2024-12-06 | Giant Gravitons and Volume Minimisation | Heng-Yu Chen et.al. | 2412.05357 | null |
2024-12-03 | $ρ$ -NeRF: Leveraging Attenuation Priors in Neural Radiance Field for 3D Computed Tomography Reconstruction | Li Zhou et.al. | 2412.05322 | null |
2024-12-11 | Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model | Lening Wang et.al. | 2412.05280 | link |
2024-12-06 | Perturb-and-Revise: Flexible 3D Editing with Generative Trajectories | Susung Hong et.al. | 2412.05279 | null |
2024-12-06 | Birth and Death of a Rose | Chen Geng et.al. | 2412.05278 | null |
2024-12-06 | Text to Blind Motion | Hee Jae Kim et.al. | 2412.05277 | null |
2024-12-06 | SimC3D: A Simple Contrastive 3D Pretraining Framework Using RGB Images | Jiahua Dong et.al. | 2412.05274 | null |
2024-12-06 | Interplay of Quasi-Quantum Hall Effect and Coulomb Disorder in Semimetals | Ian A. Leahy et.al. | 2412.05273 | null |
2024-12-06 | DenseMatcher: Learning 3D Semantic Correspondence for Category-Level Manipulation from a Single Demo | Junzhe Zhu et.al. | 2412.05268 | null |
2024-12-10 | Extrapolated Urban View Synthesis Benchmark | Xiangyu Han et.al. | 2412.05256 | link |
2024-12-06 | DNF: Unconditional 4D Generation with Dictionary-based Neural Fields | Xinyi Zhang et.al. | 2412.05161 | null |
2024-12-06 | Towards Flexible 3D Perception: Object-Centric Occupancy Completion Augments 3D Object Detection | Chaoda Zheng et.al. | 2412.05154 | link |
2024-12-10 | Non-linear magnetic buoyancy instability and galactic dynamos | Yasin Qazi et.al. | 2412.05086 | null |
2024-12-06 | Spinal ligaments detection on vertebrae meshes using registration and 3D edge detection | Ivanna Kramer et.al. | 2412.05081 | null |
2024-12-06 | BimArt: A Unified Approach for the Synthesis of 3D Bimanual Interaction with Articulated Objects | Wanyue Zhang et.al. | 2412.05066 | null |
2024-12-06 | Reconstruction of 3D lumbar spine models from incomplete segmentations using landmark detection | Lara Blomenkamp et.al. | 2412.05065 | null |
2024-12-06 | Spatial Bandwidth of Bilateral Near-Field Channels for Linear Large-Scale Antenna Array System | Zhen Wang et.al. | 2412.05058 | null |
2024-12-06 | Improving Post-Earthquake Crack Detection using Semi-Synthetic Generated Images | Piercarlo Dondi et.al. | 2412.05042 | null |
2024-12-06 | BMS $_3$ fermionic localization | Joan Simón et.al. | 2412.05038 | null |
2024-12-06 | FlowPolicy: Enabling Fast and Robust 3D Flow-based Policy via Consistency Flow Matching for Robot Manipulation | Qinglun Zhang et.al. | 2412.04987 | link |
2024-12-11 | MixedGaussianAvatar: Realistically and Geometrically Accurate Head Avatar via Mixed 2D-3D Gaussian Splatting | Peng Chen et.al. | 2412.04955 | link |
2024-12-06 | Momentum-GS: Momentum Gaussian Self-Distillation for High-Quality Large Scene Reconstruction | Jixuan Fan et.al. | 2412.04887 | link |
2024-12-06 | Automatic Tissue Differentiation in Parotidectomy using Hyperspectral Imaging | Eric L. Wisotzky et.al. | 2412.04879 | null |
2024-12-06 | UniMLVG: Unified Framework for Multi-view Long Video Generation with Comprehensive Control Capabilities for Autonomous Driving | Rui Chen et.al. | 2412.04842 | link |
2024-12-06 | WRF-GS: Wireless Radiation Field Reconstruction with 3D Gaussian Splatting | Chaozheng Wen et.al. | 2412.04832 | link |
2024-12-06 | PanoDreamer: 3D Panorama Synthesis from a Single Image | Avinash Paliwal et.al. | 2412.04827 | link |
2024-12-06 | Pushing Rendering Boundaries: Hard Gaussian Splatting | Qingshan Xu et.al. | 2412.04826 | null |
2024-12-06 | A Multi-physics Model of Flow from Coronary Angiography: Insights into Microvascular Function | Haizhou Yang et.al. | 2412.04798 | null |
2024-12-06 | Passive Six-Dimensional Movable Antenna (6DMA)-Assisted Multiuser Communication | Haozhe Wang et.al. | 2412.04720 | null |
2024-12-06 | PCTreeS: 3D Point Cloud Tree Species Classification Using Airborne LiDAR Images | Hongjin Lin et.al. | 2412.04714 | null |
2024-12-06 | Raspberry Pi multispectral imaging camera system (PiMICS): a low-cost, skills-based physics educational tool | John C. Howell et.al. | 2412.04679 | null |
2024-12-05 | Exact Inversion from Space-filling Trajectories in Cone-beam Transmission Tomography | Murdock Grewar et.al. | 2412.04669 | null |
2024-12-05 | ProPLIKS: Probablistic 3D human body pose estimation | Karthik Shetty et.al. | 2412.04665 | null |
2024-12-05 | Learning for Layered Safety-Critical Control with Predictive Control Barrier Functions | William D. Compton et.al. | 2412.04658 | null |
2024-12-05 | Drift-cyclotron loss-cone instability in 3D simulations of a sloshing-ion simple mirror | Aaron Tran et.al. | 2412.04656 | link |
2024-12-05 | SDSS J100711.74+193056.2: A Candidate Common Motion Substellar Companion to the Nearest B-Type Star Regulus | Eric E. Mamajek et.al. | 2412.04599 | null |
2024-12-05 | Spin Chern number in altermagnets | Rafael Gonzalez-Hernandez et.al. | 2412.04593 | null |
2024-12-05 | MetaFormer: High-fidelity Metalens Imaging via Aberration Correcting Transformers | Byeonghyeon Lee et.al. | 2412.04591 | null |
2024-12-05 | Data-Driven, Parameterized Reduced-order Models for Predicting Distortion in Metal 3D Printing | Indu Kant Deo et.al. | 2412.04577 | null |
2024-12-05 | Going from 3D common-envelope simulations to fast 1D simulations | V. A. Bronner et.al. | 2412.04543 | null |
2024-12-05 | 2.5D Super-Resolution Approaches for X-ray Computed Tomography-based Inspection of Additively Manufactured Parts | Haley Duba-Sullivan et.al. | 2412.04525 | null |
2024-12-11 | votess: A multi-target, GPU-capable, parallel Voronoi tessellator | Samridh Dev Singh et.al. | 2412.04514 | link |
2024-12-05 | PaintScene4D: Consistent 4D Scene Generation from Text Prompts | Vinayak Gupta et.al. | 2412.04471 | null |
2024-12-05 | Turbo3D: Ultra-fast Text-to-3D Generation | Hanzhe Hu et.al. | 2412.04470 | null |
2024-12-05 | QUEEN: QUantized Efficient ENcoding of Dynamic Gaussians for Streaming Free-viewpoint Videos | Sharath Girish et.al. | 2412.04469 | null |
2024-12-12 | DualPM: Dual Posed-Canonical Point Maps for 3D Shape and Pose Reconstruction | Ben Kaye et.al. | 2412.04464 | null |
2024-12-05 | Sparse Voxels Rasterization: Real-time High-fidelity Radiance Field Rendering | Cheng Sun et.al. | 2412.04459 | link |
2024-12-05 | Cubify Anything: Scaling Indoor 3D Object Detection | Justin Lazarow et.al. | 2412.04458 | null |
2024-12-06 | PBDyG: Position Based Dynamic Gaussians for Motion-Aware Clothed Human Avatars | Shota Sasaki et.al. | 2412.04433 | null |
2024-12-05 | Journey to the center of the common envelope evolution. Inner dynamics of the post-dynamical inspiral | Damien Gagnier et.al. | 2412.04419 | null |
2024-12-06 | GaussianFormer-2: Probabilistic Gaussian Superposition for Efficient 3D Occupancy Prediction | Yuanhui Huang et.al. | 2412.04384 | link |
2024-12-05 | SeeGround: See and Ground for Zero-Shot Open-Vocabulary 3D Visual Grounding | Rong Li et.al. | 2412.04383 | null |
2024-12-06 | EmbodiedOcc: Embodied 3D Occupancy Prediction for Vision-based Online Scene Understanding | Yuqi Wu et.al. | 2412.04380 | link |
2024-12-05 | Small-scale dynamics and structure of free-surface turbulence | Yinghe Qi et.al. | 2412.04361 | null |
2024-12-05 | DARWEN: Data-driven Algorithm for Reduction of Wide Exoplanetary Networks | A. Lira-Barria et.al. | 2412.04359 | null |
2024-12-05 | Likelihood-Scheduled Score-Based Generative Modeling for Fully 3D PET Image Reconstruction | George Webber et.al. | 2412.04339 | null |
2024-12-05 | Reflective Teacher: Semi-Supervised Multimodal 3D Object Detection in Bird’s-Eye-View via Uncertainty Measure | Saheli Hazra et.al. | 2412.04337 | null |
2024-12-05 | Generative-Model-Based Fully 3D PET Image Reconstruction by Conditional Diffusion Sampling | George Webber et.al. | 2412.04319 | null |
2024-12-05 | Towards Zero-shot 3D Anomaly Localization | Yizhou Wang et.al. | 2412.04304 | null |
2024-12-05 | Numerical study of the dimensionally reduced 3D Ising model | Tolga Kiel et.al. | 2412.04278 | null |
2024-12-05 | 3D Part Segmentation via Geometric Aggregation of 2D Visual Features | Marco Garosi et.al. | 2412.04247 | link |
2024-12-05 | GigaHands: A Massive Annotated Dataset of Bimanual Hand Activities | Rao Fu et.al. | 2412.04244 | null |
2024-12-05 | LMDM:Latent Molecular Diffusion Model For 3D Molecule Generation | Xiang Chen et.al. | 2412.04242 | null |
2024-12-05 | New Methods for Computer Tomography Based Ion Thruster Diagnostics and Simulation | Jörn Krenzer et.al. | 2412.04214 | null |
2024-12-05 | Bound of Casimir Effect by Holography | Rong-Xin Miao et.al. | 2412.04122 | null |
2024-12-05 | DeepFEA: Deep Learning for Prediction of Transient Finite Element Analysis Solutions | Georgios Triantafyllou et.al. | 2412.04121 | null |
2024-12-10 | CrossSDF: 3D Reconstruction of Thin Structures From Cross-Sections | Thomas Walker et.al. | 2412.04120 | null |
2024-12-06 | BodyMetric: Evaluating the Realism of Human Bodies in Text-to-Image Generation | Nefeli Andreou et.al. | 2412.04086 | null |
2024-12-05 | Modeling the astrosphere of LHS~1140 | K. Scherer et.al. | 2412.04018 | null |
2024-12-10 | IF-MDM: Implicit Face Motion Diffusion Model for High-Fidelity Realtime Talking Head Generation | Sejong Yang et.al. | 2412.04000 | null |
2024-12-05 | Performance Analysis of XL-MIMO with Rotary and Movable Antennas for High-speed Railway | Wenhui Yi et.al. | 2412.03940 | null |
2024-12-05 | InfiniCube: Unbounded and Controllable Dynamic 3D Driving Scene Generation with World-Guided Video Models | Yifan Lu et.al. | 2412.03934 | null |
2024-12-11 | MT3DNet: Multi-Task learning Network for 3D Surgical Scene Reconstruction | Mithun Parab et.al. | 2412.03928 | null |
2024-12-05 | Traffic Co-Simulation Framework Empowered by Infrastructure Camera Sensing and Reinforcement Learning | Talha Azfar et.al. | 2412.03925 | null |
2024-12-05 | Multi-View Pose-Agnostic Change Localization with Zero Labels | Chamuditha Jayanga Galappaththige et.al. | 2412.03911 | null |
2024-12-05 | DGNS: Deformable Gaussian Splatting and Dynamic Neural Surface for Monocular Dynamic 3D Reconstruction | Xuesong Li et.al. | 2412.03910 | link |
2024-12-05 | 4D SlingBAG: spatial-temporal coupled Gaussian ball for large-scale dynamic 3D photoacoustic iterative reconstruction | Shuang Li et.al. | 2412.03898 | link |
2024-12-05 | ShapeCraft: Body-Aware and Semantics-Aware 3D Object Design | Michelle Guo et.al. | 2412.03889 | null |
2024-12-11 | Winding number on 3D lattice | Okuto Morikawa et.al. | 2412.03888 | link |
2024-12-05 | DiffSign: AI-Assisted Generation of Customizable Sign Language Videos With Enhanced Realism | Sudha Krishnamurthy et.al. | 2412.03878 | link |
2024-12-10 | HybridGS: Decoupling Transients and Statics with 2D and 3D Gaussian Splatting | Jingyu Lin et.al. | 2412.03844 | link |
2024-12-05 | Coordinate In and Value Out: Training Flow Transformers in Ambient Space | Yuyang Wang et.al. | 2412.03791 | null |
2024-12-04 | Large-Scale Dense 3D Mapping Using Submaps Derived From Orthogonal Imaging Sonars | John McConnell et.al. | 2412.03760 | null |
2024-12-04 | Bayesian Perspective for Orientation Estimation in Cryo-EM and Cryo-ET | Sheng Xu et.al. | 2412.03723 | null |
2024-12-04 | Mixed ‘t Hooft Anomalies and the Witten Effect for AdS Black Holes | Matthew Heydeman et.al. | 2412.03695 | null |
2024-12-04 | MV-Adapter: Multi-view Consistent Image Generation Made Easy | Zehuan Huang et.al. | 2412.03632 | null |
2024-12-04 | Style3D: Attention-guided Multi-view Style Transfer for 3D Object Generation | Bingjie Song et.al. | 2412.03571 | null |
2024-12-04 | Sparse-view Pose Estimation and Reconstruction via Analysis by Generative Synthesis | Qitao Zhao et.al. | 2412.03570 | null |
2024-12-04 | MIDI: Multi-Instance Diffusion for Single Image to 3D Scene Generation | Zehuan Huang et.al. | 2412.03558 | null |
2024-12-08 | Perception Tokens Enhance Visual Reasoning in Multimodal Language Models | Mahtab Bigverdi et.al. | 2412.03548 | null |
2024-12-04 | Feed-Forward Bullet-Time Reconstruction of Dynamic Scenes from Monocular Videos | Hanxue Liang et.al. | 2412.03526 | null |
2024-12-04 | Dense Scene Reconstruction from Light-Field Images Affected by Rolling Shutter | Hermes McGriff et.al. | 2412.03518 | null |
2024-12-04 | Distilling Diffusion Models to Efficient 3D LiDAR Scene Completion | Shengyuan Zhang et.al. | 2412.03515 | link |
2024-12-04 | Distillation of Diffusion Features for Semantic Correspondence | Frank Fundel et.al. | 2412.03512 | null |
2024-12-04 | Stagnation points at grain contacts generate an elastic flow instability in 3D porous media | Emily Y. Chen et.al. | 2412.03510 | null |
2024-12-04 | Data Fusion of Semantic and Depth Information in the Context of Object Detection | Md Abu Yusuf et.al. | 2412.03490 | null |
2024-12-04 | Urban4D: Semantic-Guided 4D Gaussian Splatting for Urban Scene Reconstruction | Ziwen Li et.al. | 2412.03473 | null |
2024-12-04 | Measure Anything: Real-time, Multi-stage Vision-based Dimensional Measurement using Segment Anything | Yongkyu Lee et.al. | 2412.03472 | link |
2024-12-04 | Modeling of pattern formation of the ordered intermediate phases during co-deposition of binary thin film | Serhii Abakumov et.al. | 2412.03457 | null |
2024-12-04 | PlanarSplatting: Accurate Planar Surface Reconstruction in 3 Minutes | Bin Tan et.al. | 2412.03451 | null |
2024-12-04 | Spectral theory of effective transport for discrete uniaxial polycrystalline materials | N. Benjamin Murphy et.al. | 2412.03447 | null |
2024-12-04 | BIMCaP: BIM-based AI-supported LiDAR-Camera Pose Refinement | Miguel Arturo Vega Torres et.al. | 2412.03434 | link |
2024-12-04 | 2DGS-Room: Seed-Guided 2D Gaussian Splatting with Geometric Constrains for High-Fidelity Indoor Scene Reconstruction | Wanting Zhang et.al. | 2412.03428 | null |
2024-12-04 | Skel3D: Skeleton Guided Novel View Synthesis | Aron Fóthi et.al. | 2412.03407 | null |
2024-12-09 | MTVNet: Mapping using Transformers for Volumes – Network for Super-Resolution with Long-Range Interactions | August Leander Høeg et.al. | 2412.03379 | link |
2024-12-04 | Volumetrically Consistent 3D Gaussian Rasterization | Chinmay Talegaonkar et.al. | 2412.03378 | null |
2024-12-04 | SGSST: Scaling Gaussian Splatting StyleTransfer | Bruno Galerne et.al. | 2412.03371 | link |
2024-12-04 | MOVE: Multi-skill Omnidirectional Legged Locomotion with Limited View in 3D Environments | Songbo Li et.al. | 2412.03353 | null |
2024-12-04 | Magnetic properties and growth kinetics of Co/Gd bilayers with perpendicular magnetic anisotropy | T. J. Kools et.al. | 2412.03333 | null |
2024-12-04 | Fingering instability in dewetting capillary nanosuspensions | Lingyue Liu et.al. | 2412.03306 | null |
2024-12-07 | Magnetic Topology of quiet-Sun Ellerman bombs and associated Ultraviolet brightenings | Aditi Bhatnagar et.al. | 2412.03211 | null |
2024-12-04 | AffordDP: Generalizable Diffusion Policy with Transferable Affordance | Shijie Wu et.al. | 2412.03142 | null |
2024-12-04 | Splats in Splats: Embedding Invisible 3D Watermark within Gaussian Splatting | Yijia Guo et.al. | 2412.03121 | null |
2024-12-04 | MultiGO: Towards Multi-level Geometry Learning for Monocular 3D Textured Human Reconstruction | Gangjian Zhang et.al. | 2412.03103 | null |
2024-12-04 | Lightweight Multiplane Images Network for Real-Time Stereoscopic Conversion from Planar Video | Shanding Diao et.al. | 2412.03102 | null |
2024-12-04 | RoDyGS: Robust Dynamic Gaussian Splatting for Casual Videos | Yoonwoo Jeong et.al. | 2412.03077 | null |
2024-12-04 | CLAP: Unsupervised 3D Representation Learning for Fusion 3D Perception via Curvature Sampling and Prototype Learning | Runjian Chen et.al. | 2412.03059 | null |
2024-12-07 | Point-GN: A Non-Parametric Network Using Gaussian Positional Encoding for Point Cloud Classification | Marzieh Mohammadi et.al. | 2412.03056 | link |
2024-12-04 | TREND: Unsupervised 3D Representation Learning via Temporal Forecasting for LiDAR Perception | Runjian Chen et.al. | 2412.03054 | null |
2024-12-04 | Point-GR: Graph Residual Point Cloud Network for 3D Object Classification and Segmentation | Md Meraz et.al. | 2412.03052 | null |
2024-12-04 | Fan-Beam CT Reconstruction for Unaligned Sparse-View X-ray Baggage Dataset | Shin Kim et.al. | 2412.03036 | null |
2024-12-04 | ASIGN: An Anatomy-aware Spatial Imputation Graphic Network for 3D Spatial Transcriptomics | Junchao Zhu et.al. | 2412.03026 | link |
2024-12-04 | Human Multi-View Synthesis from a Single-View Model:Transferred Body and Face Representations | Yu Feng et.al. | 2412.03011 | null |
2024-12-04 | gghic: A Versatile R Package for Exploring and Visualizing 3D Genome Organization | Minghao Jiang et.al. | 2412.03005 | link |
2024-12-04 | AdvDreamer Unveils: Are Vision-Language Models Truly Ready for Real-World 3D Variations? | Shouwei Ruan et.al. | 2412.03002 | null |
2024-12-04 | CLAS: A Machine Learning Enhanced Framework for Exploring Large 3D Design Datasets | XiuYu Zhang et.al. | 2412.02996 | null |
2024-12-04 | 3D Interaction Geometric Pre-training for Molecular Relational Learning | Namkyeong Lee et.al. | 2412.02957 | link |
2024-12-04 | Higher Order Transformers: Efficient Attention Mechanism for Tensor Structured Data | Soroush Omranpour et.al. | 2412.02919 | null |
2024-12-03 | ShapeWords: Guiding Text-to-Image Synthesis with 3D Shape-Aware Prompts | Dmitry Petrov et.al. | 2412.02912 | null |
2024-12-03 | EgoCast: Forecasting Egocentric Human Pose in the Wild | Maria Escobar et.al. | 2412.02903 | null |
2024-12-03 | OriStitch: A Machine Embroidery Workflow to Turn Existing Fabrics into Self-Folding 3D Textiles | Zekun Chang et.al. | 2412.02891 | null |
2024-12-03 | A Minimalistic 3D Self-Organized UAV Flocking Approach for Desert Exploration | Thulio Amorim et.al. | 2412.02881 | null |
2024-12-03 | Optimized CNNs for Rapid 3D Point Cloud Object Recognition | Tianyi Lyu et.al. | 2412.02855 | null |
2024-12-03 | Gaussian Splatting Under Attack: Investigating Adversarial Noise in 3D Objects | Abdurrahman Zeybey et.al. | 2412.02803 | null |
2024-12-03 | Hijacking Vision-and-Language Navigation Agents with Adversarial Environmental Attacks | Zijiao Yang et.al. | 2412.02795 | null |
2024-12-03 | Quaternion-based Unscented Kalman Filter for 6-DoF Vision-based Inertial Navigation in GPS-denied Regions | Khashayar Ghanizadegan et.al. | 2412.02768 | null |
2024-12-03 | The imprint of cosmic voids from the DESI Legacy Survey DR9 LRGs in the Planck 2018 lensing map through spectroscopically calibrated mocks | S. Sartori et.al. | 2412.02761 | null |
2024-12-03 | MVCTrack: Boosting 3D Point Cloud Tracking via Multimodal-Guided Virtual Cues | Zhaofeng Hu et.al. | 2412.02734 | link |
2024-12-03 | Self-Similar acoustic white hole solutions in Bose-Einstein condensates and their Borel analysis | Sachin Vaidya et.al. | 2412.02728 | null |
2024-12-03 | AniGS: Animatable Gaussian Avatar from a Single Image with Inconsistent Gaussian Reconstruction | Lingteng Qiu et.al. | 2412.02684 | null |
2024-12-03 | Leveraging Tactile Sensing to Render both Haptic Feedback and Virtual Reality 3D Object Reconstruction in Robotic Telemanipulation | Gabriele Giudici et.al. | 2412.02644 | null |
2024-12-03 | Sharp-It: A Multi-view to Multi-view Diffusion Model for 3D Synthesis and Manipulation | Yiftach Edelstein et.al. | 2412.02631 | null |
2024-12-03 | Continual Learning of Personalized Generative Face Models with Experience Replay | Annie N. Wang et.al. | 2412.02627 | null |
2024-12-03 | MedTet: An Online Motion Model for 4D Heart Reconstruction | Yihong Chen et.al. | 2412.02589 | null |
2024-12-03 | LiDAR-based Registration against Georeferenced Models for Globally Consistent Allocentric Maps | Jan Quenzel et.al. | 2412.02533 | null |
2024-12-03 | Towards Rich Emotions in 3D Avatars: A Text-to-3D Avatar Generation Benchmark | Haidong Xu et.al. | 2412.02508 | link |
2024-12-03 | RelayGS: Reconstructing Dynamic Scenes with Large-Scale and Complex Motions via Relay Gaussians | Qiankun Gao et.al. | 2412.02493 | link |
2024-12-03 | Controlling the interaction of tightly focused 10-PW class lasers with multicomponent plasma via target parameters: optimization of electron-positron pair and $γ$ -photon sources | A. V. Bashinov et.al. | 2412.02485 | null |
2024-12-03 | Analysis of axisymmetric necking of a circular dielectric membrane based on a one-dimensional model | Xiang Yu et.al. | 2412.02451 | null |
2024-12-03 | TimeWalker: Personalized Neural Space for Lifelong Head Avatars | Dongwei Pan et.al. | 2412.02421 | null |
2024-12-03 | 3D Face Reconstruction From Radar Images | Valentin Braeutigam et.al. | 2412.02403 | null |
2024-12-03 | RG-SAN: Rule-Guided Spatial Awareness Network for End-to-End 3D Referring Expression Segmentation | Changli Wu et.al. | 2412.02402 | link |
2024-12-03 | Realistic Surgical Simulation from Monocular Videos | Kailing Wang et.al. | 2412.02359 | null |
2024-12-03 | Dual Exposure Stereo for Extended Dynamic Range 3D Imaging | Juhyung Choi et.al. | 2412.02351 | null |
2024-12-03 | Design of thermal meta-structures made of functionally graded materials using isogeometric density-based topology optimization | Chintan Jansari et.al. | 2412.02318 | null |
2024-12-03 | HumanRig: Learning Automatic Rigging for Humanoid Character in a Large Scale Dataset | Zedong Chu et.al. | 2412.02317 | link |
2024-12-03 | An enhanced single Gaussian point continuum finite element formulation using automatic differentiation | Njomza Pacolli et.al. | 2412.02309 | null |
2024-12-03 | Partial Non-rigid Deformations and interpolations of Human Body Surfaces | Thomas Besnier et.al. | 2412.02306 | null |
2024-12-03 | Viewpoint Consistency in 3D Generation via Attention and CLIP Guidance | Qing Zhang et.al. | 2412.02287 | null |
2024-12-03 | GSGTrack: Gaussian Splatting-Guided Object Pose Tracking from RGB Videos | Zhiyuan Chen et.al. | 2412.02267 | null |
2024-12-03 | Multi-robot autonomous 3D reconstruction using Gaussian splatting with Semantic guidance | Jing Zeng et.al. | 2412.02249 | null |
2024-12-04 | SparseLGS: Sparse View Language Embedded Gaussian Splatting | Jun Hu et.al. | 2412.02245 | null |
2024-12-03 | On Simplifying Large-Scale Spatial Vectors: Fast, Memory-Efficient, and Cost-Predictable k-means | Yushuai Ji et.al. | 2412.02244 | link |
2024-12-03 | CubeFormer: A Simple yet Effective Baseline for Lightweight Image Super-Resolution | Jikai Wang et.al. | 2412.02234 | null |
2024-12-03 | Construction of exact solutions of nonlinear PDE via dressing chain in 3D | I. T. Habibullin et.al. | 2412.02226 | null |
2024-12-03 | How to Use Diffusion Priors under Sparse Views? | Qisen Wang et.al. | 2412.02225 | link |
2024-12-03 | 3D Modular Microrobots: Micro-Origami Cubes with Integrated Si Chips Dive, Communicate, Flash Programs, and Form Collectives | Yeji Lee et.al. | 2412.02224 | null |
2024-12-03 | Higher symmetries of the lattices in 3D | I. T. Habibullin et.al. | 2412.02221 | null |
2024-12-03 | 3D representation in 512-Byte:Variational tokenizer is the key for autoregressive 3D generation | Jinzhi Zhang et.al. | 2412.02202 | null |
2024-12-03 | LayoutVLM: Differentiable Optimization of 3D Layout via Vision-Language Models | Fan-Yun Sun et.al. | 2412.02193 | null |
2024-12-03 | All Polyhedral Manifolds are Connected by a 2-Step Refolding | Lily Chung et.al. | 2412.02174 | null |
2024-12-03 | SparseGrasp: Robotic Grasping via 3D Semantic Gaussian Splatting from Sparse Multi-View RGB Images | Junqiu Yu et.al. | 2412.02140 | null |
2024-12-03 | GSOT3D: Towards Generic 3D Single Object Tracking in the Wild | Yifan Jiao et.al. | 2412.02129 | link |
2024-11-29 | Streamlining Video Analysis for Efficient Violence Detection | Gourang Pathak et.al. | 2412.02127 | null |
2024-12-03 | Gaussian Object Carver: Object-Compositional Gaussian Splatting with surfaces completion | Liu Liu et.al. | 2412.02075 | link |
2024-12-03 | CLERF: Contrastive LEaRning for Full Range Head Pose Estimation | Ting-Ruen Wei et.al. | 2412.02066 | null |
2024-12-03 | Redundant Queries in DETR-Based 3D Detection Methods: Unnecessary and Prunable | Lizhen Xu et.al. | 2412.02054 | null |
2024-12-03 | FoveaSPAD: Exploiting Depth Priors for Adaptive and Efficient Single-Photon 3D Imaging | Justin Folden et.al. | 2412.02052 | null |
2024-12-02 | Mutli-View 3D Reconstruction using Knowledge Distillation | Aditya Dutt et.al. | 2412.02039 | link |
2024-12-02 | HybridMQA: Exploring Geometry-Texture Interactions for Colored Mesh Quality Assessment | Armin Shafiee Sarvestani et.al. | 2412.01986 | null |
2024-12-02 | MPBD-LSTM: A Predictive Model for Colorectal Liver Metastases Using Time Series Multi-phase Contrast-Enhanced CT Scans | Xueyang Li et.al. | 2412.01973 | link |
2024-12-02 | Learning a Filtered Backprojection Reconstruction Method for Photoacoustic Computed Tomography with Hemispherical Measurement Geometries | Panpan Chen et.al. | 2412.01971 | null |
2024-12-02 | Planar Gaussian Splatting | Farhad G. Zanjani et.al. | 2412.01931 | null |
2024-12-01 | Enhancing Brain Age Estimation with a Multimodal 3D CNN Approach Combining Structural MRI and AI-Synthesized Cerebral Blood Volume Data | Jordan Jomsky et.al. | 2412.01865 | null |
2024-11-29 | Volumetric Reconstruction of Prostatectomy Specimens from Histology | Tom Bisson et.al. | 2412.01855 | null |
2024-12-02 | World-consistent Video Diffusion with Explicit 3D Modeling | Qihang Zhang et.al. | 2412.01821 | null |
2024-12-02 | Occam’s LGS: A Simple Approach for Language Gaussian Splatting | Jiahuan Cheng et.al. | 2412.01807 | null |
2024-12-03 | SceneFactor: Factored Latent 3D Diffusion for Controllable 3D Scene Generation | Alexey Bokhovkin et.al. | 2412.01801 | null |
2024-12-02 | CTRL-D: Controllable Dynamic 3D Scene Editing with Personalized 2D Diffusion | Kai He et.al. | 2412.01792 | null |
2024-12-06 | Robot Learning with Super-Linear Scaling | Marcel Torne et.al. | 2412.01770 | null |
2024-12-02 | Planning and Reasoning with 3D Deformable Objects for Hierarchical Text-to-3D Robotic Shaping | Alison Bartsch et.al. | 2412.01765 | null |
2024-12-02 | Horizon-GS: Unified 3D Gaussian Splatting for Large-Scale Aerial-to-Ground Scenes | Lihan Jiang et.al. | 2412.01745 | null |
2024-12-02 | Forced 3D reconnection in an exponentially separating magnetic field | David N. Hosking et.al. | 2412.01736 | null |
2024-12-02 | HUGSIM: A Real-Time, Photo-Realistic and Closed-Loop Simulator for Autonomous Driving | Hongyu Zhou et.al. | 2412.01718 | null |
2024-12-02 | Driving Scene Synthesis on Free-form Trajectories with Generative Prior | Zeyu Yang et.al. | 2412.01717 | null |
2024-12-02 | Discontinuous structural transitions in fluids with competing interactions | Ana M. Montero et.al. | 2412.01629 | null |
2024-12-02 | CRAYM: Neural Field Optimization via Camera RAY Matching | Liqiang Lin et.al. | 2412.01618 | null |
2024-12-02 | An Atlas for 3d Conformal Field Theories with a U(1) Global Symmetry | Samuel Bartlett-Tisdall et.al. | 2412.01608 | null |
2024-12-09 | 3DSceneEditor: Controllable 3D Scene Editing with Gaussian Splatting | Ziyang Yan et.al. | 2412.01583 | null |
2024-12-02 | Physical Characteristics of Jupiter’s Trojan (1437) Diomedes from a Tri-chord Stellar Occultation in 2020 and Dimensionless 3D Model | H. Dutra et.al. | 2412.01568 | null |
2024-12-02 | A dynamic implicit 3D material point-to-rigid body contact approach for large deformation analysis | Robert E. Bird et.al. | 2412.01565 | null |
2024-12-02 | Tokenizing 3D Molecule Structure with Quantized Spherical Coordinates | Kaiyuan Gao et.al. | 2412.01564 | null |
2024-12-02 | SfM-Free 3D Gaussian Splatting via Hierarchical Training | Bo Ji et.al. | 2412.01553 | link |
2024-12-02 | SeqAfford: Sequential 3D Affordance Reasoning via Multimodal Large Language Model | Chunlin Yu et.al. | 2412.01550 | null |
2024-12-02 | 6DOPE-GS: Online 6D Object Pose Estimation using Gaussian Splatting | Yufeng Jin et.al. | 2412.01543 | null |
2024-12-02 | The Bare Necessities: Designing Simple, Effective Open-Vocabulary Scene Graphs | Christina Kassab et.al. | 2412.01539 | null |
2024-12-02 | HandOS: 3D Hand Reconstruction in One Stage | Xingyu Chen et.al. | 2412.01537 | null |
2024-12-02 | Structured 3D Latents for Scalable and Versatile 3D Generation | Jianfeng Xiang et.al. | 2412.01506 | link |
2024-12-02 | 3D Spine Shape Estimation from Single 2D DXA | Emmanuelle Bourigault et.al. | 2412.01504 | null |
2024-12-02 | Improving Object Detection by Modifying Synthetic Data with Explainable AI | Nitish Mital et.al. | 2412.01477 | null |
2024-12-02 | Semantic Scene Completion with Multi-Feature Data Balancing Network | Mona Alawadh et.al. | 2412.01431 | null |
2024-12-02 | MVImgNet2.0: A Larger-scale Dataset of Multi-view Images | Xiaoguang Han et.al. | 2412.01430 | null |
2024-12-03 | HoloDrive: Holistic 2D-3D Multi-Modal Street Scene Generation for Autonomous Driving | Zehuan Wu et.al. | 2412.01407 | null |
2024-12-02 | Holistic Understanding of 3D Scenes as Universal Scene Description | Anna-Maria Halacheva et.al. | 2412.01398 | null |
2024-12-02 | Enhancing multiscale simulations for spark plasma sintering with a novel Direct FE $^2$ framework | A. Kumar et.al. | 2412.01350 | null |
2024-12-02 | Physically Constrained 3D Diffusion for Inverse Design of Fiber-reinforced Polymer Composite Materials | Pei Xu et.al. | 2412.01321 | null |
2024-12-02 | Full 3D Model of Modulation Efficiency of Complementary Metal Oxide Semiconductor (CMOS) Compatible, Submicron, Interleaved Junction Optical Phase Shifters | Abdurrahman Javid Shaikh et.al. | 2412.01305 | null |
2024-12-02 | Cross-Modal Visual Relocalization in Prior LiDAR Maps Utilizing Intensity Textures | Qiyuan Shen et.al. | 2412.01299 | null |
2024-12-02 | LSceneLLM: Enhancing Large 3D Scene Understanding Using Adaptive Visual Preferences | Hongyan Zhi et.al. | 2412.01292 | link |
2024-12-02 | Global Estimation of Building-Integrated Facade and Rooftop Photovoltaic Potential by Integrating 3D Building Footprint and Spatio-Temporal Datasets | Qing Yu et.al. | 2412.01291 | link |
2024-12-02 | Inspiring the Next Generation of Segment Anything Models: Comprehensively Evaluate SAM and SAM 2 with Diverse Prompts Towards Context-Dependent Concepts under Different Scenes | Xiaoqi Zhao et.al. | 2412.01240 | null |
2024-12-02 | Real-time Traffic Simulation and Management for Large-scale Urban Air Mobility: Integrating Route Guidance and Collision Avoidance | Canqiang Weng et.al. | 2412.01235 | null |
2024-12-04 | RGBDS-SLAM: A RGB-D Semantic Dense SLAM Based on 3D Multi Level Pyramid Gaussian Splatting | Zhenzhong Cao et.al. | 2412.01217 | link |
2024-12-02 | A Semantic Communication System for Real-time 3D Reconstruction Tasks | Jiaxing Zhang et.al. | 2412.01191 | null |
2024-12-02 | Dual-Branch Graph Transformer Network for 3D Human Mesh Reconstruction from Video | Tao Tang et.al. | 2412.01179 | link |
2024-12-02 | Rectified Flow For Structure Based Drug Design | Daiheng Zhang et.al. | 2412.01174 | null |
2024-12-02 | Object Agnostic 3D Lifting in Space and Time | Christopher Fusco et.al. | 2412.01166 | null |
2024-12-02 | Long-time 3D supernova simulations of non-rotating progenitors with magnetic fields | Bailey Sykes et.al. | 2412.01155 | null |
2024-12-02 | Dense Dispersed Structured Light for Hyperspectral 3D Imaging of Dynamic Scenes | Suhyun Shin et.al. | 2412.01140 | null |
2024-12-02 | Generating Freeform Endoskeletal Robots | Muhan Li et.al. | 2412.01036 | null |
2024-12-01 | ESCAPE: Equivariant Shape Completion via Anchor Point Encoding | Burak Bekci et.al. | 2412.00952 | null |
2024-12-01 | FIction: 4D Future Interaction Prediction from Video | Kumar Ashutosh et.al. | 2412.00932 | null |
2024-12-03 | Tomographic SAR Reconstruction for Forest Height Estimation | Grace Colverd et.al. | 2412.00903 | null |
2024-12-01 | Playable Game Generation | Mingyu Yang et.al. | 2412.00887 | link |
2024-12-01 | DynSUP: Dynamic Gaussian Splatting from An Unposed Image Pair | Weihang Li et.al. | 2412.00851 | null |
2024-12-01 | Some properties of general- $λ$ -matrix polynomials: an umbral approach | Ghazala Yasmin et.al. | 2412.00844 | null |
2024-12-01 | AniMer: Animal Pose and Shape Estimation Using Family Aware Transformer | Jin Lyu et.al. | 2412.00837 | null |
2024-12-01 | VR-Doh: Hands-on 3D Modeling in Virtual Reality | Zhaofeng Luo et.al. | 2412.00814 | null |
2024-12-01 | TSUBF-Net: Trans-Spatial UNet-like Network with Bi-direction Fusion for Segmentation of Adenoid Hypertrophy in CT | Rulin Zhou et.al. | 2412.00787 | null |
2024-12-01 | 3D-PDR Orion dataset and NeuralPDR: Neural Differential Equations for Photodissociation Regions | Gijs Vermariën et.al. | 2412.00758 | link |
2024-12-01 | CtrlNeRF: The Generative Neural Radiation Fields for the Controllable Synthesis of High-fidelity 3D-Aware Images | Jian Liu et.al. | 2412.00754 | null |
2024-12-01 | ChatSplat: 3D Conversational Gaussian Splatting | Hanlin Chen et.al. | 2412.00734 | null |
2024-12-05 | Hallo3: Highly Dynamic and Realistic Portrait Image Animation with Diffusion Transformer Networks | Jiahao Cui et.al. | 2412.00733 | link |
2024-12-01 | Refine3DNet: Scaling Precision in 3D Object Reconstruction from Multi-View RGB Images using Attention | Ajith Balakrishnan et.al. | 2412.00731 | null |
2024-12-01 | SEED4D: A Synthetic Ego–Exo Dynamic 4D Data Generator, Driving Dataset and Benchmark | Marius Kästingschäfer et.al. | 2412.00730 | link |
2024-12-01 | GenTact Toolbox: A Computational Design Pipeline to Procedurally Generate Context-Driven 3D Printed Whole-Body Tactile Skins | Carson Kohlbrenner et.al. | 2412.00711 | null |
2024-12-01 | Photoacoustic Iterative Optimization Algorithm with Shape Prior Regularization | Yu Zhang et.al. | 2412.00705 | null |
2024-12-04 | Numerical Analysis of Cavitation Dynamics on Free Ogee Spillways Using the Volume of Fluid (VOF) Method | Parvaneh Nikrou et.al. | 2412.00695 | null |
2024-12-01 | BEV-SUSHI: Multi-Target Multi-Camera 3D Detection and Tracking in Bird’s-Eye View | Yizhou Wang et.al. | 2412.00692 | null |
2024-12-01 | A Machine Learning Approach to Contact Localization in Variable Density Three-Dimensional Tactile Artificial Skin | Carson Kohlbrenner et.al. | 2412.00689 | link |
2024-12-01 | FlashSLAM: Accelerated RGB-D SLAM for Real-Time 3D Scene Reconstruction with Gaussian Splatting | Phu Pham et.al. | 2412.00682 | null |
2024-12-01 | FiffDepth: Feed-forward Transformation of Diffusion-Based Generators for Detailed Depth Estimation | Yunpeng Bai et.al. | 2412.00671 | null |
2024-12-01 | A Lesson in Splats: Teacher-Guided Diffusion for 3D Gaussian Splats Generation with 2D Supervision | Chensheng Peng et.al. | 2412.00623 | null |
2024-11-30 | CAT-ORA: Collision-Aware Time-Optimal Formation Reshaping for Efficient Robot Coordination in 3D Environments | Vit Kratky et.al. | 2412.00603 | null |
2024-11-30 | Speedy-Splat: Fast 3D Gaussian Splatting with Sparse Pixels and Sparse Primitives | Alex Hanson et.al. | 2412.00578 | link |
2024-11-30 | Multi-resolution Guided 3D GANs for Medical Image Translation | Juhyung Ha et.al. | 2412.00575 | link |
2024-11-30 | Instant3dit: Multiview Inpainting for Fast Editing of 3D Objects | Amir Barda et.al. | 2412.00518 | null |
2024-11-30 | Energy-Based Prior Latent Space Diffusion model for Reconstruction of Lumbar Vertebrae from Thick Slice MRI | Yanke Wang et.al. | 2412.00511 | link |
2024-11-30 | How Fitts’ Fits in 3D: A Tangible Twist on Spatial Tasks | Faith Griffin et.al. | 2412.00506 | null |
2024-11-30 | Video-3D LLM: Learning Position-Aware Video Representation for 3D Scene Understanding | Duo Zheng et.al. | 2412.00493 | link |
2024-11-30 | Density-aware Global-Local Attention Network for Point Cloud Segmentation | Chade Li et.al. | 2412.00489 | null |
2024-11-30 | LineGS : 3D Line Segment Representation on 3D Gaussian Splatting | Chenggang Yang et.al. | 2412.00477 | link |
2024-11-30 | Hard-Label Black-Box Attacks on 3D Point Clouds | Daizong Liu et.al. | 2412.00404 | null |
2024-11-30 | DreamDance: Animating Human Images by Enriching 3D Geometry Cues from 2D Poses | Yatian Pang et.al. | 2412.00397 | null |
2024-11-30 | ARMOR: Egocentric Perception for Humanoid Robot Collision Avoidance and Motion Planning | Daehwa Kim et.al. | 2412.00396 | null |
2024-11-30 | GradiSeg: Gradient-Guided Gaussian Segmentation with Enhanced 3D Boundary Precision | Zehao Li et.al. | 2412.00392 | null |
2024-12-05 | Gaussians on their Way: Wasserstein-Constrained 4D Gaussian Splatting with State-Space Modeling | Junli Deng et.al. | 2412.00333 | null |
2024-11-29 | Computing the multimodal stochastic dynamics of a nanobeam in a viscous fluid | J. Barbish et.al. | 2412.00258 | null |
2024-11-29 | Uni-SLAM: Uncertainty-Aware Neural Implicit SLAM for Real-Time Dense Indoor Scene Reconstruction | Shaoxiang Wang et.al. | 2412.00242 | null |
2024-11-29 | String theory and the SymTFT of 3d orthosymplectic Chern-Simons theory | Oren Bergman et.al. | 2412.00184 | null |
2024-11-29 | SOLAMI: Social Vision-Language-Action Modeling for Immersive Interaction with 3D Autonomous Characters | Jianping Jiang et.al. | 2412.00174 | null |
2024-11-29 | AerialGo: Walking-through City View Generation from Aerial Perspectives | Fuqiang Zhao et.al. | 2412.00157 | null |
2024-11-29 | T-3DGS: Removing Transient Objects for 3D Scene Reconstruction | Vadim Pryadilshchikov et.al. | 2412.00155 | null |
2024-11-28 | Differentiable Topology Estimating from Curvatures for 3D Shapes | Yihao Luo et.al. | 2412.00140 | null |
2024-11-28 | Unleashing the Power of Data Synthesis in Visual Localization | Sihang Li et.al. | 2412.00138 | null |
2024-11-28 | Demographic Predictability in 3D CT Foundation Embeddings | Guangyao Zheng et.al. | 2412.00110 | link |
2024-11-27 | MLLM-Search: A Zero-Shot Approach to Finding People using Multimodal Large Language Models | Angus Fung et.al. | 2412.00103 | null |
2024-11-27 | Graph Canvas for Controllable 3D Scene Generation | Libin Liu et.al. | 2412.00091 | null |
2024-11-25 | Enhanced Lung Cancer Survival Prediction using Semi-Supervised Pseudo-Labeling and Learning from Diverse PET/CT Datasets | Mohammad R. Salmanpour et.al. | 2412.00068 | null |
2024-11-29 | AlphaTablets: A Generic Plane Representation for 3D Planar Reconstruction from Monocular Videos | Yuze He et.al. | 2411.19950 | null |
2024-11-29 | Spatio-Temporal Energy Cascade in Three-Dimensional Magnetohydrodynamic Turbulence | Giuseppe Arrò et.al. | 2411.19927 | null |
2024-11-29 | Traction force microscopy for linear and nonlinear elastic materials as a parameter identification inverse problem | Gesa Sarnighausen et.al. | 2411.19917 | null |
2024-11-29 | $C^{3}$ -NeRF: Modeling Multiple Scenes via Conditional-cum-Continual Neural Radiance Fields | Prajwal Singh et.al. | 2411.19903 | null |
2024-12-02 | GuardSplat: Efficient and Robust Watermarking for 3D Gaussian Splatting | Zixuan Chen et.al. | 2411.19895 | link |
2024-11-29 | SpaRC: Sparse Radar-Camera Fusion for 3D Object Detection | Philipp Wolters et.al. | 2411.19860 | null |
2024-11-29 | Gravity’s role in taming the Tayler instability in red giant cores | Domenico G. Meduri et.al. | 2411.19849 | null |
2024-12-05 | SAT-HMR: Real-Time Multi-Person 3D Mesh Estimation via Scale-Adaptive Tokens | Chi Su et.al. | 2411.19824 | null |
2024-11-29 | Deterministic many-body dynamics with multifractal response | Yusuf Kasim et.al. | 2411.19779 | null |
2024-11-29 | PerLA: Perceptive 3D Language Assistant | Guofeng Mei et.al. | 2411.19774 | null |
2024-11-29 | DeSplat: Decomposed Gaussian Splatting for Distractor-Free Rendering | Yihao Wang et.al. | 2411.19756 | null |
2024-11-29 | Regularity properties of a generalized Oseen evolution operator in exterior domains, with applications to the Navier-Stokes initial value problem | Yosuke Asami et.al. | 2411.19711 | null |
2024-11-29 | Observation of a non-reciprocal skyrmion Hall effect of hybrid chiral skyrmion tubes in synthetic antiferromagnetic multilayers | Takaaki Dohi et.al. | 2411.19698 | null |
2024-11-29 | Inverse Design of Mechanical Metamaterials Using a Point-Cloud-Based Deep Generative Model | Seungwook Hong et.al. | 2411.19681 | null |
2024-11-29 | TexGaussian: Generating High-quality PBR Material via Octree-based 3D Gaussian Splatting | Bojun Xiong et.al. | 2411.19654 | link |
2024-11-29 | GREAT: Geometry-Intention Collaborative Inference for Open-Vocabulary 3D Object Affordance Grounding | Yawen Shao et.al. | 2411.19626 | link |
2024-11-29 | Tortho-Gaussian: Splatting True Digital Orthophoto Maps | Xin Wang et.al. | 2411.19594 | null |
2024-11-29 | Self-Supervised Denoiser Framework | Emilien Valat et.al. | 2411.19593 | null |
2024-11-29 | Gaussian Splashing: Direct Volumetric Rendering Underwater | Nir Mualem et.al. | 2411.19588 | null |
2024-11-29 | Bootstraping Clustering of Gaussians for View-consistent 3D Scene Understanding | Wenbo Zhang et.al. | 2411.19551 | null |
2024-11-29 | Haldane phase, field-induced magnetic ordering and Tomonaga-Luttinger liquid behavior in a spin-one chain compound NiC $_2$O$_4$$\cdot$2NH$_3$ | Shuo Li et.al. | 2411.19538 | null |
2024-11-29 | Subjective and Objective Quality Assessment Methods of Stereoscopic Videos with Visibility Affecting Distortions | Sria Biswas et.al. | 2411.19522 | null |
2024-11-29 | Diorama: Unleashing Zero-shot Single-view 3D Scene Modeling | Qirui Wu et.al. | 2411.19492 | null |
2024-11-29 | Blurred LiDAR for Sharper 3D: Robust Handheld 3D Scanning with Diffuse LiDAR and RGB | Nikhil Behari et.al. | 2411.19474 | null |
2024-11-29 | Robust Bayesian Scene Reconstruction by Leveraging Retrieval-Augmented Priors | Herbert Wright et.al. | 2411.19461 | null |
2024-11-29 | Fleximo: Towards Flexible Text-to-Human Motion Video Generation | Yuhang Zhang et.al. | 2411.19459 | null |
2024-11-29 | Multiview Equivariance Improves 3D Correspondence Understanding with Minimal Feature Finetuning | Yang You et.al. | 2411.19458 | link |
2024-12-02 | GausSurf: Geometry-Guided 3D Gaussian Splatting for Surface Reconstruction | Jiepeng Wang et.al. | 2411.19454 | null |
2024-11-29 | Achromatic single-layer hologram | Zhi Li et.al. | 2411.19445 | null |
2024-11-29 | qlbm – A Quantum Lattice Boltzmann Software Framework | Călin Andrei Georgescu et.al. | 2411.19439 | link |
2024-11-29 | RF-3DGS: Wireless Channel Modeling with Radio Radiance Field and 3D Gaussian Splatting | Lihao Zhang et.al. | 2411.19420 | link |
2024-11-28 | Atom probe composition and in situ electronic structure of epitaxial quantum dot ensembles | Christopher Natale et.al. | 2411.19414 | null |
2024-11-28 | Fine-Tuning Magnetism in CrI $_3$ Monolayers and Bilayers: A DFT+U Approach | Diego Lauer et.al. | 2411.19357 | null |
2024-11-28 | 3D Wasserstein generative adversarial network with dense U-Net based discriminator for preclinical fMRI denoising | Sima Soltanpour et.al. | 2411.19345 | null |
2024-11-28 | Tayler-Spruit dynamo in binary neutron star merger remnants | Alexis Reboul-Salze et.al. | 2411.19328 | null |
2024-11-28 | SAMa: Material-aware 3D Selection and Segmentation | Michael Fischer et.al. | 2411.19322 | null |
2024-11-28 | UrbanCAD: Towards Highly Controllable and Photorealistic 3D Vehicles for Urban Scene Simulation | Yichong Lu et.al. | 2411.19292 | null |
2024-11-28 | SADG: Segment Any Dynamic Gaussian Without Object Trackers | Yun-Jin Li et.al. | 2411.19290 | link |
2024-11-28 | Connection between Free-Fermion and Interacting Crystalline Symmetry-Protected Topological Phases | Chen-Shen Lee et.al. | 2411.19287 | null |
2024-11-28 | AGS-Mesh: Adaptive Gaussian Splatting and Meshing with Geometric Priors for Indoor Room Reconstruction Using Smartphones | Xuqian Ren et.al. | 2411.19271 | null |
2024-11-28 | InstanceGaussian: Appearance-Semantic Joint Gaussian Representation for 3D Instance-Level Perception | Haijie Li et.al. | 2411.19235 | null |
2024-11-28 | Gaussians-to-Life: Text-Driven Animation of 3D Gaussian Splatting Scenes | Thomas Wimmer et.al. | 2411.19233 | link |
2024-11-28 | Shock-capturing particle hydrodynamics with reproducing kernels | S. Rosswog et.al. | 2411.19228 | null |
2024-12-02 | Differentiable Voxel-based X-ray Rendering Improves Sparse-View 3D CBCT Reconstruction | Mohammadhossein Momeni et.al. | 2411.19224 | link |
2024-11-28 | Track Anything Behind Everything: Zero-Shot Amodal Video Object Segmentation | Finlay G. C. Hudson et.al. | 2411.19210 | null |
2024-11-28 | Video Depth without Video Models | Bingxin Ke et.al. | 2411.19189 | null |
2024-11-28 | HOT3D: Hand and Object Tracking in 3D from Egocentric Multi-View Videos | Prithviraj Banerjee et.al. | 2411.19167 | null |
2024-11-28 | Lost & Found: Updating Dynamic 3D Scene Graphs from Egocentric Observations | Tjark Behrens et.al. | 2411.19162 | link |
2024-11-28 | Neural Shadow Art | Caoliwen Wang et.al. | 2411.19161 | null |
2024-11-28 | Counting Stacked Objects from Multi-View Images | Corentin Dumery et.al. | 2411.19149 | null |
2024-11-28 | On Moving Object Segmentation from Monocular Video with Transformers | Christian Homeyer et.al. | 2411.19141 | null |
2024-11-28 | 360Recon: An Accurate Reconstruction Method Based on Depth Fusion from 360 Images | Zhongmiao Yan et.al. | 2411.19102 | null |
2024-11-28 | 3D-WAG: Hierarchical Wavelet-Guided Autoregressive Generation for High-Fidelity 3D Shapes | Tejaswini Medi et.al. | 2411.19037 | null |
2024-11-28 | Axisymmetric model for 1-color laser filament THz emission | Sean D. McGuire et.al. | 2411.18963 | null |
2024-11-28 | Designing an Optimal Scoop for Holloman High-Speed Test Track Water Braking Mechanism using Computational Fluid Dynamics | Jose A. Terrazas et.al. | 2411.18939 | null |
2024-11-28 | Self-Cross Diffusion Guidance for Text-to-Image Synthesis of Similar Subjects | Weimin Qiu et.al. | 2411.18936 | null |
2024-11-28 | Planning Shorter Paths in Graphs of Convex Sets by Undistorting Parametrized Configuration Spaces | Shruti Garg et.al. | 2411.18913 | null |
2024-11-28 | Textured As-Is BIM via GIS-informed Point Cloud Segmentation | Mohamed S. H. Alabassy et.al. | 2411.18898 | null |
2024-11-28 | RIGI: Rectifying Image-to-3D Generation Inconsistency via Uncertainty-aware Learning | Jiacheng Wang et.al. | 2411.18866 | null |
2024-11-28 | CrossTracker: Robust Multi-modal 3D Multi-Object Tracking via Cross Correction | Lipeng Gu et.al. | 2411.18850 | null |
2024-11-27 | Light-induced Orbital and Spin Magnetism in $3d$, $4d$, and $5d$ Transition Metals | Theodoros Adamantopoulos et.al. | 2411.18815 | null |
2024-11-27 | Lifting Motion to the 3D World via 2D Diffusion | Jiaman Li et.al. | 2411.18808 | null |
2024-11-27 | Reconstructing Animals and the Wild | Peter Kulits et.al. | 2411.18807 | null |
2024-11-27 | MRI Breast tissue segmentation using nnU-Net for biomechanical modeling | Melika Pooyan et.al. | 2411.18784 | null |
2024-11-27 | Direct comparison of the energization of self-consistent charged particles vs test particles in a turbulent plasma | Facundo Pugliese et.al. | 2411.18771 | null |
2024-11-27 | GaussianSpeech: Audio-Driven Gaussian Avatars | Shivangi Aneja et.al. | 2411.18675 | null |
2024-12-02 | AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers | Sherwin Bahmani et.al. | 2411.18673 | null |
2024-11-27 | Point Cloud Unsupervised Pre-training via 3D Gaussian Splatting | Hao Liu et.al. | 2411.18667 | null |
2024-11-27 | 3D Scene Graph Guided Vision-Language Pre-training | Hao Liu et.al. | 2411.18666 | null |
2024-11-27 | Spatiotemporal Skip Guidance for Enhanced Video Diffusion Sampling | Junha Hyung et.al. | 2411.18664 | null |
2024-11-27 | OOD-HOI: Text-Driven 3D Whole-Body Human-Object Interactions Generation Beyond Training Domains | Yixuan Zhang et.al. | 2411.18660 | null |
2024-11-26 | Scene Co-pilot: Procedural Text to Video Generation with Human in the Loop | Zhaofang Qian et.al. | 2411.18644 | null |
2024-11-27 | Textured Gaussians for Enhanced 3D Scene Appearance Modeling | Brian Chao et.al. | 2411.18625 | null |
2024-11-27 | GeneMAN: Generalizable Single-Image 3D Human Reconstruction from Multi-Source Human Data | Wentao Wang et.al. | 2411.18624 | null |
2024-11-27 | Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation | Yueru Jia et.al. | 2411.18623 | null |
2024-11-27 | CAT4D: Create Anything in 4D with Multi-View Video Diffusion Models | Rundi Wu et.al. | 2411.18613 | null |
2024-11-27 | Building Confidence in Deep Generative Protein Design | Tianyuan Zheng et.al. | 2411.18568 | link |
2024-11-27 | PhyCAGE: Physically Plausible Compositional 3D Asset Generation from a Single Image | Han Yan et.al. | 2411.18548 | null |
2024-11-27 | AdaVLN: Towards Visual Language Navigation in Continuous Indoor Environments with Moving Humans | Dillon Loh et.al. | 2411.18539 | link |
2024-11-27 | A comparison of extended object tracking with multi-modal sensors in indoor environment | Jiangtao Shuai et.al. | 2411.18476 | null |
2024-11-27 | HEMGS: A Hybrid Entropy Model for 3D Gaussian Splatting Data Compression | Lei Liu et.al. | 2411.18473 | null |
2024-11-27 | Neural Image Unfolding: Flattening Sparse Anatomical Structures using Neural Fields | Leonhard Rist et.al. | 2411.18415 | null |
2024-11-27 | A new proof of nonlinear Landau damping for the 3D Vlasov-Poisson system near Poisson equilibrium | Quoc-Hung Nguyen et.al. | 2411.18408 | null |
2024-11-27 | Near-field acoustic imaging with a caged bubble | Dorian Bouchet et.al. | 2411.18386 | null |
2024-11-27 | G3Flow: Generative 3D Semantic Flow for Pose-aware and Generalizable Object Manipulation | Tianxing Chen et.al. | 2411.18369 | null |
2024-11-27 | Helvipad: A Real-World Dataset for Omnidirectional Stereo Depth Estimation | Mehdi Zayene et.al. | 2411.18335 | link |
2024-11-27 | Neural Surface Priors for Editable Gaussian Splatting | Jakub Szymkowiak et.al. | 2411.18311 | link |
2024-11-27 | MvKeTR: Chest CT Report Generation with Multi-View Perception and Knowledge Enhancement | Xiwei Deng et.al. | 2411.18309 | null |
2024-11-27 | Leveraging Semantic Asymmetry for Precise Gross Tumor Volume Segmentation of Nasopharyngeal Carcinoma in Planning CT | Zi Li et.al. | 2411.18290 | null |
2024-11-27 | GAPartManip: A Large-scale Part-centric Dataset for Material-Agnostic Articulated Object Manipulation | Wenbo Cui et.al. | 2411.18276 | null |
2024-11-27 | Make-It-Animatable: An Efficient Framework for Authoring Animation-Ready 3D Characters | Zhiyang Guo et.al. | 2411.18197 | null |
2024-11-27 | Rediscovering the Milky Way with orbit superposition approach and APOGEE data III. Panoramic view of the bulge | Sergey Khoperskov et.al. | 2411.18182 | null |
2024-11-27 | Online Knowledge Integration for 3D Semantic Mapping: A Survey | Felix Igelbrink et.al. | 2411.18147 | null |
2024-11-27 | ModeDreamer: Mode Guiding Score Distillation for Text-to-3D Generation using Reference Image Prompts | Uy Dieu Tran et.al. | 2411.18135 | null |
2024-11-27 | Towards Cross-device and Training-free Robotic Grasping in 3D Open World | Weiguang Zhao et.al. | 2411.18133 | null |
2024-11-27 | Influence of Critical Current Distribution on Operation, Quench Detection and Protection of HTS Pancake Coils | Mariusz Wozniak et.al. | 2411.18124 | null |
2024-11-27 | Loss-driven miniaturized bound state in continuum biosensing system | Jiacheng Sun et.al. | 2411.18110 | null |
2024-11-27 | Development and experimental validation of an in-house treatment planning system with greedy energy layer optimization for fast IMPT | Aoxiang Wang et.al. | 2411.18074 | null |
2024-11-27 | SmileSplat: Generalizable Gaussian Splats for Unconstrained Sparse Images | Yanyan Li et.al. | 2411.18072 | null |
2024-11-27 | PersonaCraft: Personalized Full-Body Image Synthesis for Multiple Identities from Single References Using 3D-Model-Conditioned Diffusion | Gwanghyun Kim et.al. | 2411.18068 | null |
2024-11-27 | GLS: Geometry-aware 3D Language Gaussian Splatting | Jiaxiong Qiu et.al. | 2411.18066 | link |
2024-11-27 | Mortality Prediction of Pulmonary Embolism Patients with Deep Learning and XGBoost | Yalcin Tur et.al. | 2411.18063 | null |
2024-11-27 | SymTFT Approach to 2D Orbifold Groupoids: `t Hooft Anomalies, Gauging, and Partition Functions | Jin Chen et.al. | 2411.18056 | null |
2024-12-02 | Pixel-aligned RGB-NIR Stereo Imaging and Dataset for Robot Vision | Jinnyeong Kim et.al. | 2411.18025 | null |
2024-11-27 | Persistent breather and dynamical symmetry in a unitary Fermi gas | Dali Sun et.al. | 2411.18022 | null |
2024-11-27 | Manual-PA: Learning 3D Part Assembly from Instruction Diagrams | Jiahao Zhang et.al. | 2411.18011 | null |
2024-11-27 | HI-SLAM2: Geometry-Aware Gaussian SLAM for Fast Monocular Scene Reconstruction | Wei Zhang et.al. | 2411.17982 | link |
2024-11-26 | MARVEL-40M+: Multi-Level Visual Elaboration for High-Fidelity Text-to-3D Content Creation | Sankalp Sinha et.al. | 2411.17945 | link |
2024-11-26 | CAMLD: Contrast-Agnostic Medical Landmark Detection with Consistency-Based Regularization | Soorena Salari et.al. | 2411.17845 | null |
2024-11-26 | Actuation of Cell Sheets in 3D | Kirsten Endresen et.al. | 2411.17834 | null |
2024-11-26 | Signs as Tokens: An Autoregressive Multilingual Sign Language Generator | Ronglai Zuo et.al. | 2411.17799 | null |
2024-11-26 | DapPep: Domain Adaptive Peptide-agnostic Learning for Universal T-cell Receptor-antigen Binding Affinity Prediction | Jiangbin Zheng et.al. | 2411.17798 | null |
2024-11-26 | Self-supervised Monocular Depth and Pose Estimation for Endoscopy with Generative Latent Priors | Ziang Xu et.al. | 2411.17790 | null |
2024-12-01 | Geometric Point Attention Transformer for 3D Shape Reassembly | Jiahan Li et.al. | 2411.17788 | null |
2024-12-02 | MVBoost: Boost 3D Reconstruction with Multi-View Refinement | Xiangyu Liu et.al. | 2411.17772 | null |
2024-11-26 | Symmetry Strikes Back: From Single-Image Symmetry Detection to 3D Generation | Xiang Li et.al. | 2411.17763 | null |
2024-11-26 | OpenAD: Open-World Autonomous Driving Benchmark for 3D Object Detection | Zhongyu Xia et.al. | 2411.17761 | link |
2024-11-23 | SnapMem: Snapshot-based 3D Scene Memory for Embodied Exploration and Reasoning | Yuncong Yang et.al. | 2411.17735 | null |
2024-11-29 | DROID-Splat: Combining end-to-end SLAM with 3D Gaussian Splatting | Christian Homeyer et.al. | 2411.17660 | link |
2024-11-26 | Quantum Wave Simulation with Sources and Loss Functions | Cyrill Bösch et.al. | 2411.17630 | link |
2024-11-26 | Distractor-free Generalizable 3D Gaussian Splatting | Yanqi Bao et.al. | 2411.17605 | link |
2024-11-26 | Dimming events of evolved stars due to clouds of molecular gas. Scenarios based on 3D radiation-hydrodynamics simulations with CO5BOLD | Bernd Freytag et.al. | 2411.17561 | null |
2024-11-26 | BESTAnP: Bi-Step Efficient and Statistically Optimal Estimator for Acoustic-n-Point Problem | Wenliang Sheng et.al. | 2411.17521 | link |
2024-11-29 | SuperMat: Physically Consistent PBR Material Estimation at Interactive Rates | Yijia Hong et.al. | 2411.17515 | null |
2024-11-26 | Puzzle Similarity: A Perceptually-guided No-Reference Metric for Artifact Detection in 3D Scene Reconstructions | Nicolai Hermann et.al. | 2411.17489 | null |
2024-11-25 | Probing the Mid-level Vision Capabilities of Self-Supervised Learning | Xuweiyi Chen et.al. | 2411.17474 | null |
2024-11-25 | Learning 3D Representations from Procedural 3D Programs | Xuweiyi Chen et.al. | 2411.17467 | null |
2024-11-26 | Spatially Visual Perception for End-to-End Robotic Learning | Travis Davies et.al. | 2411.17458 | null |
2024-11-26 | Object-centric proto-symbolic behavioural reasoning from pixels | Ruben van Bergen et.al. | 2411.17438 | link |
2024-11-26 | DRiVE: Diffusion-based Rigging Empowers Generation of Versatile and Expressive Characters | Mingze Sun et.al. | 2411.17423 | link |
2024-11-26 | Gas dynamics around a Jupiter mass planet: II. Chemical evolution of circumplanetary material | Alex J. Cridland et.al. | 2411.17408 | null |
2024-11-26 | NumGrad-Pull: Numerical Gradient Guided Tri-plane Representation for Surface Reconstruction from Point Clouds | Ruikai Cui et.al. | 2411.17392 | link |
2024-11-26 | vesselFM: A Foundation Model for Universal 3D Blood Vessel Segmentation | Bastian Wittmann et.al. | 2411.17386 | link |
2024-11-26 | SIL-RRT*: Learning Sampling Distribution through Self Imitation Learning | Xuzhe Dang et.al. | 2411.17293 | null |
2024-11-26 | MiceBoneChallenge: Micro-CT public dataset and six solutions for automatic growth plate detection in micro-CT mice bone scans | Nikolay Burlutskiy et.al. | 2411.17260 | null |
2024-11-26 | Boost 3D Reconstruction using Diffusion-based Monocular Camera Calibration | Junyuan Deng et.al. | 2411.17240 | link |
2024-11-26 | MLI-NeRF: Multi-Light Intrinsic-Aware Neural Radiance Fields | Yixiong Yang et.al. | 2411.17235 | link |
2024-11-26 | cWDM: Conditional Wavelet Diffusion Models for Cross-Modality 3D Medical Image Synthesis | Paul Friedrich et.al. | 2411.17203 | link |
2024-11-28 | SelfSplat: Pose-Free and 3D Prior-Free Generalizable 3D Gaussian Splatting | Gyeongjin Kang et.al. | 2411.17190 | null |
2024-11-28 | PhysMotion: Physics-Grounded Dynamics From a Single Image | Xiyang Tan et.al. | 2411.17189 | null |
2024-11-26 | Unveiling New Mechanical Couplings in 3D Lattices: Axial-Bending and the Role of Symmetry Breaking | Dijia Zhong et.al. | 2411.17142 | null |
2024-11-26 | Geometry Field Splatting with Gaussian Surfels | Kaiwen Jiang et.al. | 2411.17067 | null |
2024-11-26 | 4D Scaffold Gaussian Splatting for Memory Efficient Dynamic Scene Reconstruction | Woong Oh Cho et.al. | 2411.17044 | null |
2024-11-26 | g3D-LF: Generalizable 3D-Language Feature Fields for Embodied Tasks | Zihan Wang et.al. | 2411.17030 | link |
2024-11-26 | SatVision-TOA: A Geospatial Foundation Model for Coarse-Resolution All-Sky Remote Sensing Imagery | Caleb S. Spradlin et.al. | 2411.17000 | link |
2024-11-25 | Improving Deformable Image Registration Accuracy through a Hybrid Similarity Metric and CycleGAN Based Auto-Segmentation | Keyur D. Shah et.al. | 2411.16992 | null |
2024-11-25 | G2SDF: Surface Reconstruction from Explicit Gaussians with Implicit SDFs | Kunyi Li et.al. | 2411.16898 | null |
2024-11-25 | PreF3R: Pose-Free Feed-Forward 3D Gaussian Splatting from Variable-length Image Sequence | Zequn Chen et.al. | 2411.16877 | null |
2024-11-25 | Rediscovering the Milky Way with orbit superposition approach and APOGEE data II. Chrono-chemo-kinematics of the disc | Sergey Khoperskov et.al. | 2411.16866 | null |
2024-11-25 | Critical Condition of Core-Collapse Supernovae I: One Dimensional Models | David Pochik et.al. | 2411.16857 | null |
2024-11-27 | SAR3D: Autoregressive 3D Object Generation and Understanding via Multi-scale 3D VQVAE | Yongwei Chen et.al. | 2411.16856 | null |
2024-11-25 | Open Vocabulary Monocular 3D Object Detection | Jin Yao et.al. | 2411.16833 | link |
2024-11-27 | DetailGen3D: Generative 3D Geometry Enhancement via Data-Dependent Flow | Ken Deng et.al. | 2411.16820 | null |
2024-11-27 | SplatAD: Real-Time Lidar and Camera Rendering with 3D Gaussian Splatting for Autonomous Driving | Georg Hess et.al. | 2411.16816 | link |
2024-11-25 | Abnormality-Driven Representation Learning for Radiology Imaging | Marta Ligero et.al. | 2411.16803 | null |
2024-11-27 | Phys4DGen: A Physics-Driven Framework for Controllable and Efficient 4D Content Generation from a Single Image | Jiajing Lin et.al. | 2411.16800 | null |
2024-11-25 | From Diffusion to Resolution: Leveraging 2D Diffusion Models for 3D Super-Resolution Task | Bohao Chen et.al. | 2411.16792 | null |
2024-11-25 | MAGiC-SLAM: Multi-Agent Gaussian Globally Consistent SLAM | Vladimir Yugay et.al. | 2411.16785 | null |
2024-11-25 | UniPose: A Unified Multimodal Framework for Human Pose Comprehension, Generation and Editing | Yiheng Li et.al. | 2411.16781 | link |
2024-11-25 | NovelGS: Consistent Novel-view Denoising via Large Gaussian Reconstruction Model | Jinpeng Liu et.al. | 2411.16779 | null |
2024-11-27 | MICAS: Multi-grained In-Context Adaptive Sampling for 3D Point Cloud Processing | Feifei Shao et.al. | 2411.16773 | null |
2024-11-25 | GAST: Sequential Gaussian Avatars with Hierarchical Spatio-temporal Context | Wangze Xu et.al. | 2411.16768 | null |
2024-11-24 | Bundle Adjusted Gaussian Avatars Deblurring | Muyao Niu et.al. | 2411.16758 | null |
2024-11-24 | PriorDiffusion: Leverage Language Prior in Diffusion Models for Monocular Depth Estimation | Ziyao Zeng et.al. | 2411.16750 | null |
2024-11-23 | DiM-Gestor: Co-Speech Gesture Generation with Adaptive Layer Normalization Mamba-2 | Fan Zhang et.al. | 2411.16729 | null |
2024-11-22 | TPIE: Topology-Preserved Image Editing With Text Instructions | Nivetha Jayakumar et.al. | 2411.16714 | null |
2024-11-25 | The impact of resistivity on the variability of black hole accretion flows | Antonios Nathanail et.al. | 2411.16684 | null |
2024-11-25 | Quark: Real-time, High-resolution, and General Neural View Synthesis | John Flynn et.al. | 2411.16680 | null |
2024-11-25 | Emergence of the 3D diluted Ising model universality class in a mixture of two magnets | J. J. Ruiz-Lorenzo et.al. | 2411.16659 | null |
2024-11-25 | DreamRunner: Fine-Grained Storytelling Video Generation with Retrieval-Augmented Motion Adaptation | Zun Wang et.al. | 2411.16657 | null |
2024-11-25 | Exploring Discrete Flow Matching for 3D De Novo Molecule Generation | Ian Dunn et.al. | 2411.16644 | link |
2024-11-25 | Automated Registration of 3D Neurovascular Territory Atlas to 2D DSA for Targeted Quantitative Angiography Analysis | George Dimopoulos et.al. | 2411.16637 | null |
2024-11-25 | Finite-difference compatible entropy-conserving schemes for the compressible Euler equations | Carlo De Michele et.al. | 2411.16621 | null |
2024-11-25 | RoboSpatial: Teaching Spatial Understanding to 2D and 3D Vision-Language Models for Robotics | Chan Hee Song et.al. | 2411.16537 | null |
2024-11-25 | Forecasting Shock-associated Energetic Particle Intensities in the Inner Heliosphere: A Proof-of-Concept Capability for the PUNCH Mission | Maher A. Dayeh et.al. | 2411.16510 | null |
2024-11-25 | Safety-Critical Controller Synthesis with Reduced-Order Models | Max H. Cohen et.al. | 2411.16479 | null |
2024-11-25 | Efficient Video Face Enhancement with Enhanced Spatial-Temporal Consistency | Yutong Wang et.al. | 2411.16468 | link |
2024-11-25 | SplatFlow: Multi-View Rectified Flow Model for 3D Gaussian Splatting Synthesis | Hyojun Go et.al. | 2411.16443 | link |
2024-11-25 | Generalizable Deep Learning Approach for 3D Particle Imaging using Holographic Microscopy | Shyam Kumar et.al. | 2411.16439 | null |
2024-11-25 | Quadratic Gaussian Splatting for Efficient and Detailed Surface Reconstruction | Ziyu Zhang et.al. | 2411.16392 | null |
2024-11-25 | Solaris: A Foundation Model of the Sun | Harris Abdul Majid et.al. | 2411.16339 | null |
2024-11-25 | Parameter Error Analysis for the 3D Modified Leray-alpha Model: Analytical and Numerical Approaches | Débora A. F. Albanez et.al. | 2411.16324 | null |
2024-11-26 | CutS3D: Cutting Semantics in 3D for 2D Unsupervised Instance Segmentation | Leon Sick et.al. | 2411.16319 | null |
2024-12-02 | Monocular Lane Detection Based on Deep Learning: A Survey | Xin He et.al. | 2411.16316 | link |
2024-11-26 | Functionality understanding and segmentation in 3D scenes | Jaime Corsetti et.al. | 2411.16310 | null |
2024-11-27 | An End-to-End Robust Point Cloud Semantic Segmentation Network with Single-Step Conditional Diffusion Models | Wentao Qu et.al. | 2411.16308 | link |
2024-11-26 | Internal motion of soft granular particles under circular shearing: Rate-dependent quaking and its spatial structure | Jr-Jun Lin et.al. | 2411.16293 | null |
2024-11-25 | Utilizing Uncertainty in 2D Pose Detectors for Probabilistic 3D Human Mesh Recovery | Tom Wehrbein et.al. | 2411.16289 | link |
2024-11-25 | Open-Vocabulary Octree-Graph for 3D Scene Understanding | Zhigang Wang et.al. | 2411.16253 | null |
2024-11-25 | Non-linear saturation and energy transport in global simulations of magneto-thermal turbulence in the stratified intracluster medium | Jean M. Kempf et.al. | 2411.16242 | null |
2024-11-25 | Fabrication of a 3D mode size converter for efficient edge coupling in photonic integrated circuits | Hyeong-Soon Jang et.al. | 2411.16221 | null |
2024-11-25 | Fancy123: One Image to High-Quality 3D Mesh Generation via Plug-and-Play Deformation | Qiao Yu et.al. | 2411.16185 | link |
2024-11-25 | Any3DIS: Class-Agnostic 3D Instance Segmentation by 2D Mask Tracking | Phuc Nguyen et.al. | 2411.16183 | null |
2024-11-25 | Event-boosted Deformable 3D Gaussians for Fast Dynamic Scene Reconstruction | Wenhao Xu et.al. | 2411.16180 | null |
2024-11-26 | MVGenMaster: Scaling Multi-View Generation from Any Image via 3D Priors Enhanced Diffusion Model | Chenjie Cao et.al. | 2411.16157 | link |
2024-11-25 | Revisiting Marr in Face: The Building of 2D–2.5D–3D Representations in Deep Neural Networks | Xiangyu Zhu et.al. | 2411.16148 | null |
2024-11-25 | Ro-vibrational quenching calculations of C $_2^-$ in collision with H$_2$ | Kousik Giri et.al. | 2411.16137 | null |
2024-11-25 | Three Cars Approaching within 100m! Enhancing Distant Geometry by Tri-Axis Voxel Scanning for Camera-based Semantic Scene Completion | Jongseong Bae et.al. | 2411.16129 | null |
2024-11-25 | Collaboration in Virtual Reality: Survey and Perspectives | Ourania Koutzampasopoulou Xanthidou et.al. | 2411.16124 | null |
2024-11-25 | Forest Biomass Mapping with Terrestrial Hyperspectral Imaging for Wildfire Risk Monitoring | Nathaniel Hanson et.al. | 2411.16107 | null |
2024-11-25 | Boosting 3D Object Generation through PBR Materials | Yitong Wang et.al. | 2411.16080 | null |
2024-11-25 | Geometry Distributions | Biao Zhang et.al. | 2411.16076 | null |
2024-11-25 | Language Driven Occupancy Prediction | Zhu Yu et.al. | 2411.16072 | link |
2024-11-27 | The fate of EMRI-IMRI pairs in AGN accretion disks: hydrodynamic and three body simulations | Peng Peng et.al. | 2411.16070 | null |
2024-11-24 | Kalkayotl 2.0 Bayesian phase-space modelling of star-forming regions, stellar associations, and open clusters | J. Olivares et.al. | 2411.16012 | link |
2024-11-24 | ActiveCheerios: 3D-Printed Marangoni-Driven Active Particles at an Interface | Jackson K. Wilt et.al. | 2411.16011 | null |
2024-11-24 | Peritumoral Expansion Radiomics for Improved Lung Cancer Classification | Fakrul Islam Tushar et.al. | 2411.16008 | link |
2024-11-24 | Why is Grain Growth Not Curvature Flow? | Caihao Qiu et.al. | 2411.15983 | null |
2024-11-24 | Gaussian Scenes: Pose-Free Sparse-View Scene Reconstruction using Depth-Enhanced Diffusion Priors | Soumava Paul et.al. | 2411.15966 | null |
2024-11-24 | Autonomous Multi-Robot Exploration Strategies for 3D Environments with Fire Detection Capabilitie | Ankit Shaw et.al. | 2411.15953 | null |
2024-11-28 | PINNs4Drops: Convolutional feature-enhanced physics-informed neural networks for reconstructing two-phase flows | Maximilian Dreisbach et.al. | 2411.15949 | null |
2024-11-24 | Microfluidic Bioelectrical Impedance Drug Delivery Device for Patients with Acute Exacerbations of Chronic Obstructive Pulmonary Disease | Evan Carroll et.al. | 2411.15934 | null |
2024-11-24 | Mechanical stability conditions for 3D and 2D crystals under arbitrary load | Marcin Maździarz et.al. | 2411.15918 | null |
2024-11-24 | Multi-Robot Scan-n-Print for Wire Arc Additive Manufacturing | Chen-Lung Lu et.al. | 2411.15915 | null |
2024-11-24 | Bimanual Grasp Synthesis for Dexterous Robot Hands | Yanming Shao et.al. | 2411.15903 | null |
2024-11-24 | A block-acoustic preconditioner for the elastic Helmholtz equation | Rachel Yovel et.al. | 2411.15897 | null |
2024-11-24 | Optimization-Driven Statistical Models of Anatomies using Radial Basis Function Shape Representation | Hong Xu et.al. | 2411.15882 | null |
2024-11-24 | Droplet Simulations in Computer Graphics: Theories, Methods and Applications | Hossein Keshtkar et.al. | 2411.15880 | null |
2024-11-26 | Optimizing Brain Tumor Segmentation with MedNeXt: BraTS 2024 SSA and Pediatrics | Sarim Hashmi et.al. | 2411.15872 | link |
2024-11-24 | Generalizable Single-view Object Pose Estimation by Two-side Generating and Matching | Yujing Sun et.al. | 2411.15860 | link |
2024-11-24 | A review of geometric modeling methods in microstructure design and manufacturing | Qiang Zou et.al. | 2411.15833 | null |
2024-11-24 | FieldTNN-based machine learning method for Maxwell eigenvalue problems | Jiantao Jiang et.al. | 2411.15828 | null |
2024-11-24 | Medical Slice Transformer: Improved Diagnosis and Explainability on 3D Medical Images with DINOv2 | Gustav Müller-Franzes et.al. | 2411.15802 | link |
2024-11-24 | PG-SLAM: Photo-realistic and Geometry-aware RGB-D SLAM in Dynamic Environments | Haoang Li et.al. | 2411.15800 | null |
2024-11-24 | Distinctive Electronic Characteristics and Ultra-high Thermoelectric Power Factor in Be-Fe Intermetallics | Q. D. Hao et.al. | 2411.15780 | null |
2024-11-24 | ZeroGS: Training 3D Gaussian Splatting from Unposed Images | Yu Chen et.al. | 2411.15779 | null |
2024-12-02 | Enhancing the automatic segmentation and analysis of 3D liver vasculature models | Yassine Machta et.al. | 2411.15778 | link |
2024-11-24 | Integrating Deep Metric Learning with Coreset for Active Learning in 3D Segmentation | Arvind Murari Vepa et.al. | 2411.15763 | link |
2024-11-24 | DynamicAvatars: Accurate Dynamic Facial Avatars Reconstruction and Precise Editing with Diffusion Models | Yangyang Qian et.al. | 2411.15732 | null |
2024-11-28 | GSurf: 3D Reconstruction via Signed Distance Fields with Direct Gaussian Supervision | Baixin Xu et.al. | 2411.15723 | link |
2024-11-24 | ROOT: VLM based System for Indoor Scene Understanding and Beyond | Yonghui Wang et.al. | 2411.15714 | link |
2024-11-24 | Fixing the Perspective: A Critical Examination of Zero-1-to-3 | Jack Yu et.al. | 2411.15706 | null |
2024-11-24 | High-order Discontinuous Galerkin solver based on Jacobi polynomial expansion for compressible flows on unstructured meshes | Yu-Xiang Peng et.al. | 2411.15699 | null |
2024-11-23 | Training an Open-Vocabulary Monocular 3D Object Detection Model without 3D Data | Rui Huang et.al. | 2411.15657 | null |
2024-11-23 | FATE: Full-head Gaussian Avatar with Textural Editing from Monocular Video | Jiawei Zhang et.al. | 2411.15604 | null |
2024-11-23 | An adversarial feature learning based semantic communication method for Human 3D Reconstruction | Shaojiang Liu et.al. | 2411.15595 | null |
2024-11-23 | EMD: Explicit Motion Modeling for High-Quality Street Gaussian Splatting | Xiaobao Wei et.al. | 2411.15582 | null |
2024-11-23 | Improving Factuality of 3D Brain MRI Report Generation with Paired Image-domain Retrieval and Text-domain Augmentation | Junhyeok Lee et.al. | 2411.15490 | null |
2024-11-23 | SplatFlow: Self-Supervised Dynamic Gaussian Splatting in Neural Motion Flow Field for Autonomous Driving | Su Sun et.al. | 2411.15482 | null |
2024-11-23 | Gassidy: Gaussian Splatting SLAM in Dynamic Environments | Long Wen et.al. | 2411.15476 | null |
2024-11-23 | SplatSDF: Boosting Neural Implicit SDF via Gaussian Splatting Fusion | Runfa Blark Li et.al. | 2411.15468 | null |
2024-11-23 | ConsistentAvatar: Learning to Diffuse Fully Consistent Talking Head Avatar with Temporal Guidance | Haijie Yang et.al. | 2411.15436 | null |
2024-11-23 | Semi-supervised Single-view 3D Reconstruction via Multi Shape Prior Fusion Strategy and Self-Attention | Wei Zhoua et.al. | 2411.15420 | link |
2024-11-23 | SPRINT Enables Interpretable and Ultra-Fast Virtual Screening against Thousands of Proteomes | Andrew T. McNutt et.al. | 2411.15418 | link |
2024-11-22 | Nd-BiMamba2: A Unified Bidirectional Architecture for Multi-Dimensional Data Processing | Hao Liu et.al. | 2411.15380 | link |
2024-11-22 | UniGaussian: Driving Scene Reconstruction from Multiple Camera Models via Unified Gaussian Representations | Yuan Ren et.al. | 2411.15355 | null |
2024-11-22 | Dynamic Tube MPC: Learning Tube Dynamics with Massively Parallel Simulation for Robust Safety in Practice | William D. Compton et.al. | 2411.15350 | null |
2024-11-26 | 3-Dimensional Model Based Iterative Reconstruction of Magnetisation in a Nanowire Structure Using Holographic Vector Field Electron Tomography Measurements | Aurys Silinga et.al. | 2411.15323 | link |
2024-11-22 | Bootstrapping the 3d Ising Stress Tensor | Cyuan-Han Chang et.al. | 2411.15300 | null |
2024-11-22 | Regularizing 3D conformal field theories via anyons on the fuzzy sphere | Cristian Voinea et.al. | 2411.15299 | null |
2024-11-22 | Versatile Top-Down Patterning of 3D, 2D and 0D Perovskites for On-Chip Integration | Federico Fabrizi et.al. | 2411.15286 | null |
2024-11-22 | Don’t Mesh with Me: Generating Constructive Solid Geometry Instead of Meshes by Fine-Tuning a Code-Generation LLM | Maximilian Mews et.al. | 2411.15279 | null |
2024-11-22 | Comparing the 3D morphology of solid-oxide fuel cell anodes for different manufacturing processes, operating times, and operating temperatures | Sabrina Weber et.al. | 2411.15259 | null |
2024-11-27 | J-Invariant Volume Shuffle for Self-Supervised Cryo-Electron Tomogram Denoising on Single Noisy Volume | Xiwei Liu et.al. | 2411.15248 | null |
2024-11-21 | Learning Volumetric Neural Deformable Models to Recover 3D Regional Heart Wall Motion from Multi-Planar Tagged MRI | Meng Ye et.al. | 2411.15233 | link |
2024-11-20 | S $^2$ ALM: Sequence-Structure Pre-trained Large Language Model for Comprehensive Antibody Representation Learning | Mingze Yin et.al. | 2411.15215 | null |
2024-11-20 | DAGSM: Disentangled Avatar Generation with GS-enhanced Mesh | Jingyu Zhuang et.al. | 2411.15205 | null |
2024-11-19 | Gradient-Weighted Feature Back-Projection: A Fast Alternative to Feature Distillation in 3D Gaussian Splatting | Joji Joseph et.al. | 2411.15193 | link |
2024-11-22 | Material Anything: Generating Materials for Any 3D Object via Diffusion | Xin Huang et.al. | 2411.15138 | null |
2024-11-22 | Learning to Stabilize Faces | Jan Bednarik et.al. | 2411.15074 | null |
2024-11-22 | OVO-SLAM: Open-Vocabulary Online Simultaneous Localization and Mapping | Tomas Berriel Martins et.al. | 2411.15043 | link |
2024-11-22 | Regularity, uniqueness and the relative size of small and large scales in SQG flows | Zachary Akridge et.al. | 2411.15040 | null |
2024-11-22 | Neural 4D Evolution under Large Topological Changes from 2D Images | AmirHossein Naghi Razlighi et.al. | 2411.15018 | null |
2024-11-22 | MSSF: A 4D Radar and Camera Fusion Framework With Multi-Stage Sampling for 3D Object Detection in Autonomous Driving | Hongsi Liu et.al. | 2411.15016 | null |
2024-12-02 | Gauging in Parameter Space: A Top-Down Perspective | Xingyang Yu et.al. | 2411.14997 | null |
2024-11-26 | 3D Convex Splatting: Radiance Field Rendering with 3D Smooth Convexes | Jan Held et.al. | 2411.14974 | link |
2024-11-22 | A boundary Harnack principle and its application to analyticity of 3D Brownian intersection exponents | Yifan Gao et.al. | 2411.14921 | null |
2024-11-27 | BIP3D: Bridging 2D Images and 3D Perception for Embodied Intelligence | Xuewu Lin et.al. | 2411.14869 | link |
2024-11-22 | Dynamics-Aware Gaussian Splatting Streaming Towards Fast On-the-Fly Training for 4D Reconstruction | Zhening Liu et.al. | 2411.14847 | null |
2024-11-22 | Local Well-posedness of the Free-boundary Incompressible Elastodynamics with Surface Tension | Longhui Xu et.al. | 2411.14840 | null |
2024-11-22 | Fast High-Quality Enhanced Imaging Algorithm for Layered Dielectric Targets Based on MMW MIMO-SAR System | Xu Chen et.al. | 2411.14837 | null |
2024-11-22 | Unsupervised Multi-view UAV Image Geo-localization via Iterative Rendering | Haoyuan Li et.al. | 2411.14816 | null |
2024-11-30 | Fine-Grained Alignment in Vision-and-Language Navigation through Bayesian Optimization | Yuhang Song et.al. | 2411.14811 | null |
2024-11-22 | Style-Friendly SNR Sampler for Style-Driven Generation | Jooyoung Choi et.al. | 2411.14793 | null |
2024-11-26 | Efficient Long Video Tokenization via Coordinate-based Patch Reconstruction | Huiwon Jang et.al. | 2411.14762 | null |
2024-11-22 | Point Cloud Understanding via Attention-Driven Contrastive Learning | Yi Wang et.al. | 2411.14744 | null |
2024-11-22 | TEXGen: a Generative Diffusion Model for Mesh Textures | Xin Yu et.al. | 2411.14740 | link |
2024-11-22 | VisionPAD: A Vision-Centric Pre-training Paradigm for Autonomous Driving | Haiming Zhang et.al. | 2411.14716 | null |
2024-11-22 | Any-to-3D Generation via Hybrid Diffusion Supervision | Yijun Fan et.al. | 2411.14715 | null |
2024-11-22 | Personalised 3D Human Digital Twin with Soft-Body Feet for Walking Simulation | Kum Yew Loke et.al. | 2411.14701 | null |
2024-11-21 | ACE-Net: AutofoCus-Enhanced Convolutional Network for Field Imperfection Estimation with application to high b-value spiral Diffusion MRI | Mengze Gao et.al. | 2411.14630 | null |
2024-11-21 | HotSpot: Screened Poisson Equation for Signed Distance Function Optimization | Zimo Wang et.al. | 2411.14628 | null |
2024-11-21 | Fermi surface reconstruction in strained La $3$Ni$_2$O${7}$ on LaAlO$_3$(001) and SrTiO$_3$ (001) | Benjamin Geisler et.al. | 2411.14600 | null |
2024-11-21 | Solving Zero-Shot 3D Visual Grounding as Constraint Satisfaction Problems | Qihao Yuan et.al. | 2411.14594 | link |
2024-11-21 | Efficient Spatio-Temporal Signal Recognition on Edge Devices Using PointLCA-Net | Sanaz Mahmoodi Takaghaj et.al. | 2411.14585 | null |
2024-11-21 | Swift: A Multi-FPGA Framework for Scaling Up Accelerated Graph Analytics | Oluwole Jaiyeoba et.al. | 2411.14554 | null |
2024-11-21 | A Bound on 3d Mirror Pairs | Zhenghao Zhong et.al. | 2411.14531 | null |
2024-11-28 | NexusSplats: Efficient 3D Gaussian Splatting in the Wild | Yuzhou Tang et.al. | 2411.14514 | null |
2024-11-21 | U-Motion: Learned Point Cloud Video Compression with U-Structured Motion Estimation | Tingyu Fan et.al. | 2411.14501 | null |
2024-11-21 | Test-Time Adaptation of 3D Point Clouds via Denoising Diffusion Models | Hamidreza Dastmalchi et.al. | 2411.14495 | link |
2024-11-21 | Unleashing the Potential of Multi-modal Foundation Models and Video Diffusion for 4D Dynamic Physical Scene Simulation | Zhuoman Liu et.al. | 2411.14423 | null |
2024-11-21 | Multimodal 3D Brain Tumor Segmentation with Adversarial Training and Conditional Random Field | Lan Jiang et.al. | 2411.14418 | null |
2024-11-21 | Construction of Lie algebra weight system kernel via Vogel algebra | Dmitry Khudoteplov et.al. | 2411.14417 | null |
2024-11-26 | Baking Gaussian Splatting into Diffusion Denoiser for Fast and Scalable Single-stage Image-to-3D Generation | Yuanhao Cai et.al. | 2411.14384 | null |
2024-11-21 | InCrowd-VI: A Realistic Visual-Inertial Dataset for Evaluating SLAM in Indoor Pedestrian-Rich Spaces for Human Navigation | Marziyeh Bamdad et.al. | 2411.14358 | link |
2024-11-21 | Distribution of plastics of various sizes and densities in the global ocean from a 3D Eulerian model | Zih-En Tseng et.al. | 2411.14335 | null |
2024-11-21 | SplatR : Experience Goal Visual Rearrangement with 3D Gaussian Splatting and Dense Feature Matching | Arjun P S et.al. | 2411.14322 | link |
2024-11-21 | EasyHOI: Unleashing the Power of Large Models for Reconstructing Hand-Object Interactions in the Wild | Yumeng Liu et.al. | 2411.14280 | link |
2024-11-21 | Observing planetary gaps in the gas of debris disks | C. Bergez-Casalou et.al. | 2411.14241 | null |
2024-11-21 | Regularization and passivity-preserving model reduction of quasilinear magneto-quasistatic coupled problems | Johanna Kerler-Back et.al. | 2411.14226 | null |
2024-11-21 | Novel View Extrapolation with Video Diffusion Priors | Kunhao Liu et.al. | 2411.14208 | null |
2024-11-21 | CompetitorFormer: Competitor Transformer for 3D Instance Segmentation | Duanchu Wang et.al. | 2411.14179 | null |
2024-11-21 | Spatiotemporal Decoupling for Efficient Vision-Based Occupancy Forecasting | Jingyi Xu et.al. | 2411.14169 | null |
2024-11-21 | MVANet: Multi-Stage Video Attention Network for Sound Event Localization and Detection with Source Distance Estimation | Hengyi Hong et.al. | 2411.14153 | null |
2024-11-21 | Point Cloud Resampling with Learnable Heat Diffusion | Wenqiang Xu et.al. | 2411.14120 | null |
2024-11-21 | Prandtl Equations and Related Boundary Layer Equations | Yuming Qin et.al. | 2411.14081 | null |
2024-11-22 | First Calculations of Starspot Spectra based on 3D Radiative Magnetohydrodynamics Simulations | H. N. Smitha et.al. | 2411.14056 | null |
2024-11-21 | Stereo Anything: Unifying Stereo Matching with Large-Scale Mixed Data | Xianda Guo et.al. | 2411.14053 | link |
2024-11-21 | Numerical null controllability of parabolic PDEs using Lagrangian methods | Enrique Fernandez-Cara et.al. | 2411.14031 | null |
2024-11-21 | SEMPose: A Single End-to-end Network for Multi-object Pose Estimation | Xin Liu et.al. | 2411.14002 | null |
2024-11-21 | Second derivatives of solutions to the 3D incompressible Navier-Stokes equation in Lebesgue spaces | Igor Honoré et.al. | 2411.13980 | null |
2024-11-21 | 3D Localization of FRB 20190425A for Its Potential Host Galaxy and Implications | Da-Chun Qiang et.al. | 2411.13973 | null |
2024-11-21 | Multimodal 3D Reasoning Segmentation with Complex Scenes | Xueying Jiang et.al. | 2411.13927 | null |
2024-11-21 | Sli2Vol+: Segmenting 3D Medical Images Based on an Object Estimation Guided Correspondence Flow Network | Delin An et.al. | 2411.13873 | link |
2024-11-21 | On the geometry of topological defects in glasses | Zhen Wei Wu et.al. | 2411.13853 | null |
2024-11-21 | 3D-architected gratings for polarization-sensitive, nature-inspired structural color | Moisés H. Ibarra Miranda et.al. | 2411.13803 | null |
2024-11-20 | FAST-Splat: Fast, Ambiguity-Free Semantics Transfer in Gaussian Splatting | Ola Shorinwa et.al. | 2411.13753 | null |
2024-11-20 | GA-NIFS: A galaxy-wide outflow in a Compton-thick mini-BAL quasar at z = 3.5 probed in emission and absorption | Michele Perna et.al. | 2411.13698 | null |
2024-11-20 | Soft gravitons in three dimensions | Jordan Cotler et.al. | 2411.13633 | null |
2024-11-20 | Sparse Input View Synthesis: 3D Representations and Reliable Priors | Nagabhushan Somraj et.al. | 2411.13631 | null |
2024-11-20 | MambaDETR: Query-based Temporal Modeling using State Space Model for Multi-View 3D Object Detection | Tong Ning et.al. | 2411.13628 | null |
2024-11-20 | Video2BEV: Transforming Drone Videos to BEVs for Video-based Geo-localization | Hao Ju et.al. | 2411.13610 | null |
2024-11-25 | VioPose: Violin Performance 4D Pose Estimation by Hierarchical Audiovisual Inference | Seong Jong Yoo et.al. | 2411.13607 | link |
2024-11-20 | Find Any Part in 3D | Ziqi Ma et.al. | 2411.13550 | null |
2024-11-20 | Generating 3D-Consistent Videos from Unposed Internet Photos | Gene Chou et.al. | 2411.13549 | null |
2024-11-20 | Identity Preserving 3D Head Stylization with Multiview Score Distillation | Bahri Batuhan Bilecen et.al. | 2411.13536 | null |
2024-11-20 | A Distributed-memory Tridiagonal Solver Based on a Specialised Data Structure Optimised for CPU and GPU Architectures | Semih Akkurt et.al. | 2411.13532 | null |
2024-11-21 | Geometric Algebra Planes: Convex Implicit Neural Volumes | Irmak Sivgin et.al. | 2411.13525 | null |
2024-11-20 | Dynamically Feasible Path Planning in Cluttered Environments via Reachable Bezier Polytopes | Noel Csomay-Shanklin et.al. | 2411.13507 | null |
2024-11-20 | Neural machine translation of seismic waves for petrophysical inversion | José Cunha Teixeira et.al. | 2411.13491 | null |
2024-11-20 | Thermal Entropy, Density Disorder and Antiferromagnetism of Repulsive Fermions in 3D Optical Lattice | Yu-Feng Song et.al. | 2411.13418 | null |
2024-11-20 | Interaction force estimation for tactile sensor arrays: Toward tactile-based interaction control for robotic fingers | Elie Chelly et.al. | 2411.13335 | null |
2024-11-21 | Structure-Based Molecule Optimization via Gradient-Guided Bayesian Update | Keyue Qiu et.al. | 2411.13280 | null |
2024-11-24 | Estimating the tails of the spectrum of the Hessian of the log-likelihood for \textit{ab-initio} single-particle reconstruction in electron cryomicroscopy | Aaditya V. Rangan et.al. | 2411.13263 | null |
2024-11-20 | BelHouse3D: A Benchmark Dataset for Assessing Occlusion Robustness in 3D Point Cloud Semantic Segmentation | Umamaheswaran Raman Kumar et.al. | 2411.13251 | null |
2024-11-25 | Holography of new conformal higher spin gravities in 3d for low spins | I. Lovrekovic et.al. | 2411.13250 | null |
2024-11-20 | XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic Segmentation | Ziyi Wang et.al. | 2411.13243 | link |
2024-11-20 | Building music with Lego bricks and Raspberry Pi | Ana M. Barbancho et.al. | 2411.13224 | null |
2024-11-20 | Intensity-Spatial Dual Masked Autoencoder for Multi-Scale Feature Learning in Chest CT Segmentation | Yuexing Ding et.al. | 2411.13198 | link |
2024-11-20 | VADet: Multi-frame LiDAR 3D Object Detection using Variable Aggregation | Chengjie Huang et.al. | 2411.13186 | null |
2024-11-20 | Recovering Mullins damage hyperelastic behaviour with physics augmented neural networks | Martin Zlatić et.al. | 2411.13185 | null |
2024-11-20 | High-order asymptotic expansion for the nonlinear Klein-Gordon equation in the non-relativistic limit regime | Jia Shen et.al. | 2411.13132 | null |
2024-11-20 | Third-order Orbital Corner State and its Realization in Acoustic Crystals | Jiyu Wang et.al. | 2411.13128 | null |
2024-11-20 | Identifying the Galactic Substructures in 5D Space Using All-sky RR Lyrae Stars in Gaia DR3 | Shenglan Sun et.al. | 2411.13122 | null |
2024-11-26 | DriveMLLM: A Benchmark for Spatial Understanding with Multimodal Large Language Models in Autonomous Driving | Xianda Guo et.al. | 2411.13112 | link |
2024-11-20 | Demonstrating the Suitability of Neuromorphic, Event-Based, Dynamic Vision Sensors for In Process Monitoring of Metallic Additive Manufacturing and Welding | David Mascareñas et.al. | 2411.13108 | null |
2024-11-25 | ESARM: 3D Emotional Speech-to-Animation via Reward Model from Automatically-Ranked Demonstrations | Xulong Zhang et.al. | 2411.13089 | null |
2024-11-20 | Strong interaction induced dimensional crossover in 1D quantum gas | Zhongchi Zhang et.al. | 2411.13088 | null |
2024-11-20 | X as Supervision: Contending with Depth Ambiguity in Unsupervised Monocular 3D Pose Estimation | Yuchen Yang et.al. | 2411.13026 | link |
2024-11-20 | Open-World Amodal Appearance Completion | Jiayang Ao et.al. | 2411.13019 | null |
2024-11-20 | The temporal and spatial variations of lithium abundance in the Galactic disc | Tiancheng Sun et.al. | 2411.13011 | null |
2024-11-20 | Hierarchical Diffusion Policy: manipulation trajectory generation via contact guidance | Dexin Wang et.al. | 2411.12982 | link |
2024-11-20 | GazeGaussian: High-Fidelity Gaze Redirection with 3D Gaussian Splatting | Xiaobao Wei et.al. | 2411.12981 | null |
2024-11-20 | Shrinking POMCP: A Framework for Real-Time UAV Search and Rescue | Yunuo Zhang et.al. | 2411.12967 | null |
2024-11-19 | Tree Species Classification using Machine Learning and 3D Tomographic SAR – a case study in Northern Europe | Colverd Grace et.al. | 2411.12897 | null |
2024-11-19 | Inverse Faraday effect in 3d, 4d, and 5d transition metals | Shashi B. Mishra et.al. | 2411.12864 | null |
2024-11-19 | Anticipatory Planning for Performant Long-Lived Robot in Large-Scale Home-Like Environments | Md Ridwan Hossain Talukder et.al. | 2411.12837 | null |
2024-11-19 | Towards motion from video diffusion models | Paul Janson et.al. | 2411.12831 | null |
2024-11-19 | Frustrated Spin Systems: History of the Emergence of a Modern Physics | Hung T. Diep et.al. | 2411.12826 | null |
2024-11-19 | Evolution of Supernova Remnants in a Cloudy Multiphase Interstellar Medium | Minghao Guo et.al. | 2411.12809 | null |
2024-11-19 | An exceptionally simple family of Orthosymplectic 3d $\mathcal{N}=4$ rank-0 SCFTs | Zhenghao Zhong et.al. | 2411.12802 | null |
2024-11-19 | Automated 3D Physical Simulation of Open-world Scene with Gaussian Splatting | Haoyu Zhao et.al. | 2411.12789 | null |
2024-11-19 | Mini-Splatting2: Building 360 Scenes within Minutes via Aggressive Gaussian Densification | Guangchi Fang et.al. | 2411.12788 | null |
2024-11-19 | Med-2E3: A 2D-Enhanced 3D Medical Multimodal Large Language Model | Yiming Shi et.al. | 2411.12783 | null |
2024-11-19 | Probing Langmuir monolayer self-assembly in condensed and collapsed phases: grazing incidence X-ray diffraction and X-ray standing waves studies | K. V. Nikolaev et.al. | 2411.12686 | null |
2024-11-19 | IoT-Based 3D Pose Estimation and Motion Optimization for Athletes: Application of C3D and OpenPose | Fei Ren et.al. | 2411.12676 | null |
2024-11-20 | M3D: Dual-Stream Selective State Spaces and Depth-Driven Framework for High-Fidelity Single-View 3D Reconstruction | Luoxi Zhang et.al. | 2411.12635 | link |
2024-11-19 | Leveraging Virtual Reality and AI Tutoring for Language Learning: A Case Study of a Virtual Campus Environment with OpenAI GPT Integration with Unity 3D | Adithya TG et.al. | 2411.12619 | null |
2024-11-15 | SPARS3R: Semantic Prior Alignment and Regularization for Sparse 3D Reconstruction | Yutao Tang et.al. | 2411.12592 | link |
2024-11-25 | Nucleon relativistic weak-neutral axial-vector four-current distributions | Yi Chen et.al. | 2411.12521 | null |
2024-11-19 | 3D Reconstruction by Looking: Instantaneous Blind Spot Detector for Indoor SLAM through Mixed Reality | Hanbeom Chang et.al. | 2411.12514 | null |
2024-11-19 | PR-ENDO: Physically Based Relightable Gaussian Splatting for Endoscopy | Joanna Kaleta et.al. | 2411.12510 | link |
2024-11-25 | SCIGS: 3D Gaussians Splatting from a Snapshot Compressive Image | Zixu Wang et.al. | 2411.12471 | null |
2024-11-19 | GaussianPretrain: A Simple Unified 3D Gaussian Representation for Visual Pre-training in Autonomous Driving | Shaoqing Xu et.al. | 2411.12452 | link |
2024-11-19 | Rapid Differentiation between Microplastic Particles Using Integrated Microwave Cytometry with 3D Electrodes | Yagmur Ceren Alatas et.al. | 2411.12447 | null |
2024-11-20 | Beyond Gaussians: Fast and High-Fidelity 3D Splatting with Linear Kernels | Haodong Chen et.al. | 2411.12440 | null |
2024-11-19 | Shape modes and jet formation on ultrasound-driven wall-attached bubbles | Marco Cattaneo et.al. | 2411.12371 | null |
2024-11-19 | Rapid response to fast viral evolution using AlphaFold 3-assisted topological deep learning | JunJie Wee et.al. | 2411.12370 | null |
2024-11-19 | The age of the Methuselah star in light of stellar evolution models with tailored abundances | C. Guillaume et.al. | 2411.12343 | null |
2024-11-19 | Target Height Estimation Using a Single Acoustic Camera for Compensation in 2D Seabed Mosaicking | Xiaoteng Zhou et.al. | 2411.12338 | null |
2024-11-20 | DGTR: Distributed Gaussian Turbo-Reconstruction for Sparse-View Vast Scenes | Hao Li et.al. | 2411.12309 | null |
2024-11-19 | SSEditor: Controllable Mask-to-Scene Generation with Diffusion Model | Haowen Zheng et.al. | 2411.12290 | link |
2024-11-19 | GLOVER: Generalizable Open-Vocabulary Affordance Reasoning for Task-Oriented Grasping | Teli Ma et.al. | 2411.12286 | null |
2024-11-19 | Kinetic tomography of the Galactic plane within 1.25 kiloparsecs from the Sun. The interstellar flows revealed by HI and CO line emission and 3D dust | J. D. Soler et.al. | 2411.12257 | null |
2024-11-21 | Neuro-3D: Towards 3D Visual Decoding from EEG Signals | Zhanqiang Guo et.al. | 2411.12248 | null |
2024-11-19 | Invariant Shape Representation Learning For Image Classification | Tonmoy Hossain et.al. | 2411.12201 | link |
2024-11-19 | MTFusion: Reconstructing Any 3D Object from Single Image Using Multi-word Textual Inversion | Yu Liu et.al. | 2411.12197 | null |
2024-11-19 | LiV-GS: LiDAR-Vision Integration for 3D Gaussian Splatting SLAM in Outdoor Environments | Renxiang Xiao et.al. | 2411.12185 | null |
2024-11-19 | Robust 3D Semantic Occupancy Prediction with Calibration-free Spatial Transformation | Zhuangwei Zhuang et.al. | 2411.12177 | link |
2024-11-22 | Sketch-guided Cage-based 3D Gaussian Splatting Deformation | Tianhao Xie et.al. | 2411.12168 | null |
2024-11-21 | FruitNinja: 3D Object Interior Texture Generation with Gaussian Splatting | Fangyu Wu et.al. | 2411.12089 | null |
2024-11-18 | Machine Learning Evaluation Metric Discrepancies across Programming Languages and Their Components: Need for Standardization | Mohammad R. Salmanpour et.al. | 2411.12032 | null |
2024-11-18 | Mitigating Imaging Systematics for DESI 2024 Emission Line Galaxies and Beyond | A. J. Rosado-Marín et.al. | 2411.12024 | null |
2024-11-18 | Analyzing and Improving the Skin Tone Consistency and Bias in Implicit 3D Relightable Face Generators | Libing Zeng et.al. | 2411.12002 | null |
2024-11-18 | Domain walls from SPT-sewing | Yabo Li et.al. | 2411.11967 | null |
2024-11-18 | TimeFormer: Capturing Temporal Relationships of Deformable 3D Gaussians for Robust Reconstruction | DaDong Jiang et.al. | 2411.11941 | null |
2024-11-18 | Calibrated and Efficient Sampling-Free Confidence Estimation for LiDAR Scene Semantic Segmentation | Hanieh Shojaei Miandashti et.al. | 2411.11935 | null |
2024-11-18 | DeSiRe-GS: 4D Street Gaussians for Static-Dynamic Decomposition and Surface Reconstruction for Urban Driving Scenes | Chensheng Peng et.al. | 2411.11921 | link |
2024-11-16 | DiHuR: Diffusion-Guided Generalizable Human Reconstruction | Jinnan Chen et.al. | 2411.11903 | null |
2024-11-18 | UniHands: Unifying Various Wild-Collected Keypoints for Personalized Hand Reconstruction | Menghe Zhang et.al. | 2411.11845 | null |
2024-11-19 | Generative World Explorer | Taiming Lu et.al. | 2411.11844 | null |
2024-11-18 | RoboGSim: A Real2Sim2Real Robotic Gaussian Splatting Simulator | Xinhai Li et.al. | 2411.11839 | null |
2024-11-18 | A Multi-Component, Multi-Physics Computational Model for Solving Coupled Cardiac Electromechanics and Vascular Haemodynamics | Sharp C. Y. Lo et.al. | 2411.11797 | null |
2024-11-18 | sMoRe: Enhancing Object Manipulation and Organization in Mixed Reality Spaces with LLMs and Generative AI | Yunhao Xing et.al. | 2411.11752 | null |
2024-11-18 | Joint-Space Control of a Structurally Elastic Humanoid Robot | Connor W. Herron et.al. | 2411.11734 | null |
2024-11-18 | Towards Degradation-Robust Reconstruction in Generalizable NeRF | Chan Ho Park et.al. | 2411.11691 | null |
2024-11-18 | The motion of catalytically active colloids approaching a surface | Julio Melio et.al. | 2411.11656 | null |
2024-11-18 | Wall laws for viscous flows in 3D randomly rough pipes: optimal convergence rates and stochastic integrability | Mitsuo Higaki et.al. | 2411.11653 | null |
2024-11-18 | How bad could it be? Modelling the 3D complexity of the polarised dust signal using moment expansion | Léo Vacher et.al. | 2411.11649 | null |
2024-11-18 | Non-LTE radiative transfer simulations: Improved agreement of the double detonation with normal Type Ia supernovae | Christine E. Collins et.al. | 2411.11643 | null |
2024-11-19 | Leveraging Computational Pathology AI for Noninvasive Optical Imaging Analysis Without Retraining | Danny Barash et.al. | 2411.11613 | null |
2024-11-18 | VLN-Game: Vision-Language Equilibrium Search for Zero-Shot Semantic Navigation | Bangguo Yu et.al. | 2411.11609 | null |
2024-11-18 | Analysis of solar eruptions deflecting in the low corona: influence of the magnetic environment | A. Sahade et.al. | 2411.11599 | link |
2024-11-18 | Single-cone Dirac edge states on a lattice | Alvaro Donís Vela et.al. | 2411.11564 | null |
2024-11-19 | Cascaded Diffusion Models for 2D and 3D Microscopy Image Synthesis to Enhance Cell Segmentation | Rüveyda Yilmaz et.al. | 2411.11515 | link |
2024-11-18 | MVLight: Relightable Text-to-3D Generation via Light-conditioned Multi-View Diffusion | Dongseok Shim et.al. | 2411.11475 | null |
2024-11-18 | The ADUULM-360 Dataset – A Multi-Modal Dataset for Depth Estimation in Adverse Weather | Markus Schön et.al. | 2411.11455 | null |
2024-11-18 | Weak Simplicial Bisimilarity and Minimisation for Polyhedral Model Checking | Nick Bezhanishvili et.al. | 2411.11428 | null |
2024-11-18 | IKEA Manuals at Work: 4D Grounding of Assembly Instructions on Internet Videos | Yunong Liu et.al. | 2411.11409 | link |
2024-11-18 | LeC $^2$ O-NeRF: Learning Continuous and Compact Large-Scale Occupancy for Urban Scenes | Zhenxing Mi et.al. | 2411.11374 | null |
2024-11-18 | GPS-Gaussian+: Generalizable Pixel-wise 3D Gaussian Splatting for Real-Time Human-Scene Rendering from Sparse Views | Boyao Zhou et.al. | 2411.11363 | null |
2024-11-18 | Thickness-dependent Topological Phases and Flat Bands in Rhombohedral Multilayer Graphene | H. B. Xiao et.al. | 2411.11359 | null |
2024-11-18 | Uncertainty Evaluation of the Caesium Fountain Primary Frequency Standard NIM6 | Fasong Zheng et.al. | 2411.11349 | null |
2024-11-18 | Modeling Multivariable High-resolution 3D Urban Microclimate Using Localized Fourier Neural Operator | Shaoxiang Qin et.al. | 2411.11348 | null |
2024-11-18 | Unbiased Approximations for Stationary Distributions of McKean-Vlasov SDEs | Elsiddig Awadelkarim et.al. | 2411.11270 | null |
2024-11-18 | Facilitating a 3D granular flow with an obstruction | Abhijit Sinha et.al. | 2411.11264 | null |
2024-11-18 | gDist: Efficient Distance Computation between 3D Meshes on GPU | Peng Fang et.al. | 2411.11244 | null |
2024-11-18 | DeforHMR: Vision Transformer with Deformable Cross-Attention for 3D Human Mesh Recovery | Jaewoo Heo et.al. | 2411.11214 | null |
2024-11-17 | BVI-CR: A Multi-View Human Dataset for Volumetric Video Compression | Ge Gao et.al. | 2411.11199 | link |
2024-11-17 | PickScan: Object discovery and reconstruction from handheld interactions | Vincent van der Brugge et.al. | 2411.11196 | link |
2024-11-17 | DeepSPV: An Interpretable Deep Learning Pipeline for 3D Spleen Volume Estimation from 2D Ultrasound Images | Zhen Yuan et.al. | 2411.11190 | null |
2024-11-17 | Federated Learning for UAV-Based Spectrum Sensing: Enhancing Accuracy Through SNR-Weighted Model Aggregation | Kürşat Tekbıyık et.al. | 2411.11159 | null |
2024-11-17 | Estimate Sonic Mach Number in the Interstellar Medium with Convolutional Neural Network | Tyler Schmaltz et.al. | 2411.11157 | null |
2024-11-17 | Person Segmentation and Action Classification for Multi-Channel Hemisphere Field of View LiDAR Sensors | Svetlana Seliunina et.al. | 2411.11151 | link |
2024-11-17 | Investigating explosive events in a 3D quiet-Sun model: Transition region and coronal response | Yajie Chen et.al. | 2411.11068 | null |
2024-11-17 | VeGaS: Video Gaussian Splatting | Weronika Smolak-Dyżewska et.al. | 2411.11024 | link |
2024-11-17 | Direct and Explicit 3D Generation from a Single Image | Haoyu Wu et.al. | 2411.10947 | null |
2024-11-17 | A Monocular SLAM-based Multi-User Positioning System with Image Occlusion in Augmented Reality | Wei-Hsiang Lien et.al. | 2411.10940 | null |
2024-11-17 | Constrained Diffusion with Trust Sampling | William Huang et.al. | 2411.10932 | link |
2024-11-16 | Generating Compositional Scenes via Text-to-image RGBA Instance Generation | Alessandro Fontanella et.al. | 2411.10913 | null |
2024-11-16 | Practitioner Paper: Decoding Intellectual Property: Acoustic and Magnetic Side-channel Attack on a 3D Printer | Amirhossein Jamarani et.al. | 2411.10887 | null |
2024-11-16 | Optimal structured light waves generation in 3D volumes using communication mode optics | Vinicius S. de Angelis et.al. | 2411.10865 | link |
2024-11-16 | NeuroNURBS: Learning Efficient Surface Representations for 3D Solids | Jiajie Fan et.al. | 2411.10848 | link |
2024-11-16 | ARM: Appearance Reconstruction Model for Relightable 3D Generation | Xiang Feng et.al. | 2411.10825 | null |
2024-11-16 | GeomCLIP: Contrastive Geometry-Text Pre-training for Molecules | Teng Xiao et.al. | 2411.10821 | link |
2024-11-16 | Effect of Hubbard U corrections on the electronic and magnetic properties of 2D materials: A high-throughput study | Sahar Pakdel et.al. | 2411.10790 | null |
2024-11-16 | DGS-SLAM: Gaussian Splatting SLAM in Dynamic Environment | Mangyu Kong et.al. | 2411.10722 | link |
2024-11-19 | EVT: Efficient View Transformation for Multi-Modal 3D Object Detection | Yongjin Lee et.al. | 2411.10715 | null |
2024-11-16 | Poster: Reliable 3D Reconstruction for Ad-hoc Edge Implementations | Md Nurul Absur et.al. | 2411.10705 | null |
2024-11-16 | Deep Loss Convexification for Learning Iterative Models | Ziming Zhang et.al. | 2411.10649 | null |
2024-11-16 | MTA: Multimodal Task Alignment for BEV Perception and Captioning | Yunsheng Ma et.al. | 2411.10639 | null |
2024-11-15 | Voxel-Aggergated Feature Synthesis: Efficient Dense Mapping for Simulated 3D Reasoning | Owen Burns et.al. | 2411.10616 | null |
2024-11-15 | Motion Diffusion-Guided 3D Global HMR from a Dynamic Camera | Jaewoo Heo et.al. | 2411.10582 | null |
2024-11-15 | The Oxford Spires Dataset: Benchmarking Large-Scale LiDAR-Visual Localisation, Reconstruction and Radiance Field Methods | Yifu Tao et.al. | 2411.10546 | null |
2024-11-15 | TESGNN: Temporal Equivariant Scene Graph Neural Networks for Efficient and Robust Multi-View 3D Scene Understanding | Quang P. M. Pham et.al. | 2411.10509 | link |
2024-11-15 | USP-Gaussian: Unifying Spike-based Image Reconstruction, Pose Correction and Gaussian Splatting | Kang Chen et.al. | 2411.10504 | link |
2024-11-14 | MFP3D: Monocular Food Portion Estimation Leveraging 3D Point Clouds | Jinge Ma et.al. | 2411.10492 | null |
2024-11-15 | Remote-sensing based control of 3D magnetic fields using machine learning for in-operando applications | Miguel A. Cascales Sandoval et.al. | 2411.10374 | null |
2024-11-15 | Towards High-Fidelity 3D Portrait Generation with Rich Details by Cross-View Prior-Aware Diffusion | Haoran Wei et.al. | 2411.10369 | null |
2024-11-15 | Secondary Grain Boundary Dislocations Alter Segregation Energy Spectra | Xinren Chen et.al. | 2411.10350 | null |
2024-11-15 | Multiscale stress dynamics in sheared liquid foams revealed by tomo-rheoscopy | Florian Schott et.al. | 2411.10338 | null |
2024-11-15 | Ghost states underlying spatial and temporal patterns: how non-existing invariant solutions control nonlinear dynamics | Zheng Zheng et.al. | 2411.10320 | null |
2024-11-15 | 4DPV: 4D Pet from Videos by Coarse-to-Fine Non-Rigid Radiance Fields | Sergio M. de Paco et.al. | 2411.10275 | link |
2024-11-15 | Simulation of Thermal Nonequilibrium Cycles in the Solar Wind | Roger B. Scott et.al. | 2411.10215 | null |
2024-11-15 | Satellite monitoring of long period ocean-induced magnetic field variations | C. C. Finlay et.al. | 2411.10205 | null |
2024-11-15 | Learning Generalizable 3D Manipulation With 10 Demonstrations | Yu Ren et.al. | 2411.10203 | link |
2024-11-15 | Efficient Density Control for 3D Gaussian Splatting | Xiaobin Deng et.al. | 2411.10133 | link |
2024-11-15 | Towards Multi-View Consistent Style Transfer with One-Step Diffusion via Vision Conditioning | Yushen Zuo et.al. | 2411.10130 | null |
2024-11-15 | Influence of Depth Camera Noise Models on Respiration Estimation | Maurice Rohr et.al. | 2411.10081 | null |
2024-11-15 | Long time well-posdness for the 3D Prandtl boundary layer equations without structural assumption | Yuming Qin et.al. | 2411.10052 | null |
2024-11-15 | SPLIT: SE(3)-diffusion via Local Geometry-based Score Prediction for 3D Scene-to-Pose-Set Matching Problems | Kanghyun Kim et.al. | 2411.10049 | null |
2024-11-15 | GSEditPro: 3D Gaussian Splatting Editing with Attention-based Progressive Localization | Yanhao Sun et.al. | 2411.10033 | null |
2024-11-21 | Instruction-Guided Editing Controls for Images and Multimedia: A Survey in LLM era | Thanh Tam Nguyen et.al. | 2411.09955 | link |
2024-11-15 | GGAvatar: Reconstructing Garment-Separated 3D Gaussian Splatting Avatars from Monocular Video | Jingxuan Chen et.al. | 2411.09952 | link |
2024-11-14 | Architect: Generating Vivid and Interactive 3D Scenes with Hierarchical 2D Inpainting | Yian Wang et.al. | 2411.09823 | null |
2024-11-14 | A Self-Supervised Model for Multi-modal Stroke Risk Prediction | Camille Delgrange et.al. | 2411.09822 | link |
2024-11-14 | WelQrate: Defining the Gold Standard in Small Molecule Drug Discovery Benchmarking | Yunchao et.al. | 2411.09820 | null |
2024-11-14 | New Higher-Order Super-Compact Scheme for Enhanced Three-Dimensional Heat Transfer with Nanofluid and Conducting Fins | Ashwani Punia et.al. | 2411.09818 | null |
2024-11-14 | Adversarial Attacks Using Differentiable Rendering: A Survey | Matthew Hull et.al. | 2411.09749 | null |
2024-11-14 | CropCraft: Inverse Procedural Modeling for 3D Reconstruction of Crop Plants | Albert J. Zhai et.al. | 2411.09693 | null |
2024-11-20 | Leveraging Convolutional Neural Networks for 3D Quantitative Angiography Reconstructions from Sparse Cone Beam CT Projections Utilizing CFD Data | Ahmad Rahmatpour et.al. | 2411.09632 | null |
2024-11-19 | Vision-based Manipulation of Transparent Plastic Bags in Industrial Setups | F. Adetunji et.al. | 2411.09623 | null |
2024-11-14 | Effect of Parametric Variation of Chordae Tendineae Structure on Simulated Atrioventricular Valve Closure | Nicolas R. Mangine et.al. | 2411.09599 | null |
2024-11-14 | LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models | Zhengyi Wang et.al. | 2411.09595 | null |
2024-11-14 | VPBSD:Vessel-Pattern-Based Semi-Supervised Distillation for Efficient 3D Microscopic Cerebrovascular Segmentation | Xi Lin et.al. | 2411.09567 | null |
2024-11-14 | Magnetization process of a quasi-two-dimensional quantum magnet: Two-step symmetry restoration and dimensional reduction | Anneke Reinold et.al. | 2411.09541 | null |
2024-11-14 | Marker-free Human Gait Analysis using a Smart Edge Sensor System | Eva Katharina Bauer et.al. | 2411.09538 | null |
2024-11-14 | Strategic Sacrifice: Self-Organized Robot Swarm Localization for Inspection Productivity | Sneha Ramshanker et.al. | 2411.09493 | null |
2024-11-14 | Anomalous Regularization in Kazantsev-Kraichnan Model | Marco Bagnara et.al. | 2411.09482 | null |
2024-11-15 | SINETRA: a Versatile Framework for Evaluating Single Neuron Tracking in Behaving Animals | Raphael Reme et.al. | 2411.09462 | link |
2024-11-14 | DiffRoad: Realistic and Diverse Road Scenario Generation for Autonomous Vehicle Testing | Junjie Zhou et.al. | 2411.09451 | null |
2024-11-14 | Evaluation of RIS-Enabled B5G/6G Indoor Positioning and Mapping using Ray Tracing Models | Dimitris Kompostiotis et.al. | 2411.09440 | null |
2024-11-13 | ReMP: Reusable Motion Prior for Multi-domain 3D Human Pose Estimation and Motion Inbetweening | Hojun Jang et.al. | 2411.09435 | null |
2024-11-14 | Building Height Estimation Using Shadow Length in Satellite Imagery | Mahd Qureshi et.al. | 2411.09411 | link |
2024-11-14 | Time-to-Event Pretraining for 3D Medical Imaging | Zepeng Huo et.al. | 2411.09361 | link |
2024-11-14 | Iterative tomographic reconstruction with TV prior for low-dose CBCT dental imaging | Louise Friot-Giroux et.al. | 2411.09306 | null |
2024-11-14 | Aerobars Position Effect: What is the Interaction Between Aerodynamic Drag and Power Production? | Terol Sébastien et.al. | 2411.09280 | null |
2024-11-14 | LES-Talker: Fine-Grained Emotion Editing for Talking Head Generation in Linear Emotion Space | Guanwen Feng et.al. | 2411.09268 | null |
2024-11-20 | JoyVASA: Portrait and Animal Image Animation with Diffusion-Based Audio-Driven Facial Dynamics and Head Motion Generation | Xuyang Cao et.al. | 2411.09209 | link |
2024-11-14 | RibCageImp: A Deep Learning Framework for 3D Ribcage Implant Generation | Gyanendra Chaubey et.al. | 2411.09204 | null |
2024-11-14 | Orthogonal Linear Array based Product Beamforming for Real Time Underwater 3D Acoustical Imaging | Mimisha M Menakath et.al. | 2411.09197 | null |
2024-11-14 | DyGASR: Dynamic Generalized Exponential Splatting with Surface Alignment for Accelerated 3D Mesh Reconstruction | Shengchao Zhao et.al. | 2411.09156 | null |
2024-11-15 | UniHOI: Learning Fast, Dense and Generalizable 4D Reconstruction for Egocentric Hand Object Interaction Videos | Chengbo Yuan et.al. | 2411.09145 | null |
2024-11-14 | Geminga: A Window of the Role Played by Local Halo in the Cosmic Ray Propagation Process | Lin Nie et.al. | 2411.09119 | null |
2024-11-13 | A Vectorial Envelope Maxwell Formulation for Electromagnetic Waveguides with Application to Nonlinear Fiber Optics | Stefan Henneking et.al. | 2411.09090 | null |
2024-11-13 | Multimodal Object Detection using Depth and Image Data for Manufacturing Parts | Nazanin Mahjourian et.al. | 2411.09062 | null |
2024-11-13 | CoMiX: Cross-Modal Fusion with Deformable Convolutions for HSI-X Semantic Segmentation | Xuming Zhang et.al. | 2411.09023 | null |
2024-11-13 | Challenges in constraining dust properties from starlight polarization | Raphael Skalidis et.al. | 2411.08971 | null |
2024-11-13 | Gas dynamics in an AGN-host galaxy at $z\simeq2.6$ : regular rotation, non-circular motions, and mass models | Lingrui Lin et.al. | 2411.08958 | null |
2024-11-12 | Structured Pattern Expansion with Diffusion Models | Marzia Riso et.al. | 2411.08930 | null |
2024-11-12 | DG-PPU: Dynamical Graphs based Post-processing of Point Clouds extracted from Knee Ultrasounds | Injune Hwang et.al. | 2411.08926 | null |
2024-11-13 | 4D Gaussian Splatting in the Wild with Uncertainty-Aware Regularization | Mijeong Kim et.al. | 2411.08879 | null |
2024-11-13 | SANDWICH: Towards an Offline, Differentiable, Fully-Trainable Wireless Neural Ray-Tracing Surrogate | Yifei Jin et.al. | 2411.08767 | null |
2024-11-13 | Weakly-Supervised Anomaly Detection in Surveillance Videos Based on Two-Stream I3D Convolution Network | Sareh Soltani Nejad et.al. | 2411.08755 | null |
2024-11-13 | 3D Modelling to Address Pandemic Challenges: A Project-Based Learning Methodology | Tânia Rocha et.al. | 2411.08730 | null |
2024-11-16 | A Survey on Vision Autoregressive Model | Kai Jiang et.al. | 2411.08666 | null |
2024-11-13 | Toward Human Understanding with Controllable Synthesis | Hanz Cuevas-Velasquez et.al. | 2411.08663 | null |
2024-11-13 | Towards More Accurate Fake Detection on Images Generated from Advanced Generative and Neural Rendering Models | Chengdong Dong et.al. | 2411.08642 | null |
2024-11-13 | The two alternative explosion mechanisms of core-collapse supernovae: 2024 status report | Noam Soker et.al. | 2411.08555 | null |
2024-11-13 | ACROSS: A Deformation-Based Cross-Modal Representation for Robotic Tactile Perception | Wadhah Zai El Amri et.al. | 2411.08533 | null |
2024-11-22 | BillBoard Splatting (BBSplat): Learnable Textured Primitives for Novel View Synthesis | David Svitov et.al. | 2411.08508 | link |
2024-11-13 | Methodology for a Statistical Analysis of Influencing Factors on 3D Object Detection Performance | Anton Kuznietsov et.al. | 2411.08482 | null |
2024-11-13 | The Sylvester question in $\mathbb{R}^d$ : convex sets with a flat floor | Jean-François Marckert et.al. | 2411.08456 | null |
2024-11-13 | Biomass phenotyping of oilseed rape through UAV multi-view oblique imaging with 3DGS and SAM model | Yutao Shen et.al. | 2411.08453 | null |
2024-11-13 | 3D Multi-Object Tracking with Semi-Supervised GRU-Kalman Filter | Xiaoxiang Wang et.al. | 2411.08433 | null |
2024-11-13 | Gradient Optical Diffraction Tomography | Julianna Winnik et.al. | 2411.08423 | null |
2024-11-14 | Modeling and Optimization for Rotatable Antenna Enabled Wireless Communication | Qingjie Wu et.al. | 2411.08411 | null |
2024-11-18 | V2X-R: Cooperative LiDAR-4D Radar Fusion for 3D Object Detection with Denoising Diffusion | Xun Huang et.al. | 2411.08402 | link |
2024-11-13 | DG-SLAM: Robust Dynamic Gaussian Splatting SLAM with Hybrid Pose Optimization | Yueming Xu et.al. | 2411.08373 | null |
2024-11-13 | DyConfidMatch: Dynamic Thresholding and Re-sampling for 3D Semi-supervised Learning | Zhimin Chen et.al. | 2411.08340 | null |
2024-11-13 | Efficient Trajectory Generation in 3D Environments with Multi-Level Map Construction | Chengkun Tian et.al. | 2411.08323 | null |
2024-11-13 | MBA-SLAM: Motion Blur Aware Dense Visual SLAM with Radiance Fields Representation | Peng Wang et.al. | 2411.08279 | link |
2024-11-18 | Constraints on local primordial non-Gaussianity with 3d Velocity Reconstruction from the Kinetic Sunyaev-Zeldovich Effect | Alex Laguë et.al. | 2411.08240 | null |
2024-11-12 | Virtual Steps: The Experience of Walking for a Lifelong Wheelchair User in Virtual Reality | Atieh Taheri et.al. | 2411.08229 | null |
2024-11-18 | Analysis of Quantitative Angiography using Projection Foreshortening Correction and Injection Bias Removal | Parmita Mondal et.al. | 2411.08185 | null |
2024-11-12 | Point Cloud Context Analysis for Rehabilitation Grasping Assistance | Jackson M. Steinkamp et.al. | 2411.08169 | null |
2024-11-12 | TomoGRAF: A Robust and Generalizable Reconstruction Network for Single-View Computed Tomography | Di Xu et.al. | 2411.08158 | null |
2024-11-12 | CameraHMR: Aligning People with Perspective | Priyanka Patel et.al. | 2411.08128 | link |
2024-11-09 | Online Collision Risk Estimation via Monocular Depth-Aware Object Detectors and Fuzzy Inference | Brian Hsuan-Cheng Liao et.al. | 2411.08060 | null |
2024-11-08 | Biodynamic Analysis of Alpine Skiing with a Skier-Ski-Snow Interaction Model | Nan Gao et.al. | 2411.08056 | null |
2024-11-12 | GaussianAnything: Interactive Point Cloud Latent Diffusion for 3D Generation | Yushi Lan et.al. | 2411.08033 | null |
2024-11-12 | Wavelet Latent Diffusion (Wala): Billion-Parameter 3D Generative Model with Compact Wavelet Encodings | Aditya Sanghi et.al. | 2411.08017 | link |
2024-11-13 | Skyrme-Hartree-Fock-Bogoliubov mass models on a 3D mesh: IV. Improved description of the isospin dependence of pairing | Guilherme Grams et.al. | 2411.08007 | null |
2024-11-12 | A computer-vision aided Compton-imaging system for radioactive waste characterization and decommissioning of nuclear power plants | Victor Babiano-Suarez et.al. | 2411.07996 | null |
2024-11-13 | MUltiplexed Survey Telescope: Perspectives for Large-Scale Structure Cosmology in the Era of Stage-V Spectroscopic Survey | Cheng Zhao et.al. | 2411.07970 | null |
2024-11-12 | Quantitative Phase-Field Modeling of Rapid Alloy Solidification | Kaihua Ji et.al. | 2411.07953 | null |
2024-11-12 | Numerical simulation of electron magnetohydrodynamics with Landau-quantized electrons in magnetar crusts | Peter B. Rau et.al. | 2411.07948 | link |
2024-11-12 | Interactions and Reconnections of Four-Dimensional Quantum Vortices | H. A. J. Middleton-Spencer et.al. | 2411.07943 | null |
2024-11-12 | DuoLift-GAN:Reconstructing CT from Single-view and Biplanar X-Rays with Generative Adversarial Networks | Zhaoxi Zhang et.al. | 2411.07941 | null |
2024-11-18 | Rendering-Oriented 3D Point Cloud Attribute Compression using Sparse Tensor-based Transformer | Xiao Huo et.al. | 2411.07899 | null |
2024-11-12 | INTRABENCH: Interactive Radiological Benchmark | Constantin Ulrich et.al. | 2411.07885 | null |
2024-11-12 | From Dark Matter Minihalos to Large-Scale Radiative Feedback: A Self-Consistent 3D Simulation of the First Stars and Galaxies using Neural Networks | Colton Feathers et.al. | 2411.07875 | null |
2024-11-12 | Duality in a 3D Field-Theoretic Model | R. Kumar et.al. | 2411.07849 | null |
2024-11-12 | Horticultural Temporal Fruit Monitoring via 3D Instance Segmentation and Re-Identification using Point Clouds | Daniel Fusaro et.al. | 2411.07799 | link |
2024-11-12 | Efficient 3D Perception on Multi-Sweep Point Cloud with Gumbel Spatial Pruning | Jianhao Li et.al. | 2411.07742 | null |
2024-11-12 | 3D Focusing-and-Matching Network for Multi-Instance Point Cloud Registration | Liyuan Zhang et.al. | 2411.07740 | link |
2024-11-12 | No-Reference Point Cloud Quality Assessment via Graph Convolutional Network | Wu Chen et.al. | 2411.07728 | link |
2024-11-12 | ALOcc: Adaptive Lifting-based 3D Semantic Occupancy and Cost Volume-based Flow Prediction | Dubing Chen et.al. | 2411.07725 | link |
2024-11-12 | Reducing Conservativeness of Controlled-Invariant Safe Sets by Introducing a Novel Synthesis of Control Barrier Certificates | Naeim Ebrahimi Toulkani et.al. | 2411.07640 | link |
2024-11-12 | Periodic phase diagrams in micromagnetics with an eigenvalue solver | Fangzhou Ai et.al. | 2411.07629 | null |
2024-11-12 | Two-dimensional room temperature ferromagnetic semiconductors | Jia-Wen Li et.al. | 2411.07614 | null |
2024-11-14 | Projecting Gaussian Ellipsoids While Avoiding Affine Projection Approximation | Han Qi et.al. | 2411.07579 | null |
2024-11-12 | IR image databases generation under target intrinsic thermal variability constraints | Jerome Gilles et.al. | 2411.07577 | null |
2024-11-12 | Slow, Nanometer Light Confinement Observed in Atomically Thin TaS2 | Hue T. B. Do et.al. | 2411.07572 | null |
2024-11-12 | GaussianCut: Interactive segmentation via graph cut for 3D Gaussian Splatting | Umangi Jain et.al. | 2411.07555 | null |
2024-11-12 | SP-VIO: Robust and Efficient Filter-Based Visual Inertial Odometry with State Transformation Model and Pose-Only Visual Description | Xueyu Du et.al. | 2411.07551 | null |
2024-11-12 | HiCoM: Hierarchical Coherent Motion for Streamable Dynamic Scene with 3D Gaussian Splatting | Qiankun Gao et.al. | 2411.07541 | link |
2024-11-12 | Towards Seamless Integration of Magnetic Tracking into Fluoroscopy-guided Interventions | Shuwei Xing et.al. | 2411.07495 | null |
2024-11-12 | GUS-IR: Gaussian Splatting with Unified Shading for Inverse Rendering | Zhihao Liang et.al. | 2411.07478 | null |
2024-11-11 | Spiking Transformer Hardware Accelerators in 3D Integration | Boxun Xu et.al. | 2411.07397 | null |
2024-11-11 | $SE(3)$ Equivariant Ray Embeddings for Implicit Multi-View Depth Estimation | Yinshuang Xu et.al. | 2411.07326 | null |
2024-11-11 | Disk kinematics at high redshift: DysmalPy’s extension to 3D modeling and comparison with different approaches | Lilian L. Lee et.al. | 2411.07312 | null |
2024-11-11 | A non-rational Verlinde formula from Virasoro TQFT | Boris Post et.al. | 2411.07285 | null |
2024-11-11 | Full 3D nonlinear dynamics of charged and magnetized boson stars | Víctor Jaramillo et.al. | 2411.07284 | null |
2024-11-11 | Modeling of non-planar slicer for improved surface quality in material extrusion 3D printing | Shadman Tajwar Shahid et.al. | 2411.07225 | null |
2024-11-16 | SAMPart3D: Segment Any Part in 3D Objects | Yunhan Yang et.al. | 2411.07184 | link |
2024-11-08 | Acoustic-based 3D Human Pose Estimation Robust to Human Position | Yusuke Oumi et.al. | 2411.07165 | null |
2024-11-11 | Edify 3D: Scalable High-Quality 3D Asset Generation | NVIDIA et.al. | 2411.07135 | null |
2024-11-11 | Arctique: An artificial histopathological dataset unifying realism and controllability for uncertainty quantification | Jannik Franzen et.al. | 2411.07097 | link |
2024-11-12 | Extreme Rotation Estimation in the Wild | Hana Bezalel et.al. | 2411.07096 | null |
2024-11-11 | Automatic Contact-Based 3D Scanning Using Articulated Robotic Arm | Shadman Tajwar Shahid et.al. | 2411.07047 | null |
2024-11-11 | A Hierarchical Compression Technique for 3D Gaussian Splatting Compression | He Huang et.al. | 2411.06976 | null |
2024-11-11 | Azimuthal variation of apparent contact angles on structured surfaces featuring micrometric ramps, pyramids and staggered cubes at two different inherent wettabilities | P. Palmetshofer et.al. | 2411.06961 | null |
2024-11-11 | Octupolar Weyl Superconductivity from Electron-electron Interaction | Zhiming Pan et.al. | 2411.06932 | null |
2024-11-11 | 3D Printing of Near-Ambient Responsive Liquid Crystal Elastomers with Enhanced Nematic Order and Pluralized Transformation | Dongxiao Li et.al. | 2411.06931 | null |
2024-11-11 | 3D Magnetic Textures with Mixed Topology: Unlocking the Tunable Hopf Index | Maria Azhar et.al. | 2411.06929 | null |
2024-11-11 | Study of the muon component in the core-corona model using CONEX 3D | Ana Martina Botti et.al. | 2411.06918 | null |
2024-11-11 | AV-PedAware: Self-Supervised Audio-Visual Fusion for Dynamic Pedestrian Awareness | Yizhuo Yang et.al. | 2411.06789 | link |
2024-11-11 | HSTrack: Bootstrap End-to-End Multi-Camera 3D Multi-object Tracking with Hybrid Supervision | Shubo Lin et.al. | 2411.06780 | null |
2024-11-11 | GTA-Net: An IoT-Integrated 3D Human Pose Estimation System for Real-Time Adolescent Sports Posture Correction | Shizhe Yuan et.al. | 2411.06725 | null |
2024-11-11 | Stationary acoustic black hole solutions in Bose-Einstein condensates and their Borel analysis | Sachin Vaidya et.al. | 2411.06678 | null |
2024-11-11 | Ab initio investigation of layered TMGeTe3 alloys for phase-change applications | Yihui Jiang et.al. | 2411.06668 | null |
2024-11-11 | LFSamba: Marry SAM with Mamba for Light Field Salient Object Detection | Zhengyi Liu et.al. | 2411.06652 | link |
2024-11-10 | Few-shot Semantic Learning for Robust Multi-Biome 3D Semantic Mapping in Off-Road Environments | Deegan Atha et.al. | 2411.06632 | null |
2024-11-10 | Adaptive and Temporally Consistent Gaussian Surfels for Multi-view Dynamic Reconstruction | Decai Chen et.al. | 2411.06602 | null |
2024-11-10 | Design and Characterization of a Novel Scintillator Array for In Vivo Monitoring During UHDR PBS Proton Therapy | Roman Vasyltsiv et.al. | 2411.06598 | null |
2024-11-10 | Graph Neural Networks for modelling breast biomechanical compression | Hadeel Awwad et.al. | 2411.06596 | link |
2024-11-10 | New Higher-Order Super-Compact Finite Difference Scheme to Study Three-Dimensional Natural Convection and Entropy Generation in Non-Newtonian Fluids | Ashwani Punia et.al. | 2411.06563 | null |
2024-11-10 | Real-time Deformation-aware Control for Autonomous Robotic Subretinal Injection under iOCT Guidance | Demir Arikan et.al. | 2411.06557 | null |
2024-11-14 | On hydrostatic limit of Beris-Edwards system in a thin strip | Francesco De Anna et.al. | 2411.06494 | null |
2024-11-10 | Towards Voronoi Diagrams of Surface Patches | Pengfei Wang et.al. | 2411.06471 | null |
2024-11-10 | Tetratic Phase in 2D Crystals of Squares | Robert Löffler et.al. | 2411.06464 | null |
2024-11-10 | Improved Video VAE for Latent Video Diffusion Model | Pingyu Wu et.al. | 2411.06449 | null |
2024-11-10 | SuperResolution Radar Gesture Recognitio | Netanel Blumenfeld et.al. | 2411.06410 | null |
2024-11-12 | SplatFormer: Point Transformer for Robust 3D Gaussian Splatting | Yutong Chen et.al. | 2411.06390 | link |
2024-11-10 | Through the Curved Cover: Synthesizing Cover Aberrated Scenes with Refractive Field | Liuyue Xie et.al. | 2411.06365 | null |
2024-11-10 | A novel algorithm for optimizing bundle adjustment in image sequence alignment | Hailin Xu et.al. | 2411.06343 | null |
2024-11-09 | NeuReg: Domain-invariant 3D Image Registration on Human and Mouse Brains | Taha Razzaq et.al. | 2411.06315 | null |
2024-11-09 | Widespread neuronal chaos induced by slow oscillating currents | James Scully et.al. | 2411.06304 | null |
2024-11-09 | Magnetic interaction of stellar coronal mass ejections with close-in exoplanets: implication on planetary mass loss and Ly- $α$ transits | Gopal Hazra et.al. | 2411.06283 | null |
2024-11-19 | AI’s Spatial Intelligence: Evaluating AI’s Understanding of Spatial Transformations in PSVT:R and Augmented Reality | Uttamasha Monjoree et.al. | 2411.06269 | null |
2024-11-09 | Crowd3D++: Robust Monocular Crowd Reconstruction with Upright Space | Jing Huang et.al. | 2411.06232 | null |
2024-11-09 | Hydrodynamic 3D Simulation of Roche Lobe Overflow in High-mass X-Ray Binaries | David Dickson et.al. | 2411.06227 | null |
2024-11-09 | Text2CAD: Text to 3D CAD Generation via Technical Drawings | Mohsen Yavartanoo et.al. | 2411.06206 | null |
2024-11-09 | Twisted terahertz radiation generation using Laguerre-Gaussian laser pulse propagating in axially magnetized plasma | Dinkar Mishra et.al. | 2411.06189 | null |
2024-11-19 | LSSInst: Improving Geometric Modeling in LSS-Based BEV Perception with Instance Representation | Weijie Ma et.al. | 2411.06173 | link |
2024-11-09 | Non-Leray-Hopf solutions to 3D stochastic hyper-viscous Navier-stokes equations: beyond the Lions exponents | Wenping Cao et.al. | 2411.06133 | null |
2024-11-09 | Three Dimensional Topological Field Theories and Nahm Sum Formulas | Dongmin Gang et.al. | 2411.06081 | null |
2024-11-09 | AI-Driven Stylization of 3D Environments | Yuanbo Chen et.al. | 2411.06067 | null |
2024-11-09 | Linear Spherical Sliced Optimal Transport: A Fast Metric for Comparing Spherical Data | Xinran Liu et.al. | 2411.06055 | null |
2024-11-09 | PointCG: Self-supervised Point Cloud Learning via Joint Completion and Generation | Yun Liu et.al. | 2411.06041 | null |
2024-11-09 | GaussianSpa: An “Optimizing-Sparsifying” Simplification Framework for Compact and High-Quality 3D Gaussian Splatting | Yangming Zhang et.al. | 2411.06019 | null |
2024-11-08 | Modelling, design and control of middle-size tilt-rotor quadrotor | Theodore Nye-Matthew et.al. | 2411.05994 | null |
2024-11-08 | Utilisation of Vision Systems and Digital Twin for Maintaining Cleanliness in Public Spaces | Mateusz Wasala et.al. | 2411.05964 | null |
2024-11-08 | Assessing Foundational Medical ‘Segment Anything’ (Med-SAM1, Med-SAM2) Deep Learning Models for Left Atrial Segmentation in 3D LGE MRI | Mehri Mehrnia et.al. | 2411.05963 | null |
2024-11-08 | 3D characterization of reattached flow on an airfoil with finite-span synthetic jet flow control | Adnan Machado et.al. | 2411.05949 | null |
2024-11-08 | Autoregressive Models in Vision: A Survey | Jing Xiong et.al. | 2411.05902 | link |
2024-11-08 | Untrained Perceptual Loss for image denoising of line-like structures in MR images | Elisabeth Pfaehler et.al. | 2411.05884 | null |
2024-11-08 | Benchmarking 3D multi-coil NC-PDNet MRI reconstruction | Asma Tanabene et.al. | 2411.05883 | null |
2024-11-07 | Conditional Diffusion Model for Longitudinal Medical Image Generation | Duy-Phuong Dao et.al. | 2411.05860 | null |
2024-11-05 | A Theory of Stabilization by Skull Carving | Mathieu Lamarre et.al. | 2411.05827 | null |
2024-11-08 | Latest progress on the reduced-order particle-in-cell scheme: II. Quasi-3D implementation and verification | Maryam Reza et.al. | 2411.05759 | null |
2024-11-08 | StdGEN: Semantic-Decomposed 3D Character Generation from Single Images | Yuze He et.al. | 2411.05738 | null |
2024-11-08 | PEP-GS: Perceptually-Enhanced Precise Structured 3D Gaussians for View-Adaptive Rendering | Junxi Jin et.al. | 2411.05731 | null |
2024-11-08 | A Complete Graphic Statics for Rigid-Jointed 3D Frames. Part 1: Legendre Transforms for Moments | Allan McRobie et.al. | 2411.05719 | null |
2024-11-08 | Ultra-high-energy cosmic rays from ultra-fast outflows of active galactic nuclei | Domenik Ehlert et.al. | 2411.05667 | null |
2024-11-08 | Telecom wavelength quantum dots interfaced with silicon-nitride circuits via photonic wire bonding | Ulrich Pfister et.al. | 2411.05647 | null |
2024-11-08 | Physics-constrained coupled neural differential equations for one dimensional blood flow modeling | Hunor Csala et.al. | 2411.05631 | link |
2024-11-08 | Logarithmic corrections to entropy of 3D cosmological solutions from celestial dual | Arindam Bhattacharjee et.al. | 2411.05605 | null |
2024-11-08 | Towards Active Flow Control Strategies Through Deep Reinforcement Learning | Ricard Montalà et.al. | 2411.05536 | null |
2024-11-08 | Alignment of 3D woodblock geometrical models and 2D orthographic projection image | Minh DUc Nguyen et.al. | 2411.05524 | null |
2024-11-08 | EROAS: 3D Efficient Reactive Obstacle Avoidance System for Autonomous Underwater Vehicles using 2.5D Forward-Looking Sonar | Pruthviraj Mane et.al. | 2411.05516 | link |
2024-11-08 | Fast Stochastic Subspace Identification of Densely Instrumented Bridges Using Randomized SVD | Elisa Tomassini et.al. | 2411.05510 | null |
2024-11-08 | 3D-Printed Dual-Polarized Magneto-Electric Dipole Antenna with Wideband High Isolation for Full-Duplex Applications | Mehmet Ahad Yurtoglu et.al. | 2411.05475 | null |
2024-11-08 | Bridging the Gap between Learning and Inference for Diffusion-Based Molecule Generation | Peidong Liu et.al. | 2411.05472 | link |
2024-11-08 | 2D versus 3D-like electrical behavior of MXene thin films: insights from weak localization in the role of thickness, interflake coupling and defects | Sophia Tangui et.al. | 2411.05461 | null |
2024-11-08 | Multidimensional quantum dynamics with explicitly correlated Gaussian wave packets using Rothe’s method | Simon Elias Schrader et.al. | 2411.05459 | null |
2024-11-08 | Comparative Study of Probabilistic Atlas and Deep Learning Approaches for Automatic Brain Tissue Segmentation from MRI Using N4 Bias Field Correction and Anisotropic Diffusion Pre-processing Techniques | Mohammad Imran Hossain et.al. | 2411.05456 | link |
2024-11-08 | Agile UAV landing control on moving ship in adverse conditions | James Mordaunt et.al. | 2411.05445 | null |
2024-11-08 | POC-SLT: Partial Object Completion with SDF Latent Transformers | Faezeh Zakeri et.al. | 2411.05419 | null |
2024-11-08 | Development of Underactuated Geometric Compliant (UGC) Module with Variable Radial for Robotic Applications | Mark Krysov et.al. | 2411.05418 | null |
2024-11-08 | From Transparent to Opaque: Rethinking Neural Implicit Surfaces with $α$ -NeuS | Haoran Zhang et.al. | 2411.05362 | link |
2024-11-08 | Development of a Human-Robot Interaction Platform for Dual-Arm Robots Based on ROS and Multimodal Artificial Intelligence | Thanh Nguyen Canh et.al. | 2411.05342 | null |
2024-11-08 | Rate-aware Compression for NeRF-based Volumetric Video | Zhiyu Zhang et.al. | 2411.05322 | null |
2024-11-08 | Exploring the Alignment Landscape: LLMs and Geometric Deep Models in Protein Representation | Dong Shu et.al. | 2411.05316 | link |
2024-11-08 | ZOPP: A Framework of Zero-shot Offboard Panoptic Perception for Autonomous Driving | Tao Ma et.al. | 2411.05311 | null |
2024-11-08 | Adaptive Whole-Body PET Image Denoising Using 3D Diffusion Models with ControlNet | Boxiao Yu et.al. | 2411.05302 | null |
2024-11-08 | SimpleBEV: Improved LiDAR-Camera Fusion Architecture for 3D Object Detection | Yun Zhao et.al. | 2411.05292 | null |
2024-11-18 | In-Silico Analysis of Curve Fitting in Angiographic Parametric Imaging in Intracranial Aneurysms | Parmita Mondal et.al. | 2411.05287 | null |
2024-11-08 | Path Planning in Complex Environments with Superquadrics and Voronoi-Based Orientation | Lin Yang et.al. | 2411.05279 | null |
2024-11-08 | Use of 3D chaos game representation to quantify DNA sequence similarity with applications for hierarchical clustering | Stephanie Young et.al. | 2411.05266 | null |
2024-11-07 | ARfy: A Pipeline for Adapting 3D Scenes to Augmented Reality | Arthur Caetano et.al. | 2411.05218 | null |
2024-11-07 | Exploring non-thermal emission from the star-forming region NGC 3603 through a realistic modelling of its environment | Manuel Rocamora et.al. | 2411.05206 | null |
2024-11-07 | Break Times: Virtual Reality Art Therapy | Yi Rou Yap et.al. | 2411.05146 | null |
2024-11-07 | Edge shape sensation presented in a noncontact manner using airborne ultrasound | Koichi Kato et.al. | 2411.05128 | null |
2024-11-07 | STEM: Soft Tactile Electromagnetic Actuator for Virtual Environment Interactions | Heeju Mun et.al. | 2411.05114 | null |
2024-11-07 | Co-Located Magnetic Levitation Haptic and Graphic Display using Iron Core Coils under Screen | Peter Berkelman et.al. | 2411.05113 | null |
2024-11-07 | Enhancing Medical Anatomy Education through Virtual Reality (VR): Design, Development, and Evaluation | Myint Zu Than et.al. | 2411.05106 | null |
2024-11-07 | Universal finite-size scaling in the extraordinary-log boundary phase of 3d $O(N)$ model | Francesco Parisen Toldin et.al. | 2411.05089 | null |
2024-11-07 | ProEdit: Simple Progression is All You Need for High-Quality 3D Scene Editing | Jun-Kun Chen et.al. | 2411.05006 | null |
2024-11-07 | DynaMem: Online Dynamic Spatio-Semantic Memory for Open World Mobile Manipulation | Peiqi Liu et.al. | 2411.04999 | link |
2024-11-07 | LoFi: Scalable Local Image Reconstruction with Implicit Neural Representation | AmirEhsan Khorashadizadeh et.al. | 2411.04995 | link |
2024-11-07 | VAIR: Visuo-Acoustic Implicit Representations for Low-Cost, Multi-Modal Transparent Surface Reconstruction in Indoor Scenes | Advaith V. Sethuraman et.al. | 2411.04963 | null |
2024-11-07 | SPGD: Steepest Perturbed Gradient Descent Optimization | Amir M. Vahedi et.al. | 2411.04946 | link |
2024-11-07 | Terahertz generation via all-optical quantum control in 2D and 3D materials | Kamalesh Jana et.al. | 2411.04943 | null |
2024-11-07 | DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion | Wenqiang Sun et.al. | 2411.04928 | null |
2024-11-07 | MVSplat360: Feed-Forward 360 Scene Synthesis from Sparse Views | Yuedong Chen et.al. | 2411.04924 | link |
2024-11-11 | ZAHA: Introducing the Level of Facade Generalization and the Large-Scale Point Cloud Facade Semantic Segmentation Benchmark Dataset | Olaf Wysocki et.al. | 2411.04865 | link |
2024-11-07 | Differentiable Gaussian Representation for Incomplete CT Reconstruction | Shaokai Wu et.al. | 2411.04844 | null |
2024-11-07 | GANESH: Generalizable NeRF for Lensless Imaging | Rakesh Raj Madavan et.al. | 2411.04810 | null |
2024-11-07 | Development of a Service Robot for Hospital Environments in Rehabilitation Medicine with LiDAR Based Simultaneous Localization and Mapping | Sayat Ibrayev et.al. | 2411.04797 | null |
2024-11-07 | Towards Real Time Compton Imaging in Demanding Conditions | Bernardo Gameiro et.al. | 2411.04785 | null |
2024-11-07 | Intermittency of a transitional airfoil flow with laminar separation bubble solved by the lattice-Boltzmann method | Bernardo Luiz Ribeiro et.al. | 2411.04763 | null |
2024-11-07 | Equivariant Graph Attention Networks with Structural Motifs for Predicting Cell Line-Specific Synergistic Drug Combinations | Zachary Schwehr et.al. | 2411.04747 | link |
2024-11-07 | Experimental and Numerical Studies of the Collapse of Dense Clouds Induced by Herbig-Haro Stellar Jets | Marin Fontaine et.al. | 2411.04736 | null |
2024-11-07 | Controlling Human Shape and Pose in Text-to-Image Diffusion Models via Domain Adaptation | Benito Buchheim et.al. | 2411.04724 | null |
2024-11-12 | Evolution of the electron distribution function during gas ionization by a sub-nanosecond microwave pulse of hundreds MW power | Y. Bliokh et.al. | 2411.04720 | null |
2024-11-07 | NeuroFly: A framework for whole-brain single neuron reconstruction | Rubin Zhao et.al. | 2411.04715 | link |
2024-11-07 | DNN-based 3D Cloud Retrieval for Variable Solar Illumination and Multiview Spaceborne Imaging | Tamar Klein et.al. | 2411.04682 | null |
2024-11-07 | HypoNet Nankai: Rapid hypocenter determination tool for the Nankai Trough subduction zone using physics-informed neural networks | Ryoichiro Agata et.al. | 2411.04667 | null |
2024-11-07 | Brain Tumour Removing and Missing Modality Generation using 3D WDM | André Ferreira et.al. | 2411.04630 | link |
2024-11-07 | Population estimation using 3D city modelling and Carto2S datasets – A case study | Jai G Singla et.al. | 2411.04612 | null |
2024-11-07 | Social EgoMesh Estimation | Luca Scofano et.al. | 2411.04598 | link |
2024-11-07 | Machine learning-driven complex models for wavefront shaping through multimode fibers | Jérémy Saucourt et.al. | 2411.04531 | null |
2024-11-07 | LESnets (Large-Eddy Simulation nets): Physics-informed neural operator for large-eddy simulation of turbulence | Sunan Zhao et.al. | 2411.04502 | null |
2024-11-07 | FreeCap: Hybrid Calibration-Free Motion Capture in Open Environments | Aoru Xue et.al. | 2411.04469 | null |
2024-11-07 | Enhancing Bronchoscopy Depth Estimation through Synthetic-to-Real Domain Adaptation | Qingyao Tian et.al. | 2411.04404 | null |
2024-11-07 | ProGraph: Temporally-alignable Probability Guided Graph Topological Modeling for 3D Human Reconstruction | Hongsheng Wang et.al. | 2411.04399 | null |
2024-11-08 | SuperQ-GRASP: Superquadrics-based Grasp Pose Estimation on Larger Objects for Mobile-Manipulation | Xun Tu et.al. | 2411.04386 | null |
2024-11-07 | Perspective on recent developments and challenges in regulatory and systems genomics | Julia Zeiltinger et.al. | 2411.04363 | null |
2024-11-07 | LidaRefer: Outdoor 3D Visual Grounding for Autonomous Driving with Transformers | Yeong-Seung Baek et.al. | 2411.04351 | null |
2024-11-07 | Rapid Quadrotor Navigation in Diverse Environments using an Onboard Depth Camera | Jonathan Lee et.al. | 2411.04326 | null |
2024-11-06 | Efficient Symmetry-Aware Materials Generation via Hierarchical Generative Flow Networks | Tri Minh Nguyen et.al. | 2411.04323 | null |
2024-11-06 | Photon acceleration of high-intensity vector vortex beams into the extreme ultraviolet | Kyle G. Miller et.al. | 2411.04258 | null |
2024-11-08 | PocoLoco: A Point Cloud Diffusion Model of Human Shape in Loose Clothing | Siddharth Seth et.al. | 2411.04249 | link |
2024-11-06 | Tannakian QFT: from spark algebras to quantum groups | Tudor Dimofte et.al. | 2411.04194 | null |
2024-11-06 | Mapping reionization bubbles in the JWST era I: empirical edge detection with Lyman alpha emission from galaxies | Ting-Yi Lu et.al. | 2411.04176 | null |
2024-11-06 | BAPULM: Binding Affinity Prediction using Language Models | Radheesh Sharma Meda et.al. | 2411.04150 | link |
2024-11-06 | Textual Decomposition Then Sub-motion-space Scattering for Open-Vocabulary Motion Generation | Ke Fan et.al. | 2411.04079 | null |
2024-11-06 | Collective Dynamics of Intelligent Active Brownian Particles with Visual Perception and Velocity Alignment in 3D: Spheres, Rods, and Worms | Zhaoxuan Liu et.al. | 2411.03975 | null |
2024-11-06 | Beyond The Rainbow: High Performance Deep Reinforcement Learning On A Desktop PC | Tyler Clark et.al. | 2411.03820 | null |
2024-11-06 | SA3DIP: Segment Any 3D Instance with Potential 3D Priors | Xi Yang et.al. | 2411.03819 | link |
2024-11-08 | GS2Pose: Two-stage 6D Object Pose Estimation Guided by Gaussian Splatting | Jilan Mei et.al. | 2411.03807 | null |
2024-11-06 | Model-independent calibration of Gamma-Ray Bursts with neural networks | Purba Mukherjee et.al. | 2411.03773 | null |
2024-11-06 | Homotopy Continuation Made Easy: Regression-based Online Simulation of Starting Problem-Solution Pairs | Xinyue Zhang et.al. | 2411.03745 | null |
2024-11-06 | Topological Dirac-vortex modes in a three-dimensional photonic topological insulator | Bei Yan et.al. | 2411.03738 | null |
2024-11-06 | Relation Learning and Aggregate-attention for Multi-person Motion Prediction | Kehua Qu et.al. | 2411.03729 | null |
2024-11-06 | PX2Tooth: Reconstructing the 3D Point Cloud Teeth from a Single Panoramic X-ray | Wen Ma et.al. | 2411.03725 | null |
2024-11-06 | These Maps Are Made by Propagation: Adapting Deep Stereo Networks to Road Scenarios with Decisive Disparity Diffusion | Chuang-Wei Liu et.al. | 2411.03717 | null |
2024-11-06 | 3DGS-CD: 3D Gaussian Splatting-based Change Detection for Physical Object Rearrangement | Ziqi Lu et.al. | 2411.03706 | link |
2024-11-06 | OccLoff: Learning Optimized Feature Fusion for 3D Occupancy Prediction | Ji Zhang et.al. | 2411.03696 | null |
2024-11-06 | Where Do We Stand with Implicit Neural Representations? A Technical and Performance Survey | Amer Essakine et.al. | 2411.03688 | null |
2024-11-06 | Beyond Model Adaptation at Test Time: A Survey | Zehao Xiao et.al. | 2411.03687 | link |
2024-11-06 | Towards 3D Semantic Scene Completion for Autonomous Driving: A Meta-Learning Framework Empowered by Deformable Large-Kernel Attention and Mamba Model | Yansong Qu et.al. | 2411.03672 | null |
2024-11-06 | Deciphering the Evolution of Thermodynamic Properties and their Connection to the Global Kinematics of High-Speed Coronal Mass Ejections Using FRIS Model | Soumyaranjan Khuntia et.al. | 2411.03639 | null |
2024-11-06 | Structure Consistent Gaussian Splatting with Matching Prior for Few-shot Novel View Synthesis | Rui Peng et.al. | 2411.03637 | link |
2024-11-05 | Object and Contact Point Tracking in Demonstrations Using 3D Gaussian Splatting | Michael Büttner et.al. | 2411.03555 | null |
2024-11-05 | VLA-3D: A Dataset for 3D Semantic Scene Understanding and Navigation | Haochen Zhang et.al. | 2411.03540 | link |
2024-11-05 | Beyond Complete Shapes: A Quantitative Evaluation of 3D Shape Matching Algorithms | Viktoria Ehm et.al. | 2411.03511 | null |
2024-11-05 | Self Supervised Networks for Learning Latent Space Representations of Human Body Scans and Motions | Emmanuel Hartman et.al. | 2411.03475 | null |
2024-11-07 | A high resolution simulation of protoplanetary disk turbulence driven by the vertical shear instability | Karim Shariff et.al. | 2411.03467 | null |
2024-11-05 | A 3D Simulation of a Type II-P Supernova: from Core Bounce to Beyond Shock Breakout | David Vartanyan et.al. | 2411.03434 | null |
2024-11-05 | Fine-Grained Spatial and Verbal Losses for 3D Visual Grounding | Sombit Dey et.al. | 2411.03405 | null |
2024-11-08 | Neurons for Neutrons: A Transformer Model for Computation Load Estimation on Domain-Decomposed Neutron Transport Problems | Alexander Mote et.al. | 2411.03389 | null |
2024-11-05 | Using Assurance Cases to Guide Verification and Validation of Research Software | W. Spencer Smith et.al. | 2411.03291 | null |
2024-11-05 | Blind Estimation of Sub-band Acoustic Parameters from Ambisonics Recordings using Spectro-Spatial Covariance Features | Hanyu Meng et.al. | 2411.03172 | null |
2024-11-05 | Proposals for 3D self-correcting quantum memory | Ting-Chun Lin et.al. | 2411.03115 | null |
2024-11-05 | Comparison of Bayesian inference methods using the Loreli II database of hydro-radiative simulations of the 21-cm signal | Romain Meriot et.al. | 2411.03093 | null |
2024-11-05 | HFGaussian: Learning Generalizable Gaussian Human with Integrated Human Features | Arnab Dey et.al. | 2411.03086 | null |
2024-11-05 | Self-supervised cross-modality learning for uncertainty-aware object detection and recognition in applications which lack pre-labelled training data | Irum Mehboob et.al. | 2411.03082 | null |
2024-11-05 | GarVerseLOD: High-Fidelity 3D Garment Reconstruction from a Single In-the-Wild Image using a Dataset with Levels of Details | Zhongjin Luo et.al. | 2411.03047 | null |
2024-11-05 | CRT-Fusion: Camera, Radar, Temporal Fusion Using Motion Information for 3D Object Detection | Jisong Kim et.al. | 2411.03013 | link |
2024-11-05 | Learning-based Lossless Event Data Compression | Ahmadreza Sezavar et.al. | 2411.03010 | null |
2024-11-05 | CAD-NeRF: Learning NeRFs from Uncalibrated Few-view Images by CAD Model Retrieval | Xin Wen et.al. | 2411.02979 | null |
2024-11-05 | Exploring Seasonal Variability in the Context of Neural Radiance Fields for 3D Reconstruction on Satellite Imagery | Liv Kåreborn et.al. | 2411.02972 | null |
2024-11-05 | Multi-Modal 3D Scene Graph Updater for Shared and Dynamic Environments | Emilio Olivastri et.al. | 2411.02938 | null |
2024-11-05 | Almost Linear Decoder for Optimal Geometrically Local Quantum Codes | Quinten Eggerickx et.al. | 2411.02928 | null |
2024-11-05 | Exploring the Interplay Between Video Generation and World Models in Autonomous Driving: A Survey | Ao Fu et.al. | 2411.02914 | null |
2024-11-05 | A Symmetric Dynamic Learning Framework for Diffeomorphic Medical Image Registration | Jinqiu Deng et.al. | 2411.02888 | null |
2024-11-05 | Artificial Intelligence-Enhanced Couinaud Segmentation for Precision Liver Cancer Therapy | Liang Qiu et.al. | 2411.02815 | null |
2024-11-05 | NEOviz: Uncertainty-Driven Visual Analysis of Asteroid Trajectories | Fangfei Lan et.al. | 2411.02812 | null |
2024-11-05 | Efficient Feature Aggregation and Scale-Aware Regression for Monocular 3D Object Detection | Yifan Wang et.al. | 2411.02747 | link |
2024-11-05 | LVI-GS: Tightly-coupled LiDAR-Visual-Inertial SLAM using 3D Gaussian Splatting | Huibin Zhao et.al. | 2411.02703 | null |
2024-11-04 | Multi-Transmotion: Pre-trained Model for Human Motion Prediction | Yang Gao et.al. | 2411.02673 | link |
2024-11-04 | Multi-modal deformable image registration using untrained neural networks | Quang Luong Nhat Nguyen et.al. | 2411.02672 | null |
2024-11-04 | Tracking Tumors under Deformation from Partial Point Clouds using Occupancy Networks | Pit Henrich et.al. | 2411.02619 | null |
2024-11-04 | Advanced XR-Based 6-DOF Catheter Tracking System for Immersive Cardiac Intervention Training | Mohsen Annabestani et.al. | 2411.02611 | null |
2024-11-04 | Towards Context-Aware Adaptation in Extended Reality: A Design Space for XR Interfaces and an Adaptive Placement Strategy | Shakiba Davari et.al. | 2411.02607 | null |
2024-11-04 | Computing critical exponents in 3D Ising model via pattern recognition/deep learning approach | Timothy A. Burt et.al. | 2411.02604 | null |
2024-11-04 | Map++: Towards User-Participatory Visual SLAM Systems with Efficient Map Expansion and Sharing | Xinran Zhang et.al. | 2411.02553 | null |
2024-11-04 | Modeling Uncertainty in 3D Gaussian Splatting through Continuous Semantic Splatting | Joey Wilson et.al. | 2411.02547 | null |
2024-11-04 | SPACE: 3D Spatial Co-operation and Exploration Framework for Robust Mapping and Coverage with Multi-Robot Systems | Sai Krishna Ghanta et.al. | 2411.02524 | null |
2024-11-04 | Energy Extraction from a Black Hole by a Strongly Magnetized Thin Accretion Disk | Prasun Dhang et.al. | 2411.02515 | null |
2024-11-04 | Building a Synthetic Vascular Model: Evaluation in an Intracranial Aneurysms Detection Scenario | Rafic Nader et.al. | 2411.02477 | null |
2024-11-02 | Cross-D Conv: Cross-Dimensional Transferable Knowledge Base via Fourier Shifting Operation | Mehmet Can Yavuz et.al. | 2411.02441 | link |
2024-11-04 | Neural optical flow for planar and stereo PIV | Andrew I. Masker et.al. | 2411.02373 | null |
2024-11-04 | Learning General-Purpose Biomedical Volume Representations using Randomized Synthesis | Neel Dey et.al. | 2411.02372 | link |
2024-11-04 | Holographic Reconstruction of Gravitational Perturbations in AdS/CFT and Implications for Celestial Conformal Field Theory | David A. Lowe et.al. | 2411.02364 | null |
2024-11-04 | MVPaint: Synchronized Multi-View Diffusion for Painting Anything 3D | Wei Cheng et.al. | 2411.02336 | null |
2024-11-06 | SplatOverflow: Asynchronous Hardware Troubleshooting | Amritansh Kwatra et.al. | 2411.02332 | null |
2024-11-05 | GenXD: Generating Any 3D and 4D Scenes | Yuyang Zhao et.al. | 2411.02319 | null |
2024-11-05 | Tencent Hunyuan3D-1.0: A Unified Framework for Text-to-3D and Image-to-3D Generation | Xianghui Yang et.al. | 2411.02293 | null |
2024-11-04 | 3D Audio-Visual Segmentation | Artem Sokolov et.al. | 2411.02236 | null |
2024-11-05 | FewViewGS: Gaussian Splatting with Few View Matching and Multi-stage Training | Ruihong Yin et.al. | 2411.02229 | null |
2024-11-04 | Modelling Realistic Multi-layer devices for superconducting quantum electronic circuits | Giuseppe Colletta et.al. | 2411.02178 | null |
2024-11-04 | A More General Linear Projectile Problem | Nick Lorenzo et.al. | 2411.02145 | null |
2024-11-04 | Deep Learning on 3D Semantic Segmentation: A Detailed Review | Thodoris Betsas et.al. | 2411.02104 | null |
2024-11-04 | The evolution of volumetric video: A survey of smart transcoding and compression approaches | Preetish Kakkar et.al. | 2411.02095 | null |
2024-11-04 | Helical kelvin waves for the 3D Euler equation | Daomin Cao et.al. | 2411.02055 | null |
2024-11-04 | An Immediate Update Strategy of Multi-State Constraint Kalman Filter | Qingchao Zhang et.al. | 2411.02028 | null |
2024-11-04 | Colloidal quasi-2D Methylammonium Lead Bromide Perovskite Nanostructures with Tunable Shape and High Chemical Stability | Eugen Klein et.al. | 2411.01999 | null |
2024-11-04 | Reshaping UAV-Enabled Communications with Omnidirectional Multi-Rotor Aerial Vehicles | Daniel Bonilla Licea et.al. | 2411.01985 | null |
2024-11-10 | Connection Performance Modeling and Analysis of a Radiosonde Network in a Typhoon | Hanyi Liu et.al. | 2411.01906 | null |
2024-11-04 | MBDRes-U-Net: Multi-Scale Lightweight Brain Tumor Segmentation Network | Longfeng Shen et.al. | 2411.01896 | link |
2024-11-04 | Towards the Industrial Metaverse: A Game-Based VR Application for Fire Drill and Evacuation Training for Ships and Shipbuilding | Musaab H. Hamed-Ahmed et.al. | 2411.01895 | null |
2024-11-04 | Efficient Active Imitation Learning with Random Network Distillation | Emilien Biré et.al. | 2411.01894 | null |
2024-11-04 | Exact periodic solutions of the generalized Constantin-Lax-Majda equation with dissipation | Denis A. Silantyev et.al. | 2411.01891 | null |
2024-11-04 | Mining and Transferring Feature-Geometry Coherence for Unsupervised Point Cloud Registration | Kezheng Xiong et.al. | 2411.01870 | link |
2024-11-06 | GVKF: Gaussian Voxel Kernel Functions for Highly Efficient Surface Reconstruction in Open Scenes | Gaochao Song et.al. | 2411.01853 | null |
2024-11-04 | Silver medal Solution for Image Matching Challenge 2024 | Yian Wang et.al. | 2411.01851 | null |
2024-11-11 | MSTA3D: Multi-scale Twin-attention for 3D Instance Segmentation | Duc Dang Trung Tran et.al. | 2411.01781 | null |
2024-11-04 | Disentangled PET Lesion Segmentation | Tanya Gatsak et.al. | 2411.01758 | null |
2024-11-04 | Multi-task Geometric Estimation of Depth and Surface Normal from Monocular 360° Images | Kun Huang et.al. | 2411.01749 | link |
2024-11-04 | Rotation Perturbation Robustness in Point Cloud Analysis: A Perspective of Manifold Distillation | Xinyu Xu et.al. | 2411.01748 | null |
2024-11-04 | Next Best View For Point-Cloud Model Acquisition: Bayesian Approximation and Uncertainty Analysis | Madalena Caldeira et.al. | 2411.01734 | null |
2024-11-04 | Atomic-scale 3D structural dynamics and functional degradation of Pt alloy nanocatalysts | Chaehwa Jeong et.al. | 2411.01727 | null |
2024-11-03 | Global self-similar solutions for the 3D Muskat equation | Jungkyoung Na et.al. | 2411.01682 | null |
2024-11-03 | The Painlevé equivalence problem for a constrained 3D system | Galina Filipuk et.al. | 2411.01657 | null |
2024-11-03 | Monolithic 3D numerical modeling of granular cargo movement on bulk carriers in waves | Wibke Düsterhöft-Wriggers et.al. | 2411.01649 | null |
2024-11-03 | Optical Flow Representation Alignment Mamba Diffusion Model for Medical Video Generation | Zhenbin Wang et.al. | 2411.01647 | null |
2024-11-03 | The Impact of TaS $_{2}$ -Augmented Interconnects on Circuit Performance: A Temperature-Dependent Analysis | Xinkang Chen et.al. | 2411.01632 | null |
2024-11-03 | DreamPolish: Domain Score Distillation With Progressive Geometry Generation | Yean Cheng et.al. | 2411.01602 | null |
2024-11-03 | One for All: Multi-Domain Joint Training for Point Cloud Based 3D Object Detection | Zhenyu Wang et.al. | 2411.01584 | null |
2024-11-03 | FactorizePhys: Matrix Factorization for Multidimensional Attention in Remote Physiological Sensing | Jitesh Joshi et.al. | 2411.01542 | link |
2024-11-03 | InstantGeoAvatar: Effective Geometry and Appearance Modeling of Animatable Avatars from Monocular Video | Alvaro Budria et.al. | 2411.01512 | link |
2024-11-03 | FaceDig: Automated tool for placing landmarks on facial portraits for geometric morphometrics users | Karel Kleisner et.al. | 2411.01508 | null |
2024-11-03 | 3D Migration Aperture and Formula Connecting Dips of Prestack Time Migrated and Unmigrated Data | Jagmeet Singh et.al. | 2411.01449 | null |
2024-11-03 | Pre-trained Molecular Language Models with Random Functional Group Masking | Tianhao Peng et.al. | 2411.01401 | null |
2024-11-08 | New Cold Subdwarf Discoveries from Backyard Worlds and a Metallicity Classification System for T Subdwarfs | Adam J. Burgasser et.al. | 2411.01378 | null |
2024-11-02 | Guided Synthesis of Labeled Brain MRI Data Using Latent Diffusion Models for Segmentation of Enlarged Ventricles | Tim Ruschke et.al. | 2411.01351 | null |
2024-11-02 | Optimizing Violence Detection in Video Classification Accuracy through 3D Convolutional Neural Networks | Aarjav Kavathia et.al. | 2411.01348 | null |
2024-11-02 | PMI-DT: Leveraging Digital Twins and Machine Learning for Predictive Modeling and Inspection in Manufacturing | Chas Hamel et.al. | 2411.01299 | null |
2024-11-02 | MonoPlane: Exploiting Monocular Geometric Cues for Generalizable 3D Plane Reconstruction | Wang Zhao et.al. | 2411.01226 | link |
2024-11-02 | PDBBind Optimization to Create a High-Quality Protein-Ligand Binding Dataset for Binding Affinity Prediction | Yingze Wang et.al. | 2411.01223 | link |
2024-11-02 | MultiPull: Detailing Signed Distance Functions by Pulling Multi-Level Queries at Multi-Step | Takeshi Noda et.al. | 2411.01208 | null |
2024-11-02 | AquaFuse: Waterbody Fusion for Physics Guided View Synthesis of Underwater Scenes | Md Abu Bakr Siddique et.al. | 2411.01119 | null |
2024-11-02 | Test-Time Adaptation in Point Clouds: Leveraging Sampling Variation with Weight Averaging | Ali Bahri et.al. | 2411.01116 | link |
2024-11-01 | Artificial Intelligence End-to-End Workflow for Transmission Electron Microscopy: From Data Analysis Automation to Materials Knowledge Unveiling | Marc Botifoll et.al. | 2411.01024 | null |
2024-11-01 | Normalization Layer Per-Example Gradients are Sufficient to Predict Gradient Noise Scale in Transformers | Gavia Gray et.al. | 2411.00999 | link |
2024-11-01 | The thermal bootstrap for the critical O(N) model | Julien Barrat et.al. | 2411.00978 | null |
2024-11-08 | Lung tumor segmentation in MRI mice scans using 3D nnU-Net with minimum annotations | Piotr Kaniewski et.al. | 2411.00922 | null |
2024-10-31 | Blind Time-of-Flight Imaging: Sparse Deconvolution on the Continuum with Unknown Kernels | Ruiming Guo et.al. | 2411.00893 | null |
2024-10-30 | Deep Learning for 3D Point Cloud Enhancement: A Survey | Siwen Quan et.al. | 2411.00857 | null |
2024-10-29 | A Flight-Mechanics Solver for Aircraft Inverse Simulations and Application to 3D Mirage-III Maneuver | Osama A. Marzouk et.al. | 2411.00834 | null |
2024-11-01 | CityGaussianV2: Efficient and Geometrically Accurate Reconstruction for Large-Scale Scenes | Yang Liu et.al. | 2411.00771 | null |
2024-11-01 | 3d SUSY enhancement and non-semisimple TQFTs from four dimensions | Arash Arabi Ardehali et.al. | 2411.00766 | null |
2024-11-01 | ZIM: Zero-Shot Image Matting for Anything | Beomyoung Kim et.al. | 2411.00626 | link |
2024-11-01 | A Graph Attention-Guided Diffusion Model for Liver Vessel Segmentation | Xiaotong Zhang et.al. | 2411.00617 | null |
2024-11-01 | Tumor Location-weighted MRI-Report Contrastive Learning: A Framework for Improving the Explainability of Pediatric Brain Tumor Diagnosis | Sara Ketabi et.al. | 2411.00609 | null |
2024-11-01 | On Deep Learning for Geometric and Semantic Scene Understanding Using On-Vehicle 3D LiDAR | Li Li et.al. | 2411.00600 | link |
2024-11-01 | A Semi-Discrete Optimal Transport Scheme for the 3D Incompressible Semi-Geostrophic Equations | Théo Lavier et.al. | 2411.00575 | null |
2024-11-01 | Differentiable Physics-based System Identification for Robotic Manipulation of Elastoplastic Materials | Xintong Yang et.al. | 2411.00554 | null |
2024-11-01 | Conditional Synthesis of 3D Molecules with Time Correction Sampler | Hojung Jung et.al. | 2411.00551 | null |
2024-11-04 | 3D Equivariant Pose Regression via Direct Wigner-D Harmonics Prediction | Jongmin Lee et.al. | 2411.00543 | null |
2024-11-01 | RISTRETTO: the PIAA Nuller in the prototyping phase | N. Restori et.al. | 2411.00486 | null |
2024-11-01 | Fusing matrix-product states with quantum Monte Carlo: reducing entanglement and sign problem at the same time | Gunnar Bollmark et.al. | 2411.00480 | null |
2024-11-01 | Target-Guided Adversarial Point Cloud Transformer Towards Recognition Against Real-world Corruptions | Jie Wang et.al. | 2411.00462 | link |
2024-11-01 | ConceptFactory: Facilitate 3D Object Knowledge Annotation with Object Conceptualization | Jianhua Sun et.al. | 2411.00448 | null |
2024-11-01 | PLATYPUS: Progressive Local Surface Estimator for Arbitrary-Scale Point Cloud Upsampling | Donghyun Kim et.al. | 2411.00432 | null |
2024-11-01 | StyleTex: Style Image-Guided Texture Generation for 3D Models | Zhiyu Xie et.al. | 2411.00399 | null |
2024-11-06 | Advantages of Neural Population Coding for Deep Learning | Heiko Hoffmann et.al. | 2411.00393 | null |
2024-11-01 | Two-dimensional ASEP model to study density profiles in CVD growth | Gagan Kumar et.al. | 2411.00378 | null |
2024-11-01 | Constrained Diffusion Implicit Models | Vivek Jayaram et.al. | 2411.00359 | null |
2024-11-01 | GAFusion: Adaptive Fusing LiDAR and Camera with Multiple Guidance for 3D Object Detection | Xiaotian Li et.al. | 2411.00340 | null |
2024-11-01 | From Flip FET to Flip 3D Integration (F3D): Maximizing the Scaling Potential of Wafer Both Sides Beyond Conventional 3D Integration | Heng Wu et.al. | 2411.00309 | null |
2024-10-31 | Aquatic-GS: A Hybrid 3D Representation for Underwater Scenes | Shaohua Liu et.al. | 2411.00239 | null |
2024-10-31 | APEBench: A Benchmark for Autoregressive Neural Emulators of PDEs | Felix Koehler et.al. | 2411.00180 | link |
2024-10-31 | A Recipe for Geometry-Aware 3D Mesh Transformers | Mohammad Farazi et.al. | 2411.00164 | null |
2024-10-31 | NIMBA: Towards Robust and Principled Processing of Point Clouds With SSMs | Nursena Köprücü et.al. | 2411.00151 | null |
2024-10-31 | Self-Ensembling Gaussian Splatting for Few-shot Novel View Synthesis | Chen Zhao et.al. | 2411.00144 | link |
2024-10-31 | Enhancing Brain Source Reconstruction through Physics-Informed 3D Neural Networks | Marco Morik et.al. | 2411.00143 | null |
2024-10-31 | Ill-posedness of $2\frac12$ D electron MHD | Mimi Dai et.al. | 2411.00120 | null |
2024-10-31 | Spherical bias on the 3D reconstruction of the ICM density profile in galaxy clusters | I. Veronesi et.al. | 2411.00092 | null |
2024-10-31 | URAvatar: Universal Relightable Gaussian Codec Avatars | Junxuan Li et.al. | 2410.24223 | null |
2024-10-31 | EgoMimic: Scaling Imitation Learning via Egocentric Video | Simar Kareer et.al. | 2410.24221 | link |
2024-11-01 | DELTA: Dense Efficient Long-range 3D Tracking for any video | Tuan Duc Ngo et.al. | 2410.24211 | null |
2024-10-31 | No Pose, No Problem: Surprisingly Simple 3D Gaussian Splats from Sparse Unposed Images | Botao Ye et.al. | 2410.24207 | link |
2024-11-01 | GeoSplatting: Towards Geometry Guided Gaussian Splatting for Physically-based Inverse Rendering | Kai Ye et.al. | 2410.24204 | null |
2024-10-31 | DiffPano: Scalable and Consistent Text to Panorama Generation with Spherical Epipolar-Aware Diffusion | Weicai Ye et.al. | 2410.24203 | link |
2024-10-31 | invrs-gym: a toolkit for nanophotonic inverse design research | Martin F. Schubert et.al. | 2410.24132 | link |
2024-10-31 | A Practical Style Transfer Pipeline for 3D Animation: Insights from Production R&D | Hideki Todo et.al. | 2410.24123 | null |
2024-10-31 | Characterization of the optical model of the T2K 3D segmented plastic scintillator detector | S. Abe et.al. | 2410.24099 | null |
2024-10-31 | 3D-ViTac: Learning Fine-Grained Manipulation with Visuo-Tactile Sensing | Binghao Huang et.al. | 2410.24091 | null |
2024-10-31 | Impact of micromotion on the excitation of Rydberg states of ions in a Paul trap | Wilson S. Martins et.al. | 2410.24047 | null |
2024-10-31 | A Multi-Modal Approach for Face Anti-Spoofing in Non-Calibrated Systems using Disparity Maps | Ariel Larey et.al. | 2410.24031 | null |
2024-11-05 | Re-assembling the past: The RePAIR dataset and benchmark for real world 2D and 3D puzzle solving | Theodore Tsesmelis et.al. | 2410.24010 | null |
2024-10-31 | ImOV3D: Learning Open-Vocabulary Point Clouds 3D Object Detection from Only 2D Images | Timing Yang et.al. | 2410.24001 | link |
2024-10-31 | Stochastic Reconstruction of Gappy Lagrangian Turbulent Signals by Conditional Diffusion Models | Tianyi Li et.al. | 2410.23971 | null |
2024-10-31 | EmbodiedRAG: Dynamic 3D Scene Graph Retrieval for Efficient and Scalable Robot Task Planning | Meghan Booker et.al. | 2410.23968 | null |
2024-10-31 | Manipulating Vehicle 3D Shapes through Latent Space Editing | JiangDong Miao et.al. | 2410.23931 | null |
2024-10-31 | Uncertainty Estimation for 3D Object Detection via Evidential Learning | Nikita Durasov et.al. | 2410.23910 | null |
2024-10-30 | NeFF-BioNet: Crop Biomass Prediction from Point Cloud to Drone Imagery | Xuesong Li et.al. | 2410.23901 | null |
2024-10-31 | Stereo-Talker: Audio-driven 3D Human Synthesis with Prior-Guided Mixture-of-Experts | Xiang Deng et.al. | 2410.23836 | null |
2024-10-31 | Generative AI for Accessible and Inclusive Extended Reality | Jens Grubert et.al. | 2410.23803 | null |
2024-10-31 | Open-Set 3D object detection in LiDAR data as an Out-of-Distribution problem | Louis Soum-Fontez et.al. | 2410.23767 | null |
2024-10-31 | Scaled Inverse Graphics: Efficiently Learning Large Sets of 3D Scenes | Karim Kassab et.al. | 2410.23742 | null |
2024-10-31 | GaussianMarker: Uncertainty-Aware Copyright Protection of 3D Gaussian Splatting | Xiufeng Huang et.al. | 2410.23718 | null |
2024-10-31 | XRDSLAM: A Flexible and Modular Framework for Deep Learning based SLAM | Xiaomeng Wang et.al. | 2410.23690 | link |
2024-10-31 | GS-Blur: A 3D Scene-Based Dataset for Realistic Image Deblurring | Dongwoo Lee et.al. | 2410.23658 | link |
2024-10-31 | Deep Convolutional Neural Networks on Multiclass Classification of Three-Dimensional Brain Images for Parkinson’s Disease Stage Prediction | Guan-Hua Huang et.al. | 2410.23649 | null |
2024-11-06 | SceneComplete: Open-World 3D Scene Completion in Complex Real World Environments for Robot Manipulation | Aditya Agarwal et.al. | 2410.23643 | null |
2024-10-31 | SuctionPrompt: Visual-assisted Robotic Picking with a Suction Cup Using Vision-Language Models and Facile Hardware Design | Tomohiro Motoda et.al. | 2410.23640 | null |
2024-11-01 | Posture-Informed Muscular Force Learning for Robust Hand Pressure Estimation | Kyungjin Seo et.al. | 2410.23629 | null |
2024-10-31 | Dual Agent Learning Based Aerial Trajectory Tracking | Shaswat Garg et.al. | 2410.23571 | null |
2024-10-31 | Y-AR: A Mixed Reality CAD Tool for 3D Wire Bending | Shuo Feng et.al. | 2410.23540 | null |
2024-10-31 | 3D Pore-Scale Mixing Interface Evolution | Daniel M C Hallack et.al. | 2410.23539 | null |
2024-10-31 | On-demand microfluidic droplet pinching and splitting under local confinement gradients | Margaux Kerdraon et.al. | 2410.23538 | null |
2024-11-04 | Novel View Acoustic Parameter Estimation | Ricardo Falcon-Perez et.al. | 2410.23523 | null |
2024-10-31 | LBurst: Learning-Based Robotic Burst Feature Extraction for 3D Reconstruction in Low Light | Ahalya Ravendran et.al. | 2410.23522 | null |
2024-10-30 | Fractional Voigt-regularization of the 3D Navier–Stokes and Euler equations: Global well-posedness and limiting behavior | Zdzislaw Brzeźniak et.al. | 2410.23492 | null |
2024-10-24 | VECTOR: Velocity-Enhanced GRU Neural Network for Real-Time 3D UAV Trajectory Prediction | Omer Nacar et.al. | 2410.23305 | link |
2024-11-04 | EMMA: End-to-End Multimodal Model for Autonomous Driving | Jyh-Jing Hwang et.al. | 2410.23262 | null |
2024-10-30 | PointRecon: Online Point-based 3D Reconstruction via Ray-based 2D-3D Matching | Chen Ziwen et.al. | 2410.23245 | null |
2024-10-30 | A little less conversation, a little more action, please: Investigating the physical common-sense of LLMs in a 3D embodied environment | Matteo G. Mecattaf et.al. | 2410.23242 | link |
2024-10-30 | ELMGS: Enhancing memory and computation scaLability through coMpression for 3D Gaussian Splatting | Muhammad Salman Ali et.al. | 2410.23213 | null |
2024-10-30 | Double BFV quantisation of 3d Gravity | Giovanni Canepa et.al. | 2410.23184 | null |
2024-10-30 | Dust extinction-curve variation in the translucent interstellar medium is driven by PAH growth | Xiangyu Zhang et.al. | 2410.23171 | null |
2024-10-30 | Nested ResNet: A Vision-Based Method for Detecting the Sensing Area of a Drop-in Gamma Probe | Songyu Xu et.al. | 2410.23154 | null |
2024-10-30 | Revisiting MAE pre-training for 3D medical image segmentation | Tassilo Wald et.al. | 2410.23132 | null |
2024-10-30 | Leader-Follower 3D Formation for Underwater Robots | Di Ni et.al. | 2410.23128 | null |
2024-10-31 | NASM: Neural Anisotropic Surface Meshing | Hongbo Li et.al. | 2410.23109 | null |
2024-10-30 | Automated Image-Based Identification and Consistent Classification of Fire Patterns with Quantitative Shape Analysis and Spatial Location Identification | Pengkun Liu et.al. | 2410.23105 | null |
2024-11-04 | S3PT: Scene Semantics and Structure Guided Clustering to Boost Self-Supervised Pre-Training for Autonomous Driving | Maciej K. Wozniak et.al. | 2410.23085 | null |
2024-10-30 | Neural Attention Field: Emerging Point Relevance in 3D Scenes for One-Shot Dexterous Grasping | Qianxu Wang et.al. | 2410.23039 | null |
2024-10-30 | Efficient End-to-End 6-Dof Grasp Detection Framework for Edge Devices with Hierarchical Heatmaps and Feature Propagation | Kaiqin Yang. Yixiang Dai et.al. | 2410.22980 | null |
2024-10-30 | Goodbye Christoffel Symbols: A Flexible and Efficient Approach for Solving Physical Problems in Curved Spaces | Miguel A. Herrada et.al. | 2410.22957 | null |
2024-10-30 | Bringing NeRFs to the Latent Space: Inverse Graphics Autoencoder | Antoine Schnepf et.al. | 2410.22936 | null |
2024-10-30 | 3D Printable Plasmonic Titanium Nitride Nanoparticles Enhanced Thermoplastic Polyurethane Composite for Improved Photothermal De-Icing and Infrared Labeling | Siyu Lu et.al. | 2410.22934 | null |
2024-10-30 | UniRiT: Towards Few-Shot Non-Rigid Point Cloud Registration | Geng Li et.al. | 2410.22909 | null |
2024-10-31 | Epipolar-Free 3D Gaussian Splatting for Generalizable Novel View Synthesis | Zhiyuan Min et.al. | 2410.22817 | null |
2024-10-30 | Origin of the charge density wave state in BaFe $_2$Al$_9$ | Yuping Li et.al. | 2410.22734 | null |
2024-10-30 | SCRREAM : SCan, Register, REnder And Map:A Framework for Annotating Accurate and Dense 3D Indoor Scenes with a Benchmark | HyunJun Jung et.al. | 2410.22715 | link |
2024-10-30 | Geometry Cloak: Preventing TGS-based 3D Reconstruction from Copyrighted Images | Qi Song et.al. | 2410.22705 | null |
2024-10-29 | On the weak Lefschetz property for ideals generated by powers of general linear forms | Matthew D. Booth et.al. | 2410.22542 | null |
2024-11-03 | Multimodality Helps Few-Shot 3D Point Cloud Semantic Segmentation | Zhaochong An et.al. | 2410.22489 | link |
2024-10-29 | Unified Domain Generalization and Adaptation for Multi-View 3D Object Detection | Gyusam Chang et.al. | 2410.22461 | null |
2024-11-04 | AI-assisted Agile Propagation Modeling for Real-time Digital Twin Wireless Networks | Ali Saeizadeh et.al. | 2410.22437 | null |
2024-10-29 | Design and control of three-dimensional topological magnetic fields using interwoven helical nanostructures | John Fullerton et.al. | 2410.22429 | null |
2024-10-29 | Gradient Distance Function | Hieu Le et.al. | 2410.22422 | null |
2024-11-01 | Fast Transients from Magnetic Disks Around Non-Spinning Collapsar Black Holes | Justin Bopp et.al. | 2410.22401 | null |
2024-10-29 | Exploiting Semantic Scene Reconstruction for Estimating Building Envelope Characteristics | Chenghao Xu et.al. | 2410.22383 | null |
2024-10-29 | Multi-Object 3D Grounding with Dynamic Modules and Language-Informed Spatial Attention | Haomeng Zhang et.al. | 2410.22306 | link |
2024-10-29 | NCA-Morph: Medical Image Registration with Neural Cellular Automata | Amin Ranem et.al. | 2410.22265 | link |
2024-10-29 | Optimizing and Managing Wireless Backhaul for Resilient Next-Generation Cellular Networks | Gabriele Gemmi et.al. | 2410.22246 | null |
2024-10-29 | Guide3D: A Bi-planar X-ray Dataset for 3D Shape Reconstruction | Tudor Jianu et.al. | 2410.22224 | null |
2024-10-29 | Analyzing Multimodal Interaction Strategies for LLM-Assisted Manipulation of 3D Scenes | Junlong Chen et.al. | 2410.22177 | null |
2024-10-29 | Topological surface state dominated nonlinear transverse response and microwave rectification at room temperature | Qia Shen et.al. | 2410.22156 | null |
2024-10-30 | Learning Successor Features the Simple Way | Raymond Chua et.al. | 2410.22133 | link |
2024-10-29 | PF3plat: Pose-Free Feed-Forward 3D Gaussian Splatting | Sunghwan Hong et.al. | 2410.22128 | link |
2024-11-02 | TractShapeNet: Efficient Multi-Shape Learning with 3D Tractography Point Clouds | Yui Lo et.al. | 2410.22099 | link |
2024-10-29 | DINeuro: Distilling Knowledge from 2D Natural Images via Deformable Tubular Transferring Strategy for 3D Neuron Reconstruction | Yik San Cheng et.al. | 2410.22078 | null |
2024-10-29 | FreeGaussian: Guidance-free Controllable 3D Gaussian Splats with Flow Derivatives | Qizhi Chen et.al. | 2410.22070 | null |
2024-10-29 | Numerical Calculation of the Hopf Index for 3D Magnetic Textures | Ross Knapman et.al. | 2410.22058 | null |
2024-10-29 | Buoyancy-driven flow regimes for a melting vertical ice cylinder in saline water | Dehao Xu et.al. | 2410.22050 | null |
2024-10-29 | GaiaUnlimited: The old stellar disc of the Milky Way as traced by the Red Clump | Shourya Khanna et.al. | 2410.22036 | null |
2024-10-29 | A Survey on RGB, 3D, and Multimodal Approaches for Unsupervised Industrial Anomaly Detection | Yuxuan Lin et.al. | 2410.21982 | link |
2024-11-02 | PrefPaint: Aligning Image Inpainting Diffusion Model with Human Preference | Kendong Liu et.al. | 2410.21966 | null |
2024-10-29 | Spatio-temporal Transformers for Action Unit Classification with Event Cameras | Luca Cultrera et.al. | 2410.21958 | null |
2024-10-29 | Depth Dependent Dynamics Explain the Equatorial Jet Difference Between Jupiter and Saturn | Keren Duer et.al. | 2410.21929 | null |
2024-10-29 | SceneGenAgent: Precise Industrial Scene Generation with Coding Agent | Xiao Xia et.al. | 2410.21909 | link |
2024-10-29 | Thermal Finite-Element Model of an Electric Machine Cooled by a Spray | Christian Bergfried et.al. | 2410.21875 | null |
2024-10-29 | Ideal Magnetohydrodynamics Around Couette Flow: Long Time Stability and Vorticity-Current Instability | Niklas Knobel et.al. | 2410.21835 | null |
2024-10-29 | Volumetric Conditioning Module to Control Pretrained Diffusion Models for 3D Medical Images | Suhyun Ahn et.al. | 2410.21826 | link |
2024-10-29 | Skin dose in breast radiation therapy: Monte Carlo calculations from deformed vector fields (DVF)-driven CT images | Nicolas Arbor et.al. | 2410.21823 | null |
2024-10-29 | DOFS: A Real-world 3D Deformable Object Dataset with Full Spatial Information for Dynamics Model Learning | Zhen Zhang et.al. | 2410.21758 | null |
2024-10-29 | Memory-Efficient Point Cloud Registration via Overlapping Region Sampling | Tomoyasu Shimada et.al. | 2410.21753 | null |
2024-10-29 | MotionGPT-2: A General-Purpose Motion-Language Model for Motion Generation and Understanding | Yuan Wang et.al. | 2410.21747 | null |
2024-11-07 | SS3DM: Benchmarking Street-View Surface Reconstruction with a Synthetic 3D Mesh Dataset | Yubin Hu et.al. | 2410.21739 | null |
2024-10-29 | Magnetization-Induced Phase Transitions on the surface of 3D Topological Insulators | Yu-Hao Wan et.al. | 2410.21684 | null |
2024-10-29 | Predicting the Encoding Error of SIRENs | Jeremy Vonderfecht et.al. | 2410.21645 | null |
2024-10-29 | OFER: Occluded Face Expression Reconstruction | Pratheba Selvaraju et.al. | 2410.21629 | null |
2024-10-28 | Topological numbers and their use to characterize simple points for 2D binary images | Christophe Lohou et.al. | 2410.21588 | null |
2024-10-28 | MVSDet: Multi-View Indoor 3D Object Detection via Efficient Plane Sweeps | Yating Xu et.al. | 2410.21566 | link |
2024-10-28 | Detection of moving objects through turbulent media. Decomposition of Oscillatory vs Non-Oscillatory spatio-temporal vector fields | Jerome Gilles et.al. | 2410.21551 | null |
2024-10-28 | Constrained Transformer-Based Porous Media Generation to Spatial Distribution of Rock Properties | Zihan Ren et.al. | 2410.21462 | null |
2024-10-28 | TALE-teller: Tendon-Actuated Linked Element Robotic Testbed for Investigating Tail Functions | Margaret J. Zhang et.al. | 2410.21445 | null |
2024-10-28 | TACO: Adversarial Camouflage Optimization on Trucks to Fool Object Detectors | Adonisz Dimitriu et.al. | 2410.21443 | null |
2024-10-26 | Deep Optimizer States: Towards Scalable Training of Transformer Models Using Interleaved Offloading | Avinash Maurya et.al. | 2410.21316 | link |
2024-10-25 | ArCSEM: Artistic Colorization of SEM Images via Gaussian Splatting | Takuma Nishimura et.al. | 2410.21310 | null |
2024-10-28 | Boosting HI-Galaxy Cross-Clustering Signal through Higher-Order Cross-Correlations | Eishica Chand et.al. | 2410.21225 | null |
2024-11-07 | Symmetric similarity 3D coordinate transformation based on dual quaternion algorithm | Sebahattin Bektaş et.al. | 2410.21217 | null |
2024-10-28 | Exploring contextual modeling with linear complexity for point cloud segmentation | Yong Xien Chng et.al. | 2410.21211 | null |
2024-10-28 | The VSPEC Collection: A suite of utilities to model spectroscopic phase curves of 3D exoplanet atmospheres in the presence of stellar variability | Ted M Johnson et.al. | 2410.21190 | null |
2024-10-28 | Large Language Model-assisted Speech and Pointing Benefits Multiple 3D Object Selection in Virtual Reality | Junlong Chen et.al. | 2410.21091 | null |
2024-10-29 | EEG-Driven 3D Object Reconstruction with Color Consistency and Diffusion Prior | Xin Xiang et.al. | 2410.20981 | null |
2024-10-28 | MovieCharacter: A Tuning-Free Framework for Controllable Character Video Synthesis | Di Qiu et.al. | 2410.20974 | null |
2024-10-28 | Improving Detection of Person Class Using Dense Pooling | Nouman Ahmad et.al. | 2410.20966 | link |
2024-10-28 | Direct imaging of carbohydrate stereochemistry | Shuning Cai et.al. | 2410.20897 | null |
2024-10-28 | Evaluating the Robustness of LiDAR Point Cloud Tracking Against Adversarial Attack | Shengjing Tian et.al. | 2410.20893 | null |
2024-10-28 | Synthetic Light Curves and Spectra for the Photospheric Phase of a 3D Stripped-Envelope Supernova Explosion Model | Thomas Maunder et.al. | 2410.20829 | null |
2024-10-28 | Projection-based Reduced Order Modelling for Unsteady Parametrized Optimal Control Problems in 3D Cardiovascular Flows | Surabhi Rathore et.al. | 2410.20828 | null |
2024-10-28 | Intermittency of bubble deformation in turbulence | Xu Xu et.al. | 2410.20826 | null |
2024-10-28 | Grid4D: 4D Decomposed Hash Encoding for High-fidelity Dynamic Gaussian Splatting | Jiawei Xu et.al. | 2410.20815 | null |
2024-10-30 | Transformer-Based Tooth Alignment Prediction With Occlusion And Collision Constraints | ZhenXing Dong et.al. | 2410.20806 | null |
2024-10-28 | LoDAvatar: Hierarchical Embedding and Adaptive Levels of Detail with Gaussian Splatting for Enhanced Human Avatars | Xiaonuo Dongye et.al. | 2410.20789 | null |
2024-10-28 | Bidirectional Recurrence for Cardiac Motion Tracking with Gaussian Process Latent Coding | Jiewen Yang et.al. | 2410.20752 | link |
2024-10-29 | BLAPose: Enhancing 3D Human Pose Estimation with Bone Length Adjustment | Chih-Hsiang Hsu et.al. | 2410.20731 | link |
2024-10-28 | CompGS: Unleashing 2D Compositionality for Compositional Text-to-3D via Dynamically Optimizing 3D Gaussians | Chongjian Ge et.al. | 2410.20723 | null |
2024-10-28 | Reprogramming Pretrained Target-Specific Diffusion Models for Dual-Target Drug Design | Xiangxin Zhou et.al. | 2410.20688 | link |
2024-10-28 | ODGS: 3D Scene Reconstruction from Omnidirectional Images with 3D Gaussian Splattings | Suyoung Lee et.al. | 2410.20686 | link |
2024-10-28 | TurboHopp: Accelerated Molecule Scaffold Hopping with Consistency Models | Kiwoong Yoo et.al. | 2410.20660 | link |
2024-10-27 | Three-part structure of solar coronal mass ejection observed in low coronal signatures of Solar Orbiter | Tatiana Podladchikova et.al. | 2410.20603 | null |
2024-10-27 | Normal-GS: 3D Gaussian Splatting with Normal-Involved Rendering | Meng Wei et.al. | 2410.20593 | null |
2024-10-27 | Neural rendering enables dynamic tomography | Ivan Grega et.al. | 2410.20558 | null |
2024-10-27 | SympCam: Remote Optical Measurement of Sympathetic Arousal | Björn Braun et.al. | 2410.20552 | null |
2024-10-27 | On the well-posedness of the Hall-MHD system in a critical setting of Besov-Morrey type | Lucas C. F. Ferreira et.al. | 2410.20465 | null |
2024-10-27 | BlinkVision: A Benchmark for Optical Flow, Scene Flow and Point Tracking Estimation using RGB Frames and Events | Yijin Li et.al. | 2410.20451 | null |
2024-11-02 | Point-PRC: A Prompt Learning Based Regulation Framework for Generalizable Point Cloud Analysis | Hongyu Sun et.al. | 2410.20406 | link |
2024-10-27 | Composite running vacuum in the Universe: implications on the cosmological tensions | Joan Solà Peracaula et.al. | 2410.20382 | null |
2024-11-01 | RopeTP: Global Human Motion Recovery via Integrating Robust Pose Estimation with Diffusion Trajectory Prior | Mingjiang Liang et.al. | 2410.20358 | null |
2024-10-27 | UTSRMorph: A Unified Transformer and Superresolution Network for Unsupervised Medical Image Registration | Runshi Zhang et.al. | 2410.20348 | link |
2024-10-27 | Harmony4D: A Video Dataset for In-The-Wild Close Human Interactions | Rawal Khirodkar et.al. | 2410.20294 | null |
2024-10-26 | Machine Learning based Glitch Veto for inspiral binary merger signals using Linear Chirp Transform | N. Arutkeerthi et.al. | 2410.20269 | null |
2024-10-26 | Learning Approximated Maximal Safe Sets via Hypernetworks for MPC-Based Local Motion Planning | Bojan Derajić et.al. | 2410.20267 | null |
2024-10-26 | Equivariant Blurring Diffusion for Hierarchical Molecular Conformer Generation | Jiwoong Park et.al. | 2410.20255 | link |
2024-10-26 | Neural Fields in Robotics: A Survey | Muhammad Zubair Irshad et.al. | 2410.20220 | link |
2024-10-26 | GeoFUSE: A High-Efficiency Surrogate Model for Seawater Intrusion Prediction and Uncertainty Reduction | Su Jiang et.al. | 2410.20118 | null |
2024-10-26 | Anatomical 3D Style Transfer Enabling Efficient Federated Learning with Extremely Low Communication Costs | Yuto Shibata et.al. | 2410.20102 | null |
2024-10-26 | 3D Distance-color-coded Assessment of PCI Stent Apposition via Deep-learning-based Three-dimensional Multi-object Segmentation | Xiaoyang Qin et.al. | 2410.20055 | null |
2024-10-26 | SCube: Instant Large-Scale Scene Reconstruction using VoxSplats | Xuanchi Ren et.al. | 2410.20030 | null |
2024-10-25 | Unsupervised Machine Learning for Detecting and Locating Human-Made Objects in 3D Point Cloud | Hong Zhao et.al. | 2410.20006 | null |
2024-10-25 | First Principles Excitons in Periodic Systems with Gaussian Density Fitting and Ewald Potential Functions | M. A. García-Blázquez et.al. | 2410.19945 | null |
2024-10-25 | Tracking and triangulating firefly flashes in field recordings | Raphael Sarfati et.al. | 2410.19932 | link |
2024-10-21 | YOLO11 and Vision Transformers based 3D Pose Estimation of Immature Green Fruits in Commercial Apple Orchards for Robotic Thinning | Ranjan Sapkota et.al. | 2410.19846 | null |
2024-10-19 | GNNRL-Smoothing: A Prior-Free Reinforcement Learning Model for Mesh Smoothing | Zhichao Wang et.al. | 2410.19834 | null |
2024-10-16 | Radon Implicit Field Transform (RIFT): Learning Scenes from Radar Signals | Daqian Bao et.al. | 2410.19801 | link |
2024-10-25 | IPPON: Common Sense Guided Informative Path Planning for Object Goal Navigation | Kaixian Qu et.al. | 2410.19697 | null |
2024-10-30 | DiffGS: Functional Gaussian Splatting Diffusion | Junsheng Zhou et.al. | 2410.19657 | null |
2024-10-25 | Toward Generalizable Multiple Sclerosis Lesion Segmentation Models | Liviu Badea et.al. | 2410.19623 | null |
2024-10-25 | MonoDGP: Monocular 3D Object Detection with Decoupled-Query and Geometry-Error Priors | Fanqi Pu et.al. | 2410.19590 | link |
2024-10-25 | FastPCI: Motion-Structure Guided Fast Point Cloud Frame Interpolation | Tianyu Zhang et.al. | 2410.19573 | link |
2024-10-25 | Prediction of microstructural representativity from a single image | Amir Dahari et.al. | 2410.19568 | link |
2024-10-25 | Robotic Learning in your Backyard: A Neural Simulator from Open Source Components | Liyou Zhou et.al. | 2410.19564 | link |
2024-10-25 | Interface energies of Ga2O3 phases with the sapphire substrate and the phase-locked epitaxy of metastable structures explained | Ilaria Bertoni et.al. | 2410.19530 | null |
2024-10-25 | Nutation-orbit resonances: The origin of the chaotic rotation of Hyperion and the barrel instability | Max Goldberg et.al. | 2410.19518 | null |
2024-10-25 | Content-Aware Radiance Fields: Aligning Model Complexity with Scene Intricacy Through Learned Bitwidth Quantization | Weihang Liu et.al. | 2410.19483 | link |
2024-10-25 | Evaluation of strategies for efficient rate-distortion NeRF streaming | Pedro Martin et.al. | 2410.19459 | null |
2024-10-25 | Project Lx Conventos: Travelling through space and time in Lisbon’s religious buildings | Joao Gouveia et.al. | 2410.19455 | null |
2024-10-25 | Fusion-then-Distillation: Toward Cross-modal Positive Distillation for Domain Adaptive 3D Semantic Segmentation | Yao Wu et.al. | 2410.19446 | link |
2024-10-25 | Paint Bucket Colorization Using Anime Character Color Design Sheets | Yuekun Dai et.al. | 2410.19424 | link |
2024-10-25 | Cold day-side winds shape large leading streams in evaporating exoplanet atmospheres | F. Nail et.al. | 2410.19381 | null |
2024-10-25 | DECADE: Towards Designing Efficient-yet-Accurate Distance Estimation Modules for Collision Avoidance in Mobile Advanced Driver Assistance Systems | Muhammad Zaeem Shahzad et.al. | 2410.19336 | null |
2024-10-29 | Non-rigid Relative Placement through 3D Dense Diffusion | Eric Cai et.al. | 2410.19247 | null |
2024-10-24 | SoftSnap: Rapid Prototyping of Untethered Soft Robots Using Snap-Together Modules | Luyang Zhao et.al. | 2410.19169 | null |
2024-10-24 | Nanoscale magnetic ordering dynamics in a high Curie temperature ferromagnet | Yueh-Chun Wu et.al. | 2410.19158 | null |
2024-10-24 | MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision | Ruicheng Wang et.al. | 2410.19115 | null |
2024-10-24 | Bio2Token: All-atom tokenization of any biomolecular structure with Mamba | Andrew Liu et.al. | 2410.19110 | null |
2024-10-24 | Earth-like exoplanets in spin-orbit resonances: climate dynamics, 3D atmospheric chemistry, and observational signatures | Marrick Braam et.al. | 2410.19108 | link |
2024-10-28 | BIFRÖST: 3D-Aware Image compositing with Language Instructions | Lingxiao Li et.al. | 2410.19079 | link |
2024-10-24 | Parallelization of Network Dynamics Computations in Heterogeneous Distributed Environment | Oleksandr Sudakov et.al. | 2410.19075 | null |
2024-10-24 | Continuity of the solution map of some active scalar equations in Hölder and Zygmund spaces | Marc Magaña et.al. | 2410.19057 | null |
2024-10-24 | Star-triangle dualities and supersymmetric improved bifundamentals | Sergio Benvenuti et.al. | 2410.19049 | null |
2024-10-24 | Aspects of Entanglement Entropy in $3d$ $\mathcal{N}=2$ SCFTs | Pedro Vicente Marto et.al. | 2410.19044 | null |
2024-10-24 | PixelGaussian: Generalizable 3D Gaussian Reconstruction from Arbitrary Views | Xin Fei et.al. | 2410.18979 | link |
2024-10-24 | 3D-Adapter: Geometry-Consistent Multi-View Diffusion for High-Quality 3D Generation | Hansheng Chen et.al. | 2410.18974 | link |
2024-10-30 | Large Spatial Model: End-to-end Unposed Images to Semantic 3D | Zhiwen Fan et.al. | 2410.18956 | link |
2024-10-24 | Exact solutions for topological surface states of three-dimensional lattice models | Matias Mustonen et.al. | 2410.18934 | null |
2024-10-24 | Sort-free Gaussian Splatting via Weighted Sum Rendering | Qiqi Hou et.al. | 2410.18931 | null |
2024-10-24 | Dynamic 3D Gaussian Tracking for Graph-Based Neural Dynamics Modeling | Mingtong Zhang et.al. | 2410.18912 | null |
2024-10-24 | SKATR: A Self-Supervised Summary Transformer for SKA | Ayodele Ore et.al. | 2410.18899 | link |
2024-10-27 | Binocular-Guided 3D Gaussian Splatting with View Consistency for Sparse View Synthesis | Liang Han et.al. | 2410.18822 | null |
2024-10-24 | Learning Geodesics of Geometric Shape Deformations From Images | Nian Wu et.al. | 2410.18797 | null |
2024-10-24 | Online path planning for kinematic-constrained UAVs in a dynamic environment based on a Differential Evolution algorithm | Elias J. R. Freitas et.al. | 2410.18777 | null |
2024-10-24 | Quantum Hall effect and current distribution in the 3D topological insulator HgTe | S. Hartl et.al. | 2410.18759 | null |
2024-10-24 | Efficient simulation of quarkonium master equation beyond the dipole approximation | Jorge M. Mtz-Vera et.al. | 2410.18709 | null |
2024-10-24 | Rigid Single-Slice-in-Volume registration via rotation-equivariant 2D/3D feature matching | Stefan Brandstätter et.al. | 2410.18683 | null |
2024-10-24 | 3D Shape Completion with Test-Time Training | Michael Schopf-Kuester et.al. | 2410.18668 | link |
2024-10-24 | Embodied Manipulation with Past and Future Morphologies through an Open Parametric Hand Design | Kieran Gilday et.al. | 2410.18633 | null |
2024-10-24 | A Cranial-Feature-Based Registration Scheme for Robotic Micromanipulation Using a Microscopic Stereo Camera System | Xiaofeng Lin et.al. | 2410.18630 | null |
2024-10-24 | Estimating early coronal mass ejection propagation direction with DIRECD during the severe May 8 and follow-up June 8, 2024 events | Shantanu Jain et.al. | 2410.18549 | null |
2024-10-24 | Anatomy of a Fall: Stationary and super-Keplerian spiral arms generated by accretion streamers in protostellar discs | Josh Calcino et.al. | 2410.18521 | link |
2024-10-24 | Segmentation-aware Prior Assisted Joint Global Information Aggregated 3D Building Reconstruction | Hongxin Peng et.al. | 2410.18433 | null |
2024-10-24 | GPU Accelerated 3D P-wave Source Free Adaptive Wavefield Reconstruction Inversion with an application to experimental VSP physical modeling data | Zhilong Fang et.al. | 2410.18429 | null |
2024-10-24 | Scale Propagation Network for Generalizable Depth Completion | Haotian Wang et.al. | 2410.18408 | link |
2024-10-24 | Structure Language Models for Protein Conformation Generation | Jiarui Lu et.al. | 2410.18403 | null |
2024-10-24 | Irregular Tensor Low-Rank Representation for Hyperspectral Image Representation | Bo Han et.al. | 2410.18388 | link |
2024-10-24 | Integrating Canonical Neural Units and Multi-Scale Training for Handwritten Text Recognition | Zi-Rui Wang et.al. | 2410.18374 | null |
2024-10-24 | Real-time 3D-aware Portrait Video Relighting | Ziqi Cai et.al. | 2410.18355 | link |
2024-10-23 | Floquet Codes from Coupled Spin Chains | Bowen Yan et.al. | 2410.18265 | null |
2024-10-23 | A Methodology for Transformer Ratio Adjustment in Small-Size Rotary Transformers | Saeed Hajmohammadi et.al. | 2410.18217 | null |
2024-10-23 | Personalized Instance-based Navigation Toward User-Specific Objects in Realistic Environments | Luca Barsellotti et.al. | 2410.18195 | link |
2024-10-23 | POSEIDON: A Multidimensional Atmospheric Retrieval Code for Exoplanet Spectra | Ryan J. MacDonald et.al. | 2410.18181 | null |
2024-10-23 | Bridging the Diagnostic Divide: Classical Computer Vision and Advanced AI methods for distinguishing ITB and CD through CTE Scans | Shashwat Gupta et.al. | 2410.18161 | link |
2024-10-22 | Advancing Super-Resolution in Neural Radiance Fields via Variational Diffusion Strategies | Shrey Vishen et.al. | 2410.18137 | link |
2024-10-23 | DynamicCity: Large-Scale LiDAR Generation from Dynamic Scenes | Hengwei Bian et.al. | 2410.18084 | null |
2024-10-23 | The physical properties of Cluster Chains | Laura Posch et.al. | 2410.18080 | null |
2024-10-23 | FreeVS: Generative View Synthesis on Free Driving Trajectory | Qitai Wang et.al. | 2410.18079 | null |
2024-10-23 | A geometrical description of untwisted Dijkgraaf-Witten TQFT with defects | João Faría Martins et.al. | 2410.18049 | null |
2024-10-23 | VR-Splatting: Foveated Radiance Field Rendering via 3D Gaussian Splatting and Neural Points | Linus Franke et.al. | 2410.17932 | null |
2024-10-23 | Relaxed Equivariance via Multitask Learning | Ahmed A. Elhag et.al. | 2410.17878 | null |
2024-10-23 | Gaussian Process Distance Fields Obstacle and Ground Constraints for Safe Navigation | Monisha Mushtary Uttsha et.al. | 2410.17831 | null |
2024-10-23 | Att2CPC: Attention-Guided Lossy Attribute Compression of Point Clouds | Kai Liu et.al. | 2410.17823 | link |
2024-10-23 | GenUDC: High Quality 3D Mesh Generation with Unsigned Dual Contouring Representation | Ruowei Wang et.al. | 2410.17802 | link |
2024-10-23 | Facile One Pot Synthesis of Hybrid Core-Shell Silica-Based Sensors for Live Imaging of Dissolved Oxygen and Hypoxia Mapping in 3D cell models | Helena Iuele et.al. | 2410.17797 | null |
2024-10-24 | Anomalous conductance steps in 3D TI HgTe-based quantum point contacts | Elisabeth Richter et.al. | 2410.17786 | null |
2024-10-23 | Efficient Neural Implicit Representation for 3D Human Reconstruction | Zexu Huang et.al. | 2410.17741 | link |
2024-10-23 | Under the magnifying glass: A combined 3D model applied to cloudy warm Saturn type exoplanets around M-dwarfs | Sven Kiefer et.al. | 2410.17716 | null |
2024-10-23 | Constraining the CSM structure and progenitor mass-loss history of interacting supernovae through 3D hydrodynamic modeling: The case of SN 2014C | S. Orlando et.al. | 2410.17699 | null |
2024-10-23 | Deep Generative Models for 3D Medical Image Synthesis | Paul Friedrich et.al. | 2410.17664 | null |
2024-10-23 | A generalized Frenet frame for computing MHD equilibria in stellarators | Florian J. Hindenlang et.al. | 2410.17595 | null |
2024-10-23 | Energy-Optimal Planning of Waypoint-Based UAV Missions – Does Minimum Distance Mean Minimum Energy? | Nicolas Michel et.al. | 2410.17585 | null |
2024-10-23 | Ultra-reliable urban air mobility networks | Hyunsoo Kim et.al. | 2410.17572 | null |
2024-10-23 | Generalizable Motion Planning via Operator Learning | Sharath Matada et.al. | 2410.17547 | null |
2024-10-23 | Improving Connectivity of RIS-Assisted UAV Networks using RIS Partitioning and Deployment | Mohammed Saif et.al. | 2410.17541 | null |
2024-10-23 | PLGS: Robust Panoptic Lifting with 3D Gaussian Splatting | Yu Wang et.al. | 2410.17505 | null |
2024-10-23 | GenDP: 3D Semantic Fields for Category-Level Generalizable Diffusion Policy | Yixuan Wang et.al. | 2410.17488 | null |
2024-10-22 | AG-SLAM: Active Gaussian Splatting SLAM | Wen Jiang et.al. | 2410.17422 | null |
2024-10-22 | A 3D Model of the Local Bubble’s Magnetic Field: Insights from Dust and Starlight Polarization | Theo J. O’Neill et.al. | 2410.17341 | null |
2024-10-22 | SpectroMotion: Dynamic 3D Reconstruction of Specular Scenes | Cheng-De Fan et.al. | 2410.17249 | null |
2024-10-22 | LVSM: A Large View Synthesis Model with Minimal 3D Inductive Bias | Haian Jin et.al. | 2410.17242 | null |
2024-10-22 | Reinforcement learning on structure-conditioned categorical diffusion for protein inverse folding | Yasha Ektefaie et.al. | 2410.17173 | link |
2024-10-22 | Impact of 3D LiDAR Resolution in Graph-based SLAM Approaches: A Comparative Study | J. Jorge et.al. | 2410.17171 | null |
2024-10-22 | A Parallelized 3D Geomechanical Solver for Fluid-induced Fault Slip in Poroelastic Media | Emil Rinatovich Gallyamov et.al. | 2410.17133 | null |
2024-10-18 | GS-LIVM: Real-Time Photo-Realistic LiDAR-Inertial-Visual Mapping with Gaussian Splatting | Yusen Xie et.al. | 2410.17084 | null |
2024-10-23 | Global strong solution for the stochastic tamed Chemotaxis-Navier-Stokes system in $\mathbb{R}^3$ | Fan Xu et.al. | 2410.17059 | null |
2024-10-22 | SPVSoAP3D: A Second-order Average Pooling Approach to enhance 3D Place Recognition in Horticultural Environments | T. Barros et.al. | 2410.17017 | link |
2024-10-22 | Joint Point Cloud Upsampling and Cleaning with Octree-based CNNs | Jihe Li et.al. | 2410.17001 | link |
2024-10-22 | E-3DGS: Gaussian Splatting with Exposure and Motion Events | Xiaoting Yin et.al. | 2410.16995 | link |
2024-10-22 | Multi-Layer Gaussian Splatting for Immersive Anatomy Visualization | Constantin Kleinbeck et.al. | 2410.16978 | link |
2024-10-22 | IdenBAT: Disentangled Representation Learning for Identity-Preserved Brain Age Transformation | Junyeong Maeng et.al. | 2410.16945 | link |
2024-10-22 | Automatic Extraction and Compensation of P-Bit Device Variations in Large Array Utilizing Boltzmann Machine Training | Bolin Zhang et.al. | 2410.16915 | null |
2024-10-22 | VistaDream: Sampling multiview consistent images for single-view scene reconstruction | Haiping Wang et.al. | 2410.16892 | null |
2024-10-22 | Toolpath Generation for High Density Spatial Fiber Printing Guided by Principal Stresses | Tianyu Zhang et.al. | 2410.16851 | null |
2024-10-22 | Properties of magnetic null points associated with X-class flares during solar cycle 24 | R. L. Edgar et.al. | 2410.16778 | null |
2024-10-22 | The Scene Language: Representing Scenes with Programs, Words, and Embeddings | Yunzhi Zhang et.al. | 2410.16770 | null |
2024-10-22 | Universal flops of length 1 and 2 from D2-branes at surface singularities | Marina Moleti et.al. | 2410.16767 | null |
2024-10-22 | Lyman- $α$ forest power spectrum and its cross-correlation with dark matter halos in different astrophysical models | Koichiro Nakashima et.al. | 2410.16740 | null |
2024-10-22 | Wave function forms of interlayer excitons in bilayer transition metal dichalcogenides | Jianju Tang et.al. | 2410.16717 | null |
2024-10-22 | Efficient Antibody Structure Refinement Using Energy-Guided SE(3) Flow Matching | Jiying Zhang et.al. | 2410.16673 | null |
2024-10-26 | Extending the FDTD GVADE method nonlinear polarization vector to include anisotropy | Caleb J. Grimms et.al. | 2410.16622 | null |
2024-10-22 | The second order Huang-Yang formula to the 3D Fermi gas: the Gross-Pitaevskii regime | Xuwen Chen et.al. | 2410.16620 | null |
2024-10-22 | Three-Dimensional Particle-In-Cell Simulations of Two-Dimensional Bernstein-Greene-Kruskal Modes | M. T. Franciscovich et.al. | 2410.16585 | link |
2024-10-30 | SINGAPO: Single Image Controlled Generation of Articulated Parts in Objects | Jiayi Liu et.al. | 2410.16499 | null |
2024-10-21 | Joker: Conditional 3D Head Synthesis with Extreme Facial Expressions | Malte Prinzler et.al. | 2410.16395 | null |
2024-10-21 | Disambiguating Monocular Reconstruction of 3D Clothed Human with Spatial-Temporal Transformer | Yong Deng et.al. | 2410.16337 | null |
2024-10-16 | Navigating the Digital Chain in Concrete 3D Printing | Ali El Hage et.al. | 2410.16319 | null |
2024-10-21 | MvDrag3D: Drag-based Creative 3D Editing via Multi-view Generation-Reconstruction Priors | Honghua Chen et.al. | 2410.16272 | null |
2024-10-21 | FrugalNeRF: Fast Convergence for Few-shot Novel View Synthesis without Learned Priors | Chin-Yang Lin et.al. | 2410.16271 | null |
2024-10-21 | 3DGS-Enhancer: Enhancing Unbounded 3D Gaussian Splatting with View-consistent 2D Diffusion Priors | Xi Liu et.al. | 2410.16266 | null |
2024-10-21 | Characterizing the Effect of Electrode Shift & Sensor Reapplication on Common sEMG Features in Lower Limb Muscles | Fraser Douglas et.al. | 2410.16262 | null |
2024-10-21 | Agent-to-Sim: Learning Interactive Behavior Models from Casual Longitudinal Videos | Gengshan Yang et.al. | 2410.16259 | null |
2024-10-21 | Comparative analysis of 3D-CNN models, GARCH-ANN, and VAR models for determining equity prices | Sydney Anuyah Mary Akinyemi et.al. | 2410.16205 | null |
2024-10-21 | Simulating quantum emitters in arbitrary photonic environments using FDTD: beyond the semi-classical regime | Qingyi Zhou et.al. | 2410.16118 | link |
2024-10-21 | Virtual Reality Games: Extending Unity Learn Games to VR | Ryan P. McMahan et.al. | 2410.16061 | null |
2024-10-21 | Amorphization-induced topological and insulator-metal transitions in bidimensional Bi $x$Sb${1-x}$ alloys | A. J. Uría-Álvarez et.al. | 2410.16034 | null |
2024-10-21 | HyperDrive: Scheduling Serverless Functions in the Edge-Cloud-Space 3D Continuum | Thomas Pusztai et.al. | 2410.16026 | link |
2024-10-21 | 3D-GANTex: 3D Face Reconstruction with StyleGAN3-based Multi-View Images and 3DDFA based Mesh Generation | Rohit Das et.al. | 2410.16009 | link |
2024-10-21 | Lossless optimal transient control for rigid bodies in 3D space | Riccardo Zanella et.al. | 2410.15984 | null |
2024-10-21 | Zero-Shot Scene Reconstruction from Single Images with Deep Prior Assembly | Junsheng Zhou et.al. | 2410.15971 | null |
2024-10-21 | MBPU: A Plug-and-Play State Space Model for Point Cloud Upsamping with Fast Point Rendering | Jiayi Song et.al. | 2410.15941 | null |
2024-10-21 | Fully distributed and resilient source seeking for robot swarms | Jesús Bautista et.al. | 2410.15921 | null |
2024-10-21 | TexPro: Text-guided PBR Texturing with Procedural Material Modeling | Ziqiang Dang et.al. | 2410.15891 | null |
2024-10-21 | Triplane Grasping: Efficient 6-DoF Grasping with Single RGB Images | Yiming Li et.al. | 2410.15879 | null |
2024-10-21 | R2I-rPPG: A Robust Region of Interest Selection Method for Remote Photoplethysmography to Extract Heart Rate | Sandeep Nagar et.al. | 2410.15851 | null |
2024-10-21 | LiOn-XA: Unsupervised Domain Adaptation via LiDAR-Only Cross-Modal Adversarial Training | Thomas Kreutz et.al. | 2410.15833 | link |
2024-10-21 | Kaninfradet3D:A Road-side Camera-LiDAR Fusion 3D Perception Model based on Nonlinear Feature Extraction and Intrinsic Correlation | Pei Liu et.al. | 2410.15814 | null |
2024-10-21 | Probing Na in giant exoplanets with ESPRESSO and 3D NLTE stellar spectra | G. Canocchi et.al. | 2410.15810 | null |
2024-10-27 | WildOcc: A Benchmark for Off-Road 3D Semantic Occupancy Prediction | Heng Zhai et.al. | 2410.15792 | null |
2024-10-21 | Possible way to achieve anomalous valley Hall effect by tunable intrinsic piezoelectric polarization in FeO $_2$SiGeN$_2$ monolayer | Jianke Tian et.al. | 2410.15786 | null |
2024-10-23 | Improving Instance Optimization in Deformable Image Registration with Gradient Projection | Yi Zhang et.al. | 2410.15767 | null |
2024-10-21 | Unleashing the Potential of Vision-Language Pre-Training for 3D Zero-Shot Lesion Segmentation via Mask-Attribute Alignment | Yankai Jiang et.al. | 2410.15744 | null |
2024-10-21 | 3D Optofluidic Control Using Reconfigurable Thermal Barriers | Falko Schmidt et.al. | 2410.15708 | null |
2024-10-21 | RANSAC Back to SOTA: A Two-stage Consensus Filtering for Real-time 3D Registration | Pengcheng Shi et.al. | 2410.15682 | link |
2024-10-22 | LucidFusion: Generating 3D Gaussians with Arbitrary Unposed Images | Hao He et.al. | 2410.15636 | null |
2024-10-22 | Fully Explicit Dynamic Gaussian Splatting | Junoh Lee et.al. | 2410.15629 | null |
2024-10-21 | Joint Top-Down and Bottom-Up Frameworks for 3D Visual Grounding | Yang Liu et.al. | 2410.15615 | null |
2024-10-21 | ARTS: Semi-Analytical Regressor using Disentangled Skeletal Representations for Human Mesh Recovery from Videos | Tao Tang et.al. | 2410.15582 | link |
2024-10-20 | Convolution tensor decomposition for efficient high-resolution solutions to the Allen-Cahn equation | Ye Lu et.al. | 2410.15519 | null |
2024-10-20 | Taming Mambas for Voxel Level 3D Medical Image Segmentation | Luca Lumetti et.al. | 2410.15496 | null |
2024-10-20 | AssemblyComplete: 3D Combinatorial Construction with Deep Reinforcement Learning | Alan Chen et.al. | 2410.15469 | null |
2024-10-20 | MedDiff-FM: A Diffusion-based Foundation Model for Versatile Medical Image Applications | Yongrui Yu et.al. | 2410.15432 | null |
2024-10-20 | Evaluation of Human-Robot Interfaces based on 2D/3D Visual and Haptic Feedback for Aerial Manipulation | Julien Mellet et.al. | 2410.15398 | null |
2024-10-22 | EF-3DGS: Event-Aided Free-Trajectory 3D Gaussian Splatting | Bohao Liao et.al. | 2410.15392 | null |
2024-10-20 | Layout-your-3D: Controllable and Precise 3D Generation with 2D Blueprint | Junwei Zhou et.al. | 2410.15391 | null |
2024-10-20 | Neural Active Structure-from-Motion in Dark and Textureless Environment | Kazuto Ichimaru et.al. | 2410.15378 | null |
2024-10-20 | ActiveNeuS: Neural Signed Distance Fields for Active Stereo | Kazuto Ichimaru et.al. | 2410.15376 | null |
2024-10-20 | Improving 3D Medical Image Segmentation at Boundary Regions using Local Self-attention and Global Volume Mixing | Daniya Najiha Abdul Kareem et.al. | 2410.15360 | link |
2024-10-20 | POSE: Pose estimation Of virtual Sync Exhibit system | Hao-Tang Tsui et.al. | 2410.15343 | link |
2024-10-20 | Synergistic Dual Spatial-aware Generation of Image-to-Text and Text-to-Image | Yu Zhao et.al. | 2410.15312 | null |
2024-10-20 | Likelihood-Free Inference and Hierarchical Data Assimilation for Geological Carbon Storage | Wenchao Teng et.al. | 2410.15302 | null |
2024-10-20 | Electronic correlations and spin-charge-density stripes in double-layer La $_3$Ni$_2$O$_7$ | I. V. Leonov et.al. | 2410.15298 | null |
2024-10-20 | Fusion of Time and Angle Measurements for Digital-Twin-Aided Probabilistic 3D Positioning | Vincent Corlay et.al. | 2410.15237 | null |
2024-10-19 | CLIPtortionist: Zero-shot Text-driven Deformation for Manufactured 3D Shapes | Xianghao Xu et.al. | 2410.15199 | null |
2024-10-19 | Semantically Safe Robot Manipulation: From Semantic Scene Understanding to Motion Safeguards | Lukas Brunke et.al. | 2410.15185 | null |
2024-10-19 | Implicit neural representation for free-breathing MR fingerprinting (INR-MRF): co-registered 3D whole-liver water T1, water T2, proton density fat fraction, and R2* mapping | Chao Li et.al. | 2410.15175 | null |
2024-10-19 | EndoMetric: Near-light metric scale monocular SLAM | Raúl Iranzo et.al. | 2410.15065 | null |
2024-10-19 | SeaS: Few-shot Industrial Anomaly Image Generation with Separation and Sharing Fine-tuning | Zhewei Dai et.al. | 2410.14987 | link |
2024-10-19 | 3D Multi-Object Tracking Employing MS-GLMB Filter for Autonomous Driving | Linh Van Ma et.al. | 2410.14977 | link |
2024-10-19 | ImmerseDiffusion: A Generative Spatial Audio Latent Diffusion Model | Mojtaba Heydari et.al. | 2410.14945 | null |
2024-10-19 | Development of a Simple and Novel Digital Twin Framework for Industrial Robots in Intelligent robotics manufacturing | Tianyi Xiang et.al. | 2410.14934 | null |
2024-10-19 | A Novel Approach to Grasping Control of Soft Robotic Grippers based on Digital Twin | Tianyi Xiang et.al. | 2410.14928 | null |
2024-10-18 | Low-latitude magnetic flux emergence on rapidly rotating solar-type stars | Emre Işık et.al. | 2410.14869 | null |
2024-10-18 | Knitting Multistability | Kausalya Mahadevan et.al. | 2410.14810 | null |
2024-10-18 | Little time for oscillation: Fast disruption of the Radcliffe Wave by Galactic motions | Guang-Xing Li et.al. | 2410.14603 | null |
2024-10-18 | Graph Optimality-Aware Stochastic LiDAR Bundle Adjustment with Progressive Spatial Smoothing | Jianping Li et.al. | 2410.14565 | null |
2024-10-18 | Multi-modal Pose Diffuser: A Multimodal Generative Conditional Pose Prior | Calvin-Khang Ta et.al. | 2410.14540 | null |
2024-10-18 | Neural Real-Time Recalibration for Infrared Multi-Camera Systems | Benyamin Mehmandar et.al. | 2410.14505 | link |
2024-10-18 | LUDVIG: Learning-free Uplifting of 2D Visual features to Gaussian Splatting scenes | Juliette Marrie et.al. | 2410.14462 | null |
2024-10-18 | Kinematical signatures: Distinguishing between warps and radial flows | A. Zuleta et.al. | 2410.14457 | null |
2024-10-18 | Sim2real Cattle Joint Estimation in 3D point clouds | Okour Mohammad et.al. | 2410.14419 | null |
2024-10-18 | 2D-3D Deformable Image Registration of Histology Slide and Micro-CT with ML-based Initialization | Junan Chen et.al. | 2410.14343 | null |
2024-10-18 | A dual physics-informed neural network for topology optimization | Ajendra Singh et.al. | 2410.14342 | null |
2024-10-18 | Transferring Tactile Data Across Sensors | Wadhah Zai El Amri et.al. | 2410.14310 | null |
2024-10-18 | Equilibrium and out-of-equilibrium critical dynamics of the three-dimensional Heisenberg model with random cubic anisotropy | A. Astillero et.al. | 2410.14275 | null |
2024-10-18 | Shape Transformation Driven by Active Contour for Class-Imbalanced Semi-Supervised Medical Image Segmentation | Yuliang Gu et.al. | 2410.14210 | link |
2024-10-18 | E3D-GPT: Enhanced 3D Visual Foundation for Medical Vision-Language Model | Haoran Lai et.al. | 2410.14200 | null |
2024-10-18 | Neural Signed Distance Function Inference through Splatting 3D Gaussians Pulled on Zero-Level Set | Wenyuan Zhang et.al. | 2410.14189 | null |
2024-10-27 | Unlabeled Action Quality Assessment Based on Multi-dimensional Adaptive Constrained Dynamic Time Warping | Renguang Chen et.al. | 2410.14161 | null |
2024-10-17 | A multi-detector neutral helium atom microscope | Chenyang Zhao et.al. | 2410.13955 | null |
2024-10-17 | M-theory geometric engineering for rank-0 3d $\mathcal{N}=2$ theories | Andrea Sangiovanni et.al. | 2410.13943 | null |
2024-10-17 | Inference of morphology and dynamical state of nearby $Planck$ -SZ galaxy clusters with Zernike polynomials | Valentina Capalbo et.al. | 2410.13929 | null |
2024-10-17 | ARKit LabelMaker: A New Scale for Indoor 3D Scene Understanding | Guangda Ji et.al. | 2410.13924 | link |
2024-10-17 | GraspDiffusion: Synthesizing Realistic Whole-body Hand-Object Interaction | Patrick Kwon et.al. | 2410.13911 | null |
2024-10-17 | UniDrive: Towards Universal Driving Perception Across Camera Configurations | Ye Li et.al. | 2410.13864 | link |
2024-10-17 | DepthSplat: Connecting Gaussian Splatting and Depth | Haofei Xu et.al. | 2410.13862 | link |
2024-10-17 | VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding | Runsen Xu et.al. | 2410.13860 | link |
2024-10-17 | DPLM-2: A Multimodal Diffusion Protein Language Model | Xinyou Wang et.al. | 2410.13782 | null |
2024-10-17 | MEGA: Memory-Efficient 4D Gaussian Splatting for Dynamic Scenes | Xinjie Zhang et.al. | 2410.13613 | null |
2024-10-24 | DN-4DGS: Denoised Deformable Network with Temporal-Spatial Aggregation for Dynamic Scene Rendering | Jiahao Lu et.al. | 2410.13607 | link |
2024-10-17 | Measurement-free, scalable and fault-tolerant universal quantum computing | Friederike Butt et.al. | 2410.13568 | null |
2024-10-17 | L3DG: Latent 3D Gaussian Diffusion | Barbara Roessle et.al. | 2410.13530 | null |
2024-10-17 | Geometry-influenced cooling performance of lithium-ion battery | Dwijendra Dubey et.al. | 2410.13513 | null |
2024-10-17 | Non-uniform Fourier Domain Stretching method for ultra-wide-angle wave propagation | Tomasz Kozacki et.al. | 2410.13474 | null |
2024-10-17 | Dispersion of compressible rotating Euler equations with low Mach and Rossby numbers | Pengcheng Mu et.al. | 2410.13468 | null |
2024-10-17 | Fabrication of functional 3D nanoarchitectures via atomic layer deposition on DNA origami crystals | Arthur Ermatov et.al. | 2410.13393 | null |
2024-10-17 | The ESRF dark-field x-ray microscope at ID03 | H. Isern et.al. | 2410.13391 | null |
2024-10-17 | Railway LiDAR semantic segmentation based on intelligent semi-automated data annotation | Florian Wulff et.al. | 2410.13383 | null |
2024-10-17 | Accurate Checkerboard Corner Detection under Defoucs | Zezhun Shi et.al. | 2410.13371 | link |
2024-10-17 | Self-Supervised Scene Flow Estimation with Point-Voxel Fusion and Surface Representation | Xuezhi Xiang et.al. | 2410.13355 | null |
2024-10-17 | Applying the Velocity Gradient Technique in NGC 1333: Comparison with Dust Polarization Observations | Archana Soam et.al. | 2410.13350 | null |
2024-10-17 | GlossyGS: Inverse Rendering of Glossy Objects with 3D Gaussian Splatting | Shuichang Lai et.al. | 2410.13349 | null |
2024-10-17 | Enhancing 1-Second 3D SELD Performance with Filter Bank Analysis and SCConv Integration in CST-Former | Zhehui Zhang et.al. | 2410.13328 | null |
2024-10-17 | Inner ear morphology in wild versus laboratory house mice | Sabrina Renaud et.al. | 2410.13325 | null |
2024-10-17 | Curling morphology of knitted fabrics: Structure and Mechanics | Kotone Tajiri et.al. | 2410.13307 | null |
2024-10-17 | PiLocNet: Physics-informed neural network on 3D localization with rotating point spread function | Mingda Lu et.al. | 2410.13295 | null |
2024-10-26 | LESS: Label-Efficient and Single-Stage Referring 3D Segmentation | Xuexun Liu et.al. | 2410.13294 | link |
2024-10-17 | Hybrid bundle-adjusting 3D Gaussians for view consistent rendering with pose optimization | Yanan Guo et.al. | 2410.13280 | link |
2024-10-17 | TRLO: An Efficient LiDAR Odometry with 3D Dynamic Object Tracking and Removal | Yanpeng Jia et.al. | 2410.13240 | null |
2024-10-18 | UniG: Modelling Unitary 3D Gaussians for View-consistent 3D Reconstruction | Jiamin Wu et.al. | 2410.13195 | link |
2024-10-16 | Exploring Nanoscale Photoresponse Mechanisms for Enhanced Photothermoelectric Effects in van der Waals Interfaces | Da Xu et.al. | 2410.13052 | null |
2024-10-16 | UniCoN: Universal Conditional Networks for Multi-Age Embryonic Cartilage Segmentation with Sparsely Annotated Data | Nishchal Sapkota et.al. | 2410.13043 | null |
2024-10-16 | Geometric Trajectory Diffusion Models | Jiaqi Han et.al. | 2410.13027 | link |
2024-10-16 | Configurable Embodied Data Generation for Class-Agnostic RGB-D Video Segmentation | Anthony Opipari et.al. | 2410.12995 | null |
2024-10-16 | DreamCraft3D++: Efficient Hierarchical 3D Generation with Multi-Plane Reconstruction Model | Jingxiang Sun et.al. | 2410.12928 | null |
2024-10-16 | Wreathing, Discrete Gauging, and Non-invertible Symmetries | Julius F. Grimminger et.al. | 2410.12906 | null |
2024-10-16 | Long-LRM: Long-sequence Large Reconstruction Model for Wide-coverage Gaussian Splats | Chen Ziwen et.al. | 2410.12781 | null |
2024-10-16 | Gravity-aligned Rotation Averaging with Circular Regression | Linfei Pan et.al. | 2410.12763 | link |
2024-10-16 | Optimizing 3D Geometry Reconstruction from Implicit Neural Representations | Shen Fan et.al. | 2410.12725 | null |
2024-10-16 | VividMed: Vision Language Model with Versatile Visual Grounding for Medicine | Lingxiao Luo et.al. | 2410.12694 | link |
2024-10-16 | MambaBEV: An efficient 3D detection model with Mamba2 | Zihan You et.al. | 2410.12673 | null |
2024-10-16 | A comparative analysis of metamodels for lumped cardiovascular models, and pipeline for sensitivity analysis, parameter estimation, and uncertainty quantification | John M. Hanna et.al. | 2410.12654 | null |
2024-10-16 | Contrasting results of surface metrology techniques for three-dimensional human fingerprints | Brian Lee Beatty et.al. | 2410.12648 | null |
2024-10-16 | Cascade learning in multi-task encoder-decoder networks for concurrent bone segmentation and glenohumeral joint assessment in shoulder CT scans | Luca Marsilio et.al. | 2410.12641 | null |
2024-10-16 | Sparse flow reconstruction methods to reduce the costs of analyzing large unsteady datasets | Spencer L. Stahl et.al. | 2410.12627 | null |
2024-10-16 | Cocoon: Robust Multi-Modal Perception with Uncertainty-Aware Sensor Fusion | Minkyoung Cho et.al. | 2410.12592 | null |
2024-10-22 | A finite difference method with symmetry properties for the high-dimensional Bratu equation | Muhammad Luthfi Shahab et.al. | 2410.12553 | link |
2024-10-16 | Real-time Stereo-based 3D Object Detection for Streaming Perception | Changcai Li et.al. | 2410.12394 | link |
2024-10-16 | LoD-Loc: Aerial Visual Localization using LoD 3D Map with Neural Wireframe Alignment | Juelin Zhu et.al. | 2410.12269 | link |
2024-10-16 | 3D Gaussian Splatting in Robotics: A Survey | Siting Zhu et.al. | 2410.12262 | link |
2024-10-16 | Leveraging Spatial Attention and Edge Context for Optimized Feature Selection in Visual Localization | Nanda Febri Istighfarin et.al. | 2410.12240 | null |
2024-10-17 | SAM-Guided Masked Token Prediction for 3D Scene Understanding | Zhimin Chen et.al. | 2410.12158 | null |
2024-10-16 | Intrinsic grain boundary mobility tensor from three-dimensional interface random walk | Xinyuan Song et.al. | 2410.12133 | null |
2024-10-15 | SplatPose+: Real-time Image-Based Pose-Agnostic 3D Anomaly Detection | Yizhe Liu et.al. | 2410.12080 | link |
2024-10-15 | V3D-SLAM: Robust RGB-D SLAM in Dynamic Environments with 3D Semantic Geometry Voting | Tuan Dang et.al. | 2410.12068 | link |
2024-10-15 | SOE: SO(3)-Equivariant 3D MRI Encoding | Shizhe He et.al. | 2410.12053 | link |
2024-10-15 | Enabling Data-Driven and Empathetic Interactions: A Context-Aware 3D Virtual Agent in Mixed Reality for Enhanced Financial Customer Experience | Cindy Xu et.al. | 2410.12051 | null |
2024-10-15 | Global Simulations of Gravitational Instability in Protostellar Disks with Full Radiation Transport. I. Stochastic Fragmentation with Optical-depth-dependent Rate and Universal Fragment Mass | Wenrui Xu et.al. | 2410.12042 | null |
2024-10-15 | Learned Neural Physics Simulation for Articulated 3D Human Pose Reconstruction | Mykhaylo Andriluka et.al. | 2410.12023 | null |
2024-10-15 | Stochastic 3D reconstruction of cracked polycrystalline NMC particles using 2D SEM data | Philipp Rieder et.al. | 2410.12020 | null |
2024-10-22 | Dynamic Open-Vocabulary 3D Scene Graphs for Long-term Language-Guided Mobile Manipulation | Zhijie Yan et.al. | 2410.11989 | null |
2024-10-15 | Integrating Artificial Intelligence Models and Synthetic Image Data for Enhanced Asset Inspection and Defect Identification | Reddy Mandati et.al. | 2410.11967 | null |
2024-10-15 | Dual-frame Fluid Motion Estimation with Test-time Optimization and Zero-divergence Loss | Yifei Zhang et.al. | 2410.11934 | link |
2024-10-15 | Beyond Sequence: Impact of Geometric Context for RNA Property Prediction | Junjie Xu et.al. | 2410.11933 | null |
2024-10-11 | Global strong solution of the 3D inhomogeneous liquid crystal flows with density-dependent viscosity and large velocity | Jiaxu Li et.al. | 2410.11881 | null |
2024-10-15 | Jigsaw++: Imagining Complete Shape Priors for Object Reassembly | Jiaxin Lu et.al. | 2410.11816 | null |
2024-10-16 | Efficient Diffusion Models: A Comprehensive Survey from Principles to Practices | Zhiyuan Ma et.al. | 2410.11795 | null |
2024-10-15 | Latent BKI: Open-Dictionary Continuous Mapping in Visual-Language Latent Spaces with Quantifiable Uncertainty | Joey Wilson et.al. | 2410.11783 | link |
2024-10-15 | Robotic Arm Platform for Multi-View Image Acquisition and 3D Reconstruction in Minimally Invasive Surgery | Alexander Saikia et.al. | 2410.11703 | null |
2024-10-15 | SurFhead: Affine Rig Blending for Geometrically Accurate 2D Gaussian Surfel Head Avatars | Jaeseong Lee et.al. | 2410.11682 | null |
2024-10-15 | Fast and Robust Hexahedral Mesh Optimization via Augmented Lagrangian, L-BFGS, and Line Search | Hua Tong et.al. | 2410.11656 | link |
2024-10-15 | 3D printing by two-photon polymerization of hollow microneedles for interstitial fluid extraction | Tiago Elias Abi-Ramia Silva et.al. | 2410.11631 | null |
2024-10-15 | Simultaneous Diffusion Sampling for Conditional LiDAR Generation | Ryan Faulkner et.al. | 2410.11628 | null |
2024-10-16 | Depth Estimation From Monocular Images With Enhanced Encoder-Decoder Architecture | Dabbrata Das et.al. | 2410.11610 | link |
2024-10-15 | DeformPAM: Data-Efficient Learning for Long-horizon Deformable Object Manipulation via Preference-based Action Alignment | Wendi Chen et.al. | 2410.11584 | link |
2024-10-15 | PAVLM: Advancing Point Cloud based Affordance Understanding Via Vision-Language Model | Shang-Ching Liu et.al. | 2410.11564 | null |
2024-10-15 | Electrical Transport in Tunably-Disordered Metamaterials | Caitlyn Obrero et.al. | 2410.11525 | link |
2024-10-15 | Dual-Teacher Ensemble Models with Double-Copy-Paste for 3D Semi-Supervised Medical Image Segmentation | Zhan Fa et.al. | 2410.11509 | link |
2024-10-15 | LoGS: Visual Localization via Gaussian Splatting with Fewer Training Images | Yuzhou Cheng et.al. | 2410.11505 | null |
2024-10-15 | M2Diffuser: Diffusion-based Trajectory Optimization for Mobile Manipulation in 3D Scenes | Sixu Yan et.al. | 2410.11402 | null |
2024-10-15 | MCGS: Multiview Consistency Enhancement for Sparse-View 3D Gaussian Radiance Fields | Yuru Xiao et.al. | 2410.11394 | null |
2024-10-15 | GSORB-SLAM: Gaussian Splatting SLAM benefits from ORB features and Transmittance information | Wancai Zheng et.al. | 2410.11356 | null |
2024-10-15 | An exploration of temporal coherence of light through holography | Alexandre Escarguel et.al. | 2410.11351 | null |
2024-10-16 | Azimuthal imaging of rock fractures by incorporating single borehole radar and optical data | Jian Shen et.al. | 2410.11350 | null |
2024-10-15 | Evolutionary Retrofitting | Mathurin Videau et.al. | 2410.11330 | null |
2024-10-15 | Searching for various melting scenarios of 2D crystals | Peng Hua et.al. | 2410.11286 | null |
2024-10-15 | Scalable Indoor Novel-View Synthesis using Drone-Captured 360 Imagery with 3D Gaussian Splatting | Yuanbo Chen et.al. | 2410.11285 | null |
2024-10-15 | Rethinking the Role of Infrastructure in Collaborative Perception | Hyunchul Bae et.al. | 2410.11259 | null |
2024-10-15 | TEOcc: Radar-camera Multi-modal Occupancy Prediction via Temporal Enhancement | Zhiwei Lin et.al. | 2410.11228 | link |
2024-10-16 | CVCP-Fusion: On Implicit Depth Estimation for 3D Bounding Box Prediction | Pranav Gupta et.al. | 2410.11211 | link |
2024-10-15 | Cross-Dataset Generalization in Deep Learning | Xuyu Zhang et.al. | 2410.11207 | null |
2024-10-15 | Multiview Scene Graph | Juexiao Zhang et.al. | 2410.11187 | link |
2024-10-14 | 3D-Prover: Diversity Driven Theorem Proving With Determinantal Point Processes | Sean Lamont et.al. | 2410.11133 | null |
2024-10-17 | UAV3D: A Large-scale 3D Perception Benchmark for Unmanned Aerial Vehicles | Hui Ye et.al. | 2410.11125 | null |
2024-10-14 | HoloSpot: Intuitive Object Manipulation via Mixed Reality Drag-and-Drop | Pablo Soler Garcia et.al. | 2410.11110 | null |
2024-10-14 | Few-shot Novel View Synthesis using Depth Aware 3D Gaussian Splatting | Raja Kumar et.al. | 2410.11080 | link |
2024-10-14 | Beyond Fixed Topologies: Unregistered Training and Comprehensive Evaluation Metrics for 3D Talking Heads | Federico Nocentini et.al. | 2410.11041 | null |
2024-10-14 | ET-Former: Efficient Triplane Deformable Attention for 3D Semantic Scene Completion From Monocular Camera | Jing Liang et.al. | 2410.11019 | null |
2024-10-14 | Optimizing Radio Access Technology Selection and Precoding in CV-Aided ISAC Systems | Yulan Gao et.al. | 2410.11002 | null |
2024-10-14 | Stationary Velocity Fields on Matrix Groups for Deformable Image Registration | Johannes Bostelmann et.al. | 2410.10997 | null |
2024-10-14 | Expansion properties of the young supernova type Iax remnant Pa 30 revealed | Tim Cunningham et.al. | 2410.10940 | null |
2024-10-14 | Cultural Heritage 3D Reconstruction with Diffusion Networks | Pablo Jaramillo et.al. | 2410.10927 | link |
2024-10-17 | Improving Generalization on the ProcGen Benchmark with Simple Architectural Changes and Scale | Andrew Jesson et.al. | 2410.10905 | link |
2024-10-13 | 3DS: Decomposed Difficulty Data Selection’s Case Study on LLM Medical Domain Adaptation | Hongxin Ding et.al. | 2410.10901 | null |
2024-10-13 | Oogway: Designing, Implementing, and Testing an AUV for RoboSub 2023 | Will Denton et.al. | 2410.10900 | null |
2024-10-14 | Tex4D: Zero-shot 4D Scene Texturing with Video Diffusion Models | Jingzhi Bao et.al. | 2410.10821 | link |
2024-10-14 | When Does Perceptual Alignment Benefit Vision Representations? | Shobhita Sundaram et.al. | 2410.10817 | null |
2024-10-14 | Generalizable Humanoid Manipulation with Improved 3D Diffusion Policies | Yanjie Ze et.al. | 2410.10803 | link |
2024-10-14 | Towards Foundation Models for 3D Vision: How Close Are We? | Yiming Zuo et.al. | 2410.10799 | link |
2024-10-14 | Sitcom-Crafter: A Plot-Driven Human Motion Generation System in 3D Scenes | Jianqi Chen et.al. | 2410.10790 | link |
2024-10-14 | 3DArticCyclists: Generating Simulated Dynamic 3D Cyclists for Human-Object Interaction (HOI) and Autonomous Driving Applications | Eduardo R. Corral-Soto et.al. | 2410.10782 | null |
2024-10-14 | Cavia: Camera-controllable Multi-view Video Diffusion with View-Integrated Attention | Dejia Xu et.al. | 2410.10774 | null |
2024-10-14 | DragEntity: Trajectory Guided Video Generation using Entity and Positional Relationships | Zhang Wan et.al. | 2410.10751 | null |
2024-10-14 | Quantum enhanced electric field mapping within semiconductor devices | D. Scheller et.al. | 2410.10750 | null |
2024-10-14 | FlexGen: Flexible Multi-View Generation from Text and Image Inputs | Xinli Xu et.al. | 2410.10745 | null |
2024-10-15 | 4-LEGS: 4D Language Embedded Gaussian Splatting | Gal Fiebelman et.al. | 2410.10719 | null |
2024-10-14 | The pulsar magnetosphere with machine learning: preliminary results in 3D | Ioannis Dimitropoulos et.al. | 2410.10716 | null |
2024-10-14 | TALK-Act: Enhance Textural-Awareness for 2D Speaking Avatar Reenactment with Diffusion Model | Jiazhi Guan et.al. | 2410.10696 | null |
2024-10-14 | Tracking solid oxide cell electrode microstructural evolution during annealing by scanning 3D X-ray diffraction microscopy | A. Shukla et.al. | 2410.10671 | null |
2024-10-14 | PCF-Lift: Panoptic Lifting by Probabilistic Contrastive Fusion | Runsong Zhu et.al. | 2410.10659 | link |
2024-10-14 | Fully Asynchronous Neuromorphic Perception for Mobile Robot Dodging with Loihi Chips | Junjie Jiang et.al. | 2410.10601 | null |
2024-10-14 | Development of a 3D virtual world tool for sustainable energy education | Marta Guerra-Mota et.al. | 2410.10586 | null |
2024-10-17 | Preserving Cardiac Integrity: A Topology-Infused Approach to Whole Heart Segmentation | Chenyu Zhang et.al. | 2410.10551 | null |
2024-10-14 | Differential reflectivity columns and hail – linking C-band radar-based estimated column characteristics to crowdsourced hail observations in Switzerland | Martin Aregger et.al. | 2410.10499 | link |
2024-10-17 | Commuting Local Hamiltonians Beyond 2D | John Bostanci et.al. | 2410.10495 | null |
2024-10-14 | Self-Assessed Generation: Trustworthy Label Generation for Optical Flow and Stereo Matching in Real-world | Han Ling et.al. | 2410.10453 | link |
2024-10-14 | DOME: Taming Diffusion Model into High-Fidelity Controllable Occupancy World Model | Songen Gu et.al. | 2410.10429 | null |
2024-10-14 | 4DStyleGaussian: Zero-shot 4D Style Transfer with Gaussian Splatting | Wanlin Liang et.al. | 2410.10412 | null |
2024-10-14 | SMART-TRACK: A Novel Kalman Filter-Guided Sensor Fusion For Robust UAV Object Tracking in Dynamic Environments | Khaled Gabr et.al. | 2410.10409 | link |
2024-10-15 | Parameterize Structure with Differentiable Template for 3D Shape Generation | Changfeng Ma et.al. | 2410.10399 | null |
2024-10-14 | Ultraviolet extinction correlation with 3D dust maps using white dwarfs | Snehalata Sahu et.al. | 2410.10358 | null |
2024-10-15 | On Representation of 3D Rotation in the Context of Deep Learning | Viktória Pravdová et.al. | 2410.10350 | null |
2024-10-14 | Performance of a Threshold-based WDM and ACM for FSO Communication between Mobile Platforms in Maritime Environments | Jae-Eun Han et.al. | 2410.10335 | null |
2024-10-14 | Enhanced TM-Mode 3D Coupled Wave Theory for Photonic Crystal Surface-Emitting Terahertz Quantum Cascade Lasers | Mingxi Chen et.al. | 2410.10331 | null |
2024-10-14 | ROA-BEV: 2D Region-Oriented Attention for BEV-based 3D Object | Jiwei Chen et.al. | 2410.10298 | null |
2024-10-14 | Manifold-Aware Local Feature Modeling for Semi-Supervised Medical Image Segmentation | Sicheng Shen et.al. | 2410.10287 | link |
2024-10-14 | Kinematic-ICP: Enhancing LiDAR Odometry with Kinematic Constraints for Wheeled Mobile Robots Moving on Planar Surfaces | Tiziano Guadagnino et.al. | 2410.10277 | null |
2024-10-14 | Two-Stage Approach for Brain MR Image Synthesis: 2D Image Synthesis and 3D Refinement | Jihoon Cho et.al. | 2410.10269 | null |
2024-10-14 | A Surface Adaptive First-Look Inspection Planner for Autonomous Remote Sensing of Open-Pit Mines | Vignesh Kottayam Viswanathan et.al. | 2410.10256 | null |
2024-10-14 | GUISE: Graph GaUssIan Shading watErmark | Renyi Yang et.al. | 2410.10178 | null |
2024-10-16 | Tensor-involved peridynamics: A unified framework for isotropic and anisotropic materials | Hao Tian et.al. | 2410.10175 | null |
2024-10-14 | Signage-Aware Exploration in Open World using Venue Maps | Chang Chen et.al. | 2410.10143 | null |
2024-10-14 | REHRSeg: Unleashing the Power of Self-Supervised Super-Resolution for Resource-Efficient 3D MRI Segmentation | Zhiyun Song et.al. | 2410.10097 | null |
2024-10-14 | PointNet with KAN versus PointNet with MLP for 3D Classification and Segmentation of Point Sets | Ali Kashefi et.al. | 2410.10084 | link |
2024-10-14 | Numerical Simulation of the Time-Dependent Schrodinger Equation Using the Crank-Nicolson Method | Adib Kabir et.al. | 2410.10060 | null |
2024-10-13 | GALA: Geometry-Aware Local Adaptive Grids for Detailed 3D Generation | Dingdong Yang et.al. | 2410.10037 | null |
2024-10-16 | InterMask: 3D Human Interaction Generation via Collaborative Masked Modelling | Muhammad Gohar Javed et.al. | 2410.10010 | link |
2024-10-13 | Improving 3D Few-Shot Segmentation with Inference-Time Pseudo-Labeling | Mohammad Mozafari et.al. | 2410.09967 | null |
2024-10-13 | Messaging-based Intelligent Processing Unit (m-IPU) for next generation AI computing | Md. Rownak Hossain Chowdhury et.al. | 2410.09961 | null |
2024-10-13 | AGN feeding along a one-armed spiral in NGC 4593: A study using ALMA CO(2-1) observations | K. Kianfar et.al. | 2410.09941 | null |
2024-10-13 | Large-Scale 3D Medical Image Pre-training with Geometric Context Priors | Linshan Wu et.al. | 2410.09890 | link |
2024-10-13 | Block-to-Scene Pre-training for Point Cloud Hybrid-Domain Masked Autoencoders | Yaohua Zha et.al. | 2410.09886 | null |
2024-10-13 | Conditioning 3D Diffusion Models with 2D Images: Towards Standardized OCT Volumes through En Face-Informed Super-Resolution | Coen de Vente et.al. | 2410.09862 | null |
2024-10-13 | Point Cloud Novelty Detection Based on Latent Representations of a General Feature Extractor | Shizuka Akahori et.al. | 2410.09861 | null |
2024-10-13 | DAS3D: Dual-modality Anomaly Synthesis for 3D Anomaly Detection | Kecen Li et.al. | 2410.09821 | null |
2024-10-13 | Predicting Molecular Ground-State Conformation via Conformation Optimization | Fanmeng Wang et.al. | 2410.09795 | null |
2024-10-13 | Magnituder Layers for Implicit Neural Representations in 3D | Sang Min Kim et.al. | 2410.09771 | null |
2024-10-13 | Data Adaptive Few-shot Multi Label Segmentation with Foundation Model | Gurunath Reddy et.al. | 2410.09759 | null |
2024-10-13 | Gaussian Splatting Visual MPC for Granular Media Manipulation | Wei-Cheng Tseng et.al. | 2410.09740 | null |
2024-10-13 | LOKI: A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models | Junyan Ye et.al. | 2410.09732 | null |
2024-10-19 | Robust 3D Point Clouds Classification based on Declarative Defenders | Kaidong Li et.al. | 2410.09691 | link |
2024-10-13 | FAMOUS: High-Fidelity Monocular 3D Human Digitization Using View Synthesis | Vishnu Mani Hema et.al. | 2410.09690 | null |
2024-10-12 | Many-body Expansion Based Machine Learning Models for Octahedral Transition Metal Complexes | Ralf Meyer et.al. | 2410.09659 | link |
2024-10-12 | EmbodiedCity: A Benchmark Platform for Embodied Agent in Real-world City Environment | Chen Gao et.al. | 2410.09604 | null |
2024-10-12 | ControLRM: Fast and Controllable 3D Generation via Large Reconstruction Model | Hongbin Xu et.al. | 2410.09592 | null |
2024-10-12 | Improving 3D Finger Traits Recognition via Generalizable Neural Rendering | Hongbin Xu et.al. | 2410.09582 | null |
2024-10-12 | Pic@Point: Cross-Modal Learning by Local and Global Point-Picture Correspondence | Vencia Herzog et.al. | 2410.09519 | null |
2024-10-12 | Lower order mixed elements for the linear elasticity problem in 2D and 3D | Jun Hu et.al. | 2410.09517 | null |
2024-10-12 | Towards Design and Development of a Low-Cost Unmanned Surface Vehicle for Aquaculture Water Quality Monitoring in Shallow Water Environments | Aiyelari Temilolorun et.al. | 2410.09513 | null |
2024-10-12 | Towards Efficient Visual-Language Alignment of the Q-Former for Visual Reasoning Tasks | Sungkyung Kim et.al. | 2410.09489 | link |
2024-10-12 | Enhancing Single Image to 3D Generation using Gaussian Splatting and Hybrid Diffusion Priors | Hritam Basak et.al. | 2410.09467 | null |
2024-10-12 | An Expeditious Spatial Mean Radiant Temperature Mapping Framework using Visual SLAM and Semantic Segmentation | Wei Liang et.al. | 2410.09443 | null |
2024-10-12 | Neurally Integrated Finite Elements for Differentiable Elasticity on Evolving Domains | Gilles Daviet et.al. | 2410.09417 | null |
2024-10-12 | Global well-posedness and uniform-in-time vanishing damping limit for the inviscid Oldroyd-B model | Xinyu Cheng et.al. | 2410.09340 | null |
2024-10-12 | Towards Multi-Modal Animal Pose Estimation: An In-Depth Analysis | Qianyi Deng et.al. | 2410.09312 | link |
2024-10-11 | Probing Three-Dimensional Magnetic Fields: IV – Synchrotron Polarization Derivative and Vision Transformer | Yue Hu et.al. | 2410.09294 | null |
2024-10-11 | SurgicalGS: Dynamic 3D Gaussian Splatting for Accurate Robotic-Assisted Surgical Scene Reconstruction | Jialei Chen et.al. | 2410.09292 | null |
2024-10-11 | nach0-pc: Multi-task Language Model with Molecular Point Cloud Encoder | Maksim Kuznetsov et.al. | 2410.09240 | null |
2024-10-11 | Foundation Model-Powered 3D Few-Shot Class Incremental Learning via Training-free Adaptor | Sahar Ahmadi et.al. | 2410.09237 | link |
2024-10-11 | Cross-Domain Distribution Alignment for Segmentation of Private Unannotated 3D Medical Images | Ruitong Sun et.al. | 2410.09210 | link |
2024-10-11 | MFIT: Multi-Fidelity Thermal Modeling for 2.5D and 3D Multi-Chiplet Architectures | Lukas Pfromm et.al. | 2410.09188 | link |
2024-10-11 | SceneCraft: Layout-Guided 3D Scene Generation | Xiuyu Yang et.al. | 2410.09049 | link |
2024-10-11 | Gyromorphs: a new class of functional disordered materials | Mathias Casiulis et.al. | 2410.09023 | null |
2024-10-11 | CVAM-Pose: Conditional Variational Autoencoder for Multi-Object Monocular Pose Estimation | Jianyu Zhao et.al. | 2410.09010 | link |
2024-10-11 | Semantic Score Distillation Sampling for Compositional Text-to-3D Generation | Ling Yang et.al. | 2410.09009 | link |
2024-10-11 | DEL: Discrete Element Learner for Learning 3D Particle Dynamics with Neural Rendering | Jiaxu Wang et.al. | 2410.08983 | null |
2024-10-11 | Parallel Watershed Partitioning: GPU-Based Hierarchical Image Segmentation | Varduhi Yeghiazaryan et.al. | 2410.08946 | null |
2024-10-11 | MeshGS: Adaptive Mesh-Aligned Gaussian Splatting for High-Quality Rendering | Jaehoon Choi et.al. | 2410.08941 | null |
2024-10-11 | Low-Temperature Heat Transport under Phonon Confinement in Nanostructures | M. Sidorova et.al. | 2410.08878 | null |
2024-10-11 | Adaptive optimization of wave energy conversion in oscillatory wave surge converters via SPH simulation and deep reinforcement learning | Mai Ye et.al. | 2410.08871 | link |
2024-10-11 | Learning Interaction-aware 3D Gaussian Splatting for One-shot Hand Avatars | Xuan Huang et.al. | 2410.08840 | link |
2024-10-11 | One-shot Generative Domain Adaptation in 3D GANs | Ziqiang Li et.al. | 2410.08824 | link |
2024-10-11 | Look Gauss, No Pose: Novel View Synthesis using Gaussian Splatting without Accurate Pose Initialization | Christian Schmidt et.al. | 2410.08743 | link |
2024-10-14 | Gait Sequence Upsampling using Diffusion Models for Single LiDAR Sensors | Jeongho Ahn et.al. | 2410.08680 | null |
2024-10-11 | Edge AI Collaborative Learning: Bayesian Approaches to Uncertainty Estimation | Gleb Radchenko et.al. | 2410.08651 | null |
2024-10-11 | ViT3D Alignment of LLaMA3: 3D Medical Image Report Generation | Siyou Li et.al. | 2410.08588 | null |
2024-10-11 | Diffusion-Based Depth Inpainting for Transparent and Reflective Objects | Tianyu Sun et.al. | 2410.08567 | null |
2024-10-11 | Enhanced Robot Planning and Perception through Environment Prediction | Vishnu Dutt Sharma et.al. | 2410.08560 | null |
2024-10-11 | Integrated adaptive coherent LiDAR for 4D bionic vision | Ruixuan Chen et.al. | 2410.08554 | null |
2024-10-11 | Ego3DT: Tracking Every 3D Object in Ego-centric Videos | Shengyu Hao et.al. | 2410.08530 | null |
2024-10-11 | Ab initio study on heavy-fermion behavior in LiV $_2$O$_4$ : Role of Hund’s coupling and stability | Steffen Backes et.al. | 2410.08515 | null |
2024-10-11 | HorGait: Advancing Gait Recognition with Efficient High-Order Spatial Interactions in LiDAR Point Clouds | Jiaxing Hao et.al. | 2410.08454 | null |
2024-10-10 | VoxelPrompt: A Vision-Language Agent for Grounded Medical Image Analysis | Andrew Hoopes et.al. | 2410.08397 | link |
2024-10-10 | Are We Ready for Real-Time LiDAR Semantic Segmentation in Autonomous Driving? | Samir Abou Haidar et.al. | 2410.08365 | null |
2024-10-10 | DTactive: A Vision-Based Tactile Sensor with Active Surface | Jikai Xu et.al. | 2410.08337 | null |
2024-10-10 | FusionSense: Bridging Common Sense, Vision, and Touch for Robust Sparse-View Reconstruction | Irving Fang et.al. | 2410.08282 | null |
2024-10-10 | Chaotic magnetic disconnections trigger flux eruptions in accretion flows channeled onto magnetically saturated Kerr black holes | Krzysztof Nalewajko et.al. | 2410.08280 | null |
2024-10-10 | Neural Material Adaptor for Visual Grounding of Intrinsic Dynamics | Junyi Cao et.al. | 2410.08257 | null |
2024-10-11 | SPA: 3D Spatial-Awareness Enables Effective Embodied Representation | Haoyi Zhu et.al. | 2410.08208 | link |
2024-10-10 | Poison-splat: Computation Cost Attack on 3D Gaussian Splatting | Jiahao Lu et.al. | 2410.08190 | link |
2024-10-10 | SG-Nav: Online 3D Scene Graph Prompting for LLM-based Zero-shot Object Navigation | Hang Yin et.al. | 2410.08189 | null |
2024-10-10 | DifFRelight: Diffusion-Based Facial Performance Relighting | Mingming He et.al. | 2410.08188 | null |
2024-10-10 | RGM: Reconstructing High-fidelity 3D Car Assets with Relightable 3D-GS Generative Model from a Single Image | Xiaoxue Chen et.al. | 2410.08181 | null |
2024-10-10 | ZeroComp: Zero-shot Object Compositing from Image Intrinsics via Diffusion | Zitian Zhang et.al. | 2410.08168 | link |
2024-10-10 | RayEmb: Arbitrary Landmark Detection in X-Ray Images Using Ray Embedding Subspace | Pragyan Shrestha et.al. | 2410.08152 | link |
2024-10-10 | Efficient Perspective-Correct 3D Gaussian Splatting Using Hybrid Transparency | Florian Hahlbohm et.al. | 2410.08129 | null |
2024-10-18 | IncEventGS: Pose-Free Gaussian Splatting from a Single Event Camera | Jian Huang et.al. | 2410.08107 | link |
2024-10-10 | UW-SDF: Exploiting Hybrid Geometric Priors for Neural SDF Reconstruction from Underwater Multi-view Monocular Images | Zeyu Chen et.al. | 2410.08092 | null |
2024-10-10 | Color-Guided Flying Pixel Correction in Depth Images | Ekamresh Vasudevan et.al. | 2410.08084 | link |
2024-10-10 | ToMiE: Towards Modular Growth in Enhanced SMPL Skeleton for 3D Human with Animatable Garments | Yifan Zhan et.al. | 2410.08082 | link |
2024-10-10 | {\varphi}-FD : A well-conditioned finite difference method inspired by {\varphi}-FEM for general geometries on elliptic PDEs | Michel Duprez et.al. | 2410.08042 | null |
2024-10-11 | Fast Feedforward 3D Gaussian Splatting Compression | Yihang Chen et.al. | 2410.08017 | link |
2024-10-10 | RegionGrasp: A Novel Task for Contact Region Controllable Hand Grasp Generation | Yilin Wang et.al. | 2410.07995 | null |
2024-10-10 | A transition towards virtual representations of visual scenes | Américo Pereira et.al. | 2410.07987 | null |
2024-10-10 | MolMix: A Simple Yet Effective Baseline for Multimodal Molecular Representation Learning | Andrei Manolache et.al. | 2410.07981 | link |
2024-10-10 | Generalizable and Animatable Gaussian Head Avatar | Xuangeng Chu et.al. | 2410.07971 | link |
2024-10-10 | Understanding Spatio-Temporal Relations in Human-Object Interaction using Pyramid Graph Convolutional Network | Hao Xing et.al. | 2410.07912 | null |
2024-10-11 | ONCOPILOT: A Promptable CT Foundation Model For Solid Tumor Evaluation | Léo Machado et.al. | 2410.07908 | null |
2024-10-10 | L-VITeX: Light-weight Visual Intuition for Terrain Exploration | Antar Mazumder et.al. | 2410.07872 | null |
2024-10-10 | Meissner effect in non-Hermitian superconductors | Shun Tamura et.al. | 2410.07853 | null |
2024-10-10 | From spherical stars to disk-like structures: 3D common-envelope evolution of massive binaries beyond inspiral | M. Vetter et.al. | 2410.07841 | null |
2024-10-10 | Neural Semantic Map-Learning for Autonomous Vehicles | Markus Herb et.al. | 2410.07780 | null |
2024-10-10 | Reverse Aperiodic Resonance in Low- to High-Dimensional Bistable Systems: A Complement to Stochastic Resonance Studies in Logic Circuits | Mengen Shen et.al. | 2410.07775 | null |
2024-10-10 | HeightFormer: A Semantic Alignment Monocular 3D Object Detection Method from Roadside Perspective | Pei Liu et.al. | 2410.07758 | null |
2024-10-10 | MMHead: Towards Fine-grained Multi-modal 3D Facial Animation | Sijing Wu et.al. | 2410.07757 | null |
2024-10-10 | All-optical in vivo photoacoustic tomography by adaptive multilayer temporal backpropagation | Taeil Yoon et.al. | 2410.07714 | null |
2024-10-10 | MotionGS: Exploring Explicit Motion Guidance for Deformable 3D Gaussian Splatting | Ruijie Zhu et.al. | 2410.07707 | link |
2024-10-10 | A Visual Cooperative Localization Method for Airborne Magnetic Surveying Based on a Manifold Sensor Fusion Algorithm Using Lie Groups | Liang Liu et.al. | 2410.07700 | null |
2024-10-10 | PokeFlex: A Real-World Dataset of Deformable Objects for Robotics | Jan Obrist et.al. | 2410.07688 | null |
2024-10-10 | Patterned Structure Muscle : Arbitrary Shaped Wire-driven Artificial Muscle Utilizing Anisotropic Flexible Structure for Musculoskeletal Robots | Shunnosuke Yoshimura et.al. | 2410.07682 | null |
2024-10-10 | MotionAura: Generating High-Quality and Motion Consistent Videos using Discrete Diffusion | Onkar Susladkar et.al. | 2410.07659 | link |
2024-10-10 | SeMv-3D: Towards Semantic and Mutil-view Consistency simultaneously for General Text-to-3D Generation with Triplane Priors | Xiao Cai et.al. | 2410.07658 | null |
2024-10-10 | A regularisation technique to precisely infer limb darkening using transit measurements: can we estimate stellar surface magnetic fields? | Kuldeep Verma et.al. | 2410.07636 | null |
2024-10-10 | Fine-detailed Neural Indoor Scene Reconstruction using multi-level importance sampling and multi-view consistency | Xinghui Li et.al. | 2410.07597 | null |
2024-10-10 | 3D Vision-Language Gaussian Splatting | Qucheng Peng et.al. | 2410.07577 | null |
2024-10-10 | Calibration of 3D Single-pixel Imaging Systems with a Calibration Field | Xinyue Ma et.al. | 2410.07545 | null |
2024-10-10 | A 3D-Printed Table for Hybrid X-ray CT and Optical Imaging of a Live Mouse | Wenxuan Xue et.al. | 2410.07517 | null |
2024-10-10 | Learning to Generate Diverse Pedestrian Movements from Web Videos with Noisy Labels | Zhizheng Liu et.al. | 2410.07500 | null |
2024-10-09 | Progressive Multi-Modal Fusion for Robust 3D Object Detection | Rohit Mohan et.al. | 2410.07475 | null |
2024-10-09 | 3D2M Dataset: A 3-Dimension diverse Mesh Dataset | Sankarshan Dasgupta et.al. | 2410.07415 | link |
2024-10-09 | Enhancing Soccer Camera Calibration Through Keypoint Exploitation | Nikolay S. Falaleev et.al. | 2410.07401 | link |
2024-10-09 | Structured Spatial Reasoning with Open Vocabulary Object Detectors | Negar Nejatishahidin et.al. | 2410.07394 | null |
2024-10-09 | En masse scanning and automated surfacing of small objects using Micro-CT | Riley C. W. O’Neill et.al. | 2410.07385 | link |
2024-10-18 | Code switching revisited: low-overhead magic state preparation using color codes | Lucas Daguerre et.al. | 2410.07327 | null |
2024-10-09 | On the impact of AGN feedback modes onto the turbulent properties of the multiphase ICM | Stefano Sotira et.al. | 2410.07314 | null |
2024-10-09 | A high-performance nitrogen-rich ZIF-8-derived Fe-Co-NC electrocatalyst for the oxygen reduction reaction | Yuqin Wang et.al. | 2410.07300 | null |
2024-10-17 | Spiking GS: Towards High-Accuracy and Low-Cost Surface Reconstruction via Spiking Neuron-based Gaussian Splatting | Weixing Zhang et.al. | 2410.07266 | link |
2024-10-08 | Reconstruction of Particle Flow Energy Distribution Using Deep Learning Algorithms | Han Zhang et.al. | 2410.07250 | link |
2024-10-09 | Thing2Reality: Transforming 2D Content into Conditioned Multiviews and 3D Gaussian Objects for XR Communication | Erzhen Hu et.al. | 2410.07119 | null |
2024-10-11 | Effect of different stickiness between icy and silicate particles on carbon depletion in protoplanetary disks | Tamami Okamoto et.al. | 2410.07047 | null |
2024-10-09 | Z-upscaling: Optical Flow Guided Frame Interpolation for Isotropic Reconstruction of 3D EM Volumes | Fisseha A. Ferede et.al. | 2410.07043 | link |
2024-10-09 | The Energy Sharing Timescale in an Analytic Framework for Common Envelope Hydrodynamics | Rosa Wallace Everson et.al. | 2410.07036 | null |
2024-10-09 | Structure-Centric Robust Monocular Depth Estimation via Knowledge Distillation | Runze Chen et.al. | 2410.06982 | null |
2024-10-14 | Focal Surface Holographic Light Transport using Learned Spatially Adaptive Convolutions | Chuanjun Zheng et.al. | 2410.06854 | null |
2024-10-09 | An Improved Approach for Cardiac MRI Segmentation based on 3D UNet Combined with Papillary Muscle Exclusion | Narjes Benameur et.al. | 2410.06818 | null |
2024-10-09 | Crystallinity in Niobium oxides: A pathway to mitigate Two-Level System Defects in Niobium 3D Resonator for quantum applications | Y. Kalboussi et.al. | 2410.06805 | null |
2024-10-09 | DreamMesh4D: Video-to-4D Generation with Sparse-Controlled Gaussian-Mesh Hybrid Representation | Zhiqi Li et.al. | 2410.06756 | null |
2024-10-16 | A network of cooler white dwarfs as infrared standards for flux calibration | Abbigail K. Elms et.al. | 2410.06754 | null |
2024-10-15 | MimicTalk: Mimicking a personalized and expressive 3D talking face in minutes | Zhenhui Ye et.al. | 2410.06734 | null |
2024-10-18 | Perceptual Quality Assessment of Octree-RAHT Encoded 3D Point Clouds | Dongshuai Duan et.al. | 2410.06729 | link |
2024-10-09 | Evaluating the Impact of Point Cloud Colorization on Semantic Segmentation Accuracy | Qinfeng Zhu et.al. | 2410.06725 | null |
2024-10-19 | Perceptual Quality Assessment of Trisoup-Lifting Encoded 3D Point Clouds | Juncheng Long et.al. | 2410.06689 | link |
2024-10-10 | Effect of dynamical electron correlations on the tunnelling magnetoresistance of Fe/MgO/Fe(001) junctions | Declan Nell et.al. | 2410.06679 | null |
2024-10-15 | M3Bench: Benchmarking Whole-body Motion Generation for Mobile Manipulation in 3D Scenes | Zeyu Zhang et.al. | 2410.06678 | null |
2024-10-09 | ES-Gaussian: Gaussian Splatting Mapping via Error Space-Based Gaussian Completion | Lu Chen et.al. | 2410.06613 | null |
2024-10-09 | MedImageInsight: An Open-Source Embedding Model for General Domain Medical Imaging | Noel C. F. Codella et.al. | 2410.06542 | null |
2024-10-09 | QuadBEV: An Efficient Quadruple-Task Perception Framework via Bird’s-Eye-View Representation | Yuxin Li et.al. | 2410.06516 | null |
2024-10-09 | TorchTitan: One-stop PyTorch native solution for production ready LLM pre-training | Wanchao Liang et.al. | 2410.06511 | link |
2024-10-09 | BiC-MPPI: Goal-Pursuing, Sampling-Based Bidirectional Rollout Clustering Path Integral for Trajectory Optimization | Minchan Jung et.al. | 2410.06493 | link |
2024-10-09 | 3D Representation Methods: A Survey | Zhengren Wang et.al. | 2410.06475 | null |
2024-10-09 | Anisotropic Thermal Conductivity of 3D Printed Graphene Enhanced Thermoplastic Polyurethanes Structure toward Photothermal Conversion | Zihao Kang et.al. | 2410.06470 | null |
2024-10-08 | MIRACLE 3D: Memory-efficient Integrated Robust Approach for Continual Learning on Point Clouds via Shape Model construction | Hossein Resani et.al. | 2410.06418 | null |
2024-10-08 | Monocular Visual Place Recognition in LiDAR Maps via Cross-Modal State Space Model and Multi-View Matching | Gongxin Yao et.al. | 2410.06285 | null |
2024-10-08 | New developments on the Ingot WFS laboratory testing | Tânia Gomes Machado et.al. | 2410.06260 | null |
2024-10-08 | Pupil plane WFSs for LGS systems of giant telescopes: the case of Ingot | Elisa Portaluri et.al. | 2410.06259 | null |
2024-10-08 | HiSplat: Hierarchical 3D Gaussian Splatting for Generalizable Sparse-View Reconstruction | Shengji Tang et.al. | 2410.06245 | null |
2024-10-10 | RelitLRM: Generative Relightable Radiance for Large Reconstruction Models | Tianyuan Zhang et.al. | 2410.06231 | null |
2024-10-08 | Investigation of rotational augmentation mechanisms on wind turbine blade sections based on Quasi-3D simulations | Pedro Rodrigues et.al. | 2410.06228 | null |
2024-10-08 | Linear and nonlinear optical response based on many-body GW-Bethe-Salpeter and Kadanoff-Baym approaches for two-dimensional layered semiconductors | Dmitry Skachkov et.al. | 2410.06218 | null |
2024-10-08 | GSLoc: Visual Localization with 3D Gaussian Splatting | Kazii Botashev et.al. | 2410.06165 | null |
2024-10-08 | Control of Cu morphology on TaN barrier and combined Ru-TaN barrier/liner substrates for nanoscale interconnects from atomistic kinetic Monte Carlo simulations | Samuel Aldana et.al. | 2410.06133 | null |
2024-10-08 | RealityCraft: An In-Situ CAD+CAM Interface for Novices via Scene-Aware Augmented Reality | Oğuz Arslan et.al. | 2410.06113 | null |
2024-10-08 | Corrections to “Computer Vision Aided mmWave Beam Alignment in V2X Communications” | Weihua Xu et.al. | 2410.06004 | null |
2024-10-08 | Reconfigurable Topological Dissipative Light Bullets | Qian Tang et.al. | 2410.05981 | null |
2024-10-08 | MedUniSeg: 2D and 3D Medical Image Segmentation via a Prompt-driven Universal Model | Yiwen Ye et.al. | 2410.05905 | link |
2024-10-08 | Unobserved Object Detection using Generative Models | Subhransu S. Bhattacharjee et.al. | 2410.05869 | link |
2024-10-08 | Long-Range Reading of Multiple Chipless Sensors from the Isoline Processing of 3D Radar Images | A. Hadj Djilani et.al. | 2410.05866 | null |
2024-10-08 | FürElise: Capturing and Physically Synthesizing Hand Motions of Piano Performance | Ruocheng Wang et.al. | 2410.05791 | null |
2024-10-08 | Comparative Analysis of Novel View Synthesis and Photogrammetry for 3D Forest Stand Reconstruction and extraction of individual tree parameters | Guoji Tian et.al. | 2410.05772 | null |
2024-10-08 | 3D UAV Trajectory Planning for IoT Data Collection via Matrix-Based Evolutionary Computation | Pei-Fa Sun et.al. | 2410.05759 | null |
2024-10-08 | Equi-GSPR: Equivariant SE(3) Graph Network Model for Sparse Point Cloud Registration | Xueyang Kang et.al. | 2410.05729 | link |
2024-10-08 | Whole-Body Dynamic Throwing with Legged Manipulators | Humphrey Munn et.al. | 2410.05681 | null |
2024-10-08 | Single picture single photon single pixel 3D imaging through unknown thick scattering medium | Long Pan et.al. | 2410.05607 | null |
2024-10-07 | Toward General Object-level Mapping from Sparse Views with 3D Diffusion Priors | Ziwei Liao et.al. | 2410.05514 | link |
2024-10-07 | Mind the kinematics simulation of planet-disk interactions: time evolution and numerical resolution | Kan Chen et.al. | 2410.05482 | null |
2024-10-11 | PH-Dropout: Practical Epistemic Uncertainty Quantification for View Synthesis | Chuanhao Sun et.al. | 2410.05468 | link |
2024-10-07 | SharpSLAM: 3D Object-Oriented Visual SLAM with Deblurring for Agile Drones | Denis Davletshin et.al. | 2410.05405 | null |
2024-10-07 | End of the World Brane Dynamics in Holographic 4d $\mathcal{N}=4$ SU(N) with 3d $\mathcal{N}=2$ Boundary Conditions | Jesús Huertas et.al. | 2410.05368 | null |
2024-10-07 | Generating CAD Code with Vision-Language Models for 3D Designs | Kamel Alrashedy et.al. | 2410.05340 | null |
2024-10-04 | Topology-Informed Machine Learning for Efficient Prediction of Solid Oxide Fuel Cell Electrode Polarization | Maksym Szemer et.al. | 2410.05307 | link |
2024-10-03 | Global well-posedness of the Navier-Stokes equations and the Keller-Segel system in variable Fourier-Besov spaces | Gastón Vergara-Hermosilla et.al. | 2410.05293 | null |
2024-10-07 | Generalization of Modular Spread Complexity for Non-Hermitian Density Matrices | Aneek Jana et.al. | 2410.05264 | null |
2024-10-07 | DART: A Diffusion-Based Autoregressive Motion Model for Real-Time Text-Driven Motion Control | Kaifeng Zhao et.al. | 2410.05260 | null |
2024-10-07 | GS-VTON: Controllable 3D Virtual Try-on with Gaussian Splatting | Yukang Cao et.al. | 2410.05259 | null |
2024-10-07 | Rosette spectroscopic imaging for whole-brain metabolite mapping at 7T: acceleration potential and reproducibility | Zhiwei Huang et.al. | 2410.05245 | null |
2024-10-07 | Path planning for multi-quadrotor 3D boundary surveillance using non-autonomous discrete memristor hyperchaotic system | Harisankar R et.al. | 2410.05215 | null |
2024-10-07 | Polar alignment of a dusty circumbinary disc – II. Application to 99 Herculis | Jeremy L. Smallwood et.al. | 2410.05208 | null |
2024-10-08 | Beyond FVD: Enhanced Evaluation Metrics for Video Generation Quality | Ge Ya Luo et.al. | 2410.05203 | link |
2024-10-07 | Provably Positivity-Preserving Constrained Transport (PPCT) Second-Order Scheme for Ideal Magnetohydrodynamics | Dongwen Pang et.al. | 2410.05173 | null |
2024-10-07 | Real-Time Truly-Coupled Lidar-Inertial Motion Correction and Spatiotemporal Dynamic Object Detection | Cedric Le Gentil et.al. | 2410.05152 | null |
2024-10-07 | DreamSat: Towards a General 3D Model for Novel View Synthesis of Space Objects | Nidhi Mathihalli et.al. | 2410.05097 | link |
2024-10-07 | HE-Nav: A High-Performance and Efficient Navigation System for Aerial-Ground Robots in Cluttered Environments | Junming Wang et.al. | 2410.05079 | null |
2024-10-07 | HE-Drive: Human-Like End-to-End Driving with Vision Language Models | Junming Wang et.al. | 2410.05051 | null |
2024-10-08 | FreSh: Frequency Shifting for Accelerated Neural Representation Learning | Adam Kania et.al. | 2410.05050 | link |
2024-10-07 | PhotoReg: Photometrically Registering 3D Gaussian Splatting Models | Ziwen Yuan et.al. | 2410.05044 | null |
2024-10-07 | CUDA-based focused Gaussian beams second-harmonic generation efficiency calculator | A. D. Sanchez et.al. | 2410.04994 | null |
2024-10-10 | 6DGS: Enhanced Direction-Aware Gaussian Splatting for Volumetric Rendering | Zhongpai Gao et.al. | 2410.04974 | null |
2024-10-07 | Revealing Directions for Text-guided 3D Face Editing | Zhuo Chen et.al. | 2410.04965 | null |
2024-10-07 | Real-time Ship Recognition and Georeferencing for the Improvement of Maritime Situational Awareness | Borja Carrillo Perez et.al. | 2410.04946 | null |
2024-10-07 | Training Interactive Agent in Large FPS Game Map with Rule-enhanced Reinforcement Learning | Chen Zhang et.al. | 2410.04936 | null |
2024-10-07 | Large time behavior for solutions to the anisotropic Navier-Stokes equations in a 3D half-space | Mikihiro Fujii et.al. | 2410.04904 | null |
2024-10-07 | D-PoSE: Depth as an Intermediate Representation for 3D Human Pose and Shape Estimation | Nikolaos Vasilikopoulos et.al. | 2410.04889 | link |
2024-10-07 | TeX-NeRF: Neural Radiance Fields from Pseudo-TeX Vision | Chonghao Zhong et.al. | 2410.04873 | null |
2024-10-15 | Diffusion Models in 3D Vision: A Survey | Zhen Wang et.al. | 2410.04738 | null |
2024-10-07 | PredFormer: Transformers Are Effective Spatial-Temporal Predictive Learners | Yujin Tang et.al. | 2410.04733 | link |
2024-10-07 | Coverage Analysis for 3D Indoor Terahertz Communication System Over Fluctuating Two-Ray Fading Channels | Zhifeng Tang et.al. | 2410.04681 | null |
2024-10-07 | Next Best Sense: Guiding Vision and Touch with FisherRF for 3D Gaussian Splatting | Matthew Strong et.al. | 2410.04680 | link |
2024-10-06 | Multimodal 3D Fusion and In-Situ Learning for Spatially Aware AI | Chengyuan Xu et.al. | 2410.04652 | link |
2024-10-06 | Mode-GS: Monocular Depth Guided Anchored 3D Gaussian Splatting for Robust Ground-View Scene Rendering | Yonghan Lee et.al. | 2410.04646 | null |
2024-10-06 | Thickness-Driven Transitions Between Novel Magnetic States in Ferromagnetic Films | Jacob Mankenberg et.al. | 2410.04600 | null |
2024-10-06 | Mg ii h&k spectra of an enhanced network region simulated with the MURaM-ChE code. Results using 1.5D synthesis | P. Ondratschek et.al. | 2410.04594 | null |
2024-10-06 | Enhancing 3D Human Pose Estimation Amidst Severe Occlusion with Dual Transformer Fusion | Mehwish Ghafoor et.al. | 2410.04574 | link |
2024-10-06 | 3D printed mesoporous superconductors with periodic order on three length scales and enhanced properties via block copolymer directed self-assembly | Fei Yu et.al. | 2410.04569 | null |
2024-10-06 | Multi-Time Version of the Landau-Peierls Formulation of Quantum Electrodynamics | Matthias Lienert et.al. | 2410.04535 | null |
2024-10-06 | In-Place Panoptic Radiance Field Segmentation with Perceptual Prior for 3D Scene Understanding | Shenghao Li et.al. | 2410.04529 | null |
2024-10-06 | Block Vecchia Approximation for Scalable and Efficient Gaussian Process Computations | Qilong Pan et.al. | 2410.04477 | null |
2024-10-06 | Liquid-Droplet Coalescence: CNN-based Reconstruction of Flow Fields from Concentration Fields | Vasanth Kumar Babu et.al. | 2410.04451 | null |
2024-10-06 | LiteVLoc: Map-Lite Visual Localization for Image Goal Navigation | Jianhao Jiao et.al. | 2410.04419 | null |
2024-10-06 | Global well-posedness for the defocusing 3D quadratic NLS in the sharp critical space | Jia Shen et.al. | 2410.04337 | null |
2024-10-05 | Vehicle-in-Virtual-Environment Method for ADAS and Connected and Automated Driving Function Development/Demonstration/Evaluation | Xincheng Cao et.al. | 2410.04313 | null |
2024-10-05 | ETHcavation: A Dataset and Pipeline for Panoptic Scene Understanding and Object Tracking in Dynamic Construction Environments | Lorenzo Terenzi et.al. | 2410.04250 | null |
2024-10-05 | Distinguishing Electronic Band Structure of Single-layer and Bilayer Ruddlesden-Popper Nickelates Probed by in-situ High Pressure X-ray Absorption Near-edge Spectroscopy | Mingtao Li et.al. | 2410.04230 | null |
2024-10-05 | DB-SAM: Delving into High Quality Universal Medical Image Segmentation | Chao Qin et.al. | 2410.04172 | link |
2024-10-05 | IceCloudNet: 3D reconstruction of cloud ice from Meteosat SEVIRI | Kai Jeggle et.al. | 2410.04135 | link |
2024-10-05 | TV-based Deep 3D Self Super-Resolution for fMRI | Fernando Pérez-Bueno et.al. | 2410.04097 | null |
2024-10-05 | Text2Chart31: Instruction Tuning for Chart Generation with Automatic Feedback | Fatemeh Pesaran Zadeh et.al. | 2410.04064 | link |
2024-10-10 | Hybrid NeRF-Stereo Vision: Pioneering Depth Estimation and 3D Reconstruction in Endoscopy | Pengcheng Chen et.al. | 2410.04041 | null |
2024-10-05 | Flatbands from Bound States in the Continuum for Orbital Angular Momentum Localization | Weiwei Zhu et.al. | 2410.04040 | null |
2024-10-05 | A tensor-based approach to solving linear systems involving Kronecker sum of matrices | Ahmad Y. Al-Dweik et.al. | 2410.04026 | null |
2024-10-04 | STONE: A Submodular Optimization Framework for Active 3D Object Detection | Ruiyu Mao et.al. | 2410.03918 | link |
2024-10-04 | SPARTUN3D: Situated Spatial Understanding of 3D World in Large Language Models | Yue Zhang et.al. | 2410.03878 | null |
2024-10-04 | Fusions and Dualities for 3d Theories $T[M_3]$ | Shi Cheng et.al. | 2410.03852 | null |
2024-10-04 | Text-guided Diffusion Model for 3D Molecule Generation | Yanchen Luo et.al. | 2410.03803 | null |
2024-10-04 | M2AR: A Web-based Modeling Environment for the Augmented Reality Workflow Modeling Language | Fabian Muff et.al. | 2410.03800 | null |
2024-10-04 | Estimating Body and Hand Motion in an Ego-sensed World | Brent Yi et.al. | 2410.03665 | null |
2024-10-04 | Unlearnable 3D Point Clouds: Class-wise Transformation Is All You Need | Xianlong Wang et.al. | 2410.03644 | link |
2024-10-04 | A mixed-dimensional model for the electrostatic problem on coupled domains | Beatrice Crippa et.al. | 2410.03622 | null |
2024-10-04 | 3d Mirror Symmetry is Mirror Symmetry | Ki Fung Chan et.al. | 2410.03611 | null |
2024-10-04 | Crystallography, Group Cohomology, and Lieb-Schultz-Mattis Constraints | Chunxiao Liu et.al. | 2410.03607 | null |
2024-10-04 | Variational Bayes Gaussian Splatting | Toon Van de Maele et.al. | 2410.03592 | link |
2024-10-04 | Loading Ceramics: Visualising Possibilities of Robotics in Ceramics | Varvara Guljajeva et.al. | 2410.03550 | null |
2024-10-04 | Dessie: Disentanglement for Articulated 3D Horse Shape and Pose Estimation from Images | Ci Li et.al. | 2410.03438 | null |
2024-10-08 | Towards Real-time Intrahepatic Vessel Identification in Intraoperative Ultrasound-Guided Liver Surgery | Karl-Philippe Beaudet et.al. | 2410.03420 | null |
2024-10-04 | Img2CAD: Conditioned 3D CAD Model Generation from Single Image with Structured Visual Geometry | Tianrun Chen et.al. | 2410.03417 | null |
2024-10-04 | The Haar measure in solid mechanics | Clément Ecker et.al. | 2410.03371 | null |
2024-10-04 | Collision-Aware Traversability Analysis for Autonomous Vehicles in the Context of Agricultural Robotics | Florian Philippe et.al. | 2410.03370 | null |
2024-10-04 | Accurate 3D Nanoscale Electromechanical Imaging with a Metrological Atomic Force Microscope | Roger Proksch et.al. | 2410.03340 | null |
2024-10-04 | Superconducting strings in the two-Higgs doublet model | Steven Cotterill et.al. | 2410.03300 | null |
2024-10-04 | Performance assessment of the HERD calorimeter with a photo-diode read-out system for high-energy electron beams | O. Adriani et.al. | 2410.03274 | null |
2024-10-04 | Probing 3D Velocity Distributions Insights from a Vibrated Dual-Species Granular System | Rameez Farooq Shah et.al. | 2410.03273 | null |
2024-10-04 | 3D Segmentation of Neuronal Nuclei and Cell-Type Identification using Multi-channel Information | Antonio LaTorre et.al. | 2410.03248 | null |
2024-10-08 | Autonomous Character-Scene Interaction Synthesis from Text Instruction | Nan Jiang et.al. | 2410.03187 | null |
2024-10-04 | Design and Fabrication of a Low-cost Liquid Optical Waveguide for Augmented Reality | Dechuan Sun et.al. | 2410.03157 | null |
2024-10-12 | ECHOPulse: ECG controlled echocardio-grams video generation | Yiwei Li et.al. | 2410.03143 | link |
2024-10-04 | Shrinking: Reconstruction of Parameterized Surfaces from Signed Distance Fields | Haotian Yin et.al. | 2410.03123 | null |
2024-10-04 | MBDS: A Multi-Body Dynamics Simulation Dataset for Graph Networks Simulators | Sheng Yang et.al. | 2410.03107 | link |
2024-10-04 | Calibration of NYURay for Ray Tracing using 28, 73, and 142 GHz Channel Measurements conducted in Indoor, Outdoor, and Factory Scenarios | O. Kanhere et.al. | 2410.03104 | null |
2024-10-04 | Partial-to-Full Registration based on Gradient-SDF for Computer-Assisted Orthopedic Surgery | Tiancheng Li et.al. | 2410.03078 | null |
2024-10-03 | Vehicle Suspension Recommendation System: Multi-Fidelity Neural Network-based Mechanism Design Optimization | Sumin Lee et.al. | 2410.03045 | null |
2024-10-03 | Single-Shot 6DoF Pose and 3D Size Estimation for Robotic Strawberry Harvesting | Lun Li et.al. | 2410.03031 | null |
2024-10-03 | Novel electronic state of honeycomb iridate Cu $_2$IrO$_3$ at high pressure | G. Fabbris et.al. | 2410.02934 | null |
2024-10-03 | First-principles measurement of ion and electron energization in collisionless accretion flow | Evgeny A. Gorbunov et.al. | 2410.02872 | null |
2024-10-03 | Individuation of 3D perceptual units from neurogeometry of binocular cells | Maria Virginia Bolelli et.al. | 2410.02870 | null |
2024-10-03 | On Resonance Enhancement of $E1-E2$ Nondipole Photoelectron Asymmetries in Low-Energy Ne $2p$ -Photoionization | Valeriy K. Dolmatov et.al. | 2410.02869 | null |
2024-10-03 | On the origin of transition disk cavities: Pebble-accreting protoplanets vs Super-Jupiters | Shuo Huang et.al. | 2410.02856 | null |
2024-10-03 | The Kinematics of 30 Milky Way Globular Clusters and the Multiple Stellar Populations within | Ellen Leitinger et.al. | 2410.02855 | null |
2024-10-03 | Magnetically-Driven Neutron-Rich Ejecta Unleashed: Global 3D Neutrino-GRMHD Simulations of Collapsars Reveal the Conditions for r-process Nucleosynthesis | Danat Issa et.al. | 2410.02852 | null |
2024-10-03 | Flash-Splat: 3D Reflection Removal with Flash Cues and Gaussian Splats | Mingyang Xie et.al. | 2410.02764 | null |
2024-10-03 | The helicity distribution for the incompressible Euler equations on bounded domains | Marco Inversi et.al. | 2410.02728 | null |
2024-10-03 | SynthFormer: Equivariant Pharmacophore-based Generation of Molecules for Ligand-Based Drug Design | Zygimantas Jocys et.al. | 2410.02718 | null |
2024-10-03 | AlzhiNet: Traversing from 2DCNN to 3DCNN, Towards Early Detection and Diagnosis of Alzheimer’s Disease | Romoke Grace Akindele et.al. | 2410.02714 | null |
2024-10-04 | Learning 3D Perception from Others’ Predictions | Jinsu Yoo et.al. | 2410.02646 | null |
2024-10-03 | Why Sample Space Matters: Keyframe Sampling Optimization for LiDAR-based Place Recognition | Nikolaos Stathoulopoulos et.al. | 2410.02643 | link |
2024-10-03 | GI-GS: Global Illumination Decomposition on Gaussian Splatting for Inverse Rendering | Hongze Chen et.al. | 2410.02619 | null |
2024-10-03 | Exact boundary controllability of the 3D incompressible ideal MHD system | Igor Kukavica et.al. | 2410.02588 | null |
2024-10-03 | Deep Regression 2D-3D Ultrasound Registration for Liver Motion Correction in Focal Tumor Thermal Ablation | Shuwei Xing et.al. | 2410.02579 | link |
2024-10-07 | SuperGS: Super-Resolution 3D Gaussian Splatting via Latent Feature Field and Gradient-guided Splitting | Shiyun Xie et.al. | 2410.02571 | link |
2024-10-03 | QED calculations of the E1 transition amplitude in neon-like iron and nickel | M. G. Kozlov et.al. | 2410.02489 | null |
2024-10-03 | LoGDesc: Local geometric features aggregation for robust point cloud registration | Karim Slimani et.al. | 2410.02420 | link |
2024-10-03 | ProtoSeg: A Prototype-Based Point Cloud Instance Segmentation Method | Remco Royen et.al. | 2410.02352 | null |
2024-10-03 | RESSCAL3D++: Joint Acquisition and Semantic Segmentation of 3D Point Clouds | Remco Royen et.al. | 2410.02323 | link |
2024-10-03 | Impact of beam asymmetries at the Future Circular Collider e+e- | Peter Kicsiny et.al. | 2410.02302 | null |
2024-10-03 | Key-Grid: Unsupervised 3D Keypoints Detection using Grid Heatmap Features | Chengkai Hou et.al. | 2410.02237 | null |
2024-10-02 | MVGS: Multi-view-regulated Gaussian Splatting for Novel View Synthesis | Xiaobiao Du et.al. | 2410.02103 | link |
2024-10-02 | Orient Anything | Christopher Scarvelis et.al. | 2410.02101 | null |
2024-10-02 | Learning from the Giants: A Practical Approach to Underwater Depth and Surface Normals Estimation | Alzayat Saleh et.al. | 2410.02072 | null |
2024-10-02 | Emo3D: Metric and Benchmarking Dataset for 3D Facial Expression Generation from Emotion Description | Mahshid Dehghani et.al. | 2410.02049 | null |
2024-10-02 | FeelAnyForce: Estimating Contact Force Feedback from Tactile Sensation for Vision-Based Tactile Sensors | Amir-Hossein Shahidzadeh et.al. | 2410.02048 | null |
2024-10-02 | Scene Flow as a Partial Differential Equation | Kyle Vedder et.al. | 2410.02031 | null |
2024-10-02 | SkyAI Sim: An Open-Source Simulation of UAV Aerial Imaging from Satellite Data | S. Parisa Dajkhosh et.al. | 2410.02003 | link |
2024-10-02 | Gapless dispersive continuum in a breathing kagome antiferromagnet | Asiri Thennakoon et.al. | 2410.01931 | null |
2024-10-02 | High-order regularization dealing with ill-conditioned robot localization problems | Xinghua Liu et.al. | 2410.01919 | null |
2024-10-02 | Latency Reduction in CloudVR: Cloud Prediction, Edge Correction | Ali Majlesi Kopaee et.al. | 2410.01898 | null |
2024-10-05 | H-AMR FORGE’d in FIRE I: Magnetic state transitions, jet launching and radiative emission in super-Eddington, highly magnetized quasar disks formed from cosmological initial conditions | Nicholas Kaaz et.al. | 2410.01877 | null |
2024-10-02 | OCC-MLLM-Alpha:Empowering Multi-modal Large Language Model for the Understanding of Occluded Objects with Self-Supervised Test-Time Learning | Shuxin Yang et.al. | 2410.01861 | null |
2024-09-26 | Target Pose Guided Whole-body Grasping Motion Generation for Digital Humans | Quanquan Shao et.al. | 2410.01840 | null |
2024-10-09 | EVER: Exact Volumetric Ellipsoid Rendering for Real-time View Synthesis | Alexander Mai et.al. | 2410.01804 | null |
2024-10-02 | FabricDiffusion: High-Fidelity Texture Transfer for 3D Garments Generation from In-The-Wild Clothing Images | Cheng Zhang et.al. | 2410.01801 | null |
2024-10-02 | RADAR: Robust Two-stage Modality-incomplete Industrial Anomaly Detection | Bingchen Miao et.al. | 2410.01737 | null |
2024-10-02 | Open3DTrack: Towards Open-Vocabulary 3D Multi-Object Tracking | Ayesha Ishaq et.al. | 2410.01678 | link |
2024-10-02 | 3DGS-DET: Empower 3D Gaussian Splatting with Boundary Guidance and Box-Focused Sampling for 3D Object Detection | Yang Cao et.al. | 2410.01647 | link |
2024-10-02 | Gaussian Splatting in Mirrors: Reflection-Aware Rendering via Virtual Camera Optimization | Zihan Wang et.al. | 2410.01614 | link |
2024-10-02 | Measurement of ionization delays in atomic REMPI using photoelectron vortices | D. Köhnke et.al. | 2410.01601 | null |
2024-10-02 | Coordinate-Based Neural Representation Enabling Zero-Shot Learning for 3D Multiparametric Quantitative MRI | Guoyan Lao et.al. | 2410.01577 | null |
2024-10-06 | GaussianBlock: Building Part-Aware Compositional and Editable 3D Scene by Primitives and Gaussians | Shuyi Jiang et.al. | 2410.01535 | null |
2024-10-02 | Toward a Holistic Evaluation of Robustness in CLIP Models | Weijie Tu et.al. | 2410.01534 | null |
2024-10-02 | MiraGe: Editable 2D Images using Gaussian Splatting | Joanna Waczyńska et.al. | 2410.01521 | link |
2024-10-02 | UW-GS: Distractor-Aware 3D Gaussian Splatting for Enhanced Underwater Scene Reconstruction | Haoran Wang et.al. | 2410.01517 | link |
2024-10-02 | Topological entanglement and number theory | Siddharth Dwivedi et.al. | 2410.01492 | null |
2024-10-02 | One Wave to Explain Them All: A Unifying Perspective on Post-hoc Explainability | Gabriel Kasmi et.al. | 2410.01482 | null |
2024-10-02 | A Fast Optimization Approach For A Complex Real-Life 3D Multiple Bin Size Bin Packing Problem | Katrin Heßler et.al. | 2410.01445 | null |
2024-10-03 | SurgPointTransformer: Vertebrae Shape Completion with RGB-D Data | Aidana Massalimova et.al. | 2410.01443 | null |
2024-10-02 | EVA-Gaussian: 3D Gaussian-based Real-time Human Novel View Synthesis under Diverse Camera Settings | Yingdong Hu et.al. | 2410.01425 | null |
2024-10-02 | Gaussian-Det: Learning Closed-Surface Gaussians for 3D Object Detection | Hongru Yan et.al. | 2410.01404 | null |
2024-10-02 | Towards Generalizable Vision-Language Robotic Manipulation: A Benchmark and LLM-guided 3D Policy | Ricardo Garcia et.al. | 2410.01345 | link |
2024-10-02 | Bending, breaking, and reconnecting of the electrical double layers at heterogeneous electrodes | Qian Ai et.al. | 2410.01339 | null |
2024-10-02 | Finetuning Pre-trained Model with Limited Data for LiDAR-based 3D Object Detection by Bridging Domain Gaps | Jiyun Jang et.al. | 2410.01319 | null |
2024-10-02 | LaGeM: A Large Geometry Model for 3D Representation Learning and Diffusion | Biao Zhang et.al. | 2410.01295 | null |
2024-10-02 | SurgeoNet: Realtime 3D Pose Estimation of Articulated Surgical Instruments from Stereo Images using a Synthetically-trained Network | Ahmed Tawfik Aboukhadra et.al. | 2410.01293 | null |
2024-10-02 | Panopticus: Omnidirectional 3D Object Detection on Resource-constrained Edge Devices | Jeho Lee et.al. | 2410.01270 | null |
2024-10-02 | Effect of span size in Scale Resolving Simulations of airfoil stall and post-stall | Francesco Mario D’Afiero et.al. | 2410.01254 | null |
2024-10-02 | High and Low Resolution Tradeoffs in Roadside Multimodal Sensing | Shaozu Ding et.al. | 2410.01250 | link |
2024-10-02 | Towards Native Generative Model for 3D Head Avatar | Yiyu Zhuang et.al. | 2410.01226 | null |
2024-10-02 | AniSDF: Fused-Granularity Neural Surfaces with Anisotropic Encoding for High-Fidelity 3D Reconstruction | Jingnan Gao et.al. | 2410.01202 | null |
2024-10-02 | Adaptive Finite Element Method for Phase Field Fracture Models Based on Recovery Error Estimates | Tian Tian et.al. | 2410.01177 | null |
2024-10-02 | StraightTrack: Towards Mixed Reality Navigation System for Percutaneous K-wire Insertion | Han Zhang et.al. | 2410.01143 | null |
2024-10-01 | Synthetic imagery for fuzzy object detection: A comparative study | Siavash H. Khajavi et.al. | 2410.01124 | null |
2024-10-01 | Pose Estimation of Buried Deep-Sea Objects using 3D Vision Deep Learning Models | Jerry Yan et.al. | 2410.01061 | null |
2024-10-01 | Graph-based Scalable Sampling of 3D Point Cloud Attributes | Shashank N. Sridhara et.al. | 2410.01027 | null |
2024-10-01 | Y-CA-Net: A Convolutional Attention Based Network for Volumetric Medical Image Segmentation | Muhammad Hamza Sharif et.al. | 2410.01003 | null |
2024-10-01 | Extreme scale height variations and nozzle shocks in warped disks | Nicholas Kaaz et.al. | 2410.00961 | null |
2024-10-01 | Growth of Stress-Responsive Bacteria in 3D Colonies under Confining Pressure | Samaneh Rahbar et.al. | 2410.00898 | null |
2024-10-02 | Flex3D: Feed-Forward 3D Generation With Flexible Reconstruction Model And Input View Curation | Junlin Han et.al. | 2410.00890 | null |
2024-10-01 | MAP: Unleashing Hybrid Mamba-Transformer Vision Backbone’s Potential with Masked Autoregressive Pretraining | Yunze Liu et.al. | 2410.00871 | null |
2024-10-01 | On the conservation of helicity by weak solutions of the 3D Euler and inviscid MHD equations | Daniel W. Boutros et.al. | 2410.00813 | null |
2024-10-01 | Under Pressure: Altimeter-Aided ICP for 3D Maps Consistency | William Dubois et.al. | 2410.00758 | null |
2024-10-01 | Non-Simply Laced Class-S Vertex Operator Algebras | Grant Elliot et.al. | 2410.00735 | null |
2024-10-01 | RAD: A Dataset and Benchmark for Real-Life Anomaly Detection with Robotic Observations | Kaichen Zhou et.al. | 2410.00713 | link |
2024-10-01 | BioFace3D: A fully automatic pipeline for facial biomarkers extraction of 3D face reconstructions segmented from MRI | Álvaro Heredia-Lidón et.al. | 2410.00711 | null |
2024-10-01 | A Low-Cost, High-Speed, and Robust Bin Picking System for Factory Automation Enabled by a Non-Stop, Multi-View, and Active Vision Scheme | Xingdou Fu et.al. | 2410.00706 | null |
2024-10-01 | Supercomputer 3D Digital Twin for User Focused Real-Time Monitoring | William Bergeron et.al. | 2410.00688 | null |
2024-10-01 | Photo-induced phase transition on black samarium monosulfide | Hiroshi Watanabe et.al. | 2410.00674 | null |
2024-10-01 | Cafca: High-quality Novel View Synthesis of Expressive Faces from Casual Few-shot Captures | Marcel C. Bühler et.al. | 2410.00630 | null |
2024-10-01 | An Illumination-Robust Feature Extractor Augmented by Relightable 3D Reconstruction | Shunyi Zhao et.al. | 2410.00629 | null |
2024-10-01 | GERA: Geometric Embedding for Efficient Point Registration Analysis | Geng Li et.al. | 2410.00589 | null |
2024-10-01 | Can We Remove the Ground? Obstacle-aware Point Cloud Compression for Remote Object Detection | Pengxi Zeng et.al. | 2410.00582 | null |
2024-10-01 | Theory of Nonlinear Hall Effect Induced by Electric Field and Temperature Gradient in 3D Chiral Magnetic Textures | Terufumi Yamaguchi et.al. | 2410.00563 | null |
2024-10-02 | CaRtGS: Computational Alignment for Real-Time Gaussian Splatting SLAM | Dapeng Feng et.al. | 2410.00486 | link |
2024-10-01 | RobotGraffiti: An AR tool for semi-automated construction of workcell models to optimize robot deployment | Krzysztof Zieliński et.al. | 2410.00484 | null |
2024-10-01 | Precise Workcell Sketching from Point Clouds Using an AR Toolbox | Krzysztof Zieliński et.al. | 2410.00479 | null |
2024-10-01 | ReXplain: Translating Radiology into Patient-Friendly Video Reports | Luyang Luo et.al. | 2410.00441 | null |
2024-10-01 | Domain Aware Multi-Task Pretraining of 3D Swin Transformer for T1-weighted Brain MRI | Jonghun Kim et.al. | 2410.00410 | null |
2024-10-01 | 3DGR-CAR: Coronary artery reconstruction from ultra-sparse 2D X-ray views with a 3D Gaussians representation | Xueming Fu et.al. | 2410.00404 | link |
2024-10-01 | Seamless Augmented Reality Integration in Arthroscopy: A Pipeline for Articular Reconstruction and Guidance | Hongchao Shu et.al. | 2410.00386 | null |
2024-10-01 | Data Augmentation for 3DMM-based Arousal-Valence Prediction for HRI | Christian Arzate Cruz et.al. | 2410.00349 | null |
2024-10-01 | Revisiting the Role of Texture in 3D Person Re-identification | Huy Nguyen et.al. | 2410.00348 | null |
2024-10-01 | SyntheOcc: Synthesize Geometric-Controlled Street View Images through 3D Semantic MPIs | Leheng Li et.al. | 2410.00337 | null |
2024-10-01 | PointAD: Comprehending 3D Anomalies from Points and Pixels for Zero-shot 3D Anomaly Detection | Qihang Zhou et.al. | 2410.00320 | link |
2024-10-01 | GSPR: Multimodal Place Recognition Using 3D Gaussian Splatting for Autonomous Driving | Zhangshuo Qi et.al. | 2410.00299 | link |
2024-09-30 | Embodied Visuomotor Representation | Levi Burner et.al. | 2410.00287 | null |
2024-10-02 | Social Conjuring: Multi-User Runtime Collaboration with AI in Building Virtual 3D Worlds | Amina Kobenova et.al. | 2410.00274 | null |
2024-09-30 | Comprehensive Performance Modeling and System Design Insights for Foundation Models | Shashank Subramanian et.al. | 2410.00273 | null |
2024-09-30 | Robin3D: Improving 3D Large Language Model via Robust Instruction Tuning | Weitai Kang et.al. | 2410.00255 | link |
2024-09-30 | MM-Conv: A Multi-modal Conversational Dataset for Virtual Humans | Anna Deichler et.al. | 2410.00253 | null |
2024-09-30 | Volumetric Conditional Score-based Residual Diffusion Model for PET/MR Denoising | Siyeop Yoon et.al. | 2410.00184 | link |
2024-09-30 | A new series of 3D CFTs with $\mathrm{Sp}(N)$ global symmetry on fuzzy sphere | Zheng Zhou et.al. | 2410.00087 | null |
2024-09-29 | Global well-posedness of the fractional dissipative system in the framework of variable Fourier–Besov spaces | Gastón Vergara-Hermosilla et.al. | 2410.00060 | null |
2024-09-27 | A Comparison of Micromegas with x/y Strip Charge Readouts for Directional Recoil Detection | Majd Ghrear et.al. | 2410.00048 | null |
2024-10-08 | DressRecon: Freeform 4D Human Reconstruction from Monocular Video | Jeff Tan et.al. | 2409.20563 | null |
2024-09-30 | Uni $^2$ Det: Unified and Universal Framework for Prompt-Guided Multi-dataset 3D Detection | Yubin Wang et.al. | 2409.20558 | null |
2024-09-30 | UniAff: A Unified Representation of Affordances for Tool Usage and Articulation with Vision-Language Models | Qiaojun Yu et.al. | 2409.20551 | null |
2024-09-30 | Dual Encoder GAN Inversion for High-Fidelity 3D Head Reconstruction from Single Images | Bahri Batuhan Bilecen et.al. | 2409.20530 | null |
2024-09-30 | Impact of Device Caching and Handovers on the Performance of 3D UAV Networks with Blockages | Neetu R R et.al. | 2409.20433 | null |
2024-09-30 | Navigating Threats: A Survey of Physical Adversarial Attacks on LiDAR Perception Systems in Autonomous Vehicles | Amira Guesmi et.al. | 2409.20426 | null |
2024-09-30 | 3D TFTs from 4d ${\cal N}=2$ BPS Particles | Davide Gaiotto et.al. | 2409.20393 | null |
2024-09-30 | Design, manufacturing, and inverse dynamic modeling of soft parallel robots actuated by dielectric elastomer actuators | Jung-Che Chang et.al. | 2409.20344 | null |
2024-09-30 | Devil is in Details: Locality-Aware 3D Abdominal CT Volume Generation for Self-Supervised Organ Segmentation | Yuran Wang et.al. | 2409.20332 | link |
2024-09-30 | Dissipation induced transition between extension and localization in the three-dimensional Anderson model | Xuanpu Yang et.al. | 2409.20319 | null |
2024-09-30 | RL-GSBridge: 3D Gaussian Splatting Based Real2Sim2Real Method for Robotic Manipulation Learning | Yuxuan Wu et.al. | 2409.20291 | null |
2024-09-30 | Poor-man’s Majorana edge mode enabled by specular Andreev reflection | C. W. J. Beenakker et.al. | 2409.20285 | null |
2024-09-30 | Controlling sharpness, SNR and SAR for 3D FSE at 7T by end-to-end learning | Peter Dawood et.al. | 2409.20251 | null |
2024-09-30 | Nuclear fuel imaging using position-sensitive detectors | Santeri Saariokari et.al. | 2409.20214 | null |
2024-09-30 | Annotation-Free Curb Detection Leveraging Altitude Difference Image | Fulong Ma et.al. | 2409.20171 | null |
2024-10-05 | GravMAD: Grounded Spatial Value Maps Guided Action Diffusion for Generalized 3D Manipulation | Yangtao Chen et.al. | 2409.20154 | null |
2024-09-30 | Experimental Measurement of Transverse Spin Dynamics in the Nonparaxial Focal Region | Nitish Kumar et.al. | 2409.20145 | null |
2024-09-30 | Robust Gaussian Splatting SLAM by Leveraging Loop Closure | Zunjie Zhu et.al. | 2409.20111 | null |
2024-09-30 | Individually Addressable Nanoscale OLEDs | Cheng Zhang et.al. | 2409.20080 | null |
2024-10-10 | OPONeRF: One-Point-One NeRF for Robust Neural Rendering | Yu Zheng et.al. | 2409.20043 | link |
2024-09-30 | Multimethod geophysical modelling for granite-related tungsten exploration: example of the Puy-les-Vignes/ Saint-Goussaud district (Limousin, France) | Geoffrey Dubreuil et.al. | 2409.20035 | null |
2024-09-30 | Camera Calibration using a Collimator System | Shunkun Liang et.al. | 2409.20034 | link |
2024-09-30 | Autonomous tip-induced chemical reactions in scanning probe microscopy | Nian Wu et.al. | 2409.20014 | null |
2024-09-30 | Single-shot reconstruction of three-dimensional morphology of biological cells in digital holographic microscopy using a physics-driven neural network | Jihwan Kim et.al. | 2409.20013 | null |
2024-10-01 | OccRWKV: Rethinking Efficient 3D Semantic Occupancy Prediction with Linear Complexity | Junming Wang et.al. | 2409.19987 | null |
2024-09-30 | DAOcc: 3D Object Detection Assisted Multi-Sensor Fusion for 3D Occupancy Prediction | Zhen Yang et.al. | 2409.19972 | link |
2024-09-30 | NeDF: neural deflection fields for sparse-view tomographic background oriented Schlieren | Jiawei Li et.al. | 2409.19971 | null |
2024-09-30 | Detonation propagation in three-dimensional continuous curved ducts | Lisong Shi et.al. | 2409.19944 | null |
2024-09-30 | Study of Evolution and Geo-effectiveness of CME-CME Interactions using MHD Simulations with SWASTi framework | Prateek Mayank et.al. | 2409.19943 | null |
2024-09-30 | High frame rate characterization of interaction between twin-nozzle jet in crossflow | Xunchen Liu et.al. | 2409.19927 | null |
2024-09-30 | WildFusion: Multimodal Implicit 3D Reconstructions in the Wild | Yanbaihui Liu et.al. | 2409.19904 | null |
2024-09-29 | Robust Incremental Structure-from-Motion with Hybrid Features | Shaohui Liu et.al. | 2409.19811 | null |
2024-09-29 | 4D Metric-Semantic Mapping for Persistent Orchard Monitoring: Method and Dataset | Jiuzhou Lei et.al. | 2409.19786 | null |
2024-09-29 | Learning Wheelchair Tennis Navigation from Broadcast Videos with Domain Knowledge Transfer and Diffusion Motion Planning | Zixuan Wu et.al. | 2409.19771 | null |
2024-10-01 | RNG: Relightable Neural Gaussians | Jiahui Fan et.al. | 2409.19702 | null |
2024-09-29 | Development of a 3D-printed canine head phantom for veterinary radiotherapy | Sandhya Rottoo et.al. | 2409.19694 | null |
2024-10-01 | See Detail Say Clear: Towards Brain CT Report Generation via Pathological Clue-driven Representation Learning | Chengxin Zheng et.al. | 2409.19676 | link |
2024-09-29 | Dual-Attention Frequency Fusion at Multi-Scale for Joint Segmentation and Deformable Medical Image Registration | Hongchao Zhou et.al. | 2409.19658 | null |
2024-09-29 | Grounding 3D Scene Affordance From Egocentric Interactions | Cuiyu Liu et.al. | 2409.19650 | null |
2024-09-29 | FlexSBDD: Structure-Based Drug Design with Flexible Protein Modeling | Zaixi Zhang et.al. | 2409.19645 | null |
2024-09-29 | fCOP: Focal Length Estimation from Category-level Object Priors | Xinyue Zhang et.al. | 2409.19641 | null |
2024-09-29 | Octupole topological insulating phase in Brillouin three-dimensional real projective space | Sichang Qiu et.al. | 2409.19553 | null |
2024-09-29 | A Universal Deep Learning Framework for Materials X-ray Absorption Spectra | Shubha R. Kharel et.al. | 2409.19552 | link |
2024-09-29 | Impact of Imprecision of the Time Delay on Imaging Result in Confocal Algorithm | Wenyi Shao et.al. | 2409.19517 | null |
2024-09-28 | Spatial Reasoning and Planning for Deep Embodied Agents | Shu Ishida et.al. | 2409.19479 | null |
2024-09-28 | G3R: Gradient Guided Generalizable Reconstruction | Yun Chen et.al. | 2409.19405 | null |
2024-09-28 | 3D-CT-GPT: Generating 3D Radiology Reports through Integration of Large Vision-Language Models | Hao Chen et.al. | 2409.19330 | null |
2024-09-28 | Scalable Cloud-Native Pipeline for Efficient 3D Model Reconstruction from Monocular Smartphone Images | Potito Aghilar et.al. | 2409.19322 | null |
2024-10-02 | HybridFlow: A Flexible and Efficient RLHF Framework | Guangming Sheng et.al. | 2409.19256 | null |
2024-10-07 | 1st Place Solution to the 8th HANDS Workshop Challenge – ARCTIC Track: 3DGS-based Bimanual Category-agnostic Interaction Reconstruction | Jeongwan On et.al. | 2409.19215 | null |
2024-09-28 | Anisotropic multi-orbital Hubbard model simulated with impurity approximation | Yan Peng et.al. | 2409.19199 | null |
2024-09-27 | Semi-Supervised Bone Marrow Lesion Detection from Knee MRI Segmentation Using Mask Inpainting Models | Shihua Qin et.al. | 2409.19185 | null |
2024-09-27 | Speckle-illumination spatial frequency domain imaging with a stereo laparoscope for profile-corrected optical property mapping | Anthony A. Song et.al. | 2409.19153 | null |
2024-09-27 | MASt3R-SfM: a Fully-Integrated Solution for Unconstrained Structure-from-Motion | Bardienus Duisterhof et.al. | 2409.19152 | null |
2024-09-27 | Diverse Code Query Learning for Speech-Driven Facial Animation | Chunzhi Gu et.al. | 2409.19143 | null |
2024-09-27 | ADEPT: A Noninvasive Method for Determining Elastic Properties of Valve Tissue | Wensi Wu et.al. | 2409.19081 | null |
2024-09-27 | Voxel-CIM: An Efficient Compute-in-Memory Accelerator for Voxel-based Point Cloud Neural Networks | Xipeng Lin et.al. | 2409.19077 | null |
2024-09-27 | Inhomogeneous Dust Biases Photometric Redshifts and Stellar Masses for LSST | ChangHoon Hahn et.al. | 2409.19054 | null |
2024-09-27 | Gaussian Heritage: 3D Digitization of Cultural Heritage with Integrated Object Segmentation | Mahtab Dahaghin et.al. | 2409.19039 | null |
2024-09-27 | S2O: Static to Openable Enhancement for Articulated 3D Objects | Denys Iliash et.al. | 2409.18896 | null |
2024-09-27 | Euclid preparation: 6x2 pt analysis of Euclid’s spectroscopic and photometric data sets | Euclid Collaboration et.al. | 2409.18882 | null |
2024-09-27 | Non-conventional Approach for Deriving the Radial Sizes of Coronal Mass Ejections at Different Instances: Discrepancies in the Estimates Between Remote and In Situ Observations | Anjali Agarwal et.al. | 2409.18851 | null |
2024-09-27 | A Simple A-Priory Estimate for 3D Stationary Navier-Stokes System Via Interpolation | Sergey P. Degtyarev et.al. | 2409.18808 | null |
2024-09-27 | Path Following Model Predictive Control of a Coupled Autonomous Underwater Vehicle | Isah A. Jimoh et.al. | 2409.18806 | null |
2024-09-27 | Open-Nav: Exploring Zero-Shot Vision-and-Language Navigation in Continuous Environment with Open-Source LLMs | Yanyuan Qiao et.al. | 2409.18794 | null |
2024-09-27 | A POMDP-based hierarchical planning framework for manipulation under pose uncertainty | Muhammad Suhail Saleem et.al. | 2409.18775 | null |
2024-09-27 | Geometric deep learning for galaxy-halo connection: a case study for galaxy intrinsic alignments | Yesukhei Jagvaral et.al. | 2409.18761 | null |
2024-09-27 | 3DPX: Single Panoramic X-ray Analysis Guided by 3D Oral Structure Reconstruction | Xiaoshuang Li et.al. | 2409.18701 | null |
2024-09-27 | Defect detection and size classification in CdTe samples in 3D | M. Väänänen et.al. | 2409.18555 | null |
2024-09-27 | DynaWeightPnP: Toward global real-time 3D-2D solver in PnP without correspondences | Jingwei Song et.al. | 2409.18457 | null |
2024-09-27 | Search3D: Hierarchical Open-Vocabulary 3D Segmentation | Ayca Takmaz et.al. | 2409.18431 | null |
2024-09-27 | Energy equality of the weak solutions to the non-Newtonian fluids equations | Yi Feng et.al. | 2409.18406 | null |
2024-09-27 | GenesisTex2: Stable, Consistent and High-Quality Text-to-Texture Generation | Jiawei Lu et.al. | 2409.18401 | null |
2024-09-27 | An Augmented Reality Interface for Teleoperating Robot Manipulators: Reducing Demonstrator Task Load through Digital Twin Control | Aliyah Smith et.al. | 2409.18394 | null |
2024-09-27 | Speech to Reality: On-Demand Production using Natural Language, 3D Generative AI, and Discrete Robotic Assembly | Alexander Htet Kyaw et.al. | 2409.18390 | null |
2024-09-27 | AquaMILR+: Design of an untethered limbless robot for complex aquatic terrain navigation | Matthew Fernandez et.al. | 2409.18383 | null |
2024-10-04 | Multi-hypotheses Conditioned Point Cloud Diffusion for 3D Human Reconstruction from Occluded Images | Donghwan Kim et.al. | 2409.18364 | link |
2024-09-27 | Energy Efficient Beamforming Training in Terahertz Communication Systems | Li-Hsiang Shen et.al. | 2409.18353 | null |
2024-09-26 | DeBaRA: Denoising-Based 3D Room Arrangement Generation | Léopold Maillard et.al. | 2409.18336 | null |
2024-10-01 | Synthesizing beta-amyloid PET images from T1-weighted Structural MRI: A Preliminary Study | Qing Lyu et.al. | 2409.18282 | null |
2024-09-26 | Galerkin Method of Regularized Stokeslets for Procedural Fluid Flow with Control Curves | Ryusuke Sugimoto et.al. | 2409.18276 | null |
2024-09-26 | Spin-lattice couplings in $3d$ ferromagnets: analysis from first-principles | I. P. Miranda et.al. | 2409.18274 | null |
2024-09-30 | Omni6D: Large-Vocabulary 3D Object Dataset for Category-Level 6D Object Pose Estimation | Mengchen Zhang et.al. | 2409.18261 | link |
2024-09-26 | Capping effects on spin and charge excitations in parent and superconducting Nd1-xSrxNiO2 | S. Fan et.al. | 2409.18258 | null |
2024-10-01 | Spatial Visibility and Temporal Dynamics: Revolutionizing Field of View Prediction in Adaptive Point Cloud Video Streaming | Chen Li et.al. | 2409.18236 | null |
2024-09-26 | PNR: Physics-informed Neural Representation for high-resolution LFM reconstruction | Jiayin Zhao et.al. | 2409.18223 | null |
2024-09-26 | 3D Modeling of Moist Convective Inhibition in Hydrogen-Dominated Atmospheres | Namrah Habib et.al. | 2409.18217 | null |
2024-09-26 | Exploring Intrinsic Bond Orbitals in Solids | Benjamin Wöckinger et.al. | 2409.18212 | link |
2024-09-26 | Entanglement renormalization of fractonic anisotropic $\mathbb{Z}_N$ Laplacian models | Yuan Xue et.al. | 2409.18206 | null |
2024-09-26 | Long-lived neutron-star remnants from asymmetric binary neutron star mergers: element formation, kilonova signals and gravitational waves | Sebastiano Bernuzzi et.al. | 2409.18185 | null |
2024-09-26 | Bridging 4D QFTs and 2D VOAs via 3D high-temperature EFTs | Arash Arabi Ardehali et.al. | 2409.18130 | null |
2024-09-26 | LLaVA-3D: A Simple yet Effective Pathway to Empowering LMMs with 3D-awareness | Chenming Zhu et.al. | 2409.18125 | null |
2024-10-02 | Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction | Jing He et.al. | 2409.18124 | null |
2024-09-26 | Robot See Robot Do: Imitating Articulated Object Manipulation with Monocular 4D Reconstruction | Justin Kerr et.al. | 2409.18121 | link |
2024-09-26 | EdgeRunner: Auto-regressive Auto-encoder for Artistic Mesh Generation | Jiaxiang Tang et.al. | 2409.18114 | null |
2024-09-26 | Language-Embedded Gaussian Splats (LEGS): Incrementally Building Room-Scale Representations with a Mobile Robot | Justin Yu et.al. | 2409.18108 | null |
2024-09-26 | Self-supervised Pretraining for Cardiovascular Magnetic Resonance Cine Segmentation | Rob A. J. de Mooij et.al. | 2409.18100 | link |
2024-09-26 | A Sim-to-Real Vision-based Lane Keeping System for a 1:10-scale Autonomous Vehicle | Antonio Gallina et.al. | 2409.18097 | null |
2024-09-30 | DiffSSC: Semantic LiDAR Scan Completion using Denoising Diffusion Probabilistic Models | Helin Cao et.al. | 2409.18092 | null |
2024-09-26 | Stable Video Portraits | Mirela Ostrek et.al. | 2409.18083 | null |
2024-09-26 | Magnetohydrodynamic simulations of A-type stars: Long-term evolution of core dynamo cycles | J. P. Hidalgo et.al. | 2409.18066 | null |
2024-09-26 | FlowBench: A Large Scale Benchmark for Flow Simulation over Complex Geometries | Ronak Tali et.al. | 2409.18032 | null |
2024-09-26 | Distributed Invariant Unscented Kalman Filter based on Inverse Covariance Intersection with Intermittent Measurements | Zhian Ruan et.al. | 2409.17997 | null |
2024-09-26 | CNCA: Toward Customizable and Natural Generation of Adversarial Camouflage for Vehicle Detectors | Linye Lyu et.al. | 2409.17963 | link |
2024-09-26 | Impact of solar wind turbulence on a planetary bow shock | E. Behar et.al. | 2409.17942 | null |
2024-09-26 | WaSt-3D: Wasserstein-2 Distance for Scene-to-Scene Stylization on 3D Gaussians | Dmytro Kotovenko et.al. | 2409.17917 | null |
2024-09-26 | PhantomLiDAR: Cross-modality Signal Injection Attacks against LiDAR | Zizhi Jin et.al. | 2409.17907 | null |
2024-09-26 | Upper-Body Pose-based Gaze Estimation for Privacy-Preserving 3D Gaze Target Detection | Andrea Toaiari et.al. | 2409.17886 | link |
2024-09-26 | Multi-UAV Enabled MEC Networks: Optimizing Delay through Intelligent 3D Trajectory Planning and Resource Allocation | Zhiying Wang et.al. | 2409.17882 | null |
2024-09-26 | Red giant - jet collisions in galactic nuclei I: 3D hydrodynamical model of a few stellar orbits | Petr Kurfürst et.al. | 2409.17773 | null |
2024-09-26 | Reconstructing solar magnetic fields from historical observations X. Effect of magnetic field inclination and boundary structure on AIA 1600 Å emission | Ismo Tähtinen et.al. | 2409.17771 | null |
2024-09-26 | Excited States Band Mapping and Ultrafast Nonequilibrium Dynamics in Topological Dirac Semimetal 1T-ZrTe $_2$ | Sotirios Fragkos et.al. | 2409.17761 | null |
2024-09-26 | LGFN: Lightweight Light Field Image Super-Resolution using Local Convolution Modulation and Global Attention Feature Extraction | Zhongxin Yu et.al. | 2409.17759 | null |
2024-09-26 | TCAD Simulation of Stitching for Passive CMOS Strip Detectors | Marta Baselga et.al. | 2409.17749 | null |
2024-09-26 | TADAR: Thermal Array-based Detection and Ranging for Privacy-Preserving Human Sensing | Xie Zhang et.al. | 2409.17742 | link |
2024-09-26 | Neural Implicit Representation for Highly Dynamic LiDAR Mapping and Odometry | Qi Zhang et.al. | 2409.17729 | null |
2024-09-26 | Recent advances in interpretable machine learning using structure-based protein representations | Luiz Felipe Vecchietti et.al. | 2409.17726 | null |
2024-09-26 | Three-dimensional nanoscale control of magnetism in crystalline Yttrium Iron Garnet | Valerio Levati et.al. | 2409.17722 | null |
2024-09-26 | Event-based Stereo Depth Estimation: A Survey | Suman Ghosh et.al. | 2409.17680 | null |
2024-09-26 | EM-Net: Efficient Channel and Frequency Learning with Mamba for 3D Medical Image Segmentation | Ao Chang et.al. | 2409.17675 | link |
2024-09-27 | Leveraging Anthropometric Measurements to Improve Human Mesh Estimation and Ensure Consistent Body Shapes | Katja Ludwig et.al. | 2409.17671 | null |
2024-09-26 | AP-VLM: Active Perception Enabled by Vision-Language Models | Venkatesh Sripada et.al. | 2409.17641 | null |
2024-09-26 | HGS-Planner: Hierarchical Planning Framework for Active Scene Reconstruction Using 3D Gaussian Splatting | Zijun Xu et.al. | 2409.17624 | null |
2024-09-26 | Canonical Representation and Force-Based Pretraining of 3D Tactile for Dexterous Visuo-Tactile Policy Learning | Tianhao Wu et.al. | 2409.17549 | null |
2024-09-26 | Triple Point Masking | Jiaming Liu et.al. | 2409.17547 | link |
2024-09-26 | CAMOT: Camera Angle-aware Multi-Object Tracking | Felix Limanta et.al. | 2409.17533 | null |
2024-09-26 | Global axisymmetric solutions for Navier-Stokes equation with rotation uniformly in the inviscid limit | Haram Ko et.al. | 2409.17528 | null |
2024-09-26 | Reconfigurable Manipulation of Sound with a Multi-material 3D-Printed Origami Metasurface | Dinh Hai Le et.al. | 2409.17522 | null |
2024-09-26 | TFS-NeRF: Template-Free NeRF for Semantic 3D Reconstruction of Dynamic Scene | Sandika Biswas et.al. | 2409.17459 | link |
2024-09-26 | Preparation and Characterization of High Quality Bi1-xSbx Thin Films: A Sputtering Deposition Approach | G. G. de Almeida et.al. | 2409.17444 | null |
2024-09-25 | Transient Adversarial 3D Projection Attacks on Object Detection in Autonomous Driving | Ce Zhou et.al. | 2409.17403 | null |
2024-09-25 | Observation of spin squeezing with contact interactions in one- and three-dimensional easy-plane magnets | Yoo Kyung Lee et.al. | 2409.17398 | null |
2024-09-25 | An Anatomy-Aware Shared Control Approach for Assisted Teleoperation of Lung Ultrasound Examinations | Davide Nardi et.al. | 2409.17395 | null |
2024-09-25 | A vision-based framework for human behavior understanding in industrial assembly lines | Konstantinos Papoutsakis et.al. | 2409.17356 | null |
2024-09-25 | Multi-Tier Preservation of Discrete Morse Smale Complexes in Error-Bounded Lossy Compression | Yuxiao Li et.al. | 2409.17346 | null |
2024-09-25 | SeaSplat: Representing Underwater Scenes with 3D Gaussian Splatting and a Physically Grounded Image Formation Model | Daniel Yang et.al. | 2409.17345 | null |
2024-09-25 | Small metal artifact detection and inpainting in cardiac CT images | Trevor McKeown et.al. | 2409.17342 | null |
2024-09-25 | ChatCam: Empowering Camera Control through Conversational AI | Xinhang Liu et.al. | 2409.17331 | null |
2024-09-25 | Disco4D: Disentangled 4D Human Generation and Animation from a Single Image | Hui En Pang et.al. | 2409.17280 | null |
2024-09-25 | Mnemosyne: Parallelization Strategies for Efficiently Serving Multi-Million Context Length LLM Inference Requests Without Approximations | Amey Agrawal et.al. | 2409.17264 | null |
2024-09-25 | Atomic Higgsings of 6D SCFTs | Jiakang Bao et.al. | 2409.17224 | null |
2024-09-25 | DreamWaltz-G: Expressive 3D Gaussian Avatars from Skeleton-Guided 2D Diffusion | Yukun Huang et.al. | 2409.17145 | link |
2024-09-25 | Blox-Net: Generative Design-for-Robot-Assembly Using VLM Supervision, Physics Simulation, and a Robot with Reset | Andrew Goldberg et.al. | 2409.17126 | null |
2024-09-25 | PokeFlex: Towards a Real-World Dataset of Deformable Objects for Robotic Manipulation | Jan Obrist et.al. | 2409.17124 | null |
2024-09-25 | An Active Electromagnetic 3D Surface Cloak | Paris Ang et.al. | 2409.17007 | null |
2024-09-25 | Single Image, Any Face: Generalisable 3D Face Generation | Wenqing Wang et.al. | 2409.16990 | null |
2024-10-01 | Multi-Robot Informative Path Planning for Efficient Target Mapping using Deep Reinforcement Learning | Apoorva Vashisth et.al. | 2409.16967 | link |
2024-09-25 | Go-SLAM: Grounded Object Segmentation and Localization with Gaussian Splatting SLAM | Phu Pham et.al. | 2409.16944 | null |
2024-09-25 | Generative Object Insertion in Gaussian Splatting with a Multi-View Diffusion Model | Hongliang Zhong et.al. | 2409.16938 | link |
2024-09-25 | The diverse star formation histories of early massive, quenched galaxies in modern galaxy formation simulations | Claudia del P. Lagos et.al. | 2409.16916 | link |
2024-09-25 | Tailored 3D microphantoms: an essential tool for quantitative phase tomography analysis of organoids | Michal Ziemczonok et.al. | 2409.16888 | null |
2024-09-25 | Towards Unified 3D Hair Reconstruction from Single-View Portraits | Yujian Zheng et.al. | 2409.16863 | null |
2024-09-25 | Limitations of (Procrustes) Alignment in Assessing Multi-Person Human Pose and Shape Estimation | Drazic Martin et.al. | 2409.16861 | null |
2024-09-25 | ESO 137-001 – a jellyfish galaxy model | B. Vollmer et.al. | 2409.16846 | null |
2024-09-25 | Towards General Text-guided Image Synthesis for Customized Multimodal Brain MRI Generation | Yulin Wang et.al. | 2409.16818 | link |
2024-09-25 | Performance Boundary Analyses for Statistical Multi-QoS Framework Over 6G SAGINs | Jingqing Wang et.al. | 2409.16811 | null |
2024-09-25 | Rapid Prototyping of 3D Microstructures: A Simplified Grayscale Lithography Encoding Method Using Blender | Fabricio Frizera Borghi et.al. | 2409.16749 | null |
2024-09-25 | The Effect of Lossy Compression on 3D Medical Images Segmentation with Deep Learning | Anvar Kurmukov et.al. | 2409.16733 | null |
2024-10-04 | SDCL: Students Discrepancy-Informed Correction Learning for Semi-supervised Medical Image Segmentation | Bentao Song et.al. | 2409.16728 | link |
2024-09-25 | 3DDX: Bone Surface Reconstruction from a Single Standard-Geometry Radiograph via Dual-Face Depth Estimation | Yi Gu et.al. | 2409.16702 | link |
2024-09-25 | Skyeyes: Ground Roaming using Aerial View Images | Zhiyuan Gao et.al. | 2409.16685 | null |
2024-09-25 | Morphological-consistent Diffusion Network for Ultrasound Coronal Image Enhancement | Yihao Zhou et.al. | 2409.16661 | null |
2024-09-25 | DeformStream: Deformation-based Adaptive Volumetric Video Streaming | Boyan Li et.al. | 2409.16615 | null |
2024-09-25 | Omni 3D: BEOL-Compatible 3D Logic with Omnipresent Power, Signal, and Clock | Suhyeong Choi et.al. | 2409.16608 | null |
2024-09-25 | FAFA: Frequency-Aware Flow-Aided Self-Supervision for Underwater Object Pose Estimation | Jingyi Tang et.al. | 2409.16600 | null |
2024-09-25 | Device for detection of activity-dependent changes in neural spheroids at MHz and GHz frequencies | Saeed Omidi et.al. | 2409.16552 | null |
2024-09-24 | Distributed Channel Estimation for 6D Movable Antenna: Unveiling Directional Sparsity | Xiaodan Shao et.al. | 2409.16510 | null |
2024-09-24 | Low Latency Point Cloud Rendering with Learned Splatting | Yueyu Hu et.al. | 2409.16504 | link |
2024-09-24 | GSplatLoc: Grounding Keypoint Descriptors into 3D Gaussian Splatting for Improved Visual Localization | Gennady Sidorov et.al. | 2409.16502 | link |
2024-09-24 | Frequency-based View Selection in Gaussian Splatting Reconstruction | Monica M. Q. Li et.al. | 2409.16470 | null |
2024-09-24 | Partial Elastic Shape Registration of 3D Surfaces using Dynamic Programming | Javier Bernal et.al. | 2409.16462 | null |
2024-09-24 | Underground Mapping and Localization Based on Ground-Penetrating Radar | Jinchang Zhang et.al. | 2409.16446 | null |
2024-09-24 | Hand Gesture Classification Based on Forearm Ultrasound Video Snippets Using 3D Convolutional Neural Networks | Keshav Bimbraw et.al. | 2409.16431 | null |
2024-09-24 | FastTalker: Jointly Generating Speech and Conversational Gestures from Text | Zixin Guo et.al. | 2409.16404 | null |
2024-09-24 | Towards Synthetic Data Generation for Improved Pain Recognition in Videos under Patient Constraints | Jonas Nasimzada et.al. | 2409.16382 | link |
2024-09-24 | Predicting Distance matrix with large language models | Jiaxing Yang et.al. | 2409.16333 | null |
2024-09-17 | Thermo-mechanical Properties of Hierarchical Biocomposite Materials from Photosynthetic Microorganisms | Israel Kellersztein et.al. | 2409.16318 | null |
2024-09-24 | Articulated Object Manipulation using Online Axis Estimation with SAM2-Based Tracking | Xi Wang et.al. | 2409.16287 | null |
2024-09-24 | Extended one-dimensional reduced model for blood flow within a stenotic artery | Suncica Canic et.al. | 2409.16262 | link |
2024-09-24 | Equilibrium expectations for non-Gaussian fluctuations near a QCD critical point | Jamie M. Karthein et.al. | 2409.16249 | null |
2024-09-24 | Material Transport in Protoplanetary Discs with Massive Embedded Planets | Hannah J. Petrovic et.al. | 2409.16245 | null |
2024-09-24 | Upper-body free-breathing Magnetic Resonance Fingerprinting applied to the quantification of water T1 and fat fraction | Constantin Slioussarenko et.al. | 2409.16200 | null |
2024-09-24 | Expert-level vision-language foundation model for real-world radiology and comprehensive evaluation | Xiaohong Liu et.al. | 2409.16183 | null |
2024-09-24 | SDFit: 3D Object Pose and Shape by Fitting a Morphable SDF to a Single Image | Dimitrije Antić et.al. | 2409.16178 | null |
2024-09-24 | MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling | Yifang Men et.al. | 2409.16160 | null |
2024-09-23 | MCTrack: A Unified 3D Multi-Object Tracking Framework for Autonomous Driving | Xiyang Wang et.al. | 2409.16149 | link |
2024-09-26 | Gaussian Deja-vu: Creating Controllable 3D Gaussian Head-Avatars with Enhanced Generalization and Personalization Abilities | Peizhi Yan et.al. | 2409.16147 | link |
2024-09-24 | Multi-Model Ensemble Approach for Accurate Bi-Atrial Segmentation in LGE-MRI of Atrial Fibrillation Patients | Lucas Beveridge et.al. | 2409.16083 | null |
2024-09-24 | Generative 3D Cardiac Shape Modelling for In-Silico Trials | Andrei Gasparovici et.al. | 2409.16058 | null |
2024-09-24 | AIR-Embodied: An Efficient Active 3DGS-based Interaction and Reconstruction Framework with Embodied Large Language Model | Zhenghao Qi et.al. | 2409.16019 | link |
2024-09-24 | OpenFMR: A low-cost open-source broadband ferromagnetic resonance spectrometer | Markus Meinert et.al. | 2409.15976 | null |
2024-09-24 | Semantics-Controlled Gaussian Splatting for Outdoor Scene Reconstruction and Rendering in Virtual Reality | Hannah Schieber et.al. | 2409.15959 | null |
2024-09-24 | Self-supervised Shape Completion via Involution and Implicit Correspondences | Mengya Liu et.al. | 2409.15939 | link |
2024-09-24 | Exploring the potential of collaborative UAV 3D mapping in Kenyan savanna for wildlife research | Vandita Shukla et.al. | 2409.15914 | null |
2024-09-30 | Unimotion: Unifying 3D Human Motion Synthesis and Understanding | Chuqiao Li et.al. | 2409.15904 | null |
2024-09-24 | FSF-Net: Enhance 4D Occupancy Forecasting with Coarse BEV Scene Flow for Autonomous Driving | Erxin Guo et.al. | 2409.15841 | null |
2024-09-24 | Hyperbolic Image-and-Pointcloud Contrastive Learning for 3D Classification | Naiwen Hu et.al. | 2409.15810 | null |
2024-09-24 | 3D-JEPA: A Joint Embedding Predictive Architecture for 3D Self-Supervised Representation Learning | Naiwen Hu et.al. | 2409.15803 | null |
2024-09-24 | MGNN: Moment Graph Neural Network for Universal Molecular Potentials | Jian Chang et.al. | 2409.15800 | null |
2024-09-24 | LaPose: Laplacian Mixture Shape Modeling for RGB-Based Category-Level Object Pose Estimation | Ruida Zhang et.al. | 2409.15727 | link |
2024-09-24 | Rapid 3D imaging at cellular resolution for digital cytopathology with a multi-camera array scanner (MCAS) | Kanghyun Kim et.al. | 2409.15722 | null |
2024-09-24 | Disentangled Generation and Aggregation for Robust Radiance Fields | Shihe Shen et.al. | 2409.15715 | null |
2024-09-24 | Plenoptic PNG: Real-Time Neural Radiance Fields in 150 KB | Jae Yong Lee et.al. | 2409.15689 | null |
2024-09-24 | SYNERGAI: Perception Alignment for Human-Robot Collaboration | Yixin Chen et.al. | 2409.15684 | null |
2024-09-24 | Vortex wall phase in fractonic XY-plaquette model on square lattice | A. M. Begun et.al. | 2409.15638 | null |
2024-09-23 | Three-dimensional large deformation frictional contact treatment using varying-order NURBS discretization in IGA | Vishal Agrawal et.al. | 2409.15621 | null |
2024-09-23 | A Fully Parallelizable Loosely Coupled Scheme for Fluid-Poroelastic Structure Interaction Problems | Shihan Guo et.al. | 2409.15618 | null |
2024-09-23 | Assessment of Submillimeter Precision via Structure from Motion Technique in Close-Range Capture Environments | Francisco Roza de Moraes et.al. | 2409.15602 | null |
2024-09-27 | Quantum K-Rings of Partial Flag Varieties, Coulomb Branches, and the Bethe Ansatz | Irit Huq-Kuruvilla et.al. | 2409.15575 | null |
2024-09-23 | SOFI: Multi-Scale Deformable Transformer for Camera Calibration with Enhanced Line Queries | Sebastian Janampa et.al. | 2409.15553 | link |
2024-09-23 | AgriNeRF: Neural Radiance Fields for Agriculture in Challenging Lighting Conditions | Samarth Chopra et.al. | 2409.15487 | null |
2024-09-23 | VLMine: Long-Tail Data Mining with Vision Language Models | Mao Ye et.al. | 2409.15486 | null |
2024-09-23 | Framework for Robust Localization of UUVs and Mapping of Net Pens | David Botta et.al. | 2409.15475 | null |
2024-09-23 | Matérn Kernels for Tunable Implicit Surface Reconstruction | Maximilian Weiherer et.al. | 2409.15466 | link |
2024-09-23 | Orthosymplectic Quotient Quiver Subtraction | Sam Bennett et.al. | 2409.15419 | null |
2024-09-23 | Coherence of Multi-Dimensional Pair Production Discharges in Polar Caps of Pulsars | Alexander Chernoglazov et.al. | 2409.15409 | null |
2024-09-23 | Bridging Simulations of Kink Instability in Relativistic Magnetized Jets with Radio Emission and Polarisation | Nikita Upreti et.al. | 2409.15406 | null |
2024-09-23 | MaterialFusion: Enhancing Inverse Rendering with Material Diffusion Priors | Yehonathan Litman et.al. | 2409.15273 | null |
2024-09-28 | ReLoo: Reconstructing Humans Dressed in Loose Garments from Monocular Video in the Wild | Chen Guo et.al. | 2409.15269 | null |
2024-09-23 | TacPalm: A Soft Gripper with a Biomimetic Optical Tactile Palm for Stable Precise Grasping | Xuyang Zhang et.al. | 2409.15239 | null |
2024-09-23 | Effect of valence electrons on the core level x-ray photoelectron spectra of niobium oxide thin films prepared by molecular beam epitaxy | Jasnamol Palakkal et.al. | 2409.15237 | null |
2024-09-23 | Deep Learning-Based Automated Post-Operative Gross Tumor Volume Segmentation in Glioblastoma Patients | Rajarajeswari Muthusivarajan et.al. | 2409.15177 | null |
2024-09-30 | SpikeGS: Learning 3D Gaussian Fields from Continuous Spike Stream | Jinze Yu et.al. | 2409.15176 | link |
2024-09-23 | Hybrid Drawing Solutions in AR Bitmap-to-Vector Techniques on 3D Surfaces | Pengcheng Ding et.al. | 2409.15171 | null |
2024-09-23 | The geometric phase transition of the three-dimensional $\mathbb{Z}_2$ lattice gauge model | Ramgopal Agrawal et.al. | 2409.15123 | null |
2024-09-23 | FisheyeDepth: A Real Scale Self-Supervised Depth Estimation Model for Fisheye Camera | Guoyang Zhao et.al. | 2409.15054 | link |
2024-09-23 | AIM 2024 Sparse Neural Rendering Challenge: Dataset and Benchmark | Michal Nazarczuk et.al. | 2409.15041 | null |
2024-09-23 | Immersed in my Ideas: Using Virtual Reality and Multimodal Interactions to Visualize Users’ Ideas and Thoughts | Yunhao Xing et.al. | 2409.15033 | null |
2024-09-25 | Efficient Nearest Neighbor Search Using Dynamic Programming | Pengfei Wang et.al. | 2409.15023 | null |
2024-09-24 | Sparse-to-Dense LiDAR Point Generation by LiDAR-Camera Fusion for 3D Object Detection | Minseung Lee et.al. | 2409.14985 | null |
2024-09-23 | Improving Adversarial Robustness for 3D Point Cloud Recognition at Test-Time through Purified Self-Training | Jinpeng Lin et.al. | 2409.14940 | null |
2024-09-23 | DanceCamAnimator: Keyframe-Based Controllable 3D Dance Camera Synthesis | Zixuan Wang et.al. | 2409.14925 | link |
2024-09-23 | KARMA: Augmenting Embodied AI Agents with Long-and-short Term Memory Systems | Zixuan Wang et.al. | 2409.14908 | null |
2024-09-23 | Site-site interaction model for alcohol models in two-dimensions | Aurélien Perera et.al. | 2409.14871 | null |
2024-09-23 | Human Hair Reconstruction with Strand-Aligned 3D Gaussians | Egor Zakharov et.al. | 2409.14778 | null |
2024-09-23 | Evaluating electrophysiological and behavioral measures of neural health in cochlear implant users: a computational simulation study | Yixuan Zhang et.al. | 2409.14767 | null |
2024-09-23 | Giant and Flexible Toroidal Circular Dichroism from Planar Chiral Metasurface | Shijie Kang et.al. | 2409.14757 | null |
2024-09-23 | UniBEVFusion: Unified Radar-Vision BEVFusion for 3D Object Detection | Haocheng Zhao et.al. | 2409.14751 | null |
2024-09-23 | ERPoT: Effective and Reliable Pose Tracking for Mobile Robots Based on Lightweight and Compact Polygon Maps | Haiming Gao et.al. | 2409.14723 | null |
2024-09-23 | MEVIUS: A Quadruped Robot Easily Constructed through E-Commerce with Sheet Metal Welding and Machining | Kento Kawaharazuka et.al. | 2409.14721 | null |
2024-09-23 | Reflecting Reality: Enabling Diffusion Models to Produce Faithful Mirror Reflections | Ankit Dhiman et.al. | 2409.14677 | link |
2024-09-23 | Data-driven Viscosity Solver for Fluid Simulation | Wonjung Park et.al. | 2409.14653 | link |
2024-09-22 | AR Overlay: Training Image Pose Estimation on Curved Surface in a Synthetic Way | Sining Huang et.al. | 2409.14577 | null |
2024-09-22 | RobotFingerPrint: Unified Gripper Coordinate Space for Multi-Gripper Grasp Synthesis | Ninad Khargonkar et.al. | 2409.14519 | null |
2024-09-22 | SynBench: A Synthetic Benchmark for Non-rigid 3D Point Cloud Registration | Sara Monji-Azad et.al. | 2409.14474 | null |
2024-09-22 | Pomo3D: 3D-Aware Portrait Accessorizing and More | Tzu-Chieh Liu et.al. | 2409.14430 | null |
2024-09-25 | D3RoMa: Disparity Diffusion-based Depth Sensing for Material-Agnostic Robotic Manipulation | Songlin Wei et.al. | 2409.14365 | null |
2024-09-27 | In-place Switch: Reprogramming based SLC Cache Design for Hybrid 3D SSDs | Xufeng Yang et.al. | 2409.14360 | null |
2024-09-22 | MVPGS: Excavating Multi-view Priors for Gaussian Splatting from Sparse Input Views | Wangze Xu et.al. | 2409.14316 | null |
2024-09-22 | HM3D-OVON: A Dataset and Benchmark for Open-Vocabulary Object Goal Navigation | Naoki Yokoyama et.al. | 2409.14296 | null |
2024-09-21 | Combining Absolute and Semi-Generalized Relative Poses for Visual Localization | Vojtech Panek et.al. | 2409.14269 | null |
2024-09-26 | GND: Global Navigation Dataset with Multi-Modal Perception and Multi-Category Traversability in Outdoor Campus Environments | Jing Liang et.al. | 2409.14262 | null |
2024-09-21 | A prospectus on the surface metrology of seborrheic keratoses | Nicole Werpachowski et.al. | 2409.14250 | null |
2024-09-21 | End to End Face Reconstruction via Differentiable PnP | Yiren Lu et.al. | 2409.14249 | null |
2024-09-21 | An Example of Microwave Diagnosis for Knee Osteophyte by 3D Parallel FD-FDTD Approach | Wenyi Shao et.al. | 2409.14236 | null |
2024-09-21 | An Efficient Modified MUSIC Algorithm for RIS-Assisted Near-Field Localization | Parisa Ramezani et.al. | 2409.14152 | null |
2024-09-21 | A Hands-on Experience with a Novel Scintillation Detector for Particle Physics | Anja Bitar et.al. | 2409.14140 | null |
2024-09-21 | ExFMan: Rendering 3D Dynamic Humans with Hybrid Monocular Blurry Frames and Events | Kanghao Chen et.al. | 2409.14103 | null |
2024-09-21 | The well-posedness and regularity of the Non-stationary Stokes and Navier-Stokes equations with the friction-type interface condition | Qi Wang et.al. | 2409.14098 | null |
2024-09-21 | BRep Boundary and Junction Detection for CAD Reverse Engineering | Sk Aziz Ali et.al. | 2409.14087 | link |
2024-09-21 | SplatLoc: 3D Gaussian Splatting-based Visual Localization for Augmented Reality | Hongjia Zhai et.al. | 2409.14067 | null |
2024-09-21 | Quantitative convergence for mean field control with common noise and degenerate idiosyncratic noise | Alekos Cecchin et.al. | 2409.14053 | null |
2024-09-21 | Vortex Interference Enables optimal 3D Interferometric Nanoscopy | Wei Wang et.al. | 2409.14033 | null |
2024-09-21 | Convexification for the 3D Problem of Travel Time Tomography | Michael V. Klibanov et.al. | 2409.14025 | null |
2024-09-21 | Point Cloud Structural Similarity-based Underwater Sonar Loop Detection | Donghwi Jung et.al. | 2409.14020 | link |
2024-09-21 | MOSE: Monocular Semantic Reconstruction Using NeRF-Lifted Noisy Priors | Zhenhua Du et.al. | 2409.14019 | null |
2024-09-21 | CUS3D :CLIP-based Unsupervised 3D Segmentation via Object-level Denoise | Fuyang Yu et.al. | 2409.13982 | null |
2024-09-21 | Improving 3D Semi-supervised Learning by Effectively Utilizing All Unlabelled Data | Sneha Paul et.al. | 2409.13977 | link |
2024-09-21 | Detecting Inpainted Video with Frequency Domain Insights | Quanhui Tang et.al. | 2409.13976 | null |
2024-09-21 | Pseudo-3D visualization of Faraday structure in polarized radio sources: methods, science use cases, and development priorities | Lawrence Rudnick et.al. | 2409.13973 | link |
2024-09-21 | Description of the first order phase transition region of an equation of state for QCD with a critical point | Jamie M. Karthein et.al. | 2409.13961 | null |
2024-09-21 | Periodic micromagnetic finite element method | Fangzhou Ai et.al. | 2409.13958 | null |
2024-09-20 | SpaceBlender: Creating Context-Rich Collaborative Spaces Through Generative 3D Scene Blending | Nels Numan et.al. | 2409.13926 | null |
2024-09-20 | Tactile Neural De-rendering | Jose A. Eyzaguirre et.al. | 2409.13923 | null |
2024-09-20 | PanoCoach: Enhancing Tactical Coaching and Communication in Soccer with Mixed-Reality Telepresence | Andrew Kang et.al. | 2409.13859 | null |
2024-09-20 | Comparative Planetology of Magnetic Effects in Ultrahot Jupiters: Trends in High Resolution Spectroscopy | Hayley Beltz et.al. | 2409.13840 | null |
2024-09-20 | Transfer Learning and Double U-Net Empowered Wave Propagation Model in Complex Indoor Environment | Ziheng Fu et.al. | 2409.13833 | null |
2024-09-20 | Kinematic analysis of $\mathbf{z = 4.3}$ galaxies in the SPT2349$-$ 56 protocluster core | Aparna Venkateshwaran et.al. | 2409.13823 | null |
2024-09-19 | AutoPET III Challenge: Tumor Lesion Segmentation using ResEnc-Model Ensemble | Tanya Chutani et.al. | 2409.13779 | null |
2024-09-20 | Portrait Video Editing Empowered by Multimodal Generative Priors | Xuan Gao et.al. | 2409.13591 | null |
2024-09-20 | Asymptotic properties of discretely self-similar Navier-Stokes solutions with rough data | Zachary Bradshaw et.al. | 2409.13586 | null |
2024-09-20 | Tackling fluffy clouds: field boundaries detection using time series of S2 and/or S1 imagery | Foivos I. Diakogiannis et.al. | 2409.13568 | link |
2024-09-20 | Formula-Supervised Visual-Geometric Pre-training | Ryosuke Yamada et.al. | 2409.13535 | null |
2024-09-20 | Closed-loop shape control of deformable linear objects based on Cosserat model | Azad Artinian et.al. | 2409.13522 | null |
2024-09-20 | Dense cell-by-cell systems of PDEs: approximation, spectral analysis, and preconditioning | Pietro Benedusi et.al. | 2409.13432 | null |
2024-09-25 | CVT-Occ: Cost Volume Temporal Fusion for 3D Occupancy Prediction | Zhangchen Ye et.al. | 2409.13430 | link |
2024-09-20 | Occupancy-Based Dual Contouring | Jisung Hwang et.al. | 2409.13418 | link |
2024-09-20 | Micro-transfer printing of GaSb optoelectronics chips for mid-infrared silicon photonics integrated circuits | Heidi Tuorila et.al. | 2409.13413 | null |
2024-09-20 | Validation & Exploration of Multimodal Deep-Learning Camera-Lidar Calibration models | Venkat Karramreddy et.al. | 2409.13402 | null |
2024-09-20 | Elite-EvGS: Learning Event-based 3D Gaussian Splatting by Distilling Event-to-Video Priors | Zixin Zhang et.al. | 2409.13392 | null |
2024-09-20 | Feature-Centered First Order Structure Tensor Scale-Space in 2D and 3D | Pawel Tomasz Pieta et.al. | 2409.13389 | link |
2024-09-20 | V-Hands: Touchscreen-based Hand Tracking for Remote Whiteboard Interaction | Xinshuang Liu et.al. | 2409.13347 | null |
2024-09-20 | Towards Semi-supervised Dual-modal Semantic Segmentation | Qiulei Dong et.al. | 2409.13325 | null |
2024-09-20 | Distributed Control for 3D Inspection using Multi-UAV Systems | Angelos Zacharia et.al. | 2409.13302 | null |
2024-09-20 | 3D-GSW: 3D Gaussian Splatting Watermark for Protecting Copyrights in Radiance Fields | Youngdong Jang et.al. | 2409.13222 | null |
2024-09-20 | A solution for co-locating 2D histology images in 3D for histology-to-CT and MR image registration: closing the loop for bone sarcoma treatment planning | Robert Phillips et.al. | 2409.13217 | null |
2024-09-20 | How we simulate DNA origami | Sarah Haggenmueller et.al. | 2409.13206 | null |
2024-09-20 | FreeAvatar: Robust 3D Facial Animation Transfer by Learning an Expression Foundation Model | Feng Qiu et.al. | 2409.13180 | null |
2024-09-20 | Deep Learning based Optical Image Super-Resolution via Generative Diffusion Models for Layerwise in-situ LPBF Monitoring | Francis Ogoke et.al. | 2409.13171 | null |
2024-09-20 | Towards Zero-shot Point Cloud Anomaly Detection: A Multi-View Projection Framework | Yuqi Cheng et.al. | 2409.13162 | link |
2024-09-20 | Beyond Skip Connection: Pooling and Unpooling Design for Elimination Singularities | Chengkun Sun et.al. | 2409.13154 | null |
2024-09-20 | Learning Visual Information Utility with PIXER | Yash Turkar et.al. | 2409.13151 | null |
2024-09-20 | GASA-UNet: Global Axial Self-Attention U-Net for 3D Medical Image Segmentation | Chengkun Sun et.al. | 2409.13146 | null |
2024-09-20 | Score-Based Multibeam Point Cloud Denoising | Li Ling et.al. | 2409.13143 | null |
2024-09-19 | Interpretable Action Recognition on Hard to Classify Actions | Anastasia Anichenko et.al. | 2409.13091 | null |
2024-09-19 | MGSO: Monocular Real-time Photometric SLAM with Efficient 3D Gaussian Splatting | Yan Song Hu et.al. | 2409.13055 | null |
2024-09-19 | Semi-overcomplete convolutional auto-encoder embedding as shape priors for deep vessel segmentation | Amine Sadikine et.al. | 2409.13001 | null |
2024-09-19 | Improving generalisability of 3D binding affinity models in low data regimes | Julia Buhmann et.al. | 2409.12995 | null |
2024-09-19 | Oryx MLLM: On-Demand Spatial-Temporal Understanding at Arbitrary Resolution | Zuyan Liu et.al. | 2409.12961 | link |
2024-09-19 | 3DTopia-XL: Scaling High-quality 3D Asset Generation via Primitive Diffusion | Zhaoxi Chen et.al. | 2409.12957 | link |
2024-09-19 | GStex: Per-Primitive Texturing of 2D Gaussian Splatting for Decoupled Appearance and Geometry Modeling | Victor Rong et.al. | 2409.12954 | link |
2024-09-19 | Asymptotic stability for the 3D Navier-Stokes equations in $L^3$ and nearby spaces | Zachary Bradshaw et.al. | 2409.12918 | null |
2024-09-19 | LI-GS: Gaussian Splatting with LiDAR Incorporated for Accurate Large-Scale Reconstruction | Changjian Jiang et.al. | 2409.12899 | null |
2024-09-19 | 3DGS-LM: Faster Gaussian-Splatting Optimization with Levenberg-Marquardt | Lukas Höllein et.al. | 2409.12892 | link |
2024-09-19 | EdgeGaussians – 3D Edge Mapping via Gaussian Splatting | Kunal Chelani et.al. | 2409.12886 | link |
2024-09-19 | Physics aware machine learning for micromagnetic energy minimization: recent algorithmic developments | Sebastian Schaffer et.al. | 2409.12877 | null |
2024-09-19 | Matter-coupled higher spin gravities in 3d: no- and yes-go results | Alexey Sharapov et.al. | 2409.12830 | null |
2024-09-19 | Angular Divergent Component of Motion: A step towards planning Spatial DCM Objectives for Legged Robots | Connor W. Herron et.al. | 2409.12796 | null |
2024-09-19 | TEAM PILOT – Learned Feasible Extendable Set of Dynamic MRI Acquisition Trajectories | Tamir Shor et.al. | 2409.12777 | null |
2024-09-24 | GaRField++: Reinforced Gaussian Radiance Fields for Large-Scale 3D Scene Reconstruction | Hanyue Zhang et.al. | 2409.12774 | null |
2024-09-19 | Spectral-GS: Taming 3D Gaussian Splatting with Spectral Entropy | Letian Huang et.al. | 2409.12771 | null |
2024-09-19 | DrivingForward: Feed-forward 3D Gaussian Splatting for Driving Scene Reconstruction from Flexible Surround-view Input | Qijian Tian et.al. | 2409.12753 | link |
2024-09-19 | Optimal Cosserat-based deformation control for robotic manipulation of linear objects | Azad Artinian et.al. | 2409.12723 | null |
2024-09-18 | Physics-Informed Neural Networks can accurately model cardiac electrophysiology in 3D geometries and fibrillatory conditions | Ching-En Chiu et.al. | 2409.12712 | null |
2024-09-19 | Classical and Quantum mechanics on 3D contact manifolds | Yves Colin de Verdìère et.al. | 2409.12665 | null |
2024-09-19 | Stiff and Deformable Quasicrystalline Architected Materials | Matheus I. N. Rosa et.al. | 2409.12652 | null |
2024-09-19 | From C-Band to mmWave-Band: Ray-Tracing-Assisted 5G-Based Indoor Positioning in Industrial Scenario | Karthik Muthineni et.al. | 2409.12624 | null |
2024-09-23 | Accurate Automatic 3D Annotation of Traffic Lights and Signs for Autonomous Driving | Sándor Kunsági-Máté et.al. | 2409.12620 | link |
2024-09-19 | CrossRT: A cross platform programming technology for hardware-accelerated ray tracing in CG and CV applications | Vladimir Frolov et.al. | 2409.12617 | null |
2024-09-19 | Enhancing Agricultural Environment Perception via Active Vision and Zero-Shot Learning | Michele Carlo La Greca et.al. | 2409.12602 | link |
2024-09-19 | Increased resistance to photooxidation in Dion-Jacobson lead halide perovskites – implication for perovskite device stability | Zhilin Ren et.al. | 2409.12556 | null |
2024-09-19 | MambaClinix: Hierarchical Gated Convolution and Mamba-Based U-Net for Enhanced 3D Medical Image Segmentation | Chenyuan Bian et.al. | 2409.12533 | link |
2024-09-19 | Hi-SLAM: Scaling-up Semantics in SLAM with a Hierarchically Categorical Gaussian Splatting | Boying Li et.al. | 2409.12518 | link |
2024-09-19 | Methodology for 3D sound synthesis of directional acoustic sources by higher-order ambisonics | Philippe Thorner et.al. | 2409.12506 | null |
2024-09-19 | Accurately Tracking Relative Positions of Moving Trackers based on UWB Ranging and Inertial Sensing without Anchors | Rayan Armani et.al. | 2409.12505 | null |
2024-09-19 | Arena 4.0: A Comprehensive ROS2 Development and Benchmarking Platform for Human-centric Navigation Using Generative-Model-based Environment Generation | Volodymyr Shcherbyna1 et.al. | 2409.12471 | null |
2024-09-19 | Bayesian-Optimized One-Step Diffusion Model with Knowledge Distillation for Real-Time 3D Human Motion Prediction | Sibo Tian et.al. | 2409.12456 | null |
2024-09-19 | MuxHand: A Cable-driven Dexterous Robotic Hand Using Time-division Multiplexing Motors | Jianle Xu et.al. | 2409.12455 | null |
2024-09-19 | Enhancing 3D Robotic Vision Robustness by Minimizing Adversarial Mutual Information through a Curriculum Training Approach | Nastaran Darabi et.al. | 2409.12379 | link |
2024-09-19 | Asymptotic Stability of 3D Out-flowing Compressible Viscous Fluid under Non-Spherical Perturbation | Yucong Huang et.al. | 2409.12373 | null |
2024-09-18 | A Learning-based Controller for Multi-Contact Grasps on Unknown Objects with a Dexterous Hand | Dominik Winkelbauer et.al. | 2409.12339 | null |
2024-09-18 | Deep vessel segmentation with joint multi-prior encoding | Amine Sadikine et.al. | 2409.12334 | null |
2024-09-18 | Scale-specific auxiliary multi-task contrastive learning for deep liver vessel segmentation | Amine Sadikine et.al. | 2409.12333 | null |
2024-09-18 | Towards a response function for the COSI anticoincidence system: preliminary results from Geant4 simulations | Alex Ciabattoni et.al. | 2409.12327 | null |
2024-09-18 | ReFu: Recursive Fusion for Exemplar-Free 3D Class-Incremental Learning | Yi Yang et.al. | 2409.12326 | null |
2024-09-18 | Depth Estimation Based on 3D Gaussian Splatting Siamese Defocus | Jinchang Zhang et.al. | 2409.12323 | null |
2024-09-18 | WiLoR: End-to-end 3D Hand Localization and Reconstruction in-the-wild | Rolandos Alexandros Potamias et.al. | 2409.12259 | link |
2024-09-16 | ScaleFlow++: Robust and Accurate Estimation of 3D Motion from Video | Han Ling et.al. | 2409.12202 | link |
2024-09-18 | Vista3D: Unravel the 3D Darkside of a Single Image | Qiuhong Shen et.al. | 2409.12193 | link |
2024-09-18 | Bundle Adjustment in the Eager Mode | Zitong Zhan et.al. | 2409.12190 | null |
2024-09-18 | Massively Multi-Person 3D Human Motion Forecasting with Scene Context | Felix B Mueller et.al. | 2409.12189 | link |
2024-09-25 | BRDF-NeRF: Neural Radiance Fields with Optical Satellite Images and BRDF Modelling | Lulin Zhang et.al. | 2409.12014 | link |
2024-09-18 | Panoptic-Depth Forecasting | Juana Valeria Hurtado et.al. | 2409.12008 | null |
2024-09-18 | Particle-based Instance-aware Semantic Occupancy Mapping in Dynamic Environments | Gang Chen et.al. | 2409.11975 | link |
2024-09-18 | MitoSeg: Mitochondria Segmentation Tool | Faris Serdar Taşel et.al. | 2409.11974 | link |
2024-09-18 | GaussianHeads: End-to-End Learning of Drivable Gaussian Head Avatars from Coarse-to-fine Representations | Kartik Teotia et.al. | 2409.11951 | null |
2024-09-18 | GauTOAO: Gaussian-based Task-Oriented Affordance of Objects | Jiawen Wang et.al. | 2409.11941 | null |
2024-09-18 | Differentiable Collision-Supervised Tooth Arrangement Network with a Decoupling Perspective | Zhihui He et.al. | 2409.11937 | null |
2024-09-18 | Generation of Complex 3D Human Motion by Temporal and Spatial Composition of Diffusion Models | Lorenzo Mandelli et.al. | 2409.11920 | null |
2024-09-18 | Borophane as substrate for adsorption of He-4: A journey across dimensionality | Stefania De Palo et.al. | 2409.11913 | null |
2024-09-18 | Tumor aware recurrent inter-patient deformable image registration of computed tomography scans with lung cancer | Jue Jiang et.al. | 2409.11910 | null |
2024-09-18 | SpheriGait: Enriching Spatial Representation via Spherical Projection for LiDAR-based Gait Recognition | Yanxi Wang et.al. | 2409.11869 | null |
2024-09-18 | Electric field control for experiments with atoms in Rydberg states | Aishik Panja et.al. | 2409.11865 | null |
2024-09-18 | Physically-Based Photometric Bundle Adjustment in Non-Lambertian Environments | Lei Cheng et.al. | 2409.11854 | null |
2024-09-18 | World of Forms: Deformable Geometric Templates for One-Shot Surface Meshing in Coronary CT Angiography | Rudolf L. M. van Herten et.al. | 2409.11837 | null |
2024-09-18 | Smart Data-Driven GRU Predictor for SnO $_2$ Thin films Characteristics | Faiza Bouamra et.al. | 2409.11782 | null |
2024-09-18 | RockTrack: A 3D Robust Multi-Camera-Ken Multi-Object Tracking Framework | Xiaoyu Li et.al. | 2409.11749 | null |
2024-09-18 | Three-dimensional topological valley photonics | Wenhao Li et.al. | 2409.11715 | null |
2024-09-18 | Three-dimensional valley-contrasting sound | Haoran Xue et.al. | 2409.11714 | null |
2024-09-18 | Revolutionizing Pharmaceutical Manufacturing: Advances and Challenges of 3D Printing System and Control | Rahul Kumar et.al. | 2409.11712 | null |
2024-09-18 | LFIC-DRASC: Deep Light Field Image Compression Using Disentangled Representation and Asymmetrical Strip Convolution | Shiyu Feng et.al. | 2409.11711 | null |
2024-09-18 | SLAM assisted 3D tracking system for laparoscopic surgery | Jingwei Song et.al. | 2409.11688 | null |
2024-09-18 | SRIF: Semantic Shape Registration Empowered by Diffusion-based Image Morphing and Flow Estimation | Mingze Sun et.al. | 2409.11682 | link |
2024-09-18 | Gradient-Driven 3D Segmentation and Affordance Transfer in Gaussian Splatting Using 2D Masks | Joji Joseph et.al. | 2409.11681 | link |
2024-09-19 | WALLABY Pilot Survey: HI source-finding with a machine learning framework | Li Wang et.al. | 2409.11668 | null |
2024-09-17 | DiffESM: Conditional Emulation of Temperature and Precipitation in Earth System Models with 3D Diffusion Models | Seth Bassetti et.al. | 2409.11601 | null |
2024-09-17 | 3D Water Quality Mapping using Invariant Extended Kalman Filtering for Underwater Robot Localization | Kaustubh Joshi et.al. | 2409.11578 | null |
2024-09-17 | VALO: A Versatile Anytime Framework for LiDAR-based Object Detection Deep Neural Networks | Ahmet Soyyigit et.al. | 2409.11542 | link |
2024-09-17 | Using Physics Informed Generative Adversarial Networks to Model 3D porous media | Zihan Ren et.al. | 2409.11541 | null |
2024-09-17 | Obfuscation Based Privacy Preserving Representations are Recoverable Using Neighborhood Information | Kunal Chelani et.al. | 2409.11536 | null |
2024-09-17 | Rigid Body Path Planning using Mixed-Integer Linear Programming | Mingxin Yu et.al. | 2409.11520 | null |
2024-09-17 | Instability and warping in vertically oscillating accretion discs | Loren E. Held et.al. | 2409.11490 | null |
2024-09-11 | Next-generation Probabilistic Computing Hardware with 3D MOSAICs, Illusion Scale-up, and Co-design | Tathagata Srimani et.al. | 2409.11422 | null |
2024-09-17 | Phidias: A Generative Model for Creating 3D Content from Text, Image, and 3D Conditions with Reference-Augmented Diffusion | Zhenwei Wang et.al. | 2409.11406 | null |
2024-09-17 | Online 4D Ultrasound-Guided Robotic Tracking Enables 3D Ultrasound Localisation Microscopy with Large Tissue Displacements | Jipeng Yan et.al. | 2409.11391 | null |
2024-09-17 | Learning Spatially-Aware Language and Audio Embedding | Bhavika Devnani et.al. | 2409.11369 | null |
2024-09-17 | Ping! Your Food is Ready: Comparing Different Notification Techniques in 3D AR Cooking Environment | Aditya Raikwar et.al. | 2409.11357 | null |
2024-09-17 | RenderWorld: World Model with Self-Supervised 3D Label | Ziyang Yan et.al. | 2409.11356 | null |
2024-09-17 | fMRI-3D: A Comprehensive Dataset for Enhancing fMRI-based 3D Reconstruction | Jianxiong Gao et.al. | 2409.11315 | null |
2024-09-17 | GS-Net: Generalizable Plug-and-Play 3D Gaussian Splatting Module | Yichen Zhang et.al. | 2409.11307 | null |
2024-09-18 | TTT-Unet: Enhancing U-Net with Test-Time Training Layers for Biomedical Image Segmentation | Rong Zhou et.al. | 2409.11299 | link |
2024-09-17 | SplatFields: Neural Gaussian Splats for Sparse 3D and 4D Reconstruction | Marko Mihajlovic et.al. | 2409.11211 | null |
2024-09-17 | Air-FAR: Fast and Adaptable Routing for Aerial Navigation in Large-scale Complex Unknown Environments | Botao He et.al. | 2409.11188 | null |
2024-09-13 | MAISI: Medical AI for Synthetic Imaging | Pengfei Guo et.al. | 2409.11169 | link |
2024-09-17 | UltimateDO: An Efficient Framework to Marry Occupancy Prediction with 3D Object Detection via Channel2height | Zichen Yu et.al. | 2409.11160 | null |
2024-09-17 | The Impact of Icy Cometary ‘Impacts’ on Exoplanetary Atmospheres I: Tidally-Locked Terrestrial Exoplanets | Felix Sainsbury-Martinez et.al. | 2409.11151 | null |
2024-09-17 | Use the Force, Bot! – Force-Aware ProDMP with Event-Based Replanning | Paul Werner Lödige et.al. | 2409.11144 | null |
2024-09-23 | Geometric Formula for 2d Ising Zeros: Examples & Numerics | Iñaki Garay et.al. | 2409.11109 | null |
2024-09-17 | Depth-based Privileged Information for Boosting 3D Human Pose Estimation on RGB | Alessandro Simoni et.al. | 2409.11104 | null |
2024-09-17 | Data-driven stochastic 3D modeling of the nanoporous binder-conductive additive phase in battery cathodes | Phillip Gräfensteiner et.al. | 2409.11080 | null |
2024-09-17 | Unleashing the Potential of Mamba: Boosting a LiDAR 3D Sparse Detector by Using Cross-Model Knowledge Distillation | Rui Yu et.al. | 2409.11018 | null |
2024-09-17 | Virtual Reality for Immersive Education in Orthopedic Surgery Digital Twins | Jonas Hein et.al. | 2409.11014 | null |
2024-09-17 | Enhanced segmentation of femoral bone metastasis in CT scans of patients using synthetic data generation with 3D diffusion models | Emile Saillard et.al. | 2409.11011 | null |
2024-09-17 | GLC-SLAM: Gaussian Splatting SLAM with Efficient Loop Closure | Ziheng Xu et.al. | 2409.10982 | null |
2024-09-17 | Label-free correlative morpho-chemical tomography of 3D kidney mesangial cells | Ankit Butola et.al. | 2409.10971 | null |
2024-09-17 | Gestalt driven augmented collimator widget for precise 5 dof dental drill tool positioning in 3d space | Mine Dastan et.al. | 2409.10960 | null |
2024-09-21 | HGSLoc: 3DGS-based Heuristic Camera Pose Refinement | Zhongyan Niu et.al. | 2409.10925 | null |
2024-09-17 | Multi-Floor Zero-Shot Object Navigation Policy | Lingfeng Zhang et.al. | 2409.10906 | null |
2024-09-17 | TrajSSL: Trajectory-Enhanced Semi-Supervised 3D Object Detection | Philip Jacobson et.al. | 2409.10901 | null |
2024-09-17 | 3DFacePolicy: Speech-Driven 3D Facial Animation with Diffusion Policy | Xuanmeng Sha et.al. | 2409.10848 | null |
2024-09-18 | Single-Layer Learnable Activation for Implicit Neural Representation (SL $^{2}$ A-INR) | Moein Heidari et.al. | 2409.10836 | null |
2024-09-18 | Context-Dependent Interactable Graphical User Interface Element Detection for Spatial Computing Applications | Shuqing Li et.al. | 2409.10811 | null |
2024-09-17 | Global solutions to 3D quadratic nonlinear Schrödinger-type equation | Zihua Guo et.al. | 2409.10804 | null |
2024-09-16 | Ideal flat and resolved SU(3) Landau levels in three dimensions | Mian Peng et.al. | 2409.10785 | null |
2024-09-16 | Short-Lived Gravitational Instability in Isolated Irradiated Discs | Sahl Rowther et.al. | 2409.10765 | null |
2024-09-16 | The Spin Zone: Synchronously and Asynchronously Rotating Exoplanets Have Spectral Differences in Transmission | Nicholas Scarsdale et.al. | 2409.10752 | null |
2024-09-16 | Depth from Coupled Optical Differentiation | Junjie Luo et.al. | 2409.10725 | link |
2024-09-20 | CoMamba: Real-time Cooperative Perception Unlocked with State Space Models | Jinlong Li et.al. | 2409.10699 | null |
2024-09-16 | Online Diffusion-Based 3D Occupancy Prediction at the Frontier with Probabilistic Map Reconciliation | Alec Reed et.al. | 2409.10681 | null |
2024-09-16 | The VIRUS-dE Survey I: Stars in dwarf elliptical galaxies - 3D dynamics and radially resolved stellar initial mass functions | Mathias Lipka et.al. | 2409.10518 | null |
2024-09-16 | Partial Distribution Matching via Partial Wasserstein Adversarial Networks | Zi-Ming Wang et.al. | 2409.10499 | null |
2024-09-16 | Radar Teach and Repeat: Architecture and Initial Field Testing | Xinyuan Qiao et.al. | 2409.10491 | link |
2024-09-16 | Exploring 3D Face Reconstruction and Fusion Methods for Face Verification: A Case-Study in Video Surveillance | Simone Maurizio La Cava et.al. | 2409.10481 | null |
2024-09-16 | Magnetic metamaterials by ion-implantation | Christina Vantaraki et.al. | 2409.10433 | null |
2024-09-16 | 2D or not 2D: How Does the Dimensionality of Gesture Representation Affect 3D Co-Speech Gesture Generation? | Téo Guichoux et.al. | 2409.10357 | null |
2024-09-16 | Point2Graph: An End-to-end Point Cloud-based 3D Open-Vocabulary Scene Graph for Robot Navigation | Yifan Xu et.al. | 2409.10350 | null |
2024-09-16 | MEGS: Morphological Evaluation of Galactic Structure | Ufuk Çakır et.al. | 2409.10346 | link |
2024-09-16 | Phys3DGS: Physically-based 3D Gaussian Splatting for Inverse Rendering | Euntae Choi et.al. | 2409.10335 | null |
2024-09-16 | Sensitivity analysis with a 3D mixed-dimensional code for DC geoelectrical investigations of landfills: synthetic tests | Lorenzo Panzeri et.al. | 2409.10326 | null |
2024-09-22 | Haralick texture feature analysis for Monte Carlo dose distributions of permanent implant prostate brachytherapy | Iymad R. Mansour et.al. | 2409.10324 | null |
2024-09-16 | 3D ISM structure challenges the Serkowski relation | Nikolaos Mandarakas et.al. | 2409.10317 | null |
2024-09-16 | Anatomical Positional Embeddings | Mikhail Goncharov et.al. | 2409.10291 | link |
2024-09-16 | Co-Designing Dynamic Mixed Reality Drill Positioning Widgets: A Collaborative Approach with Dentists in a Realistic Setup | Mine Dastan et.al. | 2409.10258 | null |
2024-09-16 | Precise Tool to Target Positioning Widgets (TOTTA) in Spatial Environments: A Systematic Review | Mine Dastan et.al. | 2409.10239 | null |
2024-09-16 | BEINGS: Bayesian Embodied Image-goal Navigation with Gaussian Splatting | Wugang Meng et.al. | 2409.10216 | link |
2024-09-16 | Neuromorphic Facial Analysis with Cross-Modal Supervision | Federico Becattini et.al. | 2409.10213 | null |
2024-09-16 | NEUSIS: A Compositional Neuro-Symbolic Framework for Autonomous Perception, Reasoning, and Planning in Complex UAV Search Missions | Zhixi Cai et.al. | 2409.10196 | null |
2024-09-16 | RealDiff: Real-world 3D Shape Completion using Self-Supervised Diffusion Models | Başak Melis Öcal et.al. | 2409.10180 | null |
2024-09-16 | AutoPET Challenge III: Testing the Robustness of Generalized Dice Focal Loss trained 3D Residual UNet for FDG and PSMA Lesion Segmentation from Whole-Body PET/CT Images | Shadab Ahamed et.al. | 2409.10151 | link |
2024-09-16 | PSHuman: Photorealistic Single-view Human Reconstruction using Cross-Scale Diffusion | Peng Li et.al. | 2409.10141 | null |
2024-09-16 | The i-TED Compton Camera Array for real-time boron imaging and determination during treatments in Boron Neutron Capture Therapy | Pablo Torres-Sánchez et.al. | 2409.10107 | null |
2024-09-16 | Industry 6.0: New Generation of Industry driven by Generative AI and Swarm of Heterogeneous Robots | Artem Lykov et.al. | 2409.10106 | null |
2024-09-16 | Human Insights Driven Latent Space for Different Driving Perspectives: A Unified Encoder for Efficient Multi-Task Inference | Huy-Dung Nguyen et.al. | 2409.10095 | null |
2024-09-19 | IRIS: Interactive Responsive Intelligent Segmentation for 3D Affordance Analysis | Meng Chu et.al. | 2409.10078 | null |
2024-09-16 | DENSER: 3D Gaussians Splatting for Scene Reconstruction of Dynamic Urban Environments | Mahmud A. Mohamad et.al. | 2409.10041 | link |
2024-09-20 | Dynamics of the quintic wave equation with nonlocal weak damping | Feng Zhou et.al. | 2409.10035 | null |
2024-09-16 | Embodiment-Agnostic Action Planning via Object-Part Scene Flow | Weiliang Tang et.al. | 2409.10032 | null |
2024-09-16 | Integrating Experiment with Theory to Determine the Structure of Electrode-Electrolyte Interfaces | Lalith Krishna Samanth Bonagiri et.al. | 2409.10008 | null |
2024-09-18 | ViewActive: Active viewpoint optimization from a single image | Jiayi Wu et.al. | 2409.09997 | link |
2024-09-15 | Semantic2D: A Semantic Dataset for 2D Lidar Semantic Segmentation | Zhanteng Xie et.al. | 2409.09899 | null |
2024-09-15 | GRIN: Zero-Shot Metric Depth with Pixel-Level Diffusion | Vitor Guizilini et.al. | 2409.09896 | null |
2024-09-15 | Materials Matter: Investigating Functional Advantages of Bio-Inspired Materials via Simulated Robotic Hopping | Andrew K. Schulz et.al. | 2409.09895 | link |
2024-09-15 | The shape of convection in 2D and 3D global simulations of stellar interiors | M. -G. Dethero et.al. | 2409.09815 | null |
2024-09-15 | MesonGS: Post-training Compression of 3D Gaussians via Efficient Attribute Transformation | Shuzhao Xie et.al. | 2409.09756 | null |
2024-09-15 | Efficient 3D Bayesian Full Waveform Inversion and Analysis of Prior Hypotheses | Xuebin Zhao et.al. | 2409.09746 | null |
2024-09-17 | VGG-Tex: A Vivid Geometry-Guided Facial Texture Estimation Model for High Fidelity Monocular 3D Face Reconstruction | Haoyu Wu et.al. | 2409.09740 | null |
2024-09-15 | Pre-Training for 3D Hand Pose Estimation with Contrastive Learning on Large-Scale Hand Images in the Wild | Nie Lin et.al. | 2409.09714 | null |
2024-09-15 | Structure and magnetic properties of a family of two-leg spin ladder compounds Ba2RE2Ge4O13 (RE = Pr, Nd, and Gd-Ho) with strong rung interaction | Jin Zhou et.al. | 2409.09686 | null |
2024-09-14 | Liquid crystal torons in Poiseuille-like flows | Guilherme N. C. Amaral et.al. | 2409.09486 | null |
2024-09-14 | Innovative schemes for Correlation Plenoptic Imaging | Gianlorenzo Massaro et.al. | 2409.09459 | null |
2024-09-14 | Plenoptic microscopy and photography from intensity correlations | Francesco V. Pepe et.al. | 2409.09456 | null |
2024-09-17 | The Properties of Glass Fiber Reinforced Polypropylene Filaments Recycled from Fishing Gear | Garrett Russell et.al. | 2409.09445 | null |
2024-09-14 | KAN-HyperpointNet for Point Cloud Sequence-Based 3D Human Action Recognition | Zhaoyu Chen et.al. | 2409.09444 | null |
2024-09-14 | A note for double Hölder regularity of the hydrodynamic pressure for weak solutions of Euler equations | Siran Li et.al. | 2409.09433 | null |
2024-09-14 | Evaluating Pre-trained Convolutional Neural Networks and Foundation Models as Feature Extractors for Content-based Medical Image Retrieval | Amirreza Mahbod et.al. | 2409.09430 | link |
2024-09-14 | Local-in-time analytic solutions for an inviscid model of superfluidity in 3D | Pranava Chaitanya Jayanti et.al. | 2409.09404 | null |
2024-09-14 | MotionTTT: 2D Test-Time-Training Motion Estimation for 3D Motion Corrected MRI | Tobit Klug et.al. | 2409.09370 | null |
2024-09-14 | OPUS: Occupancy Prediction Using a Sparse Set | Jiabao Wang et.al. | 2409.09350 | link |
2024-09-20 | Implicit Neural Representations with Fourier Kolmogorov-Arnold Networks | Ali Mehrabian et.al. | 2409.09323 | link |
2024-09-14 | Real-Time Stochastic Terrain Mapping and Processing for Autonomous Safe Landing | Kento Tomita et.al. | 2409.09309 | null |
2024-09-14 | ManiDext: Hand-Object Manipulation Synthesis via Continuous Correspondence Embeddings and Residual-Guided Diffusion | Jiajun Zhang et.al. | 2409.09300 | null |
2024-09-14 | GEVO: Memory-Efficient Monocular Visual Odometry Using Gaussians | Dasong Gao et.al. | 2409.09295 | link |
2024-09-14 | StyleTalk++: A Unified Framework for Controlling the Speaking Styles of Talking Heads | Suzhen Wang et.al. | 2409.09292 | null |
2024-09-14 | SAM-OCTA2: Layer Sequence OCTA Segmentation with Fine-tuned Segment Anything Model 2 | Xinrun Chen et.al. | 2409.09286 | link |
2024-09-14 | VSFormer: Mining Correlations in Flexible View Set for Multi-view 3D Shape Understanding | Hongyu Sun et.al. | 2409.09254 | link |
2024-09-13 | FiAt-Net: Detecting Fibroatheroma Plaque Cap in 3D Intravascular OCT Images | Yaopeng Peng et.al. | 2409.09188 | null |
2024-09-06 | 3D System Design: A Case for Building Customized Modular Systems in 3D | Philip Emma et.al. | 2409.09068 | null |
2024-09-13 | FreeMHD: validation and verification of the open-source, multi-domain, multi-phase solver for electrically conductive flows | Brian Wynne et.al. | 2409.08950 | null |
2024-09-17 | A Diffusion Approach to Radiance Field Relighting using Multi-Illumination Synthesis | Yohan Poirier-Ginter et.al. | 2409.08947 | null |
2024-09-13 | ClearDepth: Enhanced Stereo Perception of Transparent Objects for Robotic Manipulation | Kaixin Bai et.al. | 2409.08926 | null |
2024-09-13 | Characterization of M51 supernova remnants with the imaging spectrometer SITELLE | Billy Gamache et.al. | 2409.08888 | null |
2024-09-13 | DX2CT: Diffusion Model for 3D CT Reconstruction from Bi or Mono-planar 2D X-ray(s) | Yun Su Jeong et.al. | 2409.08850 | null |
2024-09-13 | Kinect Calibration and Data Optimization For Anthropometric Parameters | M. S. Gokmen et.al. | 2409.08847 | null |
2024-09-13 | Direct-CP: Directed Collaborative Perception for Connected and Autonomous Vehicles via Proactive Attention | Yihang Tao et.al. | 2409.08840 | null |
2024-09-13 | Evolution and the quasistationary state of collective fast neutrino flavor conversion in three dimensions without axisymmetry | Manu George et.al. | 2409.08833 | null |
2024-09-13 | Towards Precise 3D Quantum Control of a Levitated Dipolar Scatterer using Spatial Mode Decomposition | Thomas Dinter et.al. | 2409.08827 | null |
2024-09-13 | Contactless Fingerprint Recognition Using 3D Graph Matching | Zhe Cui et.al. | 2409.08782 | null |
2024-09-17 | Simulation of a (3+1)D glasma in Milne coordinates: Topological charge, eccentricity, and angular momentum | Hidefumi Matsuda et.al. | 2409.08742 | null |
2024-09-13 | Autoregressive Sequence Modeling for 3D Medical Image Representation | Siwen Wang et.al. | 2409.08691 | null |
2024-09-13 | AdR-Gaussian: Accelerating Gaussian Splatting with Adaptive Radius | Xinzhe Wang et.al. | 2409.08669 | null |
2024-09-13 | SkinFormer: Learning Statistical Texture Representation with Transformer for Skin Lesion Segmentation | Rongtao Xu et.al. | 2409.08652 | link |
2024-09-13 | SynthAorta: A 3D Mesh Dataset of Parametrized Physiological Healthy Aortas | Domagoj Bošnjak et.al. | 2409.08635 | link |
2024-09-13 | DrawingSpinUp: 3D Animation from Single Character Drawings | Jie Zhou et.al. | 2409.08615 | null |
2024-09-13 | Dense Point Clouds Matter: Dust-GS for Scene Reconstruction from Sparse Viewpoints | Shan Chen et.al. | 2409.08613 | null |
2024-09-13 | Second-order difference subspace | Kazuhiro Fukui et.al. | 2409.08563 | null |
2024-09-13 | CSS: Overcoming Pose and Scene Challenges in Crowd-Sourced 3D Gaussian Splatting | Runze Chen et.al. | 2409.08562 | null |
2024-09-13 | Tri-Plane Mamba: Efficiently Adapting Segment Anything Model for 3D Medical Images | Hualiang Wang et.al. | 2409.08492 | null |
2024-09-13 | An Intent Modeling and Inference Framework for Autonomous and Remotely Piloted Aerial Systems | Kesav Kaza et.al. | 2409.08472 | null |
2024-09-13 | CF-PRNet: Coarse-to-Fine Prototype Refining Network for Point Cloud Completion and Reconstruction | Zhi Chen et.al. | 2409.08443 | link |
2024-09-12 | Continual Learning in 3D Point Clouds: Employing Spectral Techniques for Exemplar Selection | Hossein Resani et.al. | 2409.08388 | null |
2024-09-12 | 3D Radiation-Hydrodynamical Simulations of Shadows on Transition Disks | Shangjia Zhang et.al. | 2409.08373 | null |
2024-09-12 | Digital Volumetric Biopsy Cores Improve Gleason Grading of Prostate Cancer Using Deep Learning | Ekaterina Redekop et.al. | 2409.08331 | null |
2024-09-12 | MedSegMamba: 3D CNN-Mamba Hybrid Architecture for Brain Segmentation | Aaron Cao et.al. | 2409.08307 | link |
2024-09-10 | Gaussian Differentially Private Human Faces Under a Face Radial Curve Representation | Carlos Soto et.al. | 2409.08301 | null |
2024-09-12 | DreamHOI: Subject-Driven Generation of 3D Human-Object Interactions with Diffusion Priors | Thomas Hanwen Zhu et.al. | 2409.08278 | null |
2024-09-12 | Hand-Object Interaction Pretraining from Videos | Himanshu Gaurav Singh et.al. | 2409.08273 | null |
2024-09-12 | DreamBeast: Distilling 3D Fantastical Animals with Part-Aware Knowledge Transfer | Runjia Li et.al. | 2409.08271 | null |
2024-09-12 | FlashSplat: 2D to 3D Gaussian Splatting Segmentation Solved Optimally | Qiuhong Shen et.al. | 2409.08270 | link |
2024-09-12 | LT3SD: Latent Trees for 3D Scene Diffusion | Quan Meng et.al. | 2409.08215 | null |
2024-09-12 | VI3DRM:Towards meticulous 3D Reconstruction from Sparse Views via Photo-Realistic Novel View Synthesis | Hao Chen et.al. | 2409.08207 | null |
2024-09-12 | Gaussian Garments: Reconstructing Simulation-Ready Clothing with Photorealistic Appearance from Multi-View Video | Boxiang Rong et.al. | 2409.08189 | null |
2024-09-12 | Collaborating for Success: Optimizing System Efficiency and Resilience Under Agile Industrial Settings | Sunny Katyara et.al. | 2409.08166 | null |
2024-09-12 | The JPEG Pleno Learning-based Point Cloud Coding Standard: Serving Man and Machine | André F. R. Guarda et.al. | 2409.08130 | null |
2024-09-12 | Magnetic Field Evolution of the Solar Active Region 13664 | Robert Jarolim et.al. | 2409.08124 | link |
2024-09-12 | Bayesian Self-Training for Semi-Supervised 3D Segmentation | Ozan Unal et.al. | 2409.08102 | null |
2024-09-12 | Expansive Supervision for Neural Radiance Field | Weixiang Zhang et.al. | 2409.08056 | null |
2024-09-12 | Thermal3D-GS: Physics-induced 3D Gaussians for Thermal Infrared Novel-view Synthesis | Qian Chen et.al. | 2409.08042 | link |
2024-09-12 | A three-dimensional force estimation method for the cable-driven soft robot based on monocular images | Xiaohan Zhu et.al. | 2409.08033 | null |
2024-09-12 | SPARK: Self-supervised Personalized Real-time Monocular Face Capture | Kelian Baert et.al. | 2409.07984 | null |
2024-09-12 | Deep Height Decoupling for Precise Vision-based 3D Occupancy Prediction | Yuan Wu et.al. | 2409.07972 | link |
2024-09-12 | ProbTalk3D: Non-Deterministic Emotion Controllable Speech-Driven 3D Facial Animation Synthesis Using VQ-VAE | Sichun Wu et.al. | 2409.07966 | link |
2024-09-12 | Evidence for field induced quantum spin liquid behavior in a spin-1/2 honeycomb magnet | Gaoting Lin et.al. | 2409.07959 | null |
2024-09-12 | DNN-based workflow for attenuating seismic interference noise and its application to marine towed streamer data from the Northern Viking Graben | Jing Sun et.al. | 2409.07890 | null |
2024-09-12 | UNIT: Unsupervised Online Instance Segmentation through Time | Corentin Sautier et.al. | 2409.07887 | null |
2024-09-12 | SwinGS: Sliding Window Gaussian Splatting for Volumetric Video Streaming with Arbitrary Length | Bangya Liu et.al. | 2409.07759 | null |
2024-09-12 | Learning Brain Tumor Representation in 3D High-Resolution MR Images via Interpretable State Space Models | Qingqiao Hu et.al. | 2409.07746 | link |
2024-09-12 | First-principles study of electronic and magnetic properties of Fe atoms on Cu2N/Cu(100) | Jiale Chen et.al. | 2409.07739 | null |
2024-09-12 | Advancing Depth Anything Model for Unsupervised Monocular Depth Estimation in Endoscopy | Bojian Li et.al. | 2409.07723 | null |
2024-09-12 | Ultrafast Laser-Fabricated Fluoride Glass Waveguides with Exceptionally High Refractive Index Change for Mid-Infrared Integrated Optics | T Toney Fernandez et.al. | 2409.07674 | null |
2024-09-11 | Rapid Assessment of Stable Crystal Structures in Single Phase High Entropy Alloys Via Graph Neural Network Based Surrogate Modelling | Nicholas Beaver et.al. | 2409.07664 | link |
2024-09-11 | Sensitivity of Multislice Electron Ptychography to Point Defects: A Case Study in SiC | Aaditya Bhat et.al. | 2409.07663 | null |
2024-09-11 | In-situ tunable interaction with an invertible sign between a fluxonium and a post cavity | Desislava G. Atanasova et.al. | 2409.07612 | null |
2024-09-11 | On-chip twisted hollow-core light cages: enhancing planar photonics with 3D nanoprinting | Johannes Bürger et.al. | 2409.07602 | null |
2024-09-11 | DS-ViT: Dual-Stream Vision Transformer for Cross-Task Distillation in Alzheimer’s Early Diagnosis | Ke Chen et.al. | 2409.07584 | null |
2024-09-11 | FaVoR: Features via Voxel Rendering for Camera Relocalization | Vincenzo Polizzi et.al. | 2409.07571 | link |
2024-09-11 | TabMixer: Noninvasive Estimation of the Mean Pulmonary Artery Pressure via Imaging and Tabular Data Mixing | Michal K. Grzeszczyk et.al. | 2409.07564 | link |
2024-09-02 | Dynamical phase transitions in the non-reciprocal Ising model | Yael Avni et.al. | 2409.07481 | null |
2024-09-11 | Self-Evolving Depth-Supervised 3D Gaussian Splatting from Rendered Stereo Pairs | Sadra Safadoust et.al. | 2409.07456 | null |
2024-09-11 | DreamMesh: Jointly Manipulating and Texturing Triangle Meshes for Text-to-3D Generation | Haibo Yang et.al. | 2409.07454 | null |
2024-09-11 | Hi3D: Pursuing High-Resolution Image-to-3D Generation with Video Diffusion Models | Haibo Yang et.al. | 2409.07452 | link |
2024-09-11 | StereoCrafter: Diffusion-based Generation of Long and High-fidelity Stereoscopic 3D from Monocular Videos | Sijie Zhao et.al. | 2409.07447 | null |
2024-09-11 | Ionization energy: sd transfer error and Perdew-Zunger self-interaction correction energy penalty in 3d atoms | Rohan Maniar et.al. | 2409.07438 | null |
2024-09-11 | Some effects of limited wall-sensor availability on flow estimation with 3D-GANs | Antonio Cuéllar et.al. | 2409.07348 | null |
2024-09-11 | Current Symmetry Group Equivariant Convolution Frameworks for Representation Learning | Ramzan Basheer et.al. | 2409.07327 | null |
2024-09-11 | Three-Dimensional, Multimodal Synchrotron Data for Machine Learning Applications | Calum Green et.al. | 2409.07322 | link |
2024-09-11 | Detectability Simulations of a NIR Surface Biosignature on Proxima Centauri b with Future Space Observatories | Connor O. Metz et.al. | 2409.07289 | null |
2024-09-11 | EMOdiffhead: Continuously Emotional Control in Talking Head Generation via Diffusion | Jian Zhang et.al. | 2409.07255 | link |
2024-09-11 | Single-View 3D Reconstruction via SO(2)-Equivariant Gaussian Sculpting Networks | Ruihan Xu et.al. | 2409.07245 | null |
2024-09-12 | 3DGCQA: A Quality Assessment Database for 3D AI-Generated Contents | Yingjie Zhou et.al. | 2409.07236 | link |
2024-09-11 | ThermalGaussian: Thermal 3D Gaussian Splatting | Rongfeng Lu et.al. | 2409.07200 | link |
2024-09-11 | A Perspective on AI-Guided Molecular Simulations in VR: Exploring Strategies for Imitation Learning in Hyperdimensional Molecular Systems | Mohamed Dhouioui et.al. | 2409.07189 | null |
2024-09-11 | Phy124: Fast Physics-Driven 4D Content Generation from a Single Image | Jiajing Lin et.al. | 2409.07179 | null |
2024-09-11 | Mamba Policy: Towards Efficient 3D Diffusion Policy with Hybrid Selective State Models | Jiahang Cao et.al. | 2409.07163 | null |
2024-09-11 | Dual channel CW nnU-Net for 3D PET-CT Lesion Segmentation in 2024 autoPET III Challenge | Ching-Wei Wang et.al. | 2409.07144 | link |
2024-09-11 | The radiation condition for Helmholtz equations above locally perturbed periodic surfaces | Ruming Zhang et.al. | 2409.07141 | null |
2024-09-11 | An Improved Height Difference Based Model of Height Profile for Drop-on-Demand 3D Printing With UV Curable Ink | Yumeng Wu et.al. | 2409.07021 | null |
2024-09-11 | Symmetries of Toda type 3D lattices | I. T. Habibullin et.al. | 2409.07017 | null |
2024-09-11 | Bridging Domain Gap of Point Cloud Representations via Self-Supervised Geometric Augmentation | Li Yu et.al. | 2409.06956 | null |
2024-09-11 | FSMDet: Vision-guided feature diffusion for fully sparse 3D detector | Tianran Liu et.al. | 2409.06945 | null |
2024-09-20 | Existence and Regularity Results for a Nonlinear Fluid-Structure Interaction Problem with Three-Dimensional Structural Displacement | Sunčica Čanić et.al. | 2409.06939 | null |
2024-09-11 | Cascade of strongly correlated quantum states in a partially filled kagome flat band | Caiyun Chen et.al. | 2409.06933 | null |
2024-09-11 | Rethinking Directional Parameterization in Neural Implicit Surface Reconstruction | Zijie Jiang et.al. | 2409.06923 | null |
2024-09-10 | Attosecond Inner-Shell Lasing at Angstrom Wavelengths | Thomas M. Linker et.al. | 2409.06914 | null |
2024-09-10 | Cross-Modal Self-Supervised Learning with Effective Contrastive Units for LiDAR Point Clouds | Mu Cai et.al. | 2409.06827 | link |
2024-09-10 | NLO in the large charge sector of the critical $O(N)$ model at large $N$ | Nicola Andrea Dondi et.al. | 2409.06781 | null |
2024-09-10 | Geometric Effects in Large Scale Intracellular Flows | Olenka Jain et.al. | 2409.06763 | null |
2024-09-10 | ProteinBench: A Holistic Evaluation of Protein Foundation Models | Fei Ye et.al. | 2409.06744 | null |
2024-09-10 | GeoCalib: Learning Single-image Calibration with Geometric Optimization | Alexander Veicht et.al. | 2409.06704 | link |
2024-09-10 | LEIA: Latent View-invariant Embeddings for Implicit 3D Articulation | Archana Swaminathan et.al. | 2409.06703 | null |
2024-09-10 | Hint-AD: Holistically Aligned Interpretability in End-to-End Autonomous Driving | Kairui Ding et.al. | 2409.06702 | null |
2024-09-10 | Technical Report of Mobile Manipulator Robot for Industrial Environments | Erfan Amoozad Khalili et.al. | 2409.06693 | null |
2024-09-10 | GigaGS: Scaling up Planar-Based 3D Gaussians for Large Scene Surface Reconstruction | Junyi Chen et.al. | 2409.06685 | null |
2024-09-10 | Towards Localizing Structural Elements: Merging Geometrical Detection with Semantic Verification in RGB-D Data | Ali Tourani et.al. | 2409.06625 | null |
2024-09-10 | MVGaussian: High-Fidelity text-to-3D Content Generation with Multi-View Guidance and Surface Densification | Phu Pham et.al. | 2409.06620 | null |
2024-09-10 | Interactive 3D Segmentation for Primary Gross Tumor Volume in Oropharyngeal Cancer | Mikko Saukkoriipi et.al. | 2409.06605 | null |
2024-09-10 | Semi-Supervised 3D Object Detection with Chanel Augmentation using Transformation Equivariance | Minju Kang et.al. | 2409.06583 | null |
2024-09-10 | PoseEmbroider: Towards a 3D, Visual, Semantic-aware Human Pose Representation | Ginger Delmas et.al. | 2409.06535 | null |
2024-09-10 | Unsupervised stratification of patients with myocardial infarction based on imaging and in-silico biomarkers | Dolors Serra et.al. | 2409.06526 | null |
2024-09-10 | Neural Laplacian Operator for 3D Point Clouds | Bo Pang et.al. | 2409.06506 | link |
2024-09-10 | Sources of Uncertainty in 3D Scene Reconstruction | Marcus Klasson et.al. | 2409.06407 | link |
2024-09-10 | Soft Acoustic Curvature Sensor: Design and Development | Mohammad Sheikh Sofla et.al. | 2409.06395 | null |
2024-09-10 | Fiber-level Woven Fabric Capture from a Single Photo | Zixuan Li et.al. | 2409.06368 | null |
2024-09-10 | G3PT: Unleash the power of Autoregressive Modeling in 3D Generation via Cross-scale Querying Transformer | Jinzhi Zhang et.al. | 2409.06322 | null |
2024-09-10 | PharmacoMatch: Efficient 3D Pharmacophore Screening through Neural Subgraph Matching | Daniel Rose et.al. | 2409.06316 | null |
2024-09-10 | A Latent Implicit 3D Shape Model for Multiple Levels of Detail | Benoit Guillard et.al. | 2409.06231 | null |
2024-09-10 | Design and Implementation of Online Live Streaming System Using A 3D Engine | Aizierjiang Aiersilan et.al. | 2409.06207 | null |
2024-09-10 | RealisDance: Equip controllable character animation with realistic hands | Jingkai Zhou et.al. | 2409.06202 | link |
2024-09-10 | VQCrystal: Leveraging Vector Quantization for Discovery of Stable Crystal Structures | ZiJie Qiu et.al. | 2409.06191 | null |
2024-09-10 | Loss Distillation via Gradient Matching for Point Cloud Completion with Weighted Chamfer Distance | Fangzhou Lin et.al. | 2409.06171 | link |
2024-09-10 | DECOLLAGE: 3D Detailization by Controllable, Localized, and Learned Geometry Enhancement | Qimin Chen et.al. | 2409.06129 | null |
2024-09-10 | Robust Agility via Learned Zero Dynamics Policies | Noel Csomay-Shanklin et.al. | 2409.06125 | null |
2024-09-09 | LSE-NeRF: Learning Sensor Modeling Errors for Deblured Neural Radiance Fields with RGB-Event Stereo | Wei Zhi Tang et.al. | 2409.06104 | link |
2024-09-09 | Reduced-order modeling for complex 3D seismic wave propagation | John M. Rekoske et.al. | 2409.06102 | null |
2024-09-15 | MemoVis: A GenAI-Powered Tool for Creating Companion Reference Images for 3D Design Feedback | Chen Chen et.al. | 2409.06082 | null |
2024-09-09 | Logarithmic delocalization of low temperature 3D Ising and Potts interfaces above a hard floor | Joseph Chen et.al. | 2409.06079 | null |
2024-09-09 | A robust fourth-order finite-difference discretization for the strongly anisotropic transport equation in magnetized plasmas | L. Chacon et.al. | 2409.06070 | null |
2024-09-09 | Online 3D reconstruction and dense tracking in endoscopic videos | Michel Hayoz et.al. | 2409.06037 | link |
2024-09-09 | NESI: Shape Representation via Neural Explicit Surface Intersection | Congyi Zhang et.al. | 2409.06030 | null |
2024-09-09 | Voronoi-based Multi-Robot Formations for 3D Source Seeking via Cooperative Gradient Estimation | Lara Briñón-Arranz et.al. | 2409.05995 | null |
2024-09-09 | Transmon qubit modeling and characterization for Dark Matter search | R. Moretti et.al. | 2409.05988 | null |
2024-09-09 | RUBIES: a complete census of the bright and red distant Universe with JWST/NIRSpec | Anna de Graaff et.al. | 2409.05948 | null |
2024-09-09 | Flash Cache: Reducing Bias in Radiance Cache Based Inverse Rendering | Benjamin Attal et.al. | 2409.05867 | null |
2024-09-10 | Evaluating Multiview Object Consistency in Humans and Image Models | Tyler Bonnen et.al. | 2409.05862 | link |
2024-09-10 | Finite-size topological phases from semimetals | Adipta Pal et.al. | 2409.05842 | null |
2024-09-09 | Faraday shield dissipation in the drivers of SPIDER based on electromagnetic 3D calculations | D. López-Bruna et.al. | 2409.05821 | null |
2024-09-09 | GASP: Gaussian Splatting for Physic-Based Simulations | Piotr Borycki et.al. | 2409.05819 | link |
2024-09-09 | Robust Estimation of Structural Orientation Parameters and 2D/3D Local Anisotropic Tikhonov Regularization | Ali Gholami et.al. | 2409.05754 | null |
2024-09-09 | Exclusive vector-quarkonium photoproduction at NLO in alpha_s in collinear factorisation with evolution of the generalised parton distributions and high-energy resummation | C. A. Flett et.al. | 2409.05738 | null |
2024-09-09 | LayeredFlow: A Real-World Benchmark for Non-Lambertian Multi-Layer Optical Flow | Hongyu Wen et.al. | 2409.05688 | null |
2024-09-09 | Constraints, Conserved Charges and Extended BRST Algebra for a 3D Field-Theoretic Example for Hodge Theory | Bhagya. R et.al. | 2409.05684 | null |
2024-09-09 | ForestFlow: cosmological emulation of Lyman- $α$ forest clustering from linear to nonlinear scales | J. Chaves-Montero et.al. | 2409.05682 | link |
2024-09-09 | 3D-SAR Tomography and Machine Learning for High-Resolution Tree Height Estimation | Grace Colverd et.al. | 2409.05636 | null |
2024-09-09 | Nature vs Nurture: Three Dimensional MHD Simulations of Misaligned Embedded Circum-Single Disks within an AGN Disk | Bhupendra Mishra et.al. | 2409.05614 | null |
2024-09-09 | Latent 3D Brain MRI Counterfactual | Wei Peng et.al. | 2409.05585 | null |
2024-09-09 | LEROjD: Lidar Extended Radar-Only Object Detection | Patrick Palmer et.al. | 2409.05564 | link |
2024-09-09 | Weighted Squared Volume Minimization (WSVM) for Generating Uniform Tetrahedral Meshes | Kaixin Yu et.al. | 2409.05525 | null |
2024-09-09 | Mesoscopic light transport in nonlinear disordered media | Alfonso Nardi et.al. | 2409.05488 | null |
2024-09-12 | DriveScape: Towards High-Resolution Controllable Multi-View Driving Video Generation | Wei Wu et.al. | 2409.05463 | null |
2024-09-09 | Holonomy: A Virtual Reality Exploration of Hyperbolic Geometry | Martin Skrodzki et.al. | 2409.05460 | null |
2024-09-11 | Distribution Discrepancy and Feature Heterogeneity for Active 3D Object Detection | Huang-Yu Chen et.al. | 2409.05425 | link |
2024-09-09 | DWA-3D: A Reactive Planner for Robust and Efficient Autonomous UAV Navigation | Jorge Bes et.al. | 2409.05421 | null |
2024-09-09 | KRONC: Keypoint-based Robust Camera Optimization for 3D Car Reconstruction | Davide Di Nucci et.al. | 2409.05407 | null |
2024-09-09 | Leveraging Computation of Expectation Models for Commonsense Affordance Estimation on 3D Scene Graphs | Mario Alberto Valdes Saucedo et.al. | 2409.05392 | null |
2024-09-09 | Prim2Room: Layout-Controllable Room Mesh Generation from Primitives | Chengzeng Feng et.al. | 2409.05380 | null |
2024-09-09 | Memoryless Multimodal Anomaly Detection via Student-Teacher Network and Signed Distance Learning | Zhongbin Sun et.al. | 2409.05378 | null |
2024-09-09 | CAS-Canglong: A skillful 3D Transformer model for sub-seasonal to seasonal global sea surface temperature prediction | Longhao Wang et.al. | 2409.05369 | link |
2024-09-09 | Towards Determining Mechanical Properties of Brain-Skull Interface Under Tension and Compression | Sajjad Arzemanzadeh et.al. | 2409.05365 | null |
2024-09-12 | GOPT: Generalizable Online 3D Bin Packing via Transformer-based Deep Reinforcement Learning | Heng Xiong et.al. | 2409.05344 | link |
2024-09-09 | Lagrangian Hashing for Compressed Neural Field Representations | Shrisudhan Govindarajan et.al. | 2409.05334 | null |
2024-09-09 | Integrating Novel Stellarator Single-Stage Optimization Algorithms to Design the Columbia Stellarator Experiment | A. Baillod et.al. | 2409.05261 | null |
2024-09-10 | 3D hybrid fluid-particle jet simulations and the importance of synchrotron radiative losses | Joana A. Kramer et.al. | 2409.05256 | null |
2024-09-09 | MRStyle: A Unified Framework for Color Style Transfer with Multi-Modality Reference | Jiancheng Huang et.al. | 2409.05250 | null |
2024-09-08 | CD-NGP: A Fast Scalable Continual Representation for Dynamic Scenes | Zhenhuan Liu et.al. | 2409.05166 | null |
2024-09-08 | Single-distance nano-holotomography with coded apertures | Viktor Nikitin et.al. | 2409.05163 | null |
2024-09-08 | Image color consistency in datasets: the Smooth-TPS3D method | Ismael Benito-Altamirano et.al. | 2409.05159 | null |
2024-09-08 | Ultron: Enabling Temporal Geometry Compression of 3D Mesh Sequences using Temporal Correspondence and Mesh Deformation | Haichao Zhu et.al. | 2409.05151 | null |
2024-09-08 | Inverse cascade in zonal flows | Siddhant Mishra et.al. | 2409.05127 | null |
2024-09-08 | On the Sobolev stability threshold for 3D Navier-Stokes equations with rotation near the Couette flow | Wenting Huang et.al. | 2409.05104 | null |
2024-09-19 | DreamMapping: High-Fidelity Text-to-3D Generation via Variational Distribution Mapping | Zeyu Cai et.al. | 2409.05099 | null |
2024-09-19 | Ion Trapping with a Laser-written 3D Miniaturized Monolithic Linear Paul Trap for Microcavity Integration | Soon Teh et.al. | 2409.05075 | null |
2024-09-08 | Unsupervised Multimodal 3D Medical Image Registration with Multilevel Correlation Balanced Optimization | Jiazheng Wang et.al. | 2409.05040 | link |
2024-09-08 | Room-temperature superconductivity in 1D | Carlo A. Trugenberger et.al. | 2409.05031 | null |
2024-09-08 | Multi-V2X: A Large Scale Multi-modal Multi-penetration-rate Dataset for Cooperative Perception | Rongsong Li et.al. | 2409.04980 | link |
2024-09-08 | RCBEVDet++: Toward High-accuracy Radar-Camera Fusion 3D Perception Network | Zhiwei Lin et.al. | 2409.04979 | null |
2024-09-08 | GS-PT: Exploiting 3D Gaussian Splatting for Comprehensive Point Cloud Understanding via Self-supervised Learning | Keyi Liu et.al. | 2409.04963 | null |
2024-09-14 | A foundation model enpowered by a multi-modal prompt engine for universal seismic geobody interpretation across surveys | Hang Gao et.al. | 2409.04962 | null |
2024-09-10 | Heterogeneous LiDAR Dataset for Benchmarking Robust Localization in Diverse Degenerate Scenarios | Zhiqiang Chen et.al. | 2409.04961 | link |
2024-09-07 | Light-Activated Motion, Geometry- and Confinement-Induced Optical Effects of 2D Platelets in a Nematic Liquid Crystal | Antonio Tavera-Vázquez et.al. | 2409.04912 | null |
2024-09-07 | Dielectric and optical markers originated from quantum geometry | Wei Chen et.al. | 2409.04893 | null |
2024-09-07 | A Quantitative Approach for Evaluating Disease Focus and Interpretability of Deep Learning Models for Alzheimer’s Disease Classification | Thomas Yu Chow Tam et.al. | 2409.04888 | null |
2024-09-07 | IPN-V: The Interplanetary Network Visualizer | Alice Le Bihan et.al. | 2409.04857 | null |
2024-09-07 | AdaptiveFusion: Adaptive Multi-Modal Multi-View Fusion for 3D Human Body Reconstruction | Anjun Chen et.al. | 2409.04851 | null |
2024-09-07 | Medical Image Segmentation via Single-Source Domain Generalization with Random Amplitude Spectrum Synthesis | Qiang Qiao et.al. | 2409.04768 | link |
2024-09-11 | Fisheye-GS: Lightweight and Extensible Gaussian Splatting Module for Fisheye Cameras | Zimu Liao et.al. | 2409.04751 | link |
2024-09-07 | Modeling Drivers’ Risk Perception via Attention to Improve Driving Assistance | Abhijat Biswas et.al. | 2409.04738 | null |
2024-09-07 | A Hybrid Discrete Exterior Calculus Discretization and Fourier Transform of the Incompressible Navier-Stokes Equations in 3D | Abdullah Abukhwejah et.al. | 2409.04731 | null |
2024-09-06 | Real-time CBCT Imaging and Motion Tracking via a Single Arbitrarily-angled X-ray Projection by a Joint Dynamic Reconstruction and Motion Estimation (DREME) Framework (DREME) Framework | Hua-Chieh Shao et.al. | 2409.04614 | null |
2024-09-06 | Colloidoscope: Detecting Dense Colloids in 3d with Deep Learning | Abdelwahab Kawafi et.al. | 2409.04603 | link |
2024-09-06 | Multi-scale Feature Fusion with Point Pyramid for 3D Object Detection | Weihao Lu et.al. | 2409.04601 | null |
2024-09-06 | NeCA: 3D Coronary Artery Tree Reconstruction from Two 2D Projections by Neural Implicit Representation | Yiying Wang et.al. | 2409.04596 | link |
2024-09-06 | Developing a Modular Toolkit for Rapid Prototyping of Wearable Vibrotactile Haptic Harness | Sandeep Kollannur et.al. | 2409.04579 | null |
2024-09-06 | Multi-Modal Diffusion for Hand-Object Grasp Generation | Jinkun Cao et.al. | 2409.04560 | null |
2024-09-06 | Solve paint color effect prediction problem in trajectory optimization of spray painting robot using artificial neural network inspired by the Kubelka Munk model | Hexiang Wang et.al. | 2409.04558 | null |
2024-09-06 | The Intrinsic Distribution of Lyman- $α$ Halos | John Pharo et.al. | 2409.04537 | null |
2024-09-06 | Nonperturbative Nonlinear Transport in a Floquet-Weyl Semimetal | Matthew W. Day et.al. | 2409.04531 | null |
2024-09-06 | 3D Data Long-Term Preservation in Cultural Heritage | Nicola Amico et.al. | 2409.04507 | null |
2024-09-06 | SCARF: Scalable Continual Learning Framework for Memory-efficient Multiple Neural Radiance Fields | Yuze Wang et.al. | 2409.04482 | null |
2024-09-04 | Comparative Analysis of Gradient-Based Optimization Techniques Using Multidimensional Surface 3D Visualizations and Initial Point Sensitivity | Saeed Asadi et.al. | 2409.04470 | null |
2024-09-06 | Train Till You Drop: Towards Stable and Robust Source-free Unsupervised 3D Domain Adaptation | Björn Michele et.al. | 2409.04409 | link |
2024-09-06 | Casper DPM: Cascaded Perceptual Dynamic Projection Mapping onto Hands | Yotam Erel et.al. | 2409.04397 | null |
2024-09-06 | Future Does Matter: Boosting 3D Object Detection with Temporal Motion Estimation in Point Cloud Sequences | Rui Yu et.al. | 2409.04390 | null |
2024-09-06 | Equivariant Machine Learning Decoder for 3D Toric Codes | Oliver Weissl et.al. | 2409.04300 | link |
2024-09-06 | Hybrid Cost Volume for Memory-Efficient Optical Flow | Yang Zhao et.al. | 2409.04243 | link |
2024-09-06 | Efficient Analysis and Visualization of High-Resolution Computed Tomography Data for the Exploration of Enclosed Cuneiform Tablets | Stephan Olbrich et.al. | 2409.04236 | null |
2024-09-06 | UniDet3D: Multi-dataset Indoor 3D Object Detection | Maksim Kolodiazhnyi et.al. | 2409.04234 | link |
2024-09-06 | A Method of Fundamental Solutions for Large-Scale 3D Elastance and Mobility Problems | Anna Broms et.al. | 2409.04215 | null |
2024-09-06 | Topological Quantum Materials with Kagome Lattice | Qi Wang et.al. | 2409.04211 | null |
2024-09-06 | GST: Precise 3D Human Body from a Single Image with Gaussian Splatting Transformers | Lorenza Prospero et.al. | 2409.04196 | link |
2024-09-06 | Reprojection Errors as Prompts for Efficient Scene Coordinate Regression | Ting-Ru Liu et.al. | 2409.04178 | null |
2024-09-06 | Feature Compression for Cloud-Edge Multimodal 3D Object Detection | Chongzhen Tian et.al. | 2409.04123 | null |
2024-09-06 | A New Channel Model for OAM Wireless Communication at 5.8 and 28 GHz | Runyu Lyu et.al. | 2409.04113 | null |
2024-09-06 | Dense Hand-Object(HO) GraspNet with Full Grasping Taxonomy and Dynamics | Woojin Cho et.al. | 2409.04033 | null |
2024-09-06 | BFA-YOLO: Balanced multiscale object detection network for multi-view building facade attachments detection | Yangguang Chen et.al. | 2409.04025 | null |
2024-09-06 | 3D-GP-LMVIC: Learning-based Multi-View Image Coding with 3D Gaussian Geometric Priors | Yujun Huang et.al. | 2409.04013 | link |
2024-09-06 | DreamForge: Motion-Aware Autoregressive Video Generation for Multi-View Driving Scenes | Jianbiao Mei et.al. | 2409.04003 | link |
2024-09-05 | Evidence of Charge Multiplication in Thin $25 \mathrm{μm} \times 25 \mathrm{μm}$ Pitch 3D Silicon Sensors | Andrew Gentry et.al. | 2409.03909 | null |
2024-09-12 | Microscopic entropy of static black holes in 3d Lovelock gravities | Gokhan Alkac et.al. | 2409.03865 | null |
2024-09-05 | RUBIES Reveals a Massive Quiescent Galaxy at z=7.3 | Andrea Weibel et.al. | 2409.03829 | null |
2024-09-05 | A 3D view of multiple populations kinematics in Galactic globular clusters | E. Dalessandro et.al. | 2409.03827 | null |
2024-09-05 | Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding | Yunze Man et.al. | 2409.03757 | link |
2024-09-05 | Geometry Image Diffusion: Fast and Data-Efficient Text-to-3D with Image-Based Surface Representation | Slava Elizarov et.al. | 2409.03718 | null |
2024-09-05 | View-Invariant Policy Learning via Zero-Shot Novel View Synthesis | Stephen Tian et.al. | 2409.03685 | null |
2024-09-05 | 1 Modular Parallel Manipulator for Long-Term Soft Robotic Data Collection | Kiyn Chin et.al. | 2409.03614 | null |
2024-09-05 | TCDiff: Triple Condition Diffusion Model with 3D Constraints for Stylizing Synthetic Faces | Bernardo Biesseck et.al. | 2409.03600 | link |
2024-09-05 | Charged critical behavior and nonperturbative continuum limit of three-dimensional lattice SU( $N_c$ ) gauge Higgs models | Claudio Bonati et.al. | 2409.03595 | null |
2024-09-05 | Meshless quadrature formulas arising from numerical differentiation | Oleg Davydov et.al. | 2409.03567 | null |
2024-09-05 | Surface Magnetism in Fe $_3$GeTe$_2$ Crystals | T. A. Tyson et.al. | 2409.03565 | null |
2024-09-05 | Exploring the magnetic and thermal evolution of a coronal jet | Sushree S Nayak et.al. | 2409.03484 | null |
2024-09-05 | Physical Modelling of Piano Sound | Haifan Xie et.al. | 2409.03481 | null |
2024-09-05 | LM-Gaussian: Boost Sparse-view 3D Gaussian Splatting with Large Model Priors | Hanyang Yu et.al. | 2409.03456 | null |
2024-09-05 | Ageing and dynamics of the tailed radio galaxies in Abell 2142 | L. Bruno et.al. | 2409.03453 | null |
2024-09-05 | Automatic occlusion removal from 3D maps for maritime situational awareness | Felix Sattler et.al. | 2409.03451 | null |
2024-09-05 | Weight Conditioning for Smooth Optimization of Neural Networks | Hemanth Saratchandran et.al. | 2409.03424 | null |
2024-09-05 | Retrieving stellar parameters and dynamics of AGB stars with Gaia parallax measurements and CO5BOLD RHD simulations | E. Béguin et.al. | 2409.03422 | null |
2024-09-05 | F3T: A soft tactile unit with 3D force and temperature mathematical decoupling ability for robots | Xiong Yang et.al. | 2409.03421 | null |
2024-09-08 | Estimating Indoor Scene Depth Maps from Ultrasonic Echoes | Junpei Honma et.al. | 2409.03336 | null |
2024-09-05 | Semantic Communication for Efficient Point Cloud Transmission | Shangzhuo Xie et.al. | 2409.03319 | null |
2024-09-05 | OccLLaMA: An Occupancy-Language-Action Generative World Model for Autonomous Driving | Julong Wei et.al. | 2409.03272 | null |
2024-09-05 | Gr-IoU: Ground-Intersection over Union for Robust Multi-Object Tracking with 3D Geometric Constraints | Keisuke Toida et.al. | 2409.03252 | null |
2024-09-05 | A priori and a posteriori error bounds for the fully mixed FEM formulation of poroelasticity with stress-dependent permeability | Arbaz Khan et.al. | 2409.03246 | null |
2024-09-05 | Optimizing 3D Gaussian Splatting for Sparse Viewpoint Scene Reconstruction | Shen Chen et.al. | 2409.03213 | null |
2024-08-31 | Mastoidectomy Multi-View Synthesis from a Single Microscopy Image | Yike Zhang et.al. | 2409.03190 | null |
2024-09-05 | Large Étendue 3D Holographic Display with Content-adpative Dynamic Fourier Modulation | Brian Chao et.al. | 2409.03143 | null |
2024-09-04 | Up, Up, and Away: Winds and Dynamical Structure as a Function of Altitude in the Ultra-Hot Jupiter WASP-76b | Aurora Y. Kesseli et.al. | 2409.03124 | null |
2024-09-04 | Real-time operator evolution in two and three dimensions via sparse Pauli dynamics | Tomislav Begušić et.al. | 2409.03097 | null |
2024-09-04 | Incorporating dense metric depth into neural 3D representations for view synthesis and relighting | Arkadeep Narayan Chaudhury et.al. | 2409.03061 | null |
2024-09-04 | A General Albedo Recovery Approach for Aerial Photogrammetric Images through Inverse Rendering | Shuang Song et.al. | 2409.03032 | link |
2024-09-04 | Machine learning of phases and structures for model systems in physics | Djenabou Bayo et.al. | 2409.03023 | null |
2024-09-04 | Boundless: Generating Photorealistic Synthetic Data for Object Detection in Urban Streetscapes | Mehmet Kerem Turkcan et.al. | 2409.03022 | link |
2024-09-04 | Disruption of exo-asteroids around white dwarfs and the release of dust particles in debris rings in co-orbital motion | Kyriaki I. Antoniadou et.al. | 2409.03002 | null |
2024-09-04 | X-ray polarisation in AGN circumnuclear media. Polarisation framework and 2D torus models | Bert Vander Meulen et.al. | 2409.02986 | null |
2024-09-04 | RoboTwin: Dual-Arm Robot Benchmark with Generative Digital Twins (early version) | Yao Mu et.al. | 2409.02920 | null |
2024-09-04 | Toward 2D Dynamo Models Calibrated by Global 3D Relativistic Accretion Disk Simulations | Matthew D. Duez et.al. | 2409.02899 | null |
2024-09-04 | RISTRETTO: reflected-light exoplanet spectroscopy at the diffraction limit of the VLT | Christophe Lovis et.al. | 2409.02875 | null |
2024-09-04 | Human-VDM: Learning Single-Image 3D Human Gaussian Splatting from Video Diffusion Models | Zhibin Liu et.al. | 2409.02851 | link |
2024-09-04 | Diffraction Aided Wireless Positioning | Gaurav Duggal et.al. | 2409.02832 | null |
2024-09-04 | Automatic facial axes standardization of 3D fetal ultrasound images | Antonia Alomar et.al. | 2409.02826 | null |
2024-09-04 | Physics Perspectives with the ePIC Far-Forward and Far-Backward detectors | Michael Pitt et.al. | 2409.02811 | null |
2024-09-04 | Orientational properties of the HGO system in a slit geometry in two-dimensional and three-dimensional case from Monte Carlo simulations and Onsager theory revisited | Agnieszka Chrzanowska et.al. | 2409.02796 | null |
2024-09-04 | Complete and Efficient Covariants for 3D Point Configurations with Application to Learning Molecular Quantum Properties | Hartmut Maennel et.al. | 2409.02730 | null |
2024-09-08 | GET-UP: GEomeTric-aware Depth Estimation with Radar Points UPsampling | Huawei Sun et.al. | 2409.02720 | link |
2024-09-04 | Horseshoes and spiral waves: capturing the 3D flow induced by a low-mass planet analytically | Joshua J. Brown et.al. | 2409.02687 | null |
2024-09-04 | Compression of high-power laser pulse leads to increase of electron acceleration efficiency | O. E. Vais et.al. | 2409.02671 | null |
2024-09-04 | Skip-and-Play: Depth-Driven Pose-Preserved Image Generation for Any Objects | Kyungmin Jo et.al. | 2409.02653 | null |
2024-09-04 | A Medical Multimodal Large Language Model for Pediatric Pneumonia | Weiwei Tian et.al. | 2409.02608 | null |
2024-09-04 | Real-time design of architectural structures with differentiable simulators and neural networks | Rafael Pastrana et.al. | 2409.02606 | null |
2024-09-04 | SurgTrack: CAD-Free 3D Tracking of Real-world Surgical Instruments | Wenwu Guo et.al. | 2409.02598 | link |
2024-09-04 | Object Gaussian for Monocular 6D Pose Estimation from Sparse Views | Luqing Luo et.al. | 2409.02581 | null |
2024-09-04 | Learnable Wireless Digital Twins: Reconstructing Electromagnetic Field with Neural Representations | Shuaifeng Jiang et.al. | 2409.02564 | null |
2024-09-04 | Vision-Language Navigation with Continual Learning | Zhiyuan Li et.al. | 2409.02561 | null |
2024-09-04 | Plasticity-induced crack closure identification during fatigue crack growth in AA2024-T3 by using high-resolution digital image correlation | Florian Paysan et.al. | 2409.02560 | null |
2024-09-04 | Cog-GA: A Large Language Models-based Generative Agent for Vision-Language Navigation in Continuous Environments | Zhiyuan Li et.al. | 2409.02522 | null |
2024-09-04 | Energy and helicity evolution in a flux emergence simulation | K. Moraitis et.al. | 2409.02445 | null |
2024-09-04 | Accelerating Large Language Model Training with Hybrid GPU-based Compression | Lang Xu et.al. | 2409.02423 | null |
2024-09-04 | MOSMOS: Multi-organ segmentation facilitated by medical report supervision | Weiwei Tian et.al. | 2409.02418 | null |
2024-09-04 | Multi-modal Situated Reasoning in 3D Scenes | Xiongkun Linghu et.al. | 2409.02389 | null |
2024-09-04 | GGS: Generalizable Gaussian Splatting for Lane Switching in Autonomous Driving | Huasong Han et.al. | 2409.02382 | null |
2024-09-04 | Coral Model Generation from Single Images for Virtual Reality Applications | Jie Fu et.al. | 2409.02376 | null |
2024-09-03 | How to Determine the Preferred Image Distribution of a Black-Box Vision-Language Model? | Saeid Asgari Taghanaki et.al. | 2409.02253 | link |
2024-09-03 | Metal line emission around z<1 galaxies | Rajeshwari Dutta et.al. | 2409.02182 | null |
2024-09-01 | Detecting Homeomorphic 3-manifolds via Graph Neural Networks | Craig Lawrie et.al. | 2409.02126 | link |
2024-09-05 | Deep Neural Implicit Representation of Accessibility for Multi-Axis Manufacturing | George P. Harabin et.al. | 2409.02115 | null |
2024-09-03 | DynOMo: Online Point Tracking by Dynamic Online Monocular Gaussian Reconstruction | Jenny Seidenschwarz et.al. | 2409.02104 | null |
2024-09-03 | Full-field Brillouin microscopy based on an imaging Fourier transform spectrometer | Carlo Bevilacqua et.al. | 2409.02092 | null |
2024-09-03 | Storms and convection on Uranus and Neptune: impact of methane abundance revealed by a 3D cloud-resolving model | Noé Clément et.al. | 2409.02091 | null |
2024-09-03 | GraspSplats: Efficient Manipulation with 3D Feature Splatting | Mazeyu Ji et.al. | 2409.02084 | null |
2024-09-03 | Explicit Differentiable Slicing and Global Deformation for Cardiac Mesh Reconstruction | Yihao Luo et.al. | 2409.02070 | link |
2024-09-03 | ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis | Wangbo Yu et.al. | 2409.02048 | null |
2024-09-03 | PMT-MAE: Dual-Branch Self-Supervised Learning with Distillation for Efficient Point Cloud Classification | Qiang Zheng et.al. | 2409.02007 | null |
2024-09-03 | MetaFood3D: Large 3D Food Object Dataset with Nutrition Values | Yuhao Chen et.al. | 2409.01966 | null |
2024-09-08 | Exploiting Six-Dimensional Movable Antenna (6DMA) for Wireless Sensing | Xiaodan Shao et.al. | 2409.01965 | null |
2024-09-03 | 3D-LEX v1.0: 3D Lexicons for American Sign Language and Sign Language of the Netherlands | Oline Ranum et.al. | 2409.01901 | link |
2024-09-03 | SPiKE: 3D Human Pose from Point Cloud Sequences | Irene Ballester et.al. | 2409.01879 | link |
2024-09-03 | Explicit Second-order LiDAR Bundle Adjustment Algorithm Using Mean Squared Group Metric | Tingchen Ma et.al. | 2409.01856 | null |
2024-09-03 | GeoBEV: Learning Geometric BEV Representation for Multi-view 3D Object Detection | Jinqing Zhang et.al. | 2409.01816 | link |
2024-09-03 | Optimal SSB Beam Planning and UAV Cell Selection for 5G Connectivity on Aerial Highways | Matteo Bernabe et.al. | 2409.01812 | null |
2024-09-03 | EPRecon: An Efficient Framework for Real-Time Panoptic 3D Reconstruction from Monocular Video | Zhen Zhou et.al. | 2409.01807 | link |
2024-09-03 | Perverse-Hodge octahedron | Mirko Mauri et.al. | 2409.01800 | null |
2024-09-03 | Frugal RIS-aided 3D Localization with CFO under LoS and NLoS Conditions | Yasaman Ettefagh et.al. | 2409.01797 | null |
2024-09-03 | Diffuse interstellar bands as dust indicators: the contribution from 3D maps | R. Lallement et.al. | 2409.01777 | null |
2024-09-03 | Mapping Safe Zones for Co-located Human-UAV Interaction | Ayodeji O. Abioye et.al. | 2409.01768 | null |
2024-09-03 | PRoGS: Progressive Rendering of Gaussian Splats | Brent Zoomers et.al. | 2409.01761 | null |
2024-09-03 | Dynamic Wall Shear Stress Measurement using Event-based 3D Particle Tracking | Christian E. Willert et.al. | 2409.01757 | null |
2024-09-03 | Correlating grain boundary character and composition in 3-dimensions using 4D-scanning precession electron diffraction and atom probe tomography | Saurabh M. Das et.al. | 2409.01753 | null |
2024-09-03 | When 3D Partial Points Meets SAM: Tooth Point Cloud Segmentation with Sparse Labels | Yifan Liu et.al. | 2409.01691 | null |
2024-09-03 | 3D Morphology and Motions of the Canis Major Region from Gaia DR3 | Yiwei Dong et.al. | 2409.01670 | null |
2024-09-03 | Efficiently Expanding Receptive Fields: Local Split Attention and Parallel Aggregation for Enhanced Large-scale Point Cloud Semantic Segmentation | Haodong Wang et.al. | 2409.01662 | null |
2024-09-03 | $S^2$ NeRF: Privacy-preserving Training Framework for NeRF | Bokang Zhang et.al. | 2409.01661 | link |
2024-09-03 | ReKep: Spatio-Temporal Reasoning of Relational Keypoint Constraints for Robotic Manipulation | Wenlong Huang et.al. | 2409.01652 | null |
2024-09-03 | MSA-3D: Metallicity Gradients in Galaxies at $z\sim1$ with JWST/NIRSpec Slit-stepping Spectroscopy | Mengting Ju et.al. | 2409.01616 | null |
2024-09-03 | Data-driven topology design based on principal component analysis for 3D structural design problems | Jun Yang et.al. | 2409.01607 | null |
2024-09-03 | GaussianPU: A Hybrid 2D-3D Upsampling Framework for Enhancing Color Point Clouds via 3D Gaussian Splatting | Zixuan Guo et.al. | 2409.01581 | null |
2024-09-03 | Exploring Hannan Limitation for 3D Antenna Array | Ran Ji et.al. | 2409.01566 | null |
2024-09-03 | EA-RAS: Towards Efficient and Accurate End-to-End Reconstruction of Anatomical Skeleton | Zhiheng Peng et.al. | 2409.01555 | null |
2024-09-02 | AMG: Avatar Motion Guided Video Generation | Zhangsihao Yang et.al. | 2409.01502 | link |
2024-09-02 | Three dimensional stationary solutions of the Electron MHD equations | Qirui Peng et.al. | 2409.01494 | null |
2024-09-07 | EarthGen: Generating the World from Top-Down Views | Ansh Sharma et.al. | 2409.01491 | link |
2024-09-02 | A novel 3D food printing technique: achieving tunable porosity and fracture properties via liquid rope coiling | Aref Ghorbani et.al. | 2409.01487 | null |
2024-09-02 | On Thermal Conduction in the Solar Atmosphere: An Analytical Solution for Nonlinear Diffusivity without Compact Support | Sondre Vik Furuseth et.al. | 2409.01467 | null |
2024-09-02 | 3D-LSPTM: An Automatic Framework with 3D-Large-Scale Pretrained Model for Laryngeal Cancer Detection Using Laryngoscopic Videos | Meiyu Qiu et.al. | 2409.01459 | null |
2024-09-02 | Computing virtual dark-field X-ray microscopy images of complex discrete dislocation structures from large-scale molecular dynamics simulations | Yifan Wang et.al. | 2409.01439 | null |
2024-09-02 | Dynamic Jahn-Teller effect in the strong spin-orbit coupling regime | Ivica Zivkovic et.al. | 2409.01436 | null |
2024-09-09 | DiffCSG: Differentiable CSG via Rasterization | Haocheng Yuan et.al. | 2409.01421 | null |
2024-09-02 | Correlations in interacting electron liquids: Many-body statistics and hyperuniformity | Haina Wang et.al. | 2409.01381 | null |
2024-09-02 | Rapidly yawing spheroids in viscous shear flow: Emergent loss of symmetry | Mohit P. Dalwadi et.al. | 2409.01273 | null |
2024-09-02 | Including the vacuum field energy in stellarator coil design | S. Guinchard et.al. | 2409.01268 | null |
2024-09-02 | Real-time Accident Anticipation for Autonomous Driving Through Monocular Depth-Enhanced 3D Modeling | Haicheng Liao et.al. | 2409.01256 | null |
2024-09-02 | Computer-generated holography enables high-uniformity, high-efficiency depth-of-focus extension in endoscopic OCT | Chengfu Gu et.al. | 2409.01252 | null |
2024-09-02 | Development and Validation of a Modular Sensor-Based System for Gait Analysis and Control in Lower-Limb Exoskeletons | Giorgos Marinou et.al. | 2409.01174 | null |
2024-09-02 | Variation of Camera Parameters due to Common Physical Changes in Focal Length and Camera Pose | Hsin-Yi Chen et.al. | 2409.01171 | null |
2024-09-02 | Two-stage initial-value iterative physics-informed neural networks for simulating solitary waves of nonlinear wave equations | Jin Song et.al. | 2409.01124 | null |
2024-09-02 | KMTalk: Speech-Driven 3D Facial Animation with Key Motion Embedding | Zhihao Xu et.al. | 2409.01113 | link |
2024-09-02 | Fourth-order compact finite difference schemes for solving biharmonic equations with Dirichlet boundary conditions | Kejia Pan et.al. | 2409.01064 | null |
2024-09-02 | Free-DyGS: Camera-Pose-Free Scene Reconstruction based on Gaussian Splatting for Dynamic Surgical Videos | Qian Li et.al. | 2409.01003 | null |
2024-09-02 | Physics-informed DeepONet with stiffness-based loss functions for structural response prediction | Bilal Ahmed et.al. | 2409.00994 | null |
2024-09-12 | 3D Priors-Guided Diffusion for Blind Face Restoration | Xiaobin Lu et.al. | 2409.00991 | link |
2024-09-02 | XNet v2: Fewer Limitations, Better Results and Greater Universality | Yanfeng Zhou et.al. | 2409.00947 | link |
2024-09-01 | Fisher Information guided Purification against Backdoor Attacks | Nazmul Karim et.al. | 2409.00863 | link |
2024-09-01 | Image-to-Lidar Relational Distillation for Autonomous Driving Data | Anas Mahmoud et.al. | 2409.00845 | null |
2024-09-01 | Entropy Loss: An Interpretability Amplifier of 3D Object Detection Network for Intelligent Driving | Haobo Yang et.al. | 2409.00839 | link |
2024-09-01 | GroomCap: High-Fidelity Prior-Free Hair Capture | Yuxiao Zhou et.al. | 2409.00831 | null |
2024-09-01 | SonoHaptics: An Audio-Haptic Cursor for Gaze-Based Object Selection in XR | Hyunsung Cho et.al. | 2409.00784 | null |
2024-09-01 | DSLO: Deep Sequence LiDAR Odometry Based on Inconsistent Spatio-temporal Propagation | Huixin Zhang et.al. | 2409.00744 | link |
2024-09-01 | MoManifold: Learning to Measure 3D Human Motion via Decoupled Joint Acceleration Manifolds | Ziqiang Dang et.al. | 2409.00736 | null |
2024-09-01 | Percolation in semicontinuum geometries | Jasna C. K et.al. | 2409.00699 | null |
2024-09-01 | Decoupled and Interactive Regression Modeling for High-performance One-stage 3D Object Detection | Weiping Xiao et.al. | 2409.00690 | null |
2024-09-01 | Study of Dropout in PointPillars with 3D Object Detection | Xiaoxiang Sun et.al. | 2409.00673 | null |
2024-09-01 | Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression | Dingyuan Zhang et.al. | 2409.00633 | link |
2024-09-01 | YOLOO: You Only Learn from Others Once | Lipeng Gu et.al. | 2409.00618 | null |
2024-09-01 | COMOGen: A Controllable Text-to-3D Multi-object Generation Framework | Shaorong Sun et.al. | 2409.00590 | null |
2024-08-31 | Programmable refractive functions | Md Sadman Sakib Rahman et.al. | 2409.00567 | null |
2024-08-31 | Compositional 3D-aware Video Generation with LLM Director | Hanxin Zhu et.al. | 2409.00558 | null |
2024-08-31 | ActionPose: Pretraining 3D Human Pose Estimation with the Dark Knowledge of Action | Longyun Liao et.al. | 2409.00449 | null |
2024-09-09 | Separation of Body and Background in Radiological Images. A Practical Python Code | Seyedeh Fahimeh Hosseini et.al. | 2409.00442 | link |
2024-09-06 | 3D Gaussian Splatting for Large-scale 3D Surface Reconstruction from Aerial Images | YuanZheng Wu et.al. | 2409.00381 | null |
2024-09-05 | EgoHDM: An Online Egocentric-Inertial Human Motion Capture, Localization, and Dense Mapping System | Bonan Liu et.al. | 2409.00343 | null |
2024-08-31 | Towards Secure and Usable 3D Assets: A Novel Framework for Automatic Visible Watermarking | Gursimran Singh et.al. | 2409.00314 | null |
2024-08-31 | The 3D kinetic Couette flow via the Boltzmann equation in the diffusive limit | Renjun Duan et.al. | 2409.00311 | null |
2024-08-30 | TorchDA: A Python package for performing data assimilation with deep learning forward and transformation functions | Sibo Cheng et.al. | 2409.00244 | null |
2024-08-30 | Social MediARverse Investigating Users Social Media Content Sharing and Consuming Intentions with Location-Based AR | Linda Hirsch et.al. | 2409.00211 | null |
2024-08-30 | Difference Equations: from Berry Connections to the Coulomb Branch | Andrea E. V. Ferrari et.al. | 2409.00173 | null |
2024-08-26 | A Lightweight Human Pose Estimation Approach for Edge Computing-Enabled Metaverse with Compressive Sensing | Nguyen Quang Hieu et.al. | 2409.00087 | null |
2024-08-30 | DARES: Depth Anything in Robotic Endoscopic Surgery with Self-supervised Vector-LoRA of the Foundation Model | Mona Sheikh Zeinoddin et.al. | 2408.17433 | link |
2024-09-04 | Augmented Reality without Borders: Achieving Precise Localization Without Maps | Albert Gassol Puigjaner et.al. | 2408.17373 | null |
2024-08-30 | A Runge-type approximation theorem for the 3D unsteady Stokes system | Mitsuo Higaki et.al. | 2408.17228 | null |
2024-08-30 | OG-Mapping: Octree-based Structured 3D Gaussians for Online Dense Mapping | Meng Wang et.al. | 2408.17223 | null |
2024-08-30 | Modelling Growth, Remodelling and Damage of a Thick-walled Fibre-reinforced Artery with Active Response: Application to Cerebral Vasospasm and Treatment | Giulia Pederzani et.al. | 2408.17206 | null |
2024-08-30 | GMM-IKRS: Gaussian Mixture Models for Interpretable Keypoint Refinement and Scoring | Emanuele Santellani et.al. | 2408.17149 | null |
2024-08-30 | Microscopic Structural Study on the Growth History of Granular Heaps Prepared by the Raining Method | Hanyu Li et.al. | 2408.17147 | null |
2024-08-30 | Joining simplified physics models with coarse grids to speed-up intractable 3D time-domain simulations | Wouter Deleersnyder et.al. | 2408.17137 | null |
2024-08-30 | Leveraging Digital Twin Technologies for Public Space Protection and Vulnerability Assessment | Artemis Stefanidou et.al. | 2408.17136 | null |
2024-08-30 | Multi-centric AI Model for Unruptured Intracranial Aneurysm Detection and Volumetric Segmentation in 3D TOF-MRI | Ashraya K. Indrakanti et.al. | 2408.17115 | null |
2024-08-30 | Reasoning AI Performance Degradation in 6G Networks with Large Language Models | Liming Huang et.al. | 2408.17097 | null |
2024-08-30 | Machine learning for predicting control landscape maps of quantum molecular dynamics: Laser-induced three-dimensional alignment of asymmetric top molecules | Tomotaro Namba et.al. | 2408.17089 | null |
2024-08-30 | Generalizing Deepfake Video Detection with Plug-and-Play: Video-Level Blending and Spatiotemporal Adapter Tuning | Zhiyuan Yan et.al. | 2408.17065 | link |
2024-08-30 | CP-VoteNet: Contrastive Prototypical VoteNet for Few-Shot Point Cloud Object Detection | Xuejing Li et.al. | 2408.17036 | null |
2024-08-30 | MakeWay: Object-Aware Costmaps for Proactive Indoor Navigation Using LiDAR | Binbin Xu et.al. | 2408.17034 | null |
2024-08-30 | ConDense: Consistent 2D/3D Pre-training for Dense and Sparse Features from Multi-View Images | Xiaoshuai Zhang et.al. | 2408.17027 | null |
2024-08-30 | 2DGH: 2D Gaussian-Hermite Splatting for High-quality Rendering and Better Geometry Reconstruction | Ruihan Yu et.al. | 2408.16982 | null |
2024-08-30 | Synthetic Lunar Terrain: A Multimodal Open Dataset for Training and Evaluating Neuromorphic Vision Algorithms | Marcus Märtens et.al. | 2408.16971 | null |
2024-08-29 | A Computational Framework for Modeling Emergence of Color Vision in the Human Brain | Atsunobu Kotani et.al. | 2408.16916 | link |
2024-08-29 | Ig3D: Integrating 3D Face Representations in Facial Expression Inference | Lu Dong et.al. | 2408.16907 | null |
2024-08-29 | A Spintronic Nano-Antenna Activated by Spin Injection from a Three-Dimensional Topological Insulator | Raisa Fabiha et.al. | 2408.16854 | null |
2024-08-29 | Deep learning approach for identification of HII regions during reionization in 21-cm observations – III. image recovery | Michele Bianco et.al. | 2408.16814 | null |
2024-08-29 | 3D Whole-body Grasp Synthesis with Directional Controllability | Georgios Paschalidis et.al. | 2408.16770 | null |
2024-08-29 | SAM2Point: Segment Any 3D as Videos in Zero-shot and Promptable Manners | Ziyu Guo et.al. | 2408.16768 | link |
2024-08-29 | ReconX: Reconstruct Any Scene from Sparse Views with Video Diffusion Model | Fangfu Liu et.al. | 2408.16767 | null |
2024-08-29 | UV-free Texture Generation with Denoising and Geodesic Heat Diffusions | Simone Foti et.al. | 2408.16762 | link |
2024-09-01 | Generic Objects as Pose Probes for Few-Shot View Synthesis | Zhirui Gao et.al. | 2408.16690 | null |
2024-09-10 | Space3D-Bench: Spatial 3D Question Answering Benchmark | Emilia Szymanska et.al. | 2408.16662 | null |
2024-08-29 | 3D Pose-Based Temporal Action Segmentation for Figure Skating: A Fine-Grained and Jump Procedure-Aware Annotation Approach | Ryota Tanaka et.al. | 2408.16638 | link |
2024-08-29 | SPH modelling of AGB wind morphology in hierarchical triple systems \& comparison to observation of R Aql | Jolien Malfait et.al. | 2408.16565 | link |
2024-08-30 | Beyond MR Image Harmonization: Resolution Matters Too | Savannah P. Hays et.al. | 2408.16562 | null |
2024-08-29 | Solitons in 4d Wess-Zumino-Witten models – Towards unification of integrable systems – | Masashi Hamanaka et.al. | 2408.16554 | null |
2024-08-29 | Spurfies: Sparse Surface Reconstruction using Local Geometry Priors | Kevin Raj et.al. | 2408.16544 | null |
2024-08-28 | Are Pose Estimators Ready for the Open World? STAGE: Synthetic Data Generation Toolkit for Auditing 3D Human Pose Estimators | Nikita Kister et.al. | 2408.16536 | null |
2024-08-28 | A Comprehensive Review of 3D Object Detection in Autonomous Driving: Technological Advances and Future Directions | Yu Wang et.al. | 2408.16530 | link |
2024-08-29 | Towards Modality-agnostic Label-efficient Segmentation with Entropy-Regularized Distribution Alignment | Liyao Tang et.al. | 2408.16520 | link |
2024-08-29 | Improving 3D deep learning segmentation with biophysically motivated cell synthesis | Roman Bruch et.al. | 2408.16471 | null |
2024-08-29 | Can gap-edge illumination excite spirals in protoplanetary disks? Three-temperature radiation hydrodynamics and NIR image modelling | Dhruv Muley et.al. | 2408.16461 | null |
2024-08-29 | Mismatched: Evaluating the Limits of Image Matching Approaches and Benchmarks | Sierra Bonilla et.al. | 2408.16445 | link |
2024-08-29 | Time-Optimized Trajectory Planning for Non-Prehensile Object Transportation in 3D | Lingyun Chen et.al. | 2408.16420 | null |
2024-08-29 | 3D Topological Modeling and Multi-Agent Movement Simulation for Viral Infection Risk Analysis | Wassim Jabi et.al. | 2408.16417 | null |
2024-08-29 | Virtual Fieldwork in Immersive Environments using Game Engines | Armin Bernstetter et.al. | 2408.16346 | null |
2024-08-29 | Ultranarrow-linewidth Wavelength-Vortex Metasurface Holography | Weijia Meng et.al. | 2408.16342 | null |
2024-08-29 | From layered 2D carbon to 3D tetrahedral original allotropes C12 and C18 with physical properties related to diamond: Crystal chemistry and DFT investigations | Samir F. Matar et.al. | 2408.16341 | null |
2024-08-29 | Steady Compressible 3D Euler Flows in Toroidal Volumes without Continuous Euclidean Isometries | Naoki Sato et.al. | 2408.16339 | null |
2024-08-29 | P2P-Bridge: Diffusion Bridges for 3D Point Cloud Denoising | Mathias Vogel et.al. | 2408.16325 | null |
2024-08-29 | Simulating the electrostatic patch force in sphere-plate and plate-plate geometries | Matthijs H. J. de Jong et.al. | 2408.16323 | null |
2024-08-29 | Iterated Energy-based Flow Matching for Sampling from Boltzmann Densities | Dongyeop Woo et.al. | 2408.16249 | null |
2024-08-30 | Revisiting 360 Depth Estimation with PanoGabor: A New Fusion Perspective | Zhijie Shen et.al. | 2408.16227 | null |
2024-08-29 | Uni-3DAD: GAN-Inversion Aided Universal 3D Anomaly Detection on Model-free Products | Jiayu Liu et.al. | 2408.16201 | null |
2024-08-29 | PolarBEVDet: Exploring Polar Representation for Multi-View 3D Object Detection in Bird’s-Eye-View | Zichen Yu et.al. | 2408.16200 | link |
2024-08-29 | Conformal Coordinates for Molecular Geometry: from 3D to 5D | Jesus Camargo et.al. | 2408.16188 | null |
2024-08-28 | Single-Photon 3D Imaging with Equi-Depth Photon Histograms | Kaustubh Sadekar et.al. | 2408.16150 | null |
2024-08-28 | Orbital magnetoelectric coupling of three dimensional Chern insulators | Xin Lu et.al. | 2408.16103 | null |
2024-08-28 | Influence of gauges in the numerical simulation of the time-dependent Ginzburg-Landau model | Cyril Tain et.al. | 2408.16086 | null |
2024-08-28 | Benchmarking with Supernovae: A Performance Study of the FLASH Code | Joshua Martin et.al. | 2408.16084 | null |
2024-08-28 | 3D Reconstruction with Spatial Memory | Hengyi Wang et.al. | 2408.16061 | null |
2024-09-04 | Extremal rotating BTZ black holes cannot be dressed in (anti-)self-dual Maxwell field | Hideki Maeda et.al. | 2408.16056 | null |
2024-08-28 | Electron Scattering at the Intensity Frontier with SoLID | Zein-Eddine Meziani et.al. | 2408.16037 | null |
2024-08-28 | TEDRA: Text-based Editing of Dynamic and Photoreal Actors | Basavaraj Sunagad et.al. | 2408.15995 | null |
2024-09-05 | More Text, Less Point: Towards 3D Data-Efficient Point-Language Understanding | Yuan Tang et.al. | 2408.15966 | link |
2024-08-28 | Efficient Slice Anomaly Detection Network for 3D Brain MRI Volume | Zeduo Zhang et.al. | 2408.15958 | null |
2024-08-28 | SLAM2REF: Advancing Long-Term Mapping with 3D LiDAR and Reference Map Integration for Precise 6-DoF Trajectory Estimation and Map Extension | Miguel Arturo Vega Torres et.al. | 2408.15948 | link |
2024-08-28 | DiffAge3D: Diffusion-based 3D-aware Face Aging | Junaid Wahid et.al. | 2408.15922 | null |
2024-08-28 | Gen-Swarms: Adapting Deep Generative Models to Swarms of Drones | Carlos Plou et.al. | 2408.15899 | null |
2024-08-28 | SpineMamba: Enhancing 3D Spinal Segmentation in Clinical Imaging through Residual Visual Mamba Layers and Shape Priors | Zhiqing Zhang et.al. | 2408.15887 | null |
2024-08-28 | BIM-SLAM: Integrating BIM Models in Multi-session SLAM for Lifelong Mapping using 3D LiDAR | Miguel Arturo Vega Torres et.al. | 2408.15870 | link |
2024-08-28 | Vertex characterization via second-order topological derivatives | Peter Gangl et.al. | 2408.15847 | null |
2024-08-28 | DQFormer: Towards Unified LiDAR Panoptic Segmentation with Decoupled Queries | Yu Yang et.al. | 2408.15813 | null |
2024-08-28 | Multi-view Pose Fusion for Occlusion-Aware 3D Human Pose Estimation | Laura Bragagnolo et.al. | 2408.15810 | link |
2024-08-28 | wav2pos: Sound Source Localization using Masked Autoencoders | Axel Berg et.al. | 2408.15771 | link |
2024-08-28 | A Survey on Evaluation of Multimodal Large Language Models | Jiaxing Huang et.al. | 2408.15769 | null |
2024-08-28 | Emergent scalar-chirality \& colossal transverse-magnetoresponse in strongly correlated nodal-line half-metal | Jyotirmoy Sau et.al. | 2408.15754 | null |
2024-08-28 | MambaPlace:Text-to-Point-Cloud Cross-Modal Place Recognition with Attention Mamba Mechanisms | Tianyi Shang et.al. | 2408.15740 | link |
2024-08-28 | Towards Realistic Example-based Modeling via 3D Gaussian Stitching | Xinyu Gao et.al. | 2408.15708 | null |
2024-09-05 | G-Style: Stylized Gaussian Splatting | Áron Samuel Kovács et.al. | 2408.15695 | link |
2024-08-28 | DEAR: Depth-Enhanced Action Recognition | Sadegh Rahmaniboldaji et.al. | 2408.15679 | link |
2024-08-28 | TeFF: Tracking-enhanced Forgetting-free Few-shot 3D LiDAR Semantic Segmentation | Junbao Zhou et.al. | 2408.15657 | link |
2024-08-29 | RIDE: Boosting 3D Object Detection for LiDAR Point Clouds via Rotation-Invariant Analysis | Zhaoxuan Wang et.al. | 2408.15643 | null |
2024-08-28 | Transfer Learning from Simulated to Real Scenes for Monocular 3D Object Detection | Sondos Mohamed et.al. | 2408.15637 | null |
2024-08-28 | Geometry-guided Feature Learning and Fusion for Indoor Scene Reconstruction | Ruihong Yin et.al. | 2408.15608 | null |
2024-08-28 | Computing optimal partition problems via Lagrange multiplier approach | Qing Cheng et.al. | 2408.15534 | null |
2024-08-28 | Topological string as massive spinning particle in three dimensions | I. Yu. Karataeva et.al. | 2408.15526 | null |
2024-08-28 | Ray-Distance Volume Rendering for Neural Scene Reconstruction | Ruihong Yin et.al. | 2408.15524 | null |
2024-08-28 | RoboSense: Large-scale Dataset and Benchmark for Multi-sensor Low-speed Autonomous Driving | Haisheng Su et.al. | 2408.15503 | link |
2024-08-28 | Feelit: Combining Compliant Shape Displays with Vision-Based Tactile Sensors for Real-Time Teletaction | Oscar Yu et.al. | 2408.15480 | null |
2024-08-27 | HEAD: A Bandwidth-Efficient Cooperative Perception Approach for Heterogeneous Connected and Autonomous Vehicles | Deyuan Qu et.al. | 2408.15428 | null |
2024-08-27 | Seamless 5G Automotive Connectivity with Integrated Satellite Terrestrial Networks in C-Band | Hung Nguyen-Kha et.al. | 2408.15394 | null |
2024-08-27 | Application of hybrid classical-quantum annealing technology to the 3D Bin-Packing Problem | Mohsen Rahmani et.al. | 2408.15365 | null |
2024-08-27 | Holographic Foliations: Self-Similar Quasicrystals from Hyperbolic Honeycombs | Latham Boyle et.al. | 2408.15316 | null |
2024-08-27 | A reaction network model of microscale liquid-liquid phase separation reveals effects of spatial dimension | Jinyoung Kim et.al. | 2408.15303 | null |
2024-08-22 | 3D Photon Counting CT Image Super-Resolution Using Conditional Diffusion Model | Chuang Niu et.al. | 2408.15283 | null |
2024-08-27 | Drone-assisted Road Gaussian Splatting with Cross-view Uncertainty | Saining Zhang et.al. | 2408.15242 | link |
2024-08-27 | Learning-based Multi-View Stereo: A Survey | Fangjinhua Wang et.al. | 2408.15235 | null |
2024-08-27 | SAM & SAM 2 in 3D Slicer: SegmentWithSAM Extension for Annotating Medical Images | Zafer Yildiz et.al. | 2408.15224 | link |
2024-08-27 | Halo mass functions at high redshift | Hannah O’Brennan et.al. | 2408.15194 | null |
2024-08-27 | Turbulence and far-from-equilibrium equation of state of Bogoliubov waves in Bose-Einstein Condensates | Ying Zhu et.al. | 2408.15163 | null |
2024-08-27 | Warm Jupiters around M-dwarfs are great opportunities for extensive chemical, cloud and haze characterisation with JWST | Lucas Teinturier et.al. | 2408.15137 | null |
2024-08-27 | DIFR3CT: Latent Diffusion for Probabilistic 3D CT Reconstruction from Few Planar X-Rays | Yiran Sun et.al. | 2408.15118 | link |
2024-08-27 | A novel numerical framework for three-dimensional fully resolved simulation of freely falling particles of arbitrary shape | Taraprasad Bhowmick et.al. | 2408.15115 | null |
2024-08-27 | Few-Shot Unsupervised Implicit Neural Shape Representation Learning with Spatial Adversaries | Amine Ouasfi et.al. | 2408.15114 | null |
2024-08-27 | Data-Driven Nonlinear Deformation Design of 3D-Printable Shells | Samuel Silverman et.al. | 2408.15097 | link |
2024-08-28 | MMASD+: A Novel Dataset for Privacy-Preserving Behavior Analysis of Children with Autism Spectrum Disorder | Pavan Uttej Ravva et.al. | 2408.15077 | link |
2024-08-27 | Torus and hyperchaos in 3D Lotka-Volterra map | Sishu Shankar Muni et.al. | 2408.15054 | null |
2024-08-27 | Interactive Occlusion Boundary Estimation through Exploitation of Synthetic Data | Lintao Xu et.al. | 2408.15038 | null |
2024-08-26 | Magnetization patterns in $\text{GaAs}$-$\text{Fe}{\text{33}}\text{Co}{\text{67}}$ core-shell nanorods | Anastasiia Korniienko et.al. | 2408.15036 | null |
2024-08-27 | Sequence-aware Pre-training for Echocardiography Probe Guidance | Haojun Jiang et.al. | 2408.15026 | null |
2024-08-27 | A High Altitude Platform-Based 3D Geometrical Channel Model for Beamforming Characterization in Future 6G Flying Ad-Hoc Networks | Muhammet Kirik et.al. | 2408.14986 | null |
2024-08-27 | Probing coronal mass ejections inclination effects with EUHFORIA | Karmen Martinić et.al. | 2408.14971 | null |
2024-08-27 | BOX3D: Lightweight Camera-LiDAR Fusion for 3D Object Detection and Localization | Mario A. V. Saucedo et.al. | 2408.14941 | null |
2024-08-27 | MeshUp: Multi-Target Mesh Deformation via Blended Score Distillation | Hyunwoo Kim et.al. | 2408.14899 | null |
2024-08-27 | Robo-GS: A Physics Consistent Spatial-Temporal Model for Robotic Arm with Hybrid Representation | Haozhe Lou et.al. | 2408.14873 | null |
2024-08-27 | Phase behavior of symmetric diblock copolymers under 3D soft confinement | Zhijuan He et.al. | 2408.14863 | null |
2024-08-27 | DiffSurf: A Transformer-based Diffusion Model for Generating and Reconstructing 3D Surfaces in Pose | Yusuke Yoshiyasu et.al. | 2408.14860 | null |
2024-08-27 | In situ fully vectorial tomography and pupil function retrieval of tightly focused fields | Xin Liu et.al. | 2408.14852 | null |
2024-08-27 | Diffusion-Occ: 3D Point Cloud Completion via Occupancy Diffusion | Guoqing Zhang et.al. | 2408.14846 | null |
2024-08-27 | LapisGS: Layered Progressive 3D Gaussian Splatting for Adaptive Streaming | Yuang Shi et.al. | 2408.14823 | link |
2024-08-27 | Build-A-Scene: Interactive 3D Layout Control for Diffusion-Based Image Generation | Abdelrahman Eldesokey et.al. | 2408.14819 | null |
2024-08-27 | OctFusion: Octree-based Diffusion Models for 3D Shape Generation | Bojun Xiong et.al. | 2408.14732 | link |
2024-08-27 | GeoTransfer : Generalizable Few-Shot Multi-View Reconstruction via Transfer Learning | Shubhendu Jena et.al. | 2408.14724 | null |
2024-08-26 | 3D Point Cloud Network Pruning: When Some Weights Do not Matter | Amrijit Biswas et.al. | 2408.14601 | link |
2024-08-26 | PVAFN: Point-Voxel Attention Fusion Network with Multi-Pooling Enhancing for 3D Object Detection | Yidi Li et.al. | 2408.14600 | null |
2024-08-26 | Efficient fine-tuning of 37-level GraphCast with the Canadian global deterministic analysis | Christopher Subich et.al. | 2408.14587 | link |
2024-08-26 | A sparsity-aware distributed-memory algorithm for sparse-sparse matrix multiplication | Yuxi Hong et.al. | 2408.14558 | null |
2024-08-26 | $U(1)$ $R$ -Symmetry Topological Operators from Branes in Holography | Thomas Waddleton et.al. | 2408.14542 | null |
2024-08-26 | Obstruction to Broken Symmetries in Topological Flat Bands | Penghao Zhu et.al. | 2408.14533 | null |
2024-08-28 | On the Effects of Modeling on the Sim-to-Real Transfer Gap in Twinning the POWDER Platform | Maxwell McManus et.al. | 2408.14465 | null |
2024-08-26 | Few-Shot 3D Volumetric Segmentation with Multi-Surrogate Fusion | Meng Zheng et.al. | 2408.14427 | null |
2024-08-26 | LoG-VMamba: Local-Global Vision Mamba for Medical Image Segmentation | Trung Dinh Quoc Dang et.al. | 2408.14415 | link |
2024-08-26 | From irregular to regular eutectic growth in the Al-Al3Ni system: in situ observations during directional solidification | Paul Chao et.al. | 2408.14346 | null |
2024-08-26 | LLM-3D Print: Large Language Models To Monitor and Control 3D Printing | Yayati Jadhav et.al. | 2408.14307 | null |
2024-09-04 | Learning Local Pattern Modularization for Point Cloud Reconstruction from Unseen Classes | Chao Chen et.al. | 2408.14279 | null |
2024-08-27 | Text3DAug – Prompted Instance Augmentation for LiDAR Perception | Laurenz Reichardt et.al. | 2408.14253 | link |
2024-08-26 | Exploiting ray tracing technology through OptiX to compute particle interactions with cutoff in a 3D environment on GPU | Bérenger Bramas et.al. | 2408.14247 | null |
2024-08-26 | Visuo-Tactile Exploration of Unknown Rigid 3D Curvatures by Vision-Augmented Unified Force-Impedance Control | Kübra Karacan et.al. | 2408.14219 | null |
2024-08-26 | MagicMan: Generative Novel View Synthesis of Humans with 3D-Aware Diffusion and Iterative Refinement | Xu He et.al. | 2408.14211 | null |
2024-08-26 | Radial 3D Focusing Energy Critical INLS equation with a sub-critical perturbation: Ground states, Blow-up, and scattering | Tianxiang Gou et.al. | 2408.14161 | null |
2024-08-26 | Multi-Faceted Evaluation of Modeling Languages for Augmented Reality Applications – The Case of ARWFML | Fabian Muff et.al. | 2408.14137 | null |
2024-08-26 | Global uniform regularity for the 3D incompressible MHD equations with slip boundary condition near an equilibrium | Jincheng Gao et.al. | 2408.14123 | null |
2024-08-26 | ShapeMamba-EM: Fine-Tuning Foundation Model with Local Shape Descriptors and Mamba Blocks for 3D EM Image Segmentation | Ruohua Shi et.al. | 2408.14114 | null |
2024-08-26 | Morphology of molecular clouds at kiloparsec scale in the Milky Way: Shear-induced alignment and vertical confinement | Yi-Heng Xie et.al. | 2408.14095 | null |
2024-08-26 | Variable offsets and processing of implicit forms toward the adaptive synthesis and analysis of heterogeneous conforming microstructure | Q. Y. Hong et.al. | 2408.14068 | null |
2024-09-02 | Active Search for Low-altitude UAV Sensing and Communication for Users at Unknown Locations | Yuanshuai Zheng et.al. | 2408.14067 | null |
2024-08-28 | FAST-LIVO2: Fast, Direct LiDAR-Inertial-Visual Odometry | Chunran Zheng et.al. | 2408.14035 | link |
2024-08-26 | Pixel-Aligned Multi-View Generation with Depth Guided Decoder | Zhenggang Tang et.al. | 2408.14016 | null |
2024-09-03 | A Multiscale Gradient Fusion Method for Edge Detection in Color Images Utilizing the CBM3D Filter | Zhuoyue Wang et.al. | 2408.14013 | null |
2024-08-26 | Avatar Concept Slider: Manipulate Concepts In Your Human Avatar With Fine-grained Control | Yixuan He et.al. | 2408.13995 | null |
2024-08-26 | ARANet: Attention-based Residual Adversarial Network with Deep Supervision for Radiotherapy Dose Prediction of Cervical Cancer | Lu Wen et.al. | 2408.13981 | null |
2024-08-26 | DynaSurfGS: Dynamic Surface Reconstruction with Planar-based Gaussian Splatting | Weiwei Cai et.al. | 2408.13972 | link |
2024-08-25 | InterTrack: Tracking Human Object Interaction without Object Templates | Xianghui Xie et.al. | 2408.13953 | null |
2024-08-25 | Personalized Topology-Informed 12-Lead ECG Electrode Localization from Incomplete Cardiac MRIs for Efficient Cardiac Digital Twins | Lei Li et.al. | 2408.13945 | link |
2024-08-25 | Electronic correlations and long-range magnetic ordering in NiO tuned by pressure | G. M. Gaifutdinov et.al. | 2408.13937 | null |
2024-08-25 | OpenNav: Efficient Open Vocabulary 3D Object Detection for Smart Wheelchair Navigation | Muhammad Rameez ur Rahman et.al. | 2408.13936 | link |
2024-08-27 | Splatt3R: Zero-shot Gaussian Splatting from Uncalibrated Image Pairs | Brandon Smart et.al. | 2408.13912 | null |
2024-08-25 | Inverse Problem Regularization for 3D Multi-Species Tumor Growth Models | Ali Ghafouri et.al. | 2408.13903 | null |
2024-08-25 | TraIL-Det: Transformation-Invariant Local Feature Networks for 3D LiDAR Object Detection with Unsupervised Pre-Training | Li Li et.al. | 2408.13902 | null |
2024-08-25 | DESI Peculiar Velocity Survey – Fundamental Plane | Khaled Said et.al. | 2408.13842 | null |
2024-08-25 | PropSAM: A Propagation-Based Model for Segmenting Any 3D Objects in Multi-Modal Medical Images | Zifan Chen et.al. | 2408.13836 | null |
2024-08-25 | TripleMixer: A 3D Point Cloud Denoising Model for Adverse Weather | Xiongwei Zhao et.al. | 2408.13802 | link |
2024-08-25 | Selectively Dilated Convolution for Accuracy-Preserving Sparse Pillar-based Embedded 3D Object Detection | Seongmin Park et.al. | 2408.13798 | null |
2024-08-25 | High Order Smoothness for Stochastic Navier-Stokes Equations with Transport and Stretching Noise on Bounded Domains | Daniel Goodair et.al. | 2408.13791 | null |
2024-08-25 | 3D-VirtFusion: Synthetic 3D Data Augmentation through Generative Diffusion Models and Controllable Editing | Shichao Dong et.al. | 2408.13788 | null |
2024-08-25 | TranSplat: Generalizable 3D Gaussian Splatting from Sparse Multi-View Images with Transformers | Chuanrui Zhang et.al. | 2408.13770 | null |
2024-08-25 | Self-Parameterization Based Multi-Resolution Mesh Convolution Networks | Shi Hezi et.al. | 2408.13762 | null |
2024-08-25 | 3D-RCNet: Learning from Transformer to Build a 3D Relational ConvNet for Hyperspectral Image Classification | Haizhao Jing et.al. | 2408.13728 | link |
2024-08-28 | PhysPart: Physically Plausible Part Completion for Interactable Objects | Rundong Luo et.al. | 2408.13724 | null |
2024-08-25 | Riemann-based Multi-scale Attention Reasoning Network for Text-3D Retrieval | Wenrui Li et.al. | 2408.13712 | link |
2024-08-25 | SceneDreamer360: Text-Driven 3D-Consistent Scene Generation with Panoramic Gaussian Splatting | Wenrui Li et.al. | 2408.13711 | link |
2024-08-25 | SeeBelow: Sub-dermal 3D Reconstruction of Tumors with Surgical Robotic Palpation and Tactile Exploration | Raghava Uppuluri et.al. | 2408.13699 | null |
2024-08-24 | Segment Any Mesh: Zero-shot Mesh Part Segmentation via Lifting Segment Anything 2 to 3D | George Tang et.al. | 2408.13679 | link |
2024-08-24 | GenCA: A Text-conditioned Generative Model for Realistic and Drivable Codec Avatars | Keqiang Sun et.al. | 2408.13674 | null |
2024-08-24 | Evaluating the Robustness of LiDAR-based 3D Obstacles Detection and Its Impacts on Autonomous Driving Systems | Tri Minh Triet Pham et.al. | 2408.13653 | null |
2024-08-24 | Temporally-consistent 3D Reconstruction of Birds | Johannes Hägerlind et.al. | 2408.13629 | null |
2024-08-24 | STAResNet: a Network in Spacetime Algebra to solve Maxwell’s PDEs | Alberto Pepe et.al. | 2408.13619 | link |
2024-08-24 | The Boltzmann equation in the homogeneous critical regularity framework | Jing Liu et.al. | 2408.13610 | null |
2024-08-24 | PointDGMamba: Domain Generalization of Point Cloud Classification via Generalized State Space Model | Hao Yang et.al. | 2408.13574 | link |
2024-08-24 | Plug-and-Play Drag Sail Module for LEO Satellites: Implementation and Early Testing of AirDragMod (ADM) | Anshuman Shukla et.al. | 2408.13562 | null |
2024-08-24 | G3DST: Generalizing 3D Style Transfer with Neural Radiance Fields across Scenes and Styles | Adil Meric et.al. | 2408.13508 | null |
2024-08-24 | R2G: Reasoning to Ground in 3D Scenes | Yixuan Li et.al. | 2408.13499 | null |
2024-08-24 | AdaOcc: Adaptive-Resolution Occupancy Prediction | Chao Chen et.al. | 2408.13454 | null |
2024-08-23 | Towards Robust Perception for Assistive Robotics: An RGB-Event-LiDAR Dataset and Multi-Modal Detection Pipeline | Adam Scicluna et.al. | 2408.13394 | null |
2024-08-23 | BiGS: Bidirectional Gaussian Primitives for Relightable 3D Gaussian Splatting | Zhenyuan Liu et.al. | 2408.13370 | null |
2024-08-23 | SIn-NeRF2NeRF: Editing 3D Scenes with Instructions through Segmentation and Inpainting | Jiseung Hong et.al. | 2408.13285 | link |
2024-08-23 | LayerPano3D: Layered 3D Panorama for Hyper-Immersive Scene Generation | Shuai Yang et.al. | 2408.13252 | null |
2024-08-23 | Re-evaluation of Face Anti-spoofing Algorithm in Post COVID-19 Era Using Mask Based Occlusion Attack | Vaibhav Sundharam et.al. | 2408.13251 | null |
2024-08-23 | Impact of HI cooling and study of accretion disks in AGB wind-companion smoothed particle hydrodynamic simulations | Jolien Malfait et.al. | 2408.13158 | link |
2024-08-26 | Focus on Neighbors and Know the Whole: Towards Consistent Dense Multiview Text-to-Image Generator for 3D Creation | Bonan Li et.al. | 2408.13149 | null |
2024-08-23 | Quantum-critical and dynamical properties of the XXZ bilayer with long-range interactions | Patrick Adelhardt et.al. | 2408.13145 | null |
2024-08-23 | Deep Learning at the Intersection: Certified Robustness as a Tool for 3D Vision | Gabriel Pérez S et.al. | 2408.13135 | null |
2024-08-23 | Extremal Structures with Embedded Pre-Failure Indicators | Christoffer Fyllgraf Christensen et.al. | 2408.13113 | null |
2024-08-23 | The solar beryllium abundance revisited with 3D non-LTE models | A. M. Amarsi et.al. | 2408.13105 | null |
2024-08-23 | Functional Tensor Decompositions for Physics-Informed Neural Networks | Sai Karthikeya Vemuri et.al. | 2408.13101 | link |
2024-08-23 | Magnetic correlations and Griffith-like phase in Co $2$TiSi${0.5}$Al$_{0.5}$ Heusler alloy | Priyanka Yadav et.al. | 2408.13081 | null |
2024-08-23 | Reconstruction of partially occluded objects with a physics-driven self-training neural network | Mingjun Xiang et.al. | 2408.13066 | null |
2024-08-23 | SIMPLE: Simultaneous Multi-Plane Self-Supervised Learning for Isotropic MRI Restoration from Anisotropic Data | Rotem Benisty et.al. | 2408.13065 | null |
2024-08-23 | Atlas Gaussians Diffusion for 3D Generation with Infinite Number of Points | Haitao Yang et.al. | 2408.13055 | null |
2024-08-23 | G3FA: Geometry-guided GAN for Face Animation | Alireza Javanmardi et.al. | 2408.13049 | null |
2024-08-23 | Identification and validation of the dynamic model of a tendon-driven anthropomorphic finger | Junnan Li et.al. | 2408.13044 | null |
2024-08-23 | S4D: Streaming 4D Real-World Reconstruction with Gaussians and 3D Control Points | Bing He et.al. | 2408.13036 | link |
2024-08-23 | Learning 2D Invariant Affordance Knowledge for 3D Affordance Grounding | Xianqiang Gao et.al. | 2408.13024 | null |
2024-08-23 | A plug-and-play framework for curvilinear structure segmentation based on a learned reconnecting regularization | Sophie Carneiro-Esteves et.al. | 2408.12943 | null |
2024-08-23 | Identifying band structure changes of FePS3 across the antiferromagnetic phase transition | Benjamin Pestka et.al. | 2408.12896 | null |
2024-08-23 | FLoD: Integrating Flexible Level of Detail into 3D Gaussian Splatting for Customizable Rendering | Yunji Seo et.al. | 2408.12894 | null |
2024-08-23 | T3M: Text Guided 3D Human Motion Synthesis from Speech | Wenshuo Peng et.al. | 2408.12885 | link |
2024-08-23 | Stable 3D vortex solitons of high topological charge in a Rydberg-dressed Bose-Einstein condensate with spin-orbit coupling | Yanchao Zhang et.al. | 2408.12878 | null |
2024-08-23 | S3Simulator: A benchmarking Side Scan Sonar Simulator dataset for Underwater Image Analysis | Kamal Basha S et.al. | 2408.12833 | link |
2024-08-22 | CatFree3D: Category-agnostic 3D Object Detection with Diffusion | Wenjing Bian et.al. | 2408.12747 | null |
2024-08-22 | Revisiting Cross-Domain Problem for LiDAR-based 3D Object Detection | Ruixiao Zhang et.al. | 2408.12708 | null |
2024-08-22 | Free-breathing 3D cardiac extracellular volume (ECV) mapping using a linear tangent space alignment (LTSA) model | Wonil Lee et.al. | 2408.12706 | null |
2024-08-26 | GSFusion: Online RGB-D Mapping Where Gaussian Splatting Meets TSDF Fusion | Jiaxin Wei et.al. | 2408.12677 | link |
2024-08-22 | BMN-like sectors in 4d $\mathcal N=4$ SYM with boundaries and interfaces | Andrea Chaney et.al. | 2408.12651 | null |
2024-08-22 | DreamCinema: Cinematic Transfer with Free Camera and 3D Character | Weiliang Chen et.al. | 2408.12601 | null |
2024-08-22 | ND-SDF: Learning Normal Deflection Fields for High-Fidelity Indoor Reconstruction | Ziyu Tang et.al. | 2408.12598 | link |
2024-08-22 | Core formation by binary scouring and gravitational wave recoil in massive elliptical galaxies | Nader Khonji et.al. | 2408.12537 | null |
2024-08-22 | UMAD: University of Macau Anomaly Detection Benchmark Dataset | Dong Li et.al. | 2408.12527 | link |
2024-08-22 | Advanced atom-level representations for protein flexibility prediction utilizing graph neural networks | Sina Sarparast et.al. | 2408.12519 | null |
2024-08-22 | Unveiling the Physics of Core-Collapse Supernovae with the Line Emission Mapper: Observing Cassiopeia A | S. Orlando et.al. | 2408.12462 | null |
2024-08-22 | A Riemannian Approach for Spatiotemporal Analysis and Generation of 4D Tree-shaped Structures | Tahmina Khanam et.al. | 2408.12443 | link |
2024-08-22 | 4D Diffusion for Dynamic Protein Structure Prediction with Reference Guided Motion Alignment | Kaihui Cheng et.al. | 2408.12419 | null |
2024-09-04 | Dynamic PDB: A New Dataset and a SE(3) Model Extension by Integrating Dynamic Behaviors and Physical Properties in Protein Structures | Ce Liu et.al. | 2408.12413 | null |
2024-08-22 | Multi-Style Facial Sketch Synthesis through Masked Generative Modeling | Bowen Sun et.al. | 2408.12400 | null |
2024-08-22 | Multimodal Foundational Models for Unsupervised 3D General Obstacle Detection | Tamás Matuszka et.al. | 2408.12322 | null |
2024-08-22 | Subsurface Scattering for 3D Gaussian Splatting | Jan-Niklas Dihlmann et.al. | 2408.12282 | null |
2024-08-22 | Gas dynamics around a Jupiter-mass planet I. Influence of protoplanetary disk properties | E. Lega et.al. | 2408.12233 | null |
2024-08-23 | Transientangelo: Few-Viewpoint Surface Reconstruction Using Single-Photon Lidar | Weihan Luo et.al. | 2408.12191 | null |
2024-08-22 | Mitigation of Gilbert Damping in the CoFe/CuOx Orbital Torque System | Shilei Ding et.al. | 2408.12165 | null |
2024-08-22 | Enhancing Sampling Protocol for Robust Point Cloud Classification | Chongshou Li et.al. | 2408.12062 | null |
2024-08-22 | Correlation Effects in a Simplified Bilayer Two-Orbital Hubbard Model at Half Filling | Jian-Jian Yang et.al. | 2408.12042 | null |
2024-08-21 | FUSELOC: Fusing Global and Local Descriptors to Disambiguate 2D-3D Matching in Visual Localization | Son Tung Nguyen et.al. | 2408.12037 | link |
2024-08-21 | Radiation Hydrodynamic Simulations of Massive Stars in Gas-rich Environments: Accretion of AGN Stars Suppressed By Thermal Feedback | Yi-Xian Chen et.al. | 2408.12017 | null |
2024-08-21 | Distributed-Memory Parallel Algorithms for Sparse Matrix and Sparse Tall-and-Skinny Matrix Multiplication | Isuru Ranawaka et.al. | 2408.11988 | link |
2024-09-03 | Chemical Reaction Neural Networks for Fitting Accelerating Rate Calorimetry Data | Saakaar Bhatnagar et.al. | 2408.11984 | null |
2024-08-21 | Protostellar Outflows at the EarliesT Stages (POETS) V. The launching mechanism of protostellar winds via water masers | Luca Moscadelli et.al. | 2408.11968 | null |
2024-08-21 | Visual Localization in 3D Maps: Comparing Point Cloud, Mesh, and NeRF Representations | Lintong Zhang et.al. | 2408.11966 | null |
2024-09-04 | CT-AGRG: Automated Abnormality-Guided Report Generation from 3D Chest CT Volumes | Theo Di Piazza et.al. | 2408.11965 | null |
2024-08-21 | CARLA Drone: Monocular 3D Object Detection from a Different Perspective | Johannes Meier et.al. | 2408.11958 | null |
2024-08-21 | Magnetization Plateaus in the Two-dimensional S = 1/2 Heisenberg Model with a 3 $\times$ 3 Checkerboard Structure | Xuyang Liang et.al. | 2408.11899 | null |
2024-08-21 | EmbodiedSAM: Online Segment Any 3D Thing in Real Time | Xiuwei Xu et.al. | 2408.11811 | null |
2024-08-21 | ACE: A Cross-Platform Visual-Exoskeletons System for Low-Cost Dexterous Teleoperation | Shiqi Yang et.al. | 2408.11805 | null |
2024-08-21 | Story3D-Agent: Exploring 3D Storytelling Visualization with Large Language Models | Yuzhou Huang et.al. | 2408.11801 | null |
2024-08-30 | GeoMeter: Probing Depth and Height Perception of Large Visual-Language Models | Shehreen Azad et.al. | 2408.11748 | link |
2024-08-21 | Open-Ended 3D Point Cloud Instance Segmentation | Phuc D. A. Nguyen et.al. | 2408.11747 | null |
2024-08-21 | Cultural Windows: Towards a Workflow for Immersive Journeys into Global Living Spaces | Hessam Djavaherpour et.al. | 2408.11723 | null |
2024-08-21 | Robust 3D Gaussian Splatting for Novel View Synthesis in Presence of Distractors | Paul Ungermann et.al. | 2408.11697 | link |
2024-08-21 | Collaborative Robot Arm Inserting Nasopharyngeal Swabs with Admittance Control | Peter Q. Lee et.al. | 2408.11688 | null |
2024-08-21 | LiFCal: Online Light Field Camera Calibration via Bundle Adjustment | Aymeric Fleith et.al. | 2408.11682 | null |
2024-08-21 | Improved Blow-Up Criterion in a Variational Framework for Nonlinear SPDEs | Daniel Goodair et.al. | 2408.11678 | null |
2024-08-21 | Systematic study of confinement induced effects on atomic electronic structure | Hugo Åström et.al. | 2408.11595 | null |
2024-08-21 | Achieving specific yet transient bonds between anisotropic colloids | Muraleedharapai Mayarani et.al. | 2408.11569 | null |
2024-08-21 | Positional Prompt Tuning for Efficient 3D Representation Learning | Shaochen Zhang et.al. | 2408.11567 | link |
2024-08-21 | Semi-supervised 3D Semantic Scene Completion with 2D Vision Foundation Model Guidance | Duc-Hai Pham et.al. | 2408.11559 | null |
2024-08-22 | DeRainGS: Gaussian Splatting for Enhanced Scene Reconstruction in Rainy Environments | Shuhong Liu et.al. | 2408.11540 | null |
2024-08-21 | Classification of Mitral Regurgitation from Cardiac Cine MRI using Clinically-Interpretable Morphological Features | Y. On et.al. | 2408.11532 | link |
2024-08-21 | EmoFace: Emotion-Content Disentangled Speech-Driven 3D Talking Face with Mesh Attention | Yihong Lin et.al. | 2408.11518 | null |
2024-08-21 | Re-evaluation of the cosmic-ray ionization rate in diffuse clouds | M. Obolentseva et.al. | 2408.11511 | null |
2024-08-21 | MeTTA: Single-View to 3D Textured Mesh Reconstruction with Test-Time Adaptation | Kim Yu-Ji et.al. | 2408.11465 | null |
2024-08-21 | MambaOcc: Visual State Space Model for BEV-based Occupancy Prediction with Local Adaptive Reordering | Yonglin Tian et.al. | 2408.11464 | null |
2024-08-21 | GaussianOcc: Fully Self-supervised and Efficient 3D Occupancy Estimation with Gaussian Splatting | Wanshui Gan et.al. | 2408.11447 | link |
2024-08-27 | Pano2Room: Novel View Synthesis from a Single Indoor Panorama | Guo Pu et.al. | 2408.11413 | link |
2024-08-21 | Latent Feature and Attention Dual Erasure Attack against Multi-View Diffusion Models for 3D Assets Protection | Jingwei Sun et.al. | 2408.11408 | link |
2024-08-21 | HumanCoser: Layered 3D Human Generation via Semantic-Aware Diffusion Model | Yi Wang et.al. | 2408.11357 | null |
2024-08-21 | Multimodal Datasets and Benchmarks for Reasoning about Dynamic Spatio-Temporality in Everyday Environments | Takanori Ugai et.al. | 2408.11347 | null |
2024-08-21 | Three-dimensional bond-order formation in kagome metals AV $_3$Sb$_5$ (A=Cs, Rb, K) analyzed by the density-wave equation method | Seiichiro Onari et.al. | 2408.11337 | null |
2024-08-21 | FATE: Focal-modulated Attention Encoder for Temperature Prediction | Tajamul Ashraf et.al. | 2408.11336 | link |
2024-08-21 | Exploring Scene Coherence for Semi-Supervised 3D Semantic Segmentation | Chuandong Liu et.al. | 2408.11280 | link |
2024-08-21 | Irregularity Inspection using Neural Radiance Field | Tianqi Ding et.al. | 2408.11251 | null |
2024-08-20 | Coherent radio emission from ‘Main-sequence Radio Pulse emitters’: a new stellar diagnostic to probe 3D magnetospheric structures | Barnali Das et.al. | 2408.11242 | null |
2024-08-20 | CooPre: Cooperative Pretraining for V2X Cooperative Perception | Seth Z. Zhao et.al. | 2408.11241 | null |
2024-08-20 | Structural Morphing Metasurface for Electromagnetic Beam Manipulation | Aakash Bansal et.al. | 2408.11231 | null |
2024-08-20 | OCTCube: A 3D foundation model for optical coherence tomography that improves cross-dataset, cross-disease, cross-device and cross-modality analysis | Zixuan Liu et.al. | 2408.11227 | null |
2024-08-20 | Generalized Path Integral Energy and Heat Capacity Estimators of Quantum Oscillators and Crystals using Harmonic Mapping | Sabry G. Moustafa et.al. | 2408.11214 | null |
2024-08-20 | A Short Review and Evaluation of SAM2’s Performance in 3D CT Image Segmentation | Yufan He et.al. | 2408.11210 | link |
2024-08-20 | Quantum Inverse Contextual Vision Transformers (Q-ICVT): A New Frontier in 3D Object Detection for AVs | Sanjay Bhargav Dharavath et.al. | 2408.11207 | link |
2024-08-20 | Ophthalmic Biomarker Detection: Highlights from the IEEE Video and Image Processing Cup 2023 Student Competition | Ghassan AlRegib et.al. | 2408.11170 | null |
2024-08-20 | Target-Oriented Object Grasping via Multimodal Human Guidance | Pengwei Xie et.al. | 2408.11138 | null |
2024-09-01 | The densities in diffuse and translucent molecular clouds: estimates from observations of C $_2$ and from 3-dimensional extinction maps | David A. Neufeld et.al. | 2408.11108 | null |
2024-08-20 | GSLoc: Efficient Camera Pose Refinement via 3D Gaussian Splatting | Changkun Liu et.al. | 2408.11085 | link |
2024-08-20 | Atmospheric Transport Modeling of CO $_2$ with Neural Networks | Vitus Benson et.al. | 2408.11032 | link |
2024-08-20 | OpenScan: A Benchmark for Generalized Open-Vocabulary 3D Scene Understanding | Youjun Zhao et.al. | 2408.11030 | link |
2024-08-20 | Large Point-to-Gaussian Model for Image-to-3D Generation | Longfei Lu et.al. | 2408.10935 | null |
2024-08-20 | ShapeSplat: A Large-scale Dataset of Gaussian Splats and Their Self-Supervised Pretraining | Qi Ma et.al. | 2408.10906 | null |
2024-08-20 | Open 3D World in Autonomous Driving | Xinlong Cheng et.al. | 2408.10880 | null |
2024-08-20 | Evolution of Semi-convective Staircases in Rotating Flows: Consequences for Fuzzy Cores in Giant Planets | J. R. Fuentes et.al. | 2408.10833 | null |
2024-08-20 | ZebraPose: Zebra Detection and Pose Estimation using only Synthetic Data | Elia Bonetto et.al. | 2408.10831 | null |
2024-08-20 | MPL: Lifting 3D Human Pose from Multi-view 2D Poses | Seyed Abolfazl Ghasemzadeh et.al. | 2408.10805 | link |
2024-08-20 | Learning Part-aware 3D Representations by Fusing 2D Gaussians and Superquadrics | Zhirui Gao et.al. | 2408.10789 | null |
2024-08-20 | Detection of Intracranial Hemorrhage for Trauma Patients | Antoine P. Sanner et.al. | 2408.10768 | link |
2024-08-20 | TrackNeRF: Bundle Adjusting NeRF from Sparse and Noisy Views via Feature Tracks | Jinjie Mai et.al. | 2408.10739 | null |
2024-08-20 | Coarse-to-Fine Detection of Multiple Seams for Robotic Welding | Pengkun Wei et.al. | 2408.10710 | null |
2024-08-20 | Vocabulary-Free 3D Instance Segmentation with Vision and Language Assistant | Guofeng Mei et.al. | 2408.10652 | null |
2024-08-20 | OMEGA: Efficient Occlusion-Aware Navigation for Air-Ground Robot in Dynamic Environments via State Space Model | Junming Wang et.al. | 2408.10618 | null |
2024-08-21 | MUSES: 3D-Controllable Image Generation via Multi-Modal Agent Collaboration | Yanbo Ding et.al. | 2408.10605 | link |
2024-08-20 | MV-MOS: Multi-View Feature Fusion for 3D Moving Object Segmentation | Jintao Cheng et.al. | 2408.10602 | link |
2024-08-20 | DEGAS: Detailed Expressions on Full-Body Gaussian Avatars | Zhijing Shao et.al. | 2408.10588 | link |
2024-08-20 | Multi-view Hand Reconstruction with a Point-Embedded Transformer | Lixin Yang et.al. | 2408.10581 | link |
2024-08-20 | Diff-PCC: Diffusion-based Neural Compression for 3D Point Clouds | Kai Liu et.al. | 2408.10543 | null |
2024-08-20 | Subspace Prototype Guidance for Mitigating Class Imbalance in Point Cloud Semantic Segmentation | Jiawei Han et.al. | 2408.10537 | link |
2024-08-20 | Leveraging Temporal Contexts to Enhance Vehicle-Infrastructure Cooperative Perception | Jiaru Zhong et.al. | 2408.10531 | null |
2024-08-20 | Correlation between dust continuum and CN line emissions in high-mass star-forming regions | Jihye Hwang et.al. | 2408.10506 | null |
2024-08-20 | GPT-based Textile Pilling Classification Using 3D Point Cloud Data | Yu Lu et.al. | 2408.10496 | null |
2024-08-19 | Kubrick: Multimodal Agent Collaborations for Synthetic Video Generation | Liu He et.al. | 2408.10453 | null |
2024-08-19 | Random dynamics of solutions for three-dimensional stochastic globally modified Navier-Stokes equations on unbounded Poincaré domains | Kush Kinra et.al. | 2408.10426 | null |
2024-08-19 | Chiral edge mode for single-cone Dirac fermions | C. W. J. Beenakker et.al. | 2408.10415 | null |
2024-08-28 | Stacking-Dependent Van Hove Singularity Shifts in Three-Dimensional Charge Density Waves of Kagome Metals AV $_3$Sb$_5$ (A = K, Rb, Cs) | Chanchal K. Barman et.al. | 2408.10402 | null |
2024-08-19 | Intensity and Dimensionality-Dependent Dynamics of Laser-Proton Acceleration in 1D, 2D, and 3D Particle-in-Cell Simulations | Lillian A. Daneshmand et.al. | 2408.10386 | null |
2024-08-19 | Galaxy cluster profiles: A Gaussian mixture model approach to halo miscentering | Matthew Currie et.al. | 2408.10371 | link |
2024-08-19 | $\text{AdS}_4$ Holography and the Hilbert Scheme | Samuel Crew et.al. | 2408.10313 | null |
2024-08-21 | NeRF-US: Removing Ultrasound Imaging Artifacts from Neural Radiance Fields in the Wild | Rishit Dagli et.al. | 2408.10258 | null |
2024-08-19 | MeshFormer: High-Quality Mesh Generation with 3D-Guided Reconstruction Model | Minghua Liu et.al. | 2408.10198 | null |
2024-08-19 | SpaRP: Fast 3D Object Reconstruction and Pose Estimation from Sparse Views | Chao Xu et.al. | 2408.10195 | null |
2024-08-19 | Physics-Aware Combinatorial Assembly Planning using Deep Reinforcement Learning | Ruixuan Liu et.al. | 2408.10162 | link |
2024-08-20 | LoopSplat: Loop Closure by Registering 3D Gaussian Splats | Liyuan Zhu et.al. | 2408.10154 | link |
2024-08-19 | Perceptual Depth Quality Assessment of Stereoscopic Omnidirectional Images | Wei Zhou et.al. | 2408.10134 | null |
2024-08-19 | Learning Precise Affordances from Egocentric Videos for Robotic Manipulation | Gen Li et.al. | 2408.10123 | null |
2024-08-19 | Geometry Informed Tokenization of Molecules for Language Model Generation | Xiner Li et.al. | 2408.10120 | null |
2024-08-19 | LNQ 2023 challenge: Benchmark of weakly-supervised techniques for mediastinal lymph node quantification | Reuben Dorent et.al. | 2408.10069 | null |
2024-08-19 | SHARP: Segmentation of Hands and Arms by Range using Pseudo-Depth for Enhanced Egocentric 3D Hand Pose Estimation and Action Recognition | Wiktor Mucha et.al. | 2408.10037 | link |
2024-08-19 | P3P: Pseudo-3D Pre-training for Scaling 3D Masked Autoencoders | Xuechao Chen et.al. | 2408.10007 | null |
2024-08-19 | Pose-GuideNet: Automatic Scanning Guidance for Fetal Head Ultrasound from Pose Estimation | Qianhui Men et.al. | 2408.09931 | null |
2024-08-19 | DiscoNeRF: Class-Agnostic Object Field for 3D Object Discovery | Corentin Dumery et.al. | 2408.09928 | null |
2024-08-19 | Crystallite size and microstrain in the structure of SrTiO3 formed by magnetron deposition with and without O2 flow through the deposition chambre | Zdeněk Jansa et.al. | 2408.09913 | null |
2024-08-19 | 3D-Aware Instance Segmentation and Tracking in Egocentric Videos | Yash Bhalgat et.al. | 2408.09860 | null |
2024-08-19 | OccMamba: Semantic Occupancy Prediction with State Space Models | Heng Li et.al. | 2408.09859 | link |
2024-08-19 | Coarse-Fine View Attention Alignment-Based GAN for CT Reconstruction from Biplanar X-Rays | Zhi Qiao et.al. | 2408.09736 | null |
2024-08-21 | Reconstruct Spine CT from Biplanar X-Rays via Diffusion Learning | Zhi Qiao et.al. | 2408.09731 | null |
2024-08-19 | Fragment and Geometry Aware Tokenization of Molecules for Structure-Based Drug Design Using Language Models | Cong Fu et.al. | 2408.09730 | null |
2024-08-19 | Quantitative 3D Map Accuracy Evaluation Hardware and Algorithm for LiDAR(-Inertial) SLAM | Sanghyun Hahn et.al. | 2408.09727 | link |
2024-08-19 | Double-Precision Floating-Point Data Visualizations Using Vulkan API | Nezihe Sozen et.al. | 2408.09699 | null |
2024-08-19 | An Efficient Deep Reinforcement Learning Model for Online 3D Bin Packing Combining Object Rearrangement and Stable Placement | Peiwen Zhou et.al. | 2408.09694 | null |
2024-08-19 | SG-GS: Photo-realistic Animatable Human Avatars with Semantically-Guided Gaussian Splatting | Haoyu Zhao et.al. | 2408.09665 | null |
2024-08-19 | 3D-printed terahertz subwavelength dual-core fibers with dense channel-integration | Haiyuan Ge et.al. | 2408.09664 | null |
2024-08-20 | CHASE: 3D-Consistent Human Avatars with Sparse Inputs via Gaussian Splatting and Contrastive Learning | Haoyu Zhao et.al. | 2408.09663 | null |
2024-08-18 | Imaging ferroelectric domains with soft X-ray ptychography at the oxygen K-edge | Tim A. Butcher et.al. | 2408.09608 | null |
2024-08-18 | Giant magnetic anisotropy of Pb atoms in 3d-based magnets | Weiyi Xia et.al. | 2408.09580 | null |
2024-08-18 | Tunable topological edge modes in Su-Schrieffer-Heeger arrays | G. J. Chaplain et.al. | 2408.09575 | null |
2024-08-18 | Ferroelectric Smectic C Liquid Crystal Phase with Spontaneous Polarization in the Direction of the Director | Hirotsugu Kikuchi et.al. | 2408.09520 | null |
2024-08-18 | SYZ Mirrors in non-Abelian 3d Mirror Symmetry | Ki Fung Chan et.al. | 2408.09479 | null |
2024-08-18 | G2Face: High-Fidelity Reversible Face Anonymization via Generative and Geometric Priors | Haoxin Yang et.al. | 2408.09458 | link |
2024-08-18 | Combo: Co-speech holistic 3D human motion generation and efficient customizable adaptation in harmony | Chao Xu et.al. | 2408.09397 | null |
2024-08-18 | VRCopilot: Authoring 3D Layouts with Generative AI Models in VR | Lei Zhang et.al. | 2408.09382 | null |
2024-08-18 | Flemme: A Flexible and Modular Learning Platform for Medical Images | Guoqing Zhang et.al. | 2408.09369 | link |
2024-08-18 | Improving Lung Cancer Diagnosis and Survival Prediction with Deep Learning and CT Imaging | Xiawei Wang et.al. | 2408.09367 | null |
2024-08-18 | Meta-Learning Empowered Meta-Face: Personalized Speaking Style Adaptation for Audio-Driven 3D Talking Face Animation | Xukun Zhou et.al. | 2408.09357 | null |
2024-08-18 | Elite360M: Efficient 360 Multi-task Learning via Bi-projection Fusion and Cross-task Collaboration | Hao Ai et.al. | 2408.09336 | null |
2024-08-18 | Unpaired Volumetric Harmonization of Brain MRI with Conditional Latent Diffusion | Mengqi Wu et.al. | 2408.09315 | null |
2024-08-17 | Flatten: Video Action Recognition is an Image Classification task | Junlin Chen et.al. | 2408.09220 | null |
2024-08-22 | FQGA-single: Towards Fewer Training Epochs and Fewer Model Parameters for Image-to-Image Translation Tasks | Cho Yang et.al. | 2408.09218 | null |
2024-08-17 | Learning Based Toolpath Planner on Diverse Graphs for 3D Printing | Yuming Huang et.al. | 2408.09198 | null |
2024-08-17 | GSLAMOT: A Tracklet and Query Graph-based Simultaneous Locating, Mapping, and Multiple Object Tracking System | Shuo Wang et.al. | 2408.09191 | null |
2024-08-20 | Gaussian in the Dark: Real-Time View Synthesis From Inconsistent Dark Images Using Gaussian Splatting | Sheng Ye et.al. | 2408.09130 | link |
2024-08-27 | Barbie: Text to Barbie-Style 3D Avatars | Xiaokun Sun et.al. | 2408.09126 | link |
2024-08-17 | MaskBEV: Towards A Unified Framework for BEV Detection and Map Segmentation | Xiao Zhao et.al. | 2408.09122 | null |
2024-08-17 | Temporal Reversed Training for Spiking Neural Networks with Generalized Spatio-Temporal Representation | Lin Zuo et.al. | 2408.09108 | null |
2024-08-17 | HybridOcc: NeRF Enhanced Transformer-based Multi-Camera 3D Occupancy Prediction | Xiao Zhao et.al. | 2408.09104 | null |
2024-08-17 | Depth-guided Texture Diffusion for Image Semantic Segmentation | Wei Sun et.al. | 2408.09097 | null |
2024-08-17 | Composite solitary vortices of three-wave mixing in quasi-phase-matched photonic crystals | Chao Kong et.al. | 2408.09086 | null |
2024-08-17 | Isotope-Selective Strong Field Ionization of Semi-Heavy Water | Andrew J. Howard et.al. | 2408.09056 | null |
2024-08-16 | Ambient air plasma acceleration in tightly-focused ultrashort infrared laser beams | Marianna Lytova et.al. | 2408.09052 | null |
2024-08-16 | ADen: Adaptive Density Representations for Sparse-view Camera Pose Estimation | Hao Tang et.al. | 2408.09042 | null |
2024-08-16 | Multiband polarimetric imaging of HD 34700 with SCExAO/CHARIS | Minghan Chen et.al. | 2408.09038 | null |
2024-08-16 | Application of mesh refinement to relativistic magnetic reconnection | Revathi Jambunathan et.al. | 2408.08960 | null |
2024-08-16 | Height-function-based 4D reference metrics for hyperboloidal evolution | Alex Vañó-Viñuales et.al. | 2408.08952 | null |
2024-08-14 | GeneticPrism: Multifaceted Visualization of Scientific Impact Evolutions | Ye Sun et.al. | 2408.08912 | null |
2024-08-16 | Multi-task Learning Approach for Intracranial Hemorrhage Prognosis | Miriam Cobo et.al. | 2408.08784 | link |
2024-08-16 | Correspondence-Guided SfM-Free 3D Gaussian Splatting for NVS | Wei Sun et.al. | 2408.08723 | null |
2024-08-16 | LLM-PCGC: Large Language Model-based Point Cloud Geometry Compression | Yuqi Ye et.al. | 2408.08682 | null |
2024-08-16 | Modeling the Neonatal Brain Development Using Implicit Neural Representations | Florentin Bieder et.al. | 2408.08647 | link |
2024-08-16 | Reference-free Axial Super-resolution of 3D Microscopy Images using Implicit Neural Representation with a 2D Diffusion Prior | Kyungryun Lee et.al. | 2408.08616 | link |
2024-08-16 | Magnetic fields in the outskirts of PSZ2 G096.88+24.18 from depolarization analysis of radio relics | E. De Rubeis et.al. | 2408.08603 | null |
2024-08-16 | Zero-Shot Dual-Path Integration Framework for Open-Vocabulary 3D Instance Segmentation | Tri Ton et.al. | 2408.08591 | null |
2024-08-16 | Movable Antenna for Wireless Communications:Prototyping and Experimental Results | Zhenjun Dong et.al. | 2408.08588 | null |
2024-08-16 | Rubick: Exploiting Job Reconfigurability for Deep Learning Cluster Scheduling | Xinyi Zhang et.al. | 2408.08586 | null |
2024-08-16 | Detection and tracking of MAVs using a LiDAR with rosette scanning pattern | Sándor Gazdag et.al. | 2408.08555 | null |
2024-08-15 | Differentiating Three-Dimensional Molecular Structures using Laser-induced Coulomb Explosion Imaging | Huynh Van Sa Lam et.al. | 2408.08389 | null |
2024-08-15 | MSA-3D: dissecting galaxies at z~1 with high spatial and spectral resolution | Ivana Barišić et.al. | 2408.08350 | link |
2024-08-15 | Graph representations of 3D data for machine learning | Tomasz Prytuła et.al. | 2408.08336 | null |
2024-08-15 | HeightLane: BEV Heightmap guided 3D Lane Detection | Chaesong Park et.al. | 2408.08270 | null |
2024-08-15 | Coarsening and parallelism with reduction multigrids for hyperbolic Boltzmann transport | S. Dargaville et.al. | 2408.08262 | null |
2024-08-15 | Comparative Evaluation of 3D Reconstruction Methods for Object Pose Estimation | Varun Burde et.al. | 2408.08234 | link |
2024-08-15 | Learned Multimodal Compression for Autonomous Driving | Hadi Hadizadeh et.al. | 2408.08211 | null |
2024-08-15 | WaterSplatting: Fast Underwater 3D Scene Reconstruction Using Gaussian Splatting | Huapeng Li et.al. | 2408.08206 | null |
2024-08-15 | Towards Practical Human Motion Prediction with LiDAR Point Clouds | Xiao Han et.al. | 2408.08202 | null |
2024-08-24 | Your Turn: At Home Turning Angle Estimation for Parkinson’s Disease Severity Assessment | Qiushuo Cheng et.al. | 2408.08182 | null |
2024-08-15 | Chemical complexity and dust formation around evolved stars | Marie Van de Sande et.al. | 2408.08153 | null |
2024-08-15 | Study of non-diffusive thermal behaviors in nanoscale transistors under different heating strategies | Chuang Zhang et.al. | 2408.08120 | null |
2024-08-15 | Exploring Uncertainty Visualization for Degenerate Tensors in 3D Symmetric Second-Order Tensor Field Ensembles | Tadea Schmitz et.al. | 2408.08099 | link |
2024-08-16 | OC3D: Weakly Supervised Outdoor 3D Object Detection with Only Coarse Click Annotation | Qiming Xia et.al. | 2408.08092 | null |
2024-08-15 | Single-image coherent reconstruction of objects and humans | Sarthak Batra et.al. | 2408.08086 | null |
2024-08-15 | MambaMIM: Pre-training Mamba with State Space Token-interpolation | Fenghe Tang et.al. | 2408.08070 | link |
2024-08-15 | Text2BIM: Generating Building Models Using a Large Language Model-based Multi-Agent Framework | Changyu Du et.al. | 2408.08054 | link |
2024-08-15 | MVInpainter: Learning Multi-View Consistent Inpainting to Bridge 2D and 3D Editing | Chenjie Cao et.al. | 2408.08000 | null |
2024-08-15 | Co-Fix3D: Enhancing 3D Object Detection with Collaborative Refinement | Wenxuan Li et.al. | 2408.07999 | link |
2024-08-19 | FlashGS: Efficient 3D Gaussian Splatting for Large-scale and High-resolution Rendering | Guofeng Feng et.al. | 2408.07967 | link |
2024-08-15 | A new blowup criterion for the 3D barotropic compressible Navier-Stokes equations with vacuum | Saiguo Xu et.al. | 2408.07935 | null |
2024-08-15 | GOReloc: Graph-based Object-Level Relocalization for Visual SLAM | Yutong Wang et.al. | 2408.07917 | link |
2024-08-15 | Persistence Image from 3D Medical Image: Superpixel and Optimized Gaussian Coefficient | Yanfan Zhu et.al. | 2408.07905 | link |
2024-08-15 | Chern-Simons theories with defects, Rogers-Ramanujan type functions and eta-products | Tadashi Okazaki et.al. | 2408.07893 | null |
2024-08-18 | Complementarity-Free Multi-Contact Modeling and Optimization for Dexterous Manipulation | Wanxin Jin et.al. | 2408.07855 | link |
2024-08-14 | Laboratory confirmation and improved Accuracy of 4f and 5d energy levels of Fe II previously identified from stellar spectra | M. Ding et.al. | 2408.07833 | null |
2024-08-14 | Spectroscopy of excited quarkonium states in the light-front quark model | Ritwik Acharyya et.al. | 2408.07715 | null |
2024-08-14 | RSD-DOG : A New Image Descriptor based on Second Order Derivatives | Darshan Venkatrayappa et.al. | 2408.07687 | null |
2024-08-14 | See It All: Contextualized Late Aggregation for 3D Dense Captioning | Minjung Kim et.al. | 2408.07648 | null |
2024-08-14 | Rethinking the Key Factors for the Generalization of Remote Sensing Stereo Matching Networks | Liting Jiang et.al. | 2408.07613 | null |
2024-08-14 | Panacea+: Panoramic and Controllable Video Generation for Autonomous Driving | Yuqing Wen et.al. | 2408.07605 | null |
2024-08-14 | Optimizing UAV Trajectory for Emergency Response Operations under Real 3D Environments: Integrating Priority Levels and LoS Constraints | Mohammad Taghi Dabiri et.al. | 2408.07589 | null |
2024-08-14 | Graph polyhedral divisions in growing cell aggregates | Urban Železnik et.al. | 2408.07551 | null |
2024-08-14 | 3D Gaussian Editing with A Single Image | Guan Luo et.al. | 2408.07540 | null |
2024-08-14 | Improved 3D Whole Heart Geometry from Sparse CMR Slices | Yiyang Xu et.al. | 2408.07532 | link |
2024-08-14 | Attention-Guided Perturbation for Unsupervised Image Anomaly Detection | Tingfeng Huang et.al. | 2408.07490 | null |
2024-08-14 | LLMI3D: Empowering LLM with 3D Perception from a Single 2D Image | Fan Yang et.al. | 2408.07422 | null |
2024-08-18 | Rethinking Open-Vocabulary Segmentation of Radiance Fields in 3D Space | Hyunjee Lee et.al. | 2408.07416 | null |
2024-08-14 | RoCoSDF: Row-Column Scanned Neural Signed Distance Fields for Freehand 3D Ultrasound Imaging Shape Reconstruction | Hongbo Chen et.al. | 2408.07325 | link |
2024-08-14 | Enhanced Scale-aware Depth Estimation for Monocular Endoscopic Scenes with Geometric Modeling | Ruofeng Wei et.al. | 2408.07266 | null |
2024-08-14 | Enhancing Autonomous Vehicle Perception in Adverse Weather through Image Augmentation during Semantic Segmentation Training | Ethan Kou et.al. | 2408.07239 | link |
2024-08-13 | Near-Field Localization with Antenna Arrays in the Presence of Direction-Dependent Mutual Coupling | Zohreh Ebadi et.al. | 2408.07202 | null |
2024-08-13 | Micro-integrated crossed-beam optical dipole trap system with long-term alignment stability for mobile atomic quantum technologies | Marc Christ et.al. | 2408.07187 | null |
2024-08-13 | Flexible 3D Lane Detection by Hierarchical Shape MatchingFlexible 3D Lane Detection by Hierarchical Shape Matching | Zhihao Guan et.al. | 2408.07163 | link |
2024-08-13 | Universal non-thermal power-law distribution functions from the self-consistent evolution of collisionless electrostatic plasmas | Uddipan Banik et.al. | 2408.07127 | null |
2024-08-13 | Physics-informed graph neural networks for flow field estimation in carotid arteries | Julian Suk et.al. | 2408.07110 | null |
2024-08-24 | A new non-parametric method to infer galaxy cluster masses from weak lensing | Tobias Mistele et.al. | 2408.07026 | link |
2024-08-14 | Content and Style Aware Audio-Driven Facial Animation | Qingju Liu et.al. | 2408.07005 | null |
2024-08-13 | SpectralGaussians: Semantic, spectral 3D Gaussian splatting for multi-spectral scene representation, visualization and analysis | Saptarshi Neil Sinha et.al. | 2408.06975 | null |
2024-08-13 | Mesh Simplification For Unfolding | Manas Bhargava et.al. | 2408.06944 | null |
2024-08-13 | Global well-posedness of the 3D primitive equations with horizontal viscosity and vertical diffusivity II: close to $H^1$ initial data | Chongsheng Cao et.al. | 2408.06932 | null |
2024-08-13 | SceneGPT: A Language Model for 3D Scene Understanding | Shivam Chandhok et.al. | 2408.06926 | null |
2024-08-13 | Divide and Conquer: Improving Multi-Camera 3D Perception with 2D Semantic-Depth Priors and Input-Dependent Queries | Qi Song et.al. | 2408.06901 | null |
2024-08-19 | EEPPR: Event-based Estimation of Periodic Phenomena Rate using Correlation in 3D | Jakub Kolář et.al. | 2408.06899 | link |
2024-08-13 | PBIR-NIE: Glossy Object Capture under Non-Distant Lighting | Guangyan Cai et.al. | 2408.06878 | null |
2024-08-20 | Spectrum Prediction With Deep 3D Pyramid Vision Transformer Learning | Guangliang Pan et.al. | 2408.06870 | link |
2024-08-13 | FlatFusion: Delving into Details of Sparse Transformer-based Camera-LiDAR Fusion for Autonomous Driving | Yutao Zhu et.al. | 2408.06832 | null |
2024-08-13 | Exploring Domain Shift on Radar-Based 3D Object Detection Amidst Diverse Environmental Conditions | Miao Zhang et.al. | 2408.06772 | null |
2024-08-13 | MAIR++: Improving Multi-view Attention Inverse Rendering with Implicit Lighting Representation | JunYong Choi et.al. | 2408.06707 | null |
2024-08-13 | SlotLifter: Slot-guided Feature Lifting for Learning Object-centric Radiance Fields | Yu Liu et.al. | 2408.06697 | null |
2024-08-13 | DC3DO: Diffusion Classifier for 3D Objects | Nursena Koprucu et.al. | 2408.06693 | link |
2024-08-13 | Bi-directional Contextual Attention for 3D Dense Captioning | Minjung Kim et.al. | 2408.06662 | null |
2024-08-13 | Gromov-Witten invariants in family and quantum cohomology | Indranil Biswas et.al. | 2408.06616 | null |
2024-08-13 | ViMo: Generating Motions from Casual Videos | Liangdong Qiu et.al. | 2408.06614 | null |
2024-08-13 | MV-DETR: Multi-modality indoor object detection by Multi-View DEtecton TRansformers | Zichao Dong et.al. | 2408.06604 | null |
2024-08-13 | GeoFormer: Learning Point Cloud Completion with Tri-Plane Integrated Transformer | Jinpeng Yu et.al. | 2408.06596 | link |
2024-08-13 | Recent advances in solar data-driven MHD simulations of the formation and evolution of CME flux ropes | Brigitte Schmieder et.al. | 2408.06595 | null |
2024-08-13 | ActiveNeRF: Learning Accurate 3D Geometry by Active Pattern Projection | Jianyu Tao et.al. | 2408.06592 | link |
2024-08-13 | HDRGS: High Dynamic Range Gaussian Splatting | Jiahao Wu et.al. | 2408.06543 | link |
2024-08-12 | The Kinetic Analogue of the Pressure-Strain Interaction | Sarah A. Conley et.al. | 2408.06508 | null |
2024-08-12 | Benchmarking tree species classification from proximally-sensed laser scanning data: introducing the FOR-species20K dataset | Stefano Puliti et.al. | 2408.06507 | link |
2024-08-12 | Implicit Neural Representation For Accurate CFD Flow Field Prediction | Laurent de Vito et.al. | 2408.06486 | null |
2024-08-12 | UniT: Unified Tactile Representation for Robot Learning | Zhengtong Xu et.al. | 2408.06481 | link |
2024-08-11 | Autoregressive Enzyme Function Prediction with Multi-scale Multi-modality Fusion | Dingyi Rong et.al. | 2408.06391 | null |
2024-08-12 | HeLiMOS: A Dataset for Moving Object Segmentation in 3D Point Clouds From Heterogeneous LiDAR Sensors | Hyungtae Lim et.al. | 2408.06328 | null |
2024-08-12 | Mipmap-GS: Let Gaussians Deform with Scale-specific Mipmap for Anti-aliasing Rendering | Jiameng Li et.al. | 2408.06286 | link |
2024-08-12 | Sparsity Based Multi-Source Robust 3D Localization Using a Moving Receiver | Amir Mansourian et.al. | 2408.06274 | null |
2024-08-12 | 3D Reconstruction of Protein Structures from Multi-view AFM Images using Neural Radiance Fields (NeRFs) | Jaydeep Rade et.al. | 2408.06244 | null |
2024-08-12 | FruitNeRF: A Unified Neural Radiance Field based Fruit Counting Framework | Lukas Meyer et.al. | 2408.06190 | link |
2024-08-12 | Emergent superconductivity and pair density wave at antiphase boundaries of charge density wave order in kagome metals | Xianghe Han et.al. | 2408.06174 | null |
2024-08-12 | Zero-shot 3D Segmentation of Abdominal Organs in CT Scans Using Segment Anything Model 2: Adapting Video Tracking Capabilities for 3D Medical Imaging | Yosuke Yamagishi et.al. | 2408.06170 | null |
2024-08-12 | Novel View Synthesis from a Single Image with Pretrained Diffusion Guidance | Taewon Kang et.al. | 2408.06157 | null |
2024-08-12 | Efficient and Scalable Point Cloud Generation with Sparse Point-Voxel Diffusion Models | Ioannis Romanelis et.al. | 2408.06145 | link |
2024-08-12 | MR3D-Net: Dynamic Multi-Resolution 3D Sparse Voxel Grid Fusion for LiDAR-Based Collective Perception | Sven Teufel et.al. | 2408.06137 | link |
2024-08-12 | RISurConv: Rotation Invariant Surface Attention-Augmented Convolutions for 3D Point Cloud Classification and Segmentation | Zhiyuan Zhang et.al. | 2408.06110 | link |
2024-08-12 | Sliding van der Waals Polytypes | Maayan Vizner Stern et.al. | 2408.06088 | null |
2024-08-12 | CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer | Zhuoyi Yang et.al. | 2408.06072 | link |
2024-08-12 | Multi-winged Lorenz attractors due to bifurcations of a periodic orbit with multipliers $(-1,i,-i)$ | Efrosiniia Karatetskaia et.al. | 2408.06052 | null |
2024-08-12 | Developing Smart MAVs for Autonomous Inspection in GPS-denied Constructions | Paoqiang Pan et.al. | 2408.06030 | null |
2024-08-12 | HeadGAP: Few-shot 3D Head Avatar via Generalizable Gaussian Priors | Xiaozheng Zheng et.al. | 2408.06019 | null |
2024-08-12 | DEEPTalk: Dynamic Emotion Embedding for Probabilistic Speech-Driven 3D Face Animation | Jisoo Kim et.al. | 2408.06010 | null |
2024-08-12 | Diffuse-UDA: Addressing Unsupervised Domain Adaptation in Medical Image Segmentation with Appearance and Structure Aligned Diffusion Models | Haifan Gong et.al. | 2408.05985 | null |
2024-08-12 | MV2DFusion: Leveraging Modality-Specific Object Semantics for Multi-Modal 3D Detection | Zitian Wang et.al. | 2408.05945 | null |
2024-08-12 | Deep Geometric Moments Promote Shape Consistency in Text-to-3D Generation | Utkarsh Nath et.al. | 2408.05938 | null |
2024-08-21 | CMAB: A First National-Scale Multi-Attribute Building Dataset in China Derived from Open Source Data and GeoAI | Yecheng Zhang et.al. | 2408.05891 | null |
2024-08-12 | Enhancing 3D Transformer Segmentation Model for Medical Image with Token-level Representation Learning | Xinrong Hu et.al. | 2408.05889 | link |
2024-08-11 | HySparK: Hybrid Sparse Masking for Large Scale Medical Image Pre-Training | Fenghe Tang et.al. | 2408.05815 | link |
2024-08-11 | Prototype Learning Guided Hybrid Network for Breast Tumor Segmentation in DCE-MRI | Lei Zhou et.al. | 2408.05803 | link |
2024-08-11 | A Comparative Study of Convolutional and Recurrent Neural Networks for Storm Surge Prediction in Tampa Bay | Mandana Farhang Ghahfarokhi et.al. | 2408.05797 | null |
2024-08-11 | Modified Trento initial condition and its impact on collective flows and global polarization in Cu+Au collisions | Ze-Fang Jiang et.al. | 2408.05774 | null |
2024-08-11 | Contrastive masked auto-encoders based self-supervised hashing for 2D image and 3D point cloud cross-modal retrieval | Rukai Wei et.al. | 2408.05711 | null |
2024-08-11 | Evaluating BM3D and NBNet: A Comprehensive Study of Image Denoising Across Multiple Datasets | Ghazal Kaviani et.al. | 2408.05697 | null |
2024-08-10 | BeyondCT: A deep learning model for predicting pulmonary function from chest CT scans | Kaiwen Geng et.al. | 2408.05645 | null |
2024-08-21 | Visual SLAM with 3D Gaussian Primitives and Depth Priors Enabling Novel View Synthesis | Zhongche Qu et.al. | 2408.05635 | null |
2024-08-10 | PRTGaussian: Efficient Relighting Using 3D Gaussians with Precomputed Radiance Transfer | Libo Zhang et.al. | 2408.05631 | link |
2024-08-10 | Evolutionary Neural Architecture Search for 3D Point Cloud Analysis | Yisheng Yang et.al. | 2408.05556 | null |
2024-08-10 | CryoBench: Diverse and challenging datasets for the heterogeneity problem in cryo-EM | Minkyu Jeon et.al. | 2408.05526 | link |
2024-08-10 | PointMT: Efficient Point Cloud Analysis with Hybrid MLP-Transformer Architecture | Qiang Zheng et.al. | 2408.05508 | null |
2024-08-10 | 3D Stress Tensor for Gravity in 4D Flat Spacetime | Arjun Bagchi et.al. | 2408.05494 | null |
2024-08-10 | Observation of Kolmogorov turbulence due to multiscale vortices in dusty plasma experiments | Sachin Sharma et.al. | 2408.05480 | null |
2024-08-20 | Scene123: One Prompt to 3D Scene Generation via Video-Assisted and Consistency-Enhanced MAE | Yiying Yang et.al. | 2408.05477 | null |
2024-08-10 | Existence and non-uniqueness of probabilistically strong solutions to 3D stochastic magnetohydrodynamic equations | Wenping Cao et.al. | 2408.05450 | null |
2024-08-10 | Mesh deformation-based single-view 3D reconstruction of thin eyeglasses frames with differentiable rendering | Fan Zhang et.al. | 2408.05402 | null |
2024-08-09 | Expected $1.x$ -Makespan-Optimal MAPF on Grids in Low-Poly Time | Teng Guo et.al. | 2408.05385 | link |
2024-08-09 | PRISM Lite: A lightweight model for interactive 3D placenta segmentation in ultrasound | Hao Li et.al. | 2408.05372 | link |
2024-07-31 | Enabling Quick, Accurate Crowdsourced Annotation for Elevation-Aware Flood Extent Mapping | Landon Dyken et.al. | 2408.05350 | null |
2024-08-09 | Large-scale cosmic ray anisotropies with 19 years of data from the Pierre Auger Observatory | The Pierre Auger Collaboration et.al. | 2408.05292 | null |
2024-08-09 | Gapped Phases in (2+1)d with Non-Invertible Symmetries: Part I | Lakshya Bhardwaj et.al. | 2408.05266 | null |
2024-08-09 | Design and Fabrication of Soft Locomotion Robots based on Spatial Compliant Mechanisms | Andrija Milojevic et.al. | 2408.05207 | null |
2024-08-09 | Global Existence of Large Strong Solutions to the 3D Full Compressible Navier-Stokes Equations with Density-dependent Viscosity | Yachun Li et.al. | 2408.05138 | null |
2024-08-09 | The LATIN-PGD methodology to nonlinear dynamics and quasi-brittle materials for future earthquake engineering applications | Sebastian Rodriguez et.al. | 2408.05108 | null |
2024-08-09 | Depth Helps: Improving Pre-trained RGB-based Policy with Depth Information Injection | Xincheng Pang et.al. | 2408.05107 | null |
2024-08-09 | Evaluating Layout Dimensionalities in PC+VR Asymmetric Collaborative Decision Making | Daniel Enriquez et.al. | 2408.05105 | null |
2024-08-15 | DeepInteraction++: Multi-Modality Interaction for Autonomous Driving | Zeyu Yang et.al. | 2408.05075 | link |
2024-08-09 | RadarPillars: Efficient Object Detection from 4D Radar Point Clouds | Alexander Musiat et.al. | 2408.05020 | null |
2024-08-09 | DreamCouple: Exploring High Quality Text-to-3D Generation Via Rectified Flow | Hangyu Li et.al. | 2408.05008 | null |
2024-08-09 | Improving 3D Cellular Positioning Integrity with Bayesian RAIM | Liqin Ding et.al. | 2408.04994 | null |
2024-08-09 | Mesh-based Object Tracking for Dynamic Semantic 3D Scene Graphs via Ray Tracing | Lennart Niecksch et.al. | 2408.04979 | null |
2024-08-09 | Cosmic Anisotropy and Bianchi Characterization: Killing vector fields and the implied finding of their metric frame | Robbert W. Scholtens et.al. | 2408.04938 | null |
2024-08-09 | GuidedNet: Semi-Supervised Multi-Organ Segmentation via Labeled Data Guide Unlabeled Data | Haochen Zhao et.al. | 2408.04914 | link |
2024-08-09 | Parity Violating Marginal Deformation of the 3D Gross-Neveu-Thirring Model | Gordon W. Semenoff et.al. | 2408.04855 | null |
2024-08-14 | Self-augmented Gaussian Splatting with Structure-aware Masks for Sparse-view 3D Reconstruction | Lingbei Meng et.al. | 2408.04831 | null |
2024-08-09 | FewShotNeRF: Meta-Learning-based Novel View Synthesis for Rapid Scene-Specific Adaptation | Piraveen Sivakumar et.al. | 2408.04803 | null |
2024-08-09 | A High-Temperature Thermocouple Development by Additive Manufacturing: Tungsten-Nickel (W-Ni) and Molybdenum (Mo) Integration with Ceramic Structures | Azizul Islam et.al. | 2408.04800 | null |
2024-08-08 | Novel adaptation of video segmentation to 3D MRI: efficient zero-shot knee segmentation with SAM2 | Andrew Seohwan Yu et.al. | 2408.04762 | null |
2024-08-08 | Climatic Effects of Ocean Salinity on M Dwarf Exoplanets | Kyle Batra et.al. | 2408.04754 | null |
2024-08-08 | Noise-augmented Chaotic Ising Machines for Combinatorial Optimization and Sampling | Kyle Lee et.al. | 2408.04744 | null |
2024-08-08 | DiPGrasp: Parallel Local Searching for Efficient Differentiable Grasp Planning | Wenqiang Xu et.al. | 2408.04738 | null |
2024-08-08 | Extreme heating of minor ions in imbalanced solar-wind turbulence | Michael F. Zhang et.al. | 2408.04703 | null |
2024-08-08 | Open-Source Software Architecture for Multi-Robot Wire Arc Additive Manufacturing (WAAM) | Honglu He et.al. | 2408.04677 | null |
2024-08-08 | Towards High-resolution 3D Anomaly Detection via Group-Level Feature Contrastive Learning | Hongze Zhu et.al. | 2408.04604 | null |
2024-08-08 | Sampling for View Synthesis: From Local Light Field Fusion to Neural Radiance Fields and Beyond | Ravi Ramamoorthi et.al. | 2408.04586 | null |
2024-08-08 | Sketch2Scene: Automatic Generation of Interactive 3D Game Scenes from User’s Casual Sketches | Yongzhi Xu et.al. | 2408.04567 | null |
2024-08-09 | Development of the cadmium zinc TElluride Radiation Imager (TERI) | Daniel Shy et.al. | 2408.04559 | null |
2024-08-11 | Synchronous Multi-modal Semantic Communication System with Packet-level Coding | Yun Tian et.al. | 2408.04535 | null |
2024-08-08 | Towards Synergistic Deep Learning Models for Volumetric Cirrhotic Liver Segmentation in MRIs | Vandan Gorade et.al. | 2408.04491 | null |
2024-08-06 | LumiGauss: High-Fidelity Outdoor Relighting with 2D Gaussian Splatting | Joanna Kaleta et.al. | 2408.04474 | link |
2024-08-08 | A Review of 3D Reconstruction Techniques for Deformable Tissues in Robotic Surgery | Mengya Xu et.al. | 2408.04426 | link |
2024-08-08 | MultiViPerFrOG: A Globally Optimized Multi-Viewpoint Perception Framework for Camera Motion and Tissue Deformation | Guido Caccianiga et.al. | 2408.04367 | null |
2024-08-08 | An Explainable Non-local Network for COVID-19 Diagnosis | Jingfu Yang et.al. | 2408.04300 | null |
2024-08-08 | Spin polarization of $Λ$ hyperons along beam direction in p+Pb collisions at $\sqrt{s_{NN}}=8.16$ TeV using hydrodynamic approaches | Cong Yi et.al. | 2408.04296 | null |
2024-08-08 | Evaluating Modern Approaches in 3D Scene Reconstruction: NeRF vs Gaussian-Based Methods | Yiming Zhou et.al. | 2408.04268 | null |
2024-08-08 | InstantStyleGaussian: Efficient Art Style Transfer with 3D Gaussian Splatting | Xin-Yi Yu et.al. | 2408.04249 | null |
2024-08-08 | Contact angle hysteresis and static friction for 2-dimensional droplets | Jong-In Yang et.al. | 2408.04240 | null |
2024-08-08 | High-Efficiency Urban 3D Radio Map Estimation Based on Sparse Measurements | Xinwei Chen et.al. | 2408.04205 | null |
2024-08-07 | The Quest for Early Detection of Retinal Disease: 3D CycleGAN-based Translation of Optical Coherence Tomography into Confocal Microscopy | Xin Tian et.al. | 2408.04091 | null |
2024-08-07 | Task-oriented Sequential Grounding in 3D Scenes | Zhuofan Zhang et.al. | 2408.04034 | null |
2024-08-07 | Towards Real-Time Gaussian Splatting: Accelerating 3DGS through Photometric SLAM | Yan Song Hu et.al. | 2408.03825 | null |
2024-08-07 | Compact 3D Gaussian Splatting for Static and Dynamic Radiance Fields | Joo Chan Lee et.al. | 2408.03822 | null |
2024-08-07 | Interactive Visual Analysis of Spatial Sensitivities | Marina Evers et.al. | 2408.03817 | link |
2024-08-07 | Improved Tangential Interpolation-based Multi-input Multi-output Modal Analysis of a Full Aircraft | Gabriele Dessena et.al. | 2408.03810 | null |
2024-08-07 | Vision-Language Guidance for LiDAR-based Unsupervised 3D Object Detection | Christian Fruhwirth-Reisinger et.al. | 2408.03790 | link |
2024-08-07 | Counterfactuals and Uncertainty-Based Explainable Paradigm for the Automated Detection and Segmentation of Renal Cysts in Computed Tomography Images: A Multi-Center Study | Zohaib Salahuddin et.al. | 2408.03789 | null |
2024-08-07 | Magnetic Field Controlled Surface Localization of Spin-Wave Ferromagnetic Resonance Modes in 3D Nanostructures | Mateusz Gołębiewski et.al. | 2408.03782 | null |
2024-08-07 | 3iGS: Factorised Tensorial Illumination for 3D Gaussian Splatting | Zhe Jun Tang et.al. | 2408.03753 | link |
2024-08-09 | L4DR: LiDAR-4DRadar Fusion for Weather-Robust 3D Object Detection | Xun Huang et.al. | 2408.03677 | link |
2024-08-07 | 2D Embeddings of Multi-dimensional Partitionings | Marina Evers et.al. | 2408.03641 | link |
2024-08-07 | The Navier-Stokes Cauchy problem in a class of weighted function spaces | Paolo Maremonti et.al. | 2408.03604 | null |
2024-08-07 | A deeper investigation of the Primordial Binary Cluster | Qingshun Hu et.al. | 2408.03552 | null |
2024-08-07 | VPOcc: Exploiting Vanishing Point for Monocular 3D Semantic Occupancy Prediction | Junsu Kim et.al. | 2408.03551 | null |
2024-08-07 | CLIP-based Point Cloud Classification via Point Cloud to Image Translation | Shuvozit Ghose et.al. | 2408.03545 | null |
2024-08-07 | PoseMamba: Monocular 3D Human Pose Estimation with Bidirectional Global-Local Spatio-Temporal State Space Model | Yunlong Huang et.al. | 2408.03540 | link |
2024-08-07 | PRTGS: Precomputed Radiance Transfer of Gaussian Splats for Real-Time High-Quality Relighting | Yijia Guo et.al. | 2408.03538 | null |
2024-08-07 | Leveraging LLMs for Enhanced Open-Vocabulary 3D Scene Understanding in Autonomous Driving | Amirhosein Chahe et.al. | 2408.03516 | link |
2024-08-07 | Optimus: Accelerating Large-Scale Multi-Modal LLM Training by Bubble Exploitation | Weiqi Feng et.al. | 2408.03505 | null |
2024-08-07 | Opening the Black Box of 3D Reconstruction Error Analysis with VECTOR | Racquel Fygenson et.al. | 2408.03503 | link |
2024-08-06 | Next-order balanced model captures submesoscale physics and statistics | Ryan Shìjié Dù et.al. | 2408.03422 | link |
2024-08-06 | HeTraX: Energy Efficient 3D Heterogeneous Manycore Architecture for Transformer Acceleration | Pratyush Dhingra et.al. | 2408.03397 | null |
2024-08-06 | The ALMA-CRISTAL Survey: Spatial extent of [CII] line emission in star-forming galaxies at $z=4-6$ | Ryota Ikeda et.al. | 2408.03374 | null |
2024-08-06 | Intrinsic line profiles for X-ray fluorescent lines in SKIRT | Bert Vander Meulen et.al. | 2408.03367 | null |
2024-08-06 | Segment Anything in Medical Images and Videos: Benchmark and Deployment | Jun Ma et.al. | 2408.03322 | link |
2024-08-06 | MDT-A2G: Exploring Masked Diffusion Transformers for Co-Speech Gesture Generation | Xiaofeng Mao et.al. | 2408.03312 | null |
2024-08-06 | Two-color Ytterbium MOT in a compact dual-chamber setup | Xin Wang et.al. | 2408.03310 | null |
2024-08-17 | Biomedical SAM 2: Segment Anything in Biomedical Images and Videos | Zhiling Yan et.al. | 2408.03286 | link |
2024-08-06 | ReSyncer: Rewiring Style-based Generator for Unified Audio-Visually Synced Facial Performer | Jiazhi Guan et.al. | 2408.03284 | null |
2024-08-19 | Kinetic scales dominated by magnetic helicity in space plasmas | A. Bershadskii et.al. | 2408.03267 | null |
2024-08-08 | Multi-User Mobile Augmented Reality for Cardiovascular Surgical Planning | Pratham Mehta et.al. | 2408.03249 | link |
2024-08-06 | Line-based 6-DoF Object Pose Estimation and Tracking With an Event Camera | Zibin Liu et.al. | 2408.03225 | link |
2024-08-06 | Microfluidic 3D Cell Culture: Potential Application of Collagen Hydrogels with an Optimal Dose of Bioactive Glasses | Faezeh Ghobadi et.al. | 2408.03196 | null |
2024-08-06 | Efficient NeRF Optimization – Not All Samples Remain Equally Hard | Juuso Korhonen et.al. | 2408.03193 | null |
2024-08-06 | An Object is Worth 64x64 Pixels: Generating 3D Object via Image Diffusion | Xingguang Yan et.al. | 2408.03178 | null |
2024-08-03 | Characterization of 3D printed micro-blades for cutting tissue-embedding material | Saisneha Koppaka et.al. | 2408.03155 | null |
2024-08-06 | High-dimensional quantum XYZ product codes for biased noise | Zhipeng Liang et.al. | 2408.03123 | null |
2024-08-06 | BodySLAM: A Generalized Monocular Visual SLAM Framework for Surgical Applications | G. Manni et.al. | 2408.03078 | link |
2024-08-06 | MGFs: Masked Gaussian Fields for Meshing Building based on Multi-View Images | Tengfei Wang et.al. | 2408.03060 | null |
2024-08-06 | Training-Free Condition Video Diffusion Models for single frame Spatial-Semantic Echocardiogram Synthesis | Van Phi Nguyen et.al. | 2408.03035 | link |
2024-08-09 | DreamLCM: Towards High-Quality Text-to-3D Generation via Latent Consistency Model | Yiming Zhong et.al. | 2408.02993 | link |
2024-08-06 | Coupling of magnetism and transport properties to the lattice degrees of freedom in NdBaCo_2O_{5+δ} (δ ~ 0.65) | Himanshu Pant et.al. | 2408.02974 | null |
2024-08-06 | Fast Point Cloud Geometry Compression with Context-based Residual Coding and INR-based Refinement | Hao Xu et.al. | 2408.02966 | null |
2024-08-07 | Pose Magic: Efficient and Temporally Consistent Human Pose Estimation with a Hybrid Mamba-GCN Network | Xinyi Zhang et.al. | 2408.02922 | null |
2024-08-06 | VirtualNexus: Enhancing 360-Degree Video AR/VR Collaboration with Environment Cutouts and Virtual Replicas | Xincheng Huang et.al. | 2408.02914 | null |
2024-08-05 | Analyzing Data Efficiency and Performance of Machine Learning Algorithms for Assessing Low Back Pain Physical Rehabilitation Exercises | Aleksa Marusic et.al. | 2408.02855 | null |
2024-08-05 | Phase Transitions in Anisotropic Turbulence | Adrian van Kan et.al. | 2408.02844 | null |
2024-08-05 | DaCapo: a modular deep learning framework for scalable 3D image segmentation | William Patton et.al. | 2408.02834 | link |
2024-08-07 | Self-calibrating Intelligent OCT-SLO System | Mayank Goswami et.al. | 2408.02703 | null |
2024-08-05 | Interactive 3D Medical Image Segmentation with SAM 2 | Chuyun Shen et.al. | 2408.02635 | link |
2024-08-05 | T-duality of a bosonic string in a weakly curved space-time | Sonja Dedić et.al. | 2408.02626 | null |
2024-08-05 | MeshAnything V2: Artist-Created Mesh Generation With Adjacent Mesh Tokenization | Yiwen Chen et.al. | 2408.02555 | link |
2024-08-05 | RIs-Calib: An Open-Source Spatiotemporal Calibrator for Multiple 3D Radars and IMUs Based on Continuous-Time Estimation | Shuolong Chen et.al. | 2408.02444 | link |
2024-08-05 | Tensorial template matching for fast cross-correlation with rotations and its application for tomography | Antonio Martinez-Sanchez et.al. | 2408.02398 | null |
2024-08-05 | CMR-Agent: Learning a Cross-Modal Agent for Iterative Image-to-Point Cloud Registration | Gongxin Yao et.al. | 2408.02394 | null |
2024-08-05 | MaFreeI2P: A Matching-Free Image-to-Point Cloud Registration Paradigm with Active Camera Pose Retrieval | Gongxin Yao et.al. | 2408.02392 | null |
2024-08-05 | StoDIP: Efficient 3D MRF image reconstruction with deep image priors and stochastic iterations | Perla Mayo et.al. | 2408.02367 | null |
2024-08-05 | Self-centering 3-DOF feet controller for hands-free locomotion control in telepresence and virtual reality | Raphael Memmesheimer et.al. | 2408.02319 | null |
2024-08-05 | SelfGeo: Self-supervised and Geodesic-consistent Estimation of Keypoints on Deformable Shapes | Mohammad Zohaib et.al. | 2408.02291 | null |
2024-08-05 | Geometric Algebra Meets Large Language Models: Instruction-Based Transformations of Separate Meshes in 3D, Interactive and Controllable Scenes | Dimitris Angelis et.al. | 2408.02275 | null |
2024-08-05 | Accelerated 3D Maxwell Integral Equation Solver using the Interpolated Factored Green Function Method | Jagabandhu Paul et.al. | 2408.02274 | null |
2024-08-05 | VoxelTrack: Exploring Voxel Representation for 3D Point Cloud Object Tracking | Yuxuan Lu et.al. | 2408.02263 | null |
2024-08-07 | CompositingVis: Exploring Interactions for Creating Composite Visualizations in Immersive Environments | Qian Zhu et.al. | 2408.02240 | null |
2024-08-05 | REVISION: Rendering Tools Enable Spatial Fidelity in Vision-Language Models | Agneet Chatterjee et.al. | 2408.02231 | null |
2024-08-05 | SceneMotifCoder: Example-driven Visual Program Learning for Generating 3D Object Arrangements | Hou In Ivan Tam et.al. | 2408.02211 | null |
2024-08-05 | Synergistic Learning with Multi-Task DeepONet for Efficient PDE Problem Solving | Varun Kumar et.al. | 2408.02198 | link |
2024-08-05 | 3D hard sphere Boltzmann equation: explicit structure and the transition process from polynomial tail to Gaussian tail | Yu-Chu Lin et.al. | 2408.02183 | null |
2024-08-04 | An efficient strategy for path planning with a tethered marsupial robotics system | Jesús Capitán et.al. | 2408.02141 | link |
2024-08-20 | AvatarPose: Avatar-guided 3D Pose Estimation of Close Human Interaction from Sparse Multi-view Videos | Feichi Lu et.al. | 2408.02110 | null |
2024-08-04 | View-consistent Object Removal in Radiance Fields | Yiren Lu et.al. | 2408.02100 | null |
2024-08-04 | Past Movements-Guided Motion Representation Learning for Human Motion Prediction | Junyu Shi et.al. | 2408.02091 | null |
2024-08-13 | KAN-RCBEVDepth: A multi-modal fusion algorithm in object detection for autonomous driving | Zhihao Lai et.al. | 2408.02088 | null |
2024-08-04 | PanicleNeRF: low-cost, high-precision in-field phenotypingof rice panicles with smartphone | Xin Yang et.al. | 2408.02053 | null |
2024-08-04 | 3D Single-object Tracking in Point Clouds with High Temporal Variation | Qiao Wu et.al. | 2408.02049 | null |
2024-08-04 | Characterizing the Performance of the Implicit Massively Parallel Particle-in-Cell iPIC3D Code | Jeremy J. Williams et.al. | 2408.01983 | null |
2024-08-04 | Computational Trichromacy Reconstruction: Empowering the Color-Vision Deficient to Recognize Colors Using Augmented Reality | Yuhao Zhu et.al. | 2408.01895 | null |
2024-08-03 | FBINeRF: Feature-Based Integrated Recurrent Network for Pinhole and Fisheye Neural Radiance Fields | Yifan Wu et.al. | 2408.01878 | null |
2024-08-03 | Effect of Uniform and Non-uniform wall heating on Three-Dimensional Magneto-Hydrodynamics Natural Convection and Entropy Generation: A computational study using New Higher Order Super Compact Scheme | Ashwani Punia et.al. | 2408.01853 | null |
2024-08-03 | MotionTrace: IMU-based Field of View Prediction for Smartphone AR Interactions | Rahul Islam et.al. | 2408.01850 | null |
2024-08-03 | E $^3$ NeRF: Efficient Event-Enhanced Neural Radiance Fields from Blurry Images | Yunshan Qi et.al. | 2408.01840 | null |
2024-08-16 | GLDiTalker: Speech-Driven 3D Facial Animation with Graph Latent Diffusion Transformer | Yihong Lin et.al. | 2408.01826 | null |
2024-08-03 | Introducing Bidirectional Programming in Constructive Solid Geometry-Based CAD | J. Felipe Gonzalez et.al. | 2408.01801 | null |
2024-08-03 | Understanding the Challenges of OpenSCAD Users for 3D Printing | J. Felipe Gonzalez et.al. | 2408.01796 | null |
2024-08-03 | 3DStoryline: Immersive Visual Storytelling | Haonan Yao et.al. | 2408.01775 | null |
2024-08-03 | LAM3D: Leveraging Attention for Monocular 3D Object Detection | Diana-Alexandra Sas et.al. | 2408.01739 | null |
2024-08-03 | Real-time Localization and Mapping in Architectural Plans with Deviations | Muhammad Shaheer et.al. | 2408.01737 | null |
2024-08-03 | Contribution of the Cygnus bubble to the Galactic cosmic ray spectrum and diffuse $γ$ -ray emissions | Lin Nie et.al. | 2408.01693 | null |
2024-08-03 | SiamMo: Siamese Motion-Centric 3D Object Tracking | Yuxiang Yang et.al. | 2408.01688 | link |
2024-08-03 | iControl3D: An Interactive System for Controllable 3D Scene Generation | Xingyi Li et.al. | 2408.01678 | link |
2024-08-03 | HIVE: HIerarchical Volume Encoding for Neural Implicit Surface Reconstruction | Xiaodong Gu et.al. | 2408.01677 | null |
2024-08-03 | SAT3D: Image-driven Semantic Attribute Transfer in 3D | Zhijun Zhai et.al. | 2408.01664 | null |
2024-08-03 | Stimulating Imagination: Towards General-purpose Object Rearrangement | Jianyang Wu et.al. | 2408.01655 | null |
2024-08-03 | JambaTalk: Speech-Driven 3D Talking Head Generation Based on Hybrid Transformer-Mamba Model | Farzaneh Jafari et.al. | 2408.01627 | null |
2024-08-15 | Three-dimensional Morphological Reconstruction of Millimeter-Scale Soft Continuum Robots based on Dual-Stereo-Vision | Tian-Ao Ren et.al. | 2408.01615 | null |
2024-08-02 | THOR2: Leveraging Topological Soft Clustering of Color Space for Human-Inspired Object Recognition in Unseen Environments | Ekta U. Samani et.al. | 2408.01579 | link |
2024-08-02 | Enhanced Knee Kinematics: Leveraging Deep Learning and Morphing Algorithms for 3D Implant Modeling | Viet-Dung Nguyen et.al. | 2408.01557 | null |
2024-08-02 | 3D $\mathcal{N}=1$ supergravity from Virasoro TQFT: Gravitational partition function and Out-of-time-order correlator | Arpan Bhattacharyya et.al. | 2408.01538 | null |
2024-08-02 | Multi-Unit Floor Plan Recognition and Reconstruction Using Improved Semantic Segmentation of Raster-Wise Floor Plans | Lukas Kratochvila et.al. | 2408.01526 | null |
2024-08-06 | Gibbs Sampling gives Quantum Advantage at Constant Temperatures with $O(1)$ -Local Hamiltonians | Joel Rajakumar et.al. | 2408.01516 | null |
2024-08-02 | 3D-Radial galaxy correlation function | Francesco Spezzati et.al. | 2408.01495 | null |
2024-08-02 | Type II orientifold flux vacua in 3D | Álvaro Arboleya et.al. | 2408.01403 | null |
2024-08-02 | PLIC-Net: A Machine Learning Approach for 3D Interface Reconstruction in Volume of Fluid Methods | Andrew Cahaly et.al. | 2408.01383 | null |
2024-08-02 | Balanced Residual Distillation Learning for 3D Point Cloud Class-Incremental Semantic Segmentation | Yuanzhi Su et.al. | 2408.01356 | null |
2024-08-02 | 3DPX: Progressive 2D-to-3D Oral Image Reconstruction with Hybrid MLP-CNN Networks | Xiaoshuang Li et.al. | 2408.01292 | null |
2024-08-02 | TexGen: Text-Guided 3D Texture Generation with Multi-view Sampling and Resampling | Dong Huo et.al. | 2408.01291 | null |
2024-08-02 | Chirality and dimensionality in the ultrastrong light-matter coupling regime | R. Avriller et.al. | 2408.01275 | null |
2024-08-02 | A General Framework to Boost 3D GS Initialization for Text-to-3D Generation by Lexical Richness | Lutao Jiang et.al. | 2408.01269 | null |
2024-08-02 | NeRFoot: Robot-Footprint Estimation for Image-Based Visual Servoing | Daoxin Zhong et.al. | 2408.01251 | null |
2024-08-02 | Reality Fusion: Robust Real-time Immersive Mobile Robot Teleoperation with Volumetric Visual Data Fusion | Ke Li et.al. | 2408.01225 | link |
2024-08-02 | S2TD-Face: Reconstruct a Detailed 3D Face with Controllable Texture from a Single Sketch | Zidu Wang et.al. | 2408.01218 | link |
2024-08-02 | From Problem to Solution: Bio-inspired 3D Printing for Bonding Soft and Rigid Materials via Underextrusions | Arman Goshtasbi et.al. | 2408.01210 | null |
2024-08-02 | Dipole orientation reveals single-molecule interactions and dynamics on 2D crystals | Wei Guo et.al. | 2408.01207 | null |
2024-08-02 | 3D Genetic Metamaterials for Scattering Maximization | Dmitry Dobrykh et.al. | 2408.01170 | null |
2024-08-07 | IG-SLAM: Instant Gaussian SLAM | F. Aykut Sarikamis et.al. | 2408.01126 | null |
2024-08-02 | Effect of Fog Particle Size Distribution on 3D Object Detection Under Adverse Weather Conditions | Ajinkya Shinde et.al. | 2408.01085 | null |
2024-08-02 | FCDFusion: a Fast, Low Color Deviation Method for Fusing Visible and Infrared Image Pairs | Hesong Li et.al. | 2408.01080 | null |
2024-08-02 | Structure from Motion-based Motion Estimation and 3D Reconstruction of Unknown Shaped Space Debris | Kentaro Uno et.al. | 2408.01035 | null |
2024-08-02 | A Numerical Technique for Coupling the Momentum and the Continuity Equations for Semi-Implicit 3D Ocean Models | Ali Shahabi et.al. | 2408.00990 | null |
2024-08-01 | On the fate of the secondary white dwarf in double-degenerate double-detonation Type Ia supernovae – II. 3D synthetic observables | J. M. Pollin et.al. | 2408.00917 | null |
2024-08-01 | Demonstrating the Potential of Adaptive LMS Filtering on FPGA-Based Qubit Control Platforms for Improved Qubit Readout in 2D and 3D Quantum Processing Units | Hans Johnson et.al. | 2408.00904 | null |
2024-08-01 | Medical SAM 2: Segment medical images as video via Segment Anything Model 2 | Jiayuan Zhu et.al. | 2408.00874 | link |
2024-08-05 | UlRe-NeRF: 3D Ultrasound Imaging through Neural Rendering with Ultrasound Reflection Direction Parameterization | Ziwen Guo et.al. | 2408.00860 | null |
2024-08-01 | AI-Enabled sensor fusion of time of flight imaging and mmwave for concealed metal detection | Chaitanya Kaul et.al. | 2408.00816 | null |
2024-08-01 | UniTalker: Scaling up Audio-Driven 3D Facial Animation through A Unified Model | Xiangyu Fan et.al. | 2408.00762 | null |
2024-08-06 | Segment anything model 2: an application to 2D and 3D medical images | Haoyu Dong et.al. | 2408.00756 | link |
2024-08-01 | Coarse Correspondence Elicit 3D Spacetime Understanding in Multimodal Language Model | Benlin Liu et.al. | 2408.00754 | null |
2024-08-01 | MotionFix: Text-Driven 3D Human Motion Editing | Nikos Athanasiou et.al. | 2408.00712 | null |
2024-08-01 | Attosecond Probing of Coherent Vibrational Dynamics in CBr $_4$ | Jen-Hao Ou et.al. | 2408.00696 | null |
2024-08-01 | ExpertAF: Expert Actionable Feedback from Video | Kumar Ashutosh et.al. | 2408.00672 | null |
2024-08-01 | Droplet-confined electroplating for nanoscale additive manufacturing: current control of the initial stages of the growth of copper nanowires | Mirco Nydegger et.al. | 2408.00660 | null |
2024-08-01 | SF3D: Stable Fast 3D Mesh Reconstruction with UV-unwrapping and Illumination Disentanglement | Mark Boss et.al. | 2408.00653 | null |
2024-08-15 | AMAES: Augmented Masked Autoencoder Pretraining on Public Brain MRI Data for 3D-Native Segmentation | Asbjørn Munk et.al. | 2408.00640 | link |
2024-08-01 | CrystalTac: 3D-Printed Vision-Based Tactile Sensor Family through Rapid Monolithic Manufacturing Technique | Wen Fan et.al. | 2408.00638 | null |
2024-08-01 | Harnessing Uncertainty-aware Bounding Boxes for Unsupervised 3D Object Detection | Ruiyang Zhang et.al. | 2408.00619 | link |
2024-08-05 | U2UData: A Large-scale Cooperative Perception Dataset for Swarm UAVs Autonomous Flight | Tongtong Feng et.al. | 2408.00606 | link |
2024-08-01 | Ground-to-UAV and RIS-assisted UAV-to-Ground Communication Under Channel Aging: Statistical Characterization and Outage Performance | Thanh Luan Nguyen et.al. | 2408.00600 | null |
2024-08-01 | Contrastive Learning with Dynamic Localized Repulsion for Brain Age Prediction on 3D Stiffness Maps | Jakob Träuble et.al. | 2408.00527 | null |
2024-08-01 | Relative Helicity and Tiling Twist | Boris Khesin et.al. | 2408.00522 | null |
2024-08-01 | SegStitch: Multidimensional Transformer for Robust and Efficient Medical Imaging Segmentation | Shengbo Tan et.al. | 2408.00496 | link |
2024-08-01 | HBot: A Chatbot for Healthcare Applications in Traditional Chinese Medicine Based on Human Body 3D Visualization | Bolin Zhang et.al. | 2408.00481 | null |
2024-08-01 | MonoMM: A Multi-scale Mamba-Enhanced Network for Real-time Monocular 3D Object Detection | Youjia Fu et.al. | 2408.00438 | null |
2024-08-01 | DiM-Gesture: Co-Speech Gesture Generation with Adaptive Layer Normalization Mamba-2 framework | Fan Zhang et.al. | 2408.00370 | null |
2024-08-01 | Hierarchically Structured Neural Bones for Reconstructing Animatable Objects from Casual Videos | Subin Jeon et.al. | 2408.00351 | null |
2024-08-01 | Global large strong solution of the 3D inhomogeneous Navier-Stokes equations with density-dependent viscosity | Xiangdi Huang et.al. | 2408.00333 | null |
2024-08-01 | EmoTalk3D: High-Fidelity Free-View Synthesis of Emotional 3D Talking Head | Qianyun He et.al. | 2408.00297 | null |
2024-08-01 | Head360: Learning a Parametric 3D Full-Head for Free-View Synthesis in 360° | Yuxiao He et.al. | 2408.00296 | null |
2024-08-01 | Diff3DETR:Agent-based Diffusion Model for Semi-supervised 3D Object Detection | Jiacheng Deng et.al. | 2408.00286 | null |
2024-08-01 | 3D U-KAN Implementation for Multi-modal MRI Brain Tumor Segmentation | Tianze Tang et.al. | 2408.00273 | null |
2024-08-01 | LoopSparseGS: Loop Based Sparse-View Friendly Gaussian Splatting | Zhenyu Bao et.al. | 2408.00254 | null |
2024-08-01 | Signature of point nodal superconductivity in the Dirac semimetal PdTe | C. S. Yadav et.al. | 2408.00216 | null |
2024-07-31 | S-SYNTH: Knowledge-Based, Synthetic Generation of Skin Images | Andrea Kim et.al. | 2408.00191 | link |
2024-07-31 | Adapting Skills to Novel Grasps: A Self-Supervised Approach | Georgios Papagiannis et.al. | 2408.00178 | null |
2024-07-31 | StyleRF-VolVis: Style Transfer of Neural Radiance Fields for Expressive Volume Visualization | Kaiyuan Tang et.al. | 2408.00150 | null |
2024-07-31 | Localized Gaussian Splatting Editing with Contextual Awareness | Hanyuan Xiao et.al. | 2408.00083 | null |
2024-07-31 | Chemical abundance gradients of organic molecules within a protostellar disk | Levi G. Walls et.al. | 2408.00070 | null |
2024-07-31 | Weak $\mathbb{Z}_2$ Supertopology | Kirill Parshukov et.al. | 2408.00042 | null |
2024-07-31 | Ge-based Clinopyroxene series: first principles and experimental local probe study | Ricardo P. Moreira et.al. | 2407.21749 | null |
2024-07-31 | A Federated Learning-Friendly Approach for Parameter-Efficient Fine-Tuning of SAM in 3D Segmentation | Mothilal Asokan et.al. | 2407.21739 | null |
2024-07-31 | Tree-Cotree-Based Tearing and Interconnecting for 3D Magnetostatics: A Dual-Primal Approach | Mario Mally et.al. | 2407.21707 | link |
2024-07-31 | Tora: Trajectory-oriented Diffusion Transformer for Video Generation | Zhenghao Zhang et.al. | 2407.21705 | link |
2024-07-31 | Expressive Whole-Body 3D Gaussian Avatar | Gyeongsik Moon et.al. | 2407.21686 | null |
2024-07-31 | Stable Sparse Operator Inference for Nonlinear Structural Dynamics | Pascal den Boef et.al. | 2407.21672 | null |
2024-07-31 | Adaptive Mix for Semi-Supervised Medical Image Segmentation | Zhiqiang Shen et.al. | 2407.21586 | null |
2024-07-31 | InScope: A New Real-world 3D Infrastructure-side Collaborative Perception Dataset for Open Traffic Scenarios | Xiaofei Zhang et.al. | 2407.21581 | null |
2024-07-31 | Voxel Scene Graph for Intracranial Hemorrhage | Antoine P. Sanner et.al. | 2407.21580 | link |
2024-08-11 | Monotonicity, bounds and extrapolation of Block-Gauss and Gauss-Radau quadrature for computing $B^T φ(A) B$ | Jörn Zimmerling et.al. | 2407.21505 | null |
2024-07-31 | Turbulent features near the X point of a DTT-like tokamak plasma | F. Cianfrani et.al. | 2407.21493 | null |
2024-07-31 | Theoretical study on the possibility of high $T_c$ s$\pm$ -wave superconductivity in the heavily hole-doped infinite layer nickelates | Hirofumi Sakakibara et.al. | 2407.21461 | null |
2024-07-31 | Floquet Engineering of Relativistic Electrons using Propagating Waves | Takashi Oka et.al. | 2407.21458 | null |
2024-08-02 | Forecasting Future Videos from Novel Views via Disentangled 3D Scene Representation | Sudhir Yarram et.al. | 2407.21450 | null |
2024-08-09 | Enriching thermal point clouds of buildings using semantic 3D building models | Jingwei Zhu et.al. | 2407.21436 | link |
2024-07-31 | Analyzing the impact of semantic LoD3 building models on image-based vehicle localization | Antonia Bieringer et.al. | 2407.21432 | link |
2024-07-31 | Deformable 3D Shape Diffusion Model | Dengsheng Chen et.al. | 2407.21428 | null |
2024-07-31 | 3D Variational Inference-Based Double-Difference Seismic Tomography Method and Application to the SAFOD Site, California | Hao Yang et.al. | 2407.21405 | null |
2024-07-31 | DD-rPPGNet: De-interfering and Descriptive Feature Learning for Unsupervised rPPG Estimation | Pei-Kai Huang et.al. | 2407.21402 | null |
2024-08-01 | A Testbed for Tidal Migration: the 3D Architecture of an Eccentric Hot Jupiter HD 118203 b Accompanied by a Possibly Aligned Outer Giant Planet | Jingwen Zhang et.al. | 2407.21377 | null |
2024-07-31 | ESIQA: Perceptual Quality Assessment of Vision-Pro-based Egocentric Spatial Images | Xilei Zhu et.al. | 2407.21363 | null |
2024-07-31 | MIST: A Simple and Scalable End-To-End 3D Medical Imaging Segmentation Framework | Adrian Celaya et.al. | 2407.21343 | link |
2024-07-31 | High-throughput 3D shape completion of potato tubers on a harvester | Pieter M. Blok et.al. | 2407.21341 | link |
2024-07-31 | Diffractive Waveguides | Yuntian Wang et.al. | 2407.21340 | null |
2024-07-31 | Chat2Layout: Interactive 3D Furniture Layout with a Multimodal LLM | Can Wang et.al. | 2407.21333 | null |
2024-07-31 | CAMAv2: A Vision-Centric Approach for Static Map Element Annotation | Shiyuan Chen et.al. | 2407.21331 | link |
2024-08-07 | MetaOpenFOAM: an LLM-based multi-agent framework for CFD | Yuxuan Chen et.al. | 2407.21320 | link |
2024-07-31 | Automated Quantification of Hyperreflective Foci in SD-OCT With Diabetic Retinopathy | Idowu Paul Okuwobi et.al. | 2407.21272 | null |
2024-07-31 | DEF-oriCORN: efficient 3D scene understanding for robust language-directed manipulation without demonstrations | Dongwon Son et.al. | 2407.21267 | null |
2024-07-31 | DESI Massive Post-Starburst Galaxies at $\mathbf{z\sim1.2}$ have compact structures and dense cores | Yunchong Zhang et.al. | 2407.21257 | null |
2024-07-30 | Hopf and Bautin bifurcations in a 3D model for pest leafhopper with stage structure and generalist predatory mite | Martha Alvarez Ramírez et.al. | 2407.21205 | null |
2024-07-30 | KPF Confirms a Polar Orbit for KELT-18 b | Ryan A. Rubenzahl et.al. | 2407.21196 | null |
2024-07-30 | PLANesT-3D: A new annotated dataset for segmentation of 3D plant point clouds | Kerem Mertoğlu et.al. | 2407.21150 | null |
2024-07-30 | Radio-Frequency Spectroscopy and the Dimensional Crossover in Interacting Spin-Polarized Fermi Gases | Jeff Maki et.al. | 2407.21106 | null |
2024-07-30 | A Generative Modeling Approach to Reconstructing 21-cm Tomographic Data | Nashwan Sabti et.al. | 2407.21097 | link |
2024-07-30 | Design and Predict Tetragonal van der Waals Layered Quantum Materials of MPd $_5$I$_2$ (M=Ga, In and 3d Transition Metals) | Niraj K. Nepal et.al. | 2407.20938 | null |
2024-08-01 | Magnetic field diagnostics of prominences with the Mg II k line: 3D Stokes inversions versus traditional methods | Jiří Štěpán et.al. | 2407.20926 | null |
2024-07-30 | Dynamic Scene Understanding through Object-Centric Voxelization and Neural Rendering | Yanpeng Zhao et.al. | 2407.20908 | link |
2024-07-30 | A Comparative Study of Neural Surface Reconstruction for Scientific Visualization | Siyuan Yao et.al. | 2407.20868 | null |
2024-07-30 | DeTurb: Atmospheric Turbulence Mitigation with Deformable 3D Convolutions and 3D Swin Transformers | Zhicheng Zou et.al. | 2407.20855 | null |
2024-07-30 | NIS-SLAM: Neural Implicit Semantic RGB-D SLAM for 3D Consistent Scene Understanding | Hongjia Zhai et.al. | 2407.20853 | null |
2024-07-30 | Approximating electromagnetic fields in discontinuous media using a single physics-informed neural network | Michel Nohra et.al. | 2407.20833 | null |
2024-07-30 | WARM-3D: A Weakly-Supervised Sim2Real Domain Adaptation Framework for Roadside Monocular 3D Object Detection | Xingcheng Zhou et.al. | 2407.20818 | null |
2024-07-30 | AhmedML: High-Fidelity Computational Fluid Dynamics Dataset for Incompressible, Low-Speed Bluff Body Aerodynamics | Neil Ashton et.al. | 2407.20801 | null |
2024-07-31 | libEMMI_MGFD: A program of marine controlled-source electromagnetic modelling and inversion using frequency-domain multigrid solver | Pengliang Yang et.al. | 2407.20795 | null |
2024-07-30 | Enhancement of the vortex ratchet effect in superconductor open nanotubes and nanopetals | Rodrigo H. de Bragança et.al. | 2407.20780 | null |
2024-07-30 | OmniBal: Towards Fast Instruct-tuning for Vision-Language Models via Omniverse Computation Balance | Yongqiang Yao et.al. | 2407.20761 | link |
2024-07-30 | Neural Fields for Continuous Periodic Motion Estimation in 4D Cardiovascular Imaging | Simone Garzia et.al. | 2407.20728 | null |
2024-07-30 | SceneTeller: Language-to-3D Scene Generation | Başak Melis Öcal et.al. | 2407.20727 | null |
2024-07-30 | TactIcons: Designing 3D Printed Map Icons for People who are Blind or have Low Vision | Leona Holloway et.al. | 2407.20674 | null |
2024-07-31 | 3D-GRES: Generalized 3D Referring Expression Segmentation | Changli Wu et.al. | 2407.20664 | link |
2024-07-30 | Hedgehog topological defects in 3D amorphous solids | Arabinda Bera et.al. | 2407.20631 | null |
2024-07-30 | Electronic structure and resonant inelastic x-ray scattering in Ta2NiSe5 | D. A. Kukusta et.al. | 2407.20626 | null |
2024-07-30 | ATI-CTLO:Adaptive Temporal Interval-based Continuous-Time LiDAR-Only Odometry | Bo Zhou et.al. | 2407.20619 | null |
2024-07-30 | Coupling 3D geodynamics and dynamic earthquake rupture: fault geometry, rheology and stresses across timescales | Anthony Jourdon et.al. | 2407.20609 | null |
2024-07-31 | Monocular Human-Object Reconstruction in the Wild | Chaofan Huo et.al. | 2407.20566 | link |
2024-07-30 | Robust CNN Multi-Nested-LSTM Framework with Compound Loss for Patch-based Multi-Push Ultrasound Shear Wave Imaging and Segmentation | Md. Jahin Alam et.al. | 2407.20558 | null |
2024-07-30 | StackFLOW: Monocular Human-Object Reconstruction by Stacked Normalizing Flow with Offset | Chaofan Huo et.al. | 2407.20545 | link |
2024-07-30 | HandDAGT: A Denoising Adaptive Graph Transformer for 3D Hand Pose Estimation | Wencan Cheng et.al. | 2407.20542 | link |
2024-07-30 | Linear-Quadratic GUP and Thermodynamic Dimensional Reduction | H. Ramezani et.al. | 2407.20497 | null |
2024-07-30 | Relaxed Equivariant Graph Neural Networks | Elyssa Hofgard et.al. | 2407.20471 | link |
2024-07-29 | A flexible framework for accurate LiDAR odometry, map manipulation, and localization | José Luis Blanco-Claraco et.al. | 2407.20465 | link |
2024-07-29 | Late Jets, Early Sparks: Illuminating the Pre-Maximum Bumps in Superluminous Supernovae | Ore Gottlieb et.al. | 2407.20348 | null |
2024-07-29 | Sun Off, Lights On: Photorealistic Monocular Nighttime Simulation for Robust Semantic Perception | Konstantinos Tzevelekakis et.al. | 2407.20336 | null |
2024-07-29 | X-ray nano-holotomography reconstruction with simultaneous probe retrieval | Viktor Nikitin et.al. | 2407.20304 | null |
2024-07-29 | Polarization Saturation in Multi-layered Interfacial Ferroelectrics | Wei Cao et.al. | 2407.20303 | null |
2024-07-29 | Improving 2D Feature Representations by 3D-Aware Fine-Tuning | Yuanwen Yue et.al. | 2407.20229 | null |
2024-07-29 | Global Structure-from-Motion Revisited | Linfei Pan et.al. | 2407.20219 | link |
2024-07-29 | Tetrahedral grids in Monte Carlo radiative transfer | Arno Lauwers et.al. | 2407.20216 | null |
2024-07-29 | Radiance Fields for Robotic Teleoperation | Maximum Wilder-Smith et.al. | 2407.20194 | link |
2024-07-29 | Homogenization of Non-homogeneous Incompressible Navier-Stokes System in Critically Perforated Domains | Jiaojiao Pan et.al. | 2407.20153 | null |
2024-07-29 | Extreme time extrapolation capabilities and thermodynamic consistency of physics-inspired Neural Networks for the 3D microstructure evolution of materials | Daniele Lanzoni et.al. | 2407.20126 | link |
2024-07-29 | Integrable and superintegrable quantum mechanical systems with position dependent masses invariant with respect to one parametric Lie groups. 2. Systems with dilatation and shift symmetries | A. G. Nikitin et.al. | 2407.20112 | null |
2024-07-29 | Visual Support for the Loop Grafting Workflow on Proteins | Filip Opálený et.al. | 2407.20054 | null |
2024-07-29 | Physically-based Path Tracer using WebGPU and OpenPBR | Simon Stucki et.al. | 2407.19977 | link |
2024-07-29 | From Flat to Spatial: Comparison of 4 methods constructing 3D, 2 and 1/2D Models from 2D Plans with neural networks | Jacob Sam et.al. | 2407.19970 | null |
2024-07-29 | Robust Conformal Volume Estimation in 3D Medical Images | Benjamin Lambert et.al. | 2407.19938 | link |
2024-07-29 | Aero-Nef: Neural Fields for Rapid Aircraft Aerodynamics Simulations | Giovanni Catalani et.al. | 2407.19916 | link |
2024-07-29 | Phase transitions in rolling of irregular cylinders and spheres | Daoyuan Qian et.al. | 2407.19861 | null |
2024-07-29 | RNACG: A Universal RNA Sequence Conditional Generation model based on Flow-Matching | Letian Gao et.al. | 2407.19838 | null |
2024-07-29 | VortSDF: 3D Modeling with Centroidal Voronoi Tesselation on Signed Distance Field | Diego Thomas et.al. | 2407.19837 | null |
2024-07-29 | TeleOR: Real-time Telemedicine System for Full-Scene Operating Room | Yixuan Wu et.al. | 2407.19763 | null |
2024-07-29 | Hölder continuous solutions to stochastic 3D Euler equations via stochastic convex integration | Lin Lü et.al. | 2407.19671 | null |
2024-07-29 | Take A Step Back: Rethinking the Two Stages in Visual Reasoning | Mingyu Zhang et.al. | 2407.19666 | null |
2024-07-29 | SALVE: A 3D Reconstruction Benchmark of Wounds from Consumer-grade Videos | Remi Chierchia et.al. | 2407.19652 | null |
2024-07-29 | Closing of the Mott gap near step edges in NiS2 | Yuuki Yasui et.al. | 2407.19636 | null |
2024-07-30 | Bridging the Gap: Studio-like Avatar Creation from a Monocular Phone Capture | ShahRukh Athar et.al. | 2407.19593 | null |
2024-07-28 | Cycle3D: High-quality and Consistent Image-to-3D Generation via Generation-Reconstruction Cycle | Zhenyu Tang et.al. | 2407.19548 | null |
2024-07-28 | FreeShell: A Context-Free 4D Printing Technique for Fabricating Complex 3D Triangle Mesh Shells | Chao Yuan et.al. | 2407.19533 | null |
2024-07-30 | Bistability in spatiotemporal mode-locking with dynamic multimode gain | Zhijin Xiong et.al. | 2407.19482 | null |
2024-07-28 | Crater-shaped Enrichment of $\mathrm{V}_\mathrm{Si}$ Color Centers in $4H$ -SiC using Single-Pulse Near-Infrared Femtosecond Laser Processing | Mengzhi Yan et.al. | 2407.19470 | null |
2024-07-31 | BEMTrace: Visualization-driven approach for deriving Building Energy Models from BIM | Andreas Walch et.al. | 2407.19464 | null |
2024-07-28 | HD-maps as Prior Information for Globally Consistent Mapping in GPS-denied Environments | Waqas Ali et.al. | 2407.19463 | null |
2024-08-08 | Perm: A Parametric Representation for Multi-Style 3D Hair Modeling | Chengan He et.al. | 2407.19451 | link |
2024-07-28 | FINER++: Building a Family of Variable-periodic Functions for Activating Implicit Neural Representation | Hao Zhu et.al. | 2407.19434 | null |
2024-07-28 | Three-dimensional solitons supported by the spin-orbit coupling and Rydberg-Rydberg interactions in PT-symmetric potentials | Yuan Zhao et.al. | 2407.19432 | null |
2024-07-28 | Near-Isotropic Sub-Ångstrom 3D Resolution Phase Contrast Imaging Achieved by End-to-End Ptychographic Electron Tomography | Shengboy You et.al. | 2407.19407 | null |
2024-07-28 | Innovative RIS Prototyping Enhancing Wireless Communication with Real-Time Spot Beam Tracking and OAM Wavefront Manipulation | Yufei Zhao et.al. | 2407.19379 | null |
2024-07-30 | WindsorML: High-Fidelity Computational Fluid Dynamics Dataset For Automotive Aerodynamics | Neil Ashton et.al. | 2407.19320 | null |
2024-07-27 | A Bayesian Approach Toward Robust Multidimensional Ellipsoid-Specific Fitting | Zhao Mingyang et.al. | 2407.19269 | link |
2024-07-27 | Magic3DSketch: Create Colorful 3D Models From Sketch-Based 3D Modeling Guided by Text and Language-Image Pre-Training | Ying Zang et.al. | 2407.19225 | null |
2024-07-27 | WindPoly: Polygonal Mesh Reconstruction via Winding Numbers | Xin He et.al. | 2407.19208 | null |
2024-07-27 | Guiding Wireless Signals with Arrays of Metallic Linear Fresnel Reflectors: A Low-cost, Frequency-versatile, and Practical Approach | Hieu Le et.al. | 2407.19179 | null |
2024-07-27 | Robust Multimodal 3D Object Detection via Modality-Agnostic Decoding and Proximity-based Modality Ensemble | Juhan Cha et.al. | 2407.19156 | link |
2024-07-27 | RePLAy: Remove Projective LiDAR Depthmap Artifacts via Exploiting Epipolar Geometry | Shengjie Zhu et.al. | 2407.19154 | null |
2024-07-26 | A Novel Gaussian filter-based Pressure Correction Technique with Super Compact Scheme for Unsteady 3D Incompressible, Viscous Flows | Ashwani Punia et.al. | 2407.19116 | null |
2024-07-26 | ObjectCarver: Semi-automatic segmentation, reconstruction and separation of 3D objects | Gemmechu Hassena et.al. | 2407.19108 | null |
2024-07-26 | A New Higher-Order Super Compact Finite Difference Scheme to Study Three-Dimensional Non-Newtonian Flows | Ashwani Punia et.al. | 2407.19100 | null |
2024-07-26 | Flexible graph convolutional network for 3D human pose estimation | Abu Taib Mohammed Shahjahan et.al. | 2407.19077 | link |
2024-07-26 | ScalingGaussian: Enhancing 3D Content Creation with Generative Gaussian Splatting | Shen Chen et.al. | 2407.19035 | null |
2024-07-26 | Floating No More: Object-Ground Reconstruction from a Single Image | Yunze Man et.al. | 2407.18914 | null |
2024-07-26 | SHIC: Shape-Image Correspondences with no Keypoint Supervision | Aleksandar Shtedritski et.al. | 2407.18907 | null |
2024-07-26 | Generative Adversarial Networks for Imputing Sparse Learning Performance | Liang Zhang et.al. | 2407.18875 | null |
2024-07-26 | Learning a Shape-Conditioned Agent for Purely Tactile In-Hand Manipulation of Various Objects | Johannes Pitz et.al. | 2407.18834 | null |
2024-07-26 | Superfluid Spin-up: 3D Simulations of Postglitch Dynamics in Neutron Stars Cores | J. R. Fuentes et.al. | 2407.18810 | null |
2024-07-26 | Vector Magnetometry Using Shallow Implanted NV Centers in Diamond with Waveguide-Assisted Dipole Excitation and Readout | Sajedeh Shahbazi et.al. | 2407.18711 | null |
2024-07-26 | 3D Orbital Angular Momentum Nonlinear Holography | Feiyang Shen et.al. | 2407.18696 | null |
2024-07-26 | Towards unveiling the large-scale nature of gravity with the wavelet scattering transform | Georgios Valogiannis et.al. | 2407.18647 | null |
2024-07-29 | GERry: A Code to Optimise the Hunt for the Electromagnetic Counter-parts to Gravitational Wave Events | David O’Neill et.al. | 2407.18642 | null |
2024-07-26 | IOVS4NeRF:Incremental Optimal View Selection for Large-Scale NeRFs | Jingpeng Xie et.al. | 2407.18611 | null |
2024-07-26 | From 2D to 3D: AISG-SLA Visual Localization Challenge | Jialin Gao et.al. | 2407.18590 | null |
2024-07-26 | PANDORA: The Open-Source, Structurally Elastic Humanoid Robot | Connor W. Herron et.al. | 2407.18558 | null |
2024-07-26 | How To Segment in 3D Using 2D Models: Automated 3D Segmentation of Prostate Cancer Metastatic Lesions on PET Volumes Using Multi-Angle Maximum Intensity Projections and Diffusion Models | Amirhosein Toosi et.al. | 2407.18555 | link |
2024-07-26 | ReALFRED: An Embodied Instruction Following Benchmark in Photo-Realistic Environments | Taewoong Kim et.al. | 2407.18550 | link |
2024-08-05 | Boosting Cross-Domain Point Classification via Distilling Relational Priors from 2D Transformers | Longkun Zou et.al. | 2407.18534 | link |
2024-07-26 | A variational front-tracking method for multiphase flow with triple junctions | Harald Garcke et.al. | 2407.18529 | null |
2024-07-26 | Answerability Fields: Answerable Location Estimation via Diffusion Models | Daichi Azuma et.al. | 2407.18497 | null |
2024-07-26 | Fast and Parallelizable Logical Computation with Homological Product Codes | Qian Xu et.al. | 2407.18490 | null |
2024-07-29 | A Reference-Based 3D Semantic-Aware Framework for Accurate Local Facial Attribute Editing | Yu-Kai Huang et.al. | 2407.18392 | null |
2024-07-25 | VGGHeads: A Large-Scale Synthetic Dataset for 3D Human Heads | Orest Kupyn et.al. | 2407.18245 | link |
2024-07-25 | RefMask3D: Language-Guided Transformer for 3D Referring Segmentation | Shuting He et.al. | 2407.18244 | link |
2024-07-25 | LION: Linear Group RNN for 3D Object Detection in Point Clouds | Zhe Liu et.al. | 2407.18232 | link |
2024-07-26 | Enhanced Depth Estimation and 3D Geometry Reconstruction using Bayesian Helmholtz Stereopsis with Belief Propagation | Razieh Azizi et.al. | 2407.18195 | null |
2024-07-30 | Evolution of reconnection flux during eruption of magnetic flux ropes | Samriddhi Sankar Maity et.al. | 2407.18188 | null |
2024-07-31 | Experimental and Numerical Study of Microcavity Filling Regimes for Lab-on-a-Chip Applications | Luise Nagel et.al. | 2407.18068 | null |
2024-07-25 | Signatures of Low Mass Black Hole-Neutron Star Mergers | Rahime Matur et.al. | 2407.18045 | null |
2024-07-25 | AttentionHand: Text-driven Controllable Hand Image Generation for 3D Hand Reconstruction in the Wild | Junho Park et.al. | 2407.18034 | link |
2024-07-25 | The operationally ready full three-dimensional magnetohydrodynamic (3D MHD) model from the Sun to Earth: COCONUT+Icarus | Tinatin Baratashvili et.al. | 2407.17903 | null |
2024-07-25 | 3D Hole Filling using Deep Learning Inpainting | Marina Hernández-Bautista et.al. | 2407.17896 | null |
2024-07-25 | Advancing 3D Point Cloud Understanding through Deep Transfer Learning: A Comprehensive Survey | Shahab Saquib Sohail et.al. | 2407.17877 | null |
2024-07-25 | 3D Adaptive VEM with stabilization-free a posteriori error bounds | Stefano Berrone et.al. | 2407.17858 | null |
2024-07-25 | 3D-Ising-type Magnetic Interactions Stabilized by the Extremely Large Uniaxial Magnetocrystalline Anisotropy in Layered Ferromagnetic Cr $_2$Te$_3$ | Shubham Purwar et.al. | 2407.17845 | null |
2024-07-25 | UMono: Physical Model Informed Hybrid CNN-Transformer Framework for Underwater Monocular Depth Estimation | Jian Wang et.al. | 2407.17838 | null |
2024-07-25 | DAC: 2D-3D Retrieval with Noisy Labels via Divide-and-Conquer Alignment and Correction | Chaofan Gan et.al. | 2407.17779 | link |
2024-07-25 | KiVA: Kid-inspired Visual Analogies for Testing Large Multimodal Models | Eunice Yiu et.al. | 2407.17773 | link |
2024-08-04 | Universal clusters in quasi-two-dimensional ultracold Fermi mixtures | Ruijin Liu et.al. | 2407.17702 | null |
2024-07-24 | Synthetic High-resolution Cryo-EM Density Maps with Generative Adversarial Networks | Chenwei Zhang et.al. | 2407.17674 | link |
2024-07-24 | Climate Transition to Temperate Nightside at High Atmosphere Mass | Evelyn Macdonald et.al. | 2407.17600 | null |
2024-08-06 | CityX: Controllable Procedural Content Generation for Unbounded 3D Cities | Shougao Zhang et.al. | 2407.17572 | null |
2024-07-24 | SV4D: Dynamic 3D Content Generation with Multi-Frame and Multi-View Consistency | Yiming Xie et.al. | 2407.17470 | null |
2024-07-24 | Existence and non-uniqueness of weak solutions with continuous energy to the 3D deterministic and stochastic Navier-Stokes equations | Alexey Cheskidov et.al. | 2407.17463 | null |
2024-07-24 | A soft-hard framework with exact four momentum conservation for small systems | I. Soudi et.al. | 2407.17443 | null |
2024-07-24 | Long-time behavior to the 3D isentropic compressible Navier-Stokes equations | Guochun Wu et.al. | 2407.17439 | null |
2024-07-28 | HumanVid: Demystifying Training Data for Camera-controllable Human Image Animation | Zhenzhi Wang et.al. | 2407.17438 | link |
2024-07-24 | 3D Gaussian Splatting: Survey, Technologies, Challenges, and Opportunities | Yanqi Bao et.al. | 2407.17418 | link |
2024-07-24 | 3D Question Answering for City Scene Understanding | Penglei Sun et.al. | 2407.17398 | null |
2024-07-24 | 2D and 3D Deep Learning Models for MRI-based Parkinson’s Disease Classification: A Comparative Analysis of Convolutional Kolmogorov-Arnold Networks, Convolutional Neural Networks, and Graph Convolutional Networks | Salil B Patel et.al. | 2407.17380 | null |
2024-07-24 | Accurate Inverse Process Optimization Framework in Laser Directed Energy Deposition | Xiao Shang et.al. | 2407.17338 | null |
2024-07-25 | Enhanced Deep Learning Methodologies and MRI Selection Techniques for Dementia Diagnosis in the Elderly Population | Nikolaos Ntampakis et.al. | 2407.17324 | null |
2024-07-24 | Asymmetries in asymptotic giant branch stars and their winds. I. From 3D RHD models to synthetic observables | Joachim Wiegert et.al. | 2407.17317 | link |
2024-07-25 | LangOcc: Self-Supervised Open Vocabulary Occupancy Estimation via Volume Rendering | Simon Boeder et.al. | 2407.17310 | null |
2024-07-24 | The impact of differences in facial features between real speakers and 3D face models on synthesized lip motions | Rabab Algadhy et.al. | 2407.17253 | null |
2024-07-24 | Near-Field Integrated Sensing and Communication with Extremely Large-Scale Antenna Array | Haocheng Hua et.al. | 2407.17237 | null |
2024-07-24 | Graph Neural Networks: A suitable Alternative to MLPs in Latent 3D Medical Image Classification? | Johannes Kiechle et.al. | 2407.17219 | link |
2024-07-24 | The Sketchfab 3D Creative Commons Collection (S3D3C) | Florian Spiess et.al. | 2407.17205 | null |
2024-07-24 | ALPI: Auto-Labeller with Proxy Injection for 3D Object Detection using 2D Labels Only | Saad Lahlali et.al. | 2407.17197 | link |
2024-07-24 | DiffCD: A Symmetric Differentiable Chamfer Distance for Neural Implicit Surface Fitting | Linus Härenstam-Nielsen et.al. | 2407.17058 | link |
2024-07-30 | DreamCar: Leveraging Car-specific Prior for in-the-wild 3D Car Reconstruction | Xiaobiao Du et.al. | 2407.16988 | link |
2024-07-24 | Understanding the Ising zigzag antiferromagnetism of FePS3 and FePSe3 monolayers | Ke Yang et.al. | 2407.16978 | null |
2024-07-24 | 3DAttGAN: A 3D Attention-based Generative Adversarial Network for Joint Space-Time Video Super-Resolution | Congrui Fu et.al. | 2407.16965 | link |
2024-07-24 | DVPE: Divided View Position Embedding for Multi-View 3D Object Detection | Jiasen Wang et.al. | 2407.16955 | link |
2024-07-24 | EUFormer: Learning Driven 3D Spine Deformity Assessment with Orthogonal Optical Images | Nan Meng et.al. | 2407.16942 | null |
2024-07-23 | Vision-Based Adaptive Robotics for Autonomous Surface Crack Repair | Joshua Genova et.al. | 2407.16874 | null |
2024-07-23 | SE3ET: SE(3)-Equivariant Transformer for Low-Overlap Point Cloud Registration | Chien Erh Lin et.al. | 2407.16823 | link |
2024-07-23 | Chern-Simons-Like Formulation of Exotic Massive 3D Gravity Models | Büşra Dedeoğlu et.al. | 2407.16799 | null |
2024-07-25 | What Matters in Range View 3D Object Detection | Benjamin Wilson et.al. | 2407.16789 | link |
2024-07-23 | Occlusion-Aware 3D Motion Interpretation for Abnormal Behavior Detection | Su Li et.al. | 2407.16788 | null |
2024-07-23 | She’s Got Her Mother’s Hair: End-to-End Collapsar Simulations Unveil the Origin of Black Holes’ Magnetic Field | Ore Gottlieb et.al. | 2407.16745 | null |
2024-07-23 | Simulation of ultracold Bose gases with the complex Langevin method | Philipp Heinen et.al. | 2407.16730 | null |
2024-07-23 | Diffusion Models for Monocular Depth Estimation: Overcoming Challenging Conditions | Fabio Tosi et.al. | 2407.16698 | link |
2024-07-23 | AbdomenAtlas: A Large-Scale, Detailed-Annotated, & Multi-Center Dataset for Efficient Transfer Learning and Open Algorithmic Benchmarking | Wenxuan Li et.al. | 2407.16697 | link |
2024-07-23 | Unveiling galaxy pair alignment in cosmic filaments: A 3D exploration using EAGLE simulation | Suman Sarkar et.al. | 2407.16675 | null |
2024-07-23 | Fluorescence Diffraction Tomography using Explicit Neural Fields | Renzhi He et.al. | 2407.16657 | null |
2024-07-24 | Aggregated Attributions for Explanatory Analysis of 3D Segmentation Models | Maciej Chrabaszcz et.al. | 2407.16653 | link |
2024-07-23 | Nonlinear screening in periodically doped Graphene | K. A. Baryshnikov et.al. | 2407.16579 | null |
2024-07-23 | Timeliness-Fidelity Tradeoff in 3D Scene Representations | Xiangmin Xu et.al. | 2407.16575 | null |
2024-07-23 | Is 3D Convolution with 5D Tensors Really Necessary for Video Analysis? | Habib Hajimolahoseini et.al. | 2407.16514 | null |
2024-07-23 | DreamVTON: Customizing 3D Virtual Try-on with Personalized Diffusion Models | Zhenyu Xie et.al. | 2407.16511 | null |
2024-07-23 | HDRSplat: Gaussian Splatting for High Dynamic Range 3D Scene Reconstruction from Raw Images | Shreyas Singh et.al. | 2407.16503 | link |
2024-07-23 | MonoWAD: Weather-Adaptive Diffusion Model for Robust Monocular 3D Object Detection | Youngmin Oh et.al. | 2407.16448 | link |
2024-07-23 | Cross Anything: General Quadruped Robot Navigation through Complex Terrains | Shaoting Zhu et.al. | 2407.16412 | null |
2024-07-23 | On Differentially Private 3D Medical Image Synthesis with Controllable Latent Diffusion Models | Deniz Daum et.al. | 2407.16405 | link |
2024-07-23 | Learning Unsigned Distance Functions from Multi-view Images with Volume Rendering Priors | Wenyuan Zhang et.al. | 2407.16396 | null |
2024-07-23 | DreamDissector: Learning Disentangled Text-to-3D Generation from 2D Diffusion Priors | Zizheng Yan et.al. | 2407.16260 | null |
2024-07-23 | LiCROcc: Teach Radar for Accurate Semantic Occupancy Prediction using LiDAR and Camera | Yukai Ma et.al. | 2407.16197 | null |
2024-07-23 | CloudFixer: Test-Time Adaptation for 3D Point Clouds via Diffusion-Guided Geometric Transformation | Hajin Shim et.al. | 2407.16193 | null |
2024-07-23 | Integrating Meshes and 3D Gaussians for Indoor Scene Reconstruction with SAM Mask Guidance | Jiyeop Kim et.al. | 2407.16173 | null |
2024-07-23 | Advanced AI Framework for Enhanced Detection and Assessment of Abdominal Trauma: Integrating 3D Segmentation with 2D CNN and RNN Models | Liheng Jiang et.al. | 2407.16165 | null |
2024-07-23 | 3D-UGCN: A Unified Graph Convolutional Network for Robust 3D Human Pose Estimation from Monocular RGB Images | Jie Zhao et.al. | 2407.16137 | null |
2024-07-23 | Augmented Efficiency: Reducing Memory Footprint and Accelerating Inference for 3D Semantic Segmentation through Hybrid Vision | Aditya Krishnan et.al. | 2407.16102 | null |
2024-07-27 | On Flange-based 3D Hand-Eye Calibration for Soft Robotic Tactile Welding | Xudong Han et.al. | 2407.16041 | null |
2024-07-22 | Miura operators as R-matrices from M-brane intersections | Nathan Haouzi et.al. | 2407.15990 | null |
2024-07-22 | Carving Polytopes with Saws in 3D | Eliot W. Robson et.al. | 2407.15981 | null |
2024-07-22 | Ising BCFT from Fuzzy Hemisphere | Mykola Dedushenko et.al. | 2407.15948 | null |
2024-07-22 | The Effect of Donor Star Rejuvenation on Common Envelope Evolution | C. Landri et.al. | 2407.15932 | null |
2024-07-24 | Studying the 3d Ising surface CFTs on the fuzzy sphere | Zheng Zhou et.al. | 2407.15914 | null |
2024-07-22 | Local existence of classical solutions to the 3D isentropic compressible Navier-Stokes-Poisson equations with degenerate viscosities and vacuum | Peng Lu et.al. | 2407.15897 | null |
2024-07-21 | A Novel Method to Improve Quality Surface Coverage in Multi-View Capture | Wei-Lun Huang et.al. | 2407.15883 | null |
2024-07-19 | Gaussian Process Model with Tensorial Inputs and Its Application to the Design of 3D Printed Antennas | Xi Chen et.al. | 2407.15877 | null |
2024-07-22 | HandDGP: Camera-Space Hand Mesh Prediction with Differentiable Global Positioning | Eugene Valassakis et.al. | 2407.15844 | null |
2024-07-22 | Unsupervised Mastoidectomy for Cochlear CT Mesh Reconstruction Using Highly Noisy Data | Yike Zhang et.al. | 2407.15787 | null |
2024-07-22 | Local Occupancy-Enhanced Object Grasping with Multiple Triplanar Projection | Kangqi Ma et.al. | 2407.15771 | null |
2024-07-23 | SAM2CLIP2SAM: Vision Language Model for Segmentation of 3D CT Scans for Covid-19 Detection | Dimitrios Kollias et.al. | 2407.15728 | null |
2024-07-22 | Differentiable Convex Polyhedra Optimization from Multi-view Images | Daxuan Ren et.al. | 2407.15686 | link |
2024-07-22 | TreeSBA: Tree-Transformer for Self-Supervised Sequential Brick Assembly | Mengqi Guo et.al. | 2407.15648 | link |
2024-07-22 | Decomposition of Neural Discrete Representations for Large-Scale 3D Mapping | Minseong Park et.al. | 2407.15554 | link |
2024-07-22 | Shell mergers in the late stages of massive star evolution: new insight from 3D hydrodynamic simulations | Federico Rizzuti et.al. | 2407.15544 | null |
2024-07-23 | Differentiable Product Quantization for Memory Efficient Camera Relocalization | Zakaria Laskar et.al. | 2407.15540 | link |
2024-07-22 | 6DGS: 6D Pose Estimation from a Single Image and a 3D Gaussian Splatting Model | Matteo Bortolon et.al. | 2407.15484 | null |
2024-07-22 | Enhancement of 3D Gaussian Splatting using Raw Mesh for Photorealistic Recreation of Architectures | Ruizhe Wang et.al. | 2407.15435 | null |
2024-07-22 | Spatial-Division Augmented Occupancy Field for Bone Shape Reconstruction from Biplanar X-Rays | Jixiang Chen et.al. | 2407.15433 | link |
2024-07-22 | Chronologically Accurate Retrieval for Temporal Grounding of Motion-Language Models | Kent Fujiwara et.al. | 2407.15408 | null |
2024-07-22 | Optical alignment of contamination-sensitive Far-Ultraviolet spectrographs for Aspera SmallSat mission | Aafaque R. Khan et.al. | 2407.15391 | null |
2024-07-22 | Structure-Aware Residual-Center Representation for Self-Supervised Open-Set 3D Cross-Modal Retrieval | Yang Xu et.al. | 2407.15376 | null |
2024-07-26 | avaTTAR: Table Tennis Stroke Training with On-body and Detached Visualization in Augmented Reality | Dizhi Ma et.al. | 2407.15373 | null |
2024-07-22 | X-Recon: Learning-based Patient-specific High-Resolution CT Reconstruction from Orthogonal X-Ray Images | Yunpeng Wang et.al. | 2407.15356 | link |
2024-07-22 | Learning High-resolution Vector Representation from Multi-Camera Images for 3D Object Detection | Zhili Chen et.al. | 2407.15354 | link |
2024-07-22 | WTS: A Pedestrian-Centric Traffic Video Dataset for Fine-grained Spatial-Temporal Understanding | Quan Kong et.al. | 2407.15350 | null |
2024-07-22 | ThermalNeRF: Thermal Radiance Fields | Yvette Y. Lin et.al. | 2407.15337 | null |
2024-07-22 | Explore the LiDAR-Camera Dynamic Adjustment Fusion for 3D Object Detection | Yiran Yang et.al. | 2407.15334 | link |
2024-07-21 | 3D Reconstruction of the Human Colon from Capsule Endoscope Video | Pål Anders Floor et.al. | 2407.15228 | null |
2024-07-21 | Secure Web Objects: Building Blocks for Metaverse Interoperability and Decentralization | Tianyuan Yu et.al. | 2407.15221 | null |
2024-07-21 | Unraveling Picophotonics of Crystalline Materials | Sathwik Bharadwaj et.al. | 2407.15189 | null |
2024-07-21 | HoloDreamer: Holistic 3D Panoramic World Generation from Text Descriptions | Haiyang Zhou et.al. | 2407.15187 | null |
2024-07-21 | Assessing Sample Quality via the Latent Space of Generative Models | Jingyi Xu et.al. | 2407.15171 | link |
2024-07-21 | Navigation Instruction Generation with BEV Perception and Large Language Models | Sheng Fan et.al. | 2407.15087 | link |
2024-07-21 | 3D Gaussian Parametric Head Model | Yuelang Xu et.al. | 2407.15070 | null |
2024-07-21 | VoxDepth: Rectification of Depth Images on Edge Devices | Yashashwee Chakrabarty et.al. | 2407.15067 | null |
2024-07-20 | Non-Reference Quality Assessment for Medical Imaging: Application to Synthetic Brain MRIs | Karl Van Eeden Risager et.al. | 2407.14994 | null |
2024-07-20 | RGB2Point: 3D Point Cloud Generation from Single RGB Images | Jae Joong Lee et.al. | 2407.14979 | null |
2024-07-20 | Temporal Residual Jacobians For Rig-free Motion Transfer | Sanjeev Muralikrishnan et.al. | 2407.14958 | null |
2024-07-27 | RayFormer: Improving Query-Based Multi-Camera 3D Object Detection via Ray-Centric Strategies | Xiaomeng Chu et.al. | 2407.14923 | null |
2024-07-20 | Automated Patient Positioning with Learned 3D Hand Gestures | Zhongpai Gao et.al. | 2407.14903 | null |
2024-07-20 | Probing 3D magnetic fields using starlight polarization and grain alignment theory | Bao Truong et.al. | 2407.14896 | null |
2024-07-20 | Latent Pollution Model: The Hidden Carbon Footprint in 3D Image Synthesis | Marvin Seyfarth et.al. | 2407.14892 | null |
2024-07-20 | An asymptotically consistent morphoelastic shell model for compressible biological structures with finite-strain deformations | Xiang Yu et.al. | 2407.14881 | null |
2024-07-20 | Realistic Surgical Image Dataset Generation Based On 3D Gaussian Splatting | Tianle Zeng et.al. | 2407.14846 | null |
2024-07-20 | SpatialTouch: Exploring Spatial Data Visualizations in Cross-reality | Lixiang Zhao et.al. | 2407.14833 | link |
2024-07-20 | 3D-printed axicon enables extended depth-of-focus intravascular optical coherence tomography | Pavel Ruchka et.al. | 2407.14825 | null |
2024-07-19 | Charge Density Waves in the 2.5-Dimensional Quantum Heterostructure | F. Z. Yang et.al. | 2407.14661 | null |
2024-07-26 | Improving Representation of High-frequency Components for Medical Foundation Models | Yuetan Chu et.al. | 2407.14651 | link |
2024-07-19 | ELEQTRONeX: A GPU-Accelerated Exascale Framework for Non-Equilibrium Quantum Transport in Nanomaterials | Saurabh Sawant et.al. | 2407.14633 | null |
2024-07-19 | Dynamical Transition of Quantum Vortex-Pair Annihilation in a Bose-Einstein Condensate | Toshiaki Kanai et.al. | 2407.14627 | null |
2024-07-19 | Deep Learning-based 3D Coronary Tree Reconstruction from Two 2D Non-simultaneous X-ray Angiography Projections | Yiying Wang et.al. | 2407.14616 | link |
2024-07-19 | ESCAPE: Energy-based Selective Adaptive Correction for Out-of-distribution 3D Human Pose Estimation | Luke Bidulka et.al. | 2407.14605 | null |
2024-07-19 | Unveiling the Milky Way dust extinction curve in 3D | Xiangyu Zhang et.al. | 2407.14594 | null |
2024-07-19 | Echo Location: Distances to Galactic Supernovae From ASAS-SN Light Echoes and 3D Dust Maps | Kyle D. Neumann et.al. | 2407.14584 | null |
2024-07-18 | A Novel Skiagraphic Method of Casting Shade of a Torus | Tanvir Morshed et.al. | 2407.14557 | null |
2024-07-19 | PD-TPE: Parallel Decoder with Text-guided Position Encoding for 3D Visual Grounding | Chenshu Hou et.al. | 2407.14491 | null |
2024-07-19 | MLMT-CNN for Object Detection and Segmentation in Multi-layer and Multi-spectral Images | Majedaldein Almahasneh et.al. | 2407.14473 | null |
2024-07-19 | AttentNet: Fully Convolutional 3D Attention for Lung Nodule Detection | Majedaldein Almahasneh et.al. | 2407.14464 | null |
2024-07-19 | HOTS3D: Hyper-Spherical Optimal Transport for Semantic Alignment of Text-to-3D Generation | Zezeng Li et.al. | 2407.14419 | null |
2024-07-19 | Search for Very-Short-Baseline Oscillations of Reactor Antineutrinos with the SoLid Detector | Y. Abreu et.al. | 2407.14382 | null |
2024-07-19 | OpenSU3D: Open World 3D Scene Understanding using Foundation Models | Rafay Mohiuddin et.al. | 2407.14279 | null |
2024-07-22 | Patch-based Intuitive Multimodal Prototypes Network (PIMPNet) for Alzheimer’s Disease classification | Lisa Anita De Santi et.al. | 2407.14277 | link |
2024-07-19 | SparseCraft: Few-Shot Neural Reconstruction through Stereopsis Guided Geometric Linearization | Mae Younes et.al. | 2407.14257 | null |
2024-07-19 | Fate of transient order parameter domain walls in ultrafast experiments | Lingxian Kong et.al. | 2407.14250 | null |
2024-07-19 | Double-Shot 3D Shape Measurement with a Dual-Branch Network | Mingyang Lei et.al. | 2407.14198 | null |
2024-07-19 | On the tomographic cluster clustering as a cosmological probe | Massimiliano Romanello et.al. | 2407.14144 | null |
2024-07-19 | I Know About “Up”! Enhancing Spatial Reasoning in Visual Language Models Through 3D Reconstruction | Zaiqiao Meng et.al. | 2407.14133 | null |
2024-07-19 | Seismic Fault SAM: Adapting SAM with Lightweight Modules and 2.5D Strategy for Fault Detection | Ran Chen et.al. | 2407.14121 | null |
2024-07-19 | GaussianBeV: 3D Gaussian Representation meets Perception Models for BeV Segmentation | Florian Chabot et.al. | 2407.14108 | null |
2024-07-19 | Signatures of Massive Black Hole Merger Host Galaxies from Cosmological Simulations II: Unique Stellar Kinematics in Integral Field Unit Spectroscopy | Jaeden Bardati et.al. | 2407.14061 | null |
2024-07-19 | PointRegGPT: Boosting 3D Point Cloud Registration using Generative Point-Cloud Pairs for Training | Suyi Chen et.al. | 2407.14054 | link |
2024-07-19 | DirectL: Efficient Radiance Fields Rendering for 3D Light Field Displays | Zongyuan Yang et.al. | 2407.14053 | null |
2024-07-19 | Kinematics-based 3D Human-Object Interaction Reconstruction from Single View | Yuhang Chen et.al. | 2407.14043 | null |
2024-07-19 | Segmentation of Brain Metastases in MRI: A Two-Stage Deep Learning Approach with Modality Impact Study | Yousef Sadegheih et.al. | 2407.14011 | link |
2024-07-19 | Scale Disparity of Instances in Interactive Point Cloud Segmentation | Chenrui Han et.al. | 2407.14009 | null |
2024-07-19 | Multi-modal Relation Distillation for Unified 3D Representation Learning | Huiqun Wang et.al. | 2407.14007 | null |
2024-07-19 | Semantic Communications for 3D Human Face Transmission with Neural Radiance Fields | Guanlin Wu et.al. | 2407.13992 | null |
2024-07-19 | PlacidDreamer: Advancing Harmony in Text-to-3D Generation | Shuo Huang et.al. | 2407.13976 | link |
2024-07-18 | Boosting Online 3D Multi-Object Tracking through Camera-Radar Cross Check | Sheng-Yao Kuan et.al. | 2407.13937 | null |
2024-07-18 | RT-Pose: A 4D Radar Tensor-based 3D Human Pose Estimation and Localization Benchmark | Yuan-Hao Ho et.al. | 2407.13930 | null |
2024-07-18 | Simultaneous Localization and Affordance Prediction for Tasks in Egocentric Video | Zachary Chavis et.al. | 2407.13856 | null |
2024-07-25 | Language-Driven 6-DoF Grasp Detection Using Negative Prompt Guidance | Toan Nguyen et.al. | 2407.13842 | null |
2024-07-18 | Shape of Motion: 4D Reconstruction from a Single Video | Qianqian Wang et.al. | 2407.13764 | null |
2024-07-18 | SegPoint: Segment Any Point Cloud via Large Language Model | Shuting He et.al. | 2407.13761 | null |
2024-07-25 | Streetscapes: Large-scale Consistent Street View Generation Using Autoregressive Video Diffusion | Boyang Deng et.al. | 2407.13759 | null |
2024-07-18 | General Geometry-aware Weakly Supervised 3D Object Detection | Guowen Zhang et.al. | 2407.13748 | link |
2024-07-18 | MaRINeR: Enhancing Novel Views by Matching Rendered Images with Nearby References | Lukas Bösiger et.al. | 2407.13745 | link |
2024-07-18 | Imaging the jet of MWC 349A with resolved Radio Recombination Line emission from ALMA | Antonio Martínez-Henares et.al. | 2407.13681 | null |
2024-07-18 | PASTA: Controllable Part-Aware Shape Generation with Autoregressive Transformers | Songlin Li et.al. | 2407.13677 | link |
2024-07-25 | MeshSegmenter: Zero-Shot Mesh Semantic Segmentation via Texture Synthesis | Ziming Zhong et.al. | 2407.13675 | link |
2024-07-18 | Open-Vocabulary 3D Semantic Segmentation with Text-to-Image Diffusion Models | Xiaoyu Zhu et.al. | 2407.13642 | null |
2024-07-18 | Droplet impact and splitting behaviour on superhydrophobic wedges | Gudlavalleti V V S Vara Prasad et.al. | 2407.13635 | null |
2024-07-20 | Connecting Consistency Distillation to Score Distillation for Text-to-3D Generation | Zongrui Li et.al. | 2407.13584 | link |
2024-07-18 | 3d Carrollian Chern-Simons theory and 2d Yang-Mills | Arjun Bagchi et.al. | 2407.13574 | null |
2024-07-18 | DiffuX2CT: Diffusion Learning to Reconstruct CT Images from Biplanar X-Rays | Xuhui Liu et.al. | 2407.13545 | null |
2024-07-19 | GlobalPointer: Large-Scale Plane Adjustment with Bi-Convex Relaxation | Bangyan Liao et.al. | 2407.13537 | link |
2024-07-18 | Pushing the Limits of Reactive Planning: Learning to Escape Local Minima | Isar Meijer et.al. | 2407.13530 | null |
2024-07-18 | EaDeblur-GS: Event assisted 3D Deblur Reconstruction with Gaussian Splatting | Yuchen Weng et.al. | 2407.13520 | null |
2024-07-18 | WiNet: Wavelet-based Incremental Learning for Efficient Medical Image Registration | Xinxing Cheng et.al. | 2407.13426 | link |
2024-07-18 | Open Vocabulary 3D Scene Understanding via Geometry Guided Self-Distillation | Pengfei Wang et.al. | 2407.13362 | null |
2024-07-18 | Implicit Filtering for Learning Neural Signed Distance Functions from 3D Point Clouds | Shengtao Li et.al. | 2407.13342 | null |
2024-07-18 | Long-Term 3D Point Tracking By Cost Volume Fusion | Hung Nguyen et.al. | 2407.13337 | null |
2024-07-18 | A Dataset and Benchmark for Shape Completion of Fruits for Agricultural Robotics | Federico Magistri et.al. | 2407.13304 | link |
2024-07-18 | Patient-specific coronary angioplasty simulations – a mixed-dimensional finite element modeling approach | Janina C. Datz et.al. | 2407.13276 | null |
2024-07-18 | STS MICCAI 2023 Challenge: Grand challenge on 2D and 3D semi-supervised tooth segmentation | Yaqi Wang et.al. | 2407.13246 | null |
2024-07-18 | NODER: Image Sequence Regression Based on Neural Ordinary Differential Equations | Hao Bai et.al. | 2407.13241 | link |
2024-07-18 | Adapt PointFormer: 3D Point Cloud Analysis via Adapting 2D Visual Transformers | Mengke Li et.al. | 2407.13200 | null |
2024-07-18 | Superconformal Indices of 3d $\mathcal{N}=2$ SCFTs and Holography | Nikolay Bobev et.al. | 2407.13177 | null |
2024-07-21 | Real-Time 3D Occupancy Prediction via Geometric-Semantic Disentanglement | Yulin He et.al. | 2407.13155 | null |
2024-07-18 | OE-BevSeg: An Object Informed and Environment Aware Multimodal Framework for Bird’s-eye-view Vehicle Semantic Segmentation | Jian Sun et.al. | 2407.13137 | null |
2024-07-20 | Modeling and Driving Human Body Soundfields through Acoustic Primitives | Chao Huang et.al. | 2407.13083 | null |
2024-07-17 | Planning and Perception for Unmanned Aerial Vehicles in Object and Environmental Monitoring | Harnaik Dhami et.al. | 2407.13003 | null |
2024-07-17 | $p$ -Chords, Wee-Chords, and de Sitter Space | Adel A. Rahman et.al. | 2407.12988 | null |
2024-07-17 | Transition to turbulence in the wide-gap spherical Couette system | Ankit Barik et.al. | 2407.12981 | null |
2024-07-17 | Edge Projection-Based Adaptive View Selection for Cone-Beam CT | Jingsong Lin et.al. | 2407.12963 | null |
2024-07-17 | Denoising Diffusions in Latent Space for Medical Image Segmentation | Fahim Ahmed Zaman et.al. | 2407.12952 | link |
2024-07-19 | GenRC: Generative 3D Room Completion from Sparse Image Collections | Ming-Feng Li et.al. | 2407.12939 | link |
2024-07-17 | Isolated steady solutions of the 3D Euler equations | Alberto Enciso et.al. | 2407.12938 | null |
2024-07-20 | VD3D: Taming Large Video Diffusion Transformers for 3D Camera Control | Sherwin Bahmani et.al. | 2407.12781 | null |
2024-07-17 | Generalizable Human Gaussians for Sparse View Synthesis | Youngjoong Kwon et.al. | 2407.12777 | link |
2024-07-17 | GroundUp: Rapid Sketch-Based 3D City Massing | Gizem Esra Unlu et.al. | 2407.12739 | null |
2024-07-17 | NL2Contact: Natural Language Guided 3D Hand-Object Contact Modeling with Diffusion Model | Zhongqun Zhang et.al. | 2407.12727 | null |
2024-07-18 | TransCAD: A Hierarchical Transformer for CAD Sequence Inference from Point Clouds | Elona Dupont et.al. | 2407.12702 | null |
2024-07-17 | 4Dynamic: Text-to-4D Generation with Hybrid Priors | Yu-Jie Yuan et.al. | 2407.12684 | null |
2024-07-17 | In-Situ Infrared Camera Monitoring for Defect and Anomaly Detection in Laser Powder Bed Fusion: Calibration, Data Mapping, and Feature Extraction | Shawn Hinnebusch et.al. | 2407.12682 | null |
2024-07-17 | SG-NeRF: Neural Surface Reconstruction with Scene Graph Optimization | Yiyang Chen et.al. | 2407.12667 | link |
2024-07-17 | InfoNorm: Mutual Information Shaping of Normals for Sparse-View Reconstruction | Xulong Wang et.al. | 2407.12661 | link |
2024-07-17 | FastSAM-3DSlicer: A 3D-Slicer Extension for 3D Volumetric Segment Anything Model with Uncertainty Quantification | Yiqing Shen et.al. | 2407.12658 | link |
2024-07-17 | Subequivariant Reinforcement Learning in 3D Multi-Entity Physical Environments | Runfa Chen et.al. | 2407.12505 | null |
2024-07-17 | EmoFace: Audio-driven Emotional 3D Face Animation | Chang Liu et.al. | 2407.12501 | link |
2024-07-17 | Close the Sim2real Gap via Physically-based Structured Light Synthetic Data Simulation | Kaixin Bai et.al. | 2407.12449 | null |
2024-07-17 | F-HOI: Toward Fine-grained Semantic-Aligned 3D Human-Object Interactions | Jie Yang et.al. | 2407.12435 | null |
2024-07-18 | Direct Nanopatterning of Complex 3D Surfaces and Self-Aligned Superlattices via Molecular-Beam Holographic Lithography | Shuangshuang Zeng et.al. | 2407.12420 | null |
2024-07-17 | Efficient Depth-Guided Urban View Synthesis | Sheng Miao et.al. | 2407.12395 | null |
2024-07-17 | HGL: Hierarchical Geometry Learning for Test-time Adaptation in 3D Point Cloud Segmentation | Tianpei Zou et.al. | 2407.12387 | link |
2024-07-17 | HIMO: A New Benchmark for Full-Body Human Interacting with Multiple Objects | Xintao Lv et.al. | 2407.12371 | null |
2024-07-17 | Generalisation of the Navier-slip boundary condition to arbitrary directions: Application to 3D oblique geodynamic simulations | Anthony Jourdon et.al. | 2407.12361 | null |
2024-07-17 | Label-Efficient 3D Brain Segmentation via Complementary 2D Diffusion Models with Orthogonal Views | Jihoon Cho et.al. | 2407.12329 | null |
2024-07-17 | Serialized Point Mamba: A Serialized Point Cloud Mamba Segmentation Model | Tao Wang et.al. | 2407.12319 | null |
2024-07-17 | Weakly-Supervised 3D Hand Reconstruction with Knowledge Prior and Uncertainty Guidance | Yufei Zhang et.al. | 2407.12307 | null |
2024-07-17 | Splatfacto-W: A Nerfstudio Implementation of Gaussian Splatting for Unconstrained Photo Collections | Congrong Xu et.al. | 2407.12306 | null |
2024-07-17 | VEON: Vocabulary-Enhanced Occupancy Prediction | Jilai Zheng et.al. | 2407.12294 | null |
2024-07-17 | JointDreamer: Ensuring Geometry Consistency and Text Congruence in Text-to-3D Generation via Joint Score Distillation | Chenhan Jiang et.al. | 2407.12291 | null |
2024-07-17 | Generating 3D House Wireframes with Semantics | Xueqi Ma et.al. | 2407.12267 | link |
2024-07-16 | Monocular pose estimation of articulated surgical instruments in open surgery | Robert Spektor et.al. | 2407.12138 | null |
2024-07-16 | Impact of spatially varying transport coefficients in EMC3-Eirene simulations of W7-X and assessment of drifts | David Bold et.al. | 2407.12072 | null |
2024-07-16 | UrbanWorld: An Urban World Model for 3D City Generation | Yu Shang et.al. | 2407.11965 | link |
2024-07-18 | Motion-Oriented Compositional Neural Radiance Fields for Monocular Dynamic Human Modeling | Jaehyeok Kim et.al. | 2407.11962 | null |
2024-07-16 | OmniBind: Large-scale Omni Multimodal Representation via Binding Spaces | Zehan Wang et.al. | 2407.11895 | null |
2024-07-16 | DepGAN: Leveraging Depth Maps for Handling Occlusions and Transparency in Image Composition | Amr Ghoneim et.al. | 2407.11890 | null |
2024-07-16 | MVG-Splatting: Multi-View Guided Gaussian Splatting with Adaptive Quantile-Based Geometric Consistency Densification | Zhuoxiao Li et.al. | 2407.11840 | null |
2024-07-17 | Vibravox: A Dataset of French Speech Captured with Body-conduction Audio Sensors | Julien Hauret et.al. | 2407.11828 | link |
2024-07-16 | Speckle-based 3D sub-diffraction imaging through a multimode fiber | Zhouping Lyu et.al. | 2407.11796 | null |
2024-07-16 | Click-Gaussian: Interactive Segmentation to Any 3D Gaussians | Seokhun Choi et.al. | 2407.11793 | null |
2024-07-16 | SlingBAG: Sliding ball adaptive growth algorithm with differentiable radiation enables super-efficient iterative 3D photoacoustic image reconstruction | Shuang Li et.al. | 2407.11781 | link |
2024-07-16 | First-principles investigation of magnetic exchange force microscopy on adatoms adsorbed on an antiferromagnetic surface | Soumyajyoti Haldar et.al. | 2407.11732 | null |
2024-07-17 | Monocular Occupancy Prediction for Scalable Indoor Scenes | Hongxiao Yu et.al. | 2407.11730 | link |
2024-07-16 | Superintegrable families of magnetic monopoles with non-radial potential in curved background | Antonella Marchesiello et.al. | 2407.11709 | null |
2024-07-22 | Snail-Radar: A large-scale diverse dataset for the evaluation of 4D-radar-based SLAM systems | Jianzhu Huai et.al. | 2407.11705 | null |
2024-07-16 | Global atmospheric data assimilation with multi-modal masked autoencoders | Thomas J. Vandal et.al. | 2407.11696 | null |
2024-07-16 | Perception Helps Planning: Facilitating Multi-Stage Lane-Level Integration via Double-Edge Structures | Guoliang You et.al. | 2407.11644 | null |
2024-07-16 | Magnetic dissipation in short gamma-ray burst jets. I. Resistive relativistic MHD evolution in a model environment | Giancarlo Mattia et.al. | 2407.11581 | null |
2024-07-16 | SGIFormer: Semantic-guided and Geometric-enhanced Interleaving Transformer for 3D Instance Segmentation | Lei Yao et.al. | 2407.11564 | link |
2024-07-16 | Length-Aware Motion Synthesis via Latent Diffusion | Alessio Sampieri et.al. | 2407.11532 | link |
2024-07-16 | MRIo3DS-Net: A Mutually Reinforcing Images to 3D Surface RNN-like framework for model-adaptation indoor 3D reconstruction | Chang Li et.al. | 2407.11431 | null |
2024-07-16 | TeethDreamer: 3D Teeth Reconstruction from Five Intra-oral Photographs | Chenfan Xu et.al. | 2407.11419 | link |
2024-07-16 | Animate3D: Animating Any 3D Model with Multi-view Video Diffusion | Yanqin Jiang et.al. | 2407.11398 | null |
2024-07-16 | DreamCatalyst: Fast and High-Quality 3D Editing via Controlling Editability and Identity Preservation | Jiwook Kim et.al. | 2407.11394 | link |
2024-07-17 | Segment, Lift and Fit: Automatic 3D Shape Labeling from 2D Prompts | Jianhao Li et.al. | 2407.11382 | null |
2024-07-16 | I $^2$ -SLAM: Inverting Imaging Process for Robust Photorealistic Dense SLAM | Gwangtak Bae et.al. | 2407.11347 | null |
2024-07-16 | Ev-GS: Event-based Gaussian splatting for Efficient and Accurate Radiance Field Rendering | Jingqian Wu et.al. | 2407.11343 | null |
2024-07-19 | HEROS: Hierarchical Exploration with Online Subregion Updating for 3D Environment Coverage | Shijun Long et.al. | 2407.11326 | link |
2024-07-16 | Gaussian Splatting LK | Liuyue Xie et.al. | 2407.11309 | null |
2024-07-16 | PADRe: A Unifying Polynomial Attention Drop-in Replacement for Efficient Vision Transformer | Pierre-David Letourneau et.al. | 2407.11306 | null |
2024-07-19 | LoRA-PT: Low-Rank Adapting UNETR for Hippocampus Segmentation Using Principal Tensor Singular Values and Vectors | Guanghua He et.al. | 2407.11292 | link |
2024-07-15 | Differentiable Voxelization and Mesh Morphing | Yihao Luo et.al. | 2407.11272 | link |
2024-07-15 | Towards High-Quality 3D Motion Transfer with Realistic Apparel Animation | Rong Wang et.al. | 2407.11266 | link |
2024-07-25 | Evaluating geometric accuracy of NeRF reconstructions compared to SLAM method | Adam Korycki et.al. | 2407.11238 | null |
2024-07-15 | Wind Tunnel Testing and Modeling Implications of an Advanced Turbine Cascade | Sharath Sathish et.al. | 2407.11210 | null |
2024-07-15 | Stationary CT Imaging of Intracranial Hemorrhage with Diffusion Posterior Sampling Reconstruction | Alejandro Lopez-Montes et.al. | 2407.11196 | null |
2024-07-15 | Axion-Photon Mixing in 3D: Classical Equations and Geometric Optics | J. I. McDonald et.al. | 2407.11192 | null |
2024-07-15 | iHuman: Instant Animatable Digital Humans From Monocular Videos | Pramish Paudel et.al. | 2407.11174 | link |
2024-07-15 | Deconfinements, Kutasov-Schwimmer dualities and $D_p[SU(N)]$ theories | Sergio Benvenuti et.al. | 2407.11134 | null |
2024-07-15 | S-confinement of 3d Argyres-Douglas theories and the Seiberg-like duality with an adjoint matter | Chiung Hwang et.al. | 2407.11129 | null |
2024-07-15 | CFD-based Shape Optimization of Structured Packings for Enhancing Separation Efficiency in Distillation | Sebastian Blauth et.al. | 2407.11099 | null |
2024-07-12 | Quantum-inverse scattering for the 20-vertex model up to Dynkin automorphism: crossing probabilities, 3D Poisson structure, triangular height functions, weak integrability | Pete Rigas et.al. | 2407.11066 | null |
2024-07-15 | GRUtopia: Dream General Robots in a City at Scale | Hanqing Wang et.al. | 2407.10943 | link |
2024-07-15 | STARS: Self-supervised Tuning for 3D Action Recognition in Skeleton Sequences | Soroush Mehraban et.al. | 2407.10935 | null |
2024-07-20 | RepVF: A Unified Vector Fields Representation for Multi-task 3D Perception | Chunliang Li et.al. | 2407.10876 | link |
2024-07-15 | AirNeRF: 3D Reconstruction of Human with Drone and NeRF for Future Communication Systems | Alexey Kotcov et.al. | 2407.10865 | null |
2024-07-15 | R3D-AD: Reconstruction via Diffusion for 3D Anomaly Detection | Zheyuan Zhou et.al. | 2407.10862 | null |
2024-07-15 | Trajectory Tracking for Unmanned Aerial Vehicles in 3D Spaces under Motion Constraints | Saurabh Kumar et.al. | 2407.10837 | null |
2024-07-15 | Temporal Event Stereo via Joint Learning with Stereoscopic Flow | Hoonhee Cho et.al. | 2407.10831 | link |
2024-07-15 | Enhancing Robustness to Noise Corruption for Point Cloud Model via Spatial Sorting and Set-Mixing Aggregation Module | Dingxin Zhang et.al. | 2407.10806 | null |
2024-07-15 | LVCP: LiDAR-Vision Tightly Coupled Collaborative Real-time Relative Positioning | Zhuozhu Jian et.al. | 2407.10782 | null |
2024-07-15 | OPEN: Object-wise Position Embedding for Multi-view 3D Object Detection | Jinghua Hou et.al. | 2407.10753 | link |
2024-07-15 | On Green’s function of the vorticity formulation for the 3D Navier-Stokes equations | Igor Kukavica et.al. | 2407.10751 | null |
2024-07-15 | SEED: A Simple and Effective 3D DETR in Point Clouds | Zhe Liu et.al. | 2407.10749 | link |
2024-07-15 | Scaling 3D Reasoning with LMMs to Large Robot Mission Environments Using Datagraphs | W. J. Meijer et.al. | 2407.10743 | null |
2024-07-15 | Single-cell 3D genome reconstruction in the haploid setting using rigidity theory | Sean Dewar et.al. | 2407.10700 | null |
2024-07-15 | FRI-Net: Floorplan Reconstruction via Room-wise Implicit Representation | Honghao Xu et.al. | 2407.10687 | null |
2024-07-15 | Deep Diffusion Image Prior for Efficient OOD Adaptation in 3D Inverse Problems | Hyungjin Chung et.al. | 2407.10641 | link |
2024-07-15 | Bidirectional Stereo Image Compression with Cross-Dimensional Entropy Model | Zhening Liu et.al. | 2407.10632 | link |
2024-07-15 | COSMU: Complete 3D human shape from monocular unconstrained images | Marco Pesavento et.al. | 2407.10586 | null |
2024-07-15 | Structure preserving nodal continuous Finite Elements via Global Flux quadrature | Wasilij Barsukow et.al. | 2407.10579 | null |
2024-07-15 | Pathformer3D: A 3D Scanpath Transformer for 360° Images | Rong Quan et.al. | 2407.10563 | link |
2024-07-15 | ConTEXTure: Consistent Multiview Images to Texture | Jaehoon Ahn et.al. | 2407.10558 | null |
2024-07-15 | 3D Geometric Shape Assembly via Efficient Point Cloud Matching | Nahyuk Lee et.al. | 2407.10542 | link |
2024-07-15 | 3D structure of hadrons and energy-momentum tensor | Cédric Lorcé et.al. | 2407.10496 | null |
2024-07-15 | Lite2Relight: 3D-aware Single Image Portrait Relighting | Pramod Rao et.al. | 2407.10487 | null |
2024-07-15 | GraphPrint: Extracting Features from 3D Protein Structure for Drug Target Affinity Prediction | Amritpal Singh et.al. | 2407.10452 | null |
2024-07-15 | A Multi-Stage Framework for 3D Individual Tooth Segmentation in Dental CBCT | Chunshi Wang et.al. | 2407.10433 | null |
2024-07-15 | Assessing the Impact of Network Quality-of-Service on Metaverse Virtual Reality User Experience | Rahul Dev Tripathi et.al. | 2407.10423 | null |
2024-07-15 | Effect of microstructure on fatigue properties of hyperelastic materials | Anna Stepashkina et.al. | 2407.10410 | null |
2024-07-14 | Signature of Orbital Driven Finite Momentum Pairing in a 3D Ising Superconductor | F. Z. Yang et.al. | 2407.10352 | null |
2024-07-14 | GLIM: 3D Range-Inertial Localization and Mapping with GPU-Accelerated Scan Matching Factors | Kenji Koide et.al. | 2407.10344 | link |
2024-07-14 | 3D Foundation Models Enable Simultaneous Geometry and Pose Estimation of Grasped Objects | Weiming Zhi et.al. | 2407.10331 | null |
2024-07-14 | Tree-D Fusion: Simulation-Ready Tree Dataset from Single Images with Diffusion Priors | Jae Joong Lee et.al. | 2407.10330 | null |
2024-07-14 | Complexity of 2D Snake Cube Puzzles | MIT Hardness Group et.al. | 2407.10323 | null |
2024-07-16 | RecGS: Removing Water Caustic with Recurrent Gaussian Splatting | Tianyi Zhang et.al. | 2407.10318 | null |
2024-07-14 | Quantized Inverse Design for Photonic Integrated Circuits | Frederik Schubert et.al. | 2407.10273 | null |
2024-07-14 | PAFUSE: Part-based Diffusion for 3D Whole-Body Pose Estimation | Nermin Samet et.al. | 2407.10220 | link |
2024-07-14 | Shape2Scene: 3D Scene Representation Learning Through Pre-training on Shape Data | Tuo Feng et.al. | 2407.10200 | link |
2024-07-14 | GRAPE: Generalizable and Robust Multi-view Facial Capture | Jing Li et.al. | 2407.10193 | null |
2024-07-14 | LabelDistill: Label-guided Cross-modal Knowledge Distillation for Camera-based 3D Object Detection | Sanmin Kim et.al. | 2407.10164 | link |
2024-07-14 | RAPiD-Seg: Range-Aware Pointwise Distance Distribution Networks for 3D LiDAR Segmentation | Li Li et.al. | 2407.10159 | link |
2024-07-14 | SACNet: A Spatially Adaptive Convolution Network for 2D Multi-organ Medical Segmentation | Lin Zhang et.al. | 2407.10157 | null |
2024-07-14 | FSD-BEV: Foreground Self-Distillation for Multi-view 3D Object Detection | Zheng Jiang et.al. | 2407.10135 | link |
2024-07-14 | 3DEgo: 3D Editing on the Go! | Umar Khalid et.al. | 2407.10102 | null |
2024-07-14 | STGFormer: Spatio-Temporal GraphFormer for 3D Human Pose Estimation in Video | Yang Liu et.al. | 2407.10099 | null |
2024-07-14 | Part2Object: Hierarchical Unsupervised 3D Instance Segmentation | Cheng Shi et.al. | 2407.10084 | link |
2024-07-14 | Transferable 3D Adversarial Shape Completion using Diffusion Models | Xuelong Dai et.al. | 2407.10077 | link |
2024-07-14 | SpikeGS: 3D Gaussian Splatting from Spike Streams with High-Speed Camera Motion | Jiyuan Zhang et.al. | 2407.10062 | null |
2024-07-17 | Augmented Neural Fine-Tuning for Efficient Backdoor Purification | Nazmul Karim et.al. | 2407.10052 | link |
2024-07-13 | Curriculum Is More Influential Than Haptic Information During Reinforcement Learning of Object Manipulation Against Gravity | Pegah Ojaghi et.al. | 2407.09986 | link |
2024-07-13 | Free-form Grid Structure Form Finding based on Machine Learning and Multi-objective Optimisation | Yiping Meng et.al. | 2407.09852 | null |
2024-07-13 | Computationally Efficient Nanophotonic Design through Data-Driven Eigenmode Expansion | Mehmet Can Oktay et.al. | 2407.09847 | null |
2024-07-13 | 3D Weakly Supervised Semantic Segmentation with 2D Vision-Language Guidance | Xiaoxu Xu et.al. | 2407.09826 | link |
2024-07-17 | VividDreamer: Invariant Score Distillation For Hyper-Realistic Text-to-3D Generation | Wenjie Zhuo et.al. | 2407.09822 | null |
2024-07-13 | 3D-consistency of negative flows | V. E. Adler et.al. | 2407.09813 | null |
2024-07-13 | ScaleRAFT: Cross-Scale Recurrent All-Pairs Field Transforms for 3D Motion Estimation | Han Ling et.al. | 2407.09797 | link |
2024-07-13 | Semi-supervised 3D Object Detection with PatchTeacher and PillarMix | Xiaopei Wu et.al. | 2407.09787 | link |
2024-07-13 | Self-supervised 3D Point Cloud Completion via Multi-view Adversarial Learning | Lintai Wu et.al. | 2407.09786 | link |
2024-07-13 | Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding | Ruihuang Li et.al. | 2407.09781 | null |
2024-07-13 | Coronal magnetic field and emission properties of small-scale bright and faint loops in the quiet Sun | Maria S. Madjarska et.al. | 2407.09769 | null |
2024-07-12 | Uplifting Range-View-based 3D Semantic Segmentation in Real-Time with Multi-Sensor Fusion | Shiqi Tan et.al. | 2407.09697 | null |
2024-07-12 | 3x2: 3D Object Part Segmentation by 2D Semantic Correspondences | Anh Thai et.al. | 2407.09648 | null |
2024-07-12 | Hamba: Single-view 3D Hand Reconstruction with Graph-guided Bi-Scanning Mamba | Haoye Dong et.al. | 2407.09646 | link |
2024-07-12 | FEBio FINESSE: An open-source finite element simulation approach to estimate in vivo heart valve strains using shape enforcement | Devin W. Laurence et.al. | 2407.09629 | null |
2024-07-12 | Acceleration of Tensor-Product Operations with Tensor Cores | Cu Cui et.al. | 2407.09621 | null |
2024-07-12 | Shadows Wreak Havocs in Transition Disks | Yansong Qian et.al. | 2407.09613 | null |
2024-07-12 | Spinning up the spool: Massive spinning fields in 3d quantum gravity | Robert Bourne et.al. | 2407.09608 | null |
2024-07-12 | StyleSplat: 3D Object Style Transfer with Gaussian Splatting | Sahil Jain et.al. | 2407.09473 | null |
2024-07-12 | Let Me DeCode You: Decoder Conditioning with Tabular Data | Tomasz Szczepański et.al. | 2407.09437 | link |
2024-07-12 | A novel direct Helmholtz solver in inhomogeneous media based on the operator Fourier transform functional calculus | Max Cubillos et.al. | 2407.09436 | null |
2024-07-12 | A Benchmark Environment for Offline Reinforcement Learning in Racing Games | Girolamo Macaluso et.al. | 2407.09415 | link |
2024-07-12 | Ground-state properties of the double trillium lattice antiferromagnet KBaCr $_2$(PO$_4$)$_3$ | R. Kolay et.al. | 2407.09376 | null |
2024-07-17 | Learning High-Frequency Functions Made Easy with Sinusoidal Positional Encoding | Chuanhao Sun et.al. | 2407.09370 | link |
2024-07-12 | Pre-training Point Cloud Compact Model with Partial-aware Reconstruction | Yaohua Zha et.al. | 2407.09344 | null |
2024-07-12 | SS-SfP:Neural Inverse Rendering for Self Supervised Shape from (Mixed) Polarization | Ashish Tiwari et.al. | 2407.09294 | null |
2024-07-12 | MetaFood CVPR 2024 Challenge on Physically Informed 3D Food Reconstruction: Methods and Results | Jiangpeng He et.al. | 2407.09285 | link |
2024-07-12 | Semantic UV mapping to improve texture inpainting for indoor scenes | Jelle Vermandere et.al. | 2407.09248 | null |
2024-07-12 | Belief Propagation-based Rotation and Translation Estimation for Rigid Body Localization | Volodymyr Vizitiv et.al. | 2407.09232 | null |
2024-07-12 | HUP-3D: A 3D multi-view synthetic dataset for assisted-egocentric hand-ultrasound pose estimation | Manuel Birlo et.al. | 2407.09215 | null |
2024-07-12 | On the Problem of Defining Charge Operators for the Dirac Quantum Field | Pablo Costa Rico et.al. | 2407.09126 | null |
2024-07-12 | Numerical Analysis on the Spatiotemporal Characteristics of the Portevin-Le Chatelier Effect in Ti-12Mo Alloy | Shiyuan Luo et.al. | 2407.09054 | null |
2024-07-12 | HPC: Hierarchical Progressive Coding Framework for Volumetric Video | Zihan Zheng et.al. | 2407.09026 | null |
2024-07-12 | OVExp: Open Vocabulary Exploration for Object-Oriented Navigation | Meng Wei et.al. | 2407.09016 | null |
2024-07-12 | Dynamic neural network with memristive CIM and CAM for 2D and 3D vision | Yue Zhang et.al. | 2407.08990 | null |
2024-07-12 | Symmetry Awareness Encoded Deep Learning Framework for Brain Imaging Analysis | Yang Ma et.al. | 2407.08948 | link |
2024-07-12 | Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary Detection | Xingyu Peng et.al. | 2407.08931 | link |
2024-07-12 | KGpose: Keypoint-Graph Driven End-to-End Multi-Object 6D Pose Estimation via Point-Wise Pose Voting | Andrew Jeong et.al. | 2407.08909 | null |
2024-07-11 | Manipulating a Tetris-Inspired 3D Video Representation | Mihir Godbole et.al. | 2407.08885 | null |
2024-07-11 | Direct Measurement of Microwave Loss in Nb Films for Superconducting Qubits | B. Abdisatarov et.al. | 2407.08856 | null |
2024-07-11 | Breakdown of order-fractionalization in the CPT model | Aaditya Panigrahi et.al. | 2407.08784 | null |
2024-07-11 | Unifying 3D Representation and Control of Diverse Robots with a Single Camera | Sizhe Lester Li et.al. | 2407.08722 | link |
2024-07-11 | OmniNOCS: A unified NOCS dataset and model for 3D lifting of 2D objects | Akshay Krishnan et.al. | 2407.08711 | null |
2024-07-11 | SPOCKMIP: Segmentation of Vessels in MRAs with Enhanced Continuity using Maximum Intensity Projection as Loss | Chethan Radhakrishna et.al. | 2407.08655 | link |
2024-07-11 | RTMW: Real-Time Multi-Person 2D and 3D Whole-body Pose Estimation | Tao Jiang et.al. | 2407.08634 | link |
2024-07-16 | Vision and Tactile Robotic System to Grasp Litter in Outdoor Environments | Ignacio de Loyola Páez-Ubieta et.al. | 2407.08575 | null |
2024-07-11 | Approaching Outside: Scaling Unsupervised 3D Object Detection from 2D Scene | Ruiyang Zhang et.al. | 2407.08569 | link |
2024-07-11 | Enhancing 3D Planetary Atmosphere Simulations with a Surrogate Radiative Transfer Model | Tara P. A. Tahseen et.al. | 2407.08556 | link |
2024-07-11 | Learning Localization of Body and Finger Animation Skeleton Joints on Three-Dimensional Models of Human Bodies | Stefan Novaković et.al. | 2407.08484 | link |
2024-07-11 | Inverse-designed 3D laser nanoprinted phase masks to extend the depth of field of by systems | T. J. Sturges et.al. | 2407.08482 | null |
2024-07-11 | Brain Tumor Segmentation in MRI Images with 3D U-Net and Contextual Transformer | Thien-Qua T. Nguyen et.al. | 2407.08470 | null |
2024-07-12 | Neural Poisson Solver: A Universal and Continuous Framework for Natural Signal Blending | Delong Wu et.al. | 2407.08457 | null |
2024-07-11 | WildGaussians: 3D Gaussian Splatting in the Wild | Jonas Kulhanek et.al. | 2407.08447 | link |
2024-07-11 | Vortices on Cylinders and Warped Exponential Networks | Kunal Gupta et.al. | 2407.08445 | null |
2024-07-11 | Construction of blow-up solutions for the focusing critical-energy nonlinear wave equation in $\mathbb{R}^4$ and $\mathbb{R}^5$ | Dylan Samuelian et.al. | 2407.08444 | null |
2024-07-11 | The minimum neutron star mass in neutrino-driven supernova explosions | Bernhard Müller et.al. | 2407.08407 | null |
2024-07-11 | Accurate Cooperative Localization Utilizing LiDAR-equipped Roadside Infrastructure for Autonomous Driving | Yuze Jiang et.al. | 2407.08384 | null |
2024-07-11 | Digital twins to alleviate the need for real field data in vision-based vehicle speed detection systems | Antonio Hernández Martínez et.al. | 2407.08380 | null |
2024-07-11 | Magnetograms underestimate even unipolar magnetic flux nearly everywhere on the solar disk | Jonas Sinjan et.al. | 2407.08368 | null |
2024-07-23 | Scalar Function Topology Divergence: Comparing Topology of 3D Objects | Ilya Trofimov et.al. | 2407.08364 | link |
2024-07-11 | GUI-based Pedicle Screw Planning on Fluoroscopic Images Utilizing Vertebral Segmentation | Vivek Maik et.al. | 2407.08347 | null |
2024-07-11 | Improving Molecular Modeling with Geometric GNNs: an Empirical Study | Ali Ramlaoui et.al. | 2407.08313 | null |
2024-07-11 | Gap Completion in Point Cloud Scene occluded by Vehicles using SGC-Net | Yu Feng et.al. | 2407.08290 | null |
2024-07-11 | Determination of five-parameter grain boundary characteristics in nanocrystalline Ni-W by Scanning Precession Electron Diffraction Tomography | E. F. Rauch et.al. | 2407.08251 | null |
2024-07-11 | Synchronous Diffusion for Unsupervised Smooth Non-Rigid 3D Shape Matching | Dongliang Cao et.al. | 2407.08244 | null |
2024-07-11 | GAURA: Generalizable Approach for Unified Restoration and Rendering of Arbitrary Views | Vinayak Gupta et.al. | 2407.08221 | link |
2024-07-18 | Explicit-NeRF-QA: A Quality Assessment Database for Explicit NeRF Model Compression | Yuke Xing et.al. | 2407.08165 | null |
2024-07-11 | Bayesian uncertainty analysis for underwater 3D reconstruction with neural radiance fields | Haojie Lian et.al. | 2407.08154 | null |
2024-07-11 | Survey on Fundamental Deep Learning 3D Reconstruction Techniques | Yonge Bai et.al. | 2407.08137 | null |
2024-07-10 | RoCap: A Robotic Data Collection Pipeline for the Pose Estimation of Appearance-Changing Objects | Jiahao Nick Li et.al. | 2407.08081 | null |
2024-07-10 | Smooth Like Butter: Evaluating Multi-Lattice Transitions in Property-Augmented Latent Spaces | Martha Baldwin et.al. | 2407.08074 | null |
2024-07-10 | Rossby Wave Instability and Substructure Formation in 3D Non-Ideal MHD Wind-Launching Disks | Chun-Yen Hsu et.al. | 2407.08032 | null |
2024-07-10 | Hybrid Structure-from-Motion and Camera Relocalization for Enhanced Egocentric Localization | Jinjie Mai et.al. | 2407.08023 | link |
2024-07-10 | Interactive Segmentation Model for Placenta Segmentation from 3D Ultrasound images | Hao Li et.al. | 2407.08020 | link |
2024-07-10 | Flow4D: Leveraging 4D Voxel Network for LiDAR Scene Flow Estimation | Jaeyeul Kim et.al. | 2407.07995 | link |
2024-07-10 | 3D E-textile for Exercise Physiology and Clinical Maternal Health Monitoring | Junyi Zhao et.al. | 2407.07954 | null |
2024-07-10 | Analytic framework for self-dual criticality in $\mathbb{Z}_k$ gauge theory with matter | Zhengyan Darius Shi et.al. | 2407.07941 | null |
2024-07-10 | Token-Mol 1.0: Tokenized drug design with large language model | Jike Wang et.al. | 2407.07930 | link |
2024-07-08 | A Trustworthy AIoT-enabled Localization System via Federated Learning and Blockchain | Junfei Wang et.al. | 2407.07921 | null |
2024-07-10 | LLaVA-NeXT-Interleave: Tackling Multi-image, Video, and 3D in Large Multimodal Models | Feng Li et.al. | 2407.07895 | link |
2024-07-10 | Generative Image as Action Models | Mohit Shridhar et.al. | 2407.07875 | link |
2024-07-10 | Controlling Space and Time with Diffusion Models | Daniel Watson et.al. | 2407.07860 | null |
2024-07-10 | RoBus: A Multimodal Dataset for Controllable Road Networks and Building Layouts Generation | Tao Li et.al. | 2407.07835 | link |
2024-07-10 | The positioning of stress fibers in contractile cells minimizes internal mechanical stress | Lukas Riedel et.al. | 2407.07797 | link |
2024-07-10 | Multiparameter admittance spectroscopy for investigating defects in MoS ${_2}$ thin film MOSFETs | Eros Reato et.al. | 2407.07783 | null |
2024-07-10 | HSTPROMO Internal Proper Motion Kinematics of Dwarf Spheroidal Galaxies: I. Velocity Anisotropy and Dark Matter Cusp Slope of Draco | Eduardo Vitral et.al. | 2407.07769 | link |
2024-07-10 | Protecting NeRFs’ Copyright via Plug-And-Play Watermarking Base Model | Qi Song et.al. | 2407.07735 | null |
2024-07-10 | Motion simulation of radio-labeled cells in whole-body positron emission tomography | Nils Marquardt et.al. | 2407.07709 | null |
2024-07-10 | Localizing axial dense emitters based onsingle-helix point spread function andcompressed sensing | Hanzhe Wu et.al. | 2407.07681 | null |
2024-07-10 | Synthetic to Authentic: Transferring Realism to 3D Face Renderings for Boosting Face Recognition | Parsa Rahimi et.al. | 2407.07627 | null |
2024-07-18 | Let Occ Flow: Self-Supervised 3D Occupancy Flow Prediction | Yili Liu et.al. | 2407.07587 | null |
2024-07-11 | InstructLayout: Instruction-Driven 2D and 3D Layout Synthesis with Semantic Graph Prior | Chenguo Lin et.al. | 2407.07580 | null |
2024-07-10 | Beat-It: Beat-Synchronized Multi-Condition 3D Dance Generation | Zikai Huang et.al. | 2407.07554 | null |
2024-07-10 | Neural Localizer Fields for Continuous 3D Human Pose and Shape Estimation | István Sárándi et.al. | 2407.07532 | link |
2024-07-10 | Swin SMT: Global Sequential Modeling in 3D Medical Image Segmentation | Szymon Płotka et.al. | 2407.07514 | link |
2024-07-10 | On a 3D Stokes eigenvalue problem under Navier slip-with-friction boundary conditions and applications to Navier-Stokes equations | Luigi C. Berselli et.al. | 2407.07496 | null |
2024-07-17 | Exploring the Untouched Sweeps for Conflict-Aware 3D Segmentation Pretraining | Tianfang Sun et.al. | 2407.07465 | null |
2024-07-10 | MAN TruckScenes: A multimodal dataset for autonomous trucking in diverse conditions | Felix Fent et.al. | 2407.07462 | null |
2024-07-10 | Drantal-NeRF: Diffusion-Based Restoration for Anti-aliasing Neural Radiance Field | Ganlin Yang et.al. | 2407.07461 | null |
2024-07-10 | Aging-Resistant Wideband Precoding in 5G and Beyond Using 3D Convolutional Neural Networks | Alejandro Villena-Rodriguez et.al. | 2407.07434 | null |
2024-07-10 | Targeting low micro-roughness for 3D printed aluminium mirrors using a hot isostatic press | Carolyn Atkins et.al. | 2407.07405 | null |
2024-07-10 | Hole Statistics of Equilibrium 2D and 3D Hard-Sphere Crystals | Haina Wang et.al. | 2407.07390 | null |
2024-07-10 | Asymmetric Fluid Flow in Helical Pipes Inspired by Shark Intestines | Ido Levin et.al. | 2407.07354 | null |
2024-07-21 | Deformation-Recovery Diffusion Model (DRDM): Instance Deformation for Image Manipulation and Synthesis | Jian-Qing Zheng et.al. | 2407.07295 | link |
2024-07-17 | MIGS: Multi-Identity Gaussian Splatting via Tensor Decomposition | Aggelina Chatziagapi et.al. | 2407.07284 | null |
2024-07-09 | Exploring Camera Encoder Designs for Autonomous Driving Perception | Barath Lakshmanan et.al. | 2407.07276 | null |
2024-07-09 | HAMIL-QA: Hierarchical Approach to Multiple Instance Learning for Atrial LGE MRI Quality Assessment | K M Arefeen Sultan et.al. | 2407.07254 | null |
2024-07-09 | Reference-based Controllable Scene Stylization with Gaussian Splatting | Yiqun Mei et.al. | 2407.07220 | null |
2024-07-17 | Hunting 3d $\mathcal{N}=1$ SQED in the $ε$ -expansion | Yacov-Nir Breitstein et.al. | 2407.07148 | null |
2024-07-09 | A 3D Pancreatic Cancer Model with Integrated Optical Sensors for Noninvasive Metabolism Monitoring and Drug Screening | Anna Chiara Siciliano et.al. | 2407.07126 | null |
2024-07-09 | V-VIPE: Variational View Invariant Pose Embedding | Mara Levy et.al. | 2407.07092 | null |
2024-07-10 | 3D Gaussian Ray Tracing: Fast Tracing of Particle Scenes | Nicolas Moenne-Loccoz et.al. | 2407.07090 | null |
2024-07-12 | Bow-shock structure of Sgr B molecular-cloud complex in the Galactic Centre inferred from 3D CO-line kinematics | Yoshiaki Sofue et.al. | 2407.07013 | null |
2024-07-09 | Improved Block Merging for 3D Point Cloud Instance Segmentation | Leon Denis et.al. | 2407.06991 | null |
2024-07-17 | Category-level Object Detection, Pose Estimation and Reconstruction from Stereo Images | Chuanrui Zhang et.al. | 2407.06984 | null |
2024-07-09 | INTERACT: An authoring tool that facilitates the creation of human centric interaction with 3d objects in virtual reality | Rama Krishnan Gopal Ramasamy Thandapani et.al. | 2407.06967 | null |
2024-07-09 | Joint prototype and coefficient prediction for 3D instance segmentation | Remco Royen et.al. | 2407.06958 | null |
2024-07-11 | RodinHD: High-Fidelity 3D Avatar Generation with Diffusion Models | Bowen Zhang et.al. | 2407.06938 | null |
2024-07-09 | A Unified Approach to Multi-task Legged Navigation: Temporal Logic Meets Reinforcement Learning | Jesse Jiang et.al. | 2407.06931 | null |
2024-07-10 | Identity-enabled CDMA LiDAR for massively parallel ranging with a single-element receiver | Yixiu Shen et.al. | 2407.06918 | null |
2024-07-10 | Chat-Edit-3D: Interactive 3D Scene Editing via Text Prompts | Shuangkang Fang et.al. | 2407.06842 | link |
2024-07-09 | Regularization by Nonlinear Noise for PDEs: Well-posedness and Finite Time Extinction | Wei Hong et.al. | 2407.06840 | null |
2024-07-08 | Training-free CryoET Tomogram Segmentation | Yizhou Zhao et.al. | 2407.06833 | link |
2024-07-09 | 3D Imaging of directional multi-scale cellulose nanostructures through multi-directional dark-field neutron tomography | Matteo Busi et.al. | 2407.06728 | null |
2024-07-09 | HERMES: Holographic Equivariant neuRal network model for Mutational Effect and Stability prediction | Gian Marco Visani et.al. | 2407.06703 | link |
2024-07-09 | Ultra-stable 3D-printed precision voltage divider for calibrations and experiments | Stephan Passon et.al. | 2407.06700 | null |
2024-07-09 | Deep-Motion-Net: GNN-based volumetric organ shape reconstruction from single-view 2D projections | Isuru Wijesinghe et.al. | 2407.06692 | null |
2024-07-09 | Universal Multi-view Black-box Attack against Object Detectors via Layout Optimization | Donghua Wang et.al. | 2407.06688 | null |
2024-07-09 | MRI Volume-Based Robust Brain Age Estimation Using Weight-Shared Spatial Attention in 3D CNNs | Vamshi Krishna Kancharla et.al. | 2407.06686 | null |
2024-07-12 | Mobius: A High Efficient Spatial-Temporal Parallel Training Paradigm for Text-to-Video Generation Task | Yiran Yang et.al. | 2407.06617 | link |
2024-07-09 | UAV Formation and Resource Allocation Optimization for Communication-Assisted 3D InSAR Sensing | Mohamed-Amine Lahmeri et.al. | 2407.06607 | null |
2024-07-09 | Non-uniqueness of Leray weak solutions of the forced MHD equations | Jun Wang et.al. | 2407.06565 | null |
2024-07-16 | Decomposition Betters Tracking Everything Everywhere | Rui Li et.al. | 2407.06531 | link |
2024-07-10 | VQA-Diff: Exploiting VQA and Diffusion for Zero-Shot Image-to-3D Vehicle Asset Generation in Autonomous Driving | Yibo Liu et.al. | 2407.06516 | null |
2024-07-09 | Computer vision tasks for intelligent aerospace missions: An overview | Huilin Chen et.al. | 2407.06513 | null |
2024-07-09 | LuSNAR:A Lunar Segmentation, Navigation and Reconstruction Dataset based on Muti-sensor for Autonomous Exploration | Jiayi Liu et.al. | 2407.06512 | link |
2024-07-09 | An optimal upper bound on the the determining wavenumber for 3D Navier-Stokes Equations | Alexey Cheskidov et.al. | 2407.06474 | null |
2024-07-16 | AnatoMask: Enhancing Medical Image Segmentation with Reconstruction-guided Self-masking | Yuheng Li et.al. | 2407.06468 | link |
2024-07-08 | Swin UNETR segmentation with automated geometry filtering for biomechanical modeling of knee joint cartilage | Reza Kakavand et.al. | 2407.06403 | null |
2024-07-08 | Stochastic Traveling Salesperson Problem with Neighborhoods for Object Detection | Cheng Peng et.al. | 2407.06366 | null |
2024-07-08 | MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions | Xuan Ju et.al. | 2407.06358 | null |
2024-07-08 | Reconciling M/L Ratios Across Cosmic Time: a Concordance IMF for Massive Galaxies | Pieter van Dokkum et.al. | 2407.06281 | null |
2024-07-08 | FairDiff: Fair Segmentation with Point-Image Diffusion | Wenyi Li et.al. | 2407.06250 | link |
2024-07-08 | Tailor3D: Customized 3D Assets Editing and Generation with Dual-Side Images | Zhangyang Qi et.al. | 2407.06191 | null |
2024-07-10 | 4D Contrastive Superflows are Dense 3D Representation Learners | Xiang Xu et.al. | 2407.06190 | link |
2024-07-08 | Laser-scanning of induction-melted Al alloys: are they representative of additively manufactured ones? | Zhaoxuan Ge et.al. | 2407.06138 | null |
2024-07-09 | Enhancing the Prediction of Glass Dynamics by Incorporating the Direction of Deviation from Equilibrium Positions | Xiao Jiang et.al. | 2407.06111 | null |
2024-07-16 | PerlDiff: Controllable Street View Synthesis Using Perspective-Layout Diffusion Models | Jinhua Zhang et.al. | 2407.06109 | link |
2024-07-08 | 3D Vision and Language Pretraining with Large-Scale Synthetic Data | Dejie Yang et.al. | 2407.06084 | link |
2024-07-08 | Object-Oriented Material Classification and 3D Clustering for Improved Semantic Perception and Mapping in Mobile Robots | Siva Krishna Ravipati et.al. | 2407.06077 | link |
2024-07-08 | Superconductivity up to 14.2 K in MnB $_4$ under pressure | Zhe-Ning Xiang et.al. | 2407.06061 | null |
2024-07-16 | Learning local equivariant representations for quantum operators | Zhanghao Zhouyin et.al. | 2407.06053 | link |
2024-07-08 | Foams with 3D Spatially Programmed Mechanics Enabled by Autonomous Active Learning on Viscous Thread Printing | Brett Emery et.al. | 2407.06051 | null |
2024-07-08 | Test-time adaptation for geospatial point cloud semantic segmentation with distinct domain shifts | Puzuo Wang et.al. | 2407.06043 | null |
2024-07-08 | TAPVid-3D: A Benchmark for Tracking Any Point in 3D | Skanda Koppula et.al. | 2407.05921 | link |
2024-07-10 | An efficient method to automate tooth identification and 3D bounding box extraction from Cone Beam CT Images | Ignacio Garrido Botella et.al. | 2407.05892 | null |
2024-07-08 | Affordances-Oriented Planning using Foundation Models for Continuous Vision-Language Navigation | Jiaqi Chen et.al. | 2407.05890 | null |
2024-07-08 | A coarse Erdős-Pósa theorem | Jungho Ahn et.al. | 2407.05883 | null |
2024-07-08 | Bringing Masked Autoencoders Explicit Contrastive Properties for Point Cloud Self-Supervised Learning | Bin Ren et.al. | 2407.05862 | link |
2024-07-08 | Gyroid ferromagnetic nanostructures in 3D magnonics | Mateusz Gołębiewski et.al. | 2407.05851 | null |
2024-07-08 | 3D Vessel Graph Generation Using Denoising Diffusion | Chinmay Prabhakar et.al. | 2407.05842 | link |
2024-07-08 | A novel metric for assessing climatological surface habitability | Hannah L. Woodward et.al. | 2407.05838 | link |
2024-07-08 | Multi-times Monte Carlo Rendering for Inter-reflection Reconstruction | Tengjie Zhu et.al. | 2407.05771 | null |
2024-07-08 | Boosting 3D Object Detection with Semantic-Aware Multi-Branch Framework | Hao Jing et.al. | 2407.05769 | null |
2024-07-14 | Nonrigid Reconstruction of Freehand Ultrasound without a Tracker | Qi Li et.al. | 2407.05767 | link |
2024-07-08 | Cellular diffusion processes in singularly perturbed domains | Paul C Bressloff et.al. | 2407.05747 | null |
2024-07-08 | TransMA: an explainable multi-modal deep learning model for predicting properties of ionizable lipid nanoparticles in mRNA delivery | Kun Wu et.al. | 2407.05736 | link |
2024-07-08 | Low velocity streams inside the planetary nebula H 2-18. A 3D photoionization and kinematical reconstruction | K. Gesicki et.al. | 2407.05727 | null |
2024-07-08 | On a new 3D generalized Hunter-Saxton equation | Sergei Sakovich et.al. | 2407.05723 | null |
2024-07-08 | Sensor response and radiation damage effects for 3D pixels in the ATLAS IBL Detector | ATLAS Collaboration et.al. | 2407.05716 | null |
2024-07-09 | PCAC-GAN:ASparse-Tensor-Based Generative Adversarial Network for 3D Point Cloud Attribute Compression | Xiaolong Mao et.al. | 2407.05677 | null |
2024-07-08 | A Stochastic Interacting Particle-Field Algorithm for a Haptotaxis Advection-Diffusion System Modeling Cancer Cell Invasion | Boyi Hu et.al. | 2407.05626 | null |
2024-07-08 | OSN: Infinite Representations of Dynamic 3D Scenes from Monocular Videos | Ziyang Song et.al. | 2407.05615 | link |
2024-07-08 | A consistent, volume preserving, and adaptive mesh refinement-based framework for modeling non-isothermal gas-liquid-solid flows with phase change | Ramakrishnan Thirumalaisamy et.al. | 2407.05588 | link |
2024-07-08 | FALIP: Visual Prompt as Foveal Attention Boosts CLIP Zero-Shot Performance | Jiedong Zhuang et.al. | 2407.05578 | null |
2024-07-08 | Ada-adapter:Fast Few-shot Style Personlization of Diffusion Model with Pre-trained Image Encoder | Jia Liu et.al. | 2407.05552 | null |
2024-07-07 | Reflectance measurements of mm-wave absorbers using frequency-domain continuous wave THz spectroscopy | Gaganpreet Singh et.al. | 2407.05512 | null |
2024-07-07 | Self-supervised Learning via Cluster Distance Prediction for Operating Room Context Awareness | Idris Hamoud et.al. | 2407.05448 | null |
2024-07-07 | Forest2Seq: Revitalizing Order Prior for Sequential Indoor Scene Synthesis | Qi Sun et.al. | 2407.05388 | null |
2024-07-10 | Multi-branch Collaborative Learning Network for 3D Visual Grounding | Zhipeng Qian et.al. | 2407.05363 | link |
2024-07-10 | On the power of data augmentation for head pose estimation | Michael Welter et.al. | 2407.05357 | link |
2024-07-07 | Three-dimensional solitons in fractional nonlinear Schrödinger equation with exponential saturating nonlinearity | Volodymyr M. Lashkin et.al. | 2407.05354 | null |
2024-07-07 | Generating multi-scale NMC particles with radial grain architectures using spatial stochastics and GANs | Lukas Fuchs et.al. | 2407.05333 | null |
2024-07-07 | PICA: Physics-Integrated Clothed Avatar | Bo Peng et.al. | 2407.05324 | null |
2024-07-07 | Additive manufacturing in ceramics: targeting lightweight mirror applications in the visible, ultraviolet and X-ray | Carolyn Atkins et.al. | 2407.05314 | null |
2024-07-07 | SCIPaD: Incorporating Spatial Clues into Unsupervised Pose-Depth Joint Learning | Yi Feng et.al. | 2407.05283 | link |
2024-07-07 | HyperKAN: Kolmogorov-Arnold Networks make Hyperspectral Image Classificators Smarter | Valeriy Lobanov et.al. | 2407.05278 | link |
2024-07-07 | Unlocking Textual and Visual Wisdom: Open-Vocabulary 3D Object Detection Enhanced by Comprehensive Guidance from Text and Image | Pengkun Jiao et.al. | 2407.05256 | null |
2024-07-07 | GaussReg: Fast 3D Registration with Gaussian Splatting | Jiahao Chang et.al. | 2407.05254 | null |
2024-07-09 | P2P: Part-to-Part Motion Cues Guide a Strong Tracking Framework for LiDAR Point Clouds | Jiahao Nie et.al. | 2407.05238 | link |
2024-07-06 | Leveraging Task-Specific Knowledge from LLM for Semi-Supervised 3D Medical Image Segmentation | Suruchi Kumari et.al. | 2407.05088 | null |
2024-07-06 | Slice-Consistent 3D Volumetric Brain CT-to-MRI Translation with 2D Brownian Bridge Diffusion Model | Kyobin Choo et.al. | 2407.05059 | link |
2024-07-06 | SurgicalGaussian: Deformable 3D Gaussians for High-Fidelity Surgical Scene Reconstruction | Weixing Xie et.al. | 2407.05023 | link |
2024-07-06 | Incremental Multiview Point Cloud Registration | Xiaoya Cheng et.al. | 2407.05021 | link |
2024-07-06 | T-CorresNet: Template Guided 3D Point Cloud Completion with Correspondence Pooling Query Generation Strategy | Fan Duan et.al. | 2407.05008 | link |
2024-07-06 | SAM-Med3D-MoE: Towards a Non-Forgetting Segment Anything Model via Mixture of Experts for 3D Medical Image Segmentation | Guoan Wang et.al. | 2407.04938 | null |
2024-07-15 | JDT3D: Addressing the Gaps in LiDAR-Based Tracking-by-Attention | Brian Cheong et.al. | 2407.04926 | link |
2024-07-06 | Aortic root landmark localization with optimal transport loss for heatmap regression | Tsuyoshi Ishizone et.al. | 2407.04921 | link |
2024-07-06 | Bridging-Induced Phase Separation and Loop Extrusion Drive Noise in Chromatin Transcription | Michael Chiang et.al. | 2407.04907 | null |
2024-07-05 | An embedding-aware continuum thin shell formulation | Abhishek Ghosh et.al. | 2407.04894 | null |
2024-07-05 | Neural varifolds: an aggregate representation for quantifying the geometry of point clouds | Juheon Lee et.al. | 2407.04844 | null |
2024-07-05 | 3D Adaptive Structural Convolution Network for Domain-Invariant Point Cloud Recognition | Younggun Kim et.al. | 2407.04833 | null |
2024-07-15 | LaRa: Efficient Large-Baseline Radiance Fields | Anpei Chen et.al. | 2407.04699 | null |
2024-07-05 | RAM: Retrieval-Based Affordance Transfer for Generalizable Zero-Shot Robotic Manipulation | Yuxuan Kuang et.al. | 2407.04689 | link |
2024-07-05 | Efficient Betti Matching Enables Topology-Aware 3D Segmentation via Persistent Homology | Nico Stucki et.al. | 2407.04683 | null |
2024-07-05 | Surface-Functionalization of Oleate-Capped Nano-Emitters for Stable Dispersion in 3D-Printable Polymers | Akhilesh Kumar Pathak et.al. | 2407.04636 | null |
2024-07-05 | Unbalanced optimal transport for stochastic particle tracking | Kairui Hao et.al. | 2407.04583 | null |
2024-07-05 | Gaussian Eigen Models for Human Heads | Wojciech Zielonka et.al. | 2407.04545 | null |
2024-07-05 | Neutral atomic and molecular gas dynamics in the nearby spiral galaxies NGC 1512, NGC 4535, and NGC 7496 | Sebastian Laudage et.al. | 2407.04531 | null |
2024-07-05 | Universal Scaling Laws for a Generic Swimmer Model | Bruno Ventéjou et.al. | 2407.04511 | null |
2024-07-12 | Segment Any 4D Gaussians | Shengxiang Ji et.al. | 2407.04504 | null |
2024-07-05 | Rethinking Data Input for Point Cloud Upsampling | Tongxu Zhang et.al. | 2407.04476 | null |
2024-07-05 | The ACCEL $^2$ project: simulating Lyman-$α$ forest in large-volume hydrodynamical simulations | Solène Chabanier et.al. | 2407.04473 | null |
2024-07-05 | VCD-Texture: Variance Alignment based 3D-2D Co-Denoising for Text-Guided Texturing | Shang Liu et.al. | 2407.04461 | null |
2024-07-05 | Benchmarking structure-based three-dimensional molecular generative models using GenBench3D: ligand conformation quality matters | Benoit Baillif et.al. | 2407.04424 | link |
2024-07-05 | High-throughput magnetic co-doping and design of exchange interactions in a topological insulator | Rubel Mozumder et.al. | 2407.04413 | null |
2024-07-05 | Multi-Antenna Technology for 6G Integrated Sensing and Communication | Yong Zeng et.al. | 2407.04404 | null |
2024-07-05 | A Tree-based Next-best-trajectory Method for 3D UAV Exploration | Björn Lindqvist et.al. | 2407.04386 | link |
2024-07-05 | Unsupervised Learning of Category-Level 3D Pose from Object-Centric Videos | Leonhard Sommer et.al. | 2407.04384 | link |
2024-07-05 | Data-Driven Tissue- and Subject-Specific Elastic Regularization for Medical Image Registration | Anna Reithmeir et.al. | 2407.04355 | link |
2024-07-15 | CanonicalFusion: Generating Drivable 3D Human Avatars from Multiple Images | Jisu Shin et.al. | 2407.04345 | link |
2024-07-10 | LMSeg: A deep graph message-passing network for efficient and accurate semantic segmentation of large-scale 3D landscape meshes | Zexian Huang et.al. | 2407.04326 | null |
2024-07-05 | 2D BAO vs 3D BAO: solving the Hubble tension with alternative cosmological models | Sowmaydeep Dwivedi et.al. | 2407.04322 | null |
2024-07-05 | Towards Stable 3D Object Detection | Jiabao Wang et.al. | 2407.04305 | null |
2024-07-05 | TSC-PCAC: Voxel Transformer and Sparse Convolution Based Point Cloud Attribute Compression for 3D Broadcasting | Zixi Guo et.al. | 2407.04284 | link |
2024-07-05 | Fine-grained Context and Multi-modal Alignment for Freehand 3D Ultrasound Reconstruction | Zhongnuo Yan et.al. | 2407.04242 | null |
2024-07-10 | GSD: View-Guided Gaussian Splatting Diffusion for 3D Reconstruction | Yuxuan Mu et.al. | 2407.04237 | null |
2024-07-11 | Slice-100K: A Multimodal Dataset for Extrusion-based 3D Printing | Anushrut Jignasu et.al. | 2407.04180 | null |
2024-07-12 | Detect Closer Surfaces that can be Seen: New Modeling and Evaluation in Cross-domain 3D Object Detection | Ruixiao Zhang et.al. | 2407.04061 | link |
2024-07-04 | Occupancy as Set of Points | Yiang Shi et.al. | 2407.04049 | link |
2024-07-04 | Craftium: An Extensible Framework for Creating Reinforcement Learning Environments | Mikel Malagón et.al. | 2407.03969 | link |
2024-07-04 | Runaway electron beam formation, vertical motion, termination and wall loads in EU-DEMO | F. Vannini et.al. | 2407.03940 | null |
2024-07-15 | SfM on-the-fly: Get better 3D from What You Capture | Zongqian Zhan et.al. | 2407.03939 | null |
2024-07-04 | CRiM-GS: Continuous Rigid Motion-Aware Gaussian Splatting from Motion Blur Images | Junghe Lee et.al. | 2407.03923 | null |
2024-07-04 | Perception-Guided Quality Metric of 3D Point Clouds Using Hybrid Strategy | Yujie Zhang et.al. | 2407.03885 | link |
2024-07-04 | Unsupervised Analysis of Alzheimer’s Disease Signatures using 3D Deformable Autoencoders | Mehmet Yigit Avci et.al. | 2407.03863 | link |
2024-07-04 | PFGS: High Fidelity Point Cloud Rendering via Feature Splatting | Jiaxu Wang et.al. | 2407.03857 | link |
2024-07-04 | Beyond Viewpoint: Robust 3D Object Recognition under Arbitrary Views through Joint Multi-Part Representation | Linlong Fan et.al. | 2407.03842 | null |
2024-07-04 | Markerless Multi-view 3D Human Pose Estimation: a survey | Ana Filipa Rodrigues Nogueira et.al. | 2407.03817 | null |
2024-07-04 | CardioSpectrum: Comprehensive Myocardium Motion Analysis with 3D Deep Learning and Geometric Insights | Shahar Zuler et.al. | 2407.03794 | link |
2024-07-04 | SpikeGS: Reconstruct 3D scene via fast-moving bio-inspired sensors | Yijia Guo et.al. | 2407.03771 | null |
2024-07-04 | UniPlane: Unified Plane Detection and Reconstruction from Posed Monocular Videos | Yuzhong Huang et.al. | 2407.03594 | null |
2024-07-04 | A Fast Dynamic Point Detection Method for LiDAR-Inertial Odometry in Driving Scenarios | Zikang Yuan et.al. | 2407.03590 | link |
2024-07-03 | NEBULA: Neural Empirical Bayes Under Latent Representations for Efficient and Controllable Design of Molecular Libraries | Ewa M. Nowara et.al. | 2407.03428 | link |
2024-07-03 | Observation of Co-propagating Chiral Zero Modes in Magnetic Photonic Crystals | Zhongfu Li et.al. | 2407.03390 | null |
2024-07-03 | HoloHisto: End-to-end Gigapixel WSI Segmentation with 4K Resolution Sequential Tokenization | Yucheng Tang et.al. | 2407.03307 | null |
2024-07-03 | A Unified Framework for 3D Scene Understanding | Wei Xu et.al. | 2407.03263 | link |
2024-07-03 | Cyclic Refiner: Object-Aware Temporal Representation Learning for Multi-View 3D Detection and Tracking | Mingzhe Guo et.al. | 2407.03240 | null |
2024-07-03 | Expressive Gaussian Human Avatars from Monocular RGB Video | Hezhen Hu et.al. | 2407.03204 | null |
2024-07-03 | IMC 2024 Methods & Solutions Review | Shyam Gupta et.al. | 2407.03172 | null |
2024-07-03 | Design of a UE5-based digital twin platform | Shaoqiu Lyu et.al. | 2407.03107 | null |
2024-07-03 | Electromagnetic Property Sensing Based on Diffusion Model in ISAC System | Yuhua Jiang et.al. | 2407.03075 | null |
2024-07-03 | Graph and Skipped Transformer: Exploiting Spatial and Temporal Modeling Capacities for Efficient 3D Human Pose Estimation | Mengmeng Cui et.al. | 2407.02990 | null |
2024-07-03 | Numerical analysis of a porous natural convection system with vorticity and viscous dissipation | Russel Demos et.al. | 2407.02986 | null |
2024-07-03 | 3D Multimodal Image Registration for Plant Phenotyping | Eric Stumpe et.al. | 2407.02946 | link |
2024-07-13 | VEGS: View Extrapolation of Urban Scenes in 3D Gaussian Splatting using Learned Priors | Sungwon Hwang et.al. | 2407.02945 | link |
2024-07-03 | Free-SurGS: SfM-Free 3D Gaussian Splatting for Surgical Scene Reconstruction | Jiaxin Guo et.al. | 2407.02918 | link |
2024-07-04 | Explicitly Guided Information Interaction Network for Cross-modal Point Cloud Completion | Hang Xu et.al. | 2407.02887 | link |
2024-07-03 | Spatially Coherent 3D Distributions of HI and CO in the Milky Way | Laurin Söding et.al. | 2407.02859 | null |
2024-07-05 | Multi-Task Domain Adaptation for Language Grounding with 3D Objects | Penglei Sun et.al. | 2407.02846 | null |
2024-07-03 | A Radiometric Correction based Optical Modeling Approach to Removing Reflection Noise in TLS Point Clouds of Urban Scenes | Li Fang et.al. | 2407.02830 | link |
2024-07-02 | Parametric Modeling and Estimation of Photon Registrations for 3D Imaging | Weijian Zhang et.al. | 2407.02712 | null |
2024-07-02 | Generating tailored high frequency features in core collapse supernova gravitational wave signals applicable in LIGO interferometric studies | César Tiznado et.al. | 2407.02696 | null |
2024-07-02 | Depth-Aware Endoscopic Video Inpainting | Francis Xiatian Zhang et.al. | 2407.02675 | link |
2024-07-02 | MomentsNeRF: Leveraging Orthogonal Moments for Few-Shot Neural Rendering | Ahmad AlMughrabi et.al. | 2407.02668 | null |
2024-07-02 | Pairing interaction from Demons in Sr $_2$RuO$_4$ | Young Woo Choi et.al. | 2407.02654 | null |
2024-07-02 | 3d Gravity as a random ensemble | Daniel L. Jafferis et.al. | 2407.02649 | null |
2024-07-02 | HOIMotion: Forecasting Human Motion During Human-Object Interactions Using Egocentric 3D Object Bounding Boxes | Zhiming Hu et.al. | 2407.02633 | null |
2024-07-02 | Meta 3D Gen | Raphael Bensadoun et.al. | 2407.02599 | null |
2024-07-04 | AutoSplat: Constrained Gaussian Splatting for Autonomous Driving Scene Reconstruction | Mustafa Khan et.al. | 2407.02598 | null |
2024-07-02 | Meta 3D AssetGen: Text-to-Mesh Generation with High-Quality Geometry, Texture, and PBR Materials | Yawar Siddiqui et.al. | 2407.02445 | null |
2024-07-02 | Meta 3D TextureGen: Fast and Consistent Texture Generation for 3D Objects | Raphael Bensadoun et.al. | 2407.02430 | null |
2024-07-02 | AXIAL: Attention-based eXplainability for Interpretable Alzheimer’s Localized Diagnosis using 2D CNNs on 3D MRI brain scans | Gabriele Lozupone et.al. | 2407.02418 | link |
2024-07-02 | The influence of the 3D Galactic gas structure on cosmic-ray transport and gamma-ray emission | Andrés Ramírez et.al. | 2407.02410 | null |
2024-07-02 | Real Time Collision Avoidance with GPU-Computed Distance Maps | Wendwosen Bellete Bedada et.al. | 2407.02363 | null |
2024-07-02 | Implementation of reflection matrix microscopy: An algorithm perspective | Sungsam Kang et.al. | 2407.02321 | null |
2024-07-02 | Turbulent Diffuse Molecular Media with Non-ideal Magnetohydrodynamics and Consistent Thermochemistry: Numerical Simulations and Dynamic Characteristics | Nannan Yue et.al. | 2407.02306 | null |
2024-07-02 | On the multicomponent reactive flows in moving domains | Kuntal Bhandari et.al. | 2407.02303 | null |
2024-07-02 | Some properties of a non-hydrostatic stochastic oceanic primitive equations model | Arnaud Debussche et.al. | 2407.02289 | null |
2024-07-02 | White-Box 3D-OMP-Transformer for ISAC | Bowen Zhang et.al. | 2407.02251 | null |
2024-07-02 | Hypermultiplexed off-chip hologram by on-chip integrated metasurface | Xianjin Liu et.al. | 2407.02247 | null |
2024-07-02 | Towards a Holistic Framework for Multimodal Large Language Models in Three-dimensional Brain CT Report Generation | Cheng-Yi Li et.al. | 2407.02235 | link |
2024-07-03 | BeNeRF: Neural Radiance Fields from a Single Blurry Image and Event Stream | Wenpu Li et.al. | 2407.02174 | link |
2024-07-10 | WildAvatar: Web-scale In-the-wild Video Dataset for 3D Avatar Creation | Zihao Huang et.al. | 2407.02165 | link |
2024-07-03 | SparseSSP: 3D Subcellular Structure Prediction from Sparse-View Transmitted Light Images | Jintu Zheng et.al. | 2407.02159 | link |
2024-07-02 | VRBiom: A New Periocular Dataset for Biometric Applications of HMD | Ketan Kotwal et.al. | 2407.02150 | null |
2024-07-02 | Magnetic critical phenomena and low temperature re-entrant spin-glass features of Al $_2$ MnFe Heusler alloy | Abhinav Kumar Khorwal et.al. | 2407.02149 | null |
2024-07-04 | Estimating Inverse Scattering Potentials for n-p System Using Variational Monte Carlo & Neural Networks | Anil Khachi et.al. | 2407.02137 | null |
2024-07-02 | Joint-Dataset Learning and Cross-Consistent Regularization for Text-to-Motion Retrieval | Nicola Messina et.al. | 2407.02104 | null |
2024-07-02 | DM3D: Distortion-Minimized Weight Pruning for Lossless 3D Object Detection | Kaixin Xu et.al. | 2407.02098 | null |
2024-07-07 | Hierarchical Temporal Context Learning for Camera-based Semantic Scene Completion | Bohan Li et.al. | 2407.02077 | link |
2024-07-02 | CountFormer: Multi-View Crowd Counting Transformer | Hong Mo et.al. | 2407.02047 | link |
2024-07-02 | ScaleDreamer: Scalable Text-to-3D Synthesis with Asynchronous Score Distillation | Zhiyuan Ma et.al. | 2407.02040 | link |
2024-07-04 | Camera-LiDAR Cross-modality Gait Recognition | Wenxuan Guo et.al. | 2407.02038 | null |
2024-07-02 | TrAME: Trajectory-Anchored Multi-View Editing for Text-Guided 3D Gaussian Splatting Manipulation | Chaofan Luo et.al. | 2407.02034 | null |
2024-07-02 | A Proposal for a FAIR Management of 3D Data in Cultural Heritage: The Aldrovandi Digital Twin Case | Sebastian Barzaghi et.al. | 2407.02018 | null |
2024-07-02 | AHMsys: An Automated HVAC Modeling System for BIM Project | Long Hoang Dang et.al. | 2407.01987 | null |
2024-07-02 | FlowTrack: Point-level Flow Network for 3D Single Object Tracking | Shuo Li et.al. | 2407.01959 | null |
2024-07-02 | Indoor 3D Reconstruction with an Unknown Camera-Projector Pair | Zhaoshuai Qi et.al. | 2407.01945 | null |
2024-07-02 | Probabilistic 3D Correspondence Prediction from Sparse Unsegmented Images | Krithika Iyer et.al. | 2407.01931 | null |
2024-07-02 | PO-MSCKF: An Efficient Visual-Inertial Odometry by Reconstructing the Multi-State Constrained Kalman Filter with the Pose-only Theory | Du Xueyu et.al. | 2407.01888 | null |
2024-07-01 | Freeview Sketching: View-Aware Fine-Grained Sketch-Based Image Retrieval | Aneeshan Sain et.al. | 2407.01810 | null |
2024-07-01 | Optimising robotic operation speed with edge computing over 5G networks: Insights from selective harvesting robots | Usman A. Zahidi et.al. | 2407.01792 | null |
2024-07-01 | fVDB: A Deep-Learning Framework for Sparse, Large-Scale, and High-Performance Spatial Intelligence | Francis Williams et.al. | 2407.01781 | null |
2024-07-01 | Tayler-Spruit dynamo in stably stratified rotating fluids: Application to proto-magnetars | Paul Barrère et.al. | 2407.01775 | null |
2024-07-01 | DRAGON: Drone and Ground Gaussian Splatting for 3D Building Reconstruction | Yujin Ham et.al. | 2407.01761 | null |
2024-07-01 | VolETA: One- and Few-shot Food Volume Estimation | Ahmad AlMughrabi et.al. | 2407.01717 | link |
2024-07-01 | SeFlow: A Self-Supervised Scene Flow Method in Autonomous Driving | Qingwen Zhang et.al. | 2407.01702 | link |
2024-07-02 | xLSTM-UNet can be an Effective 2D & 3D Medical Image Segmentation Backbone with Vision-LSTM (ViL) better than its Mamba Counterpart | Tianrun Chen et.al. | 2407.01530 | link |
2024-07-02 | Empowering 3D Visual Grounding with Reasoning Capabilities | Chenming Zhu et.al. | 2407.01525 | null |
2024-07-01 | Centerline Boundary Dice Loss for Vascular Segmentation | Pengcheng Shi et.al. | 2407.01517 | link |
2024-07-02 | How Land-Mass Distribution Influences the Atmospheric Dynamics of Tidally Locked Terrestrial Exoplanets | F. Sainsbury-Martinez et.al. | 2407.01480 | null |
2024-07-01 | AdaOcc: Adaptive Forward View Transformation and Flow Modeling for 3D Occupancy and Flow Prediction | Dubing Chen et.al. | 2407.01436 | null |
2024-07-01 | StyleShot: A Snapshot on Any Style | Junyao Gao et.al. | 2407.01414 | link |
2024-07-01 | 3D MHD modelling of plasmoid drift following massive material injection in a tokamak | M. Kong et.al. | 2407.01399 | null |
2024-07-01 | PanopticRecon: Leverage Open-vocabulary Instance Segmentation for Zero-shot Panoptic Reconstruction | Xuan Yu et.al. | 2407.01349 | null |
2024-07-01 | Shape Synthesis and 3D Ceramic Printing of Non-canonical MIMO Dielectric Resonator Antennas | Binbin Yang et.al. | 2407.01340 | null |
2024-07-01 | Deep Reinforcement Learning for Adverse Garage Scenario Generation | Kai Li et.al. | 2407.01333 | null |
2024-07-01 | Learning Unsigned Distance Fields from Local Shape Functions for 3D Surface Reconstruction | Jiangbei Hu et.al. | 2407.01330 | link |
2024-07-01 | GaussianStego: A Generalizable Stenography Pipeline for Generative 3D Gaussians Splatting | Chenxin Li et.al. | 2407.01301 | null |
2024-07-01 | CLHOP: Combined Audio-Video Learning for Horse 3D Pose and Shape Estimation | Ci Li et.al. | 2407.01244 | null |
2024-07-01 | SGCCNet: Single-Stage 3D Object Detector With Saliency-Guided Data Augmentation and Confidence Correction Mechanism | Ao Liang et.al. | 2407.01239 | null |
2024-07-01 | Fast and Efficient: Mask Neural Fields for 3D Scene Segmentation | Zihan Gao et.al. | 2407.01220 | link |
2024-07-01 | Cross-Slice Attention and Evidential Critical Loss for Uncertainty-Aware Prostate Cancer Detection | Alex Ling Yu Hung et.al. | 2407.01146 | link |
2024-07-01 | Machine Learning-Assisted 3D Printing of Thermoelectric Materials of Ultrahigh Performances at Room Temperature | Kaidong Song et.al. | 2407.01145 | null |
2024-07-07 | Learning 3D Gaussians for Extremely Sparse-View Cone-Beam CT Reconstruction | Yiqun Lin et.al. | 2407.01090 | link |
2024-07-01 | Multimodal Conditional 3D Face Geometry Generation | Christopher Otto et.al. | 2407.01074 | null |
2024-07-01 | No More Potentially Dynamic Objects: Static Point Cloud Map Generation based on 3D Object Detection and Ground Projection | Soojin Woo et.al. | 2407.01073 | link |
2024-07-01 | Evolutionary Morphology Towards Overconstrained Locomotion via Large-Scale, Multi-Terrain Deep Reinforcement Learning | Yenan Chen et.al. | 2407.01050 | null |
2024-07-01 | Enhancing Speech-Driven 3D Facial Animation with Audio-Visual Guidance from Lip Reading Expert | Han EunGi et.al. | 2407.01034 | null |
2024-07-01 | EndoSparse: Real-Time Sparse View Synthesis of Endoscopic Scenes using Gaussian Splatting | Chenxin Li et.al. | 2407.01029 | null |
2024-07-01 | Blind Inversion using Latent Diffusion Priors | Weimin Bai et.al. | 2407.01027 | null |
2024-07-01 | PointViG: A Lightweight GNN-based Model for Efficient Point Cloud Analysis | Qiang Zheng et.al. | 2407.00921 | null |
2024-07-01 | Learning Robust 3D Representation from CLIP via Dual Denoising | Shuqing Luo et.al. | 2407.00905 | null |
2024-06-30 | Ego-to-Exo: Interfacing Third Person Visuals from Egocentric Views in Real-time for Improved ROV Teleoperation | Adnan Abdullah et.al. | 2407.00848 | null |
2024-07-07 | CaFNet: A Confidence-Driven Framework for Radar Camera Depth Estimation | Huawei Sun et.al. | 2407.00697 | link |
2024-06-30 | ESGNN: Towards Equivariant Scene Graph Neural Network for 3D Scene Understanding | Quang P. M. Pham et.al. | 2407.00609 | null |
2024-06-30 | Frequency-resolved Raman Thermometry Analysis via a Multi-layer Heat Transfer Model for Bulk and Low-dimensional Materials | Taocheng Yu et.al. | 2407.00602 | null |
2024-06-30 | DCI: An Accurate Quality Assessment Criteria for Protein Complex Structure Models | Wenda Wang et.al. | 2407.00560 | null |
2024-06-29 | Solving combinatorial optimization problems through stochastic Landau-Lifshitz-Gilbert dynamical systems | Dairong Chen et.al. | 2407.00530 | null |
2024-06-29 | Darboux Soft Hair in 3D Asymptotically Flat Spacetimes | Vahid Taghiloo et.al. | 2407.00525 | null |
2024-06-29 | A Medical Low-Back Pain Physical Rehabilitation Dataset for Human Body Movement Analysis | Sao Mai Nguyen et.al. | 2407.00521 | link |
2024-06-29 | Intrinsic PAPR for Point-level 3D Scene Albedo and Shading Editing | Alireza Moazeni et.al. | 2407.00500 | null |
2024-06-29 | ParaPIF: A Parareal Approach for Parallel-in-Time Integration of Particle-in-Fourier schemes | Sriramkrishnan Muralikrishnan et.al. | 2407.00485 | link |
2024-06-29 | Existence of attracting periodic orbits in 3-dimensional strongly 2-cooperative systems | Rami Katz et.al. | 2407.00461 | null |
2024-06-29 | KHNNs: hypercomplex neural networks computations via Keras using TensorFlow and PyTorch | Agnieszka Niemczynowicz et.al. | 2407.00452 | null |
2024-07-04 | Language-Guided Object-Centric Diffusion Policy for Collision-Aware Robotic Manipulation | Hang Li et.al. | 2407.00451 | null |
2024-06-29 | Three-dimensional non-reciprocal transport in photonic topological heterostructure of arbitrary shape | Mudi Wang et.al. | 2407.00440 | null |
2024-07-02 | RTGS: Enabling Real-Time Gaussian Splatting on Mobile Devices Using Efficiency-Guided Pruning and Foveated Rendering | Weikai Lin et.al. | 2407.00435 | link |
2024-06-29 | Screening of half-Heuslers with temperature-induced band convergence and enhanced thermoelectric properties | Jinyang Xi et.al. | 2407.00433 | null |
2024-06-29 | Mechanical stresses in pouch cells: a reduced order model | Andrea Giudici et.al. | 2407.00373 | null |
2024-06-29 | SVG: 3D Stereoscopic Video Generation via Denoising Frame Matrix | Peng Dai et.al. | 2407.00367 | null |
2024-06-29 | OccFusion: Rendering Occluded Humans with Generative Diffusion Priors | Adam Sun et.al. | 2407.00316 | null |
2024-06-29 | UADSN: Uncertainty-Aware Dual-Stream Network for Facial Nerve Segmentation | Guanghao Zhu et.al. | 2407.00297 | null |
2024-06-28 | SPITE: Simple Polyhedral Intersection Techniques for modified Environments | Stav Ashur et.al. | 2407.00259 | null |
2024-06-28 | Inverting airborne electromagnetic data with machine learning | Michael S. McMillan et.al. | 2407.00257 | null |
2024-06-28 | SemUV: Deep Learning based semantic manipulation over UV texture map of virtual human heads | Anirban Mukherjee et.al. | 2407.00229 | null |
2024-06-28 | Ferromagnetic resonance in 3D-tilted square artificial spin ices | Ghanem Alatteili et.al. | 2407.00202 | null |
2024-06-28 | DCSM 2.0: Deep Conditional Shape Models for Data Efficient Segmentation | Athira J Jacob et.al. | 2407.00186 | null |
2024-06-27 | From Efficient Multimodal Models to World Models: A Survey | Xinji Mai et.al. | 2407.00118 | null |
2024-06-28 | Odd-One-Out: Anomaly Detection by Comparing with Neighbors | Ankan Bhunia et.al. | 2406.20099 | link |
2024-06-28 | HouseCrafter: Lifting Floorplans to 3D Scenes with 2D Diffusion Model | Hieu T. Nguyen et.al. | 2406.20077 | null |
2024-06-28 | ASSR-NeRF: Arbitrary-Scale Super-Resolution on Voxel Grid for High-Quality Radiance Fields Reconstruction | Ding-Jiun Huang et.al. | 2406.20066 | null |
2024-06-28 | Reionization Parameter Inference from 3D Minkowski Functionals of the 21 cm Signals | Kangning Diao et.al. | 2406.20058 | null |
2024-06-28 | SpotlessSplats: Ignoring Distractors in 3D Gaussian Splatting | Sara Sabour et.al. | 2406.20055 | null |
2024-06-28 | A fast-filament eruption observed in the H $α$ spectral line. I. Imaging spectroscopy diagnostic | Denis P. Cabezas et.al. | 2406.20020 | null |
2024-07-01 | Text2Robot: Evolutionary Robot Design from Text Descriptions | Ryan P. Ringel et.al. | 2406.19963 | link |
2024-06-28 | 3D Operation of Autonomous Excavator based on Reinforcement Learning through Independent Reward for Individual Joints | Yoonkyu Yoo et.al. | 2406.19848 | null |
2024-06-28 | StreamMOTP: Streaming and Unified Framework for Joint 3D Multi-Object Tracking and Trajectory Prediction | Jiaheng Zhuang et.al. | 2406.19844 | null |
2024-06-28 | LightStereo: Channel Boost Is All Your Need for Efficient 2D Cost Aggregation | Xianda Guo et.al. | 2406.19833 | link |
2024-06-28 | EgoGaussian: Dynamic Scene Understanding from Egocentric Video with 3D Gaussian Splatting | Daiwei Zhang et.al. | 2406.19811 | null |
2024-06-28 | Novel electronic structures from anomalous stackings in NbS $_2$ and MoS$_2$ | Matthew D. Watson et.al. | 2406.19793 | null |
2024-07-01 | Mobile Robot Oriented Large-Scale Indoor Dataset for Dynamic Scene Understanding | Yifan Tang et.al. | 2406.19791 | null |
2024-07-09 | Duality constraints on thermal spectra of 3d CFTs and 4d quasinormal modes | Sašo Grozdanov et.al. | 2406.19790 | null |
2024-06-28 | Structure-aware World Model for Probe Guidance via Large-scale Self-supervised Pre-train | Haojun Jiang et.al. | 2406.19756 | null |
2024-06-28 | High Precision Microscale 3D Manufacturing of Ultra Low Expansion Glass by Femtosecond Selective Laser Etching | Enrico Casamenti et.al. | 2406.19745 | null |
2024-06-28 | EPOCH: Jointly Estimating the 3D Pose of Cameras and Humans | Nicola Garau et.al. | 2406.19726 | null |
2024-06-28 | Deep Learning-based Depth Estimation Methods from Monocular Image and Videos: A Comprehensive Survey | Uchitha Rajapaksha et.al. | 2406.19675 | null |
2024-06-28 | Unstable Retention Behavior in MIFIS FEFET: Accurate Analysis of the Origin by Absolute Polarization Measurement | Song-Hyeon Kuk et.al. | 2406.19618 | null |
2024-07-04 | Deep Temporal Sequence Classification and Mathematical Modeling for Cell Tracking in Dense 3D Microscopy Videos of Bacterial Biofilms | Tanjin Taher Toma et.al. | 2406.19574 | null |
2024-06-27 | What Matters in Detecting AI-Generated Videos like Sora? | Chirui Chang et.al. | 2406.19568 | null |
2024-06-27 | Stereo Vision Based Robot for Remote Monitoring with VR Support | Mohamed Fazil M. S. et.al. | 2406.19498 | null |
2024-06-27 | Propagating Kink Waves in an Open Coronal Magnetic Flux Tube with Gravitational Stratification: Magnetohydrodynamic Simulation and Forward Modelling | Yuhang Gao et.al. | 2406.19474 | null |
2024-06-27 | Efficient and Distributed Large-Scale 3D Map Registration using Tomographic Features | Halil Utku Unlu et.al. | 2406.19461 | link |
2024-06-27 | In LIGO’s Sight? Vigorous Coherent Gravitational Waves from Cooled Collapsar Disks | Ore Gottlieb et.al. | 2406.19452 | null |
2024-06-27 | Shoulder of Dust Rings Formed by Planet-disk Interactions | Jiaqing Bi et.al. | 2406.19438 | null |
2024-06-27 | Lightweight Predictive 3D Gaussian Splats | Junli Cao et.al. | 2406.19434 | link |
2024-06-27 | Looking 3D: Anomaly Detection with 2D-3D Alignment | Ankan Bhunia et.al. | 2406.19393 | link |
2024-06-27 | STAL3D: Unsupervised Domain Adaptation for 3D Object Detection via Collaborating Self-Training and Adversarial Learning | Yanan Zhang et.al. | 2406.19362 | null |
2024-06-27 | CORE4D: A 4D Human-Object-Human Interaction Dataset for Collaborative Object REarrangement | Chengwen Zhang et.al. | 2406.19353 | link |
2024-06-28 | LiverUSRecon: Automatic 3D Reconstruction and Volumetry of the Liver with a Few Partial Ultrasound Scans | Kaushalya Sivayogaraj et.al. | 2406.19336 | null |
2024-06-27 | Human Modelling and Pose Estimation Overview | Pawel Knap et.al. | 2406.19290 | null |
2024-07-04 | Human-Aware Vision-and-Language Navigation: Bridging Simulation to Reality with Dynamic Human Interactions | Minghan Li et.al. | 2406.19236 | link |
2024-06-27 | Spikes and spines in 3D Lorentzian simplicial quantum gravity | Johanna Borissova et.al. | 2406.19169 | null |
2024-06-27 | Super-resolution imaging using super-oscillatory diffractive neural networks | Hang Chen et.al. | 2406.19126 | null |
2024-06-28 | FAGhead: Fully Animate Gaussian Head from Monocular Videos | Yixin Xuan et.al. | 2406.19070 | null |
2024-06-27 | BiCo-Fusion: Bidirectional Complementary LiDAR-Camera Fusion for Semantic- and Spatial-Aware 3D Object Detection | Yang Song et.al. | 2406.19048 | null |
2024-06-27 | CLIP3D-AD: Extending CLIP for 3D Few-Shot Anomaly Detection with Multi-View Images Generation | Zuo Zuo et.al. | 2406.18941 | null |
2024-06-27 | RAVE: A Framework for Radar Ego-Velocity Estimation | Vlaho-Josip Štironja et.al. | 2406.18850 | link |
2024-06-27 | Eccentricity and Inclination of Massive Planets Inside Low-density Cavities: Results of 3D Simulations | M. M. Romanova et.al. | 2406.18834 | null |
2024-06-27 | Evolution of Interfacial Hydration Structure Induced by Ion Condensation and Correlation Effects | Han Li et.al. | 2406.18827 | null |
2024-06-26 | Numerical simulations for the Ising model on three dimensional lattices with coordination number equal 5: static and dynamic critical phenomena | Lourdes Bibiana Merino-Solís et.al. | 2406.18782 | null |
2024-07-02 | Computational Fluid Dynamics on Quantum Computers | Madhava Syamlal et.al. | 2406.18749 | null |
2024-07-01 | 3D Feature Distillation with Object-Centric Priors | Georgios Tziafas et.al. | 2406.18742 | null |
2024-06-26 | Dynamic Gaussian Marbles for Novel View Synthesis of Casual Monocular Videos | Colton Stearns et.al. | 2406.18717 | link |
2024-06-30 | Vox-UDA: Voxel-wise Unsupervised Domain Adaptation for Cryo-Electron Subtomogram Segmentation with Denoised Pseudo Labeling | Haoran Li et.al. | 2406.18610 | null |
2024-06-26 | On Scaling Up 3D Gaussian Splatting Training | Hexu Zhao et.al. | 2406.18533 | link |
2024-06-26 | MultiDiff: Consistent Novel View Synthesis from a Single Image | Norman Müller et.al. | 2406.18524 | null |
2024-06-26 | Bayesian inverse Navier-Stokes problems: joint flow field reconstruction and parameter learning | Alexandros Kontogiannis et.al. | 2406.18464 | null |
2024-06-26 | GaussianDreamerPro: Text to Manipulable 3D Gaussians with Highly Enhanced Quality | Taoran Yi et.al. | 2406.18462 | null |
2024-06-26 | Towards Human-Level 3D Relative Pose Estimation: Generalizable, Training-Free, with Single Reference | Yuan Gao et.al. | 2406.18453 | link |
2024-06-26 | Fast 3D 31P B1+ mapping with a weighted stack of spiral trajectory at 7 Tesla | Mark Widmaier et.al. | 2406.18426 | null |
2024-06-26 | Repeat and Concatenate: 2D to 3D Image Translation with 3D to 3D Generative Modeling | Abril Corona-Figueroa et.al. | 2406.18422 | link |
2024-06-26 | BiTrack: Bidirectional Offline 3D Multi-Object Tracking Using Camera-LiDAR Data | Kemiao Huang et.al. | 2406.18414 | link |
2024-06-26 | DoubleTake: Geometry Guided Depth Estimation | Mohamed Sayed et.al. | 2406.18387 | null |
2024-06-27 | XLD: A Cross-Lane Dataset for Benchmarking Novel Driving View Synthesis | Hao Li et.al. | 2406.18360 | null |
2024-06-26 | On Shilnikov’s scenario with a homoclinic orbit in 3D | Hans-Otto Walther et.al. | 2406.18289 | null |
2024-06-26 | RealTalk: Real-time and Realistic Audio-driven Face Generation with 3D Facial Prior-guided Identity Alignment Network | Xiaozhong Ji et.al. | 2406.18284 | null |
2024-06-26 | PlaMo: Plan and Move in Rich 3D Physical Environments | Assaf Hallak et.al. | 2406.18237 | null |
2024-06-26 | Trimming the Fat: Efficient Compression of 3D Gaussian Splats through Pruning | Muhammad Salman Ali et.al. | 2406.18214 | link |
2024-06-26 | GlobalTomo: A global dataset for physics-ML seismic wavefield modeling and FWI | Shiqian Li et.al. | 2406.18202 | link |
2024-06-26 | GS-Octree: Octree-based 3D Gaussian Splatting for Robust Object-level 3D Reconstruction Under Strong Lighting | Jiaze Li et.al. | 2406.18199 | null |
2024-06-26 | VDG: Vision-Only Dynamic Gaussian for Driving Simulation | Hao Li et.al. | 2406.18198 | null |
2024-06-26 | Human-Aware 3D Scene Generation with Spatially-constrained Diffusion Models | Xiaolin Hong et.al. | 2406.18159 | null |
2024-06-26 | 3D-MVP: 3D Multiview Pretraining for Robotic Manipulation | Shengyi Qian et.al. | 2406.18158 | null |
2024-06-26 | Photosensitive PEEK Ink Enables Digital Light Processing 3D Printed High-performance Small Architected-Plastics | Ze Zhang et.al. | 2406.18157 | null |
2024-06-26 | SynRS3D: A Synthetic Dataset for Global 3D Semantic Understanding from Monocular Remote Sensing Imagery | Jian Song et.al. | 2406.18151 | link |
2024-06-26 | B-TMS: Bayesian Traversable Terrain Modeling and Segmentation Across 3D LiDAR Scans and Maps for Enhanced Off-Road Navigation | Minho Oh et.al. | 2406.18138 | null |
2024-06-26 | CTS: Sim-to-Real Unsupervised Domain Adaptation on 3D Detection | Meiying Zhang et.al. | 2406.18129 | null |
2024-06-26 | Open-vocabulary Mobile Manipulation in Unseen Dynamic Environments with 3D Semantic Maps | Dicong Qiu et.al. | 2406.18115 | null |
2024-06-26 | Generation of spatiotemporal acoustic vortices with arbitrarily oriented orbital angular momentum | Shuai Liu et.al. | 2406.18084 | null |
2024-06-26 | Real-time Structure Flow | Juan David Adarve et.al. | 2406.18031 | null |
2024-06-26 | DICE: End-to-end Deformation Capture of Hand-Face Interactions from a Single Image | Qingxuan Wu et.al. | 2406.17988 | null |
2024-06-25 | SonicSense: Object Perception from In-Hand Acoustic Vibration | Jiaxun Liu et.al. | 2406.17932 | null |
2024-06-25 | DeepSense-V2V: A Vehicle-to-Vehicle Multi-Modal Sensing, Localization, and Communications Dataset | Joao Morais et.al. | 2406.17908 | null |
2024-06-25 | On the mechanics of inhaled bronchial transmission of pathogenic microdroplets generated from the upper respiratory tract, with implications for infection onset | Saikat Basu et.al. | 2406.17895 | null |
2024-06-25 | Quantification of Cyclic Topology in Polymer Networks Using 3D Nets | Devosmita Sen et.al. | 2406.17883 | null |
2024-06-27 | Depth-Driven Geometric Prompt Learning for Laparoscopic Liver Landmark Detection | Jialun Pei et.al. | 2406.17858 | link |
2024-06-25 | Point-SAM: Promptable 3D Segmentation Model for Point Clouds | Yuchen Zhou et.al. | 2406.17741 | link |
2024-06-25 | Mask-Guided Attention U-Net for Enhanced Neonatal Brain Extraction and Image Preprocessing | Bahram Jafrasteh et.al. | 2406.17709 | link |
2024-06-25 | A sharp quantitative Alexandrov inequality and applications to volume preserving geometric flows in 3D | Vesa Julin et.al. | 2406.17691 | null |
2024-06-25 | End-to-End Autonomous Driving without Costly Modularization and 3D Manual Annotation | Mingzhe Guo et.al. | 2406.17680 | null |
2024-06-25 | Perovskite nanocrystal self-assemblies in 3D hollow templates | Etsuki Kobiyama et.al. | 2406.17665 | null |
2024-06-25 | MDHA: Multi-Scale Deformable Transformer with Hybrid Anchors for Multi-View 3D Object Detection | Michelle Adeline et.al. | 2406.17654 | link |
2024-06-25 | Time-varying Extremum Graphs | Somenath Das et.al. | 2406.17652 | null |
2024-06-25 | Video Inpainting Localization with Contrastive Learning | Zijie Lou et.al. | 2406.17628 | link |
2024-06-25 | Cluster-glass behaviour and large magnetocaloric effect in frustrated hyperkagome ferromagnet Li $_2$MgMn$_3$O$_8$ | R. Kolay et.al. | 2406.17623 | null |
2024-06-25 | Director3D: Real-world Camera Trajectory and 3D Scene Generation from Text | Xinyang Li et.al. | 2406.17601 | link |
2024-06-25 | Toward Universal Medical Image Registration via Sharpness-Aware Meta-Continual Learning | Bomin Wang et.al. | 2406.17575 | link |
2024-06-25 | Retrieval-Augmented Code Generation for Situated Action Generation: A Case Study on Minecraft | Chalamalasetti Kranti et.al. | 2406.17553 | null |
2024-06-25 | Synthesis pathways to thin films of stable layered nitrides | Andriy Zakutayev et.al. | 2406.17492 | null |
2024-06-25 | Medical Image Segmentation Using Directional Window Attention | Daniya Najiha Abdul Kareem et.al. | 2406.17471 | link |
2024-06-25 | Additively manufacturable high-strength aluminum alloys with thermally stable microstructures enabled by hybrid machine learning-based design | S. Mohadeseh Taheri-Mousavi et.al. | 2406.17457 | null |
2024-06-25 | Quantization of Carrollian conformal scalar theories | Bin Chen et.al. | 2406.17451 | null |
2024-06-25 | Mamba24/8D: Enhancing Global Interaction in Point Clouds via State Space Model | Zhuoyuan Li et.al. | 2406.17442 | null |
2024-06-25 | Implicit-Zoo: A Large-Scale Dataset of Neural Implicit Functions for 2D Images and 3D Scenes | Qi Ma et.al. | 2406.17438 | link |
2024-06-25 | Real-Time Remote Control via VR over Limited Wireless Connectivity | H. P. Madushanka et.al. | 2406.17420 | link |
2024-06-25 | SyncNoise: Geometrically Consistent Noise Prediction for Text-based 3D Scene Editing | Ruihuang Li et.al. | 2406.17396 | null |
2024-06-25 | NerfBaselines: Consistent and Reproducible Evaluation of Novel View Synthesis Methods | Jonas Kulhanek et.al. | 2406.17345 | null |
2024-06-25 | Masked Generative Extractor for Synergistic Representation and 3D Generation of Point Clouds | Hongliang Zeng et.al. | 2406.17342 | null |
2024-06-27 | Towards Open-set Camera 3D Object Detection | Zhuolin He et.al. | 2406.17297 | null |
2024-06-25 | A refined uniqueness result of Leray’s problem in an infinite-long pipe with the Navier-slip boundary condition | Zijin Li et.al. | 2406.17264 | null |
2024-07-02 | SlideSLAM: Sparse, Lightweight, Decentralized Metric-Semantic SLAM for Multi-Robot Navigation | Xu Liu et.al. | 2406.17249 | link |
2024-06-26 | Diff3Dformer: Leveraging Slice Sequence Diffusion for Enhanced 3D CT Classification with Transformer Networks | Zihao Jin et.al. | 2406.17173 | null |
2024-06-24 | Toward Ubiquitous 3D Object Digitization: A Wearable Computing Framework for Non-Invasive Physical Property Acquisition | Yunxiang Zhang et.al. | 2406.17156 | null |
2024-06-24 | Multi-Aperture Fusion of Transformer-Convolutional Network (MFTC-Net) for 3D Medical Image Segmentation and Visualization | Siyavash Shabani et.al. | 2406.17080 | link |
2024-06-24 | Reducing the Memory Footprint of 3D Gaussian Splatting | Panagiotis Papantonakis et.al. | 2406.17074 | null |
2024-06-24 | The Relation Between Variances of a 3D Density and Its 2D Column Density Revisited | Heesun Yoon et.al. | 2406.17022 | null |
2024-06-24 | Deep Learning for Prediction and Classifying the Dynamical behaviour of Piecewise Smooth Maps | Vismaya V S et.al. | 2406.17001 | null |
2024-06-24 | Are Vision xLSTM Embedded UNet More Reliable in Medical 3D Image Segmentation? | Pallabi Dutta et.al. | 2406.16993 | link |
2024-06-23 | Disruption of the mitochondrial network in a mouse model of Huntington’s disease visualized by in tissue multiscale 3D electron microscopy | Eva Martin Solana et.al. | 2406.16977 | null |
2024-06-19 | The influence of flame-pressure waves collisions on the development and evolution of tulip flames | Chengeng Qian et.al. | 2406.16950 | null |
2024-06-19 | Networked ISAC for Low-Altitude Economy: Coordinated Transmit Beamforming and UAV Trajectory Design | Gaoyuan Cheng et.al. | 2406.16946 | null |
2024-06-24 | A Certifiable Algorithm for Simultaneous Shape Estimation and Object Tracking | Lorenzo Shaikewitz et.al. | 2406.16837 | link |
2024-06-24 | Experimental and Computational Insights Into the Magnetic Anisotropy and Magnetic Behaviour of Layered Room-Temperature Ferromagnet Cr $_{1.38}$Te$_2$ | Shubham Purwar et.al. | 2406.16831 | null |
2024-06-24 | General Binding Affinity Guidance for Diffusion Models in Structure-Based Drug Design | Yue Jian et.al. | 2406.16821 | null |
2024-06-24 | ClotheDreamer: Text-Guided Garment Generation with 3D Gaussians | Yufei Liu et.al. | 2406.16815 | null |
2024-06-24 | 3D distortion-free, reduced field of view diffusion-prepared GRE at 3T | Sarah McElroy et.al. | 2406.16809 | null |
2024-06-24 | Deep Learning and Chaos: A combined Approach To Image Encryption and Decryption | Bharath V Nair et.al. | 2406.16792 | null |
2024-06-24 | Instance Consistency Regularization for Semi-Supervised 3D Instance Segmentation | Yizheng Wu et.al. | 2406.16776 | link |
2024-06-24 | Lone Pair Induced 1D Character and Weak Cation-anion Interactions: Two Ingredients for Low Thermal Conductivity in Mixed-anion Metal Chalcohalides | Xingchen Shen et.al. | 2406.16744 | null |
2024-06-24 | Convolutional neural network for Lyman break galaxies classification and redshift regression in DESI (Dark Energy Spectroscopic Instrument) | Julien Taran et.al. | 2406.16730 | null |
2024-06-24 | The host of GRB 171205A in 3D – A resolved multiwavelength study of a rare grand-design spiral GRB host | C. C. Thöne et.al. | 2406.16725 | null |
2024-06-24 | μ-Net: A Deep Learning-Based Architecture for μ-CT Segmentation | Pierangela Bruno et.al. | 2406.16724 | null |
2024-06-24 | Portrait3D: 3D Head Generation from Single In-the-wild Portrait Image | Jinkun Hao et.al. | 2406.16710 | null |
2024-07-01 | Geometry-Aware Score Distillation via 3D Consistent Noising and Gradient Consistency Modeling | Min-Seop Kwak et.al. | 2406.16695 | null |
2024-06-24 | Measuring the matter fluctuations in the Local Universe with the ALFALFA catalog | Camila Franco et.al. | 2406.16693 | null |
2024-06-24 | Linac_Gen: integrating machine learning and particle-in-cell methods for enhanced beam dynamics at Fermilab | Abhishek Pathak et.al. | 2406.16630 | null |
2024-06-24 | GATSBI: An Online GTSP-Based Algorithm for Targeted Surface Bridge Inspection and Defect Detection | Harnaik Dhami et.al. | 2406.16625 | link |
2024-06-24 | Modelling the connection between propagating disturbances and solar spicules | Samuel Skirvin et.al. | 2406.16577 | null |
2024-06-24 | FASTC: A Fast Attentional Framework for Semantic Traversability Classification Using Point Cloud | Yirui Chen et.al. | 2406.16564 | link |
2024-06-24 | PRODIGE – Planet-forming disks in Taurus with NOEMA | R. Franceschi et.al. | 2406.16498 | null |
2024-06-24 | Inferring the scrape-off layer heat flux width in a divertor with a low degree of axisymmetry | C. Marsden et.al. | 2406.16471 | null |
2024-06-24 | Exploration of the deep geothermal potential of Petite-Terre Island in Mayotte | Chrystel Dezayes et.al. | 2406.16454 | null |
2024-06-24 | Uniform Sampling and Visualization of 3D Reluctant Walks | Benjamin Buckley et.al. | 2406.16397 | null |
2024-06-24 | Lesion-Aware Cross-Phase Attention Network for Renal Tumor Subtype Classification on Multi-Phase CT Scans | Kwang-Hyun Uhm et.al. | 2406.16322 | null |
2024-06-24 | Crowd-Sourced NeRF: Collecting Data from Production Vehicles for 3D Street View Reconstruction | Tong Qin et.al. | 2406.16289 | null |
2024-06-24 | Isolated singularities of 3-dimensional Yang-Mills-Higgs fields | Bo Chen et.al. | 2406.16276 | null |
2024-06-24 | YouDream: Generating Anatomically Controllable Consistent Text-to-3D Animals | Sandeep Mishra et.al. | 2406.16273 | null |
2024-06-23 | Composite Material Design for Optimized Fracture Toughness Using Machine Learning | Mohammad Naqizadeh Jahromi et.al. | 2406.16166 | null |
2024-06-23 | Flux-Rope Mediated Turbulent Magnetic Reconnection | Alexander J. B. Russell et.al. | 2406.16149 | null |
2024-06-23 | MLPHand: Real Time Multi-View 3D Hand Mesh Reconstruction via MLP Modeling | Jian Yang et.al. | 2406.16137 | null |
2024-06-25 | X-ray2CTPA: Generating 3D CTPA scans from 2D X-ray conditioning | Noa Cahan et.al. | 2406.16109 | link |
2024-06-23 | LGS: A Light-weight 4D Gaussian Splatting for Efficient Surgical Scene Reconstruction | Hengyu Liu et.al. | 2406.16073 | link |
2024-06-23 | DV-3DLane: End-to-end Multi-modal 3D Lane Detection with Dual-view Representation | Yueru Luo et.al. | 2406.16072 | link |
2024-06-23 | Towards Real-Time Neural Volumetric Rendering on Mobile Devices: A Measurement Study | Zhe Wang et.al. | 2406.16068 | null |
2024-06-23 | Sub-Riemannian geodesics on the Heisenberg 3D nil-manifold | A. Glutsyuk et.al. | 2406.16065 | null |
2024-06-23 | Logics of polyhedral reachability | Nick Bezhanishvili et.al. | 2406.16056 | null |
2024-06-23 | Learning with Noisy Ground Truth: From 2D Classification to 3D Reconstruction | Yangdi Lu et.al. | 2406.15982 | null |
2024-06-22 | ConnectVR: A Trigger-Action Interface for Creating Agent-based Interactive VR Stories | Mengyu Chen et.al. | 2406.15889 | null |
2024-06-22 | Shape2.5D: A Dataset of Texture-less Surfaces for Depth and Normals Estimation | Muhammad Saif Ullah Khan et.al. | 2406.15831 | link |
2024-06-22 | PointDreamer: Zero-shot 3D Textured Mesh Reconstruction from Colored Point Cloud by 2D Inpainting | Qiao Yu et.al. | 2406.15811 | link |
2024-06-22 | Smart Feature is What You Need | Zhaoxin Hu et.al. | 2406.15805 | link |
2024-06-22 | How to Learn More? Exploring Kolmogorov-Arnold Networks for Hyperspectral Image Classification | Ali Jamali et.al. | 2406.15719 | link |
2024-06-22 | psPRF:Pansharpening Planar Neural Radiance Field for Generalized 3D Reconstruction Satellite Imagery | Tongtong Zhang et.al. | 2406.15707 | null |
2024-06-22 | Self-Supervised Alignment Learning for Medical Image Segmentation | Haofeng Li et.al. | 2406.15699 | null |
2024-06-21 | Taming 3DGS: High-Quality Radiance Fields with Limited Resources | Saswat Subhajyoti Mallick et.al. | 2406.15643 | link |
2024-06-21 | Compaction during fragmentation and bouncing produces realistic dust grain porosities in protoplanetary discs | Stéphane Michoulier et.al. | 2406.15622 | null |
2024-06-21 | Optimization of Trajectories for Machine Learning Training in Robot Accuracy Modeling | Blake Hannaford et.al. | 2406.15620 | link |
2024-06-21 | A Topology Scavenger Hunt to Introduce Topological Data Analysis | Lori Ziegelmeier et.al. | 2406.15580 | null |
2024-06-21 | Elucidating Galaxy Population Properties Using a Model-Free Analysis of Quadruply Imaged Quasar Lenses From Large Surveys | John Miller Jr et.al. | 2406.15344 | null |
2024-06-21 | GeoLRM: Geometry-Aware Large Reconstruction Model for High-Quality 3D Gaussian Generation | Chubin Zhang et.al. | 2406.15333 | link |
2024-06-21 | Additive Manufacturing of functionalised atomic vapour cells for next-generation quantum technologies | Feiran Wang et.al. | 2406.15255 | null |
2024-06-21 | Gaussian Splatting to Real World Flight Navigation Transfer with Liquid Networks | Alex Quach et.al. | 2406.15149 | null |
2024-06-21 | Quantum geometrical properties of topological materials | Wei Chen et.al. | 2406.15145 | null |
2024-06-24 | Investigating the impact of 2D gesture representation on co-speech gesture generation | Teo Guichoux et.al. | 2406.15111 | null |
2024-06-21 | Hybrid Intelligent Routing with Optimized Learning (HIROL) for Adaptive Routing Topology management in FANETs | Ch. Naveen Kumar Reddy et.al. | 2406.15105 | null |
2024-06-21 | Balancing The Perception of Cheating Detection, Privacy and Fairness: A Mixed-Methods Study of Visual Data Obfuscation in Remote Proctoring | Suvadeep Mukherjee et.al. | 2406.15074 | null |
2024-06-21 | 3D-Localization of Single Point-Like Gamma Sources with a Coded Aperture Camera | Tobias Meißner et.al. | 2406.15048 | null |
2024-06-21 | SVFormer: A Direct Training Spiking Transformer for Efficient Video Action Recognition | Liutao Yu et.al. | 2406.15034 | null |
2024-06-21 | A3D: Does Diffusion Dream about 3D Alignment? | Savva Ignatyev et.al. | 2406.15020 | null |
2024-06-21 | SO(3) attitude controllers and the alignment of robots with non-constant 3D vector fields | Jesus Bautista et.al. | 2406.14998 | link |
2024-06-21 | Probabilistic and Differentiable Wireless Simulation with Geometric Transformers | Thomas Hehn et.al. | 2406.14995 | null |
2024-06-21 | E2GS: Event Enhanced Gaussian Splatting | Hiroyuki Deguchi et.al. | 2406.14978 | link |
2024-06-21 | CoCPF: Coordinate-based Continuous Projection Field for Ill-Posed Inverse Problem in Imaging | Zixuan Chen et.al. | 2406.14976 | null |
2024-06-21 | VividDreamer: Towards High-Fidelity and Efficient Text-to-3D Generation | Zixuan Chen et.al. | 2406.14964 | null |
2024-06-21 | Gaussian-Informed Continuum for Physical Property Identification and Simulation | Junhao Cai et.al. | 2406.14927 | null |
2024-06-21 | Extraction of 3D trajectories of mandibular condyles from 2D real-time MRI | Karyna Isaieva et.al. | 2406.14925 | null |
2024-06-21 | LLM2FEA: Discover Novel Designs with Generative Evolutionary Multitasking | Melvin Wong et.al. | 2406.14917 | null |
2024-06-21 | MOS: Model Synergy for Test-Time Adaptation on LiDAR-Based 3D Object Detection | Zhuoxiao Chen et.al. | 2406.14878 | null |
2024-06-20 | Symmetries in 3D photoelectron momentum spectroscopy as precursory methods for dichroic and enantiosensitive measurements | Michael Davino et.al. | 2406.14705 | null |
2024-06-19 | 3D Instance Segmentation Using Deep Learning on RGB-D Indoor Data | Siddiqui Muhammad Yasir et.al. | 2406.14581 | null |
2024-06-28 | Epicardium Prompt-guided Real-time Cardiac Ultrasound Frame-to-volume Registration | Long Lei et.al. | 2406.14534 | link |
2024-06-20 | High Bulk Modulus Pentamodes: the Three-Dimensional Metal Water | Giacomo Brambilla et.al. | 2406.14502 | null |
2024-06-20 | Spin Statistics and Surgeries of Topological Solitons in QCD Matter in Magnetic Field | Yuki Amari et.al. | 2406.14419 | null |
2024-06-20 | Benchmarking Monocular 3D Dog Pose Estimation Using In-The-Wild Motion Capture Data | Moira Shooter et.al. | 2406.14412 | null |
2024-06-20 | Deblurring Neural Radiance Fields with Event-driven Bundle Adjustment | Yunshan Qi et.al. | 2406.14360 | null |
2024-06-20 | A tensor model for calibration and imaging with air-coupled ultrasonic sensor arrays | Raphael Müller et.al. | 2406.14355 | null |
2024-06-20 | MultiTalk: Enhancing 3D Talking Head Generation Across Languages with Multilingual Video Dataset | Kim Sung-Bin et.al. | 2406.14272 | null |
2024-06-20 | CityNav: Language-Goal Aerial Navigation Dataset with Geographic Information | Jungdae Lee et.al. | 2406.14240 | null |
2024-06-20 | Uncertainty and Self-Supervision in Single-View Depth | Javier Rodriguez-Puigvert et.al. | 2406.14226 | null |
2024-06-20 | Self-Supervised Pretext Tasks for Alzheimer’s Disease Classification using 3D Convolutional Neural Networks on Large-Scale Synthetic Neuroimaging Dataset | Chen Zheng et.al. | 2406.14210 | null |
2024-06-20 | Iterative Sizing Field Prediction for Adaptive Mesh Generation From Expert Demonstrations | Niklas Freymuth et.al. | 2406.14161 | link |
2024-06-20 | Geometric Self-Supervised Pretraining on 3D Protein Structures using Subgraphs | Michail Chatzianastasis et.al. | 2406.14142 | null |
2024-06-20 | Comparing the Effects of Visual, Haptic, and Visuohaptic Encoding on Memory Retention of Digital Objects in Virtual Reality | Lucas Siqueira Rodrigues et.al. | 2406.14139 | link |
2024-06-20 | ExVideo: Extending Video Diffusion Models via Parameter-Efficient Post-Tuning | Zhongjie Duan et.al. | 2406.14130 | link |
2024-06-20 | Peculiar Velocity Reconstruction From Simulations and Observations Using Deep Learning Algorithms | Yuyu Wang et.al. | 2406.14101 | null |
2024-06-20 | Towards Multi-modality Fusion and Prototype-based Feature Refinement for Clinically Significant Prostate Cancer Classification in Transrectal Ultrasound | Hong Wu et.al. | 2406.14069 | link |
2024-06-20 | Feature Fusion Based on Mutual-Cross-Attention Mechanism for EEG Emotion Recognition | Yimin Zhao et.al. | 2406.14014 | link |
2024-06-20 | A general Liouville-type theorem for the 3D steady-state Magnetic-Bénard system | Oscar Jarrin et.al. | 2406.13952 | null |
2024-06-19 | INFusion: Diffusion Regularized Implicit Neural Representations for 2D and 3D accelerated MRI reconstruction | Yamin Arefeen et.al. | 2406.13895 | null |
2024-06-19 | DPO: Dual-Perturbation Optimization for Test-time Adaptation in 3D Object Detection | Zhuoxiao Chen et.al. | 2406.13891 | link |
2024-06-26 | Splatter a Video: Video Gaussian Representation for Versatile Processing | Yang-Tian Sun et.al. | 2406.13870 | null |
2024-06-19 | RNA-FrameFlow: Flow Matching for de novo 3D RNA Backbone Design | Rishabh Anand et.al. | 2406.13839 | link |
2024-06-19 | U-shaped disks in Stokes flow: Chiral sedimentation of a non-chiral particle | Christian Vaquero-Stainer et.al. | 2406.13837 | null |
2024-06-19 | NeRF-Feat: 6D Object Pose Estimation using Feature Rendering | Shishir Reddy Vutukur et.al. | 2406.13796 | null |
2024-06-19 | Insights into the Production of $^{44}$ Ti and Nickel Isotopes in Core-Collapse Supernovae | Tianshu Wang et.al. | 2406.13746 | null |
2024-06-19 | Rethinking Abdominal Organ Segmentation (RAOS) in the clinical scenario: A robustness evaluation benchmark with challenging cases | Xiangde Luo et.al. | 2406.13674 | link |
2024-06-19 | Topological Equivalence Theorem and Double-Copy for Chern-Simons Scattering Amplitudes | Yan-Feng Hang et.al. | 2406.13671 | null |
2024-06-19 | CLAMP: Majorized Plug-and-Play for Coherent 3D LIDAR Imaging | Tony G. Allen et.al. | 2406.13651 | null |
2024-06-19 | 3D Visualization Reveals the Cooling Rate Dependent Crystallization near a Wall in Dense Microgel Systems | M. P. M. Schelling et.al. | 2406.13609 | null |
2024-06-19 | Trusted Video Inpainting Localization via Deep Attentive Noise Learning | Zijie Lou et.al. | 2406.13576 | link |
2024-06-19 | MVSBoost: An Efficient Point Cloud-based 3D Reconstruction | Umair Haroon et.al. | 2406.13515 | null |
2024-06-19 | Assessing the 3D resolution of refocused correlation plenoptic images using a general-purpose image quality estimator | Gianlorenzo Massaro et.al. | 2406.13501 | null |
2024-06-19 | An Efficient yet High-Performance Method for Precise Radar-Based Imaging of Human Hand Poses | Johanna Bräunig et.al. | 2406.13464 | null |
2024-06-19 | Lagrangian multiform structure of discrete and semi-discrete KP systems | Frank W Nijhoff et.al. | 2406.13423 | null |
2024-06-24 | Style-NeRF2NeRF: 3D Style Transfer From Style-Aligned Multi-View Images | Haruo Fujiwara et.al. | 2406.13393 | null |
2024-06-19 | Learning the Approach During the Short-loading Cycle Using Reinforcement Learning | Carl Borngrund et.al. | 2406.13366 | null |
2024-06-19 | Deep Learning-Based 3D Instance and Semantic Segmentation: A Review | Siddiqui Muhammad Yasir et.al. | 2406.13308 | null |
2024-06-19 | Situational Instructions Database: Task Guidance in Dynamic Environments | Muhammad Saif Ullah Khan et.al. | 2406.13302 | link |
2024-06-19 | Self-Supervised Diffusion Model for 3-D Seismic Data Reconstruction | Xinyang Wang et.al. | 2406.13252 | null |
2024-06-19 | Freq-Mip-AA : Frequency Mip Representation for Anti-Aliasing Neural Radiance Fields | Youngin Park et.al. | 2406.13251 | link |
2024-06-19 | MagicItem: Dynamic Behavior Design of Virtual Objects with Large Language Models in a Consumer Metaverse Platform | Ryutaro Kurai et.al. | 2406.13242 | null |
2024-06-19 | Lipid membrane domains control actin network viscoelasticity | Daniel P. Arnold et.al. | 2406.13218 | null |
2024-06-19 | Application of Computer Deep Learning Model in Diagnosis of Pulmonary Nodules | Yutian Yang et.al. | 2406.13205 | null |
2024-06-19 | Dynamical phase-field model of cavity electromagnonic systems | Shihao Zhuang et.al. | 2406.13203 | link |
2024-06-19 | Super-resolution 3D tomography of vector near-fields in dielectric resonators | Bingbing Zhu et.al. | 2406.13171 | null |
2024-06-19 | AntibodyFlow: Normalizing Flow Model for Designing Antibody Complementarity-Determining Regions | Bohao Xu et.al. | 2406.13162 | null |
2024-06-19 | High-Fidelity Facial Albedo Estimation via Texture Quantization | Zimin Ran et.al. | 2406.13149 | null |
2024-06-18 | On the Origin of Solar Hemispheric Helicity Rules: Rise of 3D Magnetic Flux Concentrations through a Background Magnetic Field | Bhishek Manek et.al. | 2406.13104 | null |
2024-06-18 | Sampling 3D Gaussian Scenes in Seconds with Latent Diffusion Models | Paul Henderson et.al. | 2406.13099 | null |
2024-06-18 | Head Pose Estimation and 3D Neural Surface Reconstruction via Monocular Camera in situ for Navigation and Safe Insertion into Natural Openings | Ruijie Tang et.al. | 2406.13048 | null |
2024-06-18 | Modeling and Controls of Fluid-Structure Interactions (FSI) in Dynamic Morphing Flight | Bibek Gupta et.al. | 2406.13039 | null |
2024-06-18 | Bounding irrelevant operators in the 3d Gross-Neveu-Yukawa CFTs | Matthew S. Mitchell et.al. | 2406.12974 | null |
2024-06-18 | Low-mass planets falling into gaps with cyclonic vortices | Raúl O. Chametla et.al. | 2406.12813 | null |
2024-06-18 | Probabilistic Temporal Prediction of Continuous Disease Trajectories and Treatment Effects Using Neural SDEs | Joshua Durso-Finley et.al. | 2406.12807 | null |
2024-06-18 | Latent Intuitive Physics: Learning to Transfer Hidden Physics from A 3D Video | Xiangming Zhu et.al. | 2406.12769 | null |
2024-06-18 | Tactile SoftHand-A: 3D-Printed, Tactile, Highly-underactuated, Anthropomorphic Robot Hand with an Antagonistic Tendon Mechanism | Haoran Li et.al. | 2406.12731 | null |
2024-06-18 | Concurrent Accretion and Migration of Giant Planets in their Natal Disks with Consistent Accretion Torque | Ya-Ping Li et.al. | 2406.12716 | null |
2024-06-18 | Coarse-Fine Spectral-Aware Deformable Convolution For Hyperspectral Image Reconstruction | Jincheng Yang et.al. | 2406.12703 | null |
2024-06-18 | SUPER: Selfie Undistortion and Head Pose Editing with Identity Preservation | Polina Karpikova et.al. | 2406.12700 | null |
2024-06-18 | Dielectric relaxation in the quantum multiferroics Rb $2$Cu$_2$Mo$_3$O${12}$ and Cs$2$Cu$_2$Mo$_3$O${12}$ | D. Flavián et.al. | 2406.12690 | null |
2024-06-18 | Dynamical traceback age of the Octans young stellar association | P. A. B. Galli et.al. | 2406.12686 | null |
2024-06-18 | An Empirical Study on the Fairness of Foundation Models for Multi-Organ Image Segmentation | Qin Li et.al. | 2406.12646 | null |
2024-06-18 | Cyclic 2.5D Perceptual Loss for Cross-Modal 3D Image Synthesis: T1 MRI to Tau-PET | Symac Kim et.al. | 2406.12632 | null |
2024-06-18 | Restorer: Solving Multiple Image Restoration Tasks with One Set of Parameters | Jiawei Mao et.al. | 2406.12587 | link |
2024-06-18 | Method for detector description conversion from DD4hep to Filmbox | Zhaoyang Yuan et.al. | 2406.12495 | null |
2024-06-18 | HumanSplat: Generalizable Single-Image Human Gaussian Splatting with Structure Priors | Panwang Pan et.al. | 2406.12459 | link |
2024-06-18 | Automated MRI Quality Assessment of Brain T1-weighted MRI in Clinical Data Warehouses: A Transfer Learning Approach Relying on Artefact Simulation | Sophie Loizillon et.al. | 2406.12448 | link |
2024-06-18 | Deep self-supervised learning with visualisation for automatic gesture recognition | Fabien Allemand et.al. | 2406.12440 | link |
2024-06-18 | LOOC: Localizing Organs using Occupancy Networks and Body Surface Depth Images | Pit Henrich et.al. | 2406.12407 | null |
2024-06-18 | Volume enclosed by a flux surface | Robert S. MacKay et.al. | 2406.12372 | null |
2024-06-18 | VIRL: Volume-Informed Representation Learning towards Few-shot Manufacturability Estimation | Yu-hsuan Chen et.al. | 2406.12286 | link |
2024-06-30 | Slot State Space Models | Jindong Jiang et.al. | 2406.12272 | link |
2024-06-18 | Enhancing Single-Slice Segmentation with 3D-to-2D Unpaired Scan Distillation | Xin Yu et.al. | 2406.12254 | null |
2024-06-18 | PCIE_EgoHandPose Solution for EgoExo4D Hand Pose Challenge | Feng Chen et.al. | 2406.12219 | link |
2024-06-18 | Fast Global Localization on Neural Radiance Field | Mangyu Kong et.al. | 2406.12202 | link |
2024-06-20 | TutteNet: Injective 3D Deformations by Composition of 2D Mesh Deformations | Bo Sun et.al. | 2406.12121 | null |
2024-06-17 | 4D Printing of Programmable Digital Metamaterials | Ido Levin et.al. | 2406.12113 | null |
2024-06-17 | Scaling limit of the ground state Bethe roots for the inhomogeneous XXZ spin- $\frac{1}{2}$ chain | Sascha Gehrmann et.al. | 2406.12102 | null |
2024-06-17 | DistillNeRF: Perceiving 3D Scenes from Single-Glance Images by Distilling Neural Fields and Foundation Model Features | Letian Wang et.al. | 2406.12095 | null |
2024-06-17 | A Hierarchical 3D Gaussian Representation for Real-Time Rendering of Very Large Datasets | Bernhard Kerbl et.al. | 2406.12080 | null |
2024-06-17 | Multi-Dimensional Pruning: Joint Channel, Layer and Block Pruning with Latency Constraint | Xinglong Sun et.al. | 2406.12079 | null |
2024-06-17 | FAWN: Floor-And-Walls Normal Regularization for Direct Neural TSDF Reconstruction | Anna Sokolova et.al. | 2406.12054 | null |
2024-06-17 | Inevitable First Order Phase Transitions in 3D Quantum Hall Systems | Kaiyuan Gu et.al. | 2406.11976 | null |
2024-06-17 | LLaNA: Large Language and NeRF Assistant | Andrea Amaduzzi et.al. | 2406.11840 | null |
2024-06-22 | RetinaGS: Scalable Training for Dense Scene Rendering with Billion-Scale 3D Gaussians | Bingling Li et.al. | 2406.11836 | null |
2024-06-17 | Flow Shadowing: A Method to Detect Multiple Flow Headings using an Array of Densely Packed Whisker-inspired Sensors | Teresa A. Kent et.al. | 2406.11829 | null |
2024-06-17 | Infinigen Indoors: Photorealistic Indoor Scenes using Procedural Generation | Alexander Raistrick et.al. | 2406.11824 | null |
2024-06-17 | MegaScenes: Scene-Level View Synthesis at Scale | Joseph Tung et.al. | 2406.11819 | link |
2024-06-17 | Task Me Anything | Jieyu Zhang et.al. | 2406.11775 | link |
2024-06-17 | Matching Query Image Against Selected NeRF Feature for Efficient and Scalable Localization | Huaiji Zhou et.al. | 2406.11766 | null |
2024-06-17 | Coronal energy release by MHD avalanches II. EUV line emission from a multi-threaded coronal loop | G. Cozzo et.al. | 2406.11701 | null |
2024-06-17 | Kelvin-Helmholtz instability and heating in oscillating loops perturbed by power-law transverse wave drivers | Konstantinos Karampelas et.al. | 2406.11700 | null |
2024-06-18 | Effective Rank Analysis and Regularization for Enhanced 3D Gaussian Splatting | Junha Hyung et.al. | 2406.11672 | null |
2024-06-17 | AsterX: a new open-source GPU-accelerated GRMHD code for dynamical spacetimes | Jay V. Kalinani et.al. | 2406.11669 | link |
2024-06-17 | Discriminative Hamiltonian Variational Autoencoder for Accurate Tumor Segmentation in Data-Scarce Regimes | Aghiles Kebaili et.al. | 2406.11659 | null |
2024-06-17 | SeamPose: Repurposing Seams as Capacitive Sensors in a Shirt for Upper-Body Pose Tracking | Tianhong Catherine Yu et.al. | 2406.11645 | null |
2024-06-17 | Duoduo CLIP: Efficient 3D Understanding with Multi-View Images | Han-Hung Lee et.al. | 2406.11579 | link |
2024-06-17 | Projecting Radiance Fields to Mesh Surfaces | Adrian Xuan Wei Lim et.al. | 2406.11570 | null |
2024-06-17 | Quiver Polymerisation | Amihay Hanany et.al. | 2406.11561 | null |
2024-06-23 | GA-Unity: A Production-Ready Unity Package for Seamless Integration of Geometric Algebra in Networked Collaborative Applications | Manos Kamarianakis et.al. | 2406.11560 | link |
2024-06-18 | ESI-GAL: EEG Source Imaging-based Kinematics Parameter Estimation for Grasp and Lift Task | Anant Jain et.al. | 2406.11500 | null |
2024-06-17 | Unfolding Time: Generative Modeling for Turbulent Flows in 4D | Abdullah Saydemir et.al. | 2406.11390 | null |
2024-06-17 | Non-LTE abundances of nitrogen in the Sun and reference A-F type stars | L. I. Mashonkina et.al. | 2406.11367 | null |
2024-06-17 | Quantized orbital angular momentums of dipolar magnons and magnetoelectric cavity polaritons | E. O. Kamenetskii et.al. | 2406.11359 | null |
2024-06-17 | Semi-Supervised Domain Adaptation Using Target-Oriented Domain Augmentation for 3D Object Detection | Yecheol Kim et.al. | 2406.11313 | link |
2024-06-17 | Syn-to-Real Unsupervised Domain Adaptation for Indoor 3D Object Detection | Yunsong Wang et.al. | 2406.11311 | link |
2024-06-17 | Parameter effects on the total intensity of H I Lyα line for a modelled coronal mass ejection and its driven shock | Beili Ying et.al. | 2406.11297 | null |
2024-06-17 | Enhancing Generalizability of Representation Learning for Data-Efficient 3D Scene Understanding | Yunsong Wang et.al. | 2406.11283 | null |
2024-06-17 | DRIP: Discriminative Rotation-Invariant Pole Landmark Descriptor for 3D LiDAR Localization | Dingrui Li et.al. | 2406.11266 | null |
2024-06-17 | NLDF: Neural Light Dynamic Fields for Efficient 3D Talking Head Generation | Niu Guanchen et.al. | 2406.11259 | null |
2024-06-17 | Holistic-Motion2D: Scalable Whole-body Human Motion Generation in 2D Space | Yuan Wang et.al. | 2406.11253 | null |
2024-06-17 | Long-time behavior toward composite wave of shocks for 3D barotropic navier-stokes system | Moon-Jin Kang et.al. | 2406.11215 | null |
2024-06-17 | Privacy-preserving Pseudonym Schemes for Personalized 3D Avatars in Mobile Social Metaverses | Cheng Su et.al. | 2406.11208 | null |
2024-06-17 | Consistency^2: Consistent and Fast 3D Painting with Latent Consistency Models | Tianfu Wang et.al. | 2406.11202 | link |
2024-06-17 | Vid3D: Synthesis of Dynamic 3D Scenes using 2D Video Diffusion | Rishab Parthasarathy et.al. | 2406.11196 | link |
2024-06-21 | $α$ -SSC: Uncertainty-Aware Camera-based 3D Semantic Scene Completion | Sanbao Su et.al. | 2406.11021 | null |
2024-06-16 | Self-supervised Pretraining and Finetuning for Monocular Depth and Visual Odometry | Boris Chidlovskii et.al. | 2406.11019 | null |
2024-06-16 | SPEAR: Receiver-to-Receiver Acoustic Neural Warping Field | Yuhang He et.al. | 2406.11006 | link |
2024-06-16 | 3D Gaze Tracking for Studying Collaborative Interactions in Mixed-Reality Environments | Eduardo Davalos et.al. | 2406.11003 | null |
2024-06-16 | SparseDet: A Simple and Effective Framework for Fully Sparse LiDAR-based 3D Object Detection | Lin Liu et.al. | 2406.10907 | null |
2024-06-16 | MV2Cyl: Reconstructing 3D Extrusion Cylinders from Multi-View Images | Eunji Hong et.al. | 2406.10853 | null |
2024-06-16 | CBGBench: Fill in the Blank of Protein-Molecule Complex Binding Graph | Haitao Lin et.al. | 2406.10840 | link |
2024-06-16 | Physically Embodied Gaussian Splatting: A Realtime Correctable World Model for Robotics | Jad Abou-Chakra et.al. | 2406.10788 | null |
2024-06-15 | Character Animation in AR: Character Animation in AR: a mobile application development study | Sukanya Bhattacharjee et.al. | 2406.10732 | null |
2024-06-15 | Breaking the Memory Wall: A Study of I/O Patterns and GPU Memory Utilization for Hybrid CPU-GPU Offloaded Optimizers | Avinash Maurya et.al. | 2406.10728 | null |
2024-06-15 | Beyond the Visible: Jointly Attending to Spectral and Spatial Dimensions with HSI-Diffusion for the FINCH Spacecraft | Ian Vyse et.al. | 2406.10724 | link |
2024-06-15 | GenMM: Geometrically and Temporally Consistent Multimodal Data Generation for Video and LiDAR | Bharat Singh et.al. | 2406.10722 | null |
2024-06-18 | Voxel Mamba: Group-Free State Space Models for Point Cloud based 3D Object Detection | Guowen Zhang et.al. | 2406.10700 | link |
2024-06-15 | fNeRF: High Quality Radiance Fields from Practical Cameras | Yi Hua et.al. | 2406.10633 | null |
2024-06-15 | NeRFDeformer: NeRF Transformation from a Single View via 3D Scene Flows | Zhenggang Tang et.al. | 2406.10543 | link |
2024-06-21 | Large Reasoning Models for 3D Floorplanning in EDA: Learning from Imperfections | Fin Amin et.al. | 2406.10538 | null |
2024-06-15 | An hp-Adaptive Sampling Algorithm for Dispersion Relation Reconstruction of 3D Photonic Crystals | Yueqi Wang et.al. | 2406.10523 | null |
2024-06-15 | Full reference point cloud quality assessment using support vector regression | Ryosuke Watanabe et.al. | 2406.10520 | link |
2024-06-15 | Self Pre-training with Topology- and Spatiality-aware Masked Autoencoders for 3D Medical Image Segmentation | Pengfei Gu et.al. | 2406.10519 | null |
2024-06-15 | Lift Your Molecules: Molecular Graph Generation in Latent Euclidean Space | Mohamed Amine Ketata et.al. | 2406.10513 | null |
2024-06-15 | Detection and Utilization of Reflections in LiDAR Scans Through Plane Optimization and Plane SLAM | Yinjie Li et.al. | 2406.10494 | link |
2024-06-15 | Improving Ab-Initio Cryo-EM Reconstruction with Semi-Amortized Pose Inference | Shayan Shekarforoush et.al. | 2406.10455 | null |
2024-06-15 | Radiative Properties of Plasmoids and Plasmoid Mergers in Magnetic Reconnection | Haocheng Zhang et.al. | 2406.10452 | null |
2024-06-14 | 3D correlation imaging for localized phase disturbance mitigation | Francesco V. Pepe et.al. | 2406.10377 | null |
2024-06-14 | Random Close Packing of Semi-Flexible Polymers in Two Dimensions: Emergence of Local and Global Order | Daniel Martinez-Fernandez et.al. | 2406.10376 | null |
2024-06-14 | Wild-GS: Real-Time Novel View Synthesis from Unconstrained Photo Collections | Jiacong Xu et.al. | 2406.10373 | null |
2024-06-14 | L4GM: Large 4D Gaussian Reconstruction Model | Jiawei Ren et.al. | 2406.10324 | null |
2024-06-14 | LieRE: Generalizing Rotary Position Encodings | Sophie Ostmeier et.al. | 2406.10322 | link |
2024-06-14 | EFM3D: A Benchmark for Measuring Progress Towards 3D Egocentric Foundation Models | Julian Straub et.al. | 2406.10224 | link |
2024-06-14 | PUP 3D-GS: Principled Uncertainty Pruning for 3D Gaussian Splatting | Alex Hanson et.al. | 2406.10219 | link |
2024-06-24 | NeST: Neural Stress Tensor Tomography by leveraging 3D Photoelasticity | Akshat Dave et.al. | 2406.10212 | null |
2024-06-14 | DiffusionBlend: Learning 3D Image Prior through Position-aware Diffusion Score Blending for 3D Computed Tomography Reconstruction | Bowen Song et.al. | 2406.10211 | null |
2024-06-14 | Three-dimensional quantum Griffiths singularity in bulk iron-pnictide superconductors | Shao-Bo Liu et.al. | 2406.10193 | null |
2024-06-14 | Impurities with a cusp: general theory and 3d Ising | Gabriel Cuomo et.al. | 2406.10186 | null |
2024-06-14 | MeshPose: Unifying DensePose and 3D Body Mesh reconstruction | Eric-Tuan Lê et.al. | 2406.10180 | link |
2024-06-14 | 4DRecons: 4D Neural Implicit Deformable Objects Reconstruction from a single RGB-D Camera with Geometrical and Topological Regularizations | Xiaoyan Cong et.al. | 2406.10167 | null |
2024-06-14 | MeshAnything: Artist-Created Mesh Generation with Autoregressive Transformers | Yiwen Chen et.al. | 2406.10163 | link |
2024-06-14 | Bifurcations of periodic orbits in the 3D secular planetary 3-Body problem: an approach through an integrable Hamiltonian system | Rita Mastroianni et.al. | 2406.10134 | null |
2024-06-14 | Training-free Camera Control for Video Generation | Chen Hou et.al. | 2406.10126 | null |
2024-06-14 | Shelf-Supervised Multi-Modal Pre-Training for 3D Object Detection | Mehar Khurana et.al. | 2406.10115 | link |
2024-06-14 | GaussianSR: 3D Gaussian Super-Resolution with 2D Diffusion Priors | Xiqian Yu et.al. | 2406.10111 | null |
2024-06-14 | DurLAR: A High-fidelity 128-channel LiDAR Dataset with Panoramic Ambient and Reflectivity Imagery for Multi-modal Autonomous Driving Applications | Li Li et.al. | 2406.10068 | link |
2024-06-14 | Quantitative phase imaging verification in large field-of-view lensless holographic microscopy via two-photon 3D printing | Emilia Wdowiak et.al. | 2406.10020 | null |
2024-06-14 | Real-time, accurate, and open source upper-limb musculoskeletal analysis using a single RGBD camera | Amedeo Ceglia et.al. | 2406.10007 | null |
2024-06-14 | OrientDream: Streamlining Text-to-3D Generation with Explicit Orientation Control | Yuzhong Huang et.al. | 2406.10000 | null |
2024-06-14 | SemanticSpray++: A Multimodal Dataset for Autonomous Driving in Wet Surface Conditions | Aldi Piroli et.al. | 2406.09945 | null |
2024-06-14 | Nonlinear two-component system of time-fractional PDEs in (2+1)-dimensions: Invariant subspace method combined with variable transformation | P. Prakash et.al. | 2406.09917 | null |
2024-06-23 | OpenECAD: An Efficient Visual Language Model for Computer-Aided Design | Zhe Yuan et.al. | 2406.09913 | null |
2024-06-14 | Nymeria: A Massive Collection of Multimodal Egocentric Daily Motion in the Wild | Lingni Ma et.al. | 2406.09905 | null |
2024-06-14 | 3D-RPE: Enhancing Long-Context Modeling Through 3D Rotary Position Encoding | Xindian Ma et.al. | 2406.09897 | null |
2024-06-14 | The Chorioallantoic Membrane Model: A 3D in vivo Testbed for Design and Analysis of MC Systems | Maximilian Schäfer et.al. | 2406.09875 | null |
2024-06-14 | GradeADreamer: Enhanced Text-to-3D Generation Using Gaussian Splatting and Multi-View Diffusion | Trapoom Ukarapol et.al. | 2406.09850 | link |
2024-06-14 | Dynamic Decentralized 3D Urban Coverage and Patrol with UAVs | Wai Lun Leong et.al. | 2406.09828 | null |
2024-06-14 | RaNeuS: Ray-adaptive Neural Surface Reconstruction | Yida Wang et.al. | 2406.09801 | link |
2024-06-20 | Sim-to-Real Transfer via 3D Feature Fields for Vision-and-Language Navigation | Zihan Wang et.al. | 2406.09798 | link |
2024-06-14 | A Two-Stage Masked Autoencoder Based Network for Indoor Depth Completion | Kailai Sun et.al. | 2406.09792 | link |
2024-06-14 | Language-Guided Manipulation with Diffusion Policies and Constrained Inpainting | Ce Hao et.al. | 2406.09767 | null |
2024-06-14 | Full-reference Point Cloud Quality Assessment Using Spectral Graph Wavelets | Ryosuke Watanabe et.al. | 2406.09762 | null |
2024-06-14 | Grounding Image Matching in 3D with MASt3R | Vincent Leroy et.al. | 2406.09756 | link |
2024-06-14 | Unified Gaussian Primitives for Scene Representation and Rendering | Yang Zhou et.al. | 2406.09733 | null |
2024-06-14 | Neural Pose Representation Learning for Generating and Transferring Non-Rigid Object Poses | Seungwoo Yoo et.al. | 2406.09728 | null |
2024-06-14 | Jointed Tails Enhance Control of Three-dimensional Body Rotation | Xun Fu et.al. | 2406.09700 | null |
2024-06-14 | The gravitational-wave emission from the explosion of a 15 solar mass star with rotation and magnetic fields | Jade Powell et.al. | 2406.09691 | null |
2024-06-14 | An Intrinsic Vector Heat Network | Alexander Gao et.al. | 2406.09648 | null |
2024-06-19 | Phase-resolving the absorption signatures of water and carbon monoxide in the atmosphere of the ultra-hot Jupiter WASP-121b with GEMINI-S/IGRINS | Joost P. Wardenier et.al. | 2406.09641 | null |
2024-06-13 | Complex Symplectic Contractions and 3d Mirrors | Andrew Dancer et.al. | 2406.09626 | null |
2024-06-13 | DrivAerNet++: A Large-Scale Multimodal Car Dataset with Computational Fluid Dynamics Simulations and Deep Learning Benchmarks | Mohamed Elrefaie et.al. | 2406.09624 | link |
2024-06-13 | Shape optimization for maximizing ionic concentration constrained by steady-state Poisson-Nernst-Planck system | Jiajie Li et.al. | 2406.09616 | null |
2024-06-13 | ImageNet3D: Towards General-Purpose Object-Level 3D Understanding | Wufei Ma et.al. | 2406.09613 | link |
2024-06-13 | Introducing HOT3D: An Egocentric Dataset for 3D Hand and Object Tracking | Prithviraj Banerjee et.al. | 2406.09598 | null |
2024-06-13 | Color Equivariant Network | Felix O’Mahony et.al. | 2406.09588 | link |
2024-06-13 | The full-sky Spherical Fourier-Bessel power spectrum in general relativity | Federico Semenzato et.al. | 2406.09545 | null |
2024-06-24 | Test particles in Kaluza-Klein models | Joao Baptista et.al. | 2406.09503 | null |
2024-06-13 | ELF-UA: Efficient Label-Free User Adaptation in Gaze Estimation | Yong Wu et.al. | 2406.09481 | null |
2024-06-13 | Stability of monodomain III-V crystals and antiphase boundaries over a Si monoatomic step | D. Gupta et.al. | 2406.09476 | null |
2024-06-13 | Rethinking Score Distillation as a Bridge Between Image Distributions | David McAllister et.al. | 2406.09417 | null |
2024-06-13 | CodedEvents: Optimal Point-Spread-Function Engineering for 3D-Tracking with Event Cameras | Sachin Shah et.al. | 2406.09409 | null |
2024-06-13 | ConsistDreamer: 3D-Consistent 2D Diffusion for High-Fidelity Scene Editing | Jun-Kun Chen et.al. | 2406.09404 | null |
2024-06-13 | Instruct 4D-to-4D: Editing 4D Scenes as Pseudo-3D Scenes Using 2D Diffusion | Linzhan Mou et.al. | 2406.09402 | null |
2024-06-13 | MMScan: A Multi-Modal 3D Scene Dataset with Hierarchical Grounded Language Annotations | Ruiyuan Lyu et.al. | 2406.09401 | link |
2024-06-13 | Modeling Ambient Scene Dynamics for Free-view Synthesis | Meng-Li Shih et.al. | 2406.09395 | null |
2024-06-14 | WonderWorld: Interactive 3D Scene Generation from a Single Image | Hong-Xing Yu et.al. | 2406.09394 | null |
2024-06-13 | LLAVIDAL: Benchmarking Large Language Vision Models for Daily Activities of Living | Rajatsubhra Chakraborty et.al. | 2406.09390 | null |
2024-06-13 | SimGen: Simulator-conditioned Driving Scene Generation | Yunsong Zhou et.al. | 2406.09386 | null |
2024-06-13 | Multiagent Multitraversal Multimodal Self-Driving: Open MARS Dataset | Yiming Li et.al. | 2406.09383 | null |
2024-06-13 | GGHead: Fast and Generalizable 3D Gaussian Heads | Tobias Kirschstein et.al. | 2406.09377 | null |
2024-06-13 | LRM-Zero: Training Large Reconstruction Models with Synthesized Data | Desai Xie et.al. | 2406.09371 | link |
2024-06-13 | Perfectly hidden order and Z2 confinement transition in a fully packed monopole liquid | Attila Szabo et.al. | 2406.09336 | null |
2024-06-13 | Instance-level quantitative saliency in multiple sclerosis lesion segmentation | Federico Spagnolo et.al. | 2406.09335 | link |
2024-06-13 | Towards AI Lesion Tracking in PET/CT Imaging: A Siamese-based CNN Pipeline applied on PSMA PET/CT Scans | Stefan P. Hein et.al. | 2406.09327 | null |
2024-06-13 | Less Cybersickness, Please: Demystifying and Detecting Stereoscopic Visual Inconsistencies in VR Apps | Shuqing Li et.al. | 2406.09313 | null |
2024-06-13 | Neural Assets: 3D-Aware Multi-Object Scene Synthesis with Image Diffusion Models | Ziyi Wu et.al. | 2406.09292 | null |
2024-06-13 | Global smooth solutions by transport noise of 3D Navier-Stokes equations with small hyperviscosity | Antonio Agresti et.al. | 2406.09267 | null |
2024-06-13 | SR-CACO-2: A Dataset for Confocal Fluorescence Microscopy Image Super-Resolution | Soufiane Belharbi et.al. | 2406.09168 | link |
2024-06-13 | ALPHAGMUT: A Rationale-Guided Alpha Shape Graph Neural Network to Evaluate Mutation Effects | Boshen Wang et.al. | 2406.09159 | null |
2024-06-13 | Higher spin swampland conjecture for massive AdS $_{3}$ gravity | R. Sammani et.al. | 2406.09151 | null |
2024-06-14 | Generative AI-based Prompt Evolution Engineering Design Optimization With Vision-Language Model | Melvin Wong et.al. | 2406.09143 | null |
2024-06-13 | The Milky Way as seen by Classical Cepheids I: distances based on mid-infrared photometry | Dorota M. Skowron et.al. | 2406.09113 | null |
2024-06-12 | DeepJEB: 3D Deep Learning-based Synthetic Jet Engine Bracket Dataset | Seongjun Hong et.al. | 2406.09047 | null |
2024-06-13 | Cross-Modal Learning for Anomaly Detection in Fused Magnesium Smelting Process: Methodology and Benchmark | Gaochang Wu et.al. | 2406.09016 | link |
2024-06-13 | AirPlanes: Accurate Plane Estimation via 3D-Consistent Embeddings | Jamie Watson et.al. | 2406.08960 | null |
2024-06-13 | Preserving Identity with Variational Score for General-purpose 3D Editing | Duong H. Le et.al. | 2406.08953 | null |
2024-06-13 | Neural NeRF Compression | Tuan Pham et.al. | 2406.08943 | null |
2024-06-14 | AV-GS: Learning Material and Geometry Aware Priors for Novel View Acoustic Synthesis | Swapnil Bhosale et.al. | 2406.08920 | null |
2024-06-13 | Dual Attribute-Spatial Relation Alignment for 3D Visual Grounding | Yue Xu et.al. | 2406.08907 | null |
2024-06-13 | OpenMaterial: A Comprehensive Dataset of Complex Materials for 3D Reconstruction | Zheng Dang et.al. | 2406.08894 | null |
2024-06-13 | Low-Overhead Channel Estimation via 3D Extrapolation for TDD mmWave Massive MIMO Systems Under High-Mobility Scenarios | Binggui Zhou et.al. | 2406.08887 | null |
2024-06-18 | Self-supervised Graph Neural Network for Mechanical CAD Retrieval | Yuhan Quan et.al. | 2406.08863 | null |
2024-06-13 | NeRF Director: Revisiting View Selection in Neural Volume Rendering | Wenhui Xiao et.al. | 2406.08839 | link |
2024-06-13 | BEVSpread: Spread Voxel Pooling for Bird’s-Eye-View Representation in Vision-based Roadside 3D Object Detection | Wenjie Wang et.al. | 2406.08785 | link |
2024-06-13 | Gaussian-Forest: Hierarchical-Hybrid 3D Gaussian Splatting for Compressed Scene Modeling | Fengyi Zhang et.al. | 2406.08759 | null |
2024-06-13 | 3D Building Generation in Minecraft via Large Language Models | Shiying Hu et.al. | 2406.08751 | link |
2024-06-13 | Field investigation of 3D snow settling dynamics under weak atmospheric turbulence | Jiaqi Li et.al. | 2406.08737 | null |
2024-06-13 | AGFA-Net: Attention-Guided and Feature-Aggregated Network for Coronary Artery Segmentation using Computed Tomography Angiography | Xinyun Liu et.al. | 2406.08724 | null |
2024-06-16 | Expressing turbulent kinetic energy as coarse-grained enstrophy or strain deformations | Damiano Capocci et.al. | 2406.08672 | null |
2024-06-12 | Vivid-ZOO: Multi-View Video Generation with Diffusion Model | Bing Li et.al. | 2406.08659 | null |
2024-06-12 | RVT-2: Learning Precise Manipulation from Few Demonstrations | Ankit Goyal et.al. | 2406.08545 | link |
2024-06-07 | Diffusion Models in $\textit{De Novo}$ Drug Design | Amira Alakhdar et.al. | 2406.08511 | null |
2024-06-12 | ICE-G: Image Conditional Editing of 3D Gaussian Splats | Vishnu Jaganathan et.al. | 2406.08488 | null |
2024-06-12 | Real3D: Scaling Up Large Reconstruction Models with Real-World Images | Hanwen Jiang et.al. | 2406.08479 | null |
2024-06-12 | Human 3Diffusion: Realistic Avatar Creation via Explicit 3D Consistent Diffusion Models | Yuxuan Xue et.al. | 2406.08475 | null |
2024-06-12 | Self-supervised Learning of Neural Implicit Feature Fields for Camera Pose Refinement | Maxime Pietrantoni et.al. | 2406.08463 | null |
2024-06-12 | Visualization of atomic structures on faceted and non-flat surfaces by difference-of-gaussians approach | A. Yu. Aladyshkin et.al. | 2406.08436 | null |
2024-06-12 | Scaling Value Iteration Networks to 5000 Layers for Extreme Long-Term Planning | Yuhui Wang et.al. | 2406.08404 | null |
2024-06-12 | LaneCPP: Continuous 3D Lane Detection using Physical Priors | Maximilian Pittner et.al. | 2406.08381 | null |
2024-06-15 | 2.5D Multi-view Averaging Diffusion Model for 3D Medical Image Translation: Application to Low-count PET Reconstruction with CT-less Attenuation Correction | Tianqi Chen et.al. | 2406.08374 | null |
2024-06-12 | $\mathbb{Z}_2$ gauge field and topological chirality from Umklapp scattering in twisted graphite | Cong Chen et.al. | 2406.08355 | null |
2024-06-12 | On the application of components manufactured with stereolithographic 3D printing in high vacuum systems | Aleksandar Radic et.al. | 2406.08326 | null |
2024-06-12 | FSH: 3D Representation via Fibonacci Spherical Harmonics | Zikuan Li et.al. | 2406.08308 | link |
2024-06-12 | From Chaos to Clarity: 3DGS in the Dark | Zhihao Li et.al. | 2406.08300 | null |
2024-06-12 | Outdoor Scene Extrapolation with Hierarchical Generative Cellular Automata | Dongsu Zhang et.al. | 2406.08292 | null |
2024-06-12 | Runtime Freezing: Dynamic Class Loss for Multi-Organ 3D Segmentation | James Willoughby et.al. | 2406.08217 | null |
2024-06-12 | Category-level Neural Field for Reconstruction of Partially Observed Objects in Indoor Environment | Taekbeom Lee et.al. | 2406.08176 | link |
2024-06-12 | CT3D++: Improving 3D Object Detection with Keypoint-induced Channel-wise Transformer | Hualian Sheng et.al. | 2406.08152 | link |
2024-06-12 | Adversarial Patch for 3D Local Feature Extractor | Yu Wen Pao et.al. | 2406.08102 | link |
2024-06-12 | 3D CBCT Challenge 2024: Improved Cone Beam CT Reconstruction using SwinIR-Based Sinogram and Image Enhancement | Sasidhar Alavala et.al. | 2406.08048 | null |
2024-06-12 | OpenObj: Open-Vocabulary Object-Level Neural Radiance Fields with Fine-Grained Understanding | Yinan Deng et.al. | 2406.08009 | link |
2024-06-14 | Can Large Language Models Understand Spatial Audio? | Changli Tang et.al. | 2406.07914 | null |
2024-06-12 | Emotional Conversation: Empowering Talking Faces with Cohesive Expression, Gaze and Pose Generation | Jiadong Liang et.al. | 2406.07895 | null |
2024-06-12 | Robust 3D Face Alignment with Multi-Path Neural Architecture Search | Zhichao Jiang et.al. | 2406.07873 | null |
2024-06-12 | SynthForge: Synthesizing High-Quality Face Dataset with Controllable 3D Generative Models | Abhay Rawat et.al. | 2406.07840 | null |
2024-06-12 | Sense Less, Generate More: Pre-training LiDAR Perception with Masked Autoencoders for Ultra-Efficient 3D Sensing | Sina Tayebati et.al. | 2406.07833 | link |
2024-06-13 | Personalized Product Assortment with Real-time 3D Perception and Bayesian Payoff Estimation | Porter Jenkins et.al. | 2406.07769 | null |
2024-06-11 | C3DAG: Controlled 3D Animal Generation using 3D pose guidance | Sandeep Mishra et.al. | 2406.07742 | null |
2024-06-11 | Object-level Scene Deocclusion | Zhengzhe Liu et.al. | 2406.07706 | null |
2024-06-11 | AI Radiologist: Revolutionizing Liver Tissue Segmentation with Convolutional Neural Networks and a Clinician-Friendly GUI | Ayman Al-Kababji et.al. | 2406.07688 | null |
2024-06-11 | Stabilization of a Quadrotor via Energy Shaping | M. Reza J. Harandi et.al. | 2406.07682 | null |
2024-06-11 | Broadband MEMS Microphone Arrays with Reduced Aperture Through 3D-Printed Waveguides | Dennis Laurijssen et.al. | 2406.07663 | null |
2024-06-11 | M-LRM: Multi-view Large Reconstruction Model | Mengfei Li et.al. | 2406.07648 | null |
2024-06-11 | Situational Awareness Matters in 3D Vision Language Reasoning | Yunze Man et.al. | 2406.07544 | link |
2024-06-11 | Hearing Anything Anywhere | Mason Wang et.al. | 2406.07532 | link |
2024-06-11 | Neural Gaffer: Relighting Any Object via Diffusion | Haian Jin et.al. | 2406.07520 | null |
2024-06-11 | Instant 3D Human Avatar Generation using Image Diffusion Models | Nikos Kolotouros et.al. | 2406.07516 | null |
2024-06-11 | Scintillation Light in SBND: Simulation, Reconstruction, and Expected Performance of the Photon Detection System | SBND Collaboration et.al. | 2406.07514 | null |
2024-06-14 | COMAP Pathfinder – Season 2 results II. Updated constraints on the CO(1-0) power spectrum | N. -O. Stutzer et.al. | 2406.07511 | null |
2024-06-12 | SPIN: Spacecraft Imagery for Navigation | Javier Montalvo et.al. | 2406.07500 | link |
2024-06-11 | Trim 3D Gaussian Splatting for Accurate Geometry Representation | Lue Fan et.al. | 2406.07499 | null |
2024-06-11 | 4Real: Towards Photorealistic 4D Scene Generation via Video Diffusion Models | Heng Yu et.al. | 2406.07472 | null |
2024-06-13 | Existence and asymptotic autonomous robustness of random attractors for three-dimensional stochastic globally modified Navier-Stokes equations on unbounded domains | Kush Kinra et.al. | 2406.07460 | null |
2024-06-21 | Cinematic Gaussians: Real-Time HDR Radiance Fields with Depth of Field | Chao Wang et.al. | 2406.07329 | null |
2024-06-11 | Realistic Data Generation for 6D Pose Estimation of Surgical Instruments | Juan Antonio Barragan et.al. | 2406.07328 | link |
2024-06-11 | Experimental Modeling of Chiral Active Robots and a Minimal Model of Non-Gaussian Displacements | Yuxuan Zhou et.al. | 2406.07313 | null |
2024-06-11 | 3D Voxel Maps to 2D Occupancy Maps for Efficient Path Planning for Aerial and Ground Robots | Scott Fredriksson et.al. | 2406.07270 | link |
2024-06-11 | Efficient 3D Molecular Generation with Flow Matching and Scale Optimal Transport | Ross Irwin et.al. | 2406.07266 | null |
2024-06-11 | Haptic Repurposing with GenAI | Haoyu Wang et.al. | 2406.07228 | null |
2024-06-11 | Pushing an Altermagnet to the Ultimate 2D Limit: Evidence of Symmetry Breaking in Monolayers of GdAlSi | Oleg E. Parfenov et.al. | 2406.07172 | null |
2024-06-11 | VoxNeuS: Enhancing Voxel-Based Neural Surface Reconstruction via Gradient Interpolation | Sidun Liu et.al. | 2406.07170 | null |
2024-06-11 | FaceGPT: Self-supervised Learning to Chat about 3D Human Faces | Haoran Wang et.al. | 2406.07163 | null |
2024-06-12 | Benchmarking and Boosting Radiology Report Generation for 3D High-Resolution Medical Images | Che Liu et.al. | 2406.07146 | null |
2024-06-17 | Beyond Bare Queries: Open-Vocabulary Object Retrieval with 3D Scene Graph | Sergey Linok et.al. | 2406.07113 | null |
2024-06-11 | NeRSP: Neural 3D Reconstruction for Reflective Objects with Sparse Polarized Images | Yufei Han et.al. | 2406.07111 | null |
2024-06-11 | CAT: Coordinating Anatomical-Textual Prompts for Multi-Organ and Tumor Segmentation | Zhongzhen Huang et.al. | 2406.07085 | link |
2024-06-11 | Triage of 3D pathology data via 2.5D multiple-instance learning to guide pathologist assessments | Gan Gao et.al. | 2406.07061 | link |
2024-06-11 | The evolution of coronal shock wave properties and their relation with solar energetic particles | Manon Jarry et.al. | 2406.07058 | null |
2024-06-11 | EFFOcc: A Minimal Baseline for EFficient Fusion-based 3D Occupancy Network | Yining Shi et.al. | 2406.07042 | link |
2024-06-11 | PanoSSC: Exploring Monocular Panoptic 3D Scene Reconstruction for Autonomous Driving | Yining Shi et.al. | 2406.07037 | null |
2024-06-11 | RS-DFM: A Remote Sensing Distributed Foundation Model for Diverse Downstream Tasks | Zhechao Wang et.al. | 2406.07032 | null |
2024-06-12 | LiSD: An Efficient Multi-Task Learning Framework for LiDAR Segmentation and Detection | Jiahua Xu et.al. | 2406.07023 | null |
2024-06-11 | Generative Lifting of Multiview to 3D from Unknown Pose: Wrapping NeRF inside Diffusion | Xin Yuan et.al. | 2406.06972 | null |
2024-06-11 | Polar alignment of a dusty circumbinary disc – I. Dust ring formation | Jeremy L. Smallwood et.al. | 2406.06971 | null |
2024-06-11 | Dynamical properties of mildly relativistic ejecta produced by the mass-loading of gamma-ray burst jets in dense ambient media | Akihiro Suzuki et.al. | 2406.06939 | null |
2024-06-11 | A Subjective Quality Evaluation of 3D Mesh with Dynamic Level of Detail in Virtual Reality | Duc Nguyen et.al. | 2406.06888 | null |
2024-06-16 | HO-Cap: A Capture System and Dataset for 3D Reconstruction and Pose Tracking of Hand-Object Interaction | Jikai Wang et.al. | 2406.06843 | link |
2024-06-10 | All-sky three-dimensional dust density and extinction Maps of the Milky Way out to 2.8 kpc | T. E. Dharmawardena et.al. | 2406.06740 | link |
2024-06-10 | The Imaging Database for Epilepsy And Surgery (IDEAS) | Peter N. Taylor et.al. | 2406.06731 | null |
2024-06-10 | Full transmission of vectorial waves through 3D multiple-scattering media | Ho-Chun Lin et.al. | 2406.06727 | null |
2024-06-10 | PatchRefiner: Leveraging Synthetic Data for Real-Domain High-Resolution Monocular Metric Depth Estimation | Zhenyu Li et.al. | 2406.06679 | null |
2024-06-10 | Instanton Density Operator in Lattice QCD from Higher Category Theory | Jing-Yuan Chen et.al. | 2406.06673 | null |
2024-06-09 | Transforming Heart Chamber Imaging: Self-Supervised Learning for Whole Heart Reconstruction and Segmentation | Abdul Qayyum et.al. | 2406.06643 | null |
2024-06-06 | SEE-2-SOUND: Zero-Shot Spatial Environment-to-Spatial Sound | Rishit Dagli et.al. | 2406.06612 | link |
2024-06-10 | IllumiNeRF: 3D Relighting without Inverse Rendering | Xiaoming Zhao et.al. | 2406.06527 | null |
2024-06-10 | GaussianCity: Generative Gaussian Splatting for Unbounded 3D City Generation | Haozhe Xie et.al. | 2406.06526 | link |
2024-06-10 | PGSR: Planar-based Gaussian Splatting for Efficient and High-Fidelity Surface Reconstruction | Danpeng Chen et.al. | 2406.06521 | null |
2024-06-10 | Merlin: A Vision Language Foundation Model for 3D Computed Tomography | Louis Blankemeier et.al. | 2406.06512 | null |
2024-06-10 | QSSEP describes the fluctuations of quantum coherences in the Anderson model | Ludwig Hruza et.al. | 2406.06444 | null |
2024-06-10 | SYM3D: Learning Symmetric Triplanes for Better 3D-Awareness of GANs | Jing Yang et.al. | 2406.06432 | null |
2024-06-10 | MVGamba: Unify 3D Content Generation as State Space Sequence Modeling | Xuanyu Yi et.al. | 2406.06367 | link |
2024-06-14 | A Field-Theoretic Example for Hodge Theory in 3D | A. K. Rao et.al. | 2406.06358 | null |
2024-06-10 | System- and Sample-agnostic Isotropic 3D Microscopy by Weakly Physics-informed, Domain-shift-resistant Axial Deblurring | Jiashu Han et.al. | 2406.06337 | null |
2024-06-10 | Exploring the generation and annihilation of three dimensional nulls through MHD simulations in initially chaotic magnetic field devoid of nulls | Yogesh Kumar Maurya et.al. | 2406.06328 | null |
2024-06-11 | Tuning-Free Visual Customization via View Iterative Self-Attention Control | Xiaojie Li et.al. | 2406.06258 | link |
2024-06-10 | Stabilized Adaptive Steering for 3D Sonar Microphone Arrays with IMU Sensor Fusion | Wouter Jansen et.al. | 2406.06255 | null |
2024-06-10 | Lighting Every Darkness with 3DGS: Fast Training and Real-Time Rendering for HDR View Synthesis | Xin Jin et.al. | 2406.06216 | link |
2024-06-10 | Revisiting 3D Cartesian Scatterplots with a Novel Plotting Framework and a Survey | Philippos Papaphilippou et.al. | 2406.06146 | null |
2024-06-10 | Space-Time Hopfion Crystals | Wenbo Lin et.al. | 2406.06096 | null |
2024-06-10 | PointABM:Integrating Bidirectional State Space Model with Multi-Head Self-Attention for Point Cloud Analysis | Jia-wei Chen et.al. | 2406.06069 | null |
2024-06-11 | 6DMA Enhanced Wireless Network with Flexible Antenna Position and Rotation: Opportunities and Challenges | Xiaodan Shao et.al. | 2406.06064 | null |
2024-06-10 | Generalizable Human Gaussians from Single-View Image | Jinnan Chen et.al. | 2406.06050 | link |
2024-06-10 | Navigation and 3D Surface Reconstruction from Passive Whisker Sensing | Michael A. Lin et.al. | 2406.06038 | null |
2024-06-10 | From yield stress to elastic instabilities: Tuning the extensional behavior of elastoviscoplastic fluid | Mohamed S. Abdelgawad et.al. | 2406.06001 | null |
2024-06-11 | LOP-Field: Brain-inspired Layout-Object-Position Fields for Robotic Scene Understanding | Jiawei Hou et.al. | 2406.05985 | null |
2024-06-10 | Inter-slice Super-resolution of Magnetic Resonance Images by Pre-training and Self-supervised Fine-tuning | Xin Wang et.al. | 2406.05974 | null |
2024-06-09 | Bits-to-Photon: End-to-End Learned Scalable Point Cloud Compression for Direct Rendering | Yueyu Hu et.al. | 2406.05915 | null |
2024-06-09 | InfoGaussian: Structure-Aware Dynamic Gaussians through Lightweight Information Shaping | Yunchao Zhang et.al. | 2406.05897 | null |
2024-06-09 | RefGaussian: Disentangling Reflections from 3D Gaussian Splatting for Realistic Rendering | Rui Zhang et.al. | 2406.05852 | null |
2024-06-09 | MAP-ADAPT: Real-Time Quality-Adaptive Semantic 3D Maps | Jianhao Zheng et.al. | 2406.05849 | null |
2024-06-09 | 3D-MolT5: Towards Unified 3D Molecule-Text Modeling with 3D Molecular Tokenization | Qizhi Pei et.al. | 2406.05797 | link |
2024-06-09 | A Survey on Text-guided 3D Visual Grounding: Elements, Recent Advances, and Future Directions | Daizong Liu et.al. | 2406.05785 | link |
2024-06-09 | VCR-GauS: View Consistent Depth-Normal Regularizer for Gaussian Surface Reconstruction | Hanlin Chen et.al. | 2406.05774 | null |
2024-06-11 | Large data global existence for coupled massive-massless wave-type systems | Yuan Cai et.al. | 2406.05762 | null |
2024-06-09 | Vision Mamba: Cutting-Edge Classification of Alzheimer’s Disease with 3D MRI Scans | Muthukumar K A et.al. | 2406.05757 | null |
2024-06-09 | Diverse 3D Human Pose Generation in Scenes based on Decoupled Structure | Bowen Dang et.al. | 2406.05691 | null |
2024-06-09 | FlightBench: A Comprehensive Benchmark of Spatial Planning Methods for Quadrotors | Shu-Ang Yu et.al. | 2406.05687 | link |
2024-06-13 | GTR: Improving Large 3D Reconstruction Models through Geometry and Texture Refinement | Peiye Zhuang et.al. | 2406.05649 | null |
2024-06-09 | Ctrl-V: Higher Fidelity Video Generation with Bounding-Box Controlled Object Motion | Ge Ya Luo et.al. | 2406.05630 | link |
2024-06-09 | Statistical Delay and Error-Rate Bounded QoS Provisioning for AoI-Driven 6G Satellite-Terrestrial Integrated Networks Using FBC | Jingqing Wang et.al. | 2406.05610 | null |
2024-06-08 | Trust the PRoC3S: Solving Long-Horizon Robotics Problems with LLMs and Constraint Satisfaction | Aidan Curtis et.al. | 2406.05572 | null |
2024-06-08 | VP-LLM: Text-Driven 3D Volume Completion with Large Language Models through Patchification | Jianmeng Liu et.al. | 2406.05543 | null |
2024-06-11 | EUV polarimetric diagnostics of the solar corona: the Hanle effect of Ne VIII 770 Å | Raveena Khan et.al. | 2406.05539 | null |
2024-06-08 | PAPR in Motion: Seamless Point-level 3D Scene Interpolation | Shichong Peng et.al. | 2406.05533 | null |
2024-06-08 | Survival probability, particle imbalance, and their relationship in quadratic models | Miroslav Hopjan et.al. | 2406.05500 | null |
2024-06-08 | Beatnik: A Novel Global Communication Mini-Application | Jason A. Stewart et.al. | 2406.05490 | null |
2024-06-08 | Field-Based Formalism for Calculating Multi-Qubit Exchange Coupling Rates for Transmon Qubits | Ghazi Khan et.al. | 2406.05473 | null |
2024-06-08 | 2D BAO vs 3D BAO: Hints for new physics? | Ruchika et.al. | 2406.05453 | null |
2024-06-08 | A Generalized Pointing Error Model for FSO Links with Fixed-Wing UAVs for 6G: Analysis and Trajectory Optimization | Hyung-Joo Moon et.al. | 2406.05444 | null |
2024-06-08 | 3D MRI Synthesis with Slice-Based Latent Diffusion Models: Improving Tumor Segmentation Tasks in Data-Scarce Regimes | Aghiles Kebaili et.al. | 2406.05421 | link |
2024-06-08 | Probing Ferroelectric Phase Transitions in Barium Titanate Single Crystals via $\it{in-situ}$ Second Harmonic Generation Microscopy | Benjamin Kirbus et.al. | 2406.05420 | null |
2024-06-08 | Mean-field Chaos Diffusion Models | Sungwoo Park et.al. | 2406.05396 | null |
2024-06-08 | The near horizon dynamics of three-dimensional Einstein gravity | Hamid Reza Afshar et.al. | 2406.05386 | null |
2024-06-08 | Blurry-Consistency Segmentation Framework with Selective Stacking on Differential Interference Contrast 3D Breast Cancer Spheroid | Thanh-Huy Nguyen et.al. | 2406.05349 | null |
2024-06-08 | Deep convolutional demosaicking network for multispectral polarization filter array | Tomoharu Ishiuchi et.al. | 2406.05312 | null |
2024-06-07 | VISTA3D: Versatile Imaging SegmenTation and Annotation model for 3D Computed Tomography | Yufan He et.al. | 2406.05285 | link |
2024-06-07 | Split-and-Fit: Learning B-Reps via Structure-Aware Voronoi Partitioning | Yilin Liu et.al. | 2406.05261 | link |
2024-06-07 | A thermo-hygro computational model to determine the factors dictating cold joint formation in 3D printed concrete | Michal Hlobil et.al. | 2406.05238 | null |
2024-06-07 | The ULS23 Challenge: a Baseline Model and Benchmark Dataset for 3D Universal Lesion Segmentation in Computed Tomography | M. J. J. de Grauw et.al. | 2406.05231 | link |
2024-06-07 | SPARC: Shared Perspective with Avatar Distortion for Remote Collaboration in VR | João Simões et.al. | 2406.05209 | null |
2024-06-07 | Statistical AoI, Delay, and Error-Rate Bounded QoS Provisioning for Satellite-Terrestrial Integrated Networks | Jingqing Wang et.al. | 2406.05165 | null |
2024-06-04 | Fight Scene Detection for Movie Highlight Generation System | Aryan Mathur et.al. | 2406.05152 | null |
2024-06-12 | 3D-GRAND: A Million-Scale Dataset for 3D-LLMs with Better Grounding and Less Hallucination | Jianing Yang et.al. | 2406.05132 | link |
2024-06-15 | GenHeld: Generating and Editing Handheld Objects | Chaerin Min et.al. | 2406.05059 | link |
2024-06-10 | The formation and survival of cold gas in a magnetized cool-core galaxy cluster | Martin Fournier et.al. | 2406.05044 | null |
2024-06-07 | Efficient 3D Shape Generation via Diffusion Mamba with Bidirectional SSMs | Shentong Mo et.al. | 2406.05038 | null |
2024-06-07 | ProMotion: Prototypes As Motion Learners | Yawen Lu et.al. | 2406.04999 | null |
2024-06-07 | CityCraft: A Real Crafter for 3D City Generation | Jie Deng et.al. | 2406.04983 | null |
2024-06-07 | The lens was fabricated by fluidic shaping | Chuanzhu Cheng et.al. | 2406.04937 | null |
2024-06-07 | The 3d-index of the 3d-skein module via the quantum trace map | Stavros Garoufalidis et.al. | 2406.04918 | null |
2024-06-07 | Proton 3D reconstruction with T-odd TMD gluon densities | Alessandro Bacchetta et.al. | 2406.04893 | null |
2024-06-07 | 3DRealCar: An In-the-wild RGB-D Car Dataset with 360-degree Views | Xiaobiao Du et.al. | 2406.04875 | null |
2024-06-07 | Normal-guided Detail-Preserving Neural Implicit Functions for High-Fidelity 3D Surface Reconstruction | Aarya Patel et.al. | 2406.04861 | null |
2024-06-07 | Predicting Polymer Properties Based on Multimodal Multitask Pretraining | Fanmeng Wang et.al. | 2406.04727 | link |
2024-06-07 | Statistical QoS Provisioning Architecture for 6G Satellite-Terrestrial Integrated Networks | Jingqing Wang et.al. | 2406.04685 | null |
2024-06-07 | Xenon plasma-focused ion beam milling as a method to deterministically fabricate bright and high-purity single-photon sources operating at C-band | Maciej Jaworski et.al. | 2406.04682 | null |
2024-06-07 | MTS-Net: Dual-Enhanced Positional Multi-Head Self-Attention for 3D CT Diagnosis of May-Thurner Syndrome | Yixin Huang et.al. | 2406.04680 | link |
2024-06-14 | XctDiff: Reconstruction of CT Images with Consistent Anatomical Structures from a Single Radiographic Projection Image | Qingze Bai et.al. | 2406.04679 | link |
2024-06-07 | UCDNet: Multi-UAV Collaborative 3D Object Detection Network by Reliable Feature Mapping | Pengju Tian et.al. | 2406.04648 | null |
2024-06-07 | UVCPNet: A UAV-Vehicle Collaborative Perception Network for 3D Object Detection | Yuchao Wang et.al. | 2406.04647 | null |
2024-06-07 | Preparation of high precision aspherical lenses based on micro stereolithography technology | Xiaoying Lu et.al. | 2406.04641 | null |
2024-06-07 | STAR: Skeleton-aware Text-based 4D Avatar Generation with In-Network Motion Retargeting | Zenghao Chai et.al. | 2406.04629 | link |
2024-06-06 | The Onset of Magnetic Reconnection in Dynamically Evolving Current Sheets | James Leake et.al. | 2406.04486 | null |
2024-06-06 | Efficient 3D-Aware Facial Image Editing via Attribute-Specific Prompt Learning | Amandeep Kumar et.al. | 2406.04413 | link |
2024-05-31 | Physics-enhanced Neural Operator for Simulating Turbulent Transport | Shengyu Chen et.al. | 2406.04367 | null |
2024-06-06 | Flash3D: Feed-Forward Generalisable 3D Scene Reconstruction from a Single Image | Stanislaw Szymanowicz et.al. | 2406.04343 | link |
2024-06-06 | GLACE: Global Local Accelerated Coordinate Encoding | Fangjinhua Wang et.al. | 2406.04340 | link |
2024-06-11 | Physics3D: Learning Physical Properties of 3D Gaussians via Video Diffusion | Fangfu Liu et.al. | 2406.04338 | null |
2024-06-06 | Coarse-To-Fine Tensor Trains for Compact Visual Representations | Sebastian Loeschcke et.al. | 2406.04332 | link |
2024-06-07 | DIRECT-3D: Learning Direct Text-to-3D Generation on Massive Noisy 3D Data | Qihao Liu et.al. | 2406.04322 | link |
2024-06-14 | GeoGen: Geometry-Aware Generative Modeling via Signed Distance Functions | Salvatore Esposito et.al. | 2406.04254 | null |
2024-06-06 | A Survey on 3D Human Avatar Modeling – From Reconstruction to Generation | Ruihe Wang et.al. | 2406.04253 | null |
2024-06-13 | Gaussian Splatting with Localized Points Management | Haosen Yang et.al. | 2406.04251 | null |
2024-06-06 | Solving Inverse Problems in Protein Space Using Diffusion-Based Priors | Axel Levy et.al. | 2406.04239 | null |
2024-06-06 | Early-time resonances in the three-dimensional wall-bounded axisymmetric Euler and related equations | Sai Swetha Venkata Kolluru et.al. | 2406.04228 | null |
2024-06-06 | Aligning Agents like Large Language Models | Adam Jelley et.al. | 2406.04208 | null |
2024-06-06 | Unified Rapid Mass Transfer | Natalia Ivanova et.al. | 2406.04195 | null |
2024-06-06 | Digital Twin Aided RIS Communication: Robust Beamforming and Interference Management | Sadjad Alikhani et.al. | 2406.04188 | link |
2024-06-06 | A Voxel-based Approach for Simulating Microbial Decomposition in Soil: Comparison with LBM and Improvement of Morphological Models | Mouad Klai et.al. | 2406.04177 | link |
2024-06-06 | Stability of equilibria and bifurcations for a fluid-solid interaction problem | Denis Bonheure et.al. | 2406.04162 | null |
2024-06-06 | Sparse Multi-baseline SAR Cross-modal 3D Reconstruction of Vehicle Targets | Da Li et.al. | 2406.04158 | null |
2024-06-06 | The 3D-PC: a benchmark for visual perspective taking in humans and machines | Drew Linsley et.al. | 2406.04138 | link |
2024-06-06 | Global Parameterization-based Texture Space Optimization | Wei Chen et.al. | 2406.04115 | null |
2024-06-06 | From Tissue Plane to Organ World: A Benchmark Dataset for Multimodal Biomedical Image Registration using Deep Co-Attention Networks | Yifeng Wang et.al. | 2406.04105 | link |
2024-06-06 | How Far Can We Compress Instant-NGP-Based NeRF? | Yihang Chen et.al. | 2406.04101 | link |
2024-06-06 | Floquet Theory in an Irradiated Nodal Surface Semimetal | Bhaskar Pandit et.al. | 2406.04053 | null |
2024-06-13 | Interactive zoom display in smartphone-based digital holographic microscope for 3D imaging | Yuki Nagahama et.al. | 2406.04014 | null |
2024-06-06 | QuickCurve: revisiting slightly non-planar 3D printing | Emilio Ottonello et.al. | 2406.03966 | null |
2024-06-06 | More Bang For Your Buck(et): Fast and Space-efficient Hardware-accelerated Coarse-granular Indexing on GPUs | Justus Henneberg et.al. | 2406.03965 | null |
2024-06-06 | 3D Ultrasound Shear Wave Elastography for Musculoskeletal Tissue Assessment Under Compressive Load: A Feasibility Study | Bryan J. Ranger et.al. | 2406.03962 | null |
2024-06-06 | Vectorized Conditional Neural Fields: A Framework for Solving Time-dependent Parametric Partial Differential Equations | Jan Hagnberger et.al. | 2406.03919 | link |
2024-06-06 | C^2RV: Cross-Regional and Cross-View Learning for Sparse-View CBCT Reconstruction | Yiqun Lin et.al. | 2406.03902 | link |
2024-06-06 | LLplace: The 3D Indoor Scene Layout Generation and Editing via Large Language Model | Yixuan Yang et.al. | 2406.03866 | null |
2024-06-06 | Correlated Electronic Structure and Incipient Flat Bands of the Kagome Superconductor CsCr3Sb5 | Yidian Li et.al. | 2406.03740 | null |
2024-06-06 | Superpoint Gaussian Splatting for Real-Time High-Fidelity Dynamic Scene Reconstruction | Diwen Wan et.al. | 2406.03697 | link |
2024-06-06 | Untrained Neural Nets for Snapshot Compressive Imaging: Theory and Algorithms | Mengyu Zhao et.al. | 2406.03694 | link |
2024-06-06 | BindGPT: A Scalable Framework for 3D Molecular Design via Language Modeling and Reinforcement Learning | Artem Zholus et.al. | 2406.03686 | null |
2024-06-05 | Degrees of Freedom Matter: Inferring Dynamics from Point Trajectories | Yan Zhang et.al. | 2406.03625 | null |
2024-06-05 | Hi5: 2D Hand Pose Estimation with Zero Human Annotation | Masum Hasan et.al. | 2406.03599 | null |
2024-06-05 | BVE + EKF: A viewpoint estimator for the estimation of the object’s position in the 3D task space using Extended Kalman Filters | Sandro Costa Magalhães et.al. | 2406.03591 | null |
2024-06-11 | Polarization Wavefront Lidar: Learning Large Scene Reconstruction from Polarized Wavefronts | Dominik Scheuble et.al. | 2406.03461 | null |
2024-06-05 | CoFie: Learning Compact Neural Surface Representations with Coordinate Fields | Hanwen Jiang et.al. | 2406.03417 | null |
2024-06-04 | Structure-based Drug Design Benchmark: Do 3D Methods Really Dominate? | Kangyu Zheng et.al. | 2406.03403 | link |
2024-06-05 | Gaussian Representation for Deformable Image Registration | Jihe Li et.al. | 2406.03394 | null |
2024-06-05 | SuperFormer: Volumetric Transformer Architectures for MRI Super-Resolution | Cristhian Forigua et.al. | 2406.03359 | link |
2024-06-07 | Strength of Kitaev Interaction in Na $_3$Co$_2$SbO$_6$ and Na$_3$Ni$_2$BiO$_6$ | Zefeng Chen et.al. | 2406.03338 | null |
2024-06-05 | Comparative Benchmarking of Failure Detection Methods in Medical Image Segmentation: Unveiling the Role of Confidence Aggregation | Maximilian Zenk et.al. | 2406.03323 | null |
2024-06-05 | L-PR: Exploiting LiDAR Fiducial Marker for Unordered Low Overlap Multiview Point Cloud Registration | Yibo Liu et.al. | 2406.03298 | link |
2024-06-05 | Text-to-Image Rectified Flow as Plug-and-Play Priors | Xiaofeng Yang et.al. | 2406.03293 | link |
2024-06-05 | Matter coupled to 3d Quantum Gravity: One-loop Unitarity | Etera R. Livine et.al. | 2406.03190 | null |
2024-06-05 | Ouroboros3D: Image-to-3D Generation via 3D-aware Recursive Diffusion | Hao Wen et.al. | 2406.03184 | link |
2024-06-05 | Dynamic 3D Gaussian Fields for Urban Areas | Tobias Fischer et.al. | 2406.03175 | null |
2024-06-05 | Enhancing 3D Lane Detection and Topology Reasoning with 2D Lane Priors | Han Li et.al. | 2406.03105 | link |
2024-06-05 | Modelling the propagation of slow magneto-acoustic waves in a multi-stranded coronal loop | S. Krishna Prasad et.al. | 2406.03093 | null |
2024-06-06 | Adapter-X: A Novel General Parameter-Efficient Fine-Tuning Framework for Vision | Minglei Li et.al. | 2406.03051 | null |
2024-06-05 | Balancing Performance and Efficiency in Zero-shot Robotic Navigation | Dmytro Kuzmenko et.al. | 2406.03015 | null |
2024-06-10 | Event3DGS: Event-Based 3D Gaussian Splatting for High-Speed Robot Egomotion | Tianyi Xiong et.al. | 2406.02972 | null |
2024-06-05 | Adversarial Generation of Hierarchical Gaussians for 3D Generative Model | Sangeek Hyun et.al. | 2406.02968 | link |
2024-06-05 | CAMEL. II. A 3D Coronal Mass Ejection Catalog Based on Coronal Mass Ejection Automatic Detection with Deep Learning | Jiahui Shan et.al. | 2406.02946 | link |
2024-06-05 | Unveiling a Family of Dimerized Quantum Magnets in Ternary Metal Borides | Zhen Zhang et.al. | 2406.02928 | null |
2024-06-04 | Collision-Affording Point Trees: SIMD-Amenable Nearest Neighbors for Fast Collision Checking | Clayton W. Ramsey et.al. | 2406.02807 | link |
2024-06-04 | MeshVPR: Citywide Visual Place Recognition Using 3D Meshes | Gabriele Berton et.al. | 2406.02776 | null |
2024-06-04 | Warped Disk Evolution in Grid-Based Simulations | C. N. Kimmig et.al. | 2406.02754 | null |
2024-06-13 | 3D-HGS: 3D Half-Gaussian Splatting | Haolin Li et.al. | 2406.02720 | link |
2024-06-07 | The Aharony-Bergman-Jafferis-Maldacena theory on a circle | Yi-Xiao Tao et.al. | 2406.02680 | null |
2024-06-04 | The Higgs Branch of 6d (1,0) SCFTs & LSTs with DE-type SUSY Enhancement | Craig Lawrie et.al. | 2406.02670 | null |
2024-05-31 | Production planning in 3DPrinting factories | Juan De Anton et.al. | 2406.02588 | null |
2024-06-04 | Open-YOLO 3D: Towards Fast and Accurate Open-Vocabulary 3D Instance Segmentation | Mohamed El Amine Boudjoghra et.al. | 2406.02548 | link |
2024-06-06 | Enhancing Temporal Consistency in Video Editing by Reconstructing Videos with 3D Gaussian Splatting | Inkyu Shin et.al. | 2406.02541 | null |
2024-06-04 | Enhancing 2D Representation Learning with a 3D Prior | Mehmet Aygün et.al. | 2406.02535 | null |
2024-06-04 | SatSplatYOLO: 3D Gaussian Splatting-based Virtual Object Detection Ensembles for Satellite Feature Recognition | Van Minh Nguyen et.al. | 2406.02533 | null |
2024-06-04 | RoboCasa: Large-Scale Simulation of Everyday Tasks for Generalist Robots | Soroush Nasiriany et.al. | 2406.02523 | null |
2024-06-04 | DDGS-CT: Direction-Disentangled Gaussian Splatting for Realistic Volume Rendering | Zhongpai Gao et.al. | 2406.02518 | null |
2024-06-04 | CamCo: Camera-Controllable 3D-Consistent Image-to-Video Generation | Dejia Xu et.al. | 2406.02509 | null |
2024-06-04 | GenS: Generalizable Neural Surface Reconstruction from Multi-View Images | Rui Peng et.al. | 2406.02495 | link |
2024-06-04 | Learning Image Priors through Patch-based Diffusion Models for Solving Inverse Problems | Jason Hu et.al. | 2406.02462 | link |
2024-06-04 | RoomTex: Texturing Compositional Indoor Scenes via Iterative Inpainting | Qi Wang et.al. | 2406.02461 | null |
2024-06-04 | CoNav: A Benchmark for Human-Centered Collaborative Navigation | Changhao Li et.al. | 2406.02425 | link |
2024-06-04 | WE-GS: An In-the-wild Efficient 3D Gaussian Representation for Unconstrained Photo Collections | Yuze Wang et.al. | 2406.02407 | null |
2024-06-09 | Query-based Semantic Gaussian Field for Scene Representation in Reinforcement Learning | Jiaxu Wang et.al. | 2406.02370 | null |
2024-06-04 | M3DM-NR: RGB-3D Noisy-Resistant Industrial Anomaly Detection via Multimodal Denoising | Chengjie Wang et.al. | 2406.02263 | null |
2024-06-04 | Can CLIP help CLIP in learning 3D? | Cristian Sbrolli et.al. | 2406.02202 | null |
2024-06-04 | Electromechanical response of saddle points in twisted hBN moiré superlattices | Stefano Chiodini et.al. | 2406.02195 | null |
2024-06-04 | Analysis and Simulation of a Coupled Fluid-Heat System in a Thin, Rough Layer | Tom Freudenberg et.al. | 2406.02150 | link |
2024-06-04 | UA-Track: Uncertainty-Aware End-to-End 3D Multi-Object Tracking | Lijun Zhou et.al. | 2406.02147 | null |
2024-06-04 | FaceCom: Towards High-fidelity 3D Facial Shape Completion via Optimization and Inpainting Guidance | Yinglong Li et.al. | 2406.02074 | link |
2024-06-04 | OpenGaussian: Towards Point-Level 3D Gaussian-based Open Vocabulary Understanding | Yanmin Wu et.al. | 2406.02058 | null |
2024-06-04 | First close-coupling study of the excitation of a large cyclic molecule: collision of c-C5H6 with He | Sándor Demes et.al. | 2406.02036 | null |
2024-06-04 | Bayesian Mesh Optimization for Graph Neural Networks to Enhance Engineering Performance Prediction | Jangseop Park et.al. | 2406.01996 | null |
2024-06-04 | 3D Imaging of Complex Specular Surfaces by Fusing Polarimetric and Deflectometric Information | Jiazhang Wang et.al. | 2406.01994 | null |
2024-06-04 | FastLGS: Speeding up Language Embedded Gaussians with Feature Grid Mapping | Yuzhou Ji et.al. | 2406.01916 | null |
2024-06-04 | HPE-CogVLM: New Head Pose Grounding Task Exploration on Vision Language Model | Yu Tian et.al. | 2406.01914 | null |
2024-06-03 | L-MAGIC: Language Model Assisted Generation of Images with Coherence | Zhipeng Cai et.al. | 2406.01843 | link |
2024-06-03 | A Robust Filter for Marker-less Multi-person Tracking in Human-Robot Interaction Scenarios | Enrico Martini et.al. | 2406.01832 | link |
2024-06-03 | The clustering of Lyman Alpha Emitting galaxies at z=2-3 | M. White et.al. | 2406.01803 | null |
2024-06-03 | 3D transcranial Dynamic Ultrasound Localization Microscopy in the mouse brain using a Row-Column Array | Alice Wu et.al. | 2406.01746 | null |
2024-06-03 | A General 3D Road Model for Motorcycle Racing | Thomas Fork et.al. | 2406.01726 | null |
2024-06-03 | Repeating Partial Tidal Encounters of Sun-like Stars Leading to their Complete Disruption | Chang Liu et.al. | 2406.01670 | null |
2024-06-03 | TAGMol: Target-Aware Gradient-guided Molecule Generation | Vineeth Dorna et.al. | 2406.01650 | link |
2024-06-01 | Equivariant amortized inference of poses for cryo-EM | Larissa de Ruijter et.al. | 2406.01630 | null |
2024-05-28 | Graph structural complexity | A. A. Snarskii et.al. | 2406.01610 | null |
2024-06-03 | MultiPly: Reconstruction of Multiple People from Monocular Video in the Wild | Zeren Jiang et.al. | 2406.01595 | null |
2024-06-03 | Reconstructing and Simulating Dynamic 3D Objects with Mesh-adsorbed Gaussian Splatting | Shaojie Ma et.al. | 2406.01593 | null |
2024-06-03 | Text-guided Controllable Mesh Refinement for Interactive 3D Modeling | Yun-Chun Chen et.al. | 2406.01592 | null |
2024-06-03 | ManiCM: Real-time 3D Diffusion Policy via Consistency Model for Robotic Manipulation | Guanxing Lu et.al. | 2406.01586 | null |
2024-06-03 | SpatialRGPT: Grounded Spatial Reasoning in Vision Language Model | An-Chieh Cheng et.al. | 2406.01584 | null |
2024-06-03 | Tetrahedron Splatting for 3D Generation | Chun Gu et.al. | 2406.01579 | link |
2024-06-03 | Towards Automating the Retrospective Generation of BIM Models: A Unified Framework for 3D Semantic Reconstruction of the Built Environment | Ka Lung Cheung et.al. | 2406.01480 | link |
2024-06-03 | DreamPhysics: Learning Physical Properties of Dynamic 3D Gaussians with Video Diffusion Priors | Tianyu Huang et.al. | 2406.01476 | link |
2024-06-03 | RaDe-GS: Rasterizing Depth in Gaussian Splatting | Baowen Zhang et.al. | 2406.01467 | link |
2024-06-03 | TE-NeXt: A LiDAR-Based 3D Sparse Convolutional Network for Traversability Estimation | Antonio Santo et.al. | 2406.01395 | link |
2024-06-03 | ARCH2S: Dataset, Benchmark and Challenges for Learning Exterior Architectural Structures from Point Clouds | Ka Lung Cheung et.al. | 2406.01337 | link |
2024-06-03 | 3D Trajectory Design for Energy-constrained Aerial CRNs Under Probabilistic LoS Channel | Hongjiang Lei et.al. | 2406.01313 | null |
2024-06-03 | High-fidelity study of three-dimensional turbulent transonic buffet on wide-span infinite wings | David J. Lusher et.al. | 2406.01232 | null |
2024-06-03 | Report on Methods and Applications for Crafting 3D Humans | Lei Liu et.al. | 2406.01223 | null |
2024-06-04 | GeminiFusion: Efficient Pixel-wise Multimodal Fusion for Vision Transformer | Ding Jia et.al. | 2406.01210 | link |
2024-06-03 | 3D WholeBody Pose Estimation based on Semantic Graph Attention Network and Distance Information | Sihan Wen et.al. | 2406.01196 | null |
2024-06-03 | Current and future applications of PVDF-carbon nanomaterials in energy and sensing | Joanna Kujawa et.al. | 2406.01169 | null |
2024-06-03 | Improved Three-Dimensional Reconstructions in Electron Ptychography through Defocus Series Measurements | Marcel Schloz et.al. | 2406.01141 | null |
2024-06-03 | Configuration Space Distance Fields for Manipulation Planning | Yiming Li et.al. | 2406.01137 | null |
2024-06-04 | Towards Practical Single-shot Motion Synthesis | Konstantinos Roditakis et.al. | 2406.01136 | null |
2024-06-03 | Self-Calibrating 4D Novel View Synthesis from Monocular Videos Using Gaussian Splatting | Fang Li et.al. | 2406.01042 | link |
2024-06-03 | Synthetic Data Generation for 3D Myocardium Deformation Analysis | Shahar Zuler et.al. | 2406.01040 | link |
2024-06-03 | Multi-Object Tracking based on Imaging Radar 3D Object Detection | Patrick Palmer et.al. | 2406.01011 | null |
2024-06-03 | Cross-Dimensional Medical Self-Supervised Representation Learning Based on a Pseudo-3D Transformation | Fei Gao et.al. | 2406.00947 | null |
2024-06-11 | LanEvil: Benchmarking the Robustness of Lane Detection to Environmental Illusions | Tianyuan Zhang et.al. | 2406.00934 | null |
2024-06-02 | Using 3-D LiDAR Data for Safe Physical Human-Robot Interaction | Sarthak Arora et.al. | 2406.00869 | null |
2024-06-02 | Collaborative Novel Object Discovery and Box-Guided Cross-Modal Alignment for Open-Vocabulary 3D Object Detection | Yang Cao et.al. | 2406.00830 | link |
2024-06-08 | Covariance-Adaptive Sequential Black-box Optimization for Diffusion Targeted Generation | Yueming Lyu et.al. | 2406.00812 | null |
2024-06-02 | Volume density maps of the 862nm DIB carrier and interstellar dust: a hint for the role of carbon-rich ejecta from AGB stars? | N. L. J. Cox et.al. | 2406.00807 | null |
2024-06-02 | PruNeRF: Segment-Centric Dataset Pruning via 3D Spatial Consistency | Yeonsung Jung et.al. | 2406.00798 | null |
2024-06-02 | Hybrid Photoelectron Momentum Microscope at the Soft X-ray Beamline I09 of the Diamond Light Source | Matthias Schmitt et.al. | 2406.00771 | null |
2024-06-02 | Freeplane: Unlocking Free Lunch in Triplane-Based Sparse-View Reconstruction Models | Wenqiang Sun et.al. | 2406.00750 | null |
2024-06-02 | A Survey of Deep Learning Based Radar and Vision Fusion for 3D Object Detection in Autonomous Driving | Di Wu et.al. | 2406.00714 | null |
2024-06-02 | MINER-RRT*: A Hierarchical and Fast Trajectory Planning Framework in 3D Cluttered Environments | Pengyu Wang et.al. | 2406.00706 | null |
2024-06-04 | Lay-A-Scene: Personalized 3D Object Arrangement Using Text-to-Image Priors | Ohad Rahamim et.al. | 2406.00687 | link |
2024-06-02 | Representing Animatable Avatar via Factorized Neural Fields | Chunjin Song et.al. | 2406.00637 | null |
2024-06-02 | T2LM: Long-Term 3D Human Motion Generation from Multiple Sentences | Taeryung Lee et.al. | 2406.00636 | null |
2024-06-02 | Compositional 4D Dynamic Scenes Understanding with Physics Priors for Video Question Answering | Xingrui Wang et.al. | 2406.00622 | link |
2024-06-04 | SuperGaussian: Repurposing Video Models for 3D Super Resolution | Yuan Shen et.al. | 2406.00609 | null |
2024-06-01 | TacShade A New 3D-printed Soft Optical Tactile Sensor Based on Light, Shadow and Greyscale for Shape Reconstruction | Zhenyu Lu et.al. | 2406.00485 | null |
2024-06-01 | Dual Hyperspectral Mamba for Efficient Spectral Compressive Imaging | Jiahua Dong et.al. | 2406.00449 | link |
2024-06-01 | Bilateral Guided Radiance Field Processing | Yuehao Wang et.al. | 2406.00448 | null |
2024-06-01 | Generating 3D Terrain with 2D Cellular Automata | Nuno Fachada et.al. | 2406.00443 | null |
2024-06-01 | Topo4D: Topology-Preserving Gaussian Splatting for High-Fidelity 4D Head Capture | X. Li et.al. | 2406.00440 | null |
2024-06-01 | MoDGS: Dynamic Gaussian Splatting from Causually-captured Monocular Videos | Qingming Liu et.al. | 2406.00434 | null |
2024-06-01 | ADHM Wilson line defect indices | Hirotaka Hayashi et.al. | 2406.00413 | null |
2024-06-01 | A Broadband 3-D Numerical FEM Study on the Characterization of Dielectric Relaxation Processes in Soils | Norman Wagner et.al. | 2406.00385 | null |
2024-06-01 | Impact disruption of Bjurböle porous chondritic projectile | Tomas Kohout et.al. | 2406.00379 | null |
2024-06-01 | E $^3$ -Net: Efficient E(3)-Equivariant Normal Estimation Network | Hanxiao Wang et.al. | 2406.00347 | null |
2024-06-01 | Details Enhancement in Unsigned Distance Field Learning for High-fidelity 3D Surface Reconstruction | Cheng Xu et.al. | 2406.00346 | null |
2024-06-01 | Adversarial 3D Virtual Patches using Integrated Gradients | Chengzeng You et.al. | 2406.00282 | null |
2024-06-01 | PuzzleFusion++: Auto-agglomerative 3D Fracture Assembly by Denoise and Verify | Zhengqing Wang et.al. | 2406.00259 | link |
2024-05-31 | Mirror Symmetry and Level-rank Duality for 3d $\mathcal{N} = 4$ Rank 0 SCFTs | Thomas Creutzig et.al. | 2406.00138 | null |
2024-05-31 | Bootstrap3D: Improving 3D Content Creation with Synthetic Data | Zeyi Sun et.al. | 2406.00093 | null |
2024-05-31 | Mixed Diffusion for 3D Indoor Scene Synthesis | Siyi Hu et.al. | 2405.21066 | link |
2024-05-31 | Single-beam grating-chip 3D and 1D optical lattices | Alan Bregazzi et.al. | 2405.21065 | null |
2024-05-31 | Quantum state preparation for multivariate functions | Matthias Rosenkranz et.al. | 2405.21058 | null |
2024-05-31 | 3D simulations of convective shell Neon-burning in a massive star | C. Georgy et.al. | 2405.21033 | null |
2024-05-31 | Optimized reinitialization based level-set method within industrial context | Paulin Ferro et.al. | 2405.20958 | null |
2024-05-31 | Direct Laser Acceleration of Bethe-Heitler positrons in laser-channel interactions | Bertrand Martinez et.al. | 2405.20930 | null |
2024-05-31 | MeshXL: Neural Coordinate Field for Generative 3D Foundation Models | Sijin Chen et.al. | 2405.20853 | link |
2024-05-31 | GS-Phong: Meta-Learned 3D Gaussians for Relightable Novel View Synthesis | Yumeng He et.al. | 2405.20791 | link |
2024-06-03 | Stratified Avatar Generation from Sparse Observations | Han Feng et.al. | 2405.20786 | null |
2024-05-31 | A transportable hyperspectral imaging setup based on fast, high-density spectral scanning for in situ quantitative biochemical mapping of fresh tissue biopsies | Luca Giannoni et.al. | 2405.20765 | null |
2024-05-31 | ContextGS: Compact 3D Gaussian Splatting with Anchor Level Context Model | Yufei Wang et.al. | 2405.20721 | link |
2024-05-31 | Power of Cooperative Supervision: Multiple Teachers Framework for Enhanced 3D Semi-Supervised Object Detection | Jin-Hee Lee et.al. | 2405.20720 | link |
2024-05-31 | R $^2$ -Gaussian: Rectifying Radiative Gaussian Splatting for Tomographic Reconstruction | Ruyi Zha et.al. | 2405.20693 | link |
2024-05-31 | 4Diffusion: Multi-view Video Diffusion Model for 4D Generation | Haiyu Zhang et.al. | 2405.20674 | null |
2024-05-31 | Fourier123: One Image to High-Quality 3D Object Generation with Hybrid Fourier Score Distillation | Shuzhou Yang et.al. | 2405.20669 | link |
2024-05-31 | Vision-Language Meets the Skeleton: Progressively Distillation with Cross-Modal Knowledge for 3D Action Representation Learning | Yang Chen et.al. | 2405.20606 | link |
2024-06-03 | Physically Compatible 3D Object Modeling from a Single Image | Minghao Guo et.al. | 2405.20510 | null |
2024-05-30 | What makes a cosmic filament? The dynamical origin and identity of filaments I. fundamentals in 2D | Job Feldbrugge et.al. | 2405.20475 | null |
2024-05-30 | Geometric Characterization of Rat Urinary Bladder Wall During Ex-Vivo Filling Using Micro-Computed Tomography (Micro-CT) | Fatemeh Azari et.al. | 2405.20454 | null |
2024-05-30 | Bulk derivation of TQFT gravity | Anatoly Dymarsky et.al. | 2405.20366 | null |
2024-05-30 | Learning 3D Robotics Perception using Inductive Priors | Muhammad Zubair Irshad et.al. | 2405.20364 | null |
2024-05-30 | Medication Recommendation via Dual Molecular Modalities and Multi-Substructure Distillation | Shi Mu et.al. | 2405.20358 | link |
2024-05-30 | Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image | Kailu Wu et.al. | 2405.20343 | link |
2024-05-30 | OccSora: 4D Occupancy Generation Models as World Simulators for Autonomous Driving | Lening Wang et.al. | 2405.20337 | link |
2024-05-30 | RapVerse: Coherent Vocals and Whole-Body Motions Generations from Text | Jiaben Chen et.al. | 2405.20336 | null |
2024-05-30 | VividDream: Generating 3D Scene with Ambient Dynamics | Yao-Chih Lee et.al. | 2405.20334 | null |
2024-05-31 | 4DHands: Reconstructing Interactive Hands in 4D with Transformers | Dixuan Lin et.al. | 2405.20330 | null |
2024-05-30 | GECO: Generative Image-to-3D within a SECOnd | Chen Wang et.al. | 2405.20327 | null |
2024-05-30 | $\textit{S}^3$ Gaussian: Self-Supervised Street Gaussians for Autonomous Driving | Nan Huang et.al. | 2405.20323 | link |
2024-05-31 | ParSEL: Parameterized Shape Editing with Language | Aditya Ganeshan et.al. | 2405.20319 | null |
2024-05-30 | Sequence-Augmented SE(3)-Flow Matching For Conditional Protein Backbone Generation | Guillaume Huguet et.al. | 2405.20313 | link |
2024-06-03 | A Pixel Is Worth More Than One 3D Gaussians in Single-View 3D Reconstruction | Jianghao Shen et.al. | 2405.20310 | null |
2024-05-30 | TetSphere Splatting: Representing High-Quality Geometry with Lagrangian Volumetric Meshes | Minghao Guo et.al. | 2405.20283 | null |
2024-05-30 | CV-VAE: A Compatible Video VAE for Latent Generative Video Models | Sijie Zhao et.al. | 2405.20279 | link |
2024-05-30 | Bridging electronic and classical density-functional theory using universal machine-learned functional approximations | Michelle M. Kelley et.al. | 2405.20270 | null |
2024-05-30 | Chiral $Λ$-$\mathfrak{bms}_4$ symmetry of 3d conformal gravity | Nishant Gupta et.al. | 2405.20244 | null |
2024-05-29 | EvaGaussians: Event Stream Assisted Gaussian Splatting from Blurry Images | Wangbo Yu et.al. | 2405.20224 | null |
2024-05-30 | MotionDreamer: Zero-Shot 3D Mesh Animation from Video Diffusion Models | Lukas Uzolas et.al. | 2405.20155 | null |
2024-05-30 | Infinite 3D Landmarks: Improving Continuous 2D Facial Landmark Detection | Prashanth Chandran et.al. | 2405.20117 | null |
2024-05-30 | Object-centric Reconstruction and Tracking of Dynamic Unknown Objects using 3D Gaussian Splatting | Kuldeep R Barad et.al. | 2405.20104 | null |
2024-05-31 | NeRF View Synthesis: Subjective Quality Assessment and Objective Metrics Evaluation | Pedro Martin et.al. | 2405.20078 | null |
2024-05-31 | N-Dimensional Gaussians for Fitting of High Dimensional Functions | Stavros Diolatzis et.al. | 2405.20067 | link |
2024-05-30 | Chaotic advection in a steady three-dimensional MHD flow | Julien Fontchastagner et.al. | 2405.20021 | null |
2024-05-30 | OpenTM: An Open-source, Single-GPU, Large-scale Thermal Microstructure Design Framework | Yuchen Quan et.al. | 2405.19991 | link |
2024-06-05 | PLA4D: Pixel-Level Alignments for Text-to-4D Gaussian Splatting | Qiaowei Miao et.al. | 2405.19957 | link |
2024-05-30 | The backreaction of stellar wobbling on accretion discs of massive protostars | D. M. -A. Meyer et.al. | 2405.19905 | null |
2024-06-10 | IReNe: Instant Recoloring of Neural Radiance Fields | Alessio Mazzucchelli et.al. | 2405.19876 | null |
2024-05-30 | Semantic Landmark Detection & Classification Using Neural Networks For 3D In-Air Sonar | Wouter Jansen et.al. | 2405.19869 | null |
2024-05-30 | KITRO: Refining Human Mesh by 2D Clues and Kinematic-tree Rotation | Fengyuan Yang et.al. | 2405.19833 | link |
2024-05-30 | Gated Fields: Learning Scene Reconstruction from Gated Videos | Andrea Ramazzina et.al. | 2405.19819 | null |
2024-05-30 | GaussianPrediction: Dynamic 3D Gaussian Prediction for Motion Extrapolation and Free View Synthesis | Boming Zhao et.al. | 2405.19745 | null |
2024-05-30 | Twin Deformable Point Convolutions for Point Cloud Semantic Segmentation in Remote Sensing Scenes | Yong-Qiang Mao et.al. | 2405.19735 | null |
2024-05-30 | HINT: Learning Complete Human Neural Representations from Limited Viewpoints | Alessandro Sanvito et.al. | 2405.19712 | null |
2024-05-30 | DNPM: A Neural Parametric Model for the Synthesis of Facial Geometric Details | Haitao Cao et.al. | 2405.19688 | null |
2024-05-30 | Fully Test-Time Adaptation for Monocular 3D Object Detection | Hongbin Lin et.al. | 2405.19682 | null |
2024-05-30 | View-Consistent Hierarchical 3D SegmentationUsing Ultrametric Feature Fields | Haodi He et.al. | 2405.19678 | link |
2024-05-30 | GaussianRoom: Improving 3D Gaussian Splatting with SDF Guidance and Monocular Cues for Indoor Scene Reconstruction | Haodong Xiang et.al. | 2405.19671 | null |
2024-05-30 | CSANet: Channel Spatial Attention Network for Robust 3D Face Alignment and Reconstruction | Yilin Liu et.al. | 2405.19659 | null |
2024-05-30 | Uncertainty-guided Optimal Transport in Depth Supervised Sparse-View 3D Gaussian | Wei Sun et.al. | 2405.19657 | null |
2024-05-30 | FaceLift: Semi-supervised 3D Facial Landmark Localization | David Ferman et.al. | 2405.19646 | null |
2024-05-30 | TAMBRIDGE: Bridging Frame-Centered Tracking and 3D Gaussian Splatting for Enhanced SLAM | Peifeng Jiang et.al. | 2405.19614 | null |
2024-05-30 | SMPLX-Lite: A Realistic and Drivable Avatar Benchmark with Rich Geometry and Texture Annotations | Yujiao Jiang et.al. | 2405.19609 | null |
2024-05-30 | SAM-E: Leveraging Visual Foundation Model with Sequence Imitation for Embodied Manipulation | Junjie Zhang et.al. | 2405.19586 | null |
2024-05-29 | Real-Time Dynamic Robot-Assisted Hand-Object Interaction via Motion Primitives | Mingqi Yuan et.al. | 2405.19531 | null |
2024-05-29 | Enabling Visual Recognition at Radio Frequency | Haowen Lai et.al. | 2405.19516 | null |
2024-05-29 | Data-Efficient Discovery of Hyperelastic TPMS Metamaterials with Extreme Energy Dissipation | Maxine Perroni-Scharf et.al. | 2405.19507 | null |
2024-05-29 | Caustics of a Paraboloid and Apollonius Problem | Yagub N. Aliyev et.al. | 2405.19484 | null |
2024-05-29 | Leveraging Generative AI for Smart City Digital Twins: A Survey on the Autonomous Generation of Data, Scenarios, 3D City Models, and Urban Designs | Haowen Xu et.al. | 2405.19464 | null |
2024-05-29 | Understanding Grasp Synergies during Reach-to-grasp using an Instrumented Data Glove | Subhash Pratap et.al. | 2405.19430 | null |
2024-06-09 | LLMs Meet Multimodal Generation and Editing: A Survey | Yingqing He et.al. | 2405.19334 | link |
2024-05-29 | NPGA: Neural Parametric Gaussian Avatars | Simon Giebenhain et.al. | 2405.19331 | null |
2024-05-29 | Reasoning3D – Grounding and Reasoning in 3D: Fine-Grained Zero-Shot Open-Vocabulary 3D Reasoning Part Segmentation via Large Vision-Language Models | Tianrun Chen et.al. | 2405.19326 | null |
2024-05-29 | DGD: Dynamic 3D Gaussians Distillation | Isaac Labe et.al. | 2405.19321 | null |
2024-05-29 | Neural Isometries: Taming Transformations for Equivariant ML | Thomas W. Mitchel et.al. | 2405.19296 | link |
2024-05-29 | 3D Neural Edge Reconstruction | Lei Li et.al. | 2405.19295 | null |
2024-05-29 | Contrastive-Adversarial and Diffusion: Exploring pre-training and fine-tuning strategies for sulcal identification | Michail Mamalakis et.al. | 2405.19204 | null |
2024-05-30 | $E^{3}$ Gen: Efficient, Expressive and Editable Avatars Generation | Weitian Zhang et.al. | 2405.19203 | null |
2024-05-29 | Strong solution of the three-dimensional $(3D)$ incompressible magneto-hydrodynamic $(MHD)$ equationss with a modified damping | Maroua Ltifi et.al. | 2405.19174 | null |
2024-05-29 | Greedy Kernel Methods for Approximating Breakthrough Curves for Reactive Flow from 3D Porous Geometry Data | Robin Herkert et.al. | 2405.19170 | null |
2024-05-29 | Dress Anyone : Automatic Physically-Based Garment Pattern Refitting | Hsiao-yu Chen et.al. | 2405.19148 | null |
2024-05-29 | MHD simulations of the space weather in Proxima b: Habitability conditions and radio emission | Luis Peña-Moñino et.al. | 2405.19116 | null |
2024-05-29 | Diagrammatic Representations of Higher-Dimensional Topological Orders | Yizhou Huang et.al. | 2405.19077 | null |
2024-06-02 | Cephalo: Multi-Modal Vision-Language Models for Bio-Inspired Materials Analysis and Design | Markus J. Buehler et.al. | 2405.19076 | link |
2024-05-29 | Fracture metamaterials with on-demand crack paths enabled by bending | Lucie Domino et.al. | 2405.19061 | null |
2024-05-29 | PointNetPGAP-SLC: A 3D LiDAR-based Place Recognition Approach with Segment-level Consistency Training for Mobile Robots in Horticulture | T. Barros et.al. | 2405.19038 | link |
2024-05-30 | An implementation of tensor product patch smoothers on GPU | Cu Cui et.al. | 2405.19004 | null |
2024-05-29 | A structure-preserving scheme for computing effective diffusivity and anomalous diffusion phenomena of random flows | Tan Zhang et.al. | 2405.19003 | null |
2024-05-29 | EasyAnimate: A High-Performance Long Video Generation Method based on Transformer Architecture | Jiaqi Xu et.al. | 2405.18991 | link |
2024-05-29 | UniIF: Unified Molecule Inverse Folding | Zhangyang Gao et.al. | 2405.18968 | null |
2024-05-29 | NbSe ${2}$’s charge density wave collapse in the (LaSe)${1.14}$(NbSe${2}$)${2}$ misfit layer compound | Ludovica Zullo et.al. | 2405.18939 | null |
2024-05-29 | Kestrel: Point Grounding Multimodal LLM for Part-Aware 3D Vision-Language Understanding | Junjie Fei et.al. | 2405.18937 | null |
2024-05-29 | Stochastic Continuation of Trajectories in the Circular Restricted Three-Body Problem via Differential Algebra | Giacomo Acciarini et.al. | 2405.18909 | null |
2024-05-29 | 4Doodle: Two-handed Gestures for Immersive Sketching of Architectural Models | Fernando Fonseca et.al. | 2405.18887 | null |
2024-05-29 | Learning Mixture-of-Experts for General-Purpose Black-Box Discrete Optimization | Shengcai Liu et.al. | 2405.18884 | link |
2024-05-29 | Neural Radiance Fields for Novel View Synthesis in Monocular Gastroscopy | Zijie Jiang et.al. | 2405.18863 | null |
2024-05-29 | Electron acceleration and transport in the 2023-03-06 solar flare | Alexey Kuznetsov et.al. | 2405.18850 | null |
2024-05-31 | MEGA: Masked Generative Autoencoder for Human Mesh Recovery | Guénolé Fiche et.al. | 2405.18839 | null |
2024-05-29 | Evaluating Zero-Shot GPT-4V Performance on 3D Visual Question Answering Benchmarks | Simranjit Singh et.al. | 2405.18831 | null |
2024-05-29 | Visual Servoing Based on 3D Features: Design and Implementation for Robotic Insertion Tasks | Antonio Rosales et.al. | 2405.18830 | null |
2024-05-29 | LP-3DGS: Learning to Prune 3D Gaussian Splatting | Zhaoliang Zhang et.al. | 2405.18784 | link |
2024-05-29 | PillarHist: A Quantization-aware Pillar Feature Encoder based on Height-aware Histogram | Sifan Zhou et.al. | 2405.18734 | null |
2024-05-29 | Efficient Learning in Chinese Checkers: Comparing Parameter Sharing in Multi-Agent Reinforcement Learning | Noah Adhikari et.al. | 2405.18733 | link |
2024-05-29 | Development of a Novel Impedance-Controlled Quasi-Direct-Drive Robotic Hand | Jay Best et.al. | 2405.18730 | null |
2024-05-29 | Towards an exact electronic quantum many-body treatment of Kondo correlation in magnetic impurities | Tianyu Zhu et.al. | 2405.18709 | link |
2024-05-30 | Multi-Condition Latent Diffusion Network for Scene-Aware Neural Human Motion Prediction | Xuehao Gao et.al. | 2405.18700 | null |
2024-05-29 | Learning Diffeomorphism for Image Registration with Time-Continuous Networks using Semigroup Regularization | Mohammadjavad Matinkia et.al. | 2405.18684 | link |
2024-05-29 | Zero-to-Hero: Enhancing Zero-Shot Novel View Synthesis via Attention Map Filtering | Ido Sobol et.al. | 2405.18677 | null |
2024-05-29 | Exploring Automated Contouring Across Institutional Boundaries: A Deep Learning Approach with Mouse Micro-CT Datasets | Lu Jiang et.al. | 2405.18676 | null |
2024-05-28 | Track Initialization and Re-Identification for~3D Multi-View Multi-Object Tracking | Linh Van Ma et.al. | 2405.18606 | link |
2024-05-28 | Video2MR: Automatically Generating Mixed Reality 3D Instructions by Augmenting Extracted Motion from 2D Videos | Keiichi Ihara et.al. | 2405.18565 | null |
2024-05-28 | REPARO: Compositional 3D Assets Generation with Differentiable 3D Layout Alignment | Haonan Han et.al. | 2405.18525 | link |
2024-05-28 | TripletMix: Triplet Data Augmentation for 3D Understanding | Jiaze Wang et.al. | 2405.18523 | null |
2024-05-28 | Atlas3D: Physically Constrained Self-Supporting Text-to-3D for Simulation and Fabrication | Yunuo Chen et.al. | 2405.18515 | null |
2024-05-27 | Past activity of Sgr A* is unlikely to affect the local cosmic-ray spectrum up to the TeV regime | Martin Fournier et.al. | 2405.18447 | null |
2024-05-28 | GFlow: Recovering 4D World from Monocular Video | Shizun Wang et.al. | 2405.18426 | null |
2024-05-28 | 3DitScene: Editing Any Scene via Language-guided Disentangled Gaussian Splatting | Qihang Zhang et.al. | 2405.18424 | null |
2024-05-30 | 3D StreetUnveiler with Semantic-Aware 2DGS | Jingwei Xu et.al. | 2405.18416 | null |
2024-05-28 | Global solutions to the Euler-Coriolis system | Xiao Ren et.al. | 2405.18390 | null |
2024-05-28 | Brain Tumor Segmentation (BraTS) Challenge 2024: Meningioma Radiotherapy Planning Automated Segmentation | Dominic LaBella et.al. | 2405.18383 | null |
2024-05-28 | Is a 3D-Tokenized LLM the Key to Reliable Autonomous Driving? | Yifan Bai et.al. | 2405.18361 | null |
2024-05-28 | Low-power Rapid Planar Superconducting Logic Devices | Nikolay Gusarov et.al. | 2405.18309 | null |
2024-05-28 | Intent3D: 3D Object Detection in RGB-D Scans Based on Human Intention | Weitai Kang et.al. | 2405.18295 | null |
2024-05-28 | Continuous Transition between Bosonic Fractional Chern Insulator and Superfluid | Hongyu Lu et.al. | 2405.18269 | null |
2024-05-28 | Population III star formation in the presence of turbulence, magnetic fields and ionizing radiation feedback | Piyush Sharda et.al. | 2405.18265 | null |
2024-05-28 | SubDLe: identification of substructures in cosmological simulations with deep learning | Michela Esposito et.al. | 2405.18257 | null |
2024-05-28 | NeRAF: 3D Scene Infused Neural Radiance and Acoustic Fields | Amandine Brunetto et.al. | 2405.18213 | null |
2024-05-28 | Render and Diffuse: Aligning Image and Action Spaces for Diffusion-based Behaviour Cloning | Vitalis Vosylius et.al. | 2405.18196 | null |
2024-05-30 | NegGS: Negative Gaussian Splatting | Artur Kasymov et.al. | 2405.18163 | link |
2024-05-28 | One-form symmetries and the 3d $\mathcal{N}=2$ $A$-model: Topologically twisted indices for any $G$ | Cyril Closset et.al. | 2405.18141 | null |
2024-05-28 | A Grid-Free Fluid Solver based on Gaussian Spatial Representation | Jingrui Xing et.al. | 2405.18133 | null |
2024-05-28 | EG4D: Explicit Generation of 4D Object without Score Distillation | Qi Sun et.al. | 2405.18132 | link |
2024-05-28 | Pipette: Automatic Fine-grained Large Language Model Training Configurator for Real-World Clusters | Jinkyu Yim et.al. | 2405.18093 | link |
2024-05-28 | Modular functors from non-semisimple 3d TFTs | Aaron Hofer et.al. | 2405.18038 | null |
2024-05-28 | RT-GS2: Real-Time Generalizable Semantic Segmentation for 3D Gaussian Representations of Radiance Fields | Mihnea-Bogdan Jurca et.al. | 2405.18033 | link |
2024-05-28 | A Calibration Tool for Refractive Underwater Vision | Felix Seegräber et.al. | 2405.18018 | null |
2024-05-28 | Diagnostics of magnetohydrodynamic modes in the ISM through synchrotron polarization statistics | Parth Pavaskar et.al. | 2405.17985 | null |
2024-05-28 | FreeSplat: Generalizable 3D Gaussian Splatting Towards Free-View Synthesis of Indoor Scenes | Yunsong Wang et.al. | 2405.17958 | link |
2024-05-28 | Self-supervised Pre-training for Transferable Multi-modal Perception | Xiaohao Xu et.al. | 2405.17942 | link |
2024-05-28 | ToonCrafter: Generative Cartoon Interpolation | Jinbo Xing et.al. | 2405.17933 | null |
2024-05-28 | A Refined 3D Gaussian Representation for High-Quality Dynamic Scene Reconstruction | Bin Zhang et.al. | 2405.17891 | null |
2024-05-29 | HFGS: 4D Gaussian Splatting with Emphasis on Spatial and Temporal High-Frequency Components for Endoscopic Scene Reconstruction | Haoyu Zhao et.al. | 2405.17872 | link |
2024-05-28 | Towards Video Codec Performance Evaluation: A Rate-Energy-Distortion Perspective | Geetha Ramasubbu et.al. | 2405.17866 | null |
2024-05-28 | Ferromagnetic ferroelectricity due to the Kugel-Khomskii mechanism of the orbital ordering assisted by atomic Hund’s second rule effects | I. V. Solovyev et.al. | 2405.17864 | null |
2024-05-30 | Deform3DGS: Flexible Deformation for Fast Surgical Scene Reconstruction with Gaussian Splatting | Shuojue Yang et.al. | 2405.17835 | link |
2024-05-28 | Mani-GS: Gaussian Splatting Manipulation with Triangular Mesh | Xiangjun Gao et.al. | 2405.17811 | null |
2024-05-28 | SafeguardGS: 3D Gaussian Primitive Pruning While Avoiding Catastrophic Scene Destruction | Yongjae Lee et.al. | 2405.17793 | link |
2024-05-28 | RealityEffects: Augmenting 3D Volumetric Videos with Object-Centric Annotation and Dynamic Visual Effects | Jian Liao et.al. | 2405.17711 | link |
2024-05-29 | DC-Gaussian: Improving 3D Gaussian Splatting for Reflective Dash Cam Videos | Linhan Wang et.al. | 2405.17705 | link |
2024-05-27 | High fidelity simulations of unstart phenomena in a scramjet inlet due to angle of attack | Jeremy Redding et.al. | 2405.17671 | null |
2024-05-27 | GarmentCodeData: A Dataset of 3D Made-to-Measure Garments With Sewing Patterns | Maria Korosteleva et.al. | 2405.17609 | link |
2024-05-27 | GOI: Find 3D Gaussians of Interest with an Optimizable Open-vocabulary Semantic-space Hyperplane | Yansong Qu et.al. | 2405.17596 | null |
2024-05-27 | Driving asymmetric red supergiants winds with binary interactions | Camille Landri et.al. | 2405.17563 | null |
2024-05-27 | Assessment of Left Atrium Motion Deformation Through Full Cardiac Cycle | Abdul Qayyum et.al. | 2405.17518 | null |
2024-05-27 | GaussianFormer: Scene as Gaussians for Vision-Based 3D Semantic Occupancy Prediction | Yuanhui Huang et.al. | 2405.17429 | link |
2024-05-27 | Reason3D: Searching and Reasoning 3D Segmentation via Large Language Model | Kuan-Chih Huang et.al. | 2405.17427 | link |
2024-05-27 | Benchmarking and Improving Bird’s Eye View Perception Robustness in Autonomous Driving | Shaoyuan Xie et.al. | 2405.17426 | link |
2024-05-27 | Hardness-Aware Scene Synthesis for Semi-Supervised 3D Object Detection | Shuai Zeng et.al. | 2405.17422 | link |
2024-05-27 | Collaborative Video Diffusion: Consistent Multi-video Generation with Camera Control | Zhengfei Kuang et.al. | 2405.17414 | null |
2024-05-27 | Occlusion Handling in 3D Human Pose Estimation with Perturbed Positional Encoding | Niloofar Azizi et.al. | 2405.17397 | null |
2024-05-27 | EASI-Tex: Edge-Aware Mesh Texturing from Single Image | Sai Raj Kishore Perla et.al. | 2405.17393 | null |
2024-05-27 | Predict joint angle of body parts based on sequence pattern recognition | Amin Ahmadi Kasani et.al. | 2405.17369 | null |
2024-05-27 | EM-GANSim: Real-time and Accurate EM Simulation Using Conditional GANs for 3D Indoor Scenes | Ruichen Wang et.al. | 2405.17366 | null |
2024-05-27 | DOF-GS: Adjustable Depth-of-Field 3D Gaussian Splatting for Refocusing,Defocus Rendering and Blur Removal | Yujie Wang et.al. | 2405.17351 | null |
2024-05-27 | All-day Depth Completion | Vadim Ezhov et.al. | 2405.17315 | null |
2024-05-27 | Non-Abelian Hopf-Euler insulators | Wojciech J. Jankowski et.al. | 2405.17305 | null |
2024-05-27 | Surface reconstruction of sampled textiles via Morse theory | Franco Coltraro et.al. | 2405.17257 | null |
2024-05-27 | GenWarp: Single Image to Novel Views with Semantic-Preserving Generative Warping | Junyoung Seo et.al. | 2405.17251 | link |
2024-05-27 | The Three Hundred project: Estimating the dependence of gas filaments on the mass of galaxy clusters | Sara Santoni et.al. | 2405.17239 | null |
2024-05-27 | Flow control of three-dimensional cylinders transitioning to turbulence via multi-agent reinforcement learning | P. Suárez et.al. | 2405.17210 | null |
2024-05-29 | Memorize What Matters: Emergent Scene Decomposition from Multitraverse | Yiming Li et.al. | 2405.17187 | link |
2024-05-27 | Closing the net on transient sources of ultra-high-energy cosmic rays | Sullivan Marafico et.al. | 2405.17179 | null |
2024-05-27 | Physical and chemical modifications of polymeric surface for enhanced epithelial cells adhesion | Laura M. S. dos Santos et.al. | 2405.17160 | null |
2024-05-27 | SDL-MVS: View Space and Depth Deformable Learning Paradigm for Multi-View Stereo Reconstruction in Remote Sensing | Yong-Qiang Mao et.al. | 2405.17140 | null |
2024-05-27 | PanoTree: Autonomous Photo-Spot Explorer in Virtual Reality Scenes | Tomohiro Hayase et.al. | 2405.17136 | null |
2024-05-27 | DINO-SD: Champion Solution for ICRA 2024 RoboDepth Challenge | Yifan Mao et.al. | 2405.17102 | null |
2024-05-28 | F-3DGS: Factorized Coordinates and Representations for 3D Gaussian Splatting | Xiangyu Sun et.al. | 2405.17083 | null |
2024-05-27 | BDC-Occ: Binarized Deep Convolution Unit For Binarized Occupancy Network | Zongkai Zhang et.al. | 2405.17037 | link |
2024-05-27 | SCaRL- A Synthetic Multi-Modal Dataset for Autonomous Driving | Avinash Nittur Ramesh et.al. | 2405.17030 | null |
2024-05-27 | Structural cohesive element for the modelling of delamination in composite laminates without the cohesive zone limit | Xiaopeng Ai et.al. | 2405.17018 | null |
2024-05-27 | $\text{Di}^2\text{Pose}$ : Discrete Diffusion Model for Occluded 3D Human Pose Estimation | Weiquan Wang et.al. | 2405.17016 | null |
2024-05-28 | MotionLLM: Multimodal Motion-Language Learning with Large Language Models | Qi Wu et.al. | 2405.17013 | link |
2024-05-27 | Collective Perception Datasets for Autonomous Driving: A Comprehensive Review | Sven Teufel et.al. | 2405.16973 | null |
2024-05-27 | PASTA: Pathology-Aware MRI to PET Cross-Modal Translation with Diffusion Models | Yitong Li et.al. | 2405.16942 | link |
2024-05-27 | Construction of birational trilinear volumes via tensor rank criteria | Laurent Busé et.al. | 2405.16936 | null |
2024-05-28 | SA-GS: Semantic-Aware Gaussian Splatting for Large Scene Reconstruction with Geometry Constrain | Butian Xiong et.al. | 2405.16923 | null |
2024-05-27 | A Cross-Dataset Study for Text-based 3D Human Motion Retrieval | Léore Bensabath et.al. | 2405.16909 | null |
2024-05-27 | PivotMesh: Generic 3D Mesh Generation via Pivot Vertices Guidance | Haohan Weng et.al. | 2405.16890 | null |
2024-05-27 | Part123: Part-aware 3D Reconstruction from a Single-view Image | Anran Liu et.al. | 2405.16888 | null |
2024-05-27 | CoCoGesture: Toward Coherent Co-speech 3D Gesture Generation in the Wild | Xingqun Qi et.al. | 2405.16874 | null |
2024-05-27 | ContrastAlign: Toward Robust BEV Feature Alignment via Contrastive Learning for Multi-Modal 3D Object Detection | Ziying Song et.al. | 2405.16873 | null |
2024-05-27 | RCDN: Towards Robust Camera-Insensitivity Collaborative Perception via Dynamic Feature-based 3D Neural Modeling | Tianhang Wang et.al. | 2405.16868 | null |
2024-05-27 | Clustering-based Learning for UAV Tracking and Pose Estimation | Jiaping Xiao et.al. | 2405.16867 | null |
2024-05-27 | NCIDiff: Non-covalent Interaction-generative Diffusion Model for Improving Reliability of 3D Molecule Generation Inside Protein Pocket | Joongwon Lee et.al. | 2405.16861 | null |
2024-05-27 | Sync4D: Video Guided Controllable Dynamics for Physics-Based 4D Generation | Zhoujie Fu et.al. | 2405.16849 | null |
2024-05-29 | PyGS: Large-scale Scene Representation with Pyramidal 3D Gaussian Splatting | Zipeng Wang et.al. | 2405.16829 | null |
2024-05-27 | Unified Editing of Panorama, 3D Scenes, and Videos Through Disentangled Self-Attention Injection | Gihyun Kwon et.al. | 2405.16823 | null |
2024-05-27 | Vidu4D: Single Generated Video to High-Fidelity 4D Reconstruction with Dynamic Gaussian Surfels | Yikai Wang et.al. | 2405.16822 | null |
2024-06-04 | Extreme Compression of Adaptive Neural Images | Leo Hoshikawa et.al. | 2405.16807 | null |
2024-05-27 | DualContrast: Unsupervised Disentangling of Content and Transformations with Implicit Parameterization | Mostofa Rafid Uddin et.al. | 2405.16796 | null |
2024-05-29 | 3D Reconstruction with Fast Dipole Sums | Hanyu Chen et.al. | 2405.16788 | null |
2024-05-27 | Probabilistic Height Grid Terrain Mapping for Mining Shovels using LiDAR | Vedant Bhandari et.al. | 2405.16774 | null |
2024-05-27 | Transport of Algebraic Structure to Latent Embeddings | Samuel Pfrommer et.al. | 2405.16763 | link |
2024-05-27 | LLM-Based Cooperative Agents using Information Relevance and Plan Validation | SeungWon Seo et.al. | 2405.16751 | null |
2024-05-28 | CARL: A Framework for Equivariant Image Registration | Hastings Greer et.al. | 2405.16738 | link |
2024-05-26 | ELG Spectroscopic Systematics Analysis of the DESI Data Release 1 | Jiaxi Yu et.al. | 2405.16657 | link |
2024-05-26 | Diffusion4D: Fast Spatial-temporal Consistent 4D Generation via Video Diffusion Models | Hanwen Liang et.al. | 2405.16645 | null |
2024-05-26 | A Survey of Multimodal Large Language Model from A Data-centric Perspective | Tianyi Bai et.al. | 2405.16640 | link |
2024-05-26 | Content and Salient Semantics Collaboration for Cloth-Changing Person Re-Identification | Qizao Wang et.al. | 2405.16597 | link |
2024-05-28 | ID-to-3D: Expressive ID-guided 3D Heads via Score Distillation Sampling | Francesca Babiloni et.al. | 2405.16570 | null |
2024-05-26 | Map-based Modular Approach for Zero-shot Embodied Question Answering | Koya Sakamoto et.al. | 2405.16559 | link |
2024-05-26 | Splat-SLAM: Globally Optimized RGB-only SLAM with 3D Gaussians | Erik Sandström et.al. | 2405.16544 | link |
2024-06-02 | Sp2360: Sparse-view 360 Scene Reconstruction using Cascaded 2D Diffusion Priors | Soumava Paul et.al. | 2405.16517 | null |
2024-05-26 | Multi-Modal UAV Detection, Classification and Tracking Algorithm – Technical Report for CVPR 2024 UG2 Challenge | Tianchen Deng et.al. | 2405.16464 | link |
2024-05-26 | 3D View Optimization for Improving Image Aesthetics | Taichi Uchida et.al. | 2405.16443 | null |
2024-05-26 | Variational Offline Multi-agent Skill Discovery | Jiayu Chen et.al. | 2405.16386 | null |
2024-05-25 | Video Prediction Models as General Visual Encoders | James Maier et.al. | 2405.16382 | null |
2024-06-03 | Non-hyperbolic 3-manifolds and 3D field theories for 2D Virasoro minimal models | Dongmin Gang et.al. | 2405.16377 | null |
2024-05-25 | Classical dynamics of the antiferromagnetic Heisenberg $S=1/2$ spin ladder | David A. Dahlbom et.al. | 2405.16315 | link |
2024-05-25 | Neural Network-Based Tracking and 3D Reconstruction of Baseball Pitch Trajectories from Single-View 2D Video | Jhen Hsieh et.al. | 2405.16296 | null |
2024-05-25 | N-BVH: Neural ray queries with bounding volume hierarchies | Philippe Weier et.al. | 2405.16237 | link |
2024-05-28 | VOODOO XP: Expressive One-Shot Head Reenactment for VR Telepresence | Phong Tran et.al. | 2405.16204 | null |
2024-05-25 | Dynamic Scattering Arrays for Simultaneous Electromagnetic Processing and Radiation in Holographic MIMO Systems | Davide Dardari et.al. | 2405.16174 | null |
2024-05-25 | Efficient Quantum Circuit Encoding of Object Information in 2D Ray Casting | Seungjae Lee et.al. | 2405.16132 | null |
2024-05-25 | Improving 3D Occupancy Prediction through Class-balancing Loss and Multi-scale Representation | Huizhou Chen et.al. | 2405.16099 | null |
2024-05-25 | Unveiling the 3D Morphology of Epitaxial GaAs/AlGaAs Quantum Dots | Yiteng Zhang et.al. | 2405.16073 | null |
2024-05-25 | DiffuBox: Refining 3D Object Detection with Point Diffusion | Xiangyu Chen et.al. | 2405.16034 | link |
2024-05-25 | 3D reconstruction of a million atoms by multiple-section local-orbital tomography | Liangze Mao et.al. | 2405.16007 | null |
2024-05-24 | ExactDreamer: High-Fidelity Text-to-3D Content Creation via Exact Score Matching | Yumin Zhang et.al. | 2405.15914 | link |
2024-05-24 | Investigating Turbulence Effects on Magnetic Reconnection Rates Through High-Resolution Three-Dimensional Resistive Magnetohydrodynamical Simulations | Giovani H. Vicentin et.al. | 2405.15909 | link |
2024-05-24 | Score Distillation via Reparametrized DDIM | Artem Lukoianov et.al. | 2405.15891 | link |
2024-05-24 | SpotNet: An Image Centric, Lidar Anchored Approach To Long Range Perception | Louis Foucard et.al. | 2405.15843 | null |
2024-05-23 | Efficient Point Transformer with Dynamic Token Aggregating for Point Cloud Processing | Dening Lu et.al. | 2405.15827 | null |
2024-05-23 | 3D Learnable Supertoken Transformer for LiDAR Point Cloud Scene Segmentation | Dening Lu et.al. | 2405.15826 | null |
2024-05-27 | First-principles studies of fermiology in topological phases of bulk ZrTe $_5$ | Chao Chen Ye et.al. | 2405.15698 | null |
2024-05-24 | UNION: Unsupervised 3D Object Detection using Object Appearance-based Pseudo-Classes | Ted Lentsch et.al. | 2405.15688 | link |
2024-05-24 | LAM3D: Large Image-Point-Cloud Alignment Model for 3D Reconstruction from Single Image | Ruikai Cui et.al. | 2405.15622 | null |
2024-05-24 | DiffCalib: Reformulating Monocular Camera Calibration as Diffusion-Based Dense Incident Map Generation | Xiankang He et.al. | 2405.15619 | null |
2024-05-24 | Modified 3D Massive Abelian 2-From Theory with a Single Pseudo-Scalar Field: BRST Approach | S. K. Panja et.al. | 2405.15588 | null |
2024-05-24 | Open-Vocabulary SAM3D: Understand Any 3D Scene | Hanchen Tai et.al. | 2405.15580 | null |
2024-05-24 | Feature Splatting for Better Novel View Synthesis with Low Overlap | T. Berriel Martins et.al. | 2405.15518 | link |
2024-05-24 | Learning to Discretize Denoising Diffusion ODEs | Vinh Tong et.al. | 2405.15506 | link |
2024-05-24 | GSDeformer: Direct Cage-based Deformation for 3D Gaussian Splatting | Jiajun Huang et.al. | 2405.15491 | null |
2024-05-24 | Text-guided 3D Human Motion Generation with Keyframe-based Parallel Skip Transformer | Zichen Geng et.al. | 2405.15439 | null |
2024-05-28 | Dipolar Droplets at 3D-1D Crossover | Maciej Pylak et.al. | 2405.15433 | null |
2024-05-24 | Throughput Requirements for RAN Functional Splits in 3D-Networks | MohammadAmin Vakilifard et.al. | 2405.15432 | null |
2024-05-24 | Volumetric Primitives for Modeling and Rendering Scattering and Emissive Media | Jorge Condor et.al. | 2405.15425 | null |
2024-05-24 | Luban: Building Open-Ended Creative Agents via Autonomous Embodied Verification | Yuxuan Guo et.al. | 2405.15414 | null |
2024-05-24 | Exploring Baryon Resonances with Transition Generalized Parton Distributions: Status and Perspectives | Stefan Diehl et.al. | 2405.15386 | null |
2024-05-24 | CPT-Interp: Continuous sPatial and Temporal Motion Modeling for 4D Medical Image Interpolation | Xia Li et.al. | 2405.15385 | null |
2024-05-24 | Distinguish Any Fake Videos: Unleashing the Power of Large-scale Data and Motion Features | Lichuan Ji et.al. | 2405.15343 | null |
2024-05-24 | An iterative closest point algorithm for marker-free 3D shape registration of continuum robots | Matthias K. Hoffmann et.al. | 2405.15336 | link |
2024-05-24 | Challenges and Opportunities in 3D Content Generation | Ke Zhao et.al. | 2405.15335 | null |
2024-05-24 | Diff3DS: Generating View-Consistent 3D Sketch via Differentiable Curve Rendering | Yibo Zhang et.al. | 2405.15305 | null |
2024-05-30 | High-field magnetoelectric coupling and successive magnetic transitions in Mn-doped polar antiferromagnet Ni3TeO6 | J. H. Zhang et.al. | 2405.15297 | null |
2024-05-24 | 3D Unsupervised Learning by Distilling 2D Open-Vocabulary Segmentation Models for Autonomous Driving | Boyi Sun et.al. | 2405.15286 | link |
2024-05-24 | Talk to Parallel LiDARs: A Human-LiDAR Interaction Method Based on 3D Visual Grounding | Yuhang Liu et.al. | 2405.15274 | null |
2024-05-24 | Fast 3D Molecule Generation via Unified Geometric Optimal Transport | Haokai Hong et.al. | 2405.15252 | null |
2024-05-24 | Blaze3DM: Marry Triplane Representation with Diffusion for 3D Medical Inverse Problem Solving | Jia He et.al. | 2405.15241 | null |
2024-05-24 | Automating the Diagnosis of Human Vision Disorders by Cross-modal 3D Generation | Li Zhang et.al. | 2405.15239 | link |
2024-05-24 | PointRWKV: Efficient RWKV-Like Model for Hierarchical Point Cloud Learning | Qingdong He et.al. | 2405.15214 | null |
2024-05-24 | Meta-meshing and triangulating lattice structures at a large scale | Qiang Zou et.al. | 2405.15197 | null |
2024-05-24 | DisC-GS: Discontinuity-aware Gaussian Splatting | Haoxuan Qu et.al. | 2405.15196 | null |
2024-05-24 | MonoDETRNext: Next-generation Accurate and Efficient Monocular 3D Object Detection Method | Pan Liao et.al. | 2405.15176 | null |
2024-05-24 | Label-efficient Semantic Scene Completion with Scribble Annotations | Song Wang et.al. | 2405.15170 | link |
2024-05-24 | Optimal Reference Nodes Deployment for Positioning Seafloor Anchor Nodes | Wei Huang et.al. | 2405.15153 | null |
2024-05-27 | HDR-GS: Efficient High Dynamic Range Novel View Synthesis at 1000x Speed via Gaussian Splatting | Yuanhao Cai et.al. | 2405.15125 | link |
2024-05-24 | GS-Hider: Hiding Messages into 3D Gaussian Splatting | Xuanyu Zhang et.al. | 2405.15118 | null |
2024-05-23 | Reframing Spatial Reasoning Evaluation in Language Models: A Real-World Simulation Benchmark for Qualitative Reasoning | Fangjun Li et.al. | 2405.15064 | link |
2024-05-23 | Revisiting MoE and Dense Speed-Accuracy Comparisons for LLM Training | Xianzhi Du et.al. | 2405.15052 | link |
2024-05-23 | NeCGS: Neural Compression for 3D Geometry Sets | Siyu Ren et.al. | 2405.15034 | link |
2024-05-23 | CraftsMan: High-fidelity Mesh Generation with 3D Native Generation and Interactive Geometry Refiner | Weiyu Li et.al. | 2405.14979 | link |
2024-06-03 | EvGGS: A Collaborative Learning Framework for Event-based Generalizable Gaussian Splatting | Jiaxu Wang et.al. | 2405.14959 | link |
2024-05-23 | PILOT: Equivariant diffusion for pocket conditioned de novo ligand generation with multi-objective guidance via importance sampling | Julian Cremer et.al. | 2405.14925 | null |
2024-05-30 | An Empirical Study of Training State-of-the-Art LiDAR Segmentation Models | Jiahao Sun et.al. | 2405.14870 | link |
2024-05-23 | PuzzleAvatar: Assembling 3D Avatars from Personal Albums | Yuliang Xiu et.al. | 2405.14869 | link |
2024-05-23 | Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis | Basile Van Hoorick et.al. | 2405.14868 | null |
2024-06-01 | Direct3D: Scalable Image-to-3D Generation via 3D Latent Diffusion Transformer | Shuang Wu et.al. | 2405.14832 | null |
2024-05-23 | Evaluating Vulnerability of Chiplet-Based Systems to Contactless Probing Techniques | Aleksa Deric et.al. | 2405.14821 | null |
2024-06-03 | Interacting phase diagram of twisted bilayer MoTe $_2$ in magnetic field | Minxuan Wang et.al. | 2405.14811 | null |
2024-05-24 | Fast-DDPM: Fast Denoising Diffusion Probabilistic Models for Medical Image-to-Image Generation | Hongxu Jiang et.al. | 2405.14802 | link |
2024-05-23 | Convolutional Neural Network Model Observers Discount Signal-like Anatomical Structures During Search in Virtual Digital Breast Tomosynthesis Phantoms | Aditya Jonnalagadda et.al. | 2405.14720 | null |
2024-05-23 | Necessity of Quantizable Geometry for Quantum Gravity | Abhishek Kumar Mehta et.al. | 2405.14692 | null |
2024-05-23 | Drones Help Drones: A Collaborative Framework for Multi-Drone Object Trajectory Prediction and Beyond | Zhechao Wang et.al. | 2405.14674 | link |
2024-05-23 | Flatten Anything: Unsupervised Neural Surface Parameterization | Qijian Zhang et.al. | 2405.14633 | link |
2024-05-23 | SE3D: A Framework For Saliency Method Evaluation In 3D Imaging | Mariusz Wiśniewski et.al. | 2405.14584 | link |
2024-05-23 | LDM: Large Tensorial SDF Model for Textured Mesh Generation | Rengan Xie et.al. | 2405.14580 | link |
2024-05-23 | Multistable Shape from Shading Emerges from Patch Diffusion | Xinran Nicole Han et.al. | 2405.14530 | null |
2024-05-23 | ArchesWeather: An efficient AI weather forecasting model at 1.5° resolution | Guillaume Couairon et.al. | 2405.14527 | link |
2024-05-23 | Ghost-Stereo: GhostNet-based Cost Volume Enhancement and Aggregation for Stereo Matching Networks | Xingguang Jiang et.al. | 2405.14520 | null |
2024-05-23 | A Brisk Estimator for the Angular Multipoles (BEAM) of the redshift space bispectrum | Sukhdeep Singh Gill et.al. | 2405.14513 | null |
2024-05-23 | MagicDrive3D: Controllable 3D Generation for Any-View Rendering in Street Scenes | Ruiyuan Gao et.al. | 2405.14475 | null |
2024-06-01 | TIGER: Text-Instructed 3D Gaussian Retrieval and Coherent Editing | Teng Xu et.al. | 2405.14455 | null |
2024-05-24 | RoGS: Large Scale Road Surface Reconstruction based on 2D Gaussian Splatting | Zhiheng Feng et.al. | 2405.14342 | link |
2024-05-23 | MAMBA4D: Efficient Long-Sequence Point Cloud Video Understanding with Disentangled Spatial-Temporal State Space Models | Jiuming Liu et.al. | 2405.14338 | null |
2024-05-24 | Autoregressive Image Diffusion: Generation of Image Sequence and Application in MRI | Guanxiong Luo et.al. | 2405.14327 | link |
2024-05-24 | D-MiSo: Editing Dynamic 3D Scenes using Multi-Gaussians Soup | Joanna Waczyńska et.al. | 2405.14276 | link |
2024-05-23 | Fine-grained Image-to-LiDAR Contrastive Distillation with Visual Foundation Models | Yifan Zhang et.al. | 2405.14271 | link |
2024-05-23 | NeuroGauss4D-PCI: 4D Neural Fields and Gaussian Deformation Fields for Point Cloud Interpolation | Chaokang Jiang et.al. | 2405.14241 | link |
2024-05-23 | Boosting Medical Image-based Cancer Detection via Text-guided Supervision from Reports | Guangyu Guo et.al. | 2405.14230 | null |
2024-05-23 | Eidos: Efficient, Imperceptible Adversarial 3D Point Clouds | Hanwei Zhang et.al. | 2405.14210 | null |
2024-05-23 | Multi-view Remote Sensing Image Segmentation With SAM priors | Zipeng Qi et.al. | 2405.14171 | null |
2024-05-23 | Non-unique solutions for electron MHD | Mimi Dai et.al. | 2405.14127 | null |
2024-05-22 | Tough Cortical Bone-Inspired Tubular Architected Cement-based Material | Shashank Gupta et.al. | 2405.14035 | null |
2024-05-24 | BrainMorph: A Foundational Keypoint Model for Robust and Flexible Brain MRI Registration | Alan Q. Wang et.al. | 2405.14019 | link |
2024-05-22 | MagicPose4D: Crafting Articulated Models with Appearance and Motion Control | Hao Zhang et.al. | 2405.14017 | null |
2024-05-26 | RadarOcc: Robust 3D Occupancy Prediction with 4D Imaging Radar | Fangqiang Ding et.al. | 2405.14014 | link |
2024-05-22 | TS40K: a 3D Point Cloud Dataset of Rural Terrain and Electrical Transmission System | Diogo Lavado et.al. | 2405.13989 | null |
2024-05-22 | Numerical Simulations of 3D Ion Crystal Dynamics in a Penning Trap using the Fast Multipole Method | John Zaris et.al. | 2405.13973 | null |
2024-05-22 | DoGaussian: Distributed-Oriented Gaussian Splatting for Large-Scale 3D Reconstruction Via Gaussian Consensus | Yu Chen et.al. | 2405.13943 | link |
2024-05-22 | Multi-Zone Modeling of Black Hole Accretion and Feedback in 3D GRMHD: Bridging Vast Spatial and Temporal Scales | Hyerin Cho et.al. | 2405.13887 | null |
2024-05-22 | Enhancing lattice kinetic schemes for fluid dynamics with Lattice-Equivariant Neural Networks | Giulio Ortali et.al. | 2405.13850 | null |
2024-05-22 | Diffusing Winding Gradients (DWG): A Parallel and Scalable Method for 3D Reconstruction from Unoriented Point Clouds | Weizhou Liu et.al. | 2405.13839 | null |
2024-05-22 | Multi-Type Point Cloud Autoencoder: A Complete Equivariant Embedding for Molecule Conformation and Pose | Michael Kilgour et.al. | 2405.13791 | link |
2024-05-22 | Monocular Gaussian SLAM with Language Extended Loop Closure | Tian Lan et.al. | 2405.13748 | null |
2024-05-24 | ComboStoc: Combinatorial Stochasticity for Diffusion Generative Models | Rui Xu et.al. | 2405.13729 | null |
2024-05-22 | Metabook: An Automatically Generated Augmented Reality Storybook Interaction System to Improve Children’s Engagement in Storytelling | Yibo Wang et.al. | 2405.13701 | null |
2024-05-30 | The metamorphosis of semi-classical mechanisms of confinement: From monopoles on ${\mathbb R}^3 \times S^1$ to center-vortices on ${\mathbb R}^2 \times T^2$ | Canberk Güvendik et.al. | 2405.13696 | null |
2024-05-22 | Gaussian Time Machine: A Real-Time Rendering Methodology for Time-Variant Appearances | Licheng Shen et.al. | 2405.13694 | null |
2024-05-22 | Context and Geometry Aware Voxel Transformer for Semantic Scene Completion | Zhu Yu et.al. | 2405.13675 | link |
2024-05-22 | EgoChoir: Capturing 3D Human-Object Interaction Regions from Egocentric Views | Yuhang Yang et.al. | 2405.13659 | null |
2024-05-22 | Fully automated construction of three-dimensional finite element simulations from Optical Coherence Tomography | Ross Straughan et.al. | 2405.13643 | null |
2024-05-22 | Spectral Analysis and Asymptotic Decay of the Solutions to Multilayered Structure-Stokes Fluid Interaction PDE System | Pelin Guven Geredeli et.al. | 2405.13612 | null |
2024-05-22 | Euclid. I. Overview of the Euclid mission | Euclid Collaboration et.al. | 2405.13491 | null |
2024-05-22 | A distinct halo population revealed from 3D non-LTE magnesium abundances | T. Matsuno et.al. | 2405.13486 | null |
2024-05-22 | Uncovering gauge-dependent critical order-parameter correlations by a stochastic gauge fixing at O( $N$)$^$ and Ising$^$ continuous transitions | Claudio Bonati et.al. | 2405.13485 | null |
2024-05-22 | Machine learning for exoplanet detection in high-contrast spectroscopy Combining cross correlation maps and deep learning on medium-resolution integral-field spectra | Rakesh Nath-Ranga et.al. | 2405.13468 | link |
2024-05-22 | Kinematics of Abdominal Aortic Aneurysms | Mostafa Jamshidian et.al. | 2405.13377 | null |
2024-05-22 | SIGGesture: Generalized Co-Speech Gesture Synthesis via Semantic Injection with Large-Scale Pre-Training Diffusion Models | Qingrong Cheng et.al. | 2405.13336 | null |
2024-05-22 | Deuterium fractionation of the starless core L 1498 | Sheng-Jun Lin et.al. | 2405.13317 | null |
2024-05-22 | Hybrid Multihead Attentive Unet-3D for Brain Tumor Segmentation | Muhammad Ansab Butt et.al. | 2405.13304 | null |
2024-05-21 | Geometric Transformation Uncertainty for Improving 3D Fetal Brain Pose Prediction from Freehand 2D Ultrasound Videos | Jayroop Ramesh et.al. | 2405.13235 | link |
2024-05-21 | Deep operator learning-based surrogate models for aerothermodynamic analysis of AEDC hypersonic waverider | Khemraj Shukla et.al. | 2405.13234 | null |
2024-05-21 | Empowering Urban Traffic Management: Elevated 3D LiDAR for Data Collection and Advanced Object Detection Analysis | Nawfal Guefrachi et.al. | 2405.13202 | null |
2024-05-21 | CamViG: Camera Aware Image-to-Video Generation with Multimodal Transformers | Andrew Marmon et.al. | 2405.13195 | null |
2024-05-21 | Adaptive coupling of 3D and 2D fluid flow models | Pratik Suchde et.al. | 2405.13165 | null |
2024-05-21 | Local gravitational instability of two-component thick discs in three dimensions | Carlo Nipoti et.al. | 2405.13123 | null |
2024-05-21 | Multiboundary wormholes and OPE statistics | Jan de Boer et.al. | 2405.13111 | null |
2024-05-21 | A Survey of Artificial Intelligence in Gait-Based Neurodegenerative Disease Diagnosis | Haocong Rao et.al. | 2405.13082 | link |
2024-05-21 | Comprehensive Multimodal Deep Learning Survival Prediction Enabled by a Transformer Architecture: A Multicenter Study in Glioblastoma | Ahmed Gomaa et.al. | 2405.12963 | null |
2024-05-21 | Exact predicates, exact constructions and combinatorics for mesh CSG | Bruno Lévy et.al. | 2405.12949 | null |
2024-05-21 | Enabling Additive Manufacturing Part Inspection of Digital Twins via Collaborative Virtual Reality | Vuthea Chheang et.al. | 2405.12931 | null |
2024-05-21 | Implicit-ARAP: Efficient Handle-Guided Deformation of High-Resolution Meshes and Neural Fields via Local Patch Meshing | Daniele Baieri et.al. | 2405.12895 | null |
2024-05-21 | Epitaxial RuO $_2$ and IrO$_2$ films by pulsed laser deposition on TiO$_2$ (110) | Philipp Keßler et.al. | 2405.12878 | null |
2024-05-21 | Talk2Radar: Bridging Natural Language with 4D mmWave Radar for 3D Referring Expression Comprehension | Runwei Guan et.al. | 2405.12821 | link |
2024-05-21 | MOSS: Motion-based 3D Clothed Human Synthesis from Monocular Video | Hongsheng Wang et.al. | 2405.12806 | null |
2024-05-21 | Dark-Field X-Ray Microscopy with Structured Illumination for Three-Dimensional Imaging | Doğa Gürsoy et.al. | 2405.12799 | null |
2024-05-21 | A Novel Methodology for Autonomous Planetary Exploration Using Multi-Robot Teams | Sarah Swinton et.al. | 2405.12790 | null |
2024-05-21 | Self-Supervised Modality-Agnostic Pre-Training of Swin Transformers | Abhiroop Talasila et.al. | 2405.12781 | link |
2024-05-21 | Neural Operator for Accelerating Coronal Magnetic Field Model | Yutao Du et.al. | 2405.12754 | link |
2024-05-21 | RemoCap: Disentangled Representation Learning for Motion Capture | Hongsheng Wang et.al. | 2405.12724 | null |
2024-05-21 | Constraints on the (re-)orientation of star-disk systems through infall | M. Kuffmeier et.al. | 2405.12670 | null |
2024-05-21 | LAGA: Layered 3D Avatar Generation and Customization via Gaussian Splatting | Jia Gong et.al. | 2405.12663 | null |
2024-05-21 | S3O: A Dual-Phase Approach for Reconstructing Dynamic Shape and Skeleton of Articulated Objects from Single Monocular Video | Hao Zhang et.al. | 2405.12607 | link |
2024-05-21 | FFAM: Feature Factorization Activation Map for Explanation of 3D Detectors | Shuai Liu et.al. | 2405.12601 | link |
2024-05-21 | A Weeding Robot for Seedling Removal | Jarkko Kotaniemi et.al. | 2405.12596 | null |
2024-05-21 | NOVA-3D: Non-overlapped Views for 3D Anime Character Reconstruction | Hongsheng Wang et.al. | 2405.12505 | null |
2024-05-21 | 3DSS-Mamba: 3D-Spectral-Spatial Mamba for Hyperspectral Image Classification | Yan He et.al. | 2405.12487 | null |
2024-05-21 | Gaussian Control with Hierarchical Semantic Graphs in 3D Human Recovery | Hongsheng Wang et.al. | 2405.12477 | null |
2024-05-21 | Optimizing Generative AI Networking: A Dual Perspective with Multi-Agent Systems and Mixture of Experts | Ruichen Zhang et.al. | 2405.12472 | null |
2024-05-21 | Physics-based Scene Layout Generation from Human Motion | Jianan Li et.al. | 2405.12460 | null |
2024-05-21 | Mutual Information Analysis in Multimodal Learning Systems | Hadi Hadizadeh et.al. | 2405.12456 | null |
2024-05-20 | GarmentDreamer: 3DGS Guided Garment Synthesis with Diverse Geometry and Texture Details | Boqian Li et.al. | 2405.12420 | link |
2024-05-20 | GeoMask3D: Geometrically Informed Mask Selection for Self-Supervised Point Cloud Learning in 3D | Ali Bahri et.al. | 2405.12419 | null |
2024-05-20 | Large scale scattering using fast solvers based on neural operators | Zongren Zou et.al. | 2405.12380 | null |
2024-05-22 | AtomGS: Atomizing Gaussian Splatting for High-Fidelity Radiance Field | Rong Liu et.al. | 2405.12369 | link |
2024-05-24 | Hypergraph: A Unified and Uniform Definition with Application to Chemical Hypergraph | Daniel T. Chang et.al. | 2405.12235 | null |
2024-05-20 | Fast Generalizable Gaussian Splatting Reconstruction from Multi-View Stereo | Tianqi Liu et.al. | 2405.12218 | link |
2024-05-20 | Multi-View Attentive Contextualization for Multi-View 3D Object Detection | Xianpeng Liu et.al. | 2405.12200 | null |
2024-05-20 | Histotripsy of blood clots within a hollow cylindrical transducer for aspiration thrombectomy applications | Li Gong et.al. | 2405.12194 | null |
2024-05-20 | State of the Practice for Medical Imaging Software | W. Spencer Smith et.al. | 2405.12171 | null |
2024-05-20 | Embracing Radiance Field Rendering in 6G: Over-the-Air Training and Inference with 3D Contents | Guanlin Wu et.al. | 2405.12155 | null |
2024-05-20 | Cosmic Ray Diffusion in the Turbulent Interstellar Medium: Effects of Mirror Diffusion and Pitch Angle Scattering | Lucas Barreto-Mota et.al. | 2405.12146 | null |
2024-05-20 | 2D vs. 3D BAO: quantification of their tension and test of the Etherington relation | Arianna Favale et.al. | 2405.12142 | null |
2024-05-20 | Neutron-superfluid vortices and proton-superconductor flux tubes: Development of a minimal model for pulsar glitches | Sanjay Shukla et.al. | 2405.12127 | null |
2024-05-20 | CoR-GS: Sparse-View 3D Gaussian Splatting via Co-Regularization | Jiawei Zhang et.al. | 2405.12110 | link |
2024-05-20 | Real topological phonons in 3D carbon allotropes | Xiaotian Wang et.al. | 2405.12072 | null |
2024-05-20 | The Projective Wave Theory of Consciousness | Robert Worden et.al. | 2405.12071 | null |
2024-05-20 | AutoSoccerPose: Automated 3D posture Analysis of Soccer Shot Movements | Calvin Yeung et.al. | 2405.12070 | link |
2024-05-21 | Gaussian Head & Shoulders: High Fidelity Neural Upper Body Avatars with Anchor Gaussian Guided Texture Warping | Tianhao Wu et.al. | 2405.12069 | null |
2024-05-20 | NPLMV-PS: Neural Point-Light Multi-View Photometric Stereo | Fotios Logothetis et.al. | 2405.12057 | null |
2024-05-20 | Depth Reconstruction with Neural Signed Distance Fields in Structured Light Systems | Rukun Qiao et.al. | 2405.12006 | null |
2024-05-20 | GGAvatar: Geometric Adjustment of Gaussian Head Avatar | Xinyang Li et.al. | 2405.11993 | null |
2024-05-20 | GuidedRec: Guiding Ill-Posed Unsupervised Volumetric Recovery | Alexandre Cafaro et.al. | 2405.11977 | null |
2024-05-20 | Confinement for 3d $\mathcal{N}=2$ $SU(N)$ with a Symmetric tensor | Antonio Amariti et.al. | 2405.11972 | null |
2024-05-20 | MirrorGaussian: Reflecting 3D Gaussians for Reconstructing Mirror Reflections | Jiayue Liu et.al. | 2405.11921 | null |
2024-05-20 | PT43D: A Probabilistic Transformer for Generating 3D Shapes from Single Highly-Ambiguous RGB Images | Yiheng Xiong et.al. | 2405.11914 | link |
2024-05-21 | 3D Reconfigurable Intelligent Surfaces for Satellite-Terrestrial Networks | Islam M. Tanash et.al. | 2405.11909 | null |
2024-05-20 | A comprehensive overview of deep learning techniques for 3D point cloud classification and semantic segmentation | Sushmita Sarker et.al. | 2405.11903 | null |
2024-05-20 | Stereo-Knowledge Distillation from dpMV to Dual Pixels for Light Field Video Reconstruction | Aryan Garg et.al. | 2405.11823 | null |
2024-05-20 | Can we improve the energy efficiency of EUV lithography? | Tsumoru Shintake et.al. | 2405.11717 | null |
2024-05-19 | FADet: A Multi-sensor 3D Object Detection Network based on Local Featured Attention | Ziang Guo et.al. | 2405.11682 | link |
2024-05-21 | Microstructure and Stress Mapping in 3D at Industrially Relevant Degrees of Plastic Deformation | Axel Henningsson et.al. | 2405.11644 | null |
2024-05-19 | Era3D: High-Resolution Multiview Diffusion using Efficient Row-wise Attention | Peng Li et.al. | 2405.11616 | null |
2024-05-19 | RobMOT: Robust 3D Multi-Object Tracking by Observational Noise and State Estimation Drift Mitigation on LiDAR PointCloud | Mohamed Nagy et.al. | 2405.11536 | null |
2024-05-19 | Point Cloud Compression with Implicit Neural Representations: A Unified Framework | Hongning Ruan et.al. | 2405.11493 | null |
2024-05-19 | Unifying 3D Vision-Language Understanding via Promptable Queries | Ziyu Zhu et.al. | 2405.11442 | null |
2024-05-18 | Second-harmonic optical diffraction tomography | Amirhossein Saba et.al. | 2405.11398 | null |
2024-05-18 | Cooperative Multi-agent Approach for Automated Computer Game Testing | Samira Shirzadeh-hajimahmood et.al. | 2405.11347 | null |
2024-05-18 | Motion Avatar: Generate Human and Animal Avatars with Arbitrary Motion | Zeyu Zhang et.al. | 2405.11286 | link |
2024-05-18 | Dreamer XL: Towards High-Resolution Text-to-3D Generation via Trajectory Score Matching | Xingyu Miao et.al. | 2405.11252 | link |
2024-05-18 | Learning-based Block-wise Planar Channel Estimation for Time-Varying MIMO OFDM | Chenchen Liu et.al. | 2405.11218 | null |
2024-05-18 | MotionGS : Compact Gaussian Splatting SLAM by Motion Filter | Xinli Guo et.al. | 2405.11129 | link |
2024-05-17 | A Comparative Study of Garment Draping Techniques | Prerana Achar et.al. | 2405.11056 | null |
2024-05-17 | Photorealistic 3D Urban Scene Reconstruction and Point Cloud Extraction using Google Earth Imagery and Gaussian Splatting | Kyle Gao et.al. | 2405.11021 | null |
2024-05-16 | Flow Score Distillation for Diverse Text-to-3D Generation | Runjie Yan et.al. | 2405.10988 | null |
2024-05-17 | Reconstruction of Manipulated Garment with Guided Deformation Prior | Ren Li et.al. | 2405.10934 | null |
2024-05-17 | Nearly self-similar blowup of generalized axisymmetric Navier-Stokes and Boussinesq equations | Thomas Y. Hou et.al. | 2405.10916 | null |
2024-05-15 | Emergent magnetic field and vector potential of the toroidal magnetic hopfions | Konstantin Y. Guslienko et.al. | 2405.10811 | null |
2024-05-17 | Flux rope modeling of the 2022 Sep 5 CME observed by Parker Solar Probe and Solar Orbiter from 0.07 to 0.69 au | Emma E. Davies et.al. | 2405.10810 | null |
2024-05-17 | Scanning Acoustic Microscopy for Quantifying Two-phase Transfer in Operando Alkaline Water Electrolyzer | Zehua Dou et.al. | 2405.10716 | null |
2024-05-17 | Quantum Phase Transitions in Many-Dipole Light-Matter Systems | Daniele Lamberto et.al. | 2405.10711 | null |
2024-05-17 | 3D Vessel Reconstruction from Sparse-View Dynamic DSA Images via Vessel Probability Guided Attenuation Learning | Zhentao Liu et.al. | 2405.10705 | link |
2024-05-17 | LoCI-DiffCom: Longitudinal Consistency-Informed Diffusion Model for 3D Infant Brain Image Completion | Zihao Zhu et.al. | 2405.10691 | null |
2024-05-17 | Stellar wind impact on early atmospheres around unmagnetized Earth-like planets | Ada Canet et.al. | 2405.10641 | null |
2024-05-17 | Cannibals in PARADISE: The effect of merging interplanetary shocks on solar energetic particle events | Antonio Niemela et.al. | 2405.10615 | null |
2024-05-17 | GEOcc: Geometrically Enhanced 3D Occupancy Network with Implicit-Explicit Depth Fusion and Contextual Self-Supervision | Xin Tan et.al. | 2405.10591 | null |
2024-05-17 | DuoSpaceNet: Leveraging Both Bird’s-Eye-View and Perspective View Representations for 3D Object Detection | Zhe Huang et.al. | 2405.10577 | null |
2024-05-17 | Resolving Symmetry Ambiguity in Correspondence-based Methods for Instance-level Object Pose Estimation | Yongliang Lin et.al. | 2405.10557 | null |
2024-05-17 | You Can’t Solve These Super Mario Bros. Levels: Undecidable Mario Games | MIT Hardness Group et.al. | 2405.10546 | null |
2024-05-17 | Radar Positioning for Accurate Sensing of Pulse Waves at Multiple Sites Using a 3D Human Model | Takehito Koshisaka et.al. | 2405.10540 | null |
2024-05-17 | ART3D: 3D Gaussian Splatting for Text-Guided Artistic Scenes Generation | Pengzhi Li et.al. | 2405.10508 | null |
2024-05-16 | Single-shot volumetric fluorescence imaging with neural fields | Oumeng Zhang et.al. | 2405.10463 | null |
2024-05-16 | Energetic particles transport in constants of motion space due to collisions in tokamak plasmas | Guo Meng et.al. | 2405.10428 | null |
2024-05-23 | Describing heat dissipation in the resistive state of three-dimensional superconductors | Leonardo Rodrigues Cadorim et.al. | 2405.10415 | null |
2024-05-16 | Grounded 3D-LLM with Referent Tokens | Yilun Chen et.al. | 2405.10370 | link |
2024-05-15 | UniCorn: A Unified Contrastive Learning Approach for Multi-view Molecular Representation Learning | Shikun Feng et.al. | 2405.10343 | null |
2024-05-14 | Suppression of blow-up in Patlak-Keller-Segel system coupled with linearized Navier-Stokes equations via the 3D Couette flow | Shikun Cui et.al. | 2405.10337 | null |
2024-05-17 | Toon3D: Seeing Cartoons from a New Perspective | Ethan Weber et.al. | 2405.10320 | null |
2024-05-16 | CAT3D: Create Anything in 3D with Multi-View Diffusion Models | Ruiqi Gao et.al. | 2405.10314 | null |
2024-05-16 | When LLMs step into the 3D World: A Survey and Meta-Analysis of 3D Tasks via Multi-modal Large Language Models | Xianzheng Ma et.al. | 2405.10255 | link |
2024-05-16 | A Foundation Model for Brain Lesion Segmentation with Mixture of Modality Experts | Xinru Zhang et.al. | 2405.10246 | link |
2024-05-16 | Phase behavior of metastable water from large-scale simulations of a quantitative accurate model: The liquid-liquid critical point | Luis Enrique Coronas et.al. | 2405.10181 | null |
2024-05-16 | 3D-2D crossover and phase shift of beats of quantum oscillations of interlayer magnetoresistance in quasi-2D metals | Taras Mogilyuk et.al. | 2405.10174 | null |
2024-05-24 | GS-Planner: A Gaussian-Splatting-based Planning Framework for Active High-Fidelity Reconstruction | Rui Jin et.al. | 2405.10142 | null |
2024-05-16 | Spatial Cognition: a Wave Hypothesis | Robert Worden et.al. | 2405.10112 | null |
2024-05-16 | One-step Pulsed Laser Deposition of Metal oxynitride/Carbon Composites for Supercapacitor Application | Subrata Ghosh et.al. | 2405.10103 | null |
2024-05-16 | MrRegNet: Multi-resolution Mask Guided Convolutional Neural Network for Medical Image Registration with Large Deformations | Ruizhe Li et.al. | 2405.10068 | link |
2024-05-16 | Discussing Risks and Benefits in the Future of Hybrid Rehabilitation and Fitness in Mixed Reality | Jana Franceska Funke et.al. | 2405.10059 | null |
2024-05-16 | A Preprocessing and Postprocessing Voxel-based Method for LiDAR Semantic Segmentation Improvement in Long Distance | Andrea Matteazzi et.al. | 2405.10046 | null |
2024-05-16 | Solving the enigma: Deriving optimal explanations of deep networks | Michail Mamalakis et.al. | 2405.10008 | null |
2024-05-16 | Learning BPS Spectra and the Gap Conjecture | Sergei Gukov et.al. | 2405.09993 | null |
2024-05-17 | Dual-band feature selection for maturity classification of specialty crops by hyperspectral imaging | Usman A. Zahidi et.al. | 2405.09955 | null |
2024-05-16 | Infrared Adversarial Car Stickers | Xiaopei Zhu et.al. | 2405.09924 | null |
2024-05-20 | RoScenes: A Large-scale Multi-view 3D Dataset for Roadside Perception | Xiaosu Zhu et.al. | 2405.09883 | link |
2024-05-16 | Deep Learning-Based Quasi-Conformal Surface Registration for Partial 3D Faces Applied to Facial Recognition | Yuchen Guo et.al. | 2405.09880 | null |
2024-05-16 | Electron delocalization in a 2D Mott insulator | Cosme G. Ayani et.al. | 2405.09877 | null |
2024-05-16 | Dual3D: Efficient and Consistent Text-to-3D Generation with Dual-mode Multi-view Latent Diffusion | Xinyang Li et.al. | 2405.09874 | null |
2024-05-27 | Electrically switchable $2^N$ -channel wave-front control with N cascaded polarization-dependent metasurfaces | Zhiyao Ma et.al. | 2405.09844 | null |
2024-05-16 | On the edge turbulence in a DTT-like tokamak plasma | F. Cianfrani et.al. | 2405.09837 | null |
2024-05-19 | PillarNeXt: Improving the 3D detector by introducing Voxel2Pillar feature encoding and extracting multi-scale features | Xusheng Li et.al. | 2405.09828 | null |
2024-05-16 | MediSyn: Text-Guided Diffusion Models for Broad Medical 2D and 3D Image Synthesis | Joseph Cho et.al. | 2405.09806 | null |
2024-05-16 | The metallicity and carbon-to-oxygen ratio of the ultra-hot Jupiter WASP-76b from Gemini-S/IGRINS | Megan Weiner Mansfield et.al. | 2405.09769 | null |
2024-05-16 | Collision Avoidance Metric for 3D Camera Evaluation | Vage Taamazyan et.al. | 2405.09755 | link |
2024-05-15 | The proper way to spatially decompose the gravitational-wave origin in stellar collapse simulations | Shuai Zha et.al. | 2405.09729 | null |
2024-05-15 | 3D structure of the Milky Way out to 10 kpc from the Sun. Catalogue of large molecular clouds in the Galactic Plane | Sara Rezaei Kh. et.al. | 2405.09634 | null |
2024-05-15 | Three-dimensional quantum Hall states as a chiral electromagnetic filter | Nandagopal Manoj et.al. | 2405.09617 | null |
2024-05-15 | BxC Toolkit: Generating Tailored Turbulent 3D Magnetic Fields | Daniela Maci et.al. | 2405.09587 | null |
2024-05-15 | Energy-Efficient Sleep Mode Optimization of 5G mmWave Networks Using Deep Contextual MAB | Saad Masrur et.al. | 2405.09528 | null |
2024-05-15 | Some 1d (Supersymmetric) Quantum Field Theories Reduced from Chern-Simons Gauge Theories | Burak Oğuz et.al. | 2405.09473 | null |
2024-05-15 | A Survey On Text-to-3D Contents Generation In The Wild | Chenhan Jiang et.al. | 2405.09431 | null |
2024-05-15 | Three Dimensional Spatial Cognition: Bees and Bats | Robert Worden et.al. | 2405.09413 | null |
2024-05-15 | VascularPilot3D: Toward a 3D fully autonomous navigation for endovascular robotics | Song Jingwei et.al. | 2405.09375 | null |
2024-05-15 | 3D-DASH: The Evolution of Size, Shape, and Intrinsic Scatter in Populations of Young and Old Quiescent Galaxies at 0.5 < z < 3 | Maike Clausen et.al. | 2405.09354 | null |
2024-05-15 | Optimal constants of smoothing estimates for the 3D Dirac equation | Makoto Ikoma et.al. | 2405.09349 | null |
2024-05-15 | Content-Based Image Retrieval for Multi-Class Volumetric Radiology Images: A Benchmark Study | Farnaz Khun Jush et.al. | 2405.09334 | null |
2024-05-15 | Energy conservation for 3D Euler and Navier-Stokes equations in a bounded domain. Applications to Beltrami flows | Luigi C. Berselli et.al. | 2405.09316 | null |
2024-05-15 | Three-Dimensional Path Planning: Navigating through Rough Mereology | Aleksandra Szpakowska et.al. | 2405.09282 | null |
2024-05-15 | Anchor Layout Optimization for Ultrasonic Indoor Positioning Using Swarm Intelligence | Daan Delabie et.al. | 2405.09222 | null |
2024-05-23 | Complex-valued 3D atomic spectroscopy with Gaussian-assisted inline holography | Xing Huang et.al. | 2405.09117 | null |
2024-05-15 | Dim Small Target Detection and Tracking: A Novel Method Based on Temporal Energy Selective Scaling and Trajectory Association | Weihua Gao et.al. | 2405.09054 | null |
2024-05-15 | 3D Shape Augmentation with Content-Aware Shape Resizing | Mingxiang Chen et.al. | 2405.09050 | null |
2024-05-24 | Theoretical Analysis for Expectation-Maximization-Based Multi-Model 3D Registration | David Jin et.al. | 2405.08991 | null |
2024-05-16 | Comparing bulge RR Lyrae stars with bulge giants – Insight from 3D kinematics | J. Olivares Carvajal et.al. | 2405.08990 | null |
2024-05-14 | A practical guide to light-sheet microscopy for nanoscale imaging: Looking beyond the cell | Stephanie N. Kramer et.al. | 2405.08987 | null |
2024-05-14 | Tunable moiré materials for probing Berry physics and topology | Pratap Chandra Adak et.al. | 2405.08959 | null |
2024-05-14 | ADA-Track: End-to-End Multi-Camera 3D Multi-Object Tracking with Alternating Detection and Association | Shuxiao Ding et.al. | 2405.08909 | link |
2024-05-14 | The Impact of 2D and 3D Gamified VR on Learning American Sign Language | Jindi Wang et.al. | 2405.08908 | null |
2024-05-14 | Incorporating Clinical Guidelines through Adapting Multi-modal Large Language Model for Prostate Cancer PI-RADS Scoring | Tiantian Zhang et.al. | 2405.08786 | link |
2024-05-14 | CATEcor: an Open Science, Shaded-Truss, Externally-Occulted Coronagraph | Craig E. DeForest et.al. | 2405.08739 | null |
2024-05-14 | An Analytic Solution to the 3D CSC Dubins Path Problem | Victor M. Baez et.al. | 2405.08710 | link |
2024-05-14 | A Distributed Approach to Autonomous Intersection Management via Multi-Agent Reinforcement Learning | Matteo Cederle et.al. | 2405.08655 | link |
2024-05-14 | Unconventional surface phase transitions in a (1+1)D $SU(2)_1$ CFT edge coupled to a (2+1)D $Z_2$ bulk | Zhe Wang et.al. | 2405.08612 | null |
2024-05-14 | Dynamic NeRF: A Review | Jinwei Lin et.al. | 2405.08609 | null |
2024-05-14 | The Requirement for Cognition, in an Equation | Robert Worden et.al. | 2405.08601 | null |
2024-05-14 | RDPN6D: Residual-based Dense Point-wise Network for 6Dof Object Pose Estimation Based on RGB-D Images | Zong-Wei Hong et.al. | 2405.08483 | link |
2024-05-14 | Velocity-vorticity geometric constraints for the energy conservation of 3D ideal incompressible fluids | Luigi C. Berselli et.al. | 2405.08461 | null |
2024-05-14 | TP3M: Transformer-based Pseudo 3D Image Matching with Reference | Liming Han et.al. | 2405.08434 | null |
2024-05-14 | No Time to Waste: Squeeze Time into Channel for Mobile Video Understanding | Yingjie Zhai et.al. | 2405.08344 | link |
2024-05-17 | Perivascular space Identification Nnunet for Generalised Usage (PINGU) | Benjamin Sinclair et.al. | 2405.08337 | link |
2024-05-14 | StraightPCF: Straight Point Cloud Filtering | Dasith de Silva Edirimuni et.al. | 2405.08322 | link |
2024-05-13 | Infinite Texture: Text-guided High Resolution Diffusion Texture Synthesis | Yifan Wang et.al. | 2405.08210 | null |
2024-05-13 | Generative Enzyme Design Guided by Functionally Important Sites and Small-Molecule Substrates | Zhenqiao Song et.al. | 2405.08205 | link |
2024-05-13 | Non-local twist sequences in floppy kagome chains | Pegah Azizi et.al. | 2405.08182 | null |
2024-05-13 | Constructing nested coordinates inside strongly shaped toroids using an action principle | Zeno Tecchiolli et.al. | 2405.08173 | null |
2024-05-13 | Towards the reproducible fabrication of conductive ferroelectric domain walls into lithium niobate bulk single crystals | Julius Ratzenberger et.al. | 2405.08156 | null |
2024-05-21 | 5d 2-Chern-Simons theory and 3d integrable field theories | Alexander Schenkel et.al. | 2405.08083 | null |
2024-05-13 | Random Bond perturbations of the $O(2)$ vector model | Maria Nocchi et.al. | 2405.08072 | null |
2024-05-13 | DiffTF++: 3D-aware Diffusion Transformer for Large-Vocabulary 3D Generation | Ziang Cao et.al. | 2405.08055 | link |
2024-05-13 | Coin3D: Controllable and Interactive 3D Assets Generation with Proxy-Guided Conditioning | Wenqi Dong et.al. | 2405.08054 | null |
2024-05-13 | Layout Generation Agents with Large Language Models | Yuichi Sasazawa et.al. | 2405.08037 | link |
2024-05-13 | SignAvatar: Sign Language 3D Motion Reconstruction and Generation | Lu Dong et.al. | 2405.07974 | null |
2024-05-13 | Scene Action Maps: Behavioural Maps for Navigation without Metric Information | Joel Loo et.al. | 2405.07948 | null |
2024-05-13 | Authentic Hand Avatar from a Phone Scan via Universal Hand Model | Gyeongsik Moon et.al. | 2405.07933 | null |
2024-05-13 | Unfolding via Progressive Mesh Approximation | Lars Zawallich et.al. | 2405.07922 | null |
2024-05-13 | An extended and refined grid of 3D STAGGER model atmospheres. Processed snapshots for stellar spectroscopy | Luisa F. Rodríguez Díaz et.al. | 2405.07872 | null |
2024-05-13 | SceneFactory: A Workflow-centric and Unified Framework for Incremental Scene Modeling | Yijun Yuan et.al. | 2405.07847 | null |
2024-05-13 | Why Decussate? Topological Constraints on 3D Wiring | Troy Shinbrot et.al. | 2405.07837 | null |
2024-05-13 | Integrating Multi-Physics Simulations and Machine Learning to Define the Spatter Mechanism and Process Window in Laser Powder Bed Fusion | Olabode T. Ajenifujah et.al. | 2405.07823 | null |
2024-05-13 | Generating Human Motion in 3D Scenes from Text Descriptions | Zhi Cen et.al. | 2405.07784 | null |
2024-05-13 | Significant improvement in sensitivity of an anomalous Nernst heat flux sensor by composite structure | Hiroto Imaeda et.al. | 2405.07758 | null |
2024-05-13 | Searching for evidence of subchromospheric magnetic reconnection on the Sun | D. Baker et.al. | 2405.07755 | null |
2024-05-13 | TOPress3D: 3D topology optimization with design-dependent pressure loads in MATLAB | Prabhat Kumar et.al. | 2405.07733 | link |
2024-05-13 | Shell structure and shape transition in odd- $Z$ superheavy nuclei with proton numbers $Z=117, 119$ : insights from deformed relativistic Hartree-Bogoliubov in continuum | Y. X. Zhang et.al. | 2405.07704 | null |
2024-05-13 | oTTC: Object Time-to-Contact for Motion Estimation in Autonomous Driving | Abdul Hannan Khan et.al. | 2405.07698 | null |
2024-05-13 | MonoMAE: Enhancing Monocular 3D Detection through Depth-Aware Masked Autoencoders | Xueying Jiang et.al. | 2405.07696 | null |
2024-05-13 | Fast Training Data Acquisition for Object Detection and Segmentation using Black Screen Luminance Keying | Thomas Pöllabauer et.al. | 2405.07653 | null |
2024-05-13 | A Hessian-Based Field Deformer for Real-Time Topology-Aware Shape Editing | Yunxiao Zhang et.al. | 2405.07644 | null |
2024-05-13 | Unveiling the Magmatic Architecture Beneath Oceanus Procellarum: Insights from GRAIL Mission Data | Meixia Geng et.al. | 2405.07639 | null |
2024-05-13 | InAs on Insulator: A New Platform for Cryogenic Hybrid Superconducting Electronics | Alessandro Paghi et.al. | 2405.07630 | null |
2024-05-13 | Direct electron beam writing of silver using a $β$ -diketonate precursor: first insights | Katja Höflich et.al. | 2405.07617 | null |
2024-05-13 | Integrity Monitoring of 3D Object Detection in Automated Driving Systems using Raw Activation Patterns and Spatial Filtering | Hakan Yekta Yatbaz et.al. | 2405.07600 | null |
2024-05-13 | A study of layered holographic superconductor | Chi-Hsien Tai et.al. | 2405.07535 | null |
2024-05-13 | Marginal Fairness Sliced Wasserstein Barycenter | Khai Nguyen et.al. | 2405.07482 | null |
2024-05-13 | Enhancing 3D Object Detection by Using Neural Network with Self-adaptive Thresholding | Houze Liu et.al. | 2405.07479 | null |
2024-05-23 | GaussianVTON: 3D Human Virtual Try-ON via Multi-Stage Gaussian Splatting Editing with Image Prompting | Haodong Chen et.al. | 2405.07472 | null |
2024-05-13 | Towards improved software visualisation of parameterised REE patterns: Introducing REEkit for geological analysis | Jaxon Kneipp et.al. | 2405.07438 | null |
2024-05-13 | PitcherNet: Powering the Moneyball Evolution in Baseball Video Analytics | Jerrin Bright et.al. | 2405.07407 | null |
2024-05-12 | LayGA: Layered Gaussian Avatars for Animatable Clothing Transfer | Siyou Lin et.al. | 2405.07319 | null |
2024-05-12 | Point Resampling and Ray Transformation Aid to Editable NeRF Models | Zhenyang Li et.al. | 2405.07306 | null |
2024-05-12 | Building a Strong Pre-Training Baseline for Universal 3D Large-Scale Perception | Haoming Chen et.al. | 2405.07201 | link |
2024-05-12 | Hologram: Realtime Holographic Overlays via LiDAR Augmented Reconstruction | Ekansh Agrawal et.al. | 2405.07178 | null |
2024-05-12 | Capacity Maximization for Base Station with Hybrid Fixed and Movable Antennas | Xiaoming Shi et.al. | 2405.07176 | null |
2024-05-12 | planetMagFields: A Python package for analyzing and plotting planetary magnetic field data | Ankit Barik et.al. | 2405.07168 | null |
2024-05-12 | 3D Hand Mesh Recovery from Monocular RGB in Camera Space | Haonan Li et.al. | 2405.07167 | null |
2024-05-12 | Vertex Shader Domain Warping with Automatic Differentiation | Dave Pagurek van Mossel et.al. | 2405.07124 | null |
2024-05-12 | Quasiparticle and Excitonic Structures of Few-layer and Bulk GaSe: Interlayer Coupling, Self-energy, and Electron-hole Interaction | Fanhao Jia et.al. | 2405.07120 | null |
2024-05-11 | TD-NeRF: Novel Truncated Depth Prior for Joint Camera Pose and Neural Radiance Field Optimization | Zhen Tan et.al. | 2405.07027 | link |
2024-05-11 | Demystifying the Hypercomplex: Inductive Biases in Hypercomplex Deep Learning | Danilo Comminiello et.al. | 2405.07024 | null |
2024-05-11 | PIPE: Process Informed Parameter Estimation, a learning based approach to task generalized system identification | Constantin Schempp et.al. | 2405.06991 | null |
2024-05-11 | Direct Learning of Mesh and Appearance via 3D Gaussian Splatting | Ancheng Lin et.al. | 2405.06945 | null |
2024-05-11 | PRENet: A Plane-Fit Redundancy Encoding Point Cloud Sequence Network for Real-Time 3D Action Recognition | Shenglin He et.al. | 2405.06929 | null |
2024-05-10 | CasCalib: Cascaded Calibration for Motion Capture from Sparse Unsynchronized Cameras | James Tang et.al. | 2405.06845 | link |
2024-05-10 | G-FARS: Gradient-Field-based Auto-Regressive Sampling for 3D Part Grouping | Junfeng Cheng et.al. | 2405.06828 | link |
2024-05-15 | Simulating Light Propagation through Biological Media Using Monte-Carlo Method | Maryam Ghahremani et.al. | 2405.06810 | null |
2024-05-10 | SAM3D: Zero-Shot Semi-Automatic Segmentation in 3D Medical Images with the Segment Anything Model | Trevor J. Chan et.al. | 2405.06786 | null |
2024-05-10 | GraphRelate3D: Context-Dependent 3D Object Detection with Inter-Object Relationship Graphs | Mingyu Liu et.al. | 2405.06782 | null |
2024-05-10 | OneTo3D: One Image to Re-editable Dynamic 3D Model and Video Generation | Jinwei Lin et.al. | 2405.06547 | link |
2024-05-10 | Holographic RG flows in a 3d gauged supergravity at finite temperature | Anastasia Golubtsova et.al. | 2405.06515 | null |
2024-05-14 | SketchDream: Sketch-based Text-to-3D Generation and Editing | Feng-Lin Liu et.al. | 2405.06461 | null |
2024-05-10 | I3DGS: Improve 3D Gaussian Splatting from Multiple Dimensions | Jinwei Lin et.al. | 2405.06408 | null |
2024-05-10 | Transmission of a Pressure Signal through a Confined Bubble Array | Edgar Ortega-Roano et.al. | 2405.06406 | null |
2024-05-22 | M3DIS – A grid of 3D radiation-hydrodynamics stellar atmosphere models for stellar surveys | Philipp Eitner et.al. | 2405.06338 | null |
2024-05-10 | Automated Cell Structure Extraction for 3D Electron Microscopy by Deep Learning | Jin Kousaka et.al. | 2405.06303 | null |
2024-05-10 | Benchmarking Classical and Learning-Based Multibeam Point Cloud Registration | Li Ling et.al. | 2405.06279 | link |
2024-05-10 | Comparative Analysis of Advanced Feature Matching Algorithms in Challenging High Spatial Resolution Optical Satellite Stereo Scenarios | Qiyan Luo et.al. | 2405.06246 | null |
2024-05-10 | Fire in SRRN: Next-Gen 3D Temperature Field Reconstruction Technology | Shenxiang Feng et.al. | 2405.06230 | link |
2024-05-10 | Event-based Structure-from-Orbit | Ethan Elms et.al. | 2405.06216 | null |
2024-05-10 | Lowering Barriers to Entry for Fully-Integrated Custom Payloads on a DJI Matrice | Joshua Springer et.al. | 2405.06176 | link |
2024-05-09 | Detecting the spread of valence band Wannier functions by optical sum rules | Luis F. Cárdenas-Castillo et.al. | 2405.06146 | null |
2024-05-09 | Perceptual Crack Detection for Rendered 3D Textured Meshes | Armin Shafiee Sarvestani et.al. | 2405.06143 | link |
2024-05-09 | Rethinking Efficient and Effective Point-based Networks for Event Camera Classification and Regression: EventMamba | Hongwei Ren et.al. | 2405.06116 | link |
2024-05-09 | A Mixture of Experts Approach to 3D Human Motion Prediction | Edmund Shieh et.al. | 2405.06088 | link |
2024-05-09 | Solving the Einstein Equations Numerically | David Hilditch et.al. | 2405.06035 | null |
2024-05-22 | Supernova Explosions of the Lowest-Mass Massive Star Progenitors | Tianshu Wang et.al. | 2405.06024 | null |
2024-05-09 | Effect of the Large Magellanic Cloud on the kinematics of Milky Way satellites and virial mass estimate | Andrey Kravtsov et.al. | 2405.06017 | null |
2024-05-17 | Single-antenna super-resolution positioning with nonseparable toroidal pulses | Ren Wang et.al. | 2405.05979 | null |
2024-04-30 | Symbolic construction of the chemical Jacobian of quasi-steady state (QSS) chemistries for Exascale computing platforms | Malik Hassanaly et.al. | 2405.05974 | link |
2024-05-15 | Lumina-T2X: Transforming Text into Any Modality, Resolution, and Duration via Flow-based Large Diffusion Transformers | Peng Gao et.al. | 2405.05945 | link |
2024-05-09 | MRISegmentator-Abdomen: A Fully Automated Multi-Organ and Structure Segmentation Tool for T1-weighted Abdominal MRI | Yan Zhuang et.al. | 2405.05944 | link |
2024-05-09 | Role of Vanadium-Oxide Layer in Electronic State of Sr $2$VFeAsO${3-δ}$ with Oxygen Deficiency | Masamichi Nakajima et.al. | 2405.05893 | null |
2024-05-09 | 3D Positioning using a New Diffraction Path Model | Gaurav Duggal et.al. | 2405.05801 | null |
2024-05-09 | DragGaussian: Enabling Drag-style Manipulation on 3D Gaussian Representation | Sitian Shen et.al. | 2405.05800 | null |
2024-05-09 | RoboHop: Segment-based Topological Map Representation for Open-World Visual Navigation | Sourav Garg et.al. | 2405.05792 | null |
2024-05-09 | Sequential Amodal Segmentation via Cumulative Occlusion Learning | Jiayang Ao et.al. | 2405.05791 | null |
2024-05-09 | Autonomous Robotic Ultrasound System for Liver Follow-up Diagnosis: Pilot Phantom Study | Tianpeng Zhang et.al. | 2405.05787 | null |
2024-05-11 | Exploration of morphological coherence in open clusters with a “core-shell’’ structure | Qingshun Hu et.al. | 2405.05771 | null |
2024-05-09 | FastScene: Text-Driven Fast 3D Indoor Scene Generation via Panoramic Gaussian Splatting | Yikun Ma et.al. | 2405.05768 | null |
2024-05-10 | NeRFFaceSpeech: One-shot Audio-driven 3D Talking Head Synthesis via Generative Prior | Gihoon Kim et.al. | 2405.05749 | null |
2024-05-09 | 3D bulk field theories for 2D non-unitary N=1 supersymmetric minimal models | Seungjoo Baek et.al. | 2405.05746 | null |
2024-05-09 | Neural Network Approach for Predicting Infrared Spectra from 3D Molecular Structure | Saleh Abdul Al et.al. | 2405.05737 | link |
2024-05-18 | NGM-SLAM: Gaussian Splatting SLAM with Radiance Field Submap | Mingrui Li et.al. | 2405.05702 | null |
2024-05-09 | SubGDiff: A Subgraph Diffusion Model to Improve Molecular Representation Learning | Jiying Zhang et.al. | 2405.05665 | link |
2024-05-09 | AI in Your Toolbox: A Plugin for Generating Renderings from 3D Models | Mingming Wang et.al. | 2405.05627 | null |
2024-05-09 | Depth Awakens: A Depth-perceptual Attention Fusion Network for RGB-D Camouflaged Object Detection | Xinran Liua et.al. | 2405.05614 | null |
2024-05-09 | Minimal Perspective Autocalibration | Andrea Porfiri Dal Cin et.al. | 2405.05605 | link |
2024-05-09 | Homogenization in 3D thin domains with oscillating boundaries of different orders | José M. Arrieta et.al. | 2405.05599 | null |
2024-05-09 | A Survey on Backbones for Deep Video Action Recognition | Zixuan Tang et.al. | 2405.05584 | null |
2024-05-09 | Array SAR 3D Sparse Imaging Based on Regularization by Denoising Under Few Observed Data | Yangyang Wang et.al. | 2405.05565 | null |
2024-05-09 | Conformal to confining SQFTs from holography | Dimitrios Chatzis et.al. | 2405.05563 | null |
2024-05-09 | Improved electrochemical performance of NASICON type Na ${3}$V${2-x}$Co$x$(PO${4}$)$_{3}$/C ($x=$ 0–0.15) cathode for high rate and stable sodium-ion batteries | Simranjot K. Sapra et.al. | 2405.05559 | null |
2024-05-09 | Benchmarking Neural Radiance Fields for Autonomous Robots: An Overview | Yuhang Ming et.al. | 2405.05526 | null |
2024-05-09 | Continuous max-flow augmentation of self-supervised few-shot learning on SPECT left ventricles | Ádám István Szűcs et.al. | 2405.05520 | link |
2024-05-09 | HPPS: A Hierarchical Progressive Perception System for Luggage Trolley Detection and Localization at Airports | Zhirui Sun et.al. | 2405.05514 | null |
2024-05-09 | Banking Turn of High-DOF Dynamic Morphing Wing Flight by Shifting Structure Response Using Optimization | Bibek Gupta et.al. | 2405.05490 | null |
2024-05-08 | Highest Fusion Performance without Harmful Edge Energy Bursts in Tokamak | SangKyeun Kim et.al. | 2405.05452 | null |
2024-05-08 | GDGS: Gradient Domain Gaussian Splatting for Sparse Representation of Radiance Fields | Yuanhao Gong et.al. | 2405.05446 | null |
2024-05-08 | 2D ferroelectrics and ferroelectrics with 2D: materials and device prospects | Chloe Leblanc et.al. | 2405.05432 | null |
2024-05-08 | Identifying stable communities in Hi-C data using a multifractal null model | Lucas Hedström et.al. | 2405.05425 | null |
2024-05-08 | Coupling of the Finite Element Method with Physics Informed Neural Networks for the Multi-Fluid Flow Problem | Michel Nohra et.al. | 2405.05371 | null |
2024-05-08 | Joint semi-supervised and contrastive learning enables zero-shot domain-adaptation and multi-domain segmentation | Alvaro Gomariz et.al. | 2405.05336 | null |
2024-05-08 | AGN flares as counterparts to the mergers detected by LIGO and Virgo: a novel spatial correlation analysis | Niccolò Veronesi et.al. | 2405.05318 | null |
2024-05-08 | Convective overstability in radially global protoplanetary disks I – Pure gas dynamics | Marius Lehmann et.al. | 2405.05314 | null |
2024-05-08 | Multi-Modal Data-Efficient 3D Scene Understanding for Autonomous Driving | Lingdong Kong et.al. | 2405.05258 | link |
2024-05-08 | FinePOSE: Fine-Grained Prompt-Driven 3D Human Pose Estimation via Diffusion Models | Jinglin Xu et.al. | 2405.05216 | link |
2024-05-18 | A Survey on Occupancy Perception for Autonomous Driving: The Information Fusion Perspective | Huaiyuan Xu et.al. | 2405.05173 | link |
2024-05-08 | Crystal structure identification with 3D convolutional neural networks with application to high-pressure phase transitions in SiO $_2$ | Linus C. Erhard et.al. | 2405.05156 | link |
2024-05-08 | DenserRadar: A 4D millimeter-wave radar point cloud detector based on dense LiDAR point clouds | Zeyu Han et.al. | 2405.05131 | null |
2024-05-08 | Energy stable gradient flow schemes for shape and topology optimization in Navier-Stokes flows | Jiajie Li et.al. | 2405.05098 | null |
2024-05-08 | Rapid Co-design of Task-Specialized Whegged Robots for Ad-Hoc Needs | Varun Madabushi et.al. | 2405.05096 | null |
2024-05-09 | Understanding solid nitrogen through machine learning simulation | Marcin Kirsz et.al. | 2405.05092 | null |
2024-05-08 | ${M^2D}$ NeRF: Multi-Modal Decomposition NeRF with 3D Feature Fields | Ning Wang et.al. | 2405.05010 | null |
2024-05-08 | Audio Matters Too! Enhancing Markerless Motion Capture with Audio Signals for String Performance Capture | Yitong Jin et.al. | 2405.04963 | link |
2024-05-08 | The many colors of the TNG100 simulation | Andrea Gebek et.al. | 2405.04925 | link |
2024-05-08 | Fast LiDAR Upsampling using Conditional Diffusion Models | Sander Elias Magnussen Helgesen et.al. | 2405.04889 | link |
2024-05-08 | EMISSA: Exploring millimetre indicators of solar-stellar activity III. Comparison of Ca II indices and millimetre continua in a 3D model atmosphere | Sneha Pandit et.al. | 2405.04871 | null |
2024-05-08 | Information Geometric Framework For Point Cloud Data | Amit Vishwakarma et.al. | 2405.04864 | null |
2024-05-08 | Three-dimensional higher-order saddle points induced flat bands in Co-based kagome metals | Hengxin Tan et.al. | 2405.04863 | null |
2024-05-08 | GoalGrasp: Grasping Goals in Partially Occluded Scenarios without Grasp Training | Shun Gui et.al. | 2405.04783 | null |
2024-05-08 | Exploring Vision Transformers for 3D Human Motion-Language Models with Motion Patches | Qing Yu et.al. | 2405.04771 | null |
2024-05-07 | Impact of Dimensionality on the Magnetocaloric Effect in Two-dimensional Magnets | Lokanath Patra et.al. | 2405.04639 | null |
2024-05-13 | FRACTAL: An Ultra-Large-Scale Aerial Lidar Dataset for 3D Semantic Segmentation of Diverse Landscapes | Charles Gaydon et.al. | 2405.04634 | link |
2024-05-07 | First Constraints on the ISM Conditions of a Low Mass, Highly Obscured z=4.27 Main Sequence Galaxy | Andrew Mizener et.al. | 2405.04582 | null |
2024-05-07 | Tactile-Augmented Radiance Fields | Yiming Dou et.al. | 2405.04534 | link |
2024-05-07 | ChatHuman: Language-driven 3D Human Understanding with Retrieval-Augmented Tool Reasoning | Jing Lin et.al. | 2405.04533 | null |
2024-05-07 | UQ state-dependent framework for seismic fragility assessment of industrial components | C. Nardin et.al. | 2405.04487 | null |
2024-05-07 | DriveWorld: 4D Pre-trained Scene Understanding via World Models for Autonomous Driving | Chen Min et.al. | 2405.04390 | null |
2024-05-14 | Splat-MOVER: Multi-Stage, Open-Vocabulary Robotic Manipulation via Editable Gaussian Splatting | Ola Shorinwa et.al. | 2405.04378 | link |
2024-05-07 | ASKAP reveals the radio tail structure of the Corkscrew Galaxy shaped by its passage through the Abell 3627 cluster | Bärbel S. Koribalski et.al. | 2405.04374 | null |
2024-05-07 | Diffusion-driven GAN Inversion for Multi-Modal Face Image Generation | Jihyun Kim et.al. | 2405.04356 | link |
2024-05-07 | Novel View Synthesis with Neural Radiance Fields for Industrial Robot Applications | Markus Hillemann et.al. | 2405.04345 | null |
2024-05-07 | Molecular Identification via Molecular Fingerprint extraction from Atomic Force Microscopy images | Manuel González Lastre et.al. | 2405.04321 | null |
2024-05-07 | Non-rigid Structure-from-Motion: Temporally-smooth Procrustean Alignment and Spatially-variant Deformation Modeling | Jiawei Shi et.al. | 2405.04309 | null |
2024-05-07 | ViewFormer: Exploring Spatiotemporal Modeling for Multi-View 3D Occupancy Perception via View-Guided Transformers | Jinke Li et.al. | 2405.04299 | link |
2024-05-07 | Bayesian Simultaneous Localization and Multi-Lane Tracking Using Onboard Sensors and a SD Map | Yuxuan Xia et.al. | 2405.04290 | null |
2024-05-07 | Morphological Evidence for the eROSITA Bubbles Being Giant and Distant Structures | Teng Liu et.al. | 2405.04264 | null |
2024-05-07 | COM3D: Leveraging Cross-View Correspondence and Cross-Modal Mining for 3D Retrieval | Hao Wu et.al. | 2405.04103 | null |
2024-05-07 | Lumbar Spine Tumor Segmentation and Localization in T2 MRI Images Using AI | Rikathi Pal et.al. | 2405.04023 | null |
2024-05-07 | A hydrodynamic approach to reproduce multiple spinning vortices in horizontally rotating three-dimensional liquid helium-4 | Satori Tsuzuki et.al. | 2405.03980 | null |
2024-05-07 | Structure-based drug design by denoising voxel grids | Pedro O. Pinheiro et.al. | 2405.03961 | link |
2024-05-07 | Robust Optimization for Spot Scanning Proton Therapy based on Dose-Linear Energy Transfer (LET) Volume Constraints | Jingyuan Chen et.al. | 2405.03916 | null |
2024-05-06 | MVDiff: Scalable and Flexible Multi-View Diffusion for 3D Object Reconstruction from Single-View | Emmanuelle Bourigault et.al. | 2405.03894 | null |
2024-05-06 | BadFusion: 2D-Oriented Backdoor Attacks against 3D Object Detection | Saket S. Chaturvedi et.al. | 2405.03884 | null |
2024-05-12 | Twisted circle compactification of $\mathcal{N}=4$ SYM and its Holographic Dual | S. Prem Kumar et.al. | 2405.03739 | null |
2024-05-06 | Pose Priors from Language Models | Sanjay Subramanian et.al. | 2405.03689 | null |
2024-05-06 | Language-Image Models with 3D Understanding | Jang Hyun Cho et.al. | 2405.03685 | null |
2024-05-06 | A Construct-Optimize Approach to Sparse View Synthesis without Camera Pose | Kaiwen Jiang et.al. | 2405.03659 | null |
2024-05-06 | Development of Ultra-Portable 3D Mapping Systems for Emergency Services | Charles Hamesse et.al. | 2405.03514 | null |
2024-05-06 | Spin-Wave Voices: Sonification of Nanoscale Spin Waves as an Engagement and Research Tool | Santa Pile et.al. | 2405.03506 | null |
2024-05-06 | Gaussian Splatting: 3D Reconstruction and Novel View Synthesis, a Review | Anurag Dalal et.al. | 2405.03417 | null |
2024-05-06 | Non-Perturbative Corrections to 3d BPS Indices and Topological Strings | Hans Jockers et.al. | 2405.03398 | null |
2024-05-06 | 3D LiDAR Mapping in Dynamic Environments Using a 4D Implicit Neural Representation | Xingguang Zhong et.al. | 2405.03388 | link |
2024-05-06 | Fully Reversing the Shoebox Image Source Method: From Impulse Responses to Room Parameters | Tom Sprunck et.al. | 2405.03385 | link |
2024-05-06 | Statistical Edge Detection And UDF Learning For Shape Representation | Virgile Foy et.al. | 2405.03381 | null |
2024-05-06 | Three-temperature radiation hydrodynamics with PLUTO: Thermal and kinematic signatures of accreting protoplanets | Dhruv Muley et.al. | 2405.03375 | null |
2024-05-06 | Enhancing Spatiotemporal Disease Progression Models via Latent Diffusion and Prior Knowledge | Lemuel Puglisi et.al. | 2405.03328 | link |
2024-05-06 | Deep Learning-based Point Cloud Registration for Augmented Reality-guided Surgery | Maximilian Weber et.al. | 2405.03314 | null |
2024-05-06 | MACE: A Machine learning Approach to Chemistry Emulation | S. Maes et.al. | 2405.03274 | link |
2024-05-06 | Understanding the effects of spacecraft trajectories through solar coronal mass ejection flux ropes using 3DCOREweb | Hannah Theresa Rüdisser et.al. | 2405.03271 | null |
2024-05-06 | Global existence and scattering of small data smooth solutions to a class of quasilinear wave systems on $\mathbb{R}^2\times\mathbb{T}$ | Fei Hou et.al. | 2405.03242 | null |
2024-05-06 | POPDG: Popular 3D Dance Generation with PopDanceSet | Zhenye Luo et.al. | 2405.03178 | link |
2024-05-06 | Advancing Multimodal Medical Capabilities of Gemini | Lin Yang et.al. | 2405.03162 | null |
2024-05-07 | Automatic Ultrasound Curve Angle Measurement via Affinity Clustering for Adolescent Idiopathic Scoliosis Evaluation | Yihao Zhou et.al. | 2405.03141 | null |
2024-05-05 | Multi-hop graph transformer network for 3D human pose estimation | Zaedul Islam et.al. | 2405.03055 | null |
2024-05-05 | Morphokinematical study of the planetary nebula Me2-1: Unveiling its point-symmetric and unusual physical structure | L. F. Miranda et.al. | 2405.02938 | null |
2024-05-05 | Multimodal Sense-Informed Prediction of 3D Human Motions | Zhenyu Lou et.al. | 2405.02911 | null |
2024-05-05 | Target Localization with Macro and Micro Base Stations Cooperative Sensing | Haotian Liu et.al. | 2405.02873 | null |
2024-05-05 | MVIP-NeRF: Multi-view 3D Inpainting on NeRF Scenes via Diffusion Prior | Honghua Chen et.al. | 2405.02859 | null |
2024-05-05 | On Enhancing Brain Tumor Segmentation Across Diverse Populations with Convolutional Neural Networks | Fadillah Maani et.al. | 2405.02852 | link |
2024-05-05 | PVTransformer: Point-to-Voxel Transformer for Scalable 3D Object Detection | Zhaoqi Leng et.al. | 2405.02811 | null |
2024-05-05 | MR-Transformer: Vision Transformer for Total Knee Replacement Prediction Using Magnetic Resonance Imaging | Chaojie Zhang et.al. | 2405.02784 | null |
2024-05-05 | Instantaneous Perception of Moving Objects in 3D | Di Liu et.al. | 2405.02781 | null |
2024-05-04 | Magnetar Eruptions and Electromagnetic Fireworks | J. F. Mahlmann et.al. | 2405.02773 | null |
2024-05-04 | Boosting 3D Neuron Segmentation with 2D Vision Transformer Pre-trained on Natural Images | Yik San Cheng et.al. | 2405.02686 | null |
2024-05-04 | Tailored Fabrication of 3D Nanopores with Dielectric Oxides for Multiple Nanoscale Applications | German Lanzavecchia et.al. | 2405.02632 | null |
2024-05-04 | Vision-based 3D occupancy prediction in autonomous driving: a review and outlook | Yanan Zhang et.al. | 2405.02595 | link |
2024-05-04 | ActiveNeuS: Active 3D Reconstruction using Neural Implicit Surface Uncertainty | Hyunseo Kim et.al. | 2405.02568 | null |
2024-05-03 | Spatio-Temporal SwinMAE: A Swin Transformer based Multiscale Representation Learner for Temporal Satellite Imagery | Yohei Nakayama et.al. | 2405.02512 | null |
2024-05-08 | Functional Imaging Constrained Diffusion for Brain PET Synthesis from Structural MRI | Minhui Yu et.al. | 2405.02504 | link |
2024-05-03 | Rip-NeRF: Anti-aliasing Radiance Fields with Ripmap-Encoded Platonic Solids | Junchen Liu et.al. | 2405.02386 | link |
2024-05-03 | DreamScene4D: Dynamic Multi-Object Scene Generation from Monocular Videos | Wen-Hsuan Chu et.al. | 2405.02280 | link |
2024-05-03 | Emergent Magnetic Field and Nonzero Gyrovector of the Toroidal Magnetic Hopfion | Dariia Popadiuk et.al. | 2405.02262 | null |
2024-05-03 | Electron Drag Effect on Thermal Conductivity in Two-dimensional Semiconductors | Yujie Quan et.al. | 2405.02257 | null |
2024-05-03 | WeightedPose: Generalizable Cross-Pose Estimation via Weighted SVD | Xuxin Cheng et.al. | 2405.02241 | link |
2024-05-03 | Accurate Pose Prediction on Signed Distance Fields for Mobile Ground Robots in Rough Terrain | Martin Oehler et.al. | 2405.02121 | link |
2024-05-03 | Probablistic Restoration with Adaptive Noise Sampling for 3D Human Pose Estimation | Xianzhou Zeng et.al. | 2405.02114 | link |
2024-05-03 | Three-Dimensional Amyloid-Beta PET Synthesis from Structural MRI with Conditional Generative Adversarial Networks | Fernando Vega et.al. | 2405.02109 | null |
2024-05-09 | WateRF: Robust Watermarks in Radiance Fields for Protection of Copyrights | Youngdong Jang et.al. | 2405.02066 | null |
2024-05-03 | HoloGS: Instant Depth-based 3D Gaussian Splatting with Microsoft HoloLens 2 | Miriam Jäger et.al. | 2405.02005 | null |
2024-05-03 | Cooperation and Federation in Distributed Radar Point Cloud Processing | S. Savazzi et.al. | 2405.01995 | null |
2024-05-03 | Creation of Novel Soft Robot Designs using Generative AI | Wee Kiat Chan et.al. | 2405.01824 | null |
2024-05-03 | Report on the AAPM Grand Challenge on deep generative modeling for learning medical image statistics | Rucha Deshpande et.al. | 2405.01822 | null |
2024-05-02 | OmniDrive: A Holistic LLM-Agent Framework for Autonomous Driving with 3D Perception, Reasoning and Planning | Shihao Wang et.al. | 2405.01533 | link |
2024-05-02 | Grand Design vs. Multi-Armed Spiral Galaxies: Dependence on Galaxy Structure | Beverly J. Smith et.al. | 2405.01516 | null |
2024-05-02 | Revisiting the Concordance $Λ$ CDM model using Gamma-Ray Bursts together with Supernovae Ia and Planck data | Shahnawaz A. Adil et.al. | 2405.01452 | null |
2024-05-02 | Convection and the Core $g$ -mode in Proto-Compact Stars – A detailed analysis | Pia Jakobus et.al. | 2405.01449 | null |
2024-05-02 | MiniGPT-3D: Efficiently Aligning 3D Point Clouds with Large Language Models using 2D Priors | Yuan Tang et.al. | 2405.01413 | link |
2024-05-02 | MUSE observations of small-scale heating events | C. A. Breu et.al. | 2405.01384 | null |
2024-05-02 | NeRF in Robotics: A Survey | Guangming Wang et.al. | 2405.01333 | null |
2024-05-02 | Behavior Imitation for Manipulator Control and Grasping with Deep Reinforcement Learning | Liu Qiyuan et.al. | 2405.01284 | null |
2024-05-02 | Satellite lines from autoionizing states of Fe XVI and the problems with the X-ray Fe XVII lines | G. Del Zanna et.al. | 2405.01274 | null |
2024-05-02 | 2d Ising Critical Couplings from Quantum Gravity | Valentin Bonzom et.al. | 2405.01253 | null |
2024-05-02 | Domain-Transferred Synthetic Data Generation for Improving Monocular Depth Estimation | Seungyeop Lee et.al. | 2405.01113 | null |
2024-05-02 | Sports Analysis and VR Viewing System Based on Player Tracking and Pose Estimation with Multimodal and Multiview Sensors | Wenxuan Guo et.al. | 2405.01112 | null |
2024-05-02 | Image segmentation of treated and untreated tumor spheroids by Fully Convolutional Networks | Matthias Streller et.al. | 2405.01105 | null |
2024-05-02 | Transformers Fusion across Disjoint Samples for Hyperspectral Image Classification | Muhammad Ahmad et.al. | 2405.01095 | null |
2024-05-14 | HandS3C: 3D Hand Mesh Reconstruction with State Space Spatial Channel Attention from RGB images | Zixun Jiao et.al. | 2405.01066 | null |
2024-05-02 | A text-based, generative deep learning model for soil reflectance spectrum simulation in the VIS-NIR (400-2499 nm) bands | Tong Lei et.al. | 2405.01060 | link |
2024-05-02 | Evolution of multiple closed knotted curves in space | Miroslav Kolar et.al. | 2405.01038 | null |
2024-05-09 | Part-aware Shape Generation with Latent 3D Diffusion of Neural Voxel Fields | Yuhang Huang et.al. | 2405.00998 | null |
2024-05-02 | Efficient Data-driven Scene Simulation using Robotic Surgery Videos via Physics-embedded 3D Gaussians | Zhenya Yang et.al. | 2405.00956 | null |
2024-05-02 | X-Oscar: A Progressive Framework for High-quality Text-guided 3D Animatable Avatar Generation | Yiwei Ma et.al. | 2405.00954 | null |
2024-05-02 | Hyperspectral Band Selection based on Generalized 3DTV and Tensor CUR Decomposition | Katherine Henneberger et.al. | 2405.00951 | null |
2024-05-02 | Virtual Psychedelia | Jacob Yenney et.al. | 2405.00938 | null |
2024-05-03 | Identifying Halos in Cosmological Simulations with Continuous Wavelet Analysis: The 2D Case | Minxing Li et.al. | 2405.00920 | null |
2024-05-02 | EchoScene: Indoor Scene Generation via Information Echo over Scene Graph Diffusion | Guangyao Zhai et.al. | 2405.00915 | null |
2024-05-04 | LidaRF: Delving into Lidar for Neural Radiance Field on Street Scenes | Shanlin Sun et.al. | 2405.00900 | null |
2024-05-01 | Stepwise ionization of Mo $^{14+}$ ions in EBIT: The importance of the metastable level | Cunqiang Wu et.al. | 2405.00893 | null |
2024-05-01 | Implementation of a Mesh refinement algorithm into the quasi-static PIC code QuickPIC | Q. Su et.al. | 2405.00886 | null |
2024-05-01 | Evidence of 1+1D photorefractive stripe solitons deep in the Kerr limit | Ludovica Falsi et.al. | 2405.00883 | null |
2024-05-01 | Subleading analysis for $S^3$ partition functions of $\mathcal{N}=2$ holographic SCFTs | Seppe Geukens et.al. | 2405.00845 | null |
2024-05-01 | Numerical investigation of three-dimensional effects of cavitating flow in a venturi-type hydrodynamic cavitation reactor | Dhruv Apte et.al. | 2405.00831 | null |
2024-05-01 | Explosively driven Richtmyer–Meshkov instability jet suppression and enhancement via coupling machine learning and additive manufacturing | Dane M. Sterbentz et.al. | 2405.00812 | null |
2024-05-01 | Coherent 3D Portrait Video Reconstruction via Triplane Fusion | Shengze Wang et.al. | 2405.00794 | null |
2024-05-01 | Spectrally Pruned Gaussian Fields with Neural Compensation | Runyi Yang et.al. | 2405.00676 | link |
2024-05-01 | TexSliders: Diffusion-Based Texture Editing in CLIP Space | Julia Guerrero-Viu et.al. | 2405.00672 | null |
2024-05-01 | Towards quantum gravity with neural networks: Solving quantum Hamilton constraints of 3d Euclidean gravity in the weak coupling limit | Hanno Sahlmann et.al. | 2405.00661 | null |
2024-05-01 | Interplay between domain walls and magnetization curling induced by chemical modulations in cylindrical nanowires | L. Alvaro-Gómez et.al. | 2405.00652 | null |
2024-05-13 | Depth Priors in Removal Neural Radiance Fields | Zhihao Guo et.al. | 2405.00630 | null |
2024-05-01 | Resolution analysis of magnetically arrested disk simulations | Leon Sosapanta Salas et.al. | 2405.00564 | null |
2024-05-01 | Long-Term Human Trajectory Prediction using 3D Dynamic Scene Graphs | Nicolas Gorlo et.al. | 2405.00552 | link |
2024-05-01 | 3D MR Fingerprinting for Dynamic Contrast-Enhanced Imaging of Whole Mouse Brain | Yuran Zhu et.al. | 2405.00513 | null |
2024-05-01 | Beyond the random phase approximation for calculating Curie temperatures in ferromagnets: application to Fe, Ni, Co and monolayer CrI3 | Varun Rajeev Pavizhakumari et.al. | 2405.00477 | null |
2024-05-01 | Continuous sPatial-Temporal Deformable Image Registration (CPT-DIR) for motion modelling in radiotherapy: beyond classic voxel-based methods | Xia Li et.al. | 2405.00430 | null |
2024-05-01 | Planar Hall Effect in Quasi-Two-Dimensional Materials | Koushik Ghorai et.al. | 2405.00379 | null |
2024-05-01 | NC-SDF: Enhancing Indoor Scene Reconstruction Using Neural SDFs with View-Dependent Normal Compensation | Ziyi Chen et.al. | 2405.00340 | null |
2024-05-01 | Canonized then Minimized RMSD for Three-Dimensional Structures | Jie Li et.al. | 2405.00339 | null |
2024-05-01 | FPGA Digital Dice using Pseudo Random Number Generator | Michael Lim Kee Hian et.al. | 2405.00308 | null |
2024-04-30 | GMC-PINNs: A new general Monte Carlo PINNs method for solving fractional partial differential equations on irregular domains | Shupeng Wang et.al. | 2405.00217 | null |
2024-04-30 | Using sunRunner3D to interpret the global structure of the heliosphere from in situ measurements | José Juan González-Avilés et.al. | 2405.00174 | null |
2024-04-30 | Planetary Nebula NGC 2818: Revealing its complex 3D morphology | Sophia Derlopa et.al. | 2405.00169 | null |
2024-04-30 | Gravitational Stress Tensor and Current at Null Infinity in Three Dimensions | H. Adami et.al. | 2405.00149 | null |
2024-04-30 | Greater benefits of deep learning-based computer-aided detection systems for finding small signals in 3D volumetric medical images | Devi Klein et.al. | 2405.00144 | null |
2024-05-02 | Utilizing Machine Learning and 3D Neuroimaging to Predict Hearing Loss: A Comparative Analysis of Dimensionality Reduction and Regression Techniques | Trinath Sai Subhash Reddy Pittala et.al. | 2405.00142 | null |
2024-04-30 | Unveiling the Physics of Neutron Stars: A 3D expedition into MAgneto-Thermal evolution in Isolated Neutron Stars with MATINS | Clara Dehman et.al. | 2405.00133 | null |
2024-04-30 | A Flexible 2.5D Medical Image Segmentation Approach with In-Slice and Cross-Slice Attention | Amarjeet Kumar et.al. | 2405.00130 | link |
2024-04-30 | A 3D view on the local gravitational instability of cold gas discs in star-forming galaxies at $0 \lesssim \mathrm{z} \lesssim 5$ | C. Bacchini et.al. | 2405.00103 | null |
2024-04-25 | Microstructural and Transport Characteristics of Triply Periodic Bicontinuous Materials | Salvatore Torquato et.al. | 2405.00057 | null |
2024-04-30 | Generalized Symmetries in 2D from String Theory: SymTFTs, Intrinsic Relativeness, and Anomalies of Non-invertible Symmetries | Sebastian Franco et.al. | 2404.19761 | null |
2024-04-30 | Lightplane: Highly-Scalable Components for Neural 3D Fields | Ang Cao et.al. | 2404.19760 | link |
2024-04-30 | Invisible Stitch: Generating Smooth 3D Scenes with Depth Inpainting | Paul Engstler et.al. | 2404.19758 | null |
2024-04-30 | Visual Fact Checker: Enabling High-Fidelity Detailed Caption Generation | Yunhao Ge et.al. | 2404.19752 | null |
2024-04-30 | Mixed Continuous and Categorical Flow Matching for 3D De Novo Molecule Generation | Ian Dunn et.al. | 2404.19739 | link |
2024-05-09 | RTG-SLAM: Real-time 3D Reconstruction at Scale using Gaussian Splatting | Zhexi Peng et.al. | 2404.19706 | null |
2024-04-30 | GS-LRM: Large Reconstruction Model for 3D Gaussian Splatting | Kai Zhang et.al. | 2404.19702 | null |
2024-04-30 | Naturally Supervised 3D Visual Grounding with Language-Regularized Concept Learners | Chun Feng et.al. | 2404.19696 | null |
2024-04-27 | Fast and label-free 3D virtual H&E histology via active modulation-assisted dynamic full-field OCT | Zichen Yin et.al. | 2404.19641 | null |
2024-04-30 | ESP-Zero: Unsupervised enhancement of zero-shot classification for Extremely Sparse Point cloud | Jiayi Han et.al. | 2404.19639 | null |
2024-04-30 | SpComm3D: A Framework for Enabling Sparse Communication in 3D Sparse Kernels | Nabil Abubaker et.al. | 2404.19638 | link |
2024-04-30 | Fake it to make it: Using synthetic data to remedy the data shortage in joint multimodal speech-and-gesture synthesis | Shivam Mehta et.al. | 2404.19622 | null |
2024-04-30 | X-Diffusion: Generating Detailed 3D MRI Volumes From a Single Image Using Cross-Sectional Diffusion Models | Emmanuelle Bourigault et.al. | 2404.19604 | null |
2024-04-30 | Ultra Inertial Poser: Scalable Motion Capture and Tracking from Sparse Inertial Sensors and Ultra-Wideband Ranging | Rayan Armani et.al. | 2404.19541 | link |
2024-04-30 | MicroDreamer: Zero-shot 3D Generation in $\sim$ 20 Seconds by Score-based Iterative Reconstruction | Luxi Chen et.al. | 2404.19525 | link |
2024-04-30 | Sensorized Soft Skin for Dexterous Robotic Hands | Jana Egli et.al. | 2404.19448 | null |
2024-05-01 | Neuro-Vision to Language: Image Reconstruction and Language enabled Interaction via Brain Recordings | Guobin Shen et.al. | 2404.19438 | null |
2024-05-02 | 3D Gaussian Blendshapes for Head Avatar Animation | Shengjie Ma et.al. | 2404.19398 | null |
2024-04-30 | Pseudo Label Refinery for Unsupervised Domain Adaptation on Cross-dataset 3D Object Detection | Zhanwei Zhang et.al. | 2404.19384 | null |
2024-04-30 | A coupled fluid-dynamics-heat transfer model for 3D simulations of the aqueous humor flow in the human eye | Thomas Saigre et.al. | 2404.19353 | null |
2024-04-30 | Assessment of physical schemes for WRF model in convection-permitting mode over southern Iberian Peninsula | Feliciano Solano-Farías et.al. | 2404.19327 | null |
2024-04-30 | Three-dimensional plasmoid-mediated reconnection and turbulence in Hall magnetohydrodynamics | Yi-Min Huang et.al. | 2404.19285 | null |
2024-04-30 | Quater-GCN: Enhancing 3D Human Pose Estimation with Orientation and Semi-supervised Training | Xingyu Song et.al. | 2404.19279 | link |
2024-04-30 | Transcrib3D: 3D Referring Expression Resolution through Large Language Models | Jiading Fang et.al. | 2404.19221 | null |
2024-04-30 | NeRF-Insert: 3D Local Editing with Multimodal Control Signals | Benet Oriol Sabat et.al. | 2404.19204 | null |
2024-04-30 | PEVA-Net: Prompt-Enhanced View Aggregation Network for Zero/Few-Shot Multi-View 3D Shape Recognition | Dongyun Lin et.al. | 2404.19168 | null |
2024-04-29 | Scalable Bayesian Inference in the Era of Deep Learning: From Gaussian Processes to Deep Neural Networks | Javier Antoran et.al. | 2404.19157 | null |
2024-05-01 | Room temperature realization of artificial chiral magnets with reprogrammable magnon nonreciprocity at zero field | Mingran Xu et.al. | 2404.19153 | null |
2024-04-29 | SAGS: Structure-Aware 3D Gaussian Splatting | Evangelos Ververas et.al. | 2404.19149 | null |
2024-04-29 | Evaluating Deep Clustering Algorithms on Non-Categorical 3D CAD Models | Siyuan Xiang et.al. | 2404.19134 | null |
2024-05-03 | On the Pair-Instability Supernova origin of J1010+2358 | Ása Skúladóttir et.al. | 2404.19086 | null |
2024-04-29 | Faraday tomography of LoTSS-DR2 data: II. Multi-tracer analysis in the high-latitude outer Galaxy | Ana Erceg et.al. | 2404.19068 | null |
2024-04-29 | GSTalker: Real-time Audio-Driven Talking Face Generation via Deformable Gaussian Splatting | Bo Chen et.al. | 2404.19040 | null |
2024-04-29 | Embedded Representation Learning Network for Animating Styled Video Portrait | Tianyong Wang et.al. | 2404.19038 | null |
2024-04-29 | MeGA: Hybrid Mesh-Gaussian Head Avatar for High-Fidelity Rendering and Head Editing | Cong Wang et.al. | 2404.19026 | null |
2024-04-29 | DGE: Direct Gaussian 3D Editing by Consistent Multi-view Editing | Minghao Chen et.al. | 2404.18929 | null |
2024-04-29 | Point Cloud Models Improve Visual Robustness in Robotic Learners | Skand Peri et.al. | 2404.18926 | null |
2024-04-29 | 3D Mapping of Glacier Moulins: Challenges and lessons learned | William Dubois et.al. | 2404.18790 | null |
2024-04-29 | Risk-Aware Coverage Path Planning for Lunar Micro-Rovers Leveraging Global and Local Environmental Data | Shreya Santra et.al. | 2404.18721 | null |
2024-05-12 | Bootstrap 3D Reconstructed Scenes from 3D Gaussian Splatting | Yifei Gao et.al. | 2404.18669 | null |
2024-04-29 | Leveraging PointNet and PointNet++ for Lyft Point Cloud Classification Challenge | Rajat K. Doshi et.al. | 2404.18665 | null |
2024-04-29 | A comprehensive study of nonlinear perturbations in the dynamics of planar crack fronts | Itamar Kolvin et.al. | 2404.18633 | null |
2024-04-29 | Self-Avatar Animation in Virtual Reality: Impact of Motion Signals Artifacts on the Full-Body Pose Reconstruction | Antoine Maiorca et.al. | 2404.18628 | null |
2024-04-29 | Patterning of 2D second harmonic generation active arrays in ferroelectric nematic fluids | M. Lovšin et.al. | 2404.18619 | null |
2024-04-29 | CSTalk: Correlation Supervised Speech-driven 3D Emotional Facial Animation Generation | Xiangyu Liang et.al. | 2404.18604 | null |
2024-04-29 | Self-supervised learning for classifying paranasal anomalies in the maxillary sinus | Debayan Bhattacharya et.al. | 2404.18599 | link |
2024-04-29 | Non-convex Pose Graph Optimization in SLAM via Proximal Linearized Riemannian ADMM | Xin Chen et.al. | 2404.18560 | null |
2024-04-29 | Towards Long-term Robotics in the Wild | Stephen Hausler et.al. | 2404.18477 | null |
2024-04-29 | Direct observation of anisotropic Cooper pairing in kagome superconductor CsV3Sb5 | Akifumi Mine et.al. | 2404.18472 | null |
2024-04-29 | Chameleon: A Data-Efficient Generalist for Dense Visual Prediction in the Wild | Donggyun Kim et.al. | 2404.18459 | link |
2024-04-29 | 3D Gaussian Splatting with Deferred Reflection | Keyang Ye et.al. | 2404.18454 | link |
2024-04-29 | $ν$ -DBA: Neural Implicit Dense Bundle Adjustment Enables Image-Only Driving Scene Reconstruction | Yunxuan Mao et.al. | 2404.18439 | null |
2024-04-29 | Uncovering an Interfacial Band Resulting from Orbital Hybridization in Nickelate Heterostructures | Mingyao Chen et.al. | 2404.18412 | null |
2024-04-29 | Mesh-based Photorealistic and Real-time 3D Mapping for Robust Visual Perception of Autonomous Underwater Vehicle | Jungwoo Lee et.al. | 2404.18395 | null |
2024-04-29 | Reconstructing Satellites in 3D from Amateur Telescope Images | Zhiming Chang et.al. | 2404.18394 | null |
2024-05-03 | Object Registration in Neural Fields | David Hall et.al. | 2404.18381 | null |
2024-04-29 | Helical Phononic Modes Induced by a Screw Dislocation | Yun Zhou et.al. | 2404.18347 | null |
2024-04-28 | Panoptic Segmentation and Labelling of Lumbar Spine Vertebrae using Modified Attention Unet | Rikathi Pal et.al. | 2404.18291 | null |
2024-04-28 | The Gravitational Lensing Imprints of DES Y3 Superstructures on the CMB: A Matched Filtering Approach | Umut Demirbozan et.al. | 2404.18278 | link |
2024-04-28 | LEGENT: Open Platform for Embodied Agents | Zhili Cheng et.al. | 2404.18243 | null |
2024-04-30 | Quadruped robot traversing 3D complex environments with limited perception | Yi Cheng et.al. | 2404.18225 | null |
2024-04-28 | Enhancing Action Recognition from Low-Quality Skeleton Data via Part-Level Knowledge Distillation | Cuiwei Liu et.al. | 2404.18206 | null |
2024-04-28 | LMM-PCQA: Assisting Point Cloud Quality Assessment with LMM | Zicheng Zhang et.al. | 2404.18203 | link |
2024-04-28 | Block-Map-Based Localization in Large-Scale Environment | Yixiao Feng et.al. | 2404.18192 | null |
2024-04-28 | Compressed Deepfake Video Detection Based on 3D Spatiotemporal Trajectories | Zongmei Chen et.al. | 2404.18149 | null |
2024-05-04 | MMAC-Copilot: Multi-modal Agent Collaboration Operating System Copilot | Zirui Song et.al. | 2404.18074 | null |
2024-04-28 | Grounded Compositional and Diverse Text-to-3D with Pretrained Multi-View Diffusion Model | Xiaolong Li et.al. | 2404.18065 | null |
2024-04-28 | Liouville type theorems for the 3D stationary MHD and Hall-MHD equations with non-zero constant vectors at infinity | Wendong Wang et.al. | 2404.18051 | null |
2024-04-28 | Pose-aware 3D Beamwidth Adaptation for Mobile Extended Reality | Alperen Duru et.al. | 2404.18042 | null |
2024-04-27 | Reduced-order modeling of neutron transport separated in axial and radial space by Proper Generalized Decomposition with applications to nuclear reactor physics | Kurt A. Dominesey et.al. | 2404.18016 | null |
2024-04-27 | FRAME: A Modular Framework for Autonomous Map-merging: Advancements in the Field | Nikolaos Stathoulopoulos et.al. | 2404.18006 | null |
2024-04-27 | MinBackProp – Backpropagating through Minimal Solvers | Diana Sungatullina et.al. | 2404.17993 | link |
2024-04-27 | HVOFusion: Incremental Mesh Reconstruction Using Hybrid Voxel Octree | Shaofan Liu et.al. | 2404.17974 | link |
2024-04-27 | Over-the-Air Fusion of Sparse Spatial Features for Integrated Sensing and Edge AI over Broadband Channels | Zhiyan Liu et.al. | 2404.17973 | null |
2024-04-27 | Open-Set 3D Semantic Instance Maps for Vision Language Navigation – O3D-SIM | Laksh Nanwani et.al. | 2404.17922 | link |
2024-04-27 | Reliable Student: Addressing Noise in Semi-Supervised 3D Object Detection | Farzad Nozarian et.al. | 2404.17910 | link |
2024-04-27 | 3D Extended Object Tracking by Fusing Roadside Sparse Radar Point Clouds and Pixel Keypoints | Jiayin Deng et.al. | 2404.17903 | link |
2024-04-27 | Vision-based Discovery of Nonlinear Dynamics for 3D Moving Target | Zitong Zhang et.al. | 2404.17865 | null |
2024-04-27 | Spatial, Temporal, and Geometric Fusion for Remote Sensing Images | Hessah Albanwan et.al. | 2404.17851 | null |
2024-04-27 | Instance-free Text to Point Cloud Localization with Relative Position Awareness | Lichao Wang et.al. | 2404.17845 | null |
2024-04-27 | Hybrid 3D Human Pose Estimation with Monocular Video and Sparse IMUs | Yiming Bao et.al. | 2404.17837 | null |
2024-04-27 | Efficient Bi-manipulation using RGBD Multi-model Fusion based on Attention Mechanism | Jian Shen et.al. | 2404.17811 | null |
2024-04-30 | High-quality Surface Reconstruction using Gaussian Surfels | Pinxuan Dai et.al. | 2404.17774 | null |
2024-04-27 | Development of an Estimation Method for the Seismic Motion Reproducibility of a Three-dimensional Ground Structure Model by combining Surface-observed Seismic Motion and Three-dimensional Seismic Motion Analysis | Tsuyoshi Ichimura et.al. | 2404.17754 | null |
2024-04-26 | Learning Manipulation Tasks in Dynamic and Shared 3D Spaces | Hariharan Arunachalam et.al. | 2404.17673 | link |
2024-04-26 | BlenderAlchemy: Editing 3D Graphics with Vision-Language Models | Ian Huang et.al. | 2404.17672 | null |
2024-04-26 | A First Look at Spatially Resolved Star Formation at $4.8<z<6.5$ with JWST FRESCO NIRCam Slitless Spectroscopy | Jasleen Matharu et.al. | 2404.17629 | null |
2024-04-25 | Synthesizing Audio from Silent Video using Sequence to Sequence Modeling | Hugo Garrido-Lestache Belinchon et.al. | 2404.17608 | link |
2024-04-26 | MaPa: Text-driven Photorealistic Material Painting for 3D Shapes | Shangzhan Zhang et.al. | 2404.17569 | null |
2024-04-26 | Integrating UAV-Enabled Base Stations in 3D Networks: QoS-Aware Joint Fronthaul and Backhaul Design | Salim Janji et.al. | 2404.17547 | null |
2024-04-26 | Geometry-aware Reconstruction and Fusion-refined Rendering for Generalizable Neural Radiance Fields | Tianqi Liu et.al. | 2404.17528 | link |
2024-04-26 | TextGaze: Gaze-Controllable Face Generation with Natural Language | Hengfei Wang et.al. | 2404.17486 | null |
2024-04-26 | Holographic $\frac{1}{2}$ -BPS surface defects in ABJM | Yolanda Lozano et.al. | 2404.17469 | null |
2024-04-26 | Multi-view Image Prompted Multi-view Diffusion for Improved 3D Generation | Seungwook Kim et.al. | 2404.17419 | null |
2024-04-26 | Part-Guided 3D RL for Sim2Real Articulated Object Manipulation | Pengwei Xie et.al. | 2404.17302 | link |
2024-04-26 | Automatic Target-Less Camera-LiDAR Calibration From Motion and Deep Point Correspondences | Kürsat Petek et.al. | 2404.17298 | link |
2024-04-26 | Camera Motion Estimation from RGB-D-Inertial Scene Flow | Samuel Cerezo et.al. | 2404.17251 | link |
2024-04-26 | Broken time reversal symmetry vestigial state for a two-component superconductor in two spatial dimensions | P. T. How et.al. | 2404.17239 | null |
2024-04-26 | Enhancing mmWave Radar Point Cloud via Visual-inertial Supervision | Cong Fan et.al. | 2404.17229 | link |
2024-04-26 | Finite-time blowup for Keller-Segel-Navier-Stokes system in three dimensions | Zexing Li et.al. | 2404.17228 | null |
2024-04-26 | SLAM for Indoor Mapping of Wide Area Construction Environments | Vincent Ress et.al. | 2404.17215 | null |
2024-04-26 | Rytov Approximation of Vectorial Waves by Modifying Scattering Matrixes: Precise Reconstruction of Dielectric Tensor Tomography | ChulMin Oh et.al. | 2404.17206 | null |
2024-04-26 | Construction of a new (3 + 1)-dimensional KdV equation and its closed-form solutions with solitary wave behaviour and conserved vectors | Nardjess Benoudina et.al. | 2404.17156 | null |
2024-04-26 | Pose-Specific 3D Fingerprint Unfolding | Xiongjun Guan et.al. | 2404.17149 | null |
2024-04-26 | Finite volume simulation of a semi-linear Neumann problem (Keller-Segel model) on rectangular domains | Nardjess Benoudina et.al. | 2404.17145 | null |
2024-04-26 | Simple Network Mechanism Leads to Quasi-Real Brain Activation Patterns with Drosophila Connectome | Xiaoyu Zhang et.al. | 2404.17128 | null |
2024-04-26 | Localization of Pallets on Shelves Using Horizontal Plane Projection of a 360-degree Image | Yasuyo Kita et.al. | 2404.17118 | null |
2024-04-25 | Towards edge engineering of two-dimensional layered transition-metal dichalcogenides by chemical vapor deposition | Wei Fu et.al. | 2404.17074 | null |
2024-04-25 | Bootstrapping the Abelian Lattice Gauge Theories | Zhijin Li et.al. | 2404.17071 | null |
2024-04-25 | Frozen-field Modeling of Coronal Condensations with MPI-AMRVAC I: Demonstration in two-dimensional models | Yuhao Zhou et.al. | 2404.17056 | null |
2024-04-25 | Generative AI in Color-Changing Systems: Re-Programmable 3D Object Textures with Material and Design Constraints | Yunyi Zhu et.al. | 2404.17028 | null |
2024-04-25 | Defect Localization Using Region of Interest and Histogram-Based Enhancement Approaches in 3D-Printing | Md Manjurul Ahsan et.al. | 2404.17015 | null |
2024-04-25 | Higgs Phases and Boundary Criticality | Kristian Tyn Kai Chung et.al. | 2404.17001 | null |
2024-04-25 | Partial absence of cosine problem in 3d Lorentzian spin foams | Alexander F. Jercher et.al. | 2404.16943 | null |
2024-04-27 | The Third Monocular Depth Estimation Challenge | Jaime Spencer et.al. | 2404.16831 | null |
2024-04-25 | Double Copy of 3D Chern-Simons Theory and 6D Kodaira-Spencer Gravity | Roberto Bonezzi et.al. | 2404.16830 | null |
2024-04-29 | Make-it-Real: Unleashing Large Multimodal Model’s Ability for Painting 3D Objects with Realistic Materials | Ye Fang et.al. | 2404.16829 | null |
2024-04-25 | Transformer-Based Local Feature Matching for Multimodal Image Registration | Remi Delaunay et.al. | 2404.16802 | null |
2024-04-25 | RadGenome-Chest CT: A Grounded Vision-Language Dataset for Chest CT Analysis | Xiaoman Zhang et.al. | 2404.16754 | link |
2024-04-25 | TokenHMR: Advancing Human Mesh Recovery with a Tokenized Pose Representation | Sai Kumar Dwivedi et.al. | 2404.16752 | link |
2024-04-25 | TELA: Text to Layer-wise 3D Clothed Human Generation | Junting Dong et.al. | 2404.16748 | null |
2024-04-25 | Log-normal glide and the formation of misfit dislocation networks in heteroepitaxial ZnS on GaP | Alexandra Fonseca Montenegro et.al. | 2404.16714 | null |
2024-04-25 | Multi-view Cardiac Image Segmentation via Trans-Dimensional Priors | Abbas Khan et.al. | 2404.16708 | null |
2024-04-25 | Dimensional Crossover of Microscopic Magnetic Metasurfaces for Magnetic Field Amplification | N. Lejeune et.al. | 2404.16700 | null |
2024-04-28 | PhyRecon: Physically Plausible Neural Scene Reconstruction | Junfeng Ni et.al. | 2404.16666 | null |
2024-04-25 | Design optimization of advanced tow-steered composites with manufacturing constraints | Chuan Luo et.al. | 2404.16650 | null |
2024-04-25 | Efficient Solution of Point-Line Absolute Pose | Petr Hruby et.al. | 2404.16552 | link |
2024-04-25 | Cross-Domain Spatial Matching for Camera and Radar Sensor Data Fusion in Autonomous Vehicle Perception System | Daniel Dworak et.al. | 2404.16548 | null |
2024-04-25 | Implementation of matrix compression in the coupling of JOREK to realistic 3D conducting wall structures | Federico Cipolletta et.al. | 2404.16546 | null |
2024-04-25 | OpenDlign: Enhancing Open-World 3D Learning with Depth-Aligned Images | Ye Mao et.al. | 2404.16538 | link |
2024-04-25 | 3D Face Modeling via Weakly-supervised Disentanglement Network joint Identity-consistency Prior | Guohao Li et.al. | 2404.16536 | link |
2024-04-25 | 3D deep learning for enhanced atom probe tomography analysis of nanoscale microstructures | Jiwei Yu et.al. | 2404.16524 | null |
2024-04-25 | Interactive3D: Create What You Want by Interactive 3D Generation | Shaocong Dong et.al. | 2404.16510 | null |
2024-04-25 | Commonsense Prototype for Outdoor Unsupervised 3D Object Detection | Hai Wu et.al. | 2404.16493 | link |
2024-04-25 | Depth Supervised Neural Surface Reconstruction from Airborne Imagery | Vincent Hackstein et.al. | 2404.16429 | null |
2024-04-25 | Neural Assembler: Learning to Generate Fine-Grained Robotic Assembly Instructions from Multi-View Images | Hongyu Yan et.al. | 2404.16423 | null |
2024-04-25 | Robust Fine-tuning for Pre-trained 3D Point Cloud Models | Zhibo Zhang et.al. | 2404.16422 | null |
2024-04-25 | DIG3D: Marrying Gaussian Splatting with Deformable Transformer for Single Image 3D Reconstruction | Jiamin Wu et.al. | 2404.16323 | link |
2024-04-25 | 3D Guidance Law for Maximal Coverage and Target Enclosing with Inherent Safety | Praveen Kumar Ranjan et.al. | 2404.16312 | null |
2024-04-25 | BezierFormer: A Unified Architecture for 2D and 3D Lane Detection | Zhiwei Dong et.al. | 2404.16304 | null |
2024-04-24 | 3D Human Pose Estimation with Occlusions: Introducing BlendMimic3D Dataset and GCN Refinement | Filipa Lino et.al. | 2404.16136 | link |
2024-04-24 | Implementation of Immersed Boundaries through Volume Penalization in the Industrial Aeronautical Solver CODA | Jonatan Nunez et.al. | 2404.16132 | null |
2024-04-25 | GaussianTalker: Real-Time High-Fidelity Talking Head Synthesis with Audio-Driven 3D Gaussian Splatting | Kyusun Cho et.al. | 2404.16012 | link |
2024-04-24 | On the Fourier analysis in the SO(3) space : EquiLoPO Network | Dmitrii Zhemchuzhnikov et.al. | 2404.15979 | null |
2024-04-24 | The State of the Art in Visual Analytics for 3D Urban Data | Fabio Miranda et.al. | 2404.15976 | null |
2024-04-26 | A Survey on Visual Mamba | Hanwei Zhang et.al. | 2404.15956 | null |
2024-04-25 | OMEGAS: Object Mesh Extraction from Large Scenes Guided by Gaussian Segmentation | Lizhi Wang et.al. | 2404.15891 | link |
2024-04-24 | Revisiting Out-of-Distribution Detection in LiDAR-based 3D Object Detection | Michael Kösel et.al. | 2404.15879 | link |
2024-04-24 | 3D Freehand Ultrasound using Visual Inertial and Deep Inertial Odometry for Measuring Patellar Tracking | Russell Buchanan et.al. | 2404.15847 | null |
2024-04-24 | Partial Renormalization of Quasiparticle Interactions | Kun Chen et.al. | 2404.15844 | null |
2024-04-24 | Signatures of Conformal Symmetry in the Dynamics of Quantum Gases: A Cyclic Quantum State and Entanglement Entropy | Jeff Maki et.al. | 2404.15827 | null |
2024-04-24 | Three-dimensional thermodynamic structures of the intracluster medium across edges in the X-ray surface brightness of massive, bright, dynamically-active galaxy clusters | Shutaro Ueda et.al. | 2404.15824 | null |
2024-04-24 | 3D Face Morphing Attack Generation using Non-Rigid Registration | Jag Mohan Singh et.al. | 2404.15765 | null |
2024-04-25 | HDBN: A Novel Hybrid Dual-branch Network for Robust Skeleton-based Action Recognition | Jinfu Liu et.al. | 2404.15719 | link |
2024-04-24 | Mitigating False Predictions In Unreasonable Body Regions | Constantin Ulrich et.al. | 2404.15718 | null |
2024-04-24 | A Hard Energy Spectrum in 3D Guide-Field Magnetic Reconnection | Masahiro Hoshino et.al. | 2404.15662 | null |
2024-04-24 | Building-PCC: Building Point Cloud Completion Benchmarks | Weixiao Gao et.al. | 2404.15644 | link |
2024-04-24 | Self-generated magnetic field in three-dimensional ablative Rayleigh-Taylor instability | Dehua Zhang et.al. | 2404.15642 | null |
2024-04-24 | MiM: Mask in Mask Self-Supervised Pre-Training for 3D Medical Image Analysis | Jiaxin Zhuang et.al. | 2404.15580 | null |
2024-04-24 | Jitter Characterization of the HyTI Satellite | Chase Urasaki et.al. | 2404.15575 | link |
2024-04-23 | Self-gravitating solutions in Yang-Mills-Chern-Simons theory coupled to 3D massive gravity | Cristóbal Corral et.al. | 2404.15569 | null |
2024-04-23 | DreamCraft: Text-Guided Generation of Functional 3D Environments in Minecraft | Sam Earle et.al. | 2404.15538 | null |
2024-04-23 | The Ability of Virtual Reality Technologies to Improve Comprehension of Speech Therapy Device Training | Daniel E. Killough et.al. | 2404.15534 | null |
2024-04-23 | Numerical study of transitions in lid-driven flows in semicircular cavities | Tsorng-Whay Pan et.al. | 2404.15514 | null |
2024-04-23 | OffRAMPS: An FPGA-based Intermediary for Analysis and Modification of Additive Manufacturing Control Systems | Jason Blocklove et.al. | 2404.15446 | null |
2024-04-23 | WANDR: Intention-guided Human Motion Generation | Markos Diomataris et.al. | 2404.15383 | null |
2024-04-30 | Hierarchical Hybrid Sliced Wasserstein: A Scalable Metric for Heterogeneous Joint Distributions | Khai Nguyen et.al. | 2404.15378 | link |
2024-04-18 | High-accurate and efficient numerical algorithms for the self-consistent field theory of liquid-crystalline polymers | Liwei Tan et.al. | 2404.15363 | null |
2024-04-23 | SMPLer: Taming Transformers for Monocular 3D Human Shape and Pose Estimation | Xiangyu Xu et.al. | 2404.15276 | link |
2024-04-29 | CT-GLIP: 3D Grounded Language-Image Pretraining with CT Scans and Radiology Reports for Full-Body Scenarios | Jingyang Lin et.al. | 2404.15272 | null |
2024-04-23 | TalkingGaussian: Structure-Persistent 3D Talking Head Synthesis via Gaussian Splatting | Jiahe Li et.al. | 2404.15264 | link |
2024-05-07 | Towards field theory of multiple D0-branes. Hamiltonian mechanics and quantization of simplest 3D prototype of multiple D0-brane system | Igor Bandos et.al. | 2404.15233 | null |
2024-04-23 | Re-Thinking Inverse Graphics With Large Language Models | Peter Kulits et.al. | 2404.15228 | null |
2024-04-23 | Deep Models for Multi-View 3D Object Recognition: A Review | Mona Alzahrani et.al. | 2404.15224 | null |
2024-04-21 | Socratic Planner: Inquiry-Based Zero-Shot Planning for Embodied Instruction Following | Suyeon Shin et.al. | 2404.15190 | null |
2024-04-23 | Statistics of three-dimensional black holes from Liouville line defects | Jeevan Chandra et.al. | 2404.15183 | null |
2024-04-23 | Neural Slicer for Multi-Axis 3D Printing | Tao Liu et.al. | 2404.15061 | null |
2024-04-23 | PRISM: A Promptable and Robust Interactive Segmentation Model with Visual Prompts | Hao Li et.al. | 2404.15028 | link |
2024-04-23 | OccGen: Generative Multi-modal 3D Occupancy Prediction for Autonomous Driving | Guoqing Wang et.al. | 2404.15014 | null |
2024-04-23 | X-3D: Explicit 3D Structure Modeling for Point Cloud Recognition | Shuofeng Sun et.al. | 2404.15010 | link |
2024-05-04 | Unknown Object Grasping for Assistive Robotics | Elle Miller et.al. | 2404.15001 | null |
2024-04-23 | CenterArt: Joint Shape Reconstruction and 6-DoF Grasp Estimation of Articulated Objects | Sassan Mokhtar et.al. | 2404.14968 | link |
2024-04-23 | CoARF: Controllable 3D Artistic Style Transfer for Radiance Fields | Deheng Zhang et.al. | 2404.14967 | null |
2024-04-23 | Mamba3D: Enhancing Local Features for 3D Point Cloud Analysis via State Space Model | Xu Han et.al. | 2404.14966 | link |
2024-04-23 | Device-Free 3D Drone Localization in RIS-Assisted mmWave MIMO Networks | Jiguang He et.al. | 2404.14879 | null |
2024-04-23 | One-Pass Randomized Algorithm with Practical Rangefinder for Low-Rank Approximation to Quaternion Matrices | Chao Chang et.al. | 2404.14783 | link |
2024-04-23 | ContextualFusion: Context-Based Multi-Sensor Fusion for 3D Object Detection in Adverse Operating Conditions | Shounak Sural et.al. | 2404.14780 | null |
2024-04-23 | Think-Program-reCtify: 3D Situated Reasoning with Large Language Models | Qingrong He et.al. | 2404.14705 | null |
2024-04-23 | 3DBench: A Scalable 3D Benchmark and Instruction-Tuning Dataset | Junjie Zhang et.al. | 2404.14678 | link |
2024-04-23 | DreamPBR: Text-driven Generation of High-resolution SVBRDF with Multi-modal Guidance | Linxuan Xin et.al. | 2404.14676 | null |
2024-04-23 | LaneCorrect: Self-supervised Lane Detection | Ming Nie et.al. | 2404.14671 | null |
2024-04-23 | 3DFlowRenderer: One-shot Face Re-enactment via Dense 3D Facial Flow Estimation | Siddharth Nijhawan et.al. | 2404.14667 | null |
2024-04-23 | UPose3D: Uncertainty-Aware 3D Human Pose Estimation with Cross-View and Temporal Cues | Vandad Davoodnia et.al. | 2404.14634 | null |
2024-04-22 | A Blueprint for the Milky Way’s Stellar Populations. V. 3D Local Dust Extinction | Deokkeun An et.al. | 2404.14626 | link |
2024-04-22 | Lifetime-Limited Linewidth Measurements of the 3C and 3D Soft X-ray Transitions in Ni XIX | Chintan Shah et.al. | 2404.14589 | null |
2024-04-22 | UVMap-ID: A Controllable and Personalized UV Map Generative Model | Weijie Wang et.al. | 2404.14568 | link |
2024-04-22 | “Where am I?” Scene Retrieval with Language | Jiaqi Chen et.al. | 2404.14565 | null |
2024-04-22 | Exploring the Potential of Data-Driven Spatial Audio Enhancement Using a Single-Channel Model | Arthur N. dos Santos et.al. | 2404.14564 | null |
2024-04-22 | Effect of biquadratic magnetic exchange interaction in the 2D antiferromagnets MPS_3 (M = Mn, Fe, Co, Ni) | Mohammad Amirabbasi et.al. | 2404.14553 | null |
2024-04-22 | Formation of low mass protostars and their circumstellar disks | Adnan Ali Ahmad et.al. | 2404.14496 | null |
2024-04-22 | Localisation without supersymmetry: towards exact results from Dirac structures in 3D $N = 0$ gauge theory | Alex S. Arvanitakis et.al. | 2404.14472 | null |
2024-04-19 | FreSeg: Frenet-Frame-based Part Segmentation for 3D Curvilinear Structures | Shixuan Gu et.al. | 2404.14435 | link |
2024-04-22 | Guess The Unseen: Dynamic 3D Scene Reconstruction from Partial 2D Glimpses | Inhee Lee et.al. | 2404.14410 | null |
2024-04-22 | GeoDiffuser: Geometry-Based Image Editing with Diffusion Models | Rahul Sajnani et.al. | 2404.14403 | null |
2024-04-22 | Scene Coordinate Reconstruction: Posing of Image Collections via Incremental Learning of a Relocalizer | Eric Brachmann et.al. | 2404.14351 | null |
2024-04-22 | Laser-written micro-channel atomic magnetometer | Andrea Zanoni et.al. | 2404.14345 | null |
2024-04-22 | X-Ray: A Sequential 3D Representation for Generation | Tao Hu et.al. | 2404.14329 | link |
2024-04-22 | LLM-Personalize: Aligning LLM Planners with Human Preferences via Reinforced Self-Training for Housekeeping Robots | Dongge Han et.al. | 2404.14285 | null |
2024-04-22 | RESFM: Robust Equivariant Multiview Structure from Motion | Fadi Khatib et.al. | 2404.14280 | null |
2024-04-22 | Electrical spin manipulation in double SrTiO $_3$/LaAlO$_3$ quantum dots | B. Szafran et.al. | 2404.14272 | null |
2024-04-22 | Quantum-Enhanced Neural Exchange-Correlation Functionals | Igor O. Sokolov et.al. | 2404.14258 | null |
2024-04-22 | CLIP-GS: CLIP-Informed Gaussian Splatting for Real-time and View-consistent 3D Semantic Understanding | Guibiao Liao et.al. | 2404.14249 | link |
2024-04-22 | Optimal Multiparameter Metrology: The Quantum Compass Solution | Denis V. Vasilyev et.al. | 2404.14194 | null |
2024-04-22 | Bayesian Windkessel calibration using optimized 0D surrogate models | Jakob Richter et.al. | 2404.14187 | null |
2024-04-22 | Immersive Rover Control and Obstacle Detection based on Extended Reality and Artificial Intelligence | Sofía Coloma et.al. | 2404.14095 | null |
2024-04-22 | Microscale Fiber-Integrated Vector Magnetometer with On-Tip Field Biasing using NV Ensembles in Diamond Microcystals | Jonas Homrighausen et.al. | 2404.14089 | null |
2024-04-22 | CloudFort: Enhancing Robustness of 3D Point Cloud Classification Against Backdoor Attacks via Spatial Partitioning and Ensemble Prediction | Wenhao Lan et.al. | 2404.14042 | null |
2024-04-28 | GaussianTalker: Speaker-specific Talking Head Synthesis via 3D Gaussian Splatting | Hongyun Yu et.al. | 2404.14037 | null |
2024-04-22 | OccFeat: Self-supervised Occupancy Feature Prediction for Pretraining BEV Segmentation Networks | Sophia Sirko-Galouchenko et.al. | 2404.14027 | link |
2024-04-22 | Finite element analysis of a spectral problem on curved meshes occurring in diffusion with high order boundary conditions | Fabien Caubet et.al. | 2404.13994 | null |
2024-04-22 | Importance of the semimetallic state for the quantum Hall effect in HfTe $_{5}$ | M. M. Piva et.al. | 2404.13969 | null |
2024-04-22 | Exploring Kinetic Curves Features for the Classification of Benign and Malignant Breast Lesions in DCE-MRI | Zixian Li et.al. | 2404.13929 | link |
2024-04-24 | MaterialSeg3D: Segmenting Dense Materials from 2D Priors for 3D Assets | Zeyu Li et.al. | 2404.13923 | null |
2024-04-22 | NeRF-DetS: Enhancing Multi-View 3D Object Detection with Sampling-adaptive Network of Continuous NeRF-based Representation | Chi Huang et.al. | 2404.13921 | null |
2024-04-22 | Angle-Aware Coverage with Camera Rotational Motion Control | Zhiyuan Lu et.al. | 2404.13915 | null |
2024-04-23 | CT-NeRF: Incremental Optimizing Neural Radiance Field and Poses with Complex Trajectory | Yunlong Ran et.al. | 2404.13896 | null |
2024-04-22 | PGAHum: Prior-Guided Geometry and Appearance Learning for High-Fidelity Animatable Human Reconstruction | Hao Wang et.al. | 2404.13862 | null |
2024-04-22 | Toward Robust LiDAR based 3D Object Detection via Density-Aware Adaptive Thresholding | Eunho Lee et.al. | 2404.13852 | null |
2024-04-22 | Co-evolution of dust grains and protoplanetary disks II: structure and evolution of protoplanetary disks; an analytical approach | Yusuke Tsukamoto et.al. | 2404.13843 | null |
2024-04-22 | On Support Relations Inference and Scene Hierarchy Graph Construction from Point Cloud in Clustered Environments | Gang Ma et.al. | 2404.13842 | null |
2024-04-26 | Neural Radiance Field in Autonomous Driving: A Survey | Lei He et.al. | 2404.13816 | null |
2024-04-22 | FaceFolds: Meshed Radiance Manifolds for Efficient Volumetric Rendering of Dynamic Faces | Safa C. Medin et.al. | 2404.13807 | null |
2024-04-21 | Mapping Phonon Polaritons with Visible Light | Kiernan E. Arledge et.al. | 2404.13759 | null |
2024-04-21 | Empirical stability criteria for 3D hierarchical triple systems I: Circumbinary planets | Nikolaos Georgakarakos et.al. | 2404.13746 | null |
2024-04-26 | ArtNeRF: A Stylized Neural Field for 3D-Aware Cartoonized Face Synthesis | Zichen Tang et.al. | 2404.13711 | link |
2024-04-29 | Clio: Real-time Task-Driven Open-Set 3D Scene Graphs | Dominic Maggio et.al. | 2404.13696 | link |
2024-04-21 | A Complete System for Automated 3D Semantic-Geometric Mapping of Corrosion in Industrial Environments | Rui Pimentel de Figueiredo et.al. | 2404.13691 | null |
2024-04-21 | GScream: Learning 3D Geometry and Feature Consistent Gaussian Splatting for Object Removal | Yuxin Wang et.al. | 2404.13679 | null |
2024-04-21 | MLP: Motion Label Prior for Temporal Sentence Localization in Untrimmed 3D Human Motions | Sheng Yan et.al. | 2404.13657 | link |
2024-04-21 | The Lockman–SpReSO project. Main properties of infrared selected star-forming galaxies | Mauro González-Otero et.al. | 2404.13629 | null |
2024-04-21 | Towards Unified Representation of Multi-Modal Pre-training for 3D Understanding via Differentiable Rendering | Ben Fei et.al. | 2404.13619 | null |
2024-04-21 | Are We Ready for Planetary Exploration Robots? The TAIL-Plus Dataset for SLAM in Granular Environments | Zirui Wang et.al. | 2404.13600 | null |
2024-04-21 | FSGe: A fast and strongly-coupled 3D fluid-solid-growth interaction method | Martin R. Pfaller et.al. | 2404.13523 | link |
2024-04-21 | Galactic Superbubbles in 3D: Wind Formation and Cloud Shielding | Osmer Suárez-López et.al. | 2404.13498 | null |
2024-04-20 | DMesh: A Differentiable Representation for General Meshes | Sanghyun Son et.al. | 2404.13445 | null |
2024-04-20 | High-fidelity Endoscopic Image Synthesis by Utilizing Depth-guided Neural Surfaces | Baoru Huang et.al. | 2404.13437 | null |
2024-04-20 | Exploring Bi-Manual Teleportation in Virtual Reality | Siddhanth Raja Sindhupathiraja et.al. | 2404.13431 | link |
2024-04-20 | Magnetic properties of layered hybrid organic-inorganic metal-halide perovskites: transition metal, organic cation and perovskite phase effects | Yaiza Asensio et.al. | 2404.13403 | null |
2024-04-20 | 3D characterization of kinematic fields and poroelastic swelling near the tip of a propagating crack in a hydrogel | Chenzhuo Li et.al. | 2404.13331 | null |
2024-04-20 | 3D-Convolution Guided Spectral-Spatial Transformer for Hyperspectral Image Classification | Shyam Varahagiri et.al. | 2404.13252 | link |
2024-04-20 | Shock waves in Interstellar Cloud-Cloud and Wind-Cloud Collisions | Sebastián Navarrete et.al. | 2404.13250 | null |
2024-04-20 | Mild solutions to the 3D-Boussinesq system with weakened initial temperature | Pedro Gabriel Fernández Dalgo et.al. | 2404.13243 | null |
2024-04-19 | Generic low-atmosphere signatures of swirled-anemone jets | Reetika Joshi et.al. | 2404.13171 | null |
2024-04-19 | ToNNO: Tomographic Reconstruction of a Neural Network’s Output for Weakly Supervised Segmentation of 3D Medical Images | Marius Schmidt-Mengin et.al. | 2404.13103 | null |
2024-04-19 | Unified Scene Representation and Reconstruction for 3D Large Language Models | Tao Chu et.al. | 2404.13044 | null |
2024-04-19 | PhysDreamer: Physics-Based Interaction with 3D Objects via Video Generation | Tianyuan Zhang et.al. | 2404.13026 | null |
2024-04-19 | RadRotator: 3D Rotation of Radiographs with Diffusion Models | Pouria Rouzrokh et.al. | 2404.13000 | null |
2024-04-19 | Ring-a-Pose: A Ring for Continuous Hand Pose Tracking | Tianhong Catherine Yu et.al. | 2404.12980 | null |
2024-04-19 | FlyNeRF: NeRF-Based Aerial Mapping for High-Quality 3D Scene Reconstruction | Maria Dronova et.al. | 2404.12970 | null |
2024-04-19 | A comparison between single-stage and two-stage 3D tracking algorithms for greenhouse robotics | David Rapado-Rincon et.al. | 2404.12963 | null |
2024-04-19 | Purposer: Putting Human Motion Generation in Context | Nicolas Ugrinovic et.al. | 2404.12942 | null |
2024-04-19 | Learn2Talk: 3D Talking Face Learns from 2D Talking Face | Yixiang Zhuang et.al. | 2404.12888 | null |
2024-04-19 | 3D Multi-frame Fusion for Video Stabilization | Zhan Peng et.al. | 2404.12887 | null |
2024-04-19 | Wrinkling instability of 3D auxetic bilayers in tension | Sairam Pamulaparthi Venkata et.al. | 2404.12873 | null |
2024-04-19 | Language-Driven Active Learning for Diverse Open-Set 3D Object Detection | Ross Greer et.al. | 2404.12856 | link |
2024-04-19 | A Point-Based Approach to Efficient LiDAR Multi-Task Perception | Christopher Lang et.al. | 2404.12798 | null |
2024-04-19 | MambaMOS: LiDAR-based 3D Moving Object Segmentation with Motion-aware State Space Model | Kang Zeng et.al. | 2404.12794 | link |
2024-04-19 | Contrastive Gaussian Clustering: Weakly Supervised 3D Scene Segmentation | Myrna C. Silva et.al. | 2404.12784 | null |
2024-04-19 | EfficientGS: Streamlining Gaussian Splatting for Large-Scale High-Resolution Scene Representation | Wenkai Liu et.al. | 2404.12777 | null |
2024-04-19 | 2D synthetic ferrimagnets by magnetic proximity coupling | Paul Rosenberger et.al. | 2404.12749 | null |
2024-04-19 | Leading Giant graviton expansion of Schur correlators in large representations | Matteo Beccaria et.al. | 2404.12690 | null |
2024-04-19 | VoxAtnNet: A 3D Point Clouds Convolutional Neural Network for Generalizable Face Presentation Attack Detection | Raghavendra Ramachandra et.al. | 2404.12680 | null |
2024-04-19 | SkelFormer: Markerless 3D Pose and Shape Estimation using Skeletal Transformers | Vandad Davoodnia et.al. | 2404.12625 | null |
2024-04-22 | Does Gaussian Splatting need SFM Initialization? | Yalda Foroutan et.al. | 2404.12547 | null |
2024-04-18 | Three-dimensional Interaction between a Planet and an Isothermal Gaseous Disk. III. Locally Isothermal Cases | Hidekazu Tanaka et.al. | 2404.12521 | null |
2024-04-18 | Advancing Applications of Satellite Photogrammetry: Novel Approaches for Built-up Area Modeling and Natural Environment Monitoring using Stereo/Multi-view Satellite Image-derived 3D Data | Shengxi Gui et.al. | 2404.12487 | null |
2024-04-18 | Spot-Compose: A Framework for Open-Vocabulary Object Retrieval and Drawer Manipulation in Point Clouds | Oliver Lemke et.al. | 2404.12440 | null |
2024-04-18 | MeshLRM: Large Reconstruction Model for High-Quality Mesh | Xinyue Wei et.al. | 2404.12385 | null |
2024-04-18 | G-HOP: Generative Hand-Object Prior for Interaction Reconstruction and Grasp Synthesis | Yufei Ye et.al. | 2404.12383 | null |
2024-04-22 | Dynamic Gaussians Mesh: Consistent Mesh Reconstruction from Monocular Videos | Isabella Liu et.al. | 2404.12379 | null |
2024-04-18 | 6Img-to-3D: Few-Image Large-Scale Outdoor Driving Scene Reconstruction | Théo Gieruc et.al. | 2404.12378 | link |
2024-04-18 | CRIRES $^+$ transmission spectroscopy of WASP-127b. Detection of the resolved signatures of a supersonic equatorial jet and cool poles in a hot planet | L. Nortmann et.al. | 2404.12363 | null |
2024-04-18 | Learning the Domain Specific Inverse NUFFT for Accelerated Spiral MRI using Diffusion Models | Trevor J. Chan et.al. | 2404.12361 | null |
2024-04-18 | Inverse Neural Rendering for Explainable Multi-Object Tracking | Julian Ost et.al. | 2404.12359 | null |
2024-04-18 | Point-In-Context: Understanding Point Cloud via In-Context Learning | Mengyuan Liu et.al. | 2404.12352 | link |
2024-04-18 | Customizing Text-to-Image Diffusion with Camera Viewpoint Control | Nupur Kumari et.al. | 2404.12333 | null |
2024-04-21 | RISE: 3D Perception Makes Real-World Robot Imitation Simple and Effective | Chenxi Wang et.al. | 2404.12281 | null |
2024-04-18 | Food Portion Estimation via 3D Object Scaling | Gautham Vinod et.al. | 2404.12257 | link |
2024-04-18 | Omnidirectional 3D printing of PEDOT:PSS aerogels with tunable electromechanical performance for unconventional stretchable interconnects and thermoelectrics | Hasan Emre Baysal et.al. | 2404.12254 | null |
2024-04-18 | Traveling strings of active dipolar colloids | Xichen Chao et.al. | 2404.12218 | null |
2024-04-18 | Partial-to-Partial Shape Matching with Geometric Consistency | Viktoria Ehm et.al. | 2404.12209 | link |
2024-04-18 | Vortex motion in reconfigurable three-dimensional superconducting nanoarchitectures | Elina Zhakina et.al. | 2404.12151 | null |
2024-04-17 | Mushroom Segmentation and 3D Pose Estimation from Point Clouds using Fully Convolutional Geometric Features and Implicit Pose Encoding | George Retsinas et.al. | 2404.12144 | link |
2024-04-23 | MolCRAFT: Structure-Based Drug Design in Continuous Parameter Space | Yanru Qu et.al. | 2404.12141 | link |
2024-04-18 | Omniview-Tuning: Boosting Viewpoint Invariance of Vision-Language Pre-training Models | Shouwei Ruan et.al. | 2404.12139 | null |
2024-04-18 | Developing Application Profiles for Enhancing Data and Workflows in Cultural Heritage Digitisation Processes | Sebastian Barzaghi et.al. | 2404.12069 | link |
2024-04-18 | PureForest: A Large-scale Aerial Lidar and Aerial Imagery Dataset for Tree Species Classification in Monospecific Forests | Charles Gaydon et.al. | 2404.12064 | link |
2024-04-18 | MIDGET: Music Conditioned 3D Dance Generation | Jinwu Wang et.al. | 2404.12062 | null |
2024-04-18 | Systematic search for islets of stability in the standard map for large parameter values | Alexandre R. Nieto et.al. | 2404.12027 | null |
2024-04-18 | Automated Real-Time Inspection in Indoor and Outdoor 3D Environments with Cooperative Aerial Robots | Andreas Anastasiou et.al. | 2404.12018 | null |
2024-04-18 | Comparing the three-dimensional morphological asymmetries in the ejecta of Kepler and Tycho in X-rays | Adrien Picquenot et.al. | 2404.12002 | null |
2024-04-18 | MultiPhys: Multi-Person Physics-aware 3D Motion Estimation | Nicolas Ugrinovic et.al. | 2404.11987 | null |
2024-04-18 | Not All Voxels Are Equal: Hardness-Aware Semantic Scene Completion with Self-Distillation | Song Wang et.al. | 2404.11958 | link |
2024-04-18 | Multi-view X-ray Image Synthesis with Multiple Domain Disentanglement from CT Scans | Lixing Tan et.al. | 2404.11889 | null |
2024-04-18 | General non-linear fragmentation with discontinuous Galerkin methods | Maxime Lombart et.al. | 2404.11851 | null |
2024-04-18 | Aerodynamic Design and Performance Evaluation of Pipe Diffuser for Centrifugal Compressor of Micro Gas Turbine | Sujal Bhavsar et.al. | 2404.11828 | null |
2024-04-18 | Holographic Parallax Improves 3D Perceptual Realism | Dongyeon Kim et.al. | 2404.11810 | null |
2024-04-17 | TempBEV: Improving Learned BEV Encoders with Combined Image and BEV Space Temporal Aggregation | Thomas Monninger et.al. | 2404.11803 | null |
2024-04-17 | 3D object quality prediction for Metal Jet Printer with Multimodal thermal encoder | Rachel et.al. | 2404.11776 | null |
2024-04-17 | Multimodal 3D Object Detection on Unseen Domains | Deepti Hegde et.al. | 2404.11764 | null |
2024-04-17 | Virtual Foundry Graphnet for Metal Sintering Deformation Prediction | Rachel et.al. | 2404.11753 | link |
2024-04-17 | Equivariant Spatio-Temporal Self-Supervision for LiDAR Object Detection | Deepti Hegde et.al. | 2404.11737 | null |
2024-04-17 | Learning with 3D rotations, a hitchhiker’s guide to SO(3) | A. René Geist et.al. | 2404.11735 | link |
2024-04-17 | Unifying Scene Representation and Hand-Eye Calibration with 3D Foundation Models | Weiming Zhi et.al. | 2404.11683 | null |
2024-04-24 | Factorized Motion Fields for Fast Sparse Input Dynamic View Synthesis | Nagabhushan Somraj et.al. | 2404.11669 | null |
2024-04-17 | InFusion: Inpainting 3D Gaussians via Learning Depth Completion from Diffusion Prior | Zhiheng Liu et.al. | 2404.11613 | null |
2024-04-22 | IntrinsicAnything: Learning Diffusion Priors for Inverse Rendering Under Unknown Illumination | Xi Chen et.al. | 2404.11593 | null |
2024-04-17 | A Subspace-Constrained Tyler’s Estimator and its Applications to Structure from Motion | Feng Yu et.al. | 2404.11590 | link |
2024-04-17 | Matern Correlation: A Panoramic Primer | Xiaoqing Chen et.al. | 2404.11427 | null |
2024-04-17 | Multi-layer continuous carbon fiber pattern optimization and a spline based path planning interpretation | Fabian Wein et.al. | 2404.11404 | null |
2024-04-17 | RainyScape: Unsupervised Rainy Scene Reconstruction using Decoupled Neural Rendering | Xianqiang Lyu et.al. | 2404.11401 | null |
2024-04-18 | DeblurGS: Gaussian Splatting for Camera Motion Blur | Jeongtaek Oh et.al. | 2404.11358 | null |
2024-04-17 | Best Practices for a Handwritten Text Recognition System | George Retsinas et.al. | 2404.11339 | link |
2024-04-17 | VBR: A Vision Benchmark in Rome | Leonardo Brizi et.al. | 2404.11322 | link |
2024-04-17 | Asymptotic, second-order homogenization of linear elastic beam networks | Yang Ye et.al. | 2404.11316 | null |
2024-04-17 | Novel View Synthesis for Cinematic Anatomy on Mobile and Immersive Displays | Simon Niedermayr et.al. | 2404.11285 | null |
2024-04-17 | MMCBE: Multi-modality Dataset for Crop Biomass Estimation and Beyond | Xuesong Li et.al. | 2404.11256 | link |
2024-04-17 | Revisiting Noise Resilience Strategies in Gesture Recognition: Short-Term Enhancement in Surface Electromyographic Signal Analysis | Weiyu Guo et.al. | 2404.11213 | null |
2024-04-17 | RiboDiffusion: Tertiary Structure-based RNA Inverse Folding with Generative Diffusion Models | Han Huang et.al. | 2404.11199 | link |
2024-04-17 | Uniform Regularity for Incompressible MHD Equations in a Bounded Domain with Curved Boundary in 3D | Yingzhi Du et.al. | 2404.11197 | null |
2024-04-17 | Texture tomography, a versatile framework to study crystalline texture in 3D | M. P. K. Frewein et.al. | 2404.11195 | null |
2024-04-20 | Learning SO(3)-Invariant Semantic Correspondence via Local Shape Transform | Chunghyun Park et.al. | 2404.11156 | null |
2024-04-17 | REACTO: Reconstructing Articulated Objects from a Single Video | Chaoyue Song et.al. | 2404.11151 | null |
2024-04-17 | Ion acceleration from micrometric targets immersed in an intense laser field | Michal Elkind et.al. | 2404.11135 | null |
2024-04-17 | D-Aug: Enhancing Data Augmentation for Dynamic LiDAR Scenes | Jiaxing Zhao et.al. | 2404.11127 | null |
2024-04-17 | Rethinking 3D Dense Caption and Visual Grounding in A Unified Framework through Prompt-based Localization | Yongdong Luo et.al. | 2404.11064 | link |
2024-04-17 | Imprints of the Local Bubble and Dust Complexity on Polarized Dust Emission | George Halal et.al. | 2404.11009 | null |
2024-04-17 | Machine-Learning-Enhanced Soft Robotic System Inspired by Rectal Functions for Investigating Fecal incontinence | Zebing Mao et.al. | 2404.10999 | null |
2024-04-17 | Pixel-Wise Symbol Spotting via Progressive Points Location for Parsing CAD Images | Junbiao Pang et.al. | 2404.10985 | null |
2024-04-17 | The Relationship Between Simulated Sub-Millimeter and Near-Infrared Images of Sagittarius A* from a Magnetically Arrested Black Hole Accretion Flow | Arpiar Avetis Grigorian et.al. | 2404.10982 | null |
2024-04-17 | Leveraging 3D LiDAR Sensors to Enable Enhanced Urban Safety and Public Health: Pedestrian Monitoring and Abnormal Activity Detection | Nawfal Guefrachi et.al. | 2404.10978 | null |
2024-04-16 | Neuromorphic Vision-based Motion Segmentation with Graph Transformer Neural Network | Yusra Alkendi et.al. | 2404.10940 | null |
2024-04-16 | Warp Factory: A Numerical Toolkit for the Analysis and Optimization of Warp Drive Geometries | Christopher Helmerich et.al. | 2404.10855 | null |
2024-04-16 | The first degree-scale starlight-polarization-based tomography map of the magnetized interstellar medium | V. Pelgrims et.al. | 2404.10821 | null |
2024-04-14 | Constraining on the non-standard cosmological models combining the observations of high-redshift quasars and BAO | Ziqiang Liu et.al. | 2404.10794 | null |
2024-04-16 | Gaussian Opacity Fields: Efficient and Compact Surface Reconstruction in Unbounded Scenes | Zehao Yu et.al. | 2404.10772 | null |
2024-04-16 | RapidVol: Rapid Reconstruction of 3D Ultrasound Volumes from Sensorless 2D Scans | Mark C. Eid et.al. | 2404.10766 | null |
2024-04-16 | RefFusion: Reference Adapted Diffusion Models for 3D Scene Inpainting | Ashkan Mirzaei et.al. | 2404.10765 | null |
2024-04-16 | Maser Flares Driven by Isothermal Shock Waves | M. D. Gray et.al. | 2404.10741 | null |
2024-04-16 | Attention-Aware Visualization: Tracking and Responding to User Perception Over Time | Arvind Srinivasan et.al. | 2404.10732 | null |
2024-04-16 | A Plausibility Study of Using Augmented Reality in the Ventriculoperitoneal Shunt Operations | Tandin Dorji et.al. | 2404.10713 | null |
2024-04-16 | ECLAIR: A High-Fidelity Aerial LiDAR Dataset for Semantic Segmentation | Iaroslav Melekhov et.al. | 2404.10699 | link |
2024-04-16 | Swarm-Based Trajectory Generation and Optimization for Stress-Aligned 3D Printing | Xavier Guidetti et.al. | 2404.10686 | null |
2024-04-16 | StyleCity: Large-Scale 3D Urban Scenes Stylization with Vision-and-Text Reference via Progressive Optimization | Yingshu Chen et.al. | 2404.10681 | null |
2024-04-16 | Gaussian Splatting Decoder for 3D-aware Generative Adversarial Networks | Florian Barthel et.al. | 2404.10625 | null |
2024-04-16 | PyTorchGeoNodes: Enabling Differentiable Shape Programs for 3D Shape Reconstruction | Sinisa Stekovic et.al. | 2404.10620 | null |
2024-04-16 | Enhancing 3D Fidelity of Text-to-3D using Cross-View Correspondences | Seungwook Kim et.al. | 2404.10603 | null |
2024-04-16 | Classification of Prostate Cancer in 3D Magnetic Resonance Imaging Data based on Convolutional Neural Networks | Malte Rippa et.al. | 2404.10548 | null |
2024-04-16 | Measuring bipartite spin correlations of lattice-trapped dipolar atoms | Youssef Aziz Alaoui et.al. | 2404.10531 | null |
2024-04-16 | SPVLoc: Semantic Panoramic Viewport Matching for 6D Camera Localization in Unseen Environments | Niklas Gard et.al. | 2404.10527 | link |
2024-04-16 | Variabilities in the polar field and solar cycle due to irregular properties of Bipolar Magnetic Regions | Pawan Kumar et.al. | 2404.10526 | null |
2024-04-16 | Restoring Connectivity in Vascular Segmentation using a Learned Post-Processing Model | Sophie Carneiro-Esteves et.al. | 2404.10506 | link |
2024-04-16 | Teaching Chinese Sign Language with Feedback in Mixed Reality | Hongli Wen et.al. | 2404.10490 | null |
2024-04-16 | AbsGS: Recovering Fine Details for 3D Gaussian Splatting | Zongxin Ye et.al. | 2404.10484 | null |
2024-04-16 | In-depth analysis of solar models with high-metallicity abundances and updated opacity tables | G. Buldgen et.al. | 2404.10478 | null |
2024-04-17 | High-resolution atmospheric retrievals of WASP-76b transmission spectroscopy with ESPRESSO: Monitoring limb asymmetries across multiple transits | Cathal Maguire et.al. | 2404.10463 | null |
2024-04-16 | Revealing data leakage in protein interaction benchmarks | Anton Bushuiev et.al. | 2404.10457 | link |
2024-04-16 | Portrait3D: Text-Guided High-Quality 3D Portrait Generation Using Pyramid Representation and GANs Prior | Yiqian Wu et.al. | 2404.10394 | link |
2024-04-16 | Generating 6-D Trajectories for Omnidirectional Multirotor Aerial Vehicles in Cluttered Environments | Peiyan Liu et.al. | 2404.10392 | null |
2024-04-16 | 3DGen: AI-Assisted Generation of Provably Correct Binary Format Parsers | Sarah Fakhoury et.al. | 2404.10362 | null |
2024-04-16 | AERO: Adaptive Erase Operation for Improving Lifetime and Performance of Modern NAND Flash-Based SSDs | Sungjun Cho et.al. | 2404.10355 | null |
2024-04-16 | SRGS: Super-Resolution 3D Gaussian Splatting | Xiang Feng et.al. | 2404.10318 | link |
2024-04-16 | Non-perturbative correction to thermodynamics of conformally dressed 3D black hole | Saheb Soroushfar et.al. | 2404.10309 | null |
2024-04-16 | EucliDreamer: Fast and High-Quality Texturing for 3D Models with Depth-Conditioned Stable Diffusion | Cindy Le et.al. | 2404.10279 | null |
2024-04-16 | Autonomous Implicit Indoor Scene Reconstruction with Frontier Exploration | Jing Zeng et.al. | 2404.10218 | null |
2024-04-16 | GaitPoint+: A Gait Recognition Network Incorporating Point Cloud Analysis and Recycling | Huantao Ren et.al. | 2404.10213 | null |
2024-04-15 | CryoMAE: Few-Shot Cryo-EM Particle Picking with Masked Autoencoders | Chentianye Xu et.al. | 2404.10178 | null |
2024-04-23 | SegFormer3D: an Efficient Transformer for 3D Medical Image Segmentation | Shehan Perera et.al. | 2404.10156 | link |
2024-04-15 | Cross-Modal Self-Training: Aligning Images and Pointclouds to Learn Classification without Labels | Amaya Dharmasiri et.al. | 2404.10146 | link |
2024-04-17 | Shaping Realities: Enhancing 3D Generative AI with Fabrication Constraints | Faraz Faruqi et.al. | 2404.10142 | null |
2024-04-15 | WB LUTs: Contrastive Learning for White Balancing Lookup Tables | Sai Kumar Reddy Manne et.al. | 2404.10133 | link |
2024-04-15 | Multiple-Input Fourier Neural Operator (MIFNO) for source-dependent 3D elastodynamics | Fanny Lehmann et.al. | 2404.10115 | link |
2024-04-15 | Synchronous PIV measurements of a self-powered blood turbine and pump couple for right ventricle support | Kagan Ucak et.al. | 2404.10081 | null |
2024-04-15 | AIGeN: An Adversarial Approach for Instruction Generation in VLN | Niyati Rawal et.al. | 2404.10054 | null |
2024-04-27 | JT gravity from non-Abelian T-duality | Daniele Bielli et.al. | 2404.10041 | null |
2024-04-15 | Taming Latent Diffusion Model for Neural Radiance Field Inpainting | Chieh Hubert Lin et.al. | 2404.09995 | null |
2024-04-15 | Boosting Determinant Quantum Monte Carlo with Submatrix Updates: Unveiling the Phase Diagram of the 3D Hubbard Model | Fanjie Sun et.al. | 2404.09989 | null |
2024-04-15 | One-Click Upgrade from 2D to 3D: Sandwiched RGB-D Video Compression for Stereoscopic Teleconferencing | Yueyu Hu et.al. | 2404.09979 | null |
2024-04-15 | Reconstructing classes of 3D FRI signals from sampled tomographic projections at unknown angles | Renke Wang et.al. | 2404.09969 | null |
2024-04-15 | Zero-shot detection of buildings in mobile LiDAR using Language Vision Model | June Moh Goo et.al. | 2404.09931 | null |
2024-04-15 | Empowering Embodied Visual Tracking with Visual Foundation Models and Offline RL | Fangwei Zhong et.al. | 2404.09857 | null |
2024-04-15 | STMixer: A One-Stage Sparse Action Detector | Tao Wu et.al. | 2404.09842 | null |
2024-04-15 | 3D Face Tracking from 2D Video through Iterative Dense UV to Image Flow | Felix Taubner et.al. | 2404.09819 | null |
2024-04-15 | A Universal Protocol to Benchmark Camera Calibration for Sports | Floriane Magera et.al. | 2404.09807 | null |
2024-04-16 | Calculating radio emissions of positive streamer phenomena using 3D simulations | Hemaditya Malla et.al. | 2404.09772 | null |
2024-04-17 | Transforming a Non-Differentiable Rasterizer into a Differentiable One with Stochastic Gradient Estimation | Thomas Deliot et.al. | 2404.09758 | null |
2024-04-15 | LetsGo: Large-Scale Garage Modeling and Rendering via LiDAR-Assisted Gaussian Primitives | Jiadi Cui et.al. | 2404.09748 | null |
2024-04-26 | Active string fluids and gels formed by dipolar active Brownian particles in 3D | Maria Kelidou et.al. | 2404.09693 | null |
2024-04-18 | Post-Training Network Compression for 3D Medical Image Segmentation: Reducing Computational Efforts via Tucker Decomposition | Tobias Weber et.al. | 2404.09683 | link |
2024-04-15 | Object Instance Retrieval in Assistive Robotics: Leveraging Fine-Tuned SimSiam with Multi-View Images Based on 3D Semantic Map | Taichi Sakaguchi et.al. | 2404.09647 | null |
2024-04-15 | Hot Jupiter Diversity and the Onset of TiO/VO Revealed by a Large Grid of Non-Grey Global Circulation Models | Alexander Roth et.al. | 2404.09626 | null |
2024-04-15 | DIDLM:A Comprehensive Multi-Sensor Dataset with Infrared Cameras, Depth Cameras, LiDAR, and 4D Millimeter-Wave Radar in Challenging Scenarios for 3D Mapping | WeiSheng Gong et.al. | 2404.09622 | null |
2024-04-15 | Efficient and accurate neural field reconstruction using resistive memory | Yifei Yu et.al. | 2404.09613 | null |
2024-04-15 | Well-Posedness for Quintic Energy Critical Wave in 3D Cylindrical Convex Domains | Meas Len et.al. | 2404.09611 | null |
2024-04-15 | The “C”: The large Chameleon-Musca-Coalsack cloud | Gordian Edenhofer et.al. | 2404.09592 | null |
2024-04-15 | 3D Gaussian Splatting as Markov Chain Monte Carlo | Shakiba Kheradmand et.al. | 2404.09591 | null |
2024-04-15 | nnU-Net Revisited: A Call for Rigorous Validation in 3D Medical Image Segmentation | Fabian Isensee et.al. | 2404.09556 | link |
2024-04-15 | Text-Driven Diverse Facial Texture Generation via Progressive Latent-Space Refinement | Chi Wang et.al. | 2404.09540 | null |
2024-04-15 | Oblique-MERF: Revisiting and Improving MERF for Oblique Photography | Xiaoyi Zeng et.al. | 2404.09531 | null |
2024-04-15 | SparseOcc: Rethinking Sparse Latent Representation for Vision-Based Semantic Occupancy Prediction | Pin Tang et.al. | 2404.09502 | null |
2024-04-15 | Learning Human Motion from Monocular Videos via Cross-Modal Manifold Alignment | Shuaiying Hou et.al. | 2404.09499 | null |
2024-04-15 | On anomalous dimension in 3D ABJM model | A. V. Kotikov et.al. | 2404.09478 | null |
2024-04-15 | Virtually Enriched NYU Depth V2 Dataset for Monocular Depth Estimation: Do We Need Artificial Augmentation? | Dmitry Ignatov et.al. | 2404.09469 | link |
2024-04-15 | PhyScene: Physically Interactable 3D Scene Synthesis for Embodied AI | Yandan Yang et.al. | 2404.09465 | null |
2024-04-15 | CompGS: Efficient 3D Scene Representation via Compressed Gaussian Splatting | Xiangrui Liu et.al. | 2404.09458 | null |
2024-04-15 | The 8th AI City Challenge | Shuo Wang et.al. | 2404.09432 | null |
2024-04-15 | VFMM3D: Releasing the Potential of Image by Vision Foundation Model for Monocular 3D Object Detection | Bonan Ding et.al. | 2404.09431 | null |
2024-04-15 | ViFu: Multiple 360 $^\circ$ Objects Reconstruction with Clean Background via Visible Part Fusion | Tianhan Xu et.al. | 2404.09426 | null |
2024-04-15 | Super-resolution of biomedical volumes with 2D supervision | Cheng Jiang et.al. | 2404.09425 | null |
2024-04-15 | DeferredGS: Decoupled and Editable Gaussian Splatting with Deferred Shading | Tong Wu et.al. | 2404.09412 | null |
2024-04-16 | Orientation-conditioned Facial Texture Mapping for Video-based Facial Remote Photoplethysmography Estimation | Sam Cantrill et.al. | 2404.09378 | null |
2024-04-14 | \textit{sweet} – An Open Source Modular Platform for Contactless Hand Vascular Biometric Experiments | David Geissbühler et.al. | 2404.09376 | null |
2024-04-24 | Exploring Feedback Generation in Automated Skeletal Movement Assessment: A Comprehensive Overview | Tal Hakim et.al. | 2404.09359 | null |
2024-04-14 | In My Perspective, In My Hands: Accurate Egocentric 2D Hand Pose and Action Recognition | Wiktor Mucha et.al. | 2404.09308 | link |
2024-04-16 | A Simple Strategy for Body Estimation from Partial-View Images | Yafei Mao et.al. | 2404.09301 | null |
2024-04-14 | RoofDiffusion: Constructing Roofs from Severely Corrupted Point Data via Diffusion | Kyle Shih-Huang Lo et.al. | 2404.09290 | link |
2024-04-14 | VRS-NeRF: Visual Relocalization with Sparse Neural Radiance Field | Fei Xue et.al. | 2404.09271 | link |
2024-04-14 | On the dynamical evolution of the asteroid belt in a massive star-neutron star binary | Chen Deng et.al. | 2404.09258 | null |
2024-04-14 | Tri-modal Confluence with Temporal Dynamics for Scene Graph Generation in Operating Rooms | Diandian Guo et.al. | 2404.09231 | null |
2024-04-14 | DreamScape: 3D Scene Creation via Gaussian Splatting joint Correlation Modeling | Xuening Yuan et.al. | 2404.09227 | null |
2024-04-14 | Design and Fabrication of String-driven Origami Robots | Peiwen Yang et.al. | 2404.09222 | null |
2024-04-14 | Robust spin order and fragile charge order in Na0.5CoO2 as revealed by time-resolved terahertz spectroscopy | X. Y. Zhou et.al. | 2404.09185 | null |
2024-04-14 | Increasing SLAM Pose Accuracy by Ground-to-Satellite Image Registration | Yanhao Zhang et.al. | 2404.09169 | link |
2024-04-23 | StreakNet-Arch: An Anti-scattering Network-based Architecture for Underwater Carrier LiDAR-Radar Imaging | Xuelong Li et.al. | 2404.09158 | link |
2024-04-22 | EGGS: Edge Guided Gaussian Splatting for Radiance Fields | Yuanhao Gong et.al. | 2404.09105 | null |
2024-04-13 | Probabilistic Directed Distance Fields for Ray-Based Shape Representations | Tristan Aumentado-Armstrong et.al. | 2404.09081 | null |
2024-04-13 | Numerical Aspects of Hyperbolic Geometry | Dorota Celinska-Kopczynska et.al. | 2404.09039 | null |
2024-04-13 | Smart Help: Strategic Opponent Modeling for Proactive and Adaptive Robot Assistance in Households | Zhihao Cao et.al. | 2404.09001 | null |
2024-04-13 | A Fourier-enhanced multi-modal 3D small object optical mark recognition and positioning method for percutaneous abdominal puncture surgical navigation | Zezhao Guo et.al. | 2404.08990 | null |
2024-04-16 | LoopGaussian: Creating 3D Cinemagraph with Multi-view Images via Eulerian Motion Field | Jiyang Li et.al. | 2404.08966 | link |
2024-04-13 | Global dynamics of Kato’s solutions for the 3D incompressible micropolar system | Zihao Song et.al. | 2404.08920 | null |
2024-04-13 | MAProtoNet: A Multi-scale Attentive Interpretable Prototypical Part Network for 3D Magnetic Resonance Imaging Brain Tumor Classification | Binghua Li et.al. | 2404.08917 | link |
2024-04-12 | Single-image driven 3d viewpoint training data augmentation for effective wine label recognition | Yueh-Cheng Huang et.al. | 2404.08820 | null |
2024-04-12 | Plasma Dynamics and Nonthermal Particle Acceleration in 3D Nonrelativistic Magnetic Reconnection | Qile Zhang et.al. | 2404.08807 | null |
2024-04-12 | Eccentric binaries: Periastron events and tidal heating | Gloria Koenigsberger et.al. | 2404.08774 | null |
2024-04-12 | Extended Metal-Insulator Crossover with Strong Antiferromagnetic Spin Correlation in Half-Filled 3D Hubbard Model | Yu-Feng Song et.al. | 2404.08745 | null |
2024-04-12 | EventEgo3D: 3D Human Motion Capture from Egocentric Event Streams | Christen Millerdurai et.al. | 2404.08640 | link |
2024-04-12 | Probing the 3D Awareness of Visual Foundation Models | Mohamed El Banani et.al. | 2404.08636 | link |
2024-04-16 | 3D Human Scan With A Moving Event Camera | Kai Kohyama et.al. | 2404.08504 | null |
2024-04-12 | A stable decoupled perfectly matched layer for the 3D wave equation using the nodal discontinuous Galerkin method | Sophia Julia Feriani et.al. | 2404.08464 | null |
2024-04-15 | OccGaussian: 3D Gaussian Splatting for Occluded Human Rendering | Jingrui Ye et.al. | 2404.08449 | null |
2024-04-12 | Global Large Solution to Navier-Stokes and SQG Equations with Time Oscillation | Yiran Xu et.al. | 2404.08420 | null |
2024-04-15 | Direct May Not Be the Best: An Incremental Evolution View of Pose Generation | Yuelong Li et.al. | 2404.08419 | link |
2024-04-12 | Seismic First Break Picking in a Higher Dimension Using Deep Graph Learning | Hongtao Wang et.al. | 2404.08408 | null |
2024-04-12 | Wild solutions of the 3D axisymmetric Euler equations | Patrick Brkic et.al. | 2404.08407 | null |
2024-04-12 | No Bells, Just Whistles: Sports Field Registration by Leveraging Geometric Properties | Marc Gutiérrez-Pérez et.al. | 2404.08401 | link |
2024-04-12 | Estimation and Inference for Three-Dimensional Panel Data Models | Guohua Feng et.al. | 2404.08365 | null |
2024-04-12 | Let It Flow: Simultaneous Optimization of 3D Flow and Object Clustering | Patrik Vacek et.al. | 2404.08363 | link |
2024-04-12 | GPN: Generative Point-based NeRF | Haipeng Wang et.al. | 2404.08312 | link |
2024-04-12 | SNAKE-fMRI: A modular fMRI data simulator from the space-time domain to k-space and back | Pierre-Antoine Comby et.al. | 2404.08282 | link |
2024-04-12 | MonoPatchNeRF: Improving Neural Radiance Fields with Patch-based Monocular Guidance | Yuqun Wu et.al. | 2404.08252 | null |
2024-04-11 | An extension theorem for weak solutions of the 3d incompressible Euler equations and applications to singular flows | Alberto Enciso et.al. | 2404.08115 | null |
2024-04-11 | SimpliCity: Reconstructing Buildings with Simple Regularized 3D Models | Jean-Philippe Bauchet et.al. | 2404.08104 | null |
2024-04-11 | Can repeller dynamics explain dominant pebble axis ratios? | Balázs Havasi-Tóth et.al. | 2404.08097 | null |
2024-04-11 | Ephemeral Myographic Motion: Repurposing the Myo Armband to Control Disposable Pneumatic Sculptures | Celia Chen et.al. | 2404.08065 | null |
2024-04-11 | The Evolution of Binaries Embedded Within Common Envelopes | Alejandra Rosselli-Calderon et.al. | 2404.08037 | null |
2024-04-11 | Connecting NeRFs, Images, and Text | Francesco Ballerini et.al. | 2404.07993 | link |
2024-04-11 | GoMAvatar: Efficient Animatable Human Modeling from Monocular Video Using Gaussians-on-Mesh | Jing Wen et.al. | 2404.07991 | null |
2024-04-11 | Any2Point: Empowering Any-modality Large Models for Efficient 3D Understanding | Yiwen Tang et.al. | 2404.07989 | link |
2024-04-11 | View Selection for 3D Captioning via Diffusion Ranking | Tiange Luo et.al. | 2404.07984 | null |
2024-04-11 | Gaga: Group Any Gaussians via 3D-aware Memory Bank | Weijie Lyu et.al. | 2404.07977 | null |
2024-04-11 | Artificial Chemotaxis under Electrodiffusiophoresis | Carlos A. Silvera Batista et.al. | 2404.07874 | null |
2024-04-11 | Illposedness of incompressible fluids in supercritical Sobolev spaces | Xiaoyutao Luo et.al. | 2404.07813 | null |
2024-04-11 | Probing Three-Dimensional Magnetic Fields: III – Synchrotron Emission and Machine Learning | Yue Hu et.al. | 2404.07806 | null |
2024-04-11 | PRAM: Place Recognition Anywhere Model for Efficient Visual Localization | Fei Xue et.al. | 2404.07785 | null |
2024-04-11 | Active particles knead three-dimensional gels into open crumbs | Martin Cramer Pedersen et.al. | 2404.07767 | null |
2024-04-11 | 3D-CSAD: Untrained 3D Anomaly Detection for Complex Manufacturing Surfaces | Xuanming Cao et.al. | 2404.07748 | null |
2024-04-11 | Point cloud obstacle detection with the map filtration | Lukas Kratochvila et.al. | 2404.07730 | null |
2024-04-11 | OpenTrench3D: A Photogrammetric 3D Point Cloud Dataset for Semantic Segmentation of Underground Utilities | Lasse H. Hansen et.al. | 2404.07711 | link |
2024-04-11 | Run-time Monitoring of 3D Object Detection in Automated Driving Systems Using Early Layer Neural Activation Patterns | Hakan Yekta Yatbaz et.al. | 2404.07685 | null |
2024-04-11 | Shape Completion in the Dark: Completing Vertebrae Morphology from 3D Ultrasound | Miruna-Alexandra Gafencu et.al. | 2404.07668 | link |
2024-04-23 | 2DLIW-SLAM:2D LiDAR-Inertial-Wheel Odometry with Real-Time Loop Closure | Bin Zhang et.al. | 2404.07644 | link |
2024-04-11 | The morphology of cell spheroids in simple shear flow | Rosalia Ferraro et.al. | 2404.07528 | null |
2024-04-11 | PillarTrack: Redesigning Pillar-based Transformer Network for Single Object Tracking on Point Clouds | Weisheng Xu et.al. | 2404.07495 | link |
2024-04-11 | G-NeRF: Geometry-enhanced Novel View Synthesis from Single-View Images | Zixiong Huang et.al. | 2404.07474 | link |
2024-04-10 | Synthetic Spectra from Particle-in-cell Simulations of Relativistic Jets containing an initial Toroidal Magnetic Field | Ioana Dutan et.al. | 2404.07392 | null |
2024-04-10 | sCWatter: Open source coupled wave scattering simulation for spectroscopy and microscopy | Ruijiao Sun et.al. | 2404.07293 | null |
2024-04-10 | Topological entropy of Turing complete dynamics | Renzo Bruera et.al. | 2404.07288 | null |
2024-04-10 | Magnetically Driven Turbulence in the Inner Regions of Protoplanetary Disks | David G. Rea et.al. | 2404.07265 | null |
2024-04-10 | RealmDreamer: Text-Driven 3D Scene Generation with Inpainting and Depth Diffusion | Jaidev Shriram et.al. | 2404.07199 | null |
2024-04-10 | Laser driven melt pool resonances through dynamically oscillating energy inputs | Marco Rupp et.al. | 2404.07195 | null |
2024-04-10 | VN-EGNN: E(3)-Equivariant Graph Neural Networks with Virtual Nodes Enhance Protein Binding Site Identification | Florian Sestak et.al. | 2404.07194 | link |
2024-04-14 | InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models | Jiale Xu et.al. | 2404.07191 | link |
2024-04-10 | Measuring proximity to standard planes during fetal brain ultrasound scanning | Chiara Di Vece et.al. | 2404.07124 | null |
2024-04-11 | Driver Attention Tracking and Analysis | Dat Viet Thanh Nguyen et.al. | 2404.07122 | null |
2024-04-10 | 3DMambaComplete: Exploring Structured State Space Model for Point Cloud Completion | Yixuan Li et.al. | 2404.07106 | null |
2024-04-10 | Fabrication Tolerant Multi-Layer Integrated Photonic Topology Optimization | Michael J. Probst et.al. | 2404.07104 | null |
2024-04-10 | Learning Priors for Non Rigid SfM from Casual Videos | Yoni Kasten et.al. | 2404.07097 | null |
2024-04-10 | MoCap-to-Visual Domain Adaptation for Efficient Human Mesh Estimation from 2D Keypoints | Bedirhan Uguz et.al. | 2404.07094 | null |
2024-04-10 | Analytical Formula for Calculations of Armour Losses in Three-Core Power Cables | Marius Hatlo et.al. | 2404.06998 | null |
2024-04-10 | Gaussian-LIC: Photo-realistic LiDAR-Inertial-Camera SLAM with 3D Gaussian Splatting | Xiaolei Lang et.al. | 2404.06926 | null |
2024-04-10 | DreamScene360: Unconstrained Text-to-3D Scene Generation with Panoramic Gaussian Splatting | Shijie Zhou et.al. | 2404.06903 | null |
2024-04-10 | RESSCAL3D: Resolution Scalable 3D Semantic Segmentation of Point Clouds | Remco Royen et.al. | 2404.06863 | null |
2024-04-19 | Monocular 3D lane detection for Autonomous Driving: Recent Achievements, Challenges, and Outlooks | Fulong Ma et.al. | 2404.06860 | null |
2024-04-10 | UDiFF: Generating Conditional Unsigned Distance Fields with Optimal Wavelet Diffusion | Junsheng Zhou et.al. | 2404.06851 | null |
2024-04-10 | O2V-Mapping: Online Open-Vocabulary Mapping with Neural Implicit Representation | Muer Tie et.al. | 2404.06836 | null |
2024-04-10 | SplatPose & Detect: Pose-Agnostic 3D Anomaly Detection | Mathis Kruse et.al. | 2404.06832 | link |
2024-04-10 | Zero-shot Point Cloud Completion Via 2D Priors | Tianxin Huang et.al. | 2404.06814 | link |
2024-04-10 | Urban Architect: Steerable 3D Urban Scene Generation with Layout Prior | Fan Lu et.al. | 2404.06780 | null |
2024-04-10 | MonoSelfRecon: Purely Self-Supervised Explicit Generalizable 3D Reconstruction of Indoor Scenes from Monocular RGB Views | Runfa Li et.al. | 2404.06753 | null |
2024-04-10 | Designing Fluid-Exuding Cartilage for Biomimetic Robots Mimicking Human Joint Lubrication Function | Akihiro Miki et.al. | 2404.06740 | null |
2024-04-10 | Bayesian NeRF: Quantifying Uncertainty with Volume Density in Neural Radiance Fields | Sibeak Lee et.al. | 2404.06727 | link |
2024-04-10 | Sparse Points to Dense Clouds: Enhancing 3D Detection with Limited LiDAR Data | Aakash Kumar et.al. | 2404.06715 | null |
2024-04-12 | SpikeNVS: Enhancing Novel View Synthesis from Blurry Images via Spike Camera | Gaole Dai et.al. | 2404.06710 | null |
2024-04-10 | Scaling Multi-Camera 3D Object Detection through Weak-to-Strong Eliciting | Hao Lu et.al. | 2404.06700 | link |
2024-04-10 | Binomial Self-compensation for Motion Error in Dynamic 3D Scanning | Geyou Zhang et.al. | 2404.06693 | link |
2024-04-10 | Fast and Accurate Relative Motion Tracking for Two Industrial Robots | Honglu He et.al. | 2404.06687 | null |
2024-04-09 | Res-U2Net: Untrained Deep Learning for Phase Retrieval and Image Reconstruction | Carlos Osorio Quero et.al. | 2404.06657 | null |
2024-04-09 | GOAT-Bench: A Benchmark for Multi-Modal Lifelong Navigation | Mukul Khanna et.al. | 2404.06609 | link |
2024-04-09 | Chiral two-dimensional MoS2 by molecular functionalization as ultra-sensitive detectors for circularly polarized light | Ye Wang et.al. | 2404.06555 | null |
2024-04-10 | Reconstructing Hand-Held Objects in 3D | Jane Wu et.al. | 2404.06507 | null |
2024-04-10 | Flying with Photons: Rendering Novel Views of Propagating Light | Anagh Malik et.al. | 2404.06493 | null |
2024-04-09 | The Central Spanning Tree Problem | Enrique Fita Sanmartín et.al. | 2404.06447 | link |
2024-04-09 | QueSTMaps: Queryable Semantic Topological Maps for 3D Scene Understanding | Yash Mehan et.al. | 2404.06442 | null |
2024-04-09 | Magic-Boost: Boost 3D Generation with Mutli-View Conditioned Diffusion | Fan Yang et.al. | 2404.06429 | link |
2024-04-09 | Helium Reionization from Empirical Quasar Luminosity Functions before and after JWST | Arghyadeep Basu et.al. | 2404.06409 | null |
2024-04-09 | DaF-BEVSeg: Distortion-aware Fisheye Camera based Bird’s Eye View Segmentation with Occlusion Reasoning | Senthil Yogamani et.al. | 2404.06352 | null |
2024-04-09 | Matching 2D Images in 3D: Metric Relative Pose from Metric Correspondences | Axel Barroso-Laguna et.al. | 2404.06337 | link |
2024-04-09 | Compensating slice emittance growth in high brightness photoinjectors using sacrificial charge | W. H. Li et.al. | 2404.06312 | null |
2024-04-09 | Constraining the Coronal Properties of AB Dor in the Radio Regime | C. E. Brasseur et.al. | 2404.06304 | null |
2024-04-09 | Size selection of crack front defects: Multiple fracture-plane interactions and intrinsic lengthscales | Meng Wang et.al. | 2404.06289 | null |
2024-04-14 | 3D Geometry-aware Deformable Gaussian Splatting for Dynamic View Synthesis | Zhicheng Lu et.al. | 2404.06270 | null |
2024-04-09 | Playing to Vision Foundation Model’s Strengths in Stereo Matching | Chuang-Wei Liu et.al. | 2404.06261 | null |
2024-04-09 | Label-Efficient 3D Object Detection For Road-Side Units | Minh-Quan Dao et.al. | 2404.06256 | null |
2024-04-09 | GHNeRF: Learning Generalizable Human Features with Efficient Neural Radiance Fields | Arnab Dey et.al. | 2404.06246 | null |
2024-04-09 | ActNetFormer: Transformer-ResNet Hybrid Method for Semi-Supervised Action Recognition in Videos | Sharana Dharshikgan Suresh Dass et.al. | 2404.06243 | link |
2024-04-09 | Enhanced Radar Perception via Multi-Task Learning: Towards Refined Data for Sensor Fusion Applications | Huawei Sun et.al. | 2404.06165 | null |
2024-04-09 | Efficient and Robust Point Cloud Registration via Heuristics-guided Parameter Search | Tianyu Huang et.al. | 2404.06155 | link |
2024-04-09 | HFNeRF: Learning Human Biomechanic Features with Neural Radiance Fields | Arnab Dey et.al. | 2404.06152 | null |
2024-04-09 | Gaussian Pancakes: Geometrically-Regularized 3D Gaussian Splatting for Realistic Endoscopic Reconstruction | Sierra Bonilla et.al. | 2404.06128 | link |
2024-04-09 | Hierarchical Insights: Exploiting Structural Similarities for Reliable 3D Semantic Segmentation | Mariella Dreissig et.al. | 2404.06124 | null |
2024-04-09 | DreamView: Injecting View-specific Text Guidance into Text-to-3D Generation | Junkai Yan et.al. | 2404.06119 | link |
2024-04-09 | Revising Densification in Gaussian Splatting | Samuel Rota Bulò et.al. | 2404.06109 | null |
2024-04-09 | Estimating the lateral speed of a fast shock driven by a coronal mass ejection at the location of solar radio emissions | S. Normo et.al. | 2404.06102 | null |
2024-04-09 | Hash3D: Training-free Acceleration for 3D Generation | Xingyi Yang et.al. | 2404.06091 | link |
2024-04-09 | Incremental Joint Learning of Depth, Pose and Implicit Scene Representation on Monocular Camera in Large-scale Scenes | Tianchen Deng et.al. | 2404.06050 | null |
2024-04-09 | Object Dynamics Modeling with Hierarchical Point Cloud-based Representations | Chanho Kim et.al. | 2404.06044 | null |
2024-04-09 | Diffusion-Based Point Cloud Super-Resolution for mmWave Radar Data | Kai Luan et.al. | 2404.06012 | null |
2024-04-09 | Polynomial-time derivation of optimal k-tree topology from Markov networks | Fereshteh R. Dastjerdi et.al. | 2404.05991 | null |
2024-04-12 | EasyTrack: Efficient and Compact One-stream 3D Point Clouds Tracker | Baojie Fan et.al. | 2404.05960 | null |
2024-04-09 | 3D Branch Point Cloud Completion for Robotic Pruning in Apple Orchards | Tian Qiu et.al. | 2404.05953 | null |
2024-04-09 | LATUP-Net: A Lightweight 3D Attention U-Net with Parallel Convolutions for Brain Tumor Segmentation | Ebtihal J. Alwadee et.al. | 2404.05911 | null |
2024-04-08 | On the Fly Robotic-Assisted Medical Instrument Planning and Execution Using Mixed Reality | Letian Ai et.al. | 2404.05887 | null |
2024-04-06 | Deep Learning-Based Brain Image Segmentation for Automated Tumour Detection | Suman Sourabh et.al. | 2404.05763 | null |
2024-04-08 | Learning 3D-Aware GANs from Unposed Images with Template Feature Field | Xinya Chen et.al. | 2404.05705 | null |
2024-04-08 | SphereHead: Stable 3D Full-head Synthesis with Spherical Tri-plane Representation | Heyuan Li et.al. | 2404.05680 | null |
2024-04-08 | Normalizing Flows on the Product Space of SO(3) Manifolds for Probabilistic Human Pose Modeling | Olaf Dünkel et.al. | 2404.05675 | link |
2024-04-08 | Realization of a three-dimensional photonic higher-order topological insulator | Ziyao Wang et.al. | 2404.05649 | null |
2024-04-08 | 3D-COCO: extension of MS-COCO dataset for image detection and 3D reconstruction modules | Maxence Bideaux et.al. | 2404.05641 | null |
2024-04-08 | Learning a Category-level Object Pose Estimator without Pose Annotations | Fengrui Tian et.al. | 2404.05626 | null |
2024-04-08 | Learning Topology Uniformed Face Mesh by Volume Rendering for Multi-view Reconstruction | Yating Wang et.al. | 2404.05606 | null |
2024-04-08 | Horizontally and vertically polarized kink oscillations in curved solar coronal loops | Mingzhe Guo et.al. | 2404.05586 | null |
2024-04-08 | Social-MAE: Social Masked Autoencoder for Multi-person Motion Representation Learning | Mahsa Ehsanpour et.al. | 2404.05578 | null |
2024-04-08 | Differential reddening in 48 globular clusters: An end to the quest for the intracluster medium | E. Pancino et.al. | 2404.05548 | null |
2024-04-08 | ALMA Spectroscopy of Europa: A Search for Active Plumes | M. A. Cordiner et.al. | 2404.05525 | null |
2024-04-08 | 3DMambaIPF: A State Space Model for Iterative Point Cloud Filtering via Differentiable Rendering | Qingyuan Zhou et.al. | 2404.05522 | link |
2024-04-08 | DepthMOT: Depth Cues Lead to a Strong Multi-Object Tracker | Jiapeng Wu et.al. | 2404.05518 | link |
2024-04-08 | Mapping finite-fault slip with spatial correlation between seismicity and point-source Coulomb failure stress change | Anthony Lomax et.al. | 2404.05437 | null |
2024-04-08 | Accretion Funnel Reconfiguration during an Outburst in a Young Stellar Object: EX Lupi | Koshvendra Singh et.al. | 2404.05420 | null |
2024-04-08 | Two Hands Are Better Than One: Resolving Hand to Hand Intersections via Occupancy Networks | Maksym Ivashechkin et.al. | 2404.05414 | null |
2024-04-08 | MOSE: Boosting Vision-based Roadside 3D Object Detection with Scene Cues | Xiahan Chen et.al. | 2404.05280 | null |
2024-04-08 | T3DRIS: Advancing Conformal RIS Design through In-depth Analysis of Mutual Coupling Effects | Placido Mursia et.al. | 2404.05261 | null |
2024-04-08 | Collision-Free Trajectory Optimization in Cluttered Environments with Sums-of-Squares Programming | Yulin Li et.al. | 2404.05242 | link |
2024-04-08 | Stylizing Sparse-View 3D Scenes with Hierarchical Neural Representation | Y. Wang et.al. | 2404.05236 | null |
2024-04-08 | StylizedGS: Controllable Stylization for 3D Gaussian Splatting | Dingxi Zhang et.al. | 2404.05220 | null |
2024-04-08 | Multi-agent Long-term 3D Human Pose Forecasting via Interaction-aware Trajectory Conditioning | Jaewoo Jeong et.al. | 2404.05218 | link |
2024-04-08 | Proximity-Induced Exchange Interaction: a New Pathway for Quantum Sensing using Spin Centers in Hexagonal Boron Nitride | Lingnan Shen et.al. | 2404.05208 | null |
2024-04-08 | Rendering-Enhanced Automatic Image-to-Point Cloud Registration for Roadside Scenes | Yu Sheng et.al. | 2404.05164 | null |
2024-04-08 | Semantic Flow: Learning Semantic Field of Dynamic Scenes from Monocular Videos | Fengrui Tian et.al. | 2404.05163 | link |
2024-04-09 | Better Monocular 3D Detectors with LiDAR from the Past | Yurong You et.al. | 2404.05139 | link |
2024-04-07 | Stop Stealing My Data: Sanitizing Stego Channels in 3D Printing Design Files | Aleksandr Dolgavin et.al. | 2404.05106 | null |
2024-04-14 | VMambaMorph: a Multi-Modality Deformable Image Registration Framework based on Visual State Space Model with Cross-Scan Module | Ziyang Wang et.al. | 2404.05105 | link |
2024-04-07 | Spatial Cognition from Egocentric Video: Out of Sight, Not Out of Mind | Chiara Plizzari et.al. | 2404.05072 | null |
2024-04-09 | Magic Boundaries of 3D Color Codes | Zijian Song et.al. | 2404.05033 | null |
2024-04-07 | FPL+: Filtered Pseudo Label-based Unsupervised Cross-Modality Adaptation for 3D Medical Image Segmentation | Jianghao Wu et.al. | 2404.04971 | link |
2024-04-07 | Bootstrapping Chest CT Image Understanding by Distilling Knowledge from X-ray Expert Models | Weiwei Cao et.al. | 2404.04936 | null |
2024-04-07 | CodecNeRF: Toward Fast Encoding and Decoding, Compact, and High-quality Novel-view Synthesis | Gyeongjin Kang et.al. | 2404.04913 | null |
2024-04-07 | MonoTAKD: Teaching Assistant Knowledge Distillation for Monocular 3D Object Detection | Hou-I Liu et.al. | 2404.04910 | link |
2024-04-07 | Dual-Camera Smooth Zoom on Mobile Phones | Renlong Wu et.al. | 2404.04908 | link |
2024-04-07 | A Unified Diffusion Framework for Scene-aware Human Motion Estimation from Sparse Signals | Jiangnan Tang et.al. | 2404.04890 | link |
2024-04-13 | GauU-Scene V2: Assessing the Reliability of Image-Based Metrics with Expansive Lidar Image Dataset Using 3DGS and NeRF | Butian Xiong et.al. | 2404.04880 | null |
2024-04-07 | CycleINR: Cycle Implicit Neural Representation for Arbitrary-Scale Volumetric Super-Resolution of Medical Data | Wei Fang et.al. | 2404.04878 | null |
2024-04-07 | HiLo: Detailed and Robust 3D Clothed Human Reconstruction with High-and Low-Frequency Information of Parametric Models | Yifan Yang et.al. | 2404.04876 | link |
2024-04-07 | Site-ordering/disordering-induced magnetic textures in a vdW ferromagnet by competing global and broken inversion-symmetry | Haoyan Zhang et.al. | 2404.04851 | null |
2024-04-07 | 3D Building Reconstruction from Monocular Remote Sensing Images with Multi-level Supervisions | Weijia Li et.al. | 2404.04823 | link |
2024-04-07 | Joint Reconstruction of 3D Human and Object via Contact-Based Refinement Transformer | Hyeongjin Nam et.al. | 2404.04819 | link |
2024-04-07 | AlphaCrystal-II: Distance matrix based crystal structure prediction using deep learning | Yuqi Song et.al. | 2404.04810 | null |
2024-04-07 | Light the Night: A Multi-Condition Diffusion Framework for Unpaired Low-Light Enhancement in Autonomous Driving | Jinlong Li et.al. | 2404.04804 | null |
2024-04-07 | Fourier Transform-based Wavenumber Domain 3D Imaging in RIS-aided Communication Systems | Yixuan Huang et.al. | 2404.04783 | null |
2024-04-06 | A free boundary problem for an immersed filament in 3D Stokes flow | Laurel Ohm et.al. | 2404.04737 | null |
2024-04-06 | On Exploring PDE Modeling for Point Cloud Video Representation Learning | Zhuoxu Huang et.al. | 2404.04720 | link |
2024-04-06 | OmniColor: A Global Camera Pose Optimization Approach of LiDAR-360Camera Fusion for Colorizing Point Clouds | Bonan Liu et.al. | 2404.04693 | link |
2024-04-06 | Z-Splat: Z-Axis Gaussian Splatting for Camera-Sonar Fusion | Ziyuan Qu et.al. | 2404.04687 | link |
2024-04-06 | Collective charge excitations studied by electron energy-loss spectroscopy | Peter Abbamonte et.al. | 2404.04670 | null |
2024-04-06 | DifFUSER: Diffusion Model for Robust Multi-Sensor Fusion in 3D Object Detection and BEV Segmentation | Duy-Tho Le et.al. | 2404.04629 | null |
2024-04-11 | Diffusion Time-step Curriculum for One Image to 3D Generation | Xuanyu Yi et.al. | 2404.04562 | link |
2024-04-09 | Co-Occ: Coupling Explicit Feature Fusion with Volume Rendering Regularization for Multi-Modal 3D Semantic Occupancy Prediction | Jingyi Pan et.al. | 2404.04561 | null |
2024-04-06 | A self-attention model for robust rigid slice-to-volume registration of functional MRI | Samah Khawaled et.al. | 2404.04546 | null |
2024-04-06 | DATENeRF: Depth-Aware Text-based Editing of NeRFs | Sara Rojas et.al. | 2404.04526 | null |
2024-04-06 | Irrational-window-filter projection method and application to quasiperiodic Schrödinger eigenproblems | Kai Jiang et.al. | 2404.04507 | null |
2024-04-06 | Galaxy 3D Shape Recovery using Mixture Density Network | Suk Yee Yong et.al. | 2404.04491 | link |
2024-04-05 | Magnetoroton in a two-dimensional Bose-Bose mixture | O. I. Utesov et.al. | 2404.04440 | null |
2024-04-05 | PhysPT: Physics-aware Pretrained Transformer for Estimating Human Dynamics from Monocular Videos | Yufei Zhang et.al. | 2404.04430 | null |
2024-04-09 | PhysAvatar: Learning the Physics of Dressed 3D Avatars from Visual Observations | Yang Zheng et.al. | 2404.04421 | null |
2024-04-05 | A Ground Mobile Robot for Autonomous Terrestrial Laser Scanning-Based Field Phenotyping | Javier Rodriguez-Sanchez et.al. | 2404.04404 | null |
2024-04-05 | Idea-2-3D: Collaborative LMM Agents Enable 3D Model Generation from Interleaved Multimodal Inputs | Junhao Chen et.al. | 2404.04363 | link |
2024-04-05 | SpatialTracker: Tracking Any 2D Pixels in 3D Space | Yuxi Xiao et.al. | 2404.04319 | link |
2024-04-05 | Dissipative Euler flows originating from circular vortex filaments | Francisco Gancedo et.al. | 2404.04250 | null |
2024-04-05 | Physical Property Understanding from Language-Embedded Feature Fields | Albert J. Zhai et.al. | 2404.04242 | null |
2024-04-05 | Robust Gaussian Splatting | François Darmon et.al. | 2404.04211 | null |
2024-04-05 | Emergent photons and fractionalized excitations in a quantum spin liquid | Bin Gao et.al. | 2404.04207 | null |
2024-04-05 | H3DFact: Heterogeneous 3D Integrated CIM for Factorization with Holographic Perceptual Representations | Zishen Wan et.al. | 2404.04173 | null |
2024-04-05 | Into the thick of it: ALMA 0.45 mm observations of HL Tau at 2 au resolution | Osmar M. Guerra-Alvarado et.al. | 2404.04164 | null |
2024-04-05 | Explorations in Precision Holography and Higher-derivative Supergravity | Robert J. Saskowski et.al. | 2404.04134 | null |
2024-04-05 | 3D Facial Expressions through Analysis-by-Neural-Synthesis | George Retsinas et.al. | 2404.04104 | null |
2024-04-05 | A first passage model of intravitreal drug delivery and residence time, in relation to ocular geometry, individual variability, and injection location | Patricia Lamirande et.al. | 2404.04086 | null |
2024-04-05 | No Time to Train: Empowering Non-Parametric Networks for Few-shot 3D Scene Segmentation | Xiangyang Zhu et.al. | 2404.04050 | link |
2024-04-05 | InstructHumans: Editing Animated 3D Human Textures with Instructions | Jiayin Zhu et.al. | 2404.04037 | null |
2024-04-05 | MM-Gaussian: 3D Gaussian-based Multi-modal Fusion for Localization and Reconstruction in Unbounded Scenes | Chenyang Wu et.al. | 2404.04026 | null |
2024-04-05 | LightOctree: Lightweight 3D Spatially-Coherent Indoor Lighting Estimation | Xuecan Wang et.al. | 2404.03925 | null |
2024-04-05 | Under-Canopy Navigation using Aerial Lidar Maps | Lucas Carvalho de Lima et.al. | 2404.03911 | null |
2024-04-04 | PARIS3D: Reasoning-based 3D Part Segmentation Using Large Multimodal Model | Amrin Kareem et.al. | 2404.03836 | link |
2024-04-04 | Effective Lymph Nodes Detection in CT Scans Using Location Debiased Query Selection and Contrastive Query Representation in Transformer | Qinji Yu et.al. | 2404.03819 | null |
2024-04-04 | Test Time Training for Industrial Anomaly Segmentation | Alex Costanzino et.al. | 2404.03743 | null |
2024-04-04 | SC4D: Sparse-Controlled Video-to-4D Generation and Motion Transfer | Zijie Wu et.al. | 2404.03736 | link |
2024-04-04 | Mitigating analytical variability in fMRI results with style transfer | Elodie Germani et.al. | 2404.03703 | null |
2024-04-04 | Know Your Neighbors: Improving Single-View Reconstruction via Spatial Vision-Language Reasoning | Rui Li et.al. | 2404.03658 | link |
2024-04-04 | MVD-Fusion: Single-view 3D via Depth-consistent Multi-view Generation | Hanzhe Hu et.al. | 2404.03656 | null |
2024-04-07 | RaFE: Generative Radiance Fields Restoration | Zhongkai Wu et.al. | 2404.03654 | null |
2024-04-04 | The More You See in 2D, the More You Perceive in 3D | Xinyang Han et.al. | 2404.03652 | null |
2024-04-04 | OpenNeRF: Open Set 3D Neural Scene Segmentation with Pixel-Wise Features and Rendered Novel Views | Francis Engelmann et.al. | 2404.03650 | null |
2024-04-05 | WorDepth: Variational Language Prior for Monocular Depth Estimation | Ziyao Zeng et.al. | 2404.03635 | link |
2024-04-04 | Reference-Based 3D-Aware Image Editing with Triplane | Bahri Batuhan Bilecen et.al. | 2404.03632 | null |
2024-04-04 | Per-Gaussian Embedding-Based Deformation for Deformable 3D Gaussian Splatting | Jeongmin Bae et.al. | 2404.03613 | null |
2024-04-04 | The influence of substantial intragranular orientation gradients on the micromechanical response of heavily-worked material | Karthik Shankar et.al. | 2404.03579 | null |
2024-04-04 | DreamScene: 3D Gaussian-based Text-to-3D Scene Generation via Formation Pattern Sampling | Haoran Li et.al. | 2404.03575 | null |
2024-04-04 | Terrain Point Cloud Inpainting via Signal Decomposition | Yizhou Xie et.al. | 2404.03572 | null |
2024-04-05 | High redshift LBGs from deep broadband imaging for future spectroscopic surveys | Vanina Ruhlmann-Kleider et.al. | 2404.03569 | null |
2024-04-04 | Towards Transcranial 3D Ultrasound Localization Microscopy of the Nonhuman Primate Brain | Paul Xing et.al. | 2404.03547 | null |
2024-04-04 | COMO: Compact Mapping and Odometry | Eric Dexheimer et.al. | 2404.03531 | null |
2024-04-04 | Wilson Loops and Random Matrices | Georg Bergner et.al. | 2404.03503 | null |
2024-04-04 | Generalizable 3D Scene Reconstruction via Divide and Conquer from a Single View | Andreea Dogaru et.al. | 2404.03421 | null |
2024-04-04 | 3D scaling laws and projection effects in The300-NIKA2 Sunyaev-Zeldovich Large Program Twin Samples | A. Paliwal et.al. | 2404.03376 | null |
2024-04-04 | VF-NeRF: Viewshed Fields for Rigid NeRF Registration | Leo Segre et.al. | 2404.03349 | null |
2024-04-04 | Significantly Enhanced Vacancy Diffusion in Mn-containing Alloys | Huaqing Guan et.al. | 2404.03339 | null |
2024-04-04 | 3D Growth and Remodeling Theory Supports the Hypothesis of Staphyloma Formation from Local Scleral Weakening under Normal Intraocular Pressure | Fabian A. Braeu et.al. | 2404.03330 | link |
2024-04-04 | $\texttt{globin}$ : A spectropolarimetric inversion code for the coupled inference of atomic line parameters | D. Vukadinović et.al. | 2404.03291 | null |
2024-04-04 | iSeg: Interactive 3D Segmentation via Interactive Attention | Itai Lang et.al. | 2404.03219 | null |
2024-04-08 | OmniGS: Omnidirectional Gaussian Splatting for Fast Radiance Field Reconstruction using Omnidirectional Images | Longwei Li et.al. | 2404.03202 | link |
2024-04-04 | CORP: A Multi-Modal Dataset for Campus-Oriented Roadside Perception Tasks | Beibei Wang et.al. | 2404.03191 | null |
2024-04-04 | BodyMAP – Jointly Predicting Body Mesh and 3D Applied Pressure Map for People in Bed | Abhishek Tandon et.al. | 2404.03183 | link |
2024-04-04 | MonoCD: Monocular 3D Object Detection with Complementary Depths | Longfei Yan et.al. | 2404.03181 | link |
2024-04-04 | A comparison between the deflection angles of massive and massless particles in the Shchwarzschild space-time and their consequences on black hole shadows | Sergio Mendoza et.al. | 2404.03174 | null |
2024-04-04 | HandDiff: 3D Hand Pose Estimation with Diffusion on Image-Point Cloud | Wencan Cheng et.al. | 2404.03159 | link |
2024-04-04 | Design and Evaluation of a Compact 3D End-effector Assistive Robot for Adaptive Arm Support | Sibo Yang et.al. | 2404.03149 | null |
2024-04-04 | A first extraction of the weak magnetism form factor and Fierz interference term from the $^{114}$In $\rightarrow$ $^{114}$ Sn Gamow-Teller transition | L. De Keukeleere et.al. | 2404.03140 | null |
2024-04-04 | GaSpCT: Gaussian Splatting for Novel CT Projection View Synthesis | Emmanouil Nikolakakis et.al. | 2404.03126 | null |
2024-04-10 | Analyzing Warp Drive Spacetimes with Warp Factory | Christopher Helmerich et.al. | 2404.03095 | link |
2024-04-03 | Behind the Veil: Enhanced Indoor 3D Scene Reconstruction with Occluded Surfaces Completion | Su Sun et.al. | 2404.03070 | null |
2024-04-07 | Linear Anchored Gaussian Mixture Model for Location and Width Computation of Objects in Thick Line Shape | Nafaa Nacereddine et.al. | 2404.03043 | null |
2024-04-03 | AWOL: Analysis WithOut synthesis using Language | Silvia Zuffi et.al. | 2404.03042 | null |
2024-04-03 | Skeleton Recall Loss for Connectivity Conserving and Resource Efficient Segmentation of Thin Tubular Structures | Yannick Kirchhoff et.al. | 2404.03010 | link |
2024-04-03 | Orbital obliquity of the young planet TOI-5398 b and the evolutionary history of the system | G. Mantovan et.al. | 2404.02969 | link |
2024-04-03 | Unraveling the Mn $L_3$-edge RIXS spectrum of lightly manganese doped Sr${3}$Ru${2}$O$_{7}$ | Wei-Yang Chen et.al. | 2404.02963 | null |
2024-04-03 | LidarDM: Generative LiDAR Simulation in a Generated World | Vlas Zyrianov et.al. | 2404.02903 | link |
2024-04-03 | MatAtlas: Text-driven Consistent Geometry Texturing and Material Assignment | Duygu Ceylan et.al. | 2404.02899 | null |
2024-04-03 | I-Design: Personalized LLM Interior Designer | Ata Çelen et.al. | 2404.02838 | null |
2024-04-03 | GPU-Accelerated RSF Level Set Evolution for Large-Scale Microvascular Segmentation | Meher Niger et.al. | 2404.02813 | null |
2024-04-03 | GenN2N: Generative NeRF2NeRF Translation | Xiangyue Liu et.al. | 2404.02788 | link |
2024-04-03 | Unsupervised Occupancy Learning from Sparse Point Cloud | Amine Ouasfi et.al. | 2404.02759 | null |
2024-04-03 | Design2Cloth: 3D Cloth Generation from 2D Masks | Jiali Zheng et.al. | 2404.02686 | null |
2024-04-03 | 3DStyleGLIP: Part-Tailored Text-Guided 3D Neural Stylization | SeungJeh Chung et.al. | 2404.02634 | link |
2024-04-03 | Neural Radiance Fields with Torch Units | Bingnan Ni et.al. | 2404.02617 | null |
2024-04-03 | Weakly-Supervised 3D Scene Graph Generation via Visual-Linguistic Assisted Pseudo-labeling | Xu Wang et.al. | 2404.02527 | link |
2024-04-03 | HENet: Hybrid Encoding for End-to-end Multi-task 3D Perception from Multi-view Cameras | Zhongyu Xia et.al. | 2404.02517 | link |
2024-04-03 | On-the-Go Tree Detection and Geometric Traits Estimation with Ground Mobile Robots in Fruit Tree Groves | Dimitrios Chatziparaschis et.al. | 2404.02516 | null |
2024-04-03 | Freditor: High-Fidelity and Transferable NeRF Editing by Frequency Decomposition | Yisheng He et.al. | 2404.02514 | null |
2024-04-04 | DiffFit: Visually-Guided Differentiable Fitting of Molecule Structures to Cryo-EM Map | Deng Luo et.al. | 2404.02465 | link |
2024-04-03 | A fast cosine transformation accelerated method for predicting effective thermal conductivity | Changqing Ye et.al. | 2404.02433 | link |
2024-04-03 | High quality Fe1+yTe synthesized by chemical vapor deposition with conspicuous vortex flow | Lu Lv et.al. | 2404.02420 | null |
2024-04-03 | TCLC-GS: Tightly Coupled LiDAR-Camera Gaussian Splatting for Surrounding Autonomous Driving Scenes | Cheng Zhao et.al. | 2404.02410 | null |
2024-04-03 | APC2Mesh: Bridging the gap from occluded building façades to full 3D models | Perpetual Hope Akwensi et.al. | 2404.02391 | null |
2024-04-03 | Imaging transformer for MRI denoising with the SNR unit training: enabling generalization across field-strengths, imaging contrasts, and anatomy | Hui Xue et.al. | 2404.02382 | null |
2024-04-02 | One Noise to Rule Them All: Multi-View Adversarial Attacks with Universal Perturbation | Mehmet Ergezer et.al. | 2404.02287 | link |
2024-04-02 | Neural network reconstruction of density and velocity fields from the 2MASS Redshift Survey | Robert Lilow et.al. | 2404.02278 | link |
2024-04-02 | On diffusion and transport acting on parameterized moving closed curves in space | Michal Benes et.al. | 2404.02260 | null |
2024-04-02 | A recipe for eccentricity and inclination damping for partial gap opening planets in 3D disks | Gabriele Pichierri et.al. | 2404.02247 | null |
2024-04-02 | Towards Robust 3D Pose Transfer with Adversarial Learning | Haoyu Chen et.al. | 2404.02242 | null |
2024-04-02 | Deep Neural Networks with 3D Point Clouds for Empirical Friction Measurements in Hydrodynamic Flood Models | Francisco Haces-Garcia et.al. | 2404.02234 | link |
2024-04-02 | Normal weak eigenstate thermalization | Patrycja Łydżba et.al. | 2404.02199 | null |
2024-04-02 | Black Hole-Disk Interactions in Magnetically Arrested Active Galactic Nuclei: General Relativistic Magnetohydrodynamic Simulations Using A Time-Dependent, Binary Metric | Sean M. Ressler et.al. | 2404.02193 | null |
2024-04-02 | NeRFCodec: Neural Feature Compression Meets Neural Radiance Fields for Memory-Efficient Scene Representation | Sicheng Li et.al. | 2404.02185 | null |
2024-04-01 | Versatile Navigation under Partial Observability via Value-guided Diffusion Policy | Gengyu Zhang et.al. | 2404.02176 | null |
2024-03-28 | Non-Destructive, High-Resolution, Chemically Specific, 3D Nanostructure Characterization using Phase-Sensitive EUV Imaging Reflectometry | Michael Tanksalvala et.al. | 2404.02170 | null |
2024-04-02 | Segment Any 3D Object with Language | Seungjun Lee et.al. | 2404.02157 | null |
2024-04-02 | Alpha Invariance: On Inverse Scaling Between Distance and Volume Density in Neural Radiance Fields | Joshua Ahn et.al. | 2404.02155 | null |
2024-04-02 | GeneAvatar: Generic Expression-Aware Volumetric Head Avatar Editing from a Single Image | Chong Bao et.al. | 2404.02152 | null |
2024-04-02 | Diffusion $^2$ : Dynamic 3D Content Generation via Score Composition of Orthogonal Diffusion Models | Zeyu Yang et.al. | 2404.02148 | link |
2024-04-02 | 3D Congealing: 3D-Aware Image Alignment in the Wild | Yunzhi Zhang et.al. | 2404.02125 | null |
2024-04-02 | Neural Ordinary Differential Equation based Sequential Image Registration for Dynamic Characterization | Yifan Wu et.al. | 2404.02106 | null |
2024-04-02 | SelfPose3d: Self-Supervised Multi-Person Multi-View 3d Pose Estimation | Vinkle Srivastav et.al. | 2404.02041 | link |
2024-04-02 | Ram pressure stripping in clusters: Gravity can bind the ISM but not the CGM | Ritali Ghosh et.al. | 2404.02035 | link |
2024-04-02 | A discussion about violin reduction: geometric analysis of contour lines and channel of minima | Philémon Beghin et.al. | 2404.01995 | null |
2024-04-02 | Lookahead Exploration with Neural Radiance Representation for Continuous Vision-Language Navigation | Zihan Wang et.al. | 2404.01943 | link |
2024-04-08 | LPSNet: End-to-End Human Pose and Shape Estimation with Lensless Imaging | Haoyang Ge et.al. | 2404.01941 | null |
2024-04-04 | 3D scene generation from scene graphs and self-attention | Pietro Bonazzi et.al. | 2404.01887 | link |
2024-04-02 | Multidimensional deconvolution with shared bases | Daria Sushnikova et.al. | 2404.01870 | null |
2024-04-02 | Identification and characterization of three-dimensional crack propagation mechanism in the Aluminium alloy AA2024-T3 using high-resolution Digital Image Correlation | Vanessa Schöne et.al. | 2404.01852 | null |
2024-04-07 | Sketch3D: Style-Consistent Guidance for Sketch-to-3D Generation | Wangguandong Zheng et.al. | 2404.01843 | null |
2024-04-02 | Uncertainty-aware Active Learning of NeRF-based Object Models for Robot Manipulators using Visual and Re-orientation Actions | Saptarshi Dasgupta et.al. | 2404.01812 | null |
2024-04-02 | Surface Reconstruction from Gaussian Splatting via Novel Stereo Views | Yaniv Wolf et.al. | 2404.01810 | link |
2024-04-02 | Diagnostics of 3D explosion asymmetries of stripped-envelope supernovae by nebular line profiles | Bart van Baal et.al. | 2404.01763 | null |
2024-04-02 | Contextual Embedding Learning to Enhance 2D Networks for Volumetric Image Segmentation | Zhuoyuan Wang et.al. | 2404.01723 | link |
2024-04-02 | HeMeNet: Heterogeneous Multichannel Equivariant Network for Protein Multitask Learning | Rong Han et.al. | 2404.01693 | null |
2024-04-02 | JRDB-PanoTrack: An Open-world Panoptic Segmentation and Tracking Robotic Dataset in Crowded Human Environments | Duy-Tho Le et.al. | 2404.01686 | null |
2024-04-05 | FashionEngine: Interactive Generation and Editing of 3D Clothed Humans | Tao Hu et.al. | 2404.01655 | null |
2024-04-02 | Efficient Computation of Mean field Control based Barycenters from Reaction-Diffusion Systems | Arjun Vijaywargiya et.al. | 2404.01586 | link |
2024-04-02 | Learning Temporal Cues by Predicting Objects Move for Multi-camera 3D Object Detection | Seokha Moon et.al. | 2404.01580 | null |
2024-04-02 | Leveraging Digital Perceptual Technologies for Remote Perception and Analysis of Human Biomechanical Processes: A Contactless Approach for Workload and Joint Force Assessment | Jesudara Omidokun et.al. | 2404.01576 | null |
2024-04-02 | Efficient 3D Implicit Head Avatar with Mesh-anchored Hash Table Blendshapes | Ziqian Bai et.al. | 2404.01543 | null |
2024-04-01 | SUGAR: Pre-training 3D Visual Representations for Robotics | Shizhe Chen et.al. | 2404.01491 | null |
2024-04-01 | Approach and rotation of reconnecting topological defect lines in liquid crystal | Yohei Zushi et.al. | 2404.01480 | null |
2024-04-01 | Hybrid GRMHD and Force-Free Simulations of Black Hole Accretion | Andrew Chael et.al. | 2404.01471 | null |
2024-04-01 | Data-Efficient Unsupervised Interpolation Without Any Intermediate Frame for 4D Medical Images | JungEun Kim et.al. | 2404.01464 | link |
2024-04-01 | Neural Implicit Representation for Building Digital Twins of Unknown Articulated Objects | Yijia Weng et.al. | 2404.01440 | link |
2024-04-01 | DPMesh: Exploiting Diffusion Prior for Occluded Human Mesh Recovery | Yixuan Zhu et.al. | 2404.01424 | link |
2024-04-01 | Electronic structure and thermoelectric properties of epitaxial Sc1-xVxNy thin films grown on MgO(001) | Susmita Chowdhury et.al. | 2404.01417 | null |
2024-04-01 | ContactHandover: Contact-Guided Robot-to-Human Object Handover | Zixi Wang et.al. | 2404.01402 | null |
2024-04-01 | NVINS: Robust Visual Inertial Navigation Fused with NeRF-augmented Camera Pose Regressor and Uncertainty Quantification | Juyeop Han et.al. | 2404.01400 | null |
2024-04-01 | Galaxy shapes in Magneticum. I. Connecting stellar and dark matter shapes to dynamical and morphological galaxy properties and the large-scale structure | Lucas M. Valenzuela et.al. | 2404.01368 | null |
2024-03-30 | Generative AI for Architectural Design: A Literature Review | Chengyuan Li et.al. | 2404.01335 | null |
2024-04-01 | NeRF-MAE : Masked AutoEncoders for Self Supervised 3D representation Learning for Neural Radiance Fields | Muhammad Zubair Irshad et.al. | 2404.01300 | link |
2024-04-01 | MagicMirror: Fast and High-Quality Avatar Generation with a Constrained Search Space | Armand Comas-Massagué et.al. | 2404.01296 | null |
2024-04-01 | Evaluating Text-to-Visual Generation with Image-to-Text Generation | Zhiqiu Lin et.al. | 2404.01291 | link |
2024-04-01 | Scalable Scene Modeling from Perspective Imaging: Physics-based Appearance and Geometry Inference | Shuang Song et.al. | 2404.01248 | null |
2024-04-02 | StructLDM: Structured Latent Diffusion for 3D Human Generation | Tao Hu et.al. | 2404.01241 | null |
2024-04-01 | FPGA-Accelerated Correspondence-free Point Cloud Registration with PointNet Features | Keisuke Sugiura et.al. | 2404.01237 | null |
2024-04-01 | Feature Splatting: Language-Driven Physics-Based Scene Synthesis and Editing | Ri-Zhao Qiu et.al. | 2404.01223 | null |
2024-04-01 | Robust Trajectory and Resource Optimization for Communication-assisted UAV SAR Sensing | Mohamed-Amine Lahmeri et.al. | 2404.01195 | null |
2024-04-01 | Mirror-3DGS: Incorporating Mirror Reflections into 3D Gaussian Splatting | Jiarui Meng et.al. | 2404.01168 | null |
2024-04-07 | CityGaussian: Real-time High-quality Large-Scale Scene Rendering with Gaussians | Yang Liu et.al. | 2404.01133 | link |
2024-04-04 | Few-shot point cloud reconstruction and denoising via learned Guassian splats renderings and fine-tuned diffusion features | Pietro Bonazzi et.al. | 2404.01112 | null |
2024-04-01 | T-Mamba: Frequency-Enhanced Gated Long-Range Dependency for Tooth 3D CBCT Segmentation | Jing Hao et.al. | 2404.01065 | link |
2024-04-04 | Roadside Monocular 3D Detection via 2D Detection Prompting | Yechi Ma et.al. | 2404.01064 | null |
2024-04-01 | Chat Modeling: Natural Language-based Procedural Modeling of Biological Structures without Training | Donggang Jia et.al. | 2404.01063 | null |
2024-04-01 | FlexiDreamer: Single Image-to-3D Generation with FlexiCubes | Ruowen Zhao et.al. | 2404.00987 | link |
2024-04-01 | PDF: A Probability-Driven Framework for Open World 3D Point Cloud Semantic Segmentation | Jinfeng Xu et.al. | 2404.00979 | link |
2024-04-01 | Diffusion-Driven Domain Adaptation for Generating 3D Molecules | Haokai Hong et.al. | 2404.00962 | null |
2024-04-01 | Orchestrating UAVs for Prioritized Data Harvesting: A Cross-Layer Optimization Perspective | Bharath Keshavamurthy et.al. | 2404.00961 | null |
2024-04-01 | Equivariant Local Reference Frames for Unsupervised Non-rigid Point Cloud Shape Correspondence | Ling Wang et.al. | 2404.00959 | null |
2024-04-01 | GOV-NeSF: Generalizable Open-Vocabulary Neural Semantic Fields | Yunsong Wang et.al. | 2404.00931 | link |
2024-04-01 | MM3DGS SLAM: Multi-modal 3D Gaussian Splatting for SLAM Using Vision, Depth, and Inertial Measurements | Lisong C. Sun et.al. | 2404.00923 | null |
2024-04-09 | Scalable 3D Registration via Truncated Entry-wise Absolute Residuals | Tianyu Huang et.al. | 2404.00915 | link |
2024-04-01 | Marrying NeRF with Feature Matching for One-step Pose Estimation | Ronghan Chen et.al. | 2404.00891 | null |
2024-04-02 | DPA-Net: Structured 3D Abstraction from Sparse Views via Differentiable Primitive Assembly | Fenggen Yu et.al. | 2404.00875 | null |
2024-04-01 | DiSR-NeRF: Diffusion-Guided View-Consistent Super-Resolution NeRF | Jie Long Lee et.al. | 2404.00874 | link |
2024-04-01 | Meta Episodic learning with Dynamic Task Sampling for CLIP-based Point Cloud Classification | Shuvozit Ghose et.al. | 2404.00857 | null |
2024-04-01 | Transfer Learning with Point Transformers | Kartik Gupta et.al. | 2404.00846 | null |
2024-03-31 | Towards Realistic Scene Generation with LiDAR Diffusion Models | Haoxi Ran et.al. | 2404.00815 | link |
2024-03-31 | Off-the-grid regularisation for Poisson inverse problems | Marta Lazzaretti et.al. | 2404.00810 | link |
2024-03-31 | Disentangling Hippocampal Shape Variations: A Study of Neurological Disorders Using Graph Variational Autoencoder with Contrastive Learning | Jakaria Rabbi et.al. | 2404.00785 | link |
2024-03-31 | An Active Perception Game for Robust Autonomous Exploration | Siming He et.al. | 2404.00769 | null |
2024-03-31 | Intensity-based 3D motion correction for cardiac MR images | Nil Stolt-Ansó et.al. | 2404.00767 | link |
2024-03-31 | Neural Radiance Field-based Visual Rendering: A Comprehensive Review | Mingyuan Yao et.al. | 2404.00714 | null |
2024-03-31 | The biharmonic optimal support problem | Antoine Lemenant et.al. | 2404.00689 | null |
2024-03-31 | Weak-to-Strong 3D Object Detection with X-Ray Distillation | Alexander Gambashidze et.al. | 2404.00679 | link |
2024-04-07 | Knowledge NeRF: Few-shot Novel View Synthesis for Dynamic Articulated Objects | Wenxiao Cai et.al. | 2404.00674 | link |
2024-04-02 | KTPFormer: Kinematics and Trajectory Prior Knowledge-Enhanced Transformer for 3D Human Pose Estimation | Jihua Peng et.al. | 2404.00658 | link |
2024-04-02 | Learning to Generate Conditional Tri-plane for 3D-aware Expression Controllable Portrait Animation | Taekyung Ki et.al. | 2404.00636 | null |
2024-03-31 | LAESI: Leaf Area Estimation with Synthetic Imagery | Jacek Kałużny et.al. | 2404.00593 | null |
2024-03-31 | M3D: Advancing 3D Medical Image Analysis with Multi-Modal Large Language Models | Fan Bai et.al. | 2404.00578 | link |
2024-04-02 | Text2HOI: Text-guided 3D Motion Generation for Hand-Object Interaction | Junuk Cha et.al. | 2404.00562 | link |
2024-03-31 | Embodied Active Defense: Leveraging Recurrent Feedback to Counter Adversarial Patches | Lingxuan Wu et.al. | 2404.00540 | null |
2024-04-02 | Functional Bethe Ansatz for a $\sinh$-Gordon model with real $q$ | Sergey Sergeev et.al. | 2404.00503 | null |
2024-03-30 | DiffHuman: Probabilistic Photorealistic 3D Reconstruction of Humans | Akash Sengupta et.al. | 2404.00485 | null |
2024-03-30 | SceneGraphLoc: Cross-Modal Coarse Visual Localization on 3D Scene Graphs | Yang Miao et.al. | 2404.00469 | null |
2024-03-30 | Multiway Point Cloud Mosaicking with Diffusion and Global Optimization | Shengze Jin et.al. | 2404.00429 | null |
2024-03-30 | 3DGSR: Implicit Surface Reconstruction with 3D Gaussian Splatting | Xiaoyang Lyu et.al. | 2404.00409 | null |
2024-03-30 | Towards Variable and Coordinated Holistic Co-Speech Motion Generation | Yifei Liu et.al. | 2404.00368 | null |
2024-03-30 | Accurate Cutting-point Estimation for Robotic Lychee Harvesting through Geometry-aware Learning | Gengming Zhang et.al. | 2404.00364 | null |
2024-03-30 | MaGRITTe: Manipulative and Generative 3D Realization from Image, Topview and Text | Takayuki Hara et.al. | 2404.00345 | null |
2024-03-30 | YNetr: Dual-Encoder architecture on Plain Scan Liver Tumors (PSLT) | Wen Sheng et.al. | 2404.00327 | null |
2024-03-30 | Exploring Unseen Environments with Robots using Large Language and Vision Models through a Procedurally Generated 3D Scene Representation | Arjun P S et.al. | 2404.00318 | null |
2024-03-30 | Monocular Identity-Conditioned Facial Reflectance Reconstruction | Xingyu Ren et.al. | 2404.00301 | null |
2024-04-02 | HOI-M3:Capture Multiple Humans and Objects Interaction within Contextual Environment | Juze Zhang et.al. | 2404.00299 | null |
2024-03-30 | IPoD: Implicit Field Learning with Point Diffusion for Generalizable 3D Object Reconstruction from Single RGB-D Images | Yushuang Wu et.al. | 2404.00269 | null |
2024-03-30 | Clustering for Protein Representation Learning | Ruijie Quan et.al. | 2404.00254 | link |
2024-03-30 | Grid Diffusion Models for Text-to-Video Generation | Taegyeong Lee et.al. | 2404.00234 | null |
2024-03-29 | Universal Bovine Identification via Depth Data and Deep Metric Learning | Asheesh Sharma et.al. | 2404.00172 | null |
2024-03-29 | VSRD: Instance-Aware Volumetric Silhouette Rendering for Weakly Supervised 3D Object Detection | Zihua Liu et.al. | 2404.00149 | link |
2024-03-29 | FetalDiffusion: Pose-Controllable 3D Fetal MRI Synthesis with Conditional Diffusion Model | Molin Zhang et.al. | 2404.00132 | null |
2024-03-29 | Non-thermal broadening of coronal lines in a 3D MHD loop model | C. A. Breu et.al. | 2404.00127 | null |
2024-03-29 | Accurate PRD modeling of the forward-scattering Hanle effect in the chromospheric CaI 4227 Å line | Luca Belluzzi et.al. | 2404.00104 | null |
2024-03-29 | Sparse Views, Near Light: A Practical Paradigm for Uncalibrated Point-light Photometric Stereo | Mohammed Brahimi et.al. | 2404.00098 | null |
2024-03-26 | Choreographing the Digital Canvas: A Machine Learning Approach to Artistic Performance | Siyuan Peng et.al. | 2404.00054 | null |
2024-03-29 | SeaBird: Segmentation in Bird’s View with Dice Loss Improves Monocular 3D Detection of Large Objects | Abhinav Kumar et.al. | 2403.20318 | link |
2024-03-29 | InstantSplat: Unbounded Sparse-view Pose-free Gaussian Splatting in 40 Seconds | Zhiwen Fan et.al. | 2403.20309 | link |
2024-03-29 | Snap-it, Tap-it, Splat-it: Tactile-Informed 3D Gaussian Splatting for Reconstructing Challenging Surfaces | Mauro Comi et.al. | 2403.20275 | null |
2024-03-29 | Sketch-to-Architecture: Generative AI-aided Architectural Design | Pengzhi Li et.al. | 2403.20186 | null |
2024-03-29 | HGS-Mapping: Online Dense Mapping Using Hybrid Gaussian Representation in Urban Scenes | Ke Wu et.al. | 2403.20159 | null |
2024-03-29 | Talk3D: High-Fidelity Talking Portrait Synthesis via Personalized 3D Generative Prior | Jaehoon Ko et.al. | 2403.20153 | link |
2024-03-29 | SGD: Street View Synthesis with Gaussian Splatting and Diffusion Prior | Zhongrui Yu et.al. | 2403.20079 | null |
2024-03-29 | NeSLAM: Neural Implicit Mapping and Self-Supervised Feature Tracking With Depth Completion and Denoising | Tianchen Deng et.al. | 2403.20034 | link |
2024-03-29 | HO-Gaussian: Hybrid Optimization of 3D Gaussian Splatting for Urban Scenes | Zhuopeng Li et.al. | 2403.20032 | null |
2024-03-29 | A Unified Framework for Human-centric Point Cloud Video Understanding | Yiteng Xu et.al. | 2403.20031 | null |
2024-03-29 | SCINeRF: Neural Radiance Fields from a Snapshot Compressive Image | Yunhao Li et.al. | 2403.20018 | link |
2024-03-29 | DerainNeRF: 3D Scene Estimation with Adhesive Waterdrop Removal | Yunhao Li et.al. | 2403.20013 | link |
2024-04-06 | Grounding and Enhancing Grid-based Models for Neural Fields | Zelin Zhao et.al. | 2403.20002 | null |
2024-03-29 | 3D-Speaker-Toolkit: An Open Source Toolkit for Multi-modal Speaker Verification and Diarization | Yafeng Chen et.al. | 2403.19971 | link |
2024-03-29 | Synthesis and Characterization of Superparamagnetic Iron Oxide Nanoparticles: A Series of Laboratory Experiments | Armando D. Urbina et.al. | 2403.19970 | null |
2024-03-29 | A Semiparametric Gaussian Mixture Model for Chest CT-based 3D Blood Vessel Reconstruction | Qianhan Zeng et.al. | 2403.19929 | link |
2024-03-29 | SceneTracker: Long-term Scene Flow Estimation Network | Bo Wang et.al. | 2403.19924 | link |
2024-03-29 | Diff-Reg v1: Diffusion Matching Model for Registration Problem | Qianliang Wu et.al. | 2403.19919 | link |
2024-03-29 | Automated Identification and Segmentation of Hi Sources in CRAFTS Using Deep Learning Method | Zihao Song et.al. | 2403.19912 | link |
2024-03-29 | Fully Geometric Panoramic Localization | Junho Kim et.al. | 2403.19904 | null |
2024-03-28 | Localization and Offline Mapping of High-Voltage Substations in Rough Terrain Using a Ground Vehicle | Ioannis Alamanos et.al. | 2403.19875 | link |
2024-04-02 | Is Synthetic Image Useful for Transfer Learning? An Investigation into Data Generation, Volume, and Utilization | Yuhang Li et.al. | 2403.19866 | null |
2024-03-28 | An Ultra-high-speed Reproducing Kernel Particle Method | Siavash Jafarzadeh et.al. | 2403.19854 | null |
2024-04-01 | Efficient 3D Instance Mapping and Localization with Neural Fields | George Tang et.al. | 2403.19797 | null |
2024-04-04 | ShapeFusion: A 3D diffusion model for localized shape editing | Rolandos Alexandros Potamias et.al. | 2403.19773 | null |
2024-03-28 | Bayesian Multi-line Intensity Mapping | Yun-Ting Cheng et.al. | 2403.19740 | null |
2024-03-28 | Towards Reverse-Engineering the Brain: Brain-Derived Neuromorphic Computing Approach with Photonic, Electronic, and Ionic Dynamicity in 3D integrated circuits | S. J. Ben Yoo et.al. | 2403.19724 | null |
2024-04-05 | GaussianCube: Structuring Gaussian Splatting using Optimal Transport for 3D Generative Modeling | Bowen Zhang et.al. | 2403.19655 | null |
2024-03-28 | InterDreamer: Zero-Shot Text to 3D Dynamic Human-Object Interaction | Sirui Xu et.al. | 2403.19652 | null |
2024-03-28 | GraspXL: Generating Grasping Motions for Diverse Objects at Scale | Hui Zhang et.al. | 2403.19649 | null |
2024-03-28 | GauStudio: A Modular Framework for 3D Gaussian Splatting and Beyond | Chongjie Ye et.al. | 2403.19632 | link |
2024-03-28 | SA-GS: Scale-Adaptive Gaussian Splatting for Training-Free Anti-Aliasing | Xiaowei Song et.al. | 2403.19615 | link |
2024-04-04 | ILPO-NET: Network for the invariant recognition of arbitrary volumetric patterns in 3D | Dmitrii Zhemchuzhnikov et.al. | 2403.19612 | null |
2024-03-28 | SAID-NeRF: Segmentation-AIDed NeRF for Depth Completion of Transparent Objects | Avinash Ummadisingu et.al. | 2403.19607 | null |
2024-03-28 | TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes | Bu Jin et.al. | 2403.19589 | link |
2024-03-28 | OV-Uni3DETR: Towards Unified Open-Vocabulary 3D Object Detection via Cycle-Modality Propagation | Zhenyu Wang et.al. | 2403.19580 | link |
2024-03-28 | CoherentGS: Sparse Novel View Synthesis with Coherent 3D Gaussians | Avinash Paliwal et.al. | 2403.19495 | link |
2024-03-28 | SG-PGM: Partial Graph Matching Network with Semantic Geometric Fusion for 3D Scene Graph Alignment and Its Downstream Tasks | Yaxu Xie et.al. | 2403.19474 | link |
2024-03-28 | Beyond Talking – Generating Holistic 3D Human Dyadic Motion for Communication | Mingze Sun et.al. | 2403.19467 | null |
2024-04-01 | BAMM: Bidirectional Autoregressive Motion Model | Ekkasit Pinyoanuntapong et.al. | 2403.19435 | link |
2024-03-28 | Brain-Shift: Unsupervised Pseudo-Healthy Brain Synthesis for Novel Biomarker Extraction in Chronic Subdural Hematoma | Baris Imre et.al. | 2403.19415 | link |
2024-03-28 | Cell Electropermeabilization Modeling via Multiple Traces Formulation and Time Semi-Implicit Coupling | Isabel A. Martínez Ávila et.al. | 2403.19371 | null |
2024-03-28 | Mesh2NeRF: Direct Mesh Supervision for Neural Radiance Field Representation and Generation | Yujin Chen et.al. | 2403.19319 | null |
2024-03-30 | Total-Decom: Decomposed 3D Scene Reconstruction with Minimal Interaction | Xiaoyang Lyu et.al. | 2403.19314 | link |
2024-03-28 | Titanium abundances in late-type stars, II. Grid of departure coefficients and application to a sample of $70\,000$ stars | J. W. E. Mallinson et.al. | 2403.19304 | null |
2024-03-28 | Neural Fields for 3D Tracking of Anatomy and Surgical Instruments in Monocular Laparoscopic Video Clips | Beerend G. A. Gerats et.al. | 2403.19265 | null |
2024-03-28 | Sine Activated Low-Rank Matrices for Parameter Efficient Learning | Yiping Ji et.al. | 2403.19243 | null |
2024-03-28 | GeoAuxNet: Towards Universal 3D Representation Learning for Multi-sensor Point Clouds | Shengjun Zhang et.al. | 2403.19220 | link |
2024-03-28 | Convolutional network learning of self-consistent electron density via grid-projected atomic fingerprints | Ryong-Gyu Lee et.al. | 2403.19214 | null |
2024-03-28 | The Lorentz force at work: multi-phase magnetohydrodynamics throughout a flare lifespan | Wenzhi Ruan et.al. | 2403.19204 | null |
2024-03-28 | Within the Dynamic Context: Inertia-aware 3D Human Modeling with Pose Sequence | Yutong Chen et.al. | 2403.19160 | null |
2024-03-28 | Compression and acceleration of ions by ultra-short ultra-intense azimuthally-polarized light | Da-Chao Deng et.al. | 2403.19133 | null |
2024-03-28 | CRKD: Enhanced Camera-Radar Object Detection with Cross-modality Knowledge Distillation | Lingjun Zhao et.al. | 2403.19104 | null |
2024-03-28 | Automatic Fingerpad Customization for Precise and Stable Grasping of 3D-Print Parts | Joyce Xin-Yan Lim et.al. | 2403.19102 | null |
2024-04-02 | MMCert: Provable Defense against Adversarial Attacks to Multi-modal Models | Yanting Wang et.al. | 2403.19080 | link |
2024-03-28 | Dataflow-Aware PIM-Enabled Manycore Architecture for Deep Learning Workloads | Harsh Sharma et.al. | 2403.19073 | null |
2024-03-27 | Low-Complexity Estimation Algorithm and Decoupling Scheme for FRaC System | Mengjiang Sun et.al. | 2403.19044 | null |
2024-03-27 | Tessellation and interactive visualization of four-dimensional spacetime geometries | Philip Claude Caplan et.al. | 2403.19036 | null |
2024-04-01 | WALT3D: Generating Realistic Training Data from Time-Lapse Imagery for Reconstructing Dynamic Objects under Occlusion | Khiem Vuong et.al. | 2403.19022 | null |
2024-03-30 | Cross-domain Fiber Cluster Shape Analysis for Language Performance Cognitive Score Prediction | Yui Lo et.al. | 2403.19001 | null |
2024-03-27 | Robustness and Visual Explanation for Black Box Image, Video, and ECG Signal Classification with Reinforcement Learning | Soumyendu Sarkar et.al. | 2403.18985 | null |
2024-03-27 | Supernova Simulations | Bernhard Müller et.al. | 2403.18952 | null |
2024-03-27 | Lift3D: Zero-Shot Lifting of Any 2D Vision Model to 3D | Mukund Varma T et.al. | 2403.18922 | null |
2024-03-27 | Sliced Online Model Checking for Optimizing the Beam Scheduling Problem in Robotic Radiation Therapy | Lars Beckers et.al. | 2403.18918 | null |
2024-03-27 | UniDepth: Universal Monocular Metric Depth Estimation | Luigi Piccinelli et.al. | 2403.18913 | link |
2024-03-27 | Towards a Cost-Benefit Analysis of Additive Manufacturing as a Service | Igor Ivkić et.al. | 2403.18882 | null |
2024-03-26 | Predicting risk of cardiovascular disease using retinal OCT imaging | Cynthia Maldonado-Garcia et.al. | 2403.18873 | link |
2024-03-27 | Garment3DGen: 3D Garment Stylization and Texture Generation | Nikolaos Sarafianos et.al. | 2403.18816 | null |
2024-03-27 | Duolando: Follower GPT with Off-Policy Reinforcement Learning for Dance Accompaniment | Li Siyao et.al. | 2403.18811 | null |
2024-03-27 | SolderlessPCB: Reusing Electronic Components in PCB Prototyping through Detachable 3D Printed Housings | Zeyu Yan et.al. | 2403.18797 | null |
2024-03-29 | Gamba: Marry Gaussian Splatting with Mamba for single view 3D reconstruction | Qiuhong Shen et.al. | 2403.18795 | link |
2024-03-27 | Object Pose Estimation via the Aggregation of Diffusion Features | Tianfu Wang et.al. | 2403.18791 | link |
2024-03-29 | SplatFace: Gaussian Splat Face Reconstruction Leveraging an Optimizable Surface | Jiahao Luo et.al. | 2403.18784 | null |
2024-03-27 | Breaking the Limitations with Sparse Inputs by Variational Frameworks (BLIss) in Terahertz Super-Resolution 3D Reconstruction | Yiyao Zhang et.al. | 2403.18776 | link |
2024-03-27 | ModaLink: Unifying Modalities for Efficient Image-to-PointCloud Place Recognition | Weidong Xie et.al. | 2403.18762 | link |
2024-03-27 | MATTopo: Topology-preserving Medial Axis Transform with Restricted Power Diagram | Ningna Wang et.al. | 2403.18761 | null |
2024-03-27 | A vascular synthetic model for improved aneurysm segmentation and detection via Deep Neural Networks | Rafic Nader et.al. | 2403.18734 | null |
2024-03-27 | SAT-NGP : Unleashing Neural Graphics Primitives for Fast Relightable Transient-Free 3D reconstruction from Satellite Imagery | Camille Billouard et.al. | 2403.18711 | link |
2024-03-27 | Addressing Data Annotation Challenges in Multiple Sensors: A Solution for Scania Collected Datasets | Ajinkya Khoche et.al. | 2403.18649 | null |
2024-03-27 | Can the splashback radius be an observable boundary of galaxy clusters? | Théo Lebeau et.al. | 2403.18648 | null |
2024-03-27 | HandBooster: Boosting 3D Hand-Mesh Reconstruction by Conditional Synthesis and Sampling of Hand-Object Interactions | Hao Xu et.al. | 2403.18575 | link |
2024-03-27 | Artifact Reduction in 3D and 4D Cone-beam Computed Tomography Images with Deep Learning – A Review | Mohammadreza Amirian et.al. | 2403.18565 | null |
2024-03-27 | Attention Calibration for Disentangled Text-to-Image Personalization | Yanbing Zhang et.al. | 2403.18551 | link |
2024-03-27 | CT-3DFlow : Leveraging 3D Normalizing Flows for Unsupervised Detection of Pathological Pulmonary CT scans | Aissam Djahnine et.al. | 2403.18514 | null |
2024-03-27 | Density-guided Translator Boosts Synthetic-to-Real Unsupervised Domain Adaptive Segmentation of 3D Point Clouds | Zhimin Yuan et.al. | 2403.18469 | link |
2024-03-27 | Backpropagation-free Network for 3D Test-time Adaptation | Yanshuo Wang et.al. | 2403.18442 | link |
2024-03-27 | MonoHair: High-Fidelity Hair Modeling from a Monocular Video | Keyu Wu et.al. | 2403.18356 | null |
2024-03-27 | Learning Inclusion Matching for Animation Paint Bucket Colorization | Yuekun Dai et.al. | 2403.18342 | link |
2024-03-28 | 3D Gap Opening in Non-Ideal MHD Protoplanetary Disks: Asymmetric Accretion, Meridional Vortices, and Observational Signatures | Xiao Hu et.al. | 2403.18292 | null |
2024-03-27 | AIR-HLoc: Adaptive Image Retrieval for Efficient Visual Localisation | Changkun Liu et.al. | 2403.18281 | null |
2024-03-27 | RoboKeyGen: Robot Pose and Joint Angles Estimation via Diffusion-based 3D Keypoint Generation | Yang Tian et.al. | 2403.18259 | null |
2024-03-27 | NeuSDFusion: A Spatial-Aware Generative Model for 3D Shape Completion, Reconstruction, and Generation | Ruikai Cui et.al. | 2403.18241 | null |
2024-03-27 | From Two-Dimensional to Three-Dimensional Environment with Q-Learning: Modeling Autonomous Navigation with Reinforcement Learning and no Libraries | Ergon Cugler de Moraes Silva et.al. | 2403.18219 | link |
2024-03-27 | SCANet: Correcting LEGO Assembly Errors with Self-Correct Assembly Network | Yuxuan Wan et.al. | 2403.18195 | link |
2024-03-27 | Topology Optimization for the Full-Cell Design of Porous Electrodes in Electrochemical Energy Storage Devices | Hanyu Li et.al. | 2403.18184 | null |
2024-03-27 | Online Embedding Multi-Scale CLIP Features into 3D Maps | Shun Taguchi et.al. | 2403.18178 | null |
2024-03-26 | EgoLifter: Open-world 3D Segmentation for Egocentric Perception | Qiao Gu et.al. | 2403.18118 | null |
2024-03-26 | Scrolly2Reel: Turning News Graphics into TikToks by Adjusting Narrative Beats and Pacing | Duy K. Nguyen et.al. | 2403.18111 | null |
2024-03-26 | EgoPoseFormer: A Simple Baseline for Egocentric 3D Human Pose Estimation | Chenhongyi Yang et.al. | 2403.18080 | link |
2024-03-26 | Move as You Say, Interact as You Can: Language-guided Human Motion Generation with Scene Affordance | Zan Wang et.al. | 2403.18036 | link |
2024-03-26 | Signatures of magnetic braking in Class 0 protostars ? Exploring the gas kinematics in magnetized models of low-mass star formation | N. Añez-Lopez et.al. | 2403.18009 | null |
2024-03-26 | Black hole thermodynamics in natural variables: the BTZ case | Kiril Hristov et.al. | 2403.18008 | null |
2024-03-25 | Electro-optic properties from ab initio calculations in two-dimensional materials | Zhijun Jiang et.al. | 2403.17987 | null |
2024-03-26 | AiOS: All-in-One-Stage Expressive Human Pose and Shape Estimation | Qingping Sun et.al. | 2403.17934 | link |
2024-03-26 | TC4D: Trajectory-Conditioned Text-to-4D Generation | Sherwin Bahmani et.al. | 2403.17920 | null |
2024-03-26 | Octree-GS: Towards Consistent Real-time Rendering with LOD-Structured 3D Gaussians | Kerui Ren et.al. | 2403.17898 | link |
2024-03-26 | A Survey on 3D Egocentric Human Pose Estimation | Md Mushfiqur Azam et.al. | 2403.17893 | link |
2024-03-26 | 2D Gaussian Splatting for Geometrically Accurate Radiance Fields | Binbin Huang et.al. | 2403.17888 | link |
2024-03-26 | To Supervise or Not to Supervise: Understanding and Addressing the Key Challenges of 3D Transfer Learning | Souhail Hadgi et.al. | 2403.17869 | null |
2024-03-26 | Hierarchical Open-Vocabulary 3D Scene Graphs for Language-Grounded Robot Navigation | Abdelrhman Werby et.al. | 2403.17846 | null |
2024-03-26 | GTA-HDR: A Large-Scale Synthetic Dataset for HDR Image Reconstruction | Hrishav Bakul Barua et.al. | 2403.17837 | link |
2024-03-26 | Implementing photometric stereo for scanning helium microscopy (SHeM) to reconstruct true-to-size 3D surfaces | Aleksandar Radic et.al. | 2403.17835 | null |
2024-03-26 | A foundation model utilizing chest CT volumes and radiology reports for supervised-level zero-shot detection of abnormalities | Ibrahim Ethem Hamamci et.al. | 2403.17834 | link |
2024-03-26 | DiffH2O: Diffusion-Based Synthesis of Hand-Object Interactions from Textual Descriptions | Sammy Christen et.al. | 2403.17827 | null |
2024-03-26 | DN-Splatter: Depth and Normal Priors for Gaussian Splatting and Meshing | Matias Turkulainen et.al. | 2403.17822 | link |
2024-03-29 | Towards 3D Vision with Low-Cost Single-Photon Cameras | Fangzhou Mu et.al. | 2403.17801 | null |
2024-03-26 | System Calibration of a Field Phenotyping Robot with Multiple High-Precision Profile Laser Scanners | Felix Esser et.al. | 2403.17788 | null |
2024-03-26 | GenesisTex: Adapting Image Denoising Diffusion to Texture Space | Chenjian Gao et.al. | 2403.17782 | null |
2024-03-26 | Facet formation in slow three-dimensional fracture | Yuri Lubomirsky et.al. | 2403.17781 | null |
2024-03-26 | Makeup Prior Models for 3D Facial Makeup Estimation and Applications | Xingchao Yang et.al. | 2403.17761 | null |
2024-03-26 | AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation | Huawei Wei et.al. | 2403.17694 | link |
2024-03-26 | DiffFAE: Advancing High-fidelity One-shot Facial Appearance Editing with Space-sensitive Customization and Semantic Preservation | Qilin Wang et.al. | 2403.17664 | null |
2024-03-27 | Rapid non-destructive inspection of sub-surface defects in 3D printed alumina through 30 layers with 7 μm depth resolution | C. Lapre et.al. | 2403.17662 | null |
2024-03-28 | UADA3D: Unsupervised Adversarial Domain Adaptation for 3D Object Detection with Sparse LiDAR and Large Domain Gaps | Maciej K Wozniak et.al. | 2403.17633 | link |
2024-03-26 | AniArtAvatar: Animatable 3D Art Avatar from a Single Image | Shaoxu Li et.al. | 2403.17631 | null |
2024-03-26 | Online Tree Reconstruction and Forest Inventory on a Mobile Robotic System | Leonard Freißmuth et.al. | 2403.17622 | null |
2024-03-26 | Grad-CAMO: Learning Interpretable Single-Cell Morphological Profiles from 3D Cell Painting Images | Vivek Gopalakrishnan et.al. | 2403.17615 | link |
2024-03-26 | DeepMIF: Deep Monotonic Implicit Fields for Large-Scale LiDAR 3D Mapping | Kutay Yılmaz et.al. | 2403.17550 | link |
2024-03-26 | WordRobe: Text-Guided Generation of Textured 3D Garments | Astitva Srivastava et.al. | 2403.17541 | null |
2024-03-26 | NeRF-HuGS: Improved Neural Radiance Fields in Non-static Scenes Using Heuristics-Guided Segmentation | Jiahao Chen et.al. | 2403.17537 | null |
2024-03-29 | Dr.Hair: Reconstructing Scalp-Connected Hair Strands without Pre-training via Differentiable Rendering of Line Segments | Yusuke Takimoto et.al. | 2403.17496 | null |
2024-03-26 | Numerical analysis of a FE/SAV scheme for a Caginalp phase field model with mechanical effects in stereolithography | Xingguang Jin et.al. | 2403.17434 | link |
2024-03-29 | IQMDose3D: a software tool for reconstructing the dose in patient using patient planning CT images and the signals measured by IQM detector | Aitang Xing et.al. | 2403.17394 | null |
2024-03-26 | SSF3D: Strict Semi-Supervised 3D Object Detection with Switching Filter | Songbur Wong et.al. | 2403.17390 | null |
2024-03-26 | Decoupled Pseudo-labeling for Semi-Supervised Monocular 3D Object Detection | Jiacheng Zhang et.al. | 2403.17387 | null |
2024-03-26 | TRAM: Global Trajectory and Motion of 3D Humans from in-the-wild Videos | Yufu Wang et.al. | 2403.17346 | link |
2024-03-28 | Residual-based Language Models are Free Boosters for Biomedical Imaging | Zhixin Lai et.al. | 2403.17343 | link |
2024-03-27 | Physical 3D Adversarial Attacks against Monocular Depth Estimation in Autonomous Driving | Junhao Zheng et.al. | 2403.17301 | link |
2024-03-26 | Maximum A Posteriori Ly-alpha Estimator (MAPLE): Band-power and covariance estimation of the 3D Ly-alpha forest power spectrum | Benjamin Horowitz et.al. | 2403.17294 | null |
2024-03-26 | Mesoscale Polymer Arrays: High Aspect Ratio Surface Structures and Their Digital Reconstruction | Demi E. Moed et.al. | 2403.17283 | null |
2024-03-25 | DreamPolisher: Towards High-Quality Text-to-3D Generation via Geometric Diffusion | Yuanze Lin et.al. | 2403.17237 | null |
2024-03-25 | PROSPECT: Precision Robot Spectroscopy Exploration and Characterization Tool | Nathaniel Hanson et.al. | 2403.17232 | null |
2024-03-25 | WIN-PDQ: A Wiener-estimator-based projection-domain quantitative SPECT method that accounts for intra-regional uptake heterogeneity | Zekun Li et.al. | 2403.17226 | null |
2024-03-25 | AnimateMe: 4D Facial Expressions via Diffusion Models | Dimitrios Gerogiannis et.al. | 2403.17213 | null |
2024-03-25 | 6D Movable Antenna Enhanced Wireless Network Via Discrete Position and Rotation Optimization | Xiaodan Shao et.al. | 2403.17122 | null |
2024-03-25 | Animal Avatars: Reconstructing Animatable 3D Animals from Casual Videos | Remy Sabathier et.al. | 2403.17103 | link |
2024-03-25 | 3D printing of hierarchical structures made of inorganic silicon-rich glass featuring self-forming nanogratings | Po-Han Huang et.al. | 2403.17102 | null |
2024-03-25 | Understanding the Multi-wavelength Thermal Dust Polarization from the Orion Molecular Cloud in Light of the Radiative Torque Paradigm | Le Ngoc Tram et.al. | 2403.17088 | link |
2024-03-25 | Quantum Liquids: Emergent higher-rank gauge theory and fractons | Yizhi You et.al. | 2403.17074 | null |
2024-03-25 | Calib3D: Calibrating Model Preferences for Reliable 3D Scene Understanding | Lingdong Kong et.al. | 2403.17010 | link |
2024-03-25 | Optimizing LiDAR Placements for Robust Driving Perception in Adverse Conditions | Ye Li et.al. | 2403.17009 | link |
2024-03-25 | TRIP: Temporal Residual Learning with Image Noise Prior for Image-to-Video Diffusion Models | Zhongwei Zhang et.al. | 2403.17005 | null |
2024-03-25 | VP3D: Unleashing 2D Visual Prompt for Text-to-3D Generation | Yang Chen et.al. | 2403.17001 | null |
2024-03-25 | Learning Spatial Adaptation and Temporal Coherence in Diffusion Models for Video Super-Resolution | Zhikai Chen et.al. | 2403.17000 | null |
2024-03-25 | Comp4D: LLM-Guided Compositional 4D Scene Generation | Dejia Xu et.al. | 2403.16993 | null |
2024-03-25 | GSDF: 3DGS Meets SDF for Improved Rendering and Reconstruction | Mulin Yu et.al. | 2403.16964 | null |
2024-03-25 | Modes of the Dark Ages 21cm field accessible to a lunar radio interferometer | Philip Bull et.al. | 2403.16955 | null |
2024-03-25 | Make-It-Vivid: Dressing Your Animatable Biped Cartoon Characters from Text | Junshu Tang et.al. | 2403.16897 | null |
2024-03-25 | Towards Balanced RGB-TSDF Fusion for Consistent Semantic Scene Completion by 3D RGB Feature Completion and a Classwise Entropy Loss Function | Laiyan Ding et.al. | 2403.16888 | null |
2024-03-25 | CVT-xRF: Contrastive In-Voxel Transformer for 3D Consistent Radiance Fields from Sparse Inputs | Yingji Zhong et.al. | 2403.16885 | null |
2024-03-25 | TAIL: A Terrain-Aware Multi-Modal SLAM Dataset for Robot Locomotion in Deformable Granular Environments | Chen Yao et.al. | 2403.16875 | null |
2024-03-25 | Holographic Gaussian Boson Sampling with Matrix Product States on 3D cQED Processors | Ningyi Lyu et.al. | 2403.16810 | null |
2024-03-25 | Exploiting Priors from 3D Diffusion Models for RGB-Based One-Shot View Planning | Sicong Pan et.al. | 2403.16803 | link |
2024-03-25 | CurbNet: Curb Detection Framework Based on LiDAR Point Cloud Segmentation | Guoyang Zhao et.al. | 2403.16794 | link |
2024-03-25 | C-arm inverse geometry CT for 3D cardiac chamber mapping | Jordan M. Slagowski et.al. | 2403.16779 | null |
2024-03-25 | Modeling the secular evolution of embedded protoplanetary discs | J. Mauxion et.al. | 2403.16753 | null |
2024-03-25 | Creating a Digital Twin of Spinal Surgery: A Proof of Concept | Jonas Hein et.al. | 2403.16736 | null |
2024-03-25 | Enabling Uncertainty Estimation in Iterative Neural Networks | Nikita Durasov et.al. | 2403.16732 | link |
2024-03-25 | Anti-de Sitter Momentum Space in 3D and 4D Quantum Gravity | Giovanni Amelino-Camelia et.al. | 2403.16721 | null |
2024-03-25 | Selective laser etching of displays: Closing the gap between optical simulations and fabrication | Martin Wimmer et.al. | 2403.16692 | null |
2024-03-25 | Clustering Propagation for Universal Medical Image Segmentation | Yuhang Ding et.al. | 2403.16646 | link |
2024-03-25 | Self-duel solution of 3D incompressible Navier-Stokes equations | Ning-An Lai et.al. | 2403.16642 | null |
2024-03-25 | Linearised Calderón problem: Reconstruction of unbounded perturbations in 3D | Henrik Garde et.al. | 2403.16588 | null |
2024-03-25 | DOrA: 3D Visual Grounding with Order-Aware Referring | Tung-Yu Wu et.al. | 2403.16539 | null |
2024-03-25 | Employing High-Dimensional RIS Information for RIS-aided Localization Systems | Tuo Wu et.al. | 2403.16521 | null |
2024-03-25 | CMViM: Contrastive Masked Vim Autoencoder for 3D Multi-modal Representation Learning for AD classification | Guangqian Yang et.al. | 2403.16520 | null |
2024-03-25 | Make-Your-Anchor: A Diffusion-based 2D Avatar Generation Framework | Ziyao Huang et.al. | 2403.16510 | link |
2024-03-25 | Data-Driven Extrusion Force Control Tuning for 3D Printing | Xavier Guidetti et.al. | 2403.16470 | null |
2024-03-25 | RCBEVDet: Radar-camera Fusion in Bird’s Eye View for 3D Object Detection | Zhiwei Lin et.al. | 2403.16440 | link |
2024-03-25 | Benchmarks and Challenges in Pose Estimation for Egocentric Hand Interactions with Objects | Zicong Fan et.al. | 2403.16428 | link |
2024-03-25 | Spike-NeRF: Neural Radiance Field Based On Spike Camera | Yijia Guo et.al. | 2403.16410 | null |
2024-03-25 | Elite360D: Towards Efficient 360 Depth Estimation via Semantic- and Distance-Aware Bi-Projection Fusion | Hao Ai et.al. | 2403.16376 | null |
2024-03-25 | Learning Action-based Representations Using Invariance | Max Rudolph et.al. | 2403.16369 | null |
2024-03-25 | 3D-EffiViTCaps: 3D Efficient Vision Transformer with Capsule for Medical Image Segmentation | Dongwei Gan et.al. | 2403.16350 | link |
2024-03-25 | Impact of Video Compression Artifacts on Fisheye Camera Visual Perception Tasks | Madhumitha Sakthi et.al. | 2403.16338 | null |
2024-03-24 | AutoInst: Automatic Instance-Based Segmentation of LiDAR 3D Scans | Cedric Perauer et.al. | 2403.16318 | link |
2024-03-24 | latentSplat: Autoencoding Variational Gaussians for Fast Generalizable 3D Reconstruction | Christopher Wewer et.al. | 2403.16292 | null |
2024-03-24 | Thermal Analysis for NVIDIA GTX480 Fermi GPU Architecture | Savinay Nagendra et.al. | 2403.16239 | null |
2024-03-24 | A poroelastic plate model obtained by simultaneous homogenization and dimension reduction | Marin Bužančić et.al. | 2403.16220 | null |
2024-03-24 | Frankenstein: Generating Semantic-Compositional 3D Scenes in One Tri-Plane | Han Yan et.al. | 2403.16210 | null |
2024-03-24 | Skull-to-Face: Anatomy-Guided 3D Facial Reconstruction and Editing | Yongqing Liang et.al. | 2403.16207 | null |
2024-03-24 | FH-SSTNet: Forehead Creases based User Verification using Spatio-Spatial Temporal Network | Geetanjali Sharma et.al. | 2403.16202 | null |
2024-03-24 | Diffusion Model is a Good Pose Estimator from 3D RF-Vision | Junqiao Fan et.al. | 2403.16198 | null |
2024-03-24 | Enhancing MRI-Based Classification of Alzheimer’s Disease with Explainable 3D Hybrid Compact Convolutional Transformers | Arindam Majee et.al. | 2403.16175 | link |
2024-03-24 | On properties of a semi-explicit in time fourth-order vector compact scheme for the multidimensional acoustic wave equation | Alexander Zlotnik et.al. | 2403.16174 | null |
2024-03-28 | Gaze-guided Hand-Object Interaction Synthesis: Benchmark and Method | Jie Tian et.al. | 2403.16169 | null |
2024-03-24 | CG-SLAM: Efficient Dense RGB-D SLAM in a Consistent Uncertainty-aware 3D Gaussian Field | Jiarui Hu et.al. | 2403.16095 | null |
2024-03-24 | Semantic Is Enough: Only Semantic Information For NeRF Reconstruction | Ruibo Wang et.al. | 2403.16043 | null |
2024-03-24 | A transformer-based neural operator for large-eddy simulation of turbulence | Zhijie Li et.al. | 2403.16026 | null |
2024-03-24 | Dimensionally Reducing Generalized Symmetries from (3+1)-Dimensions | Emily Nardoni et.al. | 2403.15995 | null |
2024-03-24 | BIMCV-R: A Landmark Dataset for 3D CT Text-Image Retrieval | Yinda Chen et.al. | 2403.15992 | null |
2024-03-28 | Exploring Accurate 3D Phenotyping in Greenhouse through Neural Radiance Fields | Junhong Zhao et.al. | 2403.15981 | null |
2024-03-24 | PSHop: A Lightweight Feed-Forward Method for 3D Prostate Gland Segmentation | Yijing Yang et.al. | 2403.15971 | null |
2024-03-23 | Deep Probabilistic Direction Prediction in 3D with Applications to Directional Dark Matter Detectors | Majd Ghrear et.al. | 2403.15949 | link |
2024-03-23 | Team Coordination on Graphs: Problem, Analysis, and Algorithms | Manshi Limbu et.al. | 2403.15946 | null |
2024-03-23 | Three-dimensional clustering characteristics of large-stokes number sprays interacting with turbulent swirling co-flows | Ali Rostami et.al. | 2403.15945 | null |
2024-03-23 | Explore until Confident: Efficient Exploration for Embodied Question Answering | Allen Z. Ren et.al. | 2403.15941 | null |
2024-03-23 | Spatio-Temporal Bi-directional Cross-frame Memory for Distractor Filtering Point Cloud Single Object Tracking | Shaoyu Sun et.al. | 2403.15831 | null |
2024-03-23 | DriveEnv-NeRF: Exploration of A NeRF-Based Autonomous Driving Environment for Real-World Performance Validation | Mu-Yi Shen et.al. | 2403.15791 | link |
2024-03-23 | 3D-TransUNet for Brain Metastases Segmentation in the BraTS2023 Challenge | Siwei Yang et.al. | 2403.15735 | null |
2024-03-23 | Contact-aware Human Motion Generation from Textual Descriptions | Sihan Ma et.al. | 2403.15709 | null |
2024-03-23 | UPNeRF: A Unified Framework for Monocular 3D Object Reconstruction and Pose Estimation | Yuliang Guo et.al. | 2403.15705 | link |
2024-03-23 | Gaussian in the Wild: 3D Gaussian Splatting for Unconstrained Image Collections | Dongbin Zhang et.al. | 2403.15704 | null |
2024-03-23 | SceneX:Procedural Controllable Large-scale Scene Generation via Large-language Models | Mengqi Zhou et.al. | 2403.15698 | null |
2024-03-23 | Maximal Number of Skew Lines in the Fermat Surface | Sally Andria et.al. | 2403.15666 | null |
2024-03-22 | Semantic Gaussians: Open-Vocabulary Scene Understanding with 3D Gaussian Splatting | Jun Guo et.al. | 2403.15624 | null |
2024-03-22 | InterFusion: Text-Driven Generation of 3D Human-Object Interaction | Sisi Dai et.al. | 2403.15612 | link |
2024-03-22 | An Optimization Framework to Enforce Multi-View Consistency for Texturing 3D Meshes Using Pre-Trained Text-to-Image Models | Zhengyi Zhao et.al. | 2403.15559 | null |
2024-03-22 | Language-Based Depth Hints for Monocular Depth Estimation | Dylan Auty et.al. | 2403.15551 | null |
2024-03-22 | Pixel-GS: Density Control with Pixel-aware Gradient for 3D Gaussian Splatting | Zheng Zhang et.al. | 2403.15530 | null |
2024-03-20 | Learning to Infer Generative Template Programs for Visual Concepts | R. Kenny Jones et.al. | 2403.15476 | link |
2024-03-20 | Human Detection in Realistic Through-the-Wall Environments using Raw Radar ADC Data and Parametric Neural Networks | Wei Wang et.al. | 2403.15468 | null |
2024-03-17 | Unified Generative Modeling of 3D Molecules via Bayesian Flow Networks | Yuxuan Song et.al. | 2403.15441 | link |
2024-03-22 | LATTE3D: Large-scale Amortized Text-To-Enhanced3D Synthesis | Kevin Xie et.al. | 2403.15385 | null |
2024-03-22 | ThemeStation: Generating Theme-Aware 3D Assets from Few Exemplars | Zhenwei Wang et.al. | 2403.15383 | link |
2024-03-22 | Time-efficient, high-resolution 3T whole-brain relaxometry using Cartesian 3D MR-STAT with CSF suppression | Hongyan Liu et.al. | 2403.15379 | link |
2024-03-22 | Augmented Reality based Simulated Data (ARSim) with multi-view consistency for AV perception networks | Aqeel Anwar et.al. | 2403.15370 | null |
2024-03-22 | Topological analysis and experimental control of transformations of domain walls in magnetic cylindrical nanowires | L. Álvaro-Gómez et.al. | 2403.15343 | null |
2024-03-25 | Point-DETR3D: Leveraging Imagery Data with Spatial Point Prior for Weakly Semi-supervised 3D Object Detection | Hongzhi Gao et.al. | 2403.15317 | null |
2024-03-22 | Global Control for Local SO(3)-Equivariant Scale-Invariant Vessel Segmentation | Patryk Rygiel et.al. | 2403.15314 | link |
2024-03-22 | CR3DT: Camera-RADAR Fusion for 3D Detection and Tracking | Nicolas Baumann et.al. | 2403.15313 | link |
2024-03-22 | Polarization Holes as an Indicator of Magnetic Field-Angular Momentum Alignment I. Initial Tests | Lijun Wang et.al. | 2403.15280 | null |
2024-03-22 | IS-Fusion: Instance-Scene Collaborative Fusion for Multimodal 3D Object Detection | Junbo Yin et.al. | 2403.15241 | link |
2024-03-22 | LeGO: Leveraging a Surface Deformation Network for Animatable Stylized Face Generation with One Example | Soyeon Yoon et.al. | 2403.15227 | link |
2024-03-22 | TriHelper: Zero-Shot Object Navigation with Dynamic Assistance | Lingfeng Zhang et.al. | 2403.15223 | null |
2024-03-22 | Anytime, Anywhere, Anyone: Investigating the Feasibility of Segment Anything Model for Crowd-Sourcing Medical Image Annotations | Pranav Kulkarni et.al. | 2403.15218 | link |
2024-03-22 | Multiphysics Numerical Method for Modeling Josephson Traveling-Wave Parametric Amplifiers | Samuel T. Elkin et.al. | 2403.15217 | null |
2024-03-22 | CRPlace: Camera-Radar Fusion with BEV Representation for Place Recognition | Shaowei Fu et.al. | 2403.15183 | null |
2024-03-22 | LSK3DNet: Towards Effective and Efficient 3D Perception with Large Sparse Kernels | Tuo Feng et.al. | 2403.15173 | link |
2024-03-22 | FastCAD: Real-Time CAD Retrieval and Alignment from Scans and Videos | Florian Langer et.al. | 2403.15161 | null |
2024-03-22 | RHINO-VR Experience: Teaching Mobile Robotics Concepts in an Interactive Museum Exhibit | Erik Schlachhoff et.al. | 2403.15151 | null |
2024-03-22 | Dynamic Interface Printing | Callum Vidler et.al. | 2403.15144 | null |
2024-03-22 | Modular Deep Active Learning Framework for Image Annotation: A Technical Report for the Ophthalmo-AI Project | Md Abdul Kadir et.al. | 2403.15143 | null |
2024-03-22 | EndoGSLAM: Real-Time Dense Reconstruction and Tracking in Endoscopic Surgeries using Gaussian Splatting | Kailing Wang et.al. | 2403.15124 | null |
2024-03-22 | Allspark: Workload Orchestration for Visual Transformers on Processing In-Memory Systems | Mengke Ge et.al. | 2403.15069 | null |
2024-03-22 | Recent Trends in 3D Reconstruction of General Non-Rigid Scenes | Raza Yunus et.al. | 2403.15064 | null |
2024-03-22 | Towards a Comprehensive, Efficient and Promptable Anatomic Structure Segmentation Model using 3D Whole-body CT Scans | Heng Guo et.al. | 2403.15063 | link |
2024-03-22 | VRSO: Visual-Centric Reconstruction for Static Object Annotation | Chenyao Yu et.al. | 2403.15026 | link |
2024-03-22 | BSNet: Box-Supervised Simulation-assisted Mean Teacher for 3D Instance Segmentation | Jiahao Lu et.al. | 2403.15019 | link |
2024-03-22 | TexRO: Generating Delicate Textures of 3D Models by Recursive Optimization | Jinbo Wu et.al. | 2403.15009 | null |
2024-03-22 | Tri-Perspective View Decomposition for Geometry-Aware Depth Completion | Zhiqiang Yan et.al. | 2403.15008 | null |
2024-03-22 | Learning Neural Free-Energy Functionals with Pair-Correlation Matching | Jacobus Dijkman et.al. | 2403.15007 | null |
2024-03-22 | Vanishing of Resistivity upon Freezing of Vortex Liquid in Clean Superconductors | Naratip Nunchot et.al. | 2403.14992 | null |
2024-03-22 | DreamFlow: High-Quality Text-to-3D Generation by Approximating Probability Flow | Kyungmin Lee et.al. | 2403.14966 | null |
2024-03-22 | GPT-Connect: Interaction between Text-Driven Human Motion Generator and 3D Scenes in a Training-free Manner | Haoxuan Qu et.al. | 2403.14947 | null |
2024-03-22 | An investigation on electronic and magnetic properties of Cr substituted MoS $_2$ monolayer and multilayers-Hybrid functional calculations | Aloka Ranjan Sahoo et.al. | 2403.14945 | null |
2024-03-22 | STAG4D: Spatial-Temporal Anchored Generative 4D Gaussians | Yifei Zeng et.al. | 2403.14939 | null |
2024-03-22 | Survey on Modeling of Articulated Objects | Jiayi Liu et.al. | 2403.14937 | null |
2024-03-25 | 3d Modularity Revisited | Miranda C. N. Cheng et.al. | 2403.14920 | null |
2024-03-21 | Black hole outflows initiated by a large-scale magnetic field | Bestin James et.al. | 2403.14882 | null |
2024-03-21 | Hyperspectral Neural Radiance Fields | Gerry Chen et.al. | 2403.14839 | null |
2024-03-21 | Evaluating Panoramic 3D Estimation in Indoor Lighting Analysis | Zining Cheng et.al. | 2403.14836 | null |
2024-03-26 | Biogenic sulfur gases as biosignatures on temperate sub-Neptune waterworlds | Shang-Min Tsai et.al. | 2403.14805 | null |
2024-03-21 | Geom-DeepONet: A Point-cloud-based Deep Operator Network for Field Predictions on 3D Parameterized Geometries | Junyan He et.al. | 2403.14788 | null |
2024-03-21 | Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance | Shenhao Zhu et.al. | 2403.14781 | link |
2024-03-21 | Universality of mean-field antiferromagnetic order in an anisotropic 3D Hubbard model at half-filling | E. Langmann et.al. | 2403.14768 | null |
2024-03-28 | Can 3D Vision-Language Models Truly Understand Natural Language? | Weipeng Deng et.al. | 2403.14760 | link |
2024-03-21 | Transforming from Kitaev to Disguised Ising Chain: Application to CoNb $_2$O$_6$ | Derek Churchill et.al. | 2403.14754 | null |
2024-03-21 | Zero-Shot Multi-Object Shape Completion | Shun Iwase et.al. | 2403.14628 | null |
2024-03-21 | MVSplat: Efficient 3D Gaussian Splatting from Sparse Multi-View Images | Yuedong Chen et.al. | 2403.14627 | link |
2024-03-21 | ODTFormer: Efficient Obstacle Detection and Tracking with Stereo Cameras Based on Transformer | Tianye Ding et.al. | 2403.14626 | link |
2024-03-21 | GRM: Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation | Yinghao Xu et.al. | 2403.14621 | link |
2024-03-21 | ClusteringSDF: Self-Organized Neural Implicit Surfaces for 3D Decomposition | Tianhao Wu et.al. | 2403.14619 | null |
2024-03-21 | DreamReward: Text-to-3D Generation with Human Preference | Junliang Ye et.al. | 2403.14613 | null |
2024-03-21 | Explorative Inbetweening of Time and Space | Haiwen Feng et.al. | 2403.14611 | null |
2024-03-21 | VXP: Voxel-Cross-Pixel Large-scale Image-LiDAR Place Recognition | Yun-Jin Li et.al. | 2403.14594 | link |
2024-03-21 | Global Solutions to the 3D Half-Wave Maps Equation with Angular Regularity | Katie Marsden et.al. | 2403.14567 | null |
2024-03-21 | Visibility-Aware Keypoint Localization for 6DoF Object Pose Estimation | Ruyi Lian et.al. | 2403.14559 | null |
2024-03-21 | Gaussian Frosting: Editable Complex Radiance Fields with Real-Time Rendering | Antoine Guédon et.al. | 2403.14554 | null |
2024-03-21 | Object-Centric Domain Randomization for 3D Shape Reconstruction in the Wild | Junhyeong Cho et.al. | 2403.14539 | null |
2024-03-21 | HAC: Hash-grid Assisted Context for 3D Gaussian Splatting Compression | Yihang Chen et.al. | 2403.14530 | link |
2024-03-21 | Denoising Diffusion Models for 3D Healthy Brain Tissue Inpainting | Alicia Durrer et.al. | 2403.14499 | link |
2024-03-21 | Modeling of high-pressure transient gas-liquid flow in M-shaped jumpers of subsea gas production systems | Alexander Yurishchev et.al. | 2403.14463 | null |
2024-03-23 | Exploring 3D Human Pose Estimation and Forecasting from the Robot’s Perspective: The HARPER Dataset | Andrea Avogaro et.al. | 2403.14447 | null |
2024-03-21 | OA-CNNs: Omni-Adaptive Sparse CNNs for 3D Semantic Segmentation | Bohao Peng et.al. | 2403.14418 | link |
2024-03-21 | CombiNeRF: A Combination of Regularization Techniques for Few-Shot Neural Radiance Field View Synthesis | Matteo Bonotto et.al. | 2403.14412 | link |
2024-03-21 | InfNeRF: Towards Infinite Scale NeRF Rendering with O(log n) Space Complexity | Jiabin Liang et.al. | 2403.14376 | null |
2024-03-21 | SurroundSDF: Implicit 3D Scene Understanding Based on Signed Distance Field | Lizhe Liu et.al. | 2403.14366 | null |
2024-03-21 | Learning-to-Learn the Wave Angle Estimation | Eray Guven et.al. | 2403.14306 | null |
2024-03-21 | Zero123-6D: Zero-shot Novel View Synthesis for RGB Category-level 6D Pose Estimation | Francesco Di Felice et.al. | 2403.14279 | null |
2024-03-21 | Isotropic Gaussian Splatting for Real-Time Radiance Field Rendering | Yuanhao Gong et.al. | 2403.14244 | null |
2024-03-21 | On unifying control barrier and Lyapunov functions using QP and Sontag’s formula with an application to tumor dynamics | Jarne J. H. van Gemert et.al. | 2403.14226 | null |
2024-03-21 | Solvent-Free Silsesquioxane Self-Welding for 3D Printing Multi-Refractive Index Glass Objects | Piaoran Ye et.al. | 2403.14205 | null |
2024-03-21 | HCTO: Optimality-Aware LiDAR Inertial Odometry with Hybrid Continuous Time Optimization for Compact Wearable Mapping System | Jianping Li et.al. | 2403.14173 | link |
2024-03-21 | Extrinsic Calibration of Multiple LiDARs for a Mobile Robot based on Floor Plane And Object Segmentation | Shun Niijima et.al. | 2403.14161 | null |
2024-03-21 | Volumetric Environment Representation for Vision-Language Navigation | Rui Liu et.al. | 2403.14158 | link |
2024-03-21 | 3D Object Detection from Point Cloud via Voting Step Diffusion | Haoran Hou et.al. | 2403.14133 | null |
2024-03-21 | External Knowledge Enhanced 3D Scene Generation from Sketch | Zijie Wu et.al. | 2403.14121 | null |
2024-03-21 | Heuristic Algorithm-based Action Masking Reinforcement Learning (HAAM-RL) with Ensemble Inference Method | Kyuwon Choi et.al. | 2403.14110 | null |
2024-03-21 | Existence Is Chaos: Enhancing 3D Human Motion Prediction with Uncertainty Consideration | Zhihao Wang et.al. | 2403.14104 | null |
2024-03-21 | MaskSAM: Towards Auto-prompt SAM with Mask Classification for Medical Image Segmentation | Bin Xie et.al. | 2403.14103 | null |
2024-03-21 | Magnetization and exchange-stiffness constants of Fe-Al-Si alloys at finite-temperatures: A first-principles study | Shogo Yamashita et.al. | 2403.14096 | null |
2024-03-21 | QSMDiff: Unsupervised 3D Diffusion Models for Quantitative Susceptibility Mapping | Zhuang Xiong et.al. | 2403.14070 | null |
2024-03-21 | Exploring the role of the halo mass function for inferring astrophysical parameters during reionisation | Bradley Greig et.al. | 2403.14061 | null |
2024-03-21 | Adaptive Finite Element Interpolated Neural Networks | Santiago Badia et.al. | 2403.14054 | null |
2024-03-21 | Leveraging Thermal Modality to Enhance Reconstruction in Low-Light Conditions | Jiacong Xu et.al. | 2403.14053 | link |
2024-03-20 | Impact of regularization on achieved resolution in 3D tunable structured illumination microscopy (TSIM) | Arash Atibi et.al. | 2403.14035 | null |
2024-03-20 | LFS-Aware Surface Reconstruction from Unoriented 3D Point Clouds | Rao Fu et.al. | 2403.13924 | link |
2024-03-20 | CoMo: Controllable Motion Generation through Language Guided Pose Code Editing | Yiming Huang et.al. | 2403.13900 | null |
2024-03-20 | A scattering theory construction of dynamical solitons in 3d | Istvan Kadar et.al. | 2403.13891 | null |
2024-03-22 | A Noisy Approach to Intrinsically Mixed-State Topological Order | Ramanjit Sohal et.al. | 2403.13879 | null |
2024-03-20 | Embedding Pose Graph, Enabling 3D Foundation Model Capabilities with a Compact Representation | Hugues Thomas et.al. | 2403.13777 | null |
2024-03-20 | Heavy States in 3d Gravity and 2d CFT | David Grabovsky et.al. | 2403.13757 | null |
2024-03-22 | Unraveling the Optical Signatures of Polymeric Carbon Nitrides: Insights into Stacking-Induced Excitonic Transitions | Changbin Im et.al. | 2403.13685 | null |
2024-03-20 | DVMNet: Computing Relative Pose for Unseen Objects Beyond Hypotheses | Chen Zhao et.al. | 2403.13683 | link |
2024-03-20 | DanceCamera3D: 3D Camera Movement Synthesis with Music and Dance | Zixuan Wang et.al. | 2403.13667 | link |
2024-03-20 | T-Pixel2Mesh: Combining Global and Local Transformer for 3D Mesh Generation from a Single Image | Shijie Zhang et.al. | 2403.13663 | null |
2024-03-24 | 3D Directed Formation Control with Global Shape Convergence using Bispherical Coordinates | Omid Mirzaeedodangeh et.al. | 2403.13609 | null |
2024-03-20 | Encoding the Subsurface in 3D with Seismic | Ben Lasscock et.al. | 2403.13593 | null |
2024-03-20 | Portrait4D-v2: Pseudo Multi-View Data Creates Better 4D Head Synthesizer | Yu Deng et.al. | 2403.13570 | null |
2024-03-20 | Find n’ Propagate: Open-Vocabulary 3D Object Detection in Urban Environments | Djamahl Etchegaray et.al. | 2403.13556 | link |
2024-03-20 | Compress3D: a Compressed Latent Space for 3D Generation from a Single Image | Bowen Zhang et.al. | 2403.13524 | null |
2024-03-20 | High-confidence pseudo-labels for domain adaptation in COVID-19 detection | Robert Turnbull et.al. | 2403.13509 | null |
2024-03-20 | Scaling Diffusion Models to Real-World 3D LiDAR Scene Completion | Lucas Nunes et.al. | 2403.13470 | link |
2024-03-20 | Fast-Poly: A Fast Polyhedral Framework For 3D Multi-Object Tracking | Xiaoyu Li et.al. | 2403.13443 | link |
2024-03-24 | See, Imagine, Plan: Discovering and Hallucinating Tasks from a Single Image | Chenyang Ma et.al. | 2403.13438 | null |
2024-03-20 | Advancing 6D Pose Estimation in Augmented Reality – Overcoming Projection Ambiguity with Uncontrolled Imagery | Mayura Manawadu et.al. | 2403.13434 | null |
2024-03-20 | Automatic Navigation Map Generation for Mobile Robots in Urban Environments | Luca Mozzarelli et.al. | 2403.13431 | null |
2024-03-20 | Acceptable solutions of the Schrodinger radial equation for a particle in a two-dimensional central potential | Jesus Etxebarria et.al. | 2403.13422 | null |
2024-03-20 | Cell Tracking in C. elegans with Cell Position Heatmap-Based Alignment and Pairwise Detection | Kaito Shiku et.al. | 2403.13412 | null |
2024-03-20 | DOR3D-Net: Dense Ordinal Regression Network for 3D Hand Pose Estimation | Yamin Mao et.al. | 2403.13405 | null |
2024-03-20 | MULAN-WC: Multi-Robot Localization Uncertainty-aware Active NeRF with Wireless Coordination | Weiying Wang et.al. | 2403.13348 | null |
2024-03-21 | LaserHuman: Language-guided Scene-aware Human Motion Generation in Free Environment | Peishan Cong et.al. | 2403.13307 | link |
2024-03-25 | Interaction-Induced Dimensional Crossover through Full 3D to 1D | Tao Yu et.al. | 2403.13295 | null |
2024-03-20 | Map-Aware Human Pose Prediction for Robot Follow-Ahead | Qingyuan Jiang et.al. | 2403.13294 | null |
2024-03-20 | Text-to-3D Shape Generation | Han-Hung Lee et.al. | 2403.13289 | null |
2024-03-20 | Beyond Skeletons: Integrative Latent Mapping for Coherent 4D Sequence Generation | Qitong Yang et.al. | 2403.13238 | null |
2024-03-19 | DecentNeRFs: Decentralized Neural Radiance Fields from Crowdsourced Images | Zaid Tasneem et.al. | 2403.13199 | null |
2024-03-19 | 3D Semantic MapNet: Building Maps for Multi-Object Re-Identification in 3D | Vincent Cartillier et.al. | 2403.13190 | null |
2024-03-19 | Reflectivity Is All You Need!: Advancing LiDAR Semantic Segmentation | Kasi Viswanath et.al. | 2403.13188 | link |
2024-03-21 | Towards complexity in de Sitter space from the double-scaled Sachdev-Ye-Kitaev model | Sergio E. Aguilar-Gutierrez et.al. | 2403.13186 | null |
2024-03-19 | SIFT-DBT: Self-supervised Initialization and Fine-Tuning for Imbalanced Digital Breast Tomosynthesis Image Classification | Yuexi Du et.al. | 2403.13148 | link |
2024-03-19 | Better Call SAL: Towards Learning to Segment Anything in Lidar | Aljoša Ošep et.al. | 2403.13129 | link |
2024-03-19 | Trustworthiness of Pretrained Transformers for Lung Cancer Segmentation | Aneesh Rangnekar et.al. | 2403.13113 | null |
2024-03-19 | SceneScript: Reconstructing Scenes With An Autoregressive Structured Language Model | Armen Avetisyan et.al. | 2403.13064 | null |
2024-03-17 | General Line Coordinates in 3D | Joshua Martinez et.al. | 2403.13014 | null |
2024-03-19 | GVGEN: Text-to-3D Generation with Volumetric Representation | Xianglong He et.al. | 2403.12957 | null |
2024-03-19 | The physics of Core-Collapse Supernovae: explosion mechanism and explosive nucleosynthesis | Luca Boccioli et.al. | 2403.12942 | null |
2024-03-19 | Semantic Layering in Room Segmentation via LLMs | Taehyeon Kim et.al. | 2403.12920 | null |
2024-03-19 | TexDreamer: Towards Zero-Shot High-Fidelity 3D Human Texture Generation | Yufei Liu et.al. | 2403.12906 | null |
2024-03-19 | EmoVOCA: Speech-Driven Emotional 3D Talking Heads | Federico Nocentini et.al. | 2403.12886 | link |
2024-03-19 | PoNQ: a Neural QEM-based Mesh Representation | Nissim Maruani et.al. | 2403.12870 | null |
2024-03-19 | Generative Enhancement for 3D Medical Images | Lingting Zhu et.al. | 2403.12852 | link |
2024-03-19 | 3d Quantum Trace Map | Samuel Panitch et.al. | 2403.12850 | null |
2024-03-19 | Compositional 3D Scene Synthesis with Scene Graph Guided Layout-Shape Generation | Yao Wei et.al. | 2403.12848 | null |
2024-03-19 | Embarrassingly Simple Scribble Supervision for 3D Medical Segmentation | Karol Gotkowski et.al. | 2403.12834 | null |
2024-03-19 | Oriented and Non-oriented Cubical Surfaces in The Penteract | Manuel Estevez et.al. | 2403.12825 | null |
2024-03-27 | A Physics-embedded Deep Learning Framework for Cloth Simulation | Zhiwei Zhao et.al. | 2403.12820 | link |
2024-03-19 | Diffusion-Driven Self-Supervised Learning for Shape Reconstruction and Pose Estimation | Jingtao Sun et.al. | 2403.12728 | link |
2024-03-19 | HUGS: Holistic Urban 3D Scene Understanding via Gaussian Splatting | Hongyu Zhou et.al. | 2403.12722 | null |
2024-03-20 | ICE: Interactive 3D Game Character Editing via Dialogue | Haoqian Wu et.al. | 2403.12667 | null |
2024-03-19 | PointGrasp: Point Cloud-based Grasping for Tendon-driven Soft Robotic Glove Applications | Chen Hu et.al. | 2403.12631 | null |
2024-03-19 | Lifting Multi-View Detection and Tracking to the Bird’s Eye View | Torben Teepe et.al. | 2403.12573 | link |
2024-03-19 | Recovering Composition Algebras from 3D Geometric Algebras | Daniele Corradetti et.al. | 2403.12569 | null |
2024-03-19 | Non-negative tensor factorization for vibration-based local damage detection | Mateusz Gabor et.al. | 2403.12554 | null |
2024-03-22 | RGBD GS-ICP SLAM | Seongbo Ha et.al. | 2403.12550 | link |
2024-03-19 | A program for 3D nuclear static and time-dependent density-functional theory with full Skyrme energy density functional: HIT3D | Yue Shi et.al. | 2403.12539 | null |
2024-03-19 | Multi-View Active Sensing for Human-Robot Interaction via Hierarchically Connected Tree | Yuanjiong Ying et.al. | 2403.12538 | null |
2024-03-19 | High-Fidelity SLAM Using Gaussian Splatting with Rendering-Guided Densification and Regularized Optimization | Shuo Sun et.al. | 2403.12535 | link |
2024-03-19 | Formation of Polar Crown Filaments Magnetic Fields by Supergranular Helicity Injection | Huanxin Chen et.al. | 2403.12497 | null |
2024-03-19 | PostoMETRO: Pose Token Enhanced Mesh Transformer for Robust 3D Human Mesh Recovery | Wendi Yang et.al. | 2403.12473 | null |
2024-03-19 | SC-Diff: 3D Shape Completion with Latent Diffusion Models | Juan D. Galvis et.al. | 2403.12470 | null |
2024-03-25 | Diagrammatic Instructions to Specify Spatial Objectives and Constraints with Applications to Mobile Base Placement | Qilin Sun et.al. | 2403.12465 | link |
2024-03-19 | Self-learning Canonical Space for Multi-view 3D Human Pose Estimation | Xiaoben Li et.al. | 2403.12440 | null |
2024-03-19 | Precise-Physics Driven Text-to-3D Generation | Qingshan Xu et.al. | 2403.12438 | null |
2024-03-19 | Prototipo de video juego activo basado en una cámara 3D para motivar la actividad física en niños y adultos mayores | Benjamín Ojeda Magaña et.al. | 2403.12432 | null |
2024-03-19 | Bin Packing Optimization via Deep Reinforcement Learning | Baoying Wang et.al. | 2403.12420 | null |
2024-03-19 | ComboVerse: Compositional 3D Assets Creation Using Spatially-Aware Diffusion Guidance | Yongwei Chen et.al. | 2403.12409 | null |
2024-03-19 | A kinetic-magnetohydrodynamic model with adaptive mesh refinement for modeling heliosphere neutral-plasma interaction | Yuxi Chen et.al. | 2403.12395 | null |
2024-03-19 | VideoBadminton: A Video Dataset for Badminton Action Recognition | Qi Li et.al. | 2403.12385 | null |
2024-03-19 | GaussianFlow: Splatting Gaussian Dynamics for 4D Content Creation | Quankai Gao et.al. | 2403.12365 | null |
2024-03-19 | Multi-State, Ultra-thin, BEOL-Compatible AlScN Ferroelectric Diodes | Kwan-Ho Kim et.al. | 2403.12361 | null |
2024-03-18 | EffiPerception: an Efficient Framework for Various Perception Tasks | Xinhao Xiang et.al. | 2403.12317 | null |
2024-03-18 | Prototipo de un Contador Bidireccional Automático de Personas basado en sensores de visión 3D | Benjamín Ojeda-Magaña et.al. | 2403.12310 | null |
2024-03-18 | Semialgebraic Range Stabbing, Ray Shooting, and Intersection Counting in the Plane | Timothy M. Chan et.al. | 2403.12303 | null |
2024-03-18 | R3DS: Reality-linked 3D Scenes for Panoramic Scene Understanding | Qirui Wu et.al. | 2403.12301 | null |
2024-03-18 | Estimation and Analysis of Slice Propagation Uncertainty in 3D Anatomy Segmentation | Rachaell Nihalaani et.al. | 2403.12290 | link |
2024-03-18 | BostonTwin: the Boston Digital Twin for Ray-Tracing in 6G Networks | Paolo Testolina et.al. | 2403.12289 | link |
2024-03-18 | Interfacing Quantum Spin Hall and Quantum Anomalous Hall insulators: Bi bilayer on MnBi $_2$Te$_4$ -family materials | I. I. Klimovskikh et.al. | 2403.12287 | null |
2024-03-18 | Measurement of anisotropies in Supernova Remnant observations and their interpretation using numerical models | Soham Mandal et.al. | 2403.12264 | null |
2024-03-18 | Motion and temporal B0 shift corrections for quantitative susceptibility mapping (QSM) and R2* mapping using dual-echo spiral navigators and conjugate-phase reconstruction | Yuguang Meng et.al. | 2403.12230 | null |
2024-03-18 | DeCoTR: Enhancing Depth Completion with 2D and 3D Attentions | Yunxiao Shi et.al. | 2403.12202 | null |
2024-03-18 | One-shot pair distribution functions of thin films using lab-based x-ray sources | Johan Bylin et.al. | 2403.12163 | null |
2024-03-18 | ThermoNeRF: Multimodal Neural Radiance Fields for Thermal Novel View Synthesis | Mariam Hassan et.al. | 2403.12154 | link |
2024-03-16 | Deep Generative Design for Mass Production | Jihoon Kim et.al. | 2403.12098 | null |
2024-03-18 | StereoNavNet: Learning to Navigate using Stereo Cameras with Auxiliary Occupancy Voxels | Hongyu Li et.al. | 2403.12039 | null |
2024-03-18 | VFusion3D: Learning Scalable 3D Generative Models from Video Diffusion Models | Junlin Han et.al. | 2403.12034 | null |
2024-03-19 | Generic 3D Diffusion Adapter Using Controlled Multi-View Editing | Hansheng Chen et.al. | 2403.12032 | link |
2024-03-18 | Ultraman: Single Image 3D Human Reconstruction with Ultra Speed and Detail | Mingjin Chen et.al. | 2403.12028 | link |
2024-03-18 | LN3Diff: Scalable Latent Neural Fields Diffusion for Speedy 3D Generation | Yushi Lan et.al. | 2403.12019 | link |
2024-03-18 | GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single Image | Xiao Fu et.al. | 2403.12013 | null |
2024-03-18 | HOIDiffusion: Generating Realistic 3D Hand-Object Interaction Data | Mengqi Zhang et.al. | 2403.12011 | null |
2024-03-18 | VideoMV: Consistent Multi-View Generation Based on Large Video Generative Model | Qi Zuo et.al. | 2403.12010 | null |
2024-03-18 | SV3D: Novel Multi-view Synthesis and 3D Generation from a Single Image using Latent Video Diffusion | Vikram Voleti et.al. | 2403.12008 | null |
2024-03-18 | GetMesh: A Controllable Model for High-quality Mesh Generation and Manipulation | Zhaoyang Lyu et.al. | 2403.11990 | null |
2024-03-18 | SceneSense: Diffusion Models for 3D Occupancy Synthesis from Partial Observation | Alec Reed et.al. | 2403.11985 | null |
2024-03-18 | Pedestrian Tracking with Monocular Camera using Unconstrained 3D Motion Model | Jan Krejčí et.al. | 2403.11978 | null |
2024-03-18 | Advancing COVID-19 Detection in 3D CT Scans | Qingqiu Li et.al. | 2403.11953 | null |
2024-03-18 | Learning Dynamical Systems Encoding Non-Linearity within Space Curvature | Bernardo Fichera et.al. | 2403.11948 | link |
2024-03-18 | RoGUENeRF: A Robust Geometry-Consistent Universal Enhancer for NeRF | Sibi Catley-Chandar et.al. | 2403.11909 | null |
2024-03-18 | Radiative loss and ion-neutral collisional effects in astrophysical plasmas | Beatrice Popescu Braileanu et.al. | 2403.11900 | null |
2024-03-18 | GNeRP: Gaussian-guided Neural Reconstruction of Reflective Objects with Noisy Polarization Priors | LI Yang et.al. | 2403.11899 | null |
2024-03-18 | InTeX: Interactive Text-to-texture Synthesis via Unified Depth-aware Inpainting | Jiaxiang Tang et.al. | 2403.11878 | null |
2024-03-18 | NuGraph2: A Graph Neural Network for Neutrino Physics Event Reconstruction | V Hewes et.al. | 2403.11872 | null |
2024-03-20 | View-Consistent 3D Editing with Gaussian Splatting | Yuxuan Wang et.al. | 2403.11868 | null |
2024-03-18 | Complete and Efficient Graph Transformers for Crystal Material Property Prediction | Keqiang Yan et.al. | 2403.11857 | link |
2024-03-18 | GraphBEV: Towards Robust BEV Feature Alignment for Multi-Modal 3D Object Detection | Ziying Song et.al. | 2403.11848 | null |
2024-03-18 | Agent3D-Zero: An Agent for Zero-shot 3D Understanding | Sha Zhang et.al. | 2403.11835 | null |
2024-03-19 | BAD-Gaussians: Bundle Adjusted Deblur Gaussian Splatting | Lingzhe Zhao et.al. | 2403.11831 | link |
2024-03-18 | A meshless and binless approach to compute statistics in 3D Ensemble PTV | Manuel Ratz et.al. | 2403.11828 | null |
2024-03-18 | Sound Event Detection and Localization with Distance Estimation | Daniel Aleksander Krause et.al. | 2403.11827 | null |
2024-03-18 | HVDistill: Transferring Knowledge from Images to Point Clouds via Unsupervised Hybrid-View Distillation | Sha Zhang et.al. | 2403.11817 | link |
2024-03-18 | Aerial Lifting: Neural Urban Semantic and Building Instance Lifting from Aerial Imagery | Yuqi Zhang et.al. | 2403.11812 | link |
2024-03-18 | OpenOcc: Open Vocabulary 3D Scene Reconstruction via Occupancy Representation | Haochen Jiang et.al. | 2403.11796 | null |
2024-03-18 | RIS-aided Single-frequency 3D Imaging by Exploiting Multi-view Image Correlations | Yixuan Huang et.al. | 2403.11764 | null |
2024-03-18 | PARMESAN: Parameter-Free Memory Search and Transduction for Dense Prediction Tasks | Philip Matthias Winter et.al. | 2403.11743 | null |
2024-03-19 | Urban Scene Diffusion through Semantic Occupancy Map | Junge Zhang et.al. | 2403.11697 | null |
2024-03-18 | TrajectoryNAS: A Neural Architecture Search for Trajectory Prediction | Ali Asghar Sharifi et.al. | 2403.11695 | null |
2024-03-18 | TTT-KD: Test-Time Training for 3D Semantic Segmentation through Knowledge Distillation from Foundation Models | Lisa Weijler et.al. | 2403.11691 | null |
2024-03-18 | MASSTAR: A Multi-Modal and Large-Scale Scene Dataset with a Versatile Toolchain for Surface Prediction and Completion | Guiyong Zheng et.al. | 2403.11681 | null |
2024-03-18 | NEDS-SLAM: A Novel Neural Explicit Dense Semantic SLAM Framework using 3D Gaussian Splatting | Yiming Ji et.al. | 2403.11679 | null |
2024-03-18 | Exploring 3D-aware Latent Spaces for Efficiently Learning Numerous Scenes | Antoine Schnepf et.al. | 2403.11678 | null |
2024-03-20 | Skyrmion on Magnetic Tunnel Junction: Interweaving Quantum Transport with Micro-magnetism | Aashish Chahal et.al. | 2403.11666 | null |
2024-03-18 | An Accurate and Real-time Relative Pose Estimation from Triple Point-line Images by Decoupling Rotation and Translation | Zewen Xu et.al. | 2403.11639 | null |
2024-03-18 | Personalized 3D Human Pose and Shape Refinement | Tom Wehrbein et.al. | 2403.11634 | null |
2024-03-18 | QEAN: Quaternion-Enhanced Attention Network for Visual Dance Generation | Zhizhen Zhou et.al. | 2403.11626 | null |
2024-03-20 | GaussNav: Gaussian Splatting for Visual Navigation | Xiaohan Lei et.al. | 2403.11625 | link |
2024-03-18 | Frontier-Based Exploration for Multi-Robot Rendezvous in Communication-Restricted Unknown Environments | Mauro Tellaroli et.al. | 2403.11617 | null |
2024-03-18 | UV Gaussians: Joint Learning of Mesh Deformation and Gaussian Textures for Human Avatar Modeling | Yujiao Jiang et.al. | 2403.11589 | null |
2024-03-18 | DynoSurf: Neural Deformation-based Temporally Consistent Dynamic Surface Reconstruction | Yuxin Yao et.al. | 2403.11586 | link |
2024-03-18 | 3DGS-Calib: 3D Gaussian Splatting for Multimodal SpatioTemporal Calibration | Quentin Herau et.al. | 2403.11577 | null |
2024-03-20 | Just Add $100 More: Augmenting NeRF-based Pseudo-LiDAR Point Cloud for Resolving Class-imbalance Problem | Mincheol Chang et.al. | 2403.11573 | null |
2024-03-18 | Probing cold gas with Mg II and Ly $α$ radiative transfer | Seok-Jun Chang et.al. | 2403.11524 | null |
2024-03-18 | GenFlow: Generalizable Recurrent Flow for 6D Pose Refinement of Novel Objects | Sungphill Moon et.al. | 2403.11510 | null |
2024-03-18 | Diffusion Models are Geometry Critics: Single Image 3D Editing Using Pre-Trained Diffusion Priors | Ruicheng Wang et.al. | 2403.11503 | null |
2024-03-18 | SeisFusion: Constrained Diffusion Model with Input Guidance for 3D Seismic Data Interpolation and Reconstruction | Shuang Wang et.al. | 2403.11482 | link |
2024-03-19 | VIHE: Virtual In-Hand Eye Transformer for 3D Robotic Manipulation | Weiyao Wang et.al. | 2403.11461 | null |
2024-03-18 | Fed3DGS: Scalable 3D Gaussian Splatting with Federated Learning | Teppei Suzuki et.al. | 2403.11460 | link |
2024-03-18 | Bridging 3D Gaussian and Mesh for Freeview Video Rendering | Yuting Xiao et.al. | 2403.11453 | null |
2024-03-18 | Motion-aware 3D Gaussian Splatting for Efficient Dynamic Scene Reconstruction | Zhiyang Guo et.al. | 2403.11447 | null |
2024-03-18 | BAGS: Building Animatable Gaussian Splatting from a Monocular Video with Diffusion Priors | Tingyang Zhang et.al. | 2403.11427 | null |
2024-03-22 | Scene-LLM: Extending Language Model for 3D Visual Understanding and Reasoning | Rao Fu et.al. | 2403.11401 | null |
2024-03-18 | Beyond Uncertainty: Risk-Aware Active View Acquisition for Safe Robot Navigation and 3D Scene Understanding with FisherRF | Guangyi Liu et.al. | 2403.11396 | null |
2024-03-21 | V2X-DGW: Domain Generalization for Multi-agent Perception under Adverse Weather Conditions | Baolu Li et.al. | 2403.11371 | null |
2024-03-17 | 3DGS-ReLoc: 3D Gaussian Splatting for Map Representation and Visual ReLocalization | Peng Jiang et.al. | 2403.11367 | null |
2024-03-17 | Creating Seamless 3D Maps Using Radiance Fields | Sai Tarun Sathyan et.al. | 2403.11364 | null |
2024-03-17 | Discrete Painlevé equations and pencils of quadrics in $\mathbb P^3$ | Jaume Alonso et.al. | 2403.11349 | null |
2024-03-17 | Ensembling and Test Augmentation for Covid-19 Detection and Covid-19 Domain Adaptation from 3D CT-Scans | Fares Bougourzi et.al. | 2403.11338 | null |
2024-03-17 | GeoGaussian: Geometry-aware Gaussian Splatting for Scene Rendering | Yanyan Li et.al. | 2403.11324 | null |
2024-03-17 | Forging the Industrial Metaverse – Where Industry 5.0, Augmented and Mixed Reality, IIoT, Opportunistic Edge Computing and Digital Twins Meet | Tiago M. Fernández-Caramés et.al. | 2403.11312 | null |
2024-03-20 | A Dual-Augmentor Framework for Domain Generalization in 3D Human Pose Estimation | Qucheng Peng et.al. | 2403.11310 | link |
2024-03-17 | BrightDreamer: Generic 3D Gaussian Generative Framework for Fast Text-to-3D Synthesis | Lutao Jiang et.al. | 2403.11273 | link |
2024-03-17 | Zutu: A Platform for Localization and Navigation of Swarm Robots Using Virtual Grids | Prateek et.al. | 2403.11252 | null |
2024-03-17 | Compact 3D Gaussian Splatting For Dense Visual SLAM | Tianchen Deng et.al. | 2403.11247 | link |
2024-03-17 | A component-level co-rotational 3D continuum finite element framework for efficient flexible multibody analysis | Ziyun Kan et.al. | 2403.11239 | null |
2024-03-17 | Concatenate, Fine-tuning, Re-training: A SAM-enabled Framework for Semi-supervised 3D Medical Image Segmentation | Shumeng Li et.al. | 2403.11229 | link |
2024-03-17 | THOR: Text to Human-Object Interaction Diffusion via Relation Intervention | Qianyang Wu et.al. | 2403.11208 | null |
2024-03-17 | The Simplex Projection: Lossless Visualization of 4D Compositional Data on a 2D Canvas | Marvin Schmitt et.al. | 2403.11141 | null |
2024-03-17 | Recent Advances in 3D Gaussian Splatting | Tong Wu et.al. | 2403.11134 | null |
2024-03-17 | Omni-Recon: Towards General-Purpose Neural Radiance Fields for Versatile 3D Applications | Yonggan Fu et.al. | 2403.11131 | link |
2024-03-17 | 3D Human Reconstruction in the Wild with Synthetic Data Using Generative Models | Yongtao Ge et.al. | 2403.11111 | null |
2024-03-17 | PyroTrack: Belief-Based Deep Reinforcement Learning Path Planning for Aerial Wildfire Monitoring in Partially Observable Environments | Sahand Khoshdel et.al. | 2403.11095 | null |
2024-03-17 | Analytic-Splatting: Anti-Aliased 3D Gaussian Splatting via Analytic Integration | Zhihao Liang et.al. | 2403.11056 | null |
2024-03-17 | Endora: Video Generation Models as Endoscopy Simulators | Chenxin Li et.al. | 2403.11050 | null |
2024-03-16 | Multiplane Quantitative Phase Imaging Using a Wavelength-Multiplexed Diffractive Optical Processor | Che-Yung Shen et.al. | 2403.11035 | null |
2024-03-16 | EfficientMorph: Parameter-Efficient Transformer-Based Architecture for 3D Image Registration | Abu Zahid Bin Aziz et.al. | 2403.11026 | link |
2024-03-16 | Fast Sparse View Guided NeRF Update for Object Reconfigurations | Ziqi Lu et.al. | 2403.11024 | null |
2024-03-16 | N2F2: Hierarchical Scene Understanding with Nested Neural Feature Fields | Yash Bhalgat et.al. | 2403.10997 | null |
2024-03-16 | Ctrl123: Consistent Novel View Synthesis via Closed-Loop Transcription | Hongxiang Zhao et.al. | 2403.10953 | null |
2024-03-19 | ScanTalk: 3D Talking Heads from Unregistered Scans | Federico Nocentini et.al. | 2403.10942 | link |
2024-03-16 | MSI-NeRF: Linking Omni-Depth with View Synthesis through Multi-Sphere Image aided Generalizable Neural Radiance Field | Dongyu Yan et.al. | 2403.10840 | link |
2024-03-16 | SF(DA) $^2$ : Source-free Domain Adaptation Through the Lens of Data Augmentation | Uiwon Hwang et.al. | 2403.10834 | link |
2024-03-16 | DUE: Dynamic Uncertainty-Aware Explanation Supervision via 3D Imputation | Qilong Zhao et.al. | 2403.10831 | link |
2024-03-16 | MicroDiffusion: Implicit Representation-Guided Diffusion for 3D Reconstruction from Limited 2D Microscopy Projections | Mude Hui et.al. | 2403.10815 | link |
2024-03-16 | DarkGS: Learning Neural Illumination and 3D Gaussians Relighting for Robotic Exploration in the Dark | Tianyi Zhang et.al. | 2403.10814 | link |
2024-03-16 | Speech-driven Personalized Gesture Synthetics: Harnessing Automatic Fuzzy Feature Inference | Fan Zhang et.al. | 2403.10805 | null |
2024-03-16 | “It’s Kind of Context Dependent”: Understanding Blind and Low Vision People’s Video Accessibility Preferences Across Viewing Scenarios | Lucy Jiang et.al. | 2403.10792 | null |
2024-03-15 | PyHySCO: GPU-Enabled Susceptibility Artifact Distortion Correction in Seconds | Abigail Julian et.al. | 2403.10706 | link |
2024-03-15 | Computational Study on the Impact of Gasoline-Ethanol Blending on Autoignition and Soot/NOx Emissions under Gasoline Compression Ignition Conditions | Krishna C. Kalvakala et.al. | 2403.10687 | null |
2024-03-15 | GS-Pose: Cascaded Framework for Generalizable Segmentation-based 6D Object Pose Estimation | Dingding Cai et.al. | 2403.10683 | null |
2024-03-15 | Mitigation and optimization of induced seismicity using physics-based forecasting | Ryley G Hill et.al. | 2403.10675 | null |
2024-03-15 | NeuralOCT: Airway OCT Analysis via Neural Fields | Yining Jiao et.al. | 2403.10622 | null |
2024-03-15 | Circumnuclear Multi-phase Gas in the Circinus Galaxy. VI. Detectability of Molecular Inflow and Atomic Outflow | Shunsuke Baba et.al. | 2403.10593 | null |
2024-03-12 | Two-sided Acoustic Metascreen for Broadband and Individual Reflection and Transmission Control | Ao Chen et.al. | 2403.10548 | null |
2024-03-15 | Lifelong LERF: Local 3D Semantic Inventory Monitoring Using FogROS2 | Adam Rashid et.al. | 2403.10494 | null |
2024-03-15 | Robust Shape Fitting for 3D Scene Abstraction | Florian Kluger et.al. | 2403.10452 | link |
2024-03-15 | SWAG: Splatting in the Wild images with Appearance-conditioned Gaussians | Hiba Dahmani et.al. | 2403.10427 | null |
2024-03-15 | SculptDiff: Learning Robotic Clay Sculpting from Humans with Goal Conditioned Diffusion Policy | Alison Bartsch et.al. | 2403.10401 | null |
2024-03-15 | Isotropic3D: Image-to-3D Generation Based on a Single CLIP Embedding | Pengkun Liu et.al. | 2403.10395 | link |
2024-03-18 | ANIM: Accurate Neural Implicit Model for Human Reconstruction from a single RGB-D image | Marco Pesavento et.al. | 2403.10357 | null |
2024-03-15 | SimPB: A Single Model for 2D and 3D Object Detection from Multiple Cameras | Yingqi Tang et.al. | 2403.10353 | link |
2024-03-15 | ParaPoint: Learning Global Free-Boundary Surface Parameterization of 3D Point Clouds | Qijian Zhang et.al. | 2403.10349 | null |
2024-03-15 | SCILLA: SurfaCe Implicit Learning for Large Urban Area, a volumetric hybrid solution | Hala Djeghim et.al. | 2403.10344 | null |
2024-03-15 | Thermal-NeRF: Neural Radiance Fields from an Infrared Camera | Tianxiang Ye et.al. | 2403.10340 | link |
2024-03-15 | NECA: Neural Customizable Human Avatar | Junjin Xiao et.al. | 2403.10335 | link |
2024-03-20 | Leveraging Neural Radiance Field in Descriptor Synthesis for Keypoints Scene Coordinate Regression | Huy-Hoang Bui et.al. | 2403.10297 | link |
2024-03-15 | FDGaussian: Fast Gaussian Splatting from Single Image via Geometric-aware Diffusion Model | Qijun Feng et.al. | 2403.10242 | null |
2024-03-15 | Expected performance of the Pyramid wavefront sensor with a laser guide star for 40 m class telescopes | Francisco Oyarzún et.al. | 2403.10177 | null |
2024-03-15 | Topology optimization of blazed gratings under conical incidence | Simon Ans et.al. | 2403.10174 | null |
2024-03-15 | SemanticHuman-HD: High-Resolution Semantic Disentangled 3D Human Generation | Peng Zheng et.al. | 2403.10166 | null |
2024-03-19 | GGRt: Towards Pose-free Generalizable 3D Gaussian Splatting in Real-time | Hao Li et.al. | 2403.10147 | null |
2024-03-15 | A Novel Bioinspired Neuromorphic Vision-based Tactile Sensor for Fast Tactile Perception | Omar Faris et.al. | 2403.10120 | null |
2024-03-15 | URS-NeRF: Unordered Rolling Shutter Bundle Adjustment for Neural Radiance Fields | Bo Xu et.al. | 2403.10119 | null |
2024-03-20 | KP-RED: Exploiting Semantic Keypoints for Joint 3D Shape Retrieval and Deformation | Ruida Zhang et.al. | 2403.10099 | link |
2024-03-15 | RangeLDM: Fast Realistic LiDAR Point Cloud Generation | Qianjiang Hu et.al. | 2403.10094 | link |
2024-03-15 | CrossGLG: LLM Guides One-shot Skeleton-based 3D Action Recognition in a Cross-level Manner | Tingbing Yan et.al. | 2403.10082 | null |
2024-03-18 | A survey of synthetic data augmentation methods in computer vision | Alhassan Mumuni et.al. | 2403.10075 | null |
2024-03-15 | Texture-GS: Disentangling the Geometry and Texture for 3D Gaussian Splatting Editing | Tian-Xing Xu et.al. | 2403.10050 | null |
2024-03-15 | SparseFusion: Efficient Sparse Multi-Modal Fusion Framework for Long-Range 3D Perception | Yiheng Li et.al. | 2403.10036 | null |
2024-03-15 | Generation of isolated attosecond electron bunches by the diffraction of a polarization-tailored intense laser beam | Ke Hu et.al. | 2403.10017 | null |
2024-03-15 | Visual Foundation Models Boost Cross-Modal Unsupervised Domain Adaptation for 3D Semantic Segmentation | Jingyi Xu et.al. | 2403.10001 | link |
2024-03-15 | Interactive Distance Field Mapping and Planning to Enable Human-Robot Collaboration | Usama Ali et.al. | 2403.09988 | link |
2024-03-19 | Controllable Text-to-3D Generation via Surface-Aligned Gaussian Splatting | Zhiqi Li et.al. | 2403.09981 | link |
2024-03-15 | Den-SOFT: Dense Space-Oriented Light Field DataseT for 6-DOF Immersive Experience | Xiaohang Yu et.al. | 2403.09973 | null |
2024-03-15 | NR-Surface: NextG-ready $μ$ W-reconfigurable mmWave Metasurface | Minseok Kim et.al. | 2403.09967 | null |
2024-03-15 | Boundary Constraint-free Biomechanical Model-Based Surface Matching for Intraoperative Liver Deformation Correction | Zixin Yang et.al. | 2403.09964 | link |
2024-03-15 | RadCLIP: Enhancing Radiologic Image Analysis through Contrastive Language-Image Pre-training | Zhixiu Lu et.al. | 2403.09948 | null |
2024-03-15 | Attention-Enhanced Hybrid Feature Aggregation Network for 3D Brain Tumor Segmentation | Ziya Ata Yazıcı et.al. | 2403.09942 | link |
2024-03-18 | Touch-GS: Visual-Tactile Supervised 3D Gaussian Splatting | Aiden Swann et.al. | 2403.09875 | null |
2024-03-14 | ThermoHands: A Benchmark for 3D Hand Pose Estimation from Egocentric Thermal Image | Fangqiang Ding et.al. | 2403.09871 | null |
2024-03-14 | MARVIS: Motion & Geometry Aware Real and Virtual Image Segmentation | Jiayi Wu et.al. | 2403.09850 | link |
2024-03-14 | Kitaev physics in the two-dimensional magnet NiPSe $_3$ | Cheng Peng et.al. | 2403.09831 | null |
2024-03-14 | FastSAM3D: An Efficient Segment Anything Model for 3D Volumetric Medical Images | Yiqing Shen et.al. | 2403.09827 | link |
2024-03-14 | On the Utility of 3D Hand Poses for Action Recognition | Md Salman Shamil et.al. | 2403.09805 | link |
2024-03-14 | BOP Challenge 2023 on Detection, Segmentation and Pose Estimation of Seen and Unseen Rigid Objects | Tomas Hodan et.al. | 2403.09799 | null |
2024-03-14 | GroupContrast: Semantic-aware Self-supervised Representation Learning for 3D Understanding | Chengyao Wang et.al. | 2403.09639 | link |
2024-03-14 | GaussianGrasper: 3D Language Gaussian Splatting for Open-vocabulary Robotic Grasping | Yuhang Zheng et.al. | 2403.09637 | link |
2024-03-14 | On locally symmetric polynomial metrics: Riemannian and Finslerian surfaces | Csaba Vincze et.al. | 2403.09633 | null |
2024-03-14 | Holo-Relighting: Controllable Volumetric Portrait Relighting from a Single Image | Yiqun Mei et.al. | 2403.09632 | null |
2024-03-14 | 3D-VLA: A 3D Vision-Language-Action Generative World Model | Haoyu Zhen et.al. | 2403.09631 | null |
2024-03-14 | Video Mamba Suite: State Space Model as a Versatile Alternative for Video Understanding | Guo Chen et.al. | 2403.09626 | link |
2024-03-14 | Make-Your-3D: Fast and Consistent Subject-Driven 3D Content Generation | Fangfu Liu et.al. | 2403.09625 | null |
2024-03-14 | Score-Guided Diffusion for 3D Human Recovery | Anastasis Stathopoulos et.al. | 2403.09623 | link |
2024-03-14 | Generative reconstruction of 3D volume elements for Ti-6Al-4V basketweave microstructure by optimization of CNN-based microstructural descriptors | Vincent Blümer et.al. | 2403.09609 | null |
2024-03-14 | pARam: Leveraging Parametric Design in Extended Reality to Support the Personalization of Artifacts for Personal Fabrication | Evgeny Stemasov et.al. | 2403.09607 | null |
2024-03-14 | The NeRFect Match: Exploring NeRF Features for Visual Localization | Qunjie Zhou et.al. | 2403.09577 | null |
2024-03-14 | Generalizing Denoising to Non-Equilibrium Structures Improves Equivariant Force Fields | Yi-Lun Liao et.al. | 2403.09549 | link |
2024-03-14 | VisionGPT-3D: A Generalized Multimodal Agent for Enhanced 3D Vision Understanding | Chris Kelly et.al. | 2403.09530 | null |
2024-03-14 | 3D-SceneDreamer: Text-Driven 3D-Consistent Scene Generation | Frank Zhang et.al. | 2403.09439 | null |
2024-03-14 | Improving Real-Time Omnidirectional 3D Multi-Person Human Pose Estimation with People Matching and Unsupervised 2D-3D Lifting | Pawel Knap et.al. | 2403.09437 | null |
2024-03-14 | Reconstruction and Simulation of Elastic Objects with Spring-Mass 3D Gaussians | Licheng Zhong et.al. | 2403.09434 | null |
2024-03-14 | RoDUS: Robust Decomposition of Static and Dynamic Elements in Urban Scenes | Thang-Anh-Quan Nguyen et.al. | 2403.09419 | null |
2024-03-14 | Relaxing Accurate Initialization Constraint for 3D Gaussian Splatting | Jaewoo Jung et.al. | 2403.09413 | link |
2024-03-14 | OpenGraph: Open-Vocabulary Hierarchical 3D Graph Representation in Large-Scale Outdoor Environments | Yinan Deng et.al. | 2403.09412 | link |
2024-03-14 | LM2D: Lyrics- and Music-Driven Dance Synthesis | Wenjie Yin et.al. | 2403.09407 | null |
2024-03-14 | DF4LCZ: A SAM-Empowered Data Fusion Framework for Scene-Level Local Climate Zone Classification | Qianqian Wu et.al. | 2403.09367 | link |
2024-03-14 | Intelligent Reflecting Surfaces vs. Full-Duplex Relays: A Comparison in the Air | Qian Ding et.al. | 2403.09353 | null |
2024-03-14 | HeadEvolver: Text to Head Avatars via Locally Learnable Mesh Deformation | Duotun Wang et.al. | 2403.09326 | null |
2024-03-14 | SD-Net: Symmetric-Aware Keypoint Prediction and Domain Adaptation for 6D Pose Estimation In Bin-picking Scenarios | Ding-Tao Huang et.al. | 2403.09317 | link |
2024-03-14 | Hyper-3DG: Text-to-3D Gaussian Generation via Hypergraph | Donglin Di et.al. | 2403.09236 | link |
2024-03-14 | Improving Distant 3D Object Detection Using 2D Box Supervision | Zetong Yang et.al. | 2403.09230 | null |
2024-03-14 | PoIFusion: Multi-Modal 3D Object Detection via Fusion at Points of Interest | Jiajun Deng et.al. | 2403.09212 | null |
2024-03-14 | A New Split Algorithm for 3D Gaussian Splatting | Qiyuan Feng et.al. | 2403.09143 | null |
2024-03-14 | Sculpt3D: Multi-View Consistent Text-to-3D Generation with Sparse 3D Prior | Cheng Chen et.al. | 2403.09140 | null |
2024-03-14 | Analytical Heterogeneous Die-to-Die 3D Placement with Macros | Yuxuan Zhao et.al. | 2403.09070 | null |
2024-03-14 | Dyadic Interaction Modeling for Social Behavior Generation | Minh Tran et.al. | 2403.09069 | link |
2024-03-14 | Distribution and Depth-Aware Transformers for 3D Human Mesh Recovery | Jerrin Bright et.al. | 2403.09063 | null |
2024-03-14 | CLOAF: CoLlisiOn-Aware Human Flow | Andrey Davydov et.al. | 2403.09050 | null |
2024-03-13 | Model order reduction for transient coupled diffusion-deformation of hydrogels | Gopal Agarwal et.al. | 2403.08968 | null |
2024-03-15 | On the Intersection of Two Conics | Michela Mancini et.al. | 2403.08953 | null |
2024-03-13 | A Virtual Environment for Collaborative Inspection in Additive Manufacturing | Vuthea Chheang et.al. | 2403.08940 | null |
2024-03-13 | CLIP-BEVFormer: Enhancing Multi-View Image-Based BEV Detector with Ground Truth Flow | Chenbin Pan et.al. | 2403.08919 | null |
2024-03-13 | Envision3D: One Image to 3D with Anchor Views Interpolation | Yatian Pang et.al. | 2403.08902 | link |
2024-03-13 | SLCF-Net: Sequential LiDAR-Camera Fusion for Semantic Scene Completion using a 3D Recurrent U-Net | Helin Cao et.al. | 2403.08885 | link |
2024-03-13 | FastMAC: Stochastic Spectral Sampling of Correspondence Graph | Yifei Zhang et.al. | 2403.08770 | link |
2024-03-13 | 3DFIRES: Few Image 3D REconstruction for Scenes with Hidden Surface | Linyi Jin et.al. | 2403.08768 | null |
2024-03-13 | MonoOcc: Digging into Monocular Semantic Occupancy Prediction | Yupeng Zheng et.al. | 2403.08766 | link |
2024-03-13 | VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis | Enric Corona et.al. | 2403.08764 | null |
2024-03-13 | MIM4D: Masked Modeling with Multi-View Video for Autonomous Driving Representation Learning | Jialv Zou et.al. | 2403.08760 | link |
2024-03-13 | Real-time 3D semantic occupancy prediction for autonomous vehicles using memory-efficient sparse convolution | Samuel Sze et.al. | 2403.08748 | null |
2024-03-14 | GaussCtrl: Multi-View Consistent Text-Driven 3D Gaussian Splatting Editing | Jing Wu et.al. | 2403.08733 | link |
2024-03-13 | Boundary geometry controls topological defect transitions that determine lumen nucleation in embryonic development | Pamela C. Guruciaga et.al. | 2403.08710 | null |
2024-03-13 | Refractive COLMAP: Refractive Structure-from-Motion Revisited | Mengkun She et.al. | 2403.08640 | null |
2024-03-13 | Scaling Up Dynamic Human-Scene Interaction Modeling | Nan Jiang et.al. | 2403.08629 | null |
2024-03-13 | Tangential Fixpoint Iterations for Gromov-Wasserstein Barycenters | Florian Beier et.al. | 2403.08612 | link |
2024-03-13 | 3D Spectrum Mapping and Reconstruction under Multi-Radiation Source Scenarios | Wang Jie et.al. | 2403.08513 | null |
2024-03-13 | UniLiDAR: Bridge the domain gap among different LiDARs for continual learning | Zikun Xu et.al. | 2403.08512 | null |
2024-03-15 | OccFiner: Offboard Occupancy Refinement with Hybrid Propagation | Hao Shi et.al. | 2403.08504 | link |
2024-03-13 | Gaussian Splatting in Style | Abhishek Saroha et.al. | 2403.08498 | null |
2024-03-13 | Stealthy and hyperuniform isotropic photonic bandgap structure in 3D | Lukas Siedentop et.al. | 2403.08404 | null |
2024-03-13 | Geometric and electronic properties of two kinds of CrO2 magnetic monolayers: D3d and D2h phases | Yang Zhang et.al. | 2403.08357 | null |
2024-03-13 | NaturalVLM: Leveraging Fine-grained Natural Language for Affordance-Guided Visual Manipulation | Ran Xu et.al. | 2403.08355 | null |
2024-03-13 | STMPL: Human Soft-Tissue Simulation | Anton Agafonov et.al. | 2403.08344 | null |
2024-03-13 | Positive Lynden-Bell derivative as a ticket to the bar trap? | Viktor D. Zozulia et.al. | 2403.08326 | null |
2024-03-13 | Sparse Bayesian Learning-Based Hierarchical Construction for 3D Radio Environment Maps Incorporating Channel Shadowing | Wang Jie et.al. | 2403.08323 | null |
2024-03-13 | DrFER: Learning Disentangled Representations for 3D Facial Expression Recognition | Hebeizi Li et.al. | 2403.08318 | null |
2024-03-13 | StyleDyRF: Zero-shot 4D Style Transfer for Dynamic Neural Radiance Fields | Hongbin Xu et.al. | 2403.08310 | link |
2024-03-13 | BiTT: Bi-directional Texture Reconstruction of Interacting Two Hands from a Single Image | Minje Kim et.al. | 2403.08262 | link |
2024-03-13 | PNeSM: Arbitrary 3D Scene Stylization via Prompt-Based Neural Style Mapping | Jiafu Chen et.al. | 2403.08252 | null |
2024-03-13 | The 3D Lyman- $α$ Forest Power Spectrum from eBOSS DR16 | Roger de Belsunce et.al. | 2403.08241 | null |
2024-03-13 | SeCG: Semantic-Enhanced 3D Visual Grounding via Cross-modal Graph Attention | Feng Xiao et.al. | 2403.08182 | null |
2024-03-13 | Effects of wave damping and finite perpendicular scale on three-dimensional Alfvén wave parametric decay in low-beta plasmas | Feiyu Li et.al. | 2403.08179 | null |
2024-03-13 | MolBind: Multimodal Alignment of Language, Molecules, and Proteins | Teng Xiao et.al. | 2403.08167 | link |
2024-03-13 | Effective Underwater Glider Path Planning in Dynamic 3D Environments Using Multi-Point Potential Fields | Hanzhi Yang et.al. | 2403.08163 | null |
2024-03-13 | Iterative Learning for Joint Image Denoising and Motion Artifact Correction of 3D Brain MRI | Lintao Zhang et.al. | 2403.08162 | link |
2024-03-12 | Q-SLAM: Quadric Representations for Monocular SLAM | Chensheng Peng et.al. | 2403.08125 | null |
2024-03-17 | 6D Movable Antenna Based on User Distribution: Modeling and Optimization | Xiaodan Shao et.al. | 2403.08123 | null |
2024-03-14 | V-PRISM: Probabilistic Mapping of Unknown Tabletop Scenes | Herbert Wright et.al. | 2403.08106 | link |
2024-03-12 | Task and Motion Planning in Hierarchical 3D Scene Graphs | Aaron Ray et.al. | 2403.08094 | null |
2024-03-12 | FluoroSAM: A Language-aligned Foundation Model for X-ray Image Segmentation | Benjamin D. Killeen et.al. | 2403.08059 | link |
2024-03-12 | DrivAerNet: A Parametric Car Dataset for Data-Driven Aerodynamic Design and Graph-Based Drag Prediction | Mohamed Elrefaie et.al. | 2403.08055 | link |
2024-03-12 | CT evaluation of 2D and 3D holistic deep learning methods for the volumetric segmentation of airway lesions | Amel Imene Hadj Bouzid et.al. | 2403.08042 | null |
2024-03-12 | Giant radio galaxies in the LOFAR deep fields | M. Simonte et.al. | 2403.08037 | null |
2024-03-15 | MRC-Net: 6-DoF Pose Estimation with MultiScale Residual Correlation | Yuelong Li et.al. | 2403.08019 | link |
2024-03-12 | Unraveling the nature of quasi van der Waals Epitaxy of magnetic topological insulators Cr: (BixSb1-x)2Te3 on a GaAs (111) substrate through coherently strained interface | Yuxing Ren et.al. | 2403.07864 | null |
2024-03-12 | A twist-grain boundary (TGB) phase in aqueous solutions of the DNA tetramer GTAC | Gregory P. Smith et.al. | 2403.07844 | null |
2024-03-12 | StyleGaussian: Instant 3D Style Transfer with Gaussian Splatting | Kunhao Liu et.al. | 2403.07807 | null |
2024-03-12 | RobotCycle: Assessing Cycling Safety in Urban Environments | Efimia Panagiotaki et.al. | 2403.07789 | null |
2024-03-12 | DexCap: Scalable and Portable Mocap Data Collection System for Dexterous Manipulation | Chen Wang et.al. | 2403.07788 | null |
2024-03-15 | Generative deep learning-enabled ultra-large field-of-view lens-free imaging | Ronald B. Liu et.al. | 2403.07786 | link |
2024-03-17 | SemCity: Semantic Scene Generation with Triplane Diffusion | Jumin Lee et.al. | 2403.07773 | link |
2024-03-12 | Unleashing HyDRa: Hybrid Fusion, Depth Consistency and Radar for Unified 3D Perception | Philipp Wolters et.al. | 2403.07746 | link |
2024-03-15 | Fast and Simple Explainability for Point Cloud Networks | Meir Yossef Levi et.al. | 2403.07706 | null |
2024-03-12 | Efficient Global Navigational Planning in 3D Structures based on Point Cloud Tomography | Bowen Yang et.al. | 2403.07631 | link |
2024-03-13 | MinkUNeXt: Point Cloud-based Large-scale Place Recognition using 3D Sparse Convolutions | J. J. Cabrera et.al. | 2403.07593 | link |
2024-03-12 | Molecularity: a fast and efficient criterion for probing superconductivity | Matías E. di Mauro et.al. | 2403.07584 | null |
2024-03-14 | Unleashing Network Potentials for Semantic Scene Completion | Fengyun Wang et.al. | 2403.07560 | link |
2024-03-12 | SMURF: Continuous Dynamics for Motion-Deblurring Radiance Fields | Jungho Lee et.al. | 2403.07547 | link |
2024-03-12 | LaB-GATr: geometric algebra transformers for large biomedical surface and volume meshes | Julian Suk et.al. | 2403.07536 | link |
2024-03-12 | SemGauss-SLAM: Dense Semantic Gaussian Splatting SLAM | Siting Zhu et.al. | 2403.07494 | link |
2024-03-12 | Study of parameters affecting the cooling capacity of liquid jets by using OpenFoam as tool to solve the inverse heat transfer problem | Kaissar Nabbout et.al. | 2403.07490 | null |
2024-03-12 | A Comprehensive Survey of 3D Dense Captioning: Localizing and Describing Objects in 3D Scenes | Ting Yu et.al. | 2403.07469 | null |
2024-03-12 | Novel Signatures of Radiation Reaction in Electron-Laser Sidescattering | Philipp Sikorski et.al. | 2403.07455 | null |
2024-03-12 | Cuprate-like Electronic Structures in Infinite-Layer Nickelates with 3D dispersion | X. Ding et.al. | 2403.07448 | null |
2024-03-12 | Towards adiabatic-connection interpolation model with broader applicability | Lucian A. Constantin et.al. | 2403.07391 | null |
2024-03-12 | NavCoT: Boosting LLM-Based Vision-and-Language Navigation via Learning Disentangled Reasoning | Bingqian Lin et.al. | 2403.07376 | link |
2024-03-12 | Eliminating Cross-modal Conflicts in BEV Space for LiDAR-Camera 3D Object Detection | Jiahui Fu et.al. | 2403.07372 | null |
2024-03-13 | FSC: Few-point Shape Completion | Xianzu Wu et.al. | 2403.07359 | link |
2024-03-12 | Complementing Event Streams and RGB Frames for Hand Mesh Reconstruction | Jianping Jiang et.al. | 2403.07346 | null |
2024-03-12 | Electronic Structure of Superconducting Infinite-Layer Lanthanum Nickelates | Wenjie Sun et.al. | 2403.07344 | null |
2024-03-12 | Large Window-based Mamba UNet for Medical Image Segmentation: Beyond Convolution and Self-attention | Jinhong Wang et.al. | 2403.07332 | link |
2024-03-12 | Customizable Avatars with Dynamic Facial Action Coded Expressions (CADyFACE) for Improved User Engagement | Megan A. Witherow et.al. | 2403.07314 | null |
2024-03-12 | Stability and Sharp Decay Estimates for 3D MHD Equations with Only Vertical Dissipation Near a Background Magnetic Field | Suhua Lai et.al. | 2403.07293 | null |
2024-03-12 | SparseLIF: High-Performance Sparse LiDAR-Camera Fusion for 3D Object Detection | Hongcheng Zhang et.al. | 2403.07284 | null |
2024-03-12 | Sharp one-point estimates and Minkowski content for the scaling limit of three-dimensional loop-erased random walk | Sarai Hernandez-Torres et.al. | 2403.07256 | null |
2024-03-12 | GuideGen: A Text-guided Framework for Joint CT Volume and Anatomical structure Generation | Linrui Dai et.al. | 2403.07247 | link |
2024-03-12 | 3D Uncertain Distance Field Mapping using GMM and GP | Qianqian Zou et.al. | 2403.07223 | null |
2024-03-12 | Monocular Microscope to CT Registration using Pose Estimation of the Incus for Augmented Reality Cochlear Implant Surgery | Yike Zhang et.al. | 2403.07219 | null |
2024-03-11 | Simulation-Based Segmentation of Blood Vessels in Cerebral 3D OCTA Images | Bastian Wittmann et.al. | 2403.07116 | null |
2024-03-11 | A slice classification neural network for automated classification of axial PET/CT slices from a multi-centric lymphoma dataset | Shadab Ahamed et.al. | 2403.07105 | null |
2024-03-11 | A cascaded deep network for automated tumor detection and segmentation in clinical PET imaging of diffuse large B-cell lymphoma | Shadab Ahamed et.al. | 2403.07092 | null |
2024-03-11 | Holography and Regge Phases with $U(1)$ Charge | Giulia Fardelli et.al. | 2403.07079 | null |
2024-03-11 | LISO: Lidar-only Self-Supervised 3D Object Detection | Stefan Baur et.al. | 2403.07071 | link |
2024-03-12 | VideoMamba: State Space Model for Efficient Video Understanding | Kunchang Li et.al. | 2403.06977 | link |
2024-03-11 | Memory-based Adapters for Online 3D Scene Perception | Xiuwei Xu et.al. | 2403.06974 | null |
2024-03-11 | Bayesian Diffusion Models for 3D Shape Reconstruction | Haiyang Xu et.al. | 2403.06973 | null |
2024-03-11 | 3D simulations of TRAPPIST-1e with varying CO2, CH4 and haze profiles | Mei Ting Mak et.al. | 2403.06928 | null |
2024-03-13 | DNGaussian: Optimizing Sparse-View 3D Gaussian Radiance Fields with Global-Local Depth Normalization | Jiahe Li et.al. | 2403.06912 | link |
2024-03-11 | FreGS: 3D Gaussian Splatting with Progressive Frequency Regularization | Jiahui Zhang et.al. | 2403.06908 | null |
2024-03-11 | LIBR+: Improving Intraoperative Liver Registration by Learning the Residual of Biomechanics-Based Deformable Registration | Dingrong Wang et.al. | 2403.06901 | null |
2024-03-11 | Data Cubes in Hand: A Design Space of Tangible Cubes for Visualizing 3D Spatio-Temporal Data in Mixed Reality | Shuqi He et.al. | 2403.06891 | null |
2024-03-11 | Numerical simulation of individual coil placement – A proof-of-concept study for the prediction of recurrence after aneurysm coiling | Julian Schwarting et.al. | 2403.06889 | null |
2024-03-13 | Process signature-driven high spatio-temporal resolution alignment of multimodal data | Abhishek Hanchate et.al. | 2403.06888 | null |
2024-03-11 | Surface-aware Mesh Texture Synthesis with Pre-trained 2D CNNs | Áron Samuel Kovács et.al. | 2403.06855 | null |
2024-03-11 | DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video Generation | Guosheng Zhao et.al. | 2403.06845 | null |
2024-03-11 | ExoCubed: A Riemann-Solver based Cubed-Sphere Dynamic Core for Planetary Atmospheres | Sihe Chen et.al. | 2403.06844 | link |
2024-03-15 | Inverse Garment and Pattern Modeling with a Differentiable Simulator | Boyang Yu et.al. | 2403.06841 | null |
2024-03-11 | CT2Rep: Automated Radiology Report Generation for 3D Medical Imaging | Ibrahim Ethem Hamamci et.al. | 2403.06801 | link |
2024-03-11 | Jeffery Orbits with Noise Revisited | Julian Talbot et.al. | 2403.06795 | null |
2024-03-11 | V3D: Video Diffusion Models are Effective 3D Generators | Zilong Chen et.al. | 2403.06738 | link |
2024-03-11 | Propagation of Solar Energetic Particles in 3D MHD Simulations of the Solar Wind | Houeibib Ahmed et.al. | 2403.06706 | null |
2024-03-11 | Fast Text-to-3D-Aware Face Generation and Manipulation via Direct Cross-modal Mapping and Geometric Regularization | Jinlu Zhang et.al. | 2403.06702 | link |
2024-03-11 | PCLD: Point Cloud Layerwise Diffusion for Adversarial Purification | Mert Gulsen et.al. | 2403.06698 | link |
2024-03-11 | epsilon-Mesh Attack: A Surface-based Adversarial Point Cloud Attack for Facial Expression Recognition | Batuhan Cengiz et.al. | 2403.06661 | link |
2024-03-11 | Towards Zero-Shot Interpretable Human Recognition: A 2D-3D Registration Framework | Henrique Jesus et.al. | 2403.06658 | null |
2024-03-18 | Ricci flow-based brain surface covariance descriptors for diagnosing Alzheimer’s disease | Fatemeh Ahmadi et.al. | 2403.06645 | null |
2024-03-11 | Visualizing, Analyzing and Constructing L-System from Arborized 3D Model Using a Web Application | Nick van Nielen et.al. | 2403.06638 | null |
2024-03-11 | Aggregated distribution grid flexibilities in subtransmission grid operational management | Neelotpal Majumdar et.al. | 2403.06635 | null |
2024-03-11 | Feasibility study on solving the Helmholtz equation in 3D with PINNs | Stefan Schoder et.al. | 2403.06623 | null |
2024-03-11 | Cross-domain and Cross-dimension Learning for Image-to-Graph Transformers | Alexander H. Berger et.al. | 2403.06601 | link |
2024-03-11 | BEV2PR: BEV-Enhanced Visual Place Recognition with Structural Cues | Fudong Ge et.al. | 2403.06600 | link |
2024-03-11 | Inhomogeneous probes for BCDI: Toward the imaging of dynamic and distorted crystals | I. Calvo-Almazán et.al. | 2403.06598 | null |
2024-03-12 | Lander.AI: Adaptive Landing Behavior Agent for Expertise in 3D Dynamic Platform Landings | Robinroy Peter et.al. | 2403.06572 | link |
2024-03-11 | Detection of Object Throwing Behavior in Surveillance Videos | Ivo P. C. Kersten et.al. | 2403.06552 | null |
2024-03-11 | 3DRef: 3D Dataset and Benchmark for Reflection Detection in RGB and Lidar Data | Xiting Zhao et.al. | 2403.06538 | null |
2024-03-16 | Confidence-Aware RGB-D Face Recognition via Virtual Depth Synthesis | Zijian Chen et.al. | 2403.06529 | null |
2024-03-11 | 3D Semantic Segmentation-Driven Representations for 3D Object Detection | Hayeon O et.al. | 2403.06501 | link |
2024-03-11 | 3D-aware Image Generation and Editing with Multi-modal Conditions | Bo Li et.al. | 2403.06470 | null |
2024-03-15 | Reliable Spatial-Temporal Voxels For Multi-Modal Test-Time Adaptation | Haozhi Cao et.al. | 2403.06461 | link |
2024-03-11 | RIS-Enabled Joint Near-Field 3D Localization and Synchronization in SISO Multipath Environments | Han Yan et.al. | 2403.06460 | null |
2024-03-11 | Ensemble Quadratic Assignment Network for Graph Matching | Haoru Tan et.al. | 2403.06457 | null |
2024-03-11 | Fine-Grained Pillar Feature Encoding Via Spatio-Temporal Virtual Grid for 3D Object Detection | Konyul Park et.al. | 2403.06433 | link |
2024-03-11 | PointSeg: A Training-Free Paradigm for 3D Scene Segmentation via Foundation Models | Qingdong He et.al. | 2403.06403 | null |
2024-03-11 | A Segmentation Foundation Model for Diverse-type Tumors | Jianhao Xie et.al. | 2403.06396 | null |
2024-03-13 | FSViewFusion: Few-Shots View Generation of Novel Objects | Rukhshanda Hussain et.al. | 2403.06394 | null |
2024-03-10 | Separable Physics-informed Neural Networks for Solving the BGK Model of the Boltzmann Equation | Jaemin Oh et.al. | 2403.06342 | link |
2024-03-10 | RTAB-Map as an Open-Source Lidar and Visual SLAM Library for Large-Scale and Long-Term Online Operation | Mathieu Labbé et.al. | 2403.06341 | null |
2024-03-10 | An End-to-End Deep Learning Generative Framework for Refinable Shape Matching and Generation | Soodeh Kalaie et.al. | 2403.06317 | null |
2024-03-10 | Hybrid-order topology with tunable chiral hinge modes and unpinned Dirac surface states in the altermagnetic insulator Eu ${3}$In${2}$As$_{4}$ | Yufei Zhao et.al. | 2403.06304 | null |
2024-03-10 | BlazeBVD: Make Scale-Time Equalization Great Again for Blind Video Deflickering | Xinmin Qiu et.al. | 2403.06243 | null |
2024-03-12 | COVID-19 Computer-aided Diagnosis through AI-assisted CT Imaging Analysis: Deploying a Medical AI System | Demetris Gerogiannis et.al. | 2403.06242 | null |
2024-03-13 | S-DyRF: Reference-Based Stylized Radiance Fields for Dynamic Scenes | Xingyi Li et.al. | 2403.06205 | null |
2024-03-10 | HINORA, a method for detecting ring-like structures in 3D point distributions I: application to the Local Volume Galaxy catalogue | Edward Olex et.al. | 2403.06187 | null |
2024-03-10 | Cross-Cluster Shifting for Efficient and Effective 3D Object Detection in Autonomous Driving | Zhili Chen et.al. | 2403.06166 | null |
2024-03-10 | Platypose: Calibrated Zero-Shot Multi-Hypothesis 3D Human Motion Estimation | Paweł A. Pierzchlewicz et.al. | 2403.06164 | link |
2024-03-10 | Bayesian Random Semantic Data Augmentation for Medical Image Classification | Yaoyao Zhu et.al. | 2403.06138 | link |
2024-03-10 | PSS-BA: LiDAR Bundle Adjustment with Progressive Spatial Smoothing | Jianping Li et.al. | 2403.06124 | null |
2024-03-10 | Enhancing 3D Object Detection with 2D Detection-Guided Query Anchors | Haoxuanye Ji et.al. | 2403.06093 | link |
2024-03-09 | Multiphysics Modeling of Surface Diffusion Coupled with Large Deformation in 3D Solids | Jaemin Kim et.al. | 2403.06005 | null |
2024-03-09 | Electromagnetic Hybrid Beamforming for Holographic Communications | Ran Ji et.al. | 2403.05970 | null |
2024-03-09 | Classifying Objects in 3D Point Clouds Using Recurrent Neural Network: A GRU LSTM Hybrid Approach | Ramin Mousa et.al. | 2403.05950 | link |
2024-03-09 | Learned 3D volumetric recovery of clouds and its uncertainty for climate analysis | Roi Ronen et.al. | 2403.05932 | null |
2024-03-09 | Global solutions for stochastically controlled fluid dynamics models | Dan Crisan et.al. | 2403.05923 | null |
2024-03-09 | Mask-Enhanced Segment Anything Model for Tumor Lesion Semantic Segmentation | Hairong Shi et.al. | 2403.05912 | link |
2024-03-09 | DO3D: Self-supervised Learning of Decomposed Object-aware 3D Motion and Depth from Monocular Videos | Xiuzhe Wu et.al. | 2403.05895 | null |
2024-03-09 | SPAFormer: Sequential 3D Part Assembly with Transformers | Boshen Xu et.al. | 2403.05874 | link |
2024-03-09 | MirrorAttack: Backdoor Attack on 3D Point Cloud with a Distorting Mirror | Yuhao Bian et.al. | 2403.05847 | null |
2024-03-09 | LEO- and RIS-Empowered User Tracking: A Riemannian Manifold Approach | Pinjun Zheng et.al. | 2403.05838 | null |
2024-03-09 | SAFDNet: A Simple and Effective Network for Fully Sparse 3D Object Detection | Gang Zhang et.al. | 2403.05817 | link |
2024-03-09 | Large Generative Model Assisted 3D Semantic Communication | Feibo Jiang et.al. | 2403.05783 | null |
2024-03-09 | Towards Deviation-Robust Agent Navigation via Perturbation-Aware Contrastive Learning | Bingqian Lin et.al. | 2403.05770 | null |
2024-03-09 | UDCR: Unsupervised Aortic DSA/CTA Rigid Registration Using Deep Reinforcement Learning and Overlap Degree Calculation | Wentao Liu et.al. | 2403.05753 | null |
2024-03-08 | Spatial-aware Transformer-GRU Framework for Enhanced Glaucoma Diagnosis from 3D OCT Imaging | Mona Ashtari-Majlan et.al. | 2403.05702 | link |
2024-03-08 | Helium in the Extended Atmosphere of the Warm Super-Puff TOI-1420b | Shreyas Vissapragada et.al. | 2403.05614 | null |
2024-03-11 | Energy function for grain boundary plane orientation fundamental zone | Wei Wan et.al. | 2403.05474 | null |
2024-03-08 | Grasping Trajectory Optimization with Point Clouds | Yu Xiang et.al. | 2403.05466 | null |
2024-03-08 | 3d-oxide molecules to tailor large magnetic anisotropy energies on MgO films | Sufyan Shehada et.al. | 2403.05432 | null |
2024-03-08 | Exponential asymptotics and Stokes surfaces in nonlinear three-dimensional flows | John A. Fitzgerald et.al. | 2403.05420 | null |
2024-03-08 | DualBEV: CNN is All You Need in View Transformation | Peidong Li et.al. | 2403.05402 | link |
2024-03-08 | A Data Augmentation Pipeline to Generate Synthetic Labeled Datasets of 3D Echocardiography Images using a GAN | Cristiana Tiago et.al. | 2403.05384 | null |
2024-03-08 | OccFusion: Depth Estimation Free Multi-sensor Fusion for 3D Occupancy Prediction | Ji Zhang et.al. | 2403.05329 | null |
2024-03-08 | Hide in Thicket: Generating Imperceptible and Rational Adversarial Perturbations on 3D Point Clouds | Tianrui Lou et.al. | 2403.05247 | link |
2024-03-08 | 3D Face Reconstruction Using A Spectral-Based Graph Convolution Encoder | Haoxin Xu et.al. | 2403.05218 | link |
2024-03-08 | LVIC: Multi-modality segmentation by Lifting Visual Info as Cue | Zichao Dong et.al. | 2403.05159 | null |
2024-03-08 | LanePtrNet: Revisiting Lane Detection as Point Voting and Grouping on Curves | Jiayan Cao et.al. | 2403.05155 | null |
2024-03-08 | GSEdit: Efficient Text-Guided Editing of 3D Objects via Gaussian Splatting | Francesco Palandra et.al. | 2403.05154 | null |
2024-03-08 | Motion-Guided Dual-Camera Tracker for Low-Cost Skill Evaluation of Gastric Endoscopy | Yuelin Zhang et.al. | 2403.05146 | link |
2024-03-08 | Med3DInsight: Enhancing 3D Medical Image Understanding with 2D Multi-Modal Large Language Models | Qiuhui Chen et.al. | 2403.05141 | null |
2024-03-08 | Arbitrary-Scale Point Cloud Upsampling by Voxel-Based Network with Latent Geometric-Consistent Learning | Hang Du et.al. | 2403.05117 | link |
2024-03-08 | RLPeri: Accelerating Visual Perimetry Test with Reinforcement Learning and Convolutional Feature Extraction | Tanvi Verma et.al. | 2403.05112 | null |
2024-03-08 | Enhancing Texture Generation with High-Fidelity Using Advanced Texture Priors | Kuo Xu et.al. | 2403.05102 | null |
2024-03-08 | Two novel fully decoupled schemes for the two-phase MHD system with exactly divergence-free magnetic field | Kaiwen Shi et.al. | 2403.05095 | null |
2024-03-08 | SplattingAvatar: Realistic Real-Time Human Avatars with Mesh-Embedded Gaussian Splatting | Zhijing Shao et.al. | 2403.05087 | link |
2024-03-08 | RadarDistill: Boosting Radar-based Object Detection Performance via Knowledge Distillation from LiDAR Features | Geonho Bang et.al. | 2403.05061 | link |
2024-03-08 | MUC: Mixture of Uncalibrated Cameras for Robust 3D Human Body Reconstruction | Yitao Zhu et.al. | 2403.05055 | link |
2024-03-08 | EgoPAT3Dv2: Predicting 3D Action Target from 2D Egocentric Vision for Human-Robot Interaction | Irving Fang et.al. | 2403.05046 | null |
2024-03-08 | CRM: Single Image to 3D Textured Mesh with Convolutional Reconstruction Model | Zhengyi Wang et.al. | 2403.05034 | null |
2024-03-08 | ERASOR++: Height Coding Plus Egocentric Ratio Based Dynamic Object Removal for Static Point Cloud Mapping | Jiabao Zhang et.al. | 2403.05019 | null |
2024-03-08 | DITTO: Dual and Integrated Latent Topologies for Implicit 3D Reconstruction | Jaehyeok Shim et.al. | 2403.05005 | link |
2024-03-08 | Paving the Way for Pass Disturb Free Vertical NAND Storage via A Dedicated and String-Compatible Pass Gate | Zijian Zhao et.al. | 2403.04981 | null |
2024-03-08 | ActFormer: Scalable Collaborative Perception via Active Queries | Suozhi Huang et.al. | 2403.04968 | null |
2024-03-08 | The Local Bubble is a Local Chimney: A New Model from 3D Dust Mapping | Theo J. O’Neill et.al. | 2403.04961 | null |
2024-03-11 | Fooling Neural Networks for Motion Forecasting via Adversarial Attacks | Edgar Medina et.al. | 2403.04954 | null |
2024-03-07 | BAGS: Blur Agnostic Gaussian Splatting through Multi-Scale Kernel Modeling | Cheng Peng et.al. | 2403.04926 | link |
2024-03-07 | Secure Information Embedding and Extraction in Forensic 3D Fingerprinting | Canran Wang et.al. | 2403.04918 | null |
2024-03-06 | Investigation of the Impact of Synthetic Training Data in the Industrial Application of Terminal Strip Object Detection | Nico Baumgart et.al. | 2403.04809 | null |
2024-03-11 | Efficient LoFTR: Semi-Dense Local Feature Matching with Sparse-Like Speed | Yifan Wang et.al. | 2403.04765 | null |
2024-03-07 | DeepSee: Multidimensional Visualizations of Seabed Ecosystems | Adam Coscia et.al. | 2403.04761 | link |
2024-03-07 | That’s My Point: Compact Object-centric LiDAR Pose Estimation for Large-scale Outdoor Localisation | Georgi Pramatarov et.al. | 2403.04755 | null |
2024-03-10 | Unbiased Estimator for Distorted Conics in Camera Calibration | Chaehyeon Song et.al. | 2403.04583 | link |
2024-03-07 | Thermal structure of circumbinary discs: Circumbinary planets should be icy not rocky | Arnaud Pierens et.al. | 2403.04535 | null |
2024-03-08 | Finding Waldo: Towards Efficient Exploration of NeRF Scene Spaces | Evangelos Skartados et.al. | 2403.04508 | null |
2024-03-07 | THz-assisted microscopy of silica matrix for biological materials encapsulation: a theoretical and experimental study | Matteo De Tullio et.al. | 2403.04470 | null |
2024-03-07 | Disentangled Diffusion-Based 3D Human Pose Estimation with Hierarchical Spatial and Temporal Denoiser | Qingyuan Cai et.al. | 2403.04444 | link |
2024-03-07 | Direct visualization of domain wall pinning in sub-100nm 3D magnetic nanowires with cross-sectional curvature | Joseph Askey et.al. | 2403.04411 | null |
2024-03-09 | Single-to-Dual-View Adaptation for Egocentric 3D Hand Pose Estimation | Ruicong Liu et.al. | 2403.04381 | link |
2024-03-07 | Video-Driven Animation of Neural Head Avatars | Wolfgang Paier et.al. | 2403.04380 | null |
2024-03-07 | Control-Barrier-Aided Teleoperation with Visual-Inertial SLAM for Safe MAV Navigation in Complex Environments | Siqi Zhou et.al. | 2403.04331 | null |
2024-03-07 | 3DTextureTransformer: Geometry Aware Texture Generation for Arbitrary Mesh Topology | Dharma KC et.al. | 2403.04225 | null |
2024-03-07 | Partial tidal disruption events: The elixir of life | Megha Sharma et.al. | 2403.04211 | null |
2024-03-07 | CN-RMA: Combined Network with Ray Marching Aggregation for 3D Indoors Object Detection from Multi-view Images | Guanlin Shen et.al. | 2403.04198 | link |
2024-03-11 | Sliding into DM: Determining the local dark matter density and speed distribution using only the local circular speed of the Galaxy | Patrick G. Staudt et.al. | 2403.04122 | null |
2024-03-07 | Radiative Gaussian Splatting for Efficient X-ray Novel View Synthesis | Yuanhao Cai et.al. | 2403.04116 | link |
2024-03-08 | DNAct: Diffusion Guided Multi-Task 3D Policy Learning | Ge Yan et.al. | 2403.04115 | null |
2024-03-07 | Closing the Visual Sim-to-Real Gap with Object-Composable NeRFs | Nikhil Mishra et.al. | 2403.04114 | link |
2024-03-06 | Multi-Object Tracking with Camera-LiDAR Fusion for Autonomous Driving | Riccardo Pieroni et.al. | 2403.04112 | null |
2024-03-06 | On the importance of separators as sites of 3D magnetic reconnection | Clare E Parnell et.al. | 2403.04076 | null |
2024-03-06 | AGN feedback in the Local Universe: multiphase outflow of the Seyfert galaxy NGC 5506 | Federico Esposito et.al. | 2403.03981 | null |
2024-03-06 | MTC $[M_3, G]$ : 3d Topological Order Labeled by Seifert Manifolds | Federico Bonetti et.al. | 2403.03973 | null |
2024-03-06 | $\mathcal{N}=5$ SCFTs and quaternionic reflection groups | Anirudh Deb et.al. | 2403.03971 | null |
2024-03-06 | Understanding Stabilizer Codes Under Local Decoherence Through a General Statistical Mechanics Mapping | Anasuya Lyons et.al. | 2403.03955 | null |
2024-03-06 | 3D Diffusion Policy | Yanjie Ze et.al. | 2403.03954 | link |
2024-03-06 | Dexterous Legged Locomotion in Confined 3D Spaces with Reinforcement Learning | Zifan Xu et.al. | 2403.03848 | null |
2024-03-06 | An assessment of $\mathbfΥ$-states above $\mathbf{B\bar B}$ -threshold using a constituent-quark-model based meson-meson coupled-channels framework | P. G. Ortega et.al. | 2403.03770 | null |
2024-03-06 | Impact of theoretical uncertainties on model parameter reconstruction from GW signals sourced by cosmological phase transitions | Marek Lewicki et.al. | 2403.03769 | null |
2024-03-06 | Nonlinear Landau fan diagram and aperiodic magnetic oscillations in three-dimensional systems | Sunit Das et.al. | 2403.03765 | null |
2024-03-06 | Learning 3D object-centric representation through prediction | John Day et.al. | 2403.03730 | null |
2024-03-07 | CMDA: Cross-Modal and Domain Adversarial Adaptation for LiDAR-Based 3D Object Detection | Gyusam Chang et.al. | 2403.03721 | null |
2024-03-06 | 3D Object Visibility Prediction in Autonomous Driving | Chuanyu Luo et.al. | 2403.03681 | null |
2024-03-06 | Finite elements for Matérn-type random fields: Uncertainty in computational mechanics and design optimization | Tobias Duswald et.al. | 2403.03658 | null |
2024-03-06 | 3D-Printed Dielectric Image Lines towards Chip-to-Chip Interconnects for subTHz-Applications | Leonhard Hahn et.al. | 2403.03657 | null |
2024-03-06 | 3D Printed Waveguide for Augmented Reality | Dechuan Sun et.al. | 2403.03652 | null |
2024-03-06 | Online Photon Guiding with 3D Gaussians for Caustics Rendering | Jiawei Huang et.al. | 2403.03641 | null |
2024-03-06 | GSNeRF: Generalizable Semantic Neural Radiance Fields with Enhanced 3D Scene Understanding | Zi-Ting Chou et.al. | 2403.03608 | null |
2024-03-08 | DPOT: Auto-Regressive Denoising Operator Transformer for Large-Scale PDE Pre-Training | Zhongkai Hao et.al. | 2403.03542 | link |
2024-03-06 | Extend Your Own Correspondences: Unsupervised Distant Point Cloud Registration by Progressive Distance Extension | Quan Liu et.al. | 2403.03532 | link |
2024-03-07 | Tracing Dirac points of topological surface states by ferromagnetic resonance | Laura Pietanesi et.al. | 2403.03518 | null |
2024-03-06 | METAMAT 01: A semi-analytic Solution for Benchmarking Wave Propagation Simulations of homogeneous Absorbers in 1D/3D and 2D | Stefan Schoder et.al. | 2403.03510 | null |
2024-03-06 | Multi-task Learning for Real-time Autonomous Driving Leveraging Task-adaptive Attention Generator | Wonhyeok Choi et.al. | 2403.03468 | null |
2024-03-06 | Line defect half-indices of $SU(N)$ Chern-Simons theories | Tadashi Okazaki et.al. | 2403.03439 | null |
2024-03-06 | Sculpting Molecules in 3D: A Flexible Substructure Aware Framework for Text-Oriented Molecular Optimization | Kaiwei Zhang et.al. | 2403.03425 | null |
2024-03-07 | Scene Depth Estimation from Traditional Oriental Landscape Paintings | Sungho Kang et.al. | 2403.03408 | null |
2024-03-05 | Quasiparticle effects in magnetic-field-resilient 3D transmons | J. Krause et.al. | 2403.03351 | null |
2024-03-04 | Machine and deep learning methods for predicting 3D genome organization | Brydon P. G. Wall et.al. | 2403.03231 | null |
2024-03-05 | Self-supervised 3D Patient Modeling with Multi-modal Attentive Fusion | Meng Zheng et.al. | 2403.03217 | null |
2024-03-05 | Quantum 2D Liouville Path-Integral Is a Sum over Geometries in AdS $_3$ Einstein Gravity | Lin Chen et.al. | 2403.03179 | null |
2024-03-05 | 3D- $N_{\rm H}$ -tool | Victor Doroshenko et.al. | 2403.03127 | null |
2024-03-05 | Characterizing the 3D Structure of Molecular Cloud Envelopes in the “Cloud Factory” Simulations | Elijah Mullens et.al. | 2403.03112 | null |
2024-03-11 | MiKASA: Multi-Key-Anchor & Scene-Aware Transformer for 3D Visual Grounding | Chun-Peng Chang et.al. | 2403.03077 | link |
2024-03-05 | Prediction of turbulent channel flow using Fourier neural operator-based machine-learning strategy | Yunpeng Wang et.al. | 2403.03051 | null |
2024-03-05 | Superconductivity in Ca-intercalated bilayer silicene | Jisvin Sam et.al. | 2403.03036 | null |
2024-03-05 | ActiveAD: Planning-Oriented Active Learning for End-to-End Autonomous Driving | Han Lu et.al. | 2403.02877 | null |
2024-03-05 | A pH Sensor Scaffold for Mapping Spatiotemporal Gradients in Three Dimensional In Vitro Tumour Models | Riccardo Rizzo et.al. | 2403.02838 | null |
2024-03-05 | Are Dense Labels Always Necessary for 3D Object Detection from Point Cloud? | Chenqiang Gao et.al. | 2403.02818 | null |
2024-03-05 | Depth resolution in piezoresponse force microscopy | Matthias Roeper et.al. | 2403.02797 | null |
2024-03-05 | HUNTER: Unsupervised Human-centric 3D Detection via Transferring Knowledge from Synthetic Instances to Real Scenes | Yichen Yao et.al. | 2403.02769 | null |
2024-03-05 | Splat-Nav: Safe Real-Time Robot Navigation in Gaussian Splatting Maps | Timothy Chen et.al. | 2403.02751 | link |
2024-03-05 | FastOcc: Accelerating 3D Occupancy Prediction by Fusing the 2D Bird’s-Eye View and Perspective View | Jiawei Hou et.al. | 2403.02710 | null |
2024-03-05 | Enhanced DareFightingICE Competitions: Sound Design and AI Competitions | Ibrahim Khan et.al. | 2403.02687 | null |
2024-03-05 | UFO: Uncertainty-aware LiDAR-image Fusion for Off-road Semantic Terrain Map Estimation | Ohn Kim et.al. | 2403.02642 | null |
2024-03-06 | HoloVIC: Large-scale Dataset and Benchmark for Multi-Sensor Holographic Intersection and Vehicle-Infrastructure Cooperative | Cong Ma et.al. | 2403.02640 | null |
2024-03-07 | False Positive Sampling-based Data Augmentation for Enhanced 3D Object Detection Accuracy | Jiyong Oh et.al. | 2403.02639 | null |
2024-03-05 | Eight-Partitioning Points in 3D, and Efficiently Too | Boris Aronov et.al. | 2403.02627 | null |
2024-03-05 | Polarization-Encoded Lenticular Nano-Printing with Single-Layer Metasurfaces | Lin Deng et.al. | 2403.02620 | null |
2024-03-14 | Pooling Image Datasets With Multiple Covariate Shift and Imbalance | Sotirios Panagiotis Chytas et.al. | 2403.02598 | null |
2024-03-05 | Enhancing Weakly Supervised 3D Medical Image Segmentation through Probabilistic-aware Learning | Zhaoxin Fan et.al. | 2403.02566 | link |
2024-03-05 | Semantic Human Mesh Reconstruction with Textures | Xiaoyu Zhan et.al. | 2403.02561 | link |
2024-03-12 | A dataset of over one thousand computed tomography scans of battery cells | Amariah Condon et.al. | 2403.02527 | null |
2024-03-04 | Exploring Standing and Reflected Slow-mode Waves in Flaring Coronal Loops: A Parametric Study Using 2.5D MHD Modeling | Tongjiang Wang et.al. | 2403.02464 | null |
2024-03-04 | Kazhdan-Lusztig Correspondence for Vertex Operator Superalgebras from Abelian Gauge Theories | Thomas Creutzig et.al. | 2403.02403 | null |
2024-03-04 | Exploring the Jet Formation in binary systems applying 3D MHD simulations | Somayeh Sheikhnezami et.al. | 2403.02390 | null |
2024-03-05 | Fractional Spins, Unfolding, and Holography: II. 4D Higher Spin Gravity and 3D Conformal Dual | Felipe Diaz et.al. | 2403.02301 | null |
2024-03-05 | Fractional Spins, Unfolding, and Holography: I. Parent field equations for dual higher-spin gravity reductions | Felipe Diaz et.al. | 2403.02283 | null |
2024-03-04 | Tightly-Coupled LiDAR-Visual-Inertial SLAM and Large-Scale Volumetric Occupancy Mapping | Simon Boche et.al. | 2403.02280 | null |
2024-03-04 | Latitude-dependent Atmospheric Waves and Long-period Modulations in Luhman 16 B from the Longest Lightcurve of an Extrasolar World | Nguyen Fuda et.al. | 2403.02260 | null |
2024-03-04 | Direct Imaging of MHD Wave Mode Conversion Near a 3D Null Point on the Sun | Pankaj Kumar et.al. | 2403.02250 | null |
2024-03-04 | 3DTopia: Large Text-to-3D Generation Model with Hybrid Diffusion Priors | Fangzhou Hong et.al. | 2403.02234 | link |
2024-03-04 | DragTex: Generative Point-Based Texture Editing on 3D Mesh | Yudi Zhang et.al. | 2403.02217 | null |
2024-03-04 | Highly unusual, doubly-strongly-correlated, altermagnetic, 3D analogue of parent compounds of high-Tc cuprates | Harald O. Jeschke et.al. | 2403.02201 | null |
2024-03-04 | Classical dynamical $r$ -matrices for the Chern-Simons formulation of generalised 3d gravity | Juan Carlos Morales Parra et.al. | 2403.02184 | null |
2024-03-04 | Predicting large scale cosmological structure evolution with GAN-based autoencoders | Marion Ullmo et.al. | 2403.02171 | null |
2024-03-04 | TripoSR: Fast 3D Object Reconstruction from a Single Image | Dmitry Tochilkin et.al. | 2403.02151 | link |
2024-03-04 | Point2Building: Reconstructing Buildings from Airborne LiDAR Point Clouds | Yujia Liu et.al. | 2403.02136 | null |
2024-03-04 | Nucleosynthesis in the Innermost Ejecta of Magnetorotational Supernova Explosions in 3-dimensions | Shuai Zha et.al. | 2403.02072 | null |
2024-03-04 | Iterative Occlusion-Aware Light Field Depth Estimation using 4D Geometrical Cues | Rui Lourenço et.al. | 2403.02043 | null |
2024-03-04 | Scalable Vision-Based 3D Object Detection and Monocular Depth Estimation for Autonomous Driving | Yuxuan Liu et.al. | 2403.02037 | link |
2024-03-04 | Physics-Informed Learning for Time-Resolved Angiographic Contrast Agent Concentration Reconstruction | Noah Maul et.al. | 2403.01993 | null |
2024-03-04 | Leveraging Anchor-based LiDAR 3D Object Detection via Point Assisted Sample Selection | Shitao Chen et.al. | 2403.01978 | link |
2024-03-12 | Tree Counting by Bridging 3D Point Clouds with Imagery | Lei Li et.al. | 2403.01932 | null |
2024-03-04 | Grain growth and its chemical impact in the first hydrostatic core phase | D. Navarro-Almaida et.al. | 2403.01905 | null |
2024-03-04 | Map-aided annotation for pole base detection | Benjamin Missaoui et.al. | 2403.01868 | null |
2024-03-04 | AiSDF: Structure-aware Neural Signed Distance Fields in Indoor Scenes | Jaehoon Jang et.al. | 2403.01861 | null |
2024-03-04 | A Simple Baseline for Efficient Hand Mesh Reconstruction | Zhishan Zhou et.al. | 2403.01813 | null |
2024-03-04 | ViewDiff: 3D-Consistent Image Generation with Text-to-Image Models | Lukas Höllein et.al. | 2403.01807 | link |
2024-03-04 | SAQIEL: Ultra-Light and Safe Manipulator with Passive 3D Wire Alignment Mechanism | Temma Suzuki et.al. | 2403.01803 | null |
2024-03-04 | Integrating Efficient Optimal Transport and Functional Maps For Unsupervised Shape Correspondence Learning | Tung Le et.al. | 2403.01781 | null |
2024-03-04 | DEMOS: Dynamic Environment Motion Synthesis in 3D Scenes via Local Spherical-BEV Perception | Jingyu Gong et.al. | 2403.01740 | null |
2024-03-04 | 3D Hand Reconstruction via Aggregating Intra and Inter Graphs Guided by Prior Knowledge for Hand-Object Interaction Scenario | Feng Shuang et.al. | 2403.01733 | null |
2024-03-04 | HanDiffuser: Text-to-Image Generation With Realistic Hand Appearances | Supreeth Narasimhaswamy et.al. | 2403.01693 | null |
2024-03-04 | DD-VNB: A Depth-based Dual-Loop Framework for Real-time Visually Navigated Bronchoscopy | Qingyao Tian et.al. | 2403.01683 | null |
2024-03-13 | OccFusion: A Straightforward and Effective Multi-Sensor Fusion Framework for 3D Occupancy Prediction | Zhenxing Ming et.al. | 2403.01644 | link |
2024-03-03 | Spectrum AUC Difference (SAUCD): Human-aligned 3D Shape Evaluation | Tianyu Luan et.al. | 2403.01619 | null |
2024-03-03 | Respiratory motion forecasting with online learning of recurrent neural networks for safety enhancement in externally guided radiotherapy | Michel Pohl et.al. | 2403.01607 | null |
2024-03-05 | Leveraging Biomolecule and Natural Language through Multi-Modal Learning: A Survey | Qizhi Pei et.al. | 2403.01528 | link |
2024-03-03 | MatchU: Matching Unseen Objects for 6D Pose Estimation from RGB-D Images | Junwen Huang et.al. | 2403.01517 | null |
2024-03-03 | Spectral Operator Representations | Austin Zadoks et.al. | 2403.01514 | link |
2024-03-03 | An RBF partition of unity method for geometry reconstruction and PDE solution in thin structures | Elisabeth Larsson et.al. | 2403.01486 | null |
2024-03-03 | Integrable geodesic flow in 3D and webs of maximal rank | Sergey I. Agafonov et.al. | 2403.01459 | null |
2024-03-05 | 3DGStream: On-the-Fly Training of 3D Gaussians for Efficient Streaming of Photo-Realistic Free-Viewpoint Videos | Jiakai Sun et.al. | 2403.01444 | link |
2024-03-03 | On Diffusion Process in SE(3)-invariant Space | Zihan Zhou et.al. | 2403.01430 | null |
2024-03-03 | Unsigned Orthogonal Distance Fields: An Accurate Neural Implicit Representation for Diverse 3D Shapes | Yujie Lu et.al. | 2403.01414 | link |
2024-03-03 | A Novel Dynamic Light-Section 3D Reconstruction Method for Wide-Range Sensing | Mengjuan Chen et.al. | 2403.01374 | null |
2024-03-02 | TUMTraf V2X Cooperative Perception Dataset | Walter Zimmer et.al. | 2403.01316 | link |
2024-03-02 | SceneCraft: An LLM Agent for Synthesizing 3D Scene as Blender Code | Ziniu Hu et.al. | 2403.01248 | null |
2024-03-02 | Dual Graph Attention based Disentanglement Multiple Instance Learning for Brain Age Estimation | Fanzhe Yan et.al. | 2403.01246 | null |
2024-03-02 | A Cost-Effective Cooperative Exploration and Inspection Strategy for Heterogeneous Aerial System | Xinhang Xu et.al. | 2403.01225 | link |
2024-03-07 | Mirror real Chern insulator in two and three dimensions | Yang Wang et.al. | 2403.01145 | null |
2024-03-02 | The evolution of the phase space structure along pitchfork and period-doubling bifurcations in a 3D galactic bar potential | Henok Tenaw Moges et.al. | 2403.01140 | null |
2024-03-02 | Neural radiance fields-based holography [Invited] | Minsung Kang et.al. | 2403.01137 | null |
2024-03-02 | Dynamic 3D Point Cloud Sequences as 2D Videos | Yiming Zeng et.al. | 2403.01129 | link |
2024-03-02 | Quasi-calibration method for structured light system with auxiliary camera | Seung-Jae Son et.al. | 2403.01119 | null |
2024-03-02 | Bulk-local dS $_3$ holography: the Matter with $T\bar T+Λ_2$ | Gauri Batra et.al. | 2403.01040 | null |
2024-03-02 | RISMiCal: A software package to perform fast RISM/3D-RISM calculations | Yutaka Maruyama et.al. | 2403.01039 | null |
2024-03-01 | On the origin of topotactic reduction effect for superconductivity in infinite-layer nickelates | Shengwei Zeng et.al. | 2403.00960 | null |
2024-03-08 | G3DR: Generative 3D Reconstruction in ImageNet | Pradyumna Reddy et.al. | 2403.00939 | link |
2024-03-01 | Iterative Methods for Navier–Stokes Inverse Problems | Liam O’Connor et.al. | 2403.00937 | link |
2024-03-06 | Atacama Large Aperture Submillimeter Telescope (AtLAST) Science: Solar and stellar observations | Sven Wedemeyer et.al. | 2403.00920 | null |
2024-03-01 | Entropy Driven Inductive Response of Topological Insulators | A. Mert Bozkurt et.al. | 2403.00714 | null |
2024-03-01 | Controlled creation of point defects in 3D colloidal crystals | Max P. M. Schelling et.al. | 2403.00678 | null |
2024-03-04 | Considerations on time resolution of neutron irradiated single pixel 3D structures at fuences up to $10^{17}$ n$_{eq}$/cm$^{2}$ using 120 GeV SPS pion beams | Evangelos-Leonidas Gkougkousis et.al. | 2403.00627 | null |
2024-03-01 | Rethinking Few-shot 3D Point Cloud Semantic Segmentation | Zhaochong An et.al. | 2403.00592 | link |
2024-03-01 | Lincoln’s Annotated Spatio-Temporal Strawberry Dataset (LAST-Straw) | Katherine Margaret Frances James et.al. | 2403.00566 | null |
2024-03-01 | Rational Linkages: From Poses to 3D-printed Prototypes | Daniel Huczala et.al. | 2403.00558 | null |
2024-03-01 | Global solutions of the 3D incompressible inhomogeneous viscoelastic system | Chengfei Ai et.al. | 2403.00555 | null |
2024-03-01 | Molecular unfolding formulation with enhanced quantum annealing approach | Arit Kumar Bishwas et.al. | 2403.00507 | null |
2024-03-01 | Computer-Controlled 3D Freeform Surface Weaving | Xiangjia Chen et.al. | 2403.00473 | null |
2024-03-01 | Phase retrieval beyond the homogeneous object assumption for X-ray in-line holographic imaging | Jens Lucht et.al. | 2403.00461 | null |
2024-03-01 | Configurations in the Euclidean space related to the 3D genome reconstruction problem from partially phased data | Annachiara Korchmaros et.al. | 2403.00407 | null |
2024-03-01 | HyperSDFusion: Bridging Hierarchical Structures in Language and Geometry for Enhanced 3D Text2Shape Generation | Zhiying Leng et.al. | 2403.00372 | null |
2024-03-04 | Quasi-one-dimensional spin transport in antiferromagnetic $Z^3$ nodal net metals | Tingli He et.al. | 2403.00371 | null |
2024-03-01 | Small, Versatile and Mighty: A Range-View Perception Framework | Qiang Meng et.al. | 2403.00325 | null |
2024-03-01 | Niobium coaxial cavities with internal quality factors exceeding 1.5 billion for circuit quantum electrodynamics | Andrew E. Oriani et.al. | 2403.00286 | null |
2024-03-01 | Assessing Bilateral Neurovascular Bundles Function with Pulsed Wave Doppler Ultrasound: Implications for Reducing Erectile Dysfunction Following Prostate Radiotherapy | Jing Wang et.al. | 2403.00271 | null |
2024-03-01 | Diffraction and Scattering Aware Radio Map and Environment Reconstruction using Geometry Model-Assisted Deep Learning | Wangqian Chen et.al. | 2403.00229 | null |
2024-03-01 | DISORF: A Distributed Online NeRF Training and Rendering Framework for Mobile Robots | Chunlin Li et.al. | 2403.00228 | link |
2024-03-01 | MaskLRF: Self-supervised Pretraining via Masked Autoencoding of Local Reference Frames for Rotation-invariant 3D Point Set Analysis | Takahiko Furuya et.al. | 2403.00206 | link |
2024-02-29 | Learning to walk in confined spaces using 3D representation | Takahiro Miki et.al. | 2403.00187 | link |
2024-02-29 | FusionVision: A comprehensive approach of 3D object reconstruction and segmentation from RGB-D cameras using YOLO and fast segment anything | Safouane El Ghazouali et.al. | 2403.00175 | link |
2024-02-29 | Stellar Surface Magnetic Fields Impact Limb Darkening | Nadiia M. Kostogryz et.al. | 2403.00118 | null |
2024-02-29 | A spectroscopic investigation of thermal instability for cylindrical equilibria with background flow | Joris Hermans et.al. | 2403.00082 | null |
2024-03-05 | The Multi-layer Nature of Molecular Gas toward the Cygnus Region | Shiyu Zhang et.al. | 2403.00061 | null |
2024-02-29 | Learning a Generalized Physical Face Model From Data | Lingchen Yang et.al. | 2402.19477 | null |
2024-02-29 | SeMoLi: What Moves Together Belongs Together | Jenny Seidenschwarz et.al. | 2402.19463 | null |
2024-02-29 | 3D Gaussian Model for Animation and Texturing | Xiangzhi Eric Wang et.al. | 2402.19441 | null |
2024-03-01 | Digital Twin Aided Massive MIMO: CSI Compression and Feedback | Shuaifeng Jiang et.al. | 2402.19434 | null |
2024-02-29 | 3D Super-resolution Optical Fluctuation Imaging with Temporal Focusing two-photon excitation | Pawel Szczypkowski et.al. | 2402.19338 | null |
2024-02-29 | Basolateral mechanics prevents rigidity transition in epithelial monolayers | Jan Rozman et.al. | 2402.19312 | null |
2024-02-29 | DiffAssemble: A Unified Graph-Diffusion Model for 2D and 3D Reassembly | Gianluca Scarpellini et.al. | 2402.19302 | link |
2024-02-29 | The impact of the explicit representation of convection on the climate of a tidally locked planet in global stretched-mesh simulations | Denis E. Sergeev et.al. | 2402.19277 | link |
2024-02-29 | T3DNet: Compressing Point Cloud Models for Lightweight 3D Recognition | Zhiyuan Yang et.al. | 2402.19264 | null |
2024-02-21 | Context-based Interpretable Spatio-Temporal Graph Convolutional Network for Human Motion Forecasting | Edgar Medina et.al. | 2402.19237 | link |
2024-02-29 | Weakly Supervised Monocular 3D Detection with a Single-View Image | Xueying Jiang et.al. | 2402.19144 | null |
2024-02-29 | Highly efficient Gauss’s law-preserving spectral algorithms for Maxwell’s double-curl source and eigenvalue problems based on eigen-decomposition | Sen Lin et.al. | 2402.19125 | null |
2024-02-29 | Symmetries and exact solutions of the diffusive Holling-Tanner prey-predator model | Roman Cherniha et.al. | 2402.19098 | null |
2024-03-01 | Graph Convolutional Neural Networks for Automated Echocardiography View Recognition: A Holistic Approach | Sarina Thomas et.al. | 2402.19062 | null |
2024-03-05 | VEnvision3D: A Synthetic Perception Dataset for 3D Multi-Task Model Research | Jiahao Zhou et.al. | 2402.19059 | link |
2024-02-29 | WDM: 3D Wavelet Diffusion Models for High-Resolution Medical Image Synthesis | Paul Friedrich et.al. | 2402.19043 | link |
2024-02-29 | Temporal segmentation of motion propagation in response to an external impulse | Sina Feldmann et.al. | 2402.19024 | null |
2024-02-29 | DOZE: A Dataset for Open-Vocabulary Zero-Shot Object Navigation in Dynamic Environments | Ji Ma et.al. | 2402.19007 | null |
2024-02-29 | Extracting quantum-critical properties from directly evaluated enhanced perturbative continuous unitary transformations | L. Schamriß et.al. | 2402.18989 | null |
2024-02-29 | High-fidelity simulations of microramp-controlled shock wave/boundary layer interaction | Giacomo Della Posta et.al. | 2402.18971 | link |
2024-02-29 | Three-dimensional atomic interface between metal and oxide in Zr-ZrO2 nanoparticles | Yao Zhang et.al. | 2402.18943 | null |
2024-03-08 | Spectral Meets Spatial: Harmonising 3D Shape Matching and Interpolation | Dongliang Cao et.al. | 2402.18920 | null |
2024-02-29 | Deep Learning for 3D Human Pose Estimation and Mesh Recovery: A Survey | Yang Liu et.al. | 2402.18844 | link |
2024-02-29 | Protein Multimer Structure Prediction via Prompt Learning | Ziqi Gao et.al. | 2402.18813 | link |
2024-02-29 | A Quantitative Evaluation of Score Distillation Sampling Based Text-to-3D | Xiaohan Fei et.al. | 2402.18780 | null |
2024-03-07 | Articulated Object Manipulation with Coarse-to-fine Affordance for Mitigating the Effect of Point Cloud Noise | Suhan Ling et.al. | 2402.18699 | null |
2024-02-28 | The VOROS: Lifting ROC curves to 3D | Christopher Ratigan et.al. | 2402.18689 | link |
2024-03-06 | Autoresonant Ash Removal in Mirror Machines | Eli Gudinetsky et.al. | 2402.18687 | null |
2024-02-28 | Topologically protected emergent Fermi surface in an Abrikosov vortex lattice | Songyang Pu et.al. | 2402.18627 | null |
2024-02-28 | Unsupervised Airway Tree Clustering with Deep Learning: The Multi-Ethnic Study of Atherosclerosis (MESA) Lung Study | Sneha N. Naik et.al. | 2402.18615 | null |
2024-02-27 | Image-To-Mesh Conversion for Biomedical Simulations | Fotis Drakopoulos et.al. | 2402.18596 | null |
2024-02-28 | UniMODE: Unified Monocular 3D Object Detection | Zhuoling Li et.al. | 2402.18573 | null |
2024-02-28 | All electrical cooling of an optically levitated nanoparticle | Oscar Kremer et.al. | 2402.18532 | null |
2024-02-28 | Phase transitions beyond criticality: extending Ising universal scaling functions to describe entire phases | David Hathcock et.al. | 2402.18531 | null |
2024-02-28 | Bayesian model reconstruction based on spectral line observations | Frederik De Ceuster et.al. | 2402.18525 | link |
2024-02-28 | Sunshine to Rainstorm: Cross-Weather Knowledge Distillation for Robust 3D Object Detection | Xun Huang et.al. | 2402.18493 | null |
2024-02-28 | TAMM: TriAdapter Multi-Modal Learning for 3D Shape Understanding | Zhihao Zhang et.al. | 2402.18490 | null |
2024-02-28 | Unraveling the complexity of the Dzyaloshinskii-Moriya interaction in layered magnets: Towards its full magnitude and chirality control | Khalil Zakeri et.al. | 2402.18466 | null |
2024-02-28 | Dissecting a miniature universe: A multi-wavelength view of galaxy quenching in the Shapley supercluster | N. Aghanim et.al. | 2402.18455 | null |
2024-03-05 | CafkNet: GNN-Empowered Forward Kinematic Modeling for Cable-Driven Parallel Robots | Zeqing Zhang et.al. | 2402.18420 | null |
2024-03-02 | LatentSwap: An Efficient Latent Code Mapping Framework for Face Swapping | Changho Choi et.al. | 2402.18351 | link |
2024-02-28 | Attention-Propagation Network for Egocentric Heatmap to 3D Pose Lifting | Taeho Kang et.al. | 2402.18330 | link |
2024-02-28 | A Multimodal Handover Failure Detection Dataset and Baselines | Santosh Thoduka et.al. | 2402.18319 | link |
2024-02-28 | Windowed-FourierMixer: Enhancing Clutter-Free Room Modeling with Fourier Transform | Bruno Henriques et.al. | 2402.18287 | null |
2024-02-28 | Image2Flow: A hybrid image and graph convolutional neural network for rapid patient-specific pulmonary artery segmentation and CFD flow field calculation from 3D cardiac MRI data | Tina Yao et.al. | 2402.18236 | null |
2024-02-28 | NToP: NeRF-Powered Large-scale Dataset Generation for 2D and 3D Human Pose Estimation in Top-View Fisheye Images | Jingrui Yu et.al. | 2402.18196 | link |
2024-02-28 | Plasma-induced magnetic phase in 3D $\mathrm{Mn^{II}-Nb^{IV}}$ octacyanidometalate with magnetic sponge behavior | Dominik Czernia et.al. | 2402.18195 | null |
2024-02-28 | Generation of skill-specific maps from graph world models for robotic systems | Koen de Vos et.al. | 2402.18174 | null |
2024-03-01 | 3DSFLabelling: Boosting 3D Scene Flow Estimation by Pseudo Auto-labelling | Chaokang Jiang et.al. | 2402.18146 | link |
2024-02-28 | OccTransformer: Improving BEVFormer for 3D camera-only occupancy prediction | Jian Liu et.al. | 2402.18140 | null |
2024-02-28 | Passive Snapshot Coded Aperture Dual-Pixel RGB-D Imaging | Bhargav Ghanekar et.al. | 2402.18102 | null |
2024-02-28 | Context-aware Talking Face Video Generation | Meidai Xuanyuan et.al. | 2402.18092 | null |
2024-02-28 | Representing 3D sparse map points and lines for camera relocalization | Bach-Thuan Bui et.al. | 2402.18011 | link |
2024-02-27 | SWTrack: Multiple Hypothesis Sliding Window 3D Multi-Object Tracking | Sandro Papais et.al. | 2402.17892 | null |
2024-02-27 | Interacting galaxies in the IllustrisTNG simulations – VI: Reconstructed orbits, close encounters and mergers | David R. Patton et.al. | 2402.17889 | null |
2024-02-27 | 3D Printing in Microfluidics: Experimental Optimization of Droplet Size and Generation Time through Flow Focusing, Phase, and Geometry Variation | Adam Britel et.al. | 2402.17876 | null |
2024-03-06 | ShapeLLM: Universal 3D Object Understanding for Embodied Interaction | Zekun Qi et.al. | 2402.17766 | link |
2024-02-27 | ADL4D: Towards A Contextually Rich Dataset for 4D Activities of Daily Living | Marsil Zakour et.al. | 2402.17758 | null |
2024-02-27 | Analyzing Regional Organization of the Human Hippocampus in 3D-PLI Using Contrastive Learning and Geometric Unfolding | Alexander Oberstrass et.al. | 2402.17744 | null |
2024-03-08 | Approaching Periodic Systems in Ensemble Density Functional Theory via Finite One-Dimensional Models | Remi J. Leano et.al. | 2402.17742 | null |
2024-02-27 | MedContext: Learning Contextual Cues for Efficient Volumetric Medical Segmentation | Hanan Gani et.al. | 2402.17725 | link |
2024-02-27 | Geometric Deep Learning for Computer-Aided Design: A Survey | Negar Heidari et.al. | 2402.17695 | null |
2024-02-28 | Novel spectral methods for shock capturing and the removal of tygers in computational fluid dynamics | Sai Swetha Venkata Kolluru et.al. | 2402.17688 | null |
2024-02-27 | CAD-SIGNet: CAD Language Inference from Point Clouds using Layer-wise Sketch Instance Guided Attention | Mohammad Sadil Khan et.al. | 2402.17678 | null |
2024-02-27 | Classification of electronic nematicity in three-dimensional crystals and quasicrystals | Matthias Hecker et.al. | 2402.17657 | null |
2024-02-27 | Origin of magnetic switching cascades in tetrahedral CoFe nanostructures | Christian Schröder et.al. | 2402.17594 | null |
2024-02-25 | Instance-aware Exploration-Verification-Exploitation for Instance ImageGoal Navigation | Xiaohan Lei et.al. | 2402.17587 | link |
2024-02-27 | An Empirical Study of the Generalization Ability of Lidar 3D Object Detectors to Unseen Domains | George Eskandar et.al. | 2402.17562 | null |
2024-02-27 | Evaluation of block encoding for sparse matrix inversion using QSVT | Leigh Lapworth et.al. | 2402.17529 | null |
2024-02-27 | AVS-Net: Point Sampling with Adaptive Voxel Size for 3D Point Cloud Analysis | Hongcheng Yang et.al. | 2402.17521 | link |
2024-02-27 | Collisional excitation of propyne (CH $_3$ CCH) by He atoms | M. Ben Khalifa et.al. | 2402.17491 | null |
2024-02-27 | EMO: Emote Portrait Alive – Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions | Linrui Tian et.al. | 2402.17485 | null |
2024-02-27 | Generative 3D Part Assembly via Part-Whole-Hierarchy Message Passing | Bi’an Du et.al. | 2402.17464 | link |
2024-02-27 | Room Temperature Spin Filtering and Quantum Transport with Transition Metal-Doped Silicon Quantum Dot | Hemant Arora et.al. | 2402.17461 | null |
2024-02-27 | Aeolian erosion in protoplanetary discs: How impactful it is on dust evolution? | Stéphane Michoulier et.al. | 2402.17439 | null |
2024-02-27 | VastGaussian: Vast 3D Gaussians for Large Scene Reconstruction | Jiaqi Lin et.al. | 2402.17427 | null |
2024-02-27 | Sora Generates Videos with Stunning Geometrical Consistency | Xuanyi Li et.al. | 2402.17403 | null |
2024-02-27 | Coupled Laplacian Eigenmaps for Locally-Aware 3D Rigid Point Cloud Matching | Matteo Bastico et.al. | 2402.17372 | link |
2024-02-27 | Learning Dynamic Tetrahedra for High-Quality Talking Head Synthesis | Zicheng Zhang et.al. | 2402.17364 | link |
2024-02-27 | ICP-Flow: LiDAR Scene Flow Estimation with ICP | Yancong Lin et.al. | 2402.17351 | link |
2024-02-27 | Existence and invariant measure of pullback attractors for 3D Navier-Stokes-Voigt equations with delay | Yuming Qin et.al. | 2402.17347 | null |
2024-02-27 | A Vanilla Multi-Task Framework for Dense Visual Prediction Solution to 1st VCL Challenge – Multi-Task Robustness Track | Zehui Chen et.al. | 2402.17319 | null |
2024-02-27 | Denoising Diffusion Models for Inpainting of Healthy Brain Tissue | Alicia Durrer et.al. | 2402.17307 | null |
2024-02-27 | VoCo: A Simple-yet-Effective Volume Contrastive Learning Framework for 3D Medical Image Analysis | Linshan Wu et.al. | 2402.17300 | link |
2024-02-27 | DivAvatar: Diverse 3D Avatar Generation with a Single Prompt | Weijing Tao et.al. | 2402.17292 | null |
2024-02-27 | SDR-Former: A Siamese Dual-Resolution Transformer for Liver Lesion Classification Using 3D Multi-Phase Imaging | Meng Lou et.al. | 2402.17246 | null |
2024-02-28 | CharacterGen: Efficient 3D Character Generation from Single Images with Multi-View Pose Canonicalization | Hao-Yang Peng et.al. | 2402.17214 | null |
2024-02-27 | Differentiable Biomechanics Unlocks Opportunities for Markerless Motion Capture | R. James Cotton et.al. | 2402.17192 | null |
2024-02-27 | On Gaiotto’s positivity conjecture | Pavel Etingof et.al. | 2402.17174 | null |
2024-02-27 | LiveHPS: LiDAR-based Scene-level Human Pose and Shape Estimation in Free Environment | Yiming Ren et.al. | 2402.17171 | null |
2024-02-27 | Integrated Interpolation and Block-term Tensor Decomposition for Spectrum Map Construction | Hao Sun et.al. | 2402.17138 | null |
2024-02-27 | CharNeRF: 3D Character Generation from Concept Art | Eddy Chu et.al. | 2402.17115 | null |
2024-02-26 | Parallelized Spatiotemporal Binding | Gautam Singh et.al. | 2402.17077 | null |
2024-02-26 | Asphalt Concrete Characterization Using Digital Image Correlation: A Systematic Review of Best Practices, Applications, and Future Vision | Siqi Wang et.al. | 2402.17074 | null |
2024-02-26 | HOISDF: Constraining 3D Hand-Object Pose Estimation with Global Signed Distance Fields | Haozhe Qi et.al. | 2402.17062 | link |
2024-02-26 | A Multi-Fidelity Methodology for Reduced Order Models with High-Dimensional Inputs | Bilal Mufti et.al. | 2402.17061 | null |
2024-02-26 | GEM3D: GEnerative Medial Abstractions for 3D Shape Synthesis | Dmitry Petrov et.al. | 2402.16994 | null |
2024-02-26 | Orbital selective order and $\mathbb{Z}_3$ Potts nematicity from a non-Fermi liquid | YuZheng Xie et.al. | 2402.16952 | null |
2024-02-26 | Disentangled 3D Scene Generation with Layout Learning | Dave Epstein et.al. | 2402.16936 | null |
2024-02-23 | Topological Analysis of Mouse Brain Vasculature via 3D Light-sheet Microscopy Images | Jiachen Yao et.al. | 2402.16894 | link |
2024-02-26 | PhyGrasp: Generalizing Robotic Grasping with Physics-informed Large Multimodal Models | Dingkun Guo et.al. | 2402.16836 | null |
2024-02-27 | Weighted Monte Carlo augmented spherical Fourier-Bessel convolutional layers for 3D abdominal organ segmentation | Wenzhao Zhao et.al. | 2402.16825 | link |
2024-03-08 | One-loop quantization of Euclidean D3-branes in holographic backgrounds | Fridrik Freyr Gautason et.al. | 2402.16779 | null |
2024-02-26 | Neural Mesh Fusion: Unsupervised 3D Planar Surface Understanding | Farhad G. Zanjani et.al. | 2402.16739 | null |
2024-02-26 | Performance of high-order Godunov-type methods in simulations of astrophysical low Mach number flows | G. Leidi et.al. | 2402.16706 | null |
2024-02-26 | Deep Learning-based Cooperative LiDAR Sensing for Improved Vehicle Positioning | Luca Barbieri et.al. | 2402.16656 | null |
2024-02-26 | GEA: Reconstructing Expressive 3D Gaussian Avatar from Monocular Video | Xinqi Liu et.al. | 2402.16607 | null |
2024-02-27 | RoboGrind: Intuitive and Interactive Surface Treatment with Industrial Robots | Benjamin Alt et.al. | 2402.16542 | null |
2024-02-26 | Global well-posedness of the 3D Patlak-Keller-Segel system near a straight line | Bowei Tu et.al. | 2402.16536 | null |
2024-02-26 | Enhancement of 3D Camera Synthetic Training Data with Noise Models | Katarína Osvaldová et.al. | 2402.16514 | null |
2024-02-26 | CMC: Few-shot Novel View Synthesis via Cross-view Multiplane Consistency | Hanxin Zhu et.al. | 2402.16407 | null |
2024-02-27 | Development of a Generalizable Data-driven Turbulence Model: Conditioned Field Inversion and Symbolic Regression | Chenyu Wu et.al. | 2402.16355 | null |
2024-02-26 | DreamUp3D: Object-Centric Generative Models for Single-View 3D Scene Understanding and Real-to-Sim Transfer | Yizhe Wu et.al. | 2402.16308 | null |
2024-02-27 | Engineering Quantum Light Sources with Flat Optics | Jinyong Ma et.al. | 2402.16265 | null |
2024-02-26 | SeqTrack3D: Exploring Sequence Information for Robust 3D Point Cloud Tracking | Yu Lin et.al. | 2402.16249 | link |
2024-02-25 | GenNBV: Generalizable Next-Best-View Policy for Active 3D Reconstruction | Xiao Chen et.al. | 2402.16174 | link |
2024-02-25 | 3D active nematic disclinations behave as Majorana quasiparticles | Louise C. Head et.al. | 2402.16149 | null |
2024-02-25 | Cinematographic Camera Diffusion Model | Hongda Jiang et.al. | 2402.16143 | link |
2024-02-25 | A statistical method for crack detection in 3D concrete images | Vitalii Makogin et.al. | 2402.16126 | null |
2024-02-25 | AVI-Talking: Learning Audio-Visual Instructions for Expressive 3D Talking Face Generation | Yasheng Sun et.al. | 2402.16124 | null |
2024-02-25 | Building Flexible Machine Learning Models for Scientific Computing at Scale | Tianyu Chen et.al. | 2402.16014 | null |
2024-02-25 | Towards Mixed Reality as the Everyday Computing Paradigm: Challenges & Design Recommendations | Amir Reza Asadi et.al. | 2402.15974 | null |
2024-02-24 | Interpolation-based immersogeometric analysis methods for multi-material and multi-physics problems | Jennifer E. Fromm et.al. | 2402.15937 | null |
2024-02-24 | Bridging the Gap between 2D and 3D Visual Question Answering: A Fusion Approach for 3D VQA | Wentao Mo et.al. | 2402.15933 | link |
2024-02-24 | Spec-Gaussian: Anisotropic View-Dependent Appearance for 3D Gaussian Splatting | Ziyi Yang et.al. | 2402.15870 | null |
2024-02-24 | Highly efficient interaction of a tubular-lattice hollow-core fiber and flexural acoustic waves: design, characterization and analysis | Ricardo E. da Silva et.al. | 2402.15825 | null |
2024-02-24 | Parameter-efficient Prompt Learning for 3D Point Cloud Understanding | Hongyu Sun et.al. | 2402.15823 | link |
2024-02-24 | A Generative Machine Learning Model for Material Microstructure 3D Reconstruction and Performance Evaluation | Yilin Zheng et.al. | 2402.15815 | null |
2024-02-24 | Tracing the Galactic disk from the kinematics of Gaia Cepheids | Xiaoyue Zhou et.al. | 2402.15782 | null |
2024-02-24 | PhyPlan: Compositional and Adaptive Physical Task Reasoning with Physics-Informed Skill Networks for Robot Manipulators | Harshil Vagadia et.al. | 2402.15767 | link |
2024-02-24 | CLIPose: Category-Level Object Pose Estimation with Pre-trained Vision-Language Knowledge | Xiao Lin et.al. | 2402.15726 | null |
2024-02-23 | In-beam test results of an RPC-based module for position-sensitive neutron detectors with timing readout | G. Canezin et.al. | 2402.15630 | null |
2024-02-23 | DeepSet SimCLR: Self-supervised deep sets for improved pathology representation learning | David Torpey et.al. | 2402.15598 | null |
2024-02-23 | Cohere3D: Exploiting Temporal Coherence for Unsupervised Representation Learning of Vision-based Autonomous Driving | Yichen Xie et.al. | 2402.15583 | null |
2024-02-23 | CharacterMixer: Rig-Aware Interpolation of 3D Characters | Xiao Zhan et.al. | 2402.15580 | null |
2024-02-23 | Induced moduli oscillation by radiation and space expansion in a higher-dimensional model | Hajime Otsuka et.al. | 2402.15547 | null |
2024-02-28 | A Comprehensive Survey of Convolutions in Deep Learning: Applications, Challenges, and Future Trends | Abolfazl Younesi et.al. | 2402.15490 | null |
2024-02-23 | Automatic treatment planning for radiotherapy: a cross-modality and protocol study | Gregory Szalkowski et.al. | 2402.15466 | null |
2024-03-07 | Quantum robustness of the toric code in a parallel field on the honeycomb and triangular lattice | V. Kott et.al. | 2402.15389 | null |
2024-02-23 | OpenSUN3D: 1st Workshop Challenge on Open-Vocabulary 3D Scene Understanding | Francis Engelmann et.al. | 2402.15321 | null |
2024-02-23 | Stability of viscous three-dimensional stratified Couette flow via dispersion and mixing | Michele Coti Zelati et.al. | 2402.15312 | null |
2024-02-23 | When in Doubt, Think Slow: Iterative Reasoning with Latent Imagination | Martin Benfeghoul et.al. | 2402.15283 | null |
2024-02-23 | EMIFF: Enhanced Multi-scale Image Feature Fusion for Vehicle-Infrastructure Cooperative 3D Object Detection | Zhe Wang et.al. | 2402.15272 | link |
2024-02-23 | GS-EMA: Integrating Gradient Surgery Exponential Moving Average with Boundary-Aware Contrastive Learning for Enhanced Domain Generalization in Aneurysm Segmentation | Fengming Lin et.al. | 2402.15239 | link |
2024-02-23 | Stability of a dispersion of elongated particles embedded in a viscous membrane | Harishankar Manikantan et.al. | 2402.15148 | null |
2024-02-23 | Two-Stage Block Orthogonalization to Improve Performance of $s$ -step GMRES | Ichitaro Yamazaki et.al. | 2402.15033 | null |
2024-02-22 | Laser-to-Vehicle Extrinsic Calibration in Low-Observability Scenarios for Subsea Mapping | Thomas Hitchcox et.al. | 2402.14993 | null |
2024-02-22 | Reinforcement Learning with Elastic Time Steps | Dong Wang et.al. | 2402.14961 | link |
2024-02-22 | An image-based transfer learning approach for using in situ processing data to predict laser powder bed fusion additively manufactured Ti-6Al-4V mechanical properties | Qixiang Luo et.al. | 2402.14945 | null |
2024-02-21 | CloudNine: Analyzing Meteorological Observation Impact on Weather Prediction Using Explainable Graph Neural Networks | Hyeon-Ju Jeon et.al. | 2402.14861 | null |
2024-02-22 | Cameras as Rays: Pose Estimation via Ray Diffusion | Jason Y. Zhang et.al. | 2402.14817 | null |
2024-02-22 | Consolidating Attention Features for Multi-view Image Editing | Or Patashnik et.al. | 2402.14792 | null |
2024-02-22 | Internal magnetic field structures observed by PSP/WISPR in a filament related coronal mass ejection | G. M. Cappello et.al. | 2402.14682 | null |
2024-02-22 | Multi-HMR: Multi-Person Whole-Body Human Mesh Recovery in a Single Shot | Fabien Baradel et.al. | 2402.14654 | link |
2024-02-22 | GaussianPro: 3D Gaussian Splatting with Progressive Propagation | Kai Cheng et.al. | 2402.14650 | null |
2024-02-22 | Distributed Radiance Fields for Edge Video Compression and Metaverse Integration in Autonomous Driving | Eugen Šlapak et.al. | 2402.14642 | null |
2024-02-26 | A thermodynamic criterion for the formation of Circumplanetary Disks | Leonardo Krapp et.al. | 2402.14638 | null |
2024-02-22 | Thermal-Aware Floorplanner for 3D IC, including TSVs, Liquid Microchannels and Thermal Domains Optimization | David Cuesta et.al. | 2402.14627 | null |
2024-02-22 | Fast and Efficient Sequential Radar Parameter Estimation in MIMO-OTFS Systems | Kuranage Roche Rayan Ranasinghe et.al. | 2402.14612 | null |
2024-02-22 | Small electron polarons bound to interstitial tantalum defects in lithium tantalate | Anton Pfannstiel et.al. | 2402.14587 | null |
2024-02-22 | Quenching-driven equatorial depletion and limb asymmetries in hot Jupiter atmospheres: WASP-96b example | Maria Zamyatina et.al. | 2402.14535 | null |
2024-02-22 | NeRF-Det++: Incorporating Semantic Cues and Perspective-aware Depth Supervision for Indoor Multi-View 3D Detection | Chenxi Huang et.al. | 2402.14464 | link |
2024-02-22 | S^2Former-OR: Single-Stage Bimodal Transformer for Scene Graph Generation in OR | Jialun Pei et.al. | 2402.14461 | link |
2024-02-22 | TaylorGrid: Towards Fast and High-Quality Implicit Field Learning via Direct Taylor-based Grid Optimization | Renyi Mao et.al. | 2402.14415 | null |
2024-02-22 | Modeling 3D Infant Kinetics Using Adaptive Graph Convolutional Networks | Daniel Holmberg et.al. | 2402.14400 | link |
2024-02-22 | Workspace Analysis for Laparoscopic Rectal Surgery : A Preliminary Study | Alexandra Thomieres et.al. | 2402.14386 | null |
2024-02-22 | Place Anything into Any Video | Ziling Liu et.al. | 2402.14316 | null |
2024-02-22 | Structure-Based Drug Design via 3D Molecular Generative Pre-training and Sampling | Yuwei Yang et.al. | 2402.14315 | null |
2024-02-22 | MVD $^2$ : Efficient Multiview 3D Reconstruction for Multiview Diffusion | Xin-Yang Zheng et.al. | 2402.14253 | null |
2024-02-22 | Quaternion recurrent neural network with real-time recurrent learning and maximum correntropy criterion | Pauline Bourigault et.al. | 2402.14227 | null |
2024-02-22 | Swin3D++: Effective Multi-Source Pretraining for 3D Indoor Scene Understanding | Yu-Qi Yang et.al. | 2402.14215 | link |
2024-02-22 | Mip-Grid: Anti-aliased Grid Representations for Neural Radiance Fields | Seungtae Nam et.al. | 2402.14196 | null |
2024-02-21 | Real-time 3D-aware Portrait Editing from a Single Image | Qingyan Bai et.al. | 2402.14000 | link |
2024-02-21 | How to fault-tolerantly realize any quantum circuit with local operations | Shin Ho Choe et.al. | 2402.13863 | null |
2024-02-21 | Improving Efficiency of Iso-Surface Extraction on Implicit Neural Representations Using Uncertainty Propagation | Haoyu Li et.al. | 2402.13861 | null |
2024-02-21 | Design of a Miniature Underwater Vehicle and Data Collection System for Indoor Experimentation | Jacob Herbert et.al. | 2402.13837 | null |
2024-02-21 | Identifying Unnecessary 3D Gaussians using Clustering for Fast Rendering of 3D Gaussian Splatting | Joongho Jo et.al. | 2402.13827 | null |
2024-02-21 | Khronos: A Unified Approach for Spatio-Temporal Metric-Semantic SLAM in Dynamic Environments | Lukas Schmid et.al. | 2402.13817 | link |
2024-02-21 | An Empirical Study on Oculus Virtual Reality Applications: Security and Privacy Perspectives | Hanyang Guo et.al. | 2402.13815 | link |
2024-02-21 | Cas-DiffCom: Cascaded diffusion model for infant longitudinal super-resolution 3D medical image completion | Lianghu Guo et.al. | 2402.13776 | null |
2024-02-21 | Hybrid Video Diffusion Models with 2D Triplane and 3D Wavelet Representation | Kihong Kim et.al. | 2402.13729 | null |
2024-02-21 | Bring Your Own Character: A Holistic Solution for Automatic Facial Animation Generation of Customized Characters | Zechen Bai et.al. | 2402.13724 | link |
2024-02-21 | An empirical view of the extended atmosphere and inner envelope of the AGB star R Doradus I. Physical model based on CO lines | T. Khouri et.al. | 2402.13676 | null |
2024-02-21 | Point spread function engineering for spiral phase interferometric scattering microscopy enables robust 3D single-particle tracking | Nathan J. Brooks et.al. | 2402.13652 | null |
2024-02-21 | Obstacle crossing strategies for high-speed 4WD small-scale vehicle | Philippe Vaslin et.al. | 2402.13650 | null |
2024-02-26 | VOOM: Robust Visual Object Odometry and Mapping using Hierarchical Landmarks | Yutong Wang et.al. | 2402.13609 | link |
2024-02-21 | Flexible Physical Camouflage Generation Based on a Differential Approach | Yang Li et.al. | 2402.13575 | null |
2024-02-21 | Full-Atom Peptide Design with Geometric Latent Diffusion | Xiangzhe Kong et.al. | 2402.13555 | link |
2024-02-21 | EffLoc: Lightweight Vision Transformer for Efficient 6-DOF Camera Relocalization | Zhendong Xiao et.al. | 2402.13537 | null |
2024-02-21 | SealD-NeRF: Interactive Pixel-Level Editing for Dynamic Scenes by Neural Radiance Fields | Zhentao Huang et.al. | 2402.13510 | null |
2024-02-21 | Field-induced electric polarization and elastic softening caused by parity-mixed $d$-$p$ hybridized states with electric multipoles in Ba$_2$CuGe$_2$O$_7$ | R. Kurihara et.al. | 2402.13504 | null |
2024-02-21 | Thermal transport in a 2D amorphous material | Yuxi Wang et.al. | 2402.13471 | null |
2024-02-20 | Coherent evolution of superexchange interaction in seconds long optical clock spectroscopy | William R. Milner et.al. | 2402.13398 | null |
2024-02-20 | New preconditioner strategy for solving block four-by-four linear systems: An application to the saddle-point problem from 3D Stokes equation | Achraf Badahmane et.al. | 2402.13373 | null |
2024-02-22 | Aria Everyday Activities Dataset | Zhaoyang Lv et.al. | 2402.13349 | link |
2024-02-08 | Development of crystal optics for Multi-Projection X-ray Imaging for synchrotron and XFEL sources | Valerio Bellucci et.al. | 2402.13262 | null |
2024-02-20 | How NeRFs and 3D Gaussian Splatting are Reshaping SLAM: a Survey | Fabio Tosi et.al. | 2402.13255 | link |
2024-02-20 | Improving Robustness for Joint Optimization of Camera Poses and Decomposed Low-Rank Tensorial Radiance Fields | Bo-Yu Cheng et.al. | 2402.13252 | link |
2024-02-20 | FlashTex: Fast Relightable Mesh Texturing with LightControlNet | Kangle Deng et.al. | 2402.13251 | null |
2024-02-20 | Quantized shift response in multi-gap topological phases | Wojciech J. Jankowski et.al. | 2402.13245 | null |
2024-02-20 | Stark Effects of Rydberg Excitons in a Monolayer WSe2 P-N Junction | Zhen Lian et.al. | 2402.13174 | null |
2024-03-04 | 3D Kinematics Estimation from Video with a Biomechanical Model and Synthetic Training Data | Zhi-Yi Lin et.al. | 2402.13172 | null |
2024-02-20 | Towards a new model-independent calibration of Gamma-Ray Bursts | Arianna Favale et.al. | 2402.13115 | null |
2024-02-20 | HiRIS: an Airborne Sonar Sensor with a 1024 Channel Microphone Array for In-Air Acoustic Imaging | Dennis Laurijssen et.al. | 2402.13110 | null |
2024-02-20 | 3D high-resolution imaging algorithm using 1D MIMO array for autonomous driving application | Sen Yuan et.al. | 2402.13062 | null |
2024-02-20 | Data Repository of Finite Element Models of Normal and Deformed Thoracolumbar Spine | Morteza Rasouligandomani et.al. | 2402.13041 | null |
2024-02-22 | The Santa Barbara Binary-Disk Code Comparison | Paul C. Duffell et.al. | 2402.13039 | null |
2024-02-20 | N-MPC for Deep Neural Network-Based Collision Avoidance exploiting Depth Images | Martin Jacquet et.al. | 2402.13038 | null |
2024-02-20 | Advancements in Point Cloud-Based 3D Defect Detection and Classification for Industrial Systems: A Comprehensive Survey | Anju Rani et.al. | 2402.12923 | null |
2024-02-20 | Real-time High-resolution View Synthesis of Complex Scenes with Explicit 3D Visibility Reasoning | Tiansong Zhou et.al. | 2402.12886 | null |
2024-02-20 | Autonomous Reality Modelling for Cultural Heritage Sites employing cooperative quadrupedal robots and unmanned aerial vehicles | Nikolaos Giakoumidis et.al. | 2402.12794 | null |
2024-02-20 | OccFlowNet: Towards Self-supervised Occupancy Estimation via Differentiable Rendering and Occupancy Flow | Simon Boeder et.al. | 2402.12792 | null |
2024-02-20 | From Movements to Metrics: Evaluating Explainable AI Methods in Skeleton-Based Human Activity Recognition | Kimji N. Pellano et.al. | 2402.12790 | null |
2024-02-20 | Equivariant Pretrained Transformer for Unified Geometric Learning on Multi-Domain 3D Molecules | Rui Jiao et.al. | 2402.12714 | null |
2024-02-20 | MVDiffusion++: A Dense High-resolution Multi-view Diffusion Model for Single or Sparse-view 3D Object Reconstruction | Shitao Tang et.al. | 2402.12712 | null |
2024-02-19 | The Radcliffe Wave is Oscillating | Ralf Konietzka et.al. | 2402.12596 | null |
2024-02-19 | An evaluation of Deep Learning based stereo dense matching dataset shift from aerial images and a large scale stereo dataset | Teng Wu et.al. | 2402.12522 | link |
2024-02-19 | Diffeomorphism Neural Operator for various domains and parameters of partial differential equations | Zhiwei Zhao et.al. | 2402.12475 | link |
2024-02-19 | Signature of the atmospheric asymmetries of hot and ultra-hot Jupiters in lightcurves | Aurélien Falco et.al. | 2402.12355 | null |
2024-02-19 | Image Super-resolution Inspired Electron Density Prediction | Chenghan Li et.al. | 2402.12335 | link |
2024-02-19 | L-QLES: Sparse Laplacian generator for evaluating Quantum Linear Equation Solvers | Leigh Lapworth et.al. | 2402.12266 | null |
2024-02-19 | Open3DSG: Open-Vocabulary 3D Scene Graphs from Point Clouds with Queryable Objects and Open-Set Relationships | Sebastian Koch et.al. | 2402.12259 | link |
2024-02-19 | Low-mass Runaways from the Orion Nebula Cluster – Kinematic Age Constraints on Star Cluster Formation | Muhammad Fajrin et.al. | 2402.12258 | null |
2024-02-19 | Stability of the coronal magnetic field around large confined and eruptive solar flares | Manu Gupta et.al. | 2402.12254 | null |
2024-02-19 | Water Vapour Transit Ambiguities for Habitable M-Earths | Evelyn Macdonald et.al. | 2402.12253 | null |
2024-02-19 | Pushing Auto-regressive Models for 3D Shape Generation at Capacity and Scalability | Xuelin Qian et.al. | 2402.12225 | null |
2024-02-19 | Colorizing Monochromatic Radiance Fields | Yean Cheng et.al. | 2402.12184 | null |
2024-02-19 | 3D Vascular Segmentation Supervised by 2D Annotation of Maximum Intensity Projection | Zhanqiang Guo et.al. | 2402.12128 | link |
2024-02-19 | A Spatiotemporal Illumination Model for 3D Image Fusion in Optical Coherence Tomography | Stefan Ploner et.al. | 2402.12114 | null |
2024-02-19 | Towards Explainable LiDAR Point Cloud Semantic Segmentation via Gradient Based Target Localization | Abhishek Kuriyal et.al. | 2402.12098 | link |
2024-02-19 | Photonic Chiplet Interconnection via 3D-Nanoprinted Interposer | Huiyu Huang et.al. | 2402.11988 | null |
2024-02-19 | One2Avatar: Generative Implicit Head Avatar For Few-shot User Adaptation | Zhixuan Yu et.al. | 2402.11909 | null |
2024-02-19 | Real-time 3D Semantic Scene Perception for Egocentric Robots with Binocular Vision | K. Nguyen et.al. | 2402.11872 | link |
2024-02-19 | An Endoscopic Chisel: Intraoperative Imaging Carves 3D Anatomical Models | Jan Emily Mangulabnan et.al. | 2402.11840 | null |
2024-02-19 | DIO: Dataset of 3D Mesh Models of Indoor Objects for Robotics and Computer Vision Applications | Nillan Nimal et.al. | 2402.11836 | null |
2024-02-19 | Unveiling the Depths: A Multi-Modal Fusion Framework for Challenging Scenarios | Jialei Xu et.al. | 2402.11826 | null |
2024-02-29 | SDGE: Stereo Guided Depth Estimation for 360 $^\circ$ Camera Sets | Jialei Xu et.al. | 2402.11791 | null |
2024-02-18 | Bulk and boundary entanglement transitions in the projective gauge-Higgs model | Hiroki Sukeno et.al. | 2402.11738 | null |
2024-02-18 | LiRaFusion: Deep Adaptive LiDAR-Radar Fusion for 3D Object Detection | Jingyu Song et.al. | 2402.11735 | link |
2024-02-18 | Mixed material point method formulation, stabilization, and validation for a unified analysis of free-surface and seepage flow | Bodhinanda Chandra et.al. | 2402.11719 | null |
2024-02-18 | 3D Point Cloud Compression with Recurrent Neural Network and Image Compression Methods | Till Beemelmanns et.al. | 2402.11680 | link |
2024-02-18 | MultiCorrupt: A Multi-Modal Robustness Dataset and Benchmark of LiDAR-Camera Fusion for 3D Object Detection | Till Beemelmanns et.al. | 2402.11677 | link |
2024-02-18 | Modified Massive Abelian $p$-Form ($p = 1, 2, 3$ ) Gauge Theories: Existence of the Pseudo-Scalar field and Its Implications | E. Harikumar et.al. | 2402.11598 | null |
2024-02-18 | CowScape: Quantitative reconstruction of the conformational landscape of biological macromolecules from cryo-EM data | Felix Lambrecht et.al. | 2402.11589 | null |
2024-02-18 | Holographic RG flows and boundary conditions in a 3D gauged supergravity | Ksenia Arkhipova et.al. | 2402.11586 | null |
2024-02-18 | A novel Fourier neural operator framework for classification of multi-sized images: Application to 3D digital porous media | Ali Kashefi et.al. | 2402.11568 | link |
2024-02-18 | Polarization-dependent resonant phenomena in all-dielectric scatterers: inversion of magnetic inductance and electric displacement | Aleksandr Shvartsburg et.al. | 2402.11509 | null |
2024-02-25 | A Robust Error-Resistant View Selection Method for 3D Reconstruction | Shaojie Zhang et.al. | 2402.11431 | null |
2024-02-17 | Impactos da Navegação Baseada em Performance nos Tempos de Voo da Aviação Comercial | João B. T. Szenczuk et.al. | 2402.11374 | null |
2024-02-21 | Diffuse Sound Field Synthesis | Franz Zotter et.al. | 2402.11330 | null |
2024-02-17 | ICHPro: Intracerebral Hemorrhage Prognosis Classification Via Joint-attention Fusion-based 3d Cross-modal Network | Xinlei Yu et.al. | 2402.11307 | link |
2024-02-17 | Dense Matchers for Dense Tracking | Tomáš Jelínek et.al. | 2402.11287 | null |
2024-02-17 | Constructing the three-dimensional extinction density maps using V-net | Bingqiu Chen et.al. | 2402.11270 | null |
2024-02-17 | DiffPoint: Single and Multi-view Point Cloud Reconstruction with ViT Based Diffusion Model | Yu Feng et.al. | 2402.11241 | null |
2024-02-17 | Hand Biometrics in Digital Forensics | Asish Bera et.al. | 2402.11206 | null |
2024-02-17 | Semantically-aware Neural Radiance Fields for Visual Scene Understanding: A Comprehensive Review | Thang-Anh-Quan Nguyen et.al. | 2402.11141 | link |
2024-02-16 | Universal Design Methodology for Printable Microstructural Materials via a New Deep Generative Learning Model: Application to a Piezocomposite | Mohammad Saber Hashemi et.al. | 2402.11102 | null |
2024-02-16 | GIM: Learning Generalizable Image Matcher From Internet Videos | Xuelun Shen et.al. | 2402.11095 | link |
2024-02-16 | Searching the SN 1987A SETI Ellipsoid with TESS | Bárbara Cabrales et.al. | 2402.11037 | null |
2024-02-16 | Occlusion Resilient 3D Human Pose Estimation | Soumava Kumar Roy et.al. | 2402.11036 | null |
2024-02-16 | Type Ia supernova explosion models are inherently multidimensional | R. Pakmor et.al. | 2402.11010 | null |
2024-02-21 | ChemReasoner: Heuristic Search over a Large Language Model’s Knowledge Space using Quantum-Chemical Feedback | Henry W. Sprueill et.al. | 2402.10980 | link |
2024-02-12 | Roll-to-roll tomographic volumetric additive manufacturing for continuous production of microstructures on long flexible substrates | Joseph Toombs et.al. | 2402.10955 | null |
2024-02-16 | 3D Diffuser Actor: Policy Diffusion with 3D Scene Representations | Tsung-Wei Ke et.al. | 2402.10885 | null |
2024-02-16 | Multi-Model 3D Registration: Finding Multiple Moving Objects in Cluttered Point Clouds | David Jin et.al. | 2402.10865 | null |
2024-02-19 | Chirality enhancement using topology-designed 3D nanophotonic antennas | Atsushi Taguchi et.al. | 2402.10742 | null |
2024-02-19 | PointMamba: A Simple State Space Model for Point Cloud Analysis | Dingkang Liang et.al. | 2402.10739 | link |
2024-02-16 | StableLego: Stability Analysis of Block Stacking Assembly | Ruixuan Liu et.al. | 2402.10711 | link |
2024-02-16 | X-ray Linear Dichroic Tomography of Crystallographic and Topological Defects | Andreas Apseros et.al. | 2402.10647 | null |
2024-02-16 | PEGASUS: Personalized Generative 3D Avatars with Composable Attributes | Hyunsoo Cha et.al. | 2402.10636 | null |
2024-02-16 | Envisioning the Future Role of 3D Wireless Networks in Preventing and Managing Disasters and Emergency Situations | Ahmed Alhammadi et.al. | 2402.10600 | null |
2024-02-16 | Localising pulsations in the hard X-ray and microwave emission of an X-class flare | Hannah Collier et.al. | 2402.10546 | null |
2024-02-16 | GaussianHair: Hair Modeling and Rendering with Light-aware Gaussians | Haimin Luo et.al. | 2402.10483 | null |
2024-02-15 | Field Line Universal relaXer (FLUX): A Fluxon Approach to Coronal Magnetic Field Modeling | Chris Lowder et.al. | 2402.10370 | null |
2024-02-15 | Deep Spectral Meshes: Multi-Frequency Facial Mesh Processing with Graph Neural Networks | Robert Kosk et.al. | 2402.10365 | null |
2024-02-15 | A 3D phase-field based Eulerian variational framework for multiphase fluid-structure interaction with contact dynamics | Xiaoyu Mao et.al. | 2402.10348 | null |
2024-02-15 | Evaluating NeRFs for 3D Plant Geometry Reconstruction in Field Conditions | Muhammad Arbab Arshad et.al. | 2402.10344 | null |
2024-02-15 | LaserSAM: Zero-Shot Change Detection Using Visual Segmentation of Spinning LiDAR | Alexander Krawciw et.al. | 2402.10321 | null |
2024-02-27 | Stability for the 3D Riemannian Penrose inequality | Conghan Dong et.al. | 2402.10299 | null |
2024-02-15 | Extracting the current-phase-relation of a monolithic three-dimensional nano-constriction using a DC-current-tunable superconducting microwave cavity | Kevin Uhl et.al. | 2402.10276 | null |
2024-02-20 | GaussianObject: Just Taking Four Images to Get A High-Quality 3D Object with Gaussian Splatting | Chen Yang et.al. | 2402.10259 | link |
2024-02-15 | Ising on the Graph: Task-specific Graph Subsampling via the Ising Model | Maria Bånkestad et.al. | 2402.10206 | null |
2024-02-15 | Mirror Chern Bands and Weyl Nodal Loops in Altermagnets | Daniil S. Antonenko et.al. | 2402.10201 | null |
2024-02-15 | A coupled VOF/embedded boundary method to model two-phase flows on arbitrary solid surfaces | Mathilde Tavares et.al. | 2402.10185 | null |
2024-02-15 | Is Continual Learning Ready for Real-world Challenges? | Theodora Kontogianni et.al. | 2402.10130 | null |
2024-02-15 | GES: Generalized Exponential Splatting for Efficient Radiance Field Rendering | Abdullah Hamdi et.al. | 2402.10128 | link |
2024-02-15 | Uncovering the Three-Dimensional Structure of Upconverting Core-Shell Nanoparticles with Multislice Electron Ptychography | Stephanie M. Ribet et.al. | 2402.10084 | null |
2024-02-15 | Tomography of orbital vortex lines in a topological semimetal | T. Figgemeier et.al. | 2402.10031 | null |
2024-02-15 | Three-dimensional active nematic turbulence: chirality, flow alignment and elastic anisotropy | Nika Kralj et.al. | 2402.10020 | null |
2024-02-25 | MM-Point: Multi-View Information-Enhanced Multi-Modal Self-Supervised 3D Point Cloud Understanding | Hai-Tao Yu et.al. | 2402.10002 | link |
2024-02-15 | One-shot omnidirectional pressure integration through matrix inversion | Fernando Zigunov et.al. | 2402.09988 | link |
2024-02-14 | Loopy-SLAM: Dense Neural SLAM with Loop Closures | Lorenzo Liso et.al. | 2402.09944 | null |
2024-02-15 | Effective yields as tracers of feedback effects on metallicity scaling relations in the EAGLE cosmological simulations | M. C. Zerbo et.al. | 2402.09904 | null |
2024-02-15 | Lester: rotoscope animation through video object segmentation and tracking | Ruben Tous et.al. | 2402.09883 | link |
2024-02-15 | 3D Cooperative Localization in UAV Systems: CRLB Analysis and Security Solutions | Zexin Fang et.al. | 2402.09810 | null |
2024-02-15 | Reg-NF: Efficient Registration of Implicit Surfaces within Neural Fields | Stephen Hausler et.al. | 2402.09722 | null |
2024-02-16 | Asymptotic stability for $n$ -dimensional isentropic compressible MHD equations without magnetic diffusion | Quansen Jiu et.al. | 2402.09661 | null |
2024-02-14 | DeepATLAS: One-Shot Localization for Biomedical Data | Peter D. Chang et.al. | 2402.09587 | null |
2024-02-14 | Automated Plaque Detection and Agatston Score Estimation on Non-Contrast CT Scans: A Multicenter Study | Andrew M. Nguyen et.al. | 2402.09569 | null |
2024-02-14 | A 3D Memristor Architecture for In-Memory Computing Demonstrated with SHA3 | Muayad J. Aljafar et.al. | 2402.09545 | null |
2024-02-14 | Superconducting Quantum Memory with a Suspended Coaxial Resonator | Lev Krayzman et.al. | 2402.09504 | null |
2024-02-14 | Magic-Me: Identity-Specific Video Customized Diffusion | Ze Ma et.al. | 2402.09368 | link |
2024-02-14 | Pruning Sparse Tensor Neural Networks Enables Deep Learning for 3D Ultrasound Localization Microscopy | Brice Rauby et.al. | 2402.09359 | link |
2024-02-14 | Investigation of Ga interstitial and vacancy diffusion in $β$-Ga$_2$O$_3$ via split defects: a direct approach via master diffusion equations | Channyung Lee et.al. | 2402.09354 | null |
2024-02-14 | Registration of Longitudinal Spine CTs for Monitoring Lesion Growth | Malika Sanhinova et.al. | 2402.09341 | null |
2024-02-14 | 3D-based RNA function prediction tools in rnaglib | Carlos Oliver et.al. | 2402.09330 | link |
2024-02-14 | PC-NeRF: Parent-Child Neural Radiance Fields Using Sparse LiDAR Frames in Autonomous Driving Environments | Xiuzhong Hu et.al. | 2402.09325 | link |
2024-02-14 | TDViT: Temporal Dilated Video Transformer for Dense Video Tasks | Guanxiong Sun et.al. | 2402.09257 | link |
2024-02-14 | Overview of the L3DAS23 Challenge on Audio-Visual Extended Reality | Christian Marinoni et.al. | 2402.09245 | null |
2024-02-14 | A general mechanism for enhancer-insulator pairing reveals heterogeneous dynamics in long-distant 3D gene regulation | Lucas Hedström et.al. | 2402.09209 | null |
2024-02-14 | Non-Volatile Analog Control and Reconfiguration of a Vortex Nano-Oscillator Frequency | Maksim Stebliy et.al. | 2402.09114 | null |
2024-02-14 | L3GO: Language Agents with Chain-of-3D-Thoughts for Generating Unconventional Objects | Yutaro Yamada et.al. | 2402.09052 | null |
2024-02-17 | Multi-modality transrectal ultrasound video classification for identification of clinically significant prostate cancer | Hong Wu et.al. | 2402.08987 | link |
2024-02-14 | HyCubE: Efficient Knowledge Hypergraph 3D Circular Convolutional Embedding | Zhao Li et.al. | 2402.08961 | null |
2024-02-26 | Depth-aware Volume Attention for Texture-less Stereo Matching | Tong Zhao et.al. | 2402.08931 | link |
2024-02-16 | Sharp decay estimates and asymptotic stability for incompressible MHD equations without viscosity or magnetic diffusion | Yaowei Xie et.al. | 2402.08913 | null |
2024-02-14 | Weakly Supervised Segmentation of Vertebral Bodies with Iterative Slice-propagation | Shiqi Peng et.al. | 2402.08892 | null |
2024-02-14 | DUDF: Differentiable Unsigned Distance Fields with Hyperbolic Scaling | Miguel Fainstein et.al. | 2402.08876 | link |
2024-02-13 | The Euler non-mixing made easy | Boris Khesin et.al. | 2402.08836 | null |
2024-02-13 | Hydrodynamic shielding in radiative multicloud outflows within multiphase galactic winds | Andrés S. Villares et.al. | 2402.08745 | null |
2024-02-12 | Weakly Supervised Detection of Pheochromocytomas and Paragangliomas in CT | David C. Oluigboa et.al. | 2402.08697 | null |
2024-02-13 | IM-3D: Iterative Multiview Diffusion and Reconstruction for High-Quality 3D Generation | Luke Melas-Kyriazi et.al. | 2402.08682 | null |
2024-02-13 | Learning Continuous 3D Words for Text-to-Image Generation | Ta-Ying Cheng et.al. | 2402.08654 | link |
2024-02-27 | Finite density QCD equation of state: critical point and lattice-based $T’$ -expansion | Micheal Kahangirwe et.al. | 2402.08636 | null |
2024-02-13 | NeRF Analogies: Example-Based Visual Attribute Transfer for NeRFs | Michael Fischer et.al. | 2402.08622 | null |
2024-02-13 | Gaussian-Sum Filter for Range-based 3D Relative Pose Estimation in the Presence of Ambiguities | Syed S. Ahmed et.al. | 2402.08566 | null |
2024-02-13 | Deep learning enhanced cost-aware multi-fidelity uncertainty quantification of a computational model for radiotherapy | Piermario Vitullo et.al. | 2402.08494 | null |
2024-02-13 | On the notion of a quaternionic holomorphic function | Michael Parfenov et.al. | 2402.08487 | null |
2024-02-13 | Moonwalk: Advancing Gait-Based User Recognition on Wearable Devices with Metric Learning | Asaf Liberman et.al. | 2402.08451 | null |
2024-02-13 | Precise and Fast LIDAR via Electrical Asynchronous Sampling Based on a Single Femtosecond Laser | Lizong Dong et.al. | 2402.08440 | null |
2024-02-20 | Camera Calibration through Geometric Constraints from Rotation and Projection Matrices | Muhammad Waleed et.al. | 2402.08437 | link |
2024-02-13 | Asymptotic Weak Gravity Conjecture in M-theory on K3 $\times$ K3 | M. Charkaoui et.al. | 2402.08389 | null |
2024-02-13 | Learning to Produce Semi-dense Correspondences for Visual Localization | Khang Truong Giang et.al. | 2402.08359 | link |
2024-02-13 | CrossGaze: A Strong Method for 3D Gaze Estimation in the Wild | Andy Cătrună et.al. | 2402.08316 | null |
2024-02-13 | One-to-many Reconstruction of 3D Geometry of cultural Artifacts using a synthetically trained Generative Model | Thomas Pöllabauer et.al. | 2402.08310 | null |
2024-02-13 | Ant Colony Optimization for Cooperative Inspection Path Planning Using Multiple Unmanned Aerial Vehicles | Duy Nam Bui et.al. | 2402.08246 | link |
2024-02-13 | Dispersive and Strichartz estimates for 3D wave equation with a class of many-electric potentials | Haoran Wang et.al. | 2402.08213 | null |
2024-02-13 | TurtleRabbit 2024 SSL Team Description Paper | Linh Trinh et.al. | 2402.08205 | null |
2024-02-13 | THE COLOSSEUM: A Benchmark for Evaluating Generalization for Robotic Manipulation | Wilbert Pumacay et.al. | 2402.08191 | link |
2024-02-13 | H2O-SDF: Two-phase Learning for 3D Indoor Reconstruction using Object Surface Fields | Minyoung Park et.al. | 2402.08138 | null |
2024-02-12 | Automated Classification of Body MRI Sequence Type Using Convolutional Neural Networks | Kimberly Helm et.al. | 2402.08098 | null |
2024-02-12 | Extending 3D body pose estimation for robotic-assistive therapies of autistic children | Laura Santos et.al. | 2402.08006 | null |
2024-02-11 | Correcting Projection Effects in CMEs using GCS-based Large Statistics of Multi-viewpoint Observations | Harshita Gandhi et.al. | 2402.07961 | null |
2024-02-12 | A holographic mobile-based application for practicing pronunciation of basic English vocabulary for Spanish speaking children | R. Cerezo et.al. | 2402.07897 | null |
2024-02-12 | 3D physical structure and angular expansion of the remnant of the recurrent nova T Pyx | E. Santamaría et.al. | 2402.07879 | null |
2024-02-12 | A Benchmark Grocery Dataset of Realworld Point Clouds From Single View | Shivanand Venkanna Sheshappanavar et.al. | 2402.07819 | null |
2024-02-12 | Echocardiogram-based ventricular isogeometric cardiac analysis using multi-patch fitted NURBS | Robin Willems et.al. | 2402.07728 | null |
2024-02-12 | Optimization of Sparse Convolution for 3D-Point Cloud on GPUs with CUDA | Chester Luo et.al. | 2402.07710 | null |
2024-02-12 | Signed Distance Field based Segmentation and Statistical Shape Modelling of the Left Atrial Appendage | Kristine Aavild Juhl et.al. | 2402.07708 | null |
2024-02-12 | Evaluation of a Smart Mobile Robotic System for Industrial Plant Inspection and Supervision | Georg K. J. Fischer et.al. | 2402.07691 | null |
2024-02-12 | AYDIV: Adaptable Yielding 3D Object Detection via Integrated Contextual Vision Transformer | Tanmoy Dam et.al. | 2402.07680 | link |
2024-02-12 | GBOT: Graph-Based 3D Object Tracking for Augmented Reality-Assisted Assembly Guidance | Shiyu Li et.al. | 2402.07677 | link |
2024-02-12 | A Computational Model of the Electrically or Acoustically Evoked Compound Action Potential in Cochlear Implant Users with Residual Hearing | Daniel Kipping et.al. | 2402.07673 | null |
2024-02-12 | DeformNet: Latent Space Modeling and Dynamics Prediction for Deformable Object Manipulation | Chenchang Li et.al. | 2402.07648 | null |
2024-02-12 | Collaborative Semantic Occupancy Prediction with Hybrid Feature Fusion in Connected Automated Vehicles | Rui Song et.al. | 2402.07635 | null |
2024-02-12 | UAV-assisted Visual SLAM Generating Reconstructed 3D Scene Graphs in GPS-denied Environments | Ahmed Radwan et.al. | 2402.07537 | null |
2024-02-12 | Remarks on variable Lebesgue spaces and fractional Navier-Stokes equations | Gastón Vergara-Hermosilla et.al. | 2402.07508 | null |
2024-02-12 | Make it more specific: A novel uncertainty based airway segmentation application on 3D U-Net and its variants | Shiyi Wang et.al. | 2402.07403 | null |
2024-02-12 | Unsupervised Discovery of Object-Centric Neural Fields | Rundong Luo et.al. | 2402.07376 | null |
2024-02-11 | BioNeRF: Biologically Plausible Neural Radiance Fields for View Synthesis | Leandro A. Passos et.al. | 2402.07310 | link |
2024-02-11 | LISR: Learning Linear 3D Implicit Surface Representation Using Compactly Supported Radial Basis Functions | Atharva Pandey et.al. | 2402.07301 | null |
2024-02-11 | Virtual reassembling of 3D fragments for the data-driven analysis of fracture mechanisms in composite materials | Thomas Wilhelm et.al. | 2402.07289 | null |
2024-02-11 | PIVOT-Net: Heterogeneous Point-Voxel-Tree-based Framework for Point Cloud Compression | Jiahao Pang et.al. | 2402.07243 | null |
2024-02-11 | GALA3D: Towards Text-to-3D Complex Scene Generation via Layout-guided Generative Gaussian Splatting | Xiaoyu Zhou et.al. | 2402.07207 | null |
2024-02-11 | 3D Gaussian as a New Vision Era: A Survey | Ben Fei et.al. | 2402.07181 | null |
2024-02-11 | Large-Language-Model Empowered Dose Volume Histogram Prediction for Intensity Modulated Radiotherapy | Zehao Dong et.al. | 2402.07167 | null |
2024-02-11 | Grain boundary strain localization in CdTe solar cell revealed by Scanning 3D X-ray diffraction microscopy | A. Shukla et.al. | 2402.07155 | null |
2024-02-11 | Learning by Watching: A Review of Video-based Learning Approaches for Robot Manipulation | Chrisantus Eze et.al. | 2402.07127 | null |
2024-02-11 | 3D-mapping and manipulation of photocurrent in an optoelectronic diamond device | A. A. Wood et.al. | 2402.07091 | null |
2024-02-10 | Finding safe 3D robot grasps through efficient haptic exploration with unscented Bayesian optimization and collision penalty | Joao Castanheira et.al. | 2402.07024 | null |
2024-02-10 | Space-time shape optimization of rotating electric machines | Alessio Cesarano et.al. | 2402.07017 | null |
2024-02-10 | An Optimization Framework for Processing and Transfer Learning for the Brain Tumor Segmentation | Tianyi Ren et.al. | 2402.07008 | null |
2024-02-10 | Speech motion anomaly detection via cross-modal translation of 4D motion fields from tagged MRI | Xiaofeng Liu et.al. | 2402.06984 | null |
2024-02-10 | Semantic Object-level Modeling for Robust Visual Camera Relocalization | Yifan Zhu et.al. | 2402.06951 | null |
2024-02-10 | Assessing Uncertainty Estimation Methods for 3D Image Segmentation under Distribution Shifts | Masoumeh Javanbakhat et.al. | 2402.06937 | null |
2024-02-10 | Localizing axial dense emitters based on single-helix point spread function and deep learning | Yihong Ji et.al. | 2402.06863 | null |
2024-02-09 | Neural Rendering based Urban Scene Reconstruction for Autonomous Driving | Shihao Shen et.al. | 2402.06826 | null |
2024-02-09 | Squidgets: Sketch-based Widget Design and Direct Manipulation of 3D Scene | Joonho Kim et.al. | 2402.06795 | null |
2024-02-09 | Oriented-grid Encoder for 3D Implicit Representations | Arihant Gaur et.al. | 2402.06752 | null |
2024-02-09 | Boundary controllability of incompressible Euler fluids with Boussinesq heat effects | Enrique Fernández-Cara et.al. | 2402.06709 | null |
2024-02-09 | Modeling Microstrip Antenna | Luis Alberto Rabanal Ramirez et.al. | 2402.06575 | null |
2024-02-09 | Transferring facade labels between point clouds with semantic octrees while considering change detection | Sophia Schwarz et.al. | 2402.06531 | link |
2024-02-09 | Reconstructing facade details using MLS point clouds and Bag-of-Words approach | Thomas Froech et.al. | 2402.06521 | link |
2024-02-09 | Classifying point clouds at the facade-level using geometric features and deep learning networks | Yue Tan et.al. | 2402.06506 | link |
2024-02-09 | Deep Learning-Based Auto-Segmentation of Planning Target Volume for Total Marrow and Lymph Node Irradiation | Ricardo Coimbra Brioso et.al. | 2402.06494 | null |
2024-02-09 | New Interstellar Extinction Maps Based on Gaia and Other Sky Surveys | G. A. Gontcharov et.al. | 2402.06474 | null |
2024-02-09 | Improving 2D-3D Dense Correspondences with Diffusion Models for 6D Object Pose Estimation | Peter Hönig et.al. | 2402.06436 | null |
2024-02-09 | Weak global attractor for the $3D$ -Navier-Stokes equations via the globally modified Navier-Stokes equations | Matheus Cheque Bortolan et.al. | 2402.06435 | null |
2024-02-09 | CurveFormer++: 3D Lane Detection by Curve Propagation with Temporal Curve Queries and Attention | Yifeng Bai et.al. | 2402.06423 | null |
2024-02-09 | ImplicitDeepfake: Plausible Face-Swapping through Implicit Deepfake Generation using NeRF and Gaussian Splatting | Georgii Stanishevskii et.al. | 2402.06390 | link |
2024-02-09 | A Network for structural dense displacement based on 3D deformable mesh model and optical flow | Peimian Du et.al. | 2402.06329 | null |
2024-02-09 | A plastic correction algorithm for full-field elasto-plastic finite element simulations : critical assessment of predictive capabilities and improvement by machine learning | Abhishek Palchoudhary et.al. | 2402.06313 | null |
2024-02-09 | An integrated heart-torso electromechanical model for the simulation of electrophysiogical outputs accounting for myocardial deformation | Elena Zappon et.al. | 2402.06308 | null |
2024-02-09 | MLS2LoD3: Refining low LoDs building models with MLS point clouds to reconstruct semantic LoD3 building models | Olaf Wysocki et.al. | 2402.06288 | null |
2024-02-09 | Wave optical model for tomographic volumetric additive manufacturing | Felix Wechsler et.al. | 2402.06283 | link |
2024-02-09 | Gravitational lensing stereoscopy | Ira Rai et.al. | 2402.06217 | null |
2024-02-13 | GS-CLIP: Gaussian Splatting for Contrastive Language-Image-3D Pretraining from Real-World Data | Haoyuan Li et.al. | 2402.06198 | null |
2024-02-09 | Masked LoGoNet: Fast and Accurate 3D Image Analysis for Medical Domain | Amin Karimi Monsefi et.al. | 2402.06190 | null |
2024-02-09 | HeadStudio: Text to Animatable Head Avatars with 3D Gaussian Splatting | Zhenglin Zhou et.al. | 2402.06149 | link |
2024-02-09 | Multiple Instance Learning for Cheating Detection and Localization in Online Examinations | Yemeng Liu et.al. | 2402.06107 | null |
2024-02-08 | 3D-2D Neural Nets for Phase Retrieval in Noisy Interferometric Imaging | Andrew H. Proppe et.al. | 2402.06063 | null |
2024-02-08 | A versatile robotic hand with 3D perception, force sensing for autonomous manipulation | Nikolaus Correll et.al. | 2402.06018 | link |
2024-02-08 | Balancing a 3D Inverted Pendulum using Remote Magnetic Manipulation | Jasan Zughaibi et.al. | 2402.06012 | null |
2024-02-20 | Collaborative Control for Geometry-Conditioned PBR Image Generation | Shimon Vainer et.al. | 2402.05919 | null |
2024-02-08 | CREMA: Multimodal Compositional Video Reasoning via Efficient Modular Adaptation and Fusion | Shoubin Yu et.al. | 2402.05889 | link |
2024-02-08 | Adaptive Surface Normal Constraint for Geometric Estimation from Monocular Images | Xiaoxiao Long et.al. | 2402.05869 | null |
2024-02-08 | On Experimental Emulation of Printability and Fleet Aware Generic Mesh Decomposition for Enabling Aerial 3D Printing | Marios-Nektarios Stamatopoulos et.al. | 2402.05853 | null |
2024-02-08 | AvatarMMC: 3D Head Avatar Generation and Editing with Multi-Modal Conditioning | Wamiq Reyaz Para et.al. | 2402.05803 | null |
2024-02-08 | Editable Scene Simulation for Autonomous Driving via Collaborative LLM-Agents | Yuxi Wei et.al. | 2402.05746 | link |
2024-02-08 | CTGAN: Semantic-guided Conditional Texture Generator for 3D Shapes | Yi-Ting Pan et.al. | 2402.05728 | null |
2024-02-08 | DiffSpeaker: Speech-Driven 3D Facial Animation with Diffusion Transformer | Zhiyuan Ma et.al. | 2402.05712 | link |
2024-02-08 | MERP: Metaverse Extended Realtiy Portal | Anisha Ghosh et.al. | 2402.05592 | null |
2024-02-21 | Understanding electronic excited states in BiFeO $_3$ via ab initio calculations and symmetry analysis | Aseem Rajan Kshirsagar et.al. | 2402.05542 | null |
2024-02-08 | Tightly Coupled Range Inertial Localization on a 3D Prior Map Based on Sliding Window Factor Graph Optimization | Kenji Koide et.al. | 2402.05540 | null |
2024-02-09 | NCRF: Neural Contact Radiance Fields for Free-Viewpoint Rendering of Hand-Object Interaction | Zhongqun Zhang et.al. | 2402.05532 | null |
2024-02-08 | Minecraft-ify: Minecraft Style Image Generation with Text-guided Image Editing for In-Game Application | Bumsoo Kim et.al. | 2402.05448 | null |
2024-02-08 | Memory-efficient deep end-to-end posterior network (DEEPEN) for inverse problems | Jyothi Rikhab Chand et.al. | 2402.05422 | null |
2024-02-08 | A State-of-the-art Survey on Full-duplex Network Design | Yonghwi Kim et.al. | 2402.05402 | null |
2024-02-08 | Block Mott insulating state induced by next-nearest neighbor hopping in the S = 3/2 zigzag chain BaCoTe2O7 | Ling-Fang Lin et.al. | 2402.05389 | null |
2024-02-08 | Can Channels be Fully Inferred Between Two Antenna Panels? | Y. Qiu et.al. | 2402.05387 | null |
2024-02-08 | 3D ferroelectric phase field simulations of polycrystalline multi-phase hafnia and zirconia based ultra-thin films | Prabhat Kumar et.al. | 2402.05331 | null |
2024-02-07 | Carousel phase retrieval algorithm for 3D coherent X-ray diffraction imaging | Fangzhou Ai et.al. | 2402.05283 | link |
2024-02-04 | Comparative Analysis of Kinect-Based and Oculus-Based Gaze Region Estimation Methods in a Driving Simulator | David González-Ortega et.al. | 2402.05248 | null |
2024-02-07 | SPAD : Spatially Aware Multiview Diffusers | Yash Kant et.al. | 2402.05235 | null |
2024-02-07 | FLARE: field line analysis and reconstruction for 3D plasma boundary modeling | H. Frerichs et.al. | 2402.05225 | null |
2024-02-07 | Self-calibrated convolution towards glioma segmentation | Felipe C. R. Salvagnini et.al. | 2402.05218 | null |
2024-02-07 | On the evolution of the observed Mass-to-Length relationship for star-forming filaments | Jiancheng Feng et.al. | 2402.05186 | null |
2024-02-07 | Electromagnetic signatures from accreting massive black hole binaries in time domain photometric surveys | Fabiola Cocchiararo et.al. | 2402.05175 | null |
2024-02-07 | ARCollab: Towards Multi-User Interactive Cardiovascular Surgical Planning in Mobile Augmented Reality | Pratham Mehta et.al. | 2402.05075 | null |
2024-02-07 | LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation | Jiaxiang Tang et.al. | 2402.05054 | null |
2024-02-08 | PhosNetVis: a web-based tool for kinase enrichment analysis and interactive 2D/3D network visualizations of phosphoproteomics data | Osho Rawal et.al. | 2402.05016 | null |
2024-02-07 | Deep Reinforcement Learning with Dynamic Graphs for Adaptive Informative Path Planning | Apoorva Vashisth et.al. | 2402.04894 | link |
2024-02-07 | Toward Accurate Camera-based 3D Object Detection via Cascade Depth Estimation and Calibration | Chaoqun Wang et.al. | 2402.04883 | null |
2024-02-19 | E(3)-Equivariant Mesh Neural Networks | Thuan Trang et.al. | 2402.04821 | link |
2024-02-07 | Mesh-based Gaussian Splatting for Real-time Large-scale Deformation | Lin Gao et.al. | 2402.04796 | null |
2024-02-07 | Comparing Observed with Simulated Solar Disk Center Scattering Polarization in the Sr I 4607 Å line | Franziska Zeuner et.al. | 2402.04736 | null |
2024-02-07 | InstructScene: Instruction-Driven 3D Indoor Scene Synthesis with Semantic Graph Prior | Chenguo Lin et.al. | 2402.04717 | link |
2024-02-07 | Nonuniversal Equation of State of a Quasi-2D Bose Gas in Dimensional Crossover | Xiaoran Ye et.al. | 2402.04703 | null |
2024-02-07 | V2VSSC: A 3D Semantic Scene Completion Benchmark for Perception with Vehicle to Vehicle Communication | Yuanfang Zhang et.al. | 2402.04671 | null |
2024-02-07 | SPLEND1D, a reduced one-dimensional model to investigate the physics of plasma detachment | O. Février et.al. | 2402.04656 | null |
2024-02-07 | OV-NeRF: Open-vocabulary Neural Radiance Fields with Vision and Language Foundation Models for 3D Semantic Understanding | Guibiao Liao et.al. | 2402.04648 | link |
2024-02-07 | Meet JEANIE: a Similarity Measure for 3D Skeleton Sequences via Temporal-Viewpoint Alignment | Lei Wang et.al. | 2402.04599 | null |
2024-02-07 | LiDAR-Forest Dataset: LiDAR Point Cloud Simulation Dataset for Forestry Application | Yawen Lu et.al. | 2402.04546 | null |
2024-02-07 | Nodal fermions in a strongly spin-orbit coupled frustrated pyrochlore superconductor | Dongjin Oh et.al. | 2402.04509 | null |
2024-02-07 | A Review on Digital Pixel Sensors | Md Rahatul Islam Udoy et.al. | 2402.04507 | null |
2024-02-07 | MIRT: a simultaneous reconstruction and affine motion compensation technique for four dimensional computed tomography (4DCT) | Anh-Tuan Nguyen et.al. | 2402.04480 | null |
2024-02-06 | ARMAN: A Reconfigurable Monolithic 3D Accelerator Architecture for Convolutional Neural Networks | Ali Sedaghatgoo et.al. | 2402.04431 | null |
2024-02-06 | SKOOTR: A SKating, Omni-Oriented, Tripedal Robot | Adam Joshua Hung et.al. | 2402.04374 | null |
2024-02-06 | 3D printer-controlled syringe pumps for dual, active, regulable and simultaneous dispensing of reagents. Manufacturing of immunochromatographic test strips | Gabriel Siano et.al. | 2402.04354 | null |
2024-02-06 | Characterization of a Transmon Qubit in a 3D Cavity for Quantum Machine Learning and Photon Counting | Alessandro D’Elia et.al. | 2402.04322 | null |
2024-02-06 | Breaking Data Silos: Cross-Domain Learning for Multi-Agent Perception from Independent Private Sources | Jinlong Li et.al. | 2402.04273 | link |
2024-02-06 | Instance by Instance: An Iterative Framework for Multi-instance 3D Registration | Xinyue Cao et.al. | 2402.04195 | null |
2024-02-06 | 3D Volumetric Super-Resolution in Radiology Using 3D RRDB-GAN | Juhyung Ha et.al. | 2402.04171 | null |
2024-02-06 | Human Emotions Analysis and Recognition Using EEG Signals in Response to 360 $^\circ$ Videos | Haseeb ur Rahman Abbasi et.al. | 2402.04142 | null |
2024-02-06 | Reducing two-level system dissipations in 3D superconducting Niobium resonators by atomic layer deposition and high temperature heat treatment | Yasmine Kalboussi et.al. | 2402.04137 | null |
2024-02-06 | VRMM: A Volumetric Relightable Morphable Head Model | Haotian Yang et.al. | 2402.04101 | null |
2024-02-06 | Improved Generalization of Weight Space Networks via Augmentations | Aviv Shamsian et.al. | 2402.04081 | link |
2024-02-06 | HEAM : Hashed Embedding Acceleration using Processing-In-Memory | Youngsuk Kim et.al. | 2402.04032 | null |
2024-02-06 | BioNet-XR: Biological Network Visualization Framework for Virtual Reality and Mixed Reality Environments | Busra Senderin et.al. | 2402.03946 | null |
2024-02-06 | EscherNet: A Generative Model for Scalable View Synthesis | Xin Kong et.al. | 2402.03908 | link |
2024-02-06 | Belief Scene Graphs: Expanding Partial Scenes with Objects through Computation of Expectation | Mario A. V. Saucedo et.al. | 2402.03840 | null |
2024-02-06 | Using Perspective-n-Point Algorithms for a Local Positioning System Based on LEDs and a QADA Receiver | Elena Aparicio-Esteve et.al. | 2402.03811 | null |
2024-02-09 | MoD-SLAM: Monocular Dense Mapping for Unbounded 3D Scene Reconstruction | Heng Zhou et.al. | 2402.03762 | null |
2024-02-06 | An invariance constrained deep learning network for PDE discovery | Chao Chen et.al. | 2402.03747 | null |
2024-02-06 | Rig3DGS: Creating Controllable Portraits from Casual Monocular Videos | Alfredo Rivero et.al. | 2402.03723 | null |
2024-02-06 | Attention-based Shape and Gait Representations Learning for Video-based Cloth-Changing Person Re-Identification | Vuong D. Nguyen et.al. | 2402.03716 | null |
2024-02-06 | ConUNETR: A Conditional Transformer Network for 3D Micro-CT Embryonic Cartilage Segmentation | Nishchal Sapkota et.al. | 2402.03695 | null |
2024-02-06 | 3Doodle: Compact Abstraction of Objects with 3D Strokes | Changwoon Choi et.al. | 2402.03690 | null |
2024-02-06 | BEAM: Beta Distribution Ray Denoising for Multi-view 3D Object Detection | Feng Liu et.al. | 2402.03634 | link |
2024-02-06 | The orbit of HD 142527 B is too compact to explain many of the disc features | M. Nowak et.al. | 2402.03595 | null |
2024-02-05 | Decoder-Only Image Registration | Xi Jia et.al. | 2402.03585 | link |
2024-02-07 | VLN-Video: Utilizing Driving Videos for Outdoor Vision-and-Language Navigation | Jialu Li et.al. | 2402.03561 | null |
2024-02-05 | One-shot Neural Face Reenactment via Finding Directions in GAN’s Latent Space | Stella Bounareli et.al. | 2402.03553 | null |
2024-02-05 | nnMamba: 3D Biomedical Image Segmentation, Classification and Landmark Detection with State Space Model | Haifan Gong et.al. | 2402.03526 | link |
2024-02-05 | Curriculum reinforcement learning for quantum architecture search under hardware errors | Yash J. Patel et.al. | 2402.03500 | null |
2024-02-05 | Beyond Strong labels: Weakly-supervised Learning Based on Gaussian Pseudo Labels for The Segmentation of Ellipse-like Vascular Structures in Non-contrast CTs | Qixiang Ma et.al. | 2402.03492 | null |
2024-02-05 | Denoising Diffusion via Image-Based Rendering | Titas Anciukevicius et.al. | 2402.03445 | null |
2024-02-05 | Discrete Global Symmetries: Gauging and Twisted Compactification | Simone Giacomelli et.al. | 2402.03424 | null |
2024-02-05 | An end-to-end deep learning pipeline to derive blood input with partial volume corrections for automated parametric brain PET mapping | Rugved Chavan et.al. | 2402.03414 | null |
2024-02-05 | Perceptual Video Quality Assessment: A Survey | Xiongkuo Min et.al. | 2402.03413 | null |
2024-02-05 | AONeuS: A Neural Rendering Framework for Acoustic-Optical Sensor Fusion | Mohamad Qadri et.al. | 2402.03309 | null |
2024-02-07 | 4D Gaussian Splatting: Towards Efficient Novel View Synthesis for Dynamic Scenes | Yuanxing Duan et.al. | 2402.03307 | link |
2024-02-05 | A Lennard-Jones Layer for Distribution Normalization | Mulun Na et.al. | 2402.03287 | null |
2024-02-05 | SGS-SLAM: Semantic Gaussian Splatting For Neural Dense SLAM | Mingrui Li et.al. | 2402.03246 | link |
2024-02-05 | ActiveAnno3D – An Active Learning Framework for Multi-Modal 3D Object Detection | Ahmed Ghita et.al. | 2402.03235 | null |
2024-02-05 | CT-based Anatomical Segmentation for Thoracic Surgical Planning: A Benchmark Study for 3D U-shaped Deep Learning Models | Arash Harirpoush et.al. | 2402.03230 | link |
2024-02-08 | IGUANe: a 3D generalizable CycleGAN for multicenter harmonization of brain MR images | Vincent Roca et.al. | 2402.03227 | link |
2024-02-05 | Spinning $Q$ -ball Superradiance in 3+1D | Guo-Dong Zhang et.al. | 2402.03193 | null |
2024-02-05 | GPU-Accelerated 3D Polygon Visibility Volumes for Synergistic Perception and Navigation | Andrew Willis et.al. | 2402.03135 | null |
2024-02-05 | Towards multiqudit quantum processor based on a $^{171}$Yb$^{+}$ ion string: Realizing basic quantum algorithms | Ilia V. Zalivako et.al. | 2402.03121 | null |
2024-02-08 | Taylor Videos for Action Recognition | Lei Wang et.al. | 2402.03019 | link |
2024-02-05 | Tracing d-d transitions in FePS $_{3}$ on ultrafast time scales | Jonah Elias Nitschke et.al. | 2402.03018 | null |
2024-02-05 | Retrieval-Augmented Score Distillation for Text-to-3D Generation | Junyoung Seo et.al. | 2402.02972 | link |
2024-02-05 | Controlling flow patterns and topology in active emulsions | Giuseppe Negro et.al. | 2402.02960 | null |
2024-02-05 | Digital Twin for Grey Box modeling of Multistory residential building thermal dynamics | Lina Morkunaite et.al. | 2402.02909 | null |
2024-02-05 | ViewFusion: Learning Composable Diffusion Models for Novel View Synthesis | Bernard Spiegl et.al. | 2402.02906 | link |
2024-02-05 | Improving Robustness of LiDAR-Camera Fusion Model against Weather Corruption from Fusion Strategy Perspective | Yihao Huang et.al. | 2402.02738 | null |
2024-02-05 | The SAMI Galaxy Survey: Using Tidal Streams and Shells to Trace the Dynamical Evolution of Massive Galaxies | Tomas H. Rutherford et.al. | 2402.02728 | null |
2024-02-05 | 3D NLTE Lithium abundances for late-type stars in GALAH DR3 | Ella Xi Wang et.al. | 2402.02669 | null |
2024-02-04 | A 3D joint interpretation of magnetotelluric and seismic tomographic models: the case of the volcanic island of Tenerife | A. García-Yeguas et.al. | 2402.02610 | null |
2024-02-04 | SYK Correlators from 2D Liouville-de Sitter Gravity | Herman Verlinde et.al. | 2402.02584 | null |
2024-02-04 | Point Cloud Matters: Rethinking the Impact of Different Observation Spaces on Robot Learning | Haoyi Zhu et.al. | 2402.02500 | link |
2024-02-04 | On local well-posedness of 3D ideal Hall-MHD system with an azimuthal magnetic field | Zijin Li et.al. | 2402.02451 | null |
2024-02-04 | EuLagNet: Eulerian Fluid Prediction with Lagrangian Dynamics | Qilong Ma et.al. | 2402.02425 | link |
2024-02-04 | Phase field cohesive zone modeling for fatigue crack propagation in quasi-brittle materials | A. Baktheer et.al. | 2402.02421 | null |
2024-02-04 | Multiplexed all-optical permutation operations using a reconfigurable diffractive optical network | Guangdong Ma et.al. | 2402.02397 | null |
2024-02-04 | Uncertainty-Aware Testing-Time Optimization for 3D Human Pose Estimation | Ti Wang et.al. | 2402.02339 | null |
2024-02-04 | CNS-Edit: 3D Shape Editing via Coupled Neural Shape Optimization | Jingyu Hu et.al. | 2402.02313 | null |
2024-02-03 | Operator Dimension Parity Fractionalization | Christopher W. Murphy et.al. | 2402.02195 | null |
2024-02-03 | RecNet: An Invertible Point Cloud Encoding through Range Image Embeddings for Multi-Robot Map Sharing and Reconstruction | Nikolaos Stathoulopoulos et.al. | 2402.02192 | null |
2024-02-03 | Feasibility of PET-enabled dual-energy CT imaging: First physical phantom and patient results | Yansong Zhu et.al. | 2402.02091 | null |
2024-02-03 | A Compact Gas-Kinetic Scheme with Scalable Geometric Multigrid Acceleration for Steady-State Computation on 3D Unstructured Meshes | Hongyu Liu et.al. | 2402.02075 | null |
2024-02-03 | RIDERS: Radar-Infrared Depth Estimation for Robust Sensing | Han Li et.al. | 2402.02067 | link |
2024-02-03 | Structure-Aware E(3)-Invariant Molecular Conformer Aggregation Networks | Duy M. H. Nguyen et.al. | 2402.01975 | link |
2024-02-06 | Calibrated Uncertainty Quantification for Operator Learning via Conformal Prediction | Ziqi Ma et.al. | 2402.01960 | null |
2024-02-02 | ConRF: Zero-shot Stylization of 3D Scenes with Conditioned Radiation Fields | Xingyu Miao et.al. | 2402.01950 | link |
2024-02-02 | Robust Inverse Graphics via Probabilistic Inference | Tuan Anh Le et.al. | 2402.01915 | link |
2024-02-02 | Onset of transmon ionization in microwave single-photon detection | Yuki Nojiri et.al. | 2402.01884 | null |
2024-02-02 | HyperPlanes: Hypernetwork Approach to Rapid NeRF Adaptation | Paweł Batorski et.al. | 2402.01524 | link |
2024-02-02 | Advancing Brain Tumor Inpainting with Generative Models | Ruizhi Zhu et.al. | 2402.01509 | null |
2024-02-02 | Di-NeRF: Distributed NeRF for Collaborative Learning with Unknown Relative Poses | Mahboubeh Asadi et.al. | 2402.01485 | null |
2024-02-05 | Multi-level protein pre-training with Vabs-Net | Jiale Zhao et.al. | 2402.01481 | null |
2024-02-02 | The imprint of magnetic fields on absorption spectra from circumgalactic wind-cloud systems | Benedetta Casavecchia et.al. | 2402.01475 | null |
2024-02-02 | Scaled 360 layouts: Revisiting non-central panoramas | Bruno Berenguel-Baeta et.al. | 2402.01466 | null |
2024-02-02 | 3D Vertebrae Measurements: Assessing Vertebral Dimensions in Human Spine Mesh Models Using Local Anatomical Vertebral Axes | Ivanna Kramer et.al. | 2402.01462 | null |
2024-02-06 | GaMeS: Mesh-Based Adapting and Modification of Gaussian Splatting | Joanna Waczyńska et.al. | 2402.01459 | link |
2024-02-02 | The 3D structure of disc-instability protoplanets | Adam Fenton et.al. | 2402.01432 | null |
2024-02-02 | EmoSpeaker: One-shot Fine-grained Emotion-Controlled Talking Face Generation | Guanwen Feng et.al. | 2402.01422 | null |
2024-02-02 | On the stability and exponential decay of the 3D MHD system with mixed partial dissipation near a equilibrium state | Xuemin Deng et.al. | 2402.01406 | null |
2024-02-02 | SiMA-Hand: Boosting 3D Hand-Mesh Reconstruction by Single-to-Multi-View Adaptation | Yinqiao Wang et.al. | 2402.01389 | link |
2024-02-02 | Efficient Dynamic-NeRF Based Volumetric Video Coding with Rate Distortion Optimization | Zhiyu Zhang et.al. | 2402.01380 | null |
2024-02-02 | MagicTac: A Novel High-Resolution 3D Multi-layer Grid-Based Tactile Sensor | Wen Fan et.al. | 2402.01366 | null |
2024-02-02 | Quantum Griffiths singularity in three-dimensional MoTiN superconducting films | Zi-Xiao Wang et.al. | 2402.01347 | null |
2024-02-02 | A general framework for rotation invariant point cloud analysis | Shuqing Luo et.al. | 2402.01331 | link |
2024-02-02 | Can Shape-Infused Joint Embeddings Improve Image-Conditioned 3D Diffusion? | Cristian Sbrolli et.al. | 2402.01241 | null |
2024-02-02 | N=2 supersymmetry in the twistor description of higher-spin holography | Julian Lang et.al. | 2402.01228 | null |
2024-02-02 | HW-SW Optimization of DNNs for Privacy-preserving People Counting on Low-resolution Infrared Arrays | Matteo Risso et.al. | 2402.01226 | null |
2024-02-02 | More fundamental than the fundamental metallicity relation: The effect of the stellar metallicity on the gas-phase mass-metallicity and gravitational potential-metallicity relations | Laura Sánchez-Menguiano et.al. | 2402.01222 | null |
2024-02-02 | Structured World Modeling via Semantic Vector Quantization | Yi-Fu Wu et.al. | 2402.01203 | null |
2024-02-02 | DeepBranchTracer: A Generally-Applicable Approach to Curvilinear Structure Reconstruction Using Multi-Feature Learning | Chao Liu et.al. | 2402.01187 | link |
2024-02-02 | Mechanism of ferromagnetism enhancement in a La ${2/3}$ Sr${1/3}$ MnO$_3$ membrane released from epitaxial strain | Takahito Takeda et.al. | 2402.01179 | null |
2024-02-02 | Symmetry-selective quasiparticle scattering and electric field tunability of the ZrSiS surface electronic structure | Michael S. Lodge et.al. | 2402.01177 | null |
2024-02-02 | A Comprehensive Survey on 3D Content Generation | Jian Liu et.al. | 2402.01166 | link |
2024-02-02 | DeepAAT: Deep Automated Aerial Triangulation for Fast UAV-based Mapping | Zequan Chen et.al. | 2402.01134 | link |
2024-02-02 | A Survey for Foundation Models in Autonomous Driving | Haoxiang Gao et.al. | 2402.01105 | null |
2024-02-01 | Unconditional Latent Diffusion Models Memorize Patient Imaging Data | Salman Ul Hassan Dar et.al. | 2402.01054 | link |
2024-02-01 | Maximum energy achievable in supernova remnants: self-consistent simulations | Emily Simon et.al. | 2402.01048 | null |
2024-02-01 | Exoplanet Analog Observations of Earth from Galileo Disk-integrated Photometry | Ryder H. Strauss et.al. | 2402.00984 | null |
2024-02-01 | Enhanced fringe-to-phase framework using deep learning | Won-Hoe Kim et.al. | 2402.00977 | null |
2024-02-01 | On the behaviour of eccentric sub-pc massive black hole binaries embedded in massive discs | Alessia Franchini et.al. | 2402.00938 | null |
2024-02-01 | AToM: Amortized Text-to-Mesh using 2D Diffusion | Guocheng Qian et.al. | 2402.00867 | null |
2024-02-01 | ViCA-NeRF: View-Consistency-Aware 3D Editing of Neural Radiance Fields | Jiahua Dong et.al. | 2402.00864 | link |
2024-02-02 | Geometry Transfer for Stylizing Radiance Fields | Hyunyoung Jung et.al. | 2402.00863 | null |
2024-02-01 | 360-GS: Layout-guided Panoramic Gaussian Splatting For Indoor Roaming | Jiayang Bai et.al. | 2402.00763 | null |
2024-02-02 | Optimal Projection for 3D Gaussian Splatting | Letian Huang et.al. | 2402.00752 | null |
2024-02-01 | DRSM: efficient neural 4d decomposition for dynamic reconstruction in stationary monocular cameras | Weixing Xie et.al. | 2402.00740 | null |
2024-02-01 | Automatic Segmentation of the Spinal Cord Nerve Rootlets | Jan Valosek et.al. | 2402.00724 | link |
2024-02-01 | Polycube Layouts via Iterative Dual Loops | Maxim Snoep et.al. | 2402.00652 | link |
2024-02-01 | Double-scaled SYK, Chords and de Sitter Gravity | Herman Verlinde et.al. | 2402.00635 | null |
2024-02-01 | CapHuman: Capture Your Moments in Parallel Universes | Chao Liang et.al. | 2402.00627 | link |
2024-02-01 | A double scaling for the 4d/3d reduction of $\mathcal{N}=1$ dualities | Antonio Amariti et.al. | 2402.00613 | null |
2024-02-01 | Diffusion-based Light Field Synthesis | Ruisheng Gao et.al. | 2402.00575 | null |
2024-02-01 | Quasi-perpendicular shocks of galaxy clusters in hybrid kinetic simulations: The structure of the shocks | S. S. Boula et.al. | 2402.00571 | null |
2024-02-01 | StopThePop: Sorted Gaussian Splatting for View-Consistent Real-time Rendering | Lukas Radl et.al. | 2402.00525 | link |
2024-02-01 | EE-Tuning: An Economical yet Scalable Solution for Tuning Early-Exit Large Language Models | Xuchen Pan et.al. | 2402.00518 | link |
2024-02-01 | Can you see me now? Blind spot estimation for autonomous vehicles using scenario-based simulation with random reference sensors | Marc Uecker et.al. | 2402.00467 | link |
2024-02-01 | The GREENBOT dataset: Multimodal mobile robotic dataset for a typical Mediterranean greenhouse | Fernando Cañadas-Aránega et.al. | 2402.00438 | null |
2024-02-01 | Image2Points:A 3D Point-based Context Clusters GAN for High-Quality PET Image Reconstruction | Jiaqi Cui et.al. | 2402.00376 | link |
2024-02-01 | Reimagining TaxiVis through an Immersive Space-Time Cube metaphor and reflecting on potential benefits of Immersive Analytics for urban data exploration | Jorge Wagner et.al. | 2402.00344 | link |
2024-02-02 | DARCS: Memory-Efficient Deep Compressed Sensing Reconstruction for Acceleration of 3D Whole-Heart Coronary MR Angiography | Zhihao Xue et.al. | 2402.00320 | null |
2024-02-02 | Geometry aware 3D generation from in-the-wild images in ImageNet | Qijia Shen et.al. | 2402.00225 | null |
2024-01-31 | Learning Based Dynamic Cluster Reconfiguration for UAV Mobility Management with 3D Beamforming | Irshad A. Meer et.al. | 2402.00224 | link |
2024-01-31 | Signatures of convection in the atmospheres of cool evolved stars | Andrea Chiavassa et.al. | 2402.00187 | null |
2024-01-31 | Distance and Collision Probability Estimation from Gaussian Surface Models | Kshitij Goel et.al. | 2402.00186 | null |
2024-01-31 | Weakly-Supervised Detection of Bone Lesions in CT | Tao Sheng et.al. | 2402.00175 | null |
2024-01-31 | Accretion of Galaxies around Supermassive Black Holes and a Theoretical Model of the Tully-Fisher and M-Sigma Relations | Nick Gorkavyi et.al. | 2402.00142 | null |
2024-01-31 | Improved Scene Landmark Detection for Camera Localization | Tien Do et.al. | 2401.18083 | link |
2024-01-31 | Strong Bow Shocks: Turbulence and An Exact Self-Similar Asymptotic | Marcus DuPont et.al. | 2401.18080 | null |
2024-01-31 | CARFF: Conditional Auto-encoded Radiance Field for 3D Scene Forecasting | Jiezhi Yang et.al. | 2401.18075 | null |
2024-01-31 | GODMAX: Modeling gas thermodynamics and matter distribution using JAX | Shivam Pandey et.al. | 2401.18072 | link |
2024-01-31 | Radiatively Cooled Magnetic Reconnection Experiments Driven by Pulsed Power | R Datta et.al. | 2401.17923 | null |
2024-01-31 | ReplaceAnything3D:Text-Guided 3D Scene Editing with Compositional Neural Radiance Fields | Edward Bartrum et.al. | 2401.17895 | null |
2024-02-02 | VR-based generation of photorealistic synthetic data for training hand-object tracking models | Chengyan Zhang et.al. | 2401.17874 | null |
2024-02-01 | Segment Anything in 3D Gaussians | Xu Hu et.al. | 2401.17857 | link |
2024-01-31 | On the evaluation of the suitability of the materials used to 3D print holographic acoustic lenses to correct transcranial focused ultrasound aberrations | Marcelino Ferri et.al. | 2401.17818 | null |
2024-01-31 | On melting for the 3D radial Stefan problem | Chencheng Zhang et.al. | 2401.17811 | null |
2024-01-31 | Advances in 3D Generation: A Survey | Xiaoyu Li et.al. | 2401.17807 | null |
2024-01-31 | Vision-Assisted Digital Twin Creation for mmWave Beam Management | Maximilian Arnold et.al. | 2401.17781 | null |
2024-01-31 | Three-Dimensional Electrode Integration with Microwave Sensors for Precise Microparticle Detection in Microfluidics | Yagmur Ceren Alatas et.al. | 2401.17774 | null |
2024-01-31 | Efficient Shape Formation by 3D Hybrid Programmable Matter: An Algorithm for Low Diameter Intermediate Structures | Kristian Hinnenthal et.al. | 2401.17734 | null |
2024-01-31 | 3D-Plotting Algorithm for Insects using YOLOv5 | Daisuke Mori et.al. | 2401.17714 | null |
2024-01-31 | Printed Sensing: Assessing 3D-Printed Electrodes for Measuring Electrodermal Activity | Martin Schmitz et.al. | 2401.17709 | null |
2024-01-31 | Towards the implementation of Industry 4.0: A methodology-based approach oriented to the customer life cycle | Víctor Julio Ramírez-Durán et.al. | 2401.17661 | null |
2024-01-31 | Topology-Aware Latent Diffusion for 3D Shape Generation | Jiangbei Hu et.al. | 2401.17603 | null |
2024-01-31 | Head and Neck Tumor Segmentation from [18F]F-FDG PET/CT Images Based on 3D Diffusion Model | Yafei Dong et.al. | 2401.17593 | null |
2024-01-31 | Local Feature Matching Using Deep Learning: A Survey | Shibiao Xu et.al. | 2401.17592 | link |
2024-01-31 | Formation Mechanism of Laser-Driven Magnetized “Pillars of Creation” | Zhu Lei et.al. | 2401.17561 | null |
2024-01-30 | AdvGPS: Adversarial GPS for Multi-Agent Perception Attack | Jinlong Li et.al. | 2401.17499 | link |
2024-01-30 | CLAIRE: Scalable GPU-Accelerated Algorithms for Diffeomorphic Image Registration in 3D | Andreas Mang et.al. | 2401.17493 | null |
2024-01-30 | Pixel to Elevation: Learning to Predict Elevation Maps at Long Range using Images for Autonomous Offroad Navigation | Chanyoung Chung et.al. | 2401.17484 | null |
2024-01-30 | EchoWrist: Continuous Hand Pose Tracking and Hand-Object Interaction Recognition Using Low-Power Active Acoustic Sensing On a Wristband | Chi-Jung Lee et.al. | 2401.17409 | null |
2024-01-30 | ATPPNet: Attention based Temporal Point cloud Prediction Network | Kaustab Pal et.al. | 2401.17399 | null |
2024-01-30 | Entropic $F$ -function of 3D Ising conformal field theory via the fuzzy sphere regularization | Liangdong Hu et.al. | 2401.17362 | null |
2024-01-30 | In situ investigation of growth modes during plasma-assisted molecular beam epitaxy of (0001)GaN | G. Koblmüller et.al. | 2401.17341 | null |
2024-01-30 | SRG/eROSITA 3D mapping of the ISM using X-ray absorption spectroscopy | E. Gatuzz et.al. | 2401.17284 | null |
2024-01-30 | ContactGen: Contact-Guided Interactive 3D Human Generation for Partners | Dongjun Gu et.al. | 2401.17212 | null |
2024-01-30 | Self-Supervised Representation Learning for Nerve Fiber Distribution Patterns in 3D-PLI | Alexander Oberstrass et.al. | 2401.17207 | null |
2024-01-30 | Euler transformation for multiple $q$-hypergeometric series from wall-crossing formula of $K$ -theoretic vortex partition function | Yutaka Yoshida et.al. | 2401.17198 | null |
2024-01-30 | Multi-Camera Asynchronous Ball Localization and Trajectory Prediction with Factor Graphs and Human Poses | Qingyu Xiao et.al. | 2401.17185 | null |
2024-01-30 | Optical Tactile Sensing for Aerial Multi-Contact Interaction: Design, Integration, and Evaluation | Emanuele Aucone et.al. | 2401.17149 | null |
2024-01-30 | Physical Priors Augmented Event-Based 3D Reconstruction | Jiaxu Wang et.al. | 2401.17121 | link |
2024-01-30 | Momentum Matching for 2D-3D Heterogeneous Ohmic van der Waals Contact | Tara Jabegu et.al. | 2401.17114 | null |
2024-01-30 | Non-central panorama indoor dataset | Bruno Berenguel-Baeta et.al. | 2401.17075 | link |
2024-01-30 | OmniSCV: An Omnidirectional Synthetic Image Generator for Computer Vision | Bruno Berenguel-Baeta et.al. | 2401.17061 | link |
2024-01-30 | Atlanta Scaled layouts from non-central panoramas | Bruno Berenguel-Baeta et.al. | 2401.17058 | link |
2024-01-31 | BlockFusion: Expandable 3D Scene Generation using Latent Tri-plane Extrapolation | Zhennan Wu et.al. | 2401.17053 | link |
2024-01-31 | Prediction of ambient pressure superconductivity in cubic ternary hydrides with MH $_6$ octahedra | Feng Zheng et.al. | 2401.17024 | null |
2024-01-30 | Near-Field Fading Channel Modeling for ELAAs: From Communication to ISAC | Jiuyu Liu et.al. | 2401.17014 | null |
2024-01-30 | Deep 3D World Models for Multi-Image Super-Resolution Beyond Optical Flow | Luca Savant Aira et.al. | 2401.16972 | null |
2024-01-30 | Linear stability analysis of compressible boundary layer over an insulated wall: Existence of multiple new unstable modes for Mach number beyond 3 | Neha Chaturvedi et.al. | 2401.16939 | null |
2024-01-30 | 3D-Printed Hydraulic Fluidic Logic Circuitry for Soft Robots | Yuxin Lin et.al. | 2401.16827 | null |
2024-01-30 | An Embeddable Implicit IUVD Representation for Part-based 3D Human Surface Reconstruction | Baoxing Li et.al. | 2401.16810 | null |
2024-01-30 | Kinesthetic-based In-Hand Object Recognition with an Underactuated Robotic Hand | Julius Arolovitch et.al. | 2401.16802 | null |
2024-01-30 | A Literature Review on Fetus Brain Motion Correction in MRI | Haoran Zhang et.al. | 2401.16782 | null |
2024-01-30 | All-optical complex field imaging using diffractive processors | Jingxi Li et.al. | 2401.16779 | null |
2024-01-30 | BoostDream: Efficient Refining for High-Quality Text-to-3D Generation from Multi-View Diffusion | Yonghao Yu et.al. | 2401.16764 | null |
2024-01-30 | Evidence of electron correlation and unusual spectral evolution in an exotic superconductor, PdTe | Ram Prakash Pandeya et.al. | 2401.16724 | null |
2024-01-30 | Layer group classification of two-dimensional materials | Jingheng Fu et.al. | 2401.16705 | null |
2024-01-30 | Towards Precise 3D Human Pose Estimation with Multi-Perspective Spatial-Temporal Relational Transformers | Jianbin Jiao et.al. | 2401.16700 | link |
2024-01-30 | VR-GS: A Physical Dynamics-Aware Interactive Gaussian Splatting System in Virtual Reality | Ying Jiang et.al. | 2401.16663 | null |
2024-01-30 | The Why, When, and How to Use Active Learning in Large-Data-Driven 3D Object Detection for Safe Autonomous Driving: An Empirical Exploration | Ross Greer et.al. | 2401.16634 | null |
2024-01-29 | ReLoki: Infrastructure-free Distributed Relative Localization using On-board UWB Antenna Arrays | Joseph Prince Mathew et.al. | 2401.16599 | null |
2024-01-29 | Discrete and semi-discrete multidimensional solitons and vortices: Established results and novel findings | Boris A. Malomed et.al. | 2401.16550 | null |
2024-01-29 | DressCode: Autoregressively Sewing and Generating Garments from Text Guidance | Kai He et.al. | 2401.16465 | null |
2024-01-29 | Print-N-Grip: A Disposable, Compliant, Scalable and One-Shot 3D-Printed Multi-Fingered Robotic Hand | Alon Laron et.al. | 2401.16463 | null |
2024-01-31 | Endo-4DGS: Endoscopic Monocular Scene Reconstruction with 4D Gaussian Splatting | Yiming Huang et.al. | 2401.16416 | link |
2024-01-29 | SuNeRF: 3D reconstruction of the solar EUV corona using Neural Radiance Fields | Robert Jarolim et.al. | 2401.16388 | null |
2024-01-29 | A new numerical method for scalar eigenvalue problems in heterogeneous, dispersive, sign-changing materials | Martin Halla et.al. | 2401.16368 | null |
2024-01-29 | Evaluation of pseudo-healthy image reconstruction for anomaly detection with deep generative models: Application to brain FDG PET | Ravi Hassanaly et.al. | 2401.16363 | link |
2024-01-29 | Synthesis of 3D on-air signatures with the Sigma-Lognormal model | Miguel A. Ferrer et.al. | 2401.16329 | null |
2024-01-29 | MixSup: Mixed-grained Supervision for Label-efficient LiDAR-based 3D Object Detection | Yuxue Yang et.al. | 2401.16305 | link |
2024-01-29 | Leveraging Positional Encoding for Robust Multi-Reference-Based Object 6D Pose Estimation | Jaewoo Park et.al. | 2401.16284 | null |
2024-01-29 | Viscous Mechano-Electric Response of Ferroelectric Nematic Liquid | Peter Medle Rupnik et.al. | 2401.16272 | null |
2024-01-29 | Reconstructing Close Human Interactions from Multiple Views | Qing Shuai et.al. | 2401.16173 | link |
2024-01-29 | Divide and Conquer: Rethinking the Training Paradigm of Neural Radiance Fields | Rongkai Ma et.al. | 2401.16144 | null |
2024-01-29 | Light-field imaging from position-momentum correlations | Davide Giannella et.al. | 2401.16129 | null |
2024-01-29 | DeFlow: Decoder of Scene Flow Network in Autonomous Driving | Qingwen Zhang et.al. | 2401.16122 | link |
2024-01-29 | Towards Scenario Generalization for Vision-based Roadside 3D Object Detection | Lei Yang et.al. | 2401.16110 | link |
2024-01-29 | Flexible Parallel Neural Network Architecture Model for Early Prediction of Lithium Battery Life | Lidang Jiang et.al. | 2401.16102 | null |
2024-01-29 | Circular-ribbon flares and the related activities | Qingmin Zhang et.al. | 2401.16101 | null |
2024-01-29 | Domain adaptation strategies for 3D reconstruction of the lumbar spine using real fluoroscopy data | Sascha Jecklin et.al. | 2401.16027 | null |
2024-01-29 | Combined track finding with GNN & CKF | Lukas Heinrich et.al. | 2401.16016 | null |
2024-01-29 | Alpha Centauri: Disc Dynamics, Planet Stability, Detectability | Nicolás Cuello et.al. | 2401.16003 | null |
2024-01-29 | The phase-space distribution of the M81 satellite system | Oliver Müller et.al. | 2401.16002 | null |
2024-01-29 | AccessLens: Auto-detecting Inaccessibility of Everyday Objects | Nahyun Kwon et.al. | 2401.15996 | null |
2024-01-29 | Hand-Centric Motion Refinement for 3D Hand-Object Interaction via Hierarchical Spatial-Temporal Modeling | Yuze Hao et.al. | 2401.15987 | link |
2024-01-29 | Suppression of blow-up in the 3D Patlak-Keller-Segel-Navier-Stokes system via non-parallel shear flows | Shikun Cui et.al. | 2401.15982 | null |
2024-01-29 | StableIdentity: Inserting Anybody into Anywhere at First Sight | Qinghe Wang et.al. | 2401.15975 | link |
2024-01-29 | Motion-induced error reduction for high-speed dynamic digital fringe projection system | Sanghoon Jeon et.al. | 2401.15938 | null |
2024-01-29 | MV2MAE: Multi-View Video Masked Autoencoders | Ketul Shah et.al. | 2401.15900 | null |
2024-01-29 | 3DPFIX: Improving Remote Novices’ 3D Printing Troubleshooting through Human-AI Collaboration | Nahyun Kwon et.al. | 2401.15877 | null |
2024-01-29 | LiDAR-PTQ: Post-Training Quantization for Point Cloud 3D Object Detection | Sifan Zhou et.al. | 2401.15865 | link |
2024-01-29 | 2L3: Lifting Imperfect Generated 2D Images into Accurate 3D | Yizheng Chen et.al. | 2401.15841 | null |
2024-01-29 | Efficient and high-performance routing of lattice-surgery paths on three-dimensional lattice | Kou Hamada et.al. | 2401.15829 | null |
2024-01-28 | An objective comparison of methods for augmented reality in laparoscopic liver resection by preoperative-to-intraoperative image fusion | Sharib Ali et.al. | 2401.15753 | null |
2024-01-28 | SegmentAnyTree: A sensor and platform agnostic deep learning model for tree segmentation using laser scanning data | Maciej Wielgosz et.al. | 2401.15739 | null |
2024-01-28 | 3D code for MAgneto-Thermal evolution in Isolated Neutron Stars, MATINS: thermal evolution and lightcurves | Stefano Ascenzi et.al. | 2401.15711 | null |
2024-01-28 | Spatial profile of plasma temperature generated by discharge in 3D printed capillary | Niv Barkai et.al. | 2401.15705 | null |
2024-01-28 | On the Itô-Stratonovich Diffusion Limit for the Magnetic Field in a 3D Thin Domain | Federico Butori et.al. | 2401.15701 | null |
2024-01-28 | Multidimensional localized states in externally driven Kerr cavities with a parabolic spatiotemporal potential: a dimensional connection | Yifan Sun et.al. | 2401.15689 | null |
2024-01-30 | Media2Face: Co-speech Facial Animation Generation With Multi-Modality Guidance | Qingcheng Zhao et.al. | 2401.15687 | null |
2024-01-28 | Cooperative Receding Horizon 3D Coverage Control with a Team of Networked Aerial Agents | Savvas Papaioannou et.al. | 2401.15674 | null |
2024-01-28 | Multi-Person 3D Pose Estimation from Multi-View Uncalibrated Depth Cameras | Yu-Jhe Li et.al. | 2401.15616 | null |
2024-01-28 | Discharge quenching mechanism and RPWELL performance with tunable 3D printed resistive plates | Abhik Jash et.al. | 2401.15611 | null |
2024-01-28 | Multi-beam phase mask optimization for holographic volumetric additive manufacturing | Chi Chung Li et.al. | 2401.15590 | null |
2024-01-28 | A High-Throughput Dark-Field Full-Field OCT System for Measuring Objects with Different Scattered Light Intensities | Youlong Fan et.al. | 2401.15575 | null |
2024-01-28 | Magnetic interactions and excitations in SrMnSb $_2$ | Zhenhua Ning et.al. | 2401.15572 | null |
2024-01-28 | The Second Order 2D Behaviors of a 3D Bose Gases in the Gross-Pitaevskii Regime | Xuwen Chen et.al. | 2401.15540 | null |
2024-01-27 | An Implicit Physical Face Model Driven by Expression and Style | Lingchen Yang et.al. | 2401.15414 | null |
2024-01-27 | Soft spots of net negative topological charge directly cause the plasticity of 3D glasses | Arabinda Bera et.al. | 2401.15359 | null |
2024-01-27 | Observation of an Abrupt 3D-2D Morphological Transition in Thin Al Layers Grown by MBE on InGaAs surface | A. Elbaroudy et.al. | 2401.15341 | null |
2024-01-27 | You Only Look Bottom-Up for Monocular 3D Object Detection | Kaixin Xiong et.al. | 2401.15319 | null |
2024-01-27 | Gaussian Splashing: Dynamic Fluid Synthesis with Gaussian Splatting | Yutao Feng et.al. | 2401.15318 | null |
2024-01-27 | A Survey on 3D Skeleton Based Person Re-Identification: Approaches, Designs, Challenges, and Future Directions | Haocong Rao et.al. | 2401.15296 | link |
2024-01-26 | GenPluSSS: A Genetic Algorithm Based Plugin for Measured Subsurface Scattering Representation | Barış Yıldırım et.al. | 2401.15245 | null |
2024-01-26 | Harnessing Deep Learning of Point Clouds for Inverse Control of 3D Shape Morphing | Jue Wang et.al. | 2401.15219 | null |
2024-01-26 | Entropy-calibrated stellar modeling: Testing and improving the use of prescriptions for entropy of adiabatic convection | L. Manchon et.al. | 2401.15172 | null |
2024-01-26 | Learning Neural Radiance Fields of Forest Structure for Scalable and Fine Monitoring | Juan Castorena et.al. | 2401.15029 | null |
2024-01-26 | Straight versus Spongy – Effect of Tortuosity on Polymer Imbibition into Nanoporous Matrices Assessed by Segmentation-Free Analysis of 3D Sample Reconstructions | Fernando Vazquez Luna et.al. | 2401.14950 | null |
2024-01-26 | DAM: Diffusion Activation Maximization for 3D Global Explanations | Hanxiao Tan et.al. | 2401.14938 | link |
2024-01-26 | PARSAC: Accelerating Robust Multi-Model Fitting with Parallel Sample Consensus | Florian Kluger et.al. | 2401.14919 | link |
2024-01-26 | Thermal and kinetic coronal rain diagnostics with MgII h & k lines | M. Kriginsky et.al. | 2401.14859 | null |
2024-01-26 | LIV-GaussMap: LiDAR-Inertial-Visual Fusion for Real-time 3D Radiance Field Map Rendering | Sheng Hong et.al. | 2401.14857 | link |
2024-01-26 | Adaptive Point Transformer | Alessandro Baiocchi et.al. | 2401.14845 | null |
2024-01-26 | TIP-Editor: An Accurate 3D Editor Following Both Text-Prompts And Image-Prompts | Jingyu Zhuang et.al. | 2401.14828 | null |
2024-01-26 | SimpleEgo: Predicting Probabilistic Body Pose from Egocentric Cameras | Hanz Cuevas-Velasquez et.al. | 2401.14785 | null |
2024-01-26 | Gradient descent optimization of acoustic holograms for transcranial focused ultrasound | Ahmed Sallam et.al. | 2401.14756 | null |
2024-01-26 | Probing the position-dependent optical energy fluence rate in 3D scattering samples | Ozan Akdemir et.al. | 2401.14748 | null |
2024-01-26 | Synthetic Multimodal Dataset for Empowering Safety and Well-being in Home Environments | Takanori Ugai et.al. | 2401.14743 | null |
2024-01-26 | 3D Reconstruction and New View Synthesis of Indoor Environments based on a Dual Neural Radiance Field | Zhenyu Bao et.al. | 2401.14726 | link |
2024-01-26 | Effects of Magnetic Helicity on 3D Equilibira and Self-Organized States in KTX Reversed Field Pinch | Ke Liu et.al. | 2401.14604 | null |
2024-01-25 | TIFu: Tri-directional Implicit Function for High-Fidelity 3D Character Reconstruction | Byoungsung Lim et.al. | 2401.14565 | null |
2024-01-25 | CaRiNG: Learning Temporal Causal Representation under Non-Invertible Generation Process | Guangyi Chen et.al. | 2401.14535 | link |
2024-01-25 | RPNR: Robust-Perception Neural Reshading | Fouad Afiouni et.al. | 2401.14510 | null |
2024-01-25 | Magnetochronology of solar-type star dynamos | Quentin Noraz et.al. | 2401.14460 | null |
2024-01-25 | The Mass of the Large Magellanic Cloud from the Three-Dimensional Kinematics of its Globular Clusters | Laura L. Watkins et.al. | 2401.14458 | null |
2024-01-25 | Nucleosynthesis in magnetorotational supernovae: impact of the magnetic field configuration | M. Reichert et.al. | 2401.14402 | null |
2024-01-25 | Range-Agnostic Multi-View Depth Estimation With Keyframe Selection | Andrea Conti et.al. | 2401.14401 | link |
2024-01-25 | pix2gestalt: Amodal Segmentation by Synthesizing Wholes | Ege Ozguroglu et.al. | 2401.14398 | link |
2024-01-25 | UrbanGenAI: Reconstructing Urban Landscapes using Panoptic Segmentation and Diffusion Models | Timo Kapsalis et.al. | 2401.14379 | null |
2024-01-25 | Spectral Gaps of 2D and 3D Many-body Quantum Systems in the Thermodynamic Limit | Illya V. Lukin et.al. | 2401.14368 | null |
2024-01-25 | Collapsing Domain Wall Networks: Impact on Pulsar Timing Arrays and Primordial Black Holes | Ricardo Z. Ferreira et.al. | 2401.14331 | null |
2024-01-25 | Viscoelasticty with physics-augmented neural networks: Model formulation and training methods without prescribed internal variables | Max Rosenkranz et.al. | 2401.14270 | null |
2024-01-27 | Sketch2NeRF: Multi-view Sketch-guided Text-to-3D Generation | Minglin Chen et.al. | 2401.14257 | null |
2024-01-25 | Health Digital Twins Supported by Artificial Intelligence-based Algorithms and Extended Reality in Cardiology | Zofia Rudnicka et.al. | 2401.14208 | null |
2024-01-25 | Dynamic image reconstruction in MPI with RESESOP-Kaczmarz | Marius Nitzsche et.al. | 2401.14202 | null |
2024-01-25 | Grounded SAM: Assembling Open-World Models for Diverse Visual Tasks | Tianhe Ren et.al. | 2401.14159 | link |
2024-01-25 | Micro and Nano 3D investigation of complex gut alterations-dementia interplay | F. Palermo et.al. | 2401.14139 | null |
2024-01-25 | Attention-based Efficient Classification for 3D MRI Image of Alzheimer’s Disease | Yihao Lin et.al. | 2401.14130 | null |
2024-01-25 | MIFI: MultI-camera Feature Integration for Roust 3D Distracted Driver Activity Recognition | Jian Kuang et.al. | 2401.14115 | link |
2024-01-25 | The radiative and dynamical impact of clouds in the atmosphere of the hot Jupiter WASP-43 b | Lucas Teinturier et.al. | 2401.14083 | null |
2024-01-25 | A real-time rendering method for high albedo anisotropic materials with multiple scattering | Shun Fang et.al. | 2401.14051 | null |
2024-01-25 | A promising candidate for ising ferromagnetism of two-dimensional kagome V $_2$O$_3$ honeycomb monolayer | Fazle Subhan et.al. | 2401.14035 | null |
2024-01-25 | GauU-Scene: A Scene Reconstruction Benchmark on Large Scale 3D Reconstruction Dataset Using Gaussian Splatting | Butian Xiong et.al. | 2401.14032 | null |
2024-01-25 | An inf-sup Approach to $C_0$ -Semigroup Generation for An Interactive Composite Structure-Stokes PDE Dynamics | Pelin G. Geredeli et.al. | 2401.13962 | null |
2024-01-25 | TriSAM: Tri-Plane SAM for zero-shot cortical blood vessel segmentation in VEM images | Jia Wan et.al. | 2401.13961 | null |
2024-01-25 | Effects of Chemical Short Range Order on Percolation in Binary Alloys | Abhinav Roy et.al. | 2401.13954 | null |
2024-01-25 | Towards 3D Molecule-Text Interpretation in Language Models | Sihang Li et.al. | 2401.13923 | link |
2024-01-25 | Spatiotemporal optical vortices with controllable radial and azimuthal quantum numbers | Xin Liu et.al. | 2401.13910 | null |
2024-01-25 | 3d gravity from Virasoro TQFT: Holography, wormholes and knots | Scott Collier et.al. | 2401.13900 | null |
2024-01-25 | Two results on the differential energy equality in viscous incompressible fluids | M. -C. Lee et.al. | 2401.13899 | null |
2024-01-25 | AscDAMs: Advanced SLAM-based channel detection and mapping system | Tengfei Wang et.al. | 2401.13877 | null |
2024-01-25 | Shell topology optimization based on level set method | Hiroki Kobayashi et.al. | 2401.13868 | null |
2024-01-24 | Interplay Between Neutrino Kicks and Hydrodynamic Kicks of Neutron Stars and Black Holes | H. -Thomas Janka et.al. | 2401.13817 | null |
2024-01-24 | Synthetic Waveform Generation for Satellite, HAPS, and 5G Base Station Positioning Reference Signal Using QuaDRiGa | Hongzhao Zheng et.al. | 2401.13791 | null |
2024-01-24 | S2TPVFormer: Spatio-Temporal Tri-Perspective View for temporally coherent 3D Semantic Occupancy Prediction | Sathira Silva et.al. | 2401.13785 | null |
2024-01-24 | Non-Hermitian Linear Electrooptic Effect in 3D materials | Tiago A. Morgado et.al. | 2401.13764 | null |
2024-01-24 | Experimental validation of ultra-shortened 3D finite element electromagnetic modeling of three-core armored cables at power frequency | Juan Carlos del-Pino-López et.al. | 2401.13761 | null |
2024-01-24 | Holographic Volumetric Additive Manufacturing | Maria I. Álvarez-Castaño et.al. | 2401.13755 | null |
2024-01-24 | One-dimensional model potentials optimized for the calculation of the HHG spectrum | Krisztina Sallai et.al. | 2401.13724 | null |
2024-01-24 | Hamiltonian, Geometric Momentum and Force Operators for a Spin Zero Particle on a Curve: Physical Approach | M. S. Shikakhwa et.al. | 2401.13664 | null |
2024-01-24 | Considerations and findings on beam vorticity dynamics | L. Groening et.al. | 2401.13644 | null |
2024-01-24 | Winding Clearness for Differentiable Point Cloud Optimization | Dong Xiao et.al. | 2401.13639 | null |
2024-01-24 | Inorganic/inorganic composites through emulsion templating | Tianhui Jiang et.al. | 2401.13638 | null |
2024-01-25 | SegMamba: Long-range Sequential Modeling Mamba For 3D Medical Image Segmentation | Zhaohu Xing et.al. | 2401.13560 | link |
2024-01-24 | Tuning of Charge Order by Uniaxial Stress in a Cuprate Superconductor | Laure Thomarat et.al. | 2401.13526 | null |
2024-01-24 | Experimental validation of ultra-shortened 3D finite element models for frequency-domain analyses of three-core armored cables | Juan Carlos del-Pino-López et.al. | 2401.13451 | null |
2024-01-24 | 3D NLTE modelling of Y and Eu. Centre-to-limb variation and solar abundances | N. Storm et.al. | 2401.13450 | null |
2024-01-24 | Magnetism and spin dynamics of an S=3/2 frustrated trillium lattice antiferromagnet K2CrTi(PO4)3 | J. Khatua et.al. | 2401.13445 | null |
2024-01-24 | GTAutoAct: An Automatic Datasets Generation Framework Based on Game Engine Redevelopment for Action Recognition | Xingyu Song et.al. | 2401.13414 | null |
2024-01-24 | Tunable circular dichroism through absorption in coupled optical modes of twisted triskelia nanostructures | Javier Rodriguez Alvarez et.al. | 2401.13378 | null |
2024-01-24 | EndoGaussians: Single View Dynamic Gaussian Splatting for Deformable Endoscopic Tissues Reconstruction | Yangsen Chen et.al. | 2401.13352 | null |
2024-01-24 | Evaluation of the power frequency magnetic field generated by three-core armored cables through 3D finite element simulations | Juan Carlos del-Pino-López et.al. | 2401.13312 | null |
2024-01-24 | Loss Allocation in Submarine Armored Three-core HVAC Power Cables | Juan Carlos del-Pino-López et.al. | 2401.13268 | null |
2024-01-24 | Global well-posedness of 3D inhomogenous incompressible Navier-Stokes equations with density-dependent viscosity | Dongjuan Niu et.al. | 2401.13265 | null |
2024-01-24 | Novel 3D Reciprocal Space Visualization of Strain Relaxation in InSb on GaAs Substrates | T. Blaikie et.al. | 2401.13258 | null |
2024-01-24 | Style-Consistent 3D Indoor Scene Synthesis with Decoupled Objects | Yunfan Zhang et.al. | 2401.13203 | null |
2024-01-23 | Deep Spatiotemporal Clutter Filtering of Transthoracic Echocardiographic Images Using a 3D Convolutional Auto-Encoder | Mahdi Tabassian et.al. | 2401.13147 | link |
2024-01-23 | CIS-UNet: Multi-Class Segmentation of the Aorta in Computed Tomography Angiography via Context-Aware Shifted Window Self-Attention | Muhammad Imran et.al. | 2401.13049 | link |
2024-01-23 | GALA: Generating Animatable Layered Assets from a Single Scan | Taeksoo Kim et.al. | 2401.12979 | null |
2024-01-24 | Zero-Shot Learning for the Primitives of 3D Affordance in General Objects | Hyeonwoo Kim et.al. | 2401.12978 | link |
2024-01-23 | IRIS: Inverse Rendering of Indoor Scenes from Low Dynamic Range Images | Zhi-Hao Lin et.al. | 2401.12977 | null |
2024-01-24 | Coverage Axis++: Efficient Inner Point Selection for 3D Shape Skeletonization | Zimeng Wang et.al. | 2401.12946 | link |
2024-01-23 | On Simplified 3D Finite Element Simulations of Three-core Armored Power Cables | Juan Carlos del-Pino-López et.al. | 2401.12943 | null |
2024-01-24 | PSAvatar: A Point-based Morphable Shape Model for Real-Time Head Avatar Creation with 3D Gaussian Splatting | Zhongyuan Zhao et.al. | 2401.12900 | link |
2024-01-23 | FocusFlow: 3D Gaze-Depth Interaction in Virtual Reality Leveraging Active Visual Depth Manipulation | Chenyang Zhang et.al. | 2401.12872 | null |
2024-01-23 | A database of physical therapy exercises with variability of execution collected by wearable sensors | Sara García-de-Villa et.al. | 2401.12868 | null |
2024-01-23 | Well-posedness of low regularity solutions for the 3D relativistic Euler equations | Huali Zhang et.al. | 2401.12796 | null |
2024-01-23 | Tuning Electronic and Optical Properties of 2D/3D Construction based on Hybrid Perovskites through Interfacial Charge Transfer: Towards Higher Efficiency Solar Cells | Hrishit Banerjee et.al. | 2401.12788 | null |
2024-01-23 | Two-View Topogram-Based Anatomy-Guided CT Reconstruction for Prospective Risk Minimization | Chang Liu et.al. | 2401.12725 | null |
2024-01-23 | Gas trap prediction from 3D seismic and well test data using machine learning | Dmitry Ivlev et.al. | 2401.12717 | null |
2024-01-23 | Pragmatic Communication in Multi-Agent Collaborative Perception | Yue Hu et.al. | 2401.12694 | null |
2024-01-23 | How do ionic superdiscs self-assemble in nanopores? | Zhuoqing Li et.al. | 2401.12663 | null |
2024-01-23 | Space-time unfitted finite elements on moving explicit geometry representations | Santiago Badia et.al. | 2401.12649 | null |
2024-01-24 | RGBD Objects in the Wild: Scaling Real-World 3D Object Learning from RGB-D Videos | Hongchi Xia et.al. | 2401.12592 | null |
2024-01-23 | NeRF-AD: Neural Radiance Field with Attention-based Disentanglement for Talking Face Synthesis | Chongke Bi et.al. | 2401.12568 | null |
2024-01-23 | EndoGaussian: Gaussian Splatting for Deformable Surgical Scene Reconstruction | Yifan Liu et.al. | 2401.12561 | link |
2024-01-23 | Deep reinforcement transfer learning for active flow control of a 3D square cylinder under state dimension mismatch | Lei Yan et.al. | 2401.12543 | null |
2024-01-23 | Motion-enhanced Holography | Zhenxing Dong et.al. | 2401.12537 | link |
2024-01-23 | DDMI: Domain-Agnostic Latent Diffusion Models for Synthesizing High-Quality Implicit Neural Representations | Dogyun Park et.al. | 2401.12517 | link |
2024-01-23 | Exploration and Improvement of Nerf-based 3D Scene Editing Techniques | Shun Fang et.al. | 2401.12456 | null |
2024-01-23 | Self-supervised Learning of LiDAR 3D Point Clouds via 2D-3D Neural Calibration | Yifan Zhang et.al. | 2401.12452 | link |
2024-01-23 | Methods and strategies for improving the novel view synthesis quality of neural radiation field | Shun Fang et.al. | 2401.12451 | null |
2024-01-23 | InverseMatrixVT3D: An Efficient Projection Matrix-Based Approach for 3D Occupancy Prediction | Zhenxing Ming et.al. | 2401.12422 | link |
2024-01-22 | A model comparison of 2D Cartesian and 2D axisymmetric models for positive streamer discharges in air | Zhen Wang et.al. | 2401.12353 | null |
2024-01-22 | Knots of Darkness in Atmospheric Turbulence | D. G. Pires et.al. | 2401.12306 | null |
2024-01-22 | Connecting the Dots: Leveraging Spatio-Temporal Graph Neural Networks for Accurate Bangla Sign Language Recognition | Haz Sameen Shahgir et.al. | 2401.12210 | null |
2024-01-22 | Single-View 3D Human Digitalization with Large Reconstruction Models | Zhenzhen Weng et.al. | 2401.12175 | null |
2024-01-22 | SpatialVLM: Endowing Vision-Language Models with Spatial Reasoning Capabilities | Boyuan Chen et.al. | 2401.12168 | null |
2024-01-22 | Semi-supervised segmentation of land cover images using nonlinear canonical correlation analysis with multiple features and t-SNE | Hong Wei et.al. | 2401.12164 | null |
2024-01-22 | The accuracy of ALMA estimates of young disk radii and masses. Predicted observations from numerical simulations | Ngo-Duy Tung et.al. | 2401.12142 | null |
2024-01-22 | Matching biomolecular structures by registration of point clouds | Michael Habeck et.al. | 2401.12082 | null |
2024-01-22 | Collaborative Reinforcement Learning Based Unmanned Aerial Vehicle (UAV) Trajectory Design for 3D UAV Tracking | Yujiao Zhu et.al. | 2401.12079 | null |
2024-01-22 | CloSe: A 3D Clothing Segmentation Dataset and Model | Dimitrije Antić et.al. | 2401.12051 | link |
2024-01-22 | Fourier Transporter: Bi-Equivariant Robotic Manipulation in 3D | Haojie Huang et.al. | 2401.12046 | null |
2024-01-22 | pH modulates friction memory effects in protein folding | Benjamin A. Dalton et.al. | 2401.12027 | null |
2024-01-22 | Experimental investigation and scale analysis on melting of salty ice in a 3D-printed cavity filled with porous media | Xiaotian Liand Yuming Wang et.al. | 2401.12009 | null |
2024-01-22 | Scaling Face Interaction Graph Networks to Real World Scenes | Tatiana Lopez-Guevara et.al. | 2401.11985 | null |
2024-01-22 | Single-Photon-Assisted Two-Photon Polymerization | Buse Unlu et.al. | 2401.11942 | null |
2024-01-22 | Mysterious non-detection of HeI (23S) transit absorption of GJ436b | M. S. Rumenskikh et.al. | 2401.11938 | null |
2024-01-22 | Large receptive field strategy and important feature extraction strategy in 3D object detection | Leichao Cui et.al. | 2401.11913 | null |
2024-01-22 | 3D Space Trajectories and beyond: Abstract Art Creation with 3D Printing | Thierry Dana-Picard et.al. | 2401.11909 | null |
2024-01-22 | PolySilicate Porous Organic Polymers (PSiPOPs), a new family of porous, ordered 3D reticular materials with polysilicate nodes and organic linkers | Jelle Jamoul et.al. | 2401.11893 | null |
2024-01-22 | Viscous Dissipation and Dynamics in Simulations of Rotating, Stratified Plane-layer Convection | Simon R. W. Lance et.al. | 2401.11883 | null |
2024-01-22 | First-principles Based 3D Virtual Simulation Testing for Discovering SOTIF Corner Cases of Autonomous Driving | Lehang Li et.al. | 2401.11876 | null |
2024-01-22 | MOSformer: Momentum encoder-based inter-slice fusion transformer for medical image segmentation | De-Xing Huang et.al. | 2401.11856 | null |
2024-01-22 | ExtruOnt: An ontology for describing a type of manufacturing machine for Industry 4.0 systems | Víctor Julio Ramírez-Durán et.al. | 2401.11848 | null |
2024-01-22 | Quasi-Two-Dimensional Drops | Tytti Kärki et.al. | 2401.11845 | null |
2024-01-22 | Two New Members of the Covalent Organic Frameworks Family: Crystalline 2D-Oxocarbon and 3D-Borocarbon Structures | N. Hassani et.al. | 2401.11843 | null |
2024-01-22 | Local Agnostic Video Explanations: a Study on the Applicability of Removal-Based Explanations to Video | F. Xavier Gaya-Morey et.al. | 2401.11796 | link |
2024-01-22 | Full-Body Motion Reconstruction with Sparse Sensing from Graph Perspective | Feiyu Yao et.al. | 2401.11783 | link |
2024-01-22 | All Inkjet-printed Organic Solar Cells on 3D Objects | Marc Steinberger et.al. | 2401.11778 | null |
2024-01-22 | Numerical Solutions for Stochastic Continuous-time Algebraic Riccati Equations | Tsung-Ming Huang et.al. | 2401.11774 | null |
2024-01-22 | MsSVT++: Mixed-scale Sparse Voxel Transformer with Center Voting for 3D Object Detection | Jianan Li et.al. | 2401.11718 | null |
2024-01-22 | Integrating 3D Slicer with a Dynamic Simulator for Situational Aware Robotic Interventions | Manish Sahu et.al. | 2401.11715 | null |
2024-01-22 | MVSFormer++: Revealing the Devil in Transformer’s Details for Multi-View Stereo | Chenjie Cao et.al. | 2401.11673 | link |
2024-01-22 | PointGL: A Simple Global-Local Framework for Efficient Point Cloud Analysis | Jianan Li et.al. | 2401.11650 | link |
2024-01-21 | A Survey on African Computer Vision Datasets, Topics and Researchers | Abdul-Hakeem Omotayo et.al. | 2401.11617 | link |
2024-01-21 | Multi-View Neural 3D Reconstruction of Micro-/Nanostructures with Atomic Force Microscopy | Shuo Chen et.al. | 2401.11541 | link |
2024-01-21 | Deformable Endoscopic Tissues Reconstruction with Gaussian Splatting | Lingting Zhu et.al. | 2401.11535 | link |
2024-01-21 | Ti substitution on Fe sites significantly changes the electronic properties of orthorhombic LaFeO3 perovskites (A first-principles study) | Jesaya Situmeang et.al. | 2401.11530 | null |
2024-01-21 | 3D Imaging of Magnetic Domains in Nd2Fe14B using Scanning Hard X-Ray Nanotomography | Srutarshi Banerjee et.al. | 2401.11523 | null |
2024-01-21 | General Flow as Foundation Affordance for Scalable Robot Learning | Chengbo Yuan et.al. | 2401.11439 | link |
2024-01-21 | S $^3$ M-Net: Joint Learning of Semantic Segmentation and Stereo Matching for Autonomous Driving | Zhiyuan Wu et.al. | 2401.11414 | null |
2024-01-21 | UniM-OV3D: Uni-Modality Open-Vocabulary 3D Scene Understanding with Fine-Grained Feature Representation | Qingdong He et.al. | 2401.11395 | link |
2024-01-20 | A versatile apparatus for simultaneous trapping of multiple species of ultracold atoms and ions to enable studies of low energy collisions and cold chemistry | Bubai Rahaman et.al. | 2401.11233 | null |
2024-01-20 | 3D Receiver for Molecular Communications in Internet of Organoids | Shaojie Zhang et.al. | 2401.11214 | null |
2024-01-20 | Towards Category Unification of 3D Single Object Tracking on Point Clouds | Jiahao Nie et.al. | 2401.11204 | null |
2024-01-24 | MotionMix: Weakly-Supervised Diffusion for Controllable Motion Generation | Nhat M. Hoang et.al. | 2401.11115 | link |
2024-01-20 | A family of rare-earth Quasi-One-Dimensional spin-chain compounds K2RENb5O15 (RE=Ce,Pr,Nd,Sm,Gd-Ho) with large interchain distance | Qingyuan Zeng et.al. | 2401.11091 | null |
2024-01-20 | UltrAvatar: A Realistic Animatable 3D Avatar Diffusion Model with Authenticity Guided Textures | Mingyuan Zhou et.al. | 2401.11078 | null |
2024-01-20 | On a spectral method for $β$ -particle bound excitation collisions in kilonovae | Ryan T. Wollaeger et.al. | 2401.11069 | null |
2024-01-20 | Make-A-Shape: a Ten-Million-scale 3D Shape Model | Ka-Hei Hui et.al. | 2401.11067 | link |
2024-01-19 | Equivariant Graph Neural Operator for Modeling 3D Dynamics | Minkai Xu et.al. | 2401.11037 | link |
2024-01-19 | Anti-Jahn-Teller disproportionation and prospects for spin-triplet superconductivity in d-element compounds | A. S. Moskvin et.al. | 2401.11028 | null |
2024-01-19 | Lattice dynamics of quasi-2D perovskites from first-principles | Emily Y. Chen et.al. | 2401.10994 | link |
2024-01-19 | How well does surface magnetism represent deep Sun-like star dynamo action? | Adam J. Finley et.al. | 2401.10984 | null |
2024-01-19 | Synthesizing Moving People with 3D Control | Boyi Li et.al. | 2401.10889 | null |
2024-01-19 | SCENES: Subpixel Correspondence Estimation With Epipolar Supervision | Dominik A. Kloepfer et.al. | 2401.10886 | null |
2024-01-19 | Source-Free and Image-Only Unsupervised Domain Adaptation for Category Level Object Pose Estimation | Prakhar Kaushik et.al. | 2401.10848 | null |
2024-01-19 | TDC-less Direct Time-of-Flight Imaging Using Spiking Neural Networks | Jack MacLean et.al. | 2401.10793 | null |
2024-01-19 | Sat2Scene: 3D Urban Scene Generation from Satellite Images with Diffusion | Zuoyue Li et.al. | 2401.10786 | null |
2024-01-19 | Dense 3D Reconstruction Through Lidar: A Comparative Study on Ex-vivo Porcine Tissue | Guido Caccianiga et.al. | 2401.10709 | null |
2024-01-19 | On the penetration of large-scale flows into stellar radiative zones | Lydia Korre et.al. | 2401.10675 | null |
2024-01-19 | Assessment of the Axial Resolution of a Compact Gamma Camera with Coded Aperture Collimator | Tobias Meißner et.al. | 2401.10633 | null |
2024-01-19 | 3d TQFTs and 3-manifold invariants | Kursat Sozer et.al. | 2401.10587 | null |
2024-01-19 | 3D Shape Completion on Unseen Categories:A Weakly-supervised Approach | Lintai Wu et.al. | 2401.10578 | link |
2024-01-19 | 3D Room Geometry Inference from Multichannel Room Impulse Response using Deep Neural Network | Inmo Yeon et.al. | 2401.10453 | null |
2024-01-18 | Treatment and Aging Studies of GaAs(111)B Substrates for van der Waals Chalcogenide Film Growth | Mingyu Yu et.al. | 2401.10425 | null |
2024-01-18 | DataViz3D: An Novel Method Leveraging Online Holographic Modeling for Extensive Dataset Preprocessing and Visualization | Jinli Duan et.al. | 2401.10416 | link |
2024-01-18 | TIPSY: Trajectory of Infalling Particles in Streamers around Young stars. Dynamical analysis of the streamers around S CrA and HL Tau | Aashish Gupta et.al. | 2401.10403 | null |
2024-01-18 | Line defects in conformal field theory: from weak to strong coupling | Julien Barrat et.al. | 2401.10336 | null |
2024-01-18 | Emission signatures from sub-pc Post-Newtonian binaries embedded in circumbinary discs | Alessia Franchini et.al. | 2401.10331 | null |
2024-01-18 | ParaHome: Parameterizing Everyday Home Activities Towards 3D Generative Modeling of Human-Object Interactions | Jeonghwan Kim et.al. | 2401.10232 | null |
2024-01-18 | Enabling Efficient Equivariant Operations in the Fourier Basis via Gaunt Tensor Products | Shengjie Luo et.al. | 2401.10216 | link |
2024-01-18 | GPAvatar: Generalizable and Precise Head Avatar from Image(s) | Xuangeng Chu et.al. | 2401.10215 | link |
2024-01-18 | SHINOBI: Shape and Illumination using Neural Object Decomposition via BRDF Optimization In-the-wild | Andreas Engelhardt et.al. | 2401.10171 | null |
2024-01-18 | Residual Based Error Estimator for Chemical-Mechanically Coupled Battery Active Particles | Raphael Schoof et.al. | 2401.10135 | null |
2024-01-18 | Long time regularity for 3d gravity waves with vorticity | Daniel Ginsberg et.al. | 2401.10096 | null |
2024-01-18 | Depth Over RGB: Automatic Evaluation of Open Surgery Skills Using Depth Camera | Ido Zuckerman et.al. | 2401.10037 | null |
2024-01-18 | Discrete differential geometry-based model for nonlinear analysis of axisymmetric shells | Weicheng Huang et.al. | 2401.09954 | null |
2024-01-18 | Real-time and On-site Aerodynamics using Stereoscopic PIV and Deep Optical Flow Learning | Mohamed Elrefaie et.al. | 2401.09932 | null |
2024-01-18 | Stationary solutions to stochastic 3D Euler equations in Hölder space | Lin Lü et.al. | 2401.09894 | null |
2024-01-18 | Artificial Intelligence-based algorithms in medical image scan seg-mentation and intelligent visual-content generation – a concise overview | Zofia Rudnicka et.al. | 2401.09857 | null |
2024-01-18 | On the global well-posedness of 3D inhomogeneous incompressible Navier-Stokes system with density-dependent viscosity | Dongjuan Niu et.al. | 2401.09850 | null |
2024-01-18 | Exploring Latent Cross-Channel Embedding for Accurate 3D Human Pose Reconstruction in a Diffusion Framework | Junkun Jiang et.al. | 2401.09836 | link |
2024-01-18 | Measuring the Discrepancy between 3D Geometric Models using Directional Distance Fields | Siyu Ren et.al. | 2401.09736 | link |
2024-01-19 | Fast graph-based denoising for point cloud color information | Ryosuke Watanabe et.al. | 2401.09721 | null |
2024-01-18 | GaussianBody: Clothed Human Reconstruction via 3d Gaussian Splatting | Mengtian Li et.al. | 2401.09720 | null |
2024-01-18 | Spreading of Low-viscosity Ink Filaments Driven by Bath Viscoelasticity in Embedded Printing | Jae Hyung Cho et.al. | 2401.09684 | null |
2024-01-18 | Eye Motion Matters for 3D Face Reconstruction | Xuan Wang et.al. | 2401.09677 | link |
2024-01-18 | QoS-Aware 3D Coverage Deployment of UAVs for Internet of Vehicles in Intelligent Transportation | engfei Du et.al. | 2401.09674 | null |
2024-01-17 | Automatic 3D Multi-modal Ultrasound Segmentation of Human Placenta using Fusion Strategies and Deep Learning | Sonit Singh et.al. | 2401.09638 | null |
2024-01-17 | Rotationally symmetric transverse magnetic vector wave propagation for nonlinear optics | Caleb J. Grimms et.al. | 2401.09616 | null |
2024-01-17 | Implications of Vertical Stability Control on the SPARC Tokamak | A. O. Nelson et.al. | 2401.09613 | null |
2024-01-17 | LoS Coverage Analysis for UAV-based THz Communication Networks: Towards 3D Visualization of Wireless Networks | Mohammad Taghi Dabiri et.al. | 2401.09590 | null |
2024-01-17 | Enhancing Surveillance Camera FOV Quality via Semantic Line Detection and Classification with Deep Hough Transform | Andrew C. Freeman et.al. | 2401.09515 | null |
2024-01-17 | 4D-ONIX: A deep learning approach for reconstructing 3D movies from sparse X-ray projections | Yuhe Zhang et.al. | 2401.09508 | link |
2024-01-17 | Identifying Three-Dimensional Radiative Patterns Associated with Early Tropical Cyclone Intensification | Frederick Iat-Hin Tam et.al. | 2401.09493 | link |
2024-01-15 | 3DMASC: Accessible, explainable 3D point clouds classification. Application to Bi-spectral Topo-bathymetric lidar data | Mathilde Letard et.al. | 2401.09481 | link |
2024-01-17 | GARField: Group Anything with Radiance Fields | Chung Min Kim et.al. | 2401.09419 | link |
2024-01-17 | TextureDreamer: Image-guided Texture Synthesis through Geometry-aware Diffusion | Yu-Ying Yeh et.al. | 2401.09416 | null |
2024-01-17 | POP-3D: Open-Vocabulary 3D Occupancy Prediction from Images | Antonin Vobecky et.al. | 2401.09413 | null |
2024-01-17 | Diverse Part Synthesis for 3D Shape Creation | Yanran Guan et.al. | 2401.09384 | link |
2024-01-17 | POE: Acoustic Soft Robotic Proprioception for Omnidirectional End-effectors | Uksang Yoo et.al. | 2401.09382 | null |
2024-01-17 | SceneVerse: Scaling 3D Vision-Language Learning for Grounded Scene Understanding | Baoxiong Jia et.al. | 2401.09340 | null |
2024-01-17 | FIT-SLAM – Fisher Information and Traversability estimation-based Active SLAM for exploration in 3D environments | Suchetan Saravanan et.al. | 2401.09322 | null |
2024-01-17 | 3D Scene Geometry Estimation from 360 $^\circ$ Imagery: A Survey | Thiago Lopes Trugillo da Silveira et.al. | 2401.09252 | null |
2024-01-18 | A regularity criterion for the 3D Boussinesq equations in homogeneous Besov spaces with negative indices | Mianlu Zou et.al. | 2401.09219 | null |
2024-01-17 | SM $^3$ : Self-Supervised Multi-task Modeling with Multi-view 2D Images for Articulated Objects | Haowen Wang et.al. | 2401.09133 | null |
2024-01-17 | Admittance Controller Complemented with Real-time Singularity Avoidance for Rehabilitation Parallel Robots | Jose L. Pulloquinga et.al. | 2401.09132 | null |
2024-01-17 | Numerical simulations of turbulence in prominence threads induced by torsional oscillations | Sergio Díaz-Suárez et.al. | 2401.09122 | null |
2024-01-17 | Dark-Bright Exciton Splitting Dominates Low-Temperature Diffusion in Halide Perovskite Nanocrystal Assemblies | Andreas J. Bornschlegl et.al. | 2401.09103 | null |
2024-01-17 | 3D orientation super-resolution spatial-frequency-shift microscopy | Xiaowei Liu et.al. | 2401.09085 | null |
2024-01-17 | A five field formulation for flow simulations in porous media with fractures and barriers via an optimization based domain decomposition method | Stefano Scialò et.al. | 2401.09072 | null |
2024-01-17 | Consistent3D: Towards Consistent High-Fidelity Text-to-3D Generation with Deterministic Sampling Prior | Zike Wu et.al. | 2401.09050 | link |
2024-01-17 | Compose and Conquer: Diffusion-Based 3D Depth Aware Composable Image Synthesis | Jonghyun Lee et.al. | 2401.09048 | link |
2024-01-17 | Visual Robotic Manipulation with Depth-Aware Pretraining | Wanying Wang et.al. | 2401.09038 | null |
2024-01-17 | Robot Tape Manipulation for 3D Printing | Nahid Tushar et.al. | 2401.08982 | null |
2024-01-17 | ICON: Incremental CONfidence for Joint Pose and Radiance Field Optimization | Weiyao Wang et.al. | 2401.08937 | null |
2024-01-17 | 3D Human Pose Analysis via Diffusion Synthesis | Haorui Ji et.al. | 2401.08930 | null |
2024-01-17 | Uncertainty-aware No-Reference Point Cloud Quality Assessment | Songlin Fan et.al. | 2401.08926 | null |
2024-01-17 | Electromagnetic Information Theory: Fundamentals and Applications for 6G Wireless Communication Systems | Cheng-Xiang Wang et.al. | 2401.08921 | null |
2024-01-16 | Benchmarking Particle Filter Algorithms for Efficient Velodyne-Based Vehicle Localization | Jose Luis Blanco-Claraco et.al. | 2401.08870 | null |
2024-01-16 | Learning Implicit Representation for Reconstructing Articulated Objects | Hao Zhang et.al. | 2401.08809 | link |
2024-01-16 | Binary fraction in Galactic star clusters: FSR 866, NGC 1960, and STOCK 2 | Lidia Yalyalieva et.al. | 2401.08797 | null |
2024-01-16 | Supersymmetric Virasoro Minimal Strings | Clifford V. Johnson et.al. | 2401.08786 | null |
2024-01-16 | Percolation as a confinement order parameter in $\mathbb{Z}_2$ lattice gauge theories | Simon M. Linsel et.al. | 2401.08770 | null |
2024-01-16 | Fast Dynamic 3D Object Generation from a Single-view Video | Zijie Pan et.al. | 2401.08742 | link |
2024-01-16 | EgoGen: An Egocentric Synthetic Data Generator | Gen Li et.al. | 2401.08739 | null |
2024-01-16 | Unsupervised Pre-Training for 3D Leaf Instance Segmentation | Gianmarco Roggiolani et.al. | 2401.08720 | null |
2024-01-13 | DA-BEV: Unsupervised Domain Adaptation for Bird’s Eye View Perception | Kai Jiang et.al. | 2401.08687 | null |
2024-01-16 | MultiPLY: A Multisensory Object-Centric Embodied Large Language Model in 3D World | Yining Hong et.al. | 2401.08577 | null |
2024-01-17 | COOL-LAMPS. VII. Quantifying Strong-lens Scaling Relations with 177 Cluster-scale Gravitational Lenses in DECaLS | Simon D. Mork et.al. | 2401.08575 | null |
2024-01-16 | RoHM: Robust Human Motion Reconstruction via Diffusion | Siwei Zhang et.al. | 2401.08570 | null |
2024-01-16 | The density of the Milky Way’s corona at $z\approx 1.6$ through ram pressure stripping of the Draco dSph galaxy | Asger Grønnow et.al. | 2401.08563 | link |
2024-01-16 | Multi-Track Timeline Control for Text-Driven 3D Human Motion Generation | Mathis Petrovich et.al. | 2401.08559 | null |
2024-01-16 | FMB: a Functional Manipulation Benchmark for Generalizable Robotic Learning | Jianlan Luo et.al. | 2401.08553 | null |
2024-01-16 | PPSURF: Combining Patches and Point Convolutions for Detailed Surface Reconstruction | Philipp Erler et.al. | 2401.08518 | link |
2024-01-16 | Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis | Zhenhui Ye et.al. | 2401.08503 | link |
2024-01-16 | Multiple-mode Quantum-anomalous-Hall and Chern-switchable States in Germanene (Stanene, Silicene)/MBi2Te4 Heterostructures | Zhe Li et.al. | 2401.08490 | null |
2024-01-16 | Generation mechanism and beaming of Jovian nKOM from 3D numerical modeling of Juno/Waves observations | Adam Boudouma et.al. | 2401.08471 | null |
2024-01-16 | Training and Comparison of nnU-Net and DeepMedic Methods for Autosegmentation of Pediatric Brain Tumors | Arastoo Vossough et.al. | 2401.08404 | link |
2024-01-16 | TACO: Benchmarking Generalizable Bimanual Tool-ACtion-Object Understanding | Yun Liu et.al. | 2401.08399 | null |
2024-01-16 | Investigating Mixing Efficiency in Droplets: A Comprehensive Study of Numerical Modeling and Experimental Testing in 3D-Printed Microfluidic Devices | Ali Kheirkhah Barzoki et.al. | 2401.08354 | null |
2024-01-16 | Stochastic 3D microstructure modeling of twinned polycrystals for investigating the mechanical behavior of $γ$ -TiAl intermetallics | Philipp Rieder et.al. | 2401.08349 | null |
2024-01-16 | Insights into Polycrystalline Microstructure of Blood Films with 3D Mueller Matrix Imaging Approach | Volodimyr A. Ushenko et.al. | 2401.08340 | null |
2024-01-16 | ModelNet-O: A Large-Scale Synthetic Dataset for Occlusion-Aware Point Cloud Classification | Zhongbin Fang et.al. | 2401.08210 | link |
2024-01-18 | ProvNeRF: Modeling per Point Provenance in NeRFs as a Stochastic Process | Kiyohiro Nakayama et.al. | 2401.08140 | null |
2024-01-16 | S3M: Semantic Segmentation Sparse Mapping for UAVs with RGB-D Camera | Thanh Nguyen Canh et.al. | 2401.08134 | null |
2024-01-16 | No-Clean-Reference Image Super-Resolution: Application to Electron Microscopy | Mohammad Khateri et.al. | 2401.08115 | null |
2024-01-16 | Forging Vision Foundation Models for Autonomous Driving: Challenges, Methodologies, and Opportunities | Xu Yan et.al. | 2401.08045 | link |
2024-01-16 | Cross-Modal Semi-Dense 6-DoF Tracking of an Event Camera in Challenging Conditions | Yi-Fan Zuo et.al. | 2401.08043 | link |
2024-01-16 | 3D Lane Detection from Front or Surround-View using Joint-Modeling & Matching | Haibin Zhou et.al. | 2401.08036 | null |
2024-01-16 | Spatial Channel State Information Prediction with Generative AI: Towards Holographic Communication and Digital Radio Twin | Lihao Zhang et.al. | 2401.08023 | null |
2024-01-15 | Noncollinear electric dipoles in a polar, chiral phase of CsSnBr $_3$ : Existence and limits of bulk Rashba effects in perovskite halides | Douglas H. Fabini et.al. | 2401.07978 | null |
2024-01-15 | GD-CAF: Graph Dual-stream Convolutional Attention Fusion for Precipitation Nowcasting | Lorand Vatamany et.al. | 2401.07958 | link |
2024-01-15 | Transformer-based Video Saliency Prediction with High Temporal Dimension Decoding | Morteza Moradi et.al. | 2401.07942 | null |
2024-01-15 | From Pure Mathematics to Macroscale Applications: The Genesis of Schwarzites | Levi C. Felix et.al. | 2401.07884 | null |
2024-01-15 | $M^{2}$ Fusion: Bayesian-based Multimodal Multi-level Fusion on Colorectal Cancer Microsatellite Instability Prediction | Quan Liu et.al. | 2401.07854 | null |
2024-01-15 | MaskClustering: View Consensus based Mask Graph Clustering for Open-Vocabulary 3D Instance Segmentation | Mi Yan et.al. | 2401.07745 | null |
2024-01-15 | HexaGen3D: StableDiffusion is just one step away from Fast and Diverse Text-to-3D Generation | Antoine Mercier et.al. | 2401.07727 | null |
2024-01-15 | The Largest Empty Sphere Problem in 3D Hollowed Point Clouds | Netzer Moriya et.al. | 2401.07593 | null |
2024-01-15 | PMFSNet: Polarized Multi-scale Feature Self-attention Network For Lightweight Medical Image Segmentation | Jiahui Zhong et.al. | 2401.07579 | link |
2024-01-15 | Exploiting GPT-4 Vision for Zero-shot Point Cloud Understanding | Qi Sun et.al. | 2401.07572 | null |
2024-01-15 | CascadeV-Det: Cascade Point Voting for 3D Object Detection | Yingping Liang et.al. | 2401.07477 | link |
2024-01-14 | Ultra-broadband Optical Diffraction Tomography | Martin Hörmann et.al. | 2401.07391 | null |
2024-01-14 | Electronic structure and magnetic correlations in trilayer nickelate superconductor La $4$Ni$_3$O${10}$ under pressure | I. V. Leonov et.al. | 2401.07350 | null |
2024-01-14 | SpineCLUE: Automatic Vertebrae Identification Using Contrastive Learning and Uncertainty Estimation | Sheng Zhang et.al. | 2401.07271 | null |
2024-01-14 | 3D Landmark Detection on Human Point Clouds: A Benchmark and A Dual Cascade Point Transformer Framework | Fan Zhang et.al. | 2401.07251 | null |
2024-01-14 | DCDet: Dynamic Cross-based 3D Object Detector | Shuai Liu et.al. | 2401.07240 | link |
2024-01-13 | Acoustic Three-dimensional Chern Insulators with Arbitrary Chern Vectors | Yang Linyun et.al. | 2401.07040 | null |
2024-01-13 | UniVision: A Unified Framework for Vision-Centric 3D Perception | Yu Hong et.al. | 2401.06994 | null |
2024-01-13 | Class-Imbalanced Semi-Supervised Learning for Large-Scale Point Cloud Semantic Segmentation via Decoupling Optimization | Mengtian Li et.al. | 2401.06975 | null |
2024-01-13 | EVOKE: Emotion Enabled Virtual Avatar Mapping Using Optimized Knowledge Distillation | Maryam Nadeem et.al. | 2401.06957 | null |
2024-01-13 | 3D Object Detection and High-Resolution Traffic Parameters Extraction Using Low-Resolution LiDAR Data | Linlin Zhang et.al. | 2401.06946 | null |
2024-01-13 | The 3D Geometry of Reflection Nebulae IC 59 and IC 63 with their illuminating Star Gamma Cas | Jacob M. Eiermann et.al. | 2401.06941 | null |
2024-01-12 | Off-Shell Fields and Conserved Currents | E. O. Spirin et.al. | 2401.06933 | null |
2024-01-12 | Low-Rank Tensor Decomposition over Finite Fields | Jason Yang et.al. | 2401.06857 | null |
2024-01-12 | Physical Correlations and Predictions Emerging from Modern Core-Collapse Supernova Theory | Adam Burrows et.al. | 2401.06840 | null |
2024-01-12 | Solving the Discretised Multiphase Flow Equations with Interface Capturing on Structured Grids Using Machine Learning Libraries | Boyang Chen et.al. | 2401.06755 | link |
2024-01-12 | Scalable 3D Panoptic Segmentation With Superpoint Graph Clustering | Damien Robert et.al. | 2401.06704 | link |
2024-01-12 | A 3D picture of moist-convection inhibition in hydrogen-rich atmospheres: Implications for K2-18 b | Jérémy Leconte et.al. | 2401.06608 | null |
2024-01-12 | Robustness-Aware 3D Object Detection in Autonomous Driving: A Review and Outlook | Ziying Song et.al. | 2401.06542 | null |
2024-01-12 | Ordering-Flexible Multi-Robot Coordination for MovingTarget Convoying Using Long-TermTask Execution | Bin-Bin Hu et.al. | 2401.06439 | null |
2024-01-12 | 3D-PreMise: Can Large Language Models Generate 3D Shapes with Sharp Features and Parametric Control? | Zeqing Yuan et.al. | 2401.06437 | null |
2024-01-12 | Joint Mechanical and Electrical Adjustment of IRS-aided LEO Satellite MIMO Communications | Doyoung Kim et.al. | 2401.06422 | null |
2024-01-12 | 3D Reconstruction of Interacting Multi-Person in Clothing from a Single Image | Junuk Cha et.al. | 2401.06415 | null |
2024-01-16 | Generalizing Visual Question Answering from Synthetic to Human-Written Questions via a Chain of QA with a Large Language Model | Taehee Kim et.al. | 2401.06400 | link |
2024-01-12 | SD-MVS: Segmentation-Driven Deformation Multi-View Stereo with Spherical Refinement and EM optimization | Zhenlong Yuan et.al. | 2401.06385 | null |
2024-01-12 | Design and Nonlinear Modeling of a Modular Cable Driven Soft Robotic Arm | Xinda Qi et.al. | 2401.06377 | null |
2024-01-12 | FeS2 monolayer: a high valence and high- $T_{\rm C}$ Ising ferromagnet | Ke Yang et.al. | 2401.06357 | null |
2024-01-12 | MedTransformer: Accurate AD Diagnosis for 3D MRI Images through 2D Vision Transformers | Yifeng Wang et.al. | 2401.06349 | null |
2024-01-12 | AffordanceLLM: Grounding Affordance from Vision Language Models | Shengyi Qian et.al. | 2401.06341 | null |
2024-01-11 | Magnetic control of Weyl nodes and wave packets in three-dimensional warped semimetals | Bruno Focassio et.al. | 2401.06282 | null |
2024-01-11 | Segmentation of Mediastinal Lymph Nodes in CT with Anatomical Priors | Tejas Sudharshan Mathai et.al. | 2401.06272 | null |
2024-01-11 | Leveraging Frequency Domain Learning in 3D Vessel Segmentation | Xinyuan Wang et.al. | 2401.06224 | null |
2024-01-11 | xTrimoPGLM: Unified 100B-Scale Pre-trained Transformer for Deciphering the Language of Protein | Bo Chen et.al. | 2401.06199 | null |
2024-01-11 | TriNeRFLet: A Wavelet Based Multiscale Triplane NeRF Representation | Rajaei Khatib et.al. | 2401.06191 | null |
2024-01-11 | Prediction of Cellular Identities from Trajectory and Cell Fate Information | Baiyang Dai et.al. | 2401.06182 | link |
2024-01-10 | Machine Learning Applications in Spine Biomechanics | Farshid Ghezelbash et.al. | 2401.06174 | null |
2024-01-11 | A Wireless Ear EEG Drowsiness Monitor | Ryan Kaveh et.al. | 2401.06076 | null |
2024-01-11 | Fast High Dynamic Range Radiance Fields for Dynamic Scenes | Guanjun Wu et.al. | 2401.06052 | null |
2024-01-11 | RAVEN: Rethinking Adversarial Video Generation with Efficient Tri-plane Networks | Partha Ghosh et.al. | 2401.06035 | null |
2024-01-12 | Surgical-DINO: Adapter Learning of Foundation Models for Depth Estimation in Endoscopic Surgery | Beilei Cui et.al. | 2401.06013 | link |
2024-01-11 | An Application of HEP Track Seeding to Astrophysical Data | Mine Gökçen et.al. | 2401.06011 | null |
2024-01-11 | TRIPS: Trilinear Point Splatting for Real-Time Radiance Field Rendering | Linus Franke et.al. | 2401.06003 | link |
2024-01-11 | A 3D Diffusive and Advective Model of Electron Transport Applied to the Pulsar Wind Nebula HESS J1825-137 | Tiffany Collins et.al. | 2401.06002 | null |
2024-01-11 | The multi-spacecraft high-energy solar particle event of 28 October 2021 | A. Kouloumvakos et.al. | 2401.05991 | null |
2024-01-14 | HybridOctree_Hex: Hybrid Octree-Based Adaptive All-Hexahedral Mesh Generation with Jacobian Control | Hua Tong et.al. | 2401.05984 | link |
2024-01-11 | UAVD4L: A Large-Scale Dataset for UAV 6-DoF Localization | Rouwan Wu et.al. | 2401.05971 | link |
2024-01-11 | CoSSegGaussians: Compact and Swift Scene Segmenting 3D Gaussians | Bin Dou et.al. | 2401.05925 | null |
2024-01-13 | Neural Implicit Surface Reconstruction for Freehand 3D Ultrasound Volumetric Point Clouds with Geometric Constraints | Hongbo Chen et.al. | 2401.05915 | null |
2024-01-11 | PartSTAD: 2D-to-3D Part Segmentation Task Adaptation | Hyunjin Kim et.al. | 2401.05906 | link |
2024-01-11 | LiDAR data acquisition and processing for ecology applications | Ion Ciobotari et.al. | 2401.05891 | null |
2024-01-11 | Self-navigated 3D diffusion MRI using an optimized CAIPI sampling and structured low-rank reconstruction | Ziyu Li et.al. | 2401.05844 | null |
2024-01-11 | GO-NeRF: Generating Virtual Objects in Neural Radiance Fields | Peng Dai et.al. | 2401.05750 | null |
2024-01-11 | Object-Centric Diffusion for Efficient Video Editing | Kumara Kahatapitiya et.al. | 2401.05735 | null |
2024-01-11 | Target search in the CRISPR/Cas9 system: Facilitated diffusion with target cues | Qiao Lu et.al. | 2401.05714 | null |
2024-01-11 | Probability-based Distance Estimation Model for 3D DV-Hop Localization in WSNs | Penghong Wang et.al. | 2401.05709 | null |
2024-01-11 | Orbital Hanle Magnetoresistance in a 3d Transition Metal | Giacomo Sala et.al. | 2401.05703 | null |
2024-01-10 | Reverse Projection: Real-Time Local Space Texture Mapping | Adrian Xuan Wei Lim et.al. | 2401.05593 | null |
2024-01-10 | Current Effect-eliminated Optimal Target Assignment and Motion Planning for a Multi-UUV System | Danjie Zhu et.al. | 2401.05521 | null |
2024-01-10 | FPRF: Feed-Forward Photorealistic Style Transfer of Large-Scale 3D Neural Radiance Fields | GeonU Kim et.al. | 2401.05516 | null |
2024-01-10 | CADgpt: Harnessing Natural Language Processing for 3D Modelling to Enhance Computer-Aided Design Workflows | Timo Kapsalis et.al. | 2401.05476 | null |
2024-01-10 | Pyramidal Clustering Algorithms in ISO-3D Project | Oldemar Rodriguez et.al. | 2401.05473 | null |
2024-01-08 | A Survey of Designs for Combined 2D+3D Visual Representations | Jiayi Hong et.al. | 2401.05438 | null |
2024-01-10 | InseRF: Text-Driven Generative Object Insertion in Neural 3D Scenes | Mohamad Shahbazi et.al. | 2401.05335 | null |
2024-01-10 | Improved modelling of SEP event onset within the WSA-Enlil-SEPMOD framework | Erika Palmerio et.al. | 2401.05309 | null |
2024-01-10 | Score Distillation Sampling with Learned Manifold Corrective | Thiemo Alldieck et.al. | 2401.05293 | null |
2024-01-10 | 3D model for surface accumulation of chiral and non-chiral microswimmers | Danne M. van Roon et.al. | 2401.05237 | null |
2024-01-10 | Structure from Duplicates: Neural Inverse Graphics from a Pile of Objects | Tianhang Cheng et.al. | 2401.05236 | link |
2024-01-12 | The magnetic field in colliding filaments G202.3+2.5 | Qi-Lao Gu et.al. | 2401.05079 | null |
2024-01-10 | Dual-Perspective Knowledge Enrichment for Semi-Supervised 3D Object Detection | Yucheng Han et.al. | 2401.05011 | link |
2024-01-10 | AdaFed: Fair Federated Learning via Adaptive Common Descent Direction | Shayan Mohajer Hamidi et.al. | 2401.04993 | null |
2024-01-10 | A Three-dimensional tumor growth model and its boundary instability | Jian-Guo Liu et.al. | 2401.04954 | link |
2024-01-10 | Diffusion-based Pose Refinement and Muti-hypothesis Generation for 3D Human Pose Estimaiton | Hongbo Kang et.al. | 2401.04921 | link |
2024-01-10 | Superconductivity in Ternary Zirconium Telluride Zr6MTe2 with 3d Transition Metals | Haruka Matsumoto et.al. | 2401.04870 | null |
2024-01-09 | 2024 Roadmap on Magnetic Microscopy Techniques and Their Applications in Materials Science | D. V. Christensen et.al. | 2401.04793 | null |
2024-01-09 | DiffSHEG: A Diffusion-Based Approach for Real-Time Speech-driven Holistic 3D Expression and Gesture Generation | Junming Chen et.al. | 2401.04747 | null |
2024-01-09 | A Simple Baseline for Spoken Language to Sign Language Translation with 3D Avatars | Ronglai Zuo et.al. | 2401.04730 | link |
2024-01-09 | Morphable Diffusion: 3D-Consistent Diffusion for Single-image Avatar Creation | Xiyi Chen et.al. | 2401.04728 | link |
2024-01-09 | U-Mamba: Enhancing Long-range Dependency for Biomedical Image Segmentation | Jun Ma et.al. | 2401.04722 | null |
2024-01-09 | Protected Weyl semimetals within 2D chiral classes | Faruk Abdulla et.al. | 2401.04656 | null |
2024-01-09 | Plasmoid formation and strong radiative cooling in a driven magnetic reconnection experiment | R. Datta et.al. | 2401.04643 | null |
2024-01-10 | Translational eigenstates of He@C $_{60}$ from four-dimensional \textit{ab initio} Potential Energy Surfaces interpolated using Gaussian Process Regression | K. Panchagnula et.al. | 2401.04625 | null |
2024-01-09 | A Multi-Modal Approach Based on Large Vision Model for Close-Range Underwater Target Localization | Mingyang Yang et.al. | 2401.04595 | null |
2024-01-09 | An Automatic Cascaded Model for Hemorrhagic Stroke Segmentation and Hemorrhagic Volume Estimation | Weijin Xu et.al. | 2401.04570 | null |
2024-01-09 | Invariant measures for a class of stochastic third grade fluid equations in $2D$ and $3D$ bounded domains | Yassine Tahraoui et.al. | 2401.04566 | null |
2024-01-09 | Observation of Higher Order Nodal Line Semimetal in Phononic Crystals | Qiyun Ma et.al. | 2401.04502 | null |
2024-01-09 | RomniStereo: Recurrent Omnidirectional Stereo Matching | Hualie Jiang et.al. | 2401.04345 | link |
2024-01-09 | SiN-on-SOI Optical Phased Array LiDAR for Ultra-Wide Field of View and 4D Sensing | Baisong Chen et.al. | 2401.04335 | null |
2024-01-08 | SOAP: Cross-sensor Domain Adaptation for 3D Object Detection Using Stationary Object Aggregation Pseudo-labelling | Chengjie Huang et.al. | 2401.04230 | null |
2024-01-08 | 3D Tensor Renormalisation Group at High Temperatures | Nikolay Ebel et.al. | 2401.04229 | null |
2024-01-08 | The TNG50-SKIRT Atlas: post-processing methodology and first data release | Maarten Baes et.al. | 2401.04224 | null |
2024-01-07 | RHOBIN Challenge: Reconstruction of Human Object Interaction | Xianghui Xie et.al. | 2401.04143 | null |
2024-01-08 | AGG: Amortized Generative 3D Gaussians for Single Image to 3D | Dejia Xu et.al. | 2401.04099 | null |
2024-01-09 | GPT-4V(ision) is a Human-Aligned Evaluator for Text-to-3D Generation | Tong Wu et.al. | 2401.04092 | link |
2024-01-08 | The enigmatic dance of the HD 189733A system: Does the planet accrete onto the star? | Salvatore Colombo et.al. | 2401.03962 | null |
2024-01-08 | Recovering the 3D UUV Position using UAV Imagery in Shallow-Water Environments | Antun Đuraš et.al. | 2401.03938 | null |
2024-01-08 | D3PRefiner: A Diffusion-based Denoise Method for 3D Human Pose Refinement | Danqi Yan et.al. | 2401.03914 | null |
2024-01-08 | RoboFusion: Towards Robust Multi-Modal 3D obiect Detection via SAM | Ziying Song et.al. | 2401.03907 | link |
2024-01-08 | Weak and Strong Solutions for A Fluid-Poroelastic-Structure Interaction via A Semigroup Approach | George Avalos et.al. | 2401.03897 | null |
2024-01-08 | A Survey on 3D Gaussian Splatting | Guikun Chen et.al. | 2401.03890 | link |
2024-01-08 | Theory of x-ray absorption spectroscopy for ferrites | Felix Sorgenfrei et.al. | 2401.03858 | link |
2024-01-08 | Joint 3D User and 6D Hybrid Reconfigurable Intelligent Surface Localization | Reza Ghazalian et.al. | 2401.03852 | null |
2024-01-08 | UFO: Unidentified Foreground Object Detection in 3D Point Cloud | Hyunjun Choi et.al. | 2401.03846 | null |
2024-01-10 | WidthFormer: Toward Efficient Transformer-based BEV View Transformation | Chenhongyi Yang et.al. | 2401.03836 | link |
2024-01-08 | ARES VI: Viability of one-dimensional retrieval models for transmission spectroscopy characterization of exo-atmospheres in the era of JWST and Ariel | Adam Yassin Jaziri et.al. | 2401.03809 | null |
2024-01-08 | InvariantOODG: Learning Invariant Features of Point Clouds for Out-of-Distribution Generalization | Zhimin Zhang et.al. | 2401.03765 | null |
2024-01-08 | 3D-SSGAN: Lifting 2D Semantics for 3D-Aware Compositional Portrait Synthesis | Ruiqi Liu et.al. | 2401.03764 | null |
2024-01-08 | Sur2f: A Hybrid Representation for High-Quality and Efficient Surface Reconstruction from Multi-view Images | Zhangjin Huang et.al. | 2401.03704 | null |
2024-01-08 | Primitive Geometry Segment Pre-training for 3D Medical Image Segmentation | Ryu Tadokoro et.al. | 2401.03665 | link |
2024-01-08 | Partial Regularity for the Three-dimensional Stochastic Ericksen–Leslie equations | Hengrong Du et.al. | 2401.03662 | null |
2024-01-08 | GrainGNN: A dynamic graph neural network for predicting 3D grain microstructure | Yigong Qin et.al. | 2401.03661 | link |
2024-01-08 | Dust formation in common envelope binary interactions – II: 3D simulations with self-consistent dust formation | Luis C. Bermúdez-Bustamante et.al. | 2401.03644 | null |
2024-01-08 | DME-Driver: Integrating Human Decision Logic and 3D Scene Perception in Autonomous Driving | Wencheng Han et.al. | 2401.03641 | null |
2024-01-07 | Quantifying T cell morphodynamics and migration in 3D collagen matrices | Yeeren I. Low et.al. | 2401.03595 | link |
2024-01-07 | A New Dataflow Implementation to Improve Energy Efficiency of Monolithic 3D Systolic Arrays | Prachi Shukla et.al. | 2401.03585 | null |
2024-01-07 | Limiting behavior of minimizing $p$-harmonic maps in 3d as $p$ goes to $2$ with finite fundamental group | Bohdan Bulanyi et.al. | 2401.03583 | null |
2024-01-07 | FurniScene: A Large-scale 3D Room Dataset with Intricate Furnishing Scenes | Genghao Zhang et.al. | 2401.03470 | null |
2024-01-07 | A Classification of Critical Configurations for any Number of Projective Views | Martin Bråtelund et.al. | 2401.03450 | link |
2024-01-07 | See360: Novel Panoramic View Interpolation | Zhi-Song Liu et.al. | 2401.03431 | link |
2024-01-07 | N $^{3}$ -Mapping: Normal Guided Neural Non-Projective Signed Distance Fields for Large-scale 3D Mapping | Shuangfu Song et.al. | 2401.03412 | link |
2024-01-09 | Predicting the Skies: A Novel Model for Flight-Level Passenger Traffic Forecasting | Sina Ehsani et.al. | 2401.03397 | null |
2024-01-07 | Low Bend Loss, High Index, Composite Morphology Ultra-fast Laser Written Waveguides for Photonic Integrated Circuits | Andrew J. Ross-Adams et.al. | 2401.03382 | null |
2024-01-06 | Freeform terahertz structures fabricated by multi-photon lithography and metal coating | Pascal Maier et.al. | 2401.03316 | null |
2024-01-06 | CAVIAR: Co-simulation of 6G Communications, 3D Scenarios and AI for Digital Twins | João Borges et.al. | 2401.03310 | link |
2024-01-06 | Multi-View 3D Instance Segmentation of Structural Anomalies for Enhanced Structural Inspection of Concrete Bridges | Christian Benz et.al. | 2401.03298 | link |
2024-01-06 | RustNeRF: Robust Neural Radiance Field with Low-Quality Images | Mengfei Li et.al. | 2401.03257 | null |
2024-01-06 | 3DMIT: 3D Multi-modal Instruction Tuning for Scene Understanding | Zeju Li et.al. | 2401.03201 | link |
2024-01-06 | PosDiffNet: Positional Neural Diffusion for Point Cloud Registration in a Large Field of View with Perturbations | Rui She et.al. | 2401.03167 | null |
2024-01-06 | An Event-Oriented Diffusion-Refinement Method for Sparse Events Completion | Bo Zhang et.al. | 2401.03153 | null |
2024-01-06 | Self-supervised Feature Adaptation for 3D Industrial Anomaly Detection | Yuanpeng Tu et.al. | 2401.03145 | null |
2024-01-06 | Vision Transformers and Bi-LSTM for Alzheimer’s Disease Diagnosis from 3D MRI | Taymaz Akan et.al. | 2401.03132 | null |
2024-01-06 | Dress-Me-Up: A Dataset & Method for Self-Supervised 3D Garment Retargeting | Shanthika Naik et.al. | 2401.03108 | null |
2024-01-05 | Learning Multimodal Volumetric Features for Large-Scale Neuron Tracing | Qihua Chen et.al. | 2401.03043 | link |
2024-01-05 | Development of a central-moment phase-field lattice Boltzmann model for thermocapillary flows: Droplet capture and computational performance | Markus Holzer et.al. | 2401.03041 | null |
2024-01-05 | Analytical Quantum Full-Wave Solutions for a 3D Circuit Quantum Electrodynamics System | Soomin Moon et.al. | 2401.03033 | null |
2024-01-05 | A GPU-Accelerated Modern Fortran Version of the ECHO Code for Relativistic Magnetohydrodynamics | Luca Del Zanna et.al. | 2401.03008 | null |
2024-01-05 | Locally Adaptive Neural 3D Morphable Models | Michail Tarasiou et.al. | 2401.02937 | link |
2024-01-05 | Lift-Connected Surface Codes | Josias Old et.al. | 2401.02911 | null |
2024-01-08 | DiffBody: Diffusion-based Pose and Shape Editing of Human Images | Yuta Okuyama et.al. | 2401.02804 | link |
2024-01-05 | VoxelNextFusion: A Simple, Unified and Effective Voxel Fusion Framework for Multi-Modal 3D Object Detection | Ziying Song et.al. | 2401.02702 | null |
2024-01-05 | Scaffolding fundamentals and recent advances in sustainable scaffolding techniques for cultured meat development | AMM Nurul Alam et.al. | 2401.02691 | null |
2024-01-05 | Scaling Laws Governing the Elastic Properties of 3D-Graphenes | Ming Li et.al. | 2401.02689 | null |
2024-01-05 | Geometric-Facilitated Denoising Diffusion Model for 3D Molecule Generation | Can Xu et.al. | 2401.02683 | link |
2024-01-05 | Adaptive Discounting of Training Time Attacks | Ridhima Bector et.al. | 2401.02652 | null |
2024-01-05 | Enhancing 3D-Air Signature by Pen Tip Tail Trajectory Awareness: Dataset and Featuring by Novel Spatio-temporal CNN | Saurabh Atreya et.al. | 2401.02649 | link |
2024-01-05 | Recent Advancement in 3D Biometrics using Monocular Camera | Aritra Mukherjee et.al. | 2401.02646 | null |
2024-01-05 | Signatures of room-temperature superconductivity emerging in two-dimensional domains within the new Bi/Pb-based ceramic cuprate superconductors at ambient pressure | S. Dzhumanov et.al. | 2401.02642 | null |
2024-01-05 | Progress and Prospects in 3D Generative AI: A Technical Overview including 3D human | Song Bai et.al. | 2401.02620 | null |
2024-01-05 | FED-NeRF: Achieve High 3D Consistency and Temporal Coherence for Face Video Editing on Dynamic NeRF | Hao Zhang et.al. | 2401.02616 | link |
2024-01-05 | Partition-based Nonrigid Registration for 3D Face Model | Yuping Ye et.al. | 2401.02607 | null |
2024-01-05 | Characterizing Satellite Geometry via Accelerated 3D Gaussian Splatting | Van Minh Nguyen et.al. | 2401.02588 | null |
2024-01-04 | Bundling by volume exclusion in non-equilibrium spaghetti | I. Bonamassa et.al. | 2401.02579 | null |
2024-01-04 | Underestimation of the tidal force and apsidal motion in close binary systems by the perturbative approach: Comparisons with non-perturbative models | L. Fellay et.al. | 2401.02573 | null |
2024-01-04 | Strings, branes and twistons: topological analysis of phase defects in excitable media such as the heart | Louise Arno et.al. | 2401.02571 | null |
2024-01-04 | OptFlow: Fast Optimization-based Scene Flow Estimation without Supervision | Rahul Ahuja et.al. | 2401.02550 | null |
2024-01-04 | Stiffer alginate gels deposit more efficiently in microchannel flows | Barrett T Smith et.al. | 2401.02530 | null |
2024-01-04 | ODIN: A Single Model for 2D and 3D Perception | Ayush Jain et.al. | 2401.02416 | link |
2024-01-04 | What You See is What You GAN: Rendering Every Pixel for High-Fidelity Geometry in 3D GANs | Alex Trevithick et.al. | 2401.02411 | null |
2024-01-04 | 3D Open-Vocabulary Panoptic Segmentation with 2D-3D Vision-Language Distillation | Zihao Xiao et.al. | 2401.02402 | null |
2024-01-04 | Learning the 3D Fauna of the Web | Zizhang Li et.al. | 2401.02400 | null |
2024-01-04 | Survey of 3D Human Body Pose and Shape Estimation Methods for Contemporary Dance Applications | Darshan Venkatrayappa et.al. | 2401.02383 | null |
2024-01-04 | Fit-NGP: Fitting Object Models to Neural Graphics Primitives | Marwan Taher et.al. | 2401.02357 | null |
2024-01-04 | GridFormer: Point-Grid Transformer for Surface Reconstruction | Shengtao Li et.al. | 2401.02292 | link |
2024-01-04 | PEGASUS: Physically Enhanced Gaussian Splatting Simulation System for 6DOF Object Pose Dataset Generation | Lukas Meyer et.al. | 2401.02281 | link |
2024-01-04 | Slot-guided Volumetric Object Radiance Fields | Di Qi et.al. | 2401.02241 | null |
2024-01-04 | Mode conversion and energy flux absorption in the structured solar atmosphere | Samuel Skirvin et.al. | 2401.02238 | null |
2024-01-04 | Enabling Digitalization in Modular Robotic Systems Integration | Daniella Tola et.al. | 2401.02227 | null |
2024-01-04 | Compositing with 2D Vector Fields by using Shape Maps that can represent Inconsistent, Impossible, and Incoherent Shapes | Ergun Akleman et.al. | 2401.02200 | null |
2024-01-04 | Real-and-Present: Investigating the Use of Life-Size 2D Video Avatars in HMD-Based AR Teleconferencing | Xuanyu Wang et.al. | 2401.02171 | null |
2024-01-03 | FMGS: Foundation Model Embedded 3D Gaussian Splatting for Holistic 3D Scene Understanding | Xingxing Zuo et.al. | 2401.01970 | null |
2024-01-03 | Quasi-two-dimensionality of three-dimensional, magnetically dominated, decaying turbulence | Shreya Dwivedi et.al. | 2401.01965 | null |
2024-01-08 | Distilling Temporal Knowledge with Masked Feature Reconstruction for 3D Object Detection | Haowen Zheng et.al. | 2401.01918 | null |
2024-01-02 | Development of the CMS Magnetic Field Map | Nicola Amapane et.al. | 2401.01913 | null |
2024-01-03 | On the Mesoscale Structure of CMEs at Mercury’s Orbit: BepiColombo and Parker Solar Probe Observations | Erika Palmerio et.al. | 2401.01875 | null |
2024-01-03 | Immersive Serious Games for Learning Physics Concepts: The Case of Density | Iuliia Zhurakovskaia et.al. | 2401.01831 | null |
2024-01-04 | HawkRover: An Autonomous mmWave Vehicular Communication Testbed with Multi-sensor Fusion and Deep Learning | Ethan Zhu et.al. | 2401.01822 | null |
2024-01-03 | Many-Objective-Optimized Semi-Automated Robotic Disassembly Sequences | Takuya Kiyokawa et.al. | 2401.01817 | null |
2024-01-03 | A quatum inspired neural network for geometric modeling | Weitao Du et.al. | 2401.01801 | null |
2024-01-03 | Simulations of Radiatively Cooled Magnetic Reconnection Driven by Pulsed Power | Rishabh Datta et.al. | 2401.01795 | null |
2024-01-08 | Does the Hamiltonian determine the tensor product structure and the 3d space? | Ovidiu Cristinel Stoica et.al. | 2401.01793 | null |
2024-01-03 | Necessary conditions for the formation of filaments and star clusters in the cold neutral medium | Rachel Pillsworth et.al. | 2401.01737 | null |
2024-01-03 | STAF: 3D Human Mesh Recovery from Video with Spatio-Temporal Alignment Fusion | Wei Yao et.al. | 2401.01730 | link |
2024-01-02 | Image Sculpting: Precise Object Editing with 3D Geometry Control | Jiraphon Yenphraphai et.al. | 2401.01702 | null |
2024-01-03 | SIGNeRF: Scene Integrated Generation for Neural Radiance Fields | Jan-Niklas Dihlmann et.al. | 2401.01647 | null |
2024-01-03 | S3Net: Innovating Stereo Matching and Semantic Segmentation with a Single-Branch Semantic Stereo Network in Satellite Epipolar Imagery | Qingyuan Yang et.al. | 2401.01643 | link |
2024-01-03 | Novel analytical solutions to a new formed model of the (2+1)-dimensional BKP equation using a novel expansion technique | Rajib Mia et.al. | 2401.01594 | null |
2024-01-03 | On the Mutuality between Localization and Channel Modeling in sub-THz | Eray Guven et.al. | 2401.01504 | null |
2024-01-02 | Indoor Obstacle Discovery on Reflective Ground via Monocular Camera | Feng Xue et.al. | 2401.01445 | link |
2024-01-02 | Off-Road LiDAR Intensity Based Semantic Segmentation | Kasi Viswanath et.al. | 2401.01439 | link |
2024-01-02 | Evolution of the pseudogap temperature dependence in YBa $2$Cu$_3$O${7-δ}$ films under the influence of a magnetic field | E. V. Petrenko et.al. | 2401.01413 | null |
2024-01-02 | Design, Manufacturing and Open-Loop Control of a Soft Pneumatic Arm | Jorge Francisco García-Samartín et.al. | 2401.01409 | null |
2024-01-02 | Collisionless Magnetorotational Turbulence in Pair Plasmas: Steady-state Dynamics, Particle Acceleration, and Radiative Cooling | Fabio Bacchini et.al. | 2401.01399 | null |
2024-01-02 | On Optimal Sampling for Learning SDF Using MLPs Equipped with Positional Encoding | Guying Lin et.al. | 2401.01391 | null |
2024-01-01 | Exploring Multi-Modal Control in Music-Driven Dance Generation | Ronghui Li et.al. | 2401.01382 | null |
2023-12-31 | Using Terrestrial Laser Scanning, Unmanned Aerial Vehicles and Mixed Reality Methodologies for Digital Survey, 3D Modelling and Historical Recreation of Religious Heritage Monuments | Aristeidis Zachos et.al. | 2401.01380 | null |
2024-01-02 | Street Gaussians for Modeling Dynamic Urban Scenes | Yunzhi Yan et.al. | 2401.01339 | link |
2024-01-02 | PDRs4All. V. Modelling the dust evolution across the illuminated edge of the Orion Bar | M. Elyajouri et.al. | 2401.01221 | null |
2024-01-02 | Noise-NeRF: Hide Information in Neural Radiance Fields using Trainable Noise | Qinglong Huang et.al. | 2401.01216 | null |
2024-01-02 | En3D: An Enhanced Generative Model for Sculpting 3D Humans from 2D Synthetic Data | Yifang Men et.al. | 2401.01173 | null |
2024-01-02 | Decayless oscillations in 3D coronal loops excited by a power-law driver | Konstantinos Karampelas et.al. | 2401.01095 | null |
2024-01-02 | Depth-discriminative Metric Learning for Monocular 3D Object Detection | Wonhyeok Choi et.al. | 2401.01075 | null |
2024-01-02 | 3D Visibility-aware Generalizable Neural Radiance Fields for Interacting Hands | Xuan Huang et.al. | 2401.00979 | link |
2024-01-01 | GenH2R: Learning Generalizable Human-to-Robot Handover via Scalable Simulation, Demonstration, and Imitation | Zifan Wang et.al. | 2401.00929 | null |
2024-01-01 | Free-form Shape Modeling in XR: A Systematic Review | Shounak Chatterjee et.al. | 2401.00924 | null |
2024-01-01 | Skeleton2vec: A Self-supervised Learning Framework with Contextualized Target Representations for Skeleton Sequence | Ruizhuo Xu et.al. | 2401.00921 | link |
2023-12-31 | Taming Mode Collapse in Score Distillation for Text-to-3D Generation | Peihao Wang et.al. | 2401.00909 | null |
2023-12-30 | 3D Human Pose Perception from Egocentric Stereo Videos | Hiroyasu Akada et.al. | 2401.00889 | null |
2023-12-30 | PlanarNeRF: Online Learning of Planar Primitives with Neural Radiance Fields | Zheng Chen et.al. | 2401.00871 | null |
2024-01-01 | Mocap Everyone Everywhere: Lightweight Motion Capture With Smartwatches and a Head-Mounted Camera | Jiye Lee et.al. | 2401.00847 | null |
2024-01-01 | Deblurring 3D Gaussian Splatting | Byeonghyeon Lee et.al. | 2401.00834 | null |
2024-01-01 | 3D Beamforming Through Joint Phase-Time Arrays | Ozlem Yildiz et.al. | 2401.00819 | null |
2024-01-01 | GLIMPSE: Generalized Local Imaging with MLPs | AmirEhsan Khorashadizadeh et.al. | 2401.00816 | link |
2024-01-03 | Ultraspherical/Gegenbauer polynomials to unify 2D/3D Ambisonic directivity designs | Franz Zotter et.al. | 2401.00813 | null |
2024-01-01 | Plug-and-Play regularized 3D seismic inversion with 2D pre-trained denoisers | Nick Luiken et.al. | 2401.00753 | null |
2024-01-03 | Harmonizing Covariance and Expressiveness for Deep Hamiltonian Regression in Crystalline Material Research: a Hybrid Cascaded Regression Framework | Shi Yin et.al. | 2401.00744 | null |
2024-01-01 | Depth Map Denoising Network and Lightweight Fusion Network for Enhanced 3D Face Recognition | Ruizhuo Xu et.al. | 2401.00719 | null |
2024-01-01 | Steering of vortices by magnetic-field tilting in superconductor nanotubes | Igor Bogush et.al. | 2401.00712 | null |
2024-01-01 | Text2Avatar: Text to 3D Human Avatar Generation with Codebook-Driven Body Controllable Attribute | Chaoqun Gong et.al. | 2401.00711 | null |
2024-01-01 | 3D non-LTE abundance analyses of late-type stars | Karin Lind et.al. | 2401.00697 | null |
2024-01-01 | Point Cloud in the Air | Yulin Shao et.al. | 2401.00658 | null |
2024-01-01 | Geometry Depth Consistency in RGBD Relative Pose Estimation | Sourav Kumar et.al. | 2401.00639 | null |
2024-01-02 | GD^2-NeRF: Generative Detail Compensation via GAN and Diffusion for One-shot Generalizable Neural Radiance Fields | Xiao Pan et.al. | 2401.00616 | null |
2023-12-31 | SteinDreamer: Variance Reduction for Text-to-3D Score Distillation via Stein Identity | Peihao Wang et.al. | 2401.00604 | null |
2023-12-31 | Proximal quantum control of spin and spin ensemble with highly localized control field from skyrmions | Md Fahim F Chowdhury et.al. | 2401.00573 | null |
2023-12-31 | Ruhr Hand Motion Catalog of Human Center-Out Transport Trajectories in 3D Task-Space Captured by a Redundant Measurement System | Tim Sziburis et.al. | 2401.00562 | null |
2023-12-31 | AllSpark: a multimodal spatiotemporal general model | Run Shao et.al. | 2401.00546 | link |
2023-12-31 | Wild2Avatar: Rendering Humans Behind Occlusions | Tiange Xiang et.al. | 2401.00431 | null |
2023-12-31 | Geometric BV for twisted Courant sigma models and the BRST power finesse | Athanasios Chatzistavrakidis et.al. | 2401.00425 | null |
2023-12-31 | A Two-stream Hybrid CNN-Transformer Network for Skeleton-based Human Interaction Recognition | Ruoqi Yin et.al. | 2401.00409 | null |
2023-12-31 | Low-cost Geometry-based Eye Gaze Detection using Facial Landmarks Generated through Deep Learning | Esther Enhui Ye et.al. | 2401.00406 | null |
2023-12-31 | Generalizing Single-View 3D Shape Retrieval to Occlusions and Unseen Objects | Qirui Wu et.al. | 2401.00405 | link |
2023-12-31 | 3D Multi-system Bayesian Calibration with Energy Conservation to Study Rapidity-dependent Dynamics of Nuclear Collisions | Andi Mankolli et.al. | 2401.00402 | null |
2024-01-02 | EMAGE: Towards Unified Holistic Co-Speech Gesture Generation via Masked Audio Gesture Modeling | Haiyang Liu et.al. | 2401.00374 | link |
2023-12-30 | SHARE: Single-view Human Adversarial REconstruction | Shreelekha Revankar et.al. | 2401.00343 | null |
2023-12-30 | Almost sure global well-posedness for 3D Euler equation and other fluid dynamics models | Juraj Foldes et.al. | 2401.00332 | null |
2023-12-30 | Asymptotically proved numerical coupling of a 2D flexural porous plate with the 3D Stokes fluid | Maxime Krier et.al. | 2401.00331 | null |
2023-12-30 | A self-assembled periodic nanoporous framework in aqueous solutions of the DNA tetramer GCCG | Gregory P. Smith et.al. | 2401.00318 | null |
2023-12-30 | ASL Champ!: A Virtual Reality Game with Deep-Learning Driven Sign Recognition | Md Shahinur Alam et.al. | 2401.00289 | null |
2023-12-30 | An $\ell^1$ -Plug-and-Play Approach for Magnetic Particle Imaging Using a Zero Shot Denoiser with Validation on the 3D Open MPI Dataset | Vladyslav Gapyak et.al. | 2401.00275 | null |
2023-12-30 | HybridGait: A Benchmark for Spatial-Temporal Cloth-Changing Gait Recognition with Hybrid Explorations | Yilan Dong et.al. | 2401.00271 | link |
2023-12-30 | Robust fluctuation-based super-resolution microscopy in a confocal architecture | Alexander Krupinski-Ptaszek et.al. | 2401.00261 | null |
2023-12-30 | Phase diagram and critical behavior of Hubbard model on the square-hexagon-octagon lattice | Xinwei Jia et.al. | 2401.00258 | null |
2023-12-30 | Probing the Limits and Capabilities of Diffusion Models for the Anatomic Editing of Digital Twins | Karim Kadry et.al. | 2401.00247 | null |
2023-12-30 | Inpaint4DNeRF: Promptable Spatio-Temporal NeRF Inpainting with Generative Diffusion Models | Han Jiang et.al. | 2401.00208 | null |
2023-12-29 | Accelerating Process Development for 3D Printing of New Metal Alloys | David Guirguis et.al. | 2401.00065 | null |
2023-12-29 | The $g$ -function and Defect Changing Operators from Wavefunction Overlap on a Fuzzy Sphere | Zheng Zhou et.al. | 2401.00039 | null |
2024-01-02 | 6D-Diff: A Keypoint Diffusion Framework for 6D Object Pose Estimation | Li Xu et.al. | 2401.00029 | null |
2023-12-29 | On the 4d/3d/2d view of the SCFT/VOA correspondence | Mykola Dedushenko et.al. | 2312.17747 | null |
2023-12-29 | MURP: Multi-Agent Ultra-Wideband Relative Pose Estimation with Constrained Communications in 3D Environments | Andrew Fishberg et.al. | 2312.17731 | link |
2023-12-29 | Visual Point Cloud Forecasting enables Scalable Autonomous Driving | Zetong Yang et.al. | 2312.17655 | link |
2023-12-29 | Grasping, Part Identification, and Pose Refinement in One Shot with a Tactile Gripper | Joyce Xin-Yan Lim et.al. | 2312.17650 | null |
2023-12-29 | Developing Flying Explorer for Autonomous Digital Modelling in Wild Unknowns | Naizhong Zhang. Yaoqiang Pan et.al. | 2312.17634 | null |
2023-12-29 | P2M2-Net: Part-Aware Prompt-Guided Multimodal Point Cloud Completion | Linlian Jiang et.al. | 2312.17611 | null |
2023-12-29 | Enhancing the Performance of DeepReach on High-Dimensional Systems through Optimizing Activation Functions | Qian Wang et.al. | 2312.17583 | null |
2023-12-29 | CAD-compatible structural shape optimization with a movable Bézier tetrahedral mesh | Jorge López et.al. | 2312.17575 | null |
2023-12-29 | Thermodynamics of the five-vertex model with scalar-product boundary conditions | Ivan N. Burenev et.al. | 2312.17565 | null |
2023-12-29 | Informative Rays Selection for Few-Shot Neural Radiance Fields | Marco Orsingher et.al. | 2312.17561 | null |
2023-12-29 | LiDAR Odometry Survey: Recent Advancements and Remaining Challenges | Dongjae Lee et.al. | 2312.17487 | null |
2023-12-29 | An improved Liouville-type theorem for the stationary tropical climate model | Youseung Cho et.al. | 2312.17441 | null |
2023-12-28 | Calmed 3D Navier-Stokes Equations: Global Well-Posedness, Energy Identities, Global Attractors, and Convergence | Matthew Enlow et.al. | 2312.17371 | null |
2023-12-28 | iFusion: Inverting Diffusion for Pose-Free Reconstruction from Sparse Views | Chin-Hsuan Wu et.al. | 2312.17250 | link |
2023-12-28 | Amodal Ground Truth and Completion in the Wild | Guanqi Zhan et.al. | 2312.17247 | link |
2023-12-28 | Segment3D: Learning Fine-Grained Class-Agnostic 3D Segmentation without Manual Labels | Rui Huang et.al. | 2312.17232 | null |
2023-12-28 | 4DGen: Grounded 4D Content Generation with Spatial-temporal Consistency | Yuyang Yin et.al. | 2312.17225 | null |
2023-12-28 | HISR: Hybrid Implicit Surface Representation for Photorealistic 3D Human Reconstruction | Angtian Wang et.al. | 2312.17192 | null |
2023-12-28 | One Model to Rule them All: Towards Universal Segmentation for Medical Images with Text Prompts | Ziheng Zhao et.al. | 2312.17183 | link |
2023-12-29 | DreamGaussian4D: Generative 4D Gaussian Splatting | Jiawei Ren et.al. | 2312.17142 | link |
2023-12-29 | Fully Sparse 3D Panoptic Occupancy Prediction | Haisong Liu et.al. | 2312.17118 | link |
2023-12-28 | Toward Semantic Scene Understanding for Fine-Grained 3D Modeling of Plants | Mohamad Qadri et.al. | 2312.17110 | null |
2023-12-28 | Geometry-Biased Transformer for Robust Multi-View 3D Human Pose Reconstruction | Olivier Moliner et.al. | 2312.17106 | null |
2023-12-28 | Multidimensional Soliton Systems | Boris A. Malomed et.al. | 2312.17096 | null |
2023-12-28 | 3d $N=2$ theories from M-theory on CY4 and IIB brane box | Marwan Najjar et.al. | 2312.17082 | null |
2023-12-28 | FILP-3D: Enhancing 3D Few-shot Class-incremental Learning with Pre-trained Vision-Language Models | Wan Xu et.al. | 2312.17051 | link |
2024-01-01 | Representing and Modeling Inconsistent, Impossible, and Incoherent Shapes and Scenes with 2D Non-Conservative Vector Fields mapped on 2-Complexes | Ergun Akleman et.al. | 2312.17046 | null |
2023-12-28 | On Density Functional Theory models for one-dimensional homogeneous materials | Bouchra Bensiali et.al. | 2312.17036 | null |
2023-12-28 | Learning Spatially Collaged Fourier Bases for Implicit Neural Representation | Jason Chun Lok Li et.al. | 2312.17018 | null |
2023-12-28 | 3D observations discover a new paradigm in rubber elasticity | Zifan Wang et.al. | 2312.16994 | null |
2023-12-28 | 3DTINC: Time-Equivariant Non-Contrastive Learning for Predicting Disease Progression from Longitudinal OCTs | Taha Emre et.al. | 2312.16980 | null |
2023-12-29 | FFCA-Net: Stereo Image Compression via Fast Cascade Alignment of Side Information | Yichong Xia et.al. | 2312.16963 | null |
2023-12-28 | Efficient Physics-Based Learned Reconstruction Methods for Real-Time 3D Near-Field MIMO Radar Imaging | Irfan Manisali et.al. | 2312.16959 | link |
2023-12-28 | EvPlug: Learn a Plug-and-Play Module for Event and Image Fusion | Jianping Jiang et.al. | 2312.16933 | null |
2023-12-28 | Three-dimensional atmospheric dynamics of Jupiter from ground-based Doppler imaging spectroscopy in the visible | François-Xavier Schmider et.al. | 2312.16888 | null |
2023-12-28 | Binaural recording methods with analysis on inter-aural time, level, and phase differences | Johann Kay Ann Tan et.al. | 2312.16884 | null |
2023-12-28 | Exploring 3D-aware Lifespan Face Aging via Disentangled Shape-Texture Representations | Qianrui Teng et.al. | 2312.16881 | null |
2023-12-28 | DualFluidNet: an Attention-based Dual-pipeline Network for Accurate and Generalizable Fluid-solid Coupled Simulation | Yu Chen et.al. | 2312.16867 | link |
2023-12-28 | Dynamic Appearance Modeling of Clothed 3D Human Avatars using a Single Camera | Hansol Lee et.al. | 2312.16842 | null |
2023-12-29 | DiffusionGAN3D: Boosting Text-guided 3D Generation and Domain Adaption by Combining 3D GANs and Diffusion Priors | Biwen Lei et.al. | 2312.16837 | null |
2023-12-28 | Spacetime Gaussian Feature Splatting for Real-Time Dynamic View Synthesis | Zhan Li et.al. | 2312.16812 | link |
2023-12-27 | Correlated Quantum Phenomena of Spin-Orbit Coupled Perovskite Oxide Heterostructures: Cases of SrRuO3 and SrIrO3-Based Artificial Superlattices | Seung Gyo Jeong et.al. | 2312.16748 | null |
2023-12-27 | HMP: Hand Motion Priors for Pose and Shape Estimation from Video | Enes Duran et.al. | 2312.16737 | null |
2023-12-27 | TetraScatt model: Born approximation for the estimation of acoustic dispersion of fluid-like objects of arbitrary geometries | Edmundo F. Lavia et.al. | 2312.16721 | null |
2023-12-27 | Measurement of multidifferential cross sections for dijet production in proton-proton collisions at $\sqrt{s}$ = 13 TeV | CMS Collaboration et.al. | 2312.16669 | null |
2023-12-27 | LIP-Loc: LiDAR Image Pretraining for Cross-Modal Localization | Sai Shubodh Puligilla et.al. | 2312.16648 | null |
2023-12-27 | Learnable Chamfer Distance for Point Cloud Reconstruction | Tianxin Huang et.al. | 2312.16582 | link |
2023-12-29 | Multi-modality Affinity Inference for Weakly Supervised 3D Semantic Segmentation | Xiawei Li et.al. | 2312.16578 | link |
2023-12-27 | Discontinuous Galerkin methods for 3D-1D systems | Rami Masri et.al. | 2312.16565 | null |
2023-12-27 | Analysis of a nonconforming finite element method for vector-valued Laplacians on the surface | Carolin Mehlmann et.al. | 2312.16541 | null |
2023-12-30 | Group Multi-View Transformer for 3D Shape Analysis with Spatial Encoding | Lixiang Xu et.al. | 2312.16477 | link |
2023-12-27 | City-on-Web: Real-time Neural Rendering of Large-scale Scenes on the Web | Kaiwen Song et.al. | 2312.16457 | link |
2023-12-27 | In-Hand 3D Object Reconstruction from a Monocular RGB Video | Shijian Jiang et.al. | 2312.16425 | null |
2023-12-26 | Coordination and Machine Learning in Multi-Robot Systems: Applications in Robotic Soccer | Luis Paulo Reis et.al. | 2312.16273 | null |
2023-12-26 | SPnet: Estimating Garment Sewing Patterns from a Single Image | Seungchan Lim et.al. | 2312.16264 | null |
2023-12-29 | DL3DV-10K: A Large-Scale Scene Dataset for Deep Learning-based 3D Vision | Lu Ling et.al. | 2312.16256 | null |
2023-12-24 | TEMP3D: Temporally Continuous 3D Human Pose Estimation Under Occlusions | Rohit Lal et.al. | 2312.16221 | link |
2023-12-24 | Hyper-VolTran: Fast and Generalizable One-Shot Image to 3D Object Structure via HyperNetworks | Christian Simon et.al. | 2312.16218 | null |
2023-12-24 | SUNDIAL: 3D Satellite Understanding through Direct, Ambient, and Complex Lighting Decomposition | Nikhil Behari et.al. | 2312.16215 | null |
2023-12-22 | Multimodal machine learning for 3-dimensional characterization of hidden groundwater and geothermal resources | Michael J. Friedel et.al. | 2312.16194 | null |
2023-12-26 | EmbodiedScan: A Holistic Multi-Modal 3D Perception Suite Towards Embodied AI | Tai Wang et.al. | 2312.16170 | link |
2023-12-26 | Social-Transmotion: Promptable Human Trajectory Prediction | Saeed Saadatnejad et.al. | 2312.16168 | link |
2023-12-26 | VirtualPainting: Addressing Sparsity with Virtual Points and Distance-Aware Data Augmentation for 3D Object Detection | Sudip Dhakal et.al. | 2312.16141 | null |
2023-12-26 | Quantum-Hybrid Stereo Matching With Nonlinear Regularization and Spatial Pyramids | Cameron Braunstein et.al. | 2312.16118 | null |
2023-12-26 | The nature of low-temperature spin-freezing in frustrated Kitaev magnets | U. Jena et.al. | 2312.16096 | null |
2023-12-26 | LangSplat: 3D Language Gaussian Splatting | Minghan Qin et.al. | 2312.16084 | link |
2023-12-26 | Anisotropic Generalized Polytropic Spheres: Regular 3D Black Holes | Seyed Naseh Sajadi et.al. | 2312.16081 | null |
2023-12-26 | 2D-Guided 3D Gaussian Segmentation | Kun Lan et.al. | 2312.16047 | null |
2023-12-26 | Plug-and-Play Regularization on Magnitude with Deep Priors for 3D Near-Field MIMO Imaging | Okyanus Oral et.al. | 2312.16024 | link |
2023-12-26 | HarmonyView: Harmonizing Consistency and Diversity in One-Image-to-3D | Sangmin Woo et.al. | 2312.15980 | link |
2023-12-26 | Monocular 3D Hand Mesh Recovery via Dual Noise Estimation | Hanhui Li et.al. | 2312.15916 | link |
2023-12-26 | Chain of Generation: Multi-Modal Gesture Synthesis via Cascaded Conditional Control | Zunnan Xu et.al. | 2312.15900 | null |
2023-12-26 | SERF: Fine-Grained Interactive 3D Segmentation and Editing with Radiance Fields | Kaichen Zhou et.al. | 2312.15856 | null |
2023-12-27 | Electro-optic frequency comb-enabled precise distance measurement with megahertz acquisition rate | Yifan Qi et.al. | 2312.15743 | null |
2023-12-25 | DI-V2X: Learning Domain-Invariant Representation for Vehicle-Infrastructure Collaborative 3D Object Detection | Li Xiang et.al. | 2312.15742 | link |
2023-12-25 | Get a Grip: Reconstructing Hand-Object Stable Grasps in Egocentric Videos | Zhifan Zhu et.al. | 2312.15719 | null |
2023-12-25 | Exploiting dynamic bifurcation in elastic ribbons for mode skipping and selection | Weicheng Huang et.al. | 2312.15699 | null |
2023-12-25 | BDIS-SLAM: A lightweight CPU-based dense stereo SLAM for surgery | Jingwei Song et.al. | 2312.15679 | link |
2023-12-25 | Sparse-view CT Reconstruction with 3D Gaussian Volumetric Representation | Yingtai Li et.al. | 2312.15676 | null |
2023-12-25 | Lifting by Image – Leveraging Image Cues for Accurate 3D Human Pose Estimation | Feng Zhou et.al. | 2312.15636 | null |
2023-12-25 | Signature of BKT-like spin transport in a quasi-2D antiferromagnet BaNi $_2$V$_2$O$_8$ | Kurea Nakagawa et.al. | 2312.15615 | null |
2023-12-25 | A Method for Determining the Locations and Configurations of Magnetic Reconnection within 3D Turbulent Plasmas | Yulei Wang et.al. | 2312.15589 | link |
2023-12-25 | A numerical study on the oscillatory dynamics of tip vortex cavitation | Saman Lak et.al. | 2312.15579 | null |
2023-12-24 | Construct 3D Hand Skeleton with Commercial WiFi | Sijie Ji et.al. | 2312.15507 | link |
2023-12-24 | iDet3D: Towards Efficient Interactive Object Detection for LiDAR Point Clouds | Dongmin Choi et.al. | 2312.15449 | null |
2023-12-24 | Make-A-Character: High Quality Text-to-3D Character Generation within Minutes | Jianqiang Ren et.al. | 2312.15430 | null |
2023-12-24 | A theory of volumetric representations for opaque solids | Bailey Miller et.al. | 2312.15406 | null |
2023-12-24 | End-to-End 3D Object Detection using LiDAR Point Cloud | Gaurav Raut et.al. | 2312.15377 | null |
2023-12-23 | WildScenes: A Benchmark for 2D and 3D Semantic Segmentation in Large-scale Natural Environments | Kavisha Vidanapathirana et.al. | 2312.15364 | link |
2023-12-23 | Scout-Net: Prospective Personalized Estimation of CT Organ Doses from Scout Views | Abdullah-Al-Zubaer Imran et.al. | 2312.15354 | null |
2023-12-23 | Benefit from public unlabeled data: A Frangi filtering-based pretraining network for 3D cerebrovascular segmentation | Gen Shi et.al. | 2312.15273 | link |
2023-12-23 | Self-Supervised Depth Completion Guided by 3D Perception and Geometry Consistency | Yu Cai et.al. | 2312.15263 | null |
2023-12-23 | Human101: Training 100+FPS Human Gaussians in 100s from 1 View | Mingwei Li et.al. | 2312.15258 | link |
2023-12-23 | CaLDiff: Camera Localization in NeRF via Pose Diffusion | Rashik Shrestha et.al. | 2312.15242 | null |
2023-12-23 | NoPose-NeuS: Jointly Optimizing Camera Poses with Neural Implicit Surfaces for Multi-view Reconstruction | Mohamed Shawky Sabae et.al. | 2312.15238 | null |
2023-12-23 | Sample selection with noise rate estimation in noise learning of medical image analysis | Maolin Li et.al. | 2312.15233 | null |
2023-12-23 | Impact of anisotropic cosmic-ray transport on the gamma-ray signatures in the Galactic Center | J. Dörner et.al. | 2312.15206 | null |
2023-12-23 | Helmholtz decomposition based windowed Green function methods for elastic scattering problems on a half-space | Tao Yin et.al. | 2312.15189 | null |
2023-12-23 | Physics-informed neural network for modeling dynamic linear elasticity | Vijay Kag et.al. | 2312.15175 | null |
2023-12-23 | Pre-trained Trojan Attacks for Visual Recognition | Aishan Liu et.al. | 2312.15172 | null |
2023-12-23 | Learning Continuous Implicit Field with Local Distance Indicator for Arbitrary-Scale Point Cloud Upsampling | Shujuan Li et.al. | 2312.15133 | null |
2023-12-23 | Stable Higher-Order Topological Dirac Semimetals with $\mathbb{Z}_2$ Monopole Charge in Alternating-twisted Multilayer Graphenes and beyond | Shifeng Qian et.al. | 2312.15131 | null |
2023-12-22 | Automated forest inventory: analysis of high-density airborne LiDAR point clouds with 3D deep learning | Binbin Xiang et.al. | 2312.15084 | link |
2023-12-22 | Deformable 3D Gaussian Splatting for Animatable Human Avatars | HyunJun Jung et.al. | 2312.15059 | null |
2023-12-22 | MACS: Mass Conditioned 3D Hand and Object Motion Synthesis | Soshi Shimada et.al. | 2312.14929 | null |
2023-12-26 | Lift-Attend-Splat: Bird’s-eye-view camera-lidar fusion using transformers | James Gunn et.al. | 2312.14919 | null |
2023-12-22 | PoseGen: Learning to Generate 3D Human Pose Dataset with NeRF | Mohsen Gholami et.al. | 2312.14915 | link |
2023-12-22 | The ALMaQUEST Survey XIV: do radial molecular gas flows affect the star-forming ability of barred galaxies? | Lucy M. Hogarth et.al. | 2312.14702 | null |
2023-12-22 | Pola4All: survey of polarimetric applications and an open-source toolkit to analyze polarization | Joaquin Rodriguez et.al. | 2312.14697 | link |
2023-12-22 | Density Uncertainty Quantification with NeRF-Ensembles: Impact of Data and Scene Constraints | Miriam Jäger et.al. | 2312.14664 | null |
2023-12-22 | A Language-based solution to enable Metaverse Retrieval | Ali Abdari et.al. | 2312.14630 | link |
2023-12-22 | Explainable Multi-Camera 3D Object Detection with Transformer-Based Saliency Maps | Till Beemelmanns et.al. | 2312.14606 | null |
2023-12-22 | 3D Programming of Patterned Heterogeneous Interface for 4D Smart Robotics | Kewei Song et.al. | 2312.14511 | null |
2023-12-22 | Digital twin-assisted three-dimensional electrical capacitance tomography for multiphase flow imaging | Shengnan Wang et.al. | 2312.14496 | null |
2023-12-22 | Beam Foreseeing in Millimeter-Wave Systems with Situational Awareness: Fundamental Limits via Cramér-Rao Lower Bound | Wan-Ting Shih et.al. | 2312.14495 | null |
2023-12-22 | MonoLSS: Learnable Sample Selection For Monocular 3D Detection | Zhenjia Li et.al. | 2312.14474 | link |
2023-12-22 | Towards Assessing Compliant Robotic Grasping from First-Object Perspective via Instrumented Objects | Maceon Knopke et.al. | 2312.14466 | link |
2023-12-22 | FM-OV3D: Foundation Model-based Cross-modal Knowledge Blending for Open-Vocabulary 3D Detection | Dongmei Zhang et.al. | 2312.14465 | null |
2023-12-22 | Scalable 3D Reconstruction From Single Particle X-Ray Diffraction Images Based on Online Machine Learning | Jay Shenoy et.al. | 2312.14432 | null |
2023-12-22 | Disc Novae: Thermodynamics of Gas Assisted Binary Black Hole Formation in AGN Discs | Henry Whitehead et.al. | 2312.14431 | null |
2023-12-22 | 3D Anderson localization of light in disordered systems of dielectric particles | Yevgen Grynko et.al. | 2312.14393 | null |
2023-12-22 | Generative AI Beyond LLMs: System Implications of Multi-Modal Generation | Alicia Golden et.al. | 2312.14385 | null |
2023-12-22 | Designing a Skilled Soccer Team for RoboCup: Exploring Skill-Set-Primitives through Reinforcement Learning | Miguel Abreu et.al. | 2312.14360 | link |
2023-12-22 | Interactive simulation and visualization of point spread functions in single molecule imaging | Magdalena C. Schneider et.al. | 2312.14356 | link |
2023-12-22 | Identifying topologically associating domains using differential kernels | Luka Maisuradze et.al. | 2312.14342 | null |
2023-12-21 | Geo2SigMap: High-Fidelity RF Signal Mapping Using Geographic Databases | Yiming Li et.al. | 2312.14303 | link |
2023-12-21 | Scalable nanoimprint manufacturing of multi-layer hybrid metasurface device | Shinhyuk Choi et.al. | 2312.14297 | null |
2023-12-21 | Inertial Waves in a Nonlinear Simulation of the Sun’s Convection Zone and Radiative Interior | Catherine C. Blume et.al. | 2312.14270 | null |
2023-12-21 | PlatoNeRF: 3D Reconstruction in Plato’s Cave via Single-View Two-Bounce Lidar | Tzofi Klinghoffer et.al. | 2312.14239 | null |
2023-12-21 | Neural Spline Fields for Burst Image Fusion and Layer Separation | Ilya Chugunov et.al. | 2312.14235 | null |
2023-12-21 | DreamDistribution: Prompt Distribution Learning for Text-to-Image Diffusion Models | Brian Nlong Zhao et.al. | 2312.14216 | null |
2023-12-21 | ZeroShape: Regression-based Zero-shot Shape Reconstruction | Zixuan Huang et.al. | 2312.14198 | link |
2023-12-21 | 3D Pose Estimation of Two Interacting Hands from a Monocular Event Camera | Christen Millerdurai et.al. | 2312.14157 | null |
2023-12-21 | Virtual Pets: Animatable Animal Generation in 3D Scenes | Yen-Chi Cheng et.al. | 2312.14154 | null |
2023-12-21 | HeadCraft: Modeling High-Detail Shape Variations for Animated 3DMMs | Artem Sevastopolsky et.al. | 2312.14140 | null |
2023-12-21 | DUSt3R: Geometric 3D Vision Made Easy | Shuzhe Wang et.al. | 2312.14132 | link |
2023-12-21 | Neural Point Cloud Diffusion for Disentangled 3D Shape and Appearance Generation | Philipp Schröppel et.al. | 2312.14124 | link |
2023-12-21 | Axionic defects in the CMB: birefringence and gravitational waves | Ricardo Z. Ferreira et.al. | 2312.14104 | null |
2023-12-21 | Solar Eruptions Triggered by Flux Emergence Below or Near a Coronal Flux Rope | T. Török et.al. | 2312.14092 | null |
2023-12-21 | LiDAR-LLM: Exploring the Potential of Large Language Models for 3D LiDAR Understanding | Senqiao Yang et.al. | 2312.14074 | null |
2023-12-21 | Geometric Awareness in Neural Fields for 3D Human Registration | Riccardo Marin et.al. | 2312.14024 | link |
2023-12-21 | Carve3D: Improving Multi-view Reconstruction Consistency for Diffusion Models with RL Finetuning | Desai Xie et.al. | 2312.13980 | null |
2023-12-21 | Anatomical basis of sex differences in human post-myocardial infarction ECG phenotypes identified by novel automated torso-cardiac 3D reconstruction | Hannah J. Smith et.al. | 2312.13976 | null |
2023-12-21 | Laminar flow synthesis of submicron CaCO $_3$ particles in 3D printed microfluidic chips | I. A. Reznik et.al. | 2312.13974 | null |
2023-12-21 | Controllable 3D Face Generation with Conditional Style Code Diffusion | Xiaolong Shen et.al. | 2312.13941 | link |
2023-12-22 | Paint3D: Paint Anything 3D with Lighting-Less Texture Diffusion Models | Xianfang Zeng et.al. | 2312.13913 | link |
2023-12-21 | Electric dipole spin resonance in single and two electron quantum dot defined in two-dimensional electron gas at the SrTiO $_3$/LaAlO$_3$ interface | B. Szafran et.al. | 2312.13862 | null |
2023-12-21 | SyncDreamer for 3D Reconstruction of Endangered Animal Species with NeRF and NeuS | Ahmet Haydar Ornek et.al. | 2312.13832 | null |
2023-12-21 | 3D Points Splatting for Real-Time Dynamic Hand Reconstruction | Zheheng Jiang et.al. | 2312.13770 | null |
2023-12-21 | Align Your Gaussians: Text-to-4D with Dynamic 3D Gaussians and Composed Diffusion Models | Huan Ling et.al. | 2312.13763 | null |
2023-12-22 | Gaussian Splatting with NeRF-based Color and Opacity | Dawid Malarz et.al. | 2312.13729 | link |
2023-12-21 | Blind Localization of Room Reflections with Application to Spatial Audio | Yogev Hadadi et.al. | 2312.13707 | null |
2023-12-21 | Free-Editor: Zero-shot Text-driven 3D Scene Editing | Nazmul Karim et.al. | 2312.13663 | link |
2023-12-21 | SPGroup3D: Superpoint Grouping Network for Indoor 3D Object Detection | Yun Zhu et.al. | 2312.13641 | link |
2023-12-21 | Ponymation: Learning 3D Animal Motions from Unlabeled Online Videos | Keqiang Sun et.al. | 2312.13604 | null |
2023-12-21 | DREAM-Talk: Diffusion-based Realistic Emotional Audio-driven Method for Single Image Talking Face Generation | Chenxu Zhang et.al. | 2312.13578 | null |
2023-12-21 | Contribution of Graphene Molecules C ${53}$ C${52}$ C$_{51}$ on Astronomical Diffuse Interstellar Bands (DIB) | Norio Ota et.al. | 2312.13550 | null |
2023-12-21 | SE(3)-Equivariant and Noise-Invariant 3D Motion Tracking in Medical Images | Benjamin Billot et.al. | 2312.13534 | link |
2023-12-21 | DyBluRF: Dynamic Deblurring Neural Radiance Fields for Blurry Monocular Video | Minh-Quan Viet Bui et.al. | 2312.13528 | null |
2023-12-21 | High-resolution myelin-water fraction and quantitative relaxation mapping using 3D ViSTa-MR fingerprinting | Congyu Liao et.al. | 2312.13523 | null |
2023-12-21 | MR-STGN: Multi-Residual Spatio Temporal Graph Network Using Attention Fusion for Patient Action Assessment | Youssef Mourchid et.al. | 2312.13509 | null |
2023-12-21 | Visual Tomography: Physically Faithful Volumetric Models of Partially Translucent Objects | David Nakath et.al. | 2312.13494 | null |
2023-12-20 | Designing 3D multicomponent self-assembling systems with signal-passing building blocks | Joshua Evans et.al. | 2312.13479 | null |
2023-12-20 | MGAug: Multimodal Geometric Augmentation in Latent Spaces of Image Deformations | Tonmoy Hossain et.al. | 2312.13440 | link |
2023-12-20 | Review and experimental benchmarking of machine learning algorithms for efficient optimization of cold atom experiments | Oliver Anton et.al. | 2312.13397 | null |
2023-12-20 | The Conformal Manifold of S-folds in String Theory | Nikolay Bobev et.al. | 2312.13370 | null |
2023-12-20 | Affine $\mathcal{W}$-algebras and Miura maps from 3d $\mathcal N=4$ non-Abelian quiver gauge theories | Ioana Coman et.al. | 2312.13363 | null |
2023-12-20 | Improving the five-point bootstrap | David Poland et.al. | 2312.13344 | null |
2023-12-20 | Ternary-type Opacity and Hybrid Odometry for RGB-only NeRF-SLAM | Junru Lin et.al. | 2312.13332 | null |
2023-12-20 | NeLF-Pro: Neural Light Field Probes | Zinuo You et.al. | 2312.13328 | null |
2023-12-20 | ShowRoom3D: Text to High-Quality 3D Room Generation Using 3D Priors | Weijia Mao et.al. | 2312.13324 | null |
2023-12-20 | In2SET: Intra-Inter Similarity Exploiting Transformer for Dual-Camera Compressive Hyperspectral Imaging | Xin Wang et.al. | 2312.13319 | link |
2023-12-20 | SWAGS: Sampling Windows Adaptively for Dynamic 3D Gaussian Splatting | Richard Shaw et.al. | 2312.13308 | null |
2023-12-19 | Compact 3D Scene Representation via Self-Organizing Gaussian Grids | Wieland Morgenstern et.al. | 2312.13299 | link |
2023-12-20 | UniSDF: Unifying Neural Representations for High-Fidelity 3D Reconstruction of Complex Scenes with Reflections | Fangjinhua Wang et.al. | 2312.13285 | null |
2023-12-20 | Deep Learning on 3D Neural Fields | Pierluigi Zama Ramirez et.al. | 2312.13277 | null |
2023-12-21 | Repaint123: Fast and High-quality One Image to 3D Generation with Progressive Controllable 2D Repainting | Junwu Zhang et.al. | 2312.13271 | link |
2023-12-20 | Improving Semantic Correspondence with Viewpoint-Guided Spherical Maps | Octave Mariotti et.al. | 2312.13216 | null |
2023-12-20 | A 3D super-resolution of wind fields via physics-informed pixel-wise self-attention generative adversarial network | Takuya Kurihana et.al. | 2312.13212 | null |
2023-12-20 | Splatter Image: Ultra-Fast Single-View 3D Reconstruction | Stanislaw Szymanowicz et.al. | 2312.13150 | link |
2023-12-21 | Molecular Hypergraph Neural Networks | Junwu Chen et.al. | 2312.13136 | link |
2023-12-20 | Screwon spectral statistics and dispersion relation in the quantum Rajeev-Ranken model | Govind S. Krishnaswami et.al. | 2312.13122 | null |
2023-12-20 | Pre-training of Molecular GNNs as Conditional Boltzmann Generator | Daiki Koge et.al. | 2312.13110 | null |
2023-12-20 | SpecNeRF: Gaussian Directional Encoding for Specular Reflections | Li Ma et.al. | 2312.13102 | null |
2023-12-22 | MoSAR: Monocular Semi-Supervised Model for Avatar Reconstruction using Differentiable Shading | Abdallah Dib et.al. | 2312.13091 | null |
2023-12-20 | Shedding light on the ejection history of molecular outflows: Multiple velocity modes and precession | Veronica Lora et.al. | 2312.13087 | null |
2023-12-20 | Langlands Dualities through Bethe/Gauge Correspondence for 3d Gauge Theories | Xiang-Mao Ding et.al. | 2312.13080 | null |
2023-12-22 | DiffPortrait3D: Controllable Diffusion for Zero-Shot Portrait View Synthesis | Yuming Gu et.al. | 2312.13016 | link |
2023-12-20 | A mesh-free framework for high-order simulations of viscoelastic flows in complex geometries | Jack R. C. King et.al. | 2312.12996 | null |
2023-12-20 | Radar Fields: An Extension of Radiance Fields to SAR | Thibaud Ehret et.al. | 2312.12961 | null |
2023-12-20 | Order-by-disorder in the antiferromagnetic long-range transverse-field Ising model on the ruby lattice | A. Duft et.al. | 2312.12941 | null |
2023-12-20 | Sign Language Production with Latent Motion Transformer | Pan Xie et.al. | 2312.12917 | null |
2023-12-20 | Relightable and Animatable Neural Avatars from Videos | Wenbin Lin et.al. | 2312.12877 | null |
2023-12-20 | Learning Exhaustive Correlation for Spectral Super-Resolution: Where Unified Spatial-Spectral Attention Meets Mutual Linear Dependence | Hongyuan Wang et.al. | 2312.12833 | null |
2023-12-20 | 3D-CLMI: A Motor Imagery EEG Classification Model via Fusion of 3D-CNN and LSTM with Attention | Shiwei Cheng et.al. | 2312.12744 | null |
2023-12-20 | PointeNet: A Lightweight Framework for Effective and Efficient Point Cloud Analysis | Lipeng Gu et.al. | 2312.12743 | null |
2023-12-20 | Reducing Shape-Radiance Ambiguity in Radiance Fields with a Closed-Form Color Estimation Method | Qihang Fang et.al. | 2312.12726 | link |
2023-12-19 | MotionScript: Natural Language Descriptions for Expressive 3D Human Motions | Payam Jome Yazdian et.al. | 2312.12634 | null |
2023-12-19 | Structural maturation of myofilaments in engineered 3D cardiac microtissues characterized using small angle X-ray scattering | Geoffrey van Dover et.al. | 2312.12628 | null |
2023-12-19 | Streaming Instability and Turbulence: Conditions for Planetesimal Formation | Jeonghoon Lim et.al. | 2312.12508 | null |
2023-12-19 | Scene-Conditional 3D Object Stylization and Composition | Jinghao Zhou et.al. | 2312.12419 | null |
2023-12-19 | LASA: Instance Reconstruction from Real Scans using A Large-scale Aligned Shape Annotation Dataset | Haolin Liu et.al. | 2312.12418 | null |
2023-12-19 | Gravitational waves from supercooled phase transitions: dimensional transmutation meets dimensional reduction | Maciej Kierkla et.al. | 2312.12413 | null |
2023-12-20 | Scalable Geometric Fracture Assembly via Co-creation Space among Assemblers | Ruiyuan Zhang et.al. | 2312.12340 | link |
2023-12-21 | pixelSplat: 3D Gaussian Splats from Image Pairs for Scalable Generalizable 3D Reconstruction | David Charatan et.al. | 2312.12337 | link |
2023-12-19 | Holography of New Conformal Higher Spin Gravities in 3d | I. Lovrekovic et.al. | 2312.12301 | null |
2023-12-19 | Sketch Vision: Artificial Intelligence with Sight for Imagination | Demircan Tas et.al. | 2312.12270 | null |
2023-12-19 | Topological spectra and entropy of chromatin loop networks | Andrea Bonato et.al. | 2312.12159 | null |
2023-12-19 | Combinatorics and topological weights of chromatin loop networks | Andrea Bonato et.al. | 2312.12154 | null |
2023-12-19 | M-BEV: Masked BEV Perception for Robust Autonomous Driving | Siran Chen et.al. | 2312.12144 | link |
2023-12-19 | The distribution of impactor core material during large impacts on Earth-like planets | Jonathan P. Itcovitz et.al. | 2312.12132 | null |
2023-12-19 | ZS-SRT: An Efficient Zero-Shot Super-Resolution Training Method for Neural Radiance Fields | Xiang Feng et.al. | 2312.12122 | null |
2023-12-19 | Domain Generalization in LiDAR Semantic Segmentation Leveraged by Density Discriminative Feature Embedding | Jaeyeul Kim et.al. | 2312.12098 | null |
2023-12-20 | DLCA-Recon: Dynamic Loose Clothing Avatar Reconstruction from Monocular Videos | Chunjie Luo et.al. | 2312.12096 | null |
2023-12-20 | CrossBind: Collaborative Cross-Modal Identification of Protein Nucleic-Acid-Binding Residues | Linglin Jing et.al. | 2312.12094 | link |
2023-12-19 | Automatic bony structure segmentation and curvature estimation on ultrasound cervical spine images – a feasibility study | Songhan Ge et.al. | 2312.12066 | null |
2023-12-19 | Expressive Forecasting of 3D Whole-body Human Motions | Pengxiang Ding et.al. | 2312.11972 | link |
2023-12-19 | EVI-SAM: Robust, Real-time, Tightly-coupled Event-Visual-Inertial State Estimation and 3D Dense Mapping | Weipeng Guan et.al. | 2312.11911 | link |
2023-12-19 | MeV Astrophysical Spectroscopic Surveyor (MASS): A Compton Telescope Mission Concept | Jiahuan Zhu et.al. | 2312.11900 | null |
2023-12-19 | 3D-LFM: Lifting Foundation Model | Mosam Dabhi et.al. | 2312.11894 | link |
2023-12-19 | Point Cloud Segmentation Using Transfer Learning with RandLA-Net: A Case Study on Urban Areas | Alperen Enes Bayar et.al. | 2312.11880 | null |
2023-12-19 | Self-supervised Learning for Enhancing Geometrical Modeling in 3D-Aware Generative Adversarial Network | Jiarong Guo et.al. | 2312.11856 | null |
2023-12-19 | Regulating Intermediate 3D Features for Vision-Centric Autonomous Driving | Junkai Xu et.al. | 2312.11837 | link |
2023-12-19 | RadOcc: Learning Cross-Modality Occupancy Knowledge through Rendering Assisted Distillation | Haiming Zhang et.al. | 2312.11829 | null |
2023-12-19 | Thermodiffusively unstable laminar hydrogen flame in a sufficiently large 3D computational domain – Part I: Characteristic patterns | Wen Xu et.al. | 2312.11810 | null |
2023-12-19 | Text-Image Conditioned Diffusion for Consistent Text-to-3D Generation | Yuze He et.al. | 2312.11774 | null |
2023-12-18 | RenderCore – a new WebGPU-based rendering engine for ROOT-EVE | Ciril Bohak et.al. | 2312.11729 | null |
2023-12-18 | Indoor and Outdoor 3D Scene Graph Generation via Language-Enabled Spatial Ontologies | Jared Strader et.al. | 2312.11713 | null |
2023-12-18 | Unified framework for diffusion generative models in SO(3): applications in computer vision and astrophysics | Yesukhei Jagvaral et.al. | 2312.11707 | null |
2023-12-18 | HAAR: Text-Conditioned Generative Model of 3D Strand-based Human Hairstyles | Vanessa Sklyarova et.al. | 2312.11666 | link |
2023-12-18 | Tensor renormalization group study of 3D principal chiral model | Shinichiro Akiyama et.al. | 2312.11649 | null |
2023-12-18 | Matrix models from black hole geometries | Andrea Boido et.al. | 2312.11640 | null |
2023-12-18 | Moiré Fractional Chern Insulators III: Hartree-Fock Phase Diagram, Magic Angle Regime for Chern Insulator States, the Role of the Moiré Potential and Goldstone Gaps in Rhombohedral Graphene Superlattices | Yves H. Kwan et.al. | 2312.11617 | null |
2023-12-18 | Towards Establishing Dense Correspondence on Multiview Coronary Angiography: From Point-to-Point to Curve-to-Curve Query Matching | Yifan Wu et.al. | 2312.11593 | null |
2023-12-18 | Relightable Neural Actor with Intrinsic Decomposition and Pose Control | Diogo Luvizon et.al. | 2312.11587 | null |
2023-12-18 | Diffusion-Based Particle-DETR for BEV Perception | Asen Nachkov et.al. | 2312.11578 | null |
2023-12-17 | SAI3D: Segment Any Instance in 3D Scenes | Yingda Yin et.al. | 2312.11557 | null |
2023-12-15 | Customize-It-3D: High-Quality 3D Creation from A Single Image Using Subject-Specific Knowledge Prior | Nan Huang et.al. | 2312.11535 | null |
2023-12-18 | GAvatar: Animatable 3D Gaussian Avatars with Implicit Mesh Learning | Ye Yuan et.al. | 2312.11461 | null |
2023-12-18 | VolumeDiffusion: Flexible Text-to-3D Generation with Efficient Volumetric Encoder | Zhicong Tang et.al. | 2312.11459 | link |
2023-12-18 | GauFRe: Gaussian Deformation Fields for Real-time Dynamic Novel View Synthesis | Yiqing Liang et.al. | 2312.11458 | null |
2023-12-18 | Language-Assisted 3D Scene Understanding | Yanmin Wu et.al. | 2312.11451 | link |
2023-12-18 | Cosmic Recombination in the Presence of Primordial Magnetic Fields | Karsten Jedamzik et.al. | 2312.11448 | null |
2023-12-18 | Explore 3D Dance Generation via Reward Model from Automatically-Ranked Demonstrations | Zilin Wang et.al. | 2312.11442 | null |
2023-12-18 | 3D exploration-based search for multiple targets using a UAV | Bilal Yousuf et.al. | 2312.11424 | null |
2023-12-18 | PolyDiff: Generating 3D Polygonal Meshes with Diffusion Models | Antonio Alliegro et.al. | 2312.11417 | null |
2023-12-18 | Active search and coverage using point-cloud reinforcement learning | Matthias Rosynski et.al. | 2312.11410 | null |
2023-12-18 | Paint-it: Text-to-Texture Synthesis via Deep Convolutional Texture Map Optimization and Physically-Based Rendering | Kim Youwang et.al. | 2312.11360 | null |
2023-12-19 | CaRe-CNN: Cascading Refinement CNN for Myocardial Infarct Segmentation with Microvascular Obstructions | Franz Thaler et.al. | 2312.11315 | null |
2023-12-18 | Long range 3D magnetic structures of the spin $S$=1 hexamer cluster fedotovite-like A${2}$Cu${3}$O(SO$_4$)$_3$ (A$_2$=K$_2$, NaK, Na$_2$ ): a neutron diffraction study | V. Yu. Pomjakushin et.al. | 2312.11277 | null |
2023-12-18 | Modelling the 3D spatiotemporal organisation of chromatin replication | G. Forte et.al. | 2312.11275 | null |
2023-12-18 | Spherical Mask: Coarse-to-Fine 3D Point Cloud Instance Segmentation with Spherical Representation | Sangyun Shin et.al. | 2312.11269 | link |
2023-12-18 | Lattice-based equation of state with 3D Ising critical point | Micheal Kahangirwe et.al. | 2312.11265 | null |
2023-12-18 | WiSegRT: Dataset for Site-Specific Indoor Radio Propagation Modeling with 3D Segmentation and Differentiable Ray-Tracing | Lihao Zhang et.al. | 2312.11245 | link |
2023-12-18 | Programmed Internal Reconfigurations in a 3D-Printed Auxetic Metamaterial Enable Fluidic Control for a Vertically Stacked Valve Array | Tinku Supakar et.al. | 2312.11228 | null |
2023-12-18 | Hausdorff measure for the singularity set of the 3D chemotaxis-Navier-Stokes equations | Xiaomeng Chen et.al. | 2312.11224 | null |
2023-12-18 | Comparative simulations of Kelvin-Helmholtz induced magnetic reconnection at the Earth’s magnetospheric flanks | Silvia Ferro et.al. | 2312.11161 | null |
2023-12-18 | 3D surface profilometry using neutral helium atoms | Aleksandar Radic et.al. | 2312.11114 | null |
2023-12-18 | ConDaFormer: Disassembled Transformer with Local Structure Enhancement for 3D Point Cloud Understanding | Lunhao Duan et.al. | 2312.11112 | link |
2023-12-18 | Toward Low Earth Orbit (LEO) Applications: the Scientific Journey of the ‘‘Space Pulsating Heat Pipe’’ Experiments | Marco Marengo et.al. | 2312.11055 | null |
2023-12-18 | Multi-Correlation Siamese Transformer Network with Dense Connection for 3D Single Object Tracking | Shihao Feng et.al. | 2312.11051 | link |
2023-12-18 | Experimental 3D super-localization with Laguerre-Gaussian modes | Chenyu Hu et.al. | 2312.11044 | null |
2023-12-18 | SinMPI: Novel View Synthesis from a Single Image with Expanded Multiplane Images | Guo Pu et.al. | 2312.11037 | link |
2023-12-18 | Realistic Human Motion Generation with Cross-Diffusion Models | Zeping Ren et.al. | 2312.10993 | null |
2023-12-18 | Long-Tailed 3D Detection via 2D Late Fusion | Yechi Ma et.al. | 2312.10986 | link |
2023-12-18 | Collaborative Learning for Annotation-Efficient Volumetric MR Image Segmentation | Yousuf Babiker M. Osman et.al. | 2312.10978 | null |
2023-12-18 | Towards Detailed Text-to-Motion Synthesis via Basic-to-Advanced Hierarchical Diffusion Model | Zhenyu Xie et.al. | 2312.10960 | null |
2023-12-18 | Opto-twistronic Hall effect in a three-dimensional spiral lattice | Zhurun Ji et.al. | 2312.10954 | null |
2023-12-18 | A Shape Detection Framework for Deformation Objects Using Clustering Algorithms | Fangqing Chen et.al. | 2312.10932 | null |
2023-12-18 | Mimic: Speaking Style Disentanglement for Speech-Driven 3D Facial Animation | Hui Fu et.al. | 2312.10877 | null |
2023-12-18 | Electronic and optical properties of ternary kagome Rb ${2}$Ni${3}$S$_4$ | Gang Bahadur Acharya et.al. | 2312.10874 | null |
2023-12-17 | M3DBench: Let’s Instruct Large Models with Multi-modal 3D Prompts | Mingsheng Li et.al. | 2312.10763 | link |
2023-12-17 | Automated object detection for muon tomography data analysis | A. Georgadze et.al. | 2312.10733 | null |
2023-12-17 | Towards Compact 3D Representations via Point Feature Enhancement Masked Autoencoders | Yaohua Zha et.al. | 2312.10726 | link |
2023-12-17 | Primitive-based 3D Human-Object Interaction Modelling and Programming | Siqi Liu et.al. | 2312.10714 | null |
2023-12-17 | Bilayer crystals of trapped ions for quantum information processing | Samarth Hawaldar et.al. | 2312.10681 | null |
2023-12-17 | Open3DIS: Open-vocabulary 3D Instance Segmentation with 2D Mask Guidance | Phuc D. A. Nguyen et.al. | 2312.10671 | link |
2023-12-17 | Stability Properties of Multi-Order Fractional Differential Systems in 3D | Kai Diethelm et.al. | 2312.10653 | null |
2023-12-17 | Gas Giant Simulations of Eddy-Driven Jets Accompanied by Deep Meridional Circulation | Keren Duer et.al. | 2312.10651 | null |
2023-12-17 | PNeRFLoc: Visual Localization with Point-based Neural Radiance Fields | Boming Zhao et.al. | 2312.10649 | null |
2023-12-17 | T2M-HiFiGPT: Generating High Quality Human Motion from Textual Descriptions with Residual Discrete Representations | Congyi Wang et.al. | 2312.10628 | null |
2023-12-17 | Robust 3D Tracking with Quality-Aware Shape Completion | Jingwen Zhang et.al. | 2312.10608 | null |
2023-12-17 | IntraSeismic: a coordinate-based learning approach to seismic inversion | Juan Romero et.al. | 2312.10568 | null |
2023-12-16 | Dynamics of Meniscus-Bound Particle Clusters in Extensional Flow | Sagar Chaudhary et.al. | 2312.10562 | null |
2023-12-16 | Transformers in Unsupervised Structure-from-Motion | Hemang Chawla et.al. | 2312.10529 | link |
2023-12-16 | Interpretable Online Network Dictionary Learning for Inferring Long-Range Chromatin Interactions | Vishal Rana et.al. | 2312.10519 | link |
2023-12-16 | IRS-Aided Sectorized Base Station Design and 3D Coverage Performance Analysis | Xintong Chen et.al. | 2312.10475 | null |
2023-12-16 | Exploring the effect of strong electronic correlations in Seebeck Coefficient of the NdCoO3 compound : Using experimental and DFT+U approach | Abhishek Pandey et.al. | 2312.10449 | null |
2023-12-16 | A suitable nonlinear Stratonovich noise prevents blow-up in the Euler equations and other SPDEs | Marco Bagnara et.al. | 2312.10446 | null |
2023-12-19 | Learning Dense Correspondence for NeRF-Based Face Reenactment | Songlin Yang et.al. | 2312.10422 | null |
2023-12-16 | Not Every Side Is Equal: Localization Uncertainty Estimation for Semi-Supervised 3D Object Detection | ChuXin Wang et.al. | 2312.10390 | link |
2023-12-16 | Bubble-induced convection and flow-instability in a soft reactor | Ron Shnapp et.al. | 2312.10363 | null |
2023-12-16 | Material Point Methods on Unstructured Tessellations: A Stable Kernel Approach With Continuous Gradient Reconstruction | Yadi Cao et.al. | 2312.10338 | null |
2023-12-16 | Enabling Mammography with Co-Robotic Ultrasound | Yuxin Chen et.al. | 2312.10309 | null |
2023-12-16 | Differential operators on the base affine space of $SL_n$ and quantized Coulomb branches | Tom Gannon et.al. | 2312.10278 | null |
2023-12-15 | Implicit Modeling of Non-rigid Objects with Cross-Category Signals | Yuchun Liu et.al. | 2312.10246 | null |
2023-12-15 | SoloPose: One-Shot Kinematic 3D Human Pose Estimation with Video Data Augmentation | David C. Jeong et.al. | 2312.10195 | link |
2023-12-15 | MVHuman: Tailoring 2D Diffusion with Multi-view Sampling For Realistic 3D Human Generation | Suyi Jiang et.al. | 2312.10120 | null |
2023-12-15 | Plasticine3D: Non-rigid 3D editting with text guidance | Yige Chen et.al. | 2312.10111 | null |
2023-12-15 | Point Transformer V3: Simpler, Faster, Stronger | Xiaoyang Wu et.al. | 2312.10035 | link |
2023-12-15 | SlimmeRF: Slimmable Radiance Fields | Shiran Yuan et.al. | 2312.10034 | link |
2023-12-15 | Plasma-enhanced atomic layer deposition of titanium nitride for superconducting devices | John Femi-Oyetoro et.al. | 2312.09984 | null |
2023-12-15 | Quasi-geostrophic convection-driven dynamos in a thick spherical shell | Olivier Barrois et.al. | 2312.09946 | null |
2023-12-15 | CNC-Net: Self-Supervised Learning for CNC Machining Operations | Mohsen Yavartanoo et.al. | 2312.09925 | null |
2023-12-15 | A Unifying Tensor View for Lightweight CNNs | Jason Chun Lok Li et.al. | 2312.09922 | null |
2023-12-15 | LAENeRF: Local Appearance Editing for Neural Radiance Fields | Lukas Radl et.al. | 2312.09913 | null |
2023-12-15 | Comparison of Quasi-Geostrophic, Hybrid and 3D models of planetary core convection | Olivier Barrois et.al. | 2312.09826 | null |
2023-12-15 | Drones Guiding Drones: Cooperative Navigation of a Less-Equipped Micro Aerial Vehicle in Cluttered Environments | Václav Pritzl et.al. | 2312.09786 | null |
2023-12-15 | RANRAC: Robust Neural Scene Representations via Random Ray Consensus | Benno Buschmann et.al. | 2312.09780 | null |
2023-12-15 | The VISCACHA survey – IX. The SMC Southern Bridge in 8D | M. C. Parisi et.al. | 2312.09756 | null |
2023-12-15 | SLS4D: Sparse Latent Space for 4D Novel View Synthesis | Qi-Yuan Feng et.al. | 2312.09743 | null |
2023-12-15 | 3DAxiesPrompts: Unleashing the 3D Spatial Task Capabilities of GPT-4V | Dingning Liu et.al. | 2312.09738 | null |
2023-12-15 | Shaping and Being Shaped by Drones: Supporting Perception-Action Loops | Mousa Sondoqah et.al. | 2312.09688 | null |
2023-12-15 | Exploring the Feasibility of Generating Realistic 3D Models of Endangered Species Using DreamGaussian: An Analysis of Elevation Angle’s Impact on Model Generation | Selcuk Anil Karatopak et.al. | 2312.09682 | null |
2023-12-15 | Ins-HOI: Instance Aware Human-Object Interactions Recovery | Jiajun Zhang et.al. | 2312.09641 | link |
2023-12-15 | Weakly-Supervised 3D Visual Grounding based on Visual Linguistic Alignment | Xiaoxu Xu et.al. | 2312.09625 | null |
2023-12-15 | Global Solutions of Multispeed Semilinear Klein-Gordon Systems in Space Dimension Two | Xilu Zhu et.al. | 2312.09596 | null |
2023-12-15 | CAGE: Controllable Articulation GEneration | Jiayi Liu et.al. | 2312.09570 | null |
2023-12-15 | Towards Transferable Targeted 3D Adversarial Attack in the Physical World | Yao Huang et.al. | 2312.09558 | link |
2023-12-14 | Relightable Neural Assets | Krishna Mullia et.al. | 2312.09398 | null |
2023-12-14 | High-Resolution Maps of Left Atrial Displacements and Strains Estimated with 3D CINE MRI and Unsupervised Neural Networks | Christoforos Galazis et.al. | 2312.09387 | link |
2023-12-14 | A bubble VEM-fully discrete polytopal scheme for mixed-dimensional poromechanics with frictional contact at matrix fracture interfaces | Jérôme Droniou et.al. | 2312.09319 | null |
2023-12-14 | A parallelized cellular Potts model that enables simulations at tissue scale | Shabaz Sultan et.al. | 2312.09317 | link |
2023-12-14 | LatentEditor: Text Driven Local Editing of 3D Scenes | Umar Khalid et.al. | 2312.09313 | link |
2023-12-14 | Stable Score Distillation for High-Quality 3D Generation | Boshi Tang et.al. | 2312.09305 | null |
2023-12-14 | Random resistive memory-based deep extreme point learning machine for unified visual processing | Shaocong Wang et.al. | 2312.09262 | null |
2023-12-14 | Single Mesh Diffusion Models with Field Latents for Texture Generation | Thomas W. Mitchel et.al. | 2312.09250 | null |
2023-12-14 | ZeroRF: Fast Sparse View 360° Reconstruction with Zero Pretraining | Ruoxi Shi et.al. | 2312.09249 | null |
2023-12-14 | SHAP-EDITOR: Instruction-guided Latent 3D Editing in Seconds | Minghao Chen et.al. | 2312.09246 | null |
2023-12-14 | OccNeRF: Self-Supervised Multi-Camera Occupancy Prediction with Neural Radiance Fields | Chubin Zhang et.al. | 2312.09243 | link |
2023-12-14 | Text2Immersion: Generative Immersive Scene with 3D Gaussians | Hao Ouyang et.al. | 2312.09242 | null |
2023-12-15 | 3DGS-Avatar: Animatable Avatars via Deformable 3D Gaussian Splatting | Zhiyin Qian et.al. | 2312.09228 | null |
2023-12-14 | Mosaic-SDF for 3D Generative Models | Lior Yariv et.al. | 2312.09222 | null |
2023-12-14 | A colossal advantage: 3D-local noisy shallow quantum circuits defeat unbounded fan-in classical circuits | Libor Caha et.al. | 2312.09209 | null |
2023-12-14 | Properties of 3D HI Filaments in the Smith High Velocity Cloud | Colin Holm-Hansen et.al. | 2312.09164 | null |
2023-12-14 | Triplane Meets Gaussian Splatting: Fast and Generalizable Single-View 3D Reconstruction with Transformers | Zi-Xin Zou et.al. | 2312.09147 | null |
2023-12-14 | Deterministic dynamics of overactive Brownian particle in 2D and 3D potential wells | Denis S. Goldobin et.al. | 2312.09141 | null |
2023-12-14 | Living Scenes: Multi-object Relocalization and Reconstruction in Changing 3D Environments | Liyuan Zhu et.al. | 2312.09138 | link |
2023-12-15 | Aleth-NeRF: Illumination Adaptive NeRF with Concealing Field Assumption | Ziteng Cui et.al. | 2312.09093 | link |
2023-12-14 | Learned Fusion: 3D Object Detection using Calibration-Free Transformer Feature Fusion | Michael Fürst et.al. | 2312.09082 | null |
2023-12-14 | PI3D: Efficient Text-to-3D Generation with Pseudo-Image Diffusion | Ying-Tian Liu et.al. | 2312.09069 | null |
2023-12-14 | Holodeck: Language Guided Generation of 3D Embodied AI Environments | Yue Yang et.al. | 2312.09067 | link |
2023-12-14 | Fusion of Audio and Visual Embeddings for Sound Event Localization and Detection | Davide Berghi et.al. | 2312.09034 | link |
2023-12-14 | iComMa: Inverting 3D Gaussians Splatting for Camera Pose Estimation via Comparing and Matching | Yuan Sun et.al. | 2312.09031 | null |
2023-12-14 | Scene 3-D Reconstruction System in Scattering Medium | Zhuoyifan Zhang et.al. | 2312.09005 | null |
2023-12-14 | LEMON: Learning 3D Human-Object Interaction Relation from 2D Images | Yuhang Yang et.al. | 2312.08963 | null |
2023-12-14 | VaLID: Variable-Length Input Diffusion for Novel View Synthesis | Shijie Li et.al. | 2312.08892 | null |
2023-12-13 | SEEAvatar: Photorealistic Text-to-3D Avatar Generation with Constrained Geometry and Appearance | Yuanyou Xu et.al. | 2312.08889 | null |
2023-12-13 | SceneWiz3D: Towards Text-guided 3D Scene Composition | Qihang Zhang et.al. | 2312.08885 | null |
2023-12-12 | Regularizing Self-supervised 3D Scene Flows with Surface Awareness and Cyclic Consistency | Patrik Vacek et.al. | 2312.08879 | link |
2023-12-12 | OpenSight: A Simple Open-Vocabulary Framework for LiDAR-Based Object Detection | Hu Zhang et.al. | 2312.08876 | null |
2023-12-14 | HeadRecon: High-Fidelity 3D Head Reconstruction from Monocular Video | Xueying Wang et.al. | 2312.08863 | null |
2023-12-14 | FR0 jets and recollimation-induced instabilities | A. Costa et.al. | 2312.08767 | null |
2023-12-14 | CF-NeRF: Camera Parameter Free Neural Radiance Fields with Incremental Learning | Qingsong Yan et.al. | 2312.08760 | null |
2023-12-14 | UniDream: Unifying Diffusion Priors for Relightable Text-to-3D Generation | Zexiang Liu et.al. | 2312.08754 | null |
2023-12-14 | GOEnFusion: Gradient Origin Encodings for 3D Forward Diffusion Models | Animesh Karnewar et.al. | 2312.08744 | null |
2023-12-14 | Bayes3D: fast learning and inference in structured generative models of 3D objects and scenes | Nishad Gothoskar et.al. | 2312.08715 | null |
2023-12-14 | A Local Appearance Model for Volumetric Capture of Diverse Hairstyle | Ziyan Wang et.al. | 2312.08679 | null |
2023-12-14 | SPEAL: Skeletal Prior Embedded Attention Learning for Cross-Source Point Cloud Registration | Kezheng Xiong et.al. | 2312.08664 | null |
2023-12-14 | Joint2Human: High-quality 3D Human Generation via Compact Spherical Embedding of 3D Joints | Muxin Zhang et.al. | 2312.08591 | null |
2023-12-14 | Quasisymmetric high-beta 3D MHD equilibria near axisymmetry | W. Sengupta et.al. | 2312.08572 | null |
2023-12-13 | NViST: In the Wild New View Synthesis from a Single Image with Transformers | Wonbong Jang et.al. | 2312.08568 | null |
2023-12-13 | Efficient-NeRF2NeRF: Streamlining Text-Driven 3D Editing with Multiview Correspondence-Enhanced Diffusion Models | Liangchen Song et.al. | 2312.08563 | null |
2023-12-13 | Crystalline finite-size topology | Michał J. Pacholski et.al. | 2312.08552 | null |
2023-12-13 | PnP for Two-Dimensional Pose Estimation | Joshua Wang et.al. | 2312.08488 | link |
2023-12-13 | The CMB lensing imprint of cosmic voids detected in the WISE-Pan-STARRS luminous red galaxy catalog | G. Camacho-Ciurana et.al. | 2312.08483 | null |
2023-12-13 | FaceTalk: Audio-Driven Motion Diffusion for Neural Parametric Head Models | Shivangi Aneja et.al. | 2312.08459 | link |
2023-12-13 | Pose and shear-based tactile servoing | John Lloyd et.al. | 2312.08411 | null |
2023-12-13 | Towards Safe and Collaborative Robotic Ultrasound Tissue Scanning in Neurosurgery | Michael Dyck et.al. | 2312.08409 | null |
2023-12-12 | PerfactTailor: Scale-Preserving 2D Pattern Adjustment Driven by 3D Garment Editing | Anran Qi et.al. | 2312.08386 | null |
2023-12-13 | SAM-guided Graph Cut for 3D Instance Segmentation | Haoyu Guo et.al. | 2312.08372 | null |
2023-12-13 | PTT: Point-Trajectory Transformer for Efficient Temporal 3D Object Detection | Kuan-Chih Huang et.al. | 2312.08371 | link |
2023-12-13 | VQ-HPS: Human Pose and Shape Estimation in a Vector-Quantized Latent Space | Guénolé Fiche et.al. | 2312.08291 | null |
2023-12-13 | TABSurfer: a Hybrid Deep Learning Architecture for Subcortical Segmentation | Aaron Cao et.al. | 2312.08267 | link |
2023-12-13 | The CHARA Array interferometric program on the multiplicity of classical Be stars: new detections and orbits of stripped subdwarf companions | Robert Klement et.al. | 2312.08252 | null |
2023-12-13 | Beyond the Label Itself: Latent Labels Enhance Semi-supervised Point Cloud Panoptic Segmentation | Yujun Chen et.al. | 2312.08234 | null |
2023-12-13 | Partial Symmetry Detection for 3D Geometry using Contrastive Learning with Geodesic Point Cloud Patches | Gregor Kobsik et.al. | 2312.08230 | null |
2023-12-13 | Chat-3D v2: Bridging 3D Scene and Large Language Models with Object Identifiers | Haifeng Huang et.al. | 2312.08168 | link |
2023-12-13 | $ρ$ -Diffusion: A diffusion-based density estimation framework for computational physics | Maxwell X. Cai et.al. | 2312.08153 | link |
2023-12-13 | Design and synthesis of three-dimensional hybrid Ruddlesden-Popper nickelate single crystals | Feiyu Li et.al. | 2312.08116 | null |
2023-12-13 | Machine Learning for the Multi-Dimensional Bin Packing Problem: Literature Review and Empirical Evaluation | Wenjie Wu et.al. | 2312.08103 | null |
2023-12-13 | 3DGEN: A GAN-based approach for generating novel 3D models from image data | Antoine Schnepf et.al. | 2312.08094 | null |
2023-12-13 | Mono3DVG: 3D Visual Grounding in Monocular Images | Yang Zhan et.al. | 2312.08022 | link |
2023-12-13 | Instance-aware Multi-Camera 3D Object Detection with Structural Priors Mining and Self-Boosting Learning | Yang Jiao et.al. | 2312.08004 | null |
2023-12-13 | BOTH2Hands: Inferring 3D Hands from Both Text Prompts and Body Dynamics | Wenqian Zhang et.al. | 2312.07937 | link |
2023-12-13 | DrivingGaussian: Composite Gaussian Splatting for Surrounding Dynamic Autonomous Driving Scenes | Xiaoyu Zhou et.al. | 2312.07920 | link |
2023-12-13 | 2-mm-Thick Large-Area CdTe Double-sided Strip Detectors for High-Resolution Spectroscopic Imaging of X-ray and Gamma-ray with Depth-Of-Interaction Sensing | Takahiro Minami et.al. | 2312.07915 | null |
2023-12-13 | Projective Parallel Single-Pixel Imaging: 3D Structured Light Scanning Under Global Illumination | Yuxi Li et.al. | 2312.07911 | null |
2023-12-13 | Ideas of lattice-basis reduction theory for error-stable Bravais lattice determination and ab-initio indexing | R. Oishi-Tomiyasu et.al. | 2312.07909 | null |
2023-12-13 | Easy bootstrap for the 3D Ising model | Wenliang Li et.al. | 2312.07866 | null |
2023-12-13 | Denoising diffusion-based synthetic generation of three-dimensional (3D) anisotropic microstructures from two-dimensional (2D) micrographs | Kang-Hyun Lee et.al. | 2312.07832 | null |
2023-12-12 | Abundances of iron-peak elements in accreted and in situ born Galactic halo stars | P. E. Nissen et.al. | 2312.07768 | null |
2023-12-12 | Mirror dualities with four supercharges | Sergio Benvenuti et.al. | 2312.07667 | null |
2023-12-12 | Multi Armed Bandit based Resource Allocation in Near Memory Processing Architectures | Shubhang Pandey et.al. | 2312.07640 | null |
2023-12-12 | Teaching Unknown Objects by Leveraging Human Gaze and Augmented Reality in Human-Robot Interaction | Daniel Weber et.al. | 2312.07638 | null |
2023-12-12 | Pre-trained Universal Medical Image Transformer | Lingxiao Luo et.al. | 2312.07630 | link |
2023-12-11 | Spatiotemporal Event Graphs for Dynamic Scene Understanding | Salman Khan et.al. | 2312.07621 | null |
2023-12-12 | HeadArtist: Text-conditioned 3D Head Generation with Self Score Distillation | Hongyu Liu et.al. | 2312.07539 | null |
2023-12-12 | WHAM: Reconstructing World-grounded Humans with Accurate 3D Motion | Soyong Shin et.al. | 2312.07531 | null |
2023-12-12 | Weakly Supervised 3D Object Detection via Multi-Level Visual Guidance | Kuan-Chih Huang et.al. | 2312.07530 | link |
2023-12-12 | A Hitchhiker’s Guide to Geometric GNNs for 3D Atomic Systems | Alexandre Duval et.al. | 2312.07511 | link |
2023-12-12 | COLMAP-Free 3D Gaussian Splatting | Yang Fu et.al. | 2312.07504 | link |
2023-12-12 | MinD-3D: Reconstruct High-quality 3D objects in Human Brain | Jianxiong Gao et.al. | 2312.07485 | null |
2023-12-12 | Holoported Characters: Real-time Free-viewpoint Rendering of Humans from Sparse RGB Cameras | Ashwath Shetty et.al. | 2312.07423 | null |
2023-12-12 | GSmoothFace: Generalized Smooth Talking Face Generation via Fine Grained 3D Face Guidance | Haiming Zhang et.al. | 2312.07385 | null |
2023-12-12 | X4D-SceneFormer: Enhanced Scene Understanding on 4D Point Cloud Videos through Cross-modal Knowledge Transfer | Linglin Jing et.al. | 2312.07378 | link |
2023-12-12 | Automatic coral reef fish identification and 3D measurement in the wild | Cyril Barrelet et.al. | 2312.07357 | null |
2023-12-12 | MRCN: Enhanced Coherence Mechanism for Near Memory Processing Architectures | Amit Kumar Kabat et.al. | 2312.07355 | null |
2023-12-12 | Accurate Fourier-space statistics for line intensity mapping: Cartesian grid sampling without aliased power | Steven Cunnington et.al. | 2312.07289 | link |
2023-12-12 | Benchmarking Pretrained Vision Embeddings for Near- and Duplicate Detection in Medical Images | Tuan Truong et.al. | 2312.07273 | null |
2023-12-12 | Unifying Correspondence, Pose and NeRF for Pose-Free Novel View Synthesis from Stereo Pairs | Sunghwan Hong et.al. | 2312.07246 | link |
2023-12-12 | Fast Training of Diffusion Transformer with Extreme Masking for 3D Point Clouds Generation | Shentong Mo et.al. | 2312.07231 | null |
2023-12-12 | Transferring CLIP’s Knowledge into Zero-Shot Point Cloud Semantic Segmentation | Yuanbin Wang et.al. | 2312.07221 | null |
2023-12-12 | Equivariant Flow Matching with Hybrid Probability Transport | Yuxuan Song et.al. | 2312.07168 | link |
2023-12-12 | Connecting remote and in situ observations of shock-accelerated electrons associated with a coronal mass ejection | D. E. Morosan et.al. | 2312.07166 | null |
2023-12-12 | CompdVision: Combining Near-Field 3D Visual and Tactile Sensing Using a Compact Compound-Eye Imaging System | Lifan Luo et.al. | 2312.07146 | null |
2023-12-12 | The complex heavy quark potential in an anisotropic quark-gluon plasma | Ajaharul Islam et.al. | 2312.07073 | null |
2023-12-12 | Template Free Reconstruction of Human-object Interaction with Procedural Interaction Generation | Xianghui Xie et.al. | 2312.07063 | null |
2023-12-12 | Mask as Supervision: Leveraging Unified Mask Information for Unsupervised 3D Pose Estimation | Yuchen Yang et.al. | 2312.07051 | link |
2023-12-12 | Diff-OP3D: Bridging 2D Diffusion for Open Pose 3D Zero-Shot Classification | Weiguang Zhao et.al. | 2312.07039 | link |
2023-12-12 | Beyond 1D and oversimplified kinematics: A generic analytical framework for surrogate safety measures | Sixu Li et.al. | 2312.07019 | null |
2023-12-14 | MWSIS: Multimodal Weakly Supervised Instance Segmentation with 2D Box Annotations for Autonomous Driving | Guangfeng Jiang et.al. | 2312.06988 | link |
2023-12-12 | Residual Stress-Driven Non-Euclidean Morphing in Origami Structures | Zihe Liang et.al. | 2312.06982 | null |
2023-12-12 | PatchMorph: A Stochastic Deep Learning Approach for Unsupervised 3D Brain Image Registration with Small Patches | Henrik Skibbe et.al. | 2312.06958 | null |
2023-12-12 | MaTe3D: Mask-guided Text-based 3D-aware Portrait Editing | Kangneng Zhou et.al. | 2312.06947 | link |
2023-12-12 | Nonlinear Expectation Inference for Direct Uncertainty Quantification of Nonlinear Inverse Problems | Zhao Zhang et.al. | 2312.06923 | link |
2023-12-12 | w2v-SELD: A Sound Event Localization and Detection Framework for Self-Supervised Spatial Audio Pre-Training | Orlem Lima dos Santos et.al. | 2312.06907 | link |
2023-12-11 | Vertical shear instability in two-moment radiation-hydrodynamical simulations of irradiated protoplanetary disks I. Angular momentum transport and turbulent heating | Julio David Melon Fuksman et.al. | 2312.06882 | null |
2023-12-11 | DYAD: A Descriptive Yet Abjuring Density efficient approximation to linear neural network layers | Sarin Chandy et.al. | 2312.06881 | link |
2023-12-11 | Scalable Decentralized Cooperative Platoon using Multi-Agent Deep Reinforcement Learning | Ahmed Abdelrahman et.al. | 2312.06858 | null |
2023-12-11 | A local study of dynamo action driven by precession | V. Kumar et.al. | 2312.06835 | null |
2023-12-11 | Advancing solar magnetic field extrapolations through multi-height magnetic field measurements | Robert Jarolim et.al. | 2312.06823 | null |
2023-12-11 | Cryogenic RPWELL: a novel charge-readout element for dual-phase argon TPCs | A. Tesi et.al. | 2312.06809 | null |
2023-12-11 | Improving the Robustness of 3D Human Pose Estimation: A Benchmark and Learning from Noisy Input | Trung-Hieu Hoang et.al. | 2312.06797 | null |
2023-12-11 | A generalized Selberg zeta function for flat space cosmologies | Arjun Bagchi et.al. | 2312.06770 | null |
2023-12-11 | Gaussian Splatting SLAM | Hidenobu Matsuki et.al. | 2312.06741 | null |
2023-12-11 | MonoNPHM: Dynamic Head Reconstruction from Monocular Videos | Simon Giebenhain et.al. | 2312.06740 | null |
2023-12-14 | TULIP: Transformer for Upsampling of LiDAR Point Cloud | Bin Yang et.al. | 2312.06733 | link |
2023-12-11 | EpiDiff: Enhancing Multi-View Synthesis via Localized Epipolar-Constrained Diffusion | Zehuan Huang et.al. | 2312.06725 | link |
2023-12-10 | UNeR3D: Versatile and Scalable 3D RGB Point Cloud Generation from 2D Images in Unsupervised Reconstruction | Hongbin Lin et.al. | 2312.06706 | null |
2023-12-10 | SIFU: Side-view Conditioned Implicit Function for Real-world Usable Clothed Human Reconstruction | Zechuan Zhang et.al. | 2312.06704 | link |
2023-12-09 | Evolving Reservoirs for Meta Reinforcement Learning | Corentin Léger et.al. | 2312.06695 | link |
2023-12-09 | Robo360: A 3D Omnispective Multi-Material Robotic Manipulation Dataset | Litian Liang et.al. | 2312.06686 | null |
2023-12-11 | CAD: Photorealistic 3D Generation via Adversarial Distillation | Ziyu Wan et.al. | 2312.06663 | null |
2023-12-11 | UpFusion: Novel View Diffusion from Unposed Sparse View Observations | Bharath Raj Nagoor Kani et.al. | 2312.06661 | null |
2023-12-11 | Learning Naturally Aggregated Appearance for Efficient 3D Editing | Ka Leong Cheng et.al. | 2312.06657 | link |
2023-12-11 | Sherpa3D: Boosting High-Fidelity Text-to-3D Generation via Coarse 3D Prior | Fangfu Liu et.al. | 2312.06655 | link |
2023-12-11 | AnyHome: Open-Vocabulary Generation of Structured and Textured 3D Homes | Zehao Wen et.al. | 2312.06644 | null |
2023-12-11 | Neural Text to Articulate Talk: Deep Text to Audiovisual Speech Synthesis achieving both Auditory and Photo-realism | Georgios Milis et.al. | 2312.06613 | link |
2023-12-11 | Mitigating Perspective Distortion-induced Shape Ambiguity in Image Crops | Aditya Prakash et.al. | 2312.06594 | null |
2023-12-11 | 3D Hand Pose Estimation in Egocentric Images in the Wild | Aditya Prakash et.al. | 2312.06583 | null |
2023-12-11 | EasyVolcap: Accelerating Neural Volumetric Video Research | Zhen Xu et.al. | 2312.06575 | link |
2023-12-11 | Inferring Hybrid Neural Fluid Fields from Videos | Hong-Xing Yu et.al. | 2312.06561 | null |
2023-12-11 | HOI-Diff: Text-Driven Synthesis of 3D Human-Object Interactions using Diffusion Models | Xiaogang Peng et.al. | 2312.06553 | null |
2023-12-12 | Open Data-Driven Automation of Residential Distribution Grid Modeling with Minimal Data Requirements | Moritz Weber et.al. | 2312.06552 | null |
2023-12-11 | On One Dimensional Advection – Diffusion Equation with Variable Diffusivity | Eeshwar Prasad Poudel et.al. | 2312.06493 | null |
2023-12-11 | DreamControl: Control-Based Text-to-3D Generation with 3D Self-Prior | Tianyu Huang et.al. | 2312.06439 | link |
2023-12-12 | PointVoxel: A Simple and Effective Pipeline for Multi-View Multi-Modal 3D Human Pose Estimation | Zhiyu Pan et.al. | 2312.06409 | null |
2023-12-11 | NVFi: Neural Velocity Fields for 3D Physics Learning from Dynamic Videos | Jinxi Li et.al. | 2312.06398 | link |
2023-12-11 | ManiPose: Manifold-Constrained Multi-Hypothesis 3D Human Pose Estimation | Cédric Rommel et.al. | 2312.06386 | link |
2023-12-11 | Intraoperative 2D/3D Image Registration via Differentiable X-ray Rendering | Vivek Gopalakrishnan et.al. | 2312.06358 | link |
2023-12-11 | Noise-based Correction for Electrical Impedance Tomography | Kai Mason et.al. | 2312.06320 | null |
2023-12-11 | Invariants of magnetic lines for Yang-Milles solutions | Petr Akhmet’ev et.al. | 2312.06301 | null |
2023-12-11 | Quantum physics at your fingertips – from paper strips to zippers | Franziska Greinert et.al. | 2312.06269 | null |
2023-12-11 | NutritionVerse-Synth: An Open Access Synthetically Generated 2D Food Scene Dataset for Dietary Intake Estimation | Saeejith Nair et.al. | 2312.06192 | null |
2023-12-11 | M3SOT: Multi-frame, Multi-field, Multi-space 3D Single Object Tracking | Jiaming Liu et.al. | 2312.06117 | link |
2023-12-11 | SimMining-3D: Altitude-Aware 3D Object Detection in Complex Mining Environments: A Novel Dataset and ROS-Based Automatic Annotation Pipeline | Mehala Balamurali et.al. | 2312.06113 | null |
2023-12-11 | Robust Geometry and Reflectance Disentanglement for 3D Face Reconstruction from Sparse-view Images | Daisheng Jin et.al. | 2312.06085 | null |
2023-12-11 | A dynamic interactive learning framework for automated 3D medical image segmentation | Mu Tian et.al. | 2312.06072 | null |
2023-12-11 | Superconductivity in Ternary Germanide ScPdGe and Silicide ScPdSi | Yusaku Shinoda et.al. | 2312.06045 | null |
2023-12-10 | The PHANGS-AstroSat Atlas of Nearby Star Forming Galaxies | Hamid Hassani et.al. | 2312.06031 | null |
2023-12-10 | GAMMA: Galactic Attributes of Mass, Metallicity, and Age Dataset | Ufuk Çakır et.al. | 2312.06016 | link |
2023-12-10 | From Correspondences to Pose: Non-minimal Certifiably Optimal Relative Pose without Disambiguation | Javier Tirado-Garín et.al. | 2312.05995 | link |
2023-12-10 | Activating Frequency and ViT for 3D Point Cloud Quality Assessment without Reference | Oussama Messai et.al. | 2312.05972 | link |
2023-12-10 | ASH: Animatable Gaussian Splats for Efficient and Photoreal Human Rendering | Haokai Pang et.al. | 2312.05941 | link |
2023-12-10 | SuperPrimitive: Scene Reconstruction at a Primitive Level | Kirill Mazur et.al. | 2312.05889 | null |
2023-12-10 | Wild Motion Unleashed: Markerless 3D Kinematics and Force Estimation in Cheetahs | Zico da Silva et.al. | 2312.05879 | null |
2023-12-10 | R2Human: Real-Time 3D Human Appearance Rendering from a Single Image | Qiao Feng et.al. | 2312.05826 | null |
2023-12-10 | HumanCoser: Layered 3D Human Generation via Semantic-Aware Diffusion Model | Yi Wang et.al. | 2312.05804 | null |
2023-12-10 | Camera-based 3D Semantic Scene Completion with Sparse Guidance Network | Jianbiao Mei et.al. | 2312.05752 | link |
2023-12-09 | On the Ground and in the Sky: A Tutorial on Radio Localization in Ground-Air-Space Networks | Hazem Sallouha et.al. | 2312.05704 | null |
2023-12-09 | Light detection and Cosmic Rejection in the ICARUS LArTPC at Fermilab | Anna Heggestuen et.al. | 2312.05684 | null |
2023-12-09 | CoGS: Controllable Gaussian Splatting | Heng Yu et.al. | 2312.05664 | null |
2023-12-09 | EipFormer: Emphasizing Instance Positions in 3D Instance Segmentation | Mengnan Zhao et.al. | 2312.05602 | null |
2023-12-09 | R2-Talker: Realistic Real-Time Talking Head Synthesis with Hash Grid Landmarks Encoding and Progressive Multilayer Conditioning | Zhiling Ye et.al. | 2312.05572 | null |
2023-12-09 | A Unified Multi-Phase CT Synthesis and Classification Framework for Kidney Cancer Diagnosis with Incomplete Data | Kwang-Hyun Uhm et.al. | 2312.05548 | null |
2023-12-09 | DPoser: Diffusion Model as Robust 3D Human Pose Prior | Junzhe Lu et.al. | 2312.05541 | link |
2023-12-09 | Exploring 3D U-Net Training Configurations and Post-Processing Strategies for the MICCAI 2023 Kidney and Tumor Segmentation Challenge | Kwang-Hyun Uhm et.al. | 2312.05528 | null |
2023-12-12 | Flexible Cross-Modal Steganography via Implicit Representations | Seoyun Yang et.al. | 2312.05496 | null |
2023-12-09 | Spectroscopy-Guided Discovery of Three-Dimensional Structures of Disordered Materials with Diffusion Models | Hyuna Kwon et.al. | 2312.05472 | link |
2023-12-08 | Learning 3D Particle-based Simulators from RGB-D Videos | William F. Whitney et.al. | 2312.05359 | null |
2023-12-08 | Transition Path Sampling with Boltzmann Generator-based MCMC Moves | Michael Plainer et.al. | 2312.05340 | link |
2023-12-08 | Multi-view Inversion for 3D-aware Generative Adversarial Networks | Florian Barthel et.al. | 2312.05330 | link |
2023-12-08 | The Higgs Branch of Heterotic LSTs: Hasse Diagrams and Generalized Symmetries | Craig Lawrie et.al. | 2312.05306 | null |
2023-12-08 | Orbits and Dynamical Masses for the Active Hyades Multiple System HD 284163 | Guillermo Torres et.al. | 2312.05301 | null |
2023-12-08 | Disentangled Clothed Avatar Generation from Text Descriptions | Jionghao Wang et.al. | 2312.05295 | null |
2023-12-11 | Nuvo: Neural UV Mapping for Unruly 3D Representations | Pratul P. Srinivasan et.al. | 2312.05283 | null |
2023-12-08 | 3D Copy-Paste: Physically Plausible Object Insertion for Monocular 3D Detection | Yunhao Ge et.al. | 2312.05277 | link |
2023-12-08 | Reconstructing Hands in 3D with Transformers | Georgios Pavlakos et.al. | 2312.05251 | null |
2023-12-08 | SwiftBrush: One-Step Text-to-Image Diffusion Model with Variational Score Distillation | Thuan Hoang Nguyen et.al. | 2312.05239 | link |
2023-12-08 | Enhancing Facial Classification and Recognition using 3D Facial Models and Deep Learning | Houting Li et.al. | 2312.05219 | null |
2023-12-08 | ControlRoom3D: Room Generation using Semantic Proxy Rooms | Jonas Schult et.al. | 2312.05208 | null |
2023-12-08 | Fine Dense Alignment of Image Bursts through Camera Pose and Depth Estimation | Bruno Lecouat et.al. | 2312.05190 | null |
2023-12-08 | GIR: 3D Gaussian Inverse Rendering for Relightable Scene Factorization | Yahao Shi et.al. | 2312.05133 | link |
2023-12-08 | Relativistic Faddeev 3D Equations for Three-Body Bound States Without Two-Body $t-$ Matrices | M. Mohammadzadeh et.al. | 2312.05132 | null |
2023-12-08 | The splashback radius for dark matter, gas and observables in the FLAMINGO simulations | Imogen Towler et.al. | 2312.05126 | link |
2023-12-08 | Theoretical Prediction of the Effective Dynamic Dielectric Constant of Disordered Hyperuniform Anisotropic Composites Beyond the Long-Wavelength Regime | Jaeuk Kim et.al. | 2312.05095 | null |
2023-12-08 | Potentials for solenoidal fields using the three-dimensional ${\varphi}$ -harmonic cyclic algebra | Homero G. Díaz-Marín et.al. | 2312.05093 | null |
2023-12-08 | 3D non-LTE modeling of the stellar center-to-limb variation for transmission spectroscopy studies | G. Canocchi et.al. | 2312.05078 | null |
2023-12-08 | New results on 3d $\mathcal{N}=2$ SQCD and its 3d GLSM interpretation | Cyril Closset et.al. | 2312.05076 | null |
2023-12-08 | Robotic Control of the Deformation of Soft Linear Objects Using Deep Reinforcement Learning | Mélodie Hani Daniel Zakaria et.al. | 2312.05056 | link |
2023-12-08 | Exploring the ex-situ components within $Gaia$ DR3 | Zhuohan Li et.al. | 2312.05027 | null |
2023-12-08 | Vision-based Learning for Drones: A Survey | Jiaping Xiao et.al. | 2312.05019 | null |
2023-12-07 | Text-to-3D Generation with Bidirectional Diffusion using both 2D and 3D priors | Lihe Ding et.al. | 2312.04963 | null |
2023-12-07 | Point2CAD: Reverse Engineering CAD Models from 3D Point Clouds | Yujia Liu et.al. | 2312.04962 | null |
2023-12-08 | EE-LLM: Large-Scale Training and Inference of Early-Exit Large Language Models with 3D Parallelism | Yanxi Chen et.al. | 2312.04916 | link |
2023-12-08 | Pilot tone-guided focused navigation for free-breathing whole-liver fat-water and T2* quantification | Adèle LC Mackowiak et.al. | 2312.04908 | null |
2023-12-08 | Cross-BERT for Point Cloud Pretraining | Xin Li et.al. | 2312.04891 | null |
2023-12-08 | MVDD: Multi-View Depth Diffusion Models | Zhen Wang et.al. | 2312.04875 | null |
2023-12-08 | Understanding oscillating features of the time-like nucleon electromagnetic form factors within the extending vector meson dominance model | Bing Yan et.al. | 2312.04866 | null |
2023-12-08 | SiCP: Simultaneous Individual and Cooperative Perception for 3D Object Detection in Connected and Automated Vehicles | Deyuan Qu et.al. | 2312.04822 | link |
2023-12-08 | Learn to Optimize Denoising Scores for 3D Generation: A Unified and Improved Diffusion Prior on NeRF and 3D Gaussian Splatting | Xiaofeng Yang et.al. | 2312.04820 | null |
2023-12-08 | RL Dreams: Policy Gradient Optimization for Score Distillation based 3D Generation | Aradhya N. Mathur et.al. | 2312.04806 | null |
2023-12-08 | SuperNormal: Neural Surface Reconstruction via Multi-View Normal Integration | Xu Cao et.al. | 2312.04803 | link |
2023-12-08 | Segmentation of Kidney Tumors on Non-Contrast CT Images using Protuberance Detection Network | Taro Hatsutani et.al. | 2312.04796 | null |
2023-12-08 | Visual Grounding of Whole Radiology Reports for 3D CT Images | Akimichi Ichinose et.al. | 2312.04794 | null |
2023-12-08 | Reality’s Canvas, Language’s Brush: Crafting 3D Avatars from Monocular Video | Yuchen Rao et.al. | 2312.04784 | null |
2023-12-07 | The 3D Kinematics of the Orion Nebula Cluster II: Mass-dependent Kinematics of the Inner Cluster | Lingfeng Wei et.al. | 2312.04751 | null |
2023-12-07 | E2ENet: Dynamic Sparse Feature Fusion for Accurate and Efficient 3D Medical Image Segmentation | Boqian Wu et.al. | 2312.04727 | link |
2023-12-07 | The effects of planetary day-night temperature gradients on He 1083 nm transit spectra | Fabienne Nail et.al. | 2312.04682 | null |
2023-12-07 | Additive manufacturing of a 3D-segmented plastic scintillator detector for tracking and calorimetry of elementary particles | Tim Weber et.al. | 2312.04672 | null |
2023-12-07 | NeuSD: Surface Completion with Multi-View Text-to-Image Diffusion | Savva Ignatyev et.al. | 2312.04654 | null |
2023-12-07 | VOODOO 3D: Volumetric Portrait Disentanglement for One-Shot 3D Head Reenactment | Phong Tran et.al. | 2312.04651 | null |
2023-12-07 | MuRF: Multi-Baseline Radiance Fields | Haofei Xu et.al. | 2312.04565 | link |
2023-12-07 | EAGLES: Efficient Accelerated 3D Gaussians with Lightweight EncodingS | Sharath Girish et.al. | 2312.04564 | link |
2023-12-07 | Visual Geometry Grounded Deep Structure From Motion | Jianyuan Wang et.al. | 2312.04563 | null |
2023-12-07 | NeRFiller: Completing Scenes via Generative 3D Inpainting | Ethan Weber et.al. | 2312.04560 | null |
2023-12-07 | PrimDiffusion: Volumetric Primitives Diffusion for 3D Human Generation | Zhaoxi Chen et.al. | 2312.04559 | link |
2023-12-07 | MonoGaussianAvatar: Monocular Gaussian Point-based Head Avatar | Yufan Chen et.al. | 2312.04558 | null |
2023-12-07 | Free3D: Consistent Novel View Synthesis without 3D Representation | Chuanxia Zheng et.al. | 2312.04551 | link |
2023-12-07 | Digital Life Project: Autonomous 3D Characters with Social Intelligence | Zhongang Cai et.al. | 2312.04547 | null |
2023-12-07 | HyperDreamer: Hyper-Realistic 3D Content Generation and Editing from a Single Image | Tong Wu et.al. | 2312.04543 | null |
2023-12-07 | Dream2Real: Zero-Shot 3D Object Rearrangement with Vision-Language Models | Ivan Kapelyukh et.al. | 2312.04533 | null |
2023-12-07 | Correspondences of the Third Kind: Camera Pose Estimation from Object Reflection | Kohei Yamashita et.al. | 2312.04527 | null |
2023-12-07 | Multimodal Industrial Anomaly Detection by Crossmodal Feature Mapping | Alex Costanzino et.al. | 2312.04521 | null |
2023-12-07 | Conjectural criteria for the most singular points of the Hilbert schemes of points | Fatemeh Rezaee et.al. | 2312.04520 | null |
2023-12-07 | Emotional Speech-driven 3D Body Animation via Disentangled Latent Diffusion | Kiran Chhatre et.al. | 2312.04466 | link |
2023-12-07 | FitDiff: Robust monocular 3D facial shape and reflectance estimation using Diffusion Models | Stathis Galanakis et.al. | 2312.04465 | null |
2023-12-07 | Deep3DSketch: 3D modeling from Free-hand Sketches with View- and Structural-Aware Adversarial Training | Tianrun Chen et.al. | 2312.04435 | null |
2023-12-07 | Cascade-Zero123: One Image to Highly Consistent 3D with Self-Prompted Nearby Views | Yabo Chen et.al. | 2312.04424 | null |
2023-12-07 | Competing d ${xy}$ and s${\pm }$ Pairing Symmetries in Superconducting La${3}$Ni${2}$O$_{7}$ emerge from LDA+FLEX Calculations | Griffin Heier et.al. | 2312.04401 | null |
2023-12-07 | AniRes2D: Anisotropic Residual-enhanced Diffusion for 2D MR Super-Resolution | Zejun Wu et.al. | 2312.04385 | null |
2023-12-07 | Features of magnetization and spin reorientation in weak ferrimagnets of the YFe $_{1-x}$Cr$_x$O$_3$ type | Alexander Moskvin et.al. | 2312.04381 | null |
2023-12-08 | SingingHead: A Large-scale 4D Dataset for Singing Head Animation | Sijing Wu et.al. | 2312.04369 | null |
2023-12-07 | Learning to sample in Cartesian MRI | Thomas Sanchez et.al. | 2312.04327 | null |
2023-12-08 | Multi Actor-Critic DDPG for Robot Action Space Decomposition: A Framework to Control Large 3D Deformation of Soft Linear Objects | Mélodie Daniel et.al. | 2312.04308 | link |
2023-12-07 | The physical properties of T Pyx as measured by MUSE I. The geometrical distribution of the ejecta and the distance to the remnant | L. Izzo et.al. | 2312.04277 | null |
2023-12-07 | Proxima: Near-storage Acceleration for Graph-based Approximate Nearest Neighbor Search in 3D NAND | Weihong Xu et.al. | 2312.04257 | null |
2023-12-07 | Extending Answer Set Programming with Rational Numbers | Francesco Pacenza et.al. | 2312.04249 | null |
2023-12-07 | TeMO: Towards Text-Driven 3D Stylization for Multi-Object Meshes | Xuying Zhang et.al. | 2312.04248 | null |
2023-12-07 | Zooming on the emerging ionized regions of pPNe with ALMA | C. Sánchez Contreras et.al. | 2312.04188 | null |
2023-12-07 | MAD UFOs: Magnetically Arrested Discs with persistent Ultra-Fast Outflows | Petra Suková et.al. | 2312.04149 | null |
2023-12-07 | Towards 4D Human Video Stylization | Tiantian Wang et.al. | 2312.04143 | link |
2023-12-07 | Polarimetric Light Transport Analysis for Specular Inter-reflection | Ryota Maeda et.al. | 2312.04140 | link |
2023-12-07 | Instance Tracking in 3D Scenes from Egocentric Videos | Yunhan Zhao et.al. | 2312.04117 | link |
2023-12-07 | Identity-Obscured Neural Radiance Fields: Privacy-Preserving 3D Facial Reconstruction | Jiayi Kong et.al. | 2312.04106 | null |
2023-12-07 | Mass Ratio Dependence of Three-Body Resonance Lifetimes in 1D and 3D | Lucas Happ et.al. | 2312.04080 | null |
2023-12-07 | Differentiable Registration of Images and LiDAR Point Clouds with VoxelPoint-to-Pixel Matching | Junsheng Zhou et.al. | 2312.04060 | link |
2023-12-07 | Doodle Your 3D: From Abstract Freehand Sketches to Precise 3D Shapes | Hmrishav Bandyopadhyay et.al. | 2312.04043 | link |
2023-12-07 | ImFace++: A Sophisticated Nonlinear 3D Morphable Face Model with Implicit Neural Representations | Mingwu Zheng et.al. | 2312.04028 | link |
2023-12-07 | PartDistill: 3D Shape Part Segmentation by Vision-Language Model Distillation | Ardian Umam et.al. | 2312.04016 | link |
2023-12-07 | Rapid detection of rare events from in situ X-ray diffraction data using machine learning | Weijian Zheng et.al. | 2312.03989 | null |
2023-12-07 | Flux tunable graphene-based superconducting quantum circuits coupled to 3D cavity | Kuei-Lin Chiu et.al. | 2312.03985 | null |
2023-12-07 | PerSival: Neural-network-based visualisation for pervasive continuum-mechanical simulations in musculoskeletal biomechanics | David Rosin et.al. | 2312.03957 | null |
2023-12-06 | Controllable Human-Object Interaction Synthesis | Jiaman Li et.al. | 2312.03913 | null |
2023-12-06 | WonderJourney: Going from Anywhere to Everywhere | Hong-Xing Yu et.al. | 2312.03884 | null |
2023-12-06 | Inpaint3D: 3D Scene Content Generation using 2D Inpainting Diffusion | Kira Prabhu et.al. | 2312.03869 | null |
2023-12-06 | Confining Strings and Glueballs in $\mathbb{Z}_N$ Gauge Theories | Andreas Athenodorou et.al. | 2312.03855 | null |
2023-12-06 | Alpha-CLIP: A CLIP Model Focusing on Wherever You Want | Zeyi Sun et.al. | 2312.03818 | link |
2023-12-06 | XCube ( $\mathcal{X}^3$ ): Large-Scale 3D Generative Modeling using Sparse Voxel Hierarchies | Xuanchi Ren et.al. | 2312.03806 | link |
2023-12-06 | AnimatableDreamer: Text-Guided Non-rigid 3D Model Generation and Reconstruction with Canonical Score Distillation | Xinzhou Wang et.al. | 2312.03795 | null |
2023-12-06 | Novel class discovery meets foundation models for 3D semantic segmentation | Luigi Riz et.al. | 2312.03782 | null |
2023-12-06 | FAAC: Facial Animation Generation with Anchor Frame and Conditional Control for Superior Fidelity and Editability | Linze Li et.al. | 2312.03775 | null |
2023-12-06 | OctreeOcc: Efficient and Multi-Granularity Occupancy Prediction Using Octree Queries | Yuhang Lu et.al. | 2312.03774 | link |
2023-12-05 | Gaussian3Diff: 3D Gaussian Diffusion for 3D Full Head Synthesis and Editing | Yushi Lan et.al. | 2312.03763 | null |
2023-12-06 | Relightable Gaussian Codec Avatars | Shunsuke Saito et.al. | 2312.03704 | null |
2023-12-06 | Skeleton-in-Context: Unified Skeleton Sequence Modeling with In-Context Learning | Xinshun Wang et.al. | 2312.03703 | link |
2023-12-06 | The SAMI Galaxy Survey: $Σ_{\rm SFR}$ drives the presence of complex emission line profiles in star-forming galaxies | Henry R. M. Zovaro et.al. | 2312.03659 | link |
2023-12-06 | DreamComposer: Controllable 3D Object Generation via Multi-View Conditions | Yunhan Yang et.al. | 2312.03611 | link |
2023-12-06 | MMM: Generative Masked Motion Model | Ekkasit Pinyoanuntapong et.al. | 2312.03596 | link |
2023-12-06 | Contact type solutions and non-mixing of the 3D Euler equations | Robert Cardona et.al. | 2312.03514 | null |
2023-12-06 | How to detect the spacetime curvature without rulers and clocks. II. Three-dimensional spacetime | A. V. Nenashev et.al. | 2312.03487 | null |
2023-12-06 | Molecule Joint Auto-Encoding: Trajectory Pretraining with 2D and 3D Diffusion | Weitao Du et.al. | 2312.03475 | null |
2023-12-07 | HiFi4G: High-Fidelity Human Performance Rendering via Compact Gaussian Splatting | Yuheng Jiang et.al. | 2312.03461 | null |
2023-12-06 | High-Quality Facial Geometry and Appearance Capture at Home | Yuxuan Han et.al. | 2312.03442 | link |
2023-12-06 | Gaussian-Flow: 4D Reconstruction with Dynamic 3D Gaussian Particle | Youtian Lin et.al. | 2312.03431 | null |
2023-12-06 | The three limits of the hydrostatic approximation | Ken Furukawa et.al. | 2312.03418 | null |
2023-12-06 | Viscous rebound of a quasi-2D cylinder on a solid wall | Alicia Aguilar-Corona et.al. | 2312.03416 | null |
2023-12-06 | Strange higher-spin topological systems in 3D | Nicolas Boulanger et.al. | 2312.03382 | null |
2023-12-06 | Evaluating the point cloud of individual trees generated from images based on Neural Radiance fields (NeRF) method | Hongyu Huang et.al. | 2312.03372 | null |
2023-12-06 | RING-NeRF: A Versatile Architecture based on Residual Implicit Neural Grids | Doriand Petit et.al. | 2312.03357 | null |
2023-12-06 | Bile Duct Segmentation Methods Under 3D Slicer Applied to ERCP: Advantages and Disadvantages | Abdelhadi Essamlali et.al. | 2312.03356 | null |
2023-12-06 | PointMoment:Mixed-Moment-based Self-Supervised Representation Learning for 3D Point Clouds | Xin Cao et.al. | 2312.03350 | null |
2023-12-06 | VLFM: Vision-Language Frontier Maps for Zero-Shot Semantic Navigation | Naoki Yokoyama et.al. | 2312.03275 | link |
2023-12-06 | Efficiency of Terrestrial Laser Scanning in Survey Works: Assessment, Modelling, and Monitoring | Fayez Tarsha Kurdi et.al. | 2312.03254 | null |
2023-12-06 | Experimental Investigation of the Structural Performance of Composite Structures Produced using Additive Manufacturing | Hunter Watts et.al. | 2312.03230 | null |
2023-12-07 | Seamless monolithic three-dimensional integration of single-crystalline films by growth | Ki Seok Kim et.al. | 2312.03206 | null |
2023-12-06 | Feature 3DGS: Supercharging 3D Gaussian Splatting to Enable Distilled Feature Fields | Shijie Zhou et.al. | 2312.03203 | link |
2023-12-05 | Predicting Bone Degradation Using Vision Transformer and Synthetic Cellular Microstructures Dataset | Mohammad Saber Hashemi et.al. | 2312.03133 | null |
2023-12-05 | The DUNE Far Detector Vertical Drift Technology, Technical Design Report | DUNE Collaboration et.al. | 2312.03130 | null |
2023-12-05 | Fully Convolutional Slice-to-Volume Reconstruction for Single-Stack MRI | Sean I. Young et.al. | 2312.03102 | link |
2023-12-05 | ScAR: Scaling Adversarial Robustness for LiDAR Object Detection | Xiaohu Lu et.al. | 2312.03085 | link |
2023-12-05 | LooseControl: Lifting ControlNet for Generalized Depth Conditioning | Shariq Farooq Bhat et.al. | 2312.03079 | null |
2023-12-05 | Inherent limitations of LLMs regarding spatial information | He Yan et.al. | 2312.03042 | link |
2023-12-05 | LiDAR-based Person Re-identification | Wenxuan Guo et.al. | 2312.03033 | link |
2023-12-05 | Zero-Shot Point Cloud Registration | Weijie Wang et.al. | 2312.03032 | null |
2023-12-05 | Gaussian Head Avatar: Ultra High-fidelity Head Avatar via Dynamic Gaussians | Yuelang Xu et.al. | 2312.03029 | link |
2023-12-05 | Uni3DL: Unified Model for 3D and Language Understanding | Xiang Li et.al. | 2312.03026 | null |
2023-12-05 | Protein Language Model-Powered 3D Ligand Binding Site Prediction from Protein Sequence | Shuo Zhang et.al. | 2312.03016 | null |
2023-12-05 | PartSLIP++: Enhancing Low-Shot 3D Part Segmentation via Multi-View Instance Segmentation and Maximum Likelihood Estimation | Yuchen Zhou et.al. | 2312.03015 | link |
2023-12-05 | ReconFusion: 3D Reconstruction with Diffusion Priors | Rundi Wu et.al. | 2312.02981 | null |
2023-12-05 | GPT4Point: A Unified Framework for Point-Language Understanding and Generation | Zhangyang Qi et.al. | 2312.02980 | null |
2023-12-05 | Imitating Shortest Paths in Simulation Enables Effective Navigation and Manipulation in the Real World | Kiana Ehsani et.al. | 2312.02976 | null |
2023-12-05 | GauHuman: Articulated Gaussian Splatting from Monocular Human Videos | Shoukang Hu et.al. | 2312.02973 | link |
2023-12-05 | Diffusion-SS3D: Diffusion Model for Semi-supervised 3D Object Detection | Cheng-Ju Ho et.al. | 2312.02966 | link |
2023-12-05 | Some locally Kneser graphs | A. E. Brouwer et.al. | 2312.02964 | null |
2023-12-05 | MVHumanNet: A Large-scale Dataset of Multi-view Daily Dressing Human Captures | Zhangyang Xiong et.al. | 2312.02963 | null |
2023-12-05 | HeadGaS: Real-Time Animatable Head Avatars via 3D Gaussian Splatting | Helisa Dhamo et.al. | 2312.02902 | null |
2023-12-07 | Comparative study of quantum emitter fabrication in wide bandgap materials using localized electron irradiation | Anand Kumar et.al. | 2312.02856 | null |
2023-12-05 | PMMTalk: Speech-Driven 3D Facial Animation from Complementary Pseudo Multi-modal Features | Tianshun Han et.al. | 2312.02781 | null |
2023-12-05 | Hidden-bottom hadronic transitions of $Υ(10753)$ | Shidong Liu et.al. | 2312.02761 | null |
2023-12-05 | R3D-SWIN:Use Shifted Window Attention for Single-View 3D Reconstruction | Chenhuan Li et.al. | 2312.02725 | null |
2023-12-05 | MyPortrait: Morphable Prior-Guided Personalized Portrait Generation | Bo Ding et.al. | 2312.02703 | null |
2023-12-05 | Neural Sign Actors: A diffusion model for 3D sign language production from text | Vasileios Baltatzis et.al. | 2312.02702 | null |
2023-12-05 | Revisit Human-Scene Interaction via Space Occupancy | Xinpeng Liu et.al. | 2312.02700 | null |
2023-12-05 | Are Synthetic Data Useful for Egocentric Hand-Object Interaction Detection? An Investigation and the HOI-Synth Domain Adaptation Benchmark | Rosario Leonardi et.al. | 2312.02672 | link |
2023-12-05 | TPA3D: Triplane Attention for Fast Text-to-3D Generation | Hong-En Chen et.al. | 2312.02647 | null |
2023-12-05 | A 3D kinetic Monte Carlo study of streamer discharges in CO $_2$ | Robert Marskar et.al. | 2312.02634 | null |
2023-12-05 | DreaMo: Articulated 3D Reconstruction From A Single Casual Video | Tao Tu et.al. | 2312.02617 | null |
2023-12-05 | Panoptica – instance-wise evaluation of 3D semantic and instance segmentation maps | Florian Kofler et.al. | 2312.02608 | link |
2023-12-05 | 6D Assembly Pose Estimation by Point Cloud Registration for Robot Manipulation | K. Samarawickrama et.al. | 2312.02593 | link |
2023-12-05 | Prompt2NeRF-PIL: Fast NeRF Generation via Pretrained Implicit Latent | Jianmeng Liu et.al. | 2312.02568 | null |
2023-12-05 | BOgen: Generating Part-Level 3D Designs Based on User Intention Inference through Bayesian Optimization and Variational Autoencoder | Seung Won Lee et.al. | 2312.02557 | null |
2023-12-05 | MASP: Scalable GNN-based Planning for Multi-Agent Navigation | Xinyi Yang et.al. | 2312.02522 | null |
2023-12-05 | Robust UAV Position and Attitude Estimation using Multiple GNSS Receivers for Laser-based 3D Mapping | Taro Suzuki et.al. | 2312.02485 | null |
2023-12-05 | Applications of Domain Adversarial Neural Network in phase transition of 3D Potts model | Xiangna Chen et.al. | 2312.02479 | null |
2023-12-05 | Watermarking for Neural Radiation Fields by Invertible Neural Network | Wenquan Sun et.al. | 2312.02456 | null |
2023-12-05 | Time-Relative RTK-GNSS: GNSS Loop Closure in Pose Graph Optimization | Taro Suzuki et.al. | 2312.02448 | null |
2023-12-05 | Fast non-autoregressive inverse folding with discrete diffusion | John J. Yang et.al. | 2312.02447 | link |
2023-12-05 | FINER: Flexible spectral-bias tuning in Implicit NEural Representation by Variable-periodic Activation Functions | Zhen Liu et.al. | 2312.02434 | null |
2023-12-05 | GNSS Odometry: Precise Trajectory Estimation Based on Carrier Phase Cycle Slip Estimation | Taro Suzuki et.al. | 2312.02424 | null |
2023-12-04 | Unsupervised Change Detection for Space Habitats Using 3D Point Clouds | Jamie Santos et.al. | 2312.02396 | link |
2023-12-04 | Insights to Molecular and Bulk Mechanical Properties of Glassy Carbon Through Molecular Dynamics Simulation and Mechanical Tensile Testing | Manali Kuntea et.al. | 2312.02388 | null |
2023-12-04 | Fast Fourier Transform periodic interpolation method for superposition sums in a periodic unit cell | Fangzhou Ai et.al. | 2312.02376 | link |
2023-12-04 | Towards General Purpose Vision Foundation Models for Medical Image Analysis: An Experimental Study of DINOv2 on Radiology Benchmarks | Mohammed Baharoon et.al. | 2312.02366 | link |
2023-12-04 | Constraining the H2 column densities in the diffuse interstellar medium using dust extinction and HI data | Raphael Skalidis et.al. | 2312.02274 | null |
2023-12-04 | Study of a cubic cavity resonator for gravitational waves detection in the microwave frequency range | Pablo Navarro et.al. | 2312.02270 | null |
2023-12-04 | Geometrically-driven Aggregation for Zero-shot 3D Point Cloud Understanding | Guofeng Mei et.al. | 2312.02244 | link |
2023-12-04 | GenEM: Physics-Informed Generative Cryo-Electron Microscopy | Jiakai Zhang et.al. | 2312.02235 | null |
2023-12-03 | InvertAvatar: Incremental GAN Inversion for Generalized Head Avatars | Xiaochen Zhao et.al. | 2312.02222 | link |
2023-12-03 | Slice3D: Multi-Slice, Occlusion-Revealing, Single View 3D Reconstruction | Yizhi Wang et.al. | 2312.02221 | null |
2023-12-03 | FlashAvatar: High-Fidelity Digital Avatar Rendering at 300FPS | Jun Xiang et.al. | 2312.02214 | null |
2023-12-03 | AttriHuman-3D: Editable 3D Human Avatar Generation with Attribute Decomposition and Indexing | Fan Yang et.al. | 2312.02209 | null |
2023-12-03 | A Data-efficient Framework for Robotics Large-scale LiDAR Scene Parsing | Kangcheng Liu et.al. | 2312.02208 | link |
2023-12-02 | ImageDream: Image-Prompt Multi-view Diffusion for 3D Generation | Peng Wang et.al. | 2312.02201 | null |
2023-12-04 | PaSCo: Urban 3D Panoptic Scene Completion with Uncertainty Awareness | Anh-Quan Cao et.al. | 2312.02158 | link |
2023-12-04 | Mesh-Guided Neural Implicit Field Editing | Can Wang et.al. | 2312.02157 | null |
2023-12-04 | GPS-Gaussian: Generalizable Pixel-wise 3D Gaussian Splatting for Real-time Human Novel View Synthesis | Shunyuan Zheng et.al. | 2312.02155 | link |
2023-12-04 | Steerers: A framework for rotation equivariant keypoint descriptors | Georg Bökman et.al. | 2312.02152 | link |
2023-12-04 | Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation | Bingxin Ke et.al. | 2312.02145 | link |
2023-12-04 | iMatching: Imperative Correspondence Learning | Zitong Zhan et.al. | 2312.02141 | link |
2023-12-04 | MANUS: Markerless Hand-Object Grasp Capture using Articulated 3D Gaussians | Chandradeep Pokhariya et.al. | 2312.02137 | null |
2023-12-04 | BerfScene: Bev-conditioned Equivariant Radiance Fields for Infinite 3D Scene Generation | Qihang Zhang et.al. | 2312.02136 | null |
2023-12-04 | GaussianAvatar: Towards Realistic Human Avatar Modeling from a Single Video via Animatable 3D Gaussians | Liangxiao Hu et.al. | 2312.02134 | link |
2023-12-04 | SplaTAM: Splat, Track & Map 3D Gaussians for Dense RGB-D SLAM | Nikhil Keetha et.al. | 2312.02126 | link |
2023-12-04 | A Framework for Self-Intersecting Surfaces (SOS): Symmetric Optimisation for Stability | Christian Amend et.al. | 2312.02113 | null |
2023-12-04 | GaussianAvatars: Photorealistic Head Avatars with Rigged 3D Gaussians | Shenhan Qian et.al. | 2312.02069 | link |
2023-12-04 | Implicit Learning of Scene Geometry from Poses for Global Localization | Mohammad Altillawi et.al. | 2312.02029 | null |
2023-12-04 | ColonNeRF: Neural Radiance Fields for High-Fidelity Long-Sequence Colonoscopy Reconstruction | Yufei Shi et.al. | 2312.02015 | null |
2023-12-06 | Towards Learning a Generalist Model for Embodied Navigation | Duo Zheng et.al. | 2312.02010 | link |
2023-12-04 | Semantics-aware Motion Retargeting with Vision-Language Models | Haodong Zhang et.al. | 2312.01964 | null |
2023-12-04 | Instance-guided Cartoon Editing with a Large-scale Dataset | Jian Lin et.al. | 2312.01943 | null |
2023-12-04 | COTR: Compact Occupancy TRansformer for Vision-based 3D Occupancy Prediction | Qihang Ma et.al. | 2312.01919 | link |
2023-12-04 | Eruptive events with exceptionally bright emission in HI Ly-alpha observed by the Metis coronagraph | G. Russano et.al. | 2312.01899 | null |
2023-12-04 | Effective models for generalized Newtonian fluids through a thin porous medium following the Carreau law | Maria Anguiano et.al. | 2312.01844 | null |
2023-12-04 | VividTalk: One-Shot Audio-Driven Talking Head Generation Based on 3D Hybrid Prior | Xusen Sun et.al. | 2312.01841 | null |
2023-12-04 | A simulation method for the wetting dynamics of liquid droplets on deformable membranes | Marcel Mokbel et.al. | 2312.01817 | null |
2023-12-04 | Non-saturation intensity dependence of anisotropic third-order optical nonlinearity approaching the damage threshold in ZnSe and GaP | Jianpeng Ye et.al. | 2312.01814 | null |
2023-12-04 | Light Field Imaging in the Restrictive Object Space based on Flexible Angular Plane | Ping Zhou et.al. | 2312.01761 | null |
2023-12-04 | Simultaneous Alignment and Surface Regression Using Hybrid 2D-3D Networks for 3D Coherent Layer Segmentation of Retinal OCT Images with Full and Sparse Annotations | Hong Liu et.al. | 2312.01726 | link |
2023-12-04 | Tracking complex singularities of fluids on log-lattices | Quentin Pikeroen et.al. | 2312.01702 | null |
2023-12-05 | Hulk: A Universal Knowledge Translator for Human-Centric Tasks | Yizhou Wang et.al. | 2312.01697 | link |
2023-12-04 | BEVNeXt: Reviving Dense BEV Frameworks for 3D Object Detection | Zhenxin Li et.al. | 2312.01696 | link |
2023-12-04 | Adversarial Medical Image with Hierarchical Feature Hiding | Qingsong Yao et.al. | 2312.01679 | link |
2023-12-04 | Customize your NeRF: Adaptive Source Driven 3D Scene Editing via Local-Global Iterative Training | Runze He et.al. | 2312.01663 | null |
2023-12-04 | GaussianHead: Impressive 3D Gaussian-based Head Avatars with Dynamic Hybrid Neural Field | Jie Wang et.al. | 2312.01632 | link |
2023-12-04 | Three-Dimensional Quantum Anomalous Hall Effect in Magnetic Topological Insulator Trilayers of Hundred-Nanometer Thickness | Yi-Fan Zhao et.al. | 2312.01614 | null |
2023-12-04 | Negative Magnetization and Magnetic Ordering of Rare Earth and Transition Metal Sublattices in NdFe0.5Cr0.5O3 | S. Kanthal et.al. | 2312.01595 | null |
2023-12-04 | Multi-View Person Matching and 3D Pose Estimation with Arbitrary Uncalibrated Camera Networks | Yan Xu et.al. | 2312.01561 | null |
2023-12-04 | Axisymmetric Virtual Elements For Problems of Elasticity and Plasticity | Louie L. Yaw et.al. | 2312.01559 | null |
2023-12-04 | Hyperspectral Image Compression Using Sampling and Implicit Neural Representations | Shima Rezasoltani et.al. | 2312.01558 | null |
2023-12-04 | Tomographic projection optimization for volumetric additive manufacturing with general band constraint Lp-norm minimization | Chi Chung Li et.al. | 2312.01548 | link |
2023-12-03 | SANeRF-HQ: Segment Anything for NeRF in High Quality | Yichen Liu et.al. | 2312.01531 | null |
2023-12-05 | T3D: Towards 3D Medical Image Understanding through Vision-Language Pre-training | Che Liu et.al. | 2312.01529 | null |
2023-12-03 | CityGen: Infinite and Controllable 3D City Layout Generation | Jie Deng et.al. | 2312.01508 | null |
2023-12-03 | FeltingReel: Density Varying Soft Fabrication with Reeling and Felting | Ping-Yi Wang et.al. | 2312.01482 | null |
2023-12-03 | Exploring Adversarial Robustness of LiDAR-Camera Fusion Model in Autonomous Driving | Bo Yang et.al. | 2312.01468 | null |
2023-12-03 | D $^2$ ST-Adapter: Disentangled-and-Deformable Spatio-Temporal Adapter for Few-shot Action Recognition | Wenjie Pei et.al. | 2312.01431 | link |
2023-12-03 | Generative Rendering: Controllable 4D-Guided Video Generation with 2D Diffusion Models | Shengqu Cai et.al. | 2312.01409 | null |
2023-12-03 | MoEC: Mixture of Experts Implicit Neural Compression | Jianchen Zhao et.al. | 2312.01361 | null |
2023-12-03 | ViVid-1-to-3: Novel View Synthesis with Video Diffusion Models | Jeong-gi Kwak et.al. | 2312.01305 | null |
2023-12-03 | A Review and A Robust Framework of Data-Efficient 3D Scene Parsing with Traditional/Learned 3D Descriptors | Kangcheng Liu et.al. | 2312.01262 | null |
2023-12-02 | RNb-NeuS: Reflectance and Normal-based Multi-View 3D Reconstruction | Baptiste Brument et.al. | 2312.01215 | link |
2023-12-02 | Neural Parametric Gaussians for Monocular Non-Rigid Object Reconstruction | Devikalyan Das et.al. | 2312.01196 | null |
2023-12-02 | Has Anything Changed? 3D Change Detection by 2D Segmentation Masks | Aikaterini Adam et.al. | 2312.01148 | null |
2023-12-02 | FDM Printing: a Fabrication Method for Fluidic Soft Circuits? | Savita V. Kendre et.al. | 2312.01131 | null |
2023-12-02 | STREAM: Software Tool for Routing Efficiently Advanced Macrofluidics | Lehong Wang et.al. | 2312.01130 | link |
2023-12-02 | ControlDreamer: Stylized 3D Generation with Multi-View ControlNet | Yeongtak Oh et.al. | 2312.01129 | null |
2023-12-02 | Paved2Paradise: Cost-Effective and Scalable LiDAR Simulation by Factoring the Real World | Michael A. Alcorn et.al. | 2312.01117 | link |
2023-12-02 | OpEnCam: Lensless Optical Encryption Camera | Salman S. Khan et.al. | 2312.01077 | null |
2023-12-02 | Spectral-wise Implicit Neural Representation for Hyperspectral Image Reconstruction | Huan Chen et.al. | 2312.01061 | null |
2023-12-02 | Exploring and Improving the Spatial Reasoning Abilities of Large Language Models | Manasi Sharma et.al. | 2312.01054 | null |
2023-12-02 | Self-Evolving Neural Radiance Fields | Jaewoo Jung et.al. | 2312.01003 | link |
2023-12-02 | Interplay between strain and size quantization in a class of topological insulators based on inverted-band semiconductors | Alexander Khaetskii et.al. | 2312.00986 | null |
2023-12-02 | Noisy probing dose facilitated dose prediction for pencil beam scanning proton therapy: physics enhances generalizability | Lian Zhang et.al. | 2312.00975 | null |
2023-12-01 | Consistent Mesh Diffusion | Julian Knodt et.al. | 2312.00971 | null |
2023-12-01 | Object 6D pose estimation meets zero-shot learning | Andrea Caraffa et.al. | 2312.00947 | null |
2023-12-01 | Enhancing Diffusion Models with 3D Perspective Geometry Constraints | Rishi Upadhyay et.al. | 2312.00944 | null |
2023-12-01 | 3DiFACE: Diffusion-based Speech-driven 3D Facial Animation and Editing | Balamurugan Thambiraja et.al. | 2312.00870 | null |
2023-12-01 | Segment Any 3D Gaussians | Jiazhong Cen et.al. | 2312.00860 | null |
2023-12-01 | NeuSG: Neural Implicit Surface Reconstruction with 3D Gaussian Splatting Guidance | Hanlin Chen et.al. | 2312.00846 | null |
2023-12-01 | Sparse Beats Dense: Rethinking Supervision in Radar-Camera Depth Completion | Huadong Li et.al. | 2312.00844 | link |
2023-11-30 | Lasagna: Layered Score Distillation for Disentangled Object Relighting | Dina Bashkirova et.al. | 2312.00833 | link |
2023-12-01 | CompuCell3D Model of Cell Migration Reproduces Chemotaxis | Pedro C. Dal-Castel et.al. | 2312.00776 | link |
2023-12-01 | Effects of three-dimensional slit geometry on flashback of premixed hydrogen flames in perforated burners | Filippo Fruzza et.al. | 2312.00744 | null |
2023-12-01 | Adversarial Score Distillation: When score distillation meets GAN | Min Wei et.al. | 2312.00739 | link |
2023-12-01 | Gaussian Grouping: Segment and Edit Anything in 3D Scenes | Mingqiao Ye et.al. | 2312.00732 | link |
2023-12-01 | Unsupervised Adaptive Implicit Neural Representation Learning for Scan-Specific MRI Reconstruction | Junwei Yang et.al. | 2312.00677 | null |
2023-12-01 | The Automatic Identification and Tracking of Coronal Flux Ropes – Part II: New Mathematical Morphology-based Flux Rope Extraction Method and Deflection Analysis | Andreas Wagner et.al. | 2312.00673 | null |
2023-12-01 | Generalized Label-Efficient 3D Scene Parsing via Hierarchical Feature Aligned Pre-Training and Region-Aware Fine-tuning | Kangcheng Liu et.al. | 2312.00663 | link |
2023-12-01 | How the zebra got its stripes: Curvature-dependent diffusion orients Turing patterns on 3D surfaces | Michael F. Staddon et.al. | 2312.00637 | null |
2023-12-01 | Towards Efficient 3D Object Detection in Bird’s-Eye-View Space for Autonomous Driving: A Convolutional-Only Approach | Yuxin Li et.al. | 2312.00633 | null |
2023-12-01 | UAVs and Birds: Enhancing Short-Range Navigation through Budgerigar Flight Studies | Md. Mahmudur Rahman et.al. | 2312.00597 | null |
2023-11-30 | LucidDreaming: Controllable Object-Centric 3D Generation | Zhaoning Wang et.al. | 2312.00588 | null |
2023-11-30 | MD-Splatting: Learning Metric Deformation from 4D Gaussians in Highly Deformable Scenes | Bardienus P. Duisterhof et.al. | 2312.00583 | null |
2023-12-01 | Novel 3D Geometry-Based Stochastic Models for Non-Isotropic MIMO Vehicle-to-Vehicle Channels | Yi Yuan et.al. | 2312.00550 | null |
2023-12-01 | LiDAR-based curb detection for ground truth annotation in automated driving validation | Jose Luis Apellániz et.al. | 2312.00534 | null |
2023-12-01 | DeepDR: Deep Structure-Aware RGB-D Inpainting for Diminished Reality | Christina Gsaxner et.al. | 2312.00532 | null |
2023-12-01 | Weak Electronic Correlations Observed in Magnetic Weyl Semimetal Mn $_3$ Ge | Susmita Changdar et.al. | 2312.00511 | null |
2023-12-01 | Global Localization: Utilizing Relative Spatio-Temporal Geometric Constraints from Adjacent and Distant Cameras | Mohammad Altillawi et.al. | 2312.00500 | null |
2023-12-01 | Learning Unorthogonalized Matrices for Rotation Estimation | Kerui Gu et.al. | 2312.00462 | null |
2023-12-01 | FSGS: Real-Time Few-shot View Synthesis using Gaussian Splatting | Zehao Zhu et.al. | 2312.00451 | null |
2023-12-01 | Nonlinear interaction of two cross-propagating plane waves | A. Matalliotakis et.al. | 2312.00445 | null |
2023-12-01 | CoLLiE: Collaborative Training of Large Language Models in an Efficient Way | Kai Lv et.al. | 2312.00407 | link |
2023-12-01 | Text-Guided 3D Face Synthesis – From Generation to Editing | Yunjie Wu et.al. | 2312.00375 | link |
2023-12-01 | On Novel Fixed-Point-Type Iterations with Structure-Preserving Doubling Algorithms for Stochastic Continuous-time Algebraic Riccati equations | Tsung-Ming Huang et.al. | 2312.00328 | null |
2023-12-01 | Universal Energy Functionals for Trapped Fermi Gases in Low Dimensions | Jiansen Zhang et.al. | 2312.00325 | null |
2023-12-01 | Improving Normalization with the James-Stein Estimator | Seyedalireza Khoshsirat et.al. | 2312.00313 | null |
2023-12-04 | 3D Face Reconstruction with the Geometric Guidance of Facial Part Segmentation | Zidu Wang et.al. | 2312.00311 | link |
2023-11-30 | SparseGS: Real-Time 360° Sparse View Synthesis using Gaussian Splatting | Haolin Xiong et.al. | 2312.00206 | link |
2023-11-30 | System for Analysis of Wind Collocations (SAWC): A Novel Archive and Collocation Software Application for the Intercomparison of Winds from Multiple Observing Platforms | Katherine E. Lukens et.al. | 2312.00190 | null |
2023-11-30 | DynMF: Neural Motion Factorization for Real-time Dynamic View Synthesis with 3D Gaussian Splatting | Agelos Kratimenos et.al. | 2312.00112 | null |
2023-11-30 | Scaffold-GS: Structured 3D Gaussians for View-Adaptive Rendering | Tao Lu et.al. | 2312.00109 | link |
2023-11-30 | GraphDreamer: Compositional 3D Scene Synthesis from Scene Graphs | Gege Gao et.al. | 2312.00093 | null |
2023-11-30 | X-Dreamer: Creating High-quality 3D Content by Bridging the Domain Gap Between Text-to-2D and Text-to-3D Generation | Yiwei Ma et.al. | 2312.00085 | link |
2023-11-29 | MoMask: Generative Masked Modeling of 3D Human Motions | Chuan Guo et.al. | 2312.00063 | link |
2023-11-30 | Just Add $π$ ! Pose Induced Video Transformers for Understanding Activities of Daily Living | Dominick Reilly et.al. | 2311.18840 | link |
2023-11-30 | PoseGPT: Chatting about 3D Human Pose | Yao Feng et.al. | 2311.18836 | null |
2023-11-30 | Exploiting Diffusion Prior for Generalizable Pixel-Level Semantic Prediction | Hsin-Ying Lee et.al. | 2311.18832 | link |
2023-11-30 | FoundPose: Unseen Object Pose Estimation with Foundation Features | Evin Pınar Örnek et.al. | 2311.18809 | null |
2023-11-30 | X-InstructBLIP: A Framework for aligning X-Modal instruction-aware representations to LLMs and Emergent Cross-modal Reasoning | Artemis Panagopoulou et.al. | 2311.18799 | link |
2023-11-30 | Learning One-Shot 4D Head Avatar Synthesis using Synthetic Data | Yu Deng et.al. | 2311.18729 | null |
2023-11-30 | Seg2Reg: Differentiable 2D Segmentation to 1D Regression Rendering for 360 Room Layout Reconstruction | Cheng Sun et.al. | 2311.18695 | null |
2023-11-30 | Multi-task learning with cross-task consistency for improved depth estimation in colonoscopy | Pedro Esteban Chavarrias Solano et.al. | 2311.18664 | null |
2023-11-30 | LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning | Sijin Chen et.al. | 2311.18651 | link |
2023-11-30 | DiffusionAvatars: Deferred Diffusion for High-fidelity 3D Head Avatars | Tobias Kirschstein et.al. | 2311.18635 | null |
2023-11-30 | DiffCAD: Weakly-Supervised Probabilistic CAD Model Retrieval and Alignment from an RGB Image | Daoyi Gao et.al. | 2311.18610 | null |
2023-11-30 | A New Old Idea: Beam-Steering Reflectarrays for Efficient Sub-THz Multiuser MIMO | Krishan Kumar Tiwari et.al. | 2311.18593 | null |
2023-11-30 | Periodic Vibration Gaussian: Dynamic Urban Scene Reconstruction and Real-time Rendering | Yurui Chen et.al. | 2311.18561 | null |
2023-11-30 | PRS: Sharp Feature Priors for Resolution-Free Surface Remeshing | Natalia Soboleva et.al. | 2311.18494 | null |
2023-11-30 | Language Embedded 3D Gaussians for Open-Vocabulary Scene Understanding | Jin-Chuan Shi et.al. | 2311.18482 | link |
2023-11-30 | DGMem: Learning Visual Navigation Policy without Any Labels by Dynamic Graph Memory | Wenzhe Cai et.al. | 2311.18473 | null |
2023-11-30 | HOLD: Category-agnostic 3D Reconstruction of Interacting Hands and Objects from Video | Zicong Fan et.al. | 2311.18448 | link |
2023-11-30 | E2PNet: Event to Point Cloud Registration with Spatio-Temporal Representation Learning | Xiuhong Lin et.al. | 2311.18433 | link |
2023-11-30 | MV-CLIP: Multi-View CLIP for Zero-shot 3D Shape Recognition | Dan Song et.al. | 2311.18402 | null |
2023-11-30 | RainAI – Precipitation Nowcasting from Satellite Data | Rafael Pablos Sarabia et.al. | 2311.18398 | link |
2023-11-30 | Measurement of Enthalpy and Entropy of the Rate-Determining Step of a Model Electrocatalyst for the Oxygen Evolution Reaction | Joaquín Morales-Santelices et.al. | 2311.18396 | null |
2023-11-30 | A Novel Variational Approach for Multiphoton Microscopy Image Restoration: from PSF Estimation to 3D Deconvolution | Julien Ajdenbaum et.al. | 2311.18386 | null |
2023-11-30 | Room temperature polariton condensation from Whispering gallery modes in CsPbBr3 microplatelets | Laura Polimeno et.al. | 2311.18379 | null |
2023-11-30 | Advances in 3D Neural Stylization: A Survey | Yingshu Chen et.al. | 2311.18328 | link |
2023-11-30 | Reconstructing the normal and shape at specularities in endoscopy | Karim Makki et.al. | 2311.18299 | null |
2023-11-30 | CosAvatar: Consistent and Animatable Portrait Video Tuning with Text Prompt | Haiyao Xiao et.al. | 2311.18288 | null |
2023-11-30 | Dispersed Structured Light for Hyperspectral 3D Imaging | Suhyun Shin et.al. | 2311.18287 | null |
2023-11-30 | 3D carbon allotropes: Topological quantum materials with obstructed atomic insulating phases, multiple bulk-boundary correspondences, and real topology | Jianhua Wang et.al. | 2311.18276 | null |
2023-11-30 | Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives | Kristen Grauman et.al. | 2311.18259 | link |
2023-11-30 | Determining the core-collapse supernova explosion mechanism with current and future gravitational-wave observatories | Jade Powell et.al. | 2311.18221 | null |
2023-11-30 | Probabilistic Speech-Driven 3D Facial Motion Synthesis: New Benchmarks, Methods, and Applications | Karren D. Yang et.al. | 2311.18168 | null |
2023-11-30 | Compact3D: Compressing Gaussian Splat Radiance Field Models with Vector Quantization | KL Navaneet et.al. | 2311.18159 | link |
2023-11-29 | Data-Driven Shape Sensing in Continuum Manipulators via Sliding Resistive Flex Sensors | Chenhan Zhang et.al. | 2311.18154 | null |
2023-11-29 | STF: Spatial Temporal Fusion for Trajectory Prediction | Pengqian Han et.al. | 2311.18149 | link |
2023-11-29 | Symmetries and Wavefunctions of Photons Confined in 3D Photonic Band Gap Superlattices | Marek Kozoň et.al. | 2311.18123 | null |
2023-11-29 | Back to 3D: Few-Shot 3D Keypoint Detection with Back-Projected 2D Features | Thomas Wimmer et.al. | 2311.18113 | link |
2023-11-29 | DiffGEPCI: 3D MRI Synthesis from mGRE Signals using 2.5D Diffusion Model | Yuyang Hu et.al. | 2311.18073 | null |
2023-11-29 | ALSTER: A Local Spatio-Temporal Expert for Online 3D Semantic Reconstruction | Silvan Weder et.al. | 2311.18068 | null |
2023-11-29 | A Data-Driven, Non-Linear, Parameterized Reduced Order Model of Metal 3D Printing | Aaron L. Brown et.al. | 2311.18036 | null |
2023-11-29 | 4D-fy: Text-to-4D Generation Using Hybrid Score Distillation Sampling | Sherwin Bahmani et.al. | 2311.17984 | link |
2023-11-29 | GaussianShader: 3D Gaussian Splatting with Shading Functions for Reflective Surfaces | Yingwenqi Jiang et.al. | 2311.17977 | null |
2023-11-29 | GeoDream: Disentangling 2D and Geometric Priors for High-Fidelity and Consistent 3D Generation | Baorui Ma et.al. | 2311.17971 | link |
2023-11-29 | AvatarStudio: High-fidelity and Animatable 3D Avatar Creation from Text | Jianfeng Zhang et.al. | 2311.17917 | null |
2023-11-29 | HUGS: Human Gaussian Splats | Muhammed Kocabas et.al. | 2311.17910 | link |
2023-11-29 | CG3D: Compositional Generation for Text-to-3D via Gaussian Splatting | Alexander Vilesov et.al. | 2311.17907 | null |
2023-11-30 | Knowledge Pursuit Prompting for Zero-Shot Multimodal Synthesis | Jinqi Luo et.al. | 2311.17898 | link |
2023-11-29 | BCFT One-point Functions of Coulomb Branch Operators | Davide Bason et.al. | 2311.17888 | null |
2023-11-29 | FisherRF: Active View Selection and Uncertainty Quantification for Radiance Fields using Fisher Information | Wen Jiang et.al. | 2311.17874 | link |
2023-11-29 | $\mathbb{Z}_{2}=0$ is topological too | Chao Lei et.al. | 2311.17859 | null |
2023-11-29 | Gaussian Shell Maps for Efficient 3D Human Generation | Rameen Abdal et.al. | 2311.17857 | link |
2023-11-29 | Evaluating VLMs for Score-Based, Multi-Probe Annotation of 3D Objects | Rishabh Kabra et.al. | 2311.17851 | null |
2023-11-30 | SPiC-E : Structural Priors in 3D Diffusion Models using Cross-Entity Attention | Etai Sella et.al. | 2311.17834 | null |
2023-11-29 | Coloring the Past: Neural Historical Buildings Reconstruction from Archival Photography | David Komorowicz et.al. | 2311.17810 | null |
2023-11-29 | PillarNeSt: Embracing Backbone Scaling and Pretraining for Pillar-based 3D Object Detection | Weixin Mao et.al. | 2311.17770 | null |
2023-11-29 | Experimental and Theoretical Brownian Dynamics Analysis of Ion Transport During Cellular Electroporation of E. coli Bacteria | Juan González-Cuevas et.al. | 2311.17755 | null |
2023-11-29 | Cinematic Behavior Transfer via NeRF-based Differentiable Filming | Xuekun Jiang et.al. | 2311.17754 | null |
2023-11-29 | Robust Localization and Tracking of UAVs in OTFS-based Networks | Alessandro Nordio et.al. | 2311.17742 | null |
2023-11-29 | GenZI: Zero-Shot 3D Human-Scene Interaction Generation | Lei Li et.al. | 2311.17737 | null |
2023-11-29 | SAMPro3D: Locating SAM Prompts in 3D for Zero-Shot Scene Segmentation | Mutian Xu et.al. | 2311.17707 | link |
2023-11-29 | Cam4DOcc: Benchmark for Camera-Only 4D Occupancy Forecasting in Autonomous Driving Applications | Junyi Ma et.al. | 2311.17663 | link |
2023-11-29 | Volumetric Cloud Field Reconstruction | Jacob Lin et.al. | 2311.17657 | null |
2023-11-29 | The Limits of Water Maser Kinematics: Insights from High-Mass Protostar AFGL 5142-MM1 | Zulfazli Rosli et.al. | 2311.17636 | null |
2023-12-01 | ShapeGPT: 3D Shape Generation with A Unified Multi-modal Language Model | Fukun Yin et.al. | 2311.17618 | link |
2023-11-29 | SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis | Ziqiao Peng et.al. | 2311.17590 | link |
2023-11-29 | Deep Learning 21cm Lightcones in 3D | Caroline Heneka et.al. | 2311.17553 | null |
2023-11-29 | Weakly-Supervised Emotion Transition Learning for Diverse 3D Co-speech Gesture Generation | Xingqun Qi et.al. | 2311.17532 | null |
2023-11-29 | Combinatorial quantum gravity and emergent 3D quantum behaviour | Carlo A. Trugenberger et.al. | 2311.17526 | null |
2023-11-30 | StructRe: Rewriting for Structured Shape Modeling | Jiepeng Wang et.al. | 2311.17510 | null |
2023-11-29 | Observational Chemical Signatures of the Past FU Ori Outbursts | Lis Zwicky et.al. | 2311.17499 | null |
2023-11-30 | W-HMR: Human Mesh Recovery in World Space with Weak-supervised Camera Calibration and Orientation Correction | Wei Yao et.al. | 2311.17460 | link |
2023-11-29 | Electric Field-induced Charge Transport in Redox-active Molecular Junctions | Ritu Gupta et.al. | 2311.17457 | null |
2023-11-29 | DifFlow3D: Toward Robust Uncertainty-Aware Scene Flow Estimation with Diffusion Model | Jiuming Liu et.al. | 2311.17456 | link |
2023-11-29 | Sewing skyrmion and antiskyrmion by quadrupole of Bloch points | Jin Tang et.al. | 2311.17422 | null |
2023-11-29 | Observation of Hybrid Magnetic Skyrmion Bubbles in Fe3Sn2 Nanodisks | Lingyao Kong et.al. | 2311.17413 | null |
2023-11-29 | Drone Delivery Optimization | Saayuj Deshpande et.al. | 2311.17375 | null |
2023-11-29 | Generative Hierarchical Temporal Transformer for Hand Action Recognition and Motion Prediction | Yilin Wen et.al. | 2311.17366 | null |
2023-11-29 | Implicit-explicit Integrated Representations for Multi-view Video Compression | Chen Zhu et.al. | 2311.17350 | link |
2023-11-29 | NeRFTAP: Enhancing Transferability of Adversarial Patches on Face Recognition using Neural Radiance Fields | Xiaoliang Liu et.al. | 2311.17332 | null |
2023-11-29 | Alternate Diverse Teaching for Semi-supervised Medical Image Segmentation | Zhen Zhao et.al. | 2311.17325 | link |
2023-11-28 | Exceptional Mechanical Performance by Spatial Printing with Continuous Fiber | Guoxin Fang et.al. | 2311.17265 | null |
2023-11-28 | SceneTex: High-Quality Texture Synthesis for Indoor Scenes via Diffusion Priors | Dave Zhenyu Chen et.al. | 2311.17261 | null |
2023-11-28 | LightGaussian: Unbounded 3D Gaussian Compression with 15x Reduction and 200+ FPS | Zhiwen Fan et.al. | 2311.17245 | link |
2023-11-28 | Dynamics and spin alignment in massive, gravito-turbulent circumbinary discs around supermassive black hole binaries | Martin A. Bourne et.al. | 2311.17144 | null |
2023-11-28 | ConTex-Human: Free-View Rendering of Human from a Single Image with Texture-Consistent Synthesis | Xiangjun Gao et.al. | 2311.17123 | null |
2023-11-28 | REF $^2$ -NeRF: Reflection and Refraction aware Neural Radiance Field | Wooseok Kim et.al. | 2311.17116 | link |
2023-11-28 | Human Gaussian Splatting: Real-time Rendering of Animatable Avatars | Arthur Moreau et.al. | 2311.17113 | link |
2023-11-28 | On the Calibration of Human Pose Estimation | Kerui Gu et.al. | 2311.17105 | null |
2023-11-28 | Dynamic Change of Amplitude for OCT Functional Imaging | Yang Jianlong et.al. | 2311.17090 | null |
2023-11-28 | Multi-Scale 3D Gaussian Splatting for Anti-Aliased Rendering | Zhiwen Yan et.al. | 2311.17089 | null |
2023-11-28 | DepthSSC: Depth-Spatial Alignment and Dynamic Voxel Resolution for Monocular 3D Semantic Scene Completion | Jiawei Yao et.al. | 2311.17084 | null |
2023-11-28 | DreamPropeller: Supercharge Text-to-3D Generation with Parallel Sampling | Linqi Zhou et.al. | 2311.17082 | link |
2023-11-28 | HumanGaussian: Text-Driven 3D Human Generation with Gaussian Splatting | Xian Liu et.al. | 2311.17061 | null |
2023-11-28 | Material Palette: Extraction of Materials from a Single Image | Ivan Lopes et.al. | 2311.17060 | null |
2023-11-28 | ReMoS: Reactive 3D Motion Synthesis for Two-Person Interactions | Anindita Ghosh et.al. | 2311.17057 | link |
2023-11-28 | Surf-D: High-Quality Surface Generation for Arbitrary Topologies using Diffusion Models | Zhengming Yu et.al. | 2311.17050 | null |
2023-11-28 | Diffusion 3D Features (Diff3F): Decorating Untextured Shapes with Distilled Semantic Features | Niladri Shekhar Dutt et.al. | 2311.17024 | link |
2023-11-28 | HumanRef: Single Image to 3D Human Generation via Reference-Guided Diffusion | Jingbo Zhang et.al. | 2311.16961 | null |
2023-11-28 | Three-dimensional internal flow evolution of an evaporating droplet and its role in particle deposition pattern | Jiaqi Li et.al. | 2311.16951 | null |
2023-11-28 | UC-NeRF: Neural Radiance Field for Under-Calibrated multi-view cameras in autonomous driving | Kai Cheng et.al. | 2311.16945 | null |
2023-11-28 | A fast, matrix-based method to perform omnidirectional pressure integration | Fernando Zigunov et.al. | 2311.16935 | link |
2023-11-28 | RichDreamer: A Generalizable Normal-Depth Diffusion Model for Detail Richness in Text-to-3D | Lingteng Qiu et.al. | 2311.16918 | null |
2023-11-28 | Is Betelgeuse really rotating? Synthetic ALMA observations of large-scale convection in 3D simulations of Red Supergiants | Jing-Ze Ma et.al. | 2311.16885 | null |
2023-11-29 | A Unified Approach for Text- and Image-guided 4D Scene Generation | Yufeng Zheng et.al. | 2311.16854 | null |
2023-11-28 | Decomposer: Semi-supervised Learning of Image Restoration and Image Decomposition | Boris Meinardus et.al. | 2311.16829 | null |
2023-11-28 | DI-Net : Decomposed Implicit Garment Transfer Network for Digital Clothed 3D Human | Xiaojing Zhong et.al. | 2311.16818 | null |
2023-11-28 | Acquisition of high-quality three-dimensional electron diffuse scattering data | Romy Poppe et.al. | 2311.16817 | null |
2023-11-28 | A Novel 3D Non-stationary Localization-assisted ISAC Channel Model | Runruo Yang et.al. | 2311.16798 | null |
2023-11-28 | A General 3D Non-Stationary 5G Wireless Channel Model | Shangbin Wu et.al. | 2311.16783 | null |
2023-11-28 | Gradient-based Local Next-best-view Planning for Improved Perception of Targeted Plant Nodes | Akshay K. Burusa et.al. | 2311.16759 | link |
2023-11-28 | As-Plausible-As-Possible: Plausibility-Aware Mesh Deformation Using 2D Diffusion Priors | Seungwoo Yoo et.al. | 2311.16739 | null |
2023-11-28 | Point’n Move: Interactive Scene Object Manipulation on Gaussian Splatting Radiance Fields | Jiajun Huang et.al. | 2311.16737 | null |
2023-11-28 | Dynamically coupled kinetic chemistry in brown dwarf atmospheres – II. Cloud and chemistry connections in directly imaged sub-Jupiter exoplanets | Elspeth K. H. Lee et.al. | 2311.16722 | null |
2023-11-28 | SCALAR-NeRF: SCAlable LARge-scale Neural Radiance Fields for Scene Reconstruction | Yu Chen et.al. | 2311.16657 | null |
2023-11-28 | Simultaneous Analysis of Continuously Embedded Reissner-Mindlin Shells in 3D Bulk Domains | Michael Wolfgang Kaiser et.al. | 2311.16638 | null |
2023-11-28 | RGBGrasp: Image-based Object Grasping by Capturing Multiple Views during Robot Arm Movement with Neural Radiance Fields | Chang Liu et.al. | 2311.16592 | null |
2023-11-28 | GeoScaler: Geometry and Rendering-Aware Downsampling of 3D Mesh Textures | Sai Karthikey Pentapati et.al. | 2311.16581 | null |
2023-11-28 | DiffusionTalker: Personalization and Acceleration for Speech-Driven 3D Face Diffuser | Peng Chen et.al. | 2311.16565 | null |
2023-11-28 | Personalized Predictions of Glioblastoma Infiltration: Mathematical Models, Physics-Informed Neural Networks and Multimodal Scans | Ray Zirui Zhang et.al. | 2311.16536 | link |
2023-11-28 | 3D Teeth Reconstruction from Panoramic Radiographs using Neural Implicit Functions | Sihwa Park et.al. | 2311.16524 | null |
2023-11-28 | Rethinking Directional Integration in Neural Radiance Fields | Congyue Deng et.al. | 2311.16504 | null |
2023-11-27 | Deceptive-Human: Prompt-to-NeRF 3D Human Generation with 3D-Consistent Synthetic Images | Shiu-hong Kao et.al. | 2311.16499 | link |
2023-11-28 | Egocentric Whole-Body Motion Capture with FisheyeViT and Diffusion-Based Motion Refinement | Jian Wang et.al. | 2311.16495 | null |
2023-11-27 | Mip-Splatting: Alias-free 3D Gaussian Splatting | Zehao Yu et.al. | 2311.16493 | null |
2023-11-27 | Animatable 3D Gaussian: Fast and High-Quality Reconstruction of Multiple Human Avatars | Yang Liu et.al. | 2311.16482 | link |
2023-11-24 | UniHPE: Towards Unified Human Pose Estimation via Contrastive Learning | Zhongyu Jiang et.al. | 2311.16477 | null |
2023-11-26 | GS-IR: 3D Gaussian Splatting for Inverse Rendering | Zhihao Liang et.al. | 2311.16473 | link |
2023-11-29 | Multi-3D-Models Registration-Based Augmented Reality (AR) Instructions for Assembly | Seda Tuzun Canadinc et.al. | 2311.16337 | null |
2023-11-27 | VehicleGAN: Pair-flexible Pose Guided Image Synthesis for Vehicle Re-identification | Baolu Li et.al. | 2311.16278 | null |
2023-11-27 | Seeing Beyond Cancer: Multi-Institutional Validation of Object Localization and 3D Semantic Segmentation using Deep Learning for Breast MRI | Arda Pekis et.al. | 2311.16213 | null |
2023-11-27 | Symphony: Symmetry-Equivariant Point-Centered Spherical Harmonics for Molecule Generation | Ameya Daigavane et.al. | 2311.16199 | link |
2023-11-27 | Generation of patient specific cardiac chamber models using generative neural networks under a Bayesian framework for electroanatomical mapping | Sunil Mathew et.al. | 2311.16197 | null |
2023-11-27 | GART: Gaussian Articulated Template Models | Jiahui Lei et.al. | 2311.16099 | null |
2023-11-27 | CG-HOI: Contact-Guided 3D Human-Object Interaction Generation | Christian Diller et.al. | 2311.16097 | null |
2023-11-27 | Animatable Gaussians: Learning Pose-dependent Gaussian Maps for High-fidelity Human Avatar Modeling | Zhe Li et.al. | 2311.16096 | link |
2023-11-27 | Three-dimensional $\mathbb{Z}$ topological insulators without reflection symmetry | Alexander C. Tyner et.al. | 2311.16092 | null |
2023-11-27 | ViT-Lens-2: Gateway to Omni-modal Intelligence | Weixian Lei et.al. | 2311.16081 | link |
2023-11-27 | Exploring Attribute Variations in Style-based GANs using Diffusion Models | Rishubh Parihar et.al. | 2311.16052 | null |
2023-11-27 | Relightable 3D Gaussian: Real-time Point Cloud Relighting with BRDF Decomposition and Ray Tracing | Jian Gao et.al. | 2311.16043 | null |
2023-11-27 | Weakly-Supervised 3D Reconstruction of Clothed Humans via Normal Maps | Jane Wu et.al. | 2311.16042 | null |
2023-11-27 | OccWorld: Learning a 3D Occupancy World Model for Autonomous Driving | Wenzhao Zheng et.al. | 2311.16038 | link |
2023-11-27 | GaussianEditor: Editing 3D Gaussians Delicately with Text Instructions | Jiemin Fang et.al. | 2311.16037 | null |
2023-11-27 | Direct 3D imaging through spatial coherence of light | Gianlorenzo Massaro et.al. | 2311.16002 | null |
2023-11-27 | Direct2.5: Diverse Text-to-3D Generation via Multi-view 2.5D Diffusion | Yuanxun Lu et.al. | 2311.15980 | null |
2023-11-27 | Text2Loc: 3D Point Cloud Localization from Natural Language | Yan Xia et.al. | 2311.15977 | null |
2023-11-27 | Individual Nanostructures in an Epsilon-Near-Zero Material Probed with 3D-Sculpted Light | Brian Kantor et.al. | 2311.15942 | null |
2023-11-27 | SiTH: Single-view Textured Human Reconstruction with Image-Conditioned Diffusion | Hsuan-I Ho et.al. | 2311.15855 | link |
2023-11-27 | Syn3DWound: A Synthetic Dataset for 3D Wound Bed Analysis | Léo Lebrat et.al. | 2311.15836 | null |
2023-11-27 | High throughput interactome determination via sulfur anomalous scattering | Mattia Miotto et.al. | 2311.15802 | null |
2023-11-27 | Impact of coordinate frames on mode formation in twisted waveguides | Johannes Bürger et.al. | 2311.15770 | null |
2023-11-27 | Topological skyrmion semimetals | Shu-Wei Liu et.al. | 2311.15753 | null |
2023-11-27 | Variational Autoencoders for Feature Exploration and Malignancy Prediction of Lung Lesions | Benjamin Keel et.al. | 2311.15719 | link |
2023-11-27 | SAM-6D: Segment Anything Model Meets Zero-Shot 6D Object Pose Estimation | Jiehong Lin et.al. | 2311.15707 | link |
2023-11-27 | MOT-DETR: 3D Single Shot Detection and Tracking with Transformers to build 3D representations for Agro-Food Robots | David Rapado-Rincon et.al. | 2311.15674 | link |
2023-11-27 | Deformation-Guided Unsupervised Non-Rigid Shape Matching | Aymen Merrouche et.al. | 2311.15668 | null |
2023-11-27 | PaintNeSF: Artistic Creation of Stylized Scenes with Vectorized 3D Strokes | Hao-Bin Duan et.al. | 2311.15637 | null |
2023-11-27 | 2D Feature Distillation for Weakly- and Semi-Supervised 3D Semantic Segmentation | Ozan Unal et.al. | 2311.15605 | null |
2023-11-27 | EucliDreamer: Fast and High-Quality Texturing for 3D Models with Stable Diffusion Depth | Cindy Le et.al. | 2311.15573 | null |
2023-11-27 | ET3D: Efficient Text-to-3D Generation via Multi-View Distillation | Yiming Chen et.al. | 2311.15561 | null |
2023-11-27 | Arbitrary Engineering of Spatial Caustics with 3D-printed Metasurfaces | Xiaoyan Zhou et.al. | 2311.15542 | null |
2023-11-27 | Distributional Hessian and divdiv complexes on triangulation and cohomology | Kaibo Hu et.al. | 2311.15482 | null |
2023-11-27 | AerialBooth: Mutual Information Guidance for Text Controlled Aerial View Synthesis from a Single Image | Divya Kothandaraman et.al. | 2311.15478 | link |
2023-11-27 | Unexpected Field Evaporation Sequence in $γ$ -TiAl | Jiayuwen Qi et.al. | 2311.15472 | null |
2023-11-26 | Functional Diffusion | Biao Zhang et.al. | 2311.15435 | null |
2023-11-26 | Wired Perspectives: Multi-View Wire Art Embraces Generative AI | Zhiyu Qu et.al. | 2311.15421 | null |
2023-11-26 | The Cauchy problem for a helical vortex filament in 3D Navier Stokes | Francisco Gancedo et.al. | 2311.15413 | null |
2023-11-26 | Visual Programming for Zero-shot Open-Vocabulary 3D Visual Grounding | Zhihao Yuan et.al. | 2311.15383 | link |
2023-11-26 | Exploring Mid-Air Hand Interaction in Data Visualization | Zona Kostic et.al. | 2311.15372 | null |
2023-11-26 | ASI: Accuracy-Stability Index for Evaluating Deep Learning Models | Wei Dai et.al. | 2311.15332 | null |
2023-11-26 | Colossal Magnetoresistance in Twisted Intertwined Graphene Spirals | Yiwen Zhang et.al. | 2311.15319 | null |
2023-11-26 | Obj-NeRF: Extract Object NeRFs from Multi-view Images | Zhiyi Li et.al. | 2311.15291 | null |
2023-11-26 | GAIA: Zero-shot Talking Avatar Generation | Tianyu He et.al. | 2311.15230 | null |
2023-11-26 | Asymptotically Compatible Schemes for Nonlocal Ohta Kawasaki Model | Wangbo Luo et.al. | 2311.15186 | null |
2023-11-26 | Sampling metrics for robust reconstructions in multislice ptychography: Theory and experiment | Colin Gilgenbach et.al. | 2311.15181 | null |
2023-11-26 | Angular-Distance Based Channel Estimation for Holographic MIMO | Yuanbin Chen et.al. | 2311.15158 | null |
2023-11-25 | Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large Datasets | Andreas Blattmann et.al. | 2311.15127 | link |
2023-11-25 | X-Ray to CT Rigid Registration Using Scene Coordinate Regression | Pragyan Shrestha et.al. | 2311.15087 | link |
2023-11-25 | Spectrum Sharing between UAV-based Wireless Mesh Networks and Ground Networks | Zhiqing Wei et.al. | 2311.15005 | null |
2023-11-25 | The Performance Analysis of Spectrum Sharing between UAV enabled Wireless Mesh Networks and Ground Networks | Zhiqing Wei et.al. | 2311.14988 | null |
2023-11-25 | SAME++: A Self-supervised Anatomical eMbeddings Enhanced medical image registration framework using stable sampling and regularized transformation | Lin Tian et.al. | 2311.14986 | link |
2023-11-25 | Multi-task Planar Reconstruction with Feature Warping Guidance | Luan Wei et.al. | 2311.14981 | link |
2023-11-25 | Imaging a Semi-Analytical Jet model Generated by 3D GRMHD Simulation | Ye Shen et.al. | 2311.14954 | null |
2023-11-25 | View-Based Luminance Mapping in Open Workplace | Guanzhou Ji et.al. | 2311.14927 | null |
2023-11-25 | Resolution- and Stimulus-agnostic Super-Resolution of Ultra-High-Field Functional MRI: Application to Visual Studies | Hongwei Bran Li et.al. | 2311.14918 | null |
2023-11-25 | Towards Scalable 3D Anomaly Detection and Localization: A Benchmark via 3D Anomaly Synthesis and A Self-Supervised Learning Network | Wenqiao Li et.al. | 2311.14897 | link |
2023-11-24 | Multi-scale energy homogenization for 3D printed microstructures with a Diritchlet boundary condition relaxation under plastic deformation | Antonio Tabanera et.al. | 2311.14870 | null |
2023-11-24 | Unified Medical Image Pre-training in Language-Guided Common Semantic Space | Xiaoxuan He et.al. | 2311.14851 | null |
2023-11-24 | Unveiling the 3D structure of nova shells with MUSE – The case of RR Pic | Lientur Celedón et.al. | 2311.14843 | null |
2023-11-24 | Magnetochiral anisotropy-induced nonlinear planar Hall effect in Topological Insulator surface states | D. C. Marinescu et.al. | 2311.14841 | null |
2023-11-24 | Robust Joint Estimation of Galaxy Redshift and Spectral Templates using Online Dictionary Learning | Sean Bryan et.al. | 2311.14812 | link |
2023-11-24 | Rashba splitting in polar-nonpolar sandwich heterostructure : A DFT Study | Sanchari Bhattacharya et.al. | 2311.14787 | null |
2023-11-24 | Neural Style Transfer for Computer Games | Eleftherios Ioannou et.al. | 2311.14617 | null |
2023-11-24 | Animate124: Animating One Image to 4D Dynamic Scene | Yuyang Zhao et.al. | 2311.14603 | null |
2023-11-24 | On the Existence and long time behaviour of $H^{1}$-Weak Solutions for $2, 3d$-Stochastic $3^{rd}$ -Grade Fluids Equations | Raya Nouira et.al. | 2311.14596 | null |
2023-11-24 | Visualizing Plasma Physics Simulations in Immersive Environments | Nuno Verdelho Trindade et.al. | 2311.14593 | null |
2023-11-24 | GaussianEditor: Swift and Controllable 3D Editing with Gaussian Splatting | Yiwen Chen et.al. | 2311.14521 | link |
2023-11-24 | Morphing Graph Drawings in the Presence of Point Obstacles | Oksana Firman et.al. | 2311.14516 | null |
2023-11-24 | MVControl: Adding Conditional Control to Multi-view Diffusion for Controllable Text-to-3D Generation | Zhiqi Li et.al. | 2311.14494 | link |
2023-11-24 | OneFormer3D: One Transformer for Unified Point Cloud Segmentation | Maxim Kolodiazhnyi et.al. | 2311.14405 | link |
2023-11-24 | Analytic solution for pulse wave propagation in flexible tubes with application to patient-specific arterial tree | Peishuo Wu et.al. | 2311.14345 | null |
2023-11-24 | Binarized 3D Whole-body Human Mesh Recovery | Zhiteng Li et.al. | 2311.14323 | link |
2023-11-24 | Decouple Content and Motion for Conditional Image-to-Video Generation | Cuifeng Shen et.al. | 2311.14294 | null |
2023-11-24 | Electron and proton energization in 3D reconnecting current sheets in semirelativistic plasma with guide magnetic field | Gregory R. Werner et.al. | 2311.14290 | null |
2023-11-24 | ZeroPS: High-quality Cross-modal Knowledge Transfer for Zero-Shot 3D Part Segmentation | Yuheng Xue et.al. | 2311.14262 | null |
2023-11-24 | Exploring percolation phase transition in the three-dimensional Ising model with machine learning | Ranran Guo et.al. | 2311.14245 | null |
2023-11-24 | RSB-Pose: Robust Short-Baseline Binocular 3D Human Pose Estimation with Occlusion Handling | Xiaoyue Wan et.al. | 2311.14242 | null |
2023-11-23 | Enhancing mTBI Diagnosis with Residual Triplet Convolutional Neural Network Using 3D CT | Hanem Ellethy et.al. | 2311.14197 | null |
2023-11-23 | HACD: Hand-Aware Conditional Diffusion for Monocular Hand-Held Object Reconstruction | Bowen Fu et.al. | 2311.14189 | null |
2023-11-23 | Disc Tearing in a Be Star: Predicted 3D Observations | M. W. Suffak et.al. | 2311.14185 | null |
2023-11-23 | TCuPGAN: A novel framework developed for optimizing human-machine interactions in citizen science | Ramanakumar Sankar et.al. | 2311.14177 | null |
2023-11-23 | GigaPose: Fast and Robust Novel Object Pose Estimation via One Correspondence | Van Nguyen Nguyen et.al. | 2311.14155 | link |
2023-11-23 | Automated 3D Tumor Segmentation using Temporal Cubic PatchGAN (TCuP-GAN) | Kameswara Bharadwaj Mantha et.al. | 2311.14148 | null |
2023-11-23 | An approach to solve the coarse-grained Protein folding problem in a Quantum Computer | Jaya Vasavi P et.al. | 2311.14141 | null |
2023-11-23 | Phonon collapse and anharmonic melting of the 3D charge-density wave in kagome metals | Martin Gutierrez-Amigo et.al. | 2311.14112 | null |
2023-11-23 | MonoNav: MAV Navigation via Monocular Depth Estimation and Reconstruction | Nathaniel Simon et.al. | 2311.14100 | null |
2023-11-23 | 3D Printed Discrete Dielectric Lens With Improved Matching Layers | Juan Andrés Vásquez-Peralvo et.al. | 2311.14065 | null |
2023-11-23 | Multidimensional surrogate modelling for Airborne TDEM data | Wouter Deleersnyder et.al. | 2311.13998 | null |
2023-11-23 | GRJointNET: Synergistic Completion and Part Segmentation on 3D Incomplete Point Clouds | Yigit Gurses et.al. | 2311.13997 | null |
2023-11-23 | Facilitating Human-Robot Collaboration through Natural Vocal Conversations | Davide Ferrari et.al. | 2311.13973 | null |
2023-11-23 | Investigating the use of publicly available natural videos to learn Dynamic MR image reconstruction | Olivier Jaubert et.al. | 2311.13963 | link |
2023-11-23 | Angular momentum and chemical transport by azimuthal magnetorotational instability in radiative stellar interiors | Domenico G. Meduri et.al. | 2311.13962 | null |
2023-11-23 | Investigating microstructure-property relationships of nonwovens by model-based virtual materials testing | Matthias Weber et.al. | 2311.13944 | null |
2023-11-23 | Coronal dimmings as indicators of early CME propagation direction | Shantanu Jain et.al. | 2311.13942 | null |
2023-11-23 | 3D microstructure characterization of Cu 25Cr solid state sintered alloy using X-ray computed tomography and machine learning assisted segmentation | Lucas Varoto et.al. | 2311.13904 | null |
2023-11-23 | A reduced basis warm-start iterative solver for the parameterized systems | Shijin Hou et.al. | 2311.13862 | null |
2023-11-23 | Existence of the axisymmetric weak solution to the 3D isothermal stationary compressible Navier-Stokes equations | Xinyu Fan et.al. | 2311.13791 | null |
2023-11-23 | GS-Pose: Category-Level Object Pose Estimation via Geometric and Semantic Correspondence | Pengyuan Wang et.al. | 2311.13777 | null |
2023-11-23 | 3D-MIR: A Benchmark and Empirical Study on 3D Medical Image Retrieval in Radiology | Asma Ben Abacha et.al. | 2311.13752 | link |
2023-11-23 | Towards Transferable Multi-modal Perception Representation Learning for Autonomy: NeRF-Supervised Masked AutoEncoder | Xiaohao Xu et.al. | 2311.13750 | null |
2023-11-22 | Multi-view Hybrid Graph Convolutional Network for Volume-to-mesh Reconstruction in Cardiovascular MRI | Nicolás Gaggion et.al. | 2311.13706 | link |
2023-11-22 | Compact 3D Gaussian Representation for Radiance Field | Joo Chan Lee et.al. | 2311.13681 | link |
2023-11-22 | GAN-Avatar: Controllable Personalized GAN-based Human Head Avatar | Berna Kabadayi et.al. | 2311.13655 | null |
2023-11-22 | Quantum energetics of a non-commuting measurement | Xiayu Linpeng et.al. | 2311.13634 | null |
2023-11-22 | Boosting3D: High-Fidelity Image-to-3D by Boosting 2D Diffusion Prior to 3D Prior with Progressive Learning | Kai Yu et.al. | 2311.13617 | null |
2023-11-22 | XAGen: 3D Expressive Human Avatars Generation | Zhongcong Xu et.al. | 2311.13574 | null |
2023-11-22 | WildFusion: Learning 3D-Aware Latent Diffusion Models in View Space | Katja Schwarz et.al. | 2311.13570 | null |
2023-11-22 | Medical Image Retrieval Using Pretrained Embeddings | Farnaz Khun Jush et.al. | 2311.13547 | null |
2023-11-22 | Learned Nonlinear Predictor for Critically Sampled 3D Point Cloud Attribute Compression | Tam Thuc Do et.al. | 2311.13539 | null |
2023-11-22 | Volumetric 3D Point Cloud Attribute Compression: Learned polynomial bilateral filter for prediction | Tam Thuc Do et.al. | 2311.13533 | null |
2023-11-22 | Accelerating Inference in Molecular Diffusion Models with Latent Representations of Protein Structure | Ian Dunn et.al. | 2311.13466 | link |
2023-11-22 | Animatable 3D Gaussians for High-fidelity Synthesis of Human Motions | Keyang Ye et.al. | 2311.13404 | null |
2023-11-22 | Depth-Regularized Optimization for 3D Gaussian Splatting in Few-Shot Images | Jaeyoung Chung et.al. | 2311.13398 | link |
2023-11-23 | LucidDreamer: Domain-free Generation of 3D Gaussian Splatting Scenes | Jaeyoung Chung et.al. | 2311.13384 | null |
2023-11-22 | Global existence of suitable weak solutions to the 3D chemotaxis-Navier-Stokes equations | Xiaomeng Chen et.al. | 2311.13343 | null |
2023-11-22 | Deep Learning for Vascular Segmentation and Applications in Phase Contrast Tomography Imaging | Ekin Yagis et.al. | 2311.13319 | null |
2023-11-22 | On the parallel solution of hydro-mechanical problems with fracture networks and contact conditions | Jan Stebel et.al. | 2311.13310 | null |
2023-11-22 | Retargeting Visual Data with Deformation Fields | Tim Elsner et.al. | 2311.13297 | null |
2023-11-22 | Gravitational repulsive effects in 3D regular black holes | Orlando Luongo et.al. | 2311.13264 | null |
2023-11-22 | TSegFormer: 3D Tooth Segmentation in Intraoral Scans with Geometry Guided Transformer | Huimin Xiong et.al. | 2311.13234 | link |
2023-11-22 | Robot at the Mirror: Learning to Imitate via Associating Self-supervised Models | Andrej Lúčny et.al. | 2311.13226 | link |
2023-11-23 | DRIFu: Differentiable Rendering and Implicit Function-based Single-View 3D Reconstruction | Zijian Kuang et.al. | 2311.13199 | link |
2023-11-22 | Differentiable Radio Frequency Ray Tracing for Millimeter-Wave Sensing | Xingyu Chen et.al. | 2311.13182 | null |
2023-11-22 | Volumetric Reconstruction Resolves Off-Resonance Artifacts in Static and Dynamic PROPELLER MRI | Annesha Ghosh et.al. | 2311.13177 | link |
2023-11-22 | 3D Face Style Transfer with a Hybrid Solution of NeRF and Mesh Rasterization | Jianwei Feng et.al. | 2311.13168 | null |
2023-11-22 | Test-Time Augmentation for 3D Point Cloud Classification and Segmentation | Tuan-Anh Vu et.al. | 2311.13152 | null |
2023-11-22 | DAE-Net: Deforming Auto-Encoder for fine-grained shape co-segmentation | Zhiqin Chen et.al. | 2311.13125 | link |
2023-11-22 | Automated Measurement of Pericoronary Adipose Tissue Attenuation and Volume in CT Angiography | Andrew M. Nguyen et.al. | 2311.13100 | null |
2023-11-22 | Terminal Phase Navigation for AUV Docking: An Innovative Electromagnetic Approach | Yevgeni Gutnik et.al. | 2311.13078 | null |
2023-11-22 | Non-equilibrium dynamics of topological defects in the 3d O(2) model | Edgar López-Contreras et.al. | 2311.13074 | null |
2023-11-21 | Training Deep 3D Convolutional Neural Networks to Extract BSM Physics Parameters Directly from HEP Data: a Proof-of-Concept Study Using Monte Carlo Simulations | S. Dubey et.al. | 2311.13060 | link |
2023-11-21 | 3D Compression Using Neural Fields | Janis Postels et.al. | 2311.13009 | null |
2023-11-21 | A Cosheaf Theory of Reciprocal Figures: Planar and Higher Genus Graphic Statics | Zoe Cooperband et.al. | 2311.12946 | null |
2023-11-21 | Argyres-Douglas Theories, IR N-ality and Complete Graphs | Anindya Dey et.al. | 2311.12931 | null |
2023-11-21 | An Efficient 3D Gaussian Representation for Monocular/Multi-view Dynamic Scenes | Kai Katsumata et.al. | 2311.12897 | null |
2023-11-21 | Text-Guided Texturing by Synchronized Multi-View Diffusion | Yuxin Liu et.al. | 2311.12891 | link |
2023-11-19 | Global Strong Solutions to the incompressible Magnetohydrodynamic Equations with Density-Dependent Viscosity and Vacuum in 3D Exterior Domains | Bing Yuan et.al. | 2311.12873 | null |
2023-11-21 | Physics-guided Shape-from-Template: Monocular Video Perception through Neural Surrogate Models | David Stotko et.al. | 2311.12796 | link |
2023-11-21 | The need for spatially resolved observations of PAHs in protoplanetary discs | K. Lange et.al. | 2311.12794 | null |
2023-11-21 | SuGaR: Surface-Aligned Gaussian Splatting for Efficient 3D Mesh Reconstruction and High-Quality Mesh Rendering | Antoine Guédon et.al. | 2311.12775 | link |
2023-11-21 | SelfOcc: Self-Supervised Vision-Based 3D Occupancy Prediction | Yuanhui Huang et.al. | 2311.12754 | link |
2023-11-21 | Non-radial NLS equation with competing inhomogeneous nonlinearities: Ground states, Blow-up and scattering | Tianxiang Gou et.al. | 2311.12693 | null |
2023-11-21 | Leveraging Unlabeled Data for 3D Medical Image Segmentation through Self-Supervised Contrastive Learning | Sanaz Karimijafarbigloo et.al. | 2311.12617 | null |
2023-11-21 | Reconstructing the Baryonic Acoustic Oscillations in the presence of photo- $z$ uncertainties | Kwan Chuen Chan et.al. | 2311.12611 | null |
2023-11-21 | TouchSDF: A DeepSDF Approach for 3D Shape Reconstruction using Vision-Based Tactile Sensing | Mauro Comi et.al. | 2311.12602 | null |
2023-11-21 | HiPose: Hierarchical Binary Surface Encoding and Correspondence Pruning for RGB-D 6DoF Object Pose Estimation | Yongliang Lin et.al. | 2311.12588 | link |
2023-11-21 | Watch Your Adjoints! Lack of Mesh Convergence in Inviscid Adjoint Solutions | Carlos Lozano et.al. | 2311.12504 | null |
2023-11-21 | AR Visualization System for Ship Detection and Recognition Based on AI | Ziqi Ye et.al. | 2311.12430 | null |
2023-11-21 | Two Views Are Better than One: Monocular 3D Pose Estimation with Multiview Consistency | Christian Keilstrup Ingwersen et.al. | 2311.12421 | null |
2023-11-21 | Autonomous Exploration of Unknown 3D Environments Using a Frontier-Based Collector Strategy | Ivan D. Changoluisa Caiza et.al. | 2311.12408 | link |
2023-11-21 | Learning Part Motion of Articulated Objects Using Spatially Continuous Neural Implicit Representations | Yushi Du et.al. | 2311.12407 | null |
2023-11-21 | Semi-supervised Medical Image Segmentation via Query Distribution Consistency | Rong Wu et.al. | 2311.12364 | link |
2023-11-21 | Evidence of filamentary superconductivity in pressurized La3Ni2O7 single crystals | Yazhou Zhou et.al. | 2311.12361 | null |
2023-11-21 | Instance-aware 3D Semantic Segmentation powered by Shape Generators and Classifiers | Bo Sun et.al. | 2311.12291 | null |
2023-11-21 | Towards Accurate Loop Closure Detection in Semantic SLAM with 3D Semantic Covisibility Graphs | Zhentian Qian et.al. | 2311.12245 | null |
2023-11-22 | PhysGaussian: Physics-Integrated 3D Gaussians for Generative Dynamics | Tianyi Xie et.al. | 2311.12198 | null |
2023-11-20 | LABELMAKER: Automatic Semantic Label Generation from RGB-D Trajectories | Silvan Weder et.al. | 2311.12174 | link |
2023-11-20 | Model-aware 3D Eye Gaze from Weak and Few-shot Supervisions | Nikola Popovic et.al. | 2311.12157 | link |
2023-11-20 | Uncertainty Estimation in Contrast-Enhanced MR Image Translation with Multi-Axis Fusion | Ivo M. Baltruschat et.al. | 2311.12153 | null |
2023-11-20 | Mixing-Denoising Generalizable Occupancy Networks | Amine Ouasfi et.al. | 2311.12125 | null |
2023-11-20 | A Comprehensive Theory for Neutron Star and Black Hole Kicks and Induced Spins | Adam Burrows et.al. | 2311.12109 | null |
2023-11-20 | Pyramid Diffusion for Fine 3D Large Scene Generation | Yuheng Liu et.al. | 2311.12085 | link |
2023-11-18 | DatasetNeRF: Efficient 3D-aware Data Factory with Generative Radiance Fields | Yu Chi et.al. | 2311.12063 | link |
2023-11-18 | PBWR: Parametric Building Wireframe Reconstruction from Aerial LiDAR Point Clouds | Shangfeng Huang et.al. | 2311.12062 | null |
2023-11-18 | Towards Function Space Mesh Watermarking: Protecting the Copyright of Signed Distance Fields | Xingyu Zhu et.al. | 2311.12059 | null |
2023-11-18 | FlashOcc: Fast and Memory-Efficient Occupancy Prediction via Channel-to-Height Plugin | Zichen Yu et.al. | 2311.12058 | link |
2023-11-18 | 3D-GOI: 3D GAN Omni-Inversion for Multifaceted and Multi-object Editing | Haoran Li et.al. | 2311.12050 | null |
2023-11-17 | Efficient Domain Adaptation via Generative Prior for 3D Infant Pose Estimation | Zhuoran Zhou et.al. | 2311.12043 | null |
2023-11-20 | An on-chip platform for multi-degree-of-freedom control of two-dimensional quantum and nonlinear materials | Haoning Tang et.al. | 2311.12030 | null |
2023-11-20 | Hourglass Tokenizer for Efficient Transformer-Based 3D Human Pose Estimation | Wenhao Li et.al. | 2311.12028 | link |
2023-11-23 | PF-LRM: Pose-Free Large Reconstruction Model for Joint Pose and Shape Prediction | Peng Wang et.al. | 2311.12024 | null |
2023-11-20 | LiDAR-HMR: 3D Human Mesh Recovery from LiDAR | Bohao Fan et.al. | 2311.11971 | link |
2023-11-20 | Bozonized Momentum Distribution of a Fermi Gas via Friedrichs Diagrams | Sascha Lill et.al. | 2311.11945 | null |
2023-11-20 | Phonon dynamics for light dark matter detection | Martí Raya-Moreno et.al. | 2311.11930 | null |
2023-11-20 | Deciphering the solar coronal heating: Energizing small-scale loops through surface convection | D. Nóbrega-Siverio et.al. | 2311.11912 | null |
2023-11-20 | GP-NeRF: Generalized Perception NeRF for Context-Aware 3D Scene Understanding | Hao Li et.al. | 2311.11863 | null |
2023-11-20 | Entangled View-Epipolar Information Aggregation for Generalizable Neural Radiance Fields | Zhiyuan Min et.al. | 2311.11845 | link |
2023-11-20 | Holistic Inverse Rendering of Complex Facade via Aerial 3D Scanning | Zixuan Xie et.al. | 2311.11825 | null |
2023-11-20 | CityScope: Enhanced Localozation and Synchronizing AR for Dynamic Urban Weather Visualization | Tzu Hsin Hsieh et.al. | 2311.11783 | null |
2023-11-20 | Polarization effects in ultrafast laser-written directional couplers | Zhi-Kai Pong et.al. | 2311.11743 | null |
2023-11-20 | A Fast and Scalable Computational Topology Framework for the Euler Characteristic | Daniel J. Laky et.al. | 2311.11740 | null |
2023-11-20 | Sparse4D v3: Advancing End-to-End 3D Detection and Tracking | Xuewu Lin et.al. | 2311.11722 | link |
2023-11-20 | Tailored Perdew-Burke-Ernzerhof functionals for improved band gap predictions in Zn monochalcogenides | Satadeep Bhattacharjee et.al. | 2311.11702 | null |
2023-11-21 | GS-SLAM: Dense Visual SLAM with 3D Gaussian Splatting | Chi Yan et.al. | 2311.11700 | null |
2023-11-20 | Quenched disorder and instability control dynamic fracture in three dimensions | Yuri Lubomirsky et.al. | 2311.11692 | null |
2023-11-20 | ViP-Mixer: A Convolutional Mixer for Video Prediction | Xin Zheng et.al. | 2311.11683 | null |
2023-11-20 | OmniSeg3D: Omniversal 3D Segmentation via Hierarchical Contrastive Learning | Haiyang Ying et.al. | 2311.11666 | link |
2023-11-20 | Enhanced Spatio-Temporal Context for Temporally Consistent Robust 3D Human Motion Recovery from Monocular Videos | Sushovan Chanda et.al. | 2311.11662 | null |
2023-11-20 | A 3D Multi-Style Cross-Modality Segmentation Framework for Segmenting Vestibular Schwannoma and Cochlea | Yuzhou Zhuang et.al. | 2311.11578 | null |
2023-11-20 | Ground state of the $S$ =1/2 pyrochlore Heisenberg antiferromagnet: A quantum spin liquid emergent from dimensional reduction | Rico Pohle et.al. | 2311.11561 | null |
2023-11-19 | Two-step BEC coming from a temperature dependent energy gap | Juan José Valencia Acevedo et.al. | 2311.11447 | null |
2023-11-19 | SOccDPT: Semi-Supervised 3D Semantic Occupancy from Dense Prediction Transformers trained under memory constraints | Aditya Nalgunda Ganesh et.al. | 2311.11371 | null |
2023-11-19 | Local environment-based machine learning for molecular adsorption energy prediction | Yifan Li et.al. | 2311.11364 | null |
2023-11-19 | The Gamma Ray Origin in RXJ0852.0-4622 Quantifying the Hadronic and Leptonic Components: Further Evidence for the Cosmic Ray Acceleration in Young Shell-type SNRs | Yasuo Fukui et.al. | 2311.11355 | null |
2023-11-22 | LucidDreamer: Towards High-Fidelity Text-to-3D Generation via Interval Score Matching | Yixun Liang et.al. | 2311.11284 | link |
2023-11-19 | Holography in Flat Spacetimes: the case for Carroll | Arjun Bagchi et.al. | 2311.11246 | null |
2023-11-19 | AtomXR: Streamlined XR Prototyping with Natural Language and Immersive Physical Interaction | Alice Cai et.al. | 2311.11238 | null |
2023-11-19 | A Universal Framework for Accurate and Efficient Geometric Deep Learning of Molecular Systems | Shuo Zhang et.al. | 2311.11228 | link |
2023-11-19 | Wireless Regional Imaging through Reconfigurable Intelligent Surfaces: Passive Mode | Fuhai Wang et.al. | 2311.11222 | null |
2023-11-19 | GaussianDiffusion: 3D Gaussian Splatting for Denoising Diffusion Probabilistic Models with Structured Noise | Xinhai Li et.al. | 2311.11221 | null |
2023-11-19 | 3D Guidewire Shape Reconstruction from Monoplane Fluoroscopic Images | Tudor Jianu et.al. | 2311.11209 | null |
2023-11-18 | Diverse Shape Completion via Style Modulated Generative Adversarial Networks | Wesley Khademi et.al. | 2311.11184 | null |
2023-11-18 | LOSTU: Fast, Scalable, and Uncertainty-Aware Triangulation | Sébastien Henry et.al. | 2311.11171 | null |
2023-11-18 | Invariant-based Mapping of Space During General Motion of an Observer | Juan D. Yepes et.al. | 2311.11130 | null |
2023-11-18 | SecondPose: SE(3)-Consistent Dual-Stream Feature Fusion for Category-Level Pose Estimation | Yamei Chen et.al. | 2311.11125 | link |
2023-11-18 | 6G Fresnel Spot Beamfocusing using Large-Scale Metasurfaces: A Distributed DRL-Based Approach | Mehdi Monemi et.al. | 2311.11109 | null |
2023-11-18 | Multiple View Geometry Transformers for 3D Human Pose Estimation | Ziwei Liao et.al. | 2311.10983 | link |
2023-11-18 | Structure-Aware Sparse-View X-ray 3D Reconstruction | Yuanhao Cai et.al. | 2311.10959 | link |
2023-11-18 | Hydrogen Doping Induced $p_x\pm ip_y$ Triplet Superconductivity in Quasi-One-Dimensional K$_2$Cr$_3$As$_3$ | Ming Zhang et.al. | 2311.10942 | null |
2023-11-17 | Path Planning in 3D with Motion Primitives for Wind Energy-Harvesting Fixed-Wing Aircraft | Seung-Keol Ryu et.al. | 2311.10915 | null |
2023-11-17 | Equivariant Neural Operator Learning with Graphon Convolution | Chaoran Cheng et.al. | 2311.10908 | link |
2023-11-17 | OCT2Confocal: 3D CycleGAN based Translation of Retinal OCT Images to Confocal Microscopy | Xin Tian et.al. | 2311.10902 | link |
2023-11-17 | Point Cloud Self-supervised Learning via 3D to Multi-view Masked Autoencoder | Zhimin Chen et.al. | 2311.10887 | link |
2023-11-17 | Pre- to Post-Contrast Breast MRI Synthesis for Enhanced Tumour Segmentation | Richard Osuala et.al. | 2311.10879 | link |
2023-11-17 | Domain Generalization of 3D Object Detection by Density-Resampling | Shuangzhi Li et.al. | 2311.10845 | link |
2023-11-17 | Artificial Intelligence in Fetal Resting-State Functional MRI Brain Segmentation: A Comparative Analysis of 3D UNet, VNet, and HighRes-Net Models | Farzan Vahedifard et.al. | 2311.10844 | null |
2023-11-17 | Simulating X-ray Reverberation in the UV-Emitting Regions of Active Galactic Nuclei Accretion Disks with 3D Multi-Frequency Magnetohydrodynamic Simulations | Amy Secunda et.al. | 2311.10820 | null |
2023-11-17 | SplatArmor: Articulated Gaussian splatting for animatable humans from monocular RGB videos | Rohit Jena et.al. | 2311.10812 | null |
2023-11-17 | INSPECT: A Multimodal Dataset for Pulmonary Embolism Diagnosis and Prognosis | Shih-Cheng Huang et.al. | 2311.10798 | null |
2023-11-16 | Collection, Collation, and Comparison of 3D Coronal CME Reconstructions | C. Kay et.al. | 2311.10712 | null |
2023-11-17 | 3D-TexSeg: Unsupervised Segmentation of 3D Texture using Mutual Transformer Learning | Iyyakutti Iyappan Ganapathi et.al. | 2311.10651 | null |
2023-11-17 | Multi-delay arterial spin-labeled perfusion estimation with biophysics simulation and deep learning | Renjiu Hu et.al. | 2311.10640 | null |
2023-11-17 | Predicting the linear response of self-gravitating stellar spheres and discs with LinearResponse.jl | Michael S. Petersen et.al. | 2311.10630 | link |
2023-11-17 | Popularity on the 3D-Euclidean Stable Roommates | Steven Ge et.al. | 2311.10585 | null |
2023-11-17 | Phase Guided Light Field for Spatial-Depth High Resolution 3D Imaging | Geyou Zhang et.al. | 2311.10568 | null |
2023-11-17 | Cross-Modal Search and Exploration of Greek Painted Pottery | Elisabeth Trinkl et.al. | 2311.10567 | null |
2023-11-17 | Mutual Coupling in RIS-Aided Communication: Experimental Validation and Performance Evaluation | Pinjun Zheng et.al. | 2311.10544 | null |
2023-11-17 | Segment Anything Model with Uncertainty Rectification for Auto-Prompting Medical Image Segmentation | Yichi Zhang et.al. | 2311.10529 | null |
2023-11-17 | Mixed Reality UI Adaptations with Inaccurate and Incomplete Objectives | Christoph Albert Johns et.al. | 2311.10466 | null |
2023-11-17 | Evolution of X-ray galaxy Cluster Properties in a Representative Sample (EXCPReS). Optimal binning for temperature profile extraction | C. M. H. Chen et.al. | 2311.10397 | null |
2023-11-17 | Vision meets mmWave Radar: 3D Object Perception Benchmark for Autonomous Driving | Yizhou Wang et.al. | 2311.10261 | null |
2023-11-16 | CV-Attention UNet: Attention-based UNet for 3D Cerebrovascular Segmentation of Enhanced TOF-MRA Images | Syed Farhan Abbas et.al. | 2311.10224 | link |
2023-11-16 | A Chern-Simons approach to self-dual gravity in (2+1)-dimensions and quantisation of Poisson structure | Prince K. Osei et.al. | 2311.10220 | null |
2023-11-16 | Confinement of the Solar Tachocline by a Non-Axisymmetric Dynamo | Loren I. Matilsky et.al. | 2311.10202 | null |
2023-11-16 | MetaDreamer: Efficient Text-to-3D Creation With Disentangling Geometry and Texture | Lincong Feng et.al. | 2311.10123 | null |
2023-11-16 | Slide-SAM: Medical SAM Meets Sliding Window | Quan Quan et.al. | 2311.10121 | link |
2023-11-16 | Improving 3D Synthetic Jet Modeling in a Crossflow | Howard Ho et.al. | 2311.10072 | null |
2023-11-17 | Visual Environment Assessment for Safe Autonomous Quadrotor Landing | Mattia Secchiero et.al. | 2311.10065 | null |
2023-11-16 | Dynamic CBCT Imaging using Prior Model-Free Spatiotemporal Implicit Neural Representation (PMF-STINR) | Hua-Chieh Shao et.al. | 2311.10036 | null |
2023-11-16 | On the Overconfidence Problem in Semantic 3D Mapping | Joao Marcos Correia Marques et.al. | 2311.10018 | link |
2023-11-16 | VertDetect: Fully End-to-End 3D Vertebral Instance Segmentation Model | Geoff Klein et.al. | 2311.09958 | null |
2023-11-16 | DSR-Diff: Depth Map Super-Resolution with Diffusion Model | Yuan Shi et.al. | 2311.09919 | null |
2023-11-16 | Visualizing acoustic levitation using COMSOL Multiphysics | Francisco M. Muñoz-Pérez et.al. | 2311.09913 | null |
2023-11-16 | LIO-EKF: High Frequency LiDAR-Inertial Odometry using Extended Kalman Filters | Yibin Wu et.al. | 2311.09887 | link |
2023-11-16 | A guide to numerical dispersion curve calculations: explanation, interpretation and basic Matlab code | Vanessa Cool et.al. | 2311.09843 | link |
2023-11-18 | EvaSurf: Efficient View-Aware Implicit Textured Surface Reconstruction on Mobile Devices | Jingnan Gao et.al. | 2311.09806 | null |
2023-11-16 | Event-based Motion-Robust Accurate Shape Estimation for Mixed Reflectance Scenes | Aniket Dashpute et.al. | 2311.09652 | null |
2023-11-16 | Deep Neural Helmholtz Operators for 3D Elastic Wave Propagation and Inversion | Caifeng Zou et.al. | 2311.09608 | null |
2023-11-16 | 3D Paintbrush: Local Stylization of 3D Shapes with Cascaded Score Distillation | Dale Decatur et.al. | 2311.09571 | link |
2023-11-16 | Temporal-Aware Refinement for Video-based Human Pose and Shape Recovery | Ming Chen et.al. | 2311.09543 | null |
2023-11-15 | Unique Asymptotics of Steady Ricci Solitons with Symmetry | Zilu Ma et.al. | 2311.09405 | null |
2023-11-15 | Nondestructive, quantitative viability analysis of 3D tissue cultures using machine learning image segmentation | Kylie J. Trettner et.al. | 2311.09354 | null |
2023-11-15 | Nothing Stands Still: A Spatiotemporal Benchmark on 3D Point Cloud Registration Under Large Geometric and Temporal Change | Tao Sun et.al. | 2311.09346 | null |
2023-11-15 | Survey of Rigid Body Simulation with Extended Position Based Dynamics | Miguel Luis Nunes Seabra et.al. | 2311.09327 | null |
2023-11-15 | H-Packer: Holographic Rotationally Equivariant Convolutional Neural Network for Protein Side-Chain Packing | Gian Marco Visani et.al. | 2311.09312 | link |
2023-11-15 | From Feast to Famine: A Systematic Study of Accretion onto Oblique Pulsars with 3D GRMHD Simulations | Ariadna Murguia-Berthier et.al. | 2311.09309 | null |
2023-11-15 | On the existence of a very metal-poor disc in the Milky Way | Hanyuan Zhang et.al. | 2311.09294 | null |
2023-11-15 | Single-Image 3D Human Digitization with Shape-Guided Diffusion | Badour AlBahar et.al. | 2311.09221 | null |
2023-11-15 | DMV3D: Denoising Multi-View Diffusion using 3D Large Reconstruction Model | Yinghao Xu et.al. | 2311.09217 | null |
2023-11-15 | Digitally reproducing the artistic style of XVI century artist Antonio Campelo in Alegoria Prudencia | Joao Fradinho Oliveira et.al. | 2311.09211 | null |
2023-11-15 | An Open-Source 3D FE Quench Simulation Tool for No-Insulation HTS Pancake Coils | Sina Atalay et.al. | 2311.09177 | null |
2023-11-15 | Radiative Asymptotic Symmetries of 3D Einstein-Maxwell Theory | Jorrit Bosma et.al. | 2311.09156 | null |
2023-11-15 | Electroneutrality breakdown for electrolytes embedded in varying-section nanopores | P. Malgaretti et.al. | 2311.09113 | null |
2023-11-15 | Cross-view and Cross-pose Completion for 3D Human Understanding | Matthieu Armando et.al. | 2311.09104 | null |
2023-11-15 | Effects of rotation and surface forcing on deep stellar convection zones | Petri J. Käpylä et.al. | 2311.09082 | null |
2023-11-15 | Flow reconstruction and particle characterization from inertial Lagrangian tracks | Ke Zhou et.al. | 2311.09076 | null |
2023-11-15 | Automatic cable harness layout routing in a customizable 3D environment | T. Karlsson et.al. | 2311.09061 | null |
2023-11-15 | Self-Annotated 3D Geometric Learning for Smeared Points Removal | Miaowei Wang et.al. | 2311.09029 | null |
2023-11-15 | Variational manifolds for ground states and scarred dynamics of blockade-constrained spin models on two and three dimensional lattices | Joey Li et.al. | 2311.08965 | null |
2023-11-15 | Plasma sheath tailoring by a magnetic field for three-dimensional plasma etching | E. Jüngling et.al. | 2311.08916 | null |
2023-11-15 | Degradation Estimation Recurrent Neural Network with Local and Non-Local Priors for Compressive Spectral Imaging | Yubo Dong et.al. | 2311.08808 | link |
2023-11-15 | Quantification of cell contractile behavior based on non-destructive macroscopic measurement of tension forces on bioprinted hydrogel | Sarah Pragnere et.al. | 2311.08773 | null |
2023-11-15 | AdVENTR: Autonomous Robot Navigation in Complex Outdoor Environments | Kasun Weerakoon et.al. | 2311.08740 | null |
2023-11-15 | Velocity Gradient and Stellar Polarization: Magnetic Field Tomography towards the L1688 Cloud | Tyler Schmaltz et.al. | 2311.08681 | null |
2023-11-15 | High-Precision Fruit Localization Using Active Laser-Camera Scanning: Robust Laser Line Extraction for 2D-3D Transformation | Pengyu Chu et.al. | 2311.08674 | null |
2023-11-15 | Computing the k-th Eigenvalue of Symmetric $H^2$ -Matrices | M. Ridwan Apriansyah et.al. | 2311.08618 | null |
2023-11-15 | Multi-Radar Inertial Odometry for 3D State Estimation using mmWave Imaging Radar | Jui-Te Huang et.al. | 2311.08608 | null |
2023-11-14 | Drivable 3D Gaussian Avatars | Wojciech Zielonka et.al. | 2311.08581 | null |
2023-11-14 | Laterally constrained low-rank seismic data completion via cyclic-shear transform | David Vargas et.al. | 2311.08540 | null |
2023-11-14 | Physical Adversarial Examples for Multi-Camera Systems | Ana Răduţoiu et.al. | 2311.08539 | null |
2023-11-14 | Impact of inlet gas turbulence on the formation, development and breakup of interfacial waves in a two-phase mixing layer | Delin Jiang et.al. | 2311.08517 | null |
2023-11-14 | Real-time topology optimization via learnable mappings | Gabriel Garayalde et.al. | 2311.08473 | null |
2023-11-14 | Chemical homogeneity of wide binary system: An approach from Near-Infrared spectroscopy | Dongwook Lim et.al. | 2311.08461 | null |
2023-11-14 | LocaliseBot: Multi-view 3D object localisation with differentiable rendering for robot grasping | Sujal Vijayaraghavan et.al. | 2311.08438 | null |
2023-11-14 | Finite temperature quantum field theory under the influence of 3D lattices | Lucia Santamaria-Sanz et.al. | 2311.08435 | null |
2023-11-14 | Instant3D: Instant Text-to-3D Generation | Ming Li et.al. | 2311.08403 | link |
2023-11-14 | The Gaia-ESO Survey: 3D dynamics of young groups and clusters from GES and Gaia EDR3 | Nicholas J. Wright et.al. | 2311.08358 | null |
2023-11-14 | Speeding Up Optimization-based Motion Planning through Deep Learning | Johannes Tenhumberg et.al. | 2311.08345 | null |
2023-11-15 | Observation of high-temperature superconductivity in the high-pressure tetragonal phase of La2PrNi2O7-δ | Gang Wang et.al. | 2311.08212 | null |
2023-11-14 | Computational homogenization of higher-order electro-mechanical materials with built-in generalized periodicity conditions | J. Barceló-Mercader et.al. | 2311.08196 | null |
2023-11-14 | What holes in the gas distribution of nearly face-on galaxies can tell us about the host disk parameters: the case of the NGC 628 South-East superbubble | S. Jiménez et.al. | 2311.08178 | null |
2023-11-14 | DynamicSurf: Dynamic Neural RGB-D Surface Reconstruction with an Optimizable Feature Grid | Mirgahney Mohamed et.al. | 2311.08159 | null |
2023-11-14 | Time-efficient combined morphologic and quantitative joint MRI based on clinical image contrasts – An exploratory in-situ study of standardized cartilage defects | Teresa Lemainque et.al. | 2311.08036 | null |
2023-11-14 | ELF: An End-to-end Local and Global Multimodal Fusion Framework for Glaucoma Grading | Wenyun Li et.al. | 2311.08032 | null |
2023-11-14 | CP-SLAM: Collaborative Neural Point-based SLAM System | Jiarui Hu et.al. | 2311.08013 | null |
2023-11-14 | Nuclear spin ratios of deuterated ammonia in prestellar cores. LAsMA observations of H-MM1 and Oph D | Jorma Harju et.al. | 2311.08006 | null |
2023-11-14 | Probable Object Location (POLo) Score Estimation for Efficient Object Goal Navigation | Jiaming Wang et.al. | 2311.07992 | null |
2023-11-14 | Configurable convolutional neural networks for real-time pedestrian-level wind prediction in urban environments | Alfredo Vicente Clemente et.al. | 2311.07985 | null |
2023-11-14 | Roadside LiDAR Assisted Cooperative Localization for Connected Autonomous Vehicles | Yuze Jiang et.al. | 2311.07913 | null |
2023-11-14 | One-2-3-45++: Fast Single Image to 3D Objects with Consistent Multi-View Generation and 3D Diffusion | Minghua Liu et.al. | 2311.07885 | null |
2023-11-13 | Assessing Test-time Variability for Interactive 3D Medical Image Segmentation with Diverse Point Prompts | Hao Li et.al. | 2311.07806 | link |
2023-11-13 | Global Solutions For Systems of Quadratic Nonlinear Schrödinger Equations in 3D | Boyang Su et.al. | 2311.07802 | null |
2023-11-13 | Sensitivity of 3D Convective Urca Simulations to Changes in Urca Reactions | Brendan Boyd et.al. | 2311.07743 | null |
2023-11-13 | Primordial Black Holes and Wormholes from Domain Wall Networks | Yann Gouttenoire et.al. | 2311.07670 | null |
2023-11-15 | Strong size evolution of disc galaxies since z = 1: Readdressing galaxy growth using a physically-motivated size indicator | Fernando Buitrago et.al. | 2311.07656 | null |
2023-11-12 | CLAMP: A Contrastive Language And Molecule Pre-training Network | Neel Redkar et.al. | 2311.07617 | link |
2023-11-12 | ReIDTracker Sea: the technical report of BoaTrack and SeaDronesSee-MOT challenge at MaCVi of WACV24 | Kaer Huang et.al. | 2311.07616 | null |
2023-11-11 | PECoP: Parameter Efficient Continual Pretraining for Action Quality Assessment | Amirhossein Dadashzadeh et.al. | 2311.07603 | link |
2023-11-11 | Polarimetric PatchMatch Multi-View Stereo | Jinyu Zhao et.al. | 2311.07600 | null |
2023-11-13 | Fast Normalized Cross-Correlation for Template Matching with Rotations | José María Almira et.al. | 2311.07561 | null |
2023-11-13 | Scalar susceptibility of a diluted classical XY model | Reece Beattie-Hauser et.al. | 2311.07457 | null |
2023-11-15 | Trajectories and Platoon-forming Algorithm for Intersections with Heterogeneous Autonomous Traffic | P. C. Joshi et.al. | 2311.07435 | null |
2023-11-13 | Supersampling of Data from Structured-light Scanner with Deep Learning | Martin Melicherčík et.al. | 2311.07432 | link |
2023-11-13 | Emittance-preserving acceleration of high-quality positron beams using warm plasma filaments | Severin Diederichs et.al. | 2311.07402 | link |
2023-11-13 | Gate-Compatible Circuit QED in a Three-Dimensional Cavity Architecture | Zezhou Xia et.al. | 2311.07337 | null |
2023-11-13 | Evolution and final fate of massive post-common-envelope binaries | Dandan Wei et.al. | 2311.07278 | null |
2023-11-13 | Multi-task learning for joint weakly-supervised segmentation and aortic arch anomaly classification in fetal cardiac MRI | Paula Ramirez et.al. | 2311.07234 | link |
2023-11-13 | Modelling turbulence in axisymmetric wakes: an application to wind turbine wakes | Majid Bastankhah et.al. | 2311.07225 | null |
2023-11-13 | Quantal effect on the opening angle distribution between the fission fragment’s spins | Guillaume Scamps et.al. | 2311.07182 | null |
2023-11-13 | Liouville type theorems for stationary Navier-Stokes equations with Lebesgue spaces of variable exponent | Diego Chamorro et.al. | 2311.07173 | null |
2023-11-13 | NDDepth: Normal-Distance Assisted Monocular Depth Estimation and Completion | Shuwei Shao et.al. | 2311.07166 | link |
2023-11-13 | Detecting As Labeling: Rethinking LiDAR-camera Fusion in 3D Object Detection | Junjie Huang et.al. | 2311.07152 | link |
2023-11-13 | SpectralGPT: Spectral Foundation Model | Danfeng Hong et.al. | 2311.07113 | null |
2023-11-13 | GazeForensics: DeepFake Detection via Gaze-guided Spatial Inconsistency Learning | Qinlin He et.al. | 2311.07075 | null |
2023-11-13 | $L_0$-Sampler: An $L_{0}$ Model Guided Volume Sampling for NeRF | Liangchen Li et.al. | 2311.07044 | null |
2023-11-13 | PICS in Pics: Physics Informed Contour Selection for Rapid Image Segmentation | Vikas Dwivedi et.al. | 2311.07002 | null |
2023-11-12 | UAV Formation Optimization for Communication-assisted InSAR Sensing | Mohamed-Amine Lahmeri et.al. | 2311.06959 | null |
2023-11-12 | Matrix-free polynomial preconditioning of saddle point systems using the hyper-power method | Michał Łukasz Mika et.al. | 2311.06926 | null |
2023-11-12 | Utilizing polydispersity in composite fibrous based sound absorbing materials | Quang Vu Tran et.al. | 2311.06819 | null |
2023-11-12 | Dual-Branch Reconstruction Network for Industrial Anomaly Detection with RGB-D Data | Chenyang Bi et.al. | 2311.06797 | null |
2023-11-12 | Rethinking Thorne-Żytkow Object Formation: The Fate of X-ray Binary LMC X-4 and Implications for Ultra-long Gamma-ray Bursts | Tenley Hutchinson-Smith et.al. | 2311.06741 | null |
2023-11-12 | Quantum Griffiths singularity in three-dimensional superconductor to Anderson critical insulator transition | Shichao Qi et.al. | 2311.06710 | null |
2023-11-11 | 3DFusion, A real-time 3D object reconstruction pipeline based on streamed instance segmented data | Xi Sun et.al. | 2311.06659 | null |
2023-11-11 | A 3D Conditional Diffusion Model for Image Quality Transfer – An Application to Low-Field MRI | Seunghoi Kim et.al. | 2311.06631 | link |
2023-11-11 | Surfaces in The Tesseract | Manuel Estévez et.al. | 2311.06596 | null |
2023-11-11 | Swin UNETR++: Advancing Transformer-Based Dense Dose Prediction Towards Fully Automated Radiation Oncology Treatments | Kuancheng Wang et.al. | 2311.06572 | null |
2023-11-11 | CrashCar101: Procedural Generation for Damage Assessment | Jens Parslov et.al. | 2311.06536 | null |
2023-11-11 | Angular-momentum modes in a bosonic condensate trapped in the inverse-square potential | Hidetsugu Sakaguchi et.al. | 2311.06507 | null |
2023-11-11 | Semantic Communication for Cooperative Perception based on Importance Map | Yucheng Sheng et.al. | 2311.06498 | null |
2023-11-11 | FiND: Few-shot three-dimensional image-free confocal focusing on point-like emitters | Swetapadma Sahoo et.al. | 2311.06479 | null |
2023-11-11 | CVTHead: One-shot Controllable Head Avatar with Vertex-feature Transformer | Haoyu Ma et.al. | 2311.06443 | link |
2023-11-10 | A Computationally Efficient Hybrid Neural Network Architecture for Porous Media: Integrating CNNs and GNNs for Improved Permeability Prediction | Qingqi Zhao et.al. | 2311.06418 | null |
2023-11-10 | Going from 3D to 1D: A one-dimensional approach to common-envelope evolution | V. A. Bronner et.al. | 2311.06332 | null |
2023-11-10 | B2G4: A synthetic data pipeline for the integration of Blender models in Geant4 simulation toolkit | Angel Bueno Rodriguez et.al. | 2311.06327 | null |
2023-11-08 | Synthetic Speaking Children – Why We Need Them and How to Make Them | Muhammad Ali Farooq et.al. | 2311.06307 | null |
2023-11-10 | Semantic-aware Video Representation for Few-shot Action Recognition | Yutao Tang et.al. | 2311.06218 | null |
2023-11-10 | MultiIoT: Towards Large-scale Multisensory Learning for the Internet of Things | Shentong Mo et.al. | 2311.06217 | link |
2023-11-10 | Instant3D: Fast Text-to-3D with Sparse-View Generation and Large Reconstruction Model | Jiahao Li et.al. | 2311.06214 | null |
2023-11-10 | ASSIST: Interactive Scene Nodes for Scalable and Realistic Indoor Simulation | Zhide Zhong et.al. | 2311.06211 | null |
2023-11-10 | WInDI: a Warp-Induced Dust Instability in protoplanetary discs | Hossam Aly et.al. | 2311.06182 | link |
2023-11-10 | Converse Flexoelectricity of Low-Dimensional Bismuth Selenite (Bi2Se3) Revealed by Piezoresponse Force Microscopy (PFM) | Qiong Liu et.al. | 2311.06120 | null |
2023-11-10 | Learning-Based Biharmonic Augmentation for Point Cloud Classification | Jiacheng Wei et.al. | 2311.06070 | null |
2023-11-10 | Refining the ONCE Benchmark with Hyperparameter Tuning | Maksim Golyadkin et.al. | 2311.06054 | null |
2023-11-10 | Deep learning for 3D Object Detection and Tracking in Autonomous Driving: A Brief Survey | Yang Peng et.al. | 2311.06043 | null |
2023-11-10 | U3DS $^3$ : Unsupervised 3D Semantic Scene Segmentation | Jiaxu Liu et.al. | 2311.06018 | null |
2023-11-10 | Quantized Distillation: Optimizing Driver Activity Recognition Models for Resource-Constrained Environments | Calvin Tanama et.al. | 2311.05970 | link |
2023-11-10 | Efficient Learning of Fast Inverse Kinematics with Collision Avoidance | Johannes Tenhumberg et.al. | 2311.05938 | null |
2023-11-10 | Nanoscale Analysis of Frozen Water by Atom Probe Tomography Using Graphene Encapsulation and Cryo-Workflows: A Parametric Study | Florant Exertier et.al. | 2311.05923 | null |
2023-11-10 | Essential difference between 2D and 3D from the perspective of real-space renormalization group | Xinliang Lyu et.al. | 2311.05891 | link |
2023-11-10 | Central Angle Optimization for 360-degree Holographic 3D Content | Hakdong Kim et.al. | 2311.05878 | null |
2023-11-09 | Confinement induced three-dimensional trajectories of microswimmers in rectangular channels | Byjesh N. Radhakrishnan et.al. | 2311.05757 | null |
2023-11-07 | OmniVec: Learning robust representations with cross modal sharing | Siddharth Srivastava et.al. | 2311.05709 | null |
2023-11-09 | The Case for Hot-Mode Accretion in Abell 2029 | Deovrat Prasad et.al. | 2311.05704 | null |
2023-11-09 | 3DGAUnet: 3D generative adversarial networks with a 3D U-Net based generator to achieve the accurate and effective synthesis of clinical tumor image data for pancreatic cancer | Yu Shi et.al. | 2311.05697 | null |
2023-11-09 | 3D-QAE: Fully Quantum Auto-Encoding of 3D Point Clouds | Lakshika Rathi et.al. | 2311.05604 | null |
2023-11-09 | A Deep Learning Method for Simultaneous Denoising and Missing Wedge Reconstruction in Cryogenic Electron Tomography | Simon Wiedemann et.al. | 2311.05539 | link |
2023-11-09 | A critical evaluation of the added value of increased horizontal resolution in the hectometric range on the simulation of the mountain boundary layer | Brigitta Goger et.al. | 2311.05528 | null |
2023-11-09 | 3DStyle-Diffusion: Pursuing Fine-grained Text-driven 3D Stylization with 2D Diffusion Models | Haibo Yang et.al. | 2311.05464 | link |
2023-11-09 | Control3D: Towards Controllable Text-to-3D Generation | Yang Chen et.al. | 2311.05461 | null |
2023-11-09 | Finite element modelling of immiscible two-phase flow in oil reservoirs | Taofik H. Nassan et.al. | 2311.05414 | null |
2023-11-09 | The Use of Quantitative Metrics and Machine Learning to Predict Radiologist Interpretations of MRI Image Quality and Artifacts | Lucas McCullum et.al. | 2311.05412 | null |
2023-11-09 | SIRE: scale-invariant, rotation-equivariant estimation of artery orientations using graph neural networks | Dieuwertje Alblas et.al. | 2311.05400 | null |
2023-11-09 | A Compact Form of 3D Conformal Block | Chaoming Song et.al. | 2311.05375 | null |
2023-11-09 | Real-time Addressee Estimation: Deployment of a Deep-Learning Model on the iCub Robot | Carlo Mazzola et.al. | 2311.05334 | null |
2023-11-09 | Liquid phase fast electron tomography unravels the true 3D structure of colloidal assemblies | Daniel Arenas Esteban et.al. | 2311.05309 | null |
2023-11-09 | Three-dimensional GRMHD simulations of neutron star jets | Pushpita Das et.al. | 2311.05301 | null |
2023-11-09 | VoxNeRF: Bridging Voxel Representation and Neural Radiance Fields for Enhanced Indoor View Synthesis | Sen Wang et.al. | 2311.05289 | null |
2023-11-09 | Single-shot Tomography of Discrete Dynamic Objects | Ajinkya Kadu et.al. | 2311.05269 | link |
2023-11-09 | ConRad: Image Constrained Radiance Fields for 3D Generation from a Single Image | Senthil Purushwalkam et.al. | 2311.05230 | null |
2023-11-09 | Super-Resolution Emulation of Large Cosmological Fields with a 3D Conditional Diffusion Model | Adam Rouhiainen et.al. | 2311.05217 | null |
2023-11-09 | Boundary vertex algebras for 3d $\mathcal{N}=4$ rank-0 SCFTs | Andrea E. V. Ferrari et.al. | 2311.05087 | null |
2023-11-09 | Detecting Phase Synchronization in Latent Variable Subspace: Non-generating Partitions and Symbol Sequence Statistics | Henrique Carvalho de Castro et.al. | 2311.05073 | null |
2023-11-08 | Dynamical Masses for the Hyades Binary System vB 120 | Guillermo Torres et.al. | 2311.05036 | null |
2023-11-08 | Transfer learning from a sparsely annotated dataset of 3D medical images | Gabriel Efrain Humpire-Mamani et.al. | 2311.05032 | link |
2023-11-08 | Reinforcement Learning Generalization for Nonlinear Systems Through Dual-Scale Homogeneity Transformations | Abdel Gafoor Haddad et.al. | 2311.05013 | null |
2023-11-08 | Implicit Neural Representations for Breathing-compensated Volume Reconstruction in Robotic Ultrasound Aorta Screening | Yordanka Velikova et.al. | 2311.04999 | null |
2023-11-08 | Digital Twin-based 3D Map Management for Edge-assisted Device Pose Tracking in Mobile AR | Conghao Zhou et.al. | 2311.04997 | null |
2023-11-08 | Turbulently-Driven Detonation Initiation in Electron-Degenerate Matter with Helium | Gabriel O. Casabona et.al. | 2311.04960 | null |
2023-11-08 | CSAM: A 2.5D Cross-Slice Attention Module for Anisotropic Volumetric Medical Image Segmentation | Alex Ling Yu Hung et.al. | 2311.04942 | link |
2023-11-08 | Linear dichroic x-ray absorption response of Ti-Ti dimers along the $c$ axis in Ti$_2$O$_3$ upon Mg substitution | M. Okawa et.al. | 2311.04814 | null |
2023-11-13 | DualTalker: A Cross-Modal Dual Learning Approach for Speech-Driven 3D Facial Animation | Guinan Su et.al. | 2311.04766 | null |
2023-11-08 | Euclidean, Projective, Conformal: Choosing a Geometric Algebra for Equivariant Transformers | Pim de Haan et.al. | 2311.04744 | null |
2023-11-08 | Social Motion Prediction with Cognitive Hierarchies | Wentao Zhu et.al. | 2311.04726 | null |
2023-11-08 | 3D Pose Estimation of Tomato Peduncle Nodes using Deep Keypoint Detection and Point Cloud | Jianchao Ci et.al. | 2311.04699 | null |
2023-11-08 | 3D Global climate model of an exo-Venus: a modern Venus-like atmosphere for the nearby super-Earth LP 890-9 c | Diogo Quirino et.al. | 2311.04675 | null |
2023-11-08 | VET: Visual Error Tomography for Point Cloud Completion and High-Quality Neural Rendering | Linus Franke et.al. | 2311.04634 | link |
2023-11-09 | Rethinking Human Pose Estimation for Autonomous Driving with 3D Event Representations | Xiaoting Yin et.al. | 2311.04591 | link |
2023-11-08 | A 3D generative model of pathological multi-modal MR images and segmentations | Virginia Fernandez et.al. | 2311.04552 | link |
2023-11-08 | PRED: Pre-training via Semantic Rendering on LiDAR Point Clouds | Hao Yang et.al. | 2311.04501 | null |
2023-11-08 | All-Optical Phase Conjugation Using Diffractive Wavefront Processing | Che-Yung Shen et.al. | 2311.04473 | null |
2023-11-08 | LRM: Large Reconstruction Model for Single Image to 3D | Yicong Hong et.al. | 2311.04400 | null |
2023-11-07 | 3DiffTection: 3D Object Detection with Geometry-Aware Diffusion Features | Chenfeng Xu et.al. | 2311.04391 | null |
2023-11-07 | A new realization of quantum algebras in gauge theory and Ramification in the Langlands program | Nathan Haouzi et.al. | 2311.04367 | null |
2023-11-07 | Accreting Neutron Stars in 3D GRMHD Simulations: Jets, Magnetic Polarity, and the Interchange Slingshot | Kyle Parfrey et.al. | 2311.04291 | null |
2023-11-07 | The Survival and Entrainment of Molecules and Dust in Galactic Winds | Zirui Chen et.al. | 2311.04275 | null |
2023-11-07 | Dose-aware Diffusion Model for 3D Ultra Low-dose PET Imaging | Huidong Xie et.al. | 2311.04248 | null |
2023-11-06 | Toward Planet-Wide Traffic Camera Calibration | Khiem Vuong et.al. | 2311.04243 | null |
2023-11-07 | Dissipation anomaly and anomalous dissipation in incompressible fluid flows | Alexey Cheskidov et.al. | 2311.04182 | null |
2023-11-07 | Exploring Climate with Obliquity in a Variable-eccentricity Earth-like World | M. J. Way et.al. | 2311.04167 | null |
2023-11-07 | High-fidelity 3D Reconstruction of Plants using Neural Radiance Field | Kewei Hu et.al. | 2311.04154 | null |
2023-11-07 | Improved Topological Preservation in 3D Axon Segmentation and Centerline Detection using Geometric Assessment-driven Topological Smoothing (GATS) | Nina I. Shamsi et.al. | 2311.04116 | null |
2023-11-07 | SPIRAL: An Efficient Algorithm for the Integration of the Equation of Rotational Motion | Carlos Andrés del Valle et.al. | 2311.04106 | link |
2023-11-07 | DeepPatent2: A Large-Scale Benchmarking Corpus for Technical Drawing Understanding | Kehinde Ajayi et.al. | 2311.04098 | link |
2023-11-07 | Image-Pointcloud Fusion based Anomaly Detection using PD-REAL Dataset | Jianjian Qin et.al. | 2311.04095 | null |
2023-11-07 | mmFUSION: Multimodal Fusion for 3D Objects Detection | Javed Ahmad et.al. | 2311.04058 | null |
2023-11-07 | 3D EAGAN: 3D edge-aware attention generative adversarial network for prostate segmentation in transrectal ultrasound images | Mengqing Liu et.al. | 2311.04049 | null |
2023-11-07 | Multimodal extended reality applications offer benefits for volumetric biomedical image analysis in research and medicine | Kathrin Krieger et.al. | 2311.03986 | null |
2023-11-07 | Adaptive 3D Geometry-based Stochastic Channel Prediction for 3D DL Selection | Mervat Zarour et.al. | 2311.03975 | null |
2023-11-07 | CeCNN: Copula-enhanced convolutional neural networks in joint prediction of refraction error and axial length based on ultra-widefield fundus images | Chong Zhong et.al. | 2311.03967 | link |
2023-11-07 | Spectral functions of the strongly interacting 3D Fermi gas | Christian H. Johansen et.al. | 2311.03953 | null |
2023-11-07 | Dark-Field X-ray Microscopy for 2D and 3D imaging of Microstructural Dynamics at the European X-ray Free Electron Laser | Sara J. Irvine et.al. | 2311.03916 | null |
2023-11-07 | Toward ground-truth optical coherence tomography via three-dimensional unsupervised deep learning processing and data | Renxiong Wu et.al. | 2311.03887 | null |
2023-11-07 | Multiderivative time integration methods preserving nonlinear functionals via relaxation | Hendrik Ranocha et.al. | 2311.03883 | null |
2023-11-07 | Chiral Soliton Lattice turns into 3D crystal | Geraint W. Evans et.al. | 2311.03880 | null |
2023-11-07 | Terrain Recognition and Contact Force Estimation through a Sensorized Paw for Legged Robots | Aleksander Vangen et.al. | 2311.03855 | link |
2023-11-07 | An AFM-based approach for quantification of guest particle deformation during mechano-fusion | Phillip Gräfensteiner et.al. | 2311.03851 | null |
2023-11-07 | Smallest Enclosing Sphere in 3D – Particle Swarm Optimization Approach | Netzer Moriya et.al. | 2311.03843 | null |
2023-11-07 | 3DifFusionDet: Diffusion Model for 3D Object Detection with Robust LiDAR-Camera Fusion | Xinhao Xiang et.al. | 2311.03742 | null |
2023-11-07 | Inertial Guided Uncertainty Estimation of Feature Correspondence in Visual-Inertial Odometry/SLAM | Seongwook Yoon et.al. | 2311.03722 | null |
2023-11-07 | Accelerating the Galerkin Reduced-Order Model with the Tensor Decomposition for Turbulent Flows | Ping-Hsuan Tsai et.al. | 2311.03694 | null |
2023-11-07 | Impact of the Ce $4f$ states in the electronic structure of the intermediate-valence superconductor CeIr$_3$ | Shin-ichi Fujimori et.al. | 2311.03640 | null |
2023-11-07 | FusionViT: Hierarchical 3D Object Detection via LiDAR-Camera Vision Transformer Fusion | Xinhao Xiang et.al. | 2311.03620 | null |
2023-11-06 | Modeling the Reverberation Response of the Broad Line Region in Active Galactic Nuclei | Sara Rosborough et.al. | 2311.03590 | null |
2023-11-06 | Afterglows from binary neutron star post-merger systems embedded in AGN disks | Adithan Kathirgamaraju et.al. | 2311.03571 | null |
2023-11-06 | Predicting Age from White Matter Diffusivity with Residual Learning | Chenyu Gao et.al. | 2311.03500 | null |
2023-11-06 | Dimensionality crossover to 2D vestigial nematicity from 3D zigzag antiferromagnetism in an XY-type honeycomb van der Waals magnet | Zeliang Sun et.al. | 2311.03493 | null |
2023-11-06 | Osprey: Multi-Session Autonomous Aerial Mapping with LiDAR-based SLAM and Next Best View Planning | Rowan Border et.al. | 2311.03484 | null |
2023-11-06 | Homogeneous crystallization in four-dimensional Lennard-Jones liquids | Robert S. Hoy et.al. | 2311.03465 | null |
2023-11-06 | Stable Envelopes, Vortex Moduli Spaces, and Verma Modules | Spencer Tamagni et.al. | 2311.03462 | null |
2023-11-06 | Nucleosynthetic Analysis of Long-Term Three-Dimensional Core-Collapse Supernova Simulations | Tianshu Wang et.al. | 2311.03446 | null |
2023-11-06 | A Generative Neural Network Approach for 3D Multi-Criteria Design Generation and Optimization of an Engine Mount for an Unmanned Air Vehicle | Christoph Petroll et.al. | 2311.03414 | null |
2023-11-06 | Long-Term Invariant Local Features via Implicit Cross-Domain Correspondences | Zador Pataki et.al. | 2311.03345 | null |
2023-11-09 | A Single 2D Pose with Context is Worth Hundreds for 3D Human Pose Estimation | Qitao Zhao et.al. | 2311.03312 | null |
2023-11-06 | Constraints on the accretion properties of quasi-periodic erupters from GRMHD simulations | Anna Chashkina et.al. | 2311.03296 | null |
2023-11-06 | LDM3D-VR: Latent Diffusion Model for 3D VR | Gabriela Ben Melech Stan et.al. | 2311.03226 | null |
2023-11-06 | Animating NeRFs from Texture Space: A Framework for Pose-Dependent Rendering of Human Performances | Paul Knoll et.al. | 2311.03140 | null |
2023-11-06 | Charge and energy deposition in the McDIPPER framework | Oscar Garcia-Montero et.al. | 2311.03125 | null |
2023-11-06 | A survey and classification of face alignment methods based on face models | Jagmohan Meher et.al. | 2311.03082 | link |
2023-11-06 | Reconfigurable, Transformable Soft Pneumatic Actuator with Tunable 3D Deformations for Dexterous Soft Robotics Applications | Dickson Chiu Yu Wong et.al. | 2311.03032 | null |
2023-11-06 | A relaxation approach to the minimisation of the neo-Hookean energy in 3D | Marco Barchiesi et.al. | 2311.02952 | null |
2023-11-06 | Voltage tunable quantum control of extraordinary optical transmission in the visible regime | Hira Asif et.al. | 2311.02949 | null |
2023-11-06 | Microwave generation and vortex jets in superconductor nanotubes | Igor Bogush et.al. | 2311.02946 | null |
2023-11-06 | Marker-Based Localisation System Using an Active PTZ Camera and CNN-Based Ellipse Detection | Xueyan Oh et.al. | 2311.02937 | null |
2023-11-06 | Auto-ICell: An Accessible and Cost-Effective Integrative Droplet Microfluidic System for Real-Time Single-Cell Morphological and Apoptotic Analysis | Yuanyuan Wei et.al. | 2311.02927 | null |
2023-11-06 | Monocular UAV Localisation with Deep Learning and Uncertainty Propagation | Xueyan Oh et.al. | 2311.02908 | null |
2023-11-06 | Human as Points: Explicit Point-based 3D Human Reconstruction from Single-view RGB Images | Yingzhi Tang et.al. | 2311.02892 | link |
2023-11-06 | OVIR-3D: Open-Vocabulary 3D Instance Retrieval Without Training on 3D Data | Shiyang Lu et.al. | 2311.02873 | link |
2023-11-06 | FocusTune: Tuning Visual Localization through Focus-Guided Sampling | Son Tung Nguyen et.al. | 2311.02872 | link |
2023-11-06 | Temporal Shift – Multi-Objective Loss Function for Improved Anomaly Fall Detection | Stefan Denkovski et.al. | 2311.02863 | null |
2023-11-06 | Consistent4D: Consistent 360° Dynamic Object Generation from Monocular Video | Yanqin Jiang et.al. | 2311.02848 | null |
2023-11-08 | Kinematic-aware Prompting for Generalizable Articulated Object Manipulation with LLMs | Wenke Xia et.al. | 2311.02847 | link |
2023-11-09 | SemanticTopoLoop: Semantic Loop Closure With 3D Topological Graph Based on Quadric-Level Object Map | Zhenzhong Cao et.al. | 2311.02831 | null |
2023-11-06 | InstructPix2NeRF: Instructed 3D Portrait Editing from a Single Image | Jianhui Li et.al. | 2311.02826 | link |
2023-11-06 | Mesh Neural Cellular Automata | Ehsan Pajouheshgar et.al. | 2311.02820 | null |
2023-11-06 | Safe-VLN: Collision Avoidance for Vision-and-Language Navigation of Autonomous Robots Operating in Continuous Environments | Lu Yue et.al. | 2311.02817 | null |
2023-11-05 | Towards Generic Anomaly Detection and Understanding: Large-scale Visual-linguistic Model (GPT-4V) Takes the Lead | Yunkang Cao et.al. | 2311.02782 | link |
2023-11-05 | MuSHRoom: Multi-Sensor Hybrid Room Dataset for Joint 3D Reconstruction and Novel View Synthesis | Xuqian Ren et.al. | 2311.02778 | null |
2023-11-05 | Actions on the quiver – Discrete quotients on the Coulomb branch | Amihay Hanany et.al. | 2311.02773 | null |
2023-11-05 | Fast Sparse 3D Convolution Network with VDB | Fangjun Zhou et.al. | 2311.02762 | link |
2023-11-05 | Charging solid partitions | Dmitry Galakhov et.al. | 2311.02751 | null |
2023-11-05 | Fast Point-cloud to Mesh Reconstruction for Deformable Object Tracking | Elham Amin Mansour et.al. | 2311.02749 | null |
2023-11-05 | Complete set of bounds for the technical moduli in 3D anisotropic elasticity | Paolo Vannucci et.al. | 2311.02712 | null |
2023-11-05 | A Generative Multi-Resolution Pyramid and Normal-Conditioning 3D Cloth Draping | Hunor Laczkó et.al. | 2311.02700 | link |
2023-11-05 | Octavius: Mitigating Task Interference in MLLMs via MoE | Zeren Chen et.al. | 2311.02684 | null |
2023-11-05 | Clustered helical vortices for 3D incompressible Euler equation in infinite cylinders | Daomin Cao et.al. | 2311.02676 | null |
2023-11-05 | PotholeGuard: A Pothole Detection Approach by Point Cloud Semantic Segmentation | Sahil Nawale et.al. | 2311.02641 | null |
2023-11-05 | Super-resolved snapshot hyperspectral imaging of solid-state quantum emitters for high-throughput integrated quantum technologies | Shunfa Liu et.al. | 2311.02626 | null |
2023-11-05 | Bulk-boundary-transport correspondence of the second-order topological insulators | Yuxiong Long^§ et.al. | 2311.02619 | null |
2023-11-05 | Deep Learning-based 3D Point Cloud Classification: A Systematic Survey and Outlook | Huang Zhang et.al. | 2311.02608 | null |
2023-11-05 | Optimizing Implicit Neural Representations from Point Clouds via Energy-Based Models | Ryutaro Yamauchi et.al. | 2311.02601 | null |
2023-11-05 | High-resolution 3D phase-contrast imaging beyond the depth of field limit via ptychographic multi-slice electron tomography | Andrey Romanov et.al. | 2311.02580 | null |
2023-11-05 | Multi-Agent 3D Map Reconstruction and Change Detection in Microgravity with Free-Flying Robots | Holly Dinkel et.al. | 2311.02558 | link |
2023-11-05 | IPVNet: Learning Implicit Point-Voxel Features for Open-Surface 3D Reconstruction | Mohammad Samiul Arshad et.al. | 2311.02552 | link |
2023-11-05 | 3D-Aware Talking-Head Video Motion Transfer | Haomiao Ni et.al. | 2311.02549 | null |
2023-11-04 | Neural Network Reconstruction of the Left Atrium using Sparse Catheter Paths | Alon Baram et.al. | 2311.02488 | null |
2023-11-04 | Evidence for Low-Level Dynamical Excitation in Near-Resonant Exoplanet Systems | Malena Rice et.al. | 2311.02478 | null |
2023-11-04 | Effect of streaks on hypersonic boundary layer instability | Clément Caillaud et.al. | 2311.02463 | null |
2023-11-04 | SPHEAR: Spherical Head Registration for Complete Statistical 3D Modeling | Eduard Gabriel Bazavan et.al. | 2311.02461 | null |
2023-11-04 | Effect of W alloying on the electronic structure, phase stability and thermoelectric properties of epitaxial CrN films | Niraj Kumar Singh et.al. | 2311.02453 | null |
2023-11-04 | Light sheet and light field microscopy based on scanning Bessel beam illumination | Chuhui Wang et.al. | 2311.02441 | null |
2023-11-04 | Backward Uniqueness for 3D Navier-Stokes Equations with Non-trivial Final Data and Applications | Zhen Lei et.al. | 2311.02429 | null |
2023-11-04 | P2O-Calib: Camera-LiDAR Calibration Using Point-Pair Spatial Occlusion Relationship | Su Wang et.al. | 2311.02413 | null |
2023-11-04 | LISNeRF Mapping: LiDAR-based Implicit Mapping via Semantic Neural Fields for Large-Scale 3D Scenes | Jianyuan Zhang et.al. | 2311.02313 | null |
2023-11-04 | 3D seismic survey design by maximizing the spectral gap | Yijun Zhang et.al. | 2311.02298 | null |
2023-11-04 | SMIwiz: An integrated toolbox for multidimensional seismic modelling and imaging | Pengliang Yang et.al. | 2311.02293 | link |
2023-11-04 | A Physics based Machine Learning Model to characterize Room Temperature Semiconductor Detectors in 3D | Srutarshi Banerjee et.al. | 2311.02290 | null |
2023-11-04 | Small-scale Dynamo in Cool Stars III. Changes in the photospheres of F3V to M0V stars | Tanayveer S. Bhatia et.al. | 2311.02286 | null |
2023-11-03 | All Hurwitz Algebras from 3D Geometric Algebras | Daniele Corradetti et.al. | 2311.02269 | null |
2023-11-03 | From 3D to 5D tracking: SMX ASIC-based Double-Sided Micro-Strip detectors for comprehensive space, time, and energy measurements | M. Teklishyn et.al. | 2311.02140 | null |
2023-11-03 | Enhancing Monocular Height Estimation from Aerial Images with Street-view Images | Xiaomou Hou et.al. | 2311.02121 | null |
2023-11-03 | EmerNeRF: Emergent Spatial-Temporal Scene Decomposition via Self-Supervision | Jiawei Yang et.al. | 2311.02077 | null |
2023-11-03 | Occlusion-Aware 2D and 3D Centerline Detection for Urban Driving via Automatic Label Generation | David Paz et.al. | 2311.02044 | null |
2023-11-03 | Gravitational waves radiated from axion string-wall networks | Yang Li et.al. | 2311.02011 | null |
2023-11-03 | Sharp Global Well-posedness and Scattering of the Boltzmann Equation | Xuwen Chen et.al. | 2311.02008 | null |
2023-11-03 | Towards Unsupervised Object Detection From LiDAR Point Clouds | Lunjun Zhang et.al. | 2311.02007 | null |
2023-11-06 | Leveraging Large-Scale Pretrained Vision Foundation Models for Label-Efficient 3D Point Cloud Segmentation | Shichao Dong et.al. | 2311.01989 | null |
2023-11-03 | Generalization of Graph-Based Active Learning Relaxation Strategies Across Materials | Xiaoxiao Wang et.al. | 2311.01987 | link |
2023-11-03 | End-to-End assessment of AR-assisted neurosurgery systems | Mahdi Bagheri et.al. | 2311.01912 | null |
2023-11-03 | 3-Dimensional residual neural architecture search for ultrasonic defect detection | Shaun McKnight et.al. | 2311.01867 | null |
2023-11-03 | Estimating 3D Uncertainty Field: Quantifying Uncertainty for Neural Radiance Fields | Jianxiong Shen et.al. | 2311.01815 | null |
2023-11-03 | Core-collapse supernova inside the core of a young massive star cluster: 3D MHD simulations | D. V. Badmaev et.al. | 2311.01789 | null |
2023-11-03 | MixCon3D: Synergizing Multi-View and Cross-Modal Contrastive Learning for Enhancing 3D Representation | Yipeng Gao et.al. | 2311.01734 | link |
2023-11-03 | EXIM: A Hybrid Explicit-Implicit Representation for Text-Guided 3D Shape Generation | Zhengzhe Liu et.al. | 2311.01714 | link |
2023-11-03 | Flow-Based Feature Fusion for Vehicle-Infrastructure Cooperative 3D Object Detection | Haibao Yu et.al. | 2311.01682 | link |
2023-11-03 | INeAT: Iterative Neural Adaptive Tomography | Bo Xiong et.al. | 2311.01653 | null |
2023-11-02 | Vertical Decomposition in 3D and 4D with Applications to Line Nearest-Neighbor Searching in 3D | Pankaj K. Agarwal et.al. | 2311.01597 | null |
2023-11-02 | A time splitting spectral method for the Klein-Gordon-Maxwell system | Peter Allmer et.al. | 2311.01583 | null |
2023-11-02 | Numerical Solution of the Non-polynomial Schrödinger Equation | Peter Allmer et.al. | 2311.01576 | null |
2023-11-02 | Improving Lesion Segmentation in FDG-18 Whole-Body PET/CT scans using Multilabel approach: AutoPET II challenge | Gowtham Krishnan Murugesan et.al. | 2311.01574 | null |
2023-11-02 | MemorySeg: Online LiDAR Semantic Segmentation with a Latent Memory | Enxu Li et.al. | 2311.01556 | null |
2023-11-02 | Expanded stability of layered SnSe-PbSe alloys and evidence of displacive phase transformation from rocksalt in heteroepitaxial thin films | Pooja D. Reddy et.al. | 2311.01514 | null |
2023-11-02 | UltraLiDAR: Learning Compact Representations for LiDAR Completion and Generation | Yuwen Xiong et.al. | 2311.01448 | null |
2023-11-02 | CADSim: Robust and Scalable in-the-wild 3D Reconstruction for Controllable Sensor Simulation | Jingkang Wang et.al. | 2311.01447 | null |
2023-11-02 | Adv3D: Generating Safety-Critical 3D Objects through Closed-Loop Simulation | Jay Sarva et.al. | 2311.01446 | null |
2023-11-02 | Checkerboard CFT | Mikhail Alfimov et.al. | 2311.01437 | null |
2023-11-04 | CenterRadarNet: Joint 3D Object Detection and Tracking Framework using 4D FMCW Radar | Jen-Hao Cheng et.al. | 2311.01423 | null |
2023-11-03 | Transverse Momentum Distributions from Lattice QCD without Wilson Lines | Yong Zhao et.al. | 2311.01391 | null |
2023-11-02 | Millimeter-scale exfoliation of hBN with tunable flake thickness | Amy S. McKeown-Green et.al. | 2311.01387 | null |
2023-11-02 | Sim2Real Bilevel Adaptation for Object Surface Classification using Vision-Based Tactile Sensors | Gabriele M. Caddeo et.al. | 2311.01380 | link |
2023-11-02 | Analysis of tidal flows through the Strait of Gibraltar using Dynamic Mode Decomposition | Sathsara Dias et.al. | 2311.01377 | link |
2023-11-02 | Look at Robot Base Once: Hand-Eye Calibration with Point Clouds of Robot Base Leveraging Learning-Based 3D Vision | Leihui Li et.al. | 2311.01335 | link |
2023-11-02 | Quasi Two-dimensional Vortex Matter in ThH $_{10}$ Superhydride | Andrey V. Sadakov et.al. | 2311.01318 | null |
2023-11-02 | Hybrid-Fusion Transformer for Multisequence MRI | Jihoon Cho et.al. | 2311.01308 | link |
2023-11-02 | Joint 3D Shape and Motion Estimation from Rolling Shutter Light-Field Images | Hermes McGriff et.al. | 2311.01292 | link |
2023-11-02 | High-Quality Animatable Dynamic Garment Reconstruction from Monocular Videos | Xiongzheng Li et.al. | 2311.01214 | null |
2023-11-02 | Cross-Modal Information-Guided Network using Contrastive Learning for Point Cloud Registration | Yifan Xie et.al. | 2311.01202 | link |
2023-11-02 | An Optimal Medium for Haptics | Thomas Daunizeau et.al. | 2311.01179 | null |
2023-11-02 | Cheating Depth: Enhancing 3D Surface Anomaly Detection via Depth Simulation | Vitjan Zavrtanik et.al. | 2311.01117 | link |
2023-11-02 | The Operator Product Expansion for Radial Lattice Quantization of 3D $φ^4$ Theory | Venkitesh Ayyar et.al. | 2311.01100 | null |
2023-11-02 | Multi-agent robotic systems and exploration algorithms: Applications for data collection in construction sites | Samuel A. Prieto et.al. | 2311.01078 | null |
2023-11-02 | Novel View Synthesis from a Single RGBD Image for Indoor Scenes | Congrui Hetang et.al. | 2311.01065 | null |
2023-11-02 | LaughTalk: Expressive 3D Talking Head Generation with Laughter | Kim Sung-Bin et.al. | 2311.00994 | null |
2023-11-02 | M&M3D: Multi-Dataset Training and Efficient Network for Multi-view 3D Object Detection | Hang Zhang et.al. | 2311.00986 | link |
2023-11-02 | MAAIG: Motion Analysis And Instruction Generation | Wei-Hsin Yeh et.al. | 2311.00980 | null |
2023-11-02 | Spontaneous-Ordering Platoon Control for Multirobot Path Navigation Using Guiding Vector Fields | Bin-Bin Hu et.al. | 2311.00976 | null |
2023-11-02 | Effect of Confinement and Topology: 2-TIPS vs MIPS | Nayana Venkatareddy et.al. | 2311.00929 | null |
2023-11-02 | Quatro++: Robust Global Registration Exploiting Ground Segmentation for Loop Closing in LiDAR SLAM | Hyungtae Lim et.al. | 2311.00928 | null |
2023-11-01 | EMPOT: partial alignment of density maps and rigid body fitting using unbalanced Gromov-Wasserstein divergence | Aryan Tajmir Riahi et.al. | 2311.00850 | link |
2023-11-01 | Using the HOMFLY-PT polynomial to compute knot types | Eric J. Rawdon et.al. | 2311.00817 | null |
2023-11-01 | Shaking a container full of perfect liquid; a tractable case, a torus shell, exhibits a virtual wall | J. H. Hannay et.al. | 2311.00576 | null |
2023-11-01 | Accelerated particle beams in a 3D simulation of the quiet Sun. Effects of advanced beam propagation modelling | L. Frogner et.al. | 2311.00490 | null |
2023-11-01 | DEFN: Dual-Encoder Fourier Group Harmonics Network for Three-Dimensional Macular Hole Reconstruction with Stochastic Retinal Defect Augmentation and Dynamic Weight Composition | Xingru Huang et.al. | 2311.00483 | link |
2023-11-01 | Single-view 3D Scene Reconstruction with High-fidelity Shape and Texture | Yixin Chen et.al. | 2311.00457 | null |
2023-11-01 | Artificial Intelligence-Facilitated Online Adaptive Proton Therapy Using Pencil Beam Scanning Proton Therapy | Hongying Feng et.al. | 2311.00448 | null |
2023-11-01 | Neural Implicit Field Editing Considering Object-environment Interaction | Zhihong Zeng et.al. | 2311.00425 | null |
2023-11-01 | Fixation-based Self-calibration for Eye Tracking in VR Headsets | Ryusei Uramune et.al. | 2311.00391 | null |
2023-11-01 | NeuralGF: Unsupervised Point Normal Estimation by Learning Neural Gradient Function | Qing Li et.al. | 2311.00389 | link |
2023-11-01 | MolecularWebXR: Multiuser discussions about chemistry and biology in immersive and inclusive VR | Fabio J. Cortes Rodriguez et.al. | 2311.00385 | null |
2023-11-01 | Space Narrative: Generating Images and 3D Scenes of Chinese Garden from Text using Deep Learning | Jiaxi Shi1 et.al. | 2311.00339 | null |
2023-11-01 | From Image to Language: A Critical Analysis of Visual Question Answering (VQA) Approaches, Challenges, and Opportunities | Md Farhan Ishmam et.al. | 2311.00308 | null |
2023-11-01 | Adaptive Latent Diffusion Model for 3D Medical Image to Image Translation: Multi-modal Magnetic Resonance Imaging Study | Jonghun Kim et.al. | 2311.00265 | link |
2023-10-31 | Mass and Angular Momentum Transport in a Gravitationally Unstable Protoplanetary Disk with Improved 3D Radiative Hydrodynamics | Thomas Y. Steiman-Cameron et.al. | 2311.00175 | null |
2023-10-31 | RIR-SF: Room Impulse Response Based Spatial Feature for Multi-channel Multi-talker ASR | Yiwen Shao et.al. | 2311.00146 | null |
2023-10-31 | Non-destructive tomographic nanoscale imaging of ferroelectric domain walls | Jiali He et.al. | 2311.00139 | null |
2023-10-31 | Joint Depth Prediction and Semantic Segmentation with Multi-View SAM | Mykhailo Shvets et.al. | 2311.00134 | null |
2023-10-31 | Deep Compressed Learning for 3D Seismic Inversion | Maayan Gelboim et.al. | 2311.00107 | null |
2023-10-31 | Wavelet Based Statistics for Enhanced 21cm EoR Parameter Constraints | Ian Hothi et.al. | 2311.00036 | link |
2023-10-31 | Magnetorotational dynamo can generate large-scale vertical magnetic fields in 3D GRMHD simulations of accreting black holes | Jonatan Jacquemin-Ide et.al. | 2311.00034 | null |
2023-10-31 | Neutrino trapping and out-of-equilibrium effects in binary neutron star merger remnants | Pedro Luis Espino et.al. | 2311.00031 | null |
2023-10-31 | How to Turn Jets into Cylinders near Supermassive Black Holes in 3D GRMHD Simulations | Valeriia Rohoza et.al. | 2311.00018 | null |
2023-10-31 | HAP: Structure-Aware Masked Image Modeling for Human-Centric Perception | Junkun Yuan et.al. | 2310.20695 | link |
2023-10-31 | StairNet: Visual Recognition of Stairs for Human-Robot Locomotion | Andrew Garrett Kurbis et.al. | 2310.20666 | null |
2023-10-31 | Higher-order reductions of the Mikhalev system | E. V. Ferapontov et.al. | 2310.20528 | null |
2023-10-31 | Large Language Model Can Interpret Latent Space of Sequential Recommender | Zhengyi Yang et.al. | 2310.20487 | link |
2023-10-31 | SignAvatars: A Large-scale 3D Sign Language Holistic Motion Dataset and Benchmark | Zhengdi Yu et.al. | 2310.20436 | null |
2023-10-31 | Thermal-Infrared Remote Target Detection System for Maritime Rescue based on Data Augmentation with 3D Synthetic Data | Sungjin Cheong et.al. | 2310.20412 | null |
2023-11-02 | Two for One – Combined Morphologic and Quantitative Knee Joint MRI Using a Versatile Turbo Spin-Echo Platform | Teresa Lemainque et.al. | 2310.20362 | null |
2023-10-31 | Muscle volume quantification: guiding transformers with anatomical priors | Louise Piecuch et.al. | 2310.20355 | null |
2023-10-31 | GACE: Geometry Aware Confidence Enhancement for Black-Box 3D Object Detectors on LiDAR-Data | David Schinagl et.al. | 2310.20319 | link |
2023-10-31 | The theory of symmetric tensor field with boundary: Kac-Moody algebras in linearized gravity | Erica Bertolini et.al. | 2310.20303 | null |
2023-10-31 | Annotator: A Generic Active Learning Baseline for LiDAR Semantic Segmentation | Binhui Xie et.al. | 2310.20293 | null |
2023-11-03 | Contrast-agent-induced deterministic component of CT-density in the abdominal aorta during routine angiography: proof of concept study | Maria R. Kodenko et.al. | 2310.20243 | null |
2023-10-31 | Breathing Life into Faces: Speech-driven 3D Facial Animation with Natural Head Pose and Detailed Shape | Wei Zhao et.al. | 2310.20240 | null |
2023-10-31 | HEDNet: A Hierarchical Encoder-Decoder Network for 3D Object Detection in Point Clouds | Gang Zhang et.al. | 2310.20234 | link |
2023-10-31 | Reconstructing Human Pose from Inertial Measurements: A Generative Model-based Compressive Sensing Approach | Nguyen Quang Hieu et.al. | 2310.20228 | null |
2023-10-31 | Refined Equivalent Pinhole Model for Large-scale 3D Reconstruction from Spaceborne CCD Imagery | Hong Danyang et.al. | 2310.20117 | null |
2023-10-30 | Does the $ν_{\max}$ scaling relation depend on metallicity? Insights from 3D convection simulations | Yixiao Zhou et.al. | 2310.20050 | null |
2023-10-30 | Computational Design of Magnetic Soft Shape-Forming Catheters using the Material Point Method | Joshua Davy et.al. | 2310.19983 | null |
2023-10-30 | Dynamic, viscoelasticity-driven shape change of elastomer bilayers | Wenya Shu et.al. | 2310.19954 | null |
2023-10-30 | Spectator-model studies for spin-dependent gluon TMD PDFs at the LHC and EIC | Alessandro Bacchetta et.al. | 2310.19916 | null |
2023-10-30 | GPCR-BERT: Interpreting Sequential Design of G Protein Coupled Receptors Using Protein Language Models | Seongwon Kim et.al. | 2310.19915 | null |
2023-10-30 | Quantum Monte Carlo Simulation of the 3D Ising Transition on the Fuzzy Sphere | Johannes S. Hofmann et.al. | 2310.19880 | null |
2023-10-30 | Metric Flows with Neural Networks | James Halverson et.al. | 2310.19870 | null |
2023-10-30 | CustomNet: Zero-shot Object Customization with Variable-Viewpoints in Text-to-Image Diffusion Models | Ziyang Yuan et.al. | 2310.19784 | null |
2023-10-30 | From Instability to Singularity Formation in Incompressible Fluids | Tarek M. Elgindi et.al. | 2310.19780 | null |
2023-10-31 | Promise:Prompt-driven 3D Medical Image Segmentation Using Pretrained Image Foundation Models | Hao Li et.al. | 2310.19721 | link |
2023-10-30 | Distributed multi-UAV shield formation based on virtual surface constraints | María Guinaldo et.al. | 2310.19681 | null |
2023-10-30 | Isolating the Nonlinear Optical Response of a MoS $_2$ Monolayer under Extreme Screening of a Metal Substrate | Tao Yang et.al. | 2310.19657 | null |
2023-10-30 | Automatic 3D modeling by combining SBFEM and transfinite element shape functions | Hauke Gravenkamp et.al. | 2310.19646 | null |
2023-10-30 | RayDF: Neural Ray-surface Distance Fields with Multi-view Consistency | Zhuoman Liu et.al. | 2310.19629 | link |
2023-10-30 | Self-assembled physical unclonable function labels based on plasmonic coupling | Mihir Dass et.al. | 2310.19587 | null |
2023-10-30 | A Perceptual Shape Loss for Monocular 3D Face Reconstruction | Christopher Otto et.al. | 2310.19580 | null |
2023-10-30 | Generating Context-Aware Natural Answers for Questions in 3D Scenes | Mohammed Munzer Dwedari et.al. | 2310.19516 | link |
2023-10-30 | Inverse folding for antibody sequence design using deep learning | Frédéric A. Dreyer et.al. | 2310.19513 | null |
2023-10-30 | Dynamic Gaussian Splatting from Markerless Motion Capture can Reconstruct Infants Movements | R. James Cotton et.al. | 2310.19441 | null |
2023-10-30 | Revision of Analytical Properties of Reaction Amplitude near Thresholds Using the Example of Muon-Induced Prompt Fission | F. F. Karpeshin et.al. | 2310.19421 | null |
2023-10-31 | Text-to-3D with Classifier Score Distillation | Xin Yu et.al. | 2310.19415 | null |
2023-10-30 | Computing decay widths of autoionizing Rydberg states with complex-variable coupled cluster theory | Joel Creutzberg et.al. | 2310.19377 | null |
2023-10-30 | FetusMapV2: Enhanced Fetal Pose Estimation in 3D Ultrasound | Chaoyu Chen et.al. | 2310.19293 | null |
2023-11-01 | rTsfNet: a DNN model with Multi-head 3D Rotation and Time Series Feature Extraction for IMU-based Human Activity Recognition | Yu Enokibori et.al. | 2310.19283 | null |
2023-10-30 | THz transition radiation of electron bunch laser-accelerated in long-scale near-critical density plasmas | D A Gorlova et.al. | 2310.19282 | null |
2023-10-30 | Prediction of Effective Elastic Moduli of Rocks using Graph Neural Networks | Jaehong Chung et.al. | 2310.19274 | link |
2023-10-29 | Popularity, face and voice: Predicting and interpreting livestreamers’ retail performance using machine learning techniques | Xiong Xiong et.al. | 2310.19200 | null |
2023-10-29 | Immersive 3D Simulator for Drone-as-a-Service | Jiamin Lin et.al. | 2310.19199 | null |
2023-10-29 | 3DMiner: Discovering Shapes from Large-Scale Unannotated Image Datasets | Ta-Ying Cheng et.al. | 2310.19188 | null |
2023-10-29 | Subjective Quality Evaluation of Point Clouds Using a Head Mounted Display | Joao Prazeres et.al. | 2310.19179 | null |
2023-10-29 | Predicting recovery following stroke: deep learning, multimodal data and feature selection using explainable AI | Adam White et.al. | 2310.19174 | null |
2023-10-29 | Bridging Scales in Black Hole Accretion and Feedback: Magnetized Bondi Accretion in 3D GRMHD | Hyerin Cho et.al. | 2310.19135 | null |
2023-10-29 | Prediction of local elasto-plastic stress and strain fields in a two-phase composite microstructure using a deep convolutional neural network | Indrashish Saha et.al. | 2310.19128 | null |
2023-10-29 | Dynamic V2X Autonomous Perception from Road-to-Vehicle Vision | Jiayao Tan et.al. | 2310.19113 | null |
2023-10-29 | TESTA: Temporal-Spatial Token Aggregation for Long-form Video-Language Understanding | Shuhuai Ren et.al. | 2310.19060 | link |
2023-10-31 | DynPoint: Dynamic Neural Point For View Synthesis | Kaichen Zhou et.al. | 2310.18999 | link |
2023-10-29 | Investigation of correlation effects in FeSe and FeTe by LDA + U method | H. Lohani et.al. | 2310.18994 | null |
2023-10-29 | Predicting RNA-small molecule binding sites by 3D structure | Nan Pan et.al. | 2310.18985 | null |
2023-10-29 | Band Structure of Topological Insulator BiSbTe1.25Se1.75 | H. Lohani et.al. | 2310.18922 | null |
2023-10-29 | TiV-NeRF: Tracking and Mapping via Time-Varying Representation with Dynamic Neural Radiance Fields | Chengyao Duan et.al. | 2310.18917 | null |
2023-10-29 | Macroscopic emulation of microscopic magnetic particle system | Viesturs Spūlis et.al. | 2310.18892 | null |
2023-10-29 | Dynamo-Depth: Fixing Unsupervised Depth Estimation for Dynamical Scenes | Yihong Sun et.al. | 2310.18887 | null |
2023-10-28 | INCODE: Implicit Neural Conditioning with Prior Knowledge Embeddings | Amirhossein Kazerouni et.al. | 2310.18846 | link |
2023-10-28 | A thousand fermions in a 3D harmonic trap via Monte Carlo simulations | Siu A. Chin et.al. | 2310.18818 | null |
2023-10-28 | Hierarchical assembly is more robust than egalitarian assembly in synthetic capsids | Wei-Shao Wei et.al. | 2310.18790 | null |
2023-10-28 | CityRefer: Geography-aware 3D Visual Grounding Dataset on City-scale Point Cloud Data | Taiki Miyanishi et.al. | 2310.18773 | link |
2023-10-28 | Integration of persistent Laplacian and pre-trained transformer for protein solubility changes upon mutation | Jiahui Chen et.al. | 2310.18760 | link |
2023-10-28 | Assessing global ion thermal confinement in critical-gradient-optimized stellarators | A. Bañón Navarro et.al. | 2310.18705 | null |
2023-10-28 | Med-DANet V2: A Flexible Dynamic Architecture for Efficient Medical Volumetric Segmentation | Haoran Shen et.al. | 2310.18656 | null |
2023-10-28 | ODM3D: Alleviating Foreground Sparsity for Enhanced Semi-Supervised Monocular 3D Object Detection | Weijia Zhang et.al. | 2310.18620 | link |
2023-10-28 | Deep3DSketch+: Obtaining Customized 3D Model by Single Free-Hand Sketch through Deep Learning | Ying Zang et.al. | 2310.18609 | null |
2023-10-28 | The Milky Way Bulge extra-tidal star survey: BH 261 (AL 3) | Andrea Kunder et.al. | 2310.18575 | null |
2023-10-31 | FPM-INR: Fourier ptychographic microscopy image stack reconstruction using implicit neural representations | Haowen Zhou et.al. | 2310.18529 | link |
2023-10-27 | Using convolutional neural networks for stereological characterization of 3D hetero-aggregates based on synthetic STEM data | Lukas Fuchs et.al. | 2310.18523 | null |
2023-10-27 | Learning to design protein-protein interactions with enhanced generalization | Anton Bushuiev et.al. | 2310.18515 | link |
2023-10-27 | 3DCoMPaT $^{++}$ : An improved Large-scale 3D Vision Dataset for Compositional Recognition | Habib Slim et.al. | 2310.18511 | link |
2023-10-27 | Using Lyman- $α$ transits to constrain models of atmospheric escape | Ethan Schreyer et.al. | 2310.18486 | null |
2023-10-27 | Kinematic signatures of planet-disk interactions in VSI-turbulent protoplanetary disks | Marcelo Barraza-Alfaro et.al. | 2310.18484 | null |
2023-10-27 | Semi-Synthetic Dataset Augmentation for Application-Specific Gaze Estimation | Cedric Leblond-Menard et.al. | 2310.18469 | null |
2023-10-27 | Understanding the effect of curvature on the magnetization reversal of three-dimensional nanohelices | John Fullerton et.al. | 2310.18456 | null |
2023-10-27 | Exploring Shape Embedding for Cloth-Changing Person Re-Identification via 2D-3D Correspondences | Yubin Wang et.al. | 2310.18438 | null |
2023-10-27 | Isotropic 3D topological phases with broken time reversal symmetry | Helene Spring et.al. | 2310.18400 | null |
2023-10-27 | Gen2Sim: Scaling up Robot Learning in Simulation with Generative Models | Pushkal Katara et.al. | 2310.18308 | null |
2023-10-27 | Structure of $3D$ gravastars in the context of massive gravity | H. Barzegar et.al. | 2310.18287 | null |
2023-10-27 | FOUND: Foot Optimization with Uncertain Normals for Surface Deformation Using Synthetic Data | Oliver Boyne et.al. | 2310.18279 | link |
2023-10-27 | Moments for Perceptive Narration Analysis Through the Emotional Attachment of Audience to Discourse and Story | Gary Bruins et.al. | 2310.18273 | null |
2023-10-27 | Impact of Property Covariance on Cluster Weak lensing Scaling Relations | Zhuowen Zhang et.al. | 2310.18266 | null |
2023-10-27 | FLSH – Friendly Library for the Simulation of Humans | Pablo Ramón et.al. | 2310.18206 | null |
2023-10-27 | 3D atomic structure from a single XFEL pulse | G. Bortel et.al. | 2310.18203 | null |
2023-10-27 | An Energy-Efficient Near-Data Processing Accelerator for DNNs that Optimizes Data Accesses | Bahareh Khabbazan et.al. | 2310.18181 | null |
2023-10-27 | Deep3DSketch++: High-Fidelity 3D Modeling from Single Free-hand Sketches | Ying Zang et.al. | 2310.18178 | null |
2023-10-27 | Reality3DSketch: Rapid 3D Modeling of Objects from Single Freehand Sketches | Tianrun Chen et.al. | 2310.18148 | null |
2023-10-27 | Improving Intrinsic Exploration by Creating Stationary Objectives | Roger Creus Castanyer et.al. | 2310.18144 | null |
2023-10-27 | Unsupervised Representation Learning for Diverse Deformable Shape Collections | Sara Hahner et.al. | 2310.18141 | null |
2023-10-27 | TabAttention: Learning Attention Conditionally on Tabular Data | Michal K. Grzeszczyk et.al. | 2310.18129 | link |
2023-10-27 | Do we need scan-matching in radar odometry? | Vladimír Kubelka et.al. | 2310.18117 | link |
2023-10-27 | Physical properties of Centaur (60558) 174P/Echeclus from stellar occultations | C. L. Pereira et.al. | 2310.18084 | null |
2023-10-27 | Three-Dimensional Variable Slab-Selective Projection Acquisition Imaging | Jinil Park et.al. | 2310.18003 | null |
2023-10-27 | ZeroNVS: Zero-Shot 360-Degree View Synthesis from a Single Real Image | Kyle Sargent et.al. | 2310.17994 | link |
2023-10-27 | Autonomous 3D Exploration in Large-Scale Environments with Dynamic Obstacles | Emil Wiman et.al. | 2310.17977 | link |
2023-10-27 | On the existence of a tight planar relation between stellar specific angular momentum, mass and effective surface brightness for ALFALFA galaxies | E. Elson et.al. | 2310.17916 | null |
2023-10-27 | 3D-Aware Visual Question Answering about Parts, Poses and Occlusions | Xingrui Wang et.al. | 2310.17914 | link |
2023-10-27 | Reconstructive Latent-Space Neural Radiance Fields for Efficient 3D Scene Representations | Tristan Aumentado-Armstrong et.al. | 2310.17880 | null |
2023-10-27 | What You See Is What You Detect: Towards better Object Densification in 3D detection | Tianran Liu et.al. | 2310.17842 | link |
2023-10-27 | Strongly anisotropic vortices in dipolar quantum droplets | Guilong Li et.al. | 2310.17840 | null |
2023-10-26 | AutoCT: Automated CT registration, segmentation, and quantification | Zhe Bai et.al. | 2310.17780 | null |
2023-10-26 | A Dataset of Relighted 3D Interacting Hands | Gyeongsik Moon et.al. | 2310.17768 | null |
2023-10-26 | 6-DoF Stability Field via Diffusion Models | Takuma Yoneda et.al. | 2310.17649 | null |
2023-10-26 | Simulation-based Inference of Reionization Parameters from 3D Tomographic 21 cm Light-cone Images – II: Application of Solid Harmonic Wavelet Scattering Transform | Xiaosheng Zhao et.al. | 2310.17602 | link |
2023-10-26 | Trading particle shape with fluid symmetry: on the mobility matrix in 3D chiral fluids | Tali Khain et.al. | 2310.17528 | null |
2023-10-26 | Masked Space-Time Hash Encoding for Efficient Dynamic Scene Reconstruction | Feng Wang et.al. | 2310.17527 | link |
2023-10-27 | FLARE: Fast Learning of Animatable and Relightable Mesh Avatars | Shrisha Bharadwaj et.al. | 2310.17519 | null |
2023-10-26 | Revisiting the Distillation of Image Representations into Point Clouds for Autonomous Driving | Gilles Puy et.al. | 2310.17504 | link |
2023-10-30 | A Hybrid Graph Network for Complex Activity Detection in Video | Salman Khan et.al. | 2310.17493 | null |
2023-10-26 | Towards Learning Monocular 3D Object Localization From 2D Labels using the Physical Laws of Motion | Daniel Kienzle et.al. | 2310.17462 | link |
2023-10-26 | A hidden 2d CFT for self-dual Yang-Mills on the celestial sphere | Wei Bu et.al. | 2310.17457 | null |
2023-10-26 | SE(3) Diffusion Model-based Point Cloud Registration for Robust 6D Object Pose Estimation | Haobo Jiang et.al. | 2310.17359 | null |
2023-10-26 | IndustReal: A Dataset for Procedure Step Recognition Handling Execution Errors in Egocentric Videos in an Industrial-Like Setting | Tim J. Schoonbeek et.al. | 2310.17323 | link |
2023-10-26 | BEVContrast: Self-Supervision in BEV Space for Automotive Lidar Point Clouds | Corentin Sautier et.al. | 2310.17281 | link |
2023-10-26 | Vorticity Alignment with Lyapunov Vectors and Rate-of-Strain Eigenvectors | Alex Encinas-Bartos et.al. | 2310.17267 | null |
2023-10-26 | On the origin of V-shaped polarisation spectra in molecular clouds | Daniel Seifried et.al. | 2310.17211 | null |
2023-10-26 | Automatic Edge Error Judgment in Figure Skating Using 3D Pose Estimation from a Monocular Camera and IMUs | Ryota Tanaka et.al. | 2310.17193 | link |
2023-10-26 | Lookup Table meets Local Laplacian Filter: Pyramid Reconstruction Network for Tone Mapping | Feng Zhang et.al. | 2310.17190 | link |
2023-10-26 | Graphical Object-Centric Actor-Critic | Leonid Ugadiarov et.al. | 2310.17178 | null |
2023-10-26 | Structure discovery in Atomic Force Microscopy imaging of ice | F. Priante et.al. | 2310.17161 | link |
2023-10-26 | Simple Baselines for Projection-based Full-reference and No-reference Point Cloud Quality Assessment | Zicheng Zhang et.al. | 2310.17147 | null |
2023-10-26 | Optimizing the Temporal and Spatial Resolutions and Light Throughput of Fresnel Incoherent Correlation Holography in the Framework of Coded Aperture Imaging | Francis Gracy Arockiaraj et.al. | 2310.17103 | null |
2023-10-25 | Probing 3D magnetic fields using thermal dust polarization and grain alignment theory | Thiem Hoang et.al. | 2310.17048 | null |
2023-10-25 | Personalized Speech-driven Expressive 3D Facial Animation Synthesis with Style Control | Elif Bozkurt et.al. | 2310.17011 | null |
2023-10-25 | Hydrodynamic limit of multiscale viscoelastic models for rigid particle suspensions | Mitia Duerinckx et.al. | 2310.17008 | null |
2023-10-25 | Double-scaled SYK and de Sitter Holography | Vladimir Narovlansky et.al. | 2310.16994 | null |
2023-10-25 | Non-Clifford and parallelizable fault-tolerant logical gates on constant and almost-constant rate homological quantum LDPC codes via higher symmetries | Guanyu Zhu et.al. | 2310.16982 | null |
2023-10-25 | Post-dynamical inspiral phase of common envelope evolution. The role of magnetic fields | D. Gagnier et.al. | 2310.16880 | null |
2023-10-25 | SparseDFF: Sparse-View Feature Distillation for One-Shot Dexterous Manipulation | Qianxu Wang et.al. | 2310.16838 | null |
2023-10-28 | PERF: Panoramic Neural Radiance Field from a Single Panorama | Guangcong Wang et.al. | 2310.16831 | link |
2023-10-26 | DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior | Jingxiang Sun et.al. | 2310.16818 | link |
2023-10-25 | Metal Nanoparticle-Functionalized Three-Dimensional Graphene: a versatile platform towards sensors and energy-related applications | Emanuele Pompei et.al. | 2310.16797 | null |
2023-10-25 | How to Extend 3D GBSM to Integrated Sensing and Communication Channel with Sharing Feature? | Yameng Liu et.al. | 2310.16765 | null |
2023-10-25 | Best practices for the manual curation of Intrinsically Disordered Proteins in DisProt | Federica Quaglia et.al. | 2310.16716 | null |
2023-10-25 | SkelFMM: A Simplified Fast Multipole Method Based on Recursive Skeletonization | Anna Yesypenko et.al. | 2310.16668 | null |
2023-10-25 | Flow-Attention-based Spatio-Temporal Aggregation Network for 3D Mask Detection | Yuxin Cao et.al. | 2310.16569 | link |
2023-10-25 | Lang3DSG: Language-based contrastive pre-training for 3D Scene Graph prediction | Sebastian Koch et.al. | 2310.16494 | null |
2023-10-25 | MVFAN: Multi-View Feature Assisted Network for 4D Radar Object Detection | Qiao Yan et.al. | 2310.16389 | null |
2023-10-25 | Deepfake Detection: Leveraging the Power of 2D and 3D CNN Ensembles | Aagam Bakliwal et.al. | 2310.16388 | null |
2023-10-25 | Open-NeRF: Towards Open Vocabulary NeRF Decomposition | Hao Zhang et.al. | 2310.16383 | null |
2023-10-25 | DiffRef3D: A Diffusion-based Proposal Refinement Framework for 3D Object Detection | Se-Ho Kim et.al. | 2310.16349 | null |
2023-10-25 | MotionAGFormer: Enhancing 3D Human Pose Estimation with a Transformer-GCNFormer Network | Soroush Mehraban et.al. | 2310.16288 | link |
2023-10-25 | Directly 3D Printed, Pneumatically Actuated Multi-Material Robotic Hand | Hanna Matusik et.al. | 2310.16280 | null |
2023-10-24 | An augmented Lagrangian-based preconditioning technique for a class of block three-by-three linear systems | Fatemeh P. A. Beik et.al. | 2310.16216 | null |
2023-10-24 | Sea-Land-Cloud Segmentation in Satellite Hyperspectral Imagery by Deep Learning | Jon Alvarez Justo et.al. | 2310.16210 | link |
2023-10-24 | iNVS: Repurposing Diffusion Inpainters for Novel View Synthesis | Yash Kant et.al. | 2310.16167 | null |
2023-10-24 | Granular packing simulation protocols: tap, press and relax | A. P. Santos et.al. | 2310.16114 | null |
2023-10-24 | EquivAct: SIM(3)-Equivariant Visuomotor Policies beyond Rigid Object Manipulation | Jingyun Yang et.al. | 2310.16050 | null |
2023-10-25 | Stanford-ORB: A Real-World 3D Object Inverse Rendering Benchmark | Zhengfei Kuang et.al. | 2310.16044 | link |
2023-10-24 | What’s Left? Concept Grounding with Logic-Enhanced Foundation Models | Joy Hsu et.al. | 2310.16035 | link |
2023-10-26 | ConvBKI: Real-Time Probabilistic Semantic Mapping Network with Quantifiable Uncertainty | Joey Wilson et.al. | 2310.16020 | null |
2023-10-24 | A New 2D Energy Balance Model For Simulating the Climates of Rapidly- and Slowly-Rotating Terrestrial Planets | Ramses M. Ramirez et.al. | 2310.15992 | null |
2023-10-24 | Geometry-Aware Video Quality Assessment for Dynamic Digital Human | Zicheng Zhang et.al. | 2310.15984 | null |
2023-10-24 | Frictional weakening of a granular sheared layer due to viscous rolling revealed by Discrete Element Modeling | Alexandre Sac–Morane et.al. | 2310.15945 | null |
2023-10-24 | Combining Behaviors with the Successor Features Keyboard | Wilka Carvalho et.al. | 2310.15940 | null |
2023-10-24 | A Spline-Based Collocation Method for Stokes and Navier-Stokes equations | Jinsil Lee et.al. | 2310.15825 | null |
2023-10-24 | 3D Masked Autoencoders for Enhanced Privacy in MRI Scans | Lennart Alexander Van der Goten et.al. | 2310.15778 | null |
2023-10-24 | Recurrent Linear Transformers | Subhojeet Pramanik et.al. | 2310.15719 | link |
2023-10-26 | GNeSF: Generalizable Neural Semantic Fields | Hanlin Chen et.al. | 2310.15712 | null |
2023-10-24 | Physics-Informed with Power-Enhanced Residual Network for Interpolation and Inverse Problems | Amir Noorizadegan et.al. | 2310.15690 | link |
2023-10-24 | Recent Advances in Multi-modal 3D Scene Understanding: A Comprehensive Survey and Evaluation | Yinjie Lei et.al. | 2310.15676 | null |
2023-10-24 | Leveraging Vision-Centric Multi-Modal Expertise for 3D Object Detection | Linyan Huang et.al. | 2310.15670 | link |
2023-10-24 | GUPNet++: Geometry Uncertainty Propagation Network for Monocular 3D Object Detection | Yan Lu et.al. | 2310.15624 | link |
2023-10-24 | 3D Multi-Target Localization Via Intelligent Reflecting Surface: Protocol and Analysis | Meng Hua et.al. | 2310.15574 | null |
2023-10-24 | I $^2$ MD: 3D Action Representation Learning with Inter- and Intra-modal Mutual Distillation | Yunyao Mao et.al. | 2310.15568 | null |
2023-10-24 | Topology Optimization with Text-Guided Stylization | Shengze Zhong et.al. | 2310.15506 | link |
2023-10-24 | Generalized Cardy conditions of topological defect lines | Xia Gu et.al. | 2310.15487 | null |
2023-10-26 | DeepIron: Predicting Unwarped Garment Texture from a Single Image | Hyun-Song Kwon et.al. | 2310.15447 | null |
2023-10-24 | Electric quadrupole second harmonic generation revealing dual magnetic orders in a magnetic Weyl semimetal | Youngjun Ahn et.al. | 2310.15423 | null |
2023-10-23 | Internally heated and fully compressible convection: flow morphology and scaling laws | Whitney T. Powers et.al. | 2310.15380 | link |
2023-10-23 | Vicinal Feature Statistics Augmentation for Federated 3D Medical Volume Segmentation | Yongsong Huang et.al. | 2310.15371 | null |
2023-10-23 | Curved Space-Filling Tiles Using Voronoi Decomposition with Line, and Curve Segments Closed Under Wallpaper Symmetries | Haard Panchal et.al. | 2310.15361 | null |
2023-10-23 | Non-invertible Symmetries in 2D from Type IIB String Theory | Xingyang Yu et.al. | 2310.15339 | null |
2023-10-23 | DeepVox and SAVE-CT: a contrast- and dose-independent 3D deep learning approach for thoracic aorta segmentation and aneurysm prediction using computed tomography scans | Matheus del-Valle et.al. | 2310.15328 | null |
2023-10-23 | ${\rm S{\scriptsize IM}BIG}$ : The First Cosmological Constraints from Non-Gaussian and Non-Linear Galaxy Clustering | ChangHoon Hahn et.al. | 2310.15246 | null |
2023-10-23 | Galaxies Going Bananas: Inferring the 3D Geometry of High-Redshift Galaxies with JWST-CEERS | Viraj Pandya et.al. | 2310.15232 | null |
2023-10-23 | The most stringent upper limit from dynamical models on the mass of a central black hole in 47 Tucanae | Alessandro Della Croce et.al. | 2310.15221 | null |
2023-10-24 | Ghost on the Shell: An Expressive Representation of General 3D Shapes | Zhen Liu et.al. | 2310.15168 | null |
2023-10-23 | SAM-Med3D | Haoyu Wang et.al. | 2310.15161 | link |
2023-10-23 | Accelerate Microstructure Evolution Simulation Using Graph Neural Networks with Adaptive Spatiotemporal Resolution | Shaoxun Fan et.al. | 2310.15153 | null |
2023-10-23 | DEsignBench: Exploring and Benchmarking DALL-E 3 for Imagining Visual Design | Kevin Lin et.al. | 2310.15144 | link |
2023-10-23 | Novel-View Acoustic Synthesis from 3D Reconstructed Rooms | Byeongjoo Ahn et.al. | 2310.15130 | link |
2023-10-23 | Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model | Ruoxi Shi et.al. | 2310.15110 | link |
2023-10-23 | Robot Skill Generalization via Keypoint Integrated Soft Actor-Critic Gaussian Mixture Models | Iman Nematollahi et.al. | 2310.15059 | null |
2023-10-23 | Fast 2D Bicephalous Convolutional Autoencoder for Compressing 3D Time Projection Chamber Data | Yi Huang et.al. | 2310.15026 | link |
2023-10-23 | System Characterization of a Human-Sized 3D Real-Time Magnetic Particle Imaging Scanner for Cerebral Applications | Florian Thieben et.al. | 2310.15014 | null |
2023-10-24 | Wonder3D: Single Image to 3D using Cross-Domain Diffusion | Xiaoxiao Long et.al. | 2310.15008 | null |
2023-10-24 | Efficient Causal Discovery for Robotics Applications | Luca Castri et.al. | 2310.14925 | null |
2023-10-23 | Object Pose Estimation Annotation Pipeline for Multi-view Monocular Camera Systems in Industrial Settings | Hazem Youssef et.al. | 2310.14914 | null |
2023-10-23 | MAS: Multi-view Ancestral Sampling for 3D motion generation using 2D diffusion | Roy Kapon et.al. | 2310.14729 | null |
2023-10-23 | A Hybrid GNN approach for predicting node data for 3D meshes | Shwetha Salimath et.al. | 2310.14707 | null |
2023-10-23 | Interaction-Driven Active 3D Reconstruction with Object Interiors | Zihao Yan et.al. | 2310.14700 | null |
2023-10-23 | CAwa-NeRF: Instant Learning of Compression-Aware NeRF Features | Omnia Mahmoud et.al. | 2310.14695 | null |
2023-10-23 | Understanding Read Disturbance in High Bandwidth Memory: An Experimental Analysis of Real HBM2 DRAM Chips | Ataberk Olgun et.al. | 2310.14665 | link |
2023-10-23 | Relit-NeuLF: Efficient Relighting and Novel View Synthesis via Neural 4D Light Field | Zhong Li et.al. | 2310.14642 | link |
2023-10-23 | Pre-Training LiDAR-Based 3D Object Detectors Through Colorization | Tai-Yu Pan et.al. | 2310.14592 | link |
2023-10-23 | Modeling groundwater levels in California’s Central Valley by hierarchical Gaussian process and neural network regression | Anshuman Pradhan et.al. | 2310.14555 | link |
2023-10-23 | ADoPT: LiDAR Spoofing Attack Detection Based on Point-Level Temporal Consistency | Minkyoung Cho et.al. | 2310.14504 | null |
2023-10-23 | Quantum cluster algebras and 3D integrability: Tetrahedron and 3D reflection equations | Rei Inoue et.al. | 2310.14493 | null |
2023-10-23 | MSFormer: A Skeleton-multiview Fusion Method For Tooth Instance Segmentation | Yuan Li et.al. | 2310.14489 | null |
2023-10-23 | VQ-NeRF: Vector Quantization Enhances Implicit Neural Representations | Yiying Yang et.al. | 2310.14487 | null |
2023-10-23 | First Detection and Modeling of Spatially Resolved Ly $α$ in TW Hya | Seok-Jun Chang et.al. | 2310.14477 | null |
2023-10-22 | Learning Generalizable Manipulation Policies with Object-Centric 3D Representations | Yifeng Zhu et.al. | 2310.14386 | null |
2023-10-22 | The Corona Australis star formation complex is accelerating away from the Galactic plane | L. Posch et.al. | 2310.14373 | null |
2023-10-22 | A Quantitative Evaluation of Dense 3D Reconstruction of Sinus Anatomy from Monocular Endoscopic Video | Jan Emily Mangulabnan et.al. | 2310.14364 | null |
2023-10-22 | A global product of fine-scale urban building height based on spaceborne lidar | Xiao Ma et.al. | 2310.14355 | null |
2023-10-22 | Learning a General Model of Single Phase Flow in Complex 3D Porous Media | Javier E. Santos et.al. | 2310.14298 | null |
2023-10-22 | High-Quality 3D Face Reconstruction with Affine Convolutional Networks | Zhiqian Lin et.al. | 2310.14237 | null |
2023-10-22 | Topologically Variable and Volumetric Morphing of 3D Architected Materials with Shape Locking | Kai Xiao et.al. | 2310.14220 | null |
2023-10-22 | Non-equilibrium Ionization Effects on Synthetic Spectra in the AWSoM Solar Corona | Judit Szente et.al. | 2310.14147 | null |
2023-10-21 | No-boundary Wave Functional and Own Mass of the Universe | Natalia Gorobey et.al. | 2310.14104 | null |
2023-10-21 | Robust NOMA-assisted OTFS-ISAC Network Design with 3D Motion Prediction Topology | Luping Xiang et.al. | 2310.13984 | null |
2023-10-21 | Linguistically Motivated Sign Language Segmentation | Amit Moryossef et.al. | 2310.13960 | link |
2023-10-21 | Uniaxial compression of 3D printed samples with voids: laboratory measurements compared with predictions from Effective Medium Theory | Filip P. Adamus et.al. | 2310.13956 | null |
2023-10-21 | Competitive Ensembling Teacher-Student Framework for Semi-Supervised Left Atrium MRI Segmentation | Yuyan Shi et.al. | 2310.13955 | null |
2023-10-21 | Fuzzy-NMS: Improving 3D Object Detection with Fuzzy Classification in NMS | Li Wang et.al. | 2310.13951 | null |
2023-10-20 | Dark matter distribution in Milky Way-analog galaxies | Natanael Gomes-Oliveira et.al. | 2310.13839 | null |
2023-10-20 | Morphological Study of Granular-Granular Impact Craters through Time-of-Flight Cameras: from Concept to Automation in Python | F. Corrales-Machín et.al. | 2310.13834 | null |
2023-10-20 | A Modular Framework for Implicit 3D-0D Coupling in Cardiac Mechanics | Aaron L. Brown et.al. | 2310.13780 | null |
2023-10-20 | TexFusion: Synthesizing 3D Textures with Text-Guided Image Diffusion Models | Tianshi Cao et.al. | 2310.13772 | null |
2023-10-20 | Spikes and accretion of unbound, collisionless matter around black holes | Stuart L. Shapiro et.al. | 2310.13739 | null |
2023-10-20 | Hyperbolic Vacua in Minkowski Space | Walker Melton et.al. | 2310.13663 | null |
2023-10-20 | 3D-Mirrorcle: Bridging the Virtual and Real through Depth Alignment in Smart Mirror Systems | Yujia Liu et.al. | 2310.13617 | null |
2023-10-20 | Local symmetry groups for arbitrary wavevectors | Emanuele Maggio et.al. | 2310.13568 | null |
2023-10-20 | Maser Investigation toward Off-Plane Stars (MIOPS): detection of SiO masers in the Galactic thick disk and halo | Wenjin Yang et.al. | 2310.13489 | null |
2023-10-20 | Phase structure and critical phenomena in 2-flavor QCD by holography | Yan-Qing Zhao et.al. | 2310.13432 | null |
2023-10-20 | OpenAnnotate3D: Open-Vocabulary Auto-Labeling System for Multi-modal 3D Data | Yijie Zhou et.al. | 2310.13398 | link |
2023-10-20 | RL-X: A Deep Reinforcement Learning Library (not only) for RoboCup | Nico Bohlinger et.al. | 2310.13396 | link |
2023-10-20 | Single-view 3D reconstruction via inverse procedural modeling | Albert Garifullin et.al. | 2310.13373 | null |
2023-10-20 | Elasto-plastic residual stress analysis of selective sintered porous materials based on 3D-multilayer thermo-structural phase-field simulations | Yangyiwei Yang et.al. | 2310.13351 | null |
2023-10-20 | EarlyBird: Early-Fusion for Multi-View Tracking in the Bird’s Eye View | Torben Teepe et.al. | 2310.13350 | link |
2023-10-20 | VR PreM+: An Immersive Pre-learning Branching Visualization System for Museum Tours | Ze Gao et.al. | 2310.13294 | null |
2023-10-20 | UE4-NeRF:Neural Radiance Field for Real-Time Rendering of Large-Scale Scene | Jiaming Gu et.al. | 2310.13263 | null |
2023-10-19 | RMap: Millimeter-Wave Radar Mapping Through Volumetric Upsampling | Ajay Narasimha Mopidevi et.al. | 2310.13188 | link |
2023-10-19 | ITER-IA 3D MHD Simulations of Shattered Pellet Injection(SPI)- D1.1 Optimization of the SPI model | Charlson. C. Kim et.al. | 2310.13176 | null |
2023-10-19 | ITER-IA 3D MHD Simulations of Shattered Pellet Injection(SPI) – D1.3 Code Validation (DIII-D) | Charlson. C. Kim et.al. | 2310.13172 | null |
2023-10-19 | Conditional Generative Modeling for Images, 3D Animations, and Video | Vikram Voleti et.al. | 2310.13157 | null |
2023-10-19 | DreamSpace: Dreaming Your Room Space with Text-Driven Panoramic Texture Propagation | Bangbang Yang et.al. | 2310.13119 | null |
2023-10-19 | NeuroSMPC: A Neural Network guided Sampling Based MPC for On-Road Autonomous Driving | Kaustab Pal et.al. | 2310.13077 | null |
2023-10-19 | Global Symmetries, Code Ensembles, and Sums Over Geometries | Ahmed Barbar et.al. | 2310.13044 | null |
2023-10-19 | Human Pose-based Estimation, Tracking and Action Recognition with Deep Learning: A Survey | Lijuan Zhou et.al. | 2310.13039 | null |
2023-10-19 | FSD: Fast Self-Supervised Single RGB-D to Categorical 3D Objects | Mayank Lunayach et.al. | 2310.12974 | link |
2023-10-19 | Frozen Transformers in Language Models Are Effective Visual Encoder Layers | Ziqi Pang et.al. | 2310.12973 | link |
2023-10-19 | 3D-GPT: Procedural 3D Modeling with Large Language Models | Chunyi Sun et.al. | 2310.12945 | null |
2023-10-19 | Plasmon Fizeau drag in 3D Dirac and Weyl semimetals | Morgan G. Blevins et.al. | 2310.12938 | null |
2023-10-19 | Fractal Subsystem Symmetries, ‘t Hooft Anomalies, and UV/IR Mixing | Heitor Casasola et.al. | 2310.12894 | null |
2023-10-19 | Statistical Process Monitoring of Isolated and Persistent Defects in Complex Geometrical Shapes | Sara Bonacina et.al. | 2310.12876 | null |
NeRF
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-15 | Advances in Radiance Field for Dynamic Scene: From Neural Field to Gaussian Field | Jinlong Fan et.al. | 2505.10049 | link |
2025-05-15 | Large-Scale Gaussian Splatting SLAM | Zhe Xin et.al. | 2505.09915 | null |
2025-05-14 | FreeDriveRF: Monocular RGB Dynamic NeRF without Poses for Autonomous Driving via Point-Level Dynamic-Static Decoupling | Yue Wen et.al. | 2505.09406 | null |
2025-05-13 | FOCI: Trajectory Optimization on Gaussian Splats | Mario Gomez Andreu et.al. | 2505.08510 | null |
2025-05-13 | A Survey of 3D Reconstruction with Event Cameras: From Event-based Geometry to Neural 3D Rendering | Chuanzhi Xu et.al. | 2505.08438 | null |
2025-05-12 | Geometric Prior-Guided Neural Implicit Surface Reconstruction in the Wild | Lintao Xiang et.al. | 2505.07373 | null |
2025-05-11 | NeuGen: Amplifying the ‘Neural’ in Neural Radiance Fields for Domain Generalization | Ahmed Qazi et.al. | 2505.06894 | null |
2025-05-10 | 3D Characterization of Smoke Plume Dispersion Using Multi-View Drone Swarm | Nikil Krishnakumar et.al. | 2505.06638 | null |
2025-05-10 | FlexNeRFer: A Multi-Dataflow, Adaptive Sparsity-Aware Accelerator for On-Device NeRF Rendering | Seock-Hwan Noh et.al. | 2505.06504 | null |
2025-05-07 | GSsplat: Generalizable Semantic Gaussian Splatting for Novel-view Synthesis in 3D Scenes | Feng Xiao et.al. | 2505.04659 | link |
2025-05-04 | Learning Heterogeneous Mixture of Scene Experts for Large-scale Neural Radiance Fields | Zhenxing Mi et.al. | 2505.02005 | link |
2025-05-03 | Visual enhancement and 3D representation for underwater scenes: a review | Guoxi Huang et.al. | 2505.01869 | null |
2025-05-03 | AquaGS: Fast Underwater Scene Reconstruction with SfM-Free Gaussian Splatting | Junhao Shi et.al. | 2505.01799 | null |
2025-04-30 | A Survey on 3D Reconstruction Techniques in Plant Phenotyping: From Classical Methods to Neural Radiance Fields (NeRF), 3D Gaussian Splatting (3DGS), and Beyond | Jiajia Li et.al. | 2505.00737 | null |
2025-05-01 | Cues3D: Unleashing the Power of Sole NeRF for Consistent and Unique Instances in Open-Vocabulary 3D Panoptic Segmentation | Feng Xue et.al. | 2505.00378 | null |
2025-04-29 | GauSS-MI: Gaussian Splatting Shannon Mutual Information for Active 3D Reconstruction | Yuhan Xie et.al. | 2504.21067 | link |
2025-04-28 | Joint Optimization of Neural Radiance Fields and Continuous Camera Motion from a Monocular Video | Hoang Chuong Nguyen et.al. | 2504.19819 | null |
2025-04-24 | CasualHDRSplat: Robust High Dynamic Range 3D Gaussian Splatting from Casually Captured Videos | Shucheng Gong et.al. | 2504.17728 | link |
2025-04-23 | Dual-Camera All-in-Focus Neural Radiance Fields | Xianrui Luo et.al. | 2504.16636 | null |
2025-04-23 | SaENeRF: Suppressing Artifacts in Event-based Neural Radiance Fields | Yuanjian Wang et.al. | 2504.16389 | link |
2025-04-10 | NeRF-APT: A New NeRF Framework for Wireless Channel Prediction | Jingzhou Shen et.al. | 2504.16094 | null |
2025-04-22 | Pose Optimization for Autonomous Driving Datasets using Neural Rendering Models | Quentin Herau et.al. | 2504.15776 | null |
2025-04-18 | Scaling LLaNA: Advancing NeRF-Language Understanding Through Large-Scale Training | Andrea Amaduzzi et.al. | 2504.13995 | null |
2025-04-21 | SLAM&Render: A Benchmark for the Intersection Between Neural Rendering, Gaussian Splatting and SLAM | Samuel Cerezo et.al. | 2504.13713 | link |
2025-04-16 | BEV-GS: Feed-forward Gaussian Splatting in Bird’s-Eye-View for Road Reconstruction | Wenhua Wu et.al. | 2504.13207 | null |
2025-04-15 | Explicit and Implicit Representations in AI-based 3D Reconstruction for Radiology: A systematic literature review | Yuezhe Yang et.al. | 2504.11349 | link |
2025-04-14 | Relative Illumination Fields: Learning Medium and Light Independent Underwater Scenes | Mengkun She et.al. | 2504.10024 | null |
2025-04-14 | MCBlock: Boosting Neural Radiance Field Training Speed by MCTS-based Dynamic-Resolution Ray Sampling | Yunpeng Tan et.al. | 2504.09878 | null |
2025-04-12 | Text To 3D Object Generation For Scalable Room Assembly | Sonia Laguna et.al. | 2504.09328 | null |
2025-04-11 | HAL-NeRF: High Accuracy Localization Leveraging Neural Radiance Fields | Asterios Reppas et.al. | 2504.08901 | null |
2025-04-11 | Generative AI for Film Creation: A Survey of Recent Advances | Ruihan Zhang et.al. | 2504.08296 | null |
2025-04-09 | Wheat3DGS: In-field 3D Reconstruction, Instance Segmentation and Phenotyping of Wheat Heads with Gaussian Splatting | Daiwei Zhang et.al. | 2504.06978 | null |
2025-04-08 | Meta-Continual Learning of Neural Fields | Seungyoon Woo et.al. | 2504.05806 | null |
2025-04-08 | InvNeRF-Seg: Fine-Tuning a Pre-Trained NeRF for 3D Object Segmentation | Jiangsan Zhao et.al. | 2504.05751 | null |
2025-04-07 | L3GS: Layered 3D Gaussian Splats for Efficient 3D Scene Delivery | Yi-Zhen Tsai et.al. | 2504.05517 | link |
2025-04-07 | DeclutterNeRF: Generative-Free 3D Scene Recovery for Occlusion Removal | Wanzhou Liu et.al. | 2504.04679 | null |
2025-04-06 | Thermoxels: a voxel-based method to generate simulation-ready 3D thermal models | Etienne Chassaing et.al. | 2504.04448 | null |
2025-04-04 | NeRFlex: Resource-aware Real-time High-quality Rendering of Complex Scenes on Mobile Devices | Zhe Wang et.al. | 2504.03415 | null |
2025-04-03 | MultiNeRF: Multiple Watermark Embedding for Neural Radiance Fields | Yash Kulthe et.al. | 2504.02517 | null |
2025-04-01 | OccludeNeRF: Geometric-aware 3D Scene Inpainting with Collaborative Score Distillation in NeRF | Jingyu Shi et.al. | 2504.02007 | null |
2025-04-02 | Diffusion-Guided Gaussian Splatting for Large-Scale Unconstrained 3D Reconstruction and Novel View Synthesis | Niluthpol Chowdhury Mithun et.al. | 2504.01960 | null |
2025-04-02 | BOGausS: Better Optimized Gaussian Splatting | Stéphane Pateux et.al. | 2504.01844 | null |
2025-04-02 | RealityAvatar: Towards Realistic Loose Clothing Modeling in Animatable 3D Gaussian Avatars | Yahui Li et.al. | 2504.01559 | null |
2025-04-23 | Luminance-GS: Adapting 3D Gaussian Splatting to Challenging Lighting Conditions with View-Adaptive Curve Adjustment | Ziteng Cui et.al. | 2504.01503 | link |
2025-04-07 | Neural Pruning for 3D Scene Reconstruction: Efficient NeRF Acceleration | Tianqi Ding et.al. | 2504.00950 | null |
2025-04-09 | NeuRadar: Neural Radiance Fields for Automotive Radar Point Clouds | Mahan Rafidashti et.al. | 2504.00859 | null |
2025-03-31 | NeRF-Based defect detection | Tianqi et.al. | 2504.00270 | null |
2025-03-28 | ABC-GS: Alignment-Based Controllable Style Transfer for 3D Gaussian Splatting | Wenjie Liu et.al. | 2503.22218 | null |
2025-03-28 | LandMarkSystem Technical Report | Zhenxiang Ma et.al. | 2503.21364 | link |
2025-03-26 | AccidentSim: Generating Physically Realistic Vehicle Collision Videos from Real-World Accident Reports | Xiangwen Zhang et.al. | 2503.20654 | null |
2025-03-25 | Learning Scene-Level Signed Directional Distance Function with Ellipsoidal Priors and Neural Residuals | Zhirui Dai et.al. | 2503.20066 | null |
2025-03-26 | A Survey on Event-driven 3D Reconstruction: Development under Different Categories | Chuanzhi Xu et.al. | 2503.19753 | null |
2025-03-25 | MultimodalStudio: A Heterogeneous Sensor Dataset and Framework for Neural Rendering across Multiple Imaging Modalities | Federico Lincetto et.al. | 2503.19673 | null |
2025-03-25 | SINR: Sparsity Driven Compressed Implicit Neural Representations | Dhananjaya Jayasundara et.al. | 2503.19576 | null |
2025-04-02 | EmoHead: Emotional Talking Head via Manipulating Semantic Expression Parameters | Xuli Shen et.al. | 2503.19416 | null |
2025-03-24 | NexusGS: Sparse View Synthesis with Epipolar Depth Priors in 3D Gaussian Splatting | Yulong Zheng et.al. | 2503.18794 | null |
2025-03-30 | NeRFPrior: Learning Neural Radiance Field as a Prior for Indoor Scene Reconstruction | Wenyuan Zhang et.al. | 2503.18361 | null |
2025-03-21 | FFaceNeRF: Few-shot Face Editing in Neural Radiance Fields | Kwan Yun et.al. | 2503.17095 | link |
2025-03-20 | Digitally Prototype Your Eye Tracker: Simulating Hardware Performance using 3D Synthetic Data | Esther Y. H. Lin et.al. | 2503.16742 | null |
2025-03-20 | Automating 3D Dataset Generation with Neural Radiance Fields | P. Schulz et.al. | 2503.15997 | link |
2025-03-20 | Enhancing Close-up Novel View Synthesis via Pseudo-labeling | Jiatong Xia et.al. | 2503.15908 | link |
2025-03-19 | DiffPortrait360: Consistent Portrait Diffusion for 360 View Synthesis | Yuming Gu et.al. | 2503.15667 | link |
2025-03-19 | GO-N3RDet: Geometry Optimized NeRF-enhanced 3D Object Detector | Zechuan Li et.al. | 2503.15211 | null |
2025-03-19 | MultiBARF: Integrating Imagery of Different Wavelength Regions by Using Neural Radiance Fields | Kana Kurata et.al. | 2503.15070 | null |
2025-03-20 | These Magic Moments: Differentiable Uncertainty Quantification of Radiance Field Models | Parker Ewen et.al. | 2503.14665 | null |
2025-03-18 | Segmentation-Guided Neural Radiance Fields for Novel Street View Synthesis | Yizhou Li et.al. | 2503.14219 | null |
2025-03-17 | Improving Geometric Consistency for 360-Degree Neural Radiance Fields in Indoor Scenarios | Iryna Repinetska et.al. | 2503.13710 | null |
2025-03-17 | DivCon-NeRF: Generating Augmented Rays with Diversity and Consistency for Few-shot View Synthesis | Ingyun Lee et.al. | 2503.12947 | null |
2025-03-15 | FA-BARF: Frequency Adapted Bundle-Adjusting Neural Radiance Fields | Rui Qian et.al. | 2503.12086 | null |
2025-04-03 | 3D Gaussian Splatting against Moving Objects for High-Fidelity Street Scene Reconstruction | Peizhen Zheng et.al. | 2503.12001 | link |
2025-03-14 | Industrial-Grade Sensor Simulation via Gaussian Splatting: A Modular Framework for Scalable Editing and Full-Stack Validation | Xianming Zeng et.al. | 2503.11731 | null |
2025-03-13 | Flow-NeRF: Joint Learning of Geometry, Poses, and Dense Flow within Unified Neural Representations | Xunzhi Zheng et.al. | 2503.10464 | null |
2025-03-11 | Dynamic Scene Reconstruction: Recent Advance in Real-time Rendering and Streaming | Jiaxuan Zhu et.al. | 2503.08166 | null |
2025-03-11 | GigaSLAM: Large-Scale Monocular SLAM with Hierachical Gaussian Splats | Kai Deng et.al. | 2503.08071 | link |
2025-03-11 | NeRF-VIO: Map-Based Visual-Inertial Odometry with Initialization Leveraging Neural Radiance Fields | Yanyu Zhang et.al. | 2503.07952 | null |
2025-03-10 | Neural Radiance and Gaze Fields for Visual Attention Modeling in 3D Environments | Andrei Chubarau et.al. | 2503.07828 | null |
2025-03-09 | Gaussian RBFNet: Gaussian Radial Basis Functions for Fast and Accurate Representation and Reconstruction of Neural Fields | Abdelaziz Bouzidi et.al. | 2503.06762 | null |
2025-03-08 | Feature-EndoGaussian: Feature Distilled Gaussian Splatting in Surgical Deformable Scene Reconstruction | Kai Li et.al. | 2503.06161 | null |
2025-03-08 | NeuraLoc: Visual Localization in Neural Implicit Map with Dual Complementary Features | Hongjia Zhai et.al. | 2503.06117 | null |
2025-03-06 | Metadata-free Georegistration of Ground and Airborne Imagery | Adam Bredvik et.al. | 2503.04927 | null |
2025-03-06 | Surgical Gaussian Surfels: Highly Accurate Real-time Surgical Scene Rendering | Idris O. Sunmola et.al. | 2503.04079 | null |
2025-03-05 | LensDFF: Language-enhanced Sparse Feature Distillation for Efficient Few-Shot Dexterous Manipulation | Qian Feng et.al. | 2503.03890 | null |
2025-03-04 | 2DGS-Avatar: Animatable High-fidelity Clothed Avatar via 2D Gaussian Splatting | Qipeng Yan et.al. | 2503.02452 | null |
2025-03-04 | Empowering Sparse-Input Neural Radiance Fields with Dual-Level Semantic Guidance from Dense Novel Views | Yingji Zhong et.al. | 2503.02230 | null |
2025-03-04 | Zero-Shot Sim-to-Real Visual Quadrotor Control with Hard Constraints | Yan Miao et.al. | 2503.02198 | null |
2025-03-03 | Data Augmentation for NeRFs in the Low Data Limit | Ayush Gaggar et.al. | 2503.02092 | null |
2025-03-03 | Difix3D+: Improving 3D Reconstructions with Single-Step Diffusion Models | Jay Zhangjie Wu et.al. | 2503.01774 | null |
2025-02-26 | Glad: A Streaming Scene Generator for Autonomous Driving | Bin Xie et.al. | 2503.00045 | null |
2025-02-28 | EndoPBR: Material and Lighting Estimation for Photorealistic Surgical Simulations via Physically-based Rendering | John J. Han et.al. | 2502.20669 | null |
2025-03-25 | Identity-preserving Distillation Sampling by Fixed-Point Iterator | SeonHwa Kim et.al. | 2502.19930 | null |
2025-02-27 | NeRFCom: Feature Transform Coding Meets Neural Radiance Field for Free-View 3D Scene Semantic Transmission | Weijie Yue et.al. | 2502.19873 | null |
2025-02-26 | Compression in 3D Gaussian Splatting: A Survey of Methods, Trends, and Future Directions | Muhammad Salman Ali et.al. | 2502.19457 | null |
2025-02-26 | Does 3D Gaussian Splatting Need Accurate Volumetric Rendering? | Adam Celarek et.al. | 2502.19318 | link |
2025-02-26 | The NeRF Signature: Codebook-Aided Watermarking for Neural Radiance Fields | Ziyuan Luo et.al. | 2502.19125 | null |
2025-02-24 | Semantic Neural Radiance Fields for Multi-Date Satellite Data | Valentin Wagner et.al. | 2502.16992 | link |
2025-02-23 | ViSNeRF: Efficient Multidimensional Neural Radiance Field Representation for Visualization Synthesis of Dynamic Volumetric Scenes | Siyuan Yao et.al. | 2502.16731 | link |
2025-02-22 | AquaNeRF: Neural Radiance Fields in Underwater Media with Distractor Removal | Luca Gough et.al. | 2502.16351 | null |
2025-02-20 | NeRF-3DTalker: Neural Radiance Field with 3D Prior Aided Audio Disentanglement for Talking Head Synthesis | Xiaoxing Liu et.al. | 2502.14178 | null |
2025-02-19 | GlossGau: Efficient Inverse Rendering for Glossy Surface with Anisotropic Spherical Gaussian | Bang Du et.al. | 2502.14129 | null |
2025-02-18 | GS-QA: Comprehensive Quality Assessment Benchmark for Gaussian Splatting View Synthesis | Pedro Martin et.al. | 2502.13196 | null |
2025-02-18 | ROI-NeRFs: Hi-Fi Visualization of Objects of Interest within a Scene by NeRFs Composition | Quoc-Anh Bui et.al. | 2502.12673 | null |
2025-04-28 | 3D Gaussian Inpainting with Depth-Guided Cross-View Consistency | Sheng-Yu Huang et.al. | 2502.11801 | null |
2025-02-14 | Multi-view 3D surface reconstruction from SAR images by inverse rendering | Emile Barbier–Renard et.al. | 2502.10492 | null |
2025-02-13 | Embed Any NeRF: Graph Meta-Networks for Neural Tasks on Arbitrary NeRF Architectures | Francesco Ballerini et.al. | 2502.09623 | null |
2025-02-12 | Sat-DN: Implicit Surface Reconstruction from Multi-View Satellite Images with Depth and Normal Supervision | Tianle Liu et.al. | 2502.08352 | null |
2025-02-08 | GWRF: A Generalizable Wireless Radiance Field for Wireless Signal Propagation Modeling | Kang Yang et.al. | 2502.05708 | null |
2025-02-05 | VistaFlow: Photorealistic Volumetric Reconstruction with Dynamic Resolution Management via Q-Learning | Jayram Palamadai et.al. | 2502.05222 | null |
2025-02-04 | SiLVR: Scalable Lidar-Visual Radiance Field Reconstruction with Uncertainty Quantification | Yifu Tao et.al. | 2502.02657 | null |
2025-02-04 | MaintaAvatar: A Maintainable Avatar Based on Neural Radiance Fields by Continual Learning | Shengbo Gu et.al. | 2502.02372 | null |
2025-02-04 | Geometric Neural Process Fields | Wenzhe Yin et.al. | 2502.02338 | null |
2025-01-31 | Laser: Efficient Language-Guided Segmentation in Neural Radiance Fields | Xingyu Miao et.al. | 2501.19084 | link |
2025-05-06 | Deformable Beta Splatting | Rong Liu et.al. | 2501.18630 | link |
2025-02-05 | GS-LiDAR: Generating Realistic LiDAR Point Clouds with Panoramic Gaussian Splatting | Junzhe Jiang et.al. | 2501.13971 | link |
2025-01-22 | Neural Radiance Fields for the Real World: A Survey | Wenhui Xiao et.al. | 2501.13104 | null |
2025-02-02 | DWTNeRF: Boosting Few-shot Neural Radiance Fields via Discrete Wavelet Transform | Hung Nguyen et.al. | 2501.12637 | null |
2025-01-21 | Fast Underwater Scene Reconstruction using Multi-View Stereo and Physical Imaging | Shuyi Hu et.al. | 2501.11884 | null |
2025-01-16 | Poxel: Voxel Reconstruction for 3D Printing | Ruixiang Cao et.al. | 2501.10474 | null |
2025-01-16 | Normal-NeRF: Ambiguity-Robust Normal Estimation for Highly Reflective Scenes | Ji Shi et.al. | 2501.09460 | link |
2025-01-13 | Evaluating Human Perception of Novel View Synthesis: Subjective Quality Assessment of Gaussian Splatting and NeRF in Dynamic Scenes | Yuhang Zhang et.al. | 2501.08072 | null |
2025-04-11 | SplatMAP: Online Dense Monocular SLAM with 3D Gaussian Splatting | Yue Hu et.al. | 2501.07015 | null |
2025-02-02 | CULTURE3D: Cultural Landmarks and Terrain Dataset for 3D Applications | Xinyi Zheng et.al. | 2501.06927 | link |
2025-01-10 | UV-Attack: Physical-World Adversarial Attacks for Person Detection via Dynamic-NeRF-based UV Mapping | Yanjie Li et.al. | 2501.05783 | null |
2025-01-07 | NeRFs are Mirror Detectors: Using Structural Similarity for Multi-View Mirror Scene Reconstruction with 3D Surface Primitives | Leif Van Holland et.al. | 2501.04074 | link |
2025-01-07 | NeuralSVG: An Implicit Representation for Text-to-Vector Generation | Sagi Polaczek et.al. | 2501.03992 | null |
2025-01-07 | AE-NeRF: Augmenting Event-Based Neural Radiance Fields for Non-ideal Conditions and Larger Scene | Chaoran Feng et.al. | 2501.02807 | null |
2024-12-29 | Bringing Objects to Life: 4D generation from 3D objects | Ohad Rahamim et.al. | 2412.20422 | null |
2024-12-27 | Learning Radiance Fields from a Single Snapshot Compressive Image | Yunhao Li et.al. | 2412.19483 | null |
2025-01-05 | BeSplat: Gaussian Splatting from a Single Blurry Image and Event Stream | Gopi Raju Matta et.al. | 2412.19370 | null |
2024-12-26 | Generating Editable Head Avatars with 3D Gaussian GANs | Guohao Li et.al. | 2412.19149 | link |
2024-12-26 | MVS-GS: High-Quality 3D Gaussian Splatting Mapping via Online Multi-View Stereo | Byeonggwon Lee et.al. | 2412.19130 | null |
2024-12-23 | Editing Implicit and Explicit Representations of Radiance Fields: A Survey | Arthur Hubert et.al. | 2412.17628 | null |
2024-12-23 | Exploring Dynamic Novel View Synthesis Technologies for Cinematography | Adrian Azzarelli et.al. | 2412.17532 | null |
2024-12-18 | AdvIRL: Reinforcement Learning-Based Adversarial Attacks on 3D NeRF Models | Tommy Nguyen et.al. | 2412.16213 | link |
2024-12-20 | NeRF-To-Real Tester: Neural Radiance Fields as Test Image Generators for Vision of Autonomous Systems | Laura Weihl et.al. | 2412.16141 | null |
2025-01-11 | NeuroPump: Simultaneous Geometric and Color Rectification for Underwater Images | Yue Guo et.al. | 2412.15890 | null |
2024-12-26 | LiHi-GS: LiDAR-Supervised Gaussian Splatting for Highway Driving Scene Reconstruction | Pou-Chun Kung et.al. | 2412.15447 | null |
2024-12-18 | DreaMark: Rooting Watermark in Score Distillation Sampling Generated Neural Radiance Fields | Xingyu Zhu et.al. | 2412.15278 | null |
2024-12-19 | LiDAR-RT: Gaussian-based Ray Tracing for Dynamic LiDAR Re-simulation | Chenxu Zhou et.al. | 2412.15199 | null |
2024-12-19 | Bright-NeRF:Brightening Neural Radiance Field with Color Restoration from Low-light Raw Images | Min Wang et.al. | 2412.14547 | null |
2024-12-18 | GraphAvatar: Compact Head Avatars with GNN-Generated 3D Gaussians | Xiaobao Wei et.al. | 2412.13983 | link |
2025-03-25 | RelationField: Relate Anything in Radiance Fields | Sebastian Koch et.al. | 2412.13652 | link |
2024-12-18 | Optimize the Unseen – Fast NeRF Cleanup with Free Space Prior | Leo Segre et.al. | 2412.12772 | null |
2024-12-16 | VRVVC: Variable-Rate NeRF-Based Volumetric Video Compression | Qiang Hu et.al. | 2412.11362 | null |
2025-01-10 | ViPOcc: Leveraging Visual Priors from Vision Foundation Models for Single-View 3D Occupancy Prediction | Yi Feng et.al. | 2412.11210 | link |
2024-12-13 | NeRF-Texture: Synthesizing Neural Radiance Field Textures | Yi-Hua Huang et.al. | 2412.10004 | null |
2024-12-13 | Sharpening Your Density Fields: Spiking Neuron Aided Fast Geometry Learning | Yi Gu et.al. | 2412.09881 | null |
2025-04-07 | PBR-NeRF: Inverse Rendering with Physics-Based Neural Fields | Sean Wu et.al. | 2412.09680 | link |
2024-12-18 | GN-FR:Generalizable Neural Radiance Fields for Flare Removal | Gopi Raju Matta et.al. | 2412.08200 | null |
2024-12-10 | EventSplat: 3D Gaussian Splatting from Moving Event Cameras for Real-time Rendering | Toshiya Yura et.al. | 2412.07293 | null |
2024-12-03 | $ρ$ -NeRF: Leveraging Attenuation Priors in Neural Radiance Field for 3D Computed Tomography Reconstruction | Li Zhou et.al. | 2412.05322 | null |
2024-12-11 | MixedGaussianAvatar: Realistically and Geometrically Accurate Head Avatar via Mixed 2D-3D Gaussian Splatting | Peng Chen et.al. | 2412.04955 | link |
2024-12-04 | NeRF and Gaussian Splatting SLAM in the Wild | Fabian Schmidt et.al. | 2412.03263 | link |
2024-12-03 | RelayGS: Reconstructing Dynamic Scenes with Large-Scale and Complex Motions via Relay Gaussians | Qiankun Gao et.al. | 2412.02493 | link |
2024-12-02 | CTRL-D: Controllable Dynamic 3D Scene Editing with Personalized 2D Diffusion | Kai He et.al. | 2412.01792 | null |
2024-12-01 | CtrlNeRF: The Generative Neural Radiation Fields for the Controllable Synthesis of High-fidelity 3D-Aware Images | Jian Liu et.al. | 2412.00754 | null |
2025-03-08 | Incremental Multi-Scene Modeling via Continual Neural Graphics Primitives | Prajwal Singh et.al. | 2411.19903 | null |
2024-11-29 | Gaussian Splashing: Direct Volumetric Rendering Underwater | Nir Mualem et.al. | 2411.19588 | null |
2024-11-29 | Deepfake Media Generation and Detection in the Generative AI Era: A Survey and Outlook | Florinel-Alin Croitoru et.al. | 2411.19537 | link |
2024-12-23 | LokiTalk: Learning Fine-Grained and Generalizable Correspondences to Enhance NeRF-based Talking Head Synthesis | Tianqi Li et.al. | 2411.19525 | null |
2024-11-27 | Surf-NeRF: Surface Regularised Neural Radiance Fields | Jack Naylor et.al. | 2411.18652 | null |
2024-11-26 | MLI-NeRF: Multi-Light Intrinsic-Aware Neural Radiance Fields | Yixiong Yang et.al. | 2411.17235 | link |
2025-03-13 | SplatAD: Real-Time Lidar and Camera Rendering with 3D Gaussian Splatting for Autonomous Driving | Georg Hess et.al. | 2411.16816 | link |
2024-11-25 | Quadratic Gaussian Splatting for Efficient and Detailed Surface Reconstruction | Ziyu Zhang et.al. | 2411.16392 | null |
2024-11-25 | U2NeRF: Unsupervised Underwater Image Restoration and Neural Radiance Fields | Vinayak Gupta et.al. | 2411.16172 | null |
2024-11-24 | ZeroGS: Training 3D Gaussian Splatting from Unposed Images | Yu Chen et.al. | 2411.15779 | null |
2024-12-20 | GSurf: 3D Reconstruction via Signed Distance Fields with Direct Gaussian Supervision | Baixin Xu et.al. | 2411.15723 | link |
2025-03-09 | NexusSplats: Efficient 3D Gaussian Splatting in the Wild | Yuzhou Tang et.al. | 2411.14514 | null |
2024-11-20 | Sparse Input View Synthesis: 3D Representations and Reliable Priors | Nagabhushan Somraj et.al. | 2411.13631 | null |
2024-11-20 | GazeGaussian: High-Fidelity Gaze Redirection with 3D Gaussian Splatting | Xiaobao Wei et.al. | 2411.12981 | null |
2024-11-19 | MTFusion: Reconstructing Any 3D Object from Single Image Using Multi-word Textual Inversion | Yu Liu et.al. | 2411.12197 | null |
2024-11-18 | Towards Degradation-Robust Reconstruction in Generalizable NeRF | Chan Ho Park et.al. | 2411.11691 | null |
2024-11-15 | The Oxford Spires Dataset: Benchmarking Large-Scale LiDAR-Visual Localisation, Reconstruction and Radiance Field Methods | Yifu Tao et.al. | 2411.10546 | null |
2024-11-15 | USP-Gaussian: Unifying Spike-based Image Reconstruction, Pose Correction and Gaussian Splatting | Kang Chen et.al. | 2411.10504 | link |
2024-11-15 | GSEditPro: 3D Gaussian Splatting Editing with Attention-based Progressive Localization | Yanhao Sun et.al. | 2411.10033 | null |
2024-11-14 | Adversarial Attacks Using Differentiable Rendering: A Survey | Matthew Hull et.al. | 2411.09749 | null |
2024-11-14 | CropCraft: Inverse Procedural Modeling for 3D Reconstruction of Crop Plants | Albert J. Zhai et.al. | 2411.09693 | null |
2024-11-13 | Towards More Accurate Fake Detection on Images Generated from Advanced Generative and Neural Rendering Models | Chengdong Dong et.al. | 2411.08642 | null |
2024-11-13 | Biomass phenotyping of oilseed rape through UAV multi-view oblique imaging with 3DGS and SAM model | Yutao Shen et.al. | 2411.08453 | null |
2024-11-13 | MBA-SLAM: Motion Blur Aware Dense Visual SLAM with Radiance Fields Representation | Peng Wang et.al. | 2411.08279 | link |
2024-11-12 | TomoGRAF: A Robust and Generalizable Reconstruction Network for Single-View Computed Tomography | Di Xu et.al. | 2411.08158 | null |
2024-11-12 | Material Transforms from Disentangled NeRF Representations | Ivan Lopes et.al. | 2411.08037 | link |
2024-11-11 | LuSh-NeRF: Lighting up and Sharpening NeRFs for Low-light Scenes | Zefan Qu et.al. | 2411.06757 | link |
2025-01-20 | From Transparent to Opaque: Rethinking Neural Implicit Surfaces with $α$ -NeuS | Haoran Zhang et.al. | 2411.05362 | link |
2024-11-08 | Rate-aware Compression for NeRF-based Volumetric Video | Zhiyu Zhang et.al. | 2411.05322 | null |
2024-11-07 | Planar Reflection-Aware Neural Radiance Fields | Chen Gao et.al. | 2411.04984 | null |
2024-11-06 | Structure Consistent Gaussian Splatting with Matching Prior for Few-shot Novel View Synthesis | Rui Peng et.al. | 2411.03637 | link |
2025-05-05 | CAD-NeRF: Learning NeRFs from Uncalibrated Few-view Images by CAD Model Retrieval | Xin Wen et.al. | 2411.02979 | null |
2024-11-05 | Exploring Seasonal Variability in the Context of Neural Radiance Fields for 3D Reconstruction on Satellite Imagery | Liv Kåreborn et.al. | 2411.02972 | null |
2025-03-07 | NeRF-Aug: Data Augmentation for Robotics with Neural Radiance Fields | Eric Zhu et.al. | 2411.02482 | null |
2024-11-05 | FewViewGS: Gaussian Splatting with Few View Matching and Multi-stage Training | Ruihong Yin et.al. | 2411.02229 | null |
2024-12-04 | GVKF: Gaussian Voxel Kernel Functions for Highly Efficient Surface Reconstruction in Open Scenes | Gaochao Song et.al. | 2411.01853 | null |
2024-11-04 | A Probabilistic Formulation of LiDAR Mapping with Neural Radiance Fields | Matthew McDermott et.al. | 2411.01725 | link |
2024-10-30 | ELMGS: Enhancing memory and computation scaLability through coMpression for 3D Gaussian Splatting | Muhammad Salman Ali et.al. | 2410.23213 | null |
2024-11-16 | EEG-Driven 3D Object Reconstruction with Style Consistency and Diffusion Prior | Xin Xiang et.al. | 2410.20981 | null |
2024-10-28 | ODGS: 3D Scene Reconstruction from Omnidirectional Images with 3D Gaussian Splattings | Suyoung Lee et.al. | 2410.20686 | link |
2024-10-27 | Neural rendering enables dynamic tomography | Ivan Grega et.al. | 2410.20558 | null |
2024-10-27 | GUMBEL-NERF: Representing Unseen Objects as Part-Compositional Neural Radiance Fields | Yusuke Sekikawa et.al. | 2410.20306 | null |
2024-10-26 | Neural Fields in Robotics: A Survey | Muhammad Zubair Irshad et.al. | 2410.20220 | link |
2024-10-19 | GL-NeRF: Gauss-Laguerre Quadrature Enables Training-Free NeRF Acceleration | Silong Yong et.al. | 2410.19831 | null |
2024-10-25 | Content-Aware Radiance Fields: Aligning Model Complexity with Scene Intricacy Through Learned Bitwidth Quantization | Weihang Liu et.al. | 2410.19483 | link |
2024-10-25 | Evaluation of strategies for efficient rate-distortion NeRF streaming | Pedro Martin et.al. | 2410.19459 | null |
2024-10-24 | Real-time 3D-aware Portrait Video Relighting | Ziqi Cai et.al. | 2410.18355 | link |
2025-01-05 | Advancing Super-Resolution in Neural Radiance Fields via Variational Diffusion Strategies | Shrey Vishen et.al. | 2410.18137 | link |
2024-10-23 | VR-Splatting: Foveated Radiance Field Rendering via 3D Gaussian Splatting and Neural Points | Linus Franke et.al. | 2410.17932 | null |
2024-10-23 | Few-shot NeRF by Adaptive Rendering Loss Regularization | Qingshan Xu et.al. | 2410.17839 | null |
2024-10-23 | Efficient Neural Implicit Representation for 3D Human Reconstruction | Zexu Huang et.al. | 2410.17741 | link |
2024-10-23 | PLGS: Robust Panoptic Lifting with 3D Gaussian Splatting | Yu Wang et.al. | 2410.17505 | null |
2025-05-01 | EmoGene: Audio-Driven Emotional 3D Talking-Head Generation | Wenqing Wang et.al. | 2410.17262 | null |
2024-10-18 | GS-LIVM: Real-Time Photo-Realistic LiDAR-Inertial-Visual Mapping with Gaussian Splatting | Yusen Xie et.al. | 2410.17084 | null |
2025-04-11 | E-3DGS: Gaussian Splatting with Exposure and Motion Events | Xiaoting Yin et.al. | 2410.16995 | link |
2024-10-21 | Joker: Conditional 3D Head Synthesis with Extreme Facial Expressions | Malte Prinzler et.al. | 2410.16395 | null |
2025-03-25 | FrugalNeRF: Fast Convergence for Few-shot Novel View Synthesis without Learned Priors | Chin-Yang Lin et.al. | 2410.16271 | null |
2024-10-19 | Neural Radiance Field Image Refinement through End-to-End Sampling Point Optimization | Kazuhiro Ohta et.al. | 2410.14958 | null |
2024-10-18 | Learning autonomous driving from aerial imagery | Varun Murali et.al. | 2410.14177 | null |
2024-10-18 | DaRePlane: Direction-aware Representations for Dynamic Scene Reconstruction | Ange Lou et.al. | 2410.14169 | null |
2024-10-17 | Object Pose Estimation Using Implicit Representation For Transparent Objects | Varun Burde et.al. | 2410.13465 | null |
2024-12-19 | 3D Gaussian Splatting in Robotics: A Survey | Siting Zhu et.al. | 2410.12262 | link |
2024-10-16 | EG-HumanNeRF: Efficient Generalizable Human NeRF Utilizing Human Prior for Sparse View | Zhaorong Wang et.al. | 2410.12242 | null |
2024-10-15 | TEOcc: Radar-camera Multi-modal Occupancy Prediction via Temporal Enhancement | Zhiwei Lin et.al. | 2410.11228 | link |
2024-10-14 | Few-shot Novel View Synthesis using Depth Aware 3D Gaussian Splatting | Raja Kumar et.al. | 2410.11080 | link |
2024-10-14 | NeRF-enabled Analysis-Through-Synthesis for ISAR Imaging of Small Everyday Objects with Sparse and Noisy UWB Radar Data | Md Farhan Tasnim Oshim et.al. | 2410.10085 | null |
2024-10-13 | Magnituder Layers for Implicit Neural Representations in 3D | Sang Min Kim et.al. | 2410.09771 | null |
2024-10-12 | Improving 3D Finger Traits Recognition via Generalizable Neural Rendering | Hongbin Xu et.al. | 2410.09582 | null |
2025-05-08 | SceneCraft: Layout-Guided 3D Scene Generation | Xiuyu Yang et.al. | 2410.09049 | link |
2024-10-11 | Optimizing NeRF-based SLAM with Trajectory Smoothness Constraints | Yicheng He et.al. | 2410.08780 | null |
2024-10-10 | Generalizable and Animatable Gaussian Head Avatar | Xuangeng Chu et.al. | 2410.07971 | link |
2024-10-11 | NeRF-Accelerated Ecological Monitoring in Mixed-Evergreen Redwood Forest | Adam Korycki et.al. | 2410.07418 | link |
2024-10-09 | DreamMesh4D: Video-to-4D Generation with Sparse-Controlled Gaussian-Mesh Hybrid Representation | Zhiqi Li et.al. | 2410.06756 | null |
2024-10-15 | MimicTalk: Mimicking a personalized and expressive 3D talking face in minutes | Zhenhui Ye et.al. | 2410.06734 | null |
2024-10-09 | 3D Representation Methods: A Survey | Zhengren Wang et.al. | 2410.06475 | null |
2024-10-08 | Comparative Analysis of Novel View Synthesis and Photogrammetry for 3D Forest Stand Reconstruction and extraction of individual tree parameters | Guoji Tian et.al. | 2410.05772 | null |
2024-10-07 | Toward General Object-level Mapping from Sparse Views with 3D Diffusion Priors | Ziwei Liao et.al. | 2410.05514 | link |
2024-10-11 | PH-Dropout: Practical Epistemic Uncertainty Quantification for View Synthesis | Chuanhao Sun et.al. | 2410.05468 | link |
2025-03-14 | LiDAR-GS:Real-time LiDAR Re-Simulation using Gaussian Splatting | Qifeng Chen et.al. | 2410.05111 | null |
2025-03-11 | 6DGS: Enhanced Direction-Aware Gaussian Splatting for Volumetric Rendering | Zhongpai Gao et.al. | 2410.04974 | null |
2024-10-07 | TeX-NeRF: Neural Radiance Fields from Pseudo-TeX Vision | Chonghao Zhong et.al. | 2410.04873 | null |
2024-10-06 | In-Place Panoptic Radiance Field Segmentation with Perceptual Prior for 3D Scene Understanding | Shenghao Li et.al. | 2410.04529 | null |
2024-10-06 | Deformable NeRF using Recursively Subdivided Tetrahedra | Zherui Qiu et.al. | 2410.04402 | null |
2025-02-28 | EndoPerfect: High-Accuracy Monocular Depth Estimation and 3D Reconstruction for Endoscopic Surgery via NeRF-Stereo Fusion | Pengcheng Chen et.al. | 2410.04041 | null |
2024-10-02 | MVGS: Multi-view-regulated Gaussian Splatting for Novel View Synthesis | Xiaobiao Du et.al. | 2410.02103 | link |
2024-10-02 | 3DGS-DET: Empower 3D Gaussian Splatting with Boundary Guidance and Box-Focused Sampling for 3D Object Detection | Yang Cao et.al. | 2410.01647 | link |
2025-04-24 | GaussianBlock: Building Part-Aware Compositional and Editable 3D Scene by Primitives and Gaussians | Shuyi Jiang et.al. | 2410.01535 | null |
2025-02-07 | AniSDF: Fused-Granularity Neural Surfaces with Anisotropic Encoding for High-Fidelity 3D Reconstruction | Jingnan Gao et.al. | 2410.01202 | null |
2024-10-01 | GMT: Enhancing Generalizable Neural Rendering via Geometry-Driven Multi-Reference Texture Transfer | Youngho Yoon et.al. | 2410.00672 | link |
2024-10-01 | Cafca: High-quality Novel View Synthesis of Expressive Faces from Casual Few-shot Captures | Marcel C. Bühler et.al. | 2410.00630 | null |
2024-10-01 | Seamless Augmented Reality Integration in Arthroscopy: A Pipeline for Articular Reconstruction and Guidance | Hongchao Shu et.al. | 2410.00386 | null |
2024-09-30 | Distributed NeRF Learning for Collaborative Multi-Robot Perception | Hongrui Zhao et.al. | 2409.20289 | null |
2025-04-07 | RNG: Relightable Neural Gaussians | Jiahui Fan et.al. | 2409.19702 | null |
2024-11-07 | LightAvatar: Efficient Head Avatar as Dynamic Neural Light Field | Huan Wang et.al. | 2409.18057 | link |
2024-09-26 | Deblur e-NeRF: NeRF from Motion-Blurred Events under High-speed or Low-light Conditions | Weng Fei Low et.al. | 2409.17988 | null |
2024-09-26 | Neural Implicit Representation for Highly Dynamic LiDAR Mapping and Odometry | Qi Zhang et.al. | 2409.17729 | null |
2025-04-25 | Let’s Make a Splan: Risk-Aware Trajectory Optimization in a Normalized Gaussian Splat | Jonathan Michaux et.al. | 2409.16915 | null |
2024-09-25 | TalkinNeRF: Animatable Neural Fields for Full-Body Talking Humans | Aggelina Chatziagapi et.al. | 2409.16666 | null |
2024-09-24 | Plenoptic PNG: Real-Time Neural Radiance Fields in 150 KB | Jae Yong Lee et.al. | 2409.15689 | null |
2024-09-23 | AgriNeRF: Neural Radiance Fields for Agriculture in Challenging Lighting Conditions | Samarth Chopra et.al. | 2409.15487 | null |
2024-10-14 | SpikeGS: Learning 3D Gaussian Fields from Continuous Spike Stream | Jinze Yu et.al. | 2409.15176 | link |
2025-03-28 | FusionRF: High-Fidelity Satellite Neural Radiance Fields from Multispectral and Panchromatic Acquisitions | Michael Sprintson et.al. | 2409.15132 | null |
2024-09-22 | MVPGS: Excavating Multi-view Priors for Gaussian Splatting from Sparse Input Views | Wangze Xu et.al. | 2409.14316 | null |
2024-09-25 | BRDF-NeRF: Neural Radiance Fields with Optical Satellite Images and BRDF Modelling | Lulin Zhang et.al. | 2409.12014 | link |
2024-09-18 | Intraoperative Registration by Cross-Modal Inverse Neural Rendering | Maximilian Fehrentz et.al. | 2409.11983 | null |
2024-09-16 | DENSER: 3D Gaussians Splatting for Scene Reconstruction of Dynamic Urban Environments | Mahmud A. Mohamad et.al. | 2409.10041 | link |
2025-03-17 | SAFER-Splat: A Control Barrier Function for Safe Navigation with Online Gaussian Splatting Maps | Timothy Chen et.al. | 2409.09868 | null |
2024-09-15 | NARF24: Estimating Articulated Object Structure for Implicit Rendering | Stanley Lewis et.al. | 2409.09829 | null |
2024-09-12 | DreamHOI: Subject-Driven Generation of 3D Human-Object Interactions with Diffusion Priors | Thomas Hanwen Zhu et.al. | 2409.08278 | null |
2025-04-05 | Expansive Supervision for Neural Radiance Field | Weixiang Zhang et.al. | 2409.08056 | null |
2025-04-22 | ThermalGaussian: Thermal 3D Gaussian Splatting | Rongfeng Lu et.al. | 2409.07200 | link |
2024-09-10 | LEIA: Latent View-invariant Embeddings for Implicit 3D Articulation | Archana Swaminathan et.al. | 2409.06703 | null |
2024-09-10 | Sources of Uncertainty in 3D Scene Reconstruction | Marcus Klasson et.al. | 2409.06407 | link |
2024-09-09 | LSE-NeRF: Learning Sensor Modeling Errors for Deblured Neural Radiance Fields with RGB-Event Stereo | Wei Zhi Tang et.al. | 2409.06104 | link |
2024-09-09 | G-NeLF: Memory- and Data-Efficient Hybrid Neural Light Field for Novel View Synthesis | Lutao Jiang et.al. | 2409.05617 | null |
2024-09-09 | Neural Surface Reconstruction and Rendering for LiDAR-Visual Systems | Jianheng Liu et.al. | 2409.05310 | null |
2024-09-06 | SCARF: Scalable Continual Learning Framework for Memory-efficient Multiple Neural Radiance Fields | Yuze Wang et.al. | 2409.04482 | null |
2024-09-05 | Weight Conditioning for Smooth Optimization of Neural Networks | Hemanth Saratchandran et.al. | 2409.03424 | null |
2024-09-05 | Optimizing 3D Gaussian Splatting for Sparse Viewpoint Scene Reconstruction | Shen Chen et.al. | 2409.03213 | null |
2024-11-09 | UC-NeRF: Uncertainty-aware Conditional Neural Radiance Fields from Endoscopic Sparse Views | Jiaxin Guo et.al. | 2409.02917 | link |
2024-09-25 | Learnable Wireless Digital Twins: Reconstructing Electromagnetic Field with Neural Representations | Shuaifeng Jiang et.al. | 2409.02564 | null |
2024-09-03 | $S^2$ NeRF: Privacy-preserving Training Framework for NeRF | Bokang Zhang et.al. | 2409.01661 | link |
2025-04-19 | OmniRe: Omni Urban Scene Reconstruction | Ziyu Chen et.al. | 2408.16760 | null |
2024-08-29 | NeRF-CA: Dynamic Reconstruction of X-ray Coronary Angiography with Extremely Sparse-views | Kirsten W. H. Maas et.al. | 2408.16355 | link |
2024-09-05 | G-Style: Stylized Gaussian Splatting | Áron Samuel Kovács et.al. | 2408.15695 | link |
2024-09-28 | GeoTransfer : Generalizable Few-Shot Multi-View Reconstruction via Transfer Learning | Shubhendu Jena et.al. | 2408.14724 | null |
2024-08-24 | G3DST: Generalizing 3D Style Transfer with Neural Radiance Fields across Scenes and Styles | Adil Meric et.al. | 2408.13508 | null |
2024-08-23 | SIn-NeRF2NeRF: Editing 3D Scenes with Instructions through Segmentation and Inpainting | Jiseung Hong et.al. | 2408.13285 | link |
2024-10-19 | Visual Localization in 3D Maps: Comparing Point Cloud, Mesh, and NeRF Representations | Lintong Zhang et.al. | 2408.11966 | null |
2024-08-21 | Irregularity Inspection using Neural Radiance Field | Tianqi Ding et.al. | 2408.11251 | null |
2024-08-20 | TrackNeRF: Bundle Adjusting NeRF from Sparse and Noisy Views via Feature Tracks | Jinjie Mai et.al. | 2408.10739 | null |
2024-08-21 | NeRF-US: Removing Ultrasound Imaging Artifacts from Neural Radiance Fields in the Wild | Rishit Dagli et.al. | 2408.10258 | null |
2024-08-19 | $R^2$ -Mesh: Reinforcement Learning Powered Mesh Reconstruction via Geometry and Appearance Refinement | Haoyang Wang et.al. | 2408.10135 | null |
2024-09-06 | DiscoNeRF: Class-Agnostic Object Field for 3D Object Discovery | Corentin Dumery et.al. | 2408.09928 | null |
2024-08-18 | S^3D-NeRF: Single-Shot Speech-Driven Neural Radiance Field for High Fidelity Talking Head Synthesis | Dongze Li et.al. | 2408.09347 | null |
2024-08-17 | SSNeRF: Sparse View Semi-supervised Neural Radiance Fields with Augmentation | Xiao Cao et.al. | 2408.09144 | null |
2024-08-16 | VF-NeRF: Learning Neural Vector Fields for Indoor Scene Reconstruction | Albert Gassol Puigjaner et.al. | 2408.08766 | link |
2024-08-13 | Potamoi: Accelerating Neural Rendering via a Unified Streaming Architecture | Yu Feng et.al. | 2408.06608 | null |
2024-11-03 | HDRGS: High Dynamic Range Gaussian Splatting | Jiahao Wu et.al. | 2408.06543 | null |
2024-08-12 | 3D Reconstruction of Protein Structures from Multi-view AFM Images using Neural Radiance Fields (NeRFs) | Jaydeep Rade et.al. | 2408.06244 | null |
2024-09-26 | FruitNeRF: A Unified Neural Radiance Field based Fruit Counting Framework | Lukas Meyer et.al. | 2408.06190 | link |
2024-08-10 | Radiance Field Learners As UAV First-Person Viewers | Liqi Yan et.al. | 2408.05533 | null |
2024-08-20 | Scene123: One Prompt to 3D Scene Generation via Video-Assisted and Consistency-Enhanced MAE | Yiying Yang et.al. | 2408.05477 | null |
2024-10-09 | FlowDreamer: Exploring High Fidelity Text-to-3D Generation via Rectified Flow | Hangyu Li et.al. | 2408.05008 | null |
2024-08-09 | FewShotNeRF: Meta-Learning-based Novel View Synthesis for Rapid Scene-Specific Adaptation | Piraveen Sivakumar et.al. | 2408.04803 | null |
2024-08-08 | Sampling for View Synthesis: From Local Light Field Fusion to Neural Radiance Fields and Beyond | Ravi Ramamoorthi et.al. | 2408.04586 | null |
2024-11-14 | Evaluating Modern Approaches in 3D Scene Reconstruction: NeRF vs Gaussian-Based Methods | Yiming Zhou et.al. | 2408.04268 | null |
2024-08-07 | Goal-oriented Semantic Communication for the Metaverse Application | Zhe Wang et.al. | 2408.03646 | null |
2025-02-04 | RayGauss: Volumetric Gaussian-Based Ray Casting for Photorealistic Novel View Synthesis | Hugo Blanc et.al. | 2408.03356 | null |
2024-08-06 | Efficient NeRF Optimization – Not All Samples Remain Equally Hard | Juuso Korhonen et.al. | 2408.03193 | null |
2024-08-03 | FBINeRF: Feature-Based Integrated Recurrent Network for Pinhole and Fisheye Neural Radiance Fields | Yifan Wu et.al. | 2408.01878 | null |
2024-08-03 | E $^3$ NeRF: Efficient Event-Enhanced Neural Radiance Fields from Blurry Images | Yunshan Qi et.al. | 2408.01840 | null |
2024-10-03 | NeRFoot: Robot-Footprint Estimation for Image-Based Visual Servoing | Daoxin Zhong et.al. | 2408.01251 | null |
2024-09-13 | UlRe-NeRF: 3D Ultrasound Imaging through Neural Rendering with Ultrasound Reflection Direction Parameterization | Ziwen Guo et.al. | 2408.00860 | null |
2024-07-31 | StyleRF-VolVis: Style Transfer of Neural Radiance Fields for Expressive Volume Visualization | Kaiyuan Tang et.al. | 2408.00150 | null |
2024-07-22 | PAV: Personalized Head Avatar from Unstructured Video Collection | Akin Caliskan et.al. | 2407.21047 | null |
2024-07-30 | A Comparative Study of Neural Surface Reconstruction for Scientific Visualization | Siyuan Yao et.al. | 2407.20868 | null |
2024-07-29 | Radiance Fields for Robotic Teleoperation | Maximum Wilder-Smith et.al. | 2407.20194 | link |
2024-07-29 | Garment Animation NeRF with Color Editing | Renke Wang et.al. | 2407.19774 | link |
2024-07-28 | FINER++: Building a Family of Variable-periodic Functions for Activating Implicit Neural Representation | Hao Zhu et.al. | 2407.19434 | null |
2025-02-17 | IOVS4NeRF:Incremental Optimal View Selection for Large-Scale NeRFs | Jingpeng Xie et.al. | 2407.18611 | null |
2024-07-22 | BoostMVSNeRFs: Boosting MVS-based NeRFs to Generalizable View Synthesis in Large-scale Scenes | Chih-Hai Su et.al. | 2407.15848 | null |
2024-07-19 | DirectL: Efficient Radiance Fields Rendering for 3D Light Field Displays | Zongyuan Yang et.al. | 2407.14053 | null |
2024-07-19 | Semantic Communications for 3D Human Face Transmission with Neural Radiance Fields | Guanlin Wu et.al. | 2407.13992 | null |
2024-09-05 | EaDeblur-GS: Event assisted 3D Deblur Reconstruction with Gaussian Splatting | Yuchen Weng et.al. | 2407.13520 | null |
2024-07-18 | GeometrySticker: Enabling Ownership Claim of Recolorized Neural Radiance Fields | Xiufeng Huang et.al. | 2407.13390 | null |
2024-07-18 | KFD-NeRF: Rethinking Dynamic NeRF with Kalman Filter | Yifan Zhan et.al. | 2407.13185 | null |
2024-07-17 | SG-NeRF: Neural Surface Reconstruction with Scene Graph Optimization | Yiyang Chen et.al. | 2407.12667 | link |
2024-07-17 | InfoNorm: Mutual Information Shaping of Normals for Sparse-View Reconstruction | Xulong Wang et.al. | 2407.12661 | link |
2024-07-17 | Efficient Depth-Guided Urban View Synthesis | Sheng Miao et.al. | 2407.12395 | null |
2024-07-17 | Invertible Neural Warp for NeRF | Shin-Fang Chng et.al. | 2407.12354 | null |
2024-09-29 | Splatfacto-W: A Nerfstudio Implementation of Gaussian Splatting for Unconstrained Photo Collections | Congrong Xu et.al. | 2407.12306 | null |
2024-07-18 | Motion-Oriented Compositional Neural Radiance Fields for Monocular Dynamic Human Modeling | Jaehyeok Kim et.al. | 2407.11962 | null |
2024-07-18 | IPA-NeRF: Illusory Poisoning Attack Against Neural Radiance Fields | Wenxiang Jiang et.al. | 2407.11921 | link |
2025-02-11 | DreamCatalyst: Fast and High-Quality 3D Editing via Controlling Editability and Identity Preservation | Jiwook Kim et.al. | 2407.11394 | link |
2024-07-16 | I $^2$ -SLAM: Inverting Imaging Process for Robust Photorealistic Dense SLAM | Gwangtak Bae et.al. | 2407.11347 | null |
2024-07-16 | Ev-GS: Event-based Gaussian splatting for Efficient and Accurate Radiance Field Rendering | Jingqian Wu et.al. | 2407.11343 | null |
2024-07-25 | Evaluating geometric accuracy of NeRF reconstructions compared to SLAM method | Adam Korycki et.al. | 2407.11238 | null |
2024-07-15 | AirNeRF: 3D Reconstruction of Human with Drone and NeRF for Future Communication Systems | Alexey Kotcov et.al. | 2407.10865 | null |
2024-07-15 | Domain Generalization for 6D Pose Estimation Through NeRF-based Image Synthesis | Antoine Legrand et.al. | 2407.10762 | null |
2024-07-15 | Interactive Rendering of Relightable and Animatable Gaussian Avatars | Youyi Zhan et.al. | 2407.10707 | null |
2024-12-08 | IE-NeRF: Inpainting Enhanced Neural Radiance Fields in the Wild | Shuaixian Wang et.al. | 2407.10695 | null |
2024-07-14 | RS-NeRF: Neural Radiance Fields from Rolling Shutter Images | Muyao Niu et.al. | 2407.10267 | link |
2024-07-14 | SpikeGS: 3D Gaussian Splatting from Spike Streams with High-Speed Camera Motion | Jiyuan Zhang et.al. | 2407.10062 | null |
2024-12-03 | Radiance Fields from Photons | Sacha Jungerman et.al. | 2407.09386 | null |
2024-08-03 | HPC: Hierarchical Progressive Coding Framework for Volumetric Video | Zihan Zheng et.al. | 2407.09026 | null |
2024-07-11 | Feasibility of Neural Radiance Fields for Crime Scene Video Reconstruction | Shariq Nadeem Malik et.al. | 2407.08795 | null |
2024-07-11 | MeshAvatar: Learning High-quality Triangular Human Avatars from Multi-view Videos | Yushuo Chen et.al. | 2407.08414 | link |
2024-09-20 | Explicit-NeRF-QA: A Quality Assessment Database for Explicit NeRF Model Compression | Yuke Xing et.al. | 2407.08165 | null |
2024-07-11 | Bayesian uncertainty analysis for underwater 3D reconstruction with neural radiance fields | Haojie Lian et.al. | 2407.08154 | null |
2024-07-11 | Survey on Fundamental Deep Learning 3D Reconstruction Techniques | Yonge Bai et.al. | 2407.08137 | null |
2025-03-14 | Neural Geometry Processing via Spherical Neural Surfaces | Romy Williamson et.al. | 2407.07755 | null |
2024-07-10 | Protecting NeRFs’ Copyright via Plug-And-Play Watermarking Base Model | Qi Song et.al. | 2407.07735 | null |
2024-07-10 | Drantal-NeRF: Diffusion-Based Restoration for Anti-aliasing Neural Radiance Field | Ganlin Yang et.al. | 2407.07461 | null |
2024-07-09 | Reference-based Controllable Scene Stylization with Gaussian Splatting | Yiqun Mei et.al. | 2407.07220 | null |
2024-07-09 | Sparse-DeRF: Deblurred Neural Radiance Fields from Sparse View | Dogyoon Lee et.al. | 2407.06613 | null |
2024-07-08 | Enhancing Neural Radiance Fields with Depth and Normal Completion Priors from Sparse Views | Jiawei Guo et.al. | 2407.05666 | null |
2024-07-08 | GeoNLF: Geometry guided Pose-Free Neural LiDAR Fields | Weiyi Xue et.al. | 2407.05597 | null |
2024-07-31 | Dynamic Neural Radiance Field From Defocused Monocular Video | Xianrui Luo et.al. | 2407.05586 | null |
2024-07-07 | GaussReg: Fast 3D Registration with Gaussian Splatting | Jiahao Chang et.al. | 2407.05254 | null |
2024-07-06 | SurgicalGaussian: Deformable 3D Gaussians for High-Fidelity Surgical Scene Reconstruction | Weixing Xie et.al. | 2407.05023 | link |
2024-07-04 | CRiM-GS: Continuous Rigid Motion-Aware Gaussian Splatting from Motion Blur Images | Junghe Lee et.al. | 2407.03923 | null |
2024-09-11 | BeNeRF: Neural Radiance Fields from a Single Blurry Image and Event Stream | Wenpu Li et.al. | 2407.02174 | link |
2024-07-01 | fVDB: A Deep-Learning Framework for Sparse, Large-Scale, and High-Performance Spatial Intelligence | Francis Williams et.al. | 2407.01781 | null |
2024-07-01 | The Continuous Tensor Abstraction: Where Indices are Real | Jaeyeon Won et.al. | 2407.01742 | null |
2024-07-01 | RoDyn-SLAM: Robust Dynamic Dense RGB-D SLAM with Neural Radiance Fields | Haochen Jiang et.al. | 2407.01303 | link |
2024-06-27 | Shorter SPECT Scans Using Self-supervised Coordinate Learning to Synthesize Skipped Projection Views | Zongyu Li et.al. | 2406.18840 | null |
2025-02-13 | Dream-in-Style: Text-to-3D Generation Using Stylized Score Distillation | Hubert Kompanowski et.al. | 2406.18581 | null |
2024-07-29 | Trimming the Fat: Efficient Compression of 3D Gaussian Splats through Pruning | Muhammad Salman Ali et.al. | 2406.18214 | link |
2024-06-25 | NerfBaselines: Consistent and Reproducible Evaluation of Novel View Synthesis Methods | Jonas Kulhanek et.al. | 2406.17345 | null |
2024-06-24 | Crowd-Sourced NeRF: Collecting Data from Production Vehicles for 3D Street View Reconstruction | Tong Qin et.al. | 2406.16289 | null |
2024-06-23 | Towards Real-Time Neural Volumetric Rendering on Mobile Devices: A Measurement Study | Zhe Wang et.al. | 2406.16068 | null |
2024-06-23 | LiveScene: Language Embedding Interactive Radiance Fields for Physical Scene Rendering and Control | Delin Qu et.al. | 2406.16038 | null |
2024-06-22 | psPRF:Pansharpening Planar Neural Radiance Field for Generalized 3D Reconstruction Satellite Imagery | Tongtong Zhang et.al. | 2406.15707 | null |
2024-06-21 | A3D: Does Diffusion Dream about 3D Alignment? | Savva Ignatyev et.al. | 2406.15020 | null |
2024-06-21 | E2GS: Event Enhanced Gaussian Splatting | Hiroyuki Deguchi et.al. | 2406.14978 | link |
2024-06-21 | Relighting Scenes with Object Insertions in Neural Radiance Fields | Xuening Zhu et.al. | 2406.14806 | null |
2024-08-01 | Deblurring Neural Radiance Fields with Event-driven Bundle Adjustment | Yunshan Qi et.al. | 2406.14360 | null |
2024-06-19 | Freq-Mip-AA : Frequency Mip Representation for Anti-Aliasing Neural Radiance Fields | Youngin Park et.al. | 2406.13251 | link |
2024-06-18 | Head Pose Estimation and 3D Neural Surface Reconstruction via Monocular Camera in situ for Navigation and Safe Insertion into Natural Openings | Ruijie Tang et.al. | 2406.13048 | null |
2025-03-14 | Fast Global Localization on Neural Radiance Field | Mangyu Kong et.al. | 2406.12202 | link |
2024-10-31 | DistillNeRF: Perceiving 3D Scenes from Single-Glance Images by Distilling Neural Fields and Foundation Model Features | Letian Wang et.al. | 2406.12095 | null |
2024-06-17 | Uncertainty modeling for fine-tuned implicit functions | Anna Susmelj et.al. | 2406.12082 | null |
2024-11-22 | LLaNA: Large Language and NeRF Assistant | Andrea Amaduzzi et.al. | 2406.11840 | null |
2024-06-17 | InterNeRF: Scaling Radiance Fields via Parameter Interpolation | Clinton Wang et.al. | 2406.11737 | null |
2024-06-16 | Learning Relighting and Intrinsic Decomposition in Neural Radiance Fields | Yixiong Yang et.al. | 2406.11077 | null |
2024-06-15 | fNeRF: High Quality Radiance Fields from Practical Cameras | Yi Hua et.al. | 2406.10633 | null |
2024-06-15 | Federated Neural Radiance Field for Distributed Intelligence | Yintian Zhang et.al. | 2406.10474 | null |
2024-06-14 | Wild-GS: Real-Time Novel View Synthesis from Unconstrained Photo Collections | Jiacong Xu et.al. | 2406.10373 | null |
2024-06-14 | GaussianSR: 3D Gaussian Super-Resolution with 2D Diffusion Priors | Xiqian Yu et.al. | 2406.10111 | null |
2024-06-13 | Neural NeRF Compression | Tuan Pham et.al. | 2406.08943 | null |
2024-06-13 | OpenMaterial: A Comprehensive Dataset of Complex Materials for 3D Reconstruction | Zheng Dang et.al. | 2406.08894 | null |
2024-06-12 | OpenObj: Open-Vocabulary Object-Level Neural Radiance Fields with Fine-Grained Understanding | Yinan Deng et.al. | 2406.08009 | link |
2024-12-13 | Spatial Annealing for Efficient Few-shot Neural Rendering | Yuru Xiao et.al. | 2406.07828 | link |
2024-10-10 | Active Scout: Multi-Target Tracking Using Neural Radiance Fields in Dense Urban Environments | Christopher D. Hsu et.al. | 2406.07431 | null |
2024-06-11 | Generative Lifting of Multiview to 3D from Unknown Pose: Wrapping NeRF inside Diffusion | Xin Yuan et.al. | 2406.06972 | null |
2024-06-15 | Neural Visibility Field for Uncertainty-Driven Active Mapping | Shangjie Xue et.al. | 2406.06948 | null |
2024-11-01 | IllumiNeRF: 3D Relighting Without Inverse Rendering | Xiaoming Zhao et.al. | 2406.06527 | null |
2024-06-10 | ExtraNeRF: Visibility-Aware View Extrapolation of Neural Radiance Fields with Diffusion Models | Meng-Li Shih et.al. | 2406.06133 | null |
2024-06-13 | GTR: Improving Large 3D Reconstruction Models through Geometry and Texture Refinement | Peiye Zhuang et.al. | 2406.05649 | null |
2024-06-07 | Multiplane Prior Guided Few-Shot Aerial Scene Rendering | Zihan Gao et.al. | 2406.04961 | null |
2024-06-07 | Multi-style Neural Radiance Field with AdaIN | Yu-Wen Pao et.al. | 2406.04960 | link |
2024-06-07 | DIRECT-3D: Learning Direct Text-to-3D Generation on Massive Noisy 3D Data | Qihao Liu et.al. | 2406.04322 | link |
2024-06-14 | GeoGen: Geometry-Aware Generative Modeling via Signed Distance Functions | Salvatore Esposito et.al. | 2406.04254 | null |
2024-06-06 | A Survey on 3D Human Avatar Modeling – From Reconstruction to Generation | Ruihe Wang et.al. | 2406.04253 | null |
2024-06-06 | Improving Physics-Augmented Continuum Neural Radiance Field-Based Geometry-Agnostic System Identification with Lagrangian Particle Optimization | Takuhiro Kaneko et.al. | 2406.04155 | null |
2024-06-06 | How Far Can We Compress Instant-NGP-Based NeRF? | Yihang Chen et.al. | 2406.04101 | link |
2024-06-06 | Gear-NeRF: Free-Viewpoint Rendering and Tracking with Motion-aware Spatio-Temporal Sampling | Xinhang Liu et.al. | 2406.03723 | null |
2024-06-13 | 3D-HGS: 3D Half-Gaussian Splatting | Haolin Li et.al. | 2406.02720 | null |
2024-07-11 | Self-Calibrating 4D Novel View Synthesis from Monocular Videos Using Gaussian Splatting | Fang Li et.al. | 2406.01042 | link |
2024-06-02 | PruNeRF: Segment-Centric Dataset Pruning via 3D Spatial Consistency | Yeonsung Jung et.al. | 2406.00798 | null |
2024-06-02 | Representing Animatable Avatar via Factorized Neural Fields | Chunjin Song et.al. | 2406.00637 | null |
2024-06-02 | Efficient Neural Light Fields (ENeLF) for Mobile Devices | Austin Peng et.al. | 2406.00598 | null |
2024-08-06 | Bilateral Guided Radiance Field Processing | Yuehao Wang et.al. | 2406.00448 | null |
2024-05-30 | $\textit{S}^3$ Gaussian: Self-Supervised Street Gaussians for Autonomous Driving | Nan Huang et.al. | 2405.20323 | link |
2024-09-27 | NeRF View Synthesis: Subjective Quality Assessment and Objective Metrics Evaluation | Pedro Martin et.al. | 2405.20078 | null |
2024-06-10 | IReNe: Instant Recoloring of Neural Radiance Fields | Alessio Mazzucchelli et.al. | 2405.19876 | null |
2024-07-18 | View-Consistent Hierarchical 3D Segmentation Using Ultrametric Feature Fields | Haodi He et.al. | 2405.19678 | link |
2024-05-29 | Neural Radiance Fields for Novel View Synthesis in Monocular Gastroscopy | Zijie Jiang et.al. | 2405.18863 | null |
2024-06-02 | NeRF On-the-go: Exploiting Uncertainty for Distractor-free NeRFs in the Wild | Weining Ren et.al. | 2405.18715 | link |
2024-10-11 | Learning Shared RGB-D Fields: Unified Self-supervised Pre-training for Label-efficient LiDAR-Camera 3D Perception | Xiaohao Xu et.al. | 2405.17942 | link |
2024-05-28 | A Refined 3D Gaussian Representation for High-Quality Dynamic Scene Reconstruction | Bin Zhang et.al. | 2405.17891 | null |
2024-09-10 | HFGS: 4D Gaussian Splatting with Emphasis on Spatial and Temporal High-Frequency Components for Endoscopic Scene Reconstruction | Haoyu Zhao et.al. | 2405.17872 | link |
2024-05-28 | Mani-GS: Gaussian Splatting Manipulation with Triangular Mesh | Xiangjun Gao et.al. | 2405.17811 | null |
2024-05-28 | F-3DGS: Factorized Coordinates and Representations for 3D Gaussian Splatting | Xiangyu Sun et.al. | 2405.17083 | null |
2024-05-29 | PyGS: Large-scale Scene Representation with Pyramidal 3D Gaussian Splatting | Zipeng Wang et.al. | 2405.16829 | null |
2024-05-24 | Neural Elevation Models for Terrain Mapping and Path Planning | Adam Dai et.al. | 2405.15227 | link |
2024-05-23 | NeRF-Casting: Improved View-Dependent Appearance with Consistent Reflections | Dor Verbin et.al. | 2405.14871 | null |
2024-05-23 | Neural Directional Encoding for Efficient and Accurate View-Dependent Appearance Modeling | Liwen Wu et.al. | 2405.14847 | null |
2024-05-23 | Camera Relocalization in Shadow-free Neural Radiance Fields | Shiyao Xu et.al. | 2405.14824 | link |
2024-06-08 | JointRF: End-to-End Joint Optimization for Dynamic Neural Radiance Field Representation and Compression | Zihan Zheng et.al. | 2405.14452 | null |
2024-06-11 | Leveraging Neural Radiance Fields for Pose Estimation of an Unknown Space Object during Proximity Operations | Antoine Legrand et.al. | 2405.12728 | null |
2024-06-18 | Embracing Radiance Field Rendering in 6G: Over-the-Air Training and Inference with 3D Contents | Guanlin Wu et.al. | 2405.12155 | null |
2024-11-06 | R-NeRF: Neural Radiance Fields for Modeling RIS-enabled Wireless Environments | Huiying Yang et.al. | 2405.11541 | link |
2024-06-01 | Enhanced 3D Urban Scene Reconstruction and Point Cloud Densification using Gaussian Splatting and Google Earth Imagery | Kyle Gao et.al. | 2405.11021 | null |
2024-05-16 | When LLMs step into the 3D World: A Survey and Meta-Analysis of 3D Tasks via Multi-modal Large Language Models | Xianzheng Ma et.al. | 2405.10255 | link |
2024-08-13 | From NeRFs to Gaussian Splats, and Back | Siming He et.al. | 2405.09717 | link |
2024-05-14 | Dynamic NeRF: A Review | Jinwei Lin et.al. | 2405.08609 | null |
2024-06-05 | Synergistic Integration of Coordinate Network and Tensorial Feature for Improving Neural Radiance Fields from Sparse Inputs | Mingyu Kim et.al. | 2405.07857 | link |
2024-10-07 | TD-NeRF: Novel Truncated Depth Prior for Joint Camera Pose and Neural Radiance Field Optimization | Zhen Tan et.al. | 2405.07027 | link |
2024-09-26 | Direct Learning of Mesh and Appearance via 3D Gaussian Splatting | Ancheng Lin et.al. | 2405.06945 | null |
2024-05-10 | OneTo3D: One Image to Re-editable Dynamic 3D Model and Video Generation | Jinwei Lin et.al. | 2405.06547 | link |
2024-05-10 | Aerial-NeRF: Adaptive Spatial Partitioning and Sampling for Large-Scale Aerial Rendering | Xiaohan Zhang et.al. | 2405.06214 | null |
2024-05-10 | Residual-NeRF: Learning Residual NeRFs for Transparent Object Manipulation | Bardienus P. Duisterhof et.al. | 2405.06181 | null |
2024-05-10 | NeRFFaceSpeech: One-shot Audio-driven 3D Talking Head Synthesis via Generative Prior | Gihoon Kim et.al. | 2405.05749 | null |
2024-06-28 | NGM-SLAM: Gaussian Splatting SLAM with Radiance Field Submap | Mingrui Li et.al. | 2405.05702 | null |
2024-12-06 | Benchmarking Neural Radiance Fields for Autonomous Robots: An Overview | Yuhang Ming et.al. | 2405.05526 | null |
2024-05-07 | Tactile-Augmented Radiance Fields | Yiming Dou et.al. | 2405.04534 | link |
2024-05-08 | DistGrid: Scalable Scene Reconstruction with Distributed Multi-resolution Hash Grid | Sidun Liu et.al. | 2405.04416 | null |
2024-05-07 | Novel View Synthesis with Neural Radiance Fields for Industrial Robot Applications | Markus Hillemann et.al. | 2405.04345 | null |
2024-06-10 | A Construct-Optimize Approach to Sparse View Synthesis without Camera Pose | Kaiwen Jiang et.al. | 2405.03659 | null |
2024-05-05 | Blending Distributed NeRFs with Tri-stage Robust Pose Optimization | Baijun Ye et.al. | 2405.02880 | null |
2024-09-18 | TK-Planes: Tiered K-Planes with High Dimensional Feature Vectors for Dynamic UAV-based Scenes | Christopher Maxey et.al. | 2405.02762 | null |
2024-06-10 | Active Neural 3D Reconstruction with Colorized Surface Voxel-based View Selection | Hyunseo Kim et.al. | 2405.02568 | null |
2024-05-03 | Learning Robot Soccer from Egocentric Vision with Deep Reinforcement Learning | Dhruva Tirumala et.al. | 2405.02425 | null |
2024-05-03 | Rip-NeRF: Anti-aliasing Radiance Fields with Ripmap-Encoded Platonic Solids | Junchen Liu et.al. | 2405.02386 | link |
2024-07-12 | WateRF: Robust Watermarks in Radiance Fields for Protection of Copyrights | Youngdong Jang et.al. | 2405.02066 | null |
2024-05-02 | Multi-view Action Recognition via Directed Gromov-Wasserstein Discrepancy | Hoang-Quan Nguyen et.al. | 2405.01337 | null |
2024-05-02 | NeRF in Robotics: A Survey | Guangming Wang et.al. | 2405.01333 | null |
2024-05-04 | LidaRF: Delving into Lidar for Neural Radiance Field on Street Scenes | Shanlin Sun et.al. | 2405.00900 | null |
2024-07-03 | Depth Priors in Removal Neural Radiance Fields | Zhihao Guo et.al. | 2405.00630 | null |
2024-06-20 | NeRF-Guided Unsupervised Learning of RGB-D Registration | Zhinan Yu et.al. | 2405.00507 | null |
2024-10-18 | MicroDreamer: Efficient 3D Generation in $\sim$ 20 Seconds by Score-based Iterative Reconstruction | Luxi Chen et.al. | 2404.19525 | link |
2024-04-29 | Embedded Representation Learning Network for Animating Styled Video Portrait | Tianyong Wang et.al. | 2404.19038 | null |
2024-05-27 | Simple-RF: Regularizing Sparse Input Radiance Fields with Simpler Solutions | Nagabhushan Somraj et.al. | 2404.19015 | null |
2024-07-22 | DGE: Direct Gaussian 3D Editing by Consistent Multi-view Editing | Minghao Chen et.al. | 2404.18929 | null |
2024-04-28 | S3-SLAM: Sparse Tri-plane Encoding for Neural Implicit SLAM | Zhiyao Zhang et.al. | 2404.18284 | null |
2024-04-26 | Geometry-aware Reconstruction and Fusion-refined Rendering for Generalizable Neural Radiance Fields | Tianqi Liu et.al. | 2404.17528 | link |
2024-04-25 | Depth Supervised Neural Surface Reconstruction from Airborne Imagery | Vincent Hackstein et.al. | 2404.16429 | null |
2024-04-24 | NeRF-XL: Scaling NeRFs with Multiple GPUs | Ruilong Li et.al. | 2404.16221 | null |
2024-04-23 | DreamCraft: Text-Guided Generation of Functional 3D Environments in Minecraft | Sam Earle et.al. | 2404.15538 | null |
2024-04-10 | Efficient EndoNeRF Reconstruction and Its Application for Data-driven Surgical Simulation | Yuehao Wang et.al. | 2404.15339 | null |
2024-08-09 | GaussianTalker: Speaker-specific Talking Head Synthesis via 3D Gaussian Splatting | Hongyun Yu et.al. | 2404.14037 | null |
2024-04-23 | CT-NeRF: Incremental Optimizing Neural Radiance Field and Poses with Complex Trajectory | Yunlong Ran et.al. | 2404.13896 | null |
2024-04-26 | Neural Radiance Field in Autonomous Driving: A Survey | Lei He et.al. | 2404.13816 | null |
2024-04-26 | ArtNeRF: A Stylized Neural Field for 3D-Aware Cartoonized Face Synthesis | Zichen Tang et.al. | 2404.13711 | link |
2024-10-30 | High-fidelity Endoscopic Image Synthesis by Utilizing Depth-guided Neural Surfaces | Baoru Huang et.al. | 2404.13437 | null |
2024-10-18 | EC-SLAM: Effectively Constrained Neural RGB-D SLAM with Sparse TSDF Encoding and Global Bundle Adjustment | Guanghao Li et.al. | 2404.13346 | link |
2024-04-19 | FlyNeRF: NeRF-Based Aerial Mapping for High-Quality 3D Scene Reconstruction | Maria Dronova et.al. | 2404.12970 | null |
2024-05-23 | Evaluating Alternatives to SFM Point Cloud Initialization for Gaussian Splatting | Yalda Foroutan et.al. | 2404.12547 | null |
2024-04-18 | AG-NeRF: Attention-guided Neural Radiance Fields for Multi-height Large-scale Outdoor Scene Rendering | Jingfeng Guo et.al. | 2404.11897 | link |
2024-04-18 | Cicero: Addressing Algorithmic and Architectural Bottlenecks in Neural Rendering by Radiance Warping and Memory Optimizations | Yu Feng et.al. | 2404.11852 | null |
2024-04-17 | SLAIM: Robust Dense Neural SLAM for Online Tracking and Mapping | Vincent Cartillier et.al. | 2404.11419 | null |
2024-04-17 | RainyScape: Unsupervised Rainy Scene Reconstruction using Decoupled Neural Rendering | Xianqiang Lyu et.al. | 2404.11401 | null |
2024-04-17 | REACTO: Reconstructing Articulated Objects from a Single Video | Chaoyue Song et.al. | 2404.11151 | null |
2024-04-16 | RapidVol: Rapid Reconstruction of 3D Ultrasound Volumes from Sensorless 2D Scans | Mark C. Eid et.al. | 2404.10766 | null |
2024-06-17 | Gaussian Splatting Decoder for 3D-aware Generative Adversarial Networks | Florian Barthel et.al. | 2404.10625 | null |
2024-04-16 | Plug-and-Play Acceleration of Occupancy Grid-based NeRF Rendering using VDB Grid and Hierarchical Ray Traversal | Yoshio Kato et.al. | 2404.10272 | link |
2024-04-15 | Taming Latent Diffusion Model for Neural Radiance Field Inpainting | Chieh Hubert Lin et.al. | 2404.09995 | null |
2024-04-15 | Video2Game: Real-time, Interactive, Realistic and Browser-Compatible Environment from a Single Video | Hongchi Xia et.al. | 2404.09833 | null |
2024-04-15 | ViFu: Multiple 360 $^\circ$ Objects Reconstruction with Clean Background via Visible Part Fusion | Tianhan Xu et.al. | 2404.09426 | null |
2024-05-06 | DeferredGS: Decoupled and Editable Gaussian Splatting with Deferred Shading | Tong Wu et.al. | 2404.09412 | null |
2024-04-14 | VRS-NeRF: Visual Relocalization with Sparse Neural Radiance Field | Fei Xue et.al. | 2404.09271 | link |
2024-08-22 | MonoPatchNeRF: Improving Neural Radiance Fields with Patch-based Monocular Guidance | Yuqun Wu et.al. | 2404.08252 | null |
2024-04-11 | Connecting NeRFs, Images, and Text | Francesco Ballerini et.al. | 2404.07993 | link |
2024-08-06 | Reinforcement Learning with Generalizable Gaussian Splatting | Jiaxu Wang et.al. | 2404.07950 | null |
2024-04-11 | Boosting Self-Supervision for Single-View Scene Completion via Knowledge Distillation | Keonhee Han et.al. | 2404.07933 | null |
2024-04-10 | SplatPose & Detect: Pose-Agnostic 3D Anomaly Detection | Mathis Kruse et.al. | 2404.06832 | link |
2024-04-10 | MonoSelfRecon: Purely Self-Supervised Explicit Generalizable 3D Reconstruction of Indoor Scenes from Monocular RGB Views | Runfa Li et.al. | 2404.06753 | null |
2024-04-10 | Bayesian NeRF: Quantifying Uncertainty with Volume Density in Neural Radiance Fields | Sibeak Lee et.al. | 2404.06727 | link |
2024-04-12 | SpikeNVS: Enhancing Novel View Synthesis from Blurry Images via Spike Camera | Gaole Dai et.al. | 2404.06710 | null |
2024-04-14 | 3D Geometry-aware Deformable Gaussian Splatting for Dynamic View Synthesis | Zhicheng Lu et.al. | 2404.06270 | null |
2024-04-09 | GHNeRF: Learning Generalizable Human Features with Efficient Neural Radiance Fields | Arnab Dey et.al. | 2404.06246 | null |
2024-04-09 | HFNeRF: Learning Human Biomechanic Features with Neural Radiance Fields | Arnab Dey et.al. | 2404.06152 | null |
2024-04-08 | Stylizing Sparse-View 3D Scenes with Hierarchical Neural Representation | Y. Wang et.al. | 2404.05236 | null |
2024-09-25 | CodecNeRF: Toward Fast Encoding and Decoding, Compact, and High-quality Novel-view Synthesis | Gyeongjin Kang et.al. | 2404.04913 | null |
2024-04-13 | GauU-Scene V2: Assessing the Reliability of Image-Based Metrics with Expansive Lidar Image Dataset Using 3DGS and NeRF | Butian Xiong et.al. | 2404.04880 | null |
2024-04-07 | NeRF2Points: Large-Scale Point Cloud Generation From Street Views’ Radiance Field Optimization | Peng Tu et.al. | 2404.04875 | null |
2024-08-01 | DATENeRF: Depth-Aware Text-based Editing of NeRFs | Sara Rojas et.al. | 2404.04526 | null |
2024-04-07 | RaFE: Generative Radiance Fields Restoration | Zhongkai Wu et.al. | 2404.03654 | null |
2024-04-04 | VF-NeRF: Viewshed Fields for Rigid NeRF Registration | Leo Segre et.al. | 2404.03349 | null |
2024-04-03 | LiDAR4D: Dynamic Neural Fields for Novel Space-time View LiDAR Synthesis | Zehan Zheng et.al. | 2404.02742 | link |
2024-04-03 | Neural Radiance Fields with Torch Units | Bingnan Ni et.al. | 2404.02617 | null |
2024-04-02 | NeRFCodec: Neural Feature Compression Meets Neural Radiance Fields for Memory-Efficient Scene Representation | Sicheng Li et.al. | 2404.02185 | null |
2024-04-17 | Alpha Invariance: On Inverse Scaling Between Distance and Volume Density in Neural Radiance Fields | Joshua Ahn et.al. | 2404.02155 | null |
2024-08-19 | NVINS: Robust Visual Inertial Navigation Fused with NeRF-augmented Camera Pose Regressor and Uncertainty Quantification | Juyeop Han et.al. | 2404.01400 | null |
2024-07-18 | NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Radiance Fields | Muhammad Zubair Irshad et.al. | 2404.01300 | link |
2024-04-01 | MagicMirror: Fast and High-Quality Avatar Generation with a Constrained Search Space | Armand Comas-Massagué et.al. | 2404.01296 | null |
2024-04-01 | Mirror-3DGS: Incorporating Mirror Reflections into 3D Gaussian Splatting | Jiarui Meng et.al. | 2404.01168 | null |
2024-06-17 | SGCNeRF: Few-Shot Neural Rendering via Sparse Geometric Consistency Guidance | Yuru Xiao et.al. | 2404.00992 | null |
2024-04-01 | MM3DGS SLAM: Multi-modal 3D Gaussian Splatting for SLAM Using Vision, Depth, and Inertial Measurements | Lisong C. Sun et.al. | 2404.00923 | null |
2024-08-07 | DPA-Net: Structured 3D Abstraction from Sparse Views via Differentiable Primitive Assembly | Fenggen Yu et.al. | 2404.00875 | null |
2024-09-26 | An Active Perception Game for Robust Information Gathering | Siming He et.al. | 2404.00769 | null |
2024-03-31 | Neural Radiance Field-based Visual Rendering: A Comprehensive Review | Mingyuan Yao et.al. | 2404.00714 | null |
2024-03-30 | MaGRITTe: Manipulative and Generative 3D Realization from Image, Topview and Text | Takayuki Hara et.al. | 2404.00345 | null |
2024-03-29 | Talk3D: High-Fidelity Talking Portrait Synthesis via Personalized 3D Generative Prior | Jaehoon Ko et.al. | 2403.20153 | link |
2024-03-29 | SGD: Street View Synthesis with Gaussian Splatting and Diffusion Prior | Zhongrui Yu et.al. | 2403.20079 | null |
2024-03-29 | NeSLAM: Neural Implicit Mapping and Self-Supervised Feature Tracking With Depth Completion and Denoising | Tianchen Deng et.al. | 2403.20034 | link |
2024-03-29 | SCINeRF: Neural Radiance Fields from a Snapshot Compressive Image | Yunhao Li et.al. | 2403.20018 | link |
2024-03-29 | DerainNeRF: 3D Scene Estimation with Adhesive Waterdrop Removal | Yunhao Li et.al. | 2403.20013 | link |
2024-04-03 | MI-NeRF: Learning a Single Face NeRF from Multiple Identities | Aggelina Chatziagapi et.al. | 2403.19920 | null |
2024-06-03 | Mitigating Motion Blur in Neural Radiance Fields with Events and Frames | Marco Cannici et.al. | 2403.19780 | link |
2024-04-05 | GaussianCube: Structuring Gaussian Splatting using Optimal Transport for 3D Generative Modeling | Bowen Zhang et.al. | 2403.19655 | null |
2024-03-28 | SAID-NeRF: Segmentation-AIDed NeRF for Depth Completion of Transparent Objects | Avinash Ummadisingu et.al. | 2403.19607 | null |
2024-03-28 | CoherentGS: Sparse Novel View Synthesis with Coherent 3D Gaussians | Avinash Paliwal et.al. | 2403.19495 | link |
2024-09-05 | Mesh2NeRF: Direct Mesh Supervision for Neural Radiance Field Representation and Generation | Yujin Chen et.al. | 2403.19319 | null |
2024-03-28 | Sine Activated Low-Rank Matrices for Parameter Efficient Learning | Yiping Ji et.al. | 2403.19243 | null |
2024-03-27 | Lift3D: Zero-Shot Lifting of Any 2D Vision Model to 3D | Mukund Varma T et.al. | 2403.18922 | null |
2024-05-24 | Gamba: Marry Gaussian Splatting with Mamba for single view 3D reconstruction | Qiuhong Shen et.al. | 2403.18795 | link |
2024-03-27 | SAT-NGP : Unleashing Neural Graphics Primitives for Fast Relightable Transient-Free 3D reconstruction from Satellite Imagery | Camille Billouard et.al. | 2403.18711 | link |
2024-03-27 | Modeling uncertainty for Gaussian Splatting | Luca Savant et.al. | 2403.18476 | null |
2024-03-26 | Fully-fused Multi-Layer Perceptrons on Intel Data Center GPUs | Kai Yuan et.al. | 2403.17607 | link |
2024-03-26 | NeRF-HuGS: Improved Neural Radiance Fields in Non-static Scenes Using Heuristics-Guided Segmentation | Jiahao Chen et.al. | 2403.17537 | null |
2024-03-25 | CVT-xRF: Contrastive In-Voxel Transformer for 3D Consistent Radiance Fields from Sparse Inputs | Yingji Zhong et.al. | 2403.16885 | null |
2024-03-25 | Spike-NeRF: Neural Radiance Field Based On Spike Camera | Yijia Guo et.al. | 2403.16410 | null |
2024-03-24 | Inverse Rendering of Glossy Objects via the Neural Plenoptic Function and Radiance Fields | Haoyuan Wang et.al. | 2403.16224 | null |
2024-03-24 | Entity-NeRF: Detecting and Removing Moving Entities in Urban Scenes | Takashi Otonari et.al. | 2403.16141 | null |
2024-03-24 | CG-SLAM: Efficient Dense RGB-D SLAM in a Consistent Uncertainty-aware 3D Gaussian Field | Jiarui Hu et.al. | 2403.16095 | null |
2024-04-15 | Are NeRFs ready for autonomous driving? Towards closing the real-to-simulation gap | Carl Lindström et.al. | 2403.16092 | null |
2024-04-02 | PKU-DyMVHumans: A Multi-View Video Benchmark for High-Fidelity Dynamic Human Modeling | Xiaoyun Zheng et.al. | 2403.16080 | link |
2024-03-24 | Semantic Is Enough: Only Semantic Information For NeRF Reconstruction | Ruibo Wang et.al. | 2403.16043 | null |
2024-03-28 | Exploring Accurate 3D Phenotyping in Greenhouse through Neural Radiance Fields | Junhong Zhao et.al. | 2403.15981 | null |
2024-05-30 | DriveEnv-NeRF: Exploration of A NeRF-Based Autonomous Driving Environment for Real-World Performance Validation | Mu-Yi Shen et.al. | 2403.15791 | link |
2024-07-14 | Gaussian in the Wild: 3D Gaussian Splatting for Unconstrained Image Collections | Dongbin Zhang et.al. | 2403.15704 | null |
2024-08-23 | Semantic Gaussians: Open-Vocabulary Scene Understanding with 3D Gaussian Splatting | Jun Guo et.al. | 2403.15624 | null |
2024-03-21 | Hyperspectral Neural Radiance Fields | Gerry Chen et.al. | 2403.14839 | null |
2024-03-21 | CombiNeRF: A Combination of Regularization Techniques for Few-Shot Neural Radiance Field View Synthesis | Matteo Bonotto et.al. | 2403.14412 | link |
2024-09-16 | InfNeRF: Towards Infinite Scale NeRF Rendering with O(log n) Space Complexity | Jiabin Liang et.al. | 2403.14376 | null |
2024-03-21 | Leveraging Thermal Modality to Enhance Reconstruction in Low-Light Conditions | Jiacong Xu et.al. | 2403.14053 | link |
2024-09-15 | MULAN-WC: Multi-Robot Localization Uncertainty-aware Active NeRF with Wireless Coordination | Weiying Wang et.al. | 2403.13348 | null |
2024-03-20 | Learning Novel View Synthesis from Heterogeneous Low-light Captures | Quan Zheng et.al. | 2403.13337 | null |
2024-09-04 | Depth-guided NeRF Training via Earth Mover’s Distance | Anita Rau et.al. | 2403.13206 | null |
2024-03-28 | DecentNeRFs: Decentralized Neural Radiance Fields from Crowdsourced Images | Zaid Tasneem et.al. | 2403.13199 | null |
2024-09-13 | Global-guided Focal Neural Radiance Field for Large-scale Scene Rendering | Mingqi Shao et.al. | 2403.12839 | null |
2024-03-19 | IFFNeRF: Initialisation Free and Fast 6DoF pose estimation from a single image and a NeRF model | Matteo Bortolon et.al. | 2403.12682 | null |
2024-03-18 | FLex: Joint Pose and Dynamic Radiance Fields Optimization for Stereo Endoscopic Videos | Florian Philipp Stilz et.al. | 2403.12198 | null |
2024-03-18 | ThermoNeRF: Multimodal Neural Radiance Fields for Thermal Novel View Synthesis | Mariam Hassan et.al. | 2403.12154 | link |
2024-03-18 | GNeRP: Gaussian-guided Neural Reconstruction of Reflective Objects with Noisy Polarization Priors | LI Yang et.al. | 2403.11899 | null |
2024-08-23 | Exploring Multi-modal Neural Scene Representations With Applications on Thermal Imaging | Mert Özer et.al. | 2403.11865 | null |
2024-03-19 | BAD-Gaussians: Bundle Adjusted Deblur Gaussian Splatting | Lingzhe Zhao et.al. | 2403.11831 | link |
2024-03-18 | Aerial Lifting: Neural Urban Semantic and Building Instance Lifting from Aerial Imagery | Yuqi Zhang et.al. | 2403.11812 | link |
2024-08-09 | OpenOcc: Open Vocabulary 3D Scene Reconstruction via Occupancy Representation | Haochen Jiang et.al. | 2403.11796 | null |
2024-03-17 | Creating Seamless 3D Maps Using Radiance Fields | Sai Tarun Sathyan et.al. | 2403.11364 | null |
2024-03-17 | SpikeNeRF: Learning Neural Radiance Fields from Continuous Spike Stream | Lin Zhu et.al. | 2403.11222 | link |
2024-04-13 | Recent Advances in 3D Gaussian Splatting | Tong Wu et.al. | 2403.11134 | null |
2024-07-18 | Omni-Recon: Harnessing Image-based Rendering for General-Purpose Neural Radiance Fields | Yonggan Fu et.al. | 2403.11131 | link |
2024-03-16 | Fast Sparse View Guided NeRF Update for Object Reconfigurations | Ziqi Lu et.al. | 2403.11024 | null |
2024-03-16 | HourglassNeRF: Casting an Hourglass as a Bundle of Rays for Few-shot Neural Rendering | Seunghyeon Seo et.al. | 2403.10906 | null |
2024-07-19 | MSI-NeRF: Linking Omni-Depth with View Synthesis through Multi-Sphere Image aided Generalizable Neural Radiance Field | Dongyu Yan et.al. | 2403.10840 | link |
2024-03-16 | DPPE: Dense Pose Estimation in a Plenoxels Environment using Gradient Approximation | Christopher Kolios et.al. | 2403.10773 | null |
2024-03-15 | Thermal-NeRF: Neural Radiance Fields from an Infrared Camera | Tianxiang Ye et.al. | 2403.10340 | link |
2024-03-20 | Leveraging Neural Radiance Field in Descriptor Synthesis for Keypoints Scene Coordinate Regression | Huy-Hoang Bui et.al. | 2403.10297 | link |
2024-03-15 | SemanticHuman-HD: High-Resolution Semantic Disentangled 3D Human Generation | Peng Zheng et.al. | 2403.10166 | null |
2024-03-25 | URS-NeRF: Unordered Rolling Shutter Bundle Adjustment for Neural Radiance Fields | Bo Xu et.al. | 2403.10119 | null |
2024-03-19 | DyBluRF: Dynamic Neural Radiance Fields from Blurry Monocular Video | Huiqiang Sun et.al. | 2403.10103 | null |
2024-08-21 | The NeRFect Match: Exploring NeRF Features for Visual Localization | Qunjie Zhou et.al. | 2403.09577 | null |
2024-08-14 | VIRUS-NeRF – Vision, InfraRed and UltraSonic based Neural Radiance Fields | Nicolaj Schmid et.al. | 2403.09477 | link |
2024-07-15 | PreSight: Enhancing Autonomous Vehicle Perception with City-Scale NeRF Priors | Tianyuan Yuan et.al. | 2403.09079 | link |
2024-03-13 | StyleDyRF: Zero-shot 4D Style Transfer for Dynamic Neural Radiance Fields | Hongbin Xu et.al. | 2403.08310 | link |
2024-07-30 | NeRF-Supervised Feature Point Detection and Description | Ali Youssef et.al. | 2403.08156 | link |
2024-03-12 | Q-SLAM: Quadric Representations for Monocular SLAM | Chensheng Peng et.al. | 2403.08125 | null |
2024-03-12 | SMURF: Continuous Dynamics for Motion-Deblurring Radiance Fields | Jungho Lee et.al. | 2403.07547 | link |
2024-03-11 | SiLVR: Scalable Lidar-Visual Reconstruction with Neural Radiance Fields for Robotic Inspection | Yifu Tao et.al. | 2403.06877 | null |
2024-03-11 | Vosh: Voxel-Mesh Hybrid Representation for Real-Time View Synthesis | Chenhao Zhang et.al. | 2403.06505 | null |
2024-03-22 | S-DyRF: Reference-Based Stylized Radiance Fields for Dynamic Scenes | Xingyi Li et.al. | 2403.06205 | null |
2024-03-10 | Is Vanilla MLP in Neural Radiance Field Enough for Few-shot View Synthesis? | Hanxin Zhu et.al. | 2403.06092 | null |
2024-03-09 | Large Generative Model Assisted 3D Semantic Communication | Feibo Jiang et.al. | 2403.05783 | null |
2024-03-24 | BAGS: Blur Agnostic Gaussian Splatting through Multi-Scale Kernel Modeling | Cheng Peng et.al. | 2403.04926 | link |
2024-03-08 | Finding Waldo: Towards Efficient Exploration of NeRF Scene Spaces | Evangelos Skartados et.al. | 2403.04508 | null |
2024-03-06 | DART: Implicit Doppler Tomography for Radar Novel View Synthesis | Tianshu Huang et.al. | 2403.03896 | null |
2024-03-06 | GSNeRF: Generalizable Semantic Neural Radiance Fields with Enhanced 3D Scene Understanding | Zi-Ting Chou et.al. | 2403.03608 | null |
2024-06-14 | NeWRF: A Deep Learning Framework for Wireless Radiation Field Reconstruction and Channel Prediction | Haofan Lu et.al. | 2403.03241 | null |
2024-04-26 | Splat-Nav: Safe Real-Time Robot Navigation in Gaussian Splatting Maps | Timothy Chen et.al. | 2403.02751 | link |
2024-03-04 | DaReNeRF: Direction-aware Representation for Dynamic Scenes | Ange Lou et.al. | 2403.02265 | null |
2024-03-02 | NeRF-VPT: Learning Novel View Representations with Neural Radiance Fields via View Prompt Tuning | Linsheng Chen et.al. | 2403.01325 | link |
2024-05-09 | Neural radiance fields-based holography [Invited] | Minsung Kang et.al. | 2403.01137 | null |
2024-03-02 | Neural Field Classifiers via Target Encoding and Classification Loss | Xindi Yang et.al. | 2403.01058 | null |
2024-04-24 | NToP: NeRF-Powered Large-scale Dataset Generation for 2D and 3D Human Pose Estimation in Top-View Fisheye Images | Jingrui Yu et.al. | 2402.18196 | link |
2024-03-21 | Neural Radiance Fields in Medical Imaging: Challenges and Next Steps | Xin Wang et.al. | 2402.17797 | null |
2024-02-27 | Learning Dynamic Tetrahedra for High-Quality Talking Head Synthesis | Zicheng Zhang et.al. | 2402.17364 | link |
2024-02-27 | CharNeRF: 3D Character Generation from Concept Art | Eddy Chu et.al. | 2402.17115 | null |
2024-02-26 | Resolution-Agnostic Neural Compression for High-Fidelity Portrait Video Conferencing via Implicit Radiance Fields | Yifei Li et.al. | 2402.16599 | null |
2024-02-26 | CMC: Few-shot Novel View Synthesis via Cross-view Multiplane Consistency | Hanxin Zhu et.al. | 2402.16407 | null |
2024-02-26 | SPC-NeRF: Spatial Predictive Compression for Voxel Based Radiance Field | Zetian Song et.al. | 2402.16366 | null |
2024-07-30 | GenNBV: Generalizable Next-Best-View Policy for Active 3D Reconstruction | Xiao Chen et.al. | 2402.16174 | link |
2024-02-22 | Consolidating Attention Features for Multi-view Image Editing | Or Patashnik et.al. | 2402.14792 | null |
2024-02-22 | Mip-Grid: Anti-aliased Grid Representations for Neural Radiance Fields | Seungtae Nam et.al. | 2402.14196 | null |
2024-02-21 | Identifying Unnecessary 3D Gaussians using Clustering for Fast Rendering of 3D Gaussian Splatting | Joongho Jo et.al. | 2402.13827 | null |
2024-02-21 | SealD-NeRF: Interactive Pixel-Level Editing for Dynamic Scenes by Neural Radiance Fields | Zhentao Huang et.al. | 2402.13510 | null |
2024-04-11 | How NeRFs and 3D Gaussian Splatting are Reshaping SLAM: a Survey | Fabio Tosi et.al. | 2402.13255 | link |
2024-03-02 | NeRF Solves Undersampled MRI Reconstruction | Tae Jun Jang et.al. | 2402.13226 | null |
2024-02-20 | OccFlowNet: Towards Self-supervised Occupancy Estimation via Differentiable Rendering and Occupancy Flow | Simon Boeder et.al. | 2402.12792 | null |
2024-02-19 | Colorizing Monochromatic Radiance Fields | Yean Cheng et.al. | 2402.12184 | null |
2024-02-19 | One2Avatar: Generative Implicit Head Avatar For Few-shot User Adaptation | Zhixuan Yu et.al. | 2402.11909 | null |
2024-02-17 | Semantically-aware Neural Radiance Fields for Visual Scene Understanding: A Comprehensive Review | Thang-Anh-Quan Nguyen et.al. | 2402.11141 | link |
2024-02-15 | Evaluating NeRFs for 3D Plant Geometry Reconstruction in Field Conditions | Muhammad Arbab Arshad et.al. | 2402.10344 | null |
2024-02-14 | PC-NeRF: Parent-Child Neural Radiance Fields Using Sparse LiDAR Frames in Autonomous Driving Environments | Xiuzhong Hu et.al. | 2402.09325 | link |
2024-02-13 | Preconditioners for the Stochastic Training of Implicit Neural Representations | Shin-Fang Chng et.al. | 2402.08784 | null |
2024-02-13 | NeRF Analogies: Example-Based Visual Attribute Transfer for NeRFs | Michael Fischer et.al. | 2402.08622 | null |
2024-03-08 | H2O-SDF: Two-phase Learning for 3D Indoor Reconstruction using Object Surface Fields | Minyoung Park et.al. | 2402.08138 | null |
2024-02-12 | DeformNet: Latent Space Modeling and Dynamics Prediction for Deformable Object Manipulation | Chenchang Li et.al. | 2402.07648 | null |
2024-03-25 | BioNeRF: Biologically Plausible Neural Radiance Fields for View Synthesis | Leandro A. Passos et.al. | 2402.07310 | link |
2024-07-10 | 3D Gaussian as a New Era: A Survey | Ben Fei et.al. | 2402.07181 | null |
2024-02-09 | ImplicitDeepfake: Plausible Face-Swapping through Implicit Deepfake Generation using NeRF and Gaussian Splatting | Georgii Stanishevskii et.al. | 2402.06390 | link |
2024-04-09 | SIR: Multi-view Inverse Rendering with Decomposable Shadow for Indoor Scenes | Xiaokang Wei et.al. | 2402.06136 | null |
2024-06-26 | Editable Scene Simulation for Autonomous Driving via Collaborative LLM-Agents | Yuxi Wei et.al. | 2402.05746 | link |
2024-02-09 | NCRF: Neural Contact Radiance Fields for Free-Viewpoint Rendering of Hand-Object Interaction | Zhongqun Zhang et.al. | 2402.05532 | null |
2024-02-07 | Mesh-based Gaussian Splatting for Real-time Large-scale Deformation | Lin Gao et.al. | 2402.04796 | null |
2024-02-07 | OV-NeRF: Open-vocabulary Neural Radiance Fields with Vision and Language Foundation Models for 3D Semantic Understanding | Guibiao Liao et.al. | 2402.04648 | link |
2024-02-07 | GSN: Generalisable Segmentation in Neural Radiance Field | Vinayak Gupta et.al. | 2402.04632 | link |
2024-02-11 | BirdNeRF: Fast Neural Reconstruction of Large-Scale Scenes From Aerial Imagery | Huiqing Zhang et.al. | 2402.04554 | null |
2024-02-20 | Denoising Diffusion via Image-Based Rendering | Titas Anciukevičius et.al. | 2402.03445 | null |
2024-02-05 | ViewFusion: Learning Composable Diffusion Models for Novel View Synthesis | Bernard Spiegl et.al. | 2402.02906 | link |
2024-02-03 | S-NeRF++: Autonomous Driving Simulation via Neural Reconstruction and Generation | Yurui Chen et.al. | 2402.02112 | null |
2024-06-11 | Robust Inverse Graphics via Probabilistic Inference | Tuan Anh Le et.al. | 2402.01915 | link |
2024-02-02 | HyperPlanes: Hypernetwork Approach to Rapid NeRF Adaptation | Paweł Batorski et.al. | 2402.01524 | link |
2024-02-02 | Di-NeRF: Distributed NeRF for Collaborative Learning with Unknown Relative Poses | Mahboubeh Asadi et.al. | 2402.01485 | null |
2024-02-15 | GaMeS: Mesh-Based Adapting and Modification of Gaussian Splatting | Joanna Waczyńska et.al. | 2402.01459 | link |
2024-05-19 | ID-NeRF: Indirect Diffusion-guided Neural Radiance Fields for Generalizable View Synthesis | Yaokun Li et.al. | 2402.01217 | null |
2024-02-01 | ViCA-NeRF: View-Consistency-Aware 3D Editing of Neural Radiance Fields | Jiahua Dong et.al. | 2402.00864 | link |
2024-01-31 | CARFF: Conditional Auto-encoded Radiance Field for 3D Scene Forecasting | Jiezhi Yang et.al. | 2401.18075 | null |
2024-01-31 | ReplaceAnything3D:Text-Guided 3D Scene Editing with Compositional Neural Radiance Fields | Edward Bartrum et.al. | 2401.17895 | null |
2024-05-17 | SAGD: Boundary-Enhanced Segment Anything in 3D Gaussian via Gaussian Decomposition | Xu Hu et.al. | 2401.17857 | link |
2024-01-30 | Physical Priors Augmented Event-Based 3D Reconstruction | Jiaxu Wang et.al. | 2401.17121 | link |
2024-04-02 | Endo-4DGS: Endoscopic Monocular Scene Reconstruction with 4D Gaussian Splatting | Yiming Huang et.al. | 2401.16416 | link |
2024-01-29 | SuNeRF: 3D reconstruction of the solar EUV corona using Neural Radiance Fields | Robert Jarolim et.al. | 2401.16388 | null |
2024-01-29 | Divide and Conquer: Rethinking the Training Paradigm of Neural Radiance Fields | Rongkai Ma et.al. | 2401.16144 | null |
2024-01-27 | AniDress: Animatable Loose-Dressed Avatar from Sparse Views Using Garment Rigging Model | Beijia Chen et.al. | 2401.15348 | null |
2024-01-26 | Learning Neural Radiance Fields of Forest Structure for Scalable and Fine Monitoring | Juan Castorena et.al. | 2401.15029 | null |
2024-01-26 | 3D Reconstruction and New View Synthesis of Indoor Environments based on a Dual Neural Radiance Field | Zhenyu Bao et.al. | 2401.14726 | link |
2024-01-25 | Learning Robust Generalizable Radiance Field with Visibility and Feature Augmented Point Representation | Jiaxu Wang et.al. | 2401.14354 | null |
2024-01-27 | Sketch2NeRF: Multi-view Sketch-guided Text-to-3D Generation | Minglin Chen et.al. | 2401.14257 | null |
2024-01-23 | NeRF-AD: Neural Radiance Field with Attention-based Disentanglement for Talking Face Synthesis | Chongke Bi et.al. | 2401.12568 | null |
2024-03-20 | DDMI: Domain-Agnostic Latent Diffusion Models for Synthesizing High-Quality Implicit Neural Representations | Dogyun Park et.al. | 2401.12517 | link |
2024-03-14 | Template-Free Single-View 3D Human Digitalization with Diffusion-Guided LRM | Zhenzhen Weng et.al. | 2401.12175 | null |
2024-01-22 | HG3-NeRF: Hierarchical Geometric, Semantic, and Photometric Guided Neural Radiance Fields for Sparse View Inputs | Zelin Gao et.al. | 2401.11711 | null |
2024-01-23 | IPR-NeRF: Ownership Verification meets Neural Radiance Field | Win Kent Ong et.al. | 2401.09495 | null |
2024-01-17 | ICON: Incremental CONfidence for Joint Pose and Radiance Field Optimization | Weiyao Wang et.al. | 2401.08937 | null |
2023-12-11 | Creating Visual Effects with Neural Radiance Fields | Cyrus Vachha et.al. | 2401.08633 | null |
2024-01-18 | ProvNeRF: Modeling per Point Provenance in NeRFs as a Stochastic Process | Kiyohiro Nakayama et.al. | 2401.08140 | null |
2024-01-11 | TriNeRFLet: A Wavelet Based Multiscale Triplane NeRF Representation | Rajaei Khatib et.al. | 2401.06191 | null |
2023-11-30 | Redefining Recon: Bridging Gaps with UAVs, 360 degree Cameras, and Neural Radiance Fields | Hartmut Surmann et.al. | 2401.06143 | null |
2024-01-11 | Fast High Dynamic Range Radiance Fields for Dynamic Scenes | Guanjun Wu et.al. | 2401.06052 | null |
2024-01-11 | GO-NeRF: Generating Virtual Objects in Neural Radiance Fields | Peng Dai et.al. | 2401.05750 | null |
2024-01-10 | Diffusion Priors for Dynamic View Synthesis from Monocular Videos | Chaoyang Wang et.al. | 2401.05583 | null |
2024-01-10 | FPRF: Feed-Forward Photorealistic Style Transfer of Large-Scale 3D Neural Radiance Fields | GeonU Kim et.al. | 2401.05516 | null |
2024-01-10 | CTNeRF: Cross-Time Transformer for Dynamic Neural Radiance Field from Monocular Video | Xingyu Miao et.al. | 2401.04861 | link |
2024-04-14 | A Survey on 3D Gaussian Splatting | Guikun Chen et.al. | 2401.03890 | link |
2024-01-06 | RustNeRF: Robust Neural Radiance Field with Low-Quality Images | Mengfei Li et.al. | 2401.03257 | null |
2024-01-06 | Hi-Map: Hierarchical Factorized Radiance Field for High-Fidelity Monocular Dense Mapping | Tongyan Hua et.al. | 2401.03203 | null |
2024-01-05 | Progress and Prospects in 3D Generative AI: A Technical Overview including 3D human | Song Bai et.al. | 2401.02620 | null |
2024-03-27 | SIGNeRF: Scene Integrated Generation for Neural Radiance Fields | Jan-Niklas Dihlmann et.al. | 2401.01647 | null |
2024-01-02 | Noise-NeRF: Hide Information in Neural Radiance Fields using Trainable Noise | Qinglong Huang et.al. | 2401.01216 | null |
2024-01-02 | 3D Visibility-aware Generalizable Neural Radiance Fields for Interacting Hands | Xuan Huang et.al. | 2401.00979 | link |
2023-12-30 | PlanarNeRF: Online Learning of Planar Primitives with Neural Radiance Fields | Zheng Chen et.al. | 2401.00871 | null |
2024-05-27 | Deblurring 3D Gaussian Splatting | Byeonghyeon Lee et.al. | 2401.00834 | null |
2024-01-01 | Sharp-NeRF: Grid-based Fast Deblurring Neural Radiance Fields Using Sharpness Prior | Byeonghyeon Lee et.al. | 2401.00825 | link |
2024-03-29 | GD^2-NeRF: Generative Detail Compensation via GAN and Diffusion for One-shot Generalizable Neural Radiance Fields | Xiao Pan et.al. | 2401.00616 | null |
2023-12-30 | Inpaint4DNeRF: Promptable Spatio-Temporal NeRF Inpainting with Generative Diffusion Models | Han Jiang et.al. | 2401.00208 | null |
2023-12-29 | Informative Rays Selection for Few-Shot Neural Radiance Fields | Marco Orsingher et.al. | 2312.17561 | null |
2024-04-01 | City-on-Web: Real-time Neural Rendering of Large-scale Scenes on the Web | Kaiwen Song et.al. | 2312.16457 | link |
2023-12-29 | DL3DV-10K: A Large-Scale Scene Dataset for Deep Learning-based 3D Vision | Lu Ling et.al. | 2312.16256 | null |
2023-12-24 | SUNDIAL: 3D Satellite Understanding through Direct, Ambient, and Complex Lighting Decomposition | Nikhil Behari et.al. | 2312.16215 | null |
2023-12-23 | INFAMOUS-NeRF: ImproviNg FAce MOdeling Using Semantically-Aligned Hypernetworks with Neural Radiance Fields | Andrew Hou et.al. | 2312.16197 | null |
2023-12-26 | 2D-Guided 3D Gaussian Segmentation | Kun Lan et.al. | 2312.16047 | null |
2024-02-23 | Pano-NeRF: Synthesizing High Dynamic Range Novel Views with Geometry from Sparse Low Dynamic Range Panoramic Images | Zhan Lu et.al. | 2312.15942 | link |
2023-12-25 | Neural BSSRDF: Object Appearance Representation Including Heterogeneous Subsurface Scattering | Thomson TG et.al. | 2312.15711 | null |
2023-12-23 | Efficient Deformable Tissue Reconstruction via Orthogonal Neural Plane | Chen Yang et.al. | 2312.15253 | link |
2023-12-22 | Deformable 3D Gaussian Splatting for Animatable Human Avatars | HyunJun Jung et.al. | 2312.15059 | null |
2023-12-22 | PoseGen: Learning to Generate 3D Human Pose Dataset with NeRF | Mohsen Gholami et.al. | 2312.14915 | link |
2023-12-22 | Density Uncertainty Quantification with NeRF-Ensembles: Impact of Data and Scene Constraints | Miriam Jäger et.al. | 2312.14664 | null |
2024-04-05 | PlatoNeRF: 3D Reconstruction in Plato’s Cave via Single-View Two-Bounce Lidar | Tzofi Klinghoffer et.al. | 2312.14239 | null |
2023-12-21 | Neural Point Cloud Diffusion for Disentangled 3D Shape and Appearance Generation | Philipp Schröppel et.al. | 2312.14124 | link |
2024-04-09 | Carve3D: Improving Multi-view Reconstruction Consistency for Diffusion Models with RL Finetuning | Desai Xie et.al. | 2312.13980 | null |
2024-02-18 | Gaussian Splatting with NeRF-based Color and Opacity | Dawid Malarz et.al. | 2312.13729 | link |
2024-03-29 | DyBluRF: Dynamic Deblurring Neural Radiance Fields for Blurry Monocular Video | Minh-Quan Viet Bui et.al. | 2312.13528 | null |
2023-12-20 | NeRF-VO: Real-Time Sparse Visual Odometry with Neural Radiance Fields | Jens Naumann et.al. | 2312.13471 | null |
2023-12-20 | ShowRoom3D: Text to High-Quality 3D Room Generation Using 3D Priors | Weijia Mao et.al. | 2312.13324 | null |
2024-05-02 | Compact 3D Scene Representation via Self-Organizing Gaussian Grids | Wieland Morgenstern et.al. | 2312.13299 | link |
2023-12-20 | Deep Learning on 3D Neural Fields | Pierluigi Zama Ramirez et.al. | 2312.13277 | null |
2024-05-16 | SpecNeRF: Gaussian Directional Encoding for Specular Reflections | Li Ma et.al. | 2312.13102 | null |
2023-12-20 | Reducing Shape-Radiance Ambiguity in Radiance Fields with a Closed-Form Color Estimation Method | Qihang Fang et.al. | 2312.12726 | link |
2023-12-19 | ZS-SRT: An Efficient Zero-Shot Super-Resolution Training Method for Neural Radiance Fields | Xiang Feng et.al. | 2312.12122 | null |
2024-01-22 | MixRT: Mixed Neural Representations For Real-Time NeRF Rendering | Chaojian Li et.al. | 2312.11841 | null |
2023-12-19 | Text-Image Conditioned Diffusion for Consistent Text-to-3D Generation | Yuze He et.al. | 2312.11774 | null |
2023-12-20 | FastSR-NeRF: Improving NeRF Efficiency on Consumer Devices with A Simple Super-Resolution Pipeline | Chien-Yu Lin et.al. | 2312.11537 | null |
2023-12-18 | GauFRe: Gaussian Deformation Fields for Real-time Dynamic Novel View Synthesis | Yiqing Liang et.al. | 2312.11458 | null |
2023-12-18 | AE-NeRF: Audio Enhanced Neural Radiance Field for Few Shot Talking Head Synthesis | Dongze Li et.al. | 2312.10921 | null |
2023-12-17 | PNeRFLoc: Visual Localization with Point-based Neural Radiance Fields | Boming Zhao et.al. | 2312.10649 | null |
2023-12-19 | Learning Dense Correspondence for NeRF-Based Face Reenactment | Songlin Yang et.al. | 2312.10422 | null |
2023-12-15 | SlimmeRF: Slimmable Radiance Fields | Shiran Yuan et.al. | 2312.10034 | link |
2024-03-25 | LAENeRF: Local Appearance Editing for Neural Radiance Fields | Lukas Radl et.al. | 2312.09913 | null |
2024-04-19 | RANRAC: Robust Neural Scene Representations via Random Ray Consensus | Benno Buschmann et.al. | 2312.09780 | null |
2023-12-15 | SLS4D: Sparse Latent Space for 4D Novel View Synthesis | Qi-Yuan Feng et.al. | 2312.09743 | null |
2023-12-14 | ZeroRF: Fast Sparse View 360° Reconstruction with Zero Pretraining | Ruoxi Shi et.al. | 2312.09249 | null |
2024-03-30 | OccNeRF: Advancing 3D Occupancy Prediction in LiDAR-Free Environments | Chubin Zhang et.al. | 2312.09243 | link |
2024-04-04 | 3DGS-Avatar: Animatable Avatars via Deformable 3D Gaussian Splatting | Zhiyin Qian et.al. | 2312.09228 | null |
2023-12-15 | ColNeRF: Collaboration for Generalizable Sparse Input Neural Radiance Field | Zhangkai Ni et.al. | 2312.09095 | link |
2024-01-24 | Aleth-NeRF: Illumination Adaptive NeRF with Concealing Field Assumption | Ziteng Cui et.al. | 2312.09093 | link |
2024-03-20 | iComMa: Inverting 3D Gaussian Splatting for Camera Pose Estimation via Comparing and Matching | Yuan Sun et.al. | 2312.09031 | null |
2023-12-14 | Scene 3-D Reconstruction System in Scattering Medium | Zhuoyifan Zhang et.al. | 2312.09005 | null |
2023-12-14 | VaLID: Variable-Length Input Diffusion for Novel View Synthesis | Shijie Li et.al. | 2312.08892 | null |
2023-12-14 | CF-NeRF: Camera Parameter Free Neural Radiance Fields with Incremental Learning | Qingsong Yan et.al. | 2312.08760 | null |
2023-12-14 | SpectralNeRF: Physically Based Spectral Rendering with Neural Radiance Field | Ru Li et.al. | 2312.08692 | link |
2023-12-13 | ProNeRF: Learning Efficient Projection-Aware Ray Sampling for Fine-Grained Implicit Neural Radiance Fields | Juan Luis Gonzalez Bello et.al. | 2312.08136 | null |
2023-12-13 | Neural Radiance Fields for Transparent Object Using Visual Hull | Heechan Yoon et.al. | 2312.08118 | null |
2023-12-13 | 3DGEN: A GAN-based approach for generating novel 3D models from image data | Antoine Schnepf et.al. | 2312.08094 | null |
2023-12-12 | COLMAP-Free 3D Gaussian Splatting | Yang Fu et.al. | 2312.07504 | link |
2024-01-19 | WaterHE-NeRF: Water-ray Tracing Neural Radiance Fields for Underwater Scene Reconstruction | Jingchun Zhou et.al. | 2312.06946 | null |
2023-12-10 | TeTriRF: Temporal Tri-Plane Radiance Fields for Efficient Free-Viewpoint Video | Minye Wu et.al. | 2312.06713 | null |
2023-12-11 | Learning Naturally Aggregated Appearance for Efficient 3D Editing | Ka Leong Cheng et.al. | 2312.06657 | link |
2023-12-11 | CorresNeRF: Image Correspondence Priors for Neural Radiance Fields | Yixing Lao et.al. | 2312.06642 | link |
2023-12-10 | Learning for CasADi: Data-driven Models in Numerical Optimization | Tim Salzmann et.al. | 2312.05873 | link |
2023-12-10 | NeVRF: Neural Video-based Radiance Fields for Long-duration Sequences | Minye Wu et.al. | 2312.05855 | null |
2023-12-10 | IL-NeRF: Incremental Learning for Neural Radiance Fields with Camera Pose Alignment | Letian Zhang et.al. | 2312.05748 | null |
2024-04-22 | CoGS: Controllable Gaussian Splatting | Heng Yu et.al. | 2312.05664 | null |
2023-12-08 | 360° Volumetric Portrait Avatar | Jalees Nehvi et.al. | 2312.05311 | null |
2023-12-11 | Nuvo: Neural UV Mapping for Unruly 3D Representations | Pratul P. Srinivasan et.al. | 2312.05283 | null |
2024-04-14 | SwiftBrush: One-Step Text-to-Image Diffusion Model with Variational Score Distillation | Thuan Hoang Nguyen et.al. | 2312.05239 | link |
2023-12-08 | TriHuman : A Real-time and Controllable Tri-plane Representation for Detailed Human Geometry and Appearance Synthesis | Heming Zhu et.al. | 2312.05161 | null |
2023-12-07 | NeuSD: Surface Completion with Multi-View Text-to-Image Diffusion | Savva Ignatyev et.al. | 2312.04654 | null |
2023-12-07 | VOODOO 3D: Volumetric Portrait Disentanglement for One-Shot 3D Head Reenactment | Phong Tran et.al. | 2312.04651 | null |
2024-04-24 | EAGLES: Efficient Accelerated 3D Gaussians with Lightweight EncodingS | Sharath Girish et.al. | 2312.04564 | link |
2023-12-07 | Multi-View Unsupervised Image Generation with Cross Attention Guidance | Llukman Cerkezi et.al. | 2312.04337 | null |
2023-12-07 | Towards 4D Human Video Stylization | Tiantian Wang et.al. | 2312.04143 | link |
2023-12-07 | Identity-Obscured Neural Radiance Fields: Privacy-Preserving 3D Facial Reconstruction | Jiayi Kong et.al. | 2312.04106 | null |
2023-12-06 | Artist-Friendly Relightable and Animatable Neural Heads | Yingyan Xu et.al. | 2312.03420 | null |
2023-12-06 | Evaluating the point cloud of individual trees generated from images based on Neural Radiance fields (NeRF) method | Hongyu Huang et.al. | 2312.03372 | null |
2023-12-06 | SO-NeRF: Active View Planning for NeRF using Surrogate Objectives | Keifer Lee et.al. | 2312.03266 | null |
2024-04-08 | Feature 3DGS: Supercharging 3D Gaussian Splatting to Enable Distilled Feature Fields | Shijie Zhou et.al. | 2312.03203 | link |
2024-03-27 | HybridNeRF: Efficient Neural Rendering via Adaptive Volumetric Surfaces | Haithem Turki et.al. | 2312.03160 | null |
2023-12-05 | ReconFusion: 3D Reconstruction with Diffusion Priors | Rundi Wu et.al. | 2312.02981 | null |
2023-12-05 | HeadGaS: Real-Time Animatable Head Avatars via 3D Gaussian Splatting | Helisa Dhamo et.al. | 2312.02902 | null |
2023-12-23 | C-NERF: Representing Scene Changes as Directional Consistency Difference-based NeRF | Rui Huang et.al. | 2312.02751 | link |
2023-12-05 | FINER: Flexible spectral-bias tuning in Implicit NEural Representation by Variable-periodic Activation Functions | Zhen Liu et.al. | 2312.02434 | null |
2024-03-21 | PointNeRF++: A multi-scale, point-based Neural Radiance Field | Weiwei Sun et.al. | 2312.02362 | null |
2024-03-19 | Instant Uncertainty Calibration of NeRFs Using a Meta-calibrator | Niki Amini-Naieni et.al. | 2312.02350 | null |
2024-04-17 | Re-Nerfing: Improving Novel Views Synthesis through Novel Views Synthesis | Felix Tristram et.al. | 2312.02255 | null |
2024-01-23 | WavePlanes: A compact Wavelet representation for Dynamic Neural Radiance Fields | Adrian Azzarelli et.al. | 2312.02218 | link |
2023-12-02 | Volumetric Rendering with Baked Quadrature Fields | Gopal Sharma et.al. | 2312.02202 | null |
2023-12-02 | StableDreamer: Taming Noisy Score Distillation Sampling for Text-to-3D | Pengsheng Guo et.al. | 2312.02189 | null |
2023-12-04 | Mesh-Guided Neural Implicit Field Editing | Can Wang et.al. | 2312.02157 | null |
2023-12-04 | Fast View Synthesis of Casual Videos | Yao-Chih Lee et.al. | 2312.02135 | null |
2024-03-21 | ColonNeRF: High-Fidelity Neural Reconstruction of Long Colonoscopy | Yufei Shi et.al. | 2312.02015 | null |
2024-01-17 | Fast and accurate sparse-view CBCT reconstruction using meta-learned neural attenuation field and hash-encoding regularization | Heejun Shin et.al. | 2312.01689 | null |
2023-12-04 | GaussianHead: Impressive 3D Gaussian-based Head Avatars with Dynamic Hybrid Neural Field | Jie Wang et.al. | 2312.01632 | link |
2024-04-06 | SANeRF-HQ: Segment Anything for NeRF in High Quality | Yichen Liu et.al. | 2312.01531 | null |
2023-12-03 | VideoRF: Rendering Dynamic Radiance Fields as 2D Feature Video Streams | Liao Wang et.al. | 2312.01407 | null |
2023-12-05 | Self-Evolving Neural Radiance Fields | Jaewoo Jung et.al. | 2312.01003 | link |
2023-11-30 | PyNeRF: Pyramidal Neural Radiance Fields | Haithem Turki et.al. | 2312.00252 | link |
2023-11-30 | SparseGS: Real-Time 360° Sparse View Synthesis using Gaussian Splatting | Haolin Xiong et.al. | 2312.00206 | link |
2024-04-01 | Contrastive Denoising Score for Text-guided Latent Diffusion Image Editing | Hyelin Nam et.al. | 2311.18608 | null |
2024-03-11 | Anisotropic Neural Representation Learning for High-Quality Neural Rendering | Y. Wang et.al. | 2311.18311 | null |
2023-11-29 | FisherRF: Active View Selection and Uncertainty Quantification for Radiance Fields using Fisher Information | Wen Jiang et.al. | 2311.17874 | link |
2023-11-29 | SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis | Ziqiao Peng et.al. | 2311.17590 | link |
2023-11-29 | NeRFTAP: Enhancing Transferability of Adversarial Patches on Face Recognition using Neural Radiance Fields | Xiaoliang Liu et.al. | 2311.17332 | null |
2023-12-11 | REF $^2$ -NeRF: Reflection and Refraction aware Neural Radiance Field | Wooseok Kim et.al. | 2311.17116 | link |
2024-03-28 | Human Gaussian Splatting: Real-time Rendering of Animatable Avatars | Arthur Moreau et.al. | 2311.17113 | link |
2023-12-11 | UC-NeRF: Neural Radiance Field for Under-Calibrated Multi-view Cameras in Autonomous Driving | Kai Cheng et.al. | 2311.16945 | null |
2023-11-29 | A Unified Approach for Text- and Image-guided 4D Scene Generation | Yufeng Zheng et.al. | 2311.16854 | null |
2023-11-28 | SplitNeRF: Split Sum Approximation Neural Field for Joint Geometry, Illumination, and Material Estimation | Jesus Zarzar et.al. | 2311.16671 | link |
2023-11-28 | DGNR: Density-Guided Neural Point Rendering of Large Driving Scenes | Zhuopeng Li et.al. | 2311.16664 | null |
2023-11-28 | SCALAR-NeRF: SCAlable LARge-scale Neural Radiance Fields for Scene Reconstruction | Yu Chen et.al. | 2311.16657 | null |
2024-03-14 | RGBGrasp: Image-based Object Grasping by Capturing Multiple Views during Robot Arm Movement with Neural Radiance Fields | Chang Liu et.al. | 2311.16592 | null |
2023-11-28 | Rethinking Directional Integration in Neural Radiance Fields | Congyue Deng et.al. | 2311.16504 | null |
2023-11-29 | Animatable 3D Gaussian: Fast and High-Quality Reconstruction of Multiple Human Avatars | Yang Liu et.al. | 2311.16482 | link |
2023-10-30 | SeamlessNeRF: Stitching Part NeRFs with Gradient Propagation | Bingchen Gong et.al. | 2311.16127 | null |
2024-03-31 | Animatable Gaussians: Learning Pose-dependent Gaussian Maps for High-fidelity Human Avatar Modeling | Zhe Li et.al. | 2311.16096 | link |
2024-03-27 | SOAC: Spatio-Temporal Overlap-Aware Multi-Sensor Calibration using Neural Radiance Fields | Quentin Herau et.al. | 2311.15803 | null |
2024-03-12 | Neural 3D Strokes: Creating Stylized 3D Scenes with Vectorized 3D Strokes | Hao-Bin Duan et.al. | 2311.15637 | null |
2023-11-27 | CaesarNeRF: Calibrated Semantic Representation for Few-shot Generalizable Neural Rendering | Haidong Zhu et.al. | 2311.15510 | link |
2023-11-26 | Efficient Encoding of Graphics Primitives with Simplex-based Structures | Yibo Wen et.al. | 2311.15439 | null |
2023-11-26 | Obj-NeRF: Extract Object NeRFs from Multi-view Images | Zhiyi Li et.al. | 2311.15291 | null |
2023-12-05 | NeuRAD: Neural Rendering for Autonomous Driving | Adam Tonderski et.al. | 2311.15260 | link |
2024-02-19 | Animate124: Animating One Image to 4D Dynamic Scene | Yuyang Zhao et.al. | 2311.14603 | null |
2023-12-20 | GaussianEditor: Swift and Controllable 3D Editing with Gaussian Splatting | Yiwen Chen et.al. | 2311.14521 | link |
2023-11-23 | ECRF: Entropy-Constrained Neural Radiance Fields Compression with Frequency Domain Optimization | Soonbin Lee et.al. | 2311.14208 | null |
2024-02-26 | Tube-NeRF: Efficient Imitation Learning of Visuomotor Policies from MPC using Tube-Guided Data Augmentation and NeRFs | Andrea Tagliabue et.al. | 2311.14153 | null |
2024-04-01 | Posterior Distillation Sampling | Juil Koo et.al. | 2311.13831 | null |
2023-12-06 | Towards Transferable Multi-modal Perception Representation Learning for Autonomy: NeRF-Supervised Masked AutoEncoder | Xiaohao Xu et.al. | 2311.13750 | null |
2024-02-15 | Compact 3D Gaussian Representation for Radiance Field | Joo Chan Lee et.al. | 2311.13681 | link |
2023-11-22 | Retargeting Visual Data with Deformation Fields | Tim Elsner et.al. | 2311.13297 | null |
2023-11-22 | 3D Face Style Transfer with a Hybrid Solution of NeRF and Mesh Rasterization | Jianwei Feng et.al. | 2311.13168 | null |
2023-11-21 | Hyb-NeRF: A Multiresolution Hybrid Encoding for Neural Radiance Fields | Yifan Wang et.al. | 2311.12490 | null |
2023-11-18 | Towards Function Space Mesh Watermarking: Protecting the Copyright of Signed Distance Fields | Xingyu Zhu et.al. | 2311.12059 | null |
2024-03-12 | Entangled View-Epipolar Information Aggregation for Generalizable Neural Radiance Fields | Zhiyuan Min et.al. | 2311.11845 | link |
2024-03-23 | Structure-Aware Sparse-View X-ray 3D Reconstruction | Yuanhao Cai et.al. | 2311.10959 | link |
2023-11-17 | Removing Adverse Volumetric Effects From Trained Neural Radiance Fields | Andreas L. Teigen et.al. | 2311.10523 | null |
2023-11-16 | Adaptive Shells for Efficient Neural Radiance Field Rendering | Zian Wang et.al. | 2311.10091 | null |
2023-11-18 | EvaSurf: Efficient View-Aware Implicit Textured Surface Reconstruction on Mobile Devices | Jingnan Gao et.al. | 2311.09806 | null |
2023-11-16 | Reconstructing Continuous Light Field From Single Coded Image | Yuya Ishikawa et.al. | 2311.09646 | null |
2023-11-14 | Drivable 3D Gaussian Avatars | Wojciech Zielonka et.al. | 2311.08581 | null |
2023-11-13 | $L_0$-Sampler: An $L_{0}$ Model Guided Volume Sampling for NeRF | Liangchen Li et.al. | 2311.07044 | null |
2024-03-19 | Aria-NeRF: Multimodal Egocentric View Synthesis | Jiankai Sun et.al. | 2311.06455 | null |
2023-11-10 | ASSIST: Interactive Scene Nodes for Scalable and Realistic Indoor Simulation | Zhide Zhong et.al. | 2311.06211 | null |
2024-03-01 | UMedNeRF: Uncertainty-aware Single View Volumetric Rendering for Medical Neural Radiance Fields | Jing Hu et.al. | 2311.05836 | null |
2023-11-28 | BakedAvatar: Baking Neural Fields for Real-Time Head Avatar Synthesis | Hao-Bin Duan et.al. | 2311.05521 | link |
2023-11-09 | VoxNeRF: Bridging Voxel Representation and Neural Radiance Fields for Enhanced Indoor View Synthesis | Sen Wang et.al. | 2311.05289 | null |
2023-11-09 | ConRad: Image Constrained Radiance Fields for 3D Generation from a Single Image | Senthil Purushwalkam et.al. | 2311.05230 | null |
2023-11-08 | Learning Robust Multi-Scale Representation for Neural Radiance Fields from Unposed Images | Nishant Jain et.al. | 2311.04521 | null |
2024-03-09 | LRM: Large Reconstruction Model for Single Image to 3D | Yicong Hong et.al. | 2311.04400 | null |
2023-11-07 | High-fidelity 3D Reconstruction of Plants using Neural Radiance Field | Kewei Hu et.al. | 2311.04154 | null |
2023-11-07 | Fast Sun-aligned Outdoor Scene Relighting based on TensoRF | Yeonjin Chang et.al. | 2311.03965 | null |
2023-11-08 | UP-NeRF: Unconstrained Pose-Prior-Free Neural Radiance Fields | Injae Kim et.al. | 2311.03784 | link |
2023-11-06 | Long-Term Invariant Local Features via Implicit Cross-Domain Correspondences | Zador Pataki et.al. | 2311.03345 | null |
2023-11-06 | Animating NeRFs from Texture Space: A Framework for Pose-Dependent Rendering of Human Performances | Paul Knoll et.al. | 2311.03140 | null |
2023-11-06 | Consistent4D: Consistent 360° Dynamic Object Generation from Monocular Video | Yanqin Jiang et.al. | 2311.02848 | null |
2024-02-02 | InstructPix2NeRF: Instructed 3D Portrait Editing from a Single Image | Jianhui Li et.al. | 2311.02826 | link |
2023-11-05 | VR-NeRF: High-Fidelity Virtualized Walkable Spaces | Linning Xu et.al. | 2311.02542 | null |
2023-11-03 | A Neural Radiance Field-Based Architecture for Intelligent Multilayered View Synthesis | D. Dhinakaran et.al. | 2311.01842 | null |
2023-11-26 | Estimating 3D Uncertainty Field: Quantifying Uncertainty for Neural Radiance Fields | Jianxiong Shen et.al. | 2311.01815 | null |
2023-11-03 | Efficient Cloud Pipelines for Neural Radiance Fields | Derek Jacoby et.al. | 2311.01659 | null |
2023-11-03 | INeAT: Iterative Neural Adaptive Tomography | Bo Xiong et.al. | 2311.01653 | null |
2023-11-02 | Novel View Synthesis from a Single RGBD Image for Indoor Scenes | Congrui Hetang et.al. | 2311.01065 | null |
2023-10-31 | FPO++: Efficient Encoding and Rendering of Dynamic Neural Radiance Fields by Analyzing and Enhancing Fourier PlenOctrees | Saskia Rabich et.al. | 2310.20710 | link |
2024-01-19 | NeRF Revisited: Fixing Quadrature Instability in Volume Rendering | Mikaela Angelina Uy et.al. | 2310.20685 | null |
2024-01-19 | DynPoint: Dynamic Neural Point For View Synthesis | Kaichen Zhou et.al. | 2310.18999 | link |
2024-03-18 | TivNe-SLAM: Dynamic Mapping and Tracking via Time-Varying Neural Radiance Fields | Chengyao Duan et.al. | 2310.18917 | null |
2023-10-28 | INCODE: Implicit Neural Conditioning with Prior Knowledge Embeddings | Amirhossein Kazerouni et.al. | 2310.18846 | link |
2023-10-27 | Reconstructive Latent-Space Neural Radiance Fields for Efficient 3D Scene Representations | Tristan Aumentado-Armstrong et.al. | 2310.17880 | null |
2023-10-27 | HyperFields: Towards Zero-Shot Generation of NeRFs from Text | Sudarshan Babu et.al. | 2310.17075 | null |
2023-11-06 | 4D-Editor: Interactive Object-level Editing in Dynamic Neural Radiance Fields via Semantic Distillation | Dadong Jiang et.al. | 2310.16858 | null |
2023-10-28 | PERF: Panoramic Neural Radiance Field from a Single Panorama | Guangcong Wang et.al. | 2310.16831 | link |
2023-10-25 | Open-NeRF: Towards Open Vocabulary NeRF Decomposition | Hao Zhang et.al. | 2310.16383 | null |
2023-10-24 | Cross-view Self-localization from Synthesized Scene-graphs | Ryogo Yamamoto et.al. | 2310.15504 | null |
2023-10-23 | CAwa-NeRF: Instant Learning of Compression-Aware NeRF Features | Omnia Mahmoud et.al. | 2310.14695 | null |
2023-10-20 | ManifoldNeRF: View-dependent Image Feature Supervision for Few-shot Neural Radiance Fields | Daiju Kanaoka et.al. | 2310.13670 | null |
2023-12-18 | Sync-NeRF: Generalizing Dynamic NeRFs to Unsynchronized Videos | Seoha Kim et.al. | 2310.13356 | link |
2023-10-20 | UE4-NeRF:Neural Radiance Field for Real-Time Rendering of Large-Scale Scene | Jiaming Gu et.al. | 2310.13263 | null |
2023-09-14 | Spec-NeRF: Multi-spectral Neural Radiance Fields | Jiabao Li et.al. | 2310.12987 | link |
2023-10-18 | Towards Abdominal 3-D Scene Rendering from Laparoscopy Surgical Videos using NeRFs | Khoa Tuan Nguyen et.al. | 2310.11645 | null |
2023-10-16 | TraM-NeRF: Tracing Mirror and Near-Perfect Specular Reflections through Neural Radiance Fields | Leif Van Holland et.al. | 2310.10650 | link |
2023-12-07 | DynVideo-E: Harnessing Dynamic NeRF for Large-Scale Motion- and View-Change Human-Centric Video Editing | Jia-Wei Liu et.al. | 2310.10624 | null |
2024-01-25 | ProteusNeRF: Fast Lightweight NeRF Editing using 3D-Aware Image Context | Binglun Wang et.al. | 2310.09965 | null |
2023-10-15 | Active Perception using Neural Radiance Fields | Siming He et.al. | 2310.09892 | link |
2023-10-15 | CBARF: Cascaded Bundle-Adjusting Neural Radiance Fields from Imperfect Camera Poses | Hongyu Fu et.al. | 2310.09776 | null |
2023-12-10 | Dynamic Appearance Particle Neural Radiance Field | Ancheng Lin et.al. | 2310.07916 | null |
2023-10-11 | rpcPRF: Generalizable MPI Neural Radiance Field for Satellite Camera | Tongtong Zhang et.al. | 2310.07179 | null |
2023-10-10 | Leveraging Neural Radiance Fields for Uncertainty-Aware Visual Localization | Le Chen et.al. | 2310.06984 | null |
2023-10-10 | High-Fidelity 3D Head Avatars Reconstruction through Spatially-Varying Expression Conditioned Neural Radiance Field | Minghan Qin et.al. | 2310.06275 | null |
2023-10-09 | A Real-time Method for Inserting Virtual Objects into Neural Radiance Fields | Keyang Ye et.al. | 2310.05837 | null |
2023-10-09 | Neural Impostor: Editing Neural Radiance Fields with Explicit Shape Manipulation | Ruiyang Liu et.al. | 2310.05391 | null |
2023-10-08 | LocoNeRF: A NeRF-based Approach for Local Structure from Motion for Precise Localization | Artem Nenashev et.al. | 2310.05134 | null |
2023-10-08 | Geometry Aware Field-to-field Transformations for 3D Semantic Segmentation | Dominik Hollidt et.al. | 2310.05133 | null |
2024-03-18 | Improving Neural Radiance Field using Near-Surface Sampling with Point Cloud Generation | Hye Bin Yoo et.al. | 2310.04152 | null |
2023-10-05 | Targeted Adversarial Attacks on Generalizable Neural Radiance Fields | Andras Horvath et.al. | 2310.03578 | null |
2023-10-05 | BID-NeRF: RGB-D image pose estimation with inverted Neural Radiance Fields | Ágoston István Csehi et.al. | 2310.03563 | null |
2023-10-05 | Point-Based Radiance Fields for Controllable Human Motion Synthesis | Haitao Yu et.al. | 2310.03375 | link |
2023-10-04 | Shielding the Unseen: Privacy Protection through Poisoning NeRF with Spatial Deformation | Yihan Wu et.al. | 2310.03125 | null |
2023-10-04 | Efficient-3DiM: Learning a Generalizable Single-image Novel-view Synthesizer in One Day | Yifan Jiang et.al. | 2310.03015 | null |
2024-02-26 | USB-NeRF: Unrolling Shutter Bundle Adjusted Neural Radiance Fields | Moyang Li et.al. | 2310.02687 | link |
2023-12-06 | EvDNeRF: Reconstructing Event Data with Dynamic Neural Radiance Fields | Anish Bhattacharya et.al. | 2310.02437 | link |
2023-10-03 | Adaptive Multi-NeRF: Exploit Efficient Parallelism in Adaptive Multiple Scale Neural Radiance Field Rendering | Tong Wang et.al. | 2310.01881 | null |
2023-10-03 | MIMO-NeRF: Fast Neural Rendering with Multi-input Multi-output Neural Radiance Fields | Takuhiro Kaneko et.al. | 2310.01821 | null |
2023-10-02 | PC-NeRF: Parent-Child Neural Radiance Fields under Partial Sensor Data Loss in Autonomous Driving Environments | Xiuzhong Hu et.al. | 2310.00874 | link |
2024-02-13 | How Many Views Are Needed to Reconstruct an Unknown Object Using NeRF? | Sicong Pan et.al. | 2310.00684 | link |
2024-01-10 | Multi-tiling Neural Radiance Field (NeRF) – Geometric Assessment on Large-scale Aerial Datasets | Ningli Xu et.al. | 2310.00530 | null |
2023-09-30 | MMPI: a Flexible Radiance Field Representation by Multiple Multi-plane Images Blending | Yuze He et.al. | 2310.00249 | null |
2023-09-29 | Multi-task View Synthesis with Neural Radiance Fields | Shuhong Zheng et.al. | 2309.17450 | link |
2023-09-29 | Forward Flow for Novel View Synthesis of Dynamic Scenes | Xiang Guo et.al. | 2309.17390 | null |
2023-09-29 | HAvatar: High-fidelity Head Avatar via Facial Model Conditioned Neural Radiance Field | Xiaochen Zhao et.al. | 2309.17128 | null |
2023-09-28 | DreamGaussian: Generative Gaussian Splatting for Efficient 3D Content Creation | Jiaxiang Tang et.al. | 2309.16653 | link |
2023-09-28 | MatrixCity: A Large-scale City Dataset for City-scale Neural Rendering and Beyond | Yixuan Li et.al. | 2309.16553 | null |
2023-10-04 | FG-NeRF: Flow-GAN based Probabilistic Neural Radiance Field for Independence-Assumption-Free Uncertainty Estimation | Songlin Wei et.al. | 2309.16364 | null |
2023-09-28 | Learning Effective NeRFs and SDFs Representations with 3D Generative Adversarial Networks for 3D Object Generation: Technical Report for ICCV 2023 OmniObject3D Challenge | Zheyuan Yang et.al. | 2309.16110 | null |
2023-09-27 | NeuRBF: A Neural Fields Representation with Adaptive Radial Basis Functions | Zhang Chen et.al. | 2309.15426 | link |
2023-09-27 | BASED: Bundle-Adjusting Surgical Endoscopic Dynamic Video Reconstruction using Neural Radiance Fields | Shreya Saha et.al. | 2309.15329 | null |
2023-09-26 | 3D Density-Gradient based Edge Detection on Neural Radiance Fields (NeRFs) for Geometric Reconstruction | Miriam Jäger et.al. | 2309.14800 | null |
2023-12-11 | NAS-NeRF: Generative Neural Architecture Search for Neural Radiance Fields | Saeejith Nair et.al. | 2309.14293 | null |
2023-09-25 | Tiled Multiplane Images for Practical 3D Photography | Numair Khan et.al. | 2309.14291 | null |
2023-09-25 | Variational Inference for Scalable 3D Object-centric Learning | Tianyu Wang et.al. | 2309.14010 | null |
2023-11-28 | MM-NeRF: Multimodal-Guided 3D Multi-Style Transfer of Neural Radiance Field | Zijiang Yang et.al. | 2309.13607 | null |
2023-09-22 | NeRRF: 3D Reconstruction and View Synthesis for Transparent and Specular Objects with Neural Refractive-Reflective Fields | Xiaoxue Chen et.al. | 2309.13039 | link |
2023-09-22 | RHINO: Regularizing the Hash-based Implicit Neural Representation | Hao Zhu et.al. | 2309.12642 | null |
2023-09-21 | NeuralLabeling: A versatile toolset for labeling vision datasets using Neural Radiance Fields | Floris Erich et.al. | 2309.11966 | link |
2023-09-21 | Fast Satellite Tensorial Radiance Field for Multi-date Satellite Imagery of Large Size | Tongtong Zhang et.al. | 2309.11767 | null |
2023-09-21 | MarkNerf:Watermarking for Neural Radiance Field | Lifeng Chen et.al. | 2309.11747 | null |
2023-09-21 | Rendering stable features improves sampling-based localisation with Neural radiance fields | Boxuan Zhang et.al. | 2309.11698 | null |
2023-09-25 | Language-driven Object Fusion into Neural Radiance Fields with Pose-Conditioned Dataset Updates | Ka Chun Shum et.al. | 2309.11281 | link |
2023-09-21 | Controllable Dynamic Appearance for Neural 3D Portraits | ShahRukh Athar et.al. | 2309.11009 | null |
2023-11-13 | SpikingNeRF: Making Bio-inspired Neural Networks See through the Real World | Xingting Yao et.al. | 2309.10987 | link |
2023-09-19 | Locally Stylized Neural Radiance Fields | Hong-Wing Pang et.al. | 2309.10684 | null |
2023-09-19 | Steganography for Neural Radiance Fields by Backdooring | Weina Dong et.al. | 2309.10503 | null |
2023-10-20 | Instant Photorealistic Style Transfer: A Lightweight and Adaptive Approach | Rong Liu et.al. | 2309.10011 | null |
2023-09-17 | NeRF-VINS: A Real-time Neural Radiance Field Map-based Visual-Inertial Navigation System | Saimouli Katragadda et.al. | 2309.09295 | null |
2023-09-16 | DynaMoN: Motion-Aware Fast And Robust Camera Localization for Dynamic NeRF | Mert Asim Karaoglu et.al. | 2309.08927 | link |
2023-09-15 | Robust e-NeRF: NeRF from Sparse & Noisy Events under Non-Uniform Motion | Weng Fei Low et.al. | 2309.08596 | link |
2023-10-18 | Breathing New Life into 3D Assets with Generative Repainting | Tianfu Wang et.al. | 2309.08523 | link |
2023-09-25 | Deformable Neural Radiance Fields using RGB and Event Cameras | Qi Ma et.al. | 2309.08416 | null |
2023-09-14 | Gradient based Grasp Pose Optimization on a NeRF that Approximates Grasp Success | Gergely Sóti et.al. | 2309.08040 | null |
2023-09-14 | MC-NeRF: Muti-Camera Neural Radiance Fields for Muti-Camera Image Acquisition Systems | Yu Gao et.al. | 2309.07846 | null |
2023-09-14 | DT-NeRF: Decomposed Triplane-Hash Neural Radiance Fields for High-Fidelity Talking Portrait Synthesis | Yaoyu Su et.al. | 2309.07752 | null |
2023-09-14 | CoRF : Colorizing Radiance Fields using Knowledge Distillation | Ankit Dhiman et.al. | 2309.07668 | null |
2023-09-14 | Indoor Scene Reconstruction with Fine-Grained Details Using Hybrid Representation and Normal Prior Enhancement | Sheng Ye et.al. | 2309.07640 | link |
2023-09-13 | Text-Guided Generation and Editing of Compositional 3D Avatars | Hao Zhang et.al. | 2309.07125 | null |
2023-09-13 | Dynamic NeRFs for Soccer Scenes | Sacha Lewin et.al. | 2309.06802 | link |
2023-09-12 | Learning Disentangled Avatars with Hybrid 3D Representations | Yao Feng et.al. | 2309.06441 | null |
2023-09-12 | Federated Learning for Large-Scale Scene Modeling with Neural Radiance Fields | Teppei Suzuki et.al. | 2309.06030 | null |
2023-09-10 | SC-NeRF: Self-Correcting Neural Radiance Field with Sparse Views | Liang Song et.al. | 2309.05028 | null |
2023-09-09 | Mirror-Aware Neural Humans | Daniel Ajisafe et.al. | 2309.04750 | link |
2023-09-08 | DeformToon3D: Deformable 3D Toonification from Neural Radiance Fields | Junzhe Zhang et.al. | 2309.04410 | link |
2023-09-14 | SimpleNeRF: Regularizing Sparse Input Neural Radiance Fields with Simpler Solutions | Nagabhushan Somraj et.al. | 2309.03955 | null |
2023-09-07 | BluNF: Blueprint Neural Field | Robin Courant et.al. | 2309.03933 | null |
2023-09-07 | Text2Control3D: Controllable 3D Avatar Generation in Neural Radiance Fields using Geometry-Guided Text-to-Image Diffusion Model | Sungwon Hwang et.al. | 2309.03550 | null |
2023-09-06 | Bayes’ Rays: Uncertainty Quantification for Neural Radiance Fields | Lily Goli et.al. | 2309.03185 | link |
2023-09-06 | Instant Continual Learning of Neural Radiance Fields | Ryan Po et.al. | 2309.01811 | null |
2023-09-04 | Adv3D: Generating 3D Adversarial Examples in Driving Scenarios with NeRF | Leheng Li et.al. | 2309.01351 | null |
2023-09-01 | SparseSat-NeRF: Dense Depth Supervised Neural Radiance Fields for Sparse Satellite Images | Lulin Zhang et.al. | 2309.00277 | link |
2023-09-04 | Improving NeRF Quality by Progressive Camera Placement for Unrestricted Navigation in Complex Environments | Georgios Kopanas et.al. | 2309.00014 | null |
2023-08-30 | From Pixels to Portraits: A Comprehensive Survey of Talking Head Generation Techniques and Applications | Shreyank N Gowda et.al. | 2308.16041 | null |
2023-08-30 | Drone-NeRF: Efficient NeRF Based 3D Scene Reconstruction for Large-Scale Drone Survey | Zhihao Jia et.al. | 2308.15733 | null |
2023-08-29 | Efficient Ray Sampling for Radiance Fields Reconstruction | Shilei Sun et.al. | 2308.15547 | null |
2023-08-29 | Pose-Free Neural Radiance Fields via Implicit Pose Regularization | Jiahui Zhang et.al. | 2308.15049 | null |
2023-08-28 | CLNeRF: Continual Learning Meets NeRF | Zhipeng Cai et.al. | 2308.14816 | link |
2023-08-28 | Flexible Techniques for Differentiable Rendering with 3D Gaussians | Leonid Keselman et.al. | 2308.14737 | null |
2023-08-28 | Multi-Modal Neural Radiance Field for Monocular Dense SLAM with a Light-Weight ToF Sensor | Xinyang Liu et.al. | 2308.14383 | null |
2023-08-27 | Unaligned 2D to 3D Translation with Conditional Vector-Quantized Code Diffusion using Transformers | Abril Corona-Figueroa et.al. | 2308.14152 | link |
2023-08-27 | Sparse3D: Distilling Multiview-Consistent Diffusion for Object Reconstruction from Sparse Views | Zi-Xin Zou et.al. | 2308.14078 | null |
2023-08-26 | InsertNeRF: Instilling Generalizability into NeRF with HyperNet Modules | Yanqi Bao et.al. | 2308.13897 | link |
2023-08-25 | Relighting Neural Radiance Fields with Shadow and Highlight Hints | Chong Zeng et.al. | 2308.13404 | link |
2023-09-06 | ARF-Plus: Controlling Perceptual Factors in Artistic Radiance Fields for 3D Scene Stylization | Wenzhao Li et.al. | 2308.12452 | null |
2023-09-11 | Blending-NeRF: Text-Driven Localized Editing in Neural Radiance Fields | Hyeonseop Song et.al. | 2308.11974 | null |
2023-09-29 | Pose Modulated Avatars from Video | Chunjin Song et.al. | 2308.11951 | null |
2023-08-22 | SAMSNeRF: Segment Anything Model (SAM) Guides Dynamic Surgical Scene Reconstruction by Neural Radiance Field (NeRF) | Ange Lou et.al. | 2308.11774 | null |
Industry
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-01 | Controllable Weather Synthesis and Removal with Video Diffusion Models | Chih-Hao Lin et.al. | 2505.00704 | null |
2025-04-21 | LongPerceptualThoughts: Distilling System-2 Reasoning for System-1 Perception | Yuan-Hong Liao et.al. | 2504.15362 | null |
2025-04-15 | PARTFIELD: Learning 3D Feature Fields for Part Segmentation and Beyond | Minghua Liu et.al. | 2504.11451 | null |
2025-04-17 | VideoPanda: Video Panoramic Diffusion with Multi-view Attention | Kevin Xie et.al. | 2504.11389 | null |
2025-04-01 | Cosmos-Transfer1: Conditional World Generation with Adaptive Multimodal Control | NVIDIA et.al. | 2503.14492 | link |
2025-03-05 | GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control | Xuanchi Ren et.al. | 2503.03751 | link |
2025-03-03 | Difix3D+: Improving 3D Reconstructions with Single-Step Diffusion Models | Jay Zhangjie Wu et.al. | 2503.01774 | null |
2025-03-22 | DiffusionRenderer: Neural Inverse and Forward Rendering with Video Diffusion Models | Ruofan Liang et.al. | 2501.18590 | null |
2025-03-18 | Cosmos World Foundation Model Platform for Physical AI | NVIDIA et.al. | 2501.03575 | link |
2024-12-05 | InfiniCube: Unbounded and Controllable Dynamic 3D Driving Scene Generation with World-Guided Video Models | Yifan Lu et.al. | 2412.03934 | null |
2025-04-01 | Feed-Forward Bullet-Time Reconstruction of Dynamic Scenes from Monocular Videos | Hanxue Liang et.al. | 2412.03526 | null |
2024-11-14 | LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models | Zhengyi Wang et.al. | 2411.09595 | null |
2025-02-28 | ReMatching Dynamic Reconstruction Flow | Sara Oblak et.al. | 2411.00705 | null |
2024-10-26 | SCube: Instant Large-Scale Scene Reconstruction using VoxSplats | Xuanchi Ren et.al. | 2410.20030 | null |
2025-02-11 | SpaceMesh: A Continuous Representation for Learning Manifold Surface Meshes | Tianchang Shen et.al. | 2409.20562 | null |
2024-09-28 | G3R: Gradient Guided Generalizable Reconstruction | Yun Chen et.al. | 2409.19405 | null |
2024-09-27 | UniCal: Unified Neural Sensor Calibration | Ze Yang et.al. | 2409.18953 | null |
2024-09-26 | Learning to Drive via Asymmetric Self-Play | Chris Zhang et.al. | 2409.18218 | null |
2024-09-15 | Reasoning Paths with Reference Objects Elicit Quantitative Spatial Reasoning in Large Vision-Language Models | Yuan-Hong Liao et.al. | 2409.09788 | null |
2025-04-19 | OmniRe: Omni Urban Scene Reconstruction | Ziyu Chen et.al. | 2408.16760 | null |
2024-08-19 | Photorealistic Object Insertion with Diffusion-Guided Inverse Rendering | Ruofan Liang et.al. | 2408.09702 | null |
2025-03-20 | Wolf: Dense Video Captioning with a World Summarization Framework | Boyi Li et.al. | 2407.18908 | null |
2024-07-15 | SuperPADL: Scaling Language-Directed Physics-Based Control with Progressive Supervised Distillation | Jordan Juravsky et.al. | 2407.10481 | null |
2024-10-10 | 3D Gaussian Ray Tracing: Fast Tracing of Particle Scenes | Nicolas Moenne-Loccoz et.al. | 2407.07090 | null |
2024-07-01 | fVDB: A Deep-Learning Framework for Sparse, Large-Scale, and High-Performance Spatial Intelligence | Francis Williams et.al. | 2407.01781 | null |
2024-10-31 | DistillNeRF: Perceiving 3D Scenes from Single-Glance Images by Distilling Neural Fields and Foundation Model Features | Letian Wang et.al. | 2406.12095 | null |
2024-06-14 | L4GM: Large 4D Gaussian Reconstruction Model | Jiawei Ren et.al. | 2406.10324 | null |
2024-06-12 | UnO: Unsupervised Occupancy Fields for Perception and Forecasting | Ben Agro et.al. | 2406.08691 | null |
2024-06-12 | Outdoor Scene Extrapolation with Hierarchical Generative Cellular Automata | Dongsu Zhang et.al. | 2406.08292 | null |
2024-06-13 | DeTra: A Unified Model for Object Detection and Trajectory Forecasting | Sergio Casas et.al. | 2406.04426 | null |
2024-04-24 | NeRF-XL: Scaling NeRFs with Multiple GPUs | Ruilong Li et.al. | 2404.16221 | null |
2024-04-22 | Align Your Steps: Optimizing Sampling Schedules in Diffusion Models | Amirmojtaba Sabour et.al. | 2404.14507 | null |
2024-04-16 | RefFusion: Reference Adapted Diffusion Models for 3D Scene Inpainting | Ashkan Mirzaei et.al. | 2404.10765 | null |
2024-04-09 | Can Feedback Enhance Semantic Grounding in Large Vision-Language Models? | Yuan-Hong Liao et.al. | 2404.06510 | null |
2024-04-01 | QuAD: Query-based Interpretable Neural Motion Planning for Autonomous Driving | Sourav Biswas et.al. | 2404.01486 | null |
2024-03-22 | LATTE3D: Large-scale Amortized Text-To-Enhanced3D Synthesis | Kevin Xie et.al. | 2403.15385 | null |
2024-03-22 | Augmented Reality based Simulated Data (ARSim) with multi-view consistency for AV perception networks | Aqeel Anwar et.al. | 2403.15370 | null |
2024-01-22 | EmerDiff: Emerging Pixel-level Semantic Knowledge in Diffusion Models | Koichi Namekata et.al. | 2401.11739 | null |
2023-12-28 | Compact Neural Graphics Primitives with Learned Hash Probing | Towaki Takikawa et.al. | 2312.17241 | null |
2024-01-03 | Align Your Gaussians: Text-to-4D with Dynamic 3D Gaussians and Composed Diffusion Models | Huan Ling et.al. | 2312.13763 | null |
2023-12-11 | LightSim: Neural Lighting Simulation for Urban Scenes | Ava Pun et.al. | 2312.06654 | null |
2024-04-14 | Trajeglish: Traffic Modeling as Next-Token Prediction | Jonah Philion et.al. | 2312.04535 | null |
2024-06-25 | XCube: Large-Scale 3D Generative Modeling using Sparse Voxel Hierarchies | Xuanchi Ren et.al. | 2312.03806 | link |
2024-04-12 | WildFusion: Learning 3D-Aware Latent Diffusion Models in View Space | Katja Schwarz et.al. | 2311.13570 | null |
2023-11-16 | Adaptive Shells for Efficient Neural Radiance Field Rendering | Zian Wang et.al. | 2311.10091 | null |
2023-11-09 | Real-Time Neural Rasterization for Large Scenes | Jeffrey Yunfan Liu et.al. | 2311.05607 | null |
2023-11-09 | Reconstructing Objects in-the-wild for Realistic Sensor Simulation | Ze Yang et.al. | 2311.05602 | null |
2023-11-07 | 3DiffTection: 3D Object Detection with Geometry-Aware Diffusion Features | Chenfeng Xu et.al. | 2311.04391 | null |
2023-11-03 | EmerNeRF: Emergent Spatial-Temporal Scene Decomposition via Self-Supervision | Jiawei Yang et.al. | 2311.02077 | null |
2023-11-03 | Towards Unsupervised Object Detection From LiDAR Point Clouds | Lunjun Zhang et.al. | 2311.02007 | null |
2023-11-02 | MemorySeg: Online LiDAR Semantic Segmentation with a Latent Memory | Enxu Li et.al. | 2311.01556 | null |
2023-11-17 | 4D-Former: Multimodal 4D Panoptic Segmentation | Ali Athar et.al. | 2311.01520 | null |
2023-11-02 | UltraLiDAR: Learning Compact Representations for LiDAR Completion and Generation | Yuwen Xiong et.al. | 2311.01448 | null |
2023-11-02 | CADSim: Robust and Scalable in-the-wild 3D Reconstruction for Controllable Sensor Simulation | Jingkang Wang et.al. | 2311.01447 | null |
2023-11-02 | Adv3D: Generating Safety-Critical 3D Objects through Closed-Loop Simulation | Jay Sarva et.al. | 2311.01446 | null |
2023-11-02 | LabelFormer: Object Trajectory Refinement for Offboard Perception from LiDAR Point Clouds | Anqi Joyce Yang et.al. | 2311.01444 | null |
2023-11-02 | Learning Realistic Traffic Agents in Closed-loop | Chris Zhang et.al. | 2311.01394 | null |
2024-04-01 | Copilot4D: Learning Unsupervised World Models for Autonomous Driving via Discrete Diffusion | Lunjun Zhang et.al. | 2311.01017 | null |
2024-01-26 | ViR: Towards Efficient Vision Retention Backbones | Ali Hatamizadeh et.al. | 2310.19731 | null |
2023-10-20 | TexFusion: Synthesizing 3D Textures with Text-Guided Image Diffusion Models | Tianshi Cao et.al. | 2310.13772 | null |
2023-09-11 | Towards Viewpoint Robustness in Bird’s Eye View Segmentation | Tzofi Klinghoffer et.al. | 2309.05192 | null |
2023-08-10 | Flexible Isosurface Extraction for Gradient-Based Mesh Optimization | Tianchang Shen et.al. | 2308.05371 | null |
2023-08-03 | UniSim: A Neural Closed-Loop Sensor Simulator | Ze Yang et.al. | 2308.01898 | null |
2023-08-02 | Implicit Occupancy Flow Fields for Perception and Prediction in Self-Driving | Ben Agro et.al. | 2308.01471 | null |
2023-07-14 | DreamTeacher: Pretraining Image Backbones with Deep Generative Models | Daiqing Li et.al. | 2307.07487 | null |
2023-06-27 | Rethinking Closed-loop Training for Autonomous Driving | Chris Zhang et.al. | 2306.15713 | null |
2023-06-06 | ATT3D: Amortized Text-to-3D Object Synthesis | Jonathan Lorraine et.al. | 2306.07349 | null |
2023-06-09 | Neural Kernel Surface Reconstruction | Jiahui Huang et.al. | 2305.19590 | null |
2023-08-13 | Neural LiDAR Fields for Novel View Synthesis | Shengyu Huang et.al. | 2305.01643 | null |
2023-04-19 | NeuralField-LDM: Scene Generation with Hierarchical Latent Diffusion Models | Seung Wook Kim et.al. | 2304.09787 | null |
2023-12-28 | Align your Latents: High-Resolution Video Synthesis with Latent Diffusion Models | Andreas Blattmann et.al. | 2304.08818 | link |
2023-04-06 | Neural Fields meet Explicit Geometric Representation for Inverse Rendering of Urban Scenes | Zian Wang et.al. | 2304.03266 | null |
2023-04-04 | Trace and Pace: Controllable Pedestrian Animation via Guided Trajectory Diffusion | Davis Rempe et.al. | 2304.01893 | null |
2023-03-25 | VoxFormer: Sparse Voxel Transformer for Camera-based 3D Semantic Scene Completion | Yiming Li et.al. | 2302.12251 | link |
2023-02-09 | Bridging the Sim2Real gap with CARE: Supervised Detection Adaptation with Conditional Alignment and Reweighting | Viraj Prabhu et.al. | 2302.04832 | null |
2023-02-02 | Synthesizing Physical Character-Scene Interactions | Mohamed Hassan et.al. | 2302.00883 | null |
2023-01-31 | PADL: Language-Directed Physics-Based Character Control | Jordan Juravsky et.al. | 2301.13868 | link |
2023-03-25 | Magic3D: High-Resolution Text-to-3D Content Creation | Chen-Hsuan Lin et.al. | 2211.10440 | null |
2022-11-08 | GoRela: Go Relative for Viewpoint-Invariant Motion Forecasting | Alexander Cui et.al. | 2211.02545 | null |
2022-10-12 | LION: Latent Point Diffusion Models for 3D Shape Generation | Xiaohui Zeng et.al. | 2210.06978 | link |
2022-10-06 | XDGAN: Multi-Modal 3D Shape Generation in 2D Space | Hassan Abu Alhaija et.al. | 2210.03007 | null |
2022-10-03 | Optimizing Data Collection for Machine Learning | Rafid Mahmood et.al. | 2210.01234 | null |
2022-09-26 | EPIC-KITCHENS VISOR Benchmark: VIdeo Segmentations and Object Relations | Ahmad Darkhalil et.al. | 2209.13064 | link |
2022-09-22 | GET3D: A Generative Model of High Quality 3D Textured Shapes Learned from Images | Jun Gao et.al. | 2209.11163 | link |
2022-08-19 | Neural Light Field Estimation for Street Scenes with Differentiable Virtual Object Insertion | Zian Wang et.al. | 2208.09480 | null |
2022-08-18 | MvDeCor: Multi-view Dense Correspondence Learning for Fine-grained 3D Segmentation | Gopal Sharma et.al. | 2208.08580 | null |
2022-07-05 | Improving Semantic Segmentation in Transformers using Hierarchical Inter-Level Attention | Gary Leung et.al. | 2207.02126 | null |
2022-07-13 | How Much More Data Do I Need? Estimating Requirements for Downstream Tasks | Rafid Mahmood et.al. | 2207.01725 | null |
2022-06-19 | Scalable Neural Data Server: A Data Recommender for Transfer Learning | Tianshi Cao et.al. | 2206.09386 | null |
2022-06-16 | Virtual Correspondence: Humans as a Cue for Extreme-View Geometry | Wei-Chiu Ma et.al. | 2206.08365 | null |
2022-06-15 | Variable Bitrate Neural Fields | Towaki Takikawa et.al. | 2206.07707 | link |
2022-06-06 | Polymorphic-GAN: Generating Aligned Samples across Multiple Domains with Learned Morph Maps | Seung Wook Kim et.al. | 2206.02903 | null |
2022-05-05 | ASE: Large-Scale Reusable Adversarial Skill Embeddings for Physically Simulated Characters | Xue Bin Peng et.al. | 2205.01906 | null |
2022-04-19 | M $^2$ BEV: Multi-Camera Joint 3D Detection and Segmentation with Unified Birds-Eye View Representation | Enze Xie et.al. | 2204.05088 | null |
2022-04-06 | AUV-Net: Learning Aligned UV Maps for Texture Transfer and Synthesis | Zhiqin Chen et.al. | 2204.03105 | null |
2022-05-10 | Learning Smooth Neural Functions via Lipschitz Regularization | Hsueh-Ti Derek Liu et.al. | 2202.08345 | null |
2022-02-10 | Domain Adversarial Training: A Game Perspective | David Acuna et.al. | 2202.05352 | null |
2022-04-21 | Causal Scene BERT: Improving object detection by searching for challenging groups of data | Cinjon Resnick et.al. | 2202.03651 | null |
2022-01-20 | Federated Learning with Heterogeneous Architectures using Graph HyperNetworks | Or Litany et.al. | 2201.08459 | null |
2022-01-12 | BigDatasetGAN: Synthesizing ImageNet with Pixel-wise Annotations | Daiqing Li et.al. | 2201.04684 | null |
2022-03-28 | Generating Useful Accident-Prone Driving Scenarios via a Learned Traffic Prior | Davis Rempe et.al. | 2112.05077 | null |
2022-08-26 | Frame Averaging for Equivariant Shape Space Learning | Matan Atzmon et.al. | 2112.01741 | null |
2021-12-02 | Hierarchical Neural Implicit Pose Network for Animation and Motion Retargeting | Sourav Biswas et.al. | 2112.00958 | null |
2021-11-26 | Neural Fields as Learnable Kernels for 3D Reconstruction | Francis Williams et.al. | 2111.13674 | null |
2023-04-11 | Extracting Triangular 3D Models, Materials, and Lighting From Images | Jacob Munkberg et.al. | 2111.12503 | link |
2021-11-15 | Towards Optimal Strategies for Training Self-Driving Perception Models in Simulation | David Acuna et.al. | 2111.07971 | null |
2021-11-08 | Deep Marching Tetrahedra: a Hybrid Representation for High-Resolution 3D Shape Synthesis | Tianchang Shen et.al. | 2111.04276 | null |
2021-11-04 | EditGAN: High-Precision Semantic Image Editing | Huan Ling et.al. | 2111.03186 | null |
2021-11-29 | Don’t Generate Me: Training Differentially Private Generative Models with Sinkhorn Divergence | Tianshi Cao et.al. | 2111.01177 | null |
2021-10-30 | DIB-R++: Learning to Predict Lighting and Material with a Hybrid Differentiable Renderer | Wenzheng Chen et.al. | 2111.00140 | null |
2021-10-07 | ATISS: Autoregressive Transformers for Indoor Scene Synthesis | Despoina Paschalidou et.al. | 2110.03675 | link |
2022-08-11 | Physics-based Human Motion Estimation and Synthesis from Videos | Kevin Xie et.al. | 2109.09913 | null |
2021-10-20 | Learning Indoor Inverse Rendering with 3D Spatially-Varying Lighting | Zian Wang et.al. | 2109.06061 | null |
2021-08-30 | 3DStyleNet: Creating 3D Shapes with Geometric and Texture Style Variations | Kangxue Yin et.al. | 2108.12958 | null |
2021-07-04 | NP-DRAW: A Non-Parametric Structured Latent Variable Model for Image Generation | Xiaohui Zeng et.al. | 2106.13435 | link |
2021-06-21 | f-Domain-Adversarial Learning: Theory and Algorithms | David Acuna et.al. | 2106.11344 | null |
2023-03-07 | Low Budget Active Learning via Wasserstein Distance: An Integer Programming Approach | Rafid Mahmood et.al. | 2106.02968 | null |
2021-04-30 | DriveGAN: Towards a Controllable High-Quality Neural Simulation | Seung Wook Kim et.al. | 2104.15060 | null |
2021-04-26 | Towards Good Practices for Efficiently Annotating Large-Scale Image Classification Datasets | Yuan-Hong Liao et.al. | 2104.12690 | null |
2021-04-20 | DatasetGAN: Efficient Labeled Data Factory with Minimal Human Effort | Yuxuan Zhang et.al. | 2104.06490 | link |
2021-04-12 | Semantic Segmentation with Generative Models: Semi-Supervised Learning and Strong Out-of-Domain Generalization | Daiqing Li et.al. | 2104.05833 | null |
2021-10-18 | Image-Level or Object-Level? A Tale of Two Resampling Strategies for Long-Tailed Detection | Nadine Chang et.al. | 2104.05702 | link |
2021-04-08 | Just Label What You Need: Fine-Grained Active Selection for Perception and Prediction through Partially Labeled Scenes | Sean Segal et.al. | 2104.03956 | null |
2021-04-06 | gradSim: Differentiable simulation for system identification and visuomotor control | Krishna Murthy Jatavallabhula et.al. | 2104.02646 | null |
2021-03-18 | Neural Parts: Learning Expressive 3D Shape Abstractions with Invertible Neural Networks | Despoina Paschalidou et.al. | 2103.10429 | link |
2021-01-26 | Neural Geometric Level of Detail: Real-time Rendering with Implicit 3D Shapes | Towaki Takikawa et.al. | 2101.10994 | link |
2021-01-20 | IntentNet: Learning to Predict Intention from Raw Sensor Data | Sergio Casas et.al. | 2101.07907 | null |
2021-01-19 | Deep Feedback Inverse Problem Solver | Wei-Chiu Ma et.al. | 2101.07719 | null |
2021-01-18 | Non-parametric Memory for Spatio-Temporal Segmentation of Construction Zones for Self-Driving | Min Bai et.al. | 2101.06865 | null |
2021-11-25 | Mending Neural Implicit Modeling for 3D Vehicle Reconstruction in the Wild | Shivam Duggal et.al. | 2101.06860 | null |
2021-04-29 | Deep Structured Reactive Planning | Jerry Liu et.al. | 2101.06832 | null |
2021-01-18 | MP3: A Unified Model to Map, Perceive, Predict and Plan | Sergio Casas et.al. | 2101.06806 | null |
2022-01-07 | Exploring Adversarial Robustness of Multi-Sensor Perception Systems in Self Driving | James Tu et.al. | 2101.06784 | null |
2021-01-17 | Deep Parametric Continuous Convolutional Neural Networks | Shenlong Wang et.al. | 2101.06742 | null |
2021-04-10 | Deep Multi-Task Learning for Joint Localization, Perception, and Prediction | John Phillips et.al. | 2101.06720 | null |
2021-01-17 | End-to-end Interpretable Neural Motion Planner | Wenyuan Zeng et.al. | 2101.06679 | null |
2021-01-17 | LaneRCNN: Distributed Representations for Graph-Centric Motion Forecasting | Wenyuan Zeng et.al. | 2101.06653 | null |
2021-01-17 | Network Automatic Pruning: Start NAP and Take a Nap | Wenyuan Zeng et.al. | 2101.06608 | null |
2021-08-01 | PLUMENet: Efficient 3D Object Detection from Stereo Images | Yan Wang et.al. | 2101.06594 | null |
2021-01-17 | Cost-Efficient Online Hyperparameter Optimization | Jingkang Wang et.al. | 2101.06590 | null |
2021-03-11 | Auto4D: Learning to Label 4D Objects from Sequential Point Clouds | Bin Yang et.al. | 2101.06586 | null |
2021-01-17 | S3: Neural Shape, Skeleton, and Skinning Fields for 3D Human Modeling | Ze Yang et.al. | 2101.06571 | null |
2021-07-15 | Asynchronous Multi-View SLAM | Anqi Joyce Yang et.al. | 2101.06562 | null |
2021-10-12 | Adversarial Attacks On Multi-Agent Communication | James Tu et.al. | 2101.06560 | null |
2021-01-17 | TrafficSim: Learning to Simulate Realistic Multi-Agent Behaviors | Simon Suo et.al. | 2101.06557 | null |
2021-01-16 | Diverse Complexity Measures for Dataset Curation in Self-driving | Abbas Sadat et.al. | 2101.06554 | null |
2021-10-12 | Self-Supervised Representation Learning from Flow Equivariance | Yuwen Xiong et.al. | 2101.06553 | null |
2023-04-16 | AdvSim: Generating Safety-Critical Scenarios for Self-Driving Vehicles | Jingkang Wang et.al. | 2101.06549 | null |
2021-05-07 | LookOut: Diverse Multi-Future Prediction and Planning for Self-Driving | Alexander Cui et.al. | 2101.06547 | null |
2021-01-16 | VideoClick: Video Object Segmentation with a Single Click | Namdar Homayounfar et.al. | 2101.06545 | null |
2021-05-16 | GeoSim: Realistic Video Simulation via Geometry-Aware Composition for Self-Driving | Yun Chen et.al. | 2101.06543 | null |
2021-01-16 | SceneGen: Learning to Generate Realistic Traffic Scenes | Shuhan Tan et.al. | 2101.06541 | null |
2021-01-07 | Safety-Oriented Pedestrian Motion and Scene Occupancy Forecasting | Katie Luo et.al. | 2101.02385 | null |
2024-05-01 | Pit30M: A Benchmark for Global Localization in the Age of Self-Driving Cars | Julieta Martinez et.al. | 2012.12437 | link |
2020-12-22 | Learning Joint 2D-3D Representations for Depth Completion | Yun Chen et.al. | 2012.12402 | null |
2020-12-22 | Multi-Task Multi-Sensor Fusion for 3D Object Detection | Ming Liang et.al. | 2012.12397 | null |
2020-12-22 | Fast and Furious: Real Time End-to-End 3D Detection, Tracking and Motion Forecasting with a Single Convolutional Net | Wenjie Luo et.al. | 2012.12395 | null |
2020-12-22 | DAGMapper: Learning to Map by Discovering Lane Topology | Namdar Homayounfar et.al. | 2012.12377 | null |
2020-12-22 | Hierarchical Recurrent Attention Networks for Structured Online Maps | Namdar Homayounfar et.al. | 2012.12314 | null |
2020-12-21 | Convolutional Recurrent Network for Road Boundary Extraction | Justin Liang et.al. | 2012.12160 | null |
2020-12-21 | HDNET: Exploiting HD Maps for 3D Object Detection | Bin Yang et.al. | 2012.11704 | null |
2021-01-14 | End-to-End Deep Structured Models for Drawing Crosswalks | Justin Liang et.al. | 2012.11585 | null |
2020-12-20 | Deep Continuous Fusion for Multi-Sensor 3D Object Detection | Ming Liang et.al. | 2012.10992 | null |
2020-12-20 | Learning to Localize Through Compressed Binary Maps | Xinkai Wei et.al. | 2012.10942 | null |
2020-12-20 | Learning to Localize Using a LiDAR Intensity Map | Ioan Andrei Bârsan et.al. | 2012.10902 | null |
2021-03-26 | Personalized Federated Learning with First Order Model Optimization | Michael Zhang et.al. | 2012.08565 | link |
2020-12-14 | A PAC-Bayesian Approach to Generalization Bounds for Graph Neural Networks | Renjie Liao et.al. | 2012.07690 | null |
2020-12-13 | GeoNet++: Iterative Geometric Neural Network with Edge-Aware Refinement for Joint Depth and Surface Normal Estimation | Xiaojuan Qi et.al. | 2012.06980 | link |
2020-11-30 | UniCon: Universal Neural Controller For Physics-based Character Motion | Tingwu Wang et.al. | 2011.15119 | null |
2021-03-17 | Emergent Road Rules In Multi-Agent Driving Environments | Avik Pal et.al. | 2011.10753 | link |
2020-11-16 | Recovering and Simulating Pedestrians in the Wild | Ze Yang et.al. | 2011.08106 | null |
2021-01-08 | MuSCLE: Multi Sweep Compression of LiDAR using Deep Entropy Models | Sourav Biswas et.al. | 2011.07590 | null |
2020-11-13 | StrObe: Streaming Object Detection from LiDAR Packets | Davi Frossard et.al. | 2011.06425 | null |
2020-11-12 | Universal Embeddings for Spatio-Temporal Tagging of Self-Driving Logs | Sean Segal et.al. | 2011.06165 | null |
2020-11-10 | Learning to Communicate and Correct Pose Errors | Nicholas Vadivelu et.al. | 2011.05289 | null |
2020-11-23 | Learning Deformable Tetrahedral Meshes for 3D Reconstruction | Jun Gao et.al. | 2011.01437 | link |
2021-03-26 | Perceive, Attend, and Drive: Learning Spatial Attention for Safe Self-Driving | Bob Wei et.al. | 2011.01153 | null |
2021-04-10 | Permute, Quantize, and Fine-tune: Efficient Compression of Neural Networks | Julieta Martinez et.al. | 2010.15703 | link |
2021-05-03 | Watch-And-Help: A Challenge for Social Perception and Human-AI Collaboration | Xavier Puig et.al. | 2010.09890 | link |
2021-07-13 | The efficacy of Neural Planning Metrics: A meta-analysis of PKL on nuScenes | Yiluan Guo et.al. | 2010.09350 | null |
2021-04-20 | Image GANs meet Differentiable Rendering for Inverse Graphics and Interpretable 3D Neural Rendering | Yuxuan Zhang et.al. | 2010.09125 | null |
2020-11-12 | LiRaNet: End-to-End Trajectory Prediction using Spatio-Temporal Radar Fusion | Meet Shah et.al. | 2010.00731 | null |
2020-09-01 | Fed-Sim: Federated Simulation for Medical Imaging | Daiqing Li et.al. | 2009.00668 | null |
2020-08-26 | Expressive Telepresence via Modular Codec Avatars | Hang Chu et.al. | 2008.11789 | null |
2020-10-26 | Interactive Annotation of 3D Object Geometry using 2D Scribbles | Tianchang Shen et.al. | 2008.10719 | null |
2020-08-22 | ScribbleBox: Interactive Annotation Framework for Video Object Segmentation | Bowen Chen et.al. | 2008.09721 | null |
2023-04-03 | Beyond Fixed Grid: Learning Geometric Image Representation with a Deformable Grid | Jun Gao et.al. | 2008.09269 | null |
2020-08-20 | Conditional Entropy Coding for Efficient Video Compression | Jerry Liu et.al. | 2008.09180 | null |
2020-08-20 | Weakly-supervised 3D Shape Completion in the Wild | Jiayuan Gu et.al. | 2008.09110 | null |
2020-08-20 | Meta-Sim2: Unsupervised Learning of Scene Structure for Synthetic Data Generation | Jeevan Devaranjan et.al. | 2008.09092 | null |
2020-08-18 | Learning to Generate Diverse Dance Motions with Transformer | Jiaman Li et.al. | 2008.08171 | null |
2020-08-17 | V2VNet: Vehicle-to-Vehicle Communication for Joint Perception and Prediction | Tsun-Hsuan Wang et.al. | 2008.07519 | null |
2020-08-13 | DSDNet: Deep Structured self-Driving Network | Wenyuan Zeng et.al. | 2008.06041 | null |
2020-08-13 | Testing the Safety of Self-driving Vehicles by Simulating Perception and Prediction | Kelvin Wong et.al. | 2008.06020 | null |
2020-08-13 | Perceive, Predict, and Plan: Safe Motion Planning Through Interpretable Semantic Representations | Abbas Sadat et.al. | 2008.05930 | null |
2020-08-13 | End-to-end Contextual Perception and Prediction with Interaction Transformer | Lingyun Luke Li et.al. | 2008.05927 | null |
2020-08-13 | Lift, Splat, Shoot: Encoding Images From Arbitrary Camera Rigs by Implicitly Unprojecting to 3D | Jonah Philion et.al. | 2008.05711 | link |
2020-11-29 | LoCo: Local Contrastive Representation Learning | Yuwen Xiong et.al. | 2008.01342 | null |
2020-07-30 | LevelSet R-CNN: A Deep Variational Method for Instance Segmentation | Namdar Homayounfar et.al. | 2007.15629 | null |
2020-07-28 | RadarNet: Exploiting Radar for Robust Perception of Dynamic Objects | Bin Yang et.al. | 2007.14366 | null |
2020-07-27 | Learning Lane Graph Representations for Motion Forecasting | Ming Liang et.al. | 2007.13732 | link |
2020-07-23 | Implicit Latent Variable Model for Scene-Consistent Motion Forecasting | Sergio Casas et.al. | 2007.12036 | null |
2020-07-23 | Hierarchical Verification for Adversarial Robustness | Cong Han Lim et.al. | 2007.11826 | null |
2020-08-14 | Multi-Agent Routing Value Iteration Network | Quinlan Sykora et.al. | 2007.05096 | link |
2020-06-16 | LiDARsim: Realistic LiDAR Simulation by Leveraging the Real World | Sivabalan Manivasagam et.al. | 2006.09348 | null |
2020-06-04 | The Importance of Prior Knowledge in Precise Multimodal Prediction | Sergio Casas et.al. | 2006.02636 | null |
2020-06-27 | PnPNet: End-to-End Perception and Prediction with Tracking in the Loop | Ming Liang et.al. | 2005.14711 | null |
2020-05-25 | Learning to Simulate Dynamic Environments with GameGAN | Seung Wook Kim et.al. | 2005.12126 | null |
2020-05-24 | ShapeAdv: Generating Shape-Aware Adversarial 3D Point Clouds | Kibok Lee et.al. | 2005.11626 | null |
2021-01-08 | OctSqueeze: Octree-Structured Entropy Model for LiDAR Compression | Lila Huang et.al. | 2005.07178 | null |
2020-04-29 | The EPIC-KITCHENS Dataset: Collection, Challenges and Baselines | Dima Damen et.al. | 2005.00343 | link |
2020-04-19 | Learning to Evaluate Perception Models Using Planner-Centric Metrics | Jonah Philion et.al. | 2004.08745 | null |
2020-04-02 | Physically Realizable Adversarial Examples for LiDAR Object Detection | James Tu et.al. | 2004.00543 | null |
2020-04-01 | Neural Data Server: A Large-Scale Search Engine for Transfer Learning Data | Xi Yan et.al. | 2001.02799 | null |
2020-01-01 | The Shmoop Corpus: A Dataset of Stories with Loosely Aligned Summaries | Atef Chaudhury et.al. | 1912.13082 | link |
2020-05-18 | Dense RepPoints: Representing Visual Objects with Dense Point Sets | Ze Yang et.al. | 1912.11473 | link |
2021-01-16 | PolyTransform: Deep Polygon Transformer for Instance Segmentation | Justin Liang et.al. | 1912.02801 | null |
2019-11-13 | Kaolin: A PyTorch Library for Accelerating 3D Deep Learning Research | Krishna Murthy Jatavallabhula et.al. | 1911.05063 | link |
2019-10-25 | CrevNet: Conditionally Reversible Video Prediction | Wei Yu et.al. | 1910.11577 | null |
2019-10-24 | Identifying Unknown Instances for Autonomous Driving | Kelvin Wong et.al. | 1910.11296 | null |
2019-10-18 | Spatially-Aware Graph Neural Networks for Relational Behavior Forecasting from Sensor Data | Sergio Casas et.al. | 1910.08233 | null |
2019-10-17 | Discrete Residual Flow for Probabilistic Pedestrian Behavior Prediction | Ajay Jain et.al. | 1910.08041 | null |
2020-02-13 | Learning to Remember from a Multi-Task Teacher | Yuwen Xiong et.al. | 1910.04650 | null |
2019-10-10 | Jointly Learnable Behavior and Trajectory Planning for Self-Driving Vehicles | Abbas Sadat et.al. | 1910.04586 | null |
2019-10-04 | Neural Turtle Graphics for Modeling City Road Layouts | Hang Chu et.al. | 1910.02055 | null |
2020-07-17 | Efficient Graph Generation with Graph Recurrent Attention Networks | Renjie Liao et.al. | 1910.00760 | link |
2019-09-27 | DMM-Net: Differentiable Mask-Matching Network for Video Object Segmentation | Xiaohui Zeng et.al. | 1909.12471 | link |
2020-02-14 | A Theoretical Analysis of the Number of Shots in Few-Shot Learning | Tianshi Cao et.al. | 1909.11722 | null |
2019-09-12 | DeepPruner: Learning Efficient Stereo Matching via Differentiable PatchMatch | Shivam Duggal et.al. | 1909.05845 | null |
2019-08-09 | DSIC: Deep Stereo Image Compression | Jerry Liu et.al. | 1908.03631 | null |
2019-08-20 | Video Face Clustering with Unknown Number of Clusters | Makarand Tapaswi et.al. | 1908.03381 | link |
2019-08-08 | Exploiting Sparse Semantic HD Maps for Self-Driving Vehicle Localization | Wei-Chiu Ma et.al. | 1908.03274 | null |
2019-11-21 | Learning to Predict 3D Objects with an Interpolation-based Differentiable Renderer | Wenzheng Chen et.al. | 1908.01210 | null |
2019-07-30 | Deformable Filter Convolution for Point Cloud Reasoning | Yuwen Xiong et.al. | 1907.13079 | null |
2019-07-12 | Gated-SCNN: Gated Shape CNNs for Semantic Segmentation | Towaki Takikawa et.al. | 1907.05740 | null |
2019-06-12 | Neural Graph Evolution: Towards Efficient Automatic Robot Design | Tingwu Wang et.al. | 1906.05370 | link |
2019-05-15 | EigenDamage: Structured Pruning in the Kronecker-Factored Eigenbasis | Chaoqi Wang et.al. | 1905.05934 | link |
2019-05-15 | DARNet: Deep Active Ray Network for Building Segmentation | Dominic Cheng et.al. | 1905.05889 | null |
2019-05-04 | Deep Multi-Sensor Lane Detection | Min Bai et.al. | 1905.01555 | null |
2019-05-03 | DeepSignals: Predicting Intent of Drivers Through Visual Signals | Davi Frossard et.al. | 1905.01333 | null |
2019-04-25 | Meta-Sim: Learning to Generate Synthetic Datasets | Amlan Kar et.al. | 1904.11621 | null |
2019-04-18 | Deep Rigid Instance Scene Flow | Wei-Chiu Ma et.al. | 1904.08913 | null |
2019-06-09 | Devil is in the Edges: Learning Semantic Boundaries from Noisy Annotations | David Acuna et.al. | 1904.07934 | null |
2019-04-09 | Action Recognition from Single Timestamp Supervision in Untrimmed Videos | Davide Moltisanti et.al. | 1904.04689 | link |
2019-03-27 | Mimicking the In-Camera Color Pipeline for Camera-Aware Object Compositing | Jun Gao et.al. | 1903.11248 | null |
2019-03-16 | Fast Interactive Object Annotation with Curve-GCN | Huan Ling et.al. | 1903.06874 | link |
2019-03-02 | PIXOR: Real-time 3D Object Detection from Point Clouds | Bin Yang et.al. | 1902.06326 | null |
2019-02-12 | ACTRCE: Augmenting Experience via Teacher’s Advice For Multi-Goal Reinforcement Learning | Harris Chan et.al. | 1902.04546 | null |
2019-04-03 | UPSNet: A Unified Panoptic Segmentation Network | Yuwen Xiong et.al. | 1901.03784 | link |
2019-10-23 | LanczosNet: Multi-Scale Deep Graph Convolutional Networks | Renjie Liao et.al. | 1901.01484 | link |
2018-12-04 | A Face-to-Face Neural Conversation Model | Hang Chu et.al. | 1812.01525 | null |
2018-12-04 | SurfConv: Bridging 3D and 2D Convolution for RGBD Images | Hang Chu et.al. | 1812.01519 | link |
2019-03-21 | Learning to Caption Images through a Lifetime by Asking Questions | Kevin Shen et.al. | 1812.00235 | link |
2018-10-23 | A Neural Compositional Paradigm for Image Captioning | Bo Dai et.al. | 1810.09630 | null |
2018-10-13 | Pose Estimation for Objects with Rotational Symmetry | Enric Corona et.al. | 1810.05780 | null |
2020-12-18 | Graph HyperNetworks for Neural Architecture Search | Chris Zhang et.al. | 1810.05749 | null |
2018-10-26 | Neural Guided Constraint Logic Programming for Program Synthesis | Lisa Zhang et.al. | 1809.02840 | link |
2018-06-29 | End-to-end Learning of Multi-sensor 3D Tracking by Detection | Davi Frossard et.al. | 1806.11534 | null |
2018-06-19 | VirtualHome: Simulating Household Activities via Programs | Xavier Puig et.al. | 1806.07011 | null |
2018-06-07 | Color Sails: Discrete-Continuous Palettes for Deep Color Exploration | Maria Shugrina et.al. | 1806.02918 | null |
2018-09-27 | Visual Reasoning by Progressive Module Networks | Seung Wook Kim et.al. | 1806.02453 | null |
2018-07-31 | Scaling Egocentric Vision: The EPIC-KITCHENS Dataset | Dima Damen et.al. | 1804.02748 | null |
2018-03-26 | Efficient Interactive Annotation of Segmentation Datasets with Polygon-RNN++ | David Acuna et.al. | 1803.09693 | null |
2019-05-05 | Learning to Reweight Examples for Robust Deep Learning | Mengye Ren et.al. | 1803.09050 | link |
2019-06-27 | Inference in Probabilistic Graphical Models by Graph Neural Networks | KiJung Yoon et.al. | 1803.07710 | null |
2019-11-06 | Reviving and Improving Recurrent Back-Propagation | Renjie Liao et.al. | 1803.06396 | link |
2018-03-16 | Learning deep structured active contours end-to-end | Diego Marcos et.al. | 1803.06329 | link |
2018-03-16 | Graph Partition Neural Networks for Semi-Supervised Classification | Renjie Liao et.al. | 1803.06272 | link |
2018-06-07 | SBNet: Sparse Blocks Network for Fast Inference | Mengye Ren et.al. | 1801.02108 | link |
2018-06-15 | Learning to Act Properly: Predicting and Explaining Affordances from Images | Ching-Yao Chuang et.al. | 1712.07576 | null |
2018-04-15 | MovieGraphs: Towards Understanding Human-Centric Situations from Videos | Paul Vicol et.al. | 1712.06761 | null |
2017-10-19 | Be Your Own Prada: Fashion Synthesis with Structural Coherence | Shizhan Zhu et.al. | 1710.07346 | null |
2017-08-14 | Situation Recognition with Graph Neural Networks | Ruiyu Li et.al. | 1708.04320 | null |
2018-07-29 | VSE++: Improving Visual-Semantic Embeddings with Hard Negatives | Fartash Faghri et.al. | 1707.05612 | link |
2017-07-14 | The Reversible Residual Network: Backpropagation Without Storing Activations | Aidan N. Gomez et.al. | 1707.04585 | link |
2017-11-14 | Few-Shot Learning Through an Information Retrieval Lens | Eleni Triantafillou et.al. | 1707.02610 | null |
2017-06-05 | Teaching Machines to Describe Images via Natural Language Feedback | Huan Ling et.al. | 1706.00130 | null |
2017-04-18 | Annotating Object Instances with a Polygon-RNN | Lluis Castrejon et.al. | 1704.05548 | link |
2017-04-04 | Open Vocabulary Scene Parsing | Hang Zhao et.al. | 1703.08769 | null |
2017-08-11 | Towards Diverse and Natural Image Descriptions via a Conditional GAN | Bo Dai et.al. | 1703.06029 | null |
2017-01-25 | Understanding the Effective Receptive Field in Deep Convolutional Neural Networks | Wenjie Luo et.al. | 1701.04128 | null |
2018-05-08 | MultiNet: Real-time Joint Semantic Reasoning for Autonomous Driving | Marvin Teichmann et.al. | 1612.07695 | link |
2016-12-01 | TorontoCity: Seeing the World with a Million Eyes | Shenlong Wang et.al. | 1612.00423 | null |
2017-05-04 | Deep Watershed Transform for Instance Segmentation | Min Bai et.al. | 1611.08303 | null |
2017-03-06 | Normalizing the Normalizers: Comparing and Extending Network Normalization Schemes | Mengye Ren et.al. | 1611.04520 | null |
2016-11-10 | Song From PI: A Musically Plausible Network for Pop Music Generation | Hang Chu et.al. | 1611.03477 | null |
2016-11-10 | Efficient Summarization with Read-Again and Copy Mechanism | Wenyuan Zeng et.al. | 1611.03382 | null |
2017-04-25 | 3D Object Proposals using Stereo Imagery for Accurate Object Class Detection | Xiaozhi Chen et.al. | 1608.07711 | null |
2018-10-16 | Semantic Understanding of Scenes through the ADE20K Dataset | Bolei Zhou et.al. | 1608.05442 | link |
2016-06-23 | Find your Way by Observing the Sun and Other Semantic Cues | Wei-Chiu Ma et.al. | 1606.07415 | null |
2016-04-10 | Soccer Field Localization from a Single Image | Namdar Homayounfar et.al. | 1604.02715 | null |
2016-08-23 | Exploiting Semantic Information and Deep Matching for Optical Flow | Min Bai et.al. | 1604.01827 | null |
2016-04-27 | Instance-Level Segmentation for Autonomous Driving with Deep Densely Connected MRFs | Ziyu Zhang et.al. | 1512.06735 | null |
2016-09-21 | MovieQA: Understanding Stories in Movies through Question-Answering | Makarand Tapaswi et.al. | 1512.02902 | null |
2016-06-02 | Training Deep Neural Networks via Direct Loss Minimization | Yang Song et.al. | 1511.06411 | link |
2016-03-01 | Order-Embeddings of Images and Language | Ivan Vendrov et.al. | 1511.06361 | null |
2016-01-08 | Reduced-Precision Strategies for Bounded Memory in Deep Neural Nets | Patrick Judd et.al. | 1511.05236 | null |
2015-06-22 | Skip-Thought Vectors | Ryan Kiros et.al. | 1506.06726 | null |
2015-06-22 | Aligning Books and Movies: Towards Story-like Visual Explanations by Watching Movies and Reading Books | Yukun Zhu et.al. | 1506.06724 | null |
2015-09-25 | Predicting Deep Zero-Shot Convolutional Neural Networks using Textual Descriptions | Jimmy Ba et.al. | 1506.00511 | null |
2015-12-18 | Monocular Object Instance Segmentation and Depth Ordering with CNNs | Ziyu Zhang et.al. | 1505.03159 | null |
2015-03-09 | Fully Connected Deep Structured Networks | Alexander G. Schwing et.al. | 1503.02351 | null |
2015-02-28 | Generating Multi-Sentence Lingual Descriptions of Indoor Scenes | Dahua Lin et.al. | 1503.00064 | null |
2015-02-15 | segDeepM: Exploiting Segmentation and Context in Deep Neural Networks for Object Detection | Yukun Zhu et.al. | 1502.04275 | null |
2015-02-05 | A Framework for Symmetric Part Detection in Cluttered Scenes | Tom Lee et.al. | 1502.01761 | null |
2014-08-09 | Video In Sentences Out | Andrei Barbu et.al. | 1408.6418 | null |
2014-08-23 | Learning a Hierarchical Compositional Shape Vocabulary for Multi-class Object Representation | Sanja Fidler et.al. | 1408.5516 | null |
2014-12-25 | FollowMe: Efficient Online Min-Cost Flow Tracking with Bounded Memory and Computation | Philip Lenz et.al. | 1407.6251 | null |
2015-04-27 | Learning Deep Structured Models | Liang-Chieh Chen et.al. | 1407.2538 | null |
2014-06-16 | Human-Machine CRFs for Identifying Bottlenecks in Holistic Scene Understanding | Roozbeh Mottaghi et.al. | 1406.3906 | null |
2014-06-08 | Detect What You Can: Detecting and Representing Objects using Holistic Models and Body Parts | Xianjie Chen et.al. | 1406.2031 | null |
2013-08-30 | Blending Learning and Inference in Structured Prediction | Tamir Hazan et.al. | 1210.2346 | null |
2012-06-27 | Efficient Structured Prediction with Latent Variables for General Graphical Models | Alexander Schwing et.al. | 1206.6436 | null |
2012-06-13 | Multi-View Learning in the Presence of View Disagreement | C. Christoudias et.al. | 1206.3242 | null |
2012-04-12 | Video In Sentences Out | Andrei Barbu et.al. | 1204.2742 | null |
2012-04-06 | Continuous Markov Random Fields for Robust Stereo Estimation | Koichiro Yamaguchi et.al. | 1204.1393 | null |
2012-07-09 | Approximated Structured Prediction for Learning Large Scale Graphical Models | Tamir Hazan et.al. | 1006.2899 | null |
Autonomous Driving
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-15 | Real-Time Out-of-Distribution Failure Prevention via Multi-Modal Reasoning | Milan Ganai et.al. | 2505.10547 | null |
2025-05-15 | Fine-tuning Diffusion Policies with Backpropagation Through Diffusion Timesteps | Ningyuan Yang et.al. | 2505.10482 | null |
2025-05-15 | Inferring Driving Maps by Deep Learning-based Trail Map Extraction | Michael Hubbertz et.al. | 2505.10258 | null |
2025-05-15 | Large Wireless Localization Model (LWLM): A Foundation Model for Positioning in 6G Networks | Guangjin Pan et.al. | 2505.10134 | link |
2025-05-15 | Application of YOLOv8 in monocular downward multiple Car Target detection | Shijie Lyu et.al. | 2505.10016 | null |
2025-05-14 | Camera-Only 3D Panoptic Scene Completion for Autonomous Driving through Differentiable Object Shapes | Nicola Marinello et.al. | 2505.09562 | null |
2025-05-15 | SafePath: Conformal Prediction for Safe LLM-Based Autonomous Navigation | Achref Doula et.al. | 2505.09427 | null |
2025-05-14 | MoRAL: Motion-aware Multi-Frame 4D Radar and LiDAR Fusion for Robust 3D Object Detection | Xiangyuan Peng et.al. | 2505.09422 | null |
2025-05-14 | FreeDriveRF: Monocular RGB Dynamic NeRF without Poses for Autonomous Driving via Point-Level Dynamic-Static Decoupling | Yue Wen et.al. | 2505.09406 | null |
2025-05-14 | APR-Transformer: Initial Pose Estimation for Localization in Complex Environments through Absolute Pose Regression | Srinivas Ravuri et.al. | 2505.09356 | link |
2025-05-14 | TransDiffuser: End-to-end Trajectory Generation with Decorrelated Multi-modal Representation for Autonomous Driving | Xuefeng Jiang et.al. | 2505.09315 | null |
2025-05-14 | OpenLKA: An Open Dataset of Lane Keeping Assist from Recent Car Models under Real-world Driving Conditions | Yuhang Wang et.al. | 2505.09092 | link |
2025-05-14 | Deployable and Generalizable Motion Prediction: Taxonomy, Open Challenges and Future Directions | Letian Wang et.al. | 2505.09074 | null |
2025-05-13 | Towards Adaptive Meta-Gradient Adversarial Examples for Visual Tracking | Wei-Long Tian et.al. | 2505.08999 | link |
2025-05-13 | Generative AI for Autonomous Driving: Frontiers and Opportunities | Yuping Wang et.al. | 2505.08854 | link |
2025-05-13 | Extending Large Vision-Language Model for Diverse Interactive Tasks in Autonomous Driving | Zongchuang Zhao et.al. | 2505.08725 | link |
2025-05-13 | Optimal Trajectory Planning with Collision Avoidance for Autonomous Vehicle Maneuvering | Jason Zalev et.al. | 2505.08724 | null |
2025-05-13 | Explaining Autonomous Vehicles with Intention-aware Policy Graphs | Sara Montese et.al. | 2505.08404 | null |
2025-05-13 | A Practical Introduction to Deep Reinforcement Learning | Yinghan Sun et.al. | 2505.08295 | null |
2025-05-13 | Automatic Curriculum Learning for Driving Scenarios: Towards Robust and Efficient Reinforcement Learning | Ahmed Abouelazm et.al. | 2505.08264 | null |
2025-05-13 | Object detection in adverse weather conditions for autonomous vehicles using Instruct Pix2Pix | Unai Gurbindo et.al. | 2505.08228 | null |
2025-05-12 | Topology-Guided Knowledge Distillation for Efficient Point Cloud Processing | Luu Tung Hai et.al. | 2505.08101 | link |
2025-05-12 | Vision Foundation Model Embedding-Based Semantic Anomaly Detection | Max Peter Ronecker et.al. | 2505.07998 | null |
2025-05-12 | Ranking-aware Continual Learning for LiDAR Place Recognition | Xufei Wang et.al. | 2505.07198 | null |
2025-05-11 | DriveSOTIF: Advancing Perception SOTIF Through Multimodal Large Language Models | Shucheng Huang et.al. | 2505.07084 | link |
2025-05-11 | Boosting Cross-spectral Unsupervised Domain Adaptation for Thermal Semantic Segmentation | Seokjun Kwon et.al. | 2505.06951 | null |
2025-05-11 | Towards Human-Centric Autonomous Driving: A Fast-Slow Architecture Integrating Large Language Model Guidance with Reinforcement Learning | Chengkai Xu et.al. | 2505.06875 | null |
2025-05-11 | Beyond Patterns: Harnessing Causal Logic for Autonomous Driving Trajectory Prediction | Bonan Wang et.al. | 2505.06856 | null |
2025-05-13 | Work-in-Progress: Multi-Deadline DAG Scheduling Model for Autonomous Driving Systems | Atsushi Yano et.al. | 2505.06780 | null |
2025-05-10 | AI-CDA4All: Democratizing Cooperative Autonomous Driving for All Drivers via Affordable Dash-cam Hardware and Open-source AI Software | Shengming Yuan et.al. | 2505.06749 | null |
2025-05-10 | M3CAD: Towards Generic Cooperative Autonomous Driving Benchmark | Morui Zhu et.al. | 2505.06746 | null |
2025-05-10 | TPK: Trustworthy Trajectory Prediction Integrating Prior Knowledge For Interpretability and Kinematic Feasibility | Marius Baden et.al. | 2505.06743 | null |
2025-05-10 | Boundary-Guided Trajectory Prediction for Road Aware and Physically Feasible Autonomous Driving | Ahmed Abouelazm et.al. | 2505.06740 | null |
2025-05-10 | Balancing Progress and Safety: A Novel Risk-Aware Objective for RL in Autonomous Driving | Ahmed Abouelazm et.al. | 2505.06737 | null |
2025-05-10 | A Contrastive Federated Semi-Supervised Learning Intrusion Detection Framework for Internet of Robotic Things | Yifan Zeng et.al. | 2505.06636 | null |
2025-05-10 | RESAR-BEV: An Explainable Progressive Residual Autoregressive Approach for Camera-Radar Fusion in BEV Segmentation | Zhiwen Zeng et.al. | 2505.06515 | null |
2025-05-10 | Video-Enhanced Offline Reinforcement Learning: A Model-Based Approach | Minting Pan et.al. | 2505.06482 | null |
2025-05-09 | What Do People Want to Know About Artificial Intelligence (AI)? The Importance of Answering End-User Questions to Explain Autonomous Vehicle (AV) Decisions | Somayeh Molaei et.al. | 2505.06428 | link |
2025-05-09 | Natural Reflection Backdoor Attack on Vision Language Model for Autonomous Driving | Ming Liu et.al. | 2505.06413 | null |
2025-05-15 | UniVLA: Learning to Act Anywhere with Task-centric Latent Actions | Qingwen Bu et.al. | 2505.06111 | link |
2025-05-09 | Priority-Driven Safe Model Predictive Control Approach to Autonomous Driving Applications | Francesco Prignoli et.al. | 2505.05933 | null |
2025-05-08 | Closing the Loop: Motion Prediction Models beyond Open-Loop Benchmarks | Mohamed-Khalil Bouzidi et.al. | 2505.05638 | null |
2025-05-08 | Flight Validation of Learning-Based Trajectory Optimization for the Astrobee Free-Flyer | Somrita Banerjee et.al. | 2505.05588 | null |
2025-05-02 | MDDFNet: Mamba-based Dynamic Dual Fusion Network for Traffic Sign Detection | TianYi Yu et.al. | 2505.05491 | null |
2025-05-08 | 3D Scene Generation: A Survey | Beichen Wen et.al. | 2505.05474 | link |
2025-05-08 | DSDrive: Distilling Large Language Model for Lightweight End-to-End Autonomous Driving with Unified Reasoning and Planning | Wenru Liu et.al. | 2505.05360 | null |
2025-05-08 | PADriver: Towards Personalized Autonomous Driving | Genghua Kou et.al. | 2505.05240 | null |
2025-05-08 | Multi-Objective Reinforcement Learning for Adaptive Personalized Autonomous Driving | Hendrik Surmann et.al. | 2505.05223 | null |
2025-05-08 | X-Driver: Explainable Autonomous Driving with Vision-Language Models | Wei Liu et.al. | 2505.05098 | null |
2025-05-08 | LVLM-MPC Collaboration for Autonomous Driving: A Safety-Aware and Task-Scalable Control Architecture | Kazuki Atsuta et.al. | 2505.04980 | null |
2025-05-07 | Crafting Physical Adversarial Examples by Combining Differentiable and Physically Based Renders | Yuqiu Liu et.al. | 2505.04662 | null |
2025-05-07 | DFVO: Learning Darkness-free Visible and Infrared Image Disentanglement and Fusion All at Once | Qi Zhou et.al. | 2505.04526 | link |
2025-05-07 | Do We Still Need to Work on Odometry for Autonomous Driving? | Cedric Le Gentil et.al. | 2505.04438 | null |
2025-05-07 | Predicting Road Surface Anomalies by Visual Tracking of a Preceding Vehicle | Petr Jahoda et.al. | 2505.04392 | null |
2025-05-07 | Verification of Digital Twins using Classical and Statistical Model Checking | Raghavendran Gunasekaran et.al. | 2505.04322 | null |
2025-05-07 | Multi-Agent Reinforcement Learning-based Cooperative Autonomous Driving in Smart Intersections | Taoyuan Yu et.al. | 2505.04231 | null |
2025-05-07 | Reliable Disentanglement Multi-view Learning Against View Adversarial Attacks | Xuyang Wang et.al. | 2505.04046 | link |
2025-05-06 | Frenet Corridor Planner: An Optimal Local Path Planning Framework for Autonomous Driving | Faizan M. Tariq et.al. | 2505.03695 | null |
2025-05-06 | Moral Testing of Autonomous Driving Systems | Wenbing Tang et.al. | 2505.03683 | null |
2025-05-06 | CaRaFFusion: Improving 2D Semantic Segmentation with Camera-Radar Point Cloud Fusion and Zero-Shot Image Inpainting | Huawei Sun et.al. | 2505.03679 | null |
2025-05-06 | Coop-WD: Cooperative Perception with Weighting and Denoising for Robust V2V Communication | Chenguang Liu et.al. | 2505.03528 | null |
2025-05-06 | RIFT: Closed-Loop RL Fine-Tuning for Realistic and Controllable Traffic Simulation | Keyu Chen et.al. | 2505.03344 | null |
2025-05-06 | Artificial Behavior Intelligence: Technology, Challenges, and Future Directions | Kanghyun Jo et.al. | 2505.03315 | null |
2025-05-06 | 3D Can Be Explored In 2D: Pseudo-Label Generation for LiDAR Point Clouds Using Sensor-Intensity-Based 2D Semantic Segmentation | Andrew Caunes et.al. | 2505.03300 | null |
2025-05-06 | RobotxR1: Enabling Embodied Robotic Intelligence on Large Language Models through Closed-Loop Reinforcement Learning | Liam Boyle et.al. | 2505.03238 | null |
2025-05-06 | VISLIX: An XAI Framework for Validating Vision Models with Slice Discovery and Analysis | Xinyuan Yan et.al. | 2505.03132 | null |
2025-05-04 | Risk Assessment and Threat Modeling for safe autonomous driving technology | Ian Alexis Wong Paz et.al. | 2505.02231 | null |
2025-05-04 | Coupled Distributional Random Expert Distillation for World Model Online Imitation Learning | Shangzhe Li et.al. | 2505.02228 | null |
2025-05-04 | Spotting the Unexpected (STU): A 3D LiDAR Dataset for Anomaly Segmentation in Autonomous Driving | Alexey Nekrasov et.al. | 2505.02148 | null |
2025-05-04 | DriveAgent: Multi-Agent Structured Reasoning with LLM and Multimodal Sensor Fusion for Autonomous Driving | Xinmeng Hou et.al. | 2505.02123 | link |
2025-05-04 | Benchmarking Feature Upsampling Methods for Vision Foundation Models using Interactive Segmentation | Volodymyr Havrylov et.al. | 2505.02075 | link |
2025-05-03 | DriveNetBench: An Affordable and Configurable Single-Camera Benchmarking System for Autonomous Driving Networks | Ali Al-Bustami et.al. | 2505.01893 | link |
2025-05-03 | CMAWRNet: Multiple Adverse Weather Removal via a Unified Quaternion Neural Architecture | Vladimir Frants et.al. | 2505.01882 | null |
2025-05-03 | PhysNav-DG: A Novel Adaptive Framework for Robust VLM-Sensor Fusion in Navigation Applications | Trisanth Srinivasan et.al. | 2505.01881 | null |
2025-05-03 | DualDiff: Dual-branch Diffusion Model for Autonomous Driving with Semantic Fusion | Haoteng Li et.al. | 2505.01857 | null |
2025-05-03 | Edge-Cloud Collaborative Computing on Distributed Intelligence and Model Optimization: A Survey | Jing Liu et.al. | 2505.01821 | null |
2025-05-03 | PosePilot: Steering Camera Pose for Generative World Models with Self-supervised Depth | Bu Jin et.al. | 2505.01729 | null |
2025-05-02 | Deformable Cargo Transport in Microgravity with Astrobee | Daniel Morton et.al. | 2505.01630 | null |
2025-04-28 | Interactive Double Deep Q-network: Integrating Human Interventions and Evaluative Predictions in Reinforcement Learning of Autonomous Driving | Alkis Sygkounas et.al. | 2505.01440 | null |
2025-05-02 | Multi-Objective Reinforcement Learning for Water Management | Zuzanna Osika et.al. | 2505.01094 | null |
2025-05-02 | LMDepth: Lightweight Mamba-based Monocular Depth Estimation for Real-World Deployment | Jiahuan Long et.al. | 2505.00980 | null |
2025-05-02 | Seeking to Collide: Online Safety-Critical Scenario Generation for Autonomous Driving with Retrieval Augmented Large Language Models | Yuewen Mei et.al. | 2505.00972 | null |
2025-05-01 | Efficient On-Chip Implementation of 4D Radar-Based 3D Object Detection on Hailo-8L | Woong-Chan Byun et.al. | 2505.00757 | null |
2025-05-01 | Controllable Weather Synthesis and Removal with Video Diffusion Models | Chih-Hao Lin et.al. | 2505.00704 | null |
2025-05-01 | Safety-Critical Traffic Simulation with Guided Latent Diffusion Model | Mingxing Peng et.al. | 2505.00515 | null |
2025-05-01 | Inconsistency-based Active Learning for LiDAR Object Detection | Esteban Rivera et.al. | 2505.00511 | null |
2025-05-05 | HeAL3D: Heuristical-enhanced Active Learning for 3D Object Detection | Esteban Rivera et.al. | 2505.00507 | null |
2025-05-01 | iMacSR: Intermediate Multi-Access Supervision and Regularization in Training Autonomous Driving Models | Wei-Bin Kou et.al. | 2505.00404 | null |
2025-05-01 | FedEMA: Federated Exponential Moving Averaging with Negative Entropy Regularizer in Autonomous Driving | Wei-Bin Kou et.al. | 2505.00318 | null |
2025-05-01 | LightEMMA: Lightweight End-to-End Multimodal Model for Autonomous Driving | Zhijie Qiao et.al. | 2505.00284 | link |
2025-04-30 | V3LMA: Visual 3D-enhanced Language Model for Autonomous Driving | Jannik Lübberstedt et.al. | 2505.00156 | null |
2025-04-30 | TinyMA-IEI-PPO: Exploration Incentive-Driven Multi-Agent DRL with Self-Adaptive Pruning for Vehicular Embodied AI Agent Twins Migration | Zhuoqi Zeng et.al. | 2505.00055 | null |
2025-04-30 | A Survey of Interactive Generative Video | Jiwen Yu et.al. | 2504.21853 | null |
2025-05-08 | REHEARSE-3D: A Multi-modal Emulated Rain Dataset for 3D Point Cloud De-raining | Abu Mohammed Raisuddin et.al. | 2504.21699 | null |
2025-04-29 | Composite Safety Potential Field for Highway Driving Risk Assessment | Dachuan Zuo et.al. | 2504.21158 | null |
2025-04-29 | Automated Parking Trajectory Generation Using Deep Reinforcement Learning | Zheyu Zhang et.al. | 2504.21071 | null |
2025-04-29 | Neural Stereo Video Compression with Hybrid Disparity Compensation | Shiyin Jiang et.al. | 2504.20383 | null |
2025-04-28 | AI Recommendation Systems for Lane-Changing Using Adherence-Aware Reinforcement Learning | Weihao Sun et.al. | 2504.20187 | null |
2025-04-28 | Learning Streaming Video Representation via Multitask Training | Yibin Yan et.al. | 2504.20041 | null |
2025-04-28 | Socially-Aware Autonomous Driving: Inferring Yielding Intentions for Safer Interactions | Jing Wang et.al. | 2504.20004 | null |
2025-04-28 | The ATLAS of Traffic Lights: A Reliable Perception Framework for Autonomous Driving | Rupert Polley et.al. | 2504.19722 | null |
2025-04-28 | Open-set Anomaly Segmentation in Complex Scenarios | Song Xia et.al. | 2504.19706 | null |
2025-05-04 | ARTEMIS: Autoregressive End-to-End Trajectory Planning with Mixture of Experts for Autonomous Driving | Renju Feng et.al. | 2504.19580 | link |
2025-04-28 | CE-NPBG: Connectivity Enhanced Neural Point-Based Graphics for Novel View Synthesis in Autonomous Driving Scenes | Mohammad Altillawi et.al. | 2504.19557 | null |
2025-04-27 | CARL: Camera-Agnostic Representation Learning for Spectral Image Analysis | Alexander Baumann et.al. | 2504.19223 | null |
2025-05-07 | LRFusionPR: A Polar BEV-Based LiDAR-Radar Fusion Network for Place Recognition | Zhangshuo Qi et.al. | 2504.19186 | link |
2025-04-27 | Segmenting Objectiveness and Task-awareness Unknown Region for Autonomous Driving | Mi Zheng et.al. | 2504.19183 | null |
2025-04-27 | Towards Latency-Aware 3D Streaming Perception for Autonomous Driving | Jiaqi Peng et.al. | 2504.19115 | null |
2025-04-26 | Safety Interventions against Adversarial Patches in an Open-Source Driver Assistance System | Cheng Chen et.al. | 2504.18990 | null |
2025-04-26 | Federated Learning-based Semantic Segmentation for Lane and Object Detection in Autonomous Driving | Gharbi Khamis Alshammari et.al. | 2504.18939 | null |
2025-04-26 | Advanced Longitudinal Control and Collision Avoidance for High-Risk Edge Cases in Autonomous Driving | Dianwei Chen et.al. | 2504.18931 | null |
2025-04-26 | Imitation Learning for Autonomous Driving: Insights from Real-World Testing | Hidayet Ersin Dursun et.al. | 2504.18847 | link |
2025-05-01 | Zero-Day Botnet Attack Detection in IoV: A Modular Approach Using Isolation Forests and Particle Swarm Optimization | Abdelaziz Amara Korba et.al. | 2504.18814 | null |
2025-04-26 | Depth as Points: Center Point-based Depth Estimation | Zhiheng Tu et.al. | 2504.18773 | null |
2025-04-22 | DriVerse: Navigation World Model for Driving Simulation via Multimodal Trajectory Prompting and Motion Alignment | Xiaofan Li et.al. | 2504.18576 | null |
2025-04-25 | NoiseController: Towards Consistent Multi-view Video Generation via Noise Decomposition and Collaboration | Haotian Dong et.al. | 2504.18448 | null |
2025-04-25 | What is the Added Value of UDA in the VFM Era? | Brunó B. Englert et.al. | 2504.18190 | null |
2025-04-25 | Study on Real-Time Road Surface Reconstruction Using Stereo Vision | Deepak Ghimire et.al. | 2504.18112 | null |
2025-04-24 | CaRL: Learning Scalable Planning Policies with Simple Rewards | Bernhard Jaeger et.al. | 2504.17838 | null |
2025-04-10 | My Precious Crash Data: Barriers and Opportunities in Encouraging Autonomous Driving Companies to Share Safety-Critical Data | Hauke Sandhaus et.al. | 2504.17792 | null |
2025-04-24 | Learning Isometric Embeddings of Road Networks using Multidimensional Scaling | Juan Carlos Climent Pardo et.al. | 2504.17534 | null |
2025-04-24 | Longitudinal Control for Autonomous Racing with Combustion Engine Vehicles | Phillip Pitschi et.al. | 2504.17418 | null |
2025-04-24 | S2S-Net: Addressing the Domain Gap of Heterogeneous Sensor Systems in LiDAR-Based Collective Perception | Sven Teufel et.al. | 2504.17399 | null |
2025-04-25 | Highly Accurate and Diverse Traffic Data: The DeepScenario Open 3D Dataset | Oussema Dhaouadi et.al. | 2504.17371 | null |
2025-04-23 | Meta-Learning Online Dynamics Model Adaptation in Off-Road Autonomous Driving | Jacob Levy et.al. | 2504.16923 | null |
2025-04-23 | Gaussian Splatting is an Effective Data Generator for 3D Object Detection | Farhad G. Zanjani et.al. | 2504.16740 | null |
2025-04-25 | Using Causal Inference to Test Systems with Hidden and Interacting Variables: An Evaluative Case Study | Michael Foster et.al. | 2504.16526 | null |
2025-04-23 | Circinus: Efficient Query Planner for Compound ML Serving | Banruo Liu et.al. | 2504.16397 | null |
2025-04-23 | SILM: A Subjective Intent Based Low-Latency Framework for Multiple Traffic Participants Joint Trajectory Prediction | Qu Weiming et.al. | 2504.16377 | null |
2025-04-23 | DPGP: A Hybrid 2D-3D Dual Path Potential Ghost Probe Zone Prediction Framework for Safe Autonomous Driving | Weiming Qu et.al. | 2504.16374 | null |
2025-04-23 | Revisiting Radar Camera Alignment by Contrastive Learning for 3D Object Detection | Linhua Kong et.al. | 2504.16368 | null |
2025-04-15 | Shape Your Ground: Refining Road Surfaces Beyond Planar Representations | Oussema Dhaouadi et.al. | 2504.16103 | null |
2025-04-22 | Describe Anything: Detailed Localized Image and Video Captioning | Long Lian et.al. | 2504.16072 | null |
2025-04-22 | MS-Occ: Multi-Stage LiDAR-Camera Fusion for 3D Semantic Occupancy Prediction | Zhiqiang Wei et.al. | 2504.15888 | null |
2025-04-22 | Pose Optimization for Autonomous Driving Datasets using Neural Rendering Models | Quentin Herau et.al. | 2504.15776 | null |
2025-04-22 | Dynamic Intent Queries for Motion Transformer-based Trajectory Prediction | Tobias Demmler et.al. | 2504.15766 | null |
2025-04-22 | SAGA: Semantic-Aware Gray color Augmentation for Visible-to-Thermal Domain Adaptation across Multi-View Drone and Ground-Based Vision Systems | Manjunath D et.al. | 2504.15728 | null |
2025-04-22 | RiskNet: Interaction-Aware Risk Forecasting for Autonomous Driving in Long-Tail Scenarios | Qichao Liu et.al. | 2504.15541 | null |
2025-04-29 | Improving Human-AI Coordination through Adversarial Training and Generative Models | Paresh Chaudhary et.al. | 2504.15457 | null |
2025-04-22 | DRAWER: Digital Reconstruction and Articulation With Environment Realism | Hongchi Xia et.al. | 2504.15278 | null |
2025-04-20 | Adaptive Field Effect Planner for Safe Interactive Autonomous Driving on Curved Roads | Qinghao Li et.al. | 2504.14747 | null |
2025-04-20 | SMTT: Novel Structured Multi-task Tracking with Graph-Regularized Sparse Representation for Robust Thermal Infrared Target Tracking | Shang Zhang et.al. | 2504.14566 | null |
2025-04-24 | Should Benevolent Deception be Allowed in EHMI? A Mechanism Explanation Based on Game Theory | Linkun Liu et.al. | 2504.14539 | null |
2025-04-20 | Are Vision LLMs Road-Ready? A Comprehensive Benchmark for Safety-Critical Driving Video Understanding | Tong Zeng et.al. | 2504.14526 | link |
2025-04-19 | A Knowledge-Informed Deep Learning Paradigm for Generalizable and Stability-Optimized Car-Following Models | Chengming Wang et.al. | 2504.14241 | null |
2025-04-19 | ROI-Guided Point Cloud Geometry Compression Towards Human and Machine Vision | Xie Liang et.al. | 2504.14240 | null |
2025-04-19 | Lightweight Road Environment Segmentation using Vector Quantization | Jiyong Kwag et.al. | 2504.14113 | null |
2025-04-18 | LoftUp: Learning a Coordinate-Based Feature Upsampler for Vision Foundation Models | Haiwen Huang et.al. | 2504.14032 | link |
2025-04-18 | Statistical Analysis and End-to-End Performance Evaluation of Traffic Models for Automotive Data | Marcello Bullo et.al. | 2504.14017 | null |
2025-04-18 | LMPOcc: 3D Semantic Occupancy Prediction Utilizing Long-Term Memory Prior from Historical Traversals | Shanshuai Yuan et.al. | 2504.13596 | null |
2025-04-18 | Testing the Fault-Tolerance of Multi-Sensor Fusion Perception in Autonomous Driving Systems | Haoxiang Tian et.al. | 2504.13420 | null |
2025-04-21 | LangCoop: Collaborative Driving with Language | Xiangbo Gao et.al. | 2504.13406 | link |
2025-04-18 | Towards a Multi-Agent Vision-Language System for Zero-Shot Novel Hazardous Object Detection for Autonomous Driving Safety | Shashank Shriram et.al. | 2504.13399 | link |
2025-04-17 | UncAD: Towards Safe End-to-end Autonomous Driving via Online Map Uncertainty | Pengxuan Yang et.al. | 2504.12826 | link |
2025-04-17 | Explainable Scene Understanding with Qualitative Representations and Graph Neural Networks | Nassim Belmecheri et.al. | 2504.12817 | null |
2025-04-17 | Approaching Current Challenges in Developing a Software Stack for Fully Autonomous Driving | Simon Sagmeister et.al. | 2504.12813 | null |
2025-04-17 | Self-Supervised Pre-training with Combined Datasets for 3D Perception in Autonomous Driving | Shumin Wang et.al. | 2504.12709 | null |
2025-04-17 | Collaborative Perception Datasets for Autonomous Driving: A Review | Naibang Wang et.al. | 2504.12696 | link |
2025-04-17 | Two Tasks, One Goal: Uniting Motion and Planning for Excellent End To End Autonomous Driving Performance | Lin Liu et.al. | 2504.12667 | null |
2025-04-16 | Self-Supervised Traversability Learning with Online Prototype Adaptation for Off-Road Autonomous Driving | Yafeng Bu et.al. | 2504.12109 | null |
2025-04-16 | Contract-based hierarchical control using predictive feasibility value functions | Felix Berkel et.al. | 2504.12036 | null |
2025-04-15 | Enhancing Autonomous Driving Systems with On-Board Deployed Large Language Models | Nicolas Baumann et.al. | 2504.11514 | link |
2025-04-11 | High Dynamic Range Modulo Imaging for Robust Object Detection in Autonomous Driving | Kebin Contreras et.al. | 2504.11472 | null |
2025-04-30 | GATE3D: Generalized Attention-based Task-synergized Estimation in 3D* | Eunsoo Im et.al. | 2504.11014 | null |
2025-04-15 | Can Vision-Language Models Understand and Interpret Dynamic Gestures from Pedestrians? Pilot Datasets and Exploration Towards Instructive Nonverbal Commands for Cooperative Autonomous Vehicles | Tonko E. W. Bossen et.al. | 2504.10873 | null |
2025-04-15 | PatrolVision: Automated License Plate Recognition in the wild | Anmol Singhal Navya Singhal et.al. | 2504.10810 | null |
2025-04-14 | ReasonDrive: Efficient Visual Question Answering for Autonomous Vehicles with Reasoning-Enhanced Small Vision-Language Models | Amirhosein Chahe et.al. | 2504.10757 | link |
2025-04-14 | FuzzSense: Towards A Modular Fuzzing Framework for Autonomous Driving Software | Andrew Roberts et.al. | 2504.10717 | null |
2025-04-14 | Decoupled Diffusion Sparks Adaptive Scene Generation | Yunsong Zhou et.al. | 2504.10485 | null |
2025-04-14 | Siamese Network with Dual Attention for EEG-Driven Social Learning: Bridging the Human-Robot Gap in Long-Tail Autonomous Driving | Xiaoshan Zhou et.al. | 2504.10296 | null |
2025-04-14 | LMFormer: Lane based Motion Prediction Transformer | Harsh Yadav et.al. | 2504.10275 | null |
2025-04-14 | Vision based driving agent for race car simulation environments | Gergely Bári et.al. | 2504.10266 | null |
2025-04-14 | Lightweight Trustworthy Distributed Clustering | Hongyang Li et.al. | 2504.10109 | null |
2025-04-14 | Balancing Two Classifiers via A Simplex ETF Structure for Model Calibration | Jiani Ni et.al. | 2504.10007 | null |
2025-04-14 | Towards Resilient Tracking in Autonomous Vehicles: A Distributionally Robust Input and State Estimation Approach | Kasra Azizi et.al. | 2504.09974 | null |
2025-04-13 | FastRSR: Efficient and Accurate Road Surface Reconstruction from Bird’s Eye View | Yuting Zhao et.al. | 2504.09535 | null |
2025-04-13 | ADDT – A Digital Twin Framework for Proactive Safety Validation in Autonomous Driving Systems | Bo Yu et.al. | 2504.09461 | null |
2025-04-12 | Minority Reports: Balancing Cost and Quality in Ground Truth Data Annotation | Hsuan Wei Liao et.al. | 2504.09341 | null |
2025-04-12 | ReferGPT: Towards Zero-Shot Referring Multi-Object Tracking | Tzoulio Chamiti et.al. | 2504.09195 | null |
2025-04-11 | Offline Reinforcement Learning using Human-Aligned Reward Labeling for Autonomous Emergency Braking in Occluded Pedestrian Crossing | Vinal Asodia et.al. | 2504.08704 | null |
2025-04-11 | TinyCenterSpeed: Efficient Center-Based Object Detection for Autonomous Racing | Neil Reichlin et.al. | 2504.08655 | link |
2025-04-11 | Shadow Erosion and Nighttime Adaptability for Camera-Based Automated Driving Applications | Mohamed Sabry et.al. | 2504.08551 | null |
2025-04-11 | Datasets for Lane Detection in Autonomous Driving: A Comprehensive Review | Jörg Gamerdinger et.al. | 2504.08540 | null |
2025-04-11 | Road Grip Uncertainty Estimation Through Surface State Segmentation | Jyri Maanpää et.al. | 2504.08452 | null |
2025-04-11 | Scholar Inbox: Personalized Paper Recommendations for Scientists | Markus Flicke et.al. | 2504.08385 | null |
2025-04-11 | SN-LiDAR: Semantic Neural Fields for Novel Space-time View LiDAR Synthesis | Yi Chen et.al. | 2504.08361 | link |
2025-04-11 | InSPE: Rapid Evaluation of Heterogeneous Multi-Modal Infrastructure Sensor Placement | Zhaoliang Zheng et.al. | 2504.08240 | null |
2025-04-11 | VL-UR: Vision-Language-guided Universal Restoration of Images Degraded by Adverse Weather Conditions | Ziyan Liu et.al. | 2504.08219 | null |
2025-04-11 | EO-VLM: VLM-Guided Energy Overload Attacks on Vision Models | Minjae Seo et.al. | 2504.08205 | null |
2025-04-10 | Investigating Vision-Language Model for Point Cloud-based Vehicle Classification | Yiqiao Li et.al. | 2504.08154 | null |
2025-04-10 | X-DECODE: EXtreme Deblurring with Curriculum Optimization and Domain Equalization | Sushant Gautam et.al. | 2504.08072 | link |
2025-04-10 | Detect Anything 3D in the Wild | Hanxue Zhang et.al. | 2504.07958 | null |
2025-04-10 | RASMD: RGB And SWIR Multispectral Driving Dataset for Robust Perception in Adverse Conditions | Youngwan Jin et.al. | 2504.07603 | null |
2025-05-09 | Drive in Corridors: Enhancing the Safety of End-to-end Autonomous Driving via Corridor Learning and Planning | Zhiwei Zhang et.al. | 2504.07507 | null |
2025-04-09 | MovSAM: A Single-image Moving Object Segmentation Framework Based on Deep Thinking | Chang Nie et.al. | 2504.06863 | null |
2025-04-09 | Dynamic Residual Safe Reinforcement Learning for Multi-Agent Safety-Critical Scenarios Decision-Making | Kaifeng Wang et.al. | 2504.06670 | null |
2025-04-10 | Uni-PrevPredMap: Extending PrevPredMap to a Unified Framework of Prior-Informed Modeling for Online Vectorized HD Map Construction | Nan Peng et.al. | 2504.06647 | link |
2025-04-09 | CAFE-AD: Cross-Scenario Adaptive Feature Enhancement for Trajectory Planning in Autonomous Driving | Junrui Zhang et.al. | 2504.06584 | link |
2025-04-09 | Robo-taxi Fleet Coordination at Scale via Reinforcement Learning | Luigi Tresca et.al. | 2504.06125 | link |
2025-04-08 | Uncertainty-Aware Hybrid Machine Learning in Virtual Sensors for Vehicle Sideslip Angle Estimation | Abinav Kalyanasundaram et.al. | 2504.06105 | null |
2025-04-08 | PRIMEDrive-CoT: A Precognitive Chain-of-Thought Framework for Uncertainty-Aware Object Interaction in Driving Scene Scenario | Sriram Mandalika et.al. | 2504.05908 | null |
2025-04-08 | Momentum Boosted Episodic Memory for Improving Learning in Long-Tailed RL Environments | Dolton Fernandes et.al. | 2504.05840 | null |
2025-04-08 | SAP-CoPE: Social-Aware Planning using Cooperative Pose Estimation with Infrastructure Sensor Nodes | Minghao Ning et.al. | 2504.05727 | link |
2025-04-08 | POD: Predictive Object Detection with Single-Frame FMCW LiDAR Point Cloud | Yining Shi et.al. | 2504.05649 | null |
2025-05-06 | DyTTP: Trajectory Prediction with Normalization-Free Transformers | JianLin Zhu et.al. | 2504.05356 | null |
2025-04-07 | Texture2LoD3: Enabling LoD3 Building Reconstruction With Panoramic Images | Wenzhao Tang et.al. | 2504.05249 | null |
2025-04-07 | Resource-Efficient Beam Prediction in mmWave Communications with Multimodal Realistic Simulation Framework | Yu Min Park et.al. | 2504.05187 | null |
2025-04-07 | Balancing Robustness and Efficiency in Embedded DNNs Through Activation Function Selection | Jon Gutiérrez Zaballa et.al. | 2504.05119 | null |
2025-04-07 | MIAT: Maneuver-Intention-Aware Transformer for Spatio-Temporal Trajectory Prediction | Chandra Raskoti et.al. | 2504.05059 | null |
2025-04-07 | GAMDTP: Dynamic Trajectory Prediction with Graph Attention Mamba Network | Yunxiang Liu et.al. | 2504.04862 | null |
2025-04-07 | Prior2Former – Evidential Modeling of Mask Transformers for Assumption-Free Open-World Panoptic Segmentation | Sebastian Schmidt et.al. | 2504.04841 | null |
2025-04-07 | Large-Scale Mixed-Traffic and Intersection Control using Multi-agent Reinforcement Learning | Songyang Liu et.al. | 2504.04691 | link |
2025-04-06 | Targetless LiDAR-Camera Calibration with Anchored 3D Gaussians | Haebeom Jung et.al. | 2504.04597 | null |
2025-05-05 | “Trust me on this” Explaining Agent Behavior to a Human Terminator | Uri Menkes et.al. | 2504.04592 | null |
2025-04-06 | Understanding Collective Stability of ACC Systems: From Theory to Real-World Observations | Raphael Korbmacher et.al. | 2504.04530 | null |
2025-04-06 | Driving-RAG: Driving Scenarios Embedding, Search, and RAG Applications | Cheng Chang et.al. | 2504.04419 | null |
2025-04-16 | OmniDrive: A Holistic Vision-Language Dataset for Autonomous Driving with Counterfactual Reasoning | Shihao Wang et.al. | 2504.04348 | null |
2025-04-06 | Data Scaling Laws for End-to-End Autonomous Driving | Alexander Naumann et.al. | 2504.04338 | null |
2025-04-05 | Autoregressive High-Order Finite Difference Modulo Imaging: High-Dynamic Range for Computer Vision Applications | Brayan Monroy et.al. | 2504.04228 | null |
2025-04-05 | An Optimized Density-Based Lane Keeping System for A Cost-Efficient Autonomous Vehicle Platform: AurigaBot V1 | Farbod Younesi et.al. | 2504.04217 | null |
2025-04-05 | JarvisIR: Elevating Autonomous Driving Perception with Intelligent Image Restoration | Yunlong Lin et.al. | 2504.04158 | null |
2025-04-05 | EMF: Event Meta Formers for Event-based Real-time Traffic Object Detection | Muhammad Ahmed Ullah Khan et.al. | 2504.04124 | null |
2025-04-10 | LATTE: Lightweight Attention-based Traffic Accident Anticipation Engine | Jiaxun Zhang et.al. | 2504.04103 | null |
2025-04-04 | Control Map Distribution using Map Query Bank for Online Map Generation | Ziming Liu et.al. | 2504.03868 | null |
2025-04-02 | Exploiting the Uncertainty of the Longest Paths: Response Time Analysis for Probabilistic DAG Tasks | Yiyang Gao et.al. | 2504.03754 | null |
2025-04-28 | Revisiting Outage for Edge Inference Systems | Zhanwei Wang et.al. | 2504.03686 | null |
2025-04-04 | PF3Det: A Prompted Foundation Feature Assisted Visual LiDAR 3D Detector | Kaidong Li et.al. | 2504.03563 | null |
2025-04-07 | ZFusion: An Effective Fuser of Camera and 4D Radar for 3D Object Perception in Autonomous Driving | Sheng Yang et.al. | 2504.03438 | null |
2025-04-07 | NuScenes-SpatialQA: A Spatial Understanding and Reasoning Benchmark for Vision-Language Models in Autonomous Driving | Kexin Tian et.al. | 2504.03164 | null |
2025-04-04 | Taming High-Dimensional Dynamics: Learning Optimal Projections onto Spectral Submanifolds | Hugo Buurmeijer et.al. | 2504.03157 | link |
2025-04-03 | VIP: Video Inpainting Pipeline for Real World Human Removal | Huiming Sun et.al. | 2504.03041 | null |
2025-04-03 | Morpheus: Benchmarking Physical Reasoning of Video Generative Models with Real Physical Experiments | Chenyu Zhang et.al. | 2504.02918 | null |
2025-04-02 | Enhancing Traffic Sign Recognition On The Performance Based On Yolov8 | Baba Ibrahim et.al. | 2504.02884 | null |
2025-04-28 | CHARMS: A Cognitive Hierarchical Agent for Reasoning and Motion Stylization in Autonomous Driving | Jingyi Wang et.al. | 2504.02450 | link |
2025-04-03 | MinkOcc: Towards real-time label-efficient semantic occupancy prediction | Samuel Sze et.al. | 2504.02270 | null |
2025-04-02 | On Simulation-Guided LLM-based Code Generation for Safe Autonomous Driving Software | Ali Nouri et.al. | 2504.02141 | null |
2025-04-01 | Semantic SLAM with Rolling-Shutter Cameras and Low-Precision INS in Outdoor Environments | Yuchen Zhang et.al. | 2504.01997 | null |
2025-03-31 | A Concise Survey on Lane Topology Reasoning for HD Mapping | Yi Yao et.al. | 2504.01989 | null |
2025-04-03 | Toward Real-world BEV Perception: Depth Uncertainty Estimation via Gaussian Splatting | Shu-Wei Lu et.al. | 2504.01957 | null |
2025-04-09 | End-to-End Driving with Online Trajectory Evaluation via BEV World Model | Yingyan Li et.al. | 2504.01941 | link |
2025-04-12 | Overlap-Aware Feature Learning for Robust Unsupervised Domain Adaptation for 3D Semantic Segmentation | Junjie Chen et.al. | 2504.01668 | null |
2025-04-02 | Building Knowledge from Interactions: An LLM-Based Architecture for Adaptive Tutoring and Social Reasoning | Luca Garello et.al. | 2504.01588 | null |
2025-04-02 | Deep LG-Track: An Enhanced Localization-Confidence-Guided Multi-Object Tracker | Ting Meng et.al. | 2504.01457 | null |
2025-04-02 | DF-Calib: Targetless LiDAR-Camera Calibration via Depth Flow | Shu Han et.al. | 2504.01416 | null |
2025-04-02 | Pedestrian-Aware Motion Planning for Autonomous Driving in Complex Urban Scenarios | Korbinian Moller et.al. | 2504.01409 | link |
2025-04-02 | From Shadows to Safety: Occlusion Tracking and Risk Mitigation for Urban Autonomous Driving | Korbinian Moller et.al. | 2504.01408 | link |
2025-03-31 | Cal or No Cal? – Real-Time Miscalibration Detection of LiDAR and Camera Sensors | Ilir Tahiraj et.al. | 2504.01040 | link |
2025-03-26 | Omnidirectional Depth-Aided Occupancy Prediction based on Cylindrical Voxel for Autonomous Driving | Chaofan Wu et.al. | 2504.01023 | null |
2025-04-01 | Foundation Models for Autonomous Driving System: An Initial Roadmap | Xiongfei Wu et.al. | 2504.00911 | null |
2025-04-09 | NeuRadar: Neural Radiance Fields for Automotive Radar Point Clouds | Mahan Rafidashti et.al. | 2504.00859 | null |
2025-04-01 | UnIRe: Unsupervised Instance Decomposition for Dynamic Urban Scene Reconstruction | Yunxuan Mao et.al. | 2504.00763 | null |
2025-04-01 | Coca-Splat: Collaborative Optimization for Camera Parameters and 3D Gaussians | Jiamin Wu et.al. | 2504.00639 | null |
2025-04-01 | ADGaussian: Generalizable Gaussian Splatting for Autonomous Driving with Multi-modal Inputs | Qi Song et.al. | 2504.00437 | null |
2025-04-01 | Intrinsic-feature-guided 3D Object Detection | Wanjing Zhang et.al. | 2504.00382 | null |
2025-04-01 | MPDrive: Improving Spatial Understanding with Marker-Based Prompt Learning for Autonomous Driving | Zhiyuan Zhang et.al. | 2504.00379 | null |
2025-04-23 | CF-CAM: Cluster Filter Class Activation Mapping for Reliable Gradient-Based Interpretability | Hongjie He et.al. | 2504.00060 | null |
2025-03-31 | Easi3R: Estimating Disentangled Motion from DUSt3R Without Training | Xingyu Chen et.al. | 2503.24391 | link |
2025-03-31 | UniOcc: A Unified Benchmark for Occupancy Forecasting and Prediction in Autonomous Driving | Yuping Wang et.al. | 2503.24381 | link |
2025-04-01 | Self-Supervised Pretraining for Aerial Road Extraction | Rupert Polley et.al. | 2503.24326 | null |
2025-03-31 | Can Test-Time Scaling Improve World Foundation Model? | Wenyan Cong et.al. | 2503.24320 | link |
2025-04-29 | 4D mmWave Radar for Sensing Enhancement in Adverse Environments: Advances and Challenges | Xiangyuan Peng et.al. | 2503.24091 | null |
2025-03-31 | DenseFormer: Learning Dense Depth Map from Sparse Depth and Image via Conditional Diffusion Model | Ming Yuan et.al. | 2503.23993 | null |
2025-03-31 | Video-based Traffic Light Recognition by Rockchip RV1126 for Autonomous Driving | Miao Fan et.al. | 2503.23965 | null |
2025-03-31 | A Benchmark for Vision-Centric HD Mapping by V2I Systems | Miao Fan et.al. | 2503.23963 | null |
2025-03-31 | GLane3D : Detecting Lanes with Graph of 3D Keypoints | Halil İbrahim Öztürk et.al. | 2503.23882 | null |
2025-04-21 | STI-Bench: Are MLLMs Ready for Precise Spatial-Temporal World Understanding? | Yun Li et.al. | 2503.23765 | null |
2025-04-07 | Towards Benchmarking and Assessing the Safety and Robustness of Autonomous Driving on Safety-critical Scenarios | Jingzheng Li et.al. | 2503.23708 | null |
2025-03-31 | A Survey of Reinforcement Learning-Based Motion Planning for Autonomous Driving: Lessons Learned from a Driving Task Perspective | Zhuoren Li et.al. | 2503.23650 | null |
2025-03-30 | OpenDriveVLA: Towards End-to-end Autonomous Driving with Large Vision Language Action Model | Xingcheng Zhou et.al. | 2503.23463 | link |
2025-04-13 | A Visual-Inertial Motion Prior SLAM for Dynamic Environments | Weilong Sun et.al. | 2503.23429 | null |
2025-03-30 | OnSiteVRU: A High-Resolution Trajectory Dataset for High-Density Vulnerable Road Users | Zhangcun Yan et.al. | 2503.23365 | null |
2025-03-29 | VLM-C4L: Continual Core Dataset Learning with Corner Case Optimization via Vision-Language Models for Autonomous Driving | Haibo Hu et.al. | 2503.23046 | null |
2025-03-28 | Markov Potential Game Construction and Multi-Agent Reinforcement Learning with Applications to Autonomous Driving | Huiwen Yan et.al. | 2503.22867 | null |
2025-03-28 | SafeCast: Risk-Responsive Motion Forecasting for Autonomous Vehicles | Haicheng Liao et.al. | 2503.22541 | null |
2025-03-28 | NuGrounding: A Multi-View 3D Visual Grounding Framework in Autonomous Driving | Fuhao Li et.al. | 2503.22436 | null |
2025-04-16 | VoteFlow: Enforcing Local Rigidity in Self-Supervised Scene Flow | Yancong Lin et.al. | 2503.22328 | link |
2025-03-28 | A Dataset for Semantic Segmentation in the Presence of Unknowns | Zakaria Laskar et.al. | 2503.22309 | null |
2025-03-28 | CRLLK: Constrained Reinforcement Learning for Lane Keeping in Autonomous Driving | Xinwei Gao et.al. | 2503.22248 | null |
2025-04-05 | CoGen: 3D Consistent Video Generation via Adaptive Conditioning for Autonomous Driving | Yishen Ji et.al. | 2503.22231 | null |
2025-03-28 | Multi-modal Knowledge Distillation-based Human Trajectory Forecasting | Jaewoo Jeong et.al. | 2503.22201 | link |
2025-03-28 | Mitigating Trade-off: Stream and Query-guided Aggregation for Efficient and Effective 3D Occupancy Prediction | Seokha Moon et.al. | 2503.22087 | link |
2025-03-28 | A Deep Learning Framework for Boundary-Aware Semantic Segmentation | Tai An et.al. | 2503.22050 | null |
2025-03-27 | Uni4D: Unifying Visual Foundation Models for 4D Modeling from a Single Video | David Yifan Yao et.al. | 2503.21761 | link |
2025-03-27 | InteractionMap: Improving Online Vectorized HDMap Construction with Interaction | Kuang Wu et.al. | 2503.21659 | null |
2025-03-27 | Fine-Grained Evaluation of Large Vision-Language Models in Autonomous Driving | Yue Li et.al. | 2503.21505 | link |
2025-04-01 | Fine-Grained Behavior and Lane Constraints Guided Trajectory Prediction Method | Wenyi Xiong et.al. | 2503.21477 | null |
2025-03-27 | Towards Generating Realistic 3D Semantic Training Data for Autonomous Driving | Lucas Nunes et.al. | 2503.21449 | link |
2025-03-27 | Exploring the Roles of Large Language Models in Reshaping Transportation Systems: A Survey, Framework, and Roadmap | Tong Nie et.al. | 2503.21411 | link |
2025-03-28 | LandMarkSystem Technical Report | Zhenxiang Ma et.al. | 2503.21364 | link |
2025-03-27 | Large Language Models for Traffic and Transportation Research: Methodologies, State of the Art, and Future Opportunities | Yimo Yan et.al. | 2503.21330 | null |
2025-03-27 | Knowledge Graphs as World Models for Semantic Material-Aware Obstacle Handling in Autonomous Vehicles | Ayush Bheemaiah et.al. | 2503.21232 | null |
2025-03-29 | GenFusion: Closing the Loop between Reconstruction and Generation via Videos | Sibo Wu et.al. | 2503.21219 | null |
2025-03-27 | Extending Silicon Lifetime: A Review of Design Techniques for Reliable Integrated Circuits | Shaik Jani Babu et.al. | 2503.21165 | null |
2025-03-27 | Adversarial Wear and Tear: Exploiting Natural Damage for Generating Physical-World Adversarial Examples | Samra Irshad et.al. | 2503.21164 | null |
2025-03-24 | AED: Automatic Discovery of Effective and Diverse Vulnerabilities for Autonomous Driving Policy with Large Language Models | Le Qiu et.al. | 2503.20804 | null |
2025-03-26 | FB-4D: Spatial-Temporal Coherent Dynamic 3D Content Generation with Feature Banks | Jinwei Li et.al. | 2503.20784 | link |
2025-03-26 | ADS-Edit: A Multimodal Knowledge Editing Dataset for Autonomous Driving Systems | Chenxi Wang et.al. | 2503.20756 | link |
2025-03-26 | PhysGen3D: Crafting a Miniature Interactive World from a Single Image | Boyuan Chen et.al. | 2503.20746 | null |
2025-03-26 | AccidentSim: Generating Physically Realistic Vehicle Collision Videos from Real-World Accident Reports | Xiangwen Zhang et.al. | 2503.20654 | null |
2025-03-26 | SaViD: Spectravista Aesthetic Vision Integration for Robust and Discerning 3D Object Detection in Challenging Environments | Tanmoy Dam et.al. | 2503.20614 | link |
2025-03-26 | GAIA-2: A Controllable Multi-View Generative World Model for Autonomous Driving | Lloyd Russell et.al. | 2503.20523 | null |
2025-03-26 | EVolSplat: Efficient Volume-based Gaussian Splatting for Urban View Synthesis | Sheng Miao et.al. | 2503.20168 | null |
2025-03-26 | Bandwidth Allocation for Cloud-Augmented Autonomous Driving | Peter Schafhalter et.al. | 2503.20127 | null |
2025-03-25 | SuperFlow++: Enhanced Spatiotemporal Consistency for Cross-Modal Data Pretraining | Xiang Xu et.al. | 2503.19912 | link |
2025-03-25 | Scaling Vision Pre-Training to 4K Resolution | Baifeng Shi et.al. | 2503.19903 | null |
2025-03-25 | LENVIZ: A High-Resolution Low-Exposure Night Vision Benchmark Dataset | Manjushree Aithal et.al. | 2503.19804 | null |
2025-03-31 | Resilient Sensor Fusion under Adverse Sensor Failures via Multi-Modal Expert Fusion | Konyul Park et.al. | 2503.19776 | null |
2025-03-25 | ORION: A Holistic End-to-End Autonomous Driving Framework by Vision-Language Instructed Action Generation | Haoyu Fu et.al. | 2503.19755 | null |
2025-03-25 | Semi-SD: Semi-Supervised Metric Depth Estimation via Surrounding Cameras for Autonomous Driving | Yusen Xie et.al. | 2503.19713 | link |
2025-03-27 | Risk-Aware Reinforcement Learning for Autonomous Driving: Improving Safety When Driving through Intersection | Bo Leng et.al. | 2503.19690 | null |
2025-03-25 | Multi-Agent Deep Reinforcement Learning for Safe Autonomous Driving with RICS-Assisted MEC | Xueyao Zhang et.al. | 2503.19418 | null |
2025-03-26 | ST-VLM: Kinematic Instruction Tuning for Spatio-Temporal Reasoning in Vision-Language Models | Dohwan Ko et.al. | 2503.19355 | null |
2025-03-25 | A Reliable and Efficient 5G Vehicular MEC: Guaranteed Task Completion with Minimal Latency | Mahsa Paknejad et.al. | 2503.19320 | null |
2025-03-25 | BIMII-Net: Brain-Inspired Multi-Iterative Interactive Network for RGB-T Road Scene Semantic Segmentation | Hanshuo Qiu et.al. | 2503.19303 | null |
2025-03-25 | Context-Aware Semantic Segmentation: Enhancing Pixel-Level Understanding with Large Language Models for Advanced Vision Applications | Ben Rahman et.al. | 2503.19276 | null |
2025-03-24 | Enhancing V2X Communications with UAV-mounted Reconfigurable Intelligent Surfaces | Salim Janji et.al. | 2503.19038 | null |
2025-03-24 | Building Blocks for Robust and Effective Semi-Supervised Real-World Object Detection | Moussa Kassem Sbeyti et.al. | 2503.18903 | null |
2025-03-24 | Predicting the Road Ahead: A Knowledge Graph based Foundation Model for Scene Understanding in Autonomous Driving | Hongkuan Zhou et.al. | 2503.18730 | null |
2025-04-07 | AgentSpec: Customizable Runtime Enforcement for Safe and Reliable LLM Agents | Haoyu Wang et.al. | 2503.18666 | null |
2025-03-24 | Robust Lane Detection with Wavelet-Enhanced Context Modeling and Adaptive Sampling | Kunyang Li et.al. | 2503.18631 | null |
2025-03-24 | ReconDreamer++: Harmonizing Generative and Reconstructive Models for Driving Scene Representation | Guosheng Zhao et.al. | 2503.18438 | null |
2025-03-23 | Training A Neural Network For Partially Occluded Road Sign Identification In The Context Of Autonomous Vehicles | Gulnaz Gimaletdinova et.al. | 2503.18177 | null |
2025-03-23 | Unraveling the Effects of Synthetic Data on End-to-End Autonomous Driving | Junhao Ge et.al. | 2503.18108 | link |
2025-03-23 | M3Net: Multimodal Multi-task Learning for 3D Detection, Segmentation, and Occupancy Prediction in Autonomous Driving | Xuesong Chen et.al. | 2503.18100 | link |
2025-04-15 | Text-Driven 3D Lidar Place Recognition for Autonomous Driving | Tianyi Shang et.al. | 2503.18035 | null |
2025-03-23 | PhysTwin: Physics-Informed Reconstruction and Simulation of Deformable Objects from Videos | Hanxiao Jiang et.al. | 2503.17973 | null |
2025-03-22 | LightLoc: Learning Outdoor LiDAR Localization at Light Speed | Wen Li et.al. | 2503.17814 | link |
2025-03-22 | HiLoTs: High-Low Temporal Sensitive Representation Learning for Semi-Supervised LiDAR Segmentation in Autonomous Driving | R. D. Lin et.al. | 2503.17752 | link |
2025-03-22 | Multi-modality Anomaly Segmentation on the Road | Heng Gao et.al. | 2503.17712 | link |
2025-03-22 | Sense4FL: Vehicular Crowdsensing Enhanced Federated Learning for Autonomous Driving | Yanan Ma et.al. | 2503.17697 | null |
2025-03-18 | CP-NCBF: A Conformal Prediction-based Approach to Synthesize Verified Neural Control Barrier Functions | Manan Tayal et.al. | 2503.17395 | null |
2025-03-21 | How to Promote Autonomous Driving with Evolving Technology: Business Strategy and Pricing Decision | Mingliang Li et.al. | 2503.17174 | null |
2025-03-26 | Hi-ALPS – An Experimental Robustness Quantification of Six LiDAR-based Object Detection Systems for Autonomous Driving | Alexandra Arzberger et.al. | 2503.17168 | null |
2025-03-21 | Enhancing Steering Estimation with Semantic-Aware GNNs | Fouad Makiyeh et.al. | 2503.17153 | null |
2025-03-26 | R-LiViT: A LiDAR-Visual-Thermal Dataset Enabling Vulnerable Road User Focused Roadside Perception | Jonas Mirlach et.al. | 2503.17122 | null |
2025-03-21 | Temporal Action Detection Model Compression by Progressive Block Drop | Xiaoyong Chen et.al. | 2503.16916 | null |
2025-03-21 | OpenCity3D: What do Vision-Language Models know about Urban Environments? | Valentin Bieri et.al. | 2503.16776 | null |
2025-03-19 | A Vehicle-Infrastructure Multi-layer Cooperative Decision-making Framework | Yiming Cui et.al. | 2503.16552 | null |
2025-03-04 | Injecting Conflict Situations in Autonomous Driving Simulation using CARLA | Tsvetomila Mihaylova et.al. | 2503.16476 | null |
2025-03-20 | Panoptic-CUDAL Technical Report: Rural Australia Point Cloud Dataset in Rainy Conditions | Tzu-Yun Tseng et.al. | 2503.16378 | null |
2025-03-20 | BadToken: Token-level Backdoor Attacks to Multi-modal Large Language Models | Zenghui Yuan et.al. | 2503.16023 | null |
2025-03-20 | MiLA: Multi-view Intensive-fidelity Long-term Video Generation World Model for Autonomous Driving | Haiguang Wang et.al. | 2503.15875 | link |
2025-03-20 | AutoDrive-QA- Automated Generation of Multiple-Choice Questions for Autonomous Driving Datasets Using Large Vision-Language Models | Boshra Khalili et.al. | 2503.15778 | null |
2025-03-20 | Nano-3D: Metasurface-Based Neural Depth Imaging | Bingxuan Li et.al. | 2503.15770 | null |
2025-03-19 | GASP: Unifying Geometric and Semantic Self-Supervised Pre-training for Autonomous Driving | William Ljungbergh et.al. | 2503.15672 | null |
2025-03-19 | V2X-DG: Domain Generalization for Vehicle-to-Everything Cooperative Perception | Baolu Li et.al. | 2503.15435 | null |
2025-03-19 | EdgeRegNet: Edge Feature-based Multimodal Registration Network between Images and LiDAR Point Clouds | Yuanchao Yue et.al. | 2503.15284 | link |
2025-03-19 | An Investigation of Beam Density on LiDAR Object Detection Performance | Christoph Griesbacher et.al. | 2503.15087 | null |
2025-03-19 | DRoPE: Directional Rotary Position Embedding for Efficient Agent Interaction Modeling | Jianbo Zhao et.al. | 2503.15029 | null |
2025-03-19 | USAM-Net: A U-Net-based Network for Improved Stereo Correspondence and Scene Depth Estimation using Features from a Pre-trained Image Segmentation network | Joseph Emmanuel DL Dayo et.al. | 2503.14950 | null |
2025-03-26 | Generating Multimodal Driving Scenes via Next-Scene Prediction | Yanhao Wu et.al. | 2503.14945 | null |
2025-03-19 | SemanticFlow: A Self-Supervised Framework for Joint Scene Flow Prediction and Instance Segmentation in Dynamic Environments | Yinqi Chen et.al. | 2503.14837 | null |
2025-03-19 | MMDT: Decoding the Trustworthiness and Safety of Multimodal Foundation Models | Chejian Xu et.al. | 2503.14827 | null |
2025-03-18 | RAT: Boosting Misclassification Detection Ability without Extra Data | Ge Yan et.al. | 2503.14783 | null |
2025-03-21 | SuperPC: A Single Diffusion Model for Point Cloud Completion, Upsampling, Denoising, and Colorization | Yi Du et.al. | 2503.14558 | null |
2025-03-22 | Learning-based 3D Reconstruction in Autonomous Driving: A Comprehensive Survey | Liewen Liao et.al. | 2503.14537 | null |
2025-03-19 | Advances in 4D Generation: A Survey | Qiaowei Miao et.al. | 2503.14501 | link |
2025-03-18 | Tracking Meets Large Multimodal Models for Driving Scenario Understanding | Ayesha Ishaq et.al. | 2503.14498 | link |
2025-03-18 | Driving behavior recognition via self-discovery learning | Yilin Wang et.al. | 2503.14194 | null |
2025-03-18 | Bridging Past and Future: End-to-End Autonomous Driving with Historical Prediction and Planning | Bozhou Zhang et.al. | 2503.14182 | link |
2025-03-18 | SimWorld: A Unified Benchmark for Simulator-Conditioned Scene Generation via World Model | Xinqing Li et.al. | 2503.13952 | link |
2025-03-21 | ChatBEV: A Visual Language Model that Understands BEV Maps | Qingyao Xu et.al. | 2503.13938 | null |
2025-03-18 | PSA-SSL: Pose and Size-aware Self-Supervised Learning on LiDAR Point Clouds | Barza Nisar et.al. | 2503.13914 | null |
2025-03-18 | Robust3D-CIL: Robust Class-Incremental Learning for 3D Perception | Jinge Ma et.al. | 2503.13869 | null |
2025-03-18 | RAD: Retrieval-Augmented Decision-Making of Meta-Actions with Vision-Language Models in Autonomous Driving | Yujin Wang et.al. | 2503.13861 | null |
2025-03-26 | MamBEV: Enabling State Space Models to Learn Birds-Eye-View Representations | Hongyu Ke et.al. | 2503.13858 | link |
2025-03-17 | AugMapNet: Improving Spatial Latent Structure via BEV Grid Augmentation for Enhanced Vectorized Online HD Map Construction | Thomas Monninger et.al. | 2503.13430 | null |
2025-03-17 | A Comprehensive Survey on Multi-Agent Cooperative Decision-Making: Scenarios, Approaches, Challenges and Perspectives | Weiqiang Jin et.al. | 2503.13415 | null |
2025-03-17 | Clustering is back: Reaching state-of-the-art LiDAR instance segmentation without training | Corentin Sautier et.al. | 2503.13203 | null |
2025-03-17 | InsightDrive: Insight Scene Representation for End-to-End Autonomous Driving | Ruiqi Song et.al. | 2503.13047 | null |
2025-03-17 | SparseAlign: A Fully Sparse Framework for Cooperative Object Detection | Yunshuang Yuan et.al. | 2503.12982 | null |
2025-03-17 | OptiPMB: Enhancing 3D Multi-Object Tracking with Optimized Poisson Multi-Bernoulli Filtering | Guanhua Ding et.al. | 2503.12968 | null |
2025-03-17 | Hydra-MDP++: Advancing End-to-End Driving via Expert-Guided Hydra-Distillation | Kailin Li et.al. | 2503.12820 | null |
2025-03-17 | SAM2 for Image and Video Segmentation: A Comprehensive Survey | Zhang Jiaxing et.al. | 2503.12781 | null |
2025-03-17 | GenStereo: Towards Open-World Generation of Stereo Images and Unsupervised Matching | Feng Qiao et.al. | 2503.12720 | link |
2025-03-16 | Logic-RAG: Augmenting Large Multimodal Models with Visual-Spatial Knowledge for Road Scene Understanding | Imran Kabir et.al. | 2503.12663 | link |
2025-03-23 | Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey | Yaoting Wang et.al. | 2503.12605 | link |
2025-03-16 | Point Cloud Based Scene Segmentation: A Survey | Dan Halperin et.al. | 2503.12595 | null |
2025-03-22 | MTGS: Multi-Traversal Gaussian Splatting | Tianyu Li et.al. | 2503.12552 | null |
2025-03-16 | Car-1000: A New Large Scale Fine-Grained Visual Categorization Dataset | Yutao Hu et.al. | 2503.12385 | null |
2025-03-16 | L2COcc: Lightweight Camera-Centric Semantic Scene Completion via Distillation of LiDAR Model | Ruoyu Wang et.al. | 2503.12369 | null |
2025-03-16 | ResLPR: A LiDAR Data Restoration Network and Benchmark for Robust Place Recognition Against Weather Corruptions | Wenqing Kuang et.al. | 2503.12350 | null |
2025-03-15 | Bench2FreeAD: A Benchmark for Vision-based End-to-end Navigation in Unstructured Robotic Environments | Yuhang Peng et.al. | 2503.12180 | link |
2025-03-15 | DiffAD: A Unified Diffusion Modeling Approach for Autonomous Driving | Tao Wang et.al. | 2503.12170 | null |
2025-03-15 | SFMNet: Sparse Focal Modulation for 3D Object Detection | Oren Shrout et.al. | 2503.12093 | null |
2025-03-15 | Generative Modeling of Adversarial Lane-Change Scenario | Chuancheng Zhang et.al. | 2503.12055 | null |
2025-03-15 | TACO: Taming Diffusion for in-the-wild Video Amodal Completion | Ruijie Lu et.al. | 2503.12049 | null |
2025-03-15 | Hydra-NeXt: Robust Closed-Loop Driving with Open-Loop Training | Zhenxin Li et.al. | 2503.12030 | link |
2025-04-03 | 3D Gaussian Splatting against Moving Objects for High-Fidelity Street Scene Reconstruction | Peizhen Zheng et.al. | 2503.12001 | link |
2025-03-30 | Controllable Latent Diffusion for Traffic Simulation | Yizhuo Xiao et.al. | 2503.11771 | link |
2025-03-14 | Industrial-Grade Sensor Simulation via Gaussian Splatting: A Modular Framework for Scalable Editing and Full-Stack Validation | Xianming Zeng et.al. | 2503.11731 | null |
2025-03-14 | Centaur: Robust End-to-End Autonomous Driving with Test-Time Training | Chonghao Sima et.al. | 2503.11650 | null |
2025-03-14 | A Framework for a Capability-driven Evaluation of Scenario Understanding for Multimodal Large Language Models in Autonomous Driving | Tin Stribor Sohn et.al. | 2503.11400 | null |
2025-03-14 | BEVDiffLoc: End-to-End LiDAR Global Localization in BEV View based on Diffusion Model | Ziyue Wang et.al. | 2503.11372 | link |
2025-03-14 | Learning-Based MPC for Efficient Control of Autonomous Vehicles | Samuel Mallick et.al. | 2503.11359 | link |
2025-03-14 | DynRsl-VLM: Enhancing Autonomous Driving Perception with Dynamic Resolution Vision-Language Models | Xirui Zhou et.al. | 2503.11265 | null |
2025-03-14 | DriveGEN: Generalized and Robust 3D Detection in Driving via Controllable Text-to-Image Diffusion Generation | Hongbin Lin et.al. | 2503.11122 | link |
2025-03-14 | Active Learning from Scene Embeddings for End-to-End Autonomous Driving | Wenhao Jiang et.al. | 2503.11062 | null |
2025-03-13 | Data-Driven Soft Robot Control via Adiabatic Spectral Submanifolds | Roshan S. Kaundinya et.al. | 2503.10919 | null |
2025-03-13 | Trajectory Mamba: Efficient Attention-Mamba Forecasting Model Based on Selective SSM | Yizhou Huang et.al. | 2503.10898 | null |
2025-03-21 | TAIJI: Textual Anchoring for Immunizing Jailbreak Images in Vision Language Models | Xiangyu Yin et.al. | 2503.10872 | null |
2025-03-13 | DriveLMM-o1: A Step-by-Step Reasoning Dataset and Large Multimodal Model for Driving Scenario Understanding | Ayesha Ishaq et.al. | 2503.10621 | link |
2025-03-13 | OCCUQ: Exploring Efficient Uncertainty Quantification for 3D Occupancy Prediction | Severin Heidrich et.al. | 2503.10605 | link |
2025-03-13 | MuDG: Taming Multi-modal Diffusion with Gaussian Splatting for Urban Scene Reconstruction | Yingshuang Zou et.al. | 2503.10604 | null |
2025-03-15 | Unlock the Power of Unlabeled Data in Language Driving Model | Chaoqun Wang et.al. | 2503.10586 | null |
2025-03-13 | Finetuning Generative Trajectory Model with Reinforcement Learning from Human Feedback | Derun Li et.al. | 2503.10434 | null |
2025-03-13 | TARS: Traffic-Aware Radar Scene Flow Estimation | Jialong Wu et.al. | 2503.10210 | null |
2025-03-13 | GS-SDF: LiDAR-Augmented Gaussian Splatting and Neural SDF for Geometrically Consistent Rendering and Reconstruction | Jianheng Liu et.al. | 2503.10170 | link |
2025-03-13 | Unlocking Generalization Power in LiDAR Point Cloud Registration | Zhenxuan Zeng et.al. | 2503.10149 | link |
2025-03-13 | Mamba-VA: A Mamba-based Approach for Continuous Emotion Recognition in Valence-Arousal Space | Yuheng Liang et.al. | 2503.10104 | link |
2025-03-13 | TGP: Two-modal occupancy prediction with 3D Gaussian and sparse points for 3D Environment Awareness | Mu Chen et.al. | 2503.09941 | null |
2025-03-12 | CleverDistiller: Simple and Spatially Consistent Cross-modal Distillation | Hariprasath Govindarajan et.al. | 2503.09878 | null |
2025-03-12 | A Comprehensive Multi-Vocal Empirical Study of ML Cloud Service Misuses | Hadil Ben Amor et.al. | 2503.09815 | null |
2025-03-12 | Evaluating the Impact of Synthetic Data on Object Detection Tasks in Autonomous Driving | Enes Özeren et.al. | 2503.09803 | null |
2025-03-12 | SimLingo: Vision-Only Closed-Loop Autonomous Driving with Language-Action Alignment | Katrin Renz et.al. | 2503.09594 | null |
2025-03-12 | Hybrid Rendering for Multimodal Autonomous Driving: Merging Neural and Physics-Based Simulation | Máté Tóth et.al. | 2503.09464 | null |
2025-03-13 | PCLA: A Framework for Testing Autonomous Agents in the CARLA Simulator | Masoud Jamshidiyan Tehrani et.al. | 2503.09385 | link |
2025-03-12 | Post-interactive Multimodal Trajectory Prediction for Autonomous Driving | Ziyi Huang et.al. | 2503.09366 | null |
2025-03-12 | A Case Study on Model Checking and Runtime Verification for Awkernel | Akira Hasegawa et.al. | 2503.09282 | null |
2025-03-17 | Other Vehicle Trajectories Are Also Needed: A Driving World Model Unifies Ego-Other Vehicle Trajectories in Video Latent Space | Jian Zhu et.al. | 2503.09215 | null |
2025-03-17 | Dual-Domain Homogeneous Fusion with Cross-Modal Mamba and Progressive Decoder for 3D Object Detection | Xuzhong Hu et.al. | 2503.08992 | null |
2025-03-11 | Simulator Ensembles for Trustworthy Autonomous Driving Testing | Lev Sorokin et.al. | 2503.08936 | null |
2025-04-05 | Out-of-Distribution Segmentation in Autonomous Driving: Problems and State of the Art | Youssef Shoeb et.al. | 2503.08695 | null |
2025-03-11 | CoLMDriver: LLM-based Negotiation Benefits Cooperative Autonomous Driving | Changxing Liu et.al. | 2503.08683 | link |
2025-04-12 | Language-Depth Navigated Thermal and Visible Image Fusion | Jinchang Zhang et.al. | 2503.08676 | null |
2025-03-11 | Task-Oriented Co-Design of Communication, Computing, and Control for Edge-Enabled Industrial Cyber-Physical Systems | Yufeng Diao et.al. | 2503.08661 | null |
2025-03-11 | HiP-AD: Hierarchical and Multi-Granularity Planning with Deformable Attention for Autonomous Driving in a Single Decoder | Yingqi Tang et.al. | 2503.08612 | link |
2025-03-11 | LiSu: A Dataset and Method for LiDAR Surface Normal Estimation | Dušan Malić et.al. | 2503.08601 | null |
2025-03-13 | JiSAM: Alleviate Labeling Burden and Corner Case Problems in Autonomous Driving via Minimal Real-World Data | Runjian Chen et.al. | 2503.08422 | null |
2025-03-11 | V-Max: Making RL practical for Autonomous Driving | Valentin Charraut et.al. | 2503.08388 | link |
2025-03-11 | Talk2PC: Enhancing 3D Visual Grounding through LiDAR and Radar Point Clouds Fusion for Autonomous Driving | Runwei Guan et.al. | 2503.08336 | null |
2025-03-24 | Uni-Gaussians: Unifying Camera and Lidar Simulation with Gaussians for Dynamic Driving Scenarios | Zikang Yuan et.al. | 2503.08317 | null |
2025-03-11 | FASIONAD++ : Integrating High-Level Instruction and Information Bottleneck in FAt-Slow fusION Systems for Enhanced Safety in Autonomous Driving with Adaptive Feedback | Kangan Qian et.al. | 2503.08162 | null |
2025-03-11 | GigaSLAM: Large-Scale Monocular SLAM with Hierachical Gaussian Splats | Kai Deng et.al. | 2503.08071 | link |
2025-03-11 | Simulating Automotive Radar with Lidar and Camera Inputs | Peili Song et.al. | 2503.08068 | null |
2025-03-11 | SGNetPose+: Stepwise Goal-Driven Networks with Pose Information for Trajectory Prediction in Autonomous Driving | Akshat Ghiya et.al. | 2503.08016 | null |
2025-03-11 | STEAD: Spatio-Temporal Efficient Anomaly Detection for Time and Compute Sensitive Applications | Andrew Gao et.al. | 2503.07942 | link |
2025-03-07 | DriveTransformer: Unified Transformer for Scalable End-to-End Autonomous Driving | Xiaosong Jia et.al. | 2503.07656 | link |
2025-03-10 | AlphaDrive: Unleashing the Power of VLMs in Autonomous Driving via Reinforcement Learning and Reasoning | Bo Jiang et.al. | 2503.07608 | link |
2025-03-10 | Robusto-1 Dataset: Comparing Humans and VLMs on real out-of-distribution Autonomous Driving VQA from Peru | Dunant Cusipuma et.al. | 2503.07587 | null |
2025-03-10 | Chameleon: Fast-slow Neuro-symbolic Lane Topology Extraction | Zongzheng Zhang et.al. | 2503.07485 | link |
2025-03-10 | CATPlan: Loss-based Collision Prediction in End-to-End Autonomous Driving | Ziliang Xiong et.al. | 2503.07425 | null |
2025-03-26 | GM-MoE: Low-Light Enhancement with Gated-Mechanism Mixture-of-Experts | Minwen Liao et.al. | 2503.07417 | null |
2025-03-10 | LEGO-Motion: Learning-Enhanced Grids with Occupancy Instance Modeling for Class-Agnostic Motion Prediction | Kangan Qian et.al. | 2503.07367 | null |
2025-03-10 | Temporal Triplane Transformers as Occupancy World Models | Haoran Xu et.al. | 2503.07338 | null |
2025-03-10 | CoT-Drive: Efficient Motion Forecasting for Autonomous Driving with LLMs and Chain-of-Thought Prompting | Haicheng Liao et.al. | 2503.07234 | null |
2025-03-12 | HisTrackMap: Global Vectorized High-Definition Map Construction via History Map Tracking | Jing Yang et.al. | 2503.07168 | null |
2025-03-10 | Controllable 3D Outdoor Scene Generation via Scene Graphs | Yuheng Liu et.al. | 2503.07152 | link |
2025-03-12 | RS2V-L: Vehicle-Mounted LiDAR Data Generation from Roadside Sensor Observations | Ruidan Xing et.al. | 2503.07085 | null |
2025-03-10 | Availability-aware Sensor Fusion via Unified Canonical Space for 4D Radar, LiDAR, and Camera | Dong-Hee Paek et.al. | 2503.07029 | link |
2025-03-10 | Combating Partial Perception Deficit in Autonomous Driving with Multimodal LLM Commonsense | Yuting Hu et.al. | 2503.07020 | null |
2025-03-10 | Griffin: Aerial-Ground Cooperative Detection and Tracking Dataset and Benchmark | Jiahao Wang et.al. | 2503.06983 | link |
2025-03-10 | HierDAMap: Towards Universal Domain Adaptive BEV Mapping via Hierarchical Perspective Priors | Siyu Li et.al. | 2503.06821 | link |
2025-03-09 | Chance-Constrained Trajectory Planning with Multimodal Environmental Uncertainty | Kai Ren et.al. | 2503.06779 | link |
2025-03-09 | CoDa-4DGS: Dynamic Gaussian Splatting with Context and Deformation Awareness for Autonomous Driving | Rui Song et.al. | 2503.06744 | null |
2025-03-09 | Safe, Task-Consistent Manipulation with Operational Space Control Barrier Functions | Daniel Morton et.al. | 2503.06736 | null |
2025-03-09 | Attention, Please! PixelSHAP Reveals What Vision-Language Models Actually Focus On | Roni Goldshmidt et.al. | 2503.06670 | null |
2025-03-13 | AgiBot World Colosseo: A Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems | AgiBot-World-Contributors et.al. | 2503.06669 | link |
2025-03-09 | AxisPose: Model-Free Matching-Free Single-Shot 6D Object Pose Estimation via Axis Generation | Yang Zou et.al. | 2503.06660 | null |
2025-03-09 | Steerable Pyramid Weighted Loss: Multi-Scale Adaptive Weighting for Semantic Segmentation | Renhao Lu et.al. | 2503.06604 | null |
2025-03-30 | StructVPR++: Distill Structural and Semantic Knowledge with Weighting Samples for Visual Place Recognition | Yanqing Shen et.al. | 2503.06601 | link |
2025-03-09 | Future-Aware Interaction Network For Motion Forecasting | Shijie Li et.al. | 2503.06565 | null |
2025-03-09 | Evaluation of Safety Cognition Capability in Vision-Language Models for Autonomous Driving | Enming Zhang et.al. | 2503.06497 | null |
2025-03-09 | OV-SCAN: Semantically Consistent Alignment for Novel Object Discovery in Open-Vocabulary 3D Object Detection | Adrian Chow et.al. | 2503.06435 | null |
2025-03-08 | Advancing Autonomous Vehicle Intelligence: Deep Learning and Multimodal LLM for Traffic Sign Recognition and Robust Lane Detection | Chandan Kumar Sah et.al. | 2503.06313 | null |
2025-03-08 | ACAM-KD: Adaptive and Cooperative Attention Masking for Knowledge Distillation | Qizhen Lan et.al. | 2503.06307 | null |
2025-03-08 | From Dataset to Real-world: General 3D Object Detection via Generalized Cross-domain Few-shot Learning | Shuangzhi Li et.al. | 2503.06282 | null |
2025-03-08 | Segment Anything, Even Occluded | Wei-En Tai et.al. | 2503.06261 | null |
2025-03-08 | Rethinking Lanes and Points in Complex Scenarios for Monocular 3D Lane Detection | Yifan Chang et.al. | 2503.06237 | null |
2025-03-08 | Vision-based 3D Semantic Scene Completion via Capture Dynamic Representations | Meng Wang et.al. | 2503.06222 | null |
2025-03-08 | VLScene: Vision-Language Guidance Distillation for Camera-Based 3D Semantic Scene Completion | Meng Wang et.al. | 2503.06219 | link |
2025-03-12 | Object-Centric World Model for Language-Guided Manipulation | Youngjoon Jeong et.al. | 2503.06170 | null |
2025-03-17 | Treble Counterfactual VLMs: A Causal Approach to Hallucination | Shawn Li et.al. | 2503.06169 | link |
2025-03-17 | Secure On-Device Video OOD Detection Without Backpropagation | Shawn Li et.al. | 2503.06166 | link |
2025-03-08 | TransParking: A Dual-Decoder Transformer Framework with Soft Localization for End-to-End Automatic Parking | Hangyu Du et.al. | 2503.06071 | null |
2025-03-08 | InfoFusion Controller: Informed TRRT Star with Mutual Information based on Fusion of Pure Pursuit and MPC for Enhanced Path Planning | Seongjun Choi et.al. | 2503.06010 | link |
2025-03-08 | Learning to Drive by Imitating Surrounding Vehicles | Yasin Sonmez et.al. | 2503.05997 | null |
2025-03-04 | DriveGen: Towards Infinite Diverse Traffic Scenarios with Large Models | Shenyu Zhang et.al. | 2503.05808 | null |
2025-03-13 | GoalFlow: Goal-Driven Flow Matching for Multimodal Trajectories Generation in End-to-End Autonomous Driving | Zebin Xing et.al. | 2503.05689 | link |
2025-03-07 | InDRiVE: Intrinsic Disagreement based Reinforcement for Vehicle Exploration through Curiosity Driven Generalized World Model | Feeza Khan Khanzada et.al. | 2503.05573 | null |
2025-03-07 | FastMap: Fast Queries Initialization Based Vectorized HD Map Reconstruction Framework | Haotian Hu et.al. | 2503.05492 | link |
2025-03-07 | DecoupledGaussian: Object-Scene Decoupling for Physics-Based Interaction | Miaowei Wang et.al. | 2503.05484 | null |
2025-03-07 | A Hybrid Approach for Extending Automotive Radar Operation to NLOS Urban Scenarios | Aviran Gal et.al. | 2503.05413 | null |
2025-03-07 | Evidential Uncertainty Estimation for Multi-Modal Trajectory Prediction | Sajad Marvi et.al. | 2503.05274 | null |
2025-03-07 | Discrete Contrastive Learning for Diffusion Policies in Autonomous Driving | Kalle Kujanpää et.al. | 2503.05229 | null |
2025-03-07 | A Comprehensive LLM-powered Framework for Driving Intelligence Evaluation | Shanhe You et.al. | 2503.05164 | null |
2025-03-07 | An End-to-End Learning-Based Multi-Sensor Fusion for Autonomous Vehicle Localization | Changhong Lin et.al. | 2503.05088 | null |
2025-03-07 | Adaptive-LIO: Enhancing Robustness and Precision through Environmental Adaptation in LiDAR Inertial Odometry | Chengwei Zhao et.al. | 2503.05077 | link |
2025-03-06 | Quantifying and Modeling Driving Styles in Trajectory Forecasting | Laura Zheng et.al. | 2503.04994 | null |
2025-03-06 | INTENT: Trajectory Prediction Framework with Intention-Guided Contrastive Clustering | Yihong Tang et.al. | 2503.04952 | null |
2025-03-06 | Manboformer: Learning Gaussian Representations via Spatial-temporal Attention Mechanism | Ziyue Zhao et.al. | 2503.04863 | null |
2025-03-05 | RTFusion: A depth estimation network based on multimodal fusion in challenging scenarios | Zelin Meng et.al. | 2503.04821 | null |
2025-03-06 | Research on a Driver’s Perceived Risk Prediction Model Considering Traffic Scene Interaction | Chenhao Yang et.al. | 2503.04516 | null |
2025-03-06 | Geometry-Constrained Monocular Scale Estimation Using Semantic Segmentation for Dynamic Scenes | Hui Zhang et.al. | 2503.04235 | null |
2025-03-06 | Simulation-based Analysis Of Highway Trajectory Planning Using High-Order Polynomial For Highly Automated Driving Function | Milin Patel et.al. | 2503.04159 | link |
2025-03-06 | H3O: Hyper-Efficient 3D Occupancy Prediction with Heterogeneous Supervision | Yunxiao Shi et.al. | 2503.04059 | null |
2025-03-05 | Enhancing Autonomous Driving Safety with Collision Scenario Integration | Zi Wang et.al. | 2503.03957 | null |
2025-03-03 | A Survey on Semantic Communications in Internet of Vehicles | Sha Ye et.al. | 2503.03767 | null |
2025-03-05 | CoSDH: Communication-Efficient Collaborative Perception via Supply-Demand Awareness and Intermediate-Late Hybridization | Junhao Xu et.al. | 2503.03430 | link |
2025-03-05 | Trajectory Prediction for Autonomous Driving: Progress, Limitations, and Future Directions | Nadya Abdel Madjid et.al. | 2503.03262 | null |
2025-03-06 | Don’t Shake the Wheel: Momentum-Aware Planning in End-to-End Autonomous Driving | Ziying Song et.al. | 2503.03125 | link |
2025-03-05 | Car-STAGE: Automated framework for large-scale high-dimensional simulated time-series data generation based on user-defined criteria | Asma A. Almutairi et.al. | 2503.03100 | null |
2025-03-05 | BEVDriver: Leveraging BEV Maps in LLMs for Robust Closed-Loop Driving | Katharina Winter et.al. | 2503.03074 | link |
2025-03-04 | Text2Scenario: Text-Driven Scenario Generation for Autonomous Driving Test | Xuan Cai et.al. | 2503.02911 | null |
2025-03-04 | Federated Learning for Privacy-Preserving Feedforward Control in Multi-Agent Systems | Jakob Weber et.al. | 2503.02693 | link |
2025-03-04 | State of play and future directions in industrial computer vision AI standards | Artemis Stefanidou et.al. | 2503.02675 | null |
2025-03-04 | Human-aligned Safe Reinforcement Learning for Highway On-Ramp Merging in Dense Traffic | Yang Li et.al. | 2503.02624 | link |
2025-03-04 | TS-CGNet: Temporal-Spatial Fusion Meets Centerline-Guided Diffusion for BEV Mapping | Xinying Hong et.al. | 2503.02578 | link |
2025-03-04 | PIDLoc: Cross-View Pose Optimization Network Inspired by PID Controllers | Wooju Lee et.al. | 2503.02388 | null |
2025-03-04 | Diffusion-Based mmWave Radar Point Cloud Enhancement Driven by Range Images | Ruixin Wu et.al. | 2503.02300 | null |
2025-03-03 | Road Boundary Detection Using 4D mmWave Radar for Autonomous Driving | Yuyan Wu et.al. | 2503.01930 | null |
2025-02-21 | Interaction-Aware Model Predictive Decision-Making for Socially-Compliant Autonomous Driving in Mixed Urban Traffic Scenarios | Balint Varga et.al. | 2503.01852 | null |
2025-03-03 | ECG-EmotionNet: Nested Mixture of Expert (NMoE) Adaptation of ECG-Foundation Model for Driver Emotion Recognition | Nastaran Mansourian et.al. | 2503.01750 | null |
2025-03-05 | Perceptual Motor Learning with Active Inference Framework for Robust Lateral Control | Elahe Delavari et.al. | 2503.01676 | null |
2025-03-03 | CAPS: Context-Aware Priority Sampling for Enhanced Imitation Learning in Autonomous Driving | Hamidreza Mirkhani et.al. | 2503.01650 | null |
2025-03-03 | DifIISR: A Diffusion Model with Gradient Guidance for Infrared Image Super-Resolution | Xingyuan Li et.al. | 2503.01187 | link |
2025-03-03 | Privacy-preserving Machine Learning in Internet of Vehicle Applications: Fundamentals, Recent Advances, and Future Direction | Nazmul Islam et.al. | 2503.01089 | null |
2025-03-02 | Efficient End-to-end Visual Localization for Autonomous Driving with Decoupled BEV Neural Matching | Jinyu Miao et.al. | 2503.00862 | null |
2025-03-02 | CARIL: Confidence-Aware Regression in Imitation Learning for Autonomous Driving | Elahe Delavari et.al. | 2503.00783 | link |
2025-03-02 | Enhancing Monocular 3D Scene Completion with Diffusion Model | Changlin Song et.al. | 2503.00726 | link |
2025-03-06 | Dur360BEV: A Real-world 360-degree Single Camera Dataset and Benchmark for Bird-Eye View Mapping in Autonomous Driving | Wenke E et.al. | 2503.00675 | link |
2025-03-01 | Actor-Critic Cooperative Compensation to Model Predictive Control for Off-Road Autonomous Vehicles Under Unknown Dynamics | Prakhar Gupta et.al. | 2503.00577 | null |
2025-02-28 | SafeAuto: Knowledge-Enhanced Safe Autonomous Driving with Multimodal Foundation Models | Jiawei Zhang et.al. | 2503.00211 | null |
2025-02-26 | Glad: A Streaming Scene Generator for Autonomous Driving | Bin Xie et.al. | 2503.00045 | null |
2025-02-28 | Dynamically Local-Enhancement Planner for Large-Scale Autonomous Driving | Nanshan Deng et.al. | 2502.21134 | null |
2025-02-28 | AuthSim: Towards Authentic and Effective Safety-critical Scenario Generation for Autonomous Driving Tests | Yukuan Yang et.al. | 2502.21100 | null |
2025-02-28 | Multimodal Learning for Just-In-Time Software Defect Prediction in Autonomous Driving Systems | Faisal Mohammad et.al. | 2502.20806 | null |
2025-02-28 | WorldModelBench: Judging Video Generation Models As World Models | Dacheng Li et.al. | 2502.20694 | null |
2025-02-28 | LV-DOT: LiDAR-visual dynamic obstacle detection and tracking for autonomous robot navigation | Zhefan Xu et.al. | 2502.20607 | link |
2025-02-28 | Map Space Belief Prediction for Manipulation-Enhanced Mapping | Joao Marcos Correia Marques et.al. | 2502.20606 | null |
2025-03-01 | VDT-Auto: End-to-end Autonomous Driving with VLM-Guided Diffusion Transformers | Ziang Guo et.al. | 2502.20108 | null |
2025-02-27 | Minds on the Move: Decoding Trajectory Prediction in Autonomous Driving with Cognitive Insights | Haicheng Liao et.al. | 2502.20084 | null |
2025-02-28 | SegLocNet: Multimodal Localization Network for Autonomous Driving via Bird’s-Eye-View Segmentation | Zijie Zhou et.al. | 2502.20077 | link |
2025-03-24 | CarPlanner: Consistent Auto-regressive Trajectory Planning for Large-scale Reinforcement Learning in Autonomous Driving | Dongkun Zhang et.al. | 2502.19908 | null |
2025-02-27 | Shared Autonomy for Proximal Teaching | Megha Srivastava et.al. | 2502.19899 | null |
2025-03-15 | You Only Click Once: Single Point Weakly Supervised 3D Instance Segmentation for Autonomous Driving | Guangfeng Jiang et.al. | 2502.19698 | null |
2025-03-24 | BEVDiffuser: Plug-and-Play Diffusion Model for BEV Denoising with Ground-Truth Guidance | Xin Ye et.al. | 2502.19694 | null |
2025-02-27 | Unveiling Security Weaknesses in Autonomous Driving Systems: An In-Depth Empirical Study | Wenyuan Cheng et.al. | 2502.19687 | null |
2025-02-26 | Ev-3DOD: Pushing the Temporal Boundaries of 3D Object Detection with Event Cameras | Hoonhee Cho et.al. | 2502.19630 | link |
2025-03-02 | EMT: A Visual Multi-Task Benchmark Dataset for Autonomous Driving in the Arab Gulf Region | Nadya Abdel Madjid et.al. | 2502.19260 | link |
2025-02-26 | Knowledge Distillation for Semantic Segmentation: A Label Space Unification Approach | Anton Backhaus et.al. | 2502.19177 | null |
2025-02-26 | Learning Autonomy: Off-Road Navigation Enhanced by Human Input | Akhil Nagariya et.al. | 2502.18760 | null |
2025-02-25 | Provably Efficient RL for Linear MDPs under Instantaneous Safety Constraints in Non-Convex Feature Spaces | Amirhossein Roknilamouki et.al. | 2502.18655 | null |
2025-02-25 | VLM-E2E: Enhancing End-to-End Autonomous Driving with Multimodal Driver Attention Fusion | Pei Liu et.al. | 2502.18042 | null |
2025-02-25 | Exploring the Effects of Traditional Chinese Medicine Scents on Mitigating Driving Fatigue | Nengyue Su et.al. | 2502.18013 | null |
2025-02-25 | InVDriver: Intra-Instance Aware Vectorized Query-Based Autonomous Driving Transformer | Bo Zhang et.al. | 2502.17949 | null |
2025-02-25 | VVRec: Reconstruction Attacks on DL-based Volumetric Video Upstreaming via Latent Diffusion Model with Gamma Distribution | Rui Lu et.al. | 2502.17880 | null |
2025-02-26 | Easy-Poly: A Easy Polyhedral Framework For 3D Multi-Object Tracking | Peng Zhang et.al. | 2502.17822 | null |
2025-02-25 | CAML: Collaborative Auxiliary Modality Learning for Multi-Agent Systems | Rui Liu et.al. | 2502.17821 | null |
2025-03-04 | CalibRefine: Deep Learning-Based Online Automatic Targetless LiDAR-Camera Calibration with Iterative and Attention-Driven Post-Refinement | Lei Cheng et.al. | 2502.17648 | link |
2025-02-25 | GaussianFlowOcc: Sparse and Weakly Supervised Occupancy Estimation using Gaussian Splatting and Temporal Flow | Simon Boeder et.al. | 2502.17288 | null |
2025-02-24 | MambaFlow: A Novel and Flow-guided State Space Model for Scene Flow Estimation | Jiehao Luo et.al. | 2502.16907 | link |
2025-02-24 | Multi-Agent Autonomous Driving Systems with Large Language Models: A Survey of Recent Advances | Yaozu Wu et.al. | 2502.16804 | null |
2025-02-25 | AUKT: Adaptive Uncertainty-Guided Knowledge Transfer with Conformal Prediction | Rui Liu et.al. | 2502.16736 | null |
2025-02-25 | Co-MTP: A Cooperative Trajectory Prediction Framework with Multi-Temporal Fusion for Autonomous Driving | Xinyu Zhang et.al. | 2502.16589 | link |
2025-02-23 | An Expert Ensemble for Detecting Anomalous Scenes, Interactions, and Behaviors in Autonomous Driving | Tianchen Ji et.al. | 2502.16389 | null |
2025-02-22 | A Brain-Inspired Perception-Decision Driving Model Based on Neural Pathway Anatomical Alignment | Haidong Wang et.al. | 2502.16027 | null |
2025-02-22 | Cross-Model Transferability of Adversarial Patches in Real-time Segmentation for Autonomous Driving | Prashant Shekhar et.al. | 2502.16012 | link |
2025-02-21 | Computation Offloading Strategies in Integrated Terrestrial and Non-Terrestrial Networks | Muhammad Ahmed Mohsin et.al. | 2502.15903 | null |
2025-02-20 | Getting SMARTER for Motion Planning in Autonomous Driving Systems | Montgomery Alban et.al. | 2502.15824 | link |
2025-02-21 | VaViM and VaVAM: Autonomous Driving through Video Generative Modeling | Florent Bartoccioni et.al. | 2502.15672 | link |
2025-02-24 | Para-Lane: Multi-Lane Dataset Registering Parallel Scans for Benchmarking Novel View Synthesis | Ziqian Ni et.al. | 2502.15635 | null |
2025-02-21 | Depth-aware Fusion Method based on Image and 4D Radar Spectrum for 3D Object Detection | Yue Sun et.al. | 2502.15516 | null |
2025-03-11 | Q-PETR: Quant-aware Position Embedding Transformation for Multi-View 3D Object Detection | Jiangyong Yu et.al. | 2502.15488 | null |
2025-02-21 | A modular risk concept for complex systems | Dag McGeorge et.al. | 2502.15482 | null |
2025-02-21 | Aligning Task- and Reconstruction-Oriented Communications for Edge Intelligence | Yufeng Diao et.al. | 2502.15472 | null |
2025-03-10 | OccLinker: Deflickering Occupancy Networks through Lightweight Spatio-Temporal Correlation | Fengcheng Yu et.al. | 2502.15438 | null |
2025-02-21 | Enhancing Vehicle Make and Model Recognition with 3D Attention Modules | Narges Semiromizadeh et.al. | 2502.15398 | null |
2025-02-26 | PFSD: A Multi-Modal Pedestrian-Focus Scene Dataset for Rich Tasks in Semi-Structured Environments | Yueting Liu et.al. | 2502.15342 | link |
2025-02-21 | OccProphet: Pushing Efficiency Frontier of Camera-Only 4D Occupancy Forecasting with Observer-Forecaster-Refiner Framework | Junliang Chen et.al. | 2502.15180 | link |
2025-02-21 | CurricuVLM: Towards Safe Autonomous Driving via Personalized Safety-Critical Curriculum Learning with Vision-Language Models | Zihao Sheng et.al. | 2502.15119 | null |
2025-02-20 | Synth It Like KITTI: Synthetic Data Generation for Object Detection in Driving Scenarios | Richard Marcus et.al. | 2502.15076 | link |
2025-02-19 | Sce2DriveX: A Generalized MLLM Framework for Scene-to-Drive Learning | Rui Zhao et.al. | 2502.14917 | null |
2025-02-17 | CoDiff: Conditional Diffusion Model for Collaborative 3D Object Detection | Zhe Huang et.al. | 2502.14891 | link |
2025-03-04 | AVD2: Accident Video Diffusion for Accident Video Description | Cheng Li et.al. | 2502.14801 | null |
2025-02-20 | RendBEV: Semantic Novel View Synthesis for Self-Supervised Bird’s Eye View Segmentation | Henrique Piñeiro Monteagudo et.al. | 2502.14792 | null |
2025-02-23 | Real-world Troublemaker: A 5G Cloud-controlled Track Testing Framework for Automated Driving Systems in Safety-critical Interaction Scenarios | Xinrui Zhang et.al. | 2502.14574 | null |
2025-02-20 | Learning Temporal 3D Semantic Scene Completion via Optical Flow Guidance | Meng Wang et.al. | 2502.14520 | null |
2025-02-20 | CrossFuse: Learning Infrared and Visible Image Fusion by Cross-Sensor Top-K Vision Alignment and Beyond | Yukai Shi et.al. | 2502.14493 | null |
2025-02-20 | Reliable Explainability of Deep Learning Spatial-Spectral Classifiers for Improved Semantic Segmentation in Autonomous Driving | Jon Gutiérrez-Zaballa et.al. | 2502.14416 | null |
2025-02-20 | ODVerse33: Is the New YOLO Version Always Better? A Multi Domain benchmark from YOLO v5 to v11 | Tianyou Jiang et.al. | 2502.14314 | null |
2025-02-20 | OrchardDepth: Precise Metric Depth Estimation of Orchard Scene from Monocular Camera Images | Zhichao Zheng et.al. | 2502.14279 | null |
2025-02-20 | OG-Gaussian: Occupancy Based Street Gaussians for Autonomous Driving | Yedong Shen et.al. | 2502.14235 | null |
2025-02-19 | SegRet: An Efficient Design for Semantic Segmentation with Retentive Network | Zhiyuan Li et.al. | 2502.14014 | link |
2025-02-19 | MEX: Memory-efficient Approach to Referring Multi-Object Tracking | Huu-Thien Tran et.al. | 2502.13875 | null |
2025-02-18 | Uncertain Multi-Objective Recommendation via Orthogonal Meta-Learning Enhanced Bayesian Optimization | Hongxu Wang et.al. | 2502.13180 | null |
2025-02-18 | RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning | Hao Gao et.al. | 2502.13144 | null |
2025-02-18 | Fragility-aware Classification for Understanding Risk and Improving Generalization | Chen Yang et.al. | 2502.13024 | null |
2025-02-18 | RadSplatter: Extending 3D Gaussian Splatting to Radio Frequencies for Wireless Radiomap Extrapolation | Yiheng Wang et.al. | 2502.12686 | null |
2025-02-17 | Detecting Systematic Weaknesses in Vision Models along Predefined Human-Understandable Dimensions | Sujan Sai Gannamaneni et.al. | 2502.12360 | null |
2025-02-16 | AI-Augmented Metamorphic Testing for Comprehensive Validation of Autonomous Vehicles | Tony Zhang et.al. | 2502.12208 | null |
2025-02-17 | Bandwidth-Adaptive Spatiotemporal Correspondence Identification for Collaborative Perception | Peng Gao et.al. | 2502.12098 | null |
2025-02-17 | Residual Learning towards High-fidelity Vehicle Dynamics Modeling with Transformer | Jinyu Miao et.al. | 2502.11800 | null |
2025-02-17 | MaskGWM: A Generalizable Driving World Model with Video Mask Reconstruction | Jingcheng Ni et.al. | 2502.11663 | link |
2025-02-17 | PrivilegedDreamer: Explicit Imagination of Privileged Information for Rapid Adaptation of Learned Policies | Morgan Byrd et.al. | 2502.11377 | null |
2025-02-17 | A Framework for Learning Scoring Rules in Autonomous Driving Planning Systems | Zikang Xiong et.al. | 2502.11352 | null |
2025-03-10 | NPSim: Nighttime Photorealistic Simulation From Daytime Images With Monocular Inverse Rendering and Ray Tracing | Shutong Zhang et.al. | 2502.10720 | null |
2025-03-02 | Adaptive Neural Networks for Intelligent Data-Driven Development | Youssef Shoeb et.al. | 2502.10603 | null |
2025-02-14 | The Role of World Models in Shaping Autonomous Driving: A Comprehensive Survey | Sifan Tu et.al. | 2502.10498 | link |
2025-02-14 | A Robust Attack: Displacement Backdoor Attack | Yong Li et.al. | 2502.10490 | null |
2025-02-13 | Knowledge Integration Strategies in Autonomous Vehicle Prediction and Planning: A Comprehensive Survey | Kumar Manas et.al. | 2502.10477 | null |
2025-02-12 | Deep Reinforcement Learning-Based User Scheduling for Collaborative Perception | Yandi Liu et.al. | 2502.10456 | null |
2025-02-11 | A Survey of Representation Learning, Optimization Strategies, and Applications for Omnidirectional Vision | Hao Ai et.al. | 2502.10444 | null |
2025-02-17 | V2V-LLM: Vehicle-to-Vehicle Cooperative Autonomous Driving with Multi-Modal Large Language Models | Hsu-kuang Chiu et.al. | 2502.09980 | null |
2025-02-14 | Dual Control for Interactive Autonomous Merging with Model Predictive Diffusion | Jacob Knaup et.al. | 2502.09918 | null |
2025-02-13 | LIFe-GoM: Generalizable Human Rendering with Learned Iterative Feedback Over Multi-Resolution Gaussians-on-Mesh | Jing Wen et.al. | 2502.09617 | null |
2025-02-13 | Rolling Ahead Diffusion for Traffic Scene Simulation | Yunpeng Liu et.al. | 2502.09587 | null |
2025-02-13 | Generalizable Reinforcement Learning with Biologically Inspired Hyperdimensional Occupancy Grid Maps for Exploration and Goal-Directed Path Planning | Shay Snyder et.al. | 2502.09393 | null |
2025-02-13 | FLARES: Fast and Accurate LiDAR Multi-Range Semantic Segmentation | Bin Yang et.al. | 2502.09274 | null |
2025-02-13 | LimSim Series: An Autonomous Driving Simulation Platform for Validation and Enhancement | Daocheng Fu et.al. | 2502.09170 | link |
2025-02-13 | Topo2Seq: Enhanced Topology Reasoning via Topology Sequence Learning | Yiming Yang et.al. | 2502.08974 | null |
2025-02-10 | Motion Forecasting for Autonomous Vehicles: A Survey | Jianxin Shi et.al. | 2502.08664 | null |
2025-02-14 | Deployment-friendly Lane-changing Intention Prediction Powered by Brain-inspired Spiking Neural Networks | Shuqi Shen et.al. | 2502.08659 | null |
2025-02-12 | MoDitector: Module-Directed Testing for Autonomous Driving Systems | Renzhi Wang et.al. | 2502.08504 | null |
2025-02-12 | AdvSwap: Covert Adversarial Perturbation with High Frequency Info-swapping for Autonomous Driving Perception | Yuanhao Huang et.al. | 2502.08374 | null |
2025-02-12 | FixDrive: Automatically Repairing Autonomous Vehicle Driving Behaviour for $0.08 per Violation | Yang Sun et.al. | 2502.08260 | link |
2025-02-12 | End-to-End Predictive Planner for Autonomous Driving with Consistency Models | Anjian Li et.al. | 2502.08033 | null |
2025-02-10 | Preference Alignment on Diffusion Model: A Comprehensive Survey for Image Generation and Editing | Sihao Wu et.al. | 2502.07829 | null |
2025-02-07 | CP-Guard+: A New Paradigm for Malicious Agent Detection and Defense in Collaborative Perception | Senkang Hu et.al. | 2502.07807 | null |
2025-02-11 | Divide and Merge: Motion and Semantic Learning in End-to-End Autonomous Driving | Yinzhe Shen et.al. | 2502.07631 | null |
2025-02-11 | Fast-COS: A Fast One-Stage Object Detector Based on Reparameterized Attention Vision Transformer for Autonomous Driving | Novendra Setyawan et.al. | 2502.07417 | null |
2025-02-11 | USRNet: Unified Scene Recovery Network for Enhancing Traffic Imaging under Multiple Adverse Weather Conditions | Yuxu Lu et.al. | 2502.07372 | link |
2025-02-11 | Semi-Supervised Vision-Centric 3D Occupancy World Model for Autonomous Driving | Xiang Li et.al. | 2502.07309 | link |
2025-02-11 | Online Aggregation of Trajectory Predictors | Alex Tong et.al. | 2502.07178 | null |
2025-02-06 | Vision-Integrated LLMs for Autonomous Driving Assistance : Human Performance Comparison and Trust Evaluation | Namhee Kim et.al. | 2502.06843 | null |
2025-02-10 | Transfer Your Perspective: Controllable 3D Generation from Any Viewpoint in a Driving Scene | Tai-Yu Pan et.al. | 2502.06682 | null |
2025-02-27 | A Survey on Video Analytics in Cloud-Edge-Terminal Collaborative Systems | Linxiao Gong et.al. | 2502.06581 | null |
2025-02-10 | Occ-LLM: Enhancing Autonomous Driving with Occupancy-Based Large Language Models | Tianshuo Xu et.al. | 2502.06419 | null |
2025-02-10 | Occlusion-Aware Contingency Safety-Critical Planning for Autonomous Vehicles | Lei Zheng et.al. | 2502.06359 | null |
2025-02-10 | Actual Achieved Gain and Optimal Perceived Gain: Modeling Human Take-over Decisions Towards Automated Vehicles’ Suggestions | Shuning Zhang et.al. | 2502.06179 | null |
2025-02-17 | Continual Adaptation for Autonomous Driving with the Mixture of Progressive Experts Network | Yixin Cui et.al. | 2502.05943 | null |
2025-02-09 | SphereFusion: Efficient Panorama Depth Estimation via Gated Fusion | Qingsong Yan et.al. | 2502.05859 | null |
2025-02-08 | Surprise Potential as a Measure of Interactivity in Driving Scenarios | Wenhao Ding et.al. | 2502.05677 | null |
2025-02-08 | TrackDiffuser: Nearly Model-Free Bayesian Filtering with Diffusion Model | Yangguang He et.al. | 2502.05629 | null |
2025-02-08 | Convolutional Neural Network Segmentation for Satellite Imagery Data to Identify Landforms Using U-Net Architecture | Mitul Goswami et.al. | 2502.05476 | null |
2025-02-12 | Safety at Scale: A Comprehensive Survey of Large Model Safety | Xingjun Ma et.al. | 2502.05206 | link |
2025-02-07 | Transforming Science with Large Language Models: A Survey on AI-assisted Scientific Discovery, Experimentation, Content Generation, and Evaluation | Steffen Eger et.al. | 2502.05151 | link |
2025-03-19 | GaussRender: Learning 3D Occupancy with Gaussian Rendering | Loïck Chambon et.al. | 2502.05040 | link |
2025-02-07 | Adaptive Learning-based Model Predictive Control Strategy for Drift Vehicles | Bei Zhou et.al. | 2502.04696 | null |
2025-02-05 | DILLEMA: Diffusion and Large Language Models for Multi-Modal Augmentation | Luciano Baresi et.al. | 2502.04378 | link |
2025-02-05 | MapFusion: A Novel BEV Feature Fusion Network for Multi-modal Map Construction | Xiaoshuai Hao et.al. | 2502.04377 | null |
2025-02-06 | SMART: Advancing Scalable Map Priors for Driving Topology Reasoning | Junjie Ye et.al. | 2502.04329 | null |
2025-02-06 | sshELF: Single-Shot Hierarchical Extrapolation of Latent Features for 3D Reconstruction from Sparse-Views | Eyvaz Najafli et.al. | 2502.04318 | null |
2025-02-06 | Safeguarding connected autonomous vehicle communication: Protocols, intra- and inter-vehicular attacks and defenses | Mohammed Aledhari et.al. | 2502.04201 | null |
2025-02-06 | Advanced Object Detection and Pose Estimation with Hybrid Task Cascade and High-Resolution Networks | Yuhui Jin et.al. | 2502.03877 | null |
2025-02-06 | Optimized Unet with Attention Mechanism for Multi-Scale Semantic Segmentation | Xuan Li et.al. | 2502.03813 | null |
2025-02-06 | Reduce Lap Time for Autonomous Racing with Curvature-Integrated MPCC Local Trajectory Planning Method | Zhouheng Li et.al. | 2502.03695 | link |
2025-02-05 | Vehicle Routing Problems in the Age of Semi-Autonomous Driving | Hins Hu et.al. | 2502.03655 | null |
2025-02-05 | Robust Autonomy Emerges from Self-Play | Marco Cusumano-Towner et.al. | 2502.03349 | null |
2025-02-05 | A Scalable Approach to Probabilistic Neuro-Symbolic Verification | Vasileios Manginas et.al. | 2502.03274 | null |
2025-02-05 | Driver Assistance System Based on Multimodal Data Hazard Detection | Long Zhouxiang et.al. | 2502.03005 | null |
2025-02-05 | Label Anything: An Interpretable, High-Fidelity and Prompt-Free Annotator | Wei-Bin Kou et.al. | 2502.02972 | null |
2025-02-04 | SD++: Enhancing Standard Definition Maps by Incorporating Road Knowledge using LLMs | Hitvarth Diwanji et.al. | 2502.02773 | null |
2025-02-04 | Anytime Incremental $ρ$ POMDP Planning in Continuous Spaces | Ron Benchetrit et.al. | 2502.02549 | null |
2025-02-04 | Uncertainty Quantification for Collaborative Object Detection Under Adversarial Attacks | Huiqun Huang et.al. | 2502.02537 | null |
2025-02-04 | Event-aided Semantic Scene Completion | Shangwei Guo et.al. | 2502.02334 | link |
2025-02-04 | Improving Generalization Ability for 3D Object Detection by Learning Sparsity-invariant Features | Hsin-Cheng Lu et.al. | 2502.02322 | link |
2025-02-04 | Risk-Aware Driving Scenario Analysis with Large Language Models | Yuan Gao et.al. | 2502.02145 | link |
2025-02-04 | Synthesis of Model Predictive Control and Reinforcement Learning: Survey and Classification | Rudolf Reiter et.al. | 2502.02133 | null |
2025-02-04 | From Accidents to Insights: Leveraging Multimodal Data for Scenario-Driven ADS Testing | Siwei Luo et.al. | 2502.02025 | null |
2025-02-04 | Toward a Low-Cost Perception System in Autonomous Vehicles: A Spectrum Learning Approach | Mohammed Alsakabi et.al. | 2502.01940 | null |
2025-02-04 | A Comprehensive Study of Bug-Fix Patterns in Autonomous Driving Systems | Yuntianyi Chen et.al. | 2502.01937 | null |
2025-02-04 | SimBEV: A Synthetic Multi-Task Multi-Sensor Driving Data Generation Tool and Dataset | Goodarz Mehr et.al. | 2502.01894 | link |
2025-02-03 | Reliability-Driven LiDAR-Camera Fusion for Robust 3D Object Detection | Reza Sadeghian et.al. | 2502.01856 | null |
2025-02-01 | Leveraging Stable Diffusion for Monocular Depth Estimation via Image Semantic Encoding | Jingming Xia et.al. | 2502.01666 | null |
2025-02-20 | TeLL-Drive: Enhancing Autonomous Driving with Teacher LLM-Guided Deep Reinforcement Learning | Chengkai Xu et.al. | 2502.01387 | null |
2025-02-02 | SAM-guided Pseudo Label Enhancement for Multi-modal 3D Semantic Segmentation | Mingyu Yang et.al. | 2502.00960 | null |
2025-02-02 | VLM-Assisted Continual learning for Visual Question Answering in Self-Driving | Yuxin Lin et.al. | 2502.00843 | null |
2025-02-01 | FlexCloud: Direct, Modular Georeferencing and Drift-Correction of Point Cloud Maps | Maximilian Leitenstern et.al. | 2502.00395 | link |
2025-02-04 | INSIGHT: Enhancing Autonomous Driving Safety through Vision-Language Models on Context-Aware Hazard Detection and Edge Case Evaluation | Dianwei Chen et.al. | 2502.00262 | null |
2025-01-31 | SpikingRTNH: Spiking Neural Network for 4D Radar Object Detection | Dong-Hee Paek et.al. | 2502.00074 | link |
2025-01-31 | Quantum Internet Use Case Analysis for the Automotive Industry | K. L. van der Enden et.al. | 2501.19070 | null |
2025-01-31 | SynthmanticLiDAR: A Synthetic Dataset for Semantic Segmentation on LiDAR Imaging | Javier Montalvo et.al. | 2501.19035 | link |
2025-01-31 | Open-Source Autonomous Driving Software Platforms: Comparison of Autoware and Apollo | Hee-Yang Jung et.al. | 2501.18942 | null |
2025-01-24 | STAMP: Scalable Task And Model-agnostic Collaborative Perception | Xiangbo Gao et.al. | 2501.18616 | link |
2025-01-30 | IROAM: Improving Roadside Monocular 3D Object Detection Learning from Autonomous Vehicle Data Domain | Zhe Wang et.al. | 2501.18162 | null |
2025-01-30 | DIAL: Distribution-Informed Adaptive Learning of Multi-Task Constraints for Safety-Critical Systems | Se-Wook Yoo et.al. | 2501.18086 | null |
2025-01-29 | TransRAD: Retentive Vision Transformer for Enhanced Radar Object Detection | Lei Cheng et.al. | 2501.17977 | link |
2025-01-23 | Ranging Performance Analysis in Automotive DToF Lidars | Xiao Guo et.al. | 2501.17884 | null |
2025-01-29 | SSF: Sparse Long-Range Scene Flow for Autonomous Driving | Ajinkya Khoche et.al. | 2501.17821 | link |
2025-01-28 | Anomaly Detection in Cooperative Vehicle Perception Systems under Imperfect Communication | Ashish Bastola et.al. | 2501.17329 | link |
2025-01-28 | A Contrastive Teacher-Student Framework for Novelty Detection under Style Shifts | Hossein Mirzaei et.al. | 2501.17289 | null |
2025-01-28 | Scenario Understanding of Traffic Scenes Through Large Visual Language Models | Rivera Esteban et.al. | 2501.17131 | null |
2025-01-28 | Revisit Mixture Models for Multi-Agent Simulation: Experimental Study within a Unified Framework | Longzhong Lin et.al. | 2501.17015 | null |
2025-02-27 | The Third Moment of AI Ethics: Developing Relatable and Contextualized Tools | Sarah Hladikova et.al. | 2501.16954 | null |
2025-01-28 | Target-driven Self-Distillation for Partial Observed Trajectories Forecasting | Pengfei Zhu et.al. | 2501.16767 | null |
2025-01-28 | Overcoming Semantic Dilution in Transformer-Based Next Frame Prediction | Hy Nguyen et.al. | 2501.16753 | null |
2025-01-28 | Dream to Drive with Predictive Individual World Model | Yinfeng Gao et.al. | 2501.16733 | link |
2025-01-28 | SliceOcc: Indoor 3D Semantic Occupancy Prediction with Vertical Slice Representation | Jianing Li et.al. | 2501.16684 | link |
2025-01-27 | Modular Framework for Uncertainty Prediction in Autonomous Vehicle Motion Forecasting within Complex Traffic Scenarios | Han Wang et.al. | 2501.16480 | null |
2025-01-18 | Risk-Informed Diffusion Transformer for Long-Tail Trajectory Prediction in the Crash Scenario | Junlan Chen et.al. | 2501.16349 | null |
2025-02-08 | Addressing Out-of-Label Hazard Detection in Dashcam Videos: Insights from the COOOL Challenge | Anh-Kiet Duong et.al. | 2501.16037 | link |
2025-01-27 | LLM-attacker: Enhancing Closed-loop Adversarial Scenario Generation for Autonomous Driving with Large Language Models | Yuewen Mei et.al. | 2501.15850 | null |
2025-02-09 | Diffusion-Based Planning for Autonomous Driving with Flexible Guidance | Yinan Zheng et.al. | 2501.15564 | null |
2025-01-26 | Mitigating Spurious Negative Pairs for Robust Industrial Anomaly Detection | Hossein Mirzaei et.al. | 2501.15434 | link |
2025-03-03 | Doracamom: Joint 3D Detection and Occupancy Prediction with Multi-view 4D Radars and Cameras for Omnidirectional Perception | Lianqing Zheng et.al. | 2501.15394 | null |
2025-01-26 | MetaOcc: Surround-View 4D Radar and Camera Fusion Framework for 3D Occupancy Prediction with Dual Training Strategies | Long Yang et.al. | 2501.15384 | link |
2025-01-29 | Towards Robust Unsupervised Attention Prediction in Autonomous Driving | Mengshi Qi et.al. | 2501.15045 | null |
2025-01-24 | AI-driven Wireless Positioning: Fundamentals, Standards, State-of-the-art, and Challenges | Guangjin Pan et.al. | 2501.14970 | null |
2025-03-11 | Performance Evaluation of Satellite-Based Data Offloading on Starlink Constellations | Alexander Bonora et.al. | 2501.14878 | null |
2025-03-12 | HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation | Xin Zhou et.al. | 2501.14729 | link |
2025-01-24 | Relightable Full-Body Gaussian Codec Avatars | Shaofei Wang et.al. | 2501.14726 | null |
2025-01-24 | 3DLabelProp: Geometric-Driven Domain Generalization for LiDAR Semantic Segmentation in Autonomous Driving | Jules Sanchez et.al. | 2501.14605 | link |
2025-01-24 | Deep-BrownConrady: Prediction of Camera Calibration and Distortion Parameters Using Deep Learning and Synthetic Data | Faiz Muhammad Chaudhry et.al. | 2501.14510 | null |
2025-01-24 | MARL-OT: Multi-Agent Reinforcement Learning Guided Online Fuzzing to Detect Safety Violation in Autonomous Driving Systems | Linfeng Liang et.al. | 2501.14451 | null |
2025-01-24 | Prerequisite of superconductivity: SDW rather than tetragonal structure in double-layer La3Ni2O7-x | Mengzhu Shi et.al. | 2501.14202 | null |
2025-02-05 | GS-LiDAR: Generating Realistic LiDAR Point Clouds with Panoramic Gaussian Splatting | Junzhe Jiang et.al. | 2501.13971 | link |
2025-01-23 | Black-Box Adversarial Attack on Vision Language Models for Autonomous Driving | Lu Wang et.al. | 2501.13563 | null |
2025-01-23 | Text-driven Online Action Detection | Manuel Benavent-Lledo et.al. | 2501.13518 | link |
2025-01-23 | Knowledge-Informed Multi-Agent Trajectory Prediction at Signalized Intersections for Infrastructure-to-Everything | Huilin Yin et.al. | 2501.13461 | null |
2025-01-23 | GeomGS: LiDAR-Guided Geometry-Aware Gaussian Splatting for Robot Localization | Jaewon Lee et.al. | 2501.13417 | null |
2025-01-22 | QuFeX: Quantum feature extraction module for hybrid quantum-classical deep neural networks | Naman Jain et.al. | 2501.13165 | null |
2025-01-23 | AdaWM: Adaptive World Model based Planning for Autonomous Driving | Hang Wang et.al. | 2501.13072 | null |
2025-01-22 | Int2Planner: An Intention-based Multi-modal Motion Planner for Integrated Prediction and Planning | Xiaolei Chen et.al. | 2501.12799 | null |
2025-01-22 | PPO-Based Vehicle Control for Ramp Merging Scheme Assisted by Enhanced C-V2X | Qiong Wu et.al. | 2501.12656 | link |
2025-01-22 | Absence of superconductivity and density-wave transition in ambient-pressure tetragonal La $4$Ni$_3$O${10}$ | Mengzhu Shi et.al. | 2501.12647 | null |
2025-01-22 | Improved Detection and Diagnosis of Faults in Deep Neural Networks Using Hierarchical and Explainable Classification | Sigma Jahan et.al. | 2501.12560 | null |
2025-01-20 | Egoistic MDS-based Rigid Body Localization | Niclas Führling et.al. | 2501.12417 | null |
2025-03-03 | RALAD: Bridging the Real-to-Sim Domain Gap in Autonomous Driving with Retrieval-Augmented Learning | Jiacheng Zuo et.al. | 2501.12296 | link |
2025-01-21 | Video Deblurring by Sharpness Prior Detection and Edge Information | Yang Tian et.al. | 2501.12246 | link |
2025-01-21 | RL-RC-DoT: A Block-level RL agent for Task-Aware Video Compression | Uri Gadot et.al. | 2501.12216 | null |
2025-01-21 | Select2Drive: Pragmatic Communications for Real-Time Collaborative Autonomous Driving | Jiahao Huang et.al. | 2501.12040 | null |
2025-01-21 | Make Full Use of Testing Information: An Integrated Accelerated Testing and Evaluation Method for Autonomous Driving Systems | Xinzheng Wu et.al. | 2501.11924 | null |
2025-01-21 | Survey on Monocular Metric Depth Estimation | Jiuling Zhang et.al. | 2501.11841 | null |
2025-02-16 | A Survey of World Models for Autonomous Driving | Tuo Feng et.al. | 2501.11260 | null |
2025-01-19 | Car-GS: Addressing Reflective and Transparent Surface Challenges in 3D Car Reconstruction | Congcong Li et.al. | 2501.11020 | null |
2025-01-18 | Efficient and Safe Trajectory Planning for Autonomous Agricultural Vehicle Headland Turning in Cluttered Orchard Environments | Peng Wei et.al. | 2501.10636 | null |
2025-01-17 | MutualForce: Mutual-Aware Enhancement for 4D Radar-LiDAR 3D Object Detection | Xiangyuan Peng et.al. | 2501.10266 | null |
2025-01-17 | Explainable artificial intelligence (XAI): from inherent explainability to large language models | Fuseini Mumuni et.al. | 2501.09967 | null |
2025-01-16 | Fine-grained Testing for Autonomous Driving Software: a Study on Autoware with LLM-driven Unit Testing | Wenhan Wang et.al. | 2501.09866 | null |
2025-01-16 | Distilling Multi-modal Large Language Models for Autonomous Driving | Deepti Hegde et.al. | 2501.09757 | null |
2025-01-16 | The Devil is in the Details: Simple Remedies for Image-to-LiDAR Representation Learning | Wonjun Jo et.al. | 2501.09485 | null |
2025-01-16 | MonoSOWA: Scalable monocular 3D Object detector Without human Annotations | Jan Skvrna et.al. | 2501.09481 | null |
2025-01-16 | RE-POSE: Synergizing Reinforcement Learning-Based Partitioning and Offloading for Edge Object Detection | Jianrui Shi et.al. | 2501.09465 | null |
2025-01-17 | On Learning Informative Trajectory Embeddings for Imitation, Classification and Regression | Zichang Ge et.al. | 2501.09327 | link |
2025-01-16 | Modeling Language for Scenario Development of Autonomous Driving Systems | Toshiaki Aoki et.al. | 2501.09319 | null |
2025-01-15 | Generative Planning with 3D-vision Language Pre-training for End-to-End Autonomous Driving | Tengpeng Li et.al. | 2501.08861 | link |
2025-01-16 | BRIGHT-VO: Brightness-Guided Hybrid Transformer for Visual Odometry with Multi-modality Refinement Module | Dongzhihan Wang et.al. | 2501.08659 | null |
2025-01-14 | Decoding Interpretable Logic Rules from Neural Networks | Chuqin Geng et.al. | 2501.08281 | null |
2025-01-14 | LeapVAD: A Leap in Autonomous Driving via Cognitive Perception and Dual-Process Thinking | Yukai Ma et.al. | 2501.08168 | null |
2025-01-14 | Hybrid Action Based Reinforcement Learning for Multi-Objective Compatible Autonomous Driving | Guizhe Jin et.al. | 2501.08096 | null |
2025-01-27 | Benchmarking Vision Foundation Models for Input Monitoring in Autonomous Driving | Mert Keser et.al. | 2501.08083 | null |
2025-01-14 | GAC-Net_Geometric and attention-based Network for Depth Completion | Kuang Zhu et.al. | 2501.07988 | null |
2025-01-14 | A Low-cost and Ultra-lightweight Binary Neural Network for Traffic Signal Recognition | Mingke Xiao et.al. | 2501.07808 | null |
2025-01-14 | HgPCN: A Heterogeneous Architecture for E2E Embedded Point Cloud Inference | Yiming Gao et.al. | 2501.07767 | null |
2025-01-16 | PO-GVINS: Tightly Coupled GNSS-Visual-Inertial Integration with Pose-Only Representation | Zhuo Xu et.al. | 2501.07259 | null |
2025-01-13 | LEO: Boosting Mixture of Vision Encoders for Multimodal Large Language Models | Mozhgan Nasr Azadani et.al. | 2501.06986 | link |
2025-01-12 | Application of Vision-Language Model to Pedestrians Behavior and Scene Understanding in Autonomous Driving | Haoxiang Gao et.al. | 2501.06680 | null |
2025-01-11 | Common Sense Is All You Need | Hugo Latapie et.al. | 2501.06642 | null |
2025-01-08 | NextStop: An Improved Tracker For Panoptic LIDAR Segmentation Data | Nirit Alkalay et.al. | 2501.06235 | null |
2025-02-23 | Leveraging Edge Intelligence and LLMs to Advance 6G-Enabled Internet of Automated Defense Vehicles | Murat Arda Onsu et.al. | 2501.06205 | null |
2025-01-10 | Vehicle-in-Virtual-Environment (VVE) Based Autonomous Driving Function Development and Evaluation Methodology for Vulnerable Road User Safety | Haochong Chen et.al. | 2501.06113 | null |
2025-01-10 | Minimizing Occlusion Effect on Multi-View Camera Perception in BEV with Multi-Sensor Fusion | Sanjay Kumar et.al. | 2501.05997 | null |
2025-01-10 | TB-Bench: Training and Testing Multi-Modal AI for Understanding Spatio-Temporal Traffic Behaviors from Dashcam Images/Videos | Korawat Charoenpitaks et.al. | 2501.05733 | link |
2025-01-09 | Vision-Language Models for Autonomous Driving: CLIP-Based Dynamic Scene Understanding | Mohammed Elhenawy et.al. | 2501.05566 | null |
2025-01-07 | Implicit Guidance and Explicit Representation of Semantic Information in Points Cloud: A Survey | Jingyuan Tang et.al. | 2501.05473 | link |
2025-01-09 | The global consensus on the risk management of autonomous driving | Sebastian Krügel et.al. | 2501.05391 | null |
2025-01-09 | Domain-Incremental Semantic Segmentation for Autonomous Driving under Adverse Driving Conditions | Shishir Muralidhara et.al. | 2501.05246 | null |
2025-01-09 | CorrDiff: Adaptive Delay-aware Detector with Temporal Cue Inputs for Real-time Object Detection | Xiang Zhang et.al. | 2501.05132 | null |
2025-01-09 | DriVLM: Domain Adaptation of Vision-Language Models in Autonomous Driving | Xuran Zheng et.al. | 2501.05081 | null |
2025-01-09 | LearningFlow: Automated Policy Learning Workflow for Urban Driving with Large Language Models | Zengqi Peng et.al. | 2501.05057 | null |
2025-01-09 | CuRLA: Curriculum Learning Based Deep Reinforcement Learning for Autonomous Driving | Bhargava Uppuluri et.al. | 2501.04982 | null |
2025-01-09 | AD-L-JEPA: Self-Supervised Spatial World Models with Joint Embedding Predictive Architecture for Autonomous Driving with LiDAR Data | Haoran Zhu et.al. | 2501.04969 | link |
2025-01-08 | Integrating LLMs with ITS: Recent Advances, Potentials, Challenges, and Future Directions | Doaa Mahmud et.al. | 2501.04437 | null |
2025-01-08 | FGU3R: Fine-Grained Fusion via Unified 3D Representation for Multimodal 3D Object Detection | Guoxin Zhang et.al. | 2501.04373 | null |
2025-01-08 | H-MBA: Hierarchical MamBa Adaptation for Multi-Modal Video Understanding in Autonomous Driving | Siran Chen et.al. | 2501.04302 | null |
2025-01-07 | LargeAD: Large-Scale Cross-Sensor Data Pretraining for Autonomous Driving | Lingdong Kong et.al. | 2501.04005 | null |
2025-01-07 | Are VLMs Ready for Autonomous Driving? An Empirical Study from the Reliability, Data, and Metric Perspectives | Shaoyuan Xie et.al. | 2501.04003 | link |
2025-01-19 | Image Segmentation: Inducing graph-based learning | Aryan Singh et.al. | 2501.03765 | link |
2025-01-07 | Hybrid Machine Learning Model with a Constrained Action Space for Trajectory Prediction | Alexander Fertig et.al. | 2501.03666 | null |
2025-01-08 | SenseRAG: Constructing Environmental Knowledge Bases with Proactive Querying for LLM-Based Autonomous Driving | Xuewen Luo et.al. | 2501.03535 | null |
2025-01-06 | MObI: Multimodal Object Inpainting Using Diffusion Models | Alexandru Buburuzan et.al. | 2501.03173 | null |
2025-01-06 | 4D-CS: Exploiting Cluster Prior for 4D Spatio-Temporal LiDAR Semantic Segmentation | Jiexi Zhong et.al. | 2501.02937 | null |
2025-01-06 | A Novel Vision Transformer for Camera-LiDAR Fusion based Traffic Object Segmentation | Toomas Tahves et.al. | 2501.02858 | null |
2025-01-13 | LDMapNet-U: An End-to-End System for City-Scale Lane-Level Map Updating | Deguo Xia et.al. | 2501.02763 | null |
2025-01-05 | UDMC: Unified Decision-Making and Control Framework for Urban Autonomous Driving with Motion Prediction of Traffic Participants | Haichao Liu et.al. | 2501.02530 | link |
2025-01-05 | GCP: Guarded Collaborative Perception with Spatial-Temporal Aware Malicious Agent Detection | Yihang Tao et.al. | 2501.02450 | null |
2025-01-04 | RadarNeXt: Real-Time and Reliable 3D Object Detector Based On 4D mmWave Imaging Radar | Liye Jia et.al. | 2501.02314 | link |
2025-01-01 | Communication Efficient Cooperative Edge AI via Event-Triggered Computation Offloading | You Zhou et.al. | 2501.02001 | null |
2025-01-03 | Evaluating Scenario-based Decision-making for Interactive Autonomous Driving Using Rational Criteria: A Survey | Zhen Tian et.al. | 2501.01886 | null |
2025-01-03 | Adverse Weather Conditions Augmentation of LiDAR Scenes with Latent Diffusion Models | Andrea Matteazzi et.al. | 2501.01761 | null |
2025-01-03 | Enhancing Large Vision Model in Street Scene Semantic Understanding through Leveraging Posterior Optimization Trajectory | Wei-Bin Kou et.al. | 2501.01710 | null |
2025-01-02 | MSC-Bench: Benchmarking and Analyzing Multi-Sensor Corruption for Driving Perception | Xiaoshuai Hao et.al. | 2501.01037 | null |
2024-12-31 | STORM: Spatio-Temporal Reconstruction Model for Large-Scale Outdoor Scenes | Jiawei Yang et.al. | 2501.00602 | null |
2025-01-03 | DreamDrive: Generative 4D Scene Modeling from Street View Images | Jiageng Mao et.al. | 2501.00601 | null |
2024-12-31 | Online Video Understanding: A Comprehensive Benchmark and Memory-Augmented Method | Zhenpeng Huang et.al. | 2501.00584 | null |
2024-12-31 | Toward Information Theoretic Active Inverse Reinforcement Learning | Ondrej Bajgar et.al. | 2501.00381 | null |
2024-12-31 | Research on vehicle detection based on improved YOLOv8 network | Haocheng Guo et.al. | 2501.00300 | null |
2025-01-09 | Automotive Speed Estimation: Sensor Types and Error Characteristics from OBD-II to ADAS | Hany Ragab et.al. | 2501.00242 | null |
2024-12-31 | DecoratingFusion: A LiDAR-Camera Fusion Network with the Combination of Point-level and Feature-level Fusion | Zixuan Yin et.al. | 2501.00220 | null |
2025-01-02 | Prometheus: 3D-Aware Latent Diffusion Models for Feed-Forward Text-to-3D Scene Generation | Yuanbo Yang et.al. | 2412.21117 | null |
2024-12-30 | TiGDistill-BEV: Multi-view BEV 3D Object Detection via Target Inner-Geometry Learning Distillation | Shaoqing Xu et.al. | 2412.20911 | link |
2024-12-30 | DEMO: A Dynamics-Enhanced Learning Model for Multi-Horizon Trajectory Prediction in Autonomous Vehicles | Chengyue Wang et.al. | 2412.20784 | null |
2024-12-29 | MR-Occ: Efficient Camera-LiDAR 3D Semantic Occupancy Prediction Using Hierarchical Multi-Resolution Voxel Representation | Minjae Seong et.al. | 2412.20480 | null |
2024-12-28 | Leveraging Large Language Models for Enhancing Autonomous Vehicle Perception | Athanasios Karagounis et.al. | 2412.20230 | null |
2024-12-28 | Multi-Modality Driven LoRA for Adverse Condition Depth Estimation | Guanglei Yang et.al. | 2412.20162 | null |
2024-12-28 | Defending Against Network Attacks for Secure AI Agent Migration in Vehicular Metaverses | Xinru Wen et.al. | 2412.20154 | null |
2024-12-28 | DepthMamba with Adaptive Fusion | Zelin Meng et.al. | 2412.19964 | null |
2024-12-27 | Zero-shot Hazard Identification in Autonomous Driving: A Case Study on the COOOL Benchmark | Lukas Picek et.al. | 2412.19944 | null |
2024-12-30 | DrivingWorld: Constructing World Model for Autonomous Driving via Video GPT | Xiaotao Hu et.al. | 2412.19505 | link |
2024-12-30 | DriveEditor: A Unified 3D Information-Guided Framework for Controllable Object Editing in Driving Scenes | Yiyuan Liang et.al. | 2412.19458 | link |
2024-12-27 | MLLM-SUL: Multimodal Large Language Model for Semantic Scene Understanding and Localization in Traffic Scenarios | Jiaqi Fan et.al. | 2412.19406 | link |
2024-12-25 | TopoBDA: Towards Bezier Deformable Attention for Road Topology Understanding | Muhammet Esat Kalfaoglu et.al. | 2412.18951 | null |
2024-12-30 | HV-BEV: Decoupling Horizontal and Vertical Feature Sampling for Multi-View 3D Object Detection | Di Wu et.al. | 2412.18884 | null |
2024-12-25 | TSceneJAL: Joint Active Learning of Traffic Scenes for 3D Object Detection | Chenyang Lei et.al. | 2412.18870 | link |
2024-12-25 | Evaluating the Adversarial Robustness of Detection Transformers | Amirhossein Nazeri et.al. | 2412.18718 | null |
2024-12-24 | Large Language Model guided Deep Reinforcement Learning for Decision Making in Autonomous Driving | Hao Pang et.al. | 2412.18511 | null |
2024-12-24 | GDM4MMIMO: Generative Diffusion Models for Massive MIMO Communications | Zhenzhou Jin et.al. | 2412.18281 | null |
2025-01-17 | High-Rank Irreducible Cartesian Tensor Decomposition and Bases of Equivariant Spaces | Shihao Shao et.al. | 2412.18263 | link |
2024-12-24 | Parallel Neural Computing for Scene Understanding from LiDAR Perception in Autonomous Racing | Suwesh Prasad Sah et.al. | 2412.18165 | link |
2024-12-24 | Generating Traffic Scenarios via In-Context Learning to Learn Better Motion Planner | Aizierjiang Aiersilan et.al. | 2412.18086 | link |
2024-12-23 | AA-SGAN: Adversarially Augmented Social GAN with Synthetic Data | Mirko Zaffaroni et.al. | 2412.18038 | link |
2024-12-23 | Multimodal Learning with Uncertainty Quantification based on Discounted Belief Fusion | Grigor Bezirganyan et.al. | 2412.18024 | link |
2025-02-05 | Causal Composition Diffusion Model for Closed-loop Traffic Generation | Haohong Lin et.al. | 2412.17920 | null |
2024-12-23 | DeepMF: Deep Motion Factorization for Closed-Loop Safety-Critical Driving Scenario Simulation | Yizhe Li et.al. | 2412.17487 | null |
2025-01-04 | Feature Based Methods in Domain Adaptation for Object Detection: A Review Paper | Helia Mohamadi et.al. | 2412.17325 | null |
2024-12-23 | OLiDM: Object-aware LiDAR Diffusion Models for Autonomous Driving | Tianyi Yan et.al. | 2412.17226 | null |
2024-12-22 | NumbOD: A Spatial-Frequency Fusion Attack Against Object Detectors | Ziqi Zhou et.al. | 2412.16955 | link |
2024-12-22 | Lightweight Design and Optimization methods for DCNNs: Progress and Futures | Hanhua Long et.al. | 2412.16886 | null |
2024-12-22 | Phase-change metasurfaces for reconfigurable image processing | Tingting Liu et.al. | 2412.16856 | null |
2024-12-21 | Towards Selection and Transition Between Behavior-Based Neural Networks for Automated Driving | Iqra Aslam et.al. | 2412.16764 | null |
2024-12-21 | A Method for the Runtime Validation of AI-based Environment Perception in Automated Driving System | Iqra Aslam et.al. | 2412.16762 | null |
2025-02-26 | Application of Multimodal Large Language Models in Autonomous Driving | Md Robiul Islam et.al. | 2412.16410 | null |
2024-12-20 | Mapping the Mind of an Instruction-based Image Editing using SMILE | Zeinab Dehghani et.al. | 2412.16277 | link |
2025-02-14 | Autoware.Flex: Human-Instructed Dynamically Reconfigurable Autonomous Driving Systems | Ziwei Song et.al. | 2412.16265 | null |
2024-12-20 | Optimizing Low-Speed Autonomous Driving: A Reinforcement Learning Approach to Route Stability and Maximum Speed | Benny Bao-Sheng Li et.al. | 2412.16248 | null |
2024-12-17 | CLIP-RLDrive: Human-Aligned Autonomous Driving via CLIP-Based Reward Shaping in Reinforcement Learning | Erfan Doroudian et.al. | 2412.16201 | null |
2024-12-20 | Camera-Based Localization and Enhanced Normalized Mutual Information | Vishnu Teja Kunde et.al. | 2412.16137 | null |
2024-12-20 | Sparse Point Clouds Assisted Learned Image Compression | Yiheng Jiang et.al. | 2412.15752 | null |
2024-12-20 | Mask-RadarNet: Enhancing Transformer With Spatial-Temporal Semantic Context for Radar Object Detection in Autonomous Driving | Yuzhi Wu et.al. | 2412.15595 | null |
2024-12-20 | VLM-RL: A Unified Vision Language Models and Reinforcement Learning Framework for Safe Autonomous Driving | Zilin Huang et.al. | 2412.15544 | null |
2024-12-26 | LiHi-GS: LiDAR-Supervised Gaussian Splatting for Highway Driving Scene Reconstruction | Pou-Chun Kung et.al. | 2412.15447 | null |
2025-02-14 | OpenEMMA: Open-Source Multimodal Model for End-to-End Autonomous Driving | Shuo Xing et.al. | 2412.15208 | link |
2024-12-19 | AutoTrust: Benchmarking Trustworthiness in Large Vision Language Models for Autonomous Driving | Shuo Xing et.al. | 2412.15206 | link |
2024-12-25 | Unveiling Uncertainty: A Deep Dive into Calibration and Performance of Multimodal Large Language Models | Zijun Chen et.al. | 2412.14660 | link |
2024-12-19 | Drive-1-to-3: Enriching Diffusion Priors for Novel View Synthesis of Real Vehicles | Chuang Lin et.al. | 2412.14494 | null |
2024-12-19 | VLM-AD: End-to-End Autonomous Driving through Vision-Language Model Supervision | Yi Xu et.al. | 2412.14446 | null |
2025-02-12 | DriveGPT: Scaling Autoregressive Behavior Models for Driving | Xin Huang et.al. | 2412.14415 | null |
2024-12-17 | A Comprehensive Review on Traffic Datasets and Simulators for Autonomous Vehicles | Supriya Sarker et.al. | 2412.14207 | null |
2024-12-18 | Joint Perception and Prediction for Autonomous Driving: A Survey | Lucas Dal’Col et.al. | 2412.14088 | link |
2025-02-04 | A Black-Box Evaluation Framework for Semantic Robustness in Bird’s Eye View Detection | Fu Wang et.al. | 2412.13913 | link |
2024-12-18 | Object Style Diffusion for Generalized Object Detection in Urban Scene | Hao Li et.al. | 2412.13815 | null |
2024-12-18 | SimADFuzz: Simulation-Feedback Fuzz Testing for Autonomous Driving Systems | Huiwen Yang et.al. | 2412.13802 | null |
2024-12-18 | An Efficient Occupancy World Model via Decoupled Dynamic Flow and Image-assisted Training | Haiming Zhang et.al. | 2412.13772 | null |
2024-12-18 | Optical aberrations in autonomous driving: Physics-informed parameterized temperature scaling for neural network uncertainty calibration | Dominik Werner Wolf et.al. | 2412.13695 | null |
2024-12-18 | Level-Set Parameters: Novel Representation for 3D Shape Analysis | Huan Lei et.al. | 2412.13502 | null |
2024-12-18 | Pre-training a Density-Aware Pose Transformer for Robust LiDAR-based 3D Human Pose Estimation | Xiaoqi An et.al. | 2412.13454 | link |
2024-12-18 | Exploring Transformer-Augmented LSTM for Temporal and Spatial Feature Learning in Trajectory Prediction | Chandra Raskoti et.al. | 2412.13419 | null |
2024-12-17 | Quantitative Predictive Monitoring and Control for Safe Human-Machine Interaction | Shuyang Dong et.al. | 2412.13365 | null |
2024-12-19 | SafeDrive: Knowledge- and Data-Driven Risk-Sensitive Decision-Making for Autonomous Vehicles with Large Language Models | Zhiyuan Zhou et.al. | 2412.13238 | null |
2024-12-24 | C2F-TP: A Coarse-to-Fine Denoising Framework for Uncertainty-Aware Trajectory Prediction | Zichen Wang et.al. | 2412.13231 | link |
2024-12-17 | GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding | Haoyi Jiang et.al. | 2412.13193 | link |
2024-12-17 | StreetCrafter: Street View Synthesis with Controllable Video Diffusion Models | Yunzhi Yan et.al. | 2412.13188 | null |
2024-12-17 | Efficient Event-based Semantic Segmentation with Spike-driven Lightweight Transformer-based Networks | Xiaxin Zhu et.al. | 2412.12843 | null |
2024-12-17 | Open-World Panoptic Segmentation | Matteo Sodano et.al. | 2412.12740 | null |
2024-12-17 | MapExpert: Online HD Map Construction with Simple and Efficient Sparse Map Element Expert | Dapeng Zhang et.al. | 2412.12704 | null |
2024-12-17 | DriveTester: A Unified Platform for Simulation-Based Autonomous Driving Testing | Mingfei Cheng et.al. | 2412.12656 | link |
2024-12-17 | Improving the Transferability of 3D Point Cloud Attack via Spectral-aware Admix and Optimization Designs | Shiyu Hu et.al. | 2412.12626 | null |
2024-12-17 | Task-Parameter Nexus: Task-Specific Parameter Learning for Model-Based Control | Sheng Cheng et.al. | 2412.12448 | null |
2024-12-16 | Domain Generalization in Autonomous Driving: Evaluating YOLOv8s, RT-DETR, and YOLO-NAS with the ROAD-Almaty Dataset | Madiyar Alimov et.al. | 2412.12349 | null |
2024-12-16 | PanSplat: 4K Panorama Synthesis with Feed-Forward Gaussian Splatting | Cheng Zhang et.al. | 2412.12096 | link |
2024-12-16 | CP-Guard: Malicious Agent Detection and Defense in Collaborative Bird’s Eye View Perception | Senkang Hu et.al. | 2412.12000 | null |
2024-12-16 | Point Cloud-Assisted Neural Image Compression | Ziqun Li et.al. | 2412.11771 | null |
2024-12-16 | NEST: A Neuromodulated Small-world Hypergraph Trajectory Prediction Model for Autonomous Driving | Chengyue Wang et.al. | 2412.11682 | null |
2024-12-16 | DINO-Foresight: Looking into the Future with DINO | Efstathios Karypidis et.al. | 2412.11673 | link |
2024-12-16 | AEPHORA: AI/ML-Based Energy-Efficient Proactive Handover and Resource Allocation | Bowen Xie et.al. | 2412.11491 | null |
2024-12-16 | HGSFusion: Radar-Camera Fusion with Hybrid Generation and Synchronization for 3D Object Detection | Zijian Gu et.al. | 2412.11489 | link |
2024-12-16 | Efficient Policy Adaptation with Contrastive Prompt Ensemble for Embodied Agents | Wonje Choi et.al. | 2412.11484 | null |
2025-01-10 | ViPOcc: Leveraging Visual Priors from Vision Foundation Models for Single-View 3D Occupancy Prediction | Yi Feng et.al. | 2412.11210 | link |
2024-12-15 | GEM: A Generalizable Ego-Vision Multimodal World Model for Fine-Grained Ego-Motion, Object Dynamics, and Scene Composition Control | Mariam Hassan et.al. | 2412.11198 | link |
2024-12-15 | RAC3: Retrieval-Augmented Corner Case Comprehension for Autonomous Driving with Vision-Language Models | Yujin Wang et.al. | 2412.11050 | null |
2024-12-15 | SceneLLM: Implicit Language Reasoning in LLM for Dynamic Scene Graph Generation | Hang Zhang et.al. | 2412.11026 | null |
2025-01-23 | OmniHD-Scenes: A Next-Generation Multimodal Dataset for Autonomous Driving | Lianqing Zheng et.al. | 2412.10734 | null |
2024-12-11 | Automatic Image Annotation for Mapped Features Detection | Maxime Noizet et.al. | 2412.10438 | null |
2024-12-13 | GaussianWorld: Gaussian World Model for Streaming 3D Occupancy Prediction | Sicheng Zuo et.al. | 2412.10373 | link |
2024-12-13 | GaussianAD: Gaussian-Centric End-to-End Autonomous Driving | Wenzhao Zheng et.al. | 2412.10371 | link |
2024-12-13 | Timealign: A multi-modal object detection method for time misalignment fusing in autonomous driving | Zhihang Song et.al. | 2412.10033 | null |
2024-12-17 | WiseAD: Knowledge Augmented End-to-End Autonomous Driving with Vision-Language Model | Songyan Zhang et.al. | 2412.09951 | link |
2024-12-13 | EI-Drive: A Platform for Cooperative Perception with Realistic Communication Models | Hanchu Zhou et.al. | 2412.09782 | null |
2024-12-11 | Bench2Drive-R: Turning Real World Data into Reactive Closed-Loop Autonomous Driving Benchmark by Generative Model | Junqi You et.al. | 2412.09647 | null |
2024-12-12 | Doe-1: Closed-Loop Autonomous Driving with Large World Model | Wenzhao Zheng et.al. | 2412.09627 | link |
2024-12-13 | Hidden Biases of End-to-End Driving Datasets | Julian Zimmerlin et.al. | 2412.09602 | link |
2024-12-12 | Slope Considered Online Nonlinear Trajectory Planning with Differential Energy Model for Autonomous Driving | Zhaofeng Tian et.al. | 2412.09424 | null |
2024-12-12 | MMD-OPT : Maximum Mean Discrepancy Based Sample Efficient Collision Risk Minimization for Autonomous Driving | Basant Sharma et.al. | 2412.09121 | null |
2024-12-12 | DrivingRecon: Large 4D Gaussian Reconstruction Model For Autonomous Driving | Hao Lu et.al. | 2412.09043 | link |
2024-12-12 | EMATO: Energy-Model-Aware Trajectory Optimization for Autonomous Driving | Zhaofeng Tian et.al. | 2412.08830 | null |
2024-12-11 | Latent Safety-Constrained Policy Approach for Safe Offline Reinforcement Learning | Prajwal Koirala et.al. | 2412.08794 | null |
2024-12-11 | GPD-1: Generative Pre-training for Driving | Zixun Xie et.al. | 2412.08643 | link |
2024-12-11 | An End-to-End Collaborative Learning Approach for Connected Autonomous Vehicles in Occluded Scenarios | Leandro Parada et.al. | 2412.08562 | null |
2024-12-13 | Physical Informed Driving World Model | Zhuoran Yang et.al. | 2412.08410 | null |
2024-12-11 | Task-specific Self-body Controller Acquisition by Musculoskeletal Humanoids: Application to Pedal Control in Autonomous Driving | Kento Kawaharazuka et.al. | 2412.08270 | null |
2024-12-11 | Neural Observation Field Guided Hybrid Optimization of Camera Placement | Yihan Cao et.al. | 2412.08266 | link |
2024-12-11 | Static-Dynamic Class-level Perception Consistency in Video Semantic Segmentation | Zhigang Cen et.al. | 2412.08034 | null |
2024-12-10 | From an Image to a Scene: Learning to Imagine the World from a Million 360 Videos | Matthew Wallingford et.al. | 2412.07770 | link |
2024-12-11 | Test-time Correction with Human Feedback: An Online 3D Detection System via Visual Prompting | Zetong Yang et.al. | 2412.07768 | null |
2024-12-10 | LoRA3D: Low-Rank Self-Calibration of 3D Geometric Foundation Models | Ziqi Lu et.al. | 2412.07746 | null |
2024-12-13 | DriveMM: All-in-One Large Multimodal Model for Autonomous Driving | Zhijian Huang et.al. | 2412.07689 | link |
2024-12-26 | CoinCLIP: A Multimodal Framework for Assessing Viability in Web3 Memecoins | Hou-Wan Long et.al. | 2412.07591 | null |
2024-12-10 | Hallucination Elimination and Semantic Enhancement Framework for Vision-Language Models in Traffic Scenarios | Jiaqi Fan et.al. | 2412.07518 | link |
2024-12-10 | A Real-time Degeneracy Sensing and Compensation Method for Enhanced LiDAR SLAM | Zongbo Liao et.al. | 2412.07513 | null |
2024-12-10 | ITPNet: Towards Instantaneous Trajectory Prediction for Autonomous Driving | Rongqing Li et.al. | 2412.07369 | null |
2024-12-10 | Fast Occupancy Network | Mingjie Lu et.al. | 2412.07163 | null |
2024-12-09 | Driv3R: Learning Dense 4D Reconstruction for Autonomous Driving | Xin Fei et.al. | 2412.06777 | link |
2024-12-14 | Exploring Critical Testing Scenarios for Decision-Making Policies: An LLM Approach | Weichao Xu et.al. | 2412.06684 | null |
2024-12-09 | Prediction of Occluded Pedestrians in Road Scenes using Human-like Reasoning: Insights from the OccluRoads Dataset | Melo Castillo Angie Nataly et.al. | 2412.06549 | null |
2024-12-09 | PPT: Pre-Training with Pseudo-Labeled Trajectories for Motion Forecasting | Yihong Xu et.al. | 2412.06491 | null |
2025-01-02 | World knowledge-enhanced Reasoning Using Instruction-guided Interactor in Autonomous Driving | Mingliang Zhai et.al. | 2412.06324 | null |
2024-12-09 | Omni-Scene: Omni-Gaussian Representation for Ego-Centric Sparse-View Scene Reconstruction | Dongxu Wei et.al. | 2412.06273 | null |
2024-12-09 | Pilot-guided Multimodal Semantic Communication for Audio-Visual Event Localization | Fei Yu et.al. | 2412.06208 | null |
2024-12-09 | AgentAlign: Misalignment-Adapted Multi-Agent Perception for Resilient Inter-Agent Sensor Correlations | Zonglin Meng et.al. | 2412.06142 | null |
2024-12-09 | HSDA: High-frequency Shuffle Data Augmentation for Bird’s-Eye-View Map Segmentation | Calvin Glisson et.al. | 2412.06127 | link |
2024-12-08 | GVDepth: Zero-Shot Monocular Depth Estimation for Ground Vehicles based on Probabilistic Cue Fusion | Karlo Koledic et.al. | 2412.06080 | null |
2024-12-08 | Lightweight Spatial Embedding for Vision-based 3D Occupancy Prediction | Jinqing Zhang et.al. | 2412.05976 | null |
2024-12-08 | A Review on Multisensor Data Fusion for Wearable Health Monitoring | Arlene John et.al. | 2412.05895 | null |
2024-12-08 | doScenes: An Autonomous Driving Dataset with Natural Language Instruction for Human Interaction and Vision-Language Navigation | Parthib Roy et.al. | 2412.05893 | link |
2024-12-07 | Real-Time 3D Object Detection Using InnovizOne LiDAR and Low-Power Hailo-8 AI Accelerator | Itay Krispin-Avraham et.al. | 2412.05594 | link |
2024-12-06 | COOOL: Challenge Of Out-Of-Label A Novel Benchmark for Autonomous Driving | Ali K. AlShami et.al. | 2412.05462 | link |
2024-12-06 | UniScene: Unified Occupancy-centric Driving Scene Generation | Bohan Li et.al. | 2412.05435 | null |
2024-12-06 | Generative Model-Based Fusion for Improved Few-Shot Semantic Segmentation of Infrared Images | Junno Yun et.al. | 2412.05341 | null |
2024-12-06 | ACT-Bench: Towards Action Controllable World Models for Autonomous Driving | Hidehisa Arai et.al. | 2412.05337 | null |
2024-12-05 | Closed-Loop Supervised Fine-Tuning of Tokenized Traffic Models | Zhejun Zhang et.al. | 2412.05334 | link |
2024-12-11 | Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model | Lening Wang et.al. | 2412.05280 | link |
2024-12-10 | Extrapolated Urban View Synthesis Benchmark | Xiangyu Han et.al. | 2412.05256 | link |
2024-12-06 | Towards Flexible 3D Perception: Object-Centric Occupancy Completion Augments 3D Object Detection | Chaoda Zheng et.al. | 2412.05154 | link |
2024-12-06 | Backdooring Outlier Detection Methods: A Novel Attack Approach | ZeinabSadat Taghavi et.al. | 2412.05010 | null |
2025-01-20 | UniMLVG: Unified Framework for Multi-view Long Video Generation with Comprehensive Control Capabilities for Autonomous Driving | Rui Chen et.al. | 2412.04842 | link |
2024-12-06 | GaussianFormer-2: Probabilistic Gaussian Superposition for Efficient 3D Occupancy Prediction | Yuanhui Huang et.al. | 2412.04384 | link |
2024-12-05 | Reflective Teacher: Semi-Supervised Multimodal 3D Object Detection in Bird’s-Eye-View via Uncertainty Measure | Saheli Hazra et.al. | 2412.04337 | null |
2024-12-05 | YOLO-CCA: A Context-Based Approach for Traffic Sign Detection | Linfeng Jiang et.al. | 2412.04289 | link |
2024-12-05 | CALMM-Drive: Confidence-Aware Autonomous Driving with Large Multimodal Model | Ruoyu Yao et.al. | 2412.04209 | null |
2024-12-05 | UNCOVER: Unknown Class Object Detection for Autonomous Vehicles in Real-time | Lars Schmarje et.al. | 2412.03986 | null |
2024-12-05 | Learning Based MPC for Autonomous Driving Using a Low Dimensional Residual Model | Yaoyu Li et.al. | 2412.03874 | null |
2024-12-05 | Using Cooperative Co-evolutionary Search to Generate Metamorphic Test Cases for Autonomous Driving Systems | Hossein Yousefizadeh et.al. | 2412.03843 | null |
2024-12-05 | Safe Adaptive Cruise Control Under Perception Uncertainty: A Deep Ensemble and Conformal Tube Model Predictive Control Approach | Xiao Li et.al. | 2412.03792 | null |
2024-12-04 | Advancing Auto-Regressive Continuation for Video Frames | Ruibo Ming et.al. | 2412.03758 | null |
2024-12-04 | Designing DNNs for a trade-off between robustness and processing performance in embedded devices | Jon Gutiérrez-Zaballa et.al. | 2412.03682 | null |
2024-12-04 | Evaluating Single Event Upsets in Deep Neural Networks for Semantic Segmentation: an embedded system perspective | Jon Gutiérrez-Zaballa et.al. | 2412.03630 | link |
2024-12-04 | Streaming Detection of Queried Event Start | Cristobal Eyzaguirre et.al. | 2412.03567 | link |
2024-12-04 | FreeSim: Toward Free-viewpoint Camera Simulation in Driving Scenes | Lue Fan et.al. | 2412.03566 | null |
2024-12-09 | Seeing Beyond Views: Multi-View Driving Scene Video Generation with Holistic Attention | Hannan Lu et.al. | 2412.03520 | null |
2024-12-04 | Data Fusion of Semantic and Depth Information in the Context of Object Detection | Md Abu Yusuf et.al. | 2412.03490 | null |
2024-12-04 | A Survey of Wireless Sensing Security from a Role-Based View: Victim, Weapon, and Shield | Ruixu Geng et.al. | 2412.03064 | link |
2024-12-04 | Lightweight Stochastic Video Prediction via Hybrid Warping | Kazuki Kotoyori et.al. | 2412.03061 | null |
2024-12-04 | Less is More: A Stealthy and Efficient Adversarial Attack Method for DRL-based Autonomous Driving Policies | Junchao Fan et.al. | 2412.03051 | null |
2024-12-03 | Gaussian Splatting Under Attack: Investigating Adversarial Noise in 3D Objects | Abdurrahman Zeybey et.al. | 2412.02803 | null |
2024-12-09 | Synergistic Development of Perovskite Memristors and Algorithms for Robust Analog Computing | Nanyang Ye et.al. | 2412.02779 | null |
2024-12-13 | MVCTrack: Boosting 3D Point Cloud Tracking via Multimodal-Guided Virtual Cues | Zhaofeng Hu et.al. | 2412.02734 | link |
2024-12-03 | Preliminary Investigation into Data Scaling Laws for Imitation Learning-Based End-to-End Autonomous Driving | Yupeng Zheng et.al. | 2412.02689 | link |
2024-12-03 | Generating Critical Scenarios for Testing Automated Driving Systems | Trung-Hieu Nguyen et.al. | 2412.02574 | null |
2024-12-03 | Who Walks With You Matters: Perceiving Social Interactions with Groups for Pedestrian Trajectory Prediction | Ziqian Zou et.al. | 2412.02395 | null |
2024-12-03 | Trajectory-based Road Autolabeling with Lidar-Camera Fusion in Winter Conditions | Eerik Alamikkotervo et.al. | 2412.02370 | link |
2024-12-03 | Underload: Defending against Latency Attacks for Object Detectors on Edge Devices | Tianyi Wang et.al. | 2412.02171 | null |
2024-12-02 | PKRD-CoT: A Unified Chain-of-thought Prompting for Multi-Modal Large Language Models in Autonomous Driving | Xuewen Luo et.al. | 2412.02025 | null |
2024-12-02 | HPRM: High-Performance Robotic Middleware for Intelligent Autonomous Systems | Jacky Kwok et.al. | 2412.01799 | null |
2024-12-02 | HUGSIM: A Real-Time, Photo-Realistic and Closed-Loop Simulator for Autonomous Driving | Hongyu Zhou et.al. | 2412.01718 | null |
2024-12-02 | 6DOPE-GS: Online 6D Object Pose Estimation using Gaussian Splatting | Yufeng Jin et.al. | 2412.01543 | null |
2024-12-04 | InfinityDrive: Breaking Time Limits in Driving World Models | Xi Guo et.al. | 2412.01522 | null |
2024-12-03 | HoloDrive: Holistic 2D-3D Multi-Modal Street Scene Generation for Autonomous Driving | Zehuan Wu et.al. | 2412.01407 | null |
2024-12-02 | FedPAW: Federated Learning with Personalized Aggregation Weights for Urban Vehicle Speed Prediction | Yuepeng He et.al. | 2412.01281 | link |
2024-12-03 | Double-Directional V2V Channel Measurement using ReRoMA at 60 GHz | Hussein Hammoud et.al. | 2412.01165 | null |
2024-12-02 | STATIC : Surface Temporal Affine for TIme Consistency in Video Monocular Depth Estimation | Sunghun Yang et.al. | 2412.01090 | null |
2024-12-02 | LiDAR SLAMMOT based on Confidence-guided Data Association | Susu Fang et.al. | 2412.01041 | null |
2024-12-02 | Quantization-Aware Imitation-Learning for Resource-Efficient Robotic Control | Seongmin Park et.al. | 2412.01034 | null |
2024-12-01 | BDefects4NN: A Backdoor Defect Database for Controlled Localization Studies in Neural Networks | Yisong Xiao et.al. | 2412.00746 | null |
2025-02-03 | SEED4D: A Synthetic Ego–Exo Dynamic 4D Data Generator, Driving Dataset and Benchmark | Marius Kästingschäfer et.al. | 2412.00730 | link |
2025-01-08 | Motion Dreamer: Realizing Physically Coherent Video Generation through Scene-Aware Motion Reasoning | Tianshuo Xu et.al. | 2412.00547 | link |
2024-11-30 | Density-aware Global-Local Attention Network for Point Cloud Segmentation | Chade Li et.al. | 2412.00489 | null |
2024-11-29 | FlowCLAS: Enhancing Normalizing Flow Via Contrastive Learning For Anomaly Segmentation | Chang Won Lee et.al. | 2411.19888 | null |
2024-11-29 | SpaRC: Sparse Radar-Camera Fusion for 3D Object Detection | Philipp Wolters et.al. | 2411.19860 | null |
2024-11-29 | A Multi-Loss Strategy for Vehicle Trajectory Prediction: Combining Off-Road, Diversity, and Directional Consistency Losses | Ahmad Rahimi et.al. | 2411.19747 | link |
2024-11-29 | AdvFuzz: Finding More Violations Caused by the EGO Vehicle in Simulation Testing by Adversarial NPC Vehicles | You Lu et.al. | 2411.19567 | null |
2024-11-29 | ReconDreamer: Crafting World Models for Driving Scene Reconstruction via Online Restoration | Chaojun Ni et.al. | 2411.19548 | null |
2024-11-28 | Mapping Public Perception of Artificial Intelligence: Expectations, Risk-Benefit Tradeoffs, and Value As Determinants for Societal Acceptance | Philipp Brauner et.al. | 2411.19356 | null |
2024-11-28 | UrbanCAD: Towards Highly Controllable and Photorealistic 3D Vehicles for Urban Scene Simulation | Yichong Lu et.al. | 2411.19292 | null |
2024-11-28 | SADG: Segment Any Dynamic Gaussian Without Object Trackers | Yun-Jin Li et.al. | 2411.19290 | link |
2024-11-28 | On-chip Hyperspectral Image Segmentation with Fully Convolutional Networks for Scene Understanding in Autonomous Driving | Jon Gutiérrez-Zaballa et.al. | 2411.19274 | null |
2024-11-28 | InstanceGaussian: Appearance-Semantic Joint Gaussian Representation for 3D Instance-Level Perception | Haijie Li et.al. | 2411.19235 | null |
2024-11-28 | Visual SLAMMOT Considering Multiple Motion Models | Peilin Tian et.al. | 2411.19134 | null |
2024-11-28 | Synergizing Decision Making and Trajectory Planning Using Two-Stage Optimization for Autonomous Vehicles | Wenru Liu et.al. | 2411.18974 | null |
2024-11-28 | T2SG: Traffic Topology Scene Graph for Topology Reasoning in Autonomous Driving | Changsheng Lv et.al. | 2411.18894 | null |
2024-11-28 | Improving Batch Normalization with TTA for Robust Object Detection in Self-Driving | Dacheng Liao et.al. | 2411.18860 | null |
2024-11-27 | TAPTRv3: Spatial and Temporal Context Foster Robust Tracking of Any Point in Long Video | Jinyuan Qu et.al. | 2411.18671 | null |
2024-11-30 | InterHub: A Naturalistic Trajectory Dataset with Dense Interaction for Autonomous Driving | Xiyan Jiang et.al. | 2411.18302 | link |
2024-11-27 | Visual Adversarial Attack on Vision-Language Models for Autonomous Driving | Tianyuan Zhang et.al. | 2411.18275 | null |
2024-12-01 | From Open Vocabulary to Open World: Teaching Vision Language Models to Detect Novel Objects | Zizhao Li et.al. | 2411.18207 | link |
2024-12-16 | Critic-V: VLM Critics Help Catch VLM Errors in Multimodal Reasoning | Di Zhang et.al. | 2411.18203 | null |
2024-11-27 | Edge-Assisted Accelerated Cooperative Sensing for CAVs: Task Placement and Resource Allocation | Yuxuan Wang et.al. | 2411.18129 | null |
2024-11-27 | FASIONAD : FAst and Slow FusION Thinking Systems for Human-Like Autonomous Driving with Adaptive Feedback | Kangan Qian et.al. | 2411.18013 | null |
2024-11-26 | Stealthy Multi-Task Adversarial Attacks | Jiacheng Guo et.al. | 2411.17936 | null |
2024-11-26 | DECODE: Domain-aware Continual Domain Expansion for Motion Prediction | Boqi Li et.al. | 2411.17917 | link |
2024-11-26 | Multimodal Crash Likelihood Prediction: A Complexity-Infused Approach Integrating Semantic, Contextual, and Driving Features | Meng Wang et.al. | 2411.17886 | null |
2024-11-26 | OpenAD: Open-World Autonomous Driving Benchmark for 3D Object Detection | Zhongyu Xia et.al. | 2411.17761 | link |
2024-11-26 | Modality-Incremental Learning with Disjoint Relevance Mapping Networks for Image-based Semantic Segmentation | Niharika Hegde et.al. | 2411.17610 | null |
2024-11-26 | Rapid Deployment of Domain-specific Hyperspectral Image Processors with Application to Autonomous Driving | Jon Gutiérrez-Zaballa et.al. | 2411.17543 | null |
2024-11-26 | HSI-Drive v2.0: More Data for New Challenges in Scene Understanding for Autonomous Driving | Jon Gutiérrez-Zaballa et.al. | 2411.17530 | null |
2024-11-26 | CoA: Chain-of-Action for Generative Semantic Labels | Meng Wei et.al. | 2411.17406 | link |
2024-11-26 | LHPF: Look back the History and Plan for the Future in Autonomous Driving | Sheng Wang et.al. | 2411.17253 | null |
2024-11-26 | Enhancing Lane Segment Perception and Topology Reasoning with Crowdsourcing Trajectory Priors | Peijin Jia et.al. | 2411.17161 | null |
2024-11-27 | SplatAD: Real-Time Lidar and Camera Rendering with 3D Gaussian Splatting for Autonomous Driving | Georg Hess et.al. | 2411.16816 | link |
2024-11-25 | One is Plenty: A Polymorphic Feature Interpreter for Immutable Heterogeneous Collaborative Perception | Yuchen Xia et.al. | 2411.16799 | null |
2024-11-25 | MAGiC-SLAM: Multi-Agent Gaussian Globally Consistent SLAM | Vladimir Yugay et.al. | 2411.16785 | null |
2024-11-25 | SynDiff-AD: Improving Semantic Segmentation and End-to-End Autonomous Driving with Synthetic Data from Latent Diffusion Models | Harsh Goel et.al. | 2411.16776 | null |
2024-11-23 | FollowGen: A Scaled Noise Conditional Diffusion Model for Car-Following Trajectory Prediction | Junwei You et.al. | 2411.16747 | null |
2024-11-23 | Gradient-Guided Parameter Mask for Multi-Scenario Image Restoration Under Adverse Weather | Jilong Guo et.al. | 2411.16739 | link |
2024-11-23 | Towards Satellite Image Road Graph Extraction: A Global-Scale Dataset and A Novel Method | Pan Yin et.al. | 2411.16733 | link |
2024-12-03 | Neuro-Symbolic Evaluation of Text-to-Video Models using Formal Verification | S. P. Sharan et.al. | 2411.16718 | link |
2024-11-25 | Generating Out-Of-Distribution Scenarios Using Language Models | Erfan Aasi et.al. | 2411.16554 | null |
2024-11-25 | Characterized Diffusion Networks for Enhanced Autonomous Driving Trajectory Prediction | Haoming Li et.al. | 2411.16457 | null |
2024-11-25 | A Study on Unsupervised Domain Adaptation for Semantic Segmentation in the Era of Vision-Language Models | Manuel Schwonberg et.al. | 2411.16407 | null |
2024-12-11 | Monocular Lane Detection Based on Deep Learning: A Survey | Xin He et.al. | 2411.16316 | link |
2024-11-25 | A Performance Increment Strategy for Semantic Segmentation of Low-Resolution Images from Damaged Roads | Rafael S. Toledo et.al. | 2411.16295 | link |
2024-11-25 | End-to-End Steering for Autonomous Vehicles via Conditional Imitation Co-Learning | Mahmoud M. Kishky et.al. | 2411.16131 | null |
2024-11-25 | Three Cars Approaching within 100m! Enhancing Distant Geometry by Tri-Axis Voxel Scanning for Camera-based Semantic Scene Completion | Jongseong Bae et.al. | 2411.16129 | null |
2024-11-24 | Performance Implications of Multi-Chiplet Neural Processing Units on Autonomous Driving Perception | Mohanad Odema et.al. | 2411.16007 | null |
2024-11-24 | SARS: A Resource Selection Algorithm for Autonomous Driving Tasks in Heterogeneous Mobile Edge Computing | Reza Zakerian et.al. | 2411.15989 | null |
2024-12-23 | DRIVE: Dual-Robustness via Information Variability and Entropic Consistency in Source-Free Unsupervised Domain Adaptation | Ruiqiang Xiao et.al. | 2411.15976 | null |
2024-11-24 | Algorithmics and Complexity of Cost-Driven Task Offloading with Submodular Optimization in Edge-Cloud Environments | Longkun Guo et.al. | 2411.15687 | null |
2024-11-23 | Training an Open-Vocabulary Monocular 3D Object Detection Model without 3D Data | Rui Huang et.al. | 2411.15657 | null |
2024-11-23 | EMD: Explicit Motion Modeling for High-Quality Street Gaussian Splatting | Xiaobao Wei et.al. | 2411.15582 | null |
2024-11-23 | SplatFlow: Self-Supervised Dynamic Gaussian Splatting in Neural Motion Flow Field for Autonomous Driving | Su Sun et.al. | 2411.15482 | null |
2024-11-22 | UniGaussian: Driving Scene Reconstruction from Multiple Camera Models via Unified Gaussian Representations | Yuan Ren et.al. | 2411.15355 | null |
2024-11-22 | Adversarial Prompt Distillation for Vision-Language Models | Lin Luo et.al. | 2411.15244 | null |
2024-11-22 | DiffusionDrive: Truncated Diffusion Model for End-to-End Autonomous Driving | Bencheng Liao et.al. | 2411.15139 | link |
2024-11-25 | Enhancing Autonomous Driving Safety through World Model-Based Predictive Navigation and Adaptive Learning Algorithms for 5G Wireless Applications | Hong Ding et.al. | 2411.15042 | null |
2024-11-22 | MSSF: A 4D Radar and Camera Fusion Framework With Multi-Stage Sampling for 3D Object Detection in Autonomous Driving | Hongsi Liu et.al. | 2411.15016 | null |
2024-11-22 | FTA generation using GenAI with an Autonomy sensor Usecase | Sneha Sudhir Shetiya et.al. | 2411.15007 | null |
2024-11-22 | LiDAR-based End-to-end Temporal Perception for Vehicle-Infrastructure Cooperation | Zhenwei Yang et.al. | 2411.14927 | null |
2024-11-22 | Benchmarking the Robustness of Optical Flow Estimation to Corruptions | Zhonghua Yi et.al. | 2411.14865 | link |
2024-11-22 | TopoSD: Topology-Enhanced Lane Segment Perception with SDMap Prior | Sen Yang et.al. | 2411.14751 | null |
2024-11-22 | VisionPAD: A Vision-Centric Pre-training Paradigm for Autonomous Driving | Haiming Zhang et.al. | 2411.14716 | null |
2024-11-21 | A Systematic Study of Multi-Agent Deep Reinforcement Learning for Safe and Robust Autonomous Highway Ramp Entry | Larry Schester et.al. | 2411.14593 | null |
2024-11-21 | Open Challenges in the Formal Verification of Autonomous Driving | Paolo Burgio et.al. | 2411.14520 | null |
2024-11-21 | Understanding World or Predicting Future? A Comprehensive Survey of World Models | Jingtao Ding et.al. | 2411.14499 | null |
2024-11-21 | Model Checking for Reinforcement Learning in Autonomous Driving: One Can Do More Than You Think! | Rong Gu et.al. | 2411.14375 | null |
2024-11-21 | Formal Simulation and Visualisation of Hybrid Programs | Pedro Mendes et.al. | 2411.14365 | null |
2024-11-21 | Generalizing End-To-End Autonomous Driving In Real-World Environments Using Zero-Shot LLMs | Zeyu Dong et.al. | 2411.14256 | null |
2024-11-21 | FedRAV: Hierarchically Federated Region-Learning for Traffic Object Classification of Autonomous Vehicles | Yijun Zhai et.al. | 2411.13979 | link |
2024-11-21 | Trajectory Tracking Using Frenet Coordinates with Deep Deterministic Policy Gradient | Tongzhou Jiang et.al. | 2411.13885 | null |
2024-11-21 | MagicDriveDiT: High-Resolution Long Video Generation for Autonomous Driving with Adaptive Control | Ruiyuan Gao et.al. | 2411.13807 | null |
2024-11-21 | A Survey on Adversarial Robustness of LiDAR-based Machine Learning Perception in Autonomous Vehicles | Junae Kim et.al. | 2411.13778 | null |
2024-11-20 | MambaDETR: Query-based Temporal Modeling using State Space Model for Multi-View 3D Object Detection | Tong Ning et.al. | 2411.13628 | null |
2024-11-20 | WHALES: A Multi-agent Scheduling Dataset for Enhanced Cooperation in Autonomous Driving | Siwei Chen et.al. | 2411.13340 | link |
2024-11-20 | A Resource Efficient Fusion Network for Object Detection in Bird’s-Eye View using Camera and Raw Radar Data | Kavin Chandrasekaran et.al. | 2411.13311 | link |
2024-11-20 | YCB-LUMA: YCB Object Dataset with Luminance Keying for Object Localization | Thomas Pöllabauer et.al. | 2411.13149 | link |
2024-11-20 | Provably Efficient Action-Manipulation Attack Against Continuous Reinforcement Learning | Zhi Luo et.al. | 2411.13116 | null |
2024-11-26 | DriveMLLM: A Benchmark for Spatial Understanding with Multimodal Large Language Models in Autonomous Driving | Xianda Guo et.al. | 2411.13112 | link |
2024-11-20 | Hints of Prompt: Enhancing Visual Representation for Multimodal LLMs in Autonomous Driving | Hao Zhou et.al. | 2411.13076 | null |
2024-11-20 | Study of Group III-V Waveguides on Sapphire Platform for Photonic Integrated Circuits | Manoj Kumar Shah et.al. | 2411.13035 | null |
2024-11-25 | LaVida Drive: Vision-Text Interaction VLM for Autonomous Driving with Token Selection, Recovery and Enhancement | Siwen Jiao et.al. | 2411.12980 | null |
2024-11-20 | M3D: Dual-Stream Selective State Spaces and Depth-Driven Framework for High-Fidelity Single-View 3D Reconstruction | Luoxi Zhang et.al. | 2411.12635 | link |
2024-11-19 | GaussianPretrain: A Simple Unified 3D Gaussian Representation for Visual Pre-training in Autonomous Driving | Shaoqing Xu et.al. | 2411.12452 | link |
2025-01-05 | Motif Channel Opened in a White-Box: Stereo Matching via Motif Correlation Graph | Ziyang Chen et.al. | 2411.12426 | link |
2024-11-19 | C $^{2}$ INet: Realizing Incremental Trajectory Prediction with Prior-Aware Continual Causal Intervention | Xiaohe Li et.al. | 2411.12313 | null |
2024-11-19 | Robust 3D Semantic Occupancy Prediction with Calibration-free Spatial Transformation | Zhuangwei Zhuang et.al. | 2411.12177 | link |
2024-11-18 | Calibrated and Efficient Sampling-Free Confidence Estimation for LiDAR Scene Semantic Segmentation | Hanieh Shojaei Miandashti et.al. | 2411.11935 | null |
2024-11-18 | DeSiRe-GS: 4D Street Gaussians for Static-Dynamic Decomposition and Surface Reconstruction for Urban Driving Scenes | Chensheng Peng et.al. | 2411.11921 | link |
2024-11-17 | ModeSeq: Taming Sparse Multimodal Motion Prediction with Sequential Mode Modeling | Zikang Zhou et.al. | 2411.11911 | null |
2024-11-18 | SignEye: Traffic Sign Interpretation from Vehicle First-Person View | Chuang Yang et.al. | 2411.11507 | null |
2024-11-18 | MGNiceNet: Unified Monocular Geometric Scene Understanding | Markus Schön et.al. | 2411.11466 | null |
2024-11-18 | The ADUULM-360 Dataset – A Multi-Modal Dataset for Depth Estimation in Adverse Weather | Markus Schön et.al. | 2411.11455 | null |
2024-11-18 | DrivingSphere: Building a High-fidelity 4D World for Closed-loop Simulation | Tianyi Yan et.al. | 2411.11252 | link |
2024-11-17 | Unveiling the Hidden: Online Vectorized HD Map Construction with Clip-Level Token Interaction and Propagation | Nayeon Kim et.al. | 2411.11002 | null |
2024-11-17 | V2X-Radar: A Multi-modal Dataset with 4D Radar for Cooperative Perception | Lei Yang et.al. | 2411.10962 | null |
2024-11-16 | Attention-based U-Net Method for Autonomous Lane Detection | Mohammadhamed Tangestanizadeh et.al. | 2411.10902 | null |
2024-11-16 | Automatic Discovery and Assessment of Interpretable Systematic Errors in Semantic Segmentation | Jaisidh Singh et.al. | 2411.10845 | null |
2024-11-16 | MTA: Multimodal Task Alignment for BEV Perception and Captioning | Yunsheng Ma et.al. | 2411.10639 | null |
2024-11-15 | A Novel MLLM-based Approach for Autonomous Driving in Different Weather Conditions | Sonda Fourati et.al. | 2411.10603 | null |
2024-11-15 | Advancing Autonomous Driving Perception: Analysis of Sensor Fusion and Computer Vision Techniques | Urvishkumar Bharti et.al. | 2411.10535 | null |
2024-11-15 | Prompt-Guided Environmentally Consistent Adversarial Patch | Chaoqun Li et.al. | 2411.10498 | null |
2024-11-15 | Guided Learning: Lubricating End-to-End Modeling for Multi-stage Decision-making | Jian Guo et.al. | 2411.10496 | null |
2024-11-15 | Moving Forward: A Review of Autonomous Driving Software and Hardware Systems | Xu Wang et.al. | 2411.10291 | null |
2024-11-15 | Imagine-2-Drive: High-Fidelity World Modeling in CARLA for Autonomous Vehicles | Anant Garg et.al. | 2411.10171 | null |
2024-11-15 | Better Safe Than Sorry: Enhancing Arbitration Graphs for Safe and Robust Autonomous Decision-Making | Piotr Spieker et.al. | 2411.10170 | link |
2024-11-15 | Explanation for Trajectory Planning using Multi-modal Large Language Model for Autonomous Driving | Shota Yamazaki et.al. | 2411.09971 | null |
2024-12-29 | A Self-Supervised Robotic System for Autonomous Contact-Based Spatial Mapping of Semiconductor Properties | Alexander E. Siemenn et.al. | 2411.09892 | link |
2024-11-15 | Planning by Simulation: Motion Planning with Learning-based Parallel Scenario Prediction for Autonomous Driving | Tian Niu et.al. | 2411.09887 | null |
2024-11-14 | CropCraft: Inverse Procedural Modeling for 3D Reconstruction of Crop Plants | Albert J. Zhai et.al. | 2411.09693 | null |
2024-11-14 | Modular Fault Diagnosis Framework for Complex Autonomous Driving Systems | Stefan Orf et.al. | 2411.09643 | null |
2024-11-13 | Methodology for a Statistical Analysis of Influencing Factors on 3D Object Detection Performance | Anton Kuznietsov et.al. | 2411.08482 | null |
2024-11-13 | 3D Multi-Object Tracking with Semi-Supervised GRU-Kalman Filter | Xiaoxiang Wang et.al. | 2411.08433 | null |
2024-11-12 | Imitation Learning from Observations: An Autoregressive Mixture of Experts Approach | Renzi Wang et.al. | 2411.08232 | null |
2024-11-12 | ALOcc: Adaptive Lifting-based 3D Semantic Occupancy and Cost Volume-based Flow Prediction | Dubing Chen et.al. | 2411.07725 | link |
2024-11-12 | EMPERROR: A Flexible Generative Perception Error Model for Probing Self-Driving Planners | Niklas Hanselmann et.al. | 2411.07719 | null |
2024-11-27 | OWLed: Outlier-weighed Layerwise Pruning for Efficient Autonomous Driving Framework | Jiaxi Li et.al. | 2411.07711 | link |
2024-11-16 | A Simple Multi-agent Joint Prediction Method for Autonomous Driving | Mingyi Wang et.al. | 2411.07612 | null |
2024-11-11 | SIESEF-FusionNet: Spatial Inter-correlation Enhancement and Spatially-Embedded Feature Fusion Network for LiDAR Point Cloud Semantic Segmentation | Jiale Chen et.al. | 2411.06991 | null |
2024-12-30 | Large-scale moral machine experiment on large language models | Muhammad Shahrul Zaim bin Ahmad et.al. | 2411.06790 | link |
2024-11-11 | Model Partition and Resource Allocation for Split Learning in Vehicular Edge Networks | Lu Yu et.al. | 2411.06773 | null |
2024-11-11 | DP and QP Based Decision-making and Planning for Autonomous Vehicle | Zhicheng Zhang et.al. | 2411.06751 | null |
2024-11-09 | Predictability Awareness for Efficient and Robust Multi-Agent Coordination | Roman Chiva Gil et.al. | 2411.06223 | null |
2024-11-19 | LSSInst: Improving Geometric Modeling in LSS-Based BEV Perception with Instance Representation | Weijie Ma et.al. | 2411.06173 | link |
2024-11-08 | Integrating Object Detection Modality into Visual Language Model for Enhanced Autonomous Driving Agent | Linfeng He et.al. | 2411.05898 | null |
2024-11-08 | MIPD: A Multi-sensory Interactive Perception Dataset for Embodied Intelligent Driving | Zhiwei Li et.al. | 2411.05881 | link |
2024-11-06 | Federated Data-Driven Kalman Filtering for State Estimation | Nikos Piperigkos et.al. | 2411.05847 | null |
2024-11-08 | Expectation vs. Reality: Towards Verification of Psychological Games | Marta Kwiatkowska et.al. | 2411.05599 | null |
2024-11-08 | Open-set object detection: towards unified problem formulation and benchmarking | Hejer Ammar et.al. | 2411.05564 | null |
2024-11-08 | ZOPP: A Framework of Zero-shot Offboard Panoptic Perception for Autonomous Driving | Tao Ma et.al. | 2411.05311 | null |
2024-11-08 | SimpleBEV: Improved LiDAR-Camera Fusion Architecture for 3D Object Detection | Yun Zhao et.al. | 2411.05292 | null |
2024-11-07 | Few-Shot Task Learning through Inverse Generative Modeling | Aviv Netanyahu et.al. | 2411.04987 | null |
2024-11-07 | IGDrivSim: A Benchmark for the Imitation Gap in Autonomous Driving | Clémence Grislain et.al. | 2411.04653 | link |
2024-11-07 | LidaRefer: Outdoor 3D Visual Grounding for Autonomous Driving with Transformers | Yeong-Seung Baek et.al. | 2411.04351 | null |
2024-11-06 | Graph-Based Multi-Modal Sensor Fusion for Autonomous Driving | Depanshu Sani et.al. | 2411.03702 | null |
2024-11-06 | OccLoff: Learning Optimized Feature Fusion for 3D Occupancy Prediction | Ji Zhang et.al. | 2411.03696 | null |
2024-11-06 | Towards 3D Semantic Scene Completion for Autonomous Driving: A Meta-Learning Framework Empowered by Deformable Large-Kernel Attention and Mamba Model | Yansong Qu et.al. | 2411.03672 | null |
2024-11-06 | Hybrid Attention for Robust RGB-T Pedestrian Detection in Real-World Conditions | Arunkumar Rathinam et.al. | 2411.03576 | null |
2024-11-07 | Knowledge Graphs of Driving Scenes to Empower the Emerging Capabilities of Neurosymbolic AI | Ruwan Wickramarachchi et.al. | 2411.03225 | null |
2024-11-05 | Precise Drive with VLM: First Prize Solution for PRCV 2024 Drive LM challenge | Bin Huang et.al. | 2411.02999 | null |
2024-11-08 | Region-Guided Attack on the Segment Anything Model (SAM) | Xiaoliang Liu et.al. | 2411.02974 | null |
2024-11-05 | Multi-modal NeRF Self-Supervision for LiDAR Semantic Segmentation | Xavier Timoneda et.al. | 2411.02969 | null |
2024-11-05 | Exploring the Interplay Between Video Generation and World Models in Autonomous Driving: A Survey | Ao Fu et.al. | 2411.02914 | null |
2024-11-05 | Safety Verification for Evasive Collision Avoidance in Autonomous Vehicles with Enhanced Resolutions | Aliasghar Arab et.al. | 2411.02706 | null |
2024-11-04 | AutoVFX: Physically Realistic Video Editing from Natural Language Instructions | Hao-Yu Hsu et.al. | 2411.02394 | null |
2024-11-04 | Learning Multiple Initial Solutions to Optimization Problems | Elad Sharony et.al. | 2411.02158 | link |
2024-11-04 | Traffic and Safety Rule Compliance of Humans in Diverse Driving Situations | Michael Kurenkov et.al. | 2411.01909 | null |
2024-11-08 | ROAD-Waymo: Action Awareness at Scale for Autonomous Driving | Salman Khan et.al. | 2411.01683 | link |
2024-11-03 | Polar R-CNN: End-to-End Lane Detection with Fewer Anchors | Shengqi Wang et.al. | 2411.01499 | link |
2024-11-03 | Interaction-Aware Trajectory Prediction for Safe Motion Planning in Autonomous Driving: A Transformer-Transfer Learning Approach | Jinhao Liang et.al. | 2411.01475 | null |
2024-11-28 | On the Black-box Explainability of Object Detection Models for Safe and Trustworthy Industrial Applications | Alain Andres et.al. | 2411.00818 | link |
2024-11-01 | HopTrack: A Real-time Multi-Object Tracking System for Embedded Devices | Xiang Li et.al. | 2411.00608 | null |
2024-11-01 | On Deep Learning for Geometric and Semantic Scene Understanding Using On-Vehicle 3D LiDAR | Li Li et.al. | 2411.00600 | link |
2024-11-26 | MAROON: A Framework for the Joint Characterization of Near-Field High-Resolution Radar and Optical Depth Imaging Techniques | Vanessa Wirth et.al. | 2411.00527 | null |
2024-11-01 | PlanScope: Learning to Plan Within Decision Scope Does Matter | Ren Xin et.al. | 2411.00476 | link |
2024-11-01 | PLATYPUS: Progressive Local Surface Estimator for Arbitrary-Scale Point Cloud Upsampling | Donghyun Kim et.al. | 2411.00432 | null |
2024-10-31 | Optical Lens Attack on Monocular Depth Estimation for Autonomous Driving | Ce Zhou et.al. | 2411.00192 | null |
2024-10-31 | AIDOVECL: AI-generated Dataset of Outpainted Vehicles for Eye-level Classification and Localization | Amir Kazemi et.al. | 2410.24116 | null |
2024-10-31 | Transformer-based Model Predictive Control: Trajectory Optimization via Sequence Modeling | Davide Celestini et.al. | 2410.23916 | null |
2024-10-31 | Driving by the Rules: A Benchmark for Integrating Traffic Sign Regulations into Vectorized HD Map | Xinyuan Chang et.al. | 2410.23780 | null |
2024-10-15 | Trajectory Prediction for Autonomous Driving using Agent-Interaction Graph Embedding | Jilan Samiuddin et.al. | 2410.23298 | null |
2024-10-30 | OpenSatMap: A Fine-grained High-resolution Satellite Dataset for Large-scale Map Construction | Hongbo Zhao et.al. | 2410.23278 | null |
2024-11-04 | EMMA: End-to-End Multimodal Model for Autonomous Driving | Jyh-Jing Hwang et.al. | 2410.23262 | null |
2024-10-31 | Enhancing Autonomous Driving Safety Analysis with Generative AI: A Comparative Study on Automated Hazard and Risk Assessment | Alireza Abbaspour et.al. | 2410.23207 | null |
2024-11-04 | S3PT: Scene Semantics and Structure Guided Clustering to Boost Self-Supervised Pre-Training for Autonomous Driving | Maciej K. Wozniak et.al. | 2410.23085 | null |
2024-10-30 | YOLOv11 for Vehicle Detection: Advancements, Performance, and Applications in Intelligent Transportation Systems | Mujadded Al Rabbani Alif et.al. | 2410.22898 | null |
2024-10-30 | A Graph-Based Model for Vehicle-Centric Data Sharing Ecosystem | Haiyue Yuan et.al. | 2410.22897 | null |
2024-10-30 | Self-Driving Car Racing: Application of Deep Reinforcement Learning | Florentiana Yuwono et.al. | 2410.22766 | null |
2024-10-30 | SoftCTRL: Soft conservative KL-control of Transformer Reinforcement Learning for Autonomous Driving | Minh Tri Huynh et.al. | 2410.22752 | null |
2024-10-30 | Analysis of Classifier Training on Synthetic Data for Cross-Domain Datasets | Andoni Cortés et.al. | 2410.22748 | null |
2024-10-29 | Pre-Trained Vision Models as Perception Backbones for Safety Filters in Autonomous Driving | Yuxuan Yang et.al. | 2410.22585 | null |
2024-10-29 | An Efficient Approach to Generate Safe Drivable Space by LiDAR-Camera-HDmap Fusion | Minghao Ning et.al. | 2410.22314 | link |
2024-10-29 | Senna: Bridging Large Vision-Language Models and End-to-End Autonomous Driving | Bo Jiang et.al. | 2410.22313 | link |
2024-10-29 | EnvoDat: A Large-Scale Multisensory Dataset for Robotic Spatial Awareness and Semantic Reasoning in Heterogeneous Environments | Linus Nwankwo et.al. | 2410.22200 | null |
2024-12-12 | Hyperspectral Imaging-Based Perception in Autonomous Driving Scenarios: Benchmarking Baseline Semantic Segmentation Models | Imad Ali Shah et.al. | 2410.22101 | link |
2024-10-29 | Building Altruistic and Moral AI Agent with Brain-inspired Affective Empathy Mechanisms | Feifei Zhao et.al. | 2410.21882 | null |
2024-11-07 | SS3DM: Benchmarking Street-View Surface Reconstruction with a Synthetic 3D Mesh Dataset | Yubin Hu et.al. | 2410.21739 | null |
2024-10-28 | Trustworthiness of Stochastic Gradient Descent in Distributed Learning | Hongyang Li et.al. | 2410.21491 | null |
2024-10-28 | Efficient Mixture-of-Expert for Video-based Driver State and Physiological Multi-task Estimation in Conditional Autonomous Driving | Jiyao Wang et.al. | 2410.21086 | null |
2024-10-28 | BEVPose: Unveiling Scene Semantics through Pose-Guided Multi-Modal BEV Alignment | Mehdi Hosseinzadeh et.al. | 2410.20969 | null |
2024-10-28 | Active Legibility in Multiagent Reinforcement Learning | Yanyu Liu et.al. | 2410.20954 | null |
2024-10-28 | SparseTem: Boosting the Efficiency of CNN-Based Video Encoders by Exploiting Temporal Continuity | Kunyun Wang et.al. | 2410.20790 | null |
2024-10-26 | Neural Fields in Robotics: A Survey | Muhammad Zubair Irshad et.al. | 2410.20220 | link |
2024-10-25 | Multi-view biomedical foundation models for molecule-target and property prediction | Parthasarathy Suryanarayanan et.al. | 2410.19704 | link |
2024-11-04 | Planning-Aware Diffusion Networks for Enhanced Motion Forecasting in Autonomous Driving | Liu Yunhao et.al. | 2410.19639 | null |
2024-10-25 | Multi-modal Motion Prediction using Temporal Ensembling with Learning-based Aggregation | Kai-Yin Hong et.al. | 2410.19606 | null |
2024-10-30 | Large Spatial Model: End-to-end Unposed Images to Semantic 3D | Zhiwen Fan et.al. | 2410.18956 | link |
2024-10-24 | Learning Transparent Reward Models via Unsupervised Feature Selection | Daulet Baimukashev et.al. | 2410.18608 | null |
2024-10-24 | Learn 2 Rage: Experiencing The Emotional Roller Coaster That Is Reinforcement Learning | Lachlan Mares et.al. | 2410.18462 | null |
2024-11-19 | CARLA2Real: a tool for reducing the sim2real gap in CARLA simulator | Stefanos Pasios et.al. | 2410.18238 | link |
2024-10-23 | WorldSimBench: Towards Video Generation Models as World Simulators | Yiran Qin et.al. | 2410.18072 | null |
2024-10-23 | Pointer: An Energy-Efficient ReRAM-based Point Cloud Recognition Accelerator with Inter-layer and Intra-layer Optimizations | Qijun Zhang et.al. | 2410.17782 | null |
2024-10-23 | YOLO-Vehicle-Pro: A Cloud-Edge Collaborative Framework for Object Detection in Autonomous Driving under Adverse Weather Conditions | Xiguang Li et.al. | 2410.17734 | null |
2024-10-23 | Real-time Vehicle-to-Vehicle Communication Based Network Cooperative Control System through Distributed Database and Multimodal Perception: Demonstrated in Crossroads | Xinwen Zhu et.al. | 2410.17576 | link |
2024-10-22 | Episodic Future Thinking Mechanism for Multi-agent Reinforcement Learning | Dongsu Lee et.al. | 2410.17373 | null |
2024-10-22 | YOLO-TS: Real-Time Traffic Sign Detection with Enhanced Accuracy Using Optimized Receptive Fields and Anchor-Free Fusion | Junzhou Chen et.al. | 2410.17144 | null |
2024-10-22 | Pedestrian motion prediction evaluation for urban autonomous driving | Dmytro Zabolotnii et.al. | 2410.16864 | link |
2024-10-22 | SpikMamba: When SNN meets Mamba in Event-based Human Action Recognition | Jiaqi Chen et.al. | 2410.16746 | link |
2024-11-07 | Mini-InternVL: A Flexible-Transfer Pocket Multimodal Model with 5% Parameters and 90% Performance | Zhangwei Gao et.al. | 2410.16261 | link |
2024-10-21 | Managing Bandwidth: The Key to Cloud-Assisted Autonomous Driving | Alexander Krentsel et.al. | 2410.16227 | null |
2024-10-24 | LASER: Script Execution by Autonomous Agents for On-demand Traffic Simulation | Hao Gao et.al. | 2410.16197 | link |
2024-10-21 | Bench4Merge: A Comprehensive Benchmark for Merging in Realistic Dense Traffic with Micro-Interactive Vehicles | Zhengming Wang et.al. | 2410.15912 | link |
2024-10-21 | How to Build a Pre-trained Multimodal model for Simultaneously Chatting and Decision-making? | Zuojin Tang et.al. | 2410.15885 | null |
2024-10-27 | WildOcc: A Benchmark for Off-Road 3D Semantic Occupancy Prediction | Heng Zhai et.al. | 2410.15792 | null |
2024-10-29 | Generalizing Motion Planners with Mixture of Experts for Autonomous Driving | Qiao Sun et.al. | 2410.15774 | link |
2024-10-23 | SPARC: Prediction-Based Safe Control for Coupled Controllable and Uncontrollable Agents with Conformal Predictions | Shuqi Wang et.al. | 2410.15660 | null |
2024-10-20 | XAI-based Feature Ensemble for Enhanced Anomaly Detection in Autonomous Driving Systems | Sazid Nazat et.al. | 2410.15405 | link |
2024-10-20 | Explainability of Point Cloud Neural Networks Using SMILE: Statistical Model-Agnostic Interpretability with Local Explanations | Seyed Mohammad Ahmadi et.al. | 2410.15374 | link |
2024-10-20 | A Novel Characterization of the Population Area Under the Risk Coverage Curve (AURC) and Rates of Finite Sample Estimators | Han Zhou et.al. | 2410.15361 | null |
2024-10-20 | Large Language Models for Autonomous Driving (LLM4AD): Concept, Benchmark, Simulation, and Real-Vehicle Experiment | Can Cui et.al. | 2410.15281 | null |
2024-10-19 | 3D Multi-Object Tracking Employing MS-GLMB Filter for Autonomous Driving | Linh Van Ma et.al. | 2410.14977 | link |
2024-10-19 | Part-Whole Relational Fusion Towards Multi-Modal Scene Understanding | Yi Liu et.al. | 2410.14944 | link |
2024-10-18 | A Hybrid Defense Strategy for Boosting Adversarial Robustness in Vision-Language Models | Yuhan Liang et.al. | 2410.14911 | null |
2024-12-11 | MultiOrg: A Multi-rater Organoid-detection Dataset | Christina Bukas et.al. | 2410.14612 | null |
2024-11-04 | Knowledge Transfer from Simple to Complex: A Safe and Efficient Reinforcement Learning Framework for Autonomous Driving Decision-Making | Rongliang Zhou et.al. | 2410.14468 | null |
2024-10-18 | Learning autonomous driving from aerial imagery | Varun Murali et.al. | 2410.14177 | null |
2024-10-17 | UniDrive: Towards Universal Driving Perception Across Camera Configurations | Ye Li et.al. | 2410.13864 | link |
2024-11-22 | DepthSplat: Connecting Gaussian Splatting and Depth | Haofei Xu et.al. | 2410.13862 | link |
2024-10-17 | Artificial Kuramoto Oscillatory Neurons | Takeru Miyato et.al. | 2410.13821 | link |
2024-10-17 | Optimizing Probabilistic Conformal Prediction with Vectorized Non-Conformity Scores | Minxing Zheng et.al. | 2410.13735 | null |
2024-11-25 | DriveDreamer4D: World Models Are Effective Data Machines for 4D Driving Scene Representation | Guosheng Zhao et.al. | 2410.13571 | null |
2024-10-17 | Accurate Checkerboard Corner Detection under Defoucs | Zezhun Shi et.al. | 2410.13371 | link |
2024-10-16 | MambaBEV: An efficient 3D detection model with Mamba2 | Zihan You et.al. | 2410.12673 | null |
2024-10-16 | Cocoon: Robust Multi-Modal Perception with Uncertainty-Aware Sensor Fusion | Minkyoung Cho et.al. | 2410.12592 | null |
2024-10-20 | Robust RL with LLM-Driven Data Synthesis and Policy Adaptation for Autonomous Driving | Sihao Wu et.al. | 2410.12568 | null |
2024-10-16 | Real-time Stereo-based 3D Object Detection for Streaming Perception | Changcai Li et.al. | 2410.12394 | link |
2024-10-16 | Consistency Calibration: Improving Uncertainty Calibration via Consistency among Perturbed Neighbors | Linwei Tao et.al. | 2410.12295 | null |
2024-10-16 | Sparse Prototype Network for Explainable Pedestrian Behavior Prediction | Yan Feng et.al. | 2410.12195 | link |
2024-10-16 | RTI-NMPC for Control of Autonomous Vehicles Using Implicit Discretization Methods | Matheus Wagner et.al. | 2410.12170 | null |
2024-10-18 | Augmented Intelligence in Smart Intersections: Local Digital Twins-Assisted Hybrid Autonomous Driving | Kui Wang et.al. | 2410.12163 | null |
2024-10-15 | System-Level Analysis of Module Uncertainty Quantification in the Autonomy Pipeline | Sampada Deglurkar et.al. | 2410.12019 | null |
2024-10-15 | An Online Self-learning Graph-based Lateral Controller for Self-Driving Cars | Jilan Samiuddin et.al. | 2410.11979 | null |
2024-10-14 | Study on the Helpfulness of Explainable Artificial Intelligence | Tobias Labarta et.al. | 2410.11896 | link |
2024-10-15 | Generalizable Spacecraft Trajectory Generation via Multimodal Learning with Transformers | Davide Celestini et.al. | 2410.11723 | null |
2024-10-15 | A Data-Driven Aggressive Autonomous Racing Framework Utilizing Local Trajectory Planning with Velocity Prediction | Zhouheng Li et.al. | 2410.11570 | link |
2024-10-15 | TEOcc: Radar-camera Multi-modal Occupancy Prediction via Temporal Enhancement | Zhiwei Lin et.al. | 2410.11228 | link |
2024-10-14 | 6G RIS-aided Single-LEO Localization with Slow and Fast Doppler Effects | Sharief Saleh et.al. | 2410.11010 | null |
2024-10-14 | Condition-Aware Multimodal Fusion for Robust Semantic Perception of Driving Scenes | Tim Broedermann et.al. | 2410.10791 | link |
2024-10-14 | 3DArticCyclists: Generating Simulated Dynamic 3D Cyclists for Human-Object Interaction (HOI) and Autonomous Driving Applications | Eduardo R. Corral-Soto et.al. | 2410.10782 | null |
2024-10-14 | Towards Calibrated Losses for Adversarial Robust Reject Option Classification | Vrund Shah et.al. | 2410.10736 | link |
2024-10-14 | Navigation under uncertainty: Trajectory prediction and occlusion reasoning with switching dynamical systems | Ran Wei et.al. | 2410.10653 | null |
2024-10-14 | Words to Wheels: Vision-Based Autonomous Driving Understanding Human Language Instructions Using Foundation Models | Chanhoe Ryu et.al. | 2410.10577 | null |
2024-10-14 | Exploiting Local Features and Range Images for Small Data Real-Time Point Cloud Semantic Segmentation | Daniel Fusaro et.al. | 2410.10510 | link |
2024-10-14 | In-Materia Speech Recognition | Mohamadreza Zolfagharinejad et.al. | 2410.10434 | null |
2024-10-14 | DOME: Taming Diffusion Model into High-Fidelity Controllable Occupancy World Model | Songen Gu et.al. | 2410.10429 | null |
2024-10-14 | ROA-BEV: 2D Region-Oriented Attention for BEV-based 3D Object | Jiwei Chen et.al. | 2410.10298 | null |
2024-10-14 | Exploring Semi-Supervised Learning for Online Mapping | Adam Lilja et.al. | 2410.10279 | null |
2024-10-13 | Symmetry Discovery for Different Data Types | Lexiang Hu et.al. | 2410.09841 | null |
2024-10-13 | LoLI-Street: Benchmarking Low-Light Image Enhancement and Beyond | Md Tanvir Islam et.al. | 2410.09831 | link |
2024-11-21 | t-READi: Transformer-Powered Robust and Efficient Multimodal Inference for Autonomous Driving | Pengfei Hu et.al. | 2410.09747 | null |
2024-10-15 | LoRD: Adapting Differentiable Driving Policies to Distribution Shifts | Christopher Diehl et.al. | 2410.09681 | link |
2024-10-12 | RailYolact – A Yolact Focused on edge for Real-Time Rail Segmentation | Qihao Qian et.al. | 2410.09612 | null |
2024-10-11 | Hybrid LLM-DDQN based Joint Optimization of V2I Communication and Autonomous Driving | Zijiang Yan et.al. | 2410.08854 | null |
2024-10-11 | VideoSAM: Open-World Video Segmentation | Pinxue Guo et.al. | 2410.08781 | null |
2024-10-11 | MMLF: Multi-modal Multi-class Late Fusion for Object Detection with Uncertainty Estimation | Qihang Yang et.al. | 2410.08739 | null |
2024-10-11 | Impact of Surface Reflections in Maritime Obstacle Detection | Samed Yalçın et.al. | 2410.08713 | link |
2024-10-11 | AdvDiffuser: Generating Adversarial Safety-Critical Driving Scenarios via Guided Diffusion | Yuting Xie et.al. | 2410.08453 | null |
2024-10-10 | Are We Ready for Real-Time LiDAR Semantic Segmentation in Autonomous Driving? | Samir Abou Haidar et.al. | 2410.08365 | null |
2024-10-10 | AdaShadow: Responsive Test-time Model Adaptation in Non-stationary Mobile Environments | Cheng Fang et.al. | 2410.08256 | null |
2024-10-10 | RGM: Reconstructing High-fidelity 3D Car Assets with Relightable 3D-GS Generative Model from a Single Image | Xiaoxue Chen et.al. | 2410.08181 | null |
2024-10-11 | Towards Synergistic, Generalized, and Efficient Dual-System for Robotic Manipulation | Qingwen Bu et.al. | 2410.08001 | null |
2024-10-10 | Offline Hierarchical Reinforcement Learning via Inverse Optimization | Carolin Schmidt et.al. | 2410.07933 | null |
2024-10-10 | Autonomous Vehicles Path Planning under Temporal Logic Specifications | Akshay Dhonthi et.al. | 2410.07845 | null |
2024-10-21 | HeightFormer: A Semantic Alignment Monocular 3D Object Detection Method from Roadside Perspective | Pei Liu et.al. | 2410.07758 | null |
2024-11-01 | Autonomous Driving in Unstructured Environments: How Far Have We Come? | Chen Min et.al. | 2410.07701 | link |
2024-10-10 | 3D Vision-Language Gaussian Splatting | Qucheng Peng et.al. | 2410.07577 | null |
2024-10-09 | Pockels Laser Directly Driving Ultrafast Optical Metrology | Shixin Xue et.al. | 2410.07482 | null |
2024-10-09 | Progressive Multi-Modal Fusion for Robust 3D Object Detection | Rohit Mohan et.al. | 2410.07475 | null |
2024-10-09 | Learning responsibility allocations for multi-agent interactions: A differentiable optimization approach with control barrier functions | Isaac Remy et.al. | 2410.07409 | null |
2024-10-09 | Learning Content-Aware Multi-Modal Joint Input Pruning via Bird’s-Eye-View Representation | Yuxin Li et.al. | 2410.07268 | null |
2024-09-23 | Curb Your Attention: Causal Attention Gating for Robust Trajectory Prediction in Autonomous Driving | Ehsan Ahmadi et.al. | 2410.07191 | null |
2024-09-22 | Margin-bounded Confidence Scores for Out-of-Distribution Detection | Lakpa D. Tamang et.al. | 2410.07185 | link |
2024-10-09 | QuadBEV: An Efficient Quadruple-Task Perception Framework via Bird’s-Eye-View Representation | Yuxin Li et.al. | 2410.06516 | null |
2024-10-09 | Overcoming Autoware-Ubuntu Incompatibility in Autonomous Driving Systems-Equipped Vehicles: Lessons Learned | Dada Zhang et.al. | 2410.06492 | null |
2024-10-08 | BEVLoc: Cross-View Localization and Matching via Birds-Eye-View Synthesis | Christopher Klammer et.al. | 2410.06410 | link |
2024-10-08 | Work-in-Progress: Traded Control Transfer for Managing Real-Time Sensor Uncertainties in Autonomous Vehicle | Md Sakib Galib Sourav et.al. | 2410.06345 | null |
2024-10-08 | A New Architecture for Neural Enhanced Multiobject Tracking | Shaoxiu Wei et.al. | 2410.06294 | null |
2024-10-08 | Gaussian-Based and Outside-the-Box Runtime Monitoring Join Forces | Vahid Hashemi et.al. | 2410.06051 | null |
2024-10-08 | Motion Forecasting in Continuous Driving | Nan Song et.al. | 2410.06007 | link |
2024-10-08 | DeMo: Decoupling Motion Forecasting into Directional Intentions and Dynamic States | Bozhou Zhang et.al. | 2410.05982 | link |
2024-10-08 | Distributed Coordination for Multi-Vehicle Systems in the Presence of Misbehaving Vehicles | Dongkun Han et.al. | 2410.05793 | null |
2024-10-08 | Reinforcement Learning From Imperfect Corrective Actions And Proxy Rewards | Zhaohui Jiang et.al. | 2410.05782 | null |
2024-10-08 | Towards Robust Spacecraft Trajectory Optimization via Transformers | Yuji Takubo et.al. | 2410.05585 | null |
2024-10-08 | Gen-Drive: Enhancing Diffusion Generative Driving Policies with Reward Modeling and Reinforcement Learning Fine-tuning | Zhiyu Huang et.al. | 2410.05582 | null |
2024-10-07 | Salient Store: Enabling Smart Storage for Continuous Learning Edge Servers | Cyan Subhra Mishra et.al. | 2410.05435 | null |
2024-10-07 | STOP! Camera Spoofing via the in-Vehicle IP Network | Dror Peri et.al. | 2410.05417 | null |
2024-10-07 | LiDAR-GS:Real-time LiDAR Re-Simulation using Gaussian Splatting | Qifeng Chen et.al. | 2410.05111 | null |
2024-10-07 | HE-Drive: Human-Like End-to-End Driving with Vision Language Models | Junming Wang et.al. | 2410.05051 | null |
2024-10-07 | PRFusion: Toward Effective and Robust Multi-Modal Place Recognition with Image and Point Cloud Fusion | Sijie Wang et.al. | 2410.04939 | link |
2024-10-07 | Data-driven Diffusion Models for Enhancing Safety in Autonomous Vehicle Traffic Simulations | Jinxiong Lu et.al. | 2410.04809 | null |
2024-10-07 | WTCL-Dehaze: Rethinking Real-world Image Dehazing via Wavelet Transform and Contrastive Learning | Divine Joseph Appiah et.al. | 2410.04762 | null |
2024-10-15 | Diffusion Models in 3D Vision: A Survey | Zhen Wang et.al. | 2410.04738 | null |
2024-10-10 | Unpacking Failure Modes of Generative Policies: Runtime Monitoring of Consistency and Progress | Christopher Agia et.al. | 2410.04640 | null |
2024-10-06 | In-Place Panoptic Radiance Field Segmentation with Perceptual Prior for 3D Scene Understanding | Shenghao Li et.al. | 2410.04529 | null |
2024-10-19 | StreetSurfGS: Scalable Urban Street Surface Reconstruction with Planar-based Gaussian Splatting | Xiao Cui et.al. | 2410.04354 | null |
2024-10-13 | Model Developmental Safety: A Safety-Centric Method and Applications in Vision-Language Models | Gang Li et.al. | 2410.03955 | link |
2024-11-01 | STONE: A Submodular Optimization Framework for Active 3D Object Detection | Ruiyu Mao et.al. | 2410.03918 | link |
2024-10-04 | A Multi-model Approach for Video Data Retrieval in Autonomous Vehicle Development | Jesper Knapp et.al. | 2410.03580 | null |
2024-10-04 | Make Interval Bound Propagation great again | Patryk Krukowski et.al. | 2410.03373 | link |
2024-10-04 | MetaOOD: Automatic Selection of OOD Detection Models | Yuehan Qin et.al. | 2410.03074 | null |
2024-11-21 | LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning | Di Zhang et.al. | 2410.02884 | null |
2024-10-03 | Agent Security Bench (ASB): Formalizing and Benchmarking Attacks and Defenses in LLM-based Agents | Hanrong Zhang et.al. | 2410.02644 | link |
2024-10-03 | Spatial-Temporal Multi-Cuts for Online Multiple-Camera Vehicle Tracking | Fabian Herzog et.al. | 2410.02638 | link |
2024-10-03 | Behavior Trees in Functional Safety Supervisors for Autonomous Vehicles | Carlos Conejo et.al. | 2410.02469 | link |
2024-10-03 | End-to-end Driving in High-Interaction Traffic Scenarios with Reinforcement Learning | Yueyuan Li et.al. | 2410.02253 | null |
2024-10-03 | Remember and Recall: Associative-Memory-based Trajectory Prediction | Hang Guo et.al. | 2410.02201 | null |
2024-10-03 | Abstract Reward Processes: Leveraging State Abstraction for Consistent Off-Policy Evaluation | Shreyas Chaudhari et.al. | 2410.02172 | link |
2024-10-28 | Neural Eulerian Scene Flow Fields | Kyle Vedder et.al. | 2410.02031 | null |
2024-10-02 | Open3DTrack: Towards Open-Vocabulary 3D Multi-Object Tracking | Ayesha Ishaq et.al. | 2410.01678 | link |
2024-10-07 | Entropy-Based Uncertainty Modeling for Trajectory Prediction in Autonomous Driving | Aron Distelzweig et.al. | 2410.01628 | null |
2024-10-07 | Perceptual Piercing: Human Visual Cue-based Object Detection in Low Visibility Conditions | Ashutosh Kumar et.al. | 2410.01225 | link |
2024-10-02 | Absolute State-wise Constrained Policy Optimization: High-Probability State-wise Constraints Satisfaction | Weiye Zhao et.al. | 2410.01212 | null |
2024-10-02 | Generative Diffusion-based Contract Design for Efficient AI Twins Migration in Vehicular Embodied AI Networks | Yue Zhong et.al. | 2410.01176 | null |
2024-10-01 | High-directivity multi-level beam switching with single-gate tunable metasurfaces based on graphene | Juho Park et.al. | 2410.00806 | null |
2024-10-01 | E-MPC: Edge-assisted Model Predictive Control | Yuan-Yao Lou et.al. | 2410.00695 | null |
2024-10-01 | SyntheOcc: Synthesize Geometric-Controlled Street View Images through 3D Semantic MPIs | Leheng Li et.al. | 2410.00337 | null |
2024-10-01 | GSPR: Multimodal Place Recognition Using 3D Gaussian Splatting for Autonomous Driving | Zhangshuo Qi et.al. | 2410.00299 | link |
2024-09-30 | Efficient Driving Behavior Narration and Reasoning on Edge Device Using Large Language Models | Yizhou Huang et.al. | 2409.20364 | null |
2024-10-01 | OccRWKV: Rethinking Efficient 3D Semantic Occupancy Prediction with Linear Complexity | Junming Wang et.al. | 2409.19987 | null |
2024-11-20 | DAOcc: 3D Object Detection Assisted Multi-Sensor Fusion for 3D Occupancy Prediction | Zhen Yang et.al. | 2409.19972 | link |
2024-09-29 | Fast-Convergent and Communication-Alleviated Heterogeneous Hierarchical Federated Learning in Autonomous Driving | Wei-Bin Kou et.al. | 2409.19560 | null |
2024-09-28 | Spatial Reasoning and Planning for Deep Embodied Agents | Shu Ishida et.al. | 2409.19479 | null |
2024-09-27 | PhysGen: Rigid-Body Physics-Grounded Image-to-Video Generation | Shaowei Liu et.al. | 2409.18964 | link |
2024-11-22 | MemFusionMap: Working Memory Fusion for Online Vectorized HD Map Construction | Jingyu Song et.al. | 2409.18737 | null |
2024-09-27 | Analysis of Truncated Singular Value Decomposition for Koopman Operator-Based Lane Change Model | Chinnawut Nantabut et.al. | 2409.18586 | null |
2024-09-27 | BoT-Drive: Hierarchical Behavior and Trajectory Planning for Autonomous Driving using POMDPs | Xuanjin Jin et.al. | 2409.18411 | null |
2024-09-27 | Multimodal Trajectory Prediction for Autonomous Driving on Unstructured Roads using Deep Convolutional Network | Lei Li et.al. | 2409.18399 | null |
2024-09-26 | Improving Agent Behaviors with RL Fine-tuning for Autonomous Driving | Zhenghao Peng et.al. | 2409.18343 | null |
2024-09-26 | Does End-to-End Autonomous Driving Really Need Perception Tasks? | Peidong Li et.al. | 2409.18341 | link |
2024-09-30 | DiffSSC: Semantic LiDAR Scan Completion using Denoising Diffusion Probabilistic Models | Helin Cao et.al. | 2409.18092 | null |
2024-11-03 | DualAD: Dual-Layer Planning for Reasoning in Autonomous Driving | Dingrui Wang et.al. | 2409.18053 | link |
2024-09-26 | Reasoning Multi-Agent Behavioral Topology for Interactive Autonomous Driving | Haochen Liu et.al. | 2409.18031 | link |
2024-09-26 | ReliOcc: Towards Reliable Semantic Occupancy Prediction via Uncertainty Learning | Song Wang et.al. | 2409.18026 | null |
2024-09-26 | Adaptive Stream Processing on Edge Devices through Active Inference | Boris Sedlak et.al. | 2409.17937 | null |
2024-09-26 | PhantomLiDAR: Cross-modality Signal Injection Attacks against LiDAR | Zizhi Jin et.al. | 2409.17907 | null |
2024-09-27 | A New Dataset for Monocular Depth Estimation Under Viewpoint Shifts | Aurel Pjetri et.al. | 2409.17851 | null |
2024-09-26 | CASPFormer: Trajectory Prediction from BEV Images with Deformable Attention | Harsh Yadav et.al. | 2409.17790 | null |
2024-09-26 | AlterMOMA: Fusion Redundancy Pruning for Camera-LiDAR Fusion Models with Alternative Modality Masking | Shiqi Sun et.al. | 2409.17728 | null |
2024-09-26 | Hierarchical End-to-End Autonomous Driving: Integrating BEV Perception with Deep Reinforcement Learning | Siyi Lu et.al. | 2409.17659 | null |
2024-10-06 | System-Level Safety Monitoring and Recovery for Perception Failures in Autonomous Vehicles | Kaustav Chakraborty et.al. | 2409.17630 | null |
2024-09-27 | Learning Occlusion-aware Decision-making from Agent Interaction via Active Perception | Jie Jia et.al. | 2409.17618 | null |
2024-09-26 | Joint Source-Channel Coding: Fundamentals and Recent Progress in Practical Designs | Deniz Gündüz et.al. | 2409.17557 | null |
2024-09-25 | Transient Adversarial 3D Projection Attacks on Object Detection in Autonomous Driving | Ce Zhou et.al. | 2409.17403 | null |
2024-09-25 | Optical Lens Attack on Deep Learning Based Monocular Depth Estimation | Ce Zhou et.al. | 2409.17376 | null |
2024-09-25 | Energy-Efficient & Real-Time Computer Vision with Intelligent Skipping via Reconfigurable CMOS Image Sensors | Md Abdullah-Al Kaiser et.al. | 2409.17341 | null |
2024-09-25 | VL4AD: Vision-Language Models Improve Pixel-wise Anomaly Detection | Liangyu Zhong et.al. | 2409.17330 | null |
2024-09-25 | Deep Learning and Machine Learning, Advancing Big Data Analytics and Management: Handy Appetizer | Benji Peng et.al. | 2409.17120 | null |
2024-10-04 | Performance assessment of ADAS in a representative subset of critical traffic situations | Luigi Di Lillo et.al. | 2409.16942 | null |
2024-09-25 | Skyeyes: Ground Roaming using Aerial View Images | Zhiyuan Gao et.al. | 2409.16685 | null |
2024-09-26 | Mitigating Covariate Shift in Imitation Learning for Autonomous Vehicles Using Latent Space Generative World Models | Alexander Popov et.al. | 2409.16663 | null |
2024-09-25 | Efficient Motion Prediction: A Lightweight & Accurate Trajectory Prediction Model With Fast Training and Inference Speed | Alexander Prutsch et.al. | 2409.16154 | link |
2024-10-14 | MCTrack: A Unified 3D Multi-Object Tracking Framework for Autonomous Driving | Xiyang Wang et.al. | 2409.16149 | link |
2024-09-30 | Unimotion: Unifying 3D Human Motion Synthesis and Understanding | Chuqiao Li et.al. | 2409.15904 | null |
2024-09-24 | FSF-Net: Enhance 4D Occupancy Forecasting with Coarse BEV Scene Flow for Autonomous Driving | Erxin Guo et.al. | 2409.15841 | null |
2024-09-24 | Intention-based and Risk-Aware Trajectory Prediction for Autonomous Driving in Complex Traffic Scenarios | Wen Wei et.al. | 2409.15821 | null |
2024-09-27 | Diffusion Models for Intelligent Transportation Systems: A Survey | Mingxing Peng et.al. | 2409.15816 | null |
2024-09-24 | A Computer Vision Approach for Autonomous Cars to Drive Safe at Construction Zone | Abu Shad Ahammed et.al. | 2409.15809 | null |
2024-09-24 | Learning Multiple Probabilistic Decisions from Latent World Model in Autonomous Driving | Lingyu Xiao et.al. | 2409.15730 | link |
2024-09-24 | Plenoptic PNG: Real-Time Neural Radiance Fields in 150 KB | Jae Yong Lee et.al. | 2409.15689 | null |
2024-09-23 | Analyzing Privacy Implications of Data Collection in Android Automotive OS | Bulut Gözübüyük et.al. | 2409.15561 | null |
2024-09-23 | VLMine: Long-Tail Data Mining with Vision Language Models | Mao Ye et.al. | 2409.15486 | null |
2024-09-07 | Causality-Driven Reinforcement Learning for Joint Communication and Sensing | Anik Roy et.al. | 2409.15329 | null |
2024-09-23 | Enhancing Pedestrian Trajectory Prediction with Crowd Trip Information | Rei Tamaru et.al. | 2409.15224 | link |
2024-09-25 | Goal-based Neural Physics Vehicle Trajectory Prediction Model | Rui Gan et.al. | 2409.15182 | null |
2024-09-23 | Controllable Traffic Simulation through LLM-Guided Hierarchical Chain-of-Thought Reasoning | Zhiyuan Liu et.al. | 2409.15135 | null |
2024-09-23 | SPformer: A Transformer Based DRL Decision Making Method for Connected Automated Vehicles | Ye Han et.al. | 2409.15105 | null |
2024-09-23 | Online Adaptation of Learned Vehicle Dynamics Model with Meta-Learning Approach | Yuki Tsuchiya et.al. | 2409.14950 | null |
2024-11-17 | Adverse Weather-Immune Semantic Segmentation with Unfolded Regularization and Foundation Model Knowledge Distillation for Autonomous Driving | Wei-Bin Kou et.al. | 2409.14737 | null |
2024-09-23 | A Generalized Control Revision Method for Autonomous Driving Safety | Zehang Zhu et.al. | 2409.14688 | null |
2024-09-23 | S2O: An Integrated Driving Decision-making Performance Evaluation Method Bridging Subjective Feeling to Objective Evaluation | Yuning Wang et.al. | 2409.14680 | null |
2024-09-24 | First Field Trial of LLM-Powered AI Agent for Lifecycle Management of Autonomous Driving Optical Networks | Xiaomin Liu et.al. | 2409.14605 | null |
2024-09-22 | Enhancing LLM-based Autonomous Driving Agents to Mitigate Perception Attacks | Ruoyu Song et.al. | 2409.14488 | null |
2024-09-21 | LFP: Efficient and Accurate End-to-End Lane-Level Planning via Camera-LiDAR Fusion | Guoliang You et.al. | 2409.14170 | null |
2024-09-24 | Will Large Language Models be a Panacea to Autonomous Driving? | Yuxuan Zhu et.al. | 2409.14165 | null |
2024-09-21 | Integrated Decision Making and Trajectory Planning for Autonomous Driving Under Multimodal Uncertainties: A Bayesian Game Approach | Zhenmin Huang et.al. | 2409.13993 | null |
2024-09-20 | OneBEV: Using One Panoramic Image for Bird’s-Eye-View Semantic Mapping | Jiale Wei et.al. | 2409.13912 | link |
2024-09-20 | Efficient Domain Augmentation for Autonomous Driving Testing Using Diffusion Models | Luciano Baresi et.al. | 2409.13661 | null |
2024-09-20 | Autonomous Driving at Unsignalized Intersections: A Review of Decision-Making Challenges and Reinforcement Learning-Based Solutions | Mohammad Al-Sharman et.al. | 2409.13144 | null |
2024-09-22 | Exploiting Minority Pseudo-Labels for Semi-Supervised Semantic Segmentation in Autonomous Driving | Yuting Hong et.al. | 2409.12680 | null |
2024-09-19 | METDrive: Multi-modal End-to-end Autonomous Driving with Temporal Guidance | Ziang Guo et.al. | 2409.12667 | null |
2024-09-23 | Accurate Automatic 3D Annotation of Traffic Lights and Signs for Autonomous Driving | Sándor Kunsági-Máté et.al. | 2409.12620 | link |
2024-09-19 | LLMs Can Check Their Own Results to Mitigate Hallucinations in Traffic Understanding Tasks | Malsha Ashani Mahawatta Dona et.al. | 2409.12580 | null |
2024-09-19 | LMT-Net: Lane Model Transformer Network for Automated HD Mapping from Sparse Vehicle Observations | Michael Mink et.al. | 2409.12409 | null |
2024-09-18 | The Finer Points: A Systematic Comparison of Point-Cloud Extractors for Radar Odometry | Elliot Preston-Krebs et.al. | 2409.12256 | null |
2024-10-14 | ScaleFlow++: Robust and Accurate Estimation of 3D Motion from Video | Han Ling et.al. | 2409.12202 | link |
2024-09-18 | Unveiling the Black Box: Independent Functional Module Evaluation for Bird’s-Eye-View Perception Model | Ludan Zhang et.al. | 2409.11969 | null |
2024-09-18 | RoboMorph: In-Context Meta-Learning for Robot Dynamics Modeling | Manuel Bianchi Bazzi et.al. | 2409.11815 | null |
2024-09-18 | Explaining Non-monotonic Normative Reasoning using Argumentation Theory with Deontic Logic | Zhe Yu et.al. | 2409.11780 | null |
2024-09-18 | RopeBEV: A Multi-Camera Roadside Perception Network in Bird’s-Eye-View | Jinrang Jia et.al. | 2409.11706 | null |
2024-09-18 | From Words to Wheels: Automated Style-Customized Policy Generation for Autonomous Driving | Xu Han et.al. | 2409.11694 | null |
2024-09-17 | RenderWorld: World Model with Self-Supervised 3D Label | Ziyang Yan et.al. | 2409.11356 | null |
2024-09-17 | TopoMaskV2: Enhanced Instance-Mask-Based Formulation for the Road Topology Problem | M. Esat Kalfaoglu et.al. | 2409.11325 | null |
2024-09-18 | High-Order Evolving Graphs for Enhanced Representation of Traffic Dynamics | Aditya Humnabadkar et.al. | 2409.11206 | null |
2024-09-17 | Optimization of Rulebooks via Asymptotically Representing Lexicographic Hierarchies for Autonomous Vehicles | Matteo Penlington et.al. | 2409.11199 | null |
2024-09-16 | Video Token Sparsification for Efficient Multimodal LLMs in Autonomous Driving | Yunsheng Ma et.al. | 2409.11182 | null |
2024-09-18 | Annealed Winner-Takes-All for Motion Forecasting | Yihong Xu et.al. | 2409.11172 | link |
2024-09-17 | UltimateDO: An Efficient Framework to Marry Occupancy Prediction with 3D Object Detection via Channel2height | Zichen Yu et.al. | 2409.11160 | null |
2024-09-17 | Unleashing the Potential of Mamba: Boosting a LiDAR 3D Sparse Detector by Using Cross-Model Knowledge Distillation | Rui Yu et.al. | 2409.11018 | null |
2024-09-17 | TrajSSL: Trajectory-Enhanced Semi-Supervised 3D Object Detection | Philip Jacobson et.al. | 2409.10901 | null |
2024-09-20 | CoMamba: Real-time Cooperative Perception Unlocked with State Space Models | Jinlong Li et.al. | 2409.10699 | null |
2024-09-16 | Realistic Extreme Behavior Generation for Improved AV Testing | Robert Dyro et.al. | 2409.10669 | null |
2024-09-02 | An Examination of Offline-Trained Encoders in Vision-Based Deep Reinforcement Learning for Autonomous Driving | Shawan Mohammed et.al. | 2409.10554 | null |
2024-08-30 | 3CSim: CARLA Corner Case Simulation for Control Assessment in Autonomous Driving | Matúš Čávojský et.al. | 2409.10524 | null |
2024-09-16 | Radar Teach and Repeat: Architecture and Initial Field Testing | Xinyuan Qiao et.al. | 2409.10491 | link |
2024-09-16 | XLM for Autonomous Driving Systems: A Comprehensive Review | Sonda Fourati et.al. | 2409.10484 | null |
2024-09-16 | DRIVE: Dependable Robust Interpretable Visionary Ensemble Framework in Autonomous Driving | Songning Lai et.al. | 2409.10330 | null |
2024-09-16 | SEAL: Towards Safe Autonomous Driving via Skill-Enabled Adversary Learning for Closed-Loop Scenario Generation | Benjamin Stoler et.al. | 2409.10320 | link |
2024-09-16 | Robust Bird’s Eye View Segmentation by Adapting DINOv2 | Merve Rabia Barın et.al. | 2409.10228 | null |
2024-09-16 | ExelMap: Explainable Element-based HD-Map Change Detection and Update | Lena Wild et.al. | 2409.10178 | null |
2024-09-16 | Maneuver Decision-Making with Trajectory Streams Prediction for Autonomous Vehicles | Mais Jamal et.al. | 2409.10165 | null |
2024-09-16 | Human Insights Driven Latent Space for Different Driving Perspectives: A Unified Encoder for Efficient Multi-Task Inference | Huy-Dung Nguyen et.al. | 2409.10095 | null |
2024-09-16 | LeGEND: A Top-Down Approach to Scenario Generation of Autonomous Driving Systems Assisted by Large Language Models | Shuncheng Tang et.al. | 2409.10066 | link |
2024-09-17 | GlobalMapNet: An Online Framework for Vectorized Global HD Map Construction | Anqi Shi et.al. | 2409.10063 | null |
2024-09-15 | Semantic2D: A Semantic Dataset for 2D Lidar Semantic Segmentation | Zhanteng Xie et.al. | 2409.09899 | null |
2024-09-15 | A Comprehensive Survey of PID and Pure Pursuit Control Algorithms for Autonomous Vehicle Navigation | Harshit Jain et.al. | 2409.09848 | null |
2024-09-15 | DiFSD: Ego-Centric Fully Sparse Paradigm with Uncertainty Denoising and Iterative Refinement for Efficient End-to-End Autonomous Driving | Haisheng Su et.al. | 2409.09777 | link |
2024-09-15 | Risk-Aware Autonomous Driving for Linear Temporal Logic Specifications | Shuhao Qi et.al. | 2409.09769 | null |
2024-09-14 | Lab2Car: A Versatile Wrapper for Deploying Experimental Planners in Complex Real-world Environments | Marc Heim et.al. | 2409.09523 | null |
2024-09-14 | A Data-Informed Analysis of Scalable Supervision for Safety in Autonomous Vehicle Fleets | Cameron Hickert et.al. | 2409.09500 | null |
2024-09-14 | MulCPred: Learning Multi-modal Concepts for Explainable Pedestrian Action Prediction | Yan Feng et.al. | 2409.09446 | link |
2024-10-31 | OPUS: Occupancy Prediction Using a Sparse Set | Jiabao Wang et.al. | 2409.09350 | link |
2024-09-11 | Inf-MLLM: Efficient Streaming Inference of Multimodal Large Language Models on a Single GPU | Zhenyu Ning et.al. | 2409.09086 | null |
2024-08-29 | Semantic Communication for Cooperative Perception using HARQ | Yucheng Sheng et.al. | 2409.09042 | null |
2024-10-16 | Closed-Loop Visuomotor Control with Generative Expectation for Robotic Manipulation | Qingwen Bu et.al. | 2409.09016 | link |
2024-09-13 | Causal Transformer for Fusion and Pose Estimation in Deep Visual Inertial Odometry | Yunus Bilge Kurt et.al. | 2409.08769 | link |
2024-09-13 | GenMapping: Unleashing the Potential of Inverse Perspective Mapping for Robust Online HD Map Construction | Siyu Li et.al. | 2409.08688 | link |
2024-09-13 | The Design of Informative Take-Over Requests for Semi-Autonomous Cyber-Physical Systems: Combining Spoken Language and Visual Icons in a Drone-Controller Setting | Ashwini Gundappa et.al. | 2409.08253 | null |
2024-09-12 | The JPEG Pleno Learning-based Point Cloud Coding Standard: Serving Man and Machine | André F. R. Guarda et.al. | 2409.08130 | null |
2024-09-12 | SoVAR: Building Generalizable Scenarios from Accident Reports for Autonomous Driving Testing | An Guo et.al. | 2409.08081 | link |
2024-10-18 | LED: Light Enhanced Depth Estimation at Night | Simon de Moreau et.al. | 2409.08031 | link |
2024-09-12 | Real-time Multi-view Omnidirectional Depth Estimation System for Robots and Autonomous Driving on Real Scenes | Ming Li et.al. | 2409.07843 | null |
2024-09-12 | ReGentS: Real-World Safety-Critical Driving Scenario Generation Made Stable | Yuan Yin et.al. | 2409.07830 | link |
2024-09-12 | GateAttentionPose: Enhancing Pose Estimation with Agent Attention and Improved Gated Convolutions | Liang Feng et.al. | 2409.07798 | null |
2024-09-13 | ROCAS: Root Cause Analysis of Autonomous Driving Accidents via Cyber-Physical Co-mutation | Shiwei Feng et.al. | 2409.07774 | link |
2024-09-12 | GatedUniPose: A Novel Approach for Pose Estimation Combining UniRepLKNet and Gated Convolution | Liang Feng et.al. | 2409.07752 | null |
2024-09-12 | Attack End-to-End Autonomous Driving through Module-Wise Noise | Lu Wang et.al. | 2409.07706 | null |
2024-09-21 | A Comprehensive Survey on Inverse Constrained Reinforcement Learning: Definitions, Progress and Challenges | Guiliang Liu et.al. | 2409.07569 | link |
2024-09-11 | Unsupervised Point Cloud Registration with Self-Distillation | Christian Löwens et.al. | 2409.07558 | link |
2024-09-11 | Module-wise Adaptive Adversarial Training for End-to-end Autonomous Driving | Tianyuan Zhang et.al. | 2409.07321 | null |
2024-09-25 | MiniDrive: More Efficient Vision-Language Models with Multi-Level 2D Features as Text Tokens for Autonomous Driving | Enming Zhang et.al. | 2409.07267 | link |
2024-09-11 | Behavioral Cloning Models Reality Check for Autonomous Driving | Mustafa Yildirim et.al. | 2409.07218 | null |
2024-09-10 | Cross-Modal Self-Supervised Learning with Effective Contrastive Units for LiDAR Point Clouds | Mu Cai et.al. | 2409.06827 | link |
2024-09-10 | Hint-AD: Holistically Aligned Interpretability in End-to-End Autonomous Driving | Kairui Ding et.al. | 2409.06702 | null |
2024-09-10 | Transtreaming: Adaptive Delay-aware Transformer for Real-time Streaming Perception | Xiang Zhang et.al. | 2409.06584 | null |
2024-09-10 | Multi-Weather Image Restoration via Histogram-Based Transformer Feature Enhancement | Yang Wen et.al. | 2409.06334 | null |
2024-09-10 | UdeerLID+: Integrating LiDAR, Image, and Relative Depth with Semi-Supervised | Tao Ni et.al. | 2409.06197 | null |
2024-09-11 | MyGo: Consistent and Controllable Multi-View Driving Video Generation with Camera Control | Yining Yao et.al. | 2409.06189 | null |
2024-09-09 | Promptable Closed-loop Traffic Simulation | Shuhan Tan et.al. | 2409.05863 | null |
2024-09-09 | Vision-Driven 2D Supervised Fine-Tuning Framework for Bird’s Eye View Perception | Lei He et.al. | 2409.05834 | null |
2024-09-09 | Replay Consolidation with Label Propagation for Continual Object Detection | Riccardo De Monte et.al. | 2409.05650 | null |
2024-09-12 | DriveScape: Towards High-Resolution Controllable Multi-View Driving Video Generation | Wei Wu et.al. | 2409.05463 | null |
2024-09-11 | Distribution Discrepancy and Feature Heterogeneity for Active 3D Object Detection | Huang-Yu Chen et.al. | 2409.05425 | link |
2024-09-09 | ICPR 2024 Competition on Safe Segmentation of Drive Scenes in Unstructured Traffic and Adverse Weather Conditions | Furqan Ahmed Shaik et.al. | 2409.05327 | null |
2024-10-22 | Developing Path Planning with Behavioral Cloning and Proximal Policy Optimization for Path-Tracking and Static Obstacle Nudging | Mingyan Zhou et.al. | 2409.05289 | link |
2024-09-08 | Enhancing the Performance of Multi-Vehicle Navigation in Unstructured Environments using Hard Sample Mining | Yining Ma et.al. | 2409.05119 | link |
2024-09-08 | RCBEVDet++: Toward High-accuracy Radar-Camera Fusion 3D Perception Network | Zhiwei Lin et.al. | 2409.04979 | null |
2024-09-07 | A Comprehensive Survey on Evidential Deep Learning and Its Applications | Junyu Gao et.al. | 2409.04720 | link |
2024-09-06 | Multi-scale Feature Fusion with Point Pyramid for 3D Object Detection | Weihao Lu et.al. | 2409.04601 | null |
2024-09-06 | Future Does Matter: Boosting 3D Object Detection with Temporal Motion Estimation in Point Cloud Sequences | Rui Yu et.al. | 2409.04390 | null |
2024-09-06 | Safe and Efficient Path Planning under Uncertainty via Deep Collision Probability Fields | Felix Herrmann et.al. | 2409.04306 | null |
2024-09-06 | Secure Traffic Sign Recognition: An Attention-Enabled Universal Image Inpainting Mechanism against Light Patch Attacks | Hangcheng Cao et.al. | 2409.04133 | null |
2024-09-06 | Towards Energy-Efficiency by Navigating the Trilemma of Energy, Latency, and Accuracy | Boyuan Tian et.al. | 2409.04018 | null |
2024-09-05 | Multi-agent Path Finding for Mixed Autonomy Traffic Coordination | Han Zheng et.al. | 2409.03881 | null |
2024-09-05 | Prediction Accuracy & Reliability: Classification and Object Localization under Distribution Shift | Fabian Diet et.al. | 2409.03543 | null |
2024-09-05 | Neural HD Map Generation from Multiple Vectorized Tiles Locally Produced by Autonomous Vehicles | Miao Fan et.al. | 2409.03445 | null |
2024-09-05 | YOLO-PPA based Efficient Traffic Sign Detection for Cruise Control in Autonomous Driving | Jingyu Zhang et.al. | 2409.03320 | null |
2024-09-05 | OccLLaMA: An Occupancy-Language-Action Generative World Model for Autonomous Driving | Julong Wei et.al. | 2409.03272 | null |
2024-09-05 | Multiple weather images restoration using the task transformer and adaptive mixup strategy | Yang Wen et.al. | 2409.03249 | null |
2024-09-05 | Autonomous Drifting Based on Maximal Safety Probability Learning | Hikaru Hoshino et.al. | 2409.03160 | link |
2024-09-04 | Developing, Analyzing, and Evaluating Self-Drive Algorithms Using Drive-by-Wire Electric Vehicles | Beñat Froemming-Aldanondo et.al. | 2409.03114 | link |
2024-09-04 | Incorporating dense metric depth into neural 3D representations for view synthesis and relighting | Arkadeep Narayan Chaudhury et.al. | 2409.03061 | null |
2024-09-04 | Can LVLMs Obtain a Driver’s License? A Benchmark Towards Reliable AGI for Autonomous Driving | Yuhang Lu et.al. | 2409.02914 | null |
2024-09-08 | GET-UP: GEomeTric-aware Depth Estimation with Radar Points UPsampling | Huawei Sun et.al. | 2409.02720 | link |
2024-09-04 | Improved Single Camera BEV Perception Using Multi-Camera Training | Daniel Busch et.al. | 2409.02676 | null |
2024-09-04 | Want a Ride? Attitudes Towards Autonomous Driving and Behavior in Autonomous Vehicles | Enrico Del Re et.al. | 2409.02556 | null |
2024-09-04 | TLD: A Vehicle Tail Light signal Dataset and Benchmark | Jinhao Chai et.al. | 2409.02508 | null |
2024-09-04 | eRSS-RAMP: A Rule-Adherence Motion Planner Based on Extended Responsibility-Sensitive Safety for Autonomous Driving | Pengfei Lin et.al. | 2409.02503 | null |
2024-09-04 | A Learnable Color Correction Matrix for RAW Reconstruction | Anqi Liu et.al. | 2409.02497 | null |
2024-10-09 | TASAR: Transfer-based Attack on Skeletal Action Recognition | Yunfeng Diao et.al. | 2409.02483 | link |
2024-09-04 | Volumetric Surfaces: Representing Fuzzy Geometries with Multiple Meshes | Stefano Esposito et.al. | 2409.02482 | null |
2024-09-04 | Local map Construction Methods with SD map: A Novel Survey | Jiaqi Li et.al. | 2409.02415 | null |
2024-09-04 | GGS: Generalizable Gaussian Splatting for Lane Switching in Autonomous Driving | Huasong Han et.al. | 2409.02382 | null |
2024-09-03 | AllWeatherNet:Unified Image enhancement for autonomous driving under adverse weather and lowlight-conditions | Chenghao Qian et.al. | 2409.02045 | link |
2024-09-03 | Snapshot: Towards Application-centered Models for Pedestrian Trajectory Prediction in Urban Traffic Environments | Nico Uhlemann et.al. | 2409.01971 | link |
2024-09-03 | DiVE: DiT-based Video Generation with Enhanced Control | Junpeng Jiang et.al. | 2409.01595 | null |
2024-09-03 | Situation-aware Autonomous Driving Decision Making with Cooperative Perception on Demand | Wei Liu et.al. | 2409.01504 | null |
2024-09-07 | EarthGen: Generating the World from Top-Down Views | Ansh Sharma et.al. | 2409.01491 | link |
2024-09-02 | Mutual Benefit: The Case for Sharing Autonomous Vehicle Data with the Public | David Goedicke et.al. | 2409.01342 | null |
2024-09-02 | An Investigation of Denial of Service Attacks on Autonomous Driving Software and Hardware in Operation | Tillmann Stübler et.al. | 2409.01324 | null |
2024-09-02 | Real-time Accident Anticipation for Autonomous Driving Through Monocular Depth-Enhanced 3D Modeling | Haicheng Liao et.al. | 2409.01256 | null |
2024-10-04 | CyberCortex.AI: An AI-based Operating System for Autonomous Robotics and Complex Automation | Sorin Grigorescu et.al. | 2409.01241 | null |
2024-09-02 | Integrating End-to-End and Modular Driving Approaches for Online Corner Case Detection in Autonomous Driving | Gemb Kaljavesi et.al. | 2409.01178 | null |
2024-09-02 | From Bird’s-Eye to Street View: Crafting Diverse and Condition-Aligned Images with Latent Diffusion Model | Xiaojie Xu et.al. | 2409.01014 | null |
2024-09-02 | Development of Occupancy Prediction Algorithm for Underground Parking Lots | Shijie Wang et.al. | 2409.00923 | null |
2024-09-02 | Multi-scale Temporal Fusion Transformer for Incomplete Vehicle Trajectory Prediction | Zhanwen Liu et.al. | 2409.00904 | null |
2024-09-05 | Trustworthy Human-AI Collaboration: Reinforcement Learning with Human Feedback and Physics Knowledge for Safe Autonomous Driving | Zilin Huang et.al. | 2409.00858 | link |
2024-09-01 | Image-to-Lidar Relational Distillation for Autonomous Driving Data | Anas Mahmoud et.al. | 2409.00845 | null |
2024-09-01 | Study of Dropout in PointPillars with 3D Object Detection | Xiaoxiang Sun et.al. | 2409.00673 | null |
2024-09-01 | Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression | Dingyuan Zhang et.al. | 2409.00633 | link |
2024-09-01 | Enhancing Vectorized Map Perception with Historical Rasterized Maps | Xiaoyu Zhang et.al. | 2409.00620 | link |
2024-09-01 | Online Temporal Fusion for Vectorized Map Construction in Mapless Autonomous Driving | Jiagang Chen et.al. | 2409.00593 | null |
2024-08-31 | Online Learning of Interaction Dynamics with Dual Model Predictive Control for Multi-Agent Systems Using Gaussian Processes | T. M. J. T. Baltussen et.al. | 2409.00432 | null |
2024-08-30 | ContextVLM: Zero-Shot and Few-Shot Context Understanding for Autonomous Driving using Vision Language Models | Shounak Sural et.al. | 2409.00301 | null |
2024-09-17 | RING#: PR-by-PE Global Localization with Roto-translation Equivariant Gram Learning | Sha Lu et.al. | 2409.00206 | null |
2024-08-19 | No Need to Sacrifice Data Quality for Quantity: Crowd-Informed Machine Annotation for Cost-Effective Understanding of Visual Data | Christopher Klugmann et.al. | 2409.00048 | null |
2024-08-30 | Structuring a Training Strategy to Robustify Perception Models with Realistic Image Augmentations | Ahmed Hammam et.al. | 2408.17311 | null |
2024-08-30 | How Could Generative AI Support Compliance with the EU AI Act? A Review for Safe Automated Driving Perception | Mert Keser et.al. | 2408.17222 | null |
2024-08-30 | NanoMVG: USV-Centric Low-Power Multi-Task Visual Grounding based on Prompt-Guided Camera and 4D mmWave Radar | Runwei Guan et.al. | 2408.17207 | null |
2024-08-30 | UTrack: Multi-Object Tracking with Uncertain Detections | Edgardo Solano-Carrillo et.al. | 2408.17098 | link |
2024-08-30 | PIB: Prioritized Information Bottleneck Framework for Collaborative Edge Video Analytics | Zhengru Fang et.al. | 2408.17047 | link |
2024-08-30 | Transient Fault Tolerant Semantic Segmentation for Autonomous Driving | Leonardo Iurada et.al. | 2408.16952 | link |
2024-08-29 | OmniRe: Omni Urban Scene Reconstruction | Ziyu Chen et.al. | 2408.16760 | null |
2024-08-29 | DriveGenVLM: Real-world Video Generation for Vision Language Model based Autonomous Driving | Yongjie Fu et.al. | 2408.16647 | null |
2024-08-28 | A Comprehensive Review of 3D Object Detection in Autonomous Driving: Technological Advances and Future Directions | Yu Wang et.al. | 2408.16530 | link |
2024-08-29 | CooTest: An Automated Testing Approach for V2X Communication Systems | An Guo et.al. | 2408.16470 | link |
2024-09-12 | BEVal: A Cross-dataset Evaluation Study of BEV Segmentation Models for Autonomous Driving | Manuel Alejandro Diaz-Zapata et.al. | 2408.16322 | link |
2024-08-29 | PolarBEVDet: Exploring Polar Representation for Multi-View 3D Object Detection in Bird’s-Eye-View | Zichen Yu et.al. | 2408.16200 | link |
2024-08-28 | GenDDS: Generating Diverse Driving Video Scenarios with Prompt-to-Video Generative Model | Yongjie Fu et.al. | 2408.15868 | null |
2024-08-28 | Str-L Pose: Integrating Point and Structured Line for Relative Pose Estimation in Dual-Graph | Zherong Zhang et.al. | 2408.15750 | null |
2024-08-28 | TeFF: Tracking-enhanced Forgetting-free Few-shot 3D LiDAR Semantic Segmentation | Junbao Zhou et.al. | 2408.15657 | link |
2024-09-25 | RoboSense: Large-scale Dataset and Benchmark for Multi-sensor Low-speed Autonomous Driving | Haisheng Su et.al. | 2408.15503 | link |
2024-08-27 | Panoptic Perception for Autonomous Driving: A Survey | Yunge Li et.al. | 2408.15388 | null |
2024-08-27 | Drone-assisted Road Gaussian Splatting with Cross-view Uncertainty | Saining Zhang et.al. | 2408.15242 | link |
2024-08-27 | Learning-based Multi-View Stereo: A Survey | Fangjinhua Wang et.al. | 2408.15235 | null |
2024-10-04 | T-FAKE: Synthesizing Thermal Images for Facial Landmarking | Philipp Flotho et.al. | 2408.15127 | link |
2024-08-27 | Adversarial Manhole: Challenging Monocular Depth Estimation and Semantic Segmentation Models with Patch Attack | Naufal Suryanto et.al. | 2408.14879 | link |
2024-10-12 | Driving in the Occupancy World: Vision-Centric 4D Occupancy Forecasting and Planning via World Models for Autonomous Driving | Yu Yang et.al. | 2408.14197 | null |
2024-08-26 | EMDFNet: Efficient Multi-scale and Diverse Feature Network for Traffic Sign Detection | Pengyu Li et.al. | 2408.14189 | null |
2024-08-26 | Quantitative Representation of Scenario Difficulty for Autonomous Driving Based on Adversarial Policy Search | Shuo Yang et.al. | 2408.14000 | null |
2024-08-26 | FusionSAM: Latent Space driven Segment Anything Model for Multimodal Fusion and Segmentation | Daixun Li et.al. | 2408.13980 | null |
2024-08-25 | Bridging the Gap between Real-world and Synthetic Images for Testing Autonomous Driving Systems | Mohammad Hossein Amini et.al. | 2408.13950 | null |
2024-08-25 | TraIL-Det: Transformation-Invariant Local Feature Networks for 3D LiDAR Object Detection with Unsupervised Pre-Training | Li Li et.al. | 2408.13902 | null |
2024-08-25 | Making Large Language Models Better Planners with Reasoning-Decision Alignment | Zhijian Huang et.al. | 2408.13890 | null |
2024-08-25 | Exploring Reliable Matching with Phase Enhancement for Night-time Semantic Segmentation | Yuwen Pan et.al. | 2408.13838 | null |
2024-08-25 | TripleMixer: A 3D Point Cloud Denoising Model for Adverse Weather | Xiongwei Zhao et.al. | 2408.13802 | link |
2024-08-25 | CV-MOS: A Cross-View Model for Motion Segmentation | Xiaoyu Tang et.al. | 2408.13790 | link |
2024-08-28 | Multi-modal Integrated Prediction and Decision-making with Adaptive Interaction Modality Explorations | Tong Li et.al. | 2408.13742 | link |
2024-08-24 | Perception-Guided Fuzzing for Simulated Scenario-Based Testing of Autonomous Driving Systems | Tri Minh Triet Pham et.al. | 2408.13686 | null |
2024-08-24 | Evaluating the Robustness of LiDAR-based 3D Obstacles Detection and Its Impacts on Autonomous Driving Systems | Tri Minh Triet Pham et.al. | 2408.13653 | null |
2024-08-24 | CSS-Segment: 2nd Place Report of LSVOS Challenge VOS Track | Jinming Chai et.al. | 2408.13582 | null |
2024-08-24 | AdaOcc: Adaptive-Resolution Occupancy Prediction | Chao Chen et.al. | 2408.13454 | null |
2024-08-23 | General Intelligent Imaging and Uncertainty Quantification by Deterministic Diffusion Model | Weiru Fan et.al. | 2408.13061 | null |
2024-08-23 | Courteous MPC for Autonomous Driving with CBF-inspired Risk Assessment | Yanze Zhang et.al. | 2408.12822 | null |
2024-08-23 | A Safe Self-evolution Algorithm for Autonomous Driving Based on Data-Driven Risk Quantification Model | Shuo Yang et.al. | 2408.12805 | null |
2024-08-22 | Revisiting Cross-Domain Problem for LiDAR-based 3D Object Detection | Ruixiao Zhang et.al. | 2408.12708 | null |
2024-09-01 | Can LLMs Understand Social Norms in Autonomous Driving Games? | Boxuan Wang et.al. | 2408.12680 | null |
2024-08-22 | Multimodal Foundational Models for Unsupervised 3D General Obstacle Detection | Tamás Matuszka et.al. | 2408.12322 | null |
2024-08-22 | A Safety-Oriented Self-Learning Algorithm for Autonomous Driving: Evolution Starting from a Basic Model | Shuo Yang et.al. | 2408.12190 | null |
2024-08-22 | A Safe and Efficient Self-evolving Algorithm for Decision-making and Control of Autonomous Driving Systems | Shuo Yang et.al. | 2408.12187 | null |
2024-08-22 | Pareto Inverse Reinforcement Learning for Diverse Expert Policy Generation | Woo Kyung Kim et.al. | 2408.12110 | null |
2024-08-22 | Enhancing Sampling Protocol for Robust Point Cloud Classification | Chongshou Li et.al. | 2408.12062 | null |
2024-08-21 | MambaOcc: Visual State Space Model for BEV-based Occupancy Prediction with Local Adaptive Reordering | Yonglin Tian et.al. | 2408.11464 | null |
2024-08-20 | Enhancing End-to-End Autonomous Driving Systems Through Synchronized Human Behavior Data | Yiqun Duan et.al. | 2408.10908 | null |
2024-08-20 | Open 3D World in Autonomous Driving | Xinlong Cheng et.al. | 2408.10880 | null |
2024-08-19 | CoVLA: Comprehensive Vision-Language-Action Dataset for Autonomous Driving | Hidehisa Arai et.al. | 2408.10845 | null |
2024-08-20 | Privacy-preserving Universal Adversarial Defense for Black-box Models | Qiao Li et.al. | 2408.10647 | null |
2024-08-20 | Fast Grid Emissions Sensitivities using Parallel Decentralized Implicit Differentiation | Anthony Degleris et.al. | 2408.10620 | null |
2024-08-20 | MV-MOS: Multi-View Feature Fusion for 3D Moving Object Segmentation | Jintao Cheng et.al. | 2408.10602 | link |
2024-08-20 | Constrained Behavior Cloning for Robotic Learning | Wensheng Liang et.al. | 2408.10568 | null |
2024-08-20 | Leveraging Temporal Contexts to Enhance Vehicle-Infrastructure Cooperative Perception | Jiaru Zhong et.al. | 2408.10531 | null |
2024-09-25 | System-Level Design Space Exploration for High-Level Synthesis under End-to-End Latency Constraints | Yuchao Liao et.al. | 2408.10431 | null |
2024-08-16 | Diffusion Model for Planning: A Systematic Literature Review | Toshihide Ubukata et.al. | 2408.10266 | null |
2024-07-22 | Comprehensive Overview of Reward Engineering and Shaping in Advancing Reinforcement Learning Applications | Sinan Ibrahim et.al. | 2408.10215 | null |
2024-08-19 | Edge-Cloud Collaborative Motion Planning for Autonomous Driving with Large Language Models | Jiao Chen et.al. | 2408.09972 | null |
2024-10-01 | Segment-Anything Models Achieve Zero-shot Robustness in Autonomous Driving | Jun Yan et.al. | 2408.09839 | link |
2024-08-19 | Automated Vehicle Driver Monitoring Dataset from Real-World Scenarios | Mohamed Sabry et.al. | 2408.09833 | null |
2024-08-19 | Quantitative 3D Map Accuracy Evaluation Hardware and Algorithm for LiDAR(-Inertial) SLAM | Sanghyun Hahn et.al. | 2408.09727 | link |
2024-08-19 | Multi-Agent Reinforcement Learning for Autonomous Driving: A Survey | Ruiqi Zhang et.al. | 2408.09675 | link |
2024-08-18 | OVOSE: Open-Vocabulary Semantic Segmentation in Event-Based Cameras | Muhammad Rameez Ur Rahman et.al. | 2408.09424 | link |
2024-08-17 | Reinforcement Learning Compensated Model Predictive Control for Off-road Driving on Unknown Deformable Terrain | Prakhar Gupta et.al. | 2408.09253 | null |
2024-09-16 | V2X-VLM: End-to-End V2X Cooperative Autonomous Driving Through Large Vision-Language Models | Junwei You et.al. | 2408.09251 | null |
2024-08-17 | MaskBEV: Towards A Unified Framework for BEV Detection and Map Segmentation | Xiao Zhao et.al. | 2408.09122 | null |
2024-08-17 | LOID: Lane Occlusion Inpainting and Detection for Enhanced Autonomous Driving Systems | Aayush Agrawal et.al. | 2408.09117 | null |
2024-08-17 | HybridOcc: NeRF Enhanced Transformer-based Multi-Camera 3D Occupancy Prediction | Xiao Zhao et.al. | 2408.09104 | null |
2024-08-15 | A Survey of Trojan Attacks and Defenses to Deep Neural Networks | Lingxin Jin et.al. | 2408.08920 | null |
2024-08-20 | PriorMapNet: Enhancing Online Vectorized HD Map Construction with Priors | Rongxuan Wang et.al. | 2408.08802 | null |
2024-08-16 | A Transparency Paradox? Investigating the Impact of Explanation Specificity and Autonomous Vehicle Perceptual Inaccuracies on Passengers | Daniel Omeiza et.al. | 2408.08785 | null |
2024-08-16 | S-RAF: A Simulation-Based Robustness Assessment Framework for Responsible Autonomous Driving | Daniel Omeiza et.al. | 2408.08584 | link |
2024-08-16 | CoSEC: A Coaxial Stereo Event Camera Dataset for Autonomous Driving | Shihan Peng et.al. | 2408.08500 | null |
2024-08-15 | A Conflicts-free, Speed-lossless KAN-based Reinforcement Learning Decision System for Interactive Driving in Roundabouts | Zhihao Lin et.al. | 2408.08242 | null |
2024-08-15 | Learned Multimodal Compression for Autonomous Driving | Hadi Hadizadeh et.al. | 2408.08211 | null |
2024-08-15 | Co-Fix3D: Enhancing 3D Object Detection with Collaborative Refinement | Wenxuan Li et.al. | 2408.07999 | link |
2024-08-14 | Sum-of-Squares inspired Quantum Metaheuristic for Polynomial Optimization with the Hadamard Test and Approximate Amplitude Constraints | Iria W. Wang et.al. | 2408.07774 | null |
2024-08-14 | Panacea+: Panoramic and Controllable Video Generation for Autonomous Driving | Yuqing Wen et.al. | 2408.07605 | null |
2024-08-14 | LLMI3D: Empowering LLM with 3D Perception from a Single 2D Image | Fan Yang et.al. | 2408.07422 | null |
2024-08-17 | Risk Occupancy: A New and Efficient Paradigm through Vehicle-Road-Cloud Collaboration | Jiaxing Chen et.al. | 2408.07367 | null |
2024-08-13 | FlatFusion: Delving into Details of Sparse Transformer-based Camera-LiDAR Fusion for Autonomous Driving | Yutao Zhu et.al. | 2408.06832 | null |
2024-08-13 | Exploring Domain Shift on Radar-Based 3D Object Detection Amidst Diverse Environmental Conditions | Miao Zhang et.al. | 2408.06772 | null |
2024-08-13 | A lightweight YOLOv5-FFM model for occlusion pedestrian detection | Xiangjie Luo et.al. | 2408.06633 | null |
2024-08-12 | IIT Bombay Racing Driverless: Autonomous Driving Stack for Formula Student AI | Yash Rampuria et.al. | 2408.06113 | null |
2024-08-22 | Text2Interaction: Establishing Safe and Preferable Human-Robot Interaction | Jakob Thumm et.al. | 2408.06105 | link |
2024-08-12 | A-BDD: Leveraging Data Augmentations for Safe Autonomous Driving in Adverse Weather and Lighting | Felix Assion et.al. | 2408.06071 | null |
2024-08-12 | Adapting a Foundation Model for Space-based Tasks | Matthew Foutter et.al. | 2408.05924 | null |
2024-08-12 | Toward Pedestrian Head Tracking: A Benchmark Dataset and an Information Fusion Network | Kailai Sun et.al. | 2408.05877 | null |
2024-08-11 | ICSFuzz: Collision Detector Bug Discovery in Autonomous Driving Simulators | Weiwei Fu et.al. | 2408.05694 | null |
2024-08-10 | What Matters in Autonomous Driving Anomaly Detection: A Weakly Supervised Horizon | Utkarsh Tiwari et.al. | 2408.05562 | link |
2024-08-15 | DeepInteraction++: Multi-Modality Interaction for Autonomous Driving | Zeyu Yang et.al. | 2408.05075 | link |
2024-08-09 | CTE-MLO: Continuous-time and Efficient Multi-LiDAR Odometry with Localizability-aware Point Cloud Sampling | Hongming Shen et.al. | 2408.04901 | link |
2024-10-03 | VLM-MPC: Vision Language Foundation Model (VLM)-Guided Model Predictive Controller (MPC) for Autonomous Driving | Keke Long et.al. | 2408.04821 | null |
2024-08-08 | Eliminating Backdoors in Neural Code Models via Trigger Inversion | Weisong Sun et.al. | 2408.04683 | null |
2024-08-08 | Field Testing and Detection of Camera Interference for Autonomous Driving | Ki Beom Park et.al. | 2408.04524 | null |
2024-08-08 | Reinforcement Learning from Human Feedback for Lane Changing of Autonomous Vehicles in Mixed Traffic | Yuting Wang et.al. | 2408.04447 | null |
2024-08-08 | Design and Implementation of Smart Infrastructures and Connected Vehicles in A Mini-city Platform | Daniel Vargas et.al. | 2408.04195 | null |
2024-08-07 | MORTAR: A Model-based Runtime Action Repair Framework for AI-enabled Cyber-Physical Systems | Renzhi Wang et.al. | 2408.03892 | null |
2024-08-07 | Vision-Language Guidance for LiDAR-based Unsupervised 3D Object Detection | Christian Fruhwirth-Reisinger et.al. | 2408.03790 | link |
2024-08-07 | Orthogonal and oriented Fano planes, triangular embeddings of $K_7,$ and geometrical representations of the Frobenius group $F_{21}$ | Simone Costa et.al. | 2408.03743 | null |
2024-08-07 | MS-Mapping: An Uncertainty-Aware Large-Scale Multi-Session LiDAR Mapping System | Xiangcheng Hu et.al. | 2408.03723 | link |
2024-08-14 | DRAMA: An Efficient End-to-end Motion Planner for Autonomous Driving with Mamba | Chengran Yuan et.al. | 2408.03601 | null |
2024-08-07 | Leveraging LLMs for Enhanced Open-Vocabulary 3D Scene Understanding in Autonomous Driving | Amirhosein Chahe et.al. | 2408.03516 | link |
2024-08-06 | Communication-Aware Consistent Edge Selection for Mobile Users and Autonomous Vehicles | Nazish Tahir et.al. | 2408.03435 | null |
2024-08-06 | Integrated Intention Prediction and Decision-Making with Spectrum Attention Net and Proximal Policy Optimization | Xiao Zhou et.al. | 2408.03191 | null |
2024-08-06 | Research on Autonomous Driving Decision-making Strategies based Deep Reinforcement Learning | Zixiang Wang et.al. | 2408.03084 | null |
2024-08-06 | SCOPE: A Synthetic Multi-Modal Dataset for Collective Perception Including Physical-Correct Weather Conditions | Jörg Gamerdinger et.al. | 2408.03065 | null |
2024-08-06 | Cross-cultural analysis of pedestrian group behaviour influence on crossing decisions in interactions with autonomous vehicles | Sergio Martín Serrano et.al. | 2408.03003 | null |
2024-08-06 | Reinforcement Learning based Workflow Scheduling in Cloud and Edge Computing Environments: A Taxonomy, Review and Future Directions | Amanda Jayanetti et.al. | 2408.02938 | null |
2024-08-06 | Compromising Embodied Agents with Contextual Backdoor Attacks | Aishan Liu et.al. | 2408.02882 | null |
2024-08-04 | Model Hijacking Attack in Federated Learning | Zheng Li et.al. | 2408.02131 | null |
2024-08-27 | KAN-RCBEVDepth: A multi-modal fusion algorithm in object detection for autonomous driving | Zhihao Lai et.al. | 2408.02088 | null |
2024-08-03 | STDA: Spatio-Temporal Dual-Encoder Network Incorporating Driver Attention to Predict Driver Behaviors Under Safety-Critical Scenarios | Dongyang Xu et.al. | 2408.01774 | null |
2024-08-03 | LAM3D: Leveraging Attention for Monocular 3D Object Detection | Diana-Alexandra Sas et.al. | 2408.01739 | null |
2024-08-02 | Trainable Pointwise Decoder Module for Point Cloud Segmentation | Bike Chen et.al. | 2408.01548 | null |
2024-08-01 | Enhancing Online Road Network Perception and Reasoning with Standard Definition Maps | Hengyuan Zhang et.al. | 2408.01471 | null |
2024-07-18 | SUSTechGAN: Image Generation for Object Recognition in Adverse Conditions of Autonomous Driving | Gongjin Lan et.al. | 2408.01430 | link |
2024-08-02 | CommonUppRoad: A Framework of Formal Modelling, Verifying, Learning, and Visualisation of Autonomous Vehicles | Rong Gu et.al. | 2408.01093 | null |
2024-08-02 | Effect of Fog Particle Size Distribution on 3D Object Detection Under Adverse Weather Conditions | Ajinkya Shinde et.al. | 2408.01085 | null |
2024-08-02 | MambaST: A Plug-and-Play Cross-Spectral Spatial-Temporal Fuser for Efficient Pedestrian Detection | Xiangbo Gao et.al. | 2408.01037 | link |
2024-07-15 | Quantification and Validation for Degree of Understanding in M2M Semantic Communications | Linhan Xia et.al. | 2408.00767 | null |
2024-08-01 | Optimizing Diffusion Models for Joint Trajectory Prediction and Controllable Generation | Yixiao Wang et.al. | 2408.00766 | null |
2024-08-01 | MUFASA: Multi-View Fusion and Adaptation Network with Spatial Awareness for Radar Object Detection | Xiangyuan Peng et.al. | 2408.00565 | null |
2024-08-01 | DriveArena: A Closed-loop Generative Simulation Platform for Autonomous Driving | Xuemeng Yang et.al. | 2408.00415 | null |
2024-08-01 | Enabling Next-Generation V2X Perception: Wireless Rigid Body Localization and Tracking | Niclas Führling et.al. | 2408.00349 | null |
2024-08-01 | RoCo:Robust Collaborative Perception By Iterative Object Matching and Pose Adjustment | Zhe Huang et.al. | 2408.00257 | link |
2024-07-31 | Areas of Improvement for Autonomous Vehicles: A Machine Learning Analysis of Disengagement Reports | Tyler Ward et.al. | 2408.00051 | null |
2024-07-31 | Diagnostic Runtime Monitoring with Martingales | Ali Hindy et.al. | 2407.21748 | null |
2024-07-31 | MART: MultiscAle Relational Transformer Networks for Multi-agent Trajectory Prediction | Seongju Lee et.al. | 2407.21635 | link |
2024-08-01 | Analysis of Functional Insufficiencies and Triggering Conditions to Improve the SOTIF of an MPC-based Trajectory Planner | Mirko Conrad et.al. | 2407.21569 | null |
2024-07-31 | SimpleLLM4AD: An End-to-End Vision-Language Model with Graph Visual Question Answering for Autonomous Driving | Peiru Zheng et.al. | 2407.21293 | null |
2024-07-30 | Self-supervised Multi-future Occupancy Forecasting for Autonomous Driving | Bernard Lange et.al. | 2407.21126 | null |
2024-07-30 | Learning Ordinality in Semantic Segmentation | Rafael Cristino et.al. | 2407.20959 | null |
2024-07-30 | Optimizing 5G-Advanced Networks for Time-critical Applications: The Role of L4S | Guangjin Pan et.al. | 2407.20852 | null |
2024-07-30 | Task-Oriented Communication for Vehicle-to-Infrastructure Cooperative Perception | Jiawei Shao et.al. | 2407.20748 | null |
2024-07-30 | Architectural Influence on Variational Quantum Circuits in Multi-Agent Reinforcement Learning: Evolutionary Strategies for Optimization | Michael Kölle et.al. | 2407.20739 | null |
2024-07-30 | Scene-Specific Trajectory Sets: Maximizing Representation in Motion Forecasting | Abhishek Vivekanandan et.al. | 2407.20732 | null |
2024-07-30 | On-the-fly Communication-and-Computing to Enable Representation Learning for Distributed Point Clouds | Xu Chen et.al. | 2407.20710 | null |
2024-07-29 | Collision Probability Distribution Estimation via Temporal Difference Learning | Thomas Steinecker et.al. | 2407.20000 | link |
2024-07-29 | Hydrodynamics of pulsating active liquids | Tirthankar Banerjee et.al. | 2407.19955 | null |
2024-07-29 | ALEN: A Dual-Approach for Uniform and Non-Uniform Low-Light Image Enhancement | Ezequiel Perez-Zarate et.al. | 2407.19708 | link |
2024-07-28 | HD-maps as Prior Information for Globally Consistent Mapping in GPS-denied Environments | Waqas Ali et.al. | 2407.19463 | null |
2024-07-28 | Reputation-Driven Asynchronous Federated Learning for Enhanced Trajectory Prediction with Blockchain | Weiliang Chen et.al. | 2407.19428 | null |
2024-07-27 | Large Language Models for Human-like Autonomous Driving: A Survey | Yun Li et.al. | 2407.19280 | null |
2024-07-26 | Addressing Behavior Model Inaccuracies for Safe Motion Control in Uncertain Dynamic Environments | Minjun Sung et.al. | 2407.19071 | null |
2024-07-26 | Sparse Refinement for Efficient High-Resolution Semantic Segmentation | Zhijian Liu et.al. | 2407.19014 | null |
2024-07-26 | Wolf: Captioning Everything with a World Summarization Framework | Boyi Li et.al. | 2407.18908 | null |
2024-07-26 | SHANGUS: Deep Reinforcement Learning Meets Heuristic Optimization for Speedy Frontier-Based Exploration of Autonomous Vehicles in Unknown Spaces | Seunghyeop Nam et.al. | 2407.18892 | null |
2024-07-26 | HERO-SLAM: Hybrid Enhanced Robust Optimization of Neural SLAM | Zhe Xin et.al. | 2407.18813 | null |
2024-07-26 | Foundation Models for the Digital Twin Creation of Cyber-Physical Systems | Shaukat Ali et.al. | 2407.18779 | null |
2024-08-04 | PP-TIL: Personalized Planning for Autonomous Driving with Instance-based Transfer Imitation Learning | Fangze Lin et.al. | 2407.18569 | link |
2024-07-29 | Multi-Agent Trajectory Prediction with Difficulty-Guided Feature Enhancement Network | Guipeng Xin et.al. | 2407.18551 | link |
2024-07-26 | Gaussian Lane Keeping: A Robust Prediction Baseline | David Isele et.al. | 2407.18451 | null |
2024-07-16 | Latency optimized Deep Neural Networks (DNNs): An Artificial Intelligence approach at the Edge using Multiprocessor System on Chip (MPSoC) | Seyed Nima Omidsajedi et.al. | 2407.18264 | null |
2024-07-25 | Taxonomy-Aware Continual Semantic Segmentation in Hyperbolic Spaces for Open-World Perception | Julia Hindel et.al. | 2407.18145 | null |
2024-09-10 | TiCoSS: Tightening the Coupling between Semantic Segmentation and Stereo Matching within A Joint Learning Framework | Guanfeng Tang et.al. | 2407.18038 | null |
2024-07-25 | StreamMOS: Streaming Moving Object Segmentation with Multi-View Perception and Dual-Span Memory | Zhiheng Li et.al. | 2407.17905 | link |
2024-07-25 | Image Segmentation via Divisive Normalization: dealing with environmental diversity | Pablo Hernández-Cámara et.al. | 2407.17829 | null |
2024-07-25 | CRASH: Crash Recognition and Anticipation System Harnessing with Context-Aware and Temporal Focus Attentions | Haicheng Liao et.al. | 2407.17757 | null |
2024-07-25 | Control Informed Design of the IAC Autonomous Racecar for Operation at the Dynamic Envelope | Qilun Zhu et.al. | 2407.17737 | null |
2024-07-20 | CORT: Class-Oriented Real-time Tracking for Embedded Systems | Edoardo Cittadini et.al. | 2407.17521 | null |
2024-07-24 | $VILA^2$ : VILA Augmented VILA | Yunhao Fang et.al. | 2407.17453 | null |
2024-07-24 | 3D Question Answering for City Scene Understanding | Penglei Sun et.al. | 2407.17398 | null |
2024-07-24 | Physical Adversarial Attack on Monocular Depth Estimation via Shape-Varying Patches | Chenxing Zhao et.al. | 2407.17312 | null |
2024-07-25 | LangOcc: Self-Supervised Open Vocabulary Occupancy Estimation via Volume Rendering | Simon Boeder et.al. | 2407.17310 | null |
2024-07-24 | Testing Large Language Models on Driving Theory Knowledge and Skills for Connected Autonomous Vehicles | Zuoyin Tang et.al. | 2407.17211 | null |
2024-07-24 | Applications of Multi-Agent Deep Reinforcement Learning Communication in Network Management: A Survey | Yue Pi et.al. | 2407.17030 | null |
2024-07-24 | Progressive Query Refinement Framework for Bird’s-Eye-View Semantic Segmentation from Surrounding Images | Dooseop Choi et.al. | 2407.17003 | link |
2024-07-23 | SECRM-2D: RL-Based Efficient and Comfortable Route-Following Autonomous Driving with Analytic Safety Guarantees | Tianyu Shi et.al. | 2407.16857 | null |
2024-07-24 | A Simulation Benchmark for Autonomous Racing with Large-Scale Human Data | Adrian Remonda et.al. | 2407.16680 | link |
2024-07-23 | Deformable Convolution Based Road Scene Semantic Segmentation of Fisheye Images in Autonomous Driving | Anam Manzoor et.al. | 2407.16647 | null |
2024-07-24 | Velocity Driven Vision: Asynchronous Sensor Fusion Birds Eye View Models for Autonomous Vehicles | Seamie Hayes et.al. | 2407.16636 | null |
2024-07-23 | MonoWAD: Weather-Adaptive Diffusion Model for Robust Monocular 3D Object Detection | Youngmin Oh et.al. | 2407.16448 | link |
2024-07-23 | Cleaning Robots in Public Spaces: A Survey and Proposal for Benchmarking Based on Stakeholders Interviews | Raphael Memmesheimer et.al. | 2407.16393 | null |
2024-07-23 | Understanding Impacts of Electromagnetic Signal Injection Attacks on Object Detection | Youqian Zhang et.al. | 2407.16327 | null |
2024-07-23 | TAPTRv2: Attention-based Position Update Improves Tracking Any Point | Hongyang Li et.al. | 2407.16291 | null |
2024-07-26 | When, Where, and What? A Novel Benchmark for Accident Anticipation and Localization with Large Language Models | Haicheng Liao et.al. | 2407.16277 | null |
2024-07-23 | LiCROcc: Teach Radar for Accurate Semantic Occupancy Prediction using LiDAR and Camera | Yukai Ma et.al. | 2407.16197 | null |
2024-07-22 | MILAN: Milli-Annotations for Lidar Semantic Segmentation | Nermin Samet et.al. | 2407.15797 | null |
2024-07-22 | Flow-guided Motion Prediction with Semantics and Dynamic Occupancy Grid Maps | Rabbia Asghar et.al. | 2407.15675 | null |
2024-07-22 | DriveDiTFit: Fine-tuning Diffusion Transformers for Autonomous Driving | Jiahang Tu et.al. | 2407.15661 | link |
2024-07-22 | Towards a Universal Evaluation Model for Careful and Competent Autonomous Driving | Kethan Reddy et.al. | 2407.15596 | null |
2024-07-22 | WTS: A Pedestrian-Centric Traffic Video Dataset for Fine-grained Spatial-Temporal Understanding | Quan Kong et.al. | 2407.15350 | null |
2024-07-22 | Explore the LiDAR-Camera Dynamic Adjustment Fusion for 3D Object Detection | Yiran Yang et.al. | 2407.15334 | link |
2024-07-20 | Is Behavior Cloning All You Need? Understanding Horizon in Imitation Learning | Dylan J. Foster et.al. | 2407.15007 | null |
2024-07-19 | Complementary Learning for Real-World Model Failure Detection | Daniel Bogdoll et.al. | 2407.14306 | link |
2024-07-19 | Hyperparameter Optimization for Driving Strategies Based on Reinforcement Learning | Nihal Acharya Adde et.al. | 2407.14262 | null |
2024-07-17 | Continual Learning for Adaptable Car-Following in Dynamic Traffic Environments | Xianda Chen et.al. | 2407.14247 | null |
2024-07-19 | KoMA: Knowledge-driven Multi-agent Framework for Autonomous Driving with Large Language Models | Kemou Jiang et.al. | 2407.14239 | null |
2024-07-18 | Boosting Online 3D Multi-Object Tracking through Camera-Radar Cross Check | Sheng-Yao Kuan et.al. | 2407.13937 | null |
2024-07-19 | Mask2Map: Vectorized HD Map Construction Using Bird’s Eye View Segmentation Masks | Sehwan Choi et.al. | 2407.13517 | link |
2024-07-18 | Risk-Aware Vehicle Trajectory Prediction Under Safety-Critical Scenarios | Qingfan Wang et.al. | 2407.13480 | null |
2024-08-26 | Improving Out-of-Distribution Generalization of Trajectory Prediction for Autonomous Driving via Polynomial Representations | Yue Yao et.al. | 2407.13431 | link |
2024-07-18 | Ultra-Low-Latency Edge Inference for Distributed Sensing | Zhanwei Wang et.al. | 2407.13360 | null |
2024-07-18 | $μ$ Drive: User-Controlled Autonomous Driving | Kun Wang et.al. | 2407.13201 | null |
2024-07-21 | Real-Time 3D Occupancy Prediction via Geometric-Semantic Disentanglement | Yulin He et.al. | 2407.13155 | null |
2024-07-18 | OE-BevSeg: An Object Informed and Environment Aware Multimodal Framework for Bird’s-eye-view Vehicle Semantic Segmentation | Jian Sun et.al. | 2407.13137 | null |
2024-07-18 | PG-Attack: A Precision-Guided Adversarial Attack Framework Against Vision Foundation Models for Autonomous Driving | Jiyuan Fu et.al. | 2407.13111 | link |
2024-07-17 | Sparsity-based Safety Conservatism for Constrained Offline Reinforcement Learning | Minjae Cho et.al. | 2407.13006 | null |
2024-07-17 | KiGRAS: Kinematic-Driven Generative Model for Realistic Agent Simulation | Jianbo Zhao et.al. | 2407.12940 | null |
2024-07-17 | AgentPoison: Red-teaming LLM Agents via Poisoning Memory or Knowledge Bases | Zhaorun Chen et.al. | 2407.12784 | link |
2024-07-25 | Hierarchical and Decoupled BEV Perception Learning Framework for Autonomous Driving | Yuqi Dai et.al. | 2407.12491 | null |
2024-09-13 | Fisheye-Calib-Adapter: An Easy Tool for Fisheye Camera Model Conversion | Sangjun Lee et.al. | 2407.12405 | link |
2024-07-17 | Efficient Depth-Guided Urban View Synthesis | Sheng Miao et.al. | 2407.12395 | null |
2024-07-16 | Gated Temporal Diffusion for Stochastic Long-Term Dense Anticipation | Olga Zatsarynna et.al. | 2407.11954 | link |
2024-07-16 | MapDistill: Boosting Efficient Camera-based HD Map Construction via Camera-LiDAR Fusion Model Distillation | Xiaoshuai Hao et.al. | 2407.11682 | null |
2024-07-16 | Perception Helps Planning: Facilitating Multi-Stage Lane-Level Integration via Double-Edge Structures | Guoliang You et.al. | 2407.11644 | null |
2024-07-17 | Segment, Lift and Fit: Automatic 3D Shape Labeling from 2D Prompts | Jianhao Li et.al. | 2407.11382 | null |
2024-07-16 | Continuity Preserving Online CenterLine Graph Learning | Yunhui Han et.al. | 2407.11337 | link |
2024-07-02 | dAJC: A 2.02mW 50Mbps Direct Analog to MJPEG Converter for Video Sensor Node using Low-Noise Switched Capacitor MAC-Quantizer with Auto-Calibration and Sparsity-Aware ADC | Gourab Barik et.al. | 2407.11023 | null |
2024-09-04 | A unified theory and statistical learning approach for traffic conflict detection | Yiru Jiao et.al. | 2407.10959 | link |
2024-07-20 | RepVF: A Unified Vector Fields Representation for Multi-task 3D Perception | Chunliang Li et.al. | 2407.10876 | link |
2024-07-15 | DINO Pre-training for Vision-based End-to-end Autonomous Driving | Shubham Juneja et.al. | 2407.10803 | null |
2024-07-20 | Follow the Rules: Reasoning for Video Anomaly Detection with Large Language Models | Yuchen Yang et.al. | 2407.10299 | link |
2024-09-13 | RAPiD-Seg: Range-Aware Pointwise Distance Distribution Networks for 3D LiDAR Segmentation | Li Li et.al. | 2407.10159 | link |
2024-07-14 | FSD-BEV: Foreground Self-Distillation for Multi-view 3D Object Detection | Zheng Jiang et.al. | 2407.10135 | link |
2024-07-13 | IFTR: An Instance-Level Fusion Transformer for Visual Collaborative Perception | Shaohong Wang et.al. | 2407.09857 | link |
2024-07-12 | Uplifting Range-View-based 3D Semantic Segmentation in Real-Time with Multi-Sensor Fusion | Shiqi Tan et.al. | 2407.09697 | null |
2024-06-25 | Optimization of Autonomous Driving Image Detection Based on RFAConv and Triplet Attention | Zhipeng Ling et.al. | 2407.09530 | null |
2024-07-12 | Adaptive Prediction Ensemble: Improving Out-of-Distribution Generalization of Motion Forecasting | Jinning Li et.al. | 2407.09475 | null |
2024-07-12 | TRAVERSE: Traffic-Responsive Autonomous Vehicle Experience & Rare-event Simulation for Enhanced safety | Sandeep Thalapanane et.al. | 2407.09466 | null |
2024-07-12 | GNN with Model-based RL for Multi-agent Systems | Hanxiao Chen et.al. | 2407.09249 | null |
2024-07-12 | Decentralized multi-agent reinforcement learning algorithm using a cluster-synchronized laser network | Shun Kotoku et.al. | 2407.09124 | null |
2024-07-11 | Real-Time Anomaly Detection and Reactive Planning with Large Language Models | Rohan Sinha et.al. | 2407.08735 | null |
2024-07-11 | MapLocNet: Coarse-to-Fine Feature Registration for Visual Re-Localization in Navigation Maps | Hang Wu et.al. | 2407.08561 | null |
2024-07-11 | BLOS-BEV: Navigation Map Enhanced Lane Segmentation Network, Beyond Line of Sight | Hang Wu et.al. | 2407.08526 | null |
2024-07-11 | Joint Optimization of Age of Information and Energy Consumption in NR-V2X System based on Deep Reinforcement Learning | Shulin Song et.al. | 2407.08458 | link |
2024-07-11 | CLEO: Continual Learning of Evolving Ontologies | Shishir Muralidhara et.al. | 2407.08411 | null |
2024-07-18 | Application of Data-Driven Model Predictive Control for Autonomous Vehicle Steering | Jiarui Zhang et.al. | 2407.08401 | null |
2024-07-11 | Accurate Cooperative Localization Utilizing LiDAR-equipped Roadside Infrastructure for Autonomous Driving | Yuze Jiang et.al. | 2407.08384 | null |
2024-07-11 | HDT: Hierarchical Document Transformer | Haoyu He et.al. | 2407.08330 | null |
2024-07-11 | WayveScenes101: A Dataset and Benchmark for Novel View Synthesis in Autonomous Driving | Jannik Zürn et.al. | 2407.08280 | link |
2024-07-10 | NDST: Neural Driving Style Transfer for Human-Like Vision-Based Autonomous Driving | Donghyun Kim et.al. | 2407.08073 | null |
2024-07-10 | Deep Learning-Based Robust Multi-Object Tracking via Fusion of mmWave Radar and Camera Sensors | Lei Cheng et.al. | 2407.08049 | null |
2024-07-10 | Flow4D: Leveraging 4D Voxel Network for LiDAR Scene Flow Estimation | Jaeyeul Kim et.al. | 2407.07995 | link |
2024-07-10 | RoBus: A Multimodal Dataset for Controllable Road Networks and Building Layouts Generation | Tao Li et.al. | 2407.07835 | link |
2024-07-10 | LSM: A Comprehensive Metric for Assessing the Safety of Lane Detection Systems in Autonomous Driving | Jörg Gamerdinger et.al. | 2407.07740 | null |
2024-07-10 | Towards Human-Like Driving: Active Inference in Autonomous Vehicle Control | Elahe Delavari et.al. | 2407.07684 | null |
2024-07-18 | Let Occ Flow: Self-Supervised 3D Occupancy Flow Prediction | Yili Liu et.al. | 2407.07587 | null |
2024-07-10 | Invisible Optical Adversarial Stripes on Traffic Sign against Autonomous Vehicles | Dongfang Guo et.al. | 2407.07510 | null |
2024-07-17 | Exploring the Untouched Sweeps for Conflict-Aware 3D Segmentation Pretraining | Tianfang Sun et.al. | 2407.07465 | null |
2024-07-16 | Event-Aided Time-to-Collision Estimation for Autonomous Driving | Jinghang Li et.al. | 2407.07324 | null |
2024-07-09 | Exploring Camera Encoder Designs for Autonomous Driving Perception | Barath Lakshmanan et.al. | 2407.07276 | null |
2024-07-09 | Less is More: Efficient Brain-Inspired Learning for Autonomous Driving Trajectory Prediction | Haicheng Liao et.al. | 2407.07020 | null |
2024-07-09 | Explainable AI for Enhancing Efficiency of DL-based Channel Estimation | Abdul Karim Gizzini et.al. | 2407.07009 | null |
2024-07-09 | Accelerating Online Mapping and Behavior Prediction via Direct BEV Feature Attention | Xunjiang Gu et.al. | 2407.06683 | link |
2024-07-19 | Exploring the Causality of End-to-End Autonomous Driving | Jiankun Li et.al. | 2407.06546 | link |
2024-07-10 | VQA-Diff: Exploiting VQA and Diffusion for Zero-Shot Image-to-3D Vehicle Asset Generation in Autonomous Driving | Yibo Liu et.al. | 2407.06516 | null |
2024-07-17 | Enhanced Safety in Autonomous Driving: Integrating Latent State Diffusion Model for End-to-End Navigation | Detian Chu et.al. | 2407.06317 | null |
2024-07-10 | 4D Contrastive Superflows are Dense 3D Representation Learners | Xiang Xu et.al. | 2407.06190 | link |
2024-07-16 | PerlDiff: Controllable Street View Synthesis Using Perspective-Layout Diffusion Models | Jinhua Zhang et.al. | 2407.06109 | link |
2024-07-08 | RHRSegNet: Relighting High-Resolution Night-Time Semantic Segmentation | Sarah Elmahdy et.al. | 2407.06016 | null |
2024-07-08 | Enhancing Vision-Language Models with Scene Graphs for Traffic Accident Understanding | Aaron Lohner et.al. | 2407.05910 | null |
2024-07-08 | Cross-domain Few-shot In-context Learning for Enhancing Traffic Sign Recognition | Yaozong Gan et.al. | 2407.05814 | null |
2024-07-08 | Boosting 3D Object Detection with Semantic-Aware Multi-Branch Framework | Hao Jing et.al. | 2407.05769 | null |
2024-07-18 | BEVWorld: A Multimodal World Model for Autonomous Driving via Unified BEV Latent Space | Yumeng Zhang et.al. | 2407.05679 | link |
2024-07-08 | MSTF: Multiscale Transformer for Incomplete Trajectory Prediction | Zhanwen Liu et.al. | 2407.05671 | null |
2024-07-08 | GenFollower: Enhancing Car-Following Prediction with Large Language Models | Xianda Chen et.al. | 2407.05611 | null |
2024-07-14 | Evolutionary Trigger Detection and Lightweight Model Repair Based Backdoor Defense | Qi Zhou et.al. | 2407.05396 | null |
2024-07-07 | SCIPaD: Incorporating Spatial Clues into Unsupervised Pose-Depth Joint Learning | Yi Feng et.al. | 2407.05283 | link |
2024-07-07 | Tracking Reflected Objects: A Benchmark | Xiaoyu Guo et.al. | 2407.05235 | null |
2024-07-06 | T-CorresNet: Template Guided 3D Point Cloud Completion with Correspondence Pooling Query Generation Strategy | Fan Duan et.al. | 2407.05008 | link |
2024-07-15 | JDT3D: Addressing the Gaps in LiDAR-Based Tracking-by-Attention | Brian Cheong et.al. | 2407.04926 | link |
2024-07-06 | SID: Stereo Image Dataset for Autonomous Driving in Adverse Conditions | Zaid A. El-Shair et.al. | 2407.04908 | null |
2024-07-05 | MUSIC-lite: Efficient MUSIC using Approximate Computing: An OFDM Radar Case Study | Rajat Bhattacharjya et.al. | 2407.04849 | null |
2024-07-05 | JaywalkerVR: A VR System for Collecting Safety-Critical Pedestrian-Vehicle Interactions | Kenta Mukoya et.al. | 2407.04843 | null |
2024-07-15 | LaRa: Efficient Large-Baseline Radiance Fields | Anpei Chen et.al. | 2407.04699 | null |
2024-07-05 | Optimizing the image correction pipeline for pedestrian detection in the thermal-infrared domain | Christophe Karam et.al. | 2407.04484 | null |
2024-07-05 | Dance of the ADS: Orchestrating Failures through Historically-Informed Scenario Fuzzing | Tong Wang et.al. | 2407.04359 | null |
2024-07-05 | Towards Stable 3D Object Detection | Jiabao Wang et.al. | 2407.04305 | null |
2024-07-05 | WOMD-Reasoning: A Large-Scale Language Dataset for Interaction and Driving Intentions Reasoning | Yiheng Li et.al. | 2407.04281 | link |
2024-07-05 | Research, Applications and Prospects of Event-Based Pedestrian Detection: A Survey | Han Wang et.al. | 2407.04277 | null |
2024-07-04 | Behavioural gap assessment of human-vehicle interaction in real and virtual reality-based scenarios in autonomous driving | Sergio. Martín Serrano et.al. | 2407.04070 | null |
2024-07-12 | Detect Closer Surfaces that can be Seen: New Modeling and Evaluation in Cross-domain 3D Object Detection | Ruixiao Zhang et.al. | 2407.04061 | link |
2024-07-04 | Towards Cross-View-Consistent Self-Supervised Surround Depth Estimation | Laiyan Ding et.al. | 2407.04041 | link |
2024-08-22 | StreamLTS: Query-based Temporal-Spatial LiDAR Fusion for Cooperative Object Detection | Yunshuang Yuan et.al. | 2407.03825 | link |
2024-07-04 | A Fast Dynamic Point Detection Method for LiDAR-Inertial Odometry in Driving Scenarios | Zikang Yuan et.al. | 2407.03590 | link |
2024-07-17 | Efficient Fusion and Task Guided Embedding for End-to-end Autonomous Driving | Yipin Guo et.al. | 2407.02878 | null |
2024-07-03 | A Radiometric Correction based Optical Modeling Approach to Removing Reflection Noise in TLS Point Clouds of Urban Scenes | Li Fang et.al. | 2407.02830 | link |
2024-07-03 | Solving Motion Planning Tasks with a Scalable Generative Model | Yihan Hu et.al. | 2407.02797 | link |
2024-07-04 | AutoSplat: Constrained Gaussian Splatting for Autonomous Driving Scene Reconstruction | Mustafa Khan et.al. | 2407.02598 | null |
2024-06-18 | Sample-efficient Imitative Multi-token Decision Transformer for Generalizable Real World Driving | Hang Zhou et.al. | 2407.02508 | null |
2024-05-10 | Light-SLAM: A Robust Deep-Learning Visual SLAM System Based on LightGlue under Challenging Lighting Conditions | Zhiqi Zhao et.al. | 2407.02382 | null |
2024-07-02 | Research on Reliable and Safe Occupancy Grid Prediction in Underground Parking Lots | JiaQi Luo et.al. | 2407.02197 | null |
2024-07-02 | I2EKF-LO: A Dual-Iteration Extended Kalman Filter Based LiDAR Odometry | Wenlu Yu et.al. | 2407.02190 | link |
2024-07-02 | DM3D: Distortion-Minimized Weight Pruning for Lossless 3D Object Detection | Kaixin Xu et.al. | 2407.02098 | null |
2024-07-02 | LiDAR-based HD Map Localization using Semantic Generalized ICP with Road Marking Detection | Yansong Gong et.al. | 2407.02061 | null |
2024-07-02 | FlowTrack: Point-level Flow Network for 3D Single Object Tracking | Shuo Li et.al. | 2407.01959 | null |
2024-07-02 | Cloud-Edge-Terminal Collaborative AIGC for Autonomous Driving | Jianan Zhang et.al. | 2407.01956 | null |
2024-07-01 | Predicting Trust Dynamics with Dynamic SEM in Human-AI Cooperation | Sota Kaneko et.al. | 2407.01752 | null |
2024-07-01 | SeFlow: A Self-Supervised Scene Flow Method in Autonomous Driving | Qingwen Zhang et.al. | 2407.01702 | link |
2024-07-01 | Deep Reinforcement Learning for Adverse Garage Scenario Generation | Kai Li et.al. | 2407.01333 | null |
2024-07-01 | Let Hybrid A* Path Planner Obey Traffic Rules: A Deep Reinforcement Learning-Based Planning Framework | Xibo Li et.al. | 2407.01216 | null |
2024-07-01 | FedRC: A Rapid-Converged Hierarchical Federated Learning Framework in Street Scene Semantic Understanding | Wei-Bin Kou et.al. | 2407.01103 | null |
2024-07-01 | HGNET: A Hierarchical Feature Guided Network for Occupancy Flow Field Prediction | Zhan Chen et.al. | 2407.01097 | null |
2024-07-01 | Data on the Move: Traffic-Oriented Data Trading Platform Powered by AI Agent with Common Sense | Yi Yu et.al. | 2407.00995 | null |
2024-07-01 | Acceleration method for generating perception failure scenarios based on editing Markov process | Canjie Cai et.al. | 2407.00980 | null |
2024-07-01 | Locomotion as Manipulation with ReachBot | Tony G. Chen et.al. | 2407.00973 | null |
2024-07-01 | FALCON: Frequency Adjoint Link with CONtinuous Density Mask for Fast Single Image Dehazing | Donghyun Kim et.al. | 2407.00972 | null |
2024-07-01 | Tokenize the World into Object-level Knowledge to Address Long-tail Events in Autonomous Driving | Ran Tian et.al. | 2407.00959 | null |
2024-07-01 | Reconfigurable Intelligent Computational Surfaces for MEC-Assisted Autonomous Driving Networks: Design Optimization and Analysis | Xueyao Zhang et.al. | 2407.00933 | null |
2024-08-30 | CaFNet: A Confidence-Driven Framework for Radar Camera Depth Estimation | Huawei Sun et.al. | 2407.00697 | link |
2024-06-29 | A Rule-Based Behaviour Planner for Autonomous Driving | Bouchard Frederic et.al. | 2407.00460 | null |
2024-05-03 | AdaBridge: Dynamic Data and Computation Reuse for Efficient Multi-task DNN Co-evolution in Edge Systems | Lehao Wang et.al. | 2407.00016 | null |
2024-06-28 | Automated Deep Neural Network Inference Partitioning for Distributed Embedded Systems | Fabian Kreß et.al. | 2406.19913 | null |
2024-06-28 | StreamMOTP: Streaming and Unified Framework for Joint 3D Multi-Object Tracking and Trajectory Prediction | Jiaheng Zhuang et.al. | 2406.19844 | null |
2024-06-28 | LCSim: A Large-Scale Controllable Traffic Simulator | Yuheng Zhang et.al. | 2406.19781 | link |
2024-06-28 | Deep Learning-based Depth Estimation Methods from Monocular Image and Videos: A Comprehensive Survey | Uchitha Rajapaksha et.al. | 2406.19675 | null |
2024-06-27 | BiCo-Fusion: Bidirectional Complementary LiDAR-Camera Fusion for Semantic- and Spatial-Aware 3D Object Detection | Yang Song et.al. | 2406.19048 | null |
2024-06-27 | XLD: A Cross-Lane Dataset for Benchmarking Novel Driving View Synthesis | Hao Li et.al. | 2406.18360 | null |
2024-06-25 | End-to-End Autonomous Driving without Costly Modularization and 3D Manual Annotation | Mingzhe Guo et.al. | 2406.17680 | null |
2024-06-25 | MDHA: Multi-Scale Deformable Transformer with Hybrid Anchors for Multi-View 3D Object Detection | Michelle Adeline et.al. | 2406.17654 | link |
2024-06-25 | Querying Labeled Time Series Data with Scenario Programs | Devan Shanker et.al. | 2406.17627 | null |
2024-06-25 | Image-Guided Outdoor LiDAR Perception Quality Assessment for Autonomous Driving | Ce Zhang et.al. | 2406.17265 | null |
2024-06-24 | GPT-4V Explorations: Mining Autonomous Driving | Zixuan Li et.al. | 2406.16817 | null |
2024-08-11 | ShanghaiTech Mapping Robot is All You Need: Robot System for Collecting Universal Ground Vehicle Datasets | Bowen Xu et.al. | 2406.16713 | null |
2024-06-24 | SegNet4D: Effective and Efficient 4D LiDAR Semantic Segmentation in Autonomous Driving Environments | Neng Wang et.al. | 2406.16279 | link |
2024-06-23 | DV-3DLane: End-to-end Multi-modal 3D Lane Detection with Dual-view Representation | Yueru Luo et.al. | 2406.16072 | link |
2024-06-22 | ISS-Scenario: Scenario-based Testing in CARLA | Renjue Li et.al. | 2406.15777 | link |
2024-05-24 | Automated Parking Planning with Vision-Based BEV Approach | Yuxuan Zhao et.al. | 2406.15430 | null |
2024-05-24 | Automatic parking planning control method based on improved A* algorithm | Yuxuan Zhao et.al. | 2406.15429 | null |
2024-06-21 | NAVSIM: Data-Driven Non-Reactive Autonomous Vehicle Simulation and Benchmarking | Daniel Dauner et.al. | 2406.15349 | link |
2024-06-21 | Towards Robust Training Datasets for Machine Learning with Ontologies: A Case Study for Emergency Road Vehicle Detection | Lynn Vonderhaar et.al. | 2406.15268 | null |
2024-06-20 | Multi-Task Lane-Free Driving Strategy for Connected and Automated Vehicles: A Multi-Agent Deep Reinforcement Learning Approach | Mehran Berahman et.al. | 2406.14766 | null |
2024-06-20 | Preferential Multi-Objective Bayesian Optimization | Raul Astudillo et.al. | 2406.14699 | null |
2024-06-24 | Enhancing Dropout-based Bayesian Neural Networks with Multi-Exit on FPGA | Hao Mark Chen et.al. | 2406.14593 | link |
2024-07-24 | Asynchronous Large Language Model Enhanced Planner for Autonomous Driving | Yuan Chen et.al. | 2406.14556 | link |
2024-06-20 | FutureNet-LOF: Joint Trajectory Prediction and Lane Occupancy Field Prediction with Future Context Encoding | Mingkun Wang et.al. | 2406.14422 | null |
2024-06-20 | PoseBench: Benchmarking the Robustness of Pose Estimation Models under Corruptions | Sihan Ma et.al. | 2406.14367 | null |
2024-06-20 | Uncertainty and Self-Supervision in Single-View Depth | Javier Rodriguez-Puigvert et.al. | 2406.14226 | null |
2024-06-20 | GTP-UDrive: Unified Game-Theoretic Trajectory Planner and Decision-Maker for Autonomous Driving in Mixed Traffic Environments | Nouhed Naidja et.al. | 2406.14077 | null |
2024-06-20 | Urban-Focused Multi-Task Offline Reinforcement Learning with Contrastive Data Sharing | Xinbo Zhao et.al. | 2406.14054 | null |
2024-06-20 | The Use of Multimodal Large Language Models to Detect Objects from Thermal Images: Transportation Applications | Huthaifa I. Ashqar et.al. | 2406.13898 | null |
2024-06-19 | Martian Exploration of Lava Tubes (MELT) with ReachBot: Scientific Investigation and Concept of Operations | Julia Di et.al. | 2406.13857 | null |
2024-07-30 | Safe and Non-Conservative Trajectory Planning for Autonomous Driving Handling Unanticipated Behaviors of Traffic Participants | Tommaso Benciolini et.al. | 2406.13396 | link |
2024-08-20 | ECAFormer: Low-light Image Enhancement using Cross Attention | Yudi Ruan et.al. | 2406.13281 | link |
2024-06-19 | Act Better by Timing: A timing-Aware Reinforcement Learning for Autonomous Driving | Guanzhou Li et.al. | 2406.13223 | null |
2024-06-18 | ABNet: Attention BarrierNet for Safe and Scalable Robot Learning | Wei Xiao et.al. | 2406.13025 | link |
2024-06-18 | Online-Adaptive Anomaly Detection for Defect Identification in Aircraft Assembly | Siddhant Shete et.al. | 2406.12698 | null |
2024-06-18 | Reparameterizable Dual-Resolution Network for Real-time Semantic Segmentation | Guoyu Yang et.al. | 2406.12496 | link |
2024-06-19 | Is Your HD Map Constructor Reliable under Sensor Corruptions? | Xiaoshuai Hao et.al. | 2406.12214 | null |
2024-06-17 | DistillNeRF: Perceiving 3D Scenes from Single-Glance Images by Distilling Neural Fields and Foundation Model Features | Letian Wang et.al. | 2406.12095 | null |
2024-06-17 | Crossfusor: A Cross-Attention Transformer Enhanced Conditional Diffusion Model for Car-Following Trajectory Prediction | Junwei You et.al. | 2406.11941 | null |
2024-06-17 | A First Physical-World Trajectory Prediction Attack via LiDAR-induced Deceptions in Autonomous Driving | Yang Lou et.al. | 2406.11707 | null |
2024-06-17 | Communication-Efficient MARL for Platoon Stability and Energy-efficiency Co-optimization in Cooperative Adaptive Cruise Control of CAVs | Min Hua et.al. | 2406.11653 | null |
2024-06-14 | Galibr: Targetless LiDAR-Camera Extrinsic Calibration Method via Ground Plane Initialization | Wonho Song et.al. | 2406.11599 | null |
2024-07-17 | Constrained Reinforcement Learning with Average Reward Objective: Model-Based and Model-Free Algorithms | Vaneet Aggarwal et.al. | 2406.11481 | null |
2024-06-17 | Semi-Supervised Domain Adaptation Using Target-Oriented Domain Augmentation for 3D Object Detection | Yecheol Kim et.al. | 2406.11313 | link |
2024-06-17 | Model Adaptation for Time Constrained Embodied Control | Jaehyun Song et.al. | 2406.11128 | null |
2024-06-16 | SparseDet: A Simple and Effective Framework for Fully Sparse LiDAR-based 3D Object Detection | Lin Liu et.al. | 2406.10907 | null |
2024-06-16 | TrafficBots V1.5: Traffic Simulation via Conditional VAEs and Transformers with Relative Pose Encoding | Zhejun Zhang et.al. | 2406.10898 | link |
2024-06-16 | An LLM-enhanced Multi-objective Evolutionary Search for Autonomous Driving Test Scenario Generation | Haoxiang Tian et.al. | 2406.10857 | null |
2024-06-16 | Learning Traffic Crashes as Language: Datasets, Benchmarks, and What-if Causal Analyses | Zhiwen Fan et.al. | 2406.10789 | null |
2024-06-15 | GenMM: Geometrically and Temporally Consistent Multimodal Data Generation for Video and LiDAR | Bharat Singh et.al. | 2406.10722 | null |
2024-06-15 | Planning with Adaptive World Models for Autonomous Driving | Arun Balajee Vasudevan et.al. | 2406.10714 | null |
2024-06-15 | Object Detection using Oriented Window Learning Vi-sion Transformer: Roadway Assets Recognition | Taqwa Alhadidi et.al. | 2406.10712 | null |
2024-07-17 | MMVR: Millimeter-wave Multi-View Radar Dataset and Benchmark for Indoor Perception | M. Mahbubur Rahman et.al. | 2406.10708 | link |
2024-06-15 | Semantic Communication for Edge Intelligence Enabled Autonomous Driving System | Yunqi Feng et.al. | 2406.10606 | null |
2024-07-16 | SparseRadNet: Sparse Perception Neural Network on Subsampled Radar Data | Jialong Wu et.al. | 2406.10600 | null |
2024-06-15 | Generating and Evolving Reward Functions for Highway Driving with Large Language Models | Xu Han et.al. | 2406.10540 | null |
2024-04-25 | Object criticality for safer navigation | Andrea Ceccarelli et.al. | 2406.10232 | null |
2024-06-14 | CarLLaVA: Vision language models for camera-only closed-loop driving | Katrin Renz et.al. | 2406.10165 | null |
2024-06-14 | MapVision: CVPR 2024 Autonomous Grand Challenge Mapless Driving Tech Report | Zhongyu Yang et.al. | 2406.10125 | null |
2024-06-14 | DurLAR: A High-fidelity 128-channel LiDAR Dataset with Panoramic Ambient and Reflectivity Imagery for Multi-modal Autonomous Driving Applications | Li Li et.al. | 2406.10068 | link |
2024-06-14 | SemanticSpray++: A Multimodal Dataset for Autonomous Driving in Wet Surface Conditions | Aldi Piroli et.al. | 2406.09945 | null |
2024-06-14 | Globally Optimal GNSS Multi-Antenna Lever Arm Calibration | Thomas Wodtko et.al. | 2406.09866 | null |
2024-06-14 | A Two-Stage Masked Autoencoder Based Network for Indoor Depth Completion | Kailai Sun et.al. | 2406.09792 | link |
2024-06-14 | Research on Edge Detection of LiDAR Images Based on Artificial Intelligence Technology | Haowei Yang et.al. | 2406.09773 | null |
2024-07-17 | Cross-Modality Program Representation Learning for Electronic Design Automation with High-Level Synthesis | Zongyue Qin et.al. | 2406.09606 | null |
2024-06-13 | SimGen: Simulator-conditioned Driving Scene Generation | Yunsong Zhou et.al. | 2406.09386 | null |
2024-06-13 | Optimizing Visual Question Answering Models for Driving: Bridging the Gap Between Human and Machine Attention Patterns | Kaavya Rekanar et.al. | 2406.09203 | null |
2024-07-25 | Auto-Vocabulary Segmentation for LiDAR Points | Weijie Wei et.al. | 2406.09126 | link |
2024-06-26 | CIMRL: Combining IMitation and Reinforcement Learning for Safe Autonomous Driving | Jonathan Booher et.al. | 2406.08878 | null |
2024-06-13 | Trajectory Planning for Autonomous Driving in Unstructured Scenarios Based on Graph Neural Network and Numerical Optimization | Sumin Zhang et.al. | 2406.08855 | null |
2024-06-13 | BEVSpread: Spread Voxel Pooling for Bird’s-Eye-View Representation in Vision-based Roadside 3D Object Detection | Wenjie Wang et.al. | 2406.08785 | link |
2024-06-12 | Enhancing End-to-End Autonomous Driving with Latent World Model | Yingyan Li et.al. | 2406.08481 | link |
2024-06-12 | PRIBOOT: A New Data-Driven Expert for Improved Driving Simulations | Daniel Coelho et.al. | 2406.08421 | link |
2024-06-12 | LaneCPP: Continuous 3D Lane Detection using Physical Priors | Maximilian Pittner et.al. | 2406.08381 | null |
2024-08-12 | Utilizing Navigation Paths to Generate Target Points for Enhanced End-to-End Autonomous Driving Planning | Yuanhua Shen et.al. | 2406.08349 | null |
2024-06-12 | Valeo4Cast: A Modular Approach to End-to-End Forecasting | Yihong Xu et.al. | 2406.08113 | link |
2024-06-11 | PLT-D3: A High-fidelity Dynamic Driving Simulation Dataset for Stereo Depth and Scene Flow | Joshua Tokarsky et.al. | 2406.07667 | null |
2024-06-11 | Instruct Large Language Models to Drive like Humans | Ruijun Zhang et.al. | 2406.07296 | link |
2024-06-11 | EFFOcc: A Minimal Baseline for EFficient Fusion-based 3D Occupancy Network | Yining Shi et.al. | 2406.07042 | link |
2024-06-11 | PanoSSC: Exploring Monocular Panoptic 3D Scene Reconstruction for Autonomous Driving | Yining Shi et.al. | 2406.07037 | null |
2024-06-12 | LiSD: An Efficient Multi-Task Learning Framework for LiDAR Segmentation and Detection | Jiahua Xu et.al. | 2406.07023 | null |
2024-06-10 | PatchRefiner: Leveraging Synthetic Data for Real-Domain High-Resolution Monocular Metric Depth Estimation | Zhenyu Li et.al. | 2406.06679 | null |
2024-06-10 | Hybrid Video Anomaly Detection for Anomalous Scenarios in Autonomous Driving | Daniel Bogdoll et.al. | 2406.06423 | null |
2024-06-10 | UMAD: Unsupervised Mask-Level Anomaly Detection for Autonomous Driving | Daniel Bogdoll et.al. | 2406.06370 | null |
2024-06-10 | DualAD: Disentangling the Dynamic and Static World for End-to-End Driving | Simon Doll et.al. | 2406.06264 | null |
2024-06-09 | Self-supervised Adversarial Training of Monocular Depth Estimation against Physical-World Attacks | Zhiyuan Cheng et.al. | 2406.05857 | link |
2024-06-09 | ControlLoc: Physical-World Hijacking Attack on Visual Perception in Autonomous Driving | Chen Ma et.al. | 2406.05810 | null |
2024-07-19 | SlowPerception: Physical-World Latency Attack against Visual Perception in Autonomous Driving | Chen Ma et.al. | 2406.05800 | null |
2024-06-09 | Certified Robustness to Data Poisoning in Gradient-Based Training | Philip Sosnin et.al. | 2406.05670 | link |
2024-06-09 | A Superalignment Framework in Autonomous Driving with Large Language Models | Xiangrui Kong et.al. | 2406.05651 | null |
2024-06-08 | Toward Autonomous Driving by Musculoskeletal Humanoids: A Study of Developed Hardware and Learning-Based Software | Kento Kawaharazuka et.al. | 2406.05573 | null |
2024-06-07 | CityCraft: A Real Crafter for 3D City Generation | Jie Deng et.al. | 2406.04983 | null |
2024-07-08 | A Survey of Fragile Model Watermarking | Zhenzhe Gao et.al. | 2406.04809 | null |
2024-06-07 | EAIA: An Efficient and Anonymous Identity Authentication Scheme in 5G-V2V | Qianmin Du et.al. | 2406.04705 | null |
2024-06-06 | Step Out and Seek Around: On Warm-Start Training with Incremental Data | Maying Shen et.al. | 2406.04484 | null |
2024-06-06 | Optimizing Autonomous Driving for Safety: A Human-Centric Approach with LLM-Enhanced RLHF | Yuan Sun et.al. | 2406.04481 | null |
2024-06-13 | DeTra: A Unified Model for Object Detection and Trajectory Forecasting | Sergio Casas et.al. | 2406.04426 | null |
2024-06-06 | Frequency-based Matcher for Long-tailed Semantic Segmentation | Shan Li et.al. | 2406.03917 | link |
2024-06-11 | Bench2Drive: Towards Multi-Ability Benchmarking of Closed-Loop End-To-End Autonomous Driving | Xiaosong Jia et.al. | 2406.03877 | link |
2024-06-06 | Monocular Localization with Semantics Map for Autonomous Vehicles | Jixiang Wan et.al. | 2406.03835 | null |
2024-06-05 | FedPylot: Navigating Federated Learning for Real-Time Object Detection in Internet of Vehicles | Cyprien Quéméneur et.al. | 2406.03611 | link |
2024-06-05 | AD-H: Autonomous Driving with Hierarchical Agents | Zaibin Zhang et.al. | 2406.03474 | null |
2024-06-11 | Polarization Wavefront Lidar: Learning Large Scene Reconstruction from Polarized Wavefronts | Dominik Scheuble et.al. | 2406.03461 | null |
2024-06-05 | Prompt-based Visual Alignment for Zero-shot Policy Transfer | Haihan Gao et.al. | 2406.03250 | null |
2024-06-05 | Situation Monitor: Diversity-Driven Zero-Shot Out-of-Distribution Detection using Budding Ensemble Architecture for Object Detection | Qutub Syed et.al. | 2406.03188 | null |
2024-06-05 | Enhanced Automotive Object Detection via RGB-D Fusion in a DiffusionDet Framework | Eliraz Orfaig et.al. | 2406.03129 | null |
2024-06-05 | Enhancing 3D Lane Detection and Topology Reasoning with 2D Lane Priors | Han Li et.al. | 2406.03105 | link |
2024-06-05 | Task-Oriented Wireless Communications for Collaborative Perception in Intelligent Unmanned Systems | Sheng Zhou et.al. | 2406.03086 | null |
2024-06-05 | Correlation of Software-in-the-Loop Simulation with Physical Testing for Autonomous Driving | Zhennan Fei et.al. | 2406.03040 | null |
2024-06-05 | DriVLMe: Enhancing LLM-based Autonomous Driving Agents with Embodied and Social Experiences | Yidong Huang et.al. | 2406.03008 | link |
2024-06-05 | Dynamically Expanding Capacity of Autonomous Driving with Near-Miss Focused Training Framework | Ziyuan Yang et.al. | 2406.02865 | null |
2024-06-01 | Data Quality in Edge Machine Learning: A State-of-the-Art Survey | Mohammed Djameleddine Belgoumri et.al. | 2406.02600 | null |
2024-06-04 | Out-of-Distribution Runtime Adaptation with Conformalized Neural Network Ensembles | Polo Contreras et.al. | 2406.02436 | null |
2024-07-19 | Decoupling of neural network calibration measures | Dominik Werner Wolf et.al. | 2406.02411 | null |
2024-06-04 | Radar Spectra-Language Model for Automotive Scene Parsing | Mariia Pushkareva et.al. | 2406.02158 | null |
2024-06-04 | UA-Track: Uncertainty-Aware End-to-End 3D Multi-Object Tracking | Lijun Zhou et.al. | 2406.02147 | null |
2024-06-05 | Exploring Real World Map Change Generalization of Prior-Informed HD Map Prediction Models | Samuel M. Bateman et.al. | 2406.01961 | null |
2024-06-04 | ProGEO: Generating Prompts through Image-Text Contrastive Learning for Visual Geo-localization | Chen Mao et.al. | 2406.01906 | link |
2024-06-03 | ZAPP! Zonotope Agreement of Prediction and Planning for Continuous-Time Collision Avoidance with Discrete-Time Dynamics | Luca Paparusso et.al. | 2406.01814 | null |
2024-04-12 | D2E-An Autonomous Decision-making Dataset involving Driver States and Human Evaluation | Zehong Ke et.al. | 2406.01598 | null |
2024-06-04 | PlanAgent: A Multi-modal Large Language Agent for Closed-loop Vehicle Motion Planning | Yupeng Zheng et.al. | 2406.01587 | null |
2024-06-03 | Learning from Mistakes: a Weakly-supervised Method for Mitigating the Distribution Shift in Autonomous Vehicle Planning | Fazel Arasteh et.al. | 2406.01544 | null |
2024-06-16 | Sensitivity-Informed Augmentation for Robust Segmentation | Laura Zheng et.al. | 2406.01425 | null |
2024-06-03 | Extending Structural Causal Models for Use in Autonomous Embodied Systems | Rhys Howard et.al. | 2406.01384 | link |
2024-06-03 | Convolutional Unscented Kalman Filter for Multi-Object Tracking with Outliers | Shiqi Liu et.al. | 2406.01380 | null |
2024-06-06 | Unleashing Generalization of End-to-End Autonomous Driving with Controllable Long Video Generation | Enhui Ma et.al. | 2406.01349 | null |
2024-06-03 | REvolve: Reward Evolution with Large Language Models for Autonomous Driving | Rishi Hazra et.al. | 2406.01309 | null |
2024-07-16 | LanEvil: Benchmarking the Robustness of Lane Detection to Environmental Illusions | Tianyuan Zhang et.al. | 2406.00934 | null |
2024-06-02 | A Survey of Deep Learning Based Radar and Vision Fusion for 3D Object Detection in Autonomous Driving | Di Wu et.al. | 2406.00714 | null |
2024-07-16 | SuperGaussian: Repurposing Video Models for 3D Super Resolution | Yuan Shen et.al. | 2406.00609 | null |
2024-06-01 | 2nd Place Solution for PVUW Challenge 2024: Video Panoptic Segmentation | Biao Wu et.al. | 2406.00500 | null |
2024-06-04 | Research on the Application of Computer Vision Based on Deep Learning in Autonomous Driving Technology | Jingyu Zhang et.al. | 2406.00490 | null |
2024-06-01 | Learning Manipulation by Predicting Interaction | Jia Zeng et.al. | 2406.00439 | link |
2024-06-01 | Over-the-Air Collaborative Inference with Feature Differential Privacy | Mohamed Seif et.al. | 2406.00256 | null |
2024-05-31 | Fairness in Autonomous Driving: Towards Understanding Confounding Factors in Object Detection under Challenging Weather | Bimsara Pathiraja et.al. | 2406.00219 | null |
2024-05-31 | Navigating Autonomous Vehicle on Unmarked Roads with Diffusion-Based Motion Prediction and Active Inference | Yufei Huang et.al. | 2406.00211 | null |
2024-05-31 | Hard Cases Detection in Motion Prediction by Vision-Language Foundation Models | Yi Yang et.al. | 2405.20991 | link |
2024-05-31 | Uncertainty Quantification for Bird’s Eye View Semantic Segmentation: Methods and Benchmarks | Linlin Yu et.al. | 2405.20986 | null |
2024-05-31 | Robust Stable Spiking Neural Networks | Jianhao Ding et.al. | 2405.20694 | link |
2024-07-05 | HOPE: A Reinforcement Learning-based Hybrid Policy Path Planner for Diverse Parking Scenarios | Mingyang Jiang et.al. | 2405.20579 | link |
2024-05-30 | Aquatic Navigation: A Challenging Benchmark for Deep Reinforcement Learning | Davide Corsi et.al. | 2405.20534 | link |
2024-05-30 | OccSora: 4D Occupancy Generation Models as World Simulators for Autonomous Driving | Lening Wang et.al. | 2405.20337 | link |
2024-05-30 | $\textit{S}^3$ Gaussian: Self-Supervised Street Gaussians for Autonomous Driving | Nan Huang et.al. | 2405.20323 | link |
2024-06-30 | Intrinsic Dynamics-Driven Generalizable Scene Representations for Vision-Oriented Decision-Making Applications | Dayang Liang et.al. | 2405.19736 | link |
2024-05-31 | Autonomous Driving with Spiking Neural Networks | Rui-Jie Zhu et.al. | 2405.19687 | link |
2024-05-31 | SparseDrive: End-to-End Autonomous Driving via Sparse Scene Representation | Wenchao Sun et.al. | 2405.19620 | link |
2024-05-29 | Reasoning3D – Grounding and Reasoning in 3D: Fine-Grained Zero-Shot Open-Vocabulary 3D Reasoning Part Segmentation via Large Vision-Language Models | Tianrun Chen et.al. | 2405.19326 | null |
2024-05-29 | Real-Time Environment Condition Classification for Autonomous Vehicles | Marco Introvigne et.al. | 2405.19305 | link |
2024-05-29 | Conditional Latent ODEs for Motion Prediction in Autonomous Driving | Khang Truong Giang et.al. | 2405.19183 | link |
2024-05-29 | Quantum Optimal Control of Squeezing in Cavity Optomechanics | Anton Halaski et.al. | 2405.19070 | null |
2024-05-29 | A Good Foundation is Worth Many Labels: Label-Efficient Panoptic Segmentation | Niclas Vödisch et.al. | 2405.19035 | link |
2024-05-29 | Optimizing Vehicular Networks with Variational Quantum Circuits-based Reinforcement Learning | Zijiang Yan et.al. | 2405.18984 | null |
2024-05-29 | SSGA-Net: Stepwise Spatial Global-local Aggregation Networks for for Autonomous Driving | Yiming Cui et.al. | 2405.18857 | null |
2024-05-29 | LetsMap: Unsupervised Representation Learning for Semantic BEV Mapping | Nikhil Gosala et.al. | 2405.18852 | null |
2024-05-29 | PillarHist: A Quantization-aware Pillar Feature Encoder based on Height-aware Histogram | Sifan Zhou et.al. | 2405.18734 | null |
2024-05-30 | 3D StreetUnveiler with Semantic-Aware 2DGS | Jingwei Xu et.al. | 2405.18416 | null |
2024-05-28 | Is a 3D-Tokenized LLM the Key to Reliable Autonomous Driving? | Yifan Bai et.al. | 2405.18361 | null |
2024-05-28 | Safe Multi-Agent Reinforcement Learning with Bilevel Optimization in Autonomous Driving | Zhi Zheng et.al. | 2405.18209 | link |
2024-05-28 | MULi-Ev: Maintaining Unperturbed LiDAR-Event Calibration | Mathieu Cocheteux et.al. | 2405.18021 | null |
2024-05-28 | Self-supervised Pre-training for Transferable Multi-modal Perception | Xiaohao Xu et.al. | 2405.17942 | link |
2024-05-28 | Online Analytic Exemplar-Free Continual Learning with Large Models for Imbalanced Autonomous Driving Task | Huiping Zhuang et.al. | 2405.17779 | link |
2024-05-27 | GaussianFormer: Scene as Gaussians for Vision-Based 3D Semantic Occupancy Prediction | Yuanhui Huang et.al. | 2405.17429 | link |
2024-05-27 | Benchmarking and Improving Bird’s Eye View Perception Robustness in Autonomous Driving | Shaoyuan Xie et.al. | 2405.17426 | link |
2024-05-27 | Hardness-Aware Scene Synthesis for Semi-Supervised 3D Object Detection | Shuai Zeng et.al. | 2405.17422 | link |
2024-05-27 | MultiOOD: Scaling Out-of-Distribution Detection for Multiple Modalities | Hao Dong et.al. | 2405.17419 | link |
2024-07-22 | Vista: A Generalizable Driving World Model with High Fidelity and Versatile Controllability | Shenyuan Gao et.al. | 2405.17398 | link |
2024-05-27 | BehaviorGPT: Smart Agent Simulation for Autonomous Driving with Next-Patch Prediction | Zikang Zhou et.al. | 2405.17372 | null |
2024-05-27 | Towards Accurate Ego-lane Identification with Early Time Series Classification | Yuchuan Jin et.al. | 2405.17270 | null |
2024-05-29 | Memorize What Matters: Emergent Scene Decomposition from Multitraverse | Yiming Li et.al. | 2405.17187 | link |
2024-05-27 | DINO-SD: Champion Solution for ICRA 2024 RoboDepth Challenge | Yifan Mao et.al. | 2405.17102 | null |
2024-05-27 | A Two-Level Stochastic Model for the Lateral Movement of Vehicles Within Their Lane Under Homogeneous Traffic Conditions | Nicole Neis et.al. | 2405.17080 | null |
2024-05-27 | SCaRL- A Synthetic Multi-Modal Dataset for Autonomous Driving | Avinash Nittur Ramesh et.al. | 2405.17030 | null |
2024-06-24 | Bounding Random Test Set Size with Computational Learning Theory | Neil Walkinshaw et.al. | 2405.17019 | null |
2024-05-27 | Collective Perception Datasets for Autonomous Driving: A Comprehensive Review | Sven Teufel et.al. | 2405.16973 | null |
2024-05-27 | Rigorous Simulation-based Testing for Autonomous Driving Systems – Targeting the Achilles’ Heel of Four Open Autopilots | Changwen Li et.al. | 2405.16914 | link |
2024-05-27 | A re-calibration method for object detection with multi-modal alignment bias in autonomous driving | Zhihang Song et.al. | 2405.16848 | null |
2024-05-25 | Lane Detection using Graph Search and Geometric Constraints for Formula Student Driverless | Ivo Ivanov et.al. | 2405.16369 | link |
2024-05-25 | Improving 3D Occupancy Prediction through Class-balancing Loss and Multi-scale Representation | Huizhou Chen et.al. | 2405.16099 | null |
2024-05-25 | Uncertainty Measurement of Deep Learning System based on the Convex Hull of Training Sets | Hyekyoung Hwang et.al. | 2405.16082 | null |
2024-05-25 | Risk Scenario Generation for Autonomous Driving Systems based on Causal Bayesian Networks | Jiangnan Zhao et.al. | 2405.16063 | null |
2024-05-25 | DiffuBox: Refining 3D Object Detection with Point Diffusion | Xiangyu Chen et.al. | 2405.16034 | link |
2024-05-24 | SMART: Scalable Multi-agent Real-time Simulation via Next-token Prediction | Wei Wu et.al. | 2405.15677 | link |
2024-05-24 | Continuously Learning, Adapting, and Improving: A Dual-Process Approach to Autonomous Driving | Jianbiao Mei et.al. | 2405.15324 | link |
2024-05-24 | 3D Unsupervised Learning by Distilling 2D Open-Vocabulary Segmentation Models for Autonomous Driving | Boyi Sun et.al. | 2405.15286 | link |
2024-05-24 | Talk to Parallel LiDARs: A Human-LiDAR Interaction Method Based on 3D Visual Grounding | Yuhang Liu et.al. | 2405.15274 | null |
2024-05-24 | Label-efficient Semantic Scene Completion with Scribble Annotations | Song Wang et.al. | 2405.15170 | link |
2024-05-23 | ReachBot Field Tests in a Mojave Desert Lava Tube as a Martian Analog | Tony G. Chen et.al. | 2405.15005 | null |
2024-05-30 | An Empirical Study of Training State-of-the-Art LiDAR Segmentation Models | Jiahao Sun et.al. | 2405.14870 | link |
2024-05-23 | TopoLogic: An Interpretable Pipeline for Lane Topology Reasoning on Driving Scenes | Yanping Fu et.al. | 2405.14747 | null |
2024-05-23 | SE3D: A Framework For Saliency Method Evaluation In 3D Imaging | Mariusz Wiśniewski et.al. | 2405.14584 | link |
2024-05-23 | MagicDrive3D: Controllable 3D Generation for Any-View Rendering in Street Scenes | Ruiyuan Gao et.al. | 2405.14475 | null |
2024-05-24 | RoGS: Large Scale Road Surface Reconstruction based on 2D Gaussian Splatting | Zhiheng Feng et.al. | 2405.14342 | link |
2024-05-23 | NeuroGauss4D-PCI: 4D Neural Fields and Gaussian Deformation Fields for Point Cloud Interpolation | Chaokang Jiang et.al. | 2405.14241 | link |
2024-05-23 | Eidos: Efficient, Imperceptible Adversarial 3D Point Clouds | Hanwei Zhang et.al. | 2405.14210 | null |
2024-05-31 | Awesome Multi-modal Object Tracking | Chunhui Zhang et.al. | 2405.14200 | link |
2024-05-23 | Towards Transferable Attacks Against Vision-LLMs in Autonomous Driving with Typography | Nhat Chung et.al. | 2405.14169 | null |
2024-05-22 | ChatScene: Knowledge-Enabled Safety-Critical Scenario Generation for Autonomous Vehicles | Jiawei Zhang et.al. | 2405.14062 | link |
2024-06-13 | RadarOcc: Robust 3D Occupancy Prediction with 4D Imaging Radar | Fangqiang Ding et.al. | 2405.14014 | link |
2024-05-22 | TS40K: a 3D Point Cloud Dataset of Rural Terrain and Electrical Transmission System | Diogo Lavado et.al. | 2405.13989 | null |
2024-05-22 | Traffic Scenario Logic: A Spatial-Temporal Logic for Modeling and Reasoning of Urban Traffic Scenarios | Ruolin Wang et.al. | 2405.13715 | link |
2024-05-22 | Safe and Personalizable Logical Guidance for Trajectory Planning of Autonomous Driving | Yuejiao Xu et.al. | 2405.13704 | null |
2024-05-22 | HighwayLLM: Decision-Making and Navigation in Highway Driving with RL-Informed Language Model | Mustafa Yildirim et.al. | 2405.13547 | null |
2024-05-22 | Task-agnostic Decision Transformer for Multi-type Agent Control with Federated Split Training | Zhiyuan Wang et.al. | 2405.13445 | null |
2024-05-22 | Autonomous Algorithm for Training Autonomous Vehicles with Minimal Human Intervention | Sang-Hyun Lee et.al. | 2405.13345 | null |
2024-05-12 | Large Language Models for Education: A Survey | Hanyi Xu et.al. | 2405.13001 | null |
2024-05-21 | Transparency Distortion Robustness for SOTA Image Segmentation Tasks | Volker Knauthe et.al. | 2405.12864 | null |
2024-05-21 | CLRKDNet: Speeding up Lane Detection with Knowledge Distillation | Weiqing Qi et.al. | 2405.12503 | link |
2024-05-21 | Mutual Information Analysis in Multimodal Learning Systems | Hadi Hadizadeh et.al. | 2405.12456 | null |
2024-06-08 | EdgeLoc: A Communication-Adaptive Parallel System for Real-Time Localization in Infrastructure-Assisted Autonomous Driving | Boyi Liu et.al. | 2405.12120 | null |
2024-05-20 | Safe by Design Autonomous Driving Systems | Marius Bozga et.al. | 2405.11995 | null |
2024-05-20 | Tutorial on Silicon Photonics Integrated Platform Fiber Edge Coupling | Sergey S. Avdeev et.al. | 2405.11980 | null |
2024-05-20 | A comprehensive overview of deep learning techniques for 3D point cloud classification and semantic segmentation | Sushmita Sarker et.al. | 2405.11903 | null |
2024-05-20 | Stereo-Knowledge Distillation from dpMV to Dual Pixels for Light Field Video Reconstruction | Aryan Garg et.al. | 2405.11823 | null |
2024-05-19 | FADet: A Multi-sensor 3D Object Detection Network based on Local Featured Attention | Ziang Guo et.al. | 2405.11682 | link |
2024-05-19 | Searching Realistic-Looking Adversarial Objects For Autonomous Driving Systems | Shengxiang Sun et.al. | 2405.11629 | null |
2024-05-18 | Generalized Multi-Objective Reinforcement Learning with Envelope Updates in URLLC-enabled Vehicular Networks | Zijiang Yan et.al. | 2405.11331 | null |
2024-05-18 | RuleFuser: Injecting Rules in Evidential Networks for Robust Out-of-Distribution Trajectory Prediction | Jay Patrikar et.al. | 2405.11139 | null |
2024-03-21 | Application of Tensorized Neural Networks for Cloud Classification | Alifu Xiafukaiti et.al. | 2405.10946 | null |
2024-05-17 | GEOcc: Geometrically Enhanced 3D Occupancy Network with Implicit-Explicit Depth Fusion and Contextual Self-Supervision | Xin Tan et.al. | 2405.10591 | null |
2024-05-17 | Team Samsung-RAL: Technical Report for 2024 RoboDrive Challenge-Robust Map Segmentation Track | Xiaoshuai Hao et.al. | 2405.10567 | null |
2024-05-28 | NeRO: Neural Road Surface Reconstruction | Ruibo Wang et.al. | 2405.10554 | link |
2024-05-07 | Detecting 5G Signal Jammers Using Spectrograms with Supervised and Unsupervised Learning | Matteo Varotto et.al. | 2405.10331 | null |
2024-05-16 | Towards Consistent and Explainable Motion Prediction using Heterogeneous Graph Attention | Tobias Demmler et.al. | 2405.10134 | null |
2024-05-16 | Cooperative Visual-LiDAR Extrinsic Calibration Technology for Intersection Vehicle-Infrastructure: A review | Xinyu Zhang et.al. | 2405.10132 | null |
2024-05-16 | Infrared Adversarial Car Stickers | Xiaopei Zhu et.al. | 2405.09924 | null |
2024-05-19 | PillarNeXt: Improving the 3D detector by introducing Voxel2Pillar feature encoding and extracting multi-scale features | Xusheng Li et.al. | 2405.09828 | null |
2024-07-08 | Collision Avoidance Metric for 3D Camera Evaluation | Vage Taamazyan et.al. | 2405.09755 | link |
2024-07-05 | UDA4Inst: Unsupervised Domain Adaptation for Instance Segmentation | Yachan Guo et.al. | 2405.09682 | null |
2024-03-20 | Mask-based Invisible Backdoor Attacks on Object Detection | Shin Jeong Jin et.al. | 2405.09550 | link |
2024-05-15 | CarDreamer: Open-Source Learning Platform for World Model based Autonomous Driving | Dechen Gao et.al. | 2405.09111 | link |
2024-05-20 | Perception Without Vision for Trajectory Prediction: Ego Vehicle Dynamics as Scene Representation for Efficient Active Learning in Autonomous Driving | Ross Greer et.al. | 2405.09049 | null |
2024-05-14 | The Pitfalls and Promise of Conformal Inference Under Adversarial Attacks | Ziquan Liu et.al. | 2405.08886 | link |
2024-05-30 | The RoboDrive Challenge: Drive Anytime Anywhere in Any Condition | Lingdong Kong et.al. | 2405.08816 | null |
2024-05-14 | Ambiguous Annotations: When is a Pedestrian not a Pedestrian? | Luisa Schwirten et.al. | 2405.08794 | null |
2024-05-14 | Work-in-Progress: Crash Course: Can (Under Attack) Autonomous Driving Beat Human Drivers? | Francesco Marchiori et.al. | 2405.08466 | null |
2024-05-13 | Equivariant Deep Learning of Mixed-Integer Optimal Control Solutions for Vehicle Decision Making and Motion Planning | Rudolf Reiter et.al. | 2405.08122 | null |
2024-06-05 | AnoVox: A Benchmark for Multimodal Anomaly Detection in Autonomous Driving | Daniel Bogdoll et.al. | 2405.07865 | link |
2024-05-13 | oTTC: Object Time-to-Contact for Motion Estimation in Autonomous Driving | Abdul Hannan Khan et.al. | 2405.07698 | null |
2024-05-13 | MaskFuser: Masked Fusion of Joint Multi-Modal Tokenization for End-to-End Autonomous Driving | Yiqun Duan et.al. | 2405.07573 | null |
2024-05-12 | Modeling Pedestrian Intrinsic Uncertainty for Multimodal Stochastic Trajectory Prediction via Energy Plan Denoising | Yao Liu et.al. | 2405.07164 | null |
2024-05-11 | Multi-agent Traffic Prediction via Denoised Endpoint Distribution | Yao Liu et.al. | 2405.07041 | null |
2024-05-20 | Boolean matrix logic programming for active learning of gene functions in genome-scale metabolic network models | Lun Ai et.al. | 2405.06724 | link |
2024-05-10 | Multi-Object Tracking in the Dark | Xinzhe Wang et.al. | 2405.06600 | link |
2024-05-10 | Autonomous Driving with a Deep Dual-Model Solution for Steering and Braking Control | Ana Petra Jukić et.al. | 2405.06473 | null |
2024-05-10 | Efficient End-to-End Detection of 6-DoF Grasps for Robotic Bin Picking | Yushi Liu et.al. | 2405.06336 | null |
2024-05-10 | Selective Focus: Investigating Semantics Sensitivity in Post-training Quantization for Lane Detection | Yunqian Fan et.al. | 2405.06264 | null |
2024-05-09 | Probing Multimodal LLMs as World Models for Driving | Shiva Sreeram et.al. | 2405.05956 | link |
2024-05-09 | Co-driver: VLM-based Autonomous Driving Assistant with Human-like Behavior and Understanding for Complex Road Scenes | Ziang Guo et.al. | 2405.05885 | link |
2024-07-01 | Towards Robust Physical-world Backdoor Attacks on Lane Detection | Xinwei Zhang et.al. | 2405.05553 | null |
2024-05-07 | Tiny Deep Ensemble: Uncertainty Estimation in Edge AI Accelerators via Ensembling Normalization Layers with Shared Weights | Soyed Tuhin Ahmed et.al. | 2405.05286 | null |
2024-05-08 | Multi-Modal Data-Efficient 3D Scene Understanding for Autonomous Driving | Lingdong Kong et.al. | 2405.05258 | link |
2024-05-18 | A Survey on Occupancy Perception for Autonomous Driving: The Information Fusion Perspective | Huaiyuan Xu et.al. | 2405.05173 | link |
2024-05-08 | DenserRadar: A 4D millimeter-wave radar point cloud detector based on dense LiDAR point clouds | Zeyu Han et.al. | 2405.05131 | null |
2024-05-08 | Novel Actor-Critic Algorithm for Robust Decision Making of CAV under Delays and Loss of V2X Data | Zine el abidine Kherroubi et.al. | 2405.05072 | null |
2024-05-08 | Traj-LLM: A New Exploration for Empowering Trajectory Prediction with Pre-trained Large Language Models | Zhengxing Lan et.al. | 2405.04909 | null |
2024-05-07 | TorchDriveEnv: A Reinforcement Learning Benchmark for Autonomous Driving with Reactive, Realistic, and Diverse Non-Playable Characters | Jonathan Wilder Lavington et.al. | 2405.04491 | null |
2024-05-07 | DriveWorld: 4D Pre-trained Scene Understanding via World Models for Autonomous Driving | Chen Min et.al. | 2405.04390 | null |
2024-05-07 | Bayesian Simultaneous Localization and Multi-Lane Tracking Using Onboard Sensors and a SD Map | Yuxuan Xia et.al. | 2405.04290 | null |
2024-06-17 | pFedLVM: A Large Vision Model (LVM)-Driven and Latent Feature-Based Personalized Federated Learning Framework in Autonomous Driving | Wei-Bin Kou et.al. | 2405.04146 | null |
2024-05-07 | ESP: Extro-Spective Prediction for Long-term Behavior Reasoning in Emergency Scenarios | Dingrui Wang et.al. | 2405.04100 | null |
2024-05-07 | Feature Map Convergence Evaluation for Functional Module | Ludan Zhang et.al. | 2405.04041 | null |
2024-05-07 | Deep Event-based Object Detection in Autonomous Driving: A Survey | Bingquan Zhou et.al. | 2405.03995 | null |
2024-05-07 | Unified End-to-End V2X Cooperative Autonomous Driving | Zhiwei Li et.al. | 2405.03971 | null |
2024-05-07 | Role of Sensing and Computer Vision in 6G Wireless Communications | Seungnyun Kim et.al. | 2405.03945 | link |
2024-05-07 | Roadside Units Assisted Localized Automated Vehicle Maneuvering: An Offline Reinforcement Learning Approach | Kui Wang et.al. | 2405.03935 | null |
2024-05-06 | BadFusion: 2D-Oriented Backdoor Attacks against 3D Object Detection | Saket S. Chaturvedi et.al. | 2405.03884 | null |
2024-05-06 | SocialFormer: Social Interaction Modeling with Edge-enhanced Heterogeneous Graph Transformers for Trajectory Prediction | Zixu Wang et.al. | 2405.03809 | null |
2024-05-06 | UniGen: Unified Modeling of Initial Agent States and Trajectories for Generating Autonomous Driving Scenarios | Reza Mahjourian et.al. | 2405.03807 | null |
2024-05-06 | Language-Image Models with 3D Understanding | Jang Hyun Cho et.al. | 2405.03685 | null |
2024-05-06 | RoboCar: A Rapidly Deployable Open-Source Platform for Autonomous Driving Research | Mehdi Testouri et.al. | 2405.03572 | link |
2024-05-06 | Is Sora a World Simulator? A Comprehensive Survey on General World Models and Beyond | Zheng Zhu et.al. | 2405.03520 | link |
2024-05-05 | SalFAU-Net: Saliency Fusion Attention U-Net for Salient Object Detection | Kassaw Abraham Mulat et.al. | 2405.02906 | null |
2024-05-04 | Accelerating Autonomy: Insights from Pro Racers in the Era of Autonomous Racing - An Expert Interview Study | Frederik Werner et.al. | 2405.02620 | link |
2024-05-04 | Vision-based 3D occupancy prediction in autonomous driving: a review and outlook | Yanan Zhang et.al. | 2405.02595 | link |
2024-05-03 | Characterized Diffusion and Spatial-Temporal Interaction Network for Trajectory Prediction in Autonomous Driving | Haicheng Liao et.al. | 2405.02145 | null |
2024-05-03 | Obstacle Avoidance of Autonomous Vehicles: An LPVMPC with Scheduling Trust Region | Maryam Nezami et.al. | 2405.02030 | null |
2024-05-03 | DiffMap: Enhancing Map Segmentation with Map Prior Using Diffusion Model | Peijin Jia et.al. | 2405.02008 | null |
2024-05-03 | M ${^2}$ Depth: Self-supervised Two-Frame Multi-camera Metric Depth Estimation | Yingshuang Zou et.al. | 2405.02004 | null |
2024-05-02 | Language-Enhanced Latent Representations for Out-of-Distribution Detection in Autonomous Driving | Zhenjiang Mao et.al. | 2405.01691 | null |
2024-05-02 | Multi-Space Alignments Towards Universal LiDAR Segmentation | Youquan Liu et.al. | 2405.01538 | link |
2024-05-02 | OmniDrive: A Holistic LLM-Agent Framework for Autonomous Driving with 3D Perception, Reasoning and Planning | Shihao Wang et.al. | 2405.01533 | link |
2024-04-12 | A Review of Reward Functions for Reinforcement Learning in the context of Autonomous Driving | Ahmed Abouelazm et.al. | 2405.01440 | null |
2024-03-21 | Analysis of a Modular Autonomous Driving Architecture: The Top Submission to CARLA Leaderboard 2.0 Challenge | Weize Zhang et.al. | 2405.01394 | null |
2024-05-02 | An Advanced Framework for Ultra-Realistic Simulation and Digital Twinning for Autonomous Vehicles | Yuankai He et.al. | 2405.01328 | null |
2024-05-02 | MFTraj: Map-Free, Behavior-Driven Trajectory Prediction for Autonomous Driving | Haicheng Liao et.al. | 2405.01266 | null |
2024-05-02 | A Survey on Semantic Communication Networks: Architecture, Security, and Privacy | Shaolong Guo et.al. | 2405.01221 | null |
2024-05-02 | Federated Learning with Heterogeneous Data Handling for Robust Vehicular Object Detection | Ahmad Khalil et.al. | 2405.01108 | link |
2024-05-02 | Poisoning Attacks on Federated Learning for Autonomous Driving | Sonakshi Garg et.al. | 2405.01073 | null |
2024-05-04 | LidaRF: Delving into Lidar for Neural Radiance Field on Street Scenes | Shanlin Sun et.al. | 2405.00900 | null |
2024-05-01 | ADM: Accelerated Diffusion Model via Estimated Priors for Robust Motion Prediction under Uncertainties | Jiahui Li et.al. | 2405.00797 | link |
2024-05-01 | Lane Segmentation Refinement with Diffusion Models | Antonio Ruiz et.al. | 2405.00620 | null |
2024-05-31 | GAD-Generative Learning for HD Map-Free Autonomous Driving | Weijian Sun et.al. | 2405.00515 | null |
2024-05-01 | On the Relevance of Byzantine Robust Optimization Against Data Poisoning | Sadegh Farhadkhani et.al. | 2405.00491 | null |
2024-05-01 | RAG-based Explainable Prediction of Road Users Behaviors for Automated Driving using Knowledge Graphs and Large Language Models | Mohamed Manzour Hussien et.al. | 2405.00449 | null |
2024-05-01 | Dual-Role AoI-based Incentive Mechanism for HD map Crowdsourcing | Wentao Ye et.al. | 2405.00353 | null |
2024-05-05 | Enhance Planning with Physics-informed Safety Controller for End-to-end Autonomous Driving | Hang Zhou et.al. | 2405.00316 | null |
2024-04-30 | SemVecNet: Generalizable Vector Map Generation for Arbitrary Sensor Configurations | Narayanan Elavathur Ranganatha et.al. | 2405.00250 | link |
2024-04-30 | Guiding Attention in End-to-End Driving Models | Diego Porres et.al. | 2405.00242 | link |
2024-04-30 | STT: Stateful Tracking with Transformers for Autonomous Driving | Longlong Jing et.al. | 2405.00236 | null |
2024-04-30 | Comparing Motion Distortion Between Vehicle Field Deployments | Nicolas Samson et.al. | 2405.00189 | null |
2024-04-30 | Pseudo Label Refinery for Unsupervised Domain Adaptation on Cross-dataset 3D Object Detection | Zhanwei Zhang et.al. | 2404.19384 | null |
2024-07-01 | SemanticFormer: Holistic and Semantic Traffic Scene Representation for Trajectory Prediction using Knowledge Graphs | Zhigang Sun et.al. | 2404.19379 | link |
2024-04-30 | G2LTraj: A Global-to-Local Generation Approach for Trajectory Prediction | Zhanwei Zhang et.al. | 2404.19330 | link |
2024-05-05 | Multimodal Fusion on Low-quality Data: A Comprehensive Survey | Qingyang Zhang et.al. | 2404.18947 | null |
2024-05-22 | PlanNetX: Learning an Efficient Neural Network Planner from MPC for Longitudinal Control | Jasper Hoffmann et.al. | 2404.18863 | null |
2024-04-29 | Safe Reach Set Computation via Neural Barrier Certificates | Alessandro Abate et.al. | 2404.18813 | null |
2024-04-29 | Uncertainty-boosted Robust Video Activity Anticipation | Zhaobo Qi et.al. | 2404.18648 | link |
2024-04-29 | Assessing Quality Metrics for Neural Reality Gap Input Mitigation in Autonomous Driving Testing | Stefano Carlo Lambertenghi et.al. | 2404.18577 | link |
2024-04-29 | Predicting Safety Misbehaviours in Autonomous Driving Systems using Uncertainty Quantification | Ruben Grewal et.al. | 2404.18573 | link |
2024-04-29 | MRIC: Model-Based Reinforcement-Imitation Learning with Mixture-of-Codebooks for Autonomous Driving Simulation | Baotian He et.al. | 2404.18464 | null |
2024-04-29 | $ν$ -DBA: Neural Implicit Dense Bundle Adjustment Enables Image-Only Driving Scene Reconstruction | Yunxuan Mao et.al. | 2404.18439 | null |
2024-04-28 | RadSimReal: Bridging the Gap Between Synthetic and Real Data in Radar Object Detection With Simulation | Oded Bialer et.al. | 2404.18150 | null |
2024-04-27 | BoostRad: Enhancing Object Detection by Boosting Radar Reflections | Yuval Haitman et.al. | 2404.17861 | null |
2024-04-27 | Motion planning for off-road autonomous driving based on human-like cognition and weight adaptation | Yuchun Wang et.al. | 2404.17820 | null |
2024-06-19 | CLFT: Camera-LiDAR Fusion Transformer for Semantic Segmentation in Autonomous Driving | Junyi Gu et.al. | 2404.17793 | link |
2024-04-26 | CoCar NextGen: a Multi-Purpose Platform for Connected Autonomous Driving Research | Marc Heinrich et.al. | 2404.17550 | null |
2024-04-26 | A Cognitive-Driven Trajectory Prediction Model for Autonomous Driving in Mixed Autonomy Environment | Haicheng Liao et.al. | 2404.17520 | null |
2024-04-26 | Cost-Sensitive Uncertainty-Based Failure Recognition for Object Detection | Moussa Kassem Sbeyti et.al. | 2404.17427 | link |
2024-04-26 | On the Road to Clarity: Exploring Explainable AI for World Models in a Driver Assistance System | Mohamed Roshdi et.al. | 2404.17350 | null |
2024-04-26 | Scene-Extrapolation: Generating Interactive Traffic Scenarios | Maximilian Zipfl et.al. | 2404.17224 | null |
2024-04-26 | Beyond Imitation: A Life-long Policy Learning Framework for Path Tracking Control of Autonomous Driving | C. Gong et.al. | 2404.17198 | null |
2024-04-25 | Integration of Mixture of Experts and Multimodal Generative AI in Internet of Vehicles: A Survey | Minrui Xu et.al. | 2404.16356 | null |
2024-04-29 | A Multi-objective Optimization Benchmark Test Suite for Real-time Semantic Segmentation | Yifan Zhao et.al. | 2404.16266 | link |
2024-04-28 | A Survey on Intermediate Fusion Methods for Collaborative Perception Categorized by Real World Challenges | Melih Yazgan et.al. | 2404.16139 | null |
2024-04-05 | Using Automated Vehicle Data as a Fitness Tracker for Sustainability | Xia Wang et.al. | 2404.16046 | null |
2024-04-24 | Learning Car-Following Behaviors Using Bayesian Matrix Normal Mixture Regression | Chengyuan Zhang et.al. | 2404.16023 | null |
2024-04-23 | OccGen: Generative Multi-modal 3D Occupancy Prediction for Autonomous Driving | Guoqing Wang et.al. | 2404.15014 | null |
2024-04-23 | LaneCorrect: Self-supervised Lane Detection | Ming Nie et.al. | 2404.14671 | null |
2024-04-22 | PLUTO: Pushing the Limit of Imitation Learning-based Planning for Autonomous Driving | Jie Cheng et.al. | 2404.14327 | null |
2024-04-22 | Localization Based on MIMO Backscattering from Retro-Directive Antenna Arrays | Marina Lotti et.al. | 2404.14206 | null |
2024-04-22 | PointDifformer: Robust Point Cloud Registration With Neural Diffusion and Transformer | Rui She et.al. | 2404.14034 | null |
2024-06-12 | OccFeat: Self-supervised Occupancy Feature Prediction for Pretraining BEV Segmentation Networks | Sophia Sirko-Galouchenko et.al. | 2404.14027 | link |
2024-04-22 | Collaborative Perception Datasets in Autonomous Driving: A Survey | Melih Yazgan et.al. | 2404.14022 | null |
2024-05-05 | How do LLMs Support Deep Learning Testing? A Comprehensive Study Through the Lens of Image Mutation | Liwen Wang et.al. | 2404.13945 | null |
2024-04-26 | Neural Radiance Field in Autonomous Driving: A Survey | Lei He et.al. | 2404.13816 | null |
2024-04-21 | Soar: Design and Deployment of A Smart Roadside Infrastructure System for Autonomous Driving | Shuyao Shi et.al. | 2404.13786 | null |
2024-04-27 | FisheyeDetNet: 360° Surround view Fisheye Camera based Object Detection System for Autonomous Driving | Ganesh Sistu et.al. | 2404.13443 | null |
2024-04-20 | Social Force Embedded Mixed Graph Convolutional Network for Multi-class Trajectory Prediction | Quancheng Du et.al. | 2404.13378 | null |
2024-04-19 | BACS: Background Aware Continual Semantic Segmentation | Mostafa ElAraby et.al. | 2404.13148 | link |
2024-04-22 | Physical Backdoor Attack can Jeopardize Driving with Vision-Large-Language Models | Zhenyang Ni et.al. | 2404.12916 | link |
2024-04-19 | FipTR: A Simple yet Effective Transformer Framework for Future Instance Prediction in Autonomous Driving | Xingtai Gui et.al. | 2404.12867 | link |
2024-06-18 | Language-Driven Active Learning for Diverse Open-Set 3D Object Detection | Ross Greer et.al. | 2404.12856 | link |
2024-04-19 | A Point-Based Approach to Efficient LiDAR Multi-Task Perception | Christopher Lang et.al. | 2404.12798 | null |
2024-04-19 | Camera Agnostic Two-Head Network for Ego-Lane Inference | Chaehyeon Song et.al. | 2404.12770 | null |
2024-04-19 | A Containerized Microservice Architecture for a ROS 2 Autonomous Driving Software: An End-to-End Latency Evaluation | Tobias Betz et.al. | 2404.12683 | null |
2024-04-19 | Dragtraffic: A Non-Expert Interactive and Point-Based Controllable Traffic Scene Generation Framework | Sheng Wang et.al. | 2404.12624 | null |
2024-04-30 | TrACT: A Training Dynamics Aware Contrastive Learning Framework for Long-tail Trajectory Prediction | Junrui Zhang et.al. | 2404.12538 | null |
2024-04-18 | SPIdepth: Strengthened Pose Information for Self-supervised Monocular Depth Estimation | Mykola Lavreniuk et.al. | 2404.12501 | link |
2024-04-18 | Reducing Bias in Pre-trained Models by Tuning while Penalizing Change | Niklas Penzel et.al. | 2404.12292 | null |
2024-04-18 | An Online Spatial-Temporal Graph Trajectory Planner for Autonomous Vehicles | Jilan Samiuddin et.al. | 2404.12256 | null |
2024-04-18 | Stability Certificates for Receding Horizon Games | Sophie Hall et.al. | 2404.12165 | null |
2024-04-18 | S4TP: Social-Suitable and Safety-Sensitive Trajectory Planning for Autonomous Vehicles | Xiao Wang et.al. | 2404.11946 | null |
2024-04-17 | TempBEV: Improving Learned BEV Encoders with Combined Image and BEV Space Temporal Aggregation | Thomas Monninger et.al. | 2404.11803 | null |
2024-04-17 | Multimodal 3D Object Detection on Unseen Domains | Deepti Hegde et.al. | 2404.11764 | null |
2024-04-17 | Exploring DNN Robustness Against Adversarial Attacks Using Approximate Multipliers | Mohammad Javad Askarizadeh et.al. | 2404.11665 | null |
2024-04-17 | VG4D: Vision-Language Model Goes 4D Video Recognition | Zhichao Deng et.al. | 2404.11605 | link |
2024-04-18 | SERENE: A Collusion Resilient Replication-based Verification Framework | Amir Esmaeili et.al. | 2404.11410 | null |
2024-04-17 | Detector Collapse: Backdooring Object Detection to Catastrophic Overload or Blindness | Hangtao Zhang et.al. | 2404.11357 | null |
2024-04-19 | KI-GAN: Knowledge-Informed Generative Adversarial Networks for Enhanced Multi-Vehicle Trajectory Forecasting at Signalized Intersections | Chuheng Wei et.al. | 2404.11181 | link |
2024-04-17 | D-Aug: Enhancing Data Augmentation for Dynamic LiDAR Scenes | Jiaxing Zhao et.al. | 2404.11127 | null |
2024-04-17 | Sky-GVIO: an enhanced GNSS/INS/Vision navigation with FCN-based sky-segmentation in urban canyon | Jingrong Wang et.al. | 2404.11070 | link |
2024-04-17 | How to deal with glare for improved perception of Autonomous Vehicles | Muhammad Z. Alam et.al. | 2404.10992 | null |
2024-04-18 | End-To-End Training and Testing Gamification Framework to Learn Human Highway Driving | Satya R. Jaladi et.al. | 2404.10849 | null |
2024-04-12 | PASA: Attack Agnostic Unsupervised Adversarial Detection using Prediction & Attribution Sensitivity Analysis | Dipkamal Bhusal et.al. | 2404.10789 | link |
2024-04-16 | Gaussian Opacity Fields: Efficient and Compact Surface Reconstruction in Unbounded Scenes | Zehao Yu et.al. | 2404.10772 | null |
2024-04-16 | N-Agent Ad Hoc Teamwork | Caroline Wang et.al. | 2404.10740 | link |
2024-04-16 | Automated Evaluation of Large Vision-Language Models on Self-driving Corner Cases | Yanze Li et.al. | 2404.10595 | null |
2024-04-19 | SEVD: Synthetic Event-based Vision Dataset for Ego and Fixed Traffic Perception | Manideep Reddy Aliminati et.al. | 2404.10540 | link |
2024-04-16 | LAECIPS: Large Vision Model Assisted Adaptive Edge-Cloud Collaboration for IoT-based Perception System | Shijing Hu et.al. | 2404.10498 | null |
2024-04-16 | PreGSU-A Generalized Traffic Scene Understanding Model for Autonomous Driving based on Pre-trained Graph Attention Network | Yuning Wang et.al. | 2404.10263 | null |
2024-04-15 | Video2Game: Real-time, Interactive, Realistic and Browser-Compatible Environment from a Single Video | Hongchi Xia et.al. | 2404.09833 | null |
2024-04-15 | Sampling for Model Predictive Trajectory Planning in Autonomous Driving using Normalizing Flows | Georg Rabenstein et.al. | 2404.09657 | null |
2024-04-15 | SparseOcc: Rethinking Sparse Latent Representation for Vision-Based Semantic Occupancy Prediction | Pin Tang et.al. | 2404.09502 | null |
2024-04-15 | Towards Collaborative Autonomous Driving: Simulation Platform and End-to-End System | Genjia Liu et.al. | 2404.09496 | link |
2024-04-15 | VFMM3D: Releasing the Potential of Image by Vision Foundation Model for Monocular 3D Object Detection | Bonan Ding et.al. | 2404.09431 | null |
2024-04-14 | SyntStereo2Real: Edge-Aware GAN for Remote Sensing Image-to-Image Translation while Maintaining Stereo Constraint | Vasudha Venkatesan et.al. | 2404.09277 | null |
2024-04-14 | VRS-NeRF: Visual Relocalization with Sparse Neural Radiance Field | Fei Xue et.al. | 2404.09271 | link |
2024-04-14 | Increasing SLAM Pose Accuracy by Ground-to-Satellite Image Registration | Yanhao Zhang et.al. | 2404.09169 | link |
2024-05-25 | Intention-Aware Control Based on Belief-Space Specifications and Stochastic Expansion | Zengjie Zhang et.al. | 2404.09037 | link |
2024-04-12 | WROOM: An Autonomous Driving Approach for Off-Road Navigation | Dvij Kalaria et.al. | 2404.08855 | link |
2024-04-12 | FusionPortableV2: A Unified Multi-Sensor Dataset for Generalized SLAM Across Diverse Platforms and Scalable Environments | Hexiang Wei et.al. | 2404.08563 | null |
2024-04-12 | Maturity of Vehicle Digital Twins: From Monitoring to Enabling Autonomous Driving | Robert Klar et.al. | 2404.08438 | null |
2024-04-12 | Transfer Learning Study of Motion Transformer-based Trajectory Predictions | Lars Ullrich et.al. | 2404.08271 | null |
2024-04-12 | MonoPatchNeRF: Improving Neural Radiance Fields with Patch-based Monocular Guidance | Yuqun Wu et.al. | 2404.08252 | null |
2024-04-11 | Real-Time Detection and Analysis of Vehicles and Pedestrians using Deep Learning | Md Nahid Sadik et.al. | 2404.08081 | null |
2024-04-11 | VeTraSS: Vehicle Trajectory Similarity Search Through Graph Modeling and Representation Learning | Ming Cheng et.al. | 2404.08021 | null |
2024-04-09 | GRANP: A Graph Recurrent Attentive Neural Process Model for Vehicle Trajectory Prediction | Yuhao Luo et.al. | 2404.08004 | link |
2024-04-11 | GoMAvatar: Efficient Animatable Human Modeling from Monocular Video Using Gaussians-on-Mesh | Jing Wen et.al. | 2404.07991 | null |
2024-04-11 | Sparse Laneformer | Ji Liu et.al. | 2404.07821 | null |
2024-04-23 | NeuroNCAP: Photorealistic Closed-loop Safety Testing for Autonomous Driving | William Ljungbergh et.al. | 2404.07762 | link |
2024-04-11 | Homography Guided Temporal Fusion for Road Line and Marking Segmentation | Shan Wang et.al. | 2404.07626 | link |
2024-04-11 | Can Vehicle Motion Planning Generalize to Realistic Long-tail Scenarios? | Marcel Hallgarten et.al. | 2404.07569 | link |
2024-04-11 | PillarTrack: Redesigning Pillar-based Transformer Network for Single Object Tracking on Point Clouds | Weisheng Xu et.al. | 2404.07495 | link |
2024-04-10 | Identification of Fine-grained Systematic Errors via Controlled Scene Generation | Valentyn Boreiko et.al. | 2404.07045 | null |
2024-04-10 | SparseAD: Sparse Query-Centric Paradigm for Efficient End-to-End Autonomous Driving | Diankun Zhang et.al. | 2404.06892 | null |
2024-04-19 | Monocular 3D lane detection for Autonomous Driving: Recent Achievements, Challenges, and Outlooks | Fulong Ma et.al. | 2404.06860 | null |
2024-04-10 | Sparse Points to Dense Clouds: Enhancing 3D Detection with Limited LiDAR Data | Aakash Kumar et.al. | 2404.06715 | null |
2024-05-10 | SAM-I-Am: Semantic Boosting for Zero-shot Atomic-Scale Electron Micrograph Segmentation | Waqwoya Abebe et.al. | 2404.06638 | link |
2024-04-20 | RoadBEV: Road Surface Reconstruction in Bird’s Eye View | Tong Zhao et.al. | 2404.06605 | link |
2024-04-11 | HPNet: Dynamic Trajectory Forecasting with Historical Prediction Attention | Xiaolong Tang et.al. | 2404.06351 | link |
2024-04-21 | AgentsCoDriver: Large Language Model Empowered Collaborative Driving with Lifelong Learning | Senkang Hu et.al. | 2404.06345 | null |
2024-04-09 | Label-Efficient 3D Object Detection For Road-Side Units | Minh-Quan Dao et.al. | 2404.06256 | null |
2024-04-09 | Towards Autonomous Driving with Small-Scale Cars: A Survey of Recent Development | Dianzhao Li et.al. | 2404.06229 | null |
2024-04-09 | Hierarchical Insights: Exploiting Structural Similarities for Reliable 3D Semantic Segmentation | Mariella Dreissig et.al. | 2404.06124 | null |
2024-04-09 | Passive None-line-of-sight imaging with arbitrary scene condition and detection pattern in small amount of prior data | Yunting Gui et.al. | 2404.06015 | null |
2024-04-08 | Residual Chain Prediction for Autonomous Driving Path Planning | Liguo Zhou et.al. | 2404.05423 | null |
2024-04-08 | Human Detection from 4D Radar Data in Low-Visibility Field Conditions | Mikael Skog et.al. | 2404.05307 | null |
2024-04-08 | Detecting Every Object from Events | Haitian Zhang et.al. | 2404.05285 | link |
2024-04-08 | MOSE: Boosting Vision-based Roadside 3D Object Detection with Scene Cues | Xiahan Chen et.al. | 2404.05280 | null |
2024-04-08 | UniMix: Towards Domain Adaptive and Generalizable LiDAR Semantic Segmentation in Adverse Weather | Haimei Zhao et.al. | 2404.05145 | null |
2024-04-09 | Better Monocular 3D Detectors with LiDAR from the Past | Yurong You et.al. | 2404.05139 | link |
2024-04-07 | MonoTAKD: Teaching Assistant Knowledge Distillation for Monocular 3D Object Detection | Hou-I Liu et.al. | 2404.04910 | link |
2024-04-07 | Prompting Multi-Modal Tokens to Enhance End-to-End Autonomous Driving Imitation Learning with LLMs | Yiqun Duan et.al. | 2404.04869 | null |
2024-04-07 | Light the Night: A Multi-Condition Diffusion Framework for Unpaired Low-Light Enhancement in Autonomous Driving | Jinlong Li et.al. | 2404.04804 | null |
2024-05-06 | HawkDrive: A Transformer-driven Visual Perception System for Autonomous Driving in Night Scene | Ziang Guo et.al. | 2404.04653 | link |
2024-05-22 | Co-Occ: Coupling Explicit Feature Fusion with Volume Rendering Regularization for Multi-Modal 3D Semantic Occupancy Prediction | Jingyi Pan et.al. | 2404.04561 | null |
2024-04-06 | Automated Lane Change Behavior Prediction and Environmental Perception Based on SLAM Technology | Han Lei et.al. | 2404.04492 | null |
2024-04-05 | Physical Property Understanding from Language-Embedded Feature Fields | Albert J. Zhai et.al. | 2404.04242 | null |
2024-04-05 | Exploring Probabilistic Models for Semi-supervised Learning | Jianfeng Wang et.al. | 2404.04199 | null |
2024-04-05 | You Can Use But Cannot Recognize: Preserving Visual Privacy in Deep Neural Networks | Qiushi Li et.al. | 2404.04098 | null |
2024-05-13 | Scaling Motion Forecasting Models with Ensemble Distillation | Scott Ettinger et.al. | 2404.03843 | null |
2024-04-04 | Language-Guided Instance-Aware Domain-Adaptive Panoptic Segmentation | Elham Amin Mansour et.al. | 2404.03799 | null |
2024-04-04 | Dendrites endow artificial neural networks with accurate, robust and parameter-efficient learning | Spyridon Chavlis et.al. | 2404.03708 | null |
2024-04-04 | Is CLIP the main roadblock for fine-grained open-world perception? | Lorenzo Bianchi et.al. | 2404.03539 | link |
2024-04-04 | Materials for High Temperature Digital Electronics | Dhiren K. Pradhan et.al. | 2404.03510 | null |
2024-04-05 | A Methodology to Study the Impact of Spiking Neural Network Parameters considering Event-Based Automotive Data | Iqra Bano et.al. | 2404.03493 | null |
2024-04-08 | Scaling Population-Based Reinforcement Learning with GPU Accelerated Simulation | Asad Ali Shahid et.al. | 2404.03336 | null |
2024-05-06 | CORP: A Multi-Modal Dataset for Campus-Oriented Roadside Perception Tasks | Beibei Wang et.al. | 2404.03191 | null |
2024-04-03 | Ego-Motion Aware Target Prediction Module for Robust Multi-Object Tracking | Navid Mahdian et.al. | 2404.03110 | link |
2024-04-03 | LidarDM: Generative LiDAR Simulation in a Generated World | Vlas Zyrianov et.al. | 2404.02903 | link |
2024-04-03 | One Stack to Rule them All: To Drive Automated Vehicles, and Reach for the 4th level | Sven Ochs et.al. | 2404.02645 | null |
2024-05-20 | HENet: Hybrid Encoding for End-to-end Multi-task 3D Perception from Multi-view Cameras | Zhongyu Xia et.al. | 2404.02517 | link |
2024-04-03 | AD4RL: Autonomous Driving Benchmarks for Offline Reinforcement Learning with Value-based Dataset | Dongsu Lee et.al. | 2404.02429 | null |
2024-04-03 | TCLC-GS: Tightly Coupled LiDAR-Camera Gaussian Splatting for Surrounding Autonomous Driving Scenes | Cheng Zhao et.al. | 2404.02410 | null |
2024-04-02 | OFMPNet: Deep End-to-End Model for Occupancy and Flow Prediction in Urban Environment | Youshaa Murhij et.al. | 2404.02263 | link |
2024-04-02 | OOSTraj: Out-of-Sight Trajectory Prediction With Vision-Positioning Denoising | Haichao Zhang et.al. | 2404.02227 | link |
2024-04-01 | Versatile Navigation under Partial Observability via Value-guided Diffusion Policy | Gengyu Zhang et.al. | 2404.02176 | null |
2024-04-02 | Risk-Aware Real-Time Task Allocation for Stochastic Multi-Agent Systems under STL Specifications | Maico H. W. Engelaar et.al. | 2404.02111 | null |
2024-04-02 | WcDT: World-centric Diffusion Transformer for Traffic Scene Generation | Chen Yang et.al. | 2404.02082 | link |
2024-04-02 | Heuristic Optimization of Amplifier Reconfiguration Process for Autonomous Driving Optical Networks | Qizhi Qiu et.al. | 2404.01949 | null |
2024-04-02 | Improving Bird’s Eye View Semantic Segmentation by Task Decomposition | Tianhao Zhao et.al. | 2404.01925 | link |
2024-04-02 | Analyzing the Single Event Upset Vulnerability of Binarized Neural Networks on SRAM FPGAs | Ioanna Souvatzoglou et.al. | 2404.01757 | null |
2024-04-02 | Exploring Latent Pathways: Enhancing the Interpretability of Autonomous Driving with a Variational Autoencoder | Anass Bairouk et.al. | 2404.01750 | null |
2024-05-12 | Boosting Visual Recognition in Real-world Degradations via Unsupervised Feature Enhancement Module with Deep Channel Prior | Zhanwen Liu et.al. | 2404.01703 | link |
2024-04-02 | Learning Temporal Cues by Predicting Objects Move for Multi-camera 3D Object Detection | Seokha Moon et.al. | 2404.01580 | null |
2024-04-02 | Perfecting Periodic Trajectory Tracking: Model Predictive Control with a Periodic Observer ( $Π$ -MPC) | Luis Pabon et.al. | 2404.01550 | link |
2024-04-02 | Are Doppler Velocity Measurements Useful for Spinning Radar Odometry? | Daniil Lisus et.al. | 2404.01537 | null |
2024-04-01 | ML KPI Prediction in 5G and B5G Networks | Nguyen Phuc Tran et.al. | 2404.01530 | null |
2024-04-01 | QuAD: Query-based Interpretable Neural Motion Planning for Autonomous Driving | Sourav Biswas et.al. | 2404.01486 | null |
2024-05-25 | BadPart: Unified Black-box Adversarial Patch Attacks against Pixel-wise Regression Tasks | Zhiyuan Cheng et.al. | 2404.00924 | link |
2024-04-01 | An Integrating Comprehensive Trajectory Prediction with Risk Potential Field Method for Autonomous Driving | Kailu Wu et.al. | 2404.00893 | null |
2024-03-31 | Adapting to Length Shift: FlexiLength Network for Trajectory Prediction | Yi Xu et.al. | 2404.00742 | null |
2024-04-20 | End-to-End Autonomous Driving through V2X Cooperation | Haibao Yu et.al. | 2404.00717 | link |
2024-03-31 | Weak-to-Strong 3D Object Detection with X-Ray Distillation | Alexander Gambashidze et.al. | 2404.00679 | link |
2024-03-31 | Denoising Low-dose Images Using Deep Learning of Time Series Images | Yang Shao et.al. | 2404.00510 | null |
2024-03-19 | Advancing Explainable Autonomous Vehicle Systems: A Comprehensive Review and Research Roadmap | Sule Tekkesinoglu et.al. | 2404.00019 | null |
2024-03-29 | InstantSplat: Unbounded Sparse-view Pose-free Gaussian Splatting in 40 Seconds | Zhiwen Fan et.al. | 2403.20309 | link |
2024-03-29 | LeGo-Drive: Language-enhanced Goal-oriented Closed-Loop End-to-End Autonomous Driving | Pranjal Paul et.al. | 2403.20116 | null |
2024-03-29 | SGD: Street View Synthesis with Gaussian Splatting and Diffusion Prior | Zhongrui Yu et.al. | 2403.20079 | null |
2024-03-29 | HO-Gaussian: Hybrid Optimization of 3D Gaussian Splatting for Urban Scenes | Zhuopeng Li et.al. | 2403.20032 | null |
2024-03-29 | PLoc: A New Evaluation Criterion Based on Physical Location for Autonomous Driving Datasets | Ruining Yang et.al. | 2403.19893 | null |
2024-05-09 | Multi-Frame, Lightweight & Efficient Vision-Language Models for Question Answering in Autonomous Driving | Akshay Gopalkrishnan et.al. | 2403.19838 | link |
2024-03-28 | Human-compatible driving partners through data-regularized self-play reinforcement learning | Daphne Cornelisse et.al. | 2403.19648 | link |
2024-04-25 | Learning Sampling Distribution and Safety Filter for Autonomous Driving with VQ-VAE and Differentiable Optimization | Simon Idoko et.al. | 2403.19461 | link |
2024-03-28 | SubjectDrive: Scaling Generative Data in Autonomous Driving via Subject Control | Binyuan Huang et.al. | 2403.19438 | null |
2024-03-28 | Learning a Formally Verified Control Barrier Function in Stochastic Environment | Manan Tayal et.al. | 2403.19332 | link |
2024-03-28 | CRKD: Enhanced Camera-Radar Object Detection with Cross-modality Knowledge Distillation | Lingjun Zhao et.al. | 2403.19104 | null |
2024-04-07 | GraphAD: Interaction Scene Graph for End-to-end Autonomous Driving | Yunpeng Zhang et.al. | 2403.19098 | link |
2024-03-27 | GENESIS-RL: GEnerating Natural Edge-cases with Systematic Integration of Safety considerations and Reinforcement Learning | Hsin-Jung Yang et.al. | 2403.19062 | null |
2024-03-27 | Ensuring Safe Autonomy: Navigating the Future of Autonomous Vehicles | Patrick Wolf et.al. | 2403.19006 | null |
2024-03-27 | LORD: Large Models based Opposite Reward Design for Autonomous Driving | Xin Ye et.al. | 2403.18965 | null |
2024-05-13 | Sampling-Based Motion Planning with Online Racing Line Generation for Autonomous Driving on Three-Dimensional Race Tracks | Levent Ögretmen et.al. | 2403.18643 | link |
2024-03-27 | Long and Short-Term Constraints Driven Safe Reinforcement Learning for Autonomous Driving | Xuemin Hu et.al. | 2403.18209 | null |
2024-03-27 | Road Obstacle Detection based on Unknown Objectness Scores | Chihiro Noguchi et.al. | 2403.18207 | null |
2024-03-26 | SLEDGE: Synthesizing Simulation Environments for Driving Agents with Generative Models | Kashyap Chitta et.al. | 2403.17933 | link |
2024-03-26 | 2D Gaussian Splatting for Geometrically Accurate Radiance Fields | Binbin Huang et.al. | 2403.17888 | link |
2024-03-26 | Low-Latency Neural Stereo Streaming | Qiqi Hou et.al. | 2403.17879 | null |
2024-03-26 | Scenario-Based Curriculum Generation for Multi-Agent Autonomous Driving | Axel Brunnbauer et.al. | 2403.17805 | link |
2024-05-08 | LiDAR-Based Crop Row Detection Algorithm for Over-Canopy Autonomous Navigation in Agriculture Fields | Ruiji Liu et.al. | 2403.17774 | link |
2024-03-28 | UADA3D: Unsupervised Adversarial Domain Adaptation for 3D Object Detection with Sparse LiDAR and Large Domain Gaps | Maciej K Wozniak et.al. | 2403.17633 | link |
2024-03-26 | AIDE: An Automatic Data Engine for Object Detection in Autonomous Driving | Mingfu Liang et.al. | 2403.17373 | null |
2024-03-27 | Physical 3D Adversarial Attacks against Monocular Depth Estimation in Autonomous Driving | Junhao Zheng et.al. | 2403.17301 | link |
2024-03-25 | SynFog: A Photo-realistic Synthetic Fog Dataset based on End-to-end Imaging Simulation for Advancing Real-World Defogging in Autonomous Driving | Yiming Xie et.al. | 2403.17094 | null |
2024-03-25 | TwinLiteNetPlus: A Stronger Model for Real-time Drivable Area and Lane Segmentation | Quang-Huy Che et.al. | 2403.16958 | link |
2024-03-25 | Exploring Communication Technologies, Standards, and Challenges in Electrified Vehicle Charging | Xiang Ma et.al. | 2403.16830 | null |
2024-05-07 | Synapse: Learning Preferential Concepts from Visual Demonstrations | Sadanand Modak et.al. | 2403.16689 | null |
2024-03-25 | RCBEVDet: Radar-camera Fusion in Bird’s Eye View for 3D Object Detection | Zhiwei Lin et.al. | 2403.16440 | link |
2024-03-25 | Producing and Leveraging Online Map Uncertainty in Trajectory Prediction | Xunjiang Gu et.al. | 2403.16439 | link |
2024-03-25 | ProIn: Learning to Predict Trajectory Based on Progressive Interactions for Autonomous Driving | Yinke Dong et.al. | 2403.16374 | null |
2024-03-25 | Impact of Video Compression Artifacts on Fisheye Camera Visual Perception Tasks | Madhumitha Sakthi et.al. | 2403.16338 | null |
2024-03-24 | Engineering Safety Requirements for Autonomous Driving with Large Language Models | Ali Nouri et.al. | 2403.16289 | null |
2024-03-24 | Interference Management for Integrated Sensing and Communication Systems: A Survey | Yangyang Niu et.al. | 2403.16189 | null |
2024-03-24 | Self-Supervised Multi-Frame Neural Scene Flow | Dongrui Liu et.al. | 2403.16116 | null |
2024-04-15 | Are NeRFs ready for autonomous driving? Towards closing the real-to-simulation gap | Carl Lindström et.al. | 2403.16092 | null |
2024-03-23 | iA $^$: Imperative Learning-based A$^$ Search for Pathfinding | Xiangyu Chen et.al. | 2403.15870 | null |
2024-03-23 | Spatio-Temporal Bi-directional Cross-frame Memory for Distractor Filtering Point Cloud Single Object Tracking | Shaoyu Sun et.al. | 2403.15831 | null |
2024-03-23 | DriveEnv-NeRF: Exploration of A NeRF-Based Autonomous Driving Environment for Real-World Performance Validation | Mu-Yi Shen et.al. | 2403.15791 | link |
2024-03-23 | PNAS-MOT: Multi-Modal Object Tracking with Pareto Neural Architecture Search | Chensheng Peng et.al. | 2403.15712 | link |
2024-03-22 | Autonomous Driving With Perception Uncertainties: Deep-Ensemble Based Adaptive Cruise Control | Xiao Li et.al. | 2403.15577 | null |
2024-03-20 | EC-IoU: Orienting Safety for Object Detectors via Ego-Centric Intersection-over-Union | Brian Hsuan-Cheng Liao et.al. | 2403.15474 | null |
2024-03-26 | Metasurface-Enabled Multifunctional Single-Frequency Sensors without External Power | Masaya Tashiro et.al. | 2403.15427 | null |
2024-03-22 | Augmented Reality based Simulated Data (ARSim) with multi-view consistency for AV perception networks | Aqeel Anwar et.al. | 2403.15370 | null |
2024-03-22 | CR3DT: Camera-RADAR Fusion for 3D Detection and Tracking | Nicolas Baumann et.al. | 2403.15313 | link |
2024-03-22 | IS-Fusion: Instance-Scene Collaborative Fusion for Multimodal 3D Object Detection | Junbo Yin et.al. | 2403.15241 | link |
2024-03-22 | Learning from Visual Demonstrations through Differentiable Nonlinear MPC for Personalized Autonomous Driving | Flavia Sofia Acerbo et.al. | 2403.15102 | null |
2024-03-22 | Tri-Perspective View Decomposition for Geometry-Aware Depth Completion | Zhiqiang Yan et.al. | 2403.15008 | null |
2024-03-22 | Unifying Lane-Level Traffic Prediction from a Graph Structural Perspective: Benchmark and Baseline | Shuhao Li et.al. | 2403.14941 | link |
2024-03-21 | MVSplat: Efficient 3D Gaussian Splatting from Sparse Multi-View Images | Yuedong Chen et.al. | 2403.14627 | link |
2024-03-21 | SurroundSDF: Implicit 3D Scene Understanding Based on Signed Distance Field | Lizhe Liu et.al. | 2403.14366 | null |
2024-05-10 | Credit vs. Discount-Based Congestion Pricing: A Comparison Study | Chih-Yuan Chiu et.al. | 2403.13923 | null |
2024-03-20 | Reinforcement Learning for Online Testing of Autonomous Driving Systems: a Replication and Extension Study | Luca Giamattei et.al. | 2403.13729 | null |
2024-03-21 | AMP: Autoregressive Motion Prediction Revisited with Next Token Prediction for Autonomous Driving | Xiaosong Jia et.al. | 2403.13331 | null |
2024-03-21 | Self-Supervised Class-Agnostic Motion Prediction with Spatial and Temporal Consistency Regularizations | Kewei Wang et.al. | 2403.13261 | link |
2024-03-20 | A Rule-Compliance Path Planner for Lane-Merge Scenarios Based on Responsibility-Sensitive Safety | Pengfei Lin et.al. | 2403.13251 | null |
2024-03-19 | TAPTR: Tracking Any Point with Transformers as Detection | Hongyang Li et.al. | 2403.13042 | null |
2024-03-19 | HUGS: Holistic Urban 3D Scene Understanding via Gaussian Splatting | Hongyu Zhou et.al. | 2403.12722 | null |
2024-03-19 | M2DA: Multi-Modal Fusion Transformer Incorporating Driver Attention for Autonomous Driving | Dongyang Xu et.al. | 2403.12552 | null |
2024-05-07 | Safety Implications of Explainable Artificial Intelligence in End-to-End Autonomous Driving | Shahin Atakishiyev et.al. | 2403.12176 | null |
2024-03-18 | HiKER-SGG: Hierarchical Knowledge Enhanced Robust Scene Graph Generation | Ce Zhang et.al. | 2403.12033 | link |
2024-03-18 | Informed Spectral Normalized Gaussian Processes for Trajectory Prediction | Christian Schlauch et.al. | 2403.11966 | null |
2024-04-10 | GraphBEV: Towards Robust BEV Feature Alignment for Multi-Modal 3D Object Detection | Ziying Song et.al. | 2403.11848 | null |
2024-03-18 | EMIE-MAP: Large-Scale Road Surface Reconstruction Based on Explicit Mesh and Implicit Encoding | Wenhua Wu et.al. | 2403.11789 | null |
2024-03-18 | TrajectoryNAS: A Neural Architecture Search for Trajectory Prediction | Ali Asghar Sharifi et.al. | 2403.11695 | null |
2024-03-18 | SSAP: A Shape-Sensitive Adversarial Patch for Comprehensive Disruption of Monocular Depth Estimation in Autonomous Navigation Applications | Amira Guesmi et.al. | 2403.11515 | null |
2024-03-18 | MCD: Diverse Large-Scale Multi-Campus Dataset for Robot Perception | Thien-Minh Nguyen et.al. | 2403.11496 | null |
2024-03-17 | Driving Style Alignment for LLM-powered Driver Agent | Ruoxuan Yang et.al. | 2403.11368 | link |
2024-03-17 | Multi-Sample Long Range Path Planning under Sensing Uncertainty for Off-Road Autonomous Driving | Matt Schmittle et.al. | 2403.11298 | null |
2024-03-17 | Large Language Models Powered Context-aware Motion Prediction | Xiaoji Zheng et.al. | 2403.11057 | link |
2024-03-16 | Task-Driven Manipulation with Reconfigurable Parallel Robots | Daniel Morton et.al. | 2403.10768 | null |
2024-03-15 | Gradient based Feature Attribution in Explainable AI: A Technical Review | Yongjie Wang et.al. | 2403.10415 | null |
2024-03-15 | Real-Time Image Segmentation via Hybrid Convolutional-Transformer Architecture Search | Hongyuan Yu et.al. | 2403.10413 | link |
2024-03-15 | SimPB: A Single Model for 2D and 3D Object Detection from Multiple Cameras | Yingqi Tang et.al. | 2403.10353 | link |
2024-03-15 | CoLeCLIP: Open-Domain Continual Learning via Joint Task Prompt and Vocabulary Learning | Yukun Li et.al. | 2403.10245 | link |
2024-03-31 | RCooper: A Real-world Large-scale Dataset for Roadside Cooperative Perception | Ruiyang Hao et.al. | 2403.10145 | link |
2024-03-15 | Enhancing Human-Centered Dynamic Scene Understanding via Multiple LLMs Collaborated Reasoning | Hang Zhang et.al. | 2403.10107 | null |
2024-03-15 | RangeLDM: Fast Realistic LiDAR Point Cloud Generation | Qianjiang Hu et.al. | 2403.10094 | link |
2024-03-15 | SparseFusion: Efficient Sparse Multi-Modal Fusion Framework for Long-Range 3D Perception | Yiheng Li et.al. | 2403.10036 | null |
2024-03-15 | Visual Foundation Models Boost Cross-Modal Unsupervised Domain Adaptation for 3D Semantic Segmentation | Jingyi Xu et.al. | 2403.10001 | link |
2024-03-14 | Reality Bites: Assessing the Realism of Driving Scenarios with Large Language Models | Jiahui Wu et.al. | 2403.09906 | link |
2024-03-14 | Generalized Predictive Model for Autonomous Driving | Jiazhi Yang et.al. | 2403.09630 | link |
2024-03-14 | Renovating Names in Open-Vocabulary Segmentation Benchmarks | Haiwen Huang et.al. | 2403.09593 | null |
2024-03-14 | Are you a robot? Detecting Autonomous Vehicles from Behavior Analysis | Fabio Maresca et.al. | 2403.09571 | null |
2024-03-14 | On STPA for Distributed Development of Safe Autonomous Driving: An Interview Study | Ali Nouri et.al. | 2403.09509 | null |
2024-03-14 | An Industrial Experience Report about Challenges from Continuous Monitoring, Improvement, and Deployment for Autonomous Driving Features | Ali Nouri et.al. | 2403.09474 | null |
2024-03-14 | EfficientMFD: Towards More Efficient Multimodal Synchronous Fusion Detection | Jiaqing Zhang et.al. | 2403.09323 | link |
2024-03-14 | Intention-aware Denoising Diffusion Model for Trajectory Prediction | Chen Liu et.al. | 2403.09190 | null |
2024-03-14 | PreSight: Enhancing Autonomous Vehicle Perception with City-Scale NeRF Priors | Tianyuan Yuan et.al. | 2403.09079 | link |
2024-03-13 | FogGuard: guarding YOLO against fog using perceptual loss | Soheil Gharatappeh et.al. | 2403.08939 | link |
2024-03-13 | CLIP-BEVFormer: Enhancing Multi-View Image-Based BEV Detector with Ground Truth Flow | Chenbin Pan et.al. | 2403.08919 | null |
2024-04-30 | People Attribute Purpose to Autonomous Vehicles When Explaining Their Behavior | Balint Gyevnar et.al. | 2403.08828 | link |
2024-03-13 | FastMAC: Stochastic Spectral Sampling of Correspondence Graph | Yifei Zhang et.al. | 2403.08770 | link |
2024-03-13 | MIM4D: Masked Modeling with Multi-View Video for Autonomous Driving Representation Learning | Jialv Zou et.al. | 2403.08760 | link |
2024-04-29 | Real-time 3D semantic occupancy prediction for autonomous vehicles using memory-efficient sparse convolution | Samuel Sze et.al. | 2403.08748 | null |
2024-03-13 | IAMCV Multi-Scenario Vehicle Interaction Dataset | Novel Certad et.al. | 2403.08455 | null |
2024-03-13 | DeepCSHAP: Utilizing Shapley Values to Explain Deep Complex-Valued Neural Networks | Florian Eilers et.al. | 2403.08428 | null |
2024-03-13 | Optimized Detection and Classification on GTRSB: Advancing Traffic Sign Recognition with Convolutional Neural Networks | Dhruv Toshniwal et.al. | 2403.08283 | null |
2024-03-13 | LIX: Implicitly Infusing Spatial Geometric Prior Knowledge into Visual Semantic Segmentation for Autonomous Driving | Sicen Guo et.al. | 2403.08215 | null |
2024-03-12 | Q-SLAM: Quadric Representations for Monocular SLAM | Chensheng Peng et.al. | 2403.08125 | null |
2024-03-12 | Unleashing HyDRa: Hybrid Fusion, Depth Consistency and Radar for Unified 3D Perception | Philipp Wolters et.al. | 2403.07746 | link |
2024-03-12 | A Survey of Vision Transformers in Autonomous Driving: Current Trends and Future Directions | Quoc-Vinh Lai-Dang et.al. | 2403.07542 | null |
2024-03-12 | Adaptive Fusion of Single-View and Multi-View Depth for Autonomous Driving | JunDa Cheng et.al. | 2403.07535 | link |
2024-03-12 | Adaptive Bounding Box Uncertainties via Two-Step Conformal Prediction | Alexander Timans et.al. | 2403.07263 | link |
2024-03-12 | Tractable Joint Prediction and Planning over Discrete Behavior Modes for Urban Driving | Adam Villaflor et.al. | 2403.07232 | null |
2024-03-11 | Mapping High-level Semantic Regions in Indoor Environments without Object Recognition | Roberto Bigazzi et.al. | 2403.07076 | null |
2024-03-11 | LISO: Lidar-only Self-Supervised 3D Object Detection | Stefan Baur et.al. | 2403.07071 | link |
2024-02-28 | Automatic driving lane change safety prediction model based on LSTM | Wenjian Sun et.al. | 2403.06993 | null |
2024-04-11 | DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video Generation | Guosheng Zhao et.al. | 2403.06845 | null |
2024-03-11 | PCLD: Point Cloud Layerwise Diffusion for Adversarial Purification | Mert Gulsen et.al. | 2403.06698 | link |
2024-03-11 | 3D Semantic Segmentation-Driven Representations for 3D Object Detection | Hayeon O et.al. | 2403.06501 | link |
2024-03-10 | On depth prediction for autonomous driving using self-supervised learning | Houssem Boulahbal et.al. | 2403.06194 | null |
2024-03-10 | Cross-Cluster Shifting for Efficient and Effective 3D Object Detection in Autonomous Driving | Zhili Chen et.al. | 2403.06166 | null |
2024-03-09 | Lightning NeRF: Efficient Hybrid Scene Representation for Autonomous Driving | Junyi Cao et.al. | 2403.05907 | link |
2024-03-09 | Fast Kernel Scene Flow | Xueqian Li et.al. | 2403.05896 | link |
2024-04-22 | SAFDNet: A Simple and Effective Network for Fully Sparse 3D Object Detection | Gang Zhang et.al. | 2403.05817 | link |
2024-03-08 | JointMotion: Joint Self-supervision for Joint Motion Prediction | Royden Wagner et.al. | 2403.05489 | link |
2024-03-08 | OccFusion: Depth Estimation Free Multi-sensor Fusion for 3D Occupancy Prediction | Ji Zhang et.al. | 2403.05329 | null |
2024-03-08 | LVIC: Multi-modality segmentation by Lifting Visual Info as Cue | Zichao Dong et.al. | 2403.05159 | null |
2024-03-08 | LanePtrNet: Revisiting Lane Detection as Point Voting and Grouping on Curves | Jiayan Cao et.al. | 2403.05155 | null |
2024-03-18 | DyRoNet: Dynamic Routing and Low-Rank Adapters for Autonomous Driving Streaming Perception | Xiang Huang et.al. | 2403.05050 | null |
2024-03-11 | LHMap-loc: Cross-Modal Monocular Localization Using LiDAR Point Cloud Heat Map | Xinrui Wu et.al. | 2403.05002 | link |
2024-03-11 | Fooling Neural Networks for Motion Forecasting via Adversarial Attacks | Edgar Medina et.al. | 2403.04954 | null |
2024-04-21 | A General Calibrated Regret Metric for Detecting and Mitigating Human-Robot Interaction Failures | Kensuke Nakamura et.al. | 2403.04745 | null |
2024-03-07 | Embodied Understanding of Driving Scenarios | Yunsong Zhou et.al. | 2403.04593 | link |
2024-03-07 | FriendNet: Detection-Friendly Dehazing Network | Yihua Fan et.al. | 2403.04443 | link |
2024-05-01 | LitSim: A Conflict-aware Policy for Long-term Interactive Traffic Simulation | Haojie Xin et.al. | 2403.04299 | null |
2024-03-07 | Generalizing Cooperative Eco-driving via Multi-residual Task Learning | Vindula Jayawardana et.al. | 2403.04232 | null |
2024-03-07 | Incremental Bayesian Learning for Fail-Operational Control in Autonomous Driving | Lei Zheng et.al. | 2403.04143 | null |
2024-03-07 | Towards learning-based planning:The nuPlan benchmark for real-world autonomous driving | Napat Karnchanachari et.al. | 2403.04133 | null |
2024-03-06 | Multi-Object Tracking with Camera-LiDAR Fusion for Autonomous Driving | Riccardo Pieroni et.al. | 2403.04112 | null |
2024-03-06 | To Spend or to Gain: Online Learning in Repeated Karma Auctions | Damien Berriaud et.al. | 2403.04057 | null |
2024-03-06 | 3D Object Visibility Prediction in Autonomous Driving | Chuanyu Luo et.al. | 2403.03681 | null |
2024-03-20 | Learning Adversarial MDPs with Stochastic Hard Constraints | Francesco Emanuele Stradi et.al. | 2403.03672 | null |
2024-03-06 | Seamless Virtual Reality with Integrated Synchronizer and Synthesizer for Autonomous Driving | He Li et.al. | 2403.03541 | null |
2024-03-06 | Multi-task Learning for Real-time Autonomous Driving Leveraging Task-adaptive Attention Generator | Wonhyeok Choi et.al. | 2403.03468 | null |
2024-03-05 | Behavior Generation with Latent Actions | Seungjae Lee et.al. | 2403.03181 | link |
2024-03-05 | User-Driven Adaptation: Tailoring Autonomous Driving Systems with Dynamic Preferences | Mingyue Zhang et.al. | 2403.02928 | null |
2024-03-05 | ActiveAD: Planning-Oriented Active Learning for End-to-End Autonomous Driving | Han Lu et.al. | 2403.02877 | null |
2024-03-05 | FastOcc: Accelerating 3D Occupancy Prediction by Fusing the 2D Bird’s-Eye View and Perspective View | Jiawei Hou et.al. | 2403.02710 | null |
2024-03-05 | Enhancing Generalization in Medical Visual Question Answering Tasks via Gradient-Guided Model Perturbation | Gang Liu et.al. | 2403.02707 | null |
2024-03-26 | HoloVIC: Large-scale Dataset and Benchmark for Multi-Sensor Holographic Intersection and Vehicle-Infrastructure Cooperative | Cong Ma et.al. | 2403.02640 | null |
2024-03-05 | World Models for Autonomous Driving: An Initial Survey | Yanchen Guan et.al. | 2403.02622 | null |
2024-03-04 | Uncertainty-Aware Prediction and Application in Planning for Autonomous Driving: Definitions, Methods, and Comparison | Wenbo Shao et.al. | 2403.02297 | null |
2024-03-04 | Depth-Guided Robust and Fast Point Cloud Fusion NeRF for Sparse Input Views | Shuai Guo et.al. | 2403.02063 | null |
2024-03-04 | Scalable Vision-Based 3D Object Detection and Monocular Depth Estimation for Autonomous Driving | Yuxuan Liu et.al. | 2403.02037 | link |
2024-03-04 | Leveraging Anchor-based LiDAR 3D Object Detection via Point Assisted Sample Selection | Shitao Chen et.al. | 2403.01978 | link |
2024-03-04 | Progressive Smoothing for Motion Planning in Real-Time NMPC | Rudolf Reiter et.al. | 2403.01830 | null |
2024-03-04 | PointCore: Efficient Unsupervised Point Cloud Anomaly Detector Using Local-Global Features | Baozhu Zhao et.al. | 2403.01804 | link |
2024-04-22 | OccFusion: A Straightforward and Effective Multi-Sensor Fusion Framework for 3D Occupancy Prediction | Zhenxing Ming et.al. | 2403.01644 | link |
2024-03-03 | A Unified Model Selection Technique for Spectral Clustering Based Motion Segmentation | Yuxiang Huang et.al. | 2403.01606 | null |
2024-03-02 | Decentralized Implicit Differentiation | Lucas Fuentes Valenzuela et.al. | 2403.01260 | null |
2024-04-15 | On the Road to Portability: Compressing End-to-End Motion Planner for Autonomous Driving | Kaituo Feng et.al. | 2403.01238 | link |
2024-03-20 | Results and Lessons Learned from Autonomous Driving Transportation Services in Airfield, Crowded Indoor, and Urban Environments | Doosan Baek et.al. | 2403.01233 | null |
2024-03-01 | Safe Hybrid-Action Reinforcement Learning-Based Decision and Control for Discretionary Lane Change | Ruichen Xu et.al. | 2403.00446 | null |
2024-03-01 | MS-Net: A Multi-Path Sparse Model for Motion Prediction in Multi-Scenes | Xiaqiang Tang et.al. | 2403.00353 | null |
2024-02-29 | Genie: Smart ROS-based Caching for Connected Autonomous Robots | Zexin Li et.al. | 2402.19410 | null |
2024-02-29 | Towards Safe and Reliable Autonomous Driving: Dynamic Occupancy Set Prediction | Wenbo Shao et.al. | 2402.19385 | null |
2024-03-03 | RoadRunner - Learning Traversability Estimation for Autonomous Off-road Driving | Jonas Frey et.al. | 2402.19341 | null |
2024-02-29 | T3DNet: Compressing Point Cloud Models for Lightweight 3D Recognition | Zhiyuan Yang et.al. | 2402.19264 | null |
2024-02-29 | A Cognitive-Based Trajectory Prediction Approach for Autonomous Driving | Haicheng Liao et.al. | 2402.19251 | link |
2024-02-21 | Context-based Interpretable Spatio-Temporal Graph Convolutional Network for Human Motion Forecasting | Edgar Medina et.al. | 2402.19237 | link |
2024-02-29 | CollaFuse: Navigating Limited Resources and Privacy in Collaborative Generative AI | Domenique Zipperling et.al. | 2402.19105 | link |
2024-02-29 | GoalNet: Goal Areas Oriented Pedestrian Trajectory Prediction | Ching-Lin Lee et.al. | 2402.19002 | null |
2024-02-29 | Deep Learning for 3D Human Pose Estimation and Mesh Recovery: A Survey | Yang Liu et.al. | 2402.18844 | link |
2024-04-25 | Unifying F1TENTH Autonomous Racing: Survey, Methods and Benchmarks | Benjamin David Evans et.al. | 2402.18558 | link |
2024-02-28 | Evaluating Decision Optimality of Autonomous Driving via Metamorphic Testing | Mingfei Cheng et.al. | 2402.18393 | null |
2024-02-28 | Enhancing Roadway Safety: LiDAR-based Tree Clearance Analysis | Miriam Louise Carnot et.al. | 2402.18309 | null |
2024-02-28 | EchoTrack: Auditory Referring Multi-Object Tracking for Autonomous Driving | Jiacheng Lin et.al. | 2402.18302 | link |
2024-02-28 | PiShield: A NeSy Framework for Learning with Requirements | Mihaela Cătălina Stoian et.al. | 2402.18285 | link |
2024-03-08 | EAN-MapNet: Efficient Vectorized HD Map Construction with Anchor Neighborhoods | Huiyuan Xiong et.al. | 2402.18278 | null |
2024-04-07 | NiteDR: Nighttime Image De-Raining with Cross-View Sensor Cooperative Learning for Dynamic Driving Scenes | Cidan Shi et.al. | 2402.18172 | link |
2024-03-01 | 3DSFLabelling: Boosting 3D Scene Flow Estimation by Pseudo Auto-labelling | Chaokang Jiang et.al. | 2402.18146 | link |
2024-02-28 | OccTransformer: Improving BEVFormer for 3D camera-only occupancy prediction | Jian Liu et.al. | 2402.18140 | null |
2024-03-30 | Passive Snapshot Coded Aperture Dual-Pixel RGB-D Imaging | Bhargav Ghanekar et.al. | 2402.18102 | null |
2024-03-06 | ICAT: An Indoor Connected and Autonomous Testbed for Vehicle Computing | Zhaofeng Tian et.al. | 2402.17933 | null |
2024-03-17 | SWTrack: Multiple Hypothesis Sliding Window 3D Multi-Object Tracking | Sandro Papais et.al. | 2402.17892 | null |
2024-02-27 | QoS prediction in radio vehicular environments via prior user information | Noor Ul Ain et.al. | 2402.17689 | null |
2024-02-27 | Mitigating Distributional Shift in Semantic Segmentation via Uncertainty Estimation from Unlabelled Data | David S. W. Williams et.al. | 2402.17653 | null |
2024-02-27 | An Empirical Study of the Generalization Ability of Lidar 3D Object Detectors to Unseen Domains | George Eskandar et.al. | 2402.17562 | null |
2024-02-27 | Leveraging Enhanced Queries of Point Sets for Vectorized Map Construction | Zihao Liu et.al. | 2402.17430 | link |
2024-03-21 | ICP-Flow: LiDAR Scene Flow Estimation with ICP | Yancong Lin et.al. | 2402.17351 | link |
2024-02-26 | Parallelized Spatiotemporal Binding | Gautam Singh et.al. | 2402.17077 | null |
2024-02-26 | Reinforcement Learning Based Oscillation Dampening: Scaling up Single-Agent RL algorithms to a 100 AV highway field operational test | Kathy Jang et.al. | 2402.17050 | null |
2024-03-20 | Lightweight, error-tolerant edge detection using memristor-enabled stochastic logics | Lekai Song et.al. | 2402.16908 | null |
2024-02-26 | Think2Drive: Efficient Reinforcement Learning by Thinking in Latent World Model for Quasi-Realistic Autonomous Driving (in CARLA-v2) | Qifeng Li et.al. | 2402.16720 | null |
2024-02-26 | Learning Based NMPC Adaptation for Autonomous Driving using Parallelized Digital Twin | Jean Pierre Allamaa et.al. | 2402.16645 | null |
2024-02-26 | Trajectory Prediction for Autonomous Driving Using a Transformer Network | Zhenning Li et.al. | 2402.16501 | null |
2024-02-26 | Edge Detectors Can Make Deep Convolutional Neural Networks More Robust | Jin Ding et.al. | 2402.16479 | null |
2024-02-26 | Contingency Planning Using Bi-level Markov Decision Processes for Space Missions | Somrita Banerjee et.al. | 2402.16342 | link |
2024-02-26 | SeqTrack3D: Exploring Sequence Information for Robust 3D Point Cloud Tracking | Yu Lin et.al. | 2402.16249 | link |
2024-02-25 | Catch Me If You Can: Combatting Fraud in Artificial Currency Based Government Benefits Programs | Devansh Jalota et.al. | 2402.16162 | null |
2024-02-25 | Machine Learning-Based Vehicle Intention Trajectory Recognition and Prediction for Autonomous Driving | Hanyi Yu et.al. | 2402.16036 | null |
2024-02-24 | Construction and application of artificial intelligence crowdsourcing map based on multi-track GPS data | Yong Wang et.al. | 2402.15796 | null |
2024-03-30 | Discretionary Lane-Change Decision and Control via Parameterized Soft Actor-Critic for Hybrid Action Space | Yuan Lin et.al. | 2402.15790 | null |
2024-04-06 | Detection Is Tracking: Point Cloud Multi-Sweep Deep Learning Models Revisited | Lingji Chen et.al. | 2402.15756 | null |
2024-04-16 | Multi-Constraint Safe RL with Objective Suppression for Safety-Critical Applications | Zihan Zhou et.al. | 2402.15650 | null |
2024-02-23 | Cohere3D: Exploiting Temporal Coherence for Unsupervised Representation Learning of Vision-based Autonomous Driving | Yichen Xie et.al. | 2402.15583 | null |
2024-02-21 | PCA-Bench: Evaluating Multimodal Large Language Models in Perception-Cognition-Action Chain | Liang Chen et.al. | 2402.15527 | link |
2024-02-23 | RoboEXP: Action-Conditioned Scene Graph via Interactive Exploration for Robotic Manipulation | Hanxiao Jiang et.al. | 2402.15487 | link |
2024-02-23 | EMIFF: Enhanced Multi-scale Image Feature Fusion for Vehicle-Infrastructure Cooperative 3D Object Detection | Zhe Wang et.al. | 2402.15272 | link |
2024-02-22 | Path Planning based on 2D Object Bounding-box | Yanliang Huang et.al. | 2402.14933 | null |
2024-02-22 | Distributed Radiance Fields for Edge Video Compression and Metaverse Integration in Autonomous Driving | Eugen Šlapak et.al. | 2402.14642 | null |
2024-02-08 | Enhancement of High-definition Map Update Service Through Coverage-aware and Reinforcement Learning | Jeffrey Redondo et.al. | 2402.14582 | null |
2024-02-22 | RadarMOSEVE: A Spatial-Temporal Transformer Network for Radar-Only Moving Object Segmentation and Ego-Velocity Estimation | Changsong Pang et.al. | 2402.14380 | link |
2024-02-23 | Blending Data-Driven Priors in Dynamic Games | Justin Lidard et.al. | 2402.14174 | null |
2024-03-18 | Hybrid Reasoning Based on Large Language Models for Autonomous Car Driving | Mehdi Azarafza et.al. | 2402.13602 | link |
2024-02-21 | EffLoc: Lightweight Vision Transformer for Efficient 6-DOF Camera Relocalization | Zhendong Xiao et.al. | 2402.13537 | null |
2024-02-21 | Learning to Model Diverse Driving Behaviors in Highly Interactive Autonomous Driving Scenarios with Multi-Agent Reinforcement Learning | Liu Weiwei et.al. | 2402.13481 | null |
2024-02-20 | VADv2: End-to-End Vectorized Autonomous Driving via Probabilistic Planning | Shaoyu Chen et.al. | 2402.13243 | link |
2024-02-20 | 3D high-resolution imaging algorithm using 1D MIMO array for autonomous driving application | Sen Yuan et.al. | 2402.13062 | null |
2024-02-20 | Advancements in Point Cloud-Based 3D Defect Detection and Classification for Industrial Systems: A Comprehensive Survey | Anju Rani et.al. | 2402.12923 | null |
2024-02-20 | Smart Mobility Digital Twin Based Automated Vehicle Navigation System: A Proof of Concept | Kui Wang et.al. | 2402.12682 | null |
2024-02-20 | Pre-trained Transformer-Enabled Strategies with Human-Guided Fine-Tuning for End-to-end Navigation of Autonomous Vehicles | Dong Hu et.al. | 2402.12666 | null |
2024-02-19 | Binary Opacity Grids: Capturing Fine Geometric Detail for Mesh-Based View Synthesis | Christian Reiser et.al. | 2402.12377 | null |
2024-02-19 | UncertaintyTrack: Exploiting Detection and Localization Uncertainty in Multi-Object Tracking | Chang Won Lee et.al. | 2402.12303 | link |
2024-03-31 | DriveVLM: The Convergence of Autonomous Driving and Large Vision-Language Models | Xiaoyu Tian et.al. | 2402.12289 | null |
2024-02-19 | Modified RRT* for Path Planning in Autonomous Driving | Sugirtha T et.al. | 2402.12129 | null |
2024-02-19 | Towards Explainable LiDAR Point Cloud Semantic Segmentation via Gradient Based Target Localization | Abhishek Kuriyal et.al. | 2402.12098 | link |
2024-02-21 | Surround-View Fisheye Optics in Computer Vision and Simulation: Survey and Challenges | Daniel Jakab et.al. | 2402.12041 | null |
2024-04-02 | SDGE: Stereo Guided Depth Estimation for 360 $^\circ$ Camera Sets | Jialei Xu et.al. | 2402.11791 | null |
2024-04-07 | GenAD: Generative End-to-End Autonomous Driving | Wenzhao Zheng et.al. | 2402.11502 | link |
2024-02-17 | Exploiting T-norms for Deep Learning in Autonomous Driving | Mihaela Cătălina Stoian et.al. | 2402.11362 | null |
2024-02-17 | CARLA-Autoware-Bridge: Facilitating Autonomous Driving Research with a Unified Framework for Simulation and Module Development | Gemb Kaljavesi et.al. | 2402.11239 | link |
2024-02-21 | When Simple is Near-Optimal in Security Games | Devansh Jalota et.al. | 2402.11209 | null |
2024-02-16 | RAG-Driver: Generalisable Driving Explanations with Retrieval-Augmented In-Context Learning in Multi-Modal Large Language Model | Jianhao Yuan et.al. | 2402.10828 | null |
2024-02-16 | Efficient Multi-task Uncertainties for Joint Semantic Segmentation and Monocular Depth Estimation | Steven Landgraf et.al. | 2402.10580 | null |
2024-02-26 | Barrier-Enhanced Homotopic Parallel Trajectory Optimization for Safety-Critical Autonomous Driving | Lei Zheng et.al. | 2402.10441 | null |
2024-02-15 | Benchmarking the Operation of Quantum Heuristics and Ising Machines: Scoring Parameter Setting Strategies on Optimization Applications | David E. Bernal Neira et.al. | 2402.10255 | null |
2024-02-03 | Simulation-based Analysis of a Novel Loop-based Road Topology for Autonomous Vehicles | Stefan Ramdhan et.al. | 2402.10226 | null |
2024-02-08 | Explainable AI for Safe and Trustworthy Autonomous Driving: A Systematic Review | Anton Kuznietsov et.al. | 2402.10086 | null |
2024-01-29 | Review of the Learning-based Camera and Lidar Simulation Methods for Autonomous Driving Systems | Hamed Haghighi et.al. | 2402.10079 | null |
2024-02-15 | Exploiting Alpha Transparency In Language And Vision-Based AI Systems | David Noever et.al. | 2402.09671 | null |
2024-02-14 | How Secure Are Large Language Models (LLMs) for Navigation in Urban Environments? | Congcong Wen et.al. | 2402.09546 | null |
2024-02-14 | PC-NeRF: Parent-Child Neural Radiance Fields Using Sparse LiDAR Frames in Autonomous Driving Environments | Xiuzhong Hu et.al. | 2402.09325 | link |
2024-02-14 | Learning Interpretable Policies in Hindsight-Observable POMDPs through Partially Supervised Reinforcement Learning | Michael Lanier et.al. | 2402.09290 | null |
2024-02-14 | Design and Realization of a Benchmarking Testbed for Evaluating Autonomous Platooning Algorithms | Michael Shaham et.al. | 2402.09233 | null |
2024-02-13 | Vehicle Behavior Prediction by Episodic-Memory Implanted NDT | Peining Shen et.al. | 2402.08423 | link |
2024-02-13 | MetaTra: Meta-Learning for Generalized Trajectory Prediction in Unseen Domain | Xiaohe Li et.al. | 2402.08221 | null |
2024-02-29 | Inherent Diverse Redundant Safety Mechanisms for AI-based Software Elements in Automotive Applications | Mandar Pitale et.al. | 2402.08208 | null |
2024-02-13 | Translating Images to Road Network:A Non-Autoregressive Sequence-to-Sequence Approach | Jiachen Lu et.al. | 2402.08207 | link |
2024-02-12 | Interaction-Based Driving Scenario Classification and Labeling | Cheng Chang et.al. | 2402.07720 | null |
2024-02-12 | AYDIV: Adaptable Yielding 3D Object Detection via Integrated Contextual Vision Transformer | Tanmoy Dam et.al. | 2402.07680 | link |
2024-02-12 | DART: A Compact Platform For Autonomous Driving Research | Lorenzo Lyons et.al. | 2402.07602 | null |
2024-02-11 | Towards Explainable, Safe Autonomous Driving with Language Embeddings for Novelty Identification and Active Learning: Framework and Experimental Analysis with Real-World Data Sets | Ross Greer et.al. | 2402.07320 | null |
2024-02-09 | Neural Rendering based Urban Scene Reconstruction for Autonomous Driving | Shihao Shen et.al. | 2402.06826 | null |
2024-02-09 | Diffusion-ES: Gradient-free Planning with Diffusion for Autonomous Driving and Zero-Shot Instruction Following | Brian Yang et.al. | 2402.06559 | link |
2024-02-09 | CurveFormer++: 3D Lane Detection by Curve Propagation with Temporal Curve Queries and Attention | Yifeng Bai et.al. | 2402.06423 | null |
2024-02-08 | Driving Everywhere with Large Language Model Policy Adaptation | Boyi Li et.al. | 2402.05932 | null |
2024-03-11 | Editable Scene Simulation for Autonomous Driving via Collaborative LLM-Agents | Yuxi Wei et.al. | 2402.05746 | link |
2024-02-08 | Optimizing Delegation in Collaborative Human-AI Hybrid Teams | Andrew Fuchs et.al. | 2402.05605 | null |
2024-02-07 | Compressing Deep Reinforcement Learning Networks with a Dynamic Structured Pruning Method for Autonomous Driving | Wensheng Su et.al. | 2402.05146 | null |
2024-02-07 | Tuning the feedback controller gains is a simple way to improve autonomous driving performance | Wenyu Liang et.al. | 2402.05064 | null |
2024-02-07 | Investigating Driving Interactions: A Robust Multi-Agent Simulation Framework for Autonomous Vehicles | Marc Kaufeld et.al. | 2402.04720 | link |
2024-02-15 | LiDAR-Forest Dataset: LiDAR Point Cloud Simulation Dataset for Forestry Application | Yawen Lu et.al. | 2402.04546 | null |
2024-02-07 | BioDrone: A Bionic Drone-based Single Object Tracking Benchmark for Robust Vision | Xin Zhao et.al. | 2402.04519 | null |
2024-03-08 | Human Observation-Inspired Trajectory Prediction for Autonomous Driving in Mixed-Autonomy Traffic Environments | Haicheng Liao et.al. | 2402.04318 | link |
2024-02-06 | Informed Reinforcement Learning for Situation-Aware Traffic Rule Exceptions | Daniel Bogdoll et.al. | 2402.04168 | link |
2024-02-06 | Controllable Diverse Sampling for Diffusion Based Motion Behavior Forecasting | Yiming Xu et.al. | 2402.03981 | null |
2024-02-06 | OASim: an Open and Adaptive Simulator based on Neural Rendering for Autonomous Driving | Guohang Yan et.al. | 2402.03830 | link |
2024-02-05 | Efficient and Interpretable Traffic Destination Prediction using Explainable Boosting Machines | Yasin Yousif et.al. | 2402.03457 | link |
2024-02-05 | Learning from Teaching Regularization: Generalizable Correlations Should be Easy to Imitate | Can Jin et.al. | 2402.02769 | link |
2024-02-05 | Improving Robustness of LiDAR-Camera Fusion Model against Weather Corruption from Fusion Strategy Perspective | Yihao Huang et.al. | 2402.02738 | null |
2024-02-04 | A Review of Full-Sized Autonomous Racing Vehicle Sensor Architecture | Manuel Mar et.al. | 2402.02603 | null |
2024-02-04 | Synthesizing Follow-Up Drive Data for Enhanced Road Safety in Intelligent Driving Function Systems | Nico Schick et.al. | 2402.02598 | null |
2024-02-04 | SIMPL: A Simple and Efficient Multi-agent Motion Prediction Baseline for Autonomous Driving | Lu Zhang et.al. | 2402.02519 | link |
2024-02-04 | Hybrid-Prediction Integrated Planning for Autonomous Driving | Haochen Liu et.al. | 2402.02426 | null |
2024-02-03 | Evaluating the Robustness of Off-Road Autonomous Driving Segmentation against Adversarial Attacks: A Dataset-Centric analysis | Pankaj Deoli et.al. | 2402.02154 | link |
2024-02-03 | S-NeRF++: Autonomous Driving Simulation via Neural Reconstruction and Generation | Yurui Chen et.al. | 2402.02112 | null |
2024-02-03 | Physical Perception Network and an All-weather Multi-modality Benchmark for Adverse Weather Image Fusion | Xilai Li et.al. | 2402.02090 | link |
2024-02-03 | RIDERS: Radar-Infrared Depth Estimation for Robust Sensing | Han Li et.al. | 2402.02067 | link |
2024-02-03 | Multimodal-Enhanced Objectness Learner for Corner Case Detection in Autonomous Driving | Lixing Xiao et.al. | 2402.02026 | link |
2024-02-03 | A Survey on Context-Aware Multi-Agent Systems: Techniques, Challenges and Future Directions | Hung Du et.al. | 2402.01968 | null |
2024-02-02 | Efficient and Interaction-Aware Trajectory Planning for Autonomous Vehicles with Particle Swarm Optimization | Lin Song et.al. | 2402.01575 | null |
2024-02-02 | Overcoming Blind Spots: Occlusion Considerations for Improved Autonomous Driving Safety | Korbinian Moller et.al. | 2402.01507 | link |
2024-02-02 | A Reinforcement Learning-Boosted Motion Planning Framework: Comprehensive Generalization Performance in Autonomous Driving | Rainer Trauth et.al. | 2402.01465 | link |
2024-02-02 | Frenetix Motion Planner: High-Performance and Modular Trajectory Planning Algorithm for Complex Autonomous Driving Scenarios | Korbinian Moller et.al. | 2402.01443 | link |
2024-02-08 | A survey on robustness in trajectory prediction for autonomous vehicles | Jeroen Hagenus et.al. | 2402.01397 | null |
2024-02-02 | LimSim++: A Closed-Loop Platform for Deploying Multimodal LLMs in Autonomous Driving | Daocheng Fu et.al. | 2402.01246 | null |
2024-02-02 | A Survey for Foundation Models in Autonomous Driving | Haoxiang Gao et.al. | 2402.01105 | null |
2024-02-02 | Combining Belief Function Theory and Stochastic Model Predictive Control for Multi-Modal Uncertainty in Autonomous Driving | Tommaso Benciolini et.al. | 2402.00697 | null |
2024-02-01 | Fisheye Camera and Ultrasonic Sensor Fusion For Near-Field Obstacle Perception in Bird’s-Eye-View | Arindam Das et.al. | 2402.00637 | null |
2024-02-01 | Uncertainty-Aware Partial-Label Learning | Tobias Fuchs et.al. | 2402.00592 | link |
2024-02-01 | Reconfigurable Intelligent Computational Surfaces for MEC-Assisted Autonomous Driving Networks | Bo Yang et.al. | 2402.00398 | null |
2024-02-01 | Multi-agent Path Finding for Cooperative Autonomous Driving | Zhongxia Yan et.al. | 2402.00334 | link |
2024-03-04 | SmartCooper: Vehicular Collaborative Perception with Adaptive Fusion and Judger Mechanism | Yuang Zhang et.al. | 2402.00321 | null |
2024-02-29 | Real-time Traffic Object Detection for Autonomous Driving | Abdul Hannan Khan et.al. | 2402.00128 | null |
2024-01-31 | CARFF: Conditional Auto-encoded Radiance Field for 3D Scene Forecasting | Jiezhi Yang et.al. | 2401.18075 | null |
2024-02-19 | LaneGraph2Seq: Lane Topology Extraction with Language Model via Vertex-Edge Encoding and Connectivity Enhancement | Renyuan Peng et.al. | 2401.17609 | link |
2024-01-30 | ATPPNet: Attention based Temporal Point cloud Prediction Network | Kaustab Pal et.al. | 2401.17399 | null |
2024-01-30 | MF-MOS: A Motion-Focused Model for Moving Object Segmentation | Jintao Cheng et.al. | 2401.17023 | link |
2024-01-30 | Evaluation of Out-of-Distribution Detection Performance on Autonomous Driving Datasets | Jens Henriksson et.al. | 2401.17013 | null |
2024-01-30 | The Why, When, and How to Use Active Learning in Large-Data-Driven 3D Object Detection for Safe Autonomous Driving: An Empirical Exploration | Ross Greer et.al. | 2401.16634 | null |
2024-01-30 | I came, I saw, I certified: some perspectives on the safety assurance of cyber-physical systems | Mithila Sivakumar et.al. | 2401.16633 | null |
2024-01-29 | FIMP: Future Interaction Modeling for Multi-Agent Motion Prediction | Sungmin Woo et.al. | 2401.16189 | null |
2024-01-29 | DeFlow: Decoder of Scene Flow Network in Autonomous Driving | Qingwen Zhang et.al. | 2401.16122 | link |
2024-01-29 | A Concise but Effective Network for Image Guided Depth Completion in Autonomous Driving | Moyun Liu et.al. | 2401.15902 | link |
2024-01-30 | GarchingSim: An Autonomous Driving Simulator with Photorealistic Scenes and Minimalist Workflow | Liguo Zhou et.al. | 2401.15803 | link |
2024-01-27 | You Only Look Bottom-Up for Monocular 3D Object Detection | Kaixin Xiong et.al. | 2401.15319 | null |
2024-01-27 | Learning Online Belief Prediction for Efficient POMDP Planning in Autonomous Driving | Zhiyu Huang et.al. | 2401.15315 | null |
2024-01-26 | DAM: Diffusion Activation Maximization for 3D Global Explanations | Hanxiao Tan et.al. | 2401.14938 | link |
2024-01-25 | Unlocking Past Information: Temporal Embeddings in Cooperative Bird’s Eye View Prediction | Dominik Rößle et.al. | 2401.14325 | null |
2024-01-25 | Optimization-based motion primitive automata for autonomous driving | Matheus V. A. Pedrosa et.al. | 2401.14276 | null |
2024-01-25 | Grounded SAM: Assembling Open-World Models for Diverse Visual Tasks | Tianhe Ren et.al. | 2401.14159 | link |
2024-01-24 | S2TPVFormer: Spatio-Temporal Tri-Perspective View for temporally coherent 3D Semantic Occupancy Prediction | Sathira Silva et.al. | 2401.13785 | null |
2024-02-29 | ADMap: Anti-disturbance framework for reconstructing online vectorized HD map | Haotian Hu et.al. | 2401.13172 | link |
2024-01-23 | IRIS: Inverse Rendering of Indoor Scenes from Low Dynamic Range Images | Zhi-Hao Lin et.al. | 2401.12977 | null |
2024-01-26 | Data-Centric Evolution in Autonomous Driving: A Comprehensive Survey of Big Data System, Data Mining, and Closed-Loop Technologies | Lincan Li et.al. | 2401.12888 | link |
2024-01-23 | Self-supervised Learning of LiDAR 3D Point Clouds via 2D-3D Neural Calibration | Yifan Zhang et.al. | 2401.12452 | link |
2024-01-22 | Accelerating Continuous Variable Coherent Ising Machines via Momentum | Robin Brown et.al. | 2401.12135 | link |
2024-01-22 | Enhancing Safety in Nonlinear Systems: Design and Stability Analysis of Adaptive Cruise Control | Fan Yang et.al. | 2401.11961 | null |
2024-03-10 | Large receptive field strategy and important feature extraction strategy in 3D object detection | Leichao Cui et.al. | 2401.11913 | null |
2024-01-22 | First-principles Based 3D Virtual Simulation Testing for Discovering SOTIF Corner Cases of Autonomous Driving | Lehang Li et.al. | 2401.11876 | null |
2024-03-14 | Safe and Generalized end-to-end Autonomous Driving System with Reinforcement Learning and Demonstrations | Zuojin Tang et.al. | 2401.11792 | null |
2024-01-21 | Self-Supervised Bird’s Eye View Motion Prediction with Cross-Modality Signals | Shaoheng Fang et.al. | 2401.11499 | link |
2024-01-29 | S $^3$ M-Net: Joint Learning of Semantic Segmentation and Stereo Matching for Autonomous Driving | Zhiyuan Wu et.al. | 2401.11414 | null |
2024-01-21 | Modeling Considerations for Developing Deep Space Autonomous Spacecraft and Simulators | Christopher Agia et.al. | 2401.11371 | null |
2024-03-01 | Enhancing System-Level Safety in Mixed-Autonomy Platoon via Safe Reinforcement Learning | Jingyuan Zhou et.al. | 2401.11148 | link |
2024-01-19 | Exploring Highly Quantised Neural Networks for Intrusion Detection in Automotive CAN | Shashwat Khandelwal et.al. | 2401.11030 | null |
2024-01-19 | A Lightweight Multi-Attack CAN Intrusion Detection System on Hybrid FPGAs | Shashwat Khandelwal et.al. | 2401.10689 | null |
2024-01-19 | Deep Learning-based Embedded Intrusion Detection System for Automotive CAN | Shashwat Khandelwal et.al. | 2401.10674 | null |
2024-01-19 | BadODD: Bangladeshi Autonomous Driving Object Detection Dataset | Mirza Nihal Baig et.al. | 2401.10659 | null |
2024-01-19 | Episodic Reinforcement Learning with Expanded State-reward Space | Dayang Liang et.al. | 2401.10516 | null |
2024-01-19 | Towards Automated Driving Violation Cause Analysis in Scenario-Based Testing for Autonomous Driving Systems | Ziwen Wan et.al. | 2401.10443 | null |
2024-01-18 | Reconstructing the Invisible: Video Frame Restoration through Siamese Masked Conditional Variational Autoencoder | Yongchen Zhou et.al. | 2401.10402 | null |
2024-01-18 | Analyzing and Mitigating Bias for Vulnerable Classes: Towards Balanced Representation in Dataset | Dewant Katare et.al. | 2401.10397 | null |
2024-01-18 | LangProp: A code optimization framework using Language Models applied to driving | Shu Ishida et.al. | 2401.10314 | link |
2024-01-18 | Hacking Predictors Means Hacking Cars: Using Sensitivity Analysis to Identify Trajectory Prediction Vulnerabilities for Autonomous Driving Security | Marsalis Gibson et.al. | 2401.10313 | null |
2024-01-18 | Model-Assisted Learning for Adaptive Cooperative Perception of Connected Autonomous Vehicles | Kaige Qu et.al. | 2401.10156 | null |
2024-01-16 | Importance-Aware Image Segmentation-based Semantic Communication for Autonomous Driving | Jie Lv et.al. | 2401.10153 | null |
2024-01-17 | Objects With Lighting: A Real-World Dataset for Evaluating Reconstruction and Rendering for Object Relighting | Benjamin Ummenhofer et.al. | 2401.09126 | link |
2024-01-18 | Stream Query Denoising for Vectorized HD Map Construction | Shuo Wang et.al. | 2401.09112 | null |
2024-01-17 | Enhancing Campus Mobility: Achievements and Challenges of Autonomous Shuttle “Snow Lion’‘ | Yingbing Chen et.al. | 2401.08939 | null |
2023-12-27 | Risk-anticipatory autonomous driving strategies considering vehicles’ weights, based on hierarchical deep reinforcement learning | Di Chen et.al. | 2401.08661 | null |
2023-12-26 | End-To-End Planning of Autonomous Driving in Industry and Academia: 2022-2023 | Gongjin Lan et.al. | 2401.08658 | null |
2023-12-25 | Digital Twins for Autonomous Driving: A Comprehensive Implementation and Demonstration | Kui Wang et.al. | 2401.08653 | null |
2024-01-16 | Forging Vision Foundation Models for Autonomous Driving: Challenges, Methodologies, and Opportunities | Xu Yan et.al. | 2401.08045 | link |
2024-01-15 | Pedestrian Detection in Low-Light Conditions: A Comprehensive Survey | Bahareh Ghari et.al. | 2401.07801 | null |
2024-01-15 | Semantic Scene Segmentation for Robotics | Juana Valeria Hurtado et.al. | 2401.07589 | null |
2024-01-15 | Geo-locating Road Objects using Inverse Haversine Formula with NVIDIA Driveworks | Mamoona Birkhez Shami et.al. | 2401.07582 | null |
2024-02-09 | RSUD20K: A Dataset for Road Scene Understanding In Autonomous Driving | Hasib Zunair et.al. | 2401.07322 | link |
2024-01-14 | Photonic real time video image signal processor at 17Tb/s based on a Kerr microcomb | Mengxi Tan et.al. | 2401.07197 | null |
2024-01-13 | ACAV: A Framework for Automatic Causality Analysis in Autonomous Vehicle Accident Recordings | Huijia Sun et.al. | 2401.07063 | null |
2024-01-13 | UniVision: A Unified Framework for Vision-Centric 3D Perception | Yu Hong et.al. | 2401.06994 | null |
2024-01-12 | Open RAN LSTM Traffic Prediction and Slice Management using Deep Reinforcement Learning | Fatemeh Lotfi et.al. | 2401.06922 | null |
2024-01-12 | Synthetic Data Generation Framework, Dataset, and Efficient Deep Model for Pedestrian Intention Prediction | Muhammad Naveed Riaz et.al. | 2401.06757 | link |
2024-01-12 | Real-time MPC with Control Barrier Functions for Autonomous Driving using Safety Enhanced Collocation | Jean Pierre Allamaa et.al. | 2401.06648 | null |
2024-01-12 | Enhancing Throughput for TTEthernet via Co-optimizing Routing and Scheduling: An Online Time-Varying Graph-based Method | Yaoxu He et.al. | 2401.06579 | null |
2024-01-12 | Robustness-Aware 3D Object Detection in Autonomous Driving: A Review and Outlook | Ziying Song et.al. | 2401.06542 | null |
2024-01-12 | Personalized Reinforcement Learning with a Budget of Policies | Dmitry Ivanov et.al. | 2401.06514 | link |
2024-01-12 | Multi-Profile Quadratic Programming (MPQP) for Optimal Gap Selection and Speed Planning of Autonomous Driving | Alexandre Miranda Anon et.al. | 2401.06305 | null |
2024-03-09 | VLP: Vision Language Planning for Autonomous Driving | Chenbin Pan et.al. | 2401.05577 | null |
2024-01-10 | Structure from Duplicates: Neural Inverse Graphics from a Pile of Objects | Tianhang Cheng et.al. | 2401.05236 | link |
2024-01-10 | Autonomous Navigation of Tractor-Trailer Vehicles through Roundabout Intersections | Daniel Attard et.al. | 2401.04980 | null |
2024-01-10 | Latency-aware Road Anomaly Segmentation in Videos: A Photorealistic Dataset and New Metrics | Beiwen Tian et.al. | 2401.04942 | null |
2024-01-10 | Learning Racing From an AI Coach: Effects of Multimodal Autonomous Driving Explanations on Driving Performance, Cognitive Load, Expertise, and Trust | Robert Kaufman et.al. | 2401.04206 | null |
2024-01-08 | RoboFusion: Towards Robust Multi-Modal 3D obiect Detection via SAM | Ziying Song et.al. | 2401.03907 | link |
2024-01-08 | UFO: Unidentified Foreground Object Detection in 3D Point Cloud | Hyunjun Choi et.al. | 2401.03846 | null |
2024-01-15 | WidthFormer: Toward Efficient Transformer-based BEV View Transformation | Chenhongyi Yang et.al. | 2401.03836 | link |
2024-01-08 | Safe Chance-constrained Model Predictive Control under Gaussian Mixture Model Uncertainty | Kai Ren et.al. | 2401.03799 | null |
2024-01-08 | NeRFmentation: NeRF-based Augmentation for Monocular Depth Estimation | Casimir Feldmann et.al. | 2401.03771 | null |
2024-01-08 | DME-Driver: Integrating Human Decision Logic and 3D Scene Perception in Autonomous Driving | Wencheng Han et.al. | 2401.03641 | null |
2024-01-08 | DDM-Lag : A Diffusion-based Decision-making Model for Autonomous Vehicles with Lagrangian Safety Enhancement | Jiaqi Liu et.al. | 2401.03629 | null |
2024-01-07 | Text-Driven Traffic Anomaly Detection with Temporal High-Frequency Modeling in Driving Videos | Rongqin Liang et.al. | 2401.03522 | null |
2024-01-16 | Reconfigurable Holographic Surface Aided Wireless Simultaneous Localization and Mapping | Haobo Zhang et.al. | 2401.03453 | null |
2024-01-06 | DistFormer: Enhancing Local and Global Features for Monocular Per-Object Distance Estimation | Aniello Panariello et.al. | 2401.03191 | link |
2024-02-19 | Human as AI Mentor: Enhanced Human-in-the-loop Reinforcement Learning for Safe and Efficient Autonomous Driving | Zilin Huang et.al. | 2401.03160 | link |
2024-01-06 | Transferable Learned Image Compression-Resistant Adversarial Perturbations | Yang Sui et.al. | 2401.03115 | null |
2024-01-08 | Uncovering the human motion pattern: Pattern Memory-based Diffusion Model for Trajectory Prediction | Yuxin Yang et.al. | 2401.02916 | null |
2024-01-04 | OptFlow: Fast Optimization-based Scene Flow Estimation without Supervision | Rahul Ahuja et.al. | 2401.02550 | null |
2024-01-04 | REDriver: Runtime Enforcement for Autonomous Vehicles | Yang Sun et.al. | 2401.02253 | null |
2024-01-04 | Inherently robust suboptimal MPC for autonomous racing with anytime feasible SQP | Logan Numerow et.al. | 2401.02194 | null |
2024-01-03 | Context-Aware Interaction Network for RGB-T Semantic Segmentation | Ying Lv et.al. | 2401.01624 | link |
2024-01-03 | Collaborative Perception for Connected and Autonomous Driving: Challenges, Possible Solutions and Opportunities | Senkang Hu et.al. | 2401.01544 | null |
2024-01-02 | A Survey on Autonomous Driving Datasets: Data Statistic, Annotation, and Outlook | Mingyu Liu et.al. | 2401.01454 | link |
2024-01-02 | Off-Road LiDAR Intensity Based Semantic Segmentation | Kasi Viswanath et.al. | 2401.01439 | link |
2023-12-28 | Fast Quantum Convolutional Neural Networks for Low-Complexity Object Detection in Autonomous Driving Applications | Hankyul Baek et.al. | 2401.01370 | null |
2024-01-02 | Temporal Adaptive RGBT Tracking with Modality Prompt | Hongyu Wang et.al. | 2401.01244 | null |
2024-01-05 | PLE-SLAM: A Visual-Inertial SLAM Based on Point-Line Features and Efficient IMU Initialization | Jiaming He et.al. | 2401.01081 | link |
2024-01-02 | BEV-CLIP: Multi-modal BEV Retrieval Methodology for Complex Scene in Autonomous Driving | Dafeng Wei et.al. | 2401.01065 | null |
2024-01-02 | Holistic Autonomous Driving Understanding by Bird’s-Eye-View Injected Multi-Modal Large Models | Xinpeng Ding et.al. | 2401.00988 | link |
2024-01-16 | WoodScape Motion Segmentation for Autonomous Driving – CVPR 2023 OmniCV Workshop Challenge | Saravanabalagi Ramachandran et.al. | 2401.00910 | null |
2024-01-01 | Socially Compliant Control of Autonomous Vehicles with Application to Eco-Driving | Shian Wang et.al. | 2401.00830 | null |
2023-12-31 | RainSD: Rain Style Diversification Module for Image Synthesis Enhancement using Feature-Level Style Distribution | Hyeonjae Jeon et.al. | 2401.00460 | null |
2023-12-31 | Controllable Safety-Critical Closed-loop Traffic Simulation via Guided Diffusion | Wei-Jer Chang et.al. | 2401.00391 | null |
2023-12-30 | LLM-Assist: Enhancing Closed-Loop Planning with Language-Based Reasoning | S P Sharan et.al. | 2401.00125 | null |
2024-01-07 | Generative AI-driven Semantic Communication Networks: Architecture, Technologies and Applications | Chengsi Liang et.al. | 2401.00124 | null |
2023-12-29 | Visual Point Cloud Forecasting enables Scalable Autonomous Driving | Zetong Yang et.al. | 2312.17655 | link |
2023-12-22 | TimePillars: Temporally-Recurrent 3D LiDAR Object Detection | Ernesto Lozano Calvo et.al. | 2312.17260 | null |
2024-01-29 | FENet: Focusing Enhanced Network for Lane Detection | Liman Wang et.al. | 2312.17163 | link |
2023-12-29 | Fully Sparse 3D Panoptic Occupancy Prediction | Haisong Liu et.al. | 2312.17118 | link |
2023-12-28 | DOEPatch: Dynamically Optimized Ensemble Model for Adversarial Patches Generation | Wenyi Tan et.al. | 2312.16907 | null |
2023-12-27 | LIP-Loc: LiDAR Image Pretraining for Cross-Modal Localization | Sai Shubodh Puligilla et.al. | 2312.16648 | null |
2023-12-27 | Autonomous Driving using Residual Sensor Fusion and Deep Reinforcement Learning | Amin Jalal Aghdasian et.al. | 2312.16620 | link |
2024-02-26 | LaneSegNet: Map Learning with Lane Segment Perception for Autonomous Driving | Tianyu Li et.al. | 2312.16108 | link |
2023-12-26 | Adaptive Kalman-based hybrid car following strategy using TD3 and CACC | Yuqi Zheng et.al. | 2312.15993 | null |
2023-12-26 | Attention-aware Social Graph Transformer Networks for Stochastic Trajectory Prediction | Yao Liu et.al. | 2312.15881 | null |
2023-12-25 | Contrastive Learning-Based Framework for Sim-to-Real Mapping of Lidar Point Clouds in Autonomous Driving Systems | Hamed Haghighi et.al. | 2312.15817 | link |
2023-12-25 | A Survey on Open-Set Image Recognition | Jiayin Sun et.al. | 2312.15571 | null |
2023-12-23 | Pre-trained Trojan Attacks for Visual Recognition | Aishan Liu et.al. | 2312.15172 | null |
2024-02-08 | Scaling Is All You Need: Autonomous Driving with JAX-Accelerated Reinforcement Learning | Moritz Harmel et.al. | 2312.15122 | null |
2023-12-26 | Lift-Attend-Splat: Bird’s-eye-view camera-lidar fusion using transformers | James Gunn et.al. | 2312.14919 | null |
2023-12-22 | Explainable Multi-Camera 3D Object Detection with Transformer-Based Saliency Maps | Till Beemelmanns et.al. | 2312.14606 | null |
2023-12-22 | MonoLSS: Learnable Sample Selection For Monocular 3D Detection | Zhenjia Li et.al. | 2312.14474 | link |
2023-12-21 | DriveLM: Driving with Graph Visual Question Answering | Chonghao Sima et.al. | 2312.14150 | link |
2023-12-21 | LingoQA: Video Question Answering for Autonomous Driving | Ana-Maria Marcu et.al. | 2312.14115 | link |
2023-12-20 | Building Lane-Level Maps from Aerial Images | Jiawei Yao et.al. | 2312.13449 | link |
2023-12-20 | NeLF-Pro: Neural Light Field Probes | Zinuo You et.al. | 2312.13328 | null |
2023-12-19 | RealGen: Retrieval Augmented Generation for Controllable Traffic Scenarios | Wenhao Ding et.al. | 2312.13303 | null |
2023-12-29 | AccidentGPT: Accident Analysis and Prevention from V2X Environmental Perception with Multi-modal Large Model | Lening Wang et.al. | 2312.13156 | null |
2024-01-10 | Optimizing Ego Vehicle Trajectory Prediction: The Graph Enhancement Approach | Sushil Sharma et.al. | 2312.13104 | null |
2024-01-17 | PPEA-Depth: Progressive Parameter-Efficient Adaptation for Self-Supervised Monocular Depth Estimation | Yue-Jiang Dong et.al. | 2312.13066 | null |
2023-12-20 | TADAP: Trajectory-Aided Drivable area Auto-labeling with Pre-trained self-supervised features in winter driving conditions | Eerik Alamikkotervo et.al. | 2312.12954 | null |
2023-12-20 | PointeNet: A Lightweight Framework for Effective and Efficient Point Cloud Analysis | Lipeng Gu et.al. | 2312.12743 | null |
2023-12-19 | Studying the Practices of Testing Machine Learning Software in the Wild | Moses Openja et.al. | 2312.12604 | link |
2024-01-23 | Tracking Any Object Amodally | Cheng-Yen Hsieh et.al. | 2312.12433 | link |
2023-12-19 | First qualitative observations on deep learning vision model YOLO and DETR for automated driving in Austria | Stefan Schoder et.al. | 2312.12314 | null |
2023-12-19 | M-BEV: Masked BEV Perception for Robust Autonomous Driving | Siran Chen et.al. | 2312.12144 | link |
2023-12-19 | Parameterized Decision-making with Multi-modal Perception for Autonomous Driving | Yuyang Xia et.al. | 2312.11935 | null |
2023-12-19 | Regulating Intermediate 3D Features for Vision-Centric Autonomous Driving | Junkai Xu et.al. | 2312.11837 | link |
2024-01-25 | A Survey of Reasoning with Foundation Models | Jiankai Sun et.al. | 2312.11562 | link |
2023-12-18 | DiffTune-MPC: Closed-Loop Learning for Model Predictive Control | Ran Tao et.al. | 2312.11384 | null |
2024-01-16 | Multi-Agent Reinforcement Learning for Connected and Automated Vehicles Control: Recent Advancements and Future Prospects | Min Hua et.al. | 2312.11084 | link |
2023-12-18 | Multi-Correlation Siamese Transformer Network with Dense Connection for 3D Single Object Tracking | Shihao Feng et.al. | 2312.11051 | link |
2024-02-14 | Physics-Informed Representation and Learning: Control and Risk Quantification | Zhuoyuan Wang et.al. | 2312.10594 | link |
2023-12-16 | Improving Environment Robustness of Deep Reinforcement Learning Approaches for Autonomous Racing Using Bayesian Optimization-based Curriculum Learning | Rohan Banerjee et.al. | 2312.10557 | link |
2023-12-19 | Fractional Deep Reinforcement Learning for Age-Minimal Mobile Edge Computing | Lyudong Jin et.al. | 2312.10418 | null |
2023-12-15 | Constrained Meta-Reinforcement Learning for Adaptable Safety Guarantee with Differentiable Convex Programming | Minjae Cho et.al. | 2312.10230 | link |
2023-12-15 | Communication-Efficient Soft Actor-Critic Policy Collaboration via Regulated Segment Mixture in Internet of Vehicles | Xiaoxue Yu et.al. | 2312.10123 | null |
2023-12-15 | Neurosymbolic Value-Inspired AI (Why, What, and How) | Amit Sheth et.al. | 2312.09928 | null |
2023-12-15 | NeuroFlow: Development of lightweight and efficient model integration scheduling strategy for autonomous driving system | Eunbin Seo et.al. | 2312.09588 | null |
2024-02-28 | Embodied Adversarial Attack: A Dynamic Robust Physical Attack in Autonomous Driving | Yitong Sun et.al. | 2312.09554 | null |
2023-12-15 | DriveTrack: A Benchmark for Long-Range Point Tracking in Real-World Videos | Arjun Balasingam et.al. | 2312.09523 | null |
2023-12-26 | SlowTrack: Increasing the Latency of Camera-based Perception in Autonomous Driving Using Adversarial Examples | Chen Ma et.al. | 2312.09520 | null |
2023-12-15 | EDA: Evolving and Distinct Anchors for Multimodal Motion Prediction | Longzhong Lin et.al. | 2312.09501 | link |
2024-02-04 | Large Language Models for Autonomous Driving: Real-World Experiments | Can Cui et.al. | 2312.09397 | null |
2023-12-25 | DriveMLM: Aligning Multi-Modal Large Language Models with Behavioral Planning States for Autonomous Driving | Wenhai Wang et.al. | 2312.09245 | link |
2023-12-14 | OccNeRF: Self-Supervised Multi-Camera Occupancy Prediction with Neural Radiance Fields | Chubin Zhang et.al. | 2312.09243 | link |
2023-12-15 | 3DGS-Avatar: Animatable Avatars via Deformable 3D Gaussian Splatting | Zhiyin Qian et.al. | 2312.09228 | null |
2023-12-13 | An Invitation to Deep Reinforcement Learning | Bernhard Jaeger et.al. | 2312.08365 | null |
2023-12-14 | Semi-Supervised Class-Agnostic Motion Prediction with Pseudo Label Regeneration and BEVMix | Kewei Wang et.al. | 2312.08009 | link |
2023-12-13 | Instance-aware Multi-Camera 3D Object Detection with Structural Priors Mining and Self-Boosting Learning | Yang Jiao et.al. | 2312.08004 | null |
2024-02-27 | DrivingGaussian: Composite Gaussian Splatting for Surrounding Dynamic Autonomous Driving Scenes | Xiaoyu Zhou et.al. | 2312.07920 | link |
2023-12-11 | Spatiotemporal Event Graphs for Dynamic Scene Understanding | Salman Khan et.al. | 2312.07621 | null |
2023-12-21 | LMDrive: Closed-Loop End-to-End Driving with Large Language Models | Hao Shao et.al. | 2312.07488 | link |
2023-12-12 | Efficient Object Detection in Autonomous Driving using Spiking Neural Networks: Performance, Energy Consumption Analysis, and Insights into Open-set Object Discovery | Aitor Martinez Seras et.al. | 2312.07466 | link |
2024-02-25 | How Well Does GPT-4V(ision) Adapt to Distribution Shifts? A Preliminary Investigation | Zhongyi Han et.al. | 2312.07424 | link |
2023-12-12 | Autonomous driving of trucks in off-road environment | Kenny A. Q. Caldas et.al. | 2312.07382 | null |
2023-12-17 | MWSIS: Multimodal Weakly Supervised Instance Segmentation with 2D Box Annotations for Autonomous Driving | Guangfeng Jiang et.al. | 2312.06988 | link |
2023-12-11 | Scalable Decentralized Cooperative Platoon using Multi-Agent Deep Reinforcement Learning | Ahmed Abdelrahman et.al. | 2312.06858 | null |
2023-12-10 | Dynamic Adversarial Attacks on Autonomous Driving Systems | Amirhosein Chahe et.al. | 2312.06701 | link |
2023-12-15 | BAT: Behavior-Aware Human-Like Trajectory Prediction for Autonomous Driving | Haicheng Liao et.al. | 2312.06371 | link |
2023-12-11 | NuScenes-MQA: Integrated Evaluation of Captions and QA for Autonomous Driving Datasets using Markup Annotations | Yuichi Inoue et.al. | 2312.06352 | link |
2023-12-11 | Evaluation of Large Language Models for Decision Making in Autonomous Driving | Kotaro Tanahashi et.al. | 2312.06351 | null |
2023-12-11 | Attribute Annotation and Bias Evaluation in Visual Datasets for Autonomous Driving | David Fernández Llorca et.al. | 2312.06306 | link |
2023-12-11 | Interpretable Long Term Waypoint-Based Trajectory Prediction Model | Amina Ghoul et.al. | 2312.06219 | null |
2023-12-11 | Recent Advances in Deterministic Human Motion Prediction: A Review | Tenghao Deng et.al. | 2312.06184 | null |
2023-12-11 | M3SOT: Multi-frame, Multi-field, Multi-space 3D Single Object Tracking | Jiaming Liu et.al. | 2312.06117 | link |
2023-12-10 | GenDepth: Generalizing Monocular Depth Estimation for Arbitrary Camera Parameters via Ground Plane Embedding | Karlo Koledić et.al. | 2312.06021 | null |
2023-12-10 | Learning for CasADi: Data-driven Models in Numerical Optimization | Tim Salzmann et.al. | 2312.05873 | link |
2023-12-10 | Beyond One Model Fits All: Ensemble Deep Learning for Autonomous Vehicles | Hemanth Manjunatha et.al. | 2312.05759 | null |
2023-12-10 | Camera-based 3D Semantic Scene Completion with Sparse Guidance Network | Jianbiao Mei et.al. | 2312.05752 | link |
2023-12-08 | IntrinsicAvatar: Physically Based Inverse Rendering of Dynamic Humans from Monocular Videos via Explicit Ray Tracing | Shaofei Wang et.al. | 2312.05210 | null |
2023-12-08 | An Autonomous Driving model with BEV-V2X Perception, Trajectory Prediction and Driving Planning in Complex Traffic Intersections | Fukang Li et.al. | 2312.05104 | null |
2023-12-08 | Radar Perception in Autonomous Driving: Exploring Different Data Representations | Shanliang Yao et.al. | 2312.04861 | link |
2023-12-07 | Fine-Grained Extraction of Road Networks via Joint Learning of Connectivity and Segmentation | Yijia Xu et.al. | 2312.04744 | null |
2023-12-07 | PAC-Bayes Generalization Certificates for Learned Inductive Conformal Prediction | Apoorva Sharma et.al. | 2312.04658 | null |
2023-12-07 | MuRF: Multi-Baseline Radiance Fields | Haofei Xu et.al. | 2312.04565 | link |
2023-12-07 | FRNet: Frustum-Range Networks for Scalable LiDAR Segmentation | Xiang Xu et.al. | 2312.04484 | link |
2023-12-07 | Deep Dynamics: Vehicle Dynamics Modeling with a Physics-Informed Neural Network for Autonomous Racing | John Chrosniak et.al. | 2312.04374 | null |
2023-12-07 | LaMPilot: An Open Benchmark Dataset for Autonomous Driving with Language Model Programs | Yunsheng Ma et.al. | 2312.04372 | link |
2023-12-27 | Towards Knowledge-driven Autonomous Driving | Xin Li et.al. | 2312.04316 | link |
2023-12-07 | Residual Graph Convolutional Network for Bird’s-Eye-View Semantic Segmentation | Qiuxiao Chen et.al. | 2312.04044 | null |
2023-12-15 | Natural-language-driven Simulation Benchmark and Copilot for Efficient Production of Object Interactions in Virtual Road Scenes | Kairui Yang et.al. | 2312.04008 | null |
2023-12-06 | Reason2Drive: Towards Interpretable and Chain-based Reasoning for Autonomous Driving | Ming Nie et.al. | 2312.03661 | link |
2023-12-06 | GPT-4 Enhanced Multimodal Grounding for Autonomous Driving: Leveraging Cross-Modal Attention with Large Language Models | Haicheng Liao et.al. | 2312.03543 | link |
2024-01-24 | Open-sourced Data Ecosystem in Autonomous Driving: the Present and Future | Hongyang Li et.al. | 2312.03408 | link |
2023-12-05 | DGInStyle: Domain-Generalizable Semantic Segmentation with Image Diffusion Models and Stylized Semantic Control | Yuru Jia et.al. | 2312.03048 | null |
2023-12-05 | Is Ego Status All You Need for Open-Loop End-to-End Autonomous Driving? | Zhiqi Li et.al. | 2312.03031 | link |
2023-12-05 | LLaVA-Grounding: Grounded Visual Chat with Large Multimodal Models | Hao Zhang et.al. | 2312.02949 | link |
2023-12-06 | WoVoGen: World Volume-aware Diffusion for Controllable Multi-camera Driving Scene Generation | Jiachen Lu et.al. | 2312.02934 | link |
2023-12-05 | Experimental Insights Towards Explainable and Interpretable Pedestrian Crossing Prediction | Angie Nataly Melo et.al. | 2312.02872 | null |
2023-12-05 | Estimation of articulated angle in six-wheeled dump trucks using multiple GNSS receivers for autonomous driving | Taro Suzuki et.al. | 2312.02510 | null |
2023-12-05 | Object Importance Estimation using Counterfactual Reasoning for Intelligent Driving | Pranay Gupta et.al. | 2312.02467 | null |
2024-02-05 | MGTR: Multi-Granular Transformer for Motion Prediction with LiDAR | Yiqian Gan et.al. | 2312.02409 | null |
2023-12-04 | PaSCo: Urban 3D Panoptic Scene Completion with Uncertainty Awareness | Anh-Quan Cao et.al. | 2312.02158 | link |
2023-12-04 | COTR: Compact Occupancy TRansformer for Vision-based 3D Occupancy Prediction | Qihang Ma et.al. | 2312.01919 | link |
2023-12-04 | Analyze Drivers’ Intervention Behavior During Autonomous Driving – A VR-incorporated Approach | Zheng Xu et.al. | 2312.01669 | null |
2024-01-09 | Exploring Adversarial Robustness of LiDAR-Camera Fusion Model in Autonomous Driving | Bo Yang et.al. | 2312.01468 | null |
2023-12-02 | A Survey of Progress on Cooperative Multi-agent Reinforcement Learning in Open Environment | Lei Yuan et.al. | 2312.01058 | null |
2023-12-01 | AV4EV: Open-Source Modular Autonomous Electric Vehicle Platform to Make Mobility Research Accessible | Zhijie Qiao et.al. | 2312.00951 | link |
2023-12-18 | Empowering Autonomous Driving with Large Language Models: A Safety Perspective | Yixuan Wang et.al. | 2312.00812 | null |
2023-12-01 | Towards Efficient 3D Object Detection in Bird’s-Eye-View Space for Autonomous Driving: A Convolutional-Only Approach | Yuxin Li et.al. | 2312.00633 | null |
2023-12-01 | Dolphins: Multimodal Language Model for Driving | Yingzi Ma et.al. | 2312.00438 | null |
2023-12-01 | Improving Efficiency of DNN-based Relocalization Module for Autonomous Driving with Server-side Computing | Dengbo Li et.al. | 2312.00316 | null |
2023-11-30 | EpiTESTER: Testing Autonomous Vehicles with Epigenetic Algorithm and Attention Mechanism | Chengjie Lu et.al. | 2312.00207 | link |
2023-11-30 | GraphDreamer: Compositional 3D Scene Synthesis from Scene Graphs | Gege Gao et.al. | 2312.00093 | null |
2023-11-30 | Evaluating the Impact of Flaky Simulators on Testing Autonomous Driving Systems | Mohammad Hossein Amini et.al. | 2311.18768 | link |
2023-11-30 | VREM-FL: Mobility-Aware Computation-Scheduling Co-Design for Vehicular Federated Learning | Luca Ballotta et.al. | 2311.18741 | link |
2023-11-30 | Heterogeneous Graph-based Trajectory Prediction using Local Map Context and Social Interactions | Daniel Grimm et.al. | 2311.18553 | null |
2023-11-30 | Data-efficient Deep Reinforcement Learning for Vehicle Trajectory Control | Bernd Frauenknecht et.al. | 2311.18393 | null |
2023-11-30 | Categorical Traffic Transformer: Interpretable and Diverse Behavior Prediction with Tokenized Latent | Yuxiao Chen et.al. | 2311.18307 | null |
2023-11-29 | Game Projection and Robustness for Game-Theoretic Autonomous Driving | Mushuang Liu et.al. | 2311.18074 | null |
2023-11-29 | Driving into the Future: Multiview Visual Forecasting and Planning with World Model for Autonomous Driving | Yuqi Wang et.al. | 2311.17918 | link |
2023-12-07 | Cam4DOcc: Benchmark for Camera-Only 4D Occupancy Forecasting in Autonomous Driving Applications | Junyi Ma et.al. | 2311.17663 | link |
2023-11-29 | Erasing the Ephemeral: Joint Camera Refinement and Transient Object Removal for Street View Synthesis | Mreenav Shyam Deka et.al. | 2311.17634 | null |
2023-11-28 | DepthSSC: Depth-Spatial Alignment and Dynamic Voxel Resolution for Monocular 3D Semantic Scene Completion | Jiawei Yao et.al. | 2311.17084 | null |
2023-12-11 | UC-NeRF: Neural Radiance Field for Under-Calibrated Multi-view Cameras in Autonomous Driving | Kai Cheng et.al. | 2311.16945 | null |
2023-11-28 | Panacea: Panoramic and Controllable Video Generation for Autonomous Driving | Yuqing Wen et.al. | 2311.16813 | null |
2023-11-28 | Towards Full-scene Domain Generalization in Multi-agent Collaborative Bird’s Eye View Segmentation for Connected and Autonomous Driving | Senkang Hu et.al. | 2311.16754 | link |
2023-11-28 | DGNR: Density-Guided Neural Point Rendering of Large Driving Scenes | Zhuopeng Li et.al. | 2311.16664 | null |
2023-11-27 | Mip-Splatting: Alias-free 3D Gaussian Splatting | Zehao Yu et.al. | 2311.16493 | null |
2023-11-27 | OccWorld: Learning a 3D Occupancy World Model for Autonomous Driving | Wenzhao Zheng et.al. | 2311.16038 | link |
2023-11-27 | SOAC: Spatio-Temporal Overlap-Aware Multi-Sensor Calibration using Neural Radiance Fields | Quentin Herau et.al. | 2311.15803 | null |
2023-11-27 | Technical Report for Argoverse Challenges on 4D Occupancy Forecasting | Pengfei Zheng et.al. | 2311.15660 | null |
2023-11-27 | Technical Report for Argoverse Challenges on Unified Sensor-based Detection, Tracking, and Forecasting | Zhepeng Wang et.al. | 2311.15615 | null |
2023-11-27 | Sparse Pedestrian Character Learning for Trajectory Prediction | Yonghao Dong et.al. | 2311.15512 | null |
2023-11-26 | GAN-Based LiDAR Intensity Simulation | Richard Marcus et.al. | 2311.15415 | null |
2023-12-05 | NeuRAD: Neural Rendering for Autonomous Driving | Adam Tonderski et.al. | 2311.15260 | link |
2023-11-26 | CalibFormer: A Transformer-based Automatic LiDAR-Camera Calibration Network | Yuxuan Xiao et.al. | 2311.15241 | null |
2023-11-25 | OpenNet: Incremental Learning for Autonomous Driving Object Detection with Balanced Loss | Zezhou Wang et.al. | 2311.14939 | null |
2023-11-25 | GBD-TS: Goal-based Pedestrian Trajectory Prediction with Diffusion using Tree Sampling Algorithm | Ge Sun et.al. | 2311.14922 | null |
2023-11-24 | GPT-4V Takes the Wheel: Evaluating Promise and Challenges for Pedestrian Behavior Prediction | Jia Huang et.al. | 2311.14786 | null |
2023-11-24 | Safety Assessment of Vehicle Characteristics Variations in Autonomous Driving Systems | Qi Pan et.al. | 2311.14461 | link |
2023-11-23 | Security and Privacy Challenges in Deep Learning Models | Gopichandh Golla et.al. | 2311.13744 | null |
2023-11-22 | Visual In-Context Prompting | Feng Li et.al. | 2311.13601 | link |
2023-11-22 | WildFusion: Learning 3D-Aware Latent Diffusion Models in View Space | Katja Schwarz et.al. | 2311.13570 | null |
2023-11-22 | ADriver-I: A General World Model for Autonomous Driving | Fan Jia et.al. | 2311.13549 | null |
2023-11-22 | An Empirical Study of Uncertainty Estimation Techniques for Detecting Drift in Data Streams | Anton Winter et.al. | 2311.13374 | null |
2023-11-22 | DoubleAUG: Single-domain Generalized Object Detector in Urban via Color Perturbation and Dual-style Memory | Lei Qi et.al. | 2311.13198 | null |
2023-10-25 | TorchSparse++: Efficient Training and Inference Framework for Sparse Convolution on GPUs | Haotian Tang et.al. | 2311.12862 | link |
2023-11-29 | SelfOcc: Self-Supervised Vision-Based 3D Occupancy Prediction | Yuanhui Huang et.al. | 2311.12754 | link |
2023-11-21 | Attacking Motion Planners Using Adversarial Perception Errors | Jonathan Sadeghi et.al. | 2311.12722 | null |
2023-11-21 | A Survey on Multimodal Large Language Models for Autonomous Driving | Can Cui et.al. | 2311.12320 | link |
2023-12-04 | Applications of Large Scale Foundation Models for Autonomous Driving | Yu Huang et.al. | 2311.12144 | null |
2023-11-18 | FlashOcc: Fast and Memory-Efficient Occupancy Prediction via Channel-to-Height Plugin | Zichen Yu et.al. | 2311.12058 | link |
2023-11-20 | Beyond Boundaries: A Comprehensive Survey of Transferable Attacks on AI Systems | Guangjing Wang et.al. | 2311.11796 | null |
2023-11-23 | MUVO: A Multimodal Generative World Model for Autonomous Driving with Geometric Representations | Daniel Bogdoll et.al. | 2311.11762 | link |
2023-11-20 | A Large-Scale Car Parts (LSCP) Dataset for Lightweight Fine-Grained Detection | Wang Jie et.al. | 2311.11754 | null |
2023-11-20 | Sparse4D v3: Advancing End-to-End 3D Detection and Tracking | Xuewu Lin et.al. | 2311.11722 | link |
2023-11-19 | Pair-wise Layer Attention with Spatial Masking for Video Prediction | Ping Li et.al. | 2311.11289 | link |
2023-11-19 | Multi-Timescale Control and Communications with Deep Reinforcement Learning – Part I: Communication-Aware Vehicle Control | Tong Liu et.al. | 2311.11281 | null |
2023-11-18 | Tactics2D: A Multi-agent Reinforcement Learning Environment for Driving Decision-making | Yueyuan Li et.al. | 2311.11058 | link |
2023-11-18 | A Survey of Simulators for Autonomous Driving: Taxonomy, Challenges, and Evaluation Metrics | Yueyuan Li et.al. | 2311.11056 | null |
2023-11-27 | A Language Agent for Autonomous Driving | Jiageng Mao et.al. | 2311.10813 | link |
2023-11-21 | Safety-aware Causal Representation for Trustworthy Reinforcement Learning in Autonomous Driving | Haohong Lin et.al. | 2311.10747 | null |
2023-11-17 | Mind the map! Accounting for existing map information when estimating online HDMaps from sensor data | Rémy Sun et.al. | 2311.10517 | link |
2023-11-17 | Cooperative Perception with Learning-Based V2V communications | Chenguang Liu et.al. | 2311.10336 | null |
2023-11-17 | Imagination-augmented Hierarchical Reinforcement Learning for Safe and Interactive Autonomous Driving in Urban Environments | Sang-Hyun Lee et.al. | 2311.10309 | null |
2023-11-17 | Vision meets mmWave Radar: 3D Object Perception Benchmark for Autonomous Driving | Yizhou Wang et.al. | 2311.10261 | null |
2023-11-16 | Interpretable Reinforcement Learning for Robotics and Continuous Control | Rohan Paleja et.al. | 2311.10041 | link |
2023-11-16 | On the Overconfidence Problem in Semantic 3D Mapping | Joao Marcos Correia Marques et.al. | 2311.10018 | link |
2023-11-16 | Scan statistics for the detection of anomalies in M-dependent random fields with applications to image data | Claudia Kirch et.al. | 2311.09961 | null |
2023-11-16 | Automatic Generation of Scenarios for System-level Simulation-based Verification of Autonomous Driving Systems | Srajan Goyal et.al. | 2311.09784 | null |
2023-11-16 | Applications of Computer Vision in Autonomous Vehicles: Methods, Challenges and Future Directions | Xingshuai Dong et.al. | 2311.09093 | null |
2023-11-14 | Low-light Pedestrian Detection in Visible and Infrared Image Feeds: Issues and Challenges | Hrishikesh Vachhani et.al. | 2311.08557 | null |
2023-11-14 | Human-Centric Autonomous Systems With LLMs for User Command Reasoning | Yi Yang et.al. | 2311.08206 | link |
2023-11-14 | Lateral control for autonomous vehicles: A comparative evaluation | Antonio Artuñedo et.al. | 2311.07987 | null |
2023-11-14 | Towards Improving Robustness Against Common Corruptions in Object Detectors Using Adversarial Contrastive Learning | Shashank Kotyan et.al. | 2311.07928 | null |
2023-11-13 | Temporal Performance Prediction for Deep Convolutional Long Short-Term Memory Networks | Laura Fieback et.al. | 2311.07477 | null |
2023-11-11 | Semantic Communication for Cooperative Perception based on Importance Map | Yucheng Sheng et.al. | 2311.06498 | null |
2023-11-10 | Improved Positional Encoding for Implicit Neural Representation based Compact Data Representation | Bharath Bhushan Damodaran et.al. | 2311.06059 | null |
2023-11-10 | Refining the ONCE Benchmark with Hyperparameter Tuning | Maksim Golyadkin et.al. | 2311.06054 | null |
2023-11-10 | Deep learning for 3D Object Detection and Tracking in Autonomous Driving: A Brief Survey | Yang Peng et.al. | 2311.06043 | null |
2023-11-10 | Quantized Distillation: Optimizing Driver Activity Recognition Models for Resource-Constrained Environments | Calvin Tanama et.al. | 2311.05970 | link |
2023-11-09 | Real-time Control of Electric Autonomous Mobility-on-Demand Systems via Graph Reinforcement Learning | Aaryan Singhal et.al. | 2311.05780 | link |
2023-11-09 | Multi-Agent Quantum Reinforcement Learning using Evolutionary Optimization | Michael Kölle et.al. | 2311.05546 | link |
2023-11-28 | On the Road with GPT-4V(ision): Early Explorations of Visual-Language Model on Autonomous Driving | Licheng Wen et.al. | 2311.05332 | link |
2023-11-09 | TLCFuse: Temporal Multi-Modality Fusion Towards Occlusion-Aware Semantic Segmentation-Aided Motion Planning | Gustavo Salazar-Gomez et.al. | 2311.05319 | null |
2023-11-08 | Lidar Annotation Is All You Need | Dinar Sharafutdinov et.al. | 2311.04777 | link |
2023-11-08 | Image Patch-Matching with Graph-Based Learning in Street Scenes | Rui She et.al. | 2311.04617 | null |
2023-11-09 | Rethinking Human Pose Estimation for Autonomous Driving with 3D Event Representations | Xiaoting Yin et.al. | 2311.04591 | link |
2023-11-08 | FFINet: Future Feedback Interaction Network for Motion Forecasting | Miao Kang et.al. | 2311.04512 | null |
2023-11-08 | PRED: Pre-training via Semantic Rendering on LiDAR Point Clouds | Hao Yang et.al. | 2311.04501 | null |
2023-11-12 | What Makes a Fantastic Passenger-Car Driver in Urban Contexts? | Yueteng Yu et.al. | 2311.04150 | null |
2023-11-07 | Augmenting Lane Perception and Topology Understanding with Standard Definition Navigation Maps | Katie Z Luo et.al. | 2311.04079 | link |
2023-11-07 | AGNES: Abstraction-guided Framework for Deep Neural Networks Security | Akshay Dhonthi et.al. | 2311.04009 | null |
2023-11-06 | COLA: COarse-LAbel multi-source LiDAR semantic segmentation for autonomous driving | Jules Sanchez et.al. | 2311.03017 | null |
2023-11-06 | IR-STP: Enhancing Autonomous Driving with Interaction Reasoning in Spatio-Temporal Planning | Yingbing Chen et.al. | 2311.02850 | link |
2023-11-06 | Flexible Multi-Generator Model with Fused Spatiotemporal Graph for Trajectory Prediction | Peiyuan Zhu et.al. | 2311.02835 | null |
2023-11-05 | Deep Learning-based 3D Point Cloud Classification: A Systematic Survey and Outlook | Huang Zhang et.al. | 2311.02608 | null |
2023-11-04 | Uncertainty Quantification of Deep Learning for Spatiotemporal Data: Challenges and Opportunities | Wenchong He et.al. | 2311.02485 | null |
2023-11-04 | Levels of AGI: Operationalizing Progress on the Path to AGI | Meredith Ringel Morris et.al. | 2311.02462 | null |
2023-11-04 | P2O-Calib: Camera-LiDAR Calibration Using Point-Pair Spatial Occlusion Relationship | Su Wang et.al. | 2311.02413 | null |
2023-11-04 | Continual Learning of Unsupervised Monocular Depth from Videos | Hemang Chawla et.al. | 2311.02393 | link |
2023-11-04 | OSM vs HD Maps: Map Representations for Trajectory Prediction | Jing-Yan Liao et.al. | 2311.02305 | null |
2023-11-03 | EmerNeRF: Emergent Spatial-Temporal Scene Decomposition via Self-Supervision | Jiawei Yang et.al. | 2311.02077 | null |
2023-11-03 | Quantitative Evaluation of a Multi-Modal Camera Setup for Fusing Event Data with RGB Images | Julian Moosmann et.al. | 2311.01881 | null |
2023-11-03 | Multi-LiDAR Localization and Mapping Pipeline for Urban Autonomous Driving | Florian Sauerbeck et.al. | 2311.01823 | null |
2023-11-30 | Towards Calibrated Robust Fine-Tuning of Vision-Language Models | Changdae Oh et.al. | 2311.01723 | link |
2023-11-03 | Flow-Based Feature Fusion for Vehicle-Infrastructure Cooperative 3D Object Detection | Haibao Yu et.al. | 2311.01682 | link |
2023-11-02 | DRNet: A Decision-Making Method for Autonomous Lane Changingwith Deep Reinforcement Learning | Kunpeng Xu et.al. | 2311.01602 | null |
2023-11-02 | Adversary ML Resilience in Autonomous Driving Through Human Centered Perception Mechanisms | Aakriti Shah et.al. | 2311.01478 | null |
2023-11-02 | Conformal Policy Learning for Sensorimotor Control Under Distribution Shifts | Huang Huang et.al. | 2311.01457 | null |
2023-11-02 | Efficient Vision Transformer for Accurate Traffic Sign Detection | Javad Mirzapour Kaleybar et.al. | 2311.01429 | null |
2023-11-04 | CenterRadarNet: Joint 3D Object Detection and Tracking Framework using 4D FMCW Radar | Jen-Hao Cheng et.al. | 2311.01423 | null |
2023-11-27 | LLM4Drive: A Survey of Large Language Models for Autonomous Driving | Zhenjie Yang et.al. | 2311.01043 | link |
2023-11-24 | Learning Unsupervised World Models for Autonomous Driving via Discrete Diffusion | Lunjun Zhang et.al. | 2311.01017 | null |
2023-11-02 | CML-MOTS: Collaborative Multi-task Learning for Multi-Object Tracking and Segmentation | Yiming Cui et.al. | 2311.00987 | null |
2023-11-01 | Learning Cooperative Trajectory Representations for Motion Forecasting | Hongzhi Ruan et.al. | 2311.00371 | link |
2023-10-31 | Addressing Limitations of State-Aware Imitation Learning for Autonomous Driving | Luca Cultrera et.al. | 2310.20650 | null |
2023-10-31 | FLODCAST: Flow and Depth Forecasting via Multimodal Recurrent Architectures | Andrea Ciamarra et.al. | 2310.20593 | null |
2023-10-31 | Predictive Control for Autonomous Driving with Uncertain, Multi-modal Predictions | Siddharth H. Nair et.al. | 2310.20561 | null |
2023-10-31 | Collaborative Decision-Making Using Spatiotemporal Graphs in Connected Autonomy | Peng Gao et.al. | 2310.20491 | null |
2023-11-01 | Enhancing the Spatial Awareness Capability of Multi-Modal Large Language Model | Yongqiang Zhao et.al. | 2310.20357 | null |
2023-10-31 | HEDNet: A Hierarchical Encoder-Decoder Network for 3D Object Detection in Point Clouds | Gang Zhang et.al. | 2310.20234 | link |
2023-10-30 | Large Trajectory Models are Scalable Motion Predictors and Planners | Qiao Sun et.al. | 2310.19620 | link |
2023-10-31 | Social Interaction-Aware Dynamical Models and Decision Making for Autonomous Vehicles | Luca Crosato et.al. | 2310.18891 | null |
2023-11-07 | ODM3D: Alleviating Foreground Sparsity for Semi-Supervised Monocular 3D Object Detection | Weijia Zhang et.al. | 2310.18620 | link |
2023-11-22 | Interactive Joint Planning for Autonomous Vehicles | Yuxiao Chen et.al. | 2310.18301 | null |
2023-10-27 | Guided Data Augmentation for Offline Reinforcement Learning and Imitation Learning | Nicholas E. Corrado et.al. | 2310.18247 | null |
2023-10-27 | Fine-Tuning Language Models Using Formal Methods Feedback | Yunhao Yang et.al. | 2310.18239 | null |
2023-10-27 | Siamese-DETR for Generic Multi-Object Tracking | Qiankun Liu et.al. | 2310.17875 | link |
2023-10-26 | Drive Anywhere: Generalizable End-to-end Autonomous Driving with Multi-modal Foundation Models | Tsun-Hsuan Wang et.al. | 2310.17642 | null |
2023-10-26 | EqDrive: Efficient Equivariant Motion Forecasting with Multi-Modality for Autonomous Driving | Yuping Wang et.al. | 2310.17540 | null |
2023-10-26 | Revisiting the Distillation of Image Representations into Point Clouds for Autonomous Driving | Gilles Puy et.al. | 2310.17504 | link |
2023-10-30 | A Hybrid Graph Network for Complex Activity Detection in Video | Salman Khan et.al. | 2310.17493 | null |
2023-10-26 | YOLO-BEV: Generating Bird’s-Eye View in the Same Way as 2D Object Detection | Chang Liu et.al. | 2310.17379 | null |
2023-10-27 | Navigating Data Heterogeneity in Federated Learning A Semi-Supervised Approach for Object Detection | Taehyeon Kim et.al. | 2310.17097 | link |
2023-10-25 | Using Knowledge Awareness to improve Safety of Autonomous Driving | Andrea Calvagna et.al. | 2310.16760 | null |
2023-10-26 | Driving through the Concept Gridlock: Unraveling Explainability Bottlenecks in Automated Driving | Jessica Echterhoff et.al. | 2310.16639 | link |
2023-10-25 | ParisLuco3D: A high-quality target dataset for domain generalization of LiDAR perception | Jules Sanchez et.al. | 2310.16542 | null |
2023-10-25 | MVFAN: Multi-View Feature Assisted Network for 4D Radar Object Detection | Qiao Yan et.al. | 2310.16389 | null |
2023-10-24 | Pixel-Level Clustering Network for Unsupervised Image Segmentation | Cuong Manh Hoang et.al. | 2310.16234 | null |
2023-10-24 | Data-driven Traffic Simulation: A Comprehensive Review | Di Chen et.al. | 2310.15975 | null |
2023-10-24 | Recent Advances in Multi-modal 3D Scene Understanding: A Comprehensive Survey and Evaluation | Yinjie Lei et.al. | 2310.15676 | null |
2023-10-24 | Leveraging Vision-Centric Multi-Modal Expertise for 3D Object Detection | Linyan Huang et.al. | 2310.15670 | link |
2023-10-23 | RoboDepth: Robust Out-of-Distribution Depth Estimation under Corruptions | Lingdong Kong et.al. | 2310.15171 | link |
2023-10-23 | P2AT: Pyramid Pooling Axial Transformer for Real-time Semantic Segmentation | Mohammed A. M. Elhassan et.al. | 2310.15025 | null |
2023-10-23 | End-to-End Learning of Behavioural Inputs for Autonomous Driving in Dense Traffic | Jatan Shrestha et.al. | 2310.14766 | link |
2023-10-23 | BM2CP: Efficient Collaborative Perception with LiDAR-Camera Modalities | Binyu Zhao et.al. | 2310.14702 | link |
2023-10-23 | DICE: Diverse Diffusion Model with Scoring for Trajectory Prediction | Younwoo Choi et.al. | 2310.14570 | null |
2023-10-22 | Vision Language Models in Autonomous Driving and Intelligent Transportation Systems | Xingcheng Zhou et.al. | 2310.14414 | link |
2023-10-22 | Detrive: Imitation Learning with Transformer Detection for End-to-End Autonomous Driving | Daoming Chen et.al. | 2310.14224 | link |
2023-10-21 | Equivariant Map and Agent Geometry for Autonomous Driving Motion Prediction | Yuping Wang et.al. | 2310.13922 | null |
2023-10-21 | Exploring Driving Behavior for Autonomous Vehicles Based on Gramian Angular Field Vision Transformer | Junwei You et.al. | 2310.13906 | null |
2023-10-20 | Transformers for Trajectory Optimization with Application to Spacecraft Rendezvous | Tommaso Guffanti et.al. | 2310.13831 | null |
2023-10-20 | OpenAnnotate3D: Open-Vocabulary Auto-Labeling System for Multi-modal 3D Data | Yijie Zhou et.al. | 2310.13398 | link |
2023-10-20 | Combining Policy Gradient and Safety-Based Control for Autonomous Driving | Xi Xiong et.al. | 2310.13314 | null |
2023-10-20 | Higher or Lower: Challenges in Object based SLAM | Zhihe Zhang et.al. | 2310.13256 | null |
2023-11-10 | LeTFuser: Light-weight End-to-end Transformer-Based Sensor Fusion for Autonomous Driving with Multi-Task Learning | Pedram Agand et.al. | 2310.13135 | link |
2023-10-19 | NeuroSMPC: A Neural Network guided Sampling Based MPC for On-Road Autonomous Driving | Kaustab Pal et.al. | 2310.13077 | null |
2023-10-19 | Real-Time Motion Prediction via Heterogeneous Polyline Transformer with Relative Pose Encoding | Zhejun Zhang et.al. | 2310.12970 | link |
2023-10-18 | Monte-Carlo Tree Search for Behavior Planning in Autonomous Driving | Qianfeng Wen et.al. | 2310.12075 | link |
2023-10-19 | One-Bit Byzantine-Tolerant Distributed Learning via Over-the-Air Computation | Yuhan Yang et.al. | 2310.11998 | null |
2023-10-18 | Malicious Agent Detection for Robust Multi-Agent Collaborative Perception | Yangheng Zhao et.al. | 2310.11901 | null |
2023-10-18 | Using Experience Classification for Training Non-Markovian Tasks | Ruixuan Miao et.al. | 2310.11678 | null |
2023-10-17 | Non-ergodicity in reinforcement learning: robustness via ergodicity transformations | Dominik Baumann et.al. | 2310.11335 | link |
2023-10-17 | LiDAR-based 4D Occupancy Completion and Forecasting | Xinhao Liu et.al. | 2310.11239 | link |
2023-10-19 | Path Following Control of Automated Vehicle Considering Uncertainties and Disturbances with Parametric Varying | Dan Shen et.al. | 2310.10925 | null |
2023-10-16 | GTA: A Geometry-Aware Attention Mechanism for Multi-View Transformers | Takeru Miyato et.al. | 2310.10375 | link |
2023-10-16 | BEVGPT: Generative Pre-trained Large Model for Autonomous Driving Prediction, Decision-Making, and Planning | Pengqin Wang et.al. | 2310.10357 | null |
2023-10-16 | Multimodal Object Query Initialization for 3D Object Detection | Mathijs R. van Geerenstein et.al. | 2310.10353 | null |
2023-10-16 | SoTTA: Robust Test-Time Adaptation on Noisy Data Streams | Taesik Gong et.al. | 2310.10074 | link |
2023-10-14 | Real-Time Traffic Sign Detection: A Case Study in a Santa Clara Suburban Neighborhood | Harish Loghashankar et.al. | 2310.09630 | null |
2023-10-20 | JM3D & JM3D-LLM: Elevating 3D Representation with Joint Multi-modal Cues | Jiayi Ji et.al. | 2310.09503 | link |
2023-10-13 | Revisiting Multi-modal 3D Semantic Segmentation in Real-world Autonomous Driving | Feng Jiang et.al. | 2310.08826 | null |
2023-10-12 | PU-Ray: Point Cloud Upsampling via Ray Marching on Implicit Surface | Sangwon Lim et.al. | 2310.08755 | link |
2023-10-12 | Waymax: An Accelerated, Data-Driven Simulator for Large-Scale Autonomous Driving Research | Cole Gulino et.al. | 2310.08710 | null |
2023-10-12 | Data-driven Invariance for Reference Governors | Ali Kashani et.al. | 2310.08679 | null |
2023-10-16 | Deep Reinforcement Learning for Autonomous Vehicle Intersection Navigation | Badr Ben Elallid et.al. | 2310.08595 | null |
2023-10-12 | Performance/power assessment of CNN packages on embedded automotive platforms | Paolo Burgio et.al. | 2310.08401 | null |
2023-10-12 | UniPAD: A Universal Pre-training Paradigm for Autonomous Driving | Honghui Yang et.al. | 2310.08370 | link |
2023-10-12 | Impact of multi-armed bandit strategies on deep recurrent reinforcement learning | Valentina Zangirolami et.al. | 2310.08331 | link |
2023-10-12 | NSM4D: Neural Scene Model Based Online 4D Point Cloud Sequence Understanding | Yuhao Dong et.al. | 2310.08326 | null |
2023-10-12 | If our aim is to build morality into an artificial agent, how might we begin to go about doing so? | Reneira Seeamber et.al. | 2310.08295 | null |
2023-10-12 | Hilbert Space Embedding-based Trajectory Optimization for Multi-Modal Uncertain Obstacle Trajectory Prediction | Basant Sharma et.al. | 2310.08270 | link |
2023-10-12 | GraphAlign: Enhancing Accurate Feature Alignment by Graph matching for Multi-Modal 3D Object Detection | Ziying Song et.al. | 2310.08261 | null |
2023-10-12 | DUSA: Decoupled Unsupervised Sim2Real Adaptation for Vehicle-to-Everything Collaborative Perception | Xianghao Kong et.al. | 2310.08117 | link |
2023-10-20 | Model Predictive Inferential Control of Neural State-Space Models for Autonomous Vehicle Motion Planning | Iman Askari et.al. | 2310.08045 | null |
2023-10-12 | EC-Depth: Exploring the consistency of self-supervised monocular depth estimation under challenging scenes | Ruijie Zhu et.al. | 2310.08044 | link |
2023-10-12 | Receive, Reason, and React: Drive as You Say with Large Language Models in Autonomous Vehicles | Can Cui et.al. | 2310.08034 | null |
2023-10-12 | HeightFormer: A Multilevel Interaction and Image-adaptive Classification-regression Network for Monocular Height Estimation with Aerial Images | Zhan Chen et.al. | 2310.07995 | null |
2023-10-11 | CRITERIA: a New Benchmarking Paradigm for Evaluating Trajectory Prediction Models for Autonomous Driving | Changhe Chen et.al. | 2310.07794 | link |
2023-10-11 | DrivingDiffusion: Layout-Guided multi-view driving scene video generation with latent diffusion model | Xiaofan Li et.al. | 2310.07771 | link |
2023-10-23 | Dual Radar: A Multi-modal Dataset with Dual 4D Radar for Autonomous Driving | Xinyu Zhang et.al. | 2310.07602 | link |
2023-10-11 | Metamorphic Runtime Monitoring of Autonomous Driving Systems | Jon Ayerdi et.al. | 2310.07414 | link |
2023-10-11 | LESS-Map: Lightweight and Evolving Semantic Map in Parking Lots for Long-term Self-Localization | Mingrui Liu et.al. | 2310.07390 | null |
2023-10-11 | Optimizing the Placement of Roadside LiDARs for Autonomous Driving | Wentao Jiang et.al. | 2310.07247 | null |
2023-10-11 | Integrated Sensing and Communication enabled Multiple Base Stations Cooperative Sensing Towards 6G | Zhiqing Wei et.al. | 2310.07180 | null |
2023-11-01 | TopoMLP: A Simple yet Strong Pipeline for Driving Topology Reasoning | Dongming Wu et.al. | 2310.06753 | link |
2023-10-10 | Safe-by-Construction Autonomous Vehicle Overtaking using Control Barrier Functions and Model Predictive Control | Dingran Yuan et.al. | 2310.06553 | null |
2023-10-09 | Layout Sequence Prediction From Noisy Mobile Modality | Haichao Zhang et.al. | 2310.06138 | null |
2023-10-07 | DynamicBEV: Leveraging Dynamic Queries and Temporal Context for 3D Object Detection | Jiawei Yao et.al. | 2310.05989 | link |
2023-10-09 | DTPP: Differentiable Joint Conditional Prediction and Cost Evaluation for Tree Policy Planning in Autonomous Driving | Zhiyu Huang et.al. | 2310.05885 | link |
2023-10-09 | Joint object detection and re-identification for 3D obstacle multi-camera systems | Irene Cortés et.al. | 2310.05785 | null |
2023-10-09 | GPS Attack Detection and Mitigation for Safe Autonomous Driving using Image and Map based Lateral Direction Localization | Qingming Chen et.al. | 2310.05407 | null |
2023-10-08 | Influence of Camera-LiDAR Configuration on 3D Object Detection for Autonomous Driving | Ye Li et.al. | 2310.05245 | link |
2023-10-08 | Indoor Localization for an Autonomous Model Car: A Marker-Based Multi-Sensor Fusion Framework | Xibo Li et.al. | 2310.05198 | null |
2023-10-08 | DeepQTest: Testing Autonomous Driving Systems with Reinforcement Learning and Real-world Weather Data | Chengjie Lu et.al. | 2310.05170 | link |
2023-10-08 | A Privacy-Preserving Trajectory Synthesis Method Based on Vector Translation Invariance Supporting Traffic Constraints | Zechen Liu et.al. | 2310.05091 | null |
2023-10-08 | An Anomaly Behavior Analysis Framework for Securing Autonomous Vehicle Perception | Murad Mehrab Abrar et.al. | 2310.05041 | link |
2023-10-07 | Combining UPerNet and ConvNeXt for Contrails Identification to reduce Global Warming | Zhenkuan Wang et.al. | 2310.04808 | link |
2023-10-07 | Towards Dynamic and Small Objects Refinement for Unsupervised Domain Adaptative Nighttime Semantic Segmentation | Jingyi Pan et.al. | 2310.04747 | null |
2023-10-06 | DiffPrompter: Differentiable Implicit Visual Prompts for Semantic-Segmentation in Adverse Conditions | Sanket Kalwar et.al. | 2310.04181 | null |
2023-10-05 | ContactGen: Generative Contact Modeling for Grasp Generation | Shaowei Liu et.al. | 2310.03740 | link |
2023-10-05 | High-Degrees-of-Freedom Dynamic Neural Fields for Robot Self-Modeling and Motion Planning | Lennart Schulze et.al. | 2310.03624 | null |
2023-10-05 | V2X Cooperative Perception for Autonomous Driving: Recent Advances and Challenges | Tao Huang et.al. | 2310.03525 | null |
2023-10-05 | RadaRays: Real-time Simulation of Rotating FMCW Radar for Mobile Robotics via Hardware-accelerated Ray Tracing | Alexander Mock et.al. | 2310.03505 | link |
2023-10-05 | A Two-stage Based Social Preference Recognition in Multi-Agent Autonomous Driving System | Jintao Xue et.al. | 2310.03303 | null |
2023-10-13 | LanguageMPC: Large Language Models as Decision Makers for Autonomous Driving | Hao Sha et.al. | 2310.03026 | null |
2023-10-04 | Curve Trajectory Model for Human Preferred Path Planning of Automated Vehicles | Gergo Igneczi et.al. | 2310.02696 | null |
2023-10-04 | Adaptive Spatio-Temporal Voxels Based Trajectory Planning for Autonomous Driving in Highway Traffic Flow | Zhiqiang Jian et.al. | 2310.02625 | link |
2023-10-03 | Human-Like Autonomous Driving on Dense Traffic | Mustafa Yildirim et.al. | 2310.02477 | null |
2023-10-03 | RSRD: A Road Surface Reconstruction Dataset and Benchmark for Safe and Comfortable Autonomous Driving | Tong Zhao et.al. | 2310.02262 | null |
2023-10-03 | TransRadar: Adaptive-Directional Transformer for Real-Time Multi-View Radar Semantic Segmentation | Yahia Dalbah et.al. | 2310.02260 | link |
2023-10-03 | Talk2BEV: Language-enhanced Bird’s-eye View Maps for Autonomous Driving | Vikrant Dewangan et.al. | 2310.02251 | null |
2023-10-13 | Driving with LLMs: Fusing Object-Level Vector Modality for Explainable Autonomous Driving | Long Chen et.al. | 2310.01957 | link |
2023-10-03 | DARTH: Holistic Test-time Adaptation for Multiple Object Tracking | Mattia Segu et.al. | 2310.01926 | link |
2023-10-03 | Trainable Noise Model as an XAI evaluation method: application on Sobol for remote sensing image segmentation | Hossein Shreim et.al. | 2310.01828 | link |
2023-10-03 | Predicting Future Spatiotemporal Occupancy Grids with Semantics for Autonomous Driving | Maneekwan Toyungyernsub et.al. | 2310.01723 | null |
2023-10-10 | You Only Look at Once for Real-time and Generic Multi-Task | Jiayuan Wang et.al. | 2310.01641 | link |
2023-10-02 | Elastic Interaction Energy Loss for Traffic Image Segmentation | Yaxin Feng et.al. | 2310.01449 | null |
2023-10-16 | GPT-Driver: Learning to Drive with GPT | Jiageng Mao et.al. | 2310.01415 | link |
2023-10-08 | DriveGPT4: Interpretable End-to-end Autonomous Driving via Large Language Model | Zhenhua Xu et.al. | 2310.01412 | null |
2023-10-02 | Streaming Motion Forecasting for Autonomous Driving | Ziqi Pang et.al. | 2310.01351 | link |
2023-10-02 | Offline Tracking with Object Permanence | Xianzhong Liu et.al. | 2310.01288 | link |
2023-10-02 | LS-VOS: Identifying Outliers in 3D Object Detections Using Latent Space Virtual Outlier Synthesis | Aldi Piroli et.al. | 2310.00952 | null |
2023-10-05 | Towards Robust 3D Object Detection In Rainy Conditions | Aldi Piroli et.al. | 2310.00944 | null |
2023-10-02 | Every Dataset Counts: Scaling up Monocular 3D Object Detection with Joint Datasets Training | Fulong Ma et.al. | 2310.00920 | null |
2023-10-02 | PC-NeRF: Parent-Child Neural Radiance Fields under Partial Sensor Data Loss in Autonomous Driving Environments | Xiuzhong Hu et.al. | 2310.00874 | link |
2023-09-30 | MonoGAE: Roadside Monocular 3D Object Detection with Ground-Aware Embeddings | Lei Yang et.al. | 2310.00400 | null |
2023-09-30 | MMPI: a Flexible Radiance Field Representation by Multiple Multi-plane Images Blending | Yuze He et.al. | 2310.00249 | null |
2023-09-29 | A Survey on Deep Learning Techniques for Action Anticipation | Zeyun Zhong et.al. | 2309.17257 | null |
NeRF + Autonomous Driving
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-15 | Fine-tuning Diffusion Policies with Backpropagation Through Diffusion Timesteps | Ningyuan Yang et.al. | 2505.10482 | null |
2025-05-15 | Inferring Driving Maps by Deep Learning-based Trail Map Extraction | Michael Hubbertz et.al. | 2505.10258 | null |
2025-05-15 | Large Wireless Localization Model (LWLM): A Foundation Model for Positioning in 6G Networks | Guangjin Pan et.al. | 2505.10134 | link |
2025-05-15 | Advances in Radiance Field for Dynamic Scene: From Neural Field to Gaussian Field | Jinlong Fan et.al. | 2505.10049 | link |
2025-05-15 | Application of YOLOv8 in monocular downward multiple Car Target detection | Shijie Lyu et.al. | 2505.10016 | null |
2025-05-15 | Large-Scale Gaussian Splatting SLAM | Zhe Xin et.al. | 2505.09915 | null |
2025-05-14 | Camera-Only 3D Panoptic Scene Completion for Autonomous Driving through Differentiable Object Shapes | Nicola Marinello et.al. | 2505.09562 | null |
2025-05-15 | SafePath: Conformal Prediction for Safe LLM-Based Autonomous Navigation | Achref Doula et.al. | 2505.09427 | null |
2025-05-14 | MoRAL: Motion-aware Multi-Frame 4D Radar and LiDAR Fusion for Robust 3D Object Detection | Xiangyuan Peng et.al. | 2505.09422 | null |
2025-05-14 | FreeDriveRF: Monocular RGB Dynamic NeRF without Poses for Autonomous Driving via Point-Level Dynamic-Static Decoupling | Yue Wen et.al. | 2505.09406 | null |
2025-05-14 | APR-Transformer: Initial Pose Estimation for Localization in Complex Environments through Absolute Pose Regression | Srinivas Ravuri et.al. | 2505.09356 | link |
2025-05-14 | TransDiffuser: End-to-end Trajectory Generation with Decorrelated Multi-modal Representation for Autonomous Driving | Xuefeng Jiang et.al. | 2505.09315 | null |
2025-05-14 | OpenLKA: An Open Dataset of Lane Keeping Assist from Recent Car Models under Real-world Driving Conditions | Yuhang Wang et.al. | 2505.09092 | link |
2025-05-14 | Deployable and Generalizable Motion Prediction: Taxonomy, Open Challenges and Future Directions | Letian Wang et.al. | 2505.09074 | null |
2025-05-13 | Towards Adaptive Meta-Gradient Adversarial Examples for Visual Tracking | Wei-Long Tian et.al. | 2505.08999 | link |
2025-05-13 | Generative AI for Autonomous Driving: Frontiers and Opportunities | Yuping Wang et.al. | 2505.08854 | link |
2025-05-13 | Extending Large Vision-Language Model for Diverse Interactive Tasks in Autonomous Driving | Zongchuang Zhao et.al. | 2505.08725 | link |
2025-05-13 | Optimal Trajectory Planning with Collision Avoidance for Autonomous Vehicle Maneuvering | Jason Zalev et.al. | 2505.08724 | null |
2025-05-13 | FOCI: Trajectory Optimization on Gaussian Splats | Mario Gomez Andreu et.al. | 2505.08510 | null |
2025-05-13 | A Survey of 3D Reconstruction with Event Cameras: From Event-based Geometry to Neural 3D Rendering | Chuanzhi Xu et.al. | 2505.08438 | null |
2025-05-13 | Explaining Autonomous Vehicles with Intention-aware Policy Graphs | Sara Montese et.al. | 2505.08404 | null |
2025-05-13 | A Practical Introduction to Deep Reinforcement Learning | Yinghan Sun et.al. | 2505.08295 | null |
2025-05-13 | Automatic Curriculum Learning for Driving Scenarios: Towards Robust and Efficient Reinforcement Learning | Ahmed Abouelazm et.al. | 2505.08264 | null |
2025-05-13 | Object detection in adverse weather conditions for autonomous vehicles using Instruct Pix2Pix | Unai Gurbindo et.al. | 2505.08228 | null |
2025-05-12 | Topology-Guided Knowledge Distillation for Efficient Point Cloud Processing | Luu Tung Hai et.al. | 2505.08101 | link |
2025-05-12 | Geometric Prior-Guided Neural Implicit Surface Reconstruction in the Wild | Lintao Xiang et.al. | 2505.07373 | null |
2025-05-12 | Ranking-aware Continual Learning for LiDAR Place Recognition | Xufei Wang et.al. | 2505.07198 | null |
2025-05-11 | DriveSOTIF: Advancing Perception SOTIF Through Multimodal Large Language Models | Shucheng Huang et.al. | 2505.07084 | link |
2025-05-11 | Boosting Cross-spectral Unsupervised Domain Adaptation for Thermal Semantic Segmentation | Seokjun Kwon et.al. | 2505.06951 | null |
2025-05-11 | NeuGen: Amplifying the ‘Neural’ in Neural Radiance Fields for Domain Generalization | Ahmed Qazi et.al. | 2505.06894 | null |
2025-05-11 | Towards Human-Centric Autonomous Driving: A Fast-Slow Architecture Integrating Large Language Model Guidance with Reinforcement Learning | Chengkai Xu et.al. | 2505.06875 | null |
2025-05-11 | Beyond Patterns: Harnessing Causal Logic for Autonomous Driving Trajectory Prediction | Bonan Wang et.al. | 2505.06856 | null |
2025-05-13 | Work-in-Progress: Multi-Deadline DAG Scheduling Model for Autonomous Driving Systems | Atsushi Yano et.al. | 2505.06780 | null |
2025-05-10 | AI-CDA4All: Democratizing Cooperative Autonomous Driving for All Drivers via Affordable Dash-cam Hardware and Open-source AI Software | Shengming Yuan et.al. | 2505.06749 | null |
2025-05-10 | M3CAD: Towards Generic Cooperative Autonomous Driving Benchmark | Morui Zhu et.al. | 2505.06746 | null |
2025-05-10 | TPK: Trustworthy Trajectory Prediction Integrating Prior Knowledge For Interpretability and Kinematic Feasibility | Marius Baden et.al. | 2505.06743 | null |
2025-05-10 | Boundary-Guided Trajectory Prediction for Road Aware and Physically Feasible Autonomous Driving | Ahmed Abouelazm et.al. | 2505.06740 | null |
2025-05-10 | Balancing Progress and Safety: A Novel Risk-Aware Objective for RL in Autonomous Driving | Ahmed Abouelazm et.al. | 2505.06737 | null |
2025-05-10 | 3D Characterization of Smoke Plume Dispersion Using Multi-View Drone Swarm | Nikil Krishnakumar et.al. | 2505.06638 | null |
2025-05-10 | A Contrastive Federated Semi-Supervised Learning Intrusion Detection Framework for Internet of Robotic Things | Yifan Zeng et.al. | 2505.06636 | null |
2025-05-10 | RESAR-BEV: An Explainable Progressive Residual Autoregressive Approach for Camera-Radar Fusion in BEV Segmentation | Zhiwen Zeng et.al. | 2505.06515 | null |
2025-05-10 | FlexNeRFer: A Multi-Dataflow, Adaptive Sparsity-Aware Accelerator for On-Device NeRF Rendering | Seock-Hwan Noh et.al. | 2505.06504 | null |
2025-05-10 | Video-Enhanced Offline Reinforcement Learning: A Model-Based Approach | Minting Pan et.al. | 2505.06482 | null |
2025-05-09 | What Do People Want to Know About Artificial Intelligence (AI)? The Importance of Answering End-User Questions to Explain Autonomous Vehicle (AV) Decisions | Somayeh Molaei et.al. | 2505.06428 | link |
2025-05-09 | Natural Reflection Backdoor Attack on Vision Language Model for Autonomous Driving | Ming Liu et.al. | 2505.06413 | null |
2025-05-09 | Priority-Driven Safe Model Predictive Control Approach to Autonomous Driving Applications | Francesco Prignoli et.al. | 2505.05933 | null |
2025-05-08 | Closing the Loop: Motion Prediction Models beyond Open-Loop Benchmarks | Mohamed-Khalil Bouzidi et.al. | 2505.05638 | null |
2025-05-02 | MDDFNet: Mamba-based Dynamic Dual Fusion Network for Traffic Sign Detection | TianYi Yu et.al. | 2505.05491 | null |
2025-05-08 | 3D Scene Generation: A Survey | Beichen Wen et.al. | 2505.05474 | link |
2025-05-08 | DSDrive: Distilling Large Language Model for Lightweight End-to-End Autonomous Driving with Unified Reasoning and Planning | Wenru Liu et.al. | 2505.05360 | null |
2025-05-08 | PADriver: Towards Personalized Autonomous Driving | Genghua Kou et.al. | 2505.05240 | null |
2025-05-08 | Multi-Objective Reinforcement Learning for Adaptive Personalized Autonomous Driving | Hendrik Surmann et.al. | 2505.05223 | null |
2025-05-08 | X-Driver: Explainable Autonomous Driving with Vision-Language Models | Wei Liu et.al. | 2505.05098 | null |
2025-05-08 | LVLM-MPC Collaboration for Autonomous Driving: A Safety-Aware and Task-Scalable Control Architecture | Kazuki Atsuta et.al. | 2505.04980 | null |
2025-05-07 | Crafting Physical Adversarial Examples by Combining Differentiable and Physically Based Renders | Yuqiu Liu et.al. | 2505.04662 | null |
2025-05-07 | GSsplat: Generalizable Semantic Gaussian Splatting for Novel-view Synthesis in 3D Scenes | Feng Xiao et.al. | 2505.04659 | link |
2025-05-07 | DFVO: Learning Darkness-free Visible and Infrared Image Disentanglement and Fusion All at Once | Qi Zhou et.al. | 2505.04526 | link |
2025-05-07 | Do We Still Need to Work on Odometry for Autonomous Driving? | Cedric Le Gentil et.al. | 2505.04438 | null |
2025-05-07 | Predicting Road Surface Anomalies by Visual Tracking of a Preceding Vehicle | Petr Jahoda et.al. | 2505.04392 | null |
2025-05-07 | Verification of Digital Twins using Classical and Statistical Model Checking | Raghavendran Gunasekaran et.al. | 2505.04322 | null |
2025-05-07 | Multi-Agent Reinforcement Learning-based Cooperative Autonomous Driving in Smart Intersections | Taoyuan Yu et.al. | 2505.04231 | null |
2025-05-07 | Reliable Disentanglement Multi-view Learning Against View Adversarial Attacks | Xuyang Wang et.al. | 2505.04046 | link |
2025-05-06 | Frenet Corridor Planner: An Optimal Local Path Planning Framework for Autonomous Driving | Faizan M. Tariq et.al. | 2505.03695 | null |
2025-05-06 | Moral Testing of Autonomous Driving Systems | Wenbing Tang et.al. | 2505.03683 | null |
2025-05-06 | CaRaFFusion: Improving 2D Semantic Segmentation with Camera-Radar Point Cloud Fusion and Zero-Shot Image Inpainting | Huawei Sun et.al. | 2505.03679 | null |
2025-05-06 | Coop-WD: Cooperative Perception with Weighting and Denoising for Robust V2V Communication | Chenguang Liu et.al. | 2505.03528 | null |
2025-05-06 | RIFT: Closed-Loop RL Fine-Tuning for Realistic and Controllable Traffic Simulation | Keyu Chen et.al. | 2505.03344 | null |
2025-05-06 | Artificial Behavior Intelligence: Technology, Challenges, and Future Directions | Kanghyun Jo et.al. | 2505.03315 | null |
2025-05-06 | 3D Can Be Explored In 2D: Pseudo-Label Generation for LiDAR Point Clouds Using Sensor-Intensity-Based 2D Semantic Segmentation | Andrew Caunes et.al. | 2505.03300 | null |
2025-05-06 | RobotxR1: Enabling Embodied Robotic Intelligence on Large Language Models through Closed-Loop Reinforcement Learning | Liam Boyle et.al. | 2505.03238 | null |
2025-05-06 | VISLIX: An XAI Framework for Validating Vision Models with Slice Discovery and Analysis | Xinyuan Yan et.al. | 2505.03132 | null |
2025-05-04 | Risk Assessment and Threat Modeling for safe autonomous driving technology | Ian Alexis Wong Paz et.al. | 2505.02231 | null |
2025-05-04 | Coupled Distributional Random Expert Distillation for World Model Online Imitation Learning | Shangzhe Li et.al. | 2505.02228 | null |
2025-05-04 | Spotting the Unexpected (STU): A 3D LiDAR Dataset for Anomaly Segmentation in Autonomous Driving | Alexey Nekrasov et.al. | 2505.02148 | null |
2025-05-04 | DriveAgent: Multi-Agent Structured Reasoning with LLM and Multimodal Sensor Fusion for Autonomous Driving | Xinmeng Hou et.al. | 2505.02123 | link |
2025-05-04 | Learning Heterogeneous Mixture of Scene Experts for Large-scale Neural Radiance Fields | Zhenxing Mi et.al. | 2505.02005 | link |
2025-05-03 | DriveNetBench: An Affordable and Configurable Single-Camera Benchmarking System for Autonomous Driving Networks | Ali Al-Bustami et.al. | 2505.01893 | link |
2025-05-03 | CMAWRNet: Multiple Adverse Weather Removal via a Unified Quaternion Neural Architecture | Vladimir Frants et.al. | 2505.01882 | null |
2025-05-03 | PhysNav-DG: A Novel Adaptive Framework for Robust VLM-Sensor Fusion in Navigation Applications | Trisanth Srinivasan et.al. | 2505.01881 | null |
2025-05-03 | Visual enhancement and 3D representation for underwater scenes: a review | Guoxi Huang et.al. | 2505.01869 | null |
2025-05-03 | DualDiff: Dual-branch Diffusion Model for Autonomous Driving with Semantic Fusion | Haoteng Li et.al. | 2505.01857 | null |
2025-05-03 | Edge-Cloud Collaborative Computing on Distributed Intelligence and Model Optimization: A Survey | Jing Liu et.al. | 2505.01821 | null |
2025-05-03 | AquaGS: Fast Underwater Scene Reconstruction with SfM-Free Gaussian Splatting | Junhao Shi et.al. | 2505.01799 | null |
2025-05-03 | PosePilot: Steering Camera Pose for Generative World Models with Self-supervised Depth | Bu Jin et.al. | 2505.01729 | null |
2025-04-28 | Interactive Double Deep Q-network: Integrating Human Interventions and Evaluative Predictions in Reinforcement Learning of Autonomous Driving | Alkis Sygkounas et.al. | 2505.01440 | null |
2025-05-02 | Multi-Objective Reinforcement Learning for Water Management | Zuzanna Osika et.al. | 2505.01094 | null |
2025-05-02 | LMDepth: Lightweight Mamba-based Monocular Depth Estimation for Real-World Deployment | Jiahuan Long et.al. | 2505.00980 | null |
2025-05-02 | Seeking to Collide: Online Safety-Critical Scenario Generation for Autonomous Driving with Retrieval Augmented Large Language Models | Yuewen Mei et.al. | 2505.00972 | null |
2025-05-01 | Efficient On-Chip Implementation of 4D Radar-Based 3D Object Detection on Hailo-8L | Woong-Chan Byun et.al. | 2505.00757 | null |
2025-04-30 | A Survey on 3D Reconstruction Techniques in Plant Phenotyping: From Classical Methods to Neural Radiance Fields (NeRF), 3D Gaussian Splatting (3DGS), and Beyond | Jiajia Li et.al. | 2505.00737 | null |
2025-05-01 | Safety-Critical Traffic Simulation with Guided Latent Diffusion Model | Mingxing Peng et.al. | 2505.00515 | null |
2025-05-01 | Inconsistency-based Active Learning for LiDAR Object Detection | Esteban Rivera et.al. | 2505.00511 | null |
2025-05-05 | HeAL3D: Heuristical-enhanced Active Learning for 3D Object Detection | Esteban Rivera et.al. | 2505.00507 | null |
2025-05-01 | iMacSR: Intermediate Multi-Access Supervision and Regularization in Training Autonomous Driving Models | Wei-Bin Kou et.al. | 2505.00404 | null |
2025-05-01 | Cues3D: Unleashing the Power of Sole NeRF for Consistent and Unique Instances in Open-Vocabulary 3D Panoptic Segmentation | Feng Xue et.al. | 2505.00378 | null |
2025-05-01 | FedEMA: Federated Exponential Moving Averaging with Negative Entropy Regularizer in Autonomous Driving | Wei-Bin Kou et.al. | 2505.00318 | null |
2025-05-01 | LightEMMA: Lightweight End-to-End Multimodal Model for Autonomous Driving | Zhijie Qiao et.al. | 2505.00284 | link |
2025-04-30 | V3LMA: Visual 3D-enhanced Language Model for Autonomous Driving | Jannik Lübberstedt et.al. | 2505.00156 | null |
2025-04-30 | TinyMA-IEI-PPO: Exploration Incentive-Driven Multi-Agent DRL with Self-Adaptive Pruning for Vehicular Embodied AI Agent Twins Migration | Zhuoqi Zeng et.al. | 2505.00055 | null |
2025-04-30 | A Survey of Interactive Generative Video | Jiwen Yu et.al. | 2504.21853 | null |
2025-05-08 | REHEARSE-3D: A Multi-modal Emulated Rain Dataset for 3D Point Cloud De-raining | Abu Mohammed Raisuddin et.al. | 2504.21699 | null |
2025-04-29 | Composite Safety Potential Field for Highway Driving Risk Assessment | Dachuan Zuo et.al. | 2504.21158 | null |
2025-04-29 | Automated Parking Trajectory Generation Using Deep Reinforcement Learning | Zheyu Zhang et.al. | 2504.21071 | null |
2025-04-29 | GauSS-MI: Gaussian Splatting Shannon Mutual Information for Active 3D Reconstruction | Yuhan Xie et.al. | 2504.21067 | link |
2025-04-29 | Neural Stereo Video Compression with Hybrid Disparity Compensation | Shiyin Jiang et.al. | 2504.20383 | null |
2025-04-28 | AI Recommendation Systems for Lane-Changing Using Adherence-Aware Reinforcement Learning | Weihao Sun et.al. | 2504.20187 | null |
2025-04-28 | Learning Streaming Video Representation via Multitask Training | Yibin Yan et.al. | 2504.20041 | null |
2025-04-28 | Socially-Aware Autonomous Driving: Inferring Yielding Intentions for Safer Interactions | Jing Wang et.al. | 2504.20004 | null |
2025-04-28 | Joint Optimization of Neural Radiance Fields and Continuous Camera Motion from a Monocular Video | Hoang Chuong Nguyen et.al. | 2504.19819 | null |
2025-04-28 | The ATLAS of Traffic Lights: A Reliable Perception Framework for Autonomous Driving | Rupert Polley et.al. | 2504.19722 | null |
2025-04-28 | Open-set Anomaly Segmentation in Complex Scenarios | Song Xia et.al. | 2504.19706 | null |
2025-05-04 | ARTEMIS: Autoregressive End-to-End Trajectory Planning with Mixture of Experts for Autonomous Driving | Renju Feng et.al. | 2504.19580 | link |
2025-04-28 | CE-NPBG: Connectivity Enhanced Neural Point-Based Graphics for Novel View Synthesis in Autonomous Driving Scenes | Mohammad Altillawi et.al. | 2504.19557 | null |
2025-04-27 | CARL: Camera-Agnostic Representation Learning for Spectral Image Analysis | Alexander Baumann et.al. | 2504.19223 | null |
2025-05-07 | LRFusionPR: A Polar BEV-Based LiDAR-Radar Fusion Network for Place Recognition | Zhangshuo Qi et.al. | 2504.19186 | link |
2025-04-27 | Segmenting Objectiveness and Task-awareness Unknown Region for Autonomous Driving | Mi Zheng et.al. | 2504.19183 | null |
2025-04-27 | Towards Latency-Aware 3D Streaming Perception for Autonomous Driving | Jiaqi Peng et.al. | 2504.19115 | null |
2025-04-26 | Safety Interventions against Adversarial Patches in an Open-Source Driver Assistance System | Cheng Chen et.al. | 2504.18990 | null |
2025-04-26 | Federated Learning-based Semantic Segmentation for Lane and Object Detection in Autonomous Driving | Gharbi Khamis Alshammari et.al. | 2504.18939 | null |
2025-04-26 | Advanced Longitudinal Control and Collision Avoidance for High-Risk Edge Cases in Autonomous Driving | Dianwei Chen et.al. | 2504.18931 | null |
2025-04-26 | Imitation Learning for Autonomous Driving: Insights from Real-World Testing | Hidayet Ersin Dursun et.al. | 2504.18847 | link |
2025-05-01 | Zero-Day Botnet Attack Detection in IoV: A Modular Approach Using Isolation Forests and Particle Swarm Optimization | Abdelaziz Amara Korba et.al. | 2504.18814 | null |
2025-04-26 | Depth as Points: Center Point-based Depth Estimation | Zhiheng Tu et.al. | 2504.18773 | null |
2025-04-22 | DriVerse: Navigation World Model for Driving Simulation via Multimodal Trajectory Prompting and Motion Alignment | Xiaofan Li et.al. | 2504.18576 | null |
2025-04-25 | NoiseController: Towards Consistent Multi-view Video Generation via Noise Decomposition and Collaboration | Haotian Dong et.al. | 2504.18448 | null |
2025-04-25 | What is the Added Value of UDA in the VFM Era? | Brunó B. Englert et.al. | 2504.18190 | null |
2025-04-25 | Study on Real-Time Road Surface Reconstruction Using Stereo Vision | Deepak Ghimire et.al. | 2504.18112 | null |
2025-04-24 | CaRL: Learning Scalable Planning Policies with Simple Rewards | Bernhard Jaeger et.al. | 2504.17838 | null |
2025-04-10 | My Precious Crash Data: Barriers and Opportunities in Encouraging Autonomous Driving Companies to Share Safety-Critical Data | Hauke Sandhaus et.al. | 2504.17792 | null |
2025-04-24 | CasualHDRSplat: Robust High Dynamic Range 3D Gaussian Splatting from Casually Captured Videos | Shucheng Gong et.al. | 2504.17728 | link |
2025-04-24 | Learning Isometric Embeddings of Road Networks using Multidimensional Scaling | Juan Carlos Climent Pardo et.al. | 2504.17534 | null |
2025-04-24 | Longitudinal Control for Autonomous Racing with Combustion Engine Vehicles | Phillip Pitschi et.al. | 2504.17418 | null |
2025-04-24 | S2S-Net: Addressing the Domain Gap of Heterogeneous Sensor Systems in LiDAR-Based Collective Perception | Sven Teufel et.al. | 2504.17399 | null |
2025-04-25 | Highly Accurate and Diverse Traffic Data: The DeepScenario Open 3D Dataset | Oussema Dhaouadi et.al. | 2504.17371 | null |
2025-04-23 | Meta-Learning Online Dynamics Model Adaptation in Off-Road Autonomous Driving | Jacob Levy et.al. | 2504.16923 | null |
2025-04-23 | Gaussian Splatting is an Effective Data Generator for 3D Object Detection | Farhad G. Zanjani et.al. | 2504.16740 | null |
2025-04-23 | Dual-Camera All-in-Focus Neural Radiance Fields | Xianrui Luo et.al. | 2504.16636 | null |
2025-04-25 | Using Causal Inference to Test Systems with Hidden and Interacting Variables: An Evaluative Case Study | Michael Foster et.al. | 2504.16526 | null |
2025-04-23 | Circinus: Efficient Query Planner for Compound ML Serving | Banruo Liu et.al. | 2504.16397 | null |
2025-04-23 | SaENeRF: Suppressing Artifacts in Event-based Neural Radiance Fields | Yuanjian Wang et.al. | 2504.16389 | link |
2025-04-23 | SILM: A Subjective Intent Based Low-Latency Framework for Multiple Traffic Participants Joint Trajectory Prediction | Qu Weiming et.al. | 2504.16377 | null |
2025-04-23 | DPGP: A Hybrid 2D-3D Dual Path Potential Ghost Probe Zone Prediction Framework for Safe Autonomous Driving | Weiming Qu et.al. | 2504.16374 | null |
2025-04-23 | Revisiting Radar Camera Alignment by Contrastive Learning for 3D Object Detection | Linhua Kong et.al. | 2504.16368 | null |
2025-04-15 | Shape Your Ground: Refining Road Surfaces Beyond Planar Representations | Oussema Dhaouadi et.al. | 2504.16103 | null |
2025-04-10 | NeRF-APT: A New NeRF Framework for Wireless Channel Prediction | Jingzhou Shen et.al. | 2504.16094 | null |
2025-04-22 | MS-Occ: Multi-Stage LiDAR-Camera Fusion for 3D Semantic Occupancy Prediction | Zhiqiang Wei et.al. | 2504.15888 | null |
2025-04-22 | Pose Optimization for Autonomous Driving Datasets using Neural Rendering Models | Quentin Herau et.al. | 2504.15776 | null |
2025-04-22 | Dynamic Intent Queries for Motion Transformer-based Trajectory Prediction | Tobias Demmler et.al. | 2504.15766 | null |
2025-04-22 | SAGA: Semantic-Aware Gray color Augmentation for Visible-to-Thermal Domain Adaptation across Multi-View Drone and Ground-Based Vision Systems | Manjunath D et.al. | 2504.15728 | null |
2025-04-22 | RiskNet: Interaction-Aware Risk Forecasting for Autonomous Driving in Long-Tail Scenarios | Qichao Liu et.al. | 2504.15541 | null |
2025-04-29 | Improving Human-AI Coordination through Adversarial Training and Generative Models | Paresh Chaudhary et.al. | 2504.15457 | null |
2025-04-20 | Adaptive Field Effect Planner for Safe Interactive Autonomous Driving on Curved Roads | Qinghao Li et.al. | 2504.14747 | null |
2025-04-20 | SMTT: Novel Structured Multi-task Tracking with Graph-Regularized Sparse Representation for Robust Thermal Infrared Target Tracking | Shang Zhang et.al. | 2504.14566 | null |
2025-04-24 | Should Benevolent Deception be Allowed in EHMI? A Mechanism Explanation Based on Game Theory | Linkun Liu et.al. | 2504.14539 | null |
2025-04-20 | Are Vision LLMs Road-Ready? A Comprehensive Benchmark for Safety-Critical Driving Video Understanding | Tong Zeng et.al. | 2504.14526 | link |
2025-04-19 | A Knowledge-Informed Deep Learning Paradigm for Generalizable and Stability-Optimized Car-Following Models | Chengming Wang et.al. | 2504.14241 | null |
2025-04-19 | ROI-Guided Point Cloud Geometry Compression Towards Human and Machine Vision | Xie Liang et.al. | 2504.14240 | null |
2025-04-19 | Lightweight Road Environment Segmentation using Vector Quantization | Jiyong Kwag et.al. | 2504.14113 | null |
2025-04-18 | Statistical Analysis and End-to-End Performance Evaluation of Traffic Models for Automotive Data | Marcello Bullo et.al. | 2504.14017 | null |
2025-04-18 | Scaling LLaNA: Advancing NeRF-Language Understanding Through Large-Scale Training | Andrea Amaduzzi et.al. | 2504.13995 | null |
2025-04-21 | SLAM&Render: A Benchmark for the Intersection Between Neural Rendering, Gaussian Splatting and SLAM | Samuel Cerezo et.al. | 2504.13713 | link |
2025-04-18 | LMPOcc: 3D Semantic Occupancy Prediction Utilizing Long-Term Memory Prior from Historical Traversals | Shanshuai Yuan et.al. | 2504.13596 | null |
2025-04-18 | Testing the Fault-Tolerance of Multi-Sensor Fusion Perception in Autonomous Driving Systems | Haoxiang Tian et.al. | 2504.13420 | null |
2025-04-21 | LangCoop: Collaborative Driving with Language | Xiangbo Gao et.al. | 2504.13406 | link |
2025-04-18 | Towards a Multi-Agent Vision-Language System for Zero-Shot Novel Hazardous Object Detection for Autonomous Driving Safety | Shashank Shriram et.al. | 2504.13399 | link |
2025-04-16 | BEV-GS: Feed-forward Gaussian Splatting in Bird’s-Eye-View for Road Reconstruction | Wenhua Wu et.al. | 2504.13207 | null |
2025-04-17 | UncAD: Towards Safe End-to-end Autonomous Driving via Online Map Uncertainty | Pengxuan Yang et.al. | 2504.12826 | link |
2025-04-17 | Explainable Scene Understanding with Qualitative Representations and Graph Neural Networks | Nassim Belmecheri et.al. | 2504.12817 | null |
2025-04-17 | Approaching Current Challenges in Developing a Software Stack for Fully Autonomous Driving | Simon Sagmeister et.al. | 2504.12813 | null |
2025-04-17 | Self-Supervised Pre-training with Combined Datasets for 3D Perception in Autonomous Driving | Shumin Wang et.al. | 2504.12709 | null |
2025-04-17 | Collaborative Perception Datasets for Autonomous Driving: A Review | Naibang Wang et.al. | 2504.12696 | link |
2025-04-17 | Two Tasks, One Goal: Uniting Motion and Planning for Excellent End To End Autonomous Driving Performance | Lin Liu et.al. | 2504.12667 | null |
2025-04-16 | Self-Supervised Traversability Learning with Online Prototype Adaptation for Off-Road Autonomous Driving | Yafeng Bu et.al. | 2504.12109 | null |
2025-04-16 | Contract-based hierarchical control using predictive feasibility value functions | Felix Berkel et.al. | 2504.12036 | null |
2025-04-15 | Enhancing Autonomous Driving Systems with On-Board Deployed Large Language Models | Nicolas Baumann et.al. | 2504.11514 | link |
2025-04-11 | High Dynamic Range Modulo Imaging for Robust Object Detection in Autonomous Driving | Kebin Contreras et.al. | 2504.11472 | null |
2025-04-15 | Explicit and Implicit Representations in AI-based 3D Reconstruction for Radiology: A systematic literature review | Yuezhe Yang et.al. | 2504.11349 | link |
2025-04-30 | GATE3D: Generalized Attention-based Task-synergized Estimation in 3D* | Eunsoo Im et.al. | 2504.11014 | null |
2025-04-15 | Can Vision-Language Models Understand and Interpret Dynamic Gestures from Pedestrians? Pilot Datasets and Exploration Towards Instructive Nonverbal Commands for Cooperative Autonomous Vehicles | Tonko E. W. Bossen et.al. | 2504.10873 | null |
2025-04-15 | PatrolVision: Automated License Plate Recognition in the wild | Anmol Singhal Navya Singhal et.al. | 2504.10810 | null |
2025-04-14 | ReasonDrive: Efficient Visual Question Answering for Autonomous Vehicles with Reasoning-Enhanced Small Vision-Language Models | Amirhosein Chahe et.al. | 2504.10757 | link |
2025-04-14 | FuzzSense: Towards A Modular Fuzzing Framework for Autonomous Driving Software | Andrew Roberts et.al. | 2504.10717 | null |
2025-04-14 | Decoupled Diffusion Sparks Adaptive Scene Generation | Yunsong Zhou et.al. | 2504.10485 | null |
2025-04-14 | Siamese Network with Dual Attention for EEG-Driven Social Learning: Bridging the Human-Robot Gap in Long-Tail Autonomous Driving | Xiaoshan Zhou et.al. | 2504.10296 | null |
2025-04-14 | LMFormer: Lane based Motion Prediction Transformer | Harsh Yadav et.al. | 2504.10275 | null |
2025-04-14 | Vision based driving agent for race car simulation environments | Gergely Bári et.al. | 2504.10266 | null |
2025-04-14 | Relative Illumination Fields: Learning Medium and Light Independent Underwater Scenes | Mengkun She et.al. | 2504.10024 | null |
2025-04-14 | Balancing Two Classifiers via A Simplex ETF Structure for Model Calibration | Jiani Ni et.al. | 2504.10007 | null |
2025-04-14 | Towards Resilient Tracking in Autonomous Vehicles: A Distributionally Robust Input and State Estimation Approach | Kasra Azizi et.al. | 2504.09974 | null |
2025-04-14 | MCBlock: Boosting Neural Radiance Field Training Speed by MCTS-based Dynamic-Resolution Ray Sampling | Yunpeng Tan et.al. | 2504.09878 | null |
2025-04-13 | FastRSR: Efficient and Accurate Road Surface Reconstruction from Bird’s Eye View | Yuting Zhao et.al. | 2504.09535 | null |
2025-04-13 | ADDT – A Digital Twin Framework for Proactive Safety Validation in Autonomous Driving Systems | Bo Yu et.al. | 2504.09461 | null |
2025-04-12 | Minority Reports: Balancing Cost and Quality in Ground Truth Data Annotation | Hsuan Wei Liao et.al. | 2504.09341 | null |
2025-04-12 | Text To 3D Object Generation For Scalable Room Assembly | Sonia Laguna et.al. | 2504.09328 | null |
2025-04-12 | ReferGPT: Towards Zero-Shot Referring Multi-Object Tracking | Tzoulio Chamiti et.al. | 2504.09195 | null |
2025-04-11 | HAL-NeRF: High Accuracy Localization Leveraging Neural Radiance Fields | Asterios Reppas et.al. | 2504.08901 | null |
2025-04-11 | Offline Reinforcement Learning using Human-Aligned Reward Labeling for Autonomous Emergency Braking in Occluded Pedestrian Crossing | Vinal Asodia et.al. | 2504.08704 | null |
2025-04-11 | TinyCenterSpeed: Efficient Center-Based Object Detection for Autonomous Racing | Neil Reichlin et.al. | 2504.08655 | link |
2025-04-11 | Shadow Erosion and Nighttime Adaptability for Camera-Based Automated Driving Applications | Mohamed Sabry et.al. | 2504.08551 | null |
2025-04-11 | Datasets for Lane Detection in Autonomous Driving: A Comprehensive Review | Jörg Gamerdinger et.al. | 2504.08540 | null |
2025-04-11 | Road Grip Uncertainty Estimation Through Surface State Segmentation | Jyri Maanpää et.al. | 2504.08452 | null |
2025-04-11 | SN-LiDAR: Semantic Neural Fields for Novel Space-time View LiDAR Synthesis | Yi Chen et.al. | 2504.08361 | link |
2025-04-11 | Generative AI for Film Creation: A Survey of Recent Advances | Ruihan Zhang et.al. | 2504.08296 | null |
2025-04-11 | InSPE: Rapid Evaluation of Heterogeneous Multi-Modal Infrastructure Sensor Placement | Zhaoliang Zheng et.al. | 2504.08240 | null |
2025-04-11 | VL-UR: Vision-Language-guided Universal Restoration of Images Degraded by Adverse Weather Conditions | Ziyan Liu et.al. | 2504.08219 | null |
2025-04-11 | EO-VLM: VLM-Guided Energy Overload Attacks on Vision Models | Minjae Seo et.al. | 2504.08205 | null |
2025-04-10 | Investigating Vision-Language Model for Point Cloud-based Vehicle Classification | Yiqiao Li et.al. | 2504.08154 | null |
2025-04-10 | X-DECODE: EXtreme Deblurring with Curriculum Optimization and Domain Equalization | Sushant Gautam et.al. | 2504.08072 | link |
2025-04-10 | Detect Anything 3D in the Wild | Hanxue Zhang et.al. | 2504.07958 | null |
2025-04-10 | RASMD: RGB And SWIR Multispectral Driving Dataset for Robust Perception in Adverse Conditions | Youngwan Jin et.al. | 2504.07603 | null |
2025-05-09 | Drive in Corridors: Enhancing the Safety of End-to-end Autonomous Driving via Corridor Learning and Planning | Zhiwei Zhang et.al. | 2504.07507 | null |
2025-04-09 | Wheat3DGS: In-field 3D Reconstruction, Instance Segmentation and Phenotyping of Wheat Heads with Gaussian Splatting | Daiwei Zhang et.al. | 2504.06978 | null |
2025-04-09 | MovSAM: A Single-image Moving Object Segmentation Framework Based on Deep Thinking | Chang Nie et.al. | 2504.06863 | null |
2025-04-09 | Dynamic Residual Safe Reinforcement Learning for Multi-Agent Safety-Critical Scenarios Decision-Making | Kaifeng Wang et.al. | 2504.06670 | null |
2025-04-10 | Uni-PrevPredMap: Extending PrevPredMap to a Unified Framework of Prior-Informed Modeling for Online Vectorized HD Map Construction | Nan Peng et.al. | 2504.06647 | link |
2025-04-09 | CAFE-AD: Cross-Scenario Adaptive Feature Enhancement for Trajectory Planning in Autonomous Driving | Junrui Zhang et.al. | 2504.06584 | link |
2025-04-08 | Uncertainty-Aware Hybrid Machine Learning in Virtual Sensors for Vehicle Sideslip Angle Estimation | Abinav Kalyanasundaram et.al. | 2504.06105 | null |
2025-04-08 | PRIMEDrive-CoT: A Precognitive Chain-of-Thought Framework for Uncertainty-Aware Object Interaction in Driving Scene Scenario | Sriram Mandalika et.al. | 2504.05908 | null |
2025-04-08 | Momentum Boosted Episodic Memory for Improving Learning in Long-Tailed RL Environments | Dolton Fernandes et.al. | 2504.05840 | null |
2025-04-08 | Meta-Continual Learning of Neural Fields | Seungyoon Woo et.al. | 2504.05806 | null |
2025-04-08 | InvNeRF-Seg: Fine-Tuning a Pre-Trained NeRF for 3D Object Segmentation | Jiangsan Zhao et.al. | 2504.05751 | null |
2025-04-08 | SAP-CoPE: Social-Aware Planning using Cooperative Pose Estimation with Infrastructure Sensor Nodes | Minghao Ning et.al. | 2504.05727 | link |
2025-04-08 | POD: Predictive Object Detection with Single-Frame FMCW LiDAR Point Cloud | Yining Shi et.al. | 2504.05649 | null |
2025-04-07 | L3GS: Layered 3D Gaussian Splats for Efficient 3D Scene Delivery | Yi-Zhen Tsai et.al. | 2504.05517 | link |
2025-05-06 | DyTTP: Trajectory Prediction with Normalization-Free Transformers | JianLin Zhu et.al. | 2504.05356 | null |
2025-04-07 | Texture2LoD3: Enabling LoD3 Building Reconstruction With Panoramic Images | Wenzhao Tang et.al. | 2504.05249 | null |
2025-04-07 | Resource-Efficient Beam Prediction in mmWave Communications with Multimodal Realistic Simulation Framework | Yu Min Park et.al. | 2504.05187 | null |
2025-04-07 | Balancing Robustness and Efficiency in Embedded DNNs Through Activation Function Selection | Jon Gutiérrez Zaballa et.al. | 2504.05119 | null |
2025-04-07 | MIAT: Maneuver-Intention-Aware Transformer for Spatio-Temporal Trajectory Prediction | Chandra Raskoti et.al. | 2504.05059 | null |
2025-04-07 | GAMDTP: Dynamic Trajectory Prediction with Graph Attention Mamba Network | Yunxiang Liu et.al. | 2504.04862 | null |
2025-04-07 | Prior2Former – Evidential Modeling of Mask Transformers for Assumption-Free Open-World Panoptic Segmentation | Sebastian Schmidt et.al. | 2504.04841 | null |
2025-04-07 | Large-Scale Mixed-Traffic and Intersection Control using Multi-agent Reinforcement Learning | Songyang Liu et.al. | 2504.04691 | link |
2025-04-07 | DeclutterNeRF: Generative-Free 3D Scene Recovery for Occlusion Removal | Wanzhou Liu et.al. | 2504.04679 | null |
2025-04-06 | Targetless LiDAR-Camera Calibration with Anchored 3D Gaussians | Haebeom Jung et.al. | 2504.04597 | null |
2025-05-05 | “Trust me on this” Explaining Agent Behavior to a Human Terminator | Uri Menkes et.al. | 2504.04592 | null |
2025-04-06 | Understanding Collective Stability of ACC Systems: From Theory to Real-World Observations | Raphael Korbmacher et.al. | 2504.04530 | null |
2025-04-06 | Thermoxels: a voxel-based method to generate simulation-ready 3D thermal models | Etienne Chassaing et.al. | 2504.04448 | null |
2025-04-06 | Driving-RAG: Driving Scenarios Embedding, Search, and RAG Applications | Cheng Chang et.al. | 2504.04419 | null |
2025-04-16 | OmniDrive: A Holistic Vision-Language Dataset for Autonomous Driving with Counterfactual Reasoning | Shihao Wang et.al. | 2504.04348 | null |
2025-04-06 | Data Scaling Laws for End-to-End Autonomous Driving | Alexander Naumann et.al. | 2504.04338 | null |
2025-04-05 | Autoregressive High-Order Finite Difference Modulo Imaging: High-Dynamic Range for Computer Vision Applications | Brayan Monroy et.al. | 2504.04228 | null |
2025-04-05 | An Optimized Density-Based Lane Keeping System for A Cost-Efficient Autonomous Vehicle Platform: AurigaBot V1 | Farbod Younesi et.al. | 2504.04217 | null |
2025-04-05 | JarvisIR: Elevating Autonomous Driving Perception with Intelligent Image Restoration | Yunlong Lin et.al. | 2504.04158 | null |
2025-04-05 | EMF: Event Meta Formers for Event-based Real-time Traffic Object Detection | Muhammad Ahmed Ullah Khan et.al. | 2504.04124 | null |
2025-04-10 | LATTE: Lightweight Attention-based Traffic Accident Anticipation Engine | Jiaxun Zhang et.al. | 2504.04103 | null |
2025-04-04 | Control Map Distribution using Map Query Bank for Online Map Generation | Ziming Liu et.al. | 2504.03868 | null |
2025-04-02 | Exploiting the Uncertainty of the Longest Paths: Response Time Analysis for Probabilistic DAG Tasks | Yiyang Gao et.al. | 2504.03754 | null |
2025-04-28 | Revisiting Outage for Edge Inference Systems | Zhanwei Wang et.al. | 2504.03686 | null |
2025-04-04 | PF3Det: A Prompted Foundation Feature Assisted Visual LiDAR 3D Detector | Kaidong Li et.al. | 2504.03563 | null |
2025-04-07 | ZFusion: An Effective Fuser of Camera and 4D Radar for 3D Object Perception in Autonomous Driving | Sheng Yang et.al. | 2504.03438 | null |
2025-04-04 | NeRFlex: Resource-aware Real-time High-quality Rendering of Complex Scenes on Mobile Devices | Zhe Wang et.al. | 2504.03415 | null |
2025-04-07 | NuScenes-SpatialQA: A Spatial Understanding and Reasoning Benchmark for Vision-Language Models in Autonomous Driving | Kexin Tian et.al. | 2504.03164 | null |
2025-04-03 | Morpheus: Benchmarking Physical Reasoning of Video Generative Models with Real Physical Experiments | Chenyu Zhang et.al. | 2504.02918 | null |
2025-04-02 | Enhancing Traffic Sign Recognition On The Performance Based On Yolov8 | Baba Ibrahim et.al. | 2504.02884 | null |
2025-04-03 | MultiNeRF: Multiple Watermark Embedding for Neural Radiance Fields | Yash Kulthe et.al. | 2504.02517 | null |
2025-04-28 | CHARMS: A Cognitive Hierarchical Agent for Reasoning and Motion Stylization in Autonomous Driving | Jingyi Wang et.al. | 2504.02450 | link |
2025-04-03 | MinkOcc: Towards real-time label-efficient semantic occupancy prediction | Samuel Sze et.al. | 2504.02270 | null |
2025-04-02 | On Simulation-Guided LLM-based Code Generation for Safe Autonomous Driving Software | Ali Nouri et.al. | 2504.02141 | null |
2025-04-01 | OccludeNeRF: Geometric-aware 3D Scene Inpainting with Collaborative Score Distillation in NeRF | Jingyu Shi et.al. | 2504.02007 | null |
2025-04-01 | Semantic SLAM with Rolling-Shutter Cameras and Low-Precision INS in Outdoor Environments | Yuchen Zhang et.al. | 2504.01997 | null |
2025-03-31 | A Concise Survey on Lane Topology Reasoning for HD Mapping | Yi Yao et.al. | 2504.01989 | null |
2025-04-02 | Diffusion-Guided Gaussian Splatting for Large-Scale Unconstrained 3D Reconstruction and Novel View Synthesis | Niluthpol Chowdhury Mithun et.al. | 2504.01960 | null |
2025-04-03 | Toward Real-world BEV Perception: Depth Uncertainty Estimation via Gaussian Splatting | Shu-Wei Lu et.al. | 2504.01957 | null |
2025-04-09 | End-to-End Driving with Online Trajectory Evaluation via BEV World Model | Yingyan Li et.al. | 2504.01941 | link |
2025-04-02 | BOGausS: Better Optimized Gaussian Splatting | Stéphane Pateux et.al. | 2504.01844 | null |
2025-04-12 | Overlap-Aware Feature Learning for Robust Unsupervised Domain Adaptation for 3D Semantic Segmentation | Junjie Chen et.al. | 2504.01668 | null |
2025-04-02 | Building Knowledge from Interactions: An LLM-Based Architecture for Adaptive Tutoring and Social Reasoning | Luca Garello et.al. | 2504.01588 | null |
2025-04-02 | RealityAvatar: Towards Realistic Loose Clothing Modeling in Animatable 3D Gaussian Avatars | Yahui Li et.al. | 2504.01559 | null |
2025-04-23 | Luminance-GS: Adapting 3D Gaussian Splatting to Challenging Lighting Conditions with View-Adaptive Curve Adjustment | Ziteng Cui et.al. | 2504.01503 | link |
2025-04-02 | Deep LG-Track: An Enhanced Localization-Confidence-Guided Multi-Object Tracker | Ting Meng et.al. | 2504.01457 | null |
2025-04-02 | DF-Calib: Targetless LiDAR-Camera Calibration via Depth Flow | Shu Han et.al. | 2504.01416 | null |
2025-04-02 | Pedestrian-Aware Motion Planning for Autonomous Driving in Complex Urban Scenarios | Korbinian Moller et.al. | 2504.01409 | link |
2025-04-02 | From Shadows to Safety: Occlusion Tracking and Risk Mitigation for Urban Autonomous Driving | Korbinian Moller et.al. | 2504.01408 | link |
2025-03-31 | Cal or No Cal? – Real-Time Miscalibration Detection of LiDAR and Camera Sensors | Ilir Tahiraj et.al. | 2504.01040 | link |
2025-03-26 | Omnidirectional Depth-Aided Occupancy Prediction based on Cylindrical Voxel for Autonomous Driving | Chaofan Wu et.al. | 2504.01023 | null |
2025-04-07 | Neural Pruning for 3D Scene Reconstruction: Efficient NeRF Acceleration | Tianqi Ding et.al. | 2504.00950 | null |
2025-04-01 | Foundation Models for Autonomous Driving System: An Initial Roadmap | Xiongfei Wu et.al. | 2504.00911 | null |
2025-04-09 | NeuRadar: Neural Radiance Fields for Automotive Radar Point Clouds | Mahan Rafidashti et.al. | 2504.00859 | null |
2025-04-01 | UnIRe: Unsupervised Instance Decomposition for Dynamic Urban Scene Reconstruction | Yunxuan Mao et.al. | 2504.00763 | null |
2025-04-01 | ADGaussian: Generalizable Gaussian Splatting for Autonomous Driving with Multi-modal Inputs | Qi Song et.al. | 2504.00437 | null |
2025-04-01 | Intrinsic-feature-guided 3D Object Detection | Wanjing Zhang et.al. | 2504.00382 | null |
2025-04-01 | MPDrive: Improving Spatial Understanding with Marker-Based Prompt Learning for Autonomous Driving | Zhiyuan Zhang et.al. | 2504.00379 | null |
2025-03-31 | NeRF-Based defect detection | Tianqi et.al. | 2504.00270 | null |
2025-04-23 | CF-CAM: Cluster Filter Class Activation Mapping for Reliable Gradient-Based Interpretability | Hongjie He et.al. | 2504.00060 | null |
2025-03-31 | UniOcc: A Unified Benchmark for Occupancy Forecasting and Prediction in Autonomous Driving | Yuping Wang et.al. | 2503.24381 | link |
2025-04-01 | Self-Supervised Pretraining for Aerial Road Extraction | Rupert Polley et.al. | 2503.24326 | null |
2025-03-31 | Can Test-Time Scaling Improve World Foundation Model? | Wenyan Cong et.al. | 2503.24320 | link |
2025-04-29 | 4D mmWave Radar for Sensing Enhancement in Adverse Environments: Advances and Challenges | Xiangyuan Peng et.al. | 2503.24091 | null |
2025-03-31 | DenseFormer: Learning Dense Depth Map from Sparse Depth and Image via Conditional Diffusion Model | Ming Yuan et.al. | 2503.23993 | null |
2025-03-31 | Video-based Traffic Light Recognition by Rockchip RV1126 for Autonomous Driving | Miao Fan et.al. | 2503.23965 | null |
2025-03-31 | A Benchmark for Vision-Centric HD Mapping by V2I Systems | Miao Fan et.al. | 2503.23963 | null |
2025-03-31 | GLane3D : Detecting Lanes with Graph of 3D Keypoints | Halil İbrahim Öztürk et.al. | 2503.23882 | null |
2025-04-21 | STI-Bench: Are MLLMs Ready for Precise Spatial-Temporal World Understanding? | Yun Li et.al. | 2503.23765 | null |
2025-04-07 | Towards Benchmarking and Assessing the Safety and Robustness of Autonomous Driving on Safety-critical Scenarios | Jingzheng Li et.al. | 2503.23708 | null |
2025-03-31 | A Survey of Reinforcement Learning-Based Motion Planning for Autonomous Driving: Lessons Learned from a Driving Task Perspective | Zhuoren Li et.al. | 2503.23650 | null |
2025-03-30 | OpenDriveVLA: Towards End-to-end Autonomous Driving with Large Vision Language Action Model | Xingcheng Zhou et.al. | 2503.23463 | link |
2025-04-13 | A Visual-Inertial Motion Prior SLAM for Dynamic Environments | Weilong Sun et.al. | 2503.23429 | null |
2025-03-30 | OnSiteVRU: A High-Resolution Trajectory Dataset for High-Density Vulnerable Road Users | Zhangcun Yan et.al. | 2503.23365 | null |
2025-03-29 | VLM-C4L: Continual Core Dataset Learning with Corner Case Optimization via Vision-Language Models for Autonomous Driving | Haibo Hu et.al. | 2503.23046 | null |
2025-03-28 | Markov Potential Game Construction and Multi-Agent Reinforcement Learning with Applications to Autonomous Driving | Huiwen Yan et.al. | 2503.22867 | null |
2025-03-28 | SafeCast: Risk-Responsive Motion Forecasting for Autonomous Vehicles | Haicheng Liao et.al. | 2503.22541 | null |
2025-03-28 | NuGrounding: A Multi-View 3D Visual Grounding Framework in Autonomous Driving | Fuhao Li et.al. | 2503.22436 | null |
2025-04-16 | VoteFlow: Enforcing Local Rigidity in Self-Supervised Scene Flow | Yancong Lin et.al. | 2503.22328 | link |
2025-03-28 | A Dataset for Semantic Segmentation in the Presence of Unknowns | Zakaria Laskar et.al. | 2503.22309 | null |
2025-03-28 | CRLLK: Constrained Reinforcement Learning for Lane Keeping in Autonomous Driving | Xinwei Gao et.al. | 2503.22248 | null |
2025-04-05 | CoGen: 3D Consistent Video Generation via Adaptive Conditioning for Autonomous Driving | Yishen Ji et.al. | 2503.22231 | null |
2025-03-28 | ABC-GS: Alignment-Based Controllable Style Transfer for 3D Gaussian Splatting | Wenjie Liu et.al. | 2503.22218 | null |
2025-03-28 | Multi-modal Knowledge Distillation-based Human Trajectory Forecasting | Jaewoo Jeong et.al. | 2503.22201 | link |
2025-03-28 | Mitigating Trade-off: Stream and Query-guided Aggregation for Efficient and Effective 3D Occupancy Prediction | Seokha Moon et.al. | 2503.22087 | link |
2025-03-28 | A Deep Learning Framework for Boundary-Aware Semantic Segmentation | Tai An et.al. | 2503.22050 | null |
2025-03-27 | InteractionMap: Improving Online Vectorized HDMap Construction with Interaction | Kuang Wu et.al. | 2503.21659 | null |
2025-03-27 | Fine-Grained Evaluation of Large Vision-Language Models in Autonomous Driving | Yue Li et.al. | 2503.21505 | link |
2025-04-01 | Fine-Grained Behavior and Lane Constraints Guided Trajectory Prediction Method | Wenyi Xiong et.al. | 2503.21477 | null |
2025-03-27 | Towards Generating Realistic 3D Semantic Training Data for Autonomous Driving | Lucas Nunes et.al. | 2503.21449 | link |
2025-03-27 | Exploring the Roles of Large Language Models in Reshaping Transportation Systems: A Survey, Framework, and Roadmap | Tong Nie et.al. | 2503.21411 | link |
2025-03-28 | LandMarkSystem Technical Report | Zhenxiang Ma et.al. | 2503.21364 | link |
2025-03-27 | Large Language Models for Traffic and Transportation Research: Methodologies, State of the Art, and Future Opportunities | Yimo Yan et.al. | 2503.21330 | null |
2025-03-27 | Knowledge Graphs as World Models for Semantic Material-Aware Obstacle Handling in Autonomous Vehicles | Ayush Bheemaiah et.al. | 2503.21232 | null |
2025-03-27 | Extending Silicon Lifetime: A Review of Design Techniques for Reliable Integrated Circuits | Shaik Jani Babu et.al. | 2503.21165 | null |
2025-03-27 | Adversarial Wear and Tear: Exploiting Natural Damage for Generating Physical-World Adversarial Examples | Samra Irshad et.al. | 2503.21164 | null |
2025-03-24 | AED: Automatic Discovery of Effective and Diverse Vulnerabilities for Autonomous Driving Policy with Large Language Models | Le Qiu et.al. | 2503.20804 | null |
2025-03-26 | ADS-Edit: A Multimodal Knowledge Editing Dataset for Autonomous Driving Systems | Chenxi Wang et.al. | 2503.20756 | link |
2025-03-26 | AccidentSim: Generating Physically Realistic Vehicle Collision Videos from Real-World Accident Reports | Xiangwen Zhang et.al. | 2503.20654 | null |
2025-03-26 | SaViD: Spectravista Aesthetic Vision Integration for Robust and Discerning 3D Object Detection in Challenging Environments | Tanmoy Dam et.al. | 2503.20614 | link |
2025-03-26 | GAIA-2: A Controllable Multi-View Generative World Model for Autonomous Driving | Lloyd Russell et.al. | 2503.20523 | null |
2025-03-26 | EVolSplat: Efficient Volume-based Gaussian Splatting for Urban View Synthesis | Sheng Miao et.al. | 2503.20168 | null |
2025-03-26 | Bandwidth Allocation for Cloud-Augmented Autonomous Driving | Peter Schafhalter et.al. | 2503.20127 | null |
2025-03-25 | Learning Scene-Level Signed Directional Distance Function with Ellipsoidal Priors and Neural Residuals | Zhirui Dai et.al. | 2503.20066 | null |
2025-03-25 | SuperFlow++: Enhanced Spatiotemporal Consistency for Cross-Modal Data Pretraining | Xiang Xu et.al. | 2503.19912 | link |
2025-03-25 | LENVIZ: A High-Resolution Low-Exposure Night Vision Benchmark Dataset | Manjushree Aithal et.al. | 2503.19804 | null |
2025-03-31 | Resilient Sensor Fusion under Adverse Sensor Failures via Multi-Modal Expert Fusion | Konyul Park et.al. | 2503.19776 | null |
2025-03-25 | ORION: A Holistic End-to-End Autonomous Driving Framework by Vision-Language Instructed Action Generation | Haoyu Fu et.al. | 2503.19755 | null |
2025-03-26 | A Survey on Event-driven 3D Reconstruction: Development under Different Categories | Chuanzhi Xu et.al. | 2503.19753 | null |
2025-03-25 | Semi-SD: Semi-Supervised Metric Depth Estimation via Surrounding Cameras for Autonomous Driving | Yusen Xie et.al. | 2503.19713 | link |
2025-03-27 | Risk-Aware Reinforcement Learning for Autonomous Driving: Improving Safety When Driving through Intersection | Bo Leng et.al. | 2503.19690 | null |
2025-03-25 | MultimodalStudio: A Heterogeneous Sensor Dataset and Framework for Neural Rendering across Multiple Imaging Modalities | Federico Lincetto et.al. | 2503.19673 | null |
2025-03-25 | SINR: Sparsity Driven Compressed Implicit Neural Representations | Dhananjaya Jayasundara et.al. | 2503.19576 | null |
2025-03-25 | Multi-Agent Deep Reinforcement Learning for Safe Autonomous Driving with RICS-Assisted MEC | Xueyao Zhang et.al. | 2503.19418 | null |
2025-04-02 | EmoHead: Emotional Talking Head via Manipulating Semantic Expression Parameters | Xuli Shen et.al. | 2503.19416 | null |
2025-03-26 | ST-VLM: Kinematic Instruction Tuning for Spatio-Temporal Reasoning in Vision-Language Models | Dohwan Ko et.al. | 2503.19355 | null |
2025-03-25 | A Reliable and Efficient 5G Vehicular MEC: Guaranteed Task Completion with Minimal Latency | Mahsa Paknejad et.al. | 2503.19320 | null |
2025-03-25 | BIMII-Net: Brain-Inspired Multi-Iterative Interactive Network for RGB-T Road Scene Semantic Segmentation | Hanshuo Qiu et.al. | 2503.19303 | null |
2025-03-25 | Context-Aware Semantic Segmentation: Enhancing Pixel-Level Understanding with Large Language Models for Advanced Vision Applications | Ben Rahman et.al. | 2503.19276 | null |
2025-03-24 | Enhancing V2X Communications with UAV-mounted Reconfigurable Intelligent Surfaces | Salim Janji et.al. | 2503.19038 | null |
2025-03-24 | Building Blocks for Robust and Effective Semi-Supervised Real-World Object Detection | Moussa Kassem Sbeyti et.al. | 2503.18903 | null |
2025-03-24 | NexusGS: Sparse View Synthesis with Epipolar Depth Priors in 3D Gaussian Splatting | Yulong Zheng et.al. | 2503.18794 | null |
2025-03-24 | Predicting the Road Ahead: A Knowledge Graph based Foundation Model for Scene Understanding in Autonomous Driving | Hongkuan Zhou et.al. | 2503.18730 | null |
2025-04-07 | AgentSpec: Customizable Runtime Enforcement for Safe and Reliable LLM Agents | Haoyu Wang et.al. | 2503.18666 | null |
2025-03-24 | Robust Lane Detection with Wavelet-Enhanced Context Modeling and Adaptive Sampling | Kunyang Li et.al. | 2503.18631 | null |
2025-03-24 | ReconDreamer++: Harmonizing Generative and Reconstructive Models for Driving Scene Representation | Guosheng Zhao et.al. | 2503.18438 | null |
2025-03-30 | NeRFPrior: Learning Neural Radiance Field as a Prior for Indoor Scene Reconstruction | Wenyuan Zhang et.al. | 2503.18361 | null |
2025-03-23 | Training A Neural Network For Partially Occluded Road Sign Identification In The Context Of Autonomous Vehicles | Gulnaz Gimaletdinova et.al. | 2503.18177 | null |
2025-03-23 | Unraveling the Effects of Synthetic Data on End-to-End Autonomous Driving | Junhao Ge et.al. | 2503.18108 | link |
2025-03-23 | M3Net: Multimodal Multi-task Learning for 3D Detection, Segmentation, and Occupancy Prediction in Autonomous Driving | Xuesong Chen et.al. | 2503.18100 | link |
2025-04-15 | Text-Driven 3D Lidar Place Recognition for Autonomous Driving | Tianyi Shang et.al. | 2503.18035 | null |
2025-03-22 | LightLoc: Learning Outdoor LiDAR Localization at Light Speed | Wen Li et.al. | 2503.17814 | link |
2025-03-22 | HiLoTs: High-Low Temporal Sensitive Representation Learning for Semi-Supervised LiDAR Segmentation in Autonomous Driving | R. D. Lin et.al. | 2503.17752 | link |
2025-03-22 | Multi-modality Anomaly Segmentation on the Road | Heng Gao et.al. | 2503.17712 | link |
2025-03-22 | Sense4FL: Vehicular Crowdsensing Enhanced Federated Learning for Autonomous Driving | Yanan Ma et.al. | 2503.17697 | null |
2025-03-18 | CP-NCBF: A Conformal Prediction-based Approach to Synthesize Verified Neural Control Barrier Functions | Manan Tayal et.al. | 2503.17395 | null |
2025-03-21 | How to Promote Autonomous Driving with Evolving Technology: Business Strategy and Pricing Decision | Mingliang Li et.al. | 2503.17174 | null |
2025-03-26 | Hi-ALPS – An Experimental Robustness Quantification of Six LiDAR-based Object Detection Systems for Autonomous Driving | Alexandra Arzberger et.al. | 2503.17168 | null |
2025-03-21 | Enhancing Steering Estimation with Semantic-Aware GNNs | Fouad Makiyeh et.al. | 2503.17153 | null |
2025-03-26 | R-LiViT: A LiDAR-Visual-Thermal Dataset Enabling Vulnerable Road User Focused Roadside Perception | Jonas Mirlach et.al. | 2503.17122 | null |
2025-03-21 | FFaceNeRF: Few-shot Face Editing in Neural Radiance Fields | Kwan Yun et.al. | 2503.17095 | link |
2025-03-21 | Temporal Action Detection Model Compression by Progressive Block Drop | Xiaoyong Chen et.al. | 2503.16916 | null |
2025-03-21 | OpenCity3D: What do Vision-Language Models know about Urban Environments? | Valentin Bieri et.al. | 2503.16776 | null |
2025-03-20 | Digitally Prototype Your Eye Tracker: Simulating Hardware Performance using 3D Synthetic Data | Esther Y. H. Lin et.al. | 2503.16742 | null |
2025-03-19 | A Vehicle-Infrastructure Multi-layer Cooperative Decision-making Framework | Yiming Cui et.al. | 2503.16552 | null |
2025-03-04 | Injecting Conflict Situations in Autonomous Driving Simulation using CARLA | Tsvetomila Mihaylova et.al. | 2503.16476 | null |
2025-03-20 | Panoptic-CUDAL Technical Report: Rural Australia Point Cloud Dataset in Rainy Conditions | Tzu-Yun Tseng et.al. | 2503.16378 | null |
2025-03-20 | BadToken: Token-level Backdoor Attacks to Multi-modal Large Language Models | Zenghui Yuan et.al. | 2503.16023 | null |
2025-03-20 | Automating 3D Dataset Generation with Neural Radiance Fields | P. Schulz et.al. | 2503.15997 | link |
2025-03-20 | Enhancing Close-up Novel View Synthesis via Pseudo-labeling | Jiatong Xia et.al. | 2503.15908 | link |
2025-03-20 | MiLA: Multi-view Intensive-fidelity Long-term Video Generation World Model for Autonomous Driving | Haiguang Wang et.al. | 2503.15875 | link |
2025-03-20 | AutoDrive-QA- Automated Generation of Multiple-Choice Questions for Autonomous Driving Datasets Using Large Vision-Language Models | Boshra Khalili et.al. | 2503.15778 | null |
2025-03-20 | Nano-3D: Metasurface-Based Neural Depth Imaging | Bingxuan Li et.al. | 2503.15770 | null |
2025-03-19 | GASP: Unifying Geometric and Semantic Self-Supervised Pre-training for Autonomous Driving | William Ljungbergh et.al. | 2503.15672 | null |
2025-03-19 | DiffPortrait360: Consistent Portrait Diffusion for 360 View Synthesis | Yuming Gu et.al. | 2503.15667 | link |
2025-03-19 | V2X-DG: Domain Generalization for Vehicle-to-Everything Cooperative Perception | Baolu Li et.al. | 2503.15435 | null |
2025-03-19 | EdgeRegNet: Edge Feature-based Multimodal Registration Network between Images and LiDAR Point Clouds | Yuanchao Yue et.al. | 2503.15284 | link |
2025-03-19 | GO-N3RDet: Geometry Optimized NeRF-enhanced 3D Object Detector | Zechuan Li et.al. | 2503.15211 | null |
2025-03-19 | An Investigation of Beam Density on LiDAR Object Detection Performance | Christoph Griesbacher et.al. | 2503.15087 | null |
2025-03-19 | MultiBARF: Integrating Imagery of Different Wavelength Regions by Using Neural Radiance Fields | Kana Kurata et.al. | 2503.15070 | null |
2025-03-19 | DRoPE: Directional Rotary Position Embedding for Efficient Agent Interaction Modeling | Jianbo Zhao et.al. | 2503.15029 | null |
2025-03-19 | USAM-Net: A U-Net-based Network for Improved Stereo Correspondence and Scene Depth Estimation using Features from a Pre-trained Image Segmentation network | Joseph Emmanuel DL Dayo et.al. | 2503.14950 | null |
2025-03-26 | Generating Multimodal Driving Scenes via Next-Scene Prediction | Yanhao Wu et.al. | 2503.14945 | null |
2025-03-19 | SemanticFlow: A Self-Supervised Framework for Joint Scene Flow Prediction and Instance Segmentation in Dynamic Environments | Yinqi Chen et.al. | 2503.14837 | null |
2025-03-19 | MMDT: Decoding the Trustworthiness and Safety of Multimodal Foundation Models | Chejian Xu et.al. | 2503.14827 | null |
2025-03-18 | RAT: Boosting Misclassification Detection Ability without Extra Data | Ge Yan et.al. | 2503.14783 | null |
2025-03-20 | These Magic Moments: Differentiable Uncertainty Quantification of Radiance Field Models | Parker Ewen et.al. | 2503.14665 | null |
2025-03-21 | SuperPC: A Single Diffusion Model for Point Cloud Completion, Upsampling, Denoising, and Colorization | Yi Du et.al. | 2503.14558 | null |
2025-03-22 | Learning-based 3D Reconstruction in Autonomous Driving: A Comprehensive Survey | Liewen Liao et.al. | 2503.14537 | null |
2025-03-19 | Advances in 4D Generation: A Survey | Qiaowei Miao et.al. | 2503.14501 | link |
2025-03-18 | Tracking Meets Large Multimodal Models for Driving Scenario Understanding | Ayesha Ishaq et.al. | 2503.14498 | link |
2025-03-18 | Segmentation-Guided Neural Radiance Fields for Novel Street View Synthesis | Yizhou Li et.al. | 2503.14219 | null |
2025-03-18 | Driving behavior recognition via self-discovery learning | Yilin Wang et.al. | 2503.14194 | null |
2025-03-18 | Bridging Past and Future: End-to-End Autonomous Driving with Historical Prediction and Planning | Bozhou Zhang et.al. | 2503.14182 | link |
2025-03-18 | SimWorld: A Unified Benchmark for Simulator-Conditioned Scene Generation via World Model | Xinqing Li et.al. | 2503.13952 | link |
2025-03-21 | ChatBEV: A Visual Language Model that Understands BEV Maps | Qingyao Xu et.al. | 2503.13938 | null |
2025-03-18 | PSA-SSL: Pose and Size-aware Self-Supervised Learning on LiDAR Point Clouds | Barza Nisar et.al. | 2503.13914 | null |
2025-03-18 | Robust3D-CIL: Robust Class-Incremental Learning for 3D Perception | Jinge Ma et.al. | 2503.13869 | null |
2025-03-18 | RAD: Retrieval-Augmented Decision-Making of Meta-Actions with Vision-Language Models in Autonomous Driving | Yujin Wang et.al. | 2503.13861 | null |
2025-03-26 | MamBEV: Enabling State Space Models to Learn Birds-Eye-View Representations | Hongyu Ke et.al. | 2503.13858 | link |
2025-03-17 | Improving Geometric Consistency for 360-Degree Neural Radiance Fields in Indoor Scenarios | Iryna Repinetska et.al. | 2503.13710 | null |
2025-03-17 | AugMapNet: Improving Spatial Latent Structure via BEV Grid Augmentation for Enhanced Vectorized Online HD Map Construction | Thomas Monninger et.al. | 2503.13430 | null |
2025-03-17 | A Comprehensive Survey on Multi-Agent Cooperative Decision-Making: Scenarios, Approaches, Challenges and Perspectives | Weiqiang Jin et.al. | 2503.13415 | null |
2025-03-17 | Clustering is back: Reaching state-of-the-art LiDAR instance segmentation without training | Corentin Sautier et.al. | 2503.13203 | null |
2025-03-17 | InsightDrive: Insight Scene Representation for End-to-End Autonomous Driving | Ruiqi Song et.al. | 2503.13047 | null |
2025-03-17 | SparseAlign: A Fully Sparse Framework for Cooperative Object Detection | Yunshuang Yuan et.al. | 2503.12982 | null |
2025-03-17 | OptiPMB: Enhancing 3D Multi-Object Tracking with Optimized Poisson Multi-Bernoulli Filtering | Guanhua Ding et.al. | 2503.12968 | null |
2025-03-17 | DivCon-NeRF: Generating Augmented Rays with Diversity and Consistency for Few-shot View Synthesis | Ingyun Lee et.al. | 2503.12947 | null |
2025-03-17 | Hydra-MDP++: Advancing End-to-End Driving via Expert-Guided Hydra-Distillation | Kailin Li et.al. | 2503.12820 | null |
2025-03-17 | SAM2 for Image and Video Segmentation: A Comprehensive Survey | Zhang Jiaxing et.al. | 2503.12781 | null |
2025-03-17 | GenStereo: Towards Open-World Generation of Stereo Images and Unsupervised Matching | Feng Qiao et.al. | 2503.12720 | link |
2025-03-16 | Logic-RAG: Augmenting Large Multimodal Models with Visual-Spatial Knowledge for Road Scene Understanding | Imran Kabir et.al. | 2503.12663 | link |
2025-03-23 | Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey | Yaoting Wang et.al. | 2503.12605 | link |
2025-03-16 | Point Cloud Based Scene Segmentation: A Survey | Dan Halperin et.al. | 2503.12595 | null |
2025-03-16 | Car-1000: A New Large Scale Fine-Grained Visual Categorization Dataset | Yutao Hu et.al. | 2503.12385 | null |
2025-03-16 | L2COcc: Lightweight Camera-Centric Semantic Scene Completion via Distillation of LiDAR Model | Ruoyu Wang et.al. | 2503.12369 | null |
2025-03-16 | ResLPR: A LiDAR Data Restoration Network and Benchmark for Robust Place Recognition Against Weather Corruptions | Wenqing Kuang et.al. | 2503.12350 | null |
2025-03-15 | Bench2FreeAD: A Benchmark for Vision-based End-to-end Navigation in Unstructured Robotic Environments | Yuhang Peng et.al. | 2503.12180 | link |
2025-03-15 | DiffAD: A Unified Diffusion Modeling Approach for Autonomous Driving | Tao Wang et.al. | 2503.12170 | null |
2025-03-15 | SFMNet: Sparse Focal Modulation for 3D Object Detection | Oren Shrout et.al. | 2503.12093 | null |
2025-03-15 | FA-BARF: Frequency Adapted Bundle-Adjusting Neural Radiance Fields | Rui Qian et.al. | 2503.12086 | null |
2025-03-15 | Generative Modeling of Adversarial Lane-Change Scenario | Chuancheng Zhang et.al. | 2503.12055 | null |
2025-03-15 | TACO: Taming Diffusion for in-the-wild Video Amodal Completion | Ruijie Lu et.al. | 2503.12049 | null |
2025-03-15 | Hydra-NeXt: Robust Closed-Loop Driving with Open-Loop Training | Zhenxin Li et.al. | 2503.12030 | link |
2025-04-03 | 3D Gaussian Splatting against Moving Objects for High-Fidelity Street Scene Reconstruction | Peizhen Zheng et.al. | 2503.12001 | link |
2025-03-30 | Controllable Latent Diffusion for Traffic Simulation | Yizhuo Xiao et.al. | 2503.11771 | link |
2025-03-14 | Industrial-Grade Sensor Simulation via Gaussian Splatting: A Modular Framework for Scalable Editing and Full-Stack Validation | Xianming Zeng et.al. | 2503.11731 | null |
2025-03-14 | Centaur: Robust End-to-End Autonomous Driving with Test-Time Training | Chonghao Sima et.al. | 2503.11650 | null |
2025-03-14 | A Framework for a Capability-driven Evaluation of Scenario Understanding for Multimodal Large Language Models in Autonomous Driving | Tin Stribor Sohn et.al. | 2503.11400 | null |
2025-03-14 | BEVDiffLoc: End-to-End LiDAR Global Localization in BEV View based on Diffusion Model | Ziyue Wang et.al. | 2503.11372 | link |
2025-03-14 | Learning-Based MPC for Efficient Control of Autonomous Vehicles | Samuel Mallick et.al. | 2503.11359 | link |
2025-03-14 | DynRsl-VLM: Enhancing Autonomous Driving Perception with Dynamic Resolution Vision-Language Models | Xirui Zhou et.al. | 2503.11265 | null |
2025-03-14 | DriveGEN: Generalized and Robust 3D Detection in Driving via Controllable Text-to-Image Diffusion Generation | Hongbin Lin et.al. | 2503.11122 | link |
2025-03-14 | Active Learning from Scene Embeddings for End-to-End Autonomous Driving | Wenhao Jiang et.al. | 2503.11062 | null |
2025-03-13 | Trajectory Mamba: Efficient Attention-Mamba Forecasting Model Based on Selective SSM | Yizhou Huang et.al. | 2503.10898 | null |
2025-03-21 | TAIJI: Textual Anchoring for Immunizing Jailbreak Images in Vision Language Models | Xiangyu Yin et.al. | 2503.10872 | null |
2025-03-13 | DriveLMM-o1: A Step-by-Step Reasoning Dataset and Large Multimodal Model for Driving Scenario Understanding | Ayesha Ishaq et.al. | 2503.10621 | link |
2025-03-13 | OCCUQ: Exploring Efficient Uncertainty Quantification for 3D Occupancy Prediction | Severin Heidrich et.al. | 2503.10605 | link |
2025-03-13 | MuDG: Taming Multi-modal Diffusion with Gaussian Splatting for Urban Scene Reconstruction | Yingshuang Zou et.al. | 2503.10604 | null |
2025-03-15 | Unlock the Power of Unlabeled Data in Language Driving Model | Chaoqun Wang et.al. | 2503.10586 | null |
2025-03-13 | Flow-NeRF: Joint Learning of Geometry, Poses, and Dense Flow within Unified Neural Representations | Xunzhi Zheng et.al. | 2503.10464 | null |
2025-03-13 | Finetuning Generative Trajectory Model with Reinforcement Learning from Human Feedback | Derun Li et.al. | 2503.10434 | null |
2025-03-13 | TARS: Traffic-Aware Radar Scene Flow Estimation | Jialong Wu et.al. | 2503.10210 | null |
2025-03-13 | GS-SDF: LiDAR-Augmented Gaussian Splatting and Neural SDF for Geometrically Consistent Rendering and Reconstruction | Jianheng Liu et.al. | 2503.10170 | link |
2025-03-13 | Unlocking Generalization Power in LiDAR Point Cloud Registration | Zhenxuan Zeng et.al. | 2503.10149 | link |
2025-03-13 | Mamba-VA: A Mamba-based Approach for Continuous Emotion Recognition in Valence-Arousal Space | Yuheng Liang et.al. | 2503.10104 | link |
2025-03-13 | TGP: Two-modal occupancy prediction with 3D Gaussian and sparse points for 3D Environment Awareness | Mu Chen et.al. | 2503.09941 | null |
2025-03-12 | CleverDistiller: Simple and Spatially Consistent Cross-modal Distillation | Hariprasath Govindarajan et.al. | 2503.09878 | null |
2025-03-12 | A Comprehensive Multi-Vocal Empirical Study of ML Cloud Service Misuses | Hadil Ben Amor et.al. | 2503.09815 | null |
2025-03-12 | Evaluating the Impact of Synthetic Data on Object Detection Tasks in Autonomous Driving | Enes Özeren et.al. | 2503.09803 | null |
2025-03-12 | SimLingo: Vision-Only Closed-Loop Autonomous Driving with Language-Action Alignment | Katrin Renz et.al. | 2503.09594 | null |
2025-03-12 | Hybrid Rendering for Multimodal Autonomous Driving: Merging Neural and Physics-Based Simulation | Máté Tóth et.al. | 2503.09464 | null |
2025-03-13 | PCLA: A Framework for Testing Autonomous Agents in the CARLA Simulator | Masoud Jamshidiyan Tehrani et.al. | 2503.09385 | link |
2025-03-12 | Post-interactive Multimodal Trajectory Prediction for Autonomous Driving | Ziyi Huang et.al. | 2503.09366 | null |
2025-03-12 | A Case Study on Model Checking and Runtime Verification for Awkernel | Akira Hasegawa et.al. | 2503.09282 | null |
2025-03-17 | Other Vehicle Trajectories Are Also Needed: A Driving World Model Unifies Ego-Other Vehicle Trajectories in Video Latent Space | Jian Zhu et.al. | 2503.09215 | null |
2025-03-17 | Dual-Domain Homogeneous Fusion with Cross-Modal Mamba and Progressive Decoder for 3D Object Detection | Xuzhong Hu et.al. | 2503.08992 | null |
2025-03-11 | Simulator Ensembles for Trustworthy Autonomous Driving Testing | Lev Sorokin et.al. | 2503.08936 | null |
2025-04-05 | Out-of-Distribution Segmentation in Autonomous Driving: Problems and State of the Art | Youssef Shoeb et.al. | 2503.08695 | null |
2025-03-11 | CoLMDriver: LLM-based Negotiation Benefits Cooperative Autonomous Driving | Changxing Liu et.al. | 2503.08683 | link |
2025-04-12 | Language-Depth Navigated Thermal and Visible Image Fusion | Jinchang Zhang et.al. | 2503.08676 | null |
2025-03-11 | Task-Oriented Co-Design of Communication, Computing, and Control for Edge-Enabled Industrial Cyber-Physical Systems | Yufeng Diao et.al. | 2503.08661 | null |
2025-03-11 | HiP-AD: Hierarchical and Multi-Granularity Planning with Deformable Attention for Autonomous Driving in a Single Decoder | Yingqi Tang et.al. | 2503.08612 | link |
2025-03-11 | LiSu: A Dataset and Method for LiDAR Surface Normal Estimation | Dušan Malić et.al. | 2503.08601 | null |
2025-03-13 | JiSAM: Alleviate Labeling Burden and Corner Case Problems in Autonomous Driving via Minimal Real-World Data | Runjian Chen et.al. | 2503.08422 | null |
2025-03-11 | V-Max: Making RL practical for Autonomous Driving | Valentin Charraut et.al. | 2503.08388 | link |
2025-03-11 | Talk2PC: Enhancing 3D Visual Grounding through LiDAR and Radar Point Clouds Fusion for Autonomous Driving | Runwei Guan et.al. | 2503.08336 | null |
2025-03-24 | Uni-Gaussians: Unifying Camera and Lidar Simulation with Gaussians for Dynamic Driving Scenarios | Zikang Yuan et.al. | 2503.08317 | null |
2025-03-11 | Dynamic Scene Reconstruction: Recent Advance in Real-time Rendering and Streaming | Jiaxuan Zhu et.al. | 2503.08166 | null |
2025-03-11 | FASIONAD++ : Integrating High-Level Instruction and Information Bottleneck in FAt-Slow fusION Systems for Enhanced Safety in Autonomous Driving with Adaptive Feedback | Kangan Qian et.al. | 2503.08162 | null |
2025-03-11 | GigaSLAM: Large-Scale Monocular SLAM with Hierachical Gaussian Splats | Kai Deng et.al. | 2503.08071 | link |
2025-03-11 | Simulating Automotive Radar with Lidar and Camera Inputs | Peili Song et.al. | 2503.08068 | null |
2025-03-11 | SGNetPose+: Stepwise Goal-Driven Networks with Pose Information for Trajectory Prediction in Autonomous Driving | Akshat Ghiya et.al. | 2503.08016 | null |
2025-03-11 | NeRF-VIO: Map-Based Visual-Inertial Odometry with Initialization Leveraging Neural Radiance Fields | Yanyu Zhang et.al. | 2503.07952 | null |
2025-03-11 | STEAD: Spatio-Temporal Efficient Anomaly Detection for Time and Compute Sensitive Applications | Andrew Gao et.al. | 2503.07942 | link |
2025-03-10 | Neural Radiance and Gaze Fields for Visual Attention Modeling in 3D Environments | Andrei Chubarau et.al. | 2503.07828 | null |
2025-03-07 | DriveTransformer: Unified Transformer for Scalable End-to-End Autonomous Driving | Xiaosong Jia et.al. | 2503.07656 | link |
2025-03-10 | AlphaDrive: Unleashing the Power of VLMs in Autonomous Driving via Reinforcement Learning and Reasoning | Bo Jiang et.al. | 2503.07608 | link |
2025-03-10 | Robusto-1 Dataset: Comparing Humans and VLMs on real out-of-distribution Autonomous Driving VQA from Peru | Dunant Cusipuma et.al. | 2503.07587 | null |
2025-03-10 | Chameleon: Fast-slow Neuro-symbolic Lane Topology Extraction | Zongzheng Zhang et.al. | 2503.07485 | link |
2025-03-10 | CATPlan: Loss-based Collision Prediction in End-to-End Autonomous Driving | Ziliang Xiong et.al. | 2503.07425 | null |
2025-03-26 | GM-MoE: Low-Light Enhancement with Gated-Mechanism Mixture-of-Experts | Minwen Liao et.al. | 2503.07417 | null |
2025-03-10 | LEGO-Motion: Learning-Enhanced Grids with Occupancy Instance Modeling for Class-Agnostic Motion Prediction | Kangan Qian et.al. | 2503.07367 | null |
2025-03-10 | Temporal Triplane Transformers as Occupancy World Models | Haoran Xu et.al. | 2503.07338 | null |
2025-03-10 | CoT-Drive: Efficient Motion Forecasting for Autonomous Driving with LLMs and Chain-of-Thought Prompting | Haicheng Liao et.al. | 2503.07234 | null |
2025-03-12 | HisTrackMap: Global Vectorized High-Definition Map Construction via History Map Tracking | Jing Yang et.al. | 2503.07168 | null |
2025-03-10 | Controllable 3D Outdoor Scene Generation via Scene Graphs | Yuheng Liu et.al. | 2503.07152 | link |
2025-03-12 | RS2V-L: Vehicle-Mounted LiDAR Data Generation from Roadside Sensor Observations | Ruidan Xing et.al. | 2503.07085 | null |
2025-03-10 | Availability-aware Sensor Fusion via Unified Canonical Space for 4D Radar, LiDAR, and Camera | Dong-Hee Paek et.al. | 2503.07029 | link |
2025-03-10 | Combating Partial Perception Deficit in Autonomous Driving with Multimodal LLM Commonsense | Yuting Hu et.al. | 2503.07020 | null |
2025-03-10 | Griffin: Aerial-Ground Cooperative Detection and Tracking Dataset and Benchmark | Jiahao Wang et.al. | 2503.06983 | link |
2025-03-10 | HierDAMap: Towards Universal Domain Adaptive BEV Mapping via Hierarchical Perspective Priors | Siyu Li et.al. | 2503.06821 | link |
2025-03-09 | Chance-Constrained Trajectory Planning with Multimodal Environmental Uncertainty | Kai Ren et.al. | 2503.06779 | link |
2025-03-09 | Gaussian RBFNet: Gaussian Radial Basis Functions for Fast and Accurate Representation and Reconstruction of Neural Fields | Abdelaziz Bouzidi et.al. | 2503.06762 | null |
2025-03-09 | CoDa-4DGS: Dynamic Gaussian Splatting with Context and Deformation Awareness for Autonomous Driving | Rui Song et.al. | 2503.06744 | null |
2025-03-09 | Attention, Please! PixelSHAP Reveals What Vision-Language Models Actually Focus On | Roni Goldshmidt et.al. | 2503.06670 | null |
2025-03-09 | AxisPose: Model-Free Matching-Free Single-Shot 6D Object Pose Estimation via Axis Generation | Yang Zou et.al. | 2503.06660 | null |
2025-03-09 | Steerable Pyramid Weighted Loss: Multi-Scale Adaptive Weighting for Semantic Segmentation | Renhao Lu et.al. | 2503.06604 | null |
2025-03-30 | StructVPR++: Distill Structural and Semantic Knowledge with Weighting Samples for Visual Place Recognition | Yanqing Shen et.al. | 2503.06601 | link |
2025-03-09 | Future-Aware Interaction Network For Motion Forecasting | Shijie Li et.al. | 2503.06565 | null |
2025-03-09 | Evaluation of Safety Cognition Capability in Vision-Language Models for Autonomous Driving | Enming Zhang et.al. | 2503.06497 | null |
2025-03-09 | OV-SCAN: Semantically Consistent Alignment for Novel Object Discovery in Open-Vocabulary 3D Object Detection | Adrian Chow et.al. | 2503.06435 | null |
2025-03-08 | Advancing Autonomous Vehicle Intelligence: Deep Learning and Multimodal LLM for Traffic Sign Recognition and Robust Lane Detection | Chandan Kumar Sah et.al. | 2503.06313 | null |
2025-03-08 | ACAM-KD: Adaptive and Cooperative Attention Masking for Knowledge Distillation | Qizhen Lan et.al. | 2503.06307 | null |
2025-03-08 | From Dataset to Real-world: General 3D Object Detection via Generalized Cross-domain Few-shot Learning | Shuangzhi Li et.al. | 2503.06282 | null |
2025-03-08 | Segment Anything, Even Occluded | Wei-En Tai et.al. | 2503.06261 | null |
2025-03-08 | Rethinking Lanes and Points in Complex Scenarios for Monocular 3D Lane Detection | Yifan Chang et.al. | 2503.06237 | null |
2025-03-08 | Vision-based 3D Semantic Scene Completion via Capture Dynamic Representations | Meng Wang et.al. | 2503.06222 | null |
2025-03-08 | VLScene: Vision-Language Guidance Distillation for Camera-Based 3D Semantic Scene Completion | Meng Wang et.al. | 2503.06219 | link |
2025-03-12 | Object-Centric World Model for Language-Guided Manipulation | Youngjoon Jeong et.al. | 2503.06170 | null |
2025-03-17 | Treble Counterfactual VLMs: A Causal Approach to Hallucination | Shawn Li et.al. | 2503.06169 | link |
2025-03-17 | Secure On-Device Video OOD Detection Without Backpropagation | Shawn Li et.al. | 2503.06166 | link |
2025-03-08 | Feature-EndoGaussian: Feature Distilled Gaussian Splatting in Surgical Deformable Scene Reconstruction | Kai Li et.al. | 2503.06161 | null |
2025-03-08 | NeuraLoc: Visual Localization in Neural Implicit Map with Dual Complementary Features | Hongjia Zhai et.al. | 2503.06117 | null |
2025-03-08 | TransParking: A Dual-Decoder Transformer Framework with Soft Localization for End-to-End Automatic Parking | Hangyu Du et.al. | 2503.06071 | null |
2025-03-08 | InfoFusion Controller: Informed TRRT Star with Mutual Information based on Fusion of Pure Pursuit and MPC for Enhanced Path Planning | Seongjun Choi et.al. | 2503.06010 | link |
2025-03-08 | Learning to Drive by Imitating Surrounding Vehicles | Yasin Sonmez et.al. | 2503.05997 | null |
2025-03-04 | DriveGen: Towards Infinite Diverse Traffic Scenarios with Large Models | Shenyu Zhang et.al. | 2503.05808 | null |
2025-03-13 | GoalFlow: Goal-Driven Flow Matching for Multimodal Trajectories Generation in End-to-End Autonomous Driving | Zebin Xing et.al. | 2503.05689 | link |
2025-03-07 | InDRiVE: Intrinsic Disagreement based Reinforcement for Vehicle Exploration through Curiosity Driven Generalized World Model | Feeza Khan Khanzada et.al. | 2503.05573 | null |
2025-03-07 | FastMap: Fast Queries Initialization Based Vectorized HD Map Reconstruction Framework | Haotian Hu et.al. | 2503.05492 | link |
2025-03-07 | DecoupledGaussian: Object-Scene Decoupling for Physics-Based Interaction | Miaowei Wang et.al. | 2503.05484 | null |
2025-03-07 | A Hybrid Approach for Extending Automotive Radar Operation to NLOS Urban Scenarios | Aviran Gal et.al. | 2503.05413 | null |
2025-03-07 | Evidential Uncertainty Estimation for Multi-Modal Trajectory Prediction | Sajad Marvi et.al. | 2503.05274 | null |
2025-03-07 | Discrete Contrastive Learning for Diffusion Policies in Autonomous Driving | Kalle Kujanpää et.al. | 2503.05229 | null |
2025-03-07 | A Comprehensive LLM-powered Framework for Driving Intelligence Evaluation | Shanhe You et.al. | 2503.05164 | null |
2025-03-07 | An End-to-End Learning-Based Multi-Sensor Fusion for Autonomous Vehicle Localization | Changhong Lin et.al. | 2503.05088 | null |
2025-03-07 | Adaptive-LIO: Enhancing Robustness and Precision through Environmental Adaptation in LiDAR Inertial Odometry | Chengwei Zhao et.al. | 2503.05077 | link |
2025-03-06 | Quantifying and Modeling Driving Styles in Trajectory Forecasting | Laura Zheng et.al. | 2503.04994 | null |
2025-03-06 | INTENT: Trajectory Prediction Framework with Intention-Guided Contrastive Clustering | Yihong Tang et.al. | 2503.04952 | null |
2025-03-06 | Metadata-free Georegistration of Ground and Airborne Imagery | Adam Bredvik et.al. | 2503.04927 | null |
2025-03-06 | Manboformer: Learning Gaussian Representations via Spatial-temporal Attention Mechanism | Ziyue Zhao et.al. | 2503.04863 | null |
2025-03-05 | RTFusion: A depth estimation network based on multimodal fusion in challenging scenarios | Zelin Meng et.al. | 2503.04821 | null |
2025-03-06 | Research on a Driver’s Perceived Risk Prediction Model Considering Traffic Scene Interaction | Chenhao Yang et.al. | 2503.04516 | null |
2025-03-06 | Geometry-Constrained Monocular Scale Estimation Using Semantic Segmentation for Dynamic Scenes | Hui Zhang et.al. | 2503.04235 | null |
2025-03-06 | Simulation-based Analysis Of Highway Trajectory Planning Using High-Order Polynomial For Highly Automated Driving Function | Milin Patel et.al. | 2503.04159 | link |
2025-03-06 | Surgical Gaussian Surfels: Highly Accurate Real-time Surgical Scene Rendering | Idris O. Sunmola et.al. | 2503.04079 | null |
2025-03-06 | H3O: Hyper-Efficient 3D Occupancy Prediction with Heterogeneous Supervision | Yunxiao Shi et.al. | 2503.04059 | null |
2025-03-05 | Enhancing Autonomous Driving Safety with Collision Scenario Integration | Zi Wang et.al. | 2503.03957 | null |
2025-03-05 | LensDFF: Language-enhanced Sparse Feature Distillation for Efficient Few-Shot Dexterous Manipulation | Qian Feng et.al. | 2503.03890 | null |
2025-03-03 | A Survey on Semantic Communications in Internet of Vehicles | Sha Ye et.al. | 2503.03767 | null |
2025-03-05 | CoSDH: Communication-Efficient Collaborative Perception via Supply-Demand Awareness and Intermediate-Late Hybridization | Junhao Xu et.al. | 2503.03430 | link |
2025-03-05 | Trajectory Prediction for Autonomous Driving: Progress, Limitations, and Future Directions | Nadya Abdel Madjid et.al. | 2503.03262 | null |
2025-03-06 | Don’t Shake the Wheel: Momentum-Aware Planning in End-to-End Autonomous Driving | Ziying Song et.al. | 2503.03125 | link |
2025-03-05 | Car-STAGE: Automated framework for large-scale high-dimensional simulated time-series data generation based on user-defined criteria | Asma A. Almutairi et.al. | 2503.03100 | null |
2025-03-05 | BEVDriver: Leveraging BEV Maps in LLMs for Robust Closed-Loop Driving | Katharina Winter et.al. | 2503.03074 | link |
2025-03-04 | Text2Scenario: Text-Driven Scenario Generation for Autonomous Driving Test | Xuan Cai et.al. | 2503.02911 | null |
2025-03-04 | Federated Learning for Privacy-Preserving Feedforward Control in Multi-Agent Systems | Jakob Weber et.al. | 2503.02693 | link |
2025-03-04 | State of play and future directions in industrial computer vision AI standards | Artemis Stefanidou et.al. | 2503.02675 | null |
2025-03-04 | Human-aligned Safe Reinforcement Learning for Highway On-Ramp Merging in Dense Traffic | Yang Li et.al. | 2503.02624 | link |
2025-03-04 | TS-CGNet: Temporal-Spatial Fusion Meets Centerline-Guided Diffusion for BEV Mapping | Xinying Hong et.al. | 2503.02578 | link |
2025-03-04 | 2DGS-Avatar: Animatable High-fidelity Clothed Avatar via 2D Gaussian Splatting | Qipeng Yan et.al. | 2503.02452 | null |
2025-03-04 | PIDLoc: Cross-View Pose Optimization Network Inspired by PID Controllers | Wooju Lee et.al. | 2503.02388 | null |
2025-03-04 | Diffusion-Based mmWave Radar Point Cloud Enhancement Driven by Range Images | Ruixin Wu et.al. | 2503.02300 | null |
2025-03-04 | Empowering Sparse-Input Neural Radiance Fields with Dual-Level Semantic Guidance from Dense Novel Views | Yingji Zhong et.al. | 2503.02230 | null |
2025-03-04 | Zero-Shot Sim-to-Real Visual Quadrotor Control with Hard Constraints | Yan Miao et.al. | 2503.02198 | null |
2025-03-03 | Data Augmentation for NeRFs in the Low Data Limit | Ayush Gaggar et.al. | 2503.02092 | null |
2025-03-03 | Road Boundary Detection Using 4D mmWave Radar for Autonomous Driving | Yuyan Wu et.al. | 2503.01930 | null |
2025-02-21 | Interaction-Aware Model Predictive Decision-Making for Socially-Compliant Autonomous Driving in Mixed Urban Traffic Scenarios | Balint Varga et.al. | 2503.01852 | null |
2025-03-03 | Difix3D+: Improving 3D Reconstructions with Single-Step Diffusion Models | Jay Zhangjie Wu et.al. | 2503.01774 | null |
2025-03-03 | ECG-EmotionNet: Nested Mixture of Expert (NMoE) Adaptation of ECG-Foundation Model for Driver Emotion Recognition | Nastaran Mansourian et.al. | 2503.01750 | null |
2025-03-05 | Perceptual Motor Learning with Active Inference Framework for Robust Lateral Control | Elahe Delavari et.al. | 2503.01676 | null |
2025-03-03 | CAPS: Context-Aware Priority Sampling for Enhanced Imitation Learning in Autonomous Driving | Hamidreza Mirkhani et.al. | 2503.01650 | null |
2025-03-03 | DifIISR: A Diffusion Model with Gradient Guidance for Infrared Image Super-Resolution | Xingyuan Li et.al. | 2503.01187 | link |
2025-03-03 | Privacy-preserving Machine Learning in Internet of Vehicle Applications: Fundamentals, Recent Advances, and Future Direction | Nazmul Islam et.al. | 2503.01089 | null |
2025-03-02 | Efficient End-to-end Visual Localization for Autonomous Driving with Decoupled BEV Neural Matching | Jinyu Miao et.al. | 2503.00862 | null |
2025-03-02 | CARIL: Confidence-Aware Regression in Imitation Learning for Autonomous Driving | Elahe Delavari et.al. | 2503.00783 | link |
2025-03-02 | Enhancing Monocular 3D Scene Completion with Diffusion Model | Changlin Song et.al. | 2503.00726 | link |
2025-03-06 | Dur360BEV: A Real-world 360-degree Single Camera Dataset and Benchmark for Bird-Eye View Mapping in Autonomous Driving | Wenke E et.al. | 2503.00675 | link |
2025-03-01 | Actor-Critic Cooperative Compensation to Model Predictive Control for Off-Road Autonomous Vehicles Under Unknown Dynamics | Prakhar Gupta et.al. | 2503.00577 | null |
2025-02-28 | SafeAuto: Knowledge-Enhanced Safe Autonomous Driving with Multimodal Foundation Models | Jiawei Zhang et.al. | 2503.00211 | null |
2025-02-26 | Glad: A Streaming Scene Generator for Autonomous Driving | Bin Xie et.al. | 2503.00045 | null |
2025-02-28 | Dynamically Local-Enhancement Planner for Large-Scale Autonomous Driving | Nanshan Deng et.al. | 2502.21134 | null |
2025-02-28 | AuthSim: Towards Authentic and Effective Safety-critical Scenario Generation for Autonomous Driving Tests | Yukuan Yang et.al. | 2502.21100 | null |
2025-02-28 | Multimodal Learning for Just-In-Time Software Defect Prediction in Autonomous Driving Systems | Faisal Mohammad et.al. | 2502.20806 | null |
2025-02-28 | WorldModelBench: Judging Video Generation Models As World Models | Dacheng Li et.al. | 2502.20694 | null |
2025-02-28 | EndoPBR: Material and Lighting Estimation for Photorealistic Surgical Simulations via Physically-based Rendering | John J. Han et.al. | 2502.20669 | null |
2025-02-28 | LV-DOT: LiDAR-visual dynamic obstacle detection and tracking for autonomous robot navigation | Zhefan Xu et.al. | 2502.20607 | link |
2025-03-01 | VDT-Auto: End-to-end Autonomous Driving with VLM-Guided Diffusion Transformers | Ziang Guo et.al. | 2502.20108 | null |
2025-02-27 | Minds on the Move: Decoding Trajectory Prediction in Autonomous Driving with Cognitive Insights | Haicheng Liao et.al. | 2502.20084 | null |
2025-02-28 | SegLocNet: Multimodal Localization Network for Autonomous Driving via Bird’s-Eye-View Segmentation | Zijie Zhou et.al. | 2502.20077 | link |
2025-03-25 | Identity-preserving Distillation Sampling by Fixed-Point Iterator | SeonHwa Kim et.al. | 2502.19930 | null |
2025-03-24 | CarPlanner: Consistent Auto-regressive Trajectory Planning for Large-scale Reinforcement Learning in Autonomous Driving | Dongkun Zhang et.al. | 2502.19908 | null |
2025-02-27 | Shared Autonomy for Proximal Teaching | Megha Srivastava et.al. | 2502.19899 | null |
2025-02-27 | NeRFCom: Feature Transform Coding Meets Neural Radiance Field for Free-View 3D Scene Semantic Transmission | Weijie Yue et.al. | 2502.19873 | null |
2025-03-15 | You Only Click Once: Single Point Weakly Supervised 3D Instance Segmentation for Autonomous Driving | Guangfeng Jiang et.al. | 2502.19698 | null |
2025-03-24 | BEVDiffuser: Plug-and-Play Diffusion Model for BEV Denoising with Ground-Truth Guidance | Xin Ye et.al. | 2502.19694 | null |
2025-02-27 | Unveiling Security Weaknesses in Autonomous Driving Systems: An In-Depth Empirical Study | Wenyuan Cheng et.al. | 2502.19687 | null |
2025-02-26 | Ev-3DOD: Pushing the Temporal Boundaries of 3D Object Detection with Event Cameras | Hoonhee Cho et.al. | 2502.19630 | link |
2025-02-26 | Compression in 3D Gaussian Splatting: A Survey of Methods, Trends, and Future Directions | Muhammad Salman Ali et.al. | 2502.19457 | null |
2025-02-26 | Does 3D Gaussian Splatting Need Accurate Volumetric Rendering? | Adam Celarek et.al. | 2502.19318 | link |
2025-03-02 | EMT: A Visual Multi-Task Benchmark Dataset for Autonomous Driving in the Arab Gulf Region | Nadya Abdel Madjid et.al. | 2502.19260 | link |
2025-02-26 | Knowledge Distillation for Semantic Segmentation: A Label Space Unification Approach | Anton Backhaus et.al. | 2502.19177 | null |
2025-02-26 | The NeRF Signature: Codebook-Aided Watermarking for Neural Radiance Fields | Ziyuan Luo et.al. | 2502.19125 | null |
2025-02-26 | Learning Autonomy: Off-Road Navigation Enhanced by Human Input | Akhil Nagariya et.al. | 2502.18760 | null |
2025-02-25 | Provably Efficient RL for Linear MDPs under Instantaneous Safety Constraints in Non-Convex Feature Spaces | Amirhossein Roknilamouki et.al. | 2502.18655 | null |
2025-02-25 | VLM-E2E: Enhancing End-to-End Autonomous Driving with Multimodal Driver Attention Fusion | Pei Liu et.al. | 2502.18042 | null |
2025-02-25 | Exploring the Effects of Traditional Chinese Medicine Scents on Mitigating Driving Fatigue | Nengyue Su et.al. | 2502.18013 | null |
2025-02-25 | InVDriver: Intra-Instance Aware Vectorized Query-Based Autonomous Driving Transformer | Bo Zhang et.al. | 2502.17949 | null |
2025-02-25 | VVRec: Reconstruction Attacks on DL-based Volumetric Video Upstreaming via Latent Diffusion Model with Gamma Distribution | Rui Lu et.al. | 2502.17880 | null |
2025-02-26 | Easy-Poly: A Easy Polyhedral Framework For 3D Multi-Object Tracking | Peng Zhang et.al. | 2502.17822 | null |
2025-02-25 | CAML: Collaborative Auxiliary Modality Learning for Multi-Agent Systems | Rui Liu et.al. | 2502.17821 | null |
2025-03-04 | CalibRefine: Deep Learning-Based Online Automatic Targetless LiDAR-Camera Calibration with Iterative and Attention-Driven Post-Refinement | Lei Cheng et.al. | 2502.17648 | link |
2025-02-25 | GaussianFlowOcc: Sparse and Weakly Supervised Occupancy Estimation using Gaussian Splatting and Temporal Flow | Simon Boeder et.al. | 2502.17288 | null |
2025-02-24 | Semantic Neural Radiance Fields for Multi-Date Satellite Data | Valentin Wagner et.al. | 2502.16992 | link |
2025-02-24 | MambaFlow: A Novel and Flow-guided State Space Model for Scene Flow Estimation | Jiehao Luo et.al. | 2502.16907 | link |
2025-02-24 | Multi-Agent Autonomous Driving Systems with Large Language Models: A Survey of Recent Advances | Yaozu Wu et.al. | 2502.16804 | null |
2025-02-25 | AUKT: Adaptive Uncertainty-Guided Knowledge Transfer with Conformal Prediction | Rui Liu et.al. | 2502.16736 | null |
2025-02-23 | ViSNeRF: Efficient Multidimensional Neural Radiance Field Representation for Visualization Synthesis of Dynamic Volumetric Scenes | Siyuan Yao et.al. | 2502.16731 | link |
2025-02-25 | Co-MTP: A Cooperative Trajectory Prediction Framework with Multi-Temporal Fusion for Autonomous Driving | Xinyu Zhang et.al. | 2502.16589 | link |
2025-02-23 | An Expert Ensemble for Detecting Anomalous Scenes, Interactions, and Behaviors in Autonomous Driving | Tianchen Ji et.al. | 2502.16389 | null |
2025-02-22 | AquaNeRF: Neural Radiance Fields in Underwater Media with Distractor Removal | Luca Gough et.al. | 2502.16351 | null |
2025-02-22 | A Brain-Inspired Perception-Decision Driving Model Based on Neural Pathway Anatomical Alignment | Haidong Wang et.al. | 2502.16027 | null |
2025-02-22 | Cross-Model Transferability of Adversarial Patches in Real-time Segmentation for Autonomous Driving | Prashant Shekhar et.al. | 2502.16012 | link |
2025-02-21 | Computation Offloading Strategies in Integrated Terrestrial and Non-Terrestrial Networks | Muhammad Ahmed Mohsin et.al. | 2502.15903 | null |
2025-02-20 | Getting SMARTER for Motion Planning in Autonomous Driving Systems | Montgomery Alban et.al. | 2502.15824 | link |
2025-02-21 | VaViM and VaVAM: Autonomous Driving through Video Generative Modeling | Florent Bartoccioni et.al. | 2502.15672 | link |
2025-02-24 | Para-Lane: Multi-Lane Dataset Registering Parallel Scans for Benchmarking Novel View Synthesis | Ziqian Ni et.al. | 2502.15635 | null |
2025-02-21 | Depth-aware Fusion Method based on Image and 4D Radar Spectrum for 3D Object Detection | Yue Sun et.al. | 2502.15516 | null |
2025-03-11 | Q-PETR: Quant-aware Position Embedding Transformation for Multi-View 3D Object Detection | Jiangyong Yu et.al. | 2502.15488 | null |
2025-02-21 | A modular risk concept for complex systems | Dag McGeorge et.al. | 2502.15482 | null |
2025-02-21 | Aligning Task- and Reconstruction-Oriented Communications for Edge Intelligence | Yufeng Diao et.al. | 2502.15472 | null |
2025-03-10 | OccLinker: Deflickering Occupancy Networks through Lightweight Spatio-Temporal Correlation | Fengcheng Yu et.al. | 2502.15438 | null |
2025-02-21 | Enhancing Vehicle Make and Model Recognition with 3D Attention Modules | Narges Semiromizadeh et.al. | 2502.15398 | null |
2025-02-26 | PFSD: A Multi-Modal Pedestrian-Focus Scene Dataset for Rich Tasks in Semi-Structured Environments | Yueting Liu et.al. | 2502.15342 | link |
2025-02-21 | OccProphet: Pushing Efficiency Frontier of Camera-Only 4D Occupancy Forecasting with Observer-Forecaster-Refiner Framework | Junliang Chen et.al. | 2502.15180 | link |
2025-02-21 | CurricuVLM: Towards Safe Autonomous Driving via Personalized Safety-Critical Curriculum Learning with Vision-Language Models | Zihao Sheng et.al. | 2502.15119 | null |
2025-02-20 | Synth It Like KITTI: Synthetic Data Generation for Object Detection in Driving Scenarios | Richard Marcus et.al. | 2502.15076 | link |
2025-02-19 | Sce2DriveX: A Generalized MLLM Framework for Scene-to-Drive Learning | Rui Zhao et.al. | 2502.14917 | null |
2025-02-17 | CoDiff: Conditional Diffusion Model for Collaborative 3D Object Detection | Zhe Huang et.al. | 2502.14891 | link |
2025-03-04 | AVD2: Accident Video Diffusion for Accident Video Description | Cheng Li et.al. | 2502.14801 | null |
2025-02-20 | RendBEV: Semantic Novel View Synthesis for Self-Supervised Bird’s Eye View Segmentation | Henrique Piñeiro Monteagudo et.al. | 2502.14792 | null |
2025-02-23 | Real-world Troublemaker: A 5G Cloud-controlled Track Testing Framework for Automated Driving Systems in Safety-critical Interaction Scenarios | Xinrui Zhang et.al. | 2502.14574 | null |
2025-02-20 | Learning Temporal 3D Semantic Scene Completion via Optical Flow Guidance | Meng Wang et.al. | 2502.14520 | null |
2025-02-20 | CrossFuse: Learning Infrared and Visible Image Fusion by Cross-Sensor Top-K Vision Alignment and Beyond | Yukai Shi et.al. | 2502.14493 | null |
2025-02-20 | Reliable Explainability of Deep Learning Spatial-Spectral Classifiers for Improved Semantic Segmentation in Autonomous Driving | Jon Gutiérrez-Zaballa et.al. | 2502.14416 | null |
2025-02-20 | ODVerse33: Is the New YOLO Version Always Better? A Multi Domain benchmark from YOLO v5 to v11 | Tianyou Jiang et.al. | 2502.14314 | null |
2025-02-20 | OrchardDepth: Precise Metric Depth Estimation of Orchard Scene from Monocular Camera Images | Zhichao Zheng et.al. | 2502.14279 | null |
2025-02-20 | OG-Gaussian: Occupancy Based Street Gaussians for Autonomous Driving | Yedong Shen et.al. | 2502.14235 | null |
2025-02-20 | NeRF-3DTalker: Neural Radiance Field with 3D Prior Aided Audio Disentanglement for Talking Head Synthesis | Xiaoxing Liu et.al. | 2502.14178 | null |
2025-02-19 | GlossGau: Efficient Inverse Rendering for Glossy Surface with Anisotropic Spherical Gaussian | Bang Du et.al. | 2502.14129 | null |
2025-02-19 | SegRet: An Efficient Design for Semantic Segmentation with Retentive Network | Zhiyuan Li et.al. | 2502.14014 | link |
2025-02-19 | MEX: Memory-efficient Approach to Referring Multi-Object Tracking | Huu-Thien Tran et.al. | 2502.13875 | null |
2025-02-18 | GS-QA: Comprehensive Quality Assessment Benchmark for Gaussian Splatting View Synthesis | Pedro Martin et.al. | 2502.13196 | null |
2025-02-18 | Uncertain Multi-Objective Recommendation via Orthogonal Meta-Learning Enhanced Bayesian Optimization | Hongxu Wang et.al. | 2502.13180 | null |
2025-02-18 | RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning | Hao Gao et.al. | 2502.13144 | null |
2025-02-18 | Fragility-aware Classification for Understanding Risk and Improving Generalization | Chen Yang et.al. | 2502.13024 | null |
2025-02-18 | RadSplatter: Extending 3D Gaussian Splatting to Radio Frequencies for Wireless Radiomap Extrapolation | Yiheng Wang et.al. | 2502.12686 | null |
2025-02-18 | ROI-NeRFs: Hi-Fi Visualization of Objects of Interest within a Scene by NeRFs Composition | Quoc-Anh Bui et.al. | 2502.12673 | null |
2025-02-17 | Detecting Systematic Weaknesses in Vision Models along Predefined Human-Understandable Dimensions | Sujan Sai Gannamaneni et.al. | 2502.12360 | null |
2025-02-16 | AI-Augmented Metamorphic Testing for Comprehensive Validation of Autonomous Vehicles | Tony Zhang et.al. | 2502.12208 | null |
2025-02-17 | Bandwidth-Adaptive Spatiotemporal Correspondence Identification for Collaborative Perception | Peng Gao et.al. | 2502.12098 | null |
2025-02-17 | 3D Gaussian Inpainting with Depth-Guided Cross-View Consistency | Sheng-Yu Huang et.al. | 2502.11801 | null |
2025-02-17 | Residual Learning towards High-fidelity Vehicle Dynamics Modeling with Transformer | Jinyu Miao et.al. | 2502.11800 | null |
2025-02-17 | MaskGWM: A Generalizable Driving World Model with Video Mask Reconstruction | Jingcheng Ni et.al. | 2502.11663 | link |
2025-02-17 | PrivilegedDreamer: Explicit Imagination of Privileged Information for Rapid Adaptation of Learned Policies | Morgan Byrd et.al. | 2502.11377 | null |
2025-02-17 | A Framework for Learning Scoring Rules in Autonomous Driving Planning Systems | Zikang Xiong et.al. | 2502.11352 | null |
2025-03-10 | NPSim: Nighttime Photorealistic Simulation From Daytime Images With Monocular Inverse Rendering and Ray Tracing | Shutong Zhang et.al. | 2502.10720 | null |
2025-03-02 | Adaptive Neural Networks for Intelligent Data-Driven Development | Youssef Shoeb et.al. | 2502.10603 | null |
2025-02-14 | The Role of World Models in Shaping Autonomous Driving: A Comprehensive Survey | Sifan Tu et.al. | 2502.10498 | link |
2025-02-14 | Multi-view 3D surface reconstruction from SAR images by inverse rendering | Emile Barbier–Renard et.al. | 2502.10492 | null |
2025-02-14 | A Robust Attack: Displacement Backdoor Attack | Yong Li et.al. | 2502.10490 | null |
2025-02-13 | Knowledge Integration Strategies in Autonomous Vehicle Prediction and Planning: A Comprehensive Survey | Kumar Manas et.al. | 2502.10477 | null |
2025-02-12 | Deep Reinforcement Learning-Based User Scheduling for Collaborative Perception | Yandi Liu et.al. | 2502.10456 | null |
2025-02-11 | A Survey of Representation Learning, Optimization Strategies, and Applications for Omnidirectional Vision | Hao Ai et.al. | 2502.10444 | null |
2025-02-17 | V2V-LLM: Vehicle-to-Vehicle Cooperative Autonomous Driving with Multi-Modal Large Language Models | Hsu-kuang Chiu et.al. | 2502.09980 | null |
2025-02-14 | Dual Control for Interactive Autonomous Merging with Model Predictive Diffusion | Jacob Knaup et.al. | 2502.09918 | null |
2025-02-13 | Embed Any NeRF: Graph Meta-Networks for Neural Tasks on Arbitrary NeRF Architectures | Francesco Ballerini et.al. | 2502.09623 | null |
2025-02-13 | Rolling Ahead Diffusion for Traffic Scene Simulation | Yunpeng Liu et.al. | 2502.09587 | null |
2025-02-13 | Generalizable Reinforcement Learning with Biologically Inspired Hyperdimensional Occupancy Grid Maps for Exploration and Goal-Directed Path Planning | Shay Snyder et.al. | 2502.09393 | null |
2025-02-13 | FLARES: Fast and Accurate LiDAR Multi-Range Semantic Segmentation | Bin Yang et.al. | 2502.09274 | null |
2025-02-13 | LimSim Series: An Autonomous Driving Simulation Platform for Validation and Enhancement | Daocheng Fu et.al. | 2502.09170 | link |
2025-02-13 | Topo2Seq: Enhanced Topology Reasoning via Topology Sequence Learning | Yiming Yang et.al. | 2502.08974 | null |
2025-02-10 | Motion Forecasting for Autonomous Vehicles: A Survey | Jianxin Shi et.al. | 2502.08664 | null |
2025-02-14 | Deployment-friendly Lane-changing Intention Prediction Powered by Brain-inspired Spiking Neural Networks | Shuqi Shen et.al. | 2502.08659 | null |
2025-02-12 | MoDitector: Module-Directed Testing for Autonomous Driving Systems | Renzhi Wang et.al. | 2502.08504 | null |
2025-02-12 | AdvSwap: Covert Adversarial Perturbation with High Frequency Info-swapping for Autonomous Driving Perception | Yuanhao Huang et.al. | 2502.08374 | null |
2025-02-12 | Sat-DN: Implicit Surface Reconstruction from Multi-View Satellite Images with Depth and Normal Supervision | Tianle Liu et.al. | 2502.08352 | null |
2025-02-12 | FixDrive: Automatically Repairing Autonomous Vehicle Driving Behaviour for $0.08 per Violation | Yang Sun et.al. | 2502.08260 | link |
2025-02-12 | End-to-End Predictive Planner for Autonomous Driving with Consistency Models | Anjian Li et.al. | 2502.08033 | null |
2025-02-10 | Preference Alignment on Diffusion Model: A Comprehensive Survey for Image Generation and Editing | Sihao Wu et.al. | 2502.07829 | null |
2025-02-07 | CP-Guard+: A New Paradigm for Malicious Agent Detection and Defense in Collaborative Perception | Senkang Hu et.al. | 2502.07807 | null |
2025-02-11 | Divide and Merge: Motion and Semantic Learning in End-to-End Autonomous Driving | Yinzhe Shen et.al. | 2502.07631 | null |
2025-02-11 | Fast-COS: A Fast One-Stage Object Detector Based on Reparameterized Attention Vision Transformer for Autonomous Driving | Novendra Setyawan et.al. | 2502.07417 | null |
2025-02-11 | USRNet: Unified Scene Recovery Network for Enhancing Traffic Imaging under Multiple Adverse Weather Conditions | Yuxu Lu et.al. | 2502.07372 | link |
2025-02-11 | Semi-Supervised Vision-Centric 3D Occupancy World Model for Autonomous Driving | Xiang Li et.al. | 2502.07309 | link |
2025-02-11 | Online Aggregation of Trajectory Predictors | Alex Tong et.al. | 2502.07178 | null |
2025-02-06 | Vision-Integrated LLMs for Autonomous Driving Assistance : Human Performance Comparison and Trust Evaluation | Namhee Kim et.al. | 2502.06843 | null |
2025-02-10 | Transfer Your Perspective: Controllable 3D Generation from Any Viewpoint in a Driving Scene | Tai-Yu Pan et.al. | 2502.06682 | null |
2025-02-27 | A Survey on Video Analytics in Cloud-Edge-Terminal Collaborative Systems | Linxiao Gong et.al. | 2502.06581 | null |
2025-02-10 | Occ-LLM: Enhancing Autonomous Driving with Occupancy-Based Large Language Models | Tianshuo Xu et.al. | 2502.06419 | null |
2025-02-10 | Occlusion-Aware Contingency Safety-Critical Planning for Autonomous Vehicles | Lei Zheng et.al. | 2502.06359 | null |
2025-02-10 | Actual Achieved Gain and Optimal Perceived Gain: Modeling Human Take-over Decisions Towards Automated Vehicles’ Suggestions | Shuning Zhang et.al. | 2502.06179 | null |
2025-02-17 | Continual Adaptation for Autonomous Driving with the Mixture of Progressive Experts Network | Yixin Cui et.al. | 2502.05943 | null |
2025-02-09 | SphereFusion: Efficient Panorama Depth Estimation via Gated Fusion | Qingsong Yan et.al. | 2502.05859 | null |
2025-02-08 | GWRF: A Generalizable Wireless Radiance Field for Wireless Signal Propagation Modeling | Kang Yang et.al. | 2502.05708 | null |
2025-02-08 | TrackDiffuser: Nearly Model-Free Bayesian Filtering with Diffusion Model | Yangguang He et.al. | 2502.05629 | null |
2025-02-08 | Convolutional Neural Network Segmentation for Satellite Imagery Data to Identify Landforms Using U-Net Architecture | Mitul Goswami et.al. | 2502.05476 | null |
2025-02-05 | VistaFlow: Photorealistic Volumetric Reconstruction with Dynamic Resolution Management via Q-Learning | Jayram Palamadai et.al. | 2502.05222 | null |
2025-02-12 | Safety at Scale: A Comprehensive Survey of Large Model Safety | Xingjun Ma et.al. | 2502.05206 | link |
2025-02-07 | Adaptive Learning-based Model Predictive Control Strategy for Drift Vehicles | Bei Zhou et.al. | 2502.04696 | null |
2025-02-05 | DILLEMA: Diffusion and Large Language Models for Multi-Modal Augmentation | Luciano Baresi et.al. | 2502.04378 | link |
2025-02-05 | MapFusion: A Novel BEV Feature Fusion Network for Multi-modal Map Construction | Xiaoshuai Hao et.al. | 2502.04377 | null |
2025-02-06 | SMART: Advancing Scalable Map Priors for Driving Topology Reasoning | Junjie Ye et.al. | 2502.04329 | null |
2025-02-06 | Safeguarding connected autonomous vehicle communication: Protocols, intra- and inter-vehicular attacks and defenses | Mohammed Aledhari et.al. | 2502.04201 | null |
2025-02-06 | Advanced Object Detection and Pose Estimation with Hybrid Task Cascade and High-Resolution Networks | Yuhui Jin et.al. | 2502.03877 | null |
2025-02-06 | Optimized Unet with Attention Mechanism for Multi-Scale Semantic Segmentation | Xuan Li et.al. | 2502.03813 | null |
2025-02-06 | Reduce Lap Time for Autonomous Racing with Curvature-Integrated MPCC Local Trajectory Planning Method | Zhouheng Li et.al. | 2502.03695 | link |
2025-02-05 | Vehicle Routing Problems in the Age of Semi-Autonomous Driving | Hins Hu et.al. | 2502.03655 | null |
2025-02-05 | Robust Autonomy Emerges from Self-Play | Marco Cusumano-Towner et.al. | 2502.03349 | null |
2025-02-05 | A Scalable Approach to Probabilistic Neuro-Symbolic Verification | Vasileios Manginas et.al. | 2502.03274 | null |
2025-02-05 | Driver Assistance System Based on Multimodal Data Hazard Detection | Long Zhouxiang et.al. | 2502.03005 | null |
2025-02-05 | Label Anything: An Interpretable, High-Fidelity and Prompt-Free Annotator | Wei-Bin Kou et.al. | 2502.02972 | null |
2025-02-04 | SD++: Enhancing Standard Definition Maps by Incorporating Road Knowledge using LLMs | Hitvarth Diwanji et.al. | 2502.02773 | null |
2025-02-04 | SiLVR: Scalable Lidar-Visual Radiance Field Reconstruction with Uncertainty Quantification | Yifu Tao et.al. | 2502.02657 | null |
2025-02-04 | Anytime Incremental $ρ$ POMDP Planning in Continuous Spaces | Ron Benchetrit et.al. | 2502.02549 | null |
2025-02-04 | Uncertainty Quantification for Collaborative Object Detection Under Adversarial Attacks | Huiqun Huang et.al. | 2502.02537 | null |
2025-02-04 | MaintaAvatar: A Maintainable Avatar Based on Neural Radiance Fields by Continual Learning | Shengbo Gu et.al. | 2502.02372 | null |
2025-02-04 | Geometric Neural Process Fields | Wenzhe Yin et.al. | 2502.02338 | null |
2025-02-04 | Event-aided Semantic Scene Completion | Shangwei Guo et.al. | 2502.02334 | link |
2025-02-04 | Improving Generalization Ability for 3D Object Detection by Learning Sparsity-invariant Features | Hsin-Cheng Lu et.al. | 2502.02322 | link |
2025-02-04 | Risk-Aware Driving Scenario Analysis with Large Language Models | Yuan Gao et.al. | 2502.02145 | link |
2025-02-04 | Synthesis of Model Predictive Control and Reinforcement Learning: Survey and Classification | Rudolf Reiter et.al. | 2502.02133 | null |
2025-02-04 | From Accidents to Insights: Leveraging Multimodal Data for Scenario-Driven ADS Testing | Siwei Luo et.al. | 2502.02025 | null |
2025-02-04 | Toward a Low-Cost Perception System in Autonomous Vehicles: A Spectrum Learning Approach | Mohammed Alsakabi et.al. | 2502.01940 | null |
2025-02-04 | A Comprehensive Study of Bug-Fix Patterns in Autonomous Driving Systems | Yuntianyi Chen et.al. | 2502.01937 | null |
2025-02-04 | SimBEV: A Synthetic Multi-Task Multi-Sensor Driving Data Generation Tool and Dataset | Goodarz Mehr et.al. | 2502.01894 | link |
2025-02-03 | Reliability-Driven LiDAR-Camera Fusion for Robust 3D Object Detection | Reza Sadeghian et.al. | 2502.01856 | null |
2025-02-01 | Leveraging Stable Diffusion for Monocular Depth Estimation via Image Semantic Encoding | Jingming Xia et.al. | 2502.01666 | null |
2025-02-20 | TeLL-Drive: Enhancing Autonomous Driving with Teacher LLM-Guided Deep Reinforcement Learning | Chengkai Xu et.al. | 2502.01387 | null |
2025-02-02 | SAM-guided Pseudo Label Enhancement for Multi-modal 3D Semantic Segmentation | Mingyu Yang et.al. | 2502.00960 | null |
2025-02-02 | VLM-Assisted Continual learning for Visual Question Answering in Self-Driving | Yuxin Lin et.al. | 2502.00843 | null |
2025-02-01 | FlexCloud: Direct, Modular Georeferencing and Drift-Correction of Point Cloud Maps | Maximilian Leitenstern et.al. | 2502.00395 | link |
2025-02-04 | INSIGHT: Enhancing Autonomous Driving Safety through Vision-Language Models on Context-Aware Hazard Detection and Edge Case Evaluation | Dianwei Chen et.al. | 2502.00262 | null |
2025-01-31 | SpikingRTNH: Spiking Neural Network for 4D Radar Object Detection | Dong-Hee Paek et.al. | 2502.00074 | link |
2025-01-31 | Laser: Efficient Language-Guided Segmentation in Neural Radiance Fields | Xingyu Miao et.al. | 2501.19084 | link |
2025-01-31 | Quantum Internet Use Case Analysis for the Automotive Industry | K. L. van der Enden et.al. | 2501.19070 | null |
2025-01-31 | SynthmanticLiDAR: A Synthetic Dataset for Semantic Segmentation on LiDAR Imaging | Javier Montalvo et.al. | 2501.19035 | link |
2025-01-31 | Open-Source Autonomous Driving Software Platforms: Comparison of Autoware and Apollo | Hee-Yang Jung et.al. | 2501.18942 | null |
2025-01-27 | Deformable Beta Splatting | Rong Liu et.al. | 2501.18630 | link |
2025-01-24 | STAMP: Scalable Task And Model-agnostic Collaborative Perception | Xiangbo Gao et.al. | 2501.18616 | link |
2025-01-30 | IROAM: Improving Roadside Monocular 3D Object Detection Learning from Autonomous Vehicle Data Domain | Zhe Wang et.al. | 2501.18162 | null |
2025-01-30 | DIAL: Distribution-Informed Adaptive Learning of Multi-Task Constraints for Safety-Critical Systems | Se-Wook Yoo et.al. | 2501.18086 | null |
2025-01-29 | TransRAD: Retentive Vision Transformer for Enhanced Radar Object Detection | Lei Cheng et.al. | 2501.17977 | link |
2025-01-23 | Ranging Performance Analysis in Automotive DToF Lidars | Xiao Guo et.al. | 2501.17884 | null |
2025-01-29 | SSF: Sparse Long-Range Scene Flow for Autonomous Driving | Ajinkya Khoche et.al. | 2501.17821 | link |
2025-01-28 | Anomaly Detection in Cooperative Vehicle Perception Systems under Imperfect Communication | Ashish Bastola et.al. | 2501.17329 | link |
2025-01-28 | A Contrastive Teacher-Student Framework for Novelty Detection under Style Shifts | Hossein Mirzaei et.al. | 2501.17289 | null |
2025-01-28 | Scenario Understanding of Traffic Scenes Through Large Visual Language Models | Rivera Esteban et.al. | 2501.17131 | null |
2025-01-28 | Revisit Mixture Models for Multi-Agent Simulation: Experimental Study within a Unified Framework | Longzhong Lin et.al. | 2501.17015 | null |
2025-02-27 | The Third Moment of AI Ethics: Developing Relatable and Contextualized Tools | Sarah Hladikova et.al. | 2501.16954 | null |
2025-01-28 | Target-driven Self-Distillation for Partial Observed Trajectories Forecasting | Pengfei Zhu et.al. | 2501.16767 | null |
2025-01-28 | Overcoming Semantic Dilution in Transformer-Based Next Frame Prediction | Hy Nguyen et.al. | 2501.16753 | null |
2025-01-28 | Dream to Drive with Predictive Individual World Model | Yinfeng Gao et.al. | 2501.16733 | link |
2025-01-28 | SliceOcc: Indoor 3D Semantic Occupancy Prediction with Vertical Slice Representation | Jianing Li et.al. | 2501.16684 | link |
2025-01-27 | Modular Framework for Uncertainty Prediction in Autonomous Vehicle Motion Forecasting within Complex Traffic Scenarios | Han Wang et.al. | 2501.16480 | null |
2025-01-18 | Risk-Informed Diffusion Transformer for Long-Tail Trajectory Prediction in the Crash Scenario | Junlan Chen et.al. | 2501.16349 | null |
2025-02-08 | Addressing Out-of-Label Hazard Detection in Dashcam Videos: Insights from the COOOL Challenge | Anh-Kiet Duong et.al. | 2501.16037 | link |
2025-01-27 | LLM-attacker: Enhancing Closed-loop Adversarial Scenario Generation for Autonomous Driving with Large Language Models | Yuewen Mei et.al. | 2501.15850 | null |
2025-02-09 | Diffusion-Based Planning for Autonomous Driving with Flexible Guidance | Yinan Zheng et.al. | 2501.15564 | null |
2025-01-26 | Mitigating Spurious Negative Pairs for Robust Industrial Anomaly Detection | Hossein Mirzaei et.al. | 2501.15434 | link |
2025-03-03 | Doracamom: Joint 3D Detection and Occupancy Prediction with Multi-view 4D Radars and Cameras for Omnidirectional Perception | Lianqing Zheng et.al. | 2501.15394 | null |
2025-01-26 | MetaOcc: Surround-View 4D Radar and Camera Fusion Framework for 3D Occupancy Prediction with Dual Training Strategies | Long Yang et.al. | 2501.15384 | link |
2025-01-29 | Towards Robust Unsupervised Attention Prediction in Autonomous Driving | Mengshi Qi et.al. | 2501.15045 | null |
2025-01-24 | AI-driven Wireless Positioning: Fundamentals, Standards, State-of-the-art, and Challenges | Guangjin Pan et.al. | 2501.14970 | null |
2025-01-24 | Performance Evaluation of Satellite-Based Data Offloading on Starlink Constellations | Alexander Bonora et.al. | 2501.14878 | null |
2025-01-24 | HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation | Xin Zhou et.al. | 2501.14729 | link |
2025-01-24 | 3DLabelProp: Geometric-Driven Domain Generalization for LiDAR Semantic Segmentation in Autonomous Driving | Jules Sanchez et.al. | 2501.14605 | link |
2025-01-24 | Deep-BrownConrady: Prediction of Camera Calibration and Distortion Parameters Using Deep Learning and Synthetic Data | Faiz Muhammad Chaudhry et.al. | 2501.14510 | null |
2025-01-24 | MARL-OT: Multi-Agent Reinforcement Learning Guided Online Fuzzing to Detect Safety Violation in Autonomous Driving Systems | Linfeng Liang et.al. | 2501.14451 | null |
2025-02-05 | GS-LiDAR: Generating Realistic LiDAR Point Clouds with Panoramic Gaussian Splatting | Junzhe Jiang et.al. | 2501.13971 | link |
2025-01-23 | Black-Box Adversarial Attack on Vision Language Models for Autonomous Driving | Lu Wang et.al. | 2501.13563 | null |
2025-01-23 | Text-driven Online Action Detection | Manuel Benavent-Lledo et.al. | 2501.13518 | link |
2025-01-23 | Knowledge-Informed Multi-Agent Trajectory Prediction at Signalized Intersections for Infrastructure-to-Everything | Huilin Yin et.al. | 2501.13461 | null |
2025-01-23 | GeomGS: LiDAR-Guided Geometry-Aware Gaussian Splatting for Robot Localization | Jaewon Lee et.al. | 2501.13417 | null |
2025-01-22 | QuFeX: Quantum feature extraction module for hybrid quantum-classical deep neural networks | Naman Jain et.al. | 2501.13165 | null |
2025-01-22 | Neural Radiance Fields for the Real World: A Survey | Wenhui Xiao et.al. | 2501.13104 | null |
2025-01-23 | AdaWM: Adaptive World Model based Planning for Autonomous Driving | Hang Wang et.al. | 2501.13072 | null |
2025-01-22 | Int2Planner: An Intention-based Multi-modal Motion Planner for Integrated Prediction and Planning | Xiaolei Chen et.al. | 2501.12799 | null |
2025-01-22 | PPO-Based Vehicle Control for Ramp Merging Scheme Assisted by Enhanced C-V2X | Qiong Wu et.al. | 2501.12656 | link |
2025-02-02 | DWTNeRF: Boosting Few-shot Neural Radiance Fields via Discrete Wavelet Transform | Hung Nguyen et.al. | 2501.12637 | null |
2025-01-22 | Improved Detection and Diagnosis of Faults in Deep Neural Networks Using Hierarchical and Explainable Classification | Sigma Jahan et.al. | 2501.12560 | null |
2025-01-20 | Egoistic MDS-based Rigid Body Localization | Niclas Führling et.al. | 2501.12417 | null |
2025-03-03 | RALAD: Bridging the Real-to-Sim Domain Gap in Autonomous Driving with Retrieval-Augmented Learning | Jiacheng Zuo et.al. | 2501.12296 | link |
2025-01-21 | Video Deblurring by Sharpness Prior Detection and Edge Information | Yang Tian et.al. | 2501.12246 | link |
2025-01-21 | RL-RC-DoT: A Block-level RL agent for Task-Aware Video Compression | Uri Gadot et.al. | 2501.12216 | null |
2025-01-21 | Select2Drive: Pragmatic Communications for Real-Time Collaborative Autonomous Driving | Jiahao Huang et.al. | 2501.12040 | null |
2025-01-21 | Make Full Use of Testing Information: An Integrated Accelerated Testing and Evaluation Method for Autonomous Driving Systems | Xinzheng Wu et.al. | 2501.11924 | null |
2025-01-21 | Fast Underwater Scene Reconstruction using Multi-View Stereo and Physical Imaging | Shuyi Hu et.al. | 2501.11884 | null |
2025-01-21 | Survey on Monocular Metric Depth Estimation | Jiuling Zhang et.al. | 2501.11841 | null |
2025-02-16 | A Survey of World Models for Autonomous Driving | Tuo Feng et.al. | 2501.11260 | null |
2025-01-19 | Car-GS: Addressing Reflective and Transparent Surface Challenges in 3D Car Reconstruction | Congcong Li et.al. | 2501.11020 | null |
2025-01-18 | Efficient and Safe Trajectory Planning for Autonomous Agricultural Vehicle Headland Turning in Cluttered Orchard Environments | Peng Wei et.al. | 2501.10636 | null |
2025-01-16 | Poxel: Voxel Reconstruction for 3D Printing | Ruixiang Cao et.al. | 2501.10474 | null |
2025-01-17 | MutualForce: Mutual-Aware Enhancement for 4D Radar-LiDAR 3D Object Detection | Xiangyuan Peng et.al. | 2501.10266 | null |
2025-01-17 | Explainable artificial intelligence (XAI): from inherent explainability to large language models | Fuseini Mumuni et.al. | 2501.09967 | null |
2025-01-16 | Fine-grained Testing for Autonomous Driving Software: a Study on Autoware with LLM-driven Unit Testing | Wenhan Wang et.al. | 2501.09866 | null |
2025-01-16 | Distilling Multi-modal Large Language Models for Autonomous Driving | Deepti Hegde et.al. | 2501.09757 | null |
2025-01-16 | The Devil is in the Details: Simple Remedies for Image-to-LiDAR Representation Learning | Wonjun Jo et.al. | 2501.09485 | null |
2025-01-16 | MonoSOWA: Scalable monocular 3D Object detector Without human Annotations | Jan Skvrna et.al. | 2501.09481 | null |
2025-01-16 | RE-POSE: Synergizing Reinforcement Learning-Based Partitioning and Offloading for Edge Object Detection | Jianrui Shi et.al. | 2501.09465 | null |
2025-01-16 | Normal-NeRF: Ambiguity-Robust Normal Estimation for Highly Reflective Scenes | Ji Shi et.al. | 2501.09460 | link |
2025-01-17 | On Learning Informative Trajectory Embeddings for Imitation, Classification and Regression | Zichang Ge et.al. | 2501.09327 | link |
2025-01-16 | Modeling Language for Scenario Development of Autonomous Driving Systems | Toshiaki Aoki et.al. | 2501.09319 | null |
2025-01-15 | Generative Planning with 3D-vision Language Pre-training for End-to-End Autonomous Driving | Tengpeng Li et.al. | 2501.08861 | link |
2025-01-16 | BRIGHT-VO: Brightness-Guided Hybrid Transformer for Visual Odometry with Multi-modality Refinement Module | Dongzhihan Wang et.al. | 2501.08659 | null |
2025-01-14 | Decoding Interpretable Logic Rules from Neural Networks | Chuqin Geng et.al. | 2501.08281 | null |
2025-01-14 | LeapVAD: A Leap in Autonomous Driving via Cognitive Perception and Dual-Process Thinking | Yukai Ma et.al. | 2501.08168 | null |
2025-01-14 | Hybrid Action Based Reinforcement Learning for Multi-Objective Compatible Autonomous Driving | Guizhe Jin et.al. | 2501.08096 | null |
2025-01-27 | Benchmarking Vision Foundation Models for Input Monitoring in Autonomous Driving | Mert Keser et.al. | 2501.08083 | null |
2025-01-13 | Evaluating Human Perception of Novel View Synthesis: Subjective Quality Assessment of Gaussian Splatting and NeRF in Dynamic Scenes | Yuhang Zhang et.al. | 2501.08072 | null |
2025-01-14 | GAC-Net_Geometric and attention-based Network for Depth Completion | Kuang Zhu et.al. | 2501.07988 | null |
2025-01-14 | A Low-cost and Ultra-lightweight Binary Neural Network for Traffic Signal Recognition | Mingke Xiao et.al. | 2501.07808 | null |
2025-01-14 | HgPCN: A Heterogeneous Architecture for E2E Embedded Point Cloud Inference | Yiming Gao et.al. | 2501.07767 | null |
2025-01-16 | PO-GVINS: Tightly Coupled GNSS-Visual-Inertial Integration with Pose-Only Representation | Zhuo Xu et.al. | 2501.07259 | null |
2025-01-14 | SplatMAP: Online Dense Monocular SLAM with 3D Gaussian Splatting | Yue Hu et.al. | 2501.07015 | null |
2025-01-13 | LEO: Boosting Mixture of Vision Encoders for Multimodal Large Language Models | Mozhgan Nasr Azadani et.al. | 2501.06986 | link |
2025-02-02 | CULTURE3D: Cultural Landmarks and Terrain Dataset for 3D Applications | Xinyi Zheng et.al. | 2501.06927 | link |
2025-01-12 | Application of Vision-Language Model to Pedestrians Behavior and Scene Understanding in Autonomous Driving | Haoxiang Gao et.al. | 2501.06680 | null |
2025-01-11 | Common Sense Is All You Need | Hugo Latapie et.al. | 2501.06642 | null |
2025-01-08 | NextStop: An Improved Tracker For Panoptic LIDAR Segmentation Data | Nirit Alkalay et.al. | 2501.06235 | null |
2025-02-23 | Leveraging Edge Intelligence and LLMs to Advance 6G-Enabled Internet of Automated Defense Vehicles | Murat Arda Onsu et.al. | 2501.06205 | null |
2025-01-10 | Vehicle-in-Virtual-Environment (VVE) Based Autonomous Driving Function Development and Evaluation Methodology for Vulnerable Road User Safety | Haochong Chen et.al. | 2501.06113 | null |
2025-01-10 | Minimizing Occlusion Effect on Multi-View Camera Perception in BEV with Multi-Sensor Fusion | Sanjay Kumar et.al. | 2501.05997 | null |
2025-01-10 | UV-Attack: Physical-World Adversarial Attacks for Person Detection via Dynamic-NeRF-based UV Mapping | Yanjie Li et.al. | 2501.05783 | null |
2025-01-10 | TB-Bench: Training and Testing Multi-Modal AI for Understanding Spatio-Temporal Traffic Behaviors from Dashcam Images/Videos | Korawat Charoenpitaks et.al. | 2501.05733 | link |
2025-01-09 | Vision-Language Models for Autonomous Driving: CLIP-Based Dynamic Scene Understanding | Mohammed Elhenawy et.al. | 2501.05566 | null |
2025-01-07 | Implicit Guidance and Explicit Representation of Semantic Information in Points Cloud: A Survey | Jingyuan Tang et.al. | 2501.05473 | link |
2025-01-09 | The global consensus on the risk management of autonomous driving | Sebastian Krügel et.al. | 2501.05391 | null |
2025-01-09 | Domain-Incremental Semantic Segmentation for Autonomous Driving under Adverse Driving Conditions | Shishir Muralidhara et.al. | 2501.05246 | null |
2025-01-09 | CorrDiff: Adaptive Delay-aware Detector with Temporal Cue Inputs for Real-time Object Detection | Xiang Zhang et.al. | 2501.05132 | null |
2025-01-09 | DriVLM: Domain Adaptation of Vision-Language Models in Autonomous Driving | Xuran Zheng et.al. | 2501.05081 | null |
2025-01-09 | LearningFlow: Automated Policy Learning Workflow for Urban Driving with Large Language Models | Zengqi Peng et.al. | 2501.05057 | null |
2025-01-09 | CuRLA: Curriculum Learning Based Deep Reinforcement Learning for Autonomous Driving | Bhargava Uppuluri et.al. | 2501.04982 | null |
2025-01-09 | AD-L-JEPA: Self-Supervised Spatial World Models with Joint Embedding Predictive Architecture for Autonomous Driving with LiDAR Data | Haoran Zhu et.al. | 2501.04969 | link |
2025-01-08 | Integrating LLMs with ITS: Recent Advances, Potentials, Challenges, and Future Directions | Doaa Mahmud et.al. | 2501.04437 | null |
2025-01-08 | FGU3R: Fine-Grained Fusion via Unified 3D Representation for Multimodal 3D Object Detection | Guoxin Zhang et.al. | 2501.04373 | null |
2025-01-08 | H-MBA: Hierarchical MamBa Adaptation for Multi-Modal Video Understanding in Autonomous Driving | Siran Chen et.al. | 2501.04302 | null |
2025-01-07 | NeRFs are Mirror Detectors: Using Structural Similarity for Multi-View Mirror Scene Reconstruction with 3D Surface Primitives | Leif Van Holland et.al. | 2501.04074 | link |
2025-01-07 | LargeAD: Large-Scale Cross-Sensor Data Pretraining for Autonomous Driving | Lingdong Kong et.al. | 2501.04005 | null |
2025-01-07 | Are VLMs Ready for Autonomous Driving? An Empirical Study from the Reliability, Data, and Metric Perspectives | Shaoyuan Xie et.al. | 2501.04003 | link |
2025-01-07 | NeuralSVG: An Implicit Representation for Text-to-Vector Generation | Sagi Polaczek et.al. | 2501.03992 | null |
2025-01-19 | Image Segmentation: Inducing graph-based learning | Aryan Singh et.al. | 2501.03765 | link |
2025-01-07 | Hybrid Machine Learning Model with a Constrained Action Space for Trajectory Prediction | Alexander Fertig et.al. | 2501.03666 | null |
2025-01-08 | SenseRAG: Constructing Environmental Knowledge Bases with Proactive Querying for LLM-Based Autonomous Driving | Xuewen Luo et.al. | 2501.03535 | null |
2025-01-06 | MObI: Multimodal Object Inpainting Using Diffusion Models | Alexandru Buburuzan et.al. | 2501.03173 | null |
2025-01-06 | 4D-CS: Exploiting Cluster Prior for 4D Spatio-Temporal LiDAR Semantic Segmentation | Jiexi Zhong et.al. | 2501.02937 | null |
2025-01-06 | A Novel Vision Transformer for Camera-LiDAR Fusion based Traffic Object Segmentation | Toomas Tahves et.al. | 2501.02858 | null |
2025-01-07 | AE-NeRF: Augmenting Event-Based Neural Radiance Fields for Non-ideal Conditions and Larger Scene | Chaoran Feng et.al. | 2501.02807 | null |
2025-01-13 | LDMapNet-U: An End-to-End System for City-Scale Lane-Level Map Updating | Deguo Xia et.al. | 2501.02763 | null |
2025-01-05 | UDMC: Unified Decision-Making and Control Framework for Urban Autonomous Driving with Motion Prediction of Traffic Participants | Haichao Liu et.al. | 2501.02530 | link |
2025-01-05 | GCP: Guarded Collaborative Perception with Spatial-Temporal Aware Malicious Agent Detection | Yihang Tao et.al. | 2501.02450 | null |
2025-01-04 | RadarNeXt: Real-Time and Reliable 3D Object Detector Based On 4D mmWave Imaging Radar | Liye Jia et.al. | 2501.02314 | link |
2025-01-01 | Communication Efficient Cooperative Edge AI via Event-Triggered Computation Offloading | You Zhou et.al. | 2501.02001 | null |
2025-01-03 | Evaluating Scenario-based Decision-making for Interactive Autonomous Driving Using Rational Criteria: A Survey | Zhen Tian et.al. | 2501.01886 | null |
2025-01-03 | Adverse Weather Conditions Augmentation of LiDAR Scenes with Latent Diffusion Models | Andrea Matteazzi et.al. | 2501.01761 | null |
2025-01-03 | Enhancing Large Vision Model in Street Scene Semantic Understanding through Leveraging Posterior Optimization Trajectory | Wei-Bin Kou et.al. | 2501.01710 | null |
2025-01-02 | MSC-Bench: Benchmarking and Analyzing Multi-Sensor Corruption for Driving Perception | Xiaoshuai Hao et.al. | 2501.01037 | null |
2025-01-03 | DreamDrive: Generative 4D Scene Modeling from Street View Images | Jiageng Mao et.al. | 2501.00601 | null |
2024-12-31 | Online Video Understanding: A Comprehensive Benchmark and Memory-Augmented Method | Zhenpeng Huang et.al. | 2501.00584 | null |
2024-12-31 | Toward Information Theoretic Active Inverse Reinforcement Learning | Ondrej Bajgar et.al. | 2501.00381 | null |
2024-12-31 | Research on vehicle detection based on improved YOLOv8 network | Haocheng Guo et.al. | 2501.00300 | null |
2025-01-09 | Automotive Speed Estimation: Sensor Types and Error Characteristics from OBD-II to ADAS | Hany Ragab et.al. | 2501.00242 | link |
2024-12-31 | DecoratingFusion: A LiDAR-Camera Fusion Network with the Combination of Point-level and Feature-level Fusion | Zixuan Yin et.al. | 2501.00220 | null |
2024-12-30 | TiGDistill-BEV: Multi-view BEV 3D Object Detection via Target Inner-Geometry Learning Distillation | Shaoqing Xu et.al. | 2412.20911 | link |
2024-12-30 | DEMO: A Dynamics-Enhanced Learning Model for Multi-Horizon Trajectory Prediction in Autonomous Vehicles | Chengyue Wang et.al. | 2412.20784 | null |
2024-12-29 | MR-Occ: Efficient Camera-LiDAR 3D Semantic Occupancy Prediction Using Hierarchical Multi-Resolution Voxel Representation | Minjae Seong et.al. | 2412.20480 | null |
2024-12-29 | Bringing Objects to Life: 4D generation from 3D objects | Ohad Rahamim et.al. | 2412.20422 | null |
2024-12-28 | Leveraging Large Language Models for Enhancing Autonomous Vehicle Perception | Athanasios Karagounis et.al. | 2412.20230 | null |
2024-12-28 | Multi-Modality Driven LoRA for Adverse Condition Depth Estimation | Guanglei Yang et.al. | 2412.20162 | null |
2024-12-28 | Defending Against Network Attacks for Secure AI Agent Migration in Vehicular Metaverses | Xinru Wen et.al. | 2412.20154 | null |
2024-12-28 | DepthMamba with Adaptive Fusion | Zelin Meng et.al. | 2412.19964 | null |
2024-12-27 | Zero-shot Hazard Identification in Autonomous Driving: A Case Study on the COOOL Benchmark | Lukas Picek et.al. | 2412.19944 | null |
2024-12-30 | DrivingWorld: Constructing World Model for Autonomous Driving via Video GPT | Xiaotao Hu et.al. | 2412.19505 | link |
2024-12-27 | Learning Radiance Fields from a Single Snapshot Compressive Image | Yunhao Li et.al. | 2412.19483 | null |
2024-12-30 | DriveEditor: A Unified 3D Information-Guided Framework for Controllable Object Editing in Driving Scenes | Yiyuan Liang et.al. | 2412.19458 | link |
2024-12-27 | MLLM-SUL: Multimodal Large Language Model for Semantic Scene Understanding and Localization in Traffic Scenarios | Jiaqi Fan et.al. | 2412.19406 | link |
2025-01-05 | BeSplat: Gaussian Splatting from a Single Blurry Image and Event Stream | Gopi Raju Matta et.al. | 2412.19370 | null |
2024-12-26 | Generating Editable Head Avatars with 3D Gaussian GANs | Guohao Li et.al. | 2412.19149 | link |
2024-12-26 | MVS-GS: High-Quality 3D Gaussian Splatting Mapping via Online Multi-View Stereo | Byeonggwon Lee et.al. | 2412.19130 | null |
2024-12-25 | TopoBDA: Towards Bezier Deformable Attention for Road Topology Understanding | Muhammet Esat Kalfaoglu et.al. | 2412.18951 | null |
2024-12-30 | HV-BEV: Decoupling Horizontal and Vertical Feature Sampling for Multi-View 3D Object Detection | Di Wu et.al. | 2412.18884 | null |
2024-12-25 | TSceneJAL: Joint Active Learning of Traffic Scenes for 3D Object Detection | Chenyang Lei et.al. | 2412.18870 | link |
2024-12-25 | Evaluating the Adversarial Robustness of Detection Transformers | Amirhossein Nazeri et.al. | 2412.18718 | null |
2024-12-24 | Large Language Model guided Deep Reinforcement Learning for Decision Making in Autonomous Driving | Hao Pang et.al. | 2412.18511 | null |
2024-12-24 | GDM4MMIMO: Generative Diffusion Models for Massive MIMO Communications | Zhenzhou Jin et.al. | 2412.18281 | null |
2024-12-24 | Parallel Neural Computing for Scene Understanding from LiDAR Perception in Autonomous Racing | Suwesh Prasad Sah et.al. | 2412.18165 | link |
2024-12-24 | Generating Traffic Scenarios via In-Context Learning to Learn Better Motion Planner | Aizierjiang Aiersilan et.al. | 2412.18086 | link |
2024-12-23 | AA-SGAN: Adversarially Augmented Social GAN with Synthetic Data | Mirko Zaffaroni et.al. | 2412.18038 | link |
2024-12-23 | Multimodal Learning with Uncertainty Quantification based on Discounted Belief Fusion | Grigor Bezirganyan et.al. | 2412.18024 | link |
2025-02-05 | Causal Composition Diffusion Model for Closed-loop Traffic Generation | Haohong Lin et.al. | 2412.17920 | null |
2024-12-23 | Editing Implicit and Explicit Representations of Radiance Fields: A Survey | Arthur Hubert et.al. | 2412.17628 | null |
2024-12-23 | Exploring Dynamic Novel View Synthesis Technologies for Cinematography | Adrian Azzarelli et.al. | 2412.17532 | null |
2024-12-23 | DeepMF: Deep Motion Factorization for Closed-Loop Safety-Critical Driving Scenario Simulation | Yizhe Li et.al. | 2412.17487 | null |
2025-01-04 | Feature Based Methods in Domain Adaptation for Object Detection: A Review Paper | Helia Mohamadi et.al. | 2412.17325 | null |
2024-12-23 | OLiDM: Object-aware LiDAR Diffusion Models for Autonomous Driving | Tianyi Yan et.al. | 2412.17226 | null |
2024-12-22 | NumbOD: A Spatial-Frequency Fusion Attack Against Object Detectors | Ziqi Zhou et.al. | 2412.16955 | link |
2024-12-22 | Lightweight Design and Optimization methods for DCNNs: Progress and Futures | Hanhua Long et.al. | 2412.16886 | null |
2024-12-22 | Phase-change metasurfaces for reconfigurable image processing | Tingting Liu et.al. | 2412.16856 | null |
2024-12-21 | Towards Selection and Transition Between Behavior-Based Neural Networks for Automated Driving | Iqra Aslam et.al. | 2412.16764 | null |
2024-12-21 | A Method for the Runtime Validation of AI-based Environment Perception in Automated Driving System | Iqra Aslam et.al. | 2412.16762 | null |
2024-12-21 | Application of Multimodal Large Language Models in Autonomous Driving | Md Robiul Islam et.al. | 2412.16410 | null |
2024-12-20 | Mapping the Mind of an Instruction-based Image Editing using SMILE | Zeinab Dehghani et.al. | 2412.16277 | link |
2025-02-14 | Autoware.Flex: Human-Instructed Dynamically Reconfigurable Autonomous Driving Systems | Ziwei Song et.al. | 2412.16265 | null |
2024-12-20 | Optimizing Low-Speed Autonomous Driving: A Reinforcement Learning Approach to Route Stability and Maximum Speed | Benny Bao-Sheng Li et.al. | 2412.16248 | null |
2024-12-18 | AdvIRL: Reinforcement Learning-Based Adversarial Attacks on 3D NeRF Models | Tommy Nguyen et.al. | 2412.16213 | link |
2024-12-17 | CLIP-RLDrive: Human-Aligned Autonomous Driving via CLIP-Based Reward Shaping in Reinforcement Learning | Erfan Doroudian et.al. | 2412.16201 | null |
2024-12-20 | NeRF-To-Real Tester: Neural Radiance Fields as Test Image Generators for Vision of Autonomous Systems | Laura Weihl et.al. | 2412.16141 | null |
2024-12-20 | Camera-Based Localization and Enhanced Normalized Mutual Information | Vishnu Teja Kunde et.al. | 2412.16137 | null |
2025-01-11 | NeuroPump: Simultaneous Geometric and Color Rectification for Underwater Images | Yue Guo et.al. | 2412.15890 | null |
2024-12-20 | Sparse Point Clouds Assisted Learned Image Compression | Yiheng Jiang et.al. | 2412.15752 | null |
2024-12-20 | Mask-RadarNet: Enhancing Transformer With Spatial-Temporal Semantic Context for Radar Object Detection in Autonomous Driving | Yuzhi Wu et.al. | 2412.15595 | null |
2024-12-20 | VLM-RL: A Unified Vision Language Models and Reinforcement Learning Framework for Safe Autonomous Driving | Zilin Huang et.al. | 2412.15544 | null |
2024-12-26 | LiHi-GS: LiDAR-Supervised Gaussian Splatting for Highway Driving Scene Reconstruction | Pou-Chun Kung et.al. | 2412.15447 | null |
2024-12-18 | DreaMark: Rooting Watermark in Score Distillation Sampling Generated Neural Radiance Fields | Xingyu Zhu et.al. | 2412.15278 | null |
2025-02-14 | OpenEMMA: Open-Source Multimodal Model for End-to-End Autonomous Driving | Shuo Xing et.al. | 2412.15208 | link |
2024-12-19 | AutoTrust: Benchmarking Trustworthiness in Large Vision Language Models for Autonomous Driving | Shuo Xing et.al. | 2412.15206 | link |
2024-12-19 | LiDAR-RT: Gaussian-based Ray Tracing for Dynamic LiDAR Re-simulation | Chenxu Zhou et.al. | 2412.15199 | null |
2024-12-25 | Unveiling Uncertainty: A Deep Dive into Calibration and Performance of Multimodal Large Language Models | Zijun Chen et.al. | 2412.14660 | link |
2024-12-19 | Bright-NeRF:Brightening Neural Radiance Field with Color Restoration from Low-light Raw Images | Min Wang et.al. | 2412.14547 | null |
2024-12-19 | Drive-1-to-3: Enriching Diffusion Priors for Novel View Synthesis of Real Vehicles | Chuang Lin et.al. | 2412.14494 | null |
2024-12-19 | VLM-AD: End-to-End Autonomous Driving through Vision-Language Model Supervision | Yi Xu et.al. | 2412.14446 | null |
2025-02-12 | DriveGPT: Scaling Autoregressive Behavior Models for Driving | Xin Huang et.al. | 2412.14415 | null |
2024-12-17 | A Comprehensive Review on Traffic Datasets and Simulators for Autonomous Vehicles | Supriya Sarker et.al. | 2412.14207 | null |
2024-12-18 | Joint Perception and Prediction for Autonomous Driving: A Survey | Lucas Dal’Col et.al. | 2412.14088 | link |
2024-12-18 | GraphAvatar: Compact Head Avatars with GNN-Generated 3D Gaussians | Xiaobao Wei et.al. | 2412.13983 | link |
2025-02-04 | A Black-Box Evaluation Framework for Semantic Robustness in Bird’s Eye View Detection | Fu Wang et.al. | 2412.13913 | link |
2024-12-18 | Object Style Diffusion for Generalized Object Detection in Urban Scene | Hao Li et.al. | 2412.13815 | null |
2024-12-18 | SimADFuzz: Simulation-Feedback Fuzz Testing for Autonomous Driving Systems | Huiwen Yang et.al. | 2412.13802 | null |
2024-12-18 | An Efficient Occupancy World Model via Decoupled Dynamic Flow and Image-assisted Training | Haiming Zhang et.al. | 2412.13772 | null |
2024-12-18 | Optical aberrations in autonomous driving: Physics-informed parameterized temperature scaling for neural network uncertainty calibration | Dominik Werner Wolf et.al. | 2412.13695 | null |
2024-12-18 | RelationField: Relate Anything in Radiance Fields | Sebastian Koch et.al. | 2412.13652 | link |
2024-12-18 | Pre-training a Density-Aware Pose Transformer for Robust LiDAR-based 3D Human Pose Estimation | Xiaoqi An et.al. | 2412.13454 | link |
2024-12-18 | Exploring Transformer-Augmented LSTM for Temporal and Spatial Feature Learning in Trajectory Prediction | Chandra Raskoti et.al. | 2412.13419 | null |
2024-12-17 | Quantitative Predictive Monitoring and Control for Safe Human-Machine Interaction | Shuyang Dong et.al. | 2412.13365 | null |
2024-12-19 | SafeDrive: Knowledge- and Data-Driven Risk-Sensitive Decision-Making for Autonomous Vehicles with Large Language Models | Zhiyuan Zhou et.al. | 2412.13238 | null |
2024-12-24 | C2F-TP: A Coarse-to-Fine Denoising Framework for Uncertainty-Aware Trajectory Prediction | Zichen Wang et.al. | 2412.13231 | link |
2024-12-17 | GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding | Haoyi Jiang et.al. | 2412.13193 | link |
2024-12-17 | StreetCrafter: Street View Synthesis with Controllable Video Diffusion Models | Yunzhi Yan et.al. | 2412.13188 | null |
2024-12-17 | Efficient Event-based Semantic Segmentation with Spike-driven Lightweight Transformer-based Networks | Xiaxin Zhu et.al. | 2412.12843 | null |
2024-12-18 | Optimize the Unseen – Fast NeRF Cleanup with Free Space Prior | Leo Segre et.al. | 2412.12772 | null |
2024-12-17 | Open-World Panoptic Segmentation | Matteo Sodano et.al. | 2412.12740 | null |
2024-12-17 | MapExpert: Online HD Map Construction with Simple and Efficient Sparse Map Element Expert | Dapeng Zhang et.al. | 2412.12704 | null |
2024-12-17 | DriveTester: A Unified Platform for Simulation-Based Autonomous Driving Testing | Mingfei Cheng et.al. | 2412.12656 | link |
2024-12-17 | Improving the Transferability of 3D Point Cloud Attack via Spectral-aware Admix and Optimization Designs | Shiyu Hu et.al. | 2412.12626 | null |
2024-12-16 | Domain Generalization in Autonomous Driving: Evaluating YOLOv8s, RT-DETR, and YOLO-NAS with the ROAD-Almaty Dataset | Madiyar Alimov et.al. | 2412.12349 | null |
2024-12-16 | PanSplat: 4K Panorama Synthesis with Feed-Forward Gaussian Splatting | Cheng Zhang et.al. | 2412.12096 | link |
2024-12-16 | CP-Guard: Malicious Agent Detection and Defense in Collaborative Bird’s Eye View Perception | Senkang Hu et.al. | 2412.12000 | null |
2024-12-16 | Point Cloud-Assisted Neural Image Compression | Ziqun Li et.al. | 2412.11771 | null |
2024-12-16 | NEST: A Neuromodulated Small-world Hypergraph Trajectory Prediction Model for Autonomous Driving | Chengyue Wang et.al. | 2412.11682 | null |
2024-12-16 | DINO-Foresight: Looking into the Future with DINO | Efstathios Karypidis et.al. | 2412.11673 | link |
2024-12-16 | AEPHORA: AI/ML-Based Energy-Efficient Proactive Handover and Resource Allocation | Bowen Xie et.al. | 2412.11491 | null |
2024-12-16 | HGSFusion: Radar-Camera Fusion with Hybrid Generation and Synchronization for 3D Object Detection | Zijian Gu et.al. | 2412.11489 | link |
2024-12-16 | Efficient Policy Adaptation with Contrastive Prompt Ensemble for Embodied Agents | Wonje Choi et.al. | 2412.11484 | null |
2024-12-16 | VRVVC: Variable-Rate NeRF-Based Volumetric Video Compression | Qiang Hu et.al. | 2412.11362 | null |
2025-01-10 | ViPOcc: Leveraging Visual Priors from Vision Foundation Models for Single-View 3D Occupancy Prediction | Yi Feng et.al. | 2412.11210 | link |
2024-12-15 | GEM: A Generalizable Ego-Vision Multimodal World Model for Fine-Grained Ego-Motion, Object Dynamics, and Scene Composition Control | Mariam Hassan et.al. | 2412.11198 | link |
2024-12-15 | RAC3: Retrieval-Augmented Corner Case Comprehension for Autonomous Driving with Vision-Language Models | Yujin Wang et.al. | 2412.11050 | null |
2024-12-15 | SceneLLM: Implicit Language Reasoning in LLM for Dynamic Scene Graph Generation | Hang Zhang et.al. | 2412.11026 | null |
2025-01-23 | OmniHD-Scenes: A Next-Generation Multimodal Dataset for Autonomous Driving | Lianqing Zheng et.al. | 2412.10734 | null |
2024-12-11 | Automatic Image Annotation for Mapped Features Detection | Maxime Noizet et.al. | 2412.10438 | null |
2024-12-13 | GaussianWorld: Gaussian World Model for Streaming 3D Occupancy Prediction | Sicheng Zuo et.al. | 2412.10373 | link |
2024-12-13 | GaussianAD: Gaussian-Centric End-to-End Autonomous Driving | Wenzhao Zheng et.al. | 2412.10371 | link |
2024-12-13 | Timealign: A multi-modal object detection method for time misalignment fusing in autonomous driving | Zhihang Song et.al. | 2412.10033 | null |
2024-12-13 | NeRF-Texture: Synthesizing Neural Radiance Field Textures | Yi-Hua Huang et.al. | 2412.10004 | null |
2024-12-17 | WiseAD: Knowledge Augmented End-to-End Autonomous Driving with Vision-Language Model | Songyan Zhang et.al. | 2412.09951 | link |
2024-12-13 | Sharpening Your Density Fields: Spiking Neuron Aided Fast Geometry Learning | Yi Gu et.al. | 2412.09881 | null |
2024-12-13 | EI-Drive: A Platform for Cooperative Perception with Realistic Communication Models | Hanchu Zhou et.al. | 2412.09782 | null |
2024-12-12 | PBR-NeRF: Inverse Rendering with Physics-Based Neural Fields | Sean Wu et.al. | 2412.09680 | link |
2024-12-11 | Bench2Drive-R: Turning Real World Data into Reactive Closed-Loop Autonomous Driving Benchmark by Generative Model | Junqi You et.al. | 2412.09647 | null |
2024-12-12 | Doe-1: Closed-Loop Autonomous Driving with Large World Model | Wenzhao Zheng et.al. | 2412.09627 | link |
2024-12-13 | Hidden Biases of End-to-End Driving Datasets | Julian Zimmerlin et.al. | 2412.09602 | link |
2024-12-12 | Slope Considered Online Nonlinear Trajectory Planning with Differential Energy Model for Autonomous Driving | Zhaofeng Tian et.al. | 2412.09424 | null |
2024-12-12 | MMD-OPT : Maximum Mean Discrepancy Based Sample Efficient Collision Risk Minimization for Autonomous Driving | Basant Sharma et.al. | 2412.09121 | null |
2024-12-12 | DrivingRecon: Large 4D Gaussian Reconstruction Model For Autonomous Driving | Hao Lu et.al. | 2412.09043 | link |
2024-12-12 | EMATO: Energy-Model-Aware Trajectory Optimization for Autonomous Driving | Zhaofeng Tian et.al. | 2412.08830 | null |
2024-12-11 | Latent Safety-Constrained Policy Approach for Safe Offline Reinforcement Learning | Prajwal Koirala et.al. | 2412.08794 | null |
2024-12-11 | GPD-1: Generative Pre-training for Driving | Zixun Xie et.al. | 2412.08643 | link |
2024-12-11 | An End-to-End Collaborative Learning Approach for Connected Autonomous Vehicles in Occluded Scenarios | Leandro Parada et.al. | 2412.08562 | null |
2024-12-13 | Physical Informed Driving World Model | Zhuoran Yang et.al. | 2412.08410 | null |
2024-12-11 | Task-specific Self-body Controller Acquisition by Musculoskeletal Humanoids: Application to Pedal Control in Autonomous Driving | Kento Kawaharazuka et.al. | 2412.08270 | null |
2024-12-11 | Neural Observation Field Guided Hybrid Optimization of Camera Placement | Yihan Cao et.al. | 2412.08266 | link |
2024-12-18 | GN-FR:Generalizable Neural Radiance Fields for Flare Removal | Gopi Raju Matta et.al. | 2412.08200 | null |
2024-12-11 | Static-Dynamic Class-level Perception Consistency in Video Semantic Segmentation | Zhigang Cen et.al. | 2412.08034 | null |
2024-12-11 | Test-time Correction with Human Feedback: An Online 3D Detection System via Visual Prompting | Zetong Yang et.al. | 2412.07768 | null |
2024-12-13 | DriveMM: All-in-One Large Multimodal Model for Autonomous Driving | Zhijian Huang et.al. | 2412.07689 | link |
2024-12-10 | Hallucination Elimination and Semantic Enhancement Framework for Vision-Language Models in Traffic Scenarios | Jiaqi Fan et.al. | 2412.07518 | link |
2024-12-10 | A Real-time Degeneracy Sensing and Compensation Method for Enhanced LiDAR SLAM | Zongbo Liao et.al. | 2412.07513 | null |
2024-12-10 | ITPNet: Towards Instantaneous Trajectory Prediction for Autonomous Driving | Rongqing Li et.al. | 2412.07369 | null |
2024-12-10 | EventSplat: 3D Gaussian Splatting from Moving Event Cameras for Real-time Rendering | Toshiya Yura et.al. | 2412.07293 | null |
2024-12-10 | Fast Occupancy Network | Mingjie Lu et.al. | 2412.07163 | null |
2024-12-09 | Driv3R: Learning Dense 4D Reconstruction for Autonomous Driving | Xin Fei et.al. | 2412.06777 | link |
2024-12-14 | Exploring Critical Testing Scenarios for Decision-Making Policies: An LLM Approach | Weichao Xu et.al. | 2412.06684 | null |
2024-12-09 | Prediction of Occluded Pedestrians in Road Scenes using Human-like Reasoning: Insights from the OccluRoads Dataset | Melo Castillo Angie Nataly et.al. | 2412.06549 | null |
2024-12-09 | PPT: Pre-Training with Pseudo-Labeled Trajectories for Motion Forecasting | Yihong Xu et.al. | 2412.06491 | null |
2025-01-02 | World knowledge-enhanced Reasoning Using Instruction-guided Interactor in Autonomous Driving | Mingliang Zhai et.al. | 2412.06324 | null |
2024-12-09 | Omni-Scene: Omni-Gaussian Representation for Ego-Centric Sparse-View Scene Reconstruction | Dongxu Wei et.al. | 2412.06273 | null |
2024-12-09 | Pilot-guided Multimodal Semantic Communication for Audio-Visual Event Localization | Fei Yu et.al. | 2412.06208 | null |
2024-12-09 | AgentAlign: Misalignment-Adapted Multi-Agent Perception for Resilient Inter-Agent Sensor Correlations | Zonglin Meng et.al. | 2412.06142 | null |
2024-12-09 | HSDA: High-frequency Shuffle Data Augmentation for Bird’s-Eye-View Map Segmentation | Calvin Glisson et.al. | 2412.06127 | link |
2024-12-08 | GVDepth: Zero-Shot Monocular Depth Estimation for Ground Vehicles based on Probabilistic Cue Fusion | Karlo Koledic et.al. | 2412.06080 | null |
2024-12-08 | Lightweight Spatial Embedding for Vision-based 3D Occupancy Prediction | Jinqing Zhang et.al. | 2412.05976 | null |
2024-12-08 | A Review on Multisensor Data Fusion for Wearable Health Monitoring | Arlene John et.al. | 2412.05895 | null |
2024-12-08 | doScenes: An Autonomous Driving Dataset with Natural Language Instruction for Human Interaction and Vision-Language Navigation | Parthib Roy et.al. | 2412.05893 | link |
2024-12-07 | Real-Time 3D Object Detection Using InnovizOne LiDAR and Low-Power Hailo-8 AI Accelerator | Itay Krispin-Avraham et.al. | 2412.05594 | link |
2024-12-06 | COOOL: Challenge Of Out-Of-Label A Novel Benchmark for Autonomous Driving | Ali K. AlShami et.al. | 2412.05462 | link |
2024-12-06 | UniScene: Unified Occupancy-centric Driving Scene Generation | Bohan Li et.al. | 2412.05435 | null |
2024-12-06 | Generative Model-Based Fusion for Improved Few-Shot Semantic Segmentation of Infrared Images | Junno Yun et.al. | 2412.05341 | null |
2024-12-06 | ACT-Bench: Towards Action Controllable World Models for Autonomous Driving | Hidehisa Arai et.al. | 2412.05337 | null |
2024-12-03 | $ρ$ -NeRF: Leveraging Attenuation Priors in Neural Radiance Field for 3D Computed Tomography Reconstruction | Li Zhou et.al. | 2412.05322 | null |
2024-12-11 | Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model | Lening Wang et.al. | 2412.05280 | link |
2024-12-06 | Towards Flexible 3D Perception: Object-Centric Occupancy Completion Augments 3D Object Detection | Chaoda Zheng et.al. | 2412.05154 | link |
2024-12-06 | Backdooring Outlier Detection Methods: A Novel Attack Approach | ZeinabSadat Taghavi et.al. | 2412.05010 | null |
2024-12-11 | MixedGaussianAvatar: Realistically and Geometrically Accurate Head Avatar via Mixed 2D-3D Gaussian Splatting | Peng Chen et.al. | 2412.04955 | link |
2025-01-20 | UniMLVG: Unified Framework for Multi-view Long Video Generation with Comprehensive Control Capabilities for Autonomous Driving | Rui Chen et.al. | 2412.04842 | link |
2024-12-06 | GaussianFormer-2: Probabilistic Gaussian Superposition for Efficient 3D Occupancy Prediction | Yuanhui Huang et.al. | 2412.04384 | link |
2024-12-05 | Reflective Teacher: Semi-Supervised Multimodal 3D Object Detection in Bird’s-Eye-View via Uncertainty Measure | Saheli Hazra et.al. | 2412.04337 | null |
2024-12-05 | YOLO-CCA: A Context-Based Approach for Traffic Sign Detection | Linfeng Jiang et.al. | 2412.04289 | link |
2024-12-05 | CALMM-Drive: Confidence-Aware Autonomous Driving with Large Multimodal Model | Ruoyu Yao et.al. | 2412.04209 | null |
2024-12-05 | UNCOVER: Unknown Class Object Detection for Autonomous Vehicles in Real-time | Lars Schmarje et.al. | 2412.03986 | null |
2024-12-05 | Learning Based MPC for Autonomous Driving Using a Low Dimensional Residual Model | Yaoyu Li et.al. | 2412.03874 | null |
2024-12-05 | Using Cooperative Co-evolutionary Search to Generate Metamorphic Test Cases for Autonomous Driving Systems | Hossein Yousefizadeh et.al. | 2412.03843 | null |
2024-12-05 | Safe Adaptive Cruise Control Under Perception Uncertainty: A Deep Ensemble and Conformal Tube Model Predictive Control Approach | Xiao Li et.al. | 2412.03792 | null |
2024-12-04 | Advancing Auto-Regressive Continuation for Video Frames | Ruibo Ming et.al. | 2412.03758 | null |
2024-12-04 | Designing DNNs for a trade-off between robustness and processing performance in embedded devices | Jon Gutiérrez-Zaballa et.al. | 2412.03682 | null |
2024-12-04 | Evaluating Single Event Upsets in Deep Neural Networks for Semantic Segmentation: an embedded system perspective | Jon Gutiérrez-Zaballa et.al. | 2412.03630 | link |
2024-12-04 | Streaming Detection of Queried Event Start | Cristobal Eyzaguirre et.al. | 2412.03567 | link |
2024-12-04 | FreeSim: Toward Free-viewpoint Camera Simulation in Driving Scenes | Lue Fan et.al. | 2412.03566 | null |
2024-12-09 | Seeing Beyond Views: Multi-View Driving Scene Video Generation with Holistic Attention | Hannan Lu et.al. | 2412.03520 | null |
2024-12-04 | Data Fusion of Semantic and Depth Information in the Context of Object Detection | Md Abu Yusuf et.al. | 2412.03490 | null |
2024-12-04 | NeRF and Gaussian Splatting SLAM in the Wild | Fabian Schmidt et.al. | 2412.03263 | link |
2024-12-04 | A Survey of Wireless Sensing Security from a Role-Based View: Victim, Weapon, and Shield | Ruixu Geng et.al. | 2412.03064 | link |
2024-12-04 | Lightweight Stochastic Video Prediction via Hybrid Warping | Kazuki Kotoyori et.al. | 2412.03061 | null |
2024-12-04 | Less is More: A Stealthy and Efficient Adversarial Attack Method for DRL-based Autonomous Driving Policies | Junchao Fan et.al. | 2412.03051 | null |
2024-12-03 | Gaussian Splatting Under Attack: Investigating Adversarial Noise in 3D Objects | Abdurrahman Zeybey et.al. | 2412.02803 | null |
2024-12-09 | Synergistic Development of Perovskite Memristors and Algorithms for Robust Analog Computing | Nanyang Ye et.al. | 2412.02779 | null |
2024-12-13 | MVCTrack: Boosting 3D Point Cloud Tracking via Multimodal-Guided Virtual Cues | Zhaofeng Hu et.al. | 2412.02734 | link |
2024-12-03 | Preliminary Investigation into Data Scaling Laws for Imitation Learning-Based End-to-End Autonomous Driving | Yupeng Zheng et.al. | 2412.02689 | link |
2024-12-03 | Generating Critical Scenarios for Testing Automated Driving Systems | Trung-Hieu Nguyen et.al. | 2412.02574 | null |
2024-12-03 | RelayGS: Reconstructing Dynamic Scenes with Large-Scale and Complex Motions via Relay Gaussians | Qiankun Gao et.al. | 2412.02493 | link |
2024-12-03 | Who Walks With You Matters: Perceiving Social Interactions with Groups for Pedestrian Trajectory Prediction | Ziqian Zou et.al. | 2412.02395 | null |
2024-12-03 | Trajectory-based Road Autolabeling with Lidar-Camera Fusion in Winter Conditions | Eerik Alamikkotervo et.al. | 2412.02370 | link |
2024-12-03 | Underload: Defending against Latency Attacks for Object Detectors on Edge Devices | Tianyi Wang et.al. | 2412.02171 | null |
2024-12-02 | PKRD-CoT: A Unified Chain-of-thought Prompting for Multi-Modal Large Language Models in Autonomous Driving | Xuewen Luo et.al. | 2412.02025 | null |
2024-12-02 | HPRM: High-Performance Robotic Middleware for Intelligent Autonomous Systems | Jacky Kwok et.al. | 2412.01799 | null |
2024-12-02 | CTRL-D: Controllable Dynamic 3D Scene Editing with Personalized 2D Diffusion | Kai He et.al. | 2412.01792 | null |
2024-12-02 | HUGSIM: A Real-Time, Photo-Realistic and Closed-Loop Simulator for Autonomous Driving | Hongyu Zhou et.al. | 2412.01718 | null |
2024-12-02 | 6DOPE-GS: Online 6D Object Pose Estimation using Gaussian Splatting | Yufeng Jin et.al. | 2412.01543 | null |
2024-12-04 | InfinityDrive: Breaking Time Limits in Driving World Models | Xi Guo et.al. | 2412.01522 | null |
2024-12-03 | HoloDrive: Holistic 2D-3D Multi-Modal Street Scene Generation for Autonomous Driving | Zehuan Wu et.al. | 2412.01407 | null |
2024-12-02 | FedPAW: Federated Learning with Personalized Aggregation Weights for Urban Vehicle Speed Prediction | Yuepeng He et.al. | 2412.01281 | link |
2024-12-03 | Double-Directional V2V Channel Measurement using ReRoMA at 60 GHz | Hussein Hammoud et.al. | 2412.01165 | null |
2024-12-02 | STATIC : Surface Temporal Affine for TIme Consistency in Video Monocular Depth Estimation | Sunghun Yang et.al. | 2412.01090 | null |
2024-12-02 | LiDAR SLAMMOT based on Confidence-guided Data Association | Susu Fang et.al. | 2412.01041 | null |
2024-12-02 | Quantization-Aware Imitation-Learning for Resource-Efficient Robotic Control | Seongmin Park et.al. | 2412.01034 | null |
2024-12-01 | CtrlNeRF: The Generative Neural Radiation Fields for the Controllable Synthesis of High-fidelity 3D-Aware Images | Jian Liu et.al. | 2412.00754 | null |
2024-12-01 | BDefects4NN: A Backdoor Defect Database for Controlled Localization Studies in Neural Networks | Yisong Xiao et.al. | 2412.00746 | null |
2024-12-01 | SEED4D: A Synthetic Ego–Exo Dynamic 4D Data Generator, Driving Dataset and Benchmark | Marius Kästingschäfer et.al. | 2412.00730 | link |
2025-01-08 | Motion Dreamer: Realizing Physically Coherent Video Generation through Scene-Aware Motion Reasoning | Tianshuo Xu et.al. | 2412.00547 | link |
2024-11-30 | Density-aware Global-Local Attention Network for Point Cloud Segmentation | Chade Li et.al. | 2412.00489 | null |
2024-11-29 | $C^{3}$ -NeRF: Modeling Multiple Scenes via Conditional-cum-Continual Neural Radiance Fields | Prajwal Singh et.al. | 2411.19903 | null |
2024-11-29 | FlowCLAS: Enhancing Normalizing Flow Via Contrastive Learning For Anomaly Segmentation | Chang Won Lee et.al. | 2411.19888 | null |
2024-11-29 | SpaRC: Sparse Radar-Camera Fusion for 3D Object Detection | Philipp Wolters et.al. | 2411.19860 | null |
2024-11-29 | A Multi-Loss Strategy for Vehicle Trajectory Prediction: Combining Off-Road, Diversity, and Directional Consistency Losses | Ahmad Rahimi et.al. | 2411.19747 | link |
2024-11-29 | Gaussian Splashing: Direct Volumetric Rendering Underwater | Nir Mualem et.al. | 2411.19588 | null |
2024-11-29 | AdvFuzz: Finding More Violations Caused by the EGO Vehicle in Simulation Testing by Adversarial NPC Vehicles | You Lu et.al. | 2411.19567 | null |
2024-11-29 | ReconDreamer: Crafting World Models for Driving Scene Reconstruction via Online Restoration | Chaojun Ni et.al. | 2411.19548 | null |
2024-11-29 | Deepfake Media Generation and Detection in the Generative AI Era: A Survey and Outlook | Florinel-Alin Croitoru et.al. | 2411.19537 | link |
2024-12-23 | LokiTalk: Learning Fine-Grained and Generalizable Correspondences to Enhance NeRF-based Talking Head Synthesis | Tianqi Li et.al. | 2411.19525 | null |
2024-11-28 | Mapping Public Perception of Artificial Intelligence: Expectations, Risk-Benefit Tradeoffs, and Value As Determinants for Societal Acceptance | Philipp Brauner et.al. | 2411.19356 | null |
2024-11-28 | UrbanCAD: Towards Highly Controllable and Photorealistic 3D Vehicles for Urban Scene Simulation | Yichong Lu et.al. | 2411.19292 | null |
2024-11-28 | SADG: Segment Any Dynamic Gaussian Without Object Trackers | Yun-Jin Li et.al. | 2411.19290 | link |
2024-11-28 | On-chip Hyperspectral Image Segmentation with Fully Convolutional Networks for Scene Understanding in Autonomous Driving | Jon Gutiérrez-Zaballa et.al. | 2411.19274 | null |
2024-11-28 | InstanceGaussian: Appearance-Semantic Joint Gaussian Representation for 3D Instance-Level Perception | Haijie Li et.al. | 2411.19235 | null |
2024-11-28 | Visual SLAMMOT Considering Multiple Motion Models | Peilin Tian et.al. | 2411.19134 | null |
2024-11-28 | Synergizing Decision Making and Trajectory Planning Using Two-Stage Optimization for Autonomous Vehicles | Wenru Liu et.al. | 2411.18974 | null |
2024-11-28 | T2SG: Traffic Topology Scene Graph for Topology Reasoning in Autonomous Driving | Changsheng Lv et.al. | 2411.18894 | null |
2024-11-28 | Improving Batch Normalization with TTA for Robust Object Detection in Self-Driving | Dacheng Liao et.al. | 2411.18860 | null |
2024-11-27 | Surf-NeRF: Surface Regularised Neural Radiance Fields | Jack Naylor et.al. | 2411.18652 | null |
2024-11-30 | InterHub: A Naturalistic Trajectory Dataset with Dense Interaction for Autonomous Driving | Xiyan Jiang et.al. | 2411.18302 | link |
2024-11-27 | Visual Adversarial Attack on Vision-Language Models for Autonomous Driving | Tianyuan Zhang et.al. | 2411.18275 | null |
2024-12-01 | From Open Vocabulary to Open World: Teaching Vision Language Models to Detect Novel Objects | Zizhao Li et.al. | 2411.18207 | link |
2024-12-16 | Critic-V: VLM Critics Help Catch VLM Errors in Multimodal Reasoning | Di Zhang et.al. | 2411.18203 | null |
2024-11-27 | Edge-Assisted Accelerated Cooperative Sensing for CAVs: Task Placement and Resource Allocation | Yuxuan Wang et.al. | 2411.18129 | null |
2024-11-27 | FASIONAD : FAst and Slow FusION Thinking Systems for Human-Like Autonomous Driving with Adaptive Feedback | Kangan Qian et.al. | 2411.18013 | null |
2024-11-26 | Stealthy Multi-Task Adversarial Attacks | Jiacheng Guo et.al. | 2411.17936 | null |
2024-11-26 | DECODE: Domain-aware Continual Domain Expansion for Motion Prediction | Boqi Li et.al. | 2411.17917 | link |
2024-11-26 | Multimodal Crash Likelihood Prediction: A Complexity-Infused Approach Integrating Semantic, Contextual, and Driving Features | Meng Wang et.al. | 2411.17886 | null |
2024-11-26 | OpenAD: Open-World Autonomous Driving Benchmark for 3D Object Detection | Zhongyu Xia et.al. | 2411.17761 | link |
2024-11-26 | Modality-Incremental Learning with Disjoint Relevance Mapping Networks for Image-based Semantic Segmentation | Niharika Hegde et.al. | 2411.17610 | null |
2024-11-26 | Rapid Deployment of Domain-specific Hyperspectral Image Processors with Application to Autonomous Driving | Jon Gutiérrez-Zaballa et.al. | 2411.17543 | null |
2024-11-26 | HSI-Drive v2.0: More Data for New Challenges in Scene Understanding for Autonomous Driving | Jon Gutiérrez-Zaballa et.al. | 2411.17530 | null |
2024-11-26 | CoA: Chain-of-Action for Generative Semantic Labels | Meng Wei et.al. | 2411.17406 | link |
2024-11-26 | LHPF: Look back the History and Plan for the Future in Autonomous Driving | Sheng Wang et.al. | 2411.17253 | null |
2024-11-26 | MLI-NeRF: Multi-Light Intrinsic-Aware Neural Radiance Fields | Yixiong Yang et.al. | 2411.17235 | link |
2024-11-26 | Enhancing Lane Segment Perception and Topology Reasoning with Crowdsourcing Trajectory Priors | Peijin Jia et.al. | 2411.17161 | null |
2024-11-27 | SplatAD: Real-Time Lidar and Camera Rendering with 3D Gaussian Splatting for Autonomous Driving | Georg Hess et.al. | 2411.16816 | link |
2024-11-25 | One is Plenty: A Polymorphic Feature Interpreter for Immutable Heterogeneous Collaborative Perception | Yuchen Xia et.al. | 2411.16799 | null |
2024-11-25 | MAGiC-SLAM: Multi-Agent Gaussian Globally Consistent SLAM | Vladimir Yugay et.al. | 2411.16785 | null |
2024-11-25 | SynDiff-AD: Improving Semantic Segmentation and End-to-End Autonomous Driving with Synthetic Data from Latent Diffusion Models | Harsh Goel et.al. | 2411.16776 | null |
2024-11-23 | FollowGen: A Scaled Noise Conditional Diffusion Model for Car-Following Trajectory Prediction | Junwei You et.al. | 2411.16747 | null |
2024-11-23 | Gradient-Guided Parameter Mask for Multi-Scenario Image Restoration Under Adverse Weather | Jilong Guo et.al. | 2411.16739 | link |
2024-11-23 | Towards Satellite Image Road Graph Extraction: A Global-Scale Dataset and A Novel Method | Pan Yin et.al. | 2411.16733 | link |
2024-12-03 | Neuro-Symbolic Evaluation of Text-to-Video Models using Formal Verification | S. P. Sharan et.al. | 2411.16718 | link |
2024-11-25 | Generating Out-Of-Distribution Scenarios Using Language Models | Erfan Aasi et.al. | 2411.16554 | null |
2024-11-25 | Characterized Diffusion Networks for Enhanced Autonomous Driving Trajectory Prediction | Haoming Li et.al. | 2411.16457 | null |
2024-11-25 | A Study on Unsupervised Domain Adaptation for Semantic Segmentation in the Era of Vision-Language Models | Manuel Schwonberg et.al. | 2411.16407 | null |
2024-11-25 | Quadratic Gaussian Splatting for Efficient and Detailed Surface Reconstruction | Ziyu Zhang et.al. | 2411.16392 | null |
2024-12-11 | Monocular Lane Detection Based on Deep Learning: A Survey | Xin He et.al. | 2411.16316 | link |
2024-11-25 | A Performance Increment Strategy for Semantic Segmentation of Low-Resolution Images from Damaged Roads | Rafael S. Toledo et.al. | 2411.16295 | link |
2024-11-25 | U2NeRF: Unsupervised Underwater Image Restoration and Neural Radiance Fields | Vinayak Gupta et.al. | 2411.16172 | null |
2024-11-25 | End-to-End Steering for Autonomous Vehicles via Conditional Imitation Co-Learning | Mahmoud M. Kishky et.al. | 2411.16131 | null |
2024-11-25 | Three Cars Approaching within 100m! Enhancing Distant Geometry by Tri-Axis Voxel Scanning for Camera-based Semantic Scene Completion | Jongseong Bae et.al. | 2411.16129 | null |
2024-11-24 | Performance Implications of Multi-Chiplet Neural Processing Units on Autonomous Driving Perception | Mohanad Odema et.al. | 2411.16007 | null |
2024-11-24 | SARS: A Resource Selection Algorithm for Autonomous Driving Tasks in Heterogeneous Mobile Edge Computing | Reza Zakerian et.al. | 2411.15989 | null |
2024-12-23 | DRIVE: Dual-Robustness via Information Variability and Entropic Consistency in Source-Free Unsupervised Domain Adaptation | Ruiqiang Xiao et.al. | 2411.15976 | null |
2024-11-24 | ZeroGS: Training 3D Gaussian Splatting from Unposed Images | Yu Chen et.al. | 2411.15779 | null |
2024-12-20 | GSurf: 3D Reconstruction via Signed Distance Fields with Direct Gaussian Supervision | Baixin Xu et.al. | 2411.15723 | link |
2024-11-24 | Algorithmics and Complexity of Cost-Driven Task Offloading with Submodular Optimization in Edge-Cloud Environments | Longkun Guo et.al. | 2411.15687 | null |
2024-11-23 | Training an Open-Vocabulary Monocular 3D Object Detection Model without 3D Data | Rui Huang et.al. | 2411.15657 | null |
2024-11-23 | EMD: Explicit Motion Modeling for High-Quality Street Gaussian Splatting | Xiaobao Wei et.al. | 2411.15582 | null |
2024-11-23 | SplatFlow: Self-Supervised Dynamic Gaussian Splatting in Neural Motion Flow Field for Autonomous Driving | Su Sun et.al. | 2411.15482 | null |
2024-11-22 | UniGaussian: Driving Scene Reconstruction from Multiple Camera Models via Unified Gaussian Representations | Yuan Ren et.al. | 2411.15355 | null |
2024-11-22 | Adversarial Prompt Distillation for Vision-Language Models | Lin Luo et.al. | 2411.15244 | null |
2024-11-22 | DiffusionDrive: Truncated Diffusion Model for End-to-End Autonomous Driving | Bencheng Liao et.al. | 2411.15139 | link |
2024-11-25 | Enhancing Autonomous Driving Safety through World Model-Based Predictive Navigation and Adaptive Learning Algorithms for 5G Wireless Applications | Hong Ding et.al. | 2411.15042 | null |
2024-11-22 | MSSF: A 4D Radar and Camera Fusion Framework With Multi-Stage Sampling for 3D Object Detection in Autonomous Driving | Hongsi Liu et.al. | 2411.15016 | null |
2024-11-22 | FTA generation using GenAI with an Autonomy sensor Usecase | Sneha Sudhir Shetiya et.al. | 2411.15007 | null |
2024-11-22 | LiDAR-based End-to-end Temporal Perception for Vehicle-Infrastructure Cooperation | Zhenwei Yang et.al. | 2411.14927 | null |
2024-11-22 | Benchmarking the Robustness of Optical Flow Estimation to Corruptions | Zhonghua Yi et.al. | 2411.14865 | link |
2024-11-22 | TopoSD: Topology-Enhanced Lane Segment Perception with SDMap Prior | Sen Yang et.al. | 2411.14751 | null |
2024-11-22 | VisionPAD: A Vision-Centric Pre-training Paradigm for Autonomous Driving | Haiming Zhang et.al. | 2411.14716 | null |
2024-11-21 | A Systematic Study of Multi-Agent Deep Reinforcement Learning for Safe and Robust Autonomous Highway Ramp Entry | Larry Schester et.al. | 2411.14593 | null |
2024-11-21 | Open Challenges in the Formal Verification of Autonomous Driving | Paolo Burgio et.al. | 2411.14520 | null |
2024-11-21 | Understanding World or Predicting Future? A Comprehensive Survey of World Models | Jingtao Ding et.al. | 2411.14499 | null |
2024-11-21 | Model Checking for Reinforcement Learning in Autonomous Driving: One Can Do More Than You Think! | Rong Gu et.al. | 2411.14375 | null |
2024-11-21 | Formal Simulation and Visualisation of Hybrid Programs | Pedro Mendes et.al. | 2411.14365 | null |
2024-11-21 | Generalizing End-To-End Autonomous Driving In Real-World Environments Using Zero-Shot LLMs | Zeyu Dong et.al. | 2411.14256 | null |
2024-11-21 | FedRAV: Hierarchically Federated Region-Learning for Traffic Object Classification of Autonomous Vehicles | Yijun Zhai et.al. | 2411.13979 | link |
2024-11-21 | Trajectory Tracking Using Frenet Coordinates with Deep Deterministic Policy Gradient | Tongzhou Jiang et.al. | 2411.13885 | null |
2024-11-21 | MagicDriveDiT: High-Resolution Long Video Generation for Autonomous Driving with Adaptive Control | Ruiyuan Gao et.al. | 2411.13807 | null |
2024-11-21 | A Survey on Adversarial Robustness of LiDAR-based Machine Learning Perception in Autonomous Vehicles | Junae Kim et.al. | 2411.13778 | null |
2024-11-20 | Sparse Input View Synthesis: 3D Representations and Reliable Priors | Nagabhushan Somraj et.al. | 2411.13631 | null |
2024-11-20 | MambaDETR: Query-based Temporal Modeling using State Space Model for Multi-View 3D Object Detection | Tong Ning et.al. | 2411.13628 | null |
2024-11-20 | WHALES: A Multi-agent Scheduling Dataset for Enhanced Cooperation in Autonomous Driving | Siwei Chen et.al. | 2411.13340 | link |
2024-11-20 | A Resource Efficient Fusion Network for Object Detection in Bird’s-Eye View using Camera and Raw Radar Data | Kavin Chandrasekaran et.al. | 2411.13311 | link |
2024-11-20 | YCB-LUMA: YCB Object Dataset with Luminance Keying for Object Localization | Thomas Pöllabauer et.al. | 2411.13149 | link |
2024-11-20 | Provably Efficient Action-Manipulation Attack Against Continuous Reinforcement Learning | Zhi Luo et.al. | 2411.13116 | null |
2024-11-26 | DriveMLLM: A Benchmark for Spatial Understanding with Multimodal Large Language Models in Autonomous Driving | Xianda Guo et.al. | 2411.13112 | link |
2024-11-20 | Hints of Prompt: Enhancing Visual Representation for Multimodal LLMs in Autonomous Driving | Hao Zhou et.al. | 2411.13076 | null |
2024-11-20 | Study of Group III-V Waveguides on Sapphire Platform for Photonic Integrated Circuits | Manoj Kumar Shah et.al. | 2411.13035 | null |
2024-11-20 | GazeGaussian: High-Fidelity Gaze Redirection with 3D Gaussian Splatting | Xiaobao Wei et.al. | 2411.12981 | null |
2024-11-25 | LaVida Drive: Vision-Text Interaction VLM for Autonomous Driving with Token Selection, Recovery and Enhancement | Siwen Jiao et.al. | 2411.12980 | null |
2024-11-20 | M3D: Dual-Stream Selective State Spaces and Depth-Driven Framework for High-Fidelity Single-View 3D Reconstruction | Luoxi Zhang et.al. | 2411.12635 | link |
2024-11-19 | GaussianPretrain: A Simple Unified 3D Gaussian Representation for Visual Pre-training in Autonomous Driving | Shaoqing Xu et.al. | 2411.12452 | link |
2024-11-19 | Motif Channel Opened in a White-Box: Stereo Matching via Motif Correlation Graph | Ziyang Chen et.al. | 2411.12426 | link |
2024-11-19 | C $^{2}$ INet: Realizing Incremental Trajectory Prediction with Prior-Aware Continual Causal Intervention | Xiaohe Li et.al. | 2411.12313 | null |
2024-11-19 | MTFusion: Reconstructing Any 3D Object from Single Image Using Multi-word Textual Inversion | Yu Liu et.al. | 2411.12197 | null |
2024-11-19 | Robust 3D Semantic Occupancy Prediction with Calibration-free Spatial Transformation | Zhuangwei Zhuang et.al. | 2411.12177 | link |
2024-11-18 | Calibrated and Efficient Sampling-Free Confidence Estimation for LiDAR Scene Semantic Segmentation | Hanieh Shojaei Miandashti et.al. | 2411.11935 | null |
2024-11-18 | DeSiRe-GS: 4D Street Gaussians for Static-Dynamic Decomposition and Surface Reconstruction for Urban Driving Scenes | Chensheng Peng et.al. | 2411.11921 | link |
2024-11-17 | ModeSeq: Taming Sparse Multimodal Motion Prediction with Sequential Mode Modeling | Zikang Zhou et.al. | 2411.11911 | null |
2024-11-18 | Towards Degradation-Robust Reconstruction in Generalizable NeRF | Chan Ho Park et.al. | 2411.11691 | null |
2024-11-18 | SignEye: Traffic Sign Interpretation from Vehicle First-Person View | Chuang Yang et.al. | 2411.11507 | null |
2024-11-18 | MGNiceNet: Unified Monocular Geometric Scene Understanding | Markus Schön et.al. | 2411.11466 | null |
2024-11-18 | The ADUULM-360 Dataset – A Multi-Modal Dataset for Depth Estimation in Adverse Weather | Markus Schön et.al. | 2411.11455 | null |
2024-11-18 | DrivingSphere: Building a High-fidelity 4D World for Closed-loop Simulation | Tianyi Yan et.al. | 2411.11252 | link |
2024-11-17 | Unveiling the Hidden: Online Vectorized HD Map Construction with Clip-Level Token Interaction and Propagation | Nayeon Kim et.al. | 2411.11002 | null |
2024-11-17 | V2X-Radar: A Multi-modal Dataset with 4D Radar for Cooperative Perception | Lei Yang et.al. | 2411.10962 | null |
2024-11-16 | Attention-based U-Net Method for Autonomous Lane Detection | Mohammadhamed Tangestanizadeh et.al. | 2411.10902 | null |
2024-11-16 | Automatic Discovery and Assessment of Interpretable Systematic Errors in Semantic Segmentation | Jaisidh Singh et.al. | 2411.10845 | null |
2024-11-16 | MTA: Multimodal Task Alignment for BEV Perception and Captioning | Yunsheng Ma et.al. | 2411.10639 | null |
2024-11-15 | A Novel MLLM-based Approach for Autonomous Driving in Different Weather Conditions | Sonda Fourati et.al. | 2411.10603 | null |
2024-11-15 | The Oxford Spires Dataset: Benchmarking Large-Scale LiDAR-Visual Localisation, Reconstruction and Radiance Field Methods | Yifu Tao et.al. | 2411.10546 | null |
2024-11-15 | Advancing Autonomous Driving Perception: Analysis of Sensor Fusion and Computer Vision Techniques | Urvishkumar Bharti et.al. | 2411.10535 | null |
2024-11-15 | USP-Gaussian: Unifying Spike-based Image Reconstruction, Pose Correction and Gaussian Splatting | Kang Chen et.al. | 2411.10504 | link |
2024-11-15 | Prompt-Guided Environmentally Consistent Adversarial Patch | Chaoqun Li et.al. | 2411.10498 | null |
2024-11-15 | Guided Learning: Lubricating End-to-End Modeling for Multi-stage Decision-making | Jian Guo et.al. | 2411.10496 | null |
2024-11-15 | Moving Forward: A Review of Autonomous Driving Software and Hardware Systems | Xu Wang et.al. | 2411.10291 | null |
2024-11-15 | Imagine-2-Drive: High-Fidelity World Modeling in CARLA for Autonomous Vehicles | Anant Garg et.al. | 2411.10171 | null |
2024-11-15 | Better Safe Than Sorry: Enhancing Arbitration Graphs for Safe and Robust Autonomous Decision-Making | Piotr Spieker et.al. | 2411.10170 | link |
2024-11-15 | GSEditPro: 3D Gaussian Splatting Editing with Attention-based Progressive Localization | Yanhao Sun et.al. | 2411.10033 | null |
2024-11-15 | Explanation for Trajectory Planning using Multi-modal Large Language Model for Autonomous Driving | Shota Yamazaki et.al. | 2411.09971 | null |
2024-11-15 | Planning by Simulation: Motion Planning with Learning-based Parallel Scenario Prediction for Autonomous Driving | Tian Niu et.al. | 2411.09887 | null |
2024-11-14 | Adversarial Attacks Using Differentiable Rendering: A Survey | Matthew Hull et.al. | 2411.09749 | null |
2024-11-14 | CropCraft: Inverse Procedural Modeling for 3D Reconstruction of Crop Plants | Albert J. Zhai et.al. | 2411.09693 | null |
2024-11-14 | Modular Fault Diagnosis Framework for Complex Autonomous Driving Systems | Stefan Orf et.al. | 2411.09643 | null |
2024-11-13 | Towards More Accurate Fake Detection on Images Generated from Advanced Generative and Neural Rendering Models | Chengdong Dong et.al. | 2411.08642 | null |
2024-11-13 | Methodology for a Statistical Analysis of Influencing Factors on 3D Object Detection Performance | Anton Kuznietsov et.al. | 2411.08482 | null |
2024-11-13 | Biomass phenotyping of oilseed rape through UAV multi-view oblique imaging with 3DGS and SAM model | Yutao Shen et.al. | 2411.08453 | null |
2024-11-13 | 3D Multi-Object Tracking with Semi-Supervised GRU-Kalman Filter | Xiaoxiang Wang et.al. | 2411.08433 | null |
2024-11-13 | MBA-SLAM: Motion Blur Aware Dense Visual SLAM with Radiance Fields Representation | Peng Wang et.al. | 2411.08279 | link |
2024-11-12 | Imitation Learning from Observations: An Autoregressive Mixture of Experts Approach | Renzi Wang et.al. | 2411.08232 | null |
2024-11-12 | TomoGRAF: A Robust and Generalizable Reconstruction Network for Single-View Computed Tomography | Di Xu et.al. | 2411.08158 | null |
2024-11-12 | Material Transforms from Disentangled NeRF Representations | Ivan Lopes et.al. | 2411.08037 | link |
2024-11-12 | ALOcc: Adaptive Lifting-based 3D Semantic Occupancy and Cost Volume-based Flow Prediction | Dubing Chen et.al. | 2411.07725 | link |
2024-11-27 | OWLed: Outlier-weighed Layerwise Pruning for Efficient Autonomous Driving Framework | Jiaxi Li et.al. | 2411.07711 | link |
2024-11-16 | A Simple Multi-agent Joint Prediction Method for Autonomous Driving | Mingyi Wang et.al. | 2411.07612 | null |
2024-11-11 | SIESEF-FusionNet: Spatial Inter-correlation Enhancement and Spatially-Embedded Feature Fusion Network for LiDAR Point Cloud Semantic Segmentation | Jiale Chen et.al. | 2411.06991 | null |
2024-11-11 | Large-scale moral machine experiment on large language models | Muhammad Shahrul Zaim bin Ahmad et.al. | 2411.06790 | link |
2024-11-11 | Model Partition and Resource Allocation for Split Learning in Vehicular Edge Networks | Lu Yu et.al. | 2411.06773 | null |
2024-11-11 | LuSh-NeRF: Lighting up and Sharpening NeRFs for Low-light Scenes | Zefan Qu et.al. | 2411.06757 | link |
2024-11-11 | DP and QP Based Decision-making and Planning for Autonomous Vehicle | Zhicheng Zhang et.al. | 2411.06751 | null |
2024-11-09 | Predictability Awareness for Efficient and Robust Multi-Agent Coordination | Roman Chiva Gil et.al. | 2411.06223 | null |
2024-11-19 | LSSInst: Improving Geometric Modeling in LSS-Based BEV Perception with Instance Representation | Weijie Ma et.al. | 2411.06173 | link |
2024-11-08 | Integrating Object Detection Modality into Visual Language Model for Enhanced Autonomous Driving Agent | Linfeng He et.al. | 2411.05898 | null |
2024-11-08 | MIPD: A Multi-sensory Interactive Perception Dataset for Embodied Intelligent Driving | Zhiwei Li et.al. | 2411.05881 | link |
2024-11-06 | Federated Data-Driven Kalman Filtering for State Estimation | Nikos Piperigkos et.al. | 2411.05847 | null |
2024-11-08 | Expectation vs. Reality: Towards Verification of Psychological Games | Marta Kwiatkowska et.al. | 2411.05599 | null |
2024-11-08 | Open-set object detection: towards unified problem formulation and benchmarking | Hejer Ammar et.al. | 2411.05564 | null |
2024-11-08 | From Transparent to Opaque: Rethinking Neural Implicit Surfaces with $α$ -NeuS | Haoran Zhang et.al. | 2411.05362 | link |
2024-11-08 | Rate-aware Compression for NeRF-based Volumetric Video | Zhiyu Zhang et.al. | 2411.05322 | null |
2024-11-08 | ZOPP: A Framework of Zero-shot Offboard Panoptic Perception for Autonomous Driving | Tao Ma et.al. | 2411.05311 | null |
2024-11-08 | SimpleBEV: Improved LiDAR-Camera Fusion Architecture for 3D Object Detection | Yun Zhao et.al. | 2411.05292 | null |
2024-11-07 | Few-Shot Task Learning through Inverse Generative Modeling | Aviv Netanyahu et.al. | 2411.04987 | null |
2024-11-07 | Planar Reflection-Aware Neural Radiance Fields | Chen Gao et.al. | 2411.04984 | null |
2024-11-07 | IGDrivSim: A Benchmark for the Imitation Gap in Autonomous Driving | Clémence Grislain et.al. | 2411.04653 | link |
2024-11-07 | LidaRefer: Outdoor 3D Visual Grounding for Autonomous Driving with Transformers | Yeong-Seung Baek et.al. | 2411.04351 | null |
2024-11-06 | Graph-Based Multi-Modal Sensor Fusion for Autonomous Driving | Depanshu Sani et.al. | 2411.03702 | null |
2024-11-06 | OccLoff: Learning Optimized Feature Fusion for 3D Occupancy Prediction | Ji Zhang et.al. | 2411.03696 | null |
2024-11-06 | Towards 3D Semantic Scene Completion for Autonomous Driving: A Meta-Learning Framework Empowered by Deformable Large-Kernel Attention and Mamba Model | Yansong Qu et.al. | 2411.03672 | null |
2024-11-06 | Structure Consistent Gaussian Splatting with Matching Prior for Few-shot Novel View Synthesis | Rui Peng et.al. | 2411.03637 | link |
2024-11-06 | Hybrid Attention for Robust RGB-T Pedestrian Detection in Real-World Conditions | Arunkumar Rathinam et.al. | 2411.03576 | null |
2024-11-07 | Knowledge Graphs of Driving Scenes to Empower the Emerging Capabilities of Neurosymbolic AI | Ruwan Wickramarachchi et.al. | 2411.03225 | null |
2024-11-05 | Precise Drive with VLM: First Prize Solution for PRCV 2024 Drive LM challenge | Bin Huang et.al. | 2411.02999 | null |
2024-11-05 | CAD-NeRF: Learning NeRFs from Uncalibrated Few-view Images by CAD Model Retrieval | Xin Wen et.al. | 2411.02979 | null |
2024-11-08 | Region-Guided Attack on the Segment Anything Model (SAM) | Xiaoliang Liu et.al. | 2411.02974 | null |
2024-11-05 | Exploring Seasonal Variability in the Context of Neural Radiance Fields for 3D Reconstruction on Satellite Imagery | Liv Kåreborn et.al. | 2411.02972 | null |
2024-11-05 | Multi-modal NeRF Self-Supervision for LiDAR Semantic Segmentation | Xavier Timoneda et.al. | 2411.02969 | null |
2024-11-05 | Exploring the Interplay Between Video Generation and World Models in Autonomous Driving: A Survey | Ao Fu et.al. | 2411.02914 | null |
2024-11-05 | Safety Verification for Evasive Collision Avoidance in Autonomous Vehicles with Enhanced Resolutions | Aliasghar Arab et.al. | 2411.02706 | null |
2024-11-04 | NeRF-Aug: Data Augmentation for Robotics with Neural Radiance Fields | Eric Zhu et.al. | 2411.02482 | null |
2024-11-05 | FewViewGS: Gaussian Splatting with Few View Matching and Multi-stage Training | Ruihong Yin et.al. | 2411.02229 | null |
2024-11-04 | Learning Multiple Initial Solutions to Optimization Problems | Elad Sharony et.al. | 2411.02158 | link |
2024-11-04 | Traffic and Safety Rule Compliance of Humans in Diverse Driving Situations | Michael Kurenkov et.al. | 2411.01909 | null |
2024-12-04 | GVKF: Gaussian Voxel Kernel Functions for Highly Efficient Surface Reconstruction in Open Scenes | Gaochao Song et.al. | 2411.01853 | null |
2024-11-04 | A Probabilistic Formulation of LiDAR Mapping with Neural Radiance Fields | Matthew McDermott et.al. | 2411.01725 | link |
2024-11-08 | ROAD-Waymo: Action Awareness at Scale for Autonomous Driving | Salman Khan et.al. | 2411.01683 | link |
2024-11-03 | Polar R-CNN: End-to-End Lane Detection with Fewer Anchors | Shengqi Wang et.al. | 2411.01499 | link |
2024-11-03 | Interaction-Aware Trajectory Prediction for Safe Motion Planning in Autonomous Driving: A Transformer-Transfer Learning Approach | Jinhao Liang et.al. | 2411.01475 | null |
2024-11-28 | On the Black-box Explainability of Object Detection Models for Safe and Trustworthy Industrial Applications | Alain Andres et.al. | 2411.00818 | link |
2024-11-01 | HopTrack: A Real-time Multi-Object Tracking System for Embedded Devices | Xiang Li et.al. | 2411.00608 | null |
2024-11-01 | On Deep Learning for Geometric and Semantic Scene Understanding Using On-Vehicle 3D LiDAR | Li Li et.al. | 2411.00600 | link |
2024-11-26 | MAROON: A Framework for the Joint Characterization of Near-Field High-Resolution Radar and Optical Depth Imaging Techniques | Vanessa Wirth et.al. | 2411.00527 | null |
2024-11-01 | PlanScope: Learning to Plan Within Decision Scope Does Matter | Ren Xin et.al. | 2411.00476 | link |
2024-11-01 | PLATYPUS: Progressive Local Surface Estimator for Arbitrary-Scale Point Cloud Upsampling | Donghyun Kim et.al. | 2411.00432 | null |
2024-10-31 | Optical Lens Attack on Monocular Depth Estimation for Autonomous Driving | Ce Zhou et.al. | 2411.00192 | null |
2024-10-31 | AIDOVECL: AI-generated Dataset of Outpainted Vehicles for Eye-level Classification and Localization | Amir Kazemi et.al. | 2410.24116 | null |
2024-10-31 | Driving by the Rules: A Benchmark for Integrating Traffic Sign Regulations into Vectorized HD Map | Xinyuan Chang et.al. | 2410.23780 | null |
2024-10-15 | Trajectory Prediction for Autonomous Driving using Agent-Interaction Graph Embedding | Jilan Samiuddin et.al. | 2410.23298 | null |
2024-10-30 | OpenSatMap: A Fine-grained High-resolution Satellite Dataset for Large-scale Map Construction | Hongbo Zhao et.al. | 2410.23278 | null |
2024-11-04 | EMMA: End-to-End Multimodal Model for Autonomous Driving | Jyh-Jing Hwang et.al. | 2410.23262 | null |
2024-10-30 | ELMGS: Enhancing memory and computation scaLability through coMpression for 3D Gaussian Splatting | Muhammad Salman Ali et.al. | 2410.23213 | null |
2024-10-31 | Enhancing Autonomous Driving Safety Analysis with Generative AI: A Comparative Study on Automated Hazard and Risk Assessment | Alireza Abbaspour et.al. | 2410.23207 | null |
2024-11-04 | S3PT: Scene Semantics and Structure Guided Clustering to Boost Self-Supervised Pre-Training for Autonomous Driving | Maciej K. Wozniak et.al. | 2410.23085 | null |
2024-10-30 | YOLOv11 for Vehicle Detection: Advancements, Performance, and Applications in Intelligent Transportation Systems | Mujadded Al Rabbani Alif et.al. | 2410.22898 | null |
2024-10-30 | A Graph-Based Model for Vehicle-Centric Data Sharing Ecosystem | Haiyue Yuan et.al. | 2410.22897 | null |
2024-10-30 | Self-Driving Car Racing: Application of Deep Reinforcement Learning | Florentiana Yuwono et.al. | 2410.22766 | null |
2024-10-30 | SoftCTRL: Soft conservative KL-control of Transformer Reinforcement Learning for Autonomous Driving | Minh Tri Huynh et.al. | 2410.22752 | null |
2024-10-30 | Analysis of Classifier Training on Synthetic Data for Cross-Domain Datasets | Andoni Cortés et.al. | 2410.22748 | null |
2024-10-29 | Pre-Trained Vision Models as Perception Backbones for Safety Filters in Autonomous Driving | Yuxuan Yang et.al. | 2410.22585 | null |
2024-10-29 | An Efficient Approach to Generate Safe Drivable Space by LiDAR-Camera-HDmap Fusion | Minghao Ning et.al. | 2410.22314 | link |
2024-10-29 | Senna: Bridging Large Vision-Language Models and End-to-End Autonomous Driving | Bo Jiang et.al. | 2410.22313 | link |
2024-10-29 | EnvoDat: A Large-Scale Multisensory Dataset for Robotic Spatial Awareness and Semantic Reasoning in Heterogeneous Environments | Linus Nwankwo et.al. | 2410.22200 | null |
2024-12-12 | Hyperspectral Imaging-Based Perception in Autonomous Driving Scenarios: Benchmarking Baseline Semantic Segmentation Models | Imad Ali Shah et.al. | 2410.22101 | link |
2024-10-29 | Building Altruistic and Moral AI Agent with Brain-inspired Affective Empathy Mechanisms | Feifei Zhao et.al. | 2410.21882 | null |
2024-11-07 | SS3DM: Benchmarking Street-View Surface Reconstruction with a Synthetic 3D Mesh Dataset | Yubin Hu et.al. | 2410.21739 | null |
2024-10-28 | Efficient Mixture-of-Expert for Video-based Driver State and Physiological Multi-task Estimation in Conditional Autonomous Driving | Jiyao Wang et.al. | 2410.21086 | null |
2024-11-16 | EEG-Driven 3D Object Reconstruction with Style Consistency and Diffusion Prior | Xin Xiang et.al. | 2410.20981 | null |
2024-10-28 | BEVPose: Unveiling Scene Semantics through Pose-Guided Multi-Modal BEV Alignment | Mehdi Hosseinzadeh et.al. | 2410.20969 | null |
2024-10-28 | Active Legibility in Multiagent Reinforcement Learning | Yanyu Liu et.al. | 2410.20954 | null |
2024-10-28 | SparseTem: Boosting the Efficiency of CNN-Based Video Encoders by Exploiting Temporal Continuity | Kunyun Wang et.al. | 2410.20790 | null |
2024-10-28 | ODGS: 3D Scene Reconstruction from Omnidirectional Images with 3D Gaussian Splattings | Suyoung Lee et.al. | 2410.20686 | link |
2024-10-27 | Neural rendering enables dynamic tomography | Ivan Grega et.al. | 2410.20558 | null |
2024-10-27 | GUMBEL-NERF: Representing Unseen Objects as Part-Compositional Neural Radiance Fields | Yusuke Sekikawa et.al. | 2410.20306 | null |
2024-10-26 | Neural Fields in Robotics: A Survey | Muhammad Zubair Irshad et.al. | 2410.20220 | link |
2024-10-19 | GL-NeRF: Gauss-Laguerre Quadrature Enables Training-Free NeRF Acceleration | Silong Yong et.al. | 2410.19831 | null |
2024-11-04 | Planning-Aware Diffusion Networks for Enhanced Motion Forecasting in Autonomous Driving | Liu Yunhao et.al. | 2410.19639 | null |
2024-10-25 | Multi-modal Motion Prediction using Temporal Ensembling with Learning-based Aggregation | Kai-Yin Hong et.al. | 2410.19606 | null |
2024-10-25 | Content-Aware Radiance Fields: Aligning Model Complexity with Scene Intricacy Through Learned Bitwidth Quantization | Weihang Liu et.al. | 2410.19483 | link |
2024-10-25 | Evaluation of strategies for efficient rate-distortion NeRF streaming | Pedro Martin et.al. | 2410.19459 | null |
2024-10-24 | Learning Transparent Reward Models via Unsupervised Feature Selection | Daulet Baimukashev et.al. | 2410.18608 | null |
2024-10-24 | Learn 2 Rage: Experiencing The Emotional Roller Coaster That Is Reinforcement Learning | Lachlan Mares et.al. | 2410.18462 | null |
2024-10-24 | Real-time 3D-aware Portrait Video Relighting | Ziqi Cai et.al. | 2410.18355 | link |
2024-11-19 | CARLA2Real: a tool for reducing the sim2real gap in CARLA simulator | Stefanos Pasios et.al. | 2410.18238 | link |
2024-10-22 | Advancing Super-Resolution in Neural Radiance Fields via Variational Diffusion Strategies | Shrey Vishen et.al. | 2410.18137 | link |
2024-10-23 | WorldSimBench: Towards Video Generation Models as World Simulators | Yiran Qin et.al. | 2410.18072 | null |
2024-10-23 | VR-Splatting: Foveated Radiance Field Rendering via 3D Gaussian Splatting and Neural Points | Linus Franke et.al. | 2410.17932 | null |
2024-10-23 | Few-shot NeRF by Adaptive Rendering Loss Regularization | Qingshan Xu et.al. | 2410.17839 | null |
2024-10-23 | Pointer: An Energy-Efficient ReRAM-based Point Cloud Recognition Accelerator with Inter-layer and Intra-layer Optimizations | Qijun Zhang et.al. | 2410.17782 | null |
2024-10-23 | Efficient Neural Implicit Representation for 3D Human Reconstruction | Zexu Huang et.al. | 2410.17741 | link |
2024-10-23 | YOLO-Vehicle-Pro: A Cloud-Edge Collaborative Framework for Object Detection in Autonomous Driving under Adverse Weather Conditions | Xiguang Li et.al. | 2410.17734 | null |
2024-10-23 | Real-time Vehicle-to-Vehicle Communication Based Network Cooperative Control System through Distributed Database and Multimodal Perception: Demonstrated in Crossroads | Xinwen Zhu et.al. | 2410.17576 | link |
2024-10-23 | PLGS: Robust Panoptic Lifting with 3D Gaussian Splatting | Yu Wang et.al. | 2410.17505 | null |
2024-10-22 | Episodic Future Thinking Mechanism for Multi-agent Reinforcement Learning | Dongsu Lee et.al. | 2410.17373 | null |
2024-10-07 | Audio-Driven Emotional 3D Talking-Head Generation | Wenqing Wang et.al. | 2410.17262 | null |
2024-10-22 | YOLO-TS: Real-Time Traffic Sign Detection with Enhanced Accuracy Using Optimized Receptive Fields and Anchor-Free Fusion | Junzhou Chen et.al. | 2410.17144 | null |
2024-10-18 | GS-LIVM: Real-Time Photo-Realistic LiDAR-Inertial-Visual Mapping with Gaussian Splatting | Yusen Xie et.al. | 2410.17084 | null |
2024-10-22 | E-3DGS: Gaussian Splatting with Exposure and Motion Events | Xiaoting Yin et.al. | 2410.16995 | link |
2024-10-22 | Pedestrian motion prediction evaluation for urban autonomous driving | Dmytro Zabolotnii et.al. | 2410.16864 | link |
2024-10-22 | SpikMamba: When SNN meets Mamba in Event-based Human Action Recognition | Jiaqi Chen et.al. | 2410.16746 | link |
2024-10-21 | Joker: Conditional 3D Head Synthesis with Extreme Facial Expressions | Malte Prinzler et.al. | 2410.16395 | null |
2024-10-21 | FrugalNeRF: Fast Convergence for Few-shot Novel View Synthesis without Learned Priors | Chin-Yang Lin et.al. | 2410.16271 | null |
2024-11-07 | Mini-InternVL: A Flexible-Transfer Pocket Multimodal Model with 5% Parameters and 90% Performance | Zhangwei Gao et.al. | 2410.16261 | link |
2024-10-21 | Managing Bandwidth: The Key to Cloud-Assisted Autonomous Driving | Alexander Krentsel et.al. | 2410.16227 | null |
2024-10-24 | LASER: Script Execution by Autonomous Agents for On-demand Traffic Simulation | Hao Gao et.al. | 2410.16197 | link |
2024-10-21 | Bench4Merge: A Comprehensive Benchmark for Merging in Realistic Dense Traffic with Micro-Interactive Vehicles | Zhengming Wang et.al. | 2410.15912 | link |
2024-10-21 | How to Build a Pre-trained Multimodal model for Simultaneously Chatting and Decision-making? | Zuojin Tang et.al. | 2410.15885 | null |
2024-10-27 | WildOcc: A Benchmark for Off-Road 3D Semantic Occupancy Prediction | Heng Zhai et.al. | 2410.15792 | null |
2024-10-29 | Generalizing Motion Planners with Mixture of Experts for Autonomous Driving | Qiao Sun et.al. | 2410.15774 | link |
2024-10-23 | SPARC: Prediction-Based Safe Control for Coupled Controllable and Uncontrollable Agents with Conformal Predictions | Shuqi Wang et.al. | 2410.15660 | null |
2024-10-20 | XAI-based Feature Ensemble for Enhanced Anomaly Detection in Autonomous Driving Systems | Sazid Nazat et.al. | 2410.15405 | link |
2024-10-20 | Explainability of Point Cloud Neural Networks Using SMILE: Statistical Model-Agnostic Interpretability with Local Explanations | Seyed Mohammad Ahmadi et.al. | 2410.15374 | link |
2024-10-20 | A Novel Characterization of the Population Area Under the Risk Coverage Curve (AURC) and Rates of Finite Sample Estimators | Han Zhou et.al. | 2410.15361 | null |
2024-10-20 | Large Language Models for Autonomous Driving (LLM4AD): Concept, Benchmark, Simulation, and Real-Vehicle Experiment | Can Cui et.al. | 2410.15281 | null |
2024-10-19 | 3D Multi-Object Tracking Employing MS-GLMB Filter for Autonomous Driving | Linh Van Ma et.al. | 2410.14977 | link |
2024-10-19 | Neural Radiance Field Image Refinement through End-to-End Sampling Point Optimization | Kazuhiro Ohta et.al. | 2410.14958 | null |
2024-10-19 | Part-Whole Relational Fusion Towards Multi-Modal Scene Understanding | Yi Liu et.al. | 2410.14944 | link |
2024-10-18 | A Hybrid Defense Strategy for Boosting Adversarial Robustness in Vision-Language Models | Yuhan Liang et.al. | 2410.14911 | null |
2024-10-18 | MultiOrg: A Multi-rater Organoid-detection Dataset | Christina Bukas et.al. | 2410.14612 | null |
2024-11-04 | Knowledge Transfer from Simple to Complex: A Safe and Efficient Reinforcement Learning Framework for Autonomous Driving Decision-Making | Rongliang Zhou et.al. | 2410.14468 | null |
2024-10-18 | Learning autonomous driving from aerial imagery | Varun Murali et.al. | 2410.14177 | null |
2024-10-18 | DaRePlane: Direction-aware Representations for Dynamic Scene Reconstruction | Ange Lou et.al. | 2410.14169 | null |
2024-10-17 | UniDrive: Towards Universal Driving Perception Across Camera Configurations | Ye Li et.al. | 2410.13864 | link |
2024-10-17 | Optimizing Probabilistic Conformal Prediction with Vectorized Non-Conformity Scores | Minxing Zheng et.al. | 2410.13735 | null |
2024-11-25 | DriveDreamer4D: World Models Are Effective Data Machines for 4D Driving Scene Representation | Guosheng Zhao et.al. | 2410.13571 | null |
2024-10-17 | Object Pose Estimation Using Implicit Representation For Transparent Objects | Varun Burde et.al. | 2410.13465 | null |
2024-10-17 | Accurate Checkerboard Corner Detection under Defoucs | Zezhun Shi et.al. | 2410.13371 | link |
2024-10-16 | MambaBEV: An efficient 3D detection model with Mamba2 | Zihan You et.al. | 2410.12673 | null |
2024-10-20 | Robust RL with LLM-Driven Data Synthesis and Policy Adaptation for Autonomous Driving | Sihao Wu et.al. | 2410.12568 | null |
2024-10-16 | Real-time Stereo-based 3D Object Detection for Streaming Perception | Changcai Li et.al. | 2410.12394 | link |
2024-10-16 | Consistency Calibration: Improving Uncertainty Calibration via Consistency among Perturbed Neighbors | Linwei Tao et.al. | 2410.12295 | null |
2024-10-16 | 3D Gaussian Splatting in Robotics: A Survey | Siting Zhu et.al. | 2410.12262 | link |
2024-10-16 | EG-HumanNeRF: Efficient Generalizable Human NeRF Utilizing Human Prior for Sparse View | Zhaorong Wang et.al. | 2410.12242 | null |
2024-10-16 | Sparse Prototype Network for Explainable Pedestrian Behavior Prediction | Yan Feng et.al. | 2410.12195 | link |
2024-10-16 | RTI-NMPC for Control of Autonomous Vehicles Using Implicit Discretization Methods | Matheus Wagner et.al. | 2410.12170 | null |
2024-10-18 | Augmented Intelligence in Smart Intersections: Local Digital Twins-Assisted Hybrid Autonomous Driving | Kui Wang et.al. | 2410.12163 | null |
2024-10-15 | System-Level Analysis of Module Uncertainty Quantification in the Autonomy Pipeline | Sampada Deglurkar et.al. | 2410.12019 | null |
2024-10-15 | An Online Self-learning Graph-based Lateral Controller for Self-Driving Cars | Jilan Samiuddin et.al. | 2410.11979 | null |
2024-10-14 | Study on the Helpfulness of Explainable Artificial Intelligence | Tobias Labarta et.al. | 2410.11896 | link |
2024-10-15 | A Data-Driven Aggressive Autonomous Racing Framework Utilizing Local Trajectory Planning with Velocity Prediction | Zhouheng Li et.al. | 2410.11570 | link |
2024-10-15 | TEOcc: Radar-camera Multi-modal Occupancy Prediction via Temporal Enhancement | Zhiwei Lin et.al. | 2410.11228 | link |
2024-10-14 | Few-shot Novel View Synthesis using Depth Aware 3D Gaussian Splatting | Raja Kumar et.al. | 2410.11080 | link |
2024-10-14 | 6G RIS-aided Single-LEO Localization with Slow and Fast Doppler Effects | Sharief Saleh et.al. | 2410.11010 | null |
2024-10-14 | Condition-Aware Multimodal Fusion for Robust Semantic Perception of Driving Scenes | Tim Broedermann et.al. | 2410.10791 | link |
2024-10-14 | 3DArticCyclists: Generating Simulated Dynamic 3D Cyclists for Human-Object Interaction (HOI) and Autonomous Driving Applications | Eduardo R. Corral-Soto et.al. | 2410.10782 | null |
2024-10-14 | Towards Calibrated Losses for Adversarial Robust Reject Option Classification | Vrund Shah et.al. | 2410.10736 | link |
2024-10-14 | Navigation under uncertainty: Trajectory prediction and occlusion reasoning with switching dynamical systems | Ran Wei et.al. | 2410.10653 | null |
2024-10-14 | Words to Wheels: Vision-Based Autonomous Driving Understanding Human Language Instructions Using Foundation Models | Chanhoe Ryu et.al. | 2410.10577 | null |
2024-10-14 | Exploiting Local Features and Range Images for Small Data Real-Time Point Cloud Semantic Segmentation | Daniel Fusaro et.al. | 2410.10510 | link |
2024-10-14 | In-Materia Speech Recognition | Mohamadreza Zolfagharinejad et.al. | 2410.10434 | null |
2024-10-14 | DOME: Taming Diffusion Model into High-Fidelity Controllable Occupancy World Model | Songen Gu et.al. | 2410.10429 | null |
2024-10-14 | ROA-BEV: 2D Region-Oriented Attention for BEV-based 3D Object | Jiwei Chen et.al. | 2410.10298 | null |
2024-10-14 | Exploring Semi-Supervised Learning for Online Mapping | Adam Lilja et.al. | 2410.10279 | null |
2024-10-14 | NeRF-enabled Analysis-Through-Synthesis for ISAR Imaging of Small Everyday Objects with Sparse and Noisy UWB Radar Data | Md Farhan Tasnim Oshim et.al. | 2410.10085 | null |
2024-10-13 | LoLI-Street: Benchmarking Low-Light Image Enhancement and Beyond | Md Tanvir Islam et.al. | 2410.09831 | link |
2024-10-13 | Magnituder Layers for Implicit Neural Representations in 3D | Sang Min Kim et.al. | 2410.09771 | null |
2024-11-21 | t-READi: Transformer-Powered Robust and Efficient Multimodal Inference for Autonomous Driving | Pengfei Hu et.al. | 2410.09747 | null |
2024-10-15 | LoRD: Adapting Differentiable Driving Policies to Distribution Shifts | Christopher Diehl et.al. | 2410.09681 | link |
2024-10-12 | RailYolact – A Yolact Focused on edge for Real-Time Rail Segmentation | Qihao Qian et.al. | 2410.09612 | null |
2024-10-12 | Improving 3D Finger Traits Recognition via Generalizable Neural Rendering | Hongbin Xu et.al. | 2410.09582 | null |
2024-11-22 | SceneCraft: Layout-Guided 3D Scene Generation | Xiuyu Yang et.al. | 2410.09049 | link |
2024-10-11 | Hybrid LLM-DDQN based Joint Optimization of V2I Communication and Autonomous Driving | Zijiang Yan et.al. | 2410.08854 | null |
2024-10-11 | VideoSAM: Open-World Video Segmentation | Pinxue Guo et.al. | 2410.08781 | null |
2024-10-11 | Optimizing NeRF-based SLAM with Trajectory Smoothness Constraints | Yicheng He et.al. | 2410.08780 | null |
2024-10-11 | MMLF: Multi-modal Multi-class Late Fusion for Object Detection with Uncertainty Estimation | Qihang Yang et.al. | 2410.08739 | null |
2024-10-11 | Impact of Surface Reflections in Maritime Obstacle Detection | Samed Yalçın et.al. | 2410.08713 | link |
2024-10-11 | AdvDiffuser: Generating Adversarial Safety-Critical Driving Scenarios via Guided Diffusion | Yuting Xie et.al. | 2410.08453 | null |
2024-10-10 | Are We Ready for Real-Time LiDAR Semantic Segmentation in Autonomous Driving? | Samir Abou Haidar et.al. | 2410.08365 | null |
2024-10-10 | AdaShadow: Responsive Test-time Model Adaptation in Non-stationary Mobile Environments | Cheng Fang et.al. | 2410.08256 | null |
2024-10-10 | RGM: Reconstructing High-fidelity 3D Car Assets with Relightable 3D-GS Generative Model from a Single Image | Xiaoxue Chen et.al. | 2410.08181 | null |
2024-10-10 | Generalizable and Animatable Gaussian Head Avatar | Xuangeng Chu et.al. | 2410.07971 | link |
2024-10-10 | Autonomous Vehicles Path Planning under Temporal Logic Specifications | Akshay Dhonthi et.al. | 2410.07845 | null |
2024-10-21 | HeightFormer: A Semantic Alignment Monocular 3D Object Detection Method from Roadside Perspective | Pei Liu et.al. | 2410.07758 | null |
2024-11-01 | Autonomous Driving in Unstructured Environments: How Far Have We Come? | Chen Min et.al. | 2410.07701 | link |
2024-10-10 | 3D Vision-Language Gaussian Splatting | Qucheng Peng et.al. | 2410.07577 | null |
2024-10-09 | Pockels Laser Directly Driving Ultrafast Optical Metrology | Shixin Xue et.al. | 2410.07482 | null |
2024-10-09 | Progressive Multi-Modal Fusion for Robust 3D Object Detection | Rohit Mohan et.al. | 2410.07475 | null |
2024-10-11 | NeRF-Accelerated Ecological Monitoring in Mixed-Evergreen Redwood Forest | Adam Korycki et.al. | 2410.07418 | link |
2024-10-09 | Learning responsibility allocations for multi-agent interactions: A differentiable optimization approach with control barrier functions | Isaac Remy et.al. | 2410.07409 | null |
2024-10-09 | Learning Content-Aware Multi-Modal Joint Input Pruning via Bird’s-Eye-View Representation | Yuxin Li et.al. | 2410.07268 | null |
2024-09-23 | Curb Your Attention: Causal Attention Gating for Robust Trajectory Prediction in Autonomous Driving | Ehsan Ahmadi et.al. | 2410.07191 | null |
2024-09-22 | Margin-bounded Confidence Scores for Out-of-Distribution Detection | Lakpa D. Tamang et.al. | 2410.07185 | link |
2024-10-09 | DreamMesh4D: Video-to-4D Generation with Sparse-Controlled Gaussian-Mesh Hybrid Representation | Zhiqi Li et.al. | 2410.06756 | null |
2024-10-15 | MimicTalk: Mimicking a personalized and expressive 3D talking face in minutes | Zhenhui Ye et.al. | 2410.06734 | null |
2024-10-09 | QuadBEV: An Efficient Quadruple-Task Perception Framework via Bird’s-Eye-View Representation | Yuxin Li et.al. | 2410.06516 | null |
2024-10-09 | Overcoming Autoware-Ubuntu Incompatibility in Autonomous Driving Systems-Equipped Vehicles: Lessons Learned | Dada Zhang et.al. | 2410.06492 | null |
2024-10-09 | 3D Representation Methods: A Survey | Zhengren Wang et.al. | 2410.06475 | null |
2024-10-08 | BEVLoc: Cross-View Localization and Matching via Birds-Eye-View Synthesis | Christopher Klammer et.al. | 2410.06410 | link |
2024-10-08 | Work-in-Progress: Traded Control Transfer for Managing Real-Time Sensor Uncertainties in Autonomous Vehicle | Md Sakib Galib Sourav et.al. | 2410.06345 | null |
2024-10-08 | A New Architecture for Neural Enhanced Multiobject Tracking | Shaoxiu Wei et.al. | 2410.06294 | null |
2024-10-08 | Gaussian-Based and Outside-the-Box Runtime Monitoring Join Forces | Vahid Hashemi et.al. | 2410.06051 | null |
2024-10-08 | Motion Forecasting in Continuous Driving | Nan Song et.al. | 2410.06007 | link |
2024-10-08 | DeMo: Decoupling Motion Forecasting into Directional Intentions and Dynamic States | Bozhou Zhang et.al. | 2410.05982 | link |
2024-10-08 | Distributed Coordination for Multi-Vehicle Systems in the Presence of Misbehaving Vehicles | Dongkun Han et.al. | 2410.05793 | null |
2024-10-08 | Reinforcement Learning From Imperfect Corrective Actions And Proxy Rewards | Zhaohui Jiang et.al. | 2410.05782 | null |
2024-10-08 | Comparative Analysis of Novel View Synthesis and Photogrammetry for 3D Forest Stand Reconstruction and extraction of individual tree parameters | Guoji Tian et.al. | 2410.05772 | null |
2024-10-08 | Gen-Drive: Enhancing Diffusion Generative Driving Policies with Reward Modeling and Reinforcement Learning Fine-tuning | Zhiyu Huang et.al. | 2410.05582 | null |
2024-10-07 | Toward General Object-level Mapping from Sparse Views with 3D Diffusion Priors | Ziwei Liao et.al. | 2410.05514 | link |
2024-10-11 | PH-Dropout: Practical Epistemic Uncertainty Quantification for View Synthesis | Chuanhao Sun et.al. | 2410.05468 | link |
2024-10-07 | Salient Store: Enabling Smart Storage for Continuous Learning Edge Servers | Cyan Subhra Mishra et.al. | 2410.05435 | null |
2024-10-07 | STOP! Camera Spoofing via the in-Vehicle IP Network | Dror Peri et.al. | 2410.05417 | null |
2024-10-07 | LiDAR-GS:Real-time LiDAR Re-Simulation using Gaussian Splatting | Qifeng Chen et.al. | 2410.05111 | null |
2024-10-07 | HE-Drive: Human-Like End-to-End Driving with Vision Language Models | Junming Wang et.al. | 2410.05051 | null |
2024-10-10 | 6DGS: Enhanced Direction-Aware Gaussian Splatting for Volumetric Rendering | Zhongpai Gao et.al. | 2410.04974 | null |
2024-10-07 | PRFusion: Toward Effective and Robust Multi-Modal Place Recognition with Image and Point Cloud Fusion | Sijie Wang et.al. | 2410.04939 | link |
2024-10-07 | TeX-NeRF: Neural Radiance Fields from Pseudo-TeX Vision | Chonghao Zhong et.al. | 2410.04873 | null |
2024-10-07 | Data-driven Diffusion Models for Enhancing Safety in Autonomous Vehicle Traffic Simulations | Jinxiong Lu et.al. | 2410.04809 | null |
2024-10-07 | WTCL-Dehaze: Rethinking Real-world Image Dehazing via Wavelet Transform and Contrastive Learning | Divine Joseph Appiah et.al. | 2410.04762 | null |
2024-10-15 | Diffusion Models in 3D Vision: A Survey | Zhen Wang et.al. | 2410.04738 | null |
2024-10-06 | In-Place Panoptic Radiance Field Segmentation with Perceptual Prior for 3D Scene Understanding | Shenghao Li et.al. | 2410.04529 | null |
2024-10-06 | Deformable NeRF using Recursively Subdivided Tetrahedra | Zherui Qiu et.al. | 2410.04402 | null |
2024-10-19 | StreetSurfGS: Scalable Urban Street Surface Reconstruction with Planar-based Gaussian Splatting | Xiao Cui et.al. | 2410.04354 | null |
2024-10-10 | Hybrid NeRF-Stereo Vision: Pioneering Depth Estimation and 3D Reconstruction in Endoscopy | Pengcheng Chen et.al. | 2410.04041 | null |
2024-10-13 | Model Developmental Safety: A Safety-Centric Method and Applications in Vision-Language Models | Gang Li et.al. | 2410.03955 | link |
2024-11-01 | STONE: A Submodular Optimization Framework for Active 3D Object Detection | Ruiyu Mao et.al. | 2410.03918 | link |
2024-10-04 | A Multi-model Approach for Video Data Retrieval in Autonomous Vehicle Development | Jesper Knapp et.al. | 2410.03580 | null |
2024-10-04 | Make Interval Bound Propagation great again | Patryk Krukowski et.al. | 2410.03373 | link |
2024-10-04 | MetaOOD: Automatic Selection of OOD Detection Models | Yuehan Qin et.al. | 2410.03074 | null |
2024-10-03 | Agent Security Bench (ASB): Formalizing and Benchmarking Attacks and Defenses in LLM-based Agents | Hanrong Zhang et.al. | 2410.02644 | link |
2024-10-03 | Spatial-Temporal Multi-Cuts for Online Multiple-Camera Vehicle Tracking | Fabian Herzog et.al. | 2410.02638 | link |
2024-10-03 | Behavior Trees in Functional Safety Supervisors for Autonomous Vehicles | Carlos Conejo et.al. | 2410.02469 | link |
2024-10-03 | End-to-end Driving in High-Interaction Traffic Scenarios with Reinforcement Learning | Yueyuan Li et.al. | 2410.02253 | null |
2024-10-03 | Remember and Recall: Associative-Memory-based Trajectory Prediction | Hang Guo et.al. | 2410.02201 | null |
2024-10-03 | Abstract Reward Processes: Leveraging State Abstraction for Consistent Off-Policy Evaluation | Shreyas Chaudhari et.al. | 2410.02172 | link |
2024-10-02 | MVGS: Multi-view-regulated Gaussian Splatting for Novel View Synthesis | Xiaobiao Du et.al. | 2410.02103 | link |
2024-10-28 | Neural Eulerian Scene Flow Fields | Kyle Vedder et.al. | 2410.02031 | null |
2024-10-02 | Open3DTrack: Towards Open-Vocabulary 3D Multi-Object Tracking | Ayesha Ishaq et.al. | 2410.01678 | link |
2024-10-02 | 3DGS-DET: Empower 3D Gaussian Splatting with Boundary Guidance and Box-Focused Sampling for 3D Object Detection | Yang Cao et.al. | 2410.01647 | link |
2024-10-07 | Entropy-Based Uncertainty Modeling for Trajectory Prediction in Autonomous Driving | Aron Distelzweig et.al. | 2410.01628 | null |
2024-10-06 | GaussianBlock: Building Part-Aware Compositional and Editable 3D Scene by Primitives and Gaussians | Shuyi Jiang et.al. | 2410.01535 | null |
2024-10-07 | Perceptual Piercing: Human Visual Cue-based Object Detection in Low Visibility Conditions | Ashutosh Kumar et.al. | 2410.01225 | link |
2024-10-02 | Absolute State-wise Constrained Policy Optimization: High-Probability State-wise Constraints Satisfaction | Weiye Zhao et.al. | 2410.01212 | null |
2024-10-02 | AniSDF: Fused-Granularity Neural Surfaces with Anisotropic Encoding for High-Fidelity 3D Reconstruction | Jingnan Gao et.al. | 2410.01202 | null |
2024-10-02 | Generative Diffusion-based Contract Design for Efficient AI Twins Migration in Vehicular Embodied AI Networks | Yue Zhong et.al. | 2410.01176 | null |
2024-10-01 | High-directivity multi-level beam switching with single-gate tunable metasurfaces based on graphene | Juho Park et.al. | 2410.00806 | null |
2024-10-01 | E-MPC: Edge-assisted Model Predictive Control | Yuan-Yao Lou et.al. | 2410.00695 | null |
2024-10-01 | GMT: Enhancing Generalizable Neural Rendering via Geometry-Driven Multi-Reference Texture Transfer | Youngho Yoon et.al. | 2410.00672 | link |
2024-10-01 | Cafca: High-quality Novel View Synthesis of Expressive Faces from Casual Few-shot Captures | Marcel C. Bühler et.al. | 2410.00630 | null |
2024-10-01 | Seamless Augmented Reality Integration in Arthroscopy: A Pipeline for Articular Reconstruction and Guidance | Hongchao Shu et.al. | 2410.00386 | null |
2024-10-01 | SyntheOcc: Synthesize Geometric-Controlled Street View Images through 3D Semantic MPIs | Leheng Li et.al. | 2410.00337 | null |
2024-10-01 | GSPR: Multimodal Place Recognition Using 3D Gaussian Splatting for Autonomous Driving | Zhangshuo Qi et.al. | 2410.00299 | link |
2024-09-30 | Efficient Driving Behavior Narration and Reasoning on Edge Device Using Large Language Models | Yizhou Huang et.al. | 2409.20364 | null |
2024-09-30 | Distributed NeRF Learning for Collaborative Multi-Robot Perception | Hongrui Zhao et.al. | 2409.20289 | null |
2024-10-01 | OccRWKV: Rethinking Efficient 3D Semantic Occupancy Prediction with Linear Complexity | Junming Wang et.al. | 2409.19987 | null |
2024-09-30 | DAOcc: 3D Object Detection Assisted Multi-Sensor Fusion for 3D Occupancy Prediction | Zhen Yang et.al. | 2409.19972 | link |
2024-10-24 | RNG: Relightable Neural Gaussians | Jiahui Fan et.al. | 2409.19702 | null |
2024-09-29 | Fast-Convergent and Communication-Alleviated Heterogeneous Hierarchical Federated Learning in Autonomous Driving | Wei-Bin Kou et.al. | 2409.19560 | null |
2024-09-28 | Spatial Reasoning and Planning for Deep Embodied Agents | Shu Ishida et.al. | 2409.19479 | null |
2024-09-26 | MemFusionMap: Working Memory Fusion for Online Vectorized HD Map Construction | Jingyu Song et.al. | 2409.18737 | null |
2024-09-27 | Analysis of Truncated Singular Value Decomposition for Koopman Operator-Based Lane Change Model | Chinnawut Nantabut et.al. | 2409.18586 | null |
2024-09-27 | BoT-Drive: Hierarchical Behavior and Trajectory Planning for Autonomous Driving using POMDPs | Xuanjin Jin et.al. | 2409.18411 | null |
2024-09-27 | Multimodal Trajectory Prediction for Autonomous Driving on Unstructured Roads using Deep Convolutional Network | Lei Li et.al. | 2409.18399 | null |
2024-09-26 | Improving Agent Behaviors with RL Fine-tuning for Autonomous Driving | Zhenghao Peng et.al. | 2409.18343 | null |
2024-09-26 | Does End-to-End Autonomous Driving Really Need Perception Tasks? | Peidong Li et.al. | 2409.18341 | link |
2024-09-30 | DiffSSC: Semantic LiDAR Scan Completion using Denoising Diffusion Probabilistic Models | Helin Cao et.al. | 2409.18092 | null |
2024-11-07 | LightAvatar: Efficient Head Avatar as Dynamic Neural Light Field | Huan Wang et.al. | 2409.18057 | link |
2024-11-03 | DualAD: Dual-Layer Planning for Reasoning in Autonomous Driving | Dingrui Wang et.al. | 2409.18053 | link |
2024-09-26 | Reasoning Multi-Agent Behavioral Topology for Interactive Autonomous Driving | Haochen Liu et.al. | 2409.18031 | link |
2024-09-26 | ReliOcc: Towards Reliable Semantic Occupancy Prediction via Uncertainty Learning | Song Wang et.al. | 2409.18026 | null |
2024-09-26 | Deblur e-NeRF: NeRF from Motion-Blurred Events under High-speed or Low-light Conditions | Weng Fei Low et.al. | 2409.17988 | null |
2024-09-26 | Adaptive Stream Processing on Edge Devices through Active Inference | Boris Sedlak et.al. | 2409.17937 | null |
2024-09-26 | PhantomLiDAR: Cross-modality Signal Injection Attacks against LiDAR | Zizhi Jin et.al. | 2409.17907 | null |
2024-09-27 | A New Dataset for Monocular Depth Estimation Under Viewpoint Shifts | Aurel Pjetri et.al. | 2409.17851 | null |
2024-09-26 | CASPFormer: Trajectory Prediction from BEV Images with Deformable Attention | Harsh Yadav et.al. | 2409.17790 | null |
2024-09-26 | Neural Implicit Representation for Highly Dynamic LiDAR Mapping and Odometry | Qi Zhang et.al. | 2409.17729 | null |
2024-09-26 | AlterMOMA: Fusion Redundancy Pruning for Camera-LiDAR Fusion Models with Alternative Modality Masking | Shiqi Sun et.al. | 2409.17728 | null |
2024-09-26 | Hierarchical End-to-End Autonomous Driving: Integrating BEV Perception with Deep Reinforcement Learning | Siyi Lu et.al. | 2409.17659 | null |
2024-09-27 | Learning Occlusion-aware Decision-making from Agent Interaction via Active Perception | Jie Jia et.al. | 2409.17618 | null |
2024-09-26 | Joint Source-Channel Coding: Fundamentals and Recent Progress in Practical Designs | Deniz Gündüz et.al. | 2409.17557 | null |
2024-09-25 | Transient Adversarial 3D Projection Attacks on Object Detection in Autonomous Driving | Ce Zhou et.al. | 2409.17403 | null |
2024-09-25 | Optical Lens Attack on Deep Learning Based Monocular Depth Estimation | Ce Zhou et.al. | 2409.17376 | null |
2024-09-25 | Energy-Efficient & Real-Time Computer Vision with Intelligent Skipping via Reconfigurable CMOS Image Sensors | Md Abdullah-Al Kaiser et.al. | 2409.17341 | null |
2024-09-25 | VL4AD: Vision-Language Models Improve Pixel-wise Anomaly Detection | Liangyu Zhong et.al. | 2409.17330 | null |
2024-09-25 | Deep Learning and Machine Learning, Advancing Big Data Analytics and Management: Handy Appetizer | Benji Peng et.al. | 2409.17120 | null |
2024-09-25 | Let’s Make a Splan: Risk-Aware Trajectory Optimization in a Normalized Gaussian Splat | Jonathan Michaux et.al. | 2409.16915 | null |
2024-09-25 | Skyeyes: Ground Roaming using Aerial View Images | Zhiyuan Gao et.al. | 2409.16685 | null |
2024-09-25 | TalkinNeRF: Animatable Neural Fields for Full-Body Talking Humans | Aggelina Chatziagapi et.al. | 2409.16666 | null |
2024-09-26 | Mitigating Covariate Shift in Imitation Learning for Autonomous Vehicles Using Latent Space Generative World Models | Alexander Popov et.al. | 2409.16663 | null |
2024-09-25 | Efficient Motion Prediction: A Lightweight & Accurate Trajectory Prediction Model With Fast Training and Inference Speed | Alexander Prutsch et.al. | 2409.16154 | link |
2024-10-14 | MCTrack: A Unified 3D Multi-Object Tracking Framework for Autonomous Driving | Xiyang Wang et.al. | 2409.16149 | link |
2024-09-24 | FSF-Net: Enhance 4D Occupancy Forecasting with Coarse BEV Scene Flow for Autonomous Driving | Erxin Guo et.al. | 2409.15841 | null |
2024-09-24 | Intention-based and Risk-Aware Trajectory Prediction for Autonomous Driving in Complex Traffic Scenarios | Wen Wei et.al. | 2409.15821 | null |
2024-09-27 | Diffusion Models for Intelligent Transportation Systems: A Survey | Mingxing Peng et.al. | 2409.15816 | null |
2024-09-24 | A Computer Vision Approach for Autonomous Cars to Drive Safe at Construction Zone | Abu Shad Ahammed et.al. | 2409.15809 | null |
2024-09-24 | Learning Multiple Probabilistic Decisions from Latent World Model in Autonomous Driving | Lingyu Xiao et.al. | 2409.15730 | link |
2024-09-24 | Plenoptic PNG: Real-Time Neural Radiance Fields in 150 KB | Jae Yong Lee et.al. | 2409.15689 | null |
2024-09-23 | Analyzing Privacy Implications of Data Collection in Android Automotive OS | Bulut Gözübüyük et.al. | 2409.15561 | null |
2024-09-23 | AgriNeRF: Neural Radiance Fields for Agriculture in Challenging Lighting Conditions | Samarth Chopra et.al. | 2409.15487 | null |
2024-09-23 | VLMine: Long-Tail Data Mining with Vision Language Models | Mao Ye et.al. | 2409.15486 | null |
2024-09-07 | Causality-Driven Reinforcement Learning for Joint Communication and Sensing | Anik Roy et.al. | 2409.15329 | null |
2024-09-23 | Enhancing Pedestrian Trajectory Prediction with Crowd Trip Information | Rei Tamaru et.al. | 2409.15224 | link |
2024-09-25 | Goal-based Neural Physics Vehicle Trajectory Prediction Model | Rui Gan et.al. | 2409.15182 | null |
2024-10-14 | SpikeGS: Learning 3D Gaussian Fields from Continuous Spike Stream | Jinze Yu et.al. | 2409.15176 | link |
2024-09-23 | Controllable Traffic Simulation through LLM-Guided Hierarchical Chain-of-Thought Reasoning | Zhiyuan Liu et.al. | 2409.15135 | null |
2024-09-23 | FusionRF: High-Fidelity Satellite Neural Radiance Fields from Multispectral and Panchromatic Acquisitions | Michael Sprintson et.al. | 2409.15132 | null |
2024-09-23 | SPformer: A Transformer Based DRL Decision Making Method for Connected Automated Vehicles | Ye Han et.al. | 2409.15105 | null |
2024-09-23 | Online Adaptation of Learned Vehicle Dynamics Model with Meta-Learning Approach | Yuki Tsuchiya et.al. | 2409.14950 | null |
2024-09-23 | An Adverse Weather-Immune Scheme with Unfolded Regularization and Foundation Model Knowledge Distillation for Street Scene Understanding | Wei-Bin Kou et.al. | 2409.14737 | null |
2024-09-23 | A Generalized Control Revision Method for Autonomous Driving Safety | Zehang Zhu et.al. | 2409.14688 | null |
2024-09-23 | S2O: An Integrated Driving Decision-making Performance Evaluation Method Bridging Subjective Feeling to Objective Evaluation | Yuning Wang et.al. | 2409.14680 | null |
2024-09-24 | First Field Trial of LLM-Powered AI Agent for Lifecycle Management of Autonomous Driving Optical Networks | Xiaomin Liu et.al. | 2409.14605 | null |
2024-09-22 | Enhancing LLM-based Autonomous Driving Agents to Mitigate Perception Attacks | Ruoyu Song et.al. | 2409.14488 | null |
2024-09-22 | MVPGS: Excavating Multi-view Priors for Gaussian Splatting from Sparse Input Views | Wangze Xu et.al. | 2409.14316 | null |
2024-09-21 | LFP: Efficient and Accurate End-to-End Lane-Level Planning via Camera-LiDAR Fusion | Guoliang You et.al. | 2409.14170 | null |
2024-09-24 | Will Large Language Models be a Panacea to Autonomous Driving? | Yuxuan Zhu et.al. | 2409.14165 | null |
2024-09-21 | Integrated Decision Making and Trajectory Planning for Autonomous Driving Under Multimodal Uncertainties: A Bayesian Game Approach | Zhenmin Huang et.al. | 2409.13993 | null |
2024-09-20 | OneBEV: Using One Panoramic Image for Bird’s-Eye-View Semantic Mapping | Jiale Wei et.al. | 2409.13912 | link |
2024-09-20 | Efficient Domain Augmentation for Autonomous Driving Testing Using Diffusion Models | Luciano Baresi et.al. | 2409.13661 | null |
2024-09-20 | Autonomous Driving at Unsignalized Intersections: A Review of Decision-Making Challenges and Reinforcement Learning-Based Solutions | Mohammad Al-Sharman et.al. | 2409.13144 | null |
2024-09-22 | Exploiting Minority Pseudo-Labels for Semi-Supervised Semantic Segmentation in Autonomous Driving | Yuting Hong et.al. | 2409.12680 | null |
2024-09-19 | METDrive: Multi-modal End-to-end Autonomous Driving with Temporal Guidance | Ziang Guo et.al. | 2409.12667 | null |
2024-09-23 | Accurate Automatic 3D Annotation of Traffic Lights and Signs for Autonomous Driving | Sándor Kunsági-Máté et.al. | 2409.12620 | link |
2024-09-19 | LLMs Can Check Their Own Results to Mitigate Hallucinations in Traffic Understanding Tasks | Malsha Ashani Mahawatta Dona et.al. | 2409.12580 | null |
2024-09-19 | LMT-Net: Lane Model Transformer Network for Automated HD Mapping from Sparse Vehicle Observations | Michael Mink et.al. | 2409.12409 | null |
2024-09-18 | The Finer Points: A Systematic Comparison of Point-Cloud Extractors for Radar Odometry | Elliot Preston-Krebs et.al. | 2409.12256 | null |
2024-10-14 | ScaleFlow++: Robust and Accurate Estimation of 3D Motion from Video | Han Ling et.al. | 2409.12202 | link |
2024-09-25 | BRDF-NeRF: Neural Radiance Fields with Optical Satellite Images and BRDF Modelling | Lulin Zhang et.al. | 2409.12014 | link |
2024-09-18 | Intraoperative Registration by Cross-Modal Inverse Neural Rendering | Maximilian Fehrentz et.al. | 2409.11983 | null |
2024-09-18 | Unveiling the Black Box: Independent Functional Module Evaluation for Bird’s-Eye-View Perception Model | Ludan Zhang et.al. | 2409.11969 | null |
2024-09-18 | Explaining Non-monotonic Normative Reasoning using Argumentation Theory with Deontic Logic | Zhe Yu et.al. | 2409.11780 | null |
2024-09-18 | RopeBEV: A Multi-Camera Roadside Perception Network in Bird’s-Eye-View | Jinrang Jia et.al. | 2409.11706 | null |
2024-09-18 | From Words to Wheels: Automated Style-Customized Policy Generation for Autonomous Driving | Xu Han et.al. | 2409.11694 | null |
2024-09-17 | RenderWorld: World Model with Self-Supervised 3D Label | Ziyang Yan et.al. | 2409.11356 | null |
2024-09-17 | TopoMaskV2: Enhanced Instance-Mask-Based Formulation for the Road Topology Problem | M. Esat Kalfaoglu et.al. | 2409.11325 | null |
2024-09-18 | High-Order Evolving Graphs for Enhanced Representation of Traffic Dynamics | Aditya Humnabadkar et.al. | 2409.11206 | null |
2024-09-17 | Optimization of Rulebooks via Asymptotically Representing Lexicographic Hierarchies for Autonomous Vehicles | Matteo Penlington et.al. | 2409.11199 | null |
2024-09-16 | Video Token Sparsification for Efficient Multimodal LLMs in Autonomous Driving | Yunsheng Ma et.al. | 2409.11182 | null |
2024-09-18 | Annealed Winner-Takes-All for Motion Forecasting | Yihong Xu et.al. | 2409.11172 | link |
2024-09-17 | UltimateDO: An Efficient Framework to Marry Occupancy Prediction with 3D Object Detection via Channel2height | Zichen Yu et.al. | 2409.11160 | null |
2024-09-17 | Unleashing the Potential of Mamba: Boosting a LiDAR 3D Sparse Detector by Using Cross-Model Knowledge Distillation | Rui Yu et.al. | 2409.11018 | null |
2024-09-17 | TrajSSL: Trajectory-Enhanced Semi-Supervised 3D Object Detection | Philip Jacobson et.al. | 2409.10901 | null |
2024-09-20 | CoMamba: Real-time Cooperative Perception Unlocked with State Space Models | Jinlong Li et.al. | 2409.10699 | null |
2024-09-02 | An Examination of Offline-Trained Encoders in Vision-Based Deep Reinforcement Learning for Autonomous Driving | Shawan Mohammed et.al. | 2409.10554 | null |
2024-08-30 | 3CSim: CARLA Corner Case Simulation for Control Assessment in Autonomous Driving | Matúš Čávojský et.al. | 2409.10524 | null |
2024-09-16 | Radar Teach and Repeat: Architecture and Initial Field Testing | Xinyuan Qiao et.al. | 2409.10491 | link |
2024-09-16 | XLM for Autonomous Driving Systems: A Comprehensive Review | Sonda Fourati et.al. | 2409.10484 | null |
2024-09-16 | DRIVE: Dependable Robust Interpretable Visionary Ensemble Framework in Autonomous Driving | Songning Lai et.al. | 2409.10330 | null |
2024-09-16 | SEAL: Towards Safe Autonomous Driving via Skill-Enabled Adversary Learning for Closed-Loop Scenario Generation | Benjamin Stoler et.al. | 2409.10320 | link |
2024-09-16 | Robust Bird’s Eye View Segmentation by Adapting DINOv2 | Merve Rabia Barın et.al. | 2409.10228 | null |
2024-09-16 | ExelMap: Explainable Element-based HD-Map Change Detection and Update | Lena Wild et.al. | 2409.10178 | null |
2024-09-16 | Maneuver Decision-Making with Trajectory Streams Prediction for Autonomous Vehicles | Mais Jamal et.al. | 2409.10165 | null |
2024-09-16 | Human Insights Driven Latent Space for Different Driving Perspectives: A Unified Encoder for Efficient Multi-Task Inference | Huy-Dung Nguyen et.al. | 2409.10095 | null |
2024-09-16 | LeGEND: A Top-Down Approach to Scenario Generation of Autonomous Driving Systems Assisted by Large Language Models | Shuncheng Tang et.al. | 2409.10066 | link |
2024-09-17 | GlobalMapNet: An Online Framework for Vectorized Global HD Map Construction | Anqi Shi et.al. | 2409.10063 | null |
2024-09-16 | DENSER: 3D Gaussians Splatting for Scene Reconstruction of Dynamic Urban Environments | Mahmud A. Mohamad et.al. | 2409.10041 | link |
2024-09-15 | Semantic2D: A Semantic Dataset for 2D Lidar Semantic Segmentation | Zhanteng Xie et.al. | 2409.09899 | null |
2024-09-15 | SAFER-Splat: A Control Barrier Function for Safe Navigation with Online Gaussian Splatting Maps | Timothy Chen et.al. | 2409.09868 | null |
2024-09-15 | A Comprehensive Survey of PID and Pure Pursuit Control Algorithms for Autonomous Vehicle Navigation | Harshit Jain et.al. | 2409.09848 | null |
2024-09-15 | NARF24: Estimating Articulated Object Structure for Implicit Rendering | Stanley Lewis et.al. | 2409.09829 | null |
2024-09-15 | DiFSD: Ego-Centric Fully Sparse Paradigm with Uncertainty Denoising and Iterative Refinement for Efficient End-to-End Autonomous Driving | Haisheng Su et.al. | 2409.09777 | link |
2024-09-15 | Risk-Aware Autonomous Driving for Linear Temporal Logic Specifications | Shuhao Qi et.al. | 2409.09769 | null |
2024-09-14 | Lab2Car: A Versatile Wrapper for Deploying Experimental Planners in Complex Real-world Environments | Marc Heim et.al. | 2409.09523 | null |
2024-09-14 | A Data-Informed Analysis of Scalable Supervision for Safety in Autonomous Vehicle Fleets | Cameron Hickert et.al. | 2409.09500 | null |
2024-09-14 | MulCPred: Learning Multi-modal Concepts for Explainable Pedestrian Action Prediction | Yan Feng et.al. | 2409.09446 | link |
2024-09-14 | OPUS: Occupancy Prediction Using a Sparse Set | Jiabao Wang et.al. | 2409.09350 | link |
2024-09-11 | Inf-MLLM: Efficient Streaming Inference of Multimodal Large Language Models on a Single GPU | Zhenyu Ning et.al. | 2409.09086 | null |
2024-08-29 | Semantic Communication for Cooperative Perception using HARQ | Yucheng Sheng et.al. | 2409.09042 | null |
2024-09-13 | Causal Transformer for Fusion and Pose Estimation in Deep Visual Inertial Odometry | Yunus Bilge Kurt et.al. | 2409.08769 | link |
2024-09-13 | GenMapping: Unleashing the Potential of Inverse Perspective Mapping for Robust Online HD Map Construction | Siyu Li et.al. | 2409.08688 | link |
2024-09-12 | DreamHOI: Subject-Driven Generation of 3D Human-Object Interactions with Diffusion Priors | Thomas Hanwen Zhu et.al. | 2409.08278 | null |
2024-09-13 | The Design of Informative Take-Over Requests for Semi-Autonomous Cyber-Physical Systems: Combining Spoken Language and Visual Icons in a Drone-Controller Setting | Ashwini Gundappa et.al. | 2409.08253 | null |
2024-09-12 | The JPEG Pleno Learning-based Point Cloud Coding Standard: Serving Man and Machine | André F. R. Guarda et.al. | 2409.08130 | null |
2024-09-12 | SoVAR: Building Generalizable Scenarios from Accident Reports for Autonomous Driving Testing | An Guo et.al. | 2409.08081 | link |
2024-09-12 | Expansive Supervision for Neural Radiance Field | Weixiang Zhang et.al. | 2409.08056 | null |
2024-10-18 | LED: Light Enhanced Depth Estimation at Night | Simon de Moreau et.al. | 2409.08031 | link |
2024-09-12 | Real-time Multi-view Omnidirectional Depth Estimation System for Robots and Autonomous Driving on Real Scenes | Ming Li et.al. | 2409.07843 | null |
2024-09-12 | ReGentS: Real-World Safety-Critical Driving Scenario Generation Made Stable | Yuan Yin et.al. | 2409.07830 | link |
2024-09-12 | GateAttentionPose: Enhancing Pose Estimation with Agent Attention and Improved Gated Convolutions | Liang Feng et.al. | 2409.07798 | null |
2024-09-13 | ROCAS: Root Cause Analysis of Autonomous Driving Accidents via Cyber-Physical Co-mutation | Shiwei Feng et.al. | 2409.07774 | link |
2024-09-12 | GatedUniPose: A Novel Approach for Pose Estimation Combining UniRepLKNet and Gated Convolution | Liang Feng et.al. | 2409.07752 | null |
2024-09-12 | Attack End-to-End Autonomous Driving through Module-Wise Noise | Lu Wang et.al. | 2409.07706 | null |
2024-09-21 | A Comprehensive Survey on Inverse Constrained Reinforcement Learning: Definitions, Progress and Challenges | Guiliang Liu et.al. | 2409.07569 | link |
2024-09-11 | Unsupervised Point Cloud Registration with Self-Distillation | Christian Löwens et.al. | 2409.07558 | link |
2024-09-11 | Module-wise Adaptive Adversarial Training for End-to-end Autonomous Driving | Tianyuan Zhang et.al. | 2409.07321 | null |
2024-09-25 | MiniDrive: More Efficient Vision-Language Models with Multi-Level 2D Features as Text Tokens for Autonomous Driving | Enming Zhang et.al. | 2409.07267 | link |
2024-09-11 | Behavioral Cloning Models Reality Check for Autonomous Driving | Mustafa Yildirim et.al. | 2409.07218 | null |
2024-09-11 | ThermalGaussian: Thermal 3D Gaussian Splatting | Rongfeng Lu et.al. | 2409.07200 | link |
2024-09-10 | Cross-Modal Self-Supervised Learning with Effective Contrastive Units for LiDAR Point Clouds | Mu Cai et.al. | 2409.06827 | link |
2024-09-10 | LEIA: Latent View-invariant Embeddings for Implicit 3D Articulation | Archana Swaminathan et.al. | 2409.06703 | null |
2024-09-10 | Hint-AD: Holistically Aligned Interpretability in End-to-End Autonomous Driving | Kairui Ding et.al. | 2409.06702 | null |
2024-09-10 | Transtreaming: Adaptive Delay-aware Transformer for Real-time Streaming Perception | Xiang Zhang et.al. | 2409.06584 | null |
2024-09-10 | Sources of Uncertainty in 3D Scene Reconstruction | Marcus Klasson et.al. | 2409.06407 | link |
2024-09-10 | Multi-Weather Image Restoration via Histogram-Based Transformer Feature Enhancement | Yang Wen et.al. | 2409.06334 | null |
2024-09-10 | UdeerLID+: Integrating LiDAR, Image, and Relative Depth with Semi-Supervised | Tao Ni et.al. | 2409.06197 | null |
2024-09-11 | MyGo: Consistent and Controllable Multi-View Driving Video Generation with Camera Control | Yining Yao et.al. | 2409.06189 | null |
2024-09-09 | LSE-NeRF: Learning Sensor Modeling Errors for Deblured Neural Radiance Fields with RGB-Event Stereo | Wei Zhi Tang et.al. | 2409.06104 | link |
2024-09-09 | Promptable Closed-loop Traffic Simulation | Shuhan Tan et.al. | 2409.05863 | null |
2024-09-09 | Vision-Driven 2D Supervised Fine-Tuning Framework for Bird’s Eye View Perception | Lei He et.al. | 2409.05834 | null |
2024-09-09 | Replay Consolidation with Label Propagation for Continual Object Detection | Riccardo De Monte et.al. | 2409.05650 | null |
2024-09-09 | G-NeLF: Memory- and Data-Efficient Hybrid Neural Light Field for Novel View Synthesis | Lutao Jiang et.al. | 2409.05617 | null |
2024-09-12 | DriveScape: Towards High-Resolution Controllable Multi-View Driving Video Generation | Wei Wu et.al. | 2409.05463 | null |
2024-09-11 | Distribution Discrepancy and Feature Heterogeneity for Active 3D Object Detection | Huang-Yu Chen et.al. | 2409.05425 | link |
2024-09-09 | ICPR 2024 Competition on Safe Segmentation of Drive Scenes in Unstructured Traffic and Adverse Weather Conditions | Furqan Ahmed Shaik et.al. | 2409.05327 | null |
2024-09-09 | Neural Surface Reconstruction and Rendering for LiDAR-Visual Systems | Jianheng Liu et.al. | 2409.05310 | null |
2024-10-22 | Developing Path Planning with Behavioral Cloning and Proximal Policy Optimization for Path-Tracking and Static Obstacle Nudging | Mingyan Zhou et.al. | 2409.05289 | link |
2024-09-08 | Enhancing the Performance of Multi-Vehicle Navigation in Unstructured Environments using Hard Sample Mining | Yining Ma et.al. | 2409.05119 | link |
2024-09-08 | RCBEVDet++: Toward High-accuracy Radar-Camera Fusion 3D Perception Network | Zhiwei Lin et.al. | 2409.04979 | null |
2024-09-07 | A Comprehensive Survey on Evidential Deep Learning and Its Applications | Junyu Gao et.al. | 2409.04720 | link |
2024-09-06 | Multi-scale Feature Fusion with Point Pyramid for 3D Object Detection | Weihao Lu et.al. | 2409.04601 | null |
2024-09-06 | SCARF: Scalable Continual Learning Framework for Memory-efficient Multiple Neural Radiance Fields | Yuze Wang et.al. | 2409.04482 | null |
2024-09-06 | Future Does Matter: Boosting 3D Object Detection with Temporal Motion Estimation in Point Cloud Sequences | Rui Yu et.al. | 2409.04390 | null |
2024-09-06 | Safe and Efficient Path Planning under Uncertainty via Deep Collision Probability Fields | Felix Herrmann et.al. | 2409.04306 | null |
2024-09-06 | Secure Traffic Sign Recognition: An Attention-Enabled Universal Image Inpainting Mechanism against Light Patch Attacks | Hangcheng Cao et.al. | 2409.04133 | null |
2024-09-05 | Multi-agent Path Finding for Mixed Autonomy Traffic Coordination | Han Zheng et.al. | 2409.03881 | null |
2024-09-05 | Prediction Accuracy & Reliability: Classification and Object Localization under Distribution Shift | Fabian Diet et.al. | 2409.03543 | null |
2024-09-05 | Neural HD Map Generation from Multiple Vectorized Tiles Locally Produced by Autonomous Vehicles | Miao Fan et.al. | 2409.03445 | null |
2024-09-05 | Weight Conditioning for Smooth Optimization of Neural Networks | Hemanth Saratchandran et.al. | 2409.03424 | null |
2024-09-05 | YOLO-PPA based Efficient Traffic Sign Detection for Cruise Control in Autonomous Driving | Jingyu Zhang et.al. | 2409.03320 | null |
2024-09-05 | OccLLaMA: An Occupancy-Language-Action Generative World Model for Autonomous Driving | Julong Wei et.al. | 2409.03272 | null |
2024-09-05 | Multiple weather images restoration using the task transformer and adaptive mixup strategy | Yang Wen et.al. | 2409.03249 | null |
2024-09-05 | Optimizing 3D Gaussian Splatting for Sparse Viewpoint Scene Reconstruction | Shen Chen et.al. | 2409.03213 | null |
2024-09-05 | Autonomous Drifting Based on Maximal Safety Probability Learning | Hikaru Hoshino et.al. | 2409.03160 | link |
2024-09-04 | Developing, Analyzing, and Evaluating Self-Drive Algorithms Using Drive-by-Wire Electric Vehicles | Beñat Froemming-Aldanondo et.al. | 2409.03114 | link |
2024-09-04 | Incorporating dense metric depth into neural 3D representations for view synthesis and relighting | Arkadeep Narayan Chaudhury et.al. | 2409.03061 | null |
2024-09-04 | UC-NeRF: Uncertainty-aware Conditional Neural Radiance Fields from Endoscopic Sparse Views | Jiaxin Guo et.al. | 2409.02917 | link |
2024-09-04 | Can LVLMs Obtain a Driver’s License? A Benchmark Towards Reliable AGI for Autonomous Driving | Yuhang Lu et.al. | 2409.02914 | null |
2024-09-08 | GET-UP: GEomeTric-aware Depth Estimation with Radar Points UPsampling | Huawei Sun et.al. | 2409.02720 | link |
2024-09-04 | Improved Single Camera BEV Perception Using Multi-Camera Training | Daniel Busch et.al. | 2409.02676 | null |
2024-09-25 | Learnable Wireless Digital Twins: Reconstructing Electromagnetic Field with Neural Representations | Shuaifeng Jiang et.al. | 2409.02564 | null |
2024-09-04 | Want a Ride? Attitudes Towards Autonomous Driving and Behavior in Autonomous Vehicles | Enrico Del Re et.al. | 2409.02556 | null |
2024-09-04 | TLD: A Vehicle Tail Light signal Dataset and Benchmark | Jinhao Chai et.al. | 2409.02508 | null |
2024-09-04 | eRSS-RAMP: A Rule-Adherence Motion Planner Based on Extended Responsibility-Sensitive Safety for Autonomous Driving | Pengfei Lin et.al. | 2409.02503 | null |
2024-09-04 | A Learnable Color Correction Matrix for RAW Reconstruction | Anqi Liu et.al. | 2409.02497 | null |
2024-10-09 | TASAR: Transfer-based Attack on Skeletal Action Recognition | Yunfeng Diao et.al. | 2409.02483 | link |
2024-09-04 | Local map Construction Methods with SD map: A Novel Survey | Jiaqi Li et.al. | 2409.02415 | null |
2024-09-04 | GGS: Generalizable Gaussian Splatting for Lane Switching in Autonomous Driving | Huasong Han et.al. | 2409.02382 | null |
2024-09-03 | AllWeatherNet:Unified Image enhancement for autonomous driving under adverse weather and lowlight-conditions | Chenghao Qian et.al. | 2409.02045 | link |
2024-09-03 | Snapshot: Towards Application-centered Models for Pedestrian Trajectory Prediction in Urban Traffic Environments | Nico Uhlemann et.al. | 2409.01971 | link |
2024-09-03 | $S^2$ NeRF: Privacy-preserving Training Framework for NeRF | Bokang Zhang et.al. | 2409.01661 | link |
2024-09-03 | DiVE: DiT-based Video Generation with Enhanced Control | Junpeng Jiang et.al. | 2409.01595 | null |
2024-09-03 | Situation-aware Autonomous Driving Decision Making with Cooperative Perception on Demand | Wei Liu et.al. | 2409.01504 | null |
2024-09-02 | Mutual Benefit: The Case for Sharing Autonomous Vehicle Data with the Public | David Goedicke et.al. | 2409.01342 | null |
2024-09-02 | An Investigation of Denial of Service Attacks on Autonomous Driving Software and Hardware in Operation | Tillmann Stübler et.al. | 2409.01324 | null |
2024-09-02 | Real-time Accident Anticipation for Autonomous Driving Through Monocular Depth-Enhanced 3D Modeling | Haicheng Liao et.al. | 2409.01256 | null |
2024-10-04 | CyberCortex.AI: An AI-based Operating System for Autonomous Robotics and Complex Automation | Sorin Grigorescu et.al. | 2409.01241 | null |
2024-09-02 | Integrating End-to-End and Modular Driving Approaches for Online Corner Case Detection in Autonomous Driving | Gemb Kaljavesi et.al. | 2409.01178 | null |
2024-09-02 | From Bird’s-Eye to Street View: Crafting Diverse and Condition-Aligned Images with Latent Diffusion Model | Xiaojie Xu et.al. | 2409.01014 | null |
2024-09-02 | Development of Occupancy Prediction Algorithm for Underground Parking Lots | Shijie Wang et.al. | 2409.00923 | null |
2024-09-02 | Multi-scale Temporal Fusion Transformer for Incomplete Vehicle Trajectory Prediction | Zhanwen Liu et.al. | 2409.00904 | null |
2024-09-05 | Trustworthy Human-AI Collaboration: Reinforcement Learning with Human Feedback and Physics Knowledge for Safe Autonomous Driving | Zilin Huang et.al. | 2409.00858 | link |
2024-09-01 | Image-to-Lidar Relational Distillation for Autonomous Driving Data | Anas Mahmoud et.al. | 2409.00845 | null |
2024-09-01 | Study of Dropout in PointPillars with 3D Object Detection | Xiaoxiang Sun et.al. | 2409.00673 | null |
2024-09-01 | Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression | Dingyuan Zhang et.al. | 2409.00633 | link |
2024-09-01 | Enhancing Vectorized Map Perception with Historical Rasterized Maps | Xiaoyu Zhang et.al. | 2409.00620 | link |
2024-09-01 | Online Temporal Fusion for Vectorized Map Construction in Mapless Autonomous Driving | Jiagang Chen et.al. | 2409.00593 | null |
2024-08-31 | Online Learning of Interaction Dynamics with Dual Model Predictive Control for Multi-Agent Systems Using Gaussian Processes | T. M. J. T. Baltussen et.al. | 2409.00432 | null |
2024-08-30 | ContextVLM: Zero-Shot and Few-Shot Context Understanding for Autonomous Driving using Vision Language Models | Shounak Sural et.al. | 2409.00301 | null |
2024-09-17 | RING#: PR-by-PE Global Localization with Roto-translation Equivariant Gram Learning | Sha Lu et.al. | 2409.00206 | null |
2024-08-19 | No Need to Sacrifice Data Quality for Quantity: Crowd-Informed Machine Annotation for Cost-Effective Understanding of Visual Data | Christopher Klugmann et.al. | 2409.00048 | null |
2024-08-30 | Structuring a Training Strategy to Robustify Perception Models with Realistic Image Augmentations | Ahmed Hammam et.al. | 2408.17311 | null |
2024-08-30 | How Could Generative AI Support Compliance with the EU AI Act? A Review for Safe Automated Driving Perception | Mert Keser et.al. | 2408.17222 | null |
2024-08-30 | NanoMVG: USV-Centric Low-Power Multi-Task Visual Grounding based on Prompt-Guided Camera and 4D mmWave Radar | Runwei Guan et.al. | 2408.17207 | null |
2024-08-30 | UTrack: Multi-Object Tracking with Uncertain Detections | Edgardo Solano-Carrillo et.al. | 2408.17098 | link |
2024-08-30 | PIB: Prioritized Information Bottleneck Framework for Collaborative Edge Video Analytics | Zhengru Fang et.al. | 2408.17047 | link |
2024-08-30 | Transient Fault Tolerant Semantic Segmentation for Autonomous Driving | Leonardo Iurada et.al. | 2408.16952 | link |
2024-08-29 | OmniRe: Omni Urban Scene Reconstruction | Ziyu Chen et.al. | 2408.16760 | null |
2024-08-29 | DriveGenVLM: Real-world Video Generation for Vision Language Model based Autonomous Driving | Yongjie Fu et.al. | 2408.16647 | null |
2024-08-28 | A Comprehensive Review of 3D Object Detection in Autonomous Driving: Technological Advances and Future Directions | Yu Wang et.al. | 2408.16530 | link |
2024-08-29 | CooTest: An Automated Testing Approach for V2X Communication Systems | An Guo et.al. | 2408.16470 | link |
2024-08-29 | NeRF-CA: Dynamic Reconstruction of X-ray Coronary Angiography with Extremely Sparse-views | Kirsten W. H. Maas et.al. | 2408.16355 | link |
2024-09-12 | BEVal: A Cross-dataset Evaluation Study of BEV Segmentation Models for Autonomous Driving | Manuel Alejandro Diaz-Zapata et.al. | 2408.16322 | link |
2024-08-29 | PolarBEVDet: Exploring Polar Representation for Multi-View 3D Object Detection in Bird’s-Eye-View | Zichen Yu et.al. | 2408.16200 | link |
2024-08-28 | GenDDS: Generating Diverse Driving Video Scenarios with Prompt-to-Video Generative Model | Yongjie Fu et.al. | 2408.15868 | null |
2024-08-28 | Str-L Pose: Integrating Point and Structured Line for Relative Pose Estimation in Dual-Graph | Zherong Zhang et.al. | 2408.15750 | null |
2024-09-05 | G-Style: Stylized Gaussian Splatting | Áron Samuel Kovács et.al. | 2408.15695 | link |
2024-08-28 | TeFF: Tracking-enhanced Forgetting-free Few-shot 3D LiDAR Semantic Segmentation | Junbao Zhou et.al. | 2408.15657 | link |
2024-09-25 | RoboSense: Large-scale Dataset and Benchmark for Multi-sensor Low-speed Autonomous Driving | Haisheng Su et.al. | 2408.15503 | link |
2024-08-27 | Panoptic Perception for Autonomous Driving: A Survey | Yunge Li et.al. | 2408.15388 | null |
2024-08-27 | Drone-assisted Road Gaussian Splatting with Cross-view Uncertainty | Saining Zhang et.al. | 2408.15242 | link |
2024-08-27 | Learning-based Multi-View Stereo: A Survey | Fangjinhua Wang et.al. | 2408.15235 | null |
2024-10-04 | T-FAKE: Synthesizing Thermal Images for Facial Landmarking | Philipp Flotho et.al. | 2408.15127 | link |
2024-08-27 | Adversarial Manhole: Challenging Monocular Depth Estimation and Semantic Segmentation Models with Patch Attack | Naufal Suryanto et.al. | 2408.14879 | link |
2024-09-28 | GeoTransfer : Generalizable Few-Shot Multi-View Reconstruction via Transfer Learning | Shubhendu Jena et.al. | 2408.14724 | null |
2024-08-26 | Driving in the Occupancy World: Vision-Centric 4D Occupancy Forecasting and Planning via World Models for Autonomous Driving | Yu Yang et.al. | 2408.14197 | null |
2024-08-26 | EMDFNet: Efficient Multi-scale and Diverse Feature Network for Traffic Sign Detection | Pengyu Li et.al. | 2408.14189 | null |
2024-08-26 | Quantitative Representation of Scenario Difficulty for Autonomous Driving Based on Adversarial Policy Search | Shuo Yang et.al. | 2408.14000 | null |
2024-08-26 | FusionSAM: Latent Space driven Segment Anything Model for Multimodal Fusion and Segmentation | Daixun Li et.al. | 2408.13980 | null |
2024-08-25 | Bridging the Gap between Real-world and Synthetic Images for Testing Autonomous Driving Systems | Mohammad Hossein Amini et.al. | 2408.13950 | null |
2024-08-25 | TraIL-Det: Transformation-Invariant Local Feature Networks for 3D LiDAR Object Detection with Unsupervised Pre-Training | Li Li et.al. | 2408.13902 | null |
2024-08-25 | Making Large Language Models Better Planners with Reasoning-Decision Alignment | Zhijian Huang et.al. | 2408.13890 | null |
2024-08-25 | Exploring Reliable Matching with Phase Enhancement for Night-time Semantic Segmentation | Yuwen Pan et.al. | 2408.13838 | null |
2024-08-25 | TripleMixer: A 3D Point Cloud Denoising Model for Adverse Weather | Xiongwei Zhao et.al. | 2408.13802 | link |
2024-08-25 | CV-MOS: A Cross-View Model for Motion Segmentation | Xiaoyu Tang et.al. | 2408.13790 | link |
2024-08-28 | Multi-modal Integrated Prediction and Decision-making with Adaptive Interaction Modality Explorations | Tong Li et.al. | 2408.13742 | link |
2024-08-24 | Perception-Guided Fuzzing for Simulated Scenario-Based Testing of Autonomous Driving Systems | Tri Minh Triet Pham et.al. | 2408.13686 | null |
2024-08-24 | Evaluating the Robustness of LiDAR-based 3D Obstacles Detection and Its Impacts on Autonomous Driving Systems | Tri Minh Triet Pham et.al. | 2408.13653 | null |
2024-08-24 | CSS-Segment: 2nd Place Report of LSVOS Challenge VOS Track | Jinming Chai et.al. | 2408.13582 | null |
2024-08-24 | G3DST: Generalizing 3D Style Transfer with Neural Radiance Fields across Scenes and Styles | Adil Meric et.al. | 2408.13508 | null |
2024-08-24 | AdaOcc: Adaptive-Resolution Occupancy Prediction | Chao Chen et.al. | 2408.13454 | null |
2024-08-23 | SIn-NeRF2NeRF: Editing 3D Scenes with Instructions through Segmentation and Inpainting | Jiseung Hong et.al. | 2408.13285 | link |
2024-08-23 | General Intelligent Imaging and Uncertainty Quantification by Deterministic Diffusion Model | Weiru Fan et.al. | 2408.13061 | null |
2024-08-23 | Courteous MPC for Autonomous Driving with CBF-inspired Risk Assessment | Yanze Zhang et.al. | 2408.12822 | null |
2024-08-23 | A Safe Self-evolution Algorithm for Autonomous Driving Based on Data-Driven Risk Quantification Model | Shuo Yang et.al. | 2408.12805 | null |
2024-08-22 | Revisiting Cross-Domain Problem for LiDAR-based 3D Object Detection | Ruixiao Zhang et.al. | 2408.12708 | null |
2024-09-01 | Can LLMs Understand Social Norms in Autonomous Driving Games? | Boxuan Wang et.al. | 2408.12680 | null |
2024-08-22 | Multimodal Foundational Models for Unsupervised 3D General Obstacle Detection | Tamás Matuszka et.al. | 2408.12322 | null |
2024-08-22 | A Safety-Oriented Self-Learning Algorithm for Autonomous Driving: Evolution Starting from a Basic Model | Shuo Yang et.al. | 2408.12190 | null |
2024-08-22 | A Safe and Efficient Self-evolving Algorithm for Decision-making and Control of Autonomous Driving Systems | Shuo Yang et.al. | 2408.12187 | null |
2024-08-22 | Pareto Inverse Reinforcement Learning for Diverse Expert Policy Generation | Woo Kyung Kim et.al. | 2408.12110 | null |
2024-08-22 | Enhancing Sampling Protocol for Robust Point Cloud Classification | Chongshou Li et.al. | 2408.12062 | null |
2024-08-21 | Visual Localization in 3D Maps: Comparing Point Cloud, Mesh, and NeRF Representations | Lintong Zhang et.al. | 2408.11966 | null |
2024-08-21 | MambaOcc: Visual State Space Model for BEV-based Occupancy Prediction with Local Adaptive Reordering | Yonglin Tian et.al. | 2408.11464 | null |
2024-08-21 | Irregularity Inspection using Neural Radiance Field | Tianqi Ding et.al. | 2408.11251 | null |
2024-08-20 | Enhancing End-to-End Autonomous Driving Systems Through Synchronized Human Behavior Data | Yiqun Duan et.al. | 2408.10908 | null |
2024-08-20 | Open 3D World in Autonomous Driving | Xinlong Cheng et.al. | 2408.10880 | null |
2024-08-19 | CoVLA: Comprehensive Vision-Language-Action Dataset for Autonomous Driving | Hidehisa Arai et.al. | 2408.10845 | null |
2024-08-20 | TrackNeRF: Bundle Adjusting NeRF from Sparse and Noisy Views via Feature Tracks | Jinjie Mai et.al. | 2408.10739 | null |
2024-08-20 | Privacy-preserving Universal Adversarial Defense for Black-box Models | Qiao Li et.al. | 2408.10647 | null |
2024-08-20 | MV-MOS: Multi-View Feature Fusion for 3D Moving Object Segmentation | Jintao Cheng et.al. | 2408.10602 | link |
2024-08-20 | Constrained Behavior Cloning for Robotic Learning | Wensheng Liang et.al. | 2408.10568 | null |
2024-08-20 | Leveraging Temporal Contexts to Enhance Vehicle-Infrastructure Cooperative Perception | Jiaru Zhong et.al. | 2408.10531 | null |
2024-09-25 | System-Level Design Space Exploration for High-Level Synthesis under End-to-End Latency Constraints | Yuchao Liao et.al. | 2408.10431 | null |
2024-08-16 | Diffusion Model for Planning: A Systematic Literature Review | Toshihide Ubukata et.al. | 2408.10266 | null |
2024-08-21 | NeRF-US: Removing Ultrasound Imaging Artifacts from Neural Radiance Fields in the Wild | Rishit Dagli et.al. | 2408.10258 | null |
2024-07-22 | Comprehensive Overview of Reward Engineering and Shaping in Advancing Reinforcement Learning Applications | Sinan Ibrahim et.al. | 2408.10215 | null |
2024-08-19 | $R^2$ -Mesh: Reinforcement Learning Powered Mesh Reconstruction via Geometry and Appearance Refinement | Haoyang Wang et.al. | 2408.10135 | null |
2024-08-19 | Edge-Cloud Collaborative Motion Planning for Autonomous Driving with Large Language Models | Jiao Chen et.al. | 2408.09972 | null |
2024-09-06 | DiscoNeRF: Class-Agnostic Object Field for 3D Object Discovery | Corentin Dumery et.al. | 2408.09928 | null |
2024-10-01 | Segment-Anything Models Achieve Zero-shot Robustness in Autonomous Driving | Jun Yan et.al. | 2408.09839 | link |
2024-08-19 | Automated Vehicle Driver Monitoring Dataset from Real-World Scenarios | Mohamed Sabry et.al. | 2408.09833 | null |
2024-08-19 | Quantitative 3D Map Accuracy Evaluation Hardware and Algorithm for LiDAR(-Inertial) SLAM | Sanghyun Hahn et.al. | 2408.09727 | link |
2024-08-19 | Multi-Agent Reinforcement Learning for Autonomous Driving: A Survey | Ruiqi Zhang et.al. | 2408.09675 | link |
2024-08-18 | OVOSE: Open-Vocabulary Semantic Segmentation in Event-Based Cameras | Muhammad Rameez Ur Rahman et.al. | 2408.09424 | link |
2024-08-18 | S^3D-NeRF: Single-Shot Speech-Driven Neural Radiance Field for High Fidelity Talking Head Synthesis | Dongze Li et.al. | 2408.09347 | null |
2024-08-17 | Reinforcement Learning Compensated Model Predictive Control for Off-road Driving on Unknown Deformable Terrain | Prakhar Gupta et.al. | 2408.09253 | null |
2024-09-16 | V2X-VLM: End-to-End V2X Cooperative Autonomous Driving Through Large Vision-Language Models | Junwei You et.al. | 2408.09251 | null |
2024-08-17 | SSNeRF: Sparse View Semi-supervised Neural Radiance Fields with Augmentation | Xiao Cao et.al. | 2408.09144 | null |
2024-08-17 | MaskBEV: Towards A Unified Framework for BEV Detection and Map Segmentation | Xiao Zhao et.al. | 2408.09122 | null |
2024-08-17 | LOID: Lane Occlusion Inpainting and Detection for Enhanced Autonomous Driving Systems | Aayush Agrawal et.al. | 2408.09117 | null |
2024-08-17 | HybridOcc: NeRF Enhanced Transformer-based Multi-Camera 3D Occupancy Prediction | Xiao Zhao et.al. | 2408.09104 | null |
2024-08-15 | A Survey of Trojan Attacks and Defenses to Deep Neural Networks | Lingxin Jin et.al. | 2408.08920 | null |
2024-08-20 | PriorMapNet: Enhancing Online Vectorized HD Map Construction with Priors | Rongxuan Wang et.al. | 2408.08802 | null |
2024-08-16 | A Transparency Paradox? Investigating the Impact of Explanation Specificity and Autonomous Vehicle Perceptual Inaccuracies on Passengers | Daniel Omeiza et.al. | 2408.08785 | null |
2024-08-16 | VF-NeRF: Learning Neural Vector Fields for Indoor Scene Reconstruction | Albert Gassol Puigjaner et.al. | 2408.08766 | link |
2024-08-16 | S-RAF: A Simulation-Based Robustness Assessment Framework for Responsible Autonomous Driving | Daniel Omeiza et.al. | 2408.08584 | link |
2024-08-16 | CoSEC: A Coaxial Stereo Event Camera Dataset for Autonomous Driving | Shihan Peng et.al. | 2408.08500 | null |
2024-08-15 | A Conflicts-free, Speed-lossless KAN-based Reinforcement Learning Decision System for Interactive Driving in Roundabouts | Zhihao Lin et.al. | 2408.08242 | null |
2024-08-15 | Learned Multimodal Compression for Autonomous Driving | Hadi Hadizadeh et.al. | 2408.08211 | null |
2024-08-15 | Co-Fix3D: Enhancing 3D Object Detection with Collaborative Refinement | Wenxuan Li et.al. | 2408.07999 | link |
2024-08-14 | Panacea+: Panoramic and Controllable Video Generation for Autonomous Driving | Yuqing Wen et.al. | 2408.07605 | null |
2024-08-14 | LLMI3D: Empowering LLM with 3D Perception from a Single 2D Image | Fan Yang et.al. | 2408.07422 | null |
2024-08-17 | Risk Occupancy: A New and Efficient Paradigm through Vehicle-Road-Cloud Collaboration | Jiaxing Chen et.al. | 2408.07367 | null |
2024-08-13 | FlatFusion: Delving into Details of Sparse Transformer-based Camera-LiDAR Fusion for Autonomous Driving | Yutao Zhu et.al. | 2408.06832 | null |
2024-08-13 | Exploring Domain Shift on Radar-Based 3D Object Detection Amidst Diverse Environmental Conditions | Miao Zhang et.al. | 2408.06772 | null |
2024-08-13 | A lightweight YOLOv5-FFM model for occlusion pedestrian detection | Xiangjie Luo et.al. | 2408.06633 | null |
2024-08-13 | Potamoi: Accelerating Neural Rendering via a Unified Streaming Architecture | Yu Feng et.al. | 2408.06608 | null |
2024-08-13 | HDRGS: High Dynamic Range Gaussian Splatting | Jiahao Wu et.al. | 2408.06543 | link |
2024-08-12 | 3D Reconstruction of Protein Structures from Multi-view AFM Images using Neural Radiance Fields (NeRFs) | Jaydeep Rade et.al. | 2408.06244 | null |
2024-09-26 | FruitNeRF: A Unified Neural Radiance Field based Fruit Counting Framework | Lukas Meyer et.al. | 2408.06190 | link |
2024-08-12 | IIT Bombay Racing Driverless: Autonomous Driving Stack for Formula Student AI | Yash Rampuria et.al. | 2408.06113 | null |
2024-08-12 | A-BDD: Leveraging Data Augmentations for Safe Autonomous Driving in Adverse Weather and Lighting | Felix Assion et.al. | 2408.06071 | null |
2024-08-12 | Toward Pedestrian Head Tracking: A Benchmark Dataset and an Information Fusion Network | Kailai Sun et.al. | 2408.05877 | null |
2024-08-11 | ICSFuzz: Collision Detector Bug Discovery in Autonomous Driving Simulators | Weiwei Fu et.al. | 2408.05694 | null |
2024-08-10 | What Matters in Autonomous Driving Anomaly Detection: A Weakly Supervised Horizon | Utkarsh Tiwari et.al. | 2408.05562 | link |
2024-08-10 | Radiance Field Learners As UAV First-Person Viewers | Liqi Yan et.al. | 2408.05533 | null |
2024-08-20 | Scene123: One Prompt to 3D Scene Generation via Video-Assisted and Consistency-Enhanced MAE | Yiying Yang et.al. | 2408.05477 | null |
2024-08-15 | DeepInteraction++: Multi-Modality Interaction for Autonomous Driving | Zeyu Yang et.al. | 2408.05075 | link |
2024-09-13 | FlowDreamer: exploring high fidelity text-to-3D generation via rectified flow | Hangyu Li et.al. | 2408.05008 | null |
2024-08-09 | CTE-MLO: Continuous-time and Efficient Multi-LiDAR Odometry with Localizability-aware Point Cloud Sampling | Hongming Shen et.al. | 2408.04901 | link |
2024-08-09 | VLM-MPC: Vision Language Foundation Model (VLM)-Guided Model Predictive Controller (MPC) for Autonomous Driving | Keke Long et.al. | 2408.04821 | null |
2024-08-09 | FewShotNeRF: Meta-Learning-based Novel View Synthesis for Rapid Scene-Specific Adaptation | Piraveen Sivakumar et.al. | 2408.04803 | null |
2024-08-08 | Eliminating Backdoors in Neural Code Models via Trigger Inversion | Weisong Sun et.al. | 2408.04683 | null |
2024-08-08 | Sampling for View Synthesis: From Local Light Field Fusion to Neural Radiance Fields and Beyond | Ravi Ramamoorthi et.al. | 2408.04586 | null |
2024-08-08 | Field Testing and Detection of Camera Interference for Autonomous Driving | Ki Beom Park et.al. | 2408.04524 | null |
2024-08-08 | Reinforcement Learning from Human Feedback for Lane Changing of Autonomous Vehicles in Mixed Traffic | Yuting Wang et.al. | 2408.04447 | null |
2024-08-08 | Evaluating Modern Approaches in 3D Scene Reconstruction: NeRF vs Gaussian-Based Methods | Yiming Zhou et.al. | 2408.04268 | null |
2024-08-08 | Design and Implementation of Smart Infrastructures and Connected Vehicles in A Mini-city Platform | Daniel Vargas et.al. | 2408.04195 | null |
2024-08-07 | MORTAR: A Model-based Runtime Action Repair Framework for AI-enabled Cyber-Physical Systems | Renzhi Wang et.al. | 2408.03892 | null |
2024-08-07 | Vision-Language Guidance for LiDAR-based Unsupervised 3D Object Detection | Christian Fruhwirth-Reisinger et.al. | 2408.03790 | link |
2024-08-07 | MS-Mapping: An Uncertainty-Aware Large-Scale Multi-Session LiDAR Mapping System | Xiangcheng Hu et.al. | 2408.03723 | link |
2024-08-07 | Goal-oriented Semantic Communication for the Metaverse Application | Zhe Wang et.al. | 2408.03646 | null |
2024-08-14 | DRAMA: An Efficient End-to-end Motion Planner for Autonomous Driving with Mamba | Chengran Yuan et.al. | 2408.03601 | null |
2024-08-07 | Leveraging LLMs for Enhanced Open-Vocabulary 3D Scene Understanding in Autonomous Driving | Amirhosein Chahe et.al. | 2408.03516 | link |
2024-08-06 | Communication-Aware Consistent Edge Selection for Mobile Users and Autonomous Vehicles | Nazish Tahir et.al. | 2408.03435 | null |
2024-08-06 | RayGauss: Volumetric Gaussian-Based Ray Casting for Photorealistic Novel View Synthesis | Hugo Blanc et.al. | 2408.03356 | null |
2024-08-06 | Efficient NeRF Optimization – Not All Samples Remain Equally Hard | Juuso Korhonen et.al. | 2408.03193 | null |
2024-08-06 | Integrated Intention Prediction and Decision-Making with Spectrum Attention Net and Proximal Policy Optimization | Xiao Zhou et.al. | 2408.03191 | null |
2024-08-06 | Research on Autonomous Driving Decision-making Strategies based Deep Reinforcement Learning | Zixiang Wang et.al. | 2408.03084 | null |
2024-08-06 | SCOPE: A Synthetic Multi-Modal Dataset for Collective Perception Including Physical-Correct Weather Conditions | Jörg Gamerdinger et.al. | 2408.03065 | null |
2024-08-06 | Cross-cultural analysis of pedestrian group behaviour influence on crossing decisions in interactions with autonomous vehicles | Sergio Martín Serrano et.al. | 2408.03003 | null |
2024-08-06 | Reinforcement Learning based Workflow Scheduling in Cloud and Edge Computing Environments: A Taxonomy, Review and Future Directions | Amanda Jayanetti et.al. | 2408.02938 | null |
2024-08-06 | Compromising Embodied Agents with Contextual Backdoor Attacks | Aishan Liu et.al. | 2408.02882 | null |
2024-08-04 | Model Hijacking Attack in Federated Learning | Zheng Li et.al. | 2408.02131 | null |
2024-08-27 | KAN-RCBEVDepth: A multi-modal fusion algorithm in object detection for autonomous driving | Zhihao Lai et.al. | 2408.02088 | null |
2024-08-03 | FBINeRF: Feature-Based Integrated Recurrent Network for Pinhole and Fisheye Neural Radiance Fields | Yifan Wu et.al. | 2408.01878 | null |
2024-08-03 | E $^3$ NeRF: Efficient Event-Enhanced Neural Radiance Fields from Blurry Images | Yunshan Qi et.al. | 2408.01840 | null |
2024-08-03 | STDA: Spatio-Temporal Dual-Encoder Network Incorporating Driver Attention to Predict Driver Behaviors Under Safety-Critical Scenarios | Dongyang Xu et.al. | 2408.01774 | null |
2024-08-03 | LAM3D: Leveraging Attention for Monocular 3D Object Detection | Diana-Alexandra Sas et.al. | 2408.01739 | null |
2024-08-02 | Trainable Pointwise Decoder Module for Point Cloud Segmentation | Bike Chen et.al. | 2408.01548 | null |
2024-08-01 | Enhancing Online Road Network Perception and Reasoning with Standard Definition Maps | Hengyuan Zhang et.al. | 2408.01471 | null |
2024-07-18 | SUSTechGAN: Image Generation for Object Recognition in Adverse Conditions of Autonomous Driving | Gongjin Lan et.al. | 2408.01430 | link |
2024-08-02 | NeRFoot: Robot-Footprint Estimation for Image-Based Visual Servoing | Daoxin Zhong et.al. | 2408.01251 | null |
2024-08-02 | CommonUppRoad: A Framework of Formal Modelling, Verifying, Learning, and Visualisation of Autonomous Vehicles | Rong Gu et.al. | 2408.01093 | null |
2024-08-02 | Effect of Fog Particle Size Distribution on 3D Object Detection Under Adverse Weather Conditions | Ajinkya Shinde et.al. | 2408.01085 | null |
2024-08-02 | MambaST: A Plug-and-Play Cross-Spectral Spatial-Temporal Fuser for Efficient Pedestrian Detection | Xiangbo Gao et.al. | 2408.01037 | link |
2024-09-13 | UlRe-NeRF: 3D Ultrasound Imaging through Neural Rendering with Ultrasound Reflection Direction Parameterization | Ziwen Guo et.al. | 2408.00860 | null |
2024-07-15 | Quantification and Validation for Degree of Understanding in M2M Semantic Communications | Linhan Xia et.al. | 2408.00767 | null |
2024-08-01 | Optimizing Diffusion Models for Joint Trajectory Prediction and Controllable Generation | Yixiao Wang et.al. | 2408.00766 | null |
2024-08-01 | MUFASA: Multi-View Fusion and Adaptation Network with Spatial Awareness for Radar Object Detection | Xiangyuan Peng et.al. | 2408.00565 | null |
2024-08-01 | DriveArena: A Closed-loop Generative Simulation Platform for Autonomous Driving | Xuemeng Yang et.al. | 2408.00415 | null |
2024-08-01 | Enabling Next-Generation V2X Perception: Wireless Rigid Body Localization and Tracking | Niclas Führling et.al. | 2408.00349 | null |
2024-08-01 | RoCo:Robust Collaborative Perception By Iterative Object Matching and Pose Adjustment | Zhe Huang et.al. | 2408.00257 | link |
2024-07-31 | StyleRF-VolVis: Style Transfer of Neural Radiance Fields for Expressive Volume Visualization | Kaiyuan Tang et.al. | 2408.00150 | null |
2024-07-31 | Areas of Improvement for Autonomous Vehicles: A Machine Learning Analysis of Disengagement Reports | Tyler Ward et.al. | 2408.00051 | null |
2024-07-31 | MART: MultiscAle Relational Transformer Networks for Multi-agent Trajectory Prediction | Seongju Lee et.al. | 2407.21635 | link |
2024-08-01 | Analysis of Functional Insufficiencies and Triggering Conditions to Improve the SOTIF of an MPC-based Trajectory Planner | Mirko Conrad et.al. | 2407.21569 | null |
2024-07-31 | SimpleLLM4AD: An End-to-End Vision-Language Model with Graph Visual Question Answering for Autonomous Driving | Peiru Zheng et.al. | 2407.21293 | null |
2024-07-30 | Self-supervised Multi-future Occupancy Forecasting for Autonomous Driving | Bernard Lange et.al. | 2407.21126 | null |
2024-07-22 | PAV: Personalized Head Avatar from Unstructured Video Collection | Akin Caliskan et.al. | 2407.21047 | null |
2024-07-30 | Learning Ordinality in Semantic Segmentation | Rafael Cristino et.al. | 2407.20959 | null |
2024-07-30 | A Comparative Study of Neural Surface Reconstruction for Scientific Visualization | Siyuan Yao et.al. | 2407.20868 | null |
2024-07-30 | Optimizing 5G-Advanced Networks for Time-critical Applications: The Role of L4S | Guangjin Pan et.al. | 2407.20852 | null |
2024-07-30 | Task-Oriented Communication for Vehicle-to-Infrastructure Cooperative Perception | Jiawei Shao et.al. | 2407.20748 | null |
2024-07-30 | Architectural Influence on Variational Quantum Circuits in Multi-Agent Reinforcement Learning: Evolutionary Strategies for Optimization | Michael Kölle et.al. | 2407.20739 | null |
2024-07-30 | Scene-Specific Trajectory Sets: Maximizing Representation in Motion Forecasting | Abhishek Vivekanandan et.al. | 2407.20732 | null |
2024-07-30 | On-the-fly Communication-and-Computing to Enable Representation Learning for Distributed Point Clouds | Xu Chen et.al. | 2407.20710 | null |
2024-07-29 | Radiance Fields for Robotic Teleoperation | Maximum Wilder-Smith et.al. | 2407.20194 | link |
2024-07-29 | Collision Probability Distribution Estimation via Temporal Difference Learning | Thomas Steinecker et.al. | 2407.20000 | link |
2024-07-29 | Hydrodynamics of pulsating active liquids | Tirthankar Banerjee et.al. | 2407.19955 | null |
2024-07-29 | Garment Animation NeRF with Color Editing | Renke Wang et.al. | 2407.19774 | link |
2024-07-29 | ALEN: A Dual-Approach for Uniform and Non-Uniform Low-Light Image Enhancement | Ezequiel Perez-Zarate et.al. | 2407.19708 | link |
2024-07-28 | HD-maps as Prior Information for Globally Consistent Mapping in GPS-denied Environments | Waqas Ali et.al. | 2407.19463 | null |
2024-07-28 | FINER++: Building a Family of Variable-periodic Functions for Activating Implicit Neural Representation | Hao Zhu et.al. | 2407.19434 | null |
2024-07-28 | Reputation-Driven Asynchronous Federated Learning for Enhanced Trajectory Prediction with Blockchain | Weiliang Chen et.al. | 2407.19428 | null |
2024-07-27 | Large Language Models for Human-like Autonomous Driving: A Survey | Yun Li et.al. | 2407.19280 | null |
2024-07-26 | Addressing Behavior Model Inaccuracies for Safe Motion Control in Uncertain Dynamic Environments | Minjun Sung et.al. | 2407.19071 | null |
2024-07-26 | Sparse Refinement for Efficient High-Resolution Semantic Segmentation | Zhijian Liu et.al. | 2407.19014 | null |
2024-07-26 | Wolf: Captioning Everything with a World Summarization Framework | Boyi Li et.al. | 2407.18908 | null |
2024-07-26 | SHANGUS: Deep Reinforcement Learning Meets Heuristic Optimization for Speedy Frontier-Based Exploration of Autonomous Vehicles in Unknown Spaces | Seunghyeop Nam et.al. | 2407.18892 | null |
2024-07-26 | HERO-SLAM: Hybrid Enhanced Robust Optimization of Neural SLAM | Zhe Xin et.al. | 2407.18813 | null |
2024-07-26 | Foundation Models for the Digital Twin Creation of Cyber-Physical Systems | Shaukat Ali et.al. | 2407.18779 | null |
2024-09-07 | IOVS4NeRF:Incremental Optimal View Selection for Large-Scale NeRFs | Jingpeng Xie et.al. | 2407.18611 | null |
2024-08-04 | PP-TIL: Personalized Planning for Autonomous Driving with Instance-based Transfer Imitation Learning | Fangze Lin et.al. | 2407.18569 | link |
2024-07-29 | Multi-Agent Trajectory Prediction with Difficulty-Guided Feature Enhancement Network | Guipeng Xin et.al. | 2407.18551 | link |
2024-07-26 | Gaussian Lane Keeping: A Robust Prediction Baseline | David Isele et.al. | 2407.18451 | null |
2024-07-16 | Latency optimized Deep Neural Networks (DNNs): An Artificial Intelligence approach at the Edge using Multiprocessor System on Chip (MPSoC) | Seyed Nima Omidsajedi et.al. | 2407.18264 | null |
2024-07-25 | Taxonomy-Aware Continual Semantic Segmentation in Hyperbolic Spaces for Open-World Perception | Julia Hindel et.al. | 2407.18145 | null |
2024-09-10 | TiCoSS: Tightening the Coupling between Semantic Segmentation and Stereo Matching within A Joint Learning Framework | Guanfeng Tang et.al. | 2407.18038 | null |
2024-07-25 | StreamMOS: Streaming Moving Object Segmentation with Multi-View Perception and Dual-Span Memory | Zhiheng Li et.al. | 2407.17905 | link |
2024-07-25 | Image Segmentation via Divisive Normalization: dealing with environmental diversity | Pablo Hernández-Cámara et.al. | 2407.17829 | null |
2024-07-25 | CRASH: Crash Recognition and Anticipation System Harnessing with Context-Aware and Temporal Focus Attentions | Haicheng Liao et.al. | 2407.17757 | null |
2024-07-25 | Control Informed Design of the IAC Autonomous Racecar for Operation at the Dynamic Envelope | Qilun Zhu et.al. | 2407.17737 | null |
2024-07-20 | CORT: Class-Oriented Real-time Tracking for Embedded Systems | Edoardo Cittadini et.al. | 2407.17521 | null |
2024-07-24 | 3D Question Answering for City Scene Understanding | Penglei Sun et.al. | 2407.17398 | null |
2024-07-24 | Physical Adversarial Attack on Monocular Depth Estimation via Shape-Varying Patches | Chenxing Zhao et.al. | 2407.17312 | null |
2024-07-25 | LangOcc: Self-Supervised Open Vocabulary Occupancy Estimation via Volume Rendering | Simon Boeder et.al. | 2407.17310 | null |
2024-07-24 | Testing Large Language Models on Driving Theory Knowledge and Skills for Connected Autonomous Vehicles | Zuoyin Tang et.al. | 2407.17211 | null |
2024-07-24 | Applications of Multi-Agent Deep Reinforcement Learning Communication in Network Management: A Survey | Yue Pi et.al. | 2407.17030 | null |
2024-07-24 | Progressive Query Refinement Framework for Bird’s-Eye-View Semantic Segmentation from Surrounding Images | Dooseop Choi et.al. | 2407.17003 | link |
2024-07-23 | SECRM-2D: RL-Based Efficient and Comfortable Route-Following Autonomous Driving with Analytic Safety Guarantees | Tianyu Shi et.al. | 2407.16857 | null |
2024-07-24 | A Simulation Benchmark for Autonomous Racing with Large-Scale Human Data | Adrian Remonda et.al. | 2407.16680 | link |
2024-07-23 | Deformable Convolution Based Road Scene Semantic Segmentation of Fisheye Images in Autonomous Driving | Anam Manzoor et.al. | 2407.16647 | null |
2024-07-24 | Velocity Driven Vision: Asynchronous Sensor Fusion Birds Eye View Models for Autonomous Vehicles | Seamie Hayes et.al. | 2407.16636 | null |
2024-07-23 | MonoWAD: Weather-Adaptive Diffusion Model for Robust Monocular 3D Object Detection | Youngmin Oh et.al. | 2407.16448 | link |
2024-07-23 | Cleaning Robots in Public Spaces: A Survey and Proposal for Benchmarking Based on Stakeholders Interviews | Raphael Memmesheimer et.al. | 2407.16393 | null |
2024-07-23 | Understanding Impacts of Electromagnetic Signal Injection Attacks on Object Detection | Youqian Zhang et.al. | 2407.16327 | null |
2024-07-26 | When, Where, and What? A Novel Benchmark for Accident Anticipation and Localization with Large Language Models | Haicheng Liao et.al. | 2407.16277 | null |
2024-07-23 | LiCROcc: Teach Radar for Accurate Semantic Occupancy Prediction using LiDAR and Camera | Yukai Ma et.al. | 2407.16197 | null |
2024-07-22 | BoostMVSNeRFs: Boosting MVS-based NeRFs to Generalizable View Synthesis in Large-scale Scenes | Chih-Hai Su et.al. | 2407.15848 | null |
2024-07-22 | MILAN: Milli-Annotations for Lidar Semantic Segmentation | Nermin Samet et.al. | 2407.15797 | null |
2024-07-22 | Flow-guided Motion Prediction with Semantics and Dynamic Occupancy Grid Maps | Rabbia Asghar et.al. | 2407.15675 | null |
2024-07-22 | DriveDiTFit: Fine-tuning Diffusion Transformers for Autonomous Driving | Jiahang Tu et.al. | 2407.15661 | link |
2024-07-22 | Towards a Universal Evaluation Model for Careful and Competent Autonomous Driving | Kethan Reddy et.al. | 2407.15596 | null |
2024-07-22 | WTS: A Pedestrian-Centric Traffic Video Dataset for Fine-grained Spatial-Temporal Understanding | Quan Kong et.al. | 2407.15350 | null |
2024-07-22 | Explore the LiDAR-Camera Dynamic Adjustment Fusion for 3D Object Detection | Yiran Yang et.al. | 2407.15334 | link |
2024-07-20 | Is Behavior Cloning All You Need? Understanding Horizon in Imitation Learning | Dylan J. Foster et.al. | 2407.15007 | null |
2024-07-19 | Complementary Learning for Real-World Model Failure Detection | Daniel Bogdoll et.al. | 2407.14306 | link |
2024-07-19 | Hyperparameter Optimization for Driving Strategies Based on Reinforcement Learning | Nihal Acharya Adde et.al. | 2407.14262 | null |
2024-07-17 | Continual Learning for Adaptable Car-Following in Dynamic Traffic Environments | Xianda Chen et.al. | 2407.14247 | null |
2024-07-19 | KoMA: Knowledge-driven Multi-agent Framework for Autonomous Driving with Large Language Models | Kemou Jiang et.al. | 2407.14239 | null |
2024-07-19 | DirectL: Efficient Radiance Fields Rendering for 3D Light Field Displays | Zongyuan Yang et.al. | 2407.14053 | null |
2024-07-19 | Semantic Communications for 3D Human Face Transmission with Neural Radiance Fields | Guanlin Wu et.al. | 2407.13992 | null |
2024-07-18 | Boosting Online 3D Multi-Object Tracking through Camera-Radar Cross Check | Sheng-Yao Kuan et.al. | 2407.13937 | null |
2024-09-05 | EaDeblur-GS: Event assisted 3D Deblur Reconstruction with Gaussian Splatting | Yuchen Weng et.al. | 2407.13520 | null |
2024-07-19 | Mask2Map: Vectorized HD Map Construction Using Bird’s Eye View Segmentation Masks | Sehwan Choi et.al. | 2407.13517 | link |
2024-07-18 | Risk-Aware Vehicle Trajectory Prediction Under Safety-Critical Scenarios | Qingfan Wang et.al. | 2407.13480 | null |
2024-08-26 | Improving Out-of-Distribution Generalization of Trajectory Prediction for Autonomous Driving via Polynomial Representations | Yue Yao et.al. | 2407.13431 | link |
2024-07-18 | GeometrySticker: Enabling Ownership Claim of Recolorized Neural Radiance Fields | Xiufeng Huang et.al. | 2407.13390 | null |
2024-07-18 | Ultra-Low-Latency Edge Inference for Distributed Sensing | Zhanwei Wang et.al. | 2407.13360 | null |
2024-07-18 | $μ$ Drive: User-Controlled Autonomous Driving | Kun Wang et.al. | 2407.13201 | null |
2024-07-18 | KFD-NeRF: Rethinking Dynamic NeRF with Kalman Filter | Yifan Zhan et.al. | 2407.13185 | null |
2024-07-21 | Real-Time 3D Occupancy Prediction via Geometric-Semantic Disentanglement | Yulin He et.al. | 2407.13155 | null |
2024-07-18 | OE-BevSeg: An Object Informed and Environment Aware Multimodal Framework for Bird’s-eye-view Vehicle Semantic Segmentation | Jian Sun et.al. | 2407.13137 | null |
2024-07-18 | PG-Attack: A Precision-Guided Adversarial Attack Framework Against Vision Foundation Models for Autonomous Driving | Jiyuan Fu et.al. | 2407.13111 | link |
2024-07-17 | Sparsity-based Safety Conservatism for Constrained Offline Reinforcement Learning | Minjae Cho et.al. | 2407.13006 | null |
2024-07-17 | KiGRAS: Kinematic-Driven Generative Model for Realistic Agent Simulation | Jianbo Zhao et.al. | 2407.12940 | null |
2024-07-17 | AgentPoison: Red-teaming LLM Agents via Poisoning Memory or Knowledge Bases | Zhaorun Chen et.al. | 2407.12784 | link |
2024-07-17 | SG-NeRF: Neural Surface Reconstruction with Scene Graph Optimization | Yiyang Chen et.al. | 2407.12667 | link |
2024-07-17 | InfoNorm: Mutual Information Shaping of Normals for Sparse-View Reconstruction | Xulong Wang et.al. | 2407.12661 | link |
2024-07-25 | Hierarchical and Decoupled BEV Perception Learning Framework for Autonomous Driving | Yuqi Dai et.al. | 2407.12491 | null |
2024-07-19 | Fisheye-Calib-Adapter: An Easy Tool for Fisheye Camera Model Conversion | Sangjun Lee et.al. | 2407.12405 | link |
2024-07-17 | Efficient Depth-Guided Urban View Synthesis | Sheng Miao et.al. | 2407.12395 | null |
2024-07-17 | Invertible Neural Warp for NeRF | Shin-Fang Chng et.al. | 2407.12354 | null |
2024-07-17 | Splatfacto-W: A Nerfstudio Implementation of Gaussian Splatting for Unconstrained Photo Collections | Congrong Xu et.al. | 2407.12306 | null |
2024-07-18 | Motion-Oriented Compositional Neural Radiance Fields for Monocular Dynamic Human Modeling | Jaehyeok Kim et.al. | 2407.11962 | null |
2024-07-16 | Gated Temporal Diffusion for Stochastic Long-Term Dense Anticipation | Olga Zatsarynna et.al. | 2407.11954 | link |
2024-07-18 | IPA-NeRF: Illusory Poisoning Attack Against Neural Radiance Fields | Wenxiang Jiang et.al. | 2407.11921 | link |
2024-07-16 | MapDistill: Boosting Efficient Camera-based HD Map Construction via Camera-LiDAR Fusion Model Distillation | Xiaoshuai Hao et.al. | 2407.11682 | null |
2024-07-16 | Perception Helps Planning: Facilitating Multi-Stage Lane-Level Integration via Double-Edge Structures | Guoliang You et.al. | 2407.11644 | null |
2024-07-17 | Segment, Lift and Fit: Automatic 3D Shape Labeling from 2D Prompts | Jianhao Li et.al. | 2407.11382 | null |
2024-07-16 | I $^2$ -SLAM: Inverting Imaging Process for Robust Photorealistic Dense SLAM | Gwangtak Bae et.al. | 2407.11347 | null |
2024-07-16 | Ev-GS: Event-based Gaussian splatting for Efficient and Accurate Radiance Field Rendering | Jingqian Wu et.al. | 2407.11343 | null |
2024-07-16 | Continuity Preserving Online CenterLine Graph Learning | Yunhui Han et.al. | 2407.11337 | link |
2024-07-25 | Evaluating geometric accuracy of NeRF reconstructions compared to SLAM method | Adam Korycki et.al. | 2407.11238 | null |
2024-07-02 | dAJC: A 2.02mW 50Mbps Direct Analog to MJPEG Converter for Video Sensor Node using Low-Noise Switched Capacitor MAC-Quantizer with Auto-Calibration and Sparsity-Aware ADC | Gourab Barik et.al. | 2407.11023 | null |
2024-09-04 | A unified theory and statistical learning approach for traffic conflict detection | Yiru Jiao et.al. | 2407.10959 | link |
2024-07-20 | RepVF: A Unified Vector Fields Representation for Multi-task 3D Perception | Chunliang Li et.al. | 2407.10876 | link |
2024-07-15 | AirNeRF: 3D Reconstruction of Human with Drone and NeRF for Future Communication Systems | Alexey Kotcov et.al. | 2407.10865 | null |
2024-07-15 | DINO Pre-training for Vision-based End-to-end Autonomous Driving | Shubham Juneja et.al. | 2407.10803 | null |
2024-07-15 | Domain Generalization for 6D Pose Estimation Through NeRF-based Image Synthesis | Antoine Legrand et.al. | 2407.10762 | null |
2024-07-15 | Interactive Rendering of Relightable and Animatable Gaussian Avatars | Youyi Zhan et.al. | 2407.10707 | null |
2024-07-15 | IE-NeRF: Inpainting Enhanced Neural Radiance Fields in the Wild | Shuaixian Wang et.al. | 2407.10695 | null |
2024-07-20 | Follow the Rules: Reasoning for Video Anomaly Detection with Large Language Models | Yuchen Yang et.al. | 2407.10299 | link |
2024-07-14 | RS-NeRF: Neural Radiance Fields from Rolling Shutter Images | Muyao Niu et.al. | 2407.10267 | link |
2024-08-25 | RAPiD-Seg: Range-Aware Pointwise Distance Distribution Networks for 3D LiDAR Segmentation | Li Li et.al. | 2407.10159 | link |
2024-07-14 | FSD-BEV: Foreground Self-Distillation for Multi-view 3D Object Detection | Zheng Jiang et.al. | 2407.10135 | link |
2024-07-14 | SpikeGS: 3D Gaussian Splatting from Spike Streams with High-Speed Camera Motion | Jiyuan Zhang et.al. | 2407.10062 | null |
2024-07-13 | IFTR: An Instance-Level Fusion Transformer for Visual Collaborative Perception | Shaohong Wang et.al. | 2407.09857 | link |
2024-07-12 | Uplifting Range-View-based 3D Semantic Segmentation in Real-Time with Multi-Sensor Fusion | Shiqi Tan et.al. | 2407.09697 | null |
2024-06-25 | Optimization of Autonomous Driving Image Detection Based on RFAConv and Triplet Attention | Zhipeng Ling et.al. | 2407.09530 | null |
2024-07-12 | Adaptive Prediction Ensemble: Improving Out-of-Distribution Generalization of Motion Forecasting | Jinning Li et.al. | 2407.09475 | null |
2024-07-12 | TRAVERSE: Traffic-Responsive Autonomous Vehicle Experience & Rare-event Simulation for Enhanced safety | Sandeep Thalapanane et.al. | 2407.09466 | null |
2024-07-12 | Radiance Fields from Photons | Sacha Jungerman et.al. | 2407.09386 | null |
2024-07-12 | GNN with Model-based RL for Multi-agent Systems | Hanxiao Chen et.al. | 2407.09249 | null |
2024-07-12 | Decentralized multi-agent reinforcement learning algorithm using a cluster-synchronized laser network | Shun Kotoku et.al. | 2407.09124 | null |
2024-08-03 | HPC: Hierarchical Progressive Coding Framework for Volumetric Video | Zihan Zheng et.al. | 2407.09026 | null |
2024-07-11 | Feasibility of Neural Radiance Fields for Crime Scene Video Reconstruction | Shariq Nadeem Malik et.al. | 2407.08795 | null |
2024-07-11 | MapLocNet: Coarse-to-Fine Feature Registration for Visual Re-Localization in Navigation Maps | Hang Wu et.al. | 2407.08561 | null |
2024-07-11 | BLOS-BEV: Navigation Map Enhanced Lane Segmentation Network, Beyond Line of Sight | Hang Wu et.al. | 2407.08526 | null |
2024-07-11 | Joint Optimization of Age of Information and Energy Consumption in NR-V2X System based on Deep Reinforcement Learning | Shulin Song et.al. | 2407.08458 | link |
2024-07-11 | MeshAvatar: Learning High-quality Triangular Human Avatars from Multi-view Videos | Yushuo Chen et.al. | 2407.08414 | link |
2024-07-11 | CLEO: Continual Learning of Evolving Ontologies | Shishir Muralidhara et.al. | 2407.08411 | null |
2024-07-18 | Application of Data-Driven Model Predictive Control for Autonomous Vehicle Steering | Jiarui Zhang et.al. | 2407.08401 | null |
2024-07-11 | Accurate Cooperative Localization Utilizing LiDAR-equipped Roadside Infrastructure for Autonomous Driving | Yuze Jiang et.al. | 2407.08384 | null |
2024-07-11 | WayveScenes101: A Dataset and Benchmark for Novel View Synthesis in Autonomous Driving | Jannik Zürn et.al. | 2407.08280 | link |
2024-07-18 | Explicit-NeRF-QA: A Quality Assessment Database for Explicit NeRF Model Compression | Yuke Xing et.al. | 2407.08165 | null |
2024-07-11 | Bayesian uncertainty analysis for underwater 3D reconstruction with neural radiance fields | Haojie Lian et.al. | 2407.08154 | null |
2024-07-11 | Survey on Fundamental Deep Learning 3D Reconstruction Techniques | Yonge Bai et.al. | 2407.08137 | null |
2024-07-10 | NDST: Neural Driving Style Transfer for Human-Like Vision-Based Autonomous Driving | Donghyun Kim et.al. | 2407.08073 | null |
2024-07-10 | Deep Learning-Based Robust Multi-Object Tracking via Fusion of mmWave Radar and Camera Sensors | Lei Cheng et.al. | 2407.08049 | null |
2024-07-10 | Flow4D: Leveraging 4D Voxel Network for LiDAR Scene Flow Estimation | Jaeyeul Kim et.al. | 2407.07995 | link |
2024-07-10 | RoBus: A Multimodal Dataset for Controllable Road Networks and Building Layouts Generation | Tao Li et.al. | 2407.07835 | link |
2024-07-10 | Neural Geometry Processing via Spherical Neural Surfaces | Romy Williamson et.al. | 2407.07755 | null |
2024-07-10 | LSM: A Comprehensive Metric for Assessing the Safety of Lane Detection Systems in Autonomous Driving | Jörg Gamerdinger et.al. | 2407.07740 | null |
2024-07-10 | Protecting NeRFs’ Copyright via Plug-And-Play Watermarking Base Model | Qi Song et.al. | 2407.07735 | null |
2024-07-10 | Towards Human-Like Driving: Active Inference in Autonomous Vehicle Control | Elahe Delavari et.al. | 2407.07684 | null |
2024-07-18 | Let Occ Flow: Self-Supervised 3D Occupancy Flow Prediction | Yili Liu et.al. | 2407.07587 | null |
2024-07-10 | Invisible Optical Adversarial Stripes on Traffic Sign against Autonomous Vehicles | Dongfang Guo et.al. | 2407.07510 | null |
2024-07-17 | Exploring the Untouched Sweeps for Conflict-Aware 3D Segmentation Pretraining | Tianfang Sun et.al. | 2407.07465 | null |
2024-07-10 | Drantal-NeRF: Diffusion-Based Restoration for Anti-aliasing Neural Radiance Field | Ganlin Yang et.al. | 2407.07461 | null |
2024-07-16 | Event-Aided Time-to-Collision Estimation for Autonomous Driving | Jinghang Li et.al. | 2407.07324 | null |
2024-07-09 | Exploring Camera Encoder Designs for Autonomous Driving Perception | Barath Lakshmanan et.al. | 2407.07276 | null |
2024-07-09 | Reference-based Controllable Scene Stylization with Gaussian Splatting | Yiqun Mei et.al. | 2407.07220 | null |
2024-07-09 | Less is More: Efficient Brain-Inspired Learning for Autonomous Driving Trajectory Prediction | Haicheng Liao et.al. | 2407.07020 | null |
2024-07-09 | Explainable AI for Enhancing Efficiency of DL-based Channel Estimation | Abdul Karim Gizzini et.al. | 2407.07009 | null |
2024-07-09 | Sparse-DeRF: Deblurred Neural Radiance Fields from Sparse View | Dogyoon Lee et.al. | 2407.06613 | null |
2024-07-19 | Exploring the Causality of End-to-End Autonomous Driving | Jiankun Li et.al. | 2407.06546 | link |
2024-07-10 | VQA-Diff: Exploiting VQA and Diffusion for Zero-Shot Image-to-3D Vehicle Asset Generation in Autonomous Driving | Yibo Liu et.al. | 2407.06516 | null |
2024-07-17 | Enhanced Safety in Autonomous Driving: Integrating Latent State Diffusion Model for End-to-End Navigation | Detian Chu et.al. | 2407.06317 | null |
2024-07-10 | 4D Contrastive Superflows are Dense 3D Representation Learners | Xiang Xu et.al. | 2407.06190 | link |
2024-07-16 | PerlDiff: Controllable Street View Synthesis Using Perspective-Layout Diffusion Models | Jinhua Zhang et.al. | 2407.06109 | link |
2024-07-08 | RHRSegNet: Relighting High-Resolution Night-Time Semantic Segmentation | Sarah Elmahdy et.al. | 2407.06016 | null |
2024-07-08 | Enhancing Vision-Language Models with Scene Graphs for Traffic Accident Understanding | Aaron Lohner et.al. | 2407.05910 | null |
2024-07-08 | Cross-domain Few-shot In-context Learning for Enhancing Traffic Sign Recognition | Yaozong Gan et.al. | 2407.05814 | null |
2024-07-08 | Boosting 3D Object Detection with Semantic-Aware Multi-Branch Framework | Hao Jing et.al. | 2407.05769 | null |
2024-07-18 | BEVWorld: A Multimodal World Model for Autonomous Driving via Unified BEV Latent Space | Yumeng Zhang et.al. | 2407.05679 | link |
2024-07-08 | MSTF: Multiscale Transformer for Incomplete Trajectory Prediction | Zhanwen Liu et.al. | 2407.05671 | null |
2024-07-08 | Enhancing Neural Radiance Fields with Depth and Normal Completion Priors from Sparse Views | Jiawei Guo et.al. | 2407.05666 | null |
2024-07-08 | GenFollower: Enhancing Car-Following Prediction with Large Language Models | Xianda Chen et.al. | 2407.05611 | null |
2024-07-08 | GeoNLF: Geometry guided Pose-Free Neural LiDAR Fields | Weiyi Xue et.al. | 2407.05597 | null |
2024-07-31 | Dynamic Neural Radiance Field From Defocused Monocular Video | Xianrui Luo et.al. | 2407.05586 | null |
2024-07-14 | Evolutionary Trigger Detection and Lightweight Model Repair Based Backdoor Defense | Qi Zhou et.al. | 2407.05396 | null |
2024-07-07 | SCIPaD: Incorporating Spatial Clues into Unsupervised Pose-Depth Joint Learning | Yi Feng et.al. | 2407.05283 | link |
2024-07-07 | GaussReg: Fast 3D Registration with Gaussian Splatting | Jiahao Chang et.al. | 2407.05254 | null |
2024-07-07 | Tracking Reflected Objects: A Benchmark | Xiaoyu Guo et.al. | 2407.05235 | null |
2024-07-06 | SurgicalGaussian: Deformable 3D Gaussians for High-Fidelity Surgical Scene Reconstruction | Weixing Xie et.al. | 2407.05023 | link |
2024-07-06 | T-CorresNet: Template Guided 3D Point Cloud Completion with Correspondence Pooling Query Generation Strategy | Fan Duan et.al. | 2407.05008 | link |
2024-07-15 | JDT3D: Addressing the Gaps in LiDAR-Based Tracking-by-Attention | Brian Cheong et.al. | 2407.04926 | link |
2024-07-06 | SID: Stereo Image Dataset for Autonomous Driving in Adverse Conditions | Zaid A. El-Shair et.al. | 2407.04908 | null |
2024-07-05 | MUSIC-lite: Efficient MUSIC using Approximate Computing: An OFDM Radar Case Study | Rajat Bhattacharjya et.al. | 2407.04849 | null |
2024-07-05 | JaywalkerVR: A VR System for Collecting Safety-Critical Pedestrian-Vehicle Interactions | Kenta Mukoya et.al. | 2407.04843 | null |
2024-07-05 | Optimizing the image correction pipeline for pedestrian detection in the thermal-infrared domain | Christophe Karam et.al. | 2407.04484 | null |
2024-07-05 | Dance of the ADS: Orchestrating Failures through Historically-Informed Scenario Fuzzing | Tong Wang et.al. | 2407.04359 | null |
2024-07-05 | Towards Stable 3D Object Detection | Jiabao Wang et.al. | 2407.04305 | null |
2024-07-05 | WOMD-Reasoning: A Large-Scale Language Dataset for Interaction and Driving Intentions Reasoning | Yiheng Li et.al. | 2407.04281 | link |
2024-07-05 | Research, Applications and Prospects of Event-Based Pedestrian Detection: A Survey | Han Wang et.al. | 2407.04277 | null |
2024-07-04 | Behavioural gap assessment of human-vehicle interaction in real and virtual reality-based scenarios in autonomous driving | Sergio. Martín Serrano et.al. | 2407.04070 | null |
2024-07-12 | Detect Closer Surfaces that can be Seen: New Modeling and Evaluation in Cross-domain 3D Object Detection | Ruixiao Zhang et.al. | 2407.04061 | link |
2024-07-04 | Towards Cross-View-Consistent Self-Supervised Surround Depth Estimation | Laiyan Ding et.al. | 2407.04041 | link |
2024-07-04 | CRiM-GS: Continuous Rigid Motion-Aware Gaussian Splatting from Motion Blur Images | Junghe Lee et.al. | 2407.03923 | null |
2024-08-22 | StreamLTS: Query-based Temporal-Spatial LiDAR Fusion for Cooperative Object Detection | Yunshuang Yuan et.al. | 2407.03825 | link |
2024-07-04 | A Fast Dynamic Point Detection Method for LiDAR-Inertial Odometry in Driving Scenarios | Zikang Yuan et.al. | 2407.03590 | link |
2024-07-17 | Efficient Fusion and Task Guided Embedding for End-to-end Autonomous Driving | Yipin Guo et.al. | 2407.02878 | null |
2024-07-03 | A Radiometric Correction based Optical Modeling Approach to Removing Reflection Noise in TLS Point Clouds of Urban Scenes | Li Fang et.al. | 2407.02830 | link |
2024-07-03 | Solving Motion Planning Tasks with a Scalable Generative Model | Yihan Hu et.al. | 2407.02797 | link |
2024-07-04 | AutoSplat: Constrained Gaussian Splatting for Autonomous Driving Scene Reconstruction | Mustafa Khan et.al. | 2407.02598 | null |
2024-06-18 | Sample-efficient Imitative Multi-token Decision Transformer for Generalizable Real World Driving | Hang Zhou et.al. | 2407.02508 | null |
2024-07-02 | Research on Reliable and Safe Occupancy Grid Prediction in Underground Parking Lots | JiaQi Luo et.al. | 2407.02197 | null |
2024-07-02 | I2EKF-LO: A Dual-Iteration Extended Kalman Filter Based LiDAR Odometry | Wenlu Yu et.al. | 2407.02190 | link |
2024-07-03 | BeNeRF: Neural Radiance Fields from a Single Blurry Image and Event Stream | Wenpu Li et.al. | 2407.02174 | link |
2024-07-02 | DM3D: Distortion-Minimized Weight Pruning for Lossless 3D Object Detection | Kaixin Xu et.al. | 2407.02098 | null |
2024-07-02 | LiDAR-based HD Map Localization using Semantic Generalized ICP with Road Marking Detection | Yansong Gong et.al. | 2407.02061 | null |
2024-07-02 | FlowTrack: Point-level Flow Network for 3D Single Object Tracking | Shuo Li et.al. | 2407.01959 | null |
2024-07-02 | Cloud-Edge-Terminal Collaborative AIGC for Autonomous Driving | Jianan Zhang et.al. | 2407.01956 | null |
2024-07-01 | fVDB: A Deep-Learning Framework for Sparse, Large-Scale, and High-Performance Spatial Intelligence | Francis Williams et.al. | 2407.01781 | null |
2024-07-01 | Predicting Trust Dynamics with Dynamic SEM in Human-AI Cooperation | Sota Kaneko et.al. | 2407.01752 | null |
2024-07-01 | The Continuous Tensor Abstraction: Where Indices are Real | Jaeyeon Won et.al. | 2407.01742 | null |
2024-07-01 | SeFlow: A Self-Supervised Scene Flow Method in Autonomous Driving | Qingwen Zhang et.al. | 2407.01702 | link |
2024-07-01 | Deep Reinforcement Learning for Adverse Garage Scenario Generation | Kai Li et.al. | 2407.01333 | null |
2024-07-01 | RoDyn-SLAM: Robust Dynamic Dense RGB-D SLAM with Neural Radiance Fields | Haochen Jiang et.al. | 2407.01303 | link |
2024-07-01 | Let Hybrid A* Path Planner Obey Traffic Rules: A Deep Reinforcement Learning-Based Planning Framework | Xibo Li et.al. | 2407.01216 | null |
2024-07-01 | FedRC: A Rapid-Converged Hierarchical Federated Learning Framework in Street Scene Semantic Understanding | Wei-Bin Kou et.al. | 2407.01103 | null |
2024-07-01 | HGNET: A Hierarchical Feature Guided Network for Occupancy Flow Field Prediction | Zhan Chen et.al. | 2407.01097 | null |
2024-07-01 | Data on the Move: Traffic-Oriented Data Trading Platform Powered by AI Agent with Common Sense | Yi Yu et.al. | 2407.00995 | null |
2024-07-01 | Acceleration method for generating perception failure scenarios based on editing Markov process | Canjie Cai et.al. | 2407.00980 | null |
2024-07-01 | FALCON: Frequency Adjoint Link with CONtinuous Density Mask for Fast Single Image Dehazing | Donghyun Kim et.al. | 2407.00972 | null |
2024-07-01 | Tokenize the World into Object-level Knowledge to Address Long-tail Events in Autonomous Driving | Ran Tian et.al. | 2407.00959 | null |
2024-07-01 | Reconfigurable Intelligent Computational Surfaces for MEC-Assisted Autonomous Driving Networks: Design Optimization and Analysis | Xueyao Zhang et.al. | 2407.00933 | null |
2024-07-07 | CaFNet: A Confidence-Driven Framework for Radar Camera Depth Estimation | Huawei Sun et.al. | 2407.00697 | link |
2024-06-29 | A Rule-Based Behaviour Planner for Autonomous Driving | Bouchard Frederic et.al. | 2407.00460 | null |
2024-06-28 | Automated Deep Neural Network Inference Partitioning for Distributed Embedded Systems | Fabian Kreß et.al. | 2406.19913 | null |
2024-06-28 | StreamMOTP: Streaming and Unified Framework for Joint 3D Multi-Object Tracking and Trajectory Prediction | Jiaheng Zhuang et.al. | 2406.19844 | null |
2024-06-28 | LCSim: A Large-Scale Controllable Traffic Simulator | Yuheng Zhang et.al. | 2406.19781 | link |
2024-06-28 | Deep Learning-based Depth Estimation Methods from Monocular Image and Videos: A Comprehensive Survey | Uchitha Rajapaksha et.al. | 2406.19675 | null |
2024-06-27 | BiCo-Fusion: Bidirectional Complementary LiDAR-Camera Fusion for Semantic- and Spatial-Aware 3D Object Detection | Yang Song et.al. | 2406.19048 | null |
2024-06-27 | Shorter SPECT Scans Using Self-supervised Coordinate Learning to Synthesize Skipped Projection Views | Zongyu Li et.al. | 2406.18840 | null |
2024-06-05 | Dream-in-Style: Text-to-3D Generation using Stylized Score Distillation | Hubert Kompanowski et.al. | 2406.18581 | null |
2024-06-27 | XLD: A Cross-Lane Dataset for Benchmarking Novel Driving View Synthesis | Hao Li et.al. | 2406.18360 | null |
2024-07-29 | Trimming the Fat: Efficient Compression of 3D Gaussian Splats through Pruning | Muhammad Salman Ali et.al. | 2406.18214 | link |
2024-06-25 | End-to-End Autonomous Driving without Costly Modularization and 3D Manual Annotation | Mingzhe Guo et.al. | 2406.17680 | null |
2024-06-25 | MDHA: Multi-Scale Deformable Transformer with Hybrid Anchors for Multi-View 3D Object Detection | Michelle Adeline et.al. | 2406.17654 | link |
2024-06-25 | Querying Labeled Time Series Data with Scenario Programs | Devan Shanker et.al. | 2406.17627 | null |
2024-06-25 | NerfBaselines: Consistent and Reproducible Evaluation of Novel View Synthesis Methods | Jonas Kulhanek et.al. | 2406.17345 | null |
2024-06-25 | Image-Guided Outdoor LiDAR Perception Quality Assessment for Autonomous Driving | Ce Zhang et.al. | 2406.17265 | null |
2024-06-24 | GPT-4V Explorations: Mining Autonomous Driving | Zixuan Li et.al. | 2406.16817 | null |
2024-08-11 | ShanghaiTech Mapping Robot is All You Need: Robot System for Collecting Universal Ground Vehicle Datasets | Bowen Xu et.al. | 2406.16713 | null |
2024-06-24 | Crowd-Sourced NeRF: Collecting Data from Production Vehicles for 3D Street View Reconstruction | Tong Qin et.al. | 2406.16289 | null |
2024-06-24 | SegNet4D: Effective and Efficient 4D LiDAR Semantic Segmentation in Autonomous Driving Environments | Neng Wang et.al. | 2406.16279 | link |
2024-06-23 | DV-3DLane: End-to-end Multi-modal 3D Lane Detection with Dual-view Representation | Yueru Luo et.al. | 2406.16072 | link |
2024-06-23 | Towards Real-Time Neural Volumetric Rendering on Mobile Devices: A Measurement Study | Zhe Wang et.al. | 2406.16068 | null |
2024-06-23 | LiveScene: Language Embedding Interactive Radiance Fields for Physical Scene Rendering and Control | Delin Qu et.al. | 2406.16038 | null |
2024-06-22 | ISS-Scenario: Scenario-based Testing in CARLA | Renjue Li et.al. | 2406.15777 | link |
2024-06-22 | psPRF:Pansharpening Planar Neural Radiance Field for Generalized 3D Reconstruction Satellite Imagery | Tongtong Zhang et.al. | 2406.15707 | null |
2024-05-24 | Automated Parking Planning with Vision-Based BEV Approach | Yuxuan Zhao et.al. | 2406.15430 | null |
2024-05-24 | Automatic parking planning control method based on improved A* algorithm | Yuxuan Zhao et.al. | 2406.15429 | null |
2024-06-21 | NAVSIM: Data-Driven Non-Reactive Autonomous Vehicle Simulation and Benchmarking | Daniel Dauner et.al. | 2406.15349 | link |
2024-06-21 | Towards Robust Training Datasets for Machine Learning with Ontologies: A Case Study for Emergency Road Vehicle Detection | Lynn Vonderhaar et.al. | 2406.15268 | null |
2024-06-21 | A3D: Does Diffusion Dream about 3D Alignment? | Savva Ignatyev et.al. | 2406.15020 | null |
2024-06-21 | E2GS: Event Enhanced Gaussian Splatting | Hiroyuki Deguchi et.al. | 2406.14978 | link |
2024-06-21 | Relighting Scenes with Object Insertions in Neural Radiance Fields | Xuening Zhu et.al. | 2406.14806 | null |
2024-06-20 | Multi-Task Lane-Free Driving Strategy for Connected and Automated Vehicles: A Multi-Agent Deep Reinforcement Learning Approach | Mehran Berahman et.al. | 2406.14766 | null |
2024-06-20 | Preferential Multi-Objective Bayesian Optimization | Raul Astudillo et.al. | 2406.14699 | null |
2024-06-24 | Enhancing Dropout-based Bayesian Neural Networks with Multi-Exit on FPGA | Hao Mark Chen et.al. | 2406.14593 | link |
2024-07-24 | Asynchronous Large Language Model Enhanced Planner for Autonomous Driving | Yuan Chen et.al. | 2406.14556 | link |
2024-06-20 | FutureNet-LOF: Joint Trajectory Prediction and Lane Occupancy Field Prediction with Future Context Encoding | Mingkun Wang et.al. | 2406.14422 | null |
2024-06-20 | PoseBench: Benchmarking the Robustness of Pose Estimation Models under Corruptions | Sihan Ma et.al. | 2406.14367 | null |
2024-08-01 | Deblurring Neural Radiance Fields with Event-driven Bundle Adjustment | Yunshan Qi et.al. | 2406.14360 | null |
2024-06-20 | Uncertainty and Self-Supervision in Single-View Depth | Javier Rodriguez-Puigvert et.al. | 2406.14226 | null |
2024-06-20 | GTP-UDrive: Unified Game-Theoretic Trajectory Planner and Decision-Maker for Autonomous Driving in Mixed Traffic Environments | Nouhed Naidja et.al. | 2406.14077 | null |
2024-06-20 | Urban-Focused Multi-Task Offline Reinforcement Learning with Contrastive Data Sharing | Xinbo Zhao et.al. | 2406.14054 | null |
2024-06-20 | The Use of Multimodal Large Language Models to Detect Objects from Thermal Images: Transportation Applications | Huthaifa I. Ashqar et.al. | 2406.13898 | null |
2024-07-30 | Safe and Non-Conservative Trajectory Planning for Autonomous Driving Handling Unanticipated Behaviors of Traffic Participants | Tommaso Benciolini et.al. | 2406.13396 | link |
2024-06-19 | ECAFormer: Low-light Image Enhancement using Cross Attention | Yudi Ruan et.al. | 2406.13281 | link |
2024-06-19 | Freq-Mip-AA : Frequency Mip Representation for Anti-Aliasing Neural Radiance Fields | Youngin Park et.al. | 2406.13251 | link |
2024-06-19 | Act Better by Timing: A timing-Aware Reinforcement Learning for Autonomous Driving | Guanzhou Li et.al. | 2406.13223 | null |
2024-06-18 | Head Pose Estimation and 3D Neural Surface Reconstruction via Monocular Camera in situ for Navigation and Safe Insertion into Natural Openings | Ruijie Tang et.al. | 2406.13048 | null |
2024-06-18 | ABNet: Attention BarrierNet for Safe and Scalable Robot Learning | Wei Xiao et.al. | 2406.13025 | link |
2024-06-18 | Online-Adaptive Anomaly Detection for Defect Identification in Aircraft Assembly | Siddhant Shete et.al. | 2406.12698 | null |
2024-06-18 | Reparameterizable Dual-Resolution Network for Real-time Semantic Segmentation | Guoyu Yang et.al. | 2406.12496 | link |
2024-06-19 | Is Your HD Map Constructor Reliable under Sensor Corruptions? | Xiaoshuai Hao et.al. | 2406.12214 | null |
2024-06-18 | Fast Global Localization on Neural Radiance Field | Mangyu Kong et.al. | 2406.12202 | link |
2024-06-17 | DistillNeRF: Perceiving 3D Scenes from Single-Glance Images by Distilling Neural Fields and Foundation Model Features | Letian Wang et.al. | 2406.12095 | null |
2024-06-17 | Uncertainty modeling for fine-tuned implicit functions | Anna Susmelj et.al. | 2406.12082 | null |
2024-06-17 | Crossfusor: A Cross-Attention Transformer Enhanced Conditional Diffusion Model for Car-Following Trajectory Prediction | Junwei You et.al. | 2406.11941 | null |
2024-06-17 | LLaNA: Large Language and NeRF Assistant | Andrea Amaduzzi et.al. | 2406.11840 | null |
2024-06-17 | InterNeRF: Scaling Radiance Fields via Parameter Interpolation | Clinton Wang et.al. | 2406.11737 | null |
2024-06-17 | A First Physical-World Trajectory Prediction Attack via LiDAR-induced Deceptions in Autonomous Driving | Yang Lou et.al. | 2406.11707 | null |
2024-06-17 | Communication-Efficient MARL for Platoon Stability and Energy-efficiency Co-optimization in Cooperative Adaptive Cruise Control of CAVs | Min Hua et.al. | 2406.11653 | null |
2024-06-14 | Galibr: Targetless LiDAR-Camera Extrinsic Calibration Method via Ground Plane Initialization | Wonho Song et.al. | 2406.11599 | null |
2024-07-17 | Constrained Reinforcement Learning with Average Reward Objective: Model-Based and Model-Free Algorithms | Vaneet Aggarwal et.al. | 2406.11481 | null |
2024-06-17 | Semi-Supervised Domain Adaptation Using Target-Oriented Domain Augmentation for 3D Object Detection | Yecheol Kim et.al. | 2406.11313 | link |
2024-06-17 | Model Adaptation for Time Constrained Embodied Control | Jaehyun Song et.al. | 2406.11128 | null |
2024-06-16 | Learning Relighting and Intrinsic Decomposition in Neural Radiance Fields | Yixiong Yang et.al. | 2406.11077 | null |
2024-06-16 | SparseDet: A Simple and Effective Framework for Fully Sparse LiDAR-based 3D Object Detection | Lin Liu et.al. | 2406.10907 | null |
2024-06-16 | TrafficBots V1.5: Traffic Simulation via Conditional VAEs and Transformers with Relative Pose Encoding | Zhejun Zhang et.al. | 2406.10898 | link |
2024-06-16 | An LLM-enhanced Multi-objective Evolutionary Search for Autonomous Driving Test Scenario Generation | Haoxiang Tian et.al. | 2406.10857 | null |
2024-06-15 | GenMM: Geometrically and Temporally Consistent Multimodal Data Generation for Video and LiDAR | Bharat Singh et.al. | 2406.10722 | null |
2024-06-15 | Planning with Adaptive World Models for Autonomous Driving | Arun Balajee Vasudevan et.al. | 2406.10714 | null |
2024-06-15 | Object Detection using Oriented Window Learning Vi-sion Transformer: Roadway Assets Recognition | Taqwa Alhadidi et.al. | 2406.10712 | null |
2024-07-17 | MMVR: Millimeter-wave Multi-View Radar Dataset and Benchmark for Indoor Perception | M. Mahbubur Rahman et.al. | 2406.10708 | link |
2024-06-15 | fNeRF: High Quality Radiance Fields from Practical Cameras | Yi Hua et.al. | 2406.10633 | null |
2024-06-15 | Semantic Communication for Edge Intelligence Enabled Autonomous Driving System | Yunqi Feng et.al. | 2406.10606 | null |
2024-07-16 | SparseRadNet: Sparse Perception Neural Network on Subsampled Radar Data | Jialong Wu et.al. | 2406.10600 | null |
2024-06-15 | Generating and Evolving Reward Functions for Highway Driving with Large Language Models | Xu Han et.al. | 2406.10540 | null |
2024-06-15 | Federated Neural Radiance Field for Distributed Intelligence | Yintian Zhang et.al. | 2406.10474 | null |
2024-06-14 | Wild-GS: Real-Time Novel View Synthesis from Unconstrained Photo Collections | Jiacong Xu et.al. | 2406.10373 | null |
2024-06-14 | CarLLaVA: Vision language models for camera-only closed-loop driving | Katrin Renz et.al. | 2406.10165 | null |
2024-06-14 | MapVision: CVPR 2024 Autonomous Grand Challenge Mapless Driving Tech Report | Zhongyu Yang et.al. | 2406.10125 | null |
2024-06-14 | GaussianSR: 3D Gaussian Super-Resolution with 2D Diffusion Priors | Xiqian Yu et.al. | 2406.10111 | null |
2024-06-14 | DurLAR: A High-fidelity 128-channel LiDAR Dataset with Panoramic Ambient and Reflectivity Imagery for Multi-modal Autonomous Driving Applications | Li Li et.al. | 2406.10068 | link |
2024-06-14 | SemanticSpray++: A Multimodal Dataset for Autonomous Driving in Wet Surface Conditions | Aldi Piroli et.al. | 2406.09945 | null |
2024-06-14 | Globally Optimal GNSS Multi-Antenna Lever Arm Calibration | Thomas Wodtko et.al. | 2406.09866 | null |
2024-06-14 | A Two-Stage Masked Autoencoder Based Network for Indoor Depth Completion | Kailai Sun et.al. | 2406.09792 | link |
2024-06-14 | Research on Edge Detection of LiDAR Images Based on Artificial Intelligence Technology | Haowei Yang et.al. | 2406.09773 | null |
2024-07-17 | Cross-Modality Program Representation Learning for Electronic Design Automation with High-Level Synthesis | Zongyue Qin et.al. | 2406.09606 | null |
2024-06-13 | SimGen: Simulator-conditioned Driving Scene Generation | Yunsong Zhou et.al. | 2406.09386 | null |
2024-06-13 | Optimizing Visual Question Answering Models for Driving: Bridging the Gap Between Human and Machine Attention Patterns | Kaavya Rekanar et.al. | 2406.09203 | null |
2024-07-25 | Auto-Vocabulary Segmentation for LiDAR Points | Weijie Wei et.al. | 2406.09126 | link |
2024-06-13 | Neural NeRF Compression | Tuan Pham et.al. | 2406.08943 | null |
2024-06-13 | OpenMaterial: A Comprehensive Dataset of Complex Materials for 3D Reconstruction | Zheng Dang et.al. | 2406.08894 | null |
2024-06-26 | CIMRL: Combining IMitation and Reinforcement Learning for Safe Autonomous Driving | Jonathan Booher et.al. | 2406.08878 | null |
2024-06-13 | Trajectory Planning for Autonomous Driving in Unstructured Scenarios Based on Graph Neural Network and Numerical Optimization | Sumin Zhang et.al. | 2406.08855 | null |
2024-06-13 | BEVSpread: Spread Voxel Pooling for Bird’s-Eye-View Representation in Vision-based Roadside 3D Object Detection | Wenjie Wang et.al. | 2406.08785 | link |
2024-06-12 | Enhancing End-to-End Autonomous Driving with Latent World Model | Yingyan Li et.al. | 2406.08481 | link |
2024-06-12 | PRIBOOT: A New Data-Driven Expert for Improved Driving Simulations | Daniel Coelho et.al. | 2406.08421 | link |
2024-06-12 | LaneCPP: Continuous 3D Lane Detection using Physical Priors | Maximilian Pittner et.al. | 2406.08381 | null |
2024-06-12 | Utilizing Navigation Path to Generate Target Point for Enhanced End-to-End Autonomous Driving Planning | Yuanhua Shen et.al. | 2406.08349 | null |
2024-06-12 | Valeo4Cast: A Modular Approach to End-to-End Forecasting | Yihong Xu et.al. | 2406.08113 | link |
2024-06-12 | OpenObj: Open-Vocabulary Object-Level Neural Radiance Fields with Fine-Grained Understanding | Yinan Deng et.al. | 2406.08009 | link |
2024-06-12 | Spatial Annealing Smoothing for Efficient Few-shot Neural Rendering | Yuru Xiao et.al. | 2406.07828 | link |
2024-06-11 | PLT-D3: A High-fidelity Dynamic Driving Simulation Dataset for Stereo Depth and Scene Flow | Joshua Tokarsky et.al. | 2406.07667 | null |
2024-06-11 | Active Scout: Multi-Target Tracking Using Neural Radiance Fields in Dense Urban Environments | Christopher D. Hsu et.al. | 2406.07431 | null |
2024-06-11 | Instruct Large Language Models to Drive like Humans | Ruijun Zhang et.al. | 2406.07296 | link |
2024-06-11 | EFFOcc: A Minimal Baseline for EFficient Fusion-based 3D Occupancy Network | Yining Shi et.al. | 2406.07042 | link |
2024-06-11 | PanoSSC: Exploring Monocular Panoptic 3D Scene Reconstruction for Autonomous Driving | Yining Shi et.al. | 2406.07037 | null |
2024-06-12 | LiSD: An Efficient Multi-Task Learning Framework for LiDAR Segmentation and Detection | Jiahua Xu et.al. | 2406.07023 | null |
2024-06-11 | Generative Lifting of Multiview to 3D from Unknown Pose: Wrapping NeRF inside Diffusion | Xin Yuan et.al. | 2406.06972 | null |
2024-06-15 | Neural Visibility Field for Uncertainty-Driven Active Mapping | Shangjie Xue et.al. | 2406.06948 | null |
2024-06-10 | PatchRefiner: Leveraging Synthetic Data for Real-Domain High-Resolution Monocular Metric Depth Estimation | Zhenyu Li et.al. | 2406.06679 | null |
2024-06-10 | IllumiNeRF: 3D Relighting without Inverse Rendering | Xiaoming Zhao et.al. | 2406.06527 | null |
2024-06-10 | Hybrid Video Anomaly Detection for Anomalous Scenarios in Autonomous Driving | Daniel Bogdoll et.al. | 2406.06423 | null |
2024-06-10 | UMAD: Unsupervised Mask-Level Anomaly Detection for Autonomous Driving | Daniel Bogdoll et.al. | 2406.06370 | null |
2024-06-10 | DualAD: Disentangling the Dynamic and Static World for End-to-End Driving | Simon Doll et.al. | 2406.06264 | null |
2024-06-10 | ExtraNeRF: Visibility-Aware View Extrapolation of Neural Radiance Fields with Diffusion Models | Meng-Li Shih et.al. | 2406.06133 | null |
2024-06-09 | Self-supervised Adversarial Training of Monocular Depth Estimation against Physical-World Attacks | Zhiyuan Cheng et.al. | 2406.05857 | link |
2024-06-09 | ControlLoc: Physical-World Hijacking Attack on Visual Perception in Autonomous Driving | Chen Ma et.al. | 2406.05810 | null |
2024-07-19 | SlowPerception: Physical-World Latency Attack against Visual Perception in Autonomous Driving | Chen Ma et.al. | 2406.05800 | null |
2024-06-09 | Certified Robustness to Data Poisoning in Gradient-Based Training | Philip Sosnin et.al. | 2406.05670 | link |
2024-06-09 | A Superalignment Framework in Autonomous Driving with Large Language Models | Xiangrui Kong et.al. | 2406.05651 | null |
2024-06-13 | GTR: Improving Large 3D Reconstruction Models through Geometry and Texture Refinement | Peiye Zhuang et.al. | 2406.05649 | null |
2024-06-08 | Toward Autonomous Driving by Musculoskeletal Humanoids: A Study of Developed Hardware and Learning-Based Software | Kento Kawaharazuka et.al. | 2406.05573 | null |
2024-06-07 | CityCraft: A Real Crafter for 3D City Generation | Jie Deng et.al. | 2406.04983 | null |
2024-06-07 | Multiplane Prior Guided Few-Shot Aerial Scene Rendering | Zihan Gao et.al. | 2406.04961 | null |
2024-06-07 | Multi-style Neural Radiance Field with AdaIN | Yu-Wen Pao et.al. | 2406.04960 | link |
2024-07-08 | A Survey of Fragile Model Watermarking | Zhenzhe Gao et.al. | 2406.04809 | null |
2024-06-07 | EAIA: An Efficient and Anonymous Identity Authentication Scheme in 5G-V2V | Qianmin Du et.al. | 2406.04705 | null |
2024-06-06 | Step Out and Seek Around: On Warm-Start Training with Incremental Data | Maying Shen et.al. | 2406.04484 | null |
2024-06-06 | Optimizing Autonomous Driving for Safety: A Human-Centric Approach with LLM-Enhanced RLHF | Yuan Sun et.al. | 2406.04481 | null |
2024-06-13 | DeTra: A Unified Model for Object Detection and Trajectory Forecasting | Sergio Casas et.al. | 2406.04426 | null |
2024-06-07 | DIRECT-3D: Learning Direct Text-to-3D Generation on Massive Noisy 3D Data | Qihao Liu et.al. | 2406.04322 | link |
2024-06-14 | GeoGen: Geometry-Aware Generative Modeling via Signed Distance Functions | Salvatore Esposito et.al. | 2406.04254 | null |
2024-06-06 | A Survey on 3D Human Avatar Modeling – From Reconstruction to Generation | Ruihe Wang et.al. | 2406.04253 | null |
2024-06-06 | Improving Physics-Augmented Continuum Neural Radiance Field-Based Geometry-Agnostic System Identification with Lagrangian Particle Optimization | Takuhiro Kaneko et.al. | 2406.04155 | null |
2024-06-06 | How Far Can We Compress Instant-NGP-Based NeRF? | Yihang Chen et.al. | 2406.04101 | link |
2024-06-06 | Frequency-based Matcher for Long-tailed Semantic Segmentation | Shan Li et.al. | 2406.03917 | link |
2024-06-11 | Bench2Drive: Towards Multi-Ability Benchmarking of Closed-Loop End-To-End Autonomous Driving | Xiaosong Jia et.al. | 2406.03877 | link |
2024-06-06 | Monocular Localization with Semantics Map for Autonomous Vehicles | Jixiang Wan et.al. | 2406.03835 | null |
2024-06-06 | Gear-NeRF: Free-Viewpoint Rendering and Tracking with Motion-aware Spatio-Temporal Sampling | Xinhang Liu et.al. | 2406.03723 | null |
2024-06-05 | FedPylot: Navigating Federated Learning for Real-Time Object Detection in Internet of Vehicles | Cyprien Quéméneur et.al. | 2406.03611 | link |
2024-06-05 | AD-H: Autonomous Driving with Hierarchical Agents | Zaibin Zhang et.al. | 2406.03474 | null |
2024-06-11 | Polarization Wavefront Lidar: Learning Large Scene Reconstruction from Polarized Wavefronts | Dominik Scheuble et.al. | 2406.03461 | null |
2024-06-05 | Prompt-based Visual Alignment for Zero-shot Policy Transfer | Haihan Gao et.al. | 2406.03250 | null |
2024-06-05 | Situation Monitor: Diversity-Driven Zero-Shot Out-of-Distribution Detection using Budding Ensemble Architecture for Object Detection | Qutub Syed et.al. | 2406.03188 | null |
2024-06-05 | Enhanced Automotive Object Detection via RGB-D Fusion in a DiffusionDet Framework | Eliraz Orfaig et.al. | 2406.03129 | null |
2024-06-05 | Enhancing 3D Lane Detection and Topology Reasoning with 2D Lane Priors | Han Li et.al. | 2406.03105 | link |
2024-06-05 | Task-Oriented Wireless Communications for Collaborative Perception in Intelligent Unmanned Systems | Sheng Zhou et.al. | 2406.03086 | null |
2024-06-05 | Correlation of Software-in-the-Loop Simulation with Physical Testing for Autonomous Driving | Zhennan Fei et.al. | 2406.03040 | null |
2024-06-05 | DriVLMe: Enhancing LLM-based Autonomous Driving Agents with Embodied and Social Experiences | Yidong Huang et.al. | 2406.03008 | link |
2024-06-05 | Dynamically Expanding Capacity of Autonomous Driving with Near-Miss Focused Training Framework | Ziyuan Yang et.al. | 2406.02865 | null |
2024-06-13 | 3D-HGS: 3D Half-Gaussian Splatting | Haolin Li et.al. | 2406.02720 | link |
2024-06-01 | Data Quality in Edge Machine Learning: A State-of-the-Art Survey | Mohammed Djameleddine Belgoumri et.al. | 2406.02600 | null |
2024-06-04 | Out-of-Distribution Runtime Adaptation with Conformalized Neural Network Ensembles | Polo Contreras et.al. | 2406.02436 | null |
2024-07-19 | Decoupling of neural network calibration measures | Dominik Werner Wolf et.al. | 2406.02411 | null |
2024-06-04 | Radar Spectra-Language Model for Automotive Scene Parsing | Mariia Pushkareva et.al. | 2406.02158 | null |
2024-06-04 | UA-Track: Uncertainty-Aware End-to-End 3D Multi-Object Tracking | Lijun Zhou et.al. | 2406.02147 | null |
2024-06-05 | Exploring Real World Map Change Generalization of Prior-Informed HD Map Prediction Models | Samuel M. Bateman et.al. | 2406.01961 | null |
2024-06-04 | ProGEO: Generating Prompts through Image-Text Contrastive Learning for Visual Geo-localization | Chen Mao et.al. | 2406.01906 | link |
2024-06-04 | PlanAgent: A Multi-modal Large Language Agent for Closed-loop Vehicle Motion Planning | Yupeng Zheng et.al. | 2406.01587 | null |
2024-06-03 | Learning from Mistakes: a Weakly-supervised Method for Mitigating the Distribution Shift in Autonomous Vehicle Planning | Fazel Arasteh et.al. | 2406.01544 | null |
2024-06-16 | Sensitivity-Informed Augmentation for Robust Segmentation | Laura Zheng et.al. | 2406.01425 | null |
2024-06-03 | Extending Structural Causal Models for Use in Autonomous Embodied Systems | Rhys Howard et.al. | 2406.01384 | link |
2024-06-03 | Convolutional Unscented Kalman Filter for Multi-Object Tracking with Outliers | Shiqi Liu et.al. | 2406.01380 | null |
2024-06-06 | Unleashing Generalization of End-to-End Autonomous Driving with Controllable Long Video Generation | Enhui Ma et.al. | 2406.01349 | null |
2024-06-03 | REvolve: Reward Evolution with Large Language Models for Autonomous Driving | Rishi Hazra et.al. | 2406.01309 | null |
2024-07-11 | Self-Calibrating 4D Novel View Synthesis from Monocular Videos Using Gaussian Splatting | Fang Li et.al. | 2406.01042 | link |
2024-07-16 | LanEvil: Benchmarking the Robustness of Lane Detection to Environmental Illusions | Tianyuan Zhang et.al. | 2406.00934 | null |
2024-06-02 | PruNeRF: Segment-Centric Dataset Pruning via 3D Spatial Consistency | Yeonsung Jung et.al. | 2406.00798 | null |
2024-06-02 | A Survey of Deep Learning Based Radar and Vision Fusion for 3D Object Detection in Autonomous Driving | Di Wu et.al. | 2406.00714 | null |
2024-06-02 | Representing Animatable Avatar via Factorized Neural Fields | Chunjin Song et.al. | 2406.00637 | null |
2024-06-02 | Efficient Neural Light Fields (ENeLF) for Mobile Devices | Austin Peng et.al. | 2406.00598 | null |
2024-06-01 | 2nd Place Solution for PVUW Challenge 2024: Video Panoptic Segmentation | Biao Wu et.al. | 2406.00500 | null |
2024-06-04 | Research on the Application of Computer Vision Based on Deep Learning in Autonomous Driving Technology | Jingyu Zhang et.al. | 2406.00490 | null |
2024-06-01 | Bilateral Guided Radiance Field Processing | Yuehao Wang et.al. | 2406.00448 | null |
2024-06-01 | Over-the-Air Collaborative Inference with Feature Differential Privacy | Mohamed Seif et.al. | 2406.00256 | null |
2024-05-31 | Fairness in Autonomous Driving: Towards Understanding Confounding Factors in Object Detection under Challenging Weather | Bimsara Pathiraja et.al. | 2406.00219 | null |
2024-05-31 | Navigating Autonomous Vehicle on Unmarked Roads with Diffusion-Based Motion Prediction and Active Inference | Yufei Huang et.al. | 2406.00211 | null |
2024-05-31 | Hard Cases Detection in Motion Prediction by Vision-Language Foundation Models | Yi Yang et.al. | 2405.20991 | link |
2024-05-31 | Uncertainty Quantification for Bird’s Eye View Semantic Segmentation: Methods and Benchmarks | Linlin Yu et.al. | 2405.20986 | null |
2024-05-31 | Robust Stable Spiking Neural Networks | Jianhao Ding et.al. | 2405.20694 | link |
2024-07-05 | HOPE: A Reinforcement Learning-based Hybrid Policy Path Planner for Diverse Parking Scenarios | Mingyang Jiang et.al. | 2405.20579 | link |
2024-05-30 | Aquatic Navigation: A Challenging Benchmark for Deep Reinforcement Learning | Davide Corsi et.al. | 2405.20534 | link |
2024-05-30 | OccSora: 4D Occupancy Generation Models as World Simulators for Autonomous Driving | Lening Wang et.al. | 2405.20337 | link |
2024-05-30 | $\textit{S}^3$ Gaussian: Self-Supervised Street Gaussians for Autonomous Driving | Nan Huang et.al. | 2405.20323 | link |
2024-05-31 | NeRF View Synthesis: Subjective Quality Assessment and Objective Metrics Evaluation | Pedro Martin et.al. | 2405.20078 | null |
2024-06-10 | IReNe: Instant Recoloring of Neural Radiance Fields | Alessio Mazzucchelli et.al. | 2405.19876 | null |
2024-06-30 | Intrinsic Dynamics-Driven Generalizable Scene Representations for Vision-Oriented Decision-Making Applications | Dayang Liang et.al. | 2405.19736 | link |
2024-05-31 | Autonomous Driving with Spiking Neural Networks | Rui-Jie Zhu et.al. | 2405.19687 | link |
2024-05-30 | View-Consistent Hierarchical 3D SegmentationUsing Ultrametric Feature Fields | Haodi He et.al. | 2405.19678 | link |
2024-05-31 | SparseDrive: End-to-End Autonomous Driving via Sparse Scene Representation | Wenchao Sun et.al. | 2405.19620 | link |
2024-05-29 | Reasoning3D – Grounding and Reasoning in 3D: Fine-Grained Zero-Shot Open-Vocabulary 3D Reasoning Part Segmentation via Large Vision-Language Models | Tianrun Chen et.al. | 2405.19326 | null |
2024-05-29 | Real-Time Environment Condition Classification for Autonomous Vehicles | Marco Introvigne et.al. | 2405.19305 | link |
2024-05-29 | Conditional Latent ODEs for Motion Prediction in Autonomous Driving | Khang Truong Giang et.al. | 2405.19183 | link |
2024-05-29 | Quantum Optimal Control of Squeezing in Cavity Optomechanics | Anton Halaski et.al. | 2405.19070 | null |
2024-05-29 | A Good Foundation is Worth Many Labels: Label-Efficient Panoptic Segmentation | Niclas Vödisch et.al. | 2405.19035 | link |
2024-05-29 | Optimizing Vehicular Networks with Variational Quantum Circuits-based Reinforcement Learning | Zijiang Yan et.al. | 2405.18984 | null |
2024-05-29 | Neural Radiance Fields for Novel View Synthesis in Monocular Gastroscopy | Zijie Jiang et.al. | 2405.18863 | null |
2024-05-29 | SSGA-Net: Stepwise Spatial Global-local Aggregation Networks for for Autonomous Driving | Yiming Cui et.al. | 2405.18857 | null |
2024-05-29 | LetsMap: Unsupervised Representation Learning for Semantic BEV Mapping | Nikhil Gosala et.al. | 2405.18852 | null |
2024-05-29 | PillarHist: A Quantization-aware Pillar Feature Encoder based on Height-aware Histogram | Sifan Zhou et.al. | 2405.18734 | null |
2024-06-02 | NeRF On-the-go: Exploiting Uncertainty for Distractor-free NeRFs in the Wild | Weining Ren et.al. | 2405.18715 | link |
2024-05-30 | 3D StreetUnveiler with Semantic-Aware 2DGS | Jingwei Xu et.al. | 2405.18416 | null |
2024-05-28 | Is a 3D-Tokenized LLM the Key to Reliable Autonomous Driving? | Yifan Bai et.al. | 2405.18361 | null |
2024-05-28 | Safe Multi-Agent Reinforcement Learning with Bilevel Optimization in Autonomous Driving | Zhi Zheng et.al. | 2405.18209 | link |
2024-05-28 | MULi-Ev: Maintaining Unperturbed LiDAR-Event Calibration | Mathieu Cocheteux et.al. | 2405.18021 | null |
2024-05-28 | Self-supervised Pre-training for Transferable Multi-modal Perception | Xiaohao Xu et.al. | 2405.17942 | link |
2024-05-28 | A Refined 3D Gaussian Representation for High-Quality Dynamic Scene Reconstruction | Bin Zhang et.al. | 2405.17891 | null |
2024-05-29 | HFGS: 4D Gaussian Splatting with Emphasis on Spatial and Temporal High-Frequency Components for Endoscopic Scene Reconstruction | Haoyu Zhao et.al. | 2405.17872 | link |
2024-05-28 | Mani-GS: Gaussian Splatting Manipulation with Triangular Mesh | Xiangjun Gao et.al. | 2405.17811 | null |
2024-05-28 | Online Analytic Exemplar-Free Continual Learning with Large Models for Imbalanced Autonomous Driving Task | Huiping Zhuang et.al. | 2405.17779 | link |
2024-05-27 | GaussianFormer: Scene as Gaussians for Vision-Based 3D Semantic Occupancy Prediction | Yuanhui Huang et.al. | 2405.17429 | link |
2024-05-27 | Benchmarking and Improving Bird’s Eye View Perception Robustness in Autonomous Driving | Shaoyuan Xie et.al. | 2405.17426 | link |
2024-05-27 | Hardness-Aware Scene Synthesis for Semi-Supervised 3D Object Detection | Shuai Zeng et.al. | 2405.17422 | link |
2024-05-27 | MultiOOD: Scaling Out-of-Distribution Detection for Multiple Modalities | Hao Dong et.al. | 2405.17419 | link |
2024-06-06 | Vista: A Generalizable Driving World Model with High Fidelity and Versatile Controllability | Shenyuan Gao et.al. | 2405.17398 | link |
2024-05-27 | BehaviorGPT: Smart Agent Simulation for Autonomous Driving with Next-Patch Prediction | Zikang Zhou et.al. | 2405.17372 | null |
2024-05-27 | Towards Accurate Ego-lane Identification with Early Time Series Classification | Yuchuan Jin et.al. | 2405.17270 | null |
2024-05-27 | DINO-SD: Champion Solution for ICRA 2024 RoboDepth Challenge | Yifan Mao et.al. | 2405.17102 | null |
2024-05-28 | F-3DGS: Factorized Coordinates and Representations for 3D Gaussian Splatting | Xiangyu Sun et.al. | 2405.17083 | null |
2024-05-27 | A Two-Level Stochastic Model for the Lateral Movement of Vehicles Within Their Lane Under Homogeneous Traffic Conditions | Nicole Neis et.al. | 2405.17080 | null |
2024-05-27 | SCaRL- A Synthetic Multi-Modal Dataset for Autonomous Driving | Avinash Nittur Ramesh et.al. | 2405.17030 | null |
2024-06-24 | Bounding Random Test Set Size with Computational Learning Theory | Neil Walkinshaw et.al. | 2405.17019 | null |
2024-05-27 | Collective Perception Datasets for Autonomous Driving: A Comprehensive Review | Sven Teufel et.al. | 2405.16973 | null |
2024-05-27 | Rigorous Simulation-based Testing for Autonomous Driving Systems – Targeting the Achilles’ Heel of Four Open Autopilots | Changwen Li et.al. | 2405.16914 | link |
2024-05-27 | A re-calibration method for object detection with multi-modal alignment bias in autonomous driving | Zhihang Song et.al. | 2405.16848 | null |
2024-05-29 | PyGS: Large-scale Scene Representation with Pyramidal 3D Gaussian Splatting | Zipeng Wang et.al. | 2405.16829 | null |
2024-05-25 | Lane Detection using Graph Search and Geometric Constraints for Formula Student Driverless | Ivo Ivanov et.al. | 2405.16369 | link |
2024-05-25 | Improving 3D Occupancy Prediction through Class-balancing Loss and Multi-scale Representation | Huizhou Chen et.al. | 2405.16099 | null |
2024-05-25 | Uncertainty Measurement of Deep Learning System based on the Convex Hull of Training Sets | Hyekyoung Hwang et.al. | 2405.16082 | null |
2024-05-25 | Risk Scenario Generation for Autonomous Driving Systems based on Causal Bayesian Networks | Jiangnan Zhao et.al. | 2405.16063 | null |
2024-05-25 | DiffuBox: Refining 3D Object Detection with Point Diffusion | Xiangyu Chen et.al. | 2405.16034 | link |
2024-05-24 | SMART: Scalable Multi-agent Real-time Simulation via Next-token Prediction | Wei Wu et.al. | 2405.15677 | link |
2024-05-24 | Continuously Learning, Adapting, and Improving: A Dual-Process Approach to Autonomous Driving | Jianbiao Mei et.al. | 2405.15324 | link |
2024-05-24 | 3D Unsupervised Learning by Distilling 2D Open-Vocabulary Segmentation Models for Autonomous Driving | Boyi Sun et.al. | 2405.15286 | link |
2024-05-24 | Talk to Parallel LiDARs: A Human-LiDAR Interaction Method Based on 3D Visual Grounding | Yuhang Liu et.al. | 2405.15274 | null |
2024-05-24 | Neural Elevation Models for Terrain Mapping and Path Planning | Adam Dai et.al. | 2405.15227 | link |
2024-05-24 | Label-efficient Semantic Scene Completion with Scribble Annotations | Song Wang et.al. | 2405.15170 | link |
2024-05-23 | NeRF-Casting: Improved View-Dependent Appearance with Consistent Reflections | Dor Verbin et.al. | 2405.14871 | null |
2024-05-30 | An Empirical Study of Training State-of-the-Art LiDAR Segmentation Models | Jiahao Sun et.al. | 2405.14870 | link |
2024-05-23 | Neural Directional Encoding for Efficient and Accurate View-Dependent Appearance Modeling | Liwen Wu et.al. | 2405.14847 | null |
2024-05-23 | Camera Relocalization in Shadow-free Neural Radiance Fields | Shiyao Xu et.al. | 2405.14824 | link |
2024-05-23 | TopoLogic: An Interpretable Pipeline for Lane Topology Reasoning on Driving Scenes | Yanping Fu et.al. | 2405.14747 | null |
2024-05-23 | SE3D: A Framework For Saliency Method Evaluation In 3D Imaging | Mariusz Wiśniewski et.al. | 2405.14584 | link |
2024-05-23 | MagicDrive3D: Controllable 3D Generation for Any-View Rendering in Street Scenes | Ruiyuan Gao et.al. | 2405.14475 | null |
2024-06-08 | JointRF: End-to-End Joint Optimization for Dynamic Neural Radiance Field Representation and Compression | Zihan Zheng et.al. | 2405.14452 | null |
2024-05-24 | RoGS: Large Scale Road Surface Reconstruction based on 2D Gaussian Splatting | Zhiheng Feng et.al. | 2405.14342 | link |
2024-05-23 | NeuroGauss4D-PCI: 4D Neural Fields and Gaussian Deformation Fields for Point Cloud Interpolation | Chaokang Jiang et.al. | 2405.14241 | link |
2024-05-23 | Eidos: Efficient, Imperceptible Adversarial 3D Point Clouds | Hanwei Zhang et.al. | 2405.14210 | null |
2024-05-31 | Awesome Multi-modal Object Tracking | Chunhui Zhang et.al. | 2405.14200 | link |
2024-05-23 | Towards Transferable Attacks Against Vision-LLMs in Autonomous Driving with Typography | Nhat Chung et.al. | 2405.14169 | null |
2024-05-22 | ChatScene: Knowledge-Enabled Safety-Critical Scenario Generation for Autonomous Vehicles | Jiawei Zhang et.al. | 2405.14062 | link |
2024-06-13 | RadarOcc: Robust 3D Occupancy Prediction with 4D Imaging Radar | Fangqiang Ding et.al. | 2405.14014 | link |
2024-05-22 | TS40K: a 3D Point Cloud Dataset of Rural Terrain and Electrical Transmission System | Diogo Lavado et.al. | 2405.13989 | null |
2024-05-22 | Traffic Scenario Logic: A Spatial-Temporal Logic for Modeling and Reasoning of Urban Traffic Scenarios | Ruolin Wang et.al. | 2405.13715 | link |
2024-05-22 | Safe and Personalizable Logical Guidance for Trajectory Planning of Autonomous Driving | Yuejiao Xu et.al. | 2405.13704 | null |
2024-05-22 | HighwayLLM: Decision-Making and Navigation in Highway Driving with RL-Informed Language Model | Mustafa Yildirim et.al. | 2405.13547 | null |
2024-05-22 | Task-agnostic Decision Transformer for Multi-type Agent Control with Federated Split Training | Zhiyuan Wang et.al. | 2405.13445 | null |
2024-05-22 | Autonomous Algorithm for Training Autonomous Vehicles with Minimal Human Intervention | Sang-Hyun Lee et.al. | 2405.13345 | null |
2024-05-12 | Large Language Models for Education: A Survey | Hanyi Xu et.al. | 2405.13001 | null |
2024-05-21 | Transparency Distortion Robustness for SOTA Image Segmentation Tasks | Volker Knauthe et.al. | 2405.12864 | null |
2024-06-11 | Leveraging Neural Radiance Fields for Pose Estimation of an Unknown Space Object during Proximity Operations | Antoine Legrand et.al. | 2405.12728 | null |
2024-05-21 | CLRKDNet: Speeding up Lane Detection with Knowledge Distillation | Weiqing Qi et.al. | 2405.12503 | link |
2024-05-21 | Mutual Information Analysis in Multimodal Learning Systems | Hadi Hadizadeh et.al. | 2405.12456 | null |
2024-06-18 | Embracing Radiance Field Rendering in 6G: Over-the-Air Training and Inference with 3D Contents | Guanlin Wu et.al. | 2405.12155 | null |
2024-06-08 | EdgeLoc: A Communication-Adaptive Parallel System for Real-Time Localization in Infrastructure-Assisted Autonomous Driving | Boyi Liu et.al. | 2405.12120 | null |
2024-05-20 | Safe by Design Autonomous Driving Systems | Marius Bozga et.al. | 2405.11995 | null |
2024-05-20 | Tutorial on Silicon Photonics Integrated Platform Fiber Edge Coupling | Sergey S. Avdeev et.al. | 2405.11980 | null |
2024-05-20 | A comprehensive overview of deep learning techniques for 3D point cloud classification and semantic segmentation | Sushmita Sarker et.al. | 2405.11903 | null |
2024-05-20 | Stereo-Knowledge Distillation from dpMV to Dual Pixels for Light Field Video Reconstruction | Aryan Garg et.al. | 2405.11823 | null |
2024-05-19 | FADet: A Multi-sensor 3D Object Detection Network based on Local Featured Attention | Ziang Guo et.al. | 2405.11682 | link |
2024-05-19 | Searching Realistic-Looking Adversarial Objects For Autonomous Driving Systems | Shengxiang Sun et.al. | 2405.11629 | null |
2024-05-19 | R-NeRF: Neural Radiance Fields for Modeling RIS-enabled Wireless Environments | Huiying Yang et.al. | 2405.11541 | link |
2024-05-18 | Generalized Multi-Objective Reinforcement Learning with Envelope Updates in URLLC-enabled Vehicular Networks | Zijiang Yan et.al. | 2405.11331 | null |
2024-05-18 | RuleFuser: Injecting Rules in Evidential Networks for Robust Out-of-Distribution Trajectory Prediction | Jay Patrikar et.al. | 2405.11139 | null |
2024-06-01 | Enhanced 3D Urban Scene Reconstruction and Point Cloud Densification using Gaussian Splatting and Google Earth Imagery | Kyle Gao et.al. | 2405.11021 | null |
2024-05-17 | GEOcc: Geometrically Enhanced 3D Occupancy Network with Implicit-Explicit Depth Fusion and Contextual Self-Supervision | Xin Tan et.al. | 2405.10591 | null |
2024-05-17 | Team Samsung-RAL: Technical Report for 2024 RoboDrive Challenge-Robust Map Segmentation Track | Xiaoshuai Hao et.al. | 2405.10567 | null |
2024-05-28 | NeRO: Neural Road Surface Reconstruction | Ruibo Wang et.al. | 2405.10554 | link |
2024-05-07 | Detecting 5G Signal Jammers Using Spectrograms with Supervised and Unsupervised Learning | Matteo Varotto et.al. | 2405.10331 | null |
2024-05-16 | When LLMs step into the 3D World: A Survey and Meta-Analysis of 3D Tasks via Multi-modal Large Language Models | Xianzheng Ma et.al. | 2405.10255 | link |
2024-05-16 | Towards Consistent and Explainable Motion Prediction using Heterogeneous Graph Attention | Tobias Demmler et.al. | 2405.10134 | null |
2024-05-16 | Cooperative Visual-LiDAR Extrinsic Calibration Technology for Intersection Vehicle-Infrastructure: A review | Xinyu Zhang et.al. | 2405.10132 | null |
2024-05-16 | Infrared Adversarial Car Stickers | Xiaopei Zhu et.al. | 2405.09924 | null |
2024-05-19 | PillarNeXt: Improving the 3D detector by introducing Voxel2Pillar feature encoding and extracting multi-scale features | Xusheng Li et.al. | 2405.09828 | null |
2024-05-16 | Collision Avoidance Metric for 3D Camera Evaluation | Vage Taamazyan et.al. | 2405.09755 | link |
2024-06-10 | From NeRFs to Gaussian Splats, and Back | Siming He et.al. | 2405.09717 | link |
2024-05-22 | Synth-to-Real Unsupervised Domain Adaptation for Instance Segmentation | Yachan Guo et.al. | 2405.09682 | null |
2024-05-15 | CarDreamer: Open-Source Learning Platform for World Model based Autonomous Driving | Dechen Gao et.al. | 2405.09111 | link |
2024-05-20 | Perception Without Vision for Trajectory Prediction: Ego Vehicle Dynamics as Scene Representation for Efficient Active Learning in Autonomous Driving | Ross Greer et.al. | 2405.09049 | null |
2024-05-14 | The Pitfalls and Promise of Conformal Inference Under Adversarial Attacks | Ziquan Liu et.al. | 2405.08886 | link |
2024-05-30 | The RoboDrive Challenge: Drive Anytime Anywhere in Any Condition | Lingdong Kong et.al. | 2405.08816 | null |
2024-05-14 | Ambiguous Annotations: When is a Pedestrian not a Pedestrian? | Luisa Schwirten et.al. | 2405.08794 | null |
2024-05-14 | Dynamic NeRF: A Review | Jinwei Lin et.al. | 2405.08609 | null |
2024-05-14 | Work-in-Progress: Crash Course: Can (Under Attack) Autonomous Driving Beat Human Drivers? | Francesco Marchiori et.al. | 2405.08466 | null |
2024-05-13 | Equivariant Deep Learning of Mixed-Integer Optimal Control Solutions for Vehicle Decision Making and Motion Planning | Rudolf Reiter et.al. | 2405.08122 | null |
2024-06-05 | AnoVox: A Benchmark for Multimodal Anomaly Detection in Autonomous Driving | Daniel Bogdoll et.al. | 2405.07865 | link |
2024-06-05 | Synergistic Integration of Coordinate Network and Tensorial Feature for Improving Neural Radiance Fields from Sparse Inputs | Mingyu Kim et.al. | 2405.07857 | link |
2024-05-13 | oTTC: Object Time-to-Contact for Motion Estimation in Autonomous Driving | Abdul Hannan Khan et.al. | 2405.07698 | null |
2024-05-13 | MaskFuser: Masked Fusion of Joint Multi-Modal Tokenization for End-to-End Autonomous Driving | Yiqun Duan et.al. | 2405.07573 | null |
2024-05-12 | Modeling Pedestrian Intrinsic Uncertainty for Multimodal Stochastic Trajectory Prediction via Energy Plan Denoising | Yao Liu et.al. | 2405.07164 | null |
2024-05-11 | Multi-agent Traffic Prediction via Denoised Endpoint Distribution | Yao Liu et.al. | 2405.07041 | null |
2024-05-11 | TD-NeRF: Novel Truncated Depth Prior for Joint Camera Pose and Neural Radiance Field Optimization | Zhen Tan et.al. | 2405.07027 | link |
2024-05-11 | Direct Learning of Mesh and Appearance via 3D Gaussian Splatting | Ancheng Lin et.al. | 2405.06945 | null |
2024-05-20 | Boolean matrix logic programming for active learning of gene functions in genome-scale metabolic network models | Lun Ai et.al. | 2405.06724 | link |
2024-05-10 | Multi-Object Tracking in the Dark | Xinzhe Wang et.al. | 2405.06600 | link |
2024-05-10 | OneTo3D: One Image to Re-editable Dynamic 3D Model and Video Generation | Jinwei Lin et.al. | 2405.06547 | link |
2024-05-10 | Autonomous Driving with a Deep Dual-Model Solution for Steering and Braking Control | Ana Petra Jukić et.al. | 2405.06473 | null |
2024-05-10 | Selective Focus: Investigating Semantics Sensitivity in Post-training Quantization for Lane Detection | Yunqian Fan et.al. | 2405.06264 | null |
2024-05-10 | Aerial-NeRF: Adaptive Spatial Partitioning and Sampling for Large-Scale Aerial Rendering | Xiaohan Zhang et.al. | 2405.06214 | null |
2024-05-10 | Residual-NeRF: Learning Residual NeRFs for Transparent Object Manipulation | Bardienus P. Duisterhof et.al. | 2405.06181 | null |
2024-05-09 | Probing Multimodal LLMs as World Models for Driving | Shiva Sreeram et.al. | 2405.05956 | link |
2024-05-09 | Co-driver: VLM-based Autonomous Driving Assistant with Human-like Behavior and Understanding for Complex Road Scenes | Ziang Guo et.al. | 2405.05885 | link |
2024-05-10 | NeRFFaceSpeech: One-shot Audio-driven 3D Talking Head Synthesis via Generative Prior | Gihoon Kim et.al. | 2405.05749 | null |
2024-05-27 | NGM-SLAM: Gaussian Splatting SLAM with Radiance Field Submap | Mingrui Li et.al. | 2405.05702 | null |
2024-06-04 | Towards Robust Physical-world Backdoor Attacks on Lane Detection | Xinwei Zhang et.al. | 2405.05553 | null |
2024-05-09 | Benchmarking Neural Radiance Fields for Autonomous Robots: An Overview | Yuhang Ming et.al. | 2405.05526 | null |
2024-05-07 | Tiny Deep Ensemble: Uncertainty Estimation in Edge AI Accelerators via Ensembling Normalization Layers with Shared Weights | Soyed Tuhin Ahmed et.al. | 2405.05286 | null |
2024-05-08 | Multi-Modal Data-Efficient 3D Scene Understanding for Autonomous Driving | Lingdong Kong et.al. | 2405.05258 | link |
2024-05-18 | A Survey on Occupancy Perception for Autonomous Driving: The Information Fusion Perspective | Huaiyuan Xu et.al. | 2405.05173 | link |
2024-05-08 | DenserRadar: A 4D millimeter-wave radar point cloud detector based on dense LiDAR point clouds | Zeyu Han et.al. | 2405.05131 | null |
2024-05-08 | Novel Actor-Critic Algorithm for Robust Decision Making of CAV under Delays and Loss of V2X Data | Zine el abidine Kherroubi et.al. | 2405.05072 | null |
2024-05-08 | Traj-LLM: A New Exploration for Empowering Trajectory Prediction with Pre-trained Large Language Models | Zhengxing Lan et.al. | 2405.04909 | null |
2024-05-07 | Tactile-Augmented Radiance Fields | Yiming Dou et.al. | 2405.04534 | link |
2024-05-07 | TorchDriveEnv: A Reinforcement Learning Benchmark for Autonomous Driving with Reactive, Realistic, and Diverse Non-Playable Characters | Jonathan Wilder Lavington et.al. | 2405.04491 | null |
2024-05-08 | DistGrid: Scalable Scene Reconstruction with Distributed Multi-resolution Hash Grid | Sidun Liu et.al. | 2405.04416 | null |
2024-05-07 | DriveWorld: 4D Pre-trained Scene Understanding via World Models for Autonomous Driving | Chen Min et.al. | 2405.04390 | null |
2024-05-07 | Novel View Synthesis with Neural Radiance Fields for Industrial Robot Applications | Markus Hillemann et.al. | 2405.04345 | null |
2024-05-07 | Bayesian Simultaneous Localization and Multi-Lane Tracking Using Onboard Sensors and a SD Map | Yuxuan Xia et.al. | 2405.04290 | null |
2024-06-17 | pFedLVM: A Large Vision Model (LVM)-Driven and Latent Feature-Based Personalized Federated Learning Framework in Autonomous Driving | Wei-Bin Kou et.al. | 2405.04146 | null |
2024-05-07 | ESP: Extro-Spective Prediction for Long-term Behavior Reasoning in Emergency Scenarios | Dingrui Wang et.al. | 2405.04100 | null |
2024-05-07 | Feature Map Convergence Evaluation for Functional Module | Ludan Zhang et.al. | 2405.04041 | null |
2024-05-07 | Deep Event-based Object Detection in Autonomous Driving: A Survey | Bingquan Zhou et.al. | 2405.03995 | null |
2024-05-07 | Unified End-to-End V2X Cooperative Autonomous Driving | Zhiwei Li et.al. | 2405.03971 | null |
2024-05-07 | Role of Sensing and Computer Vision in 6G Wireless Communications | Seungnyun Kim et.al. | 2405.03945 | link |
2024-05-07 | Roadside Units Assisted Localized Automated Vehicle Maneuvering: An Offline Reinforcement Learning Approach | Kui Wang et.al. | 2405.03935 | null |
2024-05-06 | BadFusion: 2D-Oriented Backdoor Attacks against 3D Object Detection | Saket S. Chaturvedi et.al. | 2405.03884 | null |
2024-05-06 | SocialFormer: Social Interaction Modeling with Edge-enhanced Heterogeneous Graph Transformers for Trajectory Prediction | Zixu Wang et.al. | 2405.03809 | null |
2024-05-06 | UniGen: Unified Modeling of Initial Agent States and Trajectories for Generating Autonomous Driving Scenarios | Reza Mahjourian et.al. | 2405.03807 | null |
2024-06-10 | A Construct-Optimize Approach to Sparse View Synthesis without Camera Pose | Kaiwen Jiang et.al. | 2405.03659 | null |
2024-05-06 | RoboCar: A Rapidly Deployable Open-Source Platform for Autonomous Driving Research | Mehdi Testouri et.al. | 2405.03572 | link |
2024-05-06 | Is Sora a World Simulator? A Comprehensive Survey on General World Models and Beyond | Zheng Zhu et.al. | 2405.03520 | link |
2024-05-05 | SalFAU-Net: Saliency Fusion Attention U-Net for Salient Object Detection | Kassaw Abraham Mulat et.al. | 2405.02906 | null |
2024-05-05 | Blending Distributed NeRFs with Tri-stage Robust Pose Optimization | Baijun Ye et.al. | 2405.02880 | null |
2024-05-04 | TK-Planes: Tiered K-Planes with High Dimensional Feature Vectors for Dynamic UAV-based Scenes | Christopher Maxey et.al. | 2405.02762 | null |
2024-05-04 | Accelerating Autonomy: Insights from Pro Racers in the Era of Autonomous Racing - An Expert Interview Study | Frederik Werner et.al. | 2405.02620 | link |
2024-05-04 | Vision-based 3D occupancy prediction in autonomous driving: a review and outlook | Yanan Zhang et.al. | 2405.02595 | link |
2024-06-10 | Active Neural 3D Reconstruction with Colorized Surface Voxel-based View Selection | Hyunseo Kim et.al. | 2405.02568 | null |
2024-05-03 | Learning Robot Soccer from Egocentric Vision with Deep Reinforcement Learning | Dhruva Tirumala et.al. | 2405.02425 | null |
2024-05-03 | Rip-NeRF: Anti-aliasing Radiance Fields with Ripmap-Encoded Platonic Solids | Junchen Liu et.al. | 2405.02386 | link |
2024-05-03 | Characterized Diffusion and Spatial-Temporal Interaction Network for Trajectory Prediction in Autonomous Driving | Haicheng Liao et.al. | 2405.02145 | null |
2024-05-27 | WateRF: Robust Watermarks in Radiance Fields for Protection of Copyrights | Youngdong Jang et.al. | 2405.02066 | null |
2024-05-03 | Obstacle Avoidance of Autonomous Vehicles: An LPVMPC with Scheduling Trust Region | Maryam Nezami et.al. | 2405.02030 | null |
2024-05-03 | DiffMap: Enhancing Map Segmentation with Map Prior Using Diffusion Model | Peijin Jia et.al. | 2405.02008 | null |
2024-05-03 | M ${^2}$ Depth: Self-supervised Two-Frame Multi-camera Metric Depth Estimation | Yingshuang Zou et.al. | 2405.02004 | null |
2024-05-02 | Language-Enhanced Latent Representations for Out-of-Distribution Detection in Autonomous Driving | Zhenjiang Mao et.al. | 2405.01691 | null |
2024-05-02 | Multi-Space Alignments Towards Universal LiDAR Segmentation | Youquan Liu et.al. | 2405.01538 | link |
2024-05-02 | OmniDrive: A Holistic LLM-Agent Framework for Autonomous Driving with 3D Perception, Reasoning and Planning | Shihao Wang et.al. | 2405.01533 | link |
2024-04-12 | A Review of Reward Functions for Reinforcement Learning in the context of Autonomous Driving | Ahmed Abouelazm et.al. | 2405.01440 | null |
2024-05-02 | Multi-view Action Recognition via Directed Gromov-Wasserstein Discrepancy | Hoang-Quan Nguyen et.al. | 2405.01337 | null |
2024-05-02 | NeRF in Robotics: A Survey | Guangming Wang et.al. | 2405.01333 | null |
2024-05-02 | An Advanced Framework for Ultra-Realistic Simulation and Digital Twinning for Autonomous Vehicles | Yuankai He et.al. | 2405.01328 | null |
2024-05-02 | MFTraj: Map-Free, Behavior-Driven Trajectory Prediction for Autonomous Driving | Haicheng Liao et.al. | 2405.01266 | null |
2024-05-02 | A Survey on Semantic Communication Networks: Architecture, Security, and Privacy | Shaolong Guo et.al. | 2405.01221 | null |
2024-05-02 | Federated Learning with Heterogeneous Data Handling for Robust Vehicular Object Detection | Ahmad Khalil et.al. | 2405.01108 | link |
2024-05-02 | Poisoning Attacks on Federated Learning for Autonomous Driving | Sonakshi Garg et.al. | 2405.01073 | null |
2024-05-04 | LidaRF: Delving into Lidar for Neural Radiance Field on Street Scenes | Shanlin Sun et.al. | 2405.00900 | null |
2024-05-01 | ADM: Accelerated Diffusion Model via Estimated Priors for Robust Motion Prediction under Uncertainties | Jiahui Li et.al. | 2405.00797 | link |
2024-05-13 | Depth Priors in Removal Neural Radiance Fields | Zhihao Guo et.al. | 2405.00630 | null |
2024-05-01 | Lane Segmentation Refinement with Diffusion Models | Antonio Ruiz et.al. | 2405.00620 | null |
2024-05-31 | GAD-Generative Learning for HD Map-Free Autonomous Driving | Weijian Sun et.al. | 2405.00515 | null |
2024-05-01 | NeRF-Guided Unsupervised Learning of RGB-D Registration | Zhinan Yu et.al. | 2405.00507 | null |
2024-05-01 | On the Relevance of Byzantine Robust Optimization Against Data Poisoning | Sadegh Farhadkhani et.al. | 2405.00491 | null |
2024-05-01 | RAG-based Explainable Prediction of Road Users Behaviors for Automated Driving using Knowledge Graphs and Large Language Models | Mohamed Manzour Hussien et.al. | 2405.00449 | null |
2024-05-01 | Dual-Role AoI-based Incentive Mechanism for HD map Crowdsourcing | Wentao Ye et.al. | 2405.00353 | null |
2024-05-05 | Enhance Planning with Physics-informed Safety Controller for End-to-end Autonomous Driving | Hang Zhou et.al. | 2405.00316 | null |
2024-04-30 | SemVecNet: Generalizable Vector Map Generation for Arbitrary Sensor Configurations | Narayanan Elavathur Ranganatha et.al. | 2405.00250 | link |
2024-04-30 | Guiding Attention in End-to-End Driving Models | Diego Porres et.al. | 2405.00242 | link |
2024-04-30 | STT: Stateful Tracking with Transformers for Autonomous Driving | Longlong Jing et.al. | 2405.00236 | null |
2024-04-30 | Comparing Motion Distortion Between Vehicle Field Deployments | Nicolas Samson et.al. | 2405.00189 | null |
2024-05-28 | MicroDreamer: Zero-shot 3D Generation in $\sim$ 20 Seconds by Score-based Iterative Reconstruction | Luxi Chen et.al. | 2404.19525 | link |
2024-04-30 | Pseudo Label Refinery for Unsupervised Domain Adaptation on Cross-dataset 3D Object Detection | Zhanwei Zhang et.al. | 2404.19384 | null |
2024-05-27 | SemanticFormer: Holistic and Semantic Traffic Scene Representation for Trajectory Prediction using Knowledge Graphs | Zhigang Sun et.al. | 2404.19379 | link |
2024-04-30 | G2LTraj: A Global-to-Local Generation Approach for Trajectory Prediction | Zhanwei Zhang et.al. | 2404.19330 | link |
2024-04-29 | Embedded Representation Learning Network for Animating Styled Video Portrait | Tianyong Wang et.al. | 2404.19038 | null |
2024-05-27 | Simple-RF: Regularizing Sparse Input Radiance Fields with Simpler Solutions | Nagabhushan Somraj et.al. | 2404.19015 | null |
2024-05-05 | Multimodal Fusion on Low-quality Data: A Comprehensive Survey | Qingyang Zhang et.al. | 2404.18947 | null |
2024-04-29 | DGE: Direct Gaussian 3D Editing by Consistent Multi-view Editing | Minghao Chen et.al. | 2404.18929 | null |
2024-05-22 | PlanNetX: Learning an Efficient Neural Network Planner from MPC for Longitudinal Control | Jasper Hoffmann et.al. | 2404.18863 | null |
2024-04-29 | Safe Reach Set Computation via Neural Barrier Certificates | Alessandro Abate et.al. | 2404.18813 | null |
2024-04-29 | Uncertainty-boosted Robust Video Activity Anticipation | Zhaobo Qi et.al. | 2404.18648 | link |
2024-04-29 | Assessing Quality Metrics for Neural Reality Gap Input Mitigation in Autonomous Driving Testing | Stefano Carlo Lambertenghi et.al. | 2404.18577 | link |
2024-04-29 | Predicting Safety Misbehaviours in Autonomous Driving Systems using Uncertainty Quantification | Ruben Grewal et.al. | 2404.18573 | link |
2024-04-29 | MRIC: Model-Based Reinforcement-Imitation Learning with Mixture-of-Codebooks for Autonomous Driving Simulation | Baotian He et.al. | 2404.18464 | null |
2024-04-29 | $ν$ -DBA: Neural Implicit Dense Bundle Adjustment Enables Image-Only Driving Scene Reconstruction | Yunxuan Mao et.al. | 2404.18439 | null |
2024-04-28 | S3-SLAM: Sparse Tri-plane Encoding for Neural Implicit SLAM | Zhiyao Zhang et.al. | 2404.18284 | null |
2024-04-28 | RadSimReal: Bridging the Gap Between Synthetic and Real Data in Radar Object Detection With Simulation | Oded Bialer et.al. | 2404.18150 | null |
2024-04-27 | BoostRad: Enhancing Object Detection by Boosting Radar Reflections | Yuval Haitman et.al. | 2404.17861 | null |
2024-04-27 | Motion planning for off-road autonomous driving based on human-like cognition and weight adaptation | Yuchun Wang et.al. | 2404.17820 | null |
2024-04-27 | CLFT: Camera-LiDAR Fusion Transformer for Semantic Segmentation in Autonomous Driving | Junyi Gu et.al. | 2404.17793 | link |
2024-04-26 | CoCar NextGen: a Multi-Purpose Platform for Connected Autonomous Driving Research | Marc Heinrich et.al. | 2404.17550 | null |
2024-04-26 | Geometry-aware Reconstruction and Fusion-refined Rendering for Generalizable Neural Radiance Fields | Tianqi Liu et.al. | 2404.17528 | link |
2024-04-26 | A Cognitive-Driven Trajectory Prediction Model for Autonomous Driving in Mixed Autonomy Environment | Haicheng Liao et.al. | 2404.17520 | null |
2024-04-26 | Cost-Sensitive Uncertainty-Based Failure Recognition for Object Detection | Moussa Kassem Sbeyti et.al. | 2404.17427 | link |
2024-04-26 | On the Road to Clarity: Exploring Explainable AI for World Models in a Driver Assistance System | Mohamed Roshdi et.al. | 2404.17350 | null |
2024-04-26 | Scene-Extrapolation: Generating Interactive Traffic Scenarios | Maximilian Zipfl et.al. | 2404.17224 | null |
2024-04-26 | Beyond Imitation: A Life-long Policy Learning Framework for Path Tracking Control of Autonomous Driving | C. Gong et.al. | 2404.17198 | null |
2024-04-25 | Depth Supervised Neural Surface Reconstruction from Airborne Imagery | Vincent Hackstein et.al. | 2404.16429 | null |
2024-04-25 | Integration of Mixture of Experts and Multimodal Generative AI in Internet of Vehicles: A Survey | Minrui Xu et.al. | 2404.16356 | null |
2024-04-29 | A Multi-objective Optimization Benchmark Test Suite for Real-time Semantic Segmentation | Yifan Zhao et.al. | 2404.16266 | link |
2024-04-24 | NeRF-XL: Scaling NeRFs with Multiple GPUs | Ruilong Li et.al. | 2404.16221 | null |
2024-04-28 | A Survey on Intermediate Fusion Methods for Collaborative Perception Categorized by Real World Challenges | Melih Yazgan et.al. | 2404.16139 | null |
2024-04-05 | Using Automated Vehicle Data as a Fitness Tracker for Sustainability | Xia Wang et.al. | 2404.16046 | null |
2024-04-24 | Learning Car-Following Behaviors Using Bayesian Matrix Normal Mixture Regression | Chengyuan Zhang et.al. | 2404.16023 | null |
2024-04-23 | DreamCraft: Text-Guided Generation of Functional 3D Environments in Minecraft | Sam Earle et.al. | 2404.15538 | null |
2024-04-10 | Efficient EndoNeRF Reconstruction and Its Application for Data-driven Surgical Simulation | Yuehao Wang et.al. | 2404.15339 | null |
2024-04-23 | OccGen: Generative Multi-modal 3D Occupancy Prediction for Autonomous Driving | Guoqing Wang et.al. | 2404.15014 | null |
2024-04-23 | LaneCorrect: Self-supervised Lane Detection | Ming Nie et.al. | 2404.14671 | null |
2024-04-22 | PLUTO: Pushing the Limit of Imitation Learning-based Planning for Autonomous Driving | Jie Cheng et.al. | 2404.14327 | null |
2024-04-22 | Localization Based on MIMO Backscattering from Retro-Directive Antenna Arrays | Marina Lotti et.al. | 2404.14206 | null |
2024-04-28 | GaussianTalker: Speaker-specific Talking Head Synthesis via 3D Gaussian Splatting | Hongyun Yu et.al. | 2404.14037 | null |
2024-04-22 | PointDifformer: Robust Point Cloud Registration With Neural Diffusion and Transformer | Rui She et.al. | 2404.14034 | null |
2024-05-15 | OccFeat: Self-supervised Occupancy Feature Prediction for Pretraining BEV Segmentation Networks | Sophia Sirko-Galouchenko et.al. | 2404.14027 | link |
2024-04-22 | Collaborative Perception Datasets in Autonomous Driving: A Survey | Melih Yazgan et.al. | 2404.14022 | null |
2024-05-05 | How do LLMs Support Deep Learning Testing? A Comprehensive Study Through the Lens of Image Mutation | Liwen Wang et.al. | 2404.13945 | null |
2024-04-23 | CT-NeRF: Incremental Optimizing Neural Radiance Field and Poses with Complex Trajectory | Yunlong Ran et.al. | 2404.13896 | null |
2024-04-26 | Neural Radiance Field in Autonomous Driving: A Survey | Lei He et.al. | 2404.13816 | null |
2024-04-21 | Soar: Design and Deployment of A Smart Roadside Infrastructure System for Autonomous Driving | Shuyao Shi et.al. | 2404.13786 | null |
2024-04-26 | ArtNeRF: A Stylized Neural Field for 3D-Aware Cartoonized Face Synthesis | Zichen Tang et.al. | 2404.13711 | link |
2024-04-27 | FisheyeDetNet: 360° Surround view Fisheye Camera based Object Detection System for Autonomous Driving | Ganesh Sistu et.al. | 2404.13443 | null |
2024-04-20 | High-fidelity Endoscopic Image Synthesis by Utilizing Depth-guided Neural Surfaces | Baoru Huang et.al. | 2404.13437 | null |
2024-04-20 | Social Force Embedded Mixed Graph Convolutional Network for Multi-class Trajectory Prediction | Quancheng Du et.al. | 2404.13378 | null |
2024-04-20 | EC-SLAM: Real-time Dense Neural RGB-D SLAM System with Effectively Constrained Global Bundle Adjustment | Guanghao Li et.al. | 2404.13346 | link |
2024-04-19 | BACS: Background Aware Continual Semantic Segmentation | Mostafa ElAraby et.al. | 2404.13148 | link |
2024-04-19 | FlyNeRF: NeRF-Based Aerial Mapping for High-Quality 3D Scene Reconstruction | Maria Dronova et.al. | 2404.12970 | null |
2024-04-22 | Physical Backdoor Attack can Jeopardize Driving with Vision-Large-Language Models | Zhenyang Ni et.al. | 2404.12916 | link |
2024-04-19 | FipTR: A Simple yet Effective Transformer Framework for Future Instance Prediction in Autonomous Driving | Xingtai Gui et.al. | 2404.12867 | link |
2024-04-19 | Language-Driven Active Learning for Diverse Open-Set 3D Object Detection | Ross Greer et.al. | 2404.12856 | link |
2024-04-19 | A Point-Based Approach to Efficient LiDAR Multi-Task Perception | Christopher Lang et.al. | 2404.12798 | null |
2024-04-19 | Camera Agnostic Two-Head Network for Ego-Lane Inference | Chaehyeon Song et.al. | 2404.12770 | null |
2024-04-19 | A Containerized Microservice Architecture for a ROS 2 Autonomous Driving Software: An End-to-End Latency Evaluation | Tobias Betz et.al. | 2404.12683 | null |
2024-04-19 | Dragtraffic: A Non-Expert Interactive and Point-Based Controllable Traffic Scene Generation Framework | Sheng Wang et.al. | 2404.12624 | null |
2024-05-23 | Evaluating Alternatives to SFM Point Cloud Initialization for Gaussian Splatting | Yalda Foroutan et.al. | 2404.12547 | null |
2024-04-30 | TrACT: A Training Dynamics Aware Contrastive Learning Framework for Long-tail Trajectory Prediction | Junrui Zhang et.al. | 2404.12538 | null |
2024-04-18 | SPIdepth: Strengthened Pose Information for Self-supervised Monocular Depth Estimation | Mykola Lavreniuk et.al. | 2404.12501 | link |
2024-04-18 | Reducing Bias in Pre-trained Models by Tuning while Penalizing Change | Niklas Penzel et.al. | 2404.12292 | null |
2024-04-18 | An Online Spatial-Temporal Graph Trajectory Planner for Autonomous Vehicles | Jilan Samiuddin et.al. | 2404.12256 | null |
2024-04-18 | Stability Certificates for Receding Horizon Games | Sophie Hall et.al. | 2404.12165 | null |
2024-04-18 | S4TP: Social-Suitable and Safety-Sensitive Trajectory Planning for Autonomous Vehicles | Xiao Wang et.al. | 2404.11946 | null |
2024-04-18 | AG-NeRF: Attention-guided Neural Radiance Fields for Multi-height Large-scale Outdoor Scene Rendering | Jingfeng Guo et.al. | 2404.11897 | link |
2024-04-18 | Cicero: Addressing Algorithmic and Architectural Bottlenecks in Neural Rendering by Radiance Warping and Memory Optimizations | Yu Feng et.al. | 2404.11852 | null |
2024-04-17 | TempBEV: Improving Learned BEV Encoders with Combined Image and BEV Space Temporal Aggregation | Thomas Monninger et.al. | 2404.11803 | null |
2024-04-17 | Multimodal 3D Object Detection on Unseen Domains | Deepti Hegde et.al. | 2404.11764 | null |
2024-04-17 | Exploring DNN Robustness Against Adversarial Attacks Using Approximate Multipliers | Mohammad Javad Askarizadeh et.al. | 2404.11665 | null |
2024-04-17 | VG4D: Vision-Language Model Goes 4D Video Recognition | Zhichao Deng et.al. | 2404.11605 | link |
2024-04-17 | SLAIM: Robust Dense Neural SLAM for Online Tracking and Mapping | Vincent Cartillier et.al. | 2404.11419 | null |
2024-04-18 | SERENE: A Collusion Resilient Replication-based Verification Framework | Amir Esmaeili et.al. | 2404.11410 | null |
2024-04-17 | RainyScape: Unsupervised Rainy Scene Reconstruction using Decoupled Neural Rendering | Xianqiang Lyu et.al. | 2404.11401 | null |
2024-04-17 | Detector Collapse: Backdooring Object Detection to Catastrophic Overload or Blindness | Hangtao Zhang et.al. | 2404.11357 | null |
2024-04-19 | KI-GAN: Knowledge-Informed Generative Adversarial Networks for Enhanced Multi-Vehicle Trajectory Forecasting at Signalized Intersections | Chuheng Wei et.al. | 2404.11181 | link |
2024-04-17 | REACTO: Reconstructing Articulated Objects from a Single Video | Chaoyue Song et.al. | 2404.11151 | null |
2024-04-17 | D-Aug: Enhancing Data Augmentation for Dynamic LiDAR Scenes | Jiaxing Zhao et.al. | 2404.11127 | null |
2024-04-17 | Sky-GVIO: an enhanced GNSS/INS/Vision navigation with FCN-based sky-segmentation in urban canyon | Jingrong Wang et.al. | 2404.11070 | link |
2024-04-17 | How to deal with glare for improved perception of Autonomous Vehicles | Muhammad Z. Alam et.al. | 2404.10992 | null |
2024-04-18 | End-To-End Training and Testing Gamification Framework to Learn Human Highway Driving | Satya R. Jaladi et.al. | 2404.10849 | null |
2024-04-12 | PASA: Attack Agnostic Unsupervised Adversarial Detection using Prediction & Attribution Sensitivity Analysis | Dipkamal Bhusal et.al. | 2404.10789 | link |
2024-04-16 | RapidVol: Rapid Reconstruction of 3D Ultrasound Volumes from Sensorless 2D Scans | Mark C. Eid et.al. | 2404.10766 | null |
2024-04-16 | N-Agent Ad Hoc Teamwork | Caroline Wang et.al. | 2404.10740 | link |
2024-04-16 | Gaussian Splatting Decoder for 3D-aware Generative Adversarial Networks | Florian Barthel et.al. | 2404.10625 | null |
2024-04-16 | Automated Evaluation of Large Vision-Language Models on Self-driving Corner Cases | Yanze Li et.al. | 2404.10595 | null |
2024-04-19 | SEVD: Synthetic Event-based Vision Dataset for Ego and Fixed Traffic Perception | Manideep Reddy Aliminati et.al. | 2404.10540 | link |
2024-04-16 | LAECIPS: Large Vision Model Assisted Adaptive Edge-Cloud Collaboration for IoT-based Perception System | Shijing Hu et.al. | 2404.10498 | null |
2024-04-16 | Plug-and-Play Acceleration of Occupancy Grid-based NeRF Rendering using VDB Grid and Hierarchical Ray Traversal | Yoshio Kato et.al. | 2404.10272 | link |
2024-04-16 | PreGSU-A Generalized Traffic Scene Understanding Model for Autonomous Driving based on Pre-trained Graph Attention Network | Yuning Wang et.al. | 2404.10263 | null |
2024-04-15 | Taming Latent Diffusion Model for Neural Radiance Field Inpainting | Chieh Hubert Lin et.al. | 2404.09995 | null |
2024-04-15 | Video2Game: Real-time, Interactive, Realistic and Browser-Compatible Environment from a Single Video | Hongchi Xia et.al. | 2404.09833 | null |
2024-04-15 | Sampling for Model Predictive Trajectory Planning in Autonomous Driving using Normalizing Flows | Georg Rabenstein et.al. | 2404.09657 | null |
2024-04-15 | SparseOcc: Rethinking Sparse Latent Representation for Vision-Based Semantic Occupancy Prediction | Pin Tang et.al. | 2404.09502 | null |
2024-04-15 | Towards Collaborative Autonomous Driving: Simulation Platform and End-to-End System | Genjia Liu et.al. | 2404.09496 | link |
2024-04-15 | VFMM3D: Releasing the Potential of Image by Vision Foundation Model for Monocular 3D Object Detection | Bonan Ding et.al. | 2404.09431 | null |
2024-04-15 | ViFu: Multiple 360 $^\circ$ Objects Reconstruction with Clean Background via Visible Part Fusion | Tianhan Xu et.al. | 2404.09426 | null |
2024-05-06 | DeferredGS: Decoupled and Editable Gaussian Splatting with Deferred Shading | Tong Wu et.al. | 2404.09412 | null |
2024-04-14 | SyntStereo2Real: Edge-Aware GAN for Remote Sensing Image-to-Image Translation while Maintaining Stereo Constraint | Vasudha Venkatesan et.al. | 2404.09277 | null |
2024-04-14 | VRS-NeRF: Visual Relocalization with Sparse Neural Radiance Field | Fei Xue et.al. | 2404.09271 | link |
2024-04-14 | Increasing SLAM Pose Accuracy by Ground-to-Satellite Image Registration | Yanhao Zhang et.al. | 2404.09169 | link |
2024-05-25 | Intention-Aware Control Based on Belief-Space Specifications and Stochastic Expansion | Zengjie Zhang et.al. | 2404.09037 | link |
2024-04-12 | WROOM: An Autonomous Driving Approach for Off-Road Navigation | Dvij Kalaria et.al. | 2404.08855 | link |
2024-04-12 | FusionPortableV2: A Unified Multi-Sensor Dataset for Generalized SLAM Across Diverse Platforms and Scalable Environments | Hexiang Wei et.al. | 2404.08563 | null |
2024-04-12 | Maturity of Vehicle Digital Twins: From Monitoring to Enabling Autonomous Driving | Robert Klar et.al. | 2404.08438 | null |
2024-04-12 | Transfer Learning Study of Motion Transformer-based Trajectory Predictions | Lars Ullrich et.al. | 2404.08271 | null |
2024-04-12 | MonoPatchNeRF: Improving Neural Radiance Fields with Patch-based Monocular Guidance | Yuqun Wu et.al. | 2404.08252 | null |
2024-04-11 | Real-Time Detection and Analysis of Vehicles and Pedestrians using Deep Learning | Md Nahid Sadik et.al. | 2404.08081 | null |
2024-04-11 | VeTraSS: Vehicle Trajectory Similarity Search Through Graph Modeling and Representation Learning | Ming Cheng et.al. | 2404.08021 | null |
2024-04-09 | GRANP: A Graph Recurrent Attentive Neural Process Model for Vehicle Trajectory Prediction | Yuhao Luo et.al. | 2404.08004 | link |
2024-04-11 | Connecting NeRFs, Images, and Text | Francesco Ballerini et.al. | 2404.07993 | link |
2024-03-18 | Reinforcement Learning with Generalizable Gaussian Splatting | Jiaxu Wang et.al. | 2404.07950 | null |
2024-04-11 | Boosting Self-Supervision for Single-View Scene Completion via Knowledge Distillation | Keonhee Han et.al. | 2404.07933 | null |
2024-04-11 | Sparse Laneformer | Ji Liu et.al. | 2404.07821 | null |
2024-04-23 | NeuroNCAP: Photorealistic Closed-loop Safety Testing for Autonomous Driving | William Ljungbergh et.al. | 2404.07762 | link |
2024-04-11 | Homography Guided Temporal Fusion for Road Line and Marking Segmentation | Shan Wang et.al. | 2404.07626 | link |
2024-04-11 | Can Vehicle Motion Planning Generalize to Realistic Long-tail Scenarios? | Marcel Hallgarten et.al. | 2404.07569 | link |
2024-04-11 | PillarTrack: Redesigning Pillar-based Transformer Network for Single Object Tracking on Point Clouds | Weisheng Xu et.al. | 2404.07495 | link |
2024-04-10 | Identification of Fine-grained Systematic Errors via Controlled Scene Generation | Valentyn Boreiko et.al. | 2404.07045 | null |
2024-04-10 | SparseAD: Sparse Query-Centric Paradigm for Efficient End-to-End Autonomous Driving | Diankun Zhang et.al. | 2404.06892 | null |
2024-04-19 | Monocular 3D lane detection for Autonomous Driving: Recent Achievements, Challenges, and Outlooks | Fulong Ma et.al. | 2404.06860 | null |
2024-04-10 | SplatPose & Detect: Pose-Agnostic 3D Anomaly Detection | Mathis Kruse et.al. | 2404.06832 | link |
2024-04-10 | MonoSelfRecon: Purely Self-Supervised Explicit Generalizable 3D Reconstruction of Indoor Scenes from Monocular RGB Views | Runfa Li et.al. | 2404.06753 | null |
2024-04-10 | Bayesian NeRF: Quantifying Uncertainty with Volume Density in Neural Radiance Fields | Sibeak Lee et.al. | 2404.06727 | link |
2024-04-10 | Sparse Points to Dense Clouds: Enhancing 3D Detection with Limited LiDAR Data | Aakash Kumar et.al. | 2404.06715 | null |
2024-04-12 | SpikeNVS: Enhancing Novel View Synthesis from Blurry Images via Spike Camera | Gaole Dai et.al. | 2404.06710 | null |
2024-05-10 | SAM-I-Am: Semantic Boosting for Zero-shot Atomic-Scale Electron Micrograph Segmentation | Waqwoya Abebe et.al. | 2404.06638 | link |
2024-04-20 | RoadBEV: Road Surface Reconstruction in Bird’s Eye View | Tong Zhao et.al. | 2404.06605 | link |
2024-04-11 | HPNet: Dynamic Trajectory Forecasting with Historical Prediction Attention | Xiaolong Tang et.al. | 2404.06351 | link |
2024-04-21 | AgentsCoDriver: Large Language Model Empowered Collaborative Driving with Lifelong Learning | Senkang Hu et.al. | 2404.06345 | null |
2024-04-14 | 3D Geometry-aware Deformable Gaussian Splatting for Dynamic View Synthesis | Zhicheng Lu et.al. | 2404.06270 | null |
2024-04-09 | Label-Efficient 3D Object Detection For Road-Side Units | Minh-Quan Dao et.al. | 2404.06256 | null |
2024-04-09 | GHNeRF: Learning Generalizable Human Features with Efficient Neural Radiance Fields | Arnab Dey et.al. | 2404.06246 | null |
2024-04-09 | Towards Autonomous Driving with Small-Scale Cars: A Survey of Recent Development | Dianzhao Li et.al. | 2404.06229 | null |
2024-04-09 | HFNeRF: Learning Human Biomechanic Features with Neural Radiance Fields | Arnab Dey et.al. | 2404.06152 | null |
2024-04-09 | Hierarchical Insights: Exploiting Structural Similarities for Reliable 3D Semantic Segmentation | Mariella Dreissig et.al. | 2404.06124 | null |
2024-04-09 | Passive None-line-of-sight imaging with arbitrary scene condition and detection pattern in small amount of prior data | Yunting Gui et.al. | 2404.06015 | null |
2024-04-08 | Residual Chain Prediction for Autonomous Driving Path Planning | Liguo Zhou et.al. | 2404.05423 | null |
2024-04-08 | Human Detection from 4D Radar Data in Low-Visibility Field Conditions | Mikael Skog et.al. | 2404.05307 | null |
2024-04-08 | Detecting Every Object from Events | Haitian Zhang et.al. | 2404.05285 | link |
2024-04-08 | MOSE: Boosting Vision-based Roadside 3D Object Detection with Scene Cues | Xiahan Chen et.al. | 2404.05280 | null |
2024-04-08 | Stylizing Sparse-View 3D Scenes with Hierarchical Neural Representation | Y. Wang et.al. | 2404.05236 | null |
2024-04-08 | UniMix: Towards Domain Adaptive and Generalizable LiDAR Semantic Segmentation in Adverse Weather | Haimei Zhao et.al. | 2404.05145 | null |
2024-04-09 | Better Monocular 3D Detectors with LiDAR from the Past | Yurong You et.al. | 2404.05139 | link |
2024-04-07 | CodecNeRF: Toward Fast Encoding and Decoding, Compact, and High-quality Novel-view Synthesis | Gyeongjin Kang et.al. | 2404.04913 | null |
2024-04-07 | MonoTAKD: Teaching Assistant Knowledge Distillation for Monocular 3D Object Detection | Hou-I Liu et.al. | 2404.04910 | link |
2024-04-13 | GauU-Scene V2: Assessing the Reliability of Image-Based Metrics with Expansive Lidar Image Dataset Using 3DGS and NeRF | Butian Xiong et.al. | 2404.04880 | null |
2024-04-07 | NeRF2Points: Large-Scale Point Cloud Generation From Street Views’ Radiance Field Optimization | Peng Tu et.al. | 2404.04875 | null |
2024-04-07 | Prompting Multi-Modal Tokens to Enhance End-to-End Autonomous Driving Imitation Learning with LLMs | Yiqun Duan et.al. | 2404.04869 | null |
2024-04-07 | Light the Night: A Multi-Condition Diffusion Framework for Unpaired Low-Light Enhancement in Autonomous Driving | Jinlong Li et.al. | 2404.04804 | null |
2024-05-06 | HawkDrive: A Transformer-driven Visual Perception System for Autonomous Driving in Night Scene | Ziang Guo et.al. | 2404.04653 | link |
2024-05-22 | Co-Occ: Coupling Explicit Feature Fusion with Volume Rendering Regularization for Multi-Modal 3D Semantic Occupancy Prediction | Jingyi Pan et.al. | 2404.04561 | null |
2024-04-06 | DATENeRF: Depth-Aware Text-based Editing of NeRFs | Sara Rojas et.al. | 2404.04526 | null |
2024-04-06 | Automated Lane Change Behavior Prediction and Environmental Perception Based on SLAM Technology | Han Lei et.al. | 2404.04492 | null |
2024-04-05 | Exploring Probabilistic Models for Semi-supervised Learning | Jianfeng Wang et.al. | 2404.04199 | null |
2024-04-05 | You Can Use But Cannot Recognize: Preserving Visual Privacy in Deep Neural Networks | Qiushi Li et.al. | 2404.04098 | null |
2024-05-13 | Scaling Motion Forecasting Models with Ensemble Distillation | Scott Ettinger et.al. | 2404.03843 | null |
2024-04-04 | Language-Guided Instance-Aware Domain-Adaptive Panoptic Segmentation | Elham Amin Mansour et.al. | 2404.03799 | null |
2024-04-04 | Dendrites endow artificial neural networks with accurate, robust and parameter-efficient learning | Spyridon Chavlis et.al. | 2404.03708 | null |
2024-04-07 | RaFE: Generative Radiance Fields Restoration | Zhongkai Wu et.al. | 2404.03654 | null |
2024-04-04 | Is CLIP the main roadblock for fine-grained open-world perception? | Lorenzo Bianchi et.al. | 2404.03539 | link |
2024-04-04 | Materials for High Temperature Digital Electronics | Dhiren K. Pradhan et.al. | 2404.03510 | null |
2024-04-05 | A Methodology to Study the Impact of Spiking Neural Network Parameters considering Event-Based Automotive Data | Iqra Bano et.al. | 2404.03493 | null |
2024-04-04 | VF-NeRF: Viewshed Fields for Rigid NeRF Registration | Leo Segre et.al. | 2404.03349 | null |
2024-05-06 | CORP: A Multi-Modal Dataset for Campus-Oriented Roadside Perception Tasks | Beibei Wang et.al. | 2404.03191 | null |
2024-04-03 | Ego-Motion Aware Target Prediction Module for Robust Multi-Object Tracking | Navid Mahdian et.al. | 2404.03110 | link |
2024-04-03 | LidarDM: Generative LiDAR Simulation in a Generated World | Vlas Zyrianov et.al. | 2404.02903 | link |
2024-04-03 | LiDAR4D: Dynamic Neural Fields for Novel Space-time View LiDAR Synthesis | Zehan Zheng et.al. | 2404.02742 | link |
2024-04-03 | One Stack to Rule them All: To Drive Automated Vehicles, and Reach for the 4th level | Sven Ochs et.al. | 2404.02645 | null |
2024-04-03 | Neural Radiance Fields with Torch Units | Bingnan Ni et.al. | 2404.02617 | null |
2024-05-20 | HENet: Hybrid Encoding for End-to-end Multi-task 3D Perception from Multi-view Cameras | Zhongyu Xia et.al. | 2404.02517 | link |
2024-04-03 | AD4RL: Autonomous Driving Benchmarks for Offline Reinforcement Learning with Value-based Dataset | Dongsu Lee et.al. | 2404.02429 | null |
2024-04-03 | TCLC-GS: Tightly Coupled LiDAR-Camera Gaussian Splatting for Surrounding Autonomous Driving Scenes | Cheng Zhao et.al. | 2404.02410 | null |
2024-04-02 | OFMPNet: Deep End-to-End Model for Occupancy and Flow Prediction in Urban Environment | Youshaa Murhij et.al. | 2404.02263 | link |
2024-04-02 | OOSTraj: Out-of-Sight Trajectory Prediction With Vision-Positioning Denoising | Haichao Zhang et.al. | 2404.02227 | link |
2024-04-02 | NeRFCodec: Neural Feature Compression Meets Neural Radiance Fields for Memory-Efficient Scene Representation | Sicheng Li et.al. | 2404.02185 | null |
2024-04-01 | Versatile Navigation under Partial Observability via Value-guided Diffusion Policy | Gengyu Zhang et.al. | 2404.02176 | null |
2024-04-17 | Alpha Invariance: On Inverse Scaling Between Distance and Volume Density in Neural Radiance Fields | Joshua Ahn et.al. | 2404.02155 | null |
2024-04-02 | Risk-Aware Real-Time Task Allocation for Stochastic Multi-Agent Systems under STL Specifications | Maico H. W. Engelaar et.al. | 2404.02111 | null |
2024-04-02 | WcDT: World-centric Diffusion Transformer for Traffic Scene Generation | Chen Yang et.al. | 2404.02082 | link |
2024-04-02 | Heuristic Optimization of Amplifier Reconfiguration Process for Autonomous Driving Optical Networks | Qizhi Qiu et.al. | 2404.01949 | null |
2024-04-02 | Improving Bird’s Eye View Semantic Segmentation by Task Decomposition | Tianhao Zhao et.al. | 2404.01925 | link |
2024-04-02 | Analyzing the Single Event Upset Vulnerability of Binarized Neural Networks on SRAM FPGAs | Ioanna Souvatzoglou et.al. | 2404.01757 | null |
2024-04-02 | Exploring Latent Pathways: Enhancing the Interpretability of Autonomous Driving with a Variational Autoencoder | Anass Bairouk et.al. | 2404.01750 | null |
2024-05-12 | Boosting Visual Recognition in Real-world Degradations via Unsupervised Feature Enhancement Module with Deep Channel Prior | Zhanwen Liu et.al. | 2404.01703 | link |
2024-04-02 | Learning Temporal Cues by Predicting Objects Move for Multi-camera 3D Object Detection | Seokha Moon et.al. | 2404.01580 | null |
2024-04-02 | Are Doppler Velocity Measurements Useful for Spinning Radar Odometry? | Daniil Lisus et.al. | 2404.01537 | null |
2024-04-01 | ML KPI Prediction in 5G and B5G Networks | Nguyen Phuc Tran et.al. | 2404.01530 | null |
2024-04-01 | QuAD: Query-based Interpretable Neural Motion Planning for Autonomous Driving | Sourav Biswas et.al. | 2404.01486 | null |
2024-04-01 | NVINS: Robust Visual Inertial Navigation Fused with NeRF-augmented Camera Pose Regressor and Uncertainty Quantification | Juyeop Han et.al. | 2404.01400 | null |
2024-04-18 | NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Radiance Fields | Muhammad Zubair Irshad et.al. | 2404.01300 | link |
2024-04-01 | MagicMirror: Fast and High-Quality Avatar Generation with a Constrained Search Space | Armand Comas-Massagué et.al. | 2404.01296 | null |
2024-04-01 | Mirror-3DGS: Incorporating Mirror Reflections into 3D Gaussian Splatting | Jiarui Meng et.al. | 2404.01168 | null |
2024-04-01 | SGCNeRF: Few-Shot Neural Rendering via Sparse Geometric Consistency Guidance | Yuru Xiao et.al. | 2404.00992 | null |
2024-04-01 | BadPart: Unified Black-box Adversarial Patch Attacks against Pixel-wise Regression Tasks | Zhiyuan Cheng et.al. | 2404.00924 | link |
2024-04-01 | MM3DGS SLAM: Multi-modal 3D Gaussian Splatting for SLAM Using Vision, Depth, and Inertial Measurements | Lisong C. Sun et.al. | 2404.00923 | null |
2024-04-01 | An Integrating Comprehensive Trajectory Prediction with Risk Potential Field Method for Autonomous Driving | Kailu Wu et.al. | 2404.00893 | null |
2024-04-02 | DPA-Net: Structured 3D Abstraction from Sparse Views via Differentiable Primitive Assembly | Fenggen Yu et.al. | 2404.00875 | null |
2024-03-31 | An Active Perception Game for Robust Autonomous Exploration | Siming He et.al. | 2404.00769 | null |
2024-03-31 | Adapting to Length Shift: FlexiLength Network for Trajectory Prediction | Yi Xu et.al. | 2404.00742 | null |
2024-04-20 | End-to-End Autonomous Driving through V2X Cooperation | Haibao Yu et.al. | 2404.00717 | link |
2024-03-31 | Neural Radiance Field-based Visual Rendering: A Comprehensive Review | Mingyuan Yao et.al. | 2404.00714 | null |
2024-03-31 | Weak-to-Strong 3D Object Detection with X-Ray Distillation | Alexander Gambashidze et.al. | 2404.00679 | link |
2024-03-31 | Denoising Low-dose Images Using Deep Learning of Time Series Images | Yang Shao et.al. | 2404.00510 | null |
2024-03-30 | MaGRITTe: Manipulative and Generative 3D Realization from Image, Topview and Text | Takayuki Hara et.al. | 2404.00345 | null |
2024-03-19 | Advancing Explainable Autonomous Vehicle Systems: A Comprehensive Review and Research Roadmap | Sule Tekkesinoglu et.al. | 2404.00019 | null |
2024-03-29 | Talk3D: High-Fidelity Talking Portrait Synthesis via Personalized 3D Generative Prior | Jaehoon Ko et.al. | 2403.20153 | link |
2024-03-29 | LeGo-Drive: Language-enhanced Goal-oriented Closed-Loop End-to-End Autonomous Driving | Pranjal Paul et.al. | 2403.20116 | null |
2024-03-29 | SGD: Street View Synthesis with Gaussian Splatting and Diffusion Prior | Zhongrui Yu et.al. | 2403.20079 | null |
2024-03-29 | NeSLAM: Neural Implicit Mapping and Self-Supervised Feature Tracking With Depth Completion and Denoising | Tianchen Deng et.al. | 2403.20034 | link |
2024-03-29 | HO-Gaussian: Hybrid Optimization of 3D Gaussian Splatting for Urban Scenes | Zhuopeng Li et.al. | 2403.20032 | null |
2024-03-29 | SCINeRF: Neural Radiance Fields from a Snapshot Compressive Image | Yunhao Li et.al. | 2403.20018 | link |
2024-03-29 | DerainNeRF: 3D Scene Estimation with Adhesive Waterdrop Removal | Yunhao Li et.al. | 2403.20013 | link |
2024-04-03 | MI-NeRF: Learning a Single Face NeRF from Multiple Identities | Aggelina Chatziagapi et.al. | 2403.19920 | null |
2024-03-29 | PLoc: A New Evaluation Criterion Based on Physical Location for Autonomous Driving Datasets | Ruining Yang et.al. | 2403.19893 | null |
2024-05-09 | Multi-Frame, Lightweight & Efficient Vision-Language Models for Question Answering in Autonomous Driving | Akshay Gopalkrishnan et.al. | 2403.19838 | link |
2024-03-28 | Mitigating Motion Blur in Neural Radiance Fields with Events and Frames | Marco Cannici et.al. | 2403.19780 | link |
2024-04-05 | GaussianCube: Structuring Gaussian Splatting using Optimal Transport for 3D Generative Modeling | Bowen Zhang et.al. | 2403.19655 | null |
2024-03-28 | Human-compatible driving partners through data-regularized self-play reinforcement learning | Daphne Cornelisse et.al. | 2403.19648 | link |
2024-03-28 | SAID-NeRF: Segmentation-AIDed NeRF for Depth Completion of Transparent Objects | Avinash Ummadisingu et.al. | 2403.19607 | null |
2024-03-28 | CoherentGS: Sparse Novel View Synthesis with Coherent 3D Gaussians | Avinash Paliwal et.al. | 2403.19495 | link |
2024-04-25 | Learning Sampling Distribution and Safety Filter for Autonomous Driving with VQ-VAE and Differentiable Optimization | Simon Idoko et.al. | 2403.19461 | link |
2024-03-28 | SubjectDrive: Scaling Generative Data in Autonomous Driving via Subject Control | Binyuan Huang et.al. | 2403.19438 | null |
2024-03-28 | Learning a Formally Verified Control Barrier Function in Stochastic Environment | Manan Tayal et.al. | 2403.19332 | link |
2024-03-28 | Mesh2NeRF: Direct Mesh Supervision for Neural Radiance Field Representation and Generation | Yujin Chen et.al. | 2403.19319 | null |
2024-03-28 | Sine Activated Low-Rank Matrices for Parameter Efficient Learning | Yiping Ji et.al. | 2403.19243 | null |
2024-03-28 | CRKD: Enhanced Camera-Radar Object Detection with Cross-modality Knowledge Distillation | Lingjun Zhao et.al. | 2403.19104 | null |
2024-04-07 | GraphAD: Interaction Scene Graph for End-to-end Autonomous Driving | Yunpeng Zhang et.al. | 2403.19098 | link |
2024-03-27 | GENESIS-RL: GEnerating Natural Edge-cases with Systematic Integration of Safety considerations and Reinforcement Learning | Hsin-Jung Yang et.al. | 2403.19062 | null |
2024-03-27 | Ensuring Safe Autonomy: Navigating the Future of Autonomous Vehicles | Patrick Wolf et.al. | 2403.19006 | null |
2024-03-27 | LORD: Large Models based Opposite Reward Design for Autonomous Driving | Xin Ye et.al. | 2403.18965 | null |
2024-03-27 | Lift3D: Zero-Shot Lifting of Any 2D Vision Model to 3D | Mukund Varma T et.al. | 2403.18922 | null |
2024-03-29 | Gamba: Marry Gaussian Splatting with Mamba for single view 3D reconstruction | Qiuhong Shen et.al. | 2403.18795 | link |
2024-03-27 | SAT-NGP : Unleashing Neural Graphics Primitives for Fast Relightable Transient-Free 3D reconstruction from Satellite Imagery | Camille Billouard et.al. | 2403.18711 | link |
2024-04-28 | Sampling-Based Motion Planning with Online Racing Line Generation for Autonomous Driving on Three-Dimensional Race Tracks | Levent Ögretmen et.al. | 2403.18643 | link |
2024-03-27 | Modeling uncertainty for Gaussian Splatting | Luca Savant et.al. | 2403.18476 | null |
2024-03-27 | Long and Short-Term Constraints Driven Safe Reinforcement Learning for Autonomous Driving | Xuemin Hu et.al. | 2403.18209 | null |
2024-03-27 | Road Obstacle Detection based on Unknown Objectness Scores | Chihiro Noguchi et.al. | 2403.18207 | null |
2024-03-26 | Low-Latency Neural Stereo Streaming | Qiqi Hou et.al. | 2403.17879 | null |
2024-03-26 | Scenario-Based Curriculum Generation for Multi-Agent Autonomous Driving | Axel Brunnbauer et.al. | 2403.17805 | link |
2024-03-26 | LiDAR-Based Crop Row Detection Algorithm for Over-Canopy Autonomous Navigation in Agriculture Fields | Ruiji Liu et.al. | 2403.17774 | link |
2024-03-28 | UADA3D: Unsupervised Adversarial Domain Adaptation for 3D Object Detection with Sparse LiDAR and Large Domain Gaps | Maciej K Wozniak et.al. | 2403.17633 | link |
2024-03-26 | Fully-fused Multi-Layer Perceptrons on Intel Data Center GPUs | Kai Yuan et.al. | 2403.17607 | link |
2024-03-26 | NeRF-HuGS: Improved Neural Radiance Fields in Non-static Scenes Using Heuristics-Guided Segmentation | Jiahao Chen et.al. | 2403.17537 | null |
2024-03-26 | AIDE: An Automatic Data Engine for Object Detection in Autonomous Driving | Mingfu Liang et.al. | 2403.17373 | null |
2024-03-27 | Physical 3D Adversarial Attacks against Monocular Depth Estimation in Autonomous Driving | Junhao Zheng et.al. | 2403.17301 | link |
2024-03-25 | SynFog: A Photo-realistic Synthetic Fog Dataset based on End-to-end Imaging Simulation for Advancing Real-World Defogging in Autonomous Driving | Yiming Xie et.al. | 2403.17094 | null |
2024-03-25 | TwinLiteNetPlus: A Stronger Model for Real-time Drivable Area and Lane Segmentation | Quang-Huy Che et.al. | 2403.16958 | link |
2024-03-25 | CVT-xRF: Contrastive In-Voxel Transformer for 3D Consistent Radiance Fields from Sparse Inputs | Yingji Zhong et.al. | 2403.16885 | null |
2024-03-25 | Exploring Communication Technologies, Standards, and Challenges in Electrified Vehicle Charging | Xiang Ma et.al. | 2403.16830 | null |
2024-05-07 | Synapse: Learning Preferential Concepts from Visual Demonstrations | Sadanand Modak et.al. | 2403.16689 | null |
2024-03-25 | RCBEVDet: Radar-camera Fusion in Bird’s Eye View for 3D Object Detection | Zhiwei Lin et.al. | 2403.16440 | link |
2024-03-25 | Spike-NeRF: Neural Radiance Field Based On Spike Camera | Yijia Guo et.al. | 2403.16410 | null |
2024-03-25 | ProIn: Learning to Predict Trajectory Based on Progressive Interactions for Autonomous Driving | Yinke Dong et.al. | 2403.16374 | null |
2024-03-25 | Impact of Video Compression Artifacts on Fisheye Camera Visual Perception Tasks | Madhumitha Sakthi et.al. | 2403.16338 | null |
2024-03-24 | Engineering Safety Requirements for Autonomous Driving with Large Language Models | Ali Nouri et.al. | 2403.16289 | null |
2024-03-24 | Inverse Rendering of Glossy Objects via the Neural Plenoptic Function and Radiance Fields | Haoyuan Wang et.al. | 2403.16224 | null |
2024-03-24 | Interference Management for Integrated Sensing and Communication Systems: A Survey | Yangyang Niu et.al. | 2403.16189 | null |
2024-03-24 | Entity-NeRF: Detecting and Removing Moving Entities in Urban Scenes | Takashi Otonari et.al. | 2403.16141 | null |
2024-03-24 | Self-Supervised Multi-Frame Neural Scene Flow | Dongrui Liu et.al. | 2403.16116 | null |
2024-03-24 | CG-SLAM: Efficient Dense RGB-D SLAM in a Consistent Uncertainty-aware 3D Gaussian Field | Jiarui Hu et.al. | 2403.16095 | null |
2024-04-15 | Are NeRFs ready for autonomous driving? Towards closing the real-to-simulation gap | Carl Lindström et.al. | 2403.16092 | null |
2024-04-02 | PKU-DyMVHumans: A Multi-View Video Benchmark for High-Fidelity Dynamic Human Modeling | Xiaoyun Zheng et.al. | 2403.16080 | link |
2024-03-24 | Semantic Is Enough: Only Semantic Information For NeRF Reconstruction | Ruibo Wang et.al. | 2403.16043 | null |
2024-03-28 | Exploring Accurate 3D Phenotyping in Greenhouse through Neural Radiance Fields | Junhong Zhao et.al. | 2403.15981 | null |
2024-03-23 | iA $^$: Imperative Learning-based A$^$ Search for Pathfinding | Xiangyu Chen et.al. | 2403.15870 | null |
2024-03-23 | Spatio-Temporal Bi-directional Cross-frame Memory for Distractor Filtering Point Cloud Single Object Tracking | Shaoyu Sun et.al. | 2403.15831 | null |
2024-03-23 | DriveEnv-NeRF: Exploration of A NeRF-Based Autonomous Driving Environment for Real-World Performance Validation | Mu-Yi Shen et.al. | 2403.15791 | link |
2024-03-23 | PNAS-MOT: Multi-Modal Object Tracking with Pareto Neural Architecture Search | Chensheng Peng et.al. | 2403.15712 | link |
2024-03-23 | Gaussian in the Wild: 3D Gaussian Splatting for Unconstrained Image Collections | Dongbin Zhang et.al. | 2403.15704 | null |
2024-03-22 | Semantic Gaussians: Open-Vocabulary Scene Understanding with 3D Gaussian Splatting | Jun Guo et.al. | 2403.15624 | null |
2024-03-22 | Autonomous Driving With Perception Uncertainties: Deep-Ensemble Based Adaptive Cruise Control | Xiao Li et.al. | 2403.15577 | null |
2024-03-20 | EC-IoU: Orienting Safety for Object Detectors via Ego-Centric Intersection-over-Union | Brian Hsuan-Cheng Liao et.al. | 2403.15474 | null |
2024-03-26 | Metasurface-Enabled Multifunctional Single-Frequency Sensors without External Power | Masaya Tashiro et.al. | 2403.15427 | null |
2024-03-22 | Augmented Reality based Simulated Data (ARSim) with multi-view consistency for AV perception networks | Aqeel Anwar et.al. | 2403.15370 | null |
2024-03-22 | CR3DT: Camera-RADAR Fusion for 3D Detection and Tracking | Nicolas Baumann et.al. | 2403.15313 | link |
2024-03-22 | IS-Fusion: Instance-Scene Collaborative Fusion for Multimodal 3D Object Detection | Junbo Yin et.al. | 2403.15241 | link |
2024-03-22 | Learning from Visual Demonstrations through Differentiable Nonlinear MPC for Personalized Autonomous Driving | Flavia Sofia Acerbo et.al. | 2403.15102 | null |
2024-03-22 | Tri-Perspective View Decomposition for Geometry-Aware Depth Completion | Zhiqiang Yan et.al. | 2403.15008 | null |
2024-03-22 | Unifying Lane-Level Traffic Prediction from a Graph Structural Perspective: Benchmark and Baseline | Shuhao Li et.al. | 2403.14941 | link |
2024-03-21 | Hyperspectral Neural Radiance Fields | Gerry Chen et.al. | 2403.14839 | null |
2024-03-21 | CombiNeRF: A Combination of Regularization Techniques for Few-Shot Neural Radiance Field View Synthesis | Matteo Bonotto et.al. | 2403.14412 | link |
2024-03-21 | InfNeRF: Towards Infinite Scale NeRF Rendering with O(log n) Space Complexity | Jiabin Liang et.al. | 2403.14376 | null |
2024-03-21 | SurroundSDF: Implicit 3D Scene Understanding Based on Signed Distance Field | Lizhe Liu et.al. | 2403.14366 | null |
2024-03-21 | Leveraging Thermal Modality to Enhance Reconstruction in Low-Light Conditions | Jiacong Xu et.al. | 2403.14053 | link |
2024-03-20 | Reinforcement Learning for Online Testing of Autonomous Driving Systems: a Replication and Extension Study | Luca Giamattei et.al. | 2403.13729 | null |
2024-03-20 | MULAN-WC: Multi-Robot Localization Uncertainty-aware Active NeRF with Wireless Coordination | Weiying Wang et.al. | 2403.13348 | null |
2024-03-20 | Learning Novel View Synthesis from Heterogeneous Low-light Captures | Quan Zheng et.al. | 2403.13337 | null |
2024-03-21 | AMP: Autoregressive Motion Prediction Revisited with Next Token Prediction for Autonomous Driving | Xiaosong Jia et.al. | 2403.13331 | null |
2024-03-21 | Self-Supervised Class-Agnostic Motion Prediction with Spatial and Temporal Consistency Regularizations | Kewei Wang et.al. | 2403.13261 | link |
2024-03-20 | A Rule-Compliance Path Planner for Lane-Merge Scenarios Based on Responsibility-Sensitive Safety | Pengfei Lin et.al. | 2403.13251 | null |
2024-03-19 | Depth-guided NeRF Training via Earth Mover’s Distance | Anita Rau et.al. | 2403.13206 | null |
2024-03-28 | DecentNeRFs: Decentralized Neural Radiance Fields from Crowdsourced Images | Zaid Tasneem et.al. | 2403.13199 | null |
2024-03-19 | Global-guided Focal Neural Radiance Field for Large-scale Scene Rendering | Mingqi Shao et.al. | 2403.12839 | null |
2024-03-19 | IFFNeRF: Initialisation Free and Fast 6DoF pose estimation from a single image and a NeRF model | Matteo Bortolon et.al. | 2403.12682 | null |
2024-03-19 | M2DA: Multi-Modal Fusion Transformer Incorporating Driver Attention for Autonomous Driving | Dongyang Xu et.al. | 2403.12552 | null |
2024-03-18 | FLex: Joint Pose and Dynamic Radiance Fields Optimization for Stereo Endoscopic Videos | Florian Philipp Stilz et.al. | 2403.12198 | null |
2024-03-18 | Safety Implications of Explainable Artificial Intelligence in End-to-End Autonomous Driving | Shahin Atakishiyev et.al. | 2403.12176 | null |
2024-03-18 | ThermoNeRF: Multimodal Neural Radiance Fields for Thermal Novel View Synthesis | Mariam Hassan et.al. | 2403.12154 | link |
2024-03-18 | HiKER-SGG: Hierarchical Knowledge Enhanced Robust Scene Graph Generation | Ce Zhang et.al. | 2403.12033 | link |
2024-03-18 | Informed Spectral Normalized Gaussian Processes for Trajectory Prediction | Christian Schlauch et.al. | 2403.11966 | null |
2024-03-18 | GNeRP: Gaussian-guided Neural Reconstruction of Reflective Objects with Noisy Polarization Priors | LI Yang et.al. | 2403.11899 | null |
2024-03-18 | Exploring Multi-modal Neural Scene Representations With Applications on Thermal Imaging | Mert Özer et.al. | 2403.11865 | null |
2024-04-10 | GraphBEV: Towards Robust BEV Feature Alignment for Multi-Modal 3D Object Detection | Ziying Song et.al. | 2403.11848 | null |
2024-03-19 | BAD-Gaussians: Bundle Adjusted Deblur Gaussian Splatting | Lingzhe Zhao et.al. | 2403.11831 | link |
2024-03-18 | Aerial Lifting: Neural Urban Semantic and Building Instance Lifting from Aerial Imagery | Yuqi Zhang et.al. | 2403.11812 | link |
2024-03-18 | OpenOcc: Open Vocabulary 3D Scene Reconstruction via Occupancy Representation | Haochen Jiang et.al. | 2403.11796 | null |
2024-03-18 | EMIE-MAP: Large-Scale Road Surface Reconstruction Based on Explicit Mesh and Implicit Encoding | Wenhua Wu et.al. | 2403.11789 | null |
2024-03-18 | TrajectoryNAS: A Neural Architecture Search for Trajectory Prediction | Ali Asghar Sharifi et.al. | 2403.11695 | null |
2024-03-18 | SSAP: A Shape-Sensitive Adversarial Patch for Comprehensive Disruption of Monocular Depth Estimation in Autonomous Navigation Applications | Amira Guesmi et.al. | 2403.11515 | null |
2024-03-18 | MCD: Diverse Large-Scale Multi-Campus Dataset for Robot Perception | Thien-Minh Nguyen et.al. | 2403.11496 | null |
2024-03-17 | Driving Style Alignment for LLM-powered Driver Agent | Ruoxuan Yang et.al. | 2403.11368 | link |
2024-03-17 | Creating Seamless 3D Maps Using Radiance Fields | Sai Tarun Sathyan et.al. | 2403.11364 | null |
2024-03-17 | Multi-Sample Long Range Path Planning under Sensing Uncertainty for Off-Road Autonomous Driving | Matt Schmittle et.al. | 2403.11298 | null |
2024-03-17 | SpikeNeRF: Learning Neural Radiance Fields from Continuous Spike Stream | Lin Zhu et.al. | 2403.11222 | link |
2024-04-13 | Recent Advances in 3D Gaussian Splatting | Tong Wu et.al. | 2403.11134 | null |
2024-03-17 | Omni-Recon: Towards General-Purpose Neural Radiance Fields for Versatile 3D Applications | Yonggan Fu et.al. | 2403.11131 | link |
2024-03-17 | Large Language Models Powered Context-aware Motion Prediction | Xiaoji Zheng et.al. | 2403.11057 | link |
2024-03-16 | Fast Sparse View Guided NeRF Update for Object Reconfigurations | Ziqi Lu et.al. | 2403.11024 | null |
2024-03-16 | HourglassNeRF: Casting an Hourglass as a Bundle of Rays for Few-shot Neural Rendering | Seunghyeon Seo et.al. | 2403.10906 | null |
2024-03-16 | MSI-NeRF: Linking Omni-Depth with View Synthesis through Multi-Sphere Image aided Generalizable Neural Radiance Field | Dongyu Yan et.al. | 2403.10840 | link |
2024-03-16 | DPPE: Dense Pose Estimation in a Plenoxels Environment using Gradient Approximation | Christopher Kolios et.al. | 2403.10773 | null |
2024-03-15 | Gradient based Feature Attribution in Explainable AI: A Technical Review | Yongjie Wang et.al. | 2403.10415 | null |
2024-03-15 | Real-Time Image Segmentation via Hybrid Convolutional-Transformer Architecture Search | Hongyuan Yu et.al. | 2403.10413 | link |
2024-03-15 | SimPB: A Single Model for 2D and 3D Object Detection from Multiple Cameras | Yingqi Tang et.al. | 2403.10353 | link |
2024-03-15 | Thermal-NeRF: Neural Radiance Fields from an Infrared Camera | Tianxiang Ye et.al. | 2403.10340 | link |
2024-03-20 | Leveraging Neural Radiance Field in Descriptor Synthesis for Keypoints Scene Coordinate Regression | Huy-Hoang Bui et.al. | 2403.10297 | link |
2024-03-15 | CoLeCLIP: Open-Domain Continual Learning via Joint Task Prompt and Vocabulary Learning | Yukun Li et.al. | 2403.10245 | link |
2024-03-15 | SemanticHuman-HD: High-Resolution Semantic Disentangled 3D Human Generation | Peng Zheng et.al. | 2403.10166 | null |
2024-03-31 | RCooper: A Real-world Large-scale Dataset for Roadside Cooperative Perception | Ruiyang Hao et.al. | 2403.10145 | link |
2024-03-25 | URS-NeRF: Unordered Rolling Shutter Bundle Adjustment for Neural Radiance Fields | Bo Xu et.al. | 2403.10119 | null |
2024-03-15 | Enhancing Human-Centered Dynamic Scene Understanding via Multiple LLMs Collaborated Reasoning | Hang Zhang et.al. | 2403.10107 | null |
2024-03-19 | DyBluRF: Dynamic Neural Radiance Fields from Blurry Monocular Video | Huiqiang Sun et.al. | 2403.10103 | null |
2024-03-15 | RangeLDM: Fast Realistic LiDAR Point Cloud Generation | Qianjiang Hu et.al. | 2403.10094 | link |
2024-03-15 | Visual Foundation Models Boost Cross-Modal Unsupervised Domain Adaptation for 3D Semantic Segmentation | Jingyi Xu et.al. | 2403.10001 | link |
2024-03-14 | Reality Bites: Assessing the Realism of Driving Scenarios with Large Language Models | Jiahui Wu et.al. | 2403.09906 | link |
2024-03-14 | Generalized Predictive Model for Autonomous Driving | Jiazhi Yang et.al. | 2403.09630 | link |
2024-03-14 | The NeRFect Match: Exploring NeRF Features for Visual Localization | Qunjie Zhou et.al. | 2403.09577 | null |
2024-03-14 | Are you a robot? Detecting Autonomous Vehicles from Behavior Analysis | Fabio Maresca et.al. | 2403.09571 | null |
2024-03-14 | On STPA for Distributed Development of Safe Autonomous Driving: An Interview Study | Ali Nouri et.al. | 2403.09509 | null |
2024-03-14 | VIRUS-NeRF – Vision, InfraRed and UltraSonic based Neural Radiance Fields | Nicolaj Schmid et.al. | 2403.09477 | link |
2024-03-14 | An Industrial Experience Report about Challenges from Continuous Monitoring, Improvement, and Deployment for Autonomous Driving Features | Ali Nouri et.al. | 2403.09474 | null |
2024-03-14 | EfficientMFD: Towards More Efficient Multimodal Synchronous Fusion Detection | Jiaqing Zhang et.al. | 2403.09323 | link |
2024-03-14 | Intention-aware Denoising Diffusion Model for Trajectory Prediction | Chen Liu et.al. | 2403.09190 | null |
2024-03-14 | PreSight: Enhancing Autonomous Vehicle Perception with City-Scale NeRF Priors | Tianyuan Yuan et.al. | 2403.09079 | link |
2024-03-13 | FogGuard: guarding YOLO against fog using perceptual loss | Soheil Gharatappeh et.al. | 2403.08939 | link |
2024-03-13 | CLIP-BEVFormer: Enhancing Multi-View Image-Based BEV Detector with Ground Truth Flow | Chenbin Pan et.al. | 2403.08919 | null |
2024-03-11 | People Attribute Purpose to Autonomous Vehicles When Explaining Their Behavior | Balint Gyevnar et.al. | 2403.08828 | link |
2024-03-13 | MIM4D: Masked Modeling with Multi-View Video for Autonomous Driving Representation Learning | Jialv Zou et.al. | 2403.08760 | link |
2024-03-13 | Real-time 3D semantic occupancy prediction for autonomous vehicles using memory-efficient sparse convolution | Samuel Sze et.al. | 2403.08748 | null |
2024-03-13 | IAMCV Multi-Scenario Vehicle Interaction Dataset | Novel Certad et.al. | 2403.08455 | null |
2024-03-13 | DeepCSHAP: Utilizing Shapley Values to Explain Deep Complex-Valued Neural Networks | Florian Eilers et.al. | 2403.08428 | null |
2024-03-13 | StyleDyRF: Zero-shot 4D Style Transfer for Dynamic Neural Radiance Fields | Hongbin Xu et.al. | 2403.08310 | link |
2024-03-13 | Optimized Detection and Classification on GTRSB: Advancing Traffic Sign Recognition with Convolutional Neural Networks | Dhruv Toshniwal et.al. | 2403.08283 | null |
2024-03-13 | LIX: Implicitly Infusing Spatial Geometric Prior Knowledge into Visual Semantic Segmentation for Autonomous Driving | Sicen Guo et.al. | 2403.08215 | null |
2024-03-13 | NeRF-Supervised Feature Point Detection and Description | Ali Youssef et.al. | 2403.08156 | link |
2024-03-12 | Q-SLAM: Quadric Representations for Monocular SLAM | Chensheng Peng et.al. | 2403.08125 | null |
2024-03-12 | Unleashing HyDRa: Hybrid Fusion, Depth Consistency and Radar for Unified 3D Perception | Philipp Wolters et.al. | 2403.07746 | link |
2024-03-12 | SMURF: Continuous Dynamics for Motion-Deblurring Radiance Fields | Jungho Lee et.al. | 2403.07547 | link |
2024-03-12 | A Survey of Vision Transformers in Autonomous Driving: Current Trends and Future Directions | Quoc-Vinh Lai-Dang et.al. | 2403.07542 | null |
2024-03-12 | Adaptive Fusion of Single-View and Multi-View Depth for Autonomous Driving | JunDa Cheng et.al. | 2403.07535 | link |
2024-03-12 | Adaptive Bounding Box Uncertainties via Two-Step Conformal Prediction | Alexander Timans et.al. | 2403.07263 | link |
2024-03-12 | Tractable Joint Prediction and Planning over Discrete Behavior Modes for Urban Driving | Adam Villaflor et.al. | 2403.07232 | null |
2024-02-28 | Automatic driving lane change safety prediction model based on LSTM | Wenjian Sun et.al. | 2403.06993 | null |
2024-03-11 | SiLVR: Scalable Lidar-Visual Reconstruction with Neural Radiance Fields for Robotic Inspection | Yifu Tao et.al. | 2403.06877 | null |
2024-04-11 | DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video Generation | Guosheng Zhao et.al. | 2403.06845 | null |
2024-03-11 | PCLD: Point Cloud Layerwise Diffusion for Adversarial Purification | Mert Gulsen et.al. | 2403.06698 | link |
2024-03-11 | Vosh: Voxel-Mesh Hybrid Representation for Real-Time View Synthesis | Chenhao Zhang et.al. | 2403.06505 | null |
2024-03-11 | 3D Semantic Segmentation-Driven Representations for 3D Object Detection | Hayeon O et.al. | 2403.06501 | link |
2024-03-22 | S-DyRF: Reference-Based Stylized Radiance Fields for Dynamic Scenes | Xingyi Li et.al. | 2403.06205 | null |
2024-03-10 | On depth prediction for autonomous driving using self-supervised learning | Houssem Boulahbal et.al. | 2403.06194 | null |
2024-03-10 | Cross-Cluster Shifting for Efficient and Effective 3D Object Detection in Autonomous Driving | Zhili Chen et.al. | 2403.06166 | null |
2024-03-10 | Is Vanilla MLP in Neural Radiance Field Enough for Few-shot View Synthesis? | Hanxin Zhu et.al. | 2403.06092 | null |
2024-03-09 | Lightning NeRF: Efficient Hybrid Scene Representation for Autonomous Driving | Junyi Cao et.al. | 2403.05907 | link |
2024-03-09 | Fast Kernel Scene Flow | Xueqian Li et.al. | 2403.05896 | link |
2024-03-09 | SAFDNet: A Simple and Effective Network for Fully Sparse 3D Object Detection | Gang Zhang et.al. | 2403.05817 | link |
2024-03-09 | Large Generative Model Assisted 3D Semantic Communication | Feibo Jiang et.al. | 2403.05783 | null |
2024-03-08 | JointMotion: Joint Self-supervision for Joint Motion Prediction | Royden Wagner et.al. | 2403.05489 | link |
2024-03-08 | OccFusion: Depth Estimation Free Multi-sensor Fusion for 3D Occupancy Prediction | Ji Zhang et.al. | 2403.05329 | null |
2024-03-08 | LVIC: Multi-modality segmentation by Lifting Visual Info as Cue | Zichao Dong et.al. | 2403.05159 | null |
2024-03-08 | LanePtrNet: Revisiting Lane Detection as Point Voting and Grouping on Curves | Jiayan Cao et.al. | 2403.05155 | null |
2024-03-18 | DyRoNet: Dynamic Routing and Low-Rank Adapters for Autonomous Driving Streaming Perception | Xiang Huang et.al. | 2403.05050 | null |
2024-03-11 | LHMap-loc: Cross-Modal Monocular Localization Using LiDAR Point Cloud Heat Map | Xinrui Wu et.al. | 2403.05002 | link |
2024-03-11 | Fooling Neural Networks for Motion Forecasting via Adversarial Attacks | Edgar Medina et.al. | 2403.04954 | null |
2024-03-24 | BAGS: Blur Agnostic Gaussian Splatting through Multi-Scale Kernel Modeling | Cheng Peng et.al. | 2403.04926 | link |
2024-03-07 | A General Calibrated Regret Metric for Detecting and Mitigating Human-Robot Interaction Failures | Kensuke Nakamura et.al. | 2403.04745 | null |
2024-03-07 | Embodied Understanding of Driving Scenarios | Yunsong Zhou et.al. | 2403.04593 | link |
2024-03-08 | Finding Waldo: Towards Efficient Exploration of NeRF Scene Spaces | Evangelos Skartados et.al. | 2403.04508 | null |
2024-03-07 | FriendNet: Detection-Friendly Dehazing Network | Yihua Fan et.al. | 2403.04443 | link |
2024-03-07 | LitSim: Conflict-aware Policy for Long-term Interactive Traffic Simulation | Haojie Xin et.al. | 2403.04299 | null |
2024-03-07 | Generalizing Cooperative Eco-driving via Multi-residual Task Learning | Vindula Jayawardana et.al. | 2403.04232 | null |
2024-03-07 | Incremental Bayesian Learning for Fail-Operational Control in Autonomous Driving | Lei Zheng et.al. | 2403.04143 | null |
2024-03-07 | Towards learning-based planning:The nuPlan benchmark for real-world autonomous driving | Napat Karnchanachari et.al. | 2403.04133 | null |
2024-03-06 | Multi-Object Tracking with Camera-LiDAR Fusion for Autonomous Driving | Riccardo Pieroni et.al. | 2403.04112 | null |
2024-03-06 | DART: Implicit Doppler Tomography for Radar Novel View Synthesis | Tianshu Huang et.al. | 2403.03896 | null |
2024-03-06 | 3D Object Visibility Prediction in Autonomous Driving | Chuanyu Luo et.al. | 2403.03681 | null |
2024-03-20 | Learning Adversarial MDPs with Stochastic Hard Constraints | Francesco Emanuele Stradi et.al. | 2403.03672 | null |
2024-03-06 | GSNeRF: Generalizable Semantic Neural Radiance Fields with Enhanced 3D Scene Understanding | Zi-Ting Chou et.al. | 2403.03608 | null |
2024-03-06 | Seamless Virtual Reality with Integrated Synchronizer and Synthesizer for Autonomous Driving | He Li et.al. | 2403.03541 | null |
2024-03-06 | Multi-task Learning for Real-time Autonomous Driving Leveraging Task-adaptive Attention Generator | Wonhyeok Choi et.al. | 2403.03468 | null |
2024-03-05 | A Deep Learning Framework for Wireless Radiation Field Reconstruction and Channel Prediction | Haofan Lu et.al. | 2403.03241 | null |
2024-03-05 | Behavior Generation with Latent Actions | Seungjae Lee et.al. | 2403.03181 | link |
2024-03-05 | User-Driven Adaptation: Tailoring Autonomous Driving Systems with Dynamic Preferences | Mingyue Zhang et.al. | 2403.02928 | null |
2024-03-05 | ActiveAD: Planning-Oriented Active Learning for End-to-End Autonomous Driving | Han Lu et.al. | 2403.02877 | null |
2024-03-05 | Splat-Nav: Safe Real-Time Robot Navigation in Gaussian Splatting Maps | Timothy Chen et.al. | 2403.02751 | link |
2024-03-05 | FastOcc: Accelerating 3D Occupancy Prediction by Fusing the 2D Bird’s-Eye View and Perspective View | Jiawei Hou et.al. | 2403.02710 | null |
2024-03-26 | HoloVIC: Large-scale Dataset and Benchmark for Multi-Sensor Holographic Intersection and Vehicle-Infrastructure Cooperative | Cong Ma et.al. | 2403.02640 | null |
2024-03-05 | World Models for Autonomous Driving: An Initial Survey | Yanchen Guan et.al. | 2403.02622 | null |
2024-03-04 | Uncertainty-Aware Prediction and Application in Planning for Autonomous Driving: Definitions, Methods, and Comparison | Wenbo Shao et.al. | 2403.02297 | null |
2024-03-04 | DaReNeRF: Direction-aware Representation for Dynamic Scenes | Ange Lou et.al. | 2403.02265 | null |
2024-03-04 | Depth-Guided Robust and Fast Point Cloud Fusion NeRF for Sparse Input Views | Shuai Guo et.al. | 2403.02063 | null |
2024-03-04 | Scalable Vision-Based 3D Object Detection and Monocular Depth Estimation for Autonomous Driving | Yuxuan Liu et.al. | 2403.02037 | link |
2024-03-04 | Leveraging Anchor-based LiDAR 3D Object Detection via Point Assisted Sample Selection | Shitao Chen et.al. | 2403.01978 | link |
2024-03-04 | Progressive Smoothing for Motion Planning in Real-Time NMPC | Rudolf Reiter et.al. | 2403.01830 | null |
2024-03-04 | PointCore: Efficient Unsupervised Point Cloud Anomaly Detector Using Local-Global Features | Baozhu Zhao et.al. | 2403.01804 | link |
2024-03-13 | OccFusion: A Straightforward and Effective Multi-Sensor Fusion Framework for 3D Occupancy Prediction | Zhenxing Ming et.al. | 2403.01644 | link |
2024-03-03 | A Unified Model Selection Technique for Spectral Clustering Based Motion Segmentation | Yuxiang Huang et.al. | 2403.01606 | null |
2024-03-02 | NeRF-VPT: Learning Novel View Representations with Neural Radiance Fields via View Prompt Tuning | Linsheng Chen et.al. | 2403.01325 | link |
2024-03-02 | On the Road to Portability: Compressing End-to-End Motion Planner for Autonomous Driving | Kaituo Feng et.al. | 2403.01238 | link |
2024-03-20 | Results and Lessons Learned from Autonomous Driving Transportation Services in Airfield, Crowded Indoor, and Urban Environments | Doosan Baek et.al. | 2403.01233 | null |
2024-03-02 | Neural radiance fields-based holography [Invited] | Minsung Kang et.al. | 2403.01137 | null |
2024-03-02 | Neural Field Classifiers via Target Encoding and Classification Loss | Xindi Yang et.al. | 2403.01058 | null |
2024-03-01 | Safe Hybrid-Action Reinforcement Learning-Based Decision and Control for Discretionary Lane Change | Ruichen Xu et.al. | 2403.00446 | null |
2024-03-01 | MS-Net: A Multi-Path Sparse Model for Motion Prediction in Multi-Scenes | Xiaqiang Tang et.al. | 2403.00353 | null |
2024-02-29 | Genie: Smart ROS-based Caching for Connected Autonomous Robots | Zexin Li et.al. | 2402.19410 | null |
2024-02-29 | Towards Safe and Reliable Autonomous Driving: Dynamic Occupancy Set Prediction | Wenbo Shao et.al. | 2402.19385 | null |
2024-03-03 | RoadRunner - Learning Traversability Estimation for Autonomous Off-road Driving | Jonas Frey et.al. | 2402.19341 | null |
2024-02-29 | T3DNet: Compressing Point Cloud Models for Lightweight 3D Recognition | Zhiyuan Yang et.al. | 2402.19264 | null |
2024-02-29 | A Cognitive-Based Trajectory Prediction Approach for Autonomous Driving | Haicheng Liao et.al. | 2402.19251 | link |
2024-02-21 | Context-based Interpretable Spatio-Temporal Graph Convolutional Network for Human Motion Forecasting | Edgar Medina et.al. | 2402.19237 | link |
2024-02-29 | CollaFuse: Navigating Limited Resources and Privacy in Collaborative Generative AI | Domenique Zipperling et.al. | 2402.19105 | link |
2024-02-29 | GoalNet: Goal Areas Oriented Pedestrian Trajectory Prediction | Ching-Lin Lee et.al. | 2402.19002 | null |
2024-02-29 | Deep Learning for 3D Human Pose Estimation and Mesh Recovery: A Survey | Yang Liu et.al. | 2402.18844 | link |
2024-02-28 | Evaluating Decision Optimality of Autonomous Driving via Metamorphic Testing | Mingfei Cheng et.al. | 2402.18393 | null |
2024-02-28 | Enhancing Roadway Safety: LiDAR-based Tree Clearance Analysis | Miriam Louise Carnot et.al. | 2402.18309 | null |
2024-02-28 | EchoTrack: Auditory Referring Multi-Object Tracking for Autonomous Driving | Jiacheng Lin et.al. | 2402.18302 | link |
2024-02-28 | PiShield: A NeSy Framework for Learning with Requirements | Mihaela Cătălina Stoian et.al. | 2402.18285 | link |
2024-03-08 | EAN-MapNet: Efficient Vectorized HD Map Construction with Anchor Neighborhoods | Huiyuan Xiong et.al. | 2402.18278 | null |
2024-02-28 | NToP: NeRF-Powered Large-scale Dataset Generation for 2D and 3D Human Pose Estimation in Top-View Fisheye Images | Jingrui Yu et.al. | 2402.18196 | link |
2024-02-28 | NiteDR: Nighttime Image De-Raining with Cross-View Sensor Cooperative Learning for Dynamic Driving Scenes | Cidan Shi et.al. | 2402.18172 | link |
2024-03-01 | 3DSFLabelling: Boosting 3D Scene Flow Estimation by Pseudo Auto-labelling | Chaokang Jiang et.al. | 2402.18146 | link |
2024-02-28 | OccTransformer: Improving BEVFormer for 3D camera-only occupancy prediction | Jian Liu et.al. | 2402.18140 | null |
2024-03-30 | Passive Snapshot Coded Aperture Dual-Pixel RGB-D Imaging | Bhargav Ghanekar et.al. | 2402.18102 | null |
2024-03-06 | ICAT: An Indoor Connected and Autonomous Testbed for Vehicle Computing | Zhaofeng Tian et.al. | 2402.17933 | null |
2024-03-17 | SWTrack: Multiple Hypothesis Sliding Window 3D Multi-Object Tracking | Sandro Papais et.al. | 2402.17892 | null |
2024-03-21 | Neural Radiance Fields in Medical Imaging: Challenges and Next Steps | Xin Wang et.al. | 2402.17797 | null |
2024-02-27 | QoS prediction in radio vehicular environments via prior user information | Noor Ul Ain et.al. | 2402.17689 | null |
2024-02-27 | Mitigating Distributional Shift in Semantic Segmentation via Uncertainty Estimation from Unlabelled Data | David S. W. Williams et.al. | 2402.17653 | null |
2024-02-27 | An Empirical Study of the Generalization Ability of Lidar 3D Object Detectors to Unseen Domains | George Eskandar et.al. | 2402.17562 | null |
2024-02-27 | Leveraging Enhanced Queries of Point Sets for Vectorized Map Construction | Zihao Liu et.al. | 2402.17430 | link |
2024-02-27 | Learning Dynamic Tetrahedra for High-Quality Talking Head Synthesis | Zicheng Zhang et.al. | 2402.17364 | link |
2024-03-21 | ICP-Flow: LiDAR Scene Flow Estimation with ICP | Yancong Lin et.al. | 2402.17351 | link |
2024-02-27 | CharNeRF: 3D Character Generation from Concept Art | Eddy Chu et.al. | 2402.17115 | null |
2024-02-26 | Reinforcement Learning Based Oscillation Dampening: Scaling up Single-Agent RL algorithms to a 100 AV highway field operational test | Kathy Jang et.al. | 2402.17050 | null |
2024-03-20 | Lightweight, error-tolerant edge detection using memristor-enabled stochastic logics | Lekai Song et.al. | 2402.16908 | null |
2024-02-26 | Think2Drive: Efficient Reinforcement Learning by Thinking in Latent World Model for Quasi-Realistic Autonomous Driving (in CARLA-v2) | Qifeng Li et.al. | 2402.16720 | null |
2024-02-26 | Learning Based NMPC Adaptation for Autonomous Driving using Parallelized Digital Twin | Jean Pierre Allamaa et.al. | 2402.16645 | null |
2024-02-26 | Resolution-Agnostic Neural Compression for High-Fidelity Portrait Video Conferencing via Implicit Radiance Fields | Yifei Li et.al. | 2402.16599 | null |
2024-02-26 | Trajectory Prediction for Autonomous Driving Using a Transformer Network | Zhenning Li et.al. | 2402.16501 | null |
2024-02-26 | Edge Detectors Can Make Deep Convolutional Neural Networks More Robust | Jin Ding et.al. | 2402.16479 | null |
2024-02-26 | CMC: Few-shot Novel View Synthesis via Cross-view Multiplane Consistency | Hanxin Zhu et.al. | 2402.16407 | null |
2024-02-26 | SPC-NeRF: Spatial Predictive Compression for Voxel Based Radiance Field | Zetian Song et.al. | 2402.16366 | null |
2024-02-26 | SeqTrack3D: Exploring Sequence Information for Robust 3D Point Cloud Tracking | Yu Lin et.al. | 2402.16249 | link |
2024-02-25 | GenNBV: Generalizable Next-Best-View Policy for Active 3D Reconstruction | Xiao Chen et.al. | 2402.16174 | link |
2024-02-25 | Machine Learning-Based Vehicle Intention Trajectory Recognition and Prediction for Autonomous Driving | Hanyi Yu et.al. | 2402.16036 | null |
2024-02-24 | Construction and application of artificial intelligence crowdsourcing map based on multi-track GPS data | Yong Wang et.al. | 2402.15796 | null |
2024-03-30 | Discretionary Lane-Change Decision and Control via Parameterized Soft Actor-Critic for Hybrid Action Space | Yuan Lin et.al. | 2402.15790 | null |
2024-03-22 | Detection Is Tracking: Point Cloud Multi-Sweep Deep Learning Models Revisited | Lingji Chen et.al. | 2402.15756 | null |
2024-02-23 | Multi-Constraint Safe RL with Objective Suppression for Safety-Critical Applications | Zihan Zhou et.al. | 2402.15650 | null |
2024-02-23 | Cohere3D: Exploiting Temporal Coherence for Unsupervised Representation Learning of Vision-based Autonomous Driving | Yichen Xie et.al. | 2402.15583 | null |
2024-02-21 | PCA-Bench: Evaluating Multimodal Large Language Models in Perception-Cognition-Action Chain | Liang Chen et.al. | 2402.15527 | link |
2024-02-23 | EMIFF: Enhanced Multi-scale Image Feature Fusion for Vehicle-Infrastructure Cooperative 3D Object Detection | Zhe Wang et.al. | 2402.15272 | link |
2024-02-22 | Path Planning based on 2D Object Bounding-box | Yanliang Huang et.al. | 2402.14933 | null |
2024-02-22 | Consolidating Attention Features for Multi-view Image Editing | Or Patashnik et.al. | 2402.14792 | null |
2024-02-22 | Distributed Radiance Fields for Edge Video Compression and Metaverse Integration in Autonomous Driving | Eugen Šlapak et.al. | 2402.14642 | null |
2024-02-08 | Enhancement of High-definition Map Update Service Through Coverage-aware and Reinforcement Learning | Jeffrey Redondo et.al. | 2402.14582 | null |
2024-02-22 | RadarMOSEVE: A Spatial-Temporal Transformer Network for Radar-Only Moving Object Segmentation and Ego-Velocity Estimation | Changsong Pang et.al. | 2402.14380 | link |
2024-02-22 | Mip-Grid: Anti-aliased Grid Representations for Neural Radiance Fields | Seungtae Nam et.al. | 2402.14196 | null |
2024-02-23 | Blending Data-Driven Priors in Dynamic Games | Justin Lidard et.al. | 2402.14174 | null |
2024-02-21 | Identifying Unnecessary 3D Gaussians using Clustering for Fast Rendering of 3D Gaussian Splatting | Joongho Jo et.al. | 2402.13827 | null |
2024-03-18 | Hybrid Reasoning Based on Large Language Models for Autonomous Car Driving | Mehdi Azarafza et.al. | 2402.13602 | link |
2024-02-21 | EffLoc: Lightweight Vision Transformer for Efficient 6-DOF Camera Relocalization | Zhendong Xiao et.al. | 2402.13537 | null |
2024-02-21 | SealD-NeRF: Interactive Pixel-Level Editing for Dynamic Scenes by Neural Radiance Fields | Zhentao Huang et.al. | 2402.13510 | null |
2024-02-21 | Learning to Model Diverse Driving Behaviors in Highly Interactive Autonomous Driving Scenarios with Multi-Agent Reinforcement Learning | Liu Weiwei et.al. | 2402.13481 | null |
2024-02-20 | How NeRFs and 3D Gaussian Splatting are Reshaping SLAM: a Survey | Fabio Tosi et.al. | 2402.13255 | link |
2024-02-20 | VADv2: End-to-End Vectorized Autonomous Driving via Probabilistic Planning | Shaoyu Chen et.al. | 2402.13243 | link |
2024-03-02 | NeRF Solves Undersampled MRI Reconstruction | Tae Jun Jang et.al. | 2402.13226 | null |
2024-02-20 | 3D high-resolution imaging algorithm using 1D MIMO array for autonomous driving application | Sen Yuan et.al. | 2402.13062 | null |
2024-02-20 | Advancements in Point Cloud-Based 3D Defect Detection and Classification for Industrial Systems: A Comprehensive Survey | Anju Rani et.al. | 2402.12923 | null |
2024-02-20 | OccFlowNet: Towards Self-supervised Occupancy Estimation via Differentiable Rendering and Occupancy Flow | Simon Boeder et.al. | 2402.12792 | null |
2024-02-20 | Smart Mobility Digital Twin Based Automated Vehicle Navigation System: A Proof of Concept | Kui Wang et.al. | 2402.12682 | null |
2024-02-20 | Pre-trained Transformer-Enabled Strategies with Human-Guided Fine-Tuning for End-to-end Navigation of Autonomous Vehicles | Dong Hu et.al. | 2402.12666 | null |
2024-02-19 | UncertaintyTrack: Exploiting Detection and Localization Uncertainty in Multi-Object Tracking | Chang Won Lee et.al. | 2402.12303 | link |
2024-03-31 | DriveVLM: The Convergence of Autonomous Driving and Large Vision-Language Models | Xiaoyu Tian et.al. | 2402.12289 | null |
2024-02-19 | Colorizing Monochromatic Radiance Fields | Yean Cheng et.al. | 2402.12184 | null |
2024-02-19 | Modified RRT* for Path Planning in Autonomous Driving | Sugirtha T et.al. | 2402.12129 | null |
2024-02-19 | Towards Explainable LiDAR Point Cloud Semantic Segmentation via Gradient Based Target Localization | Abhishek Kuriyal et.al. | 2402.12098 | link |
2024-02-21 | Surround-View Fisheye Optics in Computer Vision and Simulation: Survey and Challenges | Daniel Jakab et.al. | 2402.12041 | null |
2024-02-19 | One2Avatar: Generative Implicit Head Avatar For Few-shot User Adaptation | Zhixuan Yu et.al. | 2402.11909 | null |
2024-02-29 | SDGE: Stereo Guided Depth Estimation for 360 $^\circ$ Camera Sets | Jialei Xu et.al. | 2402.11791 | null |
2024-02-20 | GenAD: Generative End-to-End Autonomous Driving | Wenzhao Zheng et.al. | 2402.11502 | link |
2024-02-17 | Exploiting T-norms for Deep Learning in Autonomous Driving | Mihaela Cătălina Stoian et.al. | 2402.11362 | null |
2024-02-17 | CARLA-Autoware-Bridge: Facilitating Autonomous Driving Research with a Unified Framework for Simulation and Module Development | Gemb Kaljavesi et.al. | 2402.11239 | link |
2024-02-17 | Semantically-aware Neural Radiance Fields for Visual Scene Understanding: A Comprehensive Review | Thang-Anh-Quan Nguyen et.al. | 2402.11141 | link |
2024-02-16 | RAG-Driver: Generalisable Driving Explanations with Retrieval-Augmented In-Context Learning in Multi-Modal Large Language Model | Jianhao Yuan et.al. | 2402.10828 | null |
2024-02-16 | Efficient Multi-task Uncertainties for Joint Semantic Segmentation and Monocular Depth Estimation | Steven Landgraf et.al. | 2402.10580 | null |
2024-02-26 | Barrier-Enhanced Homotopic Parallel Trajectory Optimization for Safety-Critical Autonomous Driving | Lei Zheng et.al. | 2402.10441 | null |
2024-02-15 | Evaluating NeRFs for 3D Plant Geometry Reconstruction in Field Conditions | Muhammad Arbab Arshad et.al. | 2402.10344 | null |
2024-02-03 | Simulation-based Analysis of a Novel Loop-based Road Topology for Autonomous Vehicles | Stefan Ramdhan et.al. | 2402.10226 | null |
2024-02-08 | Explainable AI for Safe and Trustworthy Autonomous Driving: A Systematic Review | Anton Kuznietsov et.al. | 2402.10086 | null |
2024-01-29 | Review of the Learning-based Camera and Lidar Simulation Methods for Autonomous Driving Systems | Hamed Haghighi et.al. | 2402.10079 | null |
2024-02-15 | Exploiting Alpha Transparency In Language And Vision-Based AI Systems | David Noever et.al. | 2402.09671 | null |
2024-02-14 | How Secure Are Large Language Models (LLMs) for Navigation in Urban Environments? | Congcong Wen et.al. | 2402.09546 | null |
2024-02-14 | PC-NeRF: Parent-Child Neural Radiance Fields Using Sparse LiDAR Frames in Autonomous Driving Environments | Xiuzhong Hu et.al. | 2402.09325 | link |
2024-02-14 | Learning Interpretable Policies in Hindsight-Observable POMDPs through Partially Supervised Reinforcement Learning | Michael Lanier et.al. | 2402.09290 | null |
2024-02-14 | Design and Realization of a Benchmarking Testbed for Evaluating Autonomous Platooning Algorithms | Michael Shaham et.al. | 2402.09233 | null |
2024-02-13 | Preconditioners for the Stochastic Training of Implicit Neural Representations | Shin-Fang Chng et.al. | 2402.08784 | null |
2024-02-13 | NeRF Analogies: Example-Based Visual Attribute Transfer for NeRFs | Michael Fischer et.al. | 2402.08622 | null |
2024-02-13 | Vehicle Behavior Prediction by Episodic-Memory Implanted NDT | Peining Shen et.al. | 2402.08423 | link |
2024-02-13 | MetaTra: Meta-Learning for Generalized Trajectory Prediction in Unseen Domain | Xiaohe Li et.al. | 2402.08221 | null |
2024-02-29 | Inherent Diverse Redundant Safety Mechanisms for AI-based Software Elements in Automotive Applications | Mandar Pitale et.al. | 2402.08208 | null |
2024-03-08 | H2O-SDF: Two-phase Learning for 3D Indoor Reconstruction using Object Surface Fields | Minyoung Park et.al. | 2402.08138 | null |
2024-02-12 | Interaction-Based Driving Scenario Classification and Labeling | Cheng Chang et.al. | 2402.07720 | null |
2024-02-12 | AYDIV: Adaptable Yielding 3D Object Detection via Integrated Contextual Vision Transformer | Tanmoy Dam et.al. | 2402.07680 | link |
2024-02-12 | DeformNet: Latent Space Modeling and Dynamics Prediction for Deformable Object Manipulation | Chenchang Li et.al. | 2402.07648 | null |
2024-02-12 | DART: A Compact Platform For Autonomous Driving Research | Lorenzo Lyons et.al. | 2402.07602 | null |
2024-02-11 | Towards Explainable, Safe Autonomous Driving with Language Embeddings for Novelty Identification and Active Learning: Framework and Experimental Analysis with Real-World Data Sets | Ross Greer et.al. | 2402.07320 | null |
2024-03-25 | BioNeRF: Biologically Plausible Neural Radiance Fields for View Synthesis | Leandro A. Passos et.al. | 2402.07310 | link |
2024-02-11 | 3D Gaussian as a New Vision Era: A Survey | Ben Fei et.al. | 2402.07181 | null |
2024-02-09 | Neural Rendering based Urban Scene Reconstruction for Autonomous Driving | Shihao Shen et.al. | 2402.06826 | null |
2024-02-09 | Diffusion-ES: Gradient-free Planning with Diffusion for Autonomous Driving and Zero-Shot Instruction Following | Brian Yang et.al. | 2402.06559 | link |
2024-02-09 | CurveFormer++: 3D Lane Detection by Curve Propagation with Temporal Curve Queries and Attention | Yifeng Bai et.al. | 2402.06423 | null |
2024-02-09 | ImplicitDeepfake: Plausible Face-Swapping through Implicit Deepfake Generation using NeRF and Gaussian Splatting | Georgii Stanishevskii et.al. | 2402.06390 | link |
2024-02-25 | SIR: Multi-view Inverse Rendering with Decomposable Shadow for Indoor Scenes | Xiaokang Wei et.al. | 2402.06136 | null |
2024-02-08 | Driving Everywhere with Large Language Model Policy Adaptation | Boyi Li et.al. | 2402.05932 | null |
2024-03-11 | Editable Scene Simulation for Autonomous Driving via Collaborative LLM-Agents | Yuxi Wei et.al. | 2402.05746 | link |
2024-02-08 | Optimizing Delegation in Collaborative Human-AI Hybrid Teams | Andrew Fuchs et.al. | 2402.05605 | null |
2024-02-09 | NCRF: Neural Contact Radiance Fields for Free-Viewpoint Rendering of Hand-Object Interaction | Zhongqun Zhang et.al. | 2402.05532 | null |
2024-02-07 | Compressing Deep Reinforcement Learning Networks with a Dynamic Structured Pruning Method for Autonomous Driving | Wensheng Su et.al. | 2402.05146 | null |
2024-02-07 | Tuning the feedback controller gains is a simple way to improve autonomous driving performance | Wenyu Liang et.al. | 2402.05064 | null |
2024-02-07 | Mesh-based Gaussian Splatting for Real-time Large-scale Deformation | Lin Gao et.al. | 2402.04796 | null |
2024-02-07 | Investigating Driving Interactions: A Robust Multi-Agent Simulation Framework for Autonomous Vehicles | Marc Kaufeld et.al. | 2402.04720 | link |
2024-02-07 | OV-NeRF: Open-vocabulary Neural Radiance Fields with Vision and Language Foundation Models for 3D Semantic Understanding | Guibiao Liao et.al. | 2402.04648 | link |
2024-02-07 | GSN: Generalisable Segmentation in Neural Radiance Field | Vinayak Gupta et.al. | 2402.04632 | link |
2024-02-11 | BirdNeRF: Fast Neural Reconstruction of Large-Scale Scenes From Aerial Imagery | Huiqing Zhang et.al. | 2402.04554 | null |
2024-02-15 | LiDAR-Forest Dataset: LiDAR Point Cloud Simulation Dataset for Forestry Application | Yawen Lu et.al. | 2402.04546 | null |
2024-02-07 | BioDrone: A Bionic Drone-based Single Object Tracking Benchmark for Robust Vision | Xin Zhao et.al. | 2402.04519 | null |
2024-03-08 | Human Observation-Inspired Trajectory Prediction for Autonomous Driving in Mixed-Autonomy Traffic Environments | Haicheng Liao et.al. | 2402.04318 | link |
2024-02-06 | Informed Reinforcement Learning for Situation-Aware Traffic Rule Exceptions | Daniel Bogdoll et.al. | 2402.04168 | link |
2024-02-06 | Controllable Diverse Sampling for Diffusion Based Motion Behavior Forecasting | Yiming Xu et.al. | 2402.03981 | null |
2024-02-06 | OASim: an Open and Adaptive Simulator based on Neural Rendering for Autonomous Driving | Guohang Yan et.al. | 2402.03830 | link |
2024-02-05 | Efficient and Interpretable Traffic Destination Prediction using Explainable Boosting Machines | Yasin Yousif et.al. | 2402.03457 | link |
2024-02-20 | Denoising Diffusion via Image-Based Rendering | Titas Anciukevičius et.al. | 2402.03445 | null |
2024-02-05 | ViewFusion: Learning Composable Diffusion Models for Novel View Synthesis | Bernard Spiegl et.al. | 2402.02906 | link |
2024-02-05 | Improving Robustness of LiDAR-Camera Fusion Model against Weather Corruption from Fusion Strategy Perspective | Yihao Huang et.al. | 2402.02738 | null |
2024-02-04 | A Review of Full-Sized Autonomous Racing Vehicle Sensor Architecture | Manuel Mar et.al. | 2402.02603 | null |
2024-02-04 | Synthesizing Follow-Up Drive Data for Enhanced Road Safety in Intelligent Driving Function Systems | Nico Schick et.al. | 2402.02598 | null |
2024-02-04 | SIMPL: A Simple and Efficient Multi-agent Motion Prediction Baseline for Autonomous Driving | Lu Zhang et.al. | 2402.02519 | link |
2024-02-04 | Hybrid-Prediction Integrated Planning for Autonomous Driving | Haochen Liu et.al. | 2402.02426 | null |
2024-02-03 | Evaluating the Robustness of Off-Road Autonomous Driving Segmentation against Adversarial Attacks: A Dataset-Centric analysis | Pankaj Deoli et.al. | 2402.02154 | link |
2024-02-03 | S-NeRF++: Autonomous Driving Simulation via Neural Reconstruction and Generation | Yurui Chen et.al. | 2402.02112 | null |
2024-02-03 | Physical Perception Network and an All-weather Multi-modality Benchmark for Adverse Weather Image Fusion | Xilai Li et.al. | 2402.02090 | link |
2024-02-03 | RIDERS: Radar-Infrared Depth Estimation for Robust Sensing | Han Li et.al. | 2402.02067 | link |
2024-02-03 | Multimodal-Enhanced Objectness Learner for Corner Case Detection in Autonomous Driving | Lixing Xiao et.al. | 2402.02026 | link |
2024-02-03 | A Survey on Context-Aware Multi-Agent Systems: Techniques, Challenges and Future Directions | Hung Du et.al. | 2402.01968 | null |
2024-02-02 | Robust Inverse Graphics via Probabilistic Inference | Tuan Anh Le et.al. | 2402.01915 | link |
2024-02-02 | Efficient and Interaction-Aware Trajectory Planning for Autonomous Vehicles with Particle Swarm Optimization | Lin Song et.al. | 2402.01575 | null |
2024-02-02 | HyperPlanes: Hypernetwork Approach to Rapid NeRF Adaptation | Paweł Batorski et.al. | 2402.01524 | link |
2024-02-02 | Overcoming Blind Spots: Occlusion Considerations for Improved Autonomous Driving Safety | Korbinian Moller et.al. | 2402.01507 | link |
2024-02-02 | Di-NeRF: Distributed NeRF for Collaborative Learning with Unknown Relative Poses | Mahboubeh Asadi et.al. | 2402.01485 | null |
2024-02-02 | A Reinforcement Learning-Boosted Motion Planning Framework: Comprehensive Generalization Performance in Autonomous Driving | Rainer Trauth et.al. | 2402.01465 | link |
2024-02-15 | GaMeS: Mesh-Based Adapting and Modification of Gaussian Splatting | Joanna Waczyńska et.al. | 2402.01459 | link |
2024-02-02 | Frenetix Motion Planner: High-Performance and Modular Trajectory Planning Algorithm for Complex Autonomous Driving Scenarios | Korbinian Moller et.al. | 2402.01443 | link |
2024-02-08 | A survey on robustness in trajectory prediction for autonomous vehicles | Jeroen Hagenus et.al. | 2402.01397 | null |
2024-02-02 | LimSim++: A Closed-Loop Platform for Deploying Multimodal LLMs in Autonomous Driving | Daocheng Fu et.al. | 2402.01246 | null |
2024-02-06 | Taming Uncertainty in Sparse-view Generalizable NeRF via Indirect Diffusion Guidance | Yaokun Li et.al. | 2402.01217 | null |
2024-02-02 | A Survey for Foundation Models in Autonomous Driving | Haoxiang Gao et.al. | 2402.01105 | null |
2024-02-01 | ViCA-NeRF: View-Consistency-Aware 3D Editing of Neural Radiance Fields | Jiahua Dong et.al. | 2402.00864 | link |
2024-02-02 | Combining Belief Function Theory and Stochastic Model Predictive Control for Multi-Modal Uncertainty in Autonomous Driving | Tommaso Benciolini et.al. | 2402.00697 | null |
2024-02-01 | Fisheye Camera and Ultrasonic Sensor Fusion For Near-Field Obstacle Perception in Bird’s-Eye-View | Arindam Das et.al. | 2402.00637 | null |
2024-02-01 | Uncertainty-Aware Partial-Label Learning | Tobias Fuchs et.al. | 2402.00592 | link |
2024-02-01 | Reconfigurable Intelligent Computational Surfaces for MEC-Assisted Autonomous Driving Networks | Bo Yang et.al. | 2402.00398 | null |
2024-02-01 | Multi-agent Path Finding for Cooperative Autonomous Driving | Zhongxia Yan et.al. | 2402.00334 | link |
2024-03-04 | SmartCooper: Vehicular Collaborative Perception with Adaptive Fusion and Judger Mechanism | Yuang Zhang et.al. | 2402.00321 | null |
2024-02-29 | Real-time Traffic Object Detection for Autonomous Driving | Abdul Hannan Khan et.al. | 2402.00128 | null |
2024-01-31 | CARFF: Conditional Auto-encoded Radiance Field for 3D Scene Forecasting | Jiezhi Yang et.al. | 2401.18075 | null |
2024-01-31 | ReplaceAnything3D:Text-Guided 3D Scene Editing with Compositional Neural Radiance Fields | Edward Bartrum et.al. | 2401.17895 | null |
2024-02-01 | Segment Anything in 3D Gaussians | Xu Hu et.al. | 2401.17857 | link |
2024-02-19 | LaneGraph2Seq: Lane Topology Extraction with Language Model via Vertex-Edge Encoding and Connectivity Enhancement | Renyuan Peng et.al. | 2401.17609 | link |
2024-01-30 | ATPPNet: Attention based Temporal Point cloud Prediction Network | Kaustab Pal et.al. | 2401.17399 | null |
2024-01-30 | Physical Priors Augmented Event-Based 3D Reconstruction | Jiaxu Wang et.al. | 2401.17121 | link |
2024-01-30 | MF-MOS: A Motion-Focused Model for Moving Object Segmentation | Jintao Cheng et.al. | 2401.17023 | link |
2024-01-30 | Evaluation of Out-of-Distribution Detection Performance on Autonomous Driving Datasets | Jens Henriksson et.al. | 2401.17013 | null |
2024-01-30 | The Why, When, and How to Use Active Learning in Large-Data-Driven 3D Object Detection for Safe Autonomous Driving: An Empirical Exploration | Ross Greer et.al. | 2401.16634 | null |
2024-01-30 | I came, I saw, I certified: some perspectives on the safety assurance of cyber-physical systems | Mithila Sivakumar et.al. | 2401.16633 | null |
2024-03-09 | Endo-4DGS: Endoscopic Monocular Scene Reconstruction with 4D Gaussian Splatting | Yiming Huang et.al. | 2401.16416 | link |
2024-01-29 | SuNeRF: 3D reconstruction of the solar EUV corona using Neural Radiance Fields | Robert Jarolim et.al. | 2401.16388 | null |
2024-01-29 | FIMP: Future Interaction Modeling for Multi-Agent Motion Prediction | Sungmin Woo et.al. | 2401.16189 | null |
2024-01-29 | Divide and Conquer: Rethinking the Training Paradigm of Neural Radiance Fields | Rongkai Ma et.al. | 2401.16144 | null |
2024-01-29 | DeFlow: Decoder of Scene Flow Network in Autonomous Driving | Qingwen Zhang et.al. | 2401.16122 | link |
2024-01-29 | A Concise but Effective Network for Image Guided Depth Completion in Autonomous Driving | Moyun Liu et.al. | 2401.15902 | link |
2024-01-30 | GarchingSim: An Autonomous Driving Simulator with Photorealistic Scenes and Minimalist Workflow | Liguo Zhou et.al. | 2401.15803 | link |
2024-01-27 | AniDress: Animatable Loose-Dressed Avatar from Sparse Views Using Garment Rigging Model | Beijia Chen et.al. | 2401.15348 | null |
2024-01-27 | You Only Look Bottom-Up for Monocular 3D Object Detection | Kaixin Xiong et.al. | 2401.15319 | null |
2024-01-27 | Learning Online Belief Prediction for Efficient POMDP Planning in Autonomous Driving | Zhiyu Huang et.al. | 2401.15315 | null |
2024-01-26 | Learning Neural Radiance Fields of Forest Structure for Scalable and Fine Monitoring | Juan Castorena et.al. | 2401.15029 | null |
2024-01-26 | DAM: Diffusion Activation Maximization for 3D Global Explanations | Hanxiao Tan et.al. | 2401.14938 | link |
2024-01-26 | 3D Reconstruction and New View Synthesis of Indoor Environments based on a Dual Neural Radiance Field | Zhenyu Bao et.al. | 2401.14726 | link |
2024-01-25 | Learning Robust Generalizable Radiance Field with Visibility and Feature Augmented Point Representation | Jiaxu Wang et.al. | 2401.14354 | null |
2024-01-25 | Unlocking Past Information: Temporal Embeddings in Cooperative Bird’s Eye View Prediction | Dominik Rößle et.al. | 2401.14325 | null |
2024-01-25 | Optimization-based motion primitive automata for autonomous driving | Matheus V. A. Pedrosa et.al. | 2401.14276 | null |
2024-01-27 | Sketch2NeRF: Multi-view Sketch-guided Text-to-3D Generation | Minglin Chen et.al. | 2401.14257 | null |
2024-01-24 | S2TPVFormer: Spatio-Temporal Tri-Perspective View for temporally coherent 3D Semantic Occupancy Prediction | Sathira Silva et.al. | 2401.13785 | null |
2024-02-29 | ADMap: Anti-disturbance framework for reconstructing online vectorized HD map | Haotian Hu et.al. | 2401.13172 | link |
2024-01-26 | Data-Centric Evolution in Autonomous Driving: A Comprehensive Survey of Big Data System, Data Mining, and Closed-Loop Technologies | Lincan Li et.al. | 2401.12888 | link |
2024-01-23 | NeRF-AD: Neural Radiance Field with Attention-based Disentanglement for Talking Face Synthesis | Chongke Bi et.al. | 2401.12568 | null |
2024-01-23 | DDMI: Domain-Agnostic Latent Diffusion Models for Synthesizing High-Quality Implicit Neural Representations | Dogyun Park et.al. | 2401.12517 | link |
2024-01-23 | Self-supervised Learning of LiDAR 3D Point Clouds via 2D-3D Neural Calibration | Yifan Zhang et.al. | 2401.12452 | link |
2024-03-14 | Template-Free Single-View 3D Human Digitalization with Diffusion-Guided LRM | Zhenzhen Weng et.al. | 2401.12175 | null |
2024-01-22 | Enhancing Safety in Nonlinear Systems: Design and Stability Analysis of Adaptive Cruise Control | Fan Yang et.al. | 2401.11961 | null |
2024-03-10 | Large receptive field strategy and important feature extraction strategy in 3D object detection | Leichao Cui et.al. | 2401.11913 | null |
2024-01-22 | First-principles Based 3D Virtual Simulation Testing for Discovering SOTIF Corner Cases of Autonomous Driving | Lehang Li et.al. | 2401.11876 | null |
2024-03-14 | Safe and Generalized end-to-end Autonomous Driving System with Reinforcement Learning and Demonstrations | Zuojin Tang et.al. | 2401.11792 | null |
2024-01-22 | HG3-NeRF: Hierarchical Geometric, Semantic, and Photometric Guided Neural Radiance Fields for Sparse View Inputs | Zelin Gao et.al. | 2401.11711 | null |
2024-01-21 | Self-Supervised Bird’s Eye View Motion Prediction with Cross-Modality Signals | Shaoheng Fang et.al. | 2401.11499 | link |
2024-01-29 | S $^3$ M-Net: Joint Learning of Semantic Segmentation and Stereo Matching for Autonomous Driving | Zhiyuan Wu et.al. | 2401.11414 | null |
2024-03-01 | Enhancing System-Level Safety in Mixed-Autonomy Platoon via Safe Reinforcement Learning | Jingyuan Zhou et.al. | 2401.11148 | link |
2024-01-19 | Exploring Highly Quantised Neural Networks for Intrusion Detection in Automotive CAN | Shashwat Khandelwal et.al. | 2401.11030 | null |
2024-01-19 | A Lightweight Multi-Attack CAN Intrusion Detection System on Hybrid FPGAs | Shashwat Khandelwal et.al. | 2401.10689 | null |
2024-01-19 | Deep Learning-based Embedded Intrusion Detection System for Automotive CAN | Shashwat Khandelwal et.al. | 2401.10674 | null |
2024-01-19 | BadODD: Bangladeshi Autonomous Driving Object Detection Dataset | Mirza Nihal Baig et.al. | 2401.10659 | null |
2024-01-19 | Episodic Reinforcement Learning with Expanded State-reward Space | Dayang Liang et.al. | 2401.10516 | null |
2024-01-19 | Towards Automated Driving Violation Cause Analysis in Scenario-Based Testing for Autonomous Driving Systems | Ziwen Wan et.al. | 2401.10443 | null |
2024-01-18 | Reconstructing the Invisible: Video Frame Restoration through Siamese Masked Conditional Variational Autoencoder | Yongchen Zhou et.al. | 2401.10402 | null |
2024-01-18 | Analyzing and Mitigating Bias for Vulnerable Classes: Towards Balanced Representation in Dataset | Dewant Katare et.al. | 2401.10397 | null |
2024-01-18 | LangProp: A code optimization framework using Language Models applied to driving | Shu Ishida et.al. | 2401.10314 | link |
2024-01-18 | Hacking Predictors Means Hacking Cars: Using Sensitivity Analysis to Identify Trajectory Prediction Vulnerabilities for Autonomous Driving Security | Marsalis Gibson et.al. | 2401.10313 | null |
2024-01-18 | Model-Assisted Learning for Adaptive Cooperative Perception of Connected Autonomous Vehicles | Kaige Qu et.al. | 2401.10156 | null |
2024-01-16 | Importance-Aware Image Segmentation-based Semantic Communication for Autonomous Driving | Jie Lv et.al. | 2401.10153 | null |
2024-01-23 | IPR-NeRF: Ownership Verification meets Neural Radiance Field | Win Kent Ong et.al. | 2401.09495 | null |
2024-01-18 | Stream Query Denoising for Vectorized HD Map Construction | Shuo Wang et.al. | 2401.09112 | null |
2024-01-17 | Enhancing Campus Mobility: Achievements and Challenges of Autonomous Shuttle “Snow Lion’‘ | Yingbing Chen et.al. | 2401.08939 | null |
2024-01-17 | ICON: Incremental CONfidence for Joint Pose and Radiance Field Optimization | Weiyao Wang et.al. | 2401.08937 | null |
2023-12-27 | Risk-anticipatory autonomous driving strategies considering vehicles’ weights, based on hierarchical deep reinforcement learning | Di Chen et.al. | 2401.08661 | null |
2023-12-26 | End-To-End Planning of Autonomous Driving in Industry and Academia: 2022-2023 | Gongjin Lan an Qi Hao et.al. | 2401.08658 | null |
2023-12-25 | Digital Twins for Autonomous Driving: A Comprehensive Implementation and Demonstration | Kui Wang et.al. | 2401.08653 | null |
2024-01-18 | ProvNeRF: Modeling per Point Provenance in NeRFs as a Stochastic Process | Kiyohiro Nakayama et.al. | 2401.08140 | null |
2024-01-16 | Forging Vision Foundation Models for Autonomous Driving: Challenges, Methodologies, and Opportunities | Xu Yan et.al. | 2401.08045 | link |
2024-01-15 | Pedestrian Detection in Low-Light Conditions: A Comprehensive Survey | Bahareh Ghari et.al. | 2401.07801 | null |
2024-01-15 | Semantic Scene Segmentation for Robotics | Juana Valeria Hurtado et.al. | 2401.07589 | null |
2024-01-15 | Geo-locating Road Objects using Inverse Haversine Formula with NVIDIA Driveworks | Mamoona Birkhez Shami et.al. | 2401.07582 | null |
2024-02-09 | RSUD20K: A Dataset for Road Scene Understanding In Autonomous Driving | Hasib Zunair et.al. | 2401.07322 | link |
2024-01-14 | Photonic real time video image signal processor at 17Tb/s based on a Kerr microcomb | Mengxi Tan et.al. | 2401.07197 | null |
2024-01-13 | ACAV: A Framework for Automatic Causality Analysis in Autonomous Vehicle Accident Recordings | Huijia Sun et.al. | 2401.07063 | null |
2024-01-13 | UniVision: A Unified Framework for Vision-Centric 3D Perception | Yu Hong et.al. | 2401.06994 | null |
2024-01-12 | Open RAN LSTM Traffic Prediction and Slice Management using Deep Reinforcement Learning | Fatemeh Lotfi et.al. | 2401.06922 | null |
2024-01-12 | Synthetic Data Generation Framework, Dataset, and Efficient Deep Model for Pedestrian Intention Prediction | Muhammad Naveed Riaz et.al. | 2401.06757 | link |
2024-01-12 | Real-time MPC with Control Barrier Functions for Autonomous Driving using Safety Enhanced Collocation | Jean Pierre Allamaa et.al. | 2401.06648 | null |
2024-01-12 | Enhancing Throughput for TTEthernet via Co-optimizing Routing and Scheduling: An Online Time-Varying Graph-based Method | Yaoxu He et.al. | 2401.06579 | null |
2024-01-12 | Robustness-Aware 3D Object Detection in Autonomous Driving: A Review and Outlook | Ziying Song et.al. | 2401.06542 | null |
2024-01-12 | Personalized Reinforcement Learning with a Budget of Policies | Dmitry Ivanov et.al. | 2401.06514 | link |
2024-01-12 | Multi-Profile Quadratic Programming (MPQP) for Optimal Gap Selection and Speed Planning of Autonomous Driving | Alexandre Miranda Anon et.al. | 2401.06305 | null |
2024-01-11 | TriNeRFLet: A Wavelet Based Multiscale Triplane NeRF Representation | Rajaei Khatib et.al. | 2401.06191 | null |
2024-01-11 | Fast High Dynamic Range Radiance Fields for Dynamic Scenes | Guanjun Wu et.al. | 2401.06052 | null |
2024-01-11 | GO-NeRF: Generating Virtual Objects in Neural Radiance Fields | Peng Dai et.al. | 2401.05750 | null |
2024-01-10 | Diffusion Priors for Dynamic View Synthesis from Monocular Videos | Chaoyang Wang et.al. | 2401.05583 | null |
2024-03-09 | VLP: Vision Language Planning for Autonomous Driving | Chenbin Pan et.al. | 2401.05577 | null |
2024-01-10 | FPRF: Feed-Forward Photorealistic Style Transfer of Large-Scale 3D Neural Radiance Fields | GeonU Kim et.al. | 2401.05516 | null |
2024-01-10 | Autonomous Navigation of Tractor-Trailer Vehicles through Roundabout Intersections | Daniel Attard et.al. | 2401.04980 | null |
2024-01-10 | Latency-aware Road Anomaly Segmentation in Videos: A Photorealistic Dataset and New Metrics | Beiwen Tian et.al. | 2401.04942 | null |
2024-01-10 | CTNeRF: Cross-Time Transformer for Dynamic Neural Radiance Field from Monocular Video | Xingyu Miao et.al. | 2401.04861 | link |
2024-01-10 | Learning Racing From an AI Coach: Effects of Multimodal Autonomous Driving Explanations on Driving Performance, Cognitive Load, Expertise, and Trust | Robert Kaufman et.al. | 2401.04206 | null |
2024-01-08 | RoboFusion: Towards Robust Multi-Modal 3D obiect Detection via SAM | Ziying Song et.al. | 2401.03907 | link |
2024-01-08 | A Survey on 3D Gaussian Splatting | Guikun Chen et.al. | 2401.03890 | link |
2024-01-08 | UFO: Unidentified Foreground Object Detection in 3D Point Cloud | Hyunjun Choi et.al. | 2401.03846 | null |
2024-01-15 | WidthFormer: Toward Efficient Transformer-based BEV View Transformation | Chenhongyi Yang et.al. | 2401.03836 | link |
2024-01-08 | Safe Chance-constrained Model Predictive Control under Gaussian Mixture Model Uncertainty | Kai Ren et.al. | 2401.03799 | null |
2024-01-08 | NeRFmentation: NeRF-based Augmentation for Monocular Depth Estimation | Casimir Feldmann et.al. | 2401.03771 | null |
2024-01-08 | DME-Driver: Integrating Human Decision Logic and 3D Scene Perception in Autonomous Driving | Wencheng Han et.al. | 2401.03641 | null |
2024-01-08 | DDM-Lag : A Diffusion-based Decision-making Model for Autonomous Vehicles with Lagrangian Safety Enhancement | Jiaqi Liu et.al. | 2401.03629 | null |
2024-01-07 | Text-Driven Traffic Anomaly Detection with Temporal High-Frequency Modeling in Driving Videos | Rongqin Liang et.al. | 2401.03522 | null |
2024-01-16 | Reconfigurable Holographic Surface Aided Wireless Simultaneous Localization and Mapping | Haobo Zhang et.al. | 2401.03453 | null |
2024-01-06 | RustNeRF: Robust Neural Radiance Field with Low-Quality Images | Mengfei Li et.al. | 2401.03257 | null |
2024-01-06 | Hi-Map: Hierarchical Factorized Radiance Field for High-Fidelity Monocular Dense Mapping | Tongyan Hua et.al. | 2401.03203 | null |
2024-01-06 | DistFormer: Enhancing Local and Global Features for Monocular Per-Object Distance Estimation | Aniello Panariello et.al. | 2401.03191 | link |
2024-02-19 | Human as AI Mentor: Enhanced Human-in-the-loop Reinforcement Learning for Safe and Efficient Autonomous Driving | Zilin Huang et.al. | 2401.03160 | link |
2024-01-06 | Transferable Learned Image Compression-Resistant Adversarial Perturbations | Yang Sui et.al. | 2401.03115 | null |
2024-01-08 | Uncovering the human motion pattern: Pattern Memory-based Diffusion Model for Trajectory Prediction | Yuxin Yang et.al. | 2401.02916 | null |
2024-01-05 | Progress and Prospects in 3D Generative AI: A Technical Overview including 3D human | Song Bai et.al. | 2401.02620 | null |
2024-01-04 | OptFlow: Fast Optimization-based Scene Flow Estimation without Supervision | Rahul Ahuja et.al. | 2401.02550 | null |
2024-01-04 | REDriver: Runtime Enforcement for Autonomous Vehicles | Yang Sun et.al. | 2401.02253 | null |
2024-01-04 | Inherently robust suboptimal MPC for autonomous racing with anytime feasible SQP | Logan Numerow et.al. | 2401.02194 | null |
2024-01-03 | SIGNeRF: Scene Integrated Generation for Neural Radiance Fields | Jan-Niklas Dihlmann et.al. | 2401.01647 | null |
2024-01-03 | Context-Aware Interaction Network for RGB-T Semantic Segmentation | Ying Lv et.al. | 2401.01624 | link |
2024-01-03 | Collaborative Perception for Connected and Autonomous Driving: Challenges, Possible Solutions and Opportunities | Senkang Hu et.al. | 2401.01544 | null |
2024-01-02 | A Survey on Autonomous Driving Datasets: Data Statistic, Annotation, and Outlook | Mingyu Liu et.al. | 2401.01454 | link |
2024-01-02 | Off-Road LiDAR Intensity Based Semantic Segmentation | Kasi Viswanath et.al. | 2401.01439 | link |
2023-12-28 | Fast Quantum Convolutional Neural Networks for Low-Complexity Object Detection in Autonomous Driving Applications | Hankyul Baek et.al. | 2401.01370 | null |
2024-01-02 | Temporal Adaptive RGBT Tracking with Modality Prompt | Hongyu Wang et.al. | 2401.01244 | null |
2024-01-02 | Noise-NeRF: Hide Information in Neural Radiance Fields using Trainable Noise | Qinglong Huang et.al. | 2401.01216 | null |
2024-01-05 | PLE-SLAM: A Visual-Inertial SLAM Based on Point-Line Features and Efficient IMU Initialization | Jiaming He et.al. | 2401.01081 | link |
2024-01-02 | BEV-CLIP: Multi-modal BEV Retrieval Methodology for Complex Scene in Autonomous Driving | Dafeng Wei et.al. | 2401.01065 | null |
2024-01-02 | Holistic Autonomous Driving Understanding by Bird’s-Eye-View Injected Multi-Modal Large Models | Xinpeng Ding et.al. | 2401.00988 | link |
2024-01-02 | 3D Visibility-aware Generalizable Neural Radiance Fields for Interacting Hands | Xuan Huang et.al. | 2401.00979 | link |
2024-01-16 | WoodScape Motion Segmentation for Autonomous Driving – CVPR 2023 OmniCV Workshop Challenge | Saravanabalagi Ramachandran et.al. | 2401.00910 | null |
2023-12-30 | PlanarNeRF: Online Learning of Planar Primitives with Neural Radiance Fields | Zheng Chen et.al. | 2401.00871 | null |
2024-01-01 | Deblurring 3D Gaussian Splatting | Byeonghyeon Lee et.al. | 2401.00834 | null |
2024-01-01 | Socially Compliant Control of Autonomous Vehicles with Application to Eco-Driving | Shian Wang et.al. | 2401.00830 | null |
2024-01-01 | Sharp-NeRF: Grid-based Fast Deblurring Neural Radiance Fields Using Sharpness Prior | Byeonghyeon Lee et.al. | 2401.00825 | link |
2024-01-02 | GD^2-NeRF: Generative Detail Compensation via GAN and Diffusion for One-shot Generalizable Neural Radiance Fields | Xiao Pan et.al. | 2401.00616 | null |
2023-12-31 | RainSD: Rain Style Diversification Module for Image Synthesis Enhancement using Feature-Level Style Distribution | Hyeonjae Jeon et.al. | 2401.00460 | null |
2023-12-31 | Controllable Safety-Critical Closed-loop Traffic Simulation via Guided Diffusion | Wei-Jer Chang et.al. | 2401.00391 | null |
2023-12-30 | Inpaint4DNeRF: Promptable Spatio-Temporal NeRF Inpainting with Generative Diffusion Models | Han Jiang et.al. | 2401.00208 | null |
2023-12-30 | LLM-Assist: Enhancing Closed-Loop Planning with Language-Based Reasoning | S P Sharan et.al. | 2401.00125 | null |
2024-01-07 | Generative AI-driven Semantic Communication Networks: Architecture, Technologies and Applications | Chengsi Liang et.al. | 2401.00124 | null |
2023-12-29 | Visual Point Cloud Forecasting enables Scalable Autonomous Driving | Zetong Yang et.al. | 2312.17655 | link |
2023-12-29 | Informative Rays Selection for Few-Shot Neural Radiance Fields | Marco Orsingher et.al. | 2312.17561 | null |
2023-12-22 | TimePillars: Temporally-Recurrent 3D LiDAR Object Detection | Ernesto Lozano Calvo et.al. | 2312.17260 | null |
2024-01-29 | FENet: Focusing Enhanced Network for Lane Detection | Liman Wang et.al. | 2312.17163 | link |
2023-12-29 | Fully Sparse 3D Panoptic Occupancy Prediction | Haisong Liu et.al. | 2312.17118 | link |
2023-12-28 | DOEPatch: Dynamically Optimized Ensemble Model for Adversarial Patches Generation | Wenyi Tan et.al. | 2312.16907 | null |
2023-12-27 | LIP-Loc: LiDAR Image Pretraining for Cross-Modal Localization | Sai Shubodh Puligilla et.al. | 2312.16648 | null |
2023-12-27 | Autonomous Driving using Residual Sensor Fusion and Deep Reinforcement Learning | Amin Jalal Aghdasian et.al. | 2312.16620 | link |
2023-12-29 | DL3DV-10K: A Large-Scale Scene Dataset for Deep Learning-based 3D Vision | Lu Ling et.al. | 2312.16256 | null |
2023-12-24 | SUNDIAL: 3D Satellite Understanding through Direct, Ambient, and Complex Lighting Decomposition | Nikhil Behari et.al. | 2312.16215 | null |
2023-12-23 | INFAMOUS-NeRF: ImproviNg FAce MOdeling Using Semantically-Aligned Hypernetworks with Neural Radiance Fields | Andrew Hou et.al. | 2312.16197 | null |
2024-02-26 | LaneSegNet: Map Learning with Lane Segment Perception for Autonomous Driving | Tianyu Li et.al. | 2312.16108 | link |
2023-12-26 | 2D-Guided 3D Gaussian Segmentation | Kun Lan et.al. | 2312.16047 | null |
2023-12-26 | Adaptive Kalman-based hybrid car following strategy using TD3 and CACC | Yuqi Zheng et.al. | 2312.15993 | null |
2024-02-23 | Pano-NeRF: Synthesizing High Dynamic Range Novel Views with Geometry from Sparse Low Dynamic Range Panoramic Images | Zhan Lu et.al. | 2312.15942 | link |
2023-12-26 | Attention-aware Social Graph Transformer Networks for Stochastic Trajectory Prediction | Yao Liu et.al. | 2312.15881 | null |
2023-12-25 | Contrastive Learning-Based Framework for Sim-to-Real Mapping of Lidar Point Clouds in Autonomous Driving Systems | Hamed Haghighi et.al. | 2312.15817 | link |
2023-12-25 | Neural BSSRDF: Object Appearance Representation Including Heterogeneous Subsurface Scattering | Thomson TG et.al. | 2312.15711 | null |
2023-12-25 | A Survey on Open-Set Image Recognition | Jiayin Sun et.al. | 2312.15571 | null |
2023-12-23 | Efficient Deformable Tissue Reconstruction via Orthogonal Neural Plane | Chen Yang et.al. | 2312.15253 | link |
2023-12-23 | Pre-trained Trojan Attacks for Visual Recognition | Aishan Liu et.al. | 2312.15172 | null |
2024-02-08 | Scaling Is All You Need: Autonomous Driving with JAX-Accelerated Reinforcement Learning | Moritz Harmel et.al. | 2312.15122 | null |
2023-12-22 | Deformable 3D Gaussian Splatting for Animatable Human Avatars | HyunJun Jung et.al. | 2312.15059 | null |
2023-12-26 | Lift-Attend-Splat: Bird’s-eye-view camera-lidar fusion using transformers | James Gunn et.al. | 2312.14919 | null |
2023-12-22 | PoseGen: Learning to Generate 3D Human Pose Dataset with NeRF | Mohsen Gholami et.al. | 2312.14915 | link |
2023-12-22 | Density Uncertainty Quantification with NeRF-Ensembles: Impact of Data and Scene Constraints | Miriam Jäger et.al. | 2312.14664 | null |
2023-12-22 | Explainable Multi-Camera 3D Object Detection with Transformer-Based Saliency Maps | Till Beemelmanns et.al. | 2312.14606 | null |
2023-12-22 | MonoLSS: Learnable Sample Selection For Monocular 3D Detection | Zhenjia Li et.al. | 2312.14474 | link |
2023-12-21 | PlatoNeRF: 3D Reconstruction in Plato’s Cave via Single-View Two-Bounce Lidar | Tzofi Klinghoffer et.al. | 2312.14239 | null |
2023-12-21 | DriveLM: Driving with Graph Visual Question Answering | Chonghao Sima et.al. | 2312.14150 | link |
2023-12-21 | Neural Point Cloud Diffusion for Disentangled 3D Shape and Appearance Generation | Philipp Schröppel et.al. | 2312.14124 | link |
2023-12-21 | LingoQA: Video Question Answering for Autonomous Driving | Ana-Maria Marcu et.al. | 2312.14115 | link |
2024-02-18 | Gaussian Splatting with NeRF-based Color and Opacity | Dawid Malarz et.al. | 2312.13729 | link |
2023-12-21 | DyBluRF: Dynamic Deblurring Neural Radiance Fields for Blurry Monocular Video | Minh-Quan Viet Bui et.al. | 2312.13528 | null |
2023-12-20 | NeRF-VO: Real-Time Sparse Visual Odometry with Neural Radiance Fields | Jens Naumann et.al. | 2312.13471 | null |
2023-12-20 | Building Lane-Level Maps from Aerial Images | Jiawei Yao et.al. | 2312.13449 | link |
2023-12-20 | ShowRoom3D: Text to High-Quality 3D Room Generation Using 3D Priors | Weijia Mao et.al. | 2312.13324 | null |
2023-12-19 | Compact 3D Scene Representation via Self-Organizing Gaussian Grids | Wieland Morgenstern et.al. | 2312.13299 | link |
2023-12-20 | Deep Learning on 3D Neural Fields | Pierluigi Zama Ramirez et.al. | 2312.13277 | null |
2023-12-29 | AccidentGPT: Accident Analysis and Prevention from V2X Environmental Perception with Multi-modal Large Model | Lening Wang et.al. | 2312.13156 | null |
2024-01-10 | Optimizing Ego Vehicle Trajectory Prediction: The Graph Enhancement Approach | Sushil Sharma et.al. | 2312.13104 | null |
2023-12-20 | SpecNeRF: Gaussian Directional Encoding for Specular Reflections | Li Ma et.al. | 2312.13102 | null |
2024-01-17 | PPEA-Depth: Progressive Parameter-Efficient Adaptation for Self-Supervised Monocular Depth Estimation | Yue-Jiang Dong et.al. | 2312.13066 | null |
2023-12-20 | TADAP: Trajectory-Aided Drivable area Auto-labeling with Pre-trained self-supervised features in winter driving conditions | Eerik Alamikkotervo et.al. | 2312.12954 | null |
2023-12-20 | PointeNet: A Lightweight Framework for Effective and Efficient Point Cloud Analysis | Lipeng Gu et.al. | 2312.12743 | null |
2023-12-20 | Reducing Shape-Radiance Ambiguity in Radiance Fields with a Closed-Form Color Estimation Method | Qihang Fang et.al. | 2312.12726 | link |
2023-12-19 | Studying the Practices of Testing Machine Learning Software in the Wild | Moses Openja et.al. | 2312.12604 | link |
2024-01-23 | Tracking Any Object Amodally | Cheng-Yen Hsieh et.al. | 2312.12433 | link |
2023-12-19 | First qualitative observations on deep learning vision model YOLO and DETR for automated driving in Austria | Stefan Schoder et.al. | 2312.12314 | null |
2023-12-19 | M-BEV: Masked BEV Perception for Robust Autonomous Driving | Siran Chen et.al. | 2312.12144 | link |
2023-12-19 | ZS-SRT: An Efficient Zero-Shot Super-Resolution Training Method for Neural Radiance Fields | Xiang Feng et.al. | 2312.12122 | null |
2023-12-19 | Parameterized Decision-making with Multi-modal Perception for Autonomous Driving | Yuyang Xia et.al. | 2312.11935 | null |
2024-01-22 | MixRT: Mixed Neural Representations For Real-Time NeRF Rendering | Chaojian Li et.al. | 2312.11841 | null |
2023-12-19 | Regulating Intermediate 3D Features for Vision-Centric Autonomous Driving | Junkai Xu et.al. | 2312.11837 | link |
2023-12-19 | Text-Image Conditioned Diffusion for Consistent Text-to-3D Generation | Yuze He et.al. | 2312.11774 | null |
2023-12-20 | FastSR-NeRF: Improving NeRF Efficiency on Consumer Devices with A Simple Super-Resolution Pipeline | Chien-Yu Lin et.al. | 2312.11537 | null |
2023-12-18 | GauFRe: Gaussian Deformation Fields for Real-time Dynamic Novel View Synthesis | Yiqing Liang et.al. | 2312.11458 | null |
2023-12-18 | Multi-Agent Reinforcement Learning for Connected and Automated Vehicles Control: Recent Advancements and Future Prospects | Min Hua et.al. | 2312.11084 | link |
2023-12-18 | Multi-Correlation Siamese Transformer Network with Dense Connection for 3D Single Object Tracking | Shihao Feng et.al. | 2312.11051 | link |
2023-12-18 | AE-NeRF: Audio Enhanced Neural Radiance Field for Few Shot Talking Head Synthesis | Dongze Li et.al. | 2312.10921 | null |
2023-12-17 | PNeRFLoc: Visual Localization with Point-based Neural Radiance Fields | Boming Zhao et.al. | 2312.10649 | null |
2023-12-17 | Physics-informed Representation and Learning: Control and Risk Quantification | Zhuoyuan Wang et.al. | 2312.10594 | link |
2023-12-16 | Improving Environment Robustness of Deep Reinforcement Learning Approaches for Autonomous Racing Using Bayesian Optimization-based Curriculum Learning | Rohan Banerjee et.al. | 2312.10557 | link |
2023-12-19 | Learning Dense Correspondence for NeRF-Based Face Reenactment | Songlin Yang et.al. | 2312.10422 | null |
2023-12-19 | Fractional Deep Reinforcement Learning for Age-Minimal Mobile Edge Computing | Lyudong Jin et.al. | 2312.10418 | null |
2023-12-15 | Constrained Meta-Reinforcement Learning for Adaptable Safety Guarantee with Differentiable Convex Programming | Minjae Cho et.al. | 2312.10230 | link |
2023-12-15 | Communication-Efficient Soft Actor-Critic Policy Collaboration via Regulated Segment Mixture in Internet of Vehicles | Xiaoxue Yu et.al. | 2312.10123 | null |
2023-12-15 | SlimmeRF: Slimmable Radiance Fields | Shiran Yuan et.al. | 2312.10034 | link |
2023-12-15 | Neurosymbolic Value-Inspired AI (Why, What, and How) | Amit Sheth et.al. | 2312.09928 | null |
2023-12-15 | LAENeRF: Local Appearance Editing for Neural Radiance Fields | Lukas Radl et.al. | 2312.09913 | null |
2023-12-15 | RANRAC: Robust Neural Scene Representations via Random Ray Consensus | Benno Buschmann et.al. | 2312.09780 | null |
2023-12-15 | SLS4D: Sparse Latent Space for 4D Novel View Synthesis | Qi-Yuan Feng et.al. | 2312.09743 | null |
2023-12-15 | NeuroFlow: Development of lightweight and efficient model integration scheduling strategy for autonomous driving system | Eunbin Seo et.al. | 2312.09588 | null |
2023-12-15 | Embodied Adversarial Attack: A Dynamic Robust Physical Attack in Autonomous Driving | Yitong Sun et.al. | 2312.09554 | null |
2023-12-15 | DriveTrack: A Benchmark for Long-Range Point Tracking in Real-World Videos | Arjun Balasingam et.al. | 2312.09523 | null |
2023-12-26 | SlowTrack: Increasing the Latency of Camera-based Perception in Autonomous Driving Using Adversarial Examples | Chen Ma et.al. | 2312.09520 | null |
2023-12-15 | EDA: Evolving and Distinct Anchors for Multimodal Motion Prediction | Longzhong Lin et.al. | 2312.09501 | link |
2023-12-14 | Large Language Models for Autonomous Driving: Real-World Experiments | Can Cui et.al. | 2312.09397 | null |
2023-12-14 | ZeroRF: Fast Sparse View 360° Reconstruction with Zero Pretraining | Ruoxi Shi et.al. | 2312.09249 | null |
2023-12-25 | DriveMLM: Aligning Multi-Modal Large Language Models with Behavioral Planning States for Autonomous Driving | Wenhai Wang et.al. | 2312.09245 | link |
2023-12-14 | OccNeRF: Self-Supervised Multi-Camera Occupancy Prediction with Neural Radiance Fields | Chubin Zhang et.al. | 2312.09243 | link |
2023-12-15 | 3DGS-Avatar: Animatable Avatars via Deformable 3D Gaussian Splatting | Zhiyin Qian et.al. | 2312.09228 | null |
2023-12-15 | ColNeRF: Collaboration for Generalizable Sparse Input Neural Radiance Field | Zhangkai Ni et.al. | 2312.09095 | link |
2023-12-15 | Aleth-NeRF: Illumination Adaptive NeRF with Concealing Field Assumption | Ziteng Cui et.al. | 2312.09093 | link |
2023-12-14 | iComMa: Inverting 3D Gaussians Splatting for Camera Pose Estimation via Comparing and Matching | Yuan Sun et.al. | 2312.09031 | null |
2023-12-14 | Scene 3-D Reconstruction System in Scattering Medium | Zhuoyifan Zhang et.al. | 2312.09005 | null |
2023-12-14 | VaLID: Variable-Length Input Diffusion for Novel View Synthesis | Shijie Li et.al. | 2312.08892 | null |
2023-12-14 | CF-NeRF: Camera Parameter Free Neural Radiance Fields with Incremental Learning | Qingsong Yan et.al. | 2312.08760 | null |
2023-12-14 | SpectralNeRF: Physically Based Spectral Rendering with Neural Radiance Field | Ru Li et.al. | 2312.08692 | link |
2023-12-13 | ProNeRF: Learning Efficient Projection-Aware Ray Sampling for Fine-Grained Implicit Neural Radiance Fields | Juan Luis Gonzalez Bello et.al. | 2312.08136 | null |
2023-12-13 | Neural Radiance Fields for Transparent Object Using Visual Hull | Heechan Yoon et.al. | 2312.08118 | null |
2023-12-13 | 3DGEN: A GAN-based approach for generating novel 3D models from image data | Antoine Schnepf et.al. | 2312.08094 | null |
2023-12-14 | Semi-Supervised Class-Agnostic Motion Prediction with Pseudo Label Regeneration and BEVMix | Kewei Wang et.al. | 2312.08009 | link |
2023-12-13 | Instance-aware Multi-Camera 3D Object Detection with Structural Priors Mining and Self-Boosting Learning | Yang Jiao et.al. | 2312.08004 | null |
2023-12-13 | DrivingGaussian: Composite Gaussian Splatting for Surrounding Dynamic Autonomous Driving Scenes | Xiaoyu Zhou et.al. | 2312.07920 | link |
2023-12-11 | Spatiotemporal Event Graphs for Dynamic Scene Understanding | Salman Khan et.al. | 2312.07621 | null |
2023-12-12 | COLMAP-Free 3D Gaussian Splatting | Yang Fu et.al. | 2312.07504 | link |
2023-12-21 | LMDrive: Closed-Loop End-to-End Driving with Large Language Models | Hao Shao et.al. | 2312.07488 | link |
2023-12-12 | Efficient Object Detection in Autonomous Driving using Spiking Neural Networks: Performance, Energy Consumption Analysis, and Insights into Open-set Object Discovery | Aitor Martinez Seras et.al. | 2312.07466 | link |
2023-12-13 | How Well Does GPT-4V(ision) Adapt to Distribution Shifts? A Preliminary Investigation | Zhongyi Han et.al. | 2312.07424 | link |
2023-12-12 | Autonomous driving of trucks in off-road environment | Kenny A. Q. Caldas et.al. | 2312.07382 | null |
2023-12-17 | MWSIS: Multimodal Weakly Supervised Instance Segmentation with 2D Box Annotations for Autonomous Driving | Guangfeng Jiang et.al. | 2312.06988 | link |
2023-12-12 | WaterHE-NeRF: Water-ray Tracing Neural Radiance Fields for Underwater Scene Reconstruction | Jingchun Zhou et.al. | 2312.06946 | null |
2023-12-11 | Scalable Decentralized Cooperative Platoon using Multi-Agent Deep Reinforcement Learning | Ahmed Abdelrahman et.al. | 2312.06858 | null |
2023-12-10 | TeTriRF: Temporal Tri-Plane Radiance Fields for Efficient Free-Viewpoint Video | Minye Wu et.al. | 2312.06713 | null |
2023-12-10 | Dynamic Adversarial Attacks on Autonomous Driving Systems | Amirhosein Chahe et.al. | 2312.06701 | link |
2023-12-11 | Learning Naturally Aggregated Appearance for Efficient 3D Editing | Ka Leong Cheng et.al. | 2312.06657 | link |
2023-12-11 | CorresNeRF: Image Correspondence Priors for Neural Radiance Fields | Yixing Lao et.al. | 2312.06642 | link |
2023-12-15 | BAT: Behavior-Aware Human-Like Trajectory Prediction for Autonomous Driving | Haicheng Liao et.al. | 2312.06371 | link |
2023-12-11 | NuScenes-MQA: Integrated Evaluation of Captions and QA for Autonomous Driving Datasets using Markup Annotations | Yuichi Inoue et.al. | 2312.06352 | link |
2023-12-11 | Evaluation of Large Language Models for Decision Making in Autonomous Driving | Kotaro Tanahashi et.al. | 2312.06351 | null |
2023-12-11 | Attribute Annotation and Bias Evaluation in Visual Datasets for Autonomous Driving | David Fernández Llorca et.al. | 2312.06306 | link |
2023-12-11 | Interpretable Long Term Waypoint-Based Trajectory Prediction Model | Amina Ghoul et.al. | 2312.06219 | null |
2023-12-11 | Recent Advances in Deterministic Human Motion Prediction: A Review | Tenghao Deng et.al. | 2312.06184 | null |
2023-12-11 | M3SOT: Multi-frame, Multi-field, Multi-space 3D Single Object Tracking | Jiaming Liu et.al. | 2312.06117 | link |
2023-12-10 | GenDepth: Generalizing Monocular Depth Estimation for Arbitrary Camera Parameters via Ground Plane Embedding | Karlo Koledić et.al. | 2312.06021 | null |
2023-12-10 | Learning for CasADi: Data-driven Models in Numerical Optimization | Tim Salzmann et.al. | 2312.05873 | link |
2023-12-10 | NeVRF: Neural Video-based Radiance Fields for Long-duration Sequences | Minye Wu et.al. | 2312.05855 | null |
2023-12-10 | Beyond One Model Fits All: Ensemble Deep Learning for Autonomous Vehicles | Hemanth Manjunatha et.al. | 2312.05759 | null |
2023-12-10 | Camera-based 3D Semantic Scene Completion with Sparse Guidance Network | Jianbiao Mei et.al. | 2312.05752 | link |
2023-12-10 | IL-NeRF: Incremental Learning for Neural Radiance Fields with Camera Pose Alignment | Letian Zhang et.al. | 2312.05748 | null |
2023-12-09 | CoGS: Controllable Gaussian Splatting | Heng Yu et.al. | 2312.05664 | null |
2023-12-08 | 360° Volumetric Portrait Avatar | Jalees Nehvi et.al. | 2312.05311 | null |
2023-12-11 | Nuvo: Neural UV Mapping for Unruly 3D Representations | Pratul P. Srinivasan et.al. | 2312.05283 | null |
2023-12-08 | SwiftBrush: One-Step Text-to-Image Diffusion Model with Variational Score Distillation | Thuan Hoang Nguyen et.al. | 2312.05239 | link |
2023-12-08 | TriHuman : A Real-time and Controllable Tri-plane Representation for Detailed Human Geometry and Appearance Synthesis | Heming Zhu et.al. | 2312.05161 | null |
2023-12-08 | An Autonomous Driving model with BEV-V2X Perception, Trajectory Prediction and Driving Planning in Complex Traffic Intersections | Fukang Li et.al. | 2312.05104 | null |
2023-12-08 | Radar Perception in Autonomous Driving: Exploring Different Data Representations | Shanliang Yao et.al. | 2312.04861 | link |
2023-12-07 | Fine-Grained Extraction of Road Networks via Joint Learning of Connectivity and Segmentation | Yijia Xu et.al. | 2312.04744 | null |
2023-12-07 | NeuSD: Surface Completion with Multi-View Text-to-Image Diffusion | Savva Ignatyev et.al. | 2312.04654 | null |
2023-12-07 | VOODOO 3D: Volumetric Portrait Disentanglement for One-Shot 3D Head Reenactment | Phong Tran et.al. | 2312.04651 | null |
2023-12-07 | EAGLES: Efficient Accelerated 3D Gaussians with Lightweight EncodingS | Sharath Girish et.al. | 2312.04564 | link |
2023-12-07 | FRNet: Frustum-Range Networks for Scalable LiDAR Segmentation | Xiang Xu et.al. | 2312.04484 | link |
2023-12-07 | Deep Dynamics: Vehicle Dynamics Modeling with a Physics-Informed Neural Network for Autonomous Racing | John Chrosniak et.al. | 2312.04374 | null |
2023-12-07 | LaMPilot: An Open Benchmark Dataset for Autonomous Driving with Language Model Programs | Yunsheng Ma et.al. | 2312.04372 | link |
2023-12-07 | Multi-View Unsupervised Image Generation with Cross Attention Guidance | Llukman Cerkezi et.al. | 2312.04337 | null |
2023-12-12 | Towards Knowledge-driven Autonomous Driving | Xin Li et.al. | 2312.04316 | link |
2023-12-07 | Towards 4D Human Video Stylization | Tiantian Wang et.al. | 2312.04143 | link |
2023-12-07 | Identity-Obscured Neural Radiance Fields: Privacy-Preserving 3D Facial Reconstruction | Jiayi Kong et.al. | 2312.04106 | null |
2023-12-07 | Residual Graph Convolutional Network for Bird’s-Eye-View Semantic Segmentation | Qiuxiao Chen et.al. | 2312.04044 | null |
2023-12-15 | Natural-language-driven Simulation Benchmark and Copilot for Efficient Production of Object Interactions in Virtual Road Scenes | Kairui Yang et.al. | 2312.04008 | null |
2023-12-06 | Reason2Drive: Towards Interpretable and Chain-based Reasoning for Autonomous Driving | Ming Nie et.al. | 2312.03661 | link |
2023-12-06 | GPT-4 Enhanced Multimodal Grounding for Autonomous Driving: Leveraging Cross-Modal Attention with Large Language Models | Haicheng Liao et.al. | 2312.03543 | link |
2023-12-06 | Artist-Friendly Relightable and Animatable Neural Heads | Yingyan Xu et.al. | 2312.03420 | null |
2023-12-06 | Open-sourced Data Ecosystem in Autonomous Driving: the Present and Future | Hongyang Li et.al. | 2312.03408 | link |
2023-12-06 | Evaluating the point cloud of individual trees generated from images based on Neural Radiance fields (NeRF) method | Hongyu Huang et.al. | 2312.03372 | null |
2023-12-06 | SO-NeRF: Active View Planning for NeRF using Surrogate Objectives | Keifer Lee et.al. | 2312.03266 | null |
2023-12-06 | Feature 3DGS: Supercharging 3D Gaussian Splatting to Enable Distilled Feature Fields | Shijie Zhou et.al. | 2312.03203 | link |
2023-12-05 | HybridNeRF: Efficient Neural Rendering via Adaptive Volumetric Surfaces | Haithem Turki et.al. | 2312.03160 | null |
2023-12-05 | DGInStyle: Domain-Generalizable Semantic Segmentation with Image Diffusion Models and Stylized Semantic Control | Yuru Jia et.al. | 2312.03048 | null |
2023-12-05 | Is Ego Status All You Need for Open-Loop End-to-End Autonomous Driving? | Zhiqi Li et.al. | 2312.03031 | link |
2023-12-05 | ReconFusion: 3D Reconstruction with Diffusion Priors | Rundi Wu et.al. | 2312.02981 | null |
2023-12-06 | WoVoGen: World Volume-aware Diffusion for Controllable Multi-camera Driving Scene Generation | Jiachen Lu et.al. | 2312.02934 | link |
2023-12-05 | HeadGaS: Real-Time Animatable Head Avatars via 3D Gaussian Splatting | Helisa Dhamo et.al. | 2312.02902 | null |
2023-12-05 | Experimental Insights Towards Explainable and Interpretable Pedestrian Crossing Prediction | Angie Nataly Melo et.al. | 2312.02872 | null |
2023-12-05 | C-NERF: Representing Scene Changes as Directional Consistency Difference-based NeRF | Rui Huang et.al. | 2312.02751 | link |
2023-12-05 | Estimation of articulated angle in six-wheeled dump trucks using multiple GNSS receivers for autonomous driving | Taro Suzuki et.al. | 2312.02510 | null |
2023-12-05 | Object Importance Estimation using Counterfactual Reasoning for Intelligent Driving | Pranay Gupta et.al. | 2312.02467 | null |
2023-12-05 | FINER: Flexible spectral-bias tuning in Implicit NEural Representation by Variable-periodic Activation Functions | Zhen Liu et.al. | 2312.02434 | null |
2023-12-05 | MGTR: Multi-Granular Transformer for Motion Prediction with LiDAR | Yiqian Gan et.al. | 2312.02409 | null |
2023-12-04 | PointNeRF++: A multi-scale, point-based Neural Radiance Field | Weiwei Sun et.al. | 2312.02362 | null |
2023-12-04 | Calibrated Uncertainties for Neural Radiance Fields | Niki Amini-Naieni et.al. | 2312.02350 | null |
2023-12-04 | Re-Nerfing: Enforcing Geometric Constraints on Neural Radiance Fields through Novel Views Synthesis | Felix Tristram et.al. | 2312.02255 | null |
2023-12-03 | WavePlanes: A compact Wavelet representation for Dynamic Neural Radiance Fields | Adrian Azzarelli et.al. | 2312.02218 | link |
2023-12-02 | Volumetric Rendering with Baked Quadrature Fields | Gopal Sharma et.al. | 2312.02202 | null |
2023-12-02 | StableDreamer: Taming Noisy Score Distillation Sampling for Text-to-3D | Pengsheng Guo et.al. | 2312.02189 | null |
2023-12-04 | PaSCo: Urban 3D Panoptic Scene Completion with Uncertainty Awareness | Anh-Quan Cao et.al. | 2312.02158 | link |
2023-12-04 | Mesh-Guided Neural Implicit Field Editing | Can Wang et.al. | 2312.02157 | null |
2023-12-04 | Fast View Synthesis of Casual Videos | Yao-Chih Lee et.al. | 2312.02135 | null |
2023-12-04 | ColonNeRF: Neural Radiance Fields for High-Fidelity Long-Sequence Colonoscopy Reconstruction | Yufei Shi et.al. | 2312.02015 | null |
2023-12-04 | COTR: Compact Occupancy TRansformer for Vision-based 3D Occupancy Prediction | Qihang Ma et.al. | 2312.01919 | link |
2023-12-04 | Fast and accurate sparse-view CBCT reconstruction using meta-learned neural attenuation field and hash-encoding regularization | Heejun Shin et.al. | 2312.01689 | null |
2023-12-04 | Analyze Drivers’ Intervention Behavior During Autonomous Driving – A VR-incorporated Approach | Zheng Xu et.al. | 2312.01669 | null |
2023-12-04 | GaussianHead: Impressive 3D Gaussian-based Head Avatars with Dynamic Hybrid Neural Field | Jie Wang et.al. | 2312.01632 | link |
2023-12-03 | SANeRF-HQ: Segment Anything for NeRF in High Quality | Yichen Liu et.al. | 2312.01531 | null |
2023-12-03 | Exploring Adversarial Robustness of LiDAR-Camera Fusion Model in Autonomous Driving | Bo Yang et.al. | 2312.01468 | null |
2023-12-03 | VideoRF: Rendering Dynamic Radiance Fields as 2D Feature Video Streams | Liao Wang et.al. | 2312.01407 | null |
2023-12-02 | A Survey of Progress on Cooperative Multi-agent Reinforcement Learning in Open Environment | Lei Yuan et.al. | 2312.01058 | null |
2023-12-05 | Self-Evolving Neural Radiance Fields | Jaewoo Jung et.al. | 2312.01003 | link |
2023-12-01 | AV4EV: Open-Source Modular Autonomous Electric Vehicle Platform to Make Mobility Research Accessible | Zhijie Qiao et.al. | 2312.00951 | link |
2023-11-28 | Empowering Autonomous Driving with Large Language Models: A Safety Perspective | Yixuan Wang et.al. | 2312.00812 | null |
2023-12-01 | Towards Efficient 3D Object Detection in Bird’s-Eye-View Space for Autonomous Driving: A Convolutional-Only Approach | Yuxin Li et.al. | 2312.00633 | null |
2023-12-01 | Improving Efficiency of DNN-based Relocalization Module for Autonomous Driving with Server-side Computing | Dengbo Li et.al. | 2312.00316 | null |
2023-11-30 | PyNeRF: Pyramidal Neural Radiance Fields | Haithem Turki et.al. | 2312.00252 | link |
2023-11-30 | EpiTESTER: Testing Autonomous Vehicles with Epigenetic Algorithm and Attention Mechanism | Chengjie Lu et.al. | 2312.00207 | link |
2023-11-30 | SparseGS: Real-Time 360° Sparse View Synthesis using Gaussian Splatting | Haolin Xiong et.al. | 2312.00206 | link |
2023-11-30 | Evaluating the Impact of Flaky Simulators on Testing Autonomous Driving Systems | Mohammad Hossein Amini et.al. | 2311.18768 | link |
2023-11-30 | VREM-FL: Mobility-Aware Computation-Scheduling Co-Design for Vehicular Federated Learning | Luca Ballotta et.al. | 2311.18741 | link |
2023-11-30 | Contrastive Denoising Score for Text-guided Latent Diffusion Image Editing | Hyelin Nam et.al. | 2311.18608 | null |
2023-11-30 | Heterogeneous Graph-based Trajectory Prediction using Local Map Context and Social Interactions | Daniel Grimm et.al. | 2311.18553 | null |
2023-11-30 | Data-efficient Deep Reinforcement Learning for Vehicle Trajectory Control | Bernd Frauenknecht et.al. | 2311.18393 | null |
2023-11-30 | Anisotropic Neural Representation Learning for High-Quality Neural Rendering | Y. Wang et.al. | 2311.18311 | null |
2023-11-29 | Game Projection and Robustness for Game-Theoretic Autonomous Driving | Mushuang Liu et.al. | 2311.18074 | null |
2023-11-29 | Driving into the Future: Multiview Visual Forecasting and Planning with World Model for Autonomous Driving | Yuqi Wang et.al. | 2311.17918 | link |
2023-11-29 | FisherRF: Active View Selection and Uncertainty Quantification for Radiance Fields using Fisher Information | Wen Jiang et.al. | 2311.17874 | link |
2023-12-07 | Cam4DOcc: Benchmark for Camera-Only 4D Occupancy Forecasting in Autonomous Driving Applications | Junyi Ma et.al. | 2311.17663 | link |
2023-11-29 | Erasing the Ephemeral: Joint Camera Refinement and Transient Object Removal for Street View Synthesis | Mreenav Shyam Deka et.al. | 2311.17634 | null |
2023-11-29 | SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis | Ziqiao Peng et.al. | 2311.17590 | link |
2023-11-29 | NeRFTAP: Enhancing Transferability of Adversarial Patches on Face Recognition using Neural Radiance Fields | Xiaoliang Liu et.al. | 2311.17332 | null |
2023-11-30 | REF $^2$ -NeRF: Reflection and Refraction aware Neural Radiance Field | Wooseok Kim et.al. | 2311.17116 | link |
2023-11-28 | Human Gaussian Splatting: Real-time Rendering of Animatable Avatars | Arthur Moreau et.al. | 2311.17113 | link |
2023-11-28 | DepthSSC: Depth-Spatial Alignment and Dynamic Voxel Resolution for Monocular 3D Semantic Scene Completion | Jiawei Yao et.al. | 2311.17084 | null |
2023-12-11 | UC-NeRF: Neural Radiance Field for Under-Calibrated Multi-view Cameras in Autonomous Driving | Kai Cheng et.al. | 2311.16945 | null |
2023-11-29 | A Unified Approach for Text- and Image-guided 4D Scene Generation | Yufeng Zheng et.al. | 2311.16854 | null |
2023-11-28 | Panacea: Panoramic and Controllable Video Generation for Autonomous Driving | Yuqing Wen et.al. | 2311.16813 | null |
2023-11-28 | Towards Full-scene Domain Generalization in Multi-agent Collaborative Bird’s Eye View Segmentation for Connected and Autonomous Driving | Senkang Hu et.al. | 2311.16754 | link |
2023-11-28 | SplitNeRF: Split Sum Approximation Neural Field for Joint Geometry, Illumination, and Material Estimation | Jesus Zarzar et.al. | 2311.16671 | link |
2023-11-28 | DGNR: Density-Guided Neural Point Rendering of Large Driving Scenes | Zhuopeng Li et.al. | 2311.16664 | null |
2023-11-28 | SCALAR-NeRF: SCAlable LARge-scale Neural Radiance Fields for Scene Reconstruction | Yu Chen et.al. | 2311.16657 | null |
2023-11-28 | RGBGrasp: Image-based Object Grasping by Capturing Multiple Views during Robot Arm Movement with Neural Radiance Fields | Chang Liu et.al. | 2311.16592 | null |
2023-11-28 | Rethinking Directional Integration in Neural Radiance Fields | Congyue Deng et.al. | 2311.16504 | null |
2023-11-29 | Animatable 3D Gaussian: Fast and High-Quality Reconstruction of Multiple Human Avatars | Yang Liu et.al. | 2311.16482 | link |
2023-11-27 | Animatable Gaussians: Learning Pose-dependent Gaussian Maps for High-fidelity Human Avatar Modeling | Zhe Li et.al. | 2311.16096 | link |
2023-11-27 | OccWorld: Learning a 3D Occupancy World Model for Autonomous Driving | Wenzhao Zheng et.al. | 2311.16038 | link |
2023-11-27 | SOAC: Spatio-Temporal Overlap-Aware Multi-Sensor Calibration using Neural Radiance Fields | Quentin Herau et.al. | 2311.15803 | null |
2023-11-27 | Technical Report for Argoverse Challenges on 4D Occupancy Forecasting | Pengfei Zheng et.al. | 2311.15660 | null |
2023-11-27 | PaintNeSF: Artistic Creation of Stylized Scenes with Vectorized 3D Strokes | Hao-Bin Duan et.al. | 2311.15637 | null |
2023-11-27 | Technical Report for Argoverse Challenges on Unified Sensor-based Detection, Tracking, and Forecasting | Zhepeng Wang et.al. | 2311.15615 | null |
2023-11-27 | Sparse Pedestrian Character Learning for Trajectory Prediction | Yonghao Dong et.al. | 2311.15512 | null |
2023-11-27 | CaesarNeRF: Calibrated Semantic Representation for Few-shot Generalizable Neural Rendering | Haidong Zhu et.al. | 2311.15510 | link |
2023-11-26 | Efficient Encoding of Graphics Primitives with Simplex-based Structures | Yibo Wen et.al. | 2311.15439 | null |
2023-11-26 | GAN-Based LiDAR Intensity Simulation | Richard Marcus et.al. | 2311.15415 | null |
2023-11-26 | Obj-NeRF: Extract Object NeRFs from Multi-view Images | Zhiyi Li et.al. | 2311.15291 | null |
2023-12-05 | NeuRAD: Neural Rendering for Autonomous Driving | Adam Tonderski et.al. | 2311.15260 | link |
2023-11-26 | CalibFormer: A Transformer-based Automatic LiDAR-Camera Calibration Network | Yuxuan Xiao et.al. | 2311.15241 | null |
2023-11-25 | OpenNet: Incremental Learning for Autonomous Driving Object Detection with Balanced Loss | Zezhou Wang et.al. | 2311.14939 | null |
2023-11-25 | GBD-TS: Goal-based Pedestrian Trajectory Prediction with Diffusion using Tree Sampling Algorithm | Ge Sun et.al. | 2311.14922 | null |
2023-11-24 | GPT-4V Takes the Wheel: Evaluating Promise and Challenges for Pedestrian Behavior Prediction | Jia Huang et.al. | 2311.14786 | null |
2023-11-24 | Animate124: Animating One Image to 4D Dynamic Scene | Yuyang Zhao et.al. | 2311.14603 | null |
2023-12-01 | GaussianEditor: Swift and Controllable 3D Editing with Gaussian Splatting | Yiwen Chen et.al. | 2311.14521 | link |
2023-11-24 | Safety Assessment of Vehicle Characteristics Variations in Autonomous Driving Systems | Qi Pan et.al. | 2311.14461 | link |
2023-11-23 | ECRF: Entropy-Constrained Neural Radiance Fields Compression with Frequency Domain Optimization | Soonbin Lee et.al. | 2311.14208 | null |
2023-11-23 | Tube-NeRF: Efficient Imitation Learning of Visuomotor Policies from MPC using Tube-Guided Data Augmentation and NeRFs | Andrea Tagliabue et.al. | 2311.14153 | null |
2023-11-23 | Posterior Distillation Sampling | Juil Koo et.al. | 2311.13831 | null |
2023-12-06 | Towards Transferable Multi-modal Perception Representation Learning for Autonomy: NeRF-Supervised Masked AutoEncoder | Xiaohao Xu et.al. | 2311.13750 | null |
2023-11-23 | Security and Privacy Challenges in Deep Learning Models | Gopichandh Golla et.al. | 2311.13744 | null |
2023-11-22 | Compact 3D Gaussian Representation for Radiance Field | Joo Chan Lee et.al. | 2311.13681 | link |
2023-11-22 | ADriver-I: A General World Model for Autonomous Driving | Fan Jia et.al. | 2311.13549 | null |
2023-11-22 | An Empirical Study of Uncertainty Estimation Techniques for Detecting Drift in Data Streams | Anton Winter et.al. | 2311.13374 | null |
2023-11-22 | Retargeting Visual Data with Deformation Fields | Tim Elsner et.al. | 2311.13297 | null |
2023-11-22 | DoubleAUG: Single-domain Generalized Object Detector in Urban via Color Perturbation and Dual-style Memory | Lei Qi et.al. | 2311.13198 | null |
2023-11-22 | 3D Face Style Transfer with a Hybrid Solution of NeRF and Mesh Rasterization | Jianwei Feng et.al. | 2311.13168 | null |
2023-11-29 | SelfOcc: Self-Supervised Vision-Based 3D Occupancy Prediction | Yuanhui Huang et.al. | 2311.12754 | link |
2023-11-21 | Attacking Motion Planners Using Adversarial Perception Errors | Jonathan Sadeghi et.al. | 2311.12722 | null |
2023-11-21 | Hyb-NeRF: A Multiresolution Hybrid Encoding for Neural Radiance Fields | Yifan Wang et.al. | 2311.12490 | null |
2023-11-21 | A Survey on Multimodal Large Language Models for Autonomous Driving | Can Cui et.al. | 2311.12320 | link |
2023-12-04 | Applications of Large Scale Foundation Models for Autonomous Driving | Yu Huang et.al. | 2311.12144 | null |
2023-11-18 | Towards Function Space Mesh Watermarking: Protecting the Copyright of Signed Distance Fields | Xingyu Zhu et.al. | 2311.12059 | null |
2023-11-18 | FlashOcc: Fast and Memory-Efficient Occupancy Prediction via Channel-to-Height Plugin | Zichen Yu et.al. | 2311.12058 | link |
2023-11-20 | Entangled View-Epipolar Information Aggregation for Generalizable Neural Radiance Fields | Zhiyuan Min et.al. | 2311.11845 | link |
2023-11-20 | Beyond Boundaries: A Comprehensive Survey of Transferable Attacks on AI Systems | Guangjing Wang et.al. | 2311.11796 | null |
2023-11-23 | MUVO: A Multimodal Generative World Model for Autonomous Driving with Geometric Representations | Daniel Bogdoll et.al. | 2311.11762 | link |
2023-11-20 | A Large-Scale Car Parts (LSCP) Dataset for Lightweight Fine-Grained Detection | Wang Jie et.al. | 2311.11754 | null |
2023-11-20 | Sparse4D v3: Advancing End-to-End 3D Detection and Tracking | Xuewu Lin et.al. | 2311.11722 | link |
2023-11-19 | Pair-wise Layer Attention with Spatial Masking for Video Prediction | Ping Li et.al. | 2311.11289 | link |
2023-11-19 | Multi-Timescale Control and Communications with Deep Reinforcement Learning – Part I: Communication-Aware Vehicle Control | Tong Liu et.al. | 2311.11281 | null |
2023-11-18 | Tactics2D: A Multi-agent Reinforcement Learning Environment for Driving Decision-making | Yueyuan Li et.al. | 2311.11058 | link |
2023-11-18 | A Survey of Simulators for Autonomous Driving: Taxonomy, Challenges, and Evaluation Metrics | Yueyuan Li et.al. | 2311.11056 | null |
2023-11-18 | Structure-Aware Sparse-View X-ray 3D Reconstruction | Yuanhao Cai et.al. | 2311.10959 | link |
2023-11-27 | A Language Agent for Autonomous Driving | Jiageng Mao et.al. | 2311.10813 | link |
2023-10-31 | Safety-aware Causal Representation for Trustworthy Reinforcement Learning in Autonomous Driving | Haohong Lin et.al. | 2311.10747 | null |
2023-11-17 | Removing Adverse Volumetric Effects From Trained Neural Radiance Fields | Andreas L. Teigen et.al. | 2311.10523 | null |
2023-11-17 | Mind the map! Accounting for existing map information when estimating online HDMaps from sensor data | Rémy Sun et.al. | 2311.10517 | link |
2023-11-17 | Cooperative Perception with Learning-Based V2V communications | Chenguang Liu et.al. | 2311.10336 | null |
2023-11-17 | Imagination-augmented Hierarchical Reinforcement Learning for Safe and Interactive Autonomous Driving in Urban Environments | Sang-Hyun Lee et.al. | 2311.10309 | null |
2023-11-17 | Vision meets mmWave Radar: 3D Object Perception Benchmark for Autonomous Driving | Yizhou Wang et.al. | 2311.10261 | null |
2023-11-16 | Adaptive Shells for Efficient Neural Radiance Field Rendering | Zian Wang et.al. | 2311.10091 | null |
2023-11-16 | Interpretable Reinforcement Learning for Robotics and Continuous Control | Rohan Paleja et.al. | 2311.10041 | link |
2023-11-16 | Scan statistics for the detection of anomalies in M-dependent random fields with applications to image data | Claudia Kirch et.al. | 2311.09961 | null |
2023-11-18 | EvaSurf: Efficient View-Aware Implicit Textured Surface Reconstruction on Mobile Devices | Jingnan Gao et.al. | 2311.09806 | null |
2023-11-16 | Automatic Generation of Scenarios for System-level Simulation-based Verification of Autonomous Driving Systems | Srajan Goyal et.al. | 2311.09784 | null |
2023-11-16 | Reconstructing Continuous Light Field From Single Coded Image | Yuya Ishikawa et.al. | 2311.09646 | null |
2023-11-16 | Applications of Computer Vision in Autonomous Vehicles: Methods, Challenges and Future Directions | Xingshuai Dong et.al. | 2311.09093 | null |
2023-11-14 | Drivable 3D Gaussian Avatars | Wojciech Zielonka et.al. | 2311.08581 | null |
2023-11-14 | Low-light Pedestrian Detection in Visible and Infrared Image Feeds: Issues and Challenges | Hrishikesh Vachhani et.al. | 2311.08557 | null |
2023-11-14 | Human-Centric Autonomous Systems With LLMs for User Command Reasoning | Yi Yang et.al. | 2311.08206 | link |
2023-11-14 | Lateral control for autonomous vehicles: A comparative evaluation | Antonio Artuñedo et.al. | 2311.07987 | null |
2023-11-14 | Towards Improving Robustness Against Common Corruptions in Object Detectors Using Adversarial Contrastive Learning | Shashank Kotyan et.al. | 2311.07928 | null |
2023-11-13 | Temporal Performance Prediction for Deep Convolutional Long Short-Term Memory Networks | Laura Fieback et.al. | 2311.07477 | null |
2023-11-13 | $L_0$-Sampler: An $L_{0}$ Model Guided Volume Sampling for NeRF | Liangchen Li et.al. | 2311.07044 | null |
2023-11-11 | Semantic Communication for Cooperative Perception based on Importance Map | Yucheng Sheng et.al. | 2311.06498 | null |
2023-11-11 | Aria-NeRF: Multimodal Egocentric View Synthesis | Jiankai Sun et.al. | 2311.06455 | null |
2023-11-10 | ASSIST: Interactive Scene Nodes for Scalable and Realistic Indoor Simulation | Zhide Zhong et.al. | 2311.06211 | null |
2023-11-10 | Improved Positional Encoding for Implicit Neural Representation based Compact Data Representation | Bharath Bhushan Damodaran et.al. | 2311.06059 | null |
2023-11-10 | Refining the ONCE Benchmark with Hyperparameter Tuning | Maksim Golyadkin et.al. | 2311.06054 | null |
2023-11-10 | Deep learning for 3D Object Detection and Tracking in Autonomous Driving: A Brief Survey | Yang Peng et.al. | 2311.06043 | null |
2023-11-10 | Quantized Distillation: Optimizing Driver Activity Recognition Models for Resource-Constrained Environments | Calvin Tanama et.al. | 2311.05970 | link |
2023-11-17 | UMedNeRF: Uncertainty-aware Single View Volumetric Rendering for Medical Neural Radiance Fields | Jing Hu et.al. | 2311.05836 | null |
2023-11-09 | Multi-Agent Quantum Reinforcement Learning using Evolutionary Optimization | Michael Kölle et.al. | 2311.05546 | link |
2023-11-28 | BakedAvatar: Baking Neural Fields for Real-Time Head Avatar Synthesis | Hao-Bin Duan et.al. | 2311.05521 | link |
2023-11-28 | On the Road with GPT-4V(ision): Early Explorations of Visual-Language Model on Autonomous Driving | Licheng Wen et.al. | 2311.05332 | link |
2023-11-09 | TLCFuse: Temporal Multi-Modality Fusion Towards Occlusion-Aware Semantic Segmentation-Aided Motion Planning | Gustavo Salazar-Gomez et.al. | 2311.05319 | null |
2023-11-09 | VoxNeRF: Bridging Voxel Representation and Neural Radiance Fields for Enhanced Indoor View Synthesis | Sen Wang et.al. | 2311.05289 | null |
2023-11-09 | ConRad: Image Constrained Radiance Fields for 3D Generation from a Single Image | Senthil Purushwalkam et.al. | 2311.05230 | null |
2023-11-08 | Lidar Annotation Is All You Need | Dinar Sharafutdinov et.al. | 2311.04777 | link |
2023-11-08 | Image Patch-Matching with Graph-Based Learning in Street Scenes | Rui She et.al. | 2311.04617 | null |
2023-11-09 | Rethinking Human Pose Estimation for Autonomous Driving with 3D Event Representations | Xiaoting Yin et.al. | 2311.04591 | link |
2023-11-08 | Learning Robust Multi-Scale Representation for Neural Radiance Fields from Unposed Images | Nishant Jain et.al. | 2311.04521 | null |
2023-11-08 | FFINet: Future Feedback Interaction Network for Motion Forecasting | Miao Kang et.al. | 2311.04512 | null |
2023-11-08 | PRED: Pre-training via Semantic Rendering on LiDAR Point Clouds | Hao Yang et.al. | 2311.04501 | null |
2023-11-08 | LRM: Large Reconstruction Model for Single Image to 3D | Yicong Hong et.al. | 2311.04400 | null |
2023-11-07 | High-fidelity 3D Reconstruction of Plants using Neural Radiance Field | Kewei Hu et.al. | 2311.04154 | null |
2023-11-12 | What Makes a Fantastic Passenger-Car Driver in Urban Contexts? | Yueteng Yu et.al. | 2311.04150 | null |
2023-11-07 | Augmenting Lane Perception and Topology Understanding with Standard Definition Navigation Maps | Katie Z Luo et.al. | 2311.04079 | link |
2023-11-07 | AGNES: Abstraction-guided Framework for Deep Neural Networks Security | Akshay Dhonthi et.al. | 2311.04009 | null |
2023-11-07 | Fast Sun-aligned Outdoor Scene Relighting based on TensoRF | Yeonjin Chang et.al. | 2311.03965 | null |
2023-11-08 | UP-NeRF: Unconstrained Pose-Prior-Free Neural Radiance Fields | Injae Kim et.al. | 2311.03784 | link |
2023-11-06 | Long-Term Invariant Local Features via Implicit Cross-Domain Correspondences | Zador Pataki et.al. | 2311.03345 | null |
2023-11-06 | Animating NeRFs from Texture Space: A Framework for Pose-Dependent Rendering of Human Performances | Paul Knoll et.al. | 2311.03140 | null |
2023-11-06 | COLA: COarse-LAbel multi-source LiDAR semantic segmentation for autonomous driving | Jules Sanchez et.al. | 2311.03017 | null |
2023-11-06 | IR-STP: Enhancing Autonomous Driving with Interaction Reasoning in Spatio-Temporal Planning | Yingbing Chen et.al. | 2311.02850 | link |
2023-11-06 | Consistent4D: Consistent 360° Dynamic Object Generation from Monocular Video | Yanqin Jiang et.al. | 2311.02848 | null |
2023-11-06 | Flexible Multi-Generator Model with Fused Spatiotemporal Graph for Trajectory Prediction | Peiyuan Zhu et.al. | 2311.02835 | null |
2023-11-06 | InstructPix2NeRF: Instructed 3D Portrait Editing from a Single Image | Jianhui Li et.al. | 2311.02826 | link |
2023-11-05 | Deep Learning-based 3D Point Cloud Classification: A Systematic Survey and Outlook | Huang Zhang et.al. | 2311.02608 | null |
2023-11-05 | VR-NeRF: High-Fidelity Virtualized Walkable Spaces | Linning Xu et.al. | 2311.02542 | null |
2023-11-04 | Uncertainty Quantification of Deep Learning for Spatiotemporal Data: Challenges and Opportunities | Wenchong He et.al. | 2311.02485 | null |
2023-11-04 | Levels of AGI: Operationalizing Progress on the Path to AGI | Meredith Ringel Morris et.al. | 2311.02462 | null |
2023-11-04 | P2O-Calib: Camera-LiDAR Calibration Using Point-Pair Spatial Occlusion Relationship | Su Wang et.al. | 2311.02413 | null |
2023-11-04 | Continual Learning of Unsupervised Monocular Depth from Videos | Hemang Chawla et.al. | 2311.02393 | link |
2023-11-04 | OSM vs HD Maps: Map Representations for Trajectory Prediction | Jing-Yan Liao et.al. | 2311.02305 | null |
2023-11-03 | Quantitative Evaluation of a Multi-Modal Camera Setup for Fusing Event Data with RGB Images | Julian Moosmann et.al. | 2311.01881 | null |
2023-11-03 | A Neural Radiance Field-Based Architecture for Intelligent Multilayered View Synthesis | D. Dhinakaran et.al. | 2311.01842 | null |
2023-11-03 | Multi-LiDAR Localization and Mapping Pipeline for Urban Autonomous Driving | Florian Sauerbeck et.al. | 2311.01823 | null |
2023-11-03 | Estimating 3D Uncertainty Field: Quantifying Uncertainty for Neural Radiance Fields | Jianxiong Shen et.al. | 2311.01815 | null |
2023-11-06 | Towards Calibrated Robust Fine-Tuning of Vision-Language Models | Changdae Oh et.al. | 2311.01723 | link |
2023-11-03 | Flow-Based Feature Fusion for Vehicle-Infrastructure Cooperative 3D Object Detection | Haibao Yu et.al. | 2311.01682 | link |
2023-11-03 | Efficient Cloud Pipelines for Neural Radiance Fields | Derek Jacoby et.al. | 2311.01659 | null |
2023-11-03 | INeAT: Iterative Neural Adaptive Tomography | Bo Xiong et.al. | 2311.01653 | null |
2023-11-02 | DRNet: A Decision-Making Method for Autonomous Lane Changingwith Deep Reinforcement Learning | Kunpeng Xu et.al. | 2311.01602 | null |
2023-11-02 | Adversary ML Resilience in Autonomous Driving Through Human Centered Perception Mechanisms | Aakriti Shah et.al. | 2311.01478 | null |
2023-11-02 | Conformal Policy Learning for Sensorimotor Control Under Distribution Shifts | Huang Huang et.al. | 2311.01457 | null |
2023-11-02 | Efficient Vision Transformer for Accurate Traffic Sign Detection | Javad Mirzapour Kaleybar et.al. | 2311.01429 | null |
2023-11-04 | CenterRadarNet: Joint 3D Object Detection and Tracking Framework using 4D FMCW Radar | Jen-Hao Cheng et.al. | 2311.01423 | null |
2023-11-02 | Novel View Synthesis from a Single RGBD Image for Indoor Scenes | Congrui Hetang et.al. | 2311.01065 | null |
2023-11-02 | A Survey of Large Language Models for Autonomous Driving | Zhenjie Yang et.al. | 2311.01043 | link |
2023-11-02 | Learning Unsupervised World Models for Autonomous Driving via Discrete Diffusion | Lunjun Zhang et.al. | 2311.01017 | null |
2023-11-02 | CML-MOTS: Collaborative Multi-task Learning for Multi-Object Tracking and Segmentation | Yiming Cui et.al. | 2311.00987 | null |
2023-11-01 | Learning Cooperative Trajectory Representations for Motion Forecasting | Hongzhi Ruan et.al. | 2311.00371 | link |
2023-10-31 | FPO++: Efficient Encoding and Rendering of Dynamic Neural Radiance Fields by Analyzing and Enhancing Fourier PlenOctrees | Saskia Rabich et.al. | 2310.20710 | link |
2023-10-31 | NeRF Revisited: Fixing Quadrature Instability in Volume Rendering | Mikaela Angelina Uy et.al. | 2310.20685 | null |
2023-10-31 | Addressing Limitations of State-Aware Imitation Learning for Autonomous Driving | Luca Cultrera et.al. | 2310.20650 | null |
2023-10-31 | FLODCAST: Flow and Depth Forecasting via Multimodal Recurrent Architectures | Andrea Ciamarra et.al. | 2310.20593 | null |
2023-10-31 | Predictive Control for Autonomous Driving with Uncertain, Multi-modal Predictions | Siddharth H. Nair et.al. | 2310.20561 | null |
2023-10-31 | Collaborative Decision-Making Using Spatiotemporal Graphs in Connected Autonomy | Peng Gao et.al. | 2310.20491 | null |
2023-11-01 | Enhancing the Spatial Awareness Capability of Multi-Modal Large Language Model | Yongqiang Zhao et.al. | 2310.20357 | null |
2023-10-31 | HEDNet: A Hierarchical Encoder-Decoder Network for 3D Object Detection in Point Clouds | Gang Zhang et.al. | 2310.20234 | link |
2023-10-30 | Large Trajectory Models are Scalable Motion Predictors and Planners | Qiao Sun et.al. | 2310.19620 | link |
2023-10-31 | DynPoint: Dynamic Neural Point For View Synthesis | Kaichen Zhou et.al. | 2310.18999 | link |
2023-11-04 | TiV-NeRF: Tracking and Mapping via Time-Varying Representation with Dynamic Neural Radiance Fields | Chengyao Duan et.al. | 2310.18917 | null |
2023-10-31 | Social Interaction-Aware Dynamical Models and Decision Making for Autonomous Vehicles | Luca Crosato et.al. | 2310.18891 | null |
2023-10-28 | INCODE: Implicit Neural Conditioning with Prior Knowledge Embeddings | Amirhossein Kazerouni et.al. | 2310.18846 | link |
2023-11-07 | ODM3D: Alleviating Foreground Sparsity for Semi-Supervised Monocular 3D Object Detection | Weijia Zhang et.al. | 2310.18620 | link |
2023-10-27 | Guided Data Augmentation for Offline Reinforcement Learning and Imitation Learning | Nicholas E. Corrado et.al. | 2310.18247 | null |
2023-10-27 | Fine-Tuning Language Models Using Formal Methods Feedback | Yunhao Yang et.al. | 2310.18239 | null |
2023-10-27 | Reconstructive Latent-Space Neural Radiance Fields for Efficient 3D Scene Representations | Tristan Aumentado-Armstrong et.al. | 2310.17880 | null |
2023-10-27 | Siamese-DETR for Generic Multi-Object Tracking | Qiankun Liu et.al. | 2310.17875 | link |
2023-10-26 | Drive Anywhere: Generalizable End-to-end Autonomous Driving with Multi-modal Foundation Models | Tsun-Hsuan Wang et.al. | 2310.17642 | null |
2023-10-26 | EqDrive: Efficient Equivariant Motion Forecasting with Multi-Modality for Autonomous Driving | Yuping Wang et.al. | 2310.17540 | null |
2023-10-26 | Revisiting the Distillation of Image Representations into Point Clouds for Autonomous Driving | Gilles Puy et.al. | 2310.17504 | link |
2023-10-30 | A Hybrid Graph Network for Complex Activity Detection in Video | Salman Khan et.al. | 2310.17493 | null |
2023-10-26 | YOLO-BEV: Generating Bird’s-Eye View in the Same Way as 2D Object Detection | Chang Liu et.al. | 2310.17379 | null |
2023-10-27 | Navigating Data Heterogeneity in Federated Learning A Semi-Supervised Approach for Object Detection | Taehyeon Kim et.al. | 2310.17097 | link |
2023-10-27 | HyperFields: Towards Zero-Shot Generation of NeRFs from Text | Sudarshan Babu et.al. | 2310.17075 | null |
2023-11-06 | 4D-Editor: Interactive Object-level Editing in Dynamic Neural Radiance Fields via Semantic Distillation | Dadong Jiang et.al. | 2310.16858 | null |
2023-10-28 | PERF: Panoramic Neural Radiance Field from a Single Panorama | Guangcong Wang et.al. | 2310.16831 | link |
2023-10-25 | Using Knowledge Awareness to improve Safety of Autonomous Driving | Andrea Calvagna et.al. | 2310.16760 | null |
2023-10-26 | Driving through the Concept Gridlock: Unraveling Explainability Bottlenecks in Automated Driving | Jessica Echterhoff et.al. | 2310.16639 | link |
2023-10-25 | ParisLuco3D: A high-quality target dataset for domain generalization of LiDAR perception | Jules Sanchez et.al. | 2310.16542 | null |
2023-10-25 | MVFAN: Multi-View Feature Assisted Network for 4D Radar Object Detection | Qiao Yan et.al. | 2310.16389 | null |
2023-10-25 | Open-NeRF: Towards Open Vocabulary NeRF Decomposition | Hao Zhang et.al. | 2310.16383 | null |
2023-10-24 | Pixel-Level Clustering Network for Unsupervised Image Segmentation | Cuong Manh Hoang et.al. | 2310.16234 | null |
2023-10-24 | Data-driven Traffic Simulation: A Comprehensive Review | Di Chen et.al. | 2310.15975 | null |
2023-10-24 | Recent Advances in Multi-modal 3D Scene Understanding: A Comprehensive Survey and Evaluation | Yinjie Lei et.al. | 2310.15676 | null |
2023-10-24 | Cross-view Self-localization from Synthesized Scene-graphs | Ryogo Yamamoto et.al. | 2310.15504 | null |
2023-10-23 | RoboDepth: Robust Out-of-Distribution Depth Estimation under Corruptions | Lingdong Kong et.al. | 2310.15171 | link |
2023-10-23 | P2AT: Pyramid Pooling Axial Transformer for Real-time Semantic Segmentation | Mohammed A. M. Elhassan et.al. | 2310.15025 | null |
2023-10-23 | End-to-End Learning of Behavioural Inputs for Autonomous Driving in Dense Traffic | Jatan Shrestha et.al. | 2310.14766 | link |
2023-10-23 | BM2CP: Efficient Collaborative Perception with LiDAR-Camera Modalities | Binyu Zhao et.al. | 2310.14702 | link |
2023-10-23 | CAwa-NeRF: Instant Learning of Compression-Aware NeRF Features | Omnia Mahmoud et.al. | 2310.14695 | null |
2023-10-23 | DICE: Diverse Diffusion Model with Scoring for Trajectory Prediction | Younwoo Choi et.al. | 2310.14570 | null |
2023-10-22 | Vision Language Models in Autonomous Driving and Intelligent Transportation Systems | Xingcheng Zhou et.al. | 2310.14414 | link |
2023-10-22 | Detrive: Imitation Learning with Transformer Detection for End-to-End Autonomous Driving | Daoming Chen et.al. | 2310.14224 | link |
2023-10-21 | Equivariant Map and Agent Geometry for Autonomous Driving Motion Prediction | Yuping Wang et.al. | 2310.13922 | null |
2023-10-21 | Exploring Driving Behavior for Autonomous Vehicles Based on Gramian Angular Field Vision Transformer | Junwei You et.al. | 2310.13906 | null |
2023-10-20 | ManifoldNeRF: View-dependent Image Feature Supervision for Few-shot Neural Radiance Fields | Daiju Kanaoka et.al. | 2310.13670 | null |
2023-10-20 | OpenAnnotate3D: Open-Vocabulary Auto-Labeling System for Multi-modal 3D Data | Yijie Zhou et.al. | 2310.13398 | link |
2023-10-20 | Sync-NeRF: Generalizing Dynamic NeRFs to Unsynchronized Videos | Seoha Kim et.al. | 2310.13356 | link |
2023-10-20 | Combining Policy Gradient and Safety-Based Control for Autonomous Driving | Xi Xiong et.al. | 2310.13314 | null |
2023-10-20 | UE4-NeRF:Neural Radiance Field for Real-Time Rendering of Large-Scale Scene | Jiaming Gu et.al. | 2310.13263 | null |
2023-10-20 | Higher or Lower: Challenges in Object based SLAM | Zhihe Zhang et.al. | 2310.13256 | null |
2023-10-19 | LeTFuser: Light-weight End-to-end Transformer-Based Sensor Fusion for Autonomous Driving with Multi-Task Learning | Pedram Agand et.al. | 2310.13135 | link |
2023-10-19 | NeuroSMPC: A Neural Network guided Sampling Based MPC for On-Road Autonomous Driving | Kaustab Pal et.al. | 2310.13077 | null |
2023-10-19 | Real-Time Motion Prediction via Heterogeneous Polyline Transformer with Relative Pose Encoding | Zhejun Zhang et.al. | 2310.12970 | link |
2023-10-18 | Monte-Carlo Tree Search for Behavior Planning in Autonomous Driving | Qianfeng Wen et.al. | 2310.12075 | link |
2023-10-19 | One-Bit Byzantine-Tolerant Distributed Learning via Over-the-Air Computation | Yuhan Yang et.al. | 2310.11998 | null |
2023-10-18 | Malicious Agent Detection for Robust Multi-Agent Collaborative Perception | Yangheng Zhao et.al. | 2310.11901 | null |
2023-10-18 | Using Experience Classification for Training Non-Markovian Tasks | Ruixuan Miao et.al. | 2310.11678 | null |
2023-10-18 | Towards Abdominal 3-D Scene Rendering from Laparoscopy Surgical Videos using NeRFs | Khoa Tuan Nguyen et.al. | 2310.11645 | null |
2023-10-17 | Non-ergodicity in reinforcement learning: robustness via ergodicity transformations | Dominik Baumann et.al. | 2310.11335 | link |
2023-10-17 | LiDAR-based 4D Occupancy Completion and Forecasting | Xinhao Liu et.al. | 2310.11239 | link |
2023-10-19 | Path Following Control of Automated Vehicle Considering Uncertainties and Disturbances with Parametric Varying | Dan Shen et.al. | 2310.10925 | null |
2023-10-16 | TraM-NeRF: Tracing Mirror and Near-Perfect Specular Reflections through Neural Radiance Fields | Leif Van Holland et.al. | 2310.10650 | link |
2023-10-16 | DynVideo-E: Harnessing Dynamic NeRF for Large-Scale Motion- and View-Change Human-Centric Video Editing | Jia-Wei Liu et.al. | 2310.10624 | null |
2023-10-16 | BEVGPT: Generative Pre-trained Large Model for Autonomous Driving Prediction, Decision-Making, and Planning | Pengqin Wang et.al. | 2310.10357 | null |
2023-10-16 | Multimodal Object Query Initialization for 3D Object Detection | Mathijs R. van Geerenstein et.al. | 2310.10353 | null |
2023-10-16 | SoTTA: Robust Test-Time Adaptation on Noisy Data Streams | Taesik Gong et.al. | 2310.10074 | link |
2023-10-15 | ProteusNeRF: Fast Lightweight NeRF Editing using 3D-Aware Image Context | Binglun Wang et.al. | 2310.09965 | null |
2023-10-15 | Active Perception using Neural Radiance Fields | Siming He et.al. | 2310.09892 | link |
2023-10-15 | CBARF: Cascaded Bundle-Adjusting Neural Radiance Fields from Imperfect Camera Poses | Hongyu Fu et.al. | 2310.09776 | null |
2023-10-14 | Real-Time Traffic Sign Detection: A Case Study in a Santa Clara Suburban Neighborhood | Harish Loghashankar et.al. | 2310.09630 | null |
2023-10-20 | JM3D & JM3D-LLM: Elevating 3D Representation with Joint Multi-modal Cues | Jiayi Ji et.al. | 2310.09503 | link |
2023-10-13 | Revisiting Multi-modal 3D Semantic Segmentation in Real-world Autonomous Driving | Feng Jiang et.al. | 2310.08826 | null |
2023-10-12 | PU-Ray: Point Cloud Upsampling via Ray Marching on Implicit Surface | Sangwon Lim et.al. | 2310.08755 | link |
2023-10-12 | Waymax: An Accelerated, Data-Driven Simulator for Large-Scale Autonomous Driving Research | Cole Gulino et.al. | 2310.08710 | null |
2023-10-12 | Data-driven Invariance for Reference Governors | Ali Kashani et.al. | 2310.08679 | null |
2023-10-12 | Performance/power assessment of CNN packages on embedded automotive platforms | Paolo Burgio et.al. | 2310.08401 | null |
2023-10-12 | UniPAD: A Universal Pre-training Paradigm for Autonomous Driving | Honghui Yang et.al. | 2310.08370 | link |
2023-10-12 | Impact of multi-armed bandit strategies on deep recurrent reinforcement learning | Valentina Zangirolami et.al. | 2310.08331 | link |
2023-10-12 | NSM4D: Neural Scene Model Based Online 4D Point Cloud Sequence Understanding | Yuhao Dong et.al. | 2310.08326 | null |
2023-10-12 | If our aim is to build morality into an artificial agent, how might we begin to go about doing so? | Reneira Seeamber et.al. | 2310.08295 | null |
2023-10-12 | Hilbert Space Embedding-based Trajectory Optimization for Multi-Modal Uncertain Obstacle Trajectory Prediction | Basant Sharma et.al. | 2310.08270 | link |
2023-10-12 | GraphAlign: Enhancing Accurate Feature Alignment by Graph matching for Multi-Modal 3D Object Detection | Ziying Song et.al. | 2310.08261 | null |
2023-10-12 | DUSA: Decoupled Unsupervised Sim2Real Adaptation for Vehicle-to-Everything Collaborative Perception | Xianghao Kong et.al. | 2310.08117 | link |
2023-10-20 | Model Predictive Inferential Control of Neural State-Space Models for Autonomous Vehicle Motion Planning | Iman Askari et.al. | 2310.08045 | null |
2023-10-12 | EC-Depth: Exploring the consistency of self-supervised monocular depth estimation under challenging scenes | Ruijie Zhu et.al. | 2310.08044 | link |
2023-10-12 | Receive, Reason, and React: Drive as You Say with Large Language Models in Autonomous Vehicles | Can Cui et.al. | 2310.08034 | null |
2023-10-12 | HeightFormer: A Multilevel Interaction and Image-adaptive Classification-regression Network for Monocular Height Estimation with Aerial Images | Zhan Chen et.al. | 2310.07995 | null |
2023-10-11 | Dynamic Appearance Particle Neural Radiance Field | Ancheng Lin et.al. | 2310.07916 | null |
2023-10-11 | CRITERIA: a New Benchmarking Paradigm for Evaluating Trajectory Prediction Models for Autonomous Driving | Changhe Chen et.al. | 2310.07794 | link |
2023-10-11 | DrivingDiffusion: Layout-Guided multi-view driving scene video generation with latent diffusion model | Xiaofan Li et.al. | 2310.07771 | link |
2023-10-23 | Dual Radar: A Multi-modal Dataset with Dual 4D Radar for Autonomous Driving | Xinyu Zhang et.al. | 2310.07602 | link |
2023-10-11 | Metamorphic Runtime Monitoring of Autonomous Driving Systems | Jon Ayerdi et.al. | 2310.07414 | link |
2023-10-11 | LESS-Map: Lightweight and Evolving Semantic Map in Parking Lots for Long-term Self-Localization | Mingrui Liu et.al. | 2310.07390 | null |
2023-10-11 | Optimizing the Placement of Roadside LiDARs for Autonomous Driving | Wentao Jiang et.al. | 2310.07247 | null |
2023-10-11 | Integrated Sensing and Communication enabled Multiple Base Stations Cooperative Sensing Towards 6G | Zhiqing Wei et.al. | 2310.07180 | null |
2023-10-11 | rpcPRF: Generalizable MPI Neural Radiance Field for Satellite Camera | Tongtong Zhang et.al. | 2310.07179 | null |
2023-10-10 | Leveraging Neural Radiance Fields for Uncertainty-Aware Visual Localization | Le Chen et.al. | 2310.06984 | null |
2023-11-01 | TopoMLP: A Simple yet Strong Pipeline for Driving Topology Reasoning | Dongming Wu et.al. | 2310.06753 | link |
2023-10-10 | Safe-by-Construction Autonomous Vehicle Overtaking using Control Barrier Functions and Model Predictive Control | Dingran Yuan et.al. | 2310.06553 | null |
2023-10-10 | High-Fidelity 3D Head Avatars Reconstruction through Spatially-Varying Expression Conditioned Neural Radiance Field | Minghan Qin et.al. | 2310.06275 | null |
2023-10-09 | Layout Sequence Prediction From Noisy Mobile Modality | Haichao Zhang et.al. | 2310.06138 | null |
2023-10-07 | DynamicBEV: Leveraging Dynamic Queries and Temporal Context for 3D Object Detection | Jiawei Yao et.al. | 2310.05989 | link |
2023-10-09 | DTPP: Differentiable Joint Conditional Prediction and Cost Evaluation for Tree Policy Planning in Autonomous Driving | Zhiyu Huang et.al. | 2310.05885 | link |
2023-10-09 | A Real-time Method for Inserting Virtual Objects into Neural Radiance Fields | Keyang Ye et.al. | 2310.05837 | null |
2023-10-09 | Joint object detection and re-identification for 3D obstacle multi-camera systems | Irene Cortés et.al. | 2310.05785 | null |
2023-10-09 | GPS Attack Detection and Mitigation for Safe Autonomous Driving using Image and Map based Lateral Direction Localization | Qingming Chen et.al. | 2310.05407 | null |
2023-10-09 | Neural Impostor: Editing Neural Radiance Fields with Explicit Shape Manipulation | Ruiyang Liu et.al. | 2310.05391 | null |
2023-10-08 | Influence of Camera-LiDAR Configuration on 3D Object Detection for Autonomous Driving | Ye Li et.al. | 2310.05245 | link |
2023-10-08 | Indoor Localization for an Autonomous Model Car: A Marker-Based Multi-Sensor Fusion Framework | Xibo Li et.al. | 2310.05198 | null |
2023-10-08 | DeepQTest: Testing Autonomous Driving Systems with Reinforcement Learning and Real-world Weather Data | Chengjie Lu et.al. | 2310.05170 | link |
2023-10-08 | LocoNeRF: A NeRF-based Approach for Local Structure from Motion for Precise Localization | Artem Nenashev et.al. | 2310.05134 | null |
2023-10-08 | Geometry Aware Field-to-field Transformations for 3D Semantic Segmentation | Dominik Hollidt et.al. | 2310.05133 | null |
2023-10-08 | A Privacy-Preserving Trajectory Synthesis Method Based on Vector Translation Invariance Supporting Traffic Constraints | Zechen Liu et.al. | 2310.05091 | null |
2023-10-08 | An Anomaly Behavior Analysis Framework for Securing Autonomous Vehicle Perception | Murad Mehrab Abrar et.al. | 2310.05041 | link |
2023-10-07 | Combining UPerNet and ConvNeXt for Contrails Identification to reduce Global Warming | Zhenkuan Wang et.al. | 2310.04808 | link |
2023-10-07 | Towards Dynamic and Small Objects Refinement for Unsupervised Domain Adaptative Nighttime Semantic Segmentation | Jingyi Pan et.al. | 2310.04747 | null |
2023-10-06 | DiffPrompter: Differentiable Implicit Visual Prompts for Semantic-Segmentation in Adverse Conditions | Sanket Kalwar et.al. | 2310.04181 | null |
2023-10-06 | Improving Neural Radiance Field using Near-Surface Sampling with Point Cloud Generation | Hye Bin Yoo et.al. | 2310.04152 | null |
2023-10-05 | High-Degrees-of-Freedom Dynamic Neural Fields for Robot Self-Modeling and Motion Planning | Lennart Schulze et.al. | 2310.03624 | null |
2023-10-05 | Targeted Adversarial Attacks on Generalizable Neural Radiance Fields | Andras Horvath et.al. | 2310.03578 | null |
2023-10-05 | BID-NeRF: RGB-D image pose estimation with inverted Neural Radiance Fields | Ágoston István Csehi et.al. | 2310.03563 | null |
2023-10-05 | V2X Cooperative Perception for Autonomous Driving: Recent Advances and Challenges | Tao Huang et.al. | 2310.03525 | null |
2023-10-05 | RadaRays: Real-time Simulation of Rotating FMCW Radar for Mobile Robotics via Hardware-accelerated Ray Tracing | Alexander Mock et.al. | 2310.03505 | link |
2023-10-05 | Point-Based Radiance Fields for Controllable Human Motion Synthesis | Haitao Yu et.al. | 2310.03375 | link |
2023-10-05 | A Two-stage Based Social Preference Recognition in Multi-Agent Autonomous Driving System | Jintao Xue et.al. | 2310.03303 | null |
2023-10-04 | Shielding the Unseen: Privacy Protection through Poisoning NeRF with Spatial Deformation | Yihan Wu et.al. | 2310.03125 | null |
2023-10-13 | LanguageMPC: Large Language Models as Decision Makers for Autonomous Driving | Hao Sha et.al. | 2310.03026 | null |
2023-10-04 | Efficient-3DiM: Learning a Generalizable Single-image Novel-view Synthesizer in One Day | Yifan Jiang et.al. | 2310.03015 | null |
2023-10-04 | Curve Trajectory Model for Human Preferred Path Planning of Automated Vehicles | Gergo Igneczi et.al. | 2310.02696 | null |
2023-10-05 | USB-NeRF: Unrolling Shutter Bundle Adjusted Neural Radiance Fields | Moyang Li et.al. | 2310.02687 | link |
2023-10-04 | Adaptive Spatio-Temporal Voxels Based Trajectory Planning for Autonomous Driving in Highway Traffic Flow | Zhiqiang Jian et.al. | 2310.02625 | link |
Traffic Simulation
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-15 | AORRTC: Almost-Surely Asymptotically Optimal Planning with RRT-Connect | Tyler Wilson et.al. | 2505.10542 | null |
2025-05-15 | pc-dbCBS: Kinodynamic Motion Planning of Physically-Coupled Robot Teams | Khaled Wahba et.al. | 2505.10355 | null |
2025-05-15 | MambaControl: Anatomy Graph-Enhanced Mamba ControlNet with Fourier Refinement for Diffusion-Based Disease Trajectory Prediction | Hao Yang et.al. | 2505.09965 | null |
2025-05-14 | Quantum-Enhanced Parameter-Efficient Learning for Typhoon Trajectory Forecasting | Chen-Yu Liu et.al. | 2505.09395 | null |
2025-05-14 | Improved Corner Cutting Constraints for Mixed-Integer Motion Planning of a Differential Drive Micro-Mobility Vehicle | Angelo Caregnato-Neto et.al. | 2505.09359 | null |
2025-05-14 | Robot-Assisted Drone Recovery on a Wavy Surface Using Error-State Kalman Filter and Receding Horizon Model Predictive Control | Yimou Wu et.al. | 2505.09145 | null |
2025-05-14 | Deployable and Generalizable Motion Prediction: Taxonomy, Open Challenges and Future Directions | Letian Wang et.al. | 2505.09074 | null |
2025-05-13 | Multi-step manipulation task and motion planning guided by video demonstration | Kateryna Zorina et.al. | 2505.08949 | null |
2025-05-13 | Continuous World Coverage Path Planning for Fixed-Wing UAVs using Deep Reinforcement Learning | Mirco Theile et.al. | 2505.08382 | null |
2025-05-12 | Virtual Holonomic Constraints in Motion Planning: Revisiting Feasibility and Limitations | Maksim Surov et.al. | 2505.07983 | null |
2025-05-08 | A Physics-informed End-to-End Occupancy Framework for Motion Planning of Autonomous Vehicles | Shuqi Shen et.al. | 2505.07855 | null |
2025-05-12 | Intuitive Human-Robot Interfaces Leveraging on Autonomy Features for the Control of Highly-redundant Robots | Davide Torielli et.al. | 2505.07668 | null |
2025-05-12 | AIS Data-Driven Maritime Monitoring Based on Transformer: A Comprehensive Review | Zhiye Xie et.al. | 2505.07374 | link |
2025-05-13 | Human Motion Prediction via Test-domain-aware Adaptation with Easily-available Human Motions Estimated from Videos | Katsuki Shimbo et.al. | 2505.07301 | null |
2025-05-12 | A Framework for Joint Grasp and Motion Planning in Confined Spaces | Martin Rudorfer et.al. | 2505.07259 | null |
2025-05-12 | Towards Accurate State Estimation: Kalman Filter Incorporating Motion Dynamics for 3D Multi-Object Tracking | Mohamed Nagy et.al. | 2505.07254 | null |
2025-05-11 | YOPOv2-Tracker: An End-to-End Agile Tracking and Navigation Framework from Perception to Action | Junjie Lu et.al. | 2505.06923 | null |
2025-05-11 | Beyond Patterns: Harnessing Causal Logic for Autonomous Driving Trajectory Prediction | Bonan Wang et.al. | 2505.06856 | null |
2025-05-11 | cpRRTC: GPU-Parallel RRT-Connect for Constrained Motion Planning | Jiaming Hu et.al. | 2505.06791 | null |
2025-05-10 | TPK: Trustworthy Trajectory Prediction Integrating Prior Knowledge For Interpretability and Kinematic Feasibility | Marius Baden et.al. | 2505.06743 | null |
2025-05-10 | Boundary-Guided Trajectory Prediction for Road Aware and Physically Feasible Autonomous Driving | Ahmed Abouelazm et.al. | 2505.06740 | null |
2025-05-10 | Motion Planning for Autonomous Vehicles: When Model Predictive Control Meets Ensemble Kalman Smoothing | Iman Askari et.al. | 2505.06666 | null |
2025-05-09 | Realistic Adversarial Attacks for Robustness Evaluation of Trajectory Prediction Models via Future State Perturbation | Julian F. Schumann et.al. | 2505.06134 | link |
2025-05-09 | KRRF: Kinodynamic Rapidly-exploring Random Forest algorithm for multi-goal motion planning | Petr Ježek et.al. | 2505.06126 | null |
2025-05-09 | Collecting Human Motion Data in Large and Occlusion-Prone Environments using Ultra-Wideband Localization | Janik Kaden et.al. | 2505.05851 | null |
2025-05-09 | Physics-informed Temporal Difference Metric Learning for Robot Motion Planning | Ruiqi Ni et.al. | 2505.05691 | link |
2025-05-08 | Closing the Loop: Motion Prediction Models beyond Open-Loop Benchmarks | Mohamed-Khalil Bouzidi et.al. | 2505.05638 | null |
2025-05-07 | Occupancy World Model for Robots | Zhang Zhang et.al. | 2505.05512 | null |
2025-05-08 | A Vehicle System for Navigating Among Vulnerable Road Users Including Remote Operation | Oscar de Groot et.al. | 2505.04982 | null |
2025-05-08 | LVLM-MPC Collaboration for Autonomous Driving: A Safety-Aware and Task-Scalable Control Architecture | Kazuki Atsuta et.al. | 2505.04980 | null |
2025-05-08 | Real-Time Model Predictive Control of Vehicles with Convex-Polygon-Aware Collision Avoidance in Tight Spaces | Haruki Kojima et.al. | 2505.04935 | null |
2025-05-07 | Dynamic Network Flow Optimization for Task Scheduling in PTZ Camera Surveillance Systems | Mohammad Merati et.al. | 2505.04596 | null |
2025-05-07 | Stow: Robotic Packing of Items into Fabric Pods | Nicolas Hudson et.al. | 2505.04572 | null |
2025-05-07 | TrajEvo: Designing Trajectory Prediction Heuristics via LLM-driven Evolution | Zhikai Zhao et.al. | 2505.04480 | link |
2025-05-06 | Meta-Optimization and Program Search using Language Models for Task and Motion Planning | Denis Shcherba et.al. | 2505.03725 | null |
2025-05-06 | Simulation to Reality: Testbeds and Architectures for Connected and Automated Vehicles | David Klüner et.al. | 2505.03472 | null |
2025-05-06 | RIFT: Closed-Loop RL Fine-Tuning for Realistic and Controllable Traffic Simulation | Keyu Chen et.al. | 2505.03344 | null |
2025-05-06 | Enabling Robots to Autonomously Search Dynamic Cluttered Post-Disaster Environments | Karlo Rado et.al. | 2505.03283 | null |
2025-04-29 | Floating Car Observers in Intelligent Transportation Systems: Detection Modeling and Temporal Insights | Jeremias Gerner et.al. | 2505.02845 | null |
2025-05-05 | ZeloS – A Research Platform for Early-Stage Validation of Research Findings Related to Automated Driving | Christopher Bohn et.al. | 2505.02460 | null |
2025-05-05 | A Real-Time Control Barrier Function-Based Safety Filter for Motion Planning with Arbitrary Road Boundary Constraints | Jianye Xu et.al. | 2505.02395 | link |
2025-05-04 | Probabilistic Method for Optimizing Submarine Search and Rescue Strategy Under Environmental Uncertainty | Runhao Liu et.al. | 2505.02186 | null |
2025-05-02 | Phasing Through the Flames: Rapid Motion Planning with the AGHF PDE for Arbitrary Objective Functions and Constraints | Challen Enninful Adu et.al. | 2505.01589 | null |
2025-05-13 | Open-Source LLM-Driven Federated Transformer for Predictive IoV Management | Yazan Otoum et.al. | 2505.00651 | null |
2025-05-01 | Visual Trajectory Prediction of Vessels for Inland Navigation | Alexander Puzicha et.al. | 2505.00599 | null |
2025-05-01 | ParkDiffusion: Heterogeneous Multi-Agent Multi-Modal Trajectory Prediction for Automated Parking using Diffusion Models | Jiarong Wei et.al. | 2505.00586 | null |
2025-05-01 | Safety-Critical Traffic Simulation with Guided Latent Diffusion Model | Mingxing Peng et.al. | 2505.00515 | null |
2025-05-02 | InterLoc: LiDAR-based Intersection Localization using Road Segmentation with Automated Evaluation Method | Nguyen Hoang Khoi Tran et.al. | 2505.00512 | null |
2025-05-01 | Future-Oriented Navigation: Dynamic Obstacle Avoidance with One-Shot Energy-Based Multimodal Motion Prediction | Ze Zhang et.al. | 2505.00237 | null |
2025-04-30 | Leveraging Pre-trained Large Language Models with Refined Prompting for Online Task and Motion Planning | Huihui Guo et.al. | 2504.21596 | null |
2025-04-29 | Confidence-based Intent Prediction for Teleoperation in Bimanual Robotic Suturing | Zhaoyang Jacopo Hu et.al. | 2504.20761 | null |
2025-04-28 | Towards Ball Spin and Trajectory Analysis in Table Tennis Broadcast Videos via Physically Grounded Synthetic-to-Real Transfer | Daniel Kienzle et.al. | 2504.19863 | link |
2025-04-28 | Robot Motion Planning using One-Step Diffusion with Noise-Optimized Approximate Motions | Tomoharu Aizu et.al. | 2504.19652 | null |
2025-04-27 | Geometric Gait Optimization for Kinodynamic Systems Using a Lie Group Integrator | Yanhao Yang et.al. | 2504.19072 | null |
2025-04-26 | A biconvex method for minimum-time motion planning through sequences of convex sets | Tobia Marcucci et.al. | 2504.18978 | null |
2025-04-26 | Demonstrating DVS: Dynamic Virtual-Real Simulation Platform for Mobile Robotic Tasks | Zijie Zheng et.al. | 2504.18944 | null |
2025-04-29 | Hierarchical Temporal Logic Task and Motion Planning for Multi-Robot Systems | Zhongqi Wei et.al. | 2504.18899 | link |
2025-04-25 | Collaborative Object Transportation in Space via Impact Interactions | Joris Verhagen et.al. | 2504.18667 | null |
2025-04-25 | Enhancing System Self-Awareness and Trust of AI: A Case Study in Trajectory Prediction and Planning | Lars Ullrich et.al. | 2504.18421 | null |
2025-04-24 | Mixed Bernstein-Fourier Approximants for Optimal Trajectory Generation with Periodic Behavior | Liraz Mudrik et.al. | 2504.17969 | null |
2025-04-24 | Beyond Task and Motion Planning: Hierarchical Robot Planning with General-Purpose Policies | Benned Hedegaard et.al. | 2504.17901 | null |
2025-04-24 | Terrain-Aware Kinodynamic Planning with Efficiently Adaptive State Lattices for Mobile Robot Navigation in Off-Road Environments | Eric R. Damm et.al. | 2504.17889 | null |
2025-04-24 | Learning Isometric Embeddings of Road Networks using Multidimensional Scaling | Juan Carlos Climent Pardo et.al. | 2504.17534 | null |
2025-04-25 | Highly Accurate and Diverse Traffic Data: The DeepScenario Open 3D Dataset | Oussema Dhaouadi et.al. | 2504.17371 | null |
2025-04-23 | Zero-shot Sim-to-Real Transfer for Reinforcement Learning-based Visual Servoing of Soft Continuum Arms | Hsin-Jung Yang et.al. | 2504.16916 | null |
2025-04-23 | MOSAIC: A Skill-Centric Algorithmic Framework for Long-Horizon Manipulation Planning | Itamar Mishani et.al. | 2504.16738 | null |
2025-04-24 | Marginalized Generalized IoU (MGIoU): A Unified Objective Function for Optimizing Any Convex Parametric Shapes | Duy-Tho Le et.al. | 2504.16443 | null |
2025-04-23 | SILM: A Subjective Intent Based Low-Latency Framework for Multiple Traffic Participants Joint Trajectory Prediction | Qu Weiming et.al. | 2504.16377 | null |
2025-04-22 | Monocular inspection of spacecraft under illumination constraints and avoidance regions | Tochukwu Elijah Ogri et.al. | 2504.15954 | null |
2025-04-23 | Bidirectional Task-Motion Planning Based on Hierarchical Reinforcement Learning for Strategic Confrontation | Qizhen Wu et.al. | 2504.15876 | null |
2025-04-22 | Dynamic Intent Queries for Motion Transformer-based Trajectory Prediction | Tobias Demmler et.al. | 2504.15766 | null |
2025-04-22 | SocialMOIF: Multi-Order Intention Fusion for Pedestrian Trajectory Prediction | Kai Chen et.al. | 2504.15616 | null |
2025-04-22 | RiskNet: Interaction-Aware Risk Forecasting for Autonomous Driving in Long-Tail Scenarios | Qichao Liu et.al. | 2504.15541 | null |
2025-04-18 | Learning Through Retrospection: Improving Trajectory Prediction for Automated Driving with Error Feedback | Steffen Hagedorn et.al. | 2504.13785 | null |
2025-04-25 | Equi-Euler GraphNet: An Equivariant, Temporal-Dynamics Informed Graph Neural Network for Dual Force and Trajectory Prediction in Multi-Body Systems | Vinay Sharma et.al. | 2504.13768 | null |
2025-04-18 | Lightweight LiDAR-Camera 3D Dynamic Object Detection and Multi-Class Trajectory Prediction | Yushen He et.al. | 2504.13647 | link |
2025-04-18 | Robot Navigation in Dynamic Environments using Acceleration Obstacles | Asher Stern et.al. | 2504.13637 | null |
2025-04-18 | An Addendum to NeBula: Towards Extending TEAM CoSTAR’s Solution to Larger Scale Environments | Ali Agha et.al. | 2504.13461 | null |
2025-04-17 | Uncertainty-Aware Trajectory Prediction via Rule-Regularized Heteroscedastic Deep Classification | Kumar Manas et.al. | 2504.13111 | null |
2025-04-17 | Versatile, Robust, and Explosive Locomotion with Rigid and Articulated Compliant Quadrupeds | Jiatao Ding et.al. | 2504.12854 | null |
2025-04-17 | UncAD: Towards Safe End-to-end Autonomous Driving via Online Map Uncertainty | Pengxuan Yang et.al. | 2504.12826 | link |
2025-04-17 | B*: Efficient and Optimal Base Placement for Fixed-Base Manipulators | Zihang Zhao et.al. | 2504.12719 | link |
2025-04-17 | UniPhys: Unified Planner and Controller with Diffusion for Flexible Physics-Based Character Control | Yan Wu et.al. | 2504.12540 | null |
2025-04-16 | Self-Supervised Traversability Learning with Online Prototype Adaptation for Off-Road Autonomous Driving | Yafeng Bu et.al. | 2504.12109 | null |
2025-04-16 | Generation of Paths for Motion Planning for a Dubins Vehicle on Sphere | Deepak Prakash Kumar et.al. | 2504.11832 | link |
2025-04-15 | LANGTRAJ: Diffusion Model and Dataset for Language-Conditioned Trajectory Simulation | Wei-Jer Chang et.al. | 2504.11521 | null |
2025-04-15 | GC-GAT: Multimodal Vehicular Trajectory Prediction using Graph Goal Conditioning and Cross-context Attention | Mahir Gulzar et.al. | 2504.11150 | null |
2025-04-15 | Superfast Configuration-Space Convex Set Computation on GPUs for Online Motion Planning | Peter Werner et.al. | 2504.10783 | link |
2025-04-14 | HyRRT-Connect: Bidirectional Motion Planning for Hybrid Dynamical Systems | Nan Wang et.al. | 2504.10699 | null |
2025-04-14 | Layered Multirate Control of Constrained Linear Systems | Charis Stamouli et.al. | 2504.10461 | null |
2025-04-14 | LMFormer: Lane based Motion Prediction Transformer | Harsh Yadav et.al. | 2504.10275 | null |
2025-04-12 | IMPACT: Behavioral Intention-aware Multimodal Trajectory Prediction with Adaptive Context Trimming | Jiawei Sun et.al. | 2504.09103 | null |
2025-04-12 | Synthetic Aircraft Trajectory Generation Using Time-Based VQ-VAE | Abdulmajid Murad et.al. | 2504.09101 | null |
2025-05-03 | Safe Flow Matching: Robot Motion Planning with Control Barrier Functions | Xiaobing Dai et.al. | 2504.08661 | null |
2025-04-21 | Tactile sensing enables vertical obstacle negotiation for elongate many-legged robots | Juntao He et.al. | 2504.08615 | null |
2025-04-14 | RINGO: Real-time Navigation with a Guiding Trajectory for Aerial Manipulators in Unknown Environments | Zhaopeng Zhang et.al. | 2504.08338 | null |
2025-04-11 | Interior Point Differential Dynamic Programming, Redux | Ming Xu et.al. | 2504.08278 | null |
2025-04-10 | Efficient Swept Volume-Based Trajectory Generation for Arbitrary-Shaped Ground Robot Navigation | Yisheng Li et.al. | 2504.07554 | null |
2025-04-10 | Novel Diffusion Models for Multimodal 3D Hand Trajectory Prediction | Junyi Ma et.al. | 2504.07375 | link |
2025-05-01 | Bridging Deep Reinforcement Learning and Motion Planning for Model-Free Navigation in Cluttered Environments | Licheng Luo et.al. | 2504.07283 | null |
2025-04-09 | Leveraging GCN-based Action Recognition for Teleoperation in Daily Activity Assistance | Thomas M. Kwok et.al. | 2504.07001 | null |
2025-04-09 | Overcoming Dynamic Environments: A Hybrid Approach to Motion Planning for Manipulators | Ho Minh Quang Ngo et.al. | 2504.06596 | null |
2025-04-08 | Extended Version: Multi-Robot Motion Planning with Cooperative Localization | Anne Theurkauf et.al. | 2504.06429 | null |
2025-04-08 | Dictionary-free Koopman Predictive Control for Autonomous Vehicles in Mixed Traffic | Xu Shang et.al. | 2504.06240 | null |
2025-04-11 | A Self-Supervised Framework for Space Object Behaviour Characterisation | Ian Groves et.al. | 2504.06176 | null |
2025-04-08 | Accelerated Reeds-Shepp and Under-Specified Reeds-Shepp Algorithms for Mobile Robot Path Planning | Ibrahim Ibrahim et.al. | 2504.05921 | null |
2025-04-08 | POD: Predictive Object Detection with Single-Frame FMCW LiDAR Point Cloud | Yining Shi et.al. | 2504.05649 | null |
2025-04-07 | Lazy-DaSH: Lazy Approach for Hypergraph-based Multi-robot Task and Motion Planning | Seongwon Lee et.al. | 2504.05552 | link |
2025-04-07 | Path Database Guidance for Motion Planning | Amnon Attali et.al. | 2504.05550 | null |
2025-05-06 | DyTTP: Trajectory Prediction with Normalization-Free Transformers | JianLin Zhu et.al. | 2504.05356 | null |
2025-04-07 | MIAT: Maneuver-Intention-Aware Transformer for Spatio-Temporal Trajectory Prediction | Chandra Raskoti et.al. | 2504.05059 | null |
2025-05-11 | Wavelet Policy: Imitation Policy Learning in Frequency Domain with Wavelet Transforms | Changchuan Yang et.al. | 2504.04991 | link |
2025-04-07 | Constrained Gaussian Process Motion Planning via Stein Variational Newton Inference | Jiayun Li et.al. | 2504.04936 | null |
2025-04-07 | GAMDTP: Dynamic Trajectory Prediction with Graph Attention Mamba Network | Yunxiang Liu et.al. | 2504.04862 | null |
2025-04-06 | B4P: Simultaneous Grasp and Motion Planning for Object Placement via Parallelized Bidirectional Forests and Path Repair | Benjamin H. Leebron et.al. | 2504.04598 | null |
2025-04-06 | Data Scaling Laws for End-to-End Autonomous Driving | Alexander Naumann et.al. | 2504.04338 | null |
2025-04-04 | Deep Learning-Enhanced Robotic Subretinal Injection with Real-Time Retinal Motion Compensation | Tianle Wu et.al. | 2504.03939 | null |
2025-04-04 | Energy Efficient Planning for Repetitive Heterogeneous Tasks in Precision Agriculture | Shuangyu Xie et.al. | 2504.03938 | null |
2025-04-04 | Online Traffic Density Estimation using Physics-Informed Neural Networks | Dennis Wilkman et.al. | 2504.03483 | null |
2025-04-04 | Dynamic Objective MPC for Motion Planning of Seamless Docking Maneuvers | Oliver Schumann et.al. | 2504.03280 | link |
2025-05-07 | Mitigating the Impact of Electrode Shift on Classification Performance in Electromyography-Based Motion Prediction Using Sliding-Window Normalization | Taichi Tanaka et.al. | 2504.03196 | null |
2025-03-25 | Curvature-Constrained Vector Field for Motion Planning of Nonholonomic Robots | Yike Qiao et.al. | 2504.02852 | link |
2025-04-03 | L-LBVC: Long-Term Motion Estimation and Prediction for Learned Bi-Directional Video Compression | Yongqi Zhai et.al. | 2504.02560 | null |
2025-04-03 | Data-Driven Object Tracking: Integrating Modular Neural Networks into a Kalman Framework | Christian Alexander Holz et.al. | 2504.02519 | null |
2025-04-09 | End-to-End Driving with Online Trajectory Evaluation via BEV World Model | Yingyan Li et.al. | 2504.01941 | link |
2025-04-02 | Focal Mechanism Uncertainty Quantification In Ground Motion Simulations Of Le Teil Earthquake | Valeria Soto et.al. | 2504.01868 | null |
2025-04-02 | Virtual Target Trajectory Prediction for Stochastic Targets | Marc Schneider et.al. | 2504.01851 | null |
2025-04-02 | Pedestrian-Aware Motion Planning for Autonomous Driving in Complex Urban Scenarios | Korbinian Moller et.al. | 2504.01409 | link |
2025-04-02 | From Shadows to Safety: Occlusion Tracking and Risk Mitigation for Urban Autonomous Driving | Korbinian Moller et.al. | 2504.01408 | link |
2025-04-02 | A Retina-Inspired Pathway to Real-Time Motion Prediction inside Image Sensors for Extreme-Edge Intelligence | Subhradip Chakraborty et.al. | 2504.01275 | null |
2025-04-01 | A New Approach to Motion Planning in 3D for a Dubins Vehicle: Special Case on a Sphere | Deepak Prakash Kumar et.al. | 2504.01215 | link |
2025-03-27 | Gaze-Guided 3D Hand Motion Prediction for Detecting Intent in Egocentric Grasping Tasks | Yufei He et.al. | 2504.01024 | null |
2025-04-01 | Time-optimal Convexified Reeds-Shepp Paths on a Sphere | Sixu Li et.al. | 2504.00966 | link |
2025-04-01 | Design and Validation of an Intention-Aware Probabilistic Framework for Trajectory Prediction: Integrating COLREGS, Grounding Hazards, and Planned Routes | Dhanika Mahipala et.al. | 2504.00731 | null |
2025-04-01 | DecoFuse: Decomposing and Fusing the “What”, “Where”, and “How” for Brain-Inspired fMRI-to-Video Decoding | Chong Li et.al. | 2504.00432 | null |
2025-03-31 | Learning Velocity and Acceleration: Self-Supervised Motion Consistency for Pedestrian Trajectory Prediction | Yizhou Huang et.al. | 2503.24272 | null |
2025-03-31 | Joint Modeling of Multiple Longitudinal Biomarkers and Survival Outcomes via Threshold Regression: Variability as a Predictor | Mingyan Yu et.al. | 2503.24146 | null |
2025-03-31 | A Reactive Framework for Whole-Body Motion Planning of Mobile Manipulators Combining Reinforcement Learning and SDF-Constrained Quadratic Programmi | Chenyu Zhang et.al. | 2503.23975 | null |
2025-03-31 | Less is More: Contextual Sampling for Nonlinear Data-Enabled Predictive Control | Julius Beerwerth et.al. | 2503.23890 | null |
2025-03-31 | A Survey of Reinforcement Learning-Based Motion Planning for Autonomous Driving: Lessons Learned from a Driving Task Perspective | Zhuoren Li et.al. | 2503.23650 | null |
2025-03-30 | Enhancing Human Motion Prediction via Multi-range Decoupling Decoding with Gating-adjusting Aggregation | Jiexin Wang et.al. | 2503.23381 | null |
2025-03-30 | OnSiteVRU: A High-Resolution Trajectory Dataset for High-Density Vulnerable Road Users | Zhangcun Yan et.al. | 2503.23365 | null |
2025-03-29 | Energy-Aware Lane Planning for Connected Electric Vehicles in Urban Traffic: Design and Vehicle-in-the-Loop Validation | Hansung Kim et.al. | 2503.23228 | null |
2025-03-29 | Adaptive Interactive Navigation of Quadruped Robots using Large Language Models | Kangjie Zhou et.al. | 2503.22942 | null |
2025-04-04 | Predictive Traffic Rule Compliance using Reinforcement Learning | Yanliang Huang et.al. | 2503.22925 | null |
2025-03-27 | Bayesian Inferential Motion Planning Using Heavy-Tailed Distributions | Ali Vaziri et.al. | 2503.22030 | null |
2025-03-27 | A Multi-Modal Knowledge-Enhanced Framework for Vessel Trajectory Prediction | Haomin Yu et.al. | 2503.21834 | null |
2025-03-27 | Cooking Task Planning using LLM and Verified by Graph Network | Ryunosuke Takebayashi et.al. | 2503.21564 | null |
2025-03-27 | Combining Graph Attention Networks and Distributed Optimization for Multi-Robot Mixed-Integer Convex Programming | Viet-Anh Le et.al. | 2503.21548 | null |
2025-04-01 | Fine-Grained Behavior and Lane Constraints Guided Trajectory Prediction Method | Wenyi Xiong et.al. | 2503.21477 | null |
2025-03-31 | On the order of the shortest solution sequences for the pebble motion problems | Tomoki Nakamigawa et.al. | 2503.20550 | null |
2025-03-26 | Combining Machine Learning and Sampling-Based Search for Multi-Goal Motion Planning with Dynamics | Yuanjie Lu et.al. | 2503.20530 | null |
2025-03-25 | Direct Post-Training Preference Alignment for Multi-Agent Motion Generation Models Using Implicit Feedback from Pre-training Demonstrations | Ran Tian et.al. | 2503.20105 | null |
2025-03-25 | ORION: A Holistic End-to-End Autonomous Driving Framework by Vision-Language Instructed Action Generation | Haoyu Fu et.al. | 2503.19755 | null |
2025-03-25 | Multi-Object Sketch Animation by Scene Decomposition and Motion Planning | Jingyu Liu et.al. | 2503.19351 | null |
2025-03-24 | Predicting the Road Ahead: A Knowledge Graph based Foundation Model for Scene Understanding in Autonomous Driving | Hongkuan Zhou et.al. | 2503.18730 | null |
2025-03-25 | A Universal Model Combining Differential Equations and Neural Networks for Ball Trajectory Prediction | Zhiwei Shi et.al. | 2503.18584 | null |
2025-03-23 | PhysTwin: Physics-Informed Reconstruction and Simulation of Deformable Objects from Videos | Hanxiao Jiang et.al. | 2503.17973 | null |
2025-03-21 | Physical Plausibility-aware Trajectory Prediction via Locomotion Embodiment | Hiromu Taketsugu et.al. | 2503.17267 | link |
2025-03-20 | Ground and Flight Locomotion for Two-Wheeled Drones via Model Predictive Path Integral Control | Gosuke Kojima et.al. | 2503.16715 | null |
2025-03-05 | Pedestrians and Robots: A Novel Dataset for Learning Distinct Social Navigation Forces | Subham Agrawal et.al. | 2503.16481 | null |
2025-04-28 | APEX-MR: Multi-Robot Asynchronous Planning and Execution for Cooperative Assembly | Philip Huang et.al. | 2503.15836 | null |
2025-03-20 | MobiFuse: Learning Universal Human Mobility Patterns through Cross-domain Data Fusion | Haoxuan Ma et.al. | 2503.15779 | null |
2025-03-19 | Experience-based Optimal Motion Planning Algorithm for Solving Difficult Planning Problems Using a Limited Dataset | Ryota Takamido et.al. | 2503.15715 | null |
2025-03-19 | GASP: Unifying Geometric and Semantic Self-Supervised Pre-training for Autonomous Driving | William Ljungbergh et.al. | 2503.15672 | null |
2025-03-19 | Geometric Iterative Approach for Efficient Inverse Kinematics and Planning of Continuum Robots with a Floating Base Under Environment Constraints | Congjun Ma et.al. | 2503.14848 | null |
2025-03-18 | Stochastic Trajectory Prediction under Unstructured Constraints | Hao Ma et.al. | 2503.14203 | null |
2025-03-18 | Bridging Past and Future: End-to-End Autonomous Driving with Historical Prediction and Planning | Bozhou Zhang et.al. | 2503.14182 | link |
2025-03-18 | GPU-Accelerated Motion Planning of an Underactuated Forestry Crane in Cluttered Environments | Minh Nhat Vu et.al. | 2503.14160 | null |
2025-03-17 | MoManipVLA: Transferring Vision-language-action Models for General Mobile Manipulation | Zhenyu Wu et.al. | 2503.13446 | null |
2025-03-17 | InsightDrive: Insight Scene Representation for End-to-End Autonomous Driving | Ruiqi Song et.al. | 2503.13047 | null |
2025-03-17 | TA-GNN: Physics Inspired Time-Agnostic Graph Neural Network for Finger Motion Prediction | Tinghui Li et.al. | 2503.13034 | null |
2025-03-17 | COSMOS: Continuous Simplicial Neural Networks | Aref Einizade et.al. | 2503.12919 | null |
2025-03-16 | CDKFormer: Contextual Deviation Knowledge-Based Transformer for Long-Tail Trajectory Prediction | Yuansheng Lian et.al. | 2503.12695 | null |
2025-03-16 | Iterative Motion Planning in Multi-agent Systems with Opportunistic Communication under Disturbance | Neelanga Thelasingha et.al. | 2503.12457 | null |
2025-03-15 | Hierarchical Reinforcement Learning for Safe Mapless Navigation with Congestion Estimation | Jianqi Gao et.al. | 2503.12036 | null |
2025-03-15 | Hydra-NeXt: Robust Closed-Loop Driving with Open-Loop Training | Zhenxin Li et.al. | 2503.12030 | link |
2025-03-15 | DynaGSLAM: Real-Time Gaussian-Splatting SLAM for Online Rendering, Tracking, Motion Predictions of Moving Objects in Dynamic Scenes | Runfa Blark Li et.al. | 2503.11979 | null |
2025-03-30 | Controllable Latent Diffusion for Traffic Simulation | Yizhuo Xiao et.al. | 2503.11771 | link |
2025-04-06 | A High-Speed Time-Optimal Trajectory Generation Strategy via a Two-layer Planning Model | Haotian Tan et.al. | 2503.11072 | null |
2025-03-14 | Enhancing Adaptivity of Two-Fingered Object Reorientation Using Tactile-based Online Optimization of Deconstructed Actions | Qiyin Huang et.al. | 2503.11041 | null |
2025-03-13 | Graph-Grounded LLMs: Leveraging Graphical Function Calling to Minimize LLM Hallucinations | Piyush Gupta et.al. | 2503.10941 | null |
2025-03-13 | Transferring Kinesthetic Demonstrations across Diverse Objects for Manipulation Planning | Dibyendu Das et.al. | 2503.10904 | null |
2025-03-13 | Trajectory Mamba: Efficient Attention-Mamba Forecasting Model Based on Selective SSM | Yizhou Huang et.al. | 2503.10898 | null |
2025-03-13 | Stratified Topological Autonomy for Long-Range Coordination (STALC) | Cora A. Dimmig et.al. | 2503.10475 | null |
2025-03-13 | Finetuning Generative Trajectory Model with Reinforcement Learning from Human Feedback | Derun Li et.al. | 2503.10434 | null |
2025-04-07 | CODEI: Resource-Efficient Task-Driven Co-Design of Perception and Decision Making for Mobile Robots Applied to Autonomous Vehicles | Dejan Milojevic et.al. | 2503.10296 | null |
2025-03-13 | IMPACT: Intelligent Motion Planning with Acceptable Contact Trajectories via Vision-Language Models | Yiyang Ling et.al. | 2503.10110 | null |
2025-03-13 | MoFlow: One-Step Flow Matching for Human Trajectory Forecasting via Implicit Maximum Likelihood Estimation based Distillation | Yuxiang Fu et.al. | 2503.09950 | null |
2025-03-12 | Vi-LAD: Vision-Language Attention Distillation for Socially-Aware Robot Navigation in Dynamic Environments | Mohamed Elnoor et.al. | 2503.09820 | null |
2025-03-12 | Post-interactive Multimodal Trajectory Prediction for Autonomous Driving | Ziyi Huang et.al. | 2503.09366 | null |
2025-03-12 | Large-scale Regional Traffic Signal Control Based on Single-Agent Reinforcement Learning | Qiang Li et.al. | 2503.09252 | null |
2025-05-12 | SICNav-Diffusion: Safe and Interactive Crowd Navigation with Diffusion Trajectory Predictions | Sepehr Samavi et.al. | 2503.08858 | null |
2025-03-11 | Geometric Data-Driven Multi-Jet Locomotion Inspired by Salps | Yanhao Yang et.al. | 2503.08817 | null |
2025-03-11 | Cross-Embodiment Robotic Manipulation Synthesis via Guided Demonstrations through CycleVAE and Human Behavior Transformer | Apan Dastider et.al. | 2503.08622 | null |
2025-03-11 | HiP-AD: Hierarchical and Multi-Granularity Planning with Deformable Attention for Autonomous Driving in a Single Decoder | Yingqi Tang et.al. | 2503.08612 | link |
2025-03-11 | LiSu: A Dataset and Method for LiDAR Surface Normal Estimation | Dušan Malić et.al. | 2503.08601 | null |
2025-03-11 | Soft Actor-Critic-based Control Barrier Adaptation for Robust Autonomous Navigation in Unknown Environments | Nicholas Mohammad et.al. | 2503.08479 | link |
2025-03-11 | General-Purpose Aerial Intelligent Agents Empowered by Large Language Models | Ji Zhao et.al. | 2503.08302 | null |
2025-03-11 | Control Barrier Functions for Prescribed-time Reach-Avoid-Stay Tasks using Spatiotemporal Tubes | Ratnangshu Das et.al. | 2503.08106 | null |
2025-03-11 | STGDPM:Vessel Trajectory Prediction with Spatio-Temporal Graph Diffusion Probabilistic Model | Jin Wenzhe et.al. | 2503.08065 | null |
2025-03-11 | Elastic Motion Policy: An Adaptive Dynamical System for Robust and Efficient One-Shot Imitation Learning | Tianyu Li et.al. | 2503.08029 | null |
2025-03-11 | SGNetPose+: Stepwise Goal-Driven Networks with Pose Information for Trajectory Prediction in Autonomous Driving | Akshat Ghiya et.al. | 2503.08016 | null |
2025-03-11 | HEATS: A Hierarchical Framework for Efficient Autonomous Target Search with Mobile Manipulators | Hao Zhang et.al. | 2503.07986 | null |
2025-03-10 | LTLCodeGen: Code Generation of Syntactically Correct Temporal Logic for Robot Task Planning | Behrad Rabiei et.al. | 2503.07902 | null |
2025-05-15 | Multi-layer Motion Planning with Kinodynamic and Spatio-Temporal Constraints | Jeel Chatrola et.al. | 2503.07762 | null |
2025-03-10 | A Task and Motion Planning Framework Using Iteratively Deepened AND/OR Graph Networks | Hossein Karami et.al. | 2503.07700 | null |
2025-03-05 | Impact of Level 2/3 Automated Driving Technology on Road Work Zone Safety | Zhepu Xu et.al. | 2503.07634 | null |
2025-03-10 | CATPlan: Loss-based Collision Prediction in End-to-End Autonomous Driving | Ziliang Xiong et.al. | 2503.07425 | null |
2025-03-10 | LEGO-Motion: Learning-Enhanced Grids with Occupancy Instance Modeling for Class-Agnostic Motion Prediction | Kangan Qian et.al. | 2503.07367 | null |
2025-05-15 | Temporal Triplane Transformers as Occupancy World Models | Haoran Xu et.al. | 2503.07338 | null |
2025-04-25 | Multi-Robot System for Cooperative Exploration in Unknown Environments: A Survey | Chuqi Wang et.al. | 2503.07278 | null |
2025-05-07 | Generative AI in Transportation Planning: A Survey | Longchao Da et.al. | 2503.07158 | null |
2025-03-10 | Combating Partial Perception Deficit in Autonomous Driving with Multimodal LLM Commonsense | Yuting Hu et.al. | 2503.07020 | null |
2025-03-10 | GUIDE-CoT: Goal-driven and User-Informed Dynamic Estimation for Pedestrian Trajectory using Chain-of-Thought | Sungsik Kim et.al. | 2503.06832 | null |
2025-03-09 | Chance-Constrained Trajectory Planning with Multimodal Environmental Uncertainty | Kai Ren et.al. | 2503.06779 | link |
2025-03-09 | pRRTC: GPU-Parallel RRT-Connect for Fast, Consistent, and Low-Cost Motion Planning | Chih H. Huang et.al. | 2503.06757 | link |
2025-03-09 | Quantum Speedup in Dissecting Roots and Solving Nonlinear Algebraic Equations | Nhat A. Nghiem et.al. | 2503.06609 | null |
2025-03-09 | Reduced-Order Model-Based Gait Generation for Snake Robot Locomotion using NMPC | Adarsh Salagame et.al. | 2503.06402 | null |
2025-03-08 | FlowMP: Learning Motion Fields for Robot Planning with Conditional Flow Matching | Khang Nguyen et.al. | 2503.06135 | null |
2025-03-08 | T-CBF: Traversability-based Control Barrier Function to Navigate Vertically Challenging Terrain | Manas Gupta et.al. | 2503.06083 | null |
2025-03-08 | TransParking: A Dual-Decoder Transformer Framework with Soft Localization for End-to-End Automatic Parking | Hangyu Du et.al. | 2503.06071 | null |
2025-05-07 | Evaluation Framework for Sensor Configuration Impact on Deep Learning-Based Perception | A Gamage et.al. | 2503.05939 | link |
2025-03-04 | DriveGen: Towards Infinite Diverse Traffic Scenarios with Large Models | Shenyu Zhang et.al. | 2503.05808 | null |
2025-03-10 | Accelerating db-A* for Kinodynamic Motion Planning Using Diffusion | Julius Franke et.al. | 2503.05539 | null |
2025-03-07 | Self-Modeling Robots by Photographing | Kejun Hu et.al. | 2503.05398 | null |
2025-03-07 | Evidential Uncertainty Estimation for Multi-Modal Trajectory Prediction | Sajad Marvi et.al. | 2503.05274 | null |
2025-03-07 | Safety-Critical Traffic Simulation with Adversarial Transfer of Driving Intentions | Zherui Huang et.al. | 2503.05180 | null |
2025-03-07 | A Comprehensive LLM-powered Framework for Driving Intelligence Evaluation | Shanhe You et.al. | 2503.05164 | null |
2025-03-06 | INTENT: Trajectory Prediction Framework with Intention-Guided Contrastive Clustering | Yihong Tang et.al. | 2503.04952 | null |
2025-03-06 | SAFE-TAXI: A Hierarchical Multi-UAS Safe Auto-Taxiing Framework with Runtime Safety Assurance and Conflict Resolution | Kartik A. Pant et.al. | 2503.04942 | null |
2025-03-06 | Curiosity-Driven Imagination: Discovering Plan Operators and Learning Associated Policies for Open-World Adaptation | Pierrick Lorang et.al. | 2503.04931 | null |
2025-05-06 | Neural Configuration-Space Barriers for Manipulation Planning and Control | Kehan Long et.al. | 2503.04929 | null |
2025-03-06 | AUTOFRAME – A Software-driven Integration Framework for Automotive Systems | Sven Kirchner et.al. | 2503.04928 | null |
2025-03-13 | DA-STGCN: 4D Trajectory Prediction Based on Spatiotemporal Feature Extraction | Yuheng Kuang et.al. | 2503.04823 | null |
2025-03-06 | SeGMan: Sequential and Guided Manipulation Planner for Robust Planning in 2D Constrained Environments | Cankut Bora Tuncer et.al. | 2503.04409 | null |
2025-03-07 | Towards Autonomous Reinforcement Learning for Real-World Robotic Manipulation with Large Language Models | Niccolò Turcato et.al. | 2503.04280 | null |
2025-03-06 | Simulation-based Analysis Of Highway Trajectory Planning Using High-Order Polynomial For Highly Automated Driving Function | Milin Patel et.al. | 2503.04159 | link |
2025-03-05 | GO-VMP: Global Optimization for View Motion Planning in Fruit Mapping | Allen Isaac Jose et.al. | 2503.03912 | null |
2025-03-05 | Motion Planning and Control with Unknown Nonlinear Dynamics through Predicted Reachability | Zhiquan Zhang et.al. | 2503.03633 | null |
2025-04-02 | TeraSim: Uncovering Unknown Unsafe Events for Autonomous Vehicles through Generative Simulation | Haowei Sun et.al. | 2503.03629 | link |
2025-03-17 | Digital Twin-Enabled Blockage-Aware Dynamic mmWave Multi-Hop V2X Communication | Supat Roongpraiwan et.al. | 2503.03590 | null |
2025-03-05 | A Generative System for Robot-to-Human Handovers: from Intent Inference to Spatial Configuration Imagery | Hanxin Zhang et.al. | 2503.03579 | null |
2025-03-05 | Unified Human Localization and Trajectory Prediction with Monocular Vision | Po-Chien Luan et.al. | 2503.03535 | link |
2025-03-05 | Trajectory Prediction for Autonomous Driving: Progress, Limitations, and Future Directions | Nadya Abdel Madjid et.al. | 2503.03262 | null |
2025-05-08 | Don’t Shake the Wheel: Momentum-Aware Planning in End-to-End Autonomous Driving | Ziying Song et.al. | 2503.03125 | link |
2025-03-05 | BEVDriver: Leveraging BEV Maps in LLMs for Robust Closed-Loop Driving | Katharina Winter et.al. | 2503.03074 | link |
2025-05-01 | ArticuBot: Learning Universal Articulated Object Manipulation Policy via Large Scale Simulation | Yufei Wang et.al. | 2503.03045 | null |
2025-03-04 | Scalable Multi-Robot Task Allocation and Coordination under Signal Temporal Logic Specifications | Wenliang Liu et.al. | 2503.02719 | null |
2025-03-05 | SEB-Naver: A SE(2)-based Local Navigation Framework for Car-like Robots on Uneven Terrain | Xiaoying Li et.al. | 2503.02412 | link |
2025-03-04 | Controllable Motion Generation via Diffusion Modal Coupling | Luobin Wang et.al. | 2503.02353 | link |
2025-03-04 | DeLTa: A Decoding Strategy based on Logit Trajectory Prediction Improves Factuality and Reasoning Ability | Yunzhen He et.al. | 2503.02343 | link |
2025-03-03 | NavG: Risk-Aware Navigation in Crowded Environments Based on Reinforcement Learning with Guidance Points | Qianyi Zhang et.al. | 2503.02111 | null |
2025-03-03 | Code-as-Symbolic-Planner: Foundation Model-Based Robot Planning via Symbolic Code Generation | Yongchao Chen et.al. | 2503.01700 | null |
2025-03-03 | Trajectory Planning with Signal Temporal Logic Costs using Deterministic Path Integral Optimization | Patrick Halder et.al. | 2503.01476 | null |
2025-03-03 | Rational sequential parametrized topological complexity | Yuki Minowa et.al. | 2503.01123 | null |
2025-03-02 | Real-World Deployment and Assessment of a Multi-Agent Reinforcement Learning-Based Variable Speed Limit Control System | Yuhang Zhang et.al. | 2503.01017 | link |
2025-03-02 | TRACE: A Self-Improving Framework for Robot Behavior Forecasting with Vision-Language Models | Gokul Puthumanaillam et.al. | 2503.00761 | null |
2025-03-01 | Sampling-Based Motion Planning with Discrete Configuration-Space Symmetries | Thomas Cohn et.al. | 2503.00614 | null |
2025-03-01 | Space-Time Graphs of Convex Sets for Multi-Robot Motion Planning | Jingtao Tang et.al. | 2503.00583 | link |
2025-03-01 | Enhancing Context-Aware Human Motion Prediction for Efficient Robot Handovers | Gerard Gómez-Izquierdo et.al. | 2503.00576 | null |
2025-03-06 | Particle Trajectory Prediction in Discrete Element Simulations using a Graph-Based Interaction-Aware Model | Abhishek Setty et.al. | 2503.00215 | null |
2025-03-25 | RoboBrain: A Unified Brain Model for Robotic Manipulation from Abstract to Concrete | Yuheng Ji et.al. | 2502.21257 | null |
2025-02-28 | A Minor-Testing Approach for Coordinated Motion Planning with Sliding Robots | Eduard Eiben et.al. | 2502.21175 | null |
2025-02-28 | Delayed-Decision Motion Planning in the Presence of Multiple Predictions | David Isele et.al. | 2502.20636 | null |
2025-03-04 | Unifying Model Predictive Path Integral Control, Reinforcement Learning, and Diffusion Models for Optimal Control and Planning | Yankai Li et.al. | 2502.20476 | null |
2025-02-27 | Minds on the Move: Decoding Trajectory Prediction in Autonomous Driving with Cognitive Insights | Haicheng Liao et.al. | 2502.20084 | null |
2025-02-27 | RouteRL: Multi-agent reinforcement learning framework for urban route choice with autonomous vehicles | Ahmet Onur Akman et.al. | 2502.20065 | link |
2025-02-27 | Tracailer: An Efficient Trajectory Planner for Tractor-Trailer Vehicles in Unstructured Environments | Long Xu et.al. | 2502.19832 | null |
2025-02-27 | Risk-aware Integrated Task and Motion Planning for Versatile Snake Robots under Localization Failures | Ashkan Jasour et.al. | 2502.19690 | null |
2025-02-26 | Image-Based Roadmaps for Vision-Only Planning and Control of Robotic Manipulators | Sreejani Chatterjee et.al. | 2502.19617 | null |
2025-02-26 | Hybrid Robot Learning for Automatic Robot Motion Planning in Manufacturing | Siddharth Singh et.al. | 2502.19340 | null |
2025-03-03 | Interpretable Data-Driven Ship Dynamics Model: Enhancing Physics-Based Motion Prediction with Parameter Optimization | Christos Papandreou et.al. | 2502.18696 | null |
2025-02-25 | Controllability and Displacement Analysis of a Three-Link Elastic Microswimmer: A Geometric Control Approach | Rossella Attanasi et.al. | 2502.18286 | null |
2025-02-25 | Patient Trajectory Prediction: Integrating Clinical Notes with Transformers | Sifal Klioui et.al. | 2502.18009 | link |
2025-02-24 | The Geometry of Optimal Gait Families for Steering Kinematic Locomoting Systems | Jinwoo Choi et.al. | 2502.17672 | null |
2025-02-24 | V-HOP: Visuo-Haptic 6D Object Pose Tracking | Hongyu Li et.al. | 2502.17434 | null |
2025-02-24 | HVIS: A Human-like Vision and Inference System for Human Motion Prediction | Kedi Lyu et.al. | 2502.16913 | null |
2025-02-24 | Characterizing Structured versus Unstructured Environments based on Pedestrians’ and Vehicles’ Motion Trajectories | Mahsa Golchoubian et.al. | 2502.16847 | null |
2025-02-25 | Co-MTP: A Cooperative Trajectory Prediction Framework with Multi-Temporal Fusion for Autonomous Driving | Xinyu Zhang et.al. | 2502.16589 | link |
2025-02-23 | An Expert Ensemble for Detecting Anomalous Scenes, Interactions, and Behaviors in Autonomous Driving | Tianchen Ji et.al. | 2502.16389 | null |
2025-02-21 | Human Motion Prediction, Reconstruction, and Generation | Canxuan Gang et.al. | 2502.15956 | null |
2025-02-20 | Getting SMARTER for Motion Planning in Autonomous Driving Systems | Montgomery Alban et.al. | 2502.15824 | link |
2025-02-21 | Enhanced Probabilistic Collision Detection for Motion Planning Under Sensing Uncertainty | Xiaoli Wang et.al. | 2502.15525 | null |
2025-02-19 | Sce2DriveX: A Generalized MLLM Framework for Scene-to-Drive Learning | Rui Zhao et.al. | 2502.14917 | null |
2025-02-21 | BP-SGCN: Behavioral Pseudo-Label Informed Sparse Graph Convolution Network for Pedestrian and Heterogeneous Trajectory Prediction | Ruochen Li et.al. | 2502.14676 | link |
2025-02-20 | Watch Less, Feel More: Sim-to-Real RL for Generalizable Articulated Object Manipulation via Motion Adaptation and Impedance Control | Tan-Dzung Do et.al. | 2502.14457 | null |
2025-02-25 | Towards Robust and Secure Embodied AI: A Survey on Vulnerabilities and Attacks | Wenpeng Xing et.al. | 2502.13175 | null |
2025-02-17 | Calibration of Vehicular Traffic Simulation Models by Local Optimization | Davide Andrea Guastella et.al. | 2502.11585 | null |
2025-02-17 | A Framework for Learning Scoring Rules in Autonomous Driving Planning Systems | Zikang Xiong et.al. | 2502.11352 | null |
2025-02-18 | Motion planning for highly-dynamic unconditioned reflexes based on chained Signed Distance Functions | Ken Lin et.al. | 2502.10734 | null |
2025-02-15 | Semantics-aware Test-time Adaptation for 3D Human Pose Estimation | Qiuxia Lin et.al. | 2502.10724 | null |
2025-02-14 | Prediction uncertainty-aware planning using deep ensembles and trajectory optimisation | Anshul Nayak et.al. | 2502.10585 | null |
2025-02-13 | Knowledge Integration Strategies in Autonomous Vehicle Prediction and Planning: A Comprehensive Survey | Kumar Manas et.al. | 2502.10477 | null |
2025-02-14 | Manual2Skill: Learning to Read Manuals and Acquire Robotic Skills for Furniture Assembly Using Vision-Language Models | Chenrui Tie et.al. | 2502.10090 | null |
2025-02-13 | A Discontinuous Galerkin Method for Simulating 3D Seismic Wave Propagation in Nonlinear Rock Models: Verification and Application to the 2015 Mw 7.8 Gorkha Earthquake | Zihua Niu et.al. | 2502.09714 | link |
2025-02-13 | Enhancing Traffic Safety Analysis with Digital Twin Technology: Integrating Vehicle Dynamics and Environmental Factors into Microscopic Traffic Simulation | Guanhao Xu et.al. | 2502.09561 | null |
2025-02-13 | Real-Time Fast Marching Tree for Mobile Robot Motion Planning in Dynamic Environments | Jefferson Silveira et.al. | 2502.09556 | null |
2025-02-13 | Long-Term TalkingFace Generation via Motion-Prior Conditional Diffusion Model | Fei Shen et.al. | 2502.09533 | null |
2025-02-13 | Training Trajectory Predictors Without Ground-Truth Data | Mikolaj Kliniewski et.al. | 2502.08957 | null |
2025-03-13 | Knowledge-data fusion dominated vehicle platoon dynamics modeling and analysis: A physics-encoded deep learning approach | Hao Lyu et.al. | 2502.08658 | link |
2025-02-12 | Poly-Autoregressive Prediction for Modeling Interactions | Neerja Thakkar et.al. | 2502.08646 | null |
2025-03-12 | Not All Frame Features Are Equal: Video-to-4D Generation via Decoupling Dynamic-Static Features | Liying Yang et.al. | 2502.08377 | null |
2025-05-02 | Predictive Planner for Autonomous Driving with Consistency Models | Anjian Li et.al. | 2502.08033 | null |
2025-02-14 | A Simulation-Based Framework for Leveraging Shared Autonomous Vehicles to Enhance Disaster Evacuations in Rural Regions with a Focus on Vulnerable Populations | Alican Sevim et.al. | 2502.07787 | null |
2025-02-11 | Semi-Supervised Vision-Centric 3D Occupancy World Model for Autonomous Driving | Xiang Li et.al. | 2502.07309 | link |
2025-02-11 | Online Aggregation of Trajectory Predictors | Alex Tong et.al. | 2502.07178 | null |
2025-04-22 | AgilePilot: DRL-Based Drone Agent for Real-Time Motion Planning in Dynamic Environments by Leveraging Object Detection | Roohan Ahmed Khan et.al. | 2502.06725 | null |
2025-02-10 | Deferred-Decision Trajectory Optimization | Purnanand Elango et.al. | 2502.06623 | null |
2025-02-10 | Gaussian Process-driven Hidden Markov Models for Early Diagnosis of Infant Gait Anomalies | Luis Torres-Torres F. et.al. | 2502.06334 | null |
2025-02-10 | Interaction-aware Conformal Prediction for Crowd Navigation | Zhe Huang et.al. | 2502.06221 | link |
2025-04-29 | EgoAgent: A Joint Predictive Agent Model in Egocentric Worlds | Lu Chen et.al. | 2502.05857 | null |
2025-02-12 | AToM: Adaptive Theory-of-Mind-Based Human Motion Prediction in Long-Term Human-Robot Interactions | Yuwen Liao et.al. | 2502.05792 | link |
2025-02-08 | Towards Learning Scalable Agile Dynamic Motion Planning for Robosoccer Teams with Policy Optimization | Brandon Ho et.al. | 2502.05526 | null |
2025-02-08 | Motion Planning of Nonholonomic Cooperative Mobile Manipulators | Keshab Patra et.al. | 2502.05462 | null |
2025-02-07 | Effective Sampling for Robot Motion Planning Through the Lens of Lattices | Itai Panasoff et.al. | 2502.04908 | null |
2025-02-07 | Online Robot Motion Planning Methodology Guided by Group Social Proxemics Feature | Xuan Mu et.al. | 2502.04837 | null |
2025-02-07 | The $α$ -Alternator: Dynamic Adaptation To Varying Noise Levels In Sequences Using The Vendi Score For Improved Robustness and Performance | Mohammad Reza Rezaei et.al. | 2502.04593 | null |
2025-02-06 | From Configuration-Space Clearance to Feature-Space Margin: Sample Complexity in Learning-Based Collision Detection | Sapir Tubul et.al. | 2502.04170 | null |
2025-02-12 | Large Language Models for Multi-Robot Systems: A Survey | Peihan Li et.al. | 2502.03814 | link |
2025-02-05 | Simultaneous Multi-Robot Motion Planning with Projected Diffusion Models | Jinhao Liang et.al. | 2502.03607 | null |
2025-02-05 | Contact-Aware Motion Planning Among Movable Objects | Haokun Wang et.al. | 2502.03317 | null |
2025-02-05 | Conditional Prediction by Simulation for Automated Driving | Fabian Konstantinidis et.al. | 2502.03286 | null |
2025-05-04 | Global Contact-Rich Planning with Sparsity-Rich Semidefinite Relaxations | Shucheng Kang et.al. | 2502.02829 | null |
2025-02-04 | Planning with affordances: Integrating learned affordance models and symbolic planning | Rajesh Mangannavar et.al. | 2502.02768 | null |
2025-02-04 | Unified Spatial-Temporal Edge-Enhanced Graph Networks for Pedestrian Trajectory Prediction | Ruochen Li et.al. | 2502.02504 | link |
2025-02-04 | VideoJAM: Joint Appearance-Motion Representations for Enhanced Motion Generation in Video Models | Hila Chefer et.al. | 2502.02492 | null |
2025-02-04 | Risk-Aware Driving Scenario Analysis with Large Language Models | Yuan Gao et.al. | 2502.02145 | link |
2025-02-03 | Bayesian Approximation-Based Trajectory Prediction and Tracking with 4D Radar | Dong-In Kim et.al. | 2502.01357 | null |
2025-04-26 | On the Surprising Robustness of Sequential Convex Optimization for Contact-Implicit Motion Planning | Yulin Li et.al. | 2502.01055 | null |
2025-02-03 | Multi-Object Active Search and Tracking by Multiple Agents in Untrusted, Dynamically Changing Environments | Mingi Jeong et.al. | 2502.01041 | null |
2025-02-03 | Robust Trajectory Generation and Control for Quadrotor Motion Planning with Field-of-View Control Barrier Certification | Lishuo Pan et.al. | 2502.01009 | null |
2025-02-03 | Wizard of Shopping: Target-Oriented E-commerce Dialogue Generation with Decision Tree Branching | Xiangci Li et.al. | 2502.00969 | null |
2025-02-01 | FlexCloud: Direct, Modular Georeferencing and Drift-Correction of Point Cloud Maps | Maximilian Leitenstern et.al. | 2502.00395 | link |
2025-01-28 | DISC: Dataset for Analyzing Driving Styles In Simulated Crashes for Mixed Autonomy | Sandip Sharan Senthil Kumar et.al. | 2502.00050 | null |
2025-04-22 | Mitigating Traffic Oscillations in Mixed Traffic Flow with Scalable Deep Koopman Predictive Control | Hao Lyu et.al. | 2502.00043 | link |
2025-01-30 | Agile and Cooperative Aerial Manipulation of a Cable-Suspended Load | Sihao Sun et.al. | 2501.18802 | null |
2025-01-30 | Pathways to Bubble and Skyrmion Lattice Formation in Fe/Gd Multilayers | Tim Titze et.al. | 2501.18459 | null |
2025-01-30 | GPD: Guided Polynomial Diffusion for Motion Planning | Ajit Srikanth et.al. | 2501.18229 | null |
2025-05-01 | Belief Roadmaps with Uncertain Landmark Evanescence | Erick Fuentes et.al. | 2501.17982 | null |
2025-01-29 | Large Language Models for Single-Step and Multi-Step Flight Trajectory Prediction | Kaiwei Luo et.al. | 2501.17459 | null |
2025-01-28 | Overcoming Semantic Dilution in Transformer-Based Next Frame Prediction | Hy Nguyen et.al. | 2501.16753 | null |
2025-01-28 | Dream to Drive with Predictive Individual World Model | Yinfeng Gao et.al. | 2501.16733 | link |
2025-01-28 | Toward Safe Integration of UAM in Terminal Airspace: UAM Route Feasibility Assessment using Probabilistic Aircraft Trajectory Prediction | Jungwoo Cho et.al. | 2501.16599 | null |
2025-01-27 | PackDiT: Joint Human Motion and Text Generation via Mutual Prompting | Zhongyu Jiang et.al. | 2501.16551 | null |
2025-01-27 | Modular Framework for Uncertainty Prediction in Autonomous Vehicle Motion Forecasting within Complex Traffic Scenarios | Han Wang et.al. | 2501.16480 | null |
2025-01-18 | Risk-Informed Diffusion Transformer for Long-Tail Trajectory Prediction in the Crash Scenario | Junlan Chen et.al. | 2501.16349 | null |
2025-01-27 | The Components of Collaborative Joint Perception and Prediction – A Conceptual Framework | Lei Wan et.al. | 2501.15860 | null |
2025-01-27 | Beyond In-Distribution Performance: A Cross-Dataset Study of Trajectory Prediction Robustness | Yue Yao et.al. | 2501.15842 | null |
2025-01-27 | Navigation Framework for Blind and Visually Impaired Persons based on Sensor Fusion | Chathurika S. Silva et.al. | 2501.15819 | null |
2025-01-25 | Zero-shot Robotic Manipulation with Language-guided Instruction and Formal Task Planning | Junfeng Tang et.al. | 2501.15214 | null |
2025-01-25 | Understanding via Gaze: Gaze-based Task Decomposition for Imitation Learning of Robot Manipulation | Ryo Takizawa et.al. | 2501.15071 | link |
2025-01-24 | Robustified Time-optimal Point-to-point Motion Planning and Control under Uncertainty | Shuhao Zhang et.al. | 2501.14526 | null |
2025-01-23 | Time-Dependent Queuing Model for Traffic Congestion Using Mt/D/1/K: Simulation and Policy Insights | Jyoutir Raj et.al. | 2501.14132 | null |
2025-01-22 | A Spatio-temporal Graph Network Allowing Incomplete Trajectory Input for Pedestrian Trajectory Prediction | Juncen Long et.al. | 2501.13973 | null |
2025-01-23 | Where Do You Go? Pedestrian Trajectory Prediction using Scene Features | Mohammad Ali Rezaei et.al. | 2501.13848 | null |
2025-01-23 | Towards Real-World Validation of a Physics-Based Ship Motion Prediction Model | Michail Mathioudakis et.al. | 2501.13804 | null |
2025-01-23 | Knowledge-Informed Multi-Agent Trajectory Prediction at Signalized Intersections for Infrastructure-to-Everything | Huilin Yin et.al. | 2501.13461 | null |
2025-01-22 | Int2Planner: An Intention-based Multi-modal Motion Planner for Integrated Prediction and Planning | Xiaolei Chen et.al. | 2501.12799 | null |
2025-01-21 | Multi-Agent Feedback Motion Planning using Probably Approximately Correct Nonlinear Model Predictive Control | Mark Gonzales et.al. | 2501.12234 | null |
2025-02-14 | DynoSAM: Open-Source Smoothing and Mapping Framework for Dynamic SLAM | Jesse Morris et.al. | 2501.11893 | link |
2025-01-20 | An Incremental Sampling and Segmentation-Based Approach for Motion Planning Infeasibility | Antony Thomas et.al. | 2501.11434 | null |
2025-01-20 | A Dynamic Improvement Framework for Vehicular Task Offloading | Qianren Li et.al. | 2501.11333 | null |
2025-02-16 | A Survey of World Models for Autonomous Driving | Tuo Feng et.al. | 2501.11260 | null |
2025-01-19 | Automatic Calibration of Mesoscopic Traffic Simulation Using Vehicle Trajectory Data | Ran Sun et.al. | 2501.10934 | null |
2025-01-18 | Graph Coloring to Reduce Computation Time in Prioritized Planning | Patrick Scheffe et.al. | 2501.10812 | link |
2025-01-18 | Simultaneous Computation with Multiple Prioritizations in Multi-Agent Motion Planning | Patrick Scheffe et.al. | 2501.10781 | link |
2025-01-18 | Assessing Markov Property in Driving Behaviors: Insights from Statistical Tests | Zheng Li et.al. | 2501.10625 | null |
2025-01-18 | RoMu4o: A Robotic Manipulation Unit For Orchard Operations Automating Proximal Hyperspectral Leaf Sensing | Mehrad Mortazavi et.al. | 2501.10621 | link |
2025-01-16 | ASTRA: A Scene-aware TRAnsformer-based model for trajectory prediction | Izzeddin Teeti et.al. | 2501.09878 | null |
2025-01-16 | Distilling Multi-modal Large Language Models for Autonomous Driving | Deepti Hegde et.al. | 2501.09757 | null |
2025-01-16 | Monte Carlo Tree Search with Velocity Obstacles for safe and efficient motion planning in dynamic environments | Lorenzo Bonanni et.al. | 2501.09649 | null |
2025-01-16 | Interoceptive Robots for Convergent Shared Control in Collaborative Construction Work | Xiaoshan Zhou et.al. | 2501.09290 | link |
2025-01-15 | Combining Movement Primitives with Contraction Theory | Moses C. Nah et.al. | 2501.09198 | null |
2025-01-14 | Data-driven Spatial Classification using Multi-Arm Bandits for Monitoring with Energy-Constrained Mobile Robots | Xiaoshan Lin et.al. | 2501.08222 | null |
2025-01-13 | Pedestrian Trajectory Prediction Based on Social Interactions Learning With Random Weights | Jiajia Xie et.al. | 2501.07711 | null |
2025-01-13 | Explore the Use of Time Series Foundation Model for Car-Following Behavior Analysis | Luwei Zeng et.al. | 2501.07034 | null |
2025-01-12 | Hierarchical Sampling-based Planner with LTL Constraints and Text Prompting | Jingzhan Ge et.al. | 2501.06719 | null |
2025-01-14 | Cooperative Aerial Robot Inspection Challenge: A Benchmark for Heterogeneous Multi-UAV Planning and Lessons Learned | Muqing Cao et.al. | 2501.06566 | null |
2025-01-11 | Whole-Body Integrated Motion Planning for Aerial Manipulators | Weiliang Deng et.al. | 2501.06493 | null |
2025-01-10 | CoDriveVLM: VLM-Enhanced Urban Cooperative Dispatching and Motion Planning for Future Autonomous Mobility on Demand Systems | Haichao Liu et.al. | 2501.06132 | link |
2025-04-03 | Nonisotropic Gaussian Diffusion for Realistic 3D Human Motion Prediction | Cecilia Curreli et.al. | 2501.06035 | null |
2025-01-09 | Intelligent Sailing Model for Open Sea Navigation | Hanna Krasowski et.al. | 2501.04988 | null |
2025-01-08 | Towards Generalizable Trajectory Prediction Using Dual-Level Representation Learning And Adaptive Prompting | Kaouther Messaoud et.al. | 2501.04815 | null |
2025-01-08 | Traffic Simulations: Multi-City Calibration of Metropolitan Highway Networks | Chao Zhang et.al. | 2501.04783 | null |
2025-01-08 | A Survey on Path Planning Problem of Rolling Contacts: Approaches, Applications and Future Challenges | Seyed Amir Tafrishi et.al. | 2501.04442 | null |
2025-01-08 | Frenet-Serret-Based Trajectory Prediction | Shashank Verma et.al. | 2501.04273 | null |
2025-01-07 | Hybrid Machine Learning Model with a Constrained Action Space for Trajectory Prediction | Alexander Fertig et.al. | 2501.03666 | null |
2025-01-05 | Markov Decision Processes for Satellite Maneuver Planning and Collision Avoidance | William Kuhl et.al. | 2501.02667 | null |
2025-01-05 | UDMC: Unified Decision-Making and Control Framework for Urban Autonomous Driving with Motion Prediction of Traffic Participants | Haichao Liu et.al. | 2501.02530 | link |
2025-02-01 | Interpretable Neural ODEs for Gene Regulatory Network Discovery under Perturbations | Zaikang Lin et.al. | 2501.02409 | null |
2025-01-03 | Social Processes: Probabilistic Meta-learning for Adaptive Multiparty Interaction Forecasting | Augustinas Jučas et.al. | 2501.01915 | link |
2025-01-02 | K-ARC: Adaptive Robot Coordination for Multi-Robot Kinodynamic Planning | Mike Qin et.al. | 2501.01559 | null |
2025-01-01 | Spatial Temporal Attention based Target Vehicle Trajectory Prediction for Internet of Vehicles | Ouhan Huang et.al. | 2501.00890 | null |
2024-12-31 | Real-Time Sampling-Based Safe Motion Planning for Robotic Manipulators in Dynamic Environments | Nermin Covic et.al. | 2501.00507 | link |
2024-12-31 | Spatio-Temporal Multi-Subgraph GCN for 3D Human Motion Prediction | Jiexin Wang et.al. | 2501.00317 | null |
2024-12-31 | Temporal Dynamics Decoupling with Inverse Processing for Enhancing Human Motion Prediction | Jiexin Wang et.al. | 2501.00315 | null |
2024-12-31 | Distributed Traffic Control in Complex Dynamic Roadblocks: A Multi-Agent Deep RL Approach | Noor Aboueleneen et.al. | 2501.00211 | null |
2025-04-04 | TrajLearn: Trajectory Prediction Learning using Deep Generative Models | Amirhossein Nadiri et.al. | 2501.00184 | link |
2024-12-30 | DEMO: A Dynamics-Enhanced Learning Model for Multi-Horizon Trajectory Prediction in Autonomous Vehicles | Chengyue Wang et.al. | 2412.20784 | null |
2024-12-30 | Highway Managed Lane Usage and Tolling for Mixed Traffic Flows with Connected Automated Vehicles (CAVs) and High-Occupancy Vehicles (HOVs) | Max T. M. Ng et.al. | 2412.20667 | null |
2024-12-29 | A Predefined-Time Convergent and Noise-Tolerant Zeroing Neural Network Model for Time Variant Quadratic Programming With Application to Robot Motion Planning | Yi Yang et.al. | 2412.20477 | null |
2025-02-22 | Learning Policies for Dynamic Coalition Formation in Multi-Robot Task Allocation | Lucas C. D. Bezerra et.al. | 2412.20397 | null |
2024-12-29 | Subconscious Robotic Imitation Learning | Jun Xie et.al. | 2412.20368 | null |
2024-12-28 | Pushing Blocks via Checkable Gadgets: PSPACE-completeness of Push-1F and Block/Box Dude | Hayashi Ani et.al. | 2412.20079 | null |
2024-12-27 | Motion Planning Diffusion: Learning and Adapting Robot Motion Planning with Diffusion Models | J. Carvalho et.al. | 2412.19948 | null |
2024-12-27 | RobotDiffuse: Motion Planning for Redundant Manipulator based on Diffusion Model | Xiaohan Zhang et.al. | 2412.19500 | link |
2025-01-07 | Sketch-MoMa: Teleoperation for Mobile Manipulator via Interpretation of Hand-Drawn Sketches | Kosei Tanada et.al. | 2412.19153 | null |
2024-12-25 | Goal State Generation for Robotic Manipulation Based on Linguistically Guided Hybrid Gaussian Diffusion | Yichen Xu et.al. | 2412.18877 | null |
2024-12-22 | Optimal Traffic Flow in Quantum Annealing-Supported Virtual Traffic Lights | Abyad Enan et.al. | 2412.18776 | null |
2024-12-24 | Generating Traffic Scenarios via In-Context Learning to Learn Better Motion Planner | Aizierjiang Aiersilan et.al. | 2412.18086 | link |
2024-12-23 | Falsification of Autonomous Systems in Rich Environments | Khen Elimelech et.al. | 2412.17992 | null |
2024-12-23 | DeepMF: Deep Motion Factorization for Closed-Loop Safety-Critical Driving Scenario Simulation | Yizhe Li et.al. | 2412.17487 | null |
2024-12-23 | Sampling-Based Constrained Motion Planning with Products of Experts | Amirreza Razmjoo et.al. | 2412.17462 | null |
2025-02-18 | Gradient-based Trajectory Optimization with Parallelized Differentiable Traffic Simulation | Sanghyun Son et.al. | 2412.16750 | link |
2024-12-21 | Effective and Efficient Representation Learning for Flight Trajectories | Shuo Liu et.al. | 2412.16581 | link |
2024-12-20 | Learning Group Interactions and Semantic Intentions for Multi-Object Trajectory Prediction | Mengshi Qi et.al. | 2412.15673 | link |
2024-12-19 | Co-optimization of Vehicle Dynamics and Powertrain Management for Connected and Automated Electric Vehicles | Zongtan Li et.al. | 2412.14984 | null |
2025-02-13 | EPN: An Ego Vehicle Planning-Informed Network for Target Trajectory Prediction | Saiqian Peng et.al. | 2412.14442 | link |
2024-12-17 | A Comprehensive Review on Traffic Datasets and Simulators for Autonomous Vehicles | Supriya Sarker et.al. | 2412.14207 | null |
2024-12-18 | On the Use of Abundant Road Speed Data for Travel Demand Calibration of Urban Traffic Simulators | Suyash Vishnoi et.al. | 2412.14089 | null |
2024-12-18 | Joint Perception and Prediction for Autonomous Driving: A Survey | Lucas Dal’Col et.al. | 2412.14088 | link |
2024-12-18 | An Efficient Occupancy World Model via Decoupled Dynamic Flow and Image-assisted Training | Haiming Zhang et.al. | 2412.13772 | null |
2024-12-23 | THÖR-MAGNI Act: Actions for Human Motion Modeling in Robot-Shared Industrial Spaces | Tiago Rodrigues de Almeida et.al. | 2412.13729 | link |
2024-12-18 | Planning Human-Robot Co-manipulation with Human Motor Control Objectives and Multi-component Reaching Strategies | Kevin Haninger et.al. | 2412.13474 | null |
2024-12-18 | Exploring Transformer-Augmented LSTM for Temporal and Spatial Feature Learning in Trajectory Prediction | Chandra Raskoti et.al. | 2412.13419 | null |
2024-12-17 | Multi-Agent Motion Planning For Differential Drive Robots Through Stationary State Search | Jingtian Yan et.al. | 2412.13359 | link |
2024-12-17 | A Scalable Method for Optimal Path Planning on Manifolds via a Hopf-Lax Type Formula | Edward Huynh et.al. | 2412.13346 | link |
2024-12-24 | C2F-TP: A Coarse-to-Fine Denoising Framework for Uncertainty-Aware Trajectory Prediction | Zichen Wang et.al. | 2412.13231 | link |
2024-12-18 | HandsOnVLM: Vision-Language Models for Hand-Object Interaction Prediction | Chen Bao et.al. | 2412.13187 | null |
2025-03-12 | Equivariant and Invariant Parametrized Topological Complexity | Ramandeep Singh Arora et.al. | 2412.12921 | null |
2025-01-27 | Towards Physically Interpretable World Models: Meaningful Weakly Supervised Representations for Visual Trajectory Prediction | Zhenjiang Mao et.al. | 2412.12870 | null |
2024-12-17 | DriveTester: A Unified Platform for Simulation-Based Autonomous Driving Testing | Mingfei Cheng et.al. | 2412.12656 | link |
2024-12-14 | Anti-bullying Adaptive Cruise Control: A proactive right-of-way protection approach | Jia Hu et.al. | 2412.12197 | null |
2024-12-05 | SceneDiffuser: Efficient and Controllable Driving Simulation Initialization and Rollout | Chiyu Max Jiang et.al. | 2412.12129 | null |
2024-12-16 | Multimodal LLM for Intelligent Transportation Systems | Dexter Le et.al. | 2412.11683 | null |
2024-12-16 | NEST: A Neuromodulated Small-world Hypergraph Trajectory Prediction Model for Autonomous Driving | Chengyue Wang et.al. | 2412.11682 | null |
2024-12-16 | Multi-Scale Incremental Modeling for Enhanced Human Motion Prediction in Human-Robot Collaboration | Juncheng Zou et.al. | 2412.11632 | null |
2024-12-16 | Efficient Avoidance of Ellipsoidal Obstacles with Model Predictive Control for Mobile Robots and Vehicles | Mario Rosenfelder et.al. | 2412.11552 | null |
2025-03-07 | Budget-optimal multi-robot layout design for box sorting | Peiyu Zeng et.al. | 2412.11281 | null |
2024-12-14 | Impact of Trip Distance Distribution Time Dependency and Aggregation Levels in Bathtub Models – A Comparative Simulation Analysis | Jiayi Guo et.al. | 2412.10763 | null |
2024-12-13 | GaussianAD: Gaussian-Centric End-to-End Autonomous Driving | Wenzhao Zheng et.al. | 2412.10371 | link |
2024-12-13 | Adaptive Dual-Headway Unicycle Pose Control and Motion Prediction for Optimal Sampling-Based Feedback Motion Planning | Aykut İşleyen et.al. | 2412.10350 | null |
2024-12-12 | Should We Learn Contact-Rich Manipulation Policies from Sampling-Based Planners? | Huaijiang Zhu et.al. | 2412.09743 | null |
2024-12-12 | Doe-1: Closed-Loop Autonomous Driving with Large World Model | Wenzhao Zheng et.al. | 2412.09627 | link |
2025-03-16 | BaB-ND: Long-Horizon Motion Planning with Branch-and-Bound and Neural Dynamics | Keyi Shen et.al. | 2412.09584 | null |
2024-12-12 | Temporal-Assisted Beamforming and Trajectory Prediction in Sensing-Enabled UAV Communications | Shengcai Zhou et.al. | 2412.09097 | null |
2024-12-12 | Real-Time Algorithms for Game-Theoretic Motion Planning and Control in Autonomous Racing using Near-Potential Function | Dvij Kalaria et.al. | 2412.08855 | null |
2024-12-11 | GPD-1: Generative Pre-training for Driving | Zixun Xie et.al. | 2412.08643 | link |
2024-12-11 | Real-Time Trajectory Generation for Soft Robot Manipulators Using Differential Flatness | Akua Dickson et.al. | 2412.08568 | null |
2024-12-11 | SwarmGPT-Primitive: A Language-Driven Choreographer for Drone Swarms Using Safe Motion Primitive Composition | Vedant Vyas et.al. | 2412.08428 | null |
2024-12-13 | Physical Informed Driving World Model | Zhuoran Yang et.al. | 2412.08410 | null |
2024-12-11 | THUD++: Large-Scale Dynamic Indoor Scene Dataset and Benchmark for Mobile Robots | Zeshun Li et.al. | 2412.08096 | null |
2024-12-10 | Robust Multiple Description Neural Video Codec with Masked Transformer for Dynamic and Noisy Networks | Xinyue Hu et.al. | 2412.07922 | null |
2024-12-10 | RRT-GPMP2: A Motion Planner for Mobile Robots in Complex Maze Environments | Jiawei Meng et.al. | 2412.07683 | null |
2024-12-10 | Dynamic Obstacle Avoidance of Unmanned Surface Vehicles in Maritime Environments Using Gaussian Processes Based Motion Planning | Jiawei Meng et.al. | 2412.07664 | null |
2024-12-10 | Ontology-driven Prompt Tuning for LLM-based Task and Motion Planning | Muhayy Ud Din et.al. | 2412.07493 | null |
2024-12-10 | ITPNet: Towards Instantaneous Trajectory Prediction for Autonomous Driving | Rongqing Li et.al. | 2412.07369 | null |
2024-12-09 | Variable Selection for Comparing High-dimensional Time-Series Data | Kensuke Mitsuzawa et.al. | 2412.06870 | null |
2024-12-08 | doScenes: An Autonomous Driving Dataset with Natural Language Instruction for Human Interaction and Vision-Language Navigation | Parthib Roy et.al. | 2412.05893 | link |
2024-12-07 | Asymptotically Optimal Sampling-Based Path Planning Using Bidirectional Guidance Heuristic | Yi Wang et.al. | 2412.05754 | null |
2024-12-07 | Learning Soft Driving Constraints from Vectorized Scene Embeddings while Imitating Expert Trajectories | Niloufar Saeidi Mobarakeh et.al. | 2412.05717 | null |
2024-12-07 | Active Sequential Posterior Estimation for Sample-Efficient Simulation-Based Inference | Sam Griesemer et.al. | 2412.05590 | link |
2024-12-06 | FogROS2-FT: Fault Tolerant Cloud Robotics | Kaiyuan Chen et.al. | 2412.05408 | null |
2025-03-14 | Closed-Loop Supervised Fine-Tuning of Tokenized Traffic Models | Zhejun Zhang et.al. | 2412.05334 | link |
2025-03-04 | λ: A Benchmark for Data-Efficiency in Long-Horizon Indoor Mobile Manipulation Robotics | Ahmed Jaafar et.al. | 2412.05313 | null |
2024-12-05 | Socially-Informed Reconstruction for Pedestrian Trajectory Forecasting | Haleh Damirchi et.al. | 2412.04673 | link |
2024-12-05 | CALMM-Drive: Confidence-Aware Autonomous Driving with Large Multimodal Model | Ruoyu Yao et.al. | 2412.04209 | null |
2024-12-05 | Exploring Behaviors of Hybrid Systems via the Voronoi Bias over Output Signals | Gidon Ernst et.al. | 2412.04203 | null |
2024-12-05 | Towards Fast and Safety-Guaranteed Trajectory Planning and Tracking for Time-Varying Systems | Seth Siriya et.al. | 2412.04129 | null |
2025-03-10 | PriorMotion: Generative Class-Agnostic Motion Prediction with Raster-Vector Motion Field Priors | Kangan Qian et.al. | 2412.04020 | null |
2024-12-05 | Traffic Co-Simulation Framework Empowered by Infrastructure Camera Sensing and Reinforcement Learning | Talha Azfar et.al. | 2412.03925 | null |
2024-12-05 | A Two-stage Approach for Variable Selection in Joint Modeling of Multiple Longitudinal Markers and Competing Risk Outcomes | Taban Baghfalaki et.al. | 2412.03797 | link |
2024-12-04 | Predicting Pedestrian Crossing Behavior in Germany and Japan: Insights into Model Transferability | Chi Zhang et.al. | 2412.03689 | null |
2024-12-04 | Gaussian Processes for Probabilistic Estimates of Earthquake Ground Shaking: A 1-D Proof-of-Concept | Sam A. Scivier et.al. | 2412.03299 | link |
2024-12-04 | Resilient Timed Elastic Band Planner for Collision-Free Navigation in Unknown Environments | Geesara Kulathunga et.al. | 2412.03174 | null |
2024-12-03 | Who Walks With You Matters: Perceiving Social Interactions with Groups for Pedestrian Trajectory Prediction | Ziqian Zou et.al. | 2412.02395 | null |
2024-12-09 | PROFIT: A Specialized Optimizer for Deep Fine Tuning | Anirudh S Chakravarthy et.al. | 2412.01930 | null |
2024-12-02 | Real-time Traffic Simulation and Management for Large-scale Urban Air Mobility: Integrating Route Guidance and Collision Avoidance | Canqiang Weng et.al. | 2412.01235 | null |
2024-12-02 | Integrating Decision-Making Into Differentiable Optimization Guided Learning for End-to-End Planning of Autonomous Vehicles | Wenru Liu et.al. | 2412.01234 | null |
2024-12-02 | A Hybrid Evolutionary Approach for Multi Robot Coordinated Planning at Intersections | Victor Parque et.al. | 2412.01082 | null |
2024-12-01 | QuakeFormer: A Uniform Approach to Earthquake Ground Motion Prediction Using Masked Transformers | Yitian Feng et.al. | 2412.00815 | null |
2024-11-30 | TAROT: Targeted Data Selection via Optimal Transport | Lan Feng et.al. | 2412.00420 | link |
2024-11-30 | ARMOR: Egocentric Perception for Humanoid Robot Collision Avoidance and Motion Planning | Daehwa Kim et.al. | 2412.00396 | null |
2024-11-30 | Efficient Multi-Robot Motion Planning for Manifold-Constrained Manipulators by Randomized Scheduling and Informed Path Generation | Weihang Guo et.al. | 2412.00366 | null |
2025-01-30 | Assessing How Ride-hailing Rebalancing Strategies Improve the Resilience of Multi-modal Transportation Systems | Euntak Lee et.al. | 2412.00276 | null |
2024-11-29 | A Multi-Loss Strategy for Vehicle Trajectory Prediction: Combining Off-Road, Diversity, and Directional Consistency Losses | Ahmad Rahimi et.al. | 2411.19747 | link |
2024-12-31 | Global Tensor Motion Planning | An T. Le et.al. | 2411.19393 | link |
2024-11-28 | Efficient calculation of time-optimal motion primitives for systems exhibiting oscillatory internal dynamics with multiple applications | Thomas Auer et.al. | 2411.19148 | null |
2024-11-28 | Computationally efficient trajectory design from motion primitives for near time-optimal transitions for systems with oscillating internal dynamics | Thomas Auer et.al. | 2411.19144 | null |
2024-11-28 | Planning Shorter Paths in Graphs of Convex Sets by Undistorting Parametrized Configuration Spaces | Shruti Garg et.al. | 2411.18913 | null |
2024-11-28 | ETSM: Automating Dissection Trajectory Suggestion and Confidence Map-Based Safety Margin Prediction for Robot-assisted Endoscopic Submucosal Dissection | Mengya Xu et.al. | 2411.18884 | null |
2024-11-27 | Towards Motion Compensation in Autonomous Robotic Subretinal Injections | Demir Arikan et.al. | 2411.18521 | null |
2024-11-26 | DECODE: Domain-aware Continual Domain Expansion for Motion Prediction | Boqi Li et.al. | 2411.17917 | link |
2025-03-07 | Nearest-Neighbourless Asymptotically Optimal Motion Planning with Fully Connected Informed Trees (FCIT*) | Tyler S. Wilson et.al. | 2411.17902 | null |
2024-11-26 | SIL-RRT*: Learning Sampling Distribution through Self Imitation Learning | Xuzhe Dang et.al. | 2411.17293 | null |
2024-11-27 | MotionWavelet: Human Motion Prediction via Wavelet Manifold Learning | Yuming Feng et.al. | 2411.16964 | null |
2024-11-23 | FollowGen: A Scaled Noise Conditional Diffusion Model for Car-Following Trajectory Prediction | Junwei You et.al. | 2411.16747 | null |
2024-12-17 | DreamRunner: Fine-Grained Storytelling Video Generation with Retrieval-Augmented Motion Adaptation | Zun Wang et.al. | 2411.16657 | null |
2024-11-25 | Characterized Diffusion Networks for Enhanced Autonomous Driving Trajectory Prediction | Haoming Li et.al. | 2411.16457 | null |
2024-11-25 | Deep Learning for Motion Classification in Ankle Exoskeletons Using Surface EMG and IMU Signals | Silas Ruhrberg Estévez et.al. | 2411.16273 | null |
2024-11-01 | Data-driven Modeling of Granular Chains with Modern Koopman Theory | Atoosa Parsa et.al. | 2411.15142 | null |
2024-11-21 | Energy Efficient Automated Driving as a GNEP: Vehicle-in-the-loop Experiments | Viranjan Bhattacharyya et.al. | 2411.14567 | null |
2024-11-21 | Tra-MoE: Learning Trajectory Prediction Model from Multiple Domains for Adaptive Policy Conditioning | Jiange Yang et.al. | 2411.14519 | link |
2025-03-10 | U-Motion: Learned Point Cloud Video Compression with U-Structured Motion Estimation | Tingyu Fan et.al. | 2411.14501 | null |
2024-11-21 | Landing Trajectory Prediction for UAS Based on Generative Adversarial Network | Jun Xiang et.al. | 2411.14403 | null |
2024-11-21 | A Multi-Layer Blockchain Simulator and Performance Evaluation of Social Internet of Vehicles with Multi-Connectivity Management | Yi-Ting Sun et.al. | 2411.14000 | link |
2025-03-02 | Learning Two-agent Motion Planning Strategies from Generalized Nash Equilibrium for Model Predictive Control | Hansung Kim et.al. | 2411.13983 | link |
2024-11-20 | Bezier Reachable Polytopes: Efficient Certificates for Robust Motion Planning with Layered Architectures | Noel Csomay-Shanklin et.al. | 2411.13506 | null |
2024-11-20 | REVISE: Robust Probabilistic Motion Planning in a Gaussian Random Field | Alex Rose et.al. | 2411.13369 | null |
2024-11-26 | DriveMLLM: A Benchmark for Spatial Understanding with Multimodal Large Language Models in Autonomous Driving | Xianda Guo et.al. | 2411.13112 | link |
2024-11-20 | On the relationship between Koopman operator approximations and neural ordinary differential equations for data-driven time-evolution predictions | Jake Buzhardt et.al. | 2411.12940 | null |
2024-11-19 | C $^{2}$ INet: Realizing Incremental Trajectory Prediction with Prior-Aware Continual Causal Intervention | Xiaohe Li et.al. | 2411.12313 | null |
2024-11-18 | On-the-Go Path Planning and Repair in Static and Dynamic Scenarios | Daniel Ajeleye et.al. | 2411.12014 | null |
2024-11-17 | ModeSeq: Taming Sparse Multimodal Motion Prediction with Sequential Mode Modeling | Zikang Zhou et.al. | 2411.11911 | null |
2024-11-18 | Differentiable GPU-Parallelized Task and Motion Planning | William Shen et.al. | 2411.11833 | null |
2024-12-10 | cHyRRT and cHySST: Two Motion Planning Tools for Hybrid Dynamical Systems | Beverly Xu et.al. | 2411.11812 | null |
2024-11-17 | Map-Free Trajectory Prediction with Map Distillation and Hierarchical Encoding | Xiaodong Liu et.al. | 2411.10961 | null |
2024-11-16 | Hierarchical Adaptive Motion Planning with Nonlinear Model Predictive Control for Safety-Critical Collaborative Loco-Manipulation | Mohsen Sombolestan et.al. | 2411.10699 | link |
2024-12-20 | BMP: Bridging the Gap between B-Spline and Movement Primitives | Weiran Liao et.al. | 2411.10336 | null |
2024-11-15 | Imagine-2-Drive: High-Fidelity World Modeling in CARLA for Autonomous Vehicles | Anant Garg et.al. | 2411.10171 | null |
2024-11-15 | Planning by Simulation: Motion Planning with Learning-based Parallel Scenario Prediction for Autonomous Driving | Tian Niu et.al. | 2411.09887 | null |
2024-11-14 | MFTIQ: Multi-Flow Tracker with Independent Matching Quality Estimation | Jonas Serych et.al. | 2411.09551 | link |
2024-11-14 | Risk-aware MPPI for Stochastic Hybrid Systems | Hardik Parwana et.al. | 2411.09198 | link |
2024-11-13 | Experience-based Subproblem Planning for Multi-Robot Motion Planning | Irving Solis et.al. | 2411.08851 | null |
2024-11-13 | DiVR: incorporating context from diverse VR scenes for human trajectory prediction | Franz Franco Gallo et.al. | 2411.08409 | null |
2024-11-13 | Open-World Task and Motion Planning via Vision-Language Model Inferred Constraints | Nishanth Kumar et.al. | 2411.08253 | null |
2024-11-12 | Trust-Aware Sybil Attack Detection for Resilient Vehicular Communication | Mortan Thomas et.al. | 2411.07520 | null |
2024-11-11 | Robust Nonprehensile Object Transportation with Uncertain Inertial Parameters | Adam Heins et.al. | 2411.07079 | link |
2024-11-11 | Scaling Long-Horizon Online POMDP Planning via Rapid State Space Sampling | Yuanchu Liang et.al. | 2411.07032 | null |
2024-11-11 | LuSh-NeRF: Lighting up and Sharpening NeRFs for Low-light Scenes | Zefan Qu et.al. | 2411.06757 | link |
2024-11-10 | Results of the 2023 CommonRoad Motion Planning Competition for Autonomous Vehicles | Niklas Kochdumper et.al. | 2411.06425 | null |
2024-11-10 | Impact-Aware Robotic Manipulation: Quantifying the Sim-To-Real Gap for Velocity Jumps | Jari van Steen et.al. | 2411.06319 | null |
2024-11-09 | Predictability Awareness for Efficient and Robust Multi-Agent Coordination | Roman Chiva Gil et.al. | 2411.06223 | null |
2024-11-09 | RRT* Based Optimal Trajectory Generation with Linear Temporal Logic Specifications under Kinodynamic Constraints | Saksham Gautam et.al. | 2411.06219 | null |
2024-11-12 | Cross-Domain Transfer Learning using Attention Latent Features for Multi-Agent Trajectory Prediction | Jia Quan Loh et.al. | 2411.06087 | null |
2024-11-07 | Evaluating Robustness of Reinforcement Learning Algorithms for Autonomous Shipping | Bavo Lesy et.al. | 2411.04915 | null |
2024-11-07 | Rapid Quadrotor Navigation in Diverse Environments using an Onboard Depth Camera | Jonathan Lee et.al. | 2411.04326 | null |
2024-11-06 | UnityGraph: Unified Learning of Spatio-temporal features for Multi-person Motion Prediction | Kehua Qu et.al. | 2411.04151 | null |
2024-11-06 | Relation Learning and Aggregate-attention for Multi-person Motion Prediction | Kehua Qu et.al. | 2411.03729 | null |
2024-11-21 | Accelerating Gaussian Variational Inference for Motion Planning Under Uncertainty | Zinuo Chang et.al. | 2411.03416 | link |
2024-11-15 | Energy-Aware Predictive Motion Planning for Autonomous Vehicles Using a Hybrid Zonotope Constraint Representation | Joshua A. Robbins et.al. | 2411.03189 | null |
2024-11-04 | Multi-Transmotion: Pre-trained Model for Human Motion Prediction | Yang Gao et.al. | 2411.02673 | link |
2024-11-04 | SIRA: Scalable Inter-frame Relation and Association for Radar Perception | Ryoma Yataka et.al. | 2411.02220 | null |
2024-11-04 | Traffic and Safety Rule Compliance of Humans in Diverse Driving Situations | Michael Kurenkov et.al. | 2411.01909 | null |
2024-11-04 | Enhancing Social Robot Navigation with Integrated Motion Prediction and Trajectory Planning in Dynamic Human Environments | Thanh Nguyen Canh et.al. | 2411.01814 | link |
2024-11-03 | Interaction-Aware Trajectory Prediction for Safe Motion Planning in Autonomous Driving: A Transformer-Transfer Learning Approach | Jinhao Liang et.al. | 2411.01475 | null |
2024-11-03 | Wallbounce : Push wall to navigate with Contact-Implicit MPC | Xiaohan Liu et.al. | 2411.01387 | null |
2024-11-02 | Mixed-Integer MPC-Based Motion Planning Using Hybrid Zonotopes with Tight Relaxations | Joshua A. Robbins et.al. | 2411.01286 | null |
2024-11-02 | Generation of Conservative Dynamical Systems Based on Stiffness Encoding | Tengyu Hou et.al. | 2411.01120 | null |
2024-11-01 | NAMR-RRT: Neural Adaptive Motion Planning for Mobile Robots in Dynamic Environments | Zhirui Sun et.al. | 2411.00440 | null |
2024-11-01 | An Improved Rapidly Exploring Random Tree Algorithm for Path Planning in Configuration Spaces with Narrow Channels | Mathew Mithra Noel et.al. | 2411.00357 | null |
2024-10-31 | BOMP: Bin-Optimized Motion Planning | Zachary Tam et.al. | 2411.00221 | null |
2024-10-31 | Pedestrian Trajectory Prediction with Missing Data: Datasets, Imputation, and Benchmarking | Pranav Singh Chib et.al. | 2411.00174 | link |
2024-10-24 | VECTOR: Velocity-Enhanced GRU Neural Network for Real-Time 3D UAV Trajectory Prediction | Omer Nacar et.al. | 2410.23305 | link |
2024-10-15 | Trajectory Prediction for Autonomous Driving using Agent-Interaction Graph Embedding | Jilan Samiuddin et.al. | 2410.23298 | null |
2024-11-04 | EMMA: End-to-End Multimodal Model for Autonomous Driving | Jyh-Jing Hwang et.al. | 2410.23262 | null |
2024-10-30 | An Efficient Representation of Whole-body Model Predictive Control for Online Compliant Dual-arm Mobile Manipulation | Wenqian Du et.al. | 2410.22910 | null |
2024-10-30 | A time (anti)symmetric approach to the double solution theory | Pierre Jamet et.al. | 2410.22838 | null |
2024-10-30 | SoftCTRL: Soft conservative KL-control of Transformer Reinforcement Learning for Autonomous Driving | Minh Tri Huynh et.al. | 2410.22752 | null |
2024-11-04 | Intelligent Mobility System with Integrated Motion Planning and Control Utilizing Infrastructure Sensor Nodes | Yufeng Yang et.al. | 2410.22527 | null |
2024-10-29 | Local Policies Enable Zero-shot Long-horizon Manipulation | Murtaza Dalal et.al. | 2410.22332 | null |
2024-10-29 | Senna: Bridging Large Vision-Language Models and End-to-End Autonomous Driving | Bo Jiang et.al. | 2410.22313 | link |
2024-10-29 | CaStL: Constraints as Specifications through LLM Translation for Long-Horizon Task and Motion Planning | Weihang Guo et.al. | 2410.22225 | null |
2024-10-29 | On the Synthesis of Reactive Collision-Free Whole-Body Robot Motions: A Complementarity-based Approach | Haowen Yao et.al. | 2410.22049 | null |
2024-10-29 | Constrained Nonlinear Kaczmarz Projection on Intersections of Manifolds for Coordinated Multi-Robot Mobile Manipulation | Akshaya Agrawal et.al. | 2410.21630 | null |
2024-10-28 | Heterogeneous Interaction Modeling With Reduced Accumulated Error for Multi-Agent Trajectory Prediction | Siyuan Chen et.al. | 2410.21342 | null |
2024-10-28 | Combining Deep Reinforcement Learning with a Jerk-Bounded Trajectory Generator for Kinematically Constrained Motion Planning | Seyed Adel Alizadeh Kolagar et.al. | 2410.20907 | null |
2024-10-27 | Uncertainty-Aware Decision-Making and Planning for Autonomous Forced Merging | Jian Zhou et.al. | 2410.20514 | link |
2024-10-26 | Learning Approximated Maximal Safe Sets via Hypernetworks for MPC-Based Local Motion Planning | Bojan Derajić et.al. | 2410.20267 | null |
2025-02-13 | FRTree Planner: Robot Navigation in Cluttered and Unknown Environments with Tree of Free Regions | Yulin Li et.al. | 2410.20230 | null |
2024-10-25 | Multi-modal Motion Prediction using Temporal Ensembling with Learning-based Aggregation | Kai-Yin Hong et.al. | 2410.19606 | null |
2024-10-25 | PMM-Net: Single-stage Multi-agent Trajectory Prediction with Patching-based Embedding and Explicit Modal Modulation | Huajian Liu et.al. | 2410.19544 | link |
2024-10-28 | Motion Planning for Robotics: A Review for Sampling-based Planners | Liding Zhang et.al. | 2410.19414 | null |
2024-10-24 | SkillMimicGen: Automated Demonstration Generation for Efficient Skill Learning and Deployment | Caelan Garrett et.al. | 2410.18907 | null |
2024-10-31 | Continuous Dynamic Modeling via Neural ODEs for Popularity Trajectory Prediction | Songbo Yang et.al. | 2410.18742 | null |
2024-10-23 | SPIRE: Synergistic Planning, Imitation, and Reinforcement Learning for Long-Horizon Manipulation | Zihan Zhou et.al. | 2410.18065 | null |
2024-10-23 | Markov Potential Game with Final-time Reach-Avoid Objectives | Sarah H. Q. Li et.al. | 2410.17690 | null |
2024-10-23 | Generalizable Motion Planning via Operator Learning | Sharath Matada et.al. | 2410.17547 | null |
2024-12-04 | Multimodal LLM Guided Exploration and Active Mapping using Fisher Information | Wen Jiang et.al. | 2410.17422 | null |
2024-10-28 | Non-myopic Generation of Language Models for Reasoning and Planning | Chang Ma et.al. | 2410.17195 | link |
2024-10-22 | Pedestrian motion prediction evaluation for urban autonomous driving | Dmytro Zabolotnii et.al. | 2410.16864 | link |
2024-10-22 | Traj-Explainer: An Explainable and Robust Multi-modal Trajectory Prediction Approach | Pei Liu et.al. | 2410.16795 | null |
2024-10-22 | DiffusionSeeder: Seeding Motion Optimization with Diffusion for Rapid Motion Planning | Huang Huang et.al. | 2410.16727 | null |
2025-01-27 | Automated Planning Domain Inference for Task and Motion Planning | Jinbang Huang et.al. | 2410.16445 | null |
2024-10-24 | LASER: Script Execution by Autonomous Agents for On-demand Traffic Simulation | Hao Gao et.al. | 2410.16197 | link |
2024-10-21 | Critical Example Mining for Vehicle Trajectory Prediction using Flow-based Generative Models | Zhezhang Ding et.al. | 2410.16083 | null |
2025-02-06 | Bench4Merge: A Comprehensive Benchmark for Merging in Realistic Dense Traffic with Micro-Interactive Vehicles | Zhengming Wang et.al. | 2410.15912 | link |
2024-10-21 | LiMTR: Time Series Motion Prediction for Diverse Road Users through Multimodal Feature Integration | Camiel Oerlemans et.al. | 2410.15819 | link |
2024-10-21 | Hierarchical Search-Based Cooperative Motion Planning | Yuchen Wu et.al. | 2410.15710 | link |
2024-10-20 | Lie Theory Based Optimization for Unified State Planning of Mobile Manipulators | William Smith et.al. | 2410.15443 | link |
2024-10-19 | IntersectionZoo: Eco-driving for Benchmarking Multi-Agent Contextual Reinforcement Learning | Vindula Jayawardana et.al. | 2410.15221 | link |
2025-01-30 | MeshDMP: Motion Planning on Discrete Manifolds using Dynamic Movement Primitives | Matteo Dalle Vedove et.al. | 2410.15123 | null |
2024-10-19 | EDRF: Enhanced Driving Risk Field Based on Multimodal Trajectory Prediction and Its Applications | Junkai Jiang et.al. | 2410.14996 | null |
2025-01-09 | CoMAL: Collaborative Multi-Agent Large Language Models for Mixed-Autonomy Traffic | Huaiyuan Yao et.al. | 2410.14368 | link |
2024-09-30 | PC-Planner: Physics-Constrained Self-Supervised Learning for Robust Neural Motion Planning with Shape-Aware Distance Function | Xujie Shen et.al. | 2410.12805 | null |
2024-11-13 | Faster Algorithms for Growing Collision-Free Convex Polytopes in Robot Configuration Space | Peter Werner et.al. | 2410.12649 | null |
2024-10-16 | Fast Online Learning of CLiFF-maps in Changing Environments | Yufei Zhu et.al. | 2410.12237 | null |
2024-10-16 | Trajectory Manifold Optimization for Fast and Adaptive Kinodynamic Motion Planning | Yonghyeon Lee et.al. | 2410.12193 | null |
2024-11-09 | Towards Local Minima-free Robotic Navigation: Model Predictive Path Integral Control via Repulsive Potential Augmentation | Takahiro Fuke et.al. | 2410.11379 | null |
2024-10-15 | Biologically Inspired Swarm Dynamic Target Tracking and Obstacle Avoidance | Lucas Page et.al. | 2410.11237 | null |
2024-10-15 | Motion Planning for Automata-based Objectives using Efficient Gradient-based Methods | Anand Balakrishnan et.al. | 2410.11156 | null |
2024-10-14 | Safety-critical Motion Planning for Collaborative Legged Loco-Manipulation over Discrete Terrain | Mohsen Sombolestan et.al. | 2410.11023 | null |
2024-10-14 | TrajDiffuse: A Conditional Diffusion Model for Environment-Aware Trajectory Prediction | Qingze et.al. | 2410.10804 | link |
2024-10-14 | Navigation under uncertainty: Trajectory prediction and occlusion reasoning with switching dynamical systems | Ran Wei et.al. | 2410.10653 | null |
2024-10-14 | Feedback Favors the Generalization of Neural ODEs | Jindou Jia et.al. | 2410.10253 | null |
2024-10-13 | Conformalized Reachable Sets for Obstacle Avoidance With Spheres | Yongseok Kwon et.al. | 2410.09924 | null |
2024-10-13 | Physics-informed Neural Mapping and Motion Planning in Unknown Environments | Yuchen Liu et.al. | 2410.09883 | link |
2024-10-13 | Socially Aware Motion Planning for Service Robots Using LiDAR and RGB-D Camera | Duc Phu Nguyen et.al. | 2410.09803 | null |
2024-10-13 | Model Predictive Control for Optimal Motion Planning of Unmanned Aerial Vehicles | Duy-Nam Bui et.al. | 2410.09799 | null |
2024-10-15 | LoRD: Adapting Differentiable Driving Policies to Distribution Shifts | Christopher Diehl et.al. | 2410.09681 | link |
2024-10-12 | DiffuTraj: A Stochastic Vessel Trajectory Prediction Approach via Guided Diffusion Process | Changlin Li et.al. | 2410.09550 | null |
2025-01-17 | ESVO2: Direct Visual-Inertial Odometry with Stereo Event Cameras | Junkai Niu et.al. | 2410.09374 | link |
2024-10-11 | Motion Planning for Object Manipulation by Edge-Rolling | Maede Boroji et.al. | 2410.09301 | null |
2024-10-11 | Implicit Graph Search for Planning on Graphs of Convex Sets | Ramkumar Natarajan et.al. | 2410.08909 | null |
2024-10-11 | VLM See, Robot Do: Human Demo Video to Robot Action Plan via Vision Language Model | Beichen Wang et.al. | 2410.08792 | null |
2024-10-11 | SmartPretrain: Model-Agnostic and Dataset-Agnostic Representation Learning for Motion Prediction | Yang Zhou et.al. | 2410.08669 | link |
2024-12-10 | Snap and Jump: How Elastic Shells Pop Out | Takara Abe et.al. | 2410.08525 | null |
2024-10-10 | CE-MRS: Contrastive Explanations for Multi-Robot Systems | Ethan Schneider et.al. | 2410.08408 | null |
2024-10-10 | Safe and Dynamically-Feasible Motion Planning using Control Lyapunov and Barrier Functions | Pol Mestres et.al. | 2410.08364 | null |
2024-10-10 | Guiding Collision-Free Humanoid Multi-Contact Locomotion using Convex Kinematic Relaxations and Dynamic Optimization | Carlos Gonzalez et.al. | 2410.08335 | null |
2024-10-10 | Dynamic Object Catching with Quadruped Robot Front Legs | André Schakkal et.al. | 2410.08065 | null |
2024-10-10 | Stop-N-Go: Search-based Conflict Resolution for Motion Planning of Multiple Robotic Manipulators | Gidon Han et.al. | 2410.07606 | null |
2024-09-23 | Curb Your Attention: Causal Attention Gating for Robust Trajectory Prediction in Autonomous Driving | Ehsan Ahmadi et.al. | 2410.07191 | null |
2024-10-09 | Online Epsilon Net and Piercing Set for Geometric Concepts | Sujoy Bhore et.al. | 2410.07059 | null |
2024-10-09 | Combining Planning and Diffusion for Mobility with Unknown Dynamics | Yajvan Ravan et.al. | 2410.06911 | null |
2024-10-10 | Reliable Probabilistic Human Trajectory Prediction for Autonomous Applications | Manuel Hetzel et.al. | 2410.06905 | link |
2024-12-17 | Meta-Learning Augmented MPC for Disturbance-Aware Motion Planning and Control of Quadrotors | Dženan Lapandić et.al. | 2410.06325 | null |
2024-10-08 | Suitability Analysis of Ground Motion Prediction Equations for Western and Central Himalayas and Indo-Gangetic Plains | S. Selvan et.al. | 2410.05918 | null |
2024-10-08 | Effort Allocation for Deadline-Aware Task and Motion Planning: A Metareasoning Approach | Yoonchang Sung et.al. | 2410.05828 | null |
2024-10-08 | GlucoBench: Curated List of Continuous Glucose Monitoring Datasets with Prediction Benchmarks | Renat Sergazinov et.al. | 2410.05780 | link |
2024-10-08 | Noncrossing Longest Paths and Cycles | Greg Aloupis et.al. | 2410.05580 | null |
2024-10-07 | MultiNash-PF: A Particle Filtering Approach for Computing Multiple Local Generalized Nash Equilibria in Trajectory Games | Maulik Bhatt et.al. | 2410.05554 | null |
2024-10-07 | State Estimation of Marine Vessels Affected by Waves by Unmanned Aerial Vehicles | Filip Novák et.al. | 2410.05186 | null |
2024-11-28 | Predictive Spliner: Data-Driven Overtaking in Autonomous Racing Using Opponent Trajectory Prediction | Nicolas Baumann et.al. | 2410.04868 | link |
2024-10-07 | Data-driven Diffusion Models for Enhancing Safety in Autonomous Vehicle Traffic Simulations | Jinxiong Lu et.al. | 2410.04809 | null |
2024-10-05 | Fast Object Detection with a Machine Learning Edge Device | Richard C. Rodriguez et.al. | 2410.04173 | null |
2024-10-04 | Improving Efficiency of Sampling-based Motion Planning via Message-Passing Monte Carlo | Makram Chahine et.al. | 2410.03909 | null |
2024-10-04 | MDMP: Multi-modal Diffusion for supervised Motion Predictions with uncertainty | Leo Bringer et.al. | 2410.03860 | link |
2024-10-04 | CLoSD: Closing the Loop between Simulation and Diffusion for multi-task character control | Guy Tevet et.al. | 2410.03441 | link |
2024-10-04 | SPHINX: Structural Prediction using Hypergraph Inference Network | Iulia Duta et.al. | 2410.03208 | null |
2024-10-04 | Multi-Robot Motion Planning with Diffusion Models | Yorai Shaoul et.al. | 2410.03072 | link |
2024-10-03 | A Schema-aware Logic Reformulation for Graph Reachability | Davide Di Pierro et.al. | 2410.02533 | null |
2024-10-03 | Remember and Recall: Associative-Memory-based Trajectory Prediction | Hang Guo et.al. | 2410.02201 | null |
2024-10-03 | Guiding Long-Horizon Task and Motion Planning with Vision Language Models | Zhutian Yang et.al. | 2410.02193 | null |
2024-10-07 | Entropy-Based Uncertainty Modeling for Trajectory Prediction in Autonomous Driving | Aron Distelzweig et.al. | 2410.01628 | null |
2024-10-02 | MARLens: Understanding Multi-agent Reinforcement Learning for Traffic Signal Control via Visual Analytics | Yutian Zhang et.al. | 2410.01364 | null |
2024-10-02 | Towards Generalizable Vision-Language Robotic Manipulation: A Benchmark and LLM-guided 3D Policy | Ricardo Garcia et.al. | 2410.01345 | link |
2024-11-12 | Towards Efficient Motion Planning for UAVs: Lazy A* Search with Motion Primitives | Wentao Wang et.al. | 2410.01230 | null |
2024-10-01 | Collaborative motion planning for multi-manipulator systems through Reinforcement Learning and Dynamic Movement Primitives | Siddharth Singh et.al. | 2410.00757 | null |
2024-10-01 | LASMP: Language Aided Subset Sampling Based Motion Planner | Saswati Bhattacharjee et.al. | 2410.00649 | link |
2024-10-01 | ManiSkill3: GPU Parallelized Robotics Simulation and Rendering for Generalizable Embodied AI | Stone Tao et.al. | 2410.00425 | link |
2024-10-01 | AHA: A Vision-Language-Model for Detecting and Reasoning Over Failures in Robotic Manipulation | Jiafei Duan et.al. | 2410.00371 | null |
2024-10-01 | RRT-CBF Based Motion Planning | Leonas Liu et.al. | 2410.00343 | null |
2024-11-16 | Sectional category with respect to group actions and sequential topological complexity of fibre bundles | Ramandeep Singh Arora et.al. | 2410.00139 | null |
2024-09-30 | Online identification of skidding modes with interactive multiple model estimation | Ameya Salvi et.al. | 2409.20554 | null |
2024-09-30 | COLLAGE: Collaborative Human-Agent Interaction Generation using Hierarchical Latent Diffusion and Language Models | Divyanshu Daiya et.al. | 2409.20502 | null |
2024-09-30 | HEADS-UP: Head-Mounted Egocentric Dataset for Trajectory Prediction in Blind Assistance Systems | Yasaman Haghighi et.al. | 2409.20324 | null |
2024-09-30 | Self-Assessment of Evidential Grid Map Fusion for Robust Motion Planning | Oliver Schumann et.al. | 2409.20286 | null |
2024-09-29 | Generalizability of Graph Neural Networks for Decentralized Unlabeled Motion Planning | Shreyas Muthusamy et.al. | 2409.19829 | null |
2024-09-29 | Learning Wheelchair Tennis Navigation from Broadcast Videos with Domain Knowledge Transfer and Diffusion Motion Planning | Zixuan Wu et.al. | 2409.19771 | null |
2024-09-29 | Fine-Tuning Hybrid Physics-Informed Neural Networks for Vehicle Dynamics Model Estimation | Shiming Fang et.al. | 2409.19647 | null |
2024-09-29 | BadHMP: Backdoor Attack against Human Motion Prediction | Chaohui Xu et.al. | 2409.19638 | null |
2024-09-29 | Multi-Query Shortest-Path Problem in Graphs of Convex Sets | Savva Morozov et.al. | 2409.19543 | null |
2024-09-28 | Robot Guided Evacuation with Viewpoint Constraints | Gong Chen et.al. | 2409.19466 | null |
2024-09-28 | How connected cars could capture cloud dynamics – first evidence from two simulation scenarios | Tobias Veihelmann et.al. | 2409.19351 | null |
2024-09-27 | Optimization-based Task and Motion Planning under Signal Temporal Logic Specifications using Logic Network Flow | Xuan Lin et.al. | 2409.19168 | null |
2024-09-27 | S-RRT*-based Obstacle Avoidance Autonomous Motion Planner for Continuum-rigid Manipulator | Yulin Li et.al. | 2409.19110 | null |
2024-12-01 | Towards Super-Nominal Payload Handling: Inverse Dynamics Analysis for Multi-Skill Robotic Manipulation | Anuj Pasricha et.al. | 2409.18939 | null |
2024-09-27 | S2O: Static to Openable Enhancement for Articulated 3D Objects | Denys Iliash et.al. | 2409.18896 | null |
2024-09-27 | Pseudo-kinematic trajectory control of tracked vehicles | Michele Focchi et.al. | 2409.18641 | null |
2024-09-27 | Multimodal Trajectory Prediction for Autonomous Driving on Unstructured Roads using Deep Convolutional Network | Lei Li et.al. | 2409.18399 | null |
2024-09-26 | Learning to Drive via Asymmetric Self-Play | Chris Zhang et.al. | 2409.18218 | null |
2024-09-26 | RT-GuIDE: Real-Time Gaussian splatting for Information-Driven Exploration | Yuezhan Tao et.al. | 2409.18122 | null |
2024-09-26 | GSON: A Group-based Social Navigation Framework with Large Multimodal Model | Shangyi Luo et.al. | 2409.18084 | null |
2024-09-26 | CASPFormer: Trajectory Prediction from BEV Images with Deformable Attention | Harsh Yadav et.al. | 2409.17790 | null |
2024-09-26 | Computation Pre-Offloading for MEC-Enabled Vehicular Networks via Trajectory Prediction | Ting Zhang et.al. | 2409.17681 | null |
2024-09-25 | Data-efficient Trajectory Prediction via Coreset Selection | Ruining Yang et.al. | 2409.17385 | null |
2024-09-25 | Building Real-time Awareness of Out-of-distribution in Trajectory Prediction for Autonomous Vehicles | Tongfei et.al. | 2409.17277 | null |
2024-09-25 | Blox-Net: Generative Design-for-Robot-Assembly Using VLM Supervision, Physics Simulation, and a Robot with Reset | Andrew Goldberg et.al. | 2409.17126 | null |
2024-09-25 | Communication Backbone Reconfiguration with Connectivity Maintenance | Leonardo Santos et.al. | 2409.16851 | null |
2024-09-25 | Dashing for the Golden Snitch: Multi-Drone Time-Optimal Motion Planning with Multi-Agent Reinforcement Learning | Xian Wang et.al. | 2409.16720 | link |
2024-09-25 | Stochastic Shortest Path Problem with Failure Probability | Ritsusamuel Otsubo et.al. | 2409.16672 | null |
2024-09-24 | Bound-preserving OEDG schemes for Aw-Rascle-Zhang traffic models on networks | Wei Chen et.al. | 2409.16269 | null |
2024-09-25 | Efficient Motion Prediction: A Lightweight & Accurate Trajectory Prediction Model With Fast Training and Inference Speed | Alexander Prutsch et.al. | 2409.16154 | link |
2024-09-24 | PRESTO: Fast motion planning using diffusion models based on key-configuration environment representation | Mingyo Seo et.al. | 2409.16012 | null |
2024-09-24 | Intention-based and Risk-Aware Trajectory Prediction for Autonomous Driving in Complex Traffic Scenarios | Wen Wei et.al. | 2409.15821 | null |
2024-09-27 | Diffusion Models for Intelligent Transportation Systems: A Survey | Mingxing Peng et.al. | 2409.15816 | null |
2024-09-23 | XMoP: Whole-Body Control Policy for Zero-shot Cross-Embodiment Neural Motion Planning | Prabin Kumar Rath et.al. | 2409.15585 | null |
2024-09-10 | Ultrafast vision perception by neuromorphic optical flow | Shengbo Wang et.al. | 2409.15345 | null |
2024-09-23 | Enhancing Pedestrian Trajectory Prediction with Crowd Trip Information | Rei Tamaru et.al. | 2409.15224 | link |
2024-09-25 | Goal-based Neural Physics Vehicle Trajectory Prediction Model | Rui Gan et.al. | 2409.15182 | null |
2024-09-23 | Terrain-Aware Model Predictive Control of Heterogeneous Bipedal and Aerial Robot Coordination for Search and Rescue Tasks | Abdulaziz Shamsah et.al. | 2409.15174 | null |
2024-09-23 | Controllable Traffic Simulation through LLM-Guided Hierarchical Chain-of-Thought Reasoning | Zhiyuan Liu et.al. | 2409.15135 | null |
2024-09-23 | SocialCircle+: Learning the Angle-based Conditioned Interaction Representation for Pedestrian Trajectory Prediction | Conghao Wong et.al. | 2409.14984 | link |
2024-09-23 | Kinodynamic Motion Planning for Collaborative Object Transportation by Multiple Mobile Manipulators | Keshab Patra et.al. | 2409.14910 | null |
2024-09-23 | Automatic Geometric Decomposition for Analytical Inverse Kinematics | Daniel Ostermeier et.al. | 2409.14815 | null |
2024-09-22 | TrackNetV4: Enhancing Fast Sports Object Tracking with Motion Attention Maps | Arjun Raj et.al. | 2409.14543 | null |
2024-09-21 | Adversarial and Reactive Traffic Agents for Realistic Driving Simulation | Joshua Ransiek et.al. | 2409.14196 | null |
2024-09-20 | Neural Configuration Distance Function for Continuum Robot Control | Kehan Long et.al. | 2409.13865 | link |
2024-09-20 | Remote Interactions between tropical cyclones: The case of Hurricane Michael and Leslie’s high predictability uncertainty | Mauricio López-Reyes et.al. | 2409.13839 | null |
2024-09-20 | Key-Scan-Based Mobile Robot Navigation: Integrated Mapping, Planning, and Control using Graphs of Scan Regions | Dharshan Bashkaran Latha et.al. | 2409.13838 | null |
2024-09-20 | OLiVia-Nav: An Online Lifelong Vision Language Approach for Mobile Robot Social Navigation | Siddarth Narasimhan et.al. | 2409.13675 | null |
2024-09-20 | From Cognition to Precognition: A Future-Aware Framework for Social Navigation | Zeying Gong et.al. | 2409.13244 | link |
2024-09-19 | Fast End-to-End Generation of Belief Space Paths for Minimum Sensing Navigation | Lukas Taus et.al. | 2409.12902 | null |
2024-09-19 | Towards adaptive trajectories for mixed autonomous and human-operated ships | Danilo Pianini et.al. | 2409.12714 | null |
2024-09-19 | Bayesian-Optimized One-Step Diffusion Model with Knowledge Distillation for Real-Time 3D Human Motion Prediction | Sibo Tian et.al. | 2409.12456 | null |
2024-09-18 | C-Uniform Trajectory Sampling For Fast Motion Planning | O. Goktug Poyrazoglu et.al. | 2409.12266 | null |
2024-10-12 | Bootstrapping Object-level Planning with Large Language Models | David Paulius et.al. | 2409.12262 | link |
2024-10-14 | ScaleFlow++: Robust and Accurate Estimation of 3D Motion from Video | Han Ling et.al. | 2409.12202 | link |
2024-09-18 | Real-Time-Feasible Collision-Free Motion Planning For Ellipsoidal Objects | Yunfan Gao et.al. | 2409.12007 | null |
2024-09-18 | LMMCoDrive: Cooperative Driving with Large Multimodal Model | Haichao Liu et.al. | 2409.11981 | link |
2024-09-18 | XP-MARL: Auxiliary Prioritization in Multi-Agent Reinforcement Learning to Address Non-Stationarity | Jianye Xu et.al. | 2409.11852 | link |
2024-09-18 | RMP-YOLO: A Robust Motion Predictor for Partially Observable Scenarios even if You Only Look Once | Jiawei Sun et.al. | 2409.11696 | null |
2024-09-18 | Hypergraph-based Motion Generation with Multi-modal Interaction Relational Reasoning | Keshu Wu et.al. | 2409.11676 | null |
2024-09-17 | RenderWorld: World Model with Self-Supervised 3D Label | Ziyang Yan et.al. | 2409.11356 | null |
2024-09-17 | Optimization of Rulebooks via Asymptotically Representing Lexicographic Hierarchies for Autonomous Vehicles | Matteo Penlington et.al. | 2409.11199 | null |
2024-09-18 | Annealed Winner-Takes-All for Motion Forecasting | Yihong Xu et.al. | 2409.11172 | link |
2024-10-21 | Uncovering the Secrets of Human-Like Movement: A Fresh Perspective on Motion Planning | Lei Shi et.al. | 2409.10747 | null |
2024-09-16 | Safe Interval Motion Planning for Quadrotors in Dynamic Environments | Songhao Huang et.al. | 2409.10647 | null |
2024-09-20 | Motion Forecasting via Model-Based Risk Minimization | Aron Distelzweig et.al. | 2409.10585 | null |
2024-09-16 | Optimal Geodesic Curvature Constrained Dubins’ Path on Sphere with Free Terminal Orientation | Deepak Prakash Kumar et.al. | 2409.10363 | null |
2024-09-16 | Digital Twins Meet the Koopman Operator: Data-Driven Learning for Robust Autonomy | Chinmay Vilas Samak et.al. | 2409.10347 | null |
2024-09-16 | Safety-critical Locomotion of Biped Robots in Infeasible Paths: Overcoming Obstacles during Navigation toward Destination | Jaemin Lee et.al. | 2409.10274 | null |
2024-09-16 | Maneuver Decision-Making with Trajectory Streams Prediction for Autonomous Vehicles | Mais Jamal et.al. | 2409.10165 | null |
2024-09-16 | Embodiment-Agnostic Action Planning via Object-Part Scene Flow | Weiliang Tang et.al. | 2409.10032 | null |
2024-10-07 | ViewActive: Active viewpoint optimization from a single image | Jiayi Wu et.al. | 2409.09997 | link |
2024-09-16 | Generalization of Optimal Geodesic Curvature Constrained Dubins’ Path on Sphere with Free Terminal Orientation | Deepak Prakash Kumar et.al. | 2409.09954 | null |
2024-09-15 | Fast Shortest Path Polyline Smoothing With G1 Continuity and Bounded Curvature | Patrick Pastorelli et.al. | 2409.09816 | null |
2024-11-26 | DiFSD: Ego-Centric Fully Sparse Paradigm with Uncertainty Denoising and Iterative Refinement for Efficient End-to-End Self-Driving | Haisheng Su et.al. | 2409.09777 | link |
2024-08-29 | ChatSUMO: Large Language Model for Automating Traffic Scenario Generation in Simulation of Urban MObility | Shuyang Li et.al. | 2409.09040 | null |
2024-09-22 | Agile Decision-Making and Safety-Critical Motion Planning for Emergency Autonomous Vehicles | Yiming Shu et.al. | 2409.08665 | null |
2024-09-12 | Robust Dual Gaussian Splatting for Immersive Human-centric Volumetric Videos | Yuheng Jiang et.al. | 2409.08353 | null |
2024-09-12 | Graph Inspection for Robotic Motion Planning: Do Arithmetic Circuits Help? | Matthias Bentert et.al. | 2409.08219 | link |
2024-09-10 | PRO-MIND: Proximity and Reactivity Optimisation of robot Motion to tune safety limits, human stress, and productivity in INDustrial settings | Marta Lagomarsino et.al. | 2409.06864 | null |
2024-11-01 | Asymptotically Optimal Lazy Lifelong Sampling-based Algorithm for Efficient Motion Planning in Dynamic Environments | Lu Huang et.al. | 2409.06521 | null |
2024-09-10 | Coordinated Motion Planning: Multi-Agent Path Finding in a Densely Packed, Bounded Domain | Sándor P. Fekete et.al. | 2409.06486 | null |
2024-09-10 | Human Impedance Modulation to Improve Visuo-Haptic Perception | Xiaoxiao Cheng et.al. | 2409.06124 | null |
2024-09-09 | Reduced-order modeling for complex 3D seismic wave propagation | John M. Rekoske et.al. | 2409.06102 | null |
2024-09-09 | Neural MP: A Generalist Neural Motion Planner | Murtaza Dalal et.al. | 2409.05864 | null |
2024-09-09 | Promptable Closed-loop Traffic Simulation | Shuhan Tan et.al. | 2409.05863 | null |
2024-09-09 | Interpretable Responsibility Sharing as a Heuristic for Task and Motion Planning | Arda Sarp Yenicesu et.al. | 2409.05586 | link |
2024-09-09 | Adaptive Visual Servoing for On-Orbit Servicing | Farhad Aghili et.al. | 2409.05295 | null |
2024-09-09 | Path-Parameterised RRTs for Underactuated Systems | Damian Abood et.al. | 2409.05278 | null |
2024-09-06 | Synergy and Synchrony in Couple Dances | Vongani Maluleke et.al. | 2409.04440 | null |
2024-09-09 | Online Residual Learning from Offline Experts for Pedestrian Tracking | Anastasios Vlachos et.al. | 2409.04069 | null |
2024-09-05 | KiloBot: A Programming Language for Deploying Perception-Guided Industrial Manipulators at Scale | Wei Gao et.al. | 2409.03439 | null |
2024-09-05 | OccLLaMA: An Occupancy-Language-Action Generative World Model for Autonomous Driving | Julong Wei et.al. | 2409.03272 | null |
2024-09-04 | Improved Single Camera BEV Perception Using Multi-Camera Training | Daniel Busch et.al. | 2409.02676 | null |
2024-09-04 | MADiff: Motion-Aware Mamba Diffusion Models for Hand Trajectory Prediction on Egocentric Videos | Junyi Ma et.al. | 2409.02638 | null |
2024-09-25 | Mamba as a motion encoder for robotic imitation learning | Toshiaki Tsuji et.al. | 2409.02636 | null |
2024-09-04 | eRSS-RAMP: A Rule-Adherence Motion Planner Based on Extended Responsibility-Sensitive Safety for Autonomous Driving | Pengfei Lin et.al. | 2409.02503 | null |
2024-09-03 | Snapshot: Towards Application-centered Models for Pedestrian Trajectory Prediction in Urban Traffic Environments | Nico Uhlemann et.al. | 2409.01971 | link |
2024-09-02 | Direct Kinematics, Inverse Kinematics, and Motion Planning of 1-DoF Rational Linkages | Daniel Huczala et.al. | 2409.01198 | null |
2024-09-02 | Multi-scale Temporal Fusion Transformer for Incomplete Vehicle Trajectory Prediction | Zhanwen Liu et.al. | 2409.00904 | null |
2024-09-01 | Automated Cinematography Motion Planning for UAVs | Animesh Nema et.al. | 2409.00864 | null |
2024-09-01 | SITUATE: Indoor Human Trajectory Prediction through Geometric Features and Self-Supervised Vision Representation | Luigi Capogrosso et.al. | 2409.00774 | link |
2024-09-01 | Roundabout Dilemma Zone Data Mining and Forecasting with Trajectory Prediction and Graph Neural Networks | Manthan Chelenahalli Satish et.al. | 2409.00622 | null |
2024-09-04 | GenAI-powered Multi-Agent Paradigm for Smart Urban Mobility: Opportunities and Challenges for Integrating Large Language Models (LLMs) and Retrieval-Augmented Generation (RAG) with Intelligent Transportation Systems | Haowen Xu et.al. | 2409.00494 | null |
2024-08-18 | Digital Twin-Empowered Routing Management for Reliable Multi-Hop Millimeter Wave V2X | Supat Roongpraiwan et.al. | 2409.00040 | null |
2024-08-16 | DivDiff: A Conditional Diffusion Model for Diverse Human Motion Prediction | Hua Yu et.al. | 2409.00014 | null |
2024-10-15 | Explicit Contact Optimization in Whole-Body Contact-Rich Manipulation | Victor Leve et.al. | 2408.15726 | null |
2024-08-28 | CAPER: Enhancing Career Trajectory Prediction using Temporal Knowledge Graph and Ternary Relationship | Yeon-Chang Lee et.al. | 2408.15620 | link |
2024-10-21 | TrafficGamer: Reliable and Flexible Traffic Simulation for Safety-Critical Scenarios with Game-Theoretic Oracles | Guanren Qiao et.al. | 2408.15538 | link |
2024-08-27 | Fast and Modular Autonomy Software for Autonomous Racing Vehicles | Andrew Saba et.al. | 2408.15425 | null |
2024-08-09 | Pedestrian Motion Prediction Using Transformer-based Behavior Clustering and Data-Driven Reachability Analysis | Kleio Fragkedaki et.al. | 2408.15250 | null |
2024-08-26 | Model Predictive Parkour Control of a Monoped Hopper in Dynamically Changing Environments | Maximilian Albracht et.al. | 2408.14362 | link |
2024-08-24 | Evaluating the Robustness of LiDAR-based 3D Obstacles Detection and Its Impacts on Autonomous Driving Systems | Tri Minh Triet Pham et.al. | 2408.13653 | null |
2024-08-23 | Safe Bubble Cover for Motion Planning on Distance Fields | Ki Myung Brian Lee et.al. | 2408.13377 | null |
2024-08-23 | SIMPNet: Spatial-Informed Motion Planning Network | Davood Soleymanzadeh et.al. | 2408.12831 | null |
2024-08-22 | Probabilistic Homotopy Optimization for Dynamic Motion Planning | Shayan Pardis et.al. | 2408.12490 | null |
2024-09-18 | Dynamic PDB: A New Dataset and a SE(3) Model Extension by Integrating Dynamic Behaviors and Physical Properties in Protein Structures | Ce Liu et.al. | 2408.12413 | null |
2024-08-17 | Automated and Connected Driving: State-of-the-Art and Implications for Future Scenario Analysis | Edgar Jungblut et.al. | 2408.11864 | null |
2024-08-28 | ViIK: Flow-based Vision Inverse Kinematics Solver with Fusing Collision Checking | Qinglong Meng et.al. | 2408.11293 | link |
2024-08-20 | Target-Oriented Object Grasping via Multimodal Human Guidance | Pengwei Xie et.al. | 2408.11138 | null |
2024-08-20 | Towards reliable real-time trajectory optimization | Fatemeh Rastgar et.al. | 2408.10731 | null |
2024-08-19 | Edge-Cloud Collaborative Motion Planning for Autonomous Driving with Large Language Models | Jiao Chen et.al. | 2408.09972 | null |
2024-08-31 | Harnessing the Potential of Omnidirectional Multi-Rotor Aerial Vehicles in Cooperative Jamming Against Eavesdropping | Daniel Bonilla Licea et.al. | 2408.09753 | null |
2024-08-18 | SynTraC: A Synthetic Dataset for Traffic Signal Control from Traffic Monitoring Cameras | Tiejin Chen et.al. | 2408.09588 | link |
2024-08-17 | MambaTrack: A Simple Baseline for Multiple Object Tracking with State Space Model | Changcheng Xiao et.al. | 2408.09178 | null |
2024-10-01 | Towards Practical Human Motion Prediction with LiDAR Point Clouds | Xiao Han et.al. | 2408.08202 | null |
2024-08-15 | Autonomous on-Demand Shuttles for First Mile-Last Mile Connectivity: Design, Optimization, and Impact Assessment | Sudipta Roy et.al. | 2408.07872 | null |
2024-08-14 | From Decision to Action in Surgical Autonomy: Multi-Modal Large Language Models for Robot-Assisted Blood Suction | Sadra Zargarzadeh et.al. | 2408.07806 | null |
2024-08-14 | SigmaRL: A Sample-Efficient and Generalizable Multi-Agent Reinforcement Learning Framework for Motion Planning | Jianye Xu et.al. | 2408.07644 | link |
2024-08-14 | The Design of Autonomous UAV Prototypes for Inspecting Tunnel Construction Environment | Yiping Dong et.al. | 2408.07286 | null |
2024-08-13 | Learn2Decompose: Learning Problem Decomposition for Efficient Task and Motion Planning | Yan Zhang et.al. | 2408.06843 | null |
2024-08-13 | A hybrid neural network for real-time OD demand calibration under disruptions | Takao Dantsuji et.al. | 2408.06659 | null |
2024-08-12 | Motion Planning for Minimally Actuated Serial Robots | Avi Cohen et.al. | 2408.06143 | null |
2024-08-12 | Developing Smart MAVs for Autonomous Inspection in GPS-denied Constructions | Paoqiang Pan et.al. | 2408.06030 | null |
2024-08-11 | A Discrete Topological Complexity of Discrete Motion Planning | Hadi Hassanzada et.al. | 2408.05858 | null |
2024-08-11 | A Meta-Engine Framework for Interleaved Task and Motion Planning using Topological Refinements | Elisa Tosello et.al. | 2408.05795 | null |
2024-08-09 | Towards Intelligent Cooperative Robotics in Additive Manufacturing: Past, Present and Future | Sean Rescsanski et.al. | 2408.04827 | null |
2024-08-08 | Open-Source Software Architecture for Multi-Robot Wire Arc Additive Manufacturing (WAAM) | Honglu He et.al. | 2408.04677 | null |
2024-10-17 | PLANRL: A Motion Planning and Imitation Learning Framework to Bootstrap Reinforcement Learning | Amisha Bhaskar et.al. | 2408.04054 | null |
2024-08-07 | Improving the Intelligent Driver Model by Incorporating Vehicle Dynamics: Microscopic Calibration and Macroscopic Validation | Dominik Salles et.al. | 2408.03722 | link |
2024-08-14 | DRAMA: An Efficient End-to-end Motion Planner for Autonomous Driving with Mamba | Chengran Yuan et.al. | 2408.03601 | null |
2024-08-04 | Past Movements-Guided Motion Representation Learning for Human Motion Prediction | Junyu Shi et.al. | 2408.02091 | null |
2024-08-04 | Individualized multi-horizon MRI trajectory prediction for Alzheimer’s Disease | Rosemary He et.al. | 2408.02018 | link |
2024-08-03 | LF-3PM: a LiDAR-based Framework for Perception-aware Planning with Perturbation-induced Metric | Kaixin Chai et.al. | 2408.01649 | null |
2024-08-02 | CommonUppRoad: A Framework of Formal Modelling, Verifying, Learning, and Visualisation of Autonomous Vehicles | Rong Gu et.al. | 2408.01093 | null |
2024-08-01 | Data-Driven Traffic Simulation for an Intersection in a Metropolis | Chengbo Zang et.al. | 2408.00943 | null |
2024-08-01 | Optimizing Diffusion Models for Joint Trajectory Prediction and Controllable Generation | Yixiao Wang et.al. | 2408.00766 | null |
2024-08-01 | DriveArena: A Closed-loop Generative Simulation Platform for Autonomous Driving | Xuemeng Yang et.al. | 2408.00415 | null |
2024-08-02 | Conformal Trajectory Prediction with Multi-View Data Integration in Cooperative Driving | Xi Chen et.al. | 2408.00374 | link |
2024-08-06 | A Reinforcement Learning Based Motion Planner for Quadrotor Autonomous Flight in Dense Environment | Zhaohong Liu et.al. | 2408.00275 | link |
2024-07-31 | MART: MultiscAle Relational Transformer Networks for Multi-agent Trajectory Prediction | Seongju Lee et.al. | 2407.21635 | link |
2024-08-01 | Analysis of Functional Insufficiencies and Triggering Conditions to Improve the SOTIF of an MPC-based Trajectory Planner | Mirko Conrad et.al. | 2407.21569 | null |
2024-08-02 | Forecasting Future Videos from Novel Views via Disentangled 3D Scene Representation | Sudhir Yarram et.al. | 2407.21450 | null |
2024-08-02 | MSMA: Multi-agent Trajectory Prediction in Connected and Autonomous Vehicle Environment with Multi-source Data Integration | Xi Chen et.al. | 2407.21310 | link |
2024-07-31 | DEF-oriCORN: efficient 3D scene understanding for robust language-directed manipulation without demonstrations | Dongwon Son et.al. | 2407.21267 | null |
2024-07-30 | Zero Shot Health Trajectory Prediction Using Transformer | Pawel Renc et.al. | 2407.21124 | link |
2024-07-28 | Forecast-PEFT: Parameter-Efficient Fine-Tuning for Pre-trained Motion Forecasting Models | Jifeng Wang et.al. | 2407.19564 | link |
2024-07-28 | Reputation-Driven Asynchronous Federated Learning for Enhanced Trajectory Prediction with Blockchain | Weiliang Chen et.al. | 2407.19428 | null |
2024-07-26 | Addressing Behavior Model Inaccuracies for Safe Motion Control in Uncertain Dynamic Environments | Minjun Sung et.al. | 2407.19071 | null |
2024-07-26 | Real-time Uncertainty-Aware Motion Planning for Magnetic-based Navigation | Aditya Penumarti et.al. | 2407.19046 | null |
2024-07-26 | Evaluating Human Trajectory Prediction with Metamorphic Testing | Helge Spieker et.al. | 2407.18756 | null |
2024-08-04 | PP-TIL: Personalized Planning for Autonomous Driving with Instance-based Transfer Imitation Learning | Fangze Lin et.al. | 2407.18569 | link |
2024-07-29 | Multi-Agent Trajectory Prediction with Difficulty-Guided Feature Enhancement Network | Guipeng Xin et.al. | 2407.18551 | link |
2024-07-25 | Optimal Control using Composite Bernstein Approximants | Gage MacLin et.al. | 2407.18081 | null |
2024-07-17 | Driving pattern interpretation based on action phases clustering | Xue Yao et.al. | 2407.17518 | null |
2024-09-16 | Towards Practical Finite Sample Bounds for Motion Planning in TAMP | Seiji Shaw et.al. | 2407.17394 | null |
2024-07-24 | Context-aware Multi-task Learning for Pedestrian Intent and Trajectory Prediction | Farzeen Munir et.al. | 2407.17162 | link |
2024-07-24 | AI-Gadget Kit: Integrating Swarm User Interfaces with LLM-driven Agents for Rich Tabletop Game Applications | Yijie Guo et.al. | 2407.17086 | null |
2024-08-20 | Topology-Guided ORCA: Smooth Multi-Agent Motion Planning in Constrained Environments | Fatemeh Cheraghi Pouria et.al. | 2407.16771 | null |
2024-10-01 | Velocity Driven Vision: Asynchronous Sensor Fusion Birds Eye View Models for Autonomous Vehicles | Seamie Hayes et.al. | 2407.16636 | null |
2024-07-23 | Cross Anything: General Quadruped Robot Navigation through Complex Terrains | Shaoting Zhu et.al. | 2407.16412 | null |
2024-07-23 | Flatness-based motion planning for a non-uniform moving cantilever Euler-Bernoulli beam with a tip-mass | Soham Chatterjee et.al. | 2407.16195 | null |
2024-07-22 | Flow-guided Motion Prediction with Semantics and Dynamic Occupancy Grid Maps | Rabbia Asghar et.al. | 2407.15675 | null |
2024-07-18 | Anticipatory Task and Motion Planning | Roshan Dhakal et.al. | 2407.13694 | null |
2024-07-18 | Risk-Aware Vehicle Trajectory Prediction Under Safety-Critical Scenarios | Qingfan Wang et.al. | 2407.13480 | null |
2024-08-26 | Improving Out-of-Distribution Generalization of Trajectory Prediction for Autonomous Driving via Polynomial Representations | Yue Yao et.al. | 2407.13431 | link |
2024-07-17 | Trajectory Planning Using Tire Thermodynamics for Automated Drifting | Takao Kobayashi et.al. | 2407.12989 | null |
2024-07-17 | Self-Adaptive Robust Motion Planning for High DoF Robot Manipulator using Deep MPC | Ye Zhang et.al. | 2407.12887 | null |
2024-07-17 | VisionTrap: Vision-Augmented Trajectory Prediction Guided by Textual Descriptions | Seokha Moon et.al. | 2407.12345 | null |
2024-07-16 | $α$ -SGHN: A Robust Model for Learning Particle Interactions in Lattice Systems | Yixian Gao et.al. | 2407.11684 | null |
2024-07-16 | Progressive Pretext Task Learning for Human Trajectory Prediction | Xiaotong Lin et.al. | 2407.11588 | link |
2024-07-16 | Learning Semantic Latent Directions for Accurate and Controllable Human Motion Prediction | Guowei Xu et.al. | 2407.11494 | link |
2024-07-16 | Multi-Goal Motion Memory | Yuanjie Lu et.al. | 2407.11399 | null |
2024-07-19 | A Survey of Distance-Based Vessel Trajectory Clustering: Data Pre-processing, Methodologies, Applications, and Experimental Evaluation | Maohan Liang et.al. | 2407.11084 | null |
2024-07-15 | Risk-aware Trajectory Prediction by Incorporating Spatio-temporal Traffic Interaction Analysis | Divya Thuremella et.al. | 2407.10639 | link |
2024-07-15 | Communication- and Computation-Efficient Distributed Decision-Making in Multi-Robot Networks | Zirui Xu et.al. | 2407.10382 | link |
2024-10-16 | ScaleFlow++: Robust and Accurate Estimation of 3D Motion from Video | Han Ling et.al. | 2407.09797 | link |
2024-07-12 | Adaptive Prediction Ensemble: Improving Out-of-Distribution Generalization of Motion Forecasting | Jinning Li et.al. | 2407.09475 | null |
2024-07-12 | TRAVERSE: Traffic-Responsive Autonomous Vehicle Experience & Rare-event Simulation for Enhanced safety | Sandeep Thalapanane et.al. | 2407.09466 | null |
2024-07-12 | Fast and Accurate Multi-Agent Trajectory Prediction For Crowded Unknown Scenes | Xiuye Tao et.al. | 2407.09068 | null |
2024-09-28 | Deep Attention Driven Reinforcement Learning (DAD-RL) for Autonomous Decision-Making in Dynamic Environment | Jayabrata Chowdhury et.al. | 2407.08932 | link |
2024-09-18 | GCS*: Forward Heuristic Search on Implicit Graphs of Convex Sets | Shao Yuan Chew Chia et.al. | 2407.08848 | null |
2024-07-10 | Deep Learning-Based Robust Multi-Object Tracking via Fusion of mmWave Radar and Camera Sensors | Lei Cheng et.al. | 2407.08049 | null |
2024-07-10 | Field Deployment of Multi-Agent Reinforcement Learning Based Variable Speed Limit Controllers | Yuhang Zhang et.al. | 2407.08021 | null |
2024-07-10 | CATP: Context-Aware Trajectory Prediction with Competition Symbiosis | Jiang Wu et.al. | 2407.07328 | null |
2024-07-09 | Less is More: Efficient Brain-Inspired Learning for Autonomous Driving Trajectory Prediction | Haicheng Liao et.al. | 2407.07020 | null |
2024-07-17 | Enhanced Safety in Autonomous Driving: Integrating Latent State Diffusion Model for End-to-End Navigation | Detian Chu et.al. | 2407.06317 | null |
2024-07-08 | Potential Based Diffusion Motion Planning | Yunhao Luo et.al. | 2407.06169 | null |
2024-08-20 | Affordances-Oriented Planning using Foundation Models for Continuous Vision-Language Navigation | Jiaqi Chen et.al. | 2407.05890 | null |
2024-10-01 | MapsTP: HD Map Images Based Multimodal Trajectory Prediction for Automated Vehicles | Sushil Sharma et.al. | 2407.05811 | null |
2024-07-18 | BEVWorld: A Multimodal World Model for Autonomous Driving via Unified BEV Latent Space | Yumeng Zhang et.al. | 2407.05679 | link |
2024-07-08 | MSTF: Multiscale Transformer for Incomplete Trajectory Prediction | Zhanwen Liu et.al. | 2407.05671 | null |
2024-07-07 | Provably Efficient Long-Horizon Exploration in Monte Carlo Tree Search through State Occupancy Regularization | Liam Schramm et.al. | 2407.05511 | null |
2024-07-07 | Rethinking Closed-loop Planning Framework for Imitation-based Model Integrating Prediction and Planning | Jiayu Guo et.al. | 2407.05376 | null |
2024-07-06 | Toward Precise Robotic Weed Flaming Using a Mobile Manipulator with a Flamethrower | Di Wang et.al. | 2407.04929 | null |
2024-07-05 | WOMD-Reasoning: A Large-Scale Language Dataset for Interaction and Driving Intentions Reasoning | Yiheng Li et.al. | 2407.04281 | link |
2024-07-03 | Online Time-Informed Kinodynamic Motion Planning of Nonlinear Systems | Fei Meng et.al. | 2407.02933 | link |
2024-07-03 | Solving Motion Planning Tasks with a Scalable Generative Model | Yihan Hu et.al. | 2407.02797 | link |
2024-06-25 | Performance Comparison of Deep RL Algorithms for Mixed Traffic Cooperative Lane-Changing | Xue Yao et.al. | 2407.02521 | null |
2024-06-11 | Impact of an Autonomous Shuttle Service on Urban Road Capacity: Experiments by Microscopic Traffic Simulation | Sudipta Roy et.al. | 2407.02502 | null |
2024-04-15 | Malleable Robots: Reconfigurable Robotic Arms with Continuum Links of Variable Stiffness | Angus B. Clark et.al. | 2407.02374 | null |
2024-07-02 | ReliaAvatar: A Robust Real-Time Avatar Animator with Integrated Motion Prediction | Bo Qian et.al. | 2407.02129 | null |
2024-09-16 | Universal Plans: One Action Sequence to Solve Them All! | Kalle G. Timperi et.al. | 2407.02090 | null |
2024-07-02 | Cloud-Edge-Terminal Collaborative AIGC for Autonomous Driving | Jianan Zhang et.al. | 2407.01956 | null |
2024-07-01 | Active Human Pose Estimation via an Autonomous UAV Agent | Jingxi Chen et.al. | 2407.01811 | null |
2024-06-24 | neuROSym: Deployment and Evaluation of a ROS-based Neuro-Symbolic Model for Human Motion Prediction | Sariah Mghames et.al. | 2407.01593 | null |
2024-05-28 | Model-Based Diffusion for Trajectory Optimization | Chaoyi Pan et.al. | 2407.01573 | null |
2024-07-01 | HGNET: A Hierarchical Feature Guided Network for Occupancy Flow Field Prediction | Zhan Chen et.al. | 2407.01097 | null |
2024-07-01 | Data on the Move: Traffic-Oriented Data Trading Platform Powered by AI Agent with Common Sense | Yi Yu et.al. | 2407.00995 | null |
2024-07-01 | Locomotion as Manipulation with ReachBot | Tony G. Chen et.al. | 2407.00973 | null |
2024-06-30 | Engineering an Efficient Object Tracker for Non-Linear Motion | Momir Adžemović et.al. | 2407.00738 | null |
2024-06-30 | OfCaM: Global Human Mesh Recovery via Optimization-free Camera Motion Scale Calibration | Fengyuan Yang et.al. | 2407.00574 | null |
2024-10-15 | Divide And Conquer: Learning Chaotic Dynamical Systems With Multistep Penalty Neural Ordinary Differential Equations | Dibyajyoti Chakraborty et.al. | 2407.00568 | null |
2024-06-28 | SPITE: Simple Polyhedral Intersection Techniques for modified Environments | Stav Ashur et.al. | 2407.00259 | null |
2024-06-28 | Koopman based trajectory model and computation offloading for high mobility paradigm in ISAC enabled IoT system | Minh-Tuan Tran et.al. | 2406.19871 | null |
2024-06-28 | FootBots: A Transformer-based Architecture for Motion Prediction in Soccer | Guillem Capellera et.al. | 2406.19852 | null |
2024-06-28 | StreamMOTP: Streaming and Unified Framework for Joint 3D Multi-Object Tracking and Trajectory Prediction | Jiaheng Zhuang et.al. | 2406.19844 | null |
2024-06-28 | Modeling the Real World with High-Density Visual Particle Dynamics | William F. Whitney et.al. | 2406.19800 | null |
2024-06-28 | Integrating occlusion awareness in urban motion prediction for enhanced autonomous vehicle navigation | Vinicius Trentin et.al. | 2406.19798 | null |
2024-06-28 | LCSim: A Large-Scale Controllable Traffic Simulator | Yuheng Zhang et.al. | 2406.19781 | link |
2024-06-27 | A Max Pressure Algorithm for Traffic Signals Considering Pedestrian Queues | Hao Liu et.al. | 2406.19305 | null |
2024-06-26 | A Multi-Stage Goal-Driven Network for Pedestrian Trajectory Prediction | Xiuen Wu et.al. | 2406.18050 | null |
2024-06-25 | Dynamic Scheduling for Vehicle-to-Vehicle Communications Enhanced Federated Learning | Jintao Yan et.al. | 2406.17470 | null |
2024-06-25 | Task Adaptation in Industrial Human-Robot Interaction: Leveraging Riemannian Motion Policies | Mike Allenspach et.al. | 2406.17333 | null |
2024-06-25 | Parametrized topological complexity of spherical fibrations over spheres | Yuki Minowa et.al. | 2406.17227 | null |
2024-06-24 | Socially Acceptable Bipedal Robot Navigation via Social Zonotope Network Model Predictive Control | Abdulaziz Shamsah et.al. | 2406.17151 | null |
2024-06-23 | MetaFollower: Adaptable Personalized Autonomous Car Following | Xianda Chen et.al. | 2406.16978 | null |
2024-04-29 | An Exploratory Study on Human-Centric Video Anomaly Detection through Variational Autoencoders and Trajectory Prediction | Ghazal Alinezhad Noghre et.al. | 2406.15395 | link |
2024-06-20 | Relational Reasoning On Graphs Using Opinion Dynamics | Yulong Yang et.al. | 2406.14746 | null |
2024-07-24 | Asynchronous Large Language Model Enhanced Planner for Autonomous Driving | Yuan Chen et.al. | 2406.14556 | link |
2024-06-20 | FutureNet-LOF: Joint Trajectory Prediction and Lane Occupancy Field Prediction with Future Context Encoding | Mingkun Wang et.al. | 2406.14422 | null |
2024-06-20 | A-OctoMap: An Adaptive OctoMap for Online Motion Planning | Yihui Mao et.al. | 2406.13910 | null |
2024-06-19 | Group-Control Motion Planning Framework for Microrobot Swarms in a Global Field | Siyu Li et.al. | 2406.13829 | null |
2024-06-19 | Tactical Game-theoretic Decision-making with Homotopy Class Constraints | Michael Khayyat et.al. | 2406.13656 | null |
2024-06-18 | Transforming Surgical Interventions with Embodied Intelligence for Ultrasound Robotics | Huan Xu et.al. | 2406.12651 | null |
2024-09-20 | CUQDS: Conformal Uncertainty Quantification under Distribution Shift for Trajectory Prediction | Huiqun Huang et.al. | 2406.12100 | null |
2024-06-17 | Crossfusor: A Cross-Attention Transformer Enhanced Conditional Diffusion Model for Car-Following Trajectory Prediction | Junwei You et.al. | 2406.11941 | null |
2024-06-17 | FetchBench: A Simulation Benchmark for Robot Fetching | Beining Han et.al. | 2406.11793 | null |
2024-06-17 | A First Physical-World Trajectory Prediction Attack via LiDAR-induced Deceptions in Autonomous Driving | Yang Lou et.al. | 2406.11707 | null |
2024-07-04 | ESI-GAL: EEG Source Imaging-based Kinematics Parameter Estimation for Grasp and Lift Task | Anant Jain et.al. | 2406.11500 | null |
2024-06-16 | TrafficBots V1.5: Traffic Simulation via Conditional VAEs and Transformers with Relative Pose Encoding | Zhejun Zhang et.al. | 2406.10898 | link |
2024-09-19 | Planning with Adaptive World Models for Autonomous Driving | Arun Balajee Vasudevan et.al. | 2406.10714 | null |
2024-10-02 | A GPU-accelerated Large-scale Simulator for Transportation System Optimization Benchmarking | Jun Zhang et.al. | 2406.10661 | link |
2024-06-15 | Learning Temporal Logic Predicates from Data with Statistical Guarantees | Emi Soroka et.al. | 2406.10449 | null |
2024-06-14 | Constrained Motion Planning for a Robotic Endoscope Holder based on Hierarchical Quadratic Programming | Jacinto Colan et.al. | 2406.09982 | null |
2024-06-13 | Optimal Convex Cover as Collision-free Space Approximation for Trajectory Generation | Yuwei Wu et.al. | 2406.09631 | null |
2024-06-17 | Search-based versus Sampling-based Robot Motion Planning: A Comparative Study | Georgios Sotirchos et.al. | 2406.09623 | link |
2024-06-13 | Trajectory Planning for Autonomous Driving in Unstructured Scenarios Based on Graph Neural Network and Numerical Optimization | Sumin Zhang et.al. | 2406.08855 | null |
2024-06-12 | UnO: Unsupervised Occupancy Fields for Perception and Forecasting | Ben Agro et.al. | 2406.08691 | null |
2024-04-25 | LPSim: Large Scale Multi-GPU Parallel Computing based Regional Scale Traffic Simulation Framework | Xuan Jiang et.al. | 2406.08496 | link |
2024-07-30 | Eyes Wide Unshut: Unsupervised Mistake Detection in Egocentric Procedural Video by Detecting Unpredictable Gaze | Michele Mazzamuto et.al. | 2406.08379 | null |
2024-06-12 | A Hybrid Task-Constrained Motion Planning for Collaborative Robots in Intelligent Remanufacturing | Wansong Liu et.al. | 2406.08283 | null |
2024-06-11 | Scalable Optimal Motion Planning for Multi-Agent Systems by Cosserat Theory of Rods | Amirreza Fahim Golestaneh et.al. | 2406.07684 | null |
2024-06-11 | Instruct Large Language Models to Drive like Humans | Ruijun Zhang et.al. | 2406.07296 | link |
2024-06-11 | RS-DFM: A Remote Sensing Distributed Foundation Model for Diverse Downstream Tasks | Zhechao Wang et.al. | 2406.07032 | null |
2024-06-11 | iMotion-LLM: Motion Prediction Instruction Tuning | Abdulwahab Felemban et.al. | 2406.06211 | null |
2024-06-10 | WoCoCo: Learning Whole-Body Humanoid Control with Sequential Contacts | Chong Zhang et.al. | 2406.06005 | null |
2024-06-09 | Towards A General-Purpose Motion Planning for Autonomous Vehicles Using Fluid Dynamics | MReza Alipour Sormoli et.al. | 2406.05708 | null |
2024-06-08 | A Survey on Hybrid Motion Planning Methods for Automated Driving Systems | MReza Alipour Sormoli et.al. | 2406.05575 | null |
2024-06-07 | CityCraft: A Real Crafter for 3D City Generation | Jie Deng et.al. | 2406.04983 | null |
2024-06-07 | SLOPE: Search with Learned Optimal Pruning-based Expansion | Davor Bokan et.al. | 2406.04935 | link |
2024-06-07 | Scaling Motion Planning Infeasibility Proofs | Sihui Li et.al. | 2406.04795 | null |
2024-06-07 | SMC++: Masked Learning of Unsupervised Video Semantic Compression | Yuan Tian et.al. | 2406.04765 | link |
2024-06-07 | Evaluating Data-driven Performances of Mixed Integer Bilinear Formulations for Book Placement Planning | Xuan Lin et.al. | 2406.04616 | null |
2024-09-18 | RiskMap: A Unified Driving Context Representation for Autonomous Motion Planning in Urban Driving Environment | Ren Xin et.al. | 2406.04451 | null |
2024-07-12 | Traffic signal optimization in large-scale urban road networks: an adaptive-predictive controller using Ising models | Daisuke Inoue et.al. | 2406.03690 | link |
2024-06-13 | Task and Motion Planning for Execution in the Real | Tianyang Pan et.al. | 2406.03641 | null |
2024-06-05 | Adaptive Distance Functions via Kelvin Transformation | Rafael I. Cabral Muchacho et.al. | 2406.03200 | null |
2024-06-05 | Real-time Motion Planning for autonomous vehicles in dynamic environments | Mohammad Dehghani Tezerjani et.al. | 2406.02916 | null |
2024-07-26 | Towards Interactive Autonomous Vehicle Testing: Vehicle-Under-Test-Centered Traffic Simulation | Yiru Liu et.al. | 2406.02860 | link |
2024-06-04 | Collision-Affording Point Trees: SIMD-Amenable Nearest Neighbors for Fast Collision Checking | Clayton W. Ramsey et.al. | 2406.02807 | link |
2024-06-04 | Feasibility of State Space Models for Network Traffic Generation | Andrew Chu et.al. | 2406.02784 | null |
2024-06-04 | Improved context-sensitive transformer model for inland vessel trajectory prediction | Kathrin Donandt et.al. | 2406.02771 | null |
2024-06-04 | Short-term Inland Vessel Trajectory Prediction with Encoder-Decoder Models | Kathrin Donandt et.al. | 2406.02770 | null |
2024-06-04 | Spatial and social situation-aware transformer-based trajectory prediction of autonomous systems | Kathrin Donandt et.al. | 2406.02767 | null |
2024-06-04 | Out-of-Distribution Runtime Adaptation with Conformalized Neural Network Ensembles | Polo Contreras et.al. | 2406.02436 | null |
2024-06-05 | Incorporating Navigation Context into Inland Vessel Trajectory Prediction: A Gaussian Mixture Model and Transformer Approach | Kathrin Donandt et.al. | 2406.02344 | null |
2024-06-03 | ZAPP! Zonotope Agreement of Prediction and Planning for Continuous-Time Collision Avoidance with Discrete-Time Dynamics | Luca Paparusso et.al. | 2406.01814 | null |
2024-06-03 | Motion Planning for Hybrid Dynamical Systems: Framework, Algorithm Template, and a Sampling-based Approach | Nan Wang et.al. | 2406.01802 | null |
2024-06-03 | Walk on Spheres for PDE-based Path Planning | Rafael I. Cabral Muchacho et.al. | 2406.01713 | null |
2024-06-04 | PlanAgent: A Multi-modal Large Language Agent for Closed-loop Vehicle Motion Planning | Yupeng Zheng et.al. | 2406.01587 | null |
2024-06-03 | Extremum Seeking Control for Scalar Maps with Distributed Diffusion PDEs | Pedro Henrique Silva Coutinho et.al. | 2406.01564 | null |
2024-09-06 | Deep Stochastic Kinematic Models for Probabilistic Motion Forecasting in Traffic | Laura Zheng et.al. | 2406.01431 | null |
2024-06-02 | CCF: Cross Correcting Framework for Pedestrian Trajectory Prediction | Pranav Singh Chib et.al. | 2406.00749 | null |
2024-06-02 | An Efficient Trajectory Generation for Bi-copter Flight in Tight Space | Xin Dong et.al. | 2406.00671 | null |
2024-05-31 | Navigating Autonomous Vehicle on Unmarked Roads with Diffusion-Based Motion Prediction and Active Inference | Yufei Huang et.al. | 2406.00211 | null |
2024-05-31 | Hard Cases Detection in Motion Prediction by Vision-Language Foundation Models | Yi Yang et.al. | 2405.20991 | link |
2024-05-31 | Transforming Japan Real Estate | Diabul Haque et.al. | 2405.20715 | null |
2024-05-30 | A Structure-Aware Lane Graph Transformer Model for Vehicle Trajectory Prediction | Sun Zhanbo et.al. | 2405.20121 | null |
2024-05-31 | SparseDrive: End-to-End Autonomous Driving via Sparse Scene Representation | Wenchao Sun et.al. | 2405.19620 | link |
2024-05-29 | Predicting Long-Term Human Behaviors in Discrete Representations via Physics-Guided Diffusion | Zhitian Zhang et.al. | 2405.19528 | null |
2024-05-29 | Conditional Latent ODEs for Motion Prediction in Autonomous Driving | Khang Truong Giang et.al. | 2405.19183 | link |
2024-05-29 | Exploring Probabilistic Distance Fields in Robotics | Lan Wu et.al. | 2405.18965 | null |
2024-05-29 | WTTFNet: A Weather-Time-Trajectory Fusion Network for Pedestrian Trajectory Prediction in Urban Complex | Ho Chun Wu et.al. | 2405.18945 | null |
2024-05-29 | Development of a Novel Impedance-Controlled Quasi-Direct-Drive Robotic Hand | Jay Best et.al. | 2405.18730 | null |
2024-05-30 | Multi-Condition Latent Diffusion Network for Scene-Aware Neural Human Motion Prediction | Xuehao Gao et.al. | 2405.18700 | null |
2024-06-07 | Resisting Stochastic Risks in Diffusion Planners with the Trajectory Aggregation Tree | Lang Feng et.al. | 2405.17879 | link |
2024-05-27 | Deciphering Movement: Unified Trajectory Generation Model for Multi-Agent | Yi Xu et.al. | 2405.17680 | link |
2024-05-29 | A note on the error analysis of data-driven closure models for large eddy simulations of turbulence | Dibyajyoti Chakraborty et.al. | 2405.17612 | null |
2024-05-28 | Controllable Longer Image Animation with Diffusion Models | Qiang Wang et.al. | 2405.17306 | null |
2024-05-27 | Motion Primitives Planning For Center-Articulated Vehicles | Jiangpeng Hu et.al. | 2405.17127 | null |
2024-05-27 | A Two-Level Stochastic Model for the Lateral Movement of Vehicles Within Their Lane Under Homogeneous Traffic Conditions | Nicole Neis et.al. | 2405.17080 | null |
2024-05-26 | Towards Imitation Learning in Real World Unstructured Social Mini-Games in Pedestrian Crowds | Rohan Chandra et.al. | 2405.16439 | null |
2024-05-25 | RoboArm-NMP: a Learning Environment for Neural Motion Planning | Tom Jurgenson et.al. | 2405.16335 | null |
2024-05-25 | Neural Network-Based Tracking and 3D Reconstruction of Baseball Pitch Trajectories from Single-View 2D Video | Jhen Hsieh et.al. | 2405.16296 | null |
2024-05-25 | FlightPatchNet: Multi-Scale Patch Network with Differential Coding for Flight Trajectory Prediction | Lan Wu et.al. | 2405.16200 | null |
2024-05-24 | TD3 Based Collision Free Motion Planning for Robot Navigation | Hao Liu et.al. | 2405.15460 | null |
2024-06-21 | Dynamic Planning for Sequential Whole-body Mobile Manipulation | Zhitian Li et.al. | 2405.15377 | null |
2024-05-24 | Off-the-shelf ChatGPT is a Good Few-shot Human Motion Predictor | Haoxuan Qu et.al. | 2405.15267 | null |
2024-05-23 | Metric Flow Matching for Smooth Interpolations on the Data Manifold | Kacper Kapusniak et.al. | 2405.14780 | link |
2024-05-23 | Drones Help Drones: A Collaborative Framework for Multi-Drone Object Trajectory Prediction and Beyond | Zhechao Wang et.al. | 2405.14674 | link |
2024-05-23 | Reliable Trajectory Prediction and Uncertainty Quantification with Conditioned Diffusion Models | Marion Neumeier et.al. | 2405.14384 | null |
2024-05-23 | Optimal Whole Body Trajectory Planning for Mobile Manipulators in Planetary Exploration and Construction | Federica Storiale et.al. | 2405.14363 | null |
2024-05-22 | BenchNav: Simulation Platform for Benchmarking Off-road Navigation Algorithms with Probabilistic Traversability | Masafumi Endo et.al. | 2405.13318 | link |
2024-05-21 | Empowering Urban Traffic Management: Elevated 3D LiDAR for Data Collection and Advanced Object Detection Analysis | Nawfal Guefrachi et.al. | 2405.13202 | null |
2024-05-21 | A machine learning framework for interpretable predictions in patient pathways: The case of predicting ICU admission for patients with symptoms of sepsis | Sandra Zilker et.al. | 2405.13187 | null |
2024-05-21 | Enhancing Interaction Modeling with Agent Selection and Physical Methods for Trajectory Prediction | Shiji Huang et.al. | 2405.13152 | link |
2024-05-21 | Towards Using Fast Embedded Model Predictive Control for Human-Aware Predictive Robot Navigation | Till Hielscher et.al. | 2405.12616 | null |
2024-05-21 | MOSS: A Large-scale Open Microscopic Traffic Simulation System | Jun Zhang et.al. | 2405.12520 | link |
2024-05-20 | Design, Control, and Motion-Planning for a Root-Perching Rotor-Distributed Manipulator | Takuzumi Nishio et.al. | 2405.12125 | null |
2024-05-20 | CDM-MPC: An Integrated Dynamic Planning and Control Framework for Bipedal Robots Jumping | Zhicheng He et.al. | 2405.11773 | null |
2024-05-20 | AI Algorithm for Predicting and Optimizing Trajectory of UAV Swarm | Amit Raj et.al. | 2405.11722 | null |
2024-05-29 | Track Anything Rapter(TAR) | Tharun V. Puthanveettil et.al. | 2405.11655 | link |
2024-08-12 | Neural Randomized Planning for Whole Body Robot Motion | Yunfan Lu et.al. | 2405.11317 | null |
2024-08-09 | RuleFuser: An Evidential Bayes Approach for Rule Injection in Imitation Learned Planners for Robustness under Distribution Shifts | Jay Patrikar et.al. | 2405.11139 | null |
2024-05-17 | Model Predictive Contouring Control for Vehicle Obstacle Avoidance at the Limit of Handling Using Torque Vectoring | Alberto Bertipaglia et.al. | 2405.10847 | null |
2024-05-22 | Fast Collision Probability Estimation for Automated Driving using Multi-circular Shape Approximations | Leon Tolksdorf et.al. | 2405.10765 | null |
2024-05-17 | You Can’t Solve These Super Mario Bros. Levels: Undecidable Mario Games | MIT Hardness Group et.al. | 2405.10546 | null |
2024-05-16 | RGB Guided ToF Imaging System: A Survey of Deep Learning-based Methods | Xin Qiao et.al. | 2405.10357 | null |
2024-05-16 | Towards Consistent and Explainable Motion Prediction using Heterogeneous Graph Attention | Tobias Demmler et.al. | 2405.10134 | null |
2024-05-16 | Integrating Uncertainty-Aware Human Motion Prediction into Graph-Based Manipulator Motion Planning | Wansong Liu et.al. | 2405.09779 | null |
2024-05-18 | Motion Prediction with Gaussian Processes for Safe Human-Robot Interaction in Virtual Environments | Stanley Mugisha et.al. | 2405.09109 | null |
2024-05-20 | Perception Without Vision for Trajectory Prediction: Ego Vehicle Dynamics as Scene Representation for Efficient Active Learning in Autonomous Driving | Ross Greer et.al. | 2405.09049 | null |
2024-05-14 | COAST: Constraints and Streams for Task and Motion Planning | Brandon Vu et.al. | 2405.08572 | null |
2024-07-17 | Vector Field-Guided Learning Predictive Control for Motion Planning of Mobile Robots with Uncertain Dynamics | Yang Lu et.al. | 2405.08283 | null |
2024-05-13 | Equivariant Deep Learning of Mixed-Integer Optimal Control Solutions for Vehicle Decision Making and Motion Planning | Rudolf Reiter et.al. | 2405.08122 | null |
2024-05-13 | Fighter flight trajectory prediction based on spatio-temporal graphcial attention network | Yao Sun et.al. | 2405.08034 | null |
2024-05-12 | Modeling Pedestrian Intrinsic Uncertainty for Multimodal Stochastic Trajectory Prediction via Energy Plan Denoising | Yao Liu et.al. | 2405.07164 | null |
2024-05-11 | Optimal Multilayered Motion Planning for Multiple Differential Drive Mobile Robots with Hierarchical Prioritization (OM-MP) | Zong Chen et.al. | 2405.07043 | null |
2024-05-11 | Multi-agent Traffic Prediction via Denoised Endpoint Distribution | Yao Liu et.al. | 2405.07041 | null |
2024-05-10 | Hierarchical Learned Risk-Aware Planning Framework for Human Driving Modeling | Nathan Ludlow et.al. | 2405.06578 | null |
2024-05-09 | A Mixture of Experts Approach to 3D Human Motion Prediction | Edmund Shieh et.al. | 2405.06088 | link |
2024-05-08 | Planning with Probabilistic Opacity and Transparency: A Computational Model of Opaque/Transparent Observations | Sumukha Udupa et.al. | 2405.05408 | null |
2024-05-08 | Traj-LLM: A New Exploration for Empowering Trajectory Prediction with Pre-trained Large Language Models | Zhengxing Lan et.al. | 2405.04909 | null |
2024-05-07 | Physics-data hybrid dynamic model of a multi-axis manipulator for sensorless dexterous manipulation and high-performance motion planning | Wu-Te Yang et.al. | 2405.04503 | null |
2024-06-28 | Adjoint Sensitivity Analysis on Multi-Scale Bioprocess Stochastic Reaction Network | Keilung Choy et.al. | 2405.04011 | null |
2024-05-07 | Unified End-to-End V2X Cooperative Autonomous Driving | Zhiwei Li et.al. | 2405.03971 | null |
2024-05-06 | SocialFormer: Social Interaction Modeling with Edge-enhanced Heterogeneous Graph Transformers for Trajectory Prediction | Zixu Wang et.al. | 2405.03809 | null |
2024-05-06 | Motion Planning under Uncertainty: Integrating Learning-Based Multi-Modal Predictors into Branch Model Predictive Control | Mohamed-Khalil Bouzidi et.al. | 2405.03470 | null |
2024-05-06 | Greedy Heuristics for Sampling-based Motion Planning in High-Dimensional State Spaces | Phone Thiha Kyaw et.al. | 2405.03411 | link |
2024-05-05 | A Long-Short-Term Mixed-Integer Formulation for Highway Lane Change Planning | Rudolf Reiter et.al. | 2405.02979 | null |
2024-05-07 | CoverLib: Classifiers-equipped Experience Library by Iterative Problem Distribution Coverage Maximization for Domain-tuned Motion Planning | Hirokazu Ishida et.al. | 2405.02968 | null |
2024-05-05 | Multimodal Sense-Informed Prediction of 3D Human Motions | Zhenyu Lou et.al. | 2405.02911 | null |
2024-05-03 | Characterized Diffusion and Spatial-Temporal Interaction Network for Trajectory Prediction in Autonomous Driving | Haicheng Liao et.al. | 2405.02145 | null |
2024-05-03 | Solving Sequential Manipulation Puzzles by Finding Easier Subproblems | Svetlana Levit et.al. | 2405.02053 | link |
2024-07-26 | Unconstraining Multi-Robot Manipulation: Enabling Arbitrary Constraints in ECBS with Bounded Sub-Optimality | Yorai Shaoul et.al. | 2405.01772 | null |
2024-05-02 | Plan-Seq-Learn: Language Model Guided RL for Solving Long Horizon Robotics Tasks | Murtaza Dalal et.al. | 2405.01534 | null |
2024-05-02 | StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation | Yupeng Zhou et.al. | 2405.01434 | link |
2024-05-02 | MFTraj: Map-Free, Behavior-Driven Trajectory Prediction for Autonomous Driving | Haicheng Liao et.al. | 2405.01266 | null |
2024-05-02 | Poisoning Attacks on Federated Learning for Autonomous Driving | Sonakshi Garg et.al. | 2405.01073 | null |
2024-08-22 | Addressing Diverging Training Costs using BEVRestore for High-resolution Bird’s Eye View Map Construction | Minsu Kim et.al. | 2405.01016 | null |
2024-05-01 | A Differentiable Dynamic Modeling Approach to Integrated Motion Planning and Actuator Physical Design for Mobile Manipulators | Zehui Lu et.al. | 2405.00882 | null |
2024-05-01 | ADM: Accelerated Diffusion Model via Estimated Priors for Robust Motion Prediction under Uncertainties | Jiahui Li et.al. | 2405.00797 | link |
2024-05-01 | A Preprocessing and Evaluation Toolbox for Trajectory Prediction Research on the Drone Datasets | Theodor Westny et.al. | 2405.00604 | link |
2024-05-01 | Long-Term Human Trajectory Prediction using 3D Dynamic Scene Graphs | Nicolas Gorlo et.al. | 2405.00552 | link |
2024-05-01 | Enhancing Surgical Robots with Embodied Intelligence for Autonomous Ultrasound Scanning | Huan Xu et.al. | 2405.00461 | null |
2024-05-05 | Enhance Planning with Physics-informed Safety Controller for End-to-end Autonomous Driving | Hang Zhou et.al. | 2405.00316 | null |
2024-04-30 | Reactive Temporal Logic-based Planning and Control for Interactive Robotic Tasks | Farhad Nawaz et.al. | 2404.19594 | null |
2024-04-30 | MoST: Multi-modality Scene Tokenization for Motion Prediction | Norman Mu et.al. | 2404.19531 | null |
2024-04-30 | Transformer-Enhanced Motion Planner: Attention-Guided Sampling for State-Specific Decision Making | Lei Zhuang et.al. | 2404.19403 | null |
2024-07-01 | SemanticFormer: Holistic and Semantic Traffic Scene Representation for Trajectory Prediction using Knowledge Graphs | Zhigang Sun et.al. | 2404.19379 | link |
2024-04-30 | G2LTraj: A Global-to-Local Generation Approach for Trajectory Prediction | Zhanwei Zhang et.al. | 2404.19330 | link |
2024-04-30 | MAP-Former: Multi-Agent-Pair Gaussian Joint Prediction | Marlon Steiner et.al. | 2404.19283 | null |
2024-04-30 | Flight Trajectory Prediction Using an Enhanced CNN-LSTM Network | Qinzhi Hao et.al. | 2404.19218 | null |
2024-04-29 | Socially Adaptive Path Planning Based on Generative Adversarial Network | Yao Wang et.al. | 2404.18687 | null |
2024-07-24 | IncidentResponseGPT: Generating Traffic Incident Response Plans with Generative Artificial Intelligence | Artur Grigorev et.al. | 2404.18550 | null |
2024-04-27 | Motion planning for off-road autonomous driving based on human-like cognition and weight adaptation | Yuchun Wang et.al. | 2404.17820 | null |
2024-04-26 | A Cognitive-Driven Trajectory Prediction Model for Autonomous Driving in Mixed Autonomy Environment | Haicheng Liao et.al. | 2404.17520 | null |
2024-04-26 | Clustering of Motion Trajectories by a Distance Measure Based on Semantic Features | Christoph Zelch et.al. | 2404.17269 | link |
2024-04-25 | Motor Focus: Ego-Motion Prediction with All-Pixel Matching | Hao Wang et.al. | 2404.17031 | link |
2024-04-25 | Neural Interaction Energy for Multi-Agent Trajectory Prediction | Kaixin Shen et.al. | 2404.16579 | null |
2024-06-23 | Logic Learning from Demonstrations for Multi-step Manipulation Tasks in Dynamic Environments | Yan Zhang et.al. | 2404.16138 | null |
2024-04-24 | Learning Car-Following Behaviors Using Bayesian Matrix Normal Mixture Regression | Chengyuan Zhang et.al. | 2404.16023 | null |
2024-04-24 | Parameterized Algorithms for Coordinated Motion Planning: Minimizing Energy | Argyrios Deligkas et.al. | 2404.15950 | null |
2024-04-23 | Safe POMDP Online Planning among Dynamic Agents via Adaptive Conformal Prediction | Shili Sheng et.al. | 2404.15557 | null |
2024-04-23 | Designing, simulating, and performing the 100-AV field test for the CIRCLES consortium: Methodology and Implementation of the Largest mobile traffic control experiment to date | Mostafa Ameli et.al. | 2404.15533 | null |
2024-04-23 | Planning the path with Reinforcement Learning: Optimal Robot Motion Planning in RoboCup Small Size League Environments | Mateus G. Machado et.al. | 2404.15410 | link |
2024-07-12 | TOP-Nav: Legged Navigation Integrating Terrain, Obstacle and Proprioception Estimation | Junli Ren et.al. | 2404.15256 | null |
2024-09-02 | Unmanned Vehicles in 6G Networks: A Unifying Treatment of Problems, Formulations, and Tools | Winston Hurst et.al. | 2404.14738 | null |
2024-04-22 | Integrating Disambiguation and User Preferences into Large Language Models for Robot Motion Planning | Mohammed Abugurain et.al. | 2404.14547 | null |
2024-04-22 | Edge-Assisted ML-Aided Uncertainty-Aware Vehicle Collision Avoidance at Urban Intersections | Dinesh Cyril Selvaraj et.al. | 2404.14523 | null |
2024-04-20 | Social Force Embedded Mixed Graph Convolutional Network for Multi-class Trajectory Prediction | Quancheng Du et.al. | 2404.13378 | null |
2024-07-29 | Action Contextualization: Adaptive Task Planning and Action Tuning using Large Language Models | Sthithpragya Gupta et.al. | 2404.13191 | null |
2024-04-19 | PhysDreamer: Physics-Based Interaction with 3D Objects via Video Generation | Tianyuan Zhang et.al. | 2404.13026 | null |
2024-07-24 | FipTR: A Simple yet Effective Transformer Framework for Future Instance Prediction in Autonomous Driving | Xingtai Gui et.al. | 2404.12867 | link |
2024-04-19 | uTRAND: Unsupervised Anomaly Detection in Traffic Trajectories | Giacomo D’Amicantonio et.al. | 2404.12712 | null |
2024-04-19 | SA-Attack: Speed-adaptive stealthy adversarial attack on trajectory prediction | Huilin Yin et.al. | 2404.12612 | link |
2024-04-30 | TrACT: A Training Dynamics Aware Contrastive Learning Framework for Long-tail Trajectory Prediction | Junrui Zhang et.al. | 2404.12538 | null |
2024-04-18 | Hybrid Dynamics Modeling and Trajectory Planning for a Cable-Trailer System with a Quadruped Robot | Wentao Zhang et.al. | 2404.12220 | null |
2024-04-18 | S4TP: Social-Suitable and Safety-Sensitive Trajectory Planning for Autonomous Vehicles | Xiao Wang et.al. | 2404.11946 | null |
2024-04-17 | State-space Decomposition Model for Video Prediction Considering Long-term Motion Trend | Fei Cui et.al. | 2404.11576 | null |
2024-04-17 | Towards Human Awareness in Robot Task Planning with Large Language Models | Yuchen Liu et.al. | 2404.11267 | null |
2024-04-19 | KI-GAN: Knowledge-Informed Generative Adversarial Networks for Enhanced Multi-Vehicle Trajectory Forecasting at Signalized Intersections | Chuheng Wei et.al. | 2404.11181 | link |
2024-04-18 | FlexMap Fusion: Georeferencing and Automated Conflation of HD Maps with OpenStreetMap | Maximilian Leitenstern et.al. | 2404.10879 | link |
2024-04-16 | MPCOM: Robotic Data Gathering with Radio Mapping and Model Predictive Communication | Zhiyou Ji et.al. | 2404.10541 | null |
2024-04-16 | A Methodology of Cooperative Driving based on Microscopic Traffic Prediction | Boris S. Kerner et.al. | 2404.10375 | null |
2024-04-17 | ControlMTR: Control-Guided Motion Transformer with Scene-Compliant Intention Points for Feasible Motion Prediction | Jiawei Sun et.al. | 2404.10295 | null |
2024-04-16 | PreGSU-A Generalized Traffic Scene Understanding Model for Autonomous Driving based on Pre-trained Graph Attention Network | Yuning Wang et.al. | 2404.10263 | null |
2024-04-15 | Characterization and Mitigation of Insufficiencies in Automated Driving Systems | Yuting Fu et.al. | 2404.09557 | null |
2024-04-12 | Inverse Kinematics for Neuro-Robotic Grasping with Humanoid Embodied Agents | Jan-Gerrit Habekost et.al. | 2404.08825 | link |
2024-04-12 | Non-impulsive Contact-Implicit Motion Planning for Morpho-functional Loco-manipulation | Adarsh Salagame et.al. | 2404.08714 | null |
2024-04-12 | Safe Start Regions for Medical Steerable Needle Automation | Janine Hoelscher et.al. | 2404.08558 | null |
2024-08-13 | Let-It-Flow: Simultaneous Optimization of 3D Flow and Object Clustering | Patrik Vacek et.al. | 2404.08363 | link |
2024-08-07 | Transfer Learning Study of Motion Transformer-based Trajectory Predictions | Lars Ullrich et.al. | 2404.08271 | null |
2024-04-09 | GRANP: A Graph Recurrent Attentive Neural Process Model for Vehicle Trajectory Prediction | Yuhao Luo et.al. | 2404.08004 | link |
2024-04-15 | A Novel Optimization-Based Collision Avoidance For Autonomous On-Orbit Assembly | Siavash Tavana et.al. | 2404.07916 | null |
2024-04-11 | Q-ITAGS: Quality-Optimized Spatio-Temporal Heterogeneous Task Allocation with a Time Budget | Glen Neville et.al. | 2404.07902 | link |
2024-04-11 | Estimating Visibility from Alternate Perspectives for Motion Planning with Occlusions | Barry Gilhuly et.al. | 2404.07781 | link |
2024-04-11 | Can Vehicle Motion Planning Generalize to Realistic Long-tail Scenarios? | Marcel Hallgarten et.al. | 2404.07569 | link |
2024-05-02 | Graph Attention Network for Lane-Wise and Topology-Invariant Intersection Traffic Simulation | Nooshin Yousefzadeh et.al. | 2404.07446 | link |
2024-04-10 | VN-EGNN: E(3)-Equivariant Graph Neural Networks with Virtual Nodes Enhance Protein Binding Site Identification | Florian Sestak et.al. | 2404.07194 | link |
2024-04-10 | CBFKIT: A Control Barrier Function Toolbox for Robotics Applications | Mitchell Black et.al. | 2404.07158 | link |
2024-04-10 | Algebraic Proofs of Path Disconnectedness using Time-Dependent Barrier Functions | Didier Henrion et.al. | 2404.06985 | link |
2024-04-10 | TrajPRed: Trajectory Prediction with Region-based Relation Learning | Chen Zhou et.al. | 2404.06971 | link |
2024-04-10 | SparseAD: Sparse Query-Centric Paradigm for Efficient End-to-End Autonomous Driving | Diankun Zhang et.al. | 2404.06892 | null |
2024-04-10 | Toward Holistic Planning and Control Optimization for Dual-Arm Rearrangement | Kai Gao et.al. | 2404.06758 | null |
2024-04-30 | An Animation-based Augmentation Approach for Action Recognition from Discontinuous Video | Xingyu Song et.al. | 2404.06741 | link |
2024-04-09 | Traffic Signal Control and Speed Offset Coordination Using Q-Learning for Arterial Road Networks | Tianchen Yuan et.al. | 2404.06382 | null |
2024-04-09 | Two-Person Interaction Augmentation with Skeleton Priors | Baiyi Li et.al. | 2404.05490 | null |
2024-04-08 | Long-horizon Locomotion and Manipulation on a Quadrupedal Robot with Large Language Models | Yutao Ouyang et.al. | 2404.05291 | null |
2024-04-07 | Legibot: Generating Legible Motions for Service Robots Using Cost-Based Local Planners | Javad Amirian et.al. | 2404.05100 | null |
2024-04-05 | Generating Synthetic Ground Truth Distributions for Multi-step Trajectory Prediction using Probabilistic Composite Bézier Curves | Ronny Hug et.al. | 2404.04397 | null |
2024-04-04 | Quantifying Uncertainty in Motion Prediction with Variational Bayesian Mixture | Juanwu Lu et.al. | 2404.03789 | link |
2024-04-04 | SC4D: Sparse-Controlled Video-to-4D Generation and Motion Transfer | Zijie Wu et.al. | 2404.03736 | link |
2024-04-04 | Towards more realistic human motion prediction with attention to motion coordination | Pengxiang Ding et.al. | 2404.03584 | null |
2024-04-04 | Factored Task and Motion Planning with Combined Optimization, Sampling and Learning | Joaquim Ortiz-Haro et.al. | 2404.03567 | null |
2024-04-04 | DELTA: Decomposed Efficient Long-Term Robot Task Planning using Large Language Models | Yuchen Liu et.al. | 2404.03275 | null |
2024-04-04 | Traversability-aware Adaptive Optimization for Path Planning and Control in Mountainous Terrain | Se-Wook Yoo et.al. | 2404.03274 | null |
2024-04-04 | A Framework for Guided Motion Planning | Amnon Attali et.al. | 2404.03133 | null |
2024-06-30 | A Survey of Optimization-based Task and Motion Planning: From Classical To Learning Approaches | Zhigen Zhao et.al. | 2404.02817 | null |
2024-04-03 | Leveraging Swarm Intelligence to Drive Autonomously: A Particle Swarm Optimization based Approach to Motion Planning | Sven Ochs et.al. | 2404.02644 | null |
2024-04-03 | Versatile Scene-Consistent Traffic Scenario Generation as Optimization with Diffusion | Zhiyu Huang et.al. | 2404.02524 | null |
2024-04-02 | APEX: Ambidextrous Dual-Arm Robotic Manipulation Using Collision-Free Generative Diffusion Models | Apan Dastider et.al. | 2404.02284 | null |
2024-04-02 | OFMPNet: Deep End-to-End Model for Occupancy and Flow Prediction in Urban Environment | Youshaa Murhij et.al. | 2404.02263 | link |
2024-04-02 | OOSTraj: Out-of-Sight Trajectory Prediction With Vision-Positioning Denoising | Haichao Zhang et.al. | 2404.02227 | link |
2024-04-02 | Interaction-Aware Vehicle Motion Planning with Collision Avoidance Constraints in Highway Traffic | Dongryul Kim et.al. | 2404.01661 | null |
2024-04-02 | PhysORD: A Neuro-Symbolic Approach for Physics-infused Motion Prediction in Off-road Driving | Zhipeng Zhao et.al. | 2404.01596 | link |
2024-04-01 | CyberShake Earthquake Fault Rupture Modeling and Ground Motion Simulations for the Southwest Iceland Transform Zone | Otilio Rojas et.al. | 2404.01533 | null |
2024-04-01 | QuAD: Query-based Interpretable Neural Motion Planning for Autonomous Driving | Sourav Biswas et.al. | 2404.01486 | null |
2024-04-01 | Efficient Motion Planning for Manipulators with Control Barrier Function-Induced Neural Controller | Mingxin Yu et.al. | 2404.01184 | null |
2024-04-01 | An Integrating Comprehensive Trajectory Prediction with Risk Potential Field Method for Autonomous Driving | Kailu Wu et.al. | 2404.00893 | null |
2024-03-31 | Using Explainable AI and Hierarchical Planning for Outreach with Robots | Daksh Dobhal et.al. | 2404.00808 | null |
2024-03-31 | Adapting to Length Shift: FlexiLength Network for Trajectory Prediction | Yi Xu et.al. | 2404.00742 | null |
2024-03-30 | CBF-Based Motion Planning for Socially Responsible Robot Navigation Guaranteeing STL Specification | Andrea Ruo et.al. | 2404.00356 | null |
2024-03-30 | CBF-Based STL Motion Planning for Social Navigation in Crowded Environment | Andrea Ruo et.al. | 2404.00353 | null |
2024-03-30 | Joint Pedestrian Trajectory Prediction through Posterior Sampling | Haotian Lin et.al. | 2404.00237 | null |
2024-03-29 | Accelerating Search-Based Planning for Multi-Robot Manipulation by Leveraging Online-Generated Experiences | Yorai Shaoul et.al. | 2404.00143 | null |
2024-03-28 | RH20T-P: A Primitive-Level Robotic Dataset Towards Composable Generalization Agents | Zeren Chen et.al. | 2403.19622 | null |
2024-03-30 | Egocentric Scene-aware Human Trajectory Prediction | Weizhuo Wang et.al. | 2403.19026 | null |
2024-03-27 | An Efficient Risk-aware Branch MPC for Automated Driving that is Robust to Uncertain Vehicle Behaviors | Luyao Zhang et.al. | 2403.18695 | null |
2024-05-13 | Sampling-Based Motion Planning with Online Racing Line Generation for Autonomous Driving on Three-Dimensional Race Tracks | Levent Ögretmen et.al. | 2403.18643 | link |
2024-03-27 | Optimal Control Synthesis of Markov Decision Processes for Efficiency with Surveillance Tasks | Yu Chen et.al. | 2403.18632 | null |
2024-03-27 | Bridging the Gap: Regularized Reinforcement Learning for Improved Classical Motion Planning with Safety Modules | Elias Goldsztejn et.al. | 2403.18524 | null |
2024-03-27 | SingularTrajectory: Universal Trajectory Predictor Using Diffusion Model | Inhwan Bae et.al. | 2403.18452 | link |
2024-03-27 | Can Language Beat Numerical Regression? Language-Based Multimodal Trajectory Prediction | Inhwan Bae et.al. | 2403.18447 | link |
2024-03-30 | HyRRT-Connect: A Bidirectional Rapidly-Exploring Random Trees Motion Planning Algorithm for Hybrid Systems | Nan Wang et.al. | 2403.18413 | link |
2024-03-27 | LC-LLM: Explainable Lane-Change Intention and Trajectory Predictions with Large Language Models | Mingxing Peng et.al. | 2403.18344 | null |
2024-03-26 | Solution for Point Tracking Task of ICCV 1st Perception Test Challenge 2023 | Hongpeng Pan et.al. | 2403.17994 | null |
2024-07-11 | SLEDGE: Synthesizing Driving Environments with Generative Models and Rule-Based Traffic | Kashyap Chitta et.al. | 2403.17933 | link |
2024-03-26 | CMP: Cooperative Motion Prediction with Multi-Agent Communication | Zhuoyuan Wu et.al. | 2403.17916 | null |
2024-05-23 | LASIL: Learner-Aware Supervised Imitation Learning For Long-term Microscopic Traffic Simulation | Ke Guo et.al. | 2403.17601 | link |
2024-03-25 | Temporal and Semantic Evaluation Metrics for Foundation Models in Post-Hoc Analysis of Robotic Sub-tasks | Jonathan Salfity et.al. | 2403.17238 | link |
2024-03-25 | PROSPECT: Precision Robot Spectroscopy Exploration and Characterization Tool | Nathaniel Hanson et.al. | 2403.17232 | null |
2024-03-25 | Vision-Based Dexterous Motion Planning by Dynamic Movement Primitives with Human Hand Demonstration | Nuo Chen et.al. | 2403.17111 | null |
2024-03-25 | Spline Trajectory Tracking and Obstacle Avoidance for Mobile Agents via Convex Optimization | Akua Dickson et.al. | 2403.16900 | null |
2024-03-25 | Real-time Model Predictive Control with Zonotope-Based Neural Networks for Bipedal Social Navigation | Abdulaziz Shamsah et.al. | 2403.16485 | null |
2024-03-25 | Towards Cooperative Maneuver Planning in Mixed Traffic at Urban Intersections | Marvin Klimke et.al. | 2403.16478 | null |
2024-03-25 | Producing and Leveraging Online Map Uncertainty in Trajectory Prediction | Xunjiang Gu et.al. | 2403.16439 | link |
2024-03-25 | ProIn: Learning to Predict Trajectory Based on Progressive Interactions for Autonomous Driving | Yinke Dong et.al. | 2403.16374 | null |
2024-03-24 | Combined Task and Motion Planning Via Sketch Decompositions (Extended Version with Supplementary Material) | Magí Dalmau-Moreno et.al. | 2403.16277 | null |
2024-03-28 | Gaze-guided Hand-Object Interaction Synthesis: Benchmark and Method | Jie Tian et.al. | 2403.16169 | null |
2024-04-23 | Risk-Calibrated Human-Robot Interaction via Set-Valued Intent Prediction | Justin Lidard et.al. | 2403.15959 | null |
2024-03-23 | Human Motion Prediction under Unexpected Perturbation | Jiangbei Yue et.al. | 2403.15891 | null |
2024-03-23 | A Comparative Study of Artificial Potential Fields and Safety Filters | Ming Li et.al. | 2403.15743 | null |
2024-03-23 | Motion Planning for Identification of Linear Classifiers | Aneesh Raghavan et.al. | 2403.15687 | null |
2024-03-22 | Autonomous Driving With Perception Uncertainties: Deep-Ensemble Based Adaptive Cruise Control | Xiao Li et.al. | 2403.15577 | null |
2024-03-22 | OceanPlan: Hierarchical Planning and Replanning for Natural Language AUV Piloting in Large-scale Unexplored Ocean Environments | Ruochu Yang et.al. | 2403.15369 | null |
2024-03-22 | ALPINE: a climbing robot for operations in mountain environments | Michele Focchi et.al. | 2403.15142 | link |
2024-03-27 | UniTraj: A Unified Framework for Scalable Vehicle Trajectory Prediction | Lan Feng et.al. | 2403.15098 | link |
2024-03-22 | Linear Quadratic Guidance Law for Joint Motion Planning of a Pursuer-Turret Assembly | Bhargav Jha et.al. | 2403.14997 | null |
2024-03-22 | Boundary-Aware Value Function Generation for Safe Stochastic Motion Planning | Junhong Xu et.al. | 2403.14956 | null |
2024-04-10 | Learning Hierarchical Control For Multi-Agent Capacity-Constrained Systems | Charlott Vallon et.al. | 2403.14545 | null |
2024-03-21 | Exosense: A Vision-Centric Scene Understanding System For Safe Exoskeleton Navigation | Jianeng Wang et.al. | 2403.14320 | null |
2024-03-21 | Existence Is Chaos: Enhancing 3D Human Motion Prediction with Uncertainty Consideration | Zhihao Wang et.al. | 2403.14104 | null |
2024-03-20 | Motion Prediction of Multi-agent systems with Multi-view clustering | Anegi James et.al. | 2403.13905 | null |
2024-03-20 | Certified Human Trajectory Prediction | Mohammadhossein Bahari et.al. | 2403.13778 | null |
2024-03-20 | LaCE-LHMP: Airflow Modelling-Inspired Long-Term Human Motion Prediction By Enhancing Laminar Characteristics in Human Flow | Yufei Zhu et.al. | 2403.13640 | link |
2024-06-07 | SpatialPIN: Enhancing Spatial Reasoning Capabilities of Vision-Language Models through Prompting and Interacting 3D Priors | Chenyang Ma et.al. | 2403.13438 | null |
2024-03-20 | ManiPose: A Comprehensive Benchmark for Pose-aware Object Manipulation in Robotics | Qiaojun Yu et.al. | 2403.13365 | null |
2024-03-21 | AMP: Autoregressive Motion Prediction Revisited with Next Token Prediction for Autonomous Driving | Xiaosong Jia et.al. | 2403.13331 | null |
2024-03-21 | Self-Supervised Class-Agnostic Motion Prediction with Spatial and Temporal Consistency Regularizations | Kewei Wang et.al. | 2403.13261 | link |
2024-04-07 | Federated reinforcement learning for robot motion planning with zero-shot generalization | Zhenyuan Yuan et.al. | 2403.13245 | null |
2024-03-19 | Shortest Trajectory of a Dubins Vehicle with a Controllable Laser | Shivam Bajaj et.al. | 2403.12346 | null |
2024-03-18 | Reachability-based Trajectory Design via Exact Formulation of Implicit Neural Signed Distance Functions | Jonathan Michaux et.al. | 2403.12280 | null |
2024-03-18 | IKSPARK: An Inverse Kinematics Solver using Semidefinite Relaxation and Rank Minimization | Liangting Wu et.al. | 2403.12235 | null |
2024-03-18 | Informed Spectral Normalized Gaussian Processes for Trajectory Prediction | Christian Schlauch et.al. | 2403.11966 | null |
2024-03-18 | TrajectoryNAS: A Neural Architecture Search for Trajectory Prediction | Ali Asghar Sharifi et.al. | 2403.11695 | null |
2024-03-18 | Diffusion-Based Environment-Aware Trajectory Prediction | Theodor Westny et.al. | 2403.11643 | null |
2024-03-20 | LLM3:Large Language Model-based Task and Motion Planning with Motion Failure Reasoning | Shu Wang et.al. | 2403.11552 | link |
2024-03-19 | SmartRefine: A Scenario-Adaptive Refinement Framework for Efficient Motion Prediction | Yang Zhou et.al. | 2403.11492 | link |
2024-03-18 | Robot Navigation in Unknown and Cluttered Workspace with Dynamical System Modulation in Starshaped Roadmap | Kai Chen et.al. | 2403.11484 | null |
2024-03-17 | Driving Style Alignment for LLM-powered Driver Agent | Ruoxuan Yang et.al. | 2403.11368 | link |
2024-03-17 | Bridging the Gap between Discrete Agent Strategies in Game Theory and Continuous Motion Planning in Dynamic Environments | Hongrui Zheng et.al. | 2403.11334 | null |
2024-03-17 | Pioneering SE(2)-Equivariant Trajectory Planning for Automated Driving | Steffen Hagedorn et.al. | 2403.11304 | null |
2024-03-17 | Large Language Models Powered Context-aware Motion Prediction | Xiaoji Zheng et.al. | 2403.11057 | link |
2024-03-16 | PAAMP: Polytopic Action-Set And Motion Planning For Long Horizon Dynamic Motion Planning via Mixed Integer Linear Programming | Akshay Jaitly et.al. | 2403.10924 | null |
2024-03-16 | Exploring Learning-based Motion Models in Multi-Object Tracking | Hsiang-Wei Huang et.al. | 2403.10826 | null |
2024-03-16 | Efficient Trajectory Forecasting and Generation with Conditional Flow Matching | Sean Ye et.al. | 2403.10809 | link |
2024-03-16 | Diffusion-Reinforcement Learning Hierarchical Motion Planning in Adversarial Multi-agent Games | Zixuan Wu et.al. | 2403.10794 | link |
2024-03-16 | iDb-RRT: Sampling-based Kinodynamic Motion Planning with Motion Primitives and Trajectory Optimization | Joaquim Ortiz-Haro et.al. | 2403.10745 | null |
2024-03-15 | Partially Observable Task and Motion Planning with Uncertainty and Risk Awareness | Aidan Curtis et.al. | 2403.10454 | null |
2024-03-15 | H-MaP: An Iterative and Hybrid Sequential Manipulation Planner | Berk Cicek et.al. | 2403.10436 | null |
2024-03-15 | Revolutionizing Packaging: A Robotic Bagging Pipeline with Constraint-aware Structure-of-Interest Planning | Jiaming Qi et.al. | 2403.10309 | null |
2024-03-15 | T4P: Test-Time Training of Trajectory Prediction via Masked Autoencoder and Actor-specific Token Memory | Daehee Park et.al. | 2403.10052 | link |
2024-03-15 | Interactive Distance Field Mapping and Planning to Enable Human-Robot Collaboration | Usama Ali et.al. | 2403.09988 | link |
2024-03-14 | Intention-aware Denoising Diffusion Model for Trajectory Prediction | Chen Liu et.al. | 2403.09190 | null |
2024-03-13 | Autonomous Underground Freight Transport Systems – The Future of Urban Logistics? | Lasse Bienzeisler et.al. | 2403.08841 | null |
2024-03-13 | CoPa: General Robotic Manipulation through Spatial Constraints of Parts with Foundation Models | Haoxu Huang et.al. | 2403.08248 | null |
2024-03-13 | SpaceOctopus: An Octopus-inspired Motion Planning Framework for Multi-arm Space Robot | Wenbo Zhao et.al. | 2403.08219 | null |
2024-03-14 | V-PRISM: Probabilistic Mapping of Unknown Tabletop Scenes | Herbert Wright et.al. | 2403.08106 | link |
2024-03-12 | Task and Motion Planning in Hierarchical 3D Scene Graphs | Aaron Ray et.al. | 2403.08094 | null |
2024-03-12 | LG-Traj: LLM Guided Pedestrian Trajectory Prediction | Pranav Singh Chib et.al. | 2403.08032 | null |
2024-06-04 | The Virtues of Laziness: Multi-Query Kinodynamic Motion Planning with Lazy Methods | Anuj Pasricha et.al. | 2403.07867 | null |
2024-03-12 | Online Adaptation of Sampling-Based Motion Planning with Inaccurate Models | Marco Faroni et.al. | 2403.07638 | null |
2024-03-12 | DrPlanner: Diagnosis and Repair of Motion Planners Using Large Language Models | Yuanfei Lin et.al. | 2403.07470 | link |
2024-03-12 | Tractable Joint Prediction and Planning over Discrete Behavior Modes for Urban Driving | Adam Villaflor et.al. | 2403.07232 | null |
2024-02-29 | Physics Sensor Based Deep Learning Fall Detection System | Zeyuan Qu et.al. | 2403.06994 | null |
2024-02-28 | Automatic driving lane change safety prediction model based on LSTM | Wenjian Sun et.al. | 2403.06993 | null |
2024-03-11 | Conditional Score-Based Diffusion Model for Cortical Thickness Trajectory Prediction | Qing Xiao et.al. | 2403.06940 | null |
2024-03-11 | A Holistic Framework Towards Vision-based Traffic Signal Control with Microscopic Simulation | Pan He et.al. | 2403.06884 | null |
2024-03-11 | Hybrid optimal control with mixed-integer Lagrangian methods | Viktoriya Nikitina et.al. | 2403.06842 | null |
2024-03-12 | Enhancing Joint Motion Prediction for Individuals with Limb Loss Through Model Reprogramming | Sharmita Dey et.al. | 2403.06569 | null |
2024-03-10 | Robust Predictive Motion Planning by Learning Obstacle Uncertainty | Jian Zhou et.al. | 2403.06222 | link |
2024-03-10 | Towards Generalizable and Interpretable Motion Prediction: A Deep Variational Bayes Approach | Juanwu Lu et.al. | 2403.06086 | null |
2024-03-09 | MATRIX: Multi-Agent Trajectory Generation with Diverse Contexts | Zhuo Xu et.al. | 2403.06041 | null |
2024-03-09 | Recurrent Aligned Network for Generalized Pedestrian Trajectory Prediction | Yonghao Dong et.al. | 2403.05810 | null |
2024-03-09 | Physics-informed Neural Motion Planning on Constraint Manifolds | Ruiqi Ni et.al. | 2403.05765 | null |
2024-03-17 | A Motion Planning Algorithm in a Figure Eight Track | Cristian Jardon et.al. | 2403.05570 | null |
2024-03-08 | JointMotion: Joint Self-supervision for Joint Motion Prediction | Royden Wagner et.al. | 2403.05489 | link |
2024-03-11 | Fooling Neural Networks for Motion Forecasting via Adversarial Attacks | Edgar Medina et.al. | 2403.04954 | null |
2024-05-01 | LitSim: A Conflict-aware Policy for Long-term Interactive Traffic Simulation | Haojie Xin et.al. | 2403.04299 | null |
2024-03-06 | Temporal Enhanced Floating Car Observers | Jeremias Gerner et.al. | 2403.03825 | null |
2024-03-06 | Time-optimal Point-to-point Motion Planning: A Two-stage Approach | Shuhao Zhang et.al. | 2403.03573 | null |
2024-03-04 | Approximation of the Koopman operator via Bernstein polynomials | Rishikesh Yadav et.al. | 2403.02438 | null |
2024-03-20 | DiffMOT: A Real-time Diffusion-based Multiple Object Tracker with Non-linear Prediction | Weiyi Lv et.al. | 2403.02075 | null |
2024-03-04 | Progressive Smoothing for Motion Planning in Real-Time NMPC | Rudolf Reiter et.al. | 2403.01830 | null |
2024-03-04 | DEMOS: Dynamic Environment Motion Synthesis in 3D Scenes via Local Spherical-BEV Perception | Jingyu Gong et.al. | 2403.01740 | null |
2024-03-03 | Cooperative Automated Driving for Bottleneck Scenarios in Mixed Traffic | M. V. Baumann et.al. | 2403.01512 | null |
2024-04-15 | On the Road to Portability: Compressing End-to-End Motion Planner for Autonomous Driving | Kaituo Feng et.al. | 2403.01238 | link |
2024-04-17 | A Comparative Study of Rapidly-exploring Random Tree Algorithms Applied to Ship Trajectory Planning and Behavior Generation | Trym Tengesdal et.al. | 2403.01194 | null |
2024-03-01 | Complete and Near-Optimal Robotic Crack Coverage and Filling in Civil Infrastructure | Vishnu Veeraraghavan et.al. | 2403.00613 | link |
2024-03-01 | MS-Net: A Multi-Path Sparse Model for Motion Prediction in Multi-Scenes | Xiaqiang Tang et.al. | 2403.00353 | null |
2024-03-01 | Model-Based Planning and Control for Terrestrial-Aerial Bimodal Vehicles with Passive Wheels | Ruibin Zhang et.al. | 2403.00322 | null |
2024-03-04 | TEXterity – Tactile Extrinsic deXterity: Simultaneous Tactile Estimation and Control for Extrinsic Dexterity | Sangwoon Kim et.al. | 2403.00049 | null |
2024-06-02 | Towards Safe and Reliable Autonomous Driving: Dynamic Occupancy Set Prediction | Wenbo Shao et.al. | 2402.19385 | null |
2024-02-29 | Attacks Against Mobility Prediction in 5G Networks | Syafiq Al Atiiq et.al. | 2402.19319 | null |
2024-02-29 | A Cognitive-Based Trajectory Prediction Approach for Autonomous Driving | Haicheng Liao et.al. | 2402.19251 | link |
2024-02-21 | Context-based Interpretable Spatio-Temporal Graph Convolutional Network for Human Motion Forecasting | Edgar Medina et.al. | 2402.19237 | link |
2024-02-29 | ARMCHAIR: integrated inverse reinforcement learning and model predictive control for human-robot collaboration | Angelo Caregnato-Neto et.al. | 2402.19128 | null |
2024-02-29 | High-Speed Motion Planning for Aerial Swarms in Unknown and Cluttered Environments | Charbel Toumieh et.al. | 2402.19033 | link |
2024-02-29 | GoalNet: Goal Areas Oriented Pedestrian Trajectory Prediction | Ching-Lin Lee et.al. | 2402.19002 | null |
2024-03-13 | On properties of effective topological complexity and effective Lusternik-Schnirelmann category | Zbigniew Błaszczyk et.al. | 2402.18524 | null |
2024-02-29 | A Probabilistic Motion Model for Skid-Steer Wheeled Mobile Robot Navigation on Off-Road Terrains | Ananya Trivedi et.al. | 2402.18065 | link |
2024-03-03 | On the Parameterized Complexity of Motion Planning for Rectangular Robots | Iyad Kanj et.al. | 2402.17846 | null |
2024-02-26 | EEG classifier cross-task transfer to avoid training sessions in robot-assisted rehabilitation | Niklas Kueper et.al. | 2402.17790 | null |
2024-02-27 | Opening Cabinets and Drawers in the Real World using a Commodity Mobile Manipulator | Arjun Gupta et.al. | 2402.17767 | null |
2024-02-27 | Reducing Unnecessary Alerts in Pedestrian Protection Systems Based on P2V Communications | Ignacio Soto et.al. | 2402.17763 | null |
2024-03-08 | Backpropagation-Based Analytical Derivatives of EKF Covariance for Active Sensing | Jonas Benhamou et.al. | 2402.17569 | null |
2024-02-27 | RACP: Risk-Aware Contingency Planning with Multi-Modal Predictions | Khaled A. Mustafa et.al. | 2402.17387 | null |
2024-02-27 | SocialCVAE: Predicting Pedestrian Trajectory via Interaction Conditioned Latents | Wei Xiang et.al. | 2402.17339 | link |
2024-03-24 | SwarmPRM: Probabilistic Roadmap Motion Planning for Large-Scale Swarm Robotic Systems | Yunze Hu et.al. | 2402.16699 | null |
2024-03-15 | Risk-Aware Non-Myopic Motion Planner for Large-Scale Robotic Swarm Using CVaR Constraints | Xuru Yang et.al. | 2402.16690 | null |
2024-02-26 | Trajectory Prediction for Autonomous Driving Using a Transformer Network | Zhenning Li et.al. | 2402.16501 | null |
2024-02-23 | Homeostatic motion planning with innate physics knowledge | Giulia Lafratta et.al. | 2402.15384 | null |
2024-03-13 | Neural Implicit Swept Volume Models for Fast Collision Detection | Dominik Joho et.al. | 2402.15281 | null |
2024-02-22 | Path Planning based on 2D Object Bounding-box | Yanliang Huang et.al. | 2402.14933 | null |
2024-02-22 | Learning Inverse Kinodynamics for Autonomous Vehicle Drifting | M. Suvarna et.al. | 2402.14928 | link |
2024-02-22 | RoboScript: Code Generation for Free-Form Manipulation Tasks across Real and Simulation | Junting Chen et.al. | 2402.14623 | null |
2024-04-03 | Quaternion recurrent neural network with real-time recurrent learning and maximum correntropy criterion | Pauline Bourigault et.al. | 2402.14227 | null |
2024-02-21 | Towards Contact-Aided Motion Planning for Tendon-Driven Continuum Robots | Priyanka Rao et.al. | 2402.14175 | null |
2024-02-23 | Blending Data-Driven Priors in Dynamic Games | Justin Lidard et.al. | 2402.14174 | null |
2024-02-21 | Driving Towards Stability and Efficiency: A Variable Time Gap Strategy for Adaptive Cruise Control | Shaimaa K. El-Baklish et.al. | 2402.14110 | null |
2024-02-20 | A Recurrent Neural Network Enhanced Unscented Kalman Filter for Human Motion Prediction | Wansong Liu et.al. | 2402.13045 | null |
2024-02-20 | Smart Mobility Digital Twin Based Automated Vehicle Navigation System: A Proof of Concept | Kui Wang et.al. | 2402.12682 | null |
2024-02-19 | Mixed Gaussian Flow for Diverse Trajectory Prediction | Jiahe Chen et.al. | 2402.12238 | link |
2024-03-04 | From Reals to Logic and Back: Inventing Symbolic Vocabularies, Actions, and Models for Planning from Raw Data | Naman Shah et.al. | 2402.11871 | null |
2024-02-19 | Decentralized Lifelong Path Planning for Multiple Ackerman Car-Like Robots | Teng Guo et.al. | 2402.11767 | null |
2024-04-07 | GenAD: Generative End-to-End Autonomous Driving | Wenzhao Zheng et.al. | 2402.11502 | link |
2024-02-18 | Verifiably Following Complex Robot Instructions with Foundation Models | Benedict Quartey et.al. | 2402.11498 | null |
2024-02-15 | Towards Tight Convex Relaxations for Contact-Rich Manipulation | Bernhard P. Graesdal et.al. | 2402.10312 | link |
2024-02-03 | Simulation-based Analysis of a Novel Loop-based Road Topology for Autonomous Vehicles | Stefan Ramdhan et.al. | 2402.10226 | null |
2024-02-15 | Pheno-Robot: An Auto-Digital Modelling System for In-Situ Phenotyping in the Field | Yaoqiang Pan et.al. | 2402.09685 | null |
2024-05-15 | Conformalized Adaptive Forecasting of Heterogeneous Trajectories | Yanfei Zhou et.al. | 2402.09623 | link |
2024-02-16 | Auto-Encoding Bayesian Inverse Games | Xinjie Liu et.al. | 2402.08902 | null |
2024-02-13 | Safe Planning for Articulated Robots Using Reachability-based Obstacle Avoidance With Spheres | Jonathan Michaux et.al. | 2402.08857 | null |
2024-04-26 | AMEND: A Mixture of Experts Framework for Long-tailed Trajectory Prediction | Ray Coden Mercurius et.al. | 2402.08698 | null |
2024-05-16 | Provable Traffic Rule Compliance in Safe Reinforcement Learning on the Open Sea | Hanna Krasowski et.al. | 2402.08502 | null |
2024-02-13 | MetaTra: Meta-Learning for Generalized Trajectory Prediction in Unseen Domain | Xiaohe Li et.al. | 2402.08221 | null |
2024-02-29 | Inherent Diverse Redundant Safety Mechanisms for AI-based Software Elements in Automotive Applications | Mandar Pitale et.al. | 2402.08208 | null |
2024-05-14 | VistaScenario: Interaction Scenario Engineering for Vehicles with Intelligent Systems for Transport Automation | Cheng Chang et.al. | 2402.07720 | null |
2024-02-12 | DART: A Compact Platform For Autonomous Driving Research | Lorenzo Lyons et.al. | 2402.07602 | null |
2024-02-11 | Geometric and topological properties of manifolds in robot motion planning | Stephan Mescher et.al. | 2402.07265 | null |
2024-04-23 | UVTM: Universal Vehicle Trajectory Modeling with ST Feature Domain Generation | Yan Lin et.al. | 2402.07232 | link |
2024-03-13 | ASAP-MPC: An Asynchronous Update Scheme for Online Motion Planning with Nonlinear Model Predictive Control | Dries Dirckx et.al. | 2402.06263 | null |
2024-02-09 | CityFlowER: An Efficient and Realistic Traffic Simulator with Embedded Machine Learning Models | Longchao Da et.al. | 2402.06127 | link |
2024-02-08 | A versatile robotic hand with 3D perception, force sensing for autonomous manipulation | Nikolaus Correll et.al. | 2402.06018 | link |
2024-04-10 | Driving Everywhere with Large Language Model Policy Adaptation | Boyi Li et.al. | 2402.05932 | null |
2024-02-08 | On Experimental Emulation of Printability and Fleet Aware Generic Mesh Decomposition for Enabling Aerial 3D Printing | Marios-Nektarios Stamatopoulos et.al. | 2402.05853 | null |
2024-02-08 | Real-World Robot Applications of Foundation Models: A Review | Kento Kawaharazuka et.al. | 2402.05741 | null |
2024-02-09 | An Optimal Control Formulation of Tool Affordance Applied to Impact Tasks | Boyang Ti et.al. | 2402.05502 | null |
2024-02-07 | Safe Human-UAS Collaboration Abstraction | Hossein Rastgoftar et.al. | 2402.05277 | null |
2024-02-07 | Exploration Without Maps via Zero-Shot Out-of-Distribution Deep Reinforcement Learning | Shathushan Sivashangaran et.al. | 2402.05066 | null |
2024-02-07 | Smooth real-time motion planning based on a cascade dual-quaternion screw-geometry MPC | Ainoor Teimoorzadeh et.al. | 2402.05037 | null |
2024-02-07 | Entanglement Definitions for Tethered Robots: Exploration and Analysis | Gianpietro Battocletti et.al. | 2402.04909 | null |
2024-02-07 | Hierarchical Motion Planning and Offline Robust Model Predictive Control for Autonomous Vehicles | Hung Duy Nguyen et.al. | 2402.04769 | null |
2024-03-08 | Human Observation-Inspired Trajectory Prediction for Autonomous Driving in Mixed-Autonomy Traffic Environments | Haicheng Liao et.al. | 2402.04318 | link |
2024-02-06 | Controllable Diverse Sampling for Diffusion Based Motion Behavior Forecasting | Yiming Xu et.al. | 2402.03981 | null |
2024-04-10 | Prediction Horizon Requirements for Automated Driving: Optimizing Safety, Comfort, and Efficiency | Manuel Muñoz Sánchez et.al. | 2402.03893 | null |
2024-02-05 | Efficient and Interpretable Traffic Destination Prediction using Explainable Boosting Machines | Yasin Yousif et.al. | 2402.03457 | link |
2024-04-30 | Risk-Aware MPC for Stochastic Systems with Runtime Temporal Logics | Maico H. W. Engelaar et.al. | 2402.03165 | link |
2024-02-04 | SIMPL: A Simple and Efficient Multi-agent Motion Prediction Baseline for Autonomous Driving | Lu Zhang et.al. | 2402.02519 | link |
2024-02-04 | Robot Trajectron: Trajectory Prediction-based Shared Control for Robot Manipulation | Pinhao Song et.al. | 2402.02499 | link |
2024-02-04 | Hybrid-Prediction Integrated Planning for Autonomous Driving | Haochen Liu et.al. | 2402.02426 | null |
2024-02-03 | Data-Driven Prediction of Seismic Intensity Distributions Featuring Hybrid Classification-Regression Models | Koyu Mizutani et.al. | 2402.02150 | link |
2024-02-02 | A GP-based Robust Motion Planning Framework for Agile Autonomous Robot Navigation and Recovery in Unknown Environments | Nicholas Mohammad et.al. | 2402.01617 | null |
2024-02-02 | Hyperparameter tuning via trajectory predictions: Stochastic prox-linear methods in matrix sensing | Mengqi Lou et.al. | 2402.01599 | null |
2024-02-02 | Equivariant topological complexities | Mark Grant et.al. | 2402.01540 | null |
2024-02-02 | A Reinforcement Learning-Boosted Motion Planning Framework: Comprehensive Generalization Performance in Autonomous Driving | Rainer Trauth et.al. | 2402.01465 | link |
2024-04-20 | A survey on robustness in trajectory prediction for autonomous vehicles | Jeroen Hagenus et.al. | 2402.01397 | null |
2024-04-09 | CC-VPSTO: Chance-Constrained Via-Point-based Stochastic Trajectory Optimisation for Safe and Efficient Online Robot Motion Planning | Lara Brudermüller et.al. | 2402.01370 | null |
2024-02-11 | Neural Trajectory Model: Implicit Neural Trajectory Representation for Trajectories Generation | Zihan Yu et.al. | 2402.01254 | link |
2024-04-29 | Scalable Multi-modal Model Predictive Control via Duality-based Interaction Predictions | Hansung Kim et.al. | 2402.01116 | link |
2024-04-17 | Distance and Collision Probability Estimation from Gaussian Surface Models | Kshitij Goel et.al. | 2402.00186 | null |
2024-01-25 | Design and Implementation of Hardware Accelerators for Neural Processing Applications | Shilpa Mayannavar et.al. | 2402.00051 | null |
2024-01-30 | Multi-FLEX: An Automatic Task Sequence Execution Framework to Enable Reactive Motion Planning for Multi-Robot Applications | Gaurav Misra et.al. | 2401.17214 | null |
2024-01-30 | Multi-Camera Asynchronous Ball Localization and Trajectory Prediction with Factor Graphs and Human Poses | Qingyu Xiao et.al. | 2401.17185 | null |
2024-01-29 | Leveraging Public Cloud Infrastructure for Real-time Connected Vehicle Speed Advisory at a Signalized Corridor | Hsien-Wen Deng et.al. | 2401.16545 | null |
2024-01-29 | FIMP: Future Interaction Modeling for Multi-Agent Motion Prediction | Sungmin Woo et.al. | 2401.16189 | null |
2024-01-29 | Decentralized Robust Data-driven Predictive Control for Smoothing Mixed Traffic Flow | Xu Shang et.al. | 2401.15826 | link |
2024-01-28 | Long-Term Typhoon Trajectory Prediction: A Physics-Conditioned Approach Without Reanalysis Data | Young-Jae Park et.al. | 2401.15726 | null |
2024-01-28 | Design of UAV flight state recognition and trajectory prediction system based on trajectory feature construction | Xingyu Zhou et.al. | 2401.15564 | null |
2024-01-26 | Overview of Sensing Attacks on Autonomous Vehicle Technologies and Impact on Traffic Flow | Zihao Li et.al. | 2401.15193 | null |
2024-01-26 | Fast Long-Term Multi-Scenario Prediction for Maneuver Planning at Unsignalized Intersections | Max Bastian Mertens et.al. | 2401.14879 | null |
2024-01-23 | Workspace Optimization Techniques to Improve Prediction of Human Motion During Human-Robot Collaboration | Yi-Shiuan Tung et.al. | 2401.12965 | null |
2024-01-23 | Control-Aware Trajectory Predictions for Communication-Efficient Drone Swarm Coordination in Cluttered Environments | Longhao Yan et.al. | 2401.12852 | null |
2024-01-22 | Towards a prioritised use of transportation infrastructures: the case of vehicle-specific dynamic access restrictions to city centres | Holger Billhardt et.al. | 2401.12329 | null |
2024-01-22 | Multi-Agent Dynamic Relational Reasoning for Social Robot Navigation | Jiachen Li et.al. | 2401.12275 | null |
2024-03-03 | Adaptive Motion Planning for Multi-fingered Functional Grasp via Force Feedback | Dongying Tian et.al. | 2401.11977 | null |
2024-01-22 | A Comparative Study of Numerical Methods for Approximating the Solutions of a Macroscopic Automated-Vehicle Traffic Flow Model | George Titakis et.al. | 2401.11787 | null |
2024-01-21 | Self-Supervised Bird’s Eye View Motion Prediction with Cross-Modality Signals | Shaoheng Fang et.al. | 2401.11499 | link |
2024-01-21 | Towards Non-Robocentric Dynamic Landing of Quadrotor UAVs | Li-Yu Lo et.al. | 2401.11445 | link |
2024-01-18 | Hacking Predictors Means Hacking Cars: Using Sensitivity Analysis to Identify Trajectory Prediction Vulnerabilities for Autonomous Driving Security | Marsalis Gibson et.al. | 2401.10313 | null |
2024-01-22 | TEXterity: Tactile Extrinsic deXterity | Antonia Bronars et.al. | 2401.10230 | null |
2024-01-18 | Robotic Test Tube Rearrangement Using Combined Reinforcement Learning and Motion Planning | Hao Chen et.al. | 2401.09772 | null |
2024-01-17 | Biased-MPPI: Informing Sampling-Based Model Predictive Control by Fusing Ancillary Controllers | Elia Trevisan et.al. | 2401.09241 | null |
2024-01-17 | Improved Consensus ADMM for Cooperative Motion Planning of Large-Scale Connected Autonomous Vehicles with Limited Communication | Haichao Liu et.al. | 2401.09032 | null |
2024-03-16 | PINSAT: Parallelized Interleaving of Graph Search and Trajectory Optimization for Kinodynamic Motion Planning | Ramkumar Natarajan et.al. | 2401.08948 | null |
2024-01-16 | Centralized vs. Decoupled Dual-Arm Planning Taking into Account Path Quality | Jonas Wittmann et.al. | 2401.08443 | null |
2024-01-16 | CycLight: learning traffic signal cooperation with a cycle-level strategy | Gengyue Han et.al. | 2401.08121 | null |
2024-03-16 | Preprocessing-based Kinodynamic Motion Planning Framework for Intercepting Projectiles using a Robot Manipulator | Ramkumar Natarajan et.al. | 2401.08022 | null |
2024-01-15 | SSL-Interactions: Pretext Tasks for Interactive Trajectory Prediction | Prarthana Bhattacharyya et.al. | 2401.07729 | null |
2024-01-12 | EUSO-SPB1 Mission and Science | JEM-EUSO Collaboration et.al. | 2401.06525 | null |
2024-01-12 | Design and Nonlinear Modeling of a Modular Cable Driven Soft Robotic Arm | Xinda Qi et.al. | 2401.06377 | null |
2024-01-12 | Hyper-STTN: Social Group-aware Spatial-Temporal Transformer Network for Human Trajectory Prediction with Hypergraph Reasoning | Weizheng Wang et.al. | 2401.06344 | null |
2024-03-09 | VLP: Vision Language Planning for Autonomous Driving | Chenbin Pan et.al. | 2401.05577 | null |
2024-01-10 | Current Effect-eliminated Optimal Target Assignment and Motion Planning for a Multi-UUV System | Danjie Zhu et.al. | 2401.05521 | null |
2023-12-14 | Online Action Recognition for Human Risk Prediction with Anticipated Haptic Alert via Wearables | Cheng Guo et.al. | 2401.05365 | link |
2024-02-19 | AdvMT: Adversarial Motion Transformer for Long-term Human Motion Prediction | Sarmad Idrees et.al. | 2401.05018 | null |
2024-01-10 | Knowledge-aware Graph Transformer for Pedestrian Trajectory Prediction | Yu Liu et.al. | 2401.04872 | null |
2024-01-09 | A Payne-Whitham model of urban traffic networks in the presence of traffic lights and its application to traffic optimisation | Mauritz Cartier van Dissel et.al. | 2401.04436 | null |
2024-02-10 | Distributional Topological Complexity and LS-category | Alexander Dranishnikov et.al. | 2401.04272 | null |
2024-01-08 | Safe Chance-constrained Model Predictive Control under Gaussian Mixture Model Uncertainty | Kai Ren et.al. | 2401.03799 | null |
2024-01-07 | Disentangled Neural Relational Inference for Interpretable Motion Prediction | Victoria M. Dax et.al. | 2401.03599 | null |
2024-01-08 | Uncovering the human motion pattern: Pattern Memory-based Diffusion Model for Trajectory Prediction | Yuxin Yang et.al. | 2401.02916 | null |
2024-01-05 | iPolicy: Incremental Policy Algorithms for Feedback Motion Planning | Guoxiang Zhao et.al. | 2401.02883 | null |
2024-01-08 | Predicting Infant Brain Connectivity with Federated Multi-Trajectory GNNs using Scarce Data | Michalis Pistos et.al. | 2401.01383 | link |
2023-12-30 | Gridlock Models with the IBM Mega Traffic Simulator: Dependency on Vehicle Acceleration and Road Structure | Bruce G. Elmegreen et.al. | 2401.00882 | null |
2023-12-31 | Effect of Optimizer, Initializer, and Architecture of Hypernetworks on Continual Learning from Demonstration | Sayantan Auddy et.al. | 2401.00524 | link |
2023-12-31 | Controllable Safety-Critical Closed-loop Traffic Simulation via Guided Diffusion | Wei-Jer Chang et.al. | 2401.00391 | null |
2023-12-29 | Unified Task and Motion Planning using Object-centric Abstractions of Motion Constraints | Alejandro Agostini et.al. | 2312.17605 | null |
2023-12-28 | InsActor: Instruction-driven Physics-based Characters | Jiawei Ren et.al. | 2312.17135 | null |
2024-04-16 | Social-Transmotion: Promptable Human Trajectory Prediction | Saeed Saadatnejad et.al. | 2312.16168 | link |
2023-12-26 | Improving Transferability for Cross-domain Trajectory Prediction via Neural Stochastic Differential Equation | Daehee Park et.al. | 2312.15906 | link |
2023-12-26 | Attention-aware Social Graph Transformer Networks for Stochastic Trajectory Prediction | Yao Liu et.al. | 2312.15881 | null |
2023-12-22 | Traffic Reconstruction and Analysis of Natural Driving Behaviors at Unsignalized Intersections | Supriya Sarker et.al. | 2312.14561 | null |
2023-12-22 | AdapTraj: A Multi-Source Domain Generalization Framework for Multi-Agent Trajectory Prediction | Tangwen Qian et.al. | 2312.14394 | null |
2023-12-22 | Learning Socio-Temporal Graphs for Multi-Agent Trajectory Prediction | Yuke Li et.al. | 2312.14373 | null |
2023-12-21 | Modular Neural Network Policies for Learning In-Flight Object Catching with a Robot Hand-Arm System | Wenbin Hu et.al. | 2312.13987 | null |
2024-01-03 | Manipulating Trajectory Prediction with Backdoors | Kaouther Messaoud et.al. | 2312.13863 | null |
2024-01-10 | Optimizing Ego Vehicle Trajectory Prediction: The Graph Enhancement Approach | Sushil Sharma et.al. | 2312.13104 | null |
2023-12-20 | BEVSeg2TP: Surround View Camera Bird’s-Eye-View Based Joint Vehicle Segmentation and Ego Vehicle Trajectory Prediction | Sushil Sharma et.al. | 2312.13081 | null |
2023-12-19 | Path Planning for Continuum Rods Using Bernstein Surfaces | Maxwell Hammond et.al. | 2312.12333 | null |
2023-12-19 | Probabilistic Prediction of Longitudinal Trajectory Considering Driving Heterogeneity with Interpretability | Shuli Wang et.al. | 2312.12123 | null |
2023-12-19 | GazeMoDiff: Gaze-guided Diffusion Model for Stochastic Human Motion Prediction | Haodong Yan et.al. | 2312.12090 | null |
2024-01-18 | Adaptive Tracking and Perching for Quadrotor in Dynamic Scenarios | Yuman Gao et.al. | 2312.11866 | null |
2023-12-19 | GCNext: Towards the Unity of Graph Convolutions for Human Motion Prediction | Xinshun Wang et.al. | 2312.11850 | link |
2023-12-18 | Multiple Hypothesis Dropout: Estimating the Parameters of Multi-Modal Output Distributions | David D. Nguyen et.al. | 2312.11735 | link |
2023-12-18 | Energy-Aware Hierarchical Control of Joint Velocities | Jonas Wittmann et.al. | 2312.11163 | null |
2023-12-18 | Visualizing High-Dimensional Configuration Spaces For Robots: A Comprehensive Approach for Quantitative and Qualitative Analysis | Jorge Ocampo Jimenez et.al. | 2312.10918 | link |
2023-12-18 | Sharable Clothoid-based Continuous Motion Planning for Connected Automated Vehicles | Sanghoon Oh et.al. | 2312.10880 | null |
2023-12-17 | Multi-level Reasoning for Robotic Assembly: From Sequence Inference to Contact Selection | Xinghao Zhu et.al. | 2312.10571 | null |
2023-11-24 | Digital Twin Technology Enabled Proactive Safety Application for Vulnerable Road Users: A Real-World Case Study | Erik Rua et.al. | 2312.10041 | null |
2023-12-15 | nuScenes Knowledge Graph – A comprehensive semantic representation of traffic scenes for trajectory prediction | Leon Mlodzian et.al. | 2312.09676 | link |
2023-12-15 | EDA: Evolving and Distinct Anchors for Multimodal Motion Prediction | Longzhong Lin et.al. | 2312.09501 | link |
2023-11-26 | Enhancing Trajectory Prediction through Self-Supervised Waypoint Noise Prediction | Pranav Singh Chib et.al. | 2312.09466 | null |
2023-12-25 | DriveMLM: Aligning Multi-Modal Large Language Models with Behavioral Planning States for Autonomous Driving | Wenhai Wang et.al. | 2312.09245 | link |
2023-12-14 | Safe Motion Planning for Quadruped Robots Using Density Functions | Sriram S. K. S Narayanan et.al. | 2312.09173 | link |
2023-12-14 | Optimal Motion Planning using Finite Fourier Series in a Learning-based Collision Field | Feng Yichang et.al. | 2312.09073 | null |
2023-12-14 | Bayes Net based highbrid Monte Carlo Optimization for Redundant Manipulator | Feng Yichang et.al. | 2312.09024 | null |
2023-12-14 | Motion Flow Matching for Human Motion Synthesis and Editing | Vincent Tao Hu et.al. | 2312.08895 | null |
2023-12-14 | Motion Planning and Control of Hybrid Flying-Crawling Quadrotors | Dongnan Hu et.al. | 2312.08718 | null |
2023-12-14 | Versatile Telescopic-Wheeled-Legged Locomotion of Tachyon 3 via Full-Centroidal Nonlinear Model Predictive Control | Sotaro Katayama et.al. | 2312.08668 | null |
2023-12-13 | G-MEMP: Gaze-Enhanced Multimodal Ego-Motion Prediction in Driving | M. Eren Akbiyik et.al. | 2312.08558 | null |
2023-12-13 | Adaptive Robot Coordination: A Subproblem-based Approach for Hybrid Multi-Robot Motion Planning | Irving Solis et.al. | 2312.08554 | null |
2024-03-27 | World Models via Policy-Guided Trajectory Diffusion | Marc Rigter et.al. | 2312.08533 | link |
2023-12-13 | A Survey of Generative AI for Intelligent Transportation Systems | Huan Yan et.al. | 2312.08248 | null |
2023-12-14 | Semi-Supervised Class-Agnostic Motion Prediction with Pseudo Label Regeneration and BEVMix | Kewei Wang et.al. | 2312.08009 | link |
2023-12-12 | Scalarizing Multi-Objective Robot Planning Problems using Weighted Maximization | Nils Wilde et.al. | 2312.07227 | null |
2023-12-16 | The Parameterized Complexity of Coordinated Motion Planning | Eduard Eiben et.al. | 2312.07144 | null |
2023-12-12 | Motion Planning and Control of A Morphing Quadrotor in Restricted Scenarios | Guiyang Cui et.al. | 2312.07075 | null |
2023-12-11 | Adaptive Human Trajectory Prediction via Latent Corridors | Neerja Thakkar et.al. | 2312.06653 | null |
2023-12-15 | BAT: Behavior-Aware Human-Like Trajectory Prediction for Autonomous Driving | Haicheng Liao et.al. | 2312.06371 | link |
2023-12-11 | Interpretable Long Term Waypoint-Based Trajectory Prediction Model | Amina Ghoul et.al. | 2312.06219 | null |
2023-12-11 | Recent Advances in Deterministic Human Motion Prediction: A Review | Tenghao Deng et.al. | 2312.06184 | null |
2023-12-11 | Motion Planning for Multiple Mobile Manipulator System in Complex Flipping Manipulation | Wenhang Liu et.al. | 2312.06168 | null |
2023-12-10 | Graph-based Prediction and Planning Policy Network (GP3Net) for scalable self-driving in dynamic environments using Deep Reinforcement Learning | Jayabrata Chowdhury et.al. | 2312.05784 | null |
2023-12-10 | Minimum-Time Trajectory Optimization With Data-Based Models: A Linear Programming Approach | Nan Li et.al. | 2312.05724 | null |
2023-12-07 | Image and AIS Data Fusion Technique for Maritime Computer Vision Applications | Emre Gülsoylu et.al. | 2312.05270 | link |
2023-12-08 | Kraken: enabling joint trajectory prediction by utilizing Mode Transformer and Greedy Mode Processing | Daniil S. Antonenko et.al. | 2312.05144 | null |
2023-12-08 | An Autonomous Driving model with BEV-V2X Perception, Trajectory Prediction and Driving Planning in Complex Traffic Intersections | Fukang Li et.al. | 2312.05104 | null |
2023-12-08 | Synthesizing Traffic Datasets using Graph Neural Networks | Daniel Rodriguez-Criado et.al. | 2312.05031 | link |
2023-11-15 | Harnessing LSTM for Nonlinear Ship Deck Motion Prediction in UAV Autonomous Landing amidst High Sea States | Feifan Yu et.al. | 2312.04572 | null |
2023-12-07 | GSGFormer: Generative Social Graph Transformer for Multimodal Pedestrian Trajectory Prediction | Zhongchang Luo et.al. | 2312.04479 | null |
2023-12-06 | Skeleton-in-Context: Unified Skeleton Sequence Modeling with In-Context Learning | Xinshun Wang et.al. | 2312.03703 | link |
2023-12-06 | Cooperative Probabilistic Trajectory Forecasting under Occlusion | Anshul Nayak et.al. | 2312.03296 | null |
2023-12-05 | Role of Uncertainty in Anticipatory Trajectory Prediction for a Ping-Pong Playing Robot | Nima Rahmanian et.al. | 2312.03024 | null |
2024-03-06 | D-LGP: Dynamic Logic-Geometric Program for Reactive Task and Motion Planning | Teng Xue et.al. | 2312.02731 | null |
2024-02-05 | MGTR: Multi-Granular Transformer for Motion Prediction with LiDAR | Yiqian Gan et.al. | 2312.02409 | null |
2023-12-04 | Multi-Modal MPPI and Active Inference for Reactive Task and Motion Planning | Yuezhe Zhang et.al. | 2312.02328 | null |
2023-12-03 | Deeper into Self-Supervised Monocular Indoor Depth Estimation | Chao Fan et.al. | 2312.01283 | link |
2023-12-02 | Vehicle path and traffic flow optimization via lane changing of automated or semi-automated vehicles on motorways | Antonios Georgantas et.al. | 2312.01193 | null |
2023-12-02 | Swarm-GPT: Combining Large Language Models with Safe Motion Planning for Robot Choreography Design | Aoran Jiao et.al. | 2312.01059 | null |
2024-02-14 | Extrapolatable Transformer Pre-training for Ultra Long Time-Series Forecasting | Ziyang Song et.al. | 2312.00817 | null |
2023-12-01 | A bilevel optimal motion planning (BOMP) model with application to autonomous parking | Shenglei Shi et.al. | 2312.00314 | null |
2023-11-30 | Heterogeneous Graph-based Trajectory Prediction using Local Map Context and Social Interactions | Daniel Grimm et.al. | 2311.18553 | null |
2023-11-30 | Categorical Traffic Transformer: Interpretable and Diverse Behavior Prediction with Tokenized Latent | Yuxiao Chen et.al. | 2311.18307 | null |
2023-11-30 | S-T CRF: Spatial-Temporal Conditional Random Field for Human Trajectory Prediction | Pengqian Han et.al. | 2311.18198 | null |
2023-11-29 | STF: Spatial Temporal Fusion for Trajectory Prediction | Pengqian Han et.al. | 2311.18149 | link |
2023-11-29 | Deep Reinforcement Learning Graphs: Feedback Motion Planning via Neural Lyapunov Verification | Armin Ghanbarzadeh et.al. | 2311.17587 | null |
2023-11-29 | Dynamic Dense Graph Convolutional Network for Skeleton-based Human Motion Prediction | Xinshun Wang et.al. | 2311.17408 | null |
2023-12-25 | Generative Hierarchical Temporal Transformer for Hand Action Recognition and Motion Prediction | Yilin Wen et.al. | 2311.17366 | null |
2023-11-27 | A Fully Data-Driven Approach for Realistic Traffic Signal Control Using Offline Reinforcement Learning | Jianxiong Li et.al. | 2311.15920 | null |
2023-11-27 | Sparse Pedestrian Character Learning for Trajectory Prediction | Yonghao Dong et.al. | 2311.15512 | null |
2024-01-25 | IA-LSTM: Interaction-Aware LSTM for Pedestrian Trajectory Prediction | Yuehai Chen et.al. | 2311.15193 | null |
2023-11-25 | GBD-TS: Goal-based Pedestrian Trajectory Prediction with Diffusion using Tree Sampling Algorithm | Ge Sun et.al. | 2311.14922 | null |
2024-03-08 | Automated Lane Merging via Game Theory and Branch Model Predictive Control | Luyao Zhang et.al. | 2311.14916 | link |
2023-11-24 | Uncertainties in Robust Planning and Control of Autonomous Tractor-Trailer Vehicles | Theodor Westny et.al. | 2311.14573 | null |
2023-11-24 | Receding Horizon Optimization with PPUM: An Approach for Autonomous Robot Path Planning in Uncertain Environments | Zijian Ge et.al. | 2311.14411 | null |
2023-11-24 | Offline Skill Generalization via Task and Motion Planning | Shin Watanabe et.al. | 2311.14328 | null |
2024-03-09 | Multi-Agent Motion Planning with Bézier Curve Optimization under Kinodynamic Constraints | Jingtian Yan et.al. | 2311.14145 | link |
2023-11-23 | Dynamic Compositional Graph Convolutional Network for Efficient Composite Human Motion Prediction | Wanying Zhang et.al. | 2311.13781 | link |
2023-11-23 | Trace-enabled Timing Model Synthesis for ROS2-based Autonomous Applications | Hazem Abaza et.al. | 2311.13333 | null |
2023-12-23 | A Safer Vision-based Autonomous Planning System for Quadrotor UAVs with Dynamic Obstacle Trajectory Prediction and Its Application with LLMs | Jiageng Zhong et.al. | 2311.12893 | null |
2023-11-21 | Total Turning and Motion Range Prediction for Safe Unicycle Control | Abdulla Tarshahani et.al. | 2311.12532 | null |
2023-11-21 | A Random Walk Approach for Simulation-Based Continuous Dynamic Traffic Assignment | Kaveh Khoshkhah et.al. | 2311.12440 | null |
2023-11-21 | Joint-Space Multi-Robot Motion Planning with Learned Decentralized Heuristics | Fengze Xie et.al. | 2311.12385 | null |
2023-11-20 | Teaching Robots to Build Simulations of Themselves | Yuhang Hu et.al. | 2311.12151 | null |
2023-11-20 | SeaDSC: A video-based unsupervised method for dynamic scene change detection in unmanned surface vehicles | Linh Trinh et.al. | 2311.11580 | null |
2023-11-17 | Path Planning in 3D with Motion Primitives for Wind Energy-Harvesting Fixed-Wing Aircraft | Seung-Keol Ryu et.al. | 2311.10915 | null |
2023-11-27 | A Language Agent for Autonomous Driving | Jiageng Mao et.al. | 2311.10813 | link |
2023-11-17 | Minimum Star Partitions of Simple Polygons in Polynomial Time | Mikkel Abrahamsen et.al. | 2311.10631 | null |
2023-11-17 | Human motion trajectory prediction using the Social Force Model for real-time and low computational cost applications | Oscar Gil et.al. | 2311.10582 | null |
2023-11-16 | Hypergraph-based Multi-robot Motion Planning with Topological Guidance | Courtney McBeth et.al. | 2311.10176 | null |
2024-03-10 | Robust Conformal Prediction for STL Runtime Verification under Distribution Shift | Yiqi Zhao et.al. | 2311.09482 | link |
2023-10-17 | Neural Packing: from Visual Sensing to Reinforcement Learning | Juzhan Xu et.al. | 2311.09233 | null |
2023-11-15 | Brain Functional Connectivity under Teleoperation Latency: a fNIRS Study | Yang Ye et.al. | 2311.09062 | null |
2023-11-15 | Edge Accelerated Robot Navigation with Hierarchical Motion Planning | Guoliang Li et.al. | 2311.08983 | null |
2023-11-14 | Cooperative Bidirectional Mixed-Traffic Overtaking | Faizan M. Tariq et.al. | 2311.08491 | null |
2023-11-14 | Purpose in the Machine: Do Traffic Simulators Produce Distributionally Equivalent Outcomes for Reinforcement Learning Applications? | Rex Chen et.al. | 2311.08429 | null |
2023-11-14 | Speeding Up Optimization-based Motion Planning through Deep Learning | Johannes Tenhumberg et.al. | 2311.08345 | null |
2023-11-14 | Calibration of an Elastic Humanoid Upper Body and Efficient Compensation for Motion Planning | Johannes Tenhumberg et.al. | 2311.08333 | null |
2023-11-29 | DeepEMplanner: An End-to-End EM Motion Planner with Iterative Interactions | Zhili Chen et.al. | 2311.08100 | link |
2023-11-14 | CPSOR-GCN: A Vehicle Trajectory Prediction Method Powered by Emotion and Cognitive Theory | L. Tang et.al. | 2311.08086 | null |
2023-11-14 | VegaEdge: Edge AI Confluence Anomaly Detection for Real-Time Highway IoT-Applications | Vinit Katariya et.al. | 2311.07880 | null |
2024-02-08 | VT-Former: A Transformer-based Vehicle Trajectory Prediction Approach For Intelligent Highway Transportation Systems | Armin Danesh Pazho et.al. | 2311.06623 | null |
2023-10-03 | A Co-Simulation Study to Assess the Impacts of Connected and Autonomous Vehicles on Traffic Flow Stability during Hurricane Evacuation | Zaheen E Muktadi Syed et.al. | 2311.06267 | null |
2023-11-10 | Efficient Learning of Fast Inverse Kinematics with Collision Avoidance | Johannes Tenhumberg et.al. | 2311.05938 | null |
2023-11-10 | Interactive Motion Planning for Autonomous Vehicles via Adaptive Interactive MPC | Viranjan Bhattacharyya et.al. | 2311.05810 | link |
2023-11-09 | FogROS2-Sky: Optimizing Latency and Cost for Multi-Cloud Robot Applications | Kaiyuan Chen et.al. | 2311.05600 | null |
2023-11-09 | Improving Human Legibility in Collaborative Robot Tasks through Augmented Reality and Workspace Preparation | Yi-Shiuan Tung et.al. | 2311.05562 | null |
2023-11-09 | TLCFuse: Temporal Multi-Modality Fusion Towards Occlusion-Aware Semantic Segmentation-Aided Motion Planning | Gustavo Salazar-Gomez et.al. | 2311.05319 | null |
2023-11-09 | Latent Task-Specific Graph Network Simulators | Philipp Dahlinger et.al. | 2311.05256 | link |
2023-11-08 | Social Motion Prediction with Cognitive Hierarchies | Wentao Zhu et.al. | 2311.04726 | null |
2023-11-08 | FFINet: Future Feedback Interaction Network for Motion Forecasting | Miao Kang et.al. | 2311.04512 | null |
2023-11-07 | Active Collision Avoidance System for E-Scooters in Pedestrian Environment | Xuke Yan et.al. | 2311.04383 | null |
2023-11-06 | iDb-A*: Iterative Search and Optimization for Optimal Kinodynamic Motion Planning | Joaquim Ortiz-Haro et.al. | 2311.03553 | link |
2024-02-16 | IR-STP: Enhancing Autonomous Driving with Interaction Reasoning in Spatio-Temporal Planning | Yingbing Chen et.al. | 2311.02850 | link |
2023-11-06 | Flexible Multi-Generator Model with Fused Spatiotemporal Graph for Trajectory Prediction | Peiyuan Zhu et.al. | 2311.02835 | null |
2023-11-04 | OSM vs HD Maps: Map Representations for Trajectory Prediction | Jing-Yan Liao et.al. | 2311.02305 | null |
2023-11-03 | Second-Order Convergent Collision-Constrained Optimization-Based Planner | Chen Liang et.al. | 2311.01717 | null |
2023-11-02 | Variable Selection in Maximum Mean Discrepancy for Interpretable Distribution Comparison | Kensuke Mitsuzawa et.al. | 2311.01537 | null |
2023-11-02 | NOD-TAMP: Multi-Step Manipulation Planning with Neural Object Descriptors | Shuo Cheng et.al. | 2311.01530 | null |
2023-11-13 | RoboGen: Towards Unleashing Infinite Data for Automated Robot Learning via Generative Simulation | Yufei Wang et.al. | 2311.01455 | null |
2023-11-02 | Adv3D: Generating Safety-Critical 3D Objects through Closed-Loop Simulation | Jay Sarva et.al. | 2311.01446 | null |
2023-11-02 | Learning Realistic Traffic Agents in Closed-loop | Chris Zhang et.al. | 2311.01394 | null |
2023-11-01 | An efficient tangent based topologically distinctive path finding for grid maps | Zhuo Yao et.al. | 2311.00853 | null |
2023-11-01 | Constant-time Motion Planning with Anytime Refinement for Manipulation | Itamar Mishani et.al. | 2311.00837 | null |
2023-11-01 | PIAug – Physics Informed Augmentation for Learning Vehicle Dynamics for Off-Road Navigation | Parv Maheshwari et.al. | 2311.00815 | null |
2023-10-31 | Large-Scale Multi-Robot Assembly Planning for Autonomous Manufacturing | Kyle Brown et.al. | 2311.00192 | link |
2023-10-31 | Safe multi-agent motion planning under uncertainty for drones using filtered reinforcement learning | Sleiman Safaoui et.al. | 2311.00063 | null |
2024-01-03 | Modeling multi-legged robot locomotion with slipping and its experimental validation | Ziyou Wu et.al. | 2310.20669 | null |
2023-10-31 | Near-Optimal Min-Sum Motion Planning for Two Square Robots in a Polygonal Environment | Pankaj K. Agarwal et.al. | 2310.20615 | null |
2024-02-14 | Learning Lyapunov-Stable Polynomial Dynamical Systems Through Imitation | Amin Abyaneh et.al. | 2310.20605 | link |
2023-10-31 | STDA-Meta: A Meta-Learning Framework for Few-Shot Traffic Prediction | Maoxiang Sun et.al. | 2310.20223 | null |
2023-11-26 | GraphTransformers for Geospatial Forecasting of Hurricane Trajectories | Pallavi Banerjee et.al. | 2310.20174 | null |
2023-11-01 | Decision-Making for Autonomous Vehicles with Interaction-Aware Behavioral Prediction and Social-Attention Neural Network | Xiao Li et.al. | 2310.20148 | null |
2023-10-30 | GG-LLM: Geometrically Grounding Large Language Models for Zero-shot Human Activity Forecasting in Human-Aware Task Planning | Moritz A. Graule et.al. | 2310.20034 | null |
2024-02-27 | Conditional Unscented Autoencoders for Trajectory Prediction | Faris Janjoš et.al. | 2310.19944 | link |
2024-02-28 | Large Trajectory Models are Scalable Motion Predictors and Planners | Qiao Sun et.al. | 2310.19620 | link |
2023-10-30 | Rule-Based Lloyd Algorithm for Multi-Robot Motion Planning and Control with Safety and Convergence Guarantees | Manuel Boldrer et.al. | 2310.19511 | link |
2023-10-28 | Triplet Attention Transformer for Spatiotemporal Predictive Learning | Xuesong Nie et.al. | 2310.18698 | null |
2023-11-22 | Interactive Joint Planning for Autonomous Vehicles | Yuxiao Chen et.al. | 2310.18301 | null |
2023-10-27 | Decision-theoretic MPC: Motion Planning with Weighted Maneuver Preferences Under Uncertainty | Ömer Şahin Taş et.al. | 2310.17963 | null |
2023-10-26 | 6-DoF Stability Field via Diffusion Models | Takuma Yoneda et.al. | 2310.17649 | null |
2023-11-02 | Detection Defenses: An Empty Promise against Adversarial Patch Attacks on Optical Flow | Erik Scheurer et.al. | 2310.17403 | link |
2023-10-25 | Toward the use of proxies for efficient learning manipulation and locomotion strategies on soft robots | Etienne Ménager et.al. | 2310.17029 | null |
2023-10-25 | Using Knowledge Awareness to improve Safety of Autonomous Driving | Andrea Calvagna et.al. | 2310.16760 | null |
2024-02-23 | Certifying Bimanual RRT Motion Plans in a Second | Alexandre Amice et.al. | 2310.16603 | link |
2023-10-25 | Topological Complexity Related To Multi-Valued Functions | Melih İs et.al. | 2310.16422 | null |
2023-10-25 | Neural Potential Field for Obstacle-Aware Local Motion Planning | Muhammad Alhaddad et.al. | 2310.16362 | link |
2023-10-24 | Human-in-the-Loop Task and Motion Planning for Imitation Learning | Ajay Mandlekar et.al. | 2310.16014 | null |
2023-11-23 | Data-driven Traffic Simulation: A Comprehensive Review | Di Chen et.al. | 2310.15975 | null |
2023-10-24 | Graph-based Trajectory Prediction with Cooperative Information | Jan Strohbeck et.al. | 2310.15692 | null |
2023-10-23 | Parallel Quantum Rapidly-Exploring Random Trees | Paul Lathrop et.al. | 2310.15303 | null |
2024-02-06 | Orientation-Aware Leg Movement Learning for Action-Driven Human Motion Prediction | Chunzhi Gu et.al. | 2310.14907 | null |
2023-10-30 | Generalized Multi-Level Replanning TAMP Framework for Dynamic Environment | Tao Lin et.al. | 2310.14816 | null |
2023-10-23 | End-to-End Learning of Behavioural Inputs for Autonomous Driving in Dense Traffic | Jatan Shrestha et.al. | 2310.14766 | link |
2023-10-23 | DICE: Diverse Diffusion Model with Scoring for Trajectory Prediction | Younwoo Choi et.al. | 2310.14570 | null |
2023-10-22 | Motion Planning for Autonomous Ground Vehicles Using Artificial Potential Fields: A Review | Aziz ur Rehman et.al. | 2310.14339 | null |
2023-10-21 | Robust NOMA-assisted OTFS-ISAC Network Design with 3D Motion Prediction Topology | Luping Xiang et.al. | 2310.13984 | null |
2023-10-21 | Equivariant Map and Agent Geometry for Autonomous Driving Motion Prediction | Yuping Wang et.al. | 2310.13922 | null |
2023-10-19 | Closed-Loop Motion Planning for Differentially Flat Systems: A Time-Varying Optimization Framework | Tianqi Zheng et.al. | 2310.13090 | null |
2023-10-19 | NeuroSMPC: A Neural Network guided Sampling Based MPC for On-Road Autonomous Driving | Kaustab Pal et.al. | 2310.13077 | null |
2023-10-19 | Creative Robot Tool Use with Large Language Models | Mengdi Xu et.al. | 2310.13065 | null |
2023-10-19 | Real-Time Motion Prediction via Heterogeneous Polyline Transformer with Relative Pose Encoding | Zhejun Zhang et.al. | 2310.12970 | link |
2023-10-19 | Local Non-Cooperative Games with Principled Player Selection for Scalable Motion Planning | Makram Chahine et.al. | 2310.12958 | null |
2023-10-19 | A Markovian dynamics for $C. elegans$ behavior across scales | Antonio C. Costa et.al. | 2310.12883 | link |
2023-10-19 | The origins of unpredictability in life trajectory prediction tasks | Ian Lundberg et.al. | 2310.12871 | null |
2023-10-19 | Flexible Informed Trees (FIT*): Adaptive Batch-Size Approach for Informed Sampling-Based Planner | Liding Zhang et.al. | 2310.12828 | null |
2023-10-19 | Multi-Robot Local Motion Planning Using Dynamic Optimization Fabrics | Saray Bakker et.al. | 2310.12816 | link |
2023-10-19 | Multiscale Motion-Aware and Spatial-Temporal-Channel Contextual Coding Network for Learned Video Compression | Yiming Wang et.al. | 2310.12733 | null |
2024-02-12 | Denoising Heat-inspired Diffusion with Insulators for Collision Free Motion Planning | Junwoo Chang et.al. | 2310.12609 | null |
2023-10-19 | CAT: Closed-loop Adversarial Training for Safe End-to-End Driving | Linrui Zhang et.al. | 2310.12432 | null |
2023-10-17 | Signal Temporal Logic-Guided Model Predictive Control for Robust Bipedal Locomotion Resilient to Runtime External Perturbations | Zhaoyuan Gu et.al. | 2310.11290 | null |
2023-10-17 | Self-Supervised 3D Scene Flow Estimation and Motion Prediction using Local Rigidity Prior | Ruibo Li et.al. | 2310.11284 | link |
2023-10-16 | Temporally Robust Multi-Agent STL Motion Planning in Continuous Time | Joris Verhagen et.al. | 2310.10585 | null |
2023-10-16 | A Novel Benchmarking Paradigm and a Scale- and Motion-Aware Model for Egocentric Pedestrian Trajectory Prediction | Amir Rasouli et.al. | 2310.10424 | null |
2023-10-16 | BEVGPT: Generative Pre-trained Large Model for Autonomous Driving Prediction, Decision-Making, and Planning | Pengqin Wang et.al. | 2310.10357 | null |
2024-02-06 | Multi-Body Neural Scene Flow | Kavisha Vidanapathirana et.al. | 2310.10301 | link |
2023-10-13 | AMSwarmX: Safe Swarm Coordination in CompleX Environments via Implicit Non-Convex Decomposition of the Obstacle-Free Space | Vivek K. Adajania et.al. | 2310.09195 | link |
2023-10-13 | Multi-Robot Geometric Task-and-Motion Planning for Collaborative Manipulation Tasks | Hejia Zhang et.al. | 2310.08802 | null |
2023-10-12 | An Experience-based TAMP Framework for Foliated Manifolds | Jiaming Hu et.al. | 2310.08494 | null |
2023-10-12 | Hilbert Space Embedding-based Trajectory Optimization for Multi-Modal Uncertain Obstacle Trajectory Prediction | Basant Sharma et.al. | 2310.08270 | link |
2023-10-12 | Multi-Modal Sensor Fusion and Object Tracking for Autonomous Racing | Phillip Karle et.al. | 2310.08114 | link |
2023-10-20 | Model Predictive Inferential Control of Neural State-Space Models for Autonomous Vehicle Motion Planning | Iman Askari et.al. | 2310.08045 | null |
2023-10-11 | VaPr: Variable-Precision Tensors to Accelerate Robot Motion Planning | Yu-Shun Hsiao et.al. | 2310.07854 | null |
2023-10-11 | CRITERIA: a New Benchmarking Paradigm for Evaluating Trajectory Prediction Models for Autonomous Driving | Changhe Chen et.al. | 2310.07794 | link |
2023-10-11 | Pixel State Value Network for Combined Prediction and Planning in Interactive Environments | Sascha Rosbach et.al. | 2310.07706 | null |
2023-10-11 | DESTINE: Dynamic Goal Queries with Temporal Transductive Alignment for Trajectory Prediction | Rezaul Karim et.al. | 2310.07438 | null |
2023-10-11 | CoPAL: Corrective Planning of Robot Actions with Large Language Models | Frank Joublin et.al. | 2310.07263 | null |
2023-10-10 | EARL: Eye-on-Hand Reinforcement Learner for Dynamic Grasping with Active Pose Estimation | Baichuan Huang et.al. | 2310.06751 | null |
2023-10-10 | TANGO: Time-Reversal Latent GraphODE for Multi-Agent Dynamical Systems | Zijie Huang et.al. | 2310.06427 | null |
2023-10-09 | CAT-RRT: Motion Planning that Admits Contact One Link at a Time | Nataliya Nechyporenko et.al. | 2310.06210 | null |
2023-10-09 | Human-Robot Gym: Benchmarking Reinforcement Learning in Human-Robot Collaboration | Jakob Thumm et.al. | 2310.06208 | link |
2023-10-09 | Motion Memory: Leveraging Past Experiences to Accelerate Future Motion Planning | Dibyendu Das et.al. | 2310.06198 | null |
2023-10-09 | Layout Sequence Prediction From Noisy Mobile Modality | Haichao Zhang et.al. | 2310.06138 | null |
2023-10-09 | DTPP: Differentiable Joint Conditional Prediction and Cost Evaluation for Tree Policy Planning in Autonomous Driving | Zhiyu Huang et.al. | 2310.05885 | link |
2023-10-09 | SocialCircle: Learning the Angle-based Social Interaction Representation for Pedestrian Trajectory Prediction | Conghao Wong et.al. | 2310.05370 | link |
2023-10-08 | MSight: An Edge-Cloud Infrastructure-based Perception System for Connected Automated Vehicles | Rusheng Zhang et.al. | 2310.05290 | null |
2023-10-06 | The WayHome: Long-term Motion Prediction on Dynamically Scaled | Kay Scheerer et.al. | 2310.04232 | null |
2023-10-05 | Probabilistic Generative Modeling for Procedural Roundabout Generation for Developing Countries | Zarif Ikram et.al. | 2310.03687 | null |
2023-10-05 | Enhanced Human-Robot Collaboration using Constrained Probabilistic Human-Motion Prediction | Aadi Kothari et.al. | 2310.03314 | null |
2023-10-04 | Incorporating Target Vehicle Trajectories Predicted by Deep Learning Into Model Predictive Controlled Vehicles | Ni Dang et.al. | 2310.02843 | null |
2023-10-11 | Video Transformers under Occlusion: How Physics and Background Attributes Impact Large Models for Robotic Manipulation | Shutong Jin et.al. | 2310.02044 | link |
2023-10-02 | EAST: Environment Aware Safe Tracking using Planning and Control Co-Design | Zhichao Li et.al. | 2310.01363 | link |
2023-11-13 | HOH: Markerless Multimodal Human-Object-Human Handover Dataset with Large Object Count | Noah Wiederhold et.al. | 2310.00723 | null |
2023-09-30 | Smoothing Mixed Traffic with Robust Data-driven Predictive Control for Connected and Autonomous Vehicles | Xu Shang et.al. | 2310.00509 | null |
2023-11-26 | Improving Trajectory Prediction in Dynamic Multi-Agent Environment by Dropping Waypoints | Pranav Singh Chib et.al. | 2309.17338 | null |
2023-09-29 | Robots That Can See: Leveraging Human Pose for Trajectory Prediction | Tim Salzmann et.al. | 2309.17209 | link |
2023-09-29 | UXsim: An open source macroscopic and mesoscopic traffic simulator in Python – a technical overview | Toru Seo et.al. | 2309.17114 | link |
2023-09-28 | Social Navigation in Crowded Environments with Model Predictive Control and Deep Learning-Based Human Trajectory Prediction | Viet-Anh Le et.al. | 2309.16838 | null |
2023-09-11 | A mobile observer method for the estimation of road traffic using communicating vehicles | Cyril Nguyen Van Phu et.al. | 2309.16717 | null |
2023-08-23 | Towards Safe Autonomy in Hybrid Traffic: Detecting Unpredictable Abnormal Behaviors of Human Drivers via Information Sharing | Jiangwei Wang et.al. | 2309.16716 | null |
2023-09-28 | MotionLM: Multi-Agent Motion Forecasting as Language Modeling | Ari Seff et.al. | 2309.16534 | null |
2023-09-27 | Improving Autonomous Driving Safety with POP: A Framework for Accurate Partially Observed Trajectory Predictions | Sheng Wang et.al. | 2309.15685 | null |
2023-10-18 | SEPT: Towards Efficient Scene Representation Learning for Motion Prediction | Zhiqian Lan et.al. | 2309.15289 | null |
2023-09-26 | A Physics Enhanced Residual Learning (PERL) Framework for Traffic State Prediction | Keke Long et.al. | 2309.15284 | null |
2023-09-26 | Near Real-Time Position Tracking for Robot-Guided Evacuation | Mollik Nayyar et.al. | 2309.15054 | null |
2023-09-26 | Context-Aware Generative Models for Prediction of Aircraft Ground Tracks | Nick Pepper et.al. | 2309.14957 | null |
2023-09-26 | Learning Generative Models for Climbing Aircraft from Radar Data | Nick Pepper et.al. | 2309.14941 | null |
2023-10-24 | Interaction-Aware Sampling-Based MPC with Learned Local Goal Predictions | Walter Jansma et.al. | 2309.14931 | null |
2023-09-28 | Semantic Map Learning of Traffic Light to Lane Assignment based on Motion Data | Thomas Monninger et.al. | 2309.14793 | link |
2023-09-25 | Scene Informer: Anchor-based Occlusion Inference and Trajectory Prediction in Partially Observable Environments | Bernard Lange et.al. | 2309.13893 | link |
2023-09-24 | DROP: Dynamics Responses from Human Motion Prior and Projective Dynamics | Yifeng Jiang et.al. | 2309.13742 | null |
2023-09-22 | SoRTS: Learned Tree Search for Long Horizon Social Robot Navigation | Ingrid Navarro et.al. | 2309.13144 | link |
2023-09-19 | A Novel Deep Neural Network for Trajectory Prediction in Automated Vehicles Using Velocity Vector Field | MReza Alipour Sormoli et.al. | 2309.10948 | link |
2023-09-20 | Pre-training on Synthetic Driving Data for Trajectory Prediction | Yiheng Li et.al. | 2309.10121 | link |
2023-11-30 | Moving Object Detection and Tracking with 4D Radar Point Cloud | Zhijun Pan et.al. | 2309.09737 | link |
2023-09-17 | Kinematics-aware Trajectory Generation and Prediction with Latent Stochastic Differential Modeling | Ruochen Jiao et.al. | 2309.09317 | null |
2023-09-17 | Trajectory Forecasting with Loose Clothing Using Left-to-Right Hidden Markov Model | Tianchen Shen et.al. | 2309.09237 | null |
2023-09-17 | Hamiltonian Dynamics Learning from Point Cloud Observations for Nonholonomic Mobile Robot Control | Abdullah Altawaitan et.al. | 2309.09163 | link |
2023-09-19 | Trajectory Prediction for Robot Navigation using Flow-Guided Markov Neural Operator | Rashmi Bhaskara et.al. | 2309.09137 | null |
2023-09-16 | Pedestrian Trajectory Prediction Using Dynamics-based Deep Learning | Honghui Wang et.al. | 2309.09021 | link |
2023-09-16 | RMP: A Random Mask Pretrain Framework for Motion Prediction | Yi Yang et.al. | 2309.08989 | link |
2023-09-16 | Staged Contact-Aware Global Human Motion Forecasting | Luca Scofano et.al. | 2309.08947 | link |
2023-09-16 | SafeShift: Safety-Informed Distribution Shifts for Robust Trajectory Prediction in Autonomous Driving | Benjamin Stoler et.al. | 2309.08889 | link |
2023-09-16 | Intention-Aware Planner for Robust and Safe Aerial Tracking | Qiuyu Ren et.al. | 2309.08854 | null |
2023-09-16 | Distributionally Robust CVaR-Based Safety Filtering for Motion Planning in Uncertain Environments | Sleiman Safaoui et.al. | 2309.08821 | link |
2023-09-11 | Model-based traffic state estimation for link traffic using moving cameras | Tanay Rastogi et.al. | 2309.07162 | null |
2023-09-13 | CLiFF-LHMP: Using Spatial Dynamics Patterns for Long-Term Human Motion Prediction | Yufei Zhu et.al. | 2309.07066 | link |
2023-09-13 | Utilizing Hybrid Trajectory Prediction Models to Recognize Highly Interactive Traffic Scenarios | Maximilian Zipfl et.al. | 2309.06887 | null |
2023-09-13 | A Multi-task Learning Framework for Drone State Identification and Trajectory Prediction | Antreas Palamas et.al. | 2309.06741 | null |
2023-09-11 | EANet: Expert Attention Network for Online Trajectory Prediction | Pengfei Yao et.al. | 2309.05683 | null |
2023-09-11 | Dynamic Handover: Throw and Catch with Bimanual Hands | Binghao Huang et.al. | 2309.05655 | null |
2023-09-11 | Real-Time Parallel Trajectory Optimization with Spatiotemporal Safety Constraints for Autonomous Driving in Congested Traffic | Lei Zheng et.al. | 2309.05298 | null |
2023-09-13 | Can you text what is happening? Integrating pre-trained language encoders into trajectory prediction models for autonomous driving | Ali Keysan et.al. | 2309.05282 | null |
2023-09-10 | AVARS – Alleviating Unexpected Urban Road Traffic Congestion using UAVs | Jiaying Guo et.al. | 2309.04976 | link |
2023-09-07 | PBP: Path-based Trajectory Prediction for Autonomous Driving | Sepideh Afshar et.al. | 2309.03750 | null |
2023-10-31 | Efficient Baselines for Motion Prediction in Autonomous Driving | Carlos Gómez-Huélamo et.al. | 2309.03387 | link |
2023-09-05 | Generalized Simplicial Attention Neural Networks | Claudio Battiloro et.al. | 2309.02138 | link |
2023-09-05 | Graph-Based Interaction-Aware Multimodal 2D Vehicle Trajectory Prediction using Diffusion Graph Convolutional Networks | Keshu Wu et.al. | 2309.01981 | null |
2023-09-01 | Reinforcement Learning with Human Feedback for Realistic Traffic Simulation | Yulong Cao et.al. | 2309.00709 | null |
2023-09-01 | Human trajectory prediction using LSTM with Attention mechanism | Amin Manafi Soltan Ahmadi et.al. | 2309.00331 | null |
2023-08-31 | Multiscale Residual Learning of Graph Convolutional Sequence Chunks for Human Motion Prediction | Mohsen Zand et.al. | 2308.16801 | null |
2023-08-31 | MMVP: Motion-Matrix-based Video Prediction | Yiqi Zhong et.al. | 2308.16154 | link |
2023-08-24 | Interaction-Aware Trajectory Prediction and Planning in Dense Highway Traffic using Distributed Model Predictive Control | Erik Börve et.al. | 2308.13053 | null |
2023-08-23 | Multi-object Detection, Tracking and Prediction in Rugged Dynamic Environments | Shixing Huang et.al. | 2308.11870 | null |
2023-08-31 | MacFormer: Map-Agent Coupled Transformer for Real-time and Robust Trajectory Prediction | Chen Feng et.al. | 2308.10280 | null |
2023-09-22 | CTP:A Causal Interpretable Model for Non-Communicable Disease Progression Prediction | Zhoujian Sun et.al. | 2308.09735 | link |
2023-09-02 | Auxiliary Tasks Benefit 3D Skeleton-based Human Motion Prediction | Chenxin Xu et.al. | 2308.08942 | link |
2023-08-17 | Fast Inference and Update of Probabilistic Density Estimation on Trajectory Prediction | Takahiro Maeda et.al. | 2308.08824 | link |
2023-08-17 | XVTP3D: Cross-view Trajectory Prediction Using Shared 3D Queries for Autonomous Driving | Zijian Song et.al. | 2308.08764 | null |
2023-08-15 | CASPNet++: Joint Multi-Agent Motion Prediction | Maximilian Schäfer et.al. | 2308.07751 | null |
2023-08-16 | Interaction-Aware Personalized Vehicle Trajectory Prediction Using Temporal Graph Neural Networks | Amr Abdelraouf et.al. | 2308.07439 | null |
2023-08-14 | UniWorld: Autonomous Driving Pre-training via World Models | Chen Min et.al. | 2308.07234 | link |
2023-08-15 | FOLT: Fast Multiple Object Tracking from UAV-captured Videos Based on Optical Flow | Mufeng Yao et.al. | 2308.07207 | null |
2023-08-14 | Masked Motion Predictors are Strong 3D Action Representation Learners | Yunyao Mao et.al. | 2308.07092 | link |
2023-08-13 | Polar Collision Grids: Effective Interaction Modelling for Pedestrian Trajectory Prediction in Shared Space Using Collision Checks | Mahsa Golchoubian et.al. | 2308.06654 | null |
2023-08-12 | 3DMOTFormer: Graph Transformer for Online 3D Multi-Object Tracking | Shuxiao Ding et.al. | 2308.06635 | link |
2023-08-29 | EquiDiff: A Conditional Equivariant Diffusion Model For Trajectory Prediction | Kehua Chen et.al. | 2308.06564 | null |
2023-08-11 | Pedestrian Trajectory Prediction in Pedestrian-Vehicle Mixed Environments: A Systematic Review | Mahsa Golchoubian et.al. | 2308.06419 | null |
2023-08-11 | TrajPAC: Towards Robustness Verification of Pedestrian Trajectory Prediction Models | Liang Zhang et.al. | 2308.05985 | link |
2023-08-11 | Spatiotemporal Receding Horizon Control with Proactive Interaction Towards Safe and Efficient Autonomous Driving in Dense Traffic | Lei Zheng et.al. | 2308.05929 | null |
Diffusion
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-15 | 3D-Fixup: Advancing Photo Editing with 3D Priors | Yen-Chi Cheng et.al. | 2505.10566 | null |
2025-05-15 | End-to-End Vision Tokenizer Tuning | Wenxuan Wang et.al. | 2505.10562 | null |
2025-05-15 | Style Customization of Text-to-Vector Generation with Image Diffusion Priors | Peiying Zhang et.al. | 2505.10558 | null |
2025-05-15 | Does Feasibility Matter? Understanding the Impact of Feasibility on Synthetic Training Data | Yiwen Liu et.al. | 2505.10551 | link |
2025-05-15 | Pharmacophore-Conditioned Diffusion Model for Ligand-Based De Novo Drug Design | Amira Alakhdar et.al. | 2505.10545 | null |
2025-05-15 | CheXGenBench: A Unified Benchmark For Fidelity, Privacy and Utility of Synthetic Chest Radiographs | Raman Dutt et.al. | 2505.10496 | null |
2025-05-15 | Fine-tuning Diffusion Policies with Backpropagation Through Diffusion Timesteps | Ningyuan Yang et.al. | 2505.10482 | null |
2025-05-15 | Reinforcing the Diffusion Chain of Lateral Thought with Diffusion Language Models | Zemin Huang et.al. | 2505.10446 | null |
2025-05-15 | Score-based diffusion nowcasting of GOES imagery | Randy J. Chase et.al. | 2505.10432 | null |
2025-05-15 | Whitened Score Diffusion: A Structured Prior for Imaging Inverse Problems | Jeffrey Alido et.al. | 2505.10311 | null |
2025-05-15 | MTVCrafter: 4D Motion Tokenization for Open-World Human Image Animation | Yanbo Ding et.al. | 2505.10238 | null |
2025-05-15 | FlowDreamer: A RGB-D World Model with Flow-based Motion Representations for Robot Manipulation | Jun Guo et.al. | 2505.10075 | null |
2025-05-15 | ToonifyGB: StyleGAN-based Gaussian Blendshapes for 3D Stylized Head Avatars | Rui-Yang Ju et.al. | 2505.10072 | null |
2025-05-15 | Exploring the Deep Fusion of Large Language Models and Diffusion Transformers for Text-to-Image Synthesis | Bingda Tang et.al. | 2505.10046 | null |
2025-05-15 | ORL-LDM: Offline Reinforcement Learning Guided Latent Diffusion Model Super-Resolution Reconstruction | Shijie Lyu et.al. | 2505.10027 | null |
2025-05-15 | From Air to Wear: Personalized 3D Digital Fashion with AR/VR Immersive 3D Sketching | Ying Zang et.al. | 2505.09998 | null |
2025-05-15 | Ordered-subsets Multi-diffusion Model for Sparse-view CT Reconstruction | Pengfei Yu et.al. | 2505.09985 | null |
2025-05-15 | Improving the Euclidean Diffusion Generation of Manifold Data by Mitigating Score Function Singularity | Zichen Liu et.al. | 2505.09922 | null |
2025-05-15 | Diffusion-SAFE: Shared Autonomy Framework with Diffusion for Safe Human-to-Robot Driving Handover | Yunxin Fan et.al. | 2505.09889 | null |
2025-05-15 | Unsupervised Radar Point Cloud Enhancement via Arbitrary LiDAR Guided Diffusion Prior | Yanlong Yang et.al. | 2505.09887 | null |
2025-05-14 | Mission Balance: Generating Under-represented Class Samples using Video Diffusion Models | Danush Kumar Venkatesh et.al. | 2505.09858 | null |
2025-05-14 | On the Well-Posedness of Green’s Function Reconstruction via the Kirchhoff-Helmholtz Equation for One-Speed Neutron Diffusion | Roberto Ponciroli et.al. | 2505.09766 | null |
2025-05-14 | EnerVerse-AC: Envisioning Embodied Environments with Action Condition | Yuxin Jiang et.al. | 2505.09723 | null |
2025-05-14 | EWMBench: Evaluating Scene, Motion, and Semantic Quality in Embodied World Models | Hu Yue et.al. | 2505.09694 | null |
2025-05-11 | Joint Source-Channel Noise Adding with Adaptive Denoising for Diffusion-Based Semantic Communications | Chengyang Liang et.al. | 2505.09644 | null |
2025-05-01 | Generative diffusion model surrogates for mechanistic agent-based biological models | Tien Comlekoglu et.al. | 2505.09630 | null |
2025-05-14 | LightLab: Controlling Light Sources in Images with Diffusion Models | Nadav Magar et.al. | 2505.09608 | null |
2025-05-14 | Don’t Forget your Inverse DDIM for Image Editing | Guillermo Gomez-Trenado et.al. | 2505.09571 | null |
2025-05-14 | BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset | Jiuhai Chen et.al. | 2505.09568 | link |
2025-05-14 | Train a Multi-Task Diffusion Policy on RLBench-18 in One Day with One GPU | Yutong Hu et.al. | 2505.09430 | link |
2025-05-14 | Unraveling spin entanglement using quantum gates with scanning tunneling microscopy-driven electron spin resonance | Eric D. Switzer et.al. | 2505.09428 | null |
2025-05-14 | Diffusion Recommender Models and the Illusion of Progress: A Concerning Study of Reproducibility and a Conceptual Mismatch | Michael Benigni et.al. | 2505.09364 | null |
2025-05-14 | Marigold: Affordable Adaptation of Diffusion-Based Image Generators for Image Analysis | Bingxin Ke et.al. | 2505.09358 | null |
2025-05-14 | TransDiffuser: End-to-end Trajectory Generation with Decorrelated Multi-modal Representation for Autonomous Driving | Xuefeng Jiang et.al. | 2505.09315 | null |
2025-05-14 | Generating Full-field Evolution of Physical Dynamics from Irregular Sparse Observations | Panqi Chen et.al. | 2505.09284 | null |
2025-05-14 | A Note on Semantic Diffusion | Alexander P. Ryjov et.al. | 2505.09283 | null |
2025-05-14 | Few-Shot Anomaly-Driven Generation for Anomaly Classification and Segmentation | Guan Gui et.al. | 2505.09263 | link |
2025-05-14 | An Initial Exploration of Default Images in Text-to-Image Generation | Hannu Simonen et.al. | 2505.09166 | null |
2025-05-15 | Generating time-consistent dynamics with discriminator-guided image diffusion models | Philipp Hess et.al. | 2505.09089 | null |
2025-05-13 | Predictive Digital Twins with Quantified Uncertainty for Patient-Specific Decision Making in Oncology | Graham Pash et.al. | 2505.08927 | null |
2025-05-15 | IntrinsicEdit: Precise generative image manipulation in intrinsic space | Linjie Lyu et.al. | 2505.08889 | null |
2025-05-13 | Generative AI for Autonomous Driving: Frontiers and Opportunities | Yuping Wang et.al. | 2505.08854 | link |
2025-05-13 | Generative AI for Urban Planning: Synthesizing Satellite Imagery via Diffusion Models | Qingyi Wang et.al. | 2505.08833 | null |
2025-05-12 | Towards SFW sampling for diffusion models via external conditioning | Camilo Carvajal Reyes et.al. | 2505.08817 | link |
2025-05-12 | MixBridge: Heterogeneous Image-to-Image Backdoor Attack through Mixture of Schrödinger Bridges | Shixi Qin et.al. | 2505.08809 | link |
2025-05-11 | TokenProber: Jailbreaking Text-to-image Models via Fine-grained Word Impact Analysis | Longtian Wang et.al. | 2505.08804 | null |
2025-05-10 | Multi-modal Synthetic Data Training and Model Collapse: Insights from VLMs and Diffusion Models | Zizhao Hu et.al. | 2505.08803 | null |
2025-05-13 | Generative Molecular Design with Steerable and Granular Synthesizability Control | Jeff Guo et.al. | 2505.08774 | link |
2025-05-13 | Controllable Image Colorization with Instance-aware Texts and Masks | Yanru An et.al. | 2505.08705 | null |
2025-05-13 | Visually Guided Decoding: Gradient-Free Hard Prompt Inversion with Language Models | Donghoon Kim et.al. | 2505.08622 | null |
2025-05-13 | Boosting Zero-shot Stereo Matching using Large-scale Mixed Images Sources in the Real World | Yuran Wang et.al. | 2505.08607 | null |
2025-05-15 | Diffusion-assisted Model Predictive Control Optimization for Power System Real-Time Operation | Linna Xu et.al. | 2505.08535 | null |
2025-05-13 | Building-Block Aware Generative Modeling for 3D Crystals of Metal Organic Frameworks | Chenru Duan et.al. | 2505.08531 | link |
2025-05-14 | Improving Data Fidelity via Diffusion Model-based Correction and Super-Resolution | Wuzhe Xu et.al. | 2505.08526 | null |
2025-05-13 | Symbolically-Guided Visual Plan Inference from Uncurated Video Data | Wenyan Yang et.al. | 2505.08444 | null |
2025-05-13 | ConDiSim: Conditional Diffusion Models for Simulation Based Inference | Mayank Nautiyal et.al. | 2505.08403 | null |
2025-05-13 | Adaptive Diffusion Policy Optimization for Robotic Manipulation | Huiyun Jiang et.al. | 2505.08376 | null |
2025-05-13 | Ultra Lowrate Image Compression with Semantic Residual Coding and Compression-aware Diffusion | Anle Ke et.al. | 2505.08281 | null |
2025-05-13 | Skeleton-Guided Diffusion Model for Accurate Foot X-ray Synthesis in Hallux Valgus Diagnosis | Midi Wan et.al. | 2505.08247 | link |
2025-05-13 | Identifying Memorization of Diffusion Models through p-Laplace Analysis | Jonathan Brokman et.al. | 2505.08246 | null |
2025-05-13 | ACT-R: Adaptive Camera Trajectories for 3D Reconstruction from Single Image | Yizhi Wang et.al. | 2505.08239 | null |
2025-05-13 | EventDiff: A Unified and Efficient Diffusion Model Framework for Event-based Video Frame Interpolation | Hanle Zheng et.al. | 2505.08235 | null |
2025-05-13 | Removing Watermarks with Partial Regeneration using Semantic Information | Krti Tallam et.al. | 2505.08234 | link |
2025-05-13 | Object detection in adverse weather conditions for autonomous vehicles using Instruct Pix2Pix | Unai Gurbindo et.al. | 2505.08228 | null |
2025-05-13 | Visual Watermarking in the Era of Diffusion Models: Advances and Challenges | Junxian Duan et.al. | 2505.08197 | null |
2025-05-13 | Unsupervised Raindrop Removal from a Single Image using Conditional Diffusion Models | Lhuqita Fazry et.al. | 2505.08190 | link |
2025-05-13 | Highly Undersampled MRI Reconstruction via a Single Posterior Sampling of Diffusion Models | Jin Liu et.al. | 2505.08142 | null |
2025-05-12 | Task-Adaptive Semantic Communications with Controllable Diffusion-based Data Regeneration | Fupei Guo et.al. | 2505.07980 | null |
2025-05-12 | Image-Guided Microstructure Optimization using Diffusion Models: Validated with Li-Mn-rich Cathode Precursors | Geunho Choi et.al. | 2505.07906 | null |
2025-05-12 | Latent Behavior Diffusion for Sequential Reaction Generation in Dyadic Setting | Minh-Duc Nguyen et.al. | 2505.07901 | null |
2025-05-12 | EnvCDiff: Joint Refinement of Environmental Information and Channel Fingerprints via Conditional Generative Diffusion Model | Zhenzhou Jin et.al. | 2505.07894 | null |
2025-05-12 | Channel Fingerprint Construction for Massive MIMO: A Deep Conditional Generative Approach | Zhenzhou Jin et.al. | 2505.07893 | null |
2025-05-09 | VISTA: Generative Visual Imagination for Vision-and-Language Navigation | Yanjia Huang et.al. | 2505.07868 | null |
2025-05-09 | Computationally Efficient Diffusion Models in Medical Imaging: A Comprehensive Review | Abdullah et.al. | 2505.07866 | null |
2025-05-12 | DanceGRPO: Unleashing GRPO on Visual Generation | Zeyue Xue et.al. | 2505.07818 | null |
2025-05-12 | Pixel Motion as Universal Representation for Robot Control | Kanchana Ranasinghe et.al. | 2505.07817 | null |
2025-05-12 | Step1X-3D: Towards High-Fidelity and Controllable Generation of Textured 3D Assets | Weiyu Li et.al. | 2505.07747 | null |
2025-05-12 | LAMM-ViT: AI Face Detection via Layer-Aware Modulation of Region-Guided Attention | Jiangling Zhang et.al. | 2505.07734 | null |
2025-05-12 | ShotAdapter: Text-to-Multi-Shot Video Generation with Diffusion Models | Ozgur Kara et.al. | 2505.07652 | null |
2025-05-12 | Diffused Responsibility: Analyzing the Energy Consumption of Generative Text-to-Audio Diffusion Models | Riccardo Passoni et.al. | 2505.07615 | null |
2025-05-12 | Noise Optimized Conditional Diffusion for Domain Adaptation | Lingkun Luo et.al. | 2505.07548 | null |
2025-05-12 | Discrete Visual Tokens of Autoregression, by Diffusion, and for Reasoning | Bohan Wang et.al. | 2505.07538 | null |
2025-05-12 | Addressing degeneracies in latent interpolation for diffusion models | Erik Landolsi et.al. | 2505.07481 | null |
2025-05-12 | You Only Look One Step: Accelerating Backpropagation in Diffusion Sampling with Gradient Shortcuts | Hongkun Dou et.al. | 2505.07477 | link |
2025-05-13 | Ophora: A Large-Scale Data-Driven Text-Guided Ophthalmic Surgical Video Generation Model | Wei Li et.al. | 2505.07449 | link |
2025-05-12 | DiffCrysGen: A Score-Based Diffusion Model for Design of Diverse Inorganic Crystalline Materials | Sourav Mal et.al. | 2505.07442 | null |
2025-05-12 | Diffusion-driven SpatioTemporal Graph KANsformer for Medical Examination Recommendation | Jianan Li et.al. | 2505.07431 | null |
2025-05-12 | GAN-based synthetic FDG PET images from T1 brain MRI can serve to improve performance of deep unsupervised anomaly detection models | Daria Zotova et.al. | 2505.07364 | null |
2025-05-15 | Generative Pre-trained Autoregressive Diffusion Transformer | Yuan Zhang et.al. | 2505.07344 | null |
2025-05-12 | Metrics that matter: Evaluating image quality metrics for medical image generation | Yash Deo et.al. | 2505.07175 | link |
2025-05-11 | Semantic-Guided Diffusion Model for Single-Step Image Super-Resolution | Zihang Liu et.al. | 2505.07071 | link |
2025-05-11 | DAPE: Dual-Stage Parameter-Efficient Fine-Tuning for Consistent Video Editing with Diffusion Models | Junhao Xia et.al. | 2505.07057 | null |
2025-05-11 | CMD: Controllable Multiview Diffusion for 3D Editing and Progressive Generation | Peng Li et.al. | 2505.07003 | null |
2025-05-11 | Replay-Based Continual Learning with Dual-Layered Distillation and a Streamlined U-Net for Efficient Text-to-Image Generation | Md. Naimur Asif Borno et.al. | 2505.06995 | null |
2025-05-11 | BridgeIV: Bridging Customized Image and Video Generation through Test-Time Autoregressive Identity Propagation | Panwen Hu et.al. | 2505.06985 | null |
2025-05-11 | Unsupervised Learning for Class Distribution Mismatch | Pan Du et.al. | 2505.06948 | link |
2025-05-11 | Near-Field Channel Estimation for XL-MIMO: A Deep Generative Model Guided by Side Information | Zhenzhou Jin et.al. | 2505.06900 | null |
2025-05-11 | Image Classification Using a Diffusion Model as a Pre-Training Model | Kosuke Ukita et.al. | 2505.06890 | null |
2025-05-11 | Topology Guidance: Controlling the Outputs of Generative Models via Vector Field Topology | Xiaohan Wang et.al. | 2505.06804 | null |
2025-05-11 | HistDiST: Histopathological Diffusion-based Stain Transfer | Erik Großkopf et.al. | 2505.06793 | null |
2025-05-15 | Learning Graph Representation of Agent Diffusers | Youcef Djenouri et.al. | 2505.06761 | link |
2025-05-10 | Jailbreaking the Text-to-Video Generative Models | Jiayang Liu et.al. | 2505.06679 | null |
2025-05-10 | Video Dataset Condensation with Diffusion Models | Zhe Li et.al. | 2505.06670 | null |
2025-05-10 | StableMotion: Repurposing Diffusion-Based Image Priors for Motion Estimation | Ziyi Wang et.al. | 2505.06668 | null |
2025-05-10 | ReplayCAD: Generative Diffusion Replay for Continual Anomaly Detection | Lei Hu et.al. | 2505.06603 | null |
2025-05-10 | Optimal Transport for Machine Learners | Gabriel Peyré et.al. | 2505.06589 | null |
2025-05-10 | Dynamic Uncertainty Learning with Noisy Correspondence for Text-Based Person Search | Zequn Xie et.al. | 2505.06566 | null |
2025-05-10 | HDGlyph: A Hierarchical Disentangled Glyph-Based Framework for Long-Tail Text Rendering in Diffusion Models | Shuhan Zhuang et.al. | 2505.06543 | null |
2025-05-10 | ProFashion: Prototype-guided Fashion Video Generation with Multiple Reference Images | Xianghao Kong et.al. | 2505.06537 | null |
2025-05-15 | HCMA: Hierarchical Cross-model Alignment for Grounded Text-to-Image Generation | Hang Wang et.al. | 2505.06512 | link |
2025-05-10 | Climate in a Bottle: Towards a Generative Foundation Model for the Kilometer-Scale Global Atmosphere | Noah D. Brenowitz et.al. | 2505.06474 | null |
2025-05-09 | PromptIQ: Who Cares About Prompts? Let System Handle It – A Component-Aware Framework for T2I Generation | Nisan Chhetri et.al. | 2505.06467 | null |
2025-05-09 | Long time behaviour of Mean Field Games with fractional diffusion | Olav Ersland et.al. | 2505.06183 | null |
2025-05-09 | DiffLocks: Generating 3D Hair from a Single Image using Diffusion Models | Radu Alexandru Rosu et.al. | 2505.06166 | null |
2025-05-09 | Photovoltaic Defect Image Generator with Boundary Alignment Smoothing Constraint for Domain Shift Mitigation | Dongying Li et.al. | 2505.06117 | null |
2025-05-09 | Noise-Consistent Siamese-Diffusion for Medical Image Synthesis and Segmentation | Kunpeng Qiu et.al. | 2505.06068 | link |
2025-05-09 | Discovery of the Polar Ring Galaxies with deep learning | D. V. Dobrycheva et.al. | 2505.05890 | null |
2025-05-09 | A 3D pocket-aware and evolutionary conserved interaction guided diffusion model for molecular optimization | Anjie Qiao et.al. | 2505.05874 | null |
2025-05-09 | PICD: Versatile Perceptual Image Compression with Diffusion Rendering | Tongda Xu et.al. | 2505.05853 | null |
2025-05-09 | Accelerating Diffusion Transformer via Increment-Calibrated Caching with Channel-Aware Singular Value Decomposition | Zhiyuan Chen et.al. | 2505.05829 | link |
2025-05-09 | Demystifying Diffusion Policies: Action Memorization and Simple Lookup Table Alternatives | Chengyang He et.al. | 2505.05787 | null |
2025-05-09 | Insertion Language Models: Sequence Generation with Arbitrary-Position Insertions | Dhruvesh Patel et.al. | 2505.05755 | null |
2025-05-09 | Automated Learning of Semantic Embedding Representations for Diffusion Models | Limai Jiang et.al. | 2505.05732 | null |
2025-05-09 | Towards Secure Semantic Transmission In the Era of GenAI: A Diffusion-based Framework | Boxiang He et.al. | 2505.05724 | null |
2025-05-09 | Semantic-Space-Intervened Diffusive Alignment for Visual Classification | Zixuan Li et.al. | 2505.05721 | null |
2025-05-12 | InstanceGen: Image Generation with Instance-level Instructions | Etai Sella et.al. | 2505.05678 | link |
2025-05-08 | Unsupervised Blind Speech Separation with a Diffusion Prior | Zhongweiyang Xu et.al. | 2505.05657 | link |
2025-05-08 | A Preliminary Study for GPT-4o on Image Restoration | Hao Yang et.al. | 2505.05621 | link |
2025-05-08 | ReactDance: Progressive-Granular Representation for Long-Term Coherent Reactive Dance Generation | Jingzhong Lin et.al. | 2505.05589 | null |
2025-05-12 | Prompt to Polyp: Medical Text-Conditioned Image Synthesis with Diffusion Models | Mikhail Chaichuk et.al. | 2505.05573 | link |
2025-05-07 | Apply Hierarchical-Chain-of-Generation to Complex Attributes Text-to-3D Generation | Yiming Qin et.al. | 2505.05505 | link |
2025-05-06 | Preliminary Explorations with GPT-4o(mni) Native Image Generation | Pu Cao et.al. | 2505.05501 | null |
2025-05-05 | Learning 3D Persistent Embodied World Models | Siyuan Zhou et.al. | 2505.05495 | null |
2025-05-08 | SVAD: From Single Image to 3D Avatar via Synthetic Data Generation with Video Diffusion and Data Augmentation | Yonwoo Choi et.al. | 2505.05475 | link |
2025-05-08 | 3D Scene Generation: A Survey | Beichen Wen et.al. | 2505.05474 | link |
2025-05-08 | DiffusionSfM: Predicting Structure and Motion via Ray Origin and Endpoint Diffusion | Qitao Zhao et.al. | 2505.05473 | null |
2025-05-11 | Mogao: An Omni Foundation Model for Interleaved Multi-Modal Generation | Chao Liao et.al. | 2505.05472 | null |
2025-05-11 | Flow-GRPO: Training Flow Matching Models via Online RL | Jie Liu et.al. | 2505.05470 | link |
2025-05-08 | Denoising Diffusion Probabilistic Models for Coastal Inundation Forecasting | Kazi Ashik Islam et.al. | 2505.05381 | null |
2025-05-08 | Normalize Everything: A Preconditioned Magnitude-Preserving Architecture for Diffusion-Based Speech Enhancement | Julius Richter et.al. | 2505.05216 | null |
2025-05-08 | Diffusion Model Quantization: A Review | Qian Zeng et.al. | 2505.05215 | null |
2025-05-08 | EAM: Enhancing Anything with Diffusion Transformers for Blind Super-Resolution | Haizhen Xie et.al. | 2505.05209 | null |
2025-05-08 | Overcoming Dimensional Factorization Limits in Discrete Diffusion Models through Quantum Joint Distribution Learning | Chuangtao Chen et.al. | 2505.05151 | link |
2025-05-08 | Research on Anomaly Detection Methods Based on Diffusion Models | Yi Chen et.al. | 2505.05137 | null |
2025-05-08 | MDAA-Diff: CT-Guided Multi-Dose Adaptive Attention Diffusion Model for PET Denoising | Xiaolong Niu et.al. | 2505.05112 | null |
2025-05-12 | MDE-Edit: Masked Dual-Editing for Multi-Object Image Editing via Diffusion Models | Hongyang Zhu et.al. | 2505.05101 | null |
2025-05-11 | ItDPDM: Information-Theoretic Discrete Poisson Diffusion Model | Sagnik Bhattacharya et.al. | 2505.05082 | null |
2025-05-12 | PIDiff: Image Customization for Personalized Identities with Diffusion Models | Jinyu Gu et.al. | 2505.05081 | null |
2025-05-08 | Divide-and-Conquer: Cold-Start Bundle Recommendation via Mixture of Diffusion Experts | Ming Li et.al. | 2505.05035 | null |
2025-05-08 | SOAP: Style-Omniscient Animatable Portraits | Tingting Liao et.al. | 2505.05022 | link |
2025-05-08 | Inter-Diffusion Generation Model of Speakers and Listeners for Effective Communication | Jinhe Huang et.al. | 2505.04996 | null |
2025-05-08 | ReAlign: Bilingual Text-to-Motion Generation via Step-Aware Reward-Guided Alignment | Wanjiang Weng et.al. | 2505.04974 | null |
2025-05-08 | Graffe: Graph Representation Learning via Diffusion Probabilistic Models | Dingshuo Chen et.al. | 2505.04956 | null |
2025-05-08 | T2VTextBench: A Human Evaluation Benchmark for Textual Control in Video Generation Models | Xuyang Guo et.al. | 2505.04946 | null |
2025-05-08 | Accurate and Fast Channel Estimation for Fluid Antenna Systems with Diffusion Models | Erqiang Tang et.al. | 2505.04930 | null |
2025-05-08 | GlyphMastero: A Glyph Encoder for High-Fidelity Scene Text Editing | Tong Wang et.al. | 2505.04915 | null |
2025-05-08 | D-CODA: Diffusion for Coordinated Dual-Arm Data Augmentation | I-Chun Arthur Liu et.al. | 2505.04860 | null |
2025-05-07 | CRAFT: Cultural Russian-Oriented Dataset Adaptation for Focused Text-to-Image Generation | Viacheslav Vasilev et.al. | 2505.04851 | null |
2025-05-07 | Steerable Scene Generation with Post Training and Inference-Time Search | Nicholas Pfaff et.al. | 2505.04831 | link |
2025-05-07 | Lay-Your-Scene: Natural Scene Layout Generation with Diffusion Transformers | Divyansh Srivastava et.al. | 2505.04718 | null |
2025-05-07 | AI-Generated Fall Data: Assessing LLMs and Diffusion Model for Wearable Fall Detection | Sana Alamgeer et.al. | 2505.04660 | null |
2025-05-07 | MeshGen: Generating PBR Textured Mesh with Render-Enhanced Auto-Encoder and Generative Data Augmentation | Zilong Chen et.al. | 2505.04656 | link |
2025-05-06 | Multimodal Benchmarking and Recommendation of Text-to-Image Generation Models | Kapil Wanaskar et.al. | 2505.04650 | link |
2025-05-06 | ChannelExplorer: Exploring Class Separability Through Activation Channel Visualization | Md Rahat-uz- Zaman et.al. | 2505.04647 | null |
2025-05-04 | Language translation, and change of accent for speech-to-speech task using diffusion model | Abhishek Mishra et.al. | 2505.04639 | null |
2025-05-07 | Score Distillation Sampling for Audio: Source Separation, Synthesis, and Beyond | Jessie Richter-Powell et.al. | 2505.04621 | null |
2025-05-07 | Perpetuating Misogyny with Generative AI: How Model Personalization Normalizes Gendered Harm | Laura Wagner et.al. | 2505.04600 | null |
2025-05-07 | Text2CT: Towards 3D CT Volume Generation from Free-text Descriptions Using Diffusion Model | Pengfei Guo et.al. | 2505.04522 | null |
2025-05-08 | HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation | Teng Hu et.al. | 2505.04512 | null |
2025-05-08 | Defining and Quantifying Creative Behavior in Popular Image Generators | Aditi Ramaswamy et.al. | 2505.04497 | null |
2025-05-07 | Efficient Flow Matching using Latent Variables | Anirban Samaddar et.al. | 2505.04486 | null |
2025-05-07 | Localized Diffusion Models for High Dimensional Distributions Generation | Georg A. Gottwald et.al. | 2505.04417 | null |
2025-05-07 | CountDiffusion: Text-to-Image Synthesis with Training-Free Counting-Guidance Diffusion | Yanyu Li et.al. | 2505.04347 | null |
2025-05-07 | MoDE: Mixture of Diffusion Experts for Any Occluded Face Recognition | Qiannan Fan et.al. | 2505.04306 | null |
2025-05-07 | TS-Diff: Two-Stage Diffusion Model for Low-Light RAW Image Enhancement | Yi Li et.al. | 2505.04281 | link |
2025-05-07 | HDiffTG: A Lightweight Hybrid Diffusion-Transformer-GCN Architecture for 3D Human Pose Estimation | Yajie Fu et.al. | 2505.04276 | link |
2025-05-07 | Bridging Geometry-Coherent Text-to-3D Generation with Multi-View Diffusion Priors and Gaussian Splatting | Feng Yang et.al. | 2505.04262 | null |
2025-05-07 | DiffPattern-Flex: Efficient Layout Pattern Generation via Discrete Diffusion | Zixiao Wang et.al. | 2505.04173 | null |
2025-05-07 | Unmasking the Canvas: A Dynamic Benchmark for Image Generation Jailbreaking and LLM Content Safety | Variath Madhupal Gautham Nair et.al. | 2505.04146 | null |
2025-05-07 | RFNNS: Robust Fixed Neural Network Steganography with Popular Deep Generative Models | Yu Cheng et.al. | 2505.04116 | null |
2025-05-07 | Person-In-Situ: Scene-Consistent Human Image Insertion with Occlusion-Aware Pose Control | Shun Masuda et.al. | 2505.04052 | null |
2025-05-07 | BuildingBlock: A Hybrid Approach for Structured Building Generation | Junming Huang et.al. | 2505.04051 | null |
2025-05-07 | TerraFusion: Joint Generation of Terrain Geometry and Texture Using Latent Diffusion Models | Kazuki Higo et.al. | 2505.04050 | null |
2025-05-06 | Diffusion Models are Secretly Exchangeable: Parallelizing DDPMs via Autospeculation | Hengyuan Hu et.al. | 2505.03983 | null |
2025-05-06 | nuGAN: Generative Adversarial Emulator for Cosmic Web with Neutrinos | Neerav Kaushal et.al. | 2505.03936 | null |
2025-05-06 | Deepfakes on Demand: the rise of accessible non-consensual deepfake image generators | Will Hawkins et.al. | 2505.03859 | link |
2025-05-11 | From Spaceborne to Airborne: SAR Image Synthesis Using Foundation Models for Multi-Scale Adaptation | Solene Debuysere et.al. | 2505.03844 | null |
2025-05-06 | CaRaFFusion: Improving 2D Semantic Segmentation with Camera-Radar Point Cloud Fusion and Zero-Shot Image Inpainting | Huawei Sun et.al. | 2505.03679 | null |
2025-05-06 | Distribution-Conditional Generation: From Class Distribution to Creative Generation | Fu Feng et.al. | 2505.03667 | null |
2025-05-06 | Revolutionizing Brain Tumor Imaging: Generating Synthetic 3D FA Maps from T1-Weighted MRI using CycleGAN Models | Xin Du et.al. | 2505.03662 | null |
2025-05-06 | Bounding Box-Guided Diffusion for Synthesizing Industrial Images and Segmentation Map | Alessandro Simoni et.al. | 2505.03623 | link |
2025-05-11 | PAHA: Parts-Aware Audio-Driven Human Animation with Diffusion Model | S. Z. Zhou et.al. | 2505.03603 | null |
2025-05-06 | Real-Time Person Image Synthesis Using a Flow Matching Model | Jiwoo Jeong et.al. | 2505.03562 | null |
2025-05-06 | A Comprehensive Survey of Large AI Models for Future Communications: Foundations, Applications and Challenges | Feibo Jiang et.al. | 2505.03556 | link |
2025-05-06 | Phenotype-Guided Generative Model for High-Fidelity Cardiac MRI Synthesis: Advancing Pretraining and Clinical Applications | Ziyu Li et.al. | 2505.03426 | null |
2025-05-06 | Safer Prompts: Reducing IP Risk in Visual Generative AI | Lena Reissinger et.al. | 2505.03338 | null |
2025-05-06 | FLUX-Text: A Simple and Advanced Diffusion Transformer Baseline for Scene Text Editing | Rui Lan et.al. | 2505.03329 | null |
2025-05-06 | Unified Multimodal Chain-of-Thought Reward Model through Reinforcement Fine-Tuning | Yibin Wang et.al. | 2505.03318 | null |
2025-05-06 | Mamba-Diffusion Model with Learnable Wavelet for Controllable Symbolic Music Generation | Jincheng Zhang et.al. | 2505.03314 | link |
2025-05-06 | A piston to counteract diffusion: The influence of an inward-shifting boundary on the heat equation in half-space | Samuel Tréton et.al. | 2505.03304 | null |
2025-05-06 | DiffVQA: Video Quality Assessment Using Diffusion Feature Extractor | Wei-Ting Chen et.al. | 2505.03261 | null |
2025-05-06 | Seeing the Abstract: Translating the Abstract Language for Vision Language Models | Davide Talon et.al. | 2505.03242 | link |
2025-05-06 | Transformers for Learning on Noisy and Task-Level Manifolds: Approximation and Generalization Insights | Zhaiming Shen et.al. | 2505.03205 | null |
2025-05-06 | PiCo: Enhancing Text-Image Alignment with Improved Noise Selection and Precise Mask Control in Diffusion Models | Chang Xie et.al. | 2505.03203 | null |
2025-05-06 | Convergence Of Consistency Model With Multistep Sampling Under General Data Assumptions | Yiding Chen et.al. | 2505.03194 | null |
2025-05-06 | DiffusionInv: Prior-enhanced Bayesian Full Waveform Inversion using Diffusion models | Yuanyuan Li et.al. | 2505.03138 | null |
2025-05-06 | Enhancing Glass Defect Detection with Diffusion Models: Addressing Imbalanced Datasets in Manufacturing Quality Control | Sajjad Rezvani Boroujeni et.al. | 2505.03134 | null |
2025-05-06 | Not All Parameters Matter: Masking Diffusion Models for Enhancing Generation Ability | Lei Wang et.al. | 2505.03097 | null |
2025-05-05 | Towards Dataset Copyright Evasion Attack against Personalized Text-to-Image Diffusion Models | Kuofeng Gao et.al. | 2505.02824 | link |
2025-05-05 | Advancing Generalizable Tumor Segmentation with Anomaly-Aware Open-Vocabulary Attention Maps and Frozen Foundation Diffusion Models | Yankai Jiang et.al. | 2505.02753 | link |
2025-05-06 | MCCD: Multi-Agent Collaboration-based Compositional Diffusion for Complex Text-to-Image Generation | Mingcheng Li et.al. | 2505.02648 | null |
2025-05-07 | Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities | Xinjie Zhang et.al. | 2505.02567 | link |
2025-05-05 | Text to Image Generation and Editing: A Survey | Pengfei Yang et.al. | 2505.02527 | null |
2025-05-06 | Resolving Memorization in Empirical Diffusion Model for Manifold Data in High-Dimensional Spaces | Yang Lyu et.al. | 2505.02508 | null |
2025-05-07 | Ming-Lite-Uni: Advancements in Unified Architecture for Natural Multimodal Interaction | Inclusion AI et.al. | 2505.02471 | link |
2025-05-05 | Predicting the Dynamics of Complex System via Multiscale Diffusion Autoencoder | Ruikun Li et.al. | 2505.02450 | null |
2025-05-08 | T2S: High-resolution Time Series Generation with Text-to-Series Diffusion Models | Yunfeng Ge et.al. | 2505.02417 | link |
2025-05-08 | Enhancing AI Face Realism: Cost-Efficient Quality Improvement in Distilled Diffusion Models with a Fully Synthetic Dataset | Jakub Wasala et.al. | 2505.02255 | null |
2025-05-04 | Quantizing Diffusion Models from a Sampling-Aware Perspective | Qian Zeng et.al. | 2505.02242 | null |
2025-05-04 | Improving Physical Object State Representation in Text-to-Image Generative Systems | Tianle Chen et.al. | 2505.02236 | link |
2025-05-04 | DualReal: Adaptive Joint Training for Lossless Identity-Motion Fusion in Video Customization | Wenchuan Wang et.al. | 2505.02192 | null |
2025-05-06 | Regression is all you need for medical image translation | Sebastian Rassmann et.al. | 2505.02048 | link |
2025-05-03 | Discrete Spatial Diffusion: Intensity-Preserving Diffusion Modeling | Javier E. Santos et.al. | 2505.01917 | null |
2025-05-03 | Rethinking Score Distilling Sampling for 3D Editing and Generation | Xingyu Miao et.al. | 2505.01888 | null |
2025-05-03 | DualDiff: Dual-branch Diffusion Model for Autonomous Driving with Semantic Fusion | Haoteng Li et.al. | 2505.01857 | null |
2025-05-03 | MVHumanNet++: A Large-scale Dataset of Multi-view Daily Dressing Human Captures with Richer Annotations for 3D Human Digitization | Chenghong Li et.al. | 2505.01838 | null |
2025-05-03 | PhytoSynth: Leveraging Multi-modal Generative Models for Crop Disease Data Generation with Novel Benchmarking and Prompt Engineering Approach | Nitin Rai et.al. | 2505.01823 | null |
2025-05-03 | Analytic Energy-Guided Policy Optimization for Offline Reinforcement Learning | Jifeng Hu et.al. | 2505.01822 | null |
2025-05-03 | PosePilot: Steering Camera Pose for Generative World Models with Self-supervised Depth | Bu Jin et.al. | 2505.01729 | null |
2025-05-03 | RAGAR: Retrieval Augment Personalized Image Generation Guided by Recommendation | Run Ling et.al. | 2505.01657 | null |
2025-05-02 | The DCR Delusion: Measuring the Privacy Risk of Synthetic Data | Zexi Yao et.al. | 2505.01524 | null |
2025-05-02 | WorldGenBench: A World-Knowledge-Integrated Benchmark for Reasoning-Driven Text-to-Image Generation | Daoan Zhang et.al. | 2505.01490 | null |
2025-05-02 | VideoHallu: Evaluating and Mitigating Multi-modal Hallucinations for Synthetic Videos | Zongxia Li et.al. | 2505.01481 | link |
2025-04-26 | Global Stress Generation and Spatiotemporal Super-Resolution Physics-Informed Operator under Dynamic Loading for Two-Phase Random Materials | Tengfei Xing et.al. | 2505.01438 | null |
2025-05-02 | VIDSTAMP: A Temporally-Aware Watermark for Ownership and Integrity in Video Diffusion Models | Mohammadreza Teymoorianfard et.al. | 2505.01406 | link |
2025-05-02 | Provable Efficiency of Guidance in Diffusion Models for General Data Distribution | Gen Li et.al. | 2505.01382 | null |
2025-05-02 | FreeInsert: Disentangled Text-Guided Object Insertion in 3D Gaussian Scene without Spatial Priors | Chenxi Li et.al. | 2505.01322 | null |
2025-05-02 | Model See Model Do: Speech-Driven Facial Animation with Style Control | Yifang Pan et.al. | 2505.01319 | null |
2025-05-05 | Enabling Training-Free Semantic Communication Systems with Generative Diffusion Models | Shunpu Tang et.al. | 2505.01209 | null |
2025-05-02 | FreePCA: Integrating Consistency Information across Long-short Frames in Training-free Long Video Generation via Principal Component Analysis | Jiangtong Tan et.al. | 2505.01172 | link |
2025-05-02 | VSC: Visual Search Compositional Text-to-Image Diffusion Model | Do Huu Dat et.al. | 2505.01104 | null |
2025-05-02 | Improving Editability in Image Generation with Layer-wise Memory | Daneul Kim et.al. | 2505.01079 | null |
2025-05-02 | Multi-Step Consistency Models: Fast Generation with Theoretical Guarantees | Nishant Jain et.al. | 2505.01049 | null |
2025-05-02 | Where’s the liability in the Generative Era? Recovery-based Black-Box Detection of AI-Generated Content | Haoyue Bai et.al. | 2505.01008 | null |
2025-05-12 | Multi-Modal Language Models as Text-to-Image Model Evaluators | Jiahui Chen et.al. | 2505.00759 | null |
2025-05-01 | InstructAttribute: Fine-grained Object Attributes editing with Instruction | Xingxi Yin et.al. | 2505.00751 | null |
2025-05-05 | Generalized $θ$ -Parametric Metric Spaces: Fixed Point Theorems and Applications to Fractional Economic Models | Abhishikta Das et.al. | 2505.00722 | null |
2025-05-01 | Controllable Weather Synthesis and Removal with Video Diffusion Models | Chih-Hao Lin et.al. | 2505.00704 | null |
2025-05-01 | T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT | Dongzhi Jiang et.al. | 2505.00703 | link |
2025-05-01 | GuideSR: Rethinking Guidance for One-Step High-Fidelity Diffusion-Based Super-Resolution | Aditya Arora et.al. | 2505.00687 | null |
2025-05-01 | ParkDiffusion: Heterogeneous Multi-Agent Multi-Modal Trajectory Prediction for Automated Parking using Diffusion Models | Jiarong Wei et.al. | 2505.00586 | null |
2025-05-01 | Safety-Critical Traffic Simulation with Guided Latent Diffusion Model | Mingxing Peng et.al. | 2505.00515 | null |
2025-05-01 | JointDiT: Enhancing RGB-Depth Joint Modeling with Diffusion Transformers | Kwon Byung-Ki et.al. | 2505.00482 | null |
2025-05-01 | Leveraging Pretrained Diffusion Models for Zero-Shot Part Assembly | Ruiyuan Zhang et.al. | 2505.00426 | null |
2025-05-01 | Denoising weak lensing mass maps with diffusion model: systematic comparison with generative adversarial network | Shohei D. Aoyama et.al. | 2505.00345 | null |
2025-05-01 | T2VPhysBench: A First-Principles Benchmark for Physical Consistency in Text-to-Video Generation | Xuyang Guo et.al. | 2505.00337 | null |
2025-05-05 | Quaternion Wavelet-Conditioned Diffusion Models for Image Super-Resolution | Luigi Sigillo et.al. | 2505.00334 | null |
2025-04-30 | Generative Machine Learning in Adaptive Control of Dynamic Manufacturing Processes: A Review | Suk Ki Lee et.al. | 2505.00210 | null |
2025-04-30 | Direct Motion Models for Assessing Generated Videos | Kelsey Allen et.al. | 2505.00209 | null |
2025-04-30 | Generative Multimodal Multiscale Data Fusion for Digital Twins in Aerosol Jet Electronics Printing | Fatemeh Elhambakhsh et.al. | 2505.00176 | null |
2025-04-30 | Eye2Eye: A Simple Approach for Monocular-to-Stereo Video Synthesis | Michal Geyer et.al. | 2505.00135 | null |
2025-04-30 | Materials discovery acceleration by using condition generative methodology | Caiyuan Ye et.al. | 2505.00076 | link |
2025-04-27 | Keep the General, Inject the Specific: Structured Dialogue Fine-Tuning for Knowledge Injection without Catastrophic Forgetting | Yijie Hong et.al. | 2505.00029 | null |
2025-04-30 | ReVision: High-Quality, Low-Cost Video Generation with Explicit 3D Physics Modeling for Complex Motion and Interaction | Qihao Liu et.al. | 2504.21855 | null |
2025-04-30 | 3D Stylization via Large Reconstruction Model | Ipek Oztas et.al. | 2504.21836 | null |
2025-04-30 | Why Compress What You Can Generate? When GPT-4o Generation Ushers in Image Compression Fields | Yixin Gao et.al. | 2504.21814 | null |
2025-04-30 | Generalizing Biased Backpressure Routing and Scheduling to Wireless Multi-hop Networks with Advanced Air-interfaces | Zhongyuan Zhao et.al. | 2504.21721 | null |
2025-05-13 | HoloTime: Taming Video Diffusion Models for Panoramic 4D Scene Generation | Haiyang Zhou et.al. | 2504.21650 | link |
2025-04-30 | Diffusion-based Adversarial Identity Manipulation for Facial Privacy Protection | Liqin Wang et.al. | 2504.21646 | null |
2025-04-30 | ODE and PDE models for COVID-19, with reinfection and vaccination process for Cameroon and Germany | Hamadjam Abboubakar et.al. | 2504.21613 | null |
2025-04-30 | Latent Feature-Guided Conditional Diffusion for High-Fidelity Generative Image Semantic Communication | Zehao Chen et.al. | 2504.21577 | null |
2025-05-10 | MagicPortrait: Temporally Consistent Face Reenactment with 3D Geometric Guidance | Mengting Wei et.al. | 2504.21497 | link |
2025-05-08 | DGSolver: Diffusion Generalist Solver with Universal Posterior Sampling for Image Restoration | Hebaixu Wang et.al. | 2504.21487 | link |
2025-04-30 | Diff-Prompt: Diffusion-Driven Prompt Generator with Mask Supervision | Weicai Yan et.al. | 2504.21423 | null |
2025-04-30 | IDDM: Bridging Synthetic-to-Real Domain Gap from Physics-Guided Diffusion for Real-world Image Dehazing | Shijun Zhou et.al. | 2504.21385 | null |
2025-04-30 | Sparse-to-Sparse Training of Diffusion Models | Inês Cardoso Oliveira et.al. | 2504.21380 | null |
2025-05-08 | Nexus-Gen: A Unified Model for Image Understanding, Generation, and Editing | Hong Zhang et.al. | 2504.21356 | link |
2025-04-30 | Simple Visual Artifact Detection in Sora-Generated Videos | Misora Sugiyama et.al. | 2504.21334 | null |
2025-04-30 | Text-Conditioned Diffusion Model for High-Fidelity Korean Font Generation | Abdul Sami et.al. | 2504.21325 | null |
2025-04-30 | Capturing Conditional Dependence via Auto-regressive Diffusion Models | Xunpeng Huang et.al. | 2504.21314 | null |
2025-04-30 | AGHI-QA: A Subjective-Aligned Dataset and Metric for AI-Generated Human Images | Yunhao Li et.al. | 2504.21308 | null |
2025-04-30 | The Dual Power of Interpretable Token Embeddings: Jailbreaking Attacks and Defenses for Diffusion Model Unlearning | Siyi Chen et.al. | 2504.21307 | null |
2025-04-30 | Can We Achieve Efficient Diffusion without Self-Attention? Distilling Self-Attention into Convolutions | ZiYi Dong et.al. | 2504.21292 | null |
2025-04-30 | CoCoDiff: Diversifying Skeleton Action Features via Coarse-Fine Text-Co-Guided Latent Diffusion | Zhifu Zhao et.al. | 2504.21266 | null |
2025-04-29 | T2ID-CAS: Diffusion Model and Class Aware Sampling to Mitigate Class Imbalance in Neck Ultrasound Anatomical Landmark Detection | Manikanta Varaganti et.al. | 2504.21231 | null |
2025-04-29 | ProT-GFDM: A Generative Fractional Diffusion Model for Protein Generation | Xiao Liang et.al. | 2504.21092 | null |
2025-04-29 | Erased but Not Forgotten: How Backdoors Compromise Concept Erasure | Jonas Henry Grebe et.al. | 2504.21072 | null |
2025-04-29 | A 3D pocket-aware and affinity-guided diffusion model for lead optimization | Anjie Qiao et.al. | 2504.21065 | null |
2025-04-29 | YoChameleon: Personalized Vision and Language Generation | Thao Nguyen et.al. | 2504.20998 | null |
2025-04-29 | X-Fusion: Introducing New Modality to Frozen Large Language Models | Sicheng Mo et.al. | 2504.20996 | null |
2025-04-29 | TesserAct: Learning 4D Embodied World Models | Haoyu Zhen et.al. | 2504.20995 | null |
2025-04-29 | AI-GenBench: A New Ongoing Benchmark for AI-Generated Image Detection | Lorenzo Pellegrini et.al. | 2504.20865 | null |
2025-04-29 | SoccerDiffusion: Toward Learning End-to-End Humanoid Robot Soccer from Gameplay Recordings | Florian Vahl et.al. | 2504.20808 | null |
2025-04-29 | JTreeformer: Graph-Transformer via Latent-Diffusion Model for Molecular Generation | Ji Shi et.al. | 2504.20770 | null |
2025-04-29 | DDPS: Discrete Diffusion Posterior Sampling for Paths in Layered Graphs | Hao Luan et.al. | 2504.20754 | null |
2025-04-29 | Efficient Listener: Dyadic Facial Motion Synthesis via Action Diffusion | Zesheng Wang et.al. | 2504.20685 | null |
2025-04-29 | Advance Fake Video Detection via Vision Transformers | Joy Battocchio et.al. | 2504.20669 | null |
2025-04-29 | LDPoly: Latent Diffusion for Polygonal Road Outline Extraction in Large-Scale Topographic Mapping | Weiqin Jiao et.al. | 2504.20645 | null |
2025-04-29 | DiffusionRIR: Room Impulse Response Interpolation using Diffusion Models | Sagi Della Torre et.al. | 2504.20625 | null |
2025-04-29 | TriniMark: A Robust Generative Speech Watermarking Method for Trinity-Level Attribution | Yue Li et.al. | 2504.20532 | null |
2025-04-29 | Dynamic Attention Analysis for Backdoor Detection in Text-to-Image Diffusion Models | Zhongqi Wang et.al. | 2504.20518 | null |
2025-04-29 | Reviving Any-Subset Autoregressive Models with Principled Parallel Sampling and Speculative Decoding | Gabe Guo et.al. | 2504.20456 | link |
2025-04-30 | PixelHacker: Image Inpainting with Structural and Semantic Consistency | Ziyang Xu et.al. | 2504.20438 | null |
2025-04-29 | ADiff4TPP: Asynchronous Diffusion Models for Temporal Point Processes | Amartya Mukherjee et.al. | 2504.20411 | null |
2025-04-29 | Inception: Jailbreak the Memory Mechanism of Text-to-Image Generation Systems | Shiqian Zhao et.al. | 2504.20376 | null |
2025-04-29 | A Picture is Worth a Thousand Prompts? Efficacy of Iterative Human-Driven Prompt Refinement in Image Regeneration Tasks | Khoi Trinh et.al. | 2504.20340 | null |
2025-04-28 | Image Interpolation with Score-based Riemannian Metrics of Diffusion Models | Shinnosuke Saito et.al. | 2504.20288 | null |
2025-04-28 | Generative Diffusion Models for Resource Allocation in Wireless Networks | Yigit Berkay Uslu et.al. | 2504.20277 | null |
2025-04-28 | Physics-Informed Diffusion Models for SAR Ship Wake Generation from Text Prompts | Kamirul Kamirul et.al. | 2504.20241 | null |
2025-04-28 | Integration Flow Models | Jingjing Wang et.al. | 2504.20179 | null |
2025-04-27 | Forging and Removing Latent-Noise Diffusion Watermarks Using a Single Image | Anubhav Jain et.al. | 2504.20111 | null |
2025-04-28 | CineVerse: Consistent Keyframe Synthesis for Cinematic Scene Composition | Quynh Phung et.al. | 2504.19894 | null |
2025-04-28 | DeeCLIP: A Robust and Generalizable Transformer-Based Framework for Detecting AI-Generated Images | Mamadou Keita et.al. | 2504.19876 | link |
2025-04-28 | CoherenDream: Boosting Holistic Text Coherence in 3D Generation via Multimodal Large Language Models Feedback | Chenhan Jiang et.al. | 2504.19860 | null |
2025-04-28 | RepText: Rendering Visual Text via Replicating | Haofan Wang et.al. | 2504.19724 | null |
2025-04-28 | Interactive Discovery and Exploration of Visual Bias in Generative Text-to-Image Models | Johannes Eschner et.al. | 2504.19703 | null |
2025-04-28 | Multimodal Conditioned Diffusive Time Series Forecasting | Chen Su et.al. | 2504.19669 | null |
2025-04-28 | Robot Motion Planning using One-Step Diffusion with Noise-Optimized Approximate Motions | Tomoharu Aizu et.al. | 2504.19652 | null |
2025-04-28 | AI Alignment in Medical Imaging: Unveiling Hidden Biases Through Counterfactual Analysis | Haroui Ma et.al. | 2504.19621 | link |
2025-04-28 | DiVE: Efficient Multi-View Driving Scenes Generation Based on Video Diffusion Transformer | Junpeng Jiang et.al. | 2504.19614 | null |
2025-04-28 | Image Generation Method Based on Heat Diffusion Models | Pengfei Zhang et.al. | 2504.19600 | null |
2025-04-29 | WILD: a new in-the-Wild Image Linkage Dataset for synthetic image attribution | Pietro Bongini et.al. | 2504.19595 | null |
2025-04-28 | GenPTW: In-Generation Image Watermarking for Provenance Tracing and Tamper Localization | Zhenliang Gan et.al. | 2504.19567 | null |
2025-04-28 | SynergyAmodal: Deocclude Anything with Text Control | Xinyang Li et.al. | 2504.19506 | null |
2025-04-28 | Simultaneous Pick and Place Detection by Combining SE(3) Diffusion Models with Differential Kinematics | Tianyi Ko et.al. | 2504.19502 | null |
2025-04-28 | Masked Language Prompting for Generative Data Augmentation in Few-shot Fashion Style Recognition | Yuki Hirakawa et.al. | 2504.19455 | null |
2025-04-28 | GTSD: Generative Text Steganography Based on Diffusion Model | Zhengxian Wu et.al. | 2504.19433 | null |
2025-04-28 | Boosting 3D Liver Shape Datasets with Diffusion Models and Implicit Neural Representations | Khoa Tuan Nguyen et.al. | 2504.19402 | null |
2025-04-27 | Flow Along the K-Amplitude for Generative Modeling | Weitao Du et.al. | 2504.19353 | null |
2025-04-27 | Sketch2Anim: Towards Transferring Sketch Storyboards into 3D Animation | Lei Zhong et.al. | 2504.19189 | null |
2025-04-29 | IM-Portrait: Learning 3D-aware Video Diffusion for Photorealistic Talking Heads from Monocular Videos | Yuan Li et.al. | 2504.19165 | null |
2025-04-27 | Generative AI for Character Animation: A Comprehensive Survey of Techniques, Applications, and Future Directions | Mohammad Mahdi Abootorabi et.al. | 2504.19056 | link |
2025-04-26 | Learning Stochastic Thermodynamics Directly from Correlation and Trajectory-Fluctuation Currents | Jinghao Lyu et.al. | 2504.19007 | null |
2025-04-26 | REED-VAE: RE-Encode Decode Training for Iterative Image Editing with Diffusion Models | Gal Almog et.al. | 2504.18989 | link |
2025-04-26 | Predicting Stress in Two-phase Random Materials and Super-Resolution Method for Stress Images by Embedding Physical Information | Tengfei Xing et.al. | 2504.18854 | null |
2025-04-26 | Audio-Driven Talking Face Video Generation with Joint Uncertainty Learning | Yifan Xie et.al. | 2504.18810 | null |
2025-04-26 | Stealing Creator’s Workflow: A Creator-Inspired Agentic Framework with Iterative Feedback Loop for Improved Scientific Short-form Generation | Jong Inn Park et.al. | 2504.18805 | null |
2025-04-25 | Dream-Box: Object-wise Outlier Generation for Out-of-Distribution Detection | Brian K. S. Isaac-Medina et.al. | 2504.18746 | null |
2025-04-25 | Appa: Bending Weather Dynamics with Latent Diffusion Models for Global Data Assimilation | Gérôme Andry et.al. | 2504.18720 | null |
2025-04-22 | DriVerse: Navigation World Model for Driving Simulation via Multimodal Trajectory Prompting and Motion Alignment | Xiaofan Li et.al. | 2504.18576 | null |
2025-04-21 | Backdoor Defense in Diffusion Models via Spatial Attention Unlearning | Abha Jha et.al. | 2504.18563 | null |
2025-04-25 | TRACE Back from the Future: A Probabilistic Reasoning Approach to Controllable Language Generation | Gwen Yidou Weng et.al. | 2504.18535 | null |
2025-04-25 | NoiseController: Towards Consistent Multi-view Video Generation via Noise Decomposition and Collaboration | Haotian Dong et.al. | 2504.18448 | null |
2025-04-25 | HepatoGEN: Generating Hepatobiliary Phase MRI with Perceptual and Adversarial Models | Jens Hooge et.al. | 2504.18405 | null |
2025-04-25 | Energy Security and Resilience: Reviewing Concepts and Advancing Planning Perspectives for Transforming Integrated Energy Systems | Richard Schmitz et.al. | 2504.18396 | null |
2025-04-24 | Fast Autoregressive Models for Continuous Latent Generation | Tiankai Hang et.al. | 2504.18391 | null |
2025-04-25 | SSD-Poser: Avatar Pose Estimation with State Space Duality from Sparse Observations | Shuting Zhao et.al. | 2504.18332 | null |
2025-04-25 | STP4D: Spatio-Temporal-Prompt Consistent Modeling for Text-to-4D Gaussian Splatting | Yunze Deng et.al. | 2504.18318 | null |
2025-04-25 | TextTIGER: Text-based Intelligent Generation with Entity Prompt Refinement for Text-to-Image Generation | Shintaro Ozaki et.al. | 2504.18269 | null |
2025-04-25 | Optimizing Multi-Round Enhanced Training in Diffusion Models for Improved Preference Understanding | Kun Li et.al. | 2504.18204 | null |
2025-04-25 | Generative AI for Physical-Layer Authentication | Rui Meng et.al. | 2504.18175 | null |
2025-04-25 | Disentangle Identity, Cooperate Emotion: Correlation-Aware Emotional Talking Portrait Generation | Weipeng Tan et.al. | 2504.18087 | null |
2025-04-25 | Enhancing Privacy-Utility Trade-offs to Mitigate Memorization in Diffusion Models | Chen Chen et.al. | 2504.18032 | null |
2025-04-25 | Diffusion-Driven Universal Model Inversion Attack for Face Recognition | Hanrui Wang et.al. | 2504.18015 | null |
2025-04-24 | DCT-Shield: A Robust Frequency Domain Defense against Malicious Image Editing | Aniruddha Bala et.al. | 2504.17894 | null |
2025-04-30 | Evolution Meets Diffusion: Efficient Neural Architecture Generation | Bingye Zhou et.al. | 2504.17827 | null |
2025-04-24 | FashionM3: Multimodal, Multitask, and Multiround Fashion Assistant based on Unified Vision-Language Model | Kaicheng Pang et.al. | 2504.17826 | null |
2025-04-24 | Dual Prompting Image Restoration with Diffusion Transformers | Dehong Kong et.al. | 2504.17825 | null |
2025-04-23 | Subject-driven Video Generation via Disentangled Identity and Motion | Daneul Kim et.al. | 2504.17816 | null |
2025-04-23 | Visibility-Uncertainty-guided 3D Gaussian Inpainting via Scene Conceptional Learning | Mingxuan Cui et.al. | 2504.17815 | link |
2025-04-24 | LiDPM: Rethinking Point Diffusion for Lidar Scene Completion | Tetiana Martyniuk et.al. | 2504.17791 | null |
2025-04-27 | Token-Shuffle: Towards High-Resolution Image Generation with Autoregressive Models | Xu Ma et.al. | 2504.17789 | null |
2025-04-24 | Dynamic Camera Poses and Where to Find Them | Chris Rockwell et.al. | 2504.17788 | null |
2025-04-24 | Generative Fields: Uncovering Hierarchical Feature Control for StyleGAN via Inverted Receptive Fields | Zhuo He et.al. | 2504.17712 | null |
2025-04-24 | polyGen: A Learning Framework for Atomic-level Polymer Structure Generation | Ayush Jain et.al. | 2504.17656 | null |
2025-04-24 | Beyond Labels: Zero-Shot Diabetic Foot Ulcer Wound Segmentation with Self-attention Diffusion Models and the Potential for Text-Guided Customization | Abderrachid Hamrani et.al. | 2504.17628 | null |
2025-04-24 | STCL:Curriculum learning Strategies for deep learning image steganography models | Fengchun Liu et.al. | 2504.17609 | link |
2025-04-24 | Text-to-Image Alignment in Denoising-Based Models through Step Selection | Paul Grimal et.al. | 2504.17525 | null |
2025-04-24 | ESDiff: Encoding Strategy-inspired Diffusion Model with Few-shot Learning for Color Image Inpainting | Junyan Zhang et.al. | 2504.17524 | null |
2025-04-24 | RefVNLI: Towards Scalable Evaluation of Subject-driven Text-to-image Generation | Aviv Slobodkin et.al. | 2504.17502 | null |
2025-04-24 | Longitudinal Control for Autonomous Racing with Combustion Engine Vehicles | Phillip Pitschi et.al. | 2504.17418 | null |
2025-04-24 | 3DV-TON: Textured 3D-Guided Consistent Video Try-on via Diffusion Models | Min Wei et.al. | 2504.17414 | null |
2025-04-24 | StereoMamba: Real-time and Robust Intraoperative Stereo Disparity Estimation via Long-range Spatial Dependencies | Xu Wang et.al. | 2504.17401 | null |
2025-04-24 | DRC: Enhancing Personalized Image Generation via Disentangled Representation Composition | Yiyan Xu et.al. | 2504.17349 | null |
2025-04-24 | CKMDiff: A Generative Diffusion Model for CKM Construction via Inverse Problems with Learned Priors | Shen Fu et.al. | 2504.17323 | null |
2025-04-24 | Physics-based super-resolved simulation of 3D elastic wave propagation adopting scalable Diffusion Transformer | Hugo Gabrielidis et.al. | 2504.17308 | null |
2025-04-24 | Towards Generalized and Training-Free Text-Guided Semantic Manipulation | Yu Hong et.al. | 2504.17269 | null |
2025-04-24 | MV-Crafter: An Intelligent System for Music-guided Video Generation | Chuer Chen et.al. | 2504.17267 | null |
2025-04-24 | DIVE: Inverting Conditional Diffusion Models for Discriminative Tasks | Yinqi Li et.al. | 2504.17253 | link |
2025-04-24 | Scene Perceived Image Perceptual Score (SPIPS): combining global and local perception for image quality assessment | Zhiqiang Lao et.al. | 2504.17234 | null |
2025-04-25 | We’ll Fix it in Post: Improving Text-to-Video Generation with Neuro-Symbolic Feedback | Minkyu Choi et.al. | 2504.17180 | null |
2025-04-24 | AUTHENTICATION: Identifying Rare Failure Modes in Autonomous Vehicle Perception Systems using Adversarially Guided Diffusion Models | Mohammad Zarei et.al. | 2504.17179 | null |
2025-04-25 | Latent Video Dataset Distillation | Ning Li et.al. | 2504.17132 | null |
2025-04-23 | Physics-guided and fabrication-aware inverse design of photonic devices using diffusion models | Dongjin Seo et.al. | 2504.17077 | link |
2025-04-23 | Distilling semantically aware orders for autoregressive image generation | Rishav Pramanik et.al. | 2504.17069 | null |
2025-04-23 | Diffusion Probabilistic Models for Compressive SAR Imaging | Odysseas Pappas et.al. | 2504.17053 | null |
2025-04-22 | Self-Controlled Diffusion for Denoising in Scientific Imaging | Nikolay Falaleev et.al. | 2504.16951 | null |
2025-04-23 | BadVideo: Stealthy Backdoor Attack against Text-to-Video Generation | Ruotong Wang et.al. | 2504.16907 | null |
2025-04-23 | Practical approaches for crystal structure predictions with inpainting generation and universal interatomic potentials | Peichen Zhong et.al. | 2504.16893 | null |
2025-04-23 | Planning with Diffusion Models for Target-Oriented Dialogue Systems | Hanwen Du et.al. | 2504.16858 | null |
2025-04-23 | Physically Consistent Humanoid Loco-Manipulation using Latent Diffusion Models | Ilyass Taouil et.al. | 2504.16843 | null |
2025-04-24 | Simple Graph Contrastive Learning via Fractional-order Neural Diffusion Networks | Yanan Zhao et.al. | 2504.16748 | null |
2025-04-23 | MOSAIC: A Skill-Centric Algorithmic Framework for Long-Horizon Manipulation Planning | Itamar Mishani et.al. | 2504.16738 | null |
2025-04-24 | Hyper-Transforming Latent Diffusion Models | Ignacio Peis et.al. | 2504.16580 | null |
2025-05-10 | A Comprehensive Survey of Synthetic Tabular Data Generation | Ruxue Shi et.al. | 2504.16506 | null |
2025-04-23 | Generalized vector equilibrium problems with pairs of bifunctions and some applications | Hung Bui The et.al. | 2504.16497 | null |
2025-04-23 | The Dance of Atoms-De Novo Protein Design with Diffusion Model | Yujie Qin et.al. | 2504.16479 | null |
2025-04-23 | ManipDreamer: Boosting Robotic Manipulation World Model with Action Tree and Visual Guidance | Ying Li et.al. | 2504.16464 | null |
2025-04-23 | Target Concrete Score Matching: A Holistic Framework for Discrete Diffusion | Ruixiang Zhang et.al. | 2504.16431 | null |
2025-04-23 | CLPSTNet: A Progressive Multi-Scale Convolutional Steganography Model Integrating Curriculum Learning | Fengchun Liu et.al. | 2504.16364 | link |
2025-04-23 | VideoMark: A Distortion-Free Robust Watermarking Framework for Video Diffusion Models | Xuming Hu et.al. | 2504.16359 | null |
2025-04-22 | SignX: The Foundation Model for Sign Recognition | Sen Fang et.al. | 2504.16315 | null |
2025-04-22 | Learning Energy-Based Generative Models via Potential Flow: A Variational Principle Approach to Probability Density Homotopy Matching | Junn Yong Loo et.al. | 2504.16262 | null |
2025-04-22 | Aerial Active STAR-RIS-assisted Satellite-Terrestrial Covert Communications | Chuang Zhang et.al. | 2504.16146 | null |
2025-04-22 | Survey of Video Diffusion Models: Foundations, Implementations, and Applications | Yimu Wang et.al. | 2504.16081 | null |
2025-04-22 | From Reflection to Perfection: Scaling Inference-Time Optimization for Text-to-Image Diffusion Models via Reflection Tuning | Le Zhuo et.al. | 2504.16080 | null |
2025-04-22 | Intent-aware Diffusion with Contrastive Learning for Sequential Recommendation | Yuanpeng Qu et.al. | 2504.16077 | null |
2025-04-22 | Boosting Generative Image Modeling via Joint Image-Feature Synthesis | Theodoros Kouzelis et.al. | 2504.16064 | null |
2025-04-22 | Efficient Temporal Consistency in Diffusion-Based Video Editing with Adaptor Modules: A Theoretical Framework | Xinyuan Song et.al. | 2504.16016 | null |
2025-04-26 | FreeGraftor: Training-Free Cross-Image Feature Grafting for Subject-Driven Text-to-Image Generation | Zebin Yao et.al. | 2504.15958 | null |
2025-04-22 | Adversarial Observations in Weather Forecasting | Erik Imgrund et.al. | 2504.15942 | null |
2025-04-22 | Reasoning Physical Video Generation with Diffusion Timestep Tokens via Reinforcement Learning | Wang Lin et.al. | 2504.15932 | null |
2025-04-22 | Text-based Animatable 3D Avatars with Morphable Model Alignment | Yiqian Wu et.al. | 2504.15835 | null |
2025-04-22 | DualOptim: Enhancing Efficacy and Stability in Machine Unlearning with Dual Optimizers | Xuyang Zhong et.al. | 2504.15827 | null |
2025-04-22 | Satellite to GroundScape – Large-scale Consistent Ground View Generation from Satellite Views | Ningli Xu et.al. | 2504.15786 | null |
2025-04-24 | Clifford Group Equivariant Diffusion Models for 3D Molecular Generation | Cong Liu et.al. | 2504.15773 | null |
2025-04-22 | Structure-Preserving Zero-Shot Image Editing via Stage-Wise Latent Injection in Diffusion Models | Dasol Jeong et.al. | 2504.15723 | null |
2025-04-22 | DiTPainter: Efficient Video Inpainting with Diffusion Transformers | Xian Wu et.al. | 2504.15661 | null |
2025-04-22 | RadioDiff- $k^2$ : Helmholtz Equation Informed Generative Diffusion Model for Multi-Path Aware Radio Map Construction | Xiucheng Wang et.al. | 2504.15623 | null |
2025-04-22 | InstaRevive: One-Step Image Enhancement via Dynamic Score Matching | Yixuan Zhu et.al. | 2504.15513 | null |
2025-04-21 | Emergence and Evolution of Interpretable Concepts in Diffusion Models | Berk Tinaz et.al. | 2504.15473 | null |
2025-04-21 | Manifold Induced Biases for Zero-shot and Few-shot Detection of Generated Images | Jonathan Brokman et.al. | 2504.15470 | null |
2025-04-21 | MirrorVerse: Pushing Diffusion Models to Realistically Reflect the World | Ankit Dhiman et.al. | 2504.15397 | null |
2025-04-21 | Solving New Tasks by Adapting Internet Video Knowledge | Calvin Luo et.al. | 2504.15369 | null |
2025-04-20 | Diffusion-Driven Inertial Generated Data for Smartphone Location Classification | Noa Cohen et.al. | 2504.15315 | null |
2025-04-19 | LLM-Enabled Style and Content Regularization for Personalized Text-to-Image Generation | Anran Yu et.al. | 2504.15309 | null |
2025-04-23 | Surface to Seafloor: A Generative AI Framework for Decoding the Ocean Interior State | Andre N. Souza et.al. | 2504.15308 | null |
2025-04-16 | Diffusion Models on the Edge: Challenges, Optimizations, and Applications | Dongqi Zheng et.al. | 2504.15298 | null |
2025-04-21 | Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction | Vaishnavh Nagarajan et.al. | 2504.15266 | link |
2025-04-21 | Bringing Diversity from Diffusion Models to Semantic-Guided Face Asset Generation | Yunxuan Cai et.al. | 2504.15259 | null |
2025-04-21 | DRAGON: Distributional Rewards Optimize Diffusion Generative Models | Yatong Bai et.al. | 2504.15217 | null |
2025-04-22 | LACE: Controlled Image Prompting and Iterative Refinement with GenAI for Professional Visual Art Creators | Yenkai Huang et.al. | 2504.15189 | null |
2025-04-21 | Tiger200K: Manually Curated High Visual Quality Video Dataset from UGC Platform | Xianpan Zhou et.al. | 2504.15182 | null |
2025-04-21 | FaceCraft4D: Animated 3D Facial Avatar Generation from a Single Image | Fei Yin et.al. | 2504.15179 | null |
2025-04-21 | DSPO: Direct Semantic Preference Optimization for Real-World Image Super-Resolution | Miaomiao Cai et.al. | 2504.15176 | null |
2025-04-21 | Acquire and then Adapt: Squeezing out Text-to-Image Model for Image Restoration | Junyuan Deng et.al. | 2504.15159 | null |
2025-04-21 | GIFDL: Generated Image Fluctuation Distortion Learning for Enhancing Steganographic Security | Xiangkun Wang et.al. | 2504.15139 | null |
2025-04-21 | Automatic Generation of Aerobatic Flight in Complex Environments via Diffusion Models | Yuhang Zhong et.al. | 2504.15138 | null |
2025-04-27 | VistaDepth: Frequency Modulation With Bias Reweighting For Enhanced Long-Range Depth Estimation | Mingxia Zhan et.al. | 2504.15095 | null |
2025-04-21 | Generative Artificial Intelligence for Beamforming in Low-Altitude Economy | Geng Sun et.al. | 2504.15079 | null |
2025-04-21 | SOLIDO: A Robust Watermarking Method for Speech Synthesis via Low-Rank Adaptation | Yue Li et.al. | 2504.15035 | null |
2025-04-30 | DyST-XL: Dynamic Layout Planning and Content Control for Compositional Text-to-Video Generation | Weijie He et.al. | 2504.15032 | null |
2025-04-21 | Gaussian Shading++: Rethinking the Realistic Deployment Challenge of Performance-Lossless Image Watermark for Diffusion Models | Zijin Yang et.al. | 2504.15026 | null |
2025-04-21 | PIV-FlowDiffuser:Transfer-learning-based denoising diffusion models for PIV | Qianyu Zhu et.al. | 2504.14952 | link |
2025-04-21 | TWIG: Two-Step Image Generation using Segmentation Masks in Diffusion Models | Mazharul Islam Rakib et.al. | 2504.14933 | null |
2025-04-21 | Uni3C: Unifying Precisely 3D-Enhanced Camera and Human Motion Controls for Video Generation | Chenjie Cao et.al. | 2504.14899 | link |
2025-04-21 | Twin Co-Adaptive Dialogue for Progressive Image Generation | Jianhui Wang et.al. | 2504.14868 | null |
2025-04-21 | LACE: Exploring Turn-Taking and Parallel Interaction Modes in Human-AI Co-Creation for Iterative Image Generation | YenKai Huang et.al. | 2504.14827 | null |
2025-04-21 | What Lurks Within? Concept Auditing for Shared Diffusion Models at Scale | Xiaoyong Yuan et.al. | 2504.14815 | null |
2025-04-21 | When Cloud Removal Meets Diffusion Model in Remote Sensing | Zhenyu Yu et.al. | 2504.14785 | null |
2025-04-21 | Novel Concept-Oriented Synthetic Data approach for Training Generative AI-Driven Crystal Grain Analysis Using Diffusion Model | Ahmed Sobhi Saleh et.al. | 2504.14782 | null |
2025-04-20 | Generative Multimodal Pretraining with Discrete Diffusion Timestep Tokens | Kaihang Pan et.al. | 2504.14666 | null |
2025-04-20 | REDEditing: Relationship-Driven Precise Backdoor Poisoning on Text-to-Image Diffusion Models | Chongye Guo et.al. | 2504.14554 | null |
2025-04-20 | VGNC: Reducing the Overfitting of Sparse-view 3DGS via Validation-guided Gaussian Number Control | Lifeng Lin et.al. | 2504.14548 | null |
2025-04-20 | FlowLoss: Dynamic Flow-Conditioned Loss Strategy for Video Diffusion Models | Kuanting Wu et.al. | 2504.14535 | null |
2025-04-20 | SUDO: Enhancing Text-to-Image Diffusion Models with Self-Supervised Direct Preference Optimization | Liang Peng et.al. | 2504.14534 | link |
2025-04-25 | DreamID: High-Fidelity and Fast diffusion-based Face Swapping via Triplet ID Group Learning | Fulong Ye et.al. | 2504.14509 | link |
2025-04-20 | Turbo2K: Towards Ultra-Efficient and High-Quality 2K Video Synthesis | Jingjing Ren et.al. | 2504.14470 | null |
2025-04-24 | Causal Disentanglement for Robust Long-tail Medical Image Generation | Weizhi Nie et.al. | 2504.14450 | null |
2025-04-19 | SphereDiff: Tuning-free Omnidirectional Panoramic Image and Video Generation via Spherical Latent Representation | Minho Park et.al. | 2504.14396 | link |
2025-04-25 | Manipulating Multimodal Agents via Cross-Modal Prompt Injection | Le Wang et.al. | 2504.14348 | null |
2025-04-19 | Visual Prompting for One-shot Controllable Video Editing without Inversion | Zhengbo Zhang et.al. | 2504.14335 | null |
2025-04-19 | Diffusion-based Dynamic Contract for Federated AI Agent Construction in Mobile Metaverses | Jinbo Wen et.al. | 2504.14326 | null |
2025-04-19 | RadioDiff-Inverse: Diffusion Enhanced Bayesian Inverse Estimation for ISAC Radio Map Construction | Xiucheng Wang et.al. | 2504.14298 | null |
2025-04-19 | From Missing Pieces to Masterpieces: Image Completion with Context-Adaptive Diffusion | Pourya Shamsolmoali et.al. | 2504.14294 | null |
2025-04-19 | Towards NSFW-Free Text-to-Image Generation via Safety-Constraint Direct Preference Optimization | Shouwei Ruan et.al. | 2504.14290 | null |
2025-04-19 | Text-Audio-Visual-conditioned Diffusion Model for Video Saliency Prediction | Li Yu et.al. | 2504.14267 | null |
2025-04-19 | Cross-attention for State-based model RWKV-7 | Liu Xiao et.al. | 2504.14260 | link |
2025-04-19 | Towards Explainable Fake Image Detection with Multi-Modal Large Language Models | Yikun Ji et.al. | 2504.14245 | link |
2025-04-19 | PRISM: A Unified Framework for Photorealistic Reconstruction and Intrinsic Scene Modeling | Alara Dirik et.al. | 2504.14219 | null |
2025-04-19 | Learning Joint ID-Textual Representation for ID-Preserving Image Synthesis | Zichuan Liu et.al. | 2504.14202 | null |
2025-04-19 | Rethinking Target Label Conditioning in Adversarial Attacks: A 2D Tensor-Guided Generative Approach | Hangyu Liu et.al. | 2504.14137 | null |
2025-04-19 | Exploring Language Patterns of Prompts in Text-to-Image Generation and Their Impact on Visual Diversity | Maria-Teresa De Rosa Palmini et.al. | 2504.14125 | null |
2025-04-18 | System of Agentic AI for the Discovery of Metal-Organic Frameworks | Theo Jaffrelot Inizan et.al. | 2504.14110 | null |
2025-04-18 | Point-Driven Interactive Text and Image Layer Editing Using Diffusion Models | Zhenyu Yu et.al. | 2504.14108 | null |
2025-04-18 | A thermodynamically consistent and robust four-equation model for multi-phase multi-component compressible flows using ENO-type schemes including interface regularization | Henry Collis et.al. | 2504.14063 | null |
2025-04-28 | Adaptive Diffusion Models for Sparse-View Motion-Corrected Head Cone-beam CT | Antoine De Paepe et.al. | 2504.14033 | null |
2025-04-18 | Fashion-RAG: Multimodal Fashion Image Editing via Retrieval-Augmented Generation | Fulvio Sanguigni et.al. | 2504.14011 | null |
2025-04-18 | Entropy Rectifying Guidance for Diffusion and Flow Models | Tariq Berrada Ifriqi et.al. | 2504.13987 | null |
2025-04-23 | Decoding Vision Transformers: the Diffusion Steering Lens | Ryota Takatsuki et.al. | 2504.13763 | link |
2025-04-18 | ESPLoRA: Enhanced Spatial Precision with Low-Rank Adaption in Text-to-Image Diffusion Models for High-Definition Synthesis | Andrea Rigo et.al. | 2504.13745 | null |
2025-04-18 | MLEP: Multi-granularity Local Entropy Patterns for Universal AI-generated Image Detection | Lin Yuan et.al. | 2504.13726 | null |
2025-04-18 | Simulating Before Planning: Constructing Intrinsic User World Model for User-Tailored Dialogue Policy Planning | Tao He et.al. | 2504.13643 | null |
2025-04-18 | SupResDiffGAN a new approach for the Super-Resolution task | Dawid Kopeć et.al. | 2504.13622 | null |
2025-05-04 | Entropic Time Schedulers for Generative Diffusion Models | Dejan Stancevic et.al. | 2504.13612 | null |
2025-04-18 | WeatherGen: A Unified Diverse Weather Generator for LiDAR Point Clouds via Spider Mamba Diffusion | Yang Wu et.al. | 2504.13561 | null |
2025-04-18 | Task Assignment and Exploration Optimization for Low Altitude UAV Rescue via Generative AI Enhanced Multi-agent Reinforcement Learning | Xin Tang et.al. | 2504.13554 | null |
2025-04-18 | Q-FAKER: Query-free Hard Black-box Attack via Controlled Generation | CheolWon Na et.al. | 2504.13551 | null |
2025-04-18 | Beyond One-Hot Labels: Semantic Mixing for Model Calibration | Haoyang Luo et.al. | 2504.13548 | link |
2025-04-26 | U-Shape Mamba: State Space Model for faster diffusion | Alex Ergasti et.al. | 2504.13499 | link |
2025-04-18 | Early Timestep Zero-Shot Candidate Selection for Instruction-Guided Image Editing | Joowon Kim et.al. | 2504.13490 | null |
2025-04-18 | POET: Supporting Prompting Creativity and Personalization with Automated Expansion of Text-to-Image Generation | Evans Xu Han et.al. | 2504.13392 | null |
2025-04-17 | SMPL-GPTexture: Dual-View 3D Human Texture Estimation using Text-to-Image Generation Models | Mingxiao Tu et.al. | 2504.13378 | null |
2025-04-17 | On the minimax optimality of Flow Matching through the connection to kernel density estimation | Lea Kunkel et.al. | 2504.13336 | null |
2025-04-17 | Image Editing with Diffusion Models: A Survey | Jia Wang et.al. | 2504.13226 | null |
2025-04-17 | ICAS: IP Adapter and ControlNet-based Attention Structure for Multi-Subject Style Transfer Optimization | Fuwei Liu et.al. | 2504.13224 | null |
2025-04-16 | Wavelet-based Variational Autoencoders for High-Resolution Image Generation | Andrew Kiruluta et.al. | 2504.13214 | null |
2025-04-17 | IMAGGarment-1: Fine-Grained Garment Generation for Controllable Fashion Design | Fei Shen et.al. | 2504.13176 | link |
2025-04-17 | SemCORE: A Semantic-Enhanced Generative Cross-Modal Retrieval Framework with MLLMs | Haoxuan Li et.al. | 2504.13172 | null |
2025-04-17 | Personalized Text-to-Image Generation with Auto-Regressive Models | Kaiyue Sun et.al. | 2504.13162 | link |
2025-04-18 | Syntactic and Semantic Control of Large Language Models via Sequential Monte Carlo | João Loula et.al. | 2504.13139 | null |
2025-04-17 | Low-hallucination Synthetic Captions for Large-Scale Vision-Language Model Pre-training | Xinsong Zhang et.al. | 2504.13123 | null |
2025-04-17 | UniEdit-Flow: Unleashing Inversion and Editing in the Era of Flow Models | Guanlong Jiao et.al. | 2504.13109 | null |
2025-04-21 | SkyReels-V2: Infinite-length Film Generative Model | Guibin Chen et.al. | 2504.13074 | link |
2025-04-17 | HiScene: Creating Hierarchical 3D Scenes with Isometric View Generation | Wenqi Dong et.al. | 2504.13072 | null |
2025-04-17 | ArtistAuditor: Auditing Artist Style Pirate in Text-to-Image Generation Models | Linkang Du et.al. | 2504.13061 | link |
2025-04-17 | RoboTwin: Dual-Arm Robot Benchmark with Generative Digital Twins | Yao Mu et.al. | 2504.13059 | null |
2025-04-17 | TTRD3: Texture Transfer Residual Denoising Dual Diffusion Model for Remote Sensing Image Super-Resolution | Yide Liu et.al. | 2504.13026 | link |
2025-04-17 | Image-Editing Specialists: An RLAIF Approach for Diffusion Models | Elior Benarous et.al. | 2504.12833 | link |
2025-04-17 | Set You Straight: Auto-Steering Denoising Trajectories to Sidestep Unwanted Concepts | Leyang Li et.al. | 2504.12782 | link |
2025-04-17 | Privacy Protection Against Personalized Text-to-Image Synthesis via Cross-image Consistency Constraints | Guanyu Wang et.al. | 2504.12747 | null |
2025-04-17 | SmartFreeEdit: Mask-Free Spatial-Aware Image Editing with Complex Instruction Understanding | Qianqian Sun et.al. | 2504.12704 | null |
2025-04-21 | A0: An Affordance-Aware Hierarchical Model for General Robotic Manipulation | Rongtao Xu et.al. | 2504.12636 | null |
2025-04-21 | Packing Input Frame Context in Next-Frame Prediction Models for Video Generation | Lvmin Zhang et.al. | 2504.12626 | link |
2025-04-17 | Prompt-Driven and Training-Free Forgetting Approach and Dataset for Large Language Models | Zhenyu Yu et.al. | 2504.12574 | null |
2025-04-16 | Generalization through variance: how noise shapes inductive biases in diffusion models | John J. Vastola et.al. | 2504.12532 | link |
2025-04-16 | Diffusion Based Robust LiDAR Place Recognition | Benjamin Krummenacher et.al. | 2504.12412 | null |
2025-04-16 | InstantCharacter: Personalize Any Characters with a Scalable Diffusion Transformer Framework | Jiale Tao et.al. | 2504.12395 | link |
2025-04-16 | DMM: Building a Versatile Image Generation Model via Distillation-Based Model Merging | Tianhui Song et.al. | 2504.12364 | link |
2025-04-18 | WaterFlow: Learning Fast & Robust Watermarks using Stable Diffusion | Vinay Shukla et.al. | 2504.12354 | null |
2025-04-23 | Deep Generative Model-Based Generation of Synthetic Individual-Specific Brain MRI Segmentations | Ruijie Wang et.al. | 2504.12352 | link |
2025-04-15 | Prototype-Guided Diffusion for Digital Pathology: Achieving Foundation Model Performance with Minimal Clinical Data | Ekaterina Redekop et.al. | 2504.12351 | null |
2025-04-16 | VGDFR: Diffusion-based Video Generation with Dynamic Latent Frame Rate | Zhihang Yuan et.al. | 2504.12259 | link |
2025-04-16 | SIDME: Self-supervised Image Demoiréing via Masked Encoder-Decoder Reconstruction | Xia Wang et.al. | 2504.12245 | null |
2025-05-03 | Cobra: Efficient Line Art COlorization with BRoAder References | Junhao Zhuang et.al. | 2504.12240 | null |
2025-04-16 | Coding-Prior Guided Diffusion Network for Video Deblurring | Yike Liu et.al. | 2504.12222 | null |
2025-04-23 | Anti-Aesthetics: Protecting Facial Privacy against Customized Text-to-Image Synthesis | Songping Wang et.al. | 2504.12129 | null |
2025-04-16 | A Diffusion-Based Framework for Terrain-Aware Remote Sensing Image Reconstruction | Zhenyu Yu et.al. | 2504.12112 | null |
2025-04-16 | Generalized Visual Relation Detection with Diffusion Models | Kaifeng Gao et.al. | 2504.12100 | null |
2025-04-16 | Modular-Cam: Modular Dynamic Camera-view Video Generation with LLM | Zirui Pan et.al. | 2504.12048 | null |
2025-04-16 | Contract-based hierarchical control using predictive feasibility value functions | Felix Berkel et.al. | 2504.12036 | null |
2025-04-17 | Understanding Attention Mechanism in Video Diffusion Models | Bingyan Liu et.al. | 2504.12027 | null |
2025-04-16 | Instruction-augmented Multimodal Alignment for Image-Text and Element Matching | Xinli Yue et.al. | 2504.12018 | null |
2025-04-17 | Dual-Energy Cone-Beam CT Using Two Orthogonal Projection Views: A Phantom Study | Junbo Peng et.al. | 2504.12010 | null |
2025-04-16 | Generative Recommendation with Continuous-Token Diffusion | Haohao Qu et.al. | 2504.12007 | null |
2025-04-16 | Novel-view X-ray Projection Synthesis through Geometry-Integrated Deep Learning | Daiqi Liu et.al. | 2504.11953 | link |
2025-04-16 | R-Meshfusion: Reinforcement Learning Powered Sparse-View Mesh Reconstruction with Diffusion Priors | Haoyang Wang et.al. | 2504.11946 | null |
2025-04-18 | Mind2Matter: Creating 3D Models from EEG Signals | Xia Deng et.al. | 2504.11936 | link |
2025-04-16 | SemDiff: Generating Natural Unrestricted Adversarial Examples via Semantic Attributes Optimization in Diffusion Models | Zeyu Dai et.al. | 2504.11923 | null |
2025-04-16 | A Bidirectional DeepParticle Method for Efficiently Solving Low-dimensional Transport Map Problems | Tan Zhang et.al. | 2504.11851 | null |
2025-04-16 | ACE: Attentional Concept Erasure in Diffusion Models | Finn Carter et.al. | 2504.11850 | null |
2025-04-16 | Generation of Paths for Motion Planning for a Dubins Vehicle on Sphere | Deepak Prakash Kumar et.al. | 2504.11832 | link |
2025-04-16 | TextDiffSeg: Text-guided Latent Diffusion Model for 3d Medical Images Segmentation | Kangbo Ma et.al. | 2504.11825 | null |
2025-04-16 | PCDiff: Proactive Control for Ownership Protection in Diffusion Models with Watermark Compatibility | Keke Gai et.al. | 2504.11774 | null |
2025-04-16 | The Devil is in the Prompts: Retrieval-Augmented Prompt Optimization for Text-to-Video Generation | Bingjie Gao et.al. | 2504.11739 | null |
2025-04-16 | EgoExo-Gen: Ego-centric Video Prediction by Watching Exo-centric Videos | Jilan Xu et.al. | 2504.11732 | null |
2025-04-16 | Towards Safe Synthetic Image Generation On the Web: A Multimodal Robust NSFW Defense and Million Scale Dataset | Muhammad Shahid Muneer et.al. | 2504.11707 | link |
2025-04-16 | DM-OSVP++: One-Shot View Planning Using 3D Diffusion Models for Active RGB-Based Object Reconstruction | Sicong Pan et.al. | 2504.11674 | link |
2025-04-15 | LANGTRAJ: Diffusion Model and Dataset for Language-Conditioned Trajectory Simulation | Wei-Jer Chang et.al. | 2504.11521 | null |
2025-04-19 | Flux Already Knows – Activating Subject-Driven Image Generation without Training | Hao Kang et.al. | 2504.11478 | null |
2025-04-15 | Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception | Ziqi Pang et.al. | 2504.11457 | link |
2025-04-15 | SimpleAR: Pushing the Frontier of Autoregressive Visual Generation through Pretraining, SFT, and RL | Junke Wang et.al. | 2504.11455 | link |
2025-04-16 | Diffusion Distillation With Direct Preference Optimization For Efficient 3D LiDAR Scene Completion | An Zhao et.al. | 2504.11447 | link |
2025-04-15 | NormalCrafter: Learning Temporally Consistent Normals from Video Diffusion Priors | Yanrui Bin et.al. | 2504.11427 | null |
2025-04-15 | ADT: Tuning Diffusion Models with Adversarial Supervision | Dazhong Shen et.al. | 2504.11423 | null |
2025-04-17 | VideoPanda: Video Panoramic Diffusion with Multi-view Attention | Kevin Xie et.al. | 2504.11389 | null |
2025-04-15 | Omni $^2$ : Unifying Omnidirectional Image Generation and Editing in an Omni Model | Liu Yang et.al. | 2504.11379 | null |
2025-04-16 | Seedream 3.0 Technical Report | Yu Gao et.al. | 2504.11346 | null |
2025-04-15 | Autoregressive Distillation of Diffusion Transformers | Yeongmin Kim et.al. | 2504.11295 | link |
2025-04-15 | UniAnimate-DiT: Human Image Animation with Large-Scale Video Diffusion Transformer | Xiang Wang et.al. | 2504.11289 | link |
2025-04-15 | SAR-to-RGB Translation with Latent Diffusion for Earth Observation | Kaan Aydin et.al. | 2504.11154 | null |
2025-04-15 | Taming Consistency Distillation for Accelerated Human Image Animation | Xiang Wang et.al. | 2504.11143 | null |
2025-04-15 | Token-Level Constraint Boundary Search for Jailbreaking Text-to-Image Models | Jiangtao Liu et.al. | 2504.11106 | null |
2025-04-15 | Using LLMs as prompt modifier to avoid biases in AI image generators | René Peinl et.al. | 2504.11104 | null |
2025-04-15 | Defending Against Frequency-Based Attacks with Diffusion Models | Fatemeh Amerehi et.al. | 2504.11034 | null |
2025-04-15 | AnimeDL-2M: Million-Scale AI-Generated Anime Image Detection and Localization in Diffusion Era | Chenyang Zhu et.al. | 2504.11015 | null |
2025-04-15 | TMCIR: Token Merge Benefits Composed Image Retrieval | Chaoyang Wang et.al. | 2504.10995 | null |
2025-04-15 | ProtFlow: Fast Protein Sequence Design via Flow Matching on Compressed Protein Language Model Embeddings | Zitai Kong et.al. | 2504.10983 | null |
2025-04-15 | InterAnimate: Taming Region-aware Diffusion Model for Realistic Human Interaction Animation | Yukang Lin et.al. | 2504.10905 | null |
2025-04-15 | Bringing together invertible UNets with invertible attention modules for memory-efficient diffusion models | Karan Jain et.al. | 2504.10883 | null |
2025-04-18 | PT-Mark: Invisible Watermarking for Text-to-image Diffusion Models via Semantic-aware Pivotal Tuning | Yaopeng Wang et.al. | 2504.10853 | null |
2025-04-15 | SteerMusic: Enhanced Musical Consistency for Zero-shot Text-Guided and Personalized Music Editing | Xinlei Niu et.al. | 2504.10826 | null |
2025-04-15 | OmniVDiff: Omni Controllable Video Diffusion for Generation and Understanding | Dianbing Xi et.al. | 2504.10825 | null |
2025-04-15 | IlluSign: Illustrating Sign Language Videos by Leveraging the Attention Mechanism | Janna Bruner et.al. | 2504.10822 | null |
2025-04-17 | GaSLight: Gaussian Splats for Spatially-Varying Lighting in HDR | Christophe Bolduc et.al. | 2504.10809 | null |
2025-04-14 | SpinMeRound: Consistent Multi-View Identity Generation Using Diffusion Models | Stathis Galanakis et.al. | 2504.10716 | null |
2025-04-14 | H-MoRe: Learning Human-centric Motion Representation for Action Analysis | Zhanbo Huang et.al. | 2504.10676 | null |
2025-04-14 | On the Contractivity of Stochastic Interpolation Flow | Max Daniels et.al. | 2504.10653 | null |
2025-04-14 | H3AE: High Compression, High Speed, and High Quality AutoEncoder for Video Diffusion Models | Yushu Wu et.al. | 2504.10567 | null |
2025-04-14 | Beyond the Generative Learning Trilemma: Generative Model Assessment in Data Scarcity Domains | Marco Salmè et.al. | 2504.10555 | null |
2025-04-13 | AB-Cache: Training-Free Acceleration of Diffusion Models via Adams-Bashforth Cached Feature Reuse | Zichao Yu et.al. | 2504.10540 | null |
2025-04-14 | REPA-E: Unlocking VAE for End-to-End Tuning with Latent Diffusion Transformers | Xingjian Leng et.al. | 2504.10483 | null |
2025-04-14 | Art3D: Training-Free 3D Generation from Flat-Colored Illustration | Xiaoyan Cong et.al. | 2504.10466 | null |
2025-04-14 | Anchor Token Matching: Implicit Structure Locking for Training-free AR Image Editing | Taihang Hu et.al. | 2504.10434 | link |
2025-04-14 | MonoDiff9D: Monocular Category-Level 9D Object Pose Estimation via Diffusion Model | Jian Liu et.al. | 2504.10433 | link |
2025-04-14 | FingER: Content Aware Fine-grained Evaluation with Reasoning for AI-Generated Videos | Rui Chen et.al. | 2504.10358 | null |
2025-04-14 | Improving diffusion modeling in all-solid-state lithium batteries: a novel approach for grain boundary effects | Lena Scholz et.al. | 2504.10348 | null |
2025-04-21 | InstructEngine: Instruction-driven Text-to-Image Alignment | Xingyu Lu et.al. | 2504.10329 | null |
2025-04-14 | Analysis of Attention in Video Diffusion Transformers | Yuxin Wen et.al. | 2504.10317 | null |
2025-04-14 | ESCT3D: Efficient and Selectively Controllable Text-Driven 3D Content Generation with Gaussian Splatting | Huiqi Wu et.al. | 2504.10316 | null |
2025-04-14 | DiffMOD: Progressive Diffusion Point Denoising for Moving Object Detection in Remote Sensing | Jinyue Zhang et.al. | 2504.10278 | null |
2025-04-14 | VibrantLeaves: A principled parametric image generator for training deep restoration models | Raphael Achddou et.al. | 2504.10201 | link |
2025-04-14 | Efficient Generative Model Training via Embedded Representation Warmup | Deyuan Liu et.al. | 2504.10188 | link |
2025-04-21 | Hierarchical and Step-Layer-Wise Tuning of Attention Specialty for Multi-Instance Synthesis in Diffusion Transformers | Chunyang Zhang et.al. | 2504.10148 | null |
2025-04-14 | GeoUni: A Unified Model for Generating Geometry Diagrams, Problems and Problem Solutions | Jo-Ku Cheng et.al. | 2504.10146 | null |
2025-04-14 | Aligning Anime Video Generation with Human Feedback | Bingwen Zhu et.al. | 2504.10044 | null |
2025-04-14 | Masked Autoencoder Self Pre-Training for Defect Detection in Microelectronics | Nikolai Röhrich et.al. | 2504.10021 | null |
2025-04-14 | Improving Controller Generalization with Dimensionless Markov Decision Processes | Valentin Charvet et.al. | 2504.10006 | null |
2025-04-14 | NaviDiffusor: Cost-Guided Diffusion Model for Visual Navigation | Yiming Zeng et.al. | 2504.10003 | null |
2025-04-16 | GaussVideoDreamer: 3D Scene Generation with Video Diffusion and Inconsistency-Aware Gaussian Splatting | Junlin Hao et.al. | 2504.10001 | null |
2025-04-15 | OctGPT: Octree-based Multiscale Autoregressive Models for 3D Shape Generation | Si-Tong Wei et.al. | 2504.09975 | link |
2025-04-14 | Semi-implicit-explicit Runge-Kutta method for nonlinear differential equations | Lingyun Ding et.al. | 2504.09969 | link |
2025-04-14 | Omni-Dish: Photorealistic and Faithful Image Generation and Editing for Arbitrary Chinese Dishes | Huijie Liu et.al. | 2504.09948 | null |
2025-04-14 | Efficient Task-specific Conditional Diffusion Policies: Shortcut Model Acceleration and SO(3) Optimization | Haiyong Yu et.al. | 2504.09927 | null |
2025-04-14 | Separate to Collaborate: Dual-Stream Diffusion Model for Coordinated Piano Hand Motion Synthesis | Zihao Liu et.al. | 2504.09885 | null |
2025-04-14 | EquiVDM: Equivariant Video Diffusion Models with Temporally Consistent Noise | Chao Liu et.al. | 2504.09789 | null |
2025-04-13 | Stochastic generative methods for stable and accurate closure modeling of chaotic dynamical systems | Emily Williams et.al. | 2504.09750 | null |
2025-04-13 | SPICE: A Synergistic, Precise, Iterative, and Customizable Image Editing Workflow | Kenan Tang et.al. | 2504.09697 | link |
2025-04-13 | KeyVID: Keyframe-Aware Video Diffusion for Audio-Synchronized Visual Animation | Xingrui Wang et.al. | 2504.09656 | null |
2025-04-13 | Early-Bird Diffusion: Investigating and Leveraging Timestep-Aware Early-Bird Tickets in Diffusion Models for Efficient Training | Lexington Whalen et.al. | 2504.09606 | null |
2025-04-16 | Mitigating Long-tail Distribution in Oracle Bone Inscriptions: Dataset, Model, and Benchmark | Jinhao Li et.al. | 2504.09555 | null |
2025-04-13 | DiffuMural: Restoring Dunhuang Murals with Multi-scale Diffusion | Puyu Han et.al. | 2504.09513 | null |
2025-04-13 | CamMimic: Zero-Shot Image To Camera Motion Personalized Video Generation Using Diffusion Models | Pooja Guhan et.al. | 2504.09472 | null |
2025-04-13 | D $^2$ iT: Dynamic Diffusion Transformer for Accurate Image Generation | Weinan Jia et.al. | 2504.09454 | null |
2025-04-13 | Structure-Accurate Medical Image Translation based on Dynamic Frequency Balance and Knowledge Guidance | Jiahua Xu et.al. | 2504.09441 | null |
2025-04-13 | Scalable Motion In-betweening via Diffusion and Physics-Based Character Adaptation | Jia Qin et.al. | 2504.09413 | null |
2025-04-12 | Hierarchical protein backbone generation with latent and structure diffusion | Jason Yim et.al. | 2504.09374 | null |
2025-04-12 | Text To 3D Object Generation For Scalable Room Assembly | Sonia Laguna et.al. | 2504.09328 | null |
2025-04-12 | MedIL: Implicit Latent Spaces for Generating Heterogeneous Medical Images at Arbitrary Resolutions | Tyler Spears et.al. | 2504.09322 | null |
2025-04-12 | Towards Explainable Partial-AIGC Image Quality Assessment | Jiaying Qian et.al. | 2504.09291 | null |
2025-04-12 | No-Regret Generative Modeling via Parabolic Monge-Ampère PDE | Nabarun Deb et.al. | 2504.09279 | null |
2025-04-12 | Head-Aware KV Cache Compression for Efficient Visual Autoregressive Modeling | Ziran Qin et.al. | 2504.09261 | null |
2025-04-12 | Ensemble Score Filter for Data Assimilation of Two-Phase Flow Models in Porous Media | Ruoyu Hu et.al. | 2504.09245 | null |
2025-04-12 | REALM: Real-Time Estimates of Assistance for Learned Models in Human-Robot Interaction | Michael Hagenow et.al. | 2504.09243 | link |
2025-04-12 | Generation of Musical Timbres using a Text-Guided Diffusion Model | Weixuan Yuan et.al. | 2504.09219 | null |
2025-04-12 | From Visual Explanations to Counterfactual Explanations with Latent Diffusion | Tung Luu et.al. | 2504.09202 | null |
2025-04-12 | seg2med: a segmentation-based medical image generation framework using denoising diffusion probabilistic models | Zeyu Yang et.al. | 2504.09182 | null |
2025-04-12 | BIGS: Bimanual Category-agnostic Interaction Reconstruction from Monocular Videos via 3D Gaussian Splatting | Jeongwan On et.al. | 2504.09097 | null |
2025-04-12 | Sculpting Memory: Multi-Concept Forgetting in Diffusion Models via Dynamic Mask and Concept-Aware Optimization | Gen Li et.al. | 2504.09039 | null |
2025-04-11 | PolyConf: Unlocking Polymer Conformation Generation through Hierarchical Generative Models | Fanmeng Wang et.al. | 2504.08859 | link |
2025-04-09 | Probabilistic QoS Metric Forecasting in Delay-Tolerant Networks Using Conditional Diffusion Models on Latent Dynamics | Enming Zhang et.al. | 2504.08821 | null |
2025-04-05 | Embedding Hidden Adversarial Capabilities in Pre-Trained Diffusion Models | Lucas Beerens et.al. | 2504.08782 | link |
2025-04-11 | GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation | Tianwei Xiong et.al. | 2504.08736 | link |
2025-04-11 | Generating Fine Details of Entity Interactions | Xinyi Gu et.al. | 2504.08714 | null |
2025-04-11 | Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model | Team Seawead et.al. | 2504.08685 | null |
2025-04-11 | Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization | Jialu Li et.al. | 2504.08641 | null |
2025-04-11 | Latent Diffusion Autoencoders: Toward Efficient and Meaningful Unsupervised Representation Learning in Medical Imaging | Gabriele Lozupone et.al. | 2504.08635 | link |
2025-04-11 | Discretization Error Analysis of a High Order Unfitted Space-Time Method for moving domain problems | Fabian Heimann et.al. | 2504.08608 | null |
2025-04-11 | Neural Fidelity Calibration for Informative Sim-to-Real Adaptation | Youwei Yu et.al. | 2504.08604 | null |
2025-04-11 | ZipIR: Latent Pyramid Diffusion Transformer for High-Resolution Image Restoration | Yongsheng Yu et.al. | 2504.08591 | null |
2025-04-14 | COP-GEN-Beta: Unified Generative Modelling of COPernicus Imagery Thumbnails | Miguel Espinosa et.al. | 2504.08548 | null |
2025-04-11 | Discriminator-Free Direct Preference Optimization for Video Diffusion | Haoran Cheng et.al. | 2504.08542 | null |
2025-04-11 | Cut-and-Splat: Leveraging Gaussian Splatting for Synthetic Data Generation | Bram Vanherle et.al. | 2504.08473 | link |
2025-04-11 | On the Design of Diffusion-based Neural Speech Codecs | Pietro Foti et.al. | 2504.08470 | null |
2025-04-11 | Muon-Accelerated Attention Distillation for Real-Time Edge Synthesis via Optimized Latent Diffusion | Weiye Chen et.al. | 2504.08451 | link |
2025-04-11 | Diffusion Models for Robotic Manipulation: A Survey | Rosa Wolf et.al. | 2504.08438 | null |
2025-04-11 | MixDiT: Accelerating Image Diffusion Transformer Inference with Mixed-Precision MX Quantization | Daeun Kim et.al. | 2504.08398 | null |
2025-04-11 | LMM4LMM: Benchmarking and Evaluating Large-multimodal Image Generation with LMMs | Jiarui Wang et.al. | 2504.08358 | link |
2025-04-11 | Single View Garment Reconstruction Using Diffusion Mapping Via Pattern Coordinates | Ren Li et.al. | 2504.08353 | null |
2025-04-11 | Geometric Consistency Refinement for Single Image Novel View Synthesis via Test-Time Adaptation of Diffusion Models | Josef Bengtson et.al. | 2504.08348 | null |
2025-04-11 | EasyGenNet: An Efficient Framework for Audio-Driven Gesture Video Generation Based on Diffusion Model | Renda Li et.al. | 2504.08344 | null |
2025-04-11 | Generative AI for Film Creation: A Survey of Recent Advances | Ruihan Zhang et.al. | 2504.08296 | null |
2025-04-11 | Palmprint De-Identification Using Diffusion Model for High-Quality and Diverse Synthesis | Licheng Yan et.al. | 2504.08272 | null |
2025-04-11 | CoProSketch: Controllable and Progressive Sketch Generation with Diffusion Model | Ruohao Zhan et.al. | 2504.08259 | null |
2025-04-11 | RealCam-Vid: High-resolution Video Dataset with Dynamic Scenes and Metric-scale Camera Movements | Guangcong Zheng et.al. | 2504.08212 | link |
2025-04-11 | Particle Hit Clustering and Identification Using Point Set Transformers in Liquid Argon Time Projection Chambers | Edgar E. Robles et.al. | 2504.08182 | null |
2025-04-11 | TokenMotion: Decoupled Motion Control via Token Disentanglement for Human-centric Video Generation | Ruineng Li et.al. | 2504.08181 | null |
2025-04-10 | POEM: Precise Object-level Editing via MLLM control | Marco Schouten et.al. | 2504.08111 | null |
2025-04-10 | ContrastiveGaussian: High-Fidelity 3D Generation with Contrastive Learning and Gaussian Splatting | Junbang Liu et.al. | 2504.08100 | link |
2025-04-10 | Teaching Humans Subtle Differences with DIFFusion | Mia Chiquier et.al. | 2504.08046 | null |
2025-04-09 | Have we unified image generation and understanding yet? An empirical study of GPT-4o’s image generation ability | Ning Li et.al. | 2504.08003 | null |
2025-04-09 | IGG: Image Generation Informed by Geodesic Dynamics in Deformation Spaces | Nian Wu et.al. | 2504.07999 | link |
2025-04-08 | CDM-QTA: Quantized Training Acceleration for Efficient LoRA Fine-Tuning of Diffusion Model | Jinming Lu et.al. | 2504.07998 | null |
2025-04-10 | PixelFlow: Pixel-Space Generative Models with Flow | Shoufa Chen et.al. | 2504.07963 | link |
2025-04-10 | Geo4D: Leveraging Video Generators for Geometric 4D Scene Reconstruction | Zeren Jiang et.al. | 2504.07961 | link |
2025-04-10 | VisualCloze: A Universal Image Generation Framework via Visual In-Context Learning | Zhong-Yu Li et.al. | 2504.07960 | null |
2025-04-10 | GenEAva: Generating Cartoon Avatars with Fine-Grained Facial Expressions from Realistic Diffusion-based Faces | Hao Yu et.al. | 2504.07945 | null |
2025-04-17 | Beyond the Frame: Generating 360° Panoramic Videos from Perspective Videos | Rundong Luo et.al. | 2504.07940 | null |
2025-04-10 | Optimal Control For Anti-Abeta Treatment in Alzheimer’s Disease using a Reaction-Diffusion Model | Wenrui Hao et.al. | 2504.07913 | null |
2025-04-10 | DiverseFlow: Sample-Efficient Diverse Mode Coverage in Flows | Mashrur M. Morshed et.al. | 2504.07894 | null |
2025-04-10 | Towards Sustainable Creativity Support: An Exploratory Study on Prompt Based Image Generation | Daniel Hove Paludan et.al. | 2504.07879 | null |
2025-04-10 | Revisiting Likelihood-Based Out-of-Distribution Detection by Modeling Representations | Yifan Ding et.al. | 2504.07793 | link |
2025-04-10 | Generalized Passivity Sensitivity Methodology for Small-Signal Stability Analysis | Dongyeong Lee et.al. | 2504.07788 | null |
2025-04-10 | Virtual-mask Informed Prior for Sparse-view Dual-Energy CT Reconstruction | Zini Chen et.al. | 2504.07753 | null |
2025-04-10 | Multi-modal Reference Learning for Fine-grained Text-to-Image Retrieval | Zehong Ma et.al. | 2504.07718 | null |
2025-04-10 | Conditional Conformal Risk Adaptation | Rui Luo et.al. | 2504.07611 | null |
2025-04-18 | Diffusion Transformers for Tabular Data Time Series Generation | Fabrizio Garuti et.al. | 2504.07566 | link |
2025-04-10 | PhaseGen: A Diffusion-Based Approach for Complex-Valued MRI Data Generation | Moritz Rempe et.al. | 2504.07560 | link |
2025-04-10 | TokenFocus-VQA: Enhancing Text-to-Image Alignment with Position-Aware Focus and Multi-Perspective Aggregations on LVLMs | Zijian Zhang et.al. | 2504.07556 | null |
2025-04-10 | STeP: A General and Scalable Framework for Solving Video Inverse Problems with Spatiotemporal Diffusion Priors | Bingliang Zhang et.al. | 2504.07549 | link |
2025-04-10 | A mass conserved reaction-diffusion system reveals switching between coexisting polar and oscillatory cell motility states | Jack M. Hughes et.al. | 2504.07446 | null |
2025-04-10 | Unifying and extending Diffusion Models through PDEs for solving Inverse Problems | Agnimitra Dasgupta et.al. | 2504.07437 | null |
2025-04-10 | Conditional Data Synthesis Augmentation | Xinyu Tian et.al. | 2504.07426 | null |
2025-04-10 | Routing to the Right Expertise: A Trustworthy Judge for Instruction-based Image Editing | Chenxi Sun et.al. | 2504.07424 | null |
2025-04-10 | FlexIP: Dynamic Control of Preservation and Personality for Customized Image Generation | Linyan Huang et.al. | 2504.07405 | null |
2025-04-18 | ID-Booth: Identity-consistent Face Generation with Diffusion Models | Darian Tomašević et.al. | 2504.07392 | link |
2025-04-10 | Model Discrepancy Learning: Synthetic Faces Detection Based on Multi-Reconstruction | Qingchao Jiang et.al. | 2504.07382 | link |
2025-04-10 | Novel Diffusion Models for Multimodal 3D Hand Trajectory Prediction | Junyi Ma et.al. | 2504.07375 | link |
2025-04-09 | A Unified Framework for Large-Scale Classification: Error Rate Control and Optimality | Yinrui Sun et.al. | 2504.07321 | null |
2025-04-09 | MoEDiff-SR: Mixture of Experts-Guided Diffusion Model for Region-Adaptive MRI Super-Resolution | Zhe Wang et.al. | 2504.07308 | link |
2025-04-14 | MESA: Text-Driven Terrain Generation Using Latent Diffusion and Global Copernicus Data | Paul Borne–Pons et.al. | 2504.07210 | link |
2025-04-09 | OmniCaptioner: One Captioner to Rule Them All | Yiting Lu et.al. | 2504.07089 | link |
2025-04-09 | A Unified Agentic Framework for Evaluating Conditional Image Generation | Jifang Wang et.al. | 2504.07046 | link |
2025-04-09 | Latent Diffusion U-Net Representations Contain Positional Embeddings and Anomalies | Jonas Loos et.al. | 2504.07008 | link |
2025-04-09 | PathSegDiff: Pathology Segmentation using Diffusion model representations | Sachin Kumar Danisetty et.al. | 2504.06950 | null |
2025-04-09 | MedSegFactory: Text-Guided Generation of Medical Image-Mask Pairs | Jiawei Mao et.al. | 2504.06897 | null |
2025-04-09 | EIDT-V: Exploiting Intersections in Diffusion Trajectories for Model-Agnostic, Zero-Shot, Training-Free Text-to-Video Generation | Diljeet Jagpal et.al. | 2504.06861 | null |
2025-04-09 | CasTex: Cascaded Text-to-Texture Synthesis via Explicit Texture Maps and Physically-Based Shading | Mishan Aliev et.al. | 2504.06856 | null |
2025-04-16 | DyDiT++: Dynamic Diffusion Transformers for Efficient Visual Generation | Wangbo Zhao et.al. | 2504.06803 | link |
2025-04-09 | A Meaningful Perturbation Metric for Evaluating Explainability Methods | Danielle Cohen et.al. | 2504.06800 | null |
2025-04-09 | DIMA: DIffusing Motion Artifacts for unsupervised correction in brain MRI images | Paolo Angella et.al. | 2504.06767 | null |
2025-04-10 | Compass Control: Multi Object Orientation Control for Text-to-Image Generation | Rishubh Parihar et.al. | 2504.06752 | null |
2025-04-09 | Probability Density Geodesics in Image Diffusion Latent Space | Qingtao Yu et.al. | 2504.06675 | null |
2025-04-09 | RAGME: Retrieval Augmented Video Generation for Enhanced Motion Realism | Elia Peruzzo et.al. | 2504.06672 | null |
2025-04-09 | Patch Matters: Training-free Fine-grained Image Caption Enhancement via Local Perception | Ruotian Peng et.al. | 2504.06666 | null |
2025-04-09 | Collision avoidance from monocular vision trained with novel view synthesis | Valentin Tordjman–Levavasseur et.al. | 2504.06651 | null |
2025-04-09 | PosterMaker: Towards High-Quality Product Poster Generation with Accurate Text Rendering | Yifan Gao et.al. | 2504.06632 | null |
2025-04-09 | Diffusion Factor Models: Generating High-Dimensional Returns with Factor Structure | Minshuo Chen et.al. | 2504.06566 | null |
2025-04-09 | DiffusionCom: Structure-Aware Multimodal Diffusion Model for Multimodal Knowledge Graph Completion | Wei Huang et.al. | 2504.06543 | null |
2025-04-08 | Towards Holistic Prompt Craft | Joseph Lindley et.al. | 2504.06496 | null |
2025-04-11 | D-Feat Occlusions: Diffusion Features for Robustness to Partial Visual Occlusions in Object Recognition | Rupayan Mallick et.al. | 2504.06432 | null |
2025-04-08 | Unifying Autoregressive and Diffusion-Based Sequence Generation | Nima Fathi et.al. | 2504.06416 | null |
2025-04-12 | Text-to-Image Models and Their Representation of People from Different Nationalities Engaging in Activities | Abdulkareem Alsudais et.al. | 2504.06313 | null |
2025-04-08 | DMol: A Schedule-Driven Diffusion Model for Highly Efficient and Versatile Molecule Generation | Peizhi Niu et.al. | 2504.06312 | null |
2025-04-08 | Transfer between Modalities with MetaQueries | Xichen Pan et.al. | 2504.06256 | null |
2025-04-08 | Electronic Structure Guided Inverse Design Using Generative Models | Shuyi Jia et.al. | 2504.06249 | link |
2025-04-08 | HiFlow: Training-free High-Resolution Image Generation with Flow-Aligned Guidance | Jiazi Bu et.al. | 2504.06232 | null |
2025-04-08 | A Training-Free Style-aligned Image Generation with Scale-wise Autoregressive Model | Jihun Park et.al. | 2504.06144 | null |
2025-04-08 | Rhythmic neuromorphic control of a pendulum: A hybrid systems analysis | E. Petri et.al. | 2504.06046 | null |
2025-04-08 | OSDM-MReg: Multimodal Image Registration based One Step Diffusion Model | Xiaochen Wei et.al. | 2504.06027 | null |
2025-04-08 | CamContextI2V: Context-aware Controllable Video Generation | Luis Denninger et.al. | 2504.06022 | link |
2025-04-10 | An Empirical Study of GPT-4o Image Generation Capabilities | Sixiang Chen et.al. | 2504.05979 | link |
2025-04-08 | Diffusion Based Ambiguous Image Segmentation | Jakob Lønborg Christensen et.al. | 2504.05977 | null |
2025-04-08 | SVLTA: Benchmarking Vision-Language Temporal Alignment via Synthetic Video Situation | Hao Du et.al. | 2504.05925 | null |
2025-04-08 | Physics-aware generative models for turbulent fluid flows through energy-consistent stochastic interpolants | Nikolaj T. Mücke et.al. | 2504.05852 | link |
2025-04-08 | On the Importance of Conditioning for Privacy-Preserving Data Augmentation | Julian Lorenz et.al. | 2504.05849 | null |
2025-04-08 | Mind the Trojan Horse: Image Prompt Adapter Enabling Scalable and Deceptive Jailbreaking | Junxi Chen et.al. | 2504.05838 | link |
2025-04-08 | Parasite: A Steganography-based Backdoor Attack Framework for Diffusion Models | Jiahao Chen et.al. | 2504.05815 | null |
2025-04-08 | Storybooth: Training-free Multi-Subject Consistency for Improved Visual Storytelling | Jaskirat Singh et.al. | 2504.05800 | null |
2025-04-08 | QEMesh: Employing A Quadric Error Metrics-Based Representation for Mesh Generation | Jiaqi Li et.al. | 2504.05720 | null |
2025-04-08 | Reconstruction-Free Anomaly Detection with Diffusion Models via Direct Latent Likelihood Evaluation | Shunsuke Sakai et.al. | 2504.05662 | link |
2025-04-08 | Tuning-Free Image Editing with Fidelity and Editability via Unified Latent Diffusion Model | Qi Mao et.al. | 2504.05594 | null |
2025-04-07 | PartStickers: Generating Parts of Objects for Rapid Prototyping | Mo Zhou et.al. | 2504.05508 | null |
2025-04-07 | Studying Image Diffusion Features for Zero-Shot Video Object Segmentation | Thanos Delatolas et.al. | 2504.05468 | null |
2025-04-07 | EP-Diffuser: An Efficient Diffusion Model for Traffic Scene Generation and Prediction via Polynomial Representations | Yue Yao et.al. | 2504.05422 | null |
2025-04-07 | Fast Controlled Generation from Language Models with Adaptive Weighted Rejection Sampling | Benjamin Lipkin et.al. | 2504.05410 | null |
2025-04-07 | CREA: A Collaborative Multi-Agent Framework for Creative Content Generation with Diffusion Models | Kavana Venkatesh et.al. | 2504.05306 | null |
2025-04-07 | Gaussian Mixture Flow Matching Models | Hansheng Chen et.al. | 2504.05304 | link |
2025-04-07 | Dimension-Free Convergence of Diffusion Models for Approximate Gaussian Mixtures | Gen Li et.al. | 2504.05300 | null |
2025-04-07 | One-Minute Video Generation with Test-Time Training | Karan Dalal et.al. | 2504.05298 | null |
2025-04-07 | DA2Diff: Exploring Degradation-aware Adaptive Diffusion Priors for All-in-One Weather Restoration | Jiamei Xiong et.al. | 2504.05135 | null |
2025-04-07 | Graph-based Diffusion Model for Collaborative Filtering | Xuan Zhang et.al. | 2504.05029 | null |
2025-04-07 | SILVIA: Ultra-precision formation flying demonstration for space-based interferometry | Takahiro Ito et.al. | 2504.05001 | null |
2025-04-08 | REWIND: Real-Time Egocentric Whole-Body Motion Diffusion with Exemplar-Based Identity Conditioning | Jihyun Lee et.al. | 2504.04956 | null |
2025-04-07 | Video-Bench: Human-Aligned Video Generation Benchmark | Hui Han et.al. | 2504.04907 | null |
2025-04-07 | Imagining the Far East: Exploring Perceived Biases in AI-Generated Images of East Asian Women | Xingyu Lan et.al. | 2504.04865 | null |
2025-04-07 | FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis | Mengchao Wang et.al. | 2504.04842 | null |
2025-04-09 | TabRep: a Simple and Effective Continuous Representation for Training Tabular Diffusion Models | Jacob Si et.al. | 2504.04798 | link |
2025-04-07 | Disentangling Instruction Influence in Diffusion Transformers for Parallel Multi-Instruction-Guided Image Editing | Hui Liu et.al. | 2504.04784 | null |
2025-04-07 | Continuous Locomotive Crowd Behavior Generation | Inhwan Bae et.al. | 2504.04756 | link |
2025-04-07 | Unsupervised Estimation of Nonlinear Audio Effects: Comparing Diffusion-Based and Adversarial approaches | Eloi Moliner et.al. | 2504.04751 | null |
2025-04-07 | AnyArtisticGlyph: Multilingual Controllable Artistic Glyph Generation | Xiongbo Lu et.al. | 2504.04743 | null |
2025-04-07 | TactileNet: Bridging the Accessibility Gap with AI-Generated Tactile Graphics for Individuals with Vision Impairment | Adnan Khan et.al. | 2504.04722 | null |
2025-04-13 | Diffusion-Based Approximate MPC: Fast and Consistent Imitation of Multi-Modal Action Distributions | Pau Marquez Julbe et.al. | 2504.04603 | null |
2025-04-08 | Your Image Generator Is Your New Private Dataset | Nicolo Resmini et.al. | 2504.04582 | null |
2025-04-06 | Cramer-Rao Bounds for Laplacian Matrix Estimation | Morad Halihal et.al. | 2504.04576 | null |
2025-04-06 | BrainMRDiff: A Diffusion Model for Anatomically Consistent Brain MRI Synthesis | Moinak Bhattacharya et.al. | 2504.04532 | null |
2025-04-06 | Attributed Synthetic Data Generation for Zero-shot Domain-specific Image Classification | Shijian Wang et.al. | 2504.04510 | null |
2025-04-06 | PRISM: Probabilistic Representation for Integrated Shape Modeling and Generation | Lei Cheng et.al. | 2504.04454 | null |
2025-04-06 | UniToken: Harmonizing Multimodal Understanding and Generation through Unified Visual Encoding | Yang Jiao et.al. | 2504.04423 | null |
2025-04-06 | From Coarse to Fine: A Physics-Informed Self-Guided Flow Diffusion Model | Ruoyan Li et.al. | 2504.04375 | null |
2025-04-06 | DDPT: Diffusion-Driven Prompt Tuning for Large Language Model Code Generation | Jinyang Li et.al. | 2504.04351 | null |
2025-04-05 | SDEIT: Semantic-Driven Electrical Impedance Tomography | Dong Liu et.al. | 2504.04185 | null |
2025-04-09 | Digital Gene: Learning about the Physical World through Analytic Concepts | Jianhua Sun et.al. | 2504.04170 | null |
2025-04-05 | Video4DGen: Enhancing Video and 4D Generation through Mutual Optimization | Yikai Wang et.al. | 2504.04153 | link |
2025-04-05 | Multi-identity Human Image Animation with Structural Video Diffusion | Zhenzhi Wang et.al. | 2504.04126 | null |
2025-04-05 | Can You Count to Nine? A Human Evaluation Benchmark for Counting Limits in Modern Text-to-Video Models | Xuyang Guo et.al. | 2504.04051 | null |
2025-04-05 | Multi-resolution Score-Based Variational Graphical Diffusion for Causal Disaster System Modeling and Inference | Xuechun Li et.al. | 2504.04015 | link |
2025-04-05 | DiTaiListener: Controllable High Fidelity Listener Video Generation with Diffusion | Maksim Siniukov et.al. | 2504.04010 | null |
2025-04-04 | Detection Limits and Statistical Separability of Tree Ring Watermarks in Rectified Flow-based Text-to-Image Generation Models | Ved Umrajkar et.al. | 2504.03850 | link |
2025-04-04 | A Hybrid Wavelet-Fourier Method for Next-Generation Conditional Diffusion Models | Andrew Kiruluta et.al. | 2504.03821 | null |
2025-04-02 | Proof of Humanity: A Multi-Layer Network Framework for Certifying Human-Originated Content in an AI-Dominated Internet | Sebastian Barros et.al. | 2504.03752 | null |
2025-04-01 | Attention in Diffusion Model: A Survey | Litao Hua et.al. | 2504.03738 | null |
2025-03-28 | Multi-Objective Quality-Diversity in Unstructured and Unbounded Spaces | Hannah Janmohamed et.al. | 2504.03715 | link |
2025-03-27 | Geometric Flow Models over Neural Network Weights | Ege Erdogan et.al. | 2504.03710 | null |
2025-04-07 | MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models | Wulin Xie et.al. | 2504.03641 | null |
2025-04-04 | Enhancing Causal Effect Estimation with Diffusion-Generated Data | Li Chen et.al. | 2504.03630 | null |
2025-04-04 | Quantifying the uncertainty of model-based synthetic image quality metrics | Ciaran Bench et.al. | 2504.03623 | null |
2025-04-04 | Multimodal Diffusion Bridge with Attention-Based SAR Fusion for Satellite Image Cloud Removal | Yuyang Hu et.al. | 2504.03607 | null |
2025-04-04 | Diffusion Active Learning: Towards Data-Driven Experimental Design in Computed Tomography | Luis Barba et.al. | 2504.03491 | null |
2025-04-04 | BUFF: Bayesian Uncertainty Guided Diffusion Probabilistic Model for Single Image Super-Resolution | Zihao He et.al. | 2504.03490 | null |
2025-04-04 | Dynamic Importance in Diffusion U-Net for Enhanced Image Synthesis | Xi Wang et.al. | 2504.03471 | link |
2025-04-04 | D-Garment: Physics-Conditioned Latent Diffusion for Dynamic Garment Deformations | Antoine Dumoulin et.al. | 2504.03468 | null |
2025-04-04 | QIRL: Boosting Visual Question Answering via Optimized Question-Image Relation Learning | Quanxing Xu et.al. | 2504.03337 | null |
2025-04-04 | FaR: Enhancing Multi-Concept Text-to-Image Diffusion via Concept Fusion and Localized Refinement | Gia-Nghia Tran et.al. | 2504.03292 | null |
2025-04-04 | On the Connection Between Diffusion Models and Molecular Dynamics | Liam Harcombe et.al. | 2504.03187 | null |
2025-04-04 | Model Reveals What to Cache: Profiling-Based Feature Reuse for Video Diffusion Models | Xuran Ma et.al. | 2504.03140 | link |
2025-04-03 | How I Warped Your Noise: a Temporally-Correlated Noise Prior for Diffusion Models | Pascal Chang et.al. | 2504.03072 | null |
2025-04-03 | Comprehensive Relighting: Generalizable and Consistent Monocular Human Relighting and Harmonization | Junying Wang et.al. | 2504.03011 | null |
2025-04-03 | DiSRT-In-Bed: Diffusion-Based Sim-to-Real Transfer Framework for In-Bed Human Mesh Recovery | Jing Gao et.al. | 2504.03006 | null |
2025-04-03 | VARGPT-v1.1: Improve Visual Autoregressive Large Unified Model via Iterative Instruction Tuning and Reinforcement Learning | Xianwei Zhuang et.al. | 2504.02949 | link |
2025-04-03 | Morpheus: Benchmarking Physical Reasoning of Video Generative Models with Real Physical Experiments | Chenyu Zhang et.al. | 2504.02918 | null |
2025-04-03 | Bias in Large Language Models Across Clinical Applications: A Systematic Review | Thanathip Suenghataiphorn et.al. | 2504.02917 | null |
2025-04-02 | Robust AI-Synthesized Image Detection via Multi-feature Frequency-aware Learning | Hongfei Cai et.al. | 2504.02879 | null |
2025-03-28 | The epistemic dimension of algorithmic fairness: assessing its impact in innovation diffusion and fair policy making | Eugenia Villa et.al. | 2504.02856 | null |
2025-04-03 | Concept Lancet: Image Editing with Compositional Representation Transplant | Jinqi Luo et.al. | 2504.02828 | null |
2025-04-03 | F-ViTA: Foundation Model Guided Visible to Thermal Translation | Jay N. Paranjape et.al. | 2504.02801 | link |
2025-04-16 | Unified World Models: Coupling Video and Action Diffusion for Pretraining on Large Robotic Datasets | Chuning Zhu et.al. | 2504.02792 | null |
2025-04-03 | GPT-ImgEval: A Comprehensive Benchmark for Diagnosing GPT4o in Image Generation | Zhiyuan Yan et.al. | 2504.02782 | link |
2025-04-03 | Scene Splatter: Momentum 3D Scene Generation from Single Image with Video Diffusion Model | Shengjun Zhang et.al. | 2504.02764 | null |
2025-04-03 | MD-ProjTex: Texturing 3D Shapes with Multi-Diffusion Projection | Ahmet Burak Yildirim et.al. | 2504.02762 | null |
2025-04-04 | RBT4DNN: Requirements-based Testing of Neural Networks | Nusrat Jahan Mozumder et.al. | 2504.02737 | link |
2025-04-03 | RoSMM: A Robust and Secure Multi-Modal Watermarking Framework for Diffusion Models | ZhongLi Fang et.al. | 2504.02640 | null |
2025-04-03 | Fine-Tuning Visual Autoregressive Models for Subject-Driven Generation | Jiwoo Chung et.al. | 2504.02612 | null |
2025-04-03 | Bridging the Gap between Gaussian Diffusion Models and Universal Quantization for Image Compression | Lucas Relic et.al. | 2504.02579 | null |
2025-04-03 | MAD: Makeup All-in-One with Cross-Domain Diffusion Model | Bo-Kai Ruan et.al. | 2504.02545 | null |
2025-04-07 | Audio-visual Controlled Video Diffusion with Masked Selective State Spaces Modeling for Natural Talking Head Generation | Fa-Ting Hong et.al. | 2504.02542 | link |
2025-04-03 | ConMo: Controllable Motion Disentanglement and Recomposition for Zero-Shot Motion Transfer | Jiayi Gao et.al. | 2504.02451 | link |
2025-04-03 | SkyReels-A2: Compose Anything in Video Diffusion Transformers | Zhengcong Fei et.al. | 2504.02436 | link |
2025-04-03 | Translation of Fetal Brain Ultrasound Images into Pseudo-MRI Images using Artificial Intelligence | Naomi Silverstein et.al. | 2504.02408 | null |
2025-04-03 | Marine Saliency Segmenter: Object-Focused Conditional Diffusion with Region-Level Semantic Knowledge Distillation | Laibin Chang et.al. | 2504.02391 | null |
2025-04-04 | MG-Gen: Single Image to Motion Graphics Generation with Layer Decomposition | Takahiro Shirakawa et.al. | 2504.02361 | null |
2025-04-03 | ConsDreamer: Advancing Multi-View Consistency for Zero-Shot Text-to-3D Generation | Yuan Zhou et.al. | 2504.02316 | link |
2025-04-03 | OmniCam: Unified Multimodal Video Generation via Camera Control | Xiaoda Yang et.al. | 2504.02312 | null |
2025-04-03 | WonderTurbo: Generating Interactive 3D World in 0.72 Seconds | Chaojun Ni et.al. | 2504.02261 | null |
2025-04-03 | AC-LoRA: Auto Component LoRA for Personalized Artistic Style Image Generation | Zhipu Cui et.al. | 2504.02231 | null |
2025-04-02 | Foreground Focus: Enhancing Coherence and Fidelity in Camouflaged Image Generation | Pei-Chi Chen et.al. | 2504.02180 | null |
2025-04-02 | Less-to-More Generalization: Unlocking More Controllability by In-Context Generation | Shaojin Wu et.al. | 2504.02160 | link |
2025-04-02 | FreSca: Unveiling the Scaling Space in Diffusion Models | Chao Huang et.al. | 2504.02154 | null |
2025-04-02 | WorldPrompter: Traversable Text-to-Scene Generation | Zhaoyang Zhang et.al. | 2504.02045 | null |
2025-04-02 | Instruction-Guided Autoregressive Neural Network Parameter Generation | Soro Bedionita et.al. | 2504.02012 | null |
2025-04-02 | Random Conditioning with Distillation for Data-Efficient Diffusion Model Compression | Dohyun Kim et.al. | 2504.02011 | null |
2025-04-01 | OccludeNeRF: Geometric-aware 3D Scene Inpainting with Collaborative Score Distillation in NeRF | Jingyu Shi et.al. | 2504.02007 | null |
2025-04-02 | Diffusion-Guided Gaussian Splatting for Large-Scale Unconstrained 3D Reconstruction and Novel View Synthesis | Niluthpol Chowdhury Mithun et.al. | 2504.01960 | null |
2025-04-03 | VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in One Step | Hanyang Wang et.al. | 2504.01956 | null |
2025-04-02 | A Unified Approach to Analysis and Design of Denoising Markov Models | Yinuo Ren et.al. | 2504.01938 | null |
2025-04-03 | ILLUME+: Illuminating Unified MLLM with Dual Visual Tokenization and Diffusion Refinement | Runhui Huang et.al. | 2504.01934 | null |
2025-04-02 | FineLIP: Extending CLIP’s Reach via Fine-Grained Alignment with Longer Text Inputs | Mothilal Asokan et.al. | 2504.01916 | null |
2025-04-02 | Multi-fidelity Parameter Estimation Using Conditional Diffusion Models | Caroline Tatsuoka et.al. | 2504.01894 | null |
2025-04-02 | A Diffusion-Based Framework for Occluded Object Movement | Zheng-Peng Duan et.al. | 2504.01873 | null |
2025-04-02 | Implicit Bias Injection Attacks against Text-to-Image Diffusion Models | Huayang Huang et.al. | 2504.01819 | link |
2025-04-02 | The protein escape process at the ribosomal exit tunnel has conserved mechanisms across the domains of life | Phuong Thuy Bui et.al. | 2504.01731 | null |
2025-04-02 | InvFussion: Bridging Supervised and Zero-shot Diffusion for Inverse Problems | Noam Elata et.al. | 2504.01689 | link |
2025-04-02 | Instance Migration Diffusion for Nuclear Instance Segmentation in Pathology | Lirui Qi et.al. | 2504.01577 | null |
2025-04-02 | Semi-Supervised Biomedical Image Segmentation via Diffusion Models and Teacher-Student Co-Training | Luca Ciampi et.al. | 2504.01547 | link |
2025-04-10 | Hyperbolic Diffusion Recommender Model | Meng Yuan et.al. | 2504.01541 | null |
2025-04-02 | Domain Guidance: A Simple Transfer Approach for a Pre-trained Diffusion Model | Jincheng Zhong et.al. | 2504.01521 | link |
2025-04-02 | High-fidelity 3D Object Generation from Single Image with RGBN-Volume Gaussian Reconstruction Model | Yiyang Shen et.al. | 2504.01512 | null |
2025-04-02 | Generalized Assignment and Knapsack Problems in the Random-Order Model | Max Klimm et.al. | 2504.01486 | null |
2025-04-02 | Dual first-order methods for efficient computation of convex hull prices | Sofiane Tanji et.al. | 2504.01474 | null |
2025-04-02 | From Easy to Hard: Building a Shortcut for Differentially Private Image Synthesis | Kecen Li et.al. | 2504.01395 | link |
2025-04-07 | Safeguarding Vision-Language Models: Mitigating Vulnerabilities to Gaussian Noise in Perturbation-based Attacks | Jiawei Wang et.al. | 2504.01308 | link |
2025-04-01 | Prompting Forgetting: Unlearning in GANs via Textual Guidance | Piyush Nagasubramaniam et.al. | 2504.01218 | null |
2025-04-01 | Articulated Kinematics Distillation from Video Diffusion Models | Xuan Li et.al. | 2504.01204 | null |
2025-04-09 | Towards Signed Distance Function based Metamaterial Design: Neural Operator Transformer for Forward Prediction and Diffusion Model for Inverse Design | Qibang Liu et.al. | 2504.01195 | link |
2025-04-01 | Neural Approaches to SAT Solving: Design Choices and Interpretability | David Mojžíšek et.al. | 2504.01173 | null |
2025-04-01 | Follow the Flow: On Information Flow Across Textual Tokens in Text-to-Image Models | Guy Kaplan et.al. | 2504.01137 | link |
2025-04-08 | ShieldGemma 2: Robust and Tractable Image Content Moderation | Wenjun Zeng et.al. | 2504.01081 | null |
2025-04-01 | MixerMDM: Learnable Composition of Human Motion Diffusion Models | Pablo Ruiz-Ponce et.al. | 2504.01019 | null |
2025-04-01 | GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion Priors | Tian-Xing Xu et.al. | 2504.01016 | null |
2025-04-01 | AnimeGamer: Infinite Anime Life Simulation with Next Game State Prediction | Junhao Cheng et.al. | 2504.01014 | link |
2025-04-01 | IntrinsiX: High-Quality PBR Generation using Image Priors | Peter Kocsis et.al. | 2504.01008 | null |
2025-04-01 | Enhancing 3T BOLD fMRI SNR using Unpaired 7T Data with Schrödinger Bridge Diffusion | Yujian Xiong et.al. | 2504.01004 | null |
2025-04-01 | MergeVQ: A Unified Framework for Visual Generation and Representation with Disentangled Token Merging and Quantization | Siyuan Li et.al. | 2504.00999 | null |
2025-04-01 | TurboFill: Adapting Few-step Text-to-image Model for Fast Image Inpainting | Liangbin Xie et.al. | 2504.00996 | null |
2025-04-01 | WorldScore: A Unified Evaluation Benchmark for World Generation | Haoyi Duan et.al. | 2504.00983 | null |
2025-04-01 | Personalized Federated Training of Diffusion Models with Privacy Guarantees | Kumar Kshitij Patel et.al. | 2504.00952 | null |
2025-04-01 | Diffusion-model approach to flavor models: A case study for $S_4^\prime$ modular flavor model | Satsuki Nishimura et.al. | 2504.00944 | null |
2025-04-01 | Data-free Knowledge Distillation with Diffusion Models | Xiaohua Qi et.al. | 2504.00870 | null |
2025-04-01 | Integrating Fourier Neural Operators with Diffusion Models to improve Spectral Representation of Synthetic Earthquake Ground Motion Response | Niccolò Perrone et.al. | 2504.00757 | null |
2025-04-03 | Geometric Median Matching for Robust k-Subset Selection from Noisy Data | Anish Acharya et.al. | 2504.00564 | null |
2025-04-01 | Diffusion Model-Based Size Variable Virtual Try-On Technology and Evaluation Method | Shufang Zhang et.al. | 2504.00562 | null |
2025-04-01 | Galaxy Morphology Classification via Deep Semi-Supervised Learning with Limited Labeled Data | Zhijian Luo et.al. | 2504.00500 | null |
2025-04-03 | Distilling Multi-view Diffusion Models into 3D Generators | Hao Qin et.al. | 2504.00457 | null |
2025-04-01 | DecoFuse: Decomposing and Fusing the “What”, “Where”, and “How” for Brain-Inspired fMRI-to-Video Decoding | Chong Li et.al. | 2504.00432 | null |
2025-04-01 | Beyond Wide-Angle Images: Unsupervised Video Portrait Correction via Spatiotemporal Diffusion Adaptation | Wenbo Nie et.al. | 2504.00401 | null |
2025-04-04 | SPF-Portrait: Towards Pure Portrait Customization with Semantic Pollution-Free Fine-tuning | Xiaole Xian et.al. | 2504.00396 | null |
2025-04-01 | AP-CAP: Advancing High-Quality Data Synthesis for Animal Pose Estimation via a Controllable Image Generation Pipeline | Lei Wang et.al. | 2504.00394 | null |
2025-04-01 | Using complex prompts to identify fine-grained biases in image generation through ChatGPT-4o | Marinus Ferreira et.al. | 2504.00388 | null |
2025-04-01 | Scene4U: Hierarchical Layered 3D Scene Reconstruction from Single Panoramic Image for Your Immerse Exploration | Zilong Huang et.al. | 2504.00387 | null |
2025-04-01 | Hierarchical Flow Diffusion for Efficient Frame Interpolation | Yang Hai et.al. | 2504.00380 | null |
2025-04-01 | Aligning Diffusion Model with Problem Constraints for Trajectory Optimization | Anjian Li et.al. | 2504.00342 | null |
2025-04-01 | Diffusion models for probabilistic precipitation generation from atmospheric variables | Michael Aich et.al. | 2504.00307 | null |
2025-03-31 | DiffDenoise: Self-Supervised Medical Image Denoising with Conditional Diffusion Models | Basar Demir et.al. | 2504.00264 | null |
2025-04-02 | Dynamics-aware Diffusion Models for Planning and Control | Darshan Gadginmath et.al. | 2504.00236 | null |
2025-03-31 | GazeLLM: Multimodal LLMs incorporating Human Visual Attention | Jun Rekimoto et.al. | 2504.00221 | null |
2025-03-31 | Can Diffusion Models Disentangle? A Theoretical Perspective | Liming Wang et.al. | 2504.00220 | null |
2025-03-31 | Leveraging Diffusion Model and Image Foundation Model for Improved Correspondence Matching in Coronary Angiography | Lin Zhao et.al. | 2504.00191 | null |
2025-03-31 | Few-Shot Generation of Brain Tumors for Secure and Fair Data Sharing | Yongyi Shi et.al. | 2504.00150 | null |
2025-04-03 | Quantum Generative Models for Image Generation: Insights from MNIST and MedMNIST | Chi-Sheng Chen et.al. | 2504.00034 | null |
2025-03-28 | Diffusion models applied to skin and oral cancer classification | José J. M. Uliana et.al. | 2504.00026 | null |
2025-03-25 | LayerCraft: Enhancing Text-to-Image Generation with CoT Reasoning and Layered Object Integration | Yuyao Zhang et.al. | 2504.00010 | link |
2025-03-31 | RIG: Synergizing Reasoning and Imagination in End-to-End Generalist Policy | Zhonghan Zhao et.al. | 2503.24388 | null |
2025-03-31 | Consistent Subject Generation via Contrastive Instantiated Concepts | Lee Hsin-Ying et.al. | 2503.24387 | null |
2025-03-31 | Any2Caption:Interpreting Any Condition to Caption for Controllable Video Generation | Shengqiong Wu et.al. | 2503.24379 | null |
2025-03-31 | Style Quantization for Data-Efficient GAN Training | Jian Wang et.al. | 2503.24282 | null |
2025-03-31 | Enhancing Image Resolution of Solar Magnetograms: A Latent Diffusion Model Approach | Francesco Pio Ramunno et.al. | 2503.24271 | link |
2025-04-01 | Visual Acoustic Fields | Yuelei Li et.al. | 2503.24270 | null |
2025-03-31 | FakeScope: Large Multimodal Expert Model for Transparent AI-Generated Image Forensics | Yixuan Li et.al. | 2503.24267 | null |
2025-03-31 | Threats and Opportunities in AI-generated Images for Armed Forces | Raphael Meier et.al. | 2503.24095 | null |
2025-04-07 | Controlled Latent Diffusion Models for 3D Porous Media Reconstruction | Danilo Naiff et.al. | 2503.24083 | link |
2025-04-08 | A robot-assisted pipeline to rapidly scan 1.7 million historical aerial photographs | Sheila Masson et.al. | 2503.24063 | null |
2025-04-01 | HumanDreamer: Generating Controllable Human-Motion Videos via Decoupled Generation | Boyuan Wang et.al. | 2503.24026 | null |
2025-03-31 | DenseFormer: Learning Dense Depth Map from Sparse Depth and Image via Conditional Diffusion Model | Ming Yuan et.al. | 2503.23993 | null |
2025-03-31 | JointTuner: Appearance-Motion Adaptive Joint Training for Customized Video Generation | Fangda Chen et.al. | 2503.23951 | null |
2025-03-31 | AI2Agent: An End-to-End Framework for Deploying AI Projects as Autonomous Agents | Jiaxiang Chen et.al. | 2503.23948 | link |
2025-03-31 | DiffuSE: Cross-Layer Design Space Exploration of DNN Accelerator via Diffusion-Driven Optimization | Yi Ren et.al. | 2503.23945 | null |
2025-03-31 | Training-Free Text-Guided Image Editing with Visual Autoregressive Model | Yufei Wang et.al. | 2503.23897 | link |
2025-03-31 | DiffScale: Continuous Downscaling and Bias Correction of Subseasonal Wind Speed Forecasts using Diffusion Models | Maximilian Springenberg et.al. | 2503.23893 | null |
2025-03-31 | MuseFace: Text-driven Face Editing via Diffusion-based Mask Generation Approach | Xin Zhang et.al. | 2503.23888 | null |
2025-03-31 | ExScene: Free-View 3D Scene Reconstruction with Gaussian Splatting from a Single Image | Tianyi Gong et.al. | 2503.23881 | null |
2025-04-01 | On-device Sora: Enabling Training-Free Diffusion-based Text-to-Video Generation for Mobile Devices | Bosung Kim et.al. | 2503.23796 | link |
2025-03-31 | Biologically Inspired Spiking Diffusion Model with Adaptive Lateral Selection Mechanism | Linghao Feng et.al. | 2503.23767 | null |
2025-03-31 | StrokeFusion: Vector Sketch Generation via Joint Stroke-UDF Encoding and Latent Sequence Diffusion | Jin Zhou et.al. | 2503.23752 | null |
2025-03-31 | Semantic Packet Aggregation and Repeated Transmission for Text-to-Image Generation | Seunghun Lee et.al. | 2503.23734 | null |
2025-03-31 | Effective Cloud Removal for Remote Sensing Images by an Improved Mean-Reverting Denoising Model with Elucidated Design Space | Yi Liu et.al. | 2503.23717 | link |
2025-03-31 | HOIGen-1M: A Large-scale Dataset for Human-Object Interaction Video Generation | Kun Liu et.al. | 2503.23715 | null |
2025-03-31 | Expanding-and-Shrinking Binary Neural Networks | Xulong Shi et.al. | 2503.23709 | link |
2025-03-31 | Bayesian Inference for a Time-Fractional HIV Model with Nonlinear Diffusion | Mohamed BenSalah et.al. | 2503.23638 | null |
2025-03-30 | Language-Guided Trajectory Traversal in Disentangled Stable Diffusion Latent Space for Factorized Medical Image Generation | Zahra TehraniNasab et.al. | 2503.23623 | null |
2025-03-30 | Leveraging Vision-Language Foundation Models to Reveal Hidden Image-Attribute Relationships in Medical Imaging | Amar Kumar et.al. | 2503.23618 | null |
2025-03-30 | Make Autoregressive Great Again: Diffusion-Free Graph Generation with Next-Scale Prediction | Samuel Belkadi et.al. | 2503.23612 | null |
2025-03-30 | DiT4SR: Taming Diffusion Transformer for Real-World Image Super-Resolution | Zheng-Peng Duan et.al. | 2503.23580 | null |
2025-03-30 | Enhancing Creative Generation on Stable Diffusion-based Models | Jiyeon Han et.al. | 2503.23538 | link |
2025-04-01 | TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenes | Nikai Du et.al. | 2503.23461 | link |
2025-03-30 | VideoGen-Eval: Agent-based System for Video Generation Evaluation | Yuhang Yang et.al. | 2503.23452 | link |
2025-03-30 | Diffusion Meets Few-shot Class Incremental Learning | Junsu Kim et.al. | 2503.23402 | null |
2025-03-30 | A Large Scale Analysis of Gender Biases in Text-to-Image Generative Models | Leander Girrbach et.al. | 2503.23398 | null |
2025-03-30 | JavisDiT: Joint Audio-Video Diffusion Transformer with Hierarchical Spatio-Temporal Prior Synchronization | Kai Liu et.al. | 2503.23377 | null |
2025-04-04 | VLIPP: Towards Physically Plausible Video Generation with Vision and Language Informed Physical Prior | Xindi Yang et.al. | 2503.23368 | null |
2025-03-30 | DSPFusion: Image Fusion via Degradation and Semantic Dual-Prior Guidance | Linfeng Tang et.al. | 2503.23355 | null |
2025-03-30 | Object Isolated Attention for Consistent Story Visualization | Xiangyang Luo et.al. | 2503.23353 | null |
2025-03-30 | TraceMark-LDM: Authenticatable Watermarking for Latent Diffusion Models via Binary-Guided Rearrangement | Wenhao Luo et.al. | 2503.23332 | null |
2025-03-30 | MoCha: Towards Movie-Grade Talking Character Synthesis | Cong Wei et.al. | 2503.23307 | null |
2025-03-30 | SketchVideo: Sketch-based Video Generation and Editing | Feng-Lin Liu et.al. | 2503.23284 | null |
2025-03-30 | Learning Coordinated Bimanual Manipulation Policies using State Diffusion and Inverse Dynamics Models | Haonan Chen et.al. | 2503.23271 | null |
2025-04-02 | Geometry in Style: 3D Stylization via Surface Normal Deformation | Nam Anh Dinh et.al. | 2503.23241 | null |
2025-03-29 | Towards Interpretable Counterfactual Generation via Multimodal Autoregression | Chenglong Ma et.al. | 2503.23149 | null |
2025-03-29 | Galaxy Imaging with Generative Models: Insights from a Two-Models Framework | Jean-Eric Campagne et.al. | 2503.23127 | link |
2025-03-29 | Evaluating Compositional Scene Understanding in Multimodal Generative Models | Shuhao Fu et.al. | 2503.23125 | link |
2025-03-29 | MeshCraft: Exploring Efficient and Controllable Mesh Generation with Flow-based DiTs | Xianglong He et.al. | 2503.23022 | null |
2025-03-29 | On Geometrical Properties of Text Token Embeddings for Strong Semantic Binding in Text-to-Image Generation | Hoigi Seo et.al. | 2503.23011 | null |
2025-03-28 | DiTFastAttnV2: Head-wise Attention Compression for Multi-Modality Diffusion Transformers | Hanling Zhang et.al. | 2503.22796 | null |
2025-03-28 | Patronus: Bringing Transparency to Diffusion Models with Prototypes | Nina Weng et.al. | 2503.22782 | null |
2025-03-27 | Ignite Forecasting with SPARK: An Efficient Generative Framework for Refining LLMs in Temporal Knowledge Graph Forecasting | Gongzhu Yin et.al. | 2503.22748 | link |
2025-03-26 | A Spatial-temporal Deep Probabilistic Diffusion Model for Reliable Hail Nowcasting with Radar Echo Extrapolation | Haonan Shi et.al. | 2503.22724 | null |
2025-03-28 | DSO: Aligning 3D Generators with Simulation Feedback for Physical Soundness | Ruining Li et.al. | 2503.22677 | null |
2025-03-28 | Evaluation of Machine-generated Biomedical Images via A Tally-based Similarity Measure | Frank J. Brooks et.al. | 2503.22658 | null |
2025-03-28 | Zero4D: Training-Free 4D Video Generation From Single Video Using Off-the-Shelf Video Diffusion Model | Jangho Park et.al. | 2503.22622 | null |
2025-03-28 | Generative Latent Neural PDE Solver using Flow Matching | Zijie Li et.al. | 2503.22600 | null |
2025-03-28 | RELD: Regularization by Latent Diffusion Models for Image Restoration | Pasquale Cascarano et.al. | 2503.22563 | null |
2025-03-28 | Deterministic Medical Image Translation via High-fidelity Brownian Bridges | Qisheng He et.al. | 2503.22531 | null |
2025-03-28 | Scenario Dreamer: Vectorized Latent Diffusion for Generating Driving Simulation Environments | Luke Rowe et.al. | 2503.22496 | null |
2025-03-28 | Volumetric Material Decomposition Using Spectral Diffusion Posterior Sampling with a Compressed Polychromatic Forward Model | Xiao Jiang et.al. | 2503.22392 | null |
2025-03-28 | EchoFlow: A Foundation Model for Cardiac Ultrasound Image and Video Generation | Hadrien Reynaud et.al. | 2503.22357 | null |
2025-04-09 | Meta-LoRA: Meta-Learning LoRA Components for Domain-Aware ID Personalization | Barış Batuhan Topal et.al. | 2503.22352 | null |
2025-03-28 | GCRayDiffusion: Pose-Free Surface Reconstruction via Geometric Consistent Ray Diffusion | Li-Heng Chen et.al. | 2503.22349 | null |
2025-03-28 | Semantix: An Energy Guided Sampler for Semantic Style Transfer | Huiang He et.al. | 2503.22344 | null |
2025-03-28 | Imperceptible but Forgeable: Practical Invisible Watermark Forgery via Diffusion Models | Ziping Dong et.al. | 2503.22330 | null |
2025-03-28 | Mono2Stereo: A Benchmark and Empirical Study for Stereo Conversion | Songsong Yu et.al. | 2503.22262 | null |
2025-04-05 | CoGen: 3D Consistent Video Generation via Adaptive Conditioning for Autonomous Driving | Yishen Ji et.al. | 2503.22231 | null |
2025-03-28 | Follow Your Motion: A Generic Temporal Consistency Portrait Editing Framework with Trajectory Guidance | Haijie Yang et.al. | 2503.22225 | null |
2025-03-28 | Intrinsic Image Decomposition for Robust Self-supervised Monocular Depth Estimation on Reflective Surfaces | Wonhyeok Choi et.al. | 2503.22209 | null |
2025-03-28 | ORIGEN: Zero-Shot 3D Orientation Grounding in Text-to-Image Generation | Yunhong Min et.al. | 2503.22194 | null |
2025-03-28 | Limiting Disease Spreading in Human Networks | Gargi Bakshi et.al. | 2503.22191 | null |
2025-03-28 | Sell It Before You Make It: Revolutionizing E-Commerce with Personalized AI-Generated Items | Jianghao Lin et.al. | 2503.22182 | null |
2025-03-28 | High-Fidelity Diffusion Face Swapping with ID-Constrained Facial Conditioning | Dailan He et.al. | 2503.22179 | null |
2025-03-28 | Concept-Aware LoRA for Domain-Aligned Segmentation Dataset Generation | Minho Park et.al. | 2503.22172 | null |
2025-03-28 | An Empirical Study of Validating Synthetic Data for Text-Based Person Retrieval | Min Cao et.al. | 2503.22171 | link |
2025-03-28 | Spatial Transport Optimization by Repositioning Attention Map for Training-Free Text-to-Image Synthesis | Woojung Han et.al. | 2503.22168 | null |
2025-03-28 | Enhancing Dance-to-Music Generation via Negative Conditioning Latent Diffusion Model | Changchang Sun et.al. | 2503.22138 | null |
2025-03-28 | Improving the generalization of deep learning models in the segmentation of mammography images | Jan Hurtado et.al. | 2503.22052 | null |
2025-03-27 | AGILE: A Diffusion-Based Attention-Guided Image and Label Translation for Efficient Cross-Domain Plant Trait Identification | Earl Ranario et.al. | 2503.22019 | link |
2025-03-27 | Improving Equivariant Networks with Probabilistic Symmetry Breaking | Hannah Lawrence et.al. | 2503.21985 | null |
2025-03-27 | Harmonizing Visual Representations for Unified Multimodal Understanding and Generation | Size Wu et.al. | 2503.21979 | link |
2025-04-07 | Parametric Shadow Control for Portrait Generation in Text-to-Image Diffusion Models | Haoming Cai et.al. | 2503.21943 | null |
2025-03-27 | KernelFusion: Assumption-Free Blind Super-Resolution via Patch Diffusion | Oliver Heinimann et.al. | 2503.21907 | null |
2025-03-26 | Protecting Your Video Content: Disrupting Automated Video-based LLM Annotations | Haitong Liu et.al. | 2503.21824 | link |
2025-03-25 | IPGO: Indirect Prompt Gradient Optimization on Text-to-Image Generative Models with High Data Efficiency | Jianping Ye et.al. | 2503.21812 | null |
2025-03-27 | VideoMage: Multi-Subject and Motion Customization of Text-to-Video Diffusion Models | Chi-Pin Huang et.al. | 2503.21781 | null |
2025-03-27 | StyleMotif: Multi-Modal Motion Stylization using Style-Content Cross Fusion | Ziyu Guo et.al. | 2503.21775 | null |
2025-03-27 | Optimal Stepsize for Diffusion Sampling | Jianning Pei et.al. | 2503.21774 | link |
2025-03-27 | A Unified Image-Dense Annotation Generation Model for Underwater Scenes | Hongkai Lin et.al. | 2503.21771 | link |
2025-03-27 | Exploring the Evolution of Physics Cognition in Video Generation: A Survey | Minghui Lin et.al. | 2503.21765 | link |
2025-03-27 | Lumina-Image 2.0: A Unified and Efficient Image Generative Framework | Qi Qin et.al. | 2503.21758 | link |
2025-03-27 | A Unified Framework for Diffusion Bridge Problems: Flow Matching and Schrödinger Matching into One | Minyoung Kim et.al. | 2503.21756 | null |
2025-03-27 | VBench-2.0: Advancing Video Generation Benchmark Suite for Intrinsic Faithfulness | Dian Zheng et.al. | 2503.21755 | link |
2025-03-27 | LeX-Art: Rethinking Text Generation via Scalable High-Quality Data Synthesis | Shitian Zhao et.al. | 2503.21749 | null |
2025-03-27 | CTRL-O: Language-Controllable Object-Centric Visual Representation Learning | Aniket Didolkar et.al. | 2503.21747 | null |
2025-03-27 | 3DGen-Bench: Comprehensive Benchmark Suite for 3D Generative Models | Yuhan Zhang et.al. | 2503.21745 | null |
2025-03-27 | Evaluating Text-to-Image Synthesis with a Conditional Fréchet Distance | Jaywon Koo et.al. | 2503.21721 | null |
2025-03-27 | Progressive Rendering Distillation: Adapting Stable Diffusion for Instant Text-to-Mesh Generation without 3D Data | Zhiyuan Ma et.al. | 2503.21694 | link |
2025-03-27 | Audio-driven Gesture Generation via Deviation Feature in the Latent Space | Jiahui Chen et.al. | 2503.21616 | null |
2025-03-27 | Critical Iterative Denoising: A Discrete Generative Model Applied to Graphs | Yoann Boget et.al. | 2503.21592 | null |
2025-03-27 | AlignDiff: Learning Physically-Grounded Camera Alignment via Diffusion | Liuyue Xie et.al. | 2503.21581 | null |
2025-03-27 | SyncSDE: A Probabilistic Framework for Diffusion Synchronization | Hyunjun Lee et.al. | 2503.21555 | null |
2025-03-28 | LOCATEdit: Graph Laplacian Optimized Cross Attention for Localized Text-Guided Image Editing | Achint Soni et.al. | 2503.21541 | link |
2025-03-27 | Nonlinear Stability of Large-Period Traveling Waves Bifurcating from the Heteroclinic Loop in the FitzHugh-Nagumo Equation | Ji Li et.al. | 2503.21509 | null |
2025-03-27 | Invert2Restore: Zero-Shot Degradation-Blind Image Restoration | Hamadi Chihaoui et.al. | 2503.21486 | null |
2025-03-27 | Towards Generating Realistic 3D Semantic Training Data for Autonomous Driving | Lucas Nunes et.al. | 2503.21449 | link |
2025-03-27 | Exploring the flavor structure of leptons via diffusion models | Satsuki Nishimura et.al. | 2503.21432 | null |
2025-03-27 | Diffusion Image Prior | Hamadi Chihaoui et.al. | 2503.21410 | null |
2025-03-27 | HORT: Monocular Hand-held Objects Reconstruction with Transformers | Zerui Chen et.al. | 2503.21313 | null |
2025-04-01 | Zero-Shot Visual Concept Blending Without Text Guidance | Hiroya Makino et.al. | 2503.21277 | link |
2025-03-29 | GenFusion: Closing the Loop between Reconstruction and Generation via Videos | Sibo Wu et.al. | 2503.21219 | null |
2025-03-27 | UGen: Unified Autoregressive Multimodal Model with Progressive Vocabulary Learning | Hongxuan Tang et.al. | 2503.21193 | null |
2025-03-27 | Model as a Game: On Numerical and Spatial Consistency for Generative Games | Jingye Chen et.al. | 2503.21172 | null |
2025-03-27 | ChatAnyone: Stylized Real-time Portrait Video Generation with Hierarchical Motion Diffusion Model | Jinwei Qi et.al. | 2503.21144 | null |
2025-03-27 | Can Video Diffusion Model Reconstruct 4D Geometry? | Jinjie Mai et.al. | 2503.21082 | null |
2025-03-27 | Efficient Multi-Instance Generation with Janus-Pro-Dirven Prompt Parsing | Fan Qi et.al. | 2503.21069 | null |
2025-03-26 | TransDiffSBDD: Causality-Aware Multi-Modal Structure-Based Drug Design | Xiuyuan Hu et.al. | 2503.20913 | null |
2025-03-26 | Unified Multimodal Discrete Diffusion | Alexander Swerdlow et.al. | 2503.20853 | link |
2025-03-26 | Debiasing Kernel-Based Generative Models | Tian Qin et.al. | 2503.20825 | null |
2025-03-26 | Synthetic Video Enhances Physical Fidelity in Video Synthesis | Qi Zhao et.al. | 2503.20822 | null |
2025-03-26 | Free4D: Tuning-free 4D Scene Generation with Spatial-Temporal Consistency | Tianqi Liu et.al. | 2503.20785 | link |
2025-03-26 | FB-4D: Spatial-Temporal Coherent Dynamic 3D Content Generation with Feature Banks | Jinwei Li et.al. | 2503.20784 | link |
2025-03-26 | High Quality Diffusion Distillation on a Single GPU with Relative and Absolute Position Matching | Guoqiang Zhang et.al. | 2503.20744 | null |
2025-03-26 | RecTable: Fast Modeling Tabular Data with Rectified Flow | Masane Fuchi et.al. | 2503.20731 | link |
2025-03-26 | Dynamic Motion Blending for Versatile Motion Editing | Nan Jiang et.al. | 2503.20724 | null |
2025-03-26 | BizGen: Advancing Article-level Visual Text Rendering for Infographics Generation | Yuyang Peng et.al. | 2503.20672 | null |
2025-03-26 | ARMO: Autoregressive Rigging for Multi-Category Objects | Mingze Sun et.al. | 2503.20663 | null |
2025-03-26 | AccidentSim: Generating Physically Realistic Vehicle Collision Videos from Real-World Accident Reports | Xiangwen Zhang et.al. | 2503.20654 | null |
2025-03-26 | MMGen: Unified Multi-modal Image Generation and Understanding in One Go | Jiepeng Wang et.al. | 2503.20644 | null |
2025-03-26 | Stochastic Transport Maps in Diffusion Models and Sampling | Xicheng Zhang et.al. | 2503.20573 | null |
2025-03-26 | Exploring Robustness of Cortical Morphometry in the presence of white matter lesions, using Diffusion Models for Lesion Filling | Vinzenz Uhr et.al. | 2503.20571 | null |
2025-03-26 | Beyond Intermediate States: Explaining Visual Redundancy through Language | Dingchen Yang et.al. | 2503.20540 | link |
2025-03-26 | TD-BFR: Truncated Diffusion Model for Efficient Blind Face Restoration | Ziying Zhang et.al. | 2503.20537 | null |
2025-03-26 | GAIA-2: A Controllable Multi-View Generative World Model for Autonomous Driving | Lloyd Russell et.al. | 2503.20523 | null |
2025-03-26 | VPO: Aligning Text-to-Video Generation Models with Prompt Optimization | Jiale Cheng et.al. | 2503.20491 | link |
2025-03-26 | Contrastive Learning Guided Latent Diffusion Model for Image-to-Image Translation | Qi Si et.al. | 2503.20484 | null |
2025-03-26 | Dissecting and Mitigating Diffusion Bias via Mechanistic Interpretability | Yingdong Shi et.al. | 2503.20483 | null |
2025-03-26 | Latent Beam Diffusion Models for Decoding Image Sequences | Guilherme Fernandes et.al. | 2503.20429 | null |
2025-03-26 | ITA-MDT: Image-Timestep-Adaptive Masked Diffusion Transformer Framework for Image-Based Virtual Try-On | Ji Woo Hong et.al. | 2503.20418 | null |
2025-03-27 | Consistency Trajectory Matching for One-Step Generative Super-Resolution | Weiyi You et.al. | 2503.20349 | null |
2025-03-26 | Wan: Open and Advanced Large-Scale Video Generative Models | WanTeam et.al. | 2503.20314 | link |
2025-03-26 | EGVD: Event-Guided Video Diffusion Model for Physically Realistic Large-Motion Frame Interpolation | Ziran Zhang et.al. | 2503.20268 | link |
2025-03-29 | Unconditional Priors Matter! Improving Conditional Generation of Fine-Tuned Diffusion Models | Prin Phunyaphibarn et.al. | 2503.20240 | null |
2025-03-26 | Automated UI Interface Generation via Diffusion Models: Enhancing Personalization and Efficiency | Yifei Duan et.al. | 2503.20229 | null |
2025-03-26 | Video Motion Graphs | Haiyang Liu et.al. | 2503.20218 | null |
2025-03-26 | Beyond Words: Advancing Long-Text Image Generation via Multimodal Autoregressive Models | Alex Jinpeng Wang et.al. | 2503.20198 | null |
2025-03-26 | AIGC-assisted Federated Learning for Edge Intelligence: Architecture Design, Research Challenges and Future Directions | Xianke Qiang et.al. | 2503.20166 | link |
2025-03-25 | Zero-Shot Human-Object Interaction Synthesis with Multimodal Priors | Yuke Lou et.al. | 2503.20118 | null |
2025-03-27 | Adaptive Orchestration for Large-Scale Inference on Heterogeneous Accelerator Systems Balancing Cost, Performance, and Resilience | Yahav Biran et.al. | 2503.20074 | link |
2025-03-25 | Conditional Deep Generative Models for Simultaneous Simulation and Reconstruction of Entire Events | Etienne Dreyer et.al. | 2503.19981 | link |
2025-03-25 | Self-Supervised Learning of Motion Concepts by Optimizing Counterfactuals | Stefan Stojanov et.al. | 2503.19953 | null |
2025-03-25 | FuXi-RTM: A Physics-Guided Prediction Framework with Radiative Transfer Modeling | Qiusheng Huang et.al. | 2503.19940 | null |
2025-03-25 | Reverse Prompt: Cracking the Recipe Inside Text-to-Image Generation | Zhiyao Ren et.al. | 2503.19937 | null |
2025-03-25 | Learning 3D Object Spatial Relationships from Pre-trained 2D Diffusion Models | Sangwon Beak et.al. | 2503.19914 | null |
2025-03-25 | PartRM: Modeling Part-Level Dynamics with Large Cross-State Reconstruction Model | Mingju Gao et.al. | 2503.19913 | null |
2025-03-25 | FullDiT: Multi-Task Video Generative Foundation Model with Full Attention | Xuan Ju et.al. | 2503.19907 | null |
2025-03-26 | AvatarArtist: Open-Domain 4D Avatarization | Hongyu Liu et.al. | 2503.19906 | null |
2025-03-25 | ICE: Intrinsic Concept Extraction from a Single Image via Diffusion Models | Fernando Julio Cendra et.al. | 2503.19902 | null |
2025-03-25 | Scaling Down Text Encoders of Text-to-Image Diffusion Models | Lifu Wang et.al. | 2503.19897 | link |
2025-03-25 | Mask $^2$ DiT: Dual Mask-based Diffusion Transformer for Multi-Scene Long Video Generation | Tianhao Qi et.al. | 2503.19881 | null |
2025-03-29 | FireEdit: Fine-grained Instruction-based Image Editing via Region-aware Vision Language Model | Jun Zhou et.al. | 2503.19839 | null |
2025-03-25 | AudCast: Audio-Driven Human Video Generation by Cascaded Diffusion Transformers | Jiazhi Guan et.al. | 2503.19824 | null |
2025-03-25 | Unpaired Object-Level SAR-to-Optical Image Translation for Aircraft with Keypoints-Guided Diffusion Models | Ruixi You et.al. | 2503.19798 | null |
2025-03-26 | In the Blink of an Eye: Instant Game Map Editing using a Generative-AI Smart Brush | Vitaly Gnatyuk et.al. | 2503.19793 | null |
2025-03-25 | SITA: Structurally Imperceptible and Transferable Adversarial Attacks for Stylized Image Generation | Jingdan Kang et.al. | 2503.19791 | link |
2025-03-25 | Fine-Grained Erasure in Text-to-Image Diffusion-based Foundation Models | Kartik Thakral et.al. | 2503.19783 | null |
2025-03-25 | PCM : Picard Consistency Model for Fast Parallel Sampling of Diffusion Models | Junhyuk So et.al. | 2503.19731 | null |
2025-03-25 | CoSimGen: Controllable Diffusion Model for Simultaneous Image and Mask Generation | Rupak Bose et.al. | 2503.19661 | null |
2025-03-30 | OpenSDI: Spotting Diffusion-Generated Images in the Open World | Yabin Wang et.al. | 2503.19653 | link |
2025-03-25 | GIViC: Generative Implicit Video Compression | Ge Gao et.al. | 2503.19604 | null |
2025-03-25 | VectorFit : Adaptive Singular & Bias Vector Fine-Tuning of Pre-trained Foundation Models | Suhas G Hegde et.al. | 2503.19530 | null |
2025-03-25 | Single-Step Latent Consistency Model for Remote Sensing Image Super-Resolution | Xiaohui Sun et.al. | 2503.19505 | null |
2025-03-25 | Exploring Disentangled and Controllable Human Image Synthesis: From End-to-End to Stage-by-Stage | Zhengwentai Sun et.al. | 2503.19486 | null |
2025-03-25 | AccVideo: Accelerating Video Diffusion Model with Synthetic Dataset | Haiyu Zhang et.al. | 2503.19462 | null |
2025-03-25 | Towards Robust Time-of-Flight Depth Denoising with Confidence-Aware Diffusion Model | Changyong He et.al. | 2503.19448 | null |
2025-03-25 | Quantifying the Ease of Reproducing Training Data in Unconditional Diffusion Models | Masaya Hasegawa et.al. | 2503.19429 | null |
2025-03-26 | Inference-Time Scaling for Flow Models via Stochastic Generation and Rollover Budget Forcing | Jaihoon Kim et.al. | 2503.19385 | null |
2025-03-25 | MVPortrait: Text-Guided Motion and Emotion Control for Multi-view Vivid Portrait Animation | Yukang Lin et.al. | 2503.19383 | null |
2025-03-25 | Interpretable Generative Models through Post-hoc Concept Bottlenecks | Akshay Kulkarni et.al. | 2503.19377 | link |
2025-03-25 | DeClotH: Decomposable 3D Cloth and Human Body Reconstruction from a Single Image | Hyeongjin Nam et.al. | 2503.19373 | null |
2025-03-26 | EfficientMT: Efficient Temporal Adaptation for Motion Transfer in Text-to-Video Diffusion Models | Yufei Cai et.al. | 2503.19369 | link |
2025-03-25 | Correcting Deviations from Normality: A Reformulated Diffusion Model for Multi-Class Unsupervised Anomaly Detection | Farzad Beizaee et.al. | 2503.19357 | link |
2025-03-25 | Data-driven Mesoscale Weather Forecasting Combining Swin-Unet and Diffusion Models | Yuta Hirabayashi et.al. | 2503.19354 | null |
2025-03-25 | BADGR: Bundle Adjustment Diffusion Conditioned by GRadients for Wide-Baseline Floor Plan Reconstruction | Yuguang Li et.al. | 2503.19340 | null |
2025-03-25 | Long-Context Autoregressive Video Modeling with Next-Frame Prediction | Yuchao Gu et.al. | 2503.19325 | link |
2025-03-25 | ImageGen-CoT: Enhancing Text-to-Image In-context Learning with Chain-of-Thought Reasoning | Jiaqi Liao et.al. | 2503.19312 | null |
2025-03-25 | LRSCLIP: A Vision-Language Foundation Model for Aligning Remote Sensing Image with Longer Text | Weizhi Chen et.al. | 2503.19311 | link |
2025-03-25 | UniMoMo: Unified Generative Modeling of 3D Molecules for De Novo Binder Design | Xiangzhe Kong et.al. | 2503.19300 | null |
2025-03-25 | Fine-grained Textual Inversion Network for Zero-Shot Composed Image Retrieval | Haoqiang Lin et.al. | 2503.19296 | link |
2025-03-25 | ISPDiffuser: Learning RAW-to-sRGB Mappings with Texture-Aware Diffusion Models and Histogram-Guided Color Consistency | Yang Ren et.al. | 2503.19283 | link |
2025-03-25 | Learning Hazing to Dehazing: Towards Realistic Haze Generation for Real-World Image Dehazing | Ruiyi Wang et.al. | 2503.19262 | null |
2025-03-25 | MIRAGE: Multi-model Interface for Reviewing and Auditing Generative Text-to-Image AI | Matheus Kunzler Maldaner et.al. | 2503.19252 | null |
2025-03-24 | FDS: Frequency-Aware Denoising Score for Text-Guided Latent Diffusion Image Editing | Yufan Ren et.al. | 2503.19191 | null |
2025-03-24 | Color Conditional Generation with Sliced Wasserstein Guidance | Alexander Lobashev et.al. | 2503.19034 | null |
2025-03-24 | DiffV2IR: Visible-to-Infrared Diffusion Model via Vision-Language Understanding | Lingyan Ran et.al. | 2503.19012 | null |
2025-03-24 | RomanTex: Decoupling 3D-aware Rotary Positional Embedded Multi-Attention Network for Texture Synthesis | Yifei Feng et.al. | 2503.19011 | null |
2025-03-24 | DisentTalk: Cross-lingual Talking Face Generation via Semantic Disentangled Diffusion Model | Kangwei Liu et.al. | 2503.19001 | null |
2025-03-19 | Advancing Deep Learning through Probability Engineering: A Pragmatic Paradigm for Modern AI | Jianyi Zhang et.al. | 2503.18958 | null |
2025-04-02 | Target-Aware Video Diffusion Models | Taeksoo Kim et.al. | 2503.18950 | null |
2025-03-25 | Aether: Geometric-Aware Unified World Modeling | Aether Team et.al. | 2503.18945 | null |
2025-04-01 | Video-T1: Test-Time Scaling for Video Generation | Fangfu Liu et.al. | 2503.18942 | null |
2025-03-27 | Training-free Diffusion Acceleration with Bottleneck Sampling | Ye Tian et.al. | 2503.18940 | null |
2025-03-24 | SyncVP: Joint Diffusion for Synchronous Multi-Modal Video Prediction | Enrico Pallotta et.al. | 2503.18933 | link |
2025-04-03 | CFG-Zero*: Improved Classifier-Free Guidance for Flow Matching Models | Weichen Fan et.al. | 2503.18886 | link |
2025-03-25 | HunyuanPortrait: Implicit Condition Control for Enhanced Portrait Animation | Zunnan Xu et.al. | 2503.18860 | null |
2025-03-24 | Dual-domain Multi-path Self-supervised Diffusion Model for Accelerated MRI Reconstruction | Yuxuan Zhang et.al. | 2503.18836 | null |
2025-03-24 | SKDU at De-Factify 4.0: Vision Transformer with Data Augmentation for AI-Generated Image Detection | Shrikant Malviya et.al. | 2503.18812 | link |
2025-03-24 | Self-Supervised Learning based on Transformed Image Reconstruction for Equivariance-Coherent Feature Representation | Qin Wang et.al. | 2503.18753 | null |
2025-03-24 | Thermalizer: Stable autoregressive neural emulation of spatiotemporal chaos | Chris Pedersen et.al. | 2503.18731 | null |
2025-03-24 | Boosting Resolution Generalization of Diffusion Transformers with Randomized Positional Encodings | Cong Liu et.al. | 2503.18719 | null |
2025-03-24 | Human Motion Unlearning | Edoardo De Matteis et.al. | 2503.18674 | null |
2025-03-24 | Dig2DIG: Dig into Diffusion Information Gains for Image Fusion | Bing Cao et.al. | 2503.18627 | null |
2025-03-24 | Generative Dataset Distillation using Min-Max Diffusion Model | Junqiao Fan et.al. | 2503.18626 | null |
2025-03-29 | Unified Uncertainty-Aware Diffusion for Multi-Agent Trajectory Modeling | Guillem Capellera et.al. | 2503.18589 | null |
2025-04-02 | Adapting Video Diffusion Models for Time-Lapse Microscopy | Alexander Holmberg et.al. | 2503.18583 | link |
2025-03-25 | AMD-Hummingbird: Towards an Efficient Text-to-Video Model | Takashi Isobe et.al. | 2503.18559 | link |
2025-03-24 | Instruction-Aligned Visual Attention for Mitigating Hallucinations in Large Vision-Language Models | Bin Li et.al. | 2503.18556 | null |
2025-03-24 | EvAnimate: Event-conditioned Image-to-Video Generation for Human Animation | Qiang Qu et.al. | 2503.18552 | null |
2025-03-24 | Discriminative protein sequence modelling with Latent Space Diffusion | Eoin Quinn et.al. | 2503.18551 | null |
2025-03-24 | DiN: Diffusion Model for Robust Medical VQA with Semantic Noisy Labels | Erjian Guo et.al. | 2503.18536 | null |
2025-03-25 | AIM2PC: Aerial Image to 3D Building Point Cloud Reconstruction | Soulaimene Turki et.al. | 2503.18527 | null |
2025-03-24 | Uncertainty-guided Perturbation for Image Super-Resolution Diffusion Model | Leheng Zhang et.al. | 2503.18512 | null |
2025-03-24 | Can Text-to-Video Generation help Video-Language Alignment? | Luca Zanella et.al. | 2503.18507 | null |
2025-03-24 | Hiding Images in Diffusion Models by Editing Learned Score Functions | Haoyu Chen et.al. | 2503.18459 | null |
2025-03-24 | InPO: Inversion Preference Optimization with Reparametrized DDIM for Efficient Diffusion Model Alignment | Yunhong Lu et.al. | 2503.18454 | link |
2025-03-25 | Latent Space Super-Resolution for Higher-Resolution Image Generation with Diffusion Models | Jinho Jeong et.al. | 2503.18446 | link |
2025-03-24 | Teller: Real-Time Streaming Audio-Driven Portrait Animation with Autoregressive Motion Generation | Dingcheng Zhen et.al. | 2503.18429 | null |
2025-03-24 | Panorama Generation From NFoV Image Done Right | Dian Zheng et.al. | 2503.18420 | link |
2025-03-25 | Instruct-CLIP: Improving Instruction-Guided Image Editing with Automated Data Refinement Using Contrastive Learning | Sherry X. Chen et.al. | 2503.18406 | link |
2025-03-24 | PDDM: Pseudo Depth Diffusion Model for RGB-PD Semantic Segmentation Based in Complex Indoor Scenes | Xinhua Xu et.al. | 2503.18393 | null |
2025-03-24 | Resource-Efficient Motion Control for Video Generation via Dynamic Mask Guidance | Sicong Feng et.al. | 2503.18386 | null |
2025-03-24 | Efficient Inference in First Passage Time Models | Sicheng Liu et.al. | 2503.18381 | null |
2025-03-24 | DiffusedWrinkles: A Diffusion-Based Model for Data-Driven Garment Animation | Raquel Vidaurre et.al. | 2503.18370 | null |
2025-03-28 | Diffusion-4K: Ultra-High-Resolution Image Synthesis with Latent Diffusion Models | Jinjin Zhang et.al. | 2503.18352 | link |
2025-03-24 | Latent Embedding Adaptation for Human Preference Alignment in Diffusion Planners | Wen Zheng Terence Ng et.al. | 2503.18347 | null |
2025-03-24 | Plug-and-Play Interpretable Responsible Text-to-Image Generation via Dual-Space Multi-facet Concept Control | Basim Azam et.al. | 2503.18324 | null |
2025-03-24 | Diff-Palm: Realistic Palmprint Generation with Polynomial Creases and Intra-Class Variation Controllable Diffusion Models | Jianlong Jin et.al. | 2503.18312 | link |
2025-03-24 | DiffMove: Group Mobility Tendency Enhanced Trajectory Recovery via Diffusion Model | Qingyue Long et.al. | 2503.18302 | null |
2025-03-24 | DiffGED: Computing Graph Edit Distance via Diffusion-based Graph Matching | Wei Huang et.al. | 2503.18245 | null |
2025-03-23 | Decoupling Angles and Strength in Low-rank Adaptation | Massimo Bini et.al. | 2503.18225 | link |
2025-03-23 | Self-Attention Diffusion Models for Zero-Shot Biomedical Image Segmentation: Unlocking New Frontiers in Medical Imaging | Abderrachid Hamrani et.al. | 2503.18170 | null |
2025-03-23 | DiffusionTalker: Efficient and Compact Speech-Driven 3D Talking Head via Personalizer-Guided Distillation | Peng Chen et.al. | 2503.18159 | link |
2025-03-23 | Adoption of Watermarking for Generative AI Systems in Practice and Implications under the new EU AI Act | Bram Rijsbosch et.al. | 2503.18156 | null |
2025-03-23 | LongDiff: Training-Free Long Video Generation in One Go | Zhuoling Li et.al. | 2503.18150 | null |
2025-03-23 | LocDiffusion: Identifying Locations on Earth by Diffusing in the Hilbert Space | Zhangyu Wang et.al. | 2503.18142 | null |
2025-03-23 | TCFG: Tangential Damping Classifier-free Guidance | Mingi Kwon et.al. | 2503.18137 | null |
2025-03-23 | An Image-like Diffusion Method for Human-Object Interaction Detection | Xiaofei Hui et.al. | 2503.18134 | null |
2025-03-23 | Unified Geometry and Color Compression Framework for Point Clouds via Generative Diffusion Priors | Tianxin Huang et.al. | 2503.18083 | null |
2025-03-23 | Model-Guardian: Protecting against Data-Free Model Stealing Using Gradient Representations and Deceptive Predictions | Yunfei Yang et.al. | 2503.18081 | null |
2025-03-23 | GenMetaLoc: Learning to Learn Environment-Aware Fingerprint Generation for Sample Efficient Wireless Localization | Jun Gao et.al. | 2503.18078 | null |
2025-03-23 | Unseen from Seen: Rewriting Observation-Instruction Using Foundation Models for Augmenting Vision-Language Navigation | Ziming Wei et.al. | 2503.18065 | link |
2025-03-23 | OmnimatteZero: Training-free Real-time Omnimatte with Pre-trained Video Diffusion Models | Dvir Samuel et.al. | 2503.18033 | null |
2025-03-23 | Metaphor-based Jailbreaking Attacks on Text-to-Image Models | Chenyu Zhang et.al. | 2503.17987 | null |
2025-03-23 | TransAnimate: Taming Layer Diffusion to Generate RGBA Video | Xuewei Chen et.al. | 2503.17934 | null |
2025-03-23 | Guided Diffusion for the Extension of Machine Vision to Human Visual Perception | Takahiro Shindo et.al. | 2503.17907 | null |
2025-03-22 | FundusGAN: A Hierarchical Feature-Aware Generative Framework for High-Fidelity Fundus Image Generation | Qingshan Hou et.al. | 2503.17831 | null |
2025-03-22 | DVG-Diffusion: Dual-View Guided Diffusion Model for CT Reconstruction from X-Rays | Xing Xie et.al. | 2503.17804 | null |
2025-03-29 | Progressive Prompt Detailing for Improved Alignment in Text-to-Image Generative Models | Ketan Suhaas Saichandran et.al. | 2503.17794 | null |
2025-03-22 | Aligning Foundation Model Priors and Diffusion-Based Hand Interactions for Occlusion-Resistant Two-Hand Reconstruction | Gaoge Han et.al. | 2503.17788 | null |
2025-03-22 | Probabilistic Net Load Forecasting for High-Penetration RES Grids Utilizing Enhanced Conditional Diffusion Model | Yixiang Huang et.al. | 2503.17770 | null |
2025-03-22 | RDTF: Resource-efficient Dual-mask Training Framework for Multi-frame Animated Sticker Generation | Zhiqiang Yuan et.al. | 2503.17735 | null |
2025-03-22 | DynASyn: Multi-Subject Personalization Enabling Dynamic Action Synthesis | Yongjin Choi et.al. | 2503.17728 | null |
2025-03-22 | Towards Invisible Backdoor Attack on Text-to-Image Diffusion Model | Jie Zhang et.al. | 2503.17724 | link |
2025-03-22 | Conditional Diffusion Model with OOD Mitigation as High-Dimensional Offline Resource Allocation Planner in Clustered Ad Hoc Networks | Kechen Meng et.al. | 2503.17693 | null |
2025-03-22 | Towards Transformer-Based Aligned Generation with Self-Coherence Guidance | Shulei Wang et.al. | 2503.17675 | null |
2025-03-22 | ComfyGPT: A Self-Optimizing Multi-Agent System for Comprehensive ComfyUI Workflow Generation | Oucheng Huang et.al. | 2503.17671 | null |
2025-03-22 | TDRI: Two-Phase Dialogue Refinement and Co-Adaptation for Interactive Image Generation | Yuheng Feng et.al. | 2503.17669 | null |
2025-03-22 | OMR-Diffusion:Optimizing Multi-Round Enhanced Training in Diffusion Models for Improved Intent Understanding | Kun Li et.al. | 2503.17660 | null |
2025-03-22 | Efficient Diffusion Training through Parallelization with Truncated Karhunen-Loève Expansion | Yumeng Ren et.al. | 2503.17657 | null |
2025-03-22 | LZMidi: Compression-Based Symbolic Music Generation | Connor Ding et.al. | 2503.17654 | null |
2025-03-22 | AI-Based Screening for Depression and Social Anxiety Through Eye Tracking: An Exploratory Study | Karol Chlasta et.al. | 2503.17625 | null |
2025-03-22 | Guidance Free Image Editing via Explicit Conditioning | Mehdi Noroozi et.al. | 2503.17593 | null |
2025-03-21 | PRIMAL: Physically Reactive and Interactive Motor Model for Avatar Learning | Yan Zhang et.al. | 2503.17544 | null |
2025-03-21 | Generating, Fast and Slow: Scalable Parallel Video Generation with Video Interface Networks | Bhishma Dedhia et.al. | 2503.17539 | null |
2025-03-21 | DermDiff: Generative Diffusion Model for Mitigating Racial Biases in Dermatology Diagnosis | Nusrat Munia et.al. | 2503.17536 | link |
2025-03-21 | Towards Understanding the Benefits of Neural Network Parameterizations in Geophysical Inversions: A Study With Neural Fields | Anran Xu et.al. | 2503.17503 | null |
2025-03-21 | ProDehaze: Prompting Diffusion Models Toward Faithful Image Dehazing | Tianwen Zhou et.al. | 2503.17488 | link |
2025-03-21 | What’s Producible May Not Be Reachable: Measuring the Steerability of Generative Models | Keyon Vafa et.al. | 2503.17482 | null |
2025-03-21 | Bayesian generative models can flag performance loss, bias, and out-of-distribution image content | Miguel López-Pérez et.al. | 2503.17477 | null |
2025-03-21 | Every Nearby Energetic Pulsar Is Surrounded by a Region of Inhibited Diffusion | Isabelle John et.al. | 2503.17442 | null |
2025-03-21 | A nonlocal degenerate macroscopic model of traffic dynamics with saturated diffusion: modeling and calibration theory | Dawson Do et.al. | 2503.17413 | null |
2025-03-17 | Beyond Group Means and Into the World of Individuals: A Distributional Spotlight for Experimental Effects on Individuals | Roussel Rahman et.al. | 2503.17390 | link |
2025-03-21 | Position: Interactive Generative Video as Next-Generation Game Engine | Jiwen Yu et.al. | 2503.17359 | null |
2025-03-21 | Decouple and Track: Benchmarking and Improving Video Diffusion Transformers for Motion Transfer | Qingyu Shi et.al. | 2503.17350 | null |
2025-03-21 | Preference-Guided Diffusion for Multi-Objective Offline Optimization | Yashas Annadani et.al. | 2503.17299 | null |
2025-03-21 | Deep End-to-End Posterior ENergy (DEEPEN) for image recovery | Jyothi Rikhab Chand et.al. | 2503.17244 | null |
2025-03-21 | Leveraging Text-to-Image Generation for Handling Spurious Correlation | Aryan Yazdan Parast et.al. | 2503.17226 | null |
2025-03-28 | UniCon: Unidirectional Information Flow for Effective Control of Large-Scale Diffusion Models | Fanghua Yu et.al. | 2503.17221 | null |
2025-03-21 | FreeUV: Ground-Truth-Free Realistic Facial UV Texture Recovery via Cross-Assembly Inference Strategy | Xingchao Yang et.al. | 2503.17197 | null |
2025-03-21 | D2C: Unlocking the Potential of Continuous Autoregressive Image Generation with Discrete Tokens | Panpan Wang et.al. | 2503.17155 | null |
2025-03-21 | R2LDM: An Efficient 4D Radar Super-Resolution Framework Leveraging Diffusion Model | Boyuan Zheng et.al. | 2503.17097 | null |
2025-03-21 | Halton Scheduler For Masked Generative Image Transformer | Victor Besnier et.al. | 2503.17076 | link |
2025-03-24 | Zero-Shot Styled Text Image Generation, but Make It Autoregressive | Vittorio Pippi et.al. | 2503.17074 | null |
2025-03-21 | DIDiffGes: Decoupled Semi-Implicit Diffusion Models for Real-time Gesture Generation from Speech | Yongkang Cheng et.al. | 2503.17059 | null |
2025-03-21 | AnimatePainter: A Self-Supervised Rendering Framework for Reconstructing Painting Process | Junjie Hu et.al. | 2503.17029 | null |
2025-03-21 | Enabling Versatile Controls for Video Diffusion Models | Xu Zhang et.al. | 2503.16983 | link |
2025-03-21 | Real-Time Diffusion Policies for Games: Enhancing Consistency Policies with Q-Ensembles | Ruoqi Zhang et.al. | 2503.16978 | null |
2025-03-21 | Multiple Ultrasound Image Generation based on Tuned Alignment of Amplitude Hologram over Spatially non-Uniform Ultrasound Source | Keisuke Hasegawa et.al. | 2503.16949 | null |
2025-03-25 | Re-HOLD: Video Hand Object Interaction Reenactment via adaptive Layout-instructed Diffusion Model | Yingying Fan et.al. | 2503.16942 | null |
2025-03-21 | When Preferences Diverge: Aligning Diffusion Models with Minority-Aware Adaptive DPO | Lingfan Zhang et.al. | 2503.16921 | null |
2025-03-21 | Malliavin-Bismut Score-based Diffusion Models | Ehsan Mirafzali et.al. | 2503.16917 | null |
2025-03-21 | Safe and Reliable Diffusion Models via Subspace Projection | Huiqiang Chen et.al. | 2503.16835 | null |
2025-03-21 | Auto-Regressive Diffusion for Generating 3D Human-Object Interactions | Zichen Geng et.al. | 2503.16801 | link |
2025-03-20 | Automated Harmfulness Testing for Code Large Language Models | Honghao Tan et.al. | 2503.16740 | null |
2025-03-20 | EDiT: Efficient Diffusion Transformers with Linear Compressed Attention | Philipp Becker et.al. | 2503.16726 | null |
2025-03-20 | WaveFM: A High-Fidelity and Efficient Vocoder Based on Flow Matching | Tianze Luo et.al. | 2503.16689 | link |
2025-03-20 | Fed-NDIF: A Noise-Embedded Federated Diffusion Model For Low-Count Whole-Body PET Denoising | Yinchi Zhou et.al. | 2503.16635 | null |
2025-03-20 | TriTex: Learning Texture from a Single Mesh via Triplane Semantic Features | Dana Cohen-Bar et.al. | 2503.16630 | null |
2025-03-20 | A Recipe for Generating 3D Worlds From a Single Image | Katja Schwarz et.al. | 2503.16611 | null |
2025-03-20 | World Knowledge from AI Image Generation for Robot Control | Jonas Krumme et.al. | 2503.16579 | null |
2025-03-20 | Bezier Distillation | Ling Feng et.al. | 2503.16562 | null |
2025-03-17 | Adams Bashforth Moulton Solver for Inversion and Editing in Rectified Flow | Yongjia Ma et.al. | 2503.16522 | null |
2025-03-20 | XAttention: Block Sparse Attention with Antidiagonal Scoring | Ruyi Xu et.al. | 2503.16428 | link |
2025-03-20 | Tokenize Image as a Set | Zigang Geng et.al. | 2503.16425 | link |
2025-03-20 | MagicMotion: Controllable Video Generation with Dense-to-Sparse Trajectory Guidance | Quanhao Li et.al. | 2503.16421 | null |
2025-03-20 | SynCity: Training-Free Generation of 3D Worlds | Paul Engstler et.al. | 2503.16420 | null |
2025-03-20 | InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity | Liming Jiang et.al. | 2503.16418 | link |
2025-03-20 | DreamTexture: Shape from Virtual Texture with Analysis by Augmentation | Ananta R. Bhattarai et.al. | 2503.16412 | null |
2025-03-20 | VerbDiff: Text-Only Diffusion Models with Enhanced Interaction Awareness | SeungJu Cha et.al. | 2503.16406 | link |
2025-03-27 | ScalingNoise: Scaling Inference-Time Search for Generating Infinite Videos | Haolin Yang et.al. | 2503.16400 | null |
2025-03-20 | Scale-wise Distillation of Diffusion Models | Nikita Starodubcev et.al. | 2503.16397 | null |
2025-03-25 | SV4D 2.0: Enhancing Spatio-Temporal Consistency in Multi-View Video Diffusion for High-Quality 4D Generation | Chun-Han Yao et.al. | 2503.16396 | null |
2025-03-20 | Do Visual Imaginations Improve Vision-and-Language Navigation Agents? | Akhil Perincherry et.al. | 2503.16394 | null |
2025-03-20 | LaPIG: Cross-Modal Generation of Paired Thermal and Visible Facial Images | Leyang Wang et.al. | 2503.16376 | null |
2025-03-20 | Heat transfer and mixing in initiated Chemical Vapor Deposition analyzed by in-situ gas composition sensing | Simon Shindler et.al. | 2503.16373 | null |
2025-03-20 | LLM Braces: Straightening Out LLM Predictions with Relevant Sub-Updates | Ying Shen et.al. | 2503.16334 | null |
2025-03-20 | Ultra-Resolution Adaptation with Ease | Ruonan Yu et.al. | 2503.16322 | link |
2025-03-26 | Unleashing Vecset Diffusion Model for Fast Shape Generation | Zeqiang Lai et.al. | 2503.16302 | link |
2025-03-20 | Diffusion-augmented Graph Contrastive Learning for Collaborative Filter | Fan Huang et.al. | 2503.16290 | null |
2025-03-20 | SceneMI: Motion In-betweening for Modeling Human-Scene Interactions | Inwoo Hwang et.al. | 2503.16289 | null |
2025-03-21 | Uni-3DAR: Unified 3D Generation and Understanding via Autoregression on Compressed Spatial Tokens | Shuqi Lu et.al. | 2503.16278 | link |
2025-03-20 | Temporal Score Analysis for Understanding and Correcting Diffusion Artifacts | Yu Cao et.al. | 2503.16218 | null |
2025-03-20 | Improving Autoregressive Image Generation through Coarse-to-Fine Token Prediction | Ziyao Guo et.al. | 2503.16194 | null |
2025-03-19 | Guardians of Generation: Dynamic Inference-Time Copyright Shielding with Adaptive Guidance for AI Image Generation | Soham Roy et.al. | 2503.16171 | null |
2025-03-20 | FreeFlux: Understanding and Exploiting Layer-Specific Roles in RoPE-Based MMDiT for Versatile Image Editing | Tianyi Wei et.al. | 2503.16153 | null |
2025-03-20 | Improving Discriminator Guidance in Diffusion Models | Alexandre Verine et.al. | 2503.16117 | null |
2025-03-20 | PromptMobile: Efficient Promptus for Low Bandwidth Mobile Video Streaming | Liming Liu et.al. | 2503.16112 | null |
2025-03-20 | Universal class of exactly solvable diffusions from space-time transformations | Costantino Di Bello et.al. | 2503.16090 | null |
2025-03-20 | PoseTraj: Pose-Aware Trajectory Control in Video Diffusion | Longbin Ji et.al. | 2503.16068 | null |
2025-03-20 | Shining Yourself: High-Fidelity Ornaments Virtual Try-on with Diffusion Model | Yingmao Miao et.al. | 2503.16065 | null |
2025-03-20 | PromptHash: Affinity-Prompted Collaborative Cross-Modal Learning for Adaptive Hashing Retrieval | Qiang Zou et.al. | 2503.16064 | link |
2025-03-25 | Expert Race: A Flexible Routing Strategy for Scaling Diffusion Transformer with Mixture of Experts | Yike Yuan et.al. | 2503.16057 | null |
2025-03-20 | Single Image Iterative Subject-driven Generation and Editing | Yair Shpitzer et.al. | 2503.16025 | link |
2025-03-20 | Animating the Uncaptured: Humanoid Mesh Animation with Video Diffusion Models | Marc Benedí San Millán et.al. | 2503.15996 | null |
2025-03-20 | A Survey on fMRI-based Brain Decoding for Reconstructing Multimodal Stimuli | Pengyu Liu et.al. | 2503.15978 | null |
2025-03-20 | Acc3D: Accelerating Single Image to 3D Diffusion Models via Edge Consistency Guided Score Distillation | Kendong Liu et.al. | 2503.15975 | null |
2025-03-20 | BlockDance: Reuse Structurally Similar Spatio-Temporal Features to Accelerate Diffusion Transformers | Hui Zhang et.al. | 2503.15927 | null |
2025-03-20 | Text-Driven Diffusion Model for Sign Language Production | Jiayi He et.al. | 2503.15914 | null |
2025-03-20 | Jasmine: Harnessing Diffusion Prior for Self-supervised Depth Estimation | Jiyuan Wang et.al. | 2503.15905 | null |
2025-03-20 | Repurposing 2D Diffusion Models with Gaussian Atlas for 3D Generation | Tiange Xiang et.al. | 2503.15877 | null |
2025-03-20 | MiLA: Multi-view Intensive-fidelity Long-term Video Generation World Model for Autonomous Driving | Haiguang Wang et.al. | 2503.15875 | link |
2025-03-21 | UniCoRN: Latent Diffusion-based Unified Controllable Image Restoration Network across Multiple Degradations | Debabrata Mandal et.al. | 2503.15868 | null |
2025-03-20 | TruthLens: Explainable DeepFake Detection for Face Manipulated and Fully Synthetic Data | Rohit Kundu et.al. | 2503.15867 | null |
2025-03-20 | VideoRFSplat: Direct Scene-Level Text-to-3D Gaussian Splatting Generation with Flexible Pose and Multi-View Joint Modeling | Hyojun Go et.al. | 2503.15855 | null |
2025-03-25 | Zero-1-to-A: Zero-Shot One Image to Animatable Head Avatars Using Video Diffusion | Zhenglin Zhou et.al. | 2503.15851 | link |
2025-03-20 | EDEN: Enhanced Diffusion for High-quality Large-motion Video Frame Interpolation | Zihao Zhang et.al. | 2503.15831 | null |
2025-03-20 | Controlling Avatar Diffusion with Learnable Gaussian Embedding | Xuan Gao et.al. | 2503.15809 | null |
2025-03-20 | RL4Med-DDPO: Reinforcement Learning for Controlled Guidance Towards Diverse Medical Image Generation using Vision-Language Foundation Models | Parham Saremi et.al. | 2503.15784 | null |
2025-03-20 | ATTENTION2D: Communication Efficient Distributed Self-Attention Mechanism | Venmugil Elango et.al. | 2503.15758 | null |
2025-03-19 | Uncertainty-Aware Diffusion Guided Refinement of 3D Scenes | Sarosij Bose et.al. | 2503.15742 | null |
2025-03-23 | Multi-focal Conditioned Latent Diffusion for Person Image Synthesis | Jiaqi Liu et.al. | 2503.15686 | link |
2025-03-19 | CHROME: Clothed Human Reconstruction with Occlusion-Resilience and Multiview-Consistency from a Single Image | Arindam Dutta et.al. | 2503.15671 | null |
2025-03-19 | CAM-Seg: A Continuous-valued Embedding Approach for Semantic Image Generation | Masud Ahmed et.al. | 2503.15617 | link |
2025-03-19 | Towards Unified Latent Space for 3D Molecular Latent Diffusion Modeling | Yanchen Luo et.al. | 2503.15567 | null |
2025-03-19 | FP4DiT: Towards Effective Floating Point Quantization for Diffusion Transformers | Ruichen Chen et.al. | 2503.15465 | link |
2025-03-19 | Di $\mathtt{[M]}$ O: Distilling Masked Diffusion Models into One-step Generator | Yuanzhi Zhu et.al. | 2503.15457 | null |
2025-03-19 | MotionStreamer: Streaming Motion Generation via Diffusion-based Autoregressive Model in Causal Latent Space | Lixing Xiao et.al. | 2503.15451 | null |
2025-03-19 | Temporal Regularization Makes Your Video Generator Stronger | Harold Haodong Chen et.al. | 2503.15417 | null |
2025-03-24 | Visual Persona: Foundation Model for Full-Body Human Customization | Jisu Nam et.al. | 2503.15406 | null |
2025-03-19 | CCDP: Composition of Conditional Diffusion Policies with Guided Sampling | Amirreza Razmjoo et.al. | 2503.15386 | null |
2025-03-19 | Material Decomposition in Photon-Counting Computed Tomography with Diffusion Models: Comparative Study and Hybridization with Variational Regularizers | Corentin Vazia et.al. | 2503.15383 | null |
2025-03-19 | TruthLens:A Training-Free Paradigm for DeepFake Detection | Ritabrata Chakraborty et.al. | 2503.15342 | null |
2025-03-19 | Euclid Quick Data Release (Q1). Active galactic nuclei identification using diffusion-based inpainting of Euclid VIS images | Euclid Collaboration et.al. | 2503.15321 | null |
2025-03-19 | TF-TI2I: Training-Free Text-and-Image-to-Image Generation via Multi-Modal Implicit-Context Learning in Text-to-Image Models | Teng-Fang Hsiao et.al. | 2503.15283 | null |
2025-03-19 | LEGION: Learning to Ground and Explain for Synthetic Image Detection | Hengrui Kang et.al. | 2503.15264 | null |
2025-03-19 | Detect-and-Guide: Self-regulation of Diffusion Models for Safe Text-to-Image Generation via Guideline Token Optimization | Feifei Li et.al. | 2503.15197 | null |
2025-03-18 | Diffusion-based G-buffer generation and rendering | Bowen Xue et.al. | 2503.15147 | null |
2025-03-20 | VideoGen-of-Thought: Step-by-step generating multi-shot video with minimal manual intervention | Mingzhe Zheng et.al. | 2503.15138 | null |
2025-03-19 | Towards Understanding the Safety Boundaries of DeepSeek Models: Evaluation and Findings | Zonghao Ying et.al. | 2503.15092 | link |
2025-03-20 | Conjuring Positive Pairs for Efficient Unification of Representation Learning and Image Synthesis | Imanol G. Estepa et.al. | 2503.15060 | null |
2025-03-19 | Single-Step Bidirectional Unpaired Image Translation Using Implicit Bridge Consistency Distillation | Suhyeon Lee et.al. | 2503.15056 | null |
2025-03-19 | Exploiting Diffusion Prior for Real-World Image Dehazing with Unpaired Training | Yunwei Lan et.al. | 2503.15017 | link |
2025-03-19 | Taming Flow Matching with Unbalanced Optimal Transport into Fast Pansharpening | Zihan Cao et.al. | 2503.14975 | null |
2025-03-19 | Language-based Image Colorization: A Benchmark and Beyond | Yifan Li et.al. | 2503.14974 | link |
2025-03-19 | Ultrasound Image-to-Video Synthesis via Latent Dynamic Diffusion Models | Tingxiu Chen et.al. | 2503.14966 | link |
2025-03-19 | POSTA: A Go-to Framework for Customized Artistic Poster Generation | Haoyu Chen et.al. | 2503.14908 | null |
2025-03-19 | FetalFlex: Anatomy-Guided Diffusion Model for Flexible Control on Fetal Ultrasound Image Synthesis | Yaofei Duan et.al. | 2503.14906 | null |
2025-03-19 | Efficient Personalization of Quantized Diffusion Model without Backpropagation | Hoigi Seo et.al. | 2503.14868 | null |
2025-03-19 | Temporal-Consistent Video Restoration with Pre-trained Diffusion Models | Hengkang Wang et.al. | 2503.14863 | null |
2025-03-19 | Curiosity-Diffuser: Curiosity Guide Diffusion Models for Reliability | Zihao Liu et.al. | 2503.14833 | link |
2025-03-19 | MMDT: Decoding the Trustworthiness and Safety of Multimodal Foundation Models | Chejian Xu et.al. | 2503.14827 | null |
2025-03-18 | ShapeShift: Towards Text-to-Shape Arrangement Synthesis with Content-Aware Geometric Constraints | Vihaan Misra et.al. | 2503.14720 | null |
2025-03-18 | A Simple Combination of Diffusion Models for Better Quality Trade-Offs in Image Denoising | Jonas Dornbusch et.al. | 2503.14654 | null |
2025-03-18 | Potential Score Matching: Debiasing Molecular Structure Sampling with Potential Energy Guidance | Liya Guo et.al. | 2503.14569 | null |
2025-03-21 | SuperPC: A Single Diffusion Model for Point Cloud Completion, Upsampling, Denoising, and Colorization | Yi Du et.al. | 2503.14558 | null |
2025-03-18 | MusicInfuser: Making Video Diffusion Listen and Dance | Susung Hong et.al. | 2503.14505 | null |
2025-03-18 | The Power of Context: How Multimodality Improves Image Super-Resolution | Kangfu Mei et.al. | 2503.14503 | null |
2025-03-18 | Deeply Supervised Flow-Based Generative Models | Inkyu Shin et.al. | 2503.14494 | null |
2025-03-18 | Stable Virtual Camera: Generative View Synthesis with Diffusion Models | Jensen et.al. | 2503.14489 | null |
2025-03-18 | DiffMoE: Dynamic Token Selection for Scalable Diffusion Transformers | Minglei Shi et.al. | 2503.14487 | null |
2025-03-18 | Lux Post Facto: Learning Portrait Performance Relighting with Conditional Video Diffusion and a Hybrid Dataset | Yiqun Mei et.al. | 2503.14485 | null |
2025-03-18 | ICE-Bench: A Unified and Comprehensive Benchmark for Image Creating and Editing | Yulin Pan et.al. | 2503.14482 | null |
2025-03-18 | SIR-DIFF: Sparse Image Sets Restoration with Multi-View Diffusion Model | Yucheng Mao et.al. | 2503.14463 | null |
2025-03-18 | Bolt3D: Generating 3D Scenes in Seconds | Stanislaw Szymanowicz et.al. | 2503.14445 | null |
2025-03-18 | MagicComp: Training-free Dual-Phase Refinement for Compositional Video Generation | Hongyu Zhang et.al. | 2503.14428 | null |
2025-03-18 | Impossible Videos | Zechen Bai et.al. | 2503.14378 | null |
2025-03-18 | RFMI: Estimating Mutual Information on Rectified Flow for Text-to-Image Alignment | Chao Wang et.al. | 2503.14358 | null |
2025-03-19 | VEGGIE: Instructional Editing and Reasoning of Video Concepts with Grounded Generation | Shoubin Yu et.al. | 2503.14350 | null |
2025-03-18 | LeanVAE: An Ultra-Efficient Reconstruction VAE for Video Diffusion Models | Yu Cheng et.al. | 2503.14325 | link |
2025-03-21 | Free-Lunch Color-Texture Disentanglement for Stylized Image Generation | Jiang Qin et.al. | 2503.14275 | null |
2025-03-19 | CTSR: Controllable Fidelity-Realness Trade-off Distillation for Real-World Image Super Resolution | Runyi Li et.al. | 2503.14272 | null |
2025-03-18 | CRCE: Coreference-Retention Concept Erasure in Text-to-Image Diffusion Models | Yuyang Xue et.al. | 2503.14232 | null |
2025-03-18 | Stochastic Trajectory Prediction under Unstructured Constraints | Hao Ma et.al. | 2503.14203 | null |
2025-03-18 | Concat-ID: Towards Universal Identity-Preserving Video Synthesis | Yong Zhong et.al. | 2503.14151 | null |
2025-03-18 | Fast Autoregressive Video Generation with Diagonal Decoding | Yang Ye et.al. | 2503.14070 | null |
2025-03-18 | AIGVE-Tool: AI-Generated Video Evaluation Toolkit with Multifaceted Benchmark | Xinhao Xiang et.al. | 2503.14064 | link |
2025-03-27 | DefectFill: Realistic Defect Generation with Inpainting Diffusion Model for Visual Inspection | Jaewoo Song et.al. | 2503.13985 | null |
2025-03-18 | DIFFVSGG: Diffusion-Driven Online Video Scene Graph Generation | Mu Chen et.al. | 2503.13957 | link |
2025-03-18 | SimWorld: A Unified Benchmark for Simulator-Conditioned Scene Generation via World Model | Xinqing Li et.al. | 2503.13952 | link |
2025-03-18 | Make the Most of Everything: Further Considerations on Disrupting Diffusion-based Customization | Long Tang et.al. | 2503.13945 | null |
2025-03-18 | COLSON: Controllable Learning-Based Social Navigation via Diffusion-Based Reinforcement Learning | Yuki Tomita et.al. | 2503.13934 | null |
2025-03-18 | Existence and Regularizing Effects of a Nonlinear Diffusion Model for Plasma Instabilities | William Porteous et.al. | 2503.13922 | null |
2025-03-18 | Less is More: Improving Motion Diffusion Models with Sparse Keyframes | Jinseok Bae et.al. | 2503.13859 | null |
2025-03-18 | SALAD: Skeleton-aware Latent Diffusion for Text-driven Motion Generation and Editing | Seokhyeon Hong et.al. | 2503.13836 | link |
2025-03-18 | General mean-field stochastic linear quadratic control problem driven by Lévy processes with random coefficients | Yanyan Tang et.al. | 2503.13835 | null |
2025-03-18 | VARP: Reinforcement Learning from Vision-Language Model Feedback with Agent Regularized Preferences | Anukriti Singh et.al. | 2503.13817 | null |
2025-03-21 | Continual Unlearning for Foundational Text-to-Image Models without Generalization Erosion | Kartik Thakral et.al. | 2503.13769 | null |
2025-03-17 | TextInVision: Text and Prompt Complexity Driven Visual Text Generation Benchmark | Forouzan Fallah et.al. | 2503.13730 | null |
2025-03-17 | Mitigating Spectral Bias in Neural Operators via High-Frequency Scaling for Physical Systems | Siavash Khodakarami et.al. | 2503.13695 | link |
2025-03-17 | Let Synthetic Data Shine: Domain Reassembly and Soft-Fusion for Single Domain Generalization | Hao Li et.al. | 2503.13617 | null |
2025-03-17 | A Comprehensive Survey on Visual Concept Mining in Text-to-image Diffusion Models | Ziqiang Li et.al. | 2503.13576 | null |
2025-03-16 | Adaptive AUV Hunting Policy with Covert Communication via Diffusion Model | Xu Guo et.al. | 2503.13547 | null |
2025-03-16 | CNCast: Leveraging 3D Swin Transformer and DiT for Enhanced Regional Weather Forecasting | Hongli Liang et.al. | 2503.13546 | null |
2025-03-16 | DDPM-Polycube: A Denoising Diffusion Probabilistic Model for Polycube-Based Hexahedral Mesh Generation and Volumetric Spline Construction | Yuxuan Yu et.al. | 2503.13541 | null |
2025-03-12 | Long-horizon Visual Instruction Generation with Logic and Attribute Self-reflection | Yucheng Suo et.al. | 2503.13500 | null |
2025-03-17 | Unified Autoregressive Visual Generation and Understanding with Continuous Tokens | Lijie Fan et.al. | 2503.13436 | null |
2025-03-17 | BlobCtrl: A Unified and Flexible Framework for Element-level Image Generation and Editing | Yaowei Li et.al. | 2503.13434 | null |
2025-03-17 | One-Step Residual Shifting Diffusion for Image Super-Resolution via Distillation | Daniil Selikhanovych et.al. | 2503.13358 | null |
2025-03-17 | Edit Transfer: Learning Image Editing via Vision In-Context Relations | Lan Chen et.al. | 2503.13327 | null |
2025-03-17 | Progressive Human Motion Generation Based on Text and Few Motion Frames | Ling-An Zeng et.al. | 2503.13300 | link |
2025-03-17 | Generative Gaussian Splatting: Generating 3D Scenes with Video Diffusion Priors | Katja Schwarz et.al. | 2503.13272 | null |
2025-03-19 | FlexWorld: Progressively Expanding 3D Scenes for Flexiable-View Synthesis | Luxi Chen et.al. | 2503.13265 | null |
2025-03-17 | MAME: Multidimensional Adaptive Metamer Exploration with Human Perceptual Feedback | Mina Kamao et.al. | 2503.13212 | null |
2025-03-17 | MedLoRD: A Medical Low-Resource Diffusion Model for High-Resolution 3D CT Image Synthesis | Marvin Seyfarth et.al. | 2503.13211 | null |
2025-03-17 | Patient-specific radiomic feature selection with reconstructed healthy persona of knee MR images | Yaxi Chen et.al. | 2503.13131 | null |
2025-03-17 | DTGBrepGen: A Novel B-rep Generative Model through Decoupling Topology and Geometry | Jing Li et.al. | 2503.13110 | link |
2025-03-17 | Beyond Classical Diffusion: Fractional Derivatives in Transport and Stochastic Systems | Cypres Verbeeck et.al. | 2503.13096 | null |
2025-03-19 | Orbit-Controlled Generation of Two-color Attosecond Mode-locked Free-electron Lasers | Tu Lingjun et.al. | 2503.13088 | null |
2025-03-17 | Rewards Are Enough for Fast Photo-Realistic Text-to-image Generation | Yihong Luo et.al. | 2503.13070 | null |
2025-03-17 | Dynamic Relation Inference via Verb Embeddings | Omri Suissa et.al. | 2503.13021 | null |
2025-03-17 | TFDM: Time-Variant Frequency-Based Point Cloud Diffusion with Mamba | Jiaxu Liu et.al. | 2503.13004 | null |
2025-03-17 | Training Video Foundation Models with NVIDIA NeMo | Zeeshan Patel et.al. | 2503.12964 | null |
2025-03-17 | Unlock Pose Diversity: Accurate and Efficient Implicit Keypoint-based Spatiotemporal Diffusion for Audio-driven Talking Portrait | Chaolong Yang et.al. | 2503.12963 | link |
2025-03-17 | Frame-wise Conditioning Adaptation for Fine-Tuning Diffusion Models in Text-to-Video Prediction | Zheyuan Liu et.al. | 2503.12953 | null |
2025-03-17 | FNSE-SBGAN: Far-field Speech Enhancement with Schrodinger Bridge and Generative Adversarial Networks | Tong Lei et.al. | 2503.12936 | link |
2025-03-17 | AR-1-to-3: Single Image to Consistent 3D Object Generation via Next-View Prediction | Xuying Zhang et.al. | 2503.12929 | null |
2025-03-17 | DreamRenderer: Taming Multi-Instance Attribute Control in Large-Scale Text-to-Image Models | Dewei Zhou et.al. | 2503.12885 | null |
2025-03-17 | DreamLayer: Simultaneous Multi-Layer Generation via Diffusion Mode | Junjia Huang et.al. | 2503.12838 | null |
2025-03-17 | AUTV: Creating Underwater Video Datasets with Pixel-wise Annotations | Quang Trung Truong et.al. | 2503.12828 | null |
2025-03-17 | VasTSD: Learning 3D Vascular Tree-state Space Diffusion Model for Angiography Synthesis | Zhifeng Wang et.al. | 2503.12758 | null |
2025-03-17 | GenStereo: Towards Open-World Generation of Stereo Images and Unsupervised Matching | Feng Qiao et.al. | 2503.12720 | link |
2025-03-16 | UniVG: A Generalist Diffusion Model for Unified Image Generation and Editing | Tsu-Jui Fu et.al. | 2503.12652 | null |
2025-03-16 | Understanding Driver Cognition and Decision-Making Behaviors in High-Risk Scenarios: A Drift Diffusion Perspective | Heye Huang et.al. | 2503.12637 | null |
2025-03-16 | LATINO-PRO: LAtent consisTency INverse sOlver with PRompt Optimization | Alessio Spagnoletti et.al. | 2503.12615 | null |
2025-03-16 | Personalize Anything for Free with Diffusion Transformer | Haoran Feng et.al. | 2503.12590 | null |
2025-03-16 | BalancedDPO: Adaptive Multi-Metric Alignment | Dipesh Tamboli et.al. | 2503.12575 | null |
2025-03-16 | Diffusion on Graph: Augmentation of Graph Structure for Node Classification | Yancheng Wang et.al. | 2503.12563 | null |
2025-03-16 | Debiasing Diffusion Model: Enhancing Fairness through Latent Representation Learning in Stable Diffusion Model | Lin-Chun Huang et.al. | 2503.12536 | null |
2025-03-16 | SPC-GS: Gaussian Splatting with Semantic-Prompt Consistency for Indoor Open-World Free-view Synthesis from Sparse Inputs | Guibiao Liao et.al. | 2503.12535 | null |
2025-03-16 | Towards Suturing World Models: Learning Predictive Models for Robotic Surgical Tasks | Mehmet Kerem Turkcan et.al. | 2503.12531 | null |
2025-03-16 | EditID: Training-Free Editable ID Customization for Text-to-Image Generation | Guandong Li et.al. | 2503.12526 | null |
2025-03-16 | Segment Any-Quality Images with Generative Latent Space Enhancement | Guangqian Guo et.al. | 2503.12507 | null |
2025-03-16 | SING: Semantic Image Communications using Null-Space and INN-Guided Diffusion Models | Jiakang Chen et.al. | 2503.12484 | null |
2025-03-16 | LazyMAR: Accelerating Masked Autoregressive Models via Feature Caching | Feihong Yan et.al. | 2503.12450 | null |
2025-03-16 | MExD: An Expert-Infused Diffusion Model for Whole-Slide Image Classification | Jianwei Zhao et.al. | 2503.12401 | null |
2025-03-16 | Pathology Image Restoration via Mixture of Prompts | Jiangdong Cai et.al. | 2503.12399 | link |
2025-03-16 | Localized Concept Erasure for Text-to-Image Diffusion Models Using Training-Free Gated Low-Rank Adaptation | Byung Hyun Lee et.al. | 2503.12356 | link |
2025-03-15 | Reflect-DiT: Inference-Time Scaling for Text-to-Image Diffusion Transformers via In-Context Reflection | Shufan Li et.al. | 2503.12271 | null |
2025-03-15 | Generalized transition uncertainties in constrained Markov decision processes | V Varagapriya et.al. | 2503.12238 | null |
2025-03-15 | STAY Diffusion: Styled Layout Diffusion Model for Diverse Layout-to-Image Generation | Ruyu Wang et.al. | 2503.12213 | null |
2025-03-15 | D4orm: Multi-Robot Trajectories with Dynamics-aware Diffusion Denoised Deformations | Yuhao Zhang et.al. | 2503.12204 | null |
2025-03-15 | FAILS: A Framework for Automated Collection and Analysis of LLM Service Incidents | Sándor Battaglini-Fischer et.al. | 2503.12185 | link |
2025-03-15 | LAPIG: Language Guided Projector Image Generation with Surface Adaptation and Stylization | Yuchen Deng et.al. | 2503.12173 | null |
2025-03-15 | SEAL: Semantic Aware Image Watermarking | Kasra Arabi et.al. | 2503.12172 | link |
2025-03-15 | DiffAD: A Unified Diffusion Modeling Approach for Autonomous Driving | Tao Wang et.al. | 2503.12170 | null |
2025-03-15 | Z-Magic: Zero-shot Multiple Attributes Guided Image Creator | Yingying Deng et.al. | 2503.12124 | null |
2025-03-15 | A Speech-to-Video Synthesis Approach Using Spatio-Temporal Diffusion for Vocal Tract MRI | Paula Andrea Pérez-Toro et.al. | 2503.12102 | null |
2025-03-15 | A Comprehensive Survey on Knowledge Distillation | Amir M. Mansourian et.al. | 2503.12067 | link |
2025-03-15 | TACO: Taming Diffusion for in-the-wild Video Amodal Completion | Ruijie Lu et.al. | 2503.12049 | null |
2025-03-15 | SteerX: Creating Any Camera-Free 3D and 4D Scenes with Geometric Steering | Byeongjun Park et.al. | 2503.12024 | link |
2025-03-15 | Compose Your Aesthetics: Empowering Text-to-Image Models with the Principles of Art | Zhe Jin et.al. | 2503.12018 | null |
2025-03-15 | QDM: Quadtree-Based Region-Adaptive Sparse Diffusion Models for Efficient Image Super-Resolution | Donglin Yang et.al. | 2503.12015 | link |
2025-03-15 | Winning the MIDST Challenge: New Membership Inference Attacks on Diffusion Models for Tabular Data Synthesis | Xiaoyu Wu et.al. | 2503.12008 | link |
2025-03-15 | WiFi-Diffusion: Achieving Fine-Grained WiFi Radio Map Estimation With Ultra-Low Sampling Rate by Diffusion Models | Zhiyuan Liu et.al. | 2503.12004 | null |
2025-03-15 | Diffusion Dynamics Models with Generative State Estimation for Cloth Manipulation | Tongxuan Tian et.al. | 2503.11999 | null |
2025-03-15 | DecompDreamer: Advancing Structured 3D Asset Generation with Multi-Object Decomposition and Gaussian Splatting | Utkarsh Nath et.al. | 2503.11981 | null |
2025-03-15 | MoDM: Efficient Serving for Image Generation via Mixture-of-Diffusion Models | Yuchen Xia et.al. | 2503.11972 | null |
2025-03-15 | Your Text Encoder Can Be An Object-Level Watermarking Controller | Naresh Kumar Devulapally et.al. | 2503.11945 | null |
2025-03-15 | Att-Adapter: A Robust and Precise Domain-Specific Multi-Attributes T2I Diffusion Adapter via Conditional Variational Autoencoder | Wonwoong Cho et.al. | 2503.11937 | null |
2025-03-15 | Generating a Biometrically Unique and Realistic Iris Database | Jingxuan Zhang et.al. | 2503.11930 | null |
2025-03-14 | Upcycling Text-to-Image Diffusion Models for Multi-Task Capabilities | Ruchika Chavhan et.al. | 2503.11905 | null |
2025-03-14 | Diffuse-CLoC: Guided Diffusion for Physics-based Character Look-ahead Control | Xiaoyu Huang et.al. | 2503.11801 | null |
2025-03-19 | Controllable Latent Diffusion for Traffic Simulation | Yizhuo Xiao et.al. | 2503.11771 | link |
2025-03-14 | Industrial-Grade Sensor Simulation via Gaussian Splatting: A Modular Framework for Scalable Editing and Full-Stack Validation | Xianming Zeng et.al. | 2503.11731 | null |
2025-03-13 | Fine-Tuning Diffusion Generative Models via Rich Preference Optimization | Hanyang Zhao et.al. | 2503.11720 | null |
2025-03-14 | ReCamMaster: Camera-Controlled Generative Rendering from A Single Video | Jianhong Bai et.al. | 2503.11647 | null |
2025-03-14 | Pathology Image Compression with Pre-trained Autoencoders | Srikar Yellapragada et.al. | 2503.11591 | null |
2025-03-14 | Dynamics of a coupled nonlocal PDE-ODE system with spatial memory: well-posedness, stability, and bifurcation analysis | Yurij Salmaniw et.al. | 2503.11550 | null |
2025-03-14 | HiTVideo: Hierarchical Tokenizers for Enhancing Text-to-Video Generation with Autoregressive Large Language Models | Ziqin Zhou et.al. | 2503.11513 | null |
2025-03-14 | T2I-FineEval: Fine-Grained Compositional Metric for Text-to-Image Evaluation | Seyed Mohammad Hadi Hosseini et.al. | 2503.11481 | null |
2025-03-14 | TASTE-Rob: Advancing Video Generation of Task-Oriented Hand-Object Interaction for Generalizable Robotic Manipulation | Hongxiang Zhao et.al. | 2503.11423 | null |
2025-03-14 | MTV-Inpaint: Multi-Task Long Video Inpainting | Shiyuan Yang et.al. | 2503.11412 | null |
2025-03-14 | Towards A Correct Usage of Cryptography in Semantic Watermarks for Diffusion Models | Jonas Thietke et.al. | 2503.11404 | null |
2025-03-14 | BEVDiffLoc: End-to-End LiDAR Global Localization in BEV View based on Diffusion Model | Ziyue Wang et.al. | 2503.11372 | link |
2025-03-14 | Safe-VAR: Safe Visual Autoregressive Model for Text-to-Image Generative Watermarking | Ziyi Wang et.al. | 2503.11324 | null |
2025-03-14 | BriLLM: Brain-inspired Large Language Model | Hai Zhao et.al. | 2503.11299 | null |
2025-03-14 | CyclePose – Leveraging Cycle-Consistency for Annotation-Free Nuclei Segmentation in Fluorescence Microscopy | Jonas Utz et.al. | 2503.11266 | null |
2025-03-14 | Noise Synthesis for Low-Light Image Denoising with Diffusion Models | Liying Lu et.al. | 2503.11262 | null |
2025-03-14 | Step-Video-TI2V Technical Report: A State-of-the-Art Text-Driven Image-to-Video Generation Model | Haoyang Huang et.al. | 2503.11251 | link |
2025-03-14 | Towards Better Alignment: Training Diffusion Models with Reinforcement Learning Against Sparse Rewards | Zijing Hu et.al. | 2503.11240 | link |
2025-03-19 | Toward Generalized Image Quality Assessment: Relaxing the Perfect Reference Quality Assumption | Du Chen et.al. | 2503.11221 | null |
2025-03-14 | Simulating Dual-Pixel Images From Ray Tracing For Depth Estimation | Fengchen He et.al. | 2503.11213 | link |
2025-03-14 | Provenance Detection for AI-Generated Images: Combining Perceptual Hashing, Homomorphic Encryption, and AI Detection Models | Shree Singhi et.al. | 2503.11195 | null |
2025-03-14 | Cross-Modal Learning for Music-to-Music-Video Description Generation | Zhuoyuan Mao et.al. | 2503.11190 | null |
2025-03-14 | Multi-Stage Generative Upscaler: Reconstructing Football Broadcast Images via Diffusion Models | Luca Martini et.al. | 2503.11181 | null |
2025-03-14 | Neurons: Emulating the Human Visual Cortex Improves Fidelity and Interpretability in fMRI-to-Video Reconstruction | Haonan Wang et.al. | 2503.11167 | link |
2025-03-14 | Direction-Aware Diagonal Autoregressive Image Generation | Yijia Xu et.al. | 2503.11129 | null |
2025-03-14 | DriveGEN: Generalized and Robust 3D Detection in Driving via Controllable Text-to-Image Diffusion Generation | Hongbin Lin et.al. | 2503.11122 | link |
2025-03-14 | Vipera: Towards systematic auditing of generative text-to-image models at scale | Yanwei Huang et.al. | 2503.11113 | null |
2025-03-14 | Understanding Flatness in Generative Models: Its Role and Benefits | Taehwan Lee et.al. | 2503.11078 | null |
2025-03-14 | Perceive, Understand and Restore: Real-World Image Super-Resolution with Autoregressive Multimodal Generative Models | Hongyang Wei et.al. | 2503.11073 | link |
2025-03-17 | Harnessing Frequency Spectrum Insights for Image Copyright Protection Against Diffusion Models | Zhenguang Liu et.al. | 2503.11071 | link |
2025-03-14 | Flow to the Mode: Mode-Seeking Diffusion Autoencoders for State-of-the-Art Image Tokenization | Kyle Sargent et.al. | 2503.11056 | null |
2025-03-14 | LUSD: Localized Update Score Distillation for Text-Guided Image Editing | Worameth Chinchuthakun et.al. | 2503.11054 | link |
2025-03-19 | PSF-4D: A Progressive Sampling Framework for View Consistent 4D Editing | Hasan Iqbal et.al. | 2503.11044 | null |
2025-03-14 | InverseBench: Benchmarking Plug-and-Play Diffusion Priors for Inverse Problems in Physical Sciences | Hongkai Zheng et.al. | 2503.11043 | null |
2025-03-14 | ACMo: Attribute Controllable Motion Generation | Mingjie Wei et.al. | 2503.11038 | null |
2025-03-14 | EmoDiffusion: Enhancing Emotional 3D Facial Animation with Latent Diffusion Models | Yixuan Zhang et.al. | 2503.11028 | null |
2025-03-13 | RI3D: Few-Shot Gaussian Splatting With Repair and Inpainting Diffusion Priors | Avinash Paliwal et.al. | 2503.10860 | link |
2025-03-13 | The Power of One: A Single Example is All it Takes for Segmentation in VLMs | Mir Rayat Imtiaz Hossain et.al. | 2503.10779 | null |
2025-03-13 | Visual Polarization Measurement Using Counterfactual Image Generation | Mohammad Mosaffa et.al. | 2503.10738 | null |
2025-03-12 | 3D Multiphase Heterogeneous Microstructure Generation Using Conditional Latent Diffusion Models | Nirmal Baishnab et.al. | 2503.10711 | null |
2025-03-12 | Error Analyses of Auto-Regressive Video Diffusion Models: A Unified Framework | Jing Wang et.al. | 2503.10704 | null |
2025-03-12 | TA-V2A: Textually Assisted Video-to-Audio Generation | Yuhuan You et.al. | 2503.10700 | null |
2025-03-12 | Zero-Shot Subject-Centric Generation for Creative Application Using Entropy Fusion | Kaifeng Zou et.al. | 2503.10697 | null |
2025-03-12 | Neighboring Autoregressive Modeling for Efficient Visual Generation | Yefei He et.al. | 2503.10696 | link |
2025-03-12 | Reasoning is All You Need for Video Generalization: A Counterfactual Benchmark with Sub-question Evaluation | Qiji Zhou et.al. | 2503.10691 | null |
2025-03-12 | Context-guided Responsible Data Augmentation with Diffusion Models | Khawar Islam et.al. | 2503.10687 | link |
2025-03-11 | Understanding the Quality-Diversity Trade-off in Diffusion Language Models | Zak Buzzard et.al. | 2503.10683 | link |
2025-03-11 | End-to-end Learning of Sparse Interventions on Activations to Steer Generation | Pau Rodriguez et.al. | 2503.10679 | null |
2025-03-11 | VRMDiff: Text-Guided Video Referring Matting Generation of Diffusion | Lehan Yang et.al. | 2503.10678 | link |
2025-03-08 | Text-to-3D Generation using Jensen-Shannon Score Distillation | Khoi Do et.al. | 2503.10660 | null |
2025-03-13 | GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing | Rongyao Fang et.al. | 2503.10639 | link |
2025-03-13 | Studying Classifier(-Free) Guidance From a Classifier-Centric Perspective | Xiaoming Zhao et.al. | 2503.10638 | null |
2025-03-14 | Distilling Diversity and Control in Diffusion Models | Rohit Gandikota et.al. | 2503.10637 | null |
2025-03-14 | V2Edit: Versatile Video Diffusion Editor for Videos and 3D Scenes | Yanming Zhang et.al. | 2503.10634 | null |
2025-03-17 | HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model | Jiaming Liu et.al. | 2503.10631 | null |
2025-03-13 | NIL: No-data Imitation Learning by Leveraging Pre-trained Video Diffusion Models | Mert Albaba et.al. | 2503.10626 | null |
2025-03-14 | DiT-Air: Revisiting the Efficiency of Diffusion Model Architecture Design in Text to Image Generation | Chen Chen et.al. | 2503.10618 | null |
2025-03-13 | CoSTA $\ast$ : Cost-Sensitive Toolpath Agent for Multi-turn Image Editing | Advait Gupta et.al. | 2503.10613 | link |
2025-03-13 | MuDG: Taming Multi-modal Diffusion with Gaussian Splatting for Urban Scene Reconstruction | Yingshuang Zou et.al. | 2503.10604 | null |
2025-03-13 | CameraCtrl II: Dynamic Scene Exploration via Camera-controlled Video Diffusion Models | Hao He et.al. | 2503.10592 | null |
2025-03-13 | Long Context Tuning for Video Generation | Yuwei Guo et.al. | 2503.10589 | null |
2025-03-13 | Autoregressive Image Generation with Randomized Parallel Decoding | Haopeng Li et.al. | 2503.10568 | link |
2025-03-13 | Streaming Generation of Co-Speech Gestures via Accelerated Rolling Diffusion | Evgeniia Vu et.al. | 2503.10488 | null |
2025-03-13 | RealGeneral: Unifying Visual Generation via Temporal In-Context Learning with Video Models | Yijing Lin et.al. | 2503.10406 | null |
2025-03-13 | CINEMA: Coherent Multi-Subject Video Generation via MLLM-Based Guidance | Yufan Deng et.al. | 2503.10391 | null |
2025-03-13 | ConceptGuard: Continual Personalized Text-to-Image Generation with Forgetting and Confusion Mitigation | Zirun Guo et.al. | 2503.10358 | null |
2025-03-13 | Do I look like a cat.n.01 to you? A Taxonomy Image Generation Benchmark |
Viktor Moskvoretskii et.al. | 2503.10357 | null |
2025-03-13 | Enhancing Facial Privacy Protection via Weakening Diffusion Purification | Ali Salar et.al. | 2503.10350 | link |
2025-03-13 | DreamInsert: Zero-Shot Image-to-Video Object Insertion from A Single Image | Qi Zhao et.al. | 2503.10342 | null |
2025-03-13 | CoDiPhy: A General Framework for Applying Denoising Diffusion Models to the Physical Layer of Wireless Communication Systems | Peyman Neshaastegaran et.al. | 2503.10297 | null |
2025-03-13 | MACS: Multi-source Audio-to-image Generation with Contextual Significance and Semantic Alignment | Hao Zhou et.al. | 2503.10287 | null |
2025-03-13 | Efficient Diffusion Posterior Sampling for Noisy Inverse Problems | Ji Li et.al. | 2503.10237 | null |
2025-03-13 | Probability-Flow ODE in Infinite-Dimensional Function Spaces | Kunwoo Na et.al. | 2503.10219 | null |
2025-03-13 | ImageScope: Unifying Language-Guided Image Retrieval via Large Multimodal Model Collective Reasoning | Pengfei Luo et.al. | 2503.10166 | link |
2025-03-13 | Data augmentation using diffusion models to enhance inverse Ising inference | Yechan Lim et.al. | 2503.10154 | null |
2025-03-13 | PlanGen: Towards Unified Layout Planning and Image Generation in Auto-Regressive Vision Language Models | Runze He et.al. | 2503.10127 | null |
2025-03-13 | Proxy-Tuning: Tailoring Multimodal Autoregressive Models for Subject-Driven Image Generation | Yi Wu et.al. | 2503.10125 | null |
2025-03-13 | MoEdit: On Learning Quantity Perception for Multi-object Image Editing | Yanfeng Li et.al. | 2503.10112 | link |
2025-03-16 | Improving Diffusion-based Inverse Algorithms under Few-Step Constraint via Learnable Linear Extrapolation | Jiawei Zhang et.al. | 2503.10103 | link |
2025-03-13 | Semantic Latent Motion for Portrait Video Generation | Qiyuan Zhang et.al. | 2503.10096 | null |
2025-03-13 | Light-weighted foundation model for seismic data processing based on representative and non-redundant pre-training dataset | Xintong Dong et.al. | 2503.10092 | null |
2025-03-13 | AdvPaint: Protecting Images from Inpainting Manipulation via Adversarial Attention Disruption | Joonsung Jeon et.al. | 2503.10081 | link |
2025-03-16 | VMBench: A Benchmark for Perception-Aligned Video Motion Generation | Xinran Ling et.al. | 2503.10076 | link |
2025-03-13 | Provably Secure Covert Messaging Using Image-based Diffusion Processes | Luke A. Bauer et.al. | 2503.10063 | null |
2025-03-13 | Investigating and Improving Counter-Stereotypical Action Relation in Text-to-Image Diffusion Models | Sina Malakouti et.al. | 2503.10037 | null |
2025-03-13 | Channel-wise Noise Scheduled Diffusion for Inverse Rendering in Indoor Scenes | JunYong Choi et.al. | 2503.09993 | null |
2025-03-13 | A Conditional Point Cloud Diffusion Model for Deformable Liver Motion Tracking Via a Single Arbitrarily-Angled X-ray Projection | Jiacheng Xie et.al. | 2503.09978 | null |
2025-03-13 | ExtremeAIGC: Benchmarking LMM Vulnerability to AI-Generated Extremist Content | Bhavik Chandna et.al. | 2503.09964 | null |
2025-03-13 | Modeling Thousands of Human Annotators for Generalizable Text-to-Image Person Re-identification | Jiayu Jiang et.al. | 2503.09962 | link |
2025-03-21 | UVE: Are MLLMs Unified Evaluators for AI-Generated Videos? | Yuanxin Liu et.al. | 2503.09949 | link |
2025-03-13 | Cosh-DiT: Co-Speech Gesture Video Synthesis via Hybrid Audio-Visual Diffusion Transformers | Yasheng Sun et.al. | 2503.09942 | null |
2025-03-13 | PanoGen++: Domain-Adapted Text-Guided Panoramic Environment Generation for Vision-and-Language Navigation | Sen Wang et.al. | 2503.09938 | null |
2025-03-13 | VideoMerge: Towards Training-free Long Video Generation | Siyang Zhang et.al. | 2503.09926 | null |
2025-03-12 | LuciBot: Automated Robot Policy Learning from Generated Videos | Xiaowen Qiu et.al. | 2503.09871 | null |
2025-03-12 | Leveraging Semantic Attribute Binding for Free-Lunch Color Control in Diffusion Models | Héctor Laria et.al. | 2503.09864 | null |
2025-03-14 | On the Limitations of Vision-Language Models in Understanding Image Transforms | Ahmad Mustafa Anis et.al. | 2503.09837 | null |
2025-03-12 | Exploring Position Encoding in Diffusion U-Net for Training-free High-resolution Image Generation | Feng Zhou et.al. | 2503.09830 | null |
2025-03-12 | Constrained Language Generation with Discrete Diffusion Models | Michael Cardei et.al. | 2503.09790 | null |
2025-03-12 | BiasConnect: Investigating Bias Interactions in Text-to-Image Models | Pushkar Shukla et.al. | 2503.09763 | null |
2025-03-12 | Solving Bayesian inverse problems with diffusion priors and off-policy RL | Luca Scimeca et.al. | 2503.09746 | null |
2025-03-12 | I2V3D: Controllable image-to-video generation with 3D guidance | Zhiyuan Zhang et.al. | 2503.09733 | null |
2025-03-12 | Accelerating Diffusion Sampling via Exploiting Local Transition Coherence | Shangwen Zhu et.al. | 2503.09675 | null |
2025-03-12 | Silent Branding Attack: Trigger-free Data Poisoning Attack on Text-to-Image Diffusion Models | Sangwon Jang et.al. | 2503.09669 | null |
2025-03-12 | CoRe^2: Collect, Reflect and Refine to Generate Better and Faster | Shitong Shao et.al. | 2503.09662 | link |
2025-03-12 | Open-Sora 2.0: Training a Commercial-Level Video Generation Model in $200k | Xiangyu Peng et.al. | 2503.09642 | link |
2025-03-12 | SANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation | Junsong Chen et.al. | 2503.09641 | link |
2025-03-11 | Identity Preserving Latent Diffusion for Brain Aging Modeling | Gexin Huang et.al. | 2503.09634 | null |
2025-03-11 | Adaptive Anomaly Recovery for Telemanipulation: A Diffusion Model Approach to Vision-Based Tracking | Haoyang Wang et.al. | 2503.09632 | null |
2025-03-11 | V2M4: 4D Mesh Animation Reconstruction from a Single Monocular Video | Jianqi Chen et.al. | 2503.09631 | null |
2025-03-11 | CASteer: Steering Diffusion Models for Controllable Generation | Tatiana Gaintseva et.al. | 2503.09630 | link |
2025-03-13 | RewardSDS: Aligning Score Distillation via Reward-Weighted Sampling | Itay Chachy et.al. | 2503.09601 | link |
2025-03-12 | PISA Experiments: Exploring Physics Post-Training for Video Diffusion Models by Watching Stuff Drop | Chenyu Li et.al. | 2503.09595 | link |
2025-03-12 | Minimax Optimality of the Probability Flow ODE for Diffusion Models | Changxiao Cai et.al. | 2503.09583 | null |
2025-03-18 | Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models | Marianne Arriola et.al. | 2503.09573 | link |
2025-03-12 | TPDiff: Temporal Pyramid Video Diffusion Model | Lingmin Ran et.al. | 2503.09566 | null |
2025-03-12 | FCaS: Fine-grained Cardiac Image Synthesis based on 3D Template Conditional Diffusion Model | Jiahao Xia et.al. | 2503.09560 | null |
2025-03-12 | CM-Diff: A Single Generative Network for Bidirectional Cross-Modality Translation Diffusion Model Between Infrared and Visible Images | Bin Hu et.al. | 2503.09514 | null |
2025-03-12 | DAMM-Diffusion: Learning Divergence-Aware Multi-Modal Diffusion Model for Nanoparticles Distribution Prediction | Junjie Zhou et.al. | 2503.09491 | link |
2025-03-18 | Sparse Autoencoder as a Zero-Shot Classifier for Concept Erasing in Text-to-Image Diffusion Models | Zhihua Tian et.al. | 2503.09446 | link |
2025-03-12 | SuperCarver: Texture-Consistent 3D Geometry Super-Resolution for High-Fidelity Surface Detail Generation | Qijian Zhang et.al. | 2503.09439 | null |
2025-03-12 | PromptMap: An Alternative Interaction Style for AI-Based Image Generation | Krzysztof Adamkiewicz et.al. | 2503.09436 | link |
2025-03-12 | LHC Triggers using FPGA Image Recognition | James Brooke et.al. | 2503.09428 | null |
2025-03-12 | Alias-Free Latent Diffusion Models:Improving Fractional Shift Equivariance of Diffusion Latent Space | Yifan Zhou et.al. | 2503.09419 | link |
2025-03-12 | Diff-CL: A Novel Cross Pseudo-Supervision Method for Semi-supervised Medical Image Segmentation | Xiuzhen Guo et.al. | 2503.09408 | null |
2025-03-12 | Unified Dense Prediction of Video Diffusion | Lehan Yang et.al. | 2503.09344 | null |
2025-03-12 | Revealing the Implicit Noise-based Imprint of Generative Models | Xinghan Li et.al. | 2503.09314 | null |
2025-03-12 | Revealing Unintentional Information Leakage in Low-Dimensional Facial Portrait Representations | Kathleen Anderson et.al. | 2503.09306 | link |
2025-03-12 | UniCombine: Unified Multi-Conditional Combination with Diffusion Transformer | Haoxuan Wang et.al. | 2503.09277 | null |
2025-03-12 | NAMI: Efficient Image Generation via Progressive Rectified Flow Transformers | Yuhang Ma et.al. | 2503.09242 | null |
2025-03-12 | Active Learning Inspired ControlNet Guidance for Augmenting Semantic Segmentation Datasets | Hannah Kniesel et.al. | 2503.09221 | null |
2025-03-17 | Other Vehicle Trajectories Are Also Needed: A Driving World Model Unifies Ego-Other Vehicle Trajectories in Video Latent Space | Jian Zhu et.al. | 2503.09215 | null |
2025-03-15 | WonderVerse: Extendable 3D Scene Generation with Video Generative Models | Hao Feng et.al. | 2503.09160 | null |
2025-03-17 | Reangle-A-Video: 4D Video Generation as Video-to-Video Translation | Hyeonho Jeong et.al. | 2503.09151 | null |
2025-03-12 | Spiritus: An AI-Assisted Tool for Creating 2D Characters and Animations | Qirui Sun et.al. | 2503.09127 | null |
2025-03-12 | AdvAD: Exploring Non-Parametric Diffusion for Imperceptible Adversarial Attacks | Jin Li et.al. | 2503.09124 | null |
2025-03-12 | Training Data Provenance Verification: Did Your Model Use Synthetic Data from My Generative Model for Training? | Yuechen Xie et.al. | 2503.09122 | link |
2025-03-12 | Sequential Multi-Object Grasping with One Dexterous Hand | Sicheng He et.al. | 2503.09078 | null |
2025-03-12 | Theoretical Guarantees for High Order Trajectory Refinement in Generative Flows | Chengyue Gong et.al. | 2503.09069 | null |
2025-03-11 | A Deep Bayesian Nonparametric Framework for Robust Mutual Information Estimation | Forough Fazeliasl et.al. | 2503.08902 | null |
2025-03-11 | SICNav-Diffusion: Safe and Interactive Crowd Navigation with Diffusion Trajectory Predictions | Sepehr Samavi et.al. | 2503.08858 | null |
2025-03-11 | Representing 3D Shapes With 64 Latent Vectors for 3D Diffusion Models | In Cho et.al. | 2503.08737 | null |
2025-03-11 | Preserving Product Fidelity in Large Scale Image Recontextualization with Diffusion Models | Ishaan Malhi et.al. | 2503.08729 | null |
2025-03-16 | Versatile Multimodal Controls for Whole-Body Talking Human Animation | Zheng Qin et.al. | 2503.08714 | null |
2025-03-11 | GarmentCrafter: Progressive Novel View Synthesis for Single-View 3D Garment Reconstruction and Editing | Yuanhao Wang et.al. | 2503.08678 | null |
2025-03-11 | Language-Depth Navigated Thermal and Visible Image Fusion | Jinchang Zhang et.al. | 2503.08676 | null |
2025-03-11 | Modeling Stock Return Distributions and Pricing Options | Xinxin Jiang et.al. | 2503.08666 | null |
2025-03-11 | REGEN: Learning Compact Video Embedding with (Re-)Generative Decoder | Yitian Zhang et.al. | 2503.08665 | null |
2025-03-11 | MEAT: Multiview Diffusion Model for Human Generation on Megapixels with Mesh Attention | Yuhan Wang et.al. | 2503.08664 | link |
2025-03-11 | Generating Robot Constitutions & Benchmarks for Semantic Safety | Pierre Sermanet et.al. | 2503.08663 | null |
2025-03-11 | MF-VITON: High-Fidelity Mask-Free Virtual Try-On with Minimal Input | Zhenchen Wan et.al. | 2503.08650 | null |
2025-03-11 | Rethinking Diffusion Model in High Dimension | Zhenxin Zheng et.al. | 2503.08643 | link |
2025-03-11 | LightGen: Efficient Image Generation through Knowledge Distillation and Direct Preference Optimization | Xianfeng Wu et.al. | 2503.08619 | link |
2025-03-11 | Tuning-Free Multi-Event Long Video Generation via Synchronized Coupled Sampling | Subin Kim et.al. | 2503.08605 | null |
2025-03-11 | Modular Customization of Diffusion Models via Blockwise-Parameterized Low-Rank Adaptation | Mingkang Zhu et.al. | 2503.08575 | null |
2025-03-11 | Posterior-Mean Denoising Diffusion Model for Realistic PET Image Reconstruction | Yiran Sun et.al. | 2503.08546 | null |
2025-03-11 | SAS: Segment Any 3D Scene with Integrated 2D Priors | Zhuoyuan Li et.al. | 2503.08512 | null |
2025-03-11 | Learning to Match Unpaired Data with Minimum Entropy Coupling | Mustapha Bounoua et.al. | 2503.08501 | null |
2025-03-11 | Generalizable AI-Generated Image Detection Based on Fractal Self-Similarity in the Spectrum | Shengpeng Xiao et.al. | 2503.08484 | null |
2025-03-11 | NullFace: Training-Free Localized Face Anonymization | Han-Wei Kung et.al. | 2503.08478 | link |
2025-03-11 | Controlling Latent Diffusion Using Latent CLIP | Jason Becker et.al. | 2503.08455 | link |
2025-03-13 | Bokeh Diffusion: Defocus Blur Control in Text-to-Image Diffusion Models | Armando Fortes et.al. | 2503.08434 | null |
2025-03-11 | Using Powerful Prior Knowledge of Diffusion Model in Deep Unfolding Networks for Image Compressive Sensing | Chen Liao et.al. | 2503.08429 | link |
2025-03-11 | AnyMoLe: Any Character Motion In-betweening Leveraging Video Diffusion Models | Kwan Yun et.al. | 2503.08417 | link |
2025-03-14 | Layton: Latent Consistency Tokenizer for 1024-pixel Image Reconstruction and Generation by 256 Tokens | Qingsong Xie et.al. | 2503.08377 | null |
2025-03-17 | Robust Latent Matters: Boosting Image Generation with Sampling Error Synthesis | Kai Qiu et.al. | 2503.08354 | link |
2025-03-11 | Pathology-Aware Adaptive Watermarking for Text-Driven Medical Image Synthesis | Chanyoung Kim et.al. | 2503.08346 | null |
2025-03-11 | KiteRunner: Language-Driven Cooperative Local-Global Navigation Policy with UAV Mapping in Outdoor Environments | Shibo Huang et.al. | 2503.08330 | null |
2025-03-12 | $^R$ FLAV: Rolling Flow matching for infinite Audio Video generation | Alex Ergasti et.al. | 2503.08307 | link |
2025-03-11 | D3PO: Preference-Based Alignment of Discrete Diffusion Models | Umberto Borso et.al. | 2503.08295 | null |
2025-03-11 | OminiControl2: Efficient Conditioning for Diffusion Transformers | Zhenxiong Tan et.al. | 2503.08280 | null |
2025-03-11 | PromptLNet: Region-Adaptive Aesthetic Enhancement via Prompt Guidance in Low-Light Enhancement Net | Jun Yin et.al. | 2503.08276 | null |
2025-03-11 | SARA: Structural and Adversarial Representation Alignment for Training-efficient Diffusion Models | Hesen Chen et.al. | 2503.08253 | null |
2025-03-11 | Aligning Text to Image in Diffusion Models is Easier Than You Think | Jaa-Yeon Lee et.al. | 2503.08250 | null |
2025-03-11 | MVD-HuGaS: Human Gaussians from a Single Image via 3D Human Multi-view Diffusion Prior | Kaiqiang Xiong et.al. | 2503.08218 | null |
2025-03-11 | TSCnet: A Text-driven Semantic-level Controllable Framework for Customized Low-Light Image Enhancement | Miao Zhang et.al. | 2503.08168 | null |
2025-03-11 | Multimodal Generation of Animatable 3D Human Models with AvatarForge | Xinhang Liu et.al. | 2503.08165 | null |
2025-03-11 | Concept-Driven Deep Learning for Enhanced Protein-Specific Molecular Generation | Taojie Kuang et.al. | 2503.08160 | null |
2025-03-11 | WISA: World Simulator Assistant for Physics-Aware Text-to-Video Generation | Jing Wang et.al. | 2503.08153 | null |
2025-03-11 | FlowDPS: Flow-Driven Posterior Sampling for Inverse Problems | Jeongsol Kim et.al. | 2503.08136 | null |
2025-03-11 | ACE: Concept Editing in Diffusion Models without Performance Degradation | Ruipeng Wang et.al. | 2503.08116 | null |
2025-03-11 | MegaSR: Mining Customized Semantics and Expressive Guidance for Image Super-Resolution | Xinrui Li et.al. | 2503.08096 | link |
2025-03-11 | Seeing Beyond Haze: Generative Nighttime Image Dehazing | Beibei Lin et.al. | 2503.08073 | null |
2025-03-11 | STGDPM:Vessel Trajectory Prediction with Spatio-Temporal Graph Diffusion Probabilistic Model | Jin Wenzhe et.al. | 2503.08065 | null |
2025-03-11 | ObjectMover: Generative Object Movement with Video Prior | Xin Yu et.al. | 2503.08037 | null |
2025-03-11 | HOFAR: High-Order Augmentation of Flow Autoregressive Transformers | Yingyu Liang et.al. | 2503.08032 | null |
2025-03-11 | Exploring Bias in over 100 Text-to-Image Generative Models | Jordan Vice et.al. | 2503.08012 | null |
2025-03-12 | CDI3D: Cross-guided Dense-view Interpolation for 3D Reconstruction | Zhiyuan Wu et.al. | 2503.08005 | null |
2025-03-11 | How Can Video Generative AI Transform K-12 Education? Examining Teachers’ Perspectives through TPACK and TAM | Unggi Lee et.al. | 2503.08003 | null |
2025-03-11 | DiffEGG: Diffusion-Driven Edge Generation as a Pixel-Annotation-Free Alternative for Instance Annotation | Sanghyun Jo et.al. | 2503.07982 | null |
2025-03-11 | Generalizations of Total Dual Integrality | Bertrand Guenin et.al. | 2503.07925 | null |
2025-03-18 | Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia | Samuel Cahyawijaya et.al. | 2503.07920 | link |
2025-03-10 | Can Generative Geospatial Diffusion Models Excel as Discriminative Geospatial Foundation Models? | Yuru Jia et.al. | 2503.07890 | null |
2025-03-10 | AdaptSR: Low-Rank Adaptation for Efficient and Scalable Real-World Super-Resolution | Cansu Korkmaz et.al. | 2503.07748 | null |
2025-03-10 | Seedream 2.0: A Native Chinese-English Bilingual Image Generation Foundation Model | Lixue Gong et.al. | 2503.07703 | null |
2025-03-10 | RayFlow: Instance-Aware Diffusion Acceleration via Adaptive Flow Trajectories | Huiyang Shao et.al. | 2503.07699 | null |
2025-03-16 | PLADIS: Pushing the Limits of Attention in Diffusion Models at Inference Time by Leveraging Sparsity | Kwanyoung Kim et.al. | 2503.07677 | null |
2025-03-08 | Disrupting Model Merging: A Parameter-Level Defense Without Sacrificing Accuracy | Wei Junhao et.al. | 2503.07661 | null |
2025-03-06 | The day-ahead scenario generation method for new energy based on an improved conditional generative diffusion model | Changgang Wang et.al. | 2503.07648 | null |
2025-03-10 | DreamRelation: Relation-Centric Video Customization | Yujie Wei et.al. | 2503.07602 | null |
2025-03-10 | Balanced Image Stylization with Style Matching Score | Yuxin Jiang et.al. | 2503.07601 | null |
2025-03-11 | VACE: All-in-One Video Creation and Editing | Zeyinzi Jiang et.al. | 2503.07598 | null |
2025-03-10 | Denoising Score Distillation: From Noisy Diffusion Pretraining to One-Step High-Quality Generation | Tianyu Chen et.al. | 2503.07578 | null |
2025-03-12 | Inductive Moment Matching | Linqi Zhou et.al. | 2503.07565 | null |
2025-03-10 | V2Flow: Unifying Visual Tokenization and Large Language Model Vocabularies for Autoregressive Image Generation | Guiwei Zhang et.al. | 2503.07493 | link |
2025-03-10 | GenAIReading: Augmenting Human Cognition with Interactive Digital Textbooks Using Large Language Models and Image Generation Models | Ryugo Morita et.al. | 2503.07463 | null |
2025-03-10 | DRESS: Diffusion Reasoning-based Reward Shaping Scheme For Intelligent Networks | Feiran You et.al. | 2503.07433 | link |
2025-03-10 | AR-Diffusion: Asynchronous Video Generation with Auto-Regressive Diffusion | Mingzhen Sun et.al. | 2503.07418 | null |
2025-03-10 | TimeStep Master: Asymmetrical Mixture of Timestep LoRA Experts for Versatile and Efficient Diffusion Models in Vision | Shaobin Zhuang et.al. | 2503.07416 | null |
2025-03-10 | SPEED: Scalable, Precise, and Efficient Concept Erasure for Diffusion Models | Ouxiang Li et.al. | 2503.07392 | link |
2025-03-12 | PersonaBooth: Personalized Text-to-Motion Generation | Boeun Kim et.al. | 2503.07390 | null |
2025-03-10 | TRCE: Towards Reliable Malicious Concept Erasure in Text-to-Image Diffusion Models | Ruidong Chen et.al. | 2503.07389 | link |
2025-03-10 | Unleashing the Potential of Large Language Models for Text-to-Image Generation through Autoregressive Representation Alignment | Xing Xie et.al. | 2503.07334 | link |
2025-03-10 | Automated Movie Generation via Multi-Agent CoT Planning | Weijia Wu et.al. | 2503.07314 | link |
2025-03-10 | AttenST: A Training-Free Attention-Driven Style Transfer Framework with Pre-Trained Diffusion Models | Bo Huang et.al. | 2503.07307 | link |
2025-03-10 | Efficient Distillation of Classifier-Free Guidance using Adapters | Cristian Perez Jensen et.al. | 2503.07274 | null |
2025-03-10 | WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation | Yuwei Niu et.al. | 2503.07265 | link |
2025-03-11 | AnomalyPainter: Vision-Language-Diffusion Synergy for Zero-Shot Realistic and Diverse Industrial Anomaly Synthesis | Zhangyu Lai et.al. | 2503.07253 | null |
2025-03-11 | Boosting Diffusion-Based Text Image Super-Resolution Model Towards Generalized Real-World Scenarios | Chenglu Pan et.al. | 2503.07232 | null |
2025-03-10 | Synthetic Lung X-ray Generation through Cross-Attention and Affinity Transformation | Ruochen Pi et.al. | 2503.07209 | null |
2025-03-10 | Effective and Efficient Masked Image Generation Models | Zebin You et.al. | 2503.07197 | link |
2025-03-11 | Ideas in Inference-time Scaling can Benefit Generative Pre-training Algorithms | Jiaming Song et.al. | 2503.07154 | null |
2025-03-10 | Controllable 3D Outdoor Scene Generation via Scene Graphs | Yuheng Liu et.al. | 2503.07152 | link |
2025-03-10 | VidBot: Learning Generalizable 3D Actions from In-the-Wild 2D Human Videos for Zero-Shot Robotic Manipulation | Hanzhi Chen et.al. | 2503.07135 | null |
2025-03-10 | NFIG: Autoregressive Image Generation with Next-Frequency Prediction | Zhihao Huang et.al. | 2503.07076 | null |
2025-03-10 | TIDE : Temporal-Aware Sparse Autoencoders for Interpretable Diffusion Transformers in Image Generation | Victor Shea-Jay Huang et.al. | 2503.07050 | null |
2025-03-10 | Recovering Partially Corrupted Major Objects through Tri-modality Based Image Completion | Yongle Zhang et.al. | 2503.07047 | null |
2025-03-10 | EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer | Yuxuan Zhang et.al. | 2503.07027 | null |
2025-03-10 | NukesFormers: Unpaired Hyperspectral Image Generation with Non-Uniform Domain Alignment | Jiaojiao Li et.al. | 2503.07004 | null |
2025-03-10 | SOYO: A Tuning-Free Approach for Video Style Morphing via Style-Adaptive Interpolation in Diffusion Models | Haoyu Zheng et.al. | 2503.06998 | null |
2025-03-10 | Synchronized Video-to-Audio Generation via Mel Quantization-Continuum Decomposition | Juncheng Wang et.al. | 2503.06984 | null |
2025-03-10 | Task-Specific Knowledge Distillation from the Vision Foundation Model for Enhanced Medical Image Segmentation | Pengchen Liang et.al. | 2503.06976 | null |
2025-03-10 | LatexBlend: Scaling Multi-concept Customized Generation with Latent Textual Blending | Jian Jin et.al. | 2503.06956 | null |
2025-03-10 | Post-Training Quantization for Diffusion Transformer via Hierarchical Timestep Grouping | Ning Ding et.al. | 2503.06930 | null |
2025-03-10 | From Reusing to Forecasting: Accelerating Diffusion Models with TaylorSeers | Jiacheng Liu et.al. | 2503.06923 | link |
2025-03-10 | Text-to-Image Diffusion Models Cannot Count, and Prompt Refinement Cannot Help | Yuefan Cao et.al. | 2503.06884 | null |
2025-03-10 | Generic linear convergence for algorithms of non-linear least squares over smooth varieties | Shenglong Hu et.al. | 2503.06877 | null |
2025-03-10 | Towards Generalization of Tactile Image Generation: Reference-Free Evaluation in a Leakage-Free Setting | Cagri Gungor et.al. | 2503.06860 | null |
2025-03-10 | Interactive Tumor Progression Modeling via Sketch-Based Image Editing | Gexin Huang et.al. | 2503.06809 | null |
2025-03-09 | VideoPhy-2: A Challenging Action-Centric Physical Commonsense Evaluation in Video Generation | Hritik Bansal et.al. | 2503.06800 | null |
2025-03-09 | RoboDesign1M: A Large-scale Dataset for Robot Design Understanding | Tri Le et.al. | 2503.06796 | null |
2025-03-09 | GenDR: Lightning Generative Detail Restorator | Yan Wang et.al. | 2503.06790 | null |
2025-03-09 | Infinite Leagues Under the Sea: Photorealistic 3D Underwater Terrain Generation by Latent Fractal Diffusion Models | Tianyi Zhang et.al. | 2503.06784 | null |
2025-03-09 | DiffAtlas: GenAI-fying Atlas Segmentation via Image-Mask Diffusion | Hantao Zhang et.al. | 2503.06748 | null |
2025-03-09 | Color Alignment in Diffusion | Ka Chun Shum et.al. | 2503.06746 | null |
2025-03-09 | D3DR: Lighting-Aware Object Insertion in Gaussian Splatting | Vsevolod Skorokhodov et.al. | 2503.06740 | null |
2025-03-09 | What’s in a Latent? Leveraging Diffusion Latent Space for Domain Generalization | Xavier Thomas et.al. | 2503.06698 | link |
2025-03-09 | Diffusion Model Based Probabilistic Day-ahead Load Forecasting | Ding Lin et.al. | 2503.06697 | null |
2025-03-09 | UniGenX: Unified Generation of Sequence and Structure with Autoregressive Diffusion | Gongbo Zhang et.al. | 2503.06687 | null |
2025-03-09 | PixelPonder: Dynamic Patch Adaptation for Enhanced Multi-Conditional Text-to-Image Generation | Yanjie Pan et.al. | 2503.06684 | null |
2025-03-12 | Learning Few-Step Diffusion Models by Trajectory Distribution Matching | Yihong Luo et.al. | 2503.06674 | link |
2025-03-09 | AxisPose: Model-Free Matching-Free Single-Shot 6D Object Pose Estimation via Axis Generation | Yang Zou et.al. | 2503.06660 | null |
2025-03-12 | Adding Additional Control to One-Step Diffusion with Joint Distribution Matching | Yihong Luo et.al. | 2503.06652 | null |
2025-03-09 | CLAD: Constrained Latent Action Diffusion for Vision-Language Procedure Planning | Lei Shi et.al. | 2503.06637 | null |
2025-03-09 | Towards More Accurate Personalized Image Generation: Addressing Overfitting and Evaluation Bias | Mingxiao Li et.al. | 2503.06632 | link |
2025-03-09 | Conceptrol: Concept Control of Zero-shot Personalized Image Generation | Qiyuan He et.al. | 2503.06568 | link |
2025-03-09 | TR-DQ: Time-Rotation Diffusion Quantization | Yihua Shao et.al. | 2503.06564 | null |
2025-03-09 | Generative modelling with jump-diffusions | Adrian Baule et.al. | 2503.06558 | link |
2025-03-09 | QuantCache: Adaptive Importance-Guided Quantization with Hierarchical Latent and Layer Caching for Video Generation | Junyi Wu et.al. | 2503.06545 | link |
2025-03-09 | ARMOR v0.1: Empowering Autoregressive Multimodal Understanding Model with Interleaved Multimodal Generation via Asymmetric Synergy | Jianwen Sun et.al. | 2503.06542 | null |
2025-03-09 | One-Step Diffusion Model for Image Motion-Deblurring | Xiaoyang Liu et.al. | 2503.06537 | link |
2025-03-11 | LightMotion: A Light and Tuning-free Method for Simulating Camera Motion in Video Generation | Quanjian Song et.al. | 2503.06508 | link |
2025-03-09 | Fine-Grained Alignment and Noise Refinement for Compositional Text-to-Image Generation | Amir Mohammad Izadi et.al. | 2503.06506 | null |
2025-03-09 | DynamicID: Zero-Shot Multi-ID Image Personalization with Flexible Facial Editability | Xirui Hu et.al. | 2503.06505 | null |
2025-03-09 | Seismic wavefield solutions via physics-guided generative neural operator | Shijun Cheng et.al. | 2503.06488 | null |
2025-03-09 | A Mesh Is Worth 512 Numbers: Spectral-domain Diffusion Modeling for High-dimension Shape Generation | Jiajie Fan et.al. | 2503.06485 | null |
2025-03-09 | NaviDet: Efficient Input-level Backdoor Detection on Text-to-Image Synthesis via Neuron Activation Variation | Shengfang Zhai et.al. | 2503.06453 | null |
2025-03-09 | CtrTab: Tabular Data Synthesis with High-Dimensional and Limited Data | Zuqing Li et.al. | 2503.06444 | null |
2025-03-09 | Federated Learning for Diffusion Models | Zihao Peng et.al. | 2503.06426 | null |
2025-03-09 | Consistent Image Layout Editing with Diffusion Models | Tao Xia et.al. | 2503.06419 | null |
2025-03-09 | ProSE: Diffusion Priors for Speech Enhancement | Sonal Kumar et.al. | 2503.06375 | null |
2025-03-09 | Generative Video Bi-flow | Chen Liu et.al. | 2503.06364 | null |
2025-03-08 | Backdoor Attacks on Discrete Graph Diffusion Models | Jiawen Wang et.al. | 2503.06340 | null |
2025-03-08 | Text2Story: Advancing Video Storytelling with Text Guidance | Taewon Kang et.al. | 2503.06310 | null |
2025-03-08 | Your Large Vision-Language Model Only Needs A Few Attention Heads For Visual Grounding | Seil Kang et.al. | 2503.06287 | null |
2025-03-08 | The Correspondence Between Leaky-Box and Diffusion Models of Cosmic-Ray Propagation | Ramanath Cowsik et.al. | 2503.06281 | null |
2025-03-08 | WaveStitch: Flexible and Fast Conditional Time Series Generation with Diffusion Models | Aditya Shankar et.al. | 2503.06231 | link |
2025-03-08 | Reinforced Diffuser for Red Teaming Large Vision-Language Models | Ruofan Wang et.al. | 2503.06223 | null |
2025-03-08 | Explainable Synthetic Image Detection through Diffusion Timestep Ensembling | Yixin Wu et.al. | 2503.06201 | null |
2025-03-14 | PTDiffusion: Free Lunch for Generating Optical Illusion Hidden Pictures with Phase-Transferred Diffusion Model | Xiang Gao et.al. | 2503.06186 | null |
2025-03-08 | FORESCENE: FOREcasting human activity via latent SCENE graphs diffusion | Antonio Alliegro et.al. | 2503.06182 | null |
2025-03-08 | ROCM: RLHF on consistency models | Shivanshu Shekhar et.al. | 2503.06171 | null |
2025-03-12 | Object-Centric World Model for Language-Guided Manipulation | Youngjoon Jeong et.al. | 2503.06170 | null |
2025-03-08 | VACT: A Video Automatic Causal Testing System and a Benchmark | Haotong Yang et.al. | 2503.06163 | null |
2025-03-08 | BioMoDiffuse: Physics-Guided Biomechanical Diffusion for Controllable and Authentic Human Motion Synthesis | Zixi Kang et.al. | 2503.06151 | null |
2025-03-08 | VLForgery Face Triad: Detection, Localization and Attribution via Multimodal Large Language Models | Xinan He et.al. | 2503.06142 | null |
2025-03-08 | GSV3D: Gaussian Splatting-based Geometric Distillation with Stable Video Diffusion for Single-Image 3D Object Generation | Ye Tao et.al. | 2503.06136 | null |
2025-03-08 | FlowMP: Learning Motion Fields for Robot Planning with Conditional Flow Matching | Khang Nguyen et.al. | 2503.06135 | null |
2025-03-08 | X2I: Seamless Integration of Multimodal Understanding into Diffusion Transformer via Attention Distillation | Jian Ma et.al. | 2503.06134 | link |
2025-03-08 | USP: Unified Self-Supervised Pretraining for Image Generation and Understanding | Xiangxiang Chu et.al. | 2503.06132 | link |
2025-03-11 | PointDiffuse: A Dual-Conditional Diffusion Model for Enhanced Point Cloud Semantic Segmentation | Yong He et.al. | 2503.06094 | null |
2025-03-08 | DropletVideo: A Dataset and Approach to Explore Integral Spatio-Temporal Consistent Video Generation | Runze Zhang et.al. | 2503.06053 | null |
2025-03-08 | The Role of Affective States in Computational Psychiatry | David Benrimoh et.al. | 2503.06049 | null |
2025-03-08 | Invasion dynamics of super invaders: Elimination of Allee effects by a strategy at the range boundary | Yihong Du et.al. | 2503.06020 | null |
2025-03-07 | MagicInfinite: Generating Infinite Talking Videos with Your Words and Voice | Hongwei Yi et.al. | 2503.05978 | null |
2025-03-07 | LapLoss: Laplacian Pyramid-based Multiscale loss for Image Translation | Krish Didwania et.al. | 2503.05974 | null |
2025-03-11 | An Unsupervised C-Uniform Trajectory Sampler with Applications to Model Predictive Path Integral Control | O. Goktug Poyrazoglu et.al. | 2503.05819 | null |
2025-03-04 | Multi-agent Auto-Bidding with Latent Graph Diffusion Models | Dom Huh et.al. | 2503.05805 | null |
2025-03-07 | AIM-Fair: Advancing Algorithmic Fairness via Selectively Fine-Tuning Biased Models with Contextual Synthetic Data | Zengqun Zhao et.al. | 2503.05665 | link |
2025-03-07 | TrajectoryCrafter: Redirecting Camera Trajectory for Monocular Videos via Diffusion Models | Mark YU et.al. | 2503.05638 | null |
2025-03-07 | Anti-Diffusion: Preventing Abuse of Modifications of Diffusion-Based Models | Zheng Li et.al. | 2503.05595 | link |
2025-03-07 | Diffusion Models for Cayley Graphs | Michael R. Douglas et.al. | 2503.05558 | null |
2025-03-10 | Accelerating db-A* for Kinodynamic Motion Planning Using Diffusion | Julius Franke et.al. | 2503.05539 | null |
2025-03-07 | Noise-Robust Radio Frequency Fingerprint Identification Using Denoise Diffusion Model | Guolin Yin et.al. | 2503.05514 | null |
2025-03-07 | Generating Building-Level Heat Demand Time Series by Combining Occupancy Simulations and Thermal Modeling | Simon Malacek et.al. | 2503.05427 | null |
2025-03-07 | Frequency Autoregressive Image Generation with Continuous Tokens | Hu Yu et.al. | 2503.05305 | null |
2025-03-07 | MM-StoryAgent: Immersive Narrated Storybook Video Generation with a Multi-Agent Paradigm across Text, Image and Audio | Xuenan Xu et.al. | 2503.05242 | link |
2025-03-07 | Unified Reward Model for Multimodal Understanding and Generation | Yibin Wang et.al. | 2503.05236 | null |
2025-03-07 | RecipeGen: A Benchmark for Real-World Recipe Image Generation | Ruoxuan Zhang et.al. | 2503.05228 | null |
2025-03-07 | Policy Constraint by Only Support Constraint for Offline Reinforcement Learning | Yunkai Gao et.al. | 2503.05207 | link |
2025-03-07 | Generative Trajectory Stitching through Diffusion Composition | Yunhao Luo et.al. | 2503.05153 | null |
2025-03-07 | Development and Enhancement of Text-to-Image Diffusion Models | Rajdeep Roshan Sahu et.al. | 2503.05149 | null |
2025-03-07 | Taming Video Diffusion Prior with Scene-Grounding Guidance for 3D Gaussian Splatting from Sparse Inputs | Yingji Zhong et.al. | 2503.05082 | null |
2025-03-06 | Energy-Weighted Flow Matching for Offline Reinforcement Learning | Shiyuan Zhang et.al. | 2503.04975 | null |
2025-03-06 | Toward Lightweight and Fast Decoders for Diffusion Models in Image and Video Generation | Alexey Buzovkin et.al. | 2503.04871 | link |
2025-03-05 | ProReflow: Progressive Reflow with Decomposed Velocity | Lei Ke et.al. | 2503.04824 | null |
2025-03-06 | FluidNexus: 3D Fluid Reconstruction and Prediction from a Single Video | Yue Gao et.al. | 2503.04720 | null |
2025-03-06 | Compositional World Knowledge leads to High Utility Synthetic data | Sachit Gaudi et.al. | 2503.04687 | null |
2025-03-06 | What Are You Doing? A Closer Look at Controllable Human Video Generation | Emanuele Bugliarello et.al. | 2503.04666 | null |
2025-03-08 | The Best of Both Worlds: Integrating Language Models and Diffusion Models for Video Generation | Aoxiong Yin et.al. | 2503.04606 | link |
2025-03-07 | LEDiT: Your Length-Extrapolatable Diffusion Transformer without Positional Encoding | Shen Zhang et.al. | 2503.04344 | null |
2025-03-06 | S2Gaussian: Sparse-View Super-Resolution 3D Gaussian Splatting | Yecong Wan et.al. | 2503.04314 | null |
2025-03-06 | How to Move Your Dragon: Text-to-Motion Synthesis for Large-Vocabulary Objects | Wonkwang Lee et.al. | 2503.04257 | null |
2025-03-06 | Synthetic Data is an Elegant GIFT for Continual Vision-Language Models | Bin Wu et.al. | 2503.04229 | null |
2025-03-06 | Energy-Guided Optimization for Personalized Image Editing with Pretrained Text-to-Image Diffusion Models | Rui Jiang et.al. | 2503.04215 | null |
2025-03-06 | CoFinDiff: Controllable Financial Diffusion Model for Time Series Generation | Yuki Tanaka et.al. | 2503.04164 | null |
2025-03-07 | Diff-Reg v2: Diffusion-Based Matching Matrix Estimation for Image Matching and 3D Registration | Qianliang Wu et.al. | 2503.04127 | null |
2025-03-06 | FREAK: Frequency-modulated High-fidelity and Real-time Audio-driven Talking Portrait Synthesis | Ziqi Ni et.al. | 2503.04067 | null |
2025-03-06 | RA-DP: Rapid Adaptive Diffusion Policy for Training-Free High-frequency Robotics Replanning | Xi Ye et.al. | 2503.04051 | null |
2025-03-06 | Underlying Semantic Diffusion for Effective and Efficient In-Context Learning | Zhong Ji et.al. | 2503.04050 | null |
2025-03-06 | Beyond Existance: Fulfill 3D Reconstructed Scenes with Pseudo Details | Yifei Gao et.al. | 2503.04037 | null |
2025-03-06 | TextDoctor: Unified Document Image Inpainting via Patch Pyramid Diffusion Models | Wanglong Lu et.al. | 2503.04021 | null |
2025-03-06 | DSV-LFS: Unifying LLM-Driven Semantic Cues with Visual Features for Robust Few-Shot Segmentation | Amin Karimi et.al. | 2503.04006 | null |
2025-03-05 | All-atom Diffusion Transformers: Unified generative modelling of molecules and materials | Chaitanya K. Joshi et.al. | 2503.03965 | link |
2025-03-05 | Generative Learning of Densities on Manifolds | Dimitris G. Giovanis et.al. | 2503.03963 | null |
2025-03-05 | GuardDoor: Safeguarding Against Malicious Diffusion Editing via Protective Backdoors | Yaopei Zeng et.al. | 2503.03944 | null |
2025-03-05 | A non-homogeneous, non-stationary and path-dependent Markov anomalous diffusion model | Nestor Barraza et.al. | 2503.03896 | null |
2025-03-05 | Metallicity Gradients in Modern Cosmological Simulations I: Tension Between Smooth Stellar Feedback Models and Observations | Alex M. Garcia et.al. | 2503.03804 | null |
2025-03-05 | Positive-Unlabeled Diffusion Models for Preventing Sensitive Data Generation | Hiroshi Takahashi et.al. | 2503.03789 | null |
2025-03-05 | Tackling Few-Shot Segmentation in Remote Sensing via Inpainting Diffusion Model | Steve Andreas Immanuel et.al. | 2503.03785 | link |
2025-03-07 | Generating Novel Brain Morphology by Deforming Learned Templates | Alan Q. Wang et.al. | 2503.03778 | link |
2025-03-05 | GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control | Xuanchi Ren et.al. | 2503.03751 | link |
2025-03-08 | Rethinking Video Tokenization: A Conditioned Diffusion-based Approach | Nianzu Yang et.al. | 2503.03708 | link |
2025-03-05 | DualDiff+: Dual-Branch Diffusion for High-Fidelity Video Generation with Reward Guidance | Zhao Yang et.al. | 2503.03689 | link |
2025-03-05 | A Generative Approach to High Fidelity 3D Reconstruction from Text Data | Venkat Kumar R et.al. | 2503.03664 | null |
2025-03-05 | DoraCycle: Domain-Oriented Adaptation of Unified Generative Model in Multimodal Cycles | Rui Zhao et.al. | 2503.03651 | link |
2025-03-05 | Towards Understanding Text Hallucination of Diffusion Models via Local Generation Bias | Rui Lu et.al. | 2503.03595 | null |
2025-03-05 | High-Quality Virtual Single-Viewpoint Surgical Video: Geometric Autocalibration of Multiple Cameras in Surgical Lights | Yuna Kato et.al. | 2503.03558 | link |
2025-03-11 | Generative Artificial Intelligence in Robotic Manipulation: A Survey | Kun Zhang et.al. | 2503.03464 | null |
2025-03-05 | Stability analysis for set-valued optimization in Geoffroy spaces | James Larrouy et.al. | 2503.03405 | null |
2025-03-05 | Top-K Maximum Intensity Projection Priors for 3D Liver Vessel Segmentation | Xiaotong Zhang et.al. | 2503.03367 | null |
2025-03-13 | Video Super-Resolution: All You Need is a Video Diffusion Model | Zhihao Zhan et.al. | 2503.03355 | null |
2025-03-13 | Optimizing for the Shortest Path in Denoising Diffusion Model | Ping Chen et.al. | 2503.03265 | link |
2025-03-05 | GenColor: Generative Color-Concept Association in Visual Design | Yihan Hou et.al. | 2503.03236 | null |
2025-03-06 | Mocap-2-to-3: Lifting 2D Diffusion-Based Pretrained Models for 3D Motion Capture | Zhumei Wang et.al. | 2503.03222 | null |
2025-03-05 | An Analytical Theory of Power Law Spectral Bias in the Learning Dynamics of Diffusion Models | Binxu Wang et.al. | 2503.03206 | null |
2025-03-05 | Find Matching Faces Based On Face Parameters | Setu A. Bhatt et.al. | 2503.03204 | null |
2025-03-05 | WarmFed: Federated Learning with Warm-Start for Globalization and Personalization Via Personalized Diffusion Models | Tao Feng et.al. | 2503.03110 | null |
2025-03-05 | From Architectural Sketch to Conceptual Representation: Using Structure-Aware Diffusion Model to Generate Renderings of School Buildings | Zhengyang Wang et.al. | 2503.03090 | null |
2025-03-05 | Multi-View Depth Consistent Image Generation Using Generative AI Models: Application on Architectural Design of University Buildings | Xusheng Du et.al. | 2503.03068 | null |
2025-03-04 | Can Diffusion Models Provide Rigorous Uncertainty Quantification for Bayesian Inverse Problems? | Evan Scope Crafts et.al. | 2503.03007 | link |
2025-03-08 | Robust time series generation via Schrödinger Bridge: a comprehensive evaluation | Alexandre Alouadi et.al. | 2503.02943 | null |
2025-03-04 | Diverse Controllable Diffusion Policy with Signal Temporal Logic | Yue Meng et.al. | 2503.02924 | link |
2025-03-04 | Straight-Line Diffusion Model for Efficient 3D Molecular Generation | Yuyan Ni et.al. | 2503.02918 | link |
2025-03-04 | ARINAR: Bi-Level Autoregressive Feature-by-Feature Generative Models | Qinyu Zhao et.al. | 2503.02883 | link |
2025-03-04 | Prompting Generative AI with Interaction-Augmented Instructions | Leixian Shen et.al. | 2503.02874 | null |
2025-03-04 | Feynman-Kac Correctors in Diffusion: Annealing, Guidance, and Product of Experts | Marta Skreta et.al. | 2503.02819 | link |
2025-03-04 | Generating Reliable Initial Velocity Models for Full-waveform Inversion with Well and Structural Constraints | Qingchen Zhang et.al. | 2503.02815 | null |
2025-03-04 | Undertrained Image Reconstruction for Realistic Degradation in Blind Image Super-Resolution | Ru Ito et.al. | 2503.02767 | null |
2025-03-04 | Generative Modeling of Microweather Wind Velocities for Urban Air Mobility | Tristan A. Shah et.al. | 2503.02690 | link |
2025-03-04 | Reflection on Data Storytelling Tools in the Generative AI Era from the Human-AI Collaboration Perspective | Haotian Li et.al. | 2503.02631 | null |
2025-03-04 | StageDesigner: Artistic Stage Generation for Scenography via Theater Scripts | Zhaoxing Gan et.al. | 2503.02595 | null |
2025-03-04 | TS-CGNet: Temporal-Spatial Fusion Meets Centerline-Guided Diffusion for BEV Mapping | Xinying Hong et.al. | 2503.02578 | link |
2025-03-04 | SPG: Improving Motion Diffusion by Smooth Perturbation Guidance | Boseong Jeon et.al. | 2503.02577 | null |
2025-03-04 | PVTree: Realistic and Controllable Palm Vein Generation for Recognition Tasks | Sheng Shang et.al. | 2503.02547 | null |
2025-03-04 | RectifiedHR: Enable Efficient High-Resolution Image Generation via Energy Rectification | Zhen Yang et.al. | 2503.02537 | null |
2025-03-04 | Q&C: When Quantization Meets Cache in Efficient Image Generation | Xin Ding et.al. | 2503.02508 | null |
2025-03-05 | BRIDGE: Bootstrapping Text to Control Time-Series Generation via Multi-Agent Iterative Optimization and Diffusion Modelling | Hao Li et.al. | 2503.02445 | null |
2025-03-04 | Teaching Metric Distance to Autoregressive Multimodal Foundational Models | Jiwan Chung et.al. | 2503.02379 | null |
2025-03-05 | Q-Eval-100K: Evaluating Visual Quality and Alignment Level for Text-to-Vision Content | Zicheng Zhang et.al. | 2503.02357 | link |
2025-03-04 | Controllable Motion Generation via Diffusion Modal Coupling | Luobin Wang et.al. | 2503.02353 | link |
2025-03-04 | CQ CNN: A Hybrid Classical Quantum Convolutional Neural Network for Alzheimer’s Disease Detection Using Diffusion Generated and U Net Segmented 3D MRI | Mominul Islam et.al. | 2503.02345 | link |
2025-03-04 | GRADEO: Towards Human-Like Evaluation for Text-to-Video Generation via Multi-Step Reasoning | Zhun Mou et.al. | 2503.02341 | null |
2025-03-04 | Diffusion-Based mmWave Radar Point Cloud Enhancement Driven by Range Images | Ruixin Wu et.al. | 2503.02300 | null |
2025-03-04 | Language-Guided Visual Perception Disentanglement for Image Quality Assessment and Conditional Image Generation | Zhichao Yang et.al. | 2503.02206 | null |
2025-03-04 | h-Edit: Effective and Flexible Diffusion-Based Editing via Doob’s h-Transform | Toan Nguyen et.al. | 2503.02187 | link |
2025-03-03 | HanDrawer: Leveraging Spatial Information to Render Realistic Hands Using a Conditional Diffusion Model in Single Stage | Qifan Fu et.al. | 2503.02127 | null |
2025-03-03 | Generalized Diffusion Detector: Mining Robust Features from Diffusion Models for Domain-Generalized Detection | Boyong He et.al. | 2503.02101 | null |
2025-03-09 | Superscopes: Amplifying Internal Feature Representations for Language Model Interpretation | Jonathan Jacobi et.al. | 2503.02078 | link |
2025-03-03 | FRMD: Fast Robot Motion Diffusion with Consistency-Distilled Movement Primitives for Smooth Action Generation | Xirui Shi et.al. | 2503.02048 | null |
2025-03-03 | Quantifying Point Contributions: A Lightweight Framework for Efficient and Effective Query-Driven Trajectory Simplification | Yumeng Song et.al. | 2503.02047 | null |
2025-03-03 | Dynamic Search for Inference-Time Alignment in Diffusion Models | Xiner Li et.al. | 2503.02039 | null |
2025-03-03 | Morpheus: Text-Driven 3D Gaussian Splat Shape and Color Stylization | Jamie Wynn et.al. | 2503.02009 | null |
2025-03-03 | TactStyle: Generating Tactile Textures with Generative AI for Digital Fabrication | Faraz Faruqi et.al. | 2503.02007 | null |
2025-02-27 | LIVS: A Pluralistic Alignment Dataset for Inclusive Public Spaces | Rashid Mushkani et.al. | 2503.01894 | null |
2025-02-26 | Uncertainty Comes for Free: Human-in-the-Loop Policies with Diffusion Models | Zhanpeng He et.al. | 2503.01876 | null |
2025-02-26 | Online Pseudo-average Shifting Attention(PASA) for Robust Low-precision LLM Inference: Algorithms and Numerical Analysis | Long Cheng et.al. | 2503.01873 | null |
2025-02-25 | FairGen: Controlling Sensitive Attributes for Fair Generations in Diffusion Models via Adaptive Latent Guidance | Mintong Kang et.al. | 2503.01872 | null |
2025-03-03 | Denoising Functional Maps: Diffusion Models for Shape Correspondence | Aleksei Zhuravlev et.al. | 2503.01845 | null |
2025-03-03 | Jailbreaking Safeguarded Text-to-Image Models via Large Language Models | Zhengyuan Jiang et.al. | 2503.01839 | null |
2025-03-03 | Difix3D+: Improving 3D Reconstructions with Single-Step Diffusion Models | Jay Zhangjie Wu et.al. | 2503.01774 | null |
2025-03-03 | VideoUFO: A Million-Scale User-Focused Dataset for Text-to-Video Generation | Wenhao Wang et.al. | 2503.01739 | null |
2025-03-03 | Self-attention-based Diffusion Model for Time-series Imputation in Partial Blackout Scenarios | Mohammad Rafid Ul Islam et.al. | 2503.01737 | null |
2025-03-03 | ToLo: A Two-Stage, Training-Free Layout-To-Image Generation Framework For High-Overlap Layouts | Linhao Huang et.al. | 2503.01667 | link |
2025-03-03 | DesignDiffusion: High-Quality Text-to-Design Image Generation with Diffusion Models | Zhendong Wang et.al. | 2503.01645 | null |
2025-03-03 | Divide and Conquer: Heterogeneous Noise Integration for Diffusion-based Adversarial Purification | Gaozheng Pei et.al. | 2503.01407 | null |
2025-03-03 | The Road Less Traveled: Investigating Robustness and Explainability in CNN Malware Detection | Matteo Brosolo et.al. | 2503.01391 | null |
2025-03-03 | Kiss3DGen: Repurposing Image Diffusion Models for 3D Asset Generation | Jiantao Lin et.al. | 2503.01370 | link |
2025-03-03 | Diffusion-based Virtual Staining from Polarimetric Mueller Matrix Imaging | Xiaoyu Zheng et.al. | 2503.01352 | null |
2025-03-03 | CacheQuant: Comprehensively Accelerated Diffusion Models | Xuewen Liu et.al. | 2503.01323 | null |
2025-03-03 | HI-Series Algorithms A Hybrid of Substance Diffusion Algorithm and Collaborative Filtering | Yu Peng et.al. | 2503.01305 | null |
2025-03-03 | MINT: Multi-modal Chain of Thought in Unified Generative Models for Enhanced Image Generation | Yi Wang et.al. | 2503.01298 | null |
2025-03-03 | Fine-Grained Controllable Apparel Showcase Image Generation via Garment-Centric Outpainting | Rong Zhang et.al. | 2503.01294 | null |
2025-03-03 | Reconciling Stochastic and Deterministic Strategies for Zero-shot Image Restoration using Diffusion Model in Dual | Chong Wang et.al. | 2503.01288 | link |
2025-03-04 | DnD Filter: Differentiable State Estimation for Dynamic Systems using Diffusion Models | Ziyu Wan et.al. | 2503.01274 | link |
2025-03-11 | Towards Improved Text-Aligned Codebook Learning: Multi-Hierarchical Codebook-Text Alignment with Long Text | Guotao Liang et.al. | 2503.01261 | null |
2025-03-03 | Diffusion Stabilizer Policy for Automated Surgical Robot Manipulations | Chonlam Ho et.al. | 2503.01252 | null |
2025-03-03 | Enhancing Retinal Vessel Segmentation Generalization via Layout-Aware Generative Modelling | Jonathan Fhima et.al. | 2503.01190 | null |
2025-03-03 | DifIISR: A Diffusion Model with Gradient Guidance for Infrared Image Super-Resolution | Xingyuan Li et.al. | 2503.01187 | link |
2025-03-03 | Enhancing Vision-Language Compositional Understanding with Multimodal Synthetic Data | Haoxin Li et.al. | 2503.01167 | null |
2025-03-03 | Split Gibbs Discrete Diffusion Posterior Sampling | Wenda Chu et.al. | 2503.01161 | null |
2025-03-03 | EasyCraft: A Robust and Efficient Framework for Automatic Avatar Crafting | Suzhen Wang et.al. | 2503.01158 | null |
2025-03-03 | CoInD: Enabling Logical Compositions in Diffusion Models | Sachit Gaudi et.al. | 2503.01145 | link |
2025-03-03 | DPR: Diffusion Preference-based Reward for Offline Reinforcement Learning | Teng Pang et.al. | 2503.01143 | null |
2025-03-03 | ACCORD: Alleviating Concept Coupling through Dependence Regularization for Text-to-Image Diffusion Personalization | Shizhan Liu et.al. | 2503.01122 | null |
2025-03-03 | VideoHandles: Editing 3D Object Compositions in Videos Using Video Generative Priors | Juil Koo et.al. | 2503.01107 | null |
2025-03-03 | Direct Discriminative Optimization: Your Likelihood-Based Visual Generative Model is Secretly a GAN Discriminator | Kaiwen Zheng et.al. | 2503.01103 | null |
2025-03-03 | Tackling Hallucination from Conditional Models for Medical Image Reconstruction with DynamicDPS | Seunghoi Kim et.al. | 2503.01075 | null |
2025-03-02 | Data Unlearning in Diffusion Models | Silas Alberti et.al. | 2503.01034 | link |
2025-03-06 | MedUnifier: Unifying Vision-and-Language Pre-training on Medical Data with Vision Generation Task using Discrete Visual Representations | Ziyang Zhang et.al. | 2503.01019 | null |
2025-03-02 | Generative Motion Infilling From Imprecisely Timed Keyframes | Purvi Goel et.al. | 2503.01016 | null |
2025-03-02 | Underdamped Diffusion Bridges with Applications to Sampling | Denis Blessing et.al. | 2503.01006 | link |
2025-03-02 | Molecule Generation for Target Protein Binding with Hierarchical Consistency Diffusion Model | Guanlue Li et.al. | 2503.00975 | link |
2025-03-02 | Using Synthetic Images to Augment Small Medical Image Datasets | Minh H. Vu et.al. | 2503.00962 | null |
2025-03-02 | Dynamical Diffusion: Learning Temporal Dynamics with Diffusion Models | Xingzhuo Guo et.al. | 2503.00951 | link |
2025-03-02 | Extrapolating and Decoupling Image-to-Video Generation Models: Motion Modeling is Easier Than You Think | Jie Tian et.al. | 2503.00948 | link |
2025-03-06 | A Simple and Effective Reinforcement Learning Method for Text-to-Image Diffusion Fine-tuning | Shashank Gupta et.al. | 2503.00897 | null |
2025-03-02 | Evaluating and Predicting Distorted Human Body Parts for Generated Images | Lu Ma et.al. | 2503.00811 | link |
2025-03-02 | Expandora: Broadening Design Exploration with Text-to-Image Model | DaEun Choi et.al. | 2503.00791 | null |
2025-03-02 | Geodesic Diffusion Models for Medical Image-to-Image Generation | Teng Zhang et.al. | 2503.00745 | link |
2025-03-06 | LesionDiffusion: Towards Text-controlled General Lesion Synthesis | Henrui Tian et.al. | 2503.00741 | link |
2025-03-02 | FaceShot: Bring Any Character into Life | Junyao Gao et.al. | 2503.00740 | null |
2025-03-02 | Enhancing Monocular 3D Scene Completion with Diffusion Model | Changlin Song et.al. | 2503.00726 | link |
2025-03-01 | Development of an Unpaired Deep Neural Network for Synthesizing X-ray Fluoroscopic Images from Digitally Reconstructed Tomography in Image Guided Radiotherapy | Chisako Hayashi et.al. | 2503.00665 | null |
2025-03-01 | SeisDiff-deno: A Diffusion-Based Denoising Framework for Tube Wave Attenuation in VSP Data | Donglin Zhu et.al. | 2503.00637 | null |
2025-03-01 | SolidMark: Evaluating Image Memorization in Generative Models | Nicky Kriplani et.al. | 2503.00592 | link |
2025-03-01 | What Makes a Good Diffusion Planner for Decision Making? | Haofei Lu et.al. | 2503.00535 | null |
2025-03-01 | End-To-End Learning of Gaussian Mixture Priors for Diffusion Sampler | Denis Blessing et.al. | 2503.00524 | null |
2025-03-01 | Periodic Materials Generation using Text-Guided Joint Diffusion Model | Kishalay Das et.al. | 2503.00522 | link |
2025-03-01 | HGDiffuser: Efficient Task-Oriented Grasp Generation via Human-Guided Grasp Diffusion Models | Dehao Huang et.al. | 2503.00508 | null |
2025-03-01 | Bayesian Inference for Non-Synchronously Observed Diffusions | Ajay Jasra et.al. | 2503.00465 | null |
2025-03-01 | Taming Large Multimodal Agents for Ultra-low Bitrate Semantically Disentangled Image Compression | Juan Song et.al. | 2503.00399 | null |
2025-03-01 | Remasking Discrete Diffusion Models with Inference-Time Scaling | Guanghan Wang et.al. | 2503.00307 | null |
2025-03-10 | Learning to Animate Images from A Few Videos to Portray Delicate Human Actions | Haoxin Li et.al. | 2503.00276 | null |
2025-03-01 | Flow Matching for Medical Image Synthesis: Bridging the Gap Between Speed and Quality | Milad Yazdani et.al. | 2503.00266 | link |
2025-03-04 | Unified Video Action Model | Shuang Li et.al. | 2503.00200 | null |
2025-02-28 | PRISM: High-Resolution & Precise Counterfactual Medical Image Generation using Language-guided Stable Diffusion | Amar Kumar et.al. | 2503.00196 | null |
2025-02-28 | ProDapt: Proprioceptive Adaptation using Long-term Memory Diffusion | Federico Pizarro Bejarano et.al. | 2503.00193 | link |
2025-02-28 | InspireMusic: Integrating Super Resolution and Large Language Model for High-Fidelity Long-Form Music Generation | Chong Zhang et.al. | 2503.00084 | link |
2025-02-26 | Improved YOLOv12 with LLM-Generated Synthetic Data for Enhanced Apple Detection and Benchmarking Against YOLOv11 and YOLOv10 | Ranjan Sapkota et.al. | 2503.00057 | null |
2025-02-26 | Glad: A Streaming Scene Generator for Autonomous Driving | Bin Xie et.al. | 2503.00045 | null |
2025-02-23 | A Systematic Review of Open Datasets Used in Text-to-Image (T2I) Gen AI Model Safety | Rakeen Rouf et.al. | 2503.00020 | null |
2025-02-28 | How far can we go with ImageNet for Text-to-Image generation? | L. Degeorge et.al. | 2502.21318 | null |
2025-03-07 | Raccoon: Multi-stage Diffusion Training with Coarse-to-Fine Curating Videos | Zhiyu Tan et.al. | 2502.21314 | null |
2025-03-03 | MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editing | Xueyun Tian et.al. | 2502.21291 | link |
2025-02-28 | Does Generation Require Memorization? Creative Diffusion Models using Ambient Diffusion | Kulin Shah et.al. | 2502.21278 | null |
2025-03-10 | A Review on Generative AI For Text-To-Image and Image-To-Image Generation and Implications To Scientific Images | Zineb Sordo et.al. | 2502.21151 | null |
2025-02-28 | Training-free and Adaptive Sparse Attention for Efficient Long Video Generation | Yifei Xia et.al. | 2502.21079 | null |
2025-02-28 | Synthesizing Individualized Aging Brains in Health and Disease with Generative Models and Parallel Transport | Jingru Fu et.al. | 2502.21049 | link |
2025-02-28 | Generative Uncertainty in Diffusion Models | Metod Jazbec et.al. | 2502.20946 | null |
2025-02-28 | DiffBrush:Just Painting the Art by Your Hands | Jiaming Chu et.al. | 2502.20904 | null |
2025-02-28 | HAIC: Improving Human Action Understanding and Generation with Better Captions for Multi-modal Large Language Models | Xiao Wang et.al. | 2502.20811 | null |
2025-02-28 | CADDreamer: CAD object Generation from Single-view Images | Yuan Li et.al. | 2502.20732 | null |
2025-02-28 | WorldModelBench: Judging Video Generation Models As World Models | Dacheng Li et.al. | 2502.20694 | null |
2025-02-28 | Diffusion Restoration Adapter for Real-World Image Restoration | Hanbang Liang et.al. | 2502.20679 | null |
2025-02-28 | Advancing AI-Powered Medical Image Synthesis: Insights from MedVQA-GI Challenge Using CLIP, Fine-Tuned Stable Diffusion, and Dream-Booth + LoRA | Ojonugwa Oluwafemi Ejiga Peter et.al. | 2502.20667 | null |
2025-02-28 | Wavelet-based density sketching with functional hierarchical tensor | Xun Tang et.al. | 2502.20655 | null |
2025-02-28 | Gungnir: Exploiting Stylistic Features in Images for Backdoor Attacks on Diffusion Models | Yu Pan et.al. | 2502.20650 | link |
2025-02-28 | T2ICount: Enhancing Cross-modal Understanding for Zero-Shot Counting | Yifei Qian et.al. | 2502.20625 | null |
2025-02-28 | SafeText: Safe Text-to-image Models via Aligning the Text Encoder | Yuepeng Hu et.al. | 2502.20623 | null |
2025-03-04 | Unifying Model Predictive Path Integral Control, Reinforcement Learning, and Diffusion Models for Optimal Control and Planning | Yankai Li et.al. | 2502.20476 | null |
2025-02-27 | Broken Letters, Broken Narratives: A Case Study on Arabic Script in DALL-E 3 | Arshia Sobhan et.al. | 2502.20459 | null |
2025-02-27 | Tight Inversion: Image-Conditioned Inversion for Real Image Editing | Edo Kadosh et.al. | 2502.20376 | null |
2025-02-27 | Constrained Generative Modeling with Manually Bridged Diffusion Models | Saeid Naderiparizi et.al. | 2502.20371 | null |
2025-02-27 | FlexVAR: Flexible Visual Autoregressive Modeling without Residual Prediction | Siyu Jiao et.al. | 2502.20313 | link |
2025-02-27 | Mobius: Text to Seamless Looping Video Generation via Latent Shift | Xiuli Bi et.al. | 2502.20307 | link |
2025-02-27 | Explainable, Multi-modal Wound Infection Classification from Images Augmented with Generated Captions | Palawat Busaranuvong et.al. | 2502.20277 | null |
2025-02-27 | Attention Distillation: A Unified Approach to Visual Characteristics Transfer | Yang Zhou et.al. | 2502.20235 | link |
2025-02-27 | Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think | Liang Chen et.al. | 2502.20172 | link |
2025-02-27 | FlexiDiT: Your Diffusion Transformer Can Easily Generate High-Quality Samples with Less Compute | Sotiris Anagnostidis et.al. | 2502.20126 | null |
2025-02-27 | Scalability of the second-order reliability method for stochastic differential equations with multiplicative noise | Timo Schorlepp et.al. | 2502.20114 | null |
2025-02-28 | New Dataset and Methods for Fine-Grained Compositional Referring Expression Comprehension via Specialist-MLLM Collaboration | Xuzheng Yang et.al. | 2502.20104 | null |
2025-02-27 | Generative augmentations for improved cardiac ultrasound segmentation using diffusion models | Gilles Van De Vyver et.al. | 2502.20100 | link |
2025-02-27 | Image Referenced Sketch Colorization Based on Animation Creation Workflow | Dingkun Yan et.al. | 2502.19937 | link |
2025-02-27 | DiffCSS: Diverse and Expressive Conversational Speech Synthesis with Diffusion Models | Weihao wu et.al. | 2502.19924 | null |
2025-02-27 | High-Fidelity Relightable Monocular Portrait Animation with Lighting-Controllable Video Diffusion Model | Mingtao Guo et.al. | 2502.19894 | link |
2025-02-27 | C-Drag: Chain-of-Thought Driven Motion Controller for Video Generation | Yuhao Li et.al. | 2502.19868 | link |
2025-02-27 | One-for-More: Continual Diffusion Model for Anomaly Detection | Xiaofan Li et.al. | 2502.19848 | link |
2025-02-28 | CLIP Under the Microscope: A Fine-Grained Analysis of Multi-Object Representation | Reza Abbasi et.al. | 2502.19842 | link |
2025-02-27 | Analyzing CLIP’s Performance Limitations in Multi-Object Scenarios: A Controlled High-Resolution Study | Reza Abbasi et.al. | 2502.19828 | null |
2025-02-27 | Implicit Search via Discrete Diffusion: A Study on Chess | Jiacheng Ye et.al. | 2502.19805 | link |
2025-02-27 | UIFace: Unleashing Inherent Model Capabilities to Enhance Intra-Class Diversity in Synthetic Face Recognition | Xiao Lin et.al. | 2502.19803 | link |
2025-02-27 | MFSR: Multi-fractal Feature for Super-resolution Reconstruction with Fine Details Recovery | Lianping Yang et.al. | 2502.19797 | null |
2025-02-27 | The erasure of intensive livestock farming in text-to-image generative AI | Kehan Sheng et.al. | 2502.19771 | link |
2025-03-04 | Finding Local Diffusion Schrödinger Bridge using Kolmogorov-Arnold Network | Xingyu Qiu et.al. | 2502.19754 | link |
2025-02-27 | Recent Advances on Generalizable Diffusion-generated Image Detection | Qijie Xu et.al. | 2502.19716 | link |
2025-02-27 | SAP-DIFF: Semantic Adversarial Patch Generation for Black-Box Face Recognition Models via Diffusion Models | Mingsi Wang et.al. | 2502.19710 | null |
2025-03-04 | Language-Informed Hyperspectral Image Synthesis for Imbalanced-Small Sample Classification via Semi-Supervised Conditional Diffusion Model | Yimin Zhu et.al. | 2502.19700 | null |
2025-02-27 | BEVDiffuser: Plug-and-Play Diffusion Model for BEV Denoising with Ground-Truth Guidance | Xin Ye et.al. | 2502.19694 | null |
2025-02-27 | SubZero: Composing Subject, Style, and Action via Zero-Shot Personalization | Shubhankar Borse et.al. | 2502.19673 | null |
2025-02-26 | 3D Nephrographic Image Synthesis in CT Urography with the Diffusion Model and Swin Transformer | Hongkun Yu et.al. | 2502.19623 | null |
2025-02-26 | Diffusion-based Planning with Learned Viability Filters | Nicholas Ioannidis et.al. | 2502.19564 | null |
2025-02-26 | On the Interpolation Effect of Score Smoothing | Zhengdao Chen et.al. | 2502.19499 | null |
2025-02-26 | FLAP: Fully-controllable Audio-driven Portrait Video Generation through 3D head conditioned diffusion mode | Lingzhou Mu et.al. | 2502.19455 | null |
2025-03-03 | TransVDM: Motion-Constrained Video Diffusion Model for Transparent Video Synthesis | Menghao Li et.al. | 2502.19454 | null |
2025-02-27 | On the Importance of Text Preprocessing for Multimodal Representation Learning and Pathology Report Generation | Ruben T. Lucassen et.al. | 2502.19285 | null |
2025-02-26 | HDM: Hybrid Diffusion Model for Unified Image Anomaly Detection | Zekang Weng et.al. | 2502.19200 | null |
2025-02-27 | RetinaRegen: A Hybrid Model for Readability and Detail Restoration in Fundus Images | Yuhan Tang et.al. | 2502.19153 | null |
2025-02-26 | Modulation of the galactic cosmic ray spectrum in an anisotropic diffusion approach | V. D. Borisov et.al. | 2502.19062 | null |
2025-03-02 | A Dual-Purpose Framework for Backdoor Defense and Backdoor Amplification in Diffusion Models | Vu Tuan Truong et.al. | 2502.19047 | null |
2025-02-26 | DualSpec: Text-to-spatial-audio Generation via Dual-Spectrogram Guided Diffusion Model | Lei Zhao et.al. | 2502.18952 | null |
2025-02-26 | Physics-Aware Inverse Design for Nanowire Single-Photon Avalanche Detectors via Deep Learning | Boyang Zhang et.al. | 2502.18857 | null |
2025-02-26 | Reimagining Personal Data: Unlocking the Potential of AI-Generated Images in Personal Data Meaning-Making | Soobin Park et.al. | 2502.18853 | null |
2025-02-26 | Optimal Stochastic Trace Estimation in Generative Modeling | Xinyang Liu et.al. | 2502.18808 | null |
2025-02-26 | Ptychographic Image Reconstruction from Limited Data via Score-Based Diffusion Models with Physics-Guidance | Refik Mert Cam et.al. | 2502.18767 | null |
2025-02-26 | AI-Instruments: Embodying Prompts as Instruments to Abstract & Reflect Graphical Interface Commands as General-Purpose Tools | Nathalie Riche et.al. | 2502.18736 | null |
2025-02-25 | Adaptive conditional latent diffusion maps beam loss to 2D phase space projections | Alexander Scheinker et.al. | 2502.18684 | null |
2025-02-25 | Diffusion Models for conditional MRI generation | Miguel Herencia García del Castillo et.al. | 2502.18620 | null |
2025-02-25 | Investigating Youth AI Auditing | Jaemarie Solyst et.al. | 2502.18576 | null |
2025-03-02 | K-LoRA: Unlocking Training-Free Fusion of Any Subject and Style LoRAs | Ziheng Ouyang et.al. | 2502.18461 | null |
2025-02-25 | ToMCAT: Theory-of-Mind for Cooperative Agents in Teams via Multiagent Diffusion Policies | Pedro Sequeira et.al. | 2502.18438 | null |
2025-02-25 | ART: Anonymous Region Transformer for Variable Multi-Layer Transparent Image Generation | Yifan Pu et.al. | 2502.18364 | null |
2025-02-25 | LDGen: Enhancing Text-to-Image Synthesis via Large Language Model-Driven Language Representation | Pengzhi Li et.al. | 2502.18302 | null |
2025-02-25 | Synthesizing Consistent Novel Views via 3D Epipolar Attention without Re-Training | Botao Ye et.al. | 2502.18219 | null |
2025-02-25 | Training Consistency Models with Variational Noise Coupling | Gianluigi Silvestri et.al. | 2502.18197 | link |
2025-02-25 | Multi-Perspective Data Augmentation for Few-shot Object Detection | Anh-Khoa Nguyen Vu et.al. | 2502.18195 | link |
2025-03-08 | Realistic Clothed Human and Object Joint Reconstruction from a Single Image | Ayushi Dutta et.al. | 2502.18150 | null |
2025-02-25 | SpargeAttn: Accurate Sparse Attention Accelerating Any Model Inference | Jintao Zhang et.al. | 2502.18137 | link |
2025-02-26 | Bayesian Optimization for Controlled Image Editing via LLMs | Chengkun Cai et.al. | 2502.18116 | null |
2025-02-25 | PromptMID: Modal Invariant Descriptors Based on Diffusion and Vision Foundation Models for Optical-SAR Image Matching | Han Nie et.al. | 2502.18104 | link |
2025-02-25 | Robust Polyp Detection and Diagnosis through Compositional Prompt-Guided Diffusion Models | Jia Yu et.al. | 2502.17951 | null |
2025-02-25 | 3D Anatomical Structure-guided Deep Learning for Accurate Diffusion Microstructure Imaging | Xinrui Ma et.al. | 2502.17933 | null |
2025-02-25 | Structure-prior Informed Diffusion Model for Graph Source Localization with Limited Data | Hongyi Chen et.al. | 2502.17928 | null |
2025-02-25 | VVRec: Reconstruction Attacks on DL-based Volumetric Video Upstreaming via Latent Diffusion Model with Gamma Distribution | Rui Lu et.al. | 2502.17880 | null |
2025-02-25 | ASurvey: Spatiotemporal Consistency in Video Generation | Zhiyu Yin et.al. | 2502.17863 | null |
2025-02-25 | HRR: Hierarchical Retrospection Refinement for Generated Image Detection | Peipei Yuan et.al. | 2502.17862 | null |
2025-02-25 | Synthia: Novel Concept Design with Affordance Composition | Xiaomeng Jin et.al. | 2502.17793 | link |
2025-02-25 | FoREST: Frame of Reference Evaluation in Spatial Reasoning Tasks | Tanawan Premsri et.al. | 2502.17775 | link |
2025-02-24 | Aligning Compound AI Systems via System-level DPO | Xiangwen Wang et.al. | 2502.17721 | null |
2025-02-24 | Mind the Gesture: Evaluating AI Sensitivity to Culturally Offensive Non-Verbal Gestures | Akhila Yerukola et.al. | 2502.17710 | link |
2025-02-26 | Learning Decentralized Swarms Using Rotation Equivariant Graph Neural Networks | Taos Transue et.al. | 2502.17612 | null |
2025-02-24 | On the Vulnerability of Concept Erasure in Diffusion Models | Lucas Beerens et.al. | 2502.17537 | link |
2025-02-22 | A Survey on Mechanistic Interpretability for Multi-Modal Foundation Models | Zihao Lin et.al. | 2502.17516 | null |
2025-02-25 | Fractal Generative Models | Tianhong Li et.al. | 2502.17437 | link |
2025-02-24 | GCC: Generative Color Constancy via Diffusing a Color Checker | Chen-Wei Chang et.al. | 2502.17435 | null |
2025-02-24 | S4S: Solving for a Diffusion Model Solver | Eric Frankel et.al. | 2502.17423 | null |
2025-02-24 | X-Dancer: Expressive Music to Human Dance Video Generation | Zeyuan Chen et.al. | 2502.17414 | null |
2025-02-24 | RELICT: A Replica Detection Framework for Medical Image Generation | Orhun Utku Aydin et.al. | 2502.17360 | link |
2025-02-24 | Goal-Oriented Middleware Filtering at Transport Layer Based on Value of Updates | Polina Kutsevol et.al. | 2502.17350 | null |
2025-02-24 | AnyTop: Character Animation Diffusion with Any Topology | Inbar Gat et.al. | 2502.17327 | link |
2025-02-24 | VideoGrain: Modulating Space-Time Attention for Multi-grained Video Editing | Xiangpeng Yang et.al. | 2502.17258 | null |
2025-02-24 | Dimitra: Audio-driven Diffusion model for Expressive Talking Head Generation | Baptiste Chopin et.al. | 2502.17198 | null |
2025-02-25 | DICEPTION: A Generalist Diffusion Model for Visual Perceptual Tasks | Canyu Zhao et.al. | 2502.17157 | link |
2025-02-24 | Diffusion Models for Tabular Data: Challenges, Current Progress, and Future Directions | Zhong Li et.al. | 2502.17119 | link |
2025-02-24 | SFLD: Reducing the content bias for AI-generated Image Detection | Seoyeon Gye et.al. | 2502.17105 | null |
2025-02-25 | Generative Models in Decision Making: A Survey | Yinchuan Li et.al. | 2502.17100 | null |
2025-02-24 | Conditional Diffusion-Flow models for generating 3D cosmic density fields: applications to f(R) cosmologies | Julieth Katherine Riveros et.al. | 2502.17087 | null |
2025-02-24 | SpecDM: Hyperspectral Dataset Synthesis with Pixel-level Semantic Annotations | Wendi Liu et.al. | 2502.17056 | null |
2025-02-24 | Distributional Vision-Language Alignment by Cauchy-Schwarz Divergence | Wenzhe Yin et.al. | 2502.17028 | null |
2025-02-24 | TraFlow: Trajectory Distillation on Pre-Trained Rectified Flow | Zhangkai Wu et.al. | 2502.16972 | null |
2025-02-26 | Autoregressive Image Generation Guided by Chains of Thought | Miaomiao Cai et.al. | 2502.16965 | null |
2025-02-24 | MAD-AD: Masked Diffusion for Unsupervised Brain Anomaly Detection | Farzad Beizaee et.al. | 2502.16943 | link |
2025-02-24 | Multi-Dimensional Quality Assessment for Text-to-3D Assets: Dataset and Model | Kang Fu et.al. | 2502.16915 | null |
2025-02-24 | Culture-TRIP: Culturally-Aware Text-to-Image Generation with Iterative Prompt Refinment | Suchae Jeong et.al. | 2502.16902 | null |
2025-02-24 | Mitigating Hallucinations in Diffusion Models through Adaptive Attention Modulation | Trevine Oorloff et.al. | 2502.16872 | null |
2025-02-24 | A Survey of fMRI to Image Reconstruction | Weiyu Guo et.al. | 2502.16861 | null |
2025-02-24 | Posterior Inference with Diffusion Models for High-dimensional Black-box Optimization | Taeyoung Yun et.al. | 2502.16824 | link |
2025-02-24 | Fast, Accurate Manifold Denoising by Tunneling Riemannian Optimization | Shiyu Wang et.al. | 2502.16819 | null |
2025-02-24 | DiffKAN-Inpainting: KAN-based Diffusion model for brain tumor inpainting | Tianli Tao et.al. | 2502.16771 | null |
2025-02-23 | DOSE3 : Diffusion-based Out-of-distribution detection on SE(3) trajectories | Hongzhe Cheng et.al. | 2502.16725 | null |
2025-02-23 | Visual-RAG: Benchmarking Text-to-Image Retrieval Augmented Generation for Visual Knowledge Intensive Queries | Yin Wu et.al. | 2502.16636 | link |
2025-02-23 | AdverX-Ray: Ensuring X-Ray Integrity Through Frequency-Sensitive Adversarial VAEs | Francisco Caetano et.al. | 2502.16610 | link |
2025-02-23 | Human2Robot: Learning Robot Actions from Paired Human-Robot Videos | Sicheng Xie et.al. | 2502.16587 | null |
2025-02-23 | Rebalancing the Scales: A Systematic Mapping Study of Generative Adversarial Networks (GANs) in Addressing Data Imbalance | Pankaj Yadav et.al. | 2502.16535 | null |
2025-02-23 | Dragen3D: Multiview Geometry Consistent 3D Gaussian Generation with Drag-Based Control | Jinbo Yan et.al. | 2502.16475 | null |
2025-03-06 | Iterative Flow Matching – Path Correction and Gradual Refinement for Enhanced Generative Modeling | Eldad Haber et.al. | 2502.16445 | null |
2025-02-23 | Unified Prompt Attack Against Text-to-Image Generation Models | Duo Peng et.al. | 2502.16423 | null |
2025-02-23 | High-resolution Rainy Image Synthesis: Learning from Rendering | Kaibin Zhou et.al. | 2502.16421 | link |
2025-03-08 | Concept Corrector: Erase concepts on the fly for text-to-image diffusion models | Zheling Meng et.al. | 2502.16368 | null |
2025-02-22 | Ultra fast, event-by-event heavy-ion simulations for next generation experiments | Manjunath Omana Kuttan et.al. | 2502.16330 | null |
2025-02-22 | DualNeRF: Text-Driven 3D Scene Editing via Dual-Field Representation | Yuxuan Xiong et.al. | 2502.16302 | null |
2025-02-22 | PersGuard: Preventing Malicious Personalization via Backdoor Attacks on Pre-trained Text-to-Image Diffusion Models | Xinwei Liu et.al. | 2502.16167 | null |
2025-02-22 | USegMix: Unsupervised Segment Mix for Efficient Data Augmentation in Pathology Images | Jiamu Wang et.al. | 2502.16160 | null |
2025-02-22 | Creative Blends of Visual Concepts | Zhida Sun et.al. | 2502.16062 | null |
2025-02-21 | Mean-Shift Distillation for Diffusion Mode Seeking | Vikas Thamizharasan et.al. | 2502.15989 | null |
2025-02-21 | Multi-Agent Multimodal Models for Multicultural Text to Image Generation | Parth Bhalerao et.al. | 2502.15972 | link |
2025-02-21 | Human Motion Prediction, Reconstruction, and Generation | Canxuan Gang et.al. | 2502.15956 | null |
2025-02-21 | RIFLEx: A Free Lunch for Length Extrapolation in Video Diffusion Transformers | Min Zhao et.al. | 2502.15894 | null |
2025-02-21 | ELIP: Enhanced Visual-Language Foundation Models for Image Retrieval | Guanqi Zhan et.al. | 2502.15682 | null |
2025-02-21 | One-step Diffusion Models with $f$ -Divergence Distribution Matching | Yilun Xu et.al. | 2502.15681 | null |
2025-02-21 | VaViM and VaVAM: Autonomous Driving through Video Generative Modeling | Florent Bartoccioni et.al. | 2502.15672 | link |
2025-02-21 | Modeling Infectious Diseases: From SIR Models to Diffusion-Based Approaches and Numerical Solutions | Ayesha Baig et.al. | 2502.15439 | null |
2025-02-21 | Soybean pod and seed counting in both outdoor fields and indoor laboratories using unions of deep neural networks | Tianyou Jiang et.al. | 2502.15286 | null |
2025-02-21 | BundleFlow: Deep Menus for Combinatorial Auctions by Diffusion-Based Optimization | Tonghan Wang et.al. | 2502.15283 | null |
2025-02-21 | CopyJudge: Automated Copyright Infringement Identification and Mitigation in Text-to-Image Diffusion Models | Shunchang Liu et.al. | 2502.15278 | null |
2025-03-01 | Unsettling the Hegemony of Intention: Agonistic Image Generation | Andrew Shaw et.al. | 2502.15242 | null |
2025-02-21 | Lung-DDPM: Semantic Layout-guided Diffusion Models for Thoracic CT Image Synthesis | Yifan Jiang et.al. | 2502.15204 | link |
2025-02-21 | FlipConcept: Tuning-Free Multi-Concept Personalization for Text-to-Image Generation | Young Beom Woo et.al. | 2502.15203 | null |
2025-02-21 | Methods and Trends in Detecting Generated Images: A Comprehensive Review | Arpan Mahara et.al. | 2502.15176 | null |
2025-02-20 | Hardware-Friendly Static Quantization Method for Video Diffusion Transformers | Sanghyun Yi et.al. | 2502.15077 | null |
2025-02-20 | Pseudoinverse Diffusion Models for Generative CT Image Reconstruction from Low Dose Data | Matthew Tivnan et.al. | 2502.15064 | null |
2025-02-20 | FIP: Endowing Robust Motion Capture on Daily Garment by Fusing Flex and Inertial Sensors | Jiawei Fang et.al. | 2502.15058 | null |
2025-02-20 | Generative Super-Resolution PET Imaging with Fourier Diffusion Models | Matthew Tivnan et.al. | 2502.15055 | null |
2025-02-20 | DDAT: Diffusion Policies Enforcing Dynamically Admissible Robot Trajectories | Jean-Baptiste Bouvier et.al. | 2502.15043 | null |
2025-02-20 | Generative Modeling of Individual Behavior at Scale | Nabil Omi et.al. | 2502.14998 | null |
2025-02-20 | LAVID: An Agentic LVLM Framework for Diffusion-Generated Video Detection | Qingyuan Liu et.al. | 2502.14994 | null |
2025-02-20 | Reward-Guided Iterative Refinement in Diffusion Models at Test-Time with Applications to Protein and DNA Design | Masatoshi Uehara et.al. | 2502.14944 | link |
2025-02-20 | FacaDiffy: Inpainting Unseen Facade Parts Using Diffusion Models | Thomas Froech et.al. | 2502.14940 | link |
2025-02-17 | A Comprehensive Survey on Concept Erasure in Text-to-Image Diffusion Models | Changhoon Kim et.al. | 2502.14896 | null |
2025-02-17 | CoDiff: Conditional Diffusion Model for Collaborative 3D Object Detection | Zhe Huang et.al. | 2502.14891 | link |
2025-02-16 | The Multi-Faceted Monosemanticity in Multimodal Representations | Hanqi Yan et.al. | 2502.14888 | null |
2025-02-16 | Vision-Enhanced Time Series Forecasting via Latent Diffusion Models | Weilin Ruan et.al. | 2502.14887 | null |
2025-02-20 | Dynamic Concepts Personalization from Single Videos | Rameen Abdal et.al. | 2502.14844 | null |
2025-02-20 | Improving the Diffusability of Autoencoders | Ivan Skorokhodov et.al. | 2502.14831 | null |
2025-03-04 | AVD2: Accident Video Diffusion for Accident Video Description | Cheng Li et.al. | 2502.14801 | null |
2025-02-20 | A Survey on Text-Driven 360-Degree Panorama Generation | Hai Wang et.al. | 2502.14799 | null |
2025-02-20 | DC-ControlNet: Decoupling Inter- and Intra-Element Conditions in Image Generation with Diffusion Models | Hongji Yang et.al. | 2502.14779 | null |
2025-02-20 | AIdeation: Designing a Human-AI Collaborative Ideation System for Concept Designers | Wen-Fan Wang et.al. | 2502.14747 | null |
2025-02-25 | Sentence Smith: Formally Controllable Text Transformation and its Application to Evaluation of Text Embedding Models | Hongji Li et.al. | 2502.14734 | null |
2025-02-28 | RelaCtrl: Relevance-Guided Efficient Control for Diffusion Transformers | Ke Cao et.al. | 2502.14377 | null |
2025-02-20 | Textured 3D Regenerative Morphing with 3D Diffusion Prior | Songlin Yang et.al. | 2502.14316 | null |
2025-02-20 | On the Trustworthiness of Generative Foundation Models: Guideline, Assessment, and Perspective | Yue Huang et.al. | 2502.14296 | null |
2025-02-20 | Vulnerability of Text-to-Image Models to Prompt Template Stealing: A Differential Evolution Approach | Yurong Wu et.al. | 2502.14285 | null |
2025-02-21 | Pandora3D: A Comprehensive Framework for High-Quality 3D Shape and Texture Generation | Jiayu Yang et.al. | 2502.14247 | link |
2025-02-20 | Designing Parameter and Compute Efficient Diffusion Transformers using Distillation | Vignesh Sundaresha et.al. | 2502.14226 | null |
2025-02-20 | Bridging Text and Vision: A Multi-View Text-Vision Registration Approach for Cross-Modal Place Recognition | Tianyi Shang et.al. | 2502.14195 | link |
2025-02-19 | DiffExp: Efficient Exploration in Reward Fine-tuning for Text-to-Image Diffusion Models | Daewon Chae et.al. | 2502.14070 | null |
2025-02-19 | d-Sketch: Improving Visual Fidelity of Sketch-to-Image Translation with Pretrained Latent Diffusion Models without Retraining | Prasun Roy et.al. | 2502.14007 | link |
2025-02-19 | Im2SurfTex: Surface Texture Generation via Neural Backprojection of Multi-View Images | Yiangos Georgiou et.al. | 2502.14006 | null |
2025-02-19 | DP-Adapter: Dual-Pathway Adapter for Boosting Fidelity and Text Consistency in Customizable Human Image Generation | Ye Wang et.al. | 2502.13999 | null |
2025-02-19 | SigStyle: Signature Style Transfer via Personalized Text-to-Image Models | Ye Wang et.al. | 2502.13997 | null |
2025-02-19 | FantasyID: Face Knowledge Enhanced ID-Preserving Video Generation | Yunpeng Zhang et.al. | 2502.13995 | link |
2025-02-19 | Generative Detail Enhancement for Physically Based Materials | Saeed Hadadan et.al. | 2502.13994 | null |
2025-02-19 | Erasing with Precision: Evaluating Specific Concept Erasure from Text-to-Image Generative Models | Masane Fuchi et.al. | 2502.13989 | link |
2025-02-19 | SelfAge: Personalized Facial Age Transformation Using Self-reference Images | Taishi Ito et.al. | 2502.13987 | link |
2025-02-19 | FlexTok: Resampling Images into 1D Token Sequences of Flexible Length | Roman Bachmann et.al. | 2502.13967 | null |
2025-02-19 | IP-Composer: Semantic Composition of Visual Concepts | Sara Dorfman et.al. | 2502.13951 | null |
2025-02-19 | TESS 2: A Large-Scale Generalist Diffusion Language Model | Jaesung Tae et.al. | 2502.13917 | link |
2025-02-19 | MagicGeo: Training-Free Text-Guided Geometric Diagram Generation | Junxiao Wang et.al. | 2502.13855 | null |
2025-02-19 | Reverse Markov Learning: Multi-Step Generative Models for Complex Distributions | Xinwei Shen et.al. | 2502.13747 | null |
2025-02-19 | RestoreGrad: Signal Restoration Using Conditional Denoising Diffusion Models with Jointly Learned Prior | Ching-Hua Lee et.al. | 2502.13574 | null |
2025-02-19 | Diffusion Model Agnostic Social Influence Maximization in Hyperbolic Space | Hongliang Qiao et.al. | 2502.13571 | null |
2025-02-19 | Interleaved Gibbs Diffusion for Constrained Generation | Gautham Govind Anil et.al. | 2502.13450 | null |
2025-02-19 | Generative Predictive Control: Flow Matching Policies for Dynamic and Difficult-to-Demonstrate Tasks | Vince Kurtz et.al. | 2502.13406 | null |
2025-02-19 | Flow-based generative models as iterative algorithms in probability space | Yao Xie et.al. | 2502.13394 | null |
2025-02-18 | Secure and Efficient Watermarking for Latent Diffusion Models in Model Distribution Scenarios | Liangqi Lei et.al. | 2502.13345 | null |
2025-02-18 | Geometry-Aware Diffusion Models for Multiview Scene Inpainting | Ahmad Salimi et.al. | 2502.13335 | null |
2025-02-18 | Breaking the bonds of generative artificial intelligence by minimizing the maximum entropy | Mattia Miotto et.al. | 2502.13287 | null |
2025-02-18 | MotionMatcher: Motion Customization of Text-to-Video Diffusion Models via Motion Feature Matching | Yen-Siang Wu et.al. | 2502.13234 | null |
2025-02-18 | Fundus2Globe: Generative AI-Driven 3D Digital Twins for Personalized Myopia Management | Danli Shi et.al. | 2502.13182 | null |
2025-02-18 | Is Noise Conditioning Necessary for Denoising Generative Models? | Qiao Sun et.al. | 2502.13129 | null |
2025-02-18 | Score Matching Riemannian Diffusion Means | Frederik Möbius Rygaard et.al. | 2502.13106 | null |
2025-02-18 | Personalized Image Generation with Deep Generative Models: A Decade Survey | Yuxiang Wei et.al. | 2502.13081 | link |
2025-02-18 | Does Training with Synthetic Data Truly Protect Privacy? | Yunpeng Zhao et.al. | 2502.12976 | link |
2025-02-18 | Guaranteed Conditional Diffusion: 3D Block-based Models for Scientific Data Compression | Jaemoon Lee et.al. | 2502.12951 | null |
2025-02-19 | LLMPopcorn: An Empirical Study of LLMs as Assistants for Popular Micro-video Generation | Junchen Fu et.al. | 2502.12945 | null |
2025-02-18 | Flow-of-Options: Diversified and Improved LLM Reasoning by Thinking Through Options | Lakshmi Nair et.al. | 2502.12929 | link |
2025-02-18 | RAPID: Retrieval Augmented Training of Differentially Private Diffusion Models | Tanqiu Jiang et.al. | 2502.12794 | link |
2025-02-18 | Composition and Control with Distilled Energy Diffusion Models and Sequential Monte Carlo | James Thornton et.al. | 2502.12786 | null |
2025-02-18 | VidCapBench: A Comprehensive Benchmark of Video Captioning for Controllable Text-to-Video Generation | Xinlong Chen et.al. | 2502.12782 | null |
2025-02-18 | High-Fidelity Novel View Synthesis via Splatting-Guided Diffusion | Xiang Zhang et.al. | 2502.12752 | null |
2025-02-18 | 3D Shape-to-Image Brownian Bridge Diffusion for Brain MRI Synthesis from Cortical Surfaces | Fabian Bongratz et.al. | 2502.12742 | null |
2025-02-19 | Spherical Dense Text-to-Image Synthesis | Timon Winter et.al. | 2502.12691 | null |
2025-02-27 | NExT-Mol: 3D Diffusion Meets 1D Language Modeling for 3D Molecule Generation | Zhiyuan Liu et.al. | 2502.12638 | link |
2025-02-18 | MALT Diffusion: Memory-Augmented Latent Transformers for Any-Length Video Generation | Sihyun Yu et.al. | 2502.12632 | null |
2025-02-18 | Generative AI Enabled Robust Data Augmentation for Wireless Sensing in ISAC Networks | Jiacheng Wang et.al. | 2502.12622 | null |
2025-02-18 | CHATS: Combining Human-Aligned Optimization and Test-Time Sampling for Text-to-Image Generation | Minghao Fu et.al. | 2502.12579 | null |
2025-02-18 | DeltaDiff: A Residual-Guided Diffusion Model for Enhanced Image Super-Resolution | Chao Yang et.al. | 2502.12567 | null |
2025-02-18 | Comprehensive Assessment and Analysis for NSFW Content Erasure in Text-to-Image Diffusion Models | Die Chen et.al. | 2502.12527 | null |
2025-02-18 | Computational Safety for Generative AI: A Signal Processing Perspective | Pin-Yu Chen et.al. | 2502.12445 | null |
2025-02-18 | Reward-Safety Balance in Offline Safe RL via Diffusion Regularization | Junyu Guo et.al. | 2502.12391 | null |
2025-02-17 | UltraGen: Extremely Fine-grained Controllable Generation via Attribute Reconstruction and Global Preference Optimization | Longfei Yun et.al. | 2502.12375 | null |
2025-02-17 | Bayesian inference from time series of allele frequency data using exact simulation techniques | Jaromir Sant et.al. | 2502.12279 | null |
2025-02-16 | AI-Augmented Metamorphic Testing for Comprehensive Validation of Autonomous Vehicles | Tony Zhang et.al. | 2502.12208 | null |
2025-02-15 | Boosting Generalization in Diffusion-Based Neural Combinatorial Solver via Energy-guided Sampling | Haoyu Lei et.al. | 2502.12188 | null |
2025-02-14 | Direct Preference Optimization-Enhanced Multi-Guided Diffusion Model for Traffic Scenario Generation | Seungjun Yu et.al. | 2502.12178 | null |
2025-02-17 | Diffusion Models without Classifier-free Guidance | Zhicong Tang et.al. | 2502.12154 | link |
2025-02-17 | Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening | Ye Tian et.al. | 2502.12146 | link |
2025-02-21 | LaM-SLidE: Latent Space Modeling of Spatial Dynamical Systems via Linked Entities | Florian Sestak et.al. | 2502.12128 | link |
2025-02-17 | Descriminative-Generative Custom Tokens for Vision-Language Models | Pramuditha Perera et.al. | 2502.12095 | null |
2025-02-17 | How compositional generalization and creativity improve as diffusion models are trained | Alessandro Favero et.al. | 2502.12089 | null |
2025-02-21 | HumanGif: Single-View Human Diffusion with Generative Prior | Shoukang Hu et.al. | 2502.12080 | link |
2025-02-18 | A Survey on Bridging EEG Signals and Generative AI: From Image and Text to Beyond | Shreya Shukla et.al. | 2502.12048 | null |
2025-02-17 | Planning minimum regret $CO_2$ pipeline networks | Stephan Bogs et.al. | 2502.12035 | null |
2025-02-17 | Characterizing Photorealism and Artifacts in Diffusion Model-Generated Images | Negar Kamali et.al. | 2502.11989 | link |
2025-02-17 | Image Inversion: A Survey from GANs to Diffusion and Beyond | Yinan Chen et.al. | 2502.11974 | link |
2025-02-17 | GRAPHGPT-O: Synergistic Multimodal Comprehension and Generation on Graphs | Yi Fang et.al. | 2502.11925 | null |
2025-02-17 | Approximating a spatially-heterogeneously mass-emitting object by multiple point sources in a diffusion model | Qiyao Peng et.al. | 2502.11908 | null |
2025-02-17 | DLFR-VAE: Dynamic Latent Frame Rate VAE for Video Generation | Zhihang Yuan et.al. | 2502.11897 | link |
2025-02-17 | BackdoorDM: A Comprehensive Benchmark for Backdoor Learning in Diffusion Model | Weilin Lin et.al. | 2502.11798 | link |
2025-02-17 | ILIAS: Instance-Level Image retrieval At Scale | Giorgos Kordopatis-Zilos et.al. | 2502.11748 | null |
2025-02-17 | MVTokenFlow: High-quality 4D Content Generation using Multiview Token Flow | Hanzhuo Huang et.al. | 2502.11697 | null |
2025-02-17 | Object-Centric Image to Video Generation with Language Guidance | Angel Villar-Corrales et.al. | 2502.11655 | null |
2025-02-17 | GaussianMotion: End-to-End Learning of Animatable Gaussian Avatars with Pose Guidance from Text | Gyumin Shim et.al. | 2502.11642 | null |
2025-02-17 | Membership Inference Attacks for Face Images Against Fine-Tuned Latent Diffusion Models | Lauritz Christian Holme et.al. | 2502.11619 | null |
2025-02-18 | Maximum Entropy Reinforcement Learning with Diffusion Policy | Xiaoyi Dong et.al. | 2502.11612 | link |
2025-02-17 | Continuous Diffusion Model for Language Modeling | Jaehyeong Jo et.al. | 2502.11564 | link |
2025-02-17 | Control-CLIP: Decoupling Category and Style Guidance in CLIP for Specific-Domain Generation | Zexi Jia et.al. | 2502.11532 | null |
2025-02-17 | SayAnything: Audio-Driven Lip Synchronization with Conditional Video Diffusion | Junxian Ma et.al. | 2502.11515 | null |
2025-02-17 | Learning to Sample Effective and Diverse Prompts for Text-to-Image Generation | Taeyoung Yun et.al. | 2502.11477 | link |
2025-02-17 | Training-Free Guidance Beyond Differentiability: Scalable Path Steering with Tree Search in Diffusion and Flow Models | Yingqing Guo et.al. | 2502.11420 | null |
2025-02-17 | Inverse Flow and Consistency Models | Yuchen Zhang et.al. | 2502.11333 | null |
2025-02-17 | Deep Learning of Proteins with Local and Global Regions of Disorder | Oufan Zhang et.al. | 2502.11326 | link |
2025-02-16 | MaskFlow: Discrete Flows For Flexible and Efficient Long Video Generation | Michael Fuest et.al. | 2502.11234 | null |
2025-02-18 | AnyRefill: A Unified, Data-Efficient Framework for Left-Prompt-Guided Vision Tasks | Ming Xie et.al. | 2502.11158 | null |
2025-02-16 | Phantom: Subject-consistent video generation via cross-modal alignment | Lijie Liu et.al. | 2502.11079 | null |
2025-02-19 | Collaborative Deterministic-Diffusion Model for Probabilistic Urban Spatiotemporal Prediction | Zhi Sheng et.al. | 2502.11013 | null |
2025-02-16 | ControlText: Unlocking Controllable Fonts in Multilingual Text Rendering without Font Annotations | Bowen Jiang et.al. | 2502.10999 | link |
2025-02-16 | Skillful Nowcasting of Convective Clouds With a Cascade Diffusion Model | Haoming Chen et.al. | 2502.10957 | null |
2025-02-15 | SkyReels-A1: Expressive Portrait Animation in Video Diffusion Transformers | Di Qiu et.al. | 2502.10841 | link |
2025-02-15 | PDA: Generalizable Detection of AI-Generated Images via Post-hoc Distribution Alignment | Li Wang et.al. | 2502.10803 | null |
2025-02-15 | Multi-objective Aerial IRS-assisted ISAC Optimization via Generative AI-enhanced Deep Reinforcement Learning | Wenwen Xie et.al. | 2502.10687 | null |
2025-02-15 | Hybrid Deepfake Image Detection: A Comprehensive Dataset-Driven Approach Integrating Convolutional and Attention Mechanisms with Frequency Domain Features | Kafi Anan et.al. | 2502.10682 | null |
2025-02-15 | REAL: Realism Evaluation of Text-to-Image Generation Models for Effective Data Augmentation | Ran Li et.al. | 2502.10663 | null |
2025-02-14 | HIPPo: Harnessing Image-to-3D Priors for Model-free Zero-shot 6D Pose Estimation | Yibo Liu et.al. | 2502.10606 | null |
2025-02-14 | EVODMs: variational learning of PDEs for stochastic systems via diffusion models with quantified epistemic uncertainty | Zequn He et.al. | 2502.10588 | null |
2025-02-14 | Classifier-free Guidance with Adaptive Scaling | Dawid Malarz et.al. | 2502.10574 | link |
2025-02-14 | SWA-LDM: Toward Stealthy Watermarks for Latent Diffusion Models | Zhonghao Yang et.al. | 2502.10495 | null |
2025-02-13 | Knowledge Integration Strategies in Autonomous Vehicle Prediction and Planning: A Comprehensive Survey | Kumar Manas et.al. | 2502.10477 | null |
2025-02-12 | Image Watermarking of Generative Diffusion Models | Yunzhuo Chen et.al. | 2502.10465 | null |
2025-02-12 | I Think, Therefore I Diffuse: Enabling Multimodal In-Context Reasoning in Diffusion Models | Zhenxing Mi et.al. | 2502.10458 | null |
2025-02-20 | FlexControl: Computation-Aware ControlNet with Differentiable Router for Text-to-Image Generation | Zheng Fang et.al. | 2502.10451 | null |
2025-02-11 | A Survey of Representation Learning, Optimization Strategies, and Applications for Omnidirectional Vision | Hao Ai et.al. | 2502.10444 | null |
2025-02-14 | Region-Adaptive Sampling for Diffusion Transformers | Ziming Liu et.al. | 2502.10389 | null |
2025-02-14 | ReStyle3D: Scene-Level Appearance Transfer with Semantic Correspondences | Liyuan Zhu et.al. | 2502.10377 | null |
2025-02-14 | Dimension-free Score Matching and Time Bootstrapping for Diffusion Models | Syamantak Kumar et.al. | 2502.10354 | null |
2025-02-14 | DiOpt: Self-supervised Diffusion for Constrained Optimization | Shutong Ding et.al. | 2502.10330 | null |
2025-02-14 | Generalised Parallel Tempering: Flexible Replica Exchange via Flows and Diffusions | Leo Zhang et.al. | 2502.10328 | null |
2025-02-14 | Dark Matter Attenuation Effects: Sensitivity Ceilings for Spin-Dependent and Spin-Independent Interactions | QUEST-DMC Collaboration et.al. | 2502.10251 | null |
2025-02-24 | Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model | Guoqing Ma et.al. | 2502.10248 | link |
2025-02-14 | Shaping Inductive Bias in Diffusion Models through Frequency-Based Noise Control | Thomas Jiralerspong et.al. | 2502.10236 | null |
2025-02-14 | Agentic End-to-End De Novo Protein Design for Tailored Dynamics Using a Language Diffusion Model | Bo Ni et.al. | 2502.10173 | null |
2025-02-14 | IRS-assisted Edge Computing for Vehicular Networks: A Generative Diffusion Model-based Stackelberg Game Approach | Yixian Wang et.al. | 2502.10149 | null |
2025-02-14 | RealCam-I2V: Real-World Image-to-Video Generation with Interactive Complex Camera Control | Teng Li et.al. | 2502.10059 | null |
2025-02-14 | Diffusion Trajectory-guided Policy for Long-horizon Robot Manipulation | Shichao Fan et.al. | 2502.10040 | null |
2025-02-14 | ManiTrend: Bridging Future Generation and Action Prediction with 3D Flow for Robotic Manipulation | Yuxin He et.al. | 2502.10028 | null |
2025-02-18 | Large Language Diffusion Models | Shen Nie et.al. | 2502.09992 | null |
2025-02-14 | Generating on Generated: An Approach Towards Self-Evolving Diffusion Models | Xulu Zhang et.al. | 2502.09963 | null |
2025-02-14 | Precise Parameter Localization for Textual Generation in Diffusion Models | Łukasz Staniszewski et.al. | 2502.09935 | null |
2025-02-14 | Symmetry-Preserving Diffusion Models via Target Symmetrization | Vinh Tong et.al. | 2502.09890 | null |
2025-02-19 | Compression-Aware One-Step Diffusion Model for JPEG Artifact Removal | Jinpei Guo et.al. | 2502.09873 | link |
2025-02-20 | DesignWeaver: Dimensional Scaffolding for Text-to-Image Product Design | Sirui Tao et.al. | 2502.09867 | null |
2025-02-13 | Noise Controlled CT Super-Resolution with Conditional Diffusion Model | Yuang Wang et.al. | 2502.09793 | null |
2025-02-13 | CellFlow: Simulating Cellular Morphology Changes via Flow Matching | Yuhui Zhang et.al. | 2502.09775 | null |
2025-02-13 | Non-Markovian Discrete Diffusion with Causal Language Models | Yangtian Zhang et.al. | 2502.09767 | null |
2025-02-12 | Revealing Subtle Phenotypes in Small Microscopy Datasets Using Latent Diffusion Models | Anis Bourou et.al. | 2502.09665 | null |
2025-02-12 | DiffEx: Explaining a Classifier with Diffusion Models to Identify Microscopic Cellular Variations | Anis Bourou et.al. | 2502.09663 | null |
2025-02-13 | Theoretical Benefit and Limitation of Diffusion Language Model | Guhao Feng et.al. | 2502.09622 | null |
2025-02-13 | RigAnything: Template-Free Autoregressive Rigging for Diverse 3D Assets | Isabella Liu et.al. | 2502.09615 | null |
2025-02-13 | Designing a Conditional Prior Distribution for Flow-Based Generative Models | Noam Issachar et.al. | 2502.09611 | null |
2025-02-14 | Score-of-Mixture Training: Training One-Step Generative Models Made Simple via Score Estimation of Mixture Distributions | Tejas Jayashankar et.al. | 2502.09609 | null |
2025-02-13 | Rolling Ahead Diffusion for Traffic Scene Simulation | Yunpeng Liu et.al. | 2502.09587 | null |
2025-02-13 | Memorization and Generalization in Generative Diffusion under the Manifold Hypothesis | Beatrice Achilli et.al. | 2502.09578 | null |
2025-02-13 | DiffMS: Diffusion Generation of Molecules Conditioned on Mass Spectra | Montgomery Bohde et.al. | 2502.09571 | link |
2025-02-16 | Diffusing DeBias: a Recipe for Turning a Bug into a Feature | Massimiliano Ciranni et.al. | 2502.09564 | null |
2025-02-13 | Long-Term TalkingFace Generation via Motion-Prior Conditional Diffusion Model | Fei Shen et.al. | 2502.09533 | null |
2025-02-13 | Diffusion Models for Molecules: A Survey of Methods and Tasks | Liang Wang et.al. | 2502.09511 | link |
2025-02-13 | Redistribute Ensemble Training for Mitigating Memorization in Diffusion Models | Xiaoliu Guan et.al. | 2502.09434 | link |
2025-02-13 | ImageRAG: Dynamic Image Retrieval for Reference-Guided Image Generation | Rotem Shalev-Arkushin et.al. | 2502.09411 | null |
2025-02-13 | When the LM misunderstood the human chuckled: Analyzing garden path effects in humans and language models | Samuel Joseph Amouyal et.al. | 2502.09307 | null |
2025-02-13 | Non-asymptotic Analysis of Diffusion Annealed Langevin Monte Carlo for Generative Modelling | Paula Cordero-Encinar et.al. | 2502.09306 | null |
2025-02-25 | ConsistentDreamer: View-Consistent Meshes Through Balanced Multi-View Gaussian Optimization | Onat Şahin et.al. | 2502.09278 | null |
2025-02-14 | GEVRM: Goal-Expressive Video Generation Model For Robust Visual Manipulation | Hongyin Zhang et.al. | 2502.09268 | null |
2025-02-13 | Sequential Covariance Fitting for InSAR Phase Linking | Dana El Hajjar et.al. | 2502.09248 | null |
2025-02-13 | From large language models to multimodal AI: A scoping review on the potential of generative AI in medicine | Lukas Buess et.al. | 2502.09242 | null |
2025-02-13 | E-MD3C: Taming Masked Diffusion Transformers for Efficient Zero-Shot Object Customization | Trung X. Pham et.al. | 2502.09164 | null |
2025-02-13 | Regularization can make diffusion models more efficient | Mahsa Taheri et.al. | 2502.09151 | null |
2025-02-13 | Exact Bayesian inference for Markov switching diffusions | Timothée Stumpf-Fétizon et.al. | 2502.09126 | null |
2025-02-13 | StyleBlend: Enhancing Style-Specific Content Creation in Text-to-Image Diffusion Models | Zichong Chen et.al. | 2502.09064 | link |
2025-02-13 | MTDP: Modulated Transformer Diffusion Policy Model | Qianhao Wang et.al. | 2502.09029 | null |
2025-02-13 | Dynamic watermarks in images generated by diffusion models | Yunzhuo Chen et.al. | 2502.08927 | null |
2025-02-13 | Detecting Malicious Concepts Without Image Generation in AIGC | Kun Xu et.al. | 2502.08921 | null |
2025-02-13 | Diffusion Models Through a Global Lens: Are They Culturally Inclusive? | Zahra Bayramli et.al. | 2502.08914 | null |
2025-02-12 | A Reversible Solver for Diffusion SDEs | Zander W. Blasingame et.al. | 2502.08834 | null |
2025-02-12 | DejAIvu: Identifying and Explaining AI Art on the Web in Real-Time with Saliency Maps | Jocelyn Dzuong et.al. | 2502.08821 | link |
2025-02-12 | A First-order Generative Bilevel Optimization Framework for Diffusion Models | Quan Xiao et.al. | 2502.08808 | null |
2025-02-12 | HistoSmith: Single-Stage Histology Image-Label Generation via Conditional Latent Diffusion for Enhanced Cell Segmentation and Classification | Valentina Vadori et.al. | 2502.08754 | link |
2025-02-17 | Scalable Discrete Diffusion Samplers: Combinatorial Optimization and Statistical Physics | Sebastian Sanokowski et.al. | 2502.08696 | null |
2025-02-12 | Skrr: Skip and Re-use Text Encoder Layers for Memory Efficient Text-to-Image Generation | Hoigi Seo et.al. | 2502.08690 | null |
2025-02-12 | SwiftSketch: A Diffusion Model for Image-to-Vector Sketch Generation | Ellie Arar et.al. | 2502.08642 | null |
2025-02-12 | CineMaster: A 3D-Aware and Controllable Framework for Cinematic Text-to-Video Generation | Qinghe Wang et.al. | 2502.08639 | null |
2025-02-12 | Chasing Charge Carriers: Diffusion Dynamics in Mixed-n Quasi-Two-Dimensional Colloidal MAPbBr3 Perovskites | Ronja Maria Piehler et.al. | 2502.08601 | null |
2025-02-12 | Enhancing Diffusion Models Efficiency by Disentangling Total-Variance and Signal-to-Noise Ratio | Khaled Kahouli et.al. | 2502.08598 | link |
2025-02-12 | Light-A-Video: Training-free Video Relighting via Progressive Light Fusion | Yujie Zhou et.al. | 2502.08590 | link |
2025-02-12 | Ultrasound Image Generation using Latent Diffusion Models | Benoit Freiche et.al. | 2502.08580 | null |
2025-02-12 | Mapping the Landscape of Generative AI in Network Monitoring and Management | Giampaolo Bovenzi et.al. | 2502.08576 | null |
2025-02-12 | Statistically validated projection of bipartite signed networks | Anna Gallo et.al. | 2502.08567 | null |
2025-02-12 | BCDDM: Branch-Corrected Denoising Diffusion Model for Black Hole Image Generation | Ao liu et.al. | 2502.08528 | null |
2025-02-12 | One-Shot Federated Learning with Classifier-Free Diffusion Models | Obaidullah Zaland et.al. | 2502.08488 | null |
2025-02-12 | A Survey on Pre-Trained Diffusion Model Distillations | Xuhui Fan et.al. | 2502.08364 | null |
2025-02-12 | A posteriori error control for a finite volume scheme for a cross-diffusion model of ion transport | Arne Berrens et.al. | 2502.08306 | null |
2025-02-12 | BEAM: Bridging Physically-based Rendering and Gaussian Modeling for Relightable Volumetric Video | Yu Hong et.al. | 2502.08297 | null |
2025-02-12 | FloVD: Optical Flow Meets Video Diffusion Model for Enhanced Camera-Controlled Video Synthesis | Wonjoon Jin et.al. | 2502.08244 | null |
2025-02-12 | Learning Human Skill Generators at Key-Step Levels | Yilu Wu et.al. | 2502.08234 | null |
2025-02-12 | AnyCharV: Bootstrap Controllable Character Video Generation with Fine-to-Coarse Guidance | Zhao Wang et.al. | 2502.08189 | null |
2025-02-12 | DNNs May Determine Major Properties of Their Outputs Early, with Timing Possibly Driven by Bias | Song Park et.al. | 2502.08167 | null |
2025-02-19 | PoGDiff: Product-of-Gaussians Diffusion Models for Imbalanced Text-to-Image Generation | Ziyan Wang et.al. | 2502.08106 | null |
2025-02-12 | ID-Cloak: Crafting Identity-Specific Cloaks Against Personalized Text-to-Image Generation | Qianrui Teng et.al. | 2502.08097 | null |
2025-02-12 | End-to-End Predictive Planner for Autonomous Driving with Consistency Models | Anjian Li et.al. | 2502.08033 | null |
2025-02-13 | Training-Free Safe Denoisers for Safe Use of Diffusion Models | Mingyu Kim et.al. | 2502.08011 | null |
2025-02-11 | Greed is Good: Guided Generation from a Greedy Perspective | Zander W. Blasingame et.al. | 2502.08006 | null |
2025-02-11 | Towards Training One-Step Diffusion Models Without Distillation | Mingtian Zhang et.al. | 2502.08005 | null |
2025-02-11 | SurGrID: Controllable Surgical Simulation via Scene Graph to Image Diffusion | Yannik Frisch et.al. | 2502.07945 | null |
2025-02-11 | Consistent Solutions of the Radiation Diffusion Equation in Spherical and Cylindrical Geometries | Ethan Smith et.al. | 2502.07930 | null |
2025-02-11 | TextAtlas5M: A Large-scale Dataset for Dense Text Image Generation | Alex Jinpeng Wang et.al. | 2502.07870 | link |
2025-02-19 | MRS: A Fast Sampler for Mean Reverting Diffusion based on ODE and SDE Solvers | Ao Li et.al. | 2502.07856 | null |
2025-02-11 | Understanding Classifier-Free Guidance: High-Dimensional Theory and Non-Linear Generalizations | Krunoslav Lehman Pavasovic et.al. | 2502.07849 | null |
2025-02-11 | Spread them Apart: Towards Robust Watermarking of Generated Content | Mikhail Pautov et.al. | 2502.07845 | null |
2025-02-11 | TranSplat: Surface Embedding-guided 3D Gaussian Splatting for Transparent Object Manipulation | Jeongyun Kim et.al. | 2502.07840 | link |
2025-02-10 | Preference Alignment on Diffusion Model: A Comprehensive Survey for Image Generation and Editing | Sihao Wu et.al. | 2502.07829 | null |
2025-02-10 | Pre-Trained Video Generative Models as World Simulators | Haoran He et.al. | 2502.07825 | null |
2025-02-09 | Satellite Observations Guided Diffusion Model for Accurate Meteorological States at Arbitrary Resolution | Siwei Tu et.al. | 2502.07814 | null |
2025-02-11 | MatSwap: Light-aware material transfers in images | Ivan Lopes et.al. | 2502.07784 | null |
2025-02-11 | Direct Ascent Synthesis: Revealing Hidden Generative Capabilities in Discriminative Models | Stanislav Fort et.al. | 2502.07753 | null |
2025-02-11 | CausalGeD: Blending Causality and Diffusion for Spatial Gene Expression Generation | Rabeya Tus Sadia et.al. | 2502.07751 | null |
2025-02-12 | Next Block Prediction: Video Generation via Semi-Autoregressive Modeling | Shuhuai Ren et.al. | 2502.07737 | null |
2025-02-17 | Magic 1-For-1: Generating One Minute Video Clips within One Minute | Hongwei Yi et.al. | 2502.07701 | link |
2025-02-11 | Consistency Training with Physical Constraints | Che-Chia Chang et.al. | 2502.07636 | null |
2025-02-11 | Generative Modeling with Bayesian Sample Inference | Marten Lienen et.al. | 2502.07580 | link |
2025-02-11 | Single-Step Consistent Diffusion Samplers | Pascal Jutras-Dubé et.al. | 2502.07579 | null |
2025-02-11 | SketchFlex: Facilitating Spatial-Semantic Coherence in Text-to-Image Generation with Region-Based Sketches | Haichuan Lin et.al. | 2502.07556 | link |
2025-02-12 | VidCRAFT3: Camera, Object, and Lighting Control for Image-to-Video Generation | Sixiao Zheng et.al. | 2502.07531 | null |
2025-02-14 | The Devil is in the Prompts: De-Identification Traces Enhance Memorization Risks in Synthetic Chest X-Ray Generation | Raman Dutt et.al. | 2502.07516 | link |
2025-02-13 | Enhance-A-Video: Better Generated Video for Free | Yang Luo et.al. | 2502.07508 | link |
2025-02-11 | Less is More: Masking Elements in Image Condition Features Avoids Content Leakages in Style Transfer Diffusion Models | Lin Zhu et.al. | 2502.07466 | link |
2025-02-11 | RusCode: Russian Cultural Code Benchmark for Text-to-Image Generation | Viacheslav Vasilev et.al. | 2502.07455 | link |
2025-02-11 | Optimizing Knowledge Distillation in Transformers: Enabling Multi-Head Attention without Alignment Barriers | Zhaodong Bing et.al. | 2502.07436 | null |
2025-02-12 | Spatial Degradation-Aware and Temporal Consistent Diffusion Model for Compressed Video Super-Resolution | Hongyu An et.al. | 2502.07381 | null |
2025-02-11 | Generative Ghost: Investigating Ranking Bias Hidden in AI-Generated Videos | Haowen Gao et.al. | 2502.07327 | null |
2025-02-11 | Semantic to Structure: Learning Structural Representations for Infringement Detection | Chuanwei Huang et.al. | 2502.07323 | null |
2025-02-11 | Generation of Drug-Induced Cardiac Reactions towards Virtual Clinical Trials | Qian Shao et.al. | 2502.07297 | null |
2025-02-11 | Exploratory Diffusion Policy for Unsupervised Reinforcement Learning | Chengyang Ying et.al. | 2502.07279 | null |
2025-02-11 | Articulate That Object Part (ATOP): 3D Part Articulation from Text and Motion Personalization | Aditya Vora et.al. | 2502.07278 | null |
2025-02-11 | Exploring Active Data Selection Strategies for Continuous Training in Deepfake Detection | Yoshihiko Furuhashi et.al. | 2502.07269 | null |
2025-02-11 | Vevo: Controllable Zero-Shot Voice Imitation with Self-Supervised Disentanglement | Xueyao Zhang et.al. | 2502.07243 | null |
2025-02-11 | Contextual Gesture: Co-Speech Gesture Video Generation through Context-aware Gesture Representation | Pinxin Liu et.al. | 2502.07239 | null |
2025-02-11 | CAT: Contrastive Adversarial Training for Evaluating the Robustness of Protective Perturbations in Latent Diffusion Models | Sen Peng et.al. | 2502.07225 | link |
2025-02-11 | Improve the Training Efficiency of DRL for Wireless Communication Resource Allocation: The Role of Generative Diffusion Models | Xinren Zhang et.al. | 2502.07211 | null |
2025-02-11 | Monte Carlo Tree Diffusion for System 2 Planning | Jaesik Yoon et.al. | 2502.07202 | null |
2025-02-19 | HDCompression: Hybrid-Diffusion Image Compression for Ultra-Low Bitrates | Lei Lu et.al. | 2502.07160 | null |
2025-02-10 | Lotus: Creating Short Videos From Long Videos With Abstractive and Extractive Summarization | Aadit Barua et.al. | 2502.07096 | null |
2025-02-10 | Generative Distribution Prediction: A Unified Approach to Multimodal Learning | Xinyu Tian et.al. | 2502.07090 | null |
2025-02-12 | TRADES: Generating Realistic Market Simulations with Diffusion Models | Leonardo Berti et.al. | 2502.07071 | link |
2025-02-10 | From Image to Video: An Empirical Study of Diffusion Representations | Pedro Vélez et.al. | 2502.07001 | null |
2025-02-10 | Outsourced diffusion sampling: Efficient posterior inference in latent spaces of generative models | Siddarth Venkatraman et.al. | 2502.06999 | null |
2025-02-19 | Conditional diffusion model with spatial attention and latent embedding for medical image segmentation | Behzad Hejrati et.al. | 2502.06997 | link |
2025-02-10 | Model Diffusion for Certifiable Few-shot Transfer Learning | Fady Rezk et.al. | 2502.06970 | null |
2025-02-10 | GAS: Generative Avatar Synthesis from a Single Image | Yixing Lu et.al. | 2502.06957 | null |
2025-02-09 | Enabling Autoregressive Models to Fill In Masked Tokens | Daniel Israel et.al. | 2502.06901 | null |
2025-02-09 | PyPotteryInk: One-Step Diffusion Model for Sketch to Publication-ready Archaeological Drawings | Lorenzo Cardarelli et.al. | 2502.06897 | null |
2025-02-08 | FlavorDiffusion: Predicting Food Pairings and Chemical Interactions Using Diffusion Models | Seo Jun Pyo et.al. | 2502.06871 | null |
2025-02-08 | BF-GAN: Development of an AI-driven Bubbly Flow Image Generation Model Using Generative Adversarial Networks | Wen Zhou et.al. | 2502.06863 | link |
2025-02-06 | DiffNMR3: Advancing NMR Resolution Beyond Instrumental Limits | Sen Yan et.al. | 2502.06845 | null |
2025-02-06 | TorchResist: Open-Source Differentiable Resist Simulator | Zixiao Wang et.al. | 2502.06838 | link |
2025-02-05 | CTR-Driven Advertising Image Generation with Multimodal Large Language Models | Xingye Chen et.al. | 2502.06823 | link |
2025-02-05 | DiffListener: Discrete Diffusion Model for Listener Generation | Siyeol Jung et.al. | 2502.06822 | null |
2025-02-04 | Diffusion Instruction Tuning | Chen Jin et.al. | 2502.06814 | null |
2025-02-04 | Harness Local Rewards for Global Benefits: Effective Text-to-Video Generation Alignment with Patch-level Reward Models | Shuting Wang et.al. | 2502.06812 | null |
2025-02-03 | Efficient Diffusion Models: A Survey | Hui Shen et.al. | 2502.06805 | link |
2025-01-29 | Prompt-Aware Scheduling for Efficient Text-to-Image Inferencing System | Shubham Agarwal et.al. | 2502.06798 | null |
2025-02-12 | Lumina-Video: Efficient and Flexible Video Generation with Multi-scale Next-DiT | Dongyang Liu et.al. | 2502.06782 | null |
2025-02-10 | Train for the Worst, Plan for the Best: Understanding Token Ordering in Masked Diffusions | Jaeyeon Kim et.al. | 2502.06768 | null |
2025-02-10 | History-Guided Video Diffusion | Kiwhan Song et.al. | 2502.06764 | null |
2025-02-10 | Señorita-2M: A High-Quality Instruction-based Dataset for General Video Editing by Video Specialists | Bojia Zi et.al. | 2502.06734 | null |
2025-02-10 | Transfer Your Perspective: Controllable 3D Generation from Any Viewpoint in a Driving Scene | Tai-Yu Pan et.al. | 2502.06682 | null |
2025-02-11 | Unleashing the Potential of Pre-Trained Diffusion Models for Generalizable Person Re-Identification | Jiachen Li et.al. | 2502.06619 | link |
2025-02-10 | TripoSG: High-Fidelity 3D Shape Synthesis using Large-Scale Rectified Flow Models | Yangguang Li et.al. | 2502.06608 | link |
2025-02-12 | MaterialFusion: High-Quality, Zero-Shot, and Controllable Material Transfer with Diffusion Models | Kamil Garifullin et.al. | 2502.06606 | null |
2025-02-10 | A Large-scale AI-generated Image Inpainting Benchmark | Paschalis Giakoumoglou et.al. | 2502.06593 | null |
2025-02-10 | Diffusion Models for Computational Neuroimaging: A Survey | Haokai Zhao et.al. | 2502.06552 | link |
2025-02-20 | CustomVideoX: 3D Reference Attention Driven Dynamic Adaptation for Zero-Shot Customized Video Diffusion Transformers | D. She et.al. | 2502.06527 | null |
2025-02-10 | Boost-and-Skip: A Simple Guidance-Free Diffusion for Minority Generation | Soobin Um et.al. | 2502.06516 | null |
2025-02-10 | WyckoffDiff – A Generative Diffusion Model for Crystal Symmetry | Filip Ekström Kelvinius et.al. | 2502.06485 | link |
2025-02-10 | Habitizing Diffusion Planning for Efficient and Effective Decision Making | Haofei Lu et.al. | 2502.06401 | link |
2025-02-10 | TANGLED: Generating 3D Hair Strands from Images with Arbitrary Styles and Viewpoints | Pengyu Long et.al. | 2502.06392 | null |
2025-02-10 | Solving Linear-Gaussian Bayesian Inverse Problems with Decoupled Diffusion Sequential Monte Carlo | Filip Ekström Kelvinius et.al. | 2502.06379 | null |
2025-02-10 | Guidance-base Diffusion Models for Improving Photoacoustic Image Quality | Tatsuhiro Eguchi et.al. | 2502.06354 | null |
2025-02-10 | Zero-shot Depth Completion via Test-time Alignment with Affine-invariant Depth Prior | Lee Hyoseok et.al. | 2502.06338 | null |
2025-02-10 | Universal Approximation of Visual Autoregressive Transformers | Yifang Chen et.al. | 2502.06167 | null |
2025-02-17 | Efficient-vDiT: Efficient Video Diffusion Transformers With Attention Tile | Hangliang Ding et.al. | 2502.06155 | null |
2025-02-10 | Animate Anyone 2: High-Fidelity Character Image Animation with Environment Affordance | Li Hu et.al. | 2502.06145 | null |
2025-02-10 | Self-Correcting Decoding with Generative Feedback for Mitigating Hallucinations in Large Vision-Language Models | Ce Zhang et.al. | 2502.06130 | link |
2025-02-10 | CDM: Contact Diffusion Model for Multi-Contact Point Localization | Seo Wook Han et.al. | 2502.06109 | null |
2025-02-17 | Debiasing Guidance for Discrete Diffusion with Sequential Monte Carlo | Cheuk Kit Lee et.al. | 2502.06079 | null |
2025-02-09 | Online Reward-Weighted Fine-Tuning of Flow Matching with Wasserstein Regularization | Jiajun Fan et.al. | 2502.06061 | null |
2025-02-09 | Generating 3D Binding Molecules Using Shape-Conditioned Diffusion Models with Guidance | Ziqi Chen et.al. | 2502.06027 | null |
2025-02-09 | Dual Caption Preference Optimization for Diffusion Models | Amir Saeidi et.al. | 2502.06023 | link |
2025-02-09 | Diffusion Models for Inverse Problems in the Exponential Family | Alessandro Micheli et.al. | 2502.05994 | null |
2025-02-11 | VFX Creator: Animated Visual Effect Generation with Controllable Diffusion Transformer | Xinyu Liu et.al. | 2502.05979 | null |
2025-02-09 | Inverse Problem Sampling in Latent Space Using Sequential Monte Carlo | Idan Achituve et.al. | 2502.05908 | null |
2025-02-09 | Beyond Fine-Tuning: A Systematic Study of Sampling Techniques in Personalized Image Generation | Vera Soboleva et.al. | 2502.05895 | link |
2025-02-09 | MMGDreamer: Mixed-Modality Graph for Geometry-Controllable 3D Indoor Scene Generation | Zhifei Yang et.al. | 2502.05874 | link |
2025-02-09 | Understanding Design Fixation in Generative AI | Liuqing Chen et.al. | 2502.05870 | null |
2025-02-09 | Fact-or-Fair: A Checklist for Behavioral Testing of AI Models on Fairness-Related Queries | Jen-tse Huang et.al. | 2502.05849 | link |
2025-02-09 | Devil is in the Details: Density Guidance for Detail-Aware Generation with Flow Models | Rafał Karczewski et.al. | 2502.05807 | null |
2025-02-09 | Understanding Representation Dynamics of Diffusion Models via Low-Dimensional Modeling | Xiao Li et.al. | 2502.05743 | null |
2025-02-08 | SSDD-GAN: Single-Step Denoising Diffusion GAN for Cochlear Implant Surgical Scene Completion | Yike Zhang et.al. | 2502.05710 | null |
2025-02-08 | Semantic-Aware Adaptive Video Streaming Using Latent Diffusion Models for Wireless Networks | Zijiang Yan et.al. | 2502.05695 | null |
2025-02-08 | Towards AI-driven Sign Language Generation with Non-manual Markers | Han Zhang et.al. | 2502.05661 | null |
2025-02-08 | TrackDiffuser: Nearly Model-Free Bayesian Filtering with Diffusion Model | Yangguang He et.al. | 2502.05629 | null |
2025-02-08 | Training-Free Constrained Generation With Stable Diffusion Models | Stefano Zampini et.al. | 2502.05625 | null |
2025-02-08 | Discrete-Time Approximations of Controlled Diffusions with Infinite Horizon Discounted and Average Cost | Somnath Pradhan et.al. | 2502.05596 | null |
2025-02-13 | Diffusion Model for Interest Refinement in Multi-Interest Recommendation | Yankun Le et.al. | 2502.05561 | null |
2025-02-08 | Physics-Conditioned Diffusion Models for Lattice Gauge Theory | Qianteng Zhu et.al. | 2502.05504 | link |
2025-02-18 | A Physical Coherence Benchmark for Evaluating Video Generation Models via Optical Flow-guided Frame Prediction | Yongfan Chen et.al. | 2502.05503 | link |
2025-02-08 | Stochastic Forward-Backward Deconvolution: Training Diffusion Models with Finite Noisy Datasets | Haoye Lu et.al. | 2502.05446 | null |
2025-02-08 | Diverse Image Generation with Diffusion Models and Cross Class Label Learning for Polyp Classification | Vanshali Sharma et.al. | 2502.05444 | link |
2025-02-08 | Show-o Turbo: Towards Accelerated Unified Multimodal Understanding and Generation | Chenkai Xu et.al. | 2502.05415 | null |
2025-02-08 | Beyond and Free from Diffusion: Invertible Guided Consistency Training | Chia-Hong Hsu et.al. | 2502.05391 | null |
2025-02-06 | DiffNMR2: NMR Guided Sampling Acquisition Through Diffusion Model Uncertainty | Etienne Goffinet et.al. | 2502.05230 | null |
2025-02-05 | Blackout DIFUSCO | Jun Pyo Seo et.al. | 2502.05221 | link |
2025-02-04 | CoRPA: Adversarial Image Generation for Chest X-rays Using Concept Vector Perturbations and Generative Models | Amy Rafferty et.al. | 2502.05214 | null |
2025-02-12 | Safety at Scale: A Comprehensive Survey of Large Model Safety | Xingjun Ma et.al. | 2502.05206 | link |
2025-02-07 | FlashVideo:Flowing Fidelity to Detail for Efficient High-Resolution Video Generation | Shilong Zhang et.al. | 2502.05179 | link |
2025-02-07 | QLIP: Text-Aligned Visual Tokenization Unifies Auto-Regressive Multimodal Understanding and Generation | Yue Zhao et.al. | 2502.05178 | null |
2025-02-07 | Hummingbird: High Fidelity Image Generation via Multimodal Context Alignment | Minh-Quan Le et.al. | 2502.05153 | null |
2025-02-10 | Beautiful Images, Toxic Words: Understanding and Addressing Offensive Text in Generated Images | Aditya Kumar et.al. | 2502.05066 | link |
2025-02-07 | Robust Graph Learning Against Adversarial Evasion Attacks via Prior-Free Diffusion-Based Structure Purification | Jiayi Luo et.al. | 2502.05000 | link |
2025-02-07 | C2GM: Cascading Conditional Generation of Multi-scale Maps from Remote Sensing Images Constrained by Geographic Features | Chenxing Sun et.al. | 2502.04991 | null |
2025-02-07 | Cached Multi-Lora Composition for Multi-Concept Image Generation | Xiandong Zou et.al. | 2502.04923 | link |
2025-02-10 | Goku: Flow Based Video Generative Foundation Models | Shoufa Chen et.al. | 2502.04896 | null |
2025-02-07 | Advancing Wasserstein Convergence Analysis of Score-Based Models: Insights from Discretization and Second-Order Acceleration | Yifeng Yu et.al. | 2502.04849 | null |
2025-02-10 | HumanDiT: Pose-Guided Diffusion Transformer for Long-form Human Motion Video Generation | Qijun Gan et.al. | 2502.04847 | null |
2025-02-07 | Behavior-Regularized Diffusion Policy Optimization for Offline Reinforcement Learning | Chen-Xiao Gao et.al. | 2502.04778 | null |
2025-02-07 | Can Diffusion Models Learn Hidden Inter-Feature Rules Behind Images? | Yujin Han et.al. | 2502.04725 | null |
2025-02-11 | G2PDiffusion: Genotype-to-Phenotype Prediction with Diffusion Models | Mengdi Liu et.al. | 2502.04684 | null |
2025-02-07 | CCS: Controllable and Constrained Sampling with Diffusion Models via Initial Noise Perturbation | Bowen Song et.al. | 2502.04670 | null |
2025-02-07 | A Comprehensive Review on Noise Control of Diffusion Model | Zhehao Guo et.al. | 2502.04669 | null |
2025-02-07 | Fuzzy Linkography: Automatic Graphical Summarization of Creative Activity Traces | Amy Smith et.al. | 2502.04599 | link |
2025-02-06 | Mechanisms of Projective Composition of Diffusion Models | Arwen Bradley et.al. | 2502.04549 | null |
2025-02-06 | Fast Video Generation with Sliding Tile Attention | Peiyuan Zhang et.al. | 2502.04507 | null |
2025-02-06 | Provable Sample-Efficient Transfer Learning Conditional Diffusion Models via Representation Learning | Ziheng Cheng et.al. | 2502.04491 | null |
2025-02-06 | Augmented Conditioning Is Enough For Effective Training Image Generation | Jiahui Chen et.al. | 2502.04475 | null |
2025-02-06 | Iterative Importance Fine-tuning of Diffusion Models | Alexander Denker et.al. | 2502.04468 | null |
2025-02-06 | Decoder-Only LLMs are Better Controllers for Diffusion Models | Ziyi Dong et.al. | 2502.04412 | null |
2025-02-06 | UniCP: A Unified Caching and Pruning Framework for Efficient Video Generation | Wenzhang Sun et.al. | 2502.04393 | null |
2025-02-06 | Towards Fair and Robust Face Parsing for Generative AI: A Multi-Objective Approach | Sophia J. Abraham et.al. | 2502.04391 | null |
2025-02-05 | DILLEMA: Diffusion and Large Language Models for Multi-Modal Augmentation | Luciano Baresi et.al. | 2502.04378 | link |
2025-02-05 | Lost in Edits? A $λ$ -Compass for AIGC Provenance | Wenhao You et.al. | 2502.04364 | null |
2025-02-05 | On-device Sora: Enabling Diffusion-Based Text-to-Video Generation for Mobile Devices | Bosung Kim et.al. | 2502.04363 | link |
2025-02-01 | Analysis of Diffusion Models for Manifold Data | Anand Jerry George et.al. | 2502.04339 | null |
2025-02-06 | HOG-Diff: Higher-Order Guided Diffusion for Graph Generation | Yiming Huang et.al. | 2502.04308 | link |
2025-02-06 | MotionCanvas: Cinematic Shot Design with Controllable Image-to-Video Generation | Jinbo Xing et.al. | 2502.04299 | null |
2025-02-06 | Learning Real-World Action-Video Dynamics with Heterogeneous Masked Autoregression | Lirui Wang et.al. | 2502.04296 | null |
2025-02-06 | Realistic Image-to-Image Machine Unlearning via Decoupling and Knowledge Retention | Ayush K. Varshney et.al. | 2502.04260 | null |
2025-02-06 | Multi-fidelity emulator for large-scale 21 cm lightcone images: a few-shot transfer learning approach with generative adversarial network | Kangning Diao et.al. | 2502.04246 | null |
2025-02-06 | Diffusion-based mass map reconstruction from weak lensing data | Supranta S. Boruah et.al. | 2502.04158 | null |
2025-02-06 | Llasa: Scaling Train-Time and Inference-Time Compute for Llama-based Speech Synthesis | Zhen Ye et.al. | 2502.04128 | link |
2025-02-09 | Generative Adversarial Networks Bridging Art and Machine Intelligence | Junhao Song et.al. | 2502.04116 | null |
2025-02-06 | Content-Rich AIGC Video Quality Assessment via Intricate Text Alignment and Motion-Aware Consistency | Shangkun Sun et.al. | 2502.04076 | link |
2025-02-06 | TQ-DiT: Efficient Time-Aware Quantization for Diffusion Transformers | Younghye Hwang et.al. | 2502.04056 | null |
2025-02-06 | PartEdit: Fine-Grained Image Editing using Pre-Trained Diffusion Models | Aleksandar Cvejic et.al. | 2502.04050 | null |
2025-02-08 | UniForm: A Unified Diffusion Transformer for Audio-Video Generation | Lei Zhao et.al. | 2502.03897 | null |
2025-02-06 | Hierarchical Entropic Diffusion for Ransomware Detection: A Probabilistic Approach to Behavioral Anomaly Isolation | Vasili Iskorohodov et.al. | 2502.03882 | null |
2025-02-06 | FairT2I: Mitigating Social Bias in Text-to-Image Generation via Large Language Model-Assisted Detection and Attribute Rebalancing | Jinya Sakurai et.al. | 2502.03826 | null |
2025-02-06 | DeblurDiff: Real-World Image Deblurring with Generative Diffusion Models | Lingshun Kong et.al. | 2502.03810 | null |
2025-02-06 | DICE: Distilling Classifier-Free Guidance into Text Embeddings | Zhenyu Zhou et.al. | 2502.03726 | null |
2025-02-06 | Conditional Diffusion Models are Medical Image Classifiers that Provide Explainability and Uncertainty for Free | Gian Mario Favero et.al. | 2502.03687 | null |
2025-02-06 | Variational Control for Guidance in Diffusion Models | Kushagra Pandey et.al. | 2502.03686 | null |
2025-02-05 | Towards Physical Understanding in Video Generation: A 3D Point Regularization Approach | Yunuo Chen et.al. | 2502.03639 | null |
2025-02-05 | SymmCD: Symmetry-Preserving Crystal Generation with Diffusion Models | Daniel Levy et.al. | 2502.03638 | link |
2025-02-05 | DynVFX: Augmenting Real Videos with Dynamic Content | Danah Yatim et.al. | 2502.03621 | null |
2025-02-05 | Simultaneous Multi-Robot Motion Planning with Projected Diffusion Models | Jinhao Liang et.al. | 2502.03607 | null |
2025-02-17 | Path Planning for Masked Diffusion Model Sampling | Fred Zhangzhi Peng et.al. | 2502.03540 | null |
2025-02-10 | YINYANG-ALIGN: Benchmarking Contradictory Objectives and Proposing Multi-Objective Optimization based DPO for Text-to-Image Alignment | Amitava Das et.al. | 2502.03512 | null |
2025-02-05 | DC-VSR: Spatially and Temporally Consistent Video Super-Resolution with Video Diffusion Prior | Janghyeok Han et.al. | 2502.03502 | null |
2025-02-05 | Controllable Satellite-to-Street-View Synthesis with Precise Pose Alignment and Zero-Shot Environmental Control | Xianghui Ze et.al. | 2502.03498 | null |
2025-02-05 | FreqPrior: Improving Video Diffusion Models with Frequency Filtering Gaussian Noise | Yunlong Yuan et.al. | 2502.03496 | null |
2025-02-05 | MetaFE-DE: Learning Meta Feature Embedding for Depth Estimation from Monocular Endoscopic Images | Dawei Lu et.al. | 2502.03493 | null |
2025-02-05 | Lanpaint: Training-Free Diffusion Inpainting with Exact and Fast Conditional Inference | Candi Zheng et.al. | 2502.03491 | link |
2025-02-05 | Dress-1-to-3: Single Image to Simulation-Ready 3D Outfit with Diffusion Prior and Differentiable Physics | Xuan Li et.al. | 2502.03449 | null |
2025-02-05 | Masked Autoencoders Are Effective Tokenizers for Diffusion Models | Hao Chen et.al. | 2502.03444 | null |
2025-02-05 | On Fairness of Unified Multimodal Large Language Model for Image Generation | Ming Liu et.al. | 2502.03429 | null |
2025-02-05 | TruePose: Human-Parsing-guided Attention Diffusion for Full-ID Preserving Pose Transfer | Zhihong Xu et.al. | 2502.03426 | null |
2025-02-05 | Can Text-to-Image Generative Models Accurately Depict Age? A Comparative Study on Synthetic Portrait Generation and Age Estimation | Alexey A. Novikov et.al. | 2502.03420 | null |
2025-02-05 | A Mixture-Based Framework for Guiding Diffusion Models | Yazid Janati et.al. | 2502.03332 | null |
2025-02-05 | An efficient end-to-end computational framework for the generation of ECG calibrated volumetric models of human atrial electrophysiology | Elena Zappon et.al. | 2502.03322 | null |
2025-02-05 | MotionAgent: Fine-grained Controllable Video Generation via Motion Field Agent | Xinyao Liao et.al. | 2502.03207 | null |
2025-02-05 | Poisson Flow Joint Model for Multiphase contrast-enhanced CT | Rongjun Ge et.al. | 2502.03079 | null |
2025-02-05 | Direct Distributional Optimization for Provable Alignment of Diffusion Models | Ryotaro Kawata et.al. | 2502.02954 | null |
2025-02-05 | Fast T2T: Optimization Consistency Speeds Up Diffusion-Based Training-to-Testing Solving for Combinatorial Optimization | Yang Li et.al. | 2502.02941 | null |
2025-02-05 | Elucidating the Preconditioning in Consistency Distillation | Kaiwen Zheng et.al. | 2502.02922 | null |
2025-02-05 | A Survey of Sample-Efficient Deep Learning for Change Detection in Remote Sensing: Tasks, Strategies, and Challenges | Lei Ding et.al. | 2502.02835 | null |
2025-02-04 | When are Diffusion Priors Helpful in Sparse Reconstruction? A Study with Sparse-view CT | Matt Y. Cheung et.al. | 2502.02771 | null |
2025-02-04 | Controllable Video Generation with Provable Disentanglement | Yifan Shen et.al. | 2502.02690 | null |
2025-02-03 | Secure & Personalized Music-to-Video Generation via CHARCHA | Mehul Agarwal et.al. | 2502.02610 | null |
2025-01-31 | Physically Interpretable Representation and Controlled Generation for Turbulence Data | Tiffany Fan et.al. | 2502.02605 | null |
2025-02-04 | COCONut-PanCap: Joint Panoptic Segmentation and Grounded Captions for Fine-Grained Understanding and Generation | Xueqing Deng et.al. | 2502.02589 | null |
2025-02-04 | Calibrated Multi-Preference Optimization for Aligning Diffusion Models | Kyungmin Lee et.al. | 2502.02588 | null |
2025-02-04 | Open Materials Generation with Stochastic Interpolants | Philipp Hoellmer et.al. | 2502.02582 | null |
2025-02-04 | Diff9D: Diffusion-Based Domain-Generalized Category-Level 9-DoF Object Pose Estimation | Jian Liu et.al. | 2502.02525 | link |
2025-02-04 | Privacy Attacks on Image AutoRegressive Models | Antoni Kowalczuk et.al. | 2502.02514 | link |
2025-02-04 | VideoJAM: Joint Appearance-Motion Representations for Enhanced Motion Generation in Video Models | Hila Chefer et.al. | 2502.02492 | null |
2025-02-04 | Do Graph Diffusion Models Accurately Capture and Generate Substructure Distributions? | Xiyuan Wang et.al. | 2502.02488 | null |
2025-02-04 | Distributional Diffusion Models with Scoring Rules | Valentin De Bortoli et.al. | 2502.02483 | null |
2025-02-09 | Towards Consistent and Controllable Image Synthesis for Face Editing | Mengting Wei et.al. | 2502.02465 | null |
2025-02-04 | Sparse Data Generation Using Diffusion Models | Phil Ostheimer et.al. | 2502.02448 | null |
2025-02-04 | Towards Fast Graph Generation via Autoregressive Noisy Filtration Modeling | Markus Krimmel et.al. | 2502.02415 | link |
2025-02-04 | Stochastic optimal control problems with measurable coefficients via $L^p$ -viscosity solutions and applications to optimal advertising models | Filippo de Feo et.al. | 2502.02352 | null |
2025-02-04 | DIME:Diffusion-Based Maximum Entropy Reinforcement Learning | Onur Celik et.al. | 2502.02316 | null |
2025-02-04 | Exploring the latent space of diffusion models directly through singular value decomposition | Li Wang et.al. | 2502.02225 | null |
2025-02-04 | Flatten Graphs as Sequences: Transformers are Scalable Graph Generators | Dexiong Chen et.al. | 2502.02216 | null |
2025-02-04 | InterLCM: Low-Quality Images as Intermediate States of Latent Consistency Models for Effective Blind Face Restoration | Senmao Li et.al. | 2502.02215 | null |
2025-02-04 | From Uncertain to Safe: Conformal Fine-Tuning of Diffusion Models for Safe PDE Control | Peiyan Hu et.al. | 2502.02205 | null |
2025-02-04 | End-to-End Detector Optimization with Diffusion models: A Case Study in Sampling Calorimeters | Kylian Schmidt et.al. | 2502.02152 | null |
2025-02-04 | On the Guidance of Flow Matching | Ruiqi Feng et.al. | 2502.02150 | link |
2025-02-05 | IPO: Iterative Preference Optimization for Text-to-Video Generation | Xiaomeng Yang et.al. | 2502.02088 | null |
2025-02-12 | One Diffusion Step to Real-World Super-Resolution via Flow Trajectory Distillation | Jianze Li et.al. | 2502.01993 | link |
2025-02-04 | Rethinking Timesteps Samplers and Prediction Types | Bin Xie et.al. | 2502.01990 | null |
2025-02-05 | T-SCEND: Test-time Scalable MCTS-enhanced Diffusion Model | Tao Zhang et.al. | 2502.01989 | link |
2025-02-04 | Generative Data Mining with Longtail-Guided Diffusion | David S. Hayden et.al. | 2502.01980 | null |
2025-02-11 | UVGS: Reimagining Unstructured 3D Gaussian Splatting using UV Mapping | Aashish Rai et.al. | 2502.01846 | null |
2025-02-03 | Diffusion Model for Multiple Antenna Communications | Jia Guo et.al. | 2502.01841 | null |
2025-02-03 | Score as Action: Fine-Tuning Diffusion Generative Models by Continuous-time Reinforcement Learning | Hanyang Zhao et.al. | 2502.01819 | null |
2025-02-03 | VILP: Imitation Learning with Latent Video Planning | Zhengtong Xu et.al. | 2502.01784 | link |
2025-02-03 | Sparse VideoGen: Accelerating Video Diffusion Transformers with Spatial-Temporal Sparsity | Haocheng Xi et.al. | 2502.01776 | null |
2025-02-03 | Generating Multi-Image Synthetic Data for Text-to-Image Customization | Nupur Kumari et.al. | 2502.01720 | null |
2025-02-07 | MJ-VIDEO: Fine-Grained Benchmarking and Rewarding Video Preferences in Video Generation | Haibo Tong et.al. | 2502.01719 | null |
2025-02-06 | Fast Direct: Query-Efficient Online Black-box Guidance for Diffusion-model Target Generation | Kim Yong Tan et.al. | 2502.01692 | link |
2025-02-02 | HuViDPO:Enhancing Video Generation through Direct Preference Optimization for Human-Centric Alignment | Lifan Jiang et.al. | 2502.01690 | null |
2025-02-01 | Refining Alignment Framework for Diffusion Models with Intermediate-Step Preference Ranking | Jie Ren et.al. | 2502.01667 | null |
2025-02-03 | SliderSpace: Decomposing the Visual Capabilities of Diffusion Models | Rohit Gandikota et.al. | 2502.01639 | link |
2025-02-05 | MakeAnything: Harnessing Diffusion Transformers for Multi-Domain Procedural Sequence Generation | Yiren Song et.al. | 2502.01572 | null |
2025-02-03 | Scalable Language Models with Posterior Inference of Latent Thought Vectors | Deqian Kong et.al. | 2502.01567 | null |
2025-02-03 | Transformers trained on proteins can learn to attend to Euclidean distance | Isaac Ellmen et.al. | 2502.01533 | link |
2025-02-03 | BD-Diff: Generative Diffusion Model for Image Deblurring on Unknown Domains with Blur-Decoupled Learning | Junhao Cheng et.al. | 2502.01522 | null |
2025-02-03 | End-to-end Training for Text-to-Image Synthesis using Dual-Text Embeddings | Yeruru Asrar Ahmed et.al. | 2502.01507 | null |
2025-02-03 | Improved Training Technique for Latent Consistency Models | Quan Dao et.al. | 2502.01441 | link |
2025-02-03 | Assessing the use of Diffusion models for motion artifact correction in brain MRI | Paolo Angella et.al. | 2502.01418 | null |
2025-02-03 | Human Body Restoration with One-Step Diffusion Model and A New Benchmark | Jue Gong et.al. | 2502.01411 | null |
2025-02-03 | Fine-Tuning Discrete Diffusion Models with Policy Gradient Methods | Oussama Zekri et.al. | 2502.01384 | link |
2025-02-03 | Inverse Bridge Matching Distillation | Nikita Gushchin et.al. | 2502.01362 | null |
2025-02-03 | Diffusion at Absolute Zero: Langevin Sampling Using Successive Moreau Envelopes | Andreas Habring et.al. | 2502.01358 | null |
2025-02-03 | Heterogeneous Image GNN: Graph-Conditioned Diffusion for Image Synthesis | Rupert Menneer et.al. | 2502.01309 | null |
2025-02-10 | Compressed Image Generation with Denoising Diffusion Codebook Models | Guy Ohayon et.al. | 2502.01189 | null |
2025-02-03 | A generative foundation model for an all-in-one seismic processing framework | Shijun Cheng et.al. | 2502.01111 | null |
2025-02-03 | VidSketch: Hand-drawn Sketch-Driven Video Generation with Diffusion Control | Lifan Jiang et.al. | 2502.01101 | link |
2025-02-13 | OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models | Gaojie Lin et.al. | 2502.01061 | null |
2025-02-03 | Diffusion Model as a Noise-Aware Latent Reward Model for Step-Level Preference Optimization | Tao Zhang et.al. | 2502.01051 | link |
2025-02-03 | WonderHuman: Hallucinating Unseen Parts in Dynamic 3D Human Reconstruction | Zilong Wang et.al. | 2502.01045 | null |
2025-02-03 | Pushing the Boundaries of State Space Models for Image and Video Generation | Yicong Hong et.al. | 2502.00972 | null |
2025-02-03 | CoDe: Blockwise Control for Denoising Diffusion Models | Anuj Singh et.al. | 2502.00968 | link |
2025-02-03 | CLIP-UP: A Simple and Efficient Mixture-of-Experts CLIP Training Recipe with Sparse Upcycling | Xinze Wang et.al. | 2502.00965 | null |
2025-02-02 | Blink of an eye: a simple theory for feature localization in generative models | Marvin Li et.al. | 2502.00921 | null |
2025-02-02 | Cosmological super-resolution of the 21-cm signal | Simon Pochinda et.al. | 2502.00852 | null |
2025-02-02 | RealRAG: Retrieval-augmented Realistic Image Generation via Self-reflective Contrastive Learning | Yuanhuiyi Lyu et.al. | 2502.00848 | null |
2025-02-02 | Weak Supervision Dynamic KL-Weighted Diffusion Models Guided by Large Language Models | Julian Perry et.al. | 2502.00826 | null |
2025-02-02 | Data Fusion for Full-Range Response Reconstruction via Diffusion Models | Wingho Feng et.al. | 2502.00795 | null |
2025-02-02 | A method for estimating forest carbon storage distribution density via artificial intelligence generated content model | Zhenyu Yu et.al. | 2502.00783 | null |
2025-02-02 | Understanding and Mitigating the High Computational Cost in Path Data Diffusion | Dingyuan Shi et.al. | 2502.00725 | null |
2025-02-02 | High-Order Matching for One-Step Shortcut Diffusion Models | Bo Chen et.al. | 2502.00688 | null |
2025-02-02 | Zeroth-order Informed Fine-Tuning for Diffusion Model: A Recursive Likelihood Ratio Optimizer | Tao Ren et.al. | 2502.00639 | null |
2025-02-02 | Strengthening Generative Robot Policies through Predictive World Modeling | Han Qi et.al. | 2502.00622 | null |
2025-02-01 | Deep Task-Based Beamforming and Channel Data Augmentations for Enhanced Ultrasound Imaging | Ariel Amar et.al. | 2502.00524 | null |
2025-02-04 | Video Latent Flow Matching: Optimal Polynomial Projections for Video Interpolation and Extrapolation | Yang Cao et.al. | 2502.00500 | null |
2025-02-01 | A framework for river connectivity classification using temporal image processing and attention based neural networks | Timothy James Becker et.al. | 2502.00474 | null |
2025-02-01 | Enhancing Memory and Imagination Consistency in Diffusion-based World Models via Linear-Time Sequence Modeling | Jia-Hua Lee et.al. | 2502.00466 | null |
2025-02-01 | CAT Pruning: Cluster-Aware Token Pruning For Text-to-Image Diffusion Models | Xinle Cheng et.al. | 2502.00433 | null |
2025-02-01 | Masked Generative Nested Transformers with Decode Time Scaling | Sahil Goyal et.al. | 2502.00382 | null |
2025-02-12 | Soft Diffusion Actor-Critic: Efficient Online Reinforcement Learning for Diffusion Policy | Haitong Ma et.al. | 2502.00361 | null |
2025-02-01 | Shape from Semantics: 3D Shape Generation from Multi-View Semantics | Liangchen Li et.al. | 2502.00360 | null |
2025-02-01 | Exploring Representation-Aligned Latent Space for Better Generation | Wanghan Xu et.al. | 2502.00359 | null |
2025-02-01 | Denoising Score Matching with Random Features: Insights on Diffusion Models from Precise Learning Curves | Anand Jerry George et.al. | 2502.00336 | null |
2025-02-04 | BiMaCoSR: Binary One-Step Diffusion Model Leveraging Flexible Matrix Compression for Real Super-Resolution | Kai Liu et.al. | 2502.00333 | link |
2025-02-01 | A Diffusion Model Translator for Efficient Image-to-Image Translation | Mengfei Xia et.al. | 2502.00307 | null |
2025-02-01 | MCM: Multi-layer Concept Map for Efficient Concept Learning from Masked Images | Yuwei Sun et.al. | 2502.00266 | null |
2025-02-01 | Fast Solvers for Discrete Diffusion Models: Theory and Applications of High-Order Algorithms | Yinuo Ren et.al. | 2502.00234 | null |
2025-01-31 | Designing Scheduling for Diffusion Models via Spectral Analysis | Roi Benita et.al. | 2502.00180 | null |
2025-01-31 | Beyond Fixed Horizons: A Theoretical Framework for Adaptive Denoising Diffusions | Sören Christensen et.al. | 2501.19373 | null |
2025-01-31 | Pathological MRI Segmentation by Synthetic Pathological Data Generation in Fetuses and Neonates | Misha P. T Kaandorp et.al. | 2501.19338 | null |
2025-01-31 | Consistent Video Colorization via Palette Guidance | Han Wang et.al. | 2501.19331 | null |
2025-01-31 | Medical Semantic Segmentation with Diffusion Pretrain | David Li et.al. | 2501.19265 | null |
2025-01-31 | Inference-Time Text-to-Video Alignment with Diffusion Latent Beam Search | Yuta Oshima et.al. | 2501.19252 | null |
2025-01-31 | PSyDUCK: Training-Free Steganography for Latent Diffusion | Georgia Channing et.al. | 2501.19172 | null |
2025-01-31 | RMDM: Radio Map Diffusion Model with Physics Informed | Haozhe Jia et.al. | 2501.19160 | link |
2025-01-31 | Ambient Denoising Diffusion Generative Adversarial Networks for Establishing Stochastic Object Models from Noisy Image Data | Xichen Xu et.al. | 2501.19094 | null |
2025-01-31 | MotionPCM: Real-Time Motion Synthesis with Phased Consistency Model | Lei Jiang et.al. | 2501.19083 | null |
2025-01-31 | Concept Steerers: Leveraging K-Sparse Autoencoders for Controllable Generations | Dahye Kim et.al. | 2501.19066 | link |
2025-01-31 | Collaborative Diffusion Model for Recommender System | Gyuseok Lee et.al. | 2501.18997 | null |
2025-01-31 | OmniPhysGS: 3D Constitutive Gaussians for General Physics-Based Dynamics Generation | Yuchen Lin et.al. | 2501.18982 | null |
2025-01-31 | BCAT: A Block Causal Transformer for PDE Foundation Models for Fluid Dynamics | Yuxuan Liu et.al. | 2501.18972 | null |
2025-01-31 | Fantastic Targets for Concept Erasure in Diffusion Models and Where To Find Them | Anh Bui et.al. | 2501.18950 | link |
2025-01-31 | Rethinking Diffusion Posterior Sampling: From Conditional Score Estimator to Maximizing a Posterior | Tongda Xu et.al. | 2501.18913 | link |
2025-01-31 | Trustworthy Evaluation of Generative AI Models | Zijun Gao et.al. | 2501.18897 | null |
2025-01-31 | Distorting Embedding Space for Safety: A Defense Mechanism for Adversarially Robust Diffusion Models | Jaesin Ahn et.al. | 2501.18877 | link |
2025-01-31 | REG: Rectified Gradient Guidance for Conditional Diffusion Models | Zhengqi Gao et.al. | 2501.18865 | null |
2025-01-31 | Equivariant Hypergraph Diffusion for Crystal Structure Prediction | Yang Liu et.al. | 2501.18850 | null |
2025-01-31 | Pitfalls of defacing whole-head MRI: re-identification risk with diffusion models and compromised research potential | Chenyu Gao et.al. | 2501.18834 | null |
2025-01-30 | Every Image Listens, Every Image Dances: Music-Driven Image Animation | Zhikang Dong et.al. | 2501.18801 | null |
2025-01-30 | Distillation-Driven Diffusion Model for Multi-Scale MRI Super-Resolution: Make 1.5T MRI Great Again | Zhe Wang et.al. | 2501.18736 | link |
2025-01-30 | Strong and Controllable 3D Motion Generation | Canxuan Gang et.al. | 2501.18726 | null |
2025-02-07 | Drag Your Gaussian: Effective Drag-Based Editing with Score Distillation for 3D Gaussian Splatting | Yansong Qu et.al. | 2501.18672 | null |
2025-01-30 | High-Accuracy ECG Image Interpretation using Parameter-Efficient LoRA Fine-Tuning with Multimodal LLaMA 3.2 | Nandakishor M et.al. | 2501.18670 | null |
2025-01-28 | DebiasPI: Inference-time Debiasing by Prompt Iteration of a Text-to-Image Generative Model | Sarah Bonna et.al. | 2501.18642 | null |
2025-01-30 | Diffusion Autoencoders are Scalable Image Tokenizers | Yinbo Chen et.al. | 2501.18593 | null |
2025-01-30 | DiffusionRenderer: Neural Inverse and Forward Rendering with Video Diffusion Models | Ruofan Liang et.al. | 2501.18590 | null |
2025-01-30 | Inkspire: Supporting Design Exploration with Generative AI through Analogical Sketching | David Chuan-En Lin et.al. | 2501.18588 | null |
2025-02-05 | SANA 1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer | Enze Xie et.al. | 2501.18427 | null |
2025-01-31 | Generator Sets for the Minkowski Sum Problem – Theory and Insights | Mark Lyngesen et.al. | 2501.18420 | null |
2025-01-30 | Simulation of microstructures and machine learning | Katja Schladitz et.al. | 2501.18313 | null |
2025-01-30 | Free-T2M: Frequency Enhanced Text-to-Motion Diffusion Model With Consistency Loss | Wenshuo Chen et.al. | 2501.18232 | link |
2025-01-30 | Inverse source problem of sub-diffusion of variable exponent | Zhiyuan Li et.al. | 2501.18228 | null |
2025-01-30 | LLMs can see and hear without any training | Kumar Ashutosh et.al. | 2501.18096 | link |
2025-01-31 | SAeUron: Interpretable Concept Unlearning in Diffusion Models with Sparse Autoencoders | Bartosz Cywiński et.al. | 2501.18052 | link |
2025-01-29 | Generative AI for Vision: A Comprehensive Study of Frameworks and Applications | Fouad Bousetouane et.al. | 2501.18033 | null |
2025-01-28 | ProcTex: Consistent and Interactive Text-to-texture Synthesis for Procedural Models | Ruiqi Xu et.al. | 2501.17895 | null |
2025-02-02 | SCDM: Score-Based Channel Denoising Model for Digital Semantic Communications | Hao Mo et.al. | 2501.17876 | null |
2025-01-29 | Janus-Pro: Unified Multimodal Understanding and Generation with Data and Model Scaling | Xiaokang Chen et.al. | 2501.17811 | link |
2025-01-29 | VICCA: Visual Interpretation and Comprehension of Chest X-ray Anomalies in Generated Report Without Human Feedback | Sayeh Gholipour Picha et.al. | 2501.17726 | null |
2025-01-29 | A Framework for Generating Realistic Synthetic Tabular Data in a Randomized Controlled Trial Setting | Niki Z. Petrakos et.al. | 2501.17719 | null |
2025-01-29 | Segmentation-Aware Generative Reinforcement Network (GRN) for Tissue Layer Segmentation in 3-D Ultrasound Images for Chronic Low-back Pain (cLBP) Assessment | Zixue Zeng et.al. | 2501.17690 | link |
2025-02-11 | Distinguished Quantized Guidance for Diffusion-based Sequence Recommendation | Wenyu Mao et.al. | 2501.17670 | null |
2025-01-29 | Solving Inverse Problems using Diffusion with Fast Iterative Renoising | Matt C. Bendel et.al. | 2501.17468 | null |
2025-01-28 | MDDM: A Molecular Dynamics Diffusion Model to Predict Particle Self-Assembly | Kevin Ferguson et.al. | 2501.17319 | null |
2025-01-28 | CubeDiff: Repurposing Diffusion-Based Image Models for Panorama Generation | Nikolai Kalischek et.al. | 2501.17162 | null |
2025-01-31 | IC-Portrait: In-Context Matching for View-Consistent Personalized Portrait | Han Yang et.al. | 2501.17159 | null |
2025-01-28 | Text-to-Image Generation for Vocabulary Learning Using the Keyword Method | Nuwan T. Attygalle et.al. | 2501.17099 | null |
2025-01-28 | Generative diffusion models from a PDE perspective | Fei Cao et.al. | 2501.17054 | null |
2025-02-04 | MIDI-GPT: A Controllable Generative Model for Computer-Assisted Multitrack Music Composition | Philippe Pasquier et.al. | 2501.17011 | null |
2025-01-28 | RODEO: Robust Outlier Detection via Exposing Adaptive Out-of-Distribution Samples | Hossein Mirzaei et.al. | 2501.16971 | link |
2025-01-28 | Generating Random Vectors satisfying Linear and Nonlinear Constraints | Rick S. H. Willemsen et.al. | 2501.16936 | null |
2025-01-28 | Adversarial Masked Autoencoder Purifier with Defense Transferability | Yuan-Chih Chen et.al. | 2501.16904 | null |
2025-01-28 | DIRIGENt: End-To-End Robotic Imitation of Human Demonstrations Based on a Diffusion Model | Josua Spisak et.al. | 2501.16800 | null |
2025-01-28 | FlexMotion: Lightweight, Physics-Aware, and Controllable Human Motion Generation | Arvin Tashakori et.al. | 2501.16778 | null |
2025-01-28 | DiffSplat: Repurposing Image Diffusion Models for Scalable Gaussian Splat Generation | Chenguo Lin et.al. | 2501.16764 | null |
2025-01-28 | ITVTON:Virtual Try-On Diffusion Transformer Model Based on Integrated Image and Text | Haifeng Ni et.al. | 2501.16757 | null |
2025-01-31 | Consistency Diffusion Models for Single-Image 3D Reconstruction with Priors | Chenru Jiang et.al. | 2501.16737 | null |
2025-01-28 | Separate Motion from Appearance: Customizing Motion via Customizing Text-to-Video Diffusion Models | Huijie Liu et.al. | 2501.16714 | null |
2025-01-29 | Polyp-Gen: Realistic and Diverse Polyp Image Generation for Endoscopic Dataset Expansion | Shengyuan Liu et.al. | 2501.16679 | link |
2025-01-28 | Variational Schrödinger Momentum Diffusion | Kevin Rojas et.al. | 2501.16675 | null |
2025-01-28 | CascadeV: An Implementation of Wurstchen Architecture for Video Generation | Wenfeng Lin et.al. | 2501.16612 | link |
2025-02-04 | LoRA-X: Bridging Foundation Models with Training-Free Cross-Model Adaptation | Farzad Farhadzadeh et.al. | 2501.16559 | null |
2025-01-27 | PackDiT: Joint Human Motion and Text Generation via Mutual Prompting | Zhongyu Jiang et.al. | 2501.16551 | null |
2025-01-27 | PhysAnimator: Physics-Guided Generative Cartoon Animation | Tianyi Xie et.al. | 2501.16550 | null |
2025-01-27 | Decrypting the temperature field in flow boiling with latent diffusion models | UngJin Na et.al. | 2501.16510 | null |
2025-01-24 | UDiTQC: U-Net-Style Diffusion Transformer for Quantum Circuit Synthesis | Zhiwei Chen et.al. | 2501.16380 | null |
2025-01-19 | Synthetic Data Generation by Supervised Neural Gas Network for Physiological Emotion Recognition Data | S. Muhammad Hossein Mousavi et.al. | 2501.16353 | link |
2025-01-18 | An Integrated Approach to AI-Generated Content in e-health | Tasnim Ahmed et.al. | 2501.16348 | null |
2025-01-27 | RelightVid: Temporal-Consistent Diffusion Model for Video Relighting | Ye Fang et.al. | 2501.16330 | null |
2025-01-27 | Congested Crossing Pedestrian Traffic Flow : Dispersion vs. Transport in Crowded Areas | Mariam Al Khatib et.al. | 2501.16275 | null |
2025-01-27 | UDBE: Unsupervised Diffusion-based Brightness Enhancement in Underwater Images | Tatiana Taís Schein et.al. | 2501.16211 | link |
2025-01-27 | Multi-front dynamics in spatially inhomogeneous Allen-Cahn equations | Robbin Bastiaansen et.al. | 2501.16195 | null |
2025-01-27 | BAG: Body-Aligned 3D Wearable Asset Generation | Zhongjin Luo et.al. | 2501.16177 | null |
2025-01-27 | Efficient Portrait Matte Creation With Layer Diffusion and Connectivity Priors | Zhiyuan Lu et.al. | 2501.16147 | null |
2025-01-27 | Using Generative Models to Produce Realistic Populations of UK Windstorms | Yee Chun Tsoi et.al. | 2501.16110 | null |
2025-01-27 | Improving Tropical Cyclone Forecasting With Video Diffusion Models | Zhibo Ren et.al. | 2501.16003 | link |
2025-01-27 | MatCLIP: Light- and Shape-Insensitive Assignment of PBR Material Models | Michael Birsak et.al. | 2501.15981 | null |
2025-01-27 | Generative AI for Lyapunov Optimization Theory in UAV-based Low-Altitude Economy Networking | Zhang Liu et.al. | 2501.15928 | null |
2025-01-28 | Slot-Guided Adaptation of Pre-trained Diffusion Models for Object-Centric Learning and Compositional Generation | Adil Kaan Akan et.al. | 2501.15878 | null |
2025-01-30 | Can Location Embeddings Enhance Super-Resolution of Satellite Imagery? | Daniel Panangian et.al. | 2501.15847 | null |
2025-01-27 | Autonomous Horizon-based Asteroid Navigation With Observability-constrained Maneuvers | Aditya Arjun Anibha et.al. | 2501.15806 | null |
2025-01-27 | Memorization and Regularization in Generative Diffusion Models | Ricardo Baptista et.al. | 2501.15785 | link |
2025-01-27 | Do Existing Testing Tools Really Uncover Gender Bias in Text-to-Image Models? | Yunbo Lyu et.al. | 2501.15775 | null |
2025-01-26 | Bringing Characters to New Stories: Training-Free Theme-Specific Image Generation via Dynamic Visual Prompting | Yuxin Zhang et.al. | 2501.15641 | null |
2025-01-26 | BoKDiff: Best-of-K Diffusion Alignment for Target-Specific 3D Molecule Generation | Ali Khodabandeh Yalabadi et.al. | 2501.15631 | link |
2025-01-26 | Comparative clinical evaluation of “memory-efficient” synthetic 3d generative adversarial networks (gan) head-to-head to state of art: results on computed tomography of the chest | Mahshid shiri et.al. | 2501.15572 | null |
2025-01-26 | Cross-Cultural Fashion Design via Interactive Large Language Models and Diffusion Models | Spencer Ramsey et.al. | 2501.15571 | null |
2025-01-26 | CE-SDWV: Effective and Efficient Concept Erasure for Text-to-Image Diffusion Models via a Semantic-Driven Word Vocabulary | Jiahang Tu et.al. | 2501.15562 | null |
2025-01-26 | Distributionally Robust Graph Out-of-Distribution Recommendation via Diffusion Model | Chu Zhao et.al. | 2501.15555 | link |
2025-01-26 | LoRAGuard: An Effective Black-box Watermarking Approach for LoRAs | Peizhuo Lv et.al. | 2501.15478 | null |
2025-01-26 | “See What I Imagine, Imagine What I See”: Human-AI Co-Creation System for 360 $^\circ$ Panoramic Video Generation in VR | Yunge Wen et.al. | 2501.15456 | null |
2025-01-26 | SQ-DM: Accelerating Diffusion Models with Aggressive Quantization and Temporal Sparsity | Zichen Fan et.al. | 2501.15448 | null |
2025-01-26 | StochSync: Stochastic Diffusion Synchronization for Image Generation in Arbitrary Spaces | Kyeongmin Yeo et.al. | 2501.15445 | null |
2025-01-31 | Dfilled: Repurposing Edge-Enhancing Diffusion for Guided DSM Void Filling | Daniel Panangian et.al. | 2501.15440 | null |
2025-01-26 | Zero-Shot Interactive Text-to-Image Retrieval via Diffusion-Augmented Representations | Zijun Long et.al. | 2501.15379 | null |
2025-01-25 | Data-Driven Distributionally Robust Optimization for Long-Term Contract vs. Spot Allocation Decisions: Application to Electricity Markets | Dimitri J. Papageorgiou et.al. | 2501.15340 | null |
2025-01-25 | Efficient Point Clouds Upsampling via Flow Matching | Zhi-Song Liu et.al. | 2501.15286 | null |
2025-01-25 | Generalizable Deepfake Detection via Effective Local-Global Feature Extraction | Jiazhen Yan et.al. | 2501.15253 | null |
2025-01-25 | Enhancing Fetal Plane Classification Accuracy with Data Augmentation Using Diffusion Models | Yueying Tian et.al. | 2501.15248 | null |
2025-01-25 | Enhancing Intent Understanding for Ambiguous Prompts through Human-Machine Co-Adaptation | Yangfan He et.al. | 2501.15167 | null |
2025-01-25 | MAP-based Problem-Agnostic diffusion model for Inverse Problems | Pingping Tao et.al. | 2501.15128 | null |
2025-01-25 | KETA: Kinematic-Phrases-Enhanced Text-to-Motion Generation via Fine-grained Alignment | Yu Jiang et.al. | 2501.15058 | link |
2025-01-25 | Graph-Based Cross-Domain Knowledge Distillation for Cross-Dataset Text-to-Image Person Retrieval | Bingjun Luo et.al. | 2501.15052 | null |
2025-01-25 | Controllable Protein Sequence Generation with LLM Preference Optimization | Xiangyu Liu et.al. | 2501.15007 | link |
2025-01-24 | MATCHA:Towards Matching Anything | Fei Xue et.al. | 2501.14945 | null |
2025-01-23 | Resource Allocation Driven by Large Models in Future Semantic-Aware Networks | Haijun Zhang et.al. | 2501.14832 | null |
2025-01-21 | Controlling Ensemble Variance in Diffusion Models: An Application for Reanalyses Downscaling | Fabio Merizzi et.al. | 2501.14822 | null |
2025-01-11 | HeteroLLM: Accelerating Large Language Model Inference on Mobile SoCs platform with Heterogeneous AI Accelerators | Le Chen et.al. | 2501.14794 | null |
2025-01-09 | Towards Dynamic Neural Communication and Speech Neuroprosthesis Based on Viseme Decoding | Ji-Ha Park et.al. | 2501.14790 | null |
2025-01-27 | Diffusion based Text-to-Music Generation with Global and Local Text based Conditioning | Jisi Zhang et.al. | 2501.14680 | null |
2025-01-24 | Towards Scalable Topological Regularizers | Hiu-Tung Wong et.al. | 2501.14641 | null |
2025-01-24 | Training-Free Style and Content Transfer by Leveraging U-Net Skip Connections in Stable Diffusion 2.* | Ludovica Schaerf et.al. | 2501.14524 | null |
2025-01-24 | Advancing data-driven broadband seismic wavefield simulation with multi-conditional diffusion model | Zhengfa Bi et.al. | 2501.14348 | null |
2025-01-24 | Stochastic Method for Delayed Neutron Precursors Transport in Liquid Fuel | Mathis Caprais et.al. | 2501.14332 | null |
2025-01-24 | PAID: A Framework of Product-Centric Advertising Image Design | Hongyu Chen et.al. | 2501.14316 | null |
2025-01-24 | CDI: Blind Image Restoration Fidelity Evaluation based on Consistency with Degraded Image | Xiaojun Tang et.al. | 2501.14264 | null |
2025-01-24 | TFG-Flow: Training-free Guidance in Multimodal Generative Flow | Haowei Lin et.al. | 2501.14216 | link |
2025-01-24 | VideoShield: Regulating Diffusion-based Video Generation Models via Watermarking | Runyi Hu et.al. | 2501.14195 | link |
2025-01-23 | LLM-guided Instance-level Image Manipulation with Diffusion U-Net Cross-Attention Maps | Andrey Palaev et.al. | 2501.14046 | link |
2025-01-23 | INDIGO+: A Unified INN-Guided Probabilistic Diffusion Algorithm for Blind and Non-Blind Image Restoration | Di You et.al. | 2501.14014 | null |
2025-01-22 | Synthetic CT image generation from CBCT: A Systematic Review | Alzahra Altalib et.al. | 2501.13972 | null |
2025-01-22 | InsTex: Indoor Scenes Stylized Texture Synthesis | Yunfan Zhang et.al. | 2501.13969 | null |
2025-01-22 | Triplet Synthesis For Enhancing Composed Image Retrieval via Counterfactual Image Generation | Kenta Uesugi et.al. | 2501.13968 | null |
2025-01-18 | Fanar: An Arabic-Centric Multimodal Generative AI Platform | Fanar Team et.al. | 2501.13944 | null |
2025-01-23 | Can We Generate Images with CoT? Let’s Verify and Reinforce Image Generation Step by Step | Ziyu Guo et.al. | 2501.13926 | link |
2025-01-23 | IMAGINE-E: Image Generation Intelligence Evaluation of State-of-the-art Text-to-Image Models | Jiayi Lei et.al. | 2501.13920 | null |
2025-01-23 | Improving Video Generation with Human Feedback | Jie Liu et.al. | 2501.13918 | null |
2025-01-23 | Generating Realistic Forehead-Creases for User Verification via Conditioned Piecewise Polynomial Curves | Abhishek Tandon et.al. | 2501.13889 | link |
2025-01-23 | Unveiling the Power of Noise Priors: Enhancing Diffusion Models for Mobile Traffic Prediction | Zhi Sheng et.al. | 2501.13794 | null |
2025-01-23 | An Efficient Diffusion-based Non-Autoregressive Solver for Traveling Salesman Problem | Mingzhao Wang et.al. | 2501.13767 | link |
2025-01-23 | A Mutual Information Perspective on Multiple Latent Variable Generative Models for Positive View Generation | Dario Serez et.al. | 2501.13718 | null |
2025-01-23 | Training-Free Consistency Pipeline for Fashion Repose | Potito Aghilar et.al. | 2501.13692 | null |
2025-02-05 | One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt | Tao Liu et.al. | 2501.13554 | link |
2025-01-23 | Diffusion-based Perceptual Neural Video Compression with Temporal Diffusion Information Reuse | Wenzhuo Ma et.al. | 2501.13528 | null |
2025-01-23 | LDR-Net: A Novel Framework for AI-generated Image Detection via Localized Discrepancy Representation | JiaXin Chen et.al. | 2501.13475 | null |
2025-01-23 | Zero-Shot Trajectory Planning for Signal Temporal Logic Tasks | Ruijia Liu et.al. | 2501.13457 | null |
2025-01-23 | EchoVideo: Identity-Preserving Human Video Generation by Multimodal Feature Fusion | Jiangchuan Wei et.al. | 2501.13452 | null |
2025-01-23 | Bridging The Multi-Modality Gaps of Audio, Visual and Linguistic for Speech Enhancement | Meng-Ping Lin et.al. | 2501.13375 | null |
2025-01-23 | MSF: Efficient Diffusion Model Via Multi-Scale Latent Factorize | Haohang Xu et.al. | 2501.13349 | null |
2025-01-23 | One Fits All: General Mobility Trajectory Modeling via Masked Conditional Diffusion | Qingyue Long et.al. | 2501.13347 | null |
2025-01-23 | Retrievals Can Be Detrimental: A Contrastive Backdoor Attack Paradigm on Retrieval-Augmented Diffusion Models | Hao Fang et.al. | 2501.13340 | null |
2025-01-23 | Gradient-Free Adversarial Purification with Diffusion Models | Xuelong Dai et.al. | 2501.13336 | null |
2025-01-22 | State Combinatorial Generalization In Decision Making With Conditional Diffusion Models | Xintong Duan et.al. | 2501.13241 | null |
2025-01-22 | Graph Representation Learning with Diffusion Generative Models | Daniel Wesego et.al. | 2501.13133 | null |
2025-01-23 | Accelerate High-Quality Diffusion Models with Inner Loop Feedback | Matthew Gwilliam et.al. | 2501.13107 | null |
2025-01-22 | Robust Representation Consistency Model via Contrastive Denoising | Jiachen Lei et.al. | 2501.13094 | link |
2025-01-22 | Orchid: Image Latent Diffusion for Joint Appearance and Geometry Generation | Akshay Krishnan et.al. | 2501.13087 | null |
2025-01-22 | Robust Body Composition Analysis by Generating 3D CT Volumes from Limited 2D Slices | Lianrui Zuo et.al. | 2501.13071 | null |
2025-01-22 | Beyond the Lungs: Extending the Field of View in Chest CT with Latent Diffusion Models | Lianrui Zuo et.al. | 2501.13068 | null |
2025-01-22 | Low-dimensional adaptation of diffusion models: Convergence in total variation | Jiadong Liang et.al. | 2501.12982 | null |
2025-01-22 | LiT: Delving into a Simplified Linear Diffusion Transformer for Image Generation | Jiahao Wang et.al. | 2501.12976 | null |
2025-01-22 | 3D Object Manipulation in a Single Image using Generative Models | Ruisi Zhao et.al. | 2501.12935 | null |
2025-01-22 | PreciseCam: Precise Camera Control for Text-to-Image Generation | Edurne Bernal-Berdun et.al. | 2501.12910 | null |
2025-01-22 | CrossDiff: Diffusion Probabilistic Model With Cross-conditional Encoder-Decoder for Crack Segmentation | Xianglong Shi et.al. | 2501.12860 | null |
2025-01-22 | AMM-Diff: Adaptive Multi-Modality Diffusion Network for Missing Modality Imputation | Aghiles Kebaili et.al. | 2501.12840 | null |
2025-01-22 | Certified Guidance for Planning with Deep Generative Models | Francesco Giacomarra et.al. | 2501.12815 | null |
2025-01-22 | T2ISafety: Benchmark for Assessing Fairness, Toxicity, and Privacy in Image Generation | Lijun Li et.al. | 2501.12612 | link |
2025-01-22 | Image Motion Blur Removal in the Temporal Dimension with Video Diffusion Models | Wang Pang et.al. | 2501.12604 | null |
2025-01-21 | Federated Discrete Denoising Diffusion Model for Molecular Generation with OpenFL | Kevin Ta et.al. | 2501.12523 | link |
2025-01-21 | Owls are wise and foxes are unfaithful: Uncovering animal stereotypes in vision-language models | Tabinda Aman et.al. | 2501.12433 | null |
2025-01-20 | Ensemble score filter with image inpainting for data assimilation in tracking surface quasi-geostrophic dynamics with partial observations | Siming Liang et.al. | 2501.12419 | link |
2025-01-21 | Towards Affordance-Aware Articulation Synthesis for Rigged Objects | Yu-Chu Yu et.al. | 2501.12393 | null |
2025-01-22 | GPS as a Control Signal for Image Generation | Chao Feng et.al. | 2501.12390 | null |
2025-01-21 | Taming Teacher Forcing for Masked Autoregressive Video Generation | Deyu Zhou et.al. | 2501.12389 | null |
2025-01-21 | Audio Texture Manipulation by Exemplar-Based Analogy | Kan Jen Cheng et.al. | 2501.12385 | null |
2025-01-21 | DiffDoctor: Diagnosing Image Diffusion Models Before Treating | Yiyang Wang et.al. | 2501.12382 | null |
2025-01-21 | Parallel Sequence Modeling via Generalized Spatial Propagation Network | Hongjun Wang et.al. | 2501.12381 | null |
2025-01-22 | Video Depth Anything: Consistent Depth Estimation for Super-Long Videos | Sili Chen et.al. | 2501.12375 | null |
2025-01-21 | Expertise elevates AI usage: experimental evidence comparing laypeople and professional artists | Thomas F. Eisenmann et.al. | 2501.12374 | link |
2025-01-21 | VipDiff: Towards Coherent and Diverse Video Inpainting via Training-free Denoising Diffusion Models | Chaohao Xie et.al. | 2501.12267 | null |
2025-02-01 | Solving Blind Inverse Problems: Adaptive Diffusion Models for Motion-corrected Sparse-view 4DCT | Antoine De Paepe et.al. | 2501.12249 | null |
2025-01-21 | TokenVerse: Versatile Multi-concept Personalization in Token Modulation Space | Daniel Garibi et.al. | 2501.12224 | null |
2025-01-22 | Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation | Zibo Zhao et.al. | 2501.12202 | link |
2025-01-21 | ComposeAnyone: Controllable Layout-to-Human Generation with Decoupled Multimodal Conditions | Shiyue Zhang et.al. | 2501.12173 | link |
2025-01-22 | CogMorph: Cognitive Morphing Attacks for Text-to-Image Models | Zonglei Jing et.al. | 2501.11815 | null |
2025-01-20 | EfficientVITON: An Efficient Virtual Try-On Model using Optimized Diffusion Process | Mostafa Atef et.al. | 2501.11776 | null |
2025-01-20 | Are generative models fair? A study of racial bias in dermatological image generation | Miguel López-Pérez et.al. | 2501.11752 | null |
2025-01-20 | SILO: Solving Inverse Problems with Latent Operators | Ron Raphaeli et.al. | 2501.11746 | null |
2025-01-20 | Exploring Preference-Guided Diffusion Model for Cross-Domain Recommendation | Xiaodong Li et.al. | 2501.11671 | null |
2025-01-20 | Recurrent Diffusion for Large-Scale Parameter Generation | Kai Wang et.al. | 2501.11587 | link |
2025-01-20 | Graph Defense Diffusion Model | Xin He et.al. | 2501.11568 | null |
2025-01-24 | A Survey on Diffusion Models for Anomaly Detection | Jing Liu et.al. | 2501.11430 | link |
2025-01-20 | GenVidBench: A Challenging Benchmark for Detecting AI-Generated Video | Zhenliang Ni et.al. | 2501.11340 | null |
2025-01-20 | CatV2TON: Taming Diffusion Transformers for Vision-Based Virtual Try-On with Temporal Concatenation | Zheng Chong et.al. | 2501.11325 | link |
2025-01-20 | Nested Annealed Training Scheme for Generative Adversarial Networks | Chang Wan et.al. | 2501.11318 | null |
2025-01-20 | MIFNet: Learning Modality-Invariant Features for Generalizable Multimodal Image Matching | Yepeng Liu et.al. | 2501.11299 | null |
2025-01-20 | A New Formulation of Lipschitz Constrained With Functional Gradient Learning for GANs | Chang Wan et.al. | 2501.11236 | link |
2025-01-20 | Successive Interference Cancellation-aided Diffusion Models for Joint Channel Estimation and Data Detection in Low Rank Channel Scenarios | Sagnik Bhattacharya et.al. | 2501.11229 | null |
2025-01-20 | Ditto: Accelerating Diffusion Model via Temporal Value Similarity | Sungbin Kim et.al. | 2501.11211 | null |
2025-01-19 | Quantum Latent Diffusion Models | Francesca De Falco et.al. | 2501.11174 | null |
2025-01-19 | Know “No” Better: A Data-Driven Approach for Enhancing Negation Awareness in CLIP | Junsung Park et.al. | 2501.10913 | null |
2025-01-18 | Diffusion-Based Imitation Learning for Social Pose Generation | Antonio Lech Martin-Ozimek et.al. | 2501.10869 | null |
2025-01-18 | Addressing Multilabel Imbalance with an Efficiency-Focused Approach Using Diffusion Model-Generated Synthetic Samples | Francisco Charte et.al. | 2501.10822 | link |
2025-01-18 | GAUDA: Generative Adaptive Uncertainty-guided Diffusion-based Augmentation for Surgical Segmentation | Yannik Frisch et.al. | 2501.10819 | null |
2025-01-18 | FlashSR: One-step Versatile Audio Super-resolution via Diffusion Distillation | Jaekwon Im et.al. | 2501.10807 | null |
2025-01-18 | EMO2: End-Effector Guided Audio-Driven Avatar Video Generation | Linrui Tian et.al. | 2501.10687 | null |
2025-01-17 | Generic uniqueness and conjugate points for optimal control problems | Alberto Bressan et.al. | 2501.10572 | null |
2025-01-17 | Diffusion Models in Recommendation Systems: A Survey | Ting-Ruen Wei et.al. | 2501.10548 | link |
2025-01-15 | BloomScene: Lightweight Structured 3D Gaussian Splatting for Crossmodal Scene Generation | Xiaolu Hou et.al. | 2501.10462 | link |
2025-01-17 | DiffStereo: High-Frequency Aware Diffusion Model for Stereo Image Restoration | Huiyun Cao et.al. | 2501.10325 | null |
2025-01-20 | DiffVSR: Enhancing Real-World Video Super-Resolution with Diffusion Models for Advanced Visual Quality and Temporal Consistency | Xiaohui Li et.al. | 2501.10110 | null |
2025-01-17 | Conditional Latent Diffusion-Based Speech Enhancement Via Dual Context Learning | Shengkui Zhao et.al. | 2501.10052 | link |
2025-01-17 | DiffuEraser: A Diffusion Model for Video Inpainting | Xiaowen Li et.al. | 2501.10018 | link |
2025-01-17 | Enhancing Crash Frequency Modeling Based on Augmented Multi-Type Data by Hybrid VAE-Diffusion-Based Generative Neural Networks | Junlan Chen et.al. | 2501.10017 | null |
2025-02-02 | RichSpace: Enriching Text-to-Video Prompt Space via Text Embedding Interpolation | Yuefan Cao et.al. | 2501.09982 | null |
2025-01-17 | Physics-informed DeepCT: Sinogram Wavelet Decomposition Meets Masked Diffusion | Zekun Zhou et.al. | 2501.09935 | link |
2025-01-17 | IE-Bench: Advancing the Measurement of Text-Driven Image Editing for Human Perception Alignment | Shangkun Sun et.al. | 2501.09927 | null |
2025-01-16 | Geometry-Preserving Encoder/Decoder in Latent Generative Models | Wonjun Lee et.al. | 2501.09876 | null |
2025-01-16 | CrossModalityDiffusion: Multi-Modal Novel View Synthesis with Unified Intermediate Representation | Alex Berian et.al. | 2501.09838 | link |
2025-01-16 | EraseBench: Understanding The Ripple Effects of Concept Erasure Techniques | Ibtihel Amara et.al. | 2501.09833 | null |
2025-01-16 | PIXELS: Progressive Image Xemplar-based Editing with Latent Surgery | Shristi Das Biswas et.al. | 2501.09826 | link |
2025-01-16 | Lossy Compression with Pretrained Diffusion Models | Jeremy Vonderfecht et.al. | 2501.09815 | link |
2025-01-16 | VideoWorld: Exploring Knowledge Learning from Unlabeled Videos | Zhongwei Ren et.al. | 2501.09781 | null |
2025-01-16 | SynthLight: Portrait Relighting with Diffusion Model by Learning to Re-render Synthetic Faces | Sumit Chaturvedi et.al. | 2501.09756 | null |
2025-01-16 | Learnings from Scaling Visual Tokenizers for Reconstruction and Generation | Philippe Hansen-Estruch et.al. | 2501.09755 | null |
2025-01-16 | Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps | Nanye Ma et.al. | 2501.09732 | null |
2025-01-20 | Inference-Time Alignment in Diffusion Models with Reward-Guided Generation: Tutorial and Review | Masatoshi Uehara et.al. | 2501.09685 | null |
2025-01-16 | Model Predictive Path Integral Docking of Fully Actuated Surface Vessel | Akash Vijayakumar et.al. | 2501.09668 | null |
2025-02-01 | AdaFV: Rethinking of Visual-Language alignment for VLM acceleration | Jiayi Han et.al. | 2501.09532 | null |
2025-01-16 | AnyStory: Towards Unified Single and Multiple Subject Personalization in Text-to-Image Generation | Junjie He et.al. | 2501.09503 | link |
2025-01-16 | Pruning for Sparse Diffusion Models based on Gradient Flow | Ben Wan et.al. | 2501.09464 | null |
2025-01-16 | CaPa: Carve-n-Paint Synthesis for Efficient 4K Textured Mesh Generation | Hwan Heo et.al. | 2501.09433 | link |
2025-01-16 | Dynamic Neural Style Transfer for Artistic Image Generation using VGG19 | Kapil Kashyap et.al. | 2501.09420 | null |
2025-01-16 | SVIA: A Street View Image Anonymization Framework for Self-Driving Applications | Dongyu Liu et.al. | 2501.09393 | link |
2025-01-16 | Contract-Inspired Contest Theory for Controllable Image Generation in Mobile Edge Metaverse | Guangyuan Liu et.al. | 2501.09391 | null |
2025-01-16 | UVRM: A Scalable 3D Reconstruction Model from Unposed Videos | Shiu-hong Kao et.al. | 2501.09347 | null |
2025-01-16 | Domain-conditioned and Temporal-guided Diffusion Modeling for Accelerated Dynamic MRI Reconstruction | Liping Zhang et.al. | 2501.09305 | null |
2025-01-17 | SEAL: Entangled White-box Watermarks on Low-Rank Adaptation | Giyeong Oh et.al. | 2501.09284 | null |
2025-01-16 | Text Semantics to Flexible Design: A Residential Layout Generation Method Based on Stable Diffusion Model | Zijin Qiu et.al. | 2501.09279 | null |
2025-01-16 | PATCHEDSERVE: A Patch Management Framework for SLO-Optimized Hybrid Resolution Diffusion Serving | Desen Sun et.al. | 2501.09253 | null |
2025-01-15 | Grounding Text-To-Image Diffusion Models For Controlled High-Quality Image Generation | Ahmad Süleyman et.al. | 2501.09194 | null |
2025-01-15 | Generative diffusion model with inverse renormalization group flows | Kanta Masuki et.al. | 2501.09064 | link |
2025-01-15 | SHYI: Action Support for Contrastive Learning in High-Fidelity Text-to-Image Generation | Tianxiang Xia et.al. | 2501.09055 | null |
2025-01-17 | NeurOp-Diff:Continuous Remote Sensing Image Super-Resolution via Neural Operator Diffusion | Zihao Xu et.al. | 2501.09054 | link |
2025-01-15 | Continual Test-Time Adaptation for Single Image Defocus Deblurring via Causal Siamese Networks | Shuang Cui et.al. | 2501.09052 | null |
2025-01-15 | CookingDiffusion: Cooking Procedural Image Generation with Stable Diffusion | Yuan Wang et.al. | 2501.09042 | null |
2025-01-14 | Do generative video models learn physical principles from watching videos? | Saman Motamed et.al. | 2501.09038 | link |
2025-01-15 | Ouroboros-Diffusion: Exploring Consistent Content Generation in Tuning-free Long Video Diffusion | Jingyuan Chen et.al. | 2501.09019 | null |
2025-01-15 | How Do Generative Models Draw a Software Engineer? A Case Study on Stable Diffusion Bias | Tosin Fadahunsi et.al. | 2501.09014 | link |
2025-01-15 | Multimodal LLMs Can Reason about Aesthetics in Zero-Shot | Ruixiang Jiang et.al. | 2501.09012 | link |
2025-01-15 | SimGen: A Diffusion-Based Framework for Simultaneous Surgical Image and Segmentation Mask Generation | Aditya Bhat et.al. | 2501.09008 | null |
2025-01-15 | RepVideo: Rethinking Cross-Layer Representation for Video Generation | Chenyang Si et.al. | 2501.08994 | null |
2025-01-15 | Enhanced Multi-Scale Cross-Attention for Person Image Generation | Hao Tang et.al. | 2501.08900 | null |
2025-01-23 | Boosting Diffusion Guidance via Learning Degradation-Aware Models for Blind Super Resolution | Shao-Hao Lu et.al. | 2501.08819 | link |
2025-01-15 | Transformed Low-rank Adaptation via Tensor Decomposition and Its Applications to Text-to-image Models | Zerui Tao et.al. | 2501.08727 | null |
2025-01-15 | FlexiClip: Locality-Preserving Free-Form Character Animation | Anant Khandelwal et.al. | 2501.08676 | null |
2025-01-15 | TimeFlow: Longitudinal Brain Image Registration and Aging Progression Analysis | Bailiang Jian et.al. | 2501.08667 | null |
2025-01-15 | Product of Gaussian Mixture Diffusion Model for non-linear MRI Inversion | Laurenz Nagler et.al. | 2501.08662 | null |
2025-01-15 | StereoGen: High-quality Stereo Image Generation from a Single Image | Xianqi Wang et.al. | 2501.08654 | null |
2025-01-15 | Joint Learning of Depth and Appearance for Portrait Image Animation | Xinya Ji et.al. | 2501.08649 | null |
2025-01-15 | Watermarking in Diffusion Model: Gaussian Shading with Exact Diffusion Inversion via Coupled Transformations (EDICT) | Krishna Panthi et.al. | 2501.08604 | null |
2025-01-15 | DynamicFace: High-Quality and Consistent Video Face Swapping using Composable 3D Facial Priors | Runqi Wang et.al. | 2501.08553 | null |
2025-01-31 | Comprehensive Subjective and Objective Evaluation Method for Text-generated Video | Zelu Qi et.al. | 2501.08545 | null |
2025-01-15 | Complexity Control Facilitates Reasoning-Based Compositional Generalization in Transformers | Zhongwang Zhang et.al. | 2501.08537 | link |
2025-01-15 | Yuan: Yielding Unblemished Aesthetics Through A Unified Network for Visual Imperfections Removal in Generated Images | Zhenyu Yu et.al. | 2501.08505 | link |
2025-01-14 | Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models | Weichen Fan et.al. | 2501.08453 | null |
2025-01-14 | Religious Bias Landscape in Language and Text-to-Image Models: Analysis, Detection, and Debiasing Strategies | Ajwad Abrar et.al. | 2501.08441 | link |
2025-01-14 | 3D Gaussian Splatting with Normal Information for Mesh Extraction and Improved Rendering | Meenakshi Krishnan et.al. | 2501.08370 | null |
2025-01-14 | DAViD: Modeling Dynamic Affordance of 3D Objects using Pre-trained Video Diffusion Models | Hyeonwoo Kim et.al. | 2501.08333 | null |
2025-01-14 | MangaNinja: Line Art Colorization with Precise Reference Following | Zhiheng Liu et.al. | 2501.08332 | null |
2025-01-23 | Go-with-the-Flow: Motion-Controllable Video Diffusion Models Using Real-Time Warped Noise | Ryan Burgert et.al. | 2501.08331 | link |
2025-01-14 | GameFactory: Creating New Games with Generative Interactive Videos | Jiwen Yu et.al. | 2501.08325 | null |
2025-01-14 | Diffusion Adversarial Post-Training for One-Step Video Generation | Shanchuan Lin et.al. | 2501.08316 | null |
2025-01-17 | LayerAnimate: Layer-specific Control for Animation | Yuxue Yang et.al. | 2501.08295 | null |
2025-01-14 | Text-Diffusion Red-Teaming of Large Language Models: Unveiling Harmful Behaviors with Proximity Constraints | Jonathan Nöther et.al. | 2501.08246 | null |
2025-01-14 | FramePainter: Endowing Interactive Image Editing with Video Diffusion Priors | Yabo Zhang et.al. | 2501.08225 | link |
2025-01-14 | D $^2$ -DPM: Dual Denoising for Quantized Diffusion Probabilistic Models | Qian Zeng et.al. | 2501.08180 | link |
2025-01-14 | Benchmarking Multimodal Models for Fine-Grained Image Analysis: A Comparative Study Across Diverse Visual Features | Evgenii Evstafev et.al. | 2501.08170 | null |
2025-01-14 | Decision Transformers for RIS-Assisted Systems with Diffusion Model-Based Channel Acquisition | Jie Zhang et.al. | 2501.08007 | null |
2025-01-14 | GDiffRetro: Retrosynthesis Prediction with Dual Graph Enhanced Molecular Representation and Diffusion Generation | Shengyin Sun et.al. | 2501.08001 | link |
2025-01-14 | VENOM: Text-driven Unrestricted Adversarial Example Generation with Diffusion Models | Hui Kuurila-Zhang et.al. | 2501.07922 | link |
2025-01-14 | Advanced representation learning for flow field analysis and reconstruction | Yikai Wang et.al. | 2501.07835 | null |
2025-02-03 | Symmetry-Aware Generative Modeling through Learned Canonicalization | Kusha Sareen et.al. | 2501.07773 | null |
2025-01-14 | On the Statistical Capacity of Deep Generative Models | Edric Tam et.al. | 2501.07763 | link |
2025-01-13 | Concentration of Measure for Distributions Generated via Diffusion Models | Reza Ghane et.al. | 2501.07741 | null |
2025-01-13 | Democratizing Text-to-Image Masked Generative Models with Compact Text-Aware One-Dimensional Tokens | Dongwon Kim et.al. | 2501.07730 | null |
2025-01-13 | BlobGEN-Vid: Compositional Text-to-Video Generation with Blob Video Representations | Weixi Feng et.al. | 2501.07647 | null |
2025-01-13 | Training-Free Motion-Guided Video Generation with Enhanced Temporal Consistency Using Motion Consistency Loss | Xinyu Zhang et.al. | 2501.07563 | null |
2025-01-13 | Confident Pseudo-labeled Diffusion Augmentation for Canine Cardiomegaly Detection | Shiman Zhang et.al. | 2501.07533 | link |
2025-01-13 | IP-FaceDiff: Identity-Preserving Facial Video Editing with Diffusion | Tharun Anand et.al. | 2501.07530 | null |
2025-01-13 | PrecipDiff: Leveraging image diffusion models to enhance satellite-based precipitation observations | Ting-Yu Dai et.al. | 2501.07447 | null |
2025-01-13 | Diff-Ensembler: Learning to Ensemble 2D Diffusion Models for Volume-to-Volume Medical Image Translation | Xiyue Zhu et.al. | 2501.07430 | null |
2025-01-13 | OCORD: Open-Campus Object Removal Dataset | Shuo Zhang et.al. | 2501.07397 | null |
2025-01-13 | Bigger Isn’t Always Better: Towards a General Prior for Medical Image Reconstruction | Lukas Glaszner et.al. | 2501.07376 | link |
2025-01-13 | Generating Poisoning Attacks against Ridge Regression Models with Categorical Features | Monse Guedes-Ayala et.al. | 2501.07275 | null |
2025-01-13 | Skip Mamba Diffusion for Monocular 3D Semantic Scene Completion | Li Liang et.al. | 2501.07260 | link |
2025-01-13 | Boosting Text-To-Image Generation via Multilingual Prompting in Large Multimodal Models | Yongyu Mu et.al. | 2501.07086 | link |
2025-01-13 | D3MES: Diffusion Transformer with multihead equivariant self-attention for 3D molecule generation | Zhejun Zhang et.al. | 2501.07077 | link |
2025-01-13 | Enhancing Image Generation Fidelity via Progressive Prompts | Zhen Xiong et.al. | 2501.07070 | link |
2025-01-13 | Detection of AI Deepfake and Fraud in Online Payments Using GAN-Based Models | Zong Ke et.al. | 2501.07033 | null |
2025-01-13 | Erasing Noise in Signal Detection with Diffusion Model: From Theory to Application | Xiucheng Wang et.al. | 2501.07030 | null |
2025-01-13 | Global Search for Optimal Low Thrust Spacecraft Trajectories using Diffusion Models and the Indirect Method | Jannik Graebner et.al. | 2501.07005 | null |
2025-01-13 | Likelihood Training of Cascaded Diffusion Models via Hierarchical Volume-preserving Maps | Henry Li et.al. | 2501.06999 | link |
2025-01-16 | A General Framework for Inference-time Scaling and Steering of Diffusion Models | Raghav Singhal et.al. | 2501.06848 | link |
2025-01-24 | Eliza: A Web3 friendly AI Agent Operating System | Shaw Walters et.al. | 2501.06781 | link |
2025-01-12 | ODPG: Outfitting Diffusion with Pose Guided Condition | Seohyun Lee et.al. | 2501.06769 | null |
2025-01-12 | Generative AI Enabled Robust Sensor Placement in Cyber-Physical Power Systems: A Graph Diffusion Approach | Changyuan Zhao et.al. | 2501.06756 | null |
2025-01-12 | Padding Tone: A Mechanistic Analysis of Padding Tokens in T2I Models | Michael Toker et.al. | 2501.06751 | null |
2025-01-12 | DRDT3: Diffusion-Refined Decision Test-Time Training Model | Xingshuai Huang et.al. | 2501.06718 | null |
2025-01-11 | Personalized Preference Fine-tuning of Diffusion Models | Meihua Dang et.al. | 2501.06655 | null |
2025-01-11 | Denoising Diffusion Probabilistic Model for Radio Map Estimation in Generative Wireless Networks | Xuanhao Luo et.al. | 2501.06604 | null |
2025-01-11 | Boundary-enhanced time series data imputation with long-term dependency diffusion models | Chunjing Xiao et.al. | 2501.06585 | null |
2025-01-11 | DivTrackee versus DynTracker: Promoting Diversity in Anti-Facial Recognition against Dynamic FR Strategy | Wenshu Fan et.al. | 2501.06533 | null |
2025-01-11 | A Diffusive Data Augmentation Framework for Reconstruction of Complex Network Evolutionary History | En Xu et.al. | 2501.06485 | null |
2025-01-11 | Focus-N-Fix: Region-Aware Fine-Tuning for Text-to-Image Generation | Xiaoying Xing et.al. | 2501.06481 | null |
2025-01-11 | Qffusion: Controllable Portrait Video Editing via Quadrant-Grid Attention Learning | Maomao Li et.al. | 2501.06438 | null |
2025-01-10 | MEt3R: Measuring Multi-View Consistency in Generated Images | Mohammad Asim et.al. | 2501.06336 | null |
2025-01-08 | Generative AI for Cel-Animation: A Survey | Yunlong Tang et.al. | 2501.06250 | link |
2025-01-07 | Generating and Detecting Various Types of Fake Image and Audio Content: A Review of Modern Deep Learning Technologies and Tools | Arash Dehghani et.al. | 2501.06227 | null |
2025-01-10 | Multi-subject Open-set Personalization in Video Generation | Tsai-Shien Chen et.al. | 2501.06187 | null |
2025-01-10 | VideoAuteur: Towards Long Narrative Video Generation | Junfei Xiao et.al. | 2501.06173 | null |
2025-01-10 | From discrete-time policies to continuous-time diffusion samplers: Asymptotic equivalences and faster training | Julius Berner et.al. | 2501.06148 | link |
2025-01-10 | Nonisotropic Gaussian Diffusion for Realistic 3D Human Motion Prediction | Cecilia Curreli et.al. | 2501.06035 | null |
2025-01-10 | CamCtrl3D: Single-Image Scene Exploration with Precise 3D Camera Control | Stefan Popov et.al. | 2501.06006 | null |
2025-01-10 | Estimation and Restoration of Unknown Nonlinear Distortion using Diffusion | Michal Švento et.al. | 2501.05959 | link |
2025-01-10 | Beyond Flat Text: Dual Self-inherited Guidance for Visual Text Generation | Minxing Luo et.al. | 2501.05892 | null |
2025-01-10 | Poetry in Pixels: Prompt Tuning for Poem Image Generation via Diffusion Models | Sofia Jamil et.al. | 2501.05839 | link |
2025-01-10 | Diffusion Models for Smarter UAVs: Decision-Making and Modeling | Yousef Emami et.al. | 2501.05819 | null |
2025-01-10 | Alignment without Over-optimization: Training-Free Solution for Diffusion Models | Sunwoo Kim et.al. | 2501.05803 | link |
2025-01-10 | Conditional Diffusion Model for Electrical Impedance Tomography | Duanpeng Shi et.al. | 2501.05769 | null |
2025-01-10 | Controlling Large Language Models Through Concept Activation Vectors | Hanyu Zhang et.al. | 2501.05764 | null |
2025-01-10 | StarGen: A Spatiotemporal Autoregression Framework with Video Diffusion Model for Scalable and Controllable Scene Generation | Shangjin Zhai et.al. | 2501.05763 | null |
2025-01-10 | UAV Swarm-enabled Collaborative Post-disaster Communications in Low Altitude Economy via a Two-stage Optimization Approach | Xiaoya Zheng et.al. | 2501.05742 | null |
2025-01-10 | EmotiCrafter: Text-to-Emotional-Image Generation based on Valence-Arousal Model | Yi He et.al. | 2501.05710 | null |
2025-01-10 | EXION: Exploiting Inter- and Intra-Iteration Output Sparsity for Diffusion Models | Jaehoon Heo et.al. | 2501.05680 | null |
2025-01-10 | Network Diffuser for Placing-Scheduling Service Function Chains with Inverse Demonstration | Zuyuan Zhang et.al. | 2501.05673 | null |
2025-01-10 | Diffusion-Enhanced Optimization of Variational Quantum Eigensolver for General Hamiltonians | Shikun Zhang et.al. | 2501.05666 | null |
2025-01-10 | HFMF: Hierarchical Fusion Meets Multi-Stream Models for Deepfake Detection | Anant Mehta et.al. | 2501.05631 | link |
2025-01-08 | Tuning-Free Long Video Generation via Global-Local Collaborative Diffusion | Yongjia Ma et.al. | 2501.05484 | null |
2025-01-10 | Decentralized Diffusion Models | David McAllister et.al. | 2501.05450 | null |
2025-01-09 | Consistent Flow Distillation for Text-to-3D Generation | Runjie Yan et.al. | 2501.05445 | null |
2025-01-09 | Progressive Growing of Video Tokenizers for Highly Compressed Latent Spaces | Aniruddha Mahapatra et.al. | 2501.05442 | null |
2025-01-09 | The GAN is dead; long live the GAN! A Modern GAN Baseline | Yiwen Huang et.al. | 2501.05441 | link |
2025-01-09 | Zero-1-to-G: Taming Pretrained 2D Diffusion Model for Direct 3D Generation | Xuyi Meng et.al. | 2501.05427 | null |
2025-01-09 | Seeing Sound: Assembling Sounds from Visuals for Audio-to-Image Generation | Darius Petermann et.al. | 2501.05413 | null |
2025-01-09 | TimeDP: Learning to Generate Multi-Domain Time Series with Domain Prompts | Yu-Hao Huang et.al. | 2501.05403 | link |
2025-01-09 | Accelerated Diffusion Models via Speculative Sampling | Valentin De Bortoli et.al. | 2501.05370 | null |
2025-01-09 | CROPS: Model-Agnostic Training-Free Framework for Safe Image Synthesis with Latent Diffusion Models | Junha Park et.al. | 2501.05359 | null |
2025-01-09 | Patch-GAN Transfer Learning with Reconstructive Models for Cloud Removal | Wanli Ma et.al. | 2501.05265 | null |
2025-01-13 | Light Transport-aware Diffusion Posterior Sampling for Single-View Reconstruction of 3D Volumes | Ludwic Leonard et.al. | 2501.05226 | link |
2025-01-10 | FaceMe: Robust Blind Face Restoration with Personal Identification | Siyu Liu et.al. | 2501.05177 | null |
2025-01-09 | 3DIS-FLUX: simple and efficient multi-instance generation with DiT rendering | Dewei Zhou et.al. | 2501.05131 | null |
2025-01-09 | EquiBoost: An Equivariant Boosting Approach to Molecular Conformation Generation | Yixuan Yang et.al. | 2501.05109 | link |
2025-01-09 | Recovery of activation propagation and self-sustained oscillation abilities in stroke brain networks | Yingpeng Liu et.al. | 2501.05099 | null |
2025-01-10 | ResPanDiff: Diffusion Model for Pansharpening by Inferring Residual Inference | Shiqi Cao et.al. | 2501.05091 | null |
2025-01-13 | D3RM: A Discrete Denoising Diffusion Refinement Model for Piano Transcription | Hounsu Kim et.al. | 2501.05068 | link |
2025-01-09 | On a reaction-diffusion virus model with general boundary conditions in heterogeneous environments | Mingxin Wang et.al. | 2501.04992 | null |
2025-01-09 | FLowHigh: Towards Efficient and High-Quality Audio Super-Resolution with Single-Step Flow Matching | Jun-Hak Yun et.al. | 2501.04926 | link |
2025-01-08 | Geophysical inverse problems with measurement-guided diffusion models | Matteo Ravasi et.al. | 2501.04881 | null |
2025-01-08 | Using Diffusion Models for Reducing Spatiotemporal Errors of Deep Learning Based Urban Microclimate Predictions at Post-Processing Stage | Sepehrdad Tahmasebi et.al. | 2501.04847 | null |
2025-01-08 | TREAD: Token Routing for Efficient Architecture-agnostic Diffusion Training | Felix Krause et.al. | 2501.04765 | link |
2025-01-08 | Color Correction Meets Cross-Spectral Refinement: A Distribution-Aware Diffusion for Underwater Image Restoration | Laibin Chang et.al. | 2501.04740 | null |
2025-01-08 | EditAR: Unified Conditional Generation with Autoregressive Models | Jiteng Mu et.al. | 2501.04699 | null |
2025-01-08 | ConceptMaster: Multi-Concept Video Customization on Diffusion Transformer Models Without Test-Time Tuning | Yuzhou Huang et.al. | 2501.04698 | null |
2025-01-08 | SPAR3D: Stable Point-Aware Reconstruction of 3D Objects from Single Images | Zixuan Huang et.al. | 2501.04689 | null |
2025-01-08 | A Statistical Theory of Contrastive Pre-training and Multimodal Generative AI | Kazusato Oko et.al. | 2501.04641 | link |
2025-01-08 | Disentangled Clothed Avatar Generation with Layered Representation | Weitian Zhang et.al. | 2501.04631 | null |
2025-01-09 | MedCoDi-M: A Multi-Prompt Foundation Model for Multimodal Medical Data Generation | Daniele Molino et.al. | 2501.04614 | null |
2025-01-08 | Enhancing Low-Cost Video Editing with Lightweight Adaptors and Temporal-Aware Inversion | Yangfan He et.al. | 2501.04606 | link |
2025-01-08 | ZSVC: Zero-shot Style Voice Conversion with Disentangled Latent Diffusion Models and Adversarial Training | Xinfa Zhu et.al. | 2501.04416 | null |
2025-01-08 | On Computational Limits and Provably Efficient Criteria of Visual Autoregressive Models: A Fine-Grained Complexity Analysis | Yekun Ke et.al. | 2501.04377 | null |
2025-01-08 | Edit as You See: Image-guided Video Editing via Masked Motion Modeling | Zhi-Lin Huang et.al. | 2501.04325 | null |
2025-01-08 | DGQ: Distribution-Aware Group Quantization for Text-to-Image Diffusion Models | Hyogon Ryu et.al. | 2501.04304 | link |
2025-01-08 | Circuit Complexity Bounds for Visual Autoregressive Model | Yekun Ke et.al. | 2501.04299 | null |
2025-01-09 | ContextMRI: Enhancing Compressed Sensing MRI through Metadata Conditioning | Hyungjin Chung et.al. | 2501.04284 | link |
2025-01-08 | DrawSpeech: Expressive Speech Synthesis Using Prosodic Sketches as Control Conditions | Weidong Chen et.al. | 2501.04256 | null |
2025-01-08 | LipGen: Viseme-Guided Lip Video Generation for Enhancing Visual Speech Recognition | Bowen Hao et.al. | 2501.04204 | null |
2025-01-21 | HistoryPalette: Supporting Exploration and Reuse of Past Alternatives in Image Generation and Editing | Karim Benharrak et.al. | 2501.04163 | null |
2025-01-07 | NeuralSVG: An Implicit Representation for Text-to-Vector Generation | Sagi Polaczek et.al. | 2501.03992 | null |
2025-01-07 | Stabilising effect of generic anomalous diffusion independent of the Rayleigh number | Antonio Barletta et.al. | 2501.03990 | null |
2025-01-07 | A precise asymptotic analysis of learning diffusion models: theory and insights | Hugo Cui et.al. | 2501.03937 | link |
2025-01-07 | Magic Mirror: ID-Preserved Video Generation in Video Diffusion Transformers | Yuechen Zhang et.al. | 2501.03931 | link |
2025-01-09 | Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control | Zekai Gu et.al. | 2501.03847 | link |
2025-01-07 | Impact of diffusion mechanisms on persistence and spreading | Nathanaël Boutillon et.al. | 2501.03816 | null |
2025-01-07 | Mixing by Internal Gravity Waves in Stars: Assessing Numerical Simulations Against Theory | Jack Morton et.al. | 2501.03796 | null |
2025-01-07 | Motion-Aware Generative Frame Interpolation | Guozhen Zhang et.al. | 2501.03699 | null |
2025-01-07 | Exploring Molecule Generation Using Latent Space Graph Diffusion | Prashanth Pombala et.al. | 2501.03696 | link |
2025-01-10 | MC-VTON: Minimal Control Virtual Try-On Diffusion Transformer | Junsheng Luan et.al. | 2501.03630 | null |
2025-01-08 | Evaluating Image Caption via Cycle-consistent Text-to-Image Generation | Tianyu Cui et.al. | 2501.03567 | null |
2025-01-07 | PromptGuard: Soft Prompt-Guided Unsafe Content Moderation for Text-to-Image Models | Lingzhi Yuan et.al. | 2501.03544 | null |
2025-01-07 | FgC2F-UDiff: Frequency-guided and Coarse-to-fine Unified Diffusion Model for Multi-modality Missing MRI Synthesis | Xiaojiao Xiao et.al. | 2501.03526 | link |
2025-01-07 | Textualize Visual Prompt for Image Editing via Diffusion Bridge | Pengcheng Xu et.al. | 2501.03495 | null |
2025-01-07 | SceneBooth: Diffusion-based Framework for Subject-preserved Text-to-Image Generation | Shang Chai et.al. | 2501.03490 | null |
2025-01-06 | A Self-supervised Diffusion Bridge for MRI Reconstruction | Harry Gao et.al. | 2501.03430 | null |
2025-01-06 | License Plate Images Generation with Diffusion Models | Mariia Shpir et.al. | 2501.03374 | null |
2025-01-10 | K-space Diffusion Model Based MR Reconstruction Method for Simultaneous Multislice Imaging | Ting Zhao et.al. | 2501.03293 | null |
2025-01-06 | MObI: Multimodal Object Inpainting Using Diffusion Models | Alexandru Buburuzan et.al. | 2501.03173 | null |
2025-01-06 | Large language models for artificial general intelligence (AGI): A survey of foundational principles and approaches | Alhassan Mumuni et.al. | 2501.03151 | null |
2025-01-06 | Through-The-Mask: Mask-based Motion Trajectories for Image-to-Video Generation | Guy Yariv et.al. | 2501.03059 | null |
2025-01-06 | DDRM-PR: Fourier Phase Retrieval using Denoising Diffusion Restoration Models | Mehmet Onurcan Kaya et.al. | 2501.03030 | null |
2025-01-20 | TransPixeler: Advancing Text-to-Video Generation with Transparency | Luozhou Wang et.al. | 2501.03006 | link |
2025-01-06 | STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution | Rui Xie et.al. | 2501.02976 | null |
2025-01-07 | SceneVTG++: Controllable Multilingual Visual Text Generation in the Wild | Jiawei Liu et.al. | 2501.02962 | null |
2025-01-06 | Deep Generative Model-Aided Power System Dynamic State Estimation and Reconstruction with Unknown Control Inputs or Data Distributions | Jianhua Pei et.al. | 2501.02928 | null |
2025-01-06 | Pointmap-Conditioned Diffusion for Consistent Novel View Synthesis | Thang-Anh-Quan Nguyen et.al. | 2501.02913 | null |
2025-01-06 | Conditional Mutual Information Based Diffusion Posterior Sampling for Solving Inverse Problems | Shayan Mohajer Hamidi et.al. | 2501.02880 | null |
2025-01-06 | Towards HRTF Personalization using Denoising Diffusion Models | Juan Camilo Albarracín Sánchez et.al. | 2501.02871 | null |
2025-01-07 | Diff-Lung: Diffusion-Based Texture Synthesis for Enhanced Pathological Tissue Segmentation in Lung CT Scans | Rezkellah Noureddine Khiati et.al. | 2501.02867 | null |
2025-01-06 | Synthetic Fungi Datasets: A Time-Aligned Approach | A. Rani et.al. | 2501.02855 | null |
2025-01-06 | InpDiffusion: Image Inpainting Localization via Conditional Diffusion Models | Kai Wang et.al. | 2501.02816 | null |
2025-01-06 | Brick-Diffusion: Generating Long Videos with Brick-to-Wall Denoising | Yunlong Yuan et.al. | 2501.02741 | null |
2025-01-06 | Artificial Intelligence in Creative Industries: Advances Prior to 2025 | Nantheera Anantrasirichai et.al. | 2501.02725 | null |
2025-01-06 | Multilevel Semantic-Aware Model for AI-Generated Video Quality Assessment | Jiaze Li et.al. | 2501.02706 | null |
2025-01-05 | GS-DiT: Advancing Video Generation with Pseudo 4D Gaussian Fields through Efficient Dense 3D Point Tracking | Weikang Bian et.al. | 2501.02690 | null |
2025-01-05 | From thermodynamics to protein design: Diffusion models for biomolecule generation towards autonomous protein engineering | Wen-ran Li et.al. | 2501.02680 | null |
2025-01-05 | DepthMaster: Taming Diffusion Models for Monocular Depth Estimation | Ziyang Song et.al. | 2501.02576 | link |
2025-01-05 | Decoding fMRI Data into Captions using Prefix Language Modeling | Vyacheslav Shen et.al. | 2501.02570 | link |
2025-01-05 | Unified Guidance for Geometry-Conditioned Molecular Generation | Sirine Ayadi et.al. | 2501.02526 | null |
2025-01-05 | Face-MakeUp: Multimodal Facial Prompts for Text-to-Image Generation | Dawei Dai et.al. | 2501.02523 | link |
2025-01-05 | Layout2Scene: 3D Semantic Layout Guided Scene Generation via Geometry and Appearance Diffusion Priors | Minglin Chen et.al. | 2501.02519 | null |
2025-01-15 | ACE++: Instruction-Based Image Creation and Editing via Context-Aware Content Filling | Chaojie Mao et.al. | 2501.02487 | null |
2025-01-05 | DeTrack: In-model Latent Denoising Learning for Visual Object Tracking | Xinyu Zhou et.al. | 2501.02467 | null |
2025-01-05 | MedSegDiffNCA: Diffusion Models With Neural Cellular Automata for Skin Lesion Segmentation | Avni Mittal et.al. | 2501.02447 | null |
2025-01-04 | Generalizable Origin Identification for Text-Guided Image-to-Image Diffusion Models | Wenhao Wang et.al. | 2501.02376 | null |
2025-01-04 | CorrFill: Enhancing Faithfulness in Reference-based Inpainting with Correspondence Guidance in Diffusion Models | Kuan-Hung Liu et.al. | 2501.02355 | null |
2025-01-04 | DiffGraph: Heterogeneous Graph Diffusion Model | Zongwei Li et.al. | 2501.02313 | link |
2025-01-04 | TDM: Temporally-Consistent Diffusion Model for All-in-One Real-World Video Restoration | Yizhou Li et.al. | 2501.02269 | null |
2025-01-04 | Unsupervised Class Generation to Expand Semantic Segmentation Datasets | Javier Montalvo et.al. | 2501.02264 | null |
2025-01-09 | MagicFace: High-Fidelity Facial Expression Editing with Action-Unit Control | Mengting Wei et.al. | 2501.02260 | link |
2025-01-04 | Diffusion Model-Based Data Synthesis Aided Federated Semi-Supervised Learning | Zhongwei Wang et.al. | 2501.02219 | null |
2025-01-10 | Benchmark Evaluations, Applications, and Challenges of Large Vision Language Models: A Survey | Zongxia Li et.al. | 2501.02189 | link |
2025-01-04 | Generating Multimodal Images with GAN: Integrating Text, Image, and Style | Chaoyi Tan et.al. | 2501.02167 | null |
2025-01-04 | Plasma-CycleGAN: Plasma Biomarker-Guided MRI to PET Cross-modality Translation Using Conditional CycleGAN | Yanxi Chen et.al. | 2501.02146 | null |
2025-01-04 | Establishing baselines for generative discovery of inorganic crystals | Nathan J. Szymanski et.al. | 2501.02144 | link |
2025-01-03 | Generalized Twice Differentiability and Quadratic Bundles in Second-Order Variational Analysis | Pham Duy Khanh et.al. | 2501.02067 | null |
2025-01-03 | ArtCrafter: Text-Image Aligning Style Transfer via Embedding Reframing | Nisha Huang et.al. | 2501.02064 | null |
2025-01-01 | SmartSpatial: Enhancing the 3D Spatial Arrangement Capabilities of Stable Diffusion Models and Introducing a Novel 3D Spatial Evaluation Framework | Mao Xun Huang et.al. | 2501.01998 | null |
2025-01-10 | Gender Bias in Text-to-Video Generation Models: A case study of Sora | Mohammad Nadeem et.al. | 2501.01987 | null |
2025-01-09 | INFELM: In-depth Fairness Evaluation of Large Text-To-Image Models | Di Jin et.al. | 2501.01973 | null |
2025-01-03 | Bridging Classification and Segmentation in Osteosarcoma Assessment via Foundation and Discrete Diffusion Models | Manh Duong Nguyen et.al. | 2501.01932 | link |
2025-01-03 | JoyGen: Audio-Driven 3D Depth-Aware Talking-Face Video Editing | Qili Wang et.al. | 2501.01798 | link |
2025-01-03 | Ingredients: Blending Custom Photos with Video Diffusion Transformers | Zhengcong Fei et.al. | 2501.01790 | link |
2025-01-03 | Nonparametric estimation of a factorizable density using diffusion models | Hyeok Kyu Kwon et.al. | 2501.01783 | null |
2025-01-03 | Adverse Weather Conditions Augmentation of LiDAR Scenes with Latent Diffusion Models | Andrea Matteazzi et.al. | 2501.01761 | null |
2025-01-03 | Controlling your Attributes in Voice | Xuyuan Li et.al. | 2501.01674 | null |
2025-01-03 | ACE: Anti-Editing Concept Erasure in Text-to-Image Models | Zihao Wang et.al. | 2501.01633 | link |
2025-01-03 | Multivariate Time Series Anomaly Detection using DiffGAN Model | Guangqiang Wu et.al. | 2501.01591 | link |
2025-01-02 | Denoising Diffused Embeddings: a Generative Approach for Hypergraphs | Shihao Wu et.al. | 2501.01541 | null |
2024-12-30 | LS-GAN: Human Motion Synthesis with Latent-space GANs | Avinash Amballa et.al. | 2501.01449 | null |
2025-01-07 | VideoAnydoor: High-fidelity Video Object Insertion with Precise Motion Control | Yuanpeng Tu et.al. | 2501.01427 | null |
2025-01-03 | Free-Form Motion Control: A Synthetic Video Generation Dataset with Controllable Camera and Object Motions | Xincheng Shuai et.al. | 2501.01425 | null |
2025-01-02 | Object-level Visual Prompts for Compositional Image Generation | Gaurav Parmar et.al. | 2501.01424 | null |
2025-01-06 | Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models | Jingfeng Yao et.al. | 2501.01423 | link |
2025-01-02 | On Unifying Video Generation and Camera Pose Estimation | Chun-Hao Paul Huang et.al. | 2501.01409 | null |
2025-01-02 | Nested Attention: Semantic-aware Attention Values for Concept Personalization | Or Patashnik et.al. | 2501.01407 | null |
2025-01-02 | ProjectedEx: Enhancing Generation in Explainable AI for Prostate Cancer | Xuyin Qi et.al. | 2501.01392 | link |
2025-01-02 | Test-time Controllable Image Generation by Explicit Spatial Constraint Enforcement | Z. Zhang et.al. | 2501.01368 | null |
2025-01-03 | SVFR: A Unified Framework for Generalized Video Face Restoration | Zhiyao Wang et.al. | 2501.01235 | link |
2025-01-03 | Conditional Consistency Guided Image Translation and Enhancement | Amil Bhagat et.al. | 2501.01223 | link |
2025-01-02 | LayeringDiff: Layered Image Synthesis via Generation, then Disassembly with Generative Knowledge | Kyoungkook Kang et.al. | 2501.01197 | null |
2025-01-02 | TexAVi: Generating Stereoscopic VR Video Clips from Text Descriptions | Vriksha Srihari et.al. | 2501.01156 | null |
2025-01-02 | Semantics-Guided Diffusion for Deep Joint Source-Channel Coding in Wireless Image Transmission | Maojun Zhang et.al. | 2501.01138 | link |
2025-01-02 | DuMo: Dual Encoder Modulation Network for Precise Concept Erasure | Feng Han et.al. | 2501.01125 | link |
2025-01-02 | HarmonyIQA: Pioneering Benchmark and Model for Image Harmonization Quality Assessment | Zitong Xu et.al. | 2501.01116 | null |
2025-01-21 | EliGen: Entity-Level Controlled Image Generation with Regional Attention | Hong Zhang et.al. | 2501.01097 | link |
2025-01-02 | DiffCL: A Diffusion-Based Contrastive Learning Framework with Semantic Alignment for Multimodal Recommendations | Qiya Song et.al. | 2501.01066 | null |
2025-01-02 | Optimizing Noise Schedules of Generative Models in High Dimensionss | Santiago Aranguri et.al. | 2501.00988 | null |
2025-01-01 | OASIS Uncovers: High-Quality T2I Models, Same Old Stereotypes | Sepehr Dehdashtian et.al. | 2501.00962 | null |
2025-01-01 | Enhancing Early Diabetic Retinopathy Detection through Synthetic DR1 Image Generation: A StyleGAN3 Approach | Sagarnil Das et.al. | 2501.00954 | null |
2025-01-01 | Cached Adaptive Token Merging: Dynamic Token Reduction and Redundant Computation Elimination in Diffusion Model | Omid Saghatchian et.al. | 2501.00946 | link |
2025-01-11 | Diffusion Prism: Enhancing Diversity and Morphology Consistency in Mask-to-Image Diffusion | Hao Wang et.al. | 2501.00944 | null |
2025-01-01 | A Novel Diffusion Model for Pairwise Geoscience Data Generation with Unbalanced Training Dataset | Junhuan Yang et.al. | 2501.00941 | null |
2025-01-01 | Hierarchical Vision-Language Alignment for Text-to-Image Generation via Diffusion Models | Emily Johnson et.al. | 2501.00917 | null |
2025-01-01 | Diffusion Policies for Generative Modeling of Spacecraft Trajectories | Julia Briden et.al. | 2501.00915 | null |
2025-01-01 | AutoPresent: Designing Structured Visuals from Scratch | Jiaxin Ge et.al. | 2501.00912 | link |
2025-01-01 | Population Aware Diffusion for Time Series Generation | Yang Li et.al. | 2501.00910 | link |
2025-01-01 | Text2Earth: Unlocking Text-driven Remote Sensing Image Generation with a Global-Scale Dataset and a Foundation Model | Chenyang Liu et.al. | 2501.00895 | null |
2025-01-01 | Beyond Text: Implementing Multimodal Large Language Model-Powered Multi-Agent Systems Using a No-Code Platform | Cheonsu Jeong et.al. | 2501.00750 | null |
2025-01-01 | A Distributional Evaluation of Generative Image Models | Edric Tam et.al. | 2501.00744 | null |
2025-01-01 | RORem: Training a Robust Object Remover with Human-in-the-Loop | Ruibin Li et.al. | 2501.00740 | link |
2024-12-31 | SoundBrush: Sound as a Brush for Visual Scene Editing | Kim Sung-Bin et.al. | 2501.00645 | null |
2024-12-31 | Flash-Split: 2D Reflection Removal with Flash Cues and Latent Diffusion Separation | Tianfu Wang et.al. | 2501.00637 | null |
2024-12-31 | DiC: Rethinking Conv3x3 Designs in Diffusion Models | Yuchuan Tian et.al. | 2501.00603 | link |
2025-01-03 | DreamDrive: Generative 4D Scene Modeling from Street View Images | Jiageng Mao et.al. | 2501.00601 | null |
2024-12-31 | Probing Visual Language Priors in VLMs | Tiange Luo et.al. | 2501.00569 | null |
2024-12-31 | Polynomial time sampling from log-smooth distributions in fixed dimension under semi-log-concavity of the forward diffusion with application to strongly dissipative distributions | Adrien Vacher et.al. | 2501.00565 | null |
2024-12-31 | Score-Based Metropolis-Hastings Algorithms | Ahmed Aloui et.al. | 2501.00467 | null |
2024-12-31 | SAT-LDM: Provably Generalizable Image Watermarking for Latent Diffusion Models with Self-Augmented Training | Lu Zhang et.al. | 2501.00463 | null |
2024-12-31 | Unleashing Text-to-Image Diffusion Prior for Zero-Shot Image Captioning | Jianjie Luo et.al. | 2501.00437 | null |
2024-12-31 | S-Diff: An Anisotropic Diffusion Model for Collaborative Filtering in Spectral Domain | Rui Xia et.al. | 2501.00384 | null |
2024-12-31 | Token Pruning for Caching Better: 9 Times Acceleration on Stable Diffusion for Free | Evelyn Zhang et.al. | 2501.00375 | link |
2024-12-31 | diffIRM: A Diffusion-Augmented Invariant Risk Minimization Framework for Spatiotemporal Prediction over Graphs | Zhaobin Mo et.al. | 2501.00305 | null |
2024-12-31 | Dual Diffusion for Unified Image Generation and Understanding | Zijie Li et.al. | 2501.00289 | null |
2024-12-31 | Denoising Data with Measurement Error Using a Reproducing Kernel-based Diffusion Model | Mingyang Yi et.al. | 2501.00212 | null |
2024-12-31 | MLLM-as-a-Judge for Image Safety without Human Labeling | Zhenting Wang et.al. | 2501.00192 | null |
2024-12-30 | PQD: Post-training Quantization for Efficient Diffusion Models | Jiaojiao Ye et.al. | 2501.00124 | null |
2024-12-30 | Text-to-Image GAN with Pretrained Representations | Xiaozhou You et.al. | 2501.00116 | null |
2024-12-30 | LTX-Video: Realtime Video Latent Diffusion | Yoav HaCohen et.al. | 2501.00103 | link |
2024-12-28 | AdvAnchor: Enhancing Diffusion Model Unlearning with Adversarial Anchors | Mengnan Zhao et.al. | 2501.00054 | null |
2025-01-02 | Prometheus: 3D-Aware Latent Diffusion Models for Feed-Forward Text-to-3D Scene Generation | Yuanbo Yang et.al. | 2412.21117 | null |
2024-12-30 | Quantum Diffusion Model for Quark and Gluon Jet Generation | Mariia Baidachna et.al. | 2412.21082 | link |
2024-12-30 | Vinci: A Real-time Embodied Smart Assistant based on Egocentric Vision-Language Model | Yifei Huang et.al. | 2412.21080 | link |
2025-01-14 | Edicho: Consistent Image Editing in the Wild | Qingyan Bai et.al. | 2412.21079 | link |
2024-12-30 | Varformer: Adapting VAR’s Generative Prior for Image Restoration | Siyang Wang et.al. | 2412.21063 | link |
2024-12-30 | VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation | Jiazheng Xu et.al. | 2412.21059 | link |
2024-12-30 | E2EDiff: Direct Mapping from Noise to Data for Enhanced Diffusion Models | Zhiyu Tan et.al. | 2412.21044 | null |
2024-12-30 | Visual Style Prompt Learning Using Diffusion Models for Blind Face Restoration | Wanglong Lu et.al. | 2412.21042 | link |
2024-12-30 | AlignAb: Pareto-Optimal Energy Alignment for Designing Nature-Like Antibodies | Yibo Wen et.al. | 2412.20984 | null |
2024-12-30 | Influence Maximization in Temporal Networks with Persistent and Reactive Behaviors | Aaqib Zahoor et.al. | 2412.20936 | null |
2024-12-30 | ILDiff: Generate Transparent Animated Stickers by Implicit Layout Distillation | Ting Zhang et.al. | 2412.20901 | null |
2024-12-30 | DDIM sampling for Generative AIBIM, a faster intelligent structural design framework | Zhili He et.al. | 2412.20899 | null |
2024-12-30 | VMix: Improving Text-to-Image Diffusion Model with Cross-Attention Mixing Control | Shaojin Wu et.al. | 2412.20800 | link |
2024-12-30 | Dialogue Director: Bridging the Gap in Dialogue Visualization for Multimodal Storytelling | Min Zhang et.al. | 2412.20725 | null |
2024-12-30 | M $^3$ oralBench: A MultiModal Moral Benchmark for LVLMs | Bei Yan et.al. | 2412.20718 | link |
2024-12-30 | HFI: A unified framework for training-free detection and implicit watermarking of latent diffusion model generated images | Sungik Choi et.al. | 2412.20704 | null |
2024-12-30 | Diffgrasp: Whole-Body Grasping Synthesis Guided by Object Motion Using a Diffusion Model | Yonghao Zhang et.al. | 2412.20657 | null |
2024-12-30 | Latent Drifting in Diffusion Models for Counterfactual Medical Image Synthesis | Yousef Yeganeh et.al. | 2412.20651 | null |
2024-12-29 | Zero-Shot Image Restoration Using Few-Step Guidance of Consistency Models (and Beyond) | Tomer Garber et.al. | 2412.20596 | link |
2024-12-29 | Testing and Improving the Robustness of Amortized Bayesian Inference for Cognitive Models | Yufei Wu et.al. | 2412.20586 | link |
2024-12-29 | Derivations of Animal Movement Models with Explicit Memory | Tianxu Wang et.al. | 2412.20568 | null |
2024-12-29 | DPBridge: Latent Diffusion Bridge for Dense Prediction | Haorui Ji et.al. | 2412.20506 | null |
2024-12-29 | Single-image reflection removal via self-supervised diffusion models | Zhengyang Lu et.al. | 2412.20466 | null |
2024-12-29 | Image Augmentation Agent for Weakly Supervised Semantic Segmentation | Wangyu Wu et.al. | 2412.20439 | null |
2024-12-29 | Bringing Objects to Life: 4D generation from 3D objects | Ohad Rahamim et.al. | 2412.20422 | null |
2024-12-29 | Diff4MMLiTS: Advanced Multimodal Liver Tumor Segmentation via Diffusion-Based Image Synthesis and Alignment | Shiyun Chen et.al. | 2412.20418 | null |
2025-01-02 | EraseAnything: Enabling Concept Erasure in Rectified Flow Transformers | Daiheng Gao et.al. | 2412.20413 | link |
2024-12-29 | Open-Sora: Democratizing Efficient Video Production for All | Zangwei Zheng et.al. | 2412.20404 | link |
2024-12-29 | FairDiffusion: Enhancing Equity in Latent Diffusion Models via Fair Bayesian Perturbation | Yan Luo et.al. | 2412.20374 | link |
2024-12-29 | Motion Transfer-Driven intra-class data augmentation for Finger Vein Recognition | Xiu-Feng Huang et.al. | 2412.20327 | link |
2024-12-28 | An analytic theory of creativity in convolutional diffusion models | Mason Kamb et.al. | 2412.20292 | null |
2024-12-28 | Deep Generalized Schrödinger Bridges: From Image Generation to Solving Mean-Field Games | Guan-Horng Liu et.al. | 2412.20279 | null |
2024-12-28 | High-Performance Model Predictive Control for Quadcopters with Formal Stability Guarantees | Maedeh Izadi et.al. | 2412.20277 | null |
2024-12-28 | Multi-Modality Driven LoRA for Adverse Condition Depth Estimation | Guanglei Yang et.al. | 2412.20162 | null |
2025-01-13 | SyncDiff: Synchronized Motion Diffusion for Multi-Body Human-Object Interaction Synthesis | Wenkun He et.al. | 2412.20104 | null |
2024-12-28 | Parameter spaces for cross-diffusive-driven instability in a reaction-diffusion system on an annular domain | Gulsemay Yigit et.al. | 2412.20097 | null |
2024-12-28 | MADiff: Text-Guided Fashion Image Editing with Mask Prediction and Attention-Enhanced Diffusion | Zechao Zhan et.al. | 2412.20062 | null |
2024-12-28 | Enhancing Diffusion Models for Inverse Problems with Covariance-Aware Posterior Sampling | Shayan Mohajer Hamidi et.al. | 2412.20045 | null |
2024-12-28 | An Ordinary Differential Equation Sampler with Stochastic Start for Diffusion Bridge Models | Yuang Wang et.al. | 2412.19992 | null |
2024-12-28 | MAKIMA: Tuning-free Multi-Attribute Open-domain Video Editing via Mask-Guided Attention Modulation | Haoyu Zheng et.al. | 2412.19978 | null |
2024-12-27 | Motion Planning Diffusion: Learning and Adapting Robot Motion Planning with Diffusion Models | J. Carvalho et.al. | 2412.19948 | null |
2024-12-27 | Chemotaxis and Reactions in Anomalous Diffusion Dynamics | Crystianne L. De Andrade et.al. | 2412.19940 | null |
2024-12-27 | Data-Free Group-Wise Fully Quantized Winograd Convolution via Learnable Scales | Shuokai Pan et.al. | 2412.19867 | null |
2024-12-25 | Conditional Balance: Improving Multi-Conditioning Trade-Offs in Image Generation | Nadav Z. Cohen et.al. | 2412.19853 | null |
2024-12-24 | A Review of Latent Representation Models in Neuroimaging | C. Vázquez-García et.al. | 2412.19844 | null |
2024-12-22 | RoboSignature: Robust Signature and Watermarking on Network Attacks | Aryaman Shaan et.al. | 2412.19834 | link |
2024-12-27 | Generative Video Propagation | Shaoteng Liu et.al. | 2412.19761 | null |
2024-12-30 | VideoMaker: Zero-shot Customized Video Generation with the Inherent Force of Video Diffusion Models | Tao Wu et.al. | 2412.19645 | null |
2024-12-27 | ReNeg: Learning Negative Embedding with Reward Guidance | Xiaomin Li et.al. | 2412.19637 | link |
2024-12-27 | StyleRWKV: High-Quality and High-Efficiency Style Transfer with RWKV-like Architecture | Miaomiao Dai et.al. | 2412.19535 | null |
2025-01-06 | P3S-Diffusion:A Selective Subject-driven Generation Framework via Point Supervision | Junjie Hu et.al. | 2412.19533 | null |
2024-12-27 | Is Your Text-to-Image Model Robust to Caption Noise? | Weichen Yu et.al. | 2412.19531 | null |
2024-12-30 | DrivingWorld: Constructing World Model for Autonomous Driving via Video GPT | Xiaotao Hu et.al. | 2412.19505 | link |
2024-12-27 | RobotDiffuse: Motion Planning for Redundant Manipulator based on Diffusion Model | Xiaohan Zhang et.al. | 2412.19500 | link |
2024-12-27 | RAIN: Real-time Animation of Infinite Video Stream | Zhilei Shu et.al. | 2412.19489 | null |
2024-12-30 | DriveEditor: A Unified 3D Information-Guided Framework for Controllable Object Editing in Driving Scenes | Yiyuan Liang et.al. | 2412.19458 | link |
2024-12-27 | Focusing Image Generation to Mitigate Spurious Correlations | Xuewei Li et.al. | 2412.19457 | null |
2024-12-27 | Multi-scale Latent Point Consistency Models for 3D Shape Generation | Bi’an Du et.al. | 2412.19413 | null |
2024-12-27 | A Generalized Einstein Relation for Markovian Friction Coefficients from Molecular Trajectories | J. M. Hall et.al. | 2412.19398 | null |
2024-12-26 | 6Diffusion: IPv6 Target Generation Using a Diffusion Model with Global-Local Attention Mechanisms for Internet-wide IPv6 Scanning | Nabo He et.al. | 2412.19243 | null |
2024-12-26 | Mask Approximation Net: Merging Feature Extraction and Distribution Learning for Remote Sensing Change Captioning | Dongwei Sun et.al. | 2412.19179 | null |
2024-12-26 | Improving Generative Pre-Training: An In-depth Study of Masked Image Modeling and Denoising Models | Hyesong Choi et.al. | 2412.19104 | null |
2024-12-26 | Mask Factory: Towards High-quality Synthetic Data Generation for Dichotomous Image Segmentation | Haotian Qian et.al. | 2412.19080 | null |
2024-12-26 | Hierarchical Multi-agent Meta-Reinforcement Learning for Cross-channel Bidding | Shenghong He et.al. | 2412.19064 | null |
2024-12-25 | UNIC-Adapter: Unified Image-instruction Adapter with Multi-modal Transformer for Image Generation | Lunhao Duan et.al. | 2412.18928 | null |
2024-12-25 | Accelerating Diffusion Transformers with Dual Feature Caching | Chang Zou et.al. | 2412.18911 | link |
2024-12-25 | EC-Diffuser: Multi-Object Manipulation via Entity-Centric Behavior Generation | Carl Qi et.al. | 2412.18907 | null |
2024-12-25 | DiFiC: Your Diffusion Model Holds the Secret to Fine-Grained Clustering | Ruohong Yang et.al. | 2412.18838 | null |
2024-12-25 | DebiasDiff: Debiasing Text-to-image Diffusion Models with Self-discovering Latent Attribute Directions | Yilei Jiang et.al. | 2412.18810 | null |
2024-12-25 | DRDM: A Disentangled Representations Diffusion Model for Synthesizing Realistic Person Images | Enbo Huang et.al. | 2412.18797 | null |
2024-12-25 | Protective Perturbations against Unauthorized Data Usage in Diffusion-based Image Generation | Sen Peng et.al. | 2412.18791 | null |
2024-12-25 | Elucidating Flow Matching ODE Dynamics with respect to Data Geometries | Gal Mishne et.al. | 2412.18730 | null |
2024-12-25 | MRI Reconstruction with Regularized 3D Diffusion Model (R3DM) | Arya Bangun et.al. | 2412.18723 | null |
2024-12-24 | Video Is Worth a Thousand Images: Exploring the Latest Trends in Long Video Generation | Faraz Waseem et.al. | 2412.18688 | null |
2024-12-24 | 1.58-bit FLUX | Chenglin Yang et.al. | 2412.18653 | null |
2024-12-24 | Dissecting CLIP: Decomposition with a Schur Complement-based Approach | Azim Ospanov et.al. | 2412.18645 | link |
2024-12-29 | PartGen: Part-level 3D Generation and Reconstruction with Multi-View Diffusion Models | Minghao Chen et.al. | 2412.18608 | null |
2024-12-24 | DrivingGPT: Unifying Driving World Modeling and Planning with Multi-modal Autoregressive Transformers | Yuntao Chen et.al. | 2412.18607 | null |
2024-12-24 | Explaining in Diffusion: Explaining a Classifier Through Hierarchical Semantics with Text-to-Image Diffusion Models | Tahira Kazimi et.al. | 2412.18604 | null |
2024-12-24 | ZeroHSI: Zero-Shot 4D Human-Scene Interaction by Video Generation | Hongjie Li et.al. | 2412.18600 | null |
2024-12-24 | DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation | Minghong Cai et.al. | 2412.18597 | link |
2024-12-24 | LatentCRF: Continuous CRF for Efficient Latent Diffusion | Kanchana Ranasinghe et.al. | 2412.18596 | null |
2024-12-24 | Resolution-Robust 3D MRI Reconstruction with 2D Diffusion Priors: Diverse-Resolution Training Outperforms Interpolation | Anselm Krainovic et.al. | 2412.18584 | null |
2024-12-24 | 3DEnhancer: Consistent Multi-View Diffusion for 3D Enhancement | Yihang Luo et.al. | 2412.18565 | null |
2024-12-24 | Fashionability-Enhancing Outfit Image Editing with Conditional Diffusion Models | Qice Qin et.al. | 2412.18421 | null |
2024-12-24 | Discovery of 2D Materials via Symmetry-Constrained Diffusion Model | Shihang Xu et.al. | 2412.18414 | null |
2024-12-24 | Extract Free Dense Misalignment from CLIP | JeongYeon Nam et.al. | 2412.18404 | link |
2024-12-24 | FameBias: Embedding Manipulation Bias Attack in Text-to-Image Models | Jaechul Roh et.al. | 2412.18302 | null |
2024-12-24 | GDM4MMIMO: Generative Diffusion Models for Massive MIMO Communications | Zhenzhou Jin et.al. | 2412.18281 | null |
2024-12-24 | Schödinger Bridge Type Diffusion Models as an Extension of Variational Autoencoders | Kentaro Kaba et.al. | 2412.18237 | null |
2024-12-24 | Expand VSR Benchmark for VLLM to Expertize in Spatial Rules | Peijin Xie et.al. | 2412.18224 | link |
2024-12-24 | Accelerating AIGC Services with Latent Action Diffusion Scheduling in Edge Networks | Changfu Xu et.al. | 2412.18212 | link |
2024-12-30 | TextMatch: Enhancing Image-Text Consistency Through Multimodal Optimization | Yucong Luo et.al. | 2412.18185 | null |
2024-12-24 | Stochastic Control for Fine-tuning Diffusion Models: Optimality, Regularity, and Convergence | Yinbin Han et.al. | 2412.18164 | null |
2024-12-25 | EvalMuse-40K: A Reliable and Fine-Grained Benchmark with Comprehensive Human Annotations for Text-to-Image Generation Model Evaluation | Shuhao Han et.al. | 2412.18150 | link |
2024-12-24 | Dense-Face: Personalized Face Generation Model via Dense Annotation Prediction | Xiao Guo et.al. | 2412.18149 | null |
2024-12-24 | Ensuring Consistency for In-Image Translation | Chengpeng Fu et.al. | 2412.18139 | null |
2024-12-24 | AEIOU: A Unified Defense Framework against NSFW Prompts in Text-to-Image Models | Yiming Wang et.al. | 2412.18123 | null |
2024-12-23 | A physics-engineering-economic model coupling approach for estimating the socio-economic impacts of space weather scenarios | Edward J. Oughton et.al. | 2412.18032 | null |
2024-12-23 | Multi-Agent Path Finding in Continuous Spaces with Projected Diffusion Models | Jinhao Liang et.al. | 2412.17993 | null |
2024-12-23 | Causal Composition Diffusion Model for Closed-loop Traffic Generation | Haohong Lin et.al. | 2412.17920 | null |
2024-12-18 | LaMI-GO: Latent Mixture Integration for Goal-Oriented Communications Achieving High Spectrum Efficiency | Achintha Wijesinghe et.al. | 2412.17839 | null |
2024-12-23 | FaceLift: Single Image to 3D Head with View Generation and GS-LRM | Weijie Lyu et.al. | 2412.17812 | null |
2024-12-23 | Large Motion Video Autoencoding with Cross-modal Video VAE | Yazhou Xing et.al. | 2412.17805 | null |
2025-01-01 | PepTune: De Novo Generation of Therapeutic Peptides with Multi-Objective-Guided Discrete Diffusion | Sophia Tang et.al. | 2412.17780 | null |
2024-12-23 | The Superposition of Diffusion Models Using the Itô Density Estimator | Marta Skreta et.al. | 2412.17762 | null |
2024-12-23 | VidTwin: Video VAE with Decoupled Structure and Dynamics | Yuchi Wang et.al. | 2412.17726 | link |
2024-12-23 | A Bias-Free Training Paradigm for More General AI-generated Image Detection | Fabrizio Guillaro et.al. | 2412.17671 | null |
2024-12-23 | Benchmarking Generative AI Models for Deep Learning Test Input Generation | Maryam et.al. | 2412.17652 | link |
2024-12-25 | DreamFit: Garment-Centric Human Generation via a Lightweight Anything-Dressing Encoder | Ente Lin et.al. | 2412.17644 | null |
2024-12-23 | ANID: How Far Are We? Evaluating the Discrepancies Between AI-synthesized Images and Natural Images through Multimodal Guidance | Renyang Liu et.al. | 2412.17632 | link |
2024-12-23 | Editing Implicit and Explicit Representations of Radiance Fields: A Survey | Arthur Hubert et.al. | 2412.17628 | null |
2024-12-23 | Personalized Large Vision-Language Models | Chau Pham et.al. | 2412.17610 | null |
2024-12-23 | Retention Score: Quantifying Jailbreak Risks for Vision Language Models | Zaitang Li et.al. | 2412.17544 | null |
2025-01-05 | DiffusionAttacker: Diffusion-Driven Prompt Manipulation for LLM Jailbreak | Hao Wang et.al. | 2412.17522 | null |
2024-12-23 | Heterogeneous carrying capacities and global extinction in metapopulations | Jakub Hesoun et.al. | 2412.17461 | null |
2024-12-23 | AeroDiT: Diffusion Transformers for Reynolds-Averaged Navier-Stokes Simulations of Airfoil Flows | Hui Xiang et.al. | 2412.17394 | null |
2024-12-24 | Singular Value Scaling: Efficient Generative Model Compression via Pruned Weights Refinement | Hyeonjin Kim et.al. | 2412.17387 | link |
2024-12-23 | FFA Sora, video generation as fundus fluorescein angiography simulator | Xinyuan Wu et.al. | 2412.17346 | null |
2024-12-23 | Broadband Ground Motion Synthesis by Diffusion Model with Minimal Condition | Jaeheun Jung et.al. | 2412.17333 | null |
2024-12-27 | Free-viewpoint Human Animation with Pose-correlated Reference Selection | Fa-Ting Hong et.al. | 2412.17290 | null |
2024-12-23 | Enhancing Multi-Text Long Video Generation Consistency without Tuning: Time-Frequency Analysis, Prompt Alignment, and Theory | Xingyao Li et.al. | 2412.17254 | null |
2024-12-23 | OLiDM: Object-aware LiDAR Diffusion Models for Autonomous Driving | Tianyi Yan et.al. | 2412.17226 | null |
2024-12-23 | CharGen: High Accurate Character-Level Visual Text Generation Model with MultiModal Encoder | Lichen Ma et.al. | 2412.17225 | null |
2024-12-25 | Discriminative Image Generation with Diffusion Models for Zero-Shot Learning | Dingjie Fu et.al. | 2412.17219 | null |
2024-12-22 | Generative Diffusion Modeling: A Practical Handbook | Zihan Ding et.al. | 2412.17162 | null |
2024-12-24 | Distilled Decoding 1: One-step Sampling of Image Auto-regressive Models with Flow Matching | Enshu Liu et.al. | 2412.17153 | link |
2024-12-22 | Similarity Trajectories: Linking Sampling Process to Artifacts in Diffusion-Generated Images | Dennis Menn et.al. | 2412.17109 | null |
2024-12-22 | DreamOmni: Unified Image Generation and Editing | Bin Xia et.al. | 2412.17098 | null |
2024-12-22 | SubstationAI: Multimodal Large Model-Based Approaches for Analyzing Substation Equipment Faults | Jinzhi Wang et.al. | 2412.17077 | null |
2024-12-22 | Modular Conversational Agents for Surveys and Interviews | Jiangbo Yu et.al. | 2412.17049 | null |
2025-01-08 | Adapting Image-to-Video Diffusion Models for Large-Motion Frame Interpolation | Luoxu Jin et.al. | 2412.17042 | null |
2024-12-22 | HyperNet Fields: Efficiently Training Hypernetworks without Ground Truth by Learning Weight Trajectories | Eric Hedlin et.al. | 2412.17040 | null |
2024-12-22 | A Conditional Diffusion Model for Electrical Impedance Tomography Image Reconstruction | Shuaikai Shi et.al. | 2412.16979 | link |
2024-12-22 | PromptDresser: Improving the Quality and Controllability of Virtual Try-On via Generative Textual Prompt and Prompt-aware Mask | Jeongho Kim et.al. | 2412.16978 | link |
2024-12-22 | Learning an Adaptive Fall Recovery Controller for Quadrupeds on Complex Terrains | Yidan Lu et.al. | 2412.16924 | null |
2024-12-22 | FADA: Fast Diffusion Avatar Synthesis with Mixed-Supervised Multi-CFG Distillation | Tianyun Zhong et.al. | 2412.16915 | null |
2024-12-22 | Map Imagination Like Blind Humans: Group Diffusion Model for Robotic Map Generation | Qijin Song et.al. | 2412.16908 | null |
2024-12-22 | Self-Corrected Flow Distillation for Consistent One-Step and Few-Step Text-to-Image Generation | Quan Dao et.al. | 2412.16906 | null |
2024-12-22 | Diffusion-Based Approaches in Medical Image Generation and Analysis | Abdullah al Nomaan Nafi et.al. | 2412.16860 | null |
2024-12-22 | Adversarial Diffusion Model for Unsupervised Domain-Adaptive Semantic Segmentation | Jongmin Yu et.al. | 2412.16859 | null |
2024-12-26 | Sim911: Towards Effective and Equitable 9-1-1 Dispatcher Training with an LLM-Enabled Simulation | Zirong Chen et.al. | 2412.16844 | null |
2024-12-24 | Human-Guided Image Generation for Expanding Small-Scale Training Image Datasets | Changjian Chen et.al. | 2412.16839 | link |
2024-12-22 | RealisID: Scale-Robust and Fine-Controllable Identity Customization via Local and Global Complementation | Zhaoyang Sun et.al. | 2412.16832 | null |
2024-12-22 | Layer- and Timestep-Adaptive Differentiable Token Compression Ratios for Efficient Diffusion Transformers | Haoran You et.al. | 2412.16822 | null |
2024-12-21 | RoomPainter: View-Integrated Diffusion for Consistent Indoor Scene Texturing | Zhipeng Huang et.al. | 2412.16778 | null |
2024-12-21 | GANFusion: Feed-Forward Text-to-3D with Diffusion in GAN Space | Souhaib Attaiki et.al. | 2412.16717 | null |
2024-12-21 | TCAQ-DM: Timestep-Channel Adaptive Quantization for Diffusion Models | Haocheng Huang et.al. | 2412.16700 | null |
2024-12-21 | VAST 1.0: A Unified Framework for Controllable and Consistent Video Generation | Chi Zhang et.al. | 2412.16677 | null |
2024-12-21 | Optoelectronic generative adversarial networks | Jumin Qiu et.al. | 2412.16672 | link |
2024-12-24 | Adversarial Attack Against Images Classification based on Generative Adversarial Networks | Yahe Yang et.al. | 2412.16662 | null |
2024-12-21 | A Generalizable 3D Diffusion Framework for Low-Dose and Few-View Cardiac SPECT | Huidong Xie et.al. | 2412.16573 | null |
2024-12-21 | Diffusion Prior Interpolation for Flexibility Real-World Face Super-Resolution | Jiarui Yang et.al. | 2412.16552 | link |
2024-12-21 | TrojFlow: Flow Models are Natural Targets for Trojan Attacks | Zhengyang Qi et.al. | 2412.16512 | null |
2024-12-25 | Follow-Your-MultiPose: Tuning-Free Multi-Character Text-to-Video Generation via Pose Guidance | Beiyuan Zhang et.al. | 2412.16495 | null |
2024-12-20 | When Worse is Better: Navigating the compression-generation tradeoff in visual tokenization | Vivek Ramanujan et.al. | 2412.16326 | null |
2024-12-20 | Mapping the Mind of an Instruction-based Image Editing using SMILE | Zeinab Dehghani et.al. | 2412.16277 | link |
2024-12-20 | MetaScientist: A Human-AI Synergistic Framework for Automated Mechanical Metamaterial Design | Jingyuan Qi et.al. | 2412.16270 | null |
2024-12-20 | PromptLA: Towards Integrity Verification of Black-box Text-to-Image Diffusion Models | Zhuomeng Zhang et.al. | 2412.16257 | null |
2024-12-20 | Interactive Scene Authoring with Specialized Generative Primitives | Clément Jambon et.al. | 2412.16253 | null |
2024-12-18 | GALOT: Generative Active Learning via Optimizable Zero-shot Text-to-image Generation | Hanbin Hong et.al. | 2412.16227 | null |
2024-12-18 | ManiVideo: Generating Hand-Object Manipulation Video with Dexterous and Generalizable Grasping | Youxin Pang et.al. | 2412.16212 | null |
2024-12-17 | Is Your World Simulator a Good Story Presenter? A Consecutive Events-Based Benchmark for Future Long Video Generation | Yiping Wang et.al. | 2412.16211 | null |
2024-12-13 | A Decade of Deep Learning: A Survey on The Magnificent Seven | Dilshod Azizov et.al. | 2412.16188 | null |
2024-12-20 | Personalized Representation from Personalized Generation | Shobhita Sundaram et.al. | 2412.16156 | link |
2024-12-20 | NeRF-To-Real Tester: Neural Radiance Fields as Test Image Generators for Vision of Autonomous Systems | Laura Weihl et.al. | 2412.16141 | null |
2024-12-20 | Predicting human cooperation: sensitizing drift-diffusion model to interaction and external stimuli | Lucila G. Alvarez-Zuzek et.al. | 2412.16121 | null |
2024-12-20 | CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up | Songhua Liu et.al. | 2412.16112 | link |
2024-12-20 | Differentially Private Federated Learning of Diffusion Models for Synthetic Tabular Data Generation | Timur Sattarov et.al. | 2412.16083 | null |
2025-01-08 | Label-Efficient Data Augmentation with Video Diffusion Models for Guidewire Segmentation in Cardiac Fluoroscopy | Shaoyan Pan et.al. | 2412.16050 | null |
2024-12-20 | SafeCFG: Redirecting Harmful Classifier-Free Guidance for Safe Generation | Jiadong Pan et.al. | 2412.16039 | null |
2024-12-20 | Semi-Supervised Adaptation of Diffusion Models for Handwritten Text Generation | Kai Brandenbusch et.al. | 2412.15853 | null |
2024-12-20 | Electromagnetic particle-in-cell modeling of an electron cyclotron resonance plasma discharge in hydrogen | D. Eremin et.al. | 2412.15802 | null |
2024-12-20 | Diffusion-Based Conditional Image Editing through Optimized Inference with Guidance | Hyunsoo Lee et.al. | 2412.15798 | null |
2024-12-20 | DOLLAR: Few-Step Video Generation via Distillation and Latent Reward Optimization | Zihan Ding et.al. | 2412.15689 | null |
2024-12-20 | PersonaMagic: Stage-Regulated High-Fidelity Face Customization with Tandem Equilibrium | Xinzhe Li et.al. | 2412.15674 | link |
2024-12-20 | Learning Group Interactions and Semantic Intentions for Multi-Object Trajectory Prediction | Mengshi Qi et.al. | 2412.15673 | link |
2024-12-30 | BS-LDM: Effective Bone Suppression in High-Resolution Chest X-Ray Images with Conditional Latent Diffusion Models | Yifei Sun et.al. | 2412.15670 | link |
2024-12-20 | SCENIC: Scene-aware Semantic Navigation with Instruction-guided Control | Xiaohan Zhang et.al. | 2412.15664 | null |
2024-12-23 | CustomTTT: Motion and Appearance Customized Video Generation via Test-Time Training | Xiuli Bi et.al. | 2412.15646 | link |
2024-12-20 | Score-based Generative Diffusion Models for Social Recommendations | Chengyi Liu et.al. | 2412.15579 | link |
2024-12-20 | DefFiller: Mask-Conditioned Diffusion for Salient Steel Surface Defect Generation | Yichun Tai et.al. | 2412.15570 | link |
2024-12-20 | ChangeDiff: A Multi-Temporal Change Detection Data Generator with Flexible Text Prompts via Diffusion Model | Qi Zang et.al. | 2412.15541 | link |
2024-12-20 | Stylish and Functional: Guided Interpolation Subject to Physical Constraints | Yan-Ying Chen et.al. | 2412.15507 | null |
2024-12-20 | GCA-3D: Towards Generalized and Consistent Domain Adaptation of 3D Generators | Hengjia Li et.al. | 2412.15491 | null |
2024-12-19 | Spatiotemporally Coherent Probabilistic Generation of Weather from Climate | Jonathan Schmidt et.al. | 2412.15361 | link |
2024-12-19 | Dataset Augmentation by Mixing Visual Concepts | Abdullah Al Rahat et.al. | 2412.15358 | null |
2024-12-19 | Efficient Fine-Tuning and Concept Suppression for Pruned Diffusion Models | Reza Shirkavand et.al. | 2412.15341 | link |
2025-01-02 | Next Patch Prediction for Autoregressive Visual Generation | Yatian Pang et.al. | 2412.15321 | link |
2024-12-19 | LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis | Hanlin Wang et.al. | 2412.15214 | link |
2024-12-19 | Flowing from Words to Pixels: A Framework for Cross-Modality Evolution | Qihao Liu et.al. | 2412.15213 | null |
2024-12-19 | Generative Multiview Relighting for 3D Reconstruction under Extreme Illumination Variation | Hadi Alzayer et.al. | 2412.15211 | null |
2024-12-19 | FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching | Sucheng Ren et.al. | 2412.15205 | link |
2024-12-19 | AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation | Moayed Haji-Ali et.al. | 2412.15191 | null |
2024-12-26 | LMFusion: Adapting Pretrained Language Models for Multimodal Generation | Weijia Shi et.al. | 2412.15188 | null |
2024-12-19 | Tiled Diffusion | Or Madar et.al. | 2412.15185 | null |
2024-12-19 | OnlineVPO: Align Video Diffusion Model with Online Video-Centric Preference Optimization | Jiacheng Zhang et.al. | 2412.15159 | null |
2024-12-19 | Prompt-A-Video: Prompt Your Video Diffusion Model via Preference-Aligned LLM | Yatai Ji et.al. | 2412.15156 | link |
2024-12-19 | Jet: A Modern Transformer-Based Normalizing Flow | Alexander Kolesnikov et.al. | 2412.15129 | null |
2024-12-19 | Parallelized Autoregressive Visual Generation | Yuqing Wang et.al. | 2412.15119 | null |
2024-12-26 | Uni-Renderer: Unifying Rendering and Inverse Rendering Via Dual Stream Diffusion | Zhifei Chen et.al. | 2412.15050 | null |
2024-12-19 | DCTdiff: Intriguing Properties of Image Generative Modeling in the DCT Space | Mang Ning et.al. | 2412.15032 | link |
2025-01-02 | Stable-V2A: Synthesis of Synchronized Sound Effects with Temporal and Semantic Controls | Riccardo Fosco Gramaccioni et.al. | 2412.15023 | null |
2024-12-19 | MagicNaming: Consistent Identity Generation by Finding a “Name Space” in T2I Diffusion Models | Jing Zhao et.al. | 2412.14902 | null |
2024-12-19 | Diffusion priors for Bayesian 3D reconstruction from incomplete measurements | Julian L. Möbius et.al. | 2412.14897 | null |
2024-12-19 | Generative CKM Construction using Partially Observed Data with Diffusion Model | Shen Fu et.al. | 2412.14812 | null |
2024-12-19 | Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations | Yucheng Hu et.al. | 2412.14803 | null |
2024-12-19 | A General Control Method for Human-Robot Integration | Maddalena Feder et.al. | 2412.14762 | null |
2024-12-19 | EnergyMoGen: Compositional Human Motion Generation with Energy-Based Diffusion Model in Latent Space | Jianrong Zhang et.al. | 2412.14706 | null |
2024-12-19 | Event-assisted 12-stop HDR Imaging of Dynamic Scene | Shi Guo et.al. | 2412.14705 | null |
2024-12-19 | Length Controlled Generation for Black-box LLMs | Yuxuan Gu et.al. | 2412.14656 | null |
2024-12-19 | Unified Image Restoration and Enhancement: Degradation Calibrated Cycle Reconstruction Diffusion Model | Minglong Xue et.al. | 2412.14630 | link |
2024-12-19 | Qua $^2$ SeDiMo: Quantifiable Quantization Sensitivity of Diffusion Models | Keith G. Mills et.al. | 2412.14628 | null |
2024-12-19 | LDP: Generalizing to Multilingual Visual Information Extraction by Language Decoupled Pretraining | Huawen Shen et.al. | 2412.14596 | null |
2024-12-19 | DiffSim: Taming Diffusion Models for Evaluating Visual Similarity | Yiren Song et.al. | 2412.14580 | link |
2024-12-19 | Downscaling Precipitation with Bias-informed Conditional Diffusion Model | Ran Lyu et.al. | 2412.14539 | link |
2024-12-19 | Consistent Human Image and Video Generation with Spatially Conditioned Diffusion | Mingdeng Cao et.al. | 2412.14531 | link |
2024-12-19 | Guided Diffusion Model for Sensor Data Obfuscation | Xin Yang et.al. | 2412.14499 | null |
2024-12-19 | Content-style disentangled representation for controllable artistic image stylization and generation | Ma Zhuoqi et.al. | 2412.14496 | null |
2024-12-19 | Drive-1-to-3: Enriching Diffusion Priors for Novel View Synthesis of Real Vehicles | Chuang Lin et.al. | 2412.14494 | null |
2024-12-19 | DirectorLLM for Human-Centric Video Generation | Kunpeng Song et.al. | 2412.14484 | null |
2024-12-19 | DiffusionTrend: A Minimalist Approach to Virtual Fashion Try-On | Wengyi Zhan et.al. | 2412.14465 | null |
2024-12-19 | LiftRefine: Progressively Refined View Synthesis from 3D Lifting with Volume-Triplane Representations | Tung Do et.al. | 2412.14464 | null |
2024-12-19 | LEDiff: Latent Exposure Diffusion for HDR Generation | Chao Wang et.al. | 2412.14456 | null |
2024-12-19 | Multimodal Latent Diffusion Model for Complex Sewing Pattern Generation | Shengqi Liu et.al. | 2412.14453 | null |
2024-12-19 | Diffusion and Discrete Temporal Models of the Growth of Free-Ranging Cats in Urban Areas | Rodrigo Perusquía Cortés et.al. | 2412.14445 | null |
2024-12-19 | IntroStyle: Training-Free Introspective Style Attribution using Diffusion Features | Anand Kumar et.al. | 2412.14432 | null |
2024-12-19 | Enhancing Diffusion Models for High-Quality Image Generation | Jaineet Shah et.al. | 2412.14422 | null |
2024-12-19 | Comparing noisy neural population dynamics using optimal transport distances | Amin Nejatbakhsh et.al. | 2412.14421 | null |
2024-12-19 | Cutting Sequence Diffuser: Sim-to-Real Transferable Planning for Object Shaping by Grinding | Takumi Hachimine et.al. | 2412.14417 | null |
2024-12-18 | Surrealistic-like Image Generation with Vision-Language Models | Elif Ayten et.al. | 2412.14366 | link |
2024-12-18 | Personalized Generative Low-light Image Denoising and Enhancement | Xijun Wang et.al. | 2412.14327 | null |
2024-12-18 | PixelMan: Consistent Object Editing with Diffusion Models via Pixel Manipulation and Generation | Liyao Jiang et.al. | 2412.14283 | link |
2024-12-18 | GraphicsDreamer: Image to 3D Generation with Physical Consistency | Pei Chen et.al. | 2412.14214 | null |
2024-12-18 | AniDoc: Animation Creation Made Easier | Yihao Meng et.al. | 2412.14173 | null |
2024-12-19 | E-CAR: Efficient Continuous Autoregressive Image Generation via Multistage Modeling | Zhihang Yuan et.al. | 2412.14170 | null |
2024-12-18 | Autoregressive Video Generation without Vector Quantization | Haoge Deng et.al. | 2412.14169 | link |
2024-12-19 | FashionComposer: Compositional Fashion Image Generation | Sihui Ji et.al. | 2412.14168 | null |
2024-12-18 | VideoDPO: Omni-Preference Alignment for Video Diffusion Generation | Runtao Liu et.al. | 2412.14167 | null |
2024-12-29 | AKiRa: Augmentation Kit on Rays for optical video generation | Xi Wang et.al. | 2412.14158 | null |
2024-12-18 | MCMat: Multiview-Consistent and Physically Accurate PBR Material Generation | Shenhao Zhu et.al. | 2412.14148 | null |
2024-12-18 | SurgSora: Decoupled RGBD-Flow Diffusion Model for Controllable Surgical Video Generation | Tong Chen et.al. | 2412.14018 | null |
2024-12-18 | What makes a good metric? Evaluating automatic metrics for text-to-image consistency | Candace Ross et.al. | 2412.13989 | null |
2024-12-25 | Comparative Analysis of Machine Learning-Based Imputation Techniques for Air Quality Datasets with High Missing Data Rates | Sen Yan et.al. | 2412.13966 | null |
2024-12-18 | Anomalous Diffusion of Superparamagnetic Walkers with Tailored Statistics | Alessia Gentili et.al. | 2412.13960 | null |
2024-12-18 | Generation of Large District Heating System Models Using Open-Source Data and Tools: An Exemplary Workflow | Jan Stock et.al. | 2412.13950 | null |
2024-12-18 | IDEQ: an improved diffusion model for the TSP | Mickael Basson et.al. | 2412.13858 | null |
2024-12-18 | Maybe you are looking for CroQS: Cross-modal Query Suggestion for Text-to-Image Retrieval | Giacomo Pacini et.al. | 2412.13834 | null |
2024-12-18 | Object Style Diffusion for Generalized Object Detection in Urban Scene | Hao Li et.al. | 2412.13815 | null |
2024-12-18 | Text2Relight: Creative Portrait Relighting with Text Guidance | Junuk Cha et.al. | 2412.13734 | null |
2024-12-18 | Diffusion models and stochastic quantisation in lattice field theory | Gert Aarts et.al. | 2412.13704 | null |
2024-12-18 | MMO-IG: Multi-Class and Multi-Scale Object Image Generation for Remote Sensing | Chuang Yang et.al. | 2412.13684 | null |
2024-12-18 | VIIS: Visible and Infrared Information Synthesis for Severe Low-light Image Enhancement | Chen Zhao et.al. | 2412.13655 | link |
2024-12-18 | Self-control: A Better Conditional Mechanism for Masked Autoregressive Model | Qiaoying Qu et.al. | 2412.13635 | null |
2024-12-18 | TAUDiff: Improving statistical downscaling for extreme weather events using generative diffusion models | Rahul Sundar et.al. | 2412.13627 | null |
2024-12-18 | SemiDFL: A Semi-Supervised Paradigm for Decentralized Federated Learning | Xinyang Liu et.al. | 2412.13589 | link |
2024-12-18 | Urban Air Temperature Prediction using Conditional Diffusion Models | Siyang Dai et.al. | 2412.13504 | null |
2024-12-18 | VaeDiff-DocRE: End-to-end Data Augmentation Framework for Document-level Relation Extraction | Khai Phan Tran et.al. | 2412.13503 | link |
2024-12-18 | Real-time One-Step Diffusion-based Expressive Portrait Videos Generation | Hanzhong Guo et.al. | 2412.13479 | link |
2024-12-18 | SAVGBench: Benchmarking Spatially Aligned Audio-Video Generation | Kazuki Shimada et.al. | 2412.13462 | null |
2024-12-22 | Zero-Shot Low Light Image Enhancement with Diffusion Prior | Joshua Cho et.al. | 2412.13401 | link |
2024-12-18 | Marigold-DC: Zero-Shot Monocular Depth Completion with Guided Diffusion | Massimiliano Viola et.al. | 2412.13389 | null |
2024-12-19 | Posterior Mean Matching: Generative Modeling through Online Bayesian Inference | Sebastian Salazar et.al. | 2412.13286 | null |
2024-12-17 | CompactFlowNet: Efficient Real-time Optical Flow Estimation on Mobile Devices | Andrei Znobishchev et.al. | 2412.13273 | null |
2024-12-17 | Optimized two-stage AI-based Neural Decoding for Enhanced Visual Stimulus Reconstruction from fMRI Data | Lorenzo Veronese et.al. | 2412.13237 | null |
2024-12-17 | CoMPaSS: Enhancing Spatial Understanding in Text-to-Image Diffusion Models | Gaoyang Zhang et.al. | 2412.13195 | link |
2024-12-23 | MotionBridge: Dynamic Video Inbetweening with Flexible Controls | Maham Tanveer et.al. | 2412.13190 | null |
2024-12-17 | StreetCrafter: Street View Synthesis with Controllable Video Diffusion Models | Yunzhi Yan et.al. | 2412.13188 | null |
2024-12-17 | Move-in-2D: 2D-Conditioned Human Motion Generation | Hsin-Ping Huang et.al. | 2412.13185 | null |
2024-12-17 | F-Bench: Rethinking Human Preference Evaluation Metrics for Benchmarking Face Generation, Customization, and Restoration | Lu Liu et.al. | 2412.13155 | null |
2024-12-17 | Prompt Augmentation for Self-supervised Text-guided Image Manipulation | Rumeysa Bodur et.al. | 2412.13081 | null |
2024-12-17 | VidTok: A Versatile and Open-Source Video Tokenizer | Anni Tang et.al. | 2412.13061 | link |
2024-12-17 | 3D MedDiffusion: A 3D Medical Diffusion Model for Controllable and High-quality Medical Image Generation | Haoshen Wang et.al. | 2412.13059 | null |
2024-12-17 | Stable Diffusion is a Natural Cross-Modal Decoder for Layered AI-generated Image Compression | Ruijie Chen et.al. | 2412.12982 | null |
2024-12-19 | Attentive Eraser: Unleashing Diffusion Model’s Object Removal Potential via Self-Attention Redirection Guidance | Wenhao Sun et.al. | 2412.12974 | link |
2024-12-17 | ArchesWeather & ArchesWeatherGen: a deterministic and generative model for efficient ML weather forecasting | Guillaume Couairon et.al. | 2412.12971 | link |
2024-12-17 | Generation of cosmic ray trajectories by a Diffusion Model trained on test particles in 3D magnetohydrodynamic turbulence | Johannes Martin et.al. | 2412.12923 | null |
2024-12-17 | Unsupervised Region-Based Image Editing of Denoising Diffusion Models | Zixiang Li et.al. | 2412.12912 | null |
2024-12-18 | ArtAug: Enhancing Text-to-Image Generation through Synthesis-Understanding Interaction | Zhongjie Duan et.al. | 2412.12888 | link |
2024-12-17 | Rethinking Diffusion-Based Image Generators for Fundus Fluorescein Angiography Synthesis on Limited Data | Chengzhou Yu et.al. | 2412.12778 | null |
2024-12-17 | A Framework for Critical Evaluation of Text-to-Image Models: Integrating Art Historical Analysis, Artistic Exploration, and Critical Prompt Engineering | Amalia Foka et.al. | 2412.12774 | null |
2024-12-17 | Guided and Variance-Corrected Fusion with One-shot Style Alignment for Large-Content Image Generation | Shoukun Sun et.al. | 2412.12771 | link |
2024-12-17 | Towards a Training Free Approach for 3D Scene Editing | Vivek Madhavaram et.al. | 2412.12766 | null |
2024-12-17 | RDPI: A Refine Diffusion Probability Generation Method for Spatiotemporal Data Imputation | Zijin Liu et.al. | 2412.12642 | null |
2024-12-17 | A Simple and Efficient Baseline for Zero-Shot Generative Classification | Zipeng Qi et.al. | 2412.12594 | null |
2024-12-17 | Consistent Diffusion: Denoising Diffusion Model with Data-Consistent Training for Image Restoration | Xinlong Cheng et.al. | 2412.12550 | null |
2024-12-17 | Stiefel Flow Matching for Moment-Constrained Structure Elucidation | Austin Cheng et.al. | 2412.12540 | null |
2024-12-17 | Addressing Small and Imbalanced Medical Image Datasets Using Generative Models: A Comparative Study of DDPM and PGGANs with Random and Greedy K Sampling | Iman Khazrak et.al. | 2412.12532 | link |
2024-12-17 | Pattern Analogies: Learning to Perform Programmatic Image Edits by Analogy | Aditya Ganeshan et.al. | 2412.12463 | null |
2024-12-17 | Numerical Pruning for Efficient Autoregressive Models | Xuan Shen et.al. | 2412.12441 | null |
2024-12-16 | DeepSN: A Sheaf Neural Framework for Influence Maximization | Asela Hevapathige et.al. | 2412.12416 | null |
2024-12-16 | Efficient Scaling of Diffusion Transformers for Text-to-Image Generation | Hao Li et.al. | 2412.12391 | null |
2024-12-16 | OmniPrism: Learning Disentangled Visual Concept for Image Generation | Yangyang Li et.al. | 2412.12242 | null |
2024-12-16 | You Only Submit One Image to Find the Most Suitable Generative Model | Zhi Zhou et.al. | 2412.12232 | null |
2024-12-16 | Can video generation replace cinematographers? Research on the cinematic language of generated video | Xiaozhe Li et.al. | 2412.12223 | null |
2024-12-15 | Finding a Wolf in Sheep’s Clothing: Combating Adversarial Text-To-Image Prompts with Text Summarization | Portia Cooper et.al. | 2412.12212 | null |
2024-12-15 | Provably Secure Robust Image Steganography via Cross-Modal Error Correction | Yuang Qi et.al. | 2412.12206 | null |
2024-12-11 | Multimodal Approaches to Fair Image Classification: An Ethical Perspective | Javon Hickmon et.al. | 2412.12165 | null |
2024-12-10 | Generative Modeling and Data Augmentation for Power System Production Simulation | Linna Xu et.al. | 2412.12146 | link |
2024-12-17 | Causal Diffusion Transformers for Generative Modeling | Chaorui Deng et.al. | 2412.12095 | link |
2024-12-16 | CAP4D: Creating Animatable 4D Portrait Avatars with Morphable Multi-View Diffusion Models | Felix Taubner et.al. | 2412.12093 | null |
2024-12-16 | Wonderland: Navigating 3D Scenes from a Single Image | Hanwen Liang et.al. | 2412.12091 | null |
2024-12-16 | A LoRA is Worth a Thousand Pictures | Chenxi Liu et.al. | 2412.12048 | null |
2024-12-16 | The entropic optimal (self-)transport problem: Limit distributions for decreasing regularization with application to score function estimation | Gilles Mordant et.al. | 2412.12007 | null |
2024-12-16 | Controllable Shadow Generation with Single-Step Diffusion Models from Synthetic Data | Onur Tasar et.al. | 2412.11972 | null |
2024-12-16 | ColorFlow: Retrieval-Augmented Image Sequence Colorization | Junhao Zhuang et.al. | 2412.11815 | null |
2024-12-16 | InterDyn: Controllable Interactive Dynamics with Video Diffusion Models | Rick Akkerman et.al. | 2412.11785 | null |
2024-12-19 | Joint Reconstruction of the Activity and the Attenuation in PET by Diffusion Posterior Sampling: a Feasibility Study | Clémentine Phung-Ngoc et.al. | 2412.11776 | null |
2024-12-17 | No More Adam: Learning Rate Scaling at Initialization is All You Need | Minghao Xu et.al. | 2412.11768 | link |
2024-12-16 | Generative Inbetweening through Frame-wise Conditions-Driven Video Generation | Tianyi Zhu et.al. | 2412.11755 | link |
2024-12-18 | Conditional Diffusion Models Based Conditional Independence Testing | Yanfeng Yang et.al. | 2412.11744 | link |
2024-12-16 | Re-Attentional Controllable Video Diffusion Editing | Yuanzhi Wang et.al. | 2412.11710 | link |
2024-12-16 | AsymRnR: Video Diffusion Transformers Acceleration with Asymmetric Reduction and Restoration | Wenhao Sun et.al. | 2412.11706 | null |
2024-12-16 | IDProtector: An Adversarial Noise Encoder to Protect Against ID-Preserving Image Generation | Yiren Song et.al. | 2412.11638 | null |
2024-12-16 | VG-TVP: Multimodal Procedural Planning via Visually Grounded Text-Video Prompting | Muhammet Furkan Ilaslan et.al. | 2412.11621 | link |
2024-12-16 | 3D $^2$ -Actor: Learning Pose-Conditioned 3D-Aware Denoiser for Realistic Gaussian Avatar Modeling | Zichen Tang et.al. | 2412.11599 | link |
2024-12-17 | VersaGen: Unleashing Versatile Visual Control for Text-to-Image Synthesis | Zhipeng Chen et.al. | 2412.11594 | link |
2024-12-19 | StrandHead: Text to Strand-Disentangled 3D Head Avatars Using Hair Geometric Priors | Xiaokun Sun et.al. | 2412.11586 | link |
2024-12-16 | MPQ-DM: Mixed Precision Quantization for Extremely Low Bit Diffusion Models | Weilun Feng et.al. | 2412.11549 | link |
2024-12-16 | EditSplat: Multi-View Fusion and Attention-Guided Optimization for View-Consistent 3D Scene Editing with 3D Gaussian Splatting | Dong In Lee et.al. | 2412.11520 | null |
2024-12-16 | LineArt: A Knowledge-guided Training-free High-quality Appearance Transfer for Design Drawing with Diffusion Model | Xi Wang et.al. | 2412.11519 | null |
2024-12-16 | IGR: Improving Diffusion Model for Garment Restoration from Person Image | Le Shen et.al. | 2412.11513 | null |
2024-12-16 | FedCAR: Cross-client Adaptive Re-weighting for Generative Models in Federated Learning | Minjun Kim et.al. | 2412.11463 | link |
2024-12-16 | MOVIS: Enhancing Multi-Object Novel View Synthesis for Indoor Scenes | Ruijie Lu et.al. | 2412.11457 | null |
2024-12-16 | UIBDiffusion: Universal Imperceptible Backdoor Attack for Diffusion Models | Yuning Han et.al. | 2412.11441 | null |
2024-12-16 | Bayesian Flow Is All You Need to Sample Out-of-Distribution Chemical Spaces | Nianze Tao et.al. | 2412.11439 | link |
2024-12-16 | View Transformation Robustness for Multi-View 3D Object Reconstruction with Reconstruction Error-Guided View Selection | Qi Zhang et.al. | 2412.11428 | link |
2024-12-16 | Nearly Zero-Cost Protection Against Mimicry by Personalized Diffusion Models | Namhyuk Ahn et.al. | 2412.11423 | null |
2024-12-16 | Category Level 6D Object Pose Estimation from a Single RGB Image using Diffusion | Adam Bethell et.al. | 2412.11420 | null |
2024-12-16 | Quantization of Climate Change Impacts on Renewable Energy Generation Capacity: A Super-Resolution Recurrent Diffusion Model | Xiaochong Dong et.al. | 2412.11399 | null |
2024-12-15 | Segment-Level Diffusion: A Framework for Controllable Long-Form Generation with Diffusion Language Models | Xiaochen Zhu et.al. | 2412.11333 | null |
2024-12-15 | Sonicmesh: Enhancing 3D Human Mesh Reconstruction in Vision-Impaired Environments With Acoustic Signals | Xiaoxuan Liang et.al. | 2412.11325 | null |
2024-12-15 | Grassmannian Geometry Meets Dynamic Mode Decomposition in DMD-GEN: A New Metric for Mode Collapse in Time Series Generative Models | Amime Mohamed Aboussalah et.al. | 2412.11292 | null |
2024-12-15 | VividFace: A Diffusion-Based Hybrid Framework for High-Fidelity Video Face Swapping | Hao Shao et.al. | 2412.11279 | null |
2024-12-15 | Wasserstein Bounds for generative diffusion models with Gaussian tail targets | Xixian Wang et.al. | 2412.11251 | null |
2024-12-15 | GenLit: Reformulating Single-Image Relighting as Video Generation | Shrisha Bharadwaj et.al. | 2412.11224 | null |
2024-12-15 | OccScene: Semantic Occupancy-based Cross-task Mutual Learning for 3D Scene Generation | Bohan Li et.al. | 2412.11183 | null |
2024-12-15 | Dual-Schedule Inversion: Training- and Tuning-Free Inversion for Real Image Editing | Jiancheng Huang et.al. | 2412.11152 | null |
2024-12-15 | Plug-and-Play Priors as a Score-Based Method | Chicago Y. Park et.al. | 2412.11108 | link |
2024-12-15 | DynamicScaler: Seamless and Scalable Video Generation for Panoramic Scenes | Jinxiu Liu et.al. | 2412.11100 | null |
2024-12-15 | EquiFlow: Equivariant Conditional Flow Matching with Optimal Transport for 3D Molecular Conformation Prediction | Qingwen Tian et.al. | 2412.11082 | null |
2024-12-15 | SHMT: Self-supervised Hierarchical Makeup Transfer via Latent Diffusion Models | Zhaoyang Sun et.al. | 2412.11058 | link |
2024-12-15 | Understanding and Mitigating Memorization in Diffusion Models for Tabular Data | Zhengyu Fang et.al. | 2412.11044 | null |
2024-12-20 | SoftVQ-VAE: Efficient 1-Dimensional Continuous Tokenizer | Hao Chen et.al. | 2412.10958 | link |
2024-12-14 | Generative Modeling with Diffusion | Justin Le et.al. | 2412.10948 | link |
2024-12-26 | Progressive Compression with Universally Quantized Diffusion Models | Yibo Yang et.al. | 2412.10935 | null |
2024-12-17 | Zigzag Diffusion Sampling: Diffusion Models Can Self-Improve via Self-Reflection | Lichen Bai et.al. | 2412.10891 | link |
2024-12-14 | Fast and Robust Visuomotor Riemannian Flow Matching Policy | Haoran Ding et.al. | 2412.10855 | null |
2024-12-14 | Unbiased General Annotated Dataset Generation | Dengyang Jiang et.al. | 2412.10831 | null |
2024-12-18 | Diffusion Model from Scratch | Wang Zhen et.al. | 2412.10824 | null |
2024-12-14 | Diffusion-based Method for Satellite Pattern-of-Life Identification | Yongchao Ye et.al. | 2412.10814 | null |
2024-12-14 | StyleDiT: A Unified Framework for Diverse Child and Partner Faces Synthesis with Style Latent Diffusion Transformer | Pin-Yen Chiu et.al. | 2412.10785 | null |
2024-12-20 | Video Diffusion Transformers are In-Context Learners | Zhengcong Fei et.al. | 2412.10783 | link |
2024-12-17 | GridShow: Omni Visual Generation | Cong Wan et.al. | 2412.10718 | link |
2024-12-18 | EvalGIM: A Library for Evaluating Generative Image Models | Melissa Hall et.al. | 2412.10604 | link |
2024-12-13 | SnapGen-V: Generating a Five-Second Video within Five Seconds on a Mobile Device | Yushu Wu et.al. | 2412.10494 | null |
2024-12-13 | SafetyDPO: Scalable Safety Alignment for Text-to-Image Generation | Runtao Liu et.al. | 2412.10493 | null |
2024-12-13 | Dynamic Entity-Masked Graph Diffusion Model for histopathological image Representation Learning | Zhenfeng Zhuang et.al. | 2412.10482 | link |
2024-12-11 | Unlocking Visual Secrets: Inverting Features with Diffusion Priors for Image Reconstruction | Sai Qian Zhang et.al. | 2412.10448 | null |
2024-12-17 | SweetTokenizer: Semantic-Aware Spatial-Temporal Tokenizer for Compact Visual Discretization | Zhentao Tan et.al. | 2412.10443 | null |
2024-12-11 | SVGFusion: Scalable Text-to-SVG Generation via Vector Space Diffusion | Ximing Xing et.al. | 2412.10437 | null |
2024-12-11 | GPTDrawer: Enhancing Visual Synthesis through ChatGPT | Kun Li et.al. | 2412.10429 | null |
2024-12-10 | CAP: Evaluation of Persuasive and Creative Image Generation | Aysan Aghazadeh et.al. | 2412.10426 | link |
2024-12-10 | Personalized and Sequential Text-to-Image Generation | Ofir Nabati et.al. | 2412.10419 | null |
2024-12-13 | OP-LoRA: The Blessing of Dimensionality | Piotr Teterwak et.al. | 2412.10362 | null |
2024-12-13 | Towards a foundation model for heavy-ion collision experiments through point cloud diffusion | Manjunath Omana Kuttan et.al. | 2412.10352 | null |
2024-12-16 | BrushEdit: All-In-One Image Inpainting and Editing | Yaowei Li et.al. | 2412.10316 | null |
2024-12-13 | Coherent 3D Scene Diffusion From a Single RGB Image | Manuel Dahnert et.al. | 2412.10294 | null |
2024-12-16 | TIV-Diffusion: Towards Object-Centric Movement for Text-driven Image to Video Generation | Xingrui Wang et.al. | 2412.10275 | null |
2024-12-19 | AniSora: Exploring the Frontiers of Animation Video Generation in the Sora Era | Yudong Jiang et.al. | 2412.10255 | link |
2024-12-13 | GAF: Gaussian Avatar Reconstruction from Monocular Videos via Multi-view Diffusion | Jiapeng Tang et.al. | 2412.10209 | null |
2024-12-16 | Efficient Generative Modeling with Residual Vector Quantization-Based Tokens | Jaehyeon Kim et.al. | 2412.10208 | null |
2024-12-13 | Simple Guidance Mechanisms for Discrete Diffusion Models | Yair Schiff et.al. | 2412.10193 | link |
2024-12-18 | SwiftTry: Fast and Consistent Video Virtual Try-On with Diffusion Models | Hung Nguyen et.al. | 2412.10178 | null |
2024-12-13 | The Art of Deception: Color Visual Illusions and Diffusion Models | Alex Gomez-Villa et.al. | 2412.10122 | null |
2024-12-13 | SuperMark: Robust and Training-free Image Watermarking via Diffusion-based Super-Resolution | Runyi Hu et.al. | 2412.10049 | null |
2024-12-13 | Emergence of complexity in opinion propagation: A reaction-diffusion model | Romain Ducasse et.al. | 2412.10000 | null |
2024-12-13 | Cycle-Consistent Bridge Diffusion Model for Accelerated MRI Reconstruction | Tao Song et.al. | 2412.09998 | null |
2024-12-13 | EP-CFG: Energy-Preserving Classifier-Free Guidance | Kai Zhang et.al. | 2412.09966 | null |
2024-12-13 | Generating 3D Pseudo-Healthy Knee MR Images to Support Trochleoplasty Planning | Michael Wehrli et.al. | 2412.09962 | link |
2024-12-13 | Efficient Dataset Distillation via Diffusion-Driven Patch Selection for Improved Generalization | Xinhao Zhong et.al. | 2412.09959 | null |
2024-12-13 | FaceShield: Defending Facial Image against Deepfake Threats | Jaehwan Jeong et.al. | 2412.09921 | null |
2024-12-13 | Prompt2Perturb (P2P): Text-Guided Diffusion-Based Adversarial Attacks on Breast Ultrasound Images | Yasamin Medghalchi et.al. | 2412.09910 | link |
2024-12-13 | Financial Fine-tuning a Large Time Series Model | Xinghong Fu et.al. | 2412.09880 | link |
2024-12-13 | LinGen: Towards High-Resolution Minute-Length Text-to-Video Generation with Linear Computational Complexity | Hongjie Wang et.al. | 2412.09856 | null |
2024-12-13 | Real-time Identity Defenses against Malicious Personalization of Diffusion Models | Hanzhong Guo et.al. | 2412.09844 | link |
2024-12-13 | Leveraging Programmatically Generated Synthetic Data for Differentially Private Diffusion Training | Yujin Choi et.al. | 2412.09842 | link |
2024-12-13 | MSC: Multi-Scale Spatio-Temporal Causal Attention for Autoregressive Video Diffusion | Xunnong Xu et.al. | 2412.09828 | null |
2024-12-12 | The Unreasonable Effectiveness of Gaussian Score Approximation for Diffusion Models and its Applications | Binxu Wang et.al. | 2412.09726 | null |
2024-12-12 | Human vs. AI: A Novel Benchmark and a Comparative Study on the Detection of Generated Images and the Impact of Prompts | Philipp Moeßner et.al. | 2412.09715 | link |
2024-12-12 | Diffusion-Enhanced Test-time Adaptation with Text and Image Augmentation | Chun-Mei Feng et.al. | 2412.09706 | link |
2024-12-12 | Vision-Language Models Represent Darker-Skinned Black Individuals as More Homogeneous than Lighter-Skinned Black Individuals | Messi H. J. Lee et.al. | 2412.09668 | null |
2024-12-12 | From Noise to Nuance: Advances in Deep Generative Image Models | Benji Peng et.al. | 2412.09656 | null |
2024-12-11 | DSplats: 3D Generation by Denoising Splats-Based Multiview Diffusion Models | Kevin Miao et.al. | 2412.09648 | null |
2024-12-11 | Bench2Drive-R: Turning Real World Data into Reactive Closed-Loop Autonomous Driving Benchmark by Generative Model | Junqi You et.al. | 2412.09647 | null |
2024-12-16 | Evaluation Agent: Efficient and Promptable Evaluation Framework for Visual Generative Models | Fan Zhang et.al. | 2412.09645 | link |
2024-12-12 | Doe-1: Closed-Loop Autonomous Driving with Large World Model | Wenzhao Zheng et.al. | 2412.09627 | link |
2024-12-12 | FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free Scale Fusion | Haonan Qiu et.al. | 2412.09626 | null |
2024-12-12 | Illusion3D: 3D Multiview Illusion with 2D Diffusion Priors | Yue Feng et.al. | 2412.09625 | null |
2024-12-12 | OmniDrag: Enabling Motion Control for Omnidirectional Image-to-Video Generation | Weiqi Li et.al. | 2412.09623 | null |
2024-12-12 | LoRACLR: Contrastive Adaptation for Customization of Diffusion Models | Enis Simsar et.al. | 2412.09622 | null |
2024-12-12 | SnapGen: Taming High-Resolution Text-to-Image Models for Mobile Devices with Efficient Architectures and Training | Dongting Hu et.al. | 2412.09619 | null |
2024-12-12 | EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM | Zhuofan Zong et.al. | 2412.09618 | null |
2024-12-12 | Context Canvas: Enhancing Text-to-Image Diffusion Models with Knowledge Graph-Based RAG | Kavana Venkatesh et.al. | 2412.09614 | null |
2024-12-12 | FluxSpace: Disentangled Semantic Editing in Rectified Flow Transformers | Yusuf Dalva et.al. | 2412.09611 | null |
2024-12-12 | Spectral Image Tokenizer | Carlos Esteves et.al. | 2412.09607 | null |
2024-12-12 | Owl-1: Omni World Model for Consistent Long Video Generation | Yuanhui Huang et.al. | 2412.09600 | link |
2024-12-12 | LiftImage3D: Lifting Any Single Image to 3D Gaussians with Video Generation Priors | Yabo Chen et.al. | 2412.09597 | null |
2024-12-12 | Neural LightRig: Unlocking Accurate Object Normal and Material Estimation with Multi-Light Diffusion | Zexin He et.al. | 2412.09593 | null |
2024-12-12 | Video Creation by Demonstration | Yihong Sun et.al. | 2412.09551 | null |
2024-12-19 | SimAvatar: Simulation-Ready Avatars with Layered Hair and Clothing | Xueting Li et.al. | 2412.09545 | null |
2024-12-12 | Learned Compression for Compressed Learning | Dan Jacobellis et.al. | 2412.09405 | link |
2024-12-12 | UFO: Enhancing Diffusion-Based Video Generation with a Uniform Frame Organizer | Delong Liu et.al. | 2412.09389 | link |
2024-12-12 | Diffusion Model with Representation Alignment for Protein Inverse Folding | Chenglin Wang et.al. | 2412.09380 | null |
2024-12-12 | Diffusion Predictive Control with Constraints | Ralf Römer et.al. | 2412.09342 | link |
2024-12-12 | Auto-Regressive Moving Diffusion Models for Time Series Forecasting | Jiaxin Gao et.al. | 2412.09328 | link |
2024-12-13 | Are Conditional Latent Diffusion Models Effective for Image Restoration? | Yunchen Yuan et.al. | 2412.09324 | null |
2024-12-12 | T-SVG: Text-Driven Stereoscopic Video Generation | Qiao Jin et.al. | 2412.09323 | null |
2024-12-13 | GoHD: Gaze-oriented and Highly Disentangled Portrait Animation with Rhythmic Poses and Realistic Expression | Ziqi Zhou et.al. | 2412.09296 | link |
2024-12-12 | InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption | Tiehan Fan et.al. | 2412.09283 | null |
2024-12-12 | LatentSync: Audio Conditioned Latent Diffusion Models for Lip Sync | Chunyu Li et.al. | 2412.09262 | link |
2024-12-12 | ExpRDiff: Short-exposure Guided Diffusion Model for Realistic Local Motion Deblurring | Zhongbao Yang et.al. | 2412.09193 | null |
2024-12-19 | RAD: Region-Aware Diffusion Models for Image Inpainting | Sora Kim et.al. | 2412.09191 | null |
2024-12-12 | DECOR:Decomposition and Projection of Text Embeddings for Text-to-Image Customization | Geonhui Jang et.al. | 2412.09169 | null |
2024-12-12 | LVMark: Robust Watermark for latent video diffusion models | MinHyuk Jang et.al. | 2412.09122 | null |
2024-12-12 | General Markovian randomized equilibrium existence and construction in zero-sum Dynkin games for diffusions | Sören Christensen et.al. | 2412.09087 | null |
2024-12-13 | An Efficient Framework for Enhancing Discriminative Models via Diffusion Techniques | Chunxiao Li et.al. | 2412.09063 | null |
2024-12-12 | Video Anomaly Detection with Motion and Appearance Guided Patch Diffusion Model | Hang Zhou et.al. | 2412.09026 | link |
2024-12-12 | Arbitrary-steps Image Super-resolution via Diffusion Inversion | Zongsheng Yue et.al. | 2412.09013 | link |
2024-12-12 | Enhancing Facial Consistency in Conditional Video Generation via Facial Landmark Transformation | Lianrui Mu et.al. | 2412.08976 | null |
2024-12-12 | Mojito: Motion Trajectory and Intensity Control for Video Generation | Xuehai He et.al. | 2412.08948 | null |
2024-12-12 | Interpreting Graphic Notation with MusicLDM: An AI Improvisation of Cornelius Cardew’s Treatise | Tornike Karchkhadze et.al. | 2412.08944 | null |
2024-12-12 | Reversing the Damage: A QP-Aware Transformer-Diffusion Approach for 8K Video Restoration under Codec Compression | Ali Mollaahmadi Dehaghi et.al. | 2412.08912 | link |
2024-12-12 | Inference-Time Diffusion Model Distillation | Geon Yeong Park et.al. | 2412.08871 | null |
2024-12-12 | ViUniT: Visual Unit Tests for More Robust Visual Programming | Artemis Panagopoulou et.al. | 2412.08859 | null |
2024-12-12 | Complex-Cycle-Consistent Diffusion Model for Monaural Speech Enhancement | Yi Li et.al. | 2412.08856 | null |
2024-12-11 | Generative Modeling with Explicit Memory | Yi Tang et.al. | 2412.08781 | link |
2024-12-11 | A Physics-based Generative Model to Synthesize Training Datasets for MRI-based Fat Quantification | Juan P. Meneses et.al. | 2412.08741 | null |
2024-12-13 | $\texttt{UFig v1}$ : The ultra-fast image generator | Silvan Fischbacher et.al. | 2412.08716 | null |
2024-12-11 | ObjectMate: A Recurrence Prior for Object Insertion and Subject-Driven Generation | Daniel Winter et.al. | 2412.08645 | null |
2024-12-11 | Generative Semantic Communication: Architectures, Technologies, and Applications | Jinke Ren et.al. | 2412.08642 | null |
2024-12-11 | Fast Prompt Alignment for Text-to-Image Generation | Khalil Mrini et.al. | 2412.08639 | link |
2024-12-11 | DMin: Scalable Training Data Influence Estimation for Diffusion Models | Huawei Lin et.al. | 2412.08637 | link |
2024-12-11 | Multimodal Latent Language Modeling with Next-Token Diffusion | Yutao Sun et.al. | 2412.08635 | link |
2024-12-11 | FlowEdit: Inversion-Free Text-Based Editing Using Pre-Trained Flow Models | Vladimir Kulikov et.al. | 2412.08629 | link |
2024-12-13 | LAION-SG: An Enhanced Large-Scale Dataset for Training Complex Image-Text Models with Structural Annotations | Zejian Li et.al. | 2412.08580 | link |
2024-12-11 | TryOffAnyone: Tiled Cloth Generation from a Dressed Person | Ioannis Xarchakos et.al. | 2412.08573 | link |
2024-12-11 | StyleStudio: Text-Driven Style Transfer with Selective Control of Style Elements | Mingkun Lei et.al. | 2412.08503 | null |
2024-12-12 | Learning Flow Fields in Attention for Controllable Person Image Generation | Zijian Zhou et.al. | 2412.08486 | link |
2024-12-11 | InvDiff: Invariant Guidance for Bias Mitigation in Diffusion Models | Min Hou et.al. | 2412.08480 | link |
2024-12-11 | CC-Diff: Enhancing Contextual Coherence in Remote Sensing Image Synthesis | Mu Zhang et.al. | 2412.08464 | null |
2024-12-11 | Reliable Uncertainty Quantification for Fiber Orientation in Composite Molding Processes using Multilevel Polynomial Surrogates | Stjepan Salatovic et.al. | 2412.08459 | null |
2024-12-12 | Pragmatist: Multiview Conditional Diffusion Models for High-Fidelity 3D Reconstruction from Unposed Sparse Views | Songchun Zhang et.al. | 2412.08412 | null |
2024-12-13 | Physical Informed Driving World Model | Zhuoran Yang et.al. | 2412.08410 | null |
2024-12-11 | Grasp Diffusion Network: Learning Grasp Generators from Partial Point Clouds with Diffusion Models in SO(3)xR3 | Joao Carvalho et.al. | 2412.08398 | null |
2024-12-11 | Digging into Intrinsic Contextual Information for High-fidelity 3D Point Cloud Completion | Jisheng Chu et.al. | 2412.08326 | link |
2024-12-15 | GDSG: Graph Diffusion-based Solution Generator for Optimization Problems in MEC Networks | Ruihuai Liang et.al. | 2412.08296 | link |
2024-12-11 | Self-Refining Diffusion Samplers: Enabling Parallelization via Parareal Iterations | Nikil Roashan Selvam et.al. | 2412.08292 | link |
2024-12-11 | Toward Near-Globally Optimal Nonlinear Model Predictive Control via Diffusion Models | Tzu-Yuan Huang et.al. | 2412.08278 | null |
2024-12-11 | FLIP: Flow-Centric Generative Planning for General-Purpose Manipulation Tasks | Chongkai Gao et.al. | 2412.08261 | null |
2024-12-11 | VSD2M: A Large-scale Vision-language Sticker Dataset for Multi-frame Animated Sticker Generation | Zhiqiang Yuan et.al. | 2412.08259 | null |
2024-12-16 | Generate Any Scene: Evaluating and Improving Text-to-Vision Generation with Scene Graph Programming | Ziqi Gao et.al. | 2412.08221 | link |
2024-12-11 | Unicorn: Unified Neural Image Compression with One Number Reconstruction | Qi Zheng et.al. | 2412.08210 | null |
2024-12-11 | Analyzing and Improving Model Collapse in Rectified Flow Models | Huminhao Zhu et.al. | 2412.08175 | null |
2024-12-11 | AsyncDSB: Schedule-Asynchronous Diffusion Schrödinger Bridge for Image Inpainting | Zihao Han et.al. | 2412.08149 | null |
2024-12-11 | LatentSpeech: Latent Diffusion for Text-To-Speech Generation | Haowei Lou et.al. | 2412.08117 | null |
2024-12-11 | DAKD: Data Augmentation and Knowledge Distillation using Diffusion Models for SAR Oil Spill Segmentation | Jaeho Moon et.al. | 2412.08116 | null |
2024-12-11 | Seeing Syntax: Uncovering Syntactic Learning Limitations in Vision-Language Models | Sri Harsha Dumpala et.al. | 2412.08111 | null |
2024-12-11 | Generative Zoo | Tomasz Niewiadomski et.al. | 2412.08101 | null |
2024-12-11 | MAGIC: Mastering Physical Adversarial Generation in Context through Collaborative LLM Agents | Yun Xing et.al. | 2412.08014 | null |
2024-12-10 | Diffusion-Based Attention Warping for Consistent 3D Scene Editing | Eyal Gomel et.al. | 2412.07984 | null |
2024-12-10 | Non-Normal Diffusion Models | Henry Li et.al. | 2412.07935 | null |
2024-12-10 | Score Change of Variables | Stephen Robbins et.al. | 2412.07904 | null |
2024-12-10 | Score-Optimal Diffusion Schedules | Christopher Williams et.al. | 2412.07877 | null |
2024-12-09 | Boosting Alignment for Post-Unlearning Text-to-Image Generative Models | Myeongseob Ko et.al. | 2412.07808 | link |
2024-12-08 | Language Model as Visual Explainer | Xingyi Yang et.al. | 2412.07802 | null |
2024-12-10 | Efficient Diversity-Preserving Diffusion Alignment via Gradient-Informed GFlowNets | Zhen Liu et.al. | 2412.07775 | null |
2024-12-11 | UniReal: Universal Image Generation and Editing via Learning Real-world Dynamics | Xi Chen et.al. | 2412.07774 | null |
2024-12-10 | From Slow Bidirectional to Fast Causal Video Generators | Tianwei Yin et.al. | 2412.07772 | null |
2024-12-12 | Learning Visual Generative Priors without Text | Shuailei Ma et.al. | 2412.07767 | null |
2024-12-10 | Make-A-Texture: Fast Shape-Aware Texture Generation in 3 Seconds | Xiaoyu Xiang et.al. | 2412.07766 | null |
2024-12-10 | Repurposing Pre-trained Video Diffusion Models for Event-based Video Interpolation | Jingxi Chen et.al. | 2412.07761 | null |
2024-12-10 | SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints | Jianhong Bai et.al. | 2412.07760 | link |
2024-12-10 | 3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation | Xiao Fu et.al. | 2412.07759 | null |
2024-12-10 | Multi-Shot Character Consistency for Text-to-Video Generation | Yuval Atzmon et.al. | 2412.07750 | null |
2024-12-10 | StyleMaster: Stylize Your Video with Artistic Generation and Translation | Zixuan Ye et.al. | 2412.07744 | null |
2024-12-10 | STIV: Scalable Text and Image Conditioned Video Generation | Zongyu Lin et.al. | 2412.07730 | null |
2024-12-10 | ACDiT: Interpolating Autoregressive Conditional Modeling and Diffusion Transformer | Jinyi Hu et.al. | 2412.07720 | link |
2024-12-10 | A Joint Energy and Differentially-Private Smart Meter Data Market | Saurab Chhachhi et.al. | 2412.07688 | null |
2024-12-10 | FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models | Tong Wu et.al. | 2412.07674 | null |
2024-12-10 | TraSCE: Trajectory Steering for Concept Erasure | Anubhav Jain et.al. | 2412.07658 | link |
2024-12-11 | Motion Artifact Removal in Pixel-Frequency Domain via Alternate Masks and Diffusion Model | Jiahua Xu et.al. | 2412.07590 | link |
2024-12-10 | DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation | Jianzong Wu et.al. | 2412.07589 | null |
2024-12-10 | Mobile Video Diffusion | Haitam Ben Yahia et.al. | 2412.07583 | null |
2024-12-12 | Parallel simulation for sampling under isoperimetry and score-based diffusion models | Huanjian Zhou et.al. | 2412.07435 | null |
2024-12-10 | Non-Progressive Influence Maximization in Dynamic Social Networks | Yunming Hui et.al. | 2412.07402 | null |
2024-12-17 | StoryWeaver: A Unified World Model for Knowledge-Enhanced Story Character Customization | Jinlu Zhang et.al. | 2412.07375 | link |
2024-12-10 | Fusion Embedding for Pose-Guided Person Image Synthesis with Diffusion Model | Donghwna Lee et.al. | 2412.07333 | null |
2024-12-10 | A Generative Victim Model for Segmentation | Aixuan Li et.al. | 2412.07274 | null |
2024-12-10 | AppGen: Mobility-aware App Usage Behavior Generation for Mobile Users | Zihan Huang et.al. | 2412.07267 | null |
2024-12-10 | Buster: Incorporating Backdoor Attacks into Text Encoder to Mitigate NSFW Content Generation | Xin Zhao et.al. | 2412.07249 | null |
2024-12-10 | Optimization Can Learn Johnson Lindenstrauss Embeddings | Nikos Tsikouras et.al. | 2412.07242 | null |
2024-12-10 | ArtFormer: Controllable Generation of Diverse 3D Articulated Objects | Jiayi Su et.al. | 2412.07237 | link |
2024-12-10 | Moderating the Generalization of Score-based Generative Model | Wan Jiang et.al. | 2412.07229 | null |
2024-12-15 | Fine-grained Text to Image Synthesis | Xu Ouyang et.al. | 2412.07196 | null |
2024-12-10 | Hero-SR: One-Step Diffusion for Super-Resolution with Human Perception Priors | Jiangang Wang et.al. | 2412.07152 | null |
2024-12-10 | RAP-SR: RestorAtion Prior Enhancement in Diffusion Models for Realistic Image Super-Resolution | Jiangang Wang et.al. | 2412.07149 | link |
2024-12-10 | FIRE: Robust Detection of Diffusion-Generated Images via Frequency-Guided Reconstruction Error | Beilin Chu et.al. | 2412.07140 | link |
2024-12-10 | A Review of Human Emotion Synthesis Based on Generative Technology | Fei Ma et.al. | 2412.07116 | null |
2024-12-09 | Diffusing Differentiable Representations | Yash Savani et.al. | 2412.06981 | null |
2024-12-11 | Diff-GO $^\text{n}$ : Enhancing Diffusion Models for Goal-Oriented Communications | Suchinthaka Wanninayaka et.al. | 2412.06980 | null |
2024-12-09 | Edge-SD-SR: Low Latency and Parameter Efficient On-device Super-Resolution with Stable Diffusion via Bidirectional Conditioning | Mehdi Noroozi et.al. | 2412.06978 | null |
2024-12-09 | Improving Source Extraction with Diffusion and Consistency Models | Tornike Karchkhadze et.al. | 2412.06965 | link |
2024-12-09 | Geological and Well prior assisted full waveform inversion using conditional diffusion models | Fu Wang et.al. | 2412.06959 | null |
2024-12-09 | SafeWatch: An Efficient Safety-Policy Following Video Guardrail Model with Transparent Explanations | Zhaorun Chen et.al. | 2412.06878 | null |
2024-12-09 | Generating floorplans for various building functionalities via latent diffusion model | Mohamed R. Ibrahim et.al. | 2412.06859 | null |
2024-12-07 | MDiFF: Exploiting Multimodal Score-based Diffusion Models for New Fashion Product Performance Forecasting | Andrea Avogaro et.al. | 2412.06840 | null |
2024-12-10 | [MASK] is All You Need | Vincent Tao Hu et.al. | 2412.06787 | link |
2024-12-09 | Tactile DreamFusion: Exploiting Tactile Sensing for 3D Generation | Ruihan Gao et.al. | 2412.06785 | link |
2024-12-09 | Diverse Score Distillation | Yanbo Xu et.al. | 2412.06780 | null |
2024-12-09 | Visual Lexicon: Rich Image Features in Language Space | XuDong Wang et.al. | 2412.06774 | null |
2024-12-09 | Proactive Agents for Multi-Turn Text-to-Image Generation Under Uncertainty | Meera Hahn et.al. | 2412.06771 | link |
2024-12-09 | InstantRestore: Single-Step Personalized Face Restoration with Shared-Image Attention | Howard Zhang et.al. | 2412.06753 | null |
2024-12-10 | ContRail: A Framework for Realistic Railway Image Synthesis using ControlNet | Andrei-Robert Alexandrescu et.al. | 2412.06742 | null |
2024-12-09 | Take Fake as Real: Realistic-like Robust Black-box Adversarial Attack to Evade AIGC Detection | Caiyun Xie et.al. | 2412.06727 | link |
2024-12-14 | You See it, You Got it: Learning 3D Creation on Pose-Free Videos at Scale | Baorui Ma et.al. | 2412.06699 | link |
2024-12-09 | Gen-3Diffusion: Realistic Image-to-3D Generation via 2D & 3D Diffusion Synergy | Yuxuan Xue et.al. | 2412.06698 | null |
2024-12-09 | EMOv2: Pushing 5M Vision Model Frontier | Jiangning Zhang et.al. | 2412.06674 | link |
2024-12-09 | ILLUME: Illuminating Your LLMs to See, Draw, and Self-Enhance | Chunwei Wang et.al. | 2412.06673 | null |
2024-12-09 | Diff5T: Benchmarking Human Brain Diffusion MRI with an Extensive 5.0 Tesla K-Space and Spatial Dataset | Shanshan Wang et.al. | 2412.06666 | null |
2024-12-09 | Efficiency Meets Fidelity: A Novel Quantization Framework for Stable Diffusion | Shuaiting Li et.al. | 2412.06661 | null |
2024-12-09 | MVReward: Better Aligning and Evaluating Multi-View Diffusion Models with Human Preferences | Weitao Wang et.al. | 2412.06614 | null |
2024-12-10 | AnomalyControl: Learning Cross-modal Semantic Features for Controllable Anomaly Synthesis | Shidan He et.al. | 2412.06510 | null |
2024-12-09 | Diffusion on the circle and a stochastic correlation model | Sourav Majumdar et.al. | 2412.06343 | null |
2024-12-10 | Normalizing Flows are Capable Generative Models | Shuangfei Zhai et.al. | 2412.06329 | link |
2024-12-09 | See Further When Clear: Curriculum Consistency Model | Yunpeng Liu et.al. | 2412.06295 | null |
2024-12-18 | No Annotations for Object Detection in Art through Stable Diffusion | Patrick Ramos et.al. | 2412.06286 | link |
2024-12-09 | Omni-Scene: Omni-Gaussian Representation for Ego-Centric Sparse-View Scene Reconstruction | Dongxu Wei et.al. | 2412.06273 | null |
2024-12-09 | Rendering-Refined Stable Diffusion for Privacy Compliant Synthetic Data | Kartik Patwari et.al. | 2412.06248 | null |
2024-12-09 | Sound2Vision: Generating Diverse Visuals from Audio through Cross-Modal Latent Alignment | Kim Sung-Bin et.al. | 2412.06209 | link |
2024-12-09 | ASGDiffusion: Parallel High-Resolution Generation with Asynchronous Structure Guidance | Yuming Li et.al. | 2412.06163 | null |
2024-12-09 | Precise, Fast, and Low-cost Concept Erasure in Value Space: Orthogonal Complement Matters | Yuan Wang et.al. | 2412.06143 | link |
2024-12-09 | SGIA: Enhancing Fine-Grained Visual Classification with Sequence Generative Image Augmentation | Qiyu Liao et.al. | 2412.06138 | null |
2024-12-08 | GraPE: A Generate-Plan-Edit Framework for Compositional T2I Synthesis | Ashish Goswami et.al. | 2412.06089 | null |
2024-12-08 | Latent-Reframe: Enabling Camera Control for Video Diffusion Model without Training | Zhenghong Zhou et.al. | 2412.06029 | null |
2024-12-08 | FlexDiT: Dynamic Token Density Control for Diffusion Transformer | Shuning Chang et.al. | 2412.06028 | link |
2024-12-10 | Track4Gen: Teaching Video Diffusion Models to Track Points Improves Video Generation | Hyeonho Jeong et.al. | 2412.06016 | null |
2024-12-08 | TopoCellGen: Generating Histopathology Cell Topology with a Diffusion Model | Meilong Xu et.al. | 2412.06011 | link |
2024-12-08 | Nested Diffusion Models Using Hierarchical Latent Priors | Xiao Zhang et.al. | 2412.05984 | null |
2024-12-08 | Anti-Reference: Universal and Immediate Defense Against Reference-Based Generation | Yiren Song et.al. | 2412.05980 | link |
2024-12-08 | Enhanced 3D Generation by 2D Editing | Haoran Li et.al. | 2412.05929 | null |
2024-12-08 | BiDM: Pushing the Limit of Quantization for Diffusion Models | Xingyu Zheng et.al. | 2412.05926 | link |
2024-12-08 | Accelerating Video Diffusion Models via Distribution Matching | Yuanzhi Zhu et.al. | 2412.05899 | null |
2024-12-08 | 3D-Consistent Image Inpainting with Diffusion Models | Leonid Antsfeld et.al. | 2412.05881 | null |
2024-12-08 | MotionStone: Decoupled Motion Intensity Modulation with Diffusion Transformer for Image-to-Video Generation | Shuwei Shi et.al. | 2412.05848 | null |
2024-12-08 | CSG: A Context-Semantic Guided Diffusion Approach in De Novo Musculoskeletal Ultrasound Image Generation | Elay Dahan et.al. | 2412.05833 | null |
2024-12-08 | Self-Guidance: Boosting Flow and Diffusion Generation on Their Own | Tiancheng Li et.al. | 2412.05827 | null |
2024-12-08 | SILMM: Self-Improving Large Multimodal Models for Compositional Text-to-Image Generation | Leigang Qu et.al. | 2412.05818 | null |
2024-12-08 | Language-Guided Image Tokenization for Generation | Kaiwen Zha et.al. | 2412.05796 | null |
2024-12-08 | On Diffusion Posterior Sampling via Sequential Monte Carlo for Zero-Shot Scaffolding of Protein Motifs | James Matthew Young et.al. | 2412.05788 | link |
2024-12-10 | Open-Source Acceleration of Stable-Diffusion.cpp | Jingxu Ng et.al. | 2412.05781 | link |
2024-12-10 | BudgetFusion: Perceptually-Guided Adaptive Diffusion Models | Qinchan Li et.al. | 2412.05780 | null |
2024-12-07 | A Tiered GAN Approach for Monet-Style Image Generation | FNU Neha et.al. | 2412.05724 | null |
2024-12-07 | Evaluating Hallucination in Text-to-Image Diffusion Models with Scene-Graph based Question-Answering Agent | Ziyuan Qin et.al. | 2412.05722 | null |
2024-12-07 | Combining Genre Classification and Harmonic-Percussive Features with Diffusion Models for Music-Video Generation | Leonardo Pina et.al. | 2412.05694 | null |
2024-12-07 | Deep Reinforcement Learning-Based Resource Allocation for Hybrid Bit and Generative Semantic Communications in Space-Air-Ground Integrated Networks | Chong Huang et.al. | 2412.05647 | null |
2024-12-07 | Remix-DiT: Mixing Diffusion Transformers for Multi-Expert Denoising | Gongfan Fang et.al. | 2412.05628 | link |
2024-12-07 | Do We Need to Design Specific Diffusion Models for Different Tasks? Try ONE-PIC | Ming Tao et.al. | 2412.05619 | null |
2024-12-07 | DM-SBL: Channel Estimation under Structured Interference | Yifan Wang et.al. | 2412.05582 | null |
2024-12-07 | Dif4FF: Leveraging Multimodal Diffusion Models and Graph Neural Networks for Accurate New Fashion Product Performance Forecasting | Andrea Avogaro et.al. | 2412.05566 | link |
2024-12-07 | Uncovering Vision Modality Threats in Image-to-Image Tasks | Hao Cheng et.al. | 2412.05538 | null |
2024-12-07 | Enhancing Sample Generation of Diffusion Models using Noise Level Correction | Abulikemu Abuduweili et.al. | 2412.05488 | null |
2024-12-06 | MotionShop: Zero-Shot Motion Transfer in Video Diffusion Models with Mixture of Score Guidance | Hidir Yesiltepe et.al. | 2412.05355 | null |
2024-12-06 | Generalized Separation of Collections of Sets | Nguyen Duy Cuong et.al. | 2412.05336 | null |
2024-12-04 | The Role of Text-to-Image Models in Advanced Style Transfer Applications: A Case Study with DALL-E 3 | Ebubechukwu Ike et.al. | 2412.05325 | null |
2024-12-11 | Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model | Lening Wang et.al. | 2412.05280 | link |
2024-12-06 | Perturb-and-Revise: Flexible 3D Editing with Generative Trajectories | Susung Hong et.al. | 2412.05279 | null |
2024-12-06 | Birth and Death of a Rose | Chen Geng et.al. | 2412.05278 | null |
2024-12-06 | MotionFlow: Attention-Driven Motion Transfer in Video Diffusion Models | Tuna Han Salih Meral et.al. | 2412.05275 | null |
2024-12-06 | Mind the Time: Temporally-Controlled Multi-Event Video Generation | Ziyi Wu et.al. | 2412.05263 | null |
2024-12-06 | Constructing Uncertainty Sets for Robust Risk Measures: A Composition of $φ$ -Divergences Approach to Combat Tail Uncertainty | Guanyu Jin et.al. | 2412.05234 | null |
2024-12-06 | Go-or-Grow Models in Biology: a Monster on a Leash | R. Thiessen et.al. | 2412.05191 | null |
2024-12-06 | DNF: Unconditional 4D Generation with Dictionary-based Neural Fields | Xinyi Zhang et.al. | 2412.05161 | null |
2024-12-06 | LoRA.rar: Learning to Merge LoRAs via Hypernetworks for Subject-Style Conditioned Image Generation | Donald Shenaj et.al. | 2412.05148 | link |
2024-12-12 | Probabilistic Galaxy Field Generation with Diffusion Models | Tanner Sether et.al. | 2412.05131 | null |
2024-12-06 | The Silent Prompt: Initial Noise as Implicit Guidance for Goal-Driven Image Generation | Ruoyu Wang et.al. | 2412.05101 | null |
2024-12-06 | ReF-LDM: A Latent Diffusion Model for Reference-based Face Image Restoration | Chi-Wei Hsiao et.al. | 2412.05043 | null |
2024-12-06 | SLayR: Scene Layout Generation with Rectified Flow | Cameron Braunstein et.al. | 2412.05003 | null |
2024-12-06 | Noise Matters: Diffusion Model-based Urban Mobility Generation with Collaborative Noise Priors | Yuheng Zhang et.al. | 2412.05000 | null |
2024-12-09 | Continuous Video Process: Modeling Videos as Continuous Multi-Dimensional Processes for Video Prediction | Gaurav Shrivastava et.al. | 2412.04929 | null |
2024-12-06 | SleeperMark: Towards Robust Watermark against Fine-Tuning Text-to-image Diffusion Models | Zilan Wang et.al. | 2412.04852 | null |
2024-12-06 | UniMLVG: Unified Framework for Multi-view Long Video Generation with Comprehensive Control Capabilities for Autonomous Driving | Rui Chen et.al. | 2412.04842 | link |
2024-12-06 | Wavelet Diffusion Neural Operator | Peiyan Hu et.al. | 2412.04833 | link |
2024-12-06 | Customized Generation Reimagined: Fidelity and Editability Harmonized | Jian Jin et.al. | 2412.04831 | link |
2024-12-06 | DAWN-SI: Data-Aware and Noise-Informed Stochastic Interpolation for Solving Inverse Problems | Shadab Ahamed et.al. | 2412.04766 | null |
2024-12-06 | Diff4Steer: Steerable Diffusion Prior for Generative Music Retrieval with Semantic Guidance | Xuchan Bao et.al. | 2412.04746 | null |
2024-12-12 | Addressing Attribute Leakages in Diffusion-based Image Editing without Training | Sunung Mun et.al. | 2412.04715 | null |
2024-12-06 | Parametric-ControlNet: Multimodal Control in Foundation Models for Precise Engineering Design Synthesis | Rui Zhou et.al. | 2412.04707 | null |
2024-12-06 | Unsupervised Segmentation by Diffusing, Walking and Cutting | Daniela Ivanova et.al. | 2412.04678 | null |
2024-12-11 | Hidden in the Noise: Two-Stage Robust Watermarking for Images | Kasra Arabi et.al. | 2412.04653 | link |
2024-12-05 | One Communication Round is All It Needs for Federated Fine-Tuning Foundation Models | Ziyao Wang et.al. | 2412.04650 | null |
2024-12-05 | A practical guide to feedback control for Pound-Drever-Hall laser linewidth narrowing | Wance Wang et.al. | 2412.04635 | null |
2024-12-05 | Using Diffusion Priors for Video Amodal Segmentation | Kaihua Chen et.al. | 2412.04623 | null |
2024-12-05 | Inverting the Markovian projection for pure jump processes | Martin Larsson et.al. | 2412.04589 | null |
2024-12-05 | PaintScene4D: Consistent 4D Scene Generation from Text Prompts | Vinayak Gupta et.al. | 2412.04471 | null |
2024-12-05 | 4Real-Video: Learning Generalizable Photo-Realistic 4D Video Diffusion | Chaoyang Wang et.al. | 2412.04462 | null |
2024-12-05 | LayerFusion: Harmonized Multi-Layer Text-to-Image Generation with Generative Priors | Yusuf Dalva et.al. | 2412.04460 | null |
2024-12-05 | Four-Plane Factorized Video Autoencoders | Mohammed Suhail et.al. | 2412.04452 | null |
2024-12-05 | MEMO: Memory-Guided Diffusion for Expressive Talking Video Generation | Longtao Zheng et.al. | 2412.04448 | null |
2024-12-05 | DiCoDe: Diffusion-Compressed Deep Tokens for Autoregressive Video Generation with Language Models | Yizhuo Li et.al. | 2412.04446 | null |
2024-12-05 | Learning Artistic Signatures: Symmetry Discovery and Style Transfer | Emma Finn et.al. | 2412.04441 | null |
2024-12-05 | GenMAC: Compositional Text-to-Video Generation with Multi-Agent Collaboration | Kaiyi Huang et.al. | 2412.04440 | null |
2024-12-05 | Divot: Diffusion Powers Video Tokenizer for Comprehension and Generation | Yuying Ge et.al. | 2412.04432 | link |
2024-12-05 | Infinity: Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis | Jian Han et.al. | 2412.04431 | link |
2024-12-12 | Reversible molecular simulation for training classical and machine learning force fields | Joe G Greener et.al. | 2412.04374 | link |
2024-12-05 | ActFusion: a Unified Diffusion Model for Action Segmentation and Anticipation | Dayoung Gong et.al. | 2412.04353 | null |
2024-12-05 | RMD: A Simple Baseline for More General Human Motion Generation via Training-free Retrieval-Augmented Motion Diffuse | Zhouyingcheng Liao et.al. | 2412.04343 | null |
2024-12-05 | Multi-Subject Image Synthesis as a Generative Prior for Single-Subject PET Image Reconstruction | George Webber et.al. | 2412.04324 | null |
2024-12-05 | The Hyperfitting Phenomenon: Sharpening and Stabilizing LLMs for Open-Ended Text Generation | Fredrik Carlsson et.al. | 2412.04318 | null |
2024-12-07 | SwiftEdit: Lightning Fast Text-Guided Image Editing via One-Step Diffusion | Trong-Tung Nguyen et.al. | 2412.04301 | null |
2024-12-07 | T2I-FactualBench: Benchmarking the Factuality of Text-to-Image Models with Knowledge-Intensive Concepts | Ziwei Huang et.al. | 2412.04300 | null |
2024-12-05 | Structure-Aware Stylized Image Synthesis for Robust Medical Image Segmentation | Jie Bao et.al. | 2412.04296 | link |
2024-12-05 | LMDM:Latent Molecular Diffusion Model For 3D Molecule Generation | Xiang Chen et.al. | 2412.04242 | null |
2024-12-05 | CALMM-Drive: Confidence-Aware Autonomous Driving with Large Multimodal Model | Ruoyu Yao et.al. | 2412.04209 | null |
2024-12-11 | Instructional Video Generation | Yayuan Li et.al. | 2412.04189 | null |
2024-12-05 | AnyDressing: Customizable Multi-Garment Virtual Dressing via Latent Diffusion Models | Xinghui Li et.al. | 2412.04146 | null |
2024-12-05 | Understanding Memorization in Generative Models via Sharpness in Probability Landscapes | Dongjae Jeon et.al. | 2412.04140 | null |
2024-12-05 | Compositional Generative Multiphysics and Multi-component Simulation | Tao Zhang et.al. | 2412.04134 | link |
2024-12-04 | MRGen: Diffusion-based Controllable Data Engine for MRI Segmentation towards Unannotated Modalities | Haoning Wu et.al. | 2412.04106 | link |
2024-12-06 | BodyMetric: Evaluating the Realism of Human Bodies in Text-to-Image Generation | Nefeli Andreou et.al. | 2412.04086 | null |
2024-12-05 | ZipAR: Accelerating Autoregressive Image Generation through Spatial Locality | Yefei He et.al. | 2412.04062 | link |
2024-12-10 | IF-MDM: Implicit Face Motion Diffusion Model for High-Fidelity Realtime Talking Head Generation | Sejong Yang et.al. | 2412.04000 | null |
2024-12-05 | Blind Underwater Image Restoration using Co-Operational Regressor Networks | Ozer Can Devecioglu et.al. | 2412.03995 | null |
2024-12-06 | Local Curvature Smoothing with Stein’s Identity for Efficient Score Matching | Genki Osada et.al. | 2412.03962 | null |
2024-12-05 | A Framework For Image Synthesis Using Supervised Contrastive Learning | Yibin Liu et.al. | 2412.03957 | null |
2024-12-05 | Enhancing and Accelerating Diffusion-Based Inverse Problem Solving through Measurements Optimization | Tianyu Chen et.al. | 2412.03941 | null |
2024-12-05 | A Noise is Worth Diffusion Guidance | Donghoon Ahn et.al. | 2412.03895 | null |
2024-12-05 | DiffSign: AI-Assisted Generation of Customizable Sign Language Videos With Enhanced Realism | Sudha Krishnamurthy et.al. | 2412.03878 | link |
2024-12-05 | Safeguarding Text-to-Image Generation via Inference-Time Prompt-Noise Optimization | Jiangweizhi Peng et.al. | 2412.03876 | link |
2024-12-05 | CreatiLayout: Siamese Multimodal Diffusion Transformer for Creative Layout-to-Image Generation | Hui Zhang et.al. | 2412.03859 | null |
2024-12-05 | Movie Gen: SWOT Analysis of Meta’s Generative AI Foundation Model for Transforming Media Generation, Advertising, and Entertainment Industries | Abul Ehtesham et.al. | 2412.03837 | null |
2024-12-05 | CLIP-FSAC++: Few-Shot Anomaly Classification with Anomaly Descriptor Based on CLIP | Zuo Zuo et.al. | 2412.03829 | null |
2024-12-05 | EditScout: Locating Forged Regions from Diffusion-based Edited Images with Multimodal LLM | Quang Nguyen et.al. | 2412.03809 | null |
2024-12-04 | Diffusion in Zero-Shot Learning for Environmental Audio | Ysobel Sims et.al. | 2412.03771 | link |
2024-12-04 | Thermodynamic Fidelity of Generative Models for Ising System | Brian H. Lee et.al. | 2412.03764 | null |
2024-12-04 | Advancing Auto-Regressive Continuation for Video Frames | Ruibo Ming et.al. | 2412.03758 | null |
2024-12-04 | Multi-view Image Diffusion via Coordinate Noise and Fourier Attention | Justin Theiss et.al. | 2412.03756 | null |
2024-12-04 | Sprite Sheet Diffusion: Generate Game Character for Animation | Cheng-An Hsieh et.al. | 2412.03685 | null |
2024-12-04 | MV-Adapter: Multi-view Consistent Image Generation Made Easy | Zehuan Huang et.al. | 2412.03632 | null |
2024-12-04 | DiffuPT: Class Imbalance Mitigation for Glaucoma Detection via Diffusion Based Generation and Model Pretraining | Youssof Nawar et.al. | 2412.03629 | null |
2024-12-04 | Network-aided Efficient Large Language Model Services With Denoising-inspired Prompt Compression | Feiran You et.al. | 2412.03621 | null |
2024-12-06 | HunyuanVideo: A Systematic Framework For Large Video Generative Models | Weijie Kong et.al. | 2412.03603 | link |
2024-12-04 | Navigation World Models | Amir Bar et.al. | 2412.03572 | null |
2024-12-04 | MIDI: Multi-Instance Diffusion for Single Image to 3D Scene Generation | Zehuan Huang et.al. | 2412.03558 | null |
2024-12-04 | Imagine360: Immersive 360 Video Generation from Perspective Anchor | Jing Tan et.al. | 2412.03552 | null |
2024-12-09 | Seeing Beyond Views: Multi-View Driving Scene Video Generation with Holistic Attention | Hannan Lu et.al. | 2412.03520 | null |
2024-12-06 | NVComposer: Boosting Generative Novel View Synthesis with Multiple Sparse and Unposed Images | Lingen Li et.al. | 2412.03517 | null |
2024-12-04 | Distilling Diffusion Models to Efficient 3D LiDAR Scene Completion | Shengyuan Zhang et.al. | 2412.03515 | link |
2024-12-04 | Flow Matching with General Discrete Paths: A Kinetic-Optimal Perspective | Neta Shaul et.al. | 2412.03487 | null |
2024-12-04 | CleanDIFT: Diffusion Features without Noise | Nick Stracke et.al. | 2412.03439 | link |
2024-12-04 | SINGER: Vivid Audio-driven Singing Video Generation with Multi-scale Spectral Diffusion Model | Yan Li et.al. | 2412.03430 | null |
2024-12-04 | Skel3D: Skeleton Guided Novel View Synthesis | Aron Fóthi et.al. | 2412.03407 | null |
2024-12-04 | Implicit Priors Editing in Stable Diffusion via Targeted Token Adjustment | Feng He et.al. | 2412.03400 | null |
2024-12-04 | Diamond-defect engineering of NV- centers using ion beam irradiation | J. L. Sánchez Toural et.al. | 2412.03386 | null |
2024-12-06 | Identifiability implies consistency of MLE in partially observed diffusions on a torus | Ibrahim Ekren et.al. | 2412.03380 | null |
2024-12-04 | TASR: Timestep-Aware Diffusion Model for Image Super-Resolution | Qinwei Lin et.al. | 2412.03355 | link |
2024-12-04 | Fairer Analysis and Demographically Balanced Face Generation for Fairer Face Verification | Alexandre Fournier-Montgieux et.al. | 2412.03349 | link |
2024-12-04 | DIVE: Taming DINO for Subject-Driven Video Editing | Yi Huang et.al. | 2412.03347 | null |
2024-12-04 | Geometry-guided Cross-view Diffusion for One-to-many Cross-view Image Synthesis | Tao Jun Lin et.al. | 2412.03315 | null |
2024-12-04 | Diffusion-VLA: Scaling Robot Foundation Models via Unified Diffusion and Autoregression | Junjie Wen et.al. | 2412.03293 | null |
2024-12-04 | Integrating Generative AI into Art Therapy: A Technical Showcase | Yannis Valentin Schmutz et.al. | 2412.03287 | link |
2024-12-04 | Black-Box Forgery Attacks on Semantic Watermarks for Diffusion Models | Andreas Müller et.al. | 2412.03283 | null |
2024-12-04 | Generating Synthetic Genotypes using Diffusion Models | Philip Kenneweg et.al. | 2412.03278 | link |
2024-12-04 | RFSR: Improving ISR Diffusion Models via Reward Feedback Learning | Xiaopeng Sun et.al. | 2412.03268 | link |
2024-12-04 | DynamicControl: Adaptive Condition Selection for Improved Text-to-Image Generation | Qingdong He et.al. | 2412.03255 | null |
2024-12-06 | MaterialPicker: Multi-Modal Material Generation with Diffusion Transformers | Xiaohe Ma et.al. | 2412.03225 | null |
2024-12-04 | Towards Understanding and Quantifying Uncertainty for Text-to-Image Generation | Gianni Franchi et.al. | 2412.03178 | null |
2024-12-04 | PatchDPO: Patch-level DPO for Finetuning-free Personalized Image Generation | Qihan Huang et.al. | 2412.03177 | link |
2024-12-04 | A seamless local-nonlocal coupling diffusion model with $H^1$ vanishing nonlocality convergence | Yanzun Meng et.al. | 2412.03153 | null |
2024-12-04 | Appearance Matching Adapter for Exemplar-based Semantic Image Synthesis | Siyoon Jin et.al. | 2412.03150 | null |
2024-12-04 | Generalized Diffusion Model with Adjusted Offset Noise | Takuro Kutsuna et.al. | 2412.03134 | null |
2024-12-04 | MultiGO: Towards Multi-level Geometry Learning for Monocular 3D Textured Human Reconstruction | Gangjian Zhang et.al. | 2412.03103 | null |
2024-12-04 | Mimir: Improving Video Diffusion Models for Precise Text Understanding | Shuai Tan et.al. | 2412.03085 | null |
2024-12-05 | Align3R: Aligned Monocular Depth Estimation for Dynamic Videos | Jiahao Lu et.al. | 2412.03079 | null |
2024-12-04 | TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation | Liao Qu et.al. | 2412.03069 | link |
2024-12-04 | UTSD: Unified Time Series Diffusion Model | Xiangkai Ma et.al. | 2412.03068 | null |
2024-12-04 | Frequency-Guided Diffusion Model with Perturbation Training for Skeleton-Based Video Anomaly Detection | Xiaofeng Tan et.al. | 2412.03044 | link |
2024-12-04 | Human Multi-View Synthesis from a Single-View Model:Transferred Body and Face Representations | Yu Feng et.al. | 2412.03011 | null |
2024-12-04 | Partially Conditioned Patch Parallelism for Accelerated Diffusion Model Inference | XiuYu Zhang et.al. | 2412.02962 | null |
2024-12-04 | Semantic Segmentation Prior for Diffusion-Based Real-World Super-Resolution | Jiahua Xiao et.al. | 2412.02960 | null |
2024-12-04 | Panoptic Diffusion Models: co-generation of images and segmentation maps | Yinghan Long et.al. | 2412.02929 | null |
2024-12-03 | ShapeWords: Guiding Text-to-Image Synthesis with 3D Shape-Aware Prompts | Dmitry Petrov et.al. | 2412.02912 | null |
2024-12-03 | Effortless Efficiency: Low-Cost Pruning of Diffusion Models | Yang Zhang et.al. | 2412.02852 | null |
2024-12-03 | Grayscale to Hyperspectral at Any Resolution Using a Phase-Only Lens | Dean Hazineh et.al. | 2412.02798 | null |
2024-12-03 | Motion Prompting: Controlling Video Generation with Motion Trajectories | Daniel Geng et.al. | 2412.02700 | null |
2024-12-03 | Diffusion-based Visual Anagram as Multi-task Learning | Zhiyuan Xu et.al. | 2412.02693 | link |
2024-12-03 | Taming Scalable Visual Tokenizer for Autoregressive Image Generation | Fengyuan Shi et.al. | 2412.02692 | link |
2024-12-04 | FoundHand: Large-Scale Domain-Specific Learning for Controllable Hand Image Generation | Kefan Chen et.al. | 2412.02690 | null |
2024-12-04 | SNOOPI: Supercharged One-step Diffusion Distillation with Proper Guidance | Viet Nguyen et.al. | 2412.02687 | null |
2024-12-03 | AniGS: Animatable Gaussian Avatar from a Single Image with Inconsistent Gaussian Reconstruction | Lingteng Qiu et.al. | 2412.02684 | null |
2024-12-03 | Sharp-It: A Multi-view to Multi-view Diffusion Model for 3D Synthesis and Manipulation | Yiftach Edelstein et.al. | 2412.02631 | null |
2024-12-03 | Improving Dynamic Object Interactions in Text-to-Video Generation with AI Feedback | Hiroki Furuta et.al. | 2412.02617 | null |
2024-12-03 | Unveiling Concept Attribution in Diffusion Models | Quang H. Nguyen et.al. | 2412.02542 | link |
2024-12-03 | WEM-GAN: Wavelet transform based facial expression manipulation | Dongya Sun et.al. | 2412.02530 | null |
2024-12-03 | It Takes Two: Real-time Co-Speech Two-person’s Interaction Generation via Reactive Auto-regressive Diffusion Model | Mingyi Shi et.al. | 2412.02419 | null |
2024-12-03 | ScImage: How Good Are Multimodal Large Language Models at Scientific Text-to-Image Generation? | Leixin Zhang et.al. | 2412.02368 | link |
2024-12-06 | GenMix: Effective Data Augmentation with Generative Diffusion Model Image Editing | Khawar Islam et.al. | 2412.02366 | null |
2024-12-03 | LoRA Diffusion: Zero-Shot LoRA Synthesis for Diffusion Model Personalization | Ethan Smith et.al. | 2412.02352 | null |
2024-12-03 | SimuScope: Realistic Endoscopic Synthetic Dataset Generation through Surgical Simulation and Diffusion Models | Sabina Martyniak et.al. | 2412.02332 | link |
2024-12-03 | Controlling the Latent Diffusion Model for Generative Image Shadow Removal via Residual Generation | Xinjie Li et.al. | 2412.02322 | null |
2024-12-03 | Viewpoint Consistency in 3D Generation via Attention and CLIP Guidance | Qing Zhang et.al. | 2412.02287 | null |
2024-12-03 | VideoGen-of-Thought: A Collaborative Framework for Multi-Shot Video Generation | Mingzhe Zheng et.al. | 2412.02259 | link |
2024-12-03 | Fast LiDAR Data Generation with Rectified Flows | Kazuto Nakashima et.al. | 2412.02241 | link |
2024-12-03 | Cross-Attention Head Position Patterns Can Align with Human Visual Concepts in Text-to-Image Generative Models | Jungwon Park et.al. | 2412.02237 | link |
2024-12-03 | How to Use Diffusion Priors under Sparse Views? | Qisen Wang et.al. | 2412.02225 | link |
2024-12-03 | 3D representation in 512-Byte:Variational tokenizer is the key for autoregressive 3D generation | Jinzhi Zhang et.al. | 2412.02202 | null |
2024-12-04 | Generative Photography: Scene-Consistent Camera Control for Realistic Text-to-Image Synthesis | Yu Yuan et.al. | 2412.02168 | null |
2024-12-03 | AccDiffusion v2: Towards More Accurate Higher-Resolution Diffusion Extrapolation | Zhihang Lin et.al. | 2412.02099 | link |
2024-12-02 | Generalized EXTRA stochastic gradient Langevin dynamics | Mert Gurbuzbalaban et.al. | 2412.01993 | null |
2024-12-02 | ShowHowTo: Generating Scene-Conditioned Step-by-Step Visual Instructions | Tomáš Souček et.al. | 2412.01987 | link |
2024-12-02 | Diffusion models learn distributions generated by complex Langevin dynamics | Diaa E. Habibi et.al. | 2412.01919 | null |
2024-12-02 | X-Prompt: Towards Universal In-Context Image Generation in Auto-Regressive Vision Language Foundation Models | Zeyi Sun et.al. | 2412.01824 | link |
2024-12-02 | World-consistent Video Diffusion with Explicit 3D Modeling | Qihang Zhang et.al. | 2412.01821 | null |
2024-12-05 | Switti: Designing Scale-Wise Transformers for Text-to-Image Synthesis | Anton Voronov et.al. | 2412.01819 | null |
2024-12-03 | SceneFactor: Factored Latent 3D Diffusion for Controllable 3D Scene Generation | Alexey Bokhovkin et.al. | 2412.01801 | null |
2024-12-02 | IQA-Adapter: Exploring Knowledge Transfer from Image Quality Assessment to Diffusion-based Generative Models | Khaled Abud et.al. | 2412.01794 | link |
2024-12-02 | Driving Scene Synthesis on Free-form Trajectories with Generative Prior | Zeyu Yang et.al. | 2412.01717 | null |
2024-12-03 | Diffusion Models with Anisotropic Gaussian Splatting for Image Inpainting | Jacob Fein-Ashley et.al. | 2412.01682 | link |
2024-12-02 | Gen-SIS: Generative Self-augmentation Improves Self-supervised Learning | Varun Belagali et.al. | 2412.01672 | null |
2024-12-02 | Vision-based Tactile Image Generation via Contact Condition-guided Diffusion Model | Xi Lin et.al. | 2412.01639 | null |
2024-12-02 | CopyrightShield: Spatial Similarity Guided Backdoor Defense against Copyright Infringement in Diffusion Models | Zhixiang Guo et.al. | 2412.01528 | null |
2024-12-04 | InfinityDrive: Breaking Time Limits in Driving World Models | Xi Guo et.al. | 2412.01522 | null |
2024-12-02 | HaGRIDv2: 1M Images for Static and Dynamic Hand Gesture Recognition | Anton Nuzhdin et.al. | 2412.01508 | link |
2024-12-02 | RaD: A Metric for Medical Image Distribution Comparison in Out-of-Domain Detection and Other Applications | Nicholas Konz et.al. | 2412.01496 | link |
2024-12-02 | SerialGen: Personalized Image Generation by First Standardization Then Personalization | Cong Xie et.al. | 2412.01485 | null |
2024-12-02 | Improving Object Detection by Modifying Synthetic Data with Explainable AI | Nitish Mital et.al. | 2412.01477 | null |
2024-12-02 | DiffPatch: Generating Customizable Adversarial Patches using Diffusion Model | Zhixiang Wang et.al. | 2412.01440 | link |
2024-12-02 | CPA: Camera-pose-awareness Diffusion Transformer for Video Generation | Yuelei Wang et.al. | 2412.01429 | null |
2024-12-02 | An overview of diffusion models for generative artificial intelligence | Davide Gallon et.al. | 2412.01371 | null |
2024-12-02 | Exploring the Robustness of AI-Driven Tools in Digital Forensics: A Preliminary Study | Silvia Lucia Sanna et.al. | 2412.01363 | null |
2024-12-02 | MoTrans: Customized Motion Transfer with Text-driven Video Diffusion Models | Xiaomin Li et.al. | 2412.01343 | null |
2024-12-05 | Negative Token Merging: Image-based Adversarial Feature Guidance | Jaskirat Singh et.al. | 2412.01339 | null |
2024-12-02 | Physically Constrained 3D Diffusion for Inverse Design of Fiber-reinforced Polymer Composite Materials | Pei Xu et.al. | 2412.01321 | null |
2024-12-02 | Long Video Diffusion Generation with Segmented Cross-Attention and Content-Rich Video Data Curation | Xin Yan et.al. | 2412.01316 | null |
2024-12-02 | MFTF: Mask-free Training-free Object Level Layout Control Diffusion Model | Shan Yang et.al. | 2412.01284 | link |
2024-12-02 | MuLan: Adapting Multilingual Diffusion Models for Hundreds of Languages with Negligible Cost | Sen Xing et.al. | 2412.01271 | null |
2024-12-02 | Embryo 2.0: Merging Synthetic and Real Data for Advanced AI Predictions | Oriana Presacan et.al. | 2412.01255 | link |
2024-12-02 | EmojiDiff: Advanced Facial Expression Control with High Identity Preservation in Portrait Generation | Liangwei Jiang et.al. | 2412.01254 | null |
2024-12-02 | Revisiting Generative Policies: A Simpler Reinforcement Learning Algorithmic Perspective | Jinouwen Zhang et.al. | 2412.01245 | link |
2024-12-03 | Concept Replacer: Replacing Sensitive Concepts in Diffusion Models via Precision Localization | Lingyun Zhang et.al. | 2412.01244 | null |
2024-12-02 | Schedule On the Fly: Diffusion Time Prediction for Faster and Better Image Generation | Zilyu Ye et.al. | 2412.01243 | null |
2024-12-02 | PainterNet: Adaptive Image Inpainting with Actual-Token Attention and Diverse Mask Control | Ruichen Wang et.al. | 2412.01223 | null |
2024-12-02 | TinyFusion: Diffusion Transformers Learned Shallow | Gongfan Fang et.al. | 2412.01199 | link |
2024-12-03 | InstantSwap: Fast Customized Concept Swapping across Sharp Shape Differences | Chenyang Zhu et.al. | 2412.01197 | link |
2024-12-02 | Rectified Flow For Structure Based Drug Design | Daiheng Zhang et.al. | 2412.01174 | null |
2024-12-02 | OmniFlow: Any-to-Any Generation with Multi-Modal Rectified Flows | Shufan Li et.al. | 2412.01169 | link |
2024-12-02 | TextSSR: Diffusion-based Data Synthesis for Scene Text Recognition | Xingsong Ye et.al. | 2412.01137 | link |
2024-12-02 | LoyalDiffusion: A Diffusion Model Guarding Against Data Replication | Chenghao Li et.al. | 2412.01118 | null |
2024-12-02 | DIR: Retrieval-Augmented Image Captioning with Comprehensive Understanding | Hao Wu et.al. | 2412.01115 | null |
2024-12-02 | One Shot, One Talk: Whole-body Talking Avatar from a Single Image | Jun Xiang et.al. | 2412.01106 | null |
2024-12-03 | DuoCast: Duo-Probabilistic Meteorology-Aware Model for Extended Precipitation Nowcasting | Penghui Wen et.al. | 2412.01091 | link |
2024-12-04 | FLOAT: Generative Motion Latent Flow Matching for Audio-driven Talking Portrait | Taekyung Ki et.al. | 2412.01064 | null |
2024-12-03 | Unleashing In-context Learning of Autoregressive Models for Few-shot Image Manipulation | Bolin Lai et.al. | 2412.01027 | null |
2024-12-02 | On the Feature Learning in Diffusion Models | Andi Han et.al. | 2412.01021 | null |
2024-12-01 | Playable Game Generation | Mingyu Yang et.al. | 2412.00887 | link |
2024-12-06 | Beyond Pixels: Text Enhances Generalization in Real-World Image Restoration | Haoze Sun et.al. | 2412.00878 | null |
2024-12-01 | AniMer: Animal Pose and Shape Estimation Using Family Aware Transformer | Jin Lyu et.al. | 2412.00837 | null |
2024-12-01 | Particle-based 6D Object Pose Estimation from Point Clouds using Diffusion Models | Christian Möller et.al. | 2412.00835 | link |
2024-12-01 | Categorical Keypoint Positional Embedding for Robust Animal Re-Identification | Yuhao Lin et.al. | 2412.00818 | null |
2024-12-01 | Memories of Forgotten Concepts | Matan Rusanovsky et.al. | 2412.00782 | null |
2024-12-01 | DIVD: Deblurring with Improved Video Diffusion Model | Haoyang Long et.al. | 2412.00773 | null |
2024-12-01 | Learning to Forget using Hypernetworks | Jose Miguel Lara Rangel et.al. | 2412.00761 | null |
2024-12-03 | DyMO: Training-Free Diffusion Model Alignment with Dynamic Multi-Objective Scheduling | Xin Xie et.al. | 2412.00759 | null |
2024-12-01 | CtrlNeRF: The Generative Neural Radiation Fields for the Controllable Synthesis of High-fidelity 3D-Aware Images | Jian Liu et.al. | 2412.00754 | null |
2024-12-05 | Hallo3: Highly Dynamic and Realistic Portrait Image Animation with Diffusion Transformer Networks | Jiahao Cui et.al. | 2412.00733 | link |
2024-12-01 | Synergizing Motion and Appearance: Multi-Scale Compensatory Codebooks for Talking Head Video Generation | Shuling Zhao et.al. | 2412.00719 | null |
2024-12-01 | FiffDepth: Feed-forward Transformation of Diffusion-Based Generators for Detailed Depth Estimation | Yunpeng Bai et.al. | 2412.00671 | null |
2024-12-01 | Learning on Less: Constraining Pre-trained Model Learning for Generalizable Diffusion-Generated Image Detection | Yingjian Chen et.al. | 2412.00665 | null |
2024-12-01 | Improving Decoupled Posterior Sampling for Inverse Problems using Data Consistency Constraint | Zhi Qi et.al. | 2412.00664 | null |
2024-12-01 | Sketch-Guided Motion Diffusion for Stylized Cinemagraph Synthesis | Hao Jin et.al. | 2412.00638 | null |
2024-12-07 | A Lesson in Splats: Teacher-Guided Diffusion for 3D Gaussian Splats Generation with 2D Supervision | Chensheng Peng et.al. | 2412.00623 | null |
2024-11-30 | PhyT2V: LLM-Guided Iterative Self-Refinement for Physics-Grounded Text-to-Video Generation | Qiyao Xue et.al. | 2412.00596 | link |
2024-11-30 | Continuous Concepts Removal in Text-to-image Diffusion Models | Tingxu Han et.al. | 2412.00580 | null |
2024-11-30 | Blind Inverse Problem Solving Made Easy by Text-to-Image Latent Diffusion | Michail Dontas et.al. | 2412.00557 | null |
2024-11-30 | Motion Dreamer: Realizing Physically Coherent Video Generation through Scene-Aware Motion Reasoning | Tianshuo Xu et.al. | 2412.00547 | link |
2024-11-30 | Human Action CLIPS: Detecting AI-generated Human Motion | Matyas Bohacek et.al. | 2412.00526 | null |
2024-11-30 | Instant3dit: Multiview Inpainting for Fast Editing of 3D Objects | Amir Barda et.al. | 2412.00518 | null |
2024-11-30 | Energy-Based Prior Latent Space Diffusion model for Reconstruction of Lumbar Vertebrae from Thick Slice MRI | Yanke Wang et.al. | 2412.00511 | link |
2024-11-30 | DreamDance: Animating Human Images by Enriching 3D Geometry Cues from 2D Poses | Yatian Pang et.al. | 2412.00397 | null |
2024-11-30 | DogLayout: Denoising Diffusion GAN for Discrete and Continuous Layout Generation | Zhaoxing Gan et.al. | 2412.00381 | link |
2024-11-30 | Safety Alignment Backfires: Preventing the Re-emergence of Suppressed Concepts in Fine-tuned Text-to-Image Diffusion Models | Sanghyun Kim et.al. | 2412.00357 | null |
2024-11-30 | Refine-by-Align: Reference-Guided Artifacts Refinement through Semantic Alignment | Yizhi Song et.al. | 2412.00306 | null |
2024-11-29 | Diffusion Model Guided Sampling with Pixel-Wise Aleatoric Uncertainty Estimation | Michele De Vita et.al. | 2412.00205 | link |
2024-12-03 | LumiNet: Latent Intrinsics Meets Diffusion Models for Indoor Scene Relighting | Xiaoyan Xing et.al. | 2412.00177 | null |
2024-11-29 | Art-Free Generative Models: Art Creation Without Graphic Art Knowledge | Hui Ren et.al. | 2412.00176 | null |
2024-11-29 | Dynamic High-Order Control Barrier Functions with Diffuser for Safety-Critical Trajectory Planning at Signal-Free Intersections | Di Chen et.al. | 2412.00162 | null |
2024-11-29 | AerialGo: Walking-through City View Generation from Aerial Perspectives | Fuqiang Zhao et.al. | 2412.00157 | null |
2024-12-03 | VISION-XL: High Definition Video Inverse Problem Solver using Latent Image Diffusion Models | Taesung Kwon et.al. | 2412.00156 | null |
2024-11-29 | Motion Modes: What Could Happen Next? | Karran Pandey et.al. | 2412.00148 | null |
2024-11-28 | MPQ-Diff: Mixed Precision Quantization for Diffusion Models | Rocco Manz Maruzzelli et.al. | 2412.00144 | null |
2024-11-28 | EFSA: Episodic Few-Shot Adaptation for Text-to-Image Retrieval | Muhammad Huzaifa et.al. | 2412.00139 | null |
2024-11-28 | FonTS: Text Rendering with Typography and Style Controls | Wenda Shi et.al. | 2412.00136 | null |
2024-11-28 | Open-Sora Plan: Open-Source Large Video Generation Model | Bin Lin et.al. | 2412.00131 | link |
2024-11-28 | Bridging the Gap: Aligning Text-to-Image Diffusion Models with Specific Feedback | Xuexiang Niu et.al. | 2412.00122 | null |
2024-12-03 | OpenHumanVid: A Large-Scale High-Quality Dataset for Enhancing Human-Centric Video Generation | Hui Li et.al. | 2412.00115 | null |
2024-11-27 | Steering Rectified Flow Models in the Vector Field for Controlled Image Generation | Maitreya Patel et.al. | 2412.00100 | null |
2024-11-26 | Addressing Vulnerabilities in AI-Image Detection: Challenges and Proposed Solutions | Justin Jiang et.al. | 2412.00073 | null |
2024-11-25 | DiffGuard: Text-Based Safety Checker for Diffusion Models | Massine El Khader et.al. | 2412.00064 | null |
2024-11-29 | MoTe: Learning Motion-Text Diffusion Model for Multiple Generation Tasks | Yiming Wu et.al. | 2411.19786 | null |
2024-11-29 | Riemannian Denoising Score Matching for Molecular Structure Optimization with Accurate Energy | Jeheon Woo et.al. | 2411.19769 | null |
2024-11-29 | JetFormer: An Autoregressive Generative Model of Raw Images and Text | Michael Tschannen et.al. | 2411.19722 | null |
2024-11-29 | TexGaussian: Generating High-quality PBR Material via Octree-based 3D Gaussian Splatting | Bojun Xiong et.al. | 2411.19654 | link |
2024-11-29 | Uniform Attention Maps: Boosting Image Fidelity in Reconstruction and Editing | Wenyi Mo et.al. | 2411.19652 | link |
2024-11-29 | Deepfake Media Generation and Detection in the Generative AI Era: A Survey and Outlook | Florinel-Alin Croitoru et.al. | 2411.19537 | link |
2024-11-29 | QUOTA: Quantifying Objects with Text-to-Image Models for Any Domain | Wenfang Sun et.al. | 2411.19534 | null |
2024-11-29 | Ditto: Motion-Space Diffusion for Controllable Realtime Talking Head Synthesis | Tianqi Li et.al. | 2411.19509 | link |
2024-11-29 | Diffusion Models Meet Network Management: Improving Traffic Matrix Analysis with Diffusion-based Approach | Xinyu Yuan et.al. | 2411.19493 | link |
2024-12-08 | Robust Bayesian Scene Reconstruction by Leveraging Retrieval-Augmented Priors | Herbert Wright et.al. | 2411.19461 | null |
2024-11-29 | Fleximo: Towards Flexible Text-to-Human Motion Video Generation | Yuhang Zhang et.al. | 2411.19459 | null |
2024-11-29 | Achromatic single-layer hologram | Zhi Li et.al. | 2411.19445 | null |
2024-11-28 | AMO Sampler: Enhancing Text Rendering with Overshooting | Xixi Hu et.al. | 2411.19415 | link |
2024-11-28 | DreamBlend: Advancing Personalized Fine-tuning of Text-to-Image Diffusion Models | Shwetha Ram et.al. | 2411.19390 | null |
2024-11-28 | Enhancing Sketch Animation: Text-to-Video Diffusion Models with Temporal Consistency and Rigidity Constraints | Gaurav Rai et.al. | 2411.19381 | null |
2024-11-28 | Towards a Mechanistic Explanation of Diffusion Model Generalization | Matthew Niedoba et.al. | 2411.19339 | null |
2024-11-28 | Trajectory Attention for Fine-grained Video Motion Control | Zeqi Xiao et.al. | 2411.19324 | null |
2024-11-28 | Generalized Polyhedral DC Optimization Problems | Vu Thi Huong et.al. | 2411.19272 | null |
2024-11-28 | Improving Multi-Subject Consistency in Open-Domain Image Generation with Isolation and Reposition Attention | Huiguo He et.al. | 2411.19261 | null |
2024-11-28 | Gaussians-to-Life: Text-Driven Animation of 3D Gaussian Splatting Scenes | Thomas Wimmer et.al. | 2411.19233 | link |
2024-11-28 | Z-STAR+: A Zero-shot Style Transfer Method via Adjusting Style Distribution | Yingying Deng et.al. | 2411.19231 | null |
2024-11-28 | Video Depth without Video Models | Bingxin Ke et.al. | 2411.19189 | null |
2024-11-28 | SOWing Information: Cultivating Contextual Coherence with MLLMs in Image Generation | Yuhan Pei et.al. | 2411.19182 | null |
2024-11-28 | Bayesian Deconvolution of Astronomical Images with Diffusion Models: Quantifying Prior-Driven Features in Reconstructions | Alessio Spagnoletti et.al. | 2411.19158 | link |
2024-11-28 | MSG score: A Comprehensive Evaluation for Multi-Scene Video Generation | Daewon Yoon et.al. | 2411.19121 | null |
2024-11-28 | Timestep Embedding Tells: It’s Time to Cache for Video Diffusion Model | Feng Liu et.al. | 2411.19108 | null |
2024-12-06 | I Dream My Painting: Connecting MLLMs and Diffusion Models via Prompt Generation for Text-Guided Multi-Mask Inpainting | Nicola Fanelli et.al. | 2411.19050 | link |
2024-11-28 | 3D-WAG: Hierarchical Wavelet-Guided Autoregressive Generation for High-Fidelity 3D Shapes | Tejaswini Medi et.al. | 2411.19037 | null |
2024-11-28 | Locally-Focused Face Representation for Sketch-to-Image Generation Using Noise-Induced Refinement | Muhammad Umer Ramzan et.al. | 2411.19005 | null |
2024-11-28 | SPAgent: Adaptive Task Decomposition and Model Selection for General Video Generation and Editing | Rong-Cheng Tu et.al. | 2411.18983 | null |
2024-11-28 | Self-Cross Diffusion Guidance for Text-to-Image Synthesis of Similar Subjects | Weimin Qiu et.al. | 2411.18936 | null |
2024-11-28 | VIPaint: Image Inpainting with Pre-Trained Diffusion Models via Variational Inference | Sakshi Agarwal et.al. | 2411.18929 | null |
2024-11-28 | Data Augmentation with Diffusion Models for Colon Polyp Localization on the Low Data Regime: How much real data is enough? | Adrian Tormos et.al. | 2411.18926 | null |
2024-12-03 | CoDiff-VC: A Codec-Assisted Diffusion Model for Zero-shot Voice Conversion | Yuke Li et.al. | 2411.18918 | null |
2024-11-27 | FaithDiff: Unleashing Diffusion Priors for Faithful Image Super-resolution | Junyang Chen et.al. | 2411.18824 | null |
2024-12-02 | Enhancing Compositional Text-to-Image Generation with Reliable Random Seeds | Shuangqi Li et.al. | 2411.18810 | null |
2024-11-27 | Lifting Motion to the 3D World via 2D Diffusion | Jiaman Li et.al. | 2411.18808 | null |
2024-11-27 | Random Walks with Tweedie: A Unified Framework for Diffusion Models | Chicago Y. Park et.al. | 2411.18702 | link |
2024-11-27 | An indicator for effectiveness of text-to-image guardrails utilizing the Single-Turn Crescendo Attack (STCA) | Ted Kwartler et.al. | 2411.18699 | null |
2024-11-27 | MatchDiffusion: Training-free Generation of Match-cuts | Alejandro Pardo et.al. | 2411.18677 | link |
2024-12-02 | AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers | Sherwin Bahmani et.al. | 2411.18673 | null |
2024-11-27 | Towards Chunk-Wise Generation for Long Videos | Siyang Zhang et.al. | 2411.18668 | null |
2024-11-27 | SpotLight: Shadow-Guided Object Relighting via Diffusion | Frédéric Fortier-Chouinard et.al. | 2411.18665 | null |
2024-11-27 | Spatiotemporal Skip Guidance for Enhanced Video Diffusion Sampling | Junha Hyung et.al. | 2411.18664 | null |
2024-11-27 | HoliSDiP: Image Super-Resolution via Holistic Semantics and Diffusion Prior | Li-Yuan Tsao et.al. | 2411.18662 | link |
2024-11-27 | OOD-HOI: Text-Driven 3D Whole-Body Human-Object Interactions Generation Beyond Training Domains | Yixuan Zhang et.al. | 2411.18660 | null |
2024-11-26 | Scene Co-pilot: Procedural Text to Video Generation with Human in the Loop | Zhaofang Qian et.al. | 2411.18644 | null |
2024-11-27 | GeneMAN: Generalizable Single-Image 3D Human Reconstruction from Multi-Source Human Data | Wentao Wang et.al. | 2411.18624 | null |
2024-11-27 | Diffusion Self-Distillation for Zero-Shot Customized Image Generation | Shengqu Cai et.al. | 2411.18616 | null |
2024-11-27 | CAT4D: Create Anything in 4D with Multi-View Video Diffusion Models | Rundi Wu et.al. | 2411.18613 | null |
2024-11-27 | Evaluating and Improving the Effectiveness of Synthetic Chest X-Rays for Medical Image Analysis | Eva Prakash et.al. | 2411.18602 | null |
2024-11-27 | FAM Diffusion: Frequency and Attention Modulation for High-Resolution Image Generation with Stable Diffusion | Haosen Yang et.al. | 2411.18552 | null |
2024-11-28 | Enhancing weed detection performance by means of GenAI-based image augmentation | Sourav Modak et.al. | 2411.18513 | null |
2024-11-27 | Learning the Evolution of Physical Structure of Galaxies via Diffusion Models | Andrew Lizarraga et.al. | 2411.18440 | link |
2024-11-27 | Individual Content and Motion Dynamics Preserved Pruning for Video Diffusion Models | Yiming Wu et.al. | 2411.18375 | null |
2024-11-27 | TryOffDiff: Virtual-Try-Off via High-Fidelity Garment Reconstruction using Diffusion Models | Riza Velioglu et.al. | 2411.18350 | link |
2024-11-27 | Enhancing MMDiT-Based Text-to-Image Models for Similar Subject Generation | Tianyi Wei et.al. | 2411.18301 | link |
2024-11-27 | HiFiVFS: High Fidelity Video Face Swapping | Xu Chen et.al. | 2411.18293 | null |
2024-11-30 | MotionCharacter: Identity-Preserving and Motion Controllable Human Video Generation | Haopeng Fang et.al. | 2411.18281 | null |
2024-11-27 | TSD-SR: One-Step Diffusion with Target Score Distillation for Real-World Image Super-Resolution | Linwei Dong et.al. | 2411.18263 | link |
2024-11-27 | Dependency-Aware CAV Task Scheduling via Diffusion-Based Reinforcement Learning | Xiang Cheng et.al. | 2411.18230 | null |
2024-11-27 | Uniqueness and regularity of weak solutions of a drift-diffusion system for perovskite solar cells | Annegret Glitzky et.al. | 2411.18223 | null |
2024-11-27 | Prediction with Action: Visual Policy Learning via Joint Denoising Process | Yanjiang Guo et.al. | 2411.18179 | null |
2024-11-27 | Type-R: Automatically Retouching Typos for Text-to-Image Generation | Wataru Shimoda et.al. | 2411.18159 | null |
2024-11-27 | ModeDreamer: Mode Guiding Score Distillation for Text-to-3D Generation using Reference Image Prompts | Uy Dieu Tran et.al. | 2411.18135 | null |
2024-11-27 | Training Data Synthesis with Difficulty Controlled Diffusion Model | Zerun Wang et.al. | 2411.18109 | null |
2024-11-27 | PersonaCraft: Personalized Full-Body Image Synthesis for Multiple Identities from Single References Using 3D-Model-Conditioned Diffusion | Gwanghyun Kim et.al. | 2411.18068 | null |
2024-11-27 | Generative Semantic Communication for Joint Image Transmission and Segmentation | Weiwen Yuan et.al. | 2411.18005 | null |
2024-11-28 | Exploring Visual Vulnerabilities via Multi-Loss Adversarial Search for Jailbreaking Vision-Language Models | Shuyang Hao et.al. | 2411.18000 | null |
2024-11-27 | Improved implicit diffusion model with knowledge distillation to estimate the spatial distribution density of carbon stock in remote sensing imagery | Zhenyu Yu et.al. | 2411.17973 | null |
2024-11-27 | ROICtrl: Boosting Instance Control for Visual Generation | Yuchao Gu et.al. | 2411.17949 | null |
2024-11-26 | Passive Deepfake Detection Across Multi-modalities: A Comprehensive Survey | Hong-Hanh Nguyen-Le et.al. | 2411.17911 | null |
2024-11-26 | From memorization to generalization: a theoretical framework for diffusion-based generative models | Indranil Halder et.al. | 2411.17807 | null |
2024-11-26 | Signs as Tokens: An Autoregressive Multilingual Sign Language Generator | Ronglai Zuo et.al. | 2411.17799 | null |
2024-11-26 | Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient | Zigeng Chen et.al. | 2411.17787 | link |
2024-11-26 | DreamCache: Finetuning-Free Lightweight Personalized Image Generation via Feature Caching | Emanuele Aiello et.al. | 2411.17786 | null |
2024-11-27 | Diffusion Autoencoders for Few-shot Image Generation in Hyperbolic Space | Lingxiao Li et.al. | 2411.17784 | null |
2024-12-02 | MVBoost: Boost 3D Reconstruction with Multi-View Refinement | Xiangyu Liu et.al. | 2411.17772 | null |
2024-11-26 | Omegance: A Single Parameter for Various Granularities in Diffusion-Based Synthesis | Xinyu Hou et.al. | 2411.17769 | link |
2024-11-26 | Symmetry Strikes Back: From Single-Image Symmetry Detection to 3D Generation | Xiang Li et.al. | 2411.17763 | null |
2024-11-25 | UVCG: Leveraging Temporal Consistency for Universal Video Protection | KaiZhou Li et.al. | 2411.17746 | null |
2024-11-20 | Generating CKM Using Others’ Data: Cross-AP CKM Inference with Deep Learning | Zhuoyin Dai et.al. | 2411.17716 | null |
2024-11-27 | StableAnimator: High-Quality Identity-Preserving Human Image Animation | Shuyuan Tu et.al. | 2411.17697 | link |
2024-11-26 | ScribbleLight: Single Image Indoor Relighting with Scribbles | Jun Myeong Choi et.al. | 2411.17696 | null |
2024-11-26 | GenDeg: Diffusion-Based Degradation Synthesis for Generalizable All-in-One Image Restoration | Sudarshan Rajagopalan et.al. | 2411.17687 | null |
2024-11-27 | Accelerating Vision Diffusion Transformers with Skip Branches | Guanjie Chen et.al. | 2411.17616 | link |
2024-11-29 | VideoDirector: Precise Video Editing via Text-to-Video Models | Yukun Wang et.al. | 2411.17592 | null |
2024-11-26 | IMPROVE: Improving Medical Plausibility without Reliance on HumanValidation – An Enhanced Prototype-Guided Diffusion Framework | Anurag Shandilya et.al. | 2411.17535 | null |
2024-11-26 | FTMoMamba: Motion Generation with Frequency and Text State Space Models | Chengjian Li et.al. | 2411.17532 | null |
2024-11-25 | Unlocking the Potential of Text-to-Image Diffusion with PAC-Bayesian Theory | Eric Hanchen Jiang et.al. | 2411.17472 | null |
2024-11-25 | Towards Precise Scaling Laws for Video Diffusion Transformers | Yuanyang Yin et.al. | 2411.17470 | null |
2024-11-27 | WF-VAE: Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model | Zongjian Li et.al. | 2411.17459 | link |
2024-12-05 | Identity-Preserving Text-to-Video Generation by Frequency Decomposition | Shenghai Yuan et.al. | 2411.17440 | link |
2024-11-26 | Image Generation with Multimodule Semantic Feature-Aided Selection for Semantic Communications | Chengyang Liang et.al. | 2411.17428 | null |
2024-11-28 | Cross-modal Medical Image Generation Based on Pyramid Convolutional Attention Network | Fuyou Mao et.al. | 2411.17420 | null |
2024-11-26 | AnchorCrafter: Animate CyberAnchors Saling Your Products via Human-Object Interacting Video Generation | Ziyi Xu et.al. | 2411.17383 | null |
2024-11-26 | Reward Incremental Learning in Text-to-Image Generation | Maorong Wang et.al. | 2411.17310 | null |
2024-11-29 | APT: Architectural Planning and Text-to-Blueprint Construction Using Large Language Models for Open-World Agents | Jun Yu Chen et.al. | 2411.17255 | link |
2024-11-26 | DiffSLT: Enhancing Diversity in Sign Language Translation via Diffusion Model | JiHwan Moon et.al. | 2411.17248 | null |
2024-11-26 | Boost 3D Reconstruction using Diffusion-based Monocular Camera Calibration | Junyuan Deng et.al. | 2411.17240 | link |
2024-11-26 | From Graph Diffusion to Graph Classification | Jia Jun Cheng Xian et.al. | 2411.17236 | null |
2024-11-26 | DreamMix: Decoupling Object Attributes for Enhanced Editability in Customized Image Inpainting | Yicheng Yang et.al. | 2411.17223 | link |
2024-11-26 | AIGV-Assessor: Benchmarking and Evaluating the Perceptual Quality of Text-to-Video Generation with LMM | Jiarui Wang et.al. | 2411.17221 | link |
2024-11-26 | cWDM: Conditional Wavelet Diffusion Models for Cross-Modality 3D Medical Image Synthesis | Paul Friedrich et.al. | 2411.17203 | link |
2024-11-26 | The Role of Urban Designers in the Era of AIGC: An Experimental Study Based on Public Participation | Di Mo et.al. | 2411.17194 | null |
2024-11-28 | PhysMotion: Physics-Grounded Dynamics From a Single Image | Xiyang Tan et.al. | 2411.17189 | null |
2024-11-26 | Interleaved Scene Graph for Interleaved Text-and-Image Generation Assessment | Dongping Chen et.al. | 2411.17188 | null |
2024-11-26 | LiteVAR: Compressing Visual Autoregressive Modelling with Efficient Attention and Quantization | Rui Xie et.al. | 2411.17178 | null |
2024-11-26 | ChatGen: Automatic Text-to-Image Generation From FreeStyle Chatting | Chengyou Jia et.al. | 2411.17176 | null |
2024-11-26 | OSDFace: One-Step Diffusion Model for Face Restoration | Jingkai Wang et.al. | 2411.17163 | link |
2024-11-26 | Contrastive CFG: Improving CFG in Diffusion Models by Contrasting Positive and Negative Concepts | Jinho Chang et.al. | 2411.17077 | null |
2024-11-26 | Relations, Negations, and Numbers: Looking for Logic in Generative Text-to-Image Models | Colin Conwell et.al. | 2411.17066 | link |
2024-11-26 | A generalised novel loss function for computational fluid dynamics | Zachary Cooper-Baldock et.al. | 2411.17059 | null |
2024-11-26 | PersonalVideo: High ID-Fidelity Video Customization without Dynamic and Semantic Degradation | Hengjia Li et.al. | 2411.17048 | null |
2024-11-26 | Free $^2$ Guide: Gradient-Free Path Integral Control for Enhancing Text-to-Video Generation with Large Vision-Language Models | Jaemin Kim et.al. | 2411.17041 | null |
2024-12-01 | TED-VITON: Transformer-Empowered Diffusion Models for Virtual Try-On | Zhenchen Wan et.al. | 2411.17017 | link |
2024-11-25 | Generative vs. Predictive Models in Massive MIMO Channel Prediction | Ju-Hyung Lee et.al. | 2411.16971 | null |
2024-11-25 | ZoomLDM: Latent Diffusion Model for multi-scale image generation | Srikar Yellapragada et.al. | 2411.16969 | null |
2024-11-27 | MotionWavelet: Human Motion Prediction via Wavelet Manifold Learning | Yuming Feng et.al. | 2411.16964 | null |
2024-11-25 | Edit Away and My Face Will not Stay: Personal Biometric Defense against Malicious Generative Editing | Hanhui Wang et.al. | 2411.16832 | link |
2024-11-27 | Pathways on the Image Manifold: Image Editing via Video Generation | Noam Rotstein et.al. | 2411.16819 | null |
2024-11-25 | Discrete to Continuous: Generating Smooth Transition Poses from Sign Language Observation | Shengeng Tang et.al. | 2411.16810 | null |
2024-11-25 | InTraGen: Trajectory-controlled Video Generation for Object Interactions | Zuhao Liu et.al. | 2411.16804 | link |
2024-11-25 | Controllable Human Image Generation with Personalized Multi-Garments | Yisol Choi et.al. | 2411.16801 | null |
2024-11-27 | Phys4DGen: A Physics-Driven Framework for Controllable and Efficient 4D Content Generation from a Single Image | Jiajing Lin et.al. | 2411.16800 | null |
2024-11-25 | From Diffusion to Resolution: Leveraging 2D Diffusion Models for 3D Super-Resolution Task | Bohao Chen et.al. | 2411.16792 | null |
2024-11-25 | Staleness-Centric Optimizations for Efficient Diffusion MoE Inference | Jiajun Luo et.al. | 2411.16786 | null |
2024-11-25 | CoCoNO: Attention Contrast-and-Complete for Initial Noise Optimization in Text-to-Image Synthesis | Aravindan Sundaram et.al. | 2411.16783 | null |
2024-11-25 | NovelGS: Consistent Novel-view Denoising via Large Gaussian Reconstruction Model | Jinpeng Liu et.al. | 2411.16779 | null |
2024-11-25 | SynDiff-AD: Improving Semantic Segmentation and End-to-End Autonomous Driving with Synthetic Data from Latent Diffusion Models | Harsh Goel et.al. | 2411.16776 | null |
2024-11-25 | In-Context Experience Replay Facilitates Safety Red-Teaming of Text-to-Image Diffusion Models | Zhi-Yi Chin et.al. | 2411.16769 | null |
2024-11-24 | Visual Counter Turing Test (VCT^2): Discovering the Challenges for AI-Generated Image Detection and Introducing Visual AI Index (V_AI) | Nasrin Imanpour et.al. | 2411.16754 | null |
2024-11-24 | PriorDiffusion: Leverage Language Prior in Diffusion Models for Monocular Depth Estimation | Ziyao Zeng et.al. | 2411.16750 | null |
2024-12-02 | AnySynth: Harnessing the Power of Image Synthetic Data Generation for Generalized Vision-Language Tasks | You Li et.al. | 2411.16749 | null |
2024-11-24 | LetsTalk: Latent Diffusion Transformer for Talking Video Synthesis | Haojie Zhang et.al. | 2411.16748 | null |
2024-11-23 | FollowGen: A Scaled Noise Conditional Diffusion Model for Car-Following Trajectory Prediction | Junwei You et.al. | 2411.16747 | null |
2024-11-23 | Classifier-Free Guidance inside the Attraction Basin May Cause Memorization | Anubhav Jain et.al. | 2411.16738 | link |
2024-11-23 | DiM-Gestor: Co-Speech Gesture Generation with Adaptive Layer Normalization Mamba-2 | Fan Zhang et.al. | 2411.16729 | null |
2024-11-23 | EmotiveTalk: Expressive Talking Head Generation through Audio Information Decoupling and Emotional Video Diffusion | Haotian Wang et.al. | 2411.16726 | null |
2024-11-23 | $\textit{Revelio}$ : Interpreting and leveraging semantic information in diffusion models | Dahye Kim et.al. | 2411.16725 | link |
2024-11-23 | Importance-based Token Merging for Diffusion Models | Haoyu Wu et.al. | 2411.16720 | null |
2024-11-29 | Neuro-Symbolic Evaluation of Text-to-Video Models using Formal Verification | S. P. Sharan et.al. | 2411.16718 | link |
2024-11-22 | TPIE: Topology-Preserved Image Editing With Text Instructions | Nivetha Jayakumar et.al. | 2411.16714 | null |
2024-11-22 | Conditional Text-to-Image Generation with Reference Guidance | Taewook Kim et.al. | 2411.16713 | null |
2024-11-25 | Generative Omnimatte: Learning to Decompose Video into Layers | Yao-Chih Lee et.al. | 2411.16683 | null |
2024-11-27 | Factorized Visual Tokenization and Generation | Zechen Bai et.al. | 2411.16681 | null |
2024-11-25 | Diffusion Features for Zero-Shot 6DoF Object Pose Estimation | Bernd Von Gimborn et.al. | 2411.16668 | null |
2024-11-25 | DreamRunner: Fine-Grained Storytelling Video Generation with Retrieval-Augmented Motion Adaptation | Zun Wang et.al. | 2411.16657 | null |
2024-11-25 | LegoPET: Hierarchical Feature Guided Conditional Diffusion for PET Image Reconstruction | Yiran Sun et.al. | 2411.16629 | link |
2024-11-25 | Human-Activity AGV Quality Assessment: A Benchmark Dataset and an Objective Evaluation Metric | Zhichao Zhang et.al. | 2411.16619 | null |
2024-11-25 | Chat2SVG: Vector Graphics Generation with Large Language Models and Image Diffusion Models | Ronghuan Wu et.al. | 2411.16602 | null |
2024-11-25 | Unlocking The Potential of Adaptive Attacks on Diffusion-Based Purification | Andre Kassis et.al. | 2411.16598 | link |
2024-11-25 | Rethinking Diffusion for Text-Driven Human Motion Generation | Zichong Meng et.al. | 2411.16575 | null |
2024-11-25 | Representation Collapsing Problems in Vector Quantization | Wenhao Zhao et.al. | 2411.16550 | null |
2024-11-25 | ADOBI: Adaptive Diffusion Bridge For Blind Inverse Problems with Application to MRI Reconstruction | Yuyang Hu et.al. | 2411.16535 | null |
2024-11-25 | Noise Diffusion for Enhancing Semantic Faithfulness in Text-to-Image Synthesis | Boming Miao et.al. | 2411.16503 | null |
2024-11-25 | Model-based reinforcement corrosion prediction: Continuous calibration with Bayesian optimization and corrosion wire sensor data | A. Potnis et.al. | 2411.16447 | null |
2024-11-25 | Privacy Protection in Personalized Diffusion Models via Targeted Cross-Attention Adversarial Attack | Xide Xu et.al. | 2411.16437 | null |
2024-11-25 | Ca2-VDM: Efficient Autoregressive Video Diffusion Model with Causal Generation and Cache Sharing | Kaifeng Gao et.al. | 2411.16375 | link |
2024-11-25 | CapHDR2IR: Caption-Driven Transfer from Visible Light to Infrared Domain | Jingchao Peng et.al. | 2411.16327 | null |
2024-11-25 | One Diffusion to Generate Them All | Duong H. Le et.al. | 2411.16318 | link |
2024-11-27 | An End-to-End Robust Point Cloud Semantic Segmentation Network with Single-Step Conditional Diffusion Models | Wentao Qu et.al. | 2411.16308 | link |
2024-11-25 | DiffDesign: Controllable Diffusion with Meta Prior for Efficient Interior Design Generation | Yuxuan Yang et.al. | 2411.16301 | null |
2024-11-25 | SMGDiff: Soccer Motion Generation using diffusion probabilistic models | Hongdi Yang et.al. | 2411.16216 | null |
2024-11-25 | Fancy123: One Image to High-Quality 3D Mesh Generation via Plug-and-Play Deformation | Qiao Yu et.al. | 2411.16185 | link |
2024-11-25 | Image Generation Diversity Issues and How to Tame Them | Mischa Dombrowski et.al. | 2411.16171 | link |
2024-11-25 | Text-to-Image Synthesis: A Decade Survey | Nonghai Zhang et.al. | 2411.16164 | null |
2024-11-26 | MVGenMaster: Scaling Multi-View Generation from Any Image via 3D Priors Enhanced Diffusion Model | Chenjie Cao et.al. | 2411.16157 | link |
2024-11-25 | ENCLIP: Ensembling and Clustering-Based Contrastive Language-Image Pretraining for Fashion Multimodal Search with Limited Data and Low-Quality Images | Prithviraj Purushottam Naik et.al. | 2411.16096 | null |
2024-11-25 | AI-Generated Image Quality Assessment Based on Task-Specific Prompt and Multi-Granularity Similarity | Jili Xia et.al. | 2411.16087 | null |
2024-11-25 | Boosting 3D Object Generation through PBR Materials | Yitong Wang et.al. | 2411.16080 | null |
2024-11-25 | Debiasing Classifiers by Amplifying Bias with Latent Diffusion and Large Language Models | Donggeun Ko et.al. | 2411.16079 | null |
2024-11-25 | Geometry Distributions | Biao Zhang et.al. | 2411.16076 | null |
2024-11-25 | Label-Free Intraoperative Mean-Transition-Time Image Generation Using Statistical Gating and Deep Learning | Yan Shi et.al. | 2411.16039 | null |
2024-11-24 | Enhancing Quantum Diffusion Models with Pairwise Bell State Entanglement | Shivalee Shah et.al. | 2411.15973 | null |
2024-11-24 | Gaussian Scenes: Pose-Free Sparse-View Scene Reconstruction using Depth-Enhanced Diffusion Priors | Soumava Paul et.al. | 2411.15966 | null |
2024-11-24 | A Training-Free Approach for Music Style Transfer with Latent Diffusion Models | Sooyoung Kim et.al. | 2411.15913 | null |
2024-11-24 | Bimanual Grasp Synthesis for Dexterous Robot Hands | Yanming Shao et.al. | 2411.15903 | null |
2024-11-24 | PanoLlama: Generating Endless and Coherent Panoramas with Next-Token-Prediction LLMs | Teng Zhou et.al. | 2411.15867 | link |
2024-11-24 | Generalizable Single-view Object Pose Estimation by Two-side Generating and Matching | Yujing Sun et.al. | 2411.15860 | link |
2024-11-24 | Efficient Multi-user Offloading of Personalized Diffusion Models: A DRL-Convex Hybrid Solution | Wanting Yang et.al. | 2411.15781 | null |
2024-11-24 | Test-time Alignment-Enhanced Adapter for Vision-Language Models | Baoshun Tong et.al. | 2411.15735 | link |
2024-11-24 | DynamicAvatars: Accurate Dynamic Facial Avatars Reconstruction and Precise Editing with Diffusion Models | Yangyang Qian et.al. | 2411.15732 | null |
2024-11-24 | Fixing the Perspective: A Critical Examination of Zero-1-to-3 | Jack Yu et.al. | 2411.15706 | null |
2024-11-23 | An adversarial feature learning based semantic communication method for Human 3D Reconstruction | Shaojiang Liu et.al. | 2411.15595 | null |
2024-11-23 | TKG-DM: Training-free Chroma Key Content Generation Diffusion Model | Ryugo Morita et.al. | 2411.15580 | link |
2024-11-23 | Radio Halo Detection in MWA Data using Deep Neural Networks and Generative Data Augmentation | Ashutosh K. Mishra et.al. | 2411.15559 | null |
2024-11-23 | NeRF Inpainting with Geometric Diffusion Prior and Balanced Score Distillation | Menglin Zhang et.al. | 2411.15551 | null |
2024-11-23 | Optical-Flow Guided Prompt Optimization for Coherent Video Generation | Hyelin Nam et.al. | 2411.15540 | null |
2024-11-23 | MUNBa: Machine Unlearning via Nash Bargaining | Jing Wu et.al. | 2411.15537 | link |
2024-11-23 | When Image Generation Goes Wrong: A Safety Analysis of Stable Diffusion Models | Matthias Schneider et.al. | 2411.15516 | null |
2024-11-23 | Interactive Visual Assessment for Text-to-Image Generation Models | Xiaoyue Mi et.al. | 2411.15509 | null |
2024-11-23 | Automatic Evaluation for Text-to-image Generation: Task-decomposed Framework, Distilled Training, and Meta-evaluation Benchmark | Rong-Cheng Tu et.al. | 2411.15488 | link |
2024-11-23 | Large-Scale Text-to-Image Model with Inpainting is a Zero-Shot Subject-Driven Image Generator | Chaehun Shin et.al. | 2411.15466 | null |
2024-11-23 | Enhancing Instruction-Following Capability of Visual-Language Models by Reducing Image Redundancy | Te Yang et.al. | 2411.15453 | null |
2024-11-23 | ConsistentAvatar: Learning to Diffuse Fully Consistent Talking Head Avatar with Temporal Guidance | Haijie Yang et.al. | 2411.15436 | null |
2024-11-23 | What Makes a Scene ? Scene Graph-based Evaluation and Feedback for Controllable Generation | Zuyao Chen et.al. | 2411.15435 | null |
2024-11-23 | LDM-Morph: Latent diffusion model guided deformable image registration | Jiong Wu et.al. | 2411.15426 | link |
2024-11-23 | Gradient-Free Classifier Guidance for Diffusion Model Sampling | Rahul Shenoy et.al. | 2411.15393 | null |
2024-11-22 | DiffServe: Efficiently Serving Text-to-Image Diffusion Models with Query-Aware Model Scaling | Sohaib Ahmad et.al. | 2411.15381 | null |
2024-11-26 | Exploiting Watermark-Based Defense Mechanisms in Text-to-Image Diffusion Models for Unauthorized Data Usage | Soumil Datta et.al. | 2411.15367 | null |
2024-11-22 | Frequency-Guided Posterior Sampling for Diffusion-Based Image Restoration | Darshan Thaker et.al. | 2411.15295 | null |
2024-11-22 | Foundation Cures Personalization: Recovering Facial Personalized Models’ Prompt Consistency | Yiyang Cai et.al. | 2411.15277 | null |
2024-11-22 | EADReg: Probabilistic Correspondence Generation with Efficient Autoregressive Diffusion Model for Outdoor Point Cloud Registration | Linrui Gong et.al. | 2411.15271 | null |
2024-11-22 | Derivative-Free Diffusion Manifold-Constrained Gradient for Unified XAI | Won Jun Kim et.al. | 2411.15265 | null |
2024-11-22 | MovieBench: A Hierarchical Movie Level Dataset for Long Video Generation | Weijia Wu et.al. | 2411.15262 | link |
2024-11-22 | OSMamba: Omnidirectional Spectral Mamba with Dual-Domain Prior Generator for Exposure Correction | Gehui Li et.al. | 2411.15255 | null |
2024-11-22 | LocRef-Diffusion:Tuning-Free Layout and Appearance-Guided Generation | Fan Deng et.al. | 2411.15252 | null |
2024-11-22 | Reward Fine-Tuning Two-Step Diffusion Models via Learning Differentiable Latent-Space Surrogate Reward | Zhiwei Jia et.al. | 2411.15247 | null |
2024-11-22 | AnyText2: Visual Text Generation and Editing With Customizable Attributes | Yuxiang Tuo et.al. | 2411.15245 | link |
2024-11-21 | Text Embedding is Not All You Need: Attention Control for Text-to-Image Semantic Alignment with Text Self-Attention Maps | Jeeyung Kim et.al. | 2411.15236 | null |
2024-11-21 | IterIS: Iterative Inference-Solving Alignment for LoRA Merging | Hongxu Chen et.al. | 2411.15231 | null |
2024-11-19 | Adaptively Controllable Diffusion Model for Efficient Conditional Image Generation | Yucheng Xing et.al. | 2411.15199 | null |
2024-11-22 | DiffusionDrive: Truncated Diffusion Model for End-to-End Autonomous Driving | Bencheng Liao et.al. | 2411.15139 | link |
2024-11-22 | Material Anything: Generating Materials for Any 3D Object via Diffusion | Xin Huang et.al. | 2411.15138 | null |
2024-11-22 | VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement | Daeun Lee et.al. | 2411.15115 | null |
2024-11-22 | Efficient Pruning of Text-to-Image Models: Insights from Pruning Stable Diffusion | Samarth N Ramesh et.al. | 2411.15113 | null |
2024-12-02 | OminiControl: Minimal and Universal Control for Diffusion Transformer | Zhenxiong Tan et.al. | 2411.15098 | link |
2024-11-22 | Leapfrog Latent Consistency Model (LLCM) for Medical Images Generation | Lakshmikar R. Polamreddy et.al. | 2411.15084 | link |
2024-11-22 | Empowering Clients: Transformation of Design Processes Due to Generative AI | Johannes Schneider et.al. | 2411.15061 | null |
2024-11-22 | The 1D nonlocal Fisher-KPP equation with a top hat kernel. Part 3. The effect of perturbations in the kernel | David John Needham et.al. | 2411.15054 | null |
2024-11-22 | HeadRouter: A Training-free Image Editing Framework for MM-DiTs by Adaptively Routing Attention Heads | Yu Xu et.al. | 2411.15034 | null |
2024-11-22 | FloAt: Flow Warping of Self-Attention for Clothing Animation Generation | Swasti Shreya Mishra et.al. | 2411.15028 | null |
2024-11-22 | Enhancing Exploration with Diffusion Policies in Hybrid Off-Policy RL: Application to Non-Prehensile Manipulation | Huy Le et.al. | 2411.14913 | null |
2024-11-22 | Prioritize Denoising Steps on Diffusion Model Preference Alignment via Explicit Denoised Distribution Estimation | Dingyuan Shi et.al. | 2411.14871 | null |
2024-11-22 | Latent Schrodinger Bridge: Prompting Latent Diffusion for Fast Unpaired Image-to-Image Translation | Jeongsol Kim et.al. | 2411.14863 | null |
2024-11-22 | Unsupervised Multi-view UAV Image Geo-localization via Iterative Rendering | Haoyuan Li et.al. | 2411.14816 | null |
2024-11-22 | High-Resolution Image Synthesis via Next-Token Prediction | Dengsheng Chen et.al. | 2411.14808 | null |
2024-11-22 | Style-Friendly SNR Sampler for Style-Driven Generation | Jooyoung Choi et.al. | 2411.14793 | null |
2024-11-22 | FastGrasp: Efficient Grasp Synthesis with Diffusion | Xiaofei Wu et.al. | 2411.14786 | link |
2024-11-22 | Kolmogorov Modes and Linear Response of Jump-Diffusion Models: Applications to Stochastic Excitation of the ENSO Recharge Oscillator | Mickaël D. Chekroun et.al. | 2411.14769 | null |
2024-11-22 | FairAdapter: Detecting AI-generated Images with Improved Fairness | Feng Ding et.al. | 2411.14755 | link |
2024-11-22 | Measurement of the dynamic charge susceptibility near the charge density wave transition in ErTe $_3$ | Dipanjan Chaudhuri et.al. | 2411.14746 | null |
2024-11-22 | TEXGen: a Generative Diffusion Model for Mesh Textures | Xin Yu et.al. | 2411.14740 | link |
2024-11-22 | AI Tailoring: Evaluating Influence of Image Features on Fashion Product Popularity | Xiaomin Li et.al. | 2411.14737 | null |
2024-11-22 | Any-to-3D Generation via Hybrid Diffusion Supervision | Yijun Fan et.al. | 2411.14715 | null |
2024-11-22 | TrojanEdit: Backdooring Text-Based Image Editing Models | Ji Guo et.al. | 2411.14681 | null |
2024-11-22 | Differentially Private Adaptation of Diffusion Models via Noisy Aggregated Embeddings | Pura Peetathawatchai et.al. | 2411.14639 | link |
2024-11-21 | Understanding World or Predicting Future? A Comprehensive Survey of World Models | Jingtao Ding et.al. | 2411.14499 | null |
2024-11-21 | Test-Time Adaptation of 3D Point Clouds via Denoising Diffusion Models | Hamidreza Dastmalchi et.al. | 2411.14495 | link |
2024-11-21 | Stable Flow: Vital Layers for Training-Free Image Editing | Omri Avrahami et.al. | 2411.14430 | link |
2024-11-21 | Unleashing the Potential of Multi-modal Foundation Models and Video Diffusion for 4D Dynamic Physical Scene Simulation | Zhuoman Liu et.al. | 2411.14423 | null |
2024-11-26 | Baking Gaussian Splatting into Diffusion Denoiser for Fast and Scalable Single-stage Image-to-3D Generation | Yuanhao Cai et.al. | 2411.14384 | null |
2024-11-21 | CoNFiLD-inlet: Synthetic Turbulence Inflow Using Generative Latent Diffusion Models with Neural Fields | Xin-Yang Liu et.al. | 2411.14378 | null |
2024-11-21 | Enhancing Medical Image Segmentation with Deep Learning and Diffusion Models | Houze Liu et.al. | 2411.14353 | null |
2024-11-21 | StereoCrafter-Zero: Zero-Shot Stereo Video Generation with Noisy Restart | Jian Shi et.al. | 2411.14295 | link |
2024-11-21 | Guided MRI Reconstruction via Schrödinger Bridge | Yue Wang et.al. | 2411.14269 | null |
2024-11-21 | Novel View Extrapolation with Video Diffusion Priors | Kunhao Liu et.al. | 2411.14208 | null |
2024-11-21 | Is this Generated Person Existed in Real-world? Fine-grained Detecting and Calibrating Abnormal Human-body | Zeqing Wang et.al. | 2411.14205 | null |
2024-11-21 | ComfyGI: Automatic Improvement of Image Generation Workflows | Dominik Sobania et.al. | 2411.14193 | null |
2024-11-21 | TaQ-DiT: Time-aware Quantization for Diffusion Transformers | Xinyan Liu et.al. | 2411.14172 | null |
2024-11-21 | RestorerID: Towards Tuning-Free Face Restoration with ID Preservation | Jiacheng Ying et.al. | 2411.14125 | link |
2024-11-21 | Point Cloud Resampling with Learnable Heat Diffusion | Wenqiang Xu et.al. | 2411.14120 | null |
2024-11-21 | MMGenBench: Evaluating the Limits of LMMs from the Text-to-Image Generation Perspective | Hailang Huang et.al. | 2411.14062 | link |
2024-11-21 | Safety Without Semantic Disruptions: Editing-free Safe Image Generation via Context-preserving Dual Latent Reconstruction | Jordan Vice et.al. | 2411.13982 | null |
2024-11-21 | On the Fairness, Diversity and Reliability of Text-to-Image Generative Models | Jordan Vice et.al. | 2411.13981 | null |
2024-11-21 | Transforming Static Images Using Generative Models for Video Salient Object Detection | Suhwan Cho et.al. | 2411.13975 | link |
2024-11-21 | Zero-Shot Low-Light Image Enhancement via Joint Frequency Domain Priors Guided Diffusion | Jinhong He et.al. | 2411.13961 | link |
2024-11-25 | iHQGAN: A Lightweight Invertible Hybrid Quantum-Classical Generative Adversarial Network for Unsupervised Image-to-Image Translation | Xue Yang et.al. | 2411.13920 | link |
2024-11-21 | Decoupled Sparse Priors Guided Diffusion Compression Model for Point Clouds | Xiaoge Zhang et.al. | 2411.13860 | null |
2024-11-21 | Dealing with Synthetic Data Contamination in Online Continual Learning | Maorong Wang et.al. | 2411.13852 | link |
2024-11-21 | Detecting Human Artifacts from Text-to-Image Models | Kaihong Wang et.al. | 2411.13842 | link |
2024-11-21 | CLIPer: Hierarchically Improving Spatial Representation of CLIP for Open-Vocabulary Semantic Segmentation | Lin Sun et.al. | 2411.13836 | link |
2024-11-21 | MagicDriveDiT: High-Resolution Long Video Generation for Autonomous Driving with Adaptive Control | Ruiyuan Gao et.al. | 2411.13807 | null |
2024-11-21 | GalaxyEdit: Large-Scale Image Editing Dataset with Enhanced Diffusion Adapter | Aniruddha Bala et.al. | 2411.13794 | null |
2024-11-21 | Edge-Cloud Routing for Text-to-Image Model with Token-Level Multi-Metric Prediction | Zewei Xin et.al. | 2411.13787 | null |
2024-11-20 | Non-Linear Outlier Synthesis for Out-of-Distribution Detection | Lars Doorenbos et.al. | 2411.13619 | link |
2024-11-24 | What You See Is What Matters: A Novel Visual and Physics-Based Metric for Evaluating Video Generation Quality | Zihan Wang et.al. | 2411.13609 | null |
2024-11-20 | AI-generated Image Detection: Passive or Watermark? | Moyang Guo et.al. | 2411.13553 | link |
2024-11-26 | REDUCIO! Generating 1024 $\times$ 1024 Video within 16 Seconds using Extremely Compressed Motion Latents | Rui Tian et.al. | 2411.13552 | link |
2024-11-20 | Identity Preserving 3D Head Stylization with Multiview Score Distillation | Bahri Batuhan Bilecen et.al. | 2411.13536 | null |
2024-11-20 | VBench++: Comprehensive and Versatile Benchmark Suite for Video Generative Models | Ziqi Huang et.al. | 2411.13503 | link |
2024-11-20 | From Prompt Engineering to Prompt Craft | Joseph Lindley et.al. | 2411.13422 | null |
2024-11-20 | Heuristically Adaptive Diffusion-Model Evolutionary Strategy | Benedikt Hartl et.al. | 2411.13420 | null |
2024-11-20 | XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic Segmentation | Ziyi Wang et.al. | 2411.13243 | link |
2024-11-20 | A computational framework for integrating Predictive processes with evidence Accumulation Models (PAM) | Antonino Visalli et.al. | 2411.13203 | link |
2024-11-20 | RAW-Diffusion: RGB-Guided Diffusion Models for High-Fidelity RAW Image Generation | Christoph Reinders et.al. | 2411.13150 | link |
2024-11-20 | CopyrightMeter: Revisiting Copyright Protection in Text-to-image Models | Naen Xu et.al. | 2411.13144 | null |
2024-11-20 | Virtual Staining of Label-Free Tissue in Imaging Mass Spectrometry | Yijie Zhang et.al. | 2411.13120 | null |
2024-11-19 | Breaking the wire: the impact of critical length on melting pathways in silver nanowires | Kannan M Ridings et.al. | 2411.12891 | null |
2024-11-22 | From Text to Pose to Image: Improving Diffusion Model Control and Quality | Clément Bonnet et.al. | 2411.12872 | link |
2024-11-24 | CDI: Copyrighted Data Identification in Diffusion Models | Jan Dubiński et.al. | 2411.12858 | link |
2024-11-19 | Towards motion from video diffusion models | Paul Janson et.al. | 2411.12831 | null |
2024-11-19 | Stylecodes: Encoding Stylistic Information For Image Generation | Ciara Rowles et.al. | 2411.12811 | link |
2024-11-19 | Automated 3D Physical Simulation of Open-world Scene with Gaussian Splatting | Haoyu Zhao et.al. | 2411.12789 | null |
2024-11-18 | Decoupling Training-Free Guided Diffusion by ADMM | Youyuan Zhang et.al. | 2411.12773 | null |
2024-11-19 | PoM: Efficient Image and Video Generation with the Polynomial Mixer | David Picard et.al. | 2411.12663 | link |
2024-11-21 | Improving Controllability and Editability for Pretrained Text-to-Music Generation Models | Yixiao Zhang et.al. | 2411.12641 | null |
2024-11-19 | Data Pruning in Generative Diffusion Models | Rania Briq et.al. | 2411.12523 | link |
2024-11-19 | Frequency-Aware Guidance for Blind Image Restoration via Diffusion Models | Jun Xiao et.al. | 2411.12450 | null |
2024-11-27 | Combinational Backdoor Attack against Customized Text-to-Image Models | Wenbo Jiang et.al. | 2411.12389 | null |
2024-11-19 | Scalable and Effective Negative Sample Generation for Hyperedge Prediction | Shilin Qu et.al. | 2411.12354 | null |
2024-11-19 | Diffusion Product Quantization | Jie Shao et.al. | 2411.12306 | null |
2024-11-19 | SSEditor: Controllable Mask-to-Scene Generation with Diffusion Model | Haowen Zheng et.al. | 2411.12290 | link |
2024-12-01 | HouseLLM: LLM-Assisted Two-Phase Text-to-Floorplan Generation | Ziyang Zong et.al. | 2411.12279 | null |
2024-11-19 | Wavespeed selection of travelling wave solutions of a two-component reaction-diffusion model of cell invasion | Yuhui Chen et.al. | 2411.12232 | null |
2024-11-19 | CCIS-Diff: A Generative Model with Stable Diffusion Prior for Controlled Colonoscopy Image Synthesis | Yifan Xie et.al. | 2411.12198 | null |
2024-11-19 | Constant Rate Schedule: Constant-Rate Distributional Change for Efficient Training and Sampling in Diffusion Models | Shuntaro Okada et.al. | 2411.12188 | null |
2024-11-19 | Diffusion-Inspired Cold Start with Sufficient Prior in Computerized Adaptive Testing | Haiping Ma et.al. | 2411.12182 | link |
2024-11-19 | Enhancing Low Dose Computed Tomography Images Using Consistency Training Techniques | Mahmut S. Gokmen et.al. | 2411.12181 | null |
2024-11-21 | FruitNinja: 3D Object Interior Texture Generation with Gaussian Splatting | Fangyu Wu et.al. | 2411.12089 | null |
2024-11-18 | Just Leaf It: Accelerating Diffusion Classifiers with Hierarchical Class Pruning | Arundhati S. Shanbhag et.al. | 2411.12073 | link |
2024-11-18 | Zoomed In, Diffused Out: Towards Local Degradation-Aware Multi-Diffusion for Extreme Image Super-Resolution | Brian B. Moser et.al. | 2411.12072 | link |
2024-11-18 | Medical Video Generation for Disease Progression Simulation | Xu Cao et.al. | 2411.11943 | null |
2024-11-18 | SpatialDreamer: Self-supervised Stereo Video Synthesis from Monocular Input | Zhen Lv et.al. | 2411.11934 | null |
2024-11-22 | FLAME: Frozen Large Language Models Enable Data-Efficient Language-Image Pre-training | Anjia Cao et.al. | 2411.11927 | link |
2024-11-18 | Continuous Speculative Decoding for Autoregressive Image Generation | Zili Wang et.al. | 2411.11925 | link |
2024-11-18 | From Words to Structured Visuals: A Benchmark and Framework for Text-to-Diagram Generation and Editing | Jingxuan Wei et.al. | 2411.11916 | null |
2024-11-16 | DiHuR: Diffusion-Guided Generalizable Human Reconstruction | Jinnan Chen et.al. | 2411.11903 | null |
2024-11-18 | Aligning Few-Step Diffusion Models with Dense Reward Difference Learning | Ziyi Zhang et.al. | 2411.11727 | link |
2024-11-18 | Robust Reinforcement Learning under Diffusion Models for Data with Jumps | Chenyang Jiang et.al. | 2411.11697 | null |
2024-11-18 | Conceptwm: A Diffusion Model Watermark for Concept Protection | Liangqi Lei et.al. | 2411.11688 | null |
2024-11-19 | Cascaded Diffusion Models for 2D and 3D Microscopy Image Synthesis to Enhance Cell Segmentation | Rüveyda Yilmaz et.al. | 2411.11515 | link |
2024-11-18 | A Modular Open Source Framework for Genomic Variant Calling | Ankita Vaishnobi Bisoi et.al. | 2411.11513 | null |
2024-11-19 | SoK: On the Role and Future of AIGC Watermarking in the Era of Gen-AI | Kui Ren et.al. | 2411.11478 | null |
2024-11-18 | MVLight: Relightable Text-to-3D Generation via Light-conditioned Multi-View Diffusion | Dongseok Shim et.al. | 2411.11475 | null |
2024-11-27 | CLUE-MARK: Watermarking Diffusion Models using CLWE | Kareem Shehata et.al. | 2411.11434 | null |
2024-11-18 | Teaching Video Diffusion Model with Latent Physical Phenomenon Knowledge | Qinglong Cao et.al. | 2411.11343 | null |
2024-11-18 | Stochastic quantization and diffusion models | Kenji Fukushima et.al. | 2411.11297 | null |
2024-11-18 | MEMO-Bench: A Multiple Benchmark for Text-to-Image and Multimodal Large Language Models on Human Emotion Analysis | Yingjie Zhou et.al. | 2411.11235 | null |
2024-11-24 | BeautyBank: Encoding Facial Makeup in Latent Space | Qianwen Lu et.al. | 2411.11231 | null |
2024-11-17 | Stealing Training Graphs from Graph Neural Networks | Minhua Lin et.al. | 2411.11197 | null |
2024-11-17 | DeepSPV: An Interpretable Deep Learning Pipeline for 3D Spleen Volume Estimation from 2D Ultrasound Images | Zhen Yuan et.al. | 2411.11190 | null |
2024-11-17 | Enhanced Anime Image Generation Using USE-CMHSA-GAN | J. Lu et.al. | 2411.11179 | null |
2024-11-17 | Integrated Ising Model with global inhibition for decision making | Olga Tapinova et.al. | 2411.11143 | null |
2024-11-17 | Oscillation Inversion: Understand the structure of Large Flow Model through the Lens of Inversion Method | Yan Zheng et.al. | 2411.11135 | null |
2024-11-17 | Dynamic Dimensioning of Frequency Containment Reserves: The Case of the Nordic Grid | Jöbke Janssen et.al. | 2411.11093 | null |
2024-11-17 | D-Cube: Exploiting Hyper-Features of Diffusion Model for Robust Medical Classification | Minhee Jang et.al. | 2411.11087 | link |
2024-11-20 | Time Step Generating: A Universal Synthesized Deepfake Image Detector | Ziyue Zeng et.al. | 2411.11016 | link |
2024-11-17 | SageAttention2 Technical Report: Accurate 4 Bit Attention for Plug-and-play Inference Acceleration | Jintao Zhang et.al. | 2411.10958 | link |
2024-11-17 | Direct and Explicit 3D Generation from a Single Image | Haoyu Wu et.al. | 2411.10947 | null |
2024-11-17 | Iterative Camera-LiDAR Extrinsic Optimization via Surrogate Diffusion | Ni Ou et.al. | 2411.10936 | null |
2024-11-17 | Constrained Diffusion with Trust Sampling | William Huang et.al. | 2411.10932 | link |
2024-11-16 | Generating Compositional Scenes via Text-to-image RGBA Instance Generation | Alessandro Fontanella et.al. | 2411.10913 | null |
2024-11-16 | MetricGold: Leveraging Text-To-Image Latent Diffusion Models for Metric Depth Estimation | Ansh Shah et.al. | 2411.10886 | link |
2024-11-16 | ViBe: A Text-to-Video Benchmark for Evaluating Hallucination in Large Multimodal Models | Vipula Rawte et.al. | 2411.10867 | null |
2024-11-16 | Improvement in Facial Emotion Recognition using Synthetic Data Generated by Diffusion Model | Arnab Kumar Roy et.al. | 2411.10863 | link |
2024-11-16 | AnimateAnything: Consistent and Controllable Animation for Video Generation | Guojun Lei et.al. | 2411.10836 | null |
2024-11-16 | FlipSketch: Flipping Static Drawings to Text-Guided Sketch Animations | Hmrishav Bandyopadhyay et.al. | 2411.10818 | link |
2024-11-16 | Stable Continual Reinforcement Learning via Diffusion-based Trajectory Replay | Feng Chen et.al. | 2411.10809 | null |
2024-11-16 | Test-time Conditional Text-to-Image Synthesis Using Diffusion Models | Tripti Shukla et.al. | 2411.10800 | null |
2024-11-23 | C-DiffSET: Leveraging Latent Diffusion for SAR-to-EO Image Translation with Confidence-Guided Reliable Object Generation | Jeonghyeok Do et.al. | 2411.10788 | link |
2024-11-16 | Bag of Design Choices for Inference of High-Resolution Masked Generative Transformer | Shitong Shao et.al. | 2411.10781 | link |
2024-11-19 | Diffusion-Based Semantic Segmentation of Lumbar Spine MRI Scans of Lower Back Pain Patients | Maria Monzon et.al. | 2411.10755 | link |
2024-11-22 | TDSM: Triplet Diffusion for Skeleton-Text Matching in Zero-Shot Action Recognition | Jeonghyeok Do et.al. | 2411.10745 | link |
2024-11-16 | Diffusion-based Layer-wise Semantic Reconstruction for Unsupervised Out-of-Distribution Detection | Ying Yang et.al. | 2411.10701 | link |
2024-11-16 | MaskMedPaint: Masked Medical Image Inpainting with Diffusion Models for Mitigation of Spurious Correlations | Qixuan Jin et.al. | 2411.10686 | link |
2024-11-15 | Motion Diffusion-Guided 3D Global HMR from a Dynamic Camera | Jaewoo Heo et.al. | 2411.10582 | null |
2024-11-15 | SmoothCache: A Universal Inference Acceleration Technique for Diffusion Transformers | Joseph Liu et.al. | 2411.10510 | link |
2024-11-15 | DR-BFR: Degradation Representation with Diffusion Models for Blind Face Restoration | Xinmin Qiu et.al. | 2411.10508 | null |
2024-11-15 | OnlyFlow: Optical Flow based Motion Conditioning for Video Diffusion Models | Mathis Koroglu et.al. | 2411.10501 | null |
2024-11-15 | Prompt-Guided Environmentally Consistent Adversarial Patch | Chaoqun Li et.al. | 2411.10498 | null |
2024-11-15 | Boundary Attention Constrained Zero-Shot Layout-To-Image Generation | Huancheng Chen et.al. | 2411.10495 | null |
2024-11-11 | Efficient Denoising Method to Improve The Resolution of Satellite Images | Jhanavi Hegde et.al. | 2411.10476 | null |
2024-11-15 | M-VAR: Decoupled Scale-wise Autoregressive Modeling for High-Quality Image Generation | Sucheng Ren et.al. | 2411.10433 | link |
2024-11-15 | Mitigating Parameter Degeneracy using Joint Conditional Diffusion Model for WECC Composite Load Model in Power Systems | Feiqin Zhu et.al. | 2411.10431 | null |
2024-11-15 | Towards High-Fidelity 3D Portrait Generation with Rich Details by Cross-View Prior-Aware Diffusion | Haoran Wei et.al. | 2411.10369 | null |
2024-11-15 | Safe Text-to-Image Generation: Simply Sanitize the Prompt Embedding | Huming Qiu et.al. | 2411.10329 | null |
2024-11-15 | Probabilistic Prior Driven Attention Mechanism Based on Diffusion Model for Imaging Through Atmospheric Turbulence | Guodong Sun et.al. | 2411.10321 | null |
2024-11-15 | Modification Takes Courage: Seamless Image Stitching via Reference-Driven Inpainting | Ziqi Xie et.al. | 2411.10309 | link |
2024-11-15 | The Unreasonable Effectiveness of Guidance for Diffusion Models | Tim Kaiser et.al. | 2411.10257 | null |
2024-11-15 | ColorEdit: Training-free Image-Guided Color editing with diffusion model | Xingxi Yin et.al. | 2411.10232 | null |
2024-11-15 | Visual question answering based evaluation metrics for text-to-image generation | Mizuki Miyamoto et.al. | 2411.10183 | null |
2024-11-15 | CART: Compositional Auto-Regressive Transformer for Image Generation | Siddharth Roheda et.al. | 2411.10180 | null |
2024-11-15 | Evaluating Text-to-Image Diffusion Models for Texturing Synthetic Data | Thomas Lips et.al. | 2411.10164 | link |
2024-11-15 | Towards Multi-View Consistent Style Transfer with One-Step Diffusion via Vision Conditioning | Yushen Zuo et.al. | 2411.10130 | null |
2024-11-15 | SPLIT: SE(3)-diffusion via Local Geometry-based Score Prediction for 3D Scene-to-Pose-Set Matching Problems | Kanghyun Kim et.al. | 2411.10049 | null |
2024-11-15 | GSEditPro: 3D Gaussian Splatting Editing with Attention-based Progressive Localization | Yanhao Sun et.al. | 2411.10033 | null |
2024-11-15 | EyeDiff: text-to-image diffusion model improves rare eye disease diagnosis | Ruoyu Chen et.al. | 2411.10004 | null |
2024-11-15 | Adaptive Non-Uniform Timestep Sampling for Diffusion Model Training | Myunsoo Kim et.al. | 2411.09998 | null |
2024-11-21 | Instruction-Guided Editing Controls for Images and Multimedia: A Survey in LLM era | Thanh Tam Nguyen et.al. | 2411.09955 | link |
2024-11-15 | Maximum entropy inference of reaction-diffusion models | Olga Movilla Miangolarra et.al. | 2411.09880 | link |
2024-11-15 | Content-Aware Preserving Image Generation | Giang H. Le et.al. | 2411.09871 | null |
2024-11-15 | Face De-identification: State-of-the-art Methods and Comparative Studies | Jingyi Cao et.al. | 2411.09863 | null |
2024-11-15 | Enhancing Diffusion Posterior Sampling for Inverse Problems by Integrating Crafted Measurements | Shijie Zhou et.al. | 2411.09850 | null |
2024-11-14 | Architect: Generating Vivid and Interactive 3D Scenes with Hierarchical 2D Inpainting | Yian Wang et.al. | 2411.09823 | null |
2024-11-14 | GAN-Based Architecture for Low-dose Computed Tomography Imaging Denoising | Yunuo Wang et.al. | 2411.09512 | null |
2024-11-14 | Golden Noise for Diffusion Models: A Learning Framework | Zikai Zhou et.al. | 2411.09502 | link |
2024-11-14 | DiffRoad: Realistic and Diverse Road Scenario Generation for Autonomous Vehicle Testing | Junjie Zhou et.al. | 2411.09451 | null |
2024-11-14 | Image Regeneration: Evaluating Text-to-Image Model via Generating Identical Image with Multimodal Large Language Models | Chutian Meng et.al. | 2411.09449 | null |
2024-11-12 | Mediffusion: Joint Diffusion for Self-Explainable Semi-Supervised Classification and Medical Image Generation | Joanna Kaleta et.al. | 2411.09434 | null |
2024-11-14 | A survey of probabilistic generative frameworks for molecular simulations | Richard John et.al. | 2411.09388 | link |
2024-11-14 | EEG-Based Speech Decoding: A Novel Approach Using Multi-Kernel Ensemble Diffusion Models | Soowon Kim et.al. | 2411.09302 | null |
2024-11-14 | Advancing Diffusion Models: Alias-Free Resampling and Enhanced Rotational Equivariance | Md Fahim Anjum et.al. | 2411.09174 | null |
2024-11-14 | VidMan: Exploiting Implicit Dynamics from Video Diffusion Model for Effective Robot Manipulation | Youpeng Wen et.al. | 2411.09153 | null |
2024-11-14 | General linear threshold models with application to influence maximization | Alexander Kagan et.al. | 2411.09100 | link |
2024-11-15 | Inconsistencies In Consistency Models: Better ODE Solving Does Not Imply Better Samples | Noël Vouitsis et.al. | 2411.08954 | link |
2024-11-12 | Structured Pattern Expansion with Diffusion Models | Marzia Riso et.al. | 2411.08930 | null |
2024-11-01 | Advancements in Data Processing and Calibration for the Hyperspectral Imaging Satellite (HySIS) | Ankur Garg et.al. | 2411.08917 | null |
2024-11-13 | 4D Gaussian Splatting in the Wild with Uncertainty-Aware Regularization | Mijeong Kim et.al. | 2411.08879 | null |
2024-11-13 | Offline Adaptation of Quadruped Locomotion using Diffusion Models | Reece O’Mahoney et.al. | 2411.08832 | null |
2024-11-16 | A Survey on Vision Autoregressive Model | Kai Jiang et.al. | 2411.08666 | null |
2024-11-13 | Towards More Accurate Fake Detection on Images Generated from Advanced Generative and Neural Rendering Models | Chengdong Dong et.al. | 2411.08642 | null |
2024-11-13 | I Can Embrace and Avoid Vagueness Myself: Supporting the Design Process by Balancing Vagueness through Text-to-Image Generative AI | Myungjin Kim et.al. | 2411.08588 | null |
2024-11-18 | V2X-R: Cooperative LiDAR-4D Radar Fusion for 3D Object Detection with Denoising Diffusion | Xun Huang et.al. | 2411.08402 | link |
2024-11-13 | EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation | Xiaofeng Wang et.al. | 2411.08380 | null |
2024-11-13 | Physics Informed Distillation for Diffusion Models | Joshua Tian Jin Tee et.al. | 2411.08378 | link |
2024-11-13 | Generative AI for Data Augmentation in Wireless Networks: Analysis, Applications, and Case Study | Jinbo Wen et.al. | 2411.08341 | null |
2024-11-13 | Motion Control for Enhanced Complex Action Video Generation | Qiang Zhou et.al. | 2411.08328 | null |
2024-11-13 | DNN Task Assignment in UAV Networks: A Generative AI Enhanced Multi-Agent Reinforcement Learning Approach | Xin Tang et.al. | 2411.08299 | null |
2024-11-13 | Control of Biohybrid Actuators using NeuroEvolution | Hugo Alcaraz-Herrera et.al. | 2411.08261 | null |
2024-11-18 | Joint Diffusion models in Continual Learning | Paweł Skierś et.al. | 2411.08224 | null |
2024-11-12 | Latent Space Disentanglement in Diffusion Transformers Enables Precise Zero-shot Semantic Editing | Zitao Shuai et.al. | 2411.08196 | null |
2024-11-12 | Well-posedness of a Variable-Exponent Telegraph Equation Applied to Image Despeckling | Sudeb Majee et.al. | 2411.08175 | null |
2024-11-22 | TIPO: Text to Image with Text Presampling for Prompt Optimization | Shih-Ying Yeh et.al. | 2411.08127 | null |
2024-11-12 | An age-structured diffusive model for epidemic modelling: Lie symmetries and exact solutions | Roman Cherniha et.al. | 2411.08083 | null |
2024-11-17 | Scaling Properties of Diffusion Models for Perceptual Tasks | Rahul Ravishankar et.al. | 2411.08034 | null |
2024-11-12 | GaussianAnything: Interactive Point Cloud Latent Diffusion for 3D Generation | Yushi Lan et.al. | 2411.08033 | null |
2024-11-12 | Diverse capability and scaling of diffusion and auto-regressive models when learning abstract rules | Binxu Wang et.al. | 2411.07873 | null |
2024-11-12 | Novel View Synthesis with Pixel-Space Diffusion Models | Noam Elata et.al. | 2411.07765 | null |
2024-11-12 | Nanosecond nanothermometry in an electron microscope | Florian Castioni et.al. | 2411.07764 | null |
2024-11-12 | Evaluating the Generation of Spatial Relations in Text and Image Generative Models | Shang Hong Sim et.al. | 2411.07664 | null |
2024-11-12 | Leveraging Previous Steps: A Training-free Fast Solver for Flow Diffusion | Kaiyu Song et.al. | 2411.07627 | null |
2024-11-12 | Unraveling the Connections between Flow Matching and Diffusion Probabilistic Models in Training-free Conditional Generation | Kaiyu Song et.al. | 2411.07625 | null |
2024-11-12 | Artificial Intelligence for Biomedical Video Generation | Linyuan Li et.al. | 2411.07619 | null |
2024-11-12 | Harmonizing Pixels and Melodies: Maestro-Guided Film Score Generation and Composition Style Transfer | F. Qi et.al. | 2411.07539 | null |
2024-11-12 | FM-TS: Flow Matching for Time Series Generation | Yang Hu et.al. | 2411.07506 | link |
2024-11-12 | GUS-IR: Gaussian Splatting with Unified Shading for Inverse Rendering | Zhihao Liang et.al. | 2411.07478 | null |
2024-11-12 | Semi-Truths: A Large-Scale Dataset of AI-Augmented Images for Evaluating Robustness of AI-Generated Image detectors | Anisha Pal et.al. | 2411.07472 | link |
2024-11-12 | Tracing the Roots: Leveraging Temporal Dynamics in Diffusion Trajectories for Origin Attribution | Andreas Floros et.al. | 2411.07449 | null |
2024-11-12 | All-in-one Weather-degraded Image Restoration via Adaptive Degradation-aware Self-prompting Model | Yuanbo Wen et.al. | 2411.07445 | null |
2024-11-11 | Exploring Variational Autoencoders for Medical Image Generation: A Comprehensive Study | Khadija Rais et.al. | 2411.07348 | null |
2024-11-11 | Score-based generative diffusion with “active” correlated noise sources | Alexandra Lamtyugina et.al. | 2411.07233 | null |
2024-11-12 | Add-it: Training-Free Object Insertion in Images With Pretrained Diffusion Models | Yoad Tewel et.al. | 2411.07232 | null |
2024-11-11 | Learning from Limited and Imperfect Data | Harsh Rangwani et.al. | 2411.07229 | null |
2024-11-11 | DLCR: A Generative Data Expansion Framework via Diffusion for Clothes-Changing Person Re-ID | Nyle Siddiqui et.al. | 2411.07205 | link |
2024-11-11 | Crossover from inhomogeneous to homogeneous response of a resonantly driven hBN quantum emitter | Domitille Gérard et.al. | 2411.07202 | null |
2024-11-11 | OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision | Cong Wei et.al. | 2411.07199 | null |
2024-11-14 | More Expressive Attention with Negative Weights | Ang Lv et.al. | 2411.07176 | link |
2024-11-11 | Edify 3D: Scalable High-Quality 3D Asset Generation | NVIDIA et.al. | 2411.07135 | null |
2024-11-11 | Token Merging for Training-Free Semantic Binding in Text-to-Image Synthesis | Taihang Hu et.al. | 2411.07132 | link |
2024-11-11 | Edify Image: High-Quality Image Generation with Pixel Space Laplacian Diffusion Models | NVIDIA et.al. | 2411.07126 | null |
2024-11-17 | Decoding Visual Experience and Mapping Semantics through Whole-Brain Analysis Using fMRI Foundation Models | Yanchen Wang et.al. | 2411.07121 | link |
2024-11-11 | Generalized Wasserstein Barycenters | Francesco Tornabene et.al. | 2411.06838 | null |
2024-11-16 | White-Box Diffusion Transformer for single-cell RNA-seq generation | Zhuorui Cui et.al. | 2411.06785 | link |
2024-11-11 | DiffSR: Learning Radar Reflectivity Synthesis via Diffusion Model from Satellite Observations | Xuming He et.al. | 2411.06714 | null |
2024-11-11 | Layout Control and Semantic Guidance with Attention Loss Backward for T2I Diffusion Model | Guandong Li et.al. | 2411.06692 | null |
2024-11-11 | SeedEdit: Align Image Re-Generation to Image Editing | Yichun Shi et.al. | 2411.06686 | null |
2024-11-10 | Using Diffusion Models as Generative Replay in Continual Federated Learning – What will Happen? | Yongsheng Mei et.al. | 2411.06618 | null |
2024-11-15 | Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement | Zhennan Chen et.al. | 2411.06558 | link |
2024-11-10 | CASC: Condition-Aware Semantic Communication with Latent Diffusion Models | Weixuan Chen et.al. | 2411.06552 | null |
2024-11-10 | I2VControl-Camera: Precise Video Camera Control with Adjustable Motion Strength | Wanquan Feng et.al. | 2411.06525 | null |
2024-11-10 | Numerical analysis of the cross-diffusion Cahn-Hilliard model in lymphangiogenesis | Boyi Wang et.al. | 2411.06488 | null |
2024-11-19 | DDIM-Driven Coverless Steganography Scheme with Real Key | Mingyu Yu et.al. | 2411.06486 | null |
2024-11-10 | Improved Video VAE for Latent Video Diffusion Model | Pingyu Wu et.al. | 2411.06449 | null |
2024-11-10 | Detecting AutoEncoder is Enough to Catch LDM Generated Images | Dmitry Vesnin et.al. | 2411.06441 | link |
2024-11-10 | PLM-Based Discrete Diffusion Language Models with Entropy-Adaptive Gibbs Sampling | Hyukhun Koh et.al. | 2411.06438 | null |
2024-11-09 | Exploring Out-of-distribution Detection for Sparse-view Computed Tomography with Diffusion Models | Ezgi Demircan-Tureyen et.al. | 2411.06308 | null |
2024-11-09 | Text2CAD: Text to 3D CAD Generation via Technical Drawings | Mohsen Yavartanoo et.al. | 2411.06206 | null |
2024-11-09 | Scalable, Tokenization-Free Diffusion Model Architectures with Efficient Initial Convolution and Fixed-Size Reusable Structures for On-Device Image Generation | Sanchar Palit et.al. | 2411.06119 | null |
2024-11-09 | PointCG: Self-supervised Point Cloud Learning via Joint Completion and Generation | Yun Liu et.al. | 2411.06041 | null |
2024-11-08 | Energy Efficient Protein Language Models: Leveraging Small Language Models with LoRA for Controllable Protein Generation | Aayush Shah et.al. | 2411.05966 | null |
2024-11-08 | Autoregressive Models in Vision: A Survey | Jing Xiong et.al. | 2411.05902 | link |
2024-11-07 | Conditional Diffusion Model for Longitudinal Medical Image Generation | Duy-Phuong Dao et.al. | 2411.05860 | null |
2024-11-06 | Multivariate Data Augmentation for Predictive Maintenance using Diffusion | Andrew Thompson et.al. | 2411.05848 | null |
2024-11-05 | From Pixels to Prose: Advancing Multi-Modal Language Models for Remote Sensing | Xintian Sun et.al. | 2411.05826 | null |
2024-11-05 | FlexCAD: Unified and Versatile Controllable CAD Generation with Fine-tuned Large Language Models | Zhanwei Zhang et.al. | 2411.05823 | null |
2024-11-14 | Quantitative Assessment of Intersectional Empathetic Bias and Understanding | Vojtech Formanek et.al. | 2411.05777 | link |
2024-11-08 | StdGEN: Semantic-Decomposed 3D Character Generation from Single Images | Yuze He et.al. | 2411.05738 | null |
2024-11-08 | Image2Text2Image: A Novel Framework for Label-Free Evaluation of Image-to-Text Generation with Text-to-Image Diffusion Models | Jia-Hong Huang et.al. | 2411.05706 | null |
2024-11-08 | Improving Molecular Graph Generation with Flow Matching and Optimal Transport | Xiaoyang Hou et.al. | 2411.05676 | null |
2024-11-08 | WHALE: Towards Generalizable and Scalable World Models for Embodied Decision-making | Zhilong Zhang et.al. | 2411.05619 | null |
2024-11-08 | A Nerf-Based Color Consistency Method for Remote Sensing Images | Zongcheng Zuo et.al. | 2411.05557 | null |
2024-11-08 | Towards Lifelong Few-Shot Customization of Text-to-Image Diffusion | Nan Song et.al. | 2411.05544 | null |
2024-11-08 | Improving image synthesis with diffusion-negative sampling | Alakh Desai et.al. | 2411.05473 | null |
2024-11-08 | Bridging the Gap between Learning and Inference for Diffusion-Based Molecule Generation | Peidong Liu et.al. | 2411.05472 | link |
2024-11-08 | RED: Residual Estimation Diffusion for Low-Dose PET Sinogram Reconstruction | Xingyu Ai et.al. | 2411.05354 | link |
2024-11-08 | Electro-diffusive modeling and the role of spine geometry on action potential propagation in neurons | Rahul Gulati et.al. | 2411.05329 | null |
2024-11-08 | Adaptive Whole-Body PET Image Denoising Using 3D Diffusion Models with ControlNet | Boxiao Yu et.al. | 2411.05302 | null |
2024-11-07 | Generalizable Single-Source Cross-modality Medical Image Segmentation via Invariant Causal Mechanisms | Boqi Chen et.al. | 2411.05223 | link |
2024-11-07 | Precision or Recall? An Analysis of Image Captions for Training Text-to-Image Generation Model | Sheng Cheng et.al. | 2411.05079 | link |
2024-11-08 | SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models | Muyang Li et.al. | 2411.05007 | link |
2024-11-07 | ProEdit: Simple Progression is All You Need for High-Quality 3D Scene Editing | Jun-Kun Chen et.al. | 2411.05006 | null |
2024-11-07 | Diff-2-in-1: Bridging Generation and Dense Perception with Diffusion Models | Shuhong Zheng et.al. | 2411.05005 | null |
2024-11-07 | ReCapture: Generative Video Camera Controls for User-Provided Videos using Masked Video Fine-Tuning | David Junhao Zhang et.al. | 2411.05003 | null |
2024-11-07 | Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models | Weixin Liang et.al. | 2411.04996 | null |
2024-11-07 | SG-I2V: Self-Guided Trajectory Control in Image-to-Video Generation | Koichi Namekata et.al. | 2411.04989 | null |
2024-11-07 | AsCAN: Asymmetric Convolution-Attention Networks for Efficient Recognition and Generation | Anil Kag et.al. | 2411.04967 | null |
2024-11-07 | Uncovering Hidden Subspaces in Video Diffusion Models Using Re-Identification | Mischa Dombrowski et.al. | 2411.04956 | null |
2024-11-07 | DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion | Wenqiang Sun et.al. | 2411.04928 | null |
2024-11-11 | StoryAgent: Customized Storytelling Video Generation via Multi-Agent Collaboration | Panwen Hu et.al. | 2411.04925 | null |
2024-11-07 | MVSplat360: Feed-Forward 360 Scene Synthesis from Sparse Views | Yuedong Chen et.al. | 2411.04924 | link |
2024-11-13 | Stem-OB: Generalizable Visual Imitation Learning with Stem-Like Convergent Observation through Diffusion Inversion | Kaizhe Hu et.al. | 2411.04919 | link |
2024-11-06 | Boosting Latent Diffusion with Perceptual Objectives | Tariq Berrada et.al. | 2411.04873 | null |
2024-11-07 | Taming Rectified Flow for Inversion and Editing | Jiangshan Wang et.al. | 2411.04746 | link |
2024-11-07 | Controlling Human Shape and Pose in Text-to-Image Diffusion Models via Domain Adaptation | Benito Buchheim et.al. | 2411.04724 | null |
2024-11-06 | Multi-Reward as Condition for Instruction-based Image Editing | Xin Gu et.al. | 2411.04713 | null |
2024-11-06 | SEE-DPO: Self Entropy Enhanced Direct Preference Optimization | Shivanshu Shekhar et.al. | 2411.04712 | null |
2024-11-05 | TIP-I2V: A Million-Scale Real Text and Image Prompt Dataset for Image-to-Video Generation | Wenhao Wang et.al. | 2411.04709 | null |
2024-11-07 | DanceFusion: A Spatio-Temporal Skeleton Diffusion Transformer for Audio-Driven Dance Motion Reconstruction | Li Zhao et.al. | 2411.04646 | null |
2024-11-07 | Brain Tumour Removing and Missing Modality Generation using 3D WDM | André Ferreira et.al. | 2411.04630 | link |
2024-11-07 | Social EgoMesh Estimation | Luca Scofano et.al. | 2411.04598 | link |
2024-11-07 | DomainGallery: Few-shot Domain-driven Image Generation by Attribute-centric Finetuning | Yuxuan Duan et.al. | 2411.04571 | link |
2024-11-07 | Series-to-Series Diffusion Bridge Model | Hao Yang et.al. | 2411.04491 | null |
2024-11-07 | BendVLM: Test-Time Debiasing of Vision-Language Embeddings | Walter Gerych et.al. | 2411.04420 | link |
2024-11-07 | Image Understanding Makes for A Good Tokenizer for Image Generation | Luting Wang et.al. | 2411.04406 | link |
2024-11-11 | HandCraft: Anatomically Correct Restoration of Malformed Hands in Diffusion Generated Images | Zhenyue Qin et.al. | 2411.04332 | null |
2024-11-08 | PocoLoco: A Point Cloud Diffusion Model of Human Shape in Loose Clothing | Siddharth Seth et.al. | 2411.04249 | link |
2024-11-06 | Quantum Diffusion Models for Few-Shot Learning | Ruhan Wang et.al. | 2411.04217 | null |
2024-11-18 | General monotonicity | M. D. Voisei et.al. | 2411.04212 | null |
2024-11-06 | DiMSUM: Diffusion Mamba – A Scalable and Unified Spatial-Frequency Method for Image Generation | Hao Phung et.al. | 2411.04168 | link |
2024-10-29 | Generative AI Enabled Matching for 6G Multiple Access | Xudong Wang et.al. | 2411.04137 | null |
2024-11-06 | Community Forensics: Using Thousands of Generators to Train Fake Image Detectors | Jeongsoo Park et.al. | 2411.04125 | null |
2024-11-06 | Synomaly Noise and Multi-Stage Diffusion: A Novel Approach for Unsupervised Anomaly Detection in Ultrasound Imaging | Yuan Bi et.al. | 2411.04004 | null |
2024-11-06 | ParaGAN: A Scalable Distributed Training Framework for Generative Adversarial Networks | Ziji Shi et.al. | 2411.03999 | null |
2024-11-06 | ET-SEED: Efficient Trajectory-Level SE(3) Equivariant Diffusion Policy | Chenrui Tie et.al. | 2411.03990 | null |
2024-11-06 | ReEdit: Multimodal Exemplar-Based Image Editing with Diffusion Models | Ashutosh Srivastava et.al. | 2411.03982 | null |
2024-11-06 | ROBIN: Robust and Invisible Watermarks for Diffusion Models with Adversarial Optimization | Huayang Huang et.al. | 2411.03862 | link |
2024-11-06 | Sub-DM:Subspace Diffusion Model with Orthogonal Decomposition for MRI Reconstruction | Yu Guan et.al. | 2411.03758 | link |
2024-11-06 | Zero-shot Dynamic MRI Reconstruction with Global-to-local Diffusion Model | Yu Guan et.al. | 2411.03723 | link |
2024-11-06 | Investigating Conceptual Blending of a Diffusion Model for Improving Nonword-to-Image Generation | Chihaya Matsuhira et.al. | 2411.03595 | null |
2024-11-05 | Estimating Ego-Body Pose from Doubly Sparse Egocentric Video Data | Seunggeun Chi et.al. | 2411.03561 | null |
2024-11-05 | Enhancing Weakly Supervised Semantic Segmentation for Fibrosis via Controllable Image Generation | Zhiling Yue et.al. | 2411.03551 | null |
2024-11-05 | SynthSet: Generative Diffusion Model for Semantic Segmentation in Precision Agriculture | Andrew Heschl et.al. | 2411.03505 | link |
2024-11-13 | DM4Steal: Diffusion Model For Link Stealing Attack On Graph Neural Networks | Jinyin Chen et.al. | 2411.03364 | null |
2024-11-07 | DiT4Edit: Diffusion Transformer for Image Editing | Kunyu Feng et.al. | 2411.03286 | null |
2024-11-05 | DiffLM: Controllable Synthetic Data Generation via Diffusion Language Models | Ying Zhou et.al. | 2411.03250 | null |
2024-11-05 | On Improved Conditioning Mechanisms and Pre-training Strategies for Diffusion Models | Tariq Berrada Ifriqi et.al. | 2411.03177 | null |
2024-11-05 | Unleashing the power of novel conditional generative approaches for new materials discovery | Lev Novitskiy et.al. | 2411.03156 | link |
2024-11-05 | Gradient-Guided Conditional Diffusion Models for Private Image Reconstruction: Analyzing Adversarial Impacts of Differential Privacy and Denoising | Tao Huang et.al. | 2411.03053 | null |
2024-11-05 | GarVerseLOD: High-Fidelity 3D Garment Reconstruction from a Single In-the-Wild Image using a Dataset with Levels of Details | Zhongjin Luo et.al. | 2411.03047 | null |
2024-11-05 | IMUDiffusion: A Diffusion Model for Multivariate Time Series Synthetisation for Inertial Motion Capturing Systems | Heiko Oppel et.al. | 2411.02954 | null |
2024-11-05 | LDPM: Towards undersampled MRI reconstruction with MR-VAE and Latent Diffusion Prior | Xingjian Tang et.al. | 2411.02951 | null |
2024-11-05 | Textual Aesthetics in Large Language Models | Lingjie Jiang et.al. | 2411.02930 | link |
2024-11-05 | Exploring the Interplay Between Video Generation and World Models in Autonomous Driving: A Survey | Ao Fu et.al. | 2411.02914 | null |
2024-11-05 | BrainBits: How Much of the Brain are Generative Reconstruction Methods Using? | David Mayo et.al. | 2411.02783 | null |
2024-11-05 | How much is a noisy image worth? Data Scaling Laws for Ambient Diffusion | Giannis Daras et.al. | 2411.02780 | link |
2024-11-04 | Modelling Alzheimer’s Protein Dynamics: A Data-Driven Integration of Stochastic Methods, Machine Learning and Connectome Insights | Alec MacIver et.al. | 2411.02644 | null |
2024-10-29 | Decoupled Data Augmentation for Improving Image Classification | Ruoxin Chen et.al. | 2411.02592 | null |
2024-11-04 | TripletCLIP: Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives | Maitreya Patel et.al. | 2411.02545 | null |
2024-11-11 | INQUIRE: A Natural World Text-to-Image Retrieval Benchmark | Edward Vendrow et.al. | 2411.02537 | link |
2024-11-02 | TypeScore: A Text Fidelity Metric for Text-to-Image Generative Models | Georgia Gabriela Sampaio et.al. | 2411.02437 | null |
2024-11-07 | Adaptive Caching for Faster Video Generation with Diffusion Transformers | Kumara Kahatapitiya et.al. | 2411.02397 | null |
2024-11-04 | Training-free Regional Prompting for Diffusion Transformers | Anthony Chen et.al. | 2411.02395 | link |
2024-11-04 | How Far is Video Generation from World Model: A Physical Law Perspective | Bingyi Kang et.al. | 2411.02385 | null |
2024-11-04 | Diffusion-based Generative Multicasting with Intent-aware Semantic Decomposition | Xinkai Liu et.al. | 2411.02334 | null |
2024-11-04 | LayerDAG: A Layerwise Autoregressive Diffusion Model for Directed Acyclic Graph Generation | Mufei Li et.al. | 2411.02322 | link |
2024-11-05 | Tencent Hunyuan3D-1.0: A Unified Framework for Text-to-3D and Image-to-3D Generation | Xianghui Yang et.al. | 2411.02293 | null |
2024-11-05 | FewViewGS: Gaussian Splatting with Few View Matching and Multi-stage Training | Ruihong Yin et.al. | 2411.02229 | null |
2024-11-06 | Digi2Real: Bridging the Realism Gap in Synthetic Data Face Recognition via Foundation Models | Anjith George et.al. | 2411.02188 | null |
2024-11-04 | CleAR: Robust Context-Guided Generative Lighting Estimation for Mobile Augmented Reality | Yiqin Zhao et.al. | 2411.02179 | null |
2024-11-04 | Model Integrity when Unlearning with T2I Diffusion Models | Andrea Schioppa et.al. | 2411.02068 | null |
2024-11-04 | DiffuMask-Editor: A Novel Paradigm of Integration Between the Segmentation Diffusion Model and Image Editing to Improve Segmentation Ability | Bo Gao et.al. | 2411.01819 | null |
2024-11-04 | MoMu-Diffusion: On Learning Long-Term Motion-Music Synchronization and Correspondence | Fuming You et.al. | 2411.01805 | null |
2024-11-04 | A Regressor-Guided Graph Diffusion Model for Predicting Enzyme Mutations to Enhance Turnover Number | Xiaozhu Yu et.al. | 2411.01745 | link |
2024-11-04 | xDiT: an Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism | Jiarui Fang et.al. | 2411.01738 | link |
2024-11-04 | LaGDif: Latent Graph Diffusion Model for Efficient Protein Inverse Folding with Self-Ensemble | Taoyu Wu et.al. | 2411.01737 | link |
2024-11-03 | Optical Flow Representation Alignment Mamba Diffusion Model for Medical Video Generation | Zhenbin Wang et.al. | 2411.01647 | null |
2024-11-03 | DreamPolish: Domain Score Distillation With Progressive Geometry Generation | Yean Cheng et.al. | 2411.01602 | null |
2024-11-03 | HC $^3$ L-Diff: Hybrid conditional latent diffusion with high frequency enhancement for CBCT-to-CT synthesis | Shi Yin et.al. | 2411.01575 | null |
2024-11-03 | Conditional Controllable Image Fusion | Bing Cao et.al. | 2411.01573 | link |
2024-11-03 | Statistical guarantees for denoising reflected diffusion models | Asbjørn Holk et.al. | 2411.01563 | null |
2024-11-03 | Towards Small Object Editing: A Benchmark Dataset and A Training-Free Approach | Qihe Pan et.al. | 2411.01545 | link |
2024-11-03 | Digressions on Irreversibility and Stochastic Systems | Giorgio Picci et.al. | 2411.01516 | null |
2024-11-06 | Teaching Models to Improve on Tape | Liat Bezalel et.al. | 2411.01483 | null |
2024-11-03 | DPCL-Diff: The Temporal Knowledge Graph Reasoning based on Graph Node Diffusion Model with Dual-Domain Periodic Contrastive Learning | Yukun Cao et.al. | 2411.01477 | null |
2024-11-03 | Two-Timescale Model Caching and Resource Allocation for Edge-Enabled AI-Generated Content Services | Zhang Liu et.al. | 2411.01458 | null |
2024-11-02 | Guided Synthesis of Labeled Brain MRI Data Using Latent Diffusion Models for Segmentation of Enlarged Ventricles | Tim Ruschke et.al. | 2411.01351 | null |
2024-11-02 | Diffusion Models as Cartoonists! The Curious Case of High Density Regions | Rafał Karczewski et.al. | 2411.01293 | null |
2024-11-02 | Infinite-Resolution Integral Noise Warping for Diffusion Models | Yitong Deng et.al. | 2411.01212 | null |
2024-11-02 | Hollowed Net for On-Device Personalization of Text-to-Image Diffusion Models | Wonguk Cho et.al. | 2411.01179 | null |
2024-11-02 | Fast and Memory-Efficient Video Diffusion Using Streamlined Inference | Zheng Zhan et.al. | 2411.01171 | link |
2024-11-02 | Prompt Tuning with Diffusion for Few-Shot Pre-trained Policy Generalization | Shengchao Hu et.al. | 2411.01168 | null |
2024-11-02 | Supervised Score-Based Modeling by Gradient Boosting | Changyuan Zhao et.al. | 2411.01159 | null |
2024-11-02 | X-Drive: Cross-modality consistent multi-sensor data synthesis for driving scenarios | Yichen Xie et.al. | 2411.01123 | link |
2024-11-01 | Spatial profiles of a reaction-diffusion epidemic model with nonlinear incidence mechanism and constant total population | Rui Peng et.al. | 2411.01041 | null |
2024-11-01 | Evaluation Metric for Quality Control and Generative Models in Histopathology Images | Pranav Jeevan et.al. | 2411.01034 | null |
2024-11-01 | From Fake Perfects to Conversational Imperfects: Exploring Image-Generative AI as a Boundary Object for Participatory Design of Public Spaces | Jose A. Guridi et.al. | 2411.00949 | null |
2024-10-30 | Accelerated AI Inference via Dynamic Execution Methods | Haim Barad et.al. | 2411.00853 | null |
2024-10-29 | IDEATOR: Jailbreaking VLMs Using VLMs | Ruofan Wang et.al. | 2411.00827 | null |
2024-11-01 | Randomized Autoregressive Visual Generation | Qihang Yu et.al. | 2411.00776 | link |
2024-11-01 | GameGen-X: Interactive Open-world Game Video Generation | Haoxuan Che et.al. | 2411.00769 | link |
2024-11-01 | Face Anonymization Made Simple | Han-Wei Kung et.al. | 2411.00762 | link |
2024-11-01 | A Graph Attention-Guided Diffusion Model for Liver Vessel Segmentation | Xiaotong Zhang et.al. | 2411.00617 | null |
2024-11-01 | pcaGAN: Improving Posterior-Sampling cGANs via Principal Component Regularization | Matthew C. Bendel et.al. | 2411.00605 | link |
2024-11-01 | Spatial profiles of a reaction-diffusion epidemic model with nonlinear incidence mechanism and varying total population | Rui Peng et.al. | 2411.00582 | null |
2024-11-01 | Conditional Synthesis of 3D Molecules with Time Correction Sampler | Hojung Jung et.al. | 2411.00551 | null |
2024-11-01 | Generative AI-based Pipeline Architecture for Increasing Training Efficiency in Intelligent Weed Control Systems | Sourav Modak et.al. | 2411.00548 | null |
2024-11-01 | Unleashing the full potential of the North Sea – Identifying key energy infrastructure synergies for 2030 and 2040 | Jan F. Wiegner et.al. | 2411.00540 | null |
2024-11-04 | Diffusion Models as Network Optimizers: Explorations and Analysis | Ruihuai Liang et.al. | 2411.00453 | link |
2024-11-01 | StyleTex: Style Image-Guided Texture Generation for 3D Models | Zhiyu Xie et.al. | 2411.00399 | null |
2024-11-01 | Constrained Diffusion Implicit Models | Vivek Jayaram et.al. | 2411.00359 | null |
2024-11-01 | TextDestroyer: A Training- and Annotation-Free Diffusion Method for Destroying Anomal Text from Images | Mengcheng Li et.al. | 2411.00355 | null |
2024-11-01 | NCST: Neural-based Color Style Transfer for Video Retouching | Xintao Jiang et.al. | 2411.00335 | null |
2024-10-31 | Understanding the Limits of Vision Language Models Through the Lens of the Binding Problem | Declan Campbell et.al. | 2411.00238 | null |
2024-11-04 | Fashion-VDM: Video Diffusion Model for Virtual Try-On | Johanna Karras et.al. | 2411.00225 | null |
2024-10-31 | Denoising study of Fluoroscopic Images in real time tumor tracking System based on Statistical model of noise | Yongxuan Yan et.al. | 2411.00199 | null |
2024-10-31 | Clinical Evaluation of Medical Image Synthesis: A Case Study in Wireless Capsule Endoscopy | Panagiota Gatoula et.al. | 2411.00178 | null |
2024-10-31 | Enhancing Motion in Text-to-Video Generation with Decomposed Encoding and Conditioning | Penghui Ruan et.al. | 2410.24219 | link |
2024-10-31 | DiffPano: Scalable and Consistent Text to Panorama Generation with Spherical Epipolar-Aware Diffusion | Weicai Ye et.al. | 2410.24203 | link |
2024-10-31 | P-Masking: Power Law Masking Improves Multi-attribute Controlled Generation | Mohamed Elgaar et.al. | 2410.24201 | null |
2024-10-31 | Multi-Attribute Linguistic Tuning for Controlled Paraphrase Generation | Mohamed Elgaar et.al. | 2410.24199 | null |
2024-10-31 | **Redefining |
Fu Feng et.al. | 2410.24160 | null |
2024-10-31 | Scaling Concept With Text-Guided Diffusion Models | Chao Huang et.al. | 2410.24151 | null |
2024-11-18 | Understanding Generalizability of Diffusion Models Requires Rethinking the Hidden Gaussian Structure | Xiang Li et.al. | 2410.24060 | link |
2024-10-31 | TPC: Test-time Procrustes Calibration for Diffusion-based Human Image Animation | Sunjae Yoon et.al. | 2410.24037 | null |
2024-11-14 | DiffPAD: Denoising Diffusion-based Adversarial Patch Decontamination | Jia Fu et.al. | 2410.24006 | link |
2024-11-01 | Breaking Determinism: Fuzzy Modeling of Sequential Recommendation Using Discrete State Space Diffusion Model | Wenjia Xie et.al. | 2410.23994 | null |
2024-10-31 | Stochastic Reconstruction of Gappy Lagrangian Turbulent Signals by Conditional Diffusion Models | Tianyi Li et.al. | 2410.23971 | null |
2024-10-31 | Image Synthesis with Class-Aware Semantic Diffusion Models for Surgical Scene Segmentation | Yihang Zhou et.al. | 2410.23962 | null |
2024-10-31 | Text-DiFuse: An Interactive Multi-Modal Image Fusion Framework based on Text-modulated Diffusion Model | Hao Zhang et.al. | 2410.23905 | link |
2024-10-29 | Temporal and Spatial Super Resolution with Latent Diffusion Model in Medical MRI images | Vishal Dubey et.al. | 2410.23898 | null |
2024-11-08 | DiffBatt: A Diffusion Model for Battery Degradation Prediction and Synthesis | Hamidreza Eivazi et.al. | 2410.23893 | link |
2024-10-31 | Stereo-Talker: Audio-driven 3D Human Synthesis with Prior-Guided Mixture-of-Experts | Xiang Deng et.al. | 2410.23836 | null |
2024-10-31 | Denoising Diffusion Models for Anomaly Localization in Medical Images | Cosmin I. Bercea et.al. | 2410.23834 | null |
2024-10-31 | Disentangling Disentangled Representations: Towards Improved Latent Units via Diffusion Models | Youngjun Jun et.al. | 2410.23820 | null |
2024-10-31 | EDT: An Efficient Diffusion Transformer Framework Inspired by Human-like Sketching | Xinwang Chen et.al. | 2410.23788 | link |
2024-11-05 | In-Context LoRA for Diffusion Transformers | Lianghua Huang et.al. | 2410.23775 | link |
2024-10-31 | Radiation forces and torques in optics and acoustics | Ivan Toftul et.al. | 2410.23670 | null |
2024-10-31 | On Learning Multi-Modal Forgery Representation for Diffusion Generated Video Detection | Xiufeng Song et.al. | 2410.23623 | link |
2024-10-31 | Language-guided Hierarchical Fine-grained Image Forgery Detection and Localization | Xiao Guo et.al. | 2410.23556 | null |
2024-10-31 | There and Back Again: On the relation between noises, images, and their inversions in diffusion models | Łukasz Staniszewski et.al. | 2410.23530 | null |
2024-10-30 | MoLE: Enhancing Human-centric Text-to-image Diffusion via Mixture of Low-rank Experts | Jie Zhu et.al. | 2410.23332 | null |
2024-10-29 | Improved Patch Denoising Diffusion Probabilistic Models for Magnetic Resonance Fingerprinting | Perla Mayo et.al. | 2410.23318 | null |
2024-10-30 | ReferEverything: Towards Segmenting Everything We Can Speak of in Videos | Anurag Bagchi et.al. | 2410.23287 | null |
2024-11-03 | Provable Acceleration for Diffusion Models under Minimal Assumptions | Gen Li et.al. | 2410.23285 | null |
2024-11-05 | RelationBooth: Towards Relation-Aware Customized Object Generation | Qingyu Shi et.al. | 2410.23280 | null |
2024-10-31 | SlowFast-VGen: Slow-Fast Learning for Action-Driven Long Video Generation | Yining Hong et.al. | 2410.23277 | null |
2024-10-30 | Multi-student Diffusion Distillation for Better One-step Generators | Yanke Song et.al. | 2410.23274 | null |
2024-10-30 | Generalized Short Path Algorithms: Towards Super-Quadratic Speedup over Markov Chain Search for Combinatorial Optimization | Shouvanik Chakrabarti et.al. | 2410.23270 | null |
2024-10-30 | Public Domain 12M: A Highly Aesthetic Image-Text Dataset with Novel Governance Mechanisms | Jordan Meyer et.al. | 2410.23144 | null |
2024-11-12 | CausalDiff: Causality-Inspired Disentanglement via Diffusion Model for Adversarial Defense | Mingkun Zhang et.al. | 2410.23091 | link |
2024-10-30 | Controlling Language and Diffusion Models by Transporting Activations | Pau Rodriguez et.al. | 2410.23054 | link |
2024-10-30 | Improving Musical Accompaniment Co-creation via Diffusion Transformers | Javier Nistal et.al. | 2410.23005 | null |
2024-10-30 | DexGraspNet 2.0: Learning Generative Dexterous Grasping in Large-scale Synthetic Cluttered Scenes | Jialiang Zhang et.al. | 2410.23004 | null |
2024-10-30 | LumiSculpt: A Consistency Lighting Control Network for Video Generation | Yuxin Zhang et.al. | 2410.22979 | null |
2024-10-30 | Private Synthetic Text Generation with Diffusion Models | Sebastian Ochs et.al. | 2410.22971 | link |
2024-10-31 | DiffLight: A Partial Rewards Conditioned Diffusion Model for Traffic Signal Control with Missing Data | Hanyang Chen et.al. | 2410.22938 | link |
2024-10-30 | An Individual Identity-Driven Framework for Animal Re-Identification | Yihao Wu et.al. | 2410.22927 | link |
2024-10-30 | HelloMeme: Integrating Spatial Knitting Attentions to Embed High-Level and Fidelity-Rich Conditions in Diffusion Models | Shengkai Zhang et.al. | 2410.22901 | link |
2024-10-30 | Latent Diffusion, Implicit Amplification: Efficient Continuous-Scale Super-Resolution for Remote Sensing Images | Hanlin Wu et.al. | 2410.22830 | null |
2024-10-30 | Diffusion Beats Autoregressive: An Evaluation of Compositional Generation in Text-to-Image Models | Arash Marioriyad et.al. | 2410.22775 | null |
2024-11-11 | FuseAnyPart: Diffusion-Driven Facial Parts Swapping via Multiple Reference Images | Zheng Yu et.al. | 2410.22771 | link |
2024-10-30 | Identifying Drift, Diffusion, and Causal Structure from Temporal Snapshots | Vincent Guan et.al. | 2410.22729 | link |
2024-10-31 | One Prompt to Verify Your Models: Black-Box Text-to-Image Models Verification via Non-Transferable Adversarial Attacks | Ji Guo et.al. | 2410.22725 | null |
2024-10-30 | FlowDCN: Exploring DCN-like Architectures for Fast Image Generation with Arbitrary Resolution | Shuai Wang et.al. | 2410.22655 | null |
2024-10-31 | Consistency Diffusion Bridge Models | Guande He et.al. | 2410.22637 | null |
2024-10-29 | GRADE: Quantifying Sample Diversity in Text-to-Image Models | Royi Rassin et.al. | 2410.22592 | null |
2024-10-29 | Stochastic Trajectories and Spectral Boundary Conditions for Enhanced Diffusion in Immersed Boundary Problems | Rômulo Damasclin Chaves dos Santos et.al. | 2410.22579 | null |
2024-10-29 | Unpicking Data at the Seams: VAEs, Disentanglement and Independent Components | Carl Allen et.al. | 2410.22559 | null |
2024-10-31 | FairSkin: Fair Diffusion for Skin Disease Image Generation | Ruichen Zhang et.al. | 2410.22551 | null |
2024-10-29 | Embedding Watermarks in Diffusion Process for Model Intellectual Property Protection | Jijia Yang et.al. | 2410.22445 | null |
2024-11-06 | Point cloud-based diffusion models for the Electron-Ion Collider | Jack Y. Araz et.al. | 2410.22421 | link |
2024-10-29 | Discrete Modeling via Boundary Conditional Diffusion Processes | Yuxuan Gu et.al. | 2410.22380 | null |
2024-10-29 | Rare-to-Frequent: Unlocking Compositional Generation Power of Diffusion Models on Rare Concepts with LLM Guidance | Dongmin Park et.al. | 2410.22376 | link |
2024-10-28 | Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders | Viacheslav Surkov et.al. | 2410.22366 | link |
2024-10-26 | MMM-RS: A Multi-modal, Multi-GSD, Multi-scene Remote Sensing Dataset and Benchmark for Text-to-Image Generation | Jialin Luo et.al. | 2410.22362 | link |
2024-10-29 | Observation of a Bilayer Superfluid with Interlayer Coherence | Erik Rydow et.al. | 2410.22326 | null |
2024-10-29 | Multi-Class Textual-Inversion Secretly Yields a Semantic-Agnostic Classifier | Kai Wang et.al. | 2410.22317 | link |
2024-10-29 | Capacity Control is an Effective Memorization Mitigation Mechanism in Text-Conditional Diffusion Models | Raman Dutt et.al. | 2410.22149 | link |
2024-10-29 | Variational inference for pile-up removal at hadron colliders with diffusion models | Malte Algren et.al. | 2410.22074 | null |
2024-10-29 | Dual Conditional Diffusion Models for Sequential Recommendation | Hongtao Huang et.al. | 2410.21967 | null |
2024-11-02 | PrefPaint: Aligning Image Inpainting Diffusion Model with Human Preference | Kendong Liu et.al. | 2410.21966 | null |
2024-10-29 | CT to PET Translation: A Large-scale Dataset and Domain-Knowledge-Guided Diffusion Approach | Dac Thai Nguyen et.al. | 2410.21932 | link |
2024-10-29 | Guided Diffusion-based Counterfactual Augmentation for Robust Session-based Recommendation | Muskan Gupta et.al. | 2410.21892 | null |
2024-10-29 | Diffusion as Reasoning: Enhancing Object Goal Navigation with LLM-Biased Diffusion Model | Yiming Ji et.al. | 2410.21842 | null |
2024-10-29 | Volumetric Conditioning Module to Control Pretrained Diffusion Models for 3D Medical Images | Suhyun Ahn et.al. | 2410.21826 | link |
2024-10-29 | HairDiffusion: Vivid Multi-Colored Hair Editing via Latent Diffusion | Yu Zeng et.al. | 2410.21789 | null |
2024-10-29 | DiffusionVel: Multi-Information Integrated Velocity Inversion Using Generative Diffusion Models | Hao Zhang et.al. | 2410.21776 | null |
2024-10-30 | IntLoRA: Integral Low-rank Adaptation of Quantized Diffusion Models | Hang Guo et.al. | 2410.21759 | link |
2024-10-29 | DiffSTR: Controlled Diffusion Models for Scene Text Removal | Sanhita Pathak et.al. | 2410.21721 | null |
2024-10-29 | Unsupervised Modality Adaptation with Text-to-Image Diffusion Models for Semantic Segmentation | Ruihao Xia et.al. | 2410.21708 | link |
2024-10-29 | Investigating Memorization in Video Diffusion Models | Chen Chen et.al. | 2410.21669 | null |
2024-10-29 | Exploring Local Memorization in Diffusion Models via Bright Ending Attention | Chen Chen et.al. | 2410.21665 | null |
2024-10-29 | Fingerprints of Super Resolution Networks | Jeremy Vonderfecht et.al. | 2410.21653 | null |
2024-10-29 | Applications of the Second-Order Esscher Pricing in Risk Management | Tahir Choulli et.al. | 2410.21649 | null |
2024-10-29 | RDSinger: Reference-based Diffusion Network for Singing Voice Synthesis | Kehan Sui et.al. | 2410.21641 | null |
2024-10-29 | Adapting Diffusion Models for Improved Prompt Compliance and Controllable Image Synthesis | Deepak Sridhar et.al. | 2410.21638 | link |
2024-10-29 | OFER: Occluded Face Expression Reconstruction | Pratheba Selvaraju et.al. | 2410.21629 | null |
2024-10-28 | CaloChallenge 2022: A Community Challenge for Fast Calorimeter Simulation | Claudius Krause et.al. | 2410.21611 | null |
2024-10-28 | Diffusion-nested Auto-Regressive Synthesis of Heterogeneous Tabular Data | Hengrui Zhang et.al. | 2410.21523 | null |
2024-10-28 | Denoising Diffusion Planner: Learning Complex Paths from Low-Quality Demonstrations | Michiel Nikken et.al. | 2410.21497 | link |
2024-10-28 | Enhancing CTR Prediction in Recommendation Domain with Search Query Representation | Yuening Wang et.al. | 2410.21487 | null |
2024-11-01 | AdvI2I: Adversarial Image Attack on Image-to-Image Diffusion models | Yaopei Zeng et.al. | 2410.21471 | link |
2024-10-31 | Generator Subadditive Functions for Mixed-Integer Programs | Gustavo Ivan Angulo Olivares et.al. | 2410.21467 | null |
2024-10-28 | Energy-Based Diffusion Language Models for Text Generation | Minkai Xu et.al. | 2410.21357 | null |
2024-10-28 | Absorb & Escape: Overcoming Single Model Limitations in Generating Genomic Sequences | Zehui Li et.al. | 2410.21345 | link |
2024-10-28 | Meta-Learning for Speeding Up Large Model Inference in Decentralized Environments | Yuzhe Yang et.al. | 2410.21340 | null |
2024-10-31 | E(3)-invariant diffusion model for pocket-aware peptide generation | Po-Yu Liang et.al. | 2410.21335 | null |
2024-10-26 | Multi-path Exploration and Feedback Adjustment for Text-to-Image Person Retrieval | Bin Kang et.al. | 2410.21318 | null |
2024-11-04 | Decoding Diffusion: A Scalable Framework for Unsupervised Analysis of Latent Space Biases and Representations Using Natural Language Prompts | E. Zhixuan Zeng et.al. | 2410.21314 | null |
2024-10-31 | Domain-Adaptive Pre-training of Self-Supervised Foundation Models for Medical Image Classification in Gastrointestinal Endoscopy | Marcel Roth et.al. | 2410.21302 | null |
2024-10-21 | Evaluating the Posterior Sampling Ability of Plug&Play Diffusion Methods in Sparse-View CT | Liam Moroy et.al. | 2410.21301 | null |
2024-10-28 | On Inductive Biases That Enable Generalization of Diffusion Transformers | Jie An et.al. | 2410.21273 | link |
2024-10-28 | LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior | Hanyu Wang et.al. | 2410.21264 | null |
2024-10-29 | AutoBench-V: Can Large Vision-Language Models Benchmark Themselves? | Han Bao et.al. | 2410.21259 | link |
2024-10-28 | One-Step Diffusion Policy: Fast Visuomotor Policies via Diffusion Distillation | Zhendong Wang et.al. | 2410.21257 | null |
2024-10-28 | On learning higher-order cumulants in diffusion models | Gert Aarts et.al. | 2410.21212 | null |
2024-11-05 | Extrapolating Prospective Glaucoma Fundus Images through Diffusion Model in Irregular Longitudinal Sequences | Zhihao Zhao et.al. | 2410.21130 | null |
2024-10-28 | Shallow Diffuse: Robust and Invisible Watermarking through Low-Dimensional Subspaces in Diffusion Models | Wenda Li et.al. | 2410.21088 | link |
2024-10-28 | Federated Time Series Generation on Feature and Temporally Misaligned Data | Chenrui Fan et.al. | 2410.21072 | null |
2024-10-28 | Kandinsky 3: Text-to-Image Synthesis for Multifunctional Generative Framework | Vladimir Arkhipkin et.al. | 2410.21061 | link |
2024-10-28 | Beyond Autoregression: Fast LLMs via Self-Distillation Through Time | Justin Deschenaux et.al. | 2410.21035 | link |
2024-10-29 | EEG-Driven 3D Object Reconstruction with Color Consistency and Diffusion Prior | Xin Xiang et.al. | 2410.20981 | null |
2024-10-28 | Attention Overlap Is Responsible for The Entity Missing Problem in Text-to-image Diffusion Models! | Arash Marioriyad et.al. | 2410.20972 | null |
2024-10-28 | Markov spin models for image generation : explicit large deviations with respect to the number of pixels | Cecile Monthus et.al. | 2410.20906 | null |
2024-10-28 | Diff-Instruct*: Towards Human-Preferred One-step Text-to-image Generative Models | Weijian Luo et.al. | 2410.20898 | link |
2024-10-28 | General task optimal planning for heterogeneous teams with precedence and compatibility constraints and its application on power grid inspection using unmanned aerial vehicles | Antonio Sojo et.al. | 2410.20849 | link |
2024-10-28 | Novel Object Synthesis via Adaptive Text-Image Harmony | Zeren Xiong et.al. | 2410.20823 | null |
2024-10-28 | Development of a conditional diffusion model to predict process parameters and microstructures of dendrite crystals of matrix resin based on mechanical properties | Arisa Ikeda et.al. | 2410.20822 | null |
2024-10-28 | Matryoshka: Learning to Drive Black-Box LLMs with LLMs | Changhao Li et.al. | 2410.20749 | null |
2024-10-28 | Murine AI excels at cats and cheese: Structural differences between human and mouse neurons and their implementation in generative AIs | Rino Saiga et.al. | 2410.20735 | null |
2024-10-28 | CompGS: Unleashing 2D Compositionality for Compositional Text-to-3D via Dynamically Optimizing 3D Gaussians | Chongjian Ge et.al. | 2410.20723 | null |
2024-10-28 | Reprogramming Pretrained Target-Specific Diffusion Models for Dual-Target Drug Design | Xiangxin Zhou et.al. | 2410.20688 | link |
2024-10-28 | Video to Video Generative Adversarial Network for Few-shot Learning Based on Policy Gradient | Yintai Ma et.al. | 2410.20657 | null |
2024-10-29 | TabDiff: a Multi-Modal Diffusion Model for Tabular Data Generation | Juntong Shi et.al. | 2410.20626 | link |
2024-10-27 | Generator Matching: Generative modeling with arbitrary Markov processes | Peter Holderrieth et.al. | 2410.20587 | null |
2024-10-27 | ARLON: Boosting Diffusion Transformers with Autoregressive Models for Long Video Generation | Zongyi Li et.al. | 2410.20502 | null |
2024-11-01 | GrounDiT: Grounding Diffusion Transformers via Noisy Patch Transplantation | Phillip Y. Lee et.al. | 2410.20474 | link |
2024-10-27 | Hamiltonian Score Matching and Generative Flows | Peter Holderrieth et.al. | 2410.20470 | null |
2024-10-27 | Lodge++: High-quality and Long Dance Generation with Vivid Choreography Patterns | Ronghui Li et.al. | 2410.20389 | null |
2024-11-01 | Conditional GAN for Enhancing Diffusion Models in Efficient and Authentic Global Gesture Generation from Audios | Yongkang Cheng et.al. | 2410.20359 | null |
2024-10-26 | Classification under strategic adversary manipulation using pessimistic bilevel optimisation | David Benfield et.al. | 2410.20284 | null |
2024-10-26 | MarDini: Masked Autoregressive Diffusion for Video Generation at Scale | Haozhe Liu et.al. | 2410.20280 | null |
2024-10-26 | Equivariant Blurring Diffusion for Hierarchical Molecular Conformer Generation | Jiwoong Park et.al. | 2410.20255 | link |
2024-10-26 | An Efficient Watermarking Method for Latent Diffusion Models via Low-Rank Adaptation | Dongdong Lin et.al. | 2410.20202 | null |
2024-11-06 | Copyright-Aware Incentive Scheme for Generative Art Models Using Hierarchical Reinforcement Learning | Zhuan Shi et.al. | 2410.20180 | null |
2024-10-26 | Image Generation from Image Captioning – Invertible Approach | Nandakishore S Menon et.al. | 2410.20171 | null |
2024-10-26 | Diff-CXR: Report-to-CXR generation through a disease-knowledge enhanced diffusion model | Peng Huang et.al. | 2410.20165 | null |
2024-10-26 | Prompt Diffusion Robustifies Any-Modality Prompt Learning | Yingjun Du et.al. | 2410.20164 | null |
2024-10-26 | Your Image is Secretly the Last Frame of a Pseudo Video | Wenlong Chen et.al. | 2410.20158 | null |
2024-10-26 | Human-Object Interaction Detection Collaborated with Large Relation-driven Diffusion Models | Liulei Li et.al. | 2410.20155 | null |
2024-10-26 | GiVE: Guiding Visual Encoder to Perceive Overlooked Information | Junjie Li et.al. | 2410.20109 | null |
2024-10-26 | Super-resolved virtual staining of label-free tissue using diffusion models | Yijie Zhang et.al. | 2410.20073 | null |
2024-10-26 | SCube: Instant Large-Scale Scene Reconstruction using VoxSplats | Xuanchi Ren et.al. | 2410.20030 | null |
2024-10-26 | GHIL-Glue: Hierarchical Control with Filtered Subgoal Images | Kyle B. Hatch et.al. | 2410.20018 | null |
2024-10-25 | Maximizing User Engagement in Social Networks: A Game-Theoretic Approach to Network Participation and Resource Sharing | Ahmed Luqman et.al. | 2410.19966 | null |
2024-10-25 | Gravitational-Wave Parameter Estimation in non-Gaussian noise using Score-Based Likelihood Characterization | Ronan Legin et.al. | 2410.19956 | null |
2024-10-18 | Automating Video Thumbnails Selection and Generation with Multimodal and Multistage Analysis | Elia Fantini et.al. | 2410.19825 | null |
2024-10-18 | The impact of a wind switch on the stability of traveling fronts in a reaction-diffusion model of fire propagation | Olivia Chandrasekhar et.al. | 2410.19824 | null |
2024-10-16 | Stable Diffusion with Continuous-time Neural Network | Andras Horvath et.al. | 2410.19798 | null |
2024-10-15 | DiffGAN: A Test Generation Approach for Differential Testing of Deep Neural Networks | Zohreh Aghababaeyan et.al. | 2410.19794 | null |
2024-10-14 | How to Backdoor Consistency Models? | Chengen Wang et.al. | 2410.19785 | link |
2024-10-25 | Adversarial Environment Design via Regret-Guided Diffusion Models | Hojun Chung et.al. | 2410.19715 | null |
2024-10-30 | DiffGS: Functional Gaussian Splatting Diffusion | Junsheng Zhou et.al. | 2410.19657 | null |
2024-10-25 | Diffusion models for lattice gauge field simulations | Qianteng Zhu et.al. | 2410.19602 | null |
2024-10-25 | Utilizing Image Transforms and Diffusion Models for Generative Modeling of Short and Long Time Series | Ilan Naiman et.al. | 2410.19538 | null |
2024-10-25 | Ensemble Data Assimilation for Particle-based Methods | Marius Duvillard et.al. | 2410.19525 | null |
2024-10-28 | NeuroClips: Towards High-fidelity and Smooth fMRI-to-Video Reconstruction | Zixuan Gong et.al. | 2410.19452 | link |
2024-10-25 | Learned Reference-based Diffusion Sampling for multi-modal distributions | Maxence Noble et.al. | 2410.19449 | null |
2024-10-25 | Generative Diffusion Models for Sequential Recommendations | Sharare Zolghadr et.al. | 2410.19429 | null |
2024-10-28 | KAHANI: Culturally-Nuanced Visual Storytelling Pipeline for Non-Western Cultures | Hamna et.al. | 2410.19419 | null |
2024-10-25 | FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality | Zhengyao Lv et.al. | 2410.19355 | null |
2024-10-25 | High Resolution Seismic Waveform Generation using Denoising Diffusion | Andreas Bergmeister et.al. | 2410.19343 | null |
2024-10-25 | Simpler Diffusion (SiD2): 1.5 FID on ImageNet512 with pixel-space diffusion | Emiel Hoogeboom et.al. | 2410.19324 | null |
2024-10-25 | A prescriptive theory for brain-like inference | Hadi Vafaii et.al. | 2410.19315 | null |
2024-10-25 | Flow Generator Matching | Zemin Huang et.al. | 2410.19310 | null |
2024-10-25 | Fictitious Synthetic Data Can Improve LLM Factuality via Prerequisite Learning | Yujian Liu et.al. | 2410.19290 | link |
2024-10-25 | A Flow-based Truncated Denoising Diffusion Model for Super-resolution Magnetic Resonance Spectroscopic Imaging | Siyuan Dong et.al. | 2410.19288 | null |
2024-10-25 | Learning Diffusion Policies from Demonstrations For Compliant Contact-rich Manipulation | Malek Aburub et.al. | 2410.19235 | null |
2024-10-24 | Structured Diffusion Models with Mixture of Gaussians as Prior Distribution | Nanshan Jia et.al. | 2410.19149 | null |
2024-10-28 | BIFRÖST: 3D-Aware Image compositing with Language Instructions | Lingxiao Li et.al. | 2410.19079 | link |
2024-11-04 | Framer: Interactive Frame Interpolation | Wen Wang et.al. | 2410.18978 | null |
2024-10-24 | MotionCLR: Motion Generation and Training-free Editing via Understanding Attention Mechanisms | Ling-Hao Chen et.al. | 2410.18977 | null |
2024-10-24 | 3D-Adapter: Geometry-Consistent Multi-View Diffusion for High-Quality 3D Generation | Hansheng Chen et.al. | 2410.18974 | link |
2024-10-24 | On the Crucial Role of Initialization for Matrix Factorization | Bingcong Li et.al. | 2410.18965 | null |
2024-10-24 | Stable Consistency Tuning: Understanding and Improving Consistency Models | Fu-Yun Wang et.al. | 2410.18958 | link |
2024-10-24 | Generation of synthetic financial time series by diffusion models | Tomonori Takahashi et.al. | 2410.18897 | null |
2024-10-24 | Diff-Instruct++: Training One-step Text-to-image Generator Model to Align with Human Preferences | Weijian Luo et.al. | 2410.18881 | null |
2024-10-24 | The Cat and Mouse Game: The Ongoing Arms Race Between Diffusion Models and Detection Methods | Linda Laurier et.al. | 2410.18866 | null |
2024-10-24 | Multi-Scale Diffusion: Enhancing Spatial Layout in High-Resolution Panoramic Image Generation | Xiaoyu Zhang et.al. | 2410.18830 | null |
2024-10-29 | Towards Visual Text Design Transfer Across Languages | Yejin Choi et.al. | 2410.18823 | null |
2024-10-24 | Fast constrained sampling in pre-trained diffusion models | Alexandros Graikos et.al. | 2410.18804 | null |
2024-10-24 | Robust Watermarking Using Generative Priors Against Image Editing: From Benchmarking to Advances | Shilin Lu et.al. | 2410.18775 | link |
2024-10-28 | Schedule Your Edit: A Simple yet Effective Diffusion Noise Schedule for Image Editing | Haonan Lin et.al. | 2410.18756 | null |
2024-10-24 | Rectified Diffusion Guidance for Conditional Generation | Mengfei Xia et.al. | 2410.18737 | null |
2024-10-24 | Retrieval-Augmented Diffusion Models for Time Series Forecasting | Jingwei Liu et.al. | 2410.18712 | link |
2024-10-24 | Ali-AUG: Innovative Approaches to Labeled Data Augmentation using One-Step Diffusion Model | Ali Hamza et.al. | 2410.18678 | null |
2024-10-29 | DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation | Yuang Ai et.al. | 2410.18666 | link |
2024-10-25 | Diffusion Attribution Score: Evaluating Training Data Influence in Diffusion Model | Jinxu Lin et.al. | 2410.18639 | null |
2024-10-24 | FairQueue: Rethinking Prompt Learning for Fair Text-to-Image Generation | Christopher T. H Teo et.al. | 2410.18615 | null |
2024-10-24 | SMITE: Segment Me In TimE | Amirhossein Alimohammadi et.al. | 2410.18538 | link |
2024-10-24 | Beyond Color and Lines: Zero-Shot Style-Specific Image Variations with Coordinated Semantics | Jinghao Hu et.al. | 2410.18537 | null |
2024-10-24 | Scaling up Masked Diffusion Models on Text | Shen Nie et.al. | 2410.18514 | link |
2024-10-24 | Generalized conditional gradient methods for multiobjective composite optimization problems with H{ö}lder condition | Wang Chen et.al. | 2410.18465 | null |
2024-10-24 | FreCaS: Efficient Higher-Resolution Image Generation via Frequency-aware Cascaded Sampling | Zhengqiang Zhang et.al. | 2410.18410 | link |
2024-10-25 | Backdoor in Seconds: Unlocking Vulnerabilities in Large Pre-trained Models via Model Editing | Dongliang Guo et.al. | 2410.18267 | null |
2024-10-23 | DMTG: A Human-Like Mouse Trajectory Generation Bot Based on Entropy-Controlled Diffusion Networks | Jiahua Liu et.al. | 2410.18233 | null |
2024-10-23 | DynamicCity: Large-Scale LiDAR Generation from Dynamic Scenes | Hengwei Bian et.al. | 2410.18084 | null |
2024-10-23 | Prioritized Generative Replay | Renhao Wang et.al. | 2410.18082 | null |
2024-10-23 | WorldSimBench: Towards Video Generation Models as World Simulators | Yiran Qin et.al. | 2410.18072 | null |
2024-10-23 | Training Free Guided Flow Matching with Optimal Control | Luran Wang et.al. | 2410.18070 | null |
2024-10-30 | Scalable Ranked Preference Optimization for Text-to-Image Generation | Shyamgopal Karthik et.al. | 2410.18013 | null |
2024-10-23 | Optical Generative Models | Shiqi Chen et.al. | 2410.17970 | null |
2024-10-23 | A Wavelet Diffusion GAN for Image Super-Resolution | Lorenzo Aloisi et.al. | 2410.17966 | null |
2024-10-23 | Addressing Asynchronicity in Clinical Multimodal Fusion via Individualized Chest X-ray Generation | Wenfang Yao et.al. | 2410.17918 | link |
2024-10-23 | Scaling Diffusion Language Models via Adaptation from Autoregressive Models | Shansan Gong et.al. | 2410.17891 | link |
2024-10-23 | TAGE: Trustworthy Attribute Group Editing for Stable Few-shot Image Generation | Ruicheng Zhang et.al. | 2410.17855 | null |
2024-10-23 | Non-intrusive Speech Quality Assessment with Diffusion Models Trained on Clean Speech | Danilo de Oliveira et.al. | 2410.17834 | null |
2024-10-23 | PGDiffSeg: Prior-Guided Denoising Diffusion Model with Parameter-Shared Attention for Breast Cancer Segmentation | Feiyan Feng et.al. | 2410.17812 | null |
2024-10-23 | AdaDiffSR: Adaptive Region-aware Dynamic Acceleration Diffusion Model for Real-World Image Super-Resolution | Yuanting Fan et.al. | 2410.17752 | null |
2024-10-30 | VISAGE: Video Synthesis using Action Graphs for Surgery | Yousef Yeganeh et.al. | 2410.17751 | null |
2024-10-23 | Strategic Irreversible Investment | Jan-Henrik Steg et.al. | 2410.17673 | null |
2024-10-23 | Deep Generative Models for 3D Medical Image Synthesis | Paul Friedrich et.al. | 2410.17664 | null |
2024-10-23 | Towards Effective Data-Free Knowledge Distillation via Diverse Diffusion Augmentation | Muquan Li et.al. | 2410.17606 | link |
2024-10-23 | How to Continually Adapt Text-to-Image Diffusion Models for Flexible Customization? | Jiahua Dong et.al. | 2410.17594 | link |
2024-10-23 | GDDA: Semantic OOD Detection on Graphs under Covariate Shift via Score-Based Diffusion Models | Zhixia He et.al. | 2410.17526 | null |
2024-10-23 | Physics-driven AI for Channel Estimation in Cellular Network | Xiaoqian Qi et.al. | 2410.17525 | null |
2024-10-23 | Diffusion Priors for Variational Likelihood Estimation and Image Denoising | Jun Cheng et.al. | 2410.17521 | link |
2024-10-31 | Univariate Conditional Variational Autoencoder for Morphogenic Patterns Design in Frontal Polymerization-Based Manufacturing | Qibang Liu et.al. | 2410.17518 | link |
2024-10-22 | EEG-DIF: Early Warning of Epileptic Seizures through Generative Diffusion Model-based Multi-channel EEG Signals Forecasting | Zekun Jiang et.al. | 2410.17343 | link |
2024-10-22 | Offline Evaluation of Set-Based Text-to-Image Generation | Negar Arabzadeh et.al. | 2410.17331 | link |
2024-10-19 | Stool Recognition for Colorectal Cancer Detection through Deep Learning | Glenda Hui En Tan et.al. | 2410.17288 | null |
2024-10-22 | Altogether: Image Captioning via Re-aligning Alt-text | Hu Xu et.al. | 2410.17251 | link |
2024-10-22 | Reinforcement learning on structure-conditioned categorical diffusion for protein inverse folding | Yasha Ektefaie et.al. | 2410.17173 | link |
2024-10-22 | Insights on Disagreement Patterns in Multimodal Safety Perception across Diverse Rater Groups | Charvi Rastogi et.al. | 2410.17032 | null |
2024-10-22 | IdenBAT: Disentangled Representation Learning for Identity-Preserved Brain Age Transformation | Junyeong Maeng et.al. | 2410.16945 | link |
2024-10-22 | DiP-GO: A Diffusion Pruner via Few-step Gradient Optimization | Haowei Zhu et.al. | 2410.16942 | null |
2024-10-22 | Hierarchical Clustering for Conditional Diffusion in Image Generation | Jorge da Silva Goncalves et.al. | 2410.16910 | link |
2024-10-22 | VistaDream: Sampling multiview consistent images for single-view scene reconstruction | Haiping Wang et.al. | 2410.16892 | null |
2024-10-22 | MPDS: A Movie Posters Dataset for Image Generation with Diffusion Model | Meng Xu et.al. | 2410.16840 | null |
2024-10-22 | AttriPrompter: Auto-Prompting with Attribute Semantics for Zero-shot Nuclei Detection via Visual-Language Pre-trained Models | Yongjian Wu et.al. | 2410.16820 | link |
2024-10-22 | Evaluating the Effectiveness of Attack-Agnostic Features for Morphing Attack Detection | Laurent Colbois et.al. | 2410.16802 | link |
2024-10-22 | One-Step Diffusion Distillation through Score Implicit Matching | Weijian Luo et.al. | 2410.16794 | link |
2024-10-22 | LLM-Assisted Red Teaming of Diffusion Models through “Failures Are Fated, But Can Be Faded” | Som Sagar et.al. | 2410.16738 | null |
2024-10-22 | Polyp-E: Benchmarking the Robustness of Deep Segmentation Models via Polyp Editing | Runpu Wei et.al. | 2410.16732 | null |
2024-10-22 | DiffusionSeeder: Seeding Motion Optimization with Diffusion for Rapid Motion Planning | Huang Huang et.al. | 2410.16727 | null |
2024-10-22 | Progressive Compositionality In Text-to-Image Generative Models | Xu Han et.al. | 2410.16719 | link |
2024-10-22 | DARE: Diffusion Policy for Autonomous Robot Exploration | Yuhong Cao et.al. | 2410.16687 | null |
2024-10-22 | NucleiMix: Realistic Data Augmentation for Nuclei Instance Segmentation | Jiamu Wang et.al. | 2410.16671 | null |
2024-10-22 | Dual-Model Defense: Safeguarding Diffusion Models from Membership Inference Attacks through Disjoint Data Splitting | Bao Q. Tran et.al. | 2410.16657 | null |
2024-10-22 | TopoDiffusionNet: A Topology-aware Diffusion Model | Saumya Gupta et.al. | 2410.16646 | link |
2024-10-22 | GALA: Graph Diffusion-based Alignment with Jigsaw for Source-free Domain Adaptation | Junyu Luo et.al. | 2410.16606 | link |
2024-10-21 | Large Body Language Models | Saif Punjwani et.al. | 2410.16533 | null |
2024-10-30 | SINGAPO: Single Image Controlled Generation of Articulated Parts in Objects | Jiayi Liu et.al. | 2410.16499 | null |
2024-10-25 | AttentionPainter: An Efficient and Adaptive Stroke Predictor for Scene Painting | Yizhe Tang et.al. | 2410.16418 | null |
2024-10-21 | On conditional diffusion models for PDE simulations | Aliaksandra Shysheya et.al. | 2410.16415 | link |
2024-10-21 | Integrating Reinforcement Learning with Foundation Models for Autonomous Robotics: Methods and Perspectives | Angelo Moroncelli et.al. | 2410.16411 | link |
2024-10-21 | Exploring how deep learning decodes anomalous diffusion via Grad-CAM | Jaeyong Bae et.al. | 2410.16345 | link |
2024-10-21 | MvDrag3D: Drag-based Creative 3D Editing via Multi-view Generation-Reconstruction Priors | Honghua Chen et.al. | 2410.16272 | null |
2024-10-21 | 3DGS-Enhancer: Enhancing Unbounded 3D Gaussian Splatting with View-consistent 2D Diffusion Priors | Xi Liu et.al. | 2410.16266 | null |
2024-10-21 | Elucidating the design space of language models for image generation | Xuantong Liu et.al. | 2410.16257 | link |
2024-10-21 | A Framework for Evaluating Predictive Models Using Synthetic Image Covariates and Longitudinal Data | Simon Deltadahl et.al. | 2410.16177 | null |
2024-10-22 | Warped Diffusion: Solving Video Inverse Problems with Image Diffusion Models | Giannis Daras et.al. | 2410.16152 | null |
2024-10-21 | SeaDAG: Semi-autoregressive Diffusion for Conditional Directed Acyclic Graph Generation | Xinyi Zhou et.al. | 2410.16119 | null |
2024-10-21 | Continuous Speech Synthesis using per-token Latent Diffusion | Arnon Turetzky et.al. | 2410.16048 | null |
2024-10-22 | CamI2V: Camera-Controlled Image-to-Video Diffusion Model | Guangcong Zheng et.al. | 2410.15957 | link |
2024-10-21 | TexPro: Text-guided PBR Texturing with Procedural Material Modeling | Ziqiang Dang et.al. | 2410.15891 | null |
2024-10-22 | Three connected problems: principal with multiple agents in cooperation, Principal–Agent with Mckean–Vlasov dynamics and multitask Principal–Agent | Mao Fabrice Djete et.al. | 2410.15818 | null |
2024-10-21 | Solving Continual Offline RL through Selective Weights Activation on Aligned Spaces | Jifeng Hu et.al. | 2410.15698 | null |
2024-10-29 | Erasing Undesirable Concepts in Diffusion Models with Adversarial Preservation | Anh Bui et.al. | 2410.15618 | link |
2024-10-20 | Data Augmentation via Diffusion Model to Enhance AI Fairness | Christina Hastings Blow et.al. | 2410.15470 | null |
2024-10-20 | EVA: An Embodied World Model for Future Video Anticipation | Xiaowei Chi et.al. | 2410.15461 | null |
2024-10-20 | Allegro: Open the Black Box of Commercial-Level Video Generation Model | Yuan Zhou et.al. | 2410.15458 | link |
2024-10-20 | MedDiff-FM: A Diffusion-based Foundation Model for Versatile Medical Image Applications | Yongrui Yu et.al. | 2410.15432 | null |
2024-10-20 | FrameBridge: Improving Image-to-Video Generation with Bridge Models | Yuji Wang et.al. | 2410.15371 | null |
2024-10-20 | ConSinger: Efficient High-Fidelity Singing Voice Generation with Minimal Steps | Yulin Song et.al. | 2410.15342 | null |
2024-10-20 | Diffusion-PINN Sampler | Zhekun Shi et.al. | 2410.15336 | null |
2024-10-20 | FoMo: A Foundation Model for Mobile Traffic Forecasting with Diffusion Model | Haoye Chai et.al. | 2410.15322 | null |
2024-10-20 | Synergistic Dual Spatial-aware Generation of Image-to-Text and Text-to-Image | Yu Zhao et.al. | 2410.15312 | null |
2024-10-20 | FastSTI: A Fast Conditional Pseudo Numerical Diffusion Model for Spatio-temporal Traffic Data Imputation | Shaokang Cheng et.al. | 2410.15248 | null |
2024-10-19 | Retrieval Augmented Diffusion Model for Structure-informed Antibody Design and Optimization | Zichen Wang et.al. | 2410.15040 | null |
2024-10-19 | DiffuseST: Unleashing the Capability of the Diffusion Model for Style Transfer | Ying Hu et.al. | 2410.15007 | link |
2024-10-19 | How Many Van Goghs Does It Take to Van Gogh? Finding the Imitation Threshold | Sahil Verma et.al. | 2410.15002 | link |
2024-10-19 | SeaS: Few-shot Industrial Anomaly Image Generation with Separation and Sharing Fine-tuning | Zhewei Dai et.al. | 2410.14987 | link |
2024-10-19 | Attack as Defense: Run-time Backdoor Implantation for Image Content Protection | Haichuan Zhang et.al. | 2410.14966 | link |
2024-10-19 | Straightness of Rectified Flow: A Theoretical Insight into Wasserstein Convergence | Vansh Bansal et.al. | 2410.14949 | link |
2024-10-19 | ImmerseDiffusion: A Generative Spatial Audio Latent Diffusion Model | Mojtaba Heydari et.al. | 2410.14945 | null |
2024-10-31 | Adversarial Score identity Distillation: Rapidly Surpassing the Teacher in One Step | Mingyuan Zhou et.al. | 2410.14919 | link |
2024-10-18 | Truncated Consistency Models | Sangyun Lee et.al. | 2410.14895 | null |
2024-10-18 | Mitigating Embedding Collapse in Diffusion Models for Categorical Data | Bac Nguyen et.al. | 2410.14758 | null |
2024-10-16 | On the Relation Between Linear Diffusion and Power Iteration | Dana Weitzner et.al. | 2410.14730 | null |
2024-10-10 | Animating the Past: Reconstruct Trilobite via Video Generation | Xiaoran Wu et.al. | 2410.14715 | null |
2024-10-09 | G2D2: Gradient-guided Discrete Diffusion for image inverse problem solving | Naoki Murata et.al. | 2410.14710 | null |
2024-10-18 | BiGR: Harnessing Binary Latent Codes for Image Generation and Improved Visual Representation Capabilities | Shaozhe Hao et.al. | 2410.14672 | link |
2024-10-18 | Multi-modal Pose Diffuser: A Multimodal Generative Conditional Pose Prior | Calvin-Khang Ta et.al. | 2410.14540 | null |
2024-10-18 | LEAD: Latent Realignment for Human Motion Diffusion | Nefeli Andreou et.al. | 2410.14508 | null |
2024-10-18 | Reinforcement Learning in Non-Markov Market-Making | Luca Lalor et.al. | 2410.14504 | null |
2024-10-18 | ANT: Adaptive Noise Schedule for Time Series Diffusion Models | Seunghan Lee et.al. | 2410.14488 | link |
2024-10-18 | DRL Optimization Trajectory Generation via Wireless Network Intent-Guided Diffusion Models for Optimizing Resource Allocation | Junjie Wu et.al. | 2410.14481 | null |
2024-10-18 | FashionR2R: Texture-preserving Rendered-to-Real Image Translation with Diffusion Models | Rui Hu et.al. | 2410.14429 | null |
2024-10-18 | Dynamic Negative Guidance of Diffusion Models | Felix Koulischer et.al. | 2410.14398 | link |
2024-10-18 | HiCo: Hierarchical Controllable Diffusion Model for Layout-to-image Generation | Bo Cheng et.al. | 2410.14324 | link |
2024-10-18 | ClearSR: Latent Low-Resolution Image Embeddings Help Diffusion-Based Real-World Super Resolution Models See Clearer | Yuhao Wan et.al. | 2410.14279 | link |
2024-10-18 | HYPNOS : Highly Precise Foreground-focused Diffusion Finetuning for Inanimate Objects | Oliverio Theophilus Nathanael et.al. | 2410.14265 | null |
2024-10-18 | ERDDCI: Exact Reversible Diffusion via Dual-Chain Inversion for High-Quality Image Editing | Jimin Dai et.al. | 2410.14247 | null |
2024-10-18 | Unified Convergence Analysis for Score-Based Diffusion Models with Deterministic Samplers | Runjia Li et.al. | 2410.14237 | null |
2024-10-18 | Text-to-Image Representativity Fairness Evaluation Framework | Asma Yamani et.al. | 2410.14201 | null |
2024-10-29 | Heavy-Tailed Diffusion Models | Kushagra Pandey et.al. | 2410.14171 | null |
2024-10-18 | Personalized Image Generation with Large Multimodal Models | Yiyan Xu et.al. | 2410.14170 | link |
2024-10-18 | Assessing Open-world Forgetting in Generative Image Model Customization | Héctor Laria et.al. | 2410.14159 | null |
2024-10-18 | Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning | Jiacheng Ye et.al. | 2410.14157 | link |
2024-10-26 | Extreme Precipitation Nowcasting using Multi-Task Latent Diffusion Models | Li Chaorong et.al. | 2410.14103 | null |
2024-10-17 | MMAD-Purify: A Precision-Optimized Framework for Efficient and Scalable Multi-Modal Attacks | Xinxin Liu et.al. | 2410.14089 | null |
2024-10-17 | DiFuseR: A Distributed Sketch-based Influence Maximization Algorithm for GPUs | Gökhan Göktürk et.al. | 2410.14047 | null |
2024-10-17 | Latent Weight Diffusion: Generating Policies from Trajectories | Shashank Hegde et.al. | 2410.14040 | null |
2024-10-17 | Ensemble-based, large-eddy reconstruction of wind turbine inflow in a near-stationary atmospheric boundary layer through generative artificial intelligence | Alex Rybchuk et.al. | 2410.14024 | null |
2024-10-17 | Vision-Language-Action Model and Diffusion Policy Switching Enables Dexterous Control of an Anthropomorphic Hand | Cheng Pan et.al. | 2410.14022 | null |
2024-10-17 | On Diffusion Models for Multi-Agent Partial Observability: Shared Attractors, Error Bounds, and Composite Flow | Tonghan Wang et.al. | 2410.13953 | null |
2024-10-17 | Inference of morphology and dynamical state of nearby $Planck$ -SZ galaxy clusters with Zernike polynomials | Valentina Capalbo et.al. | 2410.13929 | null |
2024-10-17 | FiTv2: Scalable and Improved Flexible Vision Transformer for Diffusion Model | ZiDong Wang et.al. | 2410.13925 | link |
2024-10-17 | ARKit LabelMaker: A New Scale for Indoor 3D Scene Understanding | Guangda Ji et.al. | 2410.13924 | link |
2024-10-15 | A Formal Framework for Assessing and Mitigating Emergent Security Risks in Generative AI Models: Bridging Theory and Dynamic Risk Mitigation | Aviral Srivastava et.al. | 2410.13897 | null |
2024-10-12 | Turing chemotactic instability in an HIV model | Florinda Capone et.al. | 2410.13889 | null |
2024-10-17 | Fluid: Scaling Autoregressive Text-to-image Generative Models with Continuous Tokens | Lijie Fan et.al. | 2410.13863 | null |
2024-10-21 | PUMA: Empowering Unified MLLM with Multi-granular Visual Generation | Rongyao Fang et.al. | 2410.13861 | link |
2024-10-17 | Diffusing States and Matching Scores: A New Framework for Imitation Learning | Runzhe Wu et.al. | 2410.13855 | link |
2024-10-24 | Influence Functions for Scalable Data Attribution in Diffusion Models | Bruno Mlodozeniec et.al. | 2410.13850 | null |
2024-10-27 | VidPanos: Generative Panoramic Videos from Casual Panning Videos | Jingwei Ma et.al. | 2410.13832 | null |
2024-10-17 | DreamVideo-2: Zero-Shot Subject-Driven Video Customization with Precise Motion Control | Yujie Wei et.al. | 2410.13830 | null |
2024-10-17 | Deep Generative Models Unveil Patterns in Medical Images Through Vision-Language Conditioning | Xiaodan Xing et.al. | 2410.13823 | link |
2024-10-17 | ConsisSR: Delving Deep into Consistency in Diffusion-based Image Super-Resolution | Junhao Gu et.al. | 2410.13807 | null |
2024-10-17 | Probing the Latent Hierarchical Structure of Data via Diffusion Models | Antonio Sclocchi et.al. | 2410.13770 | null |
2024-10-17 | Theory on Score-Mismatched Diffusion Models and Zero-Shot Conditional Samplers | Yuchen Liang et.al. | 2410.13746 | null |
2024-10-17 | Improved Convergence Rate for Diffusion Probabilistic Models | Gen Li et.al. | 2410.13738 | null |
2024-10-18 | DAWN: Dynamic Frame Avatar with Non-autoregressive Diffusion Framework for Talking Head Video Generation | Hanbo Cheng et.al. | 2410.13726 | link |
2024-10-17 | Movie Gen: A Cast of Media Foundation Models | Adam Polyak et.al. | 2410.13720 | link |
2024-10-18 | Diffusion Curriculum: Synthetic-to-Real Generative Curriculum Learning via Image-Guided Diffusion | Yijun Liang et.al. | 2410.13674 | link |
2024-10-17 | Fine-Tuning Discrete Diffusion Models via Reward Optimization with Applications to DNA and Protein Design | Chenyu Wang et.al. | 2410.13643 | link |
2024-10-17 | LoLDU: Low-Rank Adaptation via Lower-Diag-Upper Decomposition for Parameter-Efficient Fine-Tuning | Yiming Shi et.al. | 2410.13618 | link |
2024-10-17 | Preference Aligned Diffusion Planner for Quadrupedal Locomotion Control | Xinyi Yuan et.al. | 2410.13586 | null |
2024-10-21 | DriveDreamer4D: World Models Are Effective Data Machines for 4D Driving Scene Representation | Guosheng Zhao et.al. | 2410.13571 | null |
2024-10-17 | Can Medical Vision-Language Pre-training Succeed with Purely Synthetic Data? | Che Liu et.al. | 2410.13523 | null |
2024-10-17 | Solving Prior Distribution Mismatch in Diffusion Models via Optimal Transport | Zhanpeng Wang et.al. | 2410.13431 | null |
2024-10-17 | MagicTailor: Component-Controllable Personalization in Text-to-Image Diffusion Models | Donghao Zhou et.al. | 2410.13370 | null |
2024-10-17 | DiffImp: Efficient Diffusion Model for Probabilistic Time Series Imputation with Bidirectional Mamba Backbone | Hongfan Gao et.al. | 2410.13338 | null |
2024-10-17 | Assessing the techno-economic benefits of LEMs for different grid topologies and prosumer shares | Markus Doepfert et.al. | 2410.13330 | link |
2024-10-17 | An Online Learning Approach to Prompt-based Selection of Generative Models | Xiaoyan Hu et.al. | 2410.13287 | null |
2024-10-31 | FDF: Flexible Decoupled Framework for Time Series Forecasting with Conditional Denoising and Polynomial Modeling | Jintao Zhang et.al. | 2410.13253 | link |
2024-10-18 | Fundus to Fluorescein Angiography Video Generation as a Retinal Generative Foundation Model | Weiyi Zhang et.al. | 2410.13242 | null |
2024-10-17 | AsymKV: Enabling 1-Bit Quantization of KV Cache with Layer-Wise Asymmetric Quantization Configurations | Qian Tao et.al. | 2410.13212 | null |
2024-10-17 | Meta-DiffuB: A Contextualized Sequence-to-Sequence Text Diffusion Model with Meta-Exploration | Yun-Yen Chuang et.al. | 2410.13201 | link |
2024-10-17 | TCP-Diffusion: A Multi-modal Diffusion Model for Global Tropical Cyclone Precipitation Forecasting with Change Awareness | Cheng Huang et.al. | 2410.13175 | null |
2024-10-17 | Unlocking the Capabilities of Masked Generative Models for Image Synthesis via Self-Guidance | Jiwan Hur et.al. | 2410.13136 | link |
2024-10-17 | Boosting Imperceptibility of Stable Diffusion-based Adversarial Examples Generation with Momentum | Nashrah Haque et.al. | 2410.13122 | link |
2024-10-17 | Preference Diffusion for Recommendation | Shuo Liu et.al. | 2410.13117 | link |
2024-10-17 | Controllable Generation via Locally Constrained Resampling | Kareem Ahmed et.al. | 2410.13111 | null |
2024-10-16 | An end-to-end generative diffusion model for heavy-ion collisions | Jing-An Sun et.al. | 2410.13069 | null |
2024-10-16 | Geometric Trajectory Diffusion Models | Jiaqi Han et.al. | 2410.13027 | link |
2024-10-16 | Super-resolving Real-world Image Illumination Enhancement: A New Dataset and A Conditional Diffusion Model | Yang Liu et.al. | 2410.12961 | null |
2024-10-16 | Syn2Real Domain Generalization for Underwater Mine-like Object Detection Using Side-Scan Sonar | Aayush Agrawal et.al. | 2410.12953 | null |
2024-10-09 | TextLap: Customizing Language Models for Text-to-Layout Planning | Jian Chen et.al. | 2410.12844 | link |
2024-10-18 | UniAutoML: A Human-Centered Framework for Unified Discriminative and Generative AutoML with Large Language Models | Jiayi Guo et.al. | 2410.12841 | link |
2024-10-16 | Meta-Unlearning on Diffusion Models: Preventing Relearning Unlearned Concepts | Hongcheng Gao et.al. | 2410.12777 | link |
2024-10-16 | SAFREE: Training-Free and Adaptive Guard for Safe Text-to-Image And Video Generation | Jaehong Yoon et.al. | 2410.12761 | null |
2024-10-16 | Embedding an Ethical Mind: Aligning Text-to-Image Synthesis via Lightweight Value Optimization | Xingqi Wang et.al. | 2410.12700 | link |
2024-10-16 | AdaptiveDrag: Semantic-Driven Dragging on Diffusion-Based Image Editing | DuoSheng Chen et.al. | 2410.12696 | link |
2024-10-16 | 3DIS: Depth-Driven Decoupled Instance Synthesis for Text-to-Image Generation | Dewei Zhou et.al. | 2410.12669 | link |
2024-10-16 | One Step Diffusion via Shortcut Models | Kevin Frans et.al. | 2410.12557 | link |
2024-10-16 | Evaluating Utility of Memory Efficient Medical Image Generation: A Study on Lung Nodule Segmentation | Kathrin Khadra et.al. | 2410.12542 | null |
2024-10-16 | Disentangling data distribution for Federated Learning | Xinyuan Zhao et.al. | 2410.12530 | null |
2024-10-16 | Shaping a Stabilized Video by Mitigating Unintended Changes for Concept-Augmented Video Editing | Mingce Guo et.al. | 2410.12526 | null |
2024-10-16 | Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective | Yongxin Zhu et.al. | 2410.12490 | link |
2024-10-16 | Imagine2Servo: Intelligent Visual Servoing with Diffusion-Driven Goal Generation for Robotic Tasks | Pranjali Pathre et.al. | 2410.12432 | link |
2024-10-16 | Generalized Smooth Stochastic Variational Inequalities: Almost Sure Convergence and Convergence Rates | Daniil Vankov et.al. | 2410.12334 | null |
2024-10-25 | FaceChain-FACT: Face Adapter with Decoupled Training for Identity-preserved Personalization | Cheng Yu et.al. | 2410.12312 | link |
2024-10-16 | DaDiff: Domain-aware Diffusion Model for Nighttime UAV Tracking | Haobo Zuo et.al. | 2410.12270 | link |
2024-10-16 | FlashAudio: Rectified Flows for Fast and High-Fidelity Text-to-Audio Generation | Huadai Liu et.al. | 2410.12266 | null |
2024-10-16 | Facing Identity: The Formation and Performance of Identity via Face-Based Artificial Intelligence Technologies | Wells Lucas Santo et.al. | 2410.12148 | null |
2024-10-16 | Preference Optimization with Multi-Sample Comparisons | Chaoqi Wang et.al. | 2410.12138 | null |
2024-10-15 | DDIL: Improved Diffusion Distillation With Imitation Learning | Risheek Garrepalli et.al. | 2410.11971 | null |
2024-10-15 | CtrlSynth: Controllable Image Text Synthesis for Data-Efficient Multimodal Learning | Qingqing Cao et.al. | 2410.11963 | null |
2024-10-15 | ChatHouseDiffusion: Prompt-Guided Generation and Editing of Floor Plans | Sizhong Qin et.al. | 2410.11908 | link |
2024-10-10 | Neural Metamorphosis | Xingyi Yang et.al. | 2410.11878 | null |
2024-10-15 | High-Resolution Frame Interpolation with Patch-based Cascaded Diffusion | Junhwa Hur et.al. | 2410.11838 | null |
2024-10-15 | On the Effectiveness of Dataset Alignment for Fake Image Detection | Anirudh Sundara Rajan et.al. | 2410.11835 | null |
2024-10-15 | Bayesian Experimental Design via Contrastive Diffusions | Jacopo Iollo et.al. | 2410.11826 | link |
2024-10-15 | KITTEN: A Knowledge-Intensive Evaluation of Image Generation on Visual Entities | Hsin-Ping Huang et.al. | 2410.11824 | null |
2024-10-15 | Improving Long-Text Alignment for Text-to-Image Diffusion Models | Luping Liu et.al. | 2410.11817 | link |
2024-10-15 | SGEdit: Bridging LLM with Text2Image Generative Model for Scene Graph-based Image Editing | Zhiyuan Zhang et.al. | 2410.11815 | null |
2024-10-16 | Efficient Diffusion Models: A Comprehensive Survey from Principles to Practices | Zhiyuan Ma et.al. | 2410.11795 | null |
2024-10-15 | Patch-Based Diffusion Models Beat Whole-Image Models for Mismatched Distribution Inverse Problems | Jason Hu et.al. | 2410.11730 | null |
2024-10-22 | Generative Image Steganography Based on Point Cloud | Zhong Yangjie et.al. | 2410.11673 | null |
2024-10-15 | Narrowband gamma-ray radiation generation by acoustically driven crystalline undulators | Konstantinos Kaleris et.al. | 2410.11621 | null |
2024-10-15 | DeformPAM: Data-Efficient Learning for Long-horizon Deformable Object Manipulation via Preference-based Action Alignment | Wendi Chen et.al. | 2410.11584 | link |
2024-10-15 | Riemann-Liouville fractional Brownian motion with random Hurst exponent | Hubert Woszczek et.al. | 2410.11546 | null |
2024-10-15 | InvSeg: Test-Time Prompt Inversion for Semantic Segmentation | Jiayi Lin et.al. | 2410.11473 | null |
2024-10-15 | A Simple Approach to Unifying Diffusion-based Conditional Generation | Xirui Li et.al. | 2410.11439 | null |
2024-10-15 | Advection-nonlinear-diffusion model of flare accelerated electron transport in Type III solar radio bursts | Eduard P. Kontar et.al. | 2410.11409 | null |
2024-10-15 | M2Diffuser: Diffusion-based Trajectory Optimization for Mobile Manipulation in 3D Scenes | Sixu Yan et.al. | 2410.11402 | null |
2024-10-15 | DIAR: Diffusion-model-guided Implicit Q-learning with Adaptive Revaluation | Jaehyun Park et.al. | 2410.11338 | null |
2024-10-15 | Evolutionary Retrofitting | Mathurin Videau et.al. | 2410.11330 | null |
2024-10-15 | Diff-SAGe: End-to-End Spatial Audio Generation Using Diffusion Models | Saksham Singh Kushwaha et.al. | 2410.11299 | null |
2024-10-15 | Shallow diffusion networks provably learn hidden low-dimensional structure | Nicholas M. Boffi et.al. | 2410.11275 | null |
2024-10-15 | Learning Diffusion Model from Noisy Measurement using Principled Expectation-Maximization Method | Weimin Bai et.al. | 2410.11241 | null |
2024-10-15 | Ctrl-U: Robust Conditional Image Generation via Uncertainty-aware Reward Modeling | Guiyu Zhang et.al. | 2410.11236 | null |
2024-10-15 | DreamSteerer: Enhancing Source Image Conditioned Editability using Personalized Diffusion Models | Zhengyang Yu et.al. | 2410.11208 | link |
2024-10-15 | Free Hunch: Denoiser Covariance Estimation for Diffusion Models Without Extra Costs | Severi Rissanen et.al. | 2410.11149 | null |
2024-10-14 | DMDSpeech: Distilled Diffusion Model Surpassing The Teacher in Zero-shot Speech Synthesis via Direct Metric Optimization | Yingahao Aaron Li et.al. | 2410.11097 | null |
2024-10-14 | Simplifying, Stabilizing and Scaling Continuous-Time Consistency Models | Cheng Lu et.al. | 2410.11081 | null |
2024-10-14 | Incorporating Task Progress Knowledge for Subgoal Generation in Robotic Manipulation through Image Edits | Xuhui Kang et.al. | 2410.11013 | null |
2024-10-14 | Cultural Heritage 3D Reconstruction with Diffusion Networks | Pablo Jaramillo et.al. | 2410.10927 | link |
2024-10-14 | A reaction-diffusion model for Mycobacterium tuberculosis infection | C. Accarino et.al. | 2410.10918 | null |
2024-10-25 | Tex4D: Zero-shot 4D Scene Texturing with Video Diffusion Models | Jingzhi Bao et.al. | 2410.10821 | link |
2024-10-14 | When Does Perceptual Alignment Benefit Vision Representations? | Shobhita Sundaram et.al. | 2410.10817 | null |
2024-10-14 | LVD-2M: A Long-take Video Dataset with Temporally Dense Captions | Tianwei Xiong et.al. | 2410.10816 | link |
2024-10-14 | Depth Any Video with Scalable Synthetic Data | Honghui Yang et.al. | 2410.10815 | link |
2024-10-14 | HART: Efficient Visual Generation with Hybrid Autoregressive Transformer | Haotian Tang et.al. | 2410.10812 | link |
2024-10-14 | TrajDiffuse: A Conditional Diffusion Model for Environment-Aware Trajectory Prediction | Qingze et.al. | 2410.10804 | link |
2024-10-14 | Boosting Camera Motion Control for Video Diffusion Transformers | Soon Yau Cheong et.al. | 2410.10802 | null |
2024-10-15 | MMAR: Towards Lossless Multi-Modal Auto-Regressive Probabilistic Modeling | Jian Yang et.al. | 2410.10798 | null |
2024-10-14 | Semantic Image Inversion and Editing using Rectified Stochastic Differential Equations | Litu Rout et.al. | 2410.10792 | null |
2024-10-14 | ControlMM: Controllable Masked Motion Generation | Ekkasit Pinyoanuntapong et.al. | 2410.10780 | null |
2024-10-14 | Cavia: Camera-controllable Multi-view Video Diffusion with View-Integrated Attention | Dejia Xu et.al. | 2410.10774 | null |
2024-10-14 | Adaptive Diffusion Terrain Generator for Autonomous Uneven Terrain Navigation | Youwei Yu et.al. | 2410.10766 | link |
2024-10-14 | DragEntity: Trajectory Guided Video Generation using Entity and Positional Relationships | Zhang Wan et.al. | 2410.10751 | null |
2024-10-14 | FlexGen: Flexible Multi-View Generation from Text and Image Inputs | Xinli Xu et.al. | 2410.10745 | null |
2024-10-14 | Deep Compression Autoencoder for Efficient High-Resolution Diffusion Models | Junyu Chen et.al. | 2410.10733 | link |
2024-10-14 | TALK-Act: Enhance Textural-Awareness for 2D Speaking Avatar Reenactment with Diffusion Model | Jiazhi Guan et.al. | 2410.10696 | null |
2024-10-14 | Evaluating SQL Understanding in Large Language Models | Ananya Rahaman et.al. | 2410.10680 | null |
2024-10-14 | Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation | Peiwen Sun et.al. | 2410.10676 | null |
2024-10-14 | Generating Model Parameters for Controlling: Parameter Diffusion for Controllable Multi-Task Recommendation | Chenglei Shen et.al. | 2410.10639 | null |
2024-10-20 | SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformers | Enze Xie et.al. | 2410.10629 | null |
2024-10-14 | Multilingual Controlled Generation And Gold-Standard-Agnostic Evaluation of Code-Mixed Sentences | Ayushman Gupta et.al. | 2410.10580 | null |
2024-10-14 | ROSAR: An Adversarial Re-Training Framework for Robust Side-Scan Sonar Object Detection | Martin Aubard et.al. | 2410.10554 | link |
2024-10-14 | UniGEM: A Unified Approach to Generation and Property Prediction for Molecules | Shikun Feng et.al. | 2410.10516 | null |
2024-10-14 | Customize Your Visual Autoregressive Recipe with Set Autoregressive Modeling | Wenze Liu et.al. | 2410.10511 | link |
2024-10-14 | Vision-guided and Mask-enhanced Adaptive Denoising for Prompt-based Image Editing | Kejie Wang et.al. | 2410.10496 | link |
2024-10-14 | An efficient numerical method for American options and their Greeks under the two-asset Kou jump-diffusion model | Karel J. in ‘t Hout et.al. | 2410.10444 | null |
2024-10-14 | Towards Reliable Verification of Unauthorized Data Usage in Personalized Text-to-Image Diffusion Models | Boheng Li et.al. | 2410.10437 | link |
2024-10-14 | DOME: Taming Diffusion Model into High-Fidelity Controllable Occupancy World Model | Songen Gu et.al. | 2410.10429 | null |
2024-10-14 | Sequential drone routing for data assimilation on a 2D airborne contaminant dispersion problem | Daniele Giovanni Gioia et.al. | 2410.10346 | null |
2024-10-18 | Evaluating Semantic Variation in Text-to-Image Synthesis: A Causal Perspective | Xiangru Zhu et.al. | 2410.10291 | link |
2024-10-14 | Saliency Guided Optimization of Diffusion Latents | Xiwen Wang et.al. | 2410.10257 | null |
2024-10-14 | Convergence properties of Markov models for image generation with applications to spin-flip dynamics and to diffusion processes | Cecile Monthus et.al. | 2410.10255 | null |
2024-10-14 | A Geometric Model with Stochastic Error for Abnormal Motion Detection of Portal Crane Bucket Grab | Baichen Yu et.al. | 2410.10246 | null |
2024-10-14 | MagicEraser: Erasing Any Objects via Semantics-Aware Control | Fan Li et.al. | 2410.10207 | link |
2024-10-14 | Identity-Focused Inference and Extraction Attacks on Diffusion Models | Jayneel Vora et.al. | 2410.10177 | null |
2024-10-14 | First Creating Backgrounds Then Rendering Texts: A New Paradigm for Visual Text Blending | Zhenhang Li et.al. | 2410.10168 | link |
2024-10-14 | Automated Filtering of Human Feedback Data for Aligning Text-to-Image Diffusion Models | Yongjin Yang et.al. | 2410.10166 | link |
2024-10-14 | TextCtrl: Diffusion-based Scene Text Editing with Prior Guidance Control | Weichao Zeng et.al. | 2410.10133 | link |
2024-10-14 | Generative Deep Learning and Signal Processing for Data Augmentation of Cardiac Auscultation Signals: Improving Model Robustness Using Synthetic Audio | Leigh Abbott et.al. | 2410.10125 | null |
2024-10-16 | MuseTalk: Real-Time High Quality Lip Synchronization with Latent Space Inpainting | Yue Zhang et.al. | 2410.10122 | link |
2024-10-14 | High-Precision Dichotomous Image Segmentation via Probing Diffusion Capacity | Qian Yu et.al. | 2410.10105 | null |
2024-10-14 | REHRSeg: Unleashing the Power of Self-Supervised Super-Resolution for Resource-Efficient 3D MRI Segmentation | Zhiyun Song et.al. | 2410.10097 | null |
2024-10-14 | The Ingredients for Robotic Diffusion Transformers | Sudeep Dasari et.al. | 2410.10088 | null |
2024-10-15 | VideoAgent: Self-Improving Video Generation | Achint Soni et.al. | 2410.10076 | link |
2024-10-14 | Learning to Customize Text-to-Image Diffusion In Diverse Context | Taewook Kim et.al. | 2410.10058 | null |
2024-10-14 | DINTR: Tracking via Diffusion-based Interpolation | Pha Nguyen et.al. | 2410.10053 | null |
2024-10-13 | TULIP: Token-length Upgraded CLIP | Ivona Najdenkoska et.al. | 2410.10034 | link |
2024-10-16 | InterMask: 3D Human Interaction Generation via Collaborative Masked Modelling | Muhammad Gohar Javed et.al. | 2410.10010 | link |
2024-10-13 | Variational Diffusion Posterior Sampling with Midpoint Guidance | Badr Moufad et.al. | 2410.09945 | link |
2024-10-13 | Multi class activity classification in videos using Motion History Image generation | Senthilkumar Gopal et.al. | 2410.09902 | link |
2024-10-13 | Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategy | Hancheng Ye et.al. | 2410.09873 | link |
2024-10-13 | AuthFace: Towards Authentic Blind Face Restoration with Face-oriented Generative Diffusion Prior | Guoqiang Liang et.al. | 2410.09864 | link |
2024-10-13 | Conditioning 3D Diffusion Models with 2D Images: Towards Standardized OCT Volumes through En Face-Informed Super-Resolution | Coen de Vente et.al. | 2410.09862 | null |
2024-10-13 | EBDM: Exemplar-guided Image Translation with Brownian-bridge Diffusion Models | Eungbean Lee et.al. | 2410.09802 | null |
2024-10-20 | Generating Intermediate Representations for Compositional Text-To-Image Generation | Ran Galun et.al. | 2410.09792 | link |
2024-10-13 | No arbitrage and the existence of ACLMMs in general diffusion models | David Criens et.al. | 2410.09789 | null |
2024-10-13 | Generalization of Compositional Tasks with Logical Specification via Implicit Planning | Duo Xu et.al. | 2410.09686 | null |
2024-10-13 | General Constrained Matrix Optimization | Casey Garner et.al. | 2410.09682 | null |
2024-10-12 | DuoDiff: Accelerating Diffusion Models with a Dual-Backbone Approach | Daniel Gallo Fernández et.al. | 2410.09633 | link |
2024-10-12 | Exploring Behavior-Relevant and Disentangled Neural Dynamics with Generative Diffusion Models | Yule Wang et.al. | 2410.09614 | link |
2024-10-12 | Enhancing Single Image to 3D Generation using Gaussian Splatting and Hybrid Diffusion Priors | Hritam Basak et.al. | 2410.09467 | null |
2024-10-12 | CtrLoRA: An Extensible and Efficient Framework for Controllable Image Generation | Yifeng Xu et.al. | 2410.09400 | link |
2024-10-12 | ExpGest: Expressive Speaker Generation Using Diffusion Model and Hybrid Audio-Text Guidance | Yongkang Cheng et.al. | 2410.09396 | null |
2024-10-12 | Zero-shot Commonsense Reasoning over Machine Imagination | Hyuntae Park et.al. | 2410.09329 | link |
2024-10-11 | TD-Paint: Faster Diffusion Inpainting Through Time Aware Pixel Conditioning | Tsiry Mayet et.al. | 2410.09306 | null |
2024-10-11 | nach0-pc: Multi-task Language Model with Molecular Point Cloud Encoder | Maksim Kuznetsov et.al. | 2410.09240 | null |
2024-10-11 | RealEra: Semantic-level Concept Erasure via Neighbor-Concept Mining | Yufan Liu et.al. | 2410.09140 | null |
2024-10-10 | IceDiff: High Resolution and High-Quality Sea Ice Forecasting with Generative Diffusion Prior | Jingyi Xu et.al. | 2410.09111 | null |
2024-10-08 | Reflections on Disentanglement and the Latent Space | Ludovica Schaerf et.al. | 2410.09094 | null |
2024-10-11 | SceneCraft: Layout-Guided 3D Scene Generation | Xiuyu Yang et.al. | 2410.09049 | link |
2024-10-11 | Linear Convergence of Diffusion Models Under the Manifold Hypothesis | Peter Potaptchik et.al. | 2410.09046 | null |
2024-10-11 | MiRAGeNews: Multimodal Realistic AI-Generated News Detection | Runsheng Huang et.al. | 2410.09045 | link |
2024-10-11 | Semantic Score Distillation Sampling for Compositional Text-to-3D Generation | Ling Yang et.al. | 2410.09009 | link |
2024-10-11 | WaveDiffusion: Exploring Full Waveform Inversion via Joint Diffusion in the Latent Space | Hanchen Wang et.al. | 2410.09002 | null |
2024-10-11 | DiffPO: A causal diffusion model for learning distributions of potential outcomes | Yuchen Ma et.al. | 2410.08924 | null |
2024-10-11 | One-shot Generative Domain Adaptation in 3D GANs | Ziqiang Li et.al. | 2410.08824 | link |
2024-10-11 | Distillation of Discrete Diffusion through Dimensional Correlations | Satoshi Hayakawa et.al. | 2410.08709 | link |
2024-10-14 | Gait Sequence Upsampling using Diffusion Models for Single LiDAR Sensors | Jeongho Ahn et.al. | 2410.08680 | null |
2024-10-11 | E-Motion: Future Motion Simulation via Event Sequence Diffusion | Song Wu et.al. | 2410.08649 | link |
2024-10-11 | Natural Language Induced Adversarial Images | Xiaopei Zhu et.al. | 2410.08620 | link |
2024-10-11 | Synth-SONAR: Sonar Image Synthesis with Enhanced Diversity and Realism via Dual Diffusion Models and GPT Prompting | Purushothaman Natarajan et.al. | 2410.08612 | link |
2024-10-11 | Text-To-Image with Generative Adversarial Networks | Mehrshad Momen-Tayefeh et.al. | 2410.08608 | null |
2024-10-11 | Achieving multi uav best viewpoint coordination in obstructed environments | Mirko Baglioni et.al. | 2410.08602 | null |
2024-10-17 | Context-Aware Full Body Anonymization using Text-to-Image Diffusion Models | Pascal Zwick et.al. | 2410.08551 | link |
2024-10-20 | Quality Prediction of AI Generated Images and Videos: Emerging Trends and Opportunities | Abhijay Ghildyal et.al. | 2410.08534 | null |
2024-10-11 | Diffusion Models Need Visual Priors for Image Generation | Xiaoyu Yue et.al. | 2410.08531 | null |
2024-10-11 | AdvDiffuser: Generating Adversarial Safety-Critical Driving Scenarios via Guided Diffusion | Yuting Xie et.al. | 2410.08453 | null |
2024-10-11 | Symbolic Music Generation with Fine-grained Interactive Textural Guidance | Tingyu Zhu et.al. | 2410.08435 | null |
2024-10-10 | Avoiding mode collapse in diffusion models fine-tuned with reinforcement learning | Roberto Barceló et.al. | 2410.08315 | null |
2024-10-10 | Dynamics of Concept Learning and Compositional Generalization | Yongyi Yang et.al. | 2410.08309 | null |
2024-10-10 | Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis | Jinbin Bai et.al. | 2410.08261 | link |
2024-10-10 | Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Video Content | Qiuheng Wang et.al. | 2410.08260 | null |
2024-10-10 | DICE: Discrete Inversion Enabling Controllable Editing for Multinomial Diffusion and Masked Generative Models | Xiaoxiao He et.al. | 2410.08207 | null |
2024-10-10 | HybridBooth: Hybrid Prompt Inversion for Efficient Subject-Driven Generation | Shanyan Guan et.al. | 2410.08192 | null |
2024-10-10 | DifFRelight: Diffusion-Based Facial Performance Relighting | Mingming He et.al. | 2410.08188 | null |
2024-10-10 | Scaling Laws For Diffusion Transformers | Zhengyang Liang et.al. | 2410.08184 | null |
2024-10-10 | ZeroComp: Zero-shot Object Compositing from Image Intrinsics via Diffusion | Zitian Zhang et.al. | 2410.08168 | link |
2024-10-10 | DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation | Jiatao Gu et.al. | 2410.08159 | null |
2024-10-10 | RayEmb: Arbitrary Landmark Detection in X-Ray Images Using Ray Embedding Subspace | Pragyan Shrestha et.al. | 2410.08152 | link |
2024-10-10 | Progressive Autoregressive Video Diffusion Models | Desai Xie et.al. | 2410.08151 | link |
2024-10-10 | Steering Masked Discrete Diffusion Models via Discrete Denoising Posterior Prediction | Jarrid Rector-Brooks et.al. | 2410.08134 | null |
2024-10-10 | Unstable Unlearning: The Hidden Risk of Concept Resurgence in Diffusion Models | Vinith M. Suriyakumar et.al. | 2410.08074 | null |
2024-10-10 | LADIMO: Face Morph Generation through Biometric Template Inversion with Latent Diffusion | Marcel Grimmer et.al. | 2410.07988 | link |
2024-10-10 | AI Surrogate Model for Distributed Computing Workloads | David K. Park et.al. | 2410.07940 | null |
2024-10-10 | Generated Bias: Auditing Internal Bias Dynamics of Text-To-Image Generative Models | Abhishek Mandal et.al. | 2410.07884 | null |
2024-10-10 | FDDM: Frequency-Decomposed Diffusion Model for Rectum Cancer Dose Prediction in Radiotherapy | Xin Liao et.al. | 2410.07876 | null |
2024-10-10 | RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation | Songming Liu et.al. | 2410.07864 | link |
2024-10-10 | MinorityPrompt: Text to Minority Image Generation via Prompt Optimization | Soobin Um et.al. | 2410.07838 | link |
2024-10-17 | Simulating images of radio galaxies with diffusion models | Tobias Vičánek Martínez et.al. | 2410.07794 | link |
2024-10-10 | HARIVO: Harnessing Text-to-Image Models for Video Generation | Mingi Kwon et.al. | 2410.07763 | null |
2024-10-10 | $\textit{Jump Your Steps}$ : Optimizing Sampling Schedule of Discrete Diffusion Models | Yong-Hyun Park et.al. | 2410.07761 | null |
2024-10-10 | Synthesizing Multi-Class Surgical Datasets with Anatomy-Aware Diffusion Models | Danush Kumar Venkatesh et.al. | 2410.07753 | link |
2024-10-14 | Hallo2: Long-Duration and High-Resolution Audio-Driven Portrait Image Animation | Jiahao Cui et.al. | 2410.07718 | link |
2024-10-10 | Flow control-oriented coherent mode prediction via Grassmann-kNN manifold learning | Hongfu Zhang et.al. | 2410.07683 | null |
2024-10-12 | Relational Diffusion Distillation for Efficient Image Generation | Weilun Feng et.al. | 2410.07679 | link |
2024-10-10 | MotionAura: Generating High-Quality and Motion Consistent Videos using Discrete Diffusion | Onkar Susladkar et.al. | 2410.07659 | link |
2024-10-10 | SeMv-3D: Towards Semantic and Mutil-view Consistency simultaneously for General Text-to-3D Generation with Triplane Priors | Xiao Cai et.al. | 2410.07658 | null |
2024-10-10 | FLIER: Few-shot Language Image Models Embedded with Latent Representations | Zhinuo Zhou et.al. | 2410.07648 | null |
2024-10-10 | MorCode: Face Morphing Attack Generation using Generative Codebooks | Aravinda Reddy PN et.al. | 2410.07625 | null |
2024-10-10 | Moyun: A Diffusion-Based Model for Style-Specific Chinese Calligraphy Generation | Kaiyuan Liu et.al. | 2410.07618 | null |
2024-10-10 | Parallel Digital Twin-driven Deep Reinforcement Learning for User Association and Load Balancing in Dynamic Wireless Networks | Zhenyu Tao et.al. | 2410.07611 | null |
2024-10-10 | A Unified Debiasing Approach for Vision-Language Models across Modalities and Tasks | Hoin Jung et.al. | 2410.07593 | link |
2024-10-10 | Conditional Lagrangian Wasserstein Flow for Time Series Imputation | Weizhu Qian et.al. | 2410.07550 | null |
2024-10-15 | I-Max: Maximize the Resolution Potential of Pre-trained Rectified Flow Transformers with Projected Flow | Ruoyi Du et.al. | 2410.07536 | null |
2024-10-09 | Fast reaction limit for a Leslie-Gower model including preys, meso-predators and top-predators | Desvillettes Laurent et.al. | 2410.07474 | null |
2024-10-09 | Language-Guided Joint Audio-Visual Editing via One-Shot Adaptation | Susan Liang et.al. | 2410.07463 | null |
2024-10-09 | An undetectable watermark for generative image models | Sam Gunn et.al. | 2410.07369 | link |
2024-10-11 | Rectified Diffusion: Straightness Is Not Your Need in Rectified Flow | Fu-Yun Wang et.al. | 2410.07303 | link |
2024-10-15 | ReinDiffuse: Crafting Physically Plausible Motions with Reinforced Diffusion Model | Gaoge Han et.al. | 2410.07296 | null |
2024-10-09 | BELM: Bidirectional Explicit Linear Multi-step Sampler for Exact Inversion in Diffusion Models | Fangyikang Wang et.al. | 2410.07273 | null |
2024-10-09 | IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation | Xinchen Zhang et.al. | 2410.07171 | link |
2024-10-09 | AvatarGO: Zero-shot 4D Human-Object Interaction Generation and Animation | Yukang Cao et.al. | 2410.07164 | null |
2024-10-09 | InstructG2I: Synthesizing Images from Multimodal Attributed Graphs | Bowen Jin et.al. | 2410.07157 | link |
2024-10-09 | Trans4D: Realistic Geometry-Aware Transition for Compositional Text-to-4D Synthesis | Bohan Zeng et.al. | 2410.07155 | link |
2024-10-10 | EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models | Rui Zhao et.al. | 2410.07133 | link |
2024-10-09 | Personalized Visual Instruction Tuning | Renjie Pi et.al. | 2410.07113 | link |
2024-10-09 | Diffusion Density Estimators | Akhil Premkumar et.al. | 2410.06986 | null |
2024-10-09 | Jointly Generating Multi-view Consistent PBR Textures using Collaborative Control | Shimon Vainer et.al. | 2410.06985 | null |
2024-10-09 | Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think | Sihyun Yu et.al. | 2410.06940 | link |
2024-10-09 | Boosting Few-Shot Detection with Large Language Models and Layout-to-Image Synthesis | Ahmed Abdullah et.al. | 2410.06841 | null |
2024-10-09 | Diffuse or Confuse: A Diffusion Deepfake Speech Dataset | Anton Firc et.al. | 2410.06796 | link |
2024-10-09 | Patterns of Creativity: How User Input Shapes AI-Generated Visual Diversity | Maria-Teresa De Rosa Palmini et.al. | 2410.06768 | null |
2024-10-09 | Diff-FMT: Diffusion Models for Fluorescence Molecular Tomography | Qianqian Xue et.al. | 2410.06757 | null |
2024-10-18 | Suppress Content Shift: Better Diffusion Features via Off-the-Shelf Generation Techniques | Benyuan Meng et.al. | 2410.06719 | link |
2024-10-09 | Decouple-Then-Merge: Towards Better Training for Diffusion Models | Qianli Ma et.al. | 2410.06664 | null |
2024-10-09 | Chemistry-Inspired Diffusion with Non-Differentiable Guidance | Yuchen Shen et.al. | 2410.06502 | null |
2024-10-09 | HFH-Font: Few-shot Chinese Font Synthesis with Higher Quality, Faster Speed, and Higher Resolution | Hua Li et.al. | 2410.06488 | link |
2024-10-09 | On the Solution of Linearized Inverse Scattering Problems in Near-Field Microwave Imaging by Operator Inversion and Matched Filtering | Matthias M. Saurer et.al. | 2410.06465 | null |
2024-10-15 | Generative Artificial Intelligence (GAI) for Mobile Communications: A Diffusion Model Perspective | Xiaoxia Xu et.al. | 2410.06389 | link |
2024-10-08 | SymDiff: Equivariant Diffusion via Stochastic Symmetrisation | Leo Zhang et.al. | 2410.06262 | null |
2024-10-08 | Story-Adapter: A Training-free Iterative Framework for Long Story Visualization | Jiawei Mao et.al. | 2410.06244 | null |
2024-10-16 | BroadWay: Boost Your Text-to-Video Generation Model in a Training-free Way | Jiazi Bu et.al. | 2410.06241 | null |
2024-10-08 | SD- $π$ XL: Generating Low-Resolution Quantized Imagery via Score Distillation | Alexandre Binninger et.al. | 2410.06236 | link |
2024-10-08 | GR-2: A Generative Video-Language-Action Model with Web-Scale Knowledge for Robot Manipulation | Chi-Lam Cheang et.al. | 2410.06158 | null |
2024-10-08 | Toward Scalable Image Feature Compression: A Content-Adaptive and Diffusion-Based Approach | Sha Guo et.al. | 2410.06149 | null |
2024-10-08 | Estimating the Number of HTTP/3 Responses in QUIC Using Deep Learning | Barak Gahtan et.al. | 2410.06140 | link |
2024-10-08 | Learning AND-OR Templates for Professional Photograph Parsing and Guidance | Xin Jin et.al. | 2410.06124 | null |
2024-10-08 | AP-LDM: Attentive and Progressive Latent Diffusion Model for Training-Free High-Resolution Image Generation | Boyuan Cao et.al. | 2410.06055 | link |
2024-10-08 | HyperDet: Generalizable Detection of Synthesized Images by Generating and Merging A Mixture of Hyper LoRAs | Huangsen Cao et.al. | 2410.06044 | null |
2024-10-10 | Sparse Repellency for Shielded Generation in Text-to-image Diffusion Models | Michael Kirchhof et.al. | 2410.06025 | null |
2024-10-08 | Pyramidal Flow Matching for Efficient Video Generative Modeling | Yang Jin et.al. | 2410.05954 | link |
2024-10-08 | TIMBA: Time series Imputation with Bi-directional Mamba Blocks and Diffusion models | Javier Solís-García et.al. | 2410.05916 | null |
2024-10-16 | Manifolds, Random Matrices and Spectral Gaps: The geometric phases of generative diffusion | Enrico Ventura et.al. | 2410.05898 | null |
2024-10-08 | Unobserved Object Detection using Generative Models | Subhransu S. Bhattacharjee et.al. | 2410.05869 | link |
2024-10-08 | A noise-corrected Langevin algorithm and sampling by half-denoising | Aapo Hyvärinen et.al. | 2410.05837 | null |
2024-10-17 | SeeClear: Semantic Distillation Enhances Pixel Condensation for Video Super-Resolution | Qi Tang et.al. | 2410.05799 | link |
2024-10-08 | FürElise: Capturing and Physically Synthesizing Hand Motions of Piano Performance | Ruocheng Wang et.al. | 2410.05791 | null |
2024-10-08 | Training-free Diffusion Model Alignment with Sampling Demons | Po-Hung Yeh et.al. | 2410.05760 | link |
2024-10-08 | DiffusionGuard: A Robust Defense Against Malicious Diffusion-based Image Editing | June Suk Choi et.al. | 2410.05694 | link |
2024-10-11 | T2V-Turbo-v2: Enhancing Video Generation Model Post-Training through Data, Reward, and Conditional Guidance Design | Jiachen Li et.al. | 2410.05677 | null |
2024-10-08 | Holistic Unlearning Benchmark: A Multi-Faceted Evaluation for Text-to-Image Diffusion Model Unlearning | Saemi Moon et.al. | 2410.05664 | null |
2024-10-08 | ViBiDSampler: Enhancing Video Interpolation Using Bidirectional Diffusion Sampler | Serin Yang et.al. | 2410.05651 | null |
2024-10-08 | TweedieMix: Improving Multi-Concept Fusion for Diffusion-based Image/Video Generation | Gihyun Kwon et.al. | 2410.05591 | link |
2024-10-08 | Gen-Drive: Enhancing Diffusion Generative Driving Policies with Reward Modeling and Reinforcement Learning Fine-tuning | Zhiyu Huang et.al. | 2410.05582 | null |
2024-10-07 | Generative Portrait Shadow Removal | Jae Shin Yoon et.al. | 2410.05525 | null |
2024-10-07 | Toward General Object-level Mapping from Sparse Views with 3D Diffusion Priors | Ziwei Liao et.al. | 2410.05514 | link |
2024-10-07 | Sparsity of Fourier mass of passively advected scalars in the Batchelor regime | Alex Blumenthal et.al. | 2410.05473 | null |
2024-10-07 | Image Watermarks are Removable Using Controllable Regeneration from Clean Noise | Yepeng Liu et.al. | 2410.05470 | link |
2024-10-07 | Continuous Ensemble Weather Forecasting with Diffusion models | Martin Andrae et.al. | 2410.05431 | link |
2024-10-07 | Diffusion Imitation from Observation | Bo-Ruei Huang et.al. | 2410.05429 | null |
2024-10-07 | Diffusion Model Predictive Control | Guangyao Zhou et.al. | 2410.05364 | null |
2024-10-07 | Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation | Fanqing Meng et.al. | 2410.05363 | link |
2024-10-05 | From Incomplete Coarse-Grained to Complete Fine-Grained: A Two-Stage Framework for Spatiotemporal Data Reconstruction | Ziyu Sun et.al. | 2410.05323 | null |
2024-10-05 | Noise Crystallization and Liquid Noise: Zero-shot Video Generation using Image Diffusion Models | Muhammad Haaris Khan et.al. | 2410.05322 | null |
2024-10-14 | Accelerating Diffusion Transformers with Token-wise Feature Caching | Chang Zou et.al. | 2410.05317 | link |
2024-10-04 | ShieldDiff: Suppressing Sexual Content Generation from Diffusion Models through Reinforcement Learning | Dong Han et.al. | 2410.05309 | null |
2024-10-04 | Diffusion-based Unsupervised Audio-visual Speech Enhancement | Jean-Eudes Ayilo et.al. | 2410.05301 | null |
2024-10-07 | DART: A Diffusion-Based Autoregressive Motion Model for Real-Time Text-Driven Motion Control | Kaifeng Zhao et.al. | 2410.05260 | null |
2024-10-07 | GS-VTON: Controllable 3D Virtual Try-on with Gaussian Splatting | Yukang Cao et.al. | 2410.05259 | null |
2024-10-07 | SePPO: Semi-Policy Preference Optimization for Diffusion Alignment | Daoan Zhang et.al. | 2410.05255 | link |
2024-10-07 | DiffuseReg: Denoising Diffusion Model for Obtaining Deformation Fields in Unsupervised Deformable Image Registration | Yongtai Zhuo et.al. | 2410.05234 | link |
2024-10-10 | The Dawn of Video Generation: Preliminary Explorations with SORA-like Models | Ailing Zeng et.al. | 2410.05227 | null |
2024-10-08 | Beyond FVD: Enhanced Evaluation Metrics for Video Generation Quality | Ge Ya Luo et.al. | 2410.05203 | link |
2024-10-07 | Presto! Distilling Steps and Layers for Accelerating Music Generation | Zachary Novack et.al. | 2410.05167 | null |
2024-10-08 | A Simulation-Free Deep Learning Approach to Stochastic Optimal Control | Mengjian Hua et.al. | 2410.05163 | null |
2024-10-07 | Leveraging Multimodal Diffusion Models to Accelerate Imaging with Side Information | Timofey Efimov et.al. | 2410.05143 | null |
2024-10-07 | Human-Feedback Efficient Reinforcement Learning for Online Diffusion Model Finetuning | Ayano Hiranaka et.al. | 2410.05116 | null |
2024-10-07 | DreamSat: Towards a General 3D Model for Novel View Synthesis of Space Objects | Nidhi Mathihalli et.al. | 2410.05097 | link |
2024-10-07 | A nodally bound-preserving discontinuous Galerkin method for the drift-diffusion equation | Gabriel R. Barrenechea et.al. | 2410.05040 | null |
2024-10-07 | Revealing Directions for Text-guided 3D Face Editing | Zhuo Chen et.al. | 2410.04965 | null |
2024-10-07 | OmniBooth: Learning Latent Control for Image Synthesis with Multi-modal Instruction | Leheng Li et.al. | 2410.04932 | null |
2024-10-07 | Low-Rank Continual Personalization of Diffusion Models | Łukasz Staniszewski et.al. | 2410.04891 | link |
2024-10-07 | Patch is Enough: Naturalistic Adversarial Patch against Vision-Language Pre-training Models | Dehong Kong et.al. | 2410.04884 | null |
2024-10-07 | PostEdit: Posterior Sampling for Efficient Zero-Shot Image Editing | Feng Tian et.al. | 2410.04844 | link |
2024-10-07 | Real-time cardiac cine MRI – A comparison of a diffusion probabilistic model with alternative state-of-the-art image reconstruction techniques for undersampled spiral acquisitions | Oliver Schad et.al. | 2410.04843 | null |
2024-10-07 | Learning Efficient and Effective Trajectories for Differential Equation-based Image Restoration | Zhiyu Zhu et.al. | 2410.04811 | link |
2024-10-07 | FedBiP: Heterogeneous One-Shot Federated Learning with Personalized Latent Diffusion Models | Haokun Chen et.al. | 2410.04810 | null |
2024-10-07 | Data-driven Diffusion Models for Enhancing Safety in Autonomous Vehicle Traffic Simulations | Jinxiong Lu et.al. | 2410.04809 | null |
2024-10-07 | Stochastic Runge-Kutta Methods: Provable Acceleration of Diffusion Models | Yuchen Wu et.al. | 2410.04760 | null |
2024-10-15 | Numerical analysis of American option pricing in a two-asset jump-diffusion model | Hao Zhou et.al. | 2410.04745 | null |
2024-10-15 | Diffusion Models in 3D Vision: A Survey | Zhen Wang et.al. | 2410.04738 | null |
2024-10-07 | Dynamics of Chemical Orders in Formation of Striped Patterns in Metamorphic Rocks | Bikash Kumar Sarkar et.al. | 2410.04735 | null |
2024-10-07 | ACDC: Autoregressive Coherent Multimodal Generation using Diffusion Correction | Hyungjin Chung et.al. | 2410.04721 | null |
2024-10-07 | CAR: Controllable Autoregressive Modeling for Visual Generation | Ziyu Yao et.al. | 2410.04671 | link |
2024-10-07 | Federated Learning Nodes Can Reconstruct Peers’ Image Data | Ethan Wilson et.al. | 2410.04661 | null |
2024-10-06 | AdaptDiff: Cross-Modality Domain Adaptation via Weak Conditional Semantic Diffusion for Retinal Vessel Segmentation | Dewei Hu et.al. | 2410.04648 | link |
2024-10-06 | Is What You Ask For What You Get? Investigating Concept Associations in Text-to-Image Models | Salma Abdel Magid et.al. | 2410.04634 | null |
2024-10-06 | Control Large Language Models via Divide and Conquer | Bingxuan Li et.al. | 2410.04628 | null |
2024-10-06 | Towards Unsupervised Blind Face Restoration using Diffusion Prior | Tianshu Kuai et.al. | 2410.04618 | null |
2024-10-06 | Realizing Video Summarization from the Path of Language-based Semantic Understanding | Kuan-Chen Mu et.al. | 2410.04511 | null |
2024-10-06 | uDiG-DIP: Unrolled Diffusion-Guided Deep Image Prior For Medical Image Reconstruction | Shijun Liang et.al. | 2410.04482 | null |
2024-10-06 | SITCOM: Step-wise Triple-Consistent Diffusion Sampling for Inverse Problems | Ismail Alkhouri et.al. | 2410.04479 | link |
2024-10-06 | Empowering Backbone Models for Visual Text Generation with Input Granularity Control and Glyph-Aware Training | Wenbo Li et.al. | 2410.04439 | null |
2024-10-11 | Disentangling Regional Primitives for Image Generation | Zhengting Chen et.al. | 2410.04421 | null |
2024-10-06 | DiffusionFake: Enhancing Generalization in Deepfake Detection via Guided Stable Diffusion | Ke Sun et.al. | 2410.04372 | link |
2024-10-06 | RespDiff: An End-to-End Multi-scale RNN Diffusion Model for Respiratory Waveform Estimation from PPG Signals | Yuyang Miao et.al. | 2410.04366 | link |
2024-10-08 | VideoGuide: Improving Video Diffusion Models without Training Through a Teacher’s Guide | Dohun Lee et.al. | 2410.04364 | null |
2024-10-05 | The Visualization JUDGE : Can Multimodal Foundation Models Guide Visualization Design Through Visual Perception? | Matthew Berger et.al. | 2410.04280 | null |
2024-10-05 | DeFoG: Discrete Flow Matching for Graph Generation | Yiming Qin et.al. | 2410.04263 | link |
2024-10-05 | Compositional Diffusion Models for Powered Descent Trajectory Generation with Flexible Constraints | Julia Briden et.al. | 2410.04261 | null |
2024-10-10 | Distillation-Free One-Step Diffusion for Real-World Image Super-Resolution | Jianze Li et.al. | 2410.04224 | link |
2024-10-05 | TANGO: Co-Speech Gesture Video Reenactment with Hierarchical Audio Motion Embedding and Diffusion Interpolation | Haiyang Liu et.al. | 2410.04221 | null |
2024-10-05 | Boosting Visual Fidelity in Driving Simulations through Diffusion Models | Fanjun Bu et.al. | 2410.04214 | null |
2024-10-15 | Learning on LoRAs: GL-Equivariant Processing of Low-Rank Weight Spaces for Large Finetuned Models | Theo Putterman et.al. | 2410.04207 | null |
2024-10-05 | Accelerating Diffusion Models with One-to-Many Knowledge Distillation | Linfeng Zhang et.al. | 2410.04191 | null |
2024-10-08 | IV-Mixed Sampler: Leveraging Image Diffusion Models for Enhanced Video Synthesis | Shitong Shao et.al. | 2410.04171 | link |
2024-10-05 | Overcoming False Illusions in Real-World Face Restoration with Multi-Modal Guided Diffusion Model | Keda Tao et.al. | 2410.04161 | null |
2024-10-05 | Lane Detection System for Driver Assistance in Vehicles | Kauan Divino Pouso Mariano et.al. | 2410.04046 | null |
2024-10-05 | Multiscale Latent Diffusion Model for Enhanced Feature Extraction from Medical Images | Rabeya Tus Sadia et.al. | 2410.04000 | null |
2024-10-04 | AutoLoRA: AutoGuidance Meets Low-Rank Adaptation for Diffusion Models | Artur Kasymov et.al. | 2410.03941 | link |
2024-10-04 | Online Posterior Sampling with a Diffusion Prior | Branislav Kveton et.al. | 2410.03919 | null |
2024-10-04 | SONIQUE: Video Background Music Generation Using Unpaired Audio-Visual Data | Liqian Zhang et.al. | 2410.03879 | link |
2024-10-04 | Chain-of-Jailbreak Attack for Image Generation Models via Editing Step by Step | Wenxuan Wang et.al. | 2410.03869 | null |
2024-10-04 | MDMP: Multi-modal Diffusion for supervised Motion Predictions with uncertainty | Leo Bringer et.al. | 2410.03860 | link |
2024-10-04 | Text-guided Diffusion Model for 3D Molecule Generation | Yanchen Luo et.al. | 2410.03803 | null |
2024-10-03 | People are poorly equipped to detect AI-powered voice clones | Sarah Barrington et.al. | 2410.03791 | null |
2024-10-02 | Denoising with a Joint-Embedding Predictive Architecture | Dengsheng Chen et.al. | 2410.03755 | null |
2024-10-01 | Khattat: Enhancing Readability and Concept Representation of Semantic Typography | Ahmed Hussein et.al. | 2410.03748 | null |
2024-10-04 | Estimating Body and Hand Motion in an Ego-sensed World | Brent Yi et.al. | 2410.03665 | null |
2024-10-04 | Real-World Benchmarks Make Membership Inference Attacks Fail on Diffusion Models | Chumeng Liang et.al. | 2410.03640 | link |
2024-10-04 | How Discrete and Continuous Diffusion Meet: Comprehensive Analysis of Discrete Diffusion Models via a Stochastic Integral Framework | Yinuo Ren et.al. | 2410.03601 | null |
2024-10-10 | Not All Diffusion Model Activations Have Been Evaluated as Discriminative Features | Benyuan Meng et.al. | 2410.03558 | link |
2024-10-04 | Diffusion State-Guided Projected Gradient for Inverse Problems | Rayhan Zirvi et.al. | 2410.03463 | link |
2024-10-04 | Generative Semantic Communication for Text-to-Speech Synthesis | Jiahao Zheng et.al. | 2410.03459 | null |
2024-10-09 | Dynamic Diffusion Transformer | Wangbo Zhao et.al. | 2410.03456 | link |
2024-10-04 | CLoSD: Closing the Loop between Simulation and Diffusion for multi-task character control | Guy Tevet et.al. | 2410.03441 | link |
2024-10-04 | Images Speak Volumes: User-Centric Assessment of Image Generation for Accessible Communication | Miriam Anschütz et.al. | 2410.03430 | link |
2024-10-04 | The scaling behaviour of localised and extended states in one-dimensional tight-binding models with disorder | Luca Schaefer et.al. | 2410.03405 | null |
2024-10-04 | Latent Abstractions in Generative Diffusion Models | Giulio Franzese et.al. | 2410.03368 | null |
2024-10-04 | LANTERN: Accelerating Visual Autoregressive Models with Relaxed Speculative Decoding | Doohyuk Jang et.al. | 2410.03355 | null |
2024-10-04 | Generalized Ordered Weighted Aggregation Robustness to Solve Uncertain Single Objective Optimization Problems | Nand Kishor et.al. | 2410.03222 | null |
2024-10-04 | Tuning Timestep-Distilled Diffusion Model Using Pairwise Sample Optimization | Zichen Miao et.al. | 2410.03190 | null |
2024-10-08 | Autonomous Character-Scene Interaction Synthesis from Text Instruction | Nan Jiang et.al. | 2410.03187 | null |
2024-10-04 | Redefining Temporal Modeling in Video Diffusion: The Vectorized Timestep Approach | Yaofang Liu et.al. | 2410.03160 | link |
2024-10-12 | ECHOPulse: ECG controlled echocardio-grams video generation | Yiwei Li et.al. | 2410.03143 | link |
2024-10-04 | A Training-Free Conditional Diffusion Model for Learning Stochastic Dynamical Systems | Yanfang Liu et.al. | 2410.03108 | link |
2024-10-04 | Combing Text-based and Drag-based Editing for Precise and Flexible Image Editing | Ziqi Jiang et.al. | 2410.03097 | null |
2024-10-04 | Generative Edge Detection with Stable Diffusion | Caixia Zhou et.al. | 2410.03080 | null |
2024-10-04 | Multi-Robot Motion Planning with Diffusion Models | Yorai Shaoul et.al. | 2410.03072 | link |
2024-10-03 | Revealing the Unseen: Guiding Personalized Diffusion Models to Expose Training Data | Xiaoyu Wu et.al. | 2410.03039 | null |
2024-10-03 | Flow Matching with Gaussian Process Priors for Probabilistic Time Series Forecasting | Marcel Kollovieh et.al. | 2410.03024 | null |
2024-10-03 | PixelShuffler: A Simple Image Translation Through Pixel Rearrangement | Omar Zamzam et.al. | 2410.03021 | link |
2024-10-03 | Learning Optimal Control and Dynamical Structure of Global Trajectory Search Problems with Diffusion Models | Jannik Graebner et.al. | 2410.02976 | null |
2024-10-03 | SymmetricDiffusers: Learning Discrete Diffusion on Finite Symmetric Groups | Yongxing Zhang et.al. | 2410.02942 | link |
2024-10-03 | Reconstructing Galaxy Cluster Mass Maps using Score-based Generative Modeling | Alan Hsu et.al. | 2410.02857 | null |
2024-09-19 | KLDD: Kalman Filter based Linear Deformable Diffusion Model in Retinal Image Segmentation | Zhihao Zhao et.al. | 2410.02808 | null |
2024-10-03 | Loong: Generating Minute-level Long Videos with Autoregressive Language Models | Yuqing Wang et.al. | 2410.02757 | null |
2024-10-03 | Revisit Large-Scale Image-Caption Data in Pre-training Multimodal Foundation Models | Zhengfeng Lai et.al. | 2410.02740 | null |
2024-10-03 | SteerDiff: Steering towards Safe Text-to-Image Diffusion Models | Hongxiang Zhang et.al. | 2410.02710 | null |
2024-10-03 | ControlAR: Controllable Image Generation with Autoregressive Models | Zongming Li et.al. | 2410.02705 | link |
2024-10-03 | GUD: Generation with Unified Diffusion | Mathis Gerdes et.al. | 2410.02667 | null |
2024-10-03 | Grounded Answers for Multi-agent Decision-making Problem through Generative World Model | Zeyang Liu et.al. | 2410.02664 | null |
2024-10-03 | Efficient calibration of the shifted square-root diffusion model to credit default swap spreads using asymptotic approximations | Ankush Agarwal et.al. | 2410.02645 | null |
2024-10-03 | NL-Eye: Abductive NLI for Images | Mor Ventura et.al. | 2410.02613 | null |
2024-10-04 | Diffusion Models are Evolutionary Algorithms | Yanbo Zhang et.al. | 2410.02543 | link |
2024-10-03 | Lightweight Diffusion Models for Resource-Constrained Semantic Communication | Giovanni Pignata et.al. | 2410.02491 | link |
2024-10-03 | Event-Customized Image Generation | Zhen Wang et.al. | 2410.02483 | null |
2024-10-13 | Towards a Theoretical Understanding of Memorization in Diffusion Models | Yunhao Chen et.al. | 2410.02467 | null |
2024-10-03 | Eliminating Oversaturation and Artifacts of High Guidance Scales in Diffusion Models | Seyedmorteza Sadat et.al. | 2410.02416 | null |
2024-10-03 | Diffusion Meets Options: Hierarchical Generative Skill Composition for Temporally-Extended Tasks | Zeyu Feng et.al. | 2410.02389 | null |
2024-10-04 | Unleashing the Potential of the Diffusion Model in Few-shot Semantic Segmentation | Muzhi Zhu et.al. | 2410.02369 | link |
2024-10-03 | SageAttention: Accurate 8-Bit Attention for Plug-and-play Inference Acceleration | Jintao Zhang et.al. | 2410.02367 | link |
2024-10-03 | Convergence of Score-Based Discrete Diffusion Models: A Discrete-Time Analysis | Zikun Zhang et.al. | 2410.02321 | null |
2024-10-03 | Channel-aware Contrastive Conditional Diffusion for Multivariate Probabilistic Time Series Forecasting | Siyang Li et.al. | 2410.02168 | link |
2024-10-03 | Controlled Generation of Natural Adversarial Documents for Stealthy Retrieval Poisoning | Collin Zhang et.al. | 2410.02163 | link |
2024-10-03 | SoundMorpher: Perceptually-Uniform Sound Morphing with Diffusion Model | Xinlei Niu et.al. | 2410.02144 | null |
2024-10-03 | Plug-and-Play Controllable Generation for Discrete Masked Models | Wei Guo et.al. | 2410.02143 | null |
2024-10-03 | MDSGen: Fast and Efficient Masked Diffusion Temporal-Aware Transformers for Open-Domain Sound Generation | Trung X. Pham et.al. | 2410.02130 | null |
2024-10-03 | SC-CDM: Enhancing Quality of Image Semantic Communication with a Compact Diffusion Model | Kexin Zhang et.al. | 2410.02121 | null |
2024-10-04 | EC-DIT: Scaling Diffusion Transformers with Adaptive Expert-Choice Routing | Haotian Sun et.al. | 2410.02098 | null |
2024-10-02 | DisEnvisioner: Disentangled and Enriched Visual Prompt for Customized Image Generation | Jing He et.al. | 2410.02067 | null |
2024-10-02 | Stochastic Deep Restoration Priors for Imaging Inverse Problems | Yuyang Hu et.al. | 2410.02057 | null |
2024-10-02 | Synthio: Augmenting Small-Scale Audio Classification Datasets with Synthetic Data | Sreyan Ghosh et.al. | 2410.02056 | link |
2024-10-02 | Using Style Ambiguity Loss to Improve Aesthetics of Diffusion Models | James Baker et.al. | 2410.02055 | link |
2024-10-05 | Normalizing Flow-Based Metric for Image Generation | Pranav Jeevan et.al. | 2410.02004 | link |
2024-10-02 | Discrete Copula Diffusion | Anji Liu et.al. | 2410.01949 | null |
2024-10-02 | A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegrained Image Generation | Liang Chen et.al. | 2410.01912 | link |
2024-10-02 | FabricDiffusion: High-Fidelity Texture Transfer for 3D Garments Generation from In-The-Wild Clothing Images | Cheng Zhang et.al. | 2410.01801 | null |
2024-10-02 | Bellman Diffusion: Generative Modeling as Learning a Linear Operator in the Distribution Space | Yangming Li et.al. | 2410.01796 | null |
2024-10-02 | Dynamical-generative downscaling of climate model ensembles | Ignacio Lopez-Gomez et.al. | 2410.01776 | null |
2024-10-02 | ImageFolder: Autoregressive Image Generation with Folded Tokens | Xiang Li et.al. | 2410.01756 | link |
2024-10-02 | VitaGlyph: Vitalizing Artistic Typography with Flexible Dual-branch Diffusion Models | Kailai Feng et.al. | 2410.01738 | link |
2024-10-02 | ComfyGen: Prompt-Adaptive Workflows for Text-to-Image Generation | Rinon Gal et.al. | 2410.01731 | null |
2024-10-04 | HarmoniCa: Harmonizing Training and Inference for Better Feature Cache in Diffusion Transformer Acceleration | Yushi Huang et.al. | 2410.01723 | link |
2024-10-02 | COMUNI: Decomposing Common and Unique Video Signals for Diffusion-based Video Generation | Mingzhen Sun et.al. | 2410.01718 | null |
2024-10-02 | Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding | Yao Teng et.al. | 2410.01699 | link |
2024-10-02 | COSMIC: Compress Satellite Images Efficiently via Diffusion Compensation | Ziyuan Zhang et.al. | 2410.01698 | link |
2024-10-02 | Data Extrapolation for Text-to-image Generation on Small Datasets | Senmao Ye et.al. | 2410.01638 | link |
2024-10-11 | KnobGen: Controlling the Sophistication of Artwork in Sketch-Based Diffusion Models | Pouyan Navard et.al. | 2410.01595 | link |
2024-10-02 | MM-LDM: Multi-Modal Latent Diffusion Model for Sounding Video Generation | Mingzhen Sun et.al. | 2410.01594 | link |
2024-10-02 | HRTF Estimation using a Score-based Prior | Etienne Thuillier et.al. | 2410.01562 | null |
2024-10-02 | Edge-preserving noise for diffusion models | Jente Vandersanden et.al. | 2410.01540 | null |
2024-10-02 | Information-Theoretical Principled Trade-off between Jailbreakability and Stealthiness on Vision Language Models | Ching-Chia Kao et.al. | 2410.01438 | null |
2024-10-02 | Harnessing the Latent Diffusion Model for Training-Free Image Style Transfer | Kento Masui et.al. | 2410.01366 | null |
2024-10-02 | Aggregation of Multi Diffusion Models for Enhancing Learned Representations | Conghan Yue et.al. | 2410.01262 | link |
2024-10-03 | The SynCOM Flow Tracking Challenge | Valmir Moraes Filho et.al. | 2410.01233 | null |
2024-10-02 | Generative Diffusion-based Contract Design for Efficient AI Twins Migration in Vehicular Embodied AI Networks | Yue Zhong et.al. | 2410.01176 | null |
2024-10-02 | Text2PDE: Latent Diffusion Models for Accessible Physics Simulation | Anthony Zhou et.al. | 2410.01153 | link |
2024-10-01 | Generative AI Application for Building Industry | Hanlong Wan et.al. | 2410.01098 | null |
2024-10-01 | “Hiding in Plain Sight”: Designing Synthetic Dialog Generation for Uncovering Socially Situated Norms | Chengfei Wu et.al. | 2410.00998 | null |
2024-10-01 | Removing Distributional Discrepancies in Captions Improves Image-Text Alignment | Yuheng Li et.al. | 2410.00905 | null |
2024-10-02 | Flex3D: Feed-Forward 3D Generation With Flexible Reconstruction Model And Input View Curation | Junlin Han et.al. | 2410.00890 | null |
2024-10-01 | Diffusion-Informed Probabilistic Contact Search for Multi-Finger Manipulation | Abhinav Kumar et.al. | 2410.00841 | null |
2024-10-01 | Absorbing State Phase Transitions and Stability of Long-Range Coherence in Dissipative Quantum State Preparation | Matthew Wampler et.al. | 2410.00819 | null |
2024-10-01 | Modeling Neural Switching via Drift-Diffusion Models | Nicholas Marco et.al. | 2410.00781 | link |
2024-10-01 | Improved Generation of Synthetic Imaging Data Using Feature-Aligned Diffusion | Lakshmi Nair et.al. | 2410.00731 | link |
2024-10-03 | NECOMIMI: Neural-Cognitive Multimodal EEG-informed Image Generation with Diffusion Models | Chi-Sheng Chen et.al. | 2410.00712 | null |
2024-10-02 | Mining Your Own Secrets: Diffusion Classifier Scores for Continual Personalization of Text-to-Image Diffusion Models | Saurav Jha et.al. | 2410.00700 | null |
2024-10-08 | Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretraining | Jie Cheng et.al. | 2410.00564 | link |
2024-10-01 | MCGM: Mask Conditional Text-to-Image Generative Model | Rami Skaik et.al. | 2410.00483 | null |
2024-10-01 | Scene Graph Disentanglement and Composition for Generalizable Complex Image Generation | Yunnan Wang et.al. | 2410.00447 | null |
2024-10-01 | CusConcept: Customized Visual Concept Decomposition with Diffusion Models | Zhi Xu et.al. | 2410.00398 | link |
2024-10-01 | Generative Precipitation Downscaling using Score-based Diffusion with Wasserstein Regularization | Yuhao Liu et.al. | 2410.00381 | null |
2024-10-01 | SyntheOcc: Synthesize Geometric-Controlled Street View Images through 3D Semantic MPIs | Leheng Li et.al. | 2410.00337 | null |
2024-10-11 | A Cat Is A Cat (Not A Dog!): Unraveling Information Mix-ups in Text-to-Image Encoders through Causal Analysis and Embedding Optimization | Chieh-Yun Chen et.al. | 2410.00321 | link |
2024-10-01 | RadGazeGen: Radiomics and Gaze-guided Medical Image Generation using Diffusion Models | Moinak Bhattacharya et.al. | 2410.00307 | null |
2024-09-30 | ImmersePro: End-to-End Stereo Video Synthesis Via Implicit Disparity Learning | Jian Shi et.al. | 2410.00262 | link |
2024-09-30 | Volumetric Conditional Score-based Residual Diffusion Model for PET/MR Denoising | Siyeop Yoon et.al. | 2410.00184 | link |
2024-09-30 | GaNDLF-Synth: A Framework to Democratize Generative AI for (Bio)Medical Imaging | Sarthak Pati et.al. | 2410.00173 | null |
2024-09-30 | ACE: All-round Creator and Editor Following Instructions via Diffusion Transformer | Zhen Han et.al. | 2410.00086 | null |
2024-09-30 | A Survey on Diffusion Models for Inverse Problems | Giannis Daras et.al. | 2410.00083 | null |
2024-09-30 | Graph Residual Noise Learner Network for Brain Connectivity Graph Prediction | Oytun Demirbilek et.al. | 2410.00082 | link |
2024-09-28 | Generalizing Consistency Policy to Visual RL with Prioritized Proximal Experience Regularization | Haoran Li et.al. | 2410.00051 | null |
2024-10-11 | Inverse Painting: Reconstructing The Painting Process | Bowei Chen et.al. | 2409.20556 | null |
2024-09-30 | COLLAGE: Collaborative Human-Agent Interaction Generation using Hierarchical Latent Diffusion and Language Models | Divyanshu Daiya et.al. | 2409.20502 | null |
2024-09-30 | FreeMask: Rethinking the Importance of Attention Masks for Zero-Shot Video Editing | Lingling Cai et.al. | 2409.20500 | null |
2024-09-30 | All-optical autoencoder machine learning framework using diffractive processors | Peijie Feng et.al. | 2409.20346 | null |
2024-09-30 | Ensemble Kalman Diffusion Guidance: A Derivative-free Method for Inverse Problems | Hongkai Zheng et.al. | 2409.20175 | null |
2024-09-30 | Erase, then Redraw: A Novel Data Augmentation Approach for Free Space Detection Using Diffusion Model | Fulong Ma et.al. | 2409.20164 | null |
2024-09-30 | Conditional Diffusion Models are Minimax-Optimal and Manifold-Adaptive for Conditional Distribution Estimation | Rong Tang et.al. | 2409.20124 | null |
2024-09-30 | Reaction-diffusion model for a population structured in phenotype and space I – Criterion for persistence | Nathanaël Boutillon et.al. | 2409.20118 | null |
2024-09-30 | Q-Bench-Video: Benchmarking the Video Quality Understanding of LMMs | Zicheng Zhang et.al. | 2409.20063 | null |
2024-09-30 | RoCoTex: A Robust Method for Consistent Texture Synthesis with Diffusion Models | Jangyeong Kim et.al. | 2409.19989 | null |
2024-09-30 | Magnet: We Never Know How Text-to-Image Diffusion Models Work, Until We Learn How Vision-Language Models Function | Chenyi Zhuang et.al. | 2409.19967 | link |
2024-10-02 | Image Copy Detection for Diffusion Models | Wenhao Wang et.al. | 2409.19952 | null |
2024-09-30 | Task-agnostic Pre-training and Task-guided Fine-tuning for Versatile Diffusion Planner | Chenyou Fan et.al. | 2409.19949 | null |
2024-09-30 | Illustrious: an Open Advanced Illustration Model | Sang Hyun Park et.al. | 2409.19946 | null |
2024-09-30 | MaskMamba: A Hybrid Mamba-Transformer Model for Masked Image Generation | Wenchao Chen et.al. | 2409.19937 | null |
2024-09-30 | Replace Anyone in Videos | Xiang Wang et.al. | 2409.19911 | link |
2024-09-30 | GameLabel-10K: Collecting Image Preference Data Through Mobile Game Crowdsourcing | Jonathan Zhou et.al. | 2409.19830 | null |
2024-09-29 | OrganiQ: Mitigating Classical Resource Bottlenecks of Quantum Generative Adversarial Networks on NISQ-Era Machines | Daniel Silver et.al. | 2409.19823 | null |
2024-09-29 | Generating peak-aware pseudo-measurements for low-voltage feeders using metadata of distribution system operators | Manuel Treutlein et.al. | 2409.19713 | null |
2024-09-29 | Text-driven Human Motion Generation with Motion Masked Diffusion Model | Xingyu Chen et.al. | 2409.19686 | null |
2024-09-29 | Simple and Fast Distillation of Diffusion Models | Zhenyu Zhou et.al. | 2409.19681 | link |
2024-09-29 | SemiDDM-Weather: A Semi-supervised Learning Framework for All-in-one Adverse Weather Removal | Fang Long et.al. | 2409.19679 | link |
2024-09-29 | Storynizor: Consistent Story Generation via Inter-Frame Synchronized and Shuffled ID Injection | Yuhang Ma et.al. | 2409.19624 | null |
2024-09-29 | MCDDPM: Multichannel Conditional Denoising Diffusion Model for Unsupervised Anomaly Detection in Brain MRI | Vivek Kumar Trivedi et.al. | 2409.19623 | link |
2024-09-29 | Causal Deciphering and Inpainting in Spatio-Temporal Dynamics via Diffusion Model | Yifan Duan et.al. | 2409.19608 | null |
2024-09-29 | DiffCP: Ultra-Low Bit Collaborative Perception via Diffusion Model | Ruiqing Mao et.al. | 2409.19592 | null |
2024-09-29 | Effective Diffusion Transformer Architecture for Image Super-Resolution | Kun Cheng et.al. | 2409.19589 | link |
2024-09-29 | High Quality Human Image Animation using Regional Supervision and Motion Blur Condition | Zhongcong Xu et.al. | 2409.19580 | null |
2024-09-28 | Introducing SDICE: An Index for Assessing Diversity of Synthetic Medical Datasets | Mohammed Talha Alam et.al. | 2409.19436 | null |
2024-09-28 | Multi-Factor Polynomial Diffusion Models and Inter-Temporal Futures Dynamics | Peilun He et.al. | 2409.19386 | null |
2024-09-28 | PDSim: A Shiny App for Polynomial Diffusion Model Simulation and Estimation | Peilun He et.al. | 2409.19385 | null |
2024-09-28 | Efficient Semantic Diffusion Architectures for Model Training on Synthetic Echocardiograms | David Stojanovski et.al. | 2409.19371 | link |
2024-10-03 | Conditional Image Synthesis with Diffusion Models: A Survey | Zheyuan Zhan et.al. | 2409.19365 | link |
2024-09-28 | CausalVE: Face Video Privacy Encryption via Causal Video Prediction | Yubo Huang et.al. | 2409.19306 | null |
2024-09-28 | FINE: Factorizing Knowledge for Initialization of Variable-sized Diffusion Models | Yucheng Xie et.al. | 2409.19289 | null |
2024-09-28 | SciDoc2Diagrammer-MAF: Towards Generation of Scientific Diagrams from Documents guided by Multi-Aspect Feedback Refinement | Ishani Mondal et.al. | 2409.19242 | null |
2024-09-27 | Multimodal Pragmatic Jailbreak on Text-to-image Models | Tong Liu et.al. | 2409.19149 | null |
2024-10-01 | Pruning then Reweighting: Towards Data-Efficient Training of Diffusion Models | Yize Li et.al. | 2409.19128 | link |
2024-09-27 | Secure Multiparty Generative AI | Manil Shrestha et.al. | 2409.19120 | null |
2024-10-02 | Fusion is all you need: Face Fusion for Customized Identity-Preserving Image Synthesis | Salaheldin Mohamed et.al. | 2409.19111 | null |
2024-09-27 | PhysGen: Rigid-Body Physics-Grounded Image-to-Video Generation | Shaowei Liu et.al. | 2409.18964 | link |
2024-09-27 | $O(d/T)$ Convergence Theory for Diffusion Probabilistic Models under Minimal Assumptions | Gen Li et.al. | 2409.18959 | null |
2024-10-01 | Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models | Jiaming Li et.al. | 2409.18943 | link |
2024-09-27 | ReviveDiff: A Universal Diffusion Model for Restoring Images in Adverse Weather Conditions | Wenfeng Huang et.al. | 2409.18932 | null |
2024-09-27 | Unsupervised Low-light Image Enhancement with Lookup Tables and Diffusion Priors | Yunlong Lin et.al. | 2409.18899 | null |
2024-09-27 | Detecting Dataset Abuse in Fine-Tuning Stable Diffusion Models for Text-to-Image Synthesis | Songrui Wang et.al. | 2409.18897 | null |
2024-10-04 | Explainable Artifacts for Synthetic Western Blot Source Attribution | João Phillipe Cardenuto et.al. | 2409.18881 | link |
2024-09-27 | Emu3: Next-Token Prediction is All You Need | Xinlong Wang et.al. | 2409.18869 | null |
2024-09-27 | Convergence of Diffusion Models Under the Manifold Hypothesis in High-Dimensions | Iskander Azangulov et.al. | 2409.18804 | null |
2024-09-27 | Learning from Pattern Completion: Self-supervised Controllable Generation | Zhiqiang Chen et.al. | 2409.18694 | link |
2024-09-27 | Unsupervised Fingerphoto Presentation Attack Detection With Diffusion Models | Hailin Li et.al. | 2409.18636 | null |
2024-09-27 | Treating Brain-inspired Memories as Priors for Diffusion Model to Forecast Multivariate Time Series | Muyao Wang et.al. | 2409.18491 | null |
2024-09-27 | Gradient-free Decoder Inversion in Latent Diffusion Models | Seongmin Hong et.al. | 2409.18442 | null |
2024-09-27 | GenesisTex2: Stable, Consistent and High-Quality Text-to-Texture Generation | Jiawei Lu et.al. | 2409.18401 | null |
2024-10-04 | Multi-hypotheses Conditioned Point Cloud Diffusion for 3D Human Reconstruction from Occluded Images | Donghwan Kim et.al. | 2409.18364 | link |
2024-09-27 | Generative AI for fast and accurate Statistical Computation of Fluids | Roberto Molinaro et.al. | 2409.18359 | link |
2024-09-26 | Realistic Evaluation of Model Merging for Compositional Generalization | Derek Tam et.al. | 2409.18314 | link |
2024-09-26 | Harnessing Wavelet Transformations for Generalizable Deepfake Forgery Detection | Lalith Bharadwaj Baru et.al. | 2409.18301 | link |
2024-10-01 | Synthesizing beta-amyloid PET images from T1-weighted Structural MRI: A Preliminary Study | Qing Lyu et.al. | 2409.18282 | null |
2024-10-04 | Amodal Instance Segmentation with Diffusion Shape Prior Estimation | Minh Tran et.al. | 2409.18256 | null |
2024-09-26 | PDFed: Privacy-Preserving and Decentralized Asynchronous Federated Learning for Diffusion Models | Kar Balan et.al. | 2409.18245 | null |
2024-09-26 | Trustworthy Text-to-Image Diffusion Models: A Timely and Focused Survey | Yi Zhang et.al. | 2409.18214 | link |
2024-09-26 | Loop-Diffusion: an equivariant diffusion model for designing and scoring protein loops | Kevin Borisiak et.al. | 2409.18201 | null |
2024-09-26 | FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner | Wenliang Zhao et.al. | 2409.18128 | link |
2024-10-02 | Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction | Jing He et.al. | 2409.18124 | null |
2024-09-26 | EdgeRunner: Auto-regressive Auto-encoder for Artistic Mesh Generation | Jiaxiang Tang et.al. | 2409.18114 | null |
2024-09-26 | StackGen: Generating Stable Structures from Silhouettes via Diffusion | Luzhe Sun et.al. | 2409.18098 | null |
2024-09-30 | DiffSSC: Semantic LiDAR Scan Completion using Denoising Diffusion Probabilistic Models | Helin Cao et.al. | 2409.18092 | null |
2024-09-26 | Stable Video Portraits | Mirela Ostrek et.al. | 2409.18083 | null |
2024-10-07 | PhoCoLens: Photorealistic and Consistent Reconstruction in Lensless Imaging | Xin Cai et.al. | 2409.17996 | null |
2024-09-26 | Joint Localization and Planning using Diffusion | L. Lao Beyer et.al. | 2409.17995 | null |
2024-09-26 | CNCA: Toward Customizable and Natural Generation of Adversarial Camouflage for Vehicle Detectors | Linye Lyu et.al. | 2409.17963 | link |
2024-09-26 | Relativistic diffusion model for hadron production in p-Pb collisions at the LHC | Philipp Schulz et.al. | 2409.17960 | null |
2024-09-26 | Pioneering Reliable Assessment in Text-to-Image Knowledge Editing: Leveraging a Fine-Grained Dataset and an Innovative Criterion | Hengrui Gu et.al. | 2409.17928 | link |
2024-09-26 | Resolving Multi-Condition Confusion for Finetuning-Free Personalized Image Generation | Qihan Huang et.al. | 2409.17920 | link |
2024-09-26 | Continual learning with task specialist | Indu Solomon et.al. | 2409.17806 | null |
2024-09-26 | Taming Diffusion Prior for Image Super-Resolution with Domain Shift SDEs | Qinpeng Cui et.al. | 2409.17778 | link |
2024-09-26 | Integrating Hierarchical Semantic into Iterative Generation Model for Entailment Tree Explanation | Qin Wang et.al. | 2409.17757 | null |
2024-09-26 | Text Image Generation for Low-Resource Languages with Dual Translation Learning | Chihiro Noguchi et.al. | 2409.17747 | null |
2024-09-26 | AnyLogo: Symbiotic Subject-Driven Diffusion System with Gemini Status | Jinghao Zhang et.al. | 2409.17740 | null |
2024-09-26 | Dark Miner: Defend against unsafe generation for text-to-image diffusion models | Zheling Meng et.al. | 2409.17682 | null |
2024-09-26 | Self-Supervised Learning of Deviation in Latent Representation for Co-speech Gesture Video Generation | Huan Yang et.al. | 2409.17674 | null |
2024-09-26 | ID $^3$ : Identity-Preserving-yet-Diversified Diffusion Models for Synthetic Face Recognition | Shen Li et.al. | 2409.17576 | null |
2024-09-26 | Flexiffusion: Segment-wise Neural Architecture Search for Flexible Denoising Schedule | Hongtao Huang et.al. | 2409.17566 | null |
2024-09-26 | Pixel-Space Post-Training of Latent Diffusion Models | Christina Zhang et.al. | 2409.17565 | null |
2024-09-26 | A Simple but Strong Baseline for Sounding Video Generation: Effective Adaptation of Audio and Video Diffusion Models for Joint Generation | Masato Ishii et.al. | 2409.17550 | link |
2024-09-26 | JoyType: A Robust Design for Multilingual Visual Text Creation | Chao Li et.al. | 2409.17524 | null |
2024-09-26 | Optimizing Resource Allocation for Multi-modal Semantic Communication in Mobile AIGC Networks: A Diffusion-based Game Approach | Jian Liu et.al. | 2409.17506 | null |
2024-09-26 | Learning Quantized Adaptive Conditions for Diffusion Models | Yuchen Liang et.al. | 2409.17487 | null |
2024-09-26 | Study of Subjective and Objective Quality in Super-Resolution Enhanced Broadcast Images on a Novel SR-IQA Dataset | Yongrok Kim et.al. | 2409.17451 | null |
2024-09-26 | Rejection Sampling IMLE: Designing Priors for Better Few-Shot Image Synthesis | Chirag Vashist et.al. | 2409.17439 | null |
2024-09-25 | Copying style, Extracting value: Illustrators’ Perception of AI Style Transfer and its Impact on Creative Labor | Julien Porquet et.al. | 2409.17410 | null |
2024-09-25 | Consistent estimation of generative model representations in the data kernel perspective space | Aranyak Acharyya et.al. | 2409.17308 | null |
2024-09-25 | Disco4D: Disentangled 4D Human Generation and Animation from a Single Image | Hui En Pang et.al. | 2409.17280 | null |
2024-09-25 | DreamWaltz-G: Expressive 3D Gaussian Avatars from Skeleton-Guided 2D Diffusion | Yukun Huang et.al. | 2409.17145 | link |
2024-09-25 | Language-oriented Semantic Communication for Image Transmission with Fine-Tuned Diffusion Model | Xinfeng Wei et.al. | 2409.17104 | null |
2024-09-25 | Generic Diagonalizability, Structural Functional Observability and Output Controllability | Yuan Zhang et.al. | 2409.17100 | link |
2024-09-25 | Ctrl-GenAug: Controllable Generative Augmentation for Medical Sequence Classification | Xinrui Zhou et.al. | 2409.17091 | null |
2024-09-25 | Degradation-Guided One-Step Image Super-Resolution with Diffusion Priors | Aiping Zhang et.al. | 2409.17058 | link |
2024-09-25 | ControlCity: A Multimodal Diffusion Model Based Approach for Accurate Geospatial Data Generation and Urban Morphology Analysis | Fangshuo Zhou et.al. | 2409.17049 | link |
2024-09-25 | GeoBiked: A Dataset with Geometric Features and Automated Labeling Techniques to Enable Deep Generative Models in Engineering Design | Phillip Mueller et.al. | 2409.17045 | null |
2024-09-25 | Dynamic Obstacle Avoidance through Uncertainty-Based Adaptive Planning with Diffusion | Vineet Punyamoorty et.al. | 2409.16950 | null |
2024-09-25 | DALDA: Data Augmentation Leveraging Diffusion Model and LLM with Adaptive Guidance Scaling | Kyuheon Jung et.al. | 2409.16949 | link |
2024-09-25 | Generative Object Insertion in Gaussian Splatting with a Multi-View Diffusion Model | Hongliang Zhong et.al. | 2409.16938 | link |
2024-09-25 | A Versatile and Differentiable Hand-Object Interaction Representation | Théo Morales et.al. | 2409.16855 | null |
2024-09-25 | Analytical assessment of workers’ safety concerning direct and indirect ways of getting infected by dangerous pathogen | Krzysztof Domino et.al. | 2409.16809 | null |
2024-09-25 | Pose-Guided Fine-Grained Sign Language Video Generation | Tongkai Shi et.al. | 2409.16709 | null |
2024-09-25 | Pix2Next: Leveraging Vision Foundation Models for RGB to NIR Image Translation | Youngwan Jin et.al. | 2409.16706 | link |
2024-09-25 | Layout-Corrector: Alleviating Layout Sticking Phenomenon in Discrete Diffusion Model | Shoma Iwai et.al. | 2409.16689 | null |
2024-09-25 | Morphological-consistent Diffusion Network for Ultrasound Coronal Image Enhancement | Yihao Zhou et.al. | 2409.16661 | null |
2024-09-25 | CasFT: Future Trend Modeling for Information Popularity Prediction with Dynamic Cues-Driven Diffusion Models | Xin Jing et.al. | 2409.16619 | null |
2024-09-25 | ECG-Image-Database: A Dataset of ECG Images with Real-World Imaging and Scanning Artifacts; A Foundation for Computerized ECG Image Digitization and Analysis | Matthew A. Reyna et.al. | 2409.16612 | link |
2024-09-25 | Prompt Sliders for Fine-Grained Control, Editing and Erasing of Concepts in Diffusion Models | Deepak Sridhar et.al. | 2409.16535 | link |
2024-09-24 | Diffusion Models to Enhance the Resolution of Microscopy Images: A Tutorial | Harshith Bachimanchi et.al. | 2409.16488 | null |
2024-09-24 | Beyond Text-to-Text: An Overview of Multimodal and Generative Artificial Intelligence for Education Using Topic Modeling | Ville Heilala et.al. | 2409.16376 | null |
2024-09-18 | A Generative Diffusion Model for Probabilistic Ensembles of Precipitation Maps Conditioned on Multisensor Satellite Observations | Clement Guilloteau et.al. | 2409.16319 | null |
2024-09-24 | Gen2Act: Human Video Generation in Novel Scenarios enables Generalizable Robot Manipulation | Homanga Bharadhwaj et.al. | 2409.16283 | null |
2024-09-24 | MonoFormer: One Transformer for Both Diffusion and Autoregression | Chuyang Zhao et.al. | 2409.16280 | link |
2024-09-24 | Generative Factor Chaining: Coordinated Manipulation with Diffusion-based Factor Graph | Utkarsh A. Mishra et.al. | 2409.16275 | null |
2024-09-24 | Label-Augmented Dataset Distillation | Seoungyoon Kang et.al. | 2409.16239 | null |
2024-09-24 | MaskBit: Embedding-free Image Generation via Bit Tokens | Mark Weber et.al. | 2409.16211 | link |
2024-09-23 | Fine Tuning Text-to-Image Diffusion Models for Correcting Anomalous Images | Hyunwoo Yoo et.al. | 2409.16174 | link |
2024-09-24 | MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling | Yifang Men et.al. | 2409.16160 | null |
2024-09-24 | Spreading dynamics of a Fisher-KPP nonlocal diffusion model with a free boundary | Lei Li et.al. | 2409.16101 | null |
2024-09-26 | Enhanced Unsupervised Image-to-Image Translation Using Contrastive Learning and Histogram of Oriented Gradients | Wanchen Zhao et.al. | 2409.16042 | null |
2024-09-24 | Deep chroma compression of tone-mapped images | Xenios Milidonis et.al. | 2409.16032 | link |
2024-09-24 | PRESTO: Fast motion planning using diffusion models based on key-configuration environment representation | Mingyo Seo et.al. | 2409.16012 | null |
2024-09-24 | Unleashing the Potential of Synthetic Images: A Study on Histopathology Image Classification | Leire Benito-Del-Valle et.al. | 2409.16002 | link |
2024-09-26 | Improvements to SDXL in NovelAI Diffusion V3 | Juan Ossa et.al. | 2409.15997 | null |
2024-09-24 | ASD-Diffusion: Anomalous Sound Detection with Diffusion Models | Fengrun Zhang et.al. | 2409.15957 | null |
2024-09-24 | Multiscale method for image denoising using nonlinear diffusion process: local denoising and spectral multiscale basis functions | Maria Vasilyeva et.al. | 2409.15952 | null |
2024-09-24 | Identifying early tumour states in a Cahn-Hilliard-reaction-diffusion model | Abramo Agosti et.al. | 2409.15925 | null |
2024-09-27 | Diffusion Models for Intelligent Transportation Systems: A Survey | Mingxing Peng et.al. | 2409.15816 | null |
2024-09-24 | Training Data Attribution: Was Your Model Secretly Trained On Data Created By Mine? | Likun Zhang et.al. | 2409.15781 | null |
2024-09-24 | TFG: Unified Training-Free Guidance for Diffusion Models | Haotian Ye et.al. | 2409.15761 | link |
2024-09-24 | ImPoster: Text and Frequency Guidance for Subject Driven Action Personalization using Diffusion Models | Divya Kothandaraman et.al. | 2409.15650 | link |
2024-09-23 | Critic Loss for Image Classification | Brendan Hogan Rappazzo et.al. | 2409.15565 | null |
2024-09-23 | Mixture of Efficient Diffusion Experts Through Automatic Interval and Sub-Network Selection | Alireza Ganjdanesh et.al. | 2409.15557 | null |
2024-09-23 | Learning Diverse Robot Striking Motions with Diffusion Models and Kinematically Constrained Gradient Guidance | Kin Man Lee et.al. | 2409.15528 | null |
2024-09-23 | Speech2rtMRI: Speech-Guided Diffusion Model for Real-time MRI Video of the Vocal Tract during Speech | Hong Nguyen et.al. | 2409.15525 | link |
2024-09-23 | Bayesian computation with generative diffusion models by Multilevel Monte Carlo | Abdul-Lateef Haji-Ali et.al. | 2409.15511 | link |
2024-09-23 | Revealing an Unattractivity Bias in Mental Reconstruction of Occluded Faces using Generative Image Models | Frederik Riedmann et.al. | 2409.15443 | null |
2024-09-23 | Uncovering Coordinated Cross-Platform Information Operations Threatening the Integrity of the 2024 U.S. Presidential Election Online Discussion | Marco Minici et.al. | 2409.15402 | null |
2024-09-21 | Adversarial Attacks on Parts of Speech: An Empirical Study in Text-to-Image Generation | G M Shahariar et.al. | 2409.15381 | link |
2024-10-05 | PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language Instructions | Weifeng Lin et.al. | 2409.15278 | link |
2024-09-23 | MaterialFusion: Enhancing Inverse Rendering with Material Diffusion Priors | Yehonathan Litman et.al. | 2409.15273 | null |
2024-09-23 | S $^2$ AG-Vid: Enhancing Multi-Motion Alignment in Video Diffusion Models via Spatial and Syntactic Attention-Based Guidance | Yuanhang Li et.al. | 2409.15259 | null |
2024-09-18 | Recommendation with Generative Models | Yashar Deldjoo et.al. | 2409.15173 | null |
2024-09-23 | LoVA: Long-form Video-to-Audio Generation | Xin Cheng et.al. | 2409.15157 | null |
2024-10-03 | GALD-SE: Guided Anisotropic Lightweight Diffusion for Efficient Speech Enhancement | Chengzhong Wang et.al. | 2409.15101 | link |
2024-09-23 | Can CLIP Count Stars? An Empirical Study on Quantity Bias in CLIP | Zeliang Zhang et.al. | 2409.15035 | null |
2024-09-23 | DepthART: Monocular Depth Estimation as Autoregressive Refinement Task | Bulat Gabdullin et.al. | 2409.15010 | null |
2024-09-23 | Multi-Modal Generative AI: Multi-modal LLM, Diffusion and Beyond | Hong Chen et.al. | 2409.14993 | null |
2024-09-23 | Advancing Video Quality Assessment for AIGC | Xinli Yue et.al. | 2409.14888 | null |
2024-09-23 | Video-to-Audio Generation with Fine-grained Temporal Semantics | Yuchen Hu et.al. | 2409.14709 | null |
2024-09-23 | VLEU: a Method for Automatic Evaluation for Generalizability of Text-to-Image Models | Jingtao Cao et.al. | 2409.14704 | link |
2024-09-24 | Rate-Splitting for Cell-Free Massive MIMO: Performance Analysis and Generative AI Approach | Jiakang Zheng et.al. | 2409.14702 | null |
2024-09-23 | EDGE-Rec: Efficient and Data-Guided Edge Diffusion For Recommender Systems Graphs | Utkarsh Priyam et.al. | 2409.14689 | null |
2024-09-23 | Reflecting Reality: Enabling Diffusion Models to Produce Faithful Mirror Reflections | Ankit Dhiman et.al. | 2409.14677 | link |
2024-09-22 | LatentQGAN: A Hybrid QGAN with Classical Convolutional Autoencoder | Vieloszynski Alexis et.al. | 2409.14622 | null |
2024-10-03 | Implicit Dynamical Flow Fusion (IDFF) for Generative Modeling | Mohammad R. Rezaei et.al. | 2409.14599 | link |
2024-09-22 | URSimulator: Human-Perception-Driven Prompt Tuning for Enhanced Virtual Urban Renewal via Diffusion Models | Chuanbo Hu et.al. | 2409.14589 | null |
2024-09-22 | Sampling-Pattern-Agnostic MRI Reconstruction through Adaptive Consistency Enforcement with Diffusion Model | Anurag Malyala et.al. | 2409.14479 | null |
2024-09-22 | Contact Compliance Visuo-Proprioceptive Policy for Contact-Rich Manipulation with Cost-Efficient Haptic Hand-Arm Teleoperation System | Bo Zhou et.al. | 2409.14440 | null |
2024-09-22 | Dormant: Defending against Pose-driven Human Image Animation | Jiachen Zhou et.al. | 2409.14424 | link |
2024-09-22 | The route of random process to ultraslow aging phenomena | Chunyan Li et.al. | 2409.14422 | null |
2024-09-22 | Self-Supervised Audio-Visual Soundscape Stylization | Tingle Li et.al. | 2409.14340 | null |
2024-09-22 | Anisotropic Diffusion Probabilistic Model for Imbalanced Image Classification | Jingyu Kong et.al. | 2409.14313 | null |
2024-09-25 | DilateQuant: Accurate and Efficient Diffusion Quantization via Weight Dilation | Xuewen Liu et.al. | 2409.14307 | null |
2024-09-22 | A competitive baseline for deep learning enhanced data assimilation using conditional Gaussian ensemble Kalman filtering | Zachariah Malik et.al. | 2409.14300 | null |
2024-09-21 | Cloud Adversarial Example Generation for Remote Sensing Image Classification | Fei Ma et.al. | 2409.14240 | null |
2024-09-21 | Content-aware Tile Generation using Exterior Boundary Inpainting | Sam Sartor et.al. | 2409.14184 | link |
2024-09-27 | JVID: Joint Video-Image Diffusion for Visual-Quality and Temporal-Consistency in Video Generation | Hadrien Reynaud et.al. | 2409.14149 | null |
2024-09-21 | Present and Future Generalization of Synthetic Image Detectors | Pablo Bernabeu-Perez et.al. | 2409.14128 | link |
2024-09-21 | Recovering Global Data Distribution Locally in Federated Learning | Ziyu Yao et.al. | 2409.14063 | null |
2024-09-21 | Signal Detection in Near-field Communication with Unknown Noise Characteristics: A Diffusion Model Method | Changyuan Zhao et.al. | 2409.14031 | null |
2024-09-21 | BrainDreamer: Reasoning-Coherent and Controllable Image Generation from EEG Brain Signals via Language Guidance | Ling Wang et.al. | 2409.14021 | null |
2024-09-21 | Mitigating Exposure Bias in Score-Based Generation of Molecular Conformations | Sijia Wang et.al. | 2409.14014 | link |
2024-09-20 | PureDiffusion: Using Backdoor to Counter Backdoor in Generative Diffusion Models | Vu Tuan Truong et.al. | 2409.13945 | null |
2024-09-20 | RN-SDEs: Limited-Angle CT Reconstruction with Residual Null-Space Diffusion Stochastic Differential Equations | Jiaqi Guo et.al. | 2409.13930 | link |
2024-09-28 | Nonlinear Inverse Design of Mechanical Multi-Material Metamaterials Enabled by Video Denoising Diffusion and Structure Identifier | Jaewan Park et.al. | 2409.13908 | null |
2024-09-20 | PTQ4ADM: Post-Training Quantization for Efficient Text Conditional Audio Diffusion Models | Jayneel Vora et.al. | 2409.13894 | null |
2024-09-10 | Table-to-Text Generation with Pretrained Diffusion Models | Aleksei S. Krylov et.al. | 2409.13739 | null |
2024-09-20 | DiffFluid: Plain Diffusion Models are Effective Predictors of Flow Dynamics | Dongyu Luo et.al. | 2409.13665 | link |
2024-09-20 | Efficient Domain Augmentation for Autonomous Driving Testing Using Diffusion Models | Luciano Baresi et.al. | 2409.13661 | null |
2024-09-20 | Human-Robot Cooperative Distribution Coupling for Hamiltonian-Constrained Social Navigation | Weizheng Wang et.al. | 2409.13573 | null |
2024-09-20 | Efficient Visualization of Neural Networks with Generative Models and Adversarial Perturbations | Athanasios Karagounis et.al. | 2409.13559 | null |
2024-10-01 | Physics-Informed Latent Diffusion for Multimodal Brain MRI Synthesis | Sven Lüpke et.al. | 2409.13532 | link |
2024-09-20 | Towards the Discovery of Down Syndrome Brain Biomarkers Using Generative Models | Jordi Malé et.al. | 2409.13437 | null |
2024-09-20 | HMD $^2$ : Environment-aware Motion Generation from Single Egocentric Head-Mounted Device | Vladimir Guzov et.al. | 2409.13426 | null |
2024-09-20 | Imagine yourself: Tuning-Free Personalized Image Generation | Zecheng He et.al. | 2409.13346 | null |
2024-09-20 | Generative Aerodynamic Design with Diffusion Probabilistic Models | Thomas Wagenaar et.al. | 2409.13328 | null |
2024-09-20 | JoyHallo: Digital human model for Mandarin | Sheng Shi et.al. | 2409.13268 | null |
2024-09-20 | Deep Learning based Optical Image Super-Resolution via Generative Diffusion Models for Layerwise in-situ LPBF Monitoring | Francis Ogoke et.al. | 2409.13171 | null |
2024-09-19 | What does guidance do? A fine-grained analysis in a simple setting | Muthu Chidambaram et.al. | 2409.13074 | null |
2024-09-19 | LVCD: Reference-based Lineart Video Colorization with Diffusion Models | Zhitong Huang et.al. | 2409.12960 | null |
2024-10-01 | Evaluating Image Hallucination in Text-to-Image Generation with Question-Answering | Youngsun Lim et.al. | 2409.12784 | link |
2024-09-19 | Financial Stochastic Models Diffusion: From Risk-Neutral to Real-World Measure | Mohamed Ben Alaya et.al. | 2409.12783 | null |
2024-09-19 | Algorithmic and High-Frequency Trading Problems for Semi-Markov and Hawkes Jump-Diffusion Models | Luca Lalor et.al. | 2409.12776 | null |
2024-09-19 | StoryMaker: Towards Holistic Consistent Characters in Text-to-image Generation | Zhengguang Zhou et.al. | 2409.12576 | link |
2024-09-19 | Improving Cone-Beam CT Image Quality with Knowledge Distillation-Enhanced Diffusion Model in Imbalanced Data Settings | Joonil Hwang et.al. | 2409.12539 | null |
2024-09-19 | Denoising Reuse: Exploiting Inter-frame Motion Consistency for Efficient Video Latent Generation | Chenyu Wang et.al. | 2409.12532 | null |
2024-09-19 | Arena 4.0: A Comprehensive ROS2 Development and Benchmarking Platform for Human-centric Navigation Using Generative-Model-based Environment Generation | Volodymyr Shcherbyna1 et.al. | 2409.12471 | null |
2024-09-19 | HSIGene: A Foundation Model For Hyperspectral Image Generation | Li Pang et.al. | 2409.12470 | link |
2024-09-29 | AudioEditor: A Training-Free Diffusion-Based Audio Editing Framework | Yuhang Jia et.al. | 2409.12466 | link |
2024-09-19 | Bayesian-Optimized One-Step Diffusion Model with Knowledge Distillation for Real-Time 3D Human Motion Prediction | Sibo Tian et.al. | 2409.12456 | null |
2024-10-04 | Infrared Small Target Detection in Satellite Videos: A New Dataset and A Novel Recurrent Feature Refinement Framework | Xinyi Ying et.al. | 2409.12448 | link |
2024-09-25 | FlexiTex: Enhancing Texture Generation with Visual Guidance | DaDong Jiang et.al. | 2409.12431 | null |
2024-09-19 | I2I-Galip: Unsupervised Medical Image Translation Using Generative Adversarial CLIP | Yilmaz Korkmaz et.al. | 2409.12399 | link |
2024-09-19 | Fundus image enhancement through direct diffusion bridges | Sehui Kim et.al. | 2409.12377 | link |
2024-09-18 | Dynamics of Post-disaster Recovery in Behavior-dependent Business Networks | Chia-Fu Liu1 et.al. | 2409.12357 | null |
2024-09-18 | Simultaneous Music Separation and Generation Using Multi-Track Latent Diffusion Models | Tornike Karchkhadze et.al. | 2409.12346 | link |
2024-09-18 | Understanding Implosion in Text-to-Image Generative Models | Wenxin Ding et.al. | 2409.12314 | null |
2024-09-17 | Sparks of Artificial General Intelligence(AGI) in Semiconductor Material Science: Early Explorations into the Next Frontier of Generative AI-Assisted Electron Micrograph Analysis | Sakhinana Sagar Srinivas et.al. | 2409.12244 | null |
2024-09-18 | Massively Multi-Person 3D Human Motion Forecasting with Scene Context | Felix B Mueller et.al. | 2409.12189 | link |
2024-09-18 | MoRAG – Multi-Fusion Retrieval Augmented Generation for Human Motion | Kalakonda Sai Shashank et.al. | 2409.12140 | link |
2024-09-18 | Brain-Streams: fMRI-to-Image Reconstruction with Multi-modal Guidance | Jaehoon Joo et.al. | 2409.12099 | null |
2024-09-18 | Denoising diffusion models for high-resolution microscopy image restoration | Pamela Osuna-Vargas et.al. | 2409.12078 | null |
2024-09-18 | LEMON: Localized Editing with Mesh Optimization and Neural Shaders | Furkan Mert Algan et.al. | 2409.12024 | null |
2024-09-18 | ChefFusion: Multimodal Foundation Model Integrating Recipe and Food Image Generation | Peiyu Li et.al. | 2409.12010 | link |
2024-09-18 | Tracking Any Point with Frame-Event Fusion Network at High Frame Rate | Jiaxiong Liu et.al. | 2409.11953 | null |
2024-09-18 | Generation of Complex 3D Human Motion by Temporal and Spatial Composition of Diffusion Models | Lorenzo Mandelli et.al. | 2409.11920 | null |
2024-09-18 | Finding the Subjective Truth: Collecting 2 Million Votes for Comprehensive Gen-AI Model Evaluation | Dimitrios Christodoulou et.al. | 2409.11904 | null |
2024-09-18 | ABHINAW: A method for Automatic Evaluation of Typography within AI-Generated Images | Abhinaw Jagtap et.al. | 2409.11874 | null |
2024-09-18 | DPI-TTS: Directional Patch Interaction for Fast-Converging and Style Temporal Modeling in Text-to-Speech | Xin Qi et.al. | 2409.11835 | null |
2024-09-18 | RaggeDi: Diffusion-based State Estimation of Disordered Rags, Sheets, Towels and Blankets | Jikai Ye et.al. | 2409.11831 | null |
2024-09-18 | InverseMeetInsert: Robust Real Image Editing via Geometric Accumulation Inversion in Guided Diffusion Models | Yan Zheng et.al. | 2409.11734 | null |
2024-09-18 | GUNet: A Graph Convolutional Network United Diffusion Model for Stable and Diversity Pose Generation | Shuowen Liang et.al. | 2409.11689 | link |
2024-09-18 | Recurrent Interpolants for Probabilistic Time Series Prediction | Yu Chen et.al. | 2409.11684 | null |
2024-09-18 | SRIF: Semantic Shape Registration Empowered by Diffusion-based Image Morphing and Flow Estimation | Mingze Sun et.al. | 2409.11682 | link |
2024-09-18 | PainDiffusion: Can robot express pain? | Quang Tien Dam et.al. | 2409.11635 | null |
2024-09-17 | Context-Generative Default Policy for Bounded Rational Agent | Durgakant Pushp et.al. | 2409.11604 | null |
2024-09-17 | DiffESM: Conditional Emulation of Temperature and Precipitation in Earth System Models with 3D Diffusion Models | Seth Bassetti et.al. | 2409.11601 | null |
2024-09-17 | Using Physics Informed Generative Adversarial Networks to Model 3D porous media | Zihan Ren et.al. | 2409.11541 | null |
2024-09-20 | Machine Learning for Analyzing Atomic Force Microscopy (AFM) Images Generated from Polymer Blends | Aanish Paruchuri et.al. | 2409.11438 | link |
2024-09-17 | Ultrasound Image Enhancement with the Variance of Diffusion Models | Yuxin Zhang et.al. | 2409.11380 | link |
2024-09-17 | OSV: One Step is Enough for High-Quality Image to Video Generation | Xiaofeng Mao et.al. | 2409.11367 | null |
2024-09-17 | Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think | Gonzalo Martin Garcia et.al. | 2409.11355 | link |
2024-09-17 | OmniGen: Unified Image Generation | Shitao Xiao et.al. | 2409.11340 | link |
2024-09-17 | fMRI-3D: A Comprehensive Dataset for Enhancing fMRI-based 3D Reconstruction | Jianxiong Gao et.al. | 2409.11315 | null |
2024-09-17 | DroneDiffusion: Robust Quadrotor Dynamics Learning with Diffusion Models | Avirup Das et.al. | 2409.11292 | null |
2024-09-19 | The Art of Storytelling: Multi-Agent Generative AI for Dynamic Multimodal Narratives | Samee Arif et.al. | 2409.11261 | link |
2024-09-17 | Score Forgetting Distillation: A Swift, Data-Free Method for Machine Unlearning in Diffusion Models | Tianqi Chen et.al. | 2409.11219 | null |
2024-09-13 | MAISI: Medical AI for Synthetic Imaging | Pengfei Guo et.al. | 2409.11169 | link |
2024-09-17 | Improving the Efficiency of Visually Augmented Language Models | Paula Ontalvilla et.al. | 2409.11148 | link |
2024-09-17 | High-Resolution Speech Restoration with Latent Diffusion Model | Tushar Dhyani et.al. | 2409.11145 | link |
2024-09-17 | In-situ measurements of light diffusion in an optically dense atomic ensemble | Antoine Glicenstein et.al. | 2409.11117 | null |
2024-09-17 | TacDiffusion: Force-domain Diffusion Policy for Precise Tactile Manipulation | Yansong Wu et.al. | 2409.11047 | null |
2024-09-17 | Enhanced segmentation of femoral bone metastasis in CT scans of patients using synthetic data generation with 3D diffusion models | Emile Saillard et.al. | 2409.11011 | null |
2024-09-17 | MM2Latent: Text-to-facial image generation and editing in GANs with multimodal assistance | Debin Meng et.al. | 2409.11010 | link |
2024-09-17 | Edge-based Denoising Image Compression | Ryugo Morita et.al. | 2409.10978 | null |
2024-09-17 | Towards Effective User Attribution for Latent Diffusion Models via Watermark-Informed Blending | Yongyang Pan et.al. | 2409.10958 | null |
2024-09-17 | EzAudio: Enhancing Text-to-Audio Generation with Efficient Diffusion Transformer | Jiarui Hai et.al. | 2409.10819 | null |
2024-09-16 | Using Generative Models to Produce Realistic Populations of the United Kingdom Windstorms | Etron Yee Chun Tsoi et.al. | 2409.10696 | null |
2024-09-16 | Playground v3: Improving Text-to-Image Alignment with Deep-Fusion Large Language Models | Bingchen Liu et.al. | 2409.10695 | null |
2024-09-16 | Online Diffusion-Based 3D Occupancy Prediction at the Frontier with Probabilistic Map Reconciliation | Alec Reed et.al. | 2409.10681 | null |
2024-09-16 | Optimizing Resource Consumption in Diffusion Models through Hallucination Early Detection | Federico Betti et.al. | 2409.10597 | null |
2024-09-30 | Manifold-Constrained Nucleus-Level Denoising Diffusion Model for Structure-Based Drug Design | Shengchao Liu et.al. | 2409.10584 | null |
2024-09-15 | GLEAN: Generative Learning for Eliminating Adversarial Noise | Justin Lyu Kim et.al. | 2409.10578 | null |
2024-09-02 | Agentic Society: Merging skeleton from real world and texture from Large Language Model | Yuqi Bai et.al. | 2409.10550 | link |
2024-08-30 | Bridging User Dynamics: Transforming Sequential Recommendations with Schrödinger Bridge and Diffusion Models | Wenjia Xie et.al. | 2409.10522 | null |
2024-09-16 | Incorporating Classifier-Free Guidance in Diffusion Model-Based Recommendation | Noah Buchanan et.al. | 2409.10494 | null |
2024-09-16 | SimInversion: A Simple Framework for Inversion-Based Text-to-Image Editing | Qi Qian et.al. | 2409.10476 | null |
2024-09-16 | MacDiff: Unified Skeleton Modeling with Masked Conditional Diffusion | Lehong Wu et.al. | 2409.10473 | null |
2024-09-16 | Mamba-ST: State Space Model for Efficient Style Transfer | Filippo Botti et.al. | 2409.10385 | link |
2024-09-16 | Taming Diffusion Models for Image Restoration: A Review | Ziwei Luo et.al. | 2409.10353 | null |
2024-09-16 | VAE-QWGAN: Improving Quantum GANs for High Resolution Image Generation | Aaron Mark Thomas et.al. | 2409.10339 | null |
2024-09-16 | Fairness, not Emotion, Drives Socioeconomic Decision Making | Rudra Mukhopadhyay et.al. | 2409.10322 | null |
2024-09-16 | On Synthetic Texture Datasets: Challenges, Creation, and Curation | Blaine Hoak et.al. | 2409.10297 | null |
2024-09-16 | DreamHead: Learning Spatial-Temporal Correspondence via Hierarchical Diffusion for Audio-driven Talking Head Synthesis | Fa-Ting Hong et.al. | 2409.10281 | null |
2024-09-16 | RealDiff: Real-world 3D Shape Completion using Self-Supervised Diffusion Models | Başak Melis Öcal et.al. | 2409.10180 | null |
2024-09-16 | PSHuman: Photorealistic Single-view Human Reconstruction using Cross-Scale Diffusion | Peng Li et.al. | 2409.10141 | null |
2024-09-16 | DDoS: Diffusion Distribution Similarity for Out-of-Distribution Detection | Kun Fang et.al. | 2409.10094 | null |
2024-09-16 | MotionCom: Automatic and Motion-Aware Image Composition with LLM and Video Diffusion Prior | Weijing Tao et.al. | 2409.10090 | link |
2024-09-16 | Cross-modality image synthesis from TOF-MRA to CTA using diffusion-based models | Alexander Koch et.al. | 2409.10089 | null |
2024-09-16 | StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion | Yinghao Aaron Li et.al. | 2409.10058 | null |
2024-09-16 | Embodiment-Agnostic Action Planning via Object-Part Scene Flow | Weiliang Tang et.al. | 2409.10032 | null |
2024-09-16 | AttnMod: Attention-Based New Art Styles | Shih-Chieh Su et.al. | 2409.10028 | null |
2024-09-16 | Generalization of Optimal Geodesic Curvature Constrained Dubins’ Path on Sphere with Free Terminal Orientation | Deepak Prakash Kumar et.al. | 2409.09954 | null |
2024-09-15 | GRIN: Zero-Shot Metric Depth with Pixel-Level Diffusion | Vitor Guizilini et.al. | 2409.09896 | null |
2024-09-15 | Latent Diffusion Models for Controllable RNA Sequence Generation | Kaixuan Huang et.al. | 2409.09828 | null |
2024-09-15 | Generalizing Alignment Paradigm of Text-to-Image Generation with Preferences through $f$ -divergence Minimization | Haoyuan Sun et.al. | 2409.09774 | null |
2024-09-15 | MFCLIP: Multi-modal Fine-grained CLIP for Generalizable Diffusion Face Forgery Detection | Yaning Zhang et.al. | 2409.09724 | link |
2024-09-15 | Finetuning CLIP to Reason about Pairwise Differences | Dylan Sam et.al. | 2409.09721 | link |
2024-09-15 | E-Commerce Inpainting with Mask Guidance in Controlnet for Reducing Overcompletion | Guandong Li et.al. | 2409.09681 | null |
2024-09-15 | EditBoard: Towards A Comprehensive Evaluation Benchmark for Text-based Video Editing Models | Yupeng Chen et.al. | 2409.09668 | link |
2024-09-15 | Conditional sampling within generative diffusion models | Zheng Zhao et.al. | 2409.09650 | link |
2024-09-15 | Extract and Diffuse: Latent Integration for Improved Diffusion-based Speech and Vocal Enhancement | Yudong Yang et.al. | 2409.09642 | null |
2024-09-15 | HJ-sampler: A Bayesian sampler for inverse problems of a stochastic process by leveraging Hamilton-Jacobi PDEs and score-based generative models | Tingwei Meng et.al. | 2409.09614 | null |
2024-09-18 | DreamMover: Leveraging the Prior of Diffusion Models for Image Interpolation with Large Motion | Liao Shen et.al. | 2409.09605 | null |
2024-09-15 | Bias Begets Bias: The Impact of Biased Embeddings on Diffusion Models | Sahil Kuchlous et.al. | 2409.09569 | null |
2024-09-14 | Integrating Deep Unfolding with Direct Diffusion Bridges for Computed Tomography Reconstruction | Herman Verinaz-Jadan et.al. | 2409.09477 | null |
2024-09-14 | Prototypical Prompting for Text-to-image Person Re-identification | Shuanglin Yan et.al. | 2409.09427 | null |
2024-09-14 | Real-world Adversarial Defense against Patch Attacks based on Diffusion Model | Xingxing Wei et.al. | 2409.09406 | link |
2024-09-14 | Towards Diverse and Efficient Audio Captioning via Diffusion Models | Manjie Xu et.al. | 2409.09401 | null |
2024-09-14 | Schrödinger Bridge Flow for Unpaired Data Translation | Valentin De Bortoli et.al. | 2409.09347 | null |
2024-09-14 | Improving Robustness of Diffusion-Based Zero-Shot Speech Synthesis via Stable Formant Generation | Changjin Han et.al. | 2409.09311 | null |
2024-09-13 | Adaptive Multi-Modal Control of Digital Human Hand Synthesis Using a Region-Aware Cycle Loss | Qifan Fu et.al. | 2409.09149 | link |
2024-09-13 | PrimeDepth: Efficient Monocular Depth Estimation with a Stable Diffusion Preimage | Denis Zavadski et.al. | 2409.09144 | link |
2024-09-13 | Neural Message Passing Induced by Energy-Constrained Diffusion | Qitian Wu et.al. | 2409.09111 | null |
2024-09-28 | Closed-Loop Visuomotor Control with Generative Expectation for Robotic Manipulation | Qingwen Bu et.al. | 2409.09016 | link |
2024-09-17 | A Diffusion Approach to Radiance Field Relighting using Multi-Illumination Synthesis | Yohan Poirier-Ginter et.al. | 2409.08947 | null |
2024-09-13 | Latent Space Score-based Diffusion Model for Probabilistic Multivariate Time Series Imputation | Guojun Liang et.al. | 2409.08917 | link |
2024-09-13 | Gaussian is All You Need: A Unified Framework for Solving Inverse Problems via Diffusion Posterior Sampling | Nebiyou Yismaw et.al. | 2409.08906 | null |
2024-09-13 | Adjoint Matching: Fine-tuning Flow and Diffusion Generative Models with Memoryless Stochastic Optimal Control | Carles Domingo-Enrich et.al. | 2409.08861 | null |
2024-09-13 | InstantDrag: Improving Interactivity in Drag-based Image Editing | Joonghyuk Shin et.al. | 2409.08857 | null |
2024-09-13 | DX2CT: Diffusion Model for 3D CT Reconstruction from Bi or Mono-planar 2D X-ray(s) | Yun Su Jeong et.al. | 2409.08850 | null |
2024-09-12 | DeCLIP: Decoding CLIP representations for deepfake localization | Stefan Smeu et.al. | 2409.08849 | link |
2024-09-13 | DFADD: The Diffusion and Flow-Matching Based Audio Deepfake Dataset | Jiawei Du et.al. | 2409.08731 | link |
2024-09-13 | STA-V2A: Video-to-Audio Generation with Semantic and Temporal Alignment | Yong Ren et.al. | 2409.08601 | null |
2024-09-13 | LHQ-SVC: Lightweight and High Quality Singing Voice Conversion Modeling | Yubo Huang et.al. | 2409.08583 | null |
2024-09-13 | Generalization of Gershgorin’s theorem. Analysis and design of control laws | Igor Furtat et.al. | 2409.08576 | null |
2024-09-13 | DiffFAS: Face Anti-Spoofing via Generative Diffusion Models | Xinxu Ge et.al. | 2409.08572 | link |
2024-09-13 | Think Twice Before You Act: Improving Inverse Problem Solving With MCMC | Yaxuan Zhu et.al. | 2409.08551 | null |
2024-09-13 | GroundingBooth: Grounding Text-to-Image Customization | Zhexiao Xiong et.al. | 2409.08520 | null |
2024-09-13 | Enhancing Privacy in ControlNet and Stable Diffusion via Split Learning | Dixi Yao et.al. | 2409.08503 | null |
2024-09-13 | Cross-conditioned Diffusion Model for Medical Image to Image Translation | Zhaohu Xing et.al. | 2409.08500 | null |
2024-09-13 | Sub-graph Based Diffusion Model for Link Prediction | Hang Li et.al. | 2409.08487 | null |
2024-09-13 | Risks When Sharing LoRA Fine-Tuned Diffusion Model Weights | Dixi Yao et.al. | 2409.08482 | null |
2024-09-13 | Integrating Neural Operators with Diffusion Models Improves Spectral Representation in Turbulence Modeling | Vivek Oommen et.al. | 2409.08477 | link |
2024-09-12 | SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer | Helin Wang et.al. | 2409.08425 | link |
2024-09-12 | Scores as Actions: a framework of fine-tuning diffusion models by continuous-time reinforcement learning | Hanyang Zhao et.al. | 2409.08400 | null |
2024-09-12 | DreamHOI: Subject-Driven Generation of 3D Human-Object Interactions with Diffusion Priors | Thomas Hanwen Zhu et.al. | 2409.08278 | null |
2024-09-12 | Click2Mask: Local Editing with Dynamic Mask Generation | Omer Regev et.al. | 2409.08272 | link |
2024-09-12 | DreamBeast: Distilling 3D Fantastical Animals with Part-Aware Knowledge Transfer | Runjia Li et.al. | 2409.08271 | null |
2024-09-12 | Touch2Touch: Cross-Modal Tactile Generation for Object Manipulation | Samanta Rodriguez et.al. | 2409.08269 | null |
2024-09-12 | Improving Text-guided Object Inpainting with Semantic Pre-inpainting | Yifu Chen et.al. | 2409.08260 | link |
2024-09-12 | Improving Virtual Try-On with Garment-focused Diffusion Models | Siqi Wan et.al. | 2409.08258 | link |
2024-09-12 | LoRID: Low-Rank Iterative Diffusion for Adversarial Purification | Geigh Zollicoffer et.al. | 2409.08255 | null |
2024-09-12 | Dynamic Prompting of Frozen Text-to-Image Diffusion Models for Panoptic Narrative Grounding | Hongyu Li et.al. | 2409.08251 | null |
2024-09-12 | TextBoost: Towards One-Shot Personalization of Text-to-Image Models via Fine-tuning Text Encoder | NaHyeon Park et.al. | 2409.08248 | link |
2024-09-19 | IFAdapter: Instance Feature Control for Grounded Text-to-Image Generation | Yinwei Wu et.al. | 2409.08240 | null |
2024-09-12 | LT3SD: Latent Trees for 3D Scene Diffusion | Quan Meng et.al. | 2409.08215 | null |
2024-09-12 | VI3DRM:Towards meticulous 3D Reconstruction from Sparse Views via Photo-Realistic Novel View Synthesis | Hao Chen et.al. | 2409.08207 | null |
2024-09-27 | High-Frequency Anti-DreamBooth: Robust Defense against Personalized Image Synthesis | Takuto Onikubo et.al. | 2409.08167 | link |
2024-09-12 | MagicStyle: Portrait Stylization Based on Reference Image | Zhaoli Deng et.al. | 2409.08156 | null |
2024-09-12 | EZIGen: Enhancing zero-shot subject-driven image generation with precise subject encoding and decoupled guidance | Zicheng Duan et.al. | 2409.08091 | link |
2024-09-12 | Diffusion-Based Image-to-Image Translation by Noise Correction via Prompt Interpolation | Junsung Lee et.al. | 2409.08077 | null |
2024-09-12 | AI-accelerated discovery of high critical temperature superconductors | Xiao-Qi Han et.al. | 2409.08065 | link |
2024-09-12 | Scribble-Guided Diffusion for Training-free Text-to-Image Generation | Seonho Lee et.al. | 2409.08026 | link |
2024-09-13 | Estimating Atmospheric Variables from Digital Typhoon Satellite Images via Conditional Denoising Diffusion Models | Zhangyue Ling et.al. | 2409.07961 | link |
2024-09-12 | Detecting and Defending Against Adversarial Attacks on Automatic Speech Recognition via Diffusion Models | Nikolai L. Kühne et.al. | 2409.07936 | link |
2024-09-12 | UGAD: Universal Generative AI Detector utilizing Frequency Fingerprints | Inzamamul Alam et.al. | 2409.07913 | null |
2024-09-12 | XMOL: Explainable Multi-property Optimization of Molecules | Aye Phyu Phyu Aung et.al. | 2409.07786 | null |
2024-09-12 | DiTAS: Quantizing Diffusion Transformers via Enhanced Activation Smoothing | Zhenyuan Dong et.al. | 2409.07756 | link |
2024-09-11 | DiffTED: One-shot Audio-driven TED Talk Video Generation with Diffusion-based Co-speech Gestures | Steven Hogue et.al. | 2409.07649 | null |
2024-09-04 | MarS: a Financial Market Simulation Engine Powered by Generative Foundation Model | Junjie Li et.al. | 2409.07486 | link |
2024-08-27 | Reflective Human-Machine Co-adaptation for Enhanced Text-to-Image Generation Dialogue System | Yuheng Feng et.al. | 2409.07464 | null |
2024-09-11 | DreamMesh: Jointly Manipulating and Texturing Triangle Meshes for Text-to-3D Generation | Haibo Yang et.al. | 2409.07454 | null |
2024-09-11 | Hi3D: Pursuing High-Resolution Image-to-3D Generation with Video Diffusion Models | Haibo Yang et.al. | 2409.07452 | link |
2024-09-11 | FreeEnhance: Tuning-Free Image Enhancement via Content-Consistent Noising-and-Denoising Process | Yang Luo et.al. | 2409.07451 | null |
2024-09-11 | StereoCrafter: Diffusion-based Generation of Long and High-fidelity Stereoscopic 3D from Monocular Videos | Sijie Zhao et.al. | 2409.07447 | null |
2024-09-11 | Controllable retinal image synthesis using conditional StyleGAN and latent space manipulation for improved diagnosis and grading of diabetic retinopathy | Somayeh Pakdelmoez et.al. | 2409.07422 | null |
2024-09-11 | Efficient One-Step Diffusion Refinement for Snapshot Compressive Imaging | Yunzhen Wang et.al. | 2409.07417 | null |
2024-09-11 | Training-Free Guidance for Discrete Diffusion Models for Molecular Generation | Thomas J. Kerby et.al. | 2409.07359 | null |
2024-09-11 | Learning Robotic Manipulation Policies from Point Clouds with Conditional Flow Matching | Eugenio Chisari et.al. | 2409.07343 | null |
2024-09-11 | Efficient and Unbiased Sampling of Boltzmann Distributions via Consistency Models | Fengzhe Zhang et.al. | 2409.07323 | null |
2024-09-11 | Exploring User-level Gradient Inversion with a Diffusion Prior | Zhuohang Li et.al. | 2409.07291 | null |
2024-09-27 | CCFExp: Facial Image Synthesis with Cycle Cross-Fusion Diffusion Model for Facial Paralysis Individuals | Weixiang Gao et.al. | 2409.07271 | link |
2024-09-11 | Realistic and Efficient Face Swapping: A Unified Approach with Diffusion Models | Sanoojan Baliah et.al. | 2409.07269 | link |
2024-09-11 | EMOdiffhead: Continuously Emotional Control in Talking Head Generation via Diffusion | Jian Zhang et.al. | 2409.07255 | link |
2024-09-12 | Alignment of Diffusion Models: Fundamentals, Challenges, and Future | Buhua Liu et.al. | 2409.07253 | link |
2024-09-11 | Diff-VPS: Video Polyp Segmentation via a Multi-task Diffusion Network with Adversarial Temporal Reasoning | Yingling Lu et.al. | 2409.07238 | link |
2024-09-11 | Phy124: Fast Physics-Driven 4D Content Generation from a Single Image | Jiajing Lin et.al. | 2409.07179 | null |
2024-09-11 | Mamba Policy: Towards Efficient 3D Diffusion Policy with Hybrid Selective State Models | Jiahang Cao et.al. | 2409.07163 | null |
2024-09-11 | MVLLaVA: An Intelligent Agent for Unified and Flexible Novel View Synthesis | Hanyu Jiang et.al. | 2409.07129 | null |
2024-09-11 | Bio-Eng-LMM AI Assist chatbot: A Comprehensive Tool for Research and Education | Ali Forootani et.al. | 2409.07110 | link |
2024-09-11 | From optimal score matching to optimal sampling | Zehao Dou et.al. | 2409.07032 | null |
2024-09-11 | CPSample: Classifier Protected Sampling for Guarding Training Data During Diffusion | Joshua Kazdan et.al. | 2409.07025 | null |
2024-09-11 | Towards Predicting Temporal Changes in a Patient’s Chest X-ray Images based on Electronic Health Records | Daeun Kyung et.al. | 2409.07012 | link |
2024-09-13 | ODYSSEE: Oyster Detection Yielded by Sensor Systems on Edge Electronics | Xiaomin Lin et.al. | 2409.07003 | null |
2024-09-11 | AdvLogo: Adversarial Patch Attack against Object Detectors based on Diffusion Models | Boming Miao et.al. | 2409.07002 | null |
2024-09-10 | Human Motion Synthesis_ A Diffusion Approach for Motion Stitching and In-Betweening | Michael Adewole et.al. | 2409.06791 | null |
2024-09-10 | Generative Hierarchical Materials Search | Sherry Yang et.al. | 2409.06762 | null |
2024-08-26 | FCDM: Sparse-view Sinogram Inpainting with Frequency Domain Convolution Enhanced Diffusion Models | Jiaze E et.al. | 2409.06714 | null |
2024-09-10 | DANCE: Deep Learning-Assisted Analysis of Protein Sequences Using Chaos Enhanced Kaleidoscopic Images | Taslim Murad et.al. | 2409.06694 | null |
2024-09-10 | SaRA: High-Efficient Diffusion Model Fine-tuning with Progressive Sparse Low-Rank Adaptation | Teng Hu et.al. | 2409.06633 | null |
2024-09-10 | PoseEmbroider: Towards a 3D, Visual, Semantic-aware Human Pose Representation | Ginger Delmas et.al. | 2409.06535 | null |
2024-09-09 | Elucidating Optimal Reward-Diversity Tradeoffs in Text-to-Image Diffusion Models | Rohit Jena et.al. | 2409.06493 | null |
2024-09-10 | Enhancing Emotional Text-to-Speech Controllability with Natural Language Guidance through Contrastive Learning and Diffusion Models | Xin Jing et.al. | 2409.06451 | null |
2024-09-11 | AMNS: Attention-Weighted Selective Mask and Noise Label Suppression for Text-to-Image Person Retrieval | Runqing Zhang et.al. | 2409.06385 | null |
2024-09-10 | Distilling Generative-Discriminative Representations for Very Low-Resolution Face Recognition | Junzheng Zhang et.al. | 2409.06371 | null |
2024-09-26 | What happens to diffusion model likelihood when your model is conditional? | Mattias Cross et.al. | 2409.06364 | null |
2024-09-10 | DiffQRCoder: Diffusion-based Aesthetic QR Code Generation with Scanning Robustness Guided Iterative Refinement | Jia-Wei Liao et.al. | 2409.06355 | null |
2024-09-10 | G3PT: Unleash the power of Autoregressive Modeling in 3D Generation via Cross-scale Querying Transformer | Jinzhi Zhang et.al. | 2409.06322 | null |
2024-09-13 | Multi-Source Music Generation with Latent Diffusion | Zhongweiyang Xu et.al. | 2409.06190 | link |
2024-09-11 | MyGo: Consistent and Controllable Multi-View Driving Video Generation with Camera Control | Yining Yao et.al. | 2409.06189 | null |
2024-09-10 | EDADepth: Enhanced Data Augmentation for Monocular Depth Estimation | Nischal Khanal et.al. | 2409.06183 | link |
2024-09-12 | Latent Diffusion Bridges for Unsupervised Musical Audio Timbre Transfer | Michele Mancusi et.al. | 2409.06096 | null |
2024-09-09 | SVS-GAN: Leveraging GANs for Semantic Video Synthesis | Khaled M. Seyam et.al. | 2409.06074 | null |
2024-09-09 | DiffusionPen: Towards Controlling the Style of Handwritten Text Generation | Konstantina Nikolaidou et.al. | 2409.06065 | link |
2024-09-12 | Enhanced Generative Data Augmentation for Semantic Segmentation via Stronger Guidance | Quang-Huy Che et.al. | 2409.06002 | null |
2024-09-09 | CoDiCast: Conditional Diffusion Model for Weather Prediction with Uncertainty Quantification | Jimeng Shi et.al. | 2409.05975 | link |
2024-09-09 | SVFit: Parameter-Efficient Fine-Tuning of Large Pre-Trained Models Using Singular Values | Chengwei Sun et.al. | 2409.05926 | null |
2024-08-23 | Enabling Distributed Generative Artificial Intelligence in 6G: Mobile Edge Generation | Ruikang Zhong et.al. | 2409.05870 | null |
2024-09-09 | Enhancing Preference-based Linear Bandits via Human Response Time | Shen Li et.al. | 2409.05798 | null |
2024-09-14 | Vector Quantized Diffusion Model Based Speech Bandwidth Extension | Yuan Fang et.al. | 2409.05784 | null |
2024-09-09 | AS-Speech: Adaptive Style For Speech Synthesis | Zhipeng Li et.al. | 2409.05730 | null |
2024-09-09 | pFedGPA: Diffusion-based Generative Parameter Aggregation for Personalized Federated Learning | Jiahao Lai et.al. | 2409.05701 | null |
2024-09-09 | Unlearning or Concealment? A Critical Analysis and Evaluation Metrics for Unlearning in Diffusion Models | Aakash Sen Sharma et.al. | 2409.05668 | null |
2024-09-09 | Forward KL Regularized Preference Optimization for Aligning Diffusion Policies | Zhao Shan et.al. | 2409.05622 | null |
2024-09-11 | CustomContrast: A Multilevel Contrastive Perspective For Subject-Driven Text-to-Image Customization | Nan Chen et.al. | 2409.05606 | null |
2024-09-12 | DriveScape: Towards High-Resolution Controllable Multi-View Driving Video Generation | Wei Wu et.al. | 2409.05463 | null |
2024-09-09 | CipherDM: Secure Three-Party Inference for Diffusion Model Sampling | Xin Zhao et.al. | 2409.05414 | null |
2024-09-09 | Sequential Posterior Sampling with Diffusion Models | Tristan S. W. Stevens et.al. | 2409.05399 | null |
2024-09-09 | TERD: A Unified Framework for Safeguarding Diffusion Models Against Backdoors | Yichuan Mo et.al. | 2409.05294 | link |
2024-09-08 | Can OOD Object Detectors Learn from Foundation Models? | Jiahui Liu et.al. | 2409.05162 | link |
2024-09-11 | Nuclear transparencies with a two-step process of the $A(e,e’π^+)$ reaction | Tae Keun Choi et.al. | 2409.05129 | null |
2024-09-15 | A Survey on Diffusion Models for Recommender Systems | Jianghao Lin et.al. | 2409.05033 | link |
2024-09-07 | Plug-and-Hide: Provable and Adjustable Diffusion Generative Steganography | Jiahao Zhu et.al. | 2409.04878 | null |
2024-09-07 | Rethinking The Training And Evaluation of Rich-Context Layout-to-Image Generation | Jiaxin Cheng et.al. | 2409.04847 | link |
2024-09-07 | Reward-Directed Score-Based Diffusion Models via q-Learning | Xuefeng Gao et.al. | 2409.04832 | null |
2024-09-07 | SpotActor: Training-Free Layout-Controlled Consistent Image Generation | Jiahao Wang et.al. | 2409.04801 | null |
2024-09-07 | Training-Free Style Consistent Image Synthesis with Condition and Mask Guidance in E-Commerce | Guandong Li et.al. | 2409.04750 | null |
2024-09-07 | Multi-Conditioned Denoising Diffusion Probabilistic Model (mDDPM) for Medical Image Synthesis | Arjun Krishna et.al. | 2409.04670 | null |
2024-09-11 | Thinking Outside the BBox: Unconstrained Generative Object Compositing | Gemma Canet Tarrés et.al. | 2409.04559 | null |
2024-09-10 | Diff-INR: Generative Regularization for Electrical Impedance Tomography | Bowen Tong et.al. | 2409.04494 | null |
2024-09-06 | VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation | Yecheng Wu et.al. | 2409.04429 | link |
2024-09-06 | Exploring Foundation Models for Synthetic Medical Imaging: A Study on Chest X-Rays and Fine-Tuning Techniques | Davide Clode da Silva et.al. | 2409.04424 | null |
2024-09-06 | Open-MAGVIT2: An Open-Source Project Toward Democratizing Auto-regressive Visual Generation | Zhuoyan Luo et.al. | 2409.04410 | link |
2024-09-06 | How Fair is Your Diffusion Recommender Model? | Daniele Malitesta et.al. | 2409.04339 | null |
2024-09-06 | Random effects estimation in a fractional diffusion model based on continuous observations | Nesrine Chebli et.al. | 2409.04331 | null |
2024-09-06 | Breaking the Brownian Barrier: Models and Manifestations of Molecular Diffusion in Complex Fluids | Harish Srinivasan et.al. | 2409.04199 | null |
2024-09-06 | GST: Precise 3D Human Body from a Single Image with Gaussian Splatting Transformers | Lorenza Prospero et.al. | 2409.04196 | link |
2024-09-06 | Secure Traffic Sign Recognition: An Attention-Enabled Universal Image Inpainting Mechanism against Light Patch Attacks | Hangcheng Cao et.al. | 2409.04133 | null |
2024-09-06 | D4: Text-guided diffusion model-based domain adaptive data augmentation for vineyard shoot detection | Kentaro Hirahara et.al. | 2409.04060 | null |
2024-09-06 | Qihoo-T2X: An Efficiency-Focused Diffusion Transformer via Proxy Tokens for Text-to-Any-Task | Jing Wang et.al. | 2409.04005 | link |
2024-09-11 | One-Shot Diffusion Mimicker for Handwritten Text Generation | Gang Dai et.al. | 2409.04004 | link |
2024-09-06 | DreamForge: Motion-Aware Autoregressive Video Generation for Multi-View Driving Scenes | Jianbiao Mei et.al. | 2409.04003 | link |
2024-09-06 | Automating Robot Failure Recovery Using Vision-Language Models With Optimized Prompts | Hongyi Chen et.al. | 2409.03966 | null |
2024-09-05 | Data-Efficient Generation for Dataset Distillation | Zhe Li et.al. | 2409.03929 | null |
2024-09-05 | Generating High Dimensional User-Specific Wireless Channels using Diffusion Models | Taekyun Lee et.al. | 2409.03924 | null |
2024-09-05 | Neural Entropy | Akhil Premkumar et.al. | 2409.03817 | null |
2024-09-04 | Protecting Activity Sensing Data Privacy Using Hierarchical Information Dissociation | Guangjing Wang et.al. | 2409.03796 | null |
2024-09-05 | Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding | Yunze Man et.al. | 2409.03757 | link |
2024-09-05 | ArtiFade: Learning to Generate High-quality Subject from Blemished Images | Shuya Yang et.al. | 2409.03745 | null |
2024-09-05 | Geometry Image Diffusion: Fast and Data-Efficient Text-to-3D with Image-Based Surface Representation | Slava Elizarov et.al. | 2409.03718 | null |
2024-09-05 | RealisHuman: A Two-Stage Approach for Refining Malformed Human Parts in Generated Images | Benzhi Wang et.al. | 2409.03644 | link |
2024-09-20 | Toward Any-to-Any Emotion Voice Conversion using Disentangled Diffusion Framework | Hsing-Hang Chou et.al. | 2409.03636 | null |
2024-09-05 | Generalizing Linear Graphs and Bond Graph Models with Hetero-functional Graphs for System-of-Systems Engineering Applications | Ehsanoddin Ghorbanichemazkati et.al. | 2409.03630 | null |
2024-09-05 | TCDiff: Triple Condition Diffusion Model with 3D Constraints for Stylizing Synthetic Faces | Bernardo Biesseck et.al. | 2409.03600 | link |
2024-09-05 | DKDM: Data-Free Knowledge Distillation for Diffusion Models with Any Architecture | Qianlong Xiang et.al. | 2409.03550 | link |
2024-09-05 | Blended Latent Diffusion under Attention Control for Real-World Video Editing | Deyin Liu et.al. | 2409.03514 | null |
2024-09-05 | Non-Uniform Illumination Attack for Fooling Convolutional Neural Networks | Akshay Jain et.al. | 2409.03458 | link |
2024-09-18 | LM-Gaussian: Boost Sparse-view 3D Gaussian Splatting with Large Model Priors | Hanyang Yu et.al. | 2409.03456 | null |
2024-09-05 | Data-free Distillation with Degradation-prompt Diffusion for Multi-weather Image Restoration | Pei Wang et.al. | 2409.03455 | null |
2024-09-05 | Fine-tuning large language models for domain adaptation: Exploration of training strategies, scaling, model merging and synergistic capabilities | Wei Lu et.al. | 2409.03444 | link |
2024-09-09 | RoVi-Aug: Robot and Viewpoint Augmentation for Cross-Embodiment Robot Learning | Lawrence Yunliang Chen et.al. | 2409.03403 | null |
2024-09-05 | Enhancing User-Centric Privacy Protection: An Interactive Framework through Diffusion Models and Machine Unlearning | Huaxi Huang et.al. | 2409.03326 | null |
2024-09-05 | SVP: Style-Enhanced Vivid Portrait Talking Head Diffusion Model | Weipeng Tan et.al. | 2409.03270 | null |
2024-09-05 | Enhancing digital core image resolution using optimal upscaling algorithm: with application to paired SEM images | Shaohua You et.al. | 2409.03265 | null |
2024-09-05 | RoomDiffusion: A Specialized Diffusion Model in the Interior Design Industry | Zhaowei Wang et.al. | 2409.03198 | null |
2024-08-22 | FIDAVL: Fake Image Detection and Attribution using Vision-Language Model | Mamadou Keita et.al. | 2409.03109 | link |
2024-09-04 | Spatial Diffusion for Cell Layout Generation | Chen Li et.al. | 2409.03106 | link |
2024-09-04 | How DREAMS are made: Emulating Satellite Galaxy and Subhalo Populations with Diffusion Models and Point Clouds | Tri Nguyen et.al. | 2409.02980 | link |
2024-09-09 | HiPrompt: Tuning-free Higher-Resolution Generation with Hierarchical MLLM Prompts | Xinyu Liu et.al. | 2409.02919 | link |
2024-09-22 | Masked Diffusion Models are Secretly Time-Agnostic Masked Models and Exploit Inaccurate Categorical Sampling | Kaiwen Zheng et.al. | 2409.02908 | null |
2024-09-04 | Human-VDM: Learning Single-Image 3D Human Gaussian Splatting from Video Diffusion Models | Zhibin Liu et.al. | 2409.02851 | link |
2024-09-04 | Multi-Track MusicLDM: Towards Versatile Music Generation with Latent Diffusion Model | Tornike Karchkhadze et.al. | 2409.02845 | null |
2024-09-04 | Independence Constrained Disentangled Representation Learning from Epistemological Perspective | Ruoyu Wang et.al. | 2409.02672 | null |
2024-09-04 | Generalized Individual Q-learning for Polymatrix Games with Partial Observations | Ahmed Said Donmez et.al. | 2409.02663 | null |
2024-09-04 | PoseTalk: Text-and-Audio-based Pose Control and Motion Refinement for One-Shot Talking Head Generation | Jun Ling et.al. | 2409.02657 | null |
2024-09-04 | Skip-and-Play: Depth-Driven Pose-Preserved Image Generation for Any Objects | Kyungmin Jo et.al. | 2409.02653 | null |
2024-09-04 | MADiff: Motion-Aware Mamba Diffusion Models for Hand Trajectory Prediction on Egocentric Videos | Junyi Ma et.al. | 2409.02638 | null |
2024-09-05 | Loopy: Taming Audio-Driven Portrait Avatar with Long-Term Motion Dependency | Jianwen Jiang et.al. | 2409.02634 | null |
2024-09-04 | Rate-Adaptive Generative Semantic Communication Using Conditional Diffusion Models | Pujing Yang et.al. | 2409.02597 | null |
2024-09-04 | Solving Video Inverse Problems Using Image Diffusion Models | Taesung Kwon et.al. | 2409.02574 | null |
2024-09-04 | StyleTokenizer: Defining Image Style by a Single Instance for Controlling Diffusion Models | Wen Li et.al. | 2409.02543 | link |
2024-09-04 | Sample what you cant compress | Vighnesh Birodkar et.al. | 2409.02529 | null |
2024-09-04 | Continual Diffuser (CoD): Mastering Continual Offline Reinforcement Learning with Experience Rehearsal | Jifeng Hu et.al. | 2409.02512 | link |
2024-09-04 | A Learnable Color Correction Matrix for RAW Reconstruction | Anqi Liu et.al. | 2409.02497 | null |
2024-09-04 | Training-free Color-Style Disentanglement for Constrained Text-to-Image Synthesis | Aishwarya Agarwal et.al. | 2409.02429 | null |
2024-09-04 | Diffusion Models Learn Low-Dimensional Distributions via Subspace Clustering | Peng Wang et.al. | 2409.02426 | link |
2024-09-10 | Exploring Low-Dimensional Subspaces in Diffusion Models for Controllable Image Editing | Siyi Chen et.al. | 2409.02374 | link |
2024-09-03 | QID $^2$ : An Image-Conditioned Diffusion Model for Q-space Up-sampling of DWI Data | Zijian Chen et.al. | 2409.02309 | null |
2024-09-03 | FastVoiceGrad: One-step Diffusion-Based Voice Conversion with Adversarial Conditional Diffusion Distillation | Takuhiro Kaneko et.al. | 2409.02245 | null |
2024-09-02 | A Financial Time Series Denoiser Based on Diffusion Model | Zhuohan Wang et.al. | 2409.02138 | null |
2024-09-01 | TrajWeaver: Trajectory Recovery with State Propagation Diffusion Model | Jinming Wang et.al. | 2409.02124 | null |
2024-09-05 | LinFusion: 1 GPU, 1 Minute, 16K Image | Songhua Liu et.al. | 2409.02097 | link |
2024-09-03 | DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos | Wenbo Hu et.al. | 2409.02095 | link |
2024-09-03 | Probing Noncentrosymmetric 2D Materials by Fourier Space Second Harmonic Imaging | Lucas Lafeta et.al. | 2409.02071 | null |
2024-09-03 | ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis | Wangbo Yu et.al. | 2409.02048 | null |
2024-09-03 | Optimizing CLIP Models for Image Retrieval with Maintained Joint-Embedding Alignment | Konstantin Schall et.al. | 2409.01936 | link |
2024-09-03 | Map-Assisted Remote-Sensing Image Compression at Extremely Low Bitrates | Yixuan Ye et.al. | 2409.01935 | link |
2024-09-05 | CyberHost: Taming Audio-driven Avatar Diffusion Model with Region Codebook Attention | Gaojie Lin et.al. | 2409.01876 | null |
2024-09-07 | Towards Generative Class Prompt Learning for Fine-grained Visual Recognition | Soumitri Chattopadhyay et.al. | 2409.01835 | link |
2024-09-03 | Classifier-Free Diffusion-Based Weakly-Supervised Approach for Health Indicator Derivation in Rotating Machines: Advancing Early Fault Detection and Condition Monitoring | Wenyang Hu et.al. | 2409.01676 | null |
2024-09-18 | A novel machine learning method to detect double- $Λ$ hypernuclear events in nuclear emulsions | Yan He et.al. | 2409.01657 | null |
2024-09-03 | Constraining anisotropic diffusion between Geminga and Earth with the cosmic-ray electron and positron spectrum | Junji Xia et.al. | 2409.01653 | null |
2024-09-03 | Unveiling Advanced Frequency Disentanglement Paradigm for Low-Light Image Enhancement | Kun Zhou et.al. | 2409.01641 | link |
2024-09-03 | A Time-Intensity Aware Pipeline for Generating Late-Stage Breast DCE-MRI using Generative Adversarial Models | Ruben D. Fonnegra et.al. | 2409.01596 | null |
2024-09-03 | DiVE: DiT-based Video Generation with Enhanced Control | Junpeng Jiang et.al. | 2409.01595 | null |
2024-09-03 | CT-SDM: A Sampling Diffusion Model for Sparse-View CT Reconstruction across All Sampling Rates | Liutao Yang et.al. | 2409.01571 | null |
2024-09-02 | AMG: Avatar Motion Guided Video Generation | Zhangsihao Yang et.al. | 2409.01502 | link |
2024-09-07 | EarthGen: Generating the World from Top-Down Views | Ansh Sharma et.al. | 2409.01491 | link |
2024-09-14 | Enhancing Sample Efficiency and Exploration in Reinforcement Learning through the Integration of Diffusion Models and Proximal Policy Optimization | Gao Tianci et.al. | 2409.01427 | link |
2024-09-02 | Target-Driven Distillation: Consistency Distillation with Target Timestep Selection and Decoupled Guidance | Cunzheng Wang et.al. | 2409.01347 | link |
2024-09-02 | SPDiffusion: Semantic Protection Diffusion for Multi-concept Text-to-image Generation | Yang Zhang et.al. | 2409.01327 | null |
2024-09-09 | Guide-and-Rescale: Self-Guidance Mechanism for Effective Tuning-Free Real Image Editing | Vadim Titov et.al. | 2409.01322 | link |
2024-09-02 | Disentangling Mean Embeddings for Better Diagnostics of Image Generators | Sebastian G. Gruber et.al. | 2409.01314 | link |
2024-09-09 | OD-VAE: An Omni-dimensional Video Compressor for Improving Latent Video Diffusion Model | Liuhan Chen et.al. | 2409.01199 | link |
2024-09-04 | Diffusion-Driven Data Replay: A Novel Approach to Combat Forgetting in Federated Class Continual Learning | Jinglin Liang et.al. | 2409.01128 | link |
2024-09-14 | DPDEdit: Detail-Preserved Diffusion Models for Multimodal Fashion Image Editing | Xiaolong Wang et.al. | 2409.01086 | null |
2024-09-02 | From Bird’s-Eye to Street View: Crafting Diverse and Condition-Aligned Images with Latent Diffusion Model | Xiaojie Xu et.al. | 2409.01014 | null |
2024-09-12 | 3D Priors-Guided Diffusion for Blind Face Restoration | Xiaobin Lu et.al. | 2409.00991 | link |
2024-09-01 | Diffusion based multi-domain neuroimaging harmonization method with preservation of anatomical details | Haoyu Lan et.al. | 2409.00807 | null |
2024-09-01 | Zero-Shot Paragraph-level Handwriting Imitation with Latent Diffusion Models | Martin Mayr et.al. | 2409.00786 | null |
2024-09-01 | Generalized Multi-hop Traffic Pressure for Heterogeneous Traffic Perimeter Control | Xiaocan Li et.al. | 2409.00753 | null |
2024-09-01 | LPUWF-LDM: Enhanced Latent Diffusion Model for Precise Late-phase UWF-FA Generation on Limited Dataset | Zhaojie Fang et.al. | 2409.00726 | link |
2024-09-01 | ReMOVE: A Reference-free Metric for Object Erasure | Aditya Chandrasekar et.al. | 2409.00707 | null |
2024-09-01 | Seed-to-Seed: Image Translation in Diffusion Seed Space | Or Greenberg et.al. | 2409.00654 | null |
2024-09-01 | McCaD: Multi-Contrast MRI Conditioned, Adaptive Adversarial Diffusion Model for High-Fidelity MRI Synthesis | Sanuwani Dayarathna et.al. | 2409.00585 | null |
2024-08-31 | Compositional 3D-aware Video Generation with LLM Director | Hanxin Zhu et.al. | 2409.00558 | null |
2024-08-31 | Data Augmentation for Image Classification using Generative AI | Fazle Rahat et.al. | 2409.00547 | null |
2024-08-31 | EraseDraw: Learning to Insert Objects by Erasing Them from Images | Alper Canberk et.al. | 2409.00522 | null |
2024-08-31 | RevCD – Reversed Conditional Diffusion for Generalized Zero-Shot Learning | William Heyden et.al. | 2409.00511 | null |
2024-08-31 | Accurate Compression of Text-to-Image Diffusion Models via Vector Quantization | Vage Egiazarian et.al. | 2409.00492 | null |
2024-08-31 | Towards understanding Diffusion Models (on Graphs) | Solveig Klepper et.al. | 2409.00374 | null |
2024-09-12 | AdaNAT: Exploring Adaptive Policy for Token-Based Image Generation | Zanlin Ni et.al. | 2409.00342 | link |
2024-08-31 | LightPure: Realtime Adversarial Image Purification for Mobile Devices Using Diffusion Models | Hossein Khalili et.al. | 2409.00340 | null |
2024-08-31 | Training-Free Sketch-Guided Diffusion with Latent Optimization | Sandra Zhang Ding et.al. | 2409.00313 | null |
2024-08-30 | Spatially-Aware Diffusion Models with Cross-Attention for Global Field Reconstruction with Sparse Observations | Yilin Zhuang et.al. | 2409.00230 | link |
2024-08-27 | Evaluating the Impact of Multiple DER Aggregators on Wholesale Energy Markets: A Hybrid Mean Field Approach | Jun He et.al. | 2409.00107 | null |
2024-09-04 | Negation Blindness in Large Language Models: Unveiling the NO Syndrome in Image Generation | Mohammad Nadeem et.al. | 2409.00105 | null |
2024-08-19 | A More Accurate Approximation of Activation Function with Few Spikes Neurons | Dayena Jeong et.al. | 2409.00044 | null |
2024-08-16 | DivDiff: A Conditional Diffusion Model for Diverse Human Motion Prediction | Hua Yu et.al. | 2409.00014 | null |
2024-08-30 | SYNTHEVAL: Hybrid Behavioral Testing of NLP Models with Synthetic CheckLists | Raoyuan Zhao et.al. | 2408.17437 | link |
2024-08-30 | CinePreGen: Camera Controllable Video Previsualization via Engine-powered Diffusion | Yiran Chen et.al. | 2408.17424 | null |
2024-08-30 | Subspace Diffusion Posterior Sampling for Travel-Time Tomography | Xiang Cao et.al. | 2408.17333 | null |
2024-08-30 | Image-Perfect Imperfections: Safety, Bias, and Authenticity in the Shadow of Text-To-Image Model Evolution | Yixin Wu et.al. | 2408.17285 | null |
2024-08-30 | VQ4DiT: Efficient Post-Training Vector Quantization for Diffusion Transformers | Juncan Deng et.al. | 2408.17131 | null |
2024-09-02 | RISSOLE: Parameter-efficient Diffusion Models via Block-wise Generation and Retrieval-Guidance | Avideep Mukherjee et.al. | 2408.17095 | null |
2024-08-30 | FissionVAE: Federated Non-IID Image Generation with Latent Space and Decoder Decomposition | Chen Hu et.al. | 2408.17090 | link |
2024-09-02 | Instant Adversarial Purification with Adversarial Consistency Distillation | Chun Tong Lei et.al. | 2408.17064 | null |
2024-08-30 | Text-to-Image Generation Via Energy-Based CLIP | Roy Ganz et.al. | 2408.17046 | null |
2024-08-30 | AdaptVision: Dynamic Input Scaling in MLLMs for Versatile Scene Understanding | Yonghui Wang et.al. | 2408.16986 | link |
2024-08-30 | Contrastive Learning with Synthetic Positives | Dewen Zeng et.al. | 2408.16965 | link |
2024-09-02 | Enabling Local Editing in Diffusion Models by Joint and Individual Component Analysis | Theodoros Kouzelis et.al. | 2408.16845 | null |
2024-08-29 | STEREO: Towards Adversarially Robust Concept Erasing from Text-to-Image Generation Models | Koushik Srivatsan et.al. | 2408.16807 | link |
2024-08-29 | ReconX: Reconstruct Any Scene from Sparse Views with Video Diffusion Model | Fangfu Liu et.al. | 2408.16767 | null |
2024-09-04 | CSGO: Content-Style Composition in Text-to-Image Generation | Peng Xing et.al. | 2408.16766 | null |
2024-08-29 | One-Shot Learning Meets Depth Diffusion in Multi-Object Videos | Anisha Jain et.al. | 2408.16704 | null |
2024-08-29 | GradBias: Unveiling Word Influence on Bias in Text-to-Image Generative Models | Moreno D’Incà et.al. | 2408.16700 | link |
2024-08-29 | DriveGenVLM: Real-world Video Generation for Vision Language Model based Autonomous Driving | Yongjie Fu et.al. | 2408.16647 | null |
2024-09-02 | RLCP: A Reinforcement Learning-based Copyright Protection Method for Text-to-Image Diffusion Model | Zhuan Shi et.al. | 2408.16634 | null |
2024-08-29 | A Score-based Generative Solver for PDE-constrained Inverse Problems with Complex Priors | Yankun Hong et.al. | 2408.16626 | null |
2024-08-29 | GRPose: Learning Graph Relations for Human Image Generation with Pose Priors | Xiangchen Yin et.al. | 2408.16540 | link |
2024-08-28 | Are Pose Estimators Ready for the Open World? STAGE: Synthetic Data Generation Toolkit for Auditing 3D Human Pose Estimators | Nikita Kister et.al. | 2408.16536 | null |
2024-08-29 | Alignment is All You Need: A Training-free Augmentation Strategy for Pose-guided Video Generation | Xiaoyu Jin et.al. | 2408.16506 | null |
2024-08-29 | Spiking Diffusion Models | Jiahang Cao et.al. | 2408.16467 | link |
2024-08-29 | What to Preserve and What to Transfer: Faithful, Identity-Preserving Diffusion-based Hairstyle Transfer | Chaeyeon Chung et.al. | 2408.16450 | link |
2024-08-29 | COIN: Control-Inpainting Diffusion Prior for Human and Camera Motion Estimation | Jiefeng Li et.al. | 2408.16426 | null |
2024-08-29 | Self-Improving Diffusion Models with Synthetic Data | Sina Alemohammad et.al. | 2408.16333 | null |
2024-08-29 | ResVG: Enhancing Relation and Semantic Understanding in Multiple Instances for Visual Grounding | Minghang Zheng et.al. | 2408.16314 | link |
2024-08-29 | Enhanced Control for Diffusion Bridge in Image Restoration | Conghan Yue et.al. | 2408.16303 | link |
2024-08-29 | Improving Diffusion-based Data Augmentation with Inversion Spherical Interpolation | Yanghao Wang et.al. | 2408.16266 | link |
2024-08-29 | Advancing Architectural Floorplan Design with Geometry-enhanced Graph Diffusion | Sizhe Hu et.al. | 2408.16258 | link |
2024-09-03 | Error analysis of finite element method for nonlocal diffusion model | Zuoqiang Shi et.al. | 2408.16243 | null |
2024-08-29 | Enhancing Conditional Image Generation with Explainable Latent Space Manipulation | Kshitij Pathania et.al. | 2408.16232 | link |
2024-08-29 | Anchor-Controlled Generative Adversarial Network for High-Fidelity Electromagnetic and Structurally Diverse Metasurface Design | Yunhui Zeng et.al. | 2408.16231 | null |
2024-08-28 | TEDRA: Text-based Editing of Dynamic and Photoreal Actors | Basavaraj Sunagad et.al. | 2408.15995 | null |
2024-08-28 | Distribution Backtracking Builds A Faster Convergence Trajectory for One-step Diffusion Distillation | Shengyuan Zhang et.al. | 2408.15991 | link |
2024-08-28 | CoRe: Context-Regularized Text Embedding Learning for Text-to-Image Personalization | Feize Wu et.al. | 2408.15914 | null |
2024-08-28 | Gen-Swarms: Adapting Deep Generative Models to Swarms of Drones | Carlos Plou et.al. | 2408.15899 | null |
2024-09-13 | Airfoil Diffusion: Denoising Diffusion Model For Conditional Airfoil Generation | Reid Graves et.al. | 2408.15898 | link |
2024-08-28 | Disentangled Diffusion Autoencoder for Harmonization of Multi-site Neuroimaging Data | Ayodeji Ijishakin et.al. | 2408.15890 | null |
2024-08-28 | GenDDS: Generating Diverse Driving Video Scenarios with Prompt-to-Video Generative Model | Yongjie Fu et.al. | 2408.15868 | null |
2024-08-28 | Defending Text-to-image Diffusion Models: Surprising Efficacy of Textual Perturbations Against Backdoor Attacks | Oscar Chew et.al. | 2408.15721 | null |
2024-08-28 | Synthetic Forehead-creases Biometric Generation for Reliable User Verification | Abhishek Tandon et.al. | 2408.15693 | link |
2024-08-28 | Merging and Splitting Diffusion Paths for Semantically Coherent Panoramas | Fabio Quattrini et.al. | 2408.15660 | link |
2024-08-28 | Grand canonical generative diffusion model for crystalline phases and grain boundaries | Bo Lei et.al. | 2408.15601 | null |
2024-08-28 | MODULI: Unlocking Preference Generalization via Diffusion Models for Offline Multi-Objective Reinforcement Learning | Yifu Yuan et.al. | 2408.15501 | null |
2024-08-28 | On the implementation of linear finite element method for nonlocal diffusion model over 2D domain | Zuoqiang Shi et.al. | 2408.15472 | null |
2024-09-04 | Hand1000: Generating Realistic Hands from Text with Only 1,000 Images | Haozhuo Zhang et.al. | 2408.15461 | null |
2024-08-28 | Avoiding Generative Model Writer’s Block With Embedding Nudging | Ali Zand et.al. | 2408.15450 | null |
2024-08-27 | Multi-Feature Aggregation in Diffusion Models for Enhanced Face Super-Resolution | Marcelo dos Santos et.al. | 2408.15386 | link |
2024-08-22 | 3D Photon Counting CT Image Super-Resolution Using Conditional Diffusion Model | Chuang Niu et.al. | 2408.15283 | null |
2024-08-10 | Civiverse: A Dataset for Analyzing User Engagement with Open-Source Text-to-Image Models | Maria-Teresa De Rosa Palmini et.al. | 2408.15261 | null |
2024-08-09 | A generative foundation model for five-class sleep staging with arbitrary sensor input | Hans van Gorp et.al. | 2408.15253 | null |
2024-08-27 | GenRec: Unifying Video Generation and Recognition with Diffusion Models | Zejia Weng et.al. | 2408.15241 | link |
2024-08-27 | Generative Inbetweening: Adapting Image-to-Video Models for Keyframe Interpolation | Xiaojuan Wang et.al. | 2408.15239 | null |
2024-08-27 | Fundus2Video: Cross-Modal Angiography Video Generation from Static Fundus Photography with Clinical Knowledge Guidance | Weiyi Zhang et.al. | 2408.15217 | link |
2024-08-27 | Simulation of Stochastic Discrete Dislocation Dynamics in Ductile Vs Brittle Materials | Santosh Chhetri et.al. | 2408.15157 | null |
2024-08-27 | DIFR3CT: Latent Diffusion for Probabilistic 3D CT Reconstruction from Few Planar X-Rays | Yiran Sun et.al. | 2408.15118 | link |
2024-08-27 | Constrained Diffusion Models via Dual Training | Shervin Khalafi et.al. | 2408.15094 | null |
2024-08-27 | LN-Gen: Rectal Lymph Nodes Generation via Anatomical Features | Weidong Guo et.al. | 2408.14977 | null |
2024-08-27 | MegActor- $Σ$ : Unlocking Flexible Mixed-Modal Control in Portrait Animation with Diffusion Transformer | Shurong Yang et.al. | 2408.14975 | null |
2024-08-27 | MeshUp: Multi-Target Mesh Deformation via Blended Score Distillation | Hyunwoo Kim et.al. | 2408.14899 | null |
2024-08-27 | DiffSurf: A Transformer-based Diffusion Model for Generating and Reconstructing 3D Surfaces in Pose | Yusuke Yoshiyasu et.al. | 2408.14860 | null |
2024-09-09 | Diffusion-Occ: 3D Point Cloud Completion via Occupancy Diffusion | Guoqing Zhang et.al. | 2408.14846 | null |
2024-08-27 | Diffusion based Semantic Outlier Generation via Nuisance Awareness for Out-of-Distribution Detection | Suhee Yoon et.al. | 2408.14841 | null |
2024-08-27 | Diffusion Models Are Real-Time Game Engines | Dani Valevski et.al. | 2408.14837 | null |
2024-08-27 | Alfie: Democratising RGBA Image Generation With No $$$ | Fabio Quattrini et.al. | 2408.14826 | link |
2024-08-27 | Build-A-Scene: Interactive 3D Layout Control for Diffusion-Based Image Generation | Abdelrahman Eldesokey et.al. | 2408.14819 | null |
2024-08-27 | A versatile informative diffusion model for single-cell ATAC-seq data generation and analysis | Lei Huang et.al. | 2408.14801 | null |
2024-08-27 | CrossViewDiff: A Cross-View Diffusion Model for Satellite-to-Street View Synthesis | Weijia Li et.al. | 2408.14765 | null |
2024-08-27 | Sequential-Scanning Dual-Energy CT Imaging Using High Temporal Resolution Image Reconstruction and Error-Compensated Material Basis Image Generation | Qiaoxin Li et.al. | 2408.14754 | null |
2024-08-27 | Learning Differentially Private Diffusion Models via Stochastic Adversarial Distillation | Bochao Liu et.al. | 2408.14738 | null |
2024-08-27 | OctFusion: Octree-based Diffusion Models for 3D Shape Generation | Bojun Xiong et.al. | 2408.14732 | link |
2024-08-26 | Global analysis of the extended cosmic-ray decreases observed with world-wide networks of neutron monitors and muon detectors; temporal variation of the rigidity spectrum and its implication | K. Munakata et.al. | 2408.14696 | null |
2024-08-26 | DIAGen: Diverse Image Augmentation with Generative Models | Tobias Lingenberg et.al. | 2408.14584 | link |
2024-08-25 | Variational autoencoder-based neural network model compression | Liang Cheng et.al. | 2408.14513 | null |
2024-08-26 | K-Sort Arena: Efficient and Reliable Benchmarking for Generative Models via K-wise Human Preferences | Zhikai Li et.al. | 2408.14468 | null |
2024-08-26 | GR-MG: Leveraging Partially Annotated Data via Multi-Modal Goal Conditioned Policy | Peiyan Li et.al. | 2408.14368 | link |
2024-08-29 | Dual-Domain CLIP-Assisted Residual Optimization Perception Model for Metal Artifact Reduction | Xinrui Zhang et.al. | 2408.14342 | null |
2024-09-03 | Foundation Models for Music: A Survey | Yinghao Ma et.al. | 2408.14340 | link |
2024-08-26 | ConceptMix: A Compositional Image Generation Benchmark with Controllable Difficulty | Xindi Wu et.al. | 2408.14339 | null |
2024-08-26 | TC-PDM: Temporally Consistent Patch Diffusion Models for Infrared-to-Visible Video Translation | Anh-Dzung Doan et.al. | 2408.14227 | link |
2024-08-26 | MagicMan: Generative Novel View Synthesis of Humans with 3D-Aware Diffusion and Iterative Refinement | Xu He et.al. | 2408.14211 | null |
2024-08-26 | Driving in the Occupancy World: Vision-Centric 4D Occupancy Forecasting and Planning via World Models for Autonomous Driving | Yu Yang et.al. | 2408.14197 | null |
2024-08-27 | SwiftBrush v2: Make Your One-step Diffusion Model Better Than Its Teacher | Trung Dao et.al. | 2408.14176 | link |
2024-08-26 | Foodfusion: A Novel Approach for Food Image Composition via Diffusion Models | Chaohua Shi et.al. | 2408.14135 | null |
2024-08-28 | SurGen: Text-Guided Diffusion Model for Surgical Video Generation | Joseph Cho et.al. | 2408.14028 | null |
2024-08-26 | Pixel-Aligned Multi-View Generation with Depth Guided Decoder | Zhenggang Tang et.al. | 2408.14016 | null |
2024-08-25 | ConVis: Contrastive Decoding with Hallucination Visualization for Mitigating Hallucinations in Multimodal Large Language Models | Yeji Park et.al. | 2408.13906 | link |
2024-08-27 | RT-Attack: Jailbreaking Text-to-Image Models via Random Token | Sensen Gao et.al. | 2408.13896 | null |
2024-08-28 | SimpleSpeech 2: Towards Simple and Efficient Text-to-Speech with Flow-based Scalar Latent Transformer Diffusion Models | Dongchao Yang et.al. | 2408.13893 | null |
2024-08-25 | Particle-Filtering-based Latent Diffusion for Inverse Problems | Amir Nazemi et.al. | 2408.13868 | null |
2024-08-25 | Draw Like an Artist: Complex Scene Generation with Diffusion Model via Composition, Painting, and Retouching | Minghao Liu et.al. | 2408.13858 | null |
2024-08-25 | Bring the Power of Diffusion Model to Defect Detection | Xuyi Yu et.al. | 2408.13845 | null |
2024-08-25 | Prior Learning in Introspective VAEs | Ioannis Athanasiadis et.al. | 2408.13805 | null |
2024-08-25 | 3D-VirtFusion: Synthetic 3D Data Augmentation through Generative Diffusion Models and Controllable Editing | Shichao Dong et.al. | 2408.13788 | null |
2024-08-25 | SceneDreamer360: Text-Driven 3D-Consistent Scene Generation with Panoramic Gaussian Splatting | Wenrui Li et.al. | 2408.13711 | link |
2024-08-25 | Guided and Fused: Efficient Frozen CLIP-ViT with Feature Guidance and Multi-Stage Feature Fusion for Generalizable Deepfake Detection | Yingjian Chen et.al. | 2408.13697 | null |
2024-08-24 | GenCA: A Text-conditioned Generative Model for Realistic and Drivable Codec Avatars | Keqiang Sun et.al. | 2408.13674 | null |
2024-08-27 | Prompt-Softbox-Prompt: A free-text Embedding Control for Image Editing | Yitong Yang et.al. | 2408.13623 | null |
2024-08-28 | DualAnoDiff: Dual-Interrelated Diffusion Model for Few-Shot Anomaly Image Generation | Ying Jin et.al. | 2408.13509 | link |
2024-08-24 | Rethinking Video Deblurring with Wavelet-Aware Dynamic Transformer and Diffusion Model | Chen Rao et.al. | 2408.13459 | link |
2024-08-24 | Explainable Concept Generation through Vision-Language Preference Learning | Aditya Taparia et.al. | 2408.13438 | null |
2024-09-02 | Training-free Long Video Generation with Chain of Diffusion Model Experts | Wenhao Li et.al. | 2408.13423 | null |
2024-08-24 | TVG: A Training-free Transition Video Generation Method with Diffusion Models | Rui Zhang et.al. | 2408.13413 | null |
2024-08-23 | Task-Oriented Diffusion Inversion for High-Fidelity Text-based Editing | Yangyang Xu et.al. | 2408.13395 | null |
2024-08-23 | Shape-Preserving Generation of Food Images for Automatic Dietary Assessment | Guangzong Chen et.al. | 2408.13358 | null |
2024-08-23 | Latent Space Disentanglement in Diffusion Transformers Enables Zero-shot Fine-grained Semantic Editing | Zitao Shuai et.al. | 2408.13335 | null |
2024-08-23 | Abstract Art Interpretation Using ControlNet | Rishabh Srivastava et.al. | 2408.13287 | link |
2024-08-23 | How Diffusion Models Learn to Factorize and Compose | Qiyao Liang et.al. | 2408.13256 | null |
2024-08-23 | CustomCrafter: Customized Video Generation with Preserving Motion and Concept Composition Abilities | Tao Wu et.al. | 2408.13239 | link |
2024-08-26 | Focus on Neighbors and Know the Whole: Towards Consistent Dense Multiview Text-to-Image Generator for 3D Creation | Bonan Li et.al. | 2408.13149 | null |
2024-08-23 | Diffusion-based Episodes Augmentation for Offline Multi-Agent Reinforcement Learning | Jihwan Oh et.al. | 2408.13092 | null |
2024-08-23 | General Intelligent Imaging and Uncertainty Quantification by Deterministic Diffusion Model | Weiru Fan et.al. | 2408.13061 | null |
2024-08-23 | Atlas Gaussians Diffusion for 3D Generation with Infinite Number of Points | Haitao Yang et.al. | 2408.13055 | null |
2024-08-23 | G3FA: Geometry-guided GAN for Face Animation | Alireza Javanmardi et.al. | 2408.13049 | null |
2024-08-23 | Adaptive complexity of log-concave sampling | Huanjian Zhou et.al. | 2408.13045 | null |
2024-08-23 | EasyControl: Transfer ControlNet to Video Diffusion for Controllable Generation and Interpolation | Cong Wang et.al. | 2408.13005 | null |
2024-09-01 | Controllable Financial Market Generation with Diffusion Guided Meta Agent | Yu-Hao Huang et.al. | 2408.12991 | null |
2024-08-23 | What Do You Want? User-centric Prompt Generation for Text-to-image Synthesis via Multi-turn Guidance | Yilun Liu et.al. | 2408.12910 | link |
2024-08-23 | When Diffusion MRI Meets Diffusion Model: A Novel Deep Generative Model for Diffusion MRI Generation | Xi Zhu et.al. | 2408.12897 | null |
2024-08-22 | Generating Realistic X-ray Scattering Images Using Stable Diffusion and Human-in-the-loop Annotations | Zhuowen Zhao et.al. | 2408.12720 | link |
2024-08-22 | Unlocking Intrinsic Fairness in Stable Diffusion | Eunji Kim et.al. | 2408.12692 | null |
2024-08-22 | Generative Diffusion Model-based Downscaling of Observed Sea Surface Height over Kuroshio Extension since 2000 | Qiuchang Han et.al. | 2408.12632 | null |
2024-08-09 | Quantum Generative Adversarial Networks: Generating and Detecting Quantum Product States | James E. Steck et.al. | 2408.12620 | null |
2024-08-31 | xGen-VideoSyn-1: High-fidelity Text-to-Video Synthesis with Compressed Representations | Can Qin et.al. | 2408.12590 | null |
2024-08-22 | Real-Time Video Generation with Pyramid Attention Broadcast | Xuanlei Zhao et.al. | 2408.12588 | link |
2024-08-22 | ssProp: Energy-Efficient Training for Convolutional Neural Networks with Scheduled Sparse Back Propagation | Lujia Zhong et.al. | 2408.12561 | link |
2024-09-11 | Show-o: One Single Transformer to Unify Multimodal Understanding and Generation | Jinheng Xie et.al. | 2408.12528 | null |
2024-08-22 | FlexEdit: Marrying Free-Shape Masks to VLLM for Flexible Image Editing | Jue Wang et.al. | 2408.12429 | link |
2024-08-22 | 4D Diffusion for Dynamic Protein Structure Prediction with Reference Guided Motion Alignment | Kaihui Cheng et.al. | 2408.12419 | null |
2024-08-22 | CODE: Confident Ordinary Differential Editing | Bastien van Delft et.al. | 2408.12418 | link |
2024-09-04 | Dynamic PDB: A New Dataset and a SE(3) Model Extension by Integrating Dynamic Behaviors and Physical Properties in Protein Structures | Ce Liu et.al. | 2408.12413 | null |
2024-08-22 | Dynamic Product Image Generation and Recommendation at Scale for Personalized E-commerce | Ádám Tibor Czapp et.al. | 2408.12392 | null |
2024-08-22 | LCM-SVC: Latent Diffusion Model Based Singing Voice Conversion with Inference Acceleration via Latent Consistency Distillation | Shihao Chen et.al. | 2408.12354 | null |
2024-08-23 | GarmentAligner: Text-to-Garment Generation via Retrieval-augmented Multi-level Corrections | Shiyue Zhang et.al. | 2408.12352 | null |
2024-08-22 | Variance reduction of diffusion model’s gradients with Taylor approximation-based control variate | Paul Jeha et.al. | 2408.12270 | null |
2024-08-22 | Scalable Autoregressive Image Generation with Mamba | Haopeng Li et.al. | 2408.12245 | link |
2024-08-22 | MedDiT: A Knowledge-Controlled Diffusion Transformer Framework for Dynamic Medical Image Generation in Virtual Simulated Patient | Yanzeng Li et.al. | 2408.12236 | null |
2024-08-22 | BihoT: A Large-Scale Dataset and Benchmark for Hyperspectral Camouflaged Object Tracking | Hanzheng Wang et.al. | 2408.12232 | null |
2024-08-22 | DimeRec: A Unified Framework for Enhanced Sequential Recommendation via Generative Diffusion Models | Wuchao Li et.al. | 2408.12153 | null |
2024-08-22 | An evidence-accumulating drift-diffusion model of competing information spread on networks | Julien Corsin et.al. | 2408.12127 | null |
2024-08-22 | ZipGait: Bridging Skeleton and Silhouette with Diffusion Model for Advancing Gait Recognition | Fanxu Min et.al. | 2408.12111 | null |
2024-08-22 | Pareto Inverse Reinforcement Learning for Diverse Expert Policy Generation | Woo Kyung Kim et.al. | 2408.12110 | null |
2024-08-22 | Spin relaxation in graphite due to spin-orbital-phonon interaction from first-principles density-matrix approach | Junqing Xu et.al. | 2408.12054 | null |
2024-08-21 | CaRDiff: Video Salient Object Ranking Chain of Thought Reasoning for Saliency Prediction with Diffusion | Yunlong Tang et.al. | 2408.12009 | null |
2024-08-21 | Pixel Is Not A Barrier: An Effective Evasion Attack for Pixel-Domain Diffusion Models | Chun-Yen Shih et.al. | 2408.11810 | null |
2024-08-21 | Approaching Deep Learning through the Spectral Dynamics of Weights | David Yunis et.al. | 2408.11804 | link |
2024-08-21 | DreamFactory: Pioneering Multi-Scene Long Video Generation with a Multi-Agent Framework | Zhifei Xie et.al. | 2408.11788 | null |
2024-08-21 | Timeline and Boundary Guided Diffusion Network for Video Shadow Detection | Haipeng Zhou et.al. | 2408.11785 | link |
2024-08-21 | JieHua Paintings Style Feature Extracting Model using Stable Diffusion with ControlNet | Yujia Gu et.al. | 2408.11744 | null |
2024-08-21 | Iterative Object Count Optimization for Text-to-image Diffusion Models | Oz Zafar et.al. | 2408.11721 | null |
2024-08-21 | FRAP: Faithful and Realistic Text-to-Image Generation with Adaptive Prompt Weighting | Liyao Jiang et.al. | 2408.11706 | null |
2024-08-21 | Moderate deviation principles for a reaction diffusion model in non-equilibrium | Linjie Zhao et.al. | 2408.11633 | null |
2024-08-21 | Bayesian inversion for the identification of the doping profile in unipolar semiconductor devices | Leila Taghizadeh et.al. | 2408.11485 | null |
2024-08-21 | TrackGo: A Flexible and Efficient Method for Controllable Video Generation | Haitao Zhou et.al. | 2408.11475 | null |
2024-08-21 | Latent Feature and Attention Dual Erasure Attack against Multi-View Diffusion Models for 3D Assets Protection | Jingwei Sun et.al. | 2408.11408 | link |
2024-09-02 | Video Diffusion Models are Strong Video Inpainter | Minhyeok Lee et.al. | 2408.11402 | null |
2024-08-21 | Generative AI based Secure Wireless Sensing for ISAC Networks | Jiacheng Wang et.al. | 2408.11398 | null |
2024-08-21 | Gender Bias Evaluation in Text-to-image Generation: A Survey | Yankun Wu et.al. | 2408.11358 | null |
2024-08-21 | HumanCoser: Layered 3D Human Generation via Semantic-Aware Diffusion Model | Yi Wang et.al. | 2408.11357 | null |
2024-08-21 | UniFashion: A Unified Vision-Language Model for Multimodal Fashion Retrieval and Generation | Xiangyu Zhao et.al. | 2408.11305 | link |
2024-08-21 | Taming Generative Diffusion for Universal Blind Image Restoration | Siwei Tu et.al. | 2408.11287 | null |
2024-08-20 | Compress Guidance in Conditional Diffusion Sampling | Anh-Dung Dinh et.al. | 2408.11194 | null |
2024-08-20 | MS $^3$ D: A RG Flow-Based Regularization for GAN Training with Limited Data | Jian Wang et.al. | 2408.11135 | null |
2024-08-18 | DiffZOO: A Purely Query-Based Black-Box Attack for Red-teaming Text-to-Image Generative Model via Zeroth Order Optimization | Pucheng Dang et.al. | 2408.11071 | null |
2024-08-20 | Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model | Chunting Zhou et.al. | 2408.11039 | null |
2024-09-07 | MegaFusion: Extend Diffusion Models towards Higher-resolution Image Generation without Further Tuning | Haoning Wu et.al. | 2408.11001 | link |
2024-08-20 | GreediRIS: Scalable Influence Maximization using Distributed Streaming Maximum Cover | Reet Barik et.al. | 2408.10982 | null |
2024-08-20 | Kilometer-Scale Convection Allowing Model Emulation using Generative Diffusion Modeling | Jaideep Pathak et.al. | 2408.10958 | null |
2024-08-20 | Large Point-to-Gaussian Model for Image-to-3D Generation | Longfei Lu et.al. | 2408.10935 | null |
2024-09-02 | A Grey-box Attack against Latent Diffusion Model-based Image Editing by Posterior Collapse | Zhongliang Guo et.al. | 2408.10901 | null |
2024-08-26 | Perception-guided Jailbreak against Text-to-Image Models | Yihao Huang et.al. | 2408.10848 | null |
2024-09-01 | Harmonizing Attention: Training-free Texture-aware Geometry Transfer | Eito Ikuta et.al. | 2408.10846 | null |
2024-08-20 | Hedging in Jump Diffusion Model with Transaction Costs | Hamidreza Maleki Almani et.al. | 2408.10785 | null |
2024-08-20 | Generating Synthetic Fair Syntax-agnostic Data by Learning and Distilling Fair Representation | Md Fahim Sikder et.al. | 2408.10755 | null |
2024-08-20 | Iterative Window Mean Filter: Thwarting Diffusion-based Adversarial Purification | Hanrui Wang et.al. | 2408.10673 | null |
2024-08-20 | TextMastero: Mastering High-Quality Scene Text Editing in Diverse Languages and Styles | Tong Wang et.al. | 2408.10623 | null |
2024-08-20 | Novel Change Detection Framework in Remote Sensing Imagery Using Diffusion Models and Structural Similarity Index (SSIM) | Andrew Kiruluta et.al. | 2408.10619 | null |
2024-08-21 | MUSES: 3D-Controllable Image Generation via Multi-Modal Agent Collaboration | Yanbo Ding et.al. | 2408.10605 | link |
2024-08-30 | Prompt-Agnostic Adversarial Perturbation for Customized Diffusion Models | Cong Wan et.al. | 2408.10571 | link |
2024-08-20 | Diff-PCC: Diffusion-based Neural Compression for 3D Point Clouds | Kai Liu et.al. | 2408.10543 | null |
2024-08-20 | Generative Diffusion Models for High Dimensional Channel Estimation | Xingyu Zhou et.al. | 2408.10501 | null |
2024-08-19 | Kubrick: Multimodal Agent Collaborations for Synthetic Video Generation | Liu He et.al. | 2408.10453 | null |
2024-08-19 | The Brittleness of AI-Generated Image Watermarking Techniques: Examining Their Robustness Against Visual Paraphrasing Attacks | Niyar R Barman et.al. | 2408.10446 | null |
2024-09-04 | SDE-based Multiplicative Noise Removal | An Vuong et.al. | 2408.10283 | link |
2024-08-16 | Diffusion Model for Planning: A Systematic Literature Review | Toshihide Ubukata et.al. | 2408.10266 | null |
2024-08-21 | NeRF-US: Removing Ultrasound Imaging Artifacts from Neural Radiance Fields in the Wild | Rishit Dagli et.al. | 2408.10258 | null |
2024-08-05 | AltCanvas: A Tile-Based Image Editor with Generative AI for Blind or Visually Impaired People | Seonghee Lee et.al. | 2408.10240 | null |
2024-08-19 | MeshFormer: High-Quality Mesh Generation with 3D-Guided Reconstruction Model | Minghua Liu et.al. | 2408.10198 | null |
2024-08-19 | SpaRP: Fast 3D Object Reconstruction and Pose Estimation from Sparse Views | Chao Xu et.al. | 2408.10195 | null |
2024-08-19 | Geometry Informed Tokenization of Molecules for Language Model Generation | Xiner Li et.al. | 2408.10120 | null |
2024-08-19 | Factorized-Dreamer: Training A High-Quality Video Generator with Limited and Low-Quality Data | Tao Yang et.al. | 2408.10119 | null |
2024-08-19 | General Impedance Modeling for Modular Multilevel Converter with Grid-forming and Grid-following Control | Chu Sun et.al. | 2408.10017 | null |
2024-08-19 | Multi-layer diffusion model of photovoltaic installations | Tomasz Weron et.al. | 2408.09904 | null |
2024-08-19 | Instruction-Based Molecular Graph Generation with Unified Text-Graph Diffusion Model | Yuran Xiang et.al. | 2408.09896 | link |
2024-08-23 | SurgicaL-CD: Generating Surgical Images via Unpaired Image Translation with Latent Consistency Diffusion Models | Danush Kumar Venkatesh et.al. | 2408.09822 | link |
2024-08-19 | Latent Diffusion for Guided Document Table Generation | Syed Jawwad Haider Hamdani et.al. | 2408.09800 | null |
2024-08-19 | Unsupervised Composable Representations for Audio | Giovanni Bindi et.al. | 2408.09792 | link |
2024-08-19 | Anim-Director: A Large Multimodal Model Powered Agent for Controllable Animation Video Generation | Yunxin Li et.al. | 2408.09787 | link |
2024-08-19 | Propagating the prior from shallow to deep with a pre-trained velocity-model Generative Transformer network | Randy Harsuko et.al. | 2408.09767 | null |
2024-08-19 | RealCustom++: Representing Images as Real-Word for Real-Time Customization | Zhendong Mao et.al. | 2408.09744 | null |
2024-08-19 | TraDiffusion: Trajectory-Based Training-Free Image Generation | Mingrui Wu et.al. | 2408.09739 | link |
2024-08-21 | Reconstruct Spine CT from Biplanar X-Rays via Diffusion Learning | Zhi Qiao et.al. | 2408.09731 | null |
2024-08-19 | Photorealistic Object Insertion with Diffusion-Guided Inverse Rendering | Ruofan Liang et.al. | 2408.09702 | null |
2024-08-19 | ExpoMamba: Exploiting Frequency SSM Blocks for Efficient and Effective Image Enhancement | Eashan Adhikarla et.al. | 2408.09650 | link |
2024-08-18 | Moonshine: Distilling Game Content Generators into Steerable Generative Models | Yuhe Nie et.al. | 2408.09594 | null |
2024-08-18 | AnomalyFactory: Regard Anomaly Generation as Unsupervised Anomaly Localization | Ying Zhao et.al. | 2408.09533 | null |
2024-08-18 | Beyond Local Views: Global State Inference with Diffusion Models for Cooperative Multi-Agent Reinforcement Learning | Zhiwei Xu et.al. | 2408.09501 | null |
2024-08-18 | Deformation-aware GAN for Medical Image Synthesis with Substantially Misaligned Pairs | Bowen Xin et.al. | 2408.09432 | null |
2024-08-18 | FD2Talk: Towards Generalized Talking Head Generation with Facial Decoupled Diffusion Model | Ziyu Yao et.al. | 2408.09384 | null |
2024-08-28 | SkyScript-100M: 1,000,000,000 Pairs of Scripts and Shooting Scripts for Short Drama | Jing Tang et.al. | 2408.09333 | link |
2024-08-18 | Unpaired Volumetric Harmonization of Brain MRI with Conditional Latent Diffusion | Mengqi Wu et.al. | 2408.09315 | null |
2024-08-20 | MagicID: Flexible ID Fidelity Generation System | Zhaoli Deng et.al. | 2408.09248 | null |
2024-08-17 | RepControlNet: ControlNet Reparameterization | Zhaoli Deng et.al. | 2408.09240 | null |
2024-08-17 | Are CLIP features all you need for Universal Synthetic Image Origin Attribution? | Dario Cioni et.al. | 2408.09153 | link |
2024-08-17 | Realistic Extreme Image Rescaling via Generative Latent Space Learning | Ce Wang et.al. | 2408.09151 | link |
2024-08-27 | Barbie: Text to Barbie-Style 3D Avatars | Xiaokun Sun et.al. | 2408.09126 | link |
2024-08-17 | Fragment-Masked Molecular Optimization | Kun Li et.al. | 2408.09106 | null |
2024-08-16 | Efficient Autoregressive Audio Modeling via Next-Scale Prediction | Kai Qiu et.al. | 2408.09027 | link |
2024-08-23 | Classifier-Free Guidance is a Predictor-Corrector | Arwen Bradley et.al. | 2408.09000 | null |
2024-08-21 | MR Optimized Reconstruction of Simultaneous Multi-Slice Imaging Using Diffusion Model | Ting Zhao et.al. | 2408.08883 | null |
2024-08-16 | PFDiff: Training-free Acceleration of Diffusion Models through the Gradient Guidance of Past and Future | Guangyi Wang et.al. | 2408.08822 | null |
2024-08-16 | Comparative Analysis of Generative Models: Enhancing Image Synthesis with VAEs, GANs, and Stable Diffusion | Sanchayan Vivekananthan et.al. | 2408.08751 | null |
2024-08-16 | An End-to-End Model for Photo-Sharing Multi-modal Dialogue Generation | Peiming Guo et.al. | 2408.08650 | link |
2024-08-16 | Modeling the Neonatal Brain Development Using Implicit Neural Representations | Florentin Bieder et.al. | 2408.08647 | link |
2024-08-16 | Sampling effects on Lasso estimation of drift functions in high-dimensional diffusion processes | Chiara Amorino et.al. | 2408.08638 | null |
2024-08-16 | Generative Dataset Distillation Based on Diffusion Model | Duo Su et.al. | 2408.08610 | link |
2024-08-16 | RadioDiff: An Effective Generative Diffusion Model for Sampling-Free Dynamic Radio Map Construction | Xiucheng Wang et.al. | 2408.08593 | link |
2024-08-22 | A New Chinese Landscape Paintings Generation Model based on Stable Diffusion using DreamBooth | Yujia Gu et.al. | 2408.08561 | null |
2024-08-16 | Linear combinations of latents in diffusion models: interpolation and beyond | Erik Bodin et.al. | 2408.08558 | null |
2024-08-16 | Inverse design with conditional cascaded diffusion models | Milad Habibi et.al. | 2408.08526 | null |
2024-08-16 | Visual-Friendly Concept Protection via Selective Adversarial Perturbations | Xiaoyue Mi et.al. | 2408.08518 | link |
2024-08-16 | Efficient Image-to-Image Diffusion Classifier for Adversarial Robustness | Hefei Mei et.al. | 2408.08502 | link |
2024-08-16 | Achieving Complex Image Edits via Function Aggregation with Diffusion Models | Mohammadreza Samadi et.al. | 2408.08495 | null |
2024-08-21 | JPEG-LM: LLMs as Image Generators with Canonical Codec Representations | Xiaochuang Han et.al. | 2408.08459 | null |
2024-08-15 | Scalable Computation of $\mathcal{H}_\infty$ Energy Functions for Polynomial Drift Nonlinear Systems | Nicholas A. Corbin et.al. | 2408.08387 | link |
2024-08-15 | CT4D: Consistent Text-to-4D Generation with Animatable Meshes | Ce Chen et.al. | 2408.08342 | null |
2024-08-15 | METR: Image Watermarking with Large Number of Unique Messages | Alexander Varlamov et.al. | 2408.08340 | link |
2024-08-14 | TurboEdit: Instant text-based image editing | Zongze Wu et.al. | 2408.08332 | null |
2024-07-31 | Segment Anything for Videos: A Systematic Survey | Chunhui Zhang et.al. | 2408.08315 | link |
2024-08-15 | Can Large Language Models Understand Symbolic Graphics Programs? | Zeju Qiu et.al. | 2408.08313 | null |
2024-08-15 | Accelerated Image-Aware Generative Diffusion Modeling | Tanmay Asthana et.al. | 2408.08306 | null |
2024-08-15 | Derivative-Free Guidance in Continuous and Discrete Diffusion Models with Soft Value-Based Decoding | Xiner Li et.al. | 2408.08252 | link |
2024-08-16 | FancyVideo: Towards Dynamic and Consistent Video Generation via Cross-frame Textual Guidance | Jiasong Feng et.al. | 2408.08189 | null |
2024-08-15 | Not Every Image is Worth a Thousand Words: Quantifying Originality in Stable Diffusion | Adi Haviv et.al. | 2408.08184 | null |
2024-08-15 | General single-loop methods for bilevel parameter learning | Ensio Suonperä et.al. | 2408.08123 | link |
2024-08-15 | When Video Coding Meets Multimodal Large Language Models: A Unified Paradigm for Video Coding | Pingping Zhang et.al. | 2408.08093 | null |
2024-08-20 | Conditional Brownian Bridge Diffusion Model for VHR SAR to Optical Image Translation | Seon-Hoon Kim et.al. | 2408.07947 | link |
2024-08-15 | Robust Offline Active Learning on Graphs | Yuanchen Wu et.al. | 2408.07941 | link |
2024-08-15 | A Novel Generative Artificial Intelligence Method for Interference Study on Multiplex Brightfield Immunohistochemistry Images | Satarupa Mukherjee et.al. | 2408.07860 | null |
2024-08-14 | Moderator: Moderating Text-to-Image Diffusion Models through Fine-grained Context-based Policies | Peiran Wang et.al. | 2408.07728 | link |
2024-08-10 | Pretrained-Guided Conditional Diffusion Models for Microbiome Data Analysis | Xinyuan Shi et.al. | 2408.07709 | link |
2024-08-14 | Boosting Unconstrained Face Recognition with Targeted Style Adversary | Mohammad Saeed Ebrahimi Saadabadi et.al. | 2408.07642 | null |
2024-08-14 | Drug Discovery SMILES-to-Pharmacokinetics Diffusion Models with Deep Molecular Understanding | Bing Hu et.al. | 2408.07636 | null |
2024-08-14 | Anisotropic Diffusion Model of Communication in 2D Biofilm | Yanahan Paramalingam et.al. | 2408.07626 | null |
2024-08-14 | Panacea+: Panoramic and Controllable Video Generation for Autonomous Driving | Yuqing Wen et.al. | 2408.07605 | null |
2024-08-14 | DifuzCam: Replacing Camera Lens with a Mask and a Diffusion Model | Erez Yosef et.al. | 2408.07541 | null |
2024-08-15 | DIffSteISR: Harnessing Diffusion Prior for Superior Real-world Stereo Image Super-Resolution | Yuanbo Zhou et.al. | 2408.07516 | null |
2024-08-14 | DeCo: Decoupled Human-Centered Diffusion Video Editing with Motion Consistency | Xiaojing Zhong et.al. | 2408.07481 | null |
2024-08-14 | One Step Diffusion-based Super-Resolution with Time-Aware Distillation | Xiao He et.al. | 2408.07476 | link |
2024-08-14 | Unsupervised Blind Joint Dereverberation and Room Acoustics Estimation with Diffusion Models | Jean-Marie Lemercier et.al. | 2408.07472 | null |
2024-08-14 | KIND: Knowledge Integration and Diversion in Diffusion Models | Yucheng Xie et.al. | 2408.07337 | null |
2024-08-14 | GRIF-DM: Generation of Rich Impression Fonts using Diffusion Models | Lei Kang et.al. | 2408.07259 | link |
2024-08-13 | Representation-space diffusion models for generating periodic materials | Anshuman Sinha et.al. | 2408.07213 | null |
2024-08-13 | SeLoRA: Self-Expanding Low-Rank Adaptation of Latent Diffusion Model for Medical Image Synthesis | Yuchen Mao et.al. | 2408.07196 | null |
2024-08-16 | Generative Photomontage | Sean J. Liu et.al. | 2408.07116 | null |
2024-08-15 | MathBridge: A Large Corpus Dataset for Translating Spoken Mathematical Expressions into $LaTeX$ Formulas for Improved Readability | Kyudan Jung et.al. | 2408.07081 | null |
2024-08-13 | Imagen 3 | Imagen-Team-Google et.al. | 2408.07009 | null |
2024-08-13 | Low-Bitwidth Floating Point Quantization for Efficient High-Quality Diffusion Models | Cheng Chen et.al. | 2408.06995 | null |
2024-08-13 | DCMSA: Multi-Head Self-Attention Mechanism Based on Deformable Convolution For Seismic Data Denoising | Wang Mingwei et.al. | 2408.06963 | null |
2024-08-13 | Definition of multispectral camera system parameters to model the asteroid 2001 SN263 | Gabriela de Carvalho Assis Goulart et.al. | 2408.06886 | null |
2024-08-13 | Diffusion Model for Slate Recommendation | Federico Tomasi et.al. | 2408.06883 | null |
2024-08-13 | Improving Synthetic Image Detection Towards Generalization: An Image Transformation Perspective | Ouxiang Li et.al. | 2408.06741 | link |
2024-08-18 | DiffLoRA: Generating Personalized Low-Rank Adaptation Weights with Diffusion | Yujia Wu et.al. | 2408.06740 | null |
2024-08-13 | DiffSG: A Generative Solver for Network Optimization with Diffusion Model | Ruihuai Liang et.al. | 2408.06701 | link |
2024-08-13 | DC3DO: Diffusion Classifier for 3D Objects | Nursena Koprucu et.al. | 2408.06693 | link |
2024-08-13 | Leveraging Priors via Diffusion Bridge for Time Series Generation | Jinseong Park et.al. | 2408.06672 | null |
2024-08-13 | Hybrid SD: Edge-Cloud Collaborative Inference for Stable Diffusion Models | Chenqian Yan et.al. | 2408.06646 | null |
2024-08-13 | ViMo: Generating Motions from Casual Videos | Liangdong Qiu et.al. | 2408.06614 | null |
2024-08-12 | Prompt Recovery for Image Generation Models: A Comparative Study of Discrete Optimizers | Joshua Nathaniel Williams et.al. | 2408.06502 | null |
2024-08-12 | Synthetic Photography Detection: A Visual Guidance for Identifying Synthetic Images Created by AI | Melanie Mathys et.al. | 2408.06398 | null |
2024-09-01 | The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery | Chris Lu et.al. | 2408.06292 | link |
2024-08-12 | 3D Reconstruction of Protein Structures from Multi-view AFM Images using Neural Radiance Fields (NeRFs) | Jaydeep Rade et.al. | 2408.06244 | null |
2024-08-12 | Novel View Synthesis from a Single Image with Pretrained Diffusion Guidance | Taewon Kang et.al. | 2408.06157 | null |
2024-08-12 | Efficient and Scalable Point Cloud Generation with Sparse Point-Voxel Diffusion Models | Ioannis Romanelis et.al. | 2408.06145 | link |
2024-08-12 | CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer | Zhuoyi Yang et.al. | 2408.06072 | link |
2024-08-15 | ControlNeXt: Powerful and Efficient Control for Image and Video Generation | Bohao Peng et.al. | 2408.06070 | link |
2024-08-12 | BooW-VTON: Boosting In-the-Wild Virtual Try-On via Mask-Free Pseudo Data Training | Xuanpu Zhang et.al. | 2408.06047 | link |
2024-08-12 | Diffuse-UDA: Addressing Unsupervised Domain Adaptation in Medical Image Segmentation with Appearance and Structure Aligned Diffusion Models | Haifan Gong et.al. | 2408.05985 | null |
2024-08-12 | UniPortrait: A Unified Framework for Identity-Preserving Single- and Multi-Human Image Personalization | Junjie He et.al. | 2408.05939 | link |
2024-08-12 | Deep Geometric Moments Promote Shape Consistency in Text-to-3D Generation | Utkarsh Nath et.al. | 2408.05938 | null |
2024-08-12 | A Simple Early Exiting Framework for Accelerated Sampling in Diffusion Models | Taehong Moon et.al. | 2408.05927 | link |
2024-08-12 | Classifier Guidance Enhances Diffusion-based Adversarial Purification by Preserving Predictive Information | Mingkun Zhang et.al. | 2408.05900 | null |
2024-08-22 | LaWa: Using Latent Space for In-Generation Image Watermarking | Ahmad Rezaei et.al. | 2408.05868 | null |
2024-08-11 | Egocentric Vision Language Planning | Zhirui Fang et.al. | 2408.05802 | null |
2024-08-11 | MTSCI: A Conditional Diffusion Model for Multivariate Time Series Consistent Imputation | Jianping Zhou et.al. | 2408.05740 | link |
2024-08-19 | SSL: A Self-similarity Loss for Improving Generative Image Super-resolution | Du Chen et.al. | 2408.05713 | link |
2024-08-11 | TC-KANRecon: High-Quality and Accelerated MRI Reconstruction via Adaptive KAN Mechanisms and Intelligent Feature Scaling | Ruiquan Ge et.al. | 2408.05705 | link |
2024-08-11 | StealthDiffusion: Towards Evading Diffusion Forensic Detection through Diffusion Model | Ziyin Zhou et.al. | 2408.05669 | link |
2024-08-16 | Speculative Diffusion Decoding: Accelerating Language Generation through Diffusion | Jacob K Christopher et.al. | 2408.05636 | null |
2024-08-10 | Diffusion Model-based Contrastive Learning for Human Activity Recognition | Chunjing Xiao et.al. | 2408.05567 | null |
2024-08-10 | ZePo: Zero-Shot Portrait Stylization with Faster Sampling | Jin Liu et.al. | 2408.05492 | link |
2024-08-10 | ReToMe-VA: Recursive Token Merging for Video Diffusion-based Unrestricted Adversarial Attack | Ziyi Gao et.al. | 2408.05479 | null |
2024-08-20 | Scene123: One Prompt to 3D Scene Generation via Video-Assisted and Consistency-Enhanced MAE | Yiying Yang et.al. | 2408.05477 | null |
2024-08-10 | Artworks Reimagined: Exploring Human-AI Co-Creation through Body Prompting | Jonas Oppenlaender et.al. | 2408.05476 | null |
2024-08-10 | Multimodal generative semantic communication based on latent diffusion model | Weiqi Fu et.al. | 2408.05455 | null |
2024-08-08 | Ensemble everything everywhere: Multi-scale aggregation for adversarial robustness | Stanislav Fort et.al. | 2408.05446 | link |
2024-08-10 | High-fidelity and Lip-synced Talking Face Synthesis via Landmark-based Diffusion Model | Weizhi Zhong et.al. | 2408.05416 | null |
2024-08-10 | Style-Preserving Lip Sync via Audio-Aware Style Reference | Weizhi Zhong et.al. | 2408.05412 | null |
2024-08-09 | Multi-Garment Customized Model Generation | Yichen Liu et.al. | 2408.05206 | null |
2024-07-31 | Semantic Successive Refinement: A Generative AI-aided Semantic Communication Framework | Kexin Zhang et.al. | 2408.05112 | null |
2024-07-24 | PreciseControl: Enhancing Text-To-Image Diffusion Models with Fine-Grained Attribute Control | Rishubh Parihar et.al. | 2408.05083 | null |
2024-08-09 | Instruction Tuning-free Visual Token Complement for Multimodal LLMs | Dongsheng Wang et.al. | 2408.05019 | null |
2024-08-09 | DreamCouple: Exploring High Quality Text-to-3D Generation Via Rectified Flow | Hangyu Li et.al. | 2408.05008 | null |
2024-08-09 | DAFT-GAN: Dual Affine Transformation Generative Adversarial Network for Text-Guided Image Inpainting | Jihoon Lee et.al. | 2408.04962 | null |
2024-08-09 | TEAdapter: Supply abundant guidance for controllable text-to-music generation | Jialing Zou et.al. | 2408.04865 | link |
2024-08-09 | Adversarially Robust Industrial Anomaly Detection Through Diffusion Model | Yuanpu Cao et.al. | 2408.04839 | null |
2024-08-09 | Next-Generation Wi-Fi Networks with Generative AI: Design and Insights | Jingyu Wang et.al. | 2408.04835 | null |
2024-08-08 | BRAT: Bonus oRthogonAl Token for Architecture Agnostic Textual Inversion | James Baker et.al. | 2408.04785 | link |
2024-08-08 | Deep Learning-based Unsupervised Domain Adaptation via a Unified Model for Prostate Lesion Detection Using Multisite Bi-parametric MRI Datasets | Hao Li et.al. | 2408.04777 | null |
2024-08-08 | Zero-Shot Uncertainty Quantification using Diffusion Probabilistic Models | Dule Shu et.al. | 2408.04718 | null |
2024-08-08 | Puppet-Master: Scaling Interactive Video Generation as a Motion Prior for Part-Level Dynamics | Ruining Li et.al. | 2408.04631 | null |
2024-08-08 | Sketch2Scene: Automatic Generation of Interactive 3D Game Scenes from User’s Casual Sketches | Yongzhi Xu et.al. | 2408.04567 | null |
2024-08-08 | Open-domain Implicit Format Control for Large Language Model Generation | Yiqun Yao et.al. | 2408.04392 | link |
2024-08-21 | Deep Generative Models in Robotics: A Survey on Learning from Multimodal Demonstrations | Julen Urain et.al. | 2408.04380 | null |
2024-08-26 | InstantStyleGaussian: Efficient Art Style Transfer with 3D Gaussian Splatting | Xin-Yi Yu et.al. | 2408.04249 | null |
2024-08-08 | LLDif: Diffusion Models for Low-light Emotion Recognition | Zhifeng Wang et.al. | 2408.04235 | null |
2024-08-08 | Connective Viewpoints of Signal-to-Noise Diffusion Models | Khanh Doan et.al. | 2408.04221 | null |
2024-08-08 | Diffusion Guided Language Modeling | Justin Lovelace et.al. | 2408.04220 | link |
2024-08-07 | ArtVLM: Attribute Recognition Through Vision-Based Prefix Language Modeling | William Y. Zhu et.al. | 2408.04102 | link |
2024-07-22 | Prompting for products: Investigating design space exploration strategies for text-to-image generative models | Leah Chong et.al. | 2408.03946 | null |
2024-08-07 | Counterfactuals and Uncertainty-Based Explainable Paradigm for the Automated Detection and Segmentation of Renal Cysts in Computed Tomography Images: A Multi-Center Study | Zohaib Salahuddin et.al. | 2408.03789 | null |
2024-08-07 | Data Generation Scheme for Thermal Modality with Edge-Guided Adversarial Conditional Diffusion Model | Guoqing Zhu et.al. | 2408.03748 | link |
2024-08-07 | Openstory++: A Large-scale Dataset and Benchmark for Instance-aware Open-domain Visual Storytelling | Zilyu Ye et.al. | 2408.03695 | link |
2024-08-07 | Unsupervised Detection of Fetal Brain Anomalies using Denoising Diffusion Models | Markus Ditlev Sjøgren Olsen et.al. | 2408.03654 | null |
2024-08-07 | TALE: Training-free Cross-domain Image Composition via Adaptive Latent Manipulation and Energy-guided Optimization | Kien T. Pham et.al. | 2408.03637 | null |
2024-08-22 | Concept Conductor: Orchestrating Multiple Personalized Concepts in Text-to-Image Synthesis | Zebin Yao et.al. | 2408.03632 | link |
2024-08-07 | A comparative study of generative adversarial networks for image recognition algorithms based on deep learning and traditional methods | Yihao Zhong et.al. | 2408.03568 | null |
2024-08-07 | Dirichlet forms of diffusion processes on Thoma simplex | Sergei Korotkikh et.al. | 2408.03553 | null |
2024-08-06 | Hybrid diffusion models: combining supervised and generative pretraining for label-efficient fine-tuning of segmentation models | Bruno Sauvalle et.al. | 2408.03433 | null |
2024-08-06 | Attacks and Defenses for Generative Diffusion Models: A Comprehensive Survey | Vu Tuan Truong et.al. | 2408.03400 | null |
2024-08-06 | Adversarial Domain Adaptation for Cross-user Activity Recognition Using Diffusion-based Noise-centred Learning | Xiaozhou Ye et.al. | 2408.03353 | link |
2024-08-06 | MDT-A2G: Exploring Masked Diffusion Transformers for Co-Speech Gesture Generation | Xiaofeng Mao et.al. | 2408.03312 | null |
2024-08-27 | IPAdapter-Instruct: Resolving Ambiguity in Image-based Conditioning using Instruct Prompts | Ciara Rowles et.al. | 2408.03209 | null |
2024-08-06 | An Object is Worth 64x64 Pixels: Generating 3D Object via Image Diffusion | Xingguang Yan et.al. | 2408.03178 | null |
2024-08-06 | Iterative CT Reconstruction via Latent Variable Optimization of Shallow Diffusion Models | Sho Ozaki et.al. | 2408.03156 | null |
2024-07-31 | Closed-loop Diffusion Control of Complex Physical Systems | Long Wei et.al. | 2408.03124 | link |
2024-08-06 | Training-Free Condition Video Diffusion Models for single frame Spatial-Semantic Echocardiogram Synthesis | Van Phi Nguyen et.al. | 2408.03035 | link |
2024-08-06 | Multitask and Multimodal Neural Tuning for Large Models | Hao Sun et.al. | 2408.03001 | null |
2024-08-09 | DreamLCM: Towards High-Quality Text-to-3D Generation via Latent Consistency Model | Yiming Zhong et.al. | 2408.02993 | link |
2024-08-06 | Diffusion Model Meets Non-Exemplar Class-Incremental Learning and Beyond | Jichuan Zhang et.al. | 2408.02983 | null |
2024-08-06 | Data-Driven Stochastic Closure Modeling via Conditional Diffusion Model and Neural Operator | Xinghao Dong et.al. | 2408.02965 | null |
2024-08-06 | Diverse Generation while Maintaining Semantic Coordination: A Diffusion-Based Data Augmentation Method for Object Detection | Sen Nie et.al. | 2408.02891 | null |
2024-08-09 | Back-Projection Diffusion: Solving the Wideband Inverse Scattering Problem with Diffusion Models | Borong Zhang et.al. | 2408.02866 | link |
2024-08-05 | Pre-trained Encoder Inference: Revealing Upstream Encoders In Downstream Machine Learning Services | Shaopeng Fu et.al. | 2408.02814 | link |
2024-07-20 | Diffusion Models as Data Mining Tools | Ioannis Siglidis et.al. | 2408.02752 | null |
2024-08-05 | Text Conditioned Symbolic Drumbeat Generation using Latent Diffusion Models | Pushkar Jajoria et.al. | 2408.02711 | null |
2024-08-05 | RCDM: Enabling Robustness for Conditional Diffusion Model | Weifeng Xu et.al. | 2408.02710 | null |
2024-08-19 | Diff-PIC: Revolutionizing Particle-In-Cell Simulation for Advancing Nuclear Fusion with Diffusion Models | Chuan Liu et.al. | 2408.02693 | null |
2024-08-05 | Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining | Dongyang Liu et.al. | 2408.02657 | link |
2024-08-05 | VidGen-1M: A Large-Scale Dataset for Text-to-video Generation | Zhiyu Tan et.al. | 2408.02629 | null |
2024-08-05 | LaMamba-Diff: Linear-Time High-Fidelity Diffusion Models Based on Local Attention and Mamba | Yunxiang Fu et.al. | 2408.02615 | link |
2024-08-05 | Generative AI as a Service in 6G Edge-Cloud: Generation Task Offloading by In-context Learning | Hao Zhou et.al. | 2408.02549 | null |
2024-08-05 | Multi-weather Cross-view Geo-localization Using Denoising Diffusion Models | Tongtong Feng et.al. | 2408.02408 | null |
2024-08-05 | A Sharp Convergence Theory for The Probability Flow ODEs of Diffusion Models | Gen Li et.al. | 2408.02320 | null |
2024-08-05 | Curriculum learning based pre-training using Multi-Modal Contrastive Masked Autoencoders | Muhammad Abdullah Jamal et.al. | 2408.02245 | null |
2024-08-05 | REVISION: Rendering Tools Enable Spatial Fidelity in Vision-Language Models | Agneet Chatterjee et.al. | 2408.02231 | null |
2024-08-06 | ProCreate, Don’t Reproduce! Propulsive Energy Diffusion for Creative Generation | Jack Lu et.al. | 2408.02226 | link |
2024-08-04 | PanoFree: Tuning-Free Holistic Multi-view Image Generation with Cross-view Self-Guidance | Aoming Liu et.al. | 2408.02157 | null |
2024-08-04 | LDFaceNet: Latent Diffusion-based Network for High-Fidelity Deepfake Generation | Dwij Mehta et.al. | 2408.02078 | null |
2024-07-22 | FDiff-Fusion:Denoising diffusion fusion network based on fuzzy learning for 3D medical image segmentation | Weiping Ding et.al. | 2408.02075 | null |
2024-08-04 | Step Saver: Predicting Minimum Denoising Steps for Diffusion Model Image Generation | Jean Yu et.al. | 2408.02054 | null |
2024-08-04 | Robustness of Watermarking on Text-to-Image Diffusion Models | Xiaodong Wu et.al. | 2408.02035 | null |
2024-08-04 | Faster Diffusion Action Segmentation | Shuaibing Wang et.al. | 2408.02024 | null |
2024-08-04 | AnomalySD: Few-Shot Multi-Class Anomaly Detection with Stable Diffusion Model | Zhenyu Yan et.al. | 2408.01960 | null |
2024-08-04 | Dataset Scale and Societal Consistency Mediate Facial Impression Bias in Vision-Language AI | Robert Wolfe et.al. | 2408.01959 | null |
2024-08-04 | Why Perturbing Symbolic Music is Necessary: Fitting the Distribution of Never-used Notes through a Joint Probabilistic Diffusion Model | Shipei Liu et.al. | 2408.01950 | null |
2024-08-16 | GLDiTalker: Speech-Driven 3D Facial Animation with Graph Latent Diffusion Transformer | Yihong Lin et.al. | 2408.01826 | null |
2024-08-17 | SkyDiffusion: Street-to-Satellite Image Synthesis with Diffusion Models and BEV Paradigm | Junyan Ye et.al. | 2408.01812 | null |
2024-08-03 | Landmark-guided Diffusion Model for High-fidelity and Temporally Coherent Talking Head Generation | Jintao Tan et.al. | 2408.01732 | null |
2024-08-03 | A Novel Evaluation Framework for Image2Text Generation | Jia-Hong Huang et.al. | 2408.01723 | null |
2024-08-03 | Controllable Unlearning for Image-to-Image Generative Models via $\varepsilon$ -Constrained Optimization | Xiaohua Feng et.al. | 2408.01689 | null |
2024-08-02 | “I don’t see myself represented here at all”: User Experiences of Stable Diffusion Outputs Containing Representational Harms across Gender Identities and Nationalities | Sourojit Ghosh et.al. | 2408.01594 | null |
2024-08-02 | Interpretations, Representations, and Stereotypes of Caste within Text-to-Image Generators | Sourojit Ghosh et.al. | 2408.01590 | null |
2024-08-02 | Conformal Diffusion Models for Individual Treatment Effect Estimation and Inference | Hengrui Cai et.al. | 2408.01582 | null |
2024-08-02 | NeuralFactors: A Novel Factor Learning Approach to Generative Modeling of Equities | Achintya Gopal et.al. | 2408.01499 | null |
2024-07-18 | SUSTechGAN: Image Generation for Object Recognition in Adverse Conditions of Autonomous Driving | Gongjin Lan et.al. | 2408.01430 | link |
2024-08-02 | Conditional LoRA Parameter Generation | Xiaolong Jin et.al. | 2408.01415 | null |
2024-08-02 | TexGen: Text-Guided 3D Texture Generation with Multi-view Sampling and Resampling | Dong Huo et.al. | 2408.01291 | null |
2024-08-02 | A General Framework to Boost 3D GS Initialization for Text-to-3D Generation by Lexical Richness | Lutao Jiang et.al. | 2408.01269 | null |
2024-08-13 | CLIP4Sketch: Enhancing Sketch to Mugshot Matching through Dataset Augmentation using Diffusion Models | Kushal Kumar Jain et.al. | 2408.01233 | null |
2024-08-02 | VAR-CLIP: Text-to-Image Generator with Visual Auto-Regressive Modeling | Qian Zhang et.al. | 2408.01181 | link |
2024-08-02 | PINNs for Medical Image Analysis: A Survey | Chayan Banerjee et.al. | 2408.01026 | null |
2024-08-02 | EIUP: A Training-Free Approach to Erase Non-Compliant Concepts Conditioned on Implicit Unsafe Prompts | Die Chen et.al. | 2408.01014 | null |
2024-08-06 | FBSDiff: Plug-and-Play Frequency Band Substitution of Diffusion Features for Highly Controllable Text-Driven Image Translation | Xiang Gao et.al. | 2408.00998 | link |
2024-08-05 | CIResDiff: A Clinically-Informed Residual Diffusion Model for Predicting Idiopathic Pulmonary Fibrosis Progression | Caiwen Jiang et.al. | 2408.00938 | null |
2024-08-01 | Optimizing Diffusion Models for Joint Trajectory Prediction and Controllable Generation | Yixiao Wang et.al. | 2408.00766 | null |
2024-08-01 | Smoothed Energy Guidance: Guiding Diffusion Models with Reduced Energy Curvature of Attention | Susung Hong et.al. | 2408.00760 | link |
2024-08-01 | TurboEdit: Text-Based Image Editing Using Few-Step Diffusion Models | Gilad Deutch et.al. | 2408.00735 | null |
2024-08-01 | MotionFix: Text-Driven 3D Human Motion Editing | Nikos Athanasiou et.al. | 2408.00712 | null |
2024-08-01 | Synthetic dual image generation for reduction of labeling efforts in semantic segmentation of micrographs with a customized metric function | Matias Oscar Volman Stern et.al. | 2408.00707 | null |
2024-08-01 | Evaluation Metrics and Methods for Generative Models in the Wireless PHY Layer | Michael Baur et.al. | 2408.00634 | null |
2024-08-01 | Illustrating Classic Brazilian Books using a Text-To-Image Diffusion Model | Felipe Mahlow et.al. | 2408.00544 | null |
2024-08-01 | Jailbreaking Text-to-Image Models with LLM-Based Agents | Yingkai Dong et.al. | 2408.00523 | null |
2024-08-01 | A new approach for encoding code and assisting code understanding | Mengdan Fan et.al. | 2408.00521 | null |
2024-08-01 | Reenact Anything: Semantic Video Motion Transfer Using Motion-Textual Inversion | Manuel Kansy et.al. | 2408.00458 | null |
2024-08-01 | Towards Reliable Advertising Image Generation Using Human Feedback | Zhenbang Du et.al. | 2408.00418 | link |
2024-08-01 | DriveArena: A Closed-loop Generative Simulation Platform for Autonomous Driving | Xuemeng Yang et.al. | 2408.00415 | null |
2024-08-13 | Deepfake Media Forensics: State of the Art and Challenges Ahead | Irene Amerini et.al. | 2408.00388 | null |
2024-08-01 | On the Limitations and Prospects of Machine Unlearning for Generative AI | Shiji Zhou et.al. | 2408.00376 | null |
2024-08-01 | Few-shot Defect Image Generation based on Consistency Modeling | Qingfeng Shi et.al. | 2408.00372 | link |
2024-08-01 | DiM-Gesture: Co-Speech Gesture Generation with Adaptive Layer Normalization Mamba-2 framework | Fan Zhang et.al. | 2408.00370 | null |
2024-08-01 | Autonomous LLM-Enhanced Adversarial Attack for Text-to-Motion | Honglei Miao et.al. | 2408.00352 | null |
2024-08-01 | A Simple Background Augmentation Method for Object Detection with Diffusion Model | Yuhang Li et.al. | 2408.00350 | null |
2024-08-01 | ADBM: Adversarial diffusion bridge model for reliable adversarial purification | Xiao Li et.al. | 2408.00315 | null |
2024-08-01 | Diff3DETR:Agent-based Diffusion Model for Semi-supervised 3D Object Detection | Jiacheng Deng et.al. | 2408.00286 | null |
2024-08-01 | Navigating Text-to-Image Generative Bias across Indic Languages | Surbhi Mittal et.al. | 2408.00283 | null |
2024-08-05 | Lost in Translation: Latent Concept Misalignment in Text-to-Image Diffusion Models | Juntu Zhao et.al. | 2408.00230 | link |
2024-07-31 | Hierarchical Conditioning of Diffusion Models Using Tree-of-Life for Studying Species Evolution | Mridul Khurana et.al. | 2408.00160 | null |
2024-07-31 | Generative Learning of the Solution of Parametric Partial Differential Equations Using Guided Diffusion Models and Virtual Observations | Han Gao et.al. | 2408.00157 | null |
2024-07-31 | WAS: Dataset and Methods for Artistic Text Segmentation | Xudong Xie et.al. | 2408.00106 | link |
2024-07-31 | From Attributes to Natural Language: A Survey and Foresight on Text-based Person Re-identification | Fanzhi Jiang et.al. | 2408.00096 | null |
2024-07-31 | Localized Gaussian Splatting Editing with Contextual Awareness | Hanyuan Xiao et.al. | 2408.00083 | null |
2024-07-31 | Detecting, Explaining, and Mitigating Memorization in Diffusion Models | Yuxin Wen et.al. | 2407.21720 | link |
2024-07-31 | Tora: Trajectory-oriented Diffusion Transformer for Video Generation | Zhenghao Zhang et.al. | 2407.21705 | link |
2024-07-31 | Generative Diffusion Model for Seismic Imaging Improvement of Sparsely Acquired Data and Uncertainty Quantification | Xingchen Shi et.al. | 2407.21683 | null |
2024-07-31 | Explainable and Controllable Motion Curve Guided Cardiac Ultrasound Video Generation | Junxuan Yu et.al. | 2407.21490 | null |
2024-07-31 | Fine-gained Zero-shot Video Sampling | Dengsheng Chen et.al. | 2407.21475 | null |
2024-07-31 | Deformable 3D Shape Diffusion Model | Dengsheng Chen et.al. | 2407.21428 | null |
2024-07-31 | Benchmarking AIGC Video Quality Assessment: A Dataset and Unified Model | Zhichao Zhang et.al. | 2407.21408 | null |
2024-07-31 | Identity-Consistent Diffusion Network for Grading Knee Osteoarthritis Progression in Radiographic Imaging | Wenhua Wu et.al. | 2407.21381 | null |
2024-07-31 | ESIQA: Perceptual Quality Assessment of Vision-Pro-based Egocentric Spatial Images | Xilei Zhu et.al. | 2407.21363 | null |
2024-07-31 | Diff-Cleanse: Identifying and Mitigating Backdoor Attacks in Diffusion Models | Jiang Hao et.al. | 2407.21316 | link |
2024-07-31 | State-observation augmented diffusion model for nonlinear assimilation | Zhuoyuan Li et.al. | 2407.21314 | link |
2024-07-31 | DEF-oriCORN: efficient 3D scene understanding for robust language-directed manipulation without demonstrations | Dongwon Son et.al. | 2407.21267 | null |
2024-07-30 | Informed Correctors for Discrete Diffusion Models | Yixiu Zhao et.al. | 2407.21243 | null |
2024-07-30 | Diffusion-Based Generation of Neural Activity from Disentangled Latent Codes | Jonathan D. McCart et.al. | 2407.21195 | null |
2024-07-30 | Embedding Space Selection for Detecting Memorization and Fingerprinting in Generative Models | Jack He et.al. | 2407.21159 | null |
2024-07-30 | On the optimal design of a new class of proportional portfolio insurance strategies in a jump-diffusion framework | Katia Colaneri et.al. | 2407.21148 | null |
2024-08-04 | Adding Multimodal Controls to Whole-body Human Motion Generation | Yuxuan Bian et.al. | 2407.21136 | link |
2024-07-17 | Direct Unlearning Optimization for Robust and Safe Text-to-Image Models | Yong-Hyun Park et.al. | 2407.21035 | null |
2024-07-17 | Safeguard Text-to-Image Diffusion Models with Human Feedback Inversion | Sanghyun Kim et.al. | 2407.21032 | null |
2024-07-30 | Matting by Generation | Zhixiang Wang et.al. | 2407.21017 | null |
2024-07-30 | Add-SD: Rational Generation without Manual Reference | Lingfeng Yang et.al. | 2407.21016 | link |
2024-08-18 | GABInsight: Exploring Gender-Activity Binding Bias in Vision-Language Models | Ali Abdollahi et.al. | 2407.21001 | link |
2024-07-30 | Vulnerabilities in AI-generated Image Detection: The Challenge of Adversarial Attacks | Yunfeng Diao et.al. | 2407.20836 | null |
2024-07-30 | Diffusion Augmented Agents: A Framework for Efficient Exploration and Transfer Learning | Norman Di Palo et.al. | 2407.20798 | null |
2024-07-29 | Retinex-Diffusion: On Controlling Illumination Conditions in Diffusion Models via Retinex Theory | Xiaoyan Xing et.al. | 2407.20785 | null |
2024-07-27 | Inverse Problems with Diffusion Models: A MAP Estimation Perspective | Sai bharath chandra Gutha et.al. | 2407.20784 | link |
2024-08-10 | SynthVLM: High-Efficiency and High-Quality Synthetic Data for Vision Language Models | Zheng Liu et.al. | 2407.20756 | link |
2024-07-30 | Understanding the Impact of Synchronous, Asynchronous, and Hybrid In-Situ Techniques in Computational Fluid Dynamics Applications | Yi Ju et.al. | 2407.20717 | null |
2024-07-30 | DocXPand-25k: a large and diverse benchmark dataset for identity documents analysis | Julien Lerouge et.al. | 2407.20662 | link |
2024-07-30 | Autonomous Improvement of Instruction Following Skills via Foundation Models | Zhiyuan Zhou et.al. | 2407.20635 | link |
2024-07-30 | EgoSonics: Generating Synchronized Audio for Silent Egocentric Videos | Aashish Rai et.al. | 2407.20592 | null |
2024-07-30 | DiffusionCounterfactuals: Inferring High-dimensional Counterfactuals with Guidance of Causal Representations | Jiageng Zhu et.al. | 2407.20553 | null |
2024-07-29 | Learning Feature-Preserving Portrait Editing from Generated Pairs | Bowei Chen et.al. | 2407.20455 | null |
2024-07-29 | Contrasting Deepfakes Diffusion via Contrastive Learning and Global-Local Similarities | Lorenzo Baraldi et.al. | 2407.20337 | link |
2024-07-29 | Sun Off, Lights On: Photorealistic Monocular Nighttime Simulation for Robust Semantic Perception | Konstantinos Tzevelekakis et.al. | 2407.20336 | null |
2024-07-20 | Improving EEG Classification Through Randomly Reassembling Original and Generated Data with Transformer-based Diffusion Models | Mingzhi Chen et.al. | 2407.20253 | null |
2024-07-29 | Specify and Edit: Overcoming Ambiguity in Text-Based Image Editing | Ekaterina Iakovleva et.al. | 2407.20232 | null |
2024-07-29 | LatentArtiFusion: An Effective and Efficient Histological Artifacts Restoration Framework | Zhenqi He et.al. | 2407.20172 | link |
2024-08-18 | Diffusion Feedback Helps CLIP See Better | Wenxuan Wang et.al. | 2407.20171 | link |
2024-07-29 | DDAP: Dual-Domain Anti-Personalization against Text-to-Image Diffusion Models | Jing Yang et.al. | 2407.20141 | null |
2024-07-29 | Diffusion-DICE: In-Sample Diffusion Guidance for Offline Reinforcement Learning | Liyuan Mao et.al. | 2407.20109 | null |
2024-07-29 | Generative Diffusion Model Bootstraps Zero-shot Classification of Fetal Ultrasound Images In Underrepresented African Populations | Fangyijie Wang et.al. | 2407.20072 | link |
2024-07-29 | MaskInversion: Localized Embeddings via Optimization of Explainability Maps | Walid Bousselham et.al. | 2407.20034 | null |
2024-07-29 | ImagiNet: A Multi-Content Dataset for Generalizable Synthetic Image Detection via Contrastive Learning | Delyan Boychev et.al. | 2407.20020 | link |
2024-07-29 | Reproducibility Study of “ITI-GEN: Inclusive Text-to-Image Generation” | Daniel Gallo Fernández et.al. | 2407.19996 | link |
2024-07-29 | MambaGesture: Enhancing Co-Speech Gesture Generation with Mamba and Disentangled Multi-Modality Fusion | Chencan Fu et.al. | 2407.19976 | null |
2024-07-29 | FedDEO: Description-Enhanced One-Shot Federated Learning with Diffusion Models | Mingzhao Yang et.al. | 2407.19953 | null |
2024-07-29 | FreeLong: Training-Free Long Video Generation with SpectralBlend Temporal Attention | Yu Lu et.al. | 2407.19918 | null |
2024-07-29 | Synthetic Thermal and RGB Videos for Automatic Pain Assessment utilizing a Vision-MLP Architecture | Stefanos Gkikas et.al. | 2407.19811 | null |
2024-07-29 | Map2Traj: Street Map Piloted Zero-shot Trajectory Generation with Diffusion Model | Zhenyu Tao et.al. | 2407.19765 | null |
2024-07-29 | Text2LiDAR: Text-guided LiDAR Point Cloud Generation via Equirectangular Transformer | Yang Wu et.al. | 2407.19628 | null |
2024-07-30 | Bridging the Gap: Studio-like Avatar Creation from a Monocular Phone Capture | ShahRukh Athar et.al. | 2407.19593 | null |
2024-07-28 | Cycle3D: High-quality and Consistent Image-to-3D Generation via Generation-Reconstruction Cycle | Zhenyu Tang et.al. | 2407.19548 | null |
2024-08-07 | Temporal Feature Matters: A Framework for Diffusion Model Quantization | Yushi Huang et.al. | 2407.19547 | null |
2024-08-16 | VersusDebias: Universal Zero-Shot Debiasing for Text-to-Image Models via SLM-Based Prompt Engineering and Generative Adversary | Hanjun Luo et.al. | 2407.19524 | link |
2024-07-28 | Visual Riddles: a Commonsense and World Knowledge Challenge for Large Vision and Language Models | Nitzan Bitton-Guetta et.al. | 2407.19474 | null |
2024-07-28 | MVPbev: Multi-view Perspective Image Generation from BEV with Test-time Controllability and Generalizability | Buyu Liu et.al. | 2407.19468 | link |
2024-07-28 | White Matter Geometry-Guided Score-Based Diffusion Model for Tissue Microstructure Imputation in Tractography Imaging | Yui Lo et.al. | 2407.19460 | null |
2024-07-28 | FIND: Fine-tuning Initial Noise Distribution with Policy Optimization for Diffusion Models | Changgu Chen et.al. | 2407.19453 | link |
2024-08-08 | Perm: A Parametric Representation for Multi-Style 3D Hair Modeling | Chengan He et.al. | 2407.19451 | link |
2024-07-28 | Innovative RIS Prototyping Enhancing Wireless Communication with Real-Time Spot Beam Tracking and OAM Wavefront Manipulation | Yufei Zhao et.al. | 2407.19379 | null |
2024-07-28 | ClickDiff: Click to Induce Semantic Contact Map for Controllable Grasp Generation with Diffusion Models | Peiming Li et.al. | 2407.19370 | link |
2024-07-27 | Radio Frequency Signal based Human Silhouette Segmentation: A Sequential Diffusion Approach | Penghui Wen et.al. | 2407.19244 | link |
2024-07-27 | Faster Image2Video Generation: A Closer Look at CLIP Image Embedding’s Impact on Spatio-Temporal Cross-Attentions | Ashkan Taghipour et.al. | 2407.19205 | null |
2024-07-27 | Data Processing Techniques for Modern Multimodal Models | Yinheng Li et.al. | 2407.19180 | null |
2024-07-26 | VIMs: Virtual Immunohistochemistry Multiplex staining via Text-to-Stain Diffusion Trained on Uniplex Stains | Shikha Dubey et.al. | 2407.19113 | null |
2024-07-26 | UniForensics: Face Forgery Detection via General Facial Representation | Ziyuan Fang et.al. | 2407.19079 | null |
2024-07-26 | ScalingGaussian: Enhancing 3D Content Creation with Generative Gaussian Splatting | Shen Chen et.al. | 2407.19035 | null |
2024-07-12 | Real Face Video Animation Platform | Xiaokai Chen et.al. | 2407.18955 | null |
2024-07-26 | SHIC: Shape-Image Correspondences with no Keypoint Supervision | Aleksandar Shtedritski et.al. | 2407.18907 | null |
2024-07-26 | Unifying Visual and Semantic Feature Spaces with Diffusion Models for Enhanced Cross-Modal Alignment | Yuze Zheng et.al. | 2407.18854 | null |
2024-07-26 | Revision of calcium and scandium abundances in Am stars based on NLTE calculations and comparison with diffusion stellar evolution models | L. I. Mashonkina et.al. | 2407.18736 | null |
2024-07-26 | Adversarial Robustification via Text-to-Image Diffusion Models | Daewon Choi et.al. | 2407.18658 | link |
2024-07-26 | How To Segment in 3D Using 2D Models: Automated 3D Segmentation of Prostate Cancer Metastatic Lesions on PET Volumes Using Multi-Angle Maximum Intensity Projections and Diffusion Models | Amirhosein Toosi et.al. | 2407.18555 | link |
2024-07-26 | Answerability Fields: Answerable Location Estimation via Diffusion Models | Daichi Azuma et.al. | 2407.18497 | null |
2024-07-26 | Diffusion-Driven Semantic Communication for Generative Models with Bandwidth Constraints | Lei Guo et.al. | 2407.18468 | null |
2024-07-26 | Lensless fiber endomicroscopic phase imaging with speckle-conditioned diffusion model | Zhaoqing Chen et.al. | 2407.18456 | null |
2024-08-04 | Diffusion-based subsurface multiphysics monitoring and forecasting | Xinquan Huang et.al. | 2407.18426 | null |
2024-07-25 | RegionDrag: Fast Region-Based Image Editing with Diffusion Models | Jingyi Lu et.al. | 2407.18247 | null |
2024-07-25 | VGGHeads: A Large-Scale Synthetic Dataset for 3D Human Heads | Orest Kupyn et.al. | 2407.18245 | link |
2024-07-25 | Self-supervised pre-training with diffusion model for few-shot landmark detection in x-ray images | Roberto Di Via et.al. | 2407.18125 | null |
2024-07-25 | AttentionHand: Text-driven Controllable Hand Image Generation for 3D Hand Reconstruction in the Wild | Junho Park et.al. | 2407.18034 | link |
2024-07-25 | Segmentation-guided MRI reconstruction for meaningfully diverse reconstructions | Jan Nikolas Morshuis et.al. | 2407.18026 | link |
2024-07-25 | Self-Supervision Improves Diffusion Models for Tabular Data Imputation | Yixin Liu et.al. | 2407.18013 | link |
2024-07-25 | Lightweight Language-driven Grasp Detection using Conditional Consistency Model | Nghia Nguyen et.al. | 2407.17967 | null |
2024-07-25 | Guided Latent Slot Diffusion for Object-Centric Learning | Krishnakant Singh et.al. | 2407.17929 | null |
2024-07-25 | ReCorD: Reasoning and Correcting Diffusion for HOI Generation | Jian-Yu Jiang-Lin et.al. | 2407.17911 | link |
2024-07-25 | Amortized Posterior Sampling with Diffusion Prior Distillation | Abbas Mammadov et.al. | 2407.17907 | null |
2024-07-25 | Artificial Immunofluorescence in a Flash: Rapid Synthetic Imaging from Brightfield Through Residual Diffusion | Xiaodan Xing et.al. | 2407.17882 | null |
2024-07-25 | DragText: Rethinking Text Embedding in Point-based Image Editing | Gayoon Choi et.al. | 2407.17843 | link |
2024-07-25 | Mpox Detection Advanced: Rapid Epidemic Response Through Synthetic Data | Yudara Kularathne et.al. | 2407.17762 | null |
2024-07-25 | Multi-physics Simulation Guided Generative Diffusion Models with Applications in Fluid and Heat Dynamics | Naichen Shi et.al. | 2407.17720 | link |
2024-07-24 | Diffusion Models for Multi-Task Generative Modeling | Changyou Chen et.al. | 2407.17571 | null |
2024-07-24 | SV4D: Dynamic 3D Content Generation with Multi-Frame and Multi-View Consistency | Yiming Xie et.al. | 2407.17470 | null |
2024-07-28 | HumanVid: Demystifying Training Data for Camera-controllable Human Image Animation | Zhenzhi Wang et.al. | 2407.17438 | link |
2024-07-24 | CDDIP: Constrained Diffusion-Driven Deep Image Prior for Seismic Image Reconstruction | Paul Goyes-Peñafiel et.al. | 2407.17402 | link |
2024-07-24 | ViPer: Visual Personalization of Generative Models via Individual Preference Learning | Sogand Salehi et.al. | 2407.17365 | null |
2024-07-24 | Revolutionizing Text-to-Image Retrieval as Autoregressive Token-to-Voken Generation | Yongqi Li et.al. | 2407.17274 | null |
2024-08-12 | LPGen: Enhancing High-Fidelity Landscape Painting Generation through Diffusion Model | Wanggong Yang et.al. | 2407.17229 | null |
2024-07-24 | Unpaired Photo-realistic Image Deraining with Energy-informed Diffusion Model | Yuanbo Wen et.al. | 2407.17193 | null |
2024-07-24 | Generalized Ordinal Priority Approach for Multi-Attribute Decision-Making under Incomplete Preference Information | Renlong Wang et.al. | 2407.17099 | null |
2024-07-24 | MemBench: Memorized Image Trigger Prompt Dataset for Diffusion Models | Chunsan Hong et.al. | 2407.17095 | link |
2024-07-24 | Sparse Inducing Points in Deep Gaussian Processes: Enhancing Modeling with Denoising Diffusion Variational Inference | Jian Xu et.al. | 2407.17033 | null |
2024-07-24 | Diffree: Text-Guided Shape Free Object Inpainting with Diffusion Model | Lirui Zhao et.al. | 2407.16982 | link |
2024-08-02 | An Adaptive Gradient Regularization Method | Huixiu Jiang et.al. | 2407.16944 | null |
2024-07-24 | Synthetic Trajectory Generation Through Convolutional Neural Networks | Jesse Merhi et.al. | 2407.16938 | link |
2024-07-24 | SAR to Optical Image Translation with Color Supervised Diffusion Model | Xinyu Bai et.al. | 2407.16921 | null |
2024-07-23 | VisMin: Visual Minimal-Change Understanding | Rabiul Awal et.al. | 2407.16772 | null |
2024-07-10 | Applying generative neural networks for fast simulations of the ALICE (CERN) experiment | Maksymilian Wojnar et.al. | 2407.16704 | link |
2024-07-23 | Diffusion Models for Monocular Depth Estimation: Overcoming Challenging Conditions | Fabio Tosi et.al. | 2407.16698 | link |
2024-07-23 | From Imitation to Refinement – Residual RL for Precise Visual Assembly | Lars Ankile et.al. | 2407.16677 | null |
2024-07-23 | MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequence | Canyu Zhao et.al. | 2407.16655 | null |
2024-07-23 | DreamVTON: Customizing 3D Virtual Try-on with Personalized Diffusion Models | Zhenyu Xie et.al. | 2407.16511 | null |
2024-07-23 | A rectangular loop interferometer for scalar optical computations and controlled generation of higher-order vector vortex modes using spin-orbit interaction of light | Ram Nandan Kumar et.al. | 2407.16501 | null |
2024-07-23 | MonoWAD: Weather-Adaptive Diffusion Model for Robust Monocular 3D Object Detection | Youngmin Oh et.al. | 2407.16448 | link |
2024-07-23 | On Differentially Private 3D Medical Image Synthesis with Controllable Latent Diffusion Models | Deniz Daum et.al. | 2407.16405 | link |
2024-07-24 | Visual Stereotypes of Autism Spectrum in DALL-E, Stable Diffusion, SDXL, and Midjourney | Maciej Wodziński et.al. | 2407.16292 | null |
2024-07-23 | DreamDissector: Learning Disentangled Text-to-3D Generation from 2D Diffusion Priors | Zizheng Yan et.al. | 2407.16260 | null |
2024-07-23 | OutfitAnyone: Ultra-high Quality Virtual Try-On for Any Clothing and Any Person | Ke Sun et.al. | 2407.16224 | null |
2024-07-23 | Diff-Shadow: Global-guided Diffusion Model for Shadow Removal | Jinting Luo et.al. | 2407.16214 | link |
2024-07-23 | CloudFixer: Test-Time Adaptation for 3D Point Clouds via Diffusion-Guided Geometric Transformation | Hajin Shim et.al. | 2407.16193 | null |
2024-07-23 | No Re-Train, More Gain: Upgrading Backbones with Diffusion Model for Few-Shot Segmentation | Shuai Chen et.al. | 2407.16182 | null |
2024-07-23 | Learning Trimodal Relation for Audio-Visual Question Answering with Missing Modality | Kyu Ri Park et.al. | 2407.16171 | link |
2024-07-23 | Diffusion Models as Optimizers for Efficient Planning in Offline RL | Renming Huang et.al. | 2407.16142 | link |
2024-07-23 | Diffusion Transformer Captures Spatial-Temporal Dependencies: A Theory for Gaussian Process Data | Hengyu Fu et.al. | 2407.16134 | null |
2024-07-23 | Diffusion Prior-Based Amortized Variational Inference for Noisy Inverse Problems | Sojin Lee et.al. | 2407.16125 | link |
2024-07-23 | Fréchet Video Motion Distance: A Metric for Evaluating Motion Consistency in Videos | Jiahe Liu et.al. | 2407.16124 | link |
2024-07-22 | Setting of the Poincaré section for accurately calculating the phase of rhythmic spatiotemporal dynamics | Takahiro Arai et.al. | 2407.16080 | null |
2024-07-22 | The Shadow of Fraud: The Emerging Danger of AI-powered Social Engineering and its Possible Cure | Jingru Yu et.al. | 2407.15912 | null |
2024-07-21 | CatVTON: Concatenation Is All You Need for Virtual Try-On with Diffusion Models | Zheng Chong et.al. | 2407.15886 | link |
2024-07-20 | Diff4VS: HIV-inhibiting Molecules Generation with Classifier Guidance Diffusion for Virtual Screening | Jiaqing Lyu et.al. | 2407.15880 | link |
2024-07-10 | Adversarial Attacks and Defenses on Text-to-Image Diffusion Models: A Survey | Chenyu Zhang et.al. | 2407.15861 | link |
2024-07-22 | Artist: Aesthetically Controllable Text-Driven Stylization without Training | Ruixiang Jiang et.al. | 2407.15842 | link |
2024-07-22 | Stretching Each Dollar: Diffusion Training from Scratch on a Micro-Budget | Vikash Sehwag et.al. | 2407.15811 | link |
2024-07-22 | Diffusion Model Based Resource Allocation Strategy in Ultra-Reliable Wireless Networked Control Systems | Amirhassan Babazadeh Darabi et.al. | 2407.15784 | null |
2024-07-22 | A Hamilton-Jacobi approach to road-field reaction-diffusion models | Christopher Henderson et.al. | 2407.15760 | null |
2024-07-22 | Diffusion for Out-of-Distribution Detection on Road Scenes and Beyond | Silvio Galesso et.al. | 2407.15739 | link |
2024-07-22 | DStruct2Design: Data and Benchmarks for Data Structure Driven Generative Floor Plan Design | Zhi Hao Luo et.al. | 2407.15723 | link |
2024-07-22 | Estimating Probability Densities with Transformer and Denoising Diffusion | Henry W. Leung et.al. | 2407.15703 | link |
2024-07-22 | Voltage mapping in subcellular nanodomains using electro-diffusion modeling | Frédéric Paquin-Lefebvre et.al. | 2407.15697 | null |
2024-07-23 | Cinemo: Consistent and Controllable Image Animation with Motion Diffusion Models | Xin Ma et.al. | 2407.15642 | link |
2024-07-23 | A Diffusion Model for Simulation Ready Coronary Anatomy with Morpho-skeletal Control | Karim Kadry et.al. | 2407.15631 | null |
2024-07-22 | StylusAI: Stylistic Adaptation for Robust German Handwritten Text Generation | Nauman Riaz et.al. | 2407.15608 | null |
2024-07-22 | Discrete Flow Matching | Itai Gat et.al. | 2407.15595 | null |
2024-07-22 | SpotDiffusion: A Fast Approach For Seamless Panorama Generation Over Time | Stanislav Frolov et.al. | 2407.15507 | link |
2024-07-22 | TextureCrop: Enhancing Synthetic Image Detection through Texture-based Cropping | Despina Konstantinidou et.al. | 2407.15500 | link |
2024-08-06 | DiffX: Guide Your Layout to Cross-Modal Generative Modeling | Zeyu Wang et.al. | 2407.15488 | link |
2024-07-22 | A New Perspective on the Diffuse Gamma-Ray Emission Excess | Ensheng Chen et.al. | 2407.15474 | null |
2024-07-22 | Text2Place: Affordance-aware Text Guided Human Placement | Rishubh Parihar et.al. | 2407.15446 | null |
2024-07-22 | A vector-host epidemic model with spatial structure and seasonality | Mingxin Wang et.al. | 2407.15361 | null |
2024-07-31 | Iterative Ensemble Training with Anti-Gradient Control for Mitigating Memorization in Diffusion Models | Xiao Liu et.al. | 2407.15328 | link |
2024-07-21 | MedEdit: Counterfactual Diffusion-based Image Editing on Brain MRI | Malek Ben Alaya et.al. | 2407.15270 | null |
2024-07-23 | BIGbench: A Unified Benchmark for Social Bias in Text-to-Image Generative Models Based on Multi-modal LLM | Hanjun Luo et.al. | 2407.15240 | link |
2024-07-21 | Variational Potential Flow: A Novel Probabilistic Framework for Energy-Based Generative Modelling | Junn Yong Loo et.al. | 2407.15238 | null |
2024-07-23 | CGB-DM: Content and Graphic Balance Layout Generation with Transformer-based Diffusion Model | Yu Li et.al. | 2407.15233 | null |
2024-07-21 | Flow as the Cross-Domain Manipulation Interface | Mengda Xu et.al. | 2407.15208 | null |
2024-07-21 | Thermodynamics inconsistencies in cosmological unimodular gravity models | Miguel Cruz et.al. | 2407.15207 | null |
2024-07-21 | HoloDreamer: Holistic 3D Panoramic World Generation from Text Descriptions | Haiyang Zhou et.al. | 2407.15187 | null |
2024-07-21 | Assessing Sample Quality via the Latent Space of Generative Models | Jingyi Xu et.al. | 2407.15171 | link |
2024-07-21 | Back-in-Time Diffusion: Unsupervised Detection of Medical Deepfakes | Fred Grabovski et.al. | 2407.15169 | link |
2024-07-21 | The VEP Booster: A Closed-Loop AI System for Visual EEG Biomarker Auto-generation | Junwen Luo et.al. | 2407.15167 | null |
2024-07-21 | Anchored Diffusion for Video Face Reenactment | Idan Kligvasser et.al. | 2407.15153 | null |
2024-07-21 | D $^4$ M: Dataset Distillation via Disentangled Diffusion Model | Duo Su et.al. | 2407.15138 | null |
2024-07-21 | Diffusion Models for Unsupervised Anomaly Detection in Fetal Brain Ultrasound | Hanna Mykula et.al. | 2407.15119 | null |
2024-07-21 | Determining a Time-Varying Potential in Time-Fractional Diffusion from Observation at a Single Point | Siyu Cen et.al. | 2407.15094 | null |
2024-07-21 | LSReGen: Large-Scale Regional Generator via Backward Guidance Framework | Bowen Zhang et.al. | 2407.15066 | null |
2024-07-20 | Improving Citation Text Generation: Overcoming Limitations in Length Control | Biswadip Mandal et.al. | 2407.14997 | null |
2024-07-20 | Non-Reference Quality Assessment for Medical Imaging: Application to Synthetic Brain MRIs | Karl Van Eeden Risager et.al. | 2407.14994 | null |
2024-07-20 | GreenStableYolo: Optimizing Inference Time and Image Quality of Text-to-Image Generation | Jingzhi Gong et.al. | 2407.14982 | link |
2024-07-20 | CoCoG-2: Controllable generation of visual stimuli for understanding human concept representation | Chen Wei et.al. | 2407.14949 | link |
2024-07-20 | Automatic Generation of Fashion Images using Prompting in Generative Machine Learning Models | Georgia Argyrou et.al. | 2407.14944 | link |
2024-07-20 | EidetiCom: A Cross-modal Brain-Computer Semantic Communication Paradigm for Decoding Visual Perception | Linfeng Zheng et.al. | 2407.14936 | null |
2024-07-23 | AGLLDiff: Guiding Diffusion Models Towards Unsupervised Training-free Real-world Low-light Image Enhancement | Yunlong Lin et.al. | 2407.14900 | null |
2024-07-20 | Latent Pollution Model: The Hidden Carbon Footprint in 3D Image Synthesis | Marvin Seyfarth et.al. | 2407.14892 | null |
2024-07-20 | Text-based Talking Video Editing with Cascaded Conditional Diffusion | Bo Han et.al. | 2407.14841 | null |
2024-08-03 | Do Generative AI Models Output Harm while Representing Non-Western Cultures: Evidence from A Community-Centered Approach | Sourojit Ghosh et.al. | 2407.14779 | null |
2024-07-20 | Difflare: Removing Image Lens Flare with Latent Diffusion Model | Tianwen Zhou et.al. | 2407.14746 | link |
2024-07-20 | FedDM: Enhancing Communication Efficiency and Handling Data Heterogeneity in Federated Diffusion Models | Jayneel Vora et.al. | 2407.14730 | null |
2024-07-20 | $\infty$ -Brush: Controllable Large Image Synthesis with Diffusion Models in Infinite Dimensions | Minh-Quan Le et.al. | 2407.14709 | null |
2024-07-19 | OASIS: Conditional Distribution Shaping for Offline Safe Reinforcement Learning | Yihang Yao et.al. | 2407.14653 | link |
2024-07-19 | Pumping Iron: How turbulent metal diffusion impacts multiphase galactic outflows | Ulrich P. Steinwandel et.al. | 2407.14599 | null |
2024-07-23 | Are handcrafted filters helpful for attributing AI-generated images? | Jialiang Li et.al. | 2407.14570 | null |
2024-07-19 | DEPICT: Diffusion-Enabled Permutation Importance for Image Classification Tasks | Sarah Jabbour et.al. | 2407.14509 | null |
2024-07-19 | T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation | Kaiyue Sun et.al. | 2407.14505 | link |
2024-07-19 | M2D2M: Multi-Motion Generation from Text with Discrete Diffusion Models | Seunggeun Chi et.al. | 2407.14502 | null |
2024-07-19 | Co-synthesis of Histopathology Nuclei Image-Label Pairs using a Context-Conditioned Joint Diffusion Model | Seonghui Min et.al. | 2407.14434 | null |
2024-07-19 | Controllable and Efficient Multi-Class Pathology Nuclei Data Augmentation using Text-Conditioned Diffusion Models | Hyun-Jic Oh et.al. | 2407.14426 | null |
2024-07-19 | Thinking Racial Bias in Fair Forgery Detection: Models, Datasets and Evaluations | Decheng Liu et.al. | 2407.14367 | link |
2024-07-19 | As Generative Models Improve, People Adapt Their Prompts | Eaman Jahani et.al. | 2407.14333 | null |
2024-07-19 | Panoptic Segmentation of Mammograms with Text-To-Image Diffusion Model | Kun Zhao et.al. | 2407.14326 | null |
2024-07-19 | Time-dependent condensate formation in ultracold atoms with energy-dependent transport coefficients | M. Larsson et.al. | 2407.14307 | null |
2024-07-19 | How to Blend Concepts in Diffusion Models | Giorgio Longari et.al. | 2407.14280 | link |
2024-07-30 | The spatial evolution of economic activities: from theory to estimation | Davide Fiaschi et.al. | 2407.14267 | null |
2024-07-19 | Unlearning Concepts from Text-to-Video Diffusion Models | Shiqi Liu et.al. | 2407.14209 | null |
2024-07-19 | Normative Diffusion Autoencoders: Application to Amyotrophic Lateral Sclerosis | Ayodeji Ijishakin et.al. | 2407.14191 | null |
2024-07-19 | Machine learning emulation of precipitation from km-scale regional climate simulations using a diffusion model | Henry Addison et.al. | 2407.14158 | link |
2024-07-19 | Visual Text Generation in the Wild | Yuanzhi Zhu et.al. | 2407.14138 | link |
2024-07-19 | Stable-Hair: Real-World Hair Transfer via Diffusion Model | Yuxuan Zhang et.al. | 2407.14078 | link |
2024-07-27 | Not All Noises Are Created Equally:Diffusion Noise Selection and Optimization | Zipeng Qi et.al. | 2407.14041 | null |
2024-07-19 | Time Series Generative Learning with Application to Brain Imaging Analysis | Zhenghao Li et.al. | 2407.14003 | null |
2024-07-19 | Decomposed Direct Preference Optimization for Structure-Based Drug Design | Xiwei Cheng et.al. | 2407.13981 | null |
2024-07-19 | PlacidDreamer: Advancing Harmony in Text-to-3D Generation | Shuo Huang et.al. | 2407.13976 | link |
2024-07-25 | Language-Driven 6-DoF Grasp Detection Using Negative Prompt Guidance | Toan Nguyen et.al. | 2407.13842 | null |
2024-07-25 | Streetscapes: Large-scale Consistent Street View Generation Using Autoregressive Video Diffusion | Boyang Deng et.al. | 2407.13759 | null |
2024-07-18 | LogoSticker: Inserting Logos into Diffusion Models for Customized Generation | Mingkang Zhu et.al. | 2407.13752 | null |
2024-07-18 | Understanding Reinforcement Learning-Based Fine-Tuning of Diffusion Models: A Tutorial and Review | Masatoshi Uehara et.al. | 2407.13734 | link |
2024-07-18 | Electrically Controlled Interfacial Charge Transfer Induced Excitons in MoSe2-WSe2 Lateral Heterostructure | Baisali Kundu et.al. | 2407.13724 | null |
2024-07-25 | MeshSegmenter: Zero-Shot Mesh Semantic Segmentation via Texture Synthesis | Ziming Zhong et.al. | 2407.13675 | link |
2024-07-18 | Open-Vocabulary 3D Semantic Segmentation with Text-to-Image Diffusion Models | Xiaoyu Zhu et.al. | 2407.13642 | null |
2024-07-18 | Training-free Composite Scene Generation for Layout-to-Image Synthesis | Jiaqi Liu et.al. | 2407.13609 | link |
2024-07-18 | EnergyDiff: Universal Time-Series Energy Data Generation using Diffusion Models | Nan Lin et.al. | 2407.13538 | link |
2024-07-18 | All Roads Lead to Rome? Exploring Representational Similarities Between Latent Spaces of Generative Image Models | Charumathi Badrinath et.al. | 2407.13449 | link |
2024-07-18 | Movement-based models for abundance data | Ricardo Carrizo Vergara et.al. | 2407.13384 | null |
2024-07-18 | URCDM: Ultra-Resolution Image Synthesis in Histopathology | Sarah Cechnicka et.al. | 2407.13277 | link |
2024-07-18 | Unveiling Structural Memorization: Structural Membership Inference Attack for Text-to-Image Diffusion Models | Qiao Li et.al. | 2407.13252 | null |
2024-07-18 | MEDIC: Zero-shot Music Editing with Disentangled Inversion Control | Huadai Liu et.al. | 2407.13220 | null |
2024-07-18 | Multi-sentence Video Grounding for Long Video Generation | Wei Feng et.al. | 2407.13219 | null |
2024-07-19 | Safe-SD: Safe and Traceable Stable Diffusion with Text Prompt Trigger for Invisible Generative Watermarking | Zhiyuan Ma et.al. | 2407.13188 | null |
2024-07-18 | SpaDiT: Diffusion Transformer for Spatial Gene Expression Prediction using scRNA-seq | Xiaoyu Li et.al. | 2407.13182 | link |
2024-07-18 | Training-Free Large Model Priors for Multiple-in-One Image Restoration | Xuanhua He et.al. | 2407.13181 | null |
2024-07-18 | Image Inpainting Models are Effective Tools for Instruction-guided Image Editing | Xuan Ju et.al. | 2407.13139 | null |
2024-07-18 | FocusDiffuser: Perceiving Local Disparities for Camouflaged Object Detection | Jianwei Zhao et.al. | 2407.13133 | null |
2024-07-19 | From Principles to Practices: Lessons Learned from Applying Partnership on AI’s (PAI) Synthetic Media Framework to 11 Use Cases | Claire R. Leibowicz et.al. | 2407.13025 | null |
2024-07-17 | Denoising Diffusions in Latent Space for Medical Image Segmentation | Fahim Ahmed Zaman et.al. | 2407.12952 | link |
2024-07-17 | DreamStory: Open-Domain Story Visualization by LLM-Guided Multi-Subject Consistent Diffusion | Huiguo He et.al. | 2407.12899 | null |
2024-07-17 | GeoGuide: Geometric guidance of diffusion models | Mateusz Poleski et.al. | 2407.12889 | link |
2024-07-17 | SMooDi: Stylized Motion Diffusion Model | Lei Zhong et.al. | 2407.12783 | null |
2024-07-20 | VD3D: Taming Large Video Diffusion Transformers for 3D Camera Control | Sherwin Bahmani et.al. | 2407.12781 | null |
2024-07-17 | Hallucination Index: An Image Quality Metric for Generative Reconstruction Models | Matthew Tivnan et.al. | 2407.12780 | null |
2024-07-17 | GroundUp: Rapid Sketch-Based 3D City Massing | Gizem Esra Unlu et.al. | 2407.12739 | null |
2024-07-17 | NL2Contact: Natural Language Guided 3D Hand-Object Contact Modeling with Diffusion Model | Zhongqun Zhang et.al. | 2407.12727 | null |
2024-07-18 | SlimFlow: Training Smaller One-Step Diffusion Models with Rectified Flow | Yuanzhi Zhu et.al. | 2407.12718 | link |
2024-08-06 | IMAGDressing-v1: Customizable Virtual Dressing | Fei Shen et.al. | 2407.12705 | link |
2024-07-17 | 4Dynamic: Text-to-4D Generation with Hybrid Priors | Yu-Jie Yuan et.al. | 2407.12684 | null |
2024-07-17 | Promptable Counterfactual Diffusion Model for Unified Brain Tumor Segmentation and Generation with MRIs | Yiqing Shen et.al. | 2407.12678 | link |
2024-07-17 | CoSIGN: Few-Step Guidance of ConSIstency Model to Solve General INverse Problems | Jiankun Zhao et.al. | 2407.12676 | link |
2024-07-17 | Zero-shot Text-guided Infinite Image Synthesis with LLM guidance | Soyeong Kwon et.al. | 2407.12642 | null |
2024-07-17 | VegeDiff: Latent Diffusion Model for Geospatial Vegetation Forecasting | Sijie Zhao et.al. | 2407.12592 | null |
2024-07-17 | Towards Understanding Unsafe Video Generation | Yan Pang et.al. | 2407.12581 | link |
2024-07-17 | The Fabrication of Reality and Fantasy: Scene Generation with LLM-Assisted Prompt Interpretation | Yi Yao et.al. | 2407.12579 | null |
2024-07-17 | High Frequency Matters: Uncertainty Guided Image Compression with Wavelet Diffusion | Juan Song et.al. | 2407.12538 | link |
2024-07-17 | Leveraging the Mahalanobis Distance to enhance Unsupervised Brain MRI Anomaly Detection | Finn Behrendt et.al. | 2407.12474 | link |
2024-07-17 | Energy-Guided Diffusion Sampling for Offline-to-Online Reinforcement Learning | Xu-Hui Liu et.al. | 2407.12448 | link |
2024-07-17 | Reliable and Efficient Concept Erasure of Text-to-Image Diffusion Models | Chao Gong et.al. | 2407.12383 | link |
2024-07-17 | HIMO: A New Benchmark for Full-Body Human Interacting with Multiple Objects | Xintao Lv et.al. | 2407.12371 | null |
2024-07-17 | LLM-based query paraphrasing for video search | Jiaxin Wu et.al. | 2407.12341 | null |
2024-07-17 | I2AM: Interpreting Image-to-Image Latent Diffusion Models via Attribution Maps | Junseo Park et.al. | 2407.12331 | null |
2024-07-17 | Label-Efficient 3D Brain Segmentation via Complementary 2D Diffusion Models with Orthogonal Views | Jihoon Cho et.al. | 2407.12329 | null |
2024-07-17 | JointDreamer: Ensuring Geometry Consistency and Text Congruence in Text-to-3D Generation via Joint Score Distillation | Chenhan Jiang et.al. | 2407.12291 | null |
2024-07-17 | Chip Placement with Diffusion | Vint Lee et.al. | 2407.12282 | null |
2024-07-17 | Voltage-Controlled Magnetoelectric Devices for Neuromorphic Diffusion Process | Yang Cheng et.al. | 2407.12261 | null |
2024-07-19 | Parameter Generation of Quantum Approximate Optimization Algorithm with Diffusion Model | Fanxu Meng et.al. | 2407.12242 | null |
2024-07-18 | Towards Dataset-scale and Feature-oriented Evaluation of Text Summarization in Large Language Model Prompts | Sam Yu-Te Lee et.al. | 2407.12192 | null |
2024-07-16 | Beta Sampling is All You Need: Efficient Image Generation Strategy for Diffusion Models using Stepwise Spectral Analysis | Haeil Lee et.al. | 2407.12173 | null |
2024-07-16 | Subject-driven Text-to-Image Generation via Preference-based Reinforcement Learning | Yanting Miao et.al. | 2407.12164 | link |
2024-07-16 | Bellman Diffusion Models | Liam Schramm et.al. | 2407.12163 | null |
2024-07-16 | Efficient Training with Denoised Neural Weights | Yifan Gong et.al. | 2407.11966 | null |
2024-07-16 | Context-Guided Diffusion for Out-of-Distribution Molecular and Protein Design | Leo Klarner et.al. | 2407.11942 | link |
2024-07-16 | Contrastive Sequential-Diffusion Learning: An approach to Multi-Scene Instructional Video Synthesis | Vasco Ramos et.al. | 2407.11814 | link |
2024-07-16 | Diffusion-driven self-assembly of emerin nanodomains at the nuclear envelope | Carlos D. Alas et.al. | 2407.11758 | null |
2024-07-16 | Mask-guided cross-image attention for zero-shot in-silico histopathologic image generation with a diffusion model | Dominik Winter et.al. | 2407.11664 | null |
2024-08-02 | CCVA-FL: Cross-Client Variations Adaptive Federated Learning for Medical Imaging | Sunny Gupta et.al. | 2407.11652 | null |
2024-07-16 | Scaling Diffusion Transformers to 16 Billion Parameters | Zhengcong Fei et.al. | 2407.11633 | link |
2024-07-16 | DiNO-Diffusion. Scaling Medical Diffusion via Self-Supervised Pre-Training | Guillermo Jimenez-Perez et.al. | 2407.11594 | null |
2024-07-17 | QVD: Post-training Quantization for Video Diffusion Models | Shilong Tian et.al. | 2407.11585 | null |
2024-07-17 | UP-Diff: Latent Diffusion Model for Remote Sensing Urban Prediction | Zeyu Wang et.al. | 2407.11578 | link |
2024-07-16 | TGIF: Text-Guided Inpainting Forgery Dataset | Hannes Mareen et.al. | 2407.11566 | link |
2024-07-16 | Self-Guided Generation of Minority Samples Using Diffusion Models | Soobin Um et.al. | 2407.11555 | link |
2024-07-16 | Length-Aware Motion Synthesis via Latent Diffusion | Alessio Sampieri et.al. | 2407.11532 | link |
2024-07-21 | How Control Information Influences Multilingual Text Image Generation and Editing? | Boqiang Zhang et.al. | 2407.11502 | link |
2024-07-16 | Diff-MTS: Temporal-Augmented Conditional Diffusion-based AIGC for Industrial Time Series Towards the Large Model Era | Lei Ren et.al. | 2407.11501 | null |
2024-07-16 | Learning Semantic Latent Directions for Accurate and Controllable Human Motion Prediction | Guowei Xu et.al. | 2407.11494 | link |
2024-07-16 | AIGC for Industrial Time Series: From Deep Generative Models to Large Generative Models | Lei Ren et.al. | 2407.11480 | null |
2024-07-16 | Isometric Representation Learning for Disentangled Latent Space of Diffusion Models | Jaehoon Hahm et.al. | 2407.11451 | link |
2024-07-16 | Repurformer: Transformers for Repurposing-Aware Molecule Generation | Changhun Lee et.al. | 2407.11439 | null |
2024-07-16 | CycleHOI: Improving Human-Object Interaction Detection with Cycle Consistency of Detection and Generation | Yisen Wang et.al. | 2407.11433 | null |
2024-07-16 | Model Inversion Attacks Through Target-Specific Conditional Diffusion Models | Ouxiang Li et.al. | 2407.11424 | link |
2024-07-16 | TeethDreamer: 3D Teeth Reconstruction from Five Intra-oral Photographs | Chenfan Xu et.al. | 2407.11419 | link |
2024-07-16 | Cover-separable Fixed Neural Network Steganography via Deep Generative Models | Guobiao Li et.al. | 2407.11405 | link |
2024-07-16 | Animate3D: Animating Any 3D Model with Multi-view Video Diffusion | Yanqin Jiang et.al. | 2407.11398 | null |
2024-07-16 | DreamCatalyst: Fast and High-Quality 3D Editing via Controlling Editability and Identity Preservation | Jiwook Kim et.al. | 2407.11394 | link |
2024-07-16 | Flatfish Disease Detection Based on Part Segmentation Approach and Disease Image Generation | Seo-Bin Hwang et.al. | 2407.11348 | null |
2024-07-16 | Gaussian Splatting LK | Liuyue Xie et.al. | 2407.11309 | null |
2024-07-16 | Zero-Shot Adaptation for Approximate Posterior Sampling of Diffusion Models in Inverse Problems | Yaşar Utku Alçalar et.al. | 2407.11288 | null |
2024-07-17 | Unconstrained Open Vocabulary Image Classification: Zero-Shot Transfer from Text to Image via CLIP Inversion | Philipp Allgeuer et.al. | 2407.11211 | link |
2024-07-15 | Stationary CT Imaging of Intracranial Hemorrhage with Diffusion Posterior Sampling Reconstruction | Alejandro Lopez-Montes et.al. | 2407.11196 | null |
2024-07-15 | Integrating Amortized Inference with Diffusion Models for Learning Clean Distribution from Corrupted Images | Yifei Wang et.al. | 2407.11162 | null |
2024-07-15 | Discrete generative diffusion models without stochastic differential equations: a tensor network approach | Luke Causer et.al. | 2407.11133 | null |
2024-07-15 | SSSD-ECG-nle: New Label Embeddings with Structured State-Space Models for ECG generation | Sergey Skorik et.al. | 2407.11108 | null |
2024-07-15 | Make-An-Agent: A Generalizable Policy Network Generator with Behavior-Prompted Diffusion | Yongyuan Liang et.al. | 2407.10973 | null |
2024-07-15 | InVi: Object Insertion In Videos Using Off-the-Shelf Diffusion Models | Nirat Saini et.al. | 2407.10958 | null |
2024-07-15 | IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation | Yuanhao Zhai et.al. | 2407.10937 | link |
2024-07-15 | OPa-Ma: Text Guided Mamba for 360-degree Image Out-painting | Penglei Gao et.al. | 2407.10923 | null |
2024-07-16 | DataDream: Few-shot Guided Dataset Generation | Jae Myung Kim et.al. | 2407.10910 | link |
2024-07-15 | Optical Diffusion Models for Image Generation | Ilker Oguz et.al. | 2407.10897 | null |
2024-07-15 | R3D-AD: Reconstruction via Diffusion for 3D Anomaly Detection | Zheyuan Zhou et.al. | 2407.10862 | null |
2024-07-15 | Physics-Inspired Generative Models in Medical Imaging: A Review | Dennis Hein et.al. | 2407.10856 | null |
2024-07-15 | Generalization Bounds for Contextual Stochastic Optimization using Kernel Regression | Yijie Wang et.al. | 2407.10764 | null |
2024-07-15 | An Autonomous Drone Swarm for Detecting and Tracking Anomalies among Dense Vegetation | Rakesh John Amala Arokia Nathan et.al. | 2407.10754 | null |
2024-07-18 | AccDiffusion: An Accurate Method for Higher-Resolution Image Generation | Zhihang Lin et.al. | 2407.10738 | link |
2024-07-18 | Conditional Guided Generative Diffusion for Particle Accelerator Beam Diagnostics | Alexander Scheinker et.al. | 2407.10693 | null |
2024-07-15 | Addressing Image Hallucination in Text-to-Image Generation through Factual Image Retrieval | Youngsun Lim et.al. | 2407.10683 | null |
2024-07-15 | Spatio-temporal neural distance fields for conditional generative modeling of the heart | Kristine Sørensen et.al. | 2407.10663 | link |
2024-07-15 | Temporal Residual Guided Diffusion Framework for Event-Driven Video Reconstruction | Lin Zhu et.al. | 2407.10636 | null |
2024-07-15 | WildVidFit: Video Virtual Try-On in the Wild via Image-Based Controlled Diffusion Models | Zijian He et.al. | 2407.10625 | null |
2024-07-15 | InsertDiffusion: Identity Preserving Visualization of Objects through a Training-Free Diffusion Architecture | Phillip Mueller et.al. | 2407.10592 | link |
2024-07-15 | A Survey of Defenses against AI-generated Visual Media: Detection, Disruption, and Authentication | Jingyi Deng et.al. | 2407.10575 | null |
2024-07-15 | Local Action-Guided Motion Diffusion Model for Text-to-Motion Generation | Peng Jin et.al. | 2407.10528 | null |
2024-07-15 | Kinetic Typography Diffusion Model | Seonmi Park et.al. | 2407.10476 | null |
2024-07-17 | GROOT: Generating Robust Watermark for Diffusion-Model-Based Audio Synthesis | Weizhi Liu et.al. | 2407.10471 | null |
2024-07-15 | LiteFocus: Accelerated Diffusion Inference for Long Audio Synthesis | Zhenxiong Tan et.al. | 2407.10468 | link |
2024-07-15 | BandControlNet: Parallel Transformers-based Steerable Popular Music Generation with Fine-Grained Spatiotemporal Features | Jing Luo et.al. | 2407.10462 | link |
2024-07-15 | DiffStega: Towards Universal Training-Free Coverless Image Steganography with Diffusion Models | Yiwei Yang et.al. | 2407.10459 | link |
2024-07-15 | Melon Fruit Detection and Quality Assessment Using Generative AI-Based Image Data Augmentation | Seungri Yoon et.al. | 2407.10413 | null |
2024-07-15 | Mutual Learning for Acoustic Matching and Dereverberation via Visual Scene-driven Diffusion | Jian Ma et.al. | 2407.10373 | null |
2024-07-14 | On an age-structured model in moving boundaries: The effects of nonlocal diffusion and harvesting pulse | Haiyan Xu et.al. | 2407.10363 | null |
2024-07-14 | Addressing Class Imbalance and Data Limitations in Advanced Node Semiconductor Defect Inspection: A Generative Approach for SEM Images | Bappaditya Dey et.al. | 2407.10348 | null |
2024-07-14 | Tree-D Fusion: Simulation-Ready Tree Dataset from Single Images with Diffusion Priors | Jae Joong Lee et.al. | 2407.10330 | null |
2024-07-14 | Noise Calibration: Plug-and-play Content-Preserving Video Enhancement using Pre-trained Video Diffusion Models | Qinyu Yang et.al. | 2407.10285 | link |
2024-07-14 | Disrupting Diffusion-based Inpainters with Semantic Digression | Geonho Son et.al. | 2407.10277 | null |
2024-07-14 | 3DEgo: 3D Editing on the Go! | Umar Khalid et.al. | 2407.10102 | null |
2024-07-14 | Transferable 3D Adversarial Shape Completion using Diffusion Models | Xuelong Dai et.al. | 2407.10077 | link |
2024-07-14 | What Appears Appealing May Not be Significant! – A Clinical Perspective of Diffusion Models | Vanshali Sharma et.al. | 2407.10029 | null |
2024-07-13 | Learning Online Scale Transformation for Talking Head Video Generation | Fa-Ting Hong et.al. | 2407.09965 | null |
2024-07-13 | DexGrasp-Diffusion: Diffusion-based Unified Functional Grasp Synthesis Pipeline for Multi-Dexterous Robotic Hands | Zhengshen Zhang et.al. | 2407.09899 | null |
2024-07-13 | Zero-Shot Image Compression with Diffusion-Based Posterior Sampling | Noam Elata et.al. | 2407.09896 | link |
2024-07-13 | Layout-and-Retouch: A Dual-stage Framework for Improving Diversity in Personalized Image Generation | Kangyeol Kim et.al. | 2407.09779 | null |
2024-07-13 | TemporalStory: Enhancing Consistency in Story Visualization using Spatial-Temporal Attention | Sixiao Zheng et.al. | 2407.09774 | link |
2024-07-13 | Prototype Clustered Diffusion Models for Versatile Inverse Problems | Jinghao Zhang et.al. | 2407.09768 | null |
2024-07-12 | Investigating the Interplay of Prioritized Replay and Generalization | Parham Mohammad Panahi et.al. | 2407.09702 | null |
2024-07-12 | Unsupervised Anomaly Detection Using Diffusion Trend Analysis | Eunwoo Kim et.al. | 2407.09578 | null |
2024-07-12 | A new approach to principal-agent problems with volatility control | Alessandro Chiusolo et.al. | 2407.09471 | null |
2024-07-12 | FairyLandAI: Personalized Fairy Tales utilizing ChatGPT and DALLE-3 | Georgios Makridis et.al. | 2407.09467 | null |
2024-07-15 | Any-Property-Conditional Molecule Generation with Self-Criticism using Spanning Trees | Alexia Jolicoeur-Martineau et.al. | 2407.09357 | link |
2024-07-12 | PID: Physics-Informed Diffusion Model for Infrared Image Generation | Fangyuan Mao et.al. | 2407.09299 | link |
2024-07-12 | Surgical Text-to-Image Generation | Chinedu Innocent Nwoye et.al. | 2407.09230 | null |
2024-07-12 | Salt & Pepper Heatmaps: Diffusion-informed Landmark Detection Strategy | Julian Wyatt et.al. | 2407.09192 | null |
2024-07-29 | DART: An Automated End-to-End Object Detection Pipeline with Data Diversification, Open-Vocabulary Bounding Box Annotation, Pseudo-Label Review, and Model Training | Chen Xin et.al. | 2407.09174 | link |
2024-07-12 | Machine Apophenia: The Kaleidoscopic Generation of Architectural Images | Alexey Tikhonov et.al. | 2407.09172 | null |
2024-07-12 | Inference Optimization of Foundation Models on AI Accelerators | Youngsuk Park et.al. | 2407.09111 | null |
2024-07-12 | Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control | Huayu Chen et.al. | 2407.09024 | link |
2024-07-12 | TCAN: Animating Human Images with Temporally Consistent Pose Guidance using Diffusion Models | Jeongho Kim et.al. | 2407.09012 | null |
2024-07-12 | Your Diffusion Model is Secretly a Noise Classifier and Benefits from Contrastive Training | Yunshu Wu et.al. | 2407.08946 | link |
2024-07-16 | Bora: Biomedical Generalist Video Generation Model | Weixiang Sun et.al. | 2407.08944 | null |
2024-07-12 | LightenDiffusion: Unsupervised Low-Light Image Enhancement with Latent-Retinex Diffusion Models | Hai Jiang et.al. | 2407.08939 | link |
2024-07-12 | Multi-objective Aerial Collaborative Secure Communication Optimization via Generative Diffusion Model-enabled Deep Reinforcement Learning | Chuang Zhang et.al. | 2407.08914 | null |
2024-07-12 | AirSketch: Generative Motion to Sketch | Hui Xian Grace Lim et.al. | 2407.08906 | null |
2024-07-11 | Video Diffusion Alignment via Reward Gradients | Mihir Prabhudesai et.al. | 2407.08737 | link |
2024-07-11 | Live2Diff: Live Stream Translation via Uni-directional Attention in Video Diffusion Models | Zhening Xing et.al. | 2407.08701 | null |
2024-07-11 | SEED-Story: Multimodal Long Story Generation with Large Language Model | Shuai Yang et.al. | 2407.08683 | link |
2024-07-22 | CAD-Prompted Generative Models: A Pathway to Feasible and Novel Engineering Designs | Leah Chong et.al. | 2407.08675 | null |
2024-07-11 | Still-Moving: Customized Video Generation without Customized Video Data | Hila Chefer et.al. | 2407.08674 | null |
2024-07-11 | Controlling the Fidelity and Diversity of Deep Generative Models via Pseudo Density | Shuangqi Li et.al. | 2407.08659 | null |
2024-07-13 | Fine-Tuning Stable Diffusion XL for Stylistic Icon Generation: A Comparison of Caption Size | Youssef Sultan et.al. | 2407.08513 | null |
2024-07-20 | Latent Conditional Diffusion-based Data Augmentation for Continuous-Time Dynamic Graph Model | Yuxing Tian et.al. | 2407.08500 | null |
2024-07-11 | A Comprehensive Survey on Human Video Generation: Challenges, Methods, and Insights | Wentao Lei et.al. | 2407.08428 | link |
2024-07-16 | Diff-Tracker: Text-to-Image Diffusion Models are Unsupervised Trackers | Zhengbo Zhang et.al. | 2407.08394 | null |
2024-07-11 | Wind Power Assessment based on Super-Resolution and Downscaling – A Comparison of Deep Learning Methods | Luca Schmidt et.al. | 2407.08259 | null |
2024-07-11 | Adaptive Compressed Sensing with Diffusion-Based Posterior Sampling | Noam Elata et.al. | 2407.08256 | null |
2024-07-11 | E2VIDiff: Perceptual Events-to-Video Reconstruction using Diffusion Priors | Jinxiu Liang et.al. | 2407.08231 | null |
2024-07-11 | Survey on Fundamental Deep Learning 3D Reconstruction Techniques | Yonge Bai et.al. | 2407.08137 | null |
2024-07-11 | Non-convergence of Adam and other adaptive stochastic gradient descent optimization methods for non-vanishing learning rates | Steffen Dereich et.al. | 2407.08100 | null |
2024-07-10 | Geospecific View Generation – Geometry-Context Aware High-resolution Ground View Inference from Satellite Views | Ningli Xu et.al. | 2407.08061 | null |
2024-07-10 | Coherent and Multi-modality Image Inpainting via Latent Space Optimization | Lingzhi Pan et.al. | 2407.08019 | link |
2024-07-10 | Generative Image as Action Models | Mohit Shridhar et.al. | 2407.07875 | link |
2024-07-10 | Dynamical Measure Transport and Neural PDE Solvers for Sampling | Jingtong Sun et.al. | 2407.07873 | null |
2024-07-10 | Controlling Space and Time with Diffusion Models | Daniel Watson et.al. | 2407.07860 | null |
2024-07-10 | RoBus: A Multimodal Dataset for Controllable Road Networks and Building Layouts Generation | Tao Li et.al. | 2407.07835 | link |
2024-07-10 | Generic Numerical Analysis of Stochastic Reaction Diffusion Model with applications in excitable media | Yahya Alnashri et.al. | 2407.07834 | null |
2024-07-10 | Universal and non-universal signatures in the scaling functions of critical variables | Gianluca Teza et.al. | 2407.07782 | null |
2024-07-10 | StoryDiffusion: How to Support UX Storyboarding With Generative-AI | Zhaohui Liang et.al. | 2407.07672 | null |
2024-07-10 | VEnhancer: Generative Space-Time Enhancement for Video Generation | Jingwen He et.al. | 2407.07667 | null |
2024-07-11 | MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis | Wanggui He et.al. | 2407.07614 | link |
2024-07-11 | Trainable Highly-expressive Activation Functions | Irit Chelly et.al. | 2407.07564 | link |
2024-07-10 | Video-to-Audio Generation with Hidden Alignment | Manjie Xu et.al. | 2407.07464 | null |
2024-07-10 | Drantal-NeRF: Diffusion-Based Restoration for Anti-aliasing Neural Radiance Field | Ganlin Yang et.al. | 2407.07461 | null |
2024-07-10 | Secondary Structure-Guided Novel Protein Sequence Generation with Latent Graph Diffusion | Yutong Hu et.al. | 2407.07443 | link |
2024-07-21 | Deformation-Recovery Diffusion Model (DRDM): Instance Deformation for Image Manipulation and Synthesis | Jian-Qing Zheng et.al. | 2407.07295 | link |
2024-07-09 | A Very Effective and Simple Diffusion Reconstruction for the Diluted Ising Model | Stefano Bae et.al. | 2407.07266 | null |
2024-07-08 | VIMI: Grounding Video Generation through Multi-modal Instruction | Yuwei Fang et.al. | 2407.06304 | null |
2024-07-02 | OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation | Kepan Nan et.al. | 2407.02371 | null |
2024-06-29 | SVG: 3D Stereoscopic Video Generation via Denoising Frame Matrix | Peng Dai et.al. | 2407.00367 | null |
2024-06-27 | What Matters in Detecting AI-Generated Videos like Sora? | Chirui Chang et.al. | 2406.19568 | null |
2024-06-26 | MultiDiff: Consistent Novel View Synthesis from a Single Image | Norman Müller et.al. | 2406.18524 | null |
2024-06-24 | FreeTraj: Tuning-Free Trajectory Control in Video Diffusion Models | Haonan Qiu et.al. | 2406.16863 | link |
2024-06-24 | Dreamitate: Real-World Visuomotor Policy Learning via Video Generation | Junbang Liang et.al. | 2406.16862 | null |
2024-06-24 | Video-Infinity: Distributed Long Video Generation | Zhenxiong Tan et.al. | 2406.16260 | null |
2024-06-22 | Identifying and Solving Conditional Image Leakage in Image-to-Video Diffusion Model | Min Zhao et.al. | 2406.15735 | null |
2024-06-20 | ExVideo: Extending Video Diffusion Models via Parameter-Efficient Post-Tuning | Zhongjie Duan et.al. | 2406.14130 | link |
2024-06-19 | ARDuP: Active Region Video Diffusion for Universal Policies | Shuaiyi Huang et.al. | 2406.13301 | null |
2024-07-07 | Vid3D: Synthesis of Dynamic 3D Scenes using 2D Video Diffusion | Rishab Parthasarathy et.al. | 2406.11196 | link |
2024-06-16 | ViD-GPT: Introducing GPT-style Autoregressive Generation in Video Diffusion Models | Kaifeng Gao et.al. | 2406.10981 | link |
2024-06-14 | Training-free Camera Control for Video Generation | Chen Hou et.al. | 2406.10126 | null |
2024-06-13 | Turns Out I’m Not Real: Towards Robust Detection of AI-Generated Videos | Qingyuan Liu et.al. | 2406.09601 | null |
2024-06-12 | Vivid-ZOO: Multi-View Video Generation with Diffusion Model | Bing Li et.al. | 2406.08659 | null |
2024-06-12 | Diffusion-Promoted HDR Video Reconstruction | Yuanshen Guan et.al. | 2406.08204 | null |
2024-06-11 | 4Real: Towards Photorealistic 4D Scene Generation via Video Diffusion Models | Heng Yu et.al. | 2406.07472 | null |
2024-06-27 | AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising | Zigeng Chen et.al. | 2406.06911 | link |
2024-06-11 | Motion Consistency Model: Accelerating Video Diffusion with Disentangled Motion-Appearance Distillation | Yuanhao Zhai et.al. | 2406.06890 | link |
2024-04-23 | Interactive Generation of Laparoscopic Videos with Diffusion Models | Ivan Iliash et.al. | 2406.06537 | null |
2024-06-10 | AID: Adapting Image2Video Diffusion Models for Instruction-guided Video Prediction | Zhen Xing et.al. | 2406.06465 | null |
2024-06-28 | MotionClone: Training-Free Motion Cloning for Controllable Video Generation | Pengyang Ling et.al. | 2406.05338 | link |
2024-06-07 | CoNo: Consistency Noise Injection for Tuning-free Long Video Diffusion | Xingrui Wang et.al. | 2406.05082 | null |
2024-06-07 | Online Continual Learning of Video Diffusion Models From a Single Video Stream | Jason Yoo et.al. | 2406.04814 | null |
2024-06-07 | Evaluating and Mitigating IP Infringement in Visual Generative AI | Zhenting Wang et.al. | 2406.04662 | link |
2024-06-11 | Physics3D: Learning Physical Properties of 3D Gaussians via Video Diffusion | Fangfu Liu et.al. | 2406.04338 | null |
2024-06-06 | SF-V: Single Forward Video Generation Model | Zhixing Zhang et.al. | 2406.04324 | link |
2024-06-05 | Searching Priors Makes Text-to-Video Synthesis Better | Haoran Cheng et.al. | 2406.03215 | null |
2024-06-04 | Neural Representations of Dynamic Visual Stimuli | Jacob Yeung et.al. | 2406.02659 | null |
2024-06-06 | Enhancing Temporal Consistency in Video Editing by Reconstructing Videos with 3D Gaussian Splatting | Inkyu Shin et.al. | 2406.02541 | null |
2024-06-04 | CamCo: Camera-Controllable 3D-Consistent Image-to-Video Generation | Dejia Xu et.al. | 2406.02509 | null |
2024-06-04 | I4VGen: Image as Stepping Stone for Text-to-Video Generation | Xiefan Guo et.al. | 2406.02230 | null |
2024-06-03 | Turning Text and Imagery into Captivating Visual Video | Mingming Wang et.al. | 2406.01851 | null |
2024-06-04 | Learning Temporally Consistent Video Depth from Video Diffusion Priors | Jiahao Shao et.al. | 2406.01493 | null |
2024-06-03 | DreamPhysics: Learning Physical Properties of Dynamic 3D Gaussians with Video Diffusion Priors | Tianyu Huang et.al. | 2406.01476 | link |
2024-06-03 | UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation | Xiang Wang et.al. | 2406.01188 | null |
2024-06-03 | ZeroSmooth: Training-free Diffuser Adaptation for High Frame Rate Video Generation | Shaoshu Yang et.al. | 2406.00908 | link |
2024-05-31 | SNED: Superposition Network Architecture Search for Efficient Video Diffusion Model | Zhengang Li et.al. | 2406.00195 | null |
2024-05-31 | Bootstrap3D: Improving 3D Content Creation with Synthetic Data | Zeyi Sun et.al. | 2406.00093 | null |
2024-05-31 | 4Diffusion: Multi-view Video Diffusion Model for 4D Generation | Haiyu Zhang et.al. | 2405.20674 | null |
2024-05-30 | VividDream: Generating 3D Scene with Ambient Dynamics | Yao-Chih Lee et.al. | 2405.20334 | null |
2024-07-11 | MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model | Muyao Niu et.al. | 2405.20222 | link |
2024-05-30 | MotionDreamer: Zero-Shot 3D Mesh Animation from Video Diffusion Models | Lukas Uzolas et.al. | 2405.20155 | null |
2024-05-30 | Streaming Video Diffusion: Online Video Editing with Diffusion Models | Feng Chen et.al. | 2405.19726 | link |
2024-05-28 | RACCooN: Remove, Add, and Change Video Content with Auto-Generated Narratives | Jaehong Yoon et.al. | 2405.18406 | link |
2024-05-28 | VividPose: Advancing Stable Video Diffusion for Realistic Human Image Animation | Qilin Wang et.al. | 2405.18156 | null |
2024-05-28 | EG4D: Explicit Generation of 4D Object without Score Distillation | Qi Sun et.al. | 2405.18132 | link |
2024-05-27 | RefDrop: Controllable Consistency in Image or Video Generation via Reference Feature Guidance | Jiaojiao Fan et.al. | 2405.17661 | null |
2024-05-27 | Collaborative Video Diffusion: Consistent Multi-video Generation with Camera Control | Zhengfei Kuang et.al. | 2405.17414 | null |
2024-05-28 | Controllable Longer Image Animation with Diffusion Models | Qiang Wang et.al. | 2405.17306 | null |
2024-05-27 | Zero-Shot Video Semantic Segmentation based on Pre-Trained Diffusion Models | Qian Wang et.al. | 2405.16947 | link |
2024-05-26 | Diffusion4D: Fast Spatial-temporal Consistent 4D Generation via Video Diffusion Models | Hanwen Liang et.al. | 2405.16645 | null |
2024-05-26 | I2VEdit: First-Frame-Guided Video Editing via Image-to-Video Diffusion Models | Wenqi Ouyang et.al. | 2405.16537 | null |
2024-05-24 | NVS-Solver: Video Diffusion Model as Zero-Shot Novel View Synthesizer | Meng You et.al. | 2405.15364 | link |
2024-05-23 | Video Diffusion Models are Training-free Motion Interpreter and Controller | Zeqi Xiao et.al. | 2405.14864 | null |
2024-05-22 | MotionCraft: Physics-based Zero-Shot Video Generation | Luca Savant Aira et.al. | 2405.13557 | link |
2024-05-22 | Enhanced Creativity and Ideation through Stable Video Synthesis | Elijah Miller et.al. | 2405.13357 | null |
2024-05-06 | Video Diffusion Models: A Survey | Andrew Melnik et.al. | 2405.03150 | link |
2024-04-30 | Semantically Consistent Video Inpainting with Conditional Diffusion Models | Dylan Green et.al. | 2405.00251 | null |
2024-04-25 | TI2V-Zero: Zero-Shot Image Conditioning for Text-to-Video Diffusion Models | Haomiao Ni et.al. | 2404.16306 | link |
2024-06-02 | X-Ray: A Sequential 3D Representation For Generation | Tao Hu et.al. | 2404.14329 | link |
2024-04-19 | ConCLVD: Controllable Chinese Landscape Video Generation via Diffusion Model | Dingming Liu et.al. | 2404.12903 | null |
2024-04-18 | AniClipart: Clipart Animation with Text-to-Video Priors | Ronghuan Wu et.al. | 2404.12347 | null |
2024-04-18 | Dynamic Typography: Bringing Text to Life via Video Diffusion Prior | Zichen Liu et.al. | 2404.11614 | null |
2024-05-24 | Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model | Han Lin et.al. | 2404.09967 | null |
2024-04-08 | Investigating the Effectiveness of Cross-Attention to Unlock Zero-Shot Editing of Text-to-Video Diffusion Models | Saman Motamed et.al. | 2404.05519 | null |
2024-04-02 | Upsample Guidance: Scale Up Diffusion Models without Training | Juno Hwang et.al. | 2404.01709 | null |
2024-03-29 | Motion Inversion for Video Customization | Luozhou Wang et.al. | 2403.20193 | null |
2024-03-28 | Frame by Familiar Frame: Understanding Replication in Video Diffusion Models | Aimon Rahman et.al. | 2403.19593 | null |
2024-03-26 | Annotated Biomedical Video Generation using Denoising Diffusion Probabilistic Models and Flow Fields | Rüveyda Yilmaz et.al. | 2403.17808 | link |
2024-03-25 | TRIP: Temporal Residual Learning with Image Noise Prior for Image-to-Video Diffusion Models | Zhongwei Zhang et.al. | 2403.17005 | null |
2024-03-22 | Spectral Motion Alignment for Video Motion Transfer using Diffusion Models | Geon Yeong Park et.al. | 2403.15249 | null |
2024-03-22 | STAG4D: Spatial-Temporal Anchored Generative 4D Gaussians | Yifei Zeng et.al. | 2403.14939 | null |
2024-03-21 | StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text | Roberto Henschel et.al. | 2403.14773 | link |
2024-03-21 | Efficient Video Diffusion Models via Content-Frame Motion-Latent Decomposition | Sihyun Yu et.al. | 2403.14148 | null |
2024-03-20 | TimeRewind: Rewinding Time with Image-and-Events Video Diffusion | Jingxi Chen et.al. | 2403.13800 | null |
2024-07-06 | Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation | Zixin Zhu et.al. | 2403.12042 | link |
2024-07-18 | VFusion3D: Learning Scalable 3D Generative Models from Video Diffusion Models | Junlin Han et.al. | 2403.12034 | null |
2024-03-18 | SV3D: Novel Multi-view Synthesis and 3D Generation from a Single Image using Latent Video Diffusion | Vikram Voleti et.al. | 2403.12008 | null |
2024-07-15 | DreamMotion: Space-Time Self-Similar Score Distillation for Zero-Shot Video Editing | Hyeonho Jeong et.al. | 2403.12002 | null |
2024-03-18 | EchoReel: Enhancing Action Generation of Existing Video Diffusion Models | Jianzhi liu et.al. | 2403.11535 | link |
2024-03-13 | Envision3D: One Image to 3D with Anchor Views Interpolation | Yatian Pang et.al. | 2403.08902 | link |
2024-03-10 | WorldGPT: A Sora-Inspired Video AI Agent as Rich World Models from Text and Image Inputs | Deshun Yang et.al. | 2403.07944 | null |
2024-04-02 | SSM Meets Video Diffusion Models: Efficient Video Generation with Structured State Spaces | Yuta Oshima et.al. | 2403.07711 | link |
2024-03-11 | V3D: Video Diffusion Models are Effective 3D Generators | Zilong Chen et.al. | 2403.06738 | link |
2024-05-14 | VidProM: A Million-scale Real Prompt-Gallery Dataset for Text-to-Video Diffusion Models | Wenhao Wang et.al. | 2403.06098 | link |
2024-03-08 | VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion Models | Yabo Zhang et.al. | 2403.05438 | link |
2024-03-05 | Tuning-Free Noise Rectification for High Fidelity Image-to-Video Generation | Weijie Li et.al. | 2403.02827 | null |
2024-03-06 | UniCtrl: Improving the Spatiotemporal Consistency of Text-to-Video Diffusion Models via Training-Free Unified Attention Control | Xuweiyi Chen et.al. | 2403.02332 | link |
2024-03-01 | Abductive Ego-View Accident Video Understanding for Safe Driving Perception | Jianwu Fang et.al. | 2403.00436 | null |
2024-02-28 | Context-aware Talking Face Video Generation | Meidai Xuanyuan et.al. | 2402.18092 | null |
2024-02-22 | Customize-A-Video: One-Shot Motion Customization of Text-to-Video Diffusion Models | Yixuan Ren et.al. | 2402.14780 | null |
2024-04-03 | Hybrid Video Diffusion Models with 2D Triplane and 3D Wavelet Representation | Kihong Kim et.al. | 2402.13729 | null |
2024-02-08 | Animated Stickers: Bringing Stickers to Life with Video Diffusion | David Yan et.al. | 2402.06088 | null |
2024-05-06 | Direct-a-Video: Customized Video Generation with User-Directed Camera Movement and Object Motion | Shiyuan Yang et.al. | 2402.03162 | null |
2024-02-02 | Boximator: Generating Rich and Controllable Motions for Video Synthesis | Jiawei Wang et.al. | 2402.01566 | null |
2024-02-01 | AnimateLCM: Accelerating the Animation of Personalized Diffusion Models and Adapters with Decoupled Consistency Learning | Fu-Yun Wang et.al. | 2402.00769 | link |
2024-01-29 | Generative Video Diffusion for Unseen Cross-Domain Video Moment Retrieval | Dezhao Luo et.al. | 2401.13329 | null |
2024-02-05 | Lumiere: A Space-Time Diffusion Model for Video Generation | Omer Bar-Tal et.al. | 2401.12945 | null |
2024-01-19 | ActAnywhere: Subject-Aware Video Background Generation | Boxiao Pan et.al. | 2401.10822 | null |
2024-01-22 | Motion-Zero: Zero-Shot Moving Object Control Framework for Diffusion-Based Video Generation | Changgu Chen et.al. | 2401.10150 | null |
2024-05-22 | CustomVideo: Customizing Text-to-Video Generation with Multiple Subjects | Zhao Wang et.al. | 2401.09962 | null |
2024-01-17 | Vlogger: Make Your Dream A Vlog | Shaobin Zhuang et.al. | 2401.09414 | link |
2024-01-17 | VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models | Haoxin Chen et.al. | 2401.09047 | link |
2024-05-10 | 360DVD: Controllable Panorama Video Generation with 360-Degree Video Diffusion Model | Qian Wang et.al. | 2401.06578 | null |
2024-01-11 | HiCAST: Highly Customized Arbitrary Style Transfer with Adapter Enhanced Diffusion Models | Hanzhang Wang et.al. | 2401.05870 | null |
2024-01-09 | MagicVideo-V2: Multi-Stage High-Aesthetic Video Generation | Weimin Wang et.al. | 2401.04468 | null |
2024-01-03 | Moonshot: Towards Controllable Video Generation and Editing with Multimodal Conditions | David Junhao Zhang et.al. | 2401.01827 | link |
2024-03-17 | 4DGen: Grounded 4D Content Generation with Spatial-temporal Consistency | Yuyang Yin et.al. | 2312.17225 | null |
2024-06-10 | DreamGaussian4D: Generative 4D Gaussian Splatting | Jiawei Ren et.al. | 2312.17142 | link |
2024-03-18 | Diffusion Reward: Learning Rewards via Conditional Video Diffusion | Tao Huang et.al. | 2312.14134 | link |
2023-12-19 | InstructVideo: Instructing Video Diffusion Models with Human Feedback | Hangjie Yuan et.al. | 2312.12490 | null |
2023-12-14 | VideoLCM: Video Latent Consistency Model | Xiang Wang et.al. | 2312.09109 | null |
2024-07-25 | FreeInit: Bridging Initialization Gap in Video Diffusion Models | Tianxing Wu et.al. | 2312.07537 | link |
2023-12-11 | Photorealistic Video Generation with Diffusion Models | Agrim Gupta et.al. | 2312.06662 | null |
2024-06-20 | Precipitation Downscaling with Spatiotemporal Video Diffusion | Prakhar Srivastava et.al. | 2312.06071 | null |
2023-12-07 | Customizing Motion in Text-to-Video Diffusion Models | Joanna Materzynska et.al. | 2312.04966 | null |
2023-12-07 | DreamVideo: Composing Your Dream Videos with Customized Subject and Motion | Yujie Wei et.al. | 2312.04433 | link |
2024-07-16 | MEVG: Multi-event Video Generation with Text-to-Video Models | Gyeongrok Oh et.al. | 2312.04086 | null |
2023-12-06 | AnimateZero: Video Diffusion Models are Zero-Shot Image Animators | Jiwen Yu et.al. | 2312.03793 | link |
2023-12-12 | DreamVideo: High-Fidelity Image-to-Video Generation with Image Retention and Text Guidance | Cong Wang et.al. | 2312.03018 | null |
2024-04-09 | BIVDiff: A Training-Free Framework for General-Purpose Video Synthesis via Bridging Image and Video Diffusion Models | Fengyuan Shi et.al. | 2312.02813 | link |
2024-07-22 | DragVideo: Interactive Drag-style Video Editing | Yufan Deng et.al. | 2312.02216 | link |
2023-12-03 | Generative Rendering: Controllable 4D-Guided Video Generation with 2D Diffusion Models | Shengqu Cai et.al. | 2312.01409 | null |
2023-12-03 | ViVid-1-to-3: Novel View Synthesis with Video Diffusion Models | Jeong-gi Kwak et.al. | 2312.01305 | null |
2023-12-01 | VMC: Video Motion Customization using Temporal Attention Adaption for Text-to-Video Diffusion Models | Hyeonho Jeong et.al. | 2312.00845 | link |
2024-03-20 | TrackDiffusion: Tracklet-Conditioned Video Generation via Diffusion Models | Pengxiang Li et.al. | 2312.00651 | null |
2024-07-15 | MagDiff: Multi-Alignment Diffusion for High-Fidelity Video Generation and Editing | Haoyu Zhao et.al. | 2311.17338 | link |
2023-12-03 | Space-Time Diffusion Features for Zero-Shot Text-Driven Motion Transfer | Danah Yatim et.al. | 2311.17009 | null |
2023-11-28 | SparseCtrl: Adding Sparse Controls to Text-to-Video Diffusion Models | Yuwei Guo et.al. | 2311.16933 | null |
2024-05-07 | A Unified Approach for Text- and Image-guided 4D Scene Generation | Yufeng Zheng et.al. | 2311.16854 | null |
2023-11-27 | MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model | Zhongcong Xu et.al. | 2311.16498 | null |
2023-11-26 | Flow-Guided Diffusion for Video Inpainting | Bohai Gu et.al. | 2311.15368 | link |
2023-11-25 | Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large Datasets | Andreas Blattmann et.al. | 2311.15127 | link |
2024-02-19 | Animate124: Animating One Image to 4D Dynamic Scene | Yuyang Zhao et.al. | 2311.14603 | null |
2023-11-21 | Breathing Life Into Sketches Using Text-to-Video Priors | Rinon Gal et.al. | 2311.13608 | null |
2023-12-04 | AnimateAnything: Fine-Grained Open Domain Image Animation with Motion Guidance | Zuozhuo Dai et.al. | 2311.12886 | link |
2023-11-02 | Infusion: Internal Diffusion for Video Inpainting | Nicolas Cherel et.al. | 2311.01090 | link |
2023-11-06 | SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction | Xinyuan Chen et.al. | 2310.20700 | null |
2024-01-30 | FreeNoise: Tuning-Free Longer Video Diffusion via Noise Rescheduling | Haonan Qiu et.al. | 2310.15169 | link |
2023-11-27 | DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors | Jinbo Xing et.al. | 2310.12190 | link |
2023-10-16 | LAMP: Learn A Motion Pattern for Few-Shot-Based Video Generation | Ruiqi Wu et.al. | 2310.10769 | link |
2023-10-16 | A Survey on Video Diffusion Models | Zhen Xing et.al. | 2310.10647 | link |
2023-10-12 | MotionDirector: Motion Customization of Text-to-Video Diffusion Models | Rui Zhao et.al. | 2310.08465 | link |
2023-10-11 | Echocardiography video synthesis from end diastolic semantic map via diffusion model | Phi Nguyen Van et.al. | 2310.07131 | null |
2024-05-04 | LLM-grounded Video Diffusion Models | Long Lian et.al. | 2309.17444 | null |
2023-10-17 | Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation | David Junhao Zhang et.al. | 2309.15818 | link |
2023-09-21 | Compositional Foundation Models for Hierarchical Planning | Anurag Ajay et.al. | 2309.08587 | null |
2024-03-19 | Dysen-VDM: Empowering Dynamics-aware Text-to-Video Diffusion with LLMs | Hao Fei et.al. | 2308.13812 | null |
2023-08-03 | VideoControlNet: A Motion-Guided Video-to-Video Translation Framework by Using Diffusion Model with ControlNet | Zhihao Hu et.al. | 2307.14073 | null |
2024-05-03 | DORSal: Diffusion for Object-centric Representations of Scenes et al | Allan Jabri et.al. | 2306.08068 | null |
2023-06-05 | Video Diffusion Models with Local-Global Context Guidance | Siyuan Yang et.al. | 2306.02562 | link |
2023-06-02 | Probabilistic Adaptation of Text-to-Video Models | Mengjiao Yang et.al. | 2306.01872 | null |
2023-05-31 | Inverse-design of nonlinear mechanical metamaterials via video denoising diffusion models | Jan-Hendrik Bastek et.al. | 2305.19836 | link |
2023-05-29 | Gen-L-Video: Multi-Text to Long Video Generation via Temporal Co-Denoising | Fu-Yun Wang et.al. | 2305.18264 | link |
2023-10-11 | VDT: General-purpose Video Diffusion Transformers via Mask Modeling | Haoyu Lu et.al. | 2305.13311 | link |
2024-03-26 | Preserve Your Own Correlation: A Noise Prior for Video Diffusion Models | Songwei Ge et.al. | 2305.10474 | null |
2024-02-19 | Make-A-Protagonist: Generic Video Editing with An Ensemble of Experts | Yuyang Zhao et.al. | 2305.08850 | null |
2023-05-09 | Style-A-Video: Agile Diffusion for Arbitrary Text-based Video Style Transfer | Nisha Huang et.al. | 2305.05464 | link |
2023-04-18 | Latent-Shift: Latent Diffusion with Temporal Shift for Efficient Text-to-Video Generation | Jie An et.al. | 2304.08477 | null |
2024-02-21 | Feature-Conditioned Cascaded Video Diffusion Models for Precise Echocardiogram Synthesis | Hadrien Reynaud et.al. | 2303.12644 | link |
2023-03-30 | Video Probabilistic Diffusion Models in Projected Latent Space | Sihyun Yu et.al. | 2302.07685 | null |
2023-02-06 | Structure and Content-Guided Video Synthesis with Diffusion Models | Patrick Esser et.al. | 2302.03011 | null |
2023-02-02 | Dreamix: Video Diffusion Models are General Video Editors | Eyal Molad et.al. | 2302.01329 | null |
2022-12-06 | Neural Cell Video Synthesis via Optical-Flow Diffusion | Manuel Serna-Aguilera et.al. | 2212.03250 | null |
2023-03-20 | Latent Video Diffusion Models for High-Fidelity Long Video Generation | Yingqing He et.al. | 2211.13221 | link |
2023-05-11 | MagicVideo: Efficient Video Generation With Latent Diffusion Models | Daquan Zhou et.al. | 2211.11018 | null |
2023-07-24 | Broken Neural Scaling Laws | Ethan Caballero et.al. | 2210.14891 | link |
2022-10-05 | Imagen Video: High Definition Video Generation with Diffusion Models | Jonathan Ho et.al. | 2210.02303 | null |
2022-11-14 | Diffusion Models for Video Prediction and Infilling | Tobias Höppe et.al. | 2206.07696 | link |
2022-10-12 | MCVD: Masked Conditional Video Diffusion for Prediction, Generation, and Interpolation | Vikram Voleti et.al. | 2205.09853 | link |
2022-06-22 | Video Diffusion Models | Jonathan Ho et.al. | 2204.03458 | null |
2022-12-08 | Diffusion Probabilistic Modeling for Video Generation | Ruihan Yang et.al. | 2203.09481 | link |