Skip to content

🎓Automatically Update CV Papers Daily using Github Actions (Update Every 12th hours)

Notifications You must be signed in to change notification settings

DZRRRRRR/cv-arxiv-daily

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

[![Contributors][contributors-shield]][contributors-url] [![Forks][forks-shield]][forks-url] [![Stargazers][stars-shield]][stars-url] [![Issues][issues-shield]][issues-url]

Updated on 2025.01.18

Usage instructions: here

Table of Contents
  1. Diffusion
  2. Implicit
  3. UnderWater
  4. image_dehaze
  5. Restoration
  6. Image_Translation

Diffusion

Publish Date Title Authors PDF Code
2025-01-16 SynthLight: Portrait Relighting with Diffusion Model by Learning to Re-render Synthetic Faces Sumit Chaturvedi et.al. 2501.09756 null
2025-01-16 Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps Nanye Ma et.al. 2501.09732 null
2025-01-16 Reward-Guided Controlled Generation for Inference-Time Alignment in Diffusion Models: Tutorial and Review Masatoshi Uehara et.al. 2501.09685 null
2025-01-16 Pruning for Sparse Diffusion Models based on Gradient Flow Ben Wan et.al. 2501.09464 null
2025-01-16 CaPa: Carve-n-Paint Synthesis for Efficient 4K Textured Mesh Generation Hwan Heo et.al. 2501.09433 null
2025-01-16 Contract-Inspired Contest Theory for Controllable Image Generation in Mobile Edge Metaverse Guangyuan Liu et.al. 2501.09391 null
2025-01-16 UVRM: A Scalable 3D Reconstruction Model from Unposed Videos Shiu-hong Kao et.al. 2501.09347 null
2025-01-16 Domain-conditioned and Temporal-guided Diffusion Modeling for Accelerated Dynamic MRI Reconstruction Liping Zhang et.al. 2501.09305 null
2025-01-16 Text Semantics to Flexible Design: A Residential Layout Generation Method Based on Stable Diffusion Model Zijin Qiu et.al. 2501.09279 null
2025-01-16 PATCHEDSERVE: A Patch Management Framework for SLO-Optimized Hybrid Resolution Diffusion Serving Desen Sun et.al. 2501.09253 null
2025-01-15 SimGen: A Diffusion-Based Framework for Simultaneous Surgical Image and Segmentation Mask Generation Aditya Bhat et.al. 2501.09008 null
2025-01-15 RepVideo: Rethinking Cross-Layer Representation for Video Generation Chenyang Si et.al. 2501.08994 null
2025-01-15 Boosting Diffusion Guidance via Learning Degradation-Aware Models for Blind Super Resolution Shao-Hao Lu et.al. 2501.08819 link
2025-01-15 Transformed Low-rank Adaptation via Tensor Decomposition and Its Applications to Text-to-image Models Zerui Tao et.al. 2501.08727 null
2025-01-15 FlexiClip: Locality-Preserving Free-Form Character Animation Anant Khandelwal et.al. 2501.08676 null
2025-01-15 TimeFlow: Longitudinal Brain Image Registration and Aging Progression Analysis Bailiang Jian et.al. 2501.08667 null
2025-01-15 Product of Gaussian Mixture Diffusion Model for non-linear MRI Inversion Laurenz Nagler et.al. 2501.08662 null
2025-01-15 Joint Learning of Depth and Appearance for Portrait Image Animation Xinya Ji et.al. 2501.08649 null
2025-01-15 Watermarking in Diffusion Model: Gaussian Shading with Exact Diffusion Inversion via Coupled Transformations (EDICT) Krishna Panthi et.al. 2501.08604 null
2025-01-15 DynamicFace: High-Quality and Consistent Video Face Swapping using Composable 3D Facial Priors Runqi Wang et.al. 2501.08553 null
2025-01-14 DAViD: Modeling Dynamic Affordance of 3D Objects using Pre-trained Video Diffusion Models Hyeonwoo Kim et.al. 2501.08333 null
2025-01-14 MangaNinja: Line Art Colorization with Precise Reference Following Zhiheng Liu et.al. 2501.08332 null
2025-01-14 Go-with-the-Flow: Motion-Controllable Video Diffusion Models Using Real-Time Warped Noise Ryan Burgert et.al. 2501.08331 link
2025-01-14 GameFactory: Creating New Games with Generative Interactive Videos Jiwen Yu et.al. 2501.08325 null
2025-01-14 Diffusion Adversarial Post-Training for One-Step Video Generation Shanchuan Lin et.al. 2501.08316 null
2025-01-14 LayerAnimate: Layer-specific Control for Animation Yuxue Yang et.al. 2501.08295 null
2025-01-14 Text-Diffusion Red-Teaming of Large Language Models: Unveiling Harmful Behaviors with Proximity Constraints Jonathan Nöther et.al. 2501.08246 null
2025-01-14 FramePainter: Endowing Interactive Image Editing with Video Diffusion Priors Yabo Zhang et.al. 2501.08225 link
2025-01-14 D $^2$ -DPM: Dual Denoising for Quantized Diffusion Probabilistic Models Qian Zeng et.al. 2501.08180 link
2025-01-14 Decision Transformers for RIS-Assisted Systems with Diffusion Model-Based Channel Acquisition Jie Zhang et.al. 2501.08007 null
2025-01-13 Training-Free Motion-Guided Video Generation with Enhanced Temporal Consistency Using Motion Consistency Loss Xinyu Zhang et.al. 2501.07563 null
2025-01-13 Confident Pseudo-labeled Diffusion Augmentation for Canine Cardiomegaly Detection Shiman Zhang et.al. 2501.07533 link
2025-01-13 IP-FaceDiff: Identity-Preserving Facial Video Editing with Diffusion Tharun Anand et.al. 2501.07530 null
2025-01-13 PrecipDiff: Leveraging image diffusion models to enhance satellite-based precipitation observations Ting-Yu Dai et.al. 2501.07447 null
2025-01-13 Diff-Ensembler: Learning to Ensemble 2D Diffusion Models for Volume-to-Volume Medical Image Translation Xiyue Zhu et.al. 2501.07430 null
2025-01-13 OCORD: Open-Campus Object Removal Dataset Shuo Zhang et.al. 2501.07397 null
2025-01-13 Bigger Isn't Always Better: Towards a General Prior for Medical Image Reconstruction Lukas Glaszner et.al. 2501.07376 null
2025-01-13 Skip Mamba Diffusion for Monocular 3D Semantic Scene Completion Li Liang et.al. 2501.07260 link
2025-01-13 D3MES: Diffusion Transformer with multihead equivariant self-attention for 3D molecule generation Zhejun Zhang et.al. 2501.07077 link
2025-01-13 Erasing Noise in Signal Detection with Diffusion Model: From Theory to Application Xiucheng Wang et.al. 2501.07030 null
2025-01-10 From discrete-time policies to continuous-time diffusion samplers: Asymptotic equivalences and faster training Julius Berner et.al. 2501.06148 link
2025-01-10 Nonisotropic Gaussian Diffusion for Realistic 3D Human Motion Prediction Cecilia Curreli et.al. 2501.06035 null
2025-01-10 CamCtrl3D: Single-Image Scene Exploration with Precise 3D Camera Control Stefan Popov et.al. 2501.06006 null
2025-01-10 Estimation and Restoration of Unknown Nonlinear Distortion using Diffusion Michal Švento et.al. 2501.05959 null
2025-01-10 Beyond Flat Text: Dual Self-inherited Guidance for Visual Text Generation Minxing Luo et.al. 2501.05892 null
2025-01-10 Poetry in Pixels: Prompt Tuning for Poem Image Generation via Diffusion Models Sofia Jamil et.al. 2501.05839 link
2025-01-10 Diffusion Models for Smarter UAVs: Decision-Making and Modeling Yousef Emami et.al. 2501.05819 null
2025-01-10 Alignment without Over-optimization: Training-Free Solution for Diffusion Models Sunwoo Kim et.al. 2501.05803 link
2025-01-10 Conditional Diffusion Model for Electrical Impedance Tomography Duanpeng Shi et.al. 2501.05769 null
2025-01-10 StarGen: A Spatiotemporal Autoregression Framework with Video Diffusion Model for Scalable and Controllable Scene Generation Shangjin Zhai et.al. 2501.05763 null
2025-01-09 Decentralized Diffusion Models David McAllister et.al. 2501.05450 null
2025-01-09 Progressive Growing of Video Tokenizers for Highly Compressed Latent Spaces Aniruddha Mahapatra et.al. 2501.05442 null
2025-01-09 The GAN is dead; long live the GAN! A Modern GAN Baseline Yiwen Huang et.al. 2501.05441 link
2025-01-09 Zero-1-to-G: Taming Pretrained 2D Diffusion Model for Direct 3D Generation Xuyi Meng et.al. 2501.05427 null
2025-01-09 TimeDP: Learning to Generate Multi-Domain Time Series with Domain Prompts Yu-Hao Huang et.al. 2501.05403 null
2025-01-09 Accelerated Diffusion Models via Speculative Sampling Valentin De Bortoli et.al. 2501.05370 null
2025-01-09 CROPS: Model-Agnostic Training-Free Framework for Safe Image Synthesis with Latent Diffusion Models Junha Park et.al. 2501.05359 null
2025-01-09 Light Transport-aware Diffusion Posterior Sampling for Single-View Reconstruction of 3D Volumes Ludwic Leonard et.al. 2501.05226 null
2025-01-09 FaceMe: Robust Blind Face Restoration with Personal Identification Siyu Liu et.al. 2501.05177 null
2025-01-09 EquiBoost: An Equivariant Boosting Approach to Molecular Conformation Generation Yixuan Yang et.al. 2501.05109 null
2025-01-08 EditAR: Unified Conditional Generation with Autoregressive Models Jiteng Mu et.al. 2501.04699 null
2025-01-08 ConceptMaster: Multi-Concept Video Customization on Diffusion Transformer Models Without Test-Time Tuning Yuzhou Huang et.al. 2501.04698 null
2025-01-08 SPAR3D: Stable Point-Aware Reconstruction of 3D Objects from Single Images Zixuan Huang et.al. 2501.04689 null
2025-01-08 A Statistical Theory of Contrastive Pre-training and Multimodal Generative AI Kazusato Oko et.al. 2501.04641 link
2025-01-08 Disentangled Clothed Avatar Generation with Layered Representation Weitian Zhang et.al. 2501.04631 null
2025-01-09 MedCoDi-M: A Multi-Prompt Foundation Model for Multimodal Medical Data Generation Daniele Molino et.al. 2501.04614 null
2025-01-08 Enhancing Low-Cost Video Editing with Lightweight Adaptors and Temporal-Aware Inversion Yangfan He et.al. 2501.04606 link
2025-01-08 ZSVC: Zero-shot Style Voice Conversion with Disentangled Latent Diffusion Models and Adversarial Training Xinfa Zhu et.al. 2501.04416 null
2025-01-08 Edit as You See: Image-guided Video Editing via Masked Motion Modeling Zhi-Lin Huang et.al. 2501.04325 null
2025-01-08 DGQ: Distribution-Aware Group Quantization for Text-to-Image Diffusion Models Hyogon Ryu et.al. 2501.04304 null
2025-01-07 NeuralSVG: An Implicit Representation for Text-to-Vector Generation Sagi Polaczek et.al. 2501.03992 null
2025-01-07 Stabilising effect of generic anomalous diffusion independent of the Rayleigh number Antonio Barletta et.al. 2501.03990 null
2025-01-07 A precise asymptotic analysis of learning diffusion models: theory and insights Hugo Cui et.al. 2501.03937 link
2025-01-07 Magic Mirror: ID-Preserved Video Generation in Video Diffusion Transformers Yuechen Zhang et.al. 2501.03931 link
2025-01-07 Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control Zekai Gu et.al. 2501.03847 link
2025-01-07 Impact of diffusion mechanisms on persistence and spreading Nathanaël Boutillon et.al. 2501.03816 null
2025-01-07 Mixing by Internal Gravity Waves in Stars: Assessing Numerical Simulations Against Theory Jack Morton et.al. 2501.03796 null
2025-01-07 Exploring Molecule Generation Using Latent Space Graph Diffusion Prashanth Pombala et.al. 2501.03696 link
2025-01-07 MC-VTON: Minimal Control Virtual Try-On Diffusion Transformer Junsheng Luan et.al. 2501.03630 null
2025-01-07 FgC2F-UDiff: Frequency-guided and Coarse-to-fine Unified Diffusion Model for Multi-modality Missing MRI Synthesis Xiaojiao Xiao et.al. 2501.03526 link
2025-01-06 MObI: Multimodal Object Inpainting Using Diffusion Models Alexandru Buburuzan et.al. 2501.03173 null
2025-01-06 Large language models for artificial general intelligence (AGI): A survey of foundational principles and approaches Alhassan Mumuni et.al. 2501.03151 null
2025-01-06 DDRM-PR: Fourier Phase Retrieval using Denoising Diffusion Restoration Models Mehmet Onurcan Kaya et.al. 2501.03030 link
2025-01-06 STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution Rui Xie et.al. 2501.02976 null
2025-01-07 SceneVTG++: Controllable Multilingual Visual Text Generation in the Wild Jiawei Liu et.al. 2501.02962 null
2025-01-06 Deep Generative Model-Aided Power System Dynamic State Estimation and Reconstruction with Unknown Control Inputs or Data Distributions Jianhua Pei et.al. 2501.02928 null
2025-01-06 Pointmap-Conditioned Diffusion for Consistent Novel View Synthesis Thang-Anh-Quan Nguyen et.al. 2501.02913 null
2025-01-06 Conditional Mutual Information Based Diffusion Posterior Sampling for Solving Inverse Problems Shayan Mohajer Hamidi et.al. 2501.02880 null
2025-01-06 Towards HRTF Personalization using Denoising Diffusion Models Juan Camilo Albarracín Sánchez et.al. 2501.02871 null
2025-01-06 Diff-Lung: Diffusion-Based Texture Synthesis for Enhanced Pathological Tissue Segmentation in Lung CT Scans Rezkellah Noureddine Khiati et.al. 2501.02867 null
2025-01-03 Bridging Classification and Segmentation in Osteosarcoma Assessment via Foundation and Discrete Diffusion Models Manh Duong Nguyen et.al. 2501.01932 link
2025-01-03 Nonparametric estimation of a factorizable density using diffusion models Hyeok Kyu Kwon et.al. 2501.01783 null
2025-01-03 Adverse Weather Conditions Augmentation of LiDAR Scenes with Latent Diffusion Models Andrea Matteazzi et.al. 2501.01761 null
2025-01-03 ACE: Anti-Editing Concept Erasure in Text-to-Image Models Zihao Wang et.al. 2501.01633 link
2025-01-03 Multivariate Time Series Anomaly Detection using DiffGAN Model Guangqiang Wu et.al. 2501.01591 link
2025-01-02 Denoising Diffused Embeddings: a Generative Approach for Hypergraphs Shihao Wu et.al. 2501.01541 null
2025-01-02 Object-level Visual Prompts for Compositional Image Generation Gaurav Parmar et.al. 2501.01424 null
2025-01-02 Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models Jingfeng Yao et.al. 2501.01423 link
2025-01-02 Test-time Controllable Image Generation by Explicit Spatial Constraint Enforcement Z. Zhang et.al. 2501.01368 null
2025-01-03 Conditional Consistency Guided Image Translation and Enhancement Amil Bhagat et.al. 2501.01223 link
2025-01-02 Semantics-Guided Diffusion for Deep Joint Source-Channel Coding in Wireless Image Transmission Maojun Zhang et.al. 2501.01138 null
2025-01-02 EliGen: Entity-Level Controlled Image Generation with Regional Attention Hong Zhang et.al. 2501.01097 link
2025-01-02 DiffCL: A Diffusion-Based Contrastive Learning Framework with Semantic Alignment for Multimodal Recommendations Qiya Song et.al. 2501.01066 null
2025-01-02 Optimizing Noise Schedules of Generative Models in High Dimensionss Santiago Aranguri et.al. 2501.00988 null
2025-01-01 Cached Adaptive Token Merging: Dynamic Token Reduction and Redundant Computation Elimination in Diffusion Model Omid Saghatchian et.al. 2501.00946 link
2025-01-01 Diffusion Prism: Enhancing Diversity and Morphology Consistency in Mask-to-Image Diffusion Hao Wang et.al. 2501.00944 null
2025-01-02 Prometheus: 3D-Aware Latent Diffusion Models for Feed-Forward Text-to-3D Scene Generation Yuanbo Yang et.al. 2412.21117 null
2024-12-30 Quantum Diffusion Model for Quark and Gluon Jet Generation Mariia Baidachna et.al. 2412.21082 link
2025-01-02 Edicho: Consistent Image Editing in the Wild Qingyan Bai et.al. 2412.21079 link
2024-12-30 Varformer: Adapting VAR's Generative Prior for Image Restoration Siyang Wang et.al. 2412.21063 link
2024-12-30 E2EDiff: Direct Mapping from Noise to Data for Enhanced Diffusion Models Zhiyu Tan et.al. 2412.21044 null
2024-12-30 Visual Style Prompt Learning Using Diffusion Models for Blind Face Restoration Wanglong Lu et.al. 2412.21042 link
2024-12-30 AlignAb: Pareto-Optimal Energy Alignment for Designing Nature-Like Antibodies Yibo Wen et.al. 2412.20984 null
2024-12-30 Influence Maximization in Temporal Networks with Persistent and Reactive Behaviors Aaqib Zahoor et.al. 2412.20936 null
2024-12-30 DDIM sampling for Generative AIBIM, a faster intelligent structural design framework Zhili He et.al. 2412.20899 null
2024-12-30 VMix: Improving Text-to-Image Diffusion Model with Cross-Attention Mixing Control Shaojin Wu et.al. 2412.20800 link
2024-12-27 VideoMaker: Zero-shot Customized Video Generation with the Inherent Force of Video Diffusion Models Tao Wu et.al. 2412.19645 null
2024-12-27 StyleRWKV: High-Quality and High-Efficiency Style Transfer with RWKV-like Architecture Miaomiao Dai et.al. 2412.19535 null
2024-12-27 RobotDiffuse: Motion Planning for Redundant Manipulator based on Diffusion Model Xiaohan Zhang et.al. 2412.19500 link
2024-12-27 RAIN: Real-time Animation of Infinite Video Stream Zhilei Shu et.al. 2412.19489 null
2024-12-27 DriveEditor: A Unified 3D Information-Guided Framework for Controllable Object Editing in Driving Scenes Yiyuan Liang et.al. 2412.19458 link
2024-12-27 Multi-scale Latent Point Consistency Models for 3D Shape Generation Bi'an Du et.al. 2412.19413 null
2024-12-27 A Generalized Einstein Relation for Markovian Friction Coefficients from Molecular Trajectories J. M. Hall et.al. 2412.19398 null
2024-12-26 6Diffusion: IPv6 Target Generation Using a Diffusion Model with Global-Local Attention Mechanisms for Internet-wide IPv6 Scanning Nabo He et.al. 2412.19243 null
2024-12-26 Mask Approximation Net: Merging Feature Extraction and Distribution Learning for Remote Sensing Change Captioning Dongwei Sun et.al. 2412.19179 null
2024-12-26 Improving Generative Pre-Training: An In-depth Study of Masked Image Modeling and Denoising Models Hyesong Choi et.al. 2412.19104 null
2024-12-24 PartGen: Part-level 3D Generation and Reconstruction with Multi-View Diffusion Models Minghao Chen et.al. 2412.18608 null
2024-12-24 DrivingGPT: Unifying Driving World Modeling and Planning with Multi-modal Autoregressive Transformers Yuntao Chen et.al. 2412.18607 null
2024-12-24 Explaining in Diffusion: Explaining a Classifier Through Hierarchical Semantics with Text-to-Image Diffusion Models Tahira Kazimi et.al. 2412.18604 null
2024-12-24 DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation Minghong Cai et.al. 2412.18597 link
2024-12-24 LatentCRF: Continuous CRF for Efficient Latent Diffusion Kanchana Ranasinghe et.al. 2412.18596 null
2024-12-24 Resolution-Robust 3D MRI Reconstruction with 2D Diffusion Priors: Diverse-Resolution Training Outperforms Interpolation Anselm Krainovic et.al. 2412.18584 null
2024-12-24 3DEnhancer: Consistent Multi-View Diffusion for 3D Enhancement Yihang Luo et.al. 2412.18565 null
2024-12-24 Fashionability-Enhancing Outfit Image Editing with Conditional Diffusion Models Qice Qin et.al. 2412.18421 null
2024-12-24 Discovery of 2D Materials via Symmetry-Constrained Diffusion Model Shihang Xu et.al. 2412.18414 null
2024-12-24 FameBias: Embedding Manipulation Bias Attack in Text-to-Image Models Jaechul Roh et.al. 2412.18302 null
2024-12-23 FaceLift: Single Image to 3D Head with View Generation and GS-LRM Weijie Lyu et.al. 2412.17812 null
2024-12-23 PepTune: De Novo Generation of Therapeutic Peptides with Multi-Objective-Guided Discrete Diffusion Sophia Tang et.al. 2412.17780 null
2024-12-23 The Superposition of Diffusion Models Using the Itô Density Estimator Marta Skreta et.al. 2412.17762 null
2024-12-23 A Bias-Free Training Paradigm for More General AI-generated Image Detection Fabrizio Guillaro et.al. 2412.17671 null
2024-12-23 Benchmarking Generative AI Models for Deep Learning Test Input Generation Maryam et.al. 2412.17652 link
2024-12-23 DreamFit: Garment-Centric Human Generation via a Lightweight Anything-Dressing Encoder Ente Lin et.al. 2412.17644 null
2024-12-23 Retention Score: Quantifying Jailbreak Risks for Vision Language Models Zaitang Li et.al. 2412.17544 null
2024-12-23 DiffusionAttacker: Diffusion-Driven Prompt Manipulation for LLM Jailbreak Hao Wang et.al. 2412.17522 null
2024-12-23 Heterogeneous carrying capacities and global extinction in metapopulations Jakub Hesoun et.al. 2412.17461 null
2024-12-23 AeroDiT: Diffusion Transformers for Reynolds-Averaged Navier-Stokes Simulations of Airfoil Flows Hui Xiang et.al. 2412.17394 null
2024-12-20 Personalized Representation from Personalized Generation Shobhita Sundaram et.al. 2412.16156 link
2024-12-20 Predicting human cooperation: sensitizing drift-diffusion model to interaction and external stimuli Lucila G. Alvarez-Zuzek et.al. 2412.16121 null
2024-12-20 Differentially Private Federated Learning of Diffusion Models for Synthetic Tabular Data Generation Timur Sattarov et.al. 2412.16083 null
2024-12-20 Label-Efficient Data Augmentation with Video Diffusion Models for Guidewire Segmentation in Cardiac Fluoroscopy Shaoyan Pan et.al. 2412.16050 null
2024-12-20 SafeCFG: Redirecting Harmful Classifier-Free Guidance for Safe Generation Jiadong Pan et.al. 2412.16039 null
2024-12-20 Semi-Supervised Adaptation of Diffusion Models for Handwritten Text Generation Kai Brandenbusch et.al. 2412.15853 null
2024-12-20 Electromagnetic particle-in-cell modeling of an electron cyclotron resonance plasma discharge in hydrogen D. Eremin et.al. 2412.15802 null
2024-12-20 Diffusion-Based Conditional Image Editing through Optimized Inference with Guidance Hyunsoo Lee et.al. 2412.15798 null
2024-12-20 Learning Group Interactions and Semantic Intentions for Multi-Object Trajectory Prediction Mengshi Qi et.al. 2412.15673 link
2024-12-20 BS-LDM: Effective Bone Suppression in High-Resolution Chest X-Ray Images with Conditional Latent Diffusion Models Yifei Sun et.al. 2412.15670 link
2024-12-19 LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis Hanlin Wang et.al. 2412.15214 link
2024-12-19 Flowing from Words to Pixels: A Framework for Cross-Modality Evolution Qihao Liu et.al. 2412.15213 null
2024-12-19 Generative Multiview Relighting for 3D Reconstruction under Extreme Illumination Variation Hadi Alzayer et.al. 2412.15211 null
2024-12-19 AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation Moayed Haji-Ali et.al. 2412.15191 null
2024-12-19 Tiled Diffusion Or Madar et.al. 2412.15185 null
2024-12-19 OnlineVPO: Align Video Diffusion Model with Online Video-Centric Preference Optimization Jiacheng Zhang et.al. 2412.15159 null
2024-12-19 Prompt-A-Video: Prompt Your Video Diffusion Model via Preference-Aligned LLM Yatai Ji et.al. 2412.15156 link
2024-12-19 Jet: A Modern Transformer-Based Normalizing Flow Alexander Kolesnikov et.al. 2412.15129 null
2024-12-19 Uni-Renderer: Unifying Rendering and Inverse Rendering Via Dual Stream Diffusion Zhifei Chen et.al. 2412.15050 null
2024-12-19 DCTdiff: Intriguing Properties of Image Generative Modeling in the DCT Space Mang Ning et.al. 2412.15032 link
2024-12-18 AniDoc: Animation Creation Made Easier Yihao Meng et.al. 2412.14173 null
2024-12-19 E-CAR: Efficient Continuous Autoregressive Image Generation via Multistage Modeling Zhihang Yuan et.al. 2412.14170 null
2024-12-18 Autoregressive Video Generation without Vector Quantization Haoge Deng et.al. 2412.14169 link
2024-12-18 VideoDPO: Omni-Preference Alignment for Video Diffusion Generation Runtao Liu et.al. 2412.14167 null
2024-12-18 MCMat: Multiview-Consistent and Physically Accurate PBR Material Generation Shenhao Zhu et.al. 2412.14148 null
2024-12-18 SurgSora: Decoupled RGBD-Flow Diffusion Model for Controllable Surgical Video Generation Tong Chen et.al. 2412.14018 null
2024-12-18 Comparative Analysis of Machine Learning-Based Imputation Techniques for Air Quality Datasets with High Missing Data Rates Sen Yan et.al. 2412.13966 null
2024-12-18 IDEQ: an improved diffusion model for the TSP Mickael Basson et.al. 2412.13858 null
2024-12-18 Object Style Diffusion for Generalized Object Detection in Urban Scene Hao Li et.al. 2412.13815 null
2024-12-18 Text2Relight: Creative Portrait Relighting with Text Guidance Junuk Cha et.al. 2412.13734 null
2024-12-17 CoMPaSS: Enhancing Spatial Understanding in Text-to-Image Diffusion Models Gaoyang Zhang et.al. 2412.13195 link
2024-12-17 StreetCrafter: Street View Synthesis with Controllable Video Diffusion Models Yunzhi Yan et.al. 2412.13188 null
2024-12-17 Move-in-2D: 2D-Conditioned Human Motion Generation Hsin-Ping Huang et.al. 2412.13185 null
2024-12-17 Prompt Augmentation for Self-supervised Text-guided Image Manipulation Rumeysa Bodur et.al. 2412.13081 null
2024-12-17 3D MedDiffusion: A 3D Medical Diffusion Model for Controllable and High-quality Medical Image Generation Haoshen Wang et.al. 2412.13059 null
2024-12-18 Attentive Eraser: Unleashing Diffusion Model's Object Removal Potential via Self-Attention Redirection Guidance Wenhao Sun et.al. 2412.12974 link
2024-12-17 ArchesWeather & ArchesWeatherGen: a deterministic and generative model for efficient ML weather forecasting Guillaume Couairon et.al. 2412.12971 link
2024-12-17 Generation of cosmic ray trajectories by a Diffusion Model trained on test particles in 3D magnetohydrodynamic turbulence Johannes Martin et.al. 2412.12923 null
2024-12-17 Unsupervised Region-Based Image Editing of Denoising Diffusion Models Zixiang Li et.al. 2412.12912 null
2024-12-17 ArtAug: Enhancing Text-to-Image Generation through Synthesis-Understanding Interaction Zhongjie Duan et.al. 2412.12888 link
2024-12-16 Causal Diffusion Transformers for Generative Modeling Chaorui Deng et.al. 2412.12095 link
2024-12-16 CAP4D: Creating Animatable 4D Portrait Avatars with Morphable Multi-View Diffusion Models Felix Taubner et.al. 2412.12093 null
2024-12-16 Wonderland: Navigating 3D Scenes from a Single Image Hanwen Liang et.al. 2412.12091 null
2024-12-16 A LoRA is Worth a Thousand Pictures Chenxi Liu et.al. 2412.12048 null
2024-12-16 The entropic optimal (self-)transport problem: Limit distributions for decreasing regularization with application to score function estimation Gilles Mordant et.al. 2412.12007 null
2024-12-16 Controllable Shadow Generation with Single-Step Diffusion Models from Synthetic Data Onur Tasar et.al. 2412.11972 null
2024-12-16 ColorFlow: Retrieval-Augmented Image Sequence Colorization Junhao Zhuang et.al. 2412.11815 null
2024-12-16 InterDyn: Controllable Interactive Dynamics with Video Diffusion Models Rick Akkerman et.al. 2412.11785 null
2024-12-16 Joint Reconstruction of the Activity and the Attenuation in PET by Diffusion Posterior Sampling: a Feasibility Study Clémentine Phung-Ngoc et.al. 2412.11776 null
2024-12-16 No More Adam: Learning Rate Scaling at Initialization is All You Need Minghao Xu et.al. 2412.11768 link
2024-12-13 Towards a foundation model for heavy-ion collision experiments through point cloud diffusion Manjunath Omana Kuttan et.al. 2412.10352 null
2024-12-13 BrushEdit: All-In-One Image Inpainting and Editing Yaowei Li et.al. 2412.10316 null
2024-12-13 Coherent 3D Scene Diffusion From a Single RGB Image Manuel Dahnert et.al. 2412.10294 null
2024-12-13 GAF: Gaussian Avatar Reconstruction from Monocular Videos via Multi-view Diffusion Jiapeng Tang et.al. 2412.10209 null
2024-12-13 Efficient Generative Modeling with Residual Vector Quantization-Based Tokens Jaehyeon Kim et.al. 2412.10208 null
2024-12-13 Simple Guidance Mechanisms for Discrete Diffusion Models Yair Schiff et.al. 2412.10193 link
2024-12-13 SwiftTry: Fast and Consistent Video Virtual Try-On with Diffusion Models Hung Nguyen et.al. 2412.10178 null
2024-12-13 The Art of Deception: Color Visual Illusions and Diffusion Models Alex Gomez-Villa et.al. 2412.10122 null
2024-12-13 SuperMark: Robust and Training-free Image Watermarking via Diffusion-based Super-Resolution Runyi Hu et.al. 2412.10049 null
2024-12-13 Emergence of complexity in opinion propagation: A reaction-diffusion model Romain Ducasse et.al. 2412.10000 null
2024-12-12 FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free Scale Fusion Haonan Qiu et.al. 2412.09626 null
2024-12-12 Illusion3D: 3D Multiview Illusion with 2D Diffusion Priors Yue Feng et.al. 2412.09625 null
2024-12-12 OmniDrag: Enabling Motion Control for Omnidirectional Image-to-Video Generation Weiqi Li et.al. 2412.09623 null
2024-12-12 LoRACLR: Contrastive Adaptation for Customization of Diffusion Models Enis Simsar et.al. 2412.09622 null
2024-12-12 SnapGen: Taming High-Resolution Text-to-Image Models for Mobile Devices with Efficient Architectures and Training Dongting Hu et.al. 2412.09619 null
2024-12-12 EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM Zhuofan Zong et.al. 2412.09618 null
2024-12-12 Context Canvas: Enhancing Text-to-Image Diffusion Models with Knowledge Graph-Based RAG Kavana Venkatesh et.al. 2412.09614 null
2024-12-12 LiftImage3D: Lifting Any Single Image to 3D Gaussians with Video Generation Priors Yabo Chen et.al. 2412.09597 null
2024-12-12 Neural LightRig: Unlocking Accurate Object Normal and Material Estimation with Multi-Light Diffusion Zexin He et.al. 2412.09593 null
2024-12-12 SimAvatar: Simulation-Ready Avatars with Layered Hair and Clothing Xueting Li et.al. 2412.09545 null
2024-12-11 Generative Semantic Communication: Architectures, Technologies, and Applications Jinke Ren et.al. 2412.08642 null
2024-12-11 DMin: Scalable Training Data Influence Estimation for Diffusion Models Huawei Lin et.al. 2412.08637 link
2024-12-11 TryOffAnyone: Tiled Cloth Generation from a Dressed Person Ioannis Xarchakos et.al. 2412.08573 link
2024-12-11 Learning Flow Fields in Attention for Controllable Person Image Generation Zijian Zhou et.al. 2412.08486 link
2024-12-11 InvDiff: Invariant Guidance for Bias Mitigation in Diffusion Models Min Hou et.al. 2412.08480 link
2024-12-11 CC-Diff: Enhancing Contextual Coherence in Remote Sensing Image Synthesis Mu Zhang et.al. 2412.08464 null
2024-12-11 Reliable Uncertainty Quantification for Fiber Orientation in Composite Molding Processes using Multilevel Polynomial Surrogates Stjepan Salatovic et.al. 2412.08459 null
2024-12-12 Pragmatist: Multiview Conditional Diffusion Models for High-Fidelity 3D Reconstruction from Unposed Sparse Views Songchun Zhang et.al. 2412.08412 null
2024-12-11 Grasp Diffusion Network: Learning Grasp Generators from Partial Point Clouds with Diffusion Models in SO(3)xR3 Joao Carvalho et.al. 2412.08398 null
2024-12-11 Digging into Intrinsic Contextual Information for High-fidelity 3D Point Cloud Completion Jisheng Chu et.al. 2412.08326 link
2024-12-10 Efficient Diversity-Preserving Diffusion Alignment via Gradient-Informed GFlowNets Zhen Liu et.al. 2412.07775 null
2024-12-10 From Slow Bidirectional to Fast Causal Video Generators Tianwei Yin et.al. 2412.07772 null
2024-12-10 Make-A-Texture: Fast Shape-Aware Texture Generation in 3 Seconds Xiaoyu Xiang et.al. 2412.07766 null
2024-12-10 Repurposing Pre-trained Video Diffusion Models for Event-based Video Interpolation Jingxi Chen et.al. 2412.07761 null
2024-12-10 SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints Jianhong Bai et.al. 2412.07760 link
2024-12-10 Multi-Shot Character Consistency for Text-to-Video Generation Yuval Atzmon et.al. 2412.07750 null
2024-12-10 FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models Tong Wu et.al. 2412.07674 null
2024-12-10 TraSCE: Trajectory Steering for Concept Erasure Anubhav Jain et.al. 2412.07658 link
2024-12-11 Motion Artifact Removal in Pixel-Frequency Domain via Alternate Masks and Diffusion Model Jiahua Xu et.al. 2412.07590 link
2024-12-10 DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation Jianzong Wu et.al. 2412.07589 null
2024-12-09 [MASK] is All You Need Vincent Tao Hu et.al. 2412.06787 link
2024-12-09 Tactile DreamFusion: Exploiting Tactile Sensing for 3D Generation Ruihan Gao et.al. 2412.06785 link
2024-12-09 Diverse Score Distillation Yanbo Xu et.al. 2412.06780 null
2024-12-09 Visual Lexicon: Rich Image Features in Language Space XuDong Wang et.al. 2412.06774 null
2024-12-09 InstantRestore: Single-Step Personalized Face Restoration with Shared-Image Attention Howard Zhang et.al. 2412.06753 null
2024-12-09 ContRail: A Framework for Realistic Railway Image Synthesis using ControlNet Andrei-Robert Alexandrescu et.al. 2412.06742 null
2024-12-09 Take Fake as Real: Realistic-like Robust Black-box Adversarial Attack to Evade AIGC Detection Caiyun Xie et.al. 2412.06727 link
2024-12-09 You See it, You Got it: Learning 3D Creation on Pose-Free Videos at Scale Baorui Ma et.al. 2412.06699 link
2024-12-09 Gen-3Diffusion: Realistic Image-to-3D Generation via 2D & 3D Diffusion Synergy Yuxuan Xue et.al. 2412.06698 null
2024-12-09 Diff5T: Benchmarking Human Brain Diffusion MRI with an Extensive 5.0 Tesla K-Space and Spatial Dataset Shanshan Wang et.al. 2412.06666 null
2024-12-06 Perturb-and-Revise: Flexible 3D Editing with Generative Trajectories Susung Hong et.al. 2412.05279 null
2024-12-06 Birth and Death of a Rose Chen Geng et.al. 2412.05278 null
2024-12-06 MotionFlow: Attention-Driven Motion Transfer in Video Diffusion Models Tuna Han Salih Meral et.al. 2412.05275 null
2024-12-06 Go-or-Grow Models in Biology: a Monster on a Leash R. Thiessen et.al. 2412.05191 null
2024-12-06 DNF: Unconditional 4D Generation with Dictionary-based Neural Fields Xinyi Zhang et.al. 2412.05161 null
2024-12-06 Probabilistic Galaxy Field Generation with Diffusion Models Tanner Sether et.al. 2412.05131 null
2024-12-06 The Silent Prompt: Initial Noise as Implicit Guidance for Goal-Driven Image Generation Ruoyu Wang et.al. 2412.05101 null
2024-12-06 ReF-LDM: A Latent Diffusion Model for Reference-based Face Image Restoration Chi-Wei Hsiao et.al. 2412.05043 null
2024-12-06 Noise Matters: Diffusion Model-based Urban Mobility Generation with Collaborative Noise Priors Yuheng Zhang et.al. 2412.05000 null
2024-12-06 Continuous Video Process: Modeling Videos as Continuous Multi-Dimensional Processes for Video Prediction Gaurav Shrivastava et.al. 2412.04929 null
2024-12-05 PaintScene4D: Consistent 4D Scene Generation from Text Prompts Vinayak Gupta et.al. 2412.04471 null
2024-12-05 LayerFusion: Harmonized Multi-Layer Text-to-Image Generation with Generative Priors Yusuf Dalva et.al. 2412.04460 null
2024-12-05 Four-Plane Factorized Video Autoencoders Mohammed Suhail et.al. 2412.04452 null
2024-12-05 MEMO: Memory-Guided Diffusion for Expressive Talking Video Generation Longtao Zheng et.al. 2412.04448 null
2024-12-05 DiCoDe: Diffusion-Compressed Deep Tokens for Autoregressive Video Generation with Language Models Yizhuo Li et.al. 2412.04446 null
2024-12-05 Learning Artistic Signatures: Symmetry Discovery and Style Transfer Emma Finn et.al. 2412.04441 null
2024-12-05 Divot: Diffusion Powers Video Tokenizer for Comprehension and Generation Yuying Ge et.al. 2412.04432 link
2024-12-05 Infinity: Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis Jian Han et.al. 2412.04431 link
2024-12-05 Reversible molecular simulation for training classical and machine learning force fields Joe G Greener et.al. 2412.04374 link
2024-12-05 ActFusion: a Unified Diffusion Model for Action Segmentation and Anticipation Dayoung Gong et.al. 2412.04353 null
2024-12-04 MIDI: Multi-Instance Diffusion for Single Image to 3D Scene Generation Zehuan Huang et.al. 2412.03558 null
2024-12-04 NVComposer: Boosting Generative Novel View Synthesis with Multiple Sparse and Unposed Images Lingen Li et.al. 2412.03517 null
2024-12-04 Distilling Diffusion Models to Efficient 3D LiDAR Scene Completion Shengyuan Zhang et.al. 2412.03515 link
2024-12-04 CleanDIFT: Diffusion Features without Noise Nick Stracke et.al. 2412.03439 link
2024-12-04 SINGER: Vivid Audio-driven Singing Video Generation with Multi-scale Spectral Diffusion Model Yan Li et.al. 2412.03430 null
2024-12-04 Skel3D: Skeleton Guided Novel View Synthesis Aron Fóthi et.al. 2412.03407 null
2024-12-04 Identifiability implies consistency of MLE in partially observed diffusions on a torus Ibrahim Ekren et.al. 2412.03380 null
2024-12-04 TASR: Timestep-Aware Diffusion Model for Image Super-Resolution Qinwei Lin et.al. 2412.03355 link
2024-12-04 DIVE: Taming DINO for Subject-Driven Video Editing Yi Huang et.al. 2412.03347 null
2024-12-04 Geometry-guided Cross-view Diffusion for One-to-many Cross-view Image Synthesis Tao Jun Lin et.al. 2412.03315 null
2024-12-03 Diffusion-based Visual Anagram as Multi-task Learning Zhiyuan Xu et.al. 2412.02693 link
2024-12-03 FoundHand: Large-Scale Domain-Specific Learning for Controllable Hand Image Generation Kefan Chen et.al. 2412.02690 null
2024-12-04 SNOOPI: Supercharged One-step Diffusion Distillation with Proper Guidance Viet Nguyen et.al. 2412.02687 null
2024-12-03 Sharp-It: A Multi-view to Multi-view Diffusion Model for 3D Synthesis and Manipulation Yiftach Edelstein et.al. 2412.02631 null
2024-12-03 Unveiling Concept Attribution in Diffusion Models Quang H. Nguyen et.al. 2412.02542 null
2024-12-03 It Takes Two: Real-time Co-Speech Two-person's Interaction Generation via Reactive Auto-regressive Diffusion Model Mingyi Shi et.al. 2412.02419 null
2024-12-03 GenMix: Effective Data Augmentation with Generative Diffusion Model Image Editing Khawar Islam et.al. 2412.02366 null
2024-12-03 LoRA Diffusion: Zero-Shot LoRA Synthesis for Diffusion Model Personalization Ethan Smith et.al. 2412.02352 null
2024-12-03 SimuScope: Realistic Endoscopic Synthetic Dataset Generation through Surgical Simulation and Diffusion Models Sabina Martyniak et.al. 2412.02332 link
2024-12-03 Controlling the Latent Diffusion Model for Generative Image Shadow Removal via Residual Generation Xinjie Li et.al. 2412.02322 null
2024-11-29 MoTe: Learning Motion-Text Diffusion Model for Multiple Generation Tasks Yiming Wu et.al. 2411.19786 null
2024-11-29 Riemannian Denoising Score Matching for Molecular Structure Optimization with Accurate Energy Jeheon Woo et.al. 2411.19769 null
2024-11-29 TexGaussian: Generating High-quality PBR Material via Octree-based 3D Gaussian Splatting Bojun Xiong et.al. 2411.19654 null
2024-11-29 Uniform Attention Maps: Boosting Image Fidelity in Reconstruction and Editing Wenyi Mo et.al. 2411.19652 link
2024-11-29 Deepfake Media Generation and Detection in the Generative AI Era: A Survey and Outlook Florinel-Alin Croitoru et.al. 2411.19537 link
2024-11-29 Ditto: Motion-Space Diffusion for Controllable Realtime Talking Head Synthesis Tianqi Li et.al. 2411.19509 null
2024-11-29 Diffusion Models Meet Network Management: Improving Traffic Matrix Analysis with Diffusion-based Approach Xinyu Yuan et.al. 2411.19493 link
2024-11-28 DreamBlend: Advancing Personalized Fine-tuning of Text-to-Image Diffusion Models Shwetha Ram et.al. 2411.19390 null
2024-11-28 Enhancing Sketch Animation: Text-to-Video Diffusion Models with Temporal Consistency and Rigidity Constraints Gaurav Rai et.al. 2411.19381 null
2024-11-28 Towards a Mechanistic Explanation of Diffusion Model Generalization Matthew Niedoba et.al. 2411.19339 null
2024-11-27 GeneMAN: Generalizable Single-Image 3D Human Reconstruction from Multi-Source Human Data Wentao Wang et.al. 2411.18624 null
2024-11-27 Diffusion Self-Distillation for Zero-Shot Customized Image Generation Shengqu Cai et.al. 2411.18616 null
2024-11-27 CAT4D: Create Anything in 4D with Multi-View Video Diffusion Models Rundi Wu et.al. 2411.18613 null
2024-11-27 Evaluating and Improving the Effectiveness of Synthetic Chest X-Rays for Medical Image Analysis Eva Prakash et.al. 2411.18602 null
2024-11-27 FAM Diffusion: Frequency and Attention Modulation for High-Resolution Image Generation with Stable Diffusion Haosen Yang et.al. 2411.18552 null
2024-11-28 Enhancing weed detection performance by means of GenAI-based image augmentation Sourav Modak et.al. 2411.18513 null
2024-11-27 Learning the Evolution of Physical Structure of Galaxies via Diffusion Models Andrew Lizarraga et.al. 2411.18440 link
2024-11-27 Individual Content and Motion Dynamics Preserved Pruning for Video Diffusion Models Yiming Wu et.al. 2411.18375 null
2024-11-27 TryOffDiff: Virtual-Try-Off via High-Fidelity Garment Reconstruction using Diffusion Models Riza Velioglu et.al. 2411.18350 link
2024-11-27 HiFiVFS: High Fidelity Video Face Swapping Xu Chen et.al. 2411.18293 null
2024-11-27 StableAnimator: High-Quality Identity-Preserving Human Image Animation Shuyuan Tu et.al. 2411.17697 link
2024-11-26 ScribbleLight: Single Image Indoor Relighting with Scribbles Jun Myeong Choi et.al. 2411.17696 null
2024-11-26 GenDeg: Diffusion-Based Degradation Synthesis for Generalizable All-in-One Image Restoration Sudarshan Rajagopalan et.al. 2411.17687 null
2024-11-26 Accelerating Vision Diffusion Transformers with Skip Branches Guanjie Chen et.al. 2411.17616 link
2024-11-26 VideoDirector: Precise Video Editing via Text-to-Video Models Yukun Wang et.al. 2411.17592 null
2024-11-26 FTMoMamba: Motion Generation with Frequency and Text State Space Models Chengjian Li et.al. 2411.17532 null
2024-11-26 WF-VAE: Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model Zongjian Li et.al. 2411.17459 link
2024-11-26 Image Generation with Multimodule Semantic Feature-Aided Selection for Semantic Communications Chengyang Liang et.al. 2411.17428 null
2024-11-26 Reward Incremental Learning in Text-to-Image Generation Maorong Wang et.al. 2411.17310 null
2024-11-26 APT: Architectural Planning and Text-to-Blueprint Construction Using Large Language Models for Open-World Agents Jun Yu Chen et.al. 2411.17255 link
2024-11-25 Generative Omnimatte: Learning to Decompose Video into Layers Yao-Chih Lee et.al. 2411.16683 null
2024-11-25 Diffusion Features for Zero-Shot 6DoF Object Pose Estimation Bernd Von Gimborn et.al. 2411.16668 null
2024-11-25 LegoPET: Hierarchical Feature Guided Conditional Diffusion for PET Image Reconstruction Yiran Sun et.al. 2411.16629 link
2024-11-25 Chat2SVG: Vector Graphics Generation with Large Language Models and Image Diffusion Models Ronghuan Wu et.al. 2411.16602 null
2024-11-25 Unlocking The Potential of Adaptive Attacks on Diffusion-Based Purification Andre Kassis et.al. 2411.16598 link
2024-11-25 Rethinking Diffusion for Text-Driven Human Motion Generation Zichong Meng et.al. 2411.16575 null
2024-11-25 Representation Collapsing Problems in Vector Quantization Wenhao Zhao et.al. 2411.16550 null
2024-11-25 ADOBI: Adaptive Diffusion Bridge For Blind Inverse Problems with Application to MRI Reconstruction Yuyang Hu et.al. 2411.16535 null
2024-11-25 Noise Diffusion for Enhancing Semantic Faithfulness in Text-to-Image Synthesis Boming Miao et.al. 2411.16503 null
2024-11-25 Model-based reinforcement corrosion prediction: Continuous calibration with Bayesian optimization and corrosion wire sensor data A. Potnis et.al. 2411.16447 null
2024-11-22 DiffusionDrive: Truncated Diffusion Model for End-to-End Autonomous Driving Bencheng Liao et.al. 2411.15139 link
2024-11-22 Material Anything: Generating Materials for Any 3D Object via Diffusion Xin Huang et.al. 2411.15138 null
2024-11-22 VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement Daeun Lee et.al. 2411.15115 null
2024-11-22 Leapfrog Latent Consistency Model (LLCM) for Medical Images Generation Lakshmikar R. Polamreddy et.al. 2411.15084 link
2024-11-22 The 1D nonlocal Fisher-KPP equation with a top hat kernel. Part 3. The effect of perturbations in the kernel David John Needham et.al. 2411.15054 null
2024-11-22 FloAt: Flow Warping of Self-Attention for Clothing Animation Generation Swasti Shreya Mishra et.al. 2411.15028 null
2024-11-22 Enhancing Exploration with Diffusion Policies in Hybrid Off-Policy RL: Application to Non-Prehensile Manipulation Huy Le et.al. 2411.14913 null
2024-11-22 Prioritize Denoising Steps on Diffusion Model Preference Alignment via Explicit Denoised Distribution Estimation Dingyuan Shi et.al. 2411.14871 null
2024-11-22 Latent Schrodinger Bridge: Prompting Latent Diffusion for Fast Unpaired Image-to-Image Translation Jeongsol Kim et.al. 2411.14863 null
2024-11-22 Style-Friendly SNR Sampler for Style-Driven Generation Jooyoung Choi et.al. 2411.14793 null
2024-11-21 Stable Flow: Vital Layers for Training-Free Image Editing Omri Avrahami et.al. 2411.14430 null
2024-11-21 Baking Gaussian Splatting into Diffusion Denoiser for Fast and Scalable Single-stage Image-to-3D Generation Yuanhao Cai et.al. 2411.14384 null
2024-11-21 CoNFiLD-inlet: Synthetic Turbulence Inflow Using Generative Latent Diffusion Models with Neural Fields Xin-Yang Liu et.al. 2411.14378 null
2024-11-21 Enhancing Medical Image Segmentation with Deep Learning and Diffusion Models Houze Liu et.al. 2411.14353 null
2024-11-21 StereoCrafter-Zero: Zero-Shot Stereo Video Generation with Noisy Restart Jian Shi et.al. 2411.14295 null
2024-11-21 Guided MRI Reconstruction via Schrödinger Bridge Yue Wang et.al. 2411.14269 null
2024-11-21 TaQ-DiT: Time-aware Quantization for Diffusion Transformers Xinyan Liu et.al. 2411.14172 null
2024-11-21 RestorerID: Towards Tuning-Free Face Restoration with ID Preservation Jiacheng Ying et.al. 2411.14125 link
2024-11-21 Point Cloud Resampling with Learnable Heat Diffusion Wenqiang Xu et.al. 2411.14120 null
2024-11-21 Transforming Static Images Using Generative Models for Video Salient Object Detection Suhwan Cho et.al. 2411.13975 link
2024-11-20 REDUCIO! Generating 1024 $\times$ 1024 Video within 16 Seconds using Extremely Compressed Motion Latents Rui Tian et.al. 2411.13552 link
2024-11-20 Identity Preserving 3D Head Stylization with Multiview Score Distillation Bahri Batuhan Bilecen et.al. 2411.13536 null
2024-11-20 Heuristically Adaptive Diffusion-Model Evolutionary Strategy Benedikt Hartl et.al. 2411.13420 null
2024-11-20 XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic Segmentation Ziyi Wang et.al. 2411.13243 link
2024-11-20 A computational framework for integrating Predictive processes with evidence Accumulation Models (PAM) Antonino Visalli et.al. 2411.13203 link
2024-11-20 RAW-Diffusion: RGB-Guided Diffusion Models for High-Fidelity RAW Image Generation Christoph Reinders et.al. 2411.13150 link
2024-11-20 CopyrightMeter: Revisiting Copyright Protection in Text-to-image Models Naen Xu et.al. 2411.13144 null
2024-11-20 Virtual Staining of Label-Free Tissue in Imaging Mass Spectrometry Yijie Zhang et.al. 2411.13120 null
2024-11-19 Breaking the wire: the impact of critical length on melting pathways in silver nanowires Kannan M Ridings et.al. 2411.12891 null
2024-11-19 From Text to Pose to Image: Improving Diffusion Model Control and Quality Clément Bonnett et.al. 2411.12872 link
2024-11-19 PoM: Efficient Image and Video Generation with the Polynomial Mixer David Picard et.al. 2411.12663 link
2024-11-19 Improving Controllability and Editability for Pretrained Text-to-Music Generation Models Yixiao Zhang et.al. 2411.12641 null
2024-11-19 Data Pruning in Generative Diffusion Models Rania Briq et.al. 2411.12523 null
2024-11-19 Frequency-Aware Guidance for Blind Image Restoration via Diffusion Models Jun Xiao et.al. 2411.12450 null
2024-11-19 Combinational Backdoor Attack against Customized Text-to-Image Models Wenbo Jiang et.al. 2411.12389 null
2024-11-19 Scalable and Effective Negative Sample Generation for Hyperedge Prediction Shilin Qu et.al. 2411.12354 null
2024-11-19 Diffusion Product Quantization Jie Shao et.al. 2411.12306 null
2024-11-19 SSEditor: Controllable Mask-to-Scene Generation with Diffusion Model Haowen Zheng et.al. 2411.12290 link
2024-11-20 HouseLLM: LLM-Assisted Two-Phase Text-to-Floorplan Generation Ziyang Zong et.al. 2411.12279 null
2024-11-19 Wavespeed selection of travelling wave solutions of a two-component reaction-diffusion model of cell invasion Yuhui Chen et.al. 2411.12232 null
2024-11-18 Aligning Few-Step Diffusion Models with Dense Reward Difference Learning Ziyi Zhang et.al. 2411.11727 link
2024-11-18 Robust Reinforcement Learning under Diffusion Models for Data with Jumps Chenyang Jiang et.al. 2411.11697 null
2024-11-18 Conceptwm: A Diffusion Model Watermark for Concept Protection Liangqi Lei et.al. 2411.11688 null
2024-11-19 Cascaded Diffusion Models for 2D and 3D Microscopy Image Synthesis to Enhance Cell Segmentation Rüveyda Yilmaz et.al. 2411.11515 null
2024-11-18 MVLight: Relightable Text-to-3D Generation via Light-conditioned Multi-View Diffusion Dongseok Shim et.al. 2411.11475 null
2024-11-18 CLUE-MARK: Watermarking Diffusion Models using CLWE Kareem Shehata et.al. 2411.11434 null
2024-11-18 Teaching Video Diffusion Model with Latent Physical Phenomenon Knowledge Qinglong Cao et.al. 2411.11343 null
2024-11-18 Stochastic quantization and diffusion models Kenji Fukushima et.al. 2411.11297 null
2024-11-17 Stealing Training Graphs from Graph Neural Networks Minhua Lin et.al. 2411.11197 null
2024-11-17 DeepSPV: An Interpretable Deep Learning Pipeline for 3D Spleen Volume Estimation from 2D Ultrasound Images Zhen Yuan et.al. 2411.11190 null
2024-11-15 M-VAR: Decoupled Scale-wise Autoregressive Modeling for High-Quality Image Generation Sucheng Ren et.al. 2411.10433 link
2024-11-15 Mitigating Parameter Degeneracy using Joint Conditional Diffusion Model for WECC Composite Load Model in Power Systems Feiqin Zhu et.al. 2411.10431 null
2024-11-15 Towards High-Fidelity 3D Portrait Generation with Rich Details by Cross-View Prior-Aware Diffusion Haoran Wei et.al. 2411.10369 null
2024-11-15 Probabilistic Prior Driven Attention Mechanism Based on Diffusion Model for Imaging Through Atmospheric Turbulence Guodong Sun et.al. 2411.10321 null
2024-11-15 Modification Takes Courage: Seamless Image Stitching via Reference-Driven Inpainting Ziqi Xie et.al. 2411.10309 link
2024-11-15 The Unreasonable Effectiveness of Guidance for Diffusion Models Tim Kaiser et.al. 2411.10257 null
2024-11-15 ColorEdit: Training-free Image-Guided Color editing with diffusion model Xingxi Yin et.al. 2411.10232 null
2024-11-15 Evaluating Text-to-Image Diffusion Models for Texturing Synthetic Data Thomas Lips et.al. 2411.10164 link
2024-11-15 Towards Multi-View Consistent Style Transfer with One-Step Diffusion via Vision Conditioning Yushen Zuo et.al. 2411.10130 null
2024-11-15 SPLIT: SE(3)-diffusion via Local Geometry-based Score Prediction for 3D Scene-to-Pose-Set Matching Problems Kanghyun Kim et.al. 2411.10049 null
2024-11-14 Golden Noise for Diffusion Models: A Learning Framework Zikai Zhou et.al. 2411.09502 link
2024-11-14 DiffRoad: Realistic and Diverse Road Scenario Generation for Autonomous Vehicle Testing Junjie Zhou et.al. 2411.09451 null
2024-11-14 Image Regeneration: Evaluating Text-to-Image Model via Generating Identical Image with Multimodal Large Language Models Chutian Meng et.al. 2411.09449 null
2024-11-14 A survey of probabilistic generative frameworks for molecular simulations Richard John et.al. 2411.09388 link
2024-11-14 EEG-Based Speech Decoding: A Novel Approach Using Multi-Kernel Ensemble Diffusion Models Soowon Kim et.al. 2411.09302 null
2024-11-14 Advancing Diffusion Models: Alias-Free Resampling and Enhanced Rotational Equivariance Md Fahim Anjum et.al. 2411.09174 null
2024-11-14 VidMan: Exploiting Implicit Dynamics from Video Diffusion Model for Effective Robot Manipulation Youpeng Wen et.al. 2411.09153 null
2024-11-14 General linear threshold models with application to influence maximization Alexander Kagan et.al. 2411.09100 link
2024-11-13 Inconsistencies In Consistency Models: Better ODE Solving Does Not Imply Better Samples Noël Vouitsis et.al. 2411.08954 link
2024-11-13 4D Gaussian Splatting in the Wild with Uncertainty-Aware Regularization Mijeong Kim et.al. 2411.08879 null
2024-11-13 Offline Adaptation of Quadruped Locomotion using Diffusion Models Reece O'Mahoney et.al. 2411.08832 null
2024-11-13 Towards More Accurate Fake Detection on Images Generated from Advanced Generative and Neural Rendering Models Chengdong Dong et.al. 2411.08642 null
2024-11-13 V2X-R: Cooperative LiDAR-4D Radar Fusion for 3D Object Detection with Denoising Diffusion Xun Huang et.al. 2411.08402 link
2024-11-13 Physics Informed Distillation for Diffusion Models Joshua Tian Jin Tee et.al. 2411.08378 link
2024-11-13 Generative AI for Data Augmentation in Wireless Networks: Analysis, Applications, and Case Study Jinbo Wen et.al. 2411.08341 null
2024-11-13 Motion Control for Enhanced Complex Action Video Generation Qiang Zhou et.al. 2411.08328 null
2024-11-13 DNN Task Assignment in UAV Networks: A Generative AI Enhanced Multi-Agent Reinforcement Learning Approach Xin Tang et.al. 2411.08299 null
2024-11-12 Joint Diffusion models in Continual Learning Paweł Skierś et.al. 2411.08224 null
2024-11-12 Latent Space Disentanglement in Diffusion Transformers Enables Precise Zero-shot Semantic Editing Zitao Shuai et.al. 2411.08196 null
2024-11-12 Scaling Properties of Diffusion Models for Perceptual Tasks Rahul Ravishankar et.al. 2411.08034 null
2024-11-12 GaussianAnything: Interactive Point Cloud Latent Diffusion for 3D Generation Yushi Lan et.al. 2411.08033 null
2024-11-12 Diverse capability and scaling of diffusion and auto-regressive models when learning abstract rules Binxu Wang et.al. 2411.07873 null
2024-11-12 Novel View Synthesis with Pixel-Space Diffusion Models Noam Elata et.al. 2411.07765 null
2024-11-12 Nanosecond nanothermometry in an electron microscope Florian Castioni et.al. 2411.07764 null
2024-11-12 Leveraging Previous Steps: A Training-free Fast Solver for Flow Diffusion Kaiyu Song et.al. 2411.07627 null
2024-11-12 Unraveling the Connections between Flow Matching and Diffusion Probabilistic Models in Training-free Conditional Generation Kaiyu Song et.al. 2411.07625 null
2024-11-12 Harmonizing Pixels and Melodies: Maestro-Guided Film Score Generation and Composition Style Transfer F. Qi et.al. 2411.07539 null
2024-11-12 FM-TS: Flow Matching for Time Series Generation Yang Hu et.al. 2411.07506 link
2024-11-12 Semi-Truths: A Large-Scale Dataset of AI-Augmented Images for Evaluating Robustness of AI-Generated Image detectors Anisha Pal et.al. 2411.07472 link
2024-11-11 Score-based generative diffusion with "active" correlated noise sources Alexandra Lamtyugina et.al. 2411.07233 null
2024-11-11 Add-it: Training-Free Object Insertion in Images With Pretrained Diffusion Models Yoad Tewel et.al. 2411.07232 null
2024-11-11 DLCR: A Generative Data Expansion Framework via Diffusion for Clothes-Changing Person Re-ID Nyle Siddiqui et.al. 2411.07205 link
2024-11-11 Crossover from inhomogeneous to homogeneous response of a resonantly driven hBN quantum emitter Domitille Gérard et.al. 2411.07202 null
2024-11-11 OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision Cong Wei et.al. 2411.07199 null
2024-11-11 More Expressive Attention with Negative Weights Ang Lv et.al. 2411.07176 link
2024-11-11 Edify 3D: Scalable High-Quality 3D Asset Generation NVIDIA et.al. 2411.07135 null
2024-11-11 Edify Image: High-Quality Image Generation with Pixel Space Laplacian Diffusion Models NVIDIA et.al. 2411.07126 null
2024-11-11 White-Box Diffusion Transformer for single-cell RNA-seq generation Zhuorui Cui et.al. 2411.06785 link
2024-11-11 DiffSR: Learning Radar Reflectivity Synthesis via Diffusion Model from Satellite Observations Xuming He et.al. 2411.06714 null
2024-11-08 StdGEN: Semantic-Decomposed 3D Character Generation from Single Images Yuze He et.al. 2411.05738 null
2024-11-08 Image2Text2Image: A Novel Framework for Label-Free Evaluation of Image-to-Text Generation with Text-to-Image Diffusion Models Jia-Hong Huang et.al. 2411.05706 null
2024-11-08 Improving Molecular Graph Generation with Flow Matching and Optimal Transport Xiaoyang Hou et.al. 2411.05676 null
2024-11-08 Towards Lifelong Few-Shot Customization of Text-to-Image Diffusion Nan Song et.al. 2411.05544 null
2024-11-08 Improving image synthesis with diffusion-negative sampling Alakh Desai et.al. 2411.05473 null
2024-11-08 Bridging the Gap between Learning and Inference for Diffusion-Based Molecule Generation Peidong Liu et.al. 2411.05472 link
2024-11-08 RED: Residual Estimation Diffusion for Low-Dose PET Sinogram Reconstruction Xingyu Ai et.al. 2411.05354 null
2024-11-08 Electro-diffusive modeling and the role of spine geometry on action potential propagation in neurons Rahul Gulati et.al. 2411.05329 null
2024-11-08 Adaptive Whole-Body PET Image Denoising Using 3D Diffusion Models with ControlNet Boxiao Yu et.al. 2411.05302 null
2024-11-07 Generalizable Single-Source Cross-modality Medical Image Segmentation via Invariant Causal Mechanisms Boqi Chen et.al. 2411.05223 null
2024-11-07 SVDQunat: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models Muyang Li et.al. 2411.05007 link
2024-11-07 ProEdit: Simple Progression is All You Need for High-Quality 3D Scene Editing Jun-Kun Chen et.al. 2411.05006 null
2024-11-07 Diff-2-in-1: Bridging Generation and Dense Perception with Diffusion Models Shuhong Zheng et.al. 2411.05005 null
2024-11-07 ReCapture: Generative Video Camera Controls for User-Provided Videos using Masked Video Fine-Tuning David Junhao Zhang et.al. 2411.05003 null
2024-11-07 SG-I2V: Self-Guided Trajectory Control in Image-to-Video Generation Koichi Namekata et.al. 2411.04989 null
2024-11-07 Uncovering Hidden Subspaces in Video Diffusion Models Using Re-Identification Mischa Dombrowski et.al. 2411.04956 null
2024-11-07 DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion Wenqiang Sun et.al. 2411.04928 null
2024-11-07 Stem-OB: Generalizable Visual Imitation Learning with Stem-Like Convergent Observation through Diffusion Inversion Kaizhe Hu et.al. 2411.04919 link
2024-11-07 Controlling Human Shape and Pose in Text-to-Image Diffusion Models via Domain Adaptation Benito Buchheim et.al. 2411.04724 null
2024-11-07 DanceFusion: A Spatio-Temporal Skeleton Diffusion Transformer for Audio-Driven Dance Motion Reconstruction Li Zhao et.al. 2411.04646 null
2024-11-06 Community Forensics: Using Thousands of Generators to Train Fake Image Detectors Jeongsoo Park et.al. 2411.04125 null
2024-11-06 Synomaly Noise and Multi-Stage Diffusion: A Novel Approach for Unsupervised Anomaly Detection in Ultrasound Imaging Yuan Bi et.al. 2411.04004 null
2024-11-06 ET-SEED: Efficient Trajectory-Level SE(3) Equivariant Diffusion Policy Chenrui Tie et.al. 2411.03990 null
2024-11-06 ReEdit: Multimodal Exemplar-Based Image Editing with Diffusion Models Ashutosh Srivastava et.al. 2411.03982 null
2024-11-06 ROBIN: Robust and Invisible Watermarks for Diffusion Models with Adversarial Optimization Huayang Huang et.al. 2411.03862 link
2024-11-06 Sub-DM:Subspace Diffusion Model with Orthogonal Decomposition for MRI Reconstruction Yu Guan et.al. 2411.03758 null
2024-11-06 Zero-shot Dynamic MRI Reconstruction with Global-to-local Diffusion Model Yu Guan et.al. 2411.03723 link
2024-11-06 Investigating Conceptual Blending of a Diffusion Model for Improving Nonword-to-Image Generation Chihaya Matsuhira et.al. 2411.03595 null
2024-11-05 Estimating Ego-Body Pose from Doubly Sparse Egocentric Video Data Seunggeun Chi et.al. 2411.03561 null
2024-11-05 SynthSet: Generative Diffusion Model for Semantic Segmentation in Precision Agriculture Andrew Heschl et.al. 2411.03505 link
2024-11-05 DiffLM: Controllable Synthetic Data Generation via Diffusion Language Models Ying Zhou et.al. 2411.03250 null
2024-11-05 On Improved Conditioning Mechanisms and Pre-training Strategies for Diffusion Models Tariq Berrada Ifriqi et.al. 2411.03177 null
2024-11-05 Unleashing the power of novel conditional generative approaches for new materials discovery Lev Novitskiy et.al. 2411.03156 link
2024-11-05 Gradient-Guided Conditional Diffusion Models for Private Image Reconstruction: Analyzing Adversarial Impacts of Differential Privacy and Denoising Tao Huang et.al. 2411.03053 null
2024-11-05 GarVerseLOD: High-Fidelity 3D Garment Reconstruction from a Single In-the-Wild Image using a Dataset with Levels of Details Zhongjin Luo et.al. 2411.03047 null
2024-11-05 IMUDiffusion: A Diffusion Model for Multivariate Time Series Synthetisation for Inertial Motion Capturing Systems Heiko Oppel et.al. 2411.02954 null
2024-11-05 LDPM: Towards undersampled MRI reconstruction with MR-VAE and Latent Diffusion Prior Xingjian Tang et.al. 2411.02951 null
2024-11-05 How much is a noisy image worth? Data Scaling Laws for Ambient Diffusion Giannis Daras et.al. 2411.02780 link
2024-11-04 Modelling Alzheimer's Protein Dynamics: A Data-Driven Integration of Stochastic Methods, Machine Learning and Connectome Insights Alec MacIver et.al. 2411.02644 null
2024-11-04 Training-free Regional Prompting for Diffusion Transformers Anthony Chen et.al. 2411.02395 link
2024-11-04 Diffusion-based Generative Multicasting with Intent-aware Semantic Decomposition Xinkai Liu et.al. 2411.02334 null
2024-11-04 LayerDAG: A Layerwise Autoregressive Diffusion Model for Directed Acyclic Graph Generation Mufei Li et.al. 2411.02322 link
2024-11-04 Hunyuan3D-1.0: A Unified Framework for Text-to-3D and Image-to-3D Generation Xianghui Yang et.al. 2411.02293 null
2024-11-04 FewViewGS: Gaussian Splatting with Few View Matching and Multi-stage Training Ruihong Yin et.al. 2411.02229 null
2024-11-04 CleAR: Robust Context-Guided Generative Lighting Estimation for Mobile Augmented Reality Yiqin Zhao et.al. 2411.02179 null
2024-11-04 Model Integrity when Unlearning with T2I Diffusion Models Andrea Schioppa et.al. 2411.02068 null
2024-11-04 DiffuMask-Editor: A Novel Paradigm of Integration Between the Segmentation Diffusion Model and Image Editing to Improve Segmentation Ability Bo Gao et.al. 2411.01819 null
2024-11-04 MoMu-Diffusion: On Learning Long-Term Motion-Music Synchronization and Correspondence Fuming You et.al. 2411.01805 null
2024-11-04 A Regressor-Guided Graph Diffusion Model for Predicting Enzyme Mutations to Enhance Turnover Number Xiaozhu Yu et.al. 2411.01745 link
2024-10-31 DiffPano: Scalable and Consistent Text to Panorama Generation with Spherical Epipolar-Aware Diffusion Weicai Ye et.al. 2410.24203 link
2024-10-31 Redefining in Dictionary: Towards a Enhanced Semantic Understanding of Creative Generation Fu Feng et.al. 2410.24160 null
2024-10-31 Scaling Concept With Text-Guided Diffusion Models Chao Huang et.al. 2410.24151 null
2024-10-31 Understanding Generalizability of Diffusion Models Requires Rethinking the Hidden Gaussian Structure Xiang Li et.al. 2410.24060 link
2024-10-31 TPC: Test-time Procrustes Calibration for Diffusion-based Human Image Animation Sunjae Yoon et.al. 2410.24037 null
2024-10-31 DiffPAD: Denoising Diffusion-based Adversarial Patch Decontamination Jia Fu et.al. 2410.24006 link
2024-11-01 Breaking Determinism: Fuzzy Modeling of Sequential Recommendation Using Discrete State Space Diffusion Model Wenjia Xie et.al. 2410.23994 null
2024-10-31 Stochastic Reconstruction of Gappy Lagrangian Turbulent Signals by Conditional Diffusion Models Tianyi Li et.al. 2410.23971 null
2024-10-31 Image Synthesis with Class-Aware Semantic Diffusion Models for Surgical Scene Segmentation Yihang Zhou et.al. 2410.23962 null
2024-10-31 Text-DiFuse: An Interactive Multi-Modal Image Fusion Framework based on Text-modulated Diffusion Model Hao Zhang et.al. 2410.23905 link
2024-10-30 ReferEverything: Towards Segmenting Everything We Can Speak of in Videos Anurag Bagchi et.al. 2410.23287 null
2024-10-30 Provable acceleration for diffusion models under minimal assumptions Gen Li et.al. 2410.23285 null
2024-10-30 RelationBooth: Towards Relation-Aware Customized Object Generation Qingyu Shi et.al. 2410.23280 null
2024-10-30 SlowFast-VGen: Slow-Fast Learning for Action-Driven Long Video Generation Yining Hong et.al. 2410.23277 null
2024-10-30 Multi-student Diffusion Distillation for Better One-step Generators Yanke Song et.al. 2410.23274 null
2024-10-30 CausalDiff: Causality-Inspired Disentanglement via Diffusion Model for Adversarial Defense Mingkun Zhang et.al. 2410.23091 link
2024-10-30 Controlling Language and Diffusion Models by Transporting Activations Pau Rodriguez et.al. 2410.23054 link
2024-10-30 Improving Musical Accompaniment Co-creation via Diffusion Transformers Javier Nistal et.al. 2410.23005 null
2024-10-30 DexGraspNet 2.0: Learning Generative Dexterous Grasping in Large-scale Synthetic Cluttered Scenes Jialiang Zhang et.al. 2410.23004 null
2024-10-30 LumiSculpt: A Consistency Lighting Control Network for Video Generation Yuxin Zhang et.al. 2410.22979 null
2024-10-29 Capacity Control is an Effective Memorization Mitigation Mechanism in Text-Conditional Diffusion Models Raman Dutt et.al. 2410.22149 link
2024-10-29 Variational inference for pile-up removal at hadron colliders with diffusion models Malte Algren et.al. 2410.22074 null
2024-10-29 Dual Conditional Diffusion Models for Sequential Recommendation Hongtao Huang et.al. 2410.21967 null
2024-10-29 PrefPaint: Aligning Image Inpainting Diffusion Model with Human Preference Kendong Liu et.al. 2410.21966 null
2024-10-29 CT to PET Translation: A Large-scale Dataset and Domain-Knowledge-Guided Diffusion Approach Dac Thai Nguyen et.al. 2410.21932 link
2024-10-29 Guided Diffusion-based Counterfactual Augmentation for Robust Session-based Recommendation Muskan Gupta et.al. 2410.21892 null
2024-10-29 Diffusion as Reasoning: Enhancing Object Goal Navigation with LLM-Biased Diffusion Model Yiming Ji et.al. 2410.21842 null
2024-10-29 Volumetric Conditioning Module to Control Pretrained Diffusion Models for 3D Medical Images Suhyun Ahn et.al. 2410.21826 link
2024-10-29 HairDiffusion: Vivid Multi-Colored Hair Editing via Latent Diffusion Yu Zeng et.al. 2410.21789 null
2024-10-29 DiffusionVel: Multi-Information Integrated Velocity Inversion Using Generative Diffusion Models Hao Zhang et.al. 2410.21776 null
2024-10-28 On Inductive Biases That Enable Generalization of Diffusion Transformers Jie An et.al. 2410.21273 link
2024-10-28 One-Step Diffusion Policy: Fast Visuomotor Policies via Diffusion Distillation Zhendong Wang et.al. 2410.21257 null
2024-10-28 On learning higher-order cumulants in diffusion models Gert Aarts et.al. 2410.21212 null
2024-10-28 Extrapolating Prospective Glaucoma Fundus Images through Diffusion Model in Irregular Longitudinal Sequences Zhihao Zhao et.al. 2410.21130 null
2024-10-28 Shallow Diffuse: Robust and Invisible Watermarking through Low-Dimensional Subspaces in Diffusion Models Wenda Li et.al. 2410.21088 link
2024-10-28 Federated Time Series Generation on Feature and Temporally Misaligned Data Chenrui Fan et.al. 2410.21072 null
2024-10-28 Kandinsky 3: Text-to-Image Synthesis for Multifunctional Generative Framework Vladimir Arkhipkin et.al. 2410.21061 link
2024-10-28 Beyond Autoregression: Fast LLMs via Self-Distillation Through Time Justin Deschenaux et.al. 2410.21035 link
2024-10-29 EEG-Driven 3D Object Reconstruction with Color Consistency and Diffusion Prior Xin Xiang et.al. 2410.20981 null
2024-10-28 Attention Overlap Is Responsible for The Entity Missing Problem in Text-to-image Diffusion Models! Arash Marioriyad et.al. 2410.20972 null
2024-10-25 Adversarial Environment Design via Regret-Guided Diffusion Models Hojun Chung et.al. 2410.19715 null
2024-10-25 DiffGS: Functional Gaussian Splatting Diffusion Junsheng Zhou et.al. 2410.19657 null
2024-10-25 Diffusion models for lattice gauge field simulations Qianteng Zhu et.al. 2410.19602 null
2024-10-25 Utilizing Image Transforms and Diffusion Models for Generative Modeling of Short and Long Time Series Ilan Naiman et.al. 2410.19538 null
2024-10-25 Ensemble Data Assimilation for Particle-based Methods Marius Duvillard et.al. 2410.19525 null
2024-10-28 NeuroClips: Towards High-fidelity and Smooth fMRI-to-Video Reconstruction Zixuan Gong et.al. 2410.19452 link
2024-10-25 Learned Reference-based Diffusion Sampling for multi-modal distributions Maxence Noble et.al. 2410.19449 null
2024-10-25 Generative Diffusion Models for Sequential Recommendations Sharare Zolghadr et.al. 2410.19429 null
2024-10-25 FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality Zhengyao Lv et.al. 2410.19355 null
2024-10-25 High Resolution Seismic Waveform Generation using Denoising Diffusion Andreas Bergmeister et.al. 2410.19343 null
2024-10-24 MotionCLR: Motion Generation and Training-free Editing via Understanding Attention Mechanisms Ling-Hao Chen et.al. 2410.18977 null
2024-10-24 3D-Adapter: Geometry-Consistent Multi-View Diffusion for High-Quality 3D Generation Hansheng Chen et.al. 2410.18974 link
2024-10-24 On the Crucial Role of Initialization for Matrix Factorization Bingcong Li et.al. 2410.18965 null
2024-10-24 Stable Consistency Tuning: Understanding and Improving Consistency Models Fu-Yun Wang et.al. 2410.18958 link
2024-10-24 Generation of synthetic financial time series by diffusion models Tomonori Takahashi et.al. 2410.18897 null
2024-10-24 The Cat and Mouse Game: The Ongoing Arms Race Between Diffusion Models and Detection Methods Linda Laurier et.al. 2410.18866 null
2024-10-24 Multi-Scale Diffusion: Enhancing Spatial Layout in High-Resolution Panoramic Image Generation Xiaoyu Zhang et.al. 2410.18830 null
2024-10-24 Fast constrained sampling in pre-trained diffusion models Alexandros Graikos et.al. 2410.18804 null
2024-10-24 Robust Watermarking Using Generative Priors Against Image Editing: From Benchmarking to Advances Shilin Lu et.al. 2410.18775 link
2024-10-25 Schedule Your Edit: A Simple yet Effective Diffusion Noise Schedule for Image Editing Haonan Lin et.al. 2410.18756 null
2024-10-23 DynamicCity: Large-Scale LiDAR Generation from Dynamic Scenes Hengwei Bian et.al. 2410.18084 null
2024-10-23 Prioritized Generative Replay Renhao Wang et.al. 2410.18082 null
2024-10-23 Optical Generative Models Shiqi Chen et.al. 2410.17970 null
2024-10-23 A Wavelet Diffusion GAN for Image Super-Resolution Lorenzo Aloisi et.al. 2410.17966 null
2024-10-23 Addressing Asynchronicity in Clinical Multimodal Fusion via Individualized Chest X-ray Generation Wenfang Yao et.al. 2410.17918 link
2024-10-23 Scaling Diffusion Language Models via Adaptation from Autoregressive Models Shansan Gong et.al. 2410.17891 link
2024-10-23 Non-intrusive Speech Quality Assessment with Diffusion Models Trained on Clean Speech Danilo de Oliveira et.al. 2410.17834 null
2024-10-23 PGDiffSeg: Prior-Guided Denoising Diffusion Model with Parameter-Shared Attention for Breast Cancer Segmentation Feiyan Feng et.al. 2410.17812 null
2024-10-23 AdaDiffSR: Adaptive Region-aware Dynamic Acceleration Diffusion Model for Real-World Image Super-Resolution Yuanting Fan et.al. 2410.17752 null
2024-10-23 VISAGE: Video Synthesis using Action Graphs for Surgery Yousef Yeganeh et.al. 2410.17751 null
2024-10-22 Reinforcement learning on structure-conditioned categorical diffusion for protein inverse folding Yasha Ektefaie et.al. 2410.17173 link
2024-10-22 DiP-GO: A Diffusion Pruner via Few-step Gradient Optimization Haowei Zhu et.al. 2410.16942 null
2024-10-22 Hierarchical Clustering for Conditional Diffusion in Image Generation Jorge da Silva Goncalves et.al. 2410.16910 link
2024-10-22 VistaDream: Sampling multiview consistent images for single-view scene reconstruction Haiping Wang et.al. 2410.16892 null
2024-10-22 MPDS: A Movie Posters Dataset for Image Generation with Diffusion Model Meng Xu et.al. 2410.16840 null
2024-10-22 Evaluating the Effectiveness of Attack-Agnostic Features for Morphing Attack Detection Laurent Colbois et.al. 2410.16802 link
2024-10-22 One-Step Diffusion Distillation through Score Implicit Matching Weijian Luo et.al. 2410.16794 link
2024-10-22 LLM-Assisted Red Teaming of Diffusion Models through "Failures Are Fated, But Can Be Faded" Som Sagar et.al. 2410.16738 null
2024-10-22 Polyp-E: Benchmarking the Robustness of Deep Segmentation Models via Polyp Editing Runpu Wei et.al. 2410.16732 null
2024-10-22 DiffusionSeeder: Seeding Motion Optimization with Diffusion for Rapid Motion Planning Huang Huang et.al. 2410.16727 null
2024-10-21 MvDrag3D: Drag-based Creative 3D Editing via Multi-view Generation-Reconstruction Priors Honghua Chen et.al. 2410.16272 null
2024-10-21 A Framework for Evaluating Predictive Models Using Synthetic Image Covariates and Longitudinal Data Simon Deltadahl et.al. 2410.16177 null
2024-10-22 Warped Diffusion: Solving Video Inverse Problems with Image Diffusion Models Giannis Daras et.al. 2410.16152 null
2024-10-21 SeaDAG: Semi-autoregressive Diffusion for Conditional Directed Acyclic Graph Generation Xinyi Zhou et.al. 2410.16119 null
2024-10-21 Continuous Speech Synthesis using per-token Latent Diffusion Arnon Turetzky et.al. 2410.16048 null
2024-10-22 CamI2V: Camera-Controlled Image-to-Video Diffusion Model Guangcong Zheng et.al. 2410.15957 link
2024-10-21 Solving Continual Offline RL through Selective Weights Activation on Aligned Spaces Jifeng Hu et.al. 2410.15698 null
2024-10-21 Erasing Undesirable Concepts in Diffusion Models with Adversarial Preservation Anh Bui et.al. 2410.15618 link
2024-10-20 Data Augmentation via Diffusion Model to Enhance AI Fairness Christina Hastings Blow et.al. 2410.15470 null
2024-10-20 MedDiff-FM: A Diffusion-based Foundation Model for Versatile Medical Image Applications Yongrui Yu et.al. 2410.15432 null
2024-10-18 Multi-modal Pose Diffuser: A Multimodal Generative Conditional Pose Prior Calvin-Khang Ta et.al. 2410.14540 null
2024-10-18 LEAD: Latent Realignment for Human Motion Diffusion Nefeli Andreou et.al. 2410.14508 null
2024-10-18 Reinforcement Learning in Non-Markov Market-Making Luca Lalor et.al. 2410.14504 null
2024-10-18 ANT: Adaptive Noise Schedule for Time Series Diffusion Models Seunghan Lee et.al. 2410.14488 link
2024-10-18 DRL Optimization Trajectory Generation via Wireless Network Intent-Guided Diffusion Models for Optimizing Resource Allocation Junjie Wu et.al. 2410.14481 null
2024-10-18 FashionR2R: Texture-preserving Rendered-to-Real Image Translation with Diffusion Models Rui Hu et.al. 2410.14429 null
2024-10-18 Dynamic Negative Guidance of Diffusion Models Felix Koulischer et.al. 2410.14398 link
2024-10-18 HiCo: Hierarchical Controllable Diffusion Model for Layout-to-image Generation Bo Cheng et.al. 2410.14324 link
2024-10-18 ClearSR: Latent Low-Resolution Image Embeddings Help Diffusion-Based Real-World Super Resolution Models See Clearer Yuhao Wan et.al. 2410.14279 null
2024-10-18 HYPNOS : Highly Precise Foreground-focused Diffusion Finetuning for Inanimate Objects Oliverio Theophilus Nathanael et.al. 2410.14265 null
2024-10-17 Diffusing States and Matching Scores: A New Framework for Imitation Learning Runzhe Wu et.al. 2410.13855 link
2024-10-17 Influence Functions for Scalable Data Attribution in Diffusion Models Bruno Mlodozeniec et.al. 2410.13850 null
2024-10-17 Deep Generative Models Unveil Patterns in Medical Images Through Vision-Language Conditioning Xiaodan Xing et.al. 2410.13823 link
2024-10-17 ConsisSR: Delving Deep into Consistency in Diffusion-based Image Super-Resolution Junhao Gu et.al. 2410.13807 null
2024-10-17 Probing the Latent Hierarchical Structure of Data via Diffusion Models Antonio Sclocchi et.al. 2410.13770 null
2024-10-17 Theory on Score-Mismatched Diffusion Models and Zero-Shot Conditional Samplers Yuchen Liang et.al. 2410.13746 null
2024-10-17 Improved Convergence Rate for Diffusion Probabilistic Models Gen Li et.al. 2410.13738 null
2024-10-18 DAWN: Dynamic Frame Avatar with Non-autoregressive Diffusion Framework for Talking Head Video Generation Hanbo Cheng et.al. 2410.13726 link
2024-10-18 Diffusion Curriculum: Synthetic-to-Real Generative Curriculum Learning via Image-Guided Diffusion Yijun Liang et.al. 2410.13674 link
2024-10-17 Fine-Tuning Discrete Diffusion Models via Reward Optimization with Applications to DNA and Protein Design Chenyu Wang et.al. 2410.13643 link
2024-10-16 Meta-Unlearning on Diffusion Models: Preventing Relearning Unlearned Concepts Hongcheng Gao et.al. 2410.12777 link
2024-10-16 SAFREE: Training-Free and Adaptive Guard for Safe Text-to-Image And Video Generation Jaehong Yoon et.al. 2410.12761 null
2024-10-16 Embedding an Ethical Mind: Aligning Text-to-Image Synthesis via Lightweight Value Optimization Xingqi Wang et.al. 2410.12700 link
2024-10-16 AdaptiveDrag: Semantic-Driven Dragging on Diffusion-Based Image Editing DuoSheng Chen et.al. 2410.12696 null
2024-10-16 One Step Diffusion via Shortcut Models Kevin Frans et.al. 2410.12557 link
2024-10-16 Disentangling data distribution for Federated Learning Xinyuan Zhao et.al. 2410.12530 null
2024-10-16 Shaping a Stabilized Video by Mitigating Unintended Changes for Concept-Augmented Video Editing Mingce Guo et.al. 2410.12526 null
2024-10-16 Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective Yongxin Zhu et.al. 2410.12490 link
2024-10-16 DaDiff: Domain-aware Diffusion Model for Nighttime UAV Tracking Haobo Zuo et.al. 2410.12270 link
2024-10-16 FlashAudio: Rectified Flows for Fast and High-Fidelity Text-to-Audio Generation Huadai Liu et.al. 2410.12266 null
2024-10-15 High-Resolution Frame Interpolation with Patch-based Cascaded Diffusion Junhwa Hur et.al. 2410.11838 null
2024-10-15 On the Effectiveness of Dataset Alignment for Fake Image Detection Anirudh Sundara Rajan et.al. 2410.11835 null
2024-10-15 Bayesian Experimental Design via Contrastive Diffusions Jacopo Iollo et.al. 2410.11826 link
2024-10-15 Improving Long-Text Alignment for Text-to-Image Diffusion Models Luping Liu et.al. 2410.11817 link
2024-10-15 SGEdit: Bridging LLM with Text2Image Generative Model for Scene Graph-based Image Editing Zhiyuan Zhang et.al. 2410.11815 null
2024-10-16 Efficient Diffusion Models: A Comprehensive Survey from Principles to Practices Zhiyuan Ma et.al. 2410.11795 null
2024-10-15 Patch-Based Diffusion Models Beat Whole-Image Models for Mismatched Distribution Inverse Problems Jason Hu et.al. 2410.11730 null
2024-10-15 DeformPAM: Data-Efficient Learning for Long-horizon Deformable Object Manipulation via Preference-based Action Alignment Wendi Chen et.al. 2410.11584 link
2024-10-15 Riemann-Liouville fractional Brownian motion with random Hurst exponent Hubert Woszczek et.al. 2410.11546 null
2024-10-15 InvSeg: Test-Time Prompt Inversion for Semantic Segmentation Jiayi Lin et.al. 2410.11473 null
2024-10-14 Tex4D: Zero-shot 4D Scene Texturing with Video Diffusion Models Jingzhi Bao et.al. 2410.10821 link
2024-10-14 Depth Any Video with Scalable Synthetic Data Honghui Yang et.al. 2410.10815 link
2024-10-14 HART: Efficient Visual Generation with Hybrid Autoregressive Transformer Haotian Tang et.al. 2410.10812 link
2024-10-14 TrajDiffuse: A Conditional Diffusion Model for Environment-Aware Trajectory Prediction Qingze et.al. 2410.10804 link
2024-10-14 Boosting Camera Motion Control for Video Diffusion Transformers Soon Yau Cheong et.al. 2410.10802 null
2024-10-14 Semantic Image Inversion and Editing using Rectified Stochastic Differential Equations Litu Rout et.al. 2410.10792 null
2024-10-14 ControlMM: Controllable Masked Motion Generation Ekkasit Pinyoanuntapong et.al. 2410.10780 null
2024-10-14 Adaptive Diffusion Terrain Generator for Autonomous Uneven Terrain Navigation Youwei Yu et.al. 2410.10766 null
2024-10-14 DragEntity: Trajectory Guided Video Generation using Entity and Positional Relationships Zhang Wan et.al. 2410.10751 null
2024-10-14 FlexGen: Flexible Multi-View Generation from Text and Image Inputs Xinli Xu et.al. 2410.10745 null
2024-10-11 SceneCraft: Layout-Guided 3D Scene Generation Xiuyu Yang et.al. 2410.09049 link
2024-10-11 Linear Convergence of Diffusion Models Under the Manifold Hypothesis Peter Potaptchik et.al. 2410.09046 null
2024-10-11 Semantic Score Distillation Sampling for Compositional Text-to-3D Generation Ling Yang et.al. 2410.09009 link
2024-10-11 WaveDiffusion: Exploring Full Waveform Inversion via Joint Diffusion in the Latent Space Hanchen Wang et.al. 2410.09002 null
2024-10-11 DiffPO: A causal diffusion model for learning distributions of potential outcomes Yuchen Ma et.al. 2410.08924 null
2024-10-11 Distillation of Discrete Diffusion through Dimensional Correlations Satoshi Hayakawa et.al. 2410.08709 null
2024-10-11 Gait Sequence Upsampling using Diffusion Models for single LiDAR sensors Jeongho Ahn et.al. 2410.08680 null
2024-10-11 E-Motion: Future Motion Simulation via Event Sequence Diffusion Song Wu et.al. 2410.08649 link
2024-10-11 Synth-SONAR: Sonar Image Synthesis with Enhanced Diversity and Realism via Dual Diffusion Models and GPT Prompting Purushothaman Natarajan et.al. 2410.08612 link
2024-10-11 Context-Aware Full Body Anonymization using Text-to-Image Diffusion Models Pascl Zwick et.al. 2410.08551 link
2024-10-10 DICE: Discrete Inversion Enabling Controllable Editing for Multinomial Diffusion and Masked Generative Models Xiaoxiao He et.al. 2410.08207 null
2024-10-10 HybridBooth: Hybrid Prompt Inversion for Efficient Subject-Driven Generation Shanyan Guan et.al. 2410.08192 null
2024-10-10 DifFRelight: Diffusion-Based Facial Performance Relighting Mingming He et.al. 2410.08188 null
2024-10-10 ZeroComp: Zero-shot Object Compositing from Image Intrinsics via Diffusion Zitian Zhang et.al. 2410.08168 link
2024-10-10 DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation Jiatao Gu et.al. 2410.08159 null
2024-10-10 Progressive Autoregressive Video Diffusion Models Desai Xie et.al. 2410.08151 link
2024-10-10 Steering Masked Discrete Diffusion Models via Discrete Denoising Posterior Prediction Jarrid Rector-Brooks et.al. 2410.08134 null
2024-10-10 Unstable Unlearning: The Hidden Risk of Concept Resurgence in Diffusion Models Vinith M. Suriyakumar et.al. 2410.08074 null
2024-10-10 LADIMO: Face Morph Generation through Biometric Template Inversion with Latent Diffusion Marcel Grimmer et.al. 2410.07988 link
2024-10-10 AI Surrogate Model for Distributed Computing Workloads David K. Park et.al. 2410.07940 null
2024-10-09 IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation Xinchen Zhang et.al. 2410.07171 link
2024-10-09 AvatarGO: Zero-shot 4D Human-Object Interaction Generation and Animation Yukang Cao et.al. 2410.07164 null
2024-10-09 InstructG2I: Synthesizing Images from Multimodal Attributed Graphs Bowen Jin et.al. 2410.07157 link
2024-10-09 Trans4D: Realistic Geometry-Aware Transition for Compositional Text-to-4D Synthesis Bohan Zeng et.al. 2410.07155 link
2024-10-09 Diffusion Density Estimators Akhil Premkumar et.al. 2410.06986 null
2024-10-09 Jointly Generating Multi-view Consistent PBR Textures using Collaborative Control Shimon Vainer et.al. 2410.06985 null
2024-10-09 Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think Sihyun Yu et.al. 2410.06940 link
2024-10-09 Boosting Few-Shot Detection with Large Language Models and Layout-to-Image Synthesis Ahmed Abdullah et.al. 2410.06841 null
2024-10-09 Diffuse or Confuse: A Diffusion Deepfake Speech Dataset Anton Firc et.al. 2410.06796 link
2024-10-09 Diff-FMT: Diffusion Models for Fluorescence Molecular Tomography Qianqian Xue et.al. 2410.06757 null
2024-10-07 DART: A Diffusion-Based Autoregressive Motion Model for Real-Time Text-Driven Motion Control Kaifeng Zhao et.al. 2410.05260 null
2024-10-07 GS-VTON: Controllable 3D Virtual Try-on with Gaussian Splatting Yukang Cao et.al. 2410.05259 null
2024-10-07 SePPO: Semi-Policy Preference Optimization for Diffusion Alignment Daoan Zhang et.al. 2410.05255 link
2024-10-07 DiffuseReg: Denoising Diffusion Model for Obtaining Deformation Fields in Unsupervised Deformable Image Registration Yongtai Zhuo et.al. 2410.05234 link
2024-10-07 Presto! Distilling Steps and Layers for Accelerating Music Generation Zachary Novack et.al. 2410.05167 null
2024-10-08 A Simulation-Free Deep Learning Approach to Stochastic Optimal Control Mengjian Hua et.al. 2410.05163 null
2024-10-07 Leveraging Multimodal Diffusion Models to Accelerate Imaging with Side Information Timofey Efimov et.al. 2410.05143 null
2024-10-07 Human-Feedback Efficient Reinforcement Learning for Online Diffusion Model Finetuning Ayano Hiranaka et.al. 2410.05116 null
2024-10-07 DreamSat: Towards a General 3D Model for Novel View Synthesis of Space Objects Nidhi Mathihalli et.al. 2410.05097 link
2024-10-07 A nodally bound-preserving discontinuous Galerkin method for the drift-diffusion equation Gabriel R. Barrenechea et.al. 2410.05040 null
2024-10-04 Estimating Body and Hand Motion in an Ego-sensed World Brent Yi et.al. 2410.03665 null
2024-10-04 Real-World Benchmarks Make Membership Inference Attacks Fail on Diffusion Models Chumeng Liang et.al. 2410.03640 link
2024-10-04 How Discrete and Continuous Diffusion Meet: Comprehensive Analysis of Discrete Diffusion Models via a Stochastic Integral Framework Yinuo Ren et.al. 2410.03601 null
2024-10-04 Not All Diffusion Model Activations Have Been Evaluated as Discriminative Features Benyuan Meng et.al. 2410.03558 link
2024-10-04 Diffusion State-Guided Projected Gradient for Inverse Problems Rayhan Zirvi et.al. 2410.03463 null
2024-10-04 Generative Semantic Communication for Text-to-Speech Synthesis Jiahao Zheng et.al. 2410.03459 null
2024-10-04 Dynamic Diffusion Transformer Wangbo Zhao et.al. 2410.03456 link
2024-10-04 CLoSD: Closing the Loop between Simulation and Diffusion for multi-task character control Guy Tevet et.al. 2410.03441 link
2024-10-04 The scaling behaviour of localised and extended states in one-dimensional tight-binding models with disorder Luca Schaefer et.al. 2410.03405 null
2024-10-04 Latent Abstractions in Generative Diffusion Models Giulio Franzese et.al. 2410.03368 null
2024-10-03 Revisit Large-Scale Image-Caption Data in Pre-training Multimodal Foundation Models Zhengfeng Lai et.al. 2410.02740 null
2024-10-03 SteerDiff: Steering towards Safe Text-to-Image Diffusion Models Hongxiang Zhang et.al. 2410.02710 null
2024-10-03 ControlAR: Controllable Image Generation with Autoregressive Models Zongming Li et.al. 2410.02705 link
2024-10-03 GUD: Generation with Unified Diffusion Mathis Gerdes et.al. 2410.02667 null
2024-10-03 Efficient calibration of the shifted square-root diffusion model to credit default swap spreads using asymptotic approximations Ankush Agarwal et.al. 2410.02645 null
2024-10-04 Diffusion Models are Evolutionary Algorithms Yanbo Zhang et.al. 2410.02543 link
2024-10-03 Lightweight Diffusion Models for Resource-Constrained Semantic Communication Giovanni Pignata et.al. 2410.02491 link
2024-10-03 Towards a Theoretical Understanding of Memorization in Diffusion Models Yunhao Chen et.al. 2410.02467 null
2024-10-03 Eliminating Oversaturation and Artifacts of High Guidance Scales in Diffusion Models Seyedmorteza Sadat et.al. 2410.02416 null
2024-10-03 Diffusion Meets Options: Hierarchical Generative Skill Composition for Temporally-Extended Tasks Zeyu Feng et.al. 2410.02389 null
2024-10-02 FabricDiffusion: High-Fidelity Texture Transfer for 3D Garments Generation from In-The-Wild Clothing Images Cheng Zhang et.al. 2410.01801 null
2024-10-02 Dynamical-generative downscaling of climate model ensembles Ignacio Lopez-Gomez et.al. 2410.01776 null
2024-10-02 ImageFolder: Autoregressive Image Generation with Folded Tokens Xiang Li et.al. 2410.01756 link
2024-10-02 VitaGlyph: Vitalizing Artistic Typography with Flexible Dual-branch Diffusion Models Kailai Feng et.al. 2410.01738 link
2024-10-02 HarmoniCa: Harmonizing Training and Inference for Better Feature Cache in Diffusion Transformer Acceleration Yushi Huang et.al. 2410.01723 null
2024-10-02 KnobGen: Controlling the Sophistication of Artwork in Sketch-Based Diffusion Models Pouyan Navard et.al. 2410.01595 link
2024-10-02 MM-LDM: Multi-Modal Latent Diffusion Model for Sounding Video Generation Mingzhen Sun et.al. 2410.01594 link
2024-10-02 HRTF Estimation using a Score-based Prior Etienne Thuillier et.al. 2410.01562 null
2024-10-02 Edge-preserving noise for diffusion models Jente Vandersanden et.al. 2410.01540 null
2024-10-02 Information-Theoretical Principled Trade-off between Jailbreakability and Stealthiness on Vision Language Models Ching-Chia Kao et.al. 2410.01438 null
2024-09-30 COLLAGE: Collaborative Human-Agent Interaction Generation using Hierarchical Latent Diffusion and Language Models Divyanshu Daiya et.al. 2409.20502 null
2024-09-30 FreeMask: Rethinking the Importance of Attention Masks for Zero-Shot Video Editing Lingling Cai et.al. 2409.20500 null
2024-09-30 Ensemble Kalman Diffusion Guidance: A Derivative-free Method for Inverse Problems Hongkai Zheng et.al. 2409.20175 null
2024-09-30 Erase, then Redraw: A Novel Data Augmentation Approach for Free Space Detection Using Diffusion Model Fulong Ma et.al. 2409.20164 null
2024-09-30 Conditional Diffusion Models are Minimax-Optimal and Manifold-Adaptive for Conditional Distribution Estimation Rong Tang et.al. 2409.20124 null
2024-09-30 Reaction-diffusion model for a population structured in phenotype and space I -- Criterion for persistence Nathanaël Boutillon et.al. 2409.20118 null
2024-09-30 RoCoTex: A Robust Method for Consistent Texture Synthesis with Diffusion Models Jangyeong Kim et.al. 2409.19989 null
2024-09-30 Magnet: We Never Know How Text-to-Image Diffusion Models Work, Until We Learn How Vision-Language Models Function Chenyi Zhuang et.al. 2409.19967 link
2024-10-02 Image Copy Detection for Diffusion Models Wenhao Wang et.al. 2409.19952 null
2024-09-30 Task-agnostic Pre-training and Task-guided Fine-tuning for Versatile Diffusion Planner Chenyou Fan et.al. 2409.19949 null
2024-09-27 $O(d/T)$ Convergence Theory for Diffusion Probabilistic Models under Minimal Assumptions Gen Li et.al. 2409.18959 null
2024-09-27 ReviveDiff: A Universal Diffusion Model for Restoring Images in Adverse Weather Conditions Wenfeng Huang et.al. 2409.18932 null
2024-09-27 Unsupervised Low-light Image Enhancement with Lookup Tables and Diffusion Priors Yunlong Lin et.al. 2409.18899 null
2024-09-27 Detecting Dataset Abuse in Fine-Tuning Stable Diffusion Models for Text-to-Image Synthesis Songrui Wang et.al. 2409.18897 null
2024-09-27 Explainable Artifacts for Synthetic Western Blot Source Attribution João Phillipe Cardenuto et.al. 2409.18881 link
2024-09-27 Emu3: Next-Token Prediction is All You Need Xinlong Wang et.al. 2409.18869 null
2024-09-27 Convergence of Diffusion Models Under the Manifold Hypothesis in High-Dimensions Iskander Azangulov et.al. 2409.18804 null
2024-09-27 Unsupervised Fingerphoto Presentation Attack Detection With Diffusion Models Hailin Li et.al. 2409.18636 null
2024-09-27 Treating Brain-inspired Memories as Priors for Diffusion Model to Forecast Multivariate Time Series Muyao Wang et.al. 2409.18491 null
2024-09-27 Gradient-free Decoder Inversion in Latent Diffusion Models Seongmin Hong et.al. 2409.18442 null
2024-09-26 FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner Wenliang Zhao et.al. 2409.18128 link
2024-09-26 Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction Jing He et.al. 2409.18124 null
2024-09-26 EdgeRunner: Auto-regressive Auto-encoder for Artistic Mesh Generation Jiaxiang Tang et.al. 2409.18114 null
2024-09-26 StackGen: Generating Stable Structures from Silhouettes via Diffusion Luzhe Sun et.al. 2409.18098 null
2024-09-26 DiffSSC: Semantic LiDAR Scan Completion using Denoising Diffusion Probabilistic Models Helin Cao et.al. 2409.18092 null
2024-09-26 Stable Video Portraits Mirela Ostrek et.al. 2409.18083 null
2024-09-26 PhoCoLens: Photorealistic and Consistent Reconstruction in Lensless Imaging Xin Cai et.al. 2409.17996 null
2024-09-26 Joint Localization and Planning using Diffusion L. Lao Beyer et.al. 2409.17995 null
2024-09-26 CNCA: Toward Customizable and Natural Generation of Adversarial Camouflage for Vehicle Detectors Linye Lyu et.al. 2409.17963 link
2024-09-26 Relativistic diffusion model for hadron production in p-Pb collisions at the LHC Philipp Schulz et.al. 2409.17960 null
2024-09-25 DreamWaltz-G: Expressive 3D Gaussian Avatars from Skeleton-Guided 2D Diffusion Yukun Huang et.al. 2409.17145 link
2024-09-25 Language-oriented Semantic Communication for Image Transmission with Fine-Tuned Diffusion Model Xinfeng Wei et.al. 2409.17104 null
2024-09-25 Degradation-Guided One-Step Image Super-Resolution with Diffusion Priors Aiping Zhang et.al. 2409.17058 link
2024-09-25 ControlCity: A Multimodal Diffusion Model Based Approach for Accurate Geospatial Data Generation and Urban Morphology Analysis Fangshuo Zhou et.al. 2409.17049 link
2024-09-25 Dynamic Obstacle Avoidance through Uncertainty-Based Adaptive Planning with Diffusion Vineet Punyamoorty et.al. 2409.16950 null
2024-09-25 DALDA: Data Augmentation Leveraging Diffusion Model and LLM with Adaptive Guidance Scaling Kyuheon Jung et.al. 2409.16949 link
2024-09-25 Generative Object Insertion in Gaussian Splatting with a Multi-View Diffusion Model Hongliang Zhong et.al. 2409.16938 link
2024-09-25 A Versatile and Differentiable Hand-Object Interaction Representation Théo Morales et.al. 2409.16855 null
2024-09-25 Analytical assessment of workers' safety concerning direct and indirect ways of getting infected by dangerous pathogen Krzysztof Domino et.al. 2409.16809 null
2024-09-25 Layout-Corrector: Alleviating Layout Sticking Phenomenon in Discrete Diffusion Model Shoma Iwai et.al. 2409.16689 null
2024-09-24 Generative Factor Chaining: Coordinated Manipulation with Diffusion-based Factor Graph Utkarsh A. Mishra et.al. 2409.16275 null
2024-09-24 MaskBit: Embedding-free Image Generation via Bit Tokens Mark Weber et.al. 2409.16211 link
2024-09-24 MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling Yifang Men et.al. 2409.16160 null
2024-09-24 Spreading dynamics of a Fisher-KPP nonlocal diffusion model with a free boundary Lei Li et.al. 2409.16101 null
2024-09-24 PRESTO: Fast motion planning using diffusion models based on key-configuration environment representation Mingyo Seo et.al. 2409.16012 null
2024-09-24 Unleashing the Potential of Synthetic Images: A Study on Histopathology Image Classification Leire Benito-Del-Valle et.al. 2409.16002 link
2024-09-24 ASD-Diffusion: Anomalous Sound Detection with Diffusion Models Fengrun Zhang et.al. 2409.15957 null
2024-09-24 Multiscale method for image denoising using nonlinear diffusion process: local denoising and spectral multiscale basis functions Maria Vasilyeva et.al. 2409.15952 null
2024-09-24 Identifying early tumour states in a Cahn-Hilliard-reaction-diffusion model Abramo Agosti et.al. 2409.15925 null
2024-09-24 Diffusion Models for Intelligent Transportation Systems: A Survey Mingxing Peng et.al. 2409.15816 null
2024-09-18 Massively Multi-Person 3D Human Motion Forecasting with Scene Context Felix B Mueller et.al. 2409.12189 link
2024-09-18 MoRAG -- Multi-Fusion Retrieval Augmented Generation for Human Motion Kalakonda Sai Shashank et.al. 2409.12140 null
2024-09-18 Brain-Streams: fMRI-to-Image Reconstruction with Multi-modal Guidance Jaehoon Joo et.al. 2409.12099 null
2024-09-18 Denoising diffusion models for high-resolution microscopy image restoration Pamela Osuna-Vargas et.al. 2409.12078 null
2024-09-18 LEMON: Localized Editing with Mesh Optimization and Neural Shaders Furkan Mert Algan et.al. 2409.12024 null
2024-09-18 Generation of Complex 3D Human Motion by Temporal and Spatial Composition of Diffusion Models Lorenzo Mandelli et.al. 2409.11920 null
2024-09-18 DPI-TTS: Directional Patch Interaction for Fast-Converging and Style Temporal Modeling in Text-to-Speech Xin Qi et.al. 2409.11835 null
2024-09-18 RaggeDi: Diffusion-based State Estimation of Disordered Rags, Sheets, Towels and Blankets Jikai Ye et.al. 2409.11831 null
2024-09-18 InverseMeetInsert: Robust Real Image Editing via Geometric Accumulation Inversion in Guided Diffusion Models Yan Zheng et.al. 2409.11734 null
2024-09-18 GUNet: A Graph Convolutional Network United Diffusion Model for Stable and Diversity Pose Generation Shuowen Liang et.al. 2409.11689 link
2024-09-17 Ultrasound Image Enhancement with the Variance of Diffusion Models Yuxin Zhang et.al. 2409.11380 link
2024-09-17 OSV: One Step is Enough for High-Quality Image to Video Generation Xiaofeng Mao et.al. 2409.11367 null
2024-09-17 Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think Gonzalo Martin Garcia et.al. 2409.11355 link
2024-09-17 OmniGen: Unified Image Generation Shitao Xiao et.al. 2409.11340 link
2024-09-17 fMRI-3D: A Comprehensive Dataset for Enhancing fMRI-based 3D Reconstruction Jianxiong Gao et.al. 2409.11315 null
2024-09-17 DroneDiffusion: Robust Quadrotor Dynamics Learning with Diffusion Models Avirup Das et.al. 2409.11292 null
2024-09-17 Score Forgetting Distillation: A Swift, Data-Free Method for Machine Unlearning in Diffusion Models Tianqi Chen et.al. 2409.11219 null
2024-09-17 High-Resolution Speech Restoration with Latent Diffusion Model Tushar Dhyani et.al. 2409.11145 null
2024-09-17 In-situ measurements of light diffusion in an optically dense atomic ensemble Antoine Glicenstein et.al. 2409.11117 null
2024-09-17 TacDiffusion: Force-domain Diffusion Policy for Precise Tactile Manipulation Yansong Wu et.al. 2409.11047 null
2024-09-16 Incorporating Classifier-Free Guidance in Diffusion Model-Based Recommendation Noah Buchanan et.al. 2409.10494 null
2024-09-16 SimInversion: A Simple Framework for Inversion-Based Text-to-Image Editing Qi Qian et.al. 2409.10476 null
2024-09-16 MacDiff: Unified Skeleton Modeling with Masked Conditional Diffusion Lehong Wu et.al. 2409.10473 null
2024-09-16 Mamba-ST: State Space Model for Efficient Style Transfer Filippo Botti et.al. 2409.10385 link
2024-09-16 Taming Diffusion Models for Image Restoration: A Review Ziwei Luo et.al. 2409.10353 null
2024-09-16 Fairness, not Emotion, Drives Socioeconomic Decision Making Rudra Mukhopadhyay et.al. 2409.10322 null
2024-09-16 DreamHead: Learning Spatial-Temporal Correspondence via Hierarchical Diffusion for Audio-driven Talking Head Synthesis Fa-Ting Hong et.al. 2409.10281 null
2024-09-16 RealDiff: Real-world 3D Shape Completion using Self-Supervised Diffusion Models Başak Melis Öcal et.al. 2409.10180 null
2024-09-16 PSHuman: Photorealistic Single-view Human Reconstruction using Cross-Scale Diffusion Peng Li et.al. 2409.10141 null
2024-09-16 DDoS: Diffusion Distribution Similarity for Out-of-Distribution Detection Kun Fang et.al. 2409.10094 null
2024-09-13 Closed-Loop Visuomotor Control with Generative Expectation for Robotic Manipulation Qingwen Bu et.al. 2409.09016 link
2024-09-13 A Diffusion Approach to Radiance Field Relighting using Multi-Illumination Synthesis Yohan Poirier-Ginter et.al. 2409.08947 null
2024-09-13 Latent Space Score-based Diffusion Model for Probabilistic Multivariate Time Series Imputation Guojun Liang et.al. 2409.08917 link
2024-09-13 Gaussian is All You Need: A Unified Framework for Solving Inverse Problems via Diffusion Posterior Sampling Nebiyou Yismaw et.al. 2409.08906 null
2024-09-13 Adjoint Matching: Fine-tuning Flow and Diffusion Generative Models with Memoryless Stochastic Optimal Control Carles Domingo-Enrich et.al. 2409.08861 null
2024-09-13 InstantDrag: Improving Interactivity in Drag-based Image Editing Joonghyuk Shin et.al. 2409.08857 null
2024-09-13 DX2CT: Diffusion Model for 3D CT Reconstruction from Bi or Mono-planar 2D X-ray(s) Yun Su Jeong et.al. 2409.08850 null
2024-09-13 DFADD: The Diffusion and Flow-Matching Based Audio Deepfake Dataset Jiawei Du et.al. 2409.08731 link
2024-09-13 STA-V2A: Video-to-Audio Generation with Semantic and Temporal Alignment Yong Ren et.al. 2409.08601 null
2024-09-13 LHQ-SVC: Lightweight and High Quality Singing Voice Conversion Modeling Yubo Huang et.al. 2409.08583 null
2024-09-12 DreamHOI: Subject-Driven Generation of 3D Human-Object Interactions with Diffusion Priors Thomas Hanwen Zhu et.al. 2409.08278 null
2024-09-12 DreamBeast: Distilling 3D Fantastical Animals with Part-Aware Knowledge Transfer Runjia Li et.al. 2409.08271 null
2024-09-12 Touch2Touch: Cross-Modal Tactile Generation for Object Manipulation Samanta Rodriguez et.al. 2409.08269 null
2024-09-12 Improving Text-guided Object Inpainting with Semantic Pre-inpainting Yifu Chen et.al. 2409.08260 link
2024-09-12 Improving Virtual Try-On with Garment-focused Diffusion Models Siqi Wan et.al. 2409.08258 link
2024-09-12 LoRID: Low-Rank Iterative Diffusion for Adversarial Purification Geigh Zollicoffer et.al. 2409.08255 null
2024-09-12 Dynamic Prompting of Frozen Text-to-Image Diffusion Models for Panoptic Narrative Grounding Hongyu Li et.al. 2409.08251 null
2024-09-12 IFAdapter: Instance Feature Control for Grounded Text-to-Image Generation Yinwei Wu et.al. 2409.08240 null
2024-09-12 LT3SD: Latent Trees for 3D Scene Diffusion Quan Meng et.al. 2409.08215 null
2024-09-12 VI3DRM:Towards meticulous 3D Reconstruction from Sparse Views via Photo-Realistic Novel View Synthesis Hao Chen et.al. 2409.08207 null
2024-09-11 DreamMesh: Jointly Manipulating and Texturing Triangle Meshes for Text-to-3D Generation Haibo Yang et.al. 2409.07454 null
2024-09-11 Hi3D: Pursuing High-Resolution Image-to-3D Generation with Video Diffusion Models Haibo Yang et.al. 2409.07452 link
2024-09-11 FreeEnhance: Tuning-Free Image Enhancement via Content-Consistent Noising-and-Denoising Process Yang Luo et.al. 2409.07451 null
2024-09-11 Efficient One-Step Diffusion Refinement for Snapshot Compressive Imaging Yunzhen Wang et.al. 2409.07417 null
2024-09-11 Training-Free Guidance for Discrete Diffusion Models for Molecular Generation Thomas J. Kerby et.al. 2409.07359 null
2024-09-11 Learning Robotic Manipulation Policies from Point Clouds with Conditional Flow Matching Eugenio Chisari et.al. 2409.07343 null
2024-09-11 Efficient and Unbiased Sampling of Boltzmann Distributions via Consistency Models Fengzhe Zhang et.al. 2409.07323 null
2024-09-11 Exploring User-level Gradient Inversion with a Diffusion Prior Zhuohang Li et.al. 2409.07291 null
2024-09-11 CCFExp: Facial Image Synthesis with Cycle Cross-Fusion Diffusion Model for Facial Paralysis Individuals Weixiang Gao et.al. 2409.07271 link
2024-09-11 Realistic and Efficient Face Swapping: A Unified Approach with Diffusion Models Sanoojan Baliah et.al. 2409.07269 link
2024-09-10 SaRA: High-Efficient Diffusion Model Fine-tuning with Progressive Sparse Low-Rank Adaptation Teng Hu et.al. 2409.06633 null
2024-09-10 Enhancing Emotional Text-to-Speech Controllability with Natural Language Guidance through Contrastive Learning and Diffusion Models Xin Jing et.al. 2409.06451 null
2024-09-10 Distilling Generative-Discriminative Representations for Very Low-Resolution Face Recognition Junzheng Zhang et.al. 2409.06371 null
2024-09-10 What happens to diffusion model likelihood when your model is conditional? Mattias Cross et.al. 2409.06364 null
2024-09-10 DiffQRCoder: Diffusion-based Aesthetic QR Code Generation with Scanning Robustness Guided Iterative Refinement Jia-Wei Liao et.al. 2409.06355 null
2024-09-10 Multi-Source Music Generation with Latent Diffusion Zhongweiyang Xu et.al. 2409.06190 link
2024-09-11 MyGo: Consistent and Controllable Multi-View Driving Video Generation with Camera Control Yining Yao et.al. 2409.06189 null
2024-09-10 EDADepth: Enhanced Data Augmentation for Monocular Depth Estimation Nischal Khanal et.al. 2409.06183 link
2024-09-09 Latent Diffusion Bridges for Unsupervised Musical Audio Timbre Transfer Michele Mancusi et.al. 2409.06096 null
2024-09-09 SVS-GAN: Leveraging GANs for Semantic Video Synthesis Khaled M. Seyam et.al. 2409.06074 null
2024-09-09 Enhancing Preference-based Linear Bandits via Human Response Time Shen Li et.al. 2409.05798 null
2024-09-09 Vector Quantized Diffusion Model Based Speech Bandwidth Extension Yuan Fang et.al. 2409.05784 null
2024-09-09 AS-Speech: Adaptive Style For Speech Synthesis Zhipeng Li et.al. 2409.05730 null
2024-09-09 pFedGPA: Diffusion-based Generative Parameter Aggregation for Personalized Federated Learning Jiahao Lai et.al. 2409.05701 null
2024-09-09 Unlearning or Concealment? A Critical Analysis and Evaluation Metrics for Unlearning in Diffusion Models Aakash Sen Sharma et.al. 2409.05668 null
2024-09-09 Forward KL Regularized Preference Optimization for Aligning Diffusion Policies Zhao Shan et.al. 2409.05622 null
2024-09-09 CipherDM: Secure Three-Party Inference for Diffusion Model Sampling Xin Zhao et.al. 2409.05414 null
2024-09-09 Sequential Posterior Sampling with Diffusion Models Tristan S. W. Stevens et.al. 2409.05399 null
2024-09-09 TERD: A Unified Framework for Safeguarding Diffusion Models Against Backdoors Yichuan Mo et.al. 2409.05294 link
2024-09-08 Nuclear transparencies with a two step process of the $A(e,e'π^+)$ reactions Tae Keun Choi et.al. 2409.05129 null
2024-09-06 VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation Yecheng Wu et.al. 2409.04429 link
2024-09-06 Exploring Foundation Models for Synthetic Medical Imaging: A Study on Chest X-Rays and Fine-Tuning Techniques Davide Clode da Silva et.al. 2409.04424 null
2024-09-06 How Fair is Your Diffusion Recommender Model? Daniele Malitesta et.al. 2409.04339 null
2024-09-06 Random effects estimation in a fractional diffusion model based on continuous observations Nesrine Chebli et.al. 2409.04331 null
2024-09-06 Breaking the Brownian Barrier: Models and Manifestations of Molecular Diffusion in Complex Fluids Harish Srinivasan et.al. 2409.04199 null
2024-09-06 GST: Precise 3D Human Body from a Single Image with Gaussian Splatting Transformers Lorenza Prospero et.al. 2409.04196 null
2024-09-06 D4: Text-guided diffusion model-based domain adaptive data augmentation for vineyard shoot detection Kentaro Hirahara et.al. 2409.04060 null
2024-09-06 One-Shot Diffusion Mimicker for Handwritten Text Generation Gang Dai et.al. 2409.04004 link
2024-09-06 DreamForge: Motion-Aware Autoregressive Video Generation for Multi-View Driving Scenes Jianbiao Mei et.al. 2409.04003 link
2024-09-05 Data-Efficient Generation for Dataset Distillation Zhe Li et.al. 2409.03929 null
2024-09-05 Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding Yunze Man et.al. 2409.03757 link
2024-09-05 ArtiFade: Learning to Generate High-quality Subject from Blemished Images Shuya Yang et.al. 2409.03745 null
2024-09-05 RealisHuman: A Two-Stage Approach for Refining Malformed Human Parts in Generated Images Benzhi Wang et.al. 2409.03644 link
2024-09-05 DiffEVC: Any-to-Any Emotion Voice Conversion with Expressive Guidance Hsing-Hang Chou et.al. 2409.03636 null
2024-09-05 TCDiff: Triple Condition Diffusion Model with 3D Constraints for Stylizing Synthetic Faces Bernardo Biesseck et.al. 2409.03600 link
2024-09-05 DKDM: Data-Free Knowledge Distillation for Diffusion Models with Any Architecture Qianlong Xiang et.al. 2409.03550 null
2024-09-05 Blended Latent Diffusion under Attention Control for Real-World Video Editing Deyin Liu et.al. 2409.03514 null
2024-09-05 Data-free Distillation with Degradation-prompt Diffusion for Multi-weather Image Restoration Pei Wang et.al. 2409.03455 null
2024-09-05 Enhancing User-Centric Privacy Protection: An Interactive Framework through Diffusion Models and Machine Unlearning Huaxi Huang et.al. 2409.03326 null
2024-09-05 SVP: Style-Enhanced Vivid Portrait Talking Head Diffusion Model Weipeng Tan et.al. 2409.03270 null
2024-09-04 HiPrompt: Tuning-free Higher-Resolution Generation with Hierarchical MLLM Prompts Xinyu Liu et.al. 2409.02919 link
2024-09-04 Masked Diffusion Models are Secretly Time-Agnostic Masked Models and Exploit Inaccurate Categorical Sampling Kaiwen Zheng et.al. 2409.02908 null
2024-09-04 Human-VDM: Learning Single-Image 3D Human Gaussian Splatting from Video Diffusion Models Zhibin Liu et.al. 2409.02851 link
2024-09-04 Multi-Track MusicLDM: Towards Versatile Music Generation with Latent Diffusion Model Tornike Karchkhadze et.al. 2409.02845 null
2024-09-04 Skip-and-Play: Depth-Driven Pose-Preserved Image Generation for Any Objects Kyungmin Jo et.al. 2409.02653 null
2024-09-04 MADiff: Motion-Aware Mamba Diffusion Models for Hand Trajectory Prediction on Egocentric Videos Junyi Ma et.al. 2409.02638 null
2024-09-04 Loopy: Taming Audio-Driven Portrait Avatar with Long-Term Motion Dependency Jianwen Jiang et.al. 2409.02634 null
2024-09-04 Rate-Adaptive Generative Semantic Communication Using Conditional Diffusion Models Pujing Yang et.al. 2409.02597 null
2024-09-04 Solving Video Inverse Problems Using Image Diffusion Models Taesung Kwon et.al. 2409.02574 null
2024-09-04 StyleTokenizer: Defining Image Style by a Single Instance for Controlling Diffusion Models Wen Li et.al. 2409.02543 link
2024-08-30 Subspace Diffusion Posterior Sampling for Travel-Time Tomography Xiang Cao et.al. 2408.17333 null
2024-09-02 RISSOLE: Parameter-efficient Diffusion Models via Block-wise Generation and Retrieval-Guidance Avideep Mukherjee et.al. 2408.17095 null
2024-09-02 Instant Adversarial Purification with Adversarial Consistency Distillation Chun Tong Lei et.al. 2408.17064 null
2024-08-30 Text-to-Image Generation Via Energy-Based CLIP Roy Ganz et.al. 2408.17046 null
2024-08-30 Contrastive Learning with Synthetic Positives Dewen Zeng et.al. 2408.16965 link
2024-09-02 Enabling Local Editing in Diffusion Models by Joint and Individual Component Analysis Theodoros Kouzelis et.al. 2408.16845 null
2024-08-29 ReconX: Reconstruct Any Scene from Sparse Views with Video Diffusion Model Fangfu Liu et.al. 2408.16767 null
2024-09-04 CSGO: Content-Style Composition in Text-to-Image Generation Peng Xing et.al. 2408.16766 null
2024-08-29 DriveGenVLM: Real-world Video Generation for Vision Language Model based Autonomous Driving Yongjie Fu et.al. 2408.16647 null
2024-09-02 RLCP: A Reinforcement Learning-based Copyright Protection Method for Text-to-Image Diffusion Model Zhuan Shi et.al. 2408.16634 null
2024-08-29 A Score-based Generative Solver for PDE-constrained Inverse Problems with Complex Priors Yankun Hong et.al. 2408.16626 null
2024-08-29 GRPose: Learning Graph Relations for Human Image Generation with Pose Priors Xiangchen Yin et.al. 2408.16540 link
2024-08-29 Spiking Diffusion Models Jiahang Cao et.al. 2408.16467 link
2024-08-29 What to Preserve and What to Transfer: Faithful, Identity-Preserving Diffusion-based Hairstyle Transfer Chaeyeon Chung et.al. 2408.16450 link
2024-08-29 COIN: Control-Inpainting Diffusion Prior for Human and Camera Motion Estimation Jiefeng Li et.al. 2408.16426 null
2024-08-29 Self-Improving Diffusion Models with Synthetic Data Sina Alemohammad et.al. 2408.16333 null
2024-08-28 TEDRA: Text-based Editing of Dynamic and Photoreal Actors Basavaraj Sunagad et.al. 2408.15995 null
2024-08-28 Distribution Backtracking Builds A Faster Convergence Trajectory for One-step Diffusion Distillation Shengyuan Zhang et.al. 2408.15991 link
2024-08-28 Gen-Swarms: Adapting Deep Generative Models to Swarms of Drones Carlos Plou et.al. 2408.15899 null
2024-08-28 Airfoil Diffusion: Denoising Diffusion Model For Conditional Airfoil Generation Reid Graves et.al. 2408.15898 link
2024-08-28 Disentangled Diffusion Autoencoder for Harmonization of Multi-site Neuroimaging Data Ayodeji Ijishakin et.al. 2408.15890 null
2024-08-28 GenDDS: Generating Diverse Driving Video Scenarios with Prompt-to-Video Generative Model Yongjie Fu et.al. 2408.15868 null
2024-08-28 Defending Text-to-image Diffusion Models: Surprising Efficacy of Textual Perturbations Against Backdoor Attacks Oscar Chew et.al. 2408.15721 null
2024-08-28 Synthetic Forehead-creases Biometric Generation for Reliable User Verification Abhishek Tandon et.al. 2408.15693 link
2024-08-28 Merging and Splitting Diffusion Paths for Semantically Coherent Panoramas Fabio Quattrini et.al. 2408.15660 link
2024-08-28 Grand canonical generative diffusion model for crystalline phases and grain boundaries Bo Lei et.al. 2408.15601 null
2024-08-27 GenRec: Unifying Video Generation and Recognition with Diffusion Models Zejia Weng et.al. 2408.15241 link
2024-08-27 Generative Inbetweening: Adapting Image-to-Video Models for Keyframe Interpolation Xiaojuan Wang et.al. 2408.15239 null
2024-08-27 Simulation of Stochastic Discrete Dislocation Dynamics in Ductile Vs Brittle Materials Santosh Chhetri et.al. 2408.15157 null
2024-08-27 DIFR3CT: Latent Diffusion for Probabilistic 3D CT Reconstruction from Few Planar X-Rays Yiran Sun et.al. 2408.15118 link
2024-08-27 Constrained Diffusion Models via Dual Training Shervin Khalafi et.al. 2408.15094 null
2024-08-27 LN-Gen: Rectal Lymph Nodes Generation via Anatomical Features Weidong Guo et.al. 2408.14977 null
2024-08-27 MegActor- $Σ$ : Unlocking Flexible Mixed-Modal Control in Portrait Animation with Diffusion Transformer Shurong Yang et.al. 2408.14975 null
2024-08-27 MeshUp: Multi-Target Mesh Deformation via Blended Score Distillation Hyunwoo Kim et.al. 2408.14899 null
2024-08-27 DiffSurf: A Transformer-based Diffusion Model for Generating and Reconstructing 3D Surfaces in Pose Yusuke Yoshiyasu et.al. 2408.14860 null
2024-08-27 Diffusion-Occ: 3D Point Cloud Completion via Occupancy Diffusion Guoqing Zhang et.al. 2408.14846 null
2024-08-27 Foundation Models for Music: A Survey Yinghao Ma et.al. 2408.14340 link
2024-08-26 TC-PDM: Temporally Consistent Patch Diffusion Models for Infrared-to-Visible Video Translation Anh-Dzung Doan et.al. 2408.14227 link
2024-08-26 MagicMan: Generative Novel View Synthesis of Humans with 3D-Aware Diffusion and Iterative Refinement Xu He et.al. 2408.14211 null
2024-08-27 SwiftBrush v2: Make Your One-step Diffusion Model Better Than Its Teacher Trung Dao et.al. 2408.14176 link
2024-08-26 Foodfusion: A Novel Approach for Food Image Composition via Diffusion Models Chaohua Shi et.al. 2408.14135 null
2024-08-26 SurGen: Text-Guided Diffusion Model for Surgical Video Generation Joseph Cho et.al. 2408.14028 null
2024-08-26 Pixel-Aligned Multi-View Generation with Depth Guided Decoder Zhenggang Tang et.al. 2408.14016 null
2024-08-25 SimpleSpeech 2: Towards Simple and Efficient Text-to-Speech with Flow-based Scalar Latent Transformer Diffusion Models Dongchao Yang et.al. 2408.13893 null
2024-08-25 Particle-Filtering-based Latent Diffusion for Inverse Problems Amir Nazemi et.al. 2408.13868 null
2024-08-25 Draw Like an Artist: Complex Scene Generation with Diffusion Model via Composition, Painting, and Retouching Minghao Liu et.al. 2408.13858 null
2024-08-23 How Diffusion Models Learn to Factorize and Compose Qiyao Liang et.al. 2408.13256 null
2024-08-23 CustomCrafter: Customized Video Generation with Preserving Motion and Concept Composition Abilities Tao Wu et.al. 2408.13239 link
2024-08-23 Diffusion-based Episodes Augmentation for Offline Multi-Agent Reinforcement Learning Jihwan Oh et.al. 2408.13092 null
2024-08-23 General Intelligent Imaging and Uncertainty Quantification by Deterministic Diffusion Model Weiru Fan et.al. 2408.13061 null
2024-08-23 Atlas Gaussians Diffusion for 3D Generation with Infinite Number of Points Haitao Yang et.al. 2408.13055 null
2024-08-23 Adaptive complexity of log-concave sampling Huanjian Zhou et.al. 2408.13045 null
2024-08-23 EasyControl: Transfer ControlNet to Video Diffusion for Controllable Generation and Interpolation Cong Wang et.al. 2408.13005 null
2024-08-23 Controllable Financial Market Generation with Diffusion Guided Meta Agent Yu-Hao Huang et.al. 2408.12991 null
2024-08-23 When Diffusion MRI Meets Diffusion Model: A Novel Deep Generative Model for Diffusion MRI Generation Xi Zhu et.al. 2408.12897 null
2024-08-22 Generating Realistic X-ray Scattering Images Using Stable Diffusion and Human-in-the-loop Annotations Zhuowen Zhao et.al. 2408.12720 link
2024-08-22 xGen-VideoSyn-1: High-fidelity Text-to-Video Synthesis with Compressed Representations Can Qin et.al. 2408.12590 null
2024-08-22 ssProp: Energy-Efficient Training for Convolutional Neural Networks with Scheduled Sparse Back Propagation Lujia Zhong et.al. 2408.12561 link
2024-08-22 Show-o: One Single Transformer to Unify Multimodal Understanding and Generation Jinheng Xie et.al. 2408.12528 null
2024-08-22 FlexEdit: Marrying Free-Shape Masks to VLLM for Flexible Image Editing Jue Wang et.al. 2408.12429 link
2024-08-22 4D Diffusion for Dynamic Protein Structure Prediction with Reference Guided Motion Alignment Kaihui Cheng et.al. 2408.12419 null
2024-08-22 CODE: Confident Ordinary Differential Editing Bastien van Delft et.al. 2408.12418 link
2024-08-22 Dynamic PDB: A New Dataset and a SE(3) Model Extension by Integrating Dynamic Behaviors and Physical Properties in Protein Structures Ce Liu et.al. 2408.12413 null
2024-08-22 LCM-SVC: Latent Diffusion Model Based Singing Voice Conversion with Inference Acceleration via Latent Consistency Distillation Shihao Chen et.al. 2408.12354 null
2024-08-23 GarmentAligner: Text-to-Garment Generation via Retrieval-augmented Multi-level Corrections Shiyue Zhang et.al. 2408.12352 null
2024-08-22 Variance reduction of diffusion model's gradients with Taylor approximation-based control variate Paul Jeha et.al. 2408.12270 null
2024-08-21 Pixel Is Not A Barrier: An Effective Evasion Attack for Pixel-Domain Diffusion Models Chun-Yen Shih et.al. 2408.11810 null
2024-08-21 Timeline and Boundary Guided Diffusion Network for Video Shadow Detection Haipeng Zhou et.al. 2408.11785 link
2024-08-21 JieHua Paintings Style Feature Extracting Model using Stable Diffusion with ControlNet Yujia Gu et.al. 2408.11744 null
2024-08-21 Iterative Object Count Optimization for Text-to-image Diffusion Models Oz Zafar et.al. 2408.11721 null
2024-08-21 FRAP: Faithful and Realistic Text-to-Image Generation with Adaptive Prompt Weighting Liyao Jiang et.al. 2408.11706 null
2024-08-21 Moderate deviation principles for a reaction diffusion model in non-equilibrium Linjie Zhao et.al. 2408.11633 null
2024-08-21 Bayesian inversion for the identification of the doping profile in unipolar semiconductor devices Leila Taghizadeh et.al. 2408.11485 null
2024-08-21 Latent Feature and Attention Dual Erasure Attack against Multi-View Diffusion Models for 3D Assets Protection Jingwei Sun et.al. 2408.11408 null
2024-08-21 Video Diffusion Models are Strong Video Inpainter Minhyeok Lee et.al. 2408.11402 null
2024-08-21 Generative AI based Secure Wireless Sensing for ISAC Networks Jiacheng Wang et.al. 2408.11398 null
2024-08-20 Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model Chunting Zhou et.al. 2408.11039 null
2024-08-20 MegaFusion: Extend Diffusion Models towards Higher-resolution Image Generation without Further Tuning Haoning Wu et.al. 2408.11001 link
2024-08-20 GreediRIS: Scalable Influence Maximization using Distributed Streaming Maximum Cover Reet Barik et.al. 2408.10982 null
2024-08-20 Kilometer-Scale Convection Allowing Model Emulation using Generative Diffusion Modeling Jaideep Pathak et.al. 2408.10958 null
2024-08-20 Large Point-to-Gaussian Model for Image-to-3D Generation Longfei Lu et.al. 2408.10935 null
2024-08-20 A Grey-box Attack against Latent Diffusion Model-based Image Editing by Posterior Collapse Zhongliang Guo et.al. 2408.10901 null
2024-08-20 Hedging in Jump Diffusion Model with Transaction Costs Hamidreza Maleki Almani et.al. 2408.10785 null
2024-08-20 Generating Synthetic Fair Syntax-agnostic Data by Learning and Distilling Fair Representation Md Fahim Sikder et.al. 2408.10755 null
2024-08-20 Iterative Window Mean Filter: Thwarting Diffusion-based Adversarial Purification Hanrui Wang et.al. 2408.10673 null
2024-08-20 TextMastero: Mastering High-Quality Scene Text Editing in Diverse Languages and Styles Tong Wang et.al. 2408.10623 null
2024-08-19 MeshFormer: High-Quality Mesh Generation with 3D-Guided Reconstruction Model Minghua Liu et.al. 2408.10198 null
2024-08-19 SpaRP: Fast 3D Object Reconstruction and Pose Estimation from Sparse Views Chao Xu et.al. 2408.10195 null
2024-08-19 Multi-layer diffusion model of photovoltaic installations Tomasz Weron et.al. 2408.09904 null
2024-08-19 Instruction-Based Molecular Graph Generation with Unified Text-Graph Diffusion Model Yuran Xiang et.al. 2408.09896 link
2024-08-19 SurgicaL-CD: Generating Surgical Images via Unpaired Image Translation with Latent Consistency Diffusion Models Danush Kumar Venkatesh et.al. 2408.09822 link
2024-08-19 Latent Diffusion for Guided Document Table Generation Syed Jawwad Haider Hamdani et.al. 2408.09800 null
2024-08-19 Unsupervised Composable Representations for Audio Giovanni Bindi et.al. 2408.09792 link
2024-08-19 Propagating the prior from shallow to deep with a pre-trained velocity-model Generative Transformer network Randy Harsuko et.al. 2408.09767 null
2024-08-19 Photorealistic Object Insertion with Diffusion-Guided Inverse Rendering Ruofan Liang et.al. 2408.09702 null
2024-08-19 ExpoMamba: Exploiting Frequency SSM Blocks for Efficient and Effective Image Enhancement Eashan Adhikarla et.al. 2408.09650 link
2024-08-16 PFDiff: Training-free Acceleration of Diffusion Models through the Gradient Guidance of Past and Future Guangyi Wang et.al. 2408.08822 null
2024-08-16 Comparative Analysis of Generative Models: Enhancing Image Synthesis with VAEs, GANs, and Stable Diffusion Sanchayan Vivekananthan et.al. 2408.08751 null
2024-08-16 An End-to-End Model for Photo-Sharing Multi-modal Dialogue Generation Peiming Guo et.al. 2408.08650 null
2024-08-16 Modeling the Neonatal Brain Development Using Implicit Neural Representations Florentin Bieder et.al. 2408.08647 link
2024-08-16 Sampling effects on Lasso estimation of drift functions in high-dimensional diffusion processes Chiara Amorino et.al. 2408.08638 null
2024-08-16 Generative Dataset Distillation Based on Diffusion Model Duo Su et.al. 2408.08610 link
2024-08-16 RadioDiff: An Effective Generative Diffusion Model for Sampling-Free Dynamic Radio Map Construction Xiucheng Wang et.al. 2408.08593 link
2024-08-16 A New Chinese Landscape Paintings Generation Model based on Stable Diffusion using DreamBooth Yujia Gu et.al. 2408.08561 null
2024-08-16 Linear combinations of latents in diffusion models: interpolation and beyond Erik Bodin et.al. 2408.08558 null
2024-08-16 Inverse design with conditional cascaded diffusion models Milad Habibi et.al. 2408.08526 null
2024-08-15 Accelerated Image-Aware Generative Diffusion Modeling Tanmay Asthana et.al. 2408.08306 null
2024-08-15 Derivative-Free Guidance in Continuous and Discrete Diffusion Models with Soft Value-Based Decoding Xiner Li et.al. 2408.08252 link
2024-08-15 Not Every Image is Worth a Thousand Words: Quantifying Originality in Stable Diffusion Adi Haviv et.al. 2408.08184 null
2024-08-15 Conditional Brownian Bridge Diffusion Model for VHR SAR to Optical Image Translation Seon-Hoon Kim et.al. 2408.07947 link
2024-08-14 Moderator: Moderating Text-to-Image Diffusion Models through Fine-grained Context-based Policies Peiran Wang et.al. 2408.07728 link
2024-08-14 Drug Discovery SMILES-to-Pharmacokinetics Diffusion Models with Deep Molecular Understanding Bing Hu et.al. 2408.07636 null
2024-08-14 Anisotropic Diffusion Model of Communication in 2D Biofilm Yanahan Paramalingam et.al. 2408.07626 null
2024-08-14 DifuzCam: Replacing Camera Lens with a Mask and a Diffusion Model Erez Yosef et.al. 2408.07541 null
2024-08-14 DeCo: Decoupled Human-Centered Diffusion Video Editing with Motion Consistency Xiaojing Zhong et.al. 2408.07481 null
2024-08-14 One Step Diffusion-based Super-Resolution with Time-Aware Distillation Xiao He et.al. 2408.07476 link
2024-08-14 Unsupervised Blind Joint Dereverberation and Room Acoustics Estimation with Diffusion Models Jean-Marie Lemercier et.al. 2408.07472 null
2024-08-14 KIND: Knowledge Integration and Diversion in Diffusion Models Yucheng Xie et.al. 2408.07337 null
2024-08-14 GRIF-DM: Generation of Rich Impression Fonts using Diffusion Models Lei Kang et.al. 2408.07259 link
2024-08-13 Representation-space diffusion models for generating periodic materials Anshuman Sinha et.al. 2408.07213 null
2024-08-13 SeLoRA: Self-Expanding Low-Rank Adaptation of Latent Diffusion Model for Medical Image Synthesis Yuchen Mao et.al. 2408.07196 null
2024-08-13 Imagen 3 Imagen-Team-Google et.al. 2408.07009 null
2024-08-13 Low-Bitwidth Floating Point Quantization for Efficient High-Quality Diffusion Models Cheng Chen et.al. 2408.06995 null
2024-08-13 DCMSA: Multi-Head Self-Attention Mechanism Based on Deformable Convolution For Seismic Data Denoising Wang Mingwei et.al. 2408.06963 null
2024-08-13 Diffusion Model for Slate Recommendation Federico Tomasi et.al. 2408.06883 null
2024-08-13 DiffLoRA: Generating Personalized Low-Rank Adaptation Weights with Diffusion Yujia Wu et.al. 2408.06740 null
2024-08-13 DiffSG: A Generative Solver for Network Optimization with Diffusion Model Ruihuai Liang et.al. 2408.06701 link
2024-08-13 DC3DO: Diffusion Classifier for 3D Objects Nursena Koprucu et.al. 2408.06693 link
2024-08-13 Leveraging Priors via Diffusion Bridge for Time Series Generation Jinseong Park et.al. 2408.06672 null
2024-08-13 Hybrid SD: Edge-Cloud Collaborative Inference for Stable Diffusion Models Chenqian Yan et.al. 2408.06646 null
2024-08-13 ViMo: Generating Motions from Casual Videos Liangdong Qiu et.al. 2408.06614 null
2024-08-12 The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery Chris Lu et.al. 2408.06292 link
2024-08-12 3D Reconstruction of Protein Structures from Multi-view AFM Images using Neural Radiance Fields (NeRFs) Jaydeep Rade et.al. 2408.06244 null
2024-08-12 Novel View Synthesis from a Single Image with Pretrained Diffusion Guidance Taewon Kang et.al. 2408.06157 null
2024-08-12 Efficient and Scalable Point Cloud Generation with Sparse Point-Voxel Diffusion Models Ioannis Romanelis et.al. 2408.06145 link
2024-08-12 CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer Zhuoyi Yang et.al. 2408.06072 link
2024-08-12 ControlNeXt: Powerful and Efficient Control for Image and Video Generation Bohao Peng et.al. 2408.06070 link
2024-08-12 BooW-VTON: Boosting In-the-Wild Virtual Try-On via Mask-Free Pseudo Data Training Xuanpu Zhang et.al. 2408.06047 link
2024-08-12 Diffuse-UDA: Addressing Unsupervised Domain Adaptation in Medical Image Segmentation with Appearance and Structure Aligned Diffusion Models Haifan Gong et.al. 2408.05985 null
2024-08-12 UniPortrait: A Unified Framework for Identity-Preserving Single- and Multi-Human Image Personalization Junjie He et.al. 2408.05939 null
2024-08-12 Deep Geometric Moments Promote Shape Consistency in Text-to-3D Generation Utkarsh Nath et.al. 2408.05938 null
2024-08-09 Multi-Garment Customized Model Generation Yichen Liu et.al. 2408.05206 null
2024-08-09 DreamCouple: Exploring High Quality Text-to-3D Generation Via Rectified Flow Hangyu Li et.al. 2408.05008 null
2024-08-09 TEAdapter: Supply abundant guidance for controllable text-to-music generation Jialing Zou et.al. 2408.04865 link
2024-08-09 Adversarially Robust Industrial Anomaly Detection Through Diffusion Model Yuanpu Cao et.al. 2408.04839 null
2024-08-09 Next-Generation Wi-Fi Networks with Generative AI: Design and Insights Jingyu Wang et.al. 2408.04835 null
2024-08-08 BRAT: Bonus oRthogonAl Token for Architecture Agnostic Textual Inversion James Baker et.al. 2408.04785 link
2024-08-08 Zero-Shot Uncertainty Quantification using Diffusion Probabilistic Models Dule Shu et.al. 2408.04718 null
2024-08-08 Puppet-Master: Scaling Interactive Video Generation as a Motion Prior for Part-Level Dynamics Ruining Li et.al. 2408.04631 null
2024-08-08 Sketch2Scene: Automatic Generation of Interactive 3D Game Scenes from User's Casual Sketches Yongzhi Xu et.al. 2408.04567 null
2024-08-08 Deep Generative Models in Robotics: A Survey on Learning from Multimodal Demonstrations Julen Urain et.al. 2408.04380 null
2024-08-08 InstantStyleGaussian: Efficient Art Style Transfer with 3D Gaussian Splatting Xin-Yi Yu et.al. 2408.04249 null
2024-08-08 LLDif: Diffusion Models for Low-light Emotion Recognition Zhifeng Wang et.al. 2408.04235 null
2024-08-08 Connective Viewpoints of Signal-to-Noise Diffusion Models Khanh Doan et.al. 2408.04221 null
2024-08-08 Diffusion Guided Language Modeling Justin Lovelace et.al. 2408.04220 link
2024-08-07 Data Generation Scheme for Thermal Modality with Edge-Guided Adversarial Conditional Diffusion Model Guoqing Zhu et.al. 2408.03748 link
2024-08-07 Unsupervised Detection of Fetal Brain Anomalies using Denoising Diffusion Models Markus Ditlev Sjøgren Olsen et.al. 2408.03654 null
2024-08-07 TALE: Training-free Cross-domain Image Composition via Adaptive Latent Manipulation and Energy-guided Optimization Kien T. Pham et.al. 2408.03637 null
2024-08-07 Dirichlet forms of diffusion processes on Thoma simplex Sergei Korotkikh et.al. 2408.03553 null
2024-08-06 Hybrid diffusion models: combining supervised and generative pretraining for label-efficient fine-tuning of segmentation models Bruno Sauvalle et.al. 2408.03433 null
2024-08-06 Attacks and Defenses for Generative Diffusion Models: A Comprehensive Survey Vu Tuan Truong et.al. 2408.03400 null
2024-08-06 Adversarial Domain Adaptation for Cross-user Activity Recognition Using Diffusion-based Noise-centred Learning Xiaozhou Ye et.al. 2408.03353 link
2024-08-06 MDT-A2G: Exploring Masked Diffusion Transformers for Co-Speech Gesture Generation Xiaofeng Mao et.al. 2408.03312 null
2024-08-06 IPAdapter-Instruct: Resolving Ambiguity in Image-based Conditioning using Instruct Prompts Ciara Rowles et.al. 2408.03209 null
2024-08-06 Iterative CT Reconstruction via Latent Variable Optimization of Shallow Diffusion Models Sho Ozaki et.al. 2408.03156 null
2024-08-06 Training-Free Condition Video Diffusion Models for single frame Spatial-Semantic Echocardiogram Synthesis Van Phi Nguyen et.al. 2408.03035 link
2024-08-06 Diffusion Model Meets Non-Exemplar Class-Incremental Learning and Beyond Jichuan Zhang et.al. 2408.02983 null
2024-08-06 Data-Driven Stochastic Closure Modeling via Conditional Diffusion Model and Neural Operator Xinghao Dong et.al. 2408.02965 null
2024-08-06 Diverse Generation while Maintaining Semantic Coordination: A Diffusion-Based Data Augmentation Method for Object Detection Sen Nie et.al. 2408.02891 null
2024-08-05 Back-Projection Diffusion: Solving the Wideband Inverse Scattering Problem with Diffusion Models Borong Zhang et.al. 2408.02866 null
2024-08-05 Text Conditioned Symbolic Drumbeat Generation using Latent Diffusion Models Pushkar Jajoria et.al. 2408.02711 null
2024-08-05 LaMamba-Diff: Linear-Time High-Fidelity Diffusion Models Based on Local Attention and Mamba Yunxiang Fu et.al. 2408.02615 link
2024-08-05 Multi-weather Cross-view Geo-localization Using Denoising Diffusion Models Tongtong Feng et.al. 2408.02408 null
2024-08-05 A Sharp Convergence Theory for The Probability Flow ODEs of Diffusion Models Gen Li et.al. 2408.02320 null
2024-08-05 Curriculum learning based pre-training using Multi-Modal Contrastive Masked Autoencoders Muhammad Abdullah Jamal et.al. 2408.02245 null
2024-08-04 LDFaceNet: Latent Diffusion-based Network for High-Fidelity Deepfake Generation Dwij Mehta et.al. 2408.02078 null
2024-08-04 Step Saver: Predicting Minimum Denoising Steps for Diffusion Model Image Generation Jean Yu et.al. 2408.02054 null
2024-08-04 Robustness of Watermarking on Text-to-Image Diffusion Models Xiaodong Wu et.al. 2408.02035 null
2024-08-04 Faster Diffusion Action Segmentation Shuaibing Wang et.al. 2408.02024 null
2024-08-04 AnomalySD: Few-Shot Multi-Class Anomaly Detection with Stable Diffusion Model Zhenyu Yan et.al. 2408.01960 null
2024-08-04 Dataset Scale and Societal Consistency Mediate Facial Impression Bias in Vision-Language AI Robert Wolfe et.al. 2408.01959 null
2024-08-02 Conditional LoRA Parameter Generation Xiaolong Jin et.al. 2408.01415 null
2024-08-02 TexGen: Text-Guided 3D Texture Generation with Multi-view Sampling and Resampling Dong Huo et.al. 2408.01291 null
2024-08-02 A General Framework to Boost 3D GS Initialization for Text-to-3D Generation by Lexical Richness Lutao Jiang et.al. 2408.01269 null
2024-08-02 CLIP4Sketch: Enhancing Sketch to Mugshot Matching through Dataset Augmentation using Diffusion Models Kushal Kumar Jain et.al. 2408.01233 null
2024-08-02 EIUP: A Training-Free Approach to Erase Non-Compliant Concepts Conditioned on Implicit Unsafe Prompts Die Chen et.al. 2408.01014 null
2024-08-02 FBSDiff: Plug-and-Play Frequency Band Substitution of Diffusion Features for Highly Controllable Text-Driven Image Translation Xiang Gao et.al. 2408.00998 link
2024-08-05 CIResDiff: A Clinically-Informed Residual Diffusion Model for Predicting Idiopathic Pulmonary Fibrosis Progression Caiwen Jiang et.al. 2408.00938 null
2024-08-01 Optimizing Diffusion Models for Joint Trajectory Prediction and Controllable Generation Yixiao Wang et.al. 2408.00766 null
2024-08-01 Smoothed Energy Guidance: Guiding Diffusion Models with Reduced Energy Curvature of Attention Susung Hong et.al. 2408.00760 link
2024-08-01 TurboEdit: Text-Based Image Editing Using Few-Step Diffusion Models Gilad Deutch et.al. 2408.00735 null
2024-08-01 MotionFix: Text-Driven 3D Human Motion Editing Nikos Athanasiou et.al. 2408.00712 null
2024-08-01 Evaluation Metrics and Methods for Generative Models in the Wireless PHY Layer Michael Baur et.al. 2408.00634 null
2024-08-01 Illustrating Classic Brazilian Books using a Text-To-Image Diffusion Model Felipe Mahlow et.al. 2408.00544 null
2024-08-01 Towards Reliable Advertising Image Generation Using Human Feedback Zhenbang Du et.al. 2408.00418 link
2024-08-01 Deepfake Media Forensics: State of the Art and Challenges Ahead Irene Amerini et.al. 2408.00388 null
2024-08-01 On the Limitations and Prospects of Machine Unlearning for Generative AI Shiji Zhou et.al. 2408.00376 null
2024-08-01 DiM-Gesture: Co-Speech Gesture Generation with Adaptive Layer Normalization Mamba-2 framework Fan Zhang et.al. 2408.00370 null
2024-07-31 Detecting, Explaining, and Mitigating Memorization in Diffusion Models Yuxin Wen et.al. 2407.21720 link
2024-07-31 Tora: Trajectory-oriented Diffusion Transformer for Video Generation Zhenghao Zhang et.al. 2407.21705 link
2024-07-31 Generative Diffusion Model for Seismic Imaging Improvement of Sparsely Acquired Data and Uncertainty Quantification Xingchen Shi et.al. 2407.21683 null
2024-07-31 Explainable and Controllable Motion Curve Guided Cardiac Ultrasound Video Generation Junxuan Yu et.al. 2407.21490 null
2024-07-31 Fine-gained Zero-shot Video Sampling Dengsheng Chen et.al. 2407.21475 null
2024-07-31 Deformable 3D Shape Diffusion Model Dengsheng Chen et.al. 2407.21428 null
2024-07-31 Diff-Cleanse: Identifying and Mitigating Backdoor Attacks in Diffusion Models Jiang Hao et.al. 2407.21316 link
2024-07-31 State-observation augmented diffusion model for nonlinear assimilation Zhuoyuan Li et.al. 2407.21314 link
2024-07-31 DEF-oriCORN: efficient 3D scene understanding for robust language-directed manipulation without demonstrations Dongwon Son et.al. 2407.21267 null
2024-07-30 Informed Correctors for Discrete Diffusion Models Yixiu Zhao et.al. 2407.21243 null
2024-07-30 Matting by Generation Zhixiang Wang et.al. 2407.21017 null
2024-07-30 Add-SD: Rational Generation without Manual Reference Lingfeng Yang et.al. 2407.21016 link
2024-07-30 Vulnerabilities in AI-generated Image Detection: The Challenge of Adversarial Attacks Yunfeng Diao et.al. 2407.20836 null
2024-07-30 Diffusion Augmented Agents: A Framework for Efficient Exploration and Transfer Learning Norman Di Palo et.al. 2407.20798 null
2024-07-30 SynthVLM: High-Efficiency and High-Quality Synthetic Data for Vision Language Models Zheng Liu et.al. 2407.20756 link
2024-07-30 EgoSonics: Generating Synchronized Audio for Silent Egocentric Videos Aashish Rai et.al. 2407.20592 null
2024-07-30 DiffusionCounterfactuals: Inferring High-dimensional Counterfactuals with Guidance of Causal Representations Jiageng Zhu et.al. 2407.20553 null
2024-07-29 Learning Feature-Preserving Portrait Editing from Generated Pairs Bowei Chen et.al. 2407.20455 null
2024-07-29 Contrasting Deepfakes Diffusion via Contrastive Learning and Global-Local Similarities Lorenzo Baraldi et.al. 2407.20337 link
2024-07-29 Sun Off, Lights On: Photorealistic Monocular Nighttime Simulation for Robust Semantic Perception Konstantinos Tzevelekakis et.al. 2407.20336 null
2024-07-29 Specify and Edit: Overcoming Ambiguity in Text-Based Image Editing Ekaterina Iakovleva et.al. 2407.20232 null
2024-07-29 LatentArtiFusion: An Effective and Efficient Histological Artifacts Restoration Framework Zhenqi He et.al. 2407.20172 link
2024-07-29 Diffusion Feedback Helps CLIP See Better Wenxuan Wang et.al. 2407.20171 link
2024-07-29 DDAP: Dual-Domain Anti-Personalization against Text-to-Image Diffusion Models Jing Yang et.al. 2407.20141 null
2024-07-29 Diffusion-DICE: In-Sample Diffusion Guidance for Offline Reinforcement Learning Liyuan Mao et.al. 2407.20109 null
2024-07-29 Generative Diffusion Model Bootstraps Zero-shot Classification of Fetal Ultrasound Images In Underrepresented African Populations Fangyijie Wang et.al. 2407.20072 link
2024-07-29 ImagiNet: A Multi-Content Dataset for Generalizable Synthetic Image Detection via Contrastive Learning Delyan Boychev et.al. 2407.20020 link
2024-07-29 MambaGesture: Enhancing Co-Speech Gesture Generation with Mamba and Disentangled Multi-Modality Fusion Chencan Fu et.al. 2407.19976 null
2024-07-29 FedDEO: Description-Enhanced One-Shot Federated Learning with Diffusion Models Mingzhao Yang et.al. 2407.19953 null
2024-07-29 FreeLong: Training-Free Long Video Generation with SpectralBlend Temporal Attention Yu Lu et.al. 2407.19918 null
2024-07-26 Unifying Visual and Semantic Feature Spaces with Diffusion Models for Enhanced Cross-Modal Alignment Yuze Zheng et.al. 2407.18854 null
2024-07-26 Revision of calcium and scandium abundances in Am stars based on NLTE calculations and comparison with diffusion stellar evolution models L. I. Mashonkina et.al. 2407.18736 null
2024-07-26 Adversarial Robustification via Text-to-Image Diffusion Models Daewon Choi et.al. 2407.18658 link
2024-07-26 How To Segment in 3D Using 2D Models: Automated 3D Segmentation of Prostate Cancer Metastatic Lesions on PET Volumes Using Multi-Angle Maximum Intensity Projections and Diffusion Models Amirhosein Toosi et.al. 2407.18555 link
2024-07-26 Answerability Fields: Answerable Location Estimation via Diffusion Models Daichi Azuma et.al. 2407.18497 null
2024-07-26 Diffusion-Driven Semantic Communication for Generative Models with Bandwidth Constraints Lei Guo et.al. 2407.18468 null
2024-07-26 Lensless fiber endomicroscopic phase imaging with speckle-conditioned diffusion model Zhaoqing Chen et.al. 2407.18456 null
2024-07-25 Diffusion-based subsurface multiphysics monitoring and forecasting Xinquan Huang et.al. 2407.18426 null
2024-07-25 RegionDrag: Fast Region-Based Image Editing with Diffusion Models Jingyi Lu et.al. 2407.18247 null
2024-07-25 VGGHeads: A Large-Scale Synthetic Dataset for 3D Human Heads Orest Kupyn et.al. 2407.18245 link
2024-07-25 Self-supervised pre-training with diffusion model for few-shot landmark detection in x-ray images Roberto Di Via et.al. 2407.18125 null
2024-07-25 Segmentation-guided MRI reconstruction for meaningfully diverse reconstructions Jan Nikolas Morshuis et.al. 2407.18026 link
2024-07-25 Self-Supervision Improves Diffusion Models for Tabular Data Imputation Yixin Liu et.al. 2407.18013 link
2024-07-25 Lightweight Language-driven Grasp Detection using Conditional Consistency Model Nghia Nguyen et.al. 2407.17967 null
2024-07-25 ReCorD: Reasoning and Correcting Diffusion for HOI Generation Jian-Yu Jiang-Lin et.al. 2407.17911 link
2024-07-25 Amortized Posterior Sampling with Diffusion Prior Distillation Abbas Mammadov et.al. 2407.17907 null
2024-07-25 Artificial Immunofluorescence in a Flash: Rapid Synthetic Imaging from Brightfield Through Residual Diffusion Xiaodan Xing et.al. 2407.17882 null
2024-07-25 DragText: Rethinking Text Embedding in Point-based Image Editing Gayoon Choi et.al. 2407.17843 link
2024-07-24 SV4D: Dynamic 3D Content Generation with Multi-Frame and Multi-View Consistency Yiming Xie et.al. 2407.17470 null
2024-07-24 CDDIP: Constrained Diffusion-Driven Deep Image Prior for Seismic Image Reconstruction Paul Goyes-Peñafiel et.al. 2407.17402 link
2024-07-25 LPGen: Enhancing High-Fidelity Landscape Painting Generation through Diffusion Model Wanggong Yang et.al. 2407.17229 null
2024-07-24 Unpaired Photo-realistic Image Deraining with Energy-informed Diffusion Model Yuanbo Wen et.al. 2407.17193 null
2024-07-24 MemBench: Memorized Image Trigger Prompt Dataset for Diffusion Models Chunsan Hong et.al. 2407.17095 link
2024-07-24 Sparse Inducing Points in Deep Gaussian Processes: Enhancing Modeling with Denoising Diffusion Variational Inference Jian Xu et.al. 2407.17033 null
2024-07-24 Diffree: Text-Guided Shape Free Object Inpainting with Diffusion Model Lirui Zhao et.al. 2407.16982 link
2024-07-24 SAR to Optical Image Translation with Color Supervised Diffusion Model Xinyu Bai et.al. 2407.16921 null
2024-07-23 VisMin: Visual Minimal-Change Understanding Rabiul Awal et.al. 2407.16772 null
2024-07-23 Diffusion Models for Monocular Depth Estimation: Overcoming Challenging Conditions Fabio Tosi et.al. 2407.16698 link
2024-07-23 From Imitation to Refinement -- Residual RL for Precise Visual Assembly Lars Ankile et.al. 2407.16677 null
2024-07-23 MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequence Canyu Zhao et.al. 2407.16655 null
2024-07-23 DreamVTON: Customizing 3D Virtual Try-on with Personalized Diffusion Models Zhenyu Xie et.al. 2407.16511 null
2024-07-23 MonoWAD: Weather-Adaptive Diffusion Model for Robust Monocular 3D Object Detection Youngmin Oh et.al. 2407.16448 link
2024-07-23 On Differentially Private 3D Medical Image Synthesis with Controllable Latent Diffusion Models Deniz Daum et.al. 2407.16405 link
2024-07-23 DreamDissector: Learning Disentangled Text-to-3D Generation from 2D Diffusion Priors Zizheng Yan et.al. 2407.16260 null
2024-07-23 OutfitAnyone: Ultra-high Quality Virtual Try-On for Any Clothing and Any Person Ke Sun et.al. 2407.16224 null
2024-07-23 Diff-Shadow: Global-guided Diffusion Model for Shadow Removal Jinting Luo et.al. 2407.16214 link
2024-07-23 CloudFixer: Test-Time Adaptation for 3D Point Clouds via Diffusion-Guided Geometric Transformation Hajin Shim et.al. 2407.16193 null
2024-07-22 Artist: Aesthetically Controllable Text-Driven Stylization without Training Ruixiang Jiang et.al. 2407.15842 link
2024-07-22 Stretching Each Dollar: Diffusion Training from Scratch on a Micro-Budget Vikash Sehwag et.al. 2407.15811 link
2024-07-22 Diffusion Model Based Resource Allocation Strategy in Ultra-Reliable Wireless Networked Control Systems Amirhassan Babazadeh Darabi et.al. 2407.15784 null
2024-07-22 A Hamilton-Jacobi approach to road-field reaction-diffusion models Christopher Henderson et.al. 2407.15760 null
2024-07-22 Diffusion for Out-of-Distribution Detection on Road Scenes and Beyond Silvio Galesso et.al. 2407.15739 link
2024-07-22 Estimating Probability Densities with Transformer and Denoising Diffusion Henry W. Leung et.al. 2407.15703 link
2024-07-22 Voltage mapping in subcellular nanodomains using electro-diffusion modeling Frédéric Paquin-Lefebvre et.al. 2407.15697 null
2024-07-23 Cinemo: Consistent and Controllable Image Animation with Motion Diffusion Models Xin Ma et.al. 2407.15642 link
2024-07-23 A Diffusion Model for Simulation Ready Coronary Anatomy with Morpho-skeletal Control Karim Kadry et.al. 2407.15631 null
2024-07-22 StylusAI: Stylistic Adaptation for Robust German Handwritten Text Generation Nauman Riaz et.al. 2407.15608 null
2024-07-19 DEPICT: Diffusion-Enabled Permutation Importance for Image Classification Tasks Sarah Jabbour et.al. 2407.14509 null
2024-07-19 M2D2M: Multi-Motion Generation from Text with Discrete Diffusion Models Seunggeun Chi et.al. 2407.14502 null
2024-07-19 Co-synthesis of Histopathology Nuclei Image-Label Pairs using a Context-Conditioned Joint Diffusion Model Seonghui Min et.al. 2407.14434 null
2024-07-19 Controllable and Efficient Multi-Class Pathology Nuclei Data Augmentation using Text-Conditioned Diffusion Models Hyun-Jic Oh et.al. 2407.14426 null
2024-07-19 As Generative Models Improve, People Adapt Their Prompts Eaman Jahani et.al. 2407.14333 null
2024-07-19 Panoptic Segmentation of Mammograms with Text-To-Image Diffusion Model Kun Zhao et.al. 2407.14326 null
2024-07-19 Time-dependent condensate formation in ultracold atoms with energy-dependent transport coefficients M. Larsson et.al. 2407.14307 null
2024-07-19 How to Blend Concepts in Diffusion Models Giorgio Longari et.al. 2407.14280 link
2024-07-19 The time-space evolution of economic activities: theory and estimation Davide Fiaschi et.al. 2407.14267 null
2024-07-19 Unlearning Concepts from Text-to-Video Diffusion Models Shiqi Liu et.al. 2407.14209 null
2024-07-18 LogoSticker: Inserting Logos into Diffusion Models for Customized Generation Mingkang Zhu et.al. 2407.13752 null
2024-07-18 Understanding Reinforcement Learning-Based Fine-Tuning of Diffusion Models: A Tutorial and Review Masatoshi Uehara et.al. 2407.13734 link
2024-07-18 MeshSegmenter: Zero-Shot Mesh Semantic Segmentation via Texture Synthesis Ziming Zhong et.al. 2407.13675 link
2024-07-18 Open-Vocabulary 3D Semantic Segmentation with Text-to-Image Diffusion Models Xiaoyu Zhu et.al. 2407.13642 null
2024-07-18 Training-free Composite Scene Generation for Layout-to-Image Synthesis Jiaqi Liu et.al. 2407.13609 link
2024-07-18 EnergyDiff: Universal Time-Series Energy Data Generation using Diffusion Models Nan Lin et.al. 2407.13538 null
2024-07-18 All Roads Lead to Rome? Exploring Representational Similarities Between Latent Spaces of Generative Image Models Charumathi Badrinath et.al. 2407.13449 link
2024-07-18 Movement-based models for abundance data Ricardo Carrizo Vergara et.al. 2407.13384 null
2024-07-18 URCDM: Ultra-Resolution Image Synthesis in Histopathology Sarah Cechnicka et.al. 2407.13277 link
2024-07-18 Unveiling Structural Memorization: Structural Membership Inference Attack for Text-to-Image Diffusion Models Qiao Li et.al. 2407.13252 null
2024-07-17 SMooDi: Stylized Motion Diffusion Model Lei Zhong et.al. 2407.12783 null
2024-07-17 VD3D: Taming Large Video Diffusion Transformers for 3D Camera Control Sherwin Bahmani et.al. 2407.12781 null
2024-07-17 Hallucination Index: An Image Quality Metric for Generative Reconstruction Models Matthew Tivnan et.al. 2407.12780 null
2024-07-17 GroundUp: Rapid Sketch-Based 3D City Massing Gizem Esra Unlu et.al. 2407.12739 null
2024-07-17 NL2Contact: Natural Language Guided 3D Hand-Object Contact Modeling with Diffusion Model Zhongqun Zhang et.al. 2407.12727 null
2024-07-18 SlimFlow: Training Smaller One-Step Diffusion Models with Rectified Flow Yuanzhi Zhu et.al. 2407.12718 link
2024-07-17 IMAGDressing-v1: Customizable Virtual Dressing Fei Shen et.al. 2407.12705 link
2024-07-17 4Dynamic: Text-to-4D Generation with Hybrid Priors Yu-Jie Yuan et.al. 2407.12684 null
2024-07-17 Promptable Counterfactual Diffusion Model for Unified Brain Tumor Segmentation and Generation with MRIs Yiqing Shen et.al. 2407.12678 link
2024-07-17 CoSIGN: Few-Step Guidance of ConSIstency Model to Solve General INverse Problems Jiankun Zhao et.al. 2407.12676 link
2024-07-16 Efficient Training with Denoised Neural Weights Yifan Gong et.al. 2407.11966 null
2024-07-16 Context-Guided Diffusion for Out-of-Distribution Molecular and Protein Design Leo Klarner et.al. 2407.11942 link
2024-07-16 Diffusion-driven self-assembly of emerin nanodomains at the nuclear envelope Carlos D. Alas et.al. 2407.11758 null
2024-07-16 Mask-guided cross-image attention for zero-shot in-silico histopathologic image generation with a diffusion model Dominik Winter et.al. 2407.11664 null
2024-07-16 CCVA-FL: Cross-Client Variations Adaptive Federated Learning for Medical Imaging Sunny Gupta et.al. 2407.11652 null
2024-07-16 Scaling Diffusion Transformers to 16 Billion Parameters Zhengcong Fei et.al. 2407.11633 link
2024-07-16 DiNO-Diffusion. Scaling Medical Diffusion via Self-Supervised Pre-Training Guillermo Jimenez-Perez et.al. 2407.11594 null
2024-07-17 QVD: Post-training Quantization for Video Diffusion Models Shilong Tian et.al. 2407.11585 null
2024-07-17 UP-Diff: Latent Diffusion Model for Remote Sensing Urban Prediction Zeyu Wang et.al. 2407.11578 link
2024-07-16 TGIF: Text-Guided Inpainting Forgery Dataset Hannes Mareen et.al. 2407.11566 link
2024-07-15 Make-An-Agent: A Generalizable Policy Network Generator with Behavior-Prompted Diffusion Yongyuan Liang et.al. 2407.10973 null
2024-07-15 InVi: Object Insertion In Videos Using Off-the-Shelf Diffusion Models Nirat Saini et.al. 2407.10958 null
2024-07-16 DataDream: Few-shot Guided Dataset Generation Jae Myung Kim et.al. 2407.10910 link
2024-07-15 Optical Diffusion Models for Image Generation Ilker Oguz et.al. 2407.10897 null
2024-07-15 R3D-AD: Reconstruction via Diffusion for 3D Anomaly Detection Zheyuan Zhou et.al. 2407.10862 null
2024-07-15 Physics-Inspired Generative Models in Medical Imaging: A Review Dennis Hein et.al. 2407.10856 null
2024-07-15 Conditional Guided Generative Diffusion for Particle Accelerator Beam Diagnostics Alexander Scheinker et.al. 2407.10693 null
2024-07-15 Addressing Image Hallucination in Text-to-Image Generation through Factual Image Retrieval Youngsun Lim et.al. 2407.10683 null
2024-07-15 Temporal Residual Guided Diffusion Framework for Event-Driven Video Reconstruction Lin Zhu et.al. 2407.10636 null
2024-07-15 WildVidFit: Video Virtual Try-On in the Wild via Image-Based Controlled Diffusion Models Zijian He et.al. 2407.10625 null
2024-07-12 Any-Property-Conditional Molecule Generation with Self-Criticism using Spanning Trees Alexia Jolicoeur-Martineau et.al. 2407.09357 link
2024-07-12 PID: Physics-Informed Diffusion Model for Infrared Image Generation Fangyuan Mao et.al. 2407.09299 link
2024-07-12 Salt & Pepper Heatmaps: Diffusion-informed Landmark Detection Strategy Julian Wyatt et.al. 2407.09192 null
2024-07-12 Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control Huayu Chen et.al. 2407.09024 link
2024-07-12 TCAN: Animating Human Images with Temporally Consistent Pose Guidance using Diffusion Models Jeongho Kim et.al. 2407.09012 null
2024-07-12 Your Diffusion Model is Secretly a Noise Classifier and Benefits from Contrastive Training Yunshu Wu et.al. 2407.08946 link
2024-07-12 Bora: Biomedical Generalist Video Generation Model Weixiang Sun et.al. 2407.08944 null
2024-07-12 LightenDiffusion: Unsupervised Low-Light Image Enhancement with Latent-Retinex Diffusion Models Hai Jiang et.al. 2407.08939 link
2024-07-12 Multi-objective Aerial Collaborative Secure Communication Optimization via Generative Diffusion Model-enabled Deep Reinforcement Learning Chuang Zhang et.al. 2407.08914 null
2024-07-12 AirSketch: Generative Motion to Sketch Hui Xian Grace Lim et.al. 2407.08906 null
2024-07-11 Video Diffusion Alignment via Reward Gradients Mihir Prabhudesai et.al. 2407.08737 link
2024-07-11 Live2Diff: Live Stream Translation via Uni-directional Attention in Video Diffusion Models Zhening Xing et.al. 2407.08701 null
2024-07-11 Controlling the Fidelity and Diversity of Deep Generative Models via Pseudo Density Shuangqi Li et.al. 2407.08659 null
2024-07-11 Latent Conditional Diffusion-based Data Augmentation for Continuous-Time Dynamic Graph Mode Yuxing Tian et.al. 2407.08500 null
2024-07-11 Diff-Tracker: Text-to-Image Diffusion Models are Unsupervised Trackers Zhengbo Zhang et.al. 2407.08394 null
2024-07-11 Wind Power Assessment based on Super-Resolution and Downscaling -- A Comparison of Deep Learning Methods Luca Schmidt et.al. 2407.08259 null
2024-07-11 Adaptive Compressed Sensing with Diffusion-Based Posterior Sampling Noam Elata et.al. 2407.08256 null
2024-07-11 E2VIDiff: Perceptual Events-to-Video Reconstruction using Diffusion Priors Jinxiu Liang et.al. 2407.08231 null
2024-07-11 Survey on Fundamental Deep Learning 3D Reconstruction Techniques Yonge Bai et.al. 2407.08137 null
2024-07-10 Geospecific View Generation -- Geometry-Context Aware High-resolution Ground View Inference from Satellite Views Ningli Xu et.al. 2407.08061 null
2024-07-10 Generative Image as Action Models Mohit Shridhar et.al. 2407.07875 link
2024-07-10 Dynamical Measure Transport and Neural PDE Solvers for Sampling Jingtong Sun et.al. 2407.07873 null
2024-07-10 Controlling Space and Time with Diffusion Models Daniel Watson et.al. 2407.07860 null
2024-07-10 Generic Numerical Analysis of Stochastic Reaction Diffusion Model with applications in excitable media Yahya Alnashri et.al. 2407.07834 null
2024-07-10 Universal and non-universal signatures in the scaling functions of critical variables Gianluca Teza et.al. 2407.07782 null
2024-07-10 VEnhancer: Generative Space-Time Enhancement for Video Generation Jingwen He et.al. 2407.07667 null
2024-07-11 MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis Wanggui He et.al. 2407.07614 link
2024-07-10 Drantal-NeRF: Diffusion-Based Restoration for Anti-aliasing Neural Radiance Field Ganlin Yang et.al. 2407.07461 null
2024-07-10 Secondary Structure-Guided Novel Protein Sequence Generation with Latent Graph Diffusion Yutong Hu et.al. 2407.07443 link
2024-07-10 Deformation-Recovery Diffusion Model (DRDM): Instance Deformation for Image Manipulation and Synthesis Jian-Qing Zheng et.al. 2407.07295 link
2024-07-09 ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction Shaozhe Hao et.al. 2407.07077 link
2024-07-09 RodinHD: High-Fidelity 3D Avatar Generation with Diffusion Models Bowen Zhang et.al. 2407.06938 null
2024-07-09 HumanRefiner: Benchmarking Abnormal Human Generation and Refining with Coarse-to-fine Pose-Reversible Guidance Guian Fang et.al. 2407.06937 link
2024-07-09 A reaction-diffusion model for relapsing-remitting multiple sclerosis with a treatment term Romina Travaglini et.al. 2407.06802 null
2024-07-09 Powerful and Flexible: Personalized Text-to-Image Generation via Reinforcement Learning Fanyue Wei et.al. 2407.06642 link
2024-07-09 Mobius: An High Efficient Spatial-Temporal Parallel Training Paradigm for Text-to-Video Generation Task Yiran Yang et.al. 2407.06617 link
2024-07-09 VQA-Diff: Exploiting VQA and Diffusion for Zero-Shot Image-to-3D Vehicle Asset Generation in Autonomous Driving Yibo Liu et.al. 2407.06516 null
2024-07-09 Sketch-Guided Scene Image Generation Tianyu Zhang et.al. 2407.06469 null
2024-07-10 Enhanced Safety in Autonomous Driving: Integrating Latent State Diffusion Model for End-to-End Navigation Jianuo Huang et.al. 2407.06317 null
2024-07-08 VIMI: Grounding Video Generation through Multi-modal Instruction Yuwei Fang et.al. 2407.06304 null
2024-07-08 JeDi: Joint-Image Diffusion Models for Finetuning-Free Personalized Text-to-Image Generation Yu Zeng et.al. 2407.06187 null
2024-07-08 The Tug-of-War Between Deepfake Generation and Detection Hannah Lee et.al. 2407.06174 null
2024-07-08 ANOLE: An Open, Autoregressive, Native Large Multimodal Models for Interleaved Image-Text Generation Ethan Chern et.al. 2407.06135 link
2024-07-08 Structured Generations: Using Hierarchical Clusters to guide Diffusion Models Jorge da Silva Goncalves et.al. 2407.06124 link
2024-07-08 PerlDiff: Controllable Street View Synthesis Using Perspective-Layout Diffusion Models Jinhua Zhang et.al. 2407.06109 link
2024-07-08 Accelerating Diffusion for SAR-to-Optical Image Translation via Adversarial Consistency Distillation Xinyu Bai et.al. 2407.06095 null
2024-07-08 Layered Diffusion Model for One-Shot High Resolution Text-to-Image Synthesis Emaad Khwaja et.al. 2407.06079 null
2024-07-08 Analysis and finite element approximation of a diffuse interface approach to the Stokes--Biot coupling Francis R. A. Aznaran et.al. 2407.05949 null
2024-07-08 Minutes to Seconds: Speeded-up DDPM-based Image Inpainting with Coarse-to-Fine Sampling Lintao Zhang et.al. 2407.05875 link
2024-07-08 RadiomicsFill-Mammo: Synthetic Mammogram Mass Manipulation with Radiomics Features Inye Na et.al. 2407.05683 link
2024-07-05 Structural Constraint Integration in Generative Model for Discovery of Quantum Material Candidates Ryotaro Okabe et.al. 2407.04557 null
2024-07-05 Unified continuous-time q-learning for mean-field game and mean-field control problems Xiaoli Wei et.al. 2407.04521 null
2024-07-08 Speed-accuracy trade-off for the diffusion models: Wisdom from nonequilibrium thermodynamics and optimal transport Kotaro Ikeda et.al. 2407.04495 null
2024-07-05 PROUD: PaRetO-gUided Diffusion Model for Multi-objective Generation Yinghua Yao et.al. 2407.04493 link
2024-07-05 VCD-Texture: Variance Alignment based 3D-2D Co-Denoising for Text-Guided Texturing Shang Liu et.al. 2407.04461 null
2024-07-05 Comparing metallicity correlations in nearby non-AGN and AGN-host galaxies Song-lin Li et.al. 2407.04252 null
2024-07-05 GSD: View-Guided Gaussian Splatting Diffusion for 3D Reconstruction Yuxuan Mu et.al. 2407.04237 null
2024-07-05 T2IShield: Defending Against Backdoors on Text-to-Image Diffusion Models Zhongqi Wang et.al. 2407.04215 link
2024-07-05 TimeLDM: Latent Diffusion Model for Unconditional Time Series Generation Jian Qian et.al. 2407.04211 null
2024-07-04 Advances in Diffusion Models for Image Data Augmentation: A Review of Methods, Models, Evaluation Metrics and Future Research Directions Panagiotis Alimisis et.al. 2407.04103 null
2024-07-03 DisCo-Diff: Enhancing Continuous Diffusion Models with Discrete Latents Yilun Xu et.al. 2407.03300 link
2024-07-03 Improved Noise Schedule for Diffusion Training Tiankai Hang et.al. 2407.03297 null
2024-07-04 Spatio-Temporal Adaptive Diffusion Models for EEG Super-Resolution in Epilepsy Diagnosis Tong Zhou et.al. 2407.03089 null
2024-07-03 Electromagnetic Property Sensing Based on Diffusion Model in ISAC System Yuhua Jiang et.al. 2407.03075 null
2024-07-03 Semantic-Aware Power Allocation for Generative Semantic Communications with Foundation Models Chunmei Xu et.al. 2407.03050 null
2024-07-03 SlerpFace: Face Template Protection via Spherical Linear Interpolation Zhizhou Zhong et.al. 2407.03043 null
2024-07-03 Frequency-Controlled Diffusion Model for Versatile Text-Guided Image-to-Image Translation Xiang Gao et.al. 2407.03006 link
2024-07-04 VEGS: View Extrapolation of Urban Scenes in 3D Gaussian Splatting using Learned Priors Sungwon Hwang et.al. 2407.02945 link
2024-07-03 Single Image Rolling Shutter Removal with Diffusion Models Zhanglei Yang et.al. 2407.02906 null
2024-07-03 Robot Shape and Location Retention in Video Generation Using Diffusion Models Peng Wang et.al. 2407.02873 link
2024-07-02 Magic Insert: Style-Aware Drag-and-Drop Nataniel Ruiz et.al. 2407.02489 null
2024-07-02 Boosting Consistency in Story Visualization with Rich-Contextual Conditional Diffusion Models Fei Shen et.al. 2407.02482 link
2024-07-02 GlyphDraw2: Automatic Generation of Complex Glyph Posters with Diffusion Models and Large Language Models Jian Ma et.al. 2407.02252 link
2024-07-02 LaMoD: Latent Motion Diffusion Model For Myocardial Strain Generation Jiarui Xing et.al. 2407.02229 link
2024-07-02 UltraPixel: Advancing Ultra-High-Resolution Image Synthesis to New Peaks Jingjing Ren et.al. 2407.02158 null
2024-07-02 Counterfactual Data Augmentation with Denoising Diffusion for Graph Anomaly Detection Chunjing Xiao et.al. 2407.02143 link
2024-07-02 Latent Diffusion Model for Generating Ensembles of Climate Simulations Johannes Meuer et.al. 2407.02070 null
2024-07-02 Accompanied Singing Voice Synthesis with Fully Text-controlled Melody Ruiqi Li et.al. 2407.02049 null
2024-07-02 ScaleDreamer: Scalable Text-to-3D Synthesis with Asynchronous Score Distillation Zhiyuan Ma et.al. 2407.02040 link
2024-07-02 SwiftDiffusion: Efficient Diffusion Model Serving with Add-on Modules Suyi Li et.al. 2407.02031 null
2024-06-28 HouseCrafter: Lifting Floorplans to 3D Scenes with 2D Diffusion Model Hieu T. Nguyen et.al. 2406.20077 null
2024-06-28 Neural Differentiable Modeling with Diffusion-Based Super-resolution for Two-Dimensional Spatiotemporal Turbulence Xiantao Fan et.al. 2406.20047 null
2024-06-28 HAITCH: A Framework for Distortion and Motion Correction in Fetal Multi-Shell Diffusion-Weighted MRI Haykel Snoussi et.al. 2406.20042 null
2024-06-28 Deceptive Diffusion: Generating Synthetic Adversarial Examples Lucas Beerens et.al. 2406.19807 null
2024-06-28 Comprehensive Generative Replay for Task-Incremental Segmentation with Concurrent Appearance and Semantic Forgetting Wei Li et.al. 2406.19796 link
2024-06-28 Decision Transformer for IRS-Assisted Systems with Diffusion-Driven Generative Channels Jie Zhang et.al. 2406.19769 null
2024-06-28 DISCO: Efficient Diffusion Solver for Large-Scale Combinatorial Optimization Problems Kexiong Yu et.al. 2406.19705 null
2024-06-28 Network Bending of Diffusion Models for Audio-Visual Generation Luke Dzwonczyk et.al. 2406.19589 link
2024-06-27 A Thermal Study of Terahertz Induced Protein Interactions Hadeel Elayan et.al. 2406.19521 null
2024-06-27 pop-cosmos: Scaleable inference of galaxy properties and redshifts with a data-driven population model Stephen Thorp et.al. 2406.19437 null
2024-06-27 Accelerating Multiphase Flow Simulations with Denoising Diffusion Model Driven Initializations Jaehong Chung et.al. 2406.19333 null
2024-06-27 Subtractive Training for Music Stem Insertion using Latent Diffusion Models Ivan Villa-Renteria et.al. 2406.19328 null
2024-06-27 Compositional Image Decomposition with Diffusion Models Jocelin Su et.al. 2406.19298 null
2024-06-27 Using diffusion model as constraint: Empower Image Restoration Network Training with Diffusion Model Jiangtong Tan et.al. 2406.19030 link
2024-06-28 AnyControl: Create Your Artwork with Versatile Control on Text-to-Image Generation Yanan Sun et.al. 2406.18958 link
2024-06-27 Investigating and Defending Shortcut Learning in Personalized Diffusion Models Yixin Liu et.al. 2406.18944 link
2024-06-28 AlignIT: Enhancing Prompt Alignment in Customization of Text-to-Image Models Aishwarya Agarwal et.al. 2406.18893 null
2024-06-27 Chemical Continuous Time Random Walks under Anomalous Diffusion Hong Zhang et.al. 2406.18869 null
2024-06-26 MultiDiff: Consistent Novel View Synthesis from a Single Image Norman Müller et.al. 2406.18524 null
2024-06-26 Denoising as Adaptation: Noise-Space Domain Adaptation for Image Restoration Kang Liao et.al. 2406.18516 link
2024-06-26 DiffuseHigh: Training-free Progressive High-Resolution Image Synthesis through Structure Guidance Younghyun Kim et.al. 2406.18459 link
2024-06-26 Towards diffusion models for large-scale sea-ice modelling Tobias Sebastian Finn et.al. 2406.18417 null
2024-06-27 Stable Diffusion Segmentation for Biomedical Images with Single-step Reverse Process Tianyu Lin et.al. 2406.18361 link
2024-06-26 Molecular Diffusion Models with Virtual Receptors Matan Halfon et.al. 2406.18330 null
2024-06-26 Galaxy spectroscopy without spectra: Galaxy properties from photometric images with conditional diffusion models Lars Doorenbos et.al. 2406.18175 link
2024-06-26 Human-Aware 3D Scene Generation with Spatially-constrained Diffusion Models Xiaolin Hong et.al. 2406.18159 null
2024-06-26 Leveraging Pre-trained Models for FF-to-FFPE Histopathological Image Translation Qilai Zhang et.al. 2406.18054 link
2024-06-25 DiffusionPDE: Generative PDE-Solving Under Partial Observation Jiahe Huang et.al. 2406.17763 link
2024-06-25 Unified Auto-Encoding with Masked Diffusion Philippe Hansen-Estruch et.al. 2406.17688 link
2024-06-25 LaTable: Towards Large Tabular Models Boris van Breugel et.al. 2406.17673 null
2024-06-25 Aligning Diffusion Models with Noise-Conditioned Perception Alexander Gambashidze et.al. 2406.17636 null
2024-06-25 Diffusion-based Adversarial Purification for Intrusion Detection Mohamed Amine Merzouk et.al. 2406.17606 null
2024-06-25 Director3D: Real-world Camera Trajectory and 3D Scene Generation from Text Xinyang Li et.al. 2406.17601 link
2024-06-25 Detection of Synthetic Face Images: Accuracy, Robustness, Generalization Nela Petrzelkova et.al. 2406.17547 null
2024-06-25 Principal Component Clustering for Semantic Segmentation in Synthetic Data Generation Felix Stillger et.al. 2406.17541 null
2024-06-25 The Tree of Diffusion Life: Evolutionary Embeddings to Understand the Generation Process of Diffusion Models Vidya Prasad et.al. 2406.17462 null
2024-06-25 SyncNoise: Geometrically Consistent Noise Prediction for Text-based 3D Scene Editing Ruihuang Li et.al. 2406.17396 null
2024-06-24 FreeTraj: Tuning-Free Trajectory Control in Video Diffusion Models Haonan Qiu et.al. 2406.16863 link
2024-06-24 Dreamitate: Real-World Visuomotor Policy Learning via Video Generation Junbang Liang et.al. 2406.16862 null
2024-06-24 General Binding Affinity Guidance for Diffusion Models in Structure-Based Drug Design Yue Jian et.al. 2406.16821 null
2024-06-24 Portrait3D: 3D Head Generation from Single In-the-wild Portrait Image Jinkun Hao et.al. 2406.16710 null
2024-06-24 Geometry-Aware Score Distillation via 3D Consistent Noising and Gradient Consistency Modeling Min-Seop Kwak et.al. 2406.16695 null
2024-06-24 Repulsive Score Distillation for Diverse Sampling of Diffusion Models Nicolas Zilberstein et.al. 2406.16683 link
2024-06-24 OAML: Outlier Aware Metric Learning for OOD Detection Enhancement Heng Gao et.al. 2406.16525 link
2024-06-24 DaLPSR: Leverage Degradation-Aligned Language Prompt for Real-World Image Super-Resolution Aiwen Jiang et.al. 2406.16477 link
2024-06-24 ResMaster: Mastering High-Resolution Image Generation via Structural and Fine-Grained Guidance Shuwei Shi et.al. 2406.16476 null
2024-06-24 Prompt-Consistency Image Generation (PCIG): A Unified Framework Integrating LLMs, Knowledge Graphs, and Controllable Diffusion Models Yichen Sun et.al. 2406.16333 null
2024-06-21 Masked Extended Attention for Zero-Shot Virtual Try-On In The Wild Nadav Orzech et.al. 2406.15331 null
2024-06-21 You Only Acquire Sparse-channel (YOAS): A Unified Framework for Dense-channel EEG Generation Hongyu Chen et.al. 2406.15269 null
2024-06-21 Unsupervised Bayesian Generation of Synthetic CT from CBCT Using Patient-Specific Score-Based Prior Junbo Peng et.al. 2406.15219 null
2024-06-21 A3D: Does Diffusion Dream about 3D Alignment? Savva Ignatyev et.al. 2406.15020 null
2024-06-21 Probabilistic and Differentiable Wireless Simulation with Geometric Transformers Thomas Hehn et.al. 2406.14995 null
2024-06-21 VividDreamer: Towards High-Fidelity and Efficient Text-to-3D Generation Zixuan Chen et.al. 2406.14964 null
2024-06-21 LatentExplainer: Explaining Latent Representations in Deep Generative Models with Multi-modal Foundation Models Mengdan Zhu et.al. 2406.14862 link
2024-06-21 Six-CD: Benchmarking Concept Removals for Benign Text-to-image Diffusion Models Jie Ren et.al. 2406.14855 link
2024-06-21 DExter: Learning and Controlling Performance Expression with Diffusion Models Huan Zhang et.al. 2406.14850 link
2024-06-21 Fair Text to Medical Image Diffusion Model with Subgroup Distribution Aligned Tuning Xu Han et.al. 2406.14847 null
2024-06-20 A Survey of Multimodal-Guided Image Editing with Text-to-Image Diffusion Models Xincheng Shuai et.al. 2406.14555 link
2024-06-21 Advancing Fine-Grained Classification by Structure and Subject Preserving Augmentation Eyal Michaeli et.al. 2406.14551 link
2024-06-20 Consistency Models Made Easy Zhengyang Geng et.al. 2406.14548 link
2024-06-20 Invertible Consistency Distillation for Text-Guided Image Editing in Around 7 Steps Nikita Starodubcev et.al. 2406.14539 null
2024-06-20 V-LASIK: Consistent Glasses-Removal from Videos Using Synthetic Data Rotem Shalev-Arkushin et.al. 2406.14510 null
2024-06-20 SafeSora: Towards Safety Alignment of Text2Video Generation via a Human Preference Dataset Josef Dai et.al. 2406.14477 link
2024-06-20 CollaFuse: Collaborative Diffusion Models Simeon Allmendinger et.al. 2406.14429 link
2024-06-20 Active Diffusion Subsampling Oisin Nolan et.al. 2406.14388 link
2024-06-20 In Tree Structure Should Sentence Be Generated Yaguang Li et.al. 2406.14189 link
2024-06-20 CriDiff: Criss-cross Injection Diffusion Framework via Generative Pre-train for Prostate Segmentation Tingwei Liu et.al. 2406.14186 link
2024-06-18 Evaluating the design space of diffusion-based generative models Yuqing Wang et.al. 2406.12839 null
2024-06-18 Neural Approximate Mirror Maps for Constrained Diffusion Models Berthy T. Feng et.al. 2406.12816 null
2024-06-18 Extracting Training Data from Unconditional Diffusion Models Yunhao Chen et.al. 2406.12752 null
2024-06-18 Speak in the Scene: Diffusion-based Acoustic Scene Transfer toward Immersive Speech Generation Miseul Kim et.al. 2406.12688 null
2024-06-18 GeoBench: Benchmarking and Analyzing Monocular Geometry Estimation Models Yongtao Ge et.al. 2406.12671 link
2024-06-18 Unmasking the Veil: An Investigation into Concept Ablation for Privacy and Copyright Protection in Images Shivank Garg et.al. 2406.12592 link
2024-06-18 Training Diffusion Models with Federated Learning Matthijs de Goede et.al. 2406.12575 null
2024-06-18 Variational Distillation of Diffusion Policies into Mixture of Experts Hongyi Zhou et.al. 2406.12538 null
2024-06-18 HumanSplat: Generalizable Single-Image Human Gaussian Splatting with Structure Priors Panwang Pan et.al. 2406.12459 link
2024-06-18 Planning Using Schrödinger Bridge Diffusion Models Adarsh Srivastava et.al. 2406.12458 link
2024-06-17 Exploring the Role of Large Language Models in Prompt Encoding for Diffusion Models Bingqi Ma et.al. 2406.11831 null
2024-06-17 MegaScenes: Scene-Level View Synthesis at Scale Joseph Tung et.al. 2406.11819 link
2024-06-17 DiffMM: Multi-Modal Diffusion Model for Recommendation Yangqin Jiang et.al. 2406.11781 link
2024-06-17 Latent Denoising Diffusion GAN: Faster sampling, Higher image quality Luan Thanh Trinh et.al. 2406.11713 link
2024-06-17 MusicScore: A Dataset for Music Score Modeling and Generation Yuheng Lin et.al. 2406.11462 link
2024-06-17 AnyTrans: Translate AnyText in the Image with Large Scale Models Zhipeng Qian et.al. 2406.11432 null
2024-06-17 DiTTo-TTS: Efficient and Scalable Zero-Shot Text-to-Speech with Diffusion Transformer Keon Lee et.al. 2406.11427 null
2024-06-17 Unfolding Time: Generative Modeling for Turbulent Flows in 4D Abdullah Saydemir et.al. 2406.11390 null
2024-06-17 Diffusion Models in Low-Level Vision: A Survey Chunming He et.al. 2406.11138 link
2024-06-16 Exploiting Diffusion Prior for Out-of-Distribution Detection Armando Zhu et.al. 2406.11105 null
2024-06-14 SatDiffMoE: A Mixture of Estimation Method for Satellite Image Super-resolution with Latent Diffusion Models Zhaoxu Luo et.al. 2406.10225 null
2024-06-14 DiffusionBlend: Learning 3D Image Prior through Position-aware Diffusion Score Blending for 3D Computed Tomography Reconstruction Bowen Song et.al. 2406.10211 null
2024-06-14 Make It Count: Text-to-Image Generation with an Accurate Number of Objects Lital Binyamin et.al. 2406.10210 null
2024-06-14 Crafting Parts for Expressive Object Composition Harsh Rangwani et.al. 2406.10197 null
2024-06-14 Training-free Camera Control for Video Generation Chen Hou et.al. 2406.10126 null
2024-06-14 Group and Shuffle: Efficient Structured Orthogonal Parametrization Mikhail Gorbunov et.al. 2406.10019 null
2024-06-14 OrientDream: Streamlining Text-to-3D Generation with Explicit Orientation Control Yuzhong Huang et.al. 2406.10000 null
2024-06-14 InstructRL4Pix: Training Diffusion for Image Editing by Reinforcement Learning Tiancheng Li et.al. 2406.09973 null
2024-06-14 GradeADreamer: Enhanced Text-to-3D Generation Using Gaussian Splatting and Multi-View Diffusion Trapoom Ukarapol et.al. 2406.09850 link
2024-06-14 Unsupervised Monocular Depth Estimation Based on Hierarchical Feature-Guided Diffusion Runze Liu et.al. 2406.09782 null
2024-06-13 Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models Qihao Liu et.al. 2406.09416 null
2024-06-13 An Image is Worth More Than 16x16 Patches: Exploring Transformers on Individual Pixels Duy-Kien Nguyen et.al. 2406.09415 null
2024-06-13 Interpreting the Weight Space of Customized Diffusion Models Amil Dravid et.al. 2406.09413 link
2024-06-13 ConsistDreamer: 3D-Consistent 2D Diffusion for High-Fidelity Scene Editing Jun-Kun Chen et.al. 2406.09404 null
2024-06-13 Instruct 4D-to-4D: Editing 4D Scenes as Pseudo-3D Scenes Using 2D Diffusion Linzhan Mou et.al. 2406.09402 null
2024-06-13 OmniTokenizer: A Joint Image-Video Tokenizer for Visual Generation Junke Wang et.al. 2406.09399 link
2024-06-13 SimGen: Simulator-conditioned Driving Scene Generation Yunsong Zhou et.al. 2406.09386 null
2024-06-13 CLIPAway: Harmonizing Focused Embeddings for Removing Objects via Diffusion Models Yigit Ekin et.al. 2406.09368 link
2024-06-13 Understanding Hallucinations in Diffusion Models through Mode Interpolation Sumukh K Aithal et.al. 2406.09358 link
2024-06-13 Advancing Graph Generation through Beta Diffusion Yilin He et.al. 2406.09357 link
2024-06-12 Words Worth a Thousand Pictures: Measuring and Understanding Perceptual Variability in Text-to-Image Generation Raphael Tang et.al. 2406.08482 null
2024-06-12 Human 3Diffusion: Realistic Avatar Creation via Explicit 3D Consistent Diffusion Models Yuxuan Xue et.al. 2406.08475 null
2024-06-12 $\texttt{DiffLense}$ : A Conditional Diffusion Model for Super-Resolution of Gravitational Lensing Data Pranath Reddy et.al. 2406.08442 null
2024-06-12 Diffusion Soup: Model Merging for Text-to-Image Diffusion Models Benjamin Biggs et.al. 2406.08431 null
2024-06-12 FontStudio: Shape-Adaptive Diffusion Model for Coherent and Consistent Font Effect Generation Xinzhi Mu et.al. 2406.08392 null
2024-06-12 Diff-A-Riff: Musical Accompaniment Co-creation via Latent Diffusion Models Javier Nistal et.al. 2406.08384 null
2024-06-12 2.5D Multi-view Averaging Diffusion Model for 3D Medical Image Translation: Application to Low-count PET Reconstruction with CT-less Attenuation Correction Tianqi Chen et.al. 2406.08374 null
2024-06-12 WMAdapter: Adding WaterMark Control to Latent Diffusion Models Hai Ci et.al. 2406.08337 null
2024-06-12 Dataset Enhancement with Instance-Level Augmentations Orest Kupyn et.al. 2406.08249 link
2024-06-12 Diffusion-Promoted HDR Video Reconstruction Yuanshen Guan et.al. 2406.08204 null
2024-06-11 An Image is Worth 32 Tokens for Reconstruction and Generation Qihang Yu et.al. 2406.07550 link
2024-06-11 Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance Kuan Heng Lin et.al. 2406.07540 null
2024-06-11 Simple and Effective Masked Diffusion Language Models Subham Sekhar Sahoo et.al. 2406.07524 link
2024-06-11 Neural Gaffer: Relighting Any Object via Diffusion Haian Jin et.al. 2406.07520 null
2024-06-11 Instant 3D Human Avatar Generation using Image Diffusion Models Nikos Kolotouros et.al. 2406.07516 null
2024-06-11 Flow Map Matching Nicholas M. Boffi et.al. 2406.07507 null
2024-06-11 GLAD: Towards Better Reconstruction with Global and Local Adaptive Diffusion Models for Unsupervised Anomaly Detection Hang Yao et.al. 2406.07487 link
2024-06-11 Image Neural Field Diffusion Models Yinbo Chen et.al. 2406.07480 null
2024-06-11 4Real: Towards Photorealistic 4D Scene Generation via Video Diffusion Models Heng Yu et.al. 2406.07472 null
2024-06-11 Noise-robust Speech Separation with Fast Generative Correction Helin Wang et.al. 2406.07461 link
2024-06-10 IllumiNeRF: 3D Relighting without Inverse Rendering Xiaoming Zhao et.al. 2406.06527 null
2024-06-10 Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation Peize Sun et.al. 2406.06525 link
2024-06-10 Monkey See, Monkey Do: Harnessing Self-attention in Motion Diffusion for Zero-shot Motion Transfer Sigal Raab et.al. 2406.06508 link
2024-06-10 AID: Adapting Image2Video Diffusion Models for Instruction-guided Video Prediction Zhen Xing et.al. 2406.06465 null
2024-06-10 Cometh: A continuous-time discrete-state graph diffusion model Antoine Siraudin et.al. 2406.06449 null
2024-06-10 Margin-aware Preference Optimization for Aligning Diffusion Models without Reference Jiwoo Hong et.al. 2406.06424 null
2024-06-10 Diffusion-RPO: Aligning Diffusion Models through Relative Preference Optimization Yi Gu et.al. 2406.06382 link
2024-06-10 Improving Deep Learning-based Automatic Cranial Defect Reconstruction by Heavy Data Augmentation: From Image Registration to Latent Diffusion Models Marek Wodzinski et.al. 2406.06372 null
2024-06-10 MVGamba: Unify 3D Content Generation as State Space Sequence Modeling Xuanyu Yi et.al. 2406.06367 link
2024-06-11 Tuning-Free Visual Customization via View Iterative Self-Attention Control Xiaojie Li et.al. 2406.06258 link
2024-06-07 CoNo: Consistency Noise Injection for Tuning-free Long Video Diffusion Xingrui Wang et.al. 2406.05082 null
2024-06-07 Generative diffusion models for synthetic trajectories of heavy and light particles in turbulence Tianyi Li et.al. 2406.05008 null
2024-06-07 Learning Divergence Fields for Shift-Robust Graph Representations Qitian Wu et.al. 2406.04963 link
2024-06-07 Combinatorial Complex Score-based Diffusion Modelling through Stochastic Differential Equations Adrien Carrel et.al. 2406.04916 link
2024-06-07 Online Continual Learning of Video Diffusion Models From a Single Video Stream Jason Yoo et.al. 2406.04814 null
2024-06-07 TEDi Policy: Temporally Entangled Diffusion for Robotic Control Sigmund H. Høeg et.al. 2406.04806 link
2024-06-07 Diffusion-based Generative Image Outpainting for Recovery of FOV-Truncated CT Images Michelle Espranita Liman et.al. 2406.04769 link
2024-06-07 PQPP: A Joint Benchmark for Text-to-Image Prompt and Query Performance Prediction Eduard Poesina et.al. 2406.04746 link
2024-06-07 FlowMM: Generating Materials with Riemannian Flow Matching Benjamin Kurt Miller et.al. 2406.04713 null
2024-06-07 MeLFusion: Synthesizing Music from Image and Language Cues using Diffusion Models Sanjoy Chowdhury et.al. 2406.04673 link
2024-06-07 Physics3D: Learning Physical Properties of 3D Gaussians via Video Diffusion Fangfu Liu et.al. 2406.04338 null
2024-06-06 Coherent Zero-Shot Visual Instruction Generation Quynh Phung et.al. 2406.04337 null
2024-06-06 BitsFusion: 1.99 bits Weight Quantization of Diffusion Model Yang Sui et.al. 2406.04333 link
2024-06-06 Simplified and Generalized Masked Diffusion for Discrete Data Jiaxin Shi et.al. 2406.04329 link
2024-06-06 SF-V: Single Forward Video Generation Model Zhixing Zhang et.al. 2406.04324 link
2024-06-06 ATraDiff: Accelerating Online Reinforcement Learning with Imaginary Trajectories Qianlan Yang et.al. 2406.04323 null
2024-06-07 DIRECT-3D: Learning Direct Text-to-3D Generation on Massive Noisy 3D Data Qihao Liu et.al. 2406.04322 link
2024-06-06 Step-aware Preference Optimization: Aligning Preference with Denoising Performance at Each Step Zhanhao Liang et.al. 2406.04314 link
2024-06-06 Everything to the Synthetic: Diffusion-driven Test-time Adaptation via Synthetic-Domain Alignment Jiayi Guo et.al. 2406.04295 link
2024-06-06 VideoTetris: Towards Compositional Text-to-Video Generation Ye Tian et.al. 2406.04277 link
2024-06-05 Text-to-Events: Synthetic Event Camera Streams from Conditional Text Input Joachim Ott et.al. 2406.03439 null
2024-06-05 Text-to-Image Rectified Flow as Plug-and-Play Priors Xiaofeng Yang et.al. 2406.03293 link
2024-06-05 Generative Diffusion Models for Fast Simulations of Particle Collisions at CERN Mikołaj Kita et.al. 2406.03233 null
2024-06-05 Searching Priors Makes Text-to-Video Synthesis Better Haoran Cheng et.al. 2406.03215 null
2024-06-05 Ouroboros3D: Image-to-3D Generation via 3D-aware Recursive Diffusion Hao Wen et.al. 2406.03184 link
2024-06-05 Tiny models from tiny data: Textual and null-text inversion for few-shot distillation Erik Landolsi et.al. 2406.03146 link
2024-06-05 Floating Anchor Diffusion Model for Multi-motif Scaffolding Ke Liu et.al. 2406.03141 link
2024-06-05 Phy-Diff: Physics-guided Hourglass Diffusion Model for Diffusion MRI Synthesis Juanhua Zhang et.al. 2406.03002 null
2024-06-05 Exploring Data Efficiency in Zero-Shot Learning with Diffusion Models Zihan Ye et.al. 2406.02929 null
2024-06-06 U-KAN Makes Strong Backbone for Medical Image Segmentation and Generation Chenxin Li et.al. 2406.02918 null
2024-06-04 Dreamguider: Improved Training free Diffusion-based Conditional Generation Nithin Gopalakrishnan Nair et.al. 2406.02549 null
2024-06-05 Enhancing Temporal Consistency in Video Editing by Reconstructing Videos with 3D Gaussian Splatting Inkyu Shin et.al. 2406.02541 null
2024-06-04 CamCo: Camera-Controllable 3D-Consistent Image-to-Video Generation Dejia Xu et.al. 2406.02509 null
2024-06-04 Guiding a Diffusion Model with a Bad Version of Itself Tero Karras et.al. 2406.02507 link
2024-06-04 Stable-Pose: Leveraging Transformers for Pose-Guided Text-to-Image Generation Jiajun Wang et.al. 2406.02485 link
2024-06-04 Inpainting Pathology in Lumbar Spine MRI with Latent Diffusion Colin Hansen et.al. 2406.02477 null
2024-06-04 Learning Image Priors through Patch-based Diffusion Models for Solving Inverse Problems Jason Hu et.al. 2406.02462 link
2024-06-04 RoomTex: Texturing Compositional Indoor Scenes via Iterative Inpainting Qi Wang et.al. 2406.02461 null
2024-06-04 Finding NeMo: Localizing Neurons Responsible For Memorization in Diffusion Models Dominik Hintersdorf et.al. 2406.02366 link
2024-06-04 Flash Diffusion: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation Clement Chadebec et.al. 2406.02347 link
2024-05-31 Mixed Diffusion for 3D Indoor Scene Synthesis Siyi Hu et.al. 2405.21066 link
2024-05-31 Unified Directly Denoising for Both Variance Preserving and Variance Exploding Diffusion Models Jingjing Wang et.al. 2405.21059 null
2024-05-31 Spectrum-Aware Parameter Efficient Fine-Tuning for Diffusion Models Xinxi Zhang et.al. 2405.21050 null
2024-05-31 Kaleido Diffusion: Improving Conditional Diffusion Models with Autoregressive Latent Modeling Jiatao Gu et.al. 2405.21048 null
2024-05-31 Amortizing intractable inference in diffusion models for vision, language, and control Siddarth Venkatraman et.al. 2405.20971 link
2024-05-31 Flow matching achieves minimax optimal convergence Kenji Fukumizu et.al. 2405.20879 null
2024-05-31 MegActor: Harness the Power of Raw Video for Vivid Portrait Animation Shurong Yang et.al. 2405.20851 link
2024-05-31 Share Your Secrets for Privacy! Confidential Forecasting with Vertical Federated Learning Aditya Shankar et.al. 2405.20761 link
2024-05-31 Information Theoretic Text-to-Image Alignment Chao Wang et.al. 2405.20759 null
2024-05-31 Diffusion Models Are Innate One-Step Generators Bowen Zheng et.al. 2405.20750 link
2024-05-30 Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image Kailu Wu et.al. 2405.20343 link
2024-05-30 VividDream: Generating 3D Scene with Ambient Dynamics Yao-Chih Lee et.al. 2405.20334 null
2024-05-30 MotionFollower: Editing Video Motion via Lightweight Score-Guided Diffusion Shuyuan Tu et.al. 2405.20325 link
2024-05-30 Don't drop your samples! Coherence-aware training benefits Conditional diffusion Nicolas Dufour et.al. 2405.20324 null
2024-05-30 Improving the Training of Rectified Flows Sangyun Lee et.al. 2405.20320 link
2024-05-30 DITTO-2: Distilled Diffusion Inference-Time T-Optimization for Music Generation Zachary Novack et.al. 2405.20289 null
2024-05-30 MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model Muyao Niu et.al. 2405.20222 link
2024-05-30 Boost Your Own Human Image Generation Model via Direct Preference Optimization with AI Feedback Sanghyeon Na et.al. 2405.20216 null
2024-05-30 MotionDreamer: Zero-Shot 3D Mesh Animation from Video Diffusion Models Lukas Uzolas et.al. 2405.20155 null
2024-05-31 DP-IQA: Utilizing Diffusion Prior for Blind Image Quality Assessment in the Wild Honghao Fu et.al. 2405.19996 link
2024-05-29 ConceptPrune: Concept Editing in Diffusion Models via Skilled Neuron Pruning Ruchika Chavhan et.al. 2405.19237 link
2024-05-30 $E^{3}$ Gen: Efficient, Expressive and Editable Avatars Generation Weitian Zhang et.al. 2405.19203 null
2024-05-29 Diffusion-based Dynamics Models for Long-Horizon Rollout in Offline Reinforcement Learning Hanye Zhao et.al. 2405.19189 link
2024-05-29 Tuning-Free Alignment of Diffusion Models with Direct Noise Optimization Zhiwei Tang et.al. 2405.18881 link
2024-05-29 Principled Probabilistic Imaging using Diffusion Models as Plug-and-Play Priors Zihui Wu et.al. 2405.18782 link
2024-05-29 RNAFlow: RNA Structure & Sequence Design via Inverse Folding-Based Flow Matching Divya Nori et.al. 2405.18768 link
2024-05-29 Stationary distribution approximations of Two-island Wright-Fisher and seed-bank models using Stein's method Han L. Gan et.al. 2405.18763 null
2024-05-29 Preferred-Action-Optimized Diffusion Policies for Offline Reinforcement Learning Tianle Zhang et.al. 2405.18729 null
2024-05-29 Reverse the auditory processing pathway: Coarse-to-fine audio reconstruction from fMRI Che Liu et.al. 2405.18726 null
2024-05-29 Learning Diffeomorphism for Image Registration with Time-Continuous Networks using Semigroup Regularization Mohammadjavad Matinkia et.al. 2405.18684 link
2024-05-28 DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention Lianghui Zhu et.al. 2405.18428 link
2024-05-28 Phased Consistency Model Fu-Yun Wang et.al. 2405.18407 link
2024-05-28 RACCooN: Remove, Add, and Change Video Content with Auto-Generated Narratives Jaehong Yoon et.al. 2405.18406 link
2024-05-28 Multi-modal Generation via Cross-Modal In-Context Learning Amandeep Kumar et.al. 2405.18304 link
2024-05-28 CT-based brain ventricle segmentation via diffusion Schrödinger Bridge without target domain ground truths Reihaneh Teimouri et.al. 2405.18267 link
2024-05-28 EG4D: Explicit Generation of 4D Object without Score Distillation Qi Sun et.al. 2405.18132 link
2024-05-28 Are Image Distributions Indistinguishable to Humans Indistinguishable to Classifiers? Zebin You et.al. 2405.18029 null
2024-05-28 Unveiling the Power of Diffusion Features For Personalized Segmentation and Retrieval Dvir Samuel et.al. 2405.18025 link
2024-05-28 MAVIN: Multi-Action Video Generation with Diffusion Models via Transition Video Infilling Bowen Zhang et.al. 2405.18003 link
2024-05-28 AttenCraft: Attention-guided Disentanglement of Multiple Concepts for Text-to-Image Customization Junjie Shentu et.al. 2405.17965 link
2024-05-27 Human4DiT: Free-view Human Video Generation with 4D Diffusion Transformer Ruizhi Shao et.al. 2405.17405 null
2024-05-27 A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Training Kai Wang et.al. 2405.17403 link
2024-05-27 RB-Modulation: Training-Free Personalization of Diffusion Models using Stochastic Optimal Control Litu Rout et.al. 2405.17401 null
2024-05-27 EASI-Tex: Edge-Aware Mesh Texturing from Single Image Sai Raj Kishore Perla et.al. 2405.17393 null
2024-05-28 Controllable Longer Image Animation with Diffusion Models Qiang Wang et.al. 2405.17306 null
2024-05-27 Does Diffusion Beat GAN in Image Super Resolution? Denis Kuznedelev et.al. 2405.17261 link
2024-05-27 DreamMat: High-quality PBR Material Generation with Geometry- and Light-aware Diffusion Models Yuqing Zhang et.al. 2405.17176 null
2024-05-27 Partitioned Hankel-based Diffusion Models for Few-shot Low-dose CT Reconstruction Wenhao Zhang et.al. 2405.17167 null
2024-05-27 PatchScaler: An Efficient Patch-independent Diffusion Model for Super-Resolution Yong Liu et.al. 2405.17158 link
2024-05-27 Ensembling Diffusion Models via Adaptive Feature Aggregation Cong Wang et.al. 2405.17082 link
2024-05-24 Looking Backward: Streaming Video-to-Video Translation with Feature Banks Feng Liang et.al. 2405.15757 link
2024-05-24 Taming Score-Based Diffusion Priors for Infinite-Dimensional Nonlinear Inverse Problems Lorenzo Baldassari et.al. 2405.15676 null
2024-05-24 Reducing the cost of posterior sampling in linear inverse problems via task-dependent score learning Fabian Schneider et.al. 2405.15643 null
2024-05-24 DiffCalib: Reformulating Monocular Camera Calibration as Diffusion-Based Dense Incident Map Generation Xiankang He et.al. 2405.15619 null
2024-05-24 Learning to Discretize Denoising Diffusion ODEs Vinh Tong et.al. 2405.15506 link
2024-05-24 Out of Many, One: Designing and Scaffolding Proteins at the Scale of the Structural Universe with Genie 2 Yeqing Lin et.al. 2405.15489 link
2024-05-24 NVS-Solver: Video Diffusion Model as Zero-Shot Novel View Synthesizer Meng You et.al. 2405.15364 link
2024-05-24 SoundLoCD: An Efficient Conditional Discrete Contrastive Latent Diffusion Model for Text-to-Sound Generation Xinlei Niu et.al. 2405.15338 null
2024-05-24 Challenges and Opportunities in 3D Content Generation Ke Zhao et.al. 2405.15335 null
2024-05-24 Towards Understanding the Working Mechanism of Text-to-Image Diffusion Model Mingyang Yi et.al. 2405.15330 null
2024-05-24 Improved Distribution Matching Distillation for Fast Image Synthesis Tianwei Yin et.al. 2405.14867 link
2024-05-23 Video Diffusion Models are Training-free Motion Interpreter and Controller Zeqi Xiao et.al. 2405.14864 null
2024-05-23 Adapting to Unknown Low-Dimensional Structures in Score-Based Diffusion Models Gen Li et.al. 2405.14861 null
2024-05-23 Semantica: An Adaptable Image-Conditioned Diffusion Model Manoj Kumar et.al. 2405.14857 null
2024-05-23 TerDiT: Ternary Diffusion Models with Transformers Xudong Lu et.al. 2405.14854 link
2024-05-23 Direct3D: Scalable Image-to-3D Generation via 3D Latent Diffusion Transformer Shuang Wu et.al. 2405.14832 null
2024-05-23 Good Seed Makes a Good Crop: Discovering Secret Seeds in Text-to-Image Diffusion Models Katherine Xu et.al. 2405.14828 null
2024-05-23 PaGoDA: Progressive Growing of a One-Step Generator from a Low-Resolution Diffusion Teacher Dongjun Kim et.al. 2405.14822 link
2024-05-24 Fast-DDPM: Fast Denoising Diffusion Probabilistic Models for Medical Image-to-Image Generation Hongxu Jiang et.al. 2405.14802 link
2024-05-23 Membership Inference on Text-to-Image Diffusion Models via Conditional Likelihood Discrepancy Shengfang Zhai et.al. 2405.14800 link
2024-05-21 Personalized Residuals for Concept-Driven Text-to-Image Generation Cusuh Ham et.al. 2405.12978 null
2024-05-21 Face Adapter for Pre-Trained Diffusion Models with Fine-Grained ID and Attribute Control Yue Han et.al. 2405.12970 null
2024-05-21 Impact of inhomogeneous diffusion on secondary cosmic ray and antiproton local spectra Álvaro Tovar-Pardo et.al. 2405.12918 null
2024-05-21 Diffusion-RSCC: Diffusion Probabilistic Model for Change Captioning in Remote Sensing Images Xiaofei Yu et.al. 2405.12875 link
2024-05-21 Model Free Prediction with Uncertainty Assessment Yuling Jiao et.al. 2405.12684 null
2024-05-21 CustomText: Customized Textual Image Generation using Diffusion Models Shubham Paliwal et.al. 2405.12531 null
2024-05-21 Customize Your Own Paired Data via Few-shot Way Jinshu Chen et.al. 2405.12490 null
2024-05-21 One-step data-driven generative model via Schrödinger Bridge Hanwen Huang et.al. 2405.12453 null
2024-05-20 Diffusion for World Modeling: Visual Details Matter in Atari Eloi Alonso et.al. 2405.12399 link
2024-05-20 Images that Sound: Composing Images and Sounds on a Single Canvas Ziyang Chen et.al. 2405.12221 null
2024-05-20 Slicedit: Zero-Shot Video Editing With Text-to-Image Diffusion Models Using Spatio-Temporal Slices Nathaniel Cohen et.al. 2405.12211 link
2024-05-20 Nonequilbrium physics of generative diffusion models Zhendong Yu et.al. 2405.11932 null
2024-05-20 "Set It Up!": Functional Object Arrangement with Compositional Generative Models Yiqing Xu et.al. 2405.11928 null
2024-05-20 Diff-BGM: A Diffusion Model for Video Background Music Generation Sizhe Li et.al. 2405.11913 link
2024-05-20 Out-of-Distribution Detection with a Single Unconditional Diffusion Model Alvin Heng et.al. 2405.11881 link
2024-05-20 Evolving Storytelling: Benchmarks and Methods for New Character Customization with Diffusion Models Xiyu Wang et.al. 2405.11852 null
2024-05-20 Alternators For Sequence Modeling Mohammad Reza Rezaei et.al. 2405.11848 null
2024-05-20 ViViD: Video Virtual Try-on using Diffusion Models Zixun Fang et.al. 2405.11794 null
2024-05-20 Guided Multi-objective Generative AI to Enhance Structure-based Drug Design Amit Kadan et.al. 2405.11785 link
2024-05-17 Improving face generation quality and prompt following with synthetic captions Michail Tarasiou et.al. 2405.10864 null
2024-05-17 Deep Data Consistency: a Fast and Robust Diffusion Model-based Solver for Inverse Problems Hanyu Chen et.al. 2405.10748 link
2024-05-17 Numerical Recovery of the Diffusion Coefficient in Diffusion Equations from Terminal Measurement Bangti Jin et.al. 2405.10708 null
2024-05-17 LoCI-DiffCom: Longitudinal Consistency-Informed Diffusion Model for 3D Infant Brain Image Completion Zihao Zhu et.al. 2405.10691 null
2024-05-17 LighTDiff: Surgical Endoscopic Image Low-Light Enhancement with T-Diffusion Tong Chen et.al. 2405.10550 link
2024-05-17 ART3D: 3D Gaussian Splatting for Text-Guided Artistic Scenes Generation Pengzhi Li et.al. 2405.10508 null
2024-05-16 Text-to-Vector Generation with Neural Path Representation Peiying Zhang et.al. 2405.10317 null
2024-05-16 Analogist: Out-of-the-box Visual In-Context Learning with Image Diffusion Model Zheng Gu et.al. 2405.10316 null
2024-05-16 CAT3D: Create Anything in 3D with Multi-View Diffusion Models Ruiqi Gao et.al. 2405.10314 null
2024-05-16 Generating Coherent Sequences of Visual Illustrations for Real-World Manual Tasks João Bordalo et.al. 2405.10122 null
2024-05-16 Spurious reconstruction from brain activity Ken Shirakawa et.al. 2405.10078 link
2024-05-16 Frequency-Domain Refinement with Multiscale Diffusion for Super Resolution Xingjian Wang et.al. 2405.10014 null
2024-05-16 VirtualModel: Generating Object-ID-retentive Human-object Interaction Image by Diffusion Model for E-commerce Marketing Binghui Chen et.al. 2405.09985 null
2024-05-16 Language-Oriented Semantic Latent Representation for Image Transmission Giordano Cicchetti et.al. 2405.09976 link
2024-05-16 Whole-Song Hierarchical Generation of Symbolic Music Using Cascaded Diffusion Models Ziyu Wang et.al. 2405.09901 link
2024-05-16 DiffAM: Diffusion-based Adversarial Makeup Transfer for Facial Privacy Protection Yuhao Sun et.al. 2405.09882 link
2024-05-16 MMFusion: Multi-modality Diffusion Model for Lymph Node Metastasis Diagnosis in Esophageal Cancer Chengyu Wu et.al. 2405.09539 link
2024-05-15 Diffusion-based Contrastive Learning for Sequential Recommendation Ziqiang Cui et.al. 2405.09369 link
2024-05-15 Dance Any Beat: Blending Beats with Visuals in Dance Video Generation Xuanchen Wang et.al. 2405.09266 null
2024-05-15 SOEDiff: Efficient Distillation for Small Object Editing Qihe Pan et.al. 2405.09114 null
2024-05-15 RSHazeDiff: A Unified Fourier-aware Diffusion Model for Remote Sensing Image Dehazing Jiamei Xiong et.al. 2405.09083 link
2024-05-15 Naturalistic Music Decoding from EEG Data via Latent Diffusion Models Emilian Postolache et.al. 2405.09062 null
2024-05-15 Response Matching for generating materials and molecules Bingqing Cheng et.al. 2405.09057 null
2024-05-15 CTS: A Consistency-Based Medical Image Segmentation Model Kejia Zhang et.al. 2405.09056 link
2024-05-14 Expensive Multi-Objective Bayesian Optimization Based on Diffusion Models Bingdong Li et.al. 2405.08674 null
2024-05-14 Towards Multi-Task Generative-AI Edge Services with an Attention-based Diffusion DRL Approach Yaju Liu et.al. 2405.08328 null
2024-05-14 Compositional Text-to-Image Generation with Dense Blob Representations Weili Nie et.al. 2405.08246 null
2024-05-13 Infinite Texture: Text-guided High Resolution Diffusion Texture Synthesis Yifan Wang et.al. 2405.08210 null
2024-05-13 Do Bayesian imaging methods report trustworthy probabilities? David Y. W. Thong et.al. 2405.08179 null
2024-05-13 DiffTF++: 3D-aware Diffusion Transformer for Large-Vocabulary 3D Generation Ziang Cao et.al. 2405.08055 link
2024-05-13 Coin3D: Controllable and Interactive 3D Assets Generation with Proxy-Guided Conditioning Wenqi Dong et.al. 2405.08054 null
2024-05-13 Stable Diffusion-based Data Augmentation for Federated Learning with Non-IID Data Mahdi Morafah et.al. 2405.07925 null
2024-05-13 CTRLorALTer: Conditional LoRAdapter for Efficient 0-Shot Control & Altering of T2I Models Nick Stracke et.al. 2405.07913 null
2024-05-13 SAR Image Synthesis with Diffusion Models Denisa Qosja et.al. 2405.07776 null
2024-05-13 CDFormer:When Degradation Prediction Embraces Diffusion Model for Blind Image Super-Resolution Qingguo Liu et.al. 2405.07648 link
2024-05-13 De novo antibody design with SE(3) diffusion Daniel Cutting et.al. 2405.07622 null
2024-05-13 Reducing Risk for Assistive Reinforcement Learning Policies with Diffusion Models Andrii Tytarenko et.al. 2405.07603 null
2024-05-13 PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator Hanshu Yan et.al. 2405.07510 link
2024-05-13 GaussianVTON: 3D Human Virtual Try-ON via Multi-Stage Gaussian Splatting Editing with Image Prompting Haodong Chen et.al. 2405.07472 null
2024-05-12 Erasing Concepts from Text-to-Image Diffusion Models with Few-shot Unlearning Masane Fuchi et.al. 2405.07288 link
2024-05-12 Modeling Pedestrian Intrinsic Uncertainty for Multimodal Stochastic Trajectory Prediction via Energy Plan Denoising Yao Liu et.al. 2405.07164 null
2024-05-10 OneTo3D: One Image to Re-editable Dynamic 3D Model and Video Generation Jinwei Lin et.al. 2405.06547 link
2024-05-10 SketchDream: Sketch-based Text-to-3D Generation and Editing Feng-Lin Liu et.al. 2405.06461 null
2024-05-10 PUMA: margin-based data pruning Javier Maroto et.al. 2405.06298 null
2024-05-10 Prior-guided Diffusion Model for Cell Segmentation in Quantitative Phase Imaging Zhuchen Shao et.al. 2405.06175 null
2024-05-09 Distilling Diffusion Models into Conditional GANs Minguk Kang et.al. 2405.05967 null
2024-05-09 Self-Supervised Learning of Time Series Representation via Diffusion Process and Imputation-Interpolation-Forecasting Mask Zineb Senane et.al. 2405.05959 link
2024-05-09 Frame Interpolation with Consecutive Brownian Bridge Diffusion Zonglin Lyu et.al. 2405.05953 link
2024-05-09 Composable Part-Based Manipulation Weiyu Liu et.al. 2405.05876 null
2024-05-09 Pre-trained Text-to-Image Diffusion Models Are Versatile Representation Learners for Control Gunshi Gupta et.al. 2405.05852 link
2024-05-09 Could It Be Generated? Towards Practical Analysis of Memorization in Text-To-Image Diffusion Models Zhe Ma et.al. 2405.05846 link
2024-05-09 MSDiff: Multi-Scale Diffusion Model for Ultra-Sparse View CT Reconstruction Pinhuang Tan et.al. 2405.05814 null
2024-05-10 MasterWeaver: Taming Editability and Identity for Personalized Text-to-Image Generation Yuxiang Wei et.al. 2405.05806 link
2024-05-09 DragGaussian: Enabling Drag-style Manipulation on 3D Gaussian Representation Sitian Shen et.al. 2405.05800 null
2024-05-09 Sequential Amodal Segmentation via Cumulative Occlusion Learning Jiayang Ao et.al. 2405.05791 null
2024-05-08 Diffusion-HMC: Parameter Inference with Diffusion Model driven Hamiltonian Monte Carlo Nayantara Mudur et.al. 2405.05255 link
2024-05-08 Attention-Driven Training-Free Efficiency Enhancement of Diffusion Models Hongjie Wang et.al. 2405.05252 null
2024-05-08 Imagine Flash: Accelerating Emu Diffusion Models with Backward Distillation Jonas Kohler et.al. 2405.05224 null
2024-05-08 FinePOSE: Fine-Grained Prompt-Driven 3D Human Pose Estimation via Diffusion Models Jinglin Xu et.al. 2405.05216 link
2024-05-08 An anti-noise seismic inversion method based on diffusion model Yingtian Liu et.al. 2405.05026 link
2024-05-08 Discrepancy-based Diffusion Models for Lesion Detection in Brain MRI Keqiang Fan et.al. 2405.04974 null
2024-05-08 Empowering Wireless Networks with Artificial Intelligence Generated Graph Jiacheng Wang et.al. 2405.04907 null
2024-05-08 Fast LiDAR Upsampling using Conditional Diffusion Models Sander Elias Magnussen Helgesen et.al. 2405.04889 link
2024-05-08 FlexEControl: Flexible and Efficient Multimodal Control for Text-to-Image Generation Xuehai He et.al. 2405.04834 null
2024-05-08 Variational Schrödinger Diffusion Models Wei Deng et.al. 2405.04795 null
2024-05-07 Tactile-Augmented Radiance Fields Yiming Dou et.al. 2405.04534 link
2024-05-07 Edit-Your-Motion: Space-Time Diffusion Decoupling Learning for Video Motion Editing Yi Zuo et.al. 2405.04496 null
2024-05-07 CloudDiff: Super-resolution ensemble retrieval of cloud properties for all day using the generative diffusion model Haixia Xiao et.al. 2405.04483 null
2024-05-07 Diff-IP2D: Diffusion-Based Hand-Object Interaction Prediction on Egocentric Videos Junyi Ma et.al. 2405.04370 link
2024-05-07 Diffusion-driven GAN Inversion for Multi-Modal Face Image Generation Jihyun Kim et.al. 2405.04356 link
2024-05-08 Inf-DiT: Upsampling Any-Resolution Image with Memory-Efficient Diffusion Transformer Zhuoyi Yang et.al. 2405.04312 link
2024-05-07 BUDDy: Single-Channel Blind Unsupervised Dereverberation with Diffusion Models Eloi Moliner et.al. 2405.04272 null
2024-05-07 Vidu: a Highly Consistent, Dynamic and Skilled Text-to-Video Generator with Diffusion Models Fan Bao et.al. 2405.04233 null
2024-05-07 Simple Drop-in LoRA Conditioning on Attention Layers Will Improve Your Diffusion Model Joo Young Choi et.al. 2405.03958 null
2024-05-06 MVDiff: Scalable and Flexible Multi-View Diffusion for 3D Object Reconstruction from Single-View Emmanuelle Bourigault et.al. 2405.03894 null
2024-05-06 Bridging discrete and continuous state spaces: Exploring the Ehrenfest process in time-continuous diffusion models Ludwig Winkler et.al. 2405.03549 null
2024-05-06 CCDM: Continuous Conditional Diffusion Models for Image Generation Xin Ding et.al. 2405.03546 link
2024-05-06 LGTM: Local-to-Global Text-Driven Human Motion Diffusion Model Haowen Sun et.al. 2405.03485 link
2024-05-06 Exploring the Frontiers of Softmax: Provable Optimization, Applications in Diffusion Model, and Beyond Jiuxiang Gu et.al. 2405.03251 null
2024-05-06 Hyperbolic Geometric Latent Diffusion Model for Graph Generation Xingcheng Fu et.al. 2405.03188 link
2024-05-06 DeepMpMRI: Tensor-decomposition Regularized Learning for Fast and High-Fidelity Multi-Parametric Microstructural MR Imaging Wenxin Fan et.al. 2405.03159 null
2024-05-06 Video Diffusion Models: A Survey Andrew Melnik et.al. 2405.03150 link
2024-05-06 AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding Tao Liu et.al. 2405.03121 link
2024-05-05 Matten: Video Generation with Mamba-Attention Yu Gao et.al. 2405.03025 null
2024-05-05 Exploring Text-based Realistic Building Facades Editing Applicaiton Jing Wang et.al. 2405.02967 null
2024-05-03 DreamScene4D: Dynamic Multi-Object Scene Generation from Monocular Videos Wen-Hsuan Chu et.al. 2405.02280 link
2024-05-03 Multi-grid reaction-diffusion master equation: applications to morphogen gradient modelling Radek Erban et.al. 2405.02117 null
2024-05-03 DiffMap: Enhancing Map Segmentation with Map Prior Using Diffusion Model Peijin Jia et.al. 2405.02008 null
2024-05-03 Defect Image Sample Generation With Diffusion Prior for Steel Surface Defect Recognition Yichun Tai et.al. 2405.01872 null
2024-05-03 Creation of Novel Soft Robot Designs using Generative AI Wee Kiat Chan et.al. 2405.01824 null
2024-05-03 Report on the AAPM Grand Challenge on deep generative modeling for learning medical image statistics Rucha Deshpande et.al. 2405.01822 null
2024-05-02 Converting Anyone's Voice: End-to-End Expressive Voice Conversion with a Conditional Diffusion Model Zongyang Du et.al. 2405.01730 null
2024-05-02 Long Tail Image Generation Through Feature Space Augmentation and Iterated Learning Rafael Elberg et.al. 2405.01705 link
2024-05-02 LocInv: Localization-aware Inversion for Text-Guided Image Editing Chuanming Tang et.al. 2405.01496 link
2024-05-02 Navigating Heterogeneity and Privacy in One-Shot Federated Learning with Diffusion Models Matias Mendieta et.al. 2405.01494 null
2024-05-02 Statistical algorithms for low-frequency diffusion data: A PDE approach Matteo Giordano et.al. 2405.01372 link
2024-05-02 DiffusionPipe: Training Large Diffusion Models with Efficient Pipelines Ye Tian et.al. 2405.01248 null
2024-05-02 Automated Virtual Product Placement and Assessment in Images using Diffusion Models Mohammad Mahmudul Alam et.al. 2405.01130 null
2024-05-02 Part-aware Shape Generation with Latent 3D Diffusion of Neural Voxel Fields Yuhang Huang et.al. 2405.00998 null
2024-05-02 Generative manufacturing systems using diffusion models and ChatGPT Xingyu Li et.al. 2405.00958 null
2024-05-02 EchoScene: Indoor Scene Generation via Information Echo over Scene Graph Diffusion Guangyao Zhai et.al. 2405.00915 null
2024-05-01 SonicDiffusion: Audio-Driven Image Generation and Editing with Pretrained Diffusion Models Burak Can Biner et.al. 2405.00878 null
2024-05-01 Guided Conditional Diffusion Classifier (ConDiff) for Enhanced Prediction of Infection in Diabetic Foot Ulcers Palawat Busaranuvong et.al. 2405.00858 null
2024-05-01 TexSliders: Diffusion-Based Texture Editing in CLIP Space Julia Guerrero-Viu et.al. 2405.00672 null
2024-05-01 RGB $\leftrightarrow$ X: Image decomposition and synthesis using material- and lighting-aware diffusion models Zheng Zeng et.al. 2405.00666 null
2024-05-01 Deep Metric Learning-Based Out-of-Distribution Detection with Synthetic Outlier Exposure Assefa Seyoum Wahd et.al. 2405.00631 null
2024-05-01 Lane Segmentation Refinement with Diffusion Models Antonio Ruiz et.al. 2405.00620 null
2024-05-01 Pricing and delta computation in jump-diffusion models with stochastic intensity by Malliavin calculus Ayub Ahmadi et.al. 2405.00473 null
2024-05-01 Lazy Layers to Make Fine-Tuned Diffusion Models More Traceable Haozhe Liu et.al. 2405.00466 null
2024-05-01 Detail-Enhancing Framework for Reference-Based Image Super-Resolution Zihan Wang et.al. 2405.00431 null
2024-05-01 Streamlining Image Editing with Layered Diffusion Brushes Peyman Gholami et.al. 2405.00313 null
2024-05-02 An Unstructured Mesh Reaction-Drift-Diffusion Master Equation with Reversible Reactions Samuel A. Isaacson et.al. 2405.00283 null
2024-05-01 ASAM: Boosting Segment Anything Model with Adversarial Tuning Bo Li et.al. 2405.00256 link
2024-04-30 MotionLCM: Real-time Controllable Motion Generation via Latent Consistency Model Wenxun Dai et.al. 2404.19759 link
2024-04-30 Invisible Stitch: Generating Smooth 3D Scenes with Depth Inpainting Paul Engstler et.al. 2404.19758 null
2024-04-30 Mixed Continuous and Categorical Flow Matching for 3D De Novo Molecule Generation Ian Dunn et.al. 2404.19739 link
2024-04-30 X-Diffusion: Generating Detailed 3D MRI Volumes From a Single Image Using Cross-Sectional Diffusion Models Emmanuelle Bourigault et.al. 2404.19604 null
2024-04-30 MicroDreamer: Zero-shot 3D Generation in $\sim$ 20 Seconds by Score-based Iterative Reconstruction Luxi Chen et.al. 2404.19525 link
2024-04-30 TwinDiffusion: Enhancing Coherence and Efficiency in Panoramic Image Generation with Diffusion Models Teng Zhou et.al. 2404.19475 link
2024-04-30 Probing Unlearned Diffusion Models: A Transferable Adversarial Attack Perspective Xiaoxuan Han et.al. 2404.19382 link
2024-04-30 Bridge to Non-Barrier Communication: Gloss-Prompted Fine-grained Cued Speech Gesture Generation with Diffusion Model Wentao Lei et.al. 2404.19277 null
2024-04-30 DiffuseLoco: Real-Time Legged Locomotion Control with Diffusion from Offline Datasets Xiaoyu Huang et.al. 2404.19264 null
2024-04-30 CONTUNER: Singing Voice Beautifying with Pitch and Expressiveness Condition Jianzong Wang et.al. 2404.19187 null
2024-04-29 Stylus: Automatic Adapter Selection for Diffusion Models Michael Luo et.al. 2404.18928 null
2024-04-29 TheaterGen: Character Management with LLM for Consistent Multi-turn Image Generation Junhao Cheng et.al. 2404.18919 link
2024-04-29 Learning general Gaussian mixtures with efficient score matching Sitan Chen et.al. 2404.18893 null
2024-04-29 A Survey on Diffusion Models for Time Series and Spatio-Temporal Data Yiyuan Yang et.al. 2404.18886 link
2024-04-29 Learning Mixtures of Gaussians Using Diffusion Models Khashayar Gatmiry et.al. 2404.18869 null
2024-04-29 Towards Extreme Image Compression with Latent Feature Guidance and Diffusion Prior Zhiyuan Li et.al. 2404.18820 link
2024-04-29 Bootstrap 3D Reconstructed Scenes from 3D Gaussian Splatting Yifei Gao et.al. 2404.18669 null
2024-04-29 FlexiFilm: Long Video Generation with Flexible Conditions Yichen Ouyang et.al. 2404.18620 link
2024-04-29 Anywhere: A Multi-Agent Framework for Reliable and Diverse Foreground-Conditioned Image Inpainting Tianyidan Xie et.al. 2404.18598 null
2024-04-29 U-Nets as Belief Propagation: Efficient Classification, Denoising, and Diffusion in Generative Hierarchical Models Song Mei et.al. 2404.18444 null
2024-04-26 MaPa: Text-driven Photorealistic Material Painting for 3D Shapes Shangzhan Zhang et.al. 2404.17569 null
2024-04-26 Chemotaxis-inspired PDE model for airborne infectious disease transmission: analysis and simulations Pierluigi Colli et.al. 2404.17506 null
2024-04-26 Multi-view Image Prompted Multi-view Diffusion for Improved 3D Generation Seungwook Kim et.al. 2404.17419 null
2024-04-29 MV-VTON: Multi-View Virtual Try-On with Diffusion Models Haoyu Wang et.al. 2404.17364 link
2024-04-26 Simultaneous Tri-Modal Medical Image Fusion and Super-Resolution using Conditional Diffusion Model Yushen Xu et.al. 2404.17357 link
2024-04-26 Trinity Detector:text-assisted and attention mechanisms based spectral fusion for diffusion generation image detection Jiawei Song et.al. 2404.17254 null
2024-04-26 Few-shot Calligraphy Style Learning Fangda Chen et.al. 2404.17199 link
2024-04-25 CyNetDiff -- A Python Library for Accelerated Implementation of Network Diffusion Models Eliot W. Robson et.al. 2404.17059 link
2024-04-25 Universal fragmentation in annihilation reactions with constrained kinetics Enrique Rozas Garcia et.al. 2404.16950 null
2024-04-25 Inferring solid-state diffusivity in lithium-ion battery active materials: improving upon the classical GITT method A. Emir Gumrukcuoglu et.al. 2404.16658 null
2024-04-25 MuseumMaker: Continual Style Customization without Catastrophic Forgetting Chenxi Liu et.al. 2404.16612 null
2024-04-25 Conditional Distribution Modelling for Few-Shot Image Synthesis with Diffusion Models Parul Gupta et.al. 2404.16556 null
2024-04-25 DiffSeg: A Segmentation Model for Skin Lesions Based on Diffusion Difference Zhihao Shuai et.al. 2404.16474 null
2024-04-25 TI2V-Zero: Zero-Shot Image Conditioning for Text-to-Video Diffusion Models Haomiao Ni et.al. 2404.16306 link
2024-04-25 CFMW: Cross-modality Fusion Mamba for Multispectral Object Detection under Adverse Weather Conditions Haoyuan Li et.al. 2404.16302 link
2024-04-25 One Noise to Rule Them All: Learning a Unified Model of Spatially-Varying Noise Patterns Arman Maesumi et.al. 2404.16292 null
2024-04-24 Editable Image Elements for Controllable Synthesis Jiteng Mu et.al. 2404.16029 null
2024-04-24 RetinaRegNet: A Versatile Approach for Retinal Image Registration Vishal Balaji Sivaraman et.al. 2404.16017 link
2024-04-24 MYCloth: Towards Intelligent and Interactive Online T-Shirt Customization based on User's Preference Yexin Liu et.al. 2404.15801 null
2024-04-24 Optimizing OOD Detection in Molecular Graphs: A Novel Approach with Diffusion Models Xu Shen et.al. 2404.15625 null
2024-04-24 A Dynamic Kernel Prior Model for Unsupervised Blind Image Super-Resolution Zhixiong Yang et.al. 2404.15620 link
2024-04-23 ID-Aligner: Enhancing Identity-Preserving Text-to-Image Generation with Reward Feedback Learning Weifeng Chen et.al. 2404.15449 null
2024-04-23 GLoD: Composing Global Contexts and Local Details in Image Generation Moyuru Yamada et.al. 2404.15447 null
2024-04-23 ControlTraj: Controllable Trajectory Generation with Topology-Constrained Diffusion Model Yuanshao Zhu et.al. 2404.15380 null
2024-04-23 Heat flow, log-concavity, and Lipschitz transport maps Giovanni Brigati et.al. 2404.15205 null
2024-04-23 CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method Mingbao Lin et.al. 2404.15141 link
2024-04-23 Taming Diffusion Probabilistic Models for Character Control Rui Chen et.al. 2404.15121 null
2024-04-23 Perturbing Attention Gives You More Bang for the Buck: Subtle Imaging Perturbations That Efficiently Fool Customized Diffusion Models Jingyao Xu et.al. 2404.15081 link
2024-04-23 Music Style Transfer With Diffusion Model Hong Huang et.al. 2404.14771 null
2024-04-23 Gradient Guidance for Diffusion Models: An Optimization Perspective Yingqing Guo et.al. 2404.14743 link
2024-04-23 FlashSpeech: Efficient Zero-Shot Speech Synthesis Zhen Ye et.al. 2404.14700 null
2024-04-23 DreamPBR: Text-driven Generation of High-resolution SVBRDF with Multi-modal Guidance Linxuan Xin et.al. 2404.14676 null
2024-04-22 UVMap-ID: A Controllable and Personalized UV Map Generative Model Weijie Wang et.al. 2404.14568 link
2024-04-22 Align Your Steps: Optimizing Sampling Schedules in Diffusion Models Amirmojtaba Sabour et.al. 2404.14507 null
2024-04-22 Guess The Unseen: Dynamic 3D Scene Reconstruction from Partial 2D Glimpses Inhee Lee et.al. 2404.14410 null
2024-04-22 GeoDiffuser: Geometry-Based Image Editing with Diffusion Models Rahul Sajnani et.al. 2404.14403 null
2024-04-22 TAVGBench: Benchmarking Text to Audible-Video Generation Yuxin Mao et.al. 2404.14381 link
2024-04-22 Full Event Particle-Level Unfolding with Variable-Length Latent Variational Diffusion Alexander Shmakov et.al. 2404.14332 null
2024-04-22 X-Ray: A Sequential 3D Representation for Generation Tao Hu et.al. 2404.14329 link
2024-04-22 Collaborative Filtering Based on Diffusion Models: Unveiling the Potential of High-Order Connectivity Yu Hou et.al. 2404.14240 link
2024-04-22 MultiBooth: Towards Generating All Your Concepts in an Image from Text Chenyang Zhu et.al. 2404.14239 link
2024-04-22 Face2Face: Label-driven Facial Retouching Restoration Guanhua Zhao et.al. 2404.14177 null
2024-04-22 FLDM-VTON: Faithful Latent Diffusion Model for Virtual Try-on Chenhui Wang et.al. 2404.14162 null
2024-04-22 Generative Artificial Intelligence Assisted Wireless Sensing: Human Flow Detection in Practical Communication Environments Jiacheng Wang et.al. 2404.14140 null
2024-04-19 Analysis of Classifier-Free Guidance Weight Schedulers Xi Wang et.al. 2404.13040 null
2024-04-19 RadRotator: 3D Rotation of Radiographs with Diffusion Models Pouria Rouzrokh et.al. 2404.13000 null
2024-04-19 Cross-modal Diffusion Modelling for Super-resolved Spatial Transcriptomics Xiaofei Wang et.al. 2404.12973 null
2024-04-19 Neural Flow Diffusion Models: Learnable Forward Process for Improved Diffusion Modelling Grigory Bartosh et.al. 2404.12940 null
2024-04-19 Zero-Shot Medical Phrase Grounding with Off-the-shelf Diffusion Models Konstantinos Vilouras et.al. 2404.12920 null
2024-04-19 Robust CLIP-Based Detector for Exposing Diffusion Model-Generated Images Santosh et.al. 2404.12908 link
2024-04-19 ConCLVD: Controllable Chinese Landscape Video Generation via Diffusion Model Dingming Liu et.al. 2404.12903 null
2024-04-19 Training-and-prompt-free General Painterly Harmonization Using Image-wise Attention Sharing Teng-Fang Hsiao et.al. 2404.12900 link
2024-04-19 MCM: Multi-condition Motion Synthesis Framework Zeyu Ling et.al. 2404.12886 null
2024-04-19 Detecting Out-Of-Distribution Earth Observation Images with Diffusion Models Georges Le Bellier et.al. 2404.12667 null
2024-04-18 G-HOP: Generative Hand-Object Prior for Interaction Reconstruction and Grasp Synthesis Yufei Ye et.al. 2404.12383 null
2024-04-18 Learning the Domain Specific Inverse NUFFT for Accelerated Spiral MRI using Diffusion Models Trevor J. Chan et.al. 2404.12361 null
2024-04-18 AniClipart: Clipart Animation with Text-to-Video Priors Ronghuan Wu et.al. 2404.12347 null
2024-04-18 Guided Discrete Diffusion for Electronic Health Record Generation Zixiang Chen et.al. 2404.12314 null
2024-04-18 StyleBooth: Image Style Editing with Multimodal Instruction Zhen Han et.al. 2404.12154 link
2024-04-18 LD-Pruner: Efficient Pruning of Latent Diffusion Models using Task-Agnostic Insights Thibault Castells et.al. 2404.11936 null
2024-04-18 FreeDiff: Progressive Frequency Truncation for Image Editing with Diffusion Models Wei Wu et.al. 2404.11895 link
2024-04-17 Prompt-Driven Feature Diffusion for Open-World Semi-Supervised Learning Marzi Heidari et.al. 2404.11795 null
2024-04-17 Diffusion Schrödinger Bridge Models for High-Quality MR-to-CT Synthesis for Head and Neck Proton Treatment Planning Muheng Li et.al. 2404.11741 null
2024-04-17 Factorized Diffusion: Perceptual Illusions by Noise Decomposition Daniel Geng et.al. 2404.11615 null
2024-04-17 IntrinsicAnything: Learning Diffusion Priors for Inverse Rendering Under Unknown Illumination Xi Chen et.al. 2404.11593 null
2024-04-17 Prompt Optimizer of Text-to-Image Diffusion Models for Abstract Concept Understanding Zezhong Fan et.al. 2404.11589 null
2024-04-17 MoA: Mixture-of-Attention for Subject-Context Disentanglement in Personalized Image Generation Kuan-Chieh et.al. 2404.11565 null
2024-04-17 Predicting Long-horizon Futures by Conditioning on Geometry and Time Tarasha Khurana et.al. 2404.11554 null
2024-04-17 SSDiff: Spatial-spectral Integrated Diffusion Model for Remote Sensing Pansharpening Yu Zhong et.al. 2404.11537 null
2024-04-17 Towards Highly Realistic Artistic Style Transfer via Stable Diffusion with Step-aware and Layer-aware Prompt Zhanjie Zhang et.al. 2404.11474 link
2024-04-17 Closely Interactive Human Reconstruction with Proxemics and Physics-Guided Adaption Buzhen Huang et.al. 2404.11291 link
2024-04-17 Optical Image-to-Image Translation Using Denoising Diffusion Models: Heterogeneous Change Detection as a Use Case João Gabriel Vinholi et.al. 2404.11243 null
2024-04-17 RiboDiffusion: Tertiary Structure-based RNA Inverse Folding with Generative Diffusion Models Han Huang et.al. 2404.11199 link
2024-04-16 RefFusion: Reference Adapted Diffusion Models for 3D Scene Inpainting Ashkan Mirzaei et.al. 2404.10765 null
2024-04-16 LaDiC: Are Diffusion Models Really Inferior to Autoregressive Counterparts for Image-to-Text Generation? Yuchi Wang et.al. 2404.10763 link
2024-04-16 GazeHTA: End-to-end Gaze Target Detection with Head-Target Association Zhi-Yi Lin et.al. 2404.10718 null
2024-04-16 Efficient Conditional Diffusion Model with Probability Flow Sampling for Image Super-resolution Yutao Yuan et.al. 2404.10688 link
2024-04-16 Generating Human Interaction Motions in Scenes with Text Control Hongwei Yi et.al. 2404.10685 null
2024-04-16 StyleCity: Large-Scale 3D Urban Scenes Stylization with Vision-and-Text Reference via Progressive Optimization Yingshu Chen et.al. 2404.10681 null
2024-04-16 Continual Offline Reinforcement Learning via Diffusion-based Dual Generative Replay Jinmei Liu et.al. 2404.10662 link
2024-04-16 Enhancing 3D Fidelity of Text-to-3D using Cross-View Correspondences Seungwook Kim et.al. 2404.10603 null
2024-04-17 Do Counterfactual Examples Complicate Adversarial Training? Eric Yeats et.al. 2404.10588 null
2024-04-17 AAVDiff: Experimental Validation of Enhanced Viability and Diversity in Recombinant Adeno-Associated Virus (AAV) Capsids through Diffusion Generation Lijun Liu et.al. 2404.10573 null
2024-04-15 Equipping Diffusion Models with Differentiable Spatial Entropy for Low-Light Image Enhancement Wenyi Lian et.al. 2404.09735 link
2024-04-15 Photo-Realistic Image Restoration in the Wild with Controlled Vision-Language Models Ziwei Luo et.al. 2404.09732 link
2024-04-15 All-in-one simulation-based inference Manuel Gloeckler et.al. 2404.09636 link
2024-04-15 TMPQ-DM: Joint Timestep Reduction and Quantization Precision Selection for Efficient Diffusion Models Haojun Sun et.al. 2404.09532 null
2024-04-15 Magic Clothing: Controllable Garment-Driven Image Synthesis Weifeng Chen et.al. 2404.09512 link
2024-04-15 PhyScene: Physically Interactable 3D Scene Synthesis for Embodied AI Yandan Yang et.al. 2404.09465 null
2024-04-15 Watermark-embedded Adversarial Examples for Copyright Protection against Diffusion Models Peifei Zhu et.al. 2404.09401 null
2024-04-14 Fault Detection in Mobile Networks Using Diffusion Models Mohamad Nabeel et.al. 2404.09240 null
2024-04-14 DreamScape: 3D Scene Creation via Gaussian Splatting joint Correlation Modeling Xuening Yuan et.al. 2404.09227 null
2024-04-16 LoopAnimate: Loopable Salient Object Animation Fanyi Wang et.al. 2404.09172 null
2024-04-12 Lossy Image Compression with Foundation Diffusion Models Lucas Relic et.al. 2404.08580 null
2024-04-12 PiRD: Physics-informed Residual Diffusion for Flow Field Reconstruction Siming Shan et.al. 2404.08412 null
2024-04-12 Struggle with Adversarial Defense? Try Diffusion Yujie Li et.al. 2404.08273 link
2024-04-12 Balanced Mixed-Type Tabular Data Synthesis with Diffusion Models Zeyu Yang et.al. 2404.08254 link
2024-04-12 Interest Maximization in Social Networks Rahul Kumar Gautam et.al. 2404.08236 null
2024-04-11 ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback Ming Li et.al. 2404.07987 link
2024-04-11 Taming Stable Diffusion for Text to 360° Panorama Image Generation Cheng Zhang et.al. 2404.07949 link
2024-04-11 Adaptive Hyperbolic-cross-space Mapped Jacobi Method on Unbounded Domains with Applications to Solving Multidimensional Spatiotemporal Integrodifferential Equations Yunhong Deng et.al. 2404.07844 null
2024-04-11 ConsistencyDet: Robust Object Detector with Denoising Paradigm of Consistency Model Lifan Jiang et.al. 2404.07773 link
2024-04-11 An Overview of Diffusion Models: Applications, Guided Generation, Statistical Rates and Optimization Minshuo Chen et.al. 2404.07771 null
2024-04-11 Joint Conditional Diffusion Model for Image Restoration with Mixed Degradations Yufeng Yue et.al. 2404.07770 null
2024-04-11 Diffusing in Someone Else's Shoes: Robotic Perspective Taking with Diffusion Josua Spisak et.al. 2404.07735 null
2024-04-11 Applying Guidance in a Limited Interval Improves Sample and Distribution Quality in Diffusion Models Tuomas Kynkäänniemi et.al. 2404.07724 link
2024-04-11 Implicit and Explicit Language Guidance for Diffusion-based Visual Perception Hefeng Wang et.al. 2404.07600 null
2024-04-11 ObjBlur: A Curriculum Learning Approach With Progressive Object-Level Blurring for Improved Layout-to-Image Generation Stanislav Frolov et.al. 2404.07564 null
2024-04-10 GoodDrag: Towards Good Practices for Drag Editing with Diffusion Models Zewei Zhang et.al. 2404.07206 null
2024-04-10 RealmDreamer: Text-Driven 3D Scene Generation with Inpainting and Depth Diffusion Jaidev Shriram et.al. 2404.07199 null
2024-04-10 InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models Jiale Xu et.al. 2404.07191 link
2024-04-10 Move Anything with Layered Scene Diffusion Jiawei Ren et.al. 2404.07178 null
2024-04-10 Diffusion-based inpainting of incomplete Euclidean distance matrices of trajectories generated by a fractional Brownian motion Alexander Lobashev et.al. 2404.07029 link
2024-04-10 DreamScene360: Unconstrained Text-to-3D Scene Generation with Panoramic Gaussian Splatting Shijie Zhou et.al. 2404.06903 null
2024-04-10 Fine color guidance in diffusion models and its application to image compression at extremely low bitrates Tom Bordin et.al. 2404.06865 null
2024-04-10 UDiFF: Generating Conditional Unsigned Distance Fields with Optimal Wavelet Diffusion Junsheng Zhou et.al. 2404.06851 null
2024-04-10 Tuning-Free Adaptive Style Incorporation for Structure-Consistent Text-Driven Style Transfer Yanqi Ge et.al. 2404.06835 null
2024-04-10 Zero-shot Point Cloud Completion Via 2D Priors Tianxin Huang et.al. 2404.06814 null
2024-04-09 GeoDirDock: Guiding Docking Along Geodesic Paths Raúl Miñán et.al. 2404.06481 null
2024-04-09 Magic-Boost: Boost 3D Generation with Mutli-View Conditioned Diffusion Fan Yang et.al. 2404.06429 link
2024-04-09 ZeST: Zero-Shot Material Transfer from a Single Image Ta-Ying Cheng et.al. 2404.06425 null
2024-04-09 Policy-Guided Diffusion Matthew Thomas Jackson et.al. 2404.06356 link
2024-04-09 Quantum State Generation with Structure-Preserving Diffusion Model Yuchen Zhu et.al. 2404.06336 null
2024-04-09 DiffHarmony: Latent Diffusion Model Meets Image Harmonization Pengfei Zhou et.al. 2404.06139 link
2024-04-09 Hash3D: Training-free Acceleration for 3D Generation Xingyi Yang et.al. 2404.06091 link
2024-04-09 Diffusion-Based Point Cloud Super-Resolution for mmWave Radar Data Kai Luan et.al. 2404.06012 null
2024-04-09 Tackling Structural Hallucination in Image Translation with Local Diffusion Seunghoi Kim et.al. 2404.05980 link
2024-04-09 Map Optical Properties to Subwavelength Structures Directly via a Diffusion Model Shijie Rao et.al. 2404.05959 null
2024-04-08 MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation Kunpeng Song et.al. 2404.05674 link
2024-04-08 YaART: Yet Another ART Rendering Technology Sergey Kastryulin et.al. 2404.05666 null
2024-04-08 BinaryDM: Towards Accurate Binarization of Diffusion Model Xingyu Zheng et.al. 2404.05662 link
2024-04-08 Resistive Memory-based Neural Differential Equation Solver for Score-based Diffusion Model Jichang Yang et.al. 2404.05648 link
2024-04-08 Learning a Category-level Object Pose Estimator without Pose Annotations Fengrui Tian et.al. 2404.05626 null
2024-04-08 UniFL: Improve Stable Diffusion via Unified Feedback Learning Jiacheng Zhang et.al. 2404.05595 null
2024-04-08 Investigating the Effectiveness of Cross-Attention to Unlock Zero-Shot Editing of Text-to-Video Diffusion Models Saman Motamed et.al. 2404.05519 null
2024-04-08 Taming Transformers for Realistic Lidar Point Cloud Generation Hamed Haghighi et.al. 2404.05505 link
2024-04-08 Rethinking the Spatial Inconsistency in Classifier-Free Diffusion Guidance Dazhong Shen et.al. 2404.05384 link
2024-04-08 Mask-ControlNet: Higher-Quality Image Generation with An Additional Mask Prompt Zhiqi Huang et.al. 2404.05331 null
2024-04-05 Identity Decoupling for Multi-Subject Personalization of Text-to-Image Models Sangwon Jang et.al. 2404.04243 null
2024-04-05 ToolEENet: Tool Affordance 6D Pose Estimation Yunlong Wang et.al. 2404.04193 null
2024-04-05 Dynamic Prompt Optimizing for Text-to-Image Generation Wenyi Mo et.al. 2404.04095 link
2024-04-05 Score identity Distillation: Exponentially Fast Distillation of Pretrained Diffusion Models for One-Step Generation Mingyuan Zhou et.al. 2404.04057 link
2024-04-05 Concept Weaver: Enabling Multi-Concept Fusion in Text-to-Image Models Gihyun Kwon et.al. 2404.03913 null
2024-04-04 MVD-Fusion: Single-view 3D via Depth-consistent Multi-view Generation Hanzhe Hu et.al. 2404.03656 null
2024-04-04 CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching Dongzhi Jiang et.al. 2404.03653 link
2024-04-04 The More You See in 2D, the More You Perceive in 3D Xinyang Han et.al. 2404.03652 null
2024-04-04 DiffBody: Human Body Restoration by Imagining with Generative Diffusion Prior Yiming Zhang et.al. 2404.03642 null
2024-04-04 LCM-Lookahead for Encoder-based Text-to-Image Personalization Rinon Gal et.al. 2404.03620 null
2024-04-04 DiffDet4SAR: Diffusion-based Aircraft Target Detection Network for SAR Images Zhou Jie et.al. 2404.03595 link
2024-04-04 PointInfinity: Resolution-Invariant Point Diffusion Models Zixuan Huang et.al. 2404.03566 null
2024-04-04 Segmentation-Guided Knee Radiograph Generation using Conditional Diffusion Models Siyuan Mei et.al. 2404.03541 null
2024-04-04 A Directional Diffusion Graph Transformer for Recommendation Zixuan Yi et.al. 2404.03326 null
2024-04-04 SiloFuse: Cross-silo Synthetic Data Generation with Latent Tabular Diffusion Models Aditya Shankar et.al. 2404.03299 null
2024-04-03 LidarDM: Generative LiDAR Simulation in a Generated World Vlas Zyrianov et.al. 2404.02903 link
2024-04-03 Fast Diffusion Model For Seismic Data Noise Attenuation Junheng Peng et.al. 2404.02767 null
2024-04-03 Cross-Attention Makes Inference Cumbersome in Text-to-Image Diffusion Models Wentian Zhang et.al. 2404.02747 link
2024-04-03 Deep Privacy Funnel Model: From a Discriminative to a Generative Approach with an Application to Face Recognition Behrooz Razeghi et.al. 2404.02696 null
2024-04-03 Diffexplainer: Towards Cross-modal Global Explanations with Diffusion Models Matteo Pennisi et.al. 2404.02618 null
2024-04-03 A Unified Editing Method for Co-Speech Gesture Generation via Diffusion Inversion Zeyu Zhao et.al. 2404.02411 null
2024-04-03 Enhancing Diffusion-based Point Cloud Generation with Smoothness Constraint Yukun Li et.al. 2404.02396 null
2024-04-02 Semantic Augmentation in Images using Language Sahiti Yerramilli et.al. 2404.02353 null
2024-04-02 Heat Death of Generative Models in Closed-Loop Learning Matteo Marchi et.al. 2404.02325 null
2024-04-02 APEX: Ambidextrous Dual-Arm Robotic Manipulation Using Collision-Free Generative Diffusion Models Apan Dastider et.al. 2404.02284 null
2024-04-02 Diffusion $^2$ : Dynamic 3D Content Generation via Score Composition of Orthogonal Diffusion Models Zeyu Yang et.al. 2404.02148 link
2024-04-02 WcDT: World-centric Diffusion Transformer for Traffic Scene Generation Chen Yang et.al. 2404.02082 link
2024-04-03 AUTODIFF: Autoregressive Diffusion Modeling for Structure-based Drug Design Xinze Li et.al. 2404.02003 null
2024-04-02 Bi-LORA: A Vision-Language Approach for Synthetic Image Detection Mamadou Keita et.al. 2404.01959 link
2024-04-02 Co-Speech Gesture Video Generation via Motion-Decoupled Diffusion Model Xu He et.al. 2404.01862 link
2024-04-02 Upsample Guidance: Scale Up Diffusion Models without Training Juno Hwang et.al. 2404.01709 null
2024-04-02 FashionEngine: Interactive Generation and Editing of 3D Clothed Humans Tao Hu et.al. 2404.01655 null
2024-04-02 Diffusion Deepfake Chaitali Bhattacharyya et.al. 2404.01579 null
2024-04-01 Prior Frequency Guided Diffusion Model for Limited Angle (LA)-CBCT Reconstruction Jiacheng Xie et.al. 2404.01448 null
2024-04-01 DPMesh: Exploiting Diffusion Prior for Occluded Human Mesh Recovery Yixuan Zhu et.al. 2404.01424 link
2024-03-29 Relation Rectification in Diffusion Model Yinwei Wu et.al. 2403.20249 null
2024-03-29 Motion Inversion for Video Customization Luozhou Wang et.al. 2403.20193 null
2024-03-29 FreeSeg-Diff: Training-Free Open-Vocabulary Segmentation with Diffusion Models Barbara Toniella Corradini et.al. 2403.20105 null
2024-03-29 SGD: Street View Synthesis with Gaussian Splatting and Diffusion Prior Zhongrui Yu et.al. 2403.20079 null
2024-03-29 Probing solar modulation analytic models with cosmic ray periodic spectra Wei-Cheng Long et.al. 2403.20038 null
2024-04-01 Structure Matters: Tackling the Semantic Discrepancy in Diffusion Models for Image Inpainting Haipeng Liu et.al. 2403.19898 link
2024-03-28 Vision-Language Synthetic Data Enhances Echocardiography Downstream Tasks Pooria Ashrafian et.al. 2403.19880 link
2024-03-28 ShapeFusion: A 3D diffusion model for localized shape editing Rolandos Alexandros Potamias et.al. 2403.19773 null
2024-03-28 Detecting Image Attribution for Text-to-Image Diffusion Models in RGB and Beyond Katherine Xu et.al. 2403.19653 link
2024-03-28 InterDreamer: Zero-Shot Text to 3D Dynamic Human-Object Interaction Sirui Xu et.al. 2403.19652 null
2024-03-28 GANTASTIC: GAN-based Transfer of Interpretable Directions for Disentangled Image Editing in Text-to-Image Diffusion Models Yusuf Dalva et.al. 2403.19645 null
2024-03-28 In the driver's mind: modeling the dynamics of human overtaking decisions in interactions with oncoming automated vehicles Samir H. A. Mohammad et.al. 2403.19637 null
2024-03-28 Enhance Image Classification via Inter-Class Image Mixup with Diffusion Model Zhicai Wang et.al. 2403.19600 link
2024-03-28 Frame by Familiar Frame: Understanding Replication in Video Diffusion Models Aimon Rahman et.al. 2403.19593 null
2024-03-28 Impact of Resin Molecular Weight on Drying Kinetics and Sag of Coatings Marola W. Issa et.al. 2403.19544 null
2024-03-28 Debiasing Cardiac Imaging with Controlled Latent Diffusion Models Grzegorz Skorupko et.al. 2403.19508 link
2024-03-28 Burst Super-Resolution with Diffusion Models for Improving Perceptual Quality Kyotaro Tokoro et.al. 2403.19428 link
2024-03-28 Imperceptible Protection against Style Imitation from Diffusion Models Namhyuk Ahn et.al. 2403.19254 null
2024-03-27 ObjectDrop: Bootstrapping Counterfactuals for Photorealistic Object Removal and Insertion Daniel Winter et.al. 2403.18818 null
2024-03-28 ECoDepth: Effective Conditioning of Diffusion Models for Monocular Depth Estimation Suraj Patni et.al. 2403.18807 link
2024-03-27 Object Pose Estimation via the Aggregation of Diffusion Features Tianfu Wang et.al. 2403.18791 link
2024-03-27 ImageNet-D: Benchmarking Neural Network Robustness on Diffusion Synthetic Object Chenshuang Zhang et.al. 2403.18775 link
2024-03-27 A Diffusion-Based Generative Equalizer for Music Restoration Eloi Moliner et.al. 2403.18636 link
2024-03-27 HandBooster: Boosting 3D Hand-Mesh Reconstruction by Conditional Synthesis and Sampling of Hand-Object Interactions Hao Xu et.al. 2403.18575 link
2024-03-27 Artifact Reduction in 3D and 4D Cone-beam Computed Tomography Images with Deep Learning -- A Review Mohammadreza Amirian et.al. 2403.18565 null
2024-03-27 CosalPure: Learning Concept from Group Images for Robust Co-Saliency Detection Jiayi Zhu et.al. 2403.18554 null
2024-03-27 CT-3DFlow : Leveraging 3D Normalizing Flows for Unsupervised Detection of Pathological Pulmonary CT scans Aissam Djahnine et.al. 2403.18514 null
2024-03-27 Synthesizing EEG Signals from Event-Related Potential Paradigms with Conditional Diffusion Models Guido Klein et.al. 2403.18486 link
2024-03-26 AID: Attention Interpolation of Text-to-Image Diffusion Qiyuan He et.al. 2403.17924 link
2024-03-26 Boosting Diffusion Models with Moving Average Sampling in Frequency Domain Yurui Qian et.al. 2403.17870 null
2024-03-26 DiffH2O: Diffusion-Based Synthesis of Hand-Object Interactions from Textual Descriptions Sammy Christen et.al. 2403.17827 null
2024-03-26 Annotated Biomedical Video Generation using Denoising Diffusion Probabilistic Models and Flow Fields Rüveyda Yilmaz et.al. 2403.17808 link
2024-03-26 GenesisTex: Adapting Image Denoising Diffusion to Texture Space Chenjian Gao et.al. 2403.17782 null
2024-03-26 CT Synthesis with Conditional Diffusion Models for Abdominal Lymph Node Segmentation Yongrui Yu et.al. 2403.17770 null
2024-03-26 AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation Huawei Wei et.al. 2403.17694 link
2024-03-26 Manifold-Guided Lyapunov Control with Diffusion Models Amartya Mukherjee et.al. 2403.17692 link
2024-03-26 Not All Similarities Are Created Equal: Leveraging Data-Driven Biases to Inform GenAI Copyright Disputes Uri Hacohen et.al. 2403.17691 null
2024-03-26 DiffFAE: Advancing High-fidelity One-shot Facial Appearance Editing with Space-sensitive Customization and Semantic Preservation Qilin Wang et.al. 2403.17664 null
2024-03-25 SDXS: Real-Time One-Step Latent Diffusion Models with Image Conditions Yuda Song et.al. 2403.16627 link
2024-03-25 SatSynth: Augmenting Image-Mask Pairs through Diffusion Models for Aerial Semantic Segmentation Aysim Toker et.al. 2403.16605 null
2024-03-25 Antigen-Specific Antibody Design via Direct Energy-based Preference Optimization Xiangxin Zhou et.al. 2403.16576 null
2024-03-25 An Intermediate Fusion ViT Enables Efficient Text-Image Alignment in Diffusion Models Zizhao Hu et.al. 2403.16530 null
2024-03-25 Let Real Images be as a Judger, Spotting Fake Images Synthesized with Generative Models Ziyou Liang et.al. 2403.16513 null
2024-03-25 Make-Your-Anchor: A Diffusion-based 2D Avatar Generation Framework Ziyao Huang et.al. 2403.16510 link
2024-03-25 Refining Text-to-Image Generation: Towards Accurate Training-Free Glyph-Enhanced Image Generation Sanyam Lakhanpal et.al. 2403.16422 null
2024-03-25 FlashEval: Towards Fast and Accurate Evaluation of Text-to-image Diffusion Generative Models Lin Zhao et.al. 2403.16379 null
2024-03-24 Laplacian-guided Entropy Model in Neural Codec with Blur-dissipated Synthesis Atefeh Khoshkhahtinat et.al. 2403.16258 null
2024-03-24 Skull-to-Face: Anatomy-Guided 3D Facial Reconstruction and Editing Yongqing Liang et.al. 2403.16207 null
2024-03-22 DiffusionMTL: Learning Multi-Task Denoising Diffusion Model from Partially Annotated Data Hanrong Ye et.al. 2403.15389 null
2024-03-22 Ultrasound Imaging based on the Variance of a Diffusion Restoration Model Yuxin Zhang et.al. 2403.15316 link
2024-03-22 Controlled Training Data Generation with Diffusion Models Teresa Yeo et.al. 2403.15309 null
2024-03-22 Spectral Motion Alignment for Video Motion Transfer using Diffusion Models Geon Yeong Park et.al. 2403.15249 null
2024-03-22 Shadow Generation for Composite Image Using Diffusion model Qingyang Liu et.al. 2403.15234 link
2024-03-22 MM-Diff: High-Fidelity Image Personalization via Multi-Modal Condition Integration Zhichao Wei et.al. 2403.15059 null
2024-03-22 Toward Tiny and High-quality Facial Makeup with Data Amplify Learning Qiaoqiao Jin et.al. 2403.15033 null
2024-03-22 Dynamics of a memory-based diffusion model with spatial heterogeneity and nonlinear boundary condition Quanli Ji et.al. 2403.14969 null
2024-03-22 DreamFlow: High-Quality Text-to-3D Generation by Approximating Probability Flow Kyungmin Lee et.al. 2403.14966 null
2024-03-22 CLIP-VQDiffusion : Langauge Free Training of Text To Image generation using CLIP and vector quantized diffusion model Seungdae Han et.al. 2403.14944 link
2024-03-21 GRM: Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation Yinghao Xu et.al. 2403.14621 link
2024-03-21 DreamReward: Text-to-3D Generation with Human Preference Junliang Ye et.al. 2403.14613 null
2024-03-21 ReNoise: Real Image Inversion Through Iterative Noising Daniel Garibi et.al. 2403.14602 null
2024-03-21 Denoising Diffusion Models for 3D Healthy Brain Tissue Inpainting Alicia Durrer et.al. 2403.14499 link
2024-03-21 Style-Extracting Diffusion Models for Semi-Supervised Histopathology Segmentation Mathias Öttl et.al. 2403.14429 null
2024-03-21 DP-RDM: Adapting Diffusion Models to Private Domains Without Fine-Tuning Jonathan Lebensold et.al. 2403.14421 link
2024-03-21 Physics-Informed Diffusion Models Jan-Hendrik Bastek et.al. 2403.14404 link
2024-03-21 Open-Vocabulary Attention Maps with Token Optimization for Semantic Segmentation in Diffusion Models Pablo Marcos-Manchón et.al. 2403.14291 link
2024-03-21 Zero123-6D: Zero-shot Novel View Synthesis for RGB Category-level 6D Pose Estimation Francesco Di Felice et.al. 2403.14279 null
2024-03-21 Diffusion Models with Ensembled Structure-Based Anomaly Scoring for Unsupervised Anomaly Detection Finn Behrendt et.al. 2403.14262 link
2024-03-20 Editing Massive Concepts in Text-to-Image Diffusion Models Tianwei Xiong et.al. 2403.13807 link
2024-03-20 ZigMa: Zigzag Mamba Diffusion Model Vincent Tao Hu et.al. 2403.13802 link
2024-03-20 TimeRewind: Rewinding Time with Image-and-Events Video Diffusion Jingxi Chen et.al. 2403.13800 null
2024-03-20 DepthFM: Fast Monocular Depth Estimation with Flow Matching Ming Gui et.al. 2403.13788 link
2024-03-20 Be-Your-Outpainter: Mastering Video Outpainting through Input-Specific Adaptation Fu-Yun Wang et.al. 2403.13745 link
2024-03-20 DanceCamera3D: 3D Camera Movement Synthesis with Music and Dance Zixuan Wang et.al. 2403.13667 link
2024-03-20 ZoDi: Zero-Shot Domain Adaptation with Diffusion-Based Image Transfer Hiroki Azuma et.al. 2403.13652 link
2024-03-20 ReGround: Improving Textual and Spatial Grounding at No Cost Yuseung Lee et.al. 2403.13589 null
2024-03-20 Ground-A-Score: Scaling Up the Score Distillation for Multi-Attribute Editing Hangeol Chang et.al. 2403.13551 link
2024-03-20 Compress3D: a Compressed Latent Space for 3D Generation from a Single Image Bowen Zhang et.al. 2403.13524 null
2024-03-19 FouriScale: A Frequency Perspective on Training-Free High-Resolution Image Synthesis Linjiang Huang et.al. 2403.12963 link
2024-03-19 FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation Shuai Yang et.al. 2403.12962 link
2024-03-19 Zero-Reference Low-Light Enhancement via Physical Quadruple Priors Wenjing Wang et.al. 2403.12933 null
2024-03-19 Ultra-High-Resolution Image Synthesis with Pyramid Diffusion Model Jiajie Yang et.al. 2403.12915 link
2024-03-19 D-Cubed: Latent Diffusion Trajectory Optimisation for Dexterous Deformable Manipulation Jun Yamada et.al. 2403.12861 null
2024-03-19 Generative Enhancement for 3D Medical Images Lingting Zhu et.al. 2403.12852 link
2024-03-19 Compositional 3D Scene Synthesis with Scene Graph Guided Layout-Shape Generation Yao Wei et.al. 2403.12848 null
2024-03-19 DreamDA: Generative Data Augmentation with Diffusion Models Yunxiang Fu et.al. 2403.12803 link
2024-03-19 WaveFace: Authentic Face Restoration with Efficient Frequency Recovery Yunqi Miao et.al. 2403.12760 null
2024-03-19 Towards Controllable Face Generation with Semantic Latent Diffusion Models Alex Ergasti et.al. 2403.12743 link
2024-03-18 Generalized Multi-Source Inference for Text Conditioned Music Diffusion Models Emilian Postolache et.al. 2403.11706 link
2024-03-19 Urban Scene Diffusion through Semantic Occupancy Map Junge Zhang et.al. 2403.11697 null
2024-03-18 Binary Noise for Binary Tasks: Masked Bernoulli Diffusion for Unsupervised Anomaly Detection Julia Wolleb et.al. 2403.11667 link
2024-03-18 Arc2Face: A Foundation Model of Human Faces Foivos Paraperas Papantoniou et.al. 2403.11641 link
2024-03-18 LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept Customization in Training-Free Diffusion Models Yang Yang et.al. 2403.11627 link
2024-03-18 CRS-Diff: Controllable Generative Remote Sensing Foundation Model Datao Tang et.al. 2403.11614 link
2024-03-18 EffiVED:Efficient Video Editing via Text-instruction Diffusion Models Zhenghao Zhang et.al. 2403.11568 link
2024-03-18 EchoReel: Enhancing Action Generation of Existing Video Diffusion Models Jianzhi liu et.al. 2403.11535 link
2024-03-18 Diffusion Models are Geometry Critics: Single Image 3D Editing Using Pre-Trained Diffusion Priors Ruicheng Wang et.al. 2403.11503 null
2024-03-18 SeisFusion: Constrained Diffusion Model with Input Guidance for 3D Seismic Data Interpolation and Reconstruction Shuang Wang et.al. 2403.11482 link
2024-03-15 Lodge: A Coarse to Fine Diffusion Network for Long Dance Generation Guided by the Characteristic Dance Primitives Ronghui Li et.al. 2403.10518 link
2024-03-15 Isotropic3D: Image-to-3D Generation Based on a Single CLIP Embedding Pengkun Liu et.al. 2403.10395 link
2024-03-15 Denoising Task Difficulty-based Curriculum for Training Diffusion Models Jin-Young Kim et.al. 2403.10348 null
2024-03-15 Optimal Control of Stationary Doubly Diffusive Flows on Two and Three Dimensional Bounded Lipschitz Domains: Numerical Analysis Jai Tushar et.al. 2403.10282 null
2024-03-15 Arbitrary-Scale Image Generation and Upsampling using Latent Diffusion Model and Implicit Neural Decoder Jinseok Kim et.al. 2403.10255 null
2024-03-15 FDGaussian: Fast Gaussian Splatting from Single Image via Geometric-aware Diffusion Model Qijun Feng et.al. 2403.10242 null
2024-03-15 BlindDiff: Empowering Degradation Modelling in Diffusion Models for Blind Image Super-Resolution Feng Li et.al. 2403.10211 link
2024-03-15 Spectral CT Two-step and One-step Material Decomposition using Diffusion Posterior Sampling Corentin Vazia et.al. 2403.10183 null
2024-03-15 Animate Your Motion: Turning Still Images into Dynamic Videos Mingxiao Li et.al. 2403.10179 null
2024-03-15 Being heterogeneous is disadvantageous: Brownian non-Gaussian searches Vittoria Sposini et.al. 2403.10138 null
2024-03-14 SCP-Diff: Photo-Realistic Semantic Image Synthesis with Spatial-Categorical Joint Prior Huan-ang Gao et.al. 2403.09638 null
2024-03-14 3D-VLA: A 3D Vision-Language-Action Generative World Model Haoyu Zhen et.al. 2403.09631 null
2024-03-14 Generalized Predictive Model for Autonomous Driving Jiazhi Yang et.al. 2403.09630 link
2024-03-14 Make-Your-3D: Fast and Consistent Subject-Driven 3D Content Generation Fangfu Liu et.al. 2403.09625 null
2024-03-14 Score-Guided Diffusion for 3D Human Recovery Anastasis Stathopoulos et.al. 2403.09623 link
2024-03-14 Explore In-Context Segmentation via Latent Diffusion Models Chaoyang Wang et.al. 2403.09616 null
2024-03-14 MambaTalk: Efficient Holistic Gesture Synthesis with Selective State Space Models Zunnan Xu et.al. 2403.09471 link
2024-03-14 Eta Inversion: Designing an Optimal Eta Function for Diffusion-based Real Image Editing Wonjun Kang et.al. 2403.09468 link
2024-03-14 Shake to Leak: Fine-tuning Diffusion Models Can Amplify the Generative Privacy Risk Zhangheng Li et.al. 2403.09450 link
2024-03-14 3D-SceneDreamer: Text-Driven 3D-Consistent Scene Generation Frank Zhang et.al. 2403.09439 null
2024-03-13 VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis Enric Corona et.al. 2403.08764 null
2024-03-13 Spatiotemporal Diffusion Model with Paired Sampling for Accelerated Cardiac Cine MRI Shihan Qiu et.al. 2403.08758 null
2024-03-13 Clinically Feasible Diffusion Reconstruction for Highly-Accelerated Cardiac Cine MRI Shihan Qiu et.al. 2403.08749 null
2024-03-14 GaussCtrl: Multi-View Consistent Text-Driven 3D Gaussian Splatting Editing Jing Wu et.al. 2403.08733 link
2024-03-13 Ambient Diffusion Posterior Sampling: Solving Inverse Problems with Diffusion Models trained on Corrupted Data Asad Aali et.al. 2403.08728 link
2024-03-13 Data Augmentation in Human-Centric Vision Wentao Jiang et.al. 2403.08650 null
2024-03-13 ActionDiffusion: An Action-aware Diffusion Model for Procedure Planning in Instructional Videos Lei Shi et.al. 2403.08591 null
2024-03-13 Federated Knowledge Graph Unlearning via Diffusion Model Bingchen Liu et.al. 2403.08554 null
2024-03-13 Model Will Tell: Training Membership Inference for Diffusion Models Xiaomeng Fu et.al. 2403.08487 null
2024-03-13 MD-Dose: A Diffusion Model based on the Mamba for Radiotherapy Dose Prediction Linjie Fu et.al. 2403.08479 link
2024-03-12 Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation Shihao Zhao et.al. 2403.07860 link
2024-03-12 Quantifying and Mitigating Privacy Risks for Tabular Generative Models Chaoyi Zhu et.al. 2403.07842 null
2024-03-12 MPCPA: Multi-Center Privacy Computing with Predictions Aggregation based on Denoising Diffusion Probabilistic Model Guibo Luo et.al. 2403.07838 null
2024-03-13 SemCity: Semantic Scene Generation with Triplane Diffusion Jumin Lee et.al. 2403.07773 link
2024-03-12 Stable-Makeup: When Real-World Makeup Transfer Meets Diffusion Model Yuxuan Zhang et.al. 2403.07764 link
2024-03-12 SSM Meets Video Diffusion Models: Efficient Video Generation with Structured State Spaces Yuta Oshima et.al. 2403.07711 link
2024-03-12 Visual Privacy Auditing with Diffusion Models Kristian Schwethelm et.al. 2403.07588 null
2024-03-12 D4D: An RGBD diffusion model to boost monocular depth estimation L. Papa et.al. 2403.07516 link
2024-03-12 Block-wise LoRA: Revisiting Fine-grained LoRA for Effective Personalization and Stylization in Text-to-Image Generation Likun Li et.al. 2403.07500 null
2024-03-12 Time-Efficient and Identity-Consistent Virtual Try-On Using A Variant of Altered Diffusion Models Phuong Dam et.al. 2403.07371 null
2024-03-11 BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion Xuan Ju et.al. 2403.06976 link
2024-03-11 Bayesian Diffusion Models for 3D Shape Reconstruction Haiyang Xu et.al. 2403.06973 null
2024-03-11 POD-ROM methods: from a finite set of snapshots to continuous-in-time approximations Bosco Garcia-Archilla et.al. 2403.06967 null
2024-03-11 SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data Jialu Li et.al. 2403.06952 null
2024-03-12 DEADiff: An Efficient Stylization Diffusion Model with Disentangled Representations Tianhao Qi et.al. 2403.06951 link
2024-03-11 Conditional Score-Based Diffusion Model for Cortical Thickness Trajectory Prediction Qing Xiao et.al. 2403.06940 null
2024-03-11 Estimation of parameters and local times in a discretely observed threshold diffusion model Sara Mazzonetto et.al. 2403.06858 null
2024-03-11 Multistep Consistency Models Jonathan Heek et.al. 2403.06807 null
2024-03-11 Distribution-Aware Data Expansion with Diffusion Models Haowei Zhu et.al. 2403.06741 link
2024-03-11 V3D: Video Diffusion Models are Effective 3D Generators Zilong Chen et.al. 2403.06738 link
2024-03-08 VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion Models Yabo Zhang et.al. 2403.05438 link
2024-03-08 DiffSF: Diffusion Models for Scene Flow Estimation Yushan Zhang et.al. 2403.05327 link
2024-03-08 Noise Level Adaptive Diffusion Model for Robust Reconstruction of Accelerated MRI Shoujin Huang et.al. 2403.05245 link
2024-03-08 Towards Effective Usage of Human-Centric Priors in Diffusion Models for Text-based Human Image Generation Junyan Wang et.al. 2403.05239 null
2024-03-08 Denoising Autoregressive Representation Learning Yazhe Li et.al. 2403.05196 null
2024-03-08 DiffuLT: How to Make Diffusion Model Useful for Long-tail Recognition Jie Shao et.al. 2403.05170 null
2024-03-08 GSEdit: Efficient Text-Guided Editing of 3D Objects via Gaussian Splatting Francesco Palandra et.al. 2403.05154 null
2024-03-08 Improving Diffusion Models for Virtual Try-on Yisol Choi et.al. 2403.05139 link
2024-03-08 ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment Xiwei Hu et.al. 2403.05135 null
2024-03-08 CogView3: Finer and Faster Text-to-Image Generation via Relay Diffusion Wendi Zheng et.al. 2403.05121 null
2024-03-07 ObjectCompose: Evaluating Resilience of Vision-Based Models on Object-to-Background Compositional Changes Hashmat Shadab Malik et.al. 2403.04701 link
2024-03-07 Delving into the Trajectory Long-tail Distribution for Muti-object Tracking Sijia Chen et.al. 2403.04700 link
2024-03-07 PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation Junsong Chen et.al. 2403.04692 link
2024-03-08 Pix2Gif: Motion-Guided Diffusion for GIF Generation Hitesh Kandala et.al. 2403.04634 link
2024-03-07 A Domain Translation Framework with an Adversarial Denoising Diffusion Model to Generate Synthetic Datasets of Echocardiography Images Cristiana Tiago et.al. 2403.04612 null
2024-03-07 Anatomy-Guided Surface Diffusion Model for Alzheimer's Disease Normative Modeling Jianwei Zhang et.al. 2403.04531 null
2024-03-07 Effect of turbulent diffusion in modeling anaerobic digestion Jeremy Z. Yan et.al. 2403.04457 null
2024-03-07 Disentangled Diffusion-Based 3D Human Pose Estimation with Hierarchical Spatial and Temporal Denoiser Qingyuan Cai et.al. 2403.04444 link
2024-03-07 StableDrag: Stable Dragging for Point-based Image Editing Yutao Cui et.al. 2403.04437 null
2024-03-07 On-demand Quantization for Green Federated Generative Diffusion in Mobile Edge Networks Bingkun Lai et.al. 2403.04430 null
2024-03-06 GUIDE: Guidance-based Incremental Learning with Diffusion Models Bartosz Cywiński et.al. 2403.03938 link
2024-03-06 Latent Dataset Distillation with Diffusion Models Brian B. Moser et.al. 2403.03881 null
2024-03-06 Accelerating Convergence of Score-Based Diffusion Models, Provably Gen Li et.al. 2403.03852 null
2024-03-06 Diffusion on language model embeddings for protein sequence generation Viacheslav Meshchaninov et.al. 2403.03726 null
2024-03-06 Efficient Search and Learning for Agile Locomotion on Stepping Stones Adithya Kumar Chinnakkonda Ravi et.al. 2403.03639 null
2024-03-06 Diffusion-based Generative Prior for Low-Complexity MIMO Channel Estimation Benedikt Fesl et.al. 2403.03545 link
2024-03-06 NoiseCollage: A Layout-Aware Text-to-Image Diffusion Model Based on Noise Cropping and Merging Takahiro Shirakawa et.al. 2403.03485 link
2024-03-06 FLAME Diffuser: Grounded Wildfire Image Synthesis using Mask Guided Diffusion Hao Wang et.al. 2403.03463 link
2024-03-06 Towards Understanding Cross and Self-Attention in Stable Diffusion for Text-Guided Image Editing Bingyan Liu et.al. 2403.03431 null
2024-03-05 Scaling Rectified Flow Transformers for High-Resolution Image Synthesis Patrick Esser et.al. 2403.03206 null
2024-03-05 MAGID: An Automated Pipeline for Generating Synthetic Multi-modal Datasets Hossein Aboutalebi et.al. 2403.03194 link
2024-03-05 NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models Zeqian Ju et.al. 2403.03100 null
2024-03-05 Global N-body Simulation of Gap Edge Structures Created by Perturbations from a Small Satellite Embedded in Saturn's Rings Naoya Torii et.al. 2403.03012 null
2024-03-05 Cross-Domain Image Conversion by CycleDM Sho Shimotsumagari et.al. 2403.02919 null
2024-03-05 MMoFusion: Multi-modal Co-Speech Motion Generation with Diffusion Model Sen Wang et.al. 2403.02905 link
2024-03-05 Enhancing the Rate-Distortion-Perception Flexibility of Learned Image Codecs with Conditional Diffusion Decoders Daniele Mari et.al. 2403.02887 null
2024-03-05 Zero-LED: Zero-Reference Lighting Estimation Diffusion Model for Low-Light Image Enhancement Jinhong He et.al. 2403.02879 null
2024-03-05 Scalable Continuous-time Diffusion Framework for Network Inference and Influence Estimation Keke Huang et.al. 2403.02867 link
2024-03-05 Tuning-Free Noise Rectification for High Fidelity Image-to-Video Generation Weijie Li et.al. 2403.02827 null
2024-03-02 DiffSal: Joint Audio and Video Learning for Diffusion Saliency Prediction Junwen Xiong et.al. 2403.01226 null
2024-03-02 TCIG: Two-Stage Controlled Image Generation with Quality Enhancement through Diffusion Salaheldin Mohamed et.al. 2403.01212 null
2024-03-02 Training Unbiased Diffusion Models From Biased Dataset Yeongmin Kim et.al. 2403.01189 link
2024-03-02 Volume diffusion modelling of a sheared granular gas Duncan Dockar et.al. 2403.01188 null
2024-03-02 Text-guided Explorable Image Super-resolution Kanchana Vaishnavi Gandikota et.al. 2403.01124 null
2024-03-02 Face Swap via Diffusion Model Feifei Wang et.al. 2403.01108 link
2024-03-01 A time-stepping deep gradient flow method for option pricing in (rough) diffusion models Antonis Papapantoleon et.al. 2403.00746 null
2024-03-01 Diff-Plugin: Revitalizing Details for Diffusion-based Low-level Tasks Yuhao Liu et.al. 2403.00644 null
2024-03-01 Improving Explicit Spatial Relationships in Text-to-Image Generation through an Automatically Derived Dataset Ander Salaberria et.al. 2403.00587 link
2024-03-01 Rethinking cluster-conditioned diffusion models Nikolas Adaloglou et.al. 2403.00570 link
2024-02-29 DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models Muyang Li et.al. 2402.19481 link
2024-02-29 Towards Generalizable Tumor Synthesis Qi Chen et.al. 2402.19470 link
2024-02-29 Listening to the Noise: Blind Denoising with Gibbs Diffusion David Heurtel-Depeiges et.al. 2402.19455 link
2024-02-29 Structure Preserving Diffusion Models Haoye Lu et.al. 2402.19369 null
2024-02-29 A Novel Approach to Industrial Defect Generation through Blended Latent Diffusion Model with Online Adaptation Hanxi Li et.al. 2402.19330 link
2024-02-29 DiffAssemble: A Unified Graph-Diffusion Model for 2D and 3D Reassembly Gianluca Scarpellini et.al. 2402.19302 link
2024-02-29 TEncDM: Understanding the Properties of Diffusion Model in the Space of Language Model Encodings Alexander Shabalin et.al. 2402.19097 link
2024-03-01 Graph Convolutional Neural Networks for Automated Echocardiography View Recognition: A Holistic Approach Sarina Thomas et.al. 2402.19062 null
2024-02-29 WDM: 3D Wavelet Diffusion Models for High-Resolution Medical Image Synthesis Paul Friedrich et.al. 2402.19043 link
2024-02-29 Generating, Reconstructing, and Representing Discrete and Continuous Data: Generalized Diffusion with Learnable Encoding-Decoding Guangyi Liu et.al. 2402.19009 null
2024-02-28 Logarithmic Sobolev Inequalities for Bounded Domains and Applications to Drift-Diffusion Equations Elie Abdo et.al. 2402.18572 null
2024-02-28 Dynamical Regimes of Diffusion Models Giulio Biroli et.al. 2402.18491 null
2024-02-28 Deep Confident Steps to New Pockets: Strategies for Docking Generalization Gabriele Corso et.al. 2402.18396 link
2024-02-28 Objective and Interpretable Breast Cosmesis Evaluation with Attention Guided Denoising Diffusion Anomaly Detection Model Sangjoon Park et.al. 2402.18362 null
2024-02-28 FineDiffusion: Scaling up Diffusion Models for Fine-grained Image Generation with 10,000 Classes Ziying Pan et.al. 2402.18331 link
2024-02-28 Balancing Act: Distribution-Guided Debiasing in Diffusion Models Rishubh Parihar et.al. 2402.18206 null
2024-02-28 Diffusion-based Neural Network Weights Generation Bedionita Soro et.al. 2402.18153 link
2024-02-28 Context-aware Talking Face Video Generation Meidai Xuanyuan et.al. 2402.18092 null
2024-02-28 Coarse-to-Fine Latent Diffusion for Pose-Guided Person Image Synthesis Yanzuo Lu et.al. 2402.18078 link
2024-02-28 SynArtifact: Classifying and Alleviating Artifacts in Synthetic Images via Vision-Language Model Bin Cao et.al. 2402.18068 link
2024-02-27 Diffusion Meets DAgger: Supercharging Eye-in-hand Imitation Learning Xiaoyu Zhang et.al. 2402.17768 null
2024-02-27 Structure-Guided Adversarial Training of Diffusion Models Ling Yang et.al. 2402.17563 null
2024-02-27 Diffusion Model-Based Image Editing: A Survey Yi Huang et.al. 2402.17525 link
2024-02-27 Label-Noise Robust Diffusion Models Byeonghu Na et.al. 2402.17517 link
2024-02-27 EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions Linrui Tian et.al. 2402.17485 null
2024-02-27 DiffuseKronA: A Parameter Efficient Fine-tuning Method for Personalized Diffusion Model Shyam Marjit et.al. 2402.17412 null
2024-02-27 Generative diffusion model for surface structure discovery Nikolaj Rønne et.al. 2402.17404 null
2024-02-27 Denoising Diffusion Models for Inpainting of Healthy Brain Tissue Alicia Durrer et.al. 2402.17307 null
2024-02-27 DivAvatar: Diverse 3D Avatar Generation with a Single Prompt Weijing Tao et.al. 2402.17292 null
2024-02-27 Enhancing Hyperspectral Images via Diffusion Model and Group-Autoencoder Super-resolution Network Zhaoyang Wang et.al. 2402.17285 link
2024-02-26 Stochastic Conditional Diffusion Models for Semantic Image Synthesis Juyeon Ko et.al. 2402.16506 link
2024-02-26 Outline-Guided Object Inpainting with Diffusion Models Markus Pobitzer et.al. 2402.16421 null
2024-02-26 Placing Objects in Context via Inpainting for Out-of-distribution Segmentation Pau de Jorge et.al. 2402.16392 link
2024-02-26 Generative AI in Vision: A Survey on Models, Metrics and Applications Gaurav Raut et.al. 2402.16369 null
2024-02-26 Feedback Efficient Online Fine-Tuning of Diffusion Models Masatoshi Uehara et.al. 2402.16359 null
2024-02-26 Graph Diffusion Policy Optimization Yijing Liu et.al. 2402.16302 link
2024-02-25 Photon-counting CT using a Conditional Diffusion Model for Super-resolution and Texture-preservation Christopher Wiedeman et.al. 2402.16212 null
2024-02-25 Towards Efficient Quantum Hybrid Diffusion Models Francesca De Falco et.al. 2402.16147 null
2024-02-25 Cinematographic Camera Diffusion Model Hongda Jiang et.al. 2402.16143 link
2024-02-25 Behavioral Refinement via Interpolant-based Policy Diffusion Kaiqi Chen et.al. 2402.16075 link
2024-02-23 Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition Chun-Hsiao Yeh et.al. 2402.15504 link
2024-02-23 ProTIP: Probabilistic Robustness Verification on Text-to-Image Diffusion Models against Stochastic Perturbation Yi Zhang et.al. 2402.15429 link
2024-02-23 Let's Rectify Step by Step: Improving Aspect-based Sentiment Analysis with Diffusion Models Shunyu Liu et.al. 2402.15289 link
2024-02-23 Weak Reproductive Solutions for a Convection-Diffusion Model Describing a Binary Alloy Solidification Processes Blanca Climent-Ezquerra et.al. 2402.15221 null
2024-02-23 Label-efficient Multi-organ Segmentation Method with Diffusion Model Yongzhi Huang et.al. 2402.15216 null
2024-02-23 Fine-Tuning of Continuous-Time Diffusion Models as Entropy-Regularized Control Masatoshi Uehara et.al. 2402.15194 null
2024-02-23 Dynamics-Guided Diffusion Model for Robot Manipulator Design Xiaomeng Xu et.al. 2402.15038 null
2024-02-22 Cameras as Rays: Pose Estimation via Ray Diffusion Jason Y. Zhang et.al. 2402.14817 null
2024-02-22 Customize-A-Video: One-Shot Motion Customization of Text-to-Video Diffusion Models Yixuan Ren et.al. 2402.14780 null
2024-02-22 Debiasing Text-to-Image Diffusion Models Ruifei He et.al. 2402.14577 null
2024-02-22 Model-Based Reinforcement Learning Control of Reaction-Diffusion Problems Christina Schenk et.al. 2402.14446 null
2024-02-22 Large-Scale Actionless Video Pre-Training via Discrete Diffusion for Efficient Policy Learning Haoran He et.al. 2402.14407 null
2024-02-22 Diffusion Model Based Visual Compensation Guidance and Visual Difference Analysis for No-Reference Image Quality Assessment Zhaoyang Wang et.al. 2402.14401 link
2024-02-22 Typographic Text Generation with Off-the-Shelf Diffusion Model KhayTze Peong et.al. 2402.14314 null
2024-02-22 Font Style Interpolation with Diffusion Models Tetta Kondo et.al. 2402.14311 null
2024-02-22 Symbolic Music Generation with Non-Differentiable Rule Guided Diffusion Yujia Huang et.al. 2402.14285 link
2024-02-22 MVD $^2$ : Efficient Multiview 3D Reconstruction for Multiview Diffusion Xin-Yang Zheng et.al. 2402.14253 null
2024-02-21 Non-asymptotic Convergence of Discrete-time Diffusion Models: New Approach and Improved Rate Yuchen Liang et.al. 2402.13901 null
2024-02-21 NeuralDiffuser: Controllable fMRI Reconstruction with Primary Visual Feature Guided Diffusion Haoyu Li et.al. 2402.13809 null
2024-02-21 Deep Generative Models for Offline Policy Learning: Tutorial, Survey, and Perspectives on Future Directions Jiayu Chen et.al. 2402.13777 link
2024-02-21 Cas-DiffCom: Cascaded diffusion model for infant longitudinal super-resolution 3D medical image completion Lianghu Guo et.al. 2402.13776 null
2024-02-21 Music Style Transfer with Time-Varying Inversion of Diffusion Models Sifei Li et.al. 2402.13763 null
2024-02-21 SRNDiff: Short-term Rainfall Nowcasting with Condition Diffusion Model Xudong Ling et.al. 2402.13737 link
2024-02-21 Hybrid Video Diffusion Models with 2D Triplane and 3D Wavelet Representation Kihong Kim et.al. 2402.13729 null
2024-02-21 Flexible Physical Camouflage Generation Based on a Differential Approach Yang Li et.al. 2402.13575 null
2024-02-21 ToDo: Token Downsampling for Efficient Generation of High-Resolution Images Ethan Smith et.al. 2402.13573 null
2024-02-21 Generative AI for Secure Physical Layer Communications: A Survey Changyuan Zhao et.al. 2402.13553 null
2024-02-20 Neural Network Diffusion Kai Wang et.al. 2402.13144 link
2024-02-20 Text-Guided Molecule Generation with Diffusion Language Model Haisong Gong et.al. 2402.13040 link
2024-02-20 Visual Style Prompting with Swapping Self-Attention Jaeseok Jeong et.al. 2402.12974 link
2024-02-20 CLIPping the Deception: Adapting Vision-Language Models for Universal Deepfake Detection Sohail Ahmed Khan et.al. 2402.12927 link
2024-02-20 RealCompo: Dynamic Equilibrium between Realism and Compositionality Improves Text-to-Image Diffusion Models Xinchen Zhang et.al. 2402.12908 link
2024-02-20 Two-stage Rainfall-Forecasting Diffusion Model XuDong Ling et.al. 2402.12779 link
2024-02-20 MuLan: Multimodal-LLM Agent for Progressive Multi-Object Diffusion Sen Li et.al. 2402.12741 link
2024-02-20 Diffusion Posterior Sampling is Computationally Intractable Shivam Gupta et.al. 2402.12727 null
2024-02-20 MVDiffusion++: A Dense High-resolution Multi-view Diffusion Model for Single or Sparse-view 3D Object Reconstruction Shitao Tang et.al. 2402.12712 null
2024-02-20 SingVisio: Visual Analytics of Diffusion Model for Singing Voice Conversion Liumeng Xue et.al. 2402.12660 link
2024-02-19 FiT: Flexible Vision Transformer for Diffusion Model Zeyu Lu et.al. 2402.12376 link
2024-02-19 Synthetic location trajectory generation using categorical diffusion models Simon Dirmeier et.al. 2402.12242 link
2024-02-19 Adversarial Feature Alignment: Balancing Robustness and Accuracy in Deep Learning via Adversarial Training Leo Hyun Park et.al. 2402.12187 null
2024-02-19 Human Video Translation via Query Warping Haiming Zhu et.al. 2402.12099 null
2024-02-19 Direct Consistency Optimization for Compositional Text-to-Image Personalization Kyungmin Lee et.al. 2402.12004 null
2024-02-19 Privacy-Preserving Low-Rank Adaptation for Latent Diffusion Models Zihao Luo et.al. 2402.11989 link
2024-02-19 DiLightNet: Fine-grained Lighting Control for Diffusion-based Image Generation Chong Zeng et.al. 2402.11929 link
2024-02-19 A Generative Pre-Training Framework for Spatio-Temporal Graph Transfer Learning Yuan Yuan et.al. 2402.11922 link
2024-02-19 ComFusion: Personalized Subject Generation in Multiple Specific Scenes From Single Image Yan Hong et.al. 2402.11849 null
2024-02-19 UnlearnCanvas: A Stylized Image Dataset to Benchmark Machine Unlearning for Diffusion Models Yihua Zhang et.al. 2402.11846 link
2024-02-16 3D Diffuser Actor: Policy Diffusion with 3D Scene Representations Tsung-Wei Ke et.al. 2402.10885 null
2024-02-16 Training Class-Imbalanced Diffusion Model Via Overlap Optimization Divin Yan et.al. 2402.10821 link
2024-02-16 VATr++: Choose Your Words Wisely for Handwritten Text Generation Bram Vanherle et.al. 2402.10798 null
2024-02-16 Rethinking Human-like Translation Strategy: Integrating Drift-Diffusion Model with Large Language Models for Machine Translation Hongbin Na et.al. 2402.10699 null
2024-02-16 Generative AI and Attentive User Interfaces: Five Strategies to Enhance Take-Over Quality in Automated Driving Patrick Ebel et.al. 2402.10664 null
2024-02-16 Speaking in Wavelet Domain: A Simple and Efficient Approach to Speed up Speech Diffusion Model Xiangyu Zhang et.al. 2402.10642 null
2024-02-16 U $^2$ MRPD: Unsupervised undersampled MRI reconstruction by prompting a large latent diffusion model Ziqi Gao et.al. 2402.10609 link
2024-02-16 A maximum likelihood estimation of Lévy-driven stochastic systems for univariate and multivariate time series of observations Babak M. S. Arani et.al. 2402.10608 null
2024-02-16 Make a Cheap Scaling: A Self-Cascade Diffusion Model for Higher-Resolution Adaptation Lanqing Guo et.al. 2402.10491 link
2024-02-16 Explaining generative diffusion models via visual analysis for interpretable decision-making process Ji-Hoon Park et.al. 2402.10404 link
2024-02-15 Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation Huizhuo Yuan et.al. 2402.10210 null
2024-02-15 Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment Rui Yang et.al. 2402.10207 link
2024-02-15 Radio-astronomical Image Reconstruction with Conditional Denoising Diffusion Model Mariia Drozdova et.al. 2402.10204 link
2024-02-15 Classification Diffusion Models Shahar Yadin et.al. 2402.10095 null
2024-02-15 Diffusion Models Meet Contextual Bandits with Large Action Spaces Imad Aouali et.al. 2402.10028 null
2024-02-15 Zero-Shot Unsupervised and Text-Based Audio Editing Using DDPM Inversion Hila Manor et.al. 2402.10009 null
2024-02-15 Accelerating Parallel Sampling of Diffusion Models Zhiwei Tang et.al. 2402.09970 link
2024-02-15 Textual Localization: Decomposing Multi-concept Images for Subject-Driven Text-to-Image Generation Junjie Shentu et.al. 2402.09966 link
2024-02-15 Lester: rotoscope animation through video object segmentation and tracking Ruben Tous et.al. 2402.09883 link
2024-02-15 Diffusion Models for Audio Restoration Jean-Marie Lemercier et.al. 2402.09821 null
2024-02-14 Synthesizing Knowledge-enhanced Features for Real-world Zero-shot Food Detection Pengfei Zhou et.al. 2402.09242 link
2024-02-14 Semi-Supervised Diffusion Model for Brain Age Prediction Ayodeji Ijishakin et.al. 2402.09137 null
2024-02-14 L3GO: Language Agents with Chain-of-3D-Thoughts for Generating Unconventional Objects Yutaro Yamada et.al. 2402.09052 null
2024-02-14 Extreme Video Compression with Pre-trained Diffusion Models Bohan Li et.al. 2402.08934 link
2024-02-14 The Mirrored Influence Hypothesis: Efficient Data Influence Estimation by Harnessing Forward Passes Myeongseob Ko et.al. 2402.08922 link
2024-02-13 Percolating transition to turbulence without puffs or bands Sébastien Gomé et.al. 2402.08829 null
2024-02-13 LDTrack: Dynamic People Tracking by Service Robots using Diffusion Models Angus Fung et.al. 2402.08774 null
2024-02-13 Towards the Detection of AI-Synthesized Human Face Images Yuhang Lu et.al. 2402.08750 null
2024-02-13 PRDP: Proximal Reward Difference Prediction for Large-Scale Reward Finetuning of Diffusion Models Fei Deng et.al. 2402.08714 null
2024-02-13 Chain Reaction of Ideas: Can Radioactive Decay Predict Technological Innovation? Guilherme S. Y. Giardini et.al. 2402.08681 null
2024-02-13 Target Score Matching Valentin De Bortoli et.al. 2402.08667 null
2024-02-13 Learning Continuous 3D Words for Text-to-Image Generation Ta-Ying Cheng et.al. 2402.08654 link
2024-02-13 Denoising Diffusion Restoration Tackles Forward and Inverse Problems for the Laplace Operator Amartya Mukherjee et.al. 2402.08563 null
2024-02-13 Confronting Reward Overoptimization for Diffusion Models: A Perspective of Inductive and Primacy Biases Ziyi Zhang et.al. 2402.08552 link
2024-02-13 A Dense Reward View on Aligning Text-to-Image Diffusion with Preference Shentao Yang et.al. 2402.08265 link
2024-02-13 Fine-Tuning Text-To-Image Diffusion Models for Class-Wise Spurious Feature Generation AprilPyone MaungMaung et.al. 2402.08200 null
2024-02-12 Convergence Analysis of Discrete Diffusion Model: Exact Implementation through Uniformization Hongrui Chen et.al. 2402.08095 null
2024-02-12 Nearest Neighbour Score Estimators for Diffusion Generative Models Matthew Niedoba et.al. 2402.08018 link
2024-02-12 Towards a mathematical theory for consistency training in diffusion models Gen Li et.al. 2402.07802 null
2024-02-12 Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models Jiacheng Ye et.al. 2402.07754 link
2024-02-12 Cosmology at the Field Level with Probabilistic Machine Learning Adam Rouhiainen et.al. 2402.07694 null
2024-02-12 Trustworthy SR: Resolving Ambiguity in Image Super-resolution via Diffusion Models and Human Feedback Cansu Korkmaz et.al. 2402.07597 null
2024-02-12 Score-based Diffusion Models via Stochastic Differential Equations -- a Technical Tutorial Wenpin Tang et.al. 2402.07487 null
2024-02-12 SALAD: Smart AI Language Assistant Daily Ragib Amin Nihal et.al. 2402.07431 null
2024-02-12 Diff-RNTraj: A Structure-aware Diffusion Model for Road Network-constrained Trajectory Generation Tonglong Wei et.al. 2402.07369 link
2024-02-11 Stitching Sub-Trajectories with Conditional Diffusion Model for Goal-Conditioned Offline RL Sungyoon Kim et.al. 2402.07226 link
2024-02-11 Towards Fast Stochastic Sampling in Diffusion Generative Models Kushagra Pandey et.al. 2402.07211 null
2024-02-10 Synthesizing CTA Image Data for Type-B Aortic Dissection using Stable Diffusion Models Ayman Abaid et.al. 2402.06969 null
2024-02-09 Diffusion-ES: Gradient-free Planning with Diffusion for Autonomous Driving and Zero-Shot Instruction Following Brian Yang et.al. 2402.06559 link
2024-02-09 Sequential Flow Matching for Generative Modeling Jongmin Yoon et.al. 2402.06461 null
2024-02-09 ControlUDA: Controllable Diffusion-assisted Unsupervised Domain Adaptation for Cross-Weather Semantic Segmentation Fengyi Shen et.al. 2402.06446 null
2024-02-09 Improving 2D-3D Dense Correspondences with Diffusion Models for 6D Object Pose Estimation Peter Hönig et.al. 2402.06436 null
2024-02-09 Particle Denoising Diffusion Sampler Angus Phillips et.al. 2402.06320 link
2024-02-09 Controllable seismic velocity synthesis using generative diffusion models Fu Wang et.al. 2402.06277 null
2024-02-09 MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion Models Yixiao Zhang et.al. 2402.06178 link
2024-02-08 CLR-Face: Conditional Latent Refinement for Blind Face Restoration Using Score-Based Diffusion Models Maitreya Suin et.al. 2402.06106 null
2024-02-08 Animated Stickers: Bringing Stickers to Life with Video Diffusion David Yan et.al. 2402.06088 null
2024-02-08 DiscDiff: Latent Diffusion Model for DNA Sequence Generation Zehui Li et.al. 2402.06079 null
2024-02-08 InstaGen: Enhancing Object Detection by Training on Synthetic Dataset Chengjian Feng et.al. 2402.05937 null
2024-02-08 Time Series Diffusion in the Frequency Domain Jonathan Crabbé et.al. 2402.05933 link
2024-02-08 AvatarMMC: 3D Head Avatar Generation and Editing with Multi-Modal Conditioning Wamiq Reyaz Para et.al. 2402.05803 null
2024-02-08 DiffSpeaker: Speech-Driven 3D Facial Animation with Diffusion Transformer Zhiyuan Ma et.al. 2402.05712 link
2024-02-08 Scalable Diffusion Models with State Space Backbone Zhengcong Fei et.al. 2402.05608 link
2024-02-08 Get What You Want, Not What You Don't: Image Content Suppression for Text-to-Image Diffusion Models Senmao Li et.al. 2402.05375 link
2024-02-08 Descanning: From Scanned to the Original Images with a Color Correction Diffusion Model Junghun Cha et.al. 2402.05350 null
2024-02-07 SPAD : Spatially Aware Multiview Diffusers Yash Kant et.al. 2402.05235 null
2024-02-07 Anatomically-Controllable Medical Image Generation with Segmentation-Guided Diffusion Models Nicholas Konz et.al. 2402.05210 link
2024-02-07 $λ$ -ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion Models by Leveraging CLIP Latent Space Maitreya Patel et.al. 2402.05195 null
2024-02-07 On diffusion models for amortized inference: Benchmarking and improving stochastic control and sampling Marcin Sendera et.al. 2402.05098 link
2024-02-07 NITO: Neural Implicit Fields for Resolution-free Topology Optimization Amin Heyrani Nobari et.al. 2402.05073 link
2024-02-07 LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation Jiaxiang Tang et.al. 2402.05054 null
2024-02-07 Generative Flows on Discrete State-Spaces: Enabling Multimodal Flows with Applications to Protein Co-Design Andrew Campbell et.al. 2402.04997 link
2024-02-07 Blue noise for diffusion models Xingchang Huang et.al. 2402.04930 link
2024-02-07 Source-Free Domain Adaptation with Diffusion-Guided Source Data Generation Shivang Chopra et.al. 2402.04929 null
2024-02-07 Towards Aligned Layout Generation via Diffusion Model with Aesthetic Constraints Jian Chen et.al. 2402.04754 link
2024-02-07 Cortical Surface Diffusion Generative Models Zhenshan Xie et.al. 2402.04753 null
2024-02-07 EvoSeed: Unveiling the Threat on Deep Neural Networks with Real-World Illusions Shashank Kotyan et.al. 2402.04699 link
2024-02-07 Noise Map Guidance: Inversion with Spatial Context for Real Image Editing Hansam Cho et.al. 2402.04625 link
2024-02-06 Polyp-DDPM: Diffusion-Based Semantic Polyp Synthesis for Enhanced Segmentation Zolnamar Dorjsembe et.al. 2402.04031 link
2024-02-06 Space Group Constrained Crystal Generation Rui Jiao et.al. 2402.03992 null
2024-02-06 Controllable Diverse Sampling for Diffusion Based Motion Behavior Forecasting Yiming Xu et.al. 2402.03981 null
2024-02-06 EscherNet: A Generative Model for Scalable View Synthesis Xin Kong et.al. 2402.03908 link
2024-02-06 On gauge freedom, conservativity and intrinsic dimensionality estimation in diffusion models Christian Horvat et.al. 2402.03845 null
2024-02-06 SDEMG: Score-based Diffusion Model for Surface Electromyographic Signal Denoising Yu-Tung Liu et.al. 2402.03808 link
2024-02-06 FoolSDEdit: Deceptively Steering Your Edits Towards Targeted Attribute-aware Distribution Qi Zhou et.al. 2402.03705 null
2024-02-06 Improving and Unifying Discrete&Continuous-time Discrete Denoising Diffusion Lingxiao Zhao et.al. 2402.03701 link
2024-02-06 Pard: Permutation-Invariant Autoregressive Diffusion for Graph Generation Lingxiao Zhao et.al. 2402.03687 link
2024-02-06 QuEST: Low-bit Diffusion Model Quantization via Efficient Selective Finetuning Haoxuan Wang et.al. 2402.03666 link
2024-02-05 Do Diffusion Models Learn Semantically Meaningful and Efficient Representations? Qiyao Liang et.al. 2402.03305 null
2024-02-05 Zero-shot Object-Level OOD Detection with Context-Aware Inpainting Quang-Huy Nguyen et.al. 2402.03292 null
2024-02-05 InstanceDiffusion: Instance-level Control for Image Generation Xudong Wang et.al. 2402.03290 link
2024-02-05 Organic or Diffused: Can We Distinguish Human Art from AI-generated Images? Anna Yoo Jeong Ha et.al. 2402.03214 null
2024-02-05 Light and Optimal Schrödinger Bridge Matching Nikita Gushchin et.al. 2402.03207 link
2024-02-05 Guidance with Spherical Gaussian Constraint for Conditional Diffusion Lingxiao Yang et.al. 2402.03201 link
2024-02-05 Direct-a-Video: Customized Video Generation with User-Directed Camera Movement and Object Motion Shiyuan Yang et.al. 2402.03162 null
2024-02-05 PFDM: Parser-Free Virtual Try-on via Diffusion Model Yunfang Niu et.al. 2402.03047 null
2024-02-05 Diffusive Gibbs Sampling Wenlin Chen et.al. 2402.03008 link
2024-02-05 DexDiffuser: Generating Dexterous Grasps with Diffusion Models Zehang Weng et.al. 2402.02989 null
2024-02-02 NeuroCine: Decoding Vivid Video Sequences from Human Brain Activties Jingyuan Sun et.al. 2402.01590 null
2024-02-02 Boximator: Generating Rich and Controllable Motions for Video Synthesis Jiawei Wang et.al. 2402.01566 null
2024-02-02 Cross-view Masked Diffusion Transformers for Person Image Synthesis Trung X. Pham et.al. 2402.01516 link
2024-02-02 Conditioning non-linear and infinite-dimensional diffusion processes Elizabeth Louise Baker et.al. 2402.01434 link
2024-02-02 Bass Accompaniment Generation via Latent Diffusion Marco Pasini et.al. 2402.01412 null
2024-02-02 Cheating Suffix: Targeted Attack to Text-To-Image Diffusion Models with Multi-Modal Priors Dingcheng Yang et.al. 2402.01369 link
2024-02-02 Unsupervised Generation of Pseudo Normal PET from MRI with Diffusion Model for Epileptic Focus Localization Wentao Chen et.al. 2402.01191 null
2024-02-01 Unconditional Latent Diffusion Models Memorize Patient Imaging Data Salman Ul Hassan Dar et.al. 2402.01054 link
2024-02-01 pop-cosmos: A comprehensive picture of the galaxy population from COSMOS data Justin Alsing et.al. 2402.00935 null
2024-02-01 Data-Space Validation of High-Dimensional Models by Comparing Sample Quantiles Stephen Thorp et.al. 2402.00930 null
2024-02-01 ViCA-NeRF: View-Consistency-Aware 3D Editing of Neural Radiance Fields Jiahua Dong et.al. 2402.00864 link
2024-02-01 An Analysis of the Variance of Diffusion-based Speech Enhancement Bunlong Lay et.al. 2402.00811 null
2024-02-01 Distilling Conditional Diffusion Models for Offline Reinforcement Learning through Trajectory Stitching Shangzhe Li et.al. 2402.00807 null
2024-02-01 AnimateLCM: Accelerating the Animation of Personalized Diffusion Models and Adapters with Decoupled Consistency Learning Fu-Yun Wang et.al. 2402.00769 link
2024-02-01 Cylindrically symmetric diffusion model for relativistic heavy-ion collisions Johannes Hoelck et.al. 2402.00628 null
2024-02-01 CapHuman: Capture Your Moments in Parallel Universes Chao Liang et.al. 2402.00627 link
2024-02-01 Masked Conditional Diffusion Model for Enhancing Deepfake Detection Tiewen Chen et.al. 2402.00541 null
2024-02-01 Energetic Particles in the Central Starburst, Disc, and Halo of NGC253 Yoel Rephaeli et.al. 2402.00523 null
2024-02-01 LRDif: Diffusion Models for Under-Display Camera Emotion Recognition Zhifeng Wang et.al. 2402.00250 null
2024-01-31 SuperDiff: Diffusion Models for Conditional Generation of Hypothetical New Families of Superconductors Samuel Yuan et.al. 2402.00198 link
2024-01-31 Motion Guidance: Diffusion-Based Image Editing with Differentiable Motion Estimators Daniel Geng et.al. 2401.18085 null
2024-01-31 Ljusternik-Schnirelmann eigenvalues for the fractional $m-$Laplacian without the $Δ_2$ condition Julian Fernandez Bonder et.al. 2401.18041 null
2024-01-31 Diagnosing the particle transport mechanism in the pulsar halo via X-ray observations Qi-Zuo Wu et.al. 2401.17982 null
2024-01-31 Convergence Analysis for General Probability Flow ODEs of Diffusion Models in Wasserstein Distances Xuefeng Gao et.al. 2401.17958 null
2024-01-31 AEROBLADE: Training-Free Detection of Latent Diffusion Images Using Autoencoder Reconstruction Error Jonas Ricker et.al. 2401.17879 link
2024-01-31 Drift Diffusion Model to understand (mis)information sharing dynamic in complex networks Lucila G. Alvarez-Zuzek et.al. 2401.17846 null
2024-01-31 A new class of efficient high order semi-Lagrangian IMEX discontinuous Galerkin methods on staggered unstructured meshes M. Tavelli et.al. 2401.17806 null
2024-01-31 Dance-to-Music Generation with Encoder-based Textual Inversion of Diffusion Models Sifei Li et.al. 2401.17800 link
2024-01-31 Image Anything: Towards Reasoning-coherent and Training-free Multi-modal Image Generation Yuanhuiyi Lyu et.al. 2401.17664 null
2024-01-31 Spatial-and-Frequency-aware Restoration method for Images based on Diffusion Models Kyungsung Lee et.al. 2401.17629 null
2024-01-30 You Only Need One Step: Fast Super-Resolution with Stable Diffusion via Scale Distillation Mehdi Noroozi et.al. 2401.17258 null
2024-01-30 ContactGen: Contact-Guided Interactive 3D Human Generation for Partners Dongjun Gu et.al. 2401.17212 null
2024-01-30 Transfer Learning for Text Diffusion Models Kehang Han et.al. 2401.17181 null
2024-01-30 PlantoGraphy: Incorporating Iterative Design Process into Generative Artificial Intelligence for Landscape Rendering Rong Huang et.al. 2401.17120 null
2024-01-30 Local modification of subdiffusion by initial Fickian diffusion: Multiscale modeling, analysis and computation Xiangcheng Zheng et.al. 2401.16885 null
2024-01-30 A Literature Review on Fetus Brain Motion Correction in MRI Haoran Zhang et.al. 2401.16782 null
2024-01-30 BoostDream: Efficient Refining for High-Quality Text-to-3D Generation from Multi-View Diffusion Yonghao Yu et.al. 2401.16764 null
2024-01-30 Pick-and-Draw: Training-free Semantic Guidance for Text-to-Image Personalization Henglei Lv et.al. 2401.16762 null
2024-01-30 Diffusion model for relational inference Shuhan Zheng et.al. 2401.16755 null
2024-01-29 Using multiple Dirac delta points to describe inhomogeneous flux density over a cell boundary in a single-cell diffusion model Qiyao Peng et.al. 2401.16261 null
2024-01-29 Diffutoon: High-Resolution Editable Toon Shading via Diffusion Models Zhongjie Duan et.al. 2401.16224 null
2024-01-29 Spatial-Aware Latent Initialization for Controllable Image Generation Wenqiang Sun et.al. 2401.16157 null
2024-01-29 DMCE: Diffusion Model Channel Enhancer for Multi-User Semantic Communication Systems Youcheng Zeng et.al. 2401.16017 null
2024-01-29 Motion-I2V: Consistent and Controllable Image-to-Video Generation with Explicit Motion Modeling Xiaoyu Shi et.al. 2401.15977 null
2024-01-29 EmoDM: A Diffusion Model for Evolutionary Multi-objective Optimization Xueming Yan et.al. 2401.15931 null
2024-01-28 Object-Driven One-Shot Fine-tuning of Text-to-Image Diffusion with Prototypical Embedding Jianxiang Lu et.al. 2401.15708 null
2024-01-28 Media2Face: Co-speech Facial Animation Generation With Multi-Modality Guidance Qingcheng Zhao et.al. 2401.15687 null
2024-01-28 CPDM: Content-Preserving Diffusion Model for Underwater Image Enhancement Xiaowen Shi et.al. 2401.15649 null
2024-01-28 FreeStyle: Free Lunch for Text-guided Style Transfer using Diffusion Models Feihong He et.al. 2401.15636 link
2024-01-26 Annotated Hands for Generative Models Yue Yang et.al. 2401.15075 link
2024-01-26 Text Image Inpainting via Global Structure-Guided Diffusion Models Shipeng Zhu et.al. 2401.14832 link
2024-01-25 Opposite variations for pore pressure on and off the fault during simulated earthquakes in the laboratory Dong Liu et.al. 2401.14506 null
2024-01-25 Deconstructing Denoising Diffusion Models for Self-Supervised Learning Xinlei Chen et.al. 2401.14404 null
2024-01-25 pix2gestalt: Amodal Segmentation by Synthesizing Wholes Ege Ozguroglu et.al. 2401.14398 link
2024-01-25 UrbanGenAI: Reconstructing Urban Landscapes using Panoptic Segmentation and Diffusion Models Timo Kapsalis et.al. 2401.14379 null
2024-01-25 Sketch2NeRF: Multi-view Sketch-guided Text-to-3D Generation Minglin Chen et.al. 2401.14257 null
2024-01-26 Image Synthesis with Graph Conditioning: CLIP-Guided Diffusion Models for Scene Graphs Rameshwar Mishra et.al. 2401.14111 null
2024-01-25 CreativeSynth: Creative Blending and Synthesis of Visual Arts based on Multimodal Diffusion Nisha Huang et.al. 2401.14066 link
2024-01-25 Diffusion-based Data Augmentation for Object Counting Problems Zhen Wang et.al. 2401.13992 null
2024-01-25 BootPIG: Bootstrapping Zero-shot Personalized Image Generation Capabilities in Pretrained Diffusion Models Senthil Purushwalkam et.al. 2401.13974 link
2024-01-25 StyleInject: Parameter Efficient Tuning of Text-to-Image Diffusion Models Yalong Bai et.al. 2401.13942 null
2024-01-24 Inverse Molecular Design with Multi-Conditional Diffusion Guidance Gang Liu et.al. 2401.13858 link
2024-01-24 Guided Diffusion for Fast Inverse Design of Density-based Mechanical Metamaterials Yanyan Yang et.al. 2401.13570 link
2024-01-24 UNIMO-G: Unified Image Generation through Multimodal Conditional Diffusion Wei Li et.al. 2401.13388 null
2024-01-24 Generative Design of Crystal Structures by Point Cloud Representations and Diffusion Model Zhelin Li et.al. 2401.13192 link
2024-01-24 Towards Multi-domain Face Landmark Detection with Synthetic Data from Diffusion model Yuanming Li et.al. 2401.13191 null
2024-01-24 Compositional Generative Inverse Design Tailin Wu et.al. 2401.13171 link
2024-01-24 Choose Your Diffusion: Efficient and flexible ways to accelerate the diffusion model in fast high energy physics simulation Cheng Jiang et.al. 2401.13162 null
2024-01-23 GALA: Generating Animatable Layered Assets from a Single Scan Taeksoo Kim et.al. 2401.12979 null
2024-01-24 Zero-Shot Learning for the Primitives of 3D Affordance in General Objects Hyeonwoo Kim et.al. 2401.12978 link
2024-01-23 Lumiere: A Space-Time Diffusion Model for Video Generation Omer Bar-Tal et.al. 2401.12945 null
2024-01-23 UniHDA: Towards Universal Hybrid Domain Adaptation of Image Generators Hengjia Li et.al. 2401.12596 null
2024-01-23 ToDA: Target-oriented Diffusion Attacker against Recommendation System Xiaohao Liu et.al. 2401.12578 null
2024-01-23 DDMI: Domain-Agnostic Latent Diffusion Models for Synthesizing High-Quality Implicit Neural Representations Dogyun Park et.al. 2401.12517 link
2024-01-22 DITTO: Diffusion Inference-Time T-Optimization for Music Generation Zachary Novack et.al. 2401.12179 null
2024-01-22 Single-View 3D Human Digitalization with Large Reconstruction Models Zhenzhen Weng et.al. 2401.12175 null
2024-01-22 Feature Denoising Diffusion Model for Blind Image Quality Assessment Xudong Li et.al. 2401.11949 null
2024-01-22 EmerDiff: Emerging Pixel-level Semantic Knowledge in Diffusion Models Koichi Namekata et.al. 2401.11739 null
2024-01-22 Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs Ling Yang et.al. 2401.11708 link
2024-01-21 Scalable High-Resolution Pixel-Space Image Synthesis with Hourglass Diffusion Transformers Katherine Crowson et.al. 2401.11605 link
2024-01-20 Diffusion Model Conditioning on Gaussian Mixture Model and Negative Gaussian Mixture Gradient Weiguo Lu et.al. 2401.11261 null
2024-01-20 Product-Level Try-on: Characteristics-preserving Try-on with Realistic Clothes Shading and Wrinkles Yanlong Zang et.al. 2401.11239 null
2024-01-20 MotionMix: Weakly-Supervised Diffusion for Controllable Motion Generation Nhat M. Hoang et.al. 2401.11115 link
2024-01-20 UltrAvatar: A Realistic Animatable 3D Avatar Diffusion Model with Authenticity Guided Textures Mingyuan Zhou et.al. 2401.11078 null
2024-01-19 Synthesizing Moving People with 3D Control Boyi Li et.al. 2401.10889 null
2024-01-19 ActAnywhere: Subject-Aware Video Background Generation Boxiao Pan et.al. 2401.10822 null
2024-01-19 From Market Saturation to Social Reinforcement: Understanding the Impact of Non-Linearity in Information Diffusion Models Tobias Friedrich et.al. 2401.10818 null
2024-01-19 Sat2Scene: 3D Urban Scene Generation from Satellite Images with Diffusion Zuoyue Li et.al. 2401.10786 null
2024-01-19 Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model Yinan Zheng et.al. 2401.10700 link
2024-01-19 MAEDiff: Masked Autoencoder-enhanced Diffusion Models for Unsupervised Anomaly Detection in Brain Images Rui Xu et.al. 2401.10561 null
2024-01-18 Inflation with Diffusion: Efficient Temporal Adaptation for Text-to-Video Super-Resolution Xin Yuan et.al. 2401.10404 null
2024-01-18 A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask Inpainting Wouter Van Gansbeke et.al. 2401.10227 link
2024-01-19 Motion-Zero: Zero-Shot Moving Object Control Framework for Diffusion-Based Video Generation Changgu Chen et.al. 2401.10150 null
2024-01-18 DiffusionGPT: LLM-Driven Text-to-Image Generation System Jie Qin et.al. 2401.10061 null
2024-01-18 CustomVideo: Customizing Text-to-Video Generation with Multiple Subjects Zhao Wang et.al. 2401.09962 null
2024-01-18 BlenDA: Domain Adaptive Object Detection through diffusion-based blending Tzuhsuan Huang et.al. 2401.09921 link
2024-01-18 Exploring Latent Cross-Channel Embedding for Accurate 3D Human Pose Reconstruction in a Diffusion Framework Junkun Jiang et.al. 2401.09836 link
2024-01-18 Wavelet-Guided Acceleration of Text Inversion in Diffusion-Based Image Editing Gwanhyeong Koo et.al. 2401.09794 null
2024-01-18 Image Translation as Diffusion Visual Programmers Cheng Han et.al. 2401.09742 null
2024-01-17 Total fraction of drug released from diffusion-controlled delivery systems with binding reactions Elliot J. Carr et.al. 2401.09644 link
2024-01-17 Efficient generative adversarial networks using linear additive-attention Transformers Emilio Morales-Juarez et.al. 2401.09596 link
2024-01-17 TextureDreamer: Image-guided Texture Synthesis through Geometry-aware Diffusion Yu-Ying Yeh et.al. 2401.09416 null
2024-01-17 Vlogger: Make Your Dream A Vlog Shaobin Zhuang et.al. 2401.09414 link
2024-01-17 On the $\varepsilon$ -Euler-Maruyama scheme for time inhomogeneous jump-driven SDEs Mireille Bossy et.al. 2401.09338 null
2024-01-17 Siamese Meets Diffusion Network: SMDNet for Enhanced Change Detection in High-Resolution RS Imagery Jia Jia et.al. 2401.09325 null
2024-01-17 T-FOLEY: A Controllable Waveform-Domain Diffusion Model for Temporal-Event-Guided Foley Sound Synthesis Yoonjin Chung et.al. 2401.09294 link
2024-01-17 Training-Free Semantic Video Composition via Pre-trained Diffusion Model Jiaqi Guo et.al. 2401.09195 null
2024-01-17 Consistent3D: Towards Consistent High-Fidelity Text-to-3D Generation with Deterministic Sampling Prior Zike Wu et.al. 2401.09050 link
2024-01-17 Compose and Conquer: Diffusion-Based 3D Depth Aware Composable Image Synthesis Jonghyun Lee et.al. 2401.09048 link
2024-01-17 VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models Haoxin Chen et.al. 2401.09047 link
2024-01-17 Data Attribution for Diffusion Models: Timestep-induced Bias in Influence Estimation Tong Xie et.al. 2401.09031 link
2024-01-16 Modeling Spoof Noise by De-spoofing Diffusion and its Application in Face Anti-spoofing Bin Zhang et.al. 2401.08275 null
2024-01-16 Multi-scale 2D Temporal Map Diffusion Models for Natural Language Video Localization Chongzhi Zhang et.al. 2401.08232 null
2024-01-16 Photonic Modes Prediction via Multi-Modal Diffusion Model Jinyang Sun et.al. 2401.08199 null
2024-01-16 Key-point Guided Deformable Image Manipulation Using Diffusion Model Seok-Hwan Oh et.al. 2401.08178 null
2024-01-16 SpecSTG: A Fast Spectral Diffusion Framework for Probabilistic Spatio-Temporal Traffic Forecasting Lequan Lin et.al. 2401.08119 null
2024-01-16 DIFFRENT: A Diffusion Model for Recording Environment Transfer of Speech Jaekwon Im et.al. 2401.08102 null
2024-01-16 EmoTalker: Emotionally Editable Talking Face Generation via Diffusion Model Bingyuan Zhang et.al. 2401.08049 null
2024-01-16 Forging Vision Foundation Models for Autonomous Driving: Challenges, Methodologies, and Opportunities Xu Yan et.al. 2401.08045 link
2024-01-15 Regularity in diffusion models with gradient activation Damião Araújo et.al. 2401.07979 null
2024-01-15 HexaGen3D: StableDiffusion is just one step away from Fast and Diverse Text-to-3D Generation Antoine Mercier et.al. 2401.07727 null
2024-01-12 A deep implicit-explicit minimizing movement method for option pricing in jump-diffusion models Emmanuil H. Georgoulis et.al. 2401.06740 null
2024-01-12 Decoupling Pixel Flipping and Occlusion Strategy for Consistent XAI Benchmarks Stefan Blücher et.al. 2401.06654 link
2024-01-12 Adversarial Examples are Misaligned in Diffusion Model Manifolds Peter Lorenz et.al. 2401.06637 null
2024-01-12 Motion2VecSets: 4D Latent Vector Set Diffusion for Non-rigid Shape Reconstruction and Tracking Wei Cao et.al. 2401.06614 null
2024-01-12 360DVD: Controllable Panorama Video Generation with 360-Degree Video Diffusion Model Qian Wang et.al. 2401.06578 null
2024-01-12 RotationDrag: Point-based Image Editing with Rotated Diffusion Features Minxing Luo et.al. 2401.06442 link
2024-01-12 Seek for Incantations: Towards Accurate Text-to-Image Diffusion Synthesis through Prompt Engineering Chang Yu et.al. 2401.06345 null
2024-01-11 Frequency-Time Diffusion with Neural Cellular Automata John Kalkhof et.al. 2401.06291 null
2024-01-11 Demystifying Variational Diffusion Models Fabio De Sousa Ribeiro et.al. 2401.06281 null
2024-01-11 E $^{2}$ GAN: Efficient Training of Efficient GANs for Image-to-Image Translation Yifan Gong et.al. 2401.06127 null
2024-01-11 DiffDA: a diffusion model for weather-scale data assimilation Langwen Huang et.al. 2401.05932 link
2024-01-11 Efficient Image Deblurring Networks based on Diffusion Models Kang Chen et.al. 2401.05907 link
2024-01-11 HiCAST: Highly Customized Arbitrary Style Transfer with Adapter Enhanced Diffusion Models Hanzhang Wang et.al. 2401.05870 null
2024-01-11 EraseDiff: Erasing Data Influence in Diffusion Models Jing Wu et.al. 2401.05779 link
2024-01-10 Diffusion Priors for Dynamic View Synthesis from Monocular Videos Chaoyang Wang et.al. 2401.05583 null
2024-01-10 From Pampas to Pixels: Fine-Tuning Diffusion Models for Gaúcho Heritage Marcellus Amadeus et.al. 2401.05520 null
2024-01-10 InseRF: Text-Driven Generative Object Insertion in Neural 3D Scenes Mohamad Shahbazi et.al. 2401.05335 null
2024-01-10 Score Distillation Sampling with Learned Manifold Corrective Thiemo Alldieck et.al. 2401.05293 null
2024-01-10 PIXART-δ: Fast and Controllable Image Generation with Latent Consistency Models Junsong Chen et.al. 2401.05252 link
2024-01-10 Derm-T2IM: Harnessing Synthetic Skin Lesion Data via Stable Diffusion Models for Enhanced Skin Disease Classification using ViT and CNN Muhammad Ali Farooq et.al. 2401.05159 null
2024-01-10 CrossDiff: Exploring Self-Supervised Representation of Pansharpening via Cross-Predictive Diffusion Model Yinghui Xing et.al. 2401.05153 null
2024-01-10 SwiMDiff: Scene-wide Matching Contrastive Learning with Diffusion Constraint for Remote Sensing Image Jiayuan Tian et.al. 2401.05093 null
2024-01-10 A novel bond-based nonlocal diffusion model with matrix-valued coefficients in non-divergence form and its collocation discretization Lili Ju et.al. 2401.04973 null
2024-01-09 Transmission-eigenchannel velocity and diffusion Azriel Z. Genack et.al. 2401.04818 null
2024-01-09 Morphable Diffusion: 3D-Consistent Diffusion for Single-image Avatar Creation Xiyi Chen et.al. 2401.04728 link
2024-01-09 Efficient estimation for ergodic diffusion processes sampled at high frequency Michael Sørensen et.al. 2401.04689 null
2024-01-09 EmoGen: Emotional Image Content Generation with Text-to-Image Diffusion Models Jingyuan Yang et.al. 2401.04608 null
2024-01-09 Enhanced Distribution Alignment for Post-Training Quantization of Diffusion Models Xuewen Liu et.al. 2401.04585 link
2024-01-09 MagicVideo-V2: Multi-Stage High-Aesthetic Video Generation Weimin Wang et.al. 2401.04468 null
2024-01-09 D3AD: Dynamic Denoising Diffusion Probabilistic Model for Anomaly Detection Justin Tebbe et.al. 2401.04463 link
2024-01-09 SonicVisionLM: Playing Sound with Vision Language Models Zhifeng Xie et.al. 2401.04394 null
2024-01-09 Representative Feature Extraction During Diffusion Process for Sketch Extraction with One Example Kwan Yun et.al. 2401.04362 null
2024-01-09 Memory-Efficient Personalization using Quantized Diffusion Model Hyogon Ryu et.al. 2401.04339 null
2024-01-08 FADI-AEC: Fast Score Based Diffusion Model Guided by Far-end Signal for Acoustic Echo Cancellation Yang Liu et.al. 2401.04283 null
2024-01-08 scDiffusion: conditional generation of high-quality single-cell data using diffusion model Erpai Luo et.al. 2401.03968 link
2024-01-08 D3PRefiner: A Diffusion-based Denoise Method for 3D Human Pose Refinement Danqi Yan et.al. 2401.03914 null
2024-01-08 DDM-Lag : A Diffusion-based Decision-making Model for Autonomous Vehicles with Lagrangian Safety Enhancement Jiaqi Liu et.al. 2401.03629 null
2024-01-07 ROIC-DM: Robust Text Inference and Classification via Diffusion Model Shilong Yuan et.al. 2401.03514 null
2024-01-07 Freetalker: Controllable Speech and Text-Driven Gesture Generation Based on Diffusion Models for Enhanced Speaker Naturalness Sicheng Yang et.al. 2401.03476 null
2024-01-07 Deep Learning-based Image and Video Inpainting: A Survey Weize Quan et.al. 2401.03395 null
2024-01-06 Reflected Schrödinger Bridge for Constrained Generative Modeling Wei Deng et.al. 2401.03228 null
2024-01-06 MirrorDiffusion: Stabilizing Diffusion Process in Zero-shot Image Translation by Prompts Redescription and Beyond Yupei Lin et.al. 2401.03221 null
2024-01-06 Fair Sampling in Diffusion Models through Switching Mechanism Yujin Choi et.al. 2401.03140 link
2024-01-05 Latte: Latent Diffusion Transformer for Video Generation Xin Ma et.al. 2401.03048 link
2024-01-05 Uncovering the human motion pattern: Pattern Memory-based Diffusion Model for Trajectory Prediction Yuxin Yang et.al. 2401.02916 null
2024-01-05 Plug-in Diffusion Model for Sequential Recommendation Haokai Ma et.al. 2401.02913 link
2024-01-05 Diffusion Variational Inference: Diffusion Models as Expressive Variational Posteriors Top Piriyakulkij et.al. 2401.02739 null
2024-01-05 Geometric-Facilitated Denoising Diffusion Model for 3D Molecule Generation Can Xu et.al. 2401.02683 link
2024-01-04 Comprehensive Exploration of Synthetic Data Generation: A Survey André Bauer et.al. 2401.02524 null
2024-01-04 VASE: Object-Centric Appearance and Shape Manipulation of Real Videos Elia Peruzzo et.al. 2401.02473 null
2024-01-04 Bring Metric Functions into Diffusion Models Jie An et.al. 2401.02414 null
2024-01-06 GUESS:GradUally Enriching SyntheSis for Text-Driven Human Motion Generation Xuehao Gao et.al. 2401.02142 link
2024-01-04 Preserving Image Properties Through Initializations in Diffusion Models Jeffrey Zhang et.al. 2401.02097 null
2024-01-04 Energy based diffusion generator for efficient sampling of Boltzmann distributions Yan Wang et.al. 2401.02080 null
2024-01-04 DiffusionEdge: Diffusion Probabilistic Model for Crisp Edge Detection Yunfan Ye et.al. 2401.02032 link
2024-01-04 Improving Diffusion-Based Image Synthesis with Context Prediction Ling Yang et.al. 2401.02015 null
2024-01-03 Instruct-Imagen: Image Generation with Multi-modal Instruction Hexiang Hu et.al. 2401.01952 null
2024-01-03 Can We Generate Realistic Hands Only Using Convolution? Mehran Hosseini et.al. 2401.01951 null
2024-01-03 Moonshot: Towards Controllable Video Generation and Editing with Multimodal Conditions David Junhao Zhang et.al. 2401.01827 link
2024-01-03 DiffYOLO: Object Detection for Anti-Noise via YOLO and Diffusion Models Yichen Liu et.al. 2401.01659 null
2024-01-03 SIGNeRF: Scene Integrated Generation for Neural Radiance Fields Jan-Niklas Dihlmann et.al. 2401.01647 null
2024-01-03 S $^{2}$ -DMs:Skip-Step Diffusion Models Yixuan Wang et.al. 2401.01520 link
2024-01-02 ColorizeDiffusion: Adjustable Sketch Colorization with Reference Image and Text Dingkun Yan et.al. 2401.01456 link
2024-01-02 VALD-MD: Visual Attribution via Latent Diffusion for Medical Diagnostics Ammar A. Siddiqui et.al. 2401.01414 null
2024-01-02 VideoDrafter: Content-Consistent Multi-Scene Video Generation with LLM Fuchen Long et.al. 2401.01256 link
2024-01-02 Towards a Simultaneous and Granular Identity-Expression Control in Personalized Face Generation Renshuai Liu et.al. 2401.01207 null
2024-01-02 A comparative study of resistivity models for simulations of magnetic reconnection in the solar atmosphere. II. Plasmoid formation Øystein Håvard Færder et.al. 2401.01177 null
2024-01-02 Joint Generative Modeling of Scene Graphs and Images via Diffusion Models Bicheng Xu et.al. 2401.01130 null
2024-01-02 Robust single-particle cryo-EM image denoising and restoration Jing Zhang et.al. 2401.01097 null
2024-01-02 Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generation Jinlong Xue et.al. 2401.01044 link
2024-01-01 DiffMorph: Text-less Image Morphing with Diffusion Models Shounak Chatterjee et.al. 2401.00739 null
2024-01-01 Diffusion Models, Image Super-Resolution And Everything: A Survey Brian B. Moser et.al. 2401.00736 null
2024-01-02 GD^2-NeRF: Generative Detail Compensation via GAN and Diffusion for One-shot Generalizable Neural Radiance Fields Xiao Pan et.al. 2401.00616 null
2023-12-31 Diff-PCR: Diffusion-Based Correspondence Searching in Doubly Stochastic Matrix Space for Point Cloud Registration Qianliang Wu et.al. 2401.00436 null
2023-12-29 FlowVid: Taming Imperfect Optical Flows for Consistent Video-to-Video Synthesis Feng Liang et.al. 2312.17681 null
2023-12-29 Data Augmentation for Supervised Graph Outlier Detection with Latent Diffusion Models Kay Liu et.al. 2312.17679 link
2023-12-29 Leveraging Open-Vocabulary Diffusion to Camouflaged Instance Segmentation Tuan-Anh Vu et.al. 2312.17505 null
2023-12-28 Classifier-free graph diffusion for molecular property targeting Matteo Ninniri et.al. 2312.17397 link
2023-12-28 iFusion: Inverting Diffusion for Pose-Free Reconstruction from Sparse Views Chin-Hsuan Wu et.al. 2312.17250 link
2023-12-28 Personalized Restoration via Dual-Pivot Tuning Pradyumna Chari et.al. 2312.17234 null
2023-12-28 4DGen: Grounded 4D Content Generation with Spatial-temporal Consistency Yuyang Yin et.al. 2312.17225 null
2023-12-28 Restoration by Generation with Constrained Priors Zheng Ding et.al. 2312.17161 null
2023-12-28 DiffKG: Knowledge Graph Diffusion Model for Recommendation Yangqin Jiang et.al. 2312.16890 link
2023-12-29 DiffusionGAN3D: Boosting Text-guided 3D Generation and Domain Adaption by Combining 3D GANs and Diffusion Priors Biwen Lei et.al. 2312.16837 null
2023-12-27 I2V-Adapter: A General Image-to-Video Adapter for Video Diffusion Models Xun Guo et.al. 2312.16693 link
2023-12-27 Forgery-aware Adaptive Transformer for Generalizable Synthetic Image Detection Huan Liu et.al. 2312.16649 link
2023-12-27 Image Restoration by Denoising Diffusion Models with Iteratively Preconditioned Guidance Tomer Garber et.al. 2312.16519 link
2023-12-29 PanGu-Draw: Advancing Resource-Efficient Text-to-Image Synthesis with Time-Decoupled Training and Reusable Coop-Diffusion Guansong Lu et.al. 2312.16486 null
2023-12-26 One-dimensional Adapter to Rule Them All: Concepts, Diffusion Models and Erasing Applications Mengyao Lyu et.al. 2312.16145 null
2023-12-26 Compositional Search of Stable Crystalline Structures in Multi-Component Alloys Using Generative Diffusion Models Grzegorz Kaszuba et.al. 2312.16073 null
2023-12-26 HarmonyView: Harmonizing Consistency and Diversity in One-Image-to-3D Sangmin Woo et.al. 2312.15980 link
2023-12-26 Semantic Guidance Tuning for Text-To-Image Diffusion Models Hyun Kang et.al. 2312.15964 link
2023-12-26 Implied volatility (also) is path-dependent Hervé Andrès et.al. 2312.15950 link
2023-12-26 EnchantDance: Unveiling the Potential of Music-Driven Dance Movement Bo Han et.al. 2312.15946 link
2023-12-26 Generating and Reweighting Dense Contrastive Patterns for Unsupervised Anomaly Detection Songmin Dai et.al. 2312.15911 null
2023-12-26 Cross Initialization for Personalized Text-to-Image Generation Lianyu Pang et.al. 2312.15905 link
2023-12-25 Adversarial Item Promotion on Visually-Aware Recommender Systems by Guided Diffusion Lijian Chen et.al. 2312.15826 null
2023-12-25 High-Fidelity Diffusion-based Image Editing Chen Hou et.al. 2312.15707 null
2023-12-22 MACS: Mass Conditioned 3D Hand and Object Motion Synthesis Soshi Shimada et.al. 2312.14929 null
2023-12-22 BrainVis: Exploring the Bridge between Brain and Visual Signals via Image Reconstruction Honghao Fu et.al. 2312.14871 link
2023-12-22 Neural-network-based regularization methods for inverse problems in imaging Andreas Habring et.al. 2312.14849 null
2023-12-22 Dreaming of Electrical Waves: Generative Modeling of Cardiac Excitation Waves using Diffusion Models Tanish Baranwal et.al. 2312.14830 link
2023-12-22 Neural network models for preferential concentration of particles in two-dimensional turbulence Thibault Maurel-Oujia et.al. 2312.14829 null
2023-12-22 Plan, Posture and Go: Towards Open-World Text-to-Motion Generation Jinpeng Liu et.al. 2312.14828 null
2023-12-22 Harnessing Diffusion Models for Visual Perception with Meta Prompts Qiang Wan et.al. 2312.14733 link
2023-12-22 FM-OV3D: Foundation Model-based Cross-modal Knowledge Blending for Open-Vocabulary 3D Detection Dongmei Zhang et.al. 2312.14465 null
2023-12-22 Generative AI Beyond LLMs: System Implications of Multi-Modal Generation Alicia Golden et.al. 2312.14385 null
2023-12-21 Diffusion Reward: Learning Rewards via Conditional Video Diffusion Tao Huang et.al. 2312.14134 link
2023-12-21 Neural Point Cloud Diffusion for Disentangled 3D Shape and Appearance Generation Philipp Schröppel et.al. 2312.14124 link
2023-12-21 HD-Painter: High-Resolution and Prompt-Faithful Text-Guided Image Inpainting with Diffusion Models Hayk Manukyan et.al. 2312.14091 link
2023-12-21 Carve3D: Improving Multi-view Reconstruction Consistency for Diffusion Models with RL Finetuning Desai Xie et.al. 2312.13980 null
2023-12-21 Paint3D: Paint Anything 3D with Lighting-Less Texture Diffusion Models Xianfang Zeng et.al. 2312.13913 link
2023-12-21 Align Your Gaussians: Text-to-4D with Dynamic 3D Gaussians and Composed Diffusion Models Huan Ling et.al. 2312.13763 null
2023-12-21 Free-Editor: Zero-shot Text-driven 3D Scene Editing Nazmul Karim et.al. 2312.13663 link
2023-12-21 Diff-Oracle: Diffusion Model for Oracle Character Generation with Controllable Styles and Contents Jing Li et.al. 2312.13631 null
2023-12-21 Navigating the Structured What-If Spaces: Counterfactual Generation via Structured Diffusion Nishtha Madaan et.al. 2312.13616 null
2023-12-21 Front stability of infinitely steep travelling waves in population biology Matthew J Simpson et.al. 2312.13601 link
2023-12-20 Repaint123: Fast and High-quality One Image to 3D Generation with Progressive Controllable 2D Repainting Junwu Zhang et.al. 2312.13271 link
2023-12-20 Conditional Image Generation with Pretrained Generative Model Rajesh Shrestha et.al. 2312.13253 null
2023-12-20 Zero-Shot Metric Depth with a Field-of-View Conditioned Diffusion Model Saurabh Saxena et.al. 2312.13252 null
2023-12-20 Diffusion Models With Learned Adaptive Noise Subham Sekhar Sahoo et.al. 2312.13236 link
2023-12-20 DiffPortrait3D: Controllable Diffusion for Zero-Shot Portrait View Synthesis Yuming Gu et.al. 2312.13016 link
2023-12-20 RadEdit: stress-testing biomedical vision models via diffusion image editing Fernando Pérez-García et.al. 2312.12865 null
2023-12-20 ReCo-Diff: Explore Retinex-Based Condition Strategy in Diffusion Model for Low-Light Image Enhancement Yuhui Wu et.al. 2312.12826 null
2023-12-20 All but One: Surgical Concept Erasing with Model Preservation in Text-to-Image Diffusion Models Seunghoo Hong et.al. 2312.12807 null
2023-12-21 AMD:Anatomical Motion Diffusion with Interpretable Motion Decomposition and Fusion Beibei Jing et.al. 2312.12763 null
2023-12-20 How Good Are Deep Generative Models for Solving Inverse Problems? Shichong Peng et.al. 2312.12691 null
2023-12-19 On Inference Stability for Diffusion Models Viet Nguyen et.al. 2312.12431 link
2023-12-19 Scene-Conditional 3D Object Stylization and Composition Jinghao Zhou et.al. 2312.12419 null
2023-12-19 Prompting Hard or Hardly Prompting: Prompt Inversion for Text-to-Image Diffusion Models Shweta Mahajan et.al. 2312.12416 null
2023-12-19 Travelling pulses on three spatial scales in a Klausmeier-type vegetation-autotoxicity model Paul Carter et.al. 2312.12277 null
2023-12-19 Intrinsic Image Diffusion for Single-view Material Estimation Peter Kocsis et.al. 2312.12274 link
2023-12-19 Brush Your Text: Synthesize Any Scene Text on Images via Diffusion Model Lingjun Zhang et.al. 2312.12232 link
2023-12-19 HuTuMotion: Human-Tuned Navigation of Latent Motion Diffusion Models with Minimal Feedback Gaoge Han et.al. 2312.12227 null
2023-12-19 FontDiffuser: One-Shot Font Generation via Denoising Diffusion with Multi-Scale Content Aggregation and Style Contrastive Learning Zhenhua Yang et.al. 2312.12142 link
2023-12-19 GazeMoDiff: Gaze-guided Diffusion Model for Stochastic Human Motion Prediction Haodong Yan et.al. 2312.12090 null
2023-12-19 Learning Subject-Aware Cropping by Outpainting Professional Photos James Hong et.al. 2312.12080 null
2023-12-18 A novel diffusion recommendation algorithm based on multi-scale cnn and residual lstm Yong Niu et.al. 2312.10885 null
2023-12-17 Your Student is Better Than Expected: Adaptive Teacher-Student Collaboration for Text-Conditional Diffusion Models Nikita Starodubcev et.al. 2312.10835 link
2023-12-17 CogCartoon: Towards Practical Story Visualization Zhongyang Zhu et.al. 2312.10718 null
2023-12-17 VidToMe: Video Token Merging for Zero-Shot Video Editing Xirui Li et.al. 2312.10656 link
2023-12-16 VecFusion: Vector Font Generation with Diffusion Vikas Thamizharasan et.al. 2312.10540 null
2023-12-16 A Unified Filter Method for Jointly Estimating State and Parameters of Stochastic Dynamical Systems via the Ensemble Score Filter Feng Bao et.al. 2312.10503 null
2023-12-16 Continuous Diffusion for Mixed-Type Tabular Data Markus Mueller et.al. 2312.10431 link
2023-12-16 Lecture Notes in Probabilistic Diffusion Models Inga Strümke et.al. 2312.10393 null
2023-12-16 Image Restoration Through Generalized Ornstein-Uhlenbeck Bridge Conghan Yue et.al. 2312.10299 link
2023-12-15 Two simple criterion to prove the existence of patterns in reaction-diffusion models of two components Francisco J. Vielma-Leal et.al. 2312.10231 null

(back to top)

Implicit

Publish Date Title Authors PDF Code
2025-01-16 Bias for Action: Video Implicit Neural Representations with Bias Modulation Alper Kayabasi et.al. 2501.09277 null
2025-01-15 Dynamic-Aware Spatio-temporal Representation Learning for Dynamic MRI Reconstruction Dayoung Baik et.al. 2501.09049 null
2025-01-13 Implicit Neural Representations for Registration of Left Ventricle Myocardium During a Cardiac Cycle Mathias Micheelsen Lowes et.al. 2501.07248 link
2025-01-14 Generalized and Efficient 2D Gaussian Splatting for Arbitrary-scale Super-Resolution Du Chen et.al. 2501.06838 null
2025-01-07 NeuralSVG: An Implicit Representation for Text-to-Vector Generation Sagi Polaczek et.al. 2501.03992 null
2025-01-06 Qinco2: Vector Compression and Search with Improved Implicit Neural Codebooks Théophane Vallaeys et.al. 2501.03078 link
2025-01-05 MetaNeRV: Meta Neural Representations for Videos with Spatial-Temporal Guidance Jialong Guo et.al. 2501.02427 null
2025-01-03 Few-shot Implicit Function Generation via Equivariance Suizhi Huang et.al. 2501.01601 null
2025-01-02 Incomplete Data Multi-Source Static Computed Tomography Reconstruction with Diffusion Priors and Implicit Neural Representation Ziju Shen et.al. 2501.01013 null
2025-01-01 CoordFlow: Coordinate Flow for Pixel-wise Neural Video Representation Daniel Silver et.al. 2501.00975 null
2024-12-19 Quantum Implicit Neural Compression Takuya Fujihashi et.al. 2412.19828 null
2025-01-09 STITCH: Surface reconstrucTion using Implicit neural representations with Topology Constraints and persistent Homology Anushrut Jignasu et.al. 2412.18696 null
2024-12-29 PartGen: Part-level 3D Generation and Reconstruction with Multi-View Diffusion Models Minghao Chen et.al. 2412.18608 null
2025-01-04 S-INF: Towards Realistic Indoor Scene Synthesis via Scene Implicit Neural Field Zixi Liang et.al. 2412.17561 link
2024-12-26 LiHi-GS: LiDAR-Supervised Gaussian Splatting for Highway Driving Scene Reconstruction Pou-Chun Kung et.al. 2412.15447 null
2024-12-17 iRBSM: A Deep Implicit 3D Breast Shape Model Maximilian Weiherer et.al. 2412.13244 null
2024-12-17 Subspace Implicit Neural Representations for Real-Time Cardiac Cine MR Imaging Wenqi Huang et.al. 2412.12742 null
2024-12-15 Semi-Implicit Neural Ordinary Differential Equations Hong Zhang et.al. 2412.11301 link
2024-12-11 Implicit Neural Compression of Point Clouds Hongning Ruan et.al. 2412.10433 null
2024-12-13 EVOS: Efficient Implicit Neural Training via EVOlutionary Selector Weixiang Zhang et.al. 2412.10153 null
2024-12-12 Enhancing Implicit Neural Representations via Symmetric Power Transformation Weixiang Zhang et.al. 2412.09213 link
2024-12-11 Unicorn: Unified Neural Image Compression with One Number Reconstruction Qi Zheng et.al. 2412.08210 null
2024-12-11 INRetouch: Context Aware Implicit Neural Representation for Photography Retouching Omar Elezabi et.al. 2412.03848 null
2024-12-04 HIIF: Hierarchical Encoding based Implicit Image Function for Continuous Super-resolution Yuxuan Jiang et.al. 2412.03748 null
2024-12-03 Multi-robot autonomous 3D reconstruction using Gaussian splatting with Semantic guidance Jing Zeng et.al. 2412.02249 null
2024-12-02 Efficient Compression of Sparse Accelerator Data Using Implicit Neural Representations and Importance Sampling Xihaier Luo et.al. 2412.01754 link
2024-12-02 SUICA: Learning Super-high Dimensional Sparse Implicit Neural Representations for Spatial Transcriptomics Qingtian Zhu et.al. 2412.01124 null
2024-11-27 Towards Lensless Image Deblurring with Prior-Embedded Implicit Neural Representations in the Low-Data Regime Abeer Banerjee et.al. 2411.18189 null
2024-11-27 MeltpoolINR: Predicting temperature field, melt pool geometry, and their rate of change in laser powder bed fusion Manav Manav et.al. 2411.18048 null
2024-11-21 Geometric Algebra Planes: Convex Implicit Neural Volumes Irmak Sivgin et.al. 2411.13525 null
2024-11-16 $\text{S}^{3}$ Mamba: Arbitrary-Scale Super-Resolution via Scaleable State Space Model Peizhe Xia et.al. 2411.11906 null
2024-11-20 TSINR: Capturing Temporal Continuity via Implicit Neural Representations for Time Series Anomaly Detection Mengxuan Li et.al. 2411.11641 link
2024-11-18 Superpixel-informed Implicit Neural Representation for Multi-Dimensional Data Jiayi Li et.al. 2411.11356 null
2024-11-18 Continuous K-space Recovery Network with Image Guidance for Fast MRI Reconstruction Yucong Meng et.al. 2411.11282 null
2024-11-17 VeGaS: Video Gaussian Splatting Weronika Smolak-Dyżewska et.al. 2411.11024 link
2024-11-12 Numerical Homogenization by Continuous Super-Resolution Zhi-Song Liu et.al. 2411.07576 null
2024-11-10 Local Implicit Wavelet Transformer for Arbitrary-Scale Super-Resolution Minghong Duan et.al. 2411.06442 link
2024-11-09 HiHa: Introducing Hierarchical Harmonic Decomposition to Implicit Neural Compression for Atmospheric Data Zhewen Xu et.al. 2411.06155 null
2024-11-07 LoFi: Scalable Local Image Reconstruction with Implicit Neural Representation AmirEhsan Khorashadizadeh et.al. 2411.04995 link
2024-11-07 VAIR: Visuo-Acoustic Implicit Representations for Low-Cost, Multi-Modal Transparent Surface Reconstruction in Indoor Scenes Advaith V. Sethuraman et.al. 2411.04963 null
2024-11-06 Where Do We Stand with Implicit Neural Representations? A Technical and Performance Survey Amer Essakine et.al. 2411.03688 null
2024-10-31 MS-Glance: Non-semantic context vectors and the applications in supervising image reconstruction Ziqi Gao et.al. 2410.23577 link
2024-10-30 Understanding Representation of Deep Equilibrium Models from Neural Collapse Perspective Haixiang Sun et.al. 2410.23391 null
2024-10-29 Predicting the Encoding Error of SIRENs Jeremy Vonderfecht et.al. 2410.21645 null
2024-10-29 Neural Experts: Mixture of Experts for Implicit Neural Representations Yizhak Ben-Shabat et.al. 2410.21643 null
2024-10-29 EEG-Driven 3D Object Reconstruction with Color Consistency and Diffusion Prior Xin Xiang et.al. 2410.20981 null
2024-10-16 Radon Implicit Field Transform (RIFT): Learning Scenes from Radar Signals Daqian Bao et.al. 2410.19801 null
2024-10-25 ST-NeRP: Spatial-Temporal Neural Representation Learning with Prior Embedding for Patient-specific Imaging Study Liang Qiu et.al. 2410.19283 null
2024-10-24 Environment Maps Editing using Inverse Rendering and Adversarial Implicit Functions Antonio D'Orazio et.al. 2410.18622 null
2024-10-22 Scalable Implicit Graphon Learning Ali Azizpour et.al. 2410.17464 link
2024-10-19 Implicit neural representation for free-breathing MR fingerprinting (INR-MRF): co-registered 3D whole-liver water T1, water T2, proton density fat fraction, and R2 mapping* Chao Li et.al. 2410.15175 null
2024-10-17 Object Pose Estimation Using Implicit Representation For Transparent Objects Varun Burde et.al. 2410.13465 null
2024-10-17 Inductive Gradient Adjustment For Spectral Bias In Implicit Neural Representations Kexuan Shi et.al. 2410.13271 null
2024-10-16 Optimizing 3D Geometry Reconstruction from Implicit Neural Representations Shen Fan et.al. 2410.12725 null
2024-10-16 MING: A Functional Approach to Learning Molecular Generative Models Van Khoa Nguyen et.al. 2410.12522 null
2024-10-14 StegaINR4MIH: steganography by implicit neural representation for multi-image hiding Weina Dong et.al. 2410.10117 link
2024-10-13 Magnituder Layers for Implicit Neural Representations in 3D Sang Min Kim et.al. 2410.09771 null
2024-10-18 IncEventGS: Pose-Free Gaussian Splatting from a Single Event Camera Jian Huang et.al. 2410.08107 link
2024-10-09 DreamMesh4D: Video-to-4D Generation with Sparse-Controlled Gaussian-Mesh Hybrid Representation Zhiqi Li et.al. 2410.06756 null
2024-10-08 Training Stiff Neural Ordinary Differential Equations with Implicit Single-Step Methods Colby Fronk et.al. 2410.05592 null
2024-10-11 Implicitly Learned Neural Phase Functions for Basis-Free Point Spread Function Engineering Aleksey Valouev et.al. 2410.05413 null
2024-10-08 FreSh: Frequency Shifting for Accelerated Neural Representation Learning Adam Kania et.al. 2410.05050 link
2024-10-07 H-SIREN: Improving implicit neural representations with hyperbolic periodic functions Rui Gao et.al. 2410.04716 null
2024-10-07 Neural Fourier Modelling: A Highly Compact Approach to Time-Series Analysis Minjung Kim et.al. 2410.04703 link
2024-10-07 SegINR: Segment-wise Implicit Neural Representation for Sequence Alignment in Neural Text-to-Speech Minchan Kim et.al. 2410.04690 null
2024-10-04 Shrinking: Reconstruction of Parameterized Surfaces from Signed Distance Fields Haotian Yin et.al. 2410.03123 null
2024-10-03 On Logical Extrapolation for Mazes with Recurrent and Implicit Networks Brandon Knutson et.al. 2410.03020 link
2024-10-02 MVGS: Multi-view-regulated Gaussian Splatting for Novel View Synthesis Xiaobiao Du et.al. 2410.02103 link
2024-10-03 Releasing the Parameter Latency of Neural Representation for High-Efficiency Video Compression Gai Zhang et.al. 2410.01654 null
2024-10-02 Coordinate-Based Neural Representation Enabling Zero-Shot Learning for 3D Multiparametric Quantitative MRI Guoyan Lao et.al. 2410.01577 null
2024-10-02 MiraGe: Editable 2D Images using Gaussian Splatting Joanna Waczyńska et.al. 2410.01521 link
2024-09-30 WildFusion: Multimodal Implicit 3D Reconstructions in the Wild Yanbaihui Liu et.al. 2409.19904 null
2024-09-28 Towards Croppable Implicit Neural Representations Maor Ashkenazi et.al. 2409.19472 link
2024-09-28 Fast Encoding and Decoding for Implicit Video Representation Hao Chen et.al. 2409.19429 null
2024-09-27 Neural Video Representation for Redundancy Reduction and Consistency Preservation Taiga Hayami et.al. 2409.18497 null
2024-09-25 Implicit Neural Representations for Simultaneous Reduction and Continuous Reconstruction of Multi-Altitude Climate Data Alif Bin Abdul Qayyum et.al. 2409.17367 link
2024-09-25 Streaming Neural Images Marcos V. Conde et.al. 2409.17134 null
2024-09-25 Moner: Motion Correction in Undersampled Radial MRI with Unsupervised Neural Representation Qing Wu et.al. 2409.16921 null
2024-09-25 Ring Artifacts Removal Based on Implicit Neural Representation of Sinogram Data Ligen Shi et.al. 2409.15731 null
2024-09-21 Implicit Neural Representations for Speed-of-Sound Estimation in Ultrasound Michal Byra et.al. 2409.14035 null
2024-09-21 MOSE: Monocular Semantic Reconstruction Using NeRF-Lifted Noisy Priors Zhenhua Du et.al. 2409.14019 null
2024-09-20 Occupancy-Based Dual Contouring Jisung Hwang et.al. 2409.13418 link
2024-09-19 Breaking the Barriers of One-to-One Usage of Implicit Neural Representation in Image Compression: A Linear Combination Approach with Performance Guarantees Sai Sanjeet et.al. 2409.13117 link
2024-09-18 Intraoperative Registration by Cross-Modal Inverse Neural Rendering Maximilian Fehrentz et.al. 2409.11983 null
2024-09-18 Monomial Matrix Group Equivariant Neural Functional Networks Hoang V. Tran et.al. 2409.11697 link
2024-09-17 Compact Implicit Neural Representations for Plane Wave Images Mathilde Monvoisin et.al. 2409.11370 null
2024-09-17 SplatFields: Neural Gaussian Splats for Sparse 3D and 4D Reconstruction Marko Mihajlovic et.al. 2409.11211 null
2024-09-17 Neural Fields for Adaptive Photoacoustic Computed Tomography Tianao Li et.al. 2409.10876 null
2024-09-18 Single-Layer Learnable Activation for Implicit Neural Representation (SL $^{2}$ A-INR) Moein Heidari et.al. 2409.10836 null
2024-09-15 Learning Transferable Features for Implicit Neural Representations Kushal Vyas et.al. 2409.09566 null
2024-09-14 Estimating Neural Orientation Distribution Fields on High Resolution Diffusion MRI Scans Mohammed Munzer Dwedari et.al. 2409.09387 link
2024-09-20 Implicit Neural Representations with Fourier Kolmogorov-Arnold Networks Ali Mehrabian et.al. 2409.09323 link
2024-09-12 DreamHOI: Subject-Driven Generation of 3D Human-Object Interactions with Diffusion Priors Thomas Hanwen Zhu et.al. 2409.08278 null
2024-09-11 NVRC: Neural Video Representation Compression Ho Man Kwan et.al. 2409.07414 null
2024-09-11 AC-IND: Sparse CT reconstruction based on attenuation coefficient estimation and implicit neural distribution Wangduo Xie et.al. 2409.07171 null
2024-09-11 Fast Medical Shape Reconstruction via Meta-learned Implicit Neural Representations Gaia Romana De Paolis et.al. 2409.07100 null
2024-09-10 A Latent Implicit 3D Shape Model for Multiple Levels of Detail Benoit Guillard et.al. 2409.06231 null
2024-09-09 G-NeLF: Memory- and Data-Efficient Hybrid Neural Light Field for Novel View Synthesis Lutao Jiang et.al. 2409.05617 null
2024-09-06 NeCA: 3D Coronary Artery Tree Reconstruction from Two 2D Projections by Neural Implicit Representation Yiying Wang et.al. 2409.04596 link
2024-09-10 Diff-INR: Generative Regularization for Electrical Impedance Tomography Bowen Tong et.al. 2409.04494 null
2024-09-02 SeCo-INR: Semantically Conditioned Implicit Neural Representations for Improved Medical Image Super-Resolution Mevan Ekanayake et.al. 2409.01013 null
2024-09-02 PNVC: Towards Practical INR-based Video Compression Ge Gao et.al. 2409.00953 null
2024-08-29 RMMI: Enhanced Obstacle Avoidance for Reactive Mobile Manipulation using an Implicit Neural Map Nicolas Marticorena et.al. 2408.16206 null
2024-08-20 NeR-VCP: A Video Content Protection Method Based on Implicit Neural Representation Yangping Lin et.al. 2408.15281 null
2024-08-27 Few-Shot Unsupervised Implicit Neural Shape Representation Learning with Spatial Adversaries Amine Ouasfi et.al. 2408.15114 null
2024-08-27 Depth Restoration of Hand-Held Transparent Objects for Human-to-Robot Handover Ran Yu et.al. 2408.14997 null
2024-08-27 OctFusion: Octree-based Diffusion Models for 3D Shape Generation Bojun Xiong et.al. 2408.14732 link
2024-08-25 FreqINR: Frequency Consistency for Implicit Neural Representation with Adaptive DCT Frequency Loss Meiyi Wei et.al. 2408.13716 null
2024-08-23 S4D: Streaming 4D Real-World Reconstruction with Gaussians and 3D Control Points Bing He et.al. 2408.13036 link
2024-08-16 Modeling the Neonatal Brain Development Using Implicit Neural Representations Florentin Bieder et.al. 2408.08647 link
2024-08-16 Reference-free Axial Super-resolution of 3D Microscopy Images using Implicit Neural Representation with a 2D Diffusion Prior Kyungryun Lee et.al. 2408.08616 link
2024-08-12 Implicit Neural Representation For Accurate CFD Flow Field Prediction Laurent de Vito et.al. 2408.06486 null
2024-08-12 Uncertainty-Informed Volume Visualization using Implicit Neural Representation Shanu Saklani et.al. 2408.06018 null
2024-08-10 Residual-INR: Communication Efficient On-Device Learning Using Implicit Neural Representation Hanqiu Chen et.al. 2408.05617 link
2024-08-20 Scene123: One Prompt to 3D Scene Generation via Video-Assisted and Consistency-Enhanced MAE Yiying Yang et.al. 2408.05477 null
2024-08-09 EclipseNETs: a differentiable description of irregular eclipse conditions Giacomo Acciarini et.al. 2408.05387 null
2024-08-07 PHOCUS: Physics-Based Deconvolution for Ultrasound Resolution Enhancement Felix Duelmer et.al. 2408.03657 link
2024-08-06 Fast Point Cloud Geometry Compression with Context-based Residual Coding and INR-based Refinement Hao Xu et.al. 2408.02966 null
2024-08-05 Latent-INR: A Flexible Framework for Implicit Representations of Videos with Discriminative Semantics Shishira R Maiya et.al. 2408.02672 null
2024-08-04 AvatarPose: Avatar-guided 3D Pose Estimation of Close Human Interaction from Sparse Multi-view Videos Feichi Lu et.al. 2408.02110 null
2024-08-05 UlRe-NeRF: 3D Ultrasound Imaging through Neural Rendering with Ultrasound Reflection Direction Parameterization Ziwen Guo et.al. 2408.00860 null
2024-07-30 Neural Fields for Continuous Periodic Motion Estimation in 4D Cardiovascular Imaging Simone Garzia et.al. 2407.20728 null
2024-07-29 Registering Neural 4D Gaussians for Endoscopic Surgery Yiming Huang et.al. 2407.20213 null
2024-07-29 Aero-Nef: Neural Fields for Rapid Aircraft Aerodynamics Simulations Giovanni Catalani et.al. 2407.19916 link
2024-07-28 UniVoxel: Fast Inverse Rendering by Unified Voxelization of Scene Representation Shuang Wu et.al. 2407.19542 link
2024-07-28 FINER++: Building a Family of Variable-periodic Functions for Activating Implicit Neural Representation Hao Zhu et.al. 2407.19434 null
2024-07-26 ObjectCarver: Semi-automatic segmentation, reconstruction and separation of 3D objects Gemmechu Hassena et.al. 2407.19108 null
2024-07-26 Revisit Event Generation Model: Self-Supervised Learning of Event-to-Video Reconstruction with Implicit Neural Representations Zipeng Wang et.al. 2407.18500 null
2024-07-25 GaussianSR: High Fidelity 2D Gaussian Splatting for Arbitrary-Scale Image Super-Resolution Jintong Hu et.al. 2407.18046 null
2024-07-23 Uncertainty-Aware Deep Neural Representations for Visual Analysis of Vector Field Data Atul Kumar et.al. 2407.16119 null
2024-07-22 Attention Beats Linear for Fast Implicit Neural Representation Generation Shuyi Zhang et.al. 2407.15355 link
2024-07-19 SparseCraft: Few-Shot Neural Reconstruction through Stereopsis Guided Geometric Linearization Mae Younes et.al. 2407.14257 null
2024-07-18 DiffuX2CT: Diffusion Learning to Reconstruct CT Images from Biplanar X-Rays Xuhui Liu et.al. 2407.13545 null
2024-07-18 Learn to Memorize and to Forget: A Continual Learning Perspective of Dynamic SLAM Baicheng Li et.al. 2407.13338 null
2024-07-17 A Resolution Independent Neural Operator Bahador Bahmani et.al. 2407.13010 null
2024-07-17 Fast Context-Based Low-Light Image Enhancement via Neural Implicit Representations Tomáš Chobola et.al. 2407.12511 link
2024-07-18 IPA-NeRF: Illusory Poisoning Attack Against Neural Radiance Fields Wenxiang Jiang et.al. 2407.11921 link
2024-07-12 Neural Poisson Solver: A Universal and Continuous Framework for Natural Signal Blending Delong Wu et.al. 2407.08457 null
2024-07-09 PDEformer-1: A Foundation Model for One-Dimensional Partial Differential Equations Zhanhong Ye et.al. 2407.06664 null
2024-07-09 Implicit Regression in Subspace for High-Sensitivity CEST Imaging Chu Chen et.al. 2407.06614 null
2024-07-08 LINEAR: Learning Implicit Neural Representation With Explicit Physical Priors for Accelerated Quantitative T1rho Mapping Yuanyuan Liu et.al. 2407.05617 null
2024-07-03 IM-MoCo: Self-supervised MRI Motion Correction using Motion-Guided Implicit Neural Representations Ziad Al-Haj Hemidi et.al. 2407.02974 link
2024-07-03 Highly Accelerated MRI via Implicit Neural Representation Guided Posterior Sampling of Diffusion Models Jiayue Chu et.al. 2407.02744 null
2024-07-03 BeNeRF: Neural Radiance Fields from a Single Blurry Image and Event Stream Wenpu Li et.al. 2407.02174 link
2024-07-04 UltraPixel: Advancing Ultra-High-Resolution Image Synthesis to New Peaks Jingjing Ren et.al. 2407.02158 null
2024-07-07 Learning 3D Gaussians for Extremely Sparse-View Cone-Beam CT Reconstruction Yiqun Lin et.al. 2407.01090 link
2024-06-27 PNeRV: A Polynomial Neural Representation for Videos Sonam Gupta et.al. 2406.19299 null
2024-06-25 Efficient and Effective Implicit Dynamic Graph Neural Network Yongjian Zhong et.al. 2406.17894 link
2024-06-25 Sparse-view Signal-domain Photoacoustic Tomography Reconstruction Method Based on Neural Representation Bowei Yao et.al. 2406.17578 null
2024-06-21 CoCPF: Coordinate-based Continuous Projection Field for Ill-Posed Inverse Problem in Imaging Zixuan Chen et.al. 2406.14976 null
2024-06-19 INFusion: Diffusion Regularized Implicit Neural Representations for 2D and 3D accelerated MRI reconstruction Yamin Arefeen et.al. 2406.13895 null
2024-06-19 Enhance the Image: Super Resolution using Artificial Intelligence in MRI Ziyu Li et.al. 2406.13625 null
2024-06-13 CodedEvents: Optimal Point-Spread-Function Engineering for 3D-Tracking with Event Cameras Sachin Shah et.al. 2406.09409 null
2024-06-13 OpenMaterial: A Comprehensive Dataset of Complex Materials for 3D Reconstruction Zheng Dang et.al. 2406.08894 null
2024-06-13 Generalizable Implicit Neural Representation As a Universal Spatiotemporal Traffic Data Learner Tong Nie et.al. 2406.08743 null
2024-06-11 NeRSP: Neural 3D Reconstruction for Reflective Objects with Sparse Polarized Images Yufei Han et.al. 2406.07111 null
2024-06-09 A Low Rank Neural Representation of Entropy Solutions Donsub Rim et.al. 2406.05694 null
2024-06-06 Conv-INR: Convolutional Implicit Neural Representation for Multimodal Visual Signals Zhicheng Cai et.al. 2406.04249 null
2024-06-06 Encoding Semantic Priors into the Weights of Implicit Neural Representation Zhicheng Cai et.al. 2406.04178 null
2024-06-06 C^2RV: Cross-Regional and Cross-View Learning for Sparse-View CBCT Reconstruction Yiqun Lin et.al. 2406.03902 link
2024-06-06 Quantum Implicit Neural Representations Jiaming Zhao et.al. 2406.03873 link
2024-06-04 ReLUs Are Sufficient for Learning Implicit Neural Representations Joseph Shenouda et.al. 2406.02529 link
2024-06-04 Image steganography based on generative implicit neural representation Zhong Yangjie et.al. 2406.01918 link
2024-06-01 Modeling Randomly Observed Spatiotemporal Dynamical Systems Valerii Iakovlev et.al. 2406.00368 null
2024-05-31 ImplicitTerrain: a Continuous Surface Model for Terrain Data Analysis Haoan Feng et.al. 2406.00227 null
2024-05-31 MeshXL: Neural Coordinate Field for Generative 3D Foundation Models Sijin Chen et.al. 2405.20853 link
2024-05-29 Implicit Neural Image Field for Biological Microscopy Image Compression Gaole Dai et.al. 2405.19012 link
2024-05-28 Towards a Sampling Theory for Implicit Neural Representations Mahrokh Najaf et.al. 2405.18410 null
2024-05-28 A Grid-Free Fluid Solver based on Gaussian Spatial Representation Jingrui Xing et.al. 2405.18133 null
2024-05-27 UniCompress: Enhancing Multi-Data Medical Image Compression with Knowledge Distillation Runzhao Yang et.al. 2405.16850 null
2024-06-04 Extreme Compression of Adaptive Neural Images Leo Hoshikawa et.al. 2405.16807 null
2024-05-27 Transport of Algebraic Structure to Latent Embeddings Samuel Pfrommer et.al. 2405.16763 link
2024-05-24 CPT-Interp: Continuous sPatial and Temporal Motion Modeling for 4D Medical Image Interpolation Xia Li et.al. 2405.15385 null
2024-05-23 Multi-view Remote Sensing Image Segmentation With SAM priors Zipeng Qi et.al. 2405.14171 null
2024-05-22 HR-INR: Continuous Space-Time Video Super-Resolution via Event Camera Yunfan Lu et.al. 2405.13389 null
2024-05-20 GarmentDreamer: 3DGS Guided Garment Synthesis with Diverse Geometry and Texture Details Boqian Li et.al. 2405.12420 link
2024-05-20 ASMR: Activation-sharing Multi-resolution Coordinate Networks For Efficient Inference Jason Chun Lok Li et.al. 2405.12398 link
2024-05-19 Point Cloud Compression with Implicit Neural Representations: A Unified Framework Hongning Ruan et.al. 2405.11493 null
2024-05-18 HR Human: Modeling Human Avatars with Triangular Mesh and High-Resolution Textures from Videos Qifeng Chen et.al. 2405.11270 null
2024-05-17 Nonparametric Teaching of Implicit Neural Representations Chen Zhang et.al. 2405.10531 link
2024-05-14 Achieving Resolution-Agnostic DNN-based Image Watermarking:A Novel Perspective of Implicit Neural Representation Yuchen Wang et.al. 2405.08340 null
2024-05-11 Unsupervised Density Neural Representation for CT Metal Artifact Reduction Qing Wu et.al. 2405.07047 null
2024-05-10 I3DGS: Improve 3D Gaussian Splatting from Multiple Dimensions Jinwei Lin et.al. 2405.06408 null
2024-05-10 Free-Moving Object Reconstruction and Pose Estimation with Virtual Camera Haixin Shi et.al. 2405.05858 null
2024-05-09 NeuRSS: Enhancing AUV Localization and Bathymetric Mapping with Neural Rendering for Sidescan SLAM Yiping Xie et.al. 2405.05807 null
2024-05-09 Radar Fields: Frequency-Space Neural Scene Representations for FMCW Radar David Borts et.al. 2405.04662 null
2024-05-06 3D LiDAR Mapping in Dynamic Environments Using a 4D Implicit Neural Representation Xingguang Zhong et.al. 2405.03388 link
2024-05-06 Spatiotemporal Implicit Neural Representation as a Generalized Traffic Data Learner Tong Nie et.al. 2405.03185 link
2024-05-03 Implicit Neural Representations for Robust Joint Sparse-View CT Reconstruction Jiayang Shi et.al. 2405.02509 null
2024-05-01 Continuous sPatial-Temporal Deformable Image Registration (CPT-DIR) for motion modelling in radiotherapy: beyond classic voxel-based methods Xia Li et.al. 2405.00430 null
2024-04-29 Distributed Stochastic Optimization of a Neural Representation Network for Time-Space Tomography Reconstruction K. Aditya Mohan et.al. 2404.19075 null
2024-04-27 DPER: Diffusion Prior Driven Neural Representation for Limited Angle and Sparse View CT Reconstruction Chenhe Du et.al. 2404.17890 null
2024-04-25 Latent Modulated Function for Computational Optimal Continuous Image Representation Zongyao He et.al. 2404.16451 link
2024-04-23 Fourier-enhanced Implicit Neural Fusion Network for Multispectral and Hyperspectral Image Fusion Yu-Jie Liang et.al. 2404.15174 null
2024-04-23 HOIN: High-Order Implicit Neural Representations Yang Chen et.al. 2404.14674 null
2024-04-22 Scene Coordinate Reconstruction: Posing of Image Collections via Incremental Learning of a Relocalizer Eric Brachmann et.al. 2404.14351 null
2024-04-18 Mapping back and forth between model predictive control and neural networks Ross Drummond et.al. 2404.12030 null
2024-04-16 Autonomous Implicit Indoor Scene Reconstruction with Frontier Exploration Jing Zeng et.al. 2404.10218 null
2024-04-15 Q2A: Querying Implicit Fully Continuous Feature Pyramid to Align Features for Medical Image Segmentation Jiahao Yu et.al. 2404.09472 null
2024-04-03 Dynamic Neural Control Flow Execution: An Agent-Based Deep Equilibrium Approach for Binary Vulnerability Detection Litao Li et.al. 2404.08562 null
2024-04-09 Studying the Impact of Latent Representations in Implicit Neural Networks for Scientific Continuous Field Reconstruction Wei Xu et.al. 2404.06418 null
2024-04-03 JDEC: JPEG Decoding via Enhanced Continuous Cosine Coefficients Woo Kyoung Han et.al. 2404.05558 link
2024-04-07 CycleINR: Cycle Implicit Neural Representation for Arbitrary-Scale Volumetric Super-Resolution of Medical Data Wei Fang et.al. 2404.04878 null
2024-04-05 Rethinking Non-Negative Matrix Factorization with Implicit Neural Representations Krishna Subramani et.al. 2404.04439 link
2024-04-05 Deep Phase Coded Image Prior Nimrod Shabtay et.al. 2404.03906 null
2024-04-04 CSR-dMRI: Continuous Super-Resolution of Diffusion MRI with Anatomical Structure-assisted Implicit Neural Representation Learning Ruoyou Wu et.al. 2404.03209 null
2024-04-03 Unsupervised Occupancy Learning from Sparse Point Cloud Amine Ouasfi et.al. 2404.02759 null
2024-04-02 Unmasking Correlations in Nuclear Cross Sections with Graph Neural Networks Sinjini Mitra et.al. 2404.02332 null
2024-04-02 Federated Multi-Agent Mapping for Planetary Exploration Tiberiu-Ioan Szatmari et.al. 2404.02289 null
2024-04-02 Bidirectional Multi-Scale Implicit Neural Representations for Image Deraining Xiang Chen et.al. 2404.01547 link
2024-03-29 NeSLAM: Neural Implicit Mapping and Self-Supervised Feature Tracking With Depth Completion and Denoising Tianchen Deng et.al. 2403.20034 link
2024-03-28 Benchmarking Implicit Neural Representation and Geometric Rendering in Real-Time RGB-D SLAM Tongyan Hua et.al. 2403.19473 link
2024-03-28 D'OH: Decoder-Only random Hypernetworks for Implicit Neural Representations Cameron Gordon et.al. 2403.19163 null
2024-03-25 INPC: Implicit Neural Point Clouds for Radiance Field Rendering Florian Hahlbohm et.al. 2403.16862 null
2024-03-23 DS-NeRV: Implicit Neural Video Representation with Decomposed Static and Dynamic Codes Hao Yan et.al. 2403.15679 null
2024-03-21 Toward Multi-class Anomaly Detection: Exploring Class-aware Unified Model against Inter-class Interference Xi Jiang et.al. 2403.14213 null
2024-03-20 Visual Imitation Learning of Task-Oriented Object Grasping and Rearrangement Yichen Cai et.al. 2403.14000 null
2024-03-20 MIMO Channel as a Neural Function: Implicit Neural Representations for Extreme CSI Compression in Massive MIMO Systems Haotian Wu et.al. 2403.13615 null
2024-03-19 VQ-NeRV: A Vector Quantized Neural Representation for Videos Yunjie Xu et.al. 2403.12401 link
2024-03-18 Reachability-based Trajectory Design via Exact Formulation of Implicit Neural Signed Distance Functions Jonathan Michaux et.al. 2403.12280 null
2024-03-20 Graph Neural Networks for Learning Equivariant Representations of Neural Networks Miltiadis Kofinas et.al. 2403.12143 link
2024-03-18 3DGS-Calib: 3D Gaussian Splatting for Multimodal SpatioTemporal Calibration Quentin Herau et.al. 2403.11577 null
2024-03-17 STAIR: Semantic-Targeted Active Implicit Reconstruction Liren Jin et.al. 2403.11233 link
2024-03-16 MicroDiffusion: Implicit Representation-Guided Diffusion for 3D Reconstruction from Limited 2D Microscopy Projections Mude Hui et.al. 2403.10815 link
2024-03-15 SWAG: Splatting in the Wild images with Appearance-conditioned Gaussians Hiba Dahmani et.al. 2403.10427 null
2024-03-15 Arbitrary-Scale Image Generation and Upsampling using Latent Diffusion Model and Implicit Neural Decoder Jinseok Kim et.al. 2403.10255 null
2024-03-14 SketchINR: A First Look into Sketches as Implicit Neural Representations Hmrishav Bandyopadhyay et.al. 2403.09344 link
2024-03-13 Representing Anatomical Trees by Denoising Diffusion of Implicit Neural Fields Ashish Sinha et.al. 2403.08974 link
2024-03-13 A Novel Implicit Neural Representation for Volume Data Armin Sheibanifard et.al. 2403.08566 null
2024-03-14 GaussianImage: 1000 FPS Image Representation and Compression by 2D Gaussian Splatting Xinjie Zhang et.al. 2403.08551 link
2024-03-13 CINA: Conditional Implicit Neural Atlas for Spatio-Temporal Representation of Fetal Brains Maik Dannecker et.al. 2403.08550 null
2024-03-11 Multi-Scale Implicit Transformer with Re-parameterize for Arbitrary-Scale Super-Resolution Jinchen Zhu et.al. 2403.06536 null
2024-03-09 Fast Kernel Scene Flow Xueqian Li et.al. 2403.05896 link
2024-03-04 Ice-Tide: Implicit Cryo-ET Imaging and Deformation Estimation Valentin Debarnot et.al. 2403.02182 link
2024-02-28 NERV++: An Enhanced Implicit Neural Video Representation Ahmed Ghorbel et.al. 2402.18305 null
2024-03-08 Boosting Neural Representations for Videos with a Conditional Decoder Xinjie Zhang et.al. 2402.18152 link
2024-02-27 LoDIP: Low light phase retrieval with deep image prior Raunak Manekar et.al. 2402.17745 null
2024-02-27 Mesh-Agnostic Decoders for Supercritical Airfoil Prediction and Inverse Design Runze Li et.al. 2402.17299 null
2024-02-26 Neural Mesh Fusion: Unsupervised 3D Planar Surface Understanding Farhad G. Zanjani et.al. 2402.16739 null
2024-02-23 Smooth and Sparse Latent Dynamics in Operator Learning with Jerk Regularization Xiaoyu Xie et.al. 2402.15636 null
2024-02-22 CoLoRA: Continuous low-rank adaptation for reduced implicit neural modeling of parameterized partial differential equations Jules Berman et.al. 2402.14646 link
2024-02-21 Improving Efficiency of Iso-Surface Extraction on Implicit Neural Representations Using Uncertainty Propagation Haoyu Li et.al. 2402.13861 null
2024-02-21 SealD-NeRF: Interactive Pixel-Level Editing for Dynamic Scenes by Neural Radiance Fields Zhentao Huang et.al. 2402.13510 null
2024-03-02 NeRF Solves Undersampled MRI Reconstruction Tae Jun Jang et.al. 2402.13226 null
2024-02-20 PDEformer: Towards a Foundation Model for One-Dimensional Partial Differential Equations Zhanhong Ye et.al. 2402.12652 null
2024-02-14 DUDF: Differentiable Unsigned Distance Fields with Hyperbolic Scaling Miguel Fainstein et.al. 2402.08876 link
2024-02-13 Preconditioners for the Stochastic Training of Implicit Neural Representations Shin-Fang Chng et.al. 2402.08784 null
2024-02-13 Pix2Code: Learning to Compose Neural Visual Concepts as Programs Antonia Wüst et.al. 2402.08280 link
2024-02-10 Training dynamics in Physics-Informed Neural Networks with feature mapping Chengxi Zeng et.al. 2402.06955 link
2024-02-08 A Sampling Theory Perspective on Activations for Implicit Neural Representations Hemanth Saratchandran et.al. 2402.05427 null
2024-02-06 OASim: an Open and Adaptive Simulator based on Neural Rendering for Autonomous Driving Guohang Yan et.al. 2402.03830 link
2024-02-05 Deep Equilibrium Models are Almost Equivalent to Not-so-deep Explicit Models for High-dimensional Gaussian Mixtures Zenan Ling et.al. 2402.02697 link
2024-02-03 Implicit Neural Representation of Tileable Material Textures Hallison Paz et.al. 2402.02208 null
2024-02-02 Immersive Video Compression using Implicit Neural Representations Ho Man Kwan et.al. 2402.01596 link
2024-02-11 Neural Trajectory Model: Implicit Neural Trajectory Representation for Trajectories Generation Zihan Yu et.al. 2402.01254 link
**202

About

🎓Automatically Update CV Papers Daily using Github Actions (Update Every 12th hours)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%