[![Contributors][contributors-shield]][contributors-url] [![Forks][forks-shield]][forks-url] [![Stargazers][stars-shield]][stars-url] [![Issues][issues-shield]][issues-url]
Usage instructions: here
Table of Contents
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-01-16 | SynthLight: Portrait Relighting with Diffusion Model by Learning to Re-render Synthetic Faces | Sumit Chaturvedi et.al. | 2501.09756 | null |
2025-01-16 | Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps | Nanye Ma et.al. | 2501.09732 | null |
2025-01-16 | Reward-Guided Controlled Generation for Inference-Time Alignment in Diffusion Models: Tutorial and Review | Masatoshi Uehara et.al. | 2501.09685 | null |
2025-01-16 | Pruning for Sparse Diffusion Models based on Gradient Flow | Ben Wan et.al. | 2501.09464 | null |
2025-01-16 | CaPa: Carve-n-Paint Synthesis for Efficient 4K Textured Mesh Generation | Hwan Heo et.al. | 2501.09433 | null |
2025-01-16 | Contract-Inspired Contest Theory for Controllable Image Generation in Mobile Edge Metaverse | Guangyuan Liu et.al. | 2501.09391 | null |
2025-01-16 | UVRM: A Scalable 3D Reconstruction Model from Unposed Videos | Shiu-hong Kao et.al. | 2501.09347 | null |
2025-01-16 | Domain-conditioned and Temporal-guided Diffusion Modeling for Accelerated Dynamic MRI Reconstruction | Liping Zhang et.al. | 2501.09305 | null |
2025-01-16 | Text Semantics to Flexible Design: A Residential Layout Generation Method Based on Stable Diffusion Model | Zijin Qiu et.al. | 2501.09279 | null |
2025-01-16 | PATCHEDSERVE: A Patch Management Framework for SLO-Optimized Hybrid Resolution Diffusion Serving | Desen Sun et.al. | 2501.09253 | null |
2025-01-15 | SimGen: A Diffusion-Based Framework for Simultaneous Surgical Image and Segmentation Mask Generation | Aditya Bhat et.al. | 2501.09008 | null |
2025-01-15 | RepVideo: Rethinking Cross-Layer Representation for Video Generation | Chenyang Si et.al. | 2501.08994 | null |
2025-01-15 | Boosting Diffusion Guidance via Learning Degradation-Aware Models for Blind Super Resolution | Shao-Hao Lu et.al. | 2501.08819 | link |
2025-01-15 | Transformed Low-rank Adaptation via Tensor Decomposition and Its Applications to Text-to-image Models | Zerui Tao et.al. | 2501.08727 | null |
2025-01-15 | FlexiClip: Locality-Preserving Free-Form Character Animation | Anant Khandelwal et.al. | 2501.08676 | null |
2025-01-15 | TimeFlow: Longitudinal Brain Image Registration and Aging Progression Analysis | Bailiang Jian et.al. | 2501.08667 | null |
2025-01-15 | Product of Gaussian Mixture Diffusion Model for non-linear MRI Inversion | Laurenz Nagler et.al. | 2501.08662 | null |
2025-01-15 | Joint Learning of Depth and Appearance for Portrait Image Animation | Xinya Ji et.al. | 2501.08649 | null |
2025-01-15 | Watermarking in Diffusion Model: Gaussian Shading with Exact Diffusion Inversion via Coupled Transformations (EDICT) | Krishna Panthi et.al. | 2501.08604 | null |
2025-01-15 | DynamicFace: High-Quality and Consistent Video Face Swapping using Composable 3D Facial Priors | Runqi Wang et.al. | 2501.08553 | null |
2025-01-14 | DAViD: Modeling Dynamic Affordance of 3D Objects using Pre-trained Video Diffusion Models | Hyeonwoo Kim et.al. | 2501.08333 | null |
2025-01-14 | MangaNinja: Line Art Colorization with Precise Reference Following | Zhiheng Liu et.al. | 2501.08332 | null |
2025-01-14 | Go-with-the-Flow: Motion-Controllable Video Diffusion Models Using Real-Time Warped Noise | Ryan Burgert et.al. | 2501.08331 | link |
2025-01-14 | GameFactory: Creating New Games with Generative Interactive Videos | Jiwen Yu et.al. | 2501.08325 | null |
2025-01-14 | Diffusion Adversarial Post-Training for One-Step Video Generation | Shanchuan Lin et.al. | 2501.08316 | null |
2025-01-14 | LayerAnimate: Layer-specific Control for Animation | Yuxue Yang et.al. | 2501.08295 | null |
2025-01-14 | Text-Diffusion Red-Teaming of Large Language Models: Unveiling Harmful Behaviors with Proximity Constraints | Jonathan Nöther et.al. | 2501.08246 | null |
2025-01-14 | FramePainter: Endowing Interactive Image Editing with Video Diffusion Priors | Yabo Zhang et.al. | 2501.08225 | link |
2025-01-14 | D |
Qian Zeng et.al. | 2501.08180 | link |
2025-01-14 | Decision Transformers for RIS-Assisted Systems with Diffusion Model-Based Channel Acquisition | Jie Zhang et.al. | 2501.08007 | null |
2025-01-13 | Training-Free Motion-Guided Video Generation with Enhanced Temporal Consistency Using Motion Consistency Loss | Xinyu Zhang et.al. | 2501.07563 | null |
2025-01-13 | Confident Pseudo-labeled Diffusion Augmentation for Canine Cardiomegaly Detection | Shiman Zhang et.al. | 2501.07533 | link |
2025-01-13 | IP-FaceDiff: Identity-Preserving Facial Video Editing with Diffusion | Tharun Anand et.al. | 2501.07530 | null |
2025-01-13 | PrecipDiff: Leveraging image diffusion models to enhance satellite-based precipitation observations | Ting-Yu Dai et.al. | 2501.07447 | null |
2025-01-13 | Diff-Ensembler: Learning to Ensemble 2D Diffusion Models for Volume-to-Volume Medical Image Translation | Xiyue Zhu et.al. | 2501.07430 | null |
2025-01-13 | OCORD: Open-Campus Object Removal Dataset | Shuo Zhang et.al. | 2501.07397 | null |
2025-01-13 | Bigger Isn't Always Better: Towards a General Prior for Medical Image Reconstruction | Lukas Glaszner et.al. | 2501.07376 | null |
2025-01-13 | Skip Mamba Diffusion for Monocular 3D Semantic Scene Completion | Li Liang et.al. | 2501.07260 | link |
2025-01-13 | D3MES: Diffusion Transformer with multihead equivariant self-attention for 3D molecule generation | Zhejun Zhang et.al. | 2501.07077 | link |
2025-01-13 | Erasing Noise in Signal Detection with Diffusion Model: From Theory to Application | Xiucheng Wang et.al. | 2501.07030 | null |
2025-01-10 | From discrete-time policies to continuous-time diffusion samplers: Asymptotic equivalences and faster training | Julius Berner et.al. | 2501.06148 | link |
2025-01-10 | Nonisotropic Gaussian Diffusion for Realistic 3D Human Motion Prediction | Cecilia Curreli et.al. | 2501.06035 | null |
2025-01-10 | CamCtrl3D: Single-Image Scene Exploration with Precise 3D Camera Control | Stefan Popov et.al. | 2501.06006 | null |
2025-01-10 | Estimation and Restoration of Unknown Nonlinear Distortion using Diffusion | Michal Švento et.al. | 2501.05959 | null |
2025-01-10 | Beyond Flat Text: Dual Self-inherited Guidance for Visual Text Generation | Minxing Luo et.al. | 2501.05892 | null |
2025-01-10 | Poetry in Pixels: Prompt Tuning for Poem Image Generation via Diffusion Models | Sofia Jamil et.al. | 2501.05839 | link |
2025-01-10 | Diffusion Models for Smarter UAVs: Decision-Making and Modeling | Yousef Emami et.al. | 2501.05819 | null |
2025-01-10 | Alignment without Over-optimization: Training-Free Solution for Diffusion Models | Sunwoo Kim et.al. | 2501.05803 | link |
2025-01-10 | Conditional Diffusion Model for Electrical Impedance Tomography | Duanpeng Shi et.al. | 2501.05769 | null |
2025-01-10 | StarGen: A Spatiotemporal Autoregression Framework with Video Diffusion Model for Scalable and Controllable Scene Generation | Shangjin Zhai et.al. | 2501.05763 | null |
2025-01-09 | Decentralized Diffusion Models | David McAllister et.al. | 2501.05450 | null |
2025-01-09 | Progressive Growing of Video Tokenizers for Highly Compressed Latent Spaces | Aniruddha Mahapatra et.al. | 2501.05442 | null |
2025-01-09 | The GAN is dead; long live the GAN! A Modern GAN Baseline | Yiwen Huang et.al. | 2501.05441 | link |
2025-01-09 | Zero-1-to-G: Taming Pretrained 2D Diffusion Model for Direct 3D Generation | Xuyi Meng et.al. | 2501.05427 | null |
2025-01-09 | TimeDP: Learning to Generate Multi-Domain Time Series with Domain Prompts | Yu-Hao Huang et.al. | 2501.05403 | null |
2025-01-09 | Accelerated Diffusion Models via Speculative Sampling | Valentin De Bortoli et.al. | 2501.05370 | null |
2025-01-09 | CROPS: Model-Agnostic Training-Free Framework for Safe Image Synthesis with Latent Diffusion Models | Junha Park et.al. | 2501.05359 | null |
2025-01-09 | Light Transport-aware Diffusion Posterior Sampling for Single-View Reconstruction of 3D Volumes | Ludwic Leonard et.al. | 2501.05226 | null |
2025-01-09 | FaceMe: Robust Blind Face Restoration with Personal Identification | Siyu Liu et.al. | 2501.05177 | null |
2025-01-09 | EquiBoost: An Equivariant Boosting Approach to Molecular Conformation Generation | Yixuan Yang et.al. | 2501.05109 | null |
2025-01-08 | EditAR: Unified Conditional Generation with Autoregressive Models | Jiteng Mu et.al. | 2501.04699 | null |
2025-01-08 | ConceptMaster: Multi-Concept Video Customization on Diffusion Transformer Models Without Test-Time Tuning | Yuzhou Huang et.al. | 2501.04698 | null |
2025-01-08 | SPAR3D: Stable Point-Aware Reconstruction of 3D Objects from Single Images | Zixuan Huang et.al. | 2501.04689 | null |
2025-01-08 | A Statistical Theory of Contrastive Pre-training and Multimodal Generative AI | Kazusato Oko et.al. | 2501.04641 | link |
2025-01-08 | Disentangled Clothed Avatar Generation with Layered Representation | Weitian Zhang et.al. | 2501.04631 | null |
2025-01-09 | MedCoDi-M: A Multi-Prompt Foundation Model for Multimodal Medical Data Generation | Daniele Molino et.al. | 2501.04614 | null |
2025-01-08 | Enhancing Low-Cost Video Editing with Lightweight Adaptors and Temporal-Aware Inversion | Yangfan He et.al. | 2501.04606 | link |
2025-01-08 | ZSVC: Zero-shot Style Voice Conversion with Disentangled Latent Diffusion Models and Adversarial Training | Xinfa Zhu et.al. | 2501.04416 | null |
2025-01-08 | Edit as You See: Image-guided Video Editing via Masked Motion Modeling | Zhi-Lin Huang et.al. | 2501.04325 | null |
2025-01-08 | DGQ: Distribution-Aware Group Quantization for Text-to-Image Diffusion Models | Hyogon Ryu et.al. | 2501.04304 | null |
2025-01-07 | NeuralSVG: An Implicit Representation for Text-to-Vector Generation | Sagi Polaczek et.al. | 2501.03992 | null |
2025-01-07 | Stabilising effect of generic anomalous diffusion independent of the Rayleigh number | Antonio Barletta et.al. | 2501.03990 | null |
2025-01-07 | A precise asymptotic analysis of learning diffusion models: theory and insights | Hugo Cui et.al. | 2501.03937 | link |
2025-01-07 | Magic Mirror: ID-Preserved Video Generation in Video Diffusion Transformers | Yuechen Zhang et.al. | 2501.03931 | link |
2025-01-07 | Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control | Zekai Gu et.al. | 2501.03847 | link |
2025-01-07 | Impact of diffusion mechanisms on persistence and spreading | Nathanaël Boutillon et.al. | 2501.03816 | null |
2025-01-07 | Mixing by Internal Gravity Waves in Stars: Assessing Numerical Simulations Against Theory | Jack Morton et.al. | 2501.03796 | null |
2025-01-07 | Exploring Molecule Generation Using Latent Space Graph Diffusion | Prashanth Pombala et.al. | 2501.03696 | link |
2025-01-07 | MC-VTON: Minimal Control Virtual Try-On Diffusion Transformer | Junsheng Luan et.al. | 2501.03630 | null |
2025-01-07 | FgC2F-UDiff: Frequency-guided and Coarse-to-fine Unified Diffusion Model for Multi-modality Missing MRI Synthesis | Xiaojiao Xiao et.al. | 2501.03526 | link |
2025-01-06 | MObI: Multimodal Object Inpainting Using Diffusion Models | Alexandru Buburuzan et.al. | 2501.03173 | null |
2025-01-06 | Large language models for artificial general intelligence (AGI): A survey of foundational principles and approaches | Alhassan Mumuni et.al. | 2501.03151 | null |
2025-01-06 | DDRM-PR: Fourier Phase Retrieval using Denoising Diffusion Restoration Models | Mehmet Onurcan Kaya et.al. | 2501.03030 | link |
2025-01-06 | STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution | Rui Xie et.al. | 2501.02976 | null |
2025-01-07 | SceneVTG++: Controllable Multilingual Visual Text Generation in the Wild | Jiawei Liu et.al. | 2501.02962 | null |
2025-01-06 | Deep Generative Model-Aided Power System Dynamic State Estimation and Reconstruction with Unknown Control Inputs or Data Distributions | Jianhua Pei et.al. | 2501.02928 | null |
2025-01-06 | Pointmap-Conditioned Diffusion for Consistent Novel View Synthesis | Thang-Anh-Quan Nguyen et.al. | 2501.02913 | null |
2025-01-06 | Conditional Mutual Information Based Diffusion Posterior Sampling for Solving Inverse Problems | Shayan Mohajer Hamidi et.al. | 2501.02880 | null |
2025-01-06 | Towards HRTF Personalization using Denoising Diffusion Models | Juan Camilo Albarracín Sánchez et.al. | 2501.02871 | null |
2025-01-06 | Diff-Lung: Diffusion-Based Texture Synthesis for Enhanced Pathological Tissue Segmentation in Lung CT Scans | Rezkellah Noureddine Khiati et.al. | 2501.02867 | null |
2025-01-03 | Bridging Classification and Segmentation in Osteosarcoma Assessment via Foundation and Discrete Diffusion Models | Manh Duong Nguyen et.al. | 2501.01932 | link |
2025-01-03 | Nonparametric estimation of a factorizable density using diffusion models | Hyeok Kyu Kwon et.al. | 2501.01783 | null |
2025-01-03 | Adverse Weather Conditions Augmentation of LiDAR Scenes with Latent Diffusion Models | Andrea Matteazzi et.al. | 2501.01761 | null |
2025-01-03 | ACE: Anti-Editing Concept Erasure in Text-to-Image Models | Zihao Wang et.al. | 2501.01633 | link |
2025-01-03 | Multivariate Time Series Anomaly Detection using DiffGAN Model | Guangqiang Wu et.al. | 2501.01591 | link |
2025-01-02 | Denoising Diffused Embeddings: a Generative Approach for Hypergraphs | Shihao Wu et.al. | 2501.01541 | null |
2025-01-02 | Object-level Visual Prompts for Compositional Image Generation | Gaurav Parmar et.al. | 2501.01424 | null |
2025-01-02 | Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models | Jingfeng Yao et.al. | 2501.01423 | link |
2025-01-02 | Test-time Controllable Image Generation by Explicit Spatial Constraint Enforcement | Z. Zhang et.al. | 2501.01368 | null |
2025-01-03 | Conditional Consistency Guided Image Translation and Enhancement | Amil Bhagat et.al. | 2501.01223 | link |
2025-01-02 | Semantics-Guided Diffusion for Deep Joint Source-Channel Coding in Wireless Image Transmission | Maojun Zhang et.al. | 2501.01138 | null |
2025-01-02 | EliGen: Entity-Level Controlled Image Generation with Regional Attention | Hong Zhang et.al. | 2501.01097 | link |
2025-01-02 | DiffCL: A Diffusion-Based Contrastive Learning Framework with Semantic Alignment for Multimodal Recommendations | Qiya Song et.al. | 2501.01066 | null |
2025-01-02 | Optimizing Noise Schedules of Generative Models in High Dimensionss | Santiago Aranguri et.al. | 2501.00988 | null |
2025-01-01 | Cached Adaptive Token Merging: Dynamic Token Reduction and Redundant Computation Elimination in Diffusion Model | Omid Saghatchian et.al. | 2501.00946 | link |
2025-01-01 | Diffusion Prism: Enhancing Diversity and Morphology Consistency in Mask-to-Image Diffusion | Hao Wang et.al. | 2501.00944 | null |
2025-01-02 | Prometheus: 3D-Aware Latent Diffusion Models for Feed-Forward Text-to-3D Scene Generation | Yuanbo Yang et.al. | 2412.21117 | null |
2024-12-30 | Quantum Diffusion Model for Quark and Gluon Jet Generation | Mariia Baidachna et.al. | 2412.21082 | link |
2025-01-02 | Edicho: Consistent Image Editing in the Wild | Qingyan Bai et.al. | 2412.21079 | link |
2024-12-30 | Varformer: Adapting VAR's Generative Prior for Image Restoration | Siyang Wang et.al. | 2412.21063 | link |
2024-12-30 | E2EDiff: Direct Mapping from Noise to Data for Enhanced Diffusion Models | Zhiyu Tan et.al. | 2412.21044 | null |
2024-12-30 | Visual Style Prompt Learning Using Diffusion Models for Blind Face Restoration | Wanglong Lu et.al. | 2412.21042 | link |
2024-12-30 | AlignAb: Pareto-Optimal Energy Alignment for Designing Nature-Like Antibodies | Yibo Wen et.al. | 2412.20984 | null |
2024-12-30 | Influence Maximization in Temporal Networks with Persistent and Reactive Behaviors | Aaqib Zahoor et.al. | 2412.20936 | null |
2024-12-30 | DDIM sampling for Generative AIBIM, a faster intelligent structural design framework | Zhili He et.al. | 2412.20899 | null |
2024-12-30 | VMix: Improving Text-to-Image Diffusion Model with Cross-Attention Mixing Control | Shaojin Wu et.al. | 2412.20800 | link |
2024-12-27 | VideoMaker: Zero-shot Customized Video Generation with the Inherent Force of Video Diffusion Models | Tao Wu et.al. | 2412.19645 | null |
2024-12-27 | StyleRWKV: High-Quality and High-Efficiency Style Transfer with RWKV-like Architecture | Miaomiao Dai et.al. | 2412.19535 | null |
2024-12-27 | RobotDiffuse: Motion Planning for Redundant Manipulator based on Diffusion Model | Xiaohan Zhang et.al. | 2412.19500 | link |
2024-12-27 | RAIN: Real-time Animation of Infinite Video Stream | Zhilei Shu et.al. | 2412.19489 | null |
2024-12-27 | DriveEditor: A Unified 3D Information-Guided Framework for Controllable Object Editing in Driving Scenes | Yiyuan Liang et.al. | 2412.19458 | link |
2024-12-27 | Multi-scale Latent Point Consistency Models for 3D Shape Generation | Bi'an Du et.al. | 2412.19413 | null |
2024-12-27 | A Generalized Einstein Relation for Markovian Friction Coefficients from Molecular Trajectories | J. M. Hall et.al. | 2412.19398 | null |
2024-12-26 | 6Diffusion: IPv6 Target Generation Using a Diffusion Model with Global-Local Attention Mechanisms for Internet-wide IPv6 Scanning | Nabo He et.al. | 2412.19243 | null |
2024-12-26 | Mask Approximation Net: Merging Feature Extraction and Distribution Learning for Remote Sensing Change Captioning | Dongwei Sun et.al. | 2412.19179 | null |
2024-12-26 | Improving Generative Pre-Training: An In-depth Study of Masked Image Modeling and Denoising Models | Hyesong Choi et.al. | 2412.19104 | null |
2024-12-24 | PartGen: Part-level 3D Generation and Reconstruction with Multi-View Diffusion Models | Minghao Chen et.al. | 2412.18608 | null |
2024-12-24 | DrivingGPT: Unifying Driving World Modeling and Planning with Multi-modal Autoregressive Transformers | Yuntao Chen et.al. | 2412.18607 | null |
2024-12-24 | Explaining in Diffusion: Explaining a Classifier Through Hierarchical Semantics with Text-to-Image Diffusion Models | Tahira Kazimi et.al. | 2412.18604 | null |
2024-12-24 | DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation | Minghong Cai et.al. | 2412.18597 | link |
2024-12-24 | LatentCRF: Continuous CRF for Efficient Latent Diffusion | Kanchana Ranasinghe et.al. | 2412.18596 | null |
2024-12-24 | Resolution-Robust 3D MRI Reconstruction with 2D Diffusion Priors: Diverse-Resolution Training Outperforms Interpolation | Anselm Krainovic et.al. | 2412.18584 | null |
2024-12-24 | 3DEnhancer: Consistent Multi-View Diffusion for 3D Enhancement | Yihang Luo et.al. | 2412.18565 | null |
2024-12-24 | Fashionability-Enhancing Outfit Image Editing with Conditional Diffusion Models | Qice Qin et.al. | 2412.18421 | null |
2024-12-24 | Discovery of 2D Materials via Symmetry-Constrained Diffusion Model | Shihang Xu et.al. | 2412.18414 | null |
2024-12-24 | FameBias: Embedding Manipulation Bias Attack in Text-to-Image Models | Jaechul Roh et.al. | 2412.18302 | null |
2024-12-23 | FaceLift: Single Image to 3D Head with View Generation and GS-LRM | Weijie Lyu et.al. | 2412.17812 | null |
2024-12-23 | PepTune: De Novo Generation of Therapeutic Peptides with Multi-Objective-Guided Discrete Diffusion | Sophia Tang et.al. | 2412.17780 | null |
2024-12-23 | The Superposition of Diffusion Models Using the Itô Density Estimator | Marta Skreta et.al. | 2412.17762 | null |
2024-12-23 | A Bias-Free Training Paradigm for More General AI-generated Image Detection | Fabrizio Guillaro et.al. | 2412.17671 | null |
2024-12-23 | Benchmarking Generative AI Models for Deep Learning Test Input Generation | Maryam et.al. | 2412.17652 | link |
2024-12-23 | DreamFit: Garment-Centric Human Generation via a Lightweight Anything-Dressing Encoder | Ente Lin et.al. | 2412.17644 | null |
2024-12-23 | Retention Score: Quantifying Jailbreak Risks for Vision Language Models | Zaitang Li et.al. | 2412.17544 | null |
2024-12-23 | DiffusionAttacker: Diffusion-Driven Prompt Manipulation for LLM Jailbreak | Hao Wang et.al. | 2412.17522 | null |
2024-12-23 | Heterogeneous carrying capacities and global extinction in metapopulations | Jakub Hesoun et.al. | 2412.17461 | null |
2024-12-23 | AeroDiT: Diffusion Transformers for Reynolds-Averaged Navier-Stokes Simulations of Airfoil Flows | Hui Xiang et.al. | 2412.17394 | null |
2024-12-20 | Personalized Representation from Personalized Generation | Shobhita Sundaram et.al. | 2412.16156 | link |
2024-12-20 | Predicting human cooperation: sensitizing drift-diffusion model to interaction and external stimuli | Lucila G. Alvarez-Zuzek et.al. | 2412.16121 | null |
2024-12-20 | Differentially Private Federated Learning of Diffusion Models for Synthetic Tabular Data Generation | Timur Sattarov et.al. | 2412.16083 | null |
2024-12-20 | Label-Efficient Data Augmentation with Video Diffusion Models for Guidewire Segmentation in Cardiac Fluoroscopy | Shaoyan Pan et.al. | 2412.16050 | null |
2024-12-20 | SafeCFG: Redirecting Harmful Classifier-Free Guidance for Safe Generation | Jiadong Pan et.al. | 2412.16039 | null |
2024-12-20 | Semi-Supervised Adaptation of Diffusion Models for Handwritten Text Generation | Kai Brandenbusch et.al. | 2412.15853 | null |
2024-12-20 | Electromagnetic particle-in-cell modeling of an electron cyclotron resonance plasma discharge in hydrogen | D. Eremin et.al. | 2412.15802 | null |
2024-12-20 | Diffusion-Based Conditional Image Editing through Optimized Inference with Guidance | Hyunsoo Lee et.al. | 2412.15798 | null |
2024-12-20 | Learning Group Interactions and Semantic Intentions for Multi-Object Trajectory Prediction | Mengshi Qi et.al. | 2412.15673 | link |
2024-12-20 | BS-LDM: Effective Bone Suppression in High-Resolution Chest X-Ray Images with Conditional Latent Diffusion Models | Yifei Sun et.al. | 2412.15670 | link |
2024-12-19 | LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis | Hanlin Wang et.al. | 2412.15214 | link |
2024-12-19 | Flowing from Words to Pixels: A Framework for Cross-Modality Evolution | Qihao Liu et.al. | 2412.15213 | null |
2024-12-19 | Generative Multiview Relighting for 3D Reconstruction under Extreme Illumination Variation | Hadi Alzayer et.al. | 2412.15211 | null |
2024-12-19 | AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation | Moayed Haji-Ali et.al. | 2412.15191 | null |
2024-12-19 | Tiled Diffusion | Or Madar et.al. | 2412.15185 | null |
2024-12-19 | OnlineVPO: Align Video Diffusion Model with Online Video-Centric Preference Optimization | Jiacheng Zhang et.al. | 2412.15159 | null |
2024-12-19 | Prompt-A-Video: Prompt Your Video Diffusion Model via Preference-Aligned LLM | Yatai Ji et.al. | 2412.15156 | link |
2024-12-19 | Jet: A Modern Transformer-Based Normalizing Flow | Alexander Kolesnikov et.al. | 2412.15129 | null |
2024-12-19 | Uni-Renderer: Unifying Rendering and Inverse Rendering Via Dual Stream Diffusion | Zhifei Chen et.al. | 2412.15050 | null |
2024-12-19 | DCTdiff: Intriguing Properties of Image Generative Modeling in the DCT Space | Mang Ning et.al. | 2412.15032 | link |
2024-12-18 | AniDoc: Animation Creation Made Easier | Yihao Meng et.al. | 2412.14173 | null |
2024-12-19 | E-CAR: Efficient Continuous Autoregressive Image Generation via Multistage Modeling | Zhihang Yuan et.al. | 2412.14170 | null |
2024-12-18 | Autoregressive Video Generation without Vector Quantization | Haoge Deng et.al. | 2412.14169 | link |
2024-12-18 | VideoDPO: Omni-Preference Alignment for Video Diffusion Generation | Runtao Liu et.al. | 2412.14167 | null |
2024-12-18 | MCMat: Multiview-Consistent and Physically Accurate PBR Material Generation | Shenhao Zhu et.al. | 2412.14148 | null |
2024-12-18 | SurgSora: Decoupled RGBD-Flow Diffusion Model for Controllable Surgical Video Generation | Tong Chen et.al. | 2412.14018 | null |
2024-12-18 | Comparative Analysis of Machine Learning-Based Imputation Techniques for Air Quality Datasets with High Missing Data Rates | Sen Yan et.al. | 2412.13966 | null |
2024-12-18 | IDEQ: an improved diffusion model for the TSP | Mickael Basson et.al. | 2412.13858 | null |
2024-12-18 | Object Style Diffusion for Generalized Object Detection in Urban Scene | Hao Li et.al. | 2412.13815 | null |
2024-12-18 | Text2Relight: Creative Portrait Relighting with Text Guidance | Junuk Cha et.al. | 2412.13734 | null |
2024-12-17 | CoMPaSS: Enhancing Spatial Understanding in Text-to-Image Diffusion Models | Gaoyang Zhang et.al. | 2412.13195 | link |
2024-12-17 | StreetCrafter: Street View Synthesis with Controllable Video Diffusion Models | Yunzhi Yan et.al. | 2412.13188 | null |
2024-12-17 | Move-in-2D: 2D-Conditioned Human Motion Generation | Hsin-Ping Huang et.al. | 2412.13185 | null |
2024-12-17 | Prompt Augmentation for Self-supervised Text-guided Image Manipulation | Rumeysa Bodur et.al. | 2412.13081 | null |
2024-12-17 | 3D MedDiffusion: A 3D Medical Diffusion Model for Controllable and High-quality Medical Image Generation | Haoshen Wang et.al. | 2412.13059 | null |
2024-12-18 | Attentive Eraser: Unleashing Diffusion Model's Object Removal Potential via Self-Attention Redirection Guidance | Wenhao Sun et.al. | 2412.12974 | link |
2024-12-17 | ArchesWeather & ArchesWeatherGen: a deterministic and generative model for efficient ML weather forecasting | Guillaume Couairon et.al. | 2412.12971 | link |
2024-12-17 | Generation of cosmic ray trajectories by a Diffusion Model trained on test particles in 3D magnetohydrodynamic turbulence | Johannes Martin et.al. | 2412.12923 | null |
2024-12-17 | Unsupervised Region-Based Image Editing of Denoising Diffusion Models | Zixiang Li et.al. | 2412.12912 | null |
2024-12-17 | ArtAug: Enhancing Text-to-Image Generation through Synthesis-Understanding Interaction | Zhongjie Duan et.al. | 2412.12888 | link |
2024-12-16 | Causal Diffusion Transformers for Generative Modeling | Chaorui Deng et.al. | 2412.12095 | link |
2024-12-16 | CAP4D: Creating Animatable 4D Portrait Avatars with Morphable Multi-View Diffusion Models | Felix Taubner et.al. | 2412.12093 | null |
2024-12-16 | Wonderland: Navigating 3D Scenes from a Single Image | Hanwen Liang et.al. | 2412.12091 | null |
2024-12-16 | A LoRA is Worth a Thousand Pictures | Chenxi Liu et.al. | 2412.12048 | null |
2024-12-16 | The entropic optimal (self-)transport problem: Limit distributions for decreasing regularization with application to score function estimation | Gilles Mordant et.al. | 2412.12007 | null |
2024-12-16 | Controllable Shadow Generation with Single-Step Diffusion Models from Synthetic Data | Onur Tasar et.al. | 2412.11972 | null |
2024-12-16 | ColorFlow: Retrieval-Augmented Image Sequence Colorization | Junhao Zhuang et.al. | 2412.11815 | null |
2024-12-16 | InterDyn: Controllable Interactive Dynamics with Video Diffusion Models | Rick Akkerman et.al. | 2412.11785 | null |
2024-12-16 | Joint Reconstruction of the Activity and the Attenuation in PET by Diffusion Posterior Sampling: a Feasibility Study | Clémentine Phung-Ngoc et.al. | 2412.11776 | null |
2024-12-16 | No More Adam: Learning Rate Scaling at Initialization is All You Need | Minghao Xu et.al. | 2412.11768 | link |
2024-12-13 | Towards a foundation model for heavy-ion collision experiments through point cloud diffusion | Manjunath Omana Kuttan et.al. | 2412.10352 | null |
2024-12-13 | BrushEdit: All-In-One Image Inpainting and Editing | Yaowei Li et.al. | 2412.10316 | null |
2024-12-13 | Coherent 3D Scene Diffusion From a Single RGB Image | Manuel Dahnert et.al. | 2412.10294 | null |
2024-12-13 | GAF: Gaussian Avatar Reconstruction from Monocular Videos via Multi-view Diffusion | Jiapeng Tang et.al. | 2412.10209 | null |
2024-12-13 | Efficient Generative Modeling with Residual Vector Quantization-Based Tokens | Jaehyeon Kim et.al. | 2412.10208 | null |
2024-12-13 | Simple Guidance Mechanisms for Discrete Diffusion Models | Yair Schiff et.al. | 2412.10193 | link |
2024-12-13 | SwiftTry: Fast and Consistent Video Virtual Try-On with Diffusion Models | Hung Nguyen et.al. | 2412.10178 | null |
2024-12-13 | The Art of Deception: Color Visual Illusions and Diffusion Models | Alex Gomez-Villa et.al. | 2412.10122 | null |
2024-12-13 | SuperMark: Robust and Training-free Image Watermarking via Diffusion-based Super-Resolution | Runyi Hu et.al. | 2412.10049 | null |
2024-12-13 | Emergence of complexity in opinion propagation: A reaction-diffusion model | Romain Ducasse et.al. | 2412.10000 | null |
2024-12-12 | FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free Scale Fusion | Haonan Qiu et.al. | 2412.09626 | null |
2024-12-12 | Illusion3D: 3D Multiview Illusion with 2D Diffusion Priors | Yue Feng et.al. | 2412.09625 | null |
2024-12-12 | OmniDrag: Enabling Motion Control for Omnidirectional Image-to-Video Generation | Weiqi Li et.al. | 2412.09623 | null |
2024-12-12 | LoRACLR: Contrastive Adaptation for Customization of Diffusion Models | Enis Simsar et.al. | 2412.09622 | null |
2024-12-12 | SnapGen: Taming High-Resolution Text-to-Image Models for Mobile Devices with Efficient Architectures and Training | Dongting Hu et.al. | 2412.09619 | null |
2024-12-12 | EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM | Zhuofan Zong et.al. | 2412.09618 | null |
2024-12-12 | Context Canvas: Enhancing Text-to-Image Diffusion Models with Knowledge Graph-Based RAG | Kavana Venkatesh et.al. | 2412.09614 | null |
2024-12-12 | LiftImage3D: Lifting Any Single Image to 3D Gaussians with Video Generation Priors | Yabo Chen et.al. | 2412.09597 | null |
2024-12-12 | Neural LightRig: Unlocking Accurate Object Normal and Material Estimation with Multi-Light Diffusion | Zexin He et.al. | 2412.09593 | null |
2024-12-12 | SimAvatar: Simulation-Ready Avatars with Layered Hair and Clothing | Xueting Li et.al. | 2412.09545 | null |
2024-12-11 | Generative Semantic Communication: Architectures, Technologies, and Applications | Jinke Ren et.al. | 2412.08642 | null |
2024-12-11 | DMin: Scalable Training Data Influence Estimation for Diffusion Models | Huawei Lin et.al. | 2412.08637 | link |
2024-12-11 | TryOffAnyone: Tiled Cloth Generation from a Dressed Person | Ioannis Xarchakos et.al. | 2412.08573 | link |
2024-12-11 | Learning Flow Fields in Attention for Controllable Person Image Generation | Zijian Zhou et.al. | 2412.08486 | link |
2024-12-11 | InvDiff: Invariant Guidance for Bias Mitigation in Diffusion Models | Min Hou et.al. | 2412.08480 | link |
2024-12-11 | CC-Diff: Enhancing Contextual Coherence in Remote Sensing Image Synthesis | Mu Zhang et.al. | 2412.08464 | null |
2024-12-11 | Reliable Uncertainty Quantification for Fiber Orientation in Composite Molding Processes using Multilevel Polynomial Surrogates | Stjepan Salatovic et.al. | 2412.08459 | null |
2024-12-12 | Pragmatist: Multiview Conditional Diffusion Models for High-Fidelity 3D Reconstruction from Unposed Sparse Views | Songchun Zhang et.al. | 2412.08412 | null |
2024-12-11 | Grasp Diffusion Network: Learning Grasp Generators from Partial Point Clouds with Diffusion Models in SO(3)xR3 | Joao Carvalho et.al. | 2412.08398 | null |
2024-12-11 | Digging into Intrinsic Contextual Information for High-fidelity 3D Point Cloud Completion | Jisheng Chu et.al. | 2412.08326 | link |
2024-12-10 | Efficient Diversity-Preserving Diffusion Alignment via Gradient-Informed GFlowNets | Zhen Liu et.al. | 2412.07775 | null |
2024-12-10 | From Slow Bidirectional to Fast Causal Video Generators | Tianwei Yin et.al. | 2412.07772 | null |
2024-12-10 | Make-A-Texture: Fast Shape-Aware Texture Generation in 3 Seconds | Xiaoyu Xiang et.al. | 2412.07766 | null |
2024-12-10 | Repurposing Pre-trained Video Diffusion Models for Event-based Video Interpolation | Jingxi Chen et.al. | 2412.07761 | null |
2024-12-10 | SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints | Jianhong Bai et.al. | 2412.07760 | link |
2024-12-10 | Multi-Shot Character Consistency for Text-to-Video Generation | Yuval Atzmon et.al. | 2412.07750 | null |
2024-12-10 | FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models | Tong Wu et.al. | 2412.07674 | null |
2024-12-10 | TraSCE: Trajectory Steering for Concept Erasure | Anubhav Jain et.al. | 2412.07658 | link |
2024-12-11 | Motion Artifact Removal in Pixel-Frequency Domain via Alternate Masks and Diffusion Model | Jiahua Xu et.al. | 2412.07590 | link |
2024-12-10 | DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation | Jianzong Wu et.al. | 2412.07589 | null |
2024-12-09 | [MASK] is All You Need | Vincent Tao Hu et.al. | 2412.06787 | link |
2024-12-09 | Tactile DreamFusion: Exploiting Tactile Sensing for 3D Generation | Ruihan Gao et.al. | 2412.06785 | link |
2024-12-09 | Diverse Score Distillation | Yanbo Xu et.al. | 2412.06780 | null |
2024-12-09 | Visual Lexicon: Rich Image Features in Language Space | XuDong Wang et.al. | 2412.06774 | null |
2024-12-09 | InstantRestore: Single-Step Personalized Face Restoration with Shared-Image Attention | Howard Zhang et.al. | 2412.06753 | null |
2024-12-09 | ContRail: A Framework for Realistic Railway Image Synthesis using ControlNet | Andrei-Robert Alexandrescu et.al. | 2412.06742 | null |
2024-12-09 | Take Fake as Real: Realistic-like Robust Black-box Adversarial Attack to Evade AIGC Detection | Caiyun Xie et.al. | 2412.06727 | link |
2024-12-09 | You See it, You Got it: Learning 3D Creation on Pose-Free Videos at Scale | Baorui Ma et.al. | 2412.06699 | link |
2024-12-09 | Gen-3Diffusion: Realistic Image-to-3D Generation via 2D & 3D Diffusion Synergy | Yuxuan Xue et.al. | 2412.06698 | null |
2024-12-09 | Diff5T: Benchmarking Human Brain Diffusion MRI with an Extensive 5.0 Tesla K-Space and Spatial Dataset | Shanshan Wang et.al. | 2412.06666 | null |
2024-12-06 | Perturb-and-Revise: Flexible 3D Editing with Generative Trajectories | Susung Hong et.al. | 2412.05279 | null |
2024-12-06 | Birth and Death of a Rose | Chen Geng et.al. | 2412.05278 | null |
2024-12-06 | MotionFlow: Attention-Driven Motion Transfer in Video Diffusion Models | Tuna Han Salih Meral et.al. | 2412.05275 | null |
2024-12-06 | Go-or-Grow Models in Biology: a Monster on a Leash | R. Thiessen et.al. | 2412.05191 | null |
2024-12-06 | DNF: Unconditional 4D Generation with Dictionary-based Neural Fields | Xinyi Zhang et.al. | 2412.05161 | null |
2024-12-06 | Probabilistic Galaxy Field Generation with Diffusion Models | Tanner Sether et.al. | 2412.05131 | null |
2024-12-06 | The Silent Prompt: Initial Noise as Implicit Guidance for Goal-Driven Image Generation | Ruoyu Wang et.al. | 2412.05101 | null |
2024-12-06 | ReF-LDM: A Latent Diffusion Model for Reference-based Face Image Restoration | Chi-Wei Hsiao et.al. | 2412.05043 | null |
2024-12-06 | Noise Matters: Diffusion Model-based Urban Mobility Generation with Collaborative Noise Priors | Yuheng Zhang et.al. | 2412.05000 | null |
2024-12-06 | Continuous Video Process: Modeling Videos as Continuous Multi-Dimensional Processes for Video Prediction | Gaurav Shrivastava et.al. | 2412.04929 | null |
2024-12-05 | PaintScene4D: Consistent 4D Scene Generation from Text Prompts | Vinayak Gupta et.al. | 2412.04471 | null |
2024-12-05 | LayerFusion: Harmonized Multi-Layer Text-to-Image Generation with Generative Priors | Yusuf Dalva et.al. | 2412.04460 | null |
2024-12-05 | Four-Plane Factorized Video Autoencoders | Mohammed Suhail et.al. | 2412.04452 | null |
2024-12-05 | MEMO: Memory-Guided Diffusion for Expressive Talking Video Generation | Longtao Zheng et.al. | 2412.04448 | null |
2024-12-05 | DiCoDe: Diffusion-Compressed Deep Tokens for Autoregressive Video Generation with Language Models | Yizhuo Li et.al. | 2412.04446 | null |
2024-12-05 | Learning Artistic Signatures: Symmetry Discovery and Style Transfer | Emma Finn et.al. | 2412.04441 | null |
2024-12-05 | Divot: Diffusion Powers Video Tokenizer for Comprehension and Generation | Yuying Ge et.al. | 2412.04432 | link |
2024-12-05 | Infinity: Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis | Jian Han et.al. | 2412.04431 | link |
2024-12-05 | Reversible molecular simulation for training classical and machine learning force fields | Joe G Greener et.al. | 2412.04374 | link |
2024-12-05 | ActFusion: a Unified Diffusion Model for Action Segmentation and Anticipation | Dayoung Gong et.al. | 2412.04353 | null |
2024-12-04 | MIDI: Multi-Instance Diffusion for Single Image to 3D Scene Generation | Zehuan Huang et.al. | 2412.03558 | null |
2024-12-04 | NVComposer: Boosting Generative Novel View Synthesis with Multiple Sparse and Unposed Images | Lingen Li et.al. | 2412.03517 | null |
2024-12-04 | Distilling Diffusion Models to Efficient 3D LiDAR Scene Completion | Shengyuan Zhang et.al. | 2412.03515 | link |
2024-12-04 | CleanDIFT: Diffusion Features without Noise | Nick Stracke et.al. | 2412.03439 | link |
2024-12-04 | SINGER: Vivid Audio-driven Singing Video Generation with Multi-scale Spectral Diffusion Model | Yan Li et.al. | 2412.03430 | null |
2024-12-04 | Skel3D: Skeleton Guided Novel View Synthesis | Aron Fóthi et.al. | 2412.03407 | null |
2024-12-04 | Identifiability implies consistency of MLE in partially observed diffusions on a torus | Ibrahim Ekren et.al. | 2412.03380 | null |
2024-12-04 | TASR: Timestep-Aware Diffusion Model for Image Super-Resolution | Qinwei Lin et.al. | 2412.03355 | link |
2024-12-04 | DIVE: Taming DINO for Subject-Driven Video Editing | Yi Huang et.al. | 2412.03347 | null |
2024-12-04 | Geometry-guided Cross-view Diffusion for One-to-many Cross-view Image Synthesis | Tao Jun Lin et.al. | 2412.03315 | null |
2024-12-03 | Diffusion-based Visual Anagram as Multi-task Learning | Zhiyuan Xu et.al. | 2412.02693 | link |
2024-12-03 | FoundHand: Large-Scale Domain-Specific Learning for Controllable Hand Image Generation | Kefan Chen et.al. | 2412.02690 | null |
2024-12-04 | SNOOPI: Supercharged One-step Diffusion Distillation with Proper Guidance | Viet Nguyen et.al. | 2412.02687 | null |
2024-12-03 | Sharp-It: A Multi-view to Multi-view Diffusion Model for 3D Synthesis and Manipulation | Yiftach Edelstein et.al. | 2412.02631 | null |
2024-12-03 | Unveiling Concept Attribution in Diffusion Models | Quang H. Nguyen et.al. | 2412.02542 | null |
2024-12-03 | It Takes Two: Real-time Co-Speech Two-person's Interaction Generation via Reactive Auto-regressive Diffusion Model | Mingyi Shi et.al. | 2412.02419 | null |
2024-12-03 | GenMix: Effective Data Augmentation with Generative Diffusion Model Image Editing | Khawar Islam et.al. | 2412.02366 | null |
2024-12-03 | LoRA Diffusion: Zero-Shot LoRA Synthesis for Diffusion Model Personalization | Ethan Smith et.al. | 2412.02352 | null |
2024-12-03 | SimuScope: Realistic Endoscopic Synthetic Dataset Generation through Surgical Simulation and Diffusion Models | Sabina Martyniak et.al. | 2412.02332 | link |
2024-12-03 | Controlling the Latent Diffusion Model for Generative Image Shadow Removal via Residual Generation | Xinjie Li et.al. | 2412.02322 | null |
2024-11-29 | MoTe: Learning Motion-Text Diffusion Model for Multiple Generation Tasks | Yiming Wu et.al. | 2411.19786 | null |
2024-11-29 | Riemannian Denoising Score Matching for Molecular Structure Optimization with Accurate Energy | Jeheon Woo et.al. | 2411.19769 | null |
2024-11-29 | TexGaussian: Generating High-quality PBR Material via Octree-based 3D Gaussian Splatting | Bojun Xiong et.al. | 2411.19654 | null |
2024-11-29 | Uniform Attention Maps: Boosting Image Fidelity in Reconstruction and Editing | Wenyi Mo et.al. | 2411.19652 | link |
2024-11-29 | Deepfake Media Generation and Detection in the Generative AI Era: A Survey and Outlook | Florinel-Alin Croitoru et.al. | 2411.19537 | link |
2024-11-29 | Ditto: Motion-Space Diffusion for Controllable Realtime Talking Head Synthesis | Tianqi Li et.al. | 2411.19509 | null |
2024-11-29 | Diffusion Models Meet Network Management: Improving Traffic Matrix Analysis with Diffusion-based Approach | Xinyu Yuan et.al. | 2411.19493 | link |
2024-11-28 | DreamBlend: Advancing Personalized Fine-tuning of Text-to-Image Diffusion Models | Shwetha Ram et.al. | 2411.19390 | null |
2024-11-28 | Enhancing Sketch Animation: Text-to-Video Diffusion Models with Temporal Consistency and Rigidity Constraints | Gaurav Rai et.al. | 2411.19381 | null |
2024-11-28 | Towards a Mechanistic Explanation of Diffusion Model Generalization | Matthew Niedoba et.al. | 2411.19339 | null |
2024-11-27 | GeneMAN: Generalizable Single-Image 3D Human Reconstruction from Multi-Source Human Data | Wentao Wang et.al. | 2411.18624 | null |
2024-11-27 | Diffusion Self-Distillation for Zero-Shot Customized Image Generation | Shengqu Cai et.al. | 2411.18616 | null |
2024-11-27 | CAT4D: Create Anything in 4D with Multi-View Video Diffusion Models | Rundi Wu et.al. | 2411.18613 | null |
2024-11-27 | Evaluating and Improving the Effectiveness of Synthetic Chest X-Rays for Medical Image Analysis | Eva Prakash et.al. | 2411.18602 | null |
2024-11-27 | FAM Diffusion: Frequency and Attention Modulation for High-Resolution Image Generation with Stable Diffusion | Haosen Yang et.al. | 2411.18552 | null |
2024-11-28 | Enhancing weed detection performance by means of GenAI-based image augmentation | Sourav Modak et.al. | 2411.18513 | null |
2024-11-27 | Learning the Evolution of Physical Structure of Galaxies via Diffusion Models | Andrew Lizarraga et.al. | 2411.18440 | link |
2024-11-27 | Individual Content and Motion Dynamics Preserved Pruning for Video Diffusion Models | Yiming Wu et.al. | 2411.18375 | null |
2024-11-27 | TryOffDiff: Virtual-Try-Off via High-Fidelity Garment Reconstruction using Diffusion Models | Riza Velioglu et.al. | 2411.18350 | link |
2024-11-27 | HiFiVFS: High Fidelity Video Face Swapping | Xu Chen et.al. | 2411.18293 | null |
2024-11-27 | StableAnimator: High-Quality Identity-Preserving Human Image Animation | Shuyuan Tu et.al. | 2411.17697 | link |
2024-11-26 | ScribbleLight: Single Image Indoor Relighting with Scribbles | Jun Myeong Choi et.al. | 2411.17696 | null |
2024-11-26 | GenDeg: Diffusion-Based Degradation Synthesis for Generalizable All-in-One Image Restoration | Sudarshan Rajagopalan et.al. | 2411.17687 | null |
2024-11-26 | Accelerating Vision Diffusion Transformers with Skip Branches | Guanjie Chen et.al. | 2411.17616 | link |
2024-11-26 | VideoDirector: Precise Video Editing via Text-to-Video Models | Yukun Wang et.al. | 2411.17592 | null |
2024-11-26 | FTMoMamba: Motion Generation with Frequency and Text State Space Models | Chengjian Li et.al. | 2411.17532 | null |
2024-11-26 | WF-VAE: Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model | Zongjian Li et.al. | 2411.17459 | link |
2024-11-26 | Image Generation with Multimodule Semantic Feature-Aided Selection for Semantic Communications | Chengyang Liang et.al. | 2411.17428 | null |
2024-11-26 | Reward Incremental Learning in Text-to-Image Generation | Maorong Wang et.al. | 2411.17310 | null |
2024-11-26 | APT: Architectural Planning and Text-to-Blueprint Construction Using Large Language Models for Open-World Agents | Jun Yu Chen et.al. | 2411.17255 | link |
2024-11-25 | Generative Omnimatte: Learning to Decompose Video into Layers | Yao-Chih Lee et.al. | 2411.16683 | null |
2024-11-25 | Diffusion Features for Zero-Shot 6DoF Object Pose Estimation | Bernd Von Gimborn et.al. | 2411.16668 | null |
2024-11-25 | LegoPET: Hierarchical Feature Guided Conditional Diffusion for PET Image Reconstruction | Yiran Sun et.al. | 2411.16629 | link |
2024-11-25 | Chat2SVG: Vector Graphics Generation with Large Language Models and Image Diffusion Models | Ronghuan Wu et.al. | 2411.16602 | null |
2024-11-25 | Unlocking The Potential of Adaptive Attacks on Diffusion-Based Purification | Andre Kassis et.al. | 2411.16598 | link |
2024-11-25 | Rethinking Diffusion for Text-Driven Human Motion Generation | Zichong Meng et.al. | 2411.16575 | null |
2024-11-25 | Representation Collapsing Problems in Vector Quantization | Wenhao Zhao et.al. | 2411.16550 | null |
2024-11-25 | ADOBI: Adaptive Diffusion Bridge For Blind Inverse Problems with Application to MRI Reconstruction | Yuyang Hu et.al. | 2411.16535 | null |
2024-11-25 | Noise Diffusion for Enhancing Semantic Faithfulness in Text-to-Image Synthesis | Boming Miao et.al. | 2411.16503 | null |
2024-11-25 | Model-based reinforcement corrosion prediction: Continuous calibration with Bayesian optimization and corrosion wire sensor data | A. Potnis et.al. | 2411.16447 | null |
2024-11-22 | DiffusionDrive: Truncated Diffusion Model for End-to-End Autonomous Driving | Bencheng Liao et.al. | 2411.15139 | link |
2024-11-22 | Material Anything: Generating Materials for Any 3D Object via Diffusion | Xin Huang et.al. | 2411.15138 | null |
2024-11-22 | VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement | Daeun Lee et.al. | 2411.15115 | null |
2024-11-22 | Leapfrog Latent Consistency Model (LLCM) for Medical Images Generation | Lakshmikar R. Polamreddy et.al. | 2411.15084 | link |
2024-11-22 | The 1D nonlocal Fisher-KPP equation with a top hat kernel. Part 3. The effect of perturbations in the kernel | David John Needham et.al. | 2411.15054 | null |
2024-11-22 | FloAt: Flow Warping of Self-Attention for Clothing Animation Generation | Swasti Shreya Mishra et.al. | 2411.15028 | null |
2024-11-22 | Enhancing Exploration with Diffusion Policies in Hybrid Off-Policy RL: Application to Non-Prehensile Manipulation | Huy Le et.al. | 2411.14913 | null |
2024-11-22 | Prioritize Denoising Steps on Diffusion Model Preference Alignment via Explicit Denoised Distribution Estimation | Dingyuan Shi et.al. | 2411.14871 | null |
2024-11-22 | Latent Schrodinger Bridge: Prompting Latent Diffusion for Fast Unpaired Image-to-Image Translation | Jeongsol Kim et.al. | 2411.14863 | null |
2024-11-22 | Style-Friendly SNR Sampler for Style-Driven Generation | Jooyoung Choi et.al. | 2411.14793 | null |
2024-11-21 | Stable Flow: Vital Layers for Training-Free Image Editing | Omri Avrahami et.al. | 2411.14430 | null |
2024-11-21 | Baking Gaussian Splatting into Diffusion Denoiser for Fast and Scalable Single-stage Image-to-3D Generation | Yuanhao Cai et.al. | 2411.14384 | null |
2024-11-21 | CoNFiLD-inlet: Synthetic Turbulence Inflow Using Generative Latent Diffusion Models with Neural Fields | Xin-Yang Liu et.al. | 2411.14378 | null |
2024-11-21 | Enhancing Medical Image Segmentation with Deep Learning and Diffusion Models | Houze Liu et.al. | 2411.14353 | null |
2024-11-21 | StereoCrafter-Zero: Zero-Shot Stereo Video Generation with Noisy Restart | Jian Shi et.al. | 2411.14295 | null |
2024-11-21 | Guided MRI Reconstruction via Schrödinger Bridge | Yue Wang et.al. | 2411.14269 | null |
2024-11-21 | TaQ-DiT: Time-aware Quantization for Diffusion Transformers | Xinyan Liu et.al. | 2411.14172 | null |
2024-11-21 | RestorerID: Towards Tuning-Free Face Restoration with ID Preservation | Jiacheng Ying et.al. | 2411.14125 | link |
2024-11-21 | Point Cloud Resampling with Learnable Heat Diffusion | Wenqiang Xu et.al. | 2411.14120 | null |
2024-11-21 | Transforming Static Images Using Generative Models for Video Salient Object Detection | Suhwan Cho et.al. | 2411.13975 | link |
2024-11-20 | REDUCIO! Generating 1024 |
Rui Tian et.al. | 2411.13552 | link |
2024-11-20 | Identity Preserving 3D Head Stylization with Multiview Score Distillation | Bahri Batuhan Bilecen et.al. | 2411.13536 | null |
2024-11-20 | Heuristically Adaptive Diffusion-Model Evolutionary Strategy | Benedikt Hartl et.al. | 2411.13420 | null |
2024-11-20 | XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic Segmentation | Ziyi Wang et.al. | 2411.13243 | link |
2024-11-20 | A computational framework for integrating Predictive processes with evidence Accumulation Models (PAM) | Antonino Visalli et.al. | 2411.13203 | link |
2024-11-20 | RAW-Diffusion: RGB-Guided Diffusion Models for High-Fidelity RAW Image Generation | Christoph Reinders et.al. | 2411.13150 | link |
2024-11-20 | CopyrightMeter: Revisiting Copyright Protection in Text-to-image Models | Naen Xu et.al. | 2411.13144 | null |
2024-11-20 | Virtual Staining of Label-Free Tissue in Imaging Mass Spectrometry | Yijie Zhang et.al. | 2411.13120 | null |
2024-11-19 | Breaking the wire: the impact of critical length on melting pathways in silver nanowires | Kannan M Ridings et.al. | 2411.12891 | null |
2024-11-19 | From Text to Pose to Image: Improving Diffusion Model Control and Quality | Clément Bonnett et.al. | 2411.12872 | link |
2024-11-19 | PoM: Efficient Image and Video Generation with the Polynomial Mixer | David Picard et.al. | 2411.12663 | link |
2024-11-19 | Improving Controllability and Editability for Pretrained Text-to-Music Generation Models | Yixiao Zhang et.al. | 2411.12641 | null |
2024-11-19 | Data Pruning in Generative Diffusion Models | Rania Briq et.al. | 2411.12523 | null |
2024-11-19 | Frequency-Aware Guidance for Blind Image Restoration via Diffusion Models | Jun Xiao et.al. | 2411.12450 | null |
2024-11-19 | Combinational Backdoor Attack against Customized Text-to-Image Models | Wenbo Jiang et.al. | 2411.12389 | null |
2024-11-19 | Scalable and Effective Negative Sample Generation for Hyperedge Prediction | Shilin Qu et.al. | 2411.12354 | null |
2024-11-19 | Diffusion Product Quantization | Jie Shao et.al. | 2411.12306 | null |
2024-11-19 | SSEditor: Controllable Mask-to-Scene Generation with Diffusion Model | Haowen Zheng et.al. | 2411.12290 | link |
2024-11-20 | HouseLLM: LLM-Assisted Two-Phase Text-to-Floorplan Generation | Ziyang Zong et.al. | 2411.12279 | null |
2024-11-19 | Wavespeed selection of travelling wave solutions of a two-component reaction-diffusion model of cell invasion | Yuhui Chen et.al. | 2411.12232 | null |
2024-11-18 | Aligning Few-Step Diffusion Models with Dense Reward Difference Learning | Ziyi Zhang et.al. | 2411.11727 | link |
2024-11-18 | Robust Reinforcement Learning under Diffusion Models for Data with Jumps | Chenyang Jiang et.al. | 2411.11697 | null |
2024-11-18 | Conceptwm: A Diffusion Model Watermark for Concept Protection | Liangqi Lei et.al. | 2411.11688 | null |
2024-11-19 | Cascaded Diffusion Models for 2D and 3D Microscopy Image Synthesis to Enhance Cell Segmentation | Rüveyda Yilmaz et.al. | 2411.11515 | null |
2024-11-18 | MVLight: Relightable Text-to-3D Generation via Light-conditioned Multi-View Diffusion | Dongseok Shim et.al. | 2411.11475 | null |
2024-11-18 | CLUE-MARK: Watermarking Diffusion Models using CLWE | Kareem Shehata et.al. | 2411.11434 | null |
2024-11-18 | Teaching Video Diffusion Model with Latent Physical Phenomenon Knowledge | Qinglong Cao et.al. | 2411.11343 | null |
2024-11-18 | Stochastic quantization and diffusion models | Kenji Fukushima et.al. | 2411.11297 | null |
2024-11-17 | Stealing Training Graphs from Graph Neural Networks | Minhua Lin et.al. | 2411.11197 | null |
2024-11-17 | DeepSPV: An Interpretable Deep Learning Pipeline for 3D Spleen Volume Estimation from 2D Ultrasound Images | Zhen Yuan et.al. | 2411.11190 | null |
2024-11-15 | M-VAR: Decoupled Scale-wise Autoregressive Modeling for High-Quality Image Generation | Sucheng Ren et.al. | 2411.10433 | link |
2024-11-15 | Mitigating Parameter Degeneracy using Joint Conditional Diffusion Model for WECC Composite Load Model in Power Systems | Feiqin Zhu et.al. | 2411.10431 | null |
2024-11-15 | Towards High-Fidelity 3D Portrait Generation with Rich Details by Cross-View Prior-Aware Diffusion | Haoran Wei et.al. | 2411.10369 | null |
2024-11-15 | Probabilistic Prior Driven Attention Mechanism Based on Diffusion Model for Imaging Through Atmospheric Turbulence | Guodong Sun et.al. | 2411.10321 | null |
2024-11-15 | Modification Takes Courage: Seamless Image Stitching via Reference-Driven Inpainting | Ziqi Xie et.al. | 2411.10309 | link |
2024-11-15 | The Unreasonable Effectiveness of Guidance for Diffusion Models | Tim Kaiser et.al. | 2411.10257 | null |
2024-11-15 | ColorEdit: Training-free Image-Guided Color editing with diffusion model | Xingxi Yin et.al. | 2411.10232 | null |
2024-11-15 | Evaluating Text-to-Image Diffusion Models for Texturing Synthetic Data | Thomas Lips et.al. | 2411.10164 | link |
2024-11-15 | Towards Multi-View Consistent Style Transfer with One-Step Diffusion via Vision Conditioning | Yushen Zuo et.al. | 2411.10130 | null |
2024-11-15 | SPLIT: SE(3)-diffusion via Local Geometry-based Score Prediction for 3D Scene-to-Pose-Set Matching Problems | Kanghyun Kim et.al. | 2411.10049 | null |
2024-11-14 | Golden Noise for Diffusion Models: A Learning Framework | Zikai Zhou et.al. | 2411.09502 | link |
2024-11-14 | DiffRoad: Realistic and Diverse Road Scenario Generation for Autonomous Vehicle Testing | Junjie Zhou et.al. | 2411.09451 | null |
2024-11-14 | Image Regeneration: Evaluating Text-to-Image Model via Generating Identical Image with Multimodal Large Language Models | Chutian Meng et.al. | 2411.09449 | null |
2024-11-14 | A survey of probabilistic generative frameworks for molecular simulations | Richard John et.al. | 2411.09388 | link |
2024-11-14 | EEG-Based Speech Decoding: A Novel Approach Using Multi-Kernel Ensemble Diffusion Models | Soowon Kim et.al. | 2411.09302 | null |
2024-11-14 | Advancing Diffusion Models: Alias-Free Resampling and Enhanced Rotational Equivariance | Md Fahim Anjum et.al. | 2411.09174 | null |
2024-11-14 | VidMan: Exploiting Implicit Dynamics from Video Diffusion Model for Effective Robot Manipulation | Youpeng Wen et.al. | 2411.09153 | null |
2024-11-14 | General linear threshold models with application to influence maximization | Alexander Kagan et.al. | 2411.09100 | link |
2024-11-13 | Inconsistencies In Consistency Models: Better ODE Solving Does Not Imply Better Samples | Noël Vouitsis et.al. | 2411.08954 | link |
2024-11-13 | 4D Gaussian Splatting in the Wild with Uncertainty-Aware Regularization | Mijeong Kim et.al. | 2411.08879 | null |
2024-11-13 | Offline Adaptation of Quadruped Locomotion using Diffusion Models | Reece O'Mahoney et.al. | 2411.08832 | null |
2024-11-13 | Towards More Accurate Fake Detection on Images Generated from Advanced Generative and Neural Rendering Models | Chengdong Dong et.al. | 2411.08642 | null |
2024-11-13 | V2X-R: Cooperative LiDAR-4D Radar Fusion for 3D Object Detection with Denoising Diffusion | Xun Huang et.al. | 2411.08402 | link |
2024-11-13 | Physics Informed Distillation for Diffusion Models | Joshua Tian Jin Tee et.al. | 2411.08378 | link |
2024-11-13 | Generative AI for Data Augmentation in Wireless Networks: Analysis, Applications, and Case Study | Jinbo Wen et.al. | 2411.08341 | null |
2024-11-13 | Motion Control for Enhanced Complex Action Video Generation | Qiang Zhou et.al. | 2411.08328 | null |
2024-11-13 | DNN Task Assignment in UAV Networks: A Generative AI Enhanced Multi-Agent Reinforcement Learning Approach | Xin Tang et.al. | 2411.08299 | null |
2024-11-12 | Joint Diffusion models in Continual Learning | Paweł Skierś et.al. | 2411.08224 | null |
2024-11-12 | Latent Space Disentanglement in Diffusion Transformers Enables Precise Zero-shot Semantic Editing | Zitao Shuai et.al. | 2411.08196 | null |
2024-11-12 | Scaling Properties of Diffusion Models for Perceptual Tasks | Rahul Ravishankar et.al. | 2411.08034 | null |
2024-11-12 | GaussianAnything: Interactive Point Cloud Latent Diffusion for 3D Generation | Yushi Lan et.al. | 2411.08033 | null |
2024-11-12 | Diverse capability and scaling of diffusion and auto-regressive models when learning abstract rules | Binxu Wang et.al. | 2411.07873 | null |
2024-11-12 | Novel View Synthesis with Pixel-Space Diffusion Models | Noam Elata et.al. | 2411.07765 | null |
2024-11-12 | Nanosecond nanothermometry in an electron microscope | Florian Castioni et.al. | 2411.07764 | null |
2024-11-12 | Leveraging Previous Steps: A Training-free Fast Solver for Flow Diffusion | Kaiyu Song et.al. | 2411.07627 | null |
2024-11-12 | Unraveling the Connections between Flow Matching and Diffusion Probabilistic Models in Training-free Conditional Generation | Kaiyu Song et.al. | 2411.07625 | null |
2024-11-12 | Harmonizing Pixels and Melodies: Maestro-Guided Film Score Generation and Composition Style Transfer | F. Qi et.al. | 2411.07539 | null |
2024-11-12 | FM-TS: Flow Matching for Time Series Generation | Yang Hu et.al. | 2411.07506 | link |
2024-11-12 | Semi-Truths: A Large-Scale Dataset of AI-Augmented Images for Evaluating Robustness of AI-Generated Image detectors | Anisha Pal et.al. | 2411.07472 | link |
2024-11-11 | Score-based generative diffusion with "active" correlated noise sources | Alexandra Lamtyugina et.al. | 2411.07233 | null |
2024-11-11 | Add-it: Training-Free Object Insertion in Images With Pretrained Diffusion Models | Yoad Tewel et.al. | 2411.07232 | null |
2024-11-11 | DLCR: A Generative Data Expansion Framework via Diffusion for Clothes-Changing Person Re-ID | Nyle Siddiqui et.al. | 2411.07205 | link |
2024-11-11 | Crossover from inhomogeneous to homogeneous response of a resonantly driven hBN quantum emitter | Domitille Gérard et.al. | 2411.07202 | null |
2024-11-11 | OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision | Cong Wei et.al. | 2411.07199 | null |
2024-11-11 | More Expressive Attention with Negative Weights | Ang Lv et.al. | 2411.07176 | link |
2024-11-11 | Edify 3D: Scalable High-Quality 3D Asset Generation | NVIDIA et.al. | 2411.07135 | null |
2024-11-11 | Edify Image: High-Quality Image Generation with Pixel Space Laplacian Diffusion Models | NVIDIA et.al. | 2411.07126 | null |
2024-11-11 | White-Box Diffusion Transformer for single-cell RNA-seq generation | Zhuorui Cui et.al. | 2411.06785 | link |
2024-11-11 | DiffSR: Learning Radar Reflectivity Synthesis via Diffusion Model from Satellite Observations | Xuming He et.al. | 2411.06714 | null |
2024-11-08 | StdGEN: Semantic-Decomposed 3D Character Generation from Single Images | Yuze He et.al. | 2411.05738 | null |
2024-11-08 | Image2Text2Image: A Novel Framework for Label-Free Evaluation of Image-to-Text Generation with Text-to-Image Diffusion Models | Jia-Hong Huang et.al. | 2411.05706 | null |
2024-11-08 | Improving Molecular Graph Generation with Flow Matching and Optimal Transport | Xiaoyang Hou et.al. | 2411.05676 | null |
2024-11-08 | Towards Lifelong Few-Shot Customization of Text-to-Image Diffusion | Nan Song et.al. | 2411.05544 | null |
2024-11-08 | Improving image synthesis with diffusion-negative sampling | Alakh Desai et.al. | 2411.05473 | null |
2024-11-08 | Bridging the Gap between Learning and Inference for Diffusion-Based Molecule Generation | Peidong Liu et.al. | 2411.05472 | link |
2024-11-08 | RED: Residual Estimation Diffusion for Low-Dose PET Sinogram Reconstruction | Xingyu Ai et.al. | 2411.05354 | null |
2024-11-08 | Electro-diffusive modeling and the role of spine geometry on action potential propagation in neurons | Rahul Gulati et.al. | 2411.05329 | null |
2024-11-08 | Adaptive Whole-Body PET Image Denoising Using 3D Diffusion Models with ControlNet | Boxiao Yu et.al. | 2411.05302 | null |
2024-11-07 | Generalizable Single-Source Cross-modality Medical Image Segmentation via Invariant Causal Mechanisms | Boqi Chen et.al. | 2411.05223 | null |
2024-11-07 | SVDQunat: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models | Muyang Li et.al. | 2411.05007 | link |
2024-11-07 | ProEdit: Simple Progression is All You Need for High-Quality 3D Scene Editing | Jun-Kun Chen et.al. | 2411.05006 | null |
2024-11-07 | Diff-2-in-1: Bridging Generation and Dense Perception with Diffusion Models | Shuhong Zheng et.al. | 2411.05005 | null |
2024-11-07 | ReCapture: Generative Video Camera Controls for User-Provided Videos using Masked Video Fine-Tuning | David Junhao Zhang et.al. | 2411.05003 | null |
2024-11-07 | SG-I2V: Self-Guided Trajectory Control in Image-to-Video Generation | Koichi Namekata et.al. | 2411.04989 | null |
2024-11-07 | Uncovering Hidden Subspaces in Video Diffusion Models Using Re-Identification | Mischa Dombrowski et.al. | 2411.04956 | null |
2024-11-07 | DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion | Wenqiang Sun et.al. | 2411.04928 | null |
2024-11-07 | Stem-OB: Generalizable Visual Imitation Learning with Stem-Like Convergent Observation through Diffusion Inversion | Kaizhe Hu et.al. | 2411.04919 | link |
2024-11-07 | Controlling Human Shape and Pose in Text-to-Image Diffusion Models via Domain Adaptation | Benito Buchheim et.al. | 2411.04724 | null |
2024-11-07 | DanceFusion: A Spatio-Temporal Skeleton Diffusion Transformer for Audio-Driven Dance Motion Reconstruction | Li Zhao et.al. | 2411.04646 | null |
2024-11-06 | Community Forensics: Using Thousands of Generators to Train Fake Image Detectors | Jeongsoo Park et.al. | 2411.04125 | null |
2024-11-06 | Synomaly Noise and Multi-Stage Diffusion: A Novel Approach for Unsupervised Anomaly Detection in Ultrasound Imaging | Yuan Bi et.al. | 2411.04004 | null |
2024-11-06 | ET-SEED: Efficient Trajectory-Level SE(3) Equivariant Diffusion Policy | Chenrui Tie et.al. | 2411.03990 | null |
2024-11-06 | ReEdit: Multimodal Exemplar-Based Image Editing with Diffusion Models | Ashutosh Srivastava et.al. | 2411.03982 | null |
2024-11-06 | ROBIN: Robust and Invisible Watermarks for Diffusion Models with Adversarial Optimization | Huayang Huang et.al. | 2411.03862 | link |
2024-11-06 | Sub-DM:Subspace Diffusion Model with Orthogonal Decomposition for MRI Reconstruction | Yu Guan et.al. | 2411.03758 | null |
2024-11-06 | Zero-shot Dynamic MRI Reconstruction with Global-to-local Diffusion Model | Yu Guan et.al. | 2411.03723 | link |
2024-11-06 | Investigating Conceptual Blending of a Diffusion Model for Improving Nonword-to-Image Generation | Chihaya Matsuhira et.al. | 2411.03595 | null |
2024-11-05 | Estimating Ego-Body Pose from Doubly Sparse Egocentric Video Data | Seunggeun Chi et.al. | 2411.03561 | null |
2024-11-05 | SynthSet: Generative Diffusion Model for Semantic Segmentation in Precision Agriculture | Andrew Heschl et.al. | 2411.03505 | link |
2024-11-05 | DiffLM: Controllable Synthetic Data Generation via Diffusion Language Models | Ying Zhou et.al. | 2411.03250 | null |
2024-11-05 | On Improved Conditioning Mechanisms and Pre-training Strategies for Diffusion Models | Tariq Berrada Ifriqi et.al. | 2411.03177 | null |
2024-11-05 | Unleashing the power of novel conditional generative approaches for new materials discovery | Lev Novitskiy et.al. | 2411.03156 | link |
2024-11-05 | Gradient-Guided Conditional Diffusion Models for Private Image Reconstruction: Analyzing Adversarial Impacts of Differential Privacy and Denoising | Tao Huang et.al. | 2411.03053 | null |
2024-11-05 | GarVerseLOD: High-Fidelity 3D Garment Reconstruction from a Single In-the-Wild Image using a Dataset with Levels of Details | Zhongjin Luo et.al. | 2411.03047 | null |
2024-11-05 | IMUDiffusion: A Diffusion Model for Multivariate Time Series Synthetisation for Inertial Motion Capturing Systems | Heiko Oppel et.al. | 2411.02954 | null |
2024-11-05 | LDPM: Towards undersampled MRI reconstruction with MR-VAE and Latent Diffusion Prior | Xingjian Tang et.al. | 2411.02951 | null |
2024-11-05 | How much is a noisy image worth? Data Scaling Laws for Ambient Diffusion | Giannis Daras et.al. | 2411.02780 | link |
2024-11-04 | Modelling Alzheimer's Protein Dynamics: A Data-Driven Integration of Stochastic Methods, Machine Learning and Connectome Insights | Alec MacIver et.al. | 2411.02644 | null |
2024-11-04 | Training-free Regional Prompting for Diffusion Transformers | Anthony Chen et.al. | 2411.02395 | link |
2024-11-04 | Diffusion-based Generative Multicasting with Intent-aware Semantic Decomposition | Xinkai Liu et.al. | 2411.02334 | null |
2024-11-04 | LayerDAG: A Layerwise Autoregressive Diffusion Model for Directed Acyclic Graph Generation | Mufei Li et.al. | 2411.02322 | link |
2024-11-04 | Hunyuan3D-1.0: A Unified Framework for Text-to-3D and Image-to-3D Generation | Xianghui Yang et.al. | 2411.02293 | null |
2024-11-04 | FewViewGS: Gaussian Splatting with Few View Matching and Multi-stage Training | Ruihong Yin et.al. | 2411.02229 | null |
2024-11-04 | CleAR: Robust Context-Guided Generative Lighting Estimation for Mobile Augmented Reality | Yiqin Zhao et.al. | 2411.02179 | null |
2024-11-04 | Model Integrity when Unlearning with T2I Diffusion Models | Andrea Schioppa et.al. | 2411.02068 | null |
2024-11-04 | DiffuMask-Editor: A Novel Paradigm of Integration Between the Segmentation Diffusion Model and Image Editing to Improve Segmentation Ability | Bo Gao et.al. | 2411.01819 | null |
2024-11-04 | MoMu-Diffusion: On Learning Long-Term Motion-Music Synchronization and Correspondence | Fuming You et.al. | 2411.01805 | null |
2024-11-04 | A Regressor-Guided Graph Diffusion Model for Predicting Enzyme Mutations to Enhance Turnover Number | Xiaozhu Yu et.al. | 2411.01745 | link |
2024-10-31 | DiffPano: Scalable and Consistent Text to Panorama Generation with Spherical Epipolar-Aware Diffusion | Weicai Ye et.al. | 2410.24203 | link |
2024-10-31 | Redefining in Dictionary: Towards a Enhanced Semantic Understanding of Creative Generation | Fu Feng et.al. | 2410.24160 | null |
2024-10-31 | Scaling Concept With Text-Guided Diffusion Models | Chao Huang et.al. | 2410.24151 | null |
2024-10-31 | Understanding Generalizability of Diffusion Models Requires Rethinking the Hidden Gaussian Structure | Xiang Li et.al. | 2410.24060 | link |
2024-10-31 | TPC: Test-time Procrustes Calibration for Diffusion-based Human Image Animation | Sunjae Yoon et.al. | 2410.24037 | null |
2024-10-31 | DiffPAD: Denoising Diffusion-based Adversarial Patch Decontamination | Jia Fu et.al. | 2410.24006 | link |
2024-11-01 | Breaking Determinism: Fuzzy Modeling of Sequential Recommendation Using Discrete State Space Diffusion Model | Wenjia Xie et.al. | 2410.23994 | null |
2024-10-31 | Stochastic Reconstruction of Gappy Lagrangian Turbulent Signals by Conditional Diffusion Models | Tianyi Li et.al. | 2410.23971 | null |
2024-10-31 | Image Synthesis with Class-Aware Semantic Diffusion Models for Surgical Scene Segmentation | Yihang Zhou et.al. | 2410.23962 | null |
2024-10-31 | Text-DiFuse: An Interactive Multi-Modal Image Fusion Framework based on Text-modulated Diffusion Model | Hao Zhang et.al. | 2410.23905 | link |
2024-10-30 | ReferEverything: Towards Segmenting Everything We Can Speak of in Videos | Anurag Bagchi et.al. | 2410.23287 | null |
2024-10-30 | Provable acceleration for diffusion models under minimal assumptions | Gen Li et.al. | 2410.23285 | null |
2024-10-30 | RelationBooth: Towards Relation-Aware Customized Object Generation | Qingyu Shi et.al. | 2410.23280 | null |
2024-10-30 | SlowFast-VGen: Slow-Fast Learning for Action-Driven Long Video Generation | Yining Hong et.al. | 2410.23277 | null |
2024-10-30 | Multi-student Diffusion Distillation for Better One-step Generators | Yanke Song et.al. | 2410.23274 | null |
2024-10-30 | CausalDiff: Causality-Inspired Disentanglement via Diffusion Model for Adversarial Defense | Mingkun Zhang et.al. | 2410.23091 | link |
2024-10-30 | Controlling Language and Diffusion Models by Transporting Activations | Pau Rodriguez et.al. | 2410.23054 | link |
2024-10-30 | Improving Musical Accompaniment Co-creation via Diffusion Transformers | Javier Nistal et.al. | 2410.23005 | null |
2024-10-30 | DexGraspNet 2.0: Learning Generative Dexterous Grasping in Large-scale Synthetic Cluttered Scenes | Jialiang Zhang et.al. | 2410.23004 | null |
2024-10-30 | LumiSculpt: A Consistency Lighting Control Network for Video Generation | Yuxin Zhang et.al. | 2410.22979 | null |
2024-10-29 | Capacity Control is an Effective Memorization Mitigation Mechanism in Text-Conditional Diffusion Models | Raman Dutt et.al. | 2410.22149 | link |
2024-10-29 | Variational inference for pile-up removal at hadron colliders with diffusion models | Malte Algren et.al. | 2410.22074 | null |
2024-10-29 | Dual Conditional Diffusion Models for Sequential Recommendation | Hongtao Huang et.al. | 2410.21967 | null |
2024-10-29 | PrefPaint: Aligning Image Inpainting Diffusion Model with Human Preference | Kendong Liu et.al. | 2410.21966 | null |
2024-10-29 | CT to PET Translation: A Large-scale Dataset and Domain-Knowledge-Guided Diffusion Approach | Dac Thai Nguyen et.al. | 2410.21932 | link |
2024-10-29 | Guided Diffusion-based Counterfactual Augmentation for Robust Session-based Recommendation | Muskan Gupta et.al. | 2410.21892 | null |
2024-10-29 | Diffusion as Reasoning: Enhancing Object Goal Navigation with LLM-Biased Diffusion Model | Yiming Ji et.al. | 2410.21842 | null |
2024-10-29 | Volumetric Conditioning Module to Control Pretrained Diffusion Models for 3D Medical Images | Suhyun Ahn et.al. | 2410.21826 | link |
2024-10-29 | HairDiffusion: Vivid Multi-Colored Hair Editing via Latent Diffusion | Yu Zeng et.al. | 2410.21789 | null |
2024-10-29 | DiffusionVel: Multi-Information Integrated Velocity Inversion Using Generative Diffusion Models | Hao Zhang et.al. | 2410.21776 | null |
2024-10-28 | On Inductive Biases That Enable Generalization of Diffusion Transformers | Jie An et.al. | 2410.21273 | link |
2024-10-28 | One-Step Diffusion Policy: Fast Visuomotor Policies via Diffusion Distillation | Zhendong Wang et.al. | 2410.21257 | null |
2024-10-28 | On learning higher-order cumulants in diffusion models | Gert Aarts et.al. | 2410.21212 | null |
2024-10-28 | Extrapolating Prospective Glaucoma Fundus Images through Diffusion Model in Irregular Longitudinal Sequences | Zhihao Zhao et.al. | 2410.21130 | null |
2024-10-28 | Shallow Diffuse: Robust and Invisible Watermarking through Low-Dimensional Subspaces in Diffusion Models | Wenda Li et.al. | 2410.21088 | link |
2024-10-28 | Federated Time Series Generation on Feature and Temporally Misaligned Data | Chenrui Fan et.al. | 2410.21072 | null |
2024-10-28 | Kandinsky 3: Text-to-Image Synthesis for Multifunctional Generative Framework | Vladimir Arkhipkin et.al. | 2410.21061 | link |
2024-10-28 | Beyond Autoregression: Fast LLMs via Self-Distillation Through Time | Justin Deschenaux et.al. | 2410.21035 | link |
2024-10-29 | EEG-Driven 3D Object Reconstruction with Color Consistency and Diffusion Prior | Xin Xiang et.al. | 2410.20981 | null |
2024-10-28 | Attention Overlap Is Responsible for The Entity Missing Problem in Text-to-image Diffusion Models! | Arash Marioriyad et.al. | 2410.20972 | null |
2024-10-25 | Adversarial Environment Design via Regret-Guided Diffusion Models | Hojun Chung et.al. | 2410.19715 | null |
2024-10-25 | DiffGS: Functional Gaussian Splatting Diffusion | Junsheng Zhou et.al. | 2410.19657 | null |
2024-10-25 | Diffusion models for lattice gauge field simulations | Qianteng Zhu et.al. | 2410.19602 | null |
2024-10-25 | Utilizing Image Transforms and Diffusion Models for Generative Modeling of Short and Long Time Series | Ilan Naiman et.al. | 2410.19538 | null |
2024-10-25 | Ensemble Data Assimilation for Particle-based Methods | Marius Duvillard et.al. | 2410.19525 | null |
2024-10-28 | NeuroClips: Towards High-fidelity and Smooth fMRI-to-Video Reconstruction | Zixuan Gong et.al. | 2410.19452 | link |
2024-10-25 | Learned Reference-based Diffusion Sampling for multi-modal distributions | Maxence Noble et.al. | 2410.19449 | null |
2024-10-25 | Generative Diffusion Models for Sequential Recommendations | Sharare Zolghadr et.al. | 2410.19429 | null |
2024-10-25 | FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality | Zhengyao Lv et.al. | 2410.19355 | null |
2024-10-25 | High Resolution Seismic Waveform Generation using Denoising Diffusion | Andreas Bergmeister et.al. | 2410.19343 | null |
2024-10-24 | MotionCLR: Motion Generation and Training-free Editing via Understanding Attention Mechanisms | Ling-Hao Chen et.al. | 2410.18977 | null |
2024-10-24 | 3D-Adapter: Geometry-Consistent Multi-View Diffusion for High-Quality 3D Generation | Hansheng Chen et.al. | 2410.18974 | link |
2024-10-24 | On the Crucial Role of Initialization for Matrix Factorization | Bingcong Li et.al. | 2410.18965 | null |
2024-10-24 | Stable Consistency Tuning: Understanding and Improving Consistency Models | Fu-Yun Wang et.al. | 2410.18958 | link |
2024-10-24 | Generation of synthetic financial time series by diffusion models | Tomonori Takahashi et.al. | 2410.18897 | null |
2024-10-24 | The Cat and Mouse Game: The Ongoing Arms Race Between Diffusion Models and Detection Methods | Linda Laurier et.al. | 2410.18866 | null |
2024-10-24 | Multi-Scale Diffusion: Enhancing Spatial Layout in High-Resolution Panoramic Image Generation | Xiaoyu Zhang et.al. | 2410.18830 | null |
2024-10-24 | Fast constrained sampling in pre-trained diffusion models | Alexandros Graikos et.al. | 2410.18804 | null |
2024-10-24 | Robust Watermarking Using Generative Priors Against Image Editing: From Benchmarking to Advances | Shilin Lu et.al. | 2410.18775 | link |
2024-10-25 | Schedule Your Edit: A Simple yet Effective Diffusion Noise Schedule for Image Editing | Haonan Lin et.al. | 2410.18756 | null |
2024-10-23 | DynamicCity: Large-Scale LiDAR Generation from Dynamic Scenes | Hengwei Bian et.al. | 2410.18084 | null |
2024-10-23 | Prioritized Generative Replay | Renhao Wang et.al. | 2410.18082 | null |
2024-10-23 | Optical Generative Models | Shiqi Chen et.al. | 2410.17970 | null |
2024-10-23 | A Wavelet Diffusion GAN for Image Super-Resolution | Lorenzo Aloisi et.al. | 2410.17966 | null |
2024-10-23 | Addressing Asynchronicity in Clinical Multimodal Fusion via Individualized Chest X-ray Generation | Wenfang Yao et.al. | 2410.17918 | link |
2024-10-23 | Scaling Diffusion Language Models via Adaptation from Autoregressive Models | Shansan Gong et.al. | 2410.17891 | link |
2024-10-23 | Non-intrusive Speech Quality Assessment with Diffusion Models Trained on Clean Speech | Danilo de Oliveira et.al. | 2410.17834 | null |
2024-10-23 | PGDiffSeg: Prior-Guided Denoising Diffusion Model with Parameter-Shared Attention for Breast Cancer Segmentation | Feiyan Feng et.al. | 2410.17812 | null |
2024-10-23 | AdaDiffSR: Adaptive Region-aware Dynamic Acceleration Diffusion Model for Real-World Image Super-Resolution | Yuanting Fan et.al. | 2410.17752 | null |
2024-10-23 | VISAGE: Video Synthesis using Action Graphs for Surgery | Yousef Yeganeh et.al. | 2410.17751 | null |
2024-10-22 | Reinforcement learning on structure-conditioned categorical diffusion for protein inverse folding | Yasha Ektefaie et.al. | 2410.17173 | link |
2024-10-22 | DiP-GO: A Diffusion Pruner via Few-step Gradient Optimization | Haowei Zhu et.al. | 2410.16942 | null |
2024-10-22 | Hierarchical Clustering for Conditional Diffusion in Image Generation | Jorge da Silva Goncalves et.al. | 2410.16910 | link |
2024-10-22 | VistaDream: Sampling multiview consistent images for single-view scene reconstruction | Haiping Wang et.al. | 2410.16892 | null |
2024-10-22 | MPDS: A Movie Posters Dataset for Image Generation with Diffusion Model | Meng Xu et.al. | 2410.16840 | null |
2024-10-22 | Evaluating the Effectiveness of Attack-Agnostic Features for Morphing Attack Detection | Laurent Colbois et.al. | 2410.16802 | link |
2024-10-22 | One-Step Diffusion Distillation through Score Implicit Matching | Weijian Luo et.al. | 2410.16794 | link |
2024-10-22 | LLM-Assisted Red Teaming of Diffusion Models through "Failures Are Fated, But Can Be Faded" | Som Sagar et.al. | 2410.16738 | null |
2024-10-22 | Polyp-E: Benchmarking the Robustness of Deep Segmentation Models via Polyp Editing | Runpu Wei et.al. | 2410.16732 | null |
2024-10-22 | DiffusionSeeder: Seeding Motion Optimization with Diffusion for Rapid Motion Planning | Huang Huang et.al. | 2410.16727 | null |
2024-10-21 | MvDrag3D: Drag-based Creative 3D Editing via Multi-view Generation-Reconstruction Priors | Honghua Chen et.al. | 2410.16272 | null |
2024-10-21 | A Framework for Evaluating Predictive Models Using Synthetic Image Covariates and Longitudinal Data | Simon Deltadahl et.al. | 2410.16177 | null |
2024-10-22 | Warped Diffusion: Solving Video Inverse Problems with Image Diffusion Models | Giannis Daras et.al. | 2410.16152 | null |
2024-10-21 | SeaDAG: Semi-autoregressive Diffusion for Conditional Directed Acyclic Graph Generation | Xinyi Zhou et.al. | 2410.16119 | null |
2024-10-21 | Continuous Speech Synthesis using per-token Latent Diffusion | Arnon Turetzky et.al. | 2410.16048 | null |
2024-10-22 | CamI2V: Camera-Controlled Image-to-Video Diffusion Model | Guangcong Zheng et.al. | 2410.15957 | link |
2024-10-21 | Solving Continual Offline RL through Selective Weights Activation on Aligned Spaces | Jifeng Hu et.al. | 2410.15698 | null |
2024-10-21 | Erasing Undesirable Concepts in Diffusion Models with Adversarial Preservation | Anh Bui et.al. | 2410.15618 | link |
2024-10-20 | Data Augmentation via Diffusion Model to Enhance AI Fairness | Christina Hastings Blow et.al. | 2410.15470 | null |
2024-10-20 | MedDiff-FM: A Diffusion-based Foundation Model for Versatile Medical Image Applications | Yongrui Yu et.al. | 2410.15432 | null |
2024-10-18 | Multi-modal Pose Diffuser: A Multimodal Generative Conditional Pose Prior | Calvin-Khang Ta et.al. | 2410.14540 | null |
2024-10-18 | LEAD: Latent Realignment for Human Motion Diffusion | Nefeli Andreou et.al. | 2410.14508 | null |
2024-10-18 | Reinforcement Learning in Non-Markov Market-Making | Luca Lalor et.al. | 2410.14504 | null |
2024-10-18 | ANT: Adaptive Noise Schedule for Time Series Diffusion Models | Seunghan Lee et.al. | 2410.14488 | link |
2024-10-18 | DRL Optimization Trajectory Generation via Wireless Network Intent-Guided Diffusion Models for Optimizing Resource Allocation | Junjie Wu et.al. | 2410.14481 | null |
2024-10-18 | FashionR2R: Texture-preserving Rendered-to-Real Image Translation with Diffusion Models | Rui Hu et.al. | 2410.14429 | null |
2024-10-18 | Dynamic Negative Guidance of Diffusion Models | Felix Koulischer et.al. | 2410.14398 | link |
2024-10-18 | HiCo: Hierarchical Controllable Diffusion Model for Layout-to-image Generation | Bo Cheng et.al. | 2410.14324 | link |
2024-10-18 | ClearSR: Latent Low-Resolution Image Embeddings Help Diffusion-Based Real-World Super Resolution Models See Clearer | Yuhao Wan et.al. | 2410.14279 | null |
2024-10-18 | HYPNOS : Highly Precise Foreground-focused Diffusion Finetuning for Inanimate Objects | Oliverio Theophilus Nathanael et.al. | 2410.14265 | null |
2024-10-17 | Diffusing States and Matching Scores: A New Framework for Imitation Learning | Runzhe Wu et.al. | 2410.13855 | link |
2024-10-17 | Influence Functions for Scalable Data Attribution in Diffusion Models | Bruno Mlodozeniec et.al. | 2410.13850 | null |
2024-10-17 | Deep Generative Models Unveil Patterns in Medical Images Through Vision-Language Conditioning | Xiaodan Xing et.al. | 2410.13823 | link |
2024-10-17 | ConsisSR: Delving Deep into Consistency in Diffusion-based Image Super-Resolution | Junhao Gu et.al. | 2410.13807 | null |
2024-10-17 | Probing the Latent Hierarchical Structure of Data via Diffusion Models | Antonio Sclocchi et.al. | 2410.13770 | null |
2024-10-17 | Theory on Score-Mismatched Diffusion Models and Zero-Shot Conditional Samplers | Yuchen Liang et.al. | 2410.13746 | null |
2024-10-17 | Improved Convergence Rate for Diffusion Probabilistic Models | Gen Li et.al. | 2410.13738 | null |
2024-10-18 | DAWN: Dynamic Frame Avatar with Non-autoregressive Diffusion Framework for Talking Head Video Generation | Hanbo Cheng et.al. | 2410.13726 | link |
2024-10-18 | Diffusion Curriculum: Synthetic-to-Real Generative Curriculum Learning via Image-Guided Diffusion | Yijun Liang et.al. | 2410.13674 | link |
2024-10-17 | Fine-Tuning Discrete Diffusion Models via Reward Optimization with Applications to DNA and Protein Design | Chenyu Wang et.al. | 2410.13643 | link |
2024-10-16 | Meta-Unlearning on Diffusion Models: Preventing Relearning Unlearned Concepts | Hongcheng Gao et.al. | 2410.12777 | link |
2024-10-16 | SAFREE: Training-Free and Adaptive Guard for Safe Text-to-Image And Video Generation | Jaehong Yoon et.al. | 2410.12761 | null |
2024-10-16 | Embedding an Ethical Mind: Aligning Text-to-Image Synthesis via Lightweight Value Optimization | Xingqi Wang et.al. | 2410.12700 | link |
2024-10-16 | AdaptiveDrag: Semantic-Driven Dragging on Diffusion-Based Image Editing | DuoSheng Chen et.al. | 2410.12696 | null |
2024-10-16 | One Step Diffusion via Shortcut Models | Kevin Frans et.al. | 2410.12557 | link |
2024-10-16 | Disentangling data distribution for Federated Learning | Xinyuan Zhao et.al. | 2410.12530 | null |
2024-10-16 | Shaping a Stabilized Video by Mitigating Unintended Changes for Concept-Augmented Video Editing | Mingce Guo et.al. | 2410.12526 | null |
2024-10-16 | Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective | Yongxin Zhu et.al. | 2410.12490 | link |
2024-10-16 | DaDiff: Domain-aware Diffusion Model for Nighttime UAV Tracking | Haobo Zuo et.al. | 2410.12270 | link |
2024-10-16 | FlashAudio: Rectified Flows for Fast and High-Fidelity Text-to-Audio Generation | Huadai Liu et.al. | 2410.12266 | null |
2024-10-15 | High-Resolution Frame Interpolation with Patch-based Cascaded Diffusion | Junhwa Hur et.al. | 2410.11838 | null |
2024-10-15 | On the Effectiveness of Dataset Alignment for Fake Image Detection | Anirudh Sundara Rajan et.al. | 2410.11835 | null |
2024-10-15 | Bayesian Experimental Design via Contrastive Diffusions | Jacopo Iollo et.al. | 2410.11826 | link |
2024-10-15 | Improving Long-Text Alignment for Text-to-Image Diffusion Models | Luping Liu et.al. | 2410.11817 | link |
2024-10-15 | SGEdit: Bridging LLM with Text2Image Generative Model for Scene Graph-based Image Editing | Zhiyuan Zhang et.al. | 2410.11815 | null |
2024-10-16 | Efficient Diffusion Models: A Comprehensive Survey from Principles to Practices | Zhiyuan Ma et.al. | 2410.11795 | null |
2024-10-15 | Patch-Based Diffusion Models Beat Whole-Image Models for Mismatched Distribution Inverse Problems | Jason Hu et.al. | 2410.11730 | null |
2024-10-15 | DeformPAM: Data-Efficient Learning for Long-horizon Deformable Object Manipulation via Preference-based Action Alignment | Wendi Chen et.al. | 2410.11584 | link |
2024-10-15 | Riemann-Liouville fractional Brownian motion with random Hurst exponent | Hubert Woszczek et.al. | 2410.11546 | null |
2024-10-15 | InvSeg: Test-Time Prompt Inversion for Semantic Segmentation | Jiayi Lin et.al. | 2410.11473 | null |
2024-10-14 | Tex4D: Zero-shot 4D Scene Texturing with Video Diffusion Models | Jingzhi Bao et.al. | 2410.10821 | link |
2024-10-14 | Depth Any Video with Scalable Synthetic Data | Honghui Yang et.al. | 2410.10815 | link |
2024-10-14 | HART: Efficient Visual Generation with Hybrid Autoregressive Transformer | Haotian Tang et.al. | 2410.10812 | link |
2024-10-14 | TrajDiffuse: A Conditional Diffusion Model for Environment-Aware Trajectory Prediction | Qingze et.al. | 2410.10804 | link |
2024-10-14 | Boosting Camera Motion Control for Video Diffusion Transformers | Soon Yau Cheong et.al. | 2410.10802 | null |
2024-10-14 | Semantic Image Inversion and Editing using Rectified Stochastic Differential Equations | Litu Rout et.al. | 2410.10792 | null |
2024-10-14 | ControlMM: Controllable Masked Motion Generation | Ekkasit Pinyoanuntapong et.al. | 2410.10780 | null |
2024-10-14 | Adaptive Diffusion Terrain Generator for Autonomous Uneven Terrain Navigation | Youwei Yu et.al. | 2410.10766 | null |
2024-10-14 | DragEntity: Trajectory Guided Video Generation using Entity and Positional Relationships | Zhang Wan et.al. | 2410.10751 | null |
2024-10-14 | FlexGen: Flexible Multi-View Generation from Text and Image Inputs | Xinli Xu et.al. | 2410.10745 | null |
2024-10-11 | SceneCraft: Layout-Guided 3D Scene Generation | Xiuyu Yang et.al. | 2410.09049 | link |
2024-10-11 | Linear Convergence of Diffusion Models Under the Manifold Hypothesis | Peter Potaptchik et.al. | 2410.09046 | null |
2024-10-11 | Semantic Score Distillation Sampling for Compositional Text-to-3D Generation | Ling Yang et.al. | 2410.09009 | link |
2024-10-11 | WaveDiffusion: Exploring Full Waveform Inversion via Joint Diffusion in the Latent Space | Hanchen Wang et.al. | 2410.09002 | null |
2024-10-11 | DiffPO: A causal diffusion model for learning distributions of potential outcomes | Yuchen Ma et.al. | 2410.08924 | null |
2024-10-11 | Distillation of Discrete Diffusion through Dimensional Correlations | Satoshi Hayakawa et.al. | 2410.08709 | null |
2024-10-11 | Gait Sequence Upsampling using Diffusion Models for single LiDAR sensors | Jeongho Ahn et.al. | 2410.08680 | null |
2024-10-11 | E-Motion: Future Motion Simulation via Event Sequence Diffusion | Song Wu et.al. | 2410.08649 | link |
2024-10-11 | Synth-SONAR: Sonar Image Synthesis with Enhanced Diversity and Realism via Dual Diffusion Models and GPT Prompting | Purushothaman Natarajan et.al. | 2410.08612 | link |
2024-10-11 | Context-Aware Full Body Anonymization using Text-to-Image Diffusion Models | Pascl Zwick et.al. | 2410.08551 | link |
2024-10-10 | DICE: Discrete Inversion Enabling Controllable Editing for Multinomial Diffusion and Masked Generative Models | Xiaoxiao He et.al. | 2410.08207 | null |
2024-10-10 | HybridBooth: Hybrid Prompt Inversion for Efficient Subject-Driven Generation | Shanyan Guan et.al. | 2410.08192 | null |
2024-10-10 | DifFRelight: Diffusion-Based Facial Performance Relighting | Mingming He et.al. | 2410.08188 | null |
2024-10-10 | ZeroComp: Zero-shot Object Compositing from Image Intrinsics via Diffusion | Zitian Zhang et.al. | 2410.08168 | link |
2024-10-10 | DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation | Jiatao Gu et.al. | 2410.08159 | null |
2024-10-10 | Progressive Autoregressive Video Diffusion Models | Desai Xie et.al. | 2410.08151 | link |
2024-10-10 | Steering Masked Discrete Diffusion Models via Discrete Denoising Posterior Prediction | Jarrid Rector-Brooks et.al. | 2410.08134 | null |
2024-10-10 | Unstable Unlearning: The Hidden Risk of Concept Resurgence in Diffusion Models | Vinith M. Suriyakumar et.al. | 2410.08074 | null |
2024-10-10 | LADIMO: Face Morph Generation through Biometric Template Inversion with Latent Diffusion | Marcel Grimmer et.al. | 2410.07988 | link |
2024-10-10 | AI Surrogate Model for Distributed Computing Workloads | David K. Park et.al. | 2410.07940 | null |
2024-10-09 | IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation | Xinchen Zhang et.al. | 2410.07171 | link |
2024-10-09 | AvatarGO: Zero-shot 4D Human-Object Interaction Generation and Animation | Yukang Cao et.al. | 2410.07164 | null |
2024-10-09 | InstructG2I: Synthesizing Images from Multimodal Attributed Graphs | Bowen Jin et.al. | 2410.07157 | link |
2024-10-09 | Trans4D: Realistic Geometry-Aware Transition for Compositional Text-to-4D Synthesis | Bohan Zeng et.al. | 2410.07155 | link |
2024-10-09 | Diffusion Density Estimators | Akhil Premkumar et.al. | 2410.06986 | null |
2024-10-09 | Jointly Generating Multi-view Consistent PBR Textures using Collaborative Control | Shimon Vainer et.al. | 2410.06985 | null |
2024-10-09 | Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think | Sihyun Yu et.al. | 2410.06940 | link |
2024-10-09 | Boosting Few-Shot Detection with Large Language Models and Layout-to-Image Synthesis | Ahmed Abdullah et.al. | 2410.06841 | null |
2024-10-09 | Diffuse or Confuse: A Diffusion Deepfake Speech Dataset | Anton Firc et.al. | 2410.06796 | link |
2024-10-09 | Diff-FMT: Diffusion Models for Fluorescence Molecular Tomography | Qianqian Xue et.al. | 2410.06757 | null |
2024-10-07 | DART: A Diffusion-Based Autoregressive Motion Model for Real-Time Text-Driven Motion Control | Kaifeng Zhao et.al. | 2410.05260 | null |
2024-10-07 | GS-VTON: Controllable 3D Virtual Try-on with Gaussian Splatting | Yukang Cao et.al. | 2410.05259 | null |
2024-10-07 | SePPO: Semi-Policy Preference Optimization for Diffusion Alignment | Daoan Zhang et.al. | 2410.05255 | link |
2024-10-07 | DiffuseReg: Denoising Diffusion Model for Obtaining Deformation Fields in Unsupervised Deformable Image Registration | Yongtai Zhuo et.al. | 2410.05234 | link |
2024-10-07 | Presto! Distilling Steps and Layers for Accelerating Music Generation | Zachary Novack et.al. | 2410.05167 | null |
2024-10-08 | A Simulation-Free Deep Learning Approach to Stochastic Optimal Control | Mengjian Hua et.al. | 2410.05163 | null |
2024-10-07 | Leveraging Multimodal Diffusion Models to Accelerate Imaging with Side Information | Timofey Efimov et.al. | 2410.05143 | null |
2024-10-07 | Human-Feedback Efficient Reinforcement Learning for Online Diffusion Model Finetuning | Ayano Hiranaka et.al. | 2410.05116 | null |
2024-10-07 | DreamSat: Towards a General 3D Model for Novel View Synthesis of Space Objects | Nidhi Mathihalli et.al. | 2410.05097 | link |
2024-10-07 | A nodally bound-preserving discontinuous Galerkin method for the drift-diffusion equation | Gabriel R. Barrenechea et.al. | 2410.05040 | null |
2024-10-04 | Estimating Body and Hand Motion in an Ego-sensed World | Brent Yi et.al. | 2410.03665 | null |
2024-10-04 | Real-World Benchmarks Make Membership Inference Attacks Fail on Diffusion Models | Chumeng Liang et.al. | 2410.03640 | link |
2024-10-04 | How Discrete and Continuous Diffusion Meet: Comprehensive Analysis of Discrete Diffusion Models via a Stochastic Integral Framework | Yinuo Ren et.al. | 2410.03601 | null |
2024-10-04 | Not All Diffusion Model Activations Have Been Evaluated as Discriminative Features | Benyuan Meng et.al. | 2410.03558 | link |
2024-10-04 | Diffusion State-Guided Projected Gradient for Inverse Problems | Rayhan Zirvi et.al. | 2410.03463 | null |
2024-10-04 | Generative Semantic Communication for Text-to-Speech Synthesis | Jiahao Zheng et.al. | 2410.03459 | null |
2024-10-04 | Dynamic Diffusion Transformer | Wangbo Zhao et.al. | 2410.03456 | link |
2024-10-04 | CLoSD: Closing the Loop between Simulation and Diffusion for multi-task character control | Guy Tevet et.al. | 2410.03441 | link |
2024-10-04 | The scaling behaviour of localised and extended states in one-dimensional tight-binding models with disorder | Luca Schaefer et.al. | 2410.03405 | null |
2024-10-04 | Latent Abstractions in Generative Diffusion Models | Giulio Franzese et.al. | 2410.03368 | null |
2024-10-03 | Revisit Large-Scale Image-Caption Data in Pre-training Multimodal Foundation Models | Zhengfeng Lai et.al. | 2410.02740 | null |
2024-10-03 | SteerDiff: Steering towards Safe Text-to-Image Diffusion Models | Hongxiang Zhang et.al. | 2410.02710 | null |
2024-10-03 | ControlAR: Controllable Image Generation with Autoregressive Models | Zongming Li et.al. | 2410.02705 | link |
2024-10-03 | GUD: Generation with Unified Diffusion | Mathis Gerdes et.al. | 2410.02667 | null |
2024-10-03 | Efficient calibration of the shifted square-root diffusion model to credit default swap spreads using asymptotic approximations | Ankush Agarwal et.al. | 2410.02645 | null |
2024-10-04 | Diffusion Models are Evolutionary Algorithms | Yanbo Zhang et.al. | 2410.02543 | link |
2024-10-03 | Lightweight Diffusion Models for Resource-Constrained Semantic Communication | Giovanni Pignata et.al. | 2410.02491 | link |
2024-10-03 | Towards a Theoretical Understanding of Memorization in Diffusion Models | Yunhao Chen et.al. | 2410.02467 | null |
2024-10-03 | Eliminating Oversaturation and Artifacts of High Guidance Scales in Diffusion Models | Seyedmorteza Sadat et.al. | 2410.02416 | null |
2024-10-03 | Diffusion Meets Options: Hierarchical Generative Skill Composition for Temporally-Extended Tasks | Zeyu Feng et.al. | 2410.02389 | null |
2024-10-02 | FabricDiffusion: High-Fidelity Texture Transfer for 3D Garments Generation from In-The-Wild Clothing Images | Cheng Zhang et.al. | 2410.01801 | null |
2024-10-02 | Dynamical-generative downscaling of climate model ensembles | Ignacio Lopez-Gomez et.al. | 2410.01776 | null |
2024-10-02 | ImageFolder: Autoregressive Image Generation with Folded Tokens | Xiang Li et.al. | 2410.01756 | link |
2024-10-02 | VitaGlyph: Vitalizing Artistic Typography with Flexible Dual-branch Diffusion Models | Kailai Feng et.al. | 2410.01738 | link |
2024-10-02 | HarmoniCa: Harmonizing Training and Inference for Better Feature Cache in Diffusion Transformer Acceleration | Yushi Huang et.al. | 2410.01723 | null |
2024-10-02 | KnobGen: Controlling the Sophistication of Artwork in Sketch-Based Diffusion Models | Pouyan Navard et.al. | 2410.01595 | link |
2024-10-02 | MM-LDM: Multi-Modal Latent Diffusion Model for Sounding Video Generation | Mingzhen Sun et.al. | 2410.01594 | link |
2024-10-02 | HRTF Estimation using a Score-based Prior | Etienne Thuillier et.al. | 2410.01562 | null |
2024-10-02 | Edge-preserving noise for diffusion models | Jente Vandersanden et.al. | 2410.01540 | null |
2024-10-02 | Information-Theoretical Principled Trade-off between Jailbreakability and Stealthiness on Vision Language Models | Ching-Chia Kao et.al. | 2410.01438 | null |
2024-09-30 | COLLAGE: Collaborative Human-Agent Interaction Generation using Hierarchical Latent Diffusion and Language Models | Divyanshu Daiya et.al. | 2409.20502 | null |
2024-09-30 | FreeMask: Rethinking the Importance of Attention Masks for Zero-Shot Video Editing | Lingling Cai et.al. | 2409.20500 | null |
2024-09-30 | Ensemble Kalman Diffusion Guidance: A Derivative-free Method for Inverse Problems | Hongkai Zheng et.al. | 2409.20175 | null |
2024-09-30 | Erase, then Redraw: A Novel Data Augmentation Approach for Free Space Detection Using Diffusion Model | Fulong Ma et.al. | 2409.20164 | null |
2024-09-30 | Conditional Diffusion Models are Minimax-Optimal and Manifold-Adaptive for Conditional Distribution Estimation | Rong Tang et.al. | 2409.20124 | null |
2024-09-30 | Reaction-diffusion model for a population structured in phenotype and space I -- Criterion for persistence | Nathanaël Boutillon et.al. | 2409.20118 | null |
2024-09-30 | RoCoTex: A Robust Method for Consistent Texture Synthesis with Diffusion Models | Jangyeong Kim et.al. | 2409.19989 | null |
2024-09-30 | Magnet: We Never Know How Text-to-Image Diffusion Models Work, Until We Learn How Vision-Language Models Function | Chenyi Zhuang et.al. | 2409.19967 | link |
2024-10-02 | Image Copy Detection for Diffusion Models | Wenhao Wang et.al. | 2409.19952 | null |
2024-09-30 | Task-agnostic Pre-training and Task-guided Fine-tuning for Versatile Diffusion Planner | Chenyou Fan et.al. | 2409.19949 | null |
2024-09-27 | Gen Li et.al. | 2409.18959 | null | |
2024-09-27 | ReviveDiff: A Universal Diffusion Model for Restoring Images in Adverse Weather Conditions | Wenfeng Huang et.al. | 2409.18932 | null |
2024-09-27 | Unsupervised Low-light Image Enhancement with Lookup Tables and Diffusion Priors | Yunlong Lin et.al. | 2409.18899 | null |
2024-09-27 | Detecting Dataset Abuse in Fine-Tuning Stable Diffusion Models for Text-to-Image Synthesis | Songrui Wang et.al. | 2409.18897 | null |
2024-09-27 | Explainable Artifacts for Synthetic Western Blot Source Attribution | João Phillipe Cardenuto et.al. | 2409.18881 | link |
2024-09-27 | Emu3: Next-Token Prediction is All You Need | Xinlong Wang et.al. | 2409.18869 | null |
2024-09-27 | Convergence of Diffusion Models Under the Manifold Hypothesis in High-Dimensions | Iskander Azangulov et.al. | 2409.18804 | null |
2024-09-27 | Unsupervised Fingerphoto Presentation Attack Detection With Diffusion Models | Hailin Li et.al. | 2409.18636 | null |
2024-09-27 | Treating Brain-inspired Memories as Priors for Diffusion Model to Forecast Multivariate Time Series | Muyao Wang et.al. | 2409.18491 | null |
2024-09-27 | Gradient-free Decoder Inversion in Latent Diffusion Models | Seongmin Hong et.al. | 2409.18442 | null |
2024-09-26 | FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner | Wenliang Zhao et.al. | 2409.18128 | link |
2024-09-26 | Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction | Jing He et.al. | 2409.18124 | null |
2024-09-26 | EdgeRunner: Auto-regressive Auto-encoder for Artistic Mesh Generation | Jiaxiang Tang et.al. | 2409.18114 | null |
2024-09-26 | StackGen: Generating Stable Structures from Silhouettes via Diffusion | Luzhe Sun et.al. | 2409.18098 | null |
2024-09-26 | DiffSSC: Semantic LiDAR Scan Completion using Denoising Diffusion Probabilistic Models | Helin Cao et.al. | 2409.18092 | null |
2024-09-26 | Stable Video Portraits | Mirela Ostrek et.al. | 2409.18083 | null |
2024-09-26 | PhoCoLens: Photorealistic and Consistent Reconstruction in Lensless Imaging | Xin Cai et.al. | 2409.17996 | null |
2024-09-26 | Joint Localization and Planning using Diffusion | L. Lao Beyer et.al. | 2409.17995 | null |
2024-09-26 | CNCA: Toward Customizable and Natural Generation of Adversarial Camouflage for Vehicle Detectors | Linye Lyu et.al. | 2409.17963 | link |
2024-09-26 | Relativistic diffusion model for hadron production in p-Pb collisions at the LHC | Philipp Schulz et.al. | 2409.17960 | null |
2024-09-25 | DreamWaltz-G: Expressive 3D Gaussian Avatars from Skeleton-Guided 2D Diffusion | Yukun Huang et.al. | 2409.17145 | link |
2024-09-25 | Language-oriented Semantic Communication for Image Transmission with Fine-Tuned Diffusion Model | Xinfeng Wei et.al. | 2409.17104 | null |
2024-09-25 | Degradation-Guided One-Step Image Super-Resolution with Diffusion Priors | Aiping Zhang et.al. | 2409.17058 | link |
2024-09-25 | ControlCity: A Multimodal Diffusion Model Based Approach for Accurate Geospatial Data Generation and Urban Morphology Analysis | Fangshuo Zhou et.al. | 2409.17049 | link |
2024-09-25 | Dynamic Obstacle Avoidance through Uncertainty-Based Adaptive Planning with Diffusion | Vineet Punyamoorty et.al. | 2409.16950 | null |
2024-09-25 | DALDA: Data Augmentation Leveraging Diffusion Model and LLM with Adaptive Guidance Scaling | Kyuheon Jung et.al. | 2409.16949 | link |
2024-09-25 | Generative Object Insertion in Gaussian Splatting with a Multi-View Diffusion Model | Hongliang Zhong et.al. | 2409.16938 | link |
2024-09-25 | A Versatile and Differentiable Hand-Object Interaction Representation | Théo Morales et.al. | 2409.16855 | null |
2024-09-25 | Analytical assessment of workers' safety concerning direct and indirect ways of getting infected by dangerous pathogen | Krzysztof Domino et.al. | 2409.16809 | null |
2024-09-25 | Layout-Corrector: Alleviating Layout Sticking Phenomenon in Discrete Diffusion Model | Shoma Iwai et.al. | 2409.16689 | null |
2024-09-24 | Generative Factor Chaining: Coordinated Manipulation with Diffusion-based Factor Graph | Utkarsh A. Mishra et.al. | 2409.16275 | null |
2024-09-24 | MaskBit: Embedding-free Image Generation via Bit Tokens | Mark Weber et.al. | 2409.16211 | link |
2024-09-24 | MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling | Yifang Men et.al. | 2409.16160 | null |
2024-09-24 | Spreading dynamics of a Fisher-KPP nonlocal diffusion model with a free boundary | Lei Li et.al. | 2409.16101 | null |
2024-09-24 | PRESTO: Fast motion planning using diffusion models based on key-configuration environment representation | Mingyo Seo et.al. | 2409.16012 | null |
2024-09-24 | Unleashing the Potential of Synthetic Images: A Study on Histopathology Image Classification | Leire Benito-Del-Valle et.al. | 2409.16002 | link |
2024-09-24 | ASD-Diffusion: Anomalous Sound Detection with Diffusion Models | Fengrun Zhang et.al. | 2409.15957 | null |
2024-09-24 | Multiscale method for image denoising using nonlinear diffusion process: local denoising and spectral multiscale basis functions | Maria Vasilyeva et.al. | 2409.15952 | null |
2024-09-24 | Identifying early tumour states in a Cahn-Hilliard-reaction-diffusion model | Abramo Agosti et.al. | 2409.15925 | null |
2024-09-24 | Diffusion Models for Intelligent Transportation Systems: A Survey | Mingxing Peng et.al. | 2409.15816 | null |
2024-09-18 | Massively Multi-Person 3D Human Motion Forecasting with Scene Context | Felix B Mueller et.al. | 2409.12189 | link |
2024-09-18 | MoRAG -- Multi-Fusion Retrieval Augmented Generation for Human Motion | Kalakonda Sai Shashank et.al. | 2409.12140 | null |
2024-09-18 | Brain-Streams: fMRI-to-Image Reconstruction with Multi-modal Guidance | Jaehoon Joo et.al. | 2409.12099 | null |
2024-09-18 | Denoising diffusion models for high-resolution microscopy image restoration | Pamela Osuna-Vargas et.al. | 2409.12078 | null |
2024-09-18 | LEMON: Localized Editing with Mesh Optimization and Neural Shaders | Furkan Mert Algan et.al. | 2409.12024 | null |
2024-09-18 | Generation of Complex 3D Human Motion by Temporal and Spatial Composition of Diffusion Models | Lorenzo Mandelli et.al. | 2409.11920 | null |
2024-09-18 | DPI-TTS: Directional Patch Interaction for Fast-Converging and Style Temporal Modeling in Text-to-Speech | Xin Qi et.al. | 2409.11835 | null |
2024-09-18 | RaggeDi: Diffusion-based State Estimation of Disordered Rags, Sheets, Towels and Blankets | Jikai Ye et.al. | 2409.11831 | null |
2024-09-18 | InverseMeetInsert: Robust Real Image Editing via Geometric Accumulation Inversion in Guided Diffusion Models | Yan Zheng et.al. | 2409.11734 | null |
2024-09-18 | GUNet: A Graph Convolutional Network United Diffusion Model for Stable and Diversity Pose Generation | Shuowen Liang et.al. | 2409.11689 | link |
2024-09-17 | Ultrasound Image Enhancement with the Variance of Diffusion Models | Yuxin Zhang et.al. | 2409.11380 | link |
2024-09-17 | OSV: One Step is Enough for High-Quality Image to Video Generation | Xiaofeng Mao et.al. | 2409.11367 | null |
2024-09-17 | Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think | Gonzalo Martin Garcia et.al. | 2409.11355 | link |
2024-09-17 | OmniGen: Unified Image Generation | Shitao Xiao et.al. | 2409.11340 | link |
2024-09-17 | fMRI-3D: A Comprehensive Dataset for Enhancing fMRI-based 3D Reconstruction | Jianxiong Gao et.al. | 2409.11315 | null |
2024-09-17 | DroneDiffusion: Robust Quadrotor Dynamics Learning with Diffusion Models | Avirup Das et.al. | 2409.11292 | null |
2024-09-17 | Score Forgetting Distillation: A Swift, Data-Free Method for Machine Unlearning in Diffusion Models | Tianqi Chen et.al. | 2409.11219 | null |
2024-09-17 | High-Resolution Speech Restoration with Latent Diffusion Model | Tushar Dhyani et.al. | 2409.11145 | null |
2024-09-17 | In-situ measurements of light diffusion in an optically dense atomic ensemble | Antoine Glicenstein et.al. | 2409.11117 | null |
2024-09-17 | TacDiffusion: Force-domain Diffusion Policy for Precise Tactile Manipulation | Yansong Wu et.al. | 2409.11047 | null |
2024-09-16 | Incorporating Classifier-Free Guidance in Diffusion Model-Based Recommendation | Noah Buchanan et.al. | 2409.10494 | null |
2024-09-16 | SimInversion: A Simple Framework for Inversion-Based Text-to-Image Editing | Qi Qian et.al. | 2409.10476 | null |
2024-09-16 | MacDiff: Unified Skeleton Modeling with Masked Conditional Diffusion | Lehong Wu et.al. | 2409.10473 | null |
2024-09-16 | Mamba-ST: State Space Model for Efficient Style Transfer | Filippo Botti et.al. | 2409.10385 | link |
2024-09-16 | Taming Diffusion Models for Image Restoration: A Review | Ziwei Luo et.al. | 2409.10353 | null |
2024-09-16 | Fairness, not Emotion, Drives Socioeconomic Decision Making | Rudra Mukhopadhyay et.al. | 2409.10322 | null |
2024-09-16 | DreamHead: Learning Spatial-Temporal Correspondence via Hierarchical Diffusion for Audio-driven Talking Head Synthesis | Fa-Ting Hong et.al. | 2409.10281 | null |
2024-09-16 | RealDiff: Real-world 3D Shape Completion using Self-Supervised Diffusion Models | Başak Melis Öcal et.al. | 2409.10180 | null |
2024-09-16 | PSHuman: Photorealistic Single-view Human Reconstruction using Cross-Scale Diffusion | Peng Li et.al. | 2409.10141 | null |
2024-09-16 | DDoS: Diffusion Distribution Similarity for Out-of-Distribution Detection | Kun Fang et.al. | 2409.10094 | null |
2024-09-13 | Closed-Loop Visuomotor Control with Generative Expectation for Robotic Manipulation | Qingwen Bu et.al. | 2409.09016 | link |
2024-09-13 | A Diffusion Approach to Radiance Field Relighting using Multi-Illumination Synthesis | Yohan Poirier-Ginter et.al. | 2409.08947 | null |
2024-09-13 | Latent Space Score-based Diffusion Model for Probabilistic Multivariate Time Series Imputation | Guojun Liang et.al. | 2409.08917 | link |
2024-09-13 | Gaussian is All You Need: A Unified Framework for Solving Inverse Problems via Diffusion Posterior Sampling | Nebiyou Yismaw et.al. | 2409.08906 | null |
2024-09-13 | Adjoint Matching: Fine-tuning Flow and Diffusion Generative Models with Memoryless Stochastic Optimal Control | Carles Domingo-Enrich et.al. | 2409.08861 | null |
2024-09-13 | InstantDrag: Improving Interactivity in Drag-based Image Editing | Joonghyuk Shin et.al. | 2409.08857 | null |
2024-09-13 | DX2CT: Diffusion Model for 3D CT Reconstruction from Bi or Mono-planar 2D X-ray(s) | Yun Su Jeong et.al. | 2409.08850 | null |
2024-09-13 | DFADD: The Diffusion and Flow-Matching Based Audio Deepfake Dataset | Jiawei Du et.al. | 2409.08731 | link |
2024-09-13 | STA-V2A: Video-to-Audio Generation with Semantic and Temporal Alignment | Yong Ren et.al. | 2409.08601 | null |
2024-09-13 | LHQ-SVC: Lightweight and High Quality Singing Voice Conversion Modeling | Yubo Huang et.al. | 2409.08583 | null |
2024-09-12 | DreamHOI: Subject-Driven Generation of 3D Human-Object Interactions with Diffusion Priors | Thomas Hanwen Zhu et.al. | 2409.08278 | null |
2024-09-12 | DreamBeast: Distilling 3D Fantastical Animals with Part-Aware Knowledge Transfer | Runjia Li et.al. | 2409.08271 | null |
2024-09-12 | Touch2Touch: Cross-Modal Tactile Generation for Object Manipulation | Samanta Rodriguez et.al. | 2409.08269 | null |
2024-09-12 | Improving Text-guided Object Inpainting with Semantic Pre-inpainting | Yifu Chen et.al. | 2409.08260 | link |
2024-09-12 | Improving Virtual Try-On with Garment-focused Diffusion Models | Siqi Wan et.al. | 2409.08258 | link |
2024-09-12 | LoRID: Low-Rank Iterative Diffusion for Adversarial Purification | Geigh Zollicoffer et.al. | 2409.08255 | null |
2024-09-12 | Dynamic Prompting of Frozen Text-to-Image Diffusion Models for Panoptic Narrative Grounding | Hongyu Li et.al. | 2409.08251 | null |
2024-09-12 | IFAdapter: Instance Feature Control for Grounded Text-to-Image Generation | Yinwei Wu et.al. | 2409.08240 | null |
2024-09-12 | LT3SD: Latent Trees for 3D Scene Diffusion | Quan Meng et.al. | 2409.08215 | null |
2024-09-12 | VI3DRM:Towards meticulous 3D Reconstruction from Sparse Views via Photo-Realistic Novel View Synthesis | Hao Chen et.al. | 2409.08207 | null |
2024-09-11 | DreamMesh: Jointly Manipulating and Texturing Triangle Meshes for Text-to-3D Generation | Haibo Yang et.al. | 2409.07454 | null |
2024-09-11 | Hi3D: Pursuing High-Resolution Image-to-3D Generation with Video Diffusion Models | Haibo Yang et.al. | 2409.07452 | link |
2024-09-11 | FreeEnhance: Tuning-Free Image Enhancement via Content-Consistent Noising-and-Denoising Process | Yang Luo et.al. | 2409.07451 | null |
2024-09-11 | Efficient One-Step Diffusion Refinement for Snapshot Compressive Imaging | Yunzhen Wang et.al. | 2409.07417 | null |
2024-09-11 | Training-Free Guidance for Discrete Diffusion Models for Molecular Generation | Thomas J. Kerby et.al. | 2409.07359 | null |
2024-09-11 | Learning Robotic Manipulation Policies from Point Clouds with Conditional Flow Matching | Eugenio Chisari et.al. | 2409.07343 | null |
2024-09-11 | Efficient and Unbiased Sampling of Boltzmann Distributions via Consistency Models | Fengzhe Zhang et.al. | 2409.07323 | null |
2024-09-11 | Exploring User-level Gradient Inversion with a Diffusion Prior | Zhuohang Li et.al. | 2409.07291 | null |
2024-09-11 | CCFExp: Facial Image Synthesis with Cycle Cross-Fusion Diffusion Model for Facial Paralysis Individuals | Weixiang Gao et.al. | 2409.07271 | link |
2024-09-11 | Realistic and Efficient Face Swapping: A Unified Approach with Diffusion Models | Sanoojan Baliah et.al. | 2409.07269 | link |
2024-09-10 | SaRA: High-Efficient Diffusion Model Fine-tuning with Progressive Sparse Low-Rank Adaptation | Teng Hu et.al. | 2409.06633 | null |
2024-09-10 | Enhancing Emotional Text-to-Speech Controllability with Natural Language Guidance through Contrastive Learning and Diffusion Models | Xin Jing et.al. | 2409.06451 | null |
2024-09-10 | Distilling Generative-Discriminative Representations for Very Low-Resolution Face Recognition | Junzheng Zhang et.al. | 2409.06371 | null |
2024-09-10 | What happens to diffusion model likelihood when your model is conditional? | Mattias Cross et.al. | 2409.06364 | null |
2024-09-10 | DiffQRCoder: Diffusion-based Aesthetic QR Code Generation with Scanning Robustness Guided Iterative Refinement | Jia-Wei Liao et.al. | 2409.06355 | null |
2024-09-10 | Multi-Source Music Generation with Latent Diffusion | Zhongweiyang Xu et.al. | 2409.06190 | link |
2024-09-11 | MyGo: Consistent and Controllable Multi-View Driving Video Generation with Camera Control | Yining Yao et.al. | 2409.06189 | null |
2024-09-10 | EDADepth: Enhanced Data Augmentation for Monocular Depth Estimation | Nischal Khanal et.al. | 2409.06183 | link |
2024-09-09 | Latent Diffusion Bridges for Unsupervised Musical Audio Timbre Transfer | Michele Mancusi et.al. | 2409.06096 | null |
2024-09-09 | SVS-GAN: Leveraging GANs for Semantic Video Synthesis | Khaled M. Seyam et.al. | 2409.06074 | null |
2024-09-09 | Enhancing Preference-based Linear Bandits via Human Response Time | Shen Li et.al. | 2409.05798 | null |
2024-09-09 | Vector Quantized Diffusion Model Based Speech Bandwidth Extension | Yuan Fang et.al. | 2409.05784 | null |
2024-09-09 | AS-Speech: Adaptive Style For Speech Synthesis | Zhipeng Li et.al. | 2409.05730 | null |
2024-09-09 | pFedGPA: Diffusion-based Generative Parameter Aggregation for Personalized Federated Learning | Jiahao Lai et.al. | 2409.05701 | null |
2024-09-09 | Unlearning or Concealment? A Critical Analysis and Evaluation Metrics for Unlearning in Diffusion Models | Aakash Sen Sharma et.al. | 2409.05668 | null |
2024-09-09 | Forward KL Regularized Preference Optimization for Aligning Diffusion Policies | Zhao Shan et.al. | 2409.05622 | null |
2024-09-09 | CipherDM: Secure Three-Party Inference for Diffusion Model Sampling | Xin Zhao et.al. | 2409.05414 | null |
2024-09-09 | Sequential Posterior Sampling with Diffusion Models | Tristan S. W. Stevens et.al. | 2409.05399 | null |
2024-09-09 | TERD: A Unified Framework for Safeguarding Diffusion Models Against Backdoors | Yichuan Mo et.al. | 2409.05294 | link |
2024-09-08 | Nuclear transparencies with a two step process of the |
Tae Keun Choi et.al. | 2409.05129 | null |
2024-09-06 | VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation | Yecheng Wu et.al. | 2409.04429 | link |
2024-09-06 | Exploring Foundation Models for Synthetic Medical Imaging: A Study on Chest X-Rays and Fine-Tuning Techniques | Davide Clode da Silva et.al. | 2409.04424 | null |
2024-09-06 | How Fair is Your Diffusion Recommender Model? | Daniele Malitesta et.al. | 2409.04339 | null |
2024-09-06 | Random effects estimation in a fractional diffusion model based on continuous observations | Nesrine Chebli et.al. | 2409.04331 | null |
2024-09-06 | Breaking the Brownian Barrier: Models and Manifestations of Molecular Diffusion in Complex Fluids | Harish Srinivasan et.al. | 2409.04199 | null |
2024-09-06 | GST: Precise 3D Human Body from a Single Image with Gaussian Splatting Transformers | Lorenza Prospero et.al. | 2409.04196 | null |
2024-09-06 | D4: Text-guided diffusion model-based domain adaptive data augmentation for vineyard shoot detection | Kentaro Hirahara et.al. | 2409.04060 | null |
2024-09-06 | One-Shot Diffusion Mimicker for Handwritten Text Generation | Gang Dai et.al. | 2409.04004 | link |
2024-09-06 | DreamForge: Motion-Aware Autoregressive Video Generation for Multi-View Driving Scenes | Jianbiao Mei et.al. | 2409.04003 | link |
2024-09-05 | Data-Efficient Generation for Dataset Distillation | Zhe Li et.al. | 2409.03929 | null |
2024-09-05 | Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding | Yunze Man et.al. | 2409.03757 | link |
2024-09-05 | ArtiFade: Learning to Generate High-quality Subject from Blemished Images | Shuya Yang et.al. | 2409.03745 | null |
2024-09-05 | RealisHuman: A Two-Stage Approach for Refining Malformed Human Parts in Generated Images | Benzhi Wang et.al. | 2409.03644 | link |
2024-09-05 | DiffEVC: Any-to-Any Emotion Voice Conversion with Expressive Guidance | Hsing-Hang Chou et.al. | 2409.03636 | null |
2024-09-05 | TCDiff: Triple Condition Diffusion Model with 3D Constraints for Stylizing Synthetic Faces | Bernardo Biesseck et.al. | 2409.03600 | link |
2024-09-05 | DKDM: Data-Free Knowledge Distillation for Diffusion Models with Any Architecture | Qianlong Xiang et.al. | 2409.03550 | null |
2024-09-05 | Blended Latent Diffusion under Attention Control for Real-World Video Editing | Deyin Liu et.al. | 2409.03514 | null |
2024-09-05 | Data-free Distillation with Degradation-prompt Diffusion for Multi-weather Image Restoration | Pei Wang et.al. | 2409.03455 | null |
2024-09-05 | Enhancing User-Centric Privacy Protection: An Interactive Framework through Diffusion Models and Machine Unlearning | Huaxi Huang et.al. | 2409.03326 | null |
2024-09-05 | SVP: Style-Enhanced Vivid Portrait Talking Head Diffusion Model | Weipeng Tan et.al. | 2409.03270 | null |
2024-09-04 | HiPrompt: Tuning-free Higher-Resolution Generation with Hierarchical MLLM Prompts | Xinyu Liu et.al. | 2409.02919 | link |
2024-09-04 | Masked Diffusion Models are Secretly Time-Agnostic Masked Models and Exploit Inaccurate Categorical Sampling | Kaiwen Zheng et.al. | 2409.02908 | null |
2024-09-04 | Human-VDM: Learning Single-Image 3D Human Gaussian Splatting from Video Diffusion Models | Zhibin Liu et.al. | 2409.02851 | link |
2024-09-04 | Multi-Track MusicLDM: Towards Versatile Music Generation with Latent Diffusion Model | Tornike Karchkhadze et.al. | 2409.02845 | null |
2024-09-04 | Skip-and-Play: Depth-Driven Pose-Preserved Image Generation for Any Objects | Kyungmin Jo et.al. | 2409.02653 | null |
2024-09-04 | MADiff: Motion-Aware Mamba Diffusion Models for Hand Trajectory Prediction on Egocentric Videos | Junyi Ma et.al. | 2409.02638 | null |
2024-09-04 | Loopy: Taming Audio-Driven Portrait Avatar with Long-Term Motion Dependency | Jianwen Jiang et.al. | 2409.02634 | null |
2024-09-04 | Rate-Adaptive Generative Semantic Communication Using Conditional Diffusion Models | Pujing Yang et.al. | 2409.02597 | null |
2024-09-04 | Solving Video Inverse Problems Using Image Diffusion Models | Taesung Kwon et.al. | 2409.02574 | null |
2024-09-04 | StyleTokenizer: Defining Image Style by a Single Instance for Controlling Diffusion Models | Wen Li et.al. | 2409.02543 | link |
2024-08-30 | Subspace Diffusion Posterior Sampling for Travel-Time Tomography | Xiang Cao et.al. | 2408.17333 | null |
2024-09-02 | RISSOLE: Parameter-efficient Diffusion Models via Block-wise Generation and Retrieval-Guidance | Avideep Mukherjee et.al. | 2408.17095 | null |
2024-09-02 | Instant Adversarial Purification with Adversarial Consistency Distillation | Chun Tong Lei et.al. | 2408.17064 | null |
2024-08-30 | Text-to-Image Generation Via Energy-Based CLIP | Roy Ganz et.al. | 2408.17046 | null |
2024-08-30 | Contrastive Learning with Synthetic Positives | Dewen Zeng et.al. | 2408.16965 | link |
2024-09-02 | Enabling Local Editing in Diffusion Models by Joint and Individual Component Analysis | Theodoros Kouzelis et.al. | 2408.16845 | null |
2024-08-29 | ReconX: Reconstruct Any Scene from Sparse Views with Video Diffusion Model | Fangfu Liu et.al. | 2408.16767 | null |
2024-09-04 | CSGO: Content-Style Composition in Text-to-Image Generation | Peng Xing et.al. | 2408.16766 | null |
2024-08-29 | DriveGenVLM: Real-world Video Generation for Vision Language Model based Autonomous Driving | Yongjie Fu et.al. | 2408.16647 | null |
2024-09-02 | RLCP: A Reinforcement Learning-based Copyright Protection Method for Text-to-Image Diffusion Model | Zhuan Shi et.al. | 2408.16634 | null |
2024-08-29 | A Score-based Generative Solver for PDE-constrained Inverse Problems with Complex Priors | Yankun Hong et.al. | 2408.16626 | null |
2024-08-29 | GRPose: Learning Graph Relations for Human Image Generation with Pose Priors | Xiangchen Yin et.al. | 2408.16540 | link |
2024-08-29 | Spiking Diffusion Models | Jiahang Cao et.al. | 2408.16467 | link |
2024-08-29 | What to Preserve and What to Transfer: Faithful, Identity-Preserving Diffusion-based Hairstyle Transfer | Chaeyeon Chung et.al. | 2408.16450 | link |
2024-08-29 | COIN: Control-Inpainting Diffusion Prior for Human and Camera Motion Estimation | Jiefeng Li et.al. | 2408.16426 | null |
2024-08-29 | Self-Improving Diffusion Models with Synthetic Data | Sina Alemohammad et.al. | 2408.16333 | null |
2024-08-28 | TEDRA: Text-based Editing of Dynamic and Photoreal Actors | Basavaraj Sunagad et.al. | 2408.15995 | null |
2024-08-28 | Distribution Backtracking Builds A Faster Convergence Trajectory for One-step Diffusion Distillation | Shengyuan Zhang et.al. | 2408.15991 | link |
2024-08-28 | Gen-Swarms: Adapting Deep Generative Models to Swarms of Drones | Carlos Plou et.al. | 2408.15899 | null |
2024-08-28 | Airfoil Diffusion: Denoising Diffusion Model For Conditional Airfoil Generation | Reid Graves et.al. | 2408.15898 | link |
2024-08-28 | Disentangled Diffusion Autoencoder for Harmonization of Multi-site Neuroimaging Data | Ayodeji Ijishakin et.al. | 2408.15890 | null |
2024-08-28 | GenDDS: Generating Diverse Driving Video Scenarios with Prompt-to-Video Generative Model | Yongjie Fu et.al. | 2408.15868 | null |
2024-08-28 | Defending Text-to-image Diffusion Models: Surprising Efficacy of Textual Perturbations Against Backdoor Attacks | Oscar Chew et.al. | 2408.15721 | null |
2024-08-28 | Synthetic Forehead-creases Biometric Generation for Reliable User Verification | Abhishek Tandon et.al. | 2408.15693 | link |
2024-08-28 | Merging and Splitting Diffusion Paths for Semantically Coherent Panoramas | Fabio Quattrini et.al. | 2408.15660 | link |
2024-08-28 | Grand canonical generative diffusion model for crystalline phases and grain boundaries | Bo Lei et.al. | 2408.15601 | null |
2024-08-27 | GenRec: Unifying Video Generation and Recognition with Diffusion Models | Zejia Weng et.al. | 2408.15241 | link |
2024-08-27 | Generative Inbetweening: Adapting Image-to-Video Models for Keyframe Interpolation | Xiaojuan Wang et.al. | 2408.15239 | null |
2024-08-27 | Simulation of Stochastic Discrete Dislocation Dynamics in Ductile Vs Brittle Materials | Santosh Chhetri et.al. | 2408.15157 | null |
2024-08-27 | DIFR3CT: Latent Diffusion for Probabilistic 3D CT Reconstruction from Few Planar X-Rays | Yiran Sun et.al. | 2408.15118 | link |
2024-08-27 | Constrained Diffusion Models via Dual Training | Shervin Khalafi et.al. | 2408.15094 | null |
2024-08-27 | LN-Gen: Rectal Lymph Nodes Generation via Anatomical Features | Weidong Guo et.al. | 2408.14977 | null |
2024-08-27 | MegActor- |
Shurong Yang et.al. | 2408.14975 | null |
2024-08-27 | MeshUp: Multi-Target Mesh Deformation via Blended Score Distillation | Hyunwoo Kim et.al. | 2408.14899 | null |
2024-08-27 | DiffSurf: A Transformer-based Diffusion Model for Generating and Reconstructing 3D Surfaces in Pose | Yusuke Yoshiyasu et.al. | 2408.14860 | null |
2024-08-27 | Diffusion-Occ: 3D Point Cloud Completion via Occupancy Diffusion | Guoqing Zhang et.al. | 2408.14846 | null |
2024-08-27 | Foundation Models for Music: A Survey | Yinghao Ma et.al. | 2408.14340 | link |
2024-08-26 | TC-PDM: Temporally Consistent Patch Diffusion Models for Infrared-to-Visible Video Translation | Anh-Dzung Doan et.al. | 2408.14227 | link |
2024-08-26 | MagicMan: Generative Novel View Synthesis of Humans with 3D-Aware Diffusion and Iterative Refinement | Xu He et.al. | 2408.14211 | null |
2024-08-27 | SwiftBrush v2: Make Your One-step Diffusion Model Better Than Its Teacher | Trung Dao et.al. | 2408.14176 | link |
2024-08-26 | Foodfusion: A Novel Approach for Food Image Composition via Diffusion Models | Chaohua Shi et.al. | 2408.14135 | null |
2024-08-26 | SurGen: Text-Guided Diffusion Model for Surgical Video Generation | Joseph Cho et.al. | 2408.14028 | null |
2024-08-26 | Pixel-Aligned Multi-View Generation with Depth Guided Decoder | Zhenggang Tang et.al. | 2408.14016 | null |
2024-08-25 | SimpleSpeech 2: Towards Simple and Efficient Text-to-Speech with Flow-based Scalar Latent Transformer Diffusion Models | Dongchao Yang et.al. | 2408.13893 | null |
2024-08-25 | Particle-Filtering-based Latent Diffusion for Inverse Problems | Amir Nazemi et.al. | 2408.13868 | null |
2024-08-25 | Draw Like an Artist: Complex Scene Generation with Diffusion Model via Composition, Painting, and Retouching | Minghao Liu et.al. | 2408.13858 | null |
2024-08-23 | How Diffusion Models Learn to Factorize and Compose | Qiyao Liang et.al. | 2408.13256 | null |
2024-08-23 | CustomCrafter: Customized Video Generation with Preserving Motion and Concept Composition Abilities | Tao Wu et.al. | 2408.13239 | link |
2024-08-23 | Diffusion-based Episodes Augmentation for Offline Multi-Agent Reinforcement Learning | Jihwan Oh et.al. | 2408.13092 | null |
2024-08-23 | General Intelligent Imaging and Uncertainty Quantification by Deterministic Diffusion Model | Weiru Fan et.al. | 2408.13061 | null |
2024-08-23 | Atlas Gaussians Diffusion for 3D Generation with Infinite Number of Points | Haitao Yang et.al. | 2408.13055 | null |
2024-08-23 | Adaptive complexity of log-concave sampling | Huanjian Zhou et.al. | 2408.13045 | null |
2024-08-23 | EasyControl: Transfer ControlNet to Video Diffusion for Controllable Generation and Interpolation | Cong Wang et.al. | 2408.13005 | null |
2024-08-23 | Controllable Financial Market Generation with Diffusion Guided Meta Agent | Yu-Hao Huang et.al. | 2408.12991 | null |
2024-08-23 | When Diffusion MRI Meets Diffusion Model: A Novel Deep Generative Model for Diffusion MRI Generation | Xi Zhu et.al. | 2408.12897 | null |
2024-08-22 | Generating Realistic X-ray Scattering Images Using Stable Diffusion and Human-in-the-loop Annotations | Zhuowen Zhao et.al. | 2408.12720 | link |
2024-08-22 | xGen-VideoSyn-1: High-fidelity Text-to-Video Synthesis with Compressed Representations | Can Qin et.al. | 2408.12590 | null |
2024-08-22 | ssProp: Energy-Efficient Training for Convolutional Neural Networks with Scheduled Sparse Back Propagation | Lujia Zhong et.al. | 2408.12561 | link |
2024-08-22 | Show-o: One Single Transformer to Unify Multimodal Understanding and Generation | Jinheng Xie et.al. | 2408.12528 | null |
2024-08-22 | FlexEdit: Marrying Free-Shape Masks to VLLM for Flexible Image Editing | Jue Wang et.al. | 2408.12429 | link |
2024-08-22 | 4D Diffusion for Dynamic Protein Structure Prediction with Reference Guided Motion Alignment | Kaihui Cheng et.al. | 2408.12419 | null |
2024-08-22 | CODE: Confident Ordinary Differential Editing | Bastien van Delft et.al. | 2408.12418 | link |
2024-08-22 | Dynamic PDB: A New Dataset and a SE(3) Model Extension by Integrating Dynamic Behaviors and Physical Properties in Protein Structures | Ce Liu et.al. | 2408.12413 | null |
2024-08-22 | LCM-SVC: Latent Diffusion Model Based Singing Voice Conversion with Inference Acceleration via Latent Consistency Distillation | Shihao Chen et.al. | 2408.12354 | null |
2024-08-23 | GarmentAligner: Text-to-Garment Generation via Retrieval-augmented Multi-level Corrections | Shiyue Zhang et.al. | 2408.12352 | null |
2024-08-22 | Variance reduction of diffusion model's gradients with Taylor approximation-based control variate | Paul Jeha et.al. | 2408.12270 | null |
2024-08-21 | Pixel Is Not A Barrier: An Effective Evasion Attack for Pixel-Domain Diffusion Models | Chun-Yen Shih et.al. | 2408.11810 | null |
2024-08-21 | Timeline and Boundary Guided Diffusion Network for Video Shadow Detection | Haipeng Zhou et.al. | 2408.11785 | link |
2024-08-21 | JieHua Paintings Style Feature Extracting Model using Stable Diffusion with ControlNet | Yujia Gu et.al. | 2408.11744 | null |
2024-08-21 | Iterative Object Count Optimization for Text-to-image Diffusion Models | Oz Zafar et.al. | 2408.11721 | null |
2024-08-21 | FRAP: Faithful and Realistic Text-to-Image Generation with Adaptive Prompt Weighting | Liyao Jiang et.al. | 2408.11706 | null |
2024-08-21 | Moderate deviation principles for a reaction diffusion model in non-equilibrium | Linjie Zhao et.al. | 2408.11633 | null |
2024-08-21 | Bayesian inversion for the identification of the doping profile in unipolar semiconductor devices | Leila Taghizadeh et.al. | 2408.11485 | null |
2024-08-21 | Latent Feature and Attention Dual Erasure Attack against Multi-View Diffusion Models for 3D Assets Protection | Jingwei Sun et.al. | 2408.11408 | null |
2024-08-21 | Video Diffusion Models are Strong Video Inpainter | Minhyeok Lee et.al. | 2408.11402 | null |
2024-08-21 | Generative AI based Secure Wireless Sensing for ISAC Networks | Jiacheng Wang et.al. | 2408.11398 | null |
2024-08-20 | Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model | Chunting Zhou et.al. | 2408.11039 | null |
2024-08-20 | MegaFusion: Extend Diffusion Models towards Higher-resolution Image Generation without Further Tuning | Haoning Wu et.al. | 2408.11001 | link |
2024-08-20 | GreediRIS: Scalable Influence Maximization using Distributed Streaming Maximum Cover | Reet Barik et.al. | 2408.10982 | null |
2024-08-20 | Kilometer-Scale Convection Allowing Model Emulation using Generative Diffusion Modeling | Jaideep Pathak et.al. | 2408.10958 | null |
2024-08-20 | Large Point-to-Gaussian Model for Image-to-3D Generation | Longfei Lu et.al. | 2408.10935 | null |
2024-08-20 | A Grey-box Attack against Latent Diffusion Model-based Image Editing by Posterior Collapse | Zhongliang Guo et.al. | 2408.10901 | null |
2024-08-20 | Hedging in Jump Diffusion Model with Transaction Costs | Hamidreza Maleki Almani et.al. | 2408.10785 | null |
2024-08-20 | Generating Synthetic Fair Syntax-agnostic Data by Learning and Distilling Fair Representation | Md Fahim Sikder et.al. | 2408.10755 | null |
2024-08-20 | Iterative Window Mean Filter: Thwarting Diffusion-based Adversarial Purification | Hanrui Wang et.al. | 2408.10673 | null |
2024-08-20 | TextMastero: Mastering High-Quality Scene Text Editing in Diverse Languages and Styles | Tong Wang et.al. | 2408.10623 | null |
2024-08-19 | MeshFormer: High-Quality Mesh Generation with 3D-Guided Reconstruction Model | Minghua Liu et.al. | 2408.10198 | null |
2024-08-19 | SpaRP: Fast 3D Object Reconstruction and Pose Estimation from Sparse Views | Chao Xu et.al. | 2408.10195 | null |
2024-08-19 | Multi-layer diffusion model of photovoltaic installations | Tomasz Weron et.al. | 2408.09904 | null |
2024-08-19 | Instruction-Based Molecular Graph Generation with Unified Text-Graph Diffusion Model | Yuran Xiang et.al. | 2408.09896 | link |
2024-08-19 | SurgicaL-CD: Generating Surgical Images via Unpaired Image Translation with Latent Consistency Diffusion Models | Danush Kumar Venkatesh et.al. | 2408.09822 | link |
2024-08-19 | Latent Diffusion for Guided Document Table Generation | Syed Jawwad Haider Hamdani et.al. | 2408.09800 | null |
2024-08-19 | Unsupervised Composable Representations for Audio | Giovanni Bindi et.al. | 2408.09792 | link |
2024-08-19 | Propagating the prior from shallow to deep with a pre-trained velocity-model Generative Transformer network | Randy Harsuko et.al. | 2408.09767 | null |
2024-08-19 | Photorealistic Object Insertion with Diffusion-Guided Inverse Rendering | Ruofan Liang et.al. | 2408.09702 | null |
2024-08-19 | ExpoMamba: Exploiting Frequency SSM Blocks for Efficient and Effective Image Enhancement | Eashan Adhikarla et.al. | 2408.09650 | link |
2024-08-16 | PFDiff: Training-free Acceleration of Diffusion Models through the Gradient Guidance of Past and Future | Guangyi Wang et.al. | 2408.08822 | null |
2024-08-16 | Comparative Analysis of Generative Models: Enhancing Image Synthesis with VAEs, GANs, and Stable Diffusion | Sanchayan Vivekananthan et.al. | 2408.08751 | null |
2024-08-16 | An End-to-End Model for Photo-Sharing Multi-modal Dialogue Generation | Peiming Guo et.al. | 2408.08650 | null |
2024-08-16 | Modeling the Neonatal Brain Development Using Implicit Neural Representations | Florentin Bieder et.al. | 2408.08647 | link |
2024-08-16 | Sampling effects on Lasso estimation of drift functions in high-dimensional diffusion processes | Chiara Amorino et.al. | 2408.08638 | null |
2024-08-16 | Generative Dataset Distillation Based on Diffusion Model | Duo Su et.al. | 2408.08610 | link |
2024-08-16 | RadioDiff: An Effective Generative Diffusion Model for Sampling-Free Dynamic Radio Map Construction | Xiucheng Wang et.al. | 2408.08593 | link |
2024-08-16 | A New Chinese Landscape Paintings Generation Model based on Stable Diffusion using DreamBooth | Yujia Gu et.al. | 2408.08561 | null |
2024-08-16 | Linear combinations of latents in diffusion models: interpolation and beyond | Erik Bodin et.al. | 2408.08558 | null |
2024-08-16 | Inverse design with conditional cascaded diffusion models | Milad Habibi et.al. | 2408.08526 | null |
2024-08-15 | Accelerated Image-Aware Generative Diffusion Modeling | Tanmay Asthana et.al. | 2408.08306 | null |
2024-08-15 | Derivative-Free Guidance in Continuous and Discrete Diffusion Models with Soft Value-Based Decoding | Xiner Li et.al. | 2408.08252 | link |
2024-08-15 | Not Every Image is Worth a Thousand Words: Quantifying Originality in Stable Diffusion | Adi Haviv et.al. | 2408.08184 | null |
2024-08-15 | Conditional Brownian Bridge Diffusion Model for VHR SAR to Optical Image Translation | Seon-Hoon Kim et.al. | 2408.07947 | link |
2024-08-14 | Moderator: Moderating Text-to-Image Diffusion Models through Fine-grained Context-based Policies | Peiran Wang et.al. | 2408.07728 | link |
2024-08-14 | Drug Discovery SMILES-to-Pharmacokinetics Diffusion Models with Deep Molecular Understanding | Bing Hu et.al. | 2408.07636 | null |
2024-08-14 | Anisotropic Diffusion Model of Communication in 2D Biofilm | Yanahan Paramalingam et.al. | 2408.07626 | null |
2024-08-14 | DifuzCam: Replacing Camera Lens with a Mask and a Diffusion Model | Erez Yosef et.al. | 2408.07541 | null |
2024-08-14 | DeCo: Decoupled Human-Centered Diffusion Video Editing with Motion Consistency | Xiaojing Zhong et.al. | 2408.07481 | null |
2024-08-14 | One Step Diffusion-based Super-Resolution with Time-Aware Distillation | Xiao He et.al. | 2408.07476 | link |
2024-08-14 | Unsupervised Blind Joint Dereverberation and Room Acoustics Estimation with Diffusion Models | Jean-Marie Lemercier et.al. | 2408.07472 | null |
2024-08-14 | KIND: Knowledge Integration and Diversion in Diffusion Models | Yucheng Xie et.al. | 2408.07337 | null |
2024-08-14 | GRIF-DM: Generation of Rich Impression Fonts using Diffusion Models | Lei Kang et.al. | 2408.07259 | link |
2024-08-13 | Representation-space diffusion models for generating periodic materials | Anshuman Sinha et.al. | 2408.07213 | null |
2024-08-13 | SeLoRA: Self-Expanding Low-Rank Adaptation of Latent Diffusion Model for Medical Image Synthesis | Yuchen Mao et.al. | 2408.07196 | null |
2024-08-13 | Imagen 3 | Imagen-Team-Google et.al. | 2408.07009 | null |
2024-08-13 | Low-Bitwidth Floating Point Quantization for Efficient High-Quality Diffusion Models | Cheng Chen et.al. | 2408.06995 | null |
2024-08-13 | DCMSA: Multi-Head Self-Attention Mechanism Based on Deformable Convolution For Seismic Data Denoising | Wang Mingwei et.al. | 2408.06963 | null |
2024-08-13 | Diffusion Model for Slate Recommendation | Federico Tomasi et.al. | 2408.06883 | null |
2024-08-13 | DiffLoRA: Generating Personalized Low-Rank Adaptation Weights with Diffusion | Yujia Wu et.al. | 2408.06740 | null |
2024-08-13 | DiffSG: A Generative Solver for Network Optimization with Diffusion Model | Ruihuai Liang et.al. | 2408.06701 | link |
2024-08-13 | DC3DO: Diffusion Classifier for 3D Objects | Nursena Koprucu et.al. | 2408.06693 | link |
2024-08-13 | Leveraging Priors via Diffusion Bridge for Time Series Generation | Jinseong Park et.al. | 2408.06672 | null |
2024-08-13 | Hybrid SD: Edge-Cloud Collaborative Inference for Stable Diffusion Models | Chenqian Yan et.al. | 2408.06646 | null |
2024-08-13 | ViMo: Generating Motions from Casual Videos | Liangdong Qiu et.al. | 2408.06614 | null |
2024-08-12 | The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery | Chris Lu et.al. | 2408.06292 | link |
2024-08-12 | 3D Reconstruction of Protein Structures from Multi-view AFM Images using Neural Radiance Fields (NeRFs) | Jaydeep Rade et.al. | 2408.06244 | null |
2024-08-12 | Novel View Synthesis from a Single Image with Pretrained Diffusion Guidance | Taewon Kang et.al. | 2408.06157 | null |
2024-08-12 | Efficient and Scalable Point Cloud Generation with Sparse Point-Voxel Diffusion Models | Ioannis Romanelis et.al. | 2408.06145 | link |
2024-08-12 | CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer | Zhuoyi Yang et.al. | 2408.06072 | link |
2024-08-12 | ControlNeXt: Powerful and Efficient Control for Image and Video Generation | Bohao Peng et.al. | 2408.06070 | link |
2024-08-12 | BooW-VTON: Boosting In-the-Wild Virtual Try-On via Mask-Free Pseudo Data Training | Xuanpu Zhang et.al. | 2408.06047 | link |
2024-08-12 | Diffuse-UDA: Addressing Unsupervised Domain Adaptation in Medical Image Segmentation with Appearance and Structure Aligned Diffusion Models | Haifan Gong et.al. | 2408.05985 | null |
2024-08-12 | UniPortrait: A Unified Framework for Identity-Preserving Single- and Multi-Human Image Personalization | Junjie He et.al. | 2408.05939 | null |
2024-08-12 | Deep Geometric Moments Promote Shape Consistency in Text-to-3D Generation | Utkarsh Nath et.al. | 2408.05938 | null |
2024-08-09 | Multi-Garment Customized Model Generation | Yichen Liu et.al. | 2408.05206 | null |
2024-08-09 | DreamCouple: Exploring High Quality Text-to-3D Generation Via Rectified Flow | Hangyu Li et.al. | 2408.05008 | null |
2024-08-09 | TEAdapter: Supply abundant guidance for controllable text-to-music generation | Jialing Zou et.al. | 2408.04865 | link |
2024-08-09 | Adversarially Robust Industrial Anomaly Detection Through Diffusion Model | Yuanpu Cao et.al. | 2408.04839 | null |
2024-08-09 | Next-Generation Wi-Fi Networks with Generative AI: Design and Insights | Jingyu Wang et.al. | 2408.04835 | null |
2024-08-08 | BRAT: Bonus oRthogonAl Token for Architecture Agnostic Textual Inversion | James Baker et.al. | 2408.04785 | link |
2024-08-08 | Zero-Shot Uncertainty Quantification using Diffusion Probabilistic Models | Dule Shu et.al. | 2408.04718 | null |
2024-08-08 | Puppet-Master: Scaling Interactive Video Generation as a Motion Prior for Part-Level Dynamics | Ruining Li et.al. | 2408.04631 | null |
2024-08-08 | Sketch2Scene: Automatic Generation of Interactive 3D Game Scenes from User's Casual Sketches | Yongzhi Xu et.al. | 2408.04567 | null |
2024-08-08 | Deep Generative Models in Robotics: A Survey on Learning from Multimodal Demonstrations | Julen Urain et.al. | 2408.04380 | null |
2024-08-08 | InstantStyleGaussian: Efficient Art Style Transfer with 3D Gaussian Splatting | Xin-Yi Yu et.al. | 2408.04249 | null |
2024-08-08 | LLDif: Diffusion Models for Low-light Emotion Recognition | Zhifeng Wang et.al. | 2408.04235 | null |
2024-08-08 | Connective Viewpoints of Signal-to-Noise Diffusion Models | Khanh Doan et.al. | 2408.04221 | null |
2024-08-08 | Diffusion Guided Language Modeling | Justin Lovelace et.al. | 2408.04220 | link |
2024-08-07 | Data Generation Scheme for Thermal Modality with Edge-Guided Adversarial Conditional Diffusion Model | Guoqing Zhu et.al. | 2408.03748 | link |
2024-08-07 | Unsupervised Detection of Fetal Brain Anomalies using Denoising Diffusion Models | Markus Ditlev Sjøgren Olsen et.al. | 2408.03654 | null |
2024-08-07 | TALE: Training-free Cross-domain Image Composition via Adaptive Latent Manipulation and Energy-guided Optimization | Kien T. Pham et.al. | 2408.03637 | null |
2024-08-07 | Dirichlet forms of diffusion processes on Thoma simplex | Sergei Korotkikh et.al. | 2408.03553 | null |
2024-08-06 | Hybrid diffusion models: combining supervised and generative pretraining for label-efficient fine-tuning of segmentation models | Bruno Sauvalle et.al. | 2408.03433 | null |
2024-08-06 | Attacks and Defenses for Generative Diffusion Models: A Comprehensive Survey | Vu Tuan Truong et.al. | 2408.03400 | null |
2024-08-06 | Adversarial Domain Adaptation for Cross-user Activity Recognition Using Diffusion-based Noise-centred Learning | Xiaozhou Ye et.al. | 2408.03353 | link |
2024-08-06 | MDT-A2G: Exploring Masked Diffusion Transformers for Co-Speech Gesture Generation | Xiaofeng Mao et.al. | 2408.03312 | null |
2024-08-06 | IPAdapter-Instruct: Resolving Ambiguity in Image-based Conditioning using Instruct Prompts | Ciara Rowles et.al. | 2408.03209 | null |
2024-08-06 | Iterative CT Reconstruction via Latent Variable Optimization of Shallow Diffusion Models | Sho Ozaki et.al. | 2408.03156 | null |
2024-08-06 | Training-Free Condition Video Diffusion Models for single frame Spatial-Semantic Echocardiogram Synthesis | Van Phi Nguyen et.al. | 2408.03035 | link |
2024-08-06 | Diffusion Model Meets Non-Exemplar Class-Incremental Learning and Beyond | Jichuan Zhang et.al. | 2408.02983 | null |
2024-08-06 | Data-Driven Stochastic Closure Modeling via Conditional Diffusion Model and Neural Operator | Xinghao Dong et.al. | 2408.02965 | null |
2024-08-06 | Diverse Generation while Maintaining Semantic Coordination: A Diffusion-Based Data Augmentation Method for Object Detection | Sen Nie et.al. | 2408.02891 | null |
2024-08-05 | Back-Projection Diffusion: Solving the Wideband Inverse Scattering Problem with Diffusion Models | Borong Zhang et.al. | 2408.02866 | null |
2024-08-05 | Text Conditioned Symbolic Drumbeat Generation using Latent Diffusion Models | Pushkar Jajoria et.al. | 2408.02711 | null |
2024-08-05 | LaMamba-Diff: Linear-Time High-Fidelity Diffusion Models Based on Local Attention and Mamba | Yunxiang Fu et.al. | 2408.02615 | link |
2024-08-05 | Multi-weather Cross-view Geo-localization Using Denoising Diffusion Models | Tongtong Feng et.al. | 2408.02408 | null |
2024-08-05 | A Sharp Convergence Theory for The Probability Flow ODEs of Diffusion Models | Gen Li et.al. | 2408.02320 | null |
2024-08-05 | Curriculum learning based pre-training using Multi-Modal Contrastive Masked Autoencoders | Muhammad Abdullah Jamal et.al. | 2408.02245 | null |
2024-08-04 | LDFaceNet: Latent Diffusion-based Network for High-Fidelity Deepfake Generation | Dwij Mehta et.al. | 2408.02078 | null |
2024-08-04 | Step Saver: Predicting Minimum Denoising Steps for Diffusion Model Image Generation | Jean Yu et.al. | 2408.02054 | null |
2024-08-04 | Robustness of Watermarking on Text-to-Image Diffusion Models | Xiaodong Wu et.al. | 2408.02035 | null |
2024-08-04 | Faster Diffusion Action Segmentation | Shuaibing Wang et.al. | 2408.02024 | null |
2024-08-04 | AnomalySD: Few-Shot Multi-Class Anomaly Detection with Stable Diffusion Model | Zhenyu Yan et.al. | 2408.01960 | null |
2024-08-04 | Dataset Scale and Societal Consistency Mediate Facial Impression Bias in Vision-Language AI | Robert Wolfe et.al. | 2408.01959 | null |
2024-08-02 | Conditional LoRA Parameter Generation | Xiaolong Jin et.al. | 2408.01415 | null |
2024-08-02 | TexGen: Text-Guided 3D Texture Generation with Multi-view Sampling and Resampling | Dong Huo et.al. | 2408.01291 | null |
2024-08-02 | A General Framework to Boost 3D GS Initialization for Text-to-3D Generation by Lexical Richness | Lutao Jiang et.al. | 2408.01269 | null |
2024-08-02 | CLIP4Sketch: Enhancing Sketch to Mugshot Matching through Dataset Augmentation using Diffusion Models | Kushal Kumar Jain et.al. | 2408.01233 | null |
2024-08-02 | EIUP: A Training-Free Approach to Erase Non-Compliant Concepts Conditioned on Implicit Unsafe Prompts | Die Chen et.al. | 2408.01014 | null |
2024-08-02 | FBSDiff: Plug-and-Play Frequency Band Substitution of Diffusion Features for Highly Controllable Text-Driven Image Translation | Xiang Gao et.al. | 2408.00998 | link |
2024-08-05 | CIResDiff: A Clinically-Informed Residual Diffusion Model for Predicting Idiopathic Pulmonary Fibrosis Progression | Caiwen Jiang et.al. | 2408.00938 | null |
2024-08-01 | Optimizing Diffusion Models for Joint Trajectory Prediction and Controllable Generation | Yixiao Wang et.al. | 2408.00766 | null |
2024-08-01 | Smoothed Energy Guidance: Guiding Diffusion Models with Reduced Energy Curvature of Attention | Susung Hong et.al. | 2408.00760 | link |
2024-08-01 | TurboEdit: Text-Based Image Editing Using Few-Step Diffusion Models | Gilad Deutch et.al. | 2408.00735 | null |
2024-08-01 | MotionFix: Text-Driven 3D Human Motion Editing | Nikos Athanasiou et.al. | 2408.00712 | null |
2024-08-01 | Evaluation Metrics and Methods for Generative Models in the Wireless PHY Layer | Michael Baur et.al. | 2408.00634 | null |
2024-08-01 | Illustrating Classic Brazilian Books using a Text-To-Image Diffusion Model | Felipe Mahlow et.al. | 2408.00544 | null |
2024-08-01 | Towards Reliable Advertising Image Generation Using Human Feedback | Zhenbang Du et.al. | 2408.00418 | link |
2024-08-01 | Deepfake Media Forensics: State of the Art and Challenges Ahead | Irene Amerini et.al. | 2408.00388 | null |
2024-08-01 | On the Limitations and Prospects of Machine Unlearning for Generative AI | Shiji Zhou et.al. | 2408.00376 | null |
2024-08-01 | DiM-Gesture: Co-Speech Gesture Generation with Adaptive Layer Normalization Mamba-2 framework | Fan Zhang et.al. | 2408.00370 | null |
2024-07-31 | Detecting, Explaining, and Mitigating Memorization in Diffusion Models | Yuxin Wen et.al. | 2407.21720 | link |
2024-07-31 | Tora: Trajectory-oriented Diffusion Transformer for Video Generation | Zhenghao Zhang et.al. | 2407.21705 | link |
2024-07-31 | Generative Diffusion Model for Seismic Imaging Improvement of Sparsely Acquired Data and Uncertainty Quantification | Xingchen Shi et.al. | 2407.21683 | null |
2024-07-31 | Explainable and Controllable Motion Curve Guided Cardiac Ultrasound Video Generation | Junxuan Yu et.al. | 2407.21490 | null |
2024-07-31 | Fine-gained Zero-shot Video Sampling | Dengsheng Chen et.al. | 2407.21475 | null |
2024-07-31 | Deformable 3D Shape Diffusion Model | Dengsheng Chen et.al. | 2407.21428 | null |
2024-07-31 | Diff-Cleanse: Identifying and Mitigating Backdoor Attacks in Diffusion Models | Jiang Hao et.al. | 2407.21316 | link |
2024-07-31 | State-observation augmented diffusion model for nonlinear assimilation | Zhuoyuan Li et.al. | 2407.21314 | link |
2024-07-31 | DEF-oriCORN: efficient 3D scene understanding for robust language-directed manipulation without demonstrations | Dongwon Son et.al. | 2407.21267 | null |
2024-07-30 | Informed Correctors for Discrete Diffusion Models | Yixiu Zhao et.al. | 2407.21243 | null |
2024-07-30 | Matting by Generation | Zhixiang Wang et.al. | 2407.21017 | null |
2024-07-30 | Add-SD: Rational Generation without Manual Reference | Lingfeng Yang et.al. | 2407.21016 | link |
2024-07-30 | Vulnerabilities in AI-generated Image Detection: The Challenge of Adversarial Attacks | Yunfeng Diao et.al. | 2407.20836 | null |
2024-07-30 | Diffusion Augmented Agents: A Framework for Efficient Exploration and Transfer Learning | Norman Di Palo et.al. | 2407.20798 | null |
2024-07-30 | SynthVLM: High-Efficiency and High-Quality Synthetic Data for Vision Language Models | Zheng Liu et.al. | 2407.20756 | link |
2024-07-30 | EgoSonics: Generating Synchronized Audio for Silent Egocentric Videos | Aashish Rai et.al. | 2407.20592 | null |
2024-07-30 | DiffusionCounterfactuals: Inferring High-dimensional Counterfactuals with Guidance of Causal Representations | Jiageng Zhu et.al. | 2407.20553 | null |
2024-07-29 | Learning Feature-Preserving Portrait Editing from Generated Pairs | Bowei Chen et.al. | 2407.20455 | null |
2024-07-29 | Contrasting Deepfakes Diffusion via Contrastive Learning and Global-Local Similarities | Lorenzo Baraldi et.al. | 2407.20337 | link |
2024-07-29 | Sun Off, Lights On: Photorealistic Monocular Nighttime Simulation for Robust Semantic Perception | Konstantinos Tzevelekakis et.al. | 2407.20336 | null |
2024-07-29 | Specify and Edit: Overcoming Ambiguity in Text-Based Image Editing | Ekaterina Iakovleva et.al. | 2407.20232 | null |
2024-07-29 | LatentArtiFusion: An Effective and Efficient Histological Artifacts Restoration Framework | Zhenqi He et.al. | 2407.20172 | link |
2024-07-29 | Diffusion Feedback Helps CLIP See Better | Wenxuan Wang et.al. | 2407.20171 | link |
2024-07-29 | DDAP: Dual-Domain Anti-Personalization against Text-to-Image Diffusion Models | Jing Yang et.al. | 2407.20141 | null |
2024-07-29 | Diffusion-DICE: In-Sample Diffusion Guidance for Offline Reinforcement Learning | Liyuan Mao et.al. | 2407.20109 | null |
2024-07-29 | Generative Diffusion Model Bootstraps Zero-shot Classification of Fetal Ultrasound Images In Underrepresented African Populations | Fangyijie Wang et.al. | 2407.20072 | link |
2024-07-29 | ImagiNet: A Multi-Content Dataset for Generalizable Synthetic Image Detection via Contrastive Learning | Delyan Boychev et.al. | 2407.20020 | link |
2024-07-29 | MambaGesture: Enhancing Co-Speech Gesture Generation with Mamba and Disentangled Multi-Modality Fusion | Chencan Fu et.al. | 2407.19976 | null |
2024-07-29 | FedDEO: Description-Enhanced One-Shot Federated Learning with Diffusion Models | Mingzhao Yang et.al. | 2407.19953 | null |
2024-07-29 | FreeLong: Training-Free Long Video Generation with SpectralBlend Temporal Attention | Yu Lu et.al. | 2407.19918 | null |
2024-07-26 | Unifying Visual and Semantic Feature Spaces with Diffusion Models for Enhanced Cross-Modal Alignment | Yuze Zheng et.al. | 2407.18854 | null |
2024-07-26 | Revision of calcium and scandium abundances in Am stars based on NLTE calculations and comparison with diffusion stellar evolution models | L. I. Mashonkina et.al. | 2407.18736 | null |
2024-07-26 | Adversarial Robustification via Text-to-Image Diffusion Models | Daewon Choi et.al. | 2407.18658 | link |
2024-07-26 | How To Segment in 3D Using 2D Models: Automated 3D Segmentation of Prostate Cancer Metastatic Lesions on PET Volumes Using Multi-Angle Maximum Intensity Projections and Diffusion Models | Amirhosein Toosi et.al. | 2407.18555 | link |
2024-07-26 | Answerability Fields: Answerable Location Estimation via Diffusion Models | Daichi Azuma et.al. | 2407.18497 | null |
2024-07-26 | Diffusion-Driven Semantic Communication for Generative Models with Bandwidth Constraints | Lei Guo et.al. | 2407.18468 | null |
2024-07-26 | Lensless fiber endomicroscopic phase imaging with speckle-conditioned diffusion model | Zhaoqing Chen et.al. | 2407.18456 | null |
2024-07-25 | Diffusion-based subsurface multiphysics monitoring and forecasting | Xinquan Huang et.al. | 2407.18426 | null |
2024-07-25 | RegionDrag: Fast Region-Based Image Editing with Diffusion Models | Jingyi Lu et.al. | 2407.18247 | null |
2024-07-25 | VGGHeads: A Large-Scale Synthetic Dataset for 3D Human Heads | Orest Kupyn et.al. | 2407.18245 | link |
2024-07-25 | Self-supervised pre-training with diffusion model for few-shot landmark detection in x-ray images | Roberto Di Via et.al. | 2407.18125 | null |
2024-07-25 | Segmentation-guided MRI reconstruction for meaningfully diverse reconstructions | Jan Nikolas Morshuis et.al. | 2407.18026 | link |
2024-07-25 | Self-Supervision Improves Diffusion Models for Tabular Data Imputation | Yixin Liu et.al. | 2407.18013 | link |
2024-07-25 | Lightweight Language-driven Grasp Detection using Conditional Consistency Model | Nghia Nguyen et.al. | 2407.17967 | null |
2024-07-25 | ReCorD: Reasoning and Correcting Diffusion for HOI Generation | Jian-Yu Jiang-Lin et.al. | 2407.17911 | link |
2024-07-25 | Amortized Posterior Sampling with Diffusion Prior Distillation | Abbas Mammadov et.al. | 2407.17907 | null |
2024-07-25 | Artificial Immunofluorescence in a Flash: Rapid Synthetic Imaging from Brightfield Through Residual Diffusion | Xiaodan Xing et.al. | 2407.17882 | null |
2024-07-25 | DragText: Rethinking Text Embedding in Point-based Image Editing | Gayoon Choi et.al. | 2407.17843 | link |
2024-07-24 | SV4D: Dynamic 3D Content Generation with Multi-Frame and Multi-View Consistency | Yiming Xie et.al. | 2407.17470 | null |
2024-07-24 | CDDIP: Constrained Diffusion-Driven Deep Image Prior for Seismic Image Reconstruction | Paul Goyes-Peñafiel et.al. | 2407.17402 | link |
2024-07-25 | LPGen: Enhancing High-Fidelity Landscape Painting Generation through Diffusion Model | Wanggong Yang et.al. | 2407.17229 | null |
2024-07-24 | Unpaired Photo-realistic Image Deraining with Energy-informed Diffusion Model | Yuanbo Wen et.al. | 2407.17193 | null |
2024-07-24 | MemBench: Memorized Image Trigger Prompt Dataset for Diffusion Models | Chunsan Hong et.al. | 2407.17095 | link |
2024-07-24 | Sparse Inducing Points in Deep Gaussian Processes: Enhancing Modeling with Denoising Diffusion Variational Inference | Jian Xu et.al. | 2407.17033 | null |
2024-07-24 | Diffree: Text-Guided Shape Free Object Inpainting with Diffusion Model | Lirui Zhao et.al. | 2407.16982 | link |
2024-07-24 | SAR to Optical Image Translation with Color Supervised Diffusion Model | Xinyu Bai et.al. | 2407.16921 | null |
2024-07-23 | VisMin: Visual Minimal-Change Understanding | Rabiul Awal et.al. | 2407.16772 | null |
2024-07-23 | Diffusion Models for Monocular Depth Estimation: Overcoming Challenging Conditions | Fabio Tosi et.al. | 2407.16698 | link |
2024-07-23 | From Imitation to Refinement -- Residual RL for Precise Visual Assembly | Lars Ankile et.al. | 2407.16677 | null |
2024-07-23 | MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequence | Canyu Zhao et.al. | 2407.16655 | null |
2024-07-23 | DreamVTON: Customizing 3D Virtual Try-on with Personalized Diffusion Models | Zhenyu Xie et.al. | 2407.16511 | null |
2024-07-23 | MonoWAD: Weather-Adaptive Diffusion Model for Robust Monocular 3D Object Detection | Youngmin Oh et.al. | 2407.16448 | link |
2024-07-23 | On Differentially Private 3D Medical Image Synthesis with Controllable Latent Diffusion Models | Deniz Daum et.al. | 2407.16405 | link |
2024-07-23 | DreamDissector: Learning Disentangled Text-to-3D Generation from 2D Diffusion Priors | Zizheng Yan et.al. | 2407.16260 | null |
2024-07-23 | OutfitAnyone: Ultra-high Quality Virtual Try-On for Any Clothing and Any Person | Ke Sun et.al. | 2407.16224 | null |
2024-07-23 | Diff-Shadow: Global-guided Diffusion Model for Shadow Removal | Jinting Luo et.al. | 2407.16214 | link |
2024-07-23 | CloudFixer: Test-Time Adaptation for 3D Point Clouds via Diffusion-Guided Geometric Transformation | Hajin Shim et.al. | 2407.16193 | null |
2024-07-22 | Artist: Aesthetically Controllable Text-Driven Stylization without Training | Ruixiang Jiang et.al. | 2407.15842 | link |
2024-07-22 | Stretching Each Dollar: Diffusion Training from Scratch on a Micro-Budget | Vikash Sehwag et.al. | 2407.15811 | link |
2024-07-22 | Diffusion Model Based Resource Allocation Strategy in Ultra-Reliable Wireless Networked Control Systems | Amirhassan Babazadeh Darabi et.al. | 2407.15784 | null |
2024-07-22 | A Hamilton-Jacobi approach to road-field reaction-diffusion models | Christopher Henderson et.al. | 2407.15760 | null |
2024-07-22 | Diffusion for Out-of-Distribution Detection on Road Scenes and Beyond | Silvio Galesso et.al. | 2407.15739 | link |
2024-07-22 | Estimating Probability Densities with Transformer and Denoising Diffusion | Henry W. Leung et.al. | 2407.15703 | link |
2024-07-22 | Voltage mapping in subcellular nanodomains using electro-diffusion modeling | Frédéric Paquin-Lefebvre et.al. | 2407.15697 | null |
2024-07-23 | Cinemo: Consistent and Controllable Image Animation with Motion Diffusion Models | Xin Ma et.al. | 2407.15642 | link |
2024-07-23 | A Diffusion Model for Simulation Ready Coronary Anatomy with Morpho-skeletal Control | Karim Kadry et.al. | 2407.15631 | null |
2024-07-22 | StylusAI: Stylistic Adaptation for Robust German Handwritten Text Generation | Nauman Riaz et.al. | 2407.15608 | null |
2024-07-19 | DEPICT: Diffusion-Enabled Permutation Importance for Image Classification Tasks | Sarah Jabbour et.al. | 2407.14509 | null |
2024-07-19 | M2D2M: Multi-Motion Generation from Text with Discrete Diffusion Models | Seunggeun Chi et.al. | 2407.14502 | null |
2024-07-19 | Co-synthesis of Histopathology Nuclei Image-Label Pairs using a Context-Conditioned Joint Diffusion Model | Seonghui Min et.al. | 2407.14434 | null |
2024-07-19 | Controllable and Efficient Multi-Class Pathology Nuclei Data Augmentation using Text-Conditioned Diffusion Models | Hyun-Jic Oh et.al. | 2407.14426 | null |
2024-07-19 | As Generative Models Improve, People Adapt Their Prompts | Eaman Jahani et.al. | 2407.14333 | null |
2024-07-19 | Panoptic Segmentation of Mammograms with Text-To-Image Diffusion Model | Kun Zhao et.al. | 2407.14326 | null |
2024-07-19 | Time-dependent condensate formation in ultracold atoms with energy-dependent transport coefficients | M. Larsson et.al. | 2407.14307 | null |
2024-07-19 | How to Blend Concepts in Diffusion Models | Giorgio Longari et.al. | 2407.14280 | link |
2024-07-19 | The time-space evolution of economic activities: theory and estimation | Davide Fiaschi et.al. | 2407.14267 | null |
2024-07-19 | Unlearning Concepts from Text-to-Video Diffusion Models | Shiqi Liu et.al. | 2407.14209 | null |
2024-07-18 | LogoSticker: Inserting Logos into Diffusion Models for Customized Generation | Mingkang Zhu et.al. | 2407.13752 | null |
2024-07-18 | Understanding Reinforcement Learning-Based Fine-Tuning of Diffusion Models: A Tutorial and Review | Masatoshi Uehara et.al. | 2407.13734 | link |
2024-07-18 | MeshSegmenter: Zero-Shot Mesh Semantic Segmentation via Texture Synthesis | Ziming Zhong et.al. | 2407.13675 | link |
2024-07-18 | Open-Vocabulary 3D Semantic Segmentation with Text-to-Image Diffusion Models | Xiaoyu Zhu et.al. | 2407.13642 | null |
2024-07-18 | Training-free Composite Scene Generation for Layout-to-Image Synthesis | Jiaqi Liu et.al. | 2407.13609 | link |
2024-07-18 | EnergyDiff: Universal Time-Series Energy Data Generation using Diffusion Models | Nan Lin et.al. | 2407.13538 | null |
2024-07-18 | All Roads Lead to Rome? Exploring Representational Similarities Between Latent Spaces of Generative Image Models | Charumathi Badrinath et.al. | 2407.13449 | link |
2024-07-18 | Movement-based models for abundance data | Ricardo Carrizo Vergara et.al. | 2407.13384 | null |
2024-07-18 | URCDM: Ultra-Resolution Image Synthesis in Histopathology | Sarah Cechnicka et.al. | 2407.13277 | link |
2024-07-18 | Unveiling Structural Memorization: Structural Membership Inference Attack for Text-to-Image Diffusion Models | Qiao Li et.al. | 2407.13252 | null |
2024-07-17 | SMooDi: Stylized Motion Diffusion Model | Lei Zhong et.al. | 2407.12783 | null |
2024-07-17 | VD3D: Taming Large Video Diffusion Transformers for 3D Camera Control | Sherwin Bahmani et.al. | 2407.12781 | null |
2024-07-17 | Hallucination Index: An Image Quality Metric for Generative Reconstruction Models | Matthew Tivnan et.al. | 2407.12780 | null |
2024-07-17 | GroundUp: Rapid Sketch-Based 3D City Massing | Gizem Esra Unlu et.al. | 2407.12739 | null |
2024-07-17 | NL2Contact: Natural Language Guided 3D Hand-Object Contact Modeling with Diffusion Model | Zhongqun Zhang et.al. | 2407.12727 | null |
2024-07-18 | SlimFlow: Training Smaller One-Step Diffusion Models with Rectified Flow | Yuanzhi Zhu et.al. | 2407.12718 | link |
2024-07-17 | IMAGDressing-v1: Customizable Virtual Dressing | Fei Shen et.al. | 2407.12705 | link |
2024-07-17 | 4Dynamic: Text-to-4D Generation with Hybrid Priors | Yu-Jie Yuan et.al. | 2407.12684 | null |
2024-07-17 | Promptable Counterfactual Diffusion Model for Unified Brain Tumor Segmentation and Generation with MRIs | Yiqing Shen et.al. | 2407.12678 | link |
2024-07-17 | CoSIGN: Few-Step Guidance of ConSIstency Model to Solve General INverse Problems | Jiankun Zhao et.al. | 2407.12676 | link |
2024-07-16 | Efficient Training with Denoised Neural Weights | Yifan Gong et.al. | 2407.11966 | null |
2024-07-16 | Context-Guided Diffusion for Out-of-Distribution Molecular and Protein Design | Leo Klarner et.al. | 2407.11942 | link |
2024-07-16 | Diffusion-driven self-assembly of emerin nanodomains at the nuclear envelope | Carlos D. Alas et.al. | 2407.11758 | null |
2024-07-16 | Mask-guided cross-image attention for zero-shot in-silico histopathologic image generation with a diffusion model | Dominik Winter et.al. | 2407.11664 | null |
2024-07-16 | CCVA-FL: Cross-Client Variations Adaptive Federated Learning for Medical Imaging | Sunny Gupta et.al. | 2407.11652 | null |
2024-07-16 | Scaling Diffusion Transformers to 16 Billion Parameters | Zhengcong Fei et.al. | 2407.11633 | link |
2024-07-16 | DiNO-Diffusion. Scaling Medical Diffusion via Self-Supervised Pre-Training | Guillermo Jimenez-Perez et.al. | 2407.11594 | null |
2024-07-17 | QVD: Post-training Quantization for Video Diffusion Models | Shilong Tian et.al. | 2407.11585 | null |
2024-07-17 | UP-Diff: Latent Diffusion Model for Remote Sensing Urban Prediction | Zeyu Wang et.al. | 2407.11578 | link |
2024-07-16 | TGIF: Text-Guided Inpainting Forgery Dataset | Hannes Mareen et.al. | 2407.11566 | link |
2024-07-15 | Make-An-Agent: A Generalizable Policy Network Generator with Behavior-Prompted Diffusion | Yongyuan Liang et.al. | 2407.10973 | null |
2024-07-15 | InVi: Object Insertion In Videos Using Off-the-Shelf Diffusion Models | Nirat Saini et.al. | 2407.10958 | null |
2024-07-16 | DataDream: Few-shot Guided Dataset Generation | Jae Myung Kim et.al. | 2407.10910 | link |
2024-07-15 | Optical Diffusion Models for Image Generation | Ilker Oguz et.al. | 2407.10897 | null |
2024-07-15 | R3D-AD: Reconstruction via Diffusion for 3D Anomaly Detection | Zheyuan Zhou et.al. | 2407.10862 | null |
2024-07-15 | Physics-Inspired Generative Models in Medical Imaging: A Review | Dennis Hein et.al. | 2407.10856 | null |
2024-07-15 | Conditional Guided Generative Diffusion for Particle Accelerator Beam Diagnostics | Alexander Scheinker et.al. | 2407.10693 | null |
2024-07-15 | Addressing Image Hallucination in Text-to-Image Generation through Factual Image Retrieval | Youngsun Lim et.al. | 2407.10683 | null |
2024-07-15 | Temporal Residual Guided Diffusion Framework for Event-Driven Video Reconstruction | Lin Zhu et.al. | 2407.10636 | null |
2024-07-15 | WildVidFit: Video Virtual Try-On in the Wild via Image-Based Controlled Diffusion Models | Zijian He et.al. | 2407.10625 | null |
2024-07-12 | Any-Property-Conditional Molecule Generation with Self-Criticism using Spanning Trees | Alexia Jolicoeur-Martineau et.al. | 2407.09357 | link |
2024-07-12 | PID: Physics-Informed Diffusion Model for Infrared Image Generation | Fangyuan Mao et.al. | 2407.09299 | link |
2024-07-12 | Salt & Pepper Heatmaps: Diffusion-informed Landmark Detection Strategy | Julian Wyatt et.al. | 2407.09192 | null |
2024-07-12 | Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control | Huayu Chen et.al. | 2407.09024 | link |
2024-07-12 | TCAN: Animating Human Images with Temporally Consistent Pose Guidance using Diffusion Models | Jeongho Kim et.al. | 2407.09012 | null |
2024-07-12 | Your Diffusion Model is Secretly a Noise Classifier and Benefits from Contrastive Training | Yunshu Wu et.al. | 2407.08946 | link |
2024-07-12 | Bora: Biomedical Generalist Video Generation Model | Weixiang Sun et.al. | 2407.08944 | null |
2024-07-12 | LightenDiffusion: Unsupervised Low-Light Image Enhancement with Latent-Retinex Diffusion Models | Hai Jiang et.al. | 2407.08939 | link |
2024-07-12 | Multi-objective Aerial Collaborative Secure Communication Optimization via Generative Diffusion Model-enabled Deep Reinforcement Learning | Chuang Zhang et.al. | 2407.08914 | null |
2024-07-12 | AirSketch: Generative Motion to Sketch | Hui Xian Grace Lim et.al. | 2407.08906 | null |
2024-07-11 | Video Diffusion Alignment via Reward Gradients | Mihir Prabhudesai et.al. | 2407.08737 | link |
2024-07-11 | Live2Diff: Live Stream Translation via Uni-directional Attention in Video Diffusion Models | Zhening Xing et.al. | 2407.08701 | null |
2024-07-11 | Controlling the Fidelity and Diversity of Deep Generative Models via Pseudo Density | Shuangqi Li et.al. | 2407.08659 | null |
2024-07-11 | Latent Conditional Diffusion-based Data Augmentation for Continuous-Time Dynamic Graph Mode | Yuxing Tian et.al. | 2407.08500 | null |
2024-07-11 | Diff-Tracker: Text-to-Image Diffusion Models are Unsupervised Trackers | Zhengbo Zhang et.al. | 2407.08394 | null |
2024-07-11 | Wind Power Assessment based on Super-Resolution and Downscaling -- A Comparison of Deep Learning Methods | Luca Schmidt et.al. | 2407.08259 | null |
2024-07-11 | Adaptive Compressed Sensing with Diffusion-Based Posterior Sampling | Noam Elata et.al. | 2407.08256 | null |
2024-07-11 | E2VIDiff: Perceptual Events-to-Video Reconstruction using Diffusion Priors | Jinxiu Liang et.al. | 2407.08231 | null |
2024-07-11 | Survey on Fundamental Deep Learning 3D Reconstruction Techniques | Yonge Bai et.al. | 2407.08137 | null |
2024-07-10 | Geospecific View Generation -- Geometry-Context Aware High-resolution Ground View Inference from Satellite Views | Ningli Xu et.al. | 2407.08061 | null |
2024-07-10 | Generative Image as Action Models | Mohit Shridhar et.al. | 2407.07875 | link |
2024-07-10 | Dynamical Measure Transport and Neural PDE Solvers for Sampling | Jingtong Sun et.al. | 2407.07873 | null |
2024-07-10 | Controlling Space and Time with Diffusion Models | Daniel Watson et.al. | 2407.07860 | null |
2024-07-10 | Generic Numerical Analysis of Stochastic Reaction Diffusion Model with applications in excitable media | Yahya Alnashri et.al. | 2407.07834 | null |
2024-07-10 | Universal and non-universal signatures in the scaling functions of critical variables | Gianluca Teza et.al. | 2407.07782 | null |
2024-07-10 | VEnhancer: Generative Space-Time Enhancement for Video Generation | Jingwen He et.al. | 2407.07667 | null |
2024-07-11 | MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis | Wanggui He et.al. | 2407.07614 | link |
2024-07-10 | Drantal-NeRF: Diffusion-Based Restoration for Anti-aliasing Neural Radiance Field | Ganlin Yang et.al. | 2407.07461 | null |
2024-07-10 | Secondary Structure-Guided Novel Protein Sequence Generation with Latent Graph Diffusion | Yutong Hu et.al. | 2407.07443 | link |
2024-07-10 | Deformation-Recovery Diffusion Model (DRDM): Instance Deformation for Image Manipulation and Synthesis | Jian-Qing Zheng et.al. | 2407.07295 | link |
2024-07-09 | ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction | Shaozhe Hao et.al. | 2407.07077 | link |
2024-07-09 | RodinHD: High-Fidelity 3D Avatar Generation with Diffusion Models | Bowen Zhang et.al. | 2407.06938 | null |
2024-07-09 | HumanRefiner: Benchmarking Abnormal Human Generation and Refining with Coarse-to-fine Pose-Reversible Guidance | Guian Fang et.al. | 2407.06937 | link |
2024-07-09 | A reaction-diffusion model for relapsing-remitting multiple sclerosis with a treatment term | Romina Travaglini et.al. | 2407.06802 | null |
2024-07-09 | Powerful and Flexible: Personalized Text-to-Image Generation via Reinforcement Learning | Fanyue Wei et.al. | 2407.06642 | link |
2024-07-09 | Mobius: An High Efficient Spatial-Temporal Parallel Training Paradigm for Text-to-Video Generation Task | Yiran Yang et.al. | 2407.06617 | link |
2024-07-09 | VQA-Diff: Exploiting VQA and Diffusion for Zero-Shot Image-to-3D Vehicle Asset Generation in Autonomous Driving | Yibo Liu et.al. | 2407.06516 | null |
2024-07-09 | Sketch-Guided Scene Image Generation | Tianyu Zhang et.al. | 2407.06469 | null |
2024-07-10 | Enhanced Safety in Autonomous Driving: Integrating Latent State Diffusion Model for End-to-End Navigation | Jianuo Huang et.al. | 2407.06317 | null |
2024-07-08 | VIMI: Grounding Video Generation through Multi-modal Instruction | Yuwei Fang et.al. | 2407.06304 | null |
2024-07-08 | JeDi: Joint-Image Diffusion Models for Finetuning-Free Personalized Text-to-Image Generation | Yu Zeng et.al. | 2407.06187 | null |
2024-07-08 | The Tug-of-War Between Deepfake Generation and Detection | Hannah Lee et.al. | 2407.06174 | null |
2024-07-08 | ANOLE: An Open, Autoregressive, Native Large Multimodal Models for Interleaved Image-Text Generation | Ethan Chern et.al. | 2407.06135 | link |
2024-07-08 | Structured Generations: Using Hierarchical Clusters to guide Diffusion Models | Jorge da Silva Goncalves et.al. | 2407.06124 | link |
2024-07-08 | PerlDiff: Controllable Street View Synthesis Using Perspective-Layout Diffusion Models | Jinhua Zhang et.al. | 2407.06109 | link |
2024-07-08 | Accelerating Diffusion for SAR-to-Optical Image Translation via Adversarial Consistency Distillation | Xinyu Bai et.al. | 2407.06095 | null |
2024-07-08 | Layered Diffusion Model for One-Shot High Resolution Text-to-Image Synthesis | Emaad Khwaja et.al. | 2407.06079 | null |
2024-07-08 | Analysis and finite element approximation of a diffuse interface approach to the Stokes--Biot coupling | Francis R. A. Aznaran et.al. | 2407.05949 | null |
2024-07-08 | Minutes to Seconds: Speeded-up DDPM-based Image Inpainting with Coarse-to-Fine Sampling | Lintao Zhang et.al. | 2407.05875 | link |
2024-07-08 | RadiomicsFill-Mammo: Synthetic Mammogram Mass Manipulation with Radiomics Features | Inye Na et.al. | 2407.05683 | link |
2024-07-05 | Structural Constraint Integration in Generative Model for Discovery of Quantum Material Candidates | Ryotaro Okabe et.al. | 2407.04557 | null |
2024-07-05 | Unified continuous-time q-learning for mean-field game and mean-field control problems | Xiaoli Wei et.al. | 2407.04521 | null |
2024-07-08 | Speed-accuracy trade-off for the diffusion models: Wisdom from nonequilibrium thermodynamics and optimal transport | Kotaro Ikeda et.al. | 2407.04495 | null |
2024-07-05 | PROUD: PaRetO-gUided Diffusion Model for Multi-objective Generation | Yinghua Yao et.al. | 2407.04493 | link |
2024-07-05 | VCD-Texture: Variance Alignment based 3D-2D Co-Denoising for Text-Guided Texturing | Shang Liu et.al. | 2407.04461 | null |
2024-07-05 | Comparing metallicity correlations in nearby non-AGN and AGN-host galaxies | Song-lin Li et.al. | 2407.04252 | null |
2024-07-05 | GSD: View-Guided Gaussian Splatting Diffusion for 3D Reconstruction | Yuxuan Mu et.al. | 2407.04237 | null |
2024-07-05 | T2IShield: Defending Against Backdoors on Text-to-Image Diffusion Models | Zhongqi Wang et.al. | 2407.04215 | link |
2024-07-05 | TimeLDM: Latent Diffusion Model for Unconditional Time Series Generation | Jian Qian et.al. | 2407.04211 | null |
2024-07-04 | Advances in Diffusion Models for Image Data Augmentation: A Review of Methods, Models, Evaluation Metrics and Future Research Directions | Panagiotis Alimisis et.al. | 2407.04103 | null |
2024-07-03 | DisCo-Diff: Enhancing Continuous Diffusion Models with Discrete Latents | Yilun Xu et.al. | 2407.03300 | link |
2024-07-03 | Improved Noise Schedule for Diffusion Training | Tiankai Hang et.al. | 2407.03297 | null |
2024-07-04 | Spatio-Temporal Adaptive Diffusion Models for EEG Super-Resolution in Epilepsy Diagnosis | Tong Zhou et.al. | 2407.03089 | null |
2024-07-03 | Electromagnetic Property Sensing Based on Diffusion Model in ISAC System | Yuhua Jiang et.al. | 2407.03075 | null |
2024-07-03 | Semantic-Aware Power Allocation for Generative Semantic Communications with Foundation Models | Chunmei Xu et.al. | 2407.03050 | null |
2024-07-03 | SlerpFace: Face Template Protection via Spherical Linear Interpolation | Zhizhou Zhong et.al. | 2407.03043 | null |
2024-07-03 | Frequency-Controlled Diffusion Model for Versatile Text-Guided Image-to-Image Translation | Xiang Gao et.al. | 2407.03006 | link |
2024-07-04 | VEGS: View Extrapolation of Urban Scenes in 3D Gaussian Splatting using Learned Priors | Sungwon Hwang et.al. | 2407.02945 | link |
2024-07-03 | Single Image Rolling Shutter Removal with Diffusion Models | Zhanglei Yang et.al. | 2407.02906 | null |
2024-07-03 | Robot Shape and Location Retention in Video Generation Using Diffusion Models | Peng Wang et.al. | 2407.02873 | link |
2024-07-02 | Magic Insert: Style-Aware Drag-and-Drop | Nataniel Ruiz et.al. | 2407.02489 | null |
2024-07-02 | Boosting Consistency in Story Visualization with Rich-Contextual Conditional Diffusion Models | Fei Shen et.al. | 2407.02482 | link |
2024-07-02 | GlyphDraw2: Automatic Generation of Complex Glyph Posters with Diffusion Models and Large Language Models | Jian Ma et.al. | 2407.02252 | link |
2024-07-02 | LaMoD: Latent Motion Diffusion Model For Myocardial Strain Generation | Jiarui Xing et.al. | 2407.02229 | link |
2024-07-02 | UltraPixel: Advancing Ultra-High-Resolution Image Synthesis to New Peaks | Jingjing Ren et.al. | 2407.02158 | null |
2024-07-02 | Counterfactual Data Augmentation with Denoising Diffusion for Graph Anomaly Detection | Chunjing Xiao et.al. | 2407.02143 | link |
2024-07-02 | Latent Diffusion Model for Generating Ensembles of Climate Simulations | Johannes Meuer et.al. | 2407.02070 | null |
2024-07-02 | Accompanied Singing Voice Synthesis with Fully Text-controlled Melody | Ruiqi Li et.al. | 2407.02049 | null |
2024-07-02 | ScaleDreamer: Scalable Text-to-3D Synthesis with Asynchronous Score Distillation | Zhiyuan Ma et.al. | 2407.02040 | link |
2024-07-02 | SwiftDiffusion: Efficient Diffusion Model Serving with Add-on Modules | Suyi Li et.al. | 2407.02031 | null |
2024-06-28 | HouseCrafter: Lifting Floorplans to 3D Scenes with 2D Diffusion Model | Hieu T. Nguyen et.al. | 2406.20077 | null |
2024-06-28 | Neural Differentiable Modeling with Diffusion-Based Super-resolution for Two-Dimensional Spatiotemporal Turbulence | Xiantao Fan et.al. | 2406.20047 | null |
2024-06-28 | HAITCH: A Framework for Distortion and Motion Correction in Fetal Multi-Shell Diffusion-Weighted MRI | Haykel Snoussi et.al. | 2406.20042 | null |
2024-06-28 | Deceptive Diffusion: Generating Synthetic Adversarial Examples | Lucas Beerens et.al. | 2406.19807 | null |
2024-06-28 | Comprehensive Generative Replay for Task-Incremental Segmentation with Concurrent Appearance and Semantic Forgetting | Wei Li et.al. | 2406.19796 | link |
2024-06-28 | Decision Transformer for IRS-Assisted Systems with Diffusion-Driven Generative Channels | Jie Zhang et.al. | 2406.19769 | null |
2024-06-28 | DISCO: Efficient Diffusion Solver for Large-Scale Combinatorial Optimization Problems | Kexiong Yu et.al. | 2406.19705 | null |
2024-06-28 | Network Bending of Diffusion Models for Audio-Visual Generation | Luke Dzwonczyk et.al. | 2406.19589 | link |
2024-06-27 | A Thermal Study of Terahertz Induced Protein Interactions | Hadeel Elayan et.al. | 2406.19521 | null |
2024-06-27 | pop-cosmos: Scaleable inference of galaxy properties and redshifts with a data-driven population model | Stephen Thorp et.al. | 2406.19437 | null |
2024-06-27 | Accelerating Multiphase Flow Simulations with Denoising Diffusion Model Driven Initializations | Jaehong Chung et.al. | 2406.19333 | null |
2024-06-27 | Subtractive Training for Music Stem Insertion using Latent Diffusion Models | Ivan Villa-Renteria et.al. | 2406.19328 | null |
2024-06-27 | Compositional Image Decomposition with Diffusion Models | Jocelin Su et.al. | 2406.19298 | null |
2024-06-27 | Using diffusion model as constraint: Empower Image Restoration Network Training with Diffusion Model | Jiangtong Tan et.al. | 2406.19030 | link |
2024-06-28 | AnyControl: Create Your Artwork with Versatile Control on Text-to-Image Generation | Yanan Sun et.al. | 2406.18958 | link |
2024-06-27 | Investigating and Defending Shortcut Learning in Personalized Diffusion Models | Yixin Liu et.al. | 2406.18944 | link |
2024-06-28 | AlignIT: Enhancing Prompt Alignment in Customization of Text-to-Image Models | Aishwarya Agarwal et.al. | 2406.18893 | null |
2024-06-27 | Chemical Continuous Time Random Walks under Anomalous Diffusion | Hong Zhang et.al. | 2406.18869 | null |
2024-06-26 | MultiDiff: Consistent Novel View Synthesis from a Single Image | Norman Müller et.al. | 2406.18524 | null |
2024-06-26 | Denoising as Adaptation: Noise-Space Domain Adaptation for Image Restoration | Kang Liao et.al. | 2406.18516 | link |
2024-06-26 | DiffuseHigh: Training-free Progressive High-Resolution Image Synthesis through Structure Guidance | Younghyun Kim et.al. | 2406.18459 | link |
2024-06-26 | Towards diffusion models for large-scale sea-ice modelling | Tobias Sebastian Finn et.al. | 2406.18417 | null |
2024-06-27 | Stable Diffusion Segmentation for Biomedical Images with Single-step Reverse Process | Tianyu Lin et.al. | 2406.18361 | link |
2024-06-26 | Molecular Diffusion Models with Virtual Receptors | Matan Halfon et.al. | 2406.18330 | null |
2024-06-26 | Galaxy spectroscopy without spectra: Galaxy properties from photometric images with conditional diffusion models | Lars Doorenbos et.al. | 2406.18175 | link |
2024-06-26 | Human-Aware 3D Scene Generation with Spatially-constrained Diffusion Models | Xiaolin Hong et.al. | 2406.18159 | null |
2024-06-26 | Leveraging Pre-trained Models for FF-to-FFPE Histopathological Image Translation | Qilai Zhang et.al. | 2406.18054 | link |
2024-06-25 | DiffusionPDE: Generative PDE-Solving Under Partial Observation | Jiahe Huang et.al. | 2406.17763 | link |
2024-06-25 | Unified Auto-Encoding with Masked Diffusion | Philippe Hansen-Estruch et.al. | 2406.17688 | link |
2024-06-25 | LaTable: Towards Large Tabular Models | Boris van Breugel et.al. | 2406.17673 | null |
2024-06-25 | Aligning Diffusion Models with Noise-Conditioned Perception | Alexander Gambashidze et.al. | 2406.17636 | null |
2024-06-25 | Diffusion-based Adversarial Purification for Intrusion Detection | Mohamed Amine Merzouk et.al. | 2406.17606 | null |
2024-06-25 | Director3D: Real-world Camera Trajectory and 3D Scene Generation from Text | Xinyang Li et.al. | 2406.17601 | link |
2024-06-25 | Detection of Synthetic Face Images: Accuracy, Robustness, Generalization | Nela Petrzelkova et.al. | 2406.17547 | null |
2024-06-25 | Principal Component Clustering for Semantic Segmentation in Synthetic Data Generation | Felix Stillger et.al. | 2406.17541 | null |
2024-06-25 | The Tree of Diffusion Life: Evolutionary Embeddings to Understand the Generation Process of Diffusion Models | Vidya Prasad et.al. | 2406.17462 | null |
2024-06-25 | SyncNoise: Geometrically Consistent Noise Prediction for Text-based 3D Scene Editing | Ruihuang Li et.al. | 2406.17396 | null |
2024-06-24 | FreeTraj: Tuning-Free Trajectory Control in Video Diffusion Models | Haonan Qiu et.al. | 2406.16863 | link |
2024-06-24 | Dreamitate: Real-World Visuomotor Policy Learning via Video Generation | Junbang Liang et.al. | 2406.16862 | null |
2024-06-24 | General Binding Affinity Guidance for Diffusion Models in Structure-Based Drug Design | Yue Jian et.al. | 2406.16821 | null |
2024-06-24 | Portrait3D: 3D Head Generation from Single In-the-wild Portrait Image | Jinkun Hao et.al. | 2406.16710 | null |
2024-06-24 | Geometry-Aware Score Distillation via 3D Consistent Noising and Gradient Consistency Modeling | Min-Seop Kwak et.al. | 2406.16695 | null |
2024-06-24 | Repulsive Score Distillation for Diverse Sampling of Diffusion Models | Nicolas Zilberstein et.al. | 2406.16683 | link |
2024-06-24 | OAML: Outlier Aware Metric Learning for OOD Detection Enhancement | Heng Gao et.al. | 2406.16525 | link |
2024-06-24 | DaLPSR: Leverage Degradation-Aligned Language Prompt for Real-World Image Super-Resolution | Aiwen Jiang et.al. | 2406.16477 | link |
2024-06-24 | ResMaster: Mastering High-Resolution Image Generation via Structural and Fine-Grained Guidance | Shuwei Shi et.al. | 2406.16476 | null |
2024-06-24 | Prompt-Consistency Image Generation (PCIG): A Unified Framework Integrating LLMs, Knowledge Graphs, and Controllable Diffusion Models | Yichen Sun et.al. | 2406.16333 | null |
2024-06-21 | Masked Extended Attention for Zero-Shot Virtual Try-On In The Wild | Nadav Orzech et.al. | 2406.15331 | null |
2024-06-21 | You Only Acquire Sparse-channel (YOAS): A Unified Framework for Dense-channel EEG Generation | Hongyu Chen et.al. | 2406.15269 | null |
2024-06-21 | Unsupervised Bayesian Generation of Synthetic CT from CBCT Using Patient-Specific Score-Based Prior | Junbo Peng et.al. | 2406.15219 | null |
2024-06-21 | A3D: Does Diffusion Dream about 3D Alignment? | Savva Ignatyev et.al. | 2406.15020 | null |
2024-06-21 | Probabilistic and Differentiable Wireless Simulation with Geometric Transformers | Thomas Hehn et.al. | 2406.14995 | null |
2024-06-21 | VividDreamer: Towards High-Fidelity and Efficient Text-to-3D Generation | Zixuan Chen et.al. | 2406.14964 | null |
2024-06-21 | LatentExplainer: Explaining Latent Representations in Deep Generative Models with Multi-modal Foundation Models | Mengdan Zhu et.al. | 2406.14862 | link |
2024-06-21 | Six-CD: Benchmarking Concept Removals for Benign Text-to-image Diffusion Models | Jie Ren et.al. | 2406.14855 | link |
2024-06-21 | DExter: Learning and Controlling Performance Expression with Diffusion Models | Huan Zhang et.al. | 2406.14850 | link |
2024-06-21 | Fair Text to Medical Image Diffusion Model with Subgroup Distribution Aligned Tuning | Xu Han et.al. | 2406.14847 | null |
2024-06-20 | A Survey of Multimodal-Guided Image Editing with Text-to-Image Diffusion Models | Xincheng Shuai et.al. | 2406.14555 | link |
2024-06-21 | Advancing Fine-Grained Classification by Structure and Subject Preserving Augmentation | Eyal Michaeli et.al. | 2406.14551 | link |
2024-06-20 | Consistency Models Made Easy | Zhengyang Geng et.al. | 2406.14548 | link |
2024-06-20 | Invertible Consistency Distillation for Text-Guided Image Editing in Around 7 Steps | Nikita Starodubcev et.al. | 2406.14539 | null |
2024-06-20 | V-LASIK: Consistent Glasses-Removal from Videos Using Synthetic Data | Rotem Shalev-Arkushin et.al. | 2406.14510 | null |
2024-06-20 | SafeSora: Towards Safety Alignment of Text2Video Generation via a Human Preference Dataset | Josef Dai et.al. | 2406.14477 | link |
2024-06-20 | CollaFuse: Collaborative Diffusion Models | Simeon Allmendinger et.al. | 2406.14429 | link |
2024-06-20 | Active Diffusion Subsampling | Oisin Nolan et.al. | 2406.14388 | link |
2024-06-20 | In Tree Structure Should Sentence Be Generated | Yaguang Li et.al. | 2406.14189 | link |
2024-06-20 | CriDiff: Criss-cross Injection Diffusion Framework via Generative Pre-train for Prostate Segmentation | Tingwei Liu et.al. | 2406.14186 | link |
2024-06-18 | Evaluating the design space of diffusion-based generative models | Yuqing Wang et.al. | 2406.12839 | null |
2024-06-18 | Neural Approximate Mirror Maps for Constrained Diffusion Models | Berthy T. Feng et.al. | 2406.12816 | null |
2024-06-18 | Extracting Training Data from Unconditional Diffusion Models | Yunhao Chen et.al. | 2406.12752 | null |
2024-06-18 | Speak in the Scene: Diffusion-based Acoustic Scene Transfer toward Immersive Speech Generation | Miseul Kim et.al. | 2406.12688 | null |
2024-06-18 | GeoBench: Benchmarking and Analyzing Monocular Geometry Estimation Models | Yongtao Ge et.al. | 2406.12671 | link |
2024-06-18 | Unmasking the Veil: An Investigation into Concept Ablation for Privacy and Copyright Protection in Images | Shivank Garg et.al. | 2406.12592 | link |
2024-06-18 | Training Diffusion Models with Federated Learning | Matthijs de Goede et.al. | 2406.12575 | null |
2024-06-18 | Variational Distillation of Diffusion Policies into Mixture of Experts | Hongyi Zhou et.al. | 2406.12538 | null |
2024-06-18 | HumanSplat: Generalizable Single-Image Human Gaussian Splatting with Structure Priors | Panwang Pan et.al. | 2406.12459 | link |
2024-06-18 | Planning Using Schrödinger Bridge Diffusion Models | Adarsh Srivastava et.al. | 2406.12458 | link |
2024-06-17 | Exploring the Role of Large Language Models in Prompt Encoding for Diffusion Models | Bingqi Ma et.al. | 2406.11831 | null |
2024-06-17 | MegaScenes: Scene-Level View Synthesis at Scale | Joseph Tung et.al. | 2406.11819 | link |
2024-06-17 | DiffMM: Multi-Modal Diffusion Model for Recommendation | Yangqin Jiang et.al. | 2406.11781 | link |
2024-06-17 | Latent Denoising Diffusion GAN: Faster sampling, Higher image quality | Luan Thanh Trinh et.al. | 2406.11713 | link |
2024-06-17 | MusicScore: A Dataset for Music Score Modeling and Generation | Yuheng Lin et.al. | 2406.11462 | link |
2024-06-17 | AnyTrans: Translate AnyText in the Image with Large Scale Models | Zhipeng Qian et.al. | 2406.11432 | null |
2024-06-17 | DiTTo-TTS: Efficient and Scalable Zero-Shot Text-to-Speech with Diffusion Transformer | Keon Lee et.al. | 2406.11427 | null |
2024-06-17 | Unfolding Time: Generative Modeling for Turbulent Flows in 4D | Abdullah Saydemir et.al. | 2406.11390 | null |
2024-06-17 | Diffusion Models in Low-Level Vision: A Survey | Chunming He et.al. | 2406.11138 | link |
2024-06-16 | Exploiting Diffusion Prior for Out-of-Distribution Detection | Armando Zhu et.al. | 2406.11105 | null |
2024-06-14 | SatDiffMoE: A Mixture of Estimation Method for Satellite Image Super-resolution with Latent Diffusion Models | Zhaoxu Luo et.al. | 2406.10225 | null |
2024-06-14 | DiffusionBlend: Learning 3D Image Prior through Position-aware Diffusion Score Blending for 3D Computed Tomography Reconstruction | Bowen Song et.al. | 2406.10211 | null |
2024-06-14 | Make It Count: Text-to-Image Generation with an Accurate Number of Objects | Lital Binyamin et.al. | 2406.10210 | null |
2024-06-14 | Crafting Parts for Expressive Object Composition | Harsh Rangwani et.al. | 2406.10197 | null |
2024-06-14 | Training-free Camera Control for Video Generation | Chen Hou et.al. | 2406.10126 | null |
2024-06-14 | Group and Shuffle: Efficient Structured Orthogonal Parametrization | Mikhail Gorbunov et.al. | 2406.10019 | null |
2024-06-14 | OrientDream: Streamlining Text-to-3D Generation with Explicit Orientation Control | Yuzhong Huang et.al. | 2406.10000 | null |
2024-06-14 | InstructRL4Pix: Training Diffusion for Image Editing by Reinforcement Learning | Tiancheng Li et.al. | 2406.09973 | null |
2024-06-14 | GradeADreamer: Enhanced Text-to-3D Generation Using Gaussian Splatting and Multi-View Diffusion | Trapoom Ukarapol et.al. | 2406.09850 | link |
2024-06-14 | Unsupervised Monocular Depth Estimation Based on Hierarchical Feature-Guided Diffusion | Runze Liu et.al. | 2406.09782 | null |
2024-06-13 | Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models | Qihao Liu et.al. | 2406.09416 | null |
2024-06-13 | An Image is Worth More Than 16x16 Patches: Exploring Transformers on Individual Pixels | Duy-Kien Nguyen et.al. | 2406.09415 | null |
2024-06-13 | Interpreting the Weight Space of Customized Diffusion Models | Amil Dravid et.al. | 2406.09413 | link |
2024-06-13 | ConsistDreamer: 3D-Consistent 2D Diffusion for High-Fidelity Scene Editing | Jun-Kun Chen et.al. | 2406.09404 | null |
2024-06-13 | Instruct 4D-to-4D: Editing 4D Scenes as Pseudo-3D Scenes Using 2D Diffusion | Linzhan Mou et.al. | 2406.09402 | null |
2024-06-13 | OmniTokenizer: A Joint Image-Video Tokenizer for Visual Generation | Junke Wang et.al. | 2406.09399 | link |
2024-06-13 | SimGen: Simulator-conditioned Driving Scene Generation | Yunsong Zhou et.al. | 2406.09386 | null |
2024-06-13 | CLIPAway: Harmonizing Focused Embeddings for Removing Objects via Diffusion Models | Yigit Ekin et.al. | 2406.09368 | link |
2024-06-13 | Understanding Hallucinations in Diffusion Models through Mode Interpolation | Sumukh K Aithal et.al. | 2406.09358 | link |
2024-06-13 | Advancing Graph Generation through Beta Diffusion | Yilin He et.al. | 2406.09357 | link |
2024-06-12 | Words Worth a Thousand Pictures: Measuring and Understanding Perceptual Variability in Text-to-Image Generation | Raphael Tang et.al. | 2406.08482 | null |
2024-06-12 | Human 3Diffusion: Realistic Avatar Creation via Explicit 3D Consistent Diffusion Models | Yuxuan Xue et.al. | 2406.08475 | null |
2024-06-12 | Pranath Reddy et.al. | 2406.08442 | null | |
2024-06-12 | Diffusion Soup: Model Merging for Text-to-Image Diffusion Models | Benjamin Biggs et.al. | 2406.08431 | null |
2024-06-12 | FontStudio: Shape-Adaptive Diffusion Model for Coherent and Consistent Font Effect Generation | Xinzhi Mu et.al. | 2406.08392 | null |
2024-06-12 | Diff-A-Riff: Musical Accompaniment Co-creation via Latent Diffusion Models | Javier Nistal et.al. | 2406.08384 | null |
2024-06-12 | 2.5D Multi-view Averaging Diffusion Model for 3D Medical Image Translation: Application to Low-count PET Reconstruction with CT-less Attenuation Correction | Tianqi Chen et.al. | 2406.08374 | null |
2024-06-12 | WMAdapter: Adding WaterMark Control to Latent Diffusion Models | Hai Ci et.al. | 2406.08337 | null |
2024-06-12 | Dataset Enhancement with Instance-Level Augmentations | Orest Kupyn et.al. | 2406.08249 | link |
2024-06-12 | Diffusion-Promoted HDR Video Reconstruction | Yuanshen Guan et.al. | 2406.08204 | null |
2024-06-11 | An Image is Worth 32 Tokens for Reconstruction and Generation | Qihang Yu et.al. | 2406.07550 | link |
2024-06-11 | Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance | Kuan Heng Lin et.al. | 2406.07540 | null |
2024-06-11 | Simple and Effective Masked Diffusion Language Models | Subham Sekhar Sahoo et.al. | 2406.07524 | link |
2024-06-11 | Neural Gaffer: Relighting Any Object via Diffusion | Haian Jin et.al. | 2406.07520 | null |
2024-06-11 | Instant 3D Human Avatar Generation using Image Diffusion Models | Nikos Kolotouros et.al. | 2406.07516 | null |
2024-06-11 | Flow Map Matching | Nicholas M. Boffi et.al. | 2406.07507 | null |
2024-06-11 | GLAD: Towards Better Reconstruction with Global and Local Adaptive Diffusion Models for Unsupervised Anomaly Detection | Hang Yao et.al. | 2406.07487 | link |
2024-06-11 | Image Neural Field Diffusion Models | Yinbo Chen et.al. | 2406.07480 | null |
2024-06-11 | 4Real: Towards Photorealistic 4D Scene Generation via Video Diffusion Models | Heng Yu et.al. | 2406.07472 | null |
2024-06-11 | Noise-robust Speech Separation with Fast Generative Correction | Helin Wang et.al. | 2406.07461 | link |
2024-06-10 | IllumiNeRF: 3D Relighting without Inverse Rendering | Xiaoming Zhao et.al. | 2406.06527 | null |
2024-06-10 | Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation | Peize Sun et.al. | 2406.06525 | link |
2024-06-10 | Monkey See, Monkey Do: Harnessing Self-attention in Motion Diffusion for Zero-shot Motion Transfer | Sigal Raab et.al. | 2406.06508 | link |
2024-06-10 | AID: Adapting Image2Video Diffusion Models for Instruction-guided Video Prediction | Zhen Xing et.al. | 2406.06465 | null |
2024-06-10 | Cometh: A continuous-time discrete-state graph diffusion model | Antoine Siraudin et.al. | 2406.06449 | null |
2024-06-10 | Margin-aware Preference Optimization for Aligning Diffusion Models without Reference | Jiwoo Hong et.al. | 2406.06424 | null |
2024-06-10 | Diffusion-RPO: Aligning Diffusion Models through Relative Preference Optimization | Yi Gu et.al. | 2406.06382 | link |
2024-06-10 | Improving Deep Learning-based Automatic Cranial Defect Reconstruction by Heavy Data Augmentation: From Image Registration to Latent Diffusion Models | Marek Wodzinski et.al. | 2406.06372 | null |
2024-06-10 | MVGamba: Unify 3D Content Generation as State Space Sequence Modeling | Xuanyu Yi et.al. | 2406.06367 | link |
2024-06-11 | Tuning-Free Visual Customization via View Iterative Self-Attention Control | Xiaojie Li et.al. | 2406.06258 | link |
2024-06-07 | CoNo: Consistency Noise Injection for Tuning-free Long Video Diffusion | Xingrui Wang et.al. | 2406.05082 | null |
2024-06-07 | Generative diffusion models for synthetic trajectories of heavy and light particles in turbulence | Tianyi Li et.al. | 2406.05008 | null |
2024-06-07 | Learning Divergence Fields for Shift-Robust Graph Representations | Qitian Wu et.al. | 2406.04963 | link |
2024-06-07 | Combinatorial Complex Score-based Diffusion Modelling through Stochastic Differential Equations | Adrien Carrel et.al. | 2406.04916 | link |
2024-06-07 | Online Continual Learning of Video Diffusion Models From a Single Video Stream | Jason Yoo et.al. | 2406.04814 | null |
2024-06-07 | TEDi Policy: Temporally Entangled Diffusion for Robotic Control | Sigmund H. Høeg et.al. | 2406.04806 | link |
2024-06-07 | Diffusion-based Generative Image Outpainting for Recovery of FOV-Truncated CT Images | Michelle Espranita Liman et.al. | 2406.04769 | link |
2024-06-07 | PQPP: A Joint Benchmark for Text-to-Image Prompt and Query Performance Prediction | Eduard Poesina et.al. | 2406.04746 | link |
2024-06-07 | FlowMM: Generating Materials with Riemannian Flow Matching | Benjamin Kurt Miller et.al. | 2406.04713 | null |
2024-06-07 | MeLFusion: Synthesizing Music from Image and Language Cues using Diffusion Models | Sanjoy Chowdhury et.al. | 2406.04673 | link |
2024-06-07 | Physics3D: Learning Physical Properties of 3D Gaussians via Video Diffusion | Fangfu Liu et.al. | 2406.04338 | null |
2024-06-06 | Coherent Zero-Shot Visual Instruction Generation | Quynh Phung et.al. | 2406.04337 | null |
2024-06-06 | BitsFusion: 1.99 bits Weight Quantization of Diffusion Model | Yang Sui et.al. | 2406.04333 | link |
2024-06-06 | Simplified and Generalized Masked Diffusion for Discrete Data | Jiaxin Shi et.al. | 2406.04329 | link |
2024-06-06 | SF-V: Single Forward Video Generation Model | Zhixing Zhang et.al. | 2406.04324 | link |
2024-06-06 | ATraDiff: Accelerating Online Reinforcement Learning with Imaginary Trajectories | Qianlan Yang et.al. | 2406.04323 | null |
2024-06-07 | DIRECT-3D: Learning Direct Text-to-3D Generation on Massive Noisy 3D Data | Qihao Liu et.al. | 2406.04322 | link |
2024-06-06 | Step-aware Preference Optimization: Aligning Preference with Denoising Performance at Each Step | Zhanhao Liang et.al. | 2406.04314 | link |
2024-06-06 | Everything to the Synthetic: Diffusion-driven Test-time Adaptation via Synthetic-Domain Alignment | Jiayi Guo et.al. | 2406.04295 | link |
2024-06-06 | VideoTetris: Towards Compositional Text-to-Video Generation | Ye Tian et.al. | 2406.04277 | link |
2024-06-05 | Text-to-Events: Synthetic Event Camera Streams from Conditional Text Input | Joachim Ott et.al. | 2406.03439 | null |
2024-06-05 | Text-to-Image Rectified Flow as Plug-and-Play Priors | Xiaofeng Yang et.al. | 2406.03293 | link |
2024-06-05 | Generative Diffusion Models for Fast Simulations of Particle Collisions at CERN | Mikołaj Kita et.al. | 2406.03233 | null |
2024-06-05 | Searching Priors Makes Text-to-Video Synthesis Better | Haoran Cheng et.al. | 2406.03215 | null |
2024-06-05 | Ouroboros3D: Image-to-3D Generation via 3D-aware Recursive Diffusion | Hao Wen et.al. | 2406.03184 | link |
2024-06-05 | Tiny models from tiny data: Textual and null-text inversion for few-shot distillation | Erik Landolsi et.al. | 2406.03146 | link |
2024-06-05 | Floating Anchor Diffusion Model for Multi-motif Scaffolding | Ke Liu et.al. | 2406.03141 | link |
2024-06-05 | Phy-Diff: Physics-guided Hourglass Diffusion Model for Diffusion MRI Synthesis | Juanhua Zhang et.al. | 2406.03002 | null |
2024-06-05 | Exploring Data Efficiency in Zero-Shot Learning with Diffusion Models | Zihan Ye et.al. | 2406.02929 | null |
2024-06-06 | U-KAN Makes Strong Backbone for Medical Image Segmentation and Generation | Chenxin Li et.al. | 2406.02918 | null |
2024-06-04 | Dreamguider: Improved Training free Diffusion-based Conditional Generation | Nithin Gopalakrishnan Nair et.al. | 2406.02549 | null |
2024-06-05 | Enhancing Temporal Consistency in Video Editing by Reconstructing Videos with 3D Gaussian Splatting | Inkyu Shin et.al. | 2406.02541 | null |
2024-06-04 | CamCo: Camera-Controllable 3D-Consistent Image-to-Video Generation | Dejia Xu et.al. | 2406.02509 | null |
2024-06-04 | Guiding a Diffusion Model with a Bad Version of Itself | Tero Karras et.al. | 2406.02507 | link |
2024-06-04 | Stable-Pose: Leveraging Transformers for Pose-Guided Text-to-Image Generation | Jiajun Wang et.al. | 2406.02485 | link |
2024-06-04 | Inpainting Pathology in Lumbar Spine MRI with Latent Diffusion | Colin Hansen et.al. | 2406.02477 | null |
2024-06-04 | Learning Image Priors through Patch-based Diffusion Models for Solving Inverse Problems | Jason Hu et.al. | 2406.02462 | link |
2024-06-04 | RoomTex: Texturing Compositional Indoor Scenes via Iterative Inpainting | Qi Wang et.al. | 2406.02461 | null |
2024-06-04 | Finding NeMo: Localizing Neurons Responsible For Memorization in Diffusion Models | Dominik Hintersdorf et.al. | 2406.02366 | link |
2024-06-04 | Flash Diffusion: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation | Clement Chadebec et.al. | 2406.02347 | link |
2024-05-31 | Mixed Diffusion for 3D Indoor Scene Synthesis | Siyi Hu et.al. | 2405.21066 | link |
2024-05-31 | Unified Directly Denoising for Both Variance Preserving and Variance Exploding Diffusion Models | Jingjing Wang et.al. | 2405.21059 | null |
2024-05-31 | Spectrum-Aware Parameter Efficient Fine-Tuning for Diffusion Models | Xinxi Zhang et.al. | 2405.21050 | null |
2024-05-31 | Kaleido Diffusion: Improving Conditional Diffusion Models with Autoregressive Latent Modeling | Jiatao Gu et.al. | 2405.21048 | null |
2024-05-31 | Amortizing intractable inference in diffusion models for vision, language, and control | Siddarth Venkatraman et.al. | 2405.20971 | link |
2024-05-31 | Flow matching achieves minimax optimal convergence | Kenji Fukumizu et.al. | 2405.20879 | null |
2024-05-31 | MegActor: Harness the Power of Raw Video for Vivid Portrait Animation | Shurong Yang et.al. | 2405.20851 | link |
2024-05-31 | Share Your Secrets for Privacy! Confidential Forecasting with Vertical Federated Learning | Aditya Shankar et.al. | 2405.20761 | link |
2024-05-31 | Information Theoretic Text-to-Image Alignment | Chao Wang et.al. | 2405.20759 | null |
2024-05-31 | Diffusion Models Are Innate One-Step Generators | Bowen Zheng et.al. | 2405.20750 | link |
2024-05-30 | Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image | Kailu Wu et.al. | 2405.20343 | link |
2024-05-30 | VividDream: Generating 3D Scene with Ambient Dynamics | Yao-Chih Lee et.al. | 2405.20334 | null |
2024-05-30 | MotionFollower: Editing Video Motion via Lightweight Score-Guided Diffusion | Shuyuan Tu et.al. | 2405.20325 | link |
2024-05-30 | Don't drop your samples! Coherence-aware training benefits Conditional diffusion | Nicolas Dufour et.al. | 2405.20324 | null |
2024-05-30 | Improving the Training of Rectified Flows | Sangyun Lee et.al. | 2405.20320 | link |
2024-05-30 | DITTO-2: Distilled Diffusion Inference-Time T-Optimization for Music Generation | Zachary Novack et.al. | 2405.20289 | null |
2024-05-30 | MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model | Muyao Niu et.al. | 2405.20222 | link |
2024-05-30 | Boost Your Own Human Image Generation Model via Direct Preference Optimization with AI Feedback | Sanghyeon Na et.al. | 2405.20216 | null |
2024-05-30 | MotionDreamer: Zero-Shot 3D Mesh Animation from Video Diffusion Models | Lukas Uzolas et.al. | 2405.20155 | null |
2024-05-31 | DP-IQA: Utilizing Diffusion Prior for Blind Image Quality Assessment in the Wild | Honghao Fu et.al. | 2405.19996 | link |
2024-05-29 | ConceptPrune: Concept Editing in Diffusion Models via Skilled Neuron Pruning | Ruchika Chavhan et.al. | 2405.19237 | link |
2024-05-30 | Weitian Zhang et.al. | 2405.19203 | null | |
2024-05-29 | Diffusion-based Dynamics Models for Long-Horizon Rollout in Offline Reinforcement Learning | Hanye Zhao et.al. | 2405.19189 | link |
2024-05-29 | Tuning-Free Alignment of Diffusion Models with Direct Noise Optimization | Zhiwei Tang et.al. | 2405.18881 | link |
2024-05-29 | Principled Probabilistic Imaging using Diffusion Models as Plug-and-Play Priors | Zihui Wu et.al. | 2405.18782 | link |
2024-05-29 | RNAFlow: RNA Structure & Sequence Design via Inverse Folding-Based Flow Matching | Divya Nori et.al. | 2405.18768 | link |
2024-05-29 | Stationary distribution approximations of Two-island Wright-Fisher and seed-bank models using Stein's method | Han L. Gan et.al. | 2405.18763 | null |
2024-05-29 | Preferred-Action-Optimized Diffusion Policies for Offline Reinforcement Learning | Tianle Zhang et.al. | 2405.18729 | null |
2024-05-29 | Reverse the auditory processing pathway: Coarse-to-fine audio reconstruction from fMRI | Che Liu et.al. | 2405.18726 | null |
2024-05-29 | Learning Diffeomorphism for Image Registration with Time-Continuous Networks using Semigroup Regularization | Mohammadjavad Matinkia et.al. | 2405.18684 | link |
2024-05-28 | DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention | Lianghui Zhu et.al. | 2405.18428 | link |
2024-05-28 | Phased Consistency Model | Fu-Yun Wang et.al. | 2405.18407 | link |
2024-05-28 | RACCooN: Remove, Add, and Change Video Content with Auto-Generated Narratives | Jaehong Yoon et.al. | 2405.18406 | link |
2024-05-28 | Multi-modal Generation via Cross-Modal In-Context Learning | Amandeep Kumar et.al. | 2405.18304 | link |
2024-05-28 | CT-based brain ventricle segmentation via diffusion Schrödinger Bridge without target domain ground truths | Reihaneh Teimouri et.al. | 2405.18267 | link |
2024-05-28 | EG4D: Explicit Generation of 4D Object without Score Distillation | Qi Sun et.al. | 2405.18132 | link |
2024-05-28 | Are Image Distributions Indistinguishable to Humans Indistinguishable to Classifiers? | Zebin You et.al. | 2405.18029 | null |
2024-05-28 | Unveiling the Power of Diffusion Features For Personalized Segmentation and Retrieval | Dvir Samuel et.al. | 2405.18025 | link |
2024-05-28 | MAVIN: Multi-Action Video Generation with Diffusion Models via Transition Video Infilling | Bowen Zhang et.al. | 2405.18003 | link |
2024-05-28 | AttenCraft: Attention-guided Disentanglement of Multiple Concepts for Text-to-Image Customization | Junjie Shentu et.al. | 2405.17965 | link |
2024-05-27 | Human4DiT: Free-view Human Video Generation with 4D Diffusion Transformer | Ruizhi Shao et.al. | 2405.17405 | null |
2024-05-27 | A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Training | Kai Wang et.al. | 2405.17403 | link |
2024-05-27 | RB-Modulation: Training-Free Personalization of Diffusion Models using Stochastic Optimal Control | Litu Rout et.al. | 2405.17401 | null |
2024-05-27 | EASI-Tex: Edge-Aware Mesh Texturing from Single Image | Sai Raj Kishore Perla et.al. | 2405.17393 | null |
2024-05-28 | Controllable Longer Image Animation with Diffusion Models | Qiang Wang et.al. | 2405.17306 | null |
2024-05-27 | Does Diffusion Beat GAN in Image Super Resolution? | Denis Kuznedelev et.al. | 2405.17261 | link |
2024-05-27 | DreamMat: High-quality PBR Material Generation with Geometry- and Light-aware Diffusion Models | Yuqing Zhang et.al. | 2405.17176 | null |
2024-05-27 | Partitioned Hankel-based Diffusion Models for Few-shot Low-dose CT Reconstruction | Wenhao Zhang et.al. | 2405.17167 | null |
2024-05-27 | PatchScaler: An Efficient Patch-independent Diffusion Model for Super-Resolution | Yong Liu et.al. | 2405.17158 | link |
2024-05-27 | Ensembling Diffusion Models via Adaptive Feature Aggregation | Cong Wang et.al. | 2405.17082 | link |
2024-05-24 | Looking Backward: Streaming Video-to-Video Translation with Feature Banks | Feng Liang et.al. | 2405.15757 | link |
2024-05-24 | Taming Score-Based Diffusion Priors for Infinite-Dimensional Nonlinear Inverse Problems | Lorenzo Baldassari et.al. | 2405.15676 | null |
2024-05-24 | Reducing the cost of posterior sampling in linear inverse problems via task-dependent score learning | Fabian Schneider et.al. | 2405.15643 | null |
2024-05-24 | DiffCalib: Reformulating Monocular Camera Calibration as Diffusion-Based Dense Incident Map Generation | Xiankang He et.al. | 2405.15619 | null |
2024-05-24 | Learning to Discretize Denoising Diffusion ODEs | Vinh Tong et.al. | 2405.15506 | link |
2024-05-24 | Out of Many, One: Designing and Scaffolding Proteins at the Scale of the Structural Universe with Genie 2 | Yeqing Lin et.al. | 2405.15489 | link |
2024-05-24 | NVS-Solver: Video Diffusion Model as Zero-Shot Novel View Synthesizer | Meng You et.al. | 2405.15364 | link |
2024-05-24 | SoundLoCD: An Efficient Conditional Discrete Contrastive Latent Diffusion Model for Text-to-Sound Generation | Xinlei Niu et.al. | 2405.15338 | null |
2024-05-24 | Challenges and Opportunities in 3D Content Generation | Ke Zhao et.al. | 2405.15335 | null |
2024-05-24 | Towards Understanding the Working Mechanism of Text-to-Image Diffusion Model | Mingyang Yi et.al. | 2405.15330 | null |
2024-05-24 | Improved Distribution Matching Distillation for Fast Image Synthesis | Tianwei Yin et.al. | 2405.14867 | link |
2024-05-23 | Video Diffusion Models are Training-free Motion Interpreter and Controller | Zeqi Xiao et.al. | 2405.14864 | null |
2024-05-23 | Adapting to Unknown Low-Dimensional Structures in Score-Based Diffusion Models | Gen Li et.al. | 2405.14861 | null |
2024-05-23 | Semantica: An Adaptable Image-Conditioned Diffusion Model | Manoj Kumar et.al. | 2405.14857 | null |
2024-05-23 | TerDiT: Ternary Diffusion Models with Transformers | Xudong Lu et.al. | 2405.14854 | link |
2024-05-23 | Direct3D: Scalable Image-to-3D Generation via 3D Latent Diffusion Transformer | Shuang Wu et.al. | 2405.14832 | null |
2024-05-23 | Good Seed Makes a Good Crop: Discovering Secret Seeds in Text-to-Image Diffusion Models | Katherine Xu et.al. | 2405.14828 | null |
2024-05-23 | PaGoDA: Progressive Growing of a One-Step Generator from a Low-Resolution Diffusion Teacher | Dongjun Kim et.al. | 2405.14822 | link |
2024-05-24 | Fast-DDPM: Fast Denoising Diffusion Probabilistic Models for Medical Image-to-Image Generation | Hongxu Jiang et.al. | 2405.14802 | link |
2024-05-23 | Membership Inference on Text-to-Image Diffusion Models via Conditional Likelihood Discrepancy | Shengfang Zhai et.al. | 2405.14800 | link |
2024-05-21 | Personalized Residuals for Concept-Driven Text-to-Image Generation | Cusuh Ham et.al. | 2405.12978 | null |
2024-05-21 | Face Adapter for Pre-Trained Diffusion Models with Fine-Grained ID and Attribute Control | Yue Han et.al. | 2405.12970 | null |
2024-05-21 | Impact of inhomogeneous diffusion on secondary cosmic ray and antiproton local spectra | Álvaro Tovar-Pardo et.al. | 2405.12918 | null |
2024-05-21 | Diffusion-RSCC: Diffusion Probabilistic Model for Change Captioning in Remote Sensing Images | Xiaofei Yu et.al. | 2405.12875 | link |
2024-05-21 | Model Free Prediction with Uncertainty Assessment | Yuling Jiao et.al. | 2405.12684 | null |
2024-05-21 | CustomText: Customized Textual Image Generation using Diffusion Models | Shubham Paliwal et.al. | 2405.12531 | null |
2024-05-21 | Customize Your Own Paired Data via Few-shot Way | Jinshu Chen et.al. | 2405.12490 | null |
2024-05-21 | One-step data-driven generative model via Schrödinger Bridge | Hanwen Huang et.al. | 2405.12453 | null |
2024-05-20 | Diffusion for World Modeling: Visual Details Matter in Atari | Eloi Alonso et.al. | 2405.12399 | link |
2024-05-20 | Images that Sound: Composing Images and Sounds on a Single Canvas | Ziyang Chen et.al. | 2405.12221 | null |
2024-05-20 | Slicedit: Zero-Shot Video Editing With Text-to-Image Diffusion Models Using Spatio-Temporal Slices | Nathaniel Cohen et.al. | 2405.12211 | link |
2024-05-20 | Nonequilbrium physics of generative diffusion models | Zhendong Yu et.al. | 2405.11932 | null |
2024-05-20 | "Set It Up!": Functional Object Arrangement with Compositional Generative Models | Yiqing Xu et.al. | 2405.11928 | null |
2024-05-20 | Diff-BGM: A Diffusion Model for Video Background Music Generation | Sizhe Li et.al. | 2405.11913 | link |
2024-05-20 | Out-of-Distribution Detection with a Single Unconditional Diffusion Model | Alvin Heng et.al. | 2405.11881 | link |
2024-05-20 | Evolving Storytelling: Benchmarks and Methods for New Character Customization with Diffusion Models | Xiyu Wang et.al. | 2405.11852 | null |
2024-05-20 | Alternators For Sequence Modeling | Mohammad Reza Rezaei et.al. | 2405.11848 | null |
2024-05-20 | ViViD: Video Virtual Try-on using Diffusion Models | Zixun Fang et.al. | 2405.11794 | null |
2024-05-20 | Guided Multi-objective Generative AI to Enhance Structure-based Drug Design | Amit Kadan et.al. | 2405.11785 | link |
2024-05-17 | Improving face generation quality and prompt following with synthetic captions | Michail Tarasiou et.al. | 2405.10864 | null |
2024-05-17 | Deep Data Consistency: a Fast and Robust Diffusion Model-based Solver for Inverse Problems | Hanyu Chen et.al. | 2405.10748 | link |
2024-05-17 | Numerical Recovery of the Diffusion Coefficient in Diffusion Equations from Terminal Measurement | Bangti Jin et.al. | 2405.10708 | null |
2024-05-17 | LoCI-DiffCom: Longitudinal Consistency-Informed Diffusion Model for 3D Infant Brain Image Completion | Zihao Zhu et.al. | 2405.10691 | null |
2024-05-17 | LighTDiff: Surgical Endoscopic Image Low-Light Enhancement with T-Diffusion | Tong Chen et.al. | 2405.10550 | link |
2024-05-17 | ART3D: 3D Gaussian Splatting for Text-Guided Artistic Scenes Generation | Pengzhi Li et.al. | 2405.10508 | null |
2024-05-16 | Text-to-Vector Generation with Neural Path Representation | Peiying Zhang et.al. | 2405.10317 | null |
2024-05-16 | Analogist: Out-of-the-box Visual In-Context Learning with Image Diffusion Model | Zheng Gu et.al. | 2405.10316 | null |
2024-05-16 | CAT3D: Create Anything in 3D with Multi-View Diffusion Models | Ruiqi Gao et.al. | 2405.10314 | null |
2024-05-16 | Generating Coherent Sequences of Visual Illustrations for Real-World Manual Tasks | João Bordalo et.al. | 2405.10122 | null |
2024-05-16 | Spurious reconstruction from brain activity | Ken Shirakawa et.al. | 2405.10078 | link |
2024-05-16 | Frequency-Domain Refinement with Multiscale Diffusion for Super Resolution | Xingjian Wang et.al. | 2405.10014 | null |
2024-05-16 | VirtualModel: Generating Object-ID-retentive Human-object Interaction Image by Diffusion Model for E-commerce Marketing | Binghui Chen et.al. | 2405.09985 | null |
2024-05-16 | Language-Oriented Semantic Latent Representation for Image Transmission | Giordano Cicchetti et.al. | 2405.09976 | link |
2024-05-16 | Whole-Song Hierarchical Generation of Symbolic Music Using Cascaded Diffusion Models | Ziyu Wang et.al. | 2405.09901 | link |
2024-05-16 | DiffAM: Diffusion-based Adversarial Makeup Transfer for Facial Privacy Protection | Yuhao Sun et.al. | 2405.09882 | link |
2024-05-16 | MMFusion: Multi-modality Diffusion Model for Lymph Node Metastasis Diagnosis in Esophageal Cancer | Chengyu Wu et.al. | 2405.09539 | link |
2024-05-15 | Diffusion-based Contrastive Learning for Sequential Recommendation | Ziqiang Cui et.al. | 2405.09369 | link |
2024-05-15 | Dance Any Beat: Blending Beats with Visuals in Dance Video Generation | Xuanchen Wang et.al. | 2405.09266 | null |
2024-05-15 | SOEDiff: Efficient Distillation for Small Object Editing | Qihe Pan et.al. | 2405.09114 | null |
2024-05-15 | RSHazeDiff: A Unified Fourier-aware Diffusion Model for Remote Sensing Image Dehazing | Jiamei Xiong et.al. | 2405.09083 | link |
2024-05-15 | Naturalistic Music Decoding from EEG Data via Latent Diffusion Models | Emilian Postolache et.al. | 2405.09062 | null |
2024-05-15 | Response Matching for generating materials and molecules | Bingqing Cheng et.al. | 2405.09057 | null |
2024-05-15 | CTS: A Consistency-Based Medical Image Segmentation Model | Kejia Zhang et.al. | 2405.09056 | link |
2024-05-14 | Expensive Multi-Objective Bayesian Optimization Based on Diffusion Models | Bingdong Li et.al. | 2405.08674 | null |
2024-05-14 | Towards Multi-Task Generative-AI Edge Services with an Attention-based Diffusion DRL Approach | Yaju Liu et.al. | 2405.08328 | null |
2024-05-14 | Compositional Text-to-Image Generation with Dense Blob Representations | Weili Nie et.al. | 2405.08246 | null |
2024-05-13 | Infinite Texture: Text-guided High Resolution Diffusion Texture Synthesis | Yifan Wang et.al. | 2405.08210 | null |
2024-05-13 | Do Bayesian imaging methods report trustworthy probabilities? | David Y. W. Thong et.al. | 2405.08179 | null |
2024-05-13 | DiffTF++: 3D-aware Diffusion Transformer for Large-Vocabulary 3D Generation | Ziang Cao et.al. | 2405.08055 | link |
2024-05-13 | Coin3D: Controllable and Interactive 3D Assets Generation with Proxy-Guided Conditioning | Wenqi Dong et.al. | 2405.08054 | null |
2024-05-13 | Stable Diffusion-based Data Augmentation for Federated Learning with Non-IID Data | Mahdi Morafah et.al. | 2405.07925 | null |
2024-05-13 | CTRLorALTer: Conditional LoRAdapter for Efficient 0-Shot Control & Altering of T2I Models | Nick Stracke et.al. | 2405.07913 | null |
2024-05-13 | SAR Image Synthesis with Diffusion Models | Denisa Qosja et.al. | 2405.07776 | null |
2024-05-13 | CDFormer:When Degradation Prediction Embraces Diffusion Model for Blind Image Super-Resolution | Qingguo Liu et.al. | 2405.07648 | link |
2024-05-13 | De novo antibody design with SE(3) diffusion | Daniel Cutting et.al. | 2405.07622 | null |
2024-05-13 | Reducing Risk for Assistive Reinforcement Learning Policies with Diffusion Models | Andrii Tytarenko et.al. | 2405.07603 | null |
2024-05-13 | PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator | Hanshu Yan et.al. | 2405.07510 | link |
2024-05-13 | GaussianVTON: 3D Human Virtual Try-ON via Multi-Stage Gaussian Splatting Editing with Image Prompting | Haodong Chen et.al. | 2405.07472 | null |
2024-05-12 | Erasing Concepts from Text-to-Image Diffusion Models with Few-shot Unlearning | Masane Fuchi et.al. | 2405.07288 | link |
2024-05-12 | Modeling Pedestrian Intrinsic Uncertainty for Multimodal Stochastic Trajectory Prediction via Energy Plan Denoising | Yao Liu et.al. | 2405.07164 | null |
2024-05-10 | OneTo3D: One Image to Re-editable Dynamic 3D Model and Video Generation | Jinwei Lin et.al. | 2405.06547 | link |
2024-05-10 | SketchDream: Sketch-based Text-to-3D Generation and Editing | Feng-Lin Liu et.al. | 2405.06461 | null |
2024-05-10 | PUMA: margin-based data pruning | Javier Maroto et.al. | 2405.06298 | null |
2024-05-10 | Prior-guided Diffusion Model for Cell Segmentation in Quantitative Phase Imaging | Zhuchen Shao et.al. | 2405.06175 | null |
2024-05-09 | Distilling Diffusion Models into Conditional GANs | Minguk Kang et.al. | 2405.05967 | null |
2024-05-09 | Self-Supervised Learning of Time Series Representation via Diffusion Process and Imputation-Interpolation-Forecasting Mask | Zineb Senane et.al. | 2405.05959 | link |
2024-05-09 | Frame Interpolation with Consecutive Brownian Bridge Diffusion | Zonglin Lyu et.al. | 2405.05953 | link |
2024-05-09 | Composable Part-Based Manipulation | Weiyu Liu et.al. | 2405.05876 | null |
2024-05-09 | Pre-trained Text-to-Image Diffusion Models Are Versatile Representation Learners for Control | Gunshi Gupta et.al. | 2405.05852 | link |
2024-05-09 | Could It Be Generated? Towards Practical Analysis of Memorization in Text-To-Image Diffusion Models | Zhe Ma et.al. | 2405.05846 | link |
2024-05-09 | MSDiff: Multi-Scale Diffusion Model for Ultra-Sparse View CT Reconstruction | Pinhuang Tan et.al. | 2405.05814 | null |
2024-05-10 | MasterWeaver: Taming Editability and Identity for Personalized Text-to-Image Generation | Yuxiang Wei et.al. | 2405.05806 | link |
2024-05-09 | DragGaussian: Enabling Drag-style Manipulation on 3D Gaussian Representation | Sitian Shen et.al. | 2405.05800 | null |
2024-05-09 | Sequential Amodal Segmentation via Cumulative Occlusion Learning | Jiayang Ao et.al. | 2405.05791 | null |
2024-05-08 | Diffusion-HMC: Parameter Inference with Diffusion Model driven Hamiltonian Monte Carlo | Nayantara Mudur et.al. | 2405.05255 | link |
2024-05-08 | Attention-Driven Training-Free Efficiency Enhancement of Diffusion Models | Hongjie Wang et.al. | 2405.05252 | null |
2024-05-08 | Imagine Flash: Accelerating Emu Diffusion Models with Backward Distillation | Jonas Kohler et.al. | 2405.05224 | null |
2024-05-08 | FinePOSE: Fine-Grained Prompt-Driven 3D Human Pose Estimation via Diffusion Models | Jinglin Xu et.al. | 2405.05216 | link |
2024-05-08 | An anti-noise seismic inversion method based on diffusion model | Yingtian Liu et.al. | 2405.05026 | link |
2024-05-08 | Discrepancy-based Diffusion Models for Lesion Detection in Brain MRI | Keqiang Fan et.al. | 2405.04974 | null |
2024-05-08 | Empowering Wireless Networks with Artificial Intelligence Generated Graph | Jiacheng Wang et.al. | 2405.04907 | null |
2024-05-08 | Fast LiDAR Upsampling using Conditional Diffusion Models | Sander Elias Magnussen Helgesen et.al. | 2405.04889 | link |
2024-05-08 | FlexEControl: Flexible and Efficient Multimodal Control for Text-to-Image Generation | Xuehai He et.al. | 2405.04834 | null |
2024-05-08 | Variational Schrödinger Diffusion Models | Wei Deng et.al. | 2405.04795 | null |
2024-05-07 | Tactile-Augmented Radiance Fields | Yiming Dou et.al. | 2405.04534 | link |
2024-05-07 | Edit-Your-Motion: Space-Time Diffusion Decoupling Learning for Video Motion Editing | Yi Zuo et.al. | 2405.04496 | null |
2024-05-07 | CloudDiff: Super-resolution ensemble retrieval of cloud properties for all day using the generative diffusion model | Haixia Xiao et.al. | 2405.04483 | null |
2024-05-07 | Diff-IP2D: Diffusion-Based Hand-Object Interaction Prediction on Egocentric Videos | Junyi Ma et.al. | 2405.04370 | link |
2024-05-07 | Diffusion-driven GAN Inversion for Multi-Modal Face Image Generation | Jihyun Kim et.al. | 2405.04356 | link |
2024-05-08 | Inf-DiT: Upsampling Any-Resolution Image with Memory-Efficient Diffusion Transformer | Zhuoyi Yang et.al. | 2405.04312 | link |
2024-05-07 | BUDDy: Single-Channel Blind Unsupervised Dereverberation with Diffusion Models | Eloi Moliner et.al. | 2405.04272 | null |
2024-05-07 | Vidu: a Highly Consistent, Dynamic and Skilled Text-to-Video Generator with Diffusion Models | Fan Bao et.al. | 2405.04233 | null |
2024-05-07 | Simple Drop-in LoRA Conditioning on Attention Layers Will Improve Your Diffusion Model | Joo Young Choi et.al. | 2405.03958 | null |
2024-05-06 | MVDiff: Scalable and Flexible Multi-View Diffusion for 3D Object Reconstruction from Single-View | Emmanuelle Bourigault et.al. | 2405.03894 | null |
2024-05-06 | Bridging discrete and continuous state spaces: Exploring the Ehrenfest process in time-continuous diffusion models | Ludwig Winkler et.al. | 2405.03549 | null |
2024-05-06 | CCDM: Continuous Conditional Diffusion Models for Image Generation | Xin Ding et.al. | 2405.03546 | link |
2024-05-06 | LGTM: Local-to-Global Text-Driven Human Motion Diffusion Model | Haowen Sun et.al. | 2405.03485 | link |
2024-05-06 | Exploring the Frontiers of Softmax: Provable Optimization, Applications in Diffusion Model, and Beyond | Jiuxiang Gu et.al. | 2405.03251 | null |
2024-05-06 | Hyperbolic Geometric Latent Diffusion Model for Graph Generation | Xingcheng Fu et.al. | 2405.03188 | link |
2024-05-06 | DeepMpMRI: Tensor-decomposition Regularized Learning for Fast and High-Fidelity Multi-Parametric Microstructural MR Imaging | Wenxin Fan et.al. | 2405.03159 | null |
2024-05-06 | Video Diffusion Models: A Survey | Andrew Melnik et.al. | 2405.03150 | link |
2024-05-06 | AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding | Tao Liu et.al. | 2405.03121 | link |
2024-05-05 | Matten: Video Generation with Mamba-Attention | Yu Gao et.al. | 2405.03025 | null |
2024-05-05 | Exploring Text-based Realistic Building Facades Editing Applicaiton | Jing Wang et.al. | 2405.02967 | null |
2024-05-03 | DreamScene4D: Dynamic Multi-Object Scene Generation from Monocular Videos | Wen-Hsuan Chu et.al. | 2405.02280 | link |
2024-05-03 | Multi-grid reaction-diffusion master equation: applications to morphogen gradient modelling | Radek Erban et.al. | 2405.02117 | null |
2024-05-03 | DiffMap: Enhancing Map Segmentation with Map Prior Using Diffusion Model | Peijin Jia et.al. | 2405.02008 | null |
2024-05-03 | Defect Image Sample Generation With Diffusion Prior for Steel Surface Defect Recognition | Yichun Tai et.al. | 2405.01872 | null |
2024-05-03 | Creation of Novel Soft Robot Designs using Generative AI | Wee Kiat Chan et.al. | 2405.01824 | null |
2024-05-03 | Report on the AAPM Grand Challenge on deep generative modeling for learning medical image statistics | Rucha Deshpande et.al. | 2405.01822 | null |
2024-05-02 | Converting Anyone's Voice: End-to-End Expressive Voice Conversion with a Conditional Diffusion Model | Zongyang Du et.al. | 2405.01730 | null |
2024-05-02 | Long Tail Image Generation Through Feature Space Augmentation and Iterated Learning | Rafael Elberg et.al. | 2405.01705 | link |
2024-05-02 | LocInv: Localization-aware Inversion for Text-Guided Image Editing | Chuanming Tang et.al. | 2405.01496 | link |
2024-05-02 | Navigating Heterogeneity and Privacy in One-Shot Federated Learning with Diffusion Models | Matias Mendieta et.al. | 2405.01494 | null |
2024-05-02 | Statistical algorithms for low-frequency diffusion data: A PDE approach | Matteo Giordano et.al. | 2405.01372 | link |
2024-05-02 | DiffusionPipe: Training Large Diffusion Models with Efficient Pipelines | Ye Tian et.al. | 2405.01248 | null |
2024-05-02 | Automated Virtual Product Placement and Assessment in Images using Diffusion Models | Mohammad Mahmudul Alam et.al. | 2405.01130 | null |
2024-05-02 | Part-aware Shape Generation with Latent 3D Diffusion of Neural Voxel Fields | Yuhang Huang et.al. | 2405.00998 | null |
2024-05-02 | Generative manufacturing systems using diffusion models and ChatGPT | Xingyu Li et.al. | 2405.00958 | null |
2024-05-02 | EchoScene: Indoor Scene Generation via Information Echo over Scene Graph Diffusion | Guangyao Zhai et.al. | 2405.00915 | null |
2024-05-01 | SonicDiffusion: Audio-Driven Image Generation and Editing with Pretrained Diffusion Models | Burak Can Biner et.al. | 2405.00878 | null |
2024-05-01 | Guided Conditional Diffusion Classifier (ConDiff) for Enhanced Prediction of Infection in Diabetic Foot Ulcers | Palawat Busaranuvong et.al. | 2405.00858 | null |
2024-05-01 | TexSliders: Diffusion-Based Texture Editing in CLIP Space | Julia Guerrero-Viu et.al. | 2405.00672 | null |
2024-05-01 | RGB |
Zheng Zeng et.al. | 2405.00666 | null |
2024-05-01 | Deep Metric Learning-Based Out-of-Distribution Detection with Synthetic Outlier Exposure | Assefa Seyoum Wahd et.al. | 2405.00631 | null |
2024-05-01 | Lane Segmentation Refinement with Diffusion Models | Antonio Ruiz et.al. | 2405.00620 | null |
2024-05-01 | Pricing and delta computation in jump-diffusion models with stochastic intensity by Malliavin calculus | Ayub Ahmadi et.al. | 2405.00473 | null |
2024-05-01 | Lazy Layers to Make Fine-Tuned Diffusion Models More Traceable | Haozhe Liu et.al. | 2405.00466 | null |
2024-05-01 | Detail-Enhancing Framework for Reference-Based Image Super-Resolution | Zihan Wang et.al. | 2405.00431 | null |
2024-05-01 | Streamlining Image Editing with Layered Diffusion Brushes | Peyman Gholami et.al. | 2405.00313 | null |
2024-05-02 | An Unstructured Mesh Reaction-Drift-Diffusion Master Equation with Reversible Reactions | Samuel A. Isaacson et.al. | 2405.00283 | null |
2024-05-01 | ASAM: Boosting Segment Anything Model with Adversarial Tuning | Bo Li et.al. | 2405.00256 | link |
2024-04-30 | MotionLCM: Real-time Controllable Motion Generation via Latent Consistency Model | Wenxun Dai et.al. | 2404.19759 | link |
2024-04-30 | Invisible Stitch: Generating Smooth 3D Scenes with Depth Inpainting | Paul Engstler et.al. | 2404.19758 | null |
2024-04-30 | Mixed Continuous and Categorical Flow Matching for 3D De Novo Molecule Generation | Ian Dunn et.al. | 2404.19739 | link |
2024-04-30 | X-Diffusion: Generating Detailed 3D MRI Volumes From a Single Image Using Cross-Sectional Diffusion Models | Emmanuelle Bourigault et.al. | 2404.19604 | null |
2024-04-30 | MicroDreamer: Zero-shot 3D Generation in |
Luxi Chen et.al. | 2404.19525 | link |
2024-04-30 | TwinDiffusion: Enhancing Coherence and Efficiency in Panoramic Image Generation with Diffusion Models | Teng Zhou et.al. | 2404.19475 | link |
2024-04-30 | Probing Unlearned Diffusion Models: A Transferable Adversarial Attack Perspective | Xiaoxuan Han et.al. | 2404.19382 | link |
2024-04-30 | Bridge to Non-Barrier Communication: Gloss-Prompted Fine-grained Cued Speech Gesture Generation with Diffusion Model | Wentao Lei et.al. | 2404.19277 | null |
2024-04-30 | DiffuseLoco: Real-Time Legged Locomotion Control with Diffusion from Offline Datasets | Xiaoyu Huang et.al. | 2404.19264 | null |
2024-04-30 | CONTUNER: Singing Voice Beautifying with Pitch and Expressiveness Condition | Jianzong Wang et.al. | 2404.19187 | null |
2024-04-29 | Stylus: Automatic Adapter Selection for Diffusion Models | Michael Luo et.al. | 2404.18928 | null |
2024-04-29 | TheaterGen: Character Management with LLM for Consistent Multi-turn Image Generation | Junhao Cheng et.al. | 2404.18919 | link |
2024-04-29 | Learning general Gaussian mixtures with efficient score matching | Sitan Chen et.al. | 2404.18893 | null |
2024-04-29 | A Survey on Diffusion Models for Time Series and Spatio-Temporal Data | Yiyuan Yang et.al. | 2404.18886 | link |
2024-04-29 | Learning Mixtures of Gaussians Using Diffusion Models | Khashayar Gatmiry et.al. | 2404.18869 | null |
2024-04-29 | Towards Extreme Image Compression with Latent Feature Guidance and Diffusion Prior | Zhiyuan Li et.al. | 2404.18820 | link |
2024-04-29 | Bootstrap 3D Reconstructed Scenes from 3D Gaussian Splatting | Yifei Gao et.al. | 2404.18669 | null |
2024-04-29 | FlexiFilm: Long Video Generation with Flexible Conditions | Yichen Ouyang et.al. | 2404.18620 | link |
2024-04-29 | Anywhere: A Multi-Agent Framework for Reliable and Diverse Foreground-Conditioned Image Inpainting | Tianyidan Xie et.al. | 2404.18598 | null |
2024-04-29 | U-Nets as Belief Propagation: Efficient Classification, Denoising, and Diffusion in Generative Hierarchical Models | Song Mei et.al. | 2404.18444 | null |
2024-04-26 | MaPa: Text-driven Photorealistic Material Painting for 3D Shapes | Shangzhan Zhang et.al. | 2404.17569 | null |
2024-04-26 | Chemotaxis-inspired PDE model for airborne infectious disease transmission: analysis and simulations | Pierluigi Colli et.al. | 2404.17506 | null |
2024-04-26 | Multi-view Image Prompted Multi-view Diffusion for Improved 3D Generation | Seungwook Kim et.al. | 2404.17419 | null |
2024-04-29 | MV-VTON: Multi-View Virtual Try-On with Diffusion Models | Haoyu Wang et.al. | 2404.17364 | link |
2024-04-26 | Simultaneous Tri-Modal Medical Image Fusion and Super-Resolution using Conditional Diffusion Model | Yushen Xu et.al. | 2404.17357 | link |
2024-04-26 | Trinity Detector:text-assisted and attention mechanisms based spectral fusion for diffusion generation image detection | Jiawei Song et.al. | 2404.17254 | null |
2024-04-26 | Few-shot Calligraphy Style Learning | Fangda Chen et.al. | 2404.17199 | link |
2024-04-25 | CyNetDiff -- A Python Library for Accelerated Implementation of Network Diffusion Models | Eliot W. Robson et.al. | 2404.17059 | link |
2024-04-25 | Universal fragmentation in annihilation reactions with constrained kinetics | Enrique Rozas Garcia et.al. | 2404.16950 | null |
2024-04-25 | Inferring solid-state diffusivity in lithium-ion battery active materials: improving upon the classical GITT method | A. Emir Gumrukcuoglu et.al. | 2404.16658 | null |
2024-04-25 | MuseumMaker: Continual Style Customization without Catastrophic Forgetting | Chenxi Liu et.al. | 2404.16612 | null |
2024-04-25 | Conditional Distribution Modelling for Few-Shot Image Synthesis with Diffusion Models | Parul Gupta et.al. | 2404.16556 | null |
2024-04-25 | DiffSeg: A Segmentation Model for Skin Lesions Based on Diffusion Difference | Zhihao Shuai et.al. | 2404.16474 | null |
2024-04-25 | TI2V-Zero: Zero-Shot Image Conditioning for Text-to-Video Diffusion Models | Haomiao Ni et.al. | 2404.16306 | link |
2024-04-25 | CFMW: Cross-modality Fusion Mamba for Multispectral Object Detection under Adverse Weather Conditions | Haoyuan Li et.al. | 2404.16302 | link |
2024-04-25 | One Noise to Rule Them All: Learning a Unified Model of Spatially-Varying Noise Patterns | Arman Maesumi et.al. | 2404.16292 | null |
2024-04-24 | Editable Image Elements for Controllable Synthesis | Jiteng Mu et.al. | 2404.16029 | null |
2024-04-24 | RetinaRegNet: A Versatile Approach for Retinal Image Registration | Vishal Balaji Sivaraman et.al. | 2404.16017 | link |
2024-04-24 | MYCloth: Towards Intelligent and Interactive Online T-Shirt Customization based on User's Preference | Yexin Liu et.al. | 2404.15801 | null |
2024-04-24 | Optimizing OOD Detection in Molecular Graphs: A Novel Approach with Diffusion Models | Xu Shen et.al. | 2404.15625 | null |
2024-04-24 | A Dynamic Kernel Prior Model for Unsupervised Blind Image Super-Resolution | Zhixiong Yang et.al. | 2404.15620 | link |
2024-04-23 | ID-Aligner: Enhancing Identity-Preserving Text-to-Image Generation with Reward Feedback Learning | Weifeng Chen et.al. | 2404.15449 | null |
2024-04-23 | GLoD: Composing Global Contexts and Local Details in Image Generation | Moyuru Yamada et.al. | 2404.15447 | null |
2024-04-23 | ControlTraj: Controllable Trajectory Generation with Topology-Constrained Diffusion Model | Yuanshao Zhu et.al. | 2404.15380 | null |
2024-04-23 | Heat flow, log-concavity, and Lipschitz transport maps | Giovanni Brigati et.al. | 2404.15205 | null |
2024-04-23 | CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method | Mingbao Lin et.al. | 2404.15141 | link |
2024-04-23 | Taming Diffusion Probabilistic Models for Character Control | Rui Chen et.al. | 2404.15121 | null |
2024-04-23 | Perturbing Attention Gives You More Bang for the Buck: Subtle Imaging Perturbations That Efficiently Fool Customized Diffusion Models | Jingyao Xu et.al. | 2404.15081 | link |
2024-04-23 | Music Style Transfer With Diffusion Model | Hong Huang et.al. | 2404.14771 | null |
2024-04-23 | Gradient Guidance for Diffusion Models: An Optimization Perspective | Yingqing Guo et.al. | 2404.14743 | link |
2024-04-23 | FlashSpeech: Efficient Zero-Shot Speech Synthesis | Zhen Ye et.al. | 2404.14700 | null |
2024-04-23 | DreamPBR: Text-driven Generation of High-resolution SVBRDF with Multi-modal Guidance | Linxuan Xin et.al. | 2404.14676 | null |
2024-04-22 | UVMap-ID: A Controllable and Personalized UV Map Generative Model | Weijie Wang et.al. | 2404.14568 | link |
2024-04-22 | Align Your Steps: Optimizing Sampling Schedules in Diffusion Models | Amirmojtaba Sabour et.al. | 2404.14507 | null |
2024-04-22 | Guess The Unseen: Dynamic 3D Scene Reconstruction from Partial 2D Glimpses | Inhee Lee et.al. | 2404.14410 | null |
2024-04-22 | GeoDiffuser: Geometry-Based Image Editing with Diffusion Models | Rahul Sajnani et.al. | 2404.14403 | null |
2024-04-22 | TAVGBench: Benchmarking Text to Audible-Video Generation | Yuxin Mao et.al. | 2404.14381 | link |
2024-04-22 | Full Event Particle-Level Unfolding with Variable-Length Latent Variational Diffusion | Alexander Shmakov et.al. | 2404.14332 | null |
2024-04-22 | X-Ray: A Sequential 3D Representation for Generation | Tao Hu et.al. | 2404.14329 | link |
2024-04-22 | Collaborative Filtering Based on Diffusion Models: Unveiling the Potential of High-Order Connectivity | Yu Hou et.al. | 2404.14240 | link |
2024-04-22 | MultiBooth: Towards Generating All Your Concepts in an Image from Text | Chenyang Zhu et.al. | 2404.14239 | link |
2024-04-22 | Face2Face: Label-driven Facial Retouching Restoration | Guanhua Zhao et.al. | 2404.14177 | null |
2024-04-22 | FLDM-VTON: Faithful Latent Diffusion Model for Virtual Try-on | Chenhui Wang et.al. | 2404.14162 | null |
2024-04-22 | Generative Artificial Intelligence Assisted Wireless Sensing: Human Flow Detection in Practical Communication Environments | Jiacheng Wang et.al. | 2404.14140 | null |
2024-04-19 | Analysis of Classifier-Free Guidance Weight Schedulers | Xi Wang et.al. | 2404.13040 | null |
2024-04-19 | RadRotator: 3D Rotation of Radiographs with Diffusion Models | Pouria Rouzrokh et.al. | 2404.13000 | null |
2024-04-19 | Cross-modal Diffusion Modelling for Super-resolved Spatial Transcriptomics | Xiaofei Wang et.al. | 2404.12973 | null |
2024-04-19 | Neural Flow Diffusion Models: Learnable Forward Process for Improved Diffusion Modelling | Grigory Bartosh et.al. | 2404.12940 | null |
2024-04-19 | Zero-Shot Medical Phrase Grounding with Off-the-shelf Diffusion Models | Konstantinos Vilouras et.al. | 2404.12920 | null |
2024-04-19 | Robust CLIP-Based Detector for Exposing Diffusion Model-Generated Images | Santosh et.al. | 2404.12908 | link |
2024-04-19 | ConCLVD: Controllable Chinese Landscape Video Generation via Diffusion Model | Dingming Liu et.al. | 2404.12903 | null |
2024-04-19 | Training-and-prompt-free General Painterly Harmonization Using Image-wise Attention Sharing | Teng-Fang Hsiao et.al. | 2404.12900 | link |
2024-04-19 | MCM: Multi-condition Motion Synthesis Framework | Zeyu Ling et.al. | 2404.12886 | null |
2024-04-19 | Detecting Out-Of-Distribution Earth Observation Images with Diffusion Models | Georges Le Bellier et.al. | 2404.12667 | null |
2024-04-18 | G-HOP: Generative Hand-Object Prior for Interaction Reconstruction and Grasp Synthesis | Yufei Ye et.al. | 2404.12383 | null |
2024-04-18 | Learning the Domain Specific Inverse NUFFT for Accelerated Spiral MRI using Diffusion Models | Trevor J. Chan et.al. | 2404.12361 | null |
2024-04-18 | AniClipart: Clipart Animation with Text-to-Video Priors | Ronghuan Wu et.al. | 2404.12347 | null |
2024-04-18 | Guided Discrete Diffusion for Electronic Health Record Generation | Zixiang Chen et.al. | 2404.12314 | null |
2024-04-18 | StyleBooth: Image Style Editing with Multimodal Instruction | Zhen Han et.al. | 2404.12154 | link |
2024-04-18 | LD-Pruner: Efficient Pruning of Latent Diffusion Models using Task-Agnostic Insights | Thibault Castells et.al. | 2404.11936 | null |
2024-04-18 | FreeDiff: Progressive Frequency Truncation for Image Editing with Diffusion Models | Wei Wu et.al. | 2404.11895 | link |
2024-04-17 | Prompt-Driven Feature Diffusion for Open-World Semi-Supervised Learning | Marzi Heidari et.al. | 2404.11795 | null |
2024-04-17 | Diffusion Schrödinger Bridge Models for High-Quality MR-to-CT Synthesis for Head and Neck Proton Treatment Planning | Muheng Li et.al. | 2404.11741 | null |
2024-04-17 | Factorized Diffusion: Perceptual Illusions by Noise Decomposition | Daniel Geng et.al. | 2404.11615 | null |
2024-04-17 | IntrinsicAnything: Learning Diffusion Priors for Inverse Rendering Under Unknown Illumination | Xi Chen et.al. | 2404.11593 | null |
2024-04-17 | Prompt Optimizer of Text-to-Image Diffusion Models for Abstract Concept Understanding | Zezhong Fan et.al. | 2404.11589 | null |
2024-04-17 | MoA: Mixture-of-Attention for Subject-Context Disentanglement in Personalized Image Generation | Kuan-Chieh et.al. | 2404.11565 | null |
2024-04-17 | Predicting Long-horizon Futures by Conditioning on Geometry and Time | Tarasha Khurana et.al. | 2404.11554 | null |
2024-04-17 | SSDiff: Spatial-spectral Integrated Diffusion Model for Remote Sensing Pansharpening | Yu Zhong et.al. | 2404.11537 | null |
2024-04-17 | Towards Highly Realistic Artistic Style Transfer via Stable Diffusion with Step-aware and Layer-aware Prompt | Zhanjie Zhang et.al. | 2404.11474 | link |
2024-04-17 | Closely Interactive Human Reconstruction with Proxemics and Physics-Guided Adaption | Buzhen Huang et.al. | 2404.11291 | link |
2024-04-17 | Optical Image-to-Image Translation Using Denoising Diffusion Models: Heterogeneous Change Detection as a Use Case | João Gabriel Vinholi et.al. | 2404.11243 | null |
2024-04-17 | RiboDiffusion: Tertiary Structure-based RNA Inverse Folding with Generative Diffusion Models | Han Huang et.al. | 2404.11199 | link |
2024-04-16 | RefFusion: Reference Adapted Diffusion Models for 3D Scene Inpainting | Ashkan Mirzaei et.al. | 2404.10765 | null |
2024-04-16 | LaDiC: Are Diffusion Models Really Inferior to Autoregressive Counterparts for Image-to-Text Generation? | Yuchi Wang et.al. | 2404.10763 | link |
2024-04-16 | GazeHTA: End-to-end Gaze Target Detection with Head-Target Association | Zhi-Yi Lin et.al. | 2404.10718 | null |
2024-04-16 | Efficient Conditional Diffusion Model with Probability Flow Sampling for Image Super-resolution | Yutao Yuan et.al. | 2404.10688 | link |
2024-04-16 | Generating Human Interaction Motions in Scenes with Text Control | Hongwei Yi et.al. | 2404.10685 | null |
2024-04-16 | StyleCity: Large-Scale 3D Urban Scenes Stylization with Vision-and-Text Reference via Progressive Optimization | Yingshu Chen et.al. | 2404.10681 | null |
2024-04-16 | Continual Offline Reinforcement Learning via Diffusion-based Dual Generative Replay | Jinmei Liu et.al. | 2404.10662 | link |
2024-04-16 | Enhancing 3D Fidelity of Text-to-3D using Cross-View Correspondences | Seungwook Kim et.al. | 2404.10603 | null |
2024-04-17 | Do Counterfactual Examples Complicate Adversarial Training? | Eric Yeats et.al. | 2404.10588 | null |
2024-04-17 | AAVDiff: Experimental Validation of Enhanced Viability and Diversity in Recombinant Adeno-Associated Virus (AAV) Capsids through Diffusion Generation | Lijun Liu et.al. | 2404.10573 | null |
2024-04-15 | Equipping Diffusion Models with Differentiable Spatial Entropy for Low-Light Image Enhancement | Wenyi Lian et.al. | 2404.09735 | link |
2024-04-15 | Photo-Realistic Image Restoration in the Wild with Controlled Vision-Language Models | Ziwei Luo et.al. | 2404.09732 | link |
2024-04-15 | All-in-one simulation-based inference | Manuel Gloeckler et.al. | 2404.09636 | link |
2024-04-15 | TMPQ-DM: Joint Timestep Reduction and Quantization Precision Selection for Efficient Diffusion Models | Haojun Sun et.al. | 2404.09532 | null |
2024-04-15 | Magic Clothing: Controllable Garment-Driven Image Synthesis | Weifeng Chen et.al. | 2404.09512 | link |
2024-04-15 | PhyScene: Physically Interactable 3D Scene Synthesis for Embodied AI | Yandan Yang et.al. | 2404.09465 | null |
2024-04-15 | Watermark-embedded Adversarial Examples for Copyright Protection against Diffusion Models | Peifei Zhu et.al. | 2404.09401 | null |
2024-04-14 | Fault Detection in Mobile Networks Using Diffusion Models | Mohamad Nabeel et.al. | 2404.09240 | null |
2024-04-14 | DreamScape: 3D Scene Creation via Gaussian Splatting joint Correlation Modeling | Xuening Yuan et.al. | 2404.09227 | null |
2024-04-16 | LoopAnimate: Loopable Salient Object Animation | Fanyi Wang et.al. | 2404.09172 | null |
2024-04-12 | Lossy Image Compression with Foundation Diffusion Models | Lucas Relic et.al. | 2404.08580 | null |
2024-04-12 | PiRD: Physics-informed Residual Diffusion for Flow Field Reconstruction | Siming Shan et.al. | 2404.08412 | null |
2024-04-12 | Struggle with Adversarial Defense? Try Diffusion | Yujie Li et.al. | 2404.08273 | link |
2024-04-12 | Balanced Mixed-Type Tabular Data Synthesis with Diffusion Models | Zeyu Yang et.al. | 2404.08254 | link |
2024-04-12 | Interest Maximization in Social Networks | Rahul Kumar Gautam et.al. | 2404.08236 | null |
2024-04-11 | ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback | Ming Li et.al. | 2404.07987 | link |
2024-04-11 | Taming Stable Diffusion for Text to 360° Panorama Image Generation | Cheng Zhang et.al. | 2404.07949 | link |
2024-04-11 | Adaptive Hyperbolic-cross-space Mapped Jacobi Method on Unbounded Domains with Applications to Solving Multidimensional Spatiotemporal Integrodifferential Equations | Yunhong Deng et.al. | 2404.07844 | null |
2024-04-11 | ConsistencyDet: Robust Object Detector with Denoising Paradigm of Consistency Model | Lifan Jiang et.al. | 2404.07773 | link |
2024-04-11 | An Overview of Diffusion Models: Applications, Guided Generation, Statistical Rates and Optimization | Minshuo Chen et.al. | 2404.07771 | null |
2024-04-11 | Joint Conditional Diffusion Model for Image Restoration with Mixed Degradations | Yufeng Yue et.al. | 2404.07770 | null |
2024-04-11 | Diffusing in Someone Else's Shoes: Robotic Perspective Taking with Diffusion | Josua Spisak et.al. | 2404.07735 | null |
2024-04-11 | Applying Guidance in a Limited Interval Improves Sample and Distribution Quality in Diffusion Models | Tuomas Kynkäänniemi et.al. | 2404.07724 | link |
2024-04-11 | Implicit and Explicit Language Guidance for Diffusion-based Visual Perception | Hefeng Wang et.al. | 2404.07600 | null |
2024-04-11 | ObjBlur: A Curriculum Learning Approach With Progressive Object-Level Blurring for Improved Layout-to-Image Generation | Stanislav Frolov et.al. | 2404.07564 | null |
2024-04-10 | GoodDrag: Towards Good Practices for Drag Editing with Diffusion Models | Zewei Zhang et.al. | 2404.07206 | null |
2024-04-10 | RealmDreamer: Text-Driven 3D Scene Generation with Inpainting and Depth Diffusion | Jaidev Shriram et.al. | 2404.07199 | null |
2024-04-10 | InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models | Jiale Xu et.al. | 2404.07191 | link |
2024-04-10 | Move Anything with Layered Scene Diffusion | Jiawei Ren et.al. | 2404.07178 | null |
2024-04-10 | Diffusion-based inpainting of incomplete Euclidean distance matrices of trajectories generated by a fractional Brownian motion | Alexander Lobashev et.al. | 2404.07029 | link |
2024-04-10 | DreamScene360: Unconstrained Text-to-3D Scene Generation with Panoramic Gaussian Splatting | Shijie Zhou et.al. | 2404.06903 | null |
2024-04-10 | Fine color guidance in diffusion models and its application to image compression at extremely low bitrates | Tom Bordin et.al. | 2404.06865 | null |
2024-04-10 | UDiFF: Generating Conditional Unsigned Distance Fields with Optimal Wavelet Diffusion | Junsheng Zhou et.al. | 2404.06851 | null |
2024-04-10 | Tuning-Free Adaptive Style Incorporation for Structure-Consistent Text-Driven Style Transfer | Yanqi Ge et.al. | 2404.06835 | null |
2024-04-10 | Zero-shot Point Cloud Completion Via 2D Priors | Tianxin Huang et.al. | 2404.06814 | null |
2024-04-09 | GeoDirDock: Guiding Docking Along Geodesic Paths | Raúl Miñán et.al. | 2404.06481 | null |
2024-04-09 | Magic-Boost: Boost 3D Generation with Mutli-View Conditioned Diffusion | Fan Yang et.al. | 2404.06429 | link |
2024-04-09 | ZeST: Zero-Shot Material Transfer from a Single Image | Ta-Ying Cheng et.al. | 2404.06425 | null |
2024-04-09 | Policy-Guided Diffusion | Matthew Thomas Jackson et.al. | 2404.06356 | link |
2024-04-09 | Quantum State Generation with Structure-Preserving Diffusion Model | Yuchen Zhu et.al. | 2404.06336 | null |
2024-04-09 | DiffHarmony: Latent Diffusion Model Meets Image Harmonization | Pengfei Zhou et.al. | 2404.06139 | link |
2024-04-09 | Hash3D: Training-free Acceleration for 3D Generation | Xingyi Yang et.al. | 2404.06091 | link |
2024-04-09 | Diffusion-Based Point Cloud Super-Resolution for mmWave Radar Data | Kai Luan et.al. | 2404.06012 | null |
2024-04-09 | Tackling Structural Hallucination in Image Translation with Local Diffusion | Seunghoi Kim et.al. | 2404.05980 | link |
2024-04-09 | Map Optical Properties to Subwavelength Structures Directly via a Diffusion Model | Shijie Rao et.al. | 2404.05959 | null |
2024-04-08 | MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation | Kunpeng Song et.al. | 2404.05674 | link |
2024-04-08 | YaART: Yet Another ART Rendering Technology | Sergey Kastryulin et.al. | 2404.05666 | null |
2024-04-08 | BinaryDM: Towards Accurate Binarization of Diffusion Model | Xingyu Zheng et.al. | 2404.05662 | link |
2024-04-08 | Resistive Memory-based Neural Differential Equation Solver for Score-based Diffusion Model | Jichang Yang et.al. | 2404.05648 | link |
2024-04-08 | Learning a Category-level Object Pose Estimator without Pose Annotations | Fengrui Tian et.al. | 2404.05626 | null |
2024-04-08 | UniFL: Improve Stable Diffusion via Unified Feedback Learning | Jiacheng Zhang et.al. | 2404.05595 | null |
2024-04-08 | Investigating the Effectiveness of Cross-Attention to Unlock Zero-Shot Editing of Text-to-Video Diffusion Models | Saman Motamed et.al. | 2404.05519 | null |
2024-04-08 | Taming Transformers for Realistic Lidar Point Cloud Generation | Hamed Haghighi et.al. | 2404.05505 | link |
2024-04-08 | Rethinking the Spatial Inconsistency in Classifier-Free Diffusion Guidance | Dazhong Shen et.al. | 2404.05384 | link |
2024-04-08 | Mask-ControlNet: Higher-Quality Image Generation with An Additional Mask Prompt | Zhiqi Huang et.al. | 2404.05331 | null |
2024-04-05 | Identity Decoupling for Multi-Subject Personalization of Text-to-Image Models | Sangwon Jang et.al. | 2404.04243 | null |
2024-04-05 | ToolEENet: Tool Affordance 6D Pose Estimation | Yunlong Wang et.al. | 2404.04193 | null |
2024-04-05 | Dynamic Prompt Optimizing for Text-to-Image Generation | Wenyi Mo et.al. | 2404.04095 | link |
2024-04-05 | Score identity Distillation: Exponentially Fast Distillation of Pretrained Diffusion Models for One-Step Generation | Mingyuan Zhou et.al. | 2404.04057 | link |
2024-04-05 | Concept Weaver: Enabling Multi-Concept Fusion in Text-to-Image Models | Gihyun Kwon et.al. | 2404.03913 | null |
2024-04-04 | MVD-Fusion: Single-view 3D via Depth-consistent Multi-view Generation | Hanzhe Hu et.al. | 2404.03656 | null |
2024-04-04 | CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching | Dongzhi Jiang et.al. | 2404.03653 | link |
2024-04-04 | The More You See in 2D, the More You Perceive in 3D | Xinyang Han et.al. | 2404.03652 | null |
2024-04-04 | DiffBody: Human Body Restoration by Imagining with Generative Diffusion Prior | Yiming Zhang et.al. | 2404.03642 | null |
2024-04-04 | LCM-Lookahead for Encoder-based Text-to-Image Personalization | Rinon Gal et.al. | 2404.03620 | null |
2024-04-04 | DiffDet4SAR: Diffusion-based Aircraft Target Detection Network for SAR Images | Zhou Jie et.al. | 2404.03595 | link |
2024-04-04 | PointInfinity: Resolution-Invariant Point Diffusion Models | Zixuan Huang et.al. | 2404.03566 | null |
2024-04-04 | Segmentation-Guided Knee Radiograph Generation using Conditional Diffusion Models | Siyuan Mei et.al. | 2404.03541 | null |
2024-04-04 | A Directional Diffusion Graph Transformer for Recommendation | Zixuan Yi et.al. | 2404.03326 | null |
2024-04-04 | SiloFuse: Cross-silo Synthetic Data Generation with Latent Tabular Diffusion Models | Aditya Shankar et.al. | 2404.03299 | null |
2024-04-03 | LidarDM: Generative LiDAR Simulation in a Generated World | Vlas Zyrianov et.al. | 2404.02903 | link |
2024-04-03 | Fast Diffusion Model For Seismic Data Noise Attenuation | Junheng Peng et.al. | 2404.02767 | null |
2024-04-03 | Cross-Attention Makes Inference Cumbersome in Text-to-Image Diffusion Models | Wentian Zhang et.al. | 2404.02747 | link |
2024-04-03 | Deep Privacy Funnel Model: From a Discriminative to a Generative Approach with an Application to Face Recognition | Behrooz Razeghi et.al. | 2404.02696 | null |
2024-04-03 | Diffexplainer: Towards Cross-modal Global Explanations with Diffusion Models | Matteo Pennisi et.al. | 2404.02618 | null |
2024-04-03 | A Unified Editing Method for Co-Speech Gesture Generation via Diffusion Inversion | Zeyu Zhao et.al. | 2404.02411 | null |
2024-04-03 | Enhancing Diffusion-based Point Cloud Generation with Smoothness Constraint | Yukun Li et.al. | 2404.02396 | null |
2024-04-02 | Semantic Augmentation in Images using Language | Sahiti Yerramilli et.al. | 2404.02353 | null |
2024-04-02 | Heat Death of Generative Models in Closed-Loop Learning | Matteo Marchi et.al. | 2404.02325 | null |
2024-04-02 | APEX: Ambidextrous Dual-Arm Robotic Manipulation Using Collision-Free Generative Diffusion Models | Apan Dastider et.al. | 2404.02284 | null |
2024-04-02 | Diffusion |
Zeyu Yang et.al. | 2404.02148 | link |
2024-04-02 | WcDT: World-centric Diffusion Transformer for Traffic Scene Generation | Chen Yang et.al. | 2404.02082 | link |
2024-04-03 | AUTODIFF: Autoregressive Diffusion Modeling for Structure-based Drug Design | Xinze Li et.al. | 2404.02003 | null |
2024-04-02 | Bi-LORA: A Vision-Language Approach for Synthetic Image Detection | Mamadou Keita et.al. | 2404.01959 | link |
2024-04-02 | Co-Speech Gesture Video Generation via Motion-Decoupled Diffusion Model | Xu He et.al. | 2404.01862 | link |
2024-04-02 | Upsample Guidance: Scale Up Diffusion Models without Training | Juno Hwang et.al. | 2404.01709 | null |
2024-04-02 | FashionEngine: Interactive Generation and Editing of 3D Clothed Humans | Tao Hu et.al. | 2404.01655 | null |
2024-04-02 | Diffusion Deepfake | Chaitali Bhattacharyya et.al. | 2404.01579 | null |
2024-04-01 | Prior Frequency Guided Diffusion Model for Limited Angle (LA)-CBCT Reconstruction | Jiacheng Xie et.al. | 2404.01448 | null |
2024-04-01 | DPMesh: Exploiting Diffusion Prior for Occluded Human Mesh Recovery | Yixuan Zhu et.al. | 2404.01424 | link |
2024-03-29 | Relation Rectification in Diffusion Model | Yinwei Wu et.al. | 2403.20249 | null |
2024-03-29 | Motion Inversion for Video Customization | Luozhou Wang et.al. | 2403.20193 | null |
2024-03-29 | FreeSeg-Diff: Training-Free Open-Vocabulary Segmentation with Diffusion Models | Barbara Toniella Corradini et.al. | 2403.20105 | null |
2024-03-29 | SGD: Street View Synthesis with Gaussian Splatting and Diffusion Prior | Zhongrui Yu et.al. | 2403.20079 | null |
2024-03-29 | Probing solar modulation analytic models with cosmic ray periodic spectra | Wei-Cheng Long et.al. | 2403.20038 | null |
2024-04-01 | Structure Matters: Tackling the Semantic Discrepancy in Diffusion Models for Image Inpainting | Haipeng Liu et.al. | 2403.19898 | link |
2024-03-28 | Vision-Language Synthetic Data Enhances Echocardiography Downstream Tasks | Pooria Ashrafian et.al. | 2403.19880 | link |
2024-03-28 | ShapeFusion: A 3D diffusion model for localized shape editing | Rolandos Alexandros Potamias et.al. | 2403.19773 | null |
2024-03-28 | Detecting Image Attribution for Text-to-Image Diffusion Models in RGB and Beyond | Katherine Xu et.al. | 2403.19653 | link |
2024-03-28 | InterDreamer: Zero-Shot Text to 3D Dynamic Human-Object Interaction | Sirui Xu et.al. | 2403.19652 | null |
2024-03-28 | GANTASTIC: GAN-based Transfer of Interpretable Directions for Disentangled Image Editing in Text-to-Image Diffusion Models | Yusuf Dalva et.al. | 2403.19645 | null |
2024-03-28 | In the driver's mind: modeling the dynamics of human overtaking decisions in interactions with oncoming automated vehicles | Samir H. A. Mohammad et.al. | 2403.19637 | null |
2024-03-28 | Enhance Image Classification via Inter-Class Image Mixup with Diffusion Model | Zhicai Wang et.al. | 2403.19600 | link |
2024-03-28 | Frame by Familiar Frame: Understanding Replication in Video Diffusion Models | Aimon Rahman et.al. | 2403.19593 | null |
2024-03-28 | Impact of Resin Molecular Weight on Drying Kinetics and Sag of Coatings | Marola W. Issa et.al. | 2403.19544 | null |
2024-03-28 | Debiasing Cardiac Imaging with Controlled Latent Diffusion Models | Grzegorz Skorupko et.al. | 2403.19508 | link |
2024-03-28 | Burst Super-Resolution with Diffusion Models for Improving Perceptual Quality | Kyotaro Tokoro et.al. | 2403.19428 | link |
2024-03-28 | Imperceptible Protection against Style Imitation from Diffusion Models | Namhyuk Ahn et.al. | 2403.19254 | null |
2024-03-27 | ObjectDrop: Bootstrapping Counterfactuals for Photorealistic Object Removal and Insertion | Daniel Winter et.al. | 2403.18818 | null |
2024-03-28 | ECoDepth: Effective Conditioning of Diffusion Models for Monocular Depth Estimation | Suraj Patni et.al. | 2403.18807 | link |
2024-03-27 | Object Pose Estimation via the Aggregation of Diffusion Features | Tianfu Wang et.al. | 2403.18791 | link |
2024-03-27 | ImageNet-D: Benchmarking Neural Network Robustness on Diffusion Synthetic Object | Chenshuang Zhang et.al. | 2403.18775 | link |
2024-03-27 | A Diffusion-Based Generative Equalizer for Music Restoration | Eloi Moliner et.al. | 2403.18636 | link |
2024-03-27 | HandBooster: Boosting 3D Hand-Mesh Reconstruction by Conditional Synthesis and Sampling of Hand-Object Interactions | Hao Xu et.al. | 2403.18575 | link |
2024-03-27 | Artifact Reduction in 3D and 4D Cone-beam Computed Tomography Images with Deep Learning -- A Review | Mohammadreza Amirian et.al. | 2403.18565 | null |
2024-03-27 | CosalPure: Learning Concept from Group Images for Robust Co-Saliency Detection | Jiayi Zhu et.al. | 2403.18554 | null |
2024-03-27 | CT-3DFlow : Leveraging 3D Normalizing Flows for Unsupervised Detection of Pathological Pulmonary CT scans | Aissam Djahnine et.al. | 2403.18514 | null |
2024-03-27 | Synthesizing EEG Signals from Event-Related Potential Paradigms with Conditional Diffusion Models | Guido Klein et.al. | 2403.18486 | link |
2024-03-26 | AID: Attention Interpolation of Text-to-Image Diffusion | Qiyuan He et.al. | 2403.17924 | link |
2024-03-26 | Boosting Diffusion Models with Moving Average Sampling in Frequency Domain | Yurui Qian et.al. | 2403.17870 | null |
2024-03-26 | DiffH2O: Diffusion-Based Synthesis of Hand-Object Interactions from Textual Descriptions | Sammy Christen et.al. | 2403.17827 | null |
2024-03-26 | Annotated Biomedical Video Generation using Denoising Diffusion Probabilistic Models and Flow Fields | Rüveyda Yilmaz et.al. | 2403.17808 | link |
2024-03-26 | GenesisTex: Adapting Image Denoising Diffusion to Texture Space | Chenjian Gao et.al. | 2403.17782 | null |
2024-03-26 | CT Synthesis with Conditional Diffusion Models for Abdominal Lymph Node Segmentation | Yongrui Yu et.al. | 2403.17770 | null |
2024-03-26 | AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation | Huawei Wei et.al. | 2403.17694 | link |
2024-03-26 | Manifold-Guided Lyapunov Control with Diffusion Models | Amartya Mukherjee et.al. | 2403.17692 | link |
2024-03-26 | Not All Similarities Are Created Equal: Leveraging Data-Driven Biases to Inform GenAI Copyright Disputes | Uri Hacohen et.al. | 2403.17691 | null |
2024-03-26 | DiffFAE: Advancing High-fidelity One-shot Facial Appearance Editing with Space-sensitive Customization and Semantic Preservation | Qilin Wang et.al. | 2403.17664 | null |
2024-03-25 | SDXS: Real-Time One-Step Latent Diffusion Models with Image Conditions | Yuda Song et.al. | 2403.16627 | link |
2024-03-25 | SatSynth: Augmenting Image-Mask Pairs through Diffusion Models for Aerial Semantic Segmentation | Aysim Toker et.al. | 2403.16605 | null |
2024-03-25 | Antigen-Specific Antibody Design via Direct Energy-based Preference Optimization | Xiangxin Zhou et.al. | 2403.16576 | null |
2024-03-25 | An Intermediate Fusion ViT Enables Efficient Text-Image Alignment in Diffusion Models | Zizhao Hu et.al. | 2403.16530 | null |
2024-03-25 | Let Real Images be as a Judger, Spotting Fake Images Synthesized with Generative Models | Ziyou Liang et.al. | 2403.16513 | null |
2024-03-25 | Make-Your-Anchor: A Diffusion-based 2D Avatar Generation Framework | Ziyao Huang et.al. | 2403.16510 | link |
2024-03-25 | Refining Text-to-Image Generation: Towards Accurate Training-Free Glyph-Enhanced Image Generation | Sanyam Lakhanpal et.al. | 2403.16422 | null |
2024-03-25 | FlashEval: Towards Fast and Accurate Evaluation of Text-to-image Diffusion Generative Models | Lin Zhao et.al. | 2403.16379 | null |
2024-03-24 | Laplacian-guided Entropy Model in Neural Codec with Blur-dissipated Synthesis | Atefeh Khoshkhahtinat et.al. | 2403.16258 | null |
2024-03-24 | Skull-to-Face: Anatomy-Guided 3D Facial Reconstruction and Editing | Yongqing Liang et.al. | 2403.16207 | null |
2024-03-22 | DiffusionMTL: Learning Multi-Task Denoising Diffusion Model from Partially Annotated Data | Hanrong Ye et.al. | 2403.15389 | null |
2024-03-22 | Ultrasound Imaging based on the Variance of a Diffusion Restoration Model | Yuxin Zhang et.al. | 2403.15316 | link |
2024-03-22 | Controlled Training Data Generation with Diffusion Models | Teresa Yeo et.al. | 2403.15309 | null |
2024-03-22 | Spectral Motion Alignment for Video Motion Transfer using Diffusion Models | Geon Yeong Park et.al. | 2403.15249 | null |
2024-03-22 | Shadow Generation for Composite Image Using Diffusion model | Qingyang Liu et.al. | 2403.15234 | link |
2024-03-22 | MM-Diff: High-Fidelity Image Personalization via Multi-Modal Condition Integration | Zhichao Wei et.al. | 2403.15059 | null |
2024-03-22 | Toward Tiny and High-quality Facial Makeup with Data Amplify Learning | Qiaoqiao Jin et.al. | 2403.15033 | null |
2024-03-22 | Dynamics of a memory-based diffusion model with spatial heterogeneity and nonlinear boundary condition | Quanli Ji et.al. | 2403.14969 | null |
2024-03-22 | DreamFlow: High-Quality Text-to-3D Generation by Approximating Probability Flow | Kyungmin Lee et.al. | 2403.14966 | null |
2024-03-22 | CLIP-VQDiffusion : Langauge Free Training of Text To Image generation using CLIP and vector quantized diffusion model | Seungdae Han et.al. | 2403.14944 | link |
2024-03-21 | GRM: Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation | Yinghao Xu et.al. | 2403.14621 | link |
2024-03-21 | DreamReward: Text-to-3D Generation with Human Preference | Junliang Ye et.al. | 2403.14613 | null |
2024-03-21 | ReNoise: Real Image Inversion Through Iterative Noising | Daniel Garibi et.al. | 2403.14602 | null |
2024-03-21 | Denoising Diffusion Models for 3D Healthy Brain Tissue Inpainting | Alicia Durrer et.al. | 2403.14499 | link |
2024-03-21 | Style-Extracting Diffusion Models for Semi-Supervised Histopathology Segmentation | Mathias Öttl et.al. | 2403.14429 | null |
2024-03-21 | DP-RDM: Adapting Diffusion Models to Private Domains Without Fine-Tuning | Jonathan Lebensold et.al. | 2403.14421 | link |
2024-03-21 | Physics-Informed Diffusion Models | Jan-Hendrik Bastek et.al. | 2403.14404 | link |
2024-03-21 | Open-Vocabulary Attention Maps with Token Optimization for Semantic Segmentation in Diffusion Models | Pablo Marcos-Manchón et.al. | 2403.14291 | link |
2024-03-21 | Zero123-6D: Zero-shot Novel View Synthesis for RGB Category-level 6D Pose Estimation | Francesco Di Felice et.al. | 2403.14279 | null |
2024-03-21 | Diffusion Models with Ensembled Structure-Based Anomaly Scoring for Unsupervised Anomaly Detection | Finn Behrendt et.al. | 2403.14262 | link |
2024-03-20 | Editing Massive Concepts in Text-to-Image Diffusion Models | Tianwei Xiong et.al. | 2403.13807 | link |
2024-03-20 | ZigMa: Zigzag Mamba Diffusion Model | Vincent Tao Hu et.al. | 2403.13802 | link |
2024-03-20 | TimeRewind: Rewinding Time with Image-and-Events Video Diffusion | Jingxi Chen et.al. | 2403.13800 | null |
2024-03-20 | DepthFM: Fast Monocular Depth Estimation with Flow Matching | Ming Gui et.al. | 2403.13788 | link |
2024-03-20 | Be-Your-Outpainter: Mastering Video Outpainting through Input-Specific Adaptation | Fu-Yun Wang et.al. | 2403.13745 | link |
2024-03-20 | DanceCamera3D: 3D Camera Movement Synthesis with Music and Dance | Zixuan Wang et.al. | 2403.13667 | link |
2024-03-20 | ZoDi: Zero-Shot Domain Adaptation with Diffusion-Based Image Transfer | Hiroki Azuma et.al. | 2403.13652 | link |
2024-03-20 | ReGround: Improving Textual and Spatial Grounding at No Cost | Yuseung Lee et.al. | 2403.13589 | null |
2024-03-20 | Ground-A-Score: Scaling Up the Score Distillation for Multi-Attribute Editing | Hangeol Chang et.al. | 2403.13551 | link |
2024-03-20 | Compress3D: a Compressed Latent Space for 3D Generation from a Single Image | Bowen Zhang et.al. | 2403.13524 | null |
2024-03-19 | FouriScale: A Frequency Perspective on Training-Free High-Resolution Image Synthesis | Linjiang Huang et.al. | 2403.12963 | link |
2024-03-19 | FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation | Shuai Yang et.al. | 2403.12962 | link |
2024-03-19 | Zero-Reference Low-Light Enhancement via Physical Quadruple Priors | Wenjing Wang et.al. | 2403.12933 | null |
2024-03-19 | Ultra-High-Resolution Image Synthesis with Pyramid Diffusion Model | Jiajie Yang et.al. | 2403.12915 | link |
2024-03-19 | D-Cubed: Latent Diffusion Trajectory Optimisation for Dexterous Deformable Manipulation | Jun Yamada et.al. | 2403.12861 | null |
2024-03-19 | Generative Enhancement for 3D Medical Images | Lingting Zhu et.al. | 2403.12852 | link |
2024-03-19 | Compositional 3D Scene Synthesis with Scene Graph Guided Layout-Shape Generation | Yao Wei et.al. | 2403.12848 | null |
2024-03-19 | DreamDA: Generative Data Augmentation with Diffusion Models | Yunxiang Fu et.al. | 2403.12803 | link |
2024-03-19 | WaveFace: Authentic Face Restoration with Efficient Frequency Recovery | Yunqi Miao et.al. | 2403.12760 | null |
2024-03-19 | Towards Controllable Face Generation with Semantic Latent Diffusion Models | Alex Ergasti et.al. | 2403.12743 | link |
2024-03-18 | Generalized Multi-Source Inference for Text Conditioned Music Diffusion Models | Emilian Postolache et.al. | 2403.11706 | link |
2024-03-19 | Urban Scene Diffusion through Semantic Occupancy Map | Junge Zhang et.al. | 2403.11697 | null |
2024-03-18 | Binary Noise for Binary Tasks: Masked Bernoulli Diffusion for Unsupervised Anomaly Detection | Julia Wolleb et.al. | 2403.11667 | link |
2024-03-18 | Arc2Face: A Foundation Model of Human Faces | Foivos Paraperas Papantoniou et.al. | 2403.11641 | link |
2024-03-18 | LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept Customization in Training-Free Diffusion Models | Yang Yang et.al. | 2403.11627 | link |
2024-03-18 | CRS-Diff: Controllable Generative Remote Sensing Foundation Model | Datao Tang et.al. | 2403.11614 | link |
2024-03-18 | EffiVED:Efficient Video Editing via Text-instruction Diffusion Models | Zhenghao Zhang et.al. | 2403.11568 | link |
2024-03-18 | EchoReel: Enhancing Action Generation of Existing Video Diffusion Models | Jianzhi liu et.al. | 2403.11535 | link |
2024-03-18 | Diffusion Models are Geometry Critics: Single Image 3D Editing Using Pre-Trained Diffusion Priors | Ruicheng Wang et.al. | 2403.11503 | null |
2024-03-18 | SeisFusion: Constrained Diffusion Model with Input Guidance for 3D Seismic Data Interpolation and Reconstruction | Shuang Wang et.al. | 2403.11482 | link |
2024-03-15 | Lodge: A Coarse to Fine Diffusion Network for Long Dance Generation Guided by the Characteristic Dance Primitives | Ronghui Li et.al. | 2403.10518 | link |
2024-03-15 | Isotropic3D: Image-to-3D Generation Based on a Single CLIP Embedding | Pengkun Liu et.al. | 2403.10395 | link |
2024-03-15 | Denoising Task Difficulty-based Curriculum for Training Diffusion Models | Jin-Young Kim et.al. | 2403.10348 | null |
2024-03-15 | Optimal Control of Stationary Doubly Diffusive Flows on Two and Three Dimensional Bounded Lipschitz Domains: Numerical Analysis | Jai Tushar et.al. | 2403.10282 | null |
2024-03-15 | Arbitrary-Scale Image Generation and Upsampling using Latent Diffusion Model and Implicit Neural Decoder | Jinseok Kim et.al. | 2403.10255 | null |
2024-03-15 | FDGaussian: Fast Gaussian Splatting from Single Image via Geometric-aware Diffusion Model | Qijun Feng et.al. | 2403.10242 | null |
2024-03-15 | BlindDiff: Empowering Degradation Modelling in Diffusion Models for Blind Image Super-Resolution | Feng Li et.al. | 2403.10211 | link |
2024-03-15 | Spectral CT Two-step and One-step Material Decomposition using Diffusion Posterior Sampling | Corentin Vazia et.al. | 2403.10183 | null |
2024-03-15 | Animate Your Motion: Turning Still Images into Dynamic Videos | Mingxiao Li et.al. | 2403.10179 | null |
2024-03-15 | Being heterogeneous is disadvantageous: Brownian non-Gaussian searches | Vittoria Sposini et.al. | 2403.10138 | null |
2024-03-14 | SCP-Diff: Photo-Realistic Semantic Image Synthesis with Spatial-Categorical Joint Prior | Huan-ang Gao et.al. | 2403.09638 | null |
2024-03-14 | 3D-VLA: A 3D Vision-Language-Action Generative World Model | Haoyu Zhen et.al. | 2403.09631 | null |
2024-03-14 | Generalized Predictive Model for Autonomous Driving | Jiazhi Yang et.al. | 2403.09630 | link |
2024-03-14 | Make-Your-3D: Fast and Consistent Subject-Driven 3D Content Generation | Fangfu Liu et.al. | 2403.09625 | null |
2024-03-14 | Score-Guided Diffusion for 3D Human Recovery | Anastasis Stathopoulos et.al. | 2403.09623 | link |
2024-03-14 | Explore In-Context Segmentation via Latent Diffusion Models | Chaoyang Wang et.al. | 2403.09616 | null |
2024-03-14 | MambaTalk: Efficient Holistic Gesture Synthesis with Selective State Space Models | Zunnan Xu et.al. | 2403.09471 | link |
2024-03-14 | Eta Inversion: Designing an Optimal Eta Function for Diffusion-based Real Image Editing | Wonjun Kang et.al. | 2403.09468 | link |
2024-03-14 | Shake to Leak: Fine-tuning Diffusion Models Can Amplify the Generative Privacy Risk | Zhangheng Li et.al. | 2403.09450 | link |
2024-03-14 | 3D-SceneDreamer: Text-Driven 3D-Consistent Scene Generation | Frank Zhang et.al. | 2403.09439 | null |
2024-03-13 | VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis | Enric Corona et.al. | 2403.08764 | null |
2024-03-13 | Spatiotemporal Diffusion Model with Paired Sampling for Accelerated Cardiac Cine MRI | Shihan Qiu et.al. | 2403.08758 | null |
2024-03-13 | Clinically Feasible Diffusion Reconstruction for Highly-Accelerated Cardiac Cine MRI | Shihan Qiu et.al. | 2403.08749 | null |
2024-03-14 | GaussCtrl: Multi-View Consistent Text-Driven 3D Gaussian Splatting Editing | Jing Wu et.al. | 2403.08733 | link |
2024-03-13 | Ambient Diffusion Posterior Sampling: Solving Inverse Problems with Diffusion Models trained on Corrupted Data | Asad Aali et.al. | 2403.08728 | link |
2024-03-13 | Data Augmentation in Human-Centric Vision | Wentao Jiang et.al. | 2403.08650 | null |
2024-03-13 | ActionDiffusion: An Action-aware Diffusion Model for Procedure Planning in Instructional Videos | Lei Shi et.al. | 2403.08591 | null |
2024-03-13 | Federated Knowledge Graph Unlearning via Diffusion Model | Bingchen Liu et.al. | 2403.08554 | null |
2024-03-13 | Model Will Tell: Training Membership Inference for Diffusion Models | Xiaomeng Fu et.al. | 2403.08487 | null |
2024-03-13 | MD-Dose: A Diffusion Model based on the Mamba for Radiotherapy Dose Prediction | Linjie Fu et.al. | 2403.08479 | link |
2024-03-12 | Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation | Shihao Zhao et.al. | 2403.07860 | link |
2024-03-12 | Quantifying and Mitigating Privacy Risks for Tabular Generative Models | Chaoyi Zhu et.al. | 2403.07842 | null |
2024-03-12 | MPCPA: Multi-Center Privacy Computing with Predictions Aggregation based on Denoising Diffusion Probabilistic Model | Guibo Luo et.al. | 2403.07838 | null |
2024-03-13 | SemCity: Semantic Scene Generation with Triplane Diffusion | Jumin Lee et.al. | 2403.07773 | link |
2024-03-12 | Stable-Makeup: When Real-World Makeup Transfer Meets Diffusion Model | Yuxuan Zhang et.al. | 2403.07764 | link |
2024-03-12 | SSM Meets Video Diffusion Models: Efficient Video Generation with Structured State Spaces | Yuta Oshima et.al. | 2403.07711 | link |
2024-03-12 | Visual Privacy Auditing with Diffusion Models | Kristian Schwethelm et.al. | 2403.07588 | null |
2024-03-12 | D4D: An RGBD diffusion model to boost monocular depth estimation | L. Papa et.al. | 2403.07516 | link |
2024-03-12 | Block-wise LoRA: Revisiting Fine-grained LoRA for Effective Personalization and Stylization in Text-to-Image Generation | Likun Li et.al. | 2403.07500 | null |
2024-03-12 | Time-Efficient and Identity-Consistent Virtual Try-On Using A Variant of Altered Diffusion Models | Phuong Dam et.al. | 2403.07371 | null |
2024-03-11 | BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion | Xuan Ju et.al. | 2403.06976 | link |
2024-03-11 | Bayesian Diffusion Models for 3D Shape Reconstruction | Haiyang Xu et.al. | 2403.06973 | null |
2024-03-11 | POD-ROM methods: from a finite set of snapshots to continuous-in-time approximations | Bosco Garcia-Archilla et.al. | 2403.06967 | null |
2024-03-11 | SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data | Jialu Li et.al. | 2403.06952 | null |
2024-03-12 | DEADiff: An Efficient Stylization Diffusion Model with Disentangled Representations | Tianhao Qi et.al. | 2403.06951 | link |
2024-03-11 | Conditional Score-Based Diffusion Model for Cortical Thickness Trajectory Prediction | Qing Xiao et.al. | 2403.06940 | null |
2024-03-11 | Estimation of parameters and local times in a discretely observed threshold diffusion model | Sara Mazzonetto et.al. | 2403.06858 | null |
2024-03-11 | Multistep Consistency Models | Jonathan Heek et.al. | 2403.06807 | null |
2024-03-11 | Distribution-Aware Data Expansion with Diffusion Models | Haowei Zhu et.al. | 2403.06741 | link |
2024-03-11 | V3D: Video Diffusion Models are Effective 3D Generators | Zilong Chen et.al. | 2403.06738 | link |
2024-03-08 | VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion Models | Yabo Zhang et.al. | 2403.05438 | link |
2024-03-08 | DiffSF: Diffusion Models for Scene Flow Estimation | Yushan Zhang et.al. | 2403.05327 | link |
2024-03-08 | Noise Level Adaptive Diffusion Model for Robust Reconstruction of Accelerated MRI | Shoujin Huang et.al. | 2403.05245 | link |
2024-03-08 | Towards Effective Usage of Human-Centric Priors in Diffusion Models for Text-based Human Image Generation | Junyan Wang et.al. | 2403.05239 | null |
2024-03-08 | Denoising Autoregressive Representation Learning | Yazhe Li et.al. | 2403.05196 | null |
2024-03-08 | DiffuLT: How to Make Diffusion Model Useful for Long-tail Recognition | Jie Shao et.al. | 2403.05170 | null |
2024-03-08 | GSEdit: Efficient Text-Guided Editing of 3D Objects via Gaussian Splatting | Francesco Palandra et.al. | 2403.05154 | null |
2024-03-08 | Improving Diffusion Models for Virtual Try-on | Yisol Choi et.al. | 2403.05139 | link |
2024-03-08 | ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment | Xiwei Hu et.al. | 2403.05135 | null |
2024-03-08 | CogView3: Finer and Faster Text-to-Image Generation via Relay Diffusion | Wendi Zheng et.al. | 2403.05121 | null |
2024-03-07 | ObjectCompose: Evaluating Resilience of Vision-Based Models on Object-to-Background Compositional Changes | Hashmat Shadab Malik et.al. | 2403.04701 | link |
2024-03-07 | Delving into the Trajectory Long-tail Distribution for Muti-object Tracking | Sijia Chen et.al. | 2403.04700 | link |
2024-03-07 | PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation | Junsong Chen et.al. | 2403.04692 | link |
2024-03-08 | Pix2Gif: Motion-Guided Diffusion for GIF Generation | Hitesh Kandala et.al. | 2403.04634 | link |
2024-03-07 | A Domain Translation Framework with an Adversarial Denoising Diffusion Model to Generate Synthetic Datasets of Echocardiography Images | Cristiana Tiago et.al. | 2403.04612 | null |
2024-03-07 | Anatomy-Guided Surface Diffusion Model for Alzheimer's Disease Normative Modeling | Jianwei Zhang et.al. | 2403.04531 | null |
2024-03-07 | Effect of turbulent diffusion in modeling anaerobic digestion | Jeremy Z. Yan et.al. | 2403.04457 | null |
2024-03-07 | Disentangled Diffusion-Based 3D Human Pose Estimation with Hierarchical Spatial and Temporal Denoiser | Qingyuan Cai et.al. | 2403.04444 | link |
2024-03-07 | StableDrag: Stable Dragging for Point-based Image Editing | Yutao Cui et.al. | 2403.04437 | null |
2024-03-07 | On-demand Quantization for Green Federated Generative Diffusion in Mobile Edge Networks | Bingkun Lai et.al. | 2403.04430 | null |
2024-03-06 | GUIDE: Guidance-based Incremental Learning with Diffusion Models | Bartosz Cywiński et.al. | 2403.03938 | link |
2024-03-06 | Latent Dataset Distillation with Diffusion Models | Brian B. Moser et.al. | 2403.03881 | null |
2024-03-06 | Accelerating Convergence of Score-Based Diffusion Models, Provably | Gen Li et.al. | 2403.03852 | null |
2024-03-06 | Diffusion on language model embeddings for protein sequence generation | Viacheslav Meshchaninov et.al. | 2403.03726 | null |
2024-03-06 | Efficient Search and Learning for Agile Locomotion on Stepping Stones | Adithya Kumar Chinnakkonda Ravi et.al. | 2403.03639 | null |
2024-03-06 | Diffusion-based Generative Prior for Low-Complexity MIMO Channel Estimation | Benedikt Fesl et.al. | 2403.03545 | link |
2024-03-06 | NoiseCollage: A Layout-Aware Text-to-Image Diffusion Model Based on Noise Cropping and Merging | Takahiro Shirakawa et.al. | 2403.03485 | link |
2024-03-06 | FLAME Diffuser: Grounded Wildfire Image Synthesis using Mask Guided Diffusion | Hao Wang et.al. | 2403.03463 | link |
2024-03-06 | Towards Understanding Cross and Self-Attention in Stable Diffusion for Text-Guided Image Editing | Bingyan Liu et.al. | 2403.03431 | null |
2024-03-05 | Scaling Rectified Flow Transformers for High-Resolution Image Synthesis | Patrick Esser et.al. | 2403.03206 | null |
2024-03-05 | MAGID: An Automated Pipeline for Generating Synthetic Multi-modal Datasets | Hossein Aboutalebi et.al. | 2403.03194 | link |
2024-03-05 | NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models | Zeqian Ju et.al. | 2403.03100 | null |
2024-03-05 | Global N-body Simulation of Gap Edge Structures Created by Perturbations from a Small Satellite Embedded in Saturn's Rings | Naoya Torii et.al. | 2403.03012 | null |
2024-03-05 | Cross-Domain Image Conversion by CycleDM | Sho Shimotsumagari et.al. | 2403.02919 | null |
2024-03-05 | MMoFusion: Multi-modal Co-Speech Motion Generation with Diffusion Model | Sen Wang et.al. | 2403.02905 | link |
2024-03-05 | Enhancing the Rate-Distortion-Perception Flexibility of Learned Image Codecs with Conditional Diffusion Decoders | Daniele Mari et.al. | 2403.02887 | null |
2024-03-05 | Zero-LED: Zero-Reference Lighting Estimation Diffusion Model for Low-Light Image Enhancement | Jinhong He et.al. | 2403.02879 | null |
2024-03-05 | Scalable Continuous-time Diffusion Framework for Network Inference and Influence Estimation | Keke Huang et.al. | 2403.02867 | link |
2024-03-05 | Tuning-Free Noise Rectification for High Fidelity Image-to-Video Generation | Weijie Li et.al. | 2403.02827 | null |
2024-03-02 | DiffSal: Joint Audio and Video Learning for Diffusion Saliency Prediction | Junwen Xiong et.al. | 2403.01226 | null |
2024-03-02 | TCIG: Two-Stage Controlled Image Generation with Quality Enhancement through Diffusion | Salaheldin Mohamed et.al. | 2403.01212 | null |
2024-03-02 | Training Unbiased Diffusion Models From Biased Dataset | Yeongmin Kim et.al. | 2403.01189 | link |
2024-03-02 | Volume diffusion modelling of a sheared granular gas | Duncan Dockar et.al. | 2403.01188 | null |
2024-03-02 | Text-guided Explorable Image Super-resolution | Kanchana Vaishnavi Gandikota et.al. | 2403.01124 | null |
2024-03-02 | Face Swap via Diffusion Model | Feifei Wang et.al. | 2403.01108 | link |
2024-03-01 | A time-stepping deep gradient flow method for option pricing in (rough) diffusion models | Antonis Papapantoleon et.al. | 2403.00746 | null |
2024-03-01 | Diff-Plugin: Revitalizing Details for Diffusion-based Low-level Tasks | Yuhao Liu et.al. | 2403.00644 | null |
2024-03-01 | Improving Explicit Spatial Relationships in Text-to-Image Generation through an Automatically Derived Dataset | Ander Salaberria et.al. | 2403.00587 | link |
2024-03-01 | Rethinking cluster-conditioned diffusion models | Nikolas Adaloglou et.al. | 2403.00570 | link |
2024-02-29 | DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models | Muyang Li et.al. | 2402.19481 | link |
2024-02-29 | Towards Generalizable Tumor Synthesis | Qi Chen et.al. | 2402.19470 | link |
2024-02-29 | Listening to the Noise: Blind Denoising with Gibbs Diffusion | David Heurtel-Depeiges et.al. | 2402.19455 | link |
2024-02-29 | Structure Preserving Diffusion Models | Haoye Lu et.al. | 2402.19369 | null |
2024-02-29 | A Novel Approach to Industrial Defect Generation through Blended Latent Diffusion Model with Online Adaptation | Hanxi Li et.al. | 2402.19330 | link |
2024-02-29 | DiffAssemble: A Unified Graph-Diffusion Model for 2D and 3D Reassembly | Gianluca Scarpellini et.al. | 2402.19302 | link |
2024-02-29 | TEncDM: Understanding the Properties of Diffusion Model in the Space of Language Model Encodings | Alexander Shabalin et.al. | 2402.19097 | link |
2024-03-01 | Graph Convolutional Neural Networks for Automated Echocardiography View Recognition: A Holistic Approach | Sarina Thomas et.al. | 2402.19062 | null |
2024-02-29 | WDM: 3D Wavelet Diffusion Models for High-Resolution Medical Image Synthesis | Paul Friedrich et.al. | 2402.19043 | link |
2024-02-29 | Generating, Reconstructing, and Representing Discrete and Continuous Data: Generalized Diffusion with Learnable Encoding-Decoding | Guangyi Liu et.al. | 2402.19009 | null |
2024-02-28 | Logarithmic Sobolev Inequalities for Bounded Domains and Applications to Drift-Diffusion Equations | Elie Abdo et.al. | 2402.18572 | null |
2024-02-28 | Dynamical Regimes of Diffusion Models | Giulio Biroli et.al. | 2402.18491 | null |
2024-02-28 | Deep Confident Steps to New Pockets: Strategies for Docking Generalization | Gabriele Corso et.al. | 2402.18396 | link |
2024-02-28 | Objective and Interpretable Breast Cosmesis Evaluation with Attention Guided Denoising Diffusion Anomaly Detection Model | Sangjoon Park et.al. | 2402.18362 | null |
2024-02-28 | FineDiffusion: Scaling up Diffusion Models for Fine-grained Image Generation with 10,000 Classes | Ziying Pan et.al. | 2402.18331 | link |
2024-02-28 | Balancing Act: Distribution-Guided Debiasing in Diffusion Models | Rishubh Parihar et.al. | 2402.18206 | null |
2024-02-28 | Diffusion-based Neural Network Weights Generation | Bedionita Soro et.al. | 2402.18153 | link |
2024-02-28 | Context-aware Talking Face Video Generation | Meidai Xuanyuan et.al. | 2402.18092 | null |
2024-02-28 | Coarse-to-Fine Latent Diffusion for Pose-Guided Person Image Synthesis | Yanzuo Lu et.al. | 2402.18078 | link |
2024-02-28 | SynArtifact: Classifying and Alleviating Artifacts in Synthetic Images via Vision-Language Model | Bin Cao et.al. | 2402.18068 | link |
2024-02-27 | Diffusion Meets DAgger: Supercharging Eye-in-hand Imitation Learning | Xiaoyu Zhang et.al. | 2402.17768 | null |
2024-02-27 | Structure-Guided Adversarial Training of Diffusion Models | Ling Yang et.al. | 2402.17563 | null |
2024-02-27 | Diffusion Model-Based Image Editing: A Survey | Yi Huang et.al. | 2402.17525 | link |
2024-02-27 | Label-Noise Robust Diffusion Models | Byeonghu Na et.al. | 2402.17517 | link |
2024-02-27 | EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions | Linrui Tian et.al. | 2402.17485 | null |
2024-02-27 | DiffuseKronA: A Parameter Efficient Fine-tuning Method for Personalized Diffusion Model | Shyam Marjit et.al. | 2402.17412 | null |
2024-02-27 | Generative diffusion model for surface structure discovery | Nikolaj Rønne et.al. | 2402.17404 | null |
2024-02-27 | Denoising Diffusion Models for Inpainting of Healthy Brain Tissue | Alicia Durrer et.al. | 2402.17307 | null |
2024-02-27 | DivAvatar: Diverse 3D Avatar Generation with a Single Prompt | Weijing Tao et.al. | 2402.17292 | null |
2024-02-27 | Enhancing Hyperspectral Images via Diffusion Model and Group-Autoencoder Super-resolution Network | Zhaoyang Wang et.al. | 2402.17285 | link |
2024-02-26 | Stochastic Conditional Diffusion Models for Semantic Image Synthesis | Juyeon Ko et.al. | 2402.16506 | link |
2024-02-26 | Outline-Guided Object Inpainting with Diffusion Models | Markus Pobitzer et.al. | 2402.16421 | null |
2024-02-26 | Placing Objects in Context via Inpainting for Out-of-distribution Segmentation | Pau de Jorge et.al. | 2402.16392 | link |
2024-02-26 | Generative AI in Vision: A Survey on Models, Metrics and Applications | Gaurav Raut et.al. | 2402.16369 | null |
2024-02-26 | Feedback Efficient Online Fine-Tuning of Diffusion Models | Masatoshi Uehara et.al. | 2402.16359 | null |
2024-02-26 | Graph Diffusion Policy Optimization | Yijing Liu et.al. | 2402.16302 | link |
2024-02-25 | Photon-counting CT using a Conditional Diffusion Model for Super-resolution and Texture-preservation | Christopher Wiedeman et.al. | 2402.16212 | null |
2024-02-25 | Towards Efficient Quantum Hybrid Diffusion Models | Francesca De Falco et.al. | 2402.16147 | null |
2024-02-25 | Cinematographic Camera Diffusion Model | Hongda Jiang et.al. | 2402.16143 | link |
2024-02-25 | Behavioral Refinement via Interpolant-based Policy Diffusion | Kaiqi Chen et.al. | 2402.16075 | link |
2024-02-23 | Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition | Chun-Hsiao Yeh et.al. | 2402.15504 | link |
2024-02-23 | ProTIP: Probabilistic Robustness Verification on Text-to-Image Diffusion Models against Stochastic Perturbation | Yi Zhang et.al. | 2402.15429 | link |
2024-02-23 | Let's Rectify Step by Step: Improving Aspect-based Sentiment Analysis with Diffusion Models | Shunyu Liu et.al. | 2402.15289 | link |
2024-02-23 | Weak Reproductive Solutions for a Convection-Diffusion Model Describing a Binary Alloy Solidification Processes | Blanca Climent-Ezquerra et.al. | 2402.15221 | null |
2024-02-23 | Label-efficient Multi-organ Segmentation Method with Diffusion Model | Yongzhi Huang et.al. | 2402.15216 | null |
2024-02-23 | Fine-Tuning of Continuous-Time Diffusion Models as Entropy-Regularized Control | Masatoshi Uehara et.al. | 2402.15194 | null |
2024-02-23 | Dynamics-Guided Diffusion Model for Robot Manipulator Design | Xiaomeng Xu et.al. | 2402.15038 | null |
2024-02-22 | Cameras as Rays: Pose Estimation via Ray Diffusion | Jason Y. Zhang et.al. | 2402.14817 | null |
2024-02-22 | Customize-A-Video: One-Shot Motion Customization of Text-to-Video Diffusion Models | Yixuan Ren et.al. | 2402.14780 | null |
2024-02-22 | Debiasing Text-to-Image Diffusion Models | Ruifei He et.al. | 2402.14577 | null |
2024-02-22 | Model-Based Reinforcement Learning Control of Reaction-Diffusion Problems | Christina Schenk et.al. | 2402.14446 | null |
2024-02-22 | Large-Scale Actionless Video Pre-Training via Discrete Diffusion for Efficient Policy Learning | Haoran He et.al. | 2402.14407 | null |
2024-02-22 | Diffusion Model Based Visual Compensation Guidance and Visual Difference Analysis for No-Reference Image Quality Assessment | Zhaoyang Wang et.al. | 2402.14401 | link |
2024-02-22 | Typographic Text Generation with Off-the-Shelf Diffusion Model | KhayTze Peong et.al. | 2402.14314 | null |
2024-02-22 | Font Style Interpolation with Diffusion Models | Tetta Kondo et.al. | 2402.14311 | null |
2024-02-22 | Symbolic Music Generation with Non-Differentiable Rule Guided Diffusion | Yujia Huang et.al. | 2402.14285 | link |
2024-02-22 | MVD |
Xin-Yang Zheng et.al. | 2402.14253 | null |
2024-02-21 | Non-asymptotic Convergence of Discrete-time Diffusion Models: New Approach and Improved Rate | Yuchen Liang et.al. | 2402.13901 | null |
2024-02-21 | NeuralDiffuser: Controllable fMRI Reconstruction with Primary Visual Feature Guided Diffusion | Haoyu Li et.al. | 2402.13809 | null |
2024-02-21 | Deep Generative Models for Offline Policy Learning: Tutorial, Survey, and Perspectives on Future Directions | Jiayu Chen et.al. | 2402.13777 | link |
2024-02-21 | Cas-DiffCom: Cascaded diffusion model for infant longitudinal super-resolution 3D medical image completion | Lianghu Guo et.al. | 2402.13776 | null |
2024-02-21 | Music Style Transfer with Time-Varying Inversion of Diffusion Models | Sifei Li et.al. | 2402.13763 | null |
2024-02-21 | SRNDiff: Short-term Rainfall Nowcasting with Condition Diffusion Model | Xudong Ling et.al. | 2402.13737 | link |
2024-02-21 | Hybrid Video Diffusion Models with 2D Triplane and 3D Wavelet Representation | Kihong Kim et.al. | 2402.13729 | null |
2024-02-21 | Flexible Physical Camouflage Generation Based on a Differential Approach | Yang Li et.al. | 2402.13575 | null |
2024-02-21 | ToDo: Token Downsampling for Efficient Generation of High-Resolution Images | Ethan Smith et.al. | 2402.13573 | null |
2024-02-21 | Generative AI for Secure Physical Layer Communications: A Survey | Changyuan Zhao et.al. | 2402.13553 | null |
2024-02-20 | Neural Network Diffusion | Kai Wang et.al. | 2402.13144 | link |
2024-02-20 | Text-Guided Molecule Generation with Diffusion Language Model | Haisong Gong et.al. | 2402.13040 | link |
2024-02-20 | Visual Style Prompting with Swapping Self-Attention | Jaeseok Jeong et.al. | 2402.12974 | link |
2024-02-20 | CLIPping the Deception: Adapting Vision-Language Models for Universal Deepfake Detection | Sohail Ahmed Khan et.al. | 2402.12927 | link |
2024-02-20 | RealCompo: Dynamic Equilibrium between Realism and Compositionality Improves Text-to-Image Diffusion Models | Xinchen Zhang et.al. | 2402.12908 | link |
2024-02-20 | Two-stage Rainfall-Forecasting Diffusion Model | XuDong Ling et.al. | 2402.12779 | link |
2024-02-20 | MuLan: Multimodal-LLM Agent for Progressive Multi-Object Diffusion | Sen Li et.al. | 2402.12741 | link |
2024-02-20 | Diffusion Posterior Sampling is Computationally Intractable | Shivam Gupta et.al. | 2402.12727 | null |
2024-02-20 | MVDiffusion++: A Dense High-resolution Multi-view Diffusion Model for Single or Sparse-view 3D Object Reconstruction | Shitao Tang et.al. | 2402.12712 | null |
2024-02-20 | SingVisio: Visual Analytics of Diffusion Model for Singing Voice Conversion | Liumeng Xue et.al. | 2402.12660 | link |
2024-02-19 | FiT: Flexible Vision Transformer for Diffusion Model | Zeyu Lu et.al. | 2402.12376 | link |
2024-02-19 | Synthetic location trajectory generation using categorical diffusion models | Simon Dirmeier et.al. | 2402.12242 | link |
2024-02-19 | Adversarial Feature Alignment: Balancing Robustness and Accuracy in Deep Learning via Adversarial Training | Leo Hyun Park et.al. | 2402.12187 | null |
2024-02-19 | Human Video Translation via Query Warping | Haiming Zhu et.al. | 2402.12099 | null |
2024-02-19 | Direct Consistency Optimization for Compositional Text-to-Image Personalization | Kyungmin Lee et.al. | 2402.12004 | null |
2024-02-19 | Privacy-Preserving Low-Rank Adaptation for Latent Diffusion Models | Zihao Luo et.al. | 2402.11989 | link |
2024-02-19 | DiLightNet: Fine-grained Lighting Control for Diffusion-based Image Generation | Chong Zeng et.al. | 2402.11929 | link |
2024-02-19 | A Generative Pre-Training Framework for Spatio-Temporal Graph Transfer Learning | Yuan Yuan et.al. | 2402.11922 | link |
2024-02-19 | ComFusion: Personalized Subject Generation in Multiple Specific Scenes From Single Image | Yan Hong et.al. | 2402.11849 | null |
2024-02-19 | UnlearnCanvas: A Stylized Image Dataset to Benchmark Machine Unlearning for Diffusion Models | Yihua Zhang et.al. | 2402.11846 | link |
2024-02-16 | 3D Diffuser Actor: Policy Diffusion with 3D Scene Representations | Tsung-Wei Ke et.al. | 2402.10885 | null |
2024-02-16 | Training Class-Imbalanced Diffusion Model Via Overlap Optimization | Divin Yan et.al. | 2402.10821 | link |
2024-02-16 | VATr++: Choose Your Words Wisely for Handwritten Text Generation | Bram Vanherle et.al. | 2402.10798 | null |
2024-02-16 | Rethinking Human-like Translation Strategy: Integrating Drift-Diffusion Model with Large Language Models for Machine Translation | Hongbin Na et.al. | 2402.10699 | null |
2024-02-16 | Generative AI and Attentive User Interfaces: Five Strategies to Enhance Take-Over Quality in Automated Driving | Patrick Ebel et.al. | 2402.10664 | null |
2024-02-16 | Speaking in Wavelet Domain: A Simple and Efficient Approach to Speed up Speech Diffusion Model | Xiangyu Zhang et.al. | 2402.10642 | null |
2024-02-16 | U |
Ziqi Gao et.al. | 2402.10609 | link |
2024-02-16 | A maximum likelihood estimation of Lévy-driven stochastic systems for univariate and multivariate time series of observations | Babak M. S. Arani et.al. | 2402.10608 | null |
2024-02-16 | Make a Cheap Scaling: A Self-Cascade Diffusion Model for Higher-Resolution Adaptation | Lanqing Guo et.al. | 2402.10491 | link |
2024-02-16 | Explaining generative diffusion models via visual analysis for interpretable decision-making process | Ji-Hoon Park et.al. | 2402.10404 | link |
2024-02-15 | Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation | Huizhuo Yuan et.al. | 2402.10210 | null |
2024-02-15 | Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment | Rui Yang et.al. | 2402.10207 | link |
2024-02-15 | Radio-astronomical Image Reconstruction with Conditional Denoising Diffusion Model | Mariia Drozdova et.al. | 2402.10204 | link |
2024-02-15 | Classification Diffusion Models | Shahar Yadin et.al. | 2402.10095 | null |
2024-02-15 | Diffusion Models Meet Contextual Bandits with Large Action Spaces | Imad Aouali et.al. | 2402.10028 | null |
2024-02-15 | Zero-Shot Unsupervised and Text-Based Audio Editing Using DDPM Inversion | Hila Manor et.al. | 2402.10009 | null |
2024-02-15 | Accelerating Parallel Sampling of Diffusion Models | Zhiwei Tang et.al. | 2402.09970 | link |
2024-02-15 | Textual Localization: Decomposing Multi-concept Images for Subject-Driven Text-to-Image Generation | Junjie Shentu et.al. | 2402.09966 | link |
2024-02-15 | Lester: rotoscope animation through video object segmentation and tracking | Ruben Tous et.al. | 2402.09883 | link |
2024-02-15 | Diffusion Models for Audio Restoration | Jean-Marie Lemercier et.al. | 2402.09821 | null |
2024-02-14 | Synthesizing Knowledge-enhanced Features for Real-world Zero-shot Food Detection | Pengfei Zhou et.al. | 2402.09242 | link |
2024-02-14 | Semi-Supervised Diffusion Model for Brain Age Prediction | Ayodeji Ijishakin et.al. | 2402.09137 | null |
2024-02-14 | L3GO: Language Agents with Chain-of-3D-Thoughts for Generating Unconventional Objects | Yutaro Yamada et.al. | 2402.09052 | null |
2024-02-14 | Extreme Video Compression with Pre-trained Diffusion Models | Bohan Li et.al. | 2402.08934 | link |
2024-02-14 | The Mirrored Influence Hypothesis: Efficient Data Influence Estimation by Harnessing Forward Passes | Myeongseob Ko et.al. | 2402.08922 | link |
2024-02-13 | Percolating transition to turbulence without puffs or bands | Sébastien Gomé et.al. | 2402.08829 | null |
2024-02-13 | LDTrack: Dynamic People Tracking by Service Robots using Diffusion Models | Angus Fung et.al. | 2402.08774 | null |
2024-02-13 | Towards the Detection of AI-Synthesized Human Face Images | Yuhang Lu et.al. | 2402.08750 | null |
2024-02-13 | PRDP: Proximal Reward Difference Prediction for Large-Scale Reward Finetuning of Diffusion Models | Fei Deng et.al. | 2402.08714 | null |
2024-02-13 | Chain Reaction of Ideas: Can Radioactive Decay Predict Technological Innovation? | Guilherme S. Y. Giardini et.al. | 2402.08681 | null |
2024-02-13 | Target Score Matching | Valentin De Bortoli et.al. | 2402.08667 | null |
2024-02-13 | Learning Continuous 3D Words for Text-to-Image Generation | Ta-Ying Cheng et.al. | 2402.08654 | link |
2024-02-13 | Denoising Diffusion Restoration Tackles Forward and Inverse Problems for the Laplace Operator | Amartya Mukherjee et.al. | 2402.08563 | null |
2024-02-13 | Confronting Reward Overoptimization for Diffusion Models: A Perspective of Inductive and Primacy Biases | Ziyi Zhang et.al. | 2402.08552 | link |
2024-02-13 | A Dense Reward View on Aligning Text-to-Image Diffusion with Preference | Shentao Yang et.al. | 2402.08265 | link |
2024-02-13 | Fine-Tuning Text-To-Image Diffusion Models for Class-Wise Spurious Feature Generation | AprilPyone MaungMaung et.al. | 2402.08200 | null |
2024-02-12 | Convergence Analysis of Discrete Diffusion Model: Exact Implementation through Uniformization | Hongrui Chen et.al. | 2402.08095 | null |
2024-02-12 | Nearest Neighbour Score Estimators for Diffusion Generative Models | Matthew Niedoba et.al. | 2402.08018 | link |
2024-02-12 | Towards a mathematical theory for consistency training in diffusion models | Gen Li et.al. | 2402.07802 | null |
2024-02-12 | Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models | Jiacheng Ye et.al. | 2402.07754 | link |
2024-02-12 | Cosmology at the Field Level with Probabilistic Machine Learning | Adam Rouhiainen et.al. | 2402.07694 | null |
2024-02-12 | Trustworthy SR: Resolving Ambiguity in Image Super-resolution via Diffusion Models and Human Feedback | Cansu Korkmaz et.al. | 2402.07597 | null |
2024-02-12 | Score-based Diffusion Models via Stochastic Differential Equations -- a Technical Tutorial | Wenpin Tang et.al. | 2402.07487 | null |
2024-02-12 | SALAD: Smart AI Language Assistant Daily | Ragib Amin Nihal et.al. | 2402.07431 | null |
2024-02-12 | Diff-RNTraj: A Structure-aware Diffusion Model for Road Network-constrained Trajectory Generation | Tonglong Wei et.al. | 2402.07369 | link |
2024-02-11 | Stitching Sub-Trajectories with Conditional Diffusion Model for Goal-Conditioned Offline RL | Sungyoon Kim et.al. | 2402.07226 | link |
2024-02-11 | Towards Fast Stochastic Sampling in Diffusion Generative Models | Kushagra Pandey et.al. | 2402.07211 | null |
2024-02-10 | Synthesizing CTA Image Data for Type-B Aortic Dissection using Stable Diffusion Models | Ayman Abaid et.al. | 2402.06969 | null |
2024-02-09 | Diffusion-ES: Gradient-free Planning with Diffusion for Autonomous Driving and Zero-Shot Instruction Following | Brian Yang et.al. | 2402.06559 | link |
2024-02-09 | Sequential Flow Matching for Generative Modeling | Jongmin Yoon et.al. | 2402.06461 | null |
2024-02-09 | ControlUDA: Controllable Diffusion-assisted Unsupervised Domain Adaptation for Cross-Weather Semantic Segmentation | Fengyi Shen et.al. | 2402.06446 | null |
2024-02-09 | Improving 2D-3D Dense Correspondences with Diffusion Models for 6D Object Pose Estimation | Peter Hönig et.al. | 2402.06436 | null |
2024-02-09 | Particle Denoising Diffusion Sampler | Angus Phillips et.al. | 2402.06320 | link |
2024-02-09 | Controllable seismic velocity synthesis using generative diffusion models | Fu Wang et.al. | 2402.06277 | null |
2024-02-09 | MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion Models | Yixiao Zhang et.al. | 2402.06178 | link |
2024-02-08 | CLR-Face: Conditional Latent Refinement for Blind Face Restoration Using Score-Based Diffusion Models | Maitreya Suin et.al. | 2402.06106 | null |
2024-02-08 | Animated Stickers: Bringing Stickers to Life with Video Diffusion | David Yan et.al. | 2402.06088 | null |
2024-02-08 | DiscDiff: Latent Diffusion Model for DNA Sequence Generation | Zehui Li et.al. | 2402.06079 | null |
2024-02-08 | InstaGen: Enhancing Object Detection by Training on Synthetic Dataset | Chengjian Feng et.al. | 2402.05937 | null |
2024-02-08 | Time Series Diffusion in the Frequency Domain | Jonathan Crabbé et.al. | 2402.05933 | link |
2024-02-08 | AvatarMMC: 3D Head Avatar Generation and Editing with Multi-Modal Conditioning | Wamiq Reyaz Para et.al. | 2402.05803 | null |
2024-02-08 | DiffSpeaker: Speech-Driven 3D Facial Animation with Diffusion Transformer | Zhiyuan Ma et.al. | 2402.05712 | link |
2024-02-08 | Scalable Diffusion Models with State Space Backbone | Zhengcong Fei et.al. | 2402.05608 | link |
2024-02-08 | Get What You Want, Not What You Don't: Image Content Suppression for Text-to-Image Diffusion Models | Senmao Li et.al. | 2402.05375 | link |
2024-02-08 | Descanning: From Scanned to the Original Images with a Color Correction Diffusion Model | Junghun Cha et.al. | 2402.05350 | null |
2024-02-07 | SPAD : Spatially Aware Multiview Diffusers | Yash Kant et.al. | 2402.05235 | null |
2024-02-07 | Anatomically-Controllable Medical Image Generation with Segmentation-Guided Diffusion Models | Nicholas Konz et.al. | 2402.05210 | link |
2024-02-07 | Maitreya Patel et.al. | 2402.05195 | null | |
2024-02-07 | On diffusion models for amortized inference: Benchmarking and improving stochastic control and sampling | Marcin Sendera et.al. | 2402.05098 | link |
2024-02-07 | NITO: Neural Implicit Fields for Resolution-free Topology Optimization | Amin Heyrani Nobari et.al. | 2402.05073 | link |
2024-02-07 | LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation | Jiaxiang Tang et.al. | 2402.05054 | null |
2024-02-07 | Generative Flows on Discrete State-Spaces: Enabling Multimodal Flows with Applications to Protein Co-Design | Andrew Campbell et.al. | 2402.04997 | link |
2024-02-07 | Blue noise for diffusion models | Xingchang Huang et.al. | 2402.04930 | link |
2024-02-07 | Source-Free Domain Adaptation with Diffusion-Guided Source Data Generation | Shivang Chopra et.al. | 2402.04929 | null |
2024-02-07 | Towards Aligned Layout Generation via Diffusion Model with Aesthetic Constraints | Jian Chen et.al. | 2402.04754 | link |
2024-02-07 | Cortical Surface Diffusion Generative Models | Zhenshan Xie et.al. | 2402.04753 | null |
2024-02-07 | EvoSeed: Unveiling the Threat on Deep Neural Networks with Real-World Illusions | Shashank Kotyan et.al. | 2402.04699 | link |
2024-02-07 | Noise Map Guidance: Inversion with Spatial Context for Real Image Editing | Hansam Cho et.al. | 2402.04625 | link |
2024-02-06 | Polyp-DDPM: Diffusion-Based Semantic Polyp Synthesis for Enhanced Segmentation | Zolnamar Dorjsembe et.al. | 2402.04031 | link |
2024-02-06 | Space Group Constrained Crystal Generation | Rui Jiao et.al. | 2402.03992 | null |
2024-02-06 | Controllable Diverse Sampling for Diffusion Based Motion Behavior Forecasting | Yiming Xu et.al. | 2402.03981 | null |
2024-02-06 | EscherNet: A Generative Model for Scalable View Synthesis | Xin Kong et.al. | 2402.03908 | link |
2024-02-06 | On gauge freedom, conservativity and intrinsic dimensionality estimation in diffusion models | Christian Horvat et.al. | 2402.03845 | null |
2024-02-06 | SDEMG: Score-based Diffusion Model for Surface Electromyographic Signal Denoising | Yu-Tung Liu et.al. | 2402.03808 | link |
2024-02-06 | FoolSDEdit: Deceptively Steering Your Edits Towards Targeted Attribute-aware Distribution | Qi Zhou et.al. | 2402.03705 | null |
2024-02-06 | Improving and Unifying Discrete&Continuous-time Discrete Denoising Diffusion | Lingxiao Zhao et.al. | 2402.03701 | link |
2024-02-06 | Pard: Permutation-Invariant Autoregressive Diffusion for Graph Generation | Lingxiao Zhao et.al. | 2402.03687 | link |
2024-02-06 | QuEST: Low-bit Diffusion Model Quantization via Efficient Selective Finetuning | Haoxuan Wang et.al. | 2402.03666 | link |
2024-02-05 | Do Diffusion Models Learn Semantically Meaningful and Efficient Representations? | Qiyao Liang et.al. | 2402.03305 | null |
2024-02-05 | Zero-shot Object-Level OOD Detection with Context-Aware Inpainting | Quang-Huy Nguyen et.al. | 2402.03292 | null |
2024-02-05 | InstanceDiffusion: Instance-level Control for Image Generation | Xudong Wang et.al. | 2402.03290 | link |
2024-02-05 | Organic or Diffused: Can We Distinguish Human Art from AI-generated Images? | Anna Yoo Jeong Ha et.al. | 2402.03214 | null |
2024-02-05 | Light and Optimal Schrödinger Bridge Matching | Nikita Gushchin et.al. | 2402.03207 | link |
2024-02-05 | Guidance with Spherical Gaussian Constraint for Conditional Diffusion | Lingxiao Yang et.al. | 2402.03201 | link |
2024-02-05 | Direct-a-Video: Customized Video Generation with User-Directed Camera Movement and Object Motion | Shiyuan Yang et.al. | 2402.03162 | null |
2024-02-05 | PFDM: Parser-Free Virtual Try-on via Diffusion Model | Yunfang Niu et.al. | 2402.03047 | null |
2024-02-05 | Diffusive Gibbs Sampling | Wenlin Chen et.al. | 2402.03008 | link |
2024-02-05 | DexDiffuser: Generating Dexterous Grasps with Diffusion Models | Zehang Weng et.al. | 2402.02989 | null |
2024-02-02 | NeuroCine: Decoding Vivid Video Sequences from Human Brain Activties | Jingyuan Sun et.al. | 2402.01590 | null |
2024-02-02 | Boximator: Generating Rich and Controllable Motions for Video Synthesis | Jiawei Wang et.al. | 2402.01566 | null |
2024-02-02 | Cross-view Masked Diffusion Transformers for Person Image Synthesis | Trung X. Pham et.al. | 2402.01516 | link |
2024-02-02 | Conditioning non-linear and infinite-dimensional diffusion processes | Elizabeth Louise Baker et.al. | 2402.01434 | link |
2024-02-02 | Bass Accompaniment Generation via Latent Diffusion | Marco Pasini et.al. | 2402.01412 | null |
2024-02-02 | Cheating Suffix: Targeted Attack to Text-To-Image Diffusion Models with Multi-Modal Priors | Dingcheng Yang et.al. | 2402.01369 | link |
2024-02-02 | Unsupervised Generation of Pseudo Normal PET from MRI with Diffusion Model for Epileptic Focus Localization | Wentao Chen et.al. | 2402.01191 | null |
2024-02-01 | Unconditional Latent Diffusion Models Memorize Patient Imaging Data | Salman Ul Hassan Dar et.al. | 2402.01054 | link |
2024-02-01 | pop-cosmos: A comprehensive picture of the galaxy population from COSMOS data | Justin Alsing et.al. | 2402.00935 | null |
2024-02-01 | Data-Space Validation of High-Dimensional Models by Comparing Sample Quantiles | Stephen Thorp et.al. | 2402.00930 | null |
2024-02-01 | ViCA-NeRF: View-Consistency-Aware 3D Editing of Neural Radiance Fields | Jiahua Dong et.al. | 2402.00864 | link |
2024-02-01 | An Analysis of the Variance of Diffusion-based Speech Enhancement | Bunlong Lay et.al. | 2402.00811 | null |
2024-02-01 | Distilling Conditional Diffusion Models for Offline Reinforcement Learning through Trajectory Stitching | Shangzhe Li et.al. | 2402.00807 | null |
2024-02-01 | AnimateLCM: Accelerating the Animation of Personalized Diffusion Models and Adapters with Decoupled Consistency Learning | Fu-Yun Wang et.al. | 2402.00769 | link |
2024-02-01 | Cylindrically symmetric diffusion model for relativistic heavy-ion collisions | Johannes Hoelck et.al. | 2402.00628 | null |
2024-02-01 | CapHuman: Capture Your Moments in Parallel Universes | Chao Liang et.al. | 2402.00627 | link |
2024-02-01 | Masked Conditional Diffusion Model for Enhancing Deepfake Detection | Tiewen Chen et.al. | 2402.00541 | null |
2024-02-01 | Energetic Particles in the Central Starburst, Disc, and Halo of NGC253 | Yoel Rephaeli et.al. | 2402.00523 | null |
2024-02-01 | LRDif: Diffusion Models for Under-Display Camera Emotion Recognition | Zhifeng Wang et.al. | 2402.00250 | null |
2024-01-31 | SuperDiff: Diffusion Models for Conditional Generation of Hypothetical New Families of Superconductors | Samuel Yuan et.al. | 2402.00198 | link |
2024-01-31 | Motion Guidance: Diffusion-Based Image Editing with Differentiable Motion Estimators | Daniel Geng et.al. | 2401.18085 | null |
2024-01-31 | Ljusternik-Schnirelmann eigenvalues for the fractional $m-$Laplacian without the |
Julian Fernandez Bonder et.al. | 2401.18041 | null |
2024-01-31 | Diagnosing the particle transport mechanism in the pulsar halo via X-ray observations | Qi-Zuo Wu et.al. | 2401.17982 | null |
2024-01-31 | Convergence Analysis for General Probability Flow ODEs of Diffusion Models in Wasserstein Distances | Xuefeng Gao et.al. | 2401.17958 | null |
2024-01-31 | AEROBLADE: Training-Free Detection of Latent Diffusion Images Using Autoencoder Reconstruction Error | Jonas Ricker et.al. | 2401.17879 | link |
2024-01-31 | Drift Diffusion Model to understand (mis)information sharing dynamic in complex networks | Lucila G. Alvarez-Zuzek et.al. | 2401.17846 | null |
2024-01-31 | A new class of efficient high order semi-Lagrangian IMEX discontinuous Galerkin methods on staggered unstructured meshes | M. Tavelli et.al. | 2401.17806 | null |
2024-01-31 | Dance-to-Music Generation with Encoder-based Textual Inversion of Diffusion Models | Sifei Li et.al. | 2401.17800 | link |
2024-01-31 | Image Anything: Towards Reasoning-coherent and Training-free Multi-modal Image Generation | Yuanhuiyi Lyu et.al. | 2401.17664 | null |
2024-01-31 | Spatial-and-Frequency-aware Restoration method for Images based on Diffusion Models | Kyungsung Lee et.al. | 2401.17629 | null |
2024-01-30 | You Only Need One Step: Fast Super-Resolution with Stable Diffusion via Scale Distillation | Mehdi Noroozi et.al. | 2401.17258 | null |
2024-01-30 | ContactGen: Contact-Guided Interactive 3D Human Generation for Partners | Dongjun Gu et.al. | 2401.17212 | null |
2024-01-30 | Transfer Learning for Text Diffusion Models | Kehang Han et.al. | 2401.17181 | null |
2024-01-30 | PlantoGraphy: Incorporating Iterative Design Process into Generative Artificial Intelligence for Landscape Rendering | Rong Huang et.al. | 2401.17120 | null |
2024-01-30 | Local modification of subdiffusion by initial Fickian diffusion: Multiscale modeling, analysis and computation | Xiangcheng Zheng et.al. | 2401.16885 | null |
2024-01-30 | A Literature Review on Fetus Brain Motion Correction in MRI | Haoran Zhang et.al. | 2401.16782 | null |
2024-01-30 | BoostDream: Efficient Refining for High-Quality Text-to-3D Generation from Multi-View Diffusion | Yonghao Yu et.al. | 2401.16764 | null |
2024-01-30 | Pick-and-Draw: Training-free Semantic Guidance for Text-to-Image Personalization | Henglei Lv et.al. | 2401.16762 | null |
2024-01-30 | Diffusion model for relational inference | Shuhan Zheng et.al. | 2401.16755 | null |
2024-01-29 | Using multiple Dirac delta points to describe inhomogeneous flux density over a cell boundary in a single-cell diffusion model | Qiyao Peng et.al. | 2401.16261 | null |
2024-01-29 | Diffutoon: High-Resolution Editable Toon Shading via Diffusion Models | Zhongjie Duan et.al. | 2401.16224 | null |
2024-01-29 | Spatial-Aware Latent Initialization for Controllable Image Generation | Wenqiang Sun et.al. | 2401.16157 | null |
2024-01-29 | DMCE: Diffusion Model Channel Enhancer for Multi-User Semantic Communication Systems | Youcheng Zeng et.al. | 2401.16017 | null |
2024-01-29 | Motion-I2V: Consistent and Controllable Image-to-Video Generation with Explicit Motion Modeling | Xiaoyu Shi et.al. | 2401.15977 | null |
2024-01-29 | EmoDM: A Diffusion Model for Evolutionary Multi-objective Optimization | Xueming Yan et.al. | 2401.15931 | null |
2024-01-28 | Object-Driven One-Shot Fine-tuning of Text-to-Image Diffusion with Prototypical Embedding | Jianxiang Lu et.al. | 2401.15708 | null |
2024-01-28 | Media2Face: Co-speech Facial Animation Generation With Multi-Modality Guidance | Qingcheng Zhao et.al. | 2401.15687 | null |
2024-01-28 | CPDM: Content-Preserving Diffusion Model for Underwater Image Enhancement | Xiaowen Shi et.al. | 2401.15649 | null |
2024-01-28 | FreeStyle: Free Lunch for Text-guided Style Transfer using Diffusion Models | Feihong He et.al. | 2401.15636 | link |
2024-01-26 | Annotated Hands for Generative Models | Yue Yang et.al. | 2401.15075 | link |
2024-01-26 | Text Image Inpainting via Global Structure-Guided Diffusion Models | Shipeng Zhu et.al. | 2401.14832 | link |
2024-01-25 | Opposite variations for pore pressure on and off the fault during simulated earthquakes in the laboratory | Dong Liu et.al. | 2401.14506 | null |
2024-01-25 | Deconstructing Denoising Diffusion Models for Self-Supervised Learning | Xinlei Chen et.al. | 2401.14404 | null |
2024-01-25 | pix2gestalt: Amodal Segmentation by Synthesizing Wholes | Ege Ozguroglu et.al. | 2401.14398 | link |
2024-01-25 | UrbanGenAI: Reconstructing Urban Landscapes using Panoptic Segmentation and Diffusion Models | Timo Kapsalis et.al. | 2401.14379 | null |
2024-01-25 | Sketch2NeRF: Multi-view Sketch-guided Text-to-3D Generation | Minglin Chen et.al. | 2401.14257 | null |
2024-01-26 | Image Synthesis with Graph Conditioning: CLIP-Guided Diffusion Models for Scene Graphs | Rameshwar Mishra et.al. | 2401.14111 | null |
2024-01-25 | CreativeSynth: Creative Blending and Synthesis of Visual Arts based on Multimodal Diffusion | Nisha Huang et.al. | 2401.14066 | link |
2024-01-25 | Diffusion-based Data Augmentation for Object Counting Problems | Zhen Wang et.al. | 2401.13992 | null |
2024-01-25 | BootPIG: Bootstrapping Zero-shot Personalized Image Generation Capabilities in Pretrained Diffusion Models | Senthil Purushwalkam et.al. | 2401.13974 | link |
2024-01-25 | StyleInject: Parameter Efficient Tuning of Text-to-Image Diffusion Models | Yalong Bai et.al. | 2401.13942 | null |
2024-01-24 | Inverse Molecular Design with Multi-Conditional Diffusion Guidance | Gang Liu et.al. | 2401.13858 | link |
2024-01-24 | Guided Diffusion for Fast Inverse Design of Density-based Mechanical Metamaterials | Yanyan Yang et.al. | 2401.13570 | link |
2024-01-24 | UNIMO-G: Unified Image Generation through Multimodal Conditional Diffusion | Wei Li et.al. | 2401.13388 | null |
2024-01-24 | Generative Design of Crystal Structures by Point Cloud Representations and Diffusion Model | Zhelin Li et.al. | 2401.13192 | link |
2024-01-24 | Towards Multi-domain Face Landmark Detection with Synthetic Data from Diffusion model | Yuanming Li et.al. | 2401.13191 | null |
2024-01-24 | Compositional Generative Inverse Design | Tailin Wu et.al. | 2401.13171 | link |
2024-01-24 | Choose Your Diffusion: Efficient and flexible ways to accelerate the diffusion model in fast high energy physics simulation | Cheng Jiang et.al. | 2401.13162 | null |
2024-01-23 | GALA: Generating Animatable Layered Assets from a Single Scan | Taeksoo Kim et.al. | 2401.12979 | null |
2024-01-24 | Zero-Shot Learning for the Primitives of 3D Affordance in General Objects | Hyeonwoo Kim et.al. | 2401.12978 | link |
2024-01-23 | Lumiere: A Space-Time Diffusion Model for Video Generation | Omer Bar-Tal et.al. | 2401.12945 | null |
2024-01-23 | UniHDA: Towards Universal Hybrid Domain Adaptation of Image Generators | Hengjia Li et.al. | 2401.12596 | null |
2024-01-23 | ToDA: Target-oriented Diffusion Attacker against Recommendation System | Xiaohao Liu et.al. | 2401.12578 | null |
2024-01-23 | DDMI: Domain-Agnostic Latent Diffusion Models for Synthesizing High-Quality Implicit Neural Representations | Dogyun Park et.al. | 2401.12517 | link |
2024-01-22 | DITTO: Diffusion Inference-Time T-Optimization for Music Generation | Zachary Novack et.al. | 2401.12179 | null |
2024-01-22 | Single-View 3D Human Digitalization with Large Reconstruction Models | Zhenzhen Weng et.al. | 2401.12175 | null |
2024-01-22 | Feature Denoising Diffusion Model for Blind Image Quality Assessment | Xudong Li et.al. | 2401.11949 | null |
2024-01-22 | EmerDiff: Emerging Pixel-level Semantic Knowledge in Diffusion Models | Koichi Namekata et.al. | 2401.11739 | null |
2024-01-22 | Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs | Ling Yang et.al. | 2401.11708 | link |
2024-01-21 | Scalable High-Resolution Pixel-Space Image Synthesis with Hourglass Diffusion Transformers | Katherine Crowson et.al. | 2401.11605 | link |
2024-01-20 | Diffusion Model Conditioning on Gaussian Mixture Model and Negative Gaussian Mixture Gradient | Weiguo Lu et.al. | 2401.11261 | null |
2024-01-20 | Product-Level Try-on: Characteristics-preserving Try-on with Realistic Clothes Shading and Wrinkles | Yanlong Zang et.al. | 2401.11239 | null |
2024-01-20 | MotionMix: Weakly-Supervised Diffusion for Controllable Motion Generation | Nhat M. Hoang et.al. | 2401.11115 | link |
2024-01-20 | UltrAvatar: A Realistic Animatable 3D Avatar Diffusion Model with Authenticity Guided Textures | Mingyuan Zhou et.al. | 2401.11078 | null |
2024-01-19 | Synthesizing Moving People with 3D Control | Boyi Li et.al. | 2401.10889 | null |
2024-01-19 | ActAnywhere: Subject-Aware Video Background Generation | Boxiao Pan et.al. | 2401.10822 | null |
2024-01-19 | From Market Saturation to Social Reinforcement: Understanding the Impact of Non-Linearity in Information Diffusion Models | Tobias Friedrich et.al. | 2401.10818 | null |
2024-01-19 | Sat2Scene: 3D Urban Scene Generation from Satellite Images with Diffusion | Zuoyue Li et.al. | 2401.10786 | null |
2024-01-19 | Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model | Yinan Zheng et.al. | 2401.10700 | link |
2024-01-19 | MAEDiff: Masked Autoencoder-enhanced Diffusion Models for Unsupervised Anomaly Detection in Brain Images | Rui Xu et.al. | 2401.10561 | null |
2024-01-18 | Inflation with Diffusion: Efficient Temporal Adaptation for Text-to-Video Super-Resolution | Xin Yuan et.al. | 2401.10404 | null |
2024-01-18 | A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask Inpainting | Wouter Van Gansbeke et.al. | 2401.10227 | link |
2024-01-19 | Motion-Zero: Zero-Shot Moving Object Control Framework for Diffusion-Based Video Generation | Changgu Chen et.al. | 2401.10150 | null |
2024-01-18 | DiffusionGPT: LLM-Driven Text-to-Image Generation System | Jie Qin et.al. | 2401.10061 | null |
2024-01-18 | CustomVideo: Customizing Text-to-Video Generation with Multiple Subjects | Zhao Wang et.al. | 2401.09962 | null |
2024-01-18 | BlenDA: Domain Adaptive Object Detection through diffusion-based blending | Tzuhsuan Huang et.al. | 2401.09921 | link |
2024-01-18 | Exploring Latent Cross-Channel Embedding for Accurate 3D Human Pose Reconstruction in a Diffusion Framework | Junkun Jiang et.al. | 2401.09836 | link |
2024-01-18 | Wavelet-Guided Acceleration of Text Inversion in Diffusion-Based Image Editing | Gwanhyeong Koo et.al. | 2401.09794 | null |
2024-01-18 | Image Translation as Diffusion Visual Programmers | Cheng Han et.al. | 2401.09742 | null |
2024-01-17 | Total fraction of drug released from diffusion-controlled delivery systems with binding reactions | Elliot J. Carr et.al. | 2401.09644 | link |
2024-01-17 | Efficient generative adversarial networks using linear additive-attention Transformers | Emilio Morales-Juarez et.al. | 2401.09596 | link |
2024-01-17 | TextureDreamer: Image-guided Texture Synthesis through Geometry-aware Diffusion | Yu-Ying Yeh et.al. | 2401.09416 | null |
2024-01-17 | Vlogger: Make Your Dream A Vlog | Shaobin Zhuang et.al. | 2401.09414 | link |
2024-01-17 | On the |
Mireille Bossy et.al. | 2401.09338 | null |
2024-01-17 | Siamese Meets Diffusion Network: SMDNet for Enhanced Change Detection in High-Resolution RS Imagery | Jia Jia et.al. | 2401.09325 | null |
2024-01-17 | T-FOLEY: A Controllable Waveform-Domain Diffusion Model for Temporal-Event-Guided Foley Sound Synthesis | Yoonjin Chung et.al. | 2401.09294 | link |
2024-01-17 | Training-Free Semantic Video Composition via Pre-trained Diffusion Model | Jiaqi Guo et.al. | 2401.09195 | null |
2024-01-17 | Consistent3D: Towards Consistent High-Fidelity Text-to-3D Generation with Deterministic Sampling Prior | Zike Wu et.al. | 2401.09050 | link |
2024-01-17 | Compose and Conquer: Diffusion-Based 3D Depth Aware Composable Image Synthesis | Jonghyun Lee et.al. | 2401.09048 | link |
2024-01-17 | VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models | Haoxin Chen et.al. | 2401.09047 | link |
2024-01-17 | Data Attribution for Diffusion Models: Timestep-induced Bias in Influence Estimation | Tong Xie et.al. | 2401.09031 | link |
2024-01-16 | Modeling Spoof Noise by De-spoofing Diffusion and its Application in Face Anti-spoofing | Bin Zhang et.al. | 2401.08275 | null |
2024-01-16 | Multi-scale 2D Temporal Map Diffusion Models for Natural Language Video Localization | Chongzhi Zhang et.al. | 2401.08232 | null |
2024-01-16 | Photonic Modes Prediction via Multi-Modal Diffusion Model | Jinyang Sun et.al. | 2401.08199 | null |
2024-01-16 | Key-point Guided Deformable Image Manipulation Using Diffusion Model | Seok-Hwan Oh et.al. | 2401.08178 | null |
2024-01-16 | SpecSTG: A Fast Spectral Diffusion Framework for Probabilistic Spatio-Temporal Traffic Forecasting | Lequan Lin et.al. | 2401.08119 | null |
2024-01-16 | DIFFRENT: A Diffusion Model for Recording Environment Transfer of Speech | Jaekwon Im et.al. | 2401.08102 | null |
2024-01-16 | EmoTalker: Emotionally Editable Talking Face Generation via Diffusion Model | Bingyuan Zhang et.al. | 2401.08049 | null |
2024-01-16 | Forging Vision Foundation Models for Autonomous Driving: Challenges, Methodologies, and Opportunities | Xu Yan et.al. | 2401.08045 | link |
2024-01-15 | Regularity in diffusion models with gradient activation | Damião Araújo et.al. | 2401.07979 | null |
2024-01-15 | HexaGen3D: StableDiffusion is just one step away from Fast and Diverse Text-to-3D Generation | Antoine Mercier et.al. | 2401.07727 | null |
2024-01-12 | A deep implicit-explicit minimizing movement method for option pricing in jump-diffusion models | Emmanuil H. Georgoulis et.al. | 2401.06740 | null |
2024-01-12 | Decoupling Pixel Flipping and Occlusion Strategy for Consistent XAI Benchmarks | Stefan Blücher et.al. | 2401.06654 | link |
2024-01-12 | Adversarial Examples are Misaligned in Diffusion Model Manifolds | Peter Lorenz et.al. | 2401.06637 | null |
2024-01-12 | Motion2VecSets: 4D Latent Vector Set Diffusion for Non-rigid Shape Reconstruction and Tracking | Wei Cao et.al. | 2401.06614 | null |
2024-01-12 | 360DVD: Controllable Panorama Video Generation with 360-Degree Video Diffusion Model | Qian Wang et.al. | 2401.06578 | null |
2024-01-12 | RotationDrag: Point-based Image Editing with Rotated Diffusion Features | Minxing Luo et.al. | 2401.06442 | link |
2024-01-12 | Seek for Incantations: Towards Accurate Text-to-Image Diffusion Synthesis through Prompt Engineering | Chang Yu et.al. | 2401.06345 | null |
2024-01-11 | Frequency-Time Diffusion with Neural Cellular Automata | John Kalkhof et.al. | 2401.06291 | null |
2024-01-11 | Demystifying Variational Diffusion Models | Fabio De Sousa Ribeiro et.al. | 2401.06281 | null |
2024-01-11 | E |
Yifan Gong et.al. | 2401.06127 | null |
2024-01-11 | DiffDA: a diffusion model for weather-scale data assimilation | Langwen Huang et.al. | 2401.05932 | link |
2024-01-11 | Efficient Image Deblurring Networks based on Diffusion Models | Kang Chen et.al. | 2401.05907 | link |
2024-01-11 | HiCAST: Highly Customized Arbitrary Style Transfer with Adapter Enhanced Diffusion Models | Hanzhang Wang et.al. | 2401.05870 | null |
2024-01-11 | EraseDiff: Erasing Data Influence in Diffusion Models | Jing Wu et.al. | 2401.05779 | link |
2024-01-10 | Diffusion Priors for Dynamic View Synthesis from Monocular Videos | Chaoyang Wang et.al. | 2401.05583 | null |
2024-01-10 | From Pampas to Pixels: Fine-Tuning Diffusion Models for Gaúcho Heritage | Marcellus Amadeus et.al. | 2401.05520 | null |
2024-01-10 | InseRF: Text-Driven Generative Object Insertion in Neural 3D Scenes | Mohamad Shahbazi et.al. | 2401.05335 | null |
2024-01-10 | Score Distillation Sampling with Learned Manifold Corrective | Thiemo Alldieck et.al. | 2401.05293 | null |
2024-01-10 | PIXART-δ: Fast and Controllable Image Generation with Latent Consistency Models | Junsong Chen et.al. | 2401.05252 | link |
2024-01-10 | Derm-T2IM: Harnessing Synthetic Skin Lesion Data via Stable Diffusion Models for Enhanced Skin Disease Classification using ViT and CNN | Muhammad Ali Farooq et.al. | 2401.05159 | null |
2024-01-10 | CrossDiff: Exploring Self-Supervised Representation of Pansharpening via Cross-Predictive Diffusion Model | Yinghui Xing et.al. | 2401.05153 | null |
2024-01-10 | SwiMDiff: Scene-wide Matching Contrastive Learning with Diffusion Constraint for Remote Sensing Image | Jiayuan Tian et.al. | 2401.05093 | null |
2024-01-10 | A novel bond-based nonlocal diffusion model with matrix-valued coefficients in non-divergence form and its collocation discretization | Lili Ju et.al. | 2401.04973 | null |
2024-01-09 | Transmission-eigenchannel velocity and diffusion | Azriel Z. Genack et.al. | 2401.04818 | null |
2024-01-09 | Morphable Diffusion: 3D-Consistent Diffusion for Single-image Avatar Creation | Xiyi Chen et.al. | 2401.04728 | link |
2024-01-09 | Efficient estimation for ergodic diffusion processes sampled at high frequency | Michael Sørensen et.al. | 2401.04689 | null |
2024-01-09 | EmoGen: Emotional Image Content Generation with Text-to-Image Diffusion Models | Jingyuan Yang et.al. | 2401.04608 | null |
2024-01-09 | Enhanced Distribution Alignment for Post-Training Quantization of Diffusion Models | Xuewen Liu et.al. | 2401.04585 | link |
2024-01-09 | MagicVideo-V2: Multi-Stage High-Aesthetic Video Generation | Weimin Wang et.al. | 2401.04468 | null |
2024-01-09 | D3AD: Dynamic Denoising Diffusion Probabilistic Model for Anomaly Detection | Justin Tebbe et.al. | 2401.04463 | link |
2024-01-09 | SonicVisionLM: Playing Sound with Vision Language Models | Zhifeng Xie et.al. | 2401.04394 | null |
2024-01-09 | Representative Feature Extraction During Diffusion Process for Sketch Extraction with One Example | Kwan Yun et.al. | 2401.04362 | null |
2024-01-09 | Memory-Efficient Personalization using Quantized Diffusion Model | Hyogon Ryu et.al. | 2401.04339 | null |
2024-01-08 | FADI-AEC: Fast Score Based Diffusion Model Guided by Far-end Signal for Acoustic Echo Cancellation | Yang Liu et.al. | 2401.04283 | null |
2024-01-08 | scDiffusion: conditional generation of high-quality single-cell data using diffusion model | Erpai Luo et.al. | 2401.03968 | link |
2024-01-08 | D3PRefiner: A Diffusion-based Denoise Method for 3D Human Pose Refinement | Danqi Yan et.al. | 2401.03914 | null |
2024-01-08 | DDM-Lag : A Diffusion-based Decision-making Model for Autonomous Vehicles with Lagrangian Safety Enhancement | Jiaqi Liu et.al. | 2401.03629 | null |
2024-01-07 | ROIC-DM: Robust Text Inference and Classification via Diffusion Model | Shilong Yuan et.al. | 2401.03514 | null |
2024-01-07 | Freetalker: Controllable Speech and Text-Driven Gesture Generation Based on Diffusion Models for Enhanced Speaker Naturalness | Sicheng Yang et.al. | 2401.03476 | null |
2024-01-07 | Deep Learning-based Image and Video Inpainting: A Survey | Weize Quan et.al. | 2401.03395 | null |
2024-01-06 | Reflected Schrödinger Bridge for Constrained Generative Modeling | Wei Deng et.al. | 2401.03228 | null |
2024-01-06 | MirrorDiffusion: Stabilizing Diffusion Process in Zero-shot Image Translation by Prompts Redescription and Beyond | Yupei Lin et.al. | 2401.03221 | null |
2024-01-06 | Fair Sampling in Diffusion Models through Switching Mechanism | Yujin Choi et.al. | 2401.03140 | link |
2024-01-05 | Latte: Latent Diffusion Transformer for Video Generation | Xin Ma et.al. | 2401.03048 | link |
2024-01-05 | Uncovering the human motion pattern: Pattern Memory-based Diffusion Model for Trajectory Prediction | Yuxin Yang et.al. | 2401.02916 | null |
2024-01-05 | Plug-in Diffusion Model for Sequential Recommendation | Haokai Ma et.al. | 2401.02913 | link |
2024-01-05 | Diffusion Variational Inference: Diffusion Models as Expressive Variational Posteriors | Top Piriyakulkij et.al. | 2401.02739 | null |
2024-01-05 | Geometric-Facilitated Denoising Diffusion Model for 3D Molecule Generation | Can Xu et.al. | 2401.02683 | link |
2024-01-04 | Comprehensive Exploration of Synthetic Data Generation: A Survey | André Bauer et.al. | 2401.02524 | null |
2024-01-04 | VASE: Object-Centric Appearance and Shape Manipulation of Real Videos | Elia Peruzzo et.al. | 2401.02473 | null |
2024-01-04 | Bring Metric Functions into Diffusion Models | Jie An et.al. | 2401.02414 | null |
2024-01-06 | GUESS:GradUally Enriching SyntheSis for Text-Driven Human Motion Generation | Xuehao Gao et.al. | 2401.02142 | link |
2024-01-04 | Preserving Image Properties Through Initializations in Diffusion Models | Jeffrey Zhang et.al. | 2401.02097 | null |
2024-01-04 | Energy based diffusion generator for efficient sampling of Boltzmann distributions | Yan Wang et.al. | 2401.02080 | null |
2024-01-04 | DiffusionEdge: Diffusion Probabilistic Model for Crisp Edge Detection | Yunfan Ye et.al. | 2401.02032 | link |
2024-01-04 | Improving Diffusion-Based Image Synthesis with Context Prediction | Ling Yang et.al. | 2401.02015 | null |
2024-01-03 | Instruct-Imagen: Image Generation with Multi-modal Instruction | Hexiang Hu et.al. | 2401.01952 | null |
2024-01-03 | Can We Generate Realistic Hands Only Using Convolution? | Mehran Hosseini et.al. | 2401.01951 | null |
2024-01-03 | Moonshot: Towards Controllable Video Generation and Editing with Multimodal Conditions | David Junhao Zhang et.al. | 2401.01827 | link |
2024-01-03 | DiffYOLO: Object Detection for Anti-Noise via YOLO and Diffusion Models | Yichen Liu et.al. | 2401.01659 | null |
2024-01-03 | SIGNeRF: Scene Integrated Generation for Neural Radiance Fields | Jan-Niklas Dihlmann et.al. | 2401.01647 | null |
2024-01-03 | S |
Yixuan Wang et.al. | 2401.01520 | link |
2024-01-02 | ColorizeDiffusion: Adjustable Sketch Colorization with Reference Image and Text | Dingkun Yan et.al. | 2401.01456 | link |
2024-01-02 | VALD-MD: Visual Attribution via Latent Diffusion for Medical Diagnostics | Ammar A. Siddiqui et.al. | 2401.01414 | null |
2024-01-02 | VideoDrafter: Content-Consistent Multi-Scene Video Generation with LLM | Fuchen Long et.al. | 2401.01256 | link |
2024-01-02 | Towards a Simultaneous and Granular Identity-Expression Control in Personalized Face Generation | Renshuai Liu et.al. | 2401.01207 | null |
2024-01-02 | A comparative study of resistivity models for simulations of magnetic reconnection in the solar atmosphere. II. Plasmoid formation | Øystein Håvard Færder et.al. | 2401.01177 | null |
2024-01-02 | Joint Generative Modeling of Scene Graphs and Images via Diffusion Models | Bicheng Xu et.al. | 2401.01130 | null |
2024-01-02 | Robust single-particle cryo-EM image denoising and restoration | Jing Zhang et.al. | 2401.01097 | null |
2024-01-02 | Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generation | Jinlong Xue et.al. | 2401.01044 | link |
2024-01-01 | DiffMorph: Text-less Image Morphing with Diffusion Models | Shounak Chatterjee et.al. | 2401.00739 | null |
2024-01-01 | Diffusion Models, Image Super-Resolution And Everything: A Survey | Brian B. Moser et.al. | 2401.00736 | null |
2024-01-02 | GD^2-NeRF: Generative Detail Compensation via GAN and Diffusion for One-shot Generalizable Neural Radiance Fields | Xiao Pan et.al. | 2401.00616 | null |
2023-12-31 | Diff-PCR: Diffusion-Based Correspondence Searching in Doubly Stochastic Matrix Space for Point Cloud Registration | Qianliang Wu et.al. | 2401.00436 | null |
2023-12-29 | FlowVid: Taming Imperfect Optical Flows for Consistent Video-to-Video Synthesis | Feng Liang et.al. | 2312.17681 | null |
2023-12-29 | Data Augmentation for Supervised Graph Outlier Detection with Latent Diffusion Models | Kay Liu et.al. | 2312.17679 | link |
2023-12-29 | Leveraging Open-Vocabulary Diffusion to Camouflaged Instance Segmentation | Tuan-Anh Vu et.al. | 2312.17505 | null |
2023-12-28 | Classifier-free graph diffusion for molecular property targeting | Matteo Ninniri et.al. | 2312.17397 | link |
2023-12-28 | iFusion: Inverting Diffusion for Pose-Free Reconstruction from Sparse Views | Chin-Hsuan Wu et.al. | 2312.17250 | link |
2023-12-28 | Personalized Restoration via Dual-Pivot Tuning | Pradyumna Chari et.al. | 2312.17234 | null |
2023-12-28 | 4DGen: Grounded 4D Content Generation with Spatial-temporal Consistency | Yuyang Yin et.al. | 2312.17225 | null |
2023-12-28 | Restoration by Generation with Constrained Priors | Zheng Ding et.al. | 2312.17161 | null |
2023-12-28 | DiffKG: Knowledge Graph Diffusion Model for Recommendation | Yangqin Jiang et.al. | 2312.16890 | link |
2023-12-29 | DiffusionGAN3D: Boosting Text-guided 3D Generation and Domain Adaption by Combining 3D GANs and Diffusion Priors | Biwen Lei et.al. | 2312.16837 | null |
2023-12-27 | I2V-Adapter: A General Image-to-Video Adapter for Video Diffusion Models | Xun Guo et.al. | 2312.16693 | link |
2023-12-27 | Forgery-aware Adaptive Transformer for Generalizable Synthetic Image Detection | Huan Liu et.al. | 2312.16649 | link |
2023-12-27 | Image Restoration by Denoising Diffusion Models with Iteratively Preconditioned Guidance | Tomer Garber et.al. | 2312.16519 | link |
2023-12-29 | PanGu-Draw: Advancing Resource-Efficient Text-to-Image Synthesis with Time-Decoupled Training and Reusable Coop-Diffusion | Guansong Lu et.al. | 2312.16486 | null |
2023-12-26 | One-dimensional Adapter to Rule Them All: Concepts, Diffusion Models and Erasing Applications | Mengyao Lyu et.al. | 2312.16145 | null |
2023-12-26 | Compositional Search of Stable Crystalline Structures in Multi-Component Alloys Using Generative Diffusion Models | Grzegorz Kaszuba et.al. | 2312.16073 | null |
2023-12-26 | HarmonyView: Harmonizing Consistency and Diversity in One-Image-to-3D | Sangmin Woo et.al. | 2312.15980 | link |
2023-12-26 | Semantic Guidance Tuning for Text-To-Image Diffusion Models | Hyun Kang et.al. | 2312.15964 | link |
2023-12-26 | Implied volatility (also) is path-dependent | Hervé Andrès et.al. | 2312.15950 | link |
2023-12-26 | EnchantDance: Unveiling the Potential of Music-Driven Dance Movement | Bo Han et.al. | 2312.15946 | link |
2023-12-26 | Generating and Reweighting Dense Contrastive Patterns for Unsupervised Anomaly Detection | Songmin Dai et.al. | 2312.15911 | null |
2023-12-26 | Cross Initialization for Personalized Text-to-Image Generation | Lianyu Pang et.al. | 2312.15905 | link |
2023-12-25 | Adversarial Item Promotion on Visually-Aware Recommender Systems by Guided Diffusion | Lijian Chen et.al. | 2312.15826 | null |
2023-12-25 | High-Fidelity Diffusion-based Image Editing | Chen Hou et.al. | 2312.15707 | null |
2023-12-22 | MACS: Mass Conditioned 3D Hand and Object Motion Synthesis | Soshi Shimada et.al. | 2312.14929 | null |
2023-12-22 | BrainVis: Exploring the Bridge between Brain and Visual Signals via Image Reconstruction | Honghao Fu et.al. | 2312.14871 | link |
2023-12-22 | Neural-network-based regularization methods for inverse problems in imaging | Andreas Habring et.al. | 2312.14849 | null |
2023-12-22 | Dreaming of Electrical Waves: Generative Modeling of Cardiac Excitation Waves using Diffusion Models | Tanish Baranwal et.al. | 2312.14830 | link |
2023-12-22 | Neural network models for preferential concentration of particles in two-dimensional turbulence | Thibault Maurel-Oujia et.al. | 2312.14829 | null |
2023-12-22 | Plan, Posture and Go: Towards Open-World Text-to-Motion Generation | Jinpeng Liu et.al. | 2312.14828 | null |
2023-12-22 | Harnessing Diffusion Models for Visual Perception with Meta Prompts | Qiang Wan et.al. | 2312.14733 | link |
2023-12-22 | FM-OV3D: Foundation Model-based Cross-modal Knowledge Blending for Open-Vocabulary 3D Detection | Dongmei Zhang et.al. | 2312.14465 | null |
2023-12-22 | Generative AI Beyond LLMs: System Implications of Multi-Modal Generation | Alicia Golden et.al. | 2312.14385 | null |
2023-12-21 | Diffusion Reward: Learning Rewards via Conditional Video Diffusion | Tao Huang et.al. | 2312.14134 | link |
2023-12-21 | Neural Point Cloud Diffusion for Disentangled 3D Shape and Appearance Generation | Philipp Schröppel et.al. | 2312.14124 | link |
2023-12-21 | HD-Painter: High-Resolution and Prompt-Faithful Text-Guided Image Inpainting with Diffusion Models | Hayk Manukyan et.al. | 2312.14091 | link |
2023-12-21 | Carve3D: Improving Multi-view Reconstruction Consistency for Diffusion Models with RL Finetuning | Desai Xie et.al. | 2312.13980 | null |
2023-12-21 | Paint3D: Paint Anything 3D with Lighting-Less Texture Diffusion Models | Xianfang Zeng et.al. | 2312.13913 | link |
2023-12-21 | Align Your Gaussians: Text-to-4D with Dynamic 3D Gaussians and Composed Diffusion Models | Huan Ling et.al. | 2312.13763 | null |
2023-12-21 | Free-Editor: Zero-shot Text-driven 3D Scene Editing | Nazmul Karim et.al. | 2312.13663 | link |
2023-12-21 | Diff-Oracle: Diffusion Model for Oracle Character Generation with Controllable Styles and Contents | Jing Li et.al. | 2312.13631 | null |
2023-12-21 | Navigating the Structured What-If Spaces: Counterfactual Generation via Structured Diffusion | Nishtha Madaan et.al. | 2312.13616 | null |
2023-12-21 | Front stability of infinitely steep travelling waves in population biology | Matthew J Simpson et.al. | 2312.13601 | link |
2023-12-20 | Repaint123: Fast and High-quality One Image to 3D Generation with Progressive Controllable 2D Repainting | Junwu Zhang et.al. | 2312.13271 | link |
2023-12-20 | Conditional Image Generation with Pretrained Generative Model | Rajesh Shrestha et.al. | 2312.13253 | null |
2023-12-20 | Zero-Shot Metric Depth with a Field-of-View Conditioned Diffusion Model | Saurabh Saxena et.al. | 2312.13252 | null |
2023-12-20 | Diffusion Models With Learned Adaptive Noise | Subham Sekhar Sahoo et.al. | 2312.13236 | link |
2023-12-20 | DiffPortrait3D: Controllable Diffusion for Zero-Shot Portrait View Synthesis | Yuming Gu et.al. | 2312.13016 | link |
2023-12-20 | RadEdit: stress-testing biomedical vision models via diffusion image editing | Fernando Pérez-García et.al. | 2312.12865 | null |
2023-12-20 | ReCo-Diff: Explore Retinex-Based Condition Strategy in Diffusion Model for Low-Light Image Enhancement | Yuhui Wu et.al. | 2312.12826 | null |
2023-12-20 | All but One: Surgical Concept Erasing with Model Preservation in Text-to-Image Diffusion Models | Seunghoo Hong et.al. | 2312.12807 | null |
2023-12-21 | AMD:Anatomical Motion Diffusion with Interpretable Motion Decomposition and Fusion | Beibei Jing et.al. | 2312.12763 | null |
2023-12-20 | How Good Are Deep Generative Models for Solving Inverse Problems? | Shichong Peng et.al. | 2312.12691 | null |
2023-12-19 | On Inference Stability for Diffusion Models | Viet Nguyen et.al. | 2312.12431 | link |
2023-12-19 | Scene-Conditional 3D Object Stylization and Composition | Jinghao Zhou et.al. | 2312.12419 | null |
2023-12-19 | Prompting Hard or Hardly Prompting: Prompt Inversion for Text-to-Image Diffusion Models | Shweta Mahajan et.al. | 2312.12416 | null |
2023-12-19 | Travelling pulses on three spatial scales in a Klausmeier-type vegetation-autotoxicity model | Paul Carter et.al. | 2312.12277 | null |
2023-12-19 | Intrinsic Image Diffusion for Single-view Material Estimation | Peter Kocsis et.al. | 2312.12274 | link |
2023-12-19 | Brush Your Text: Synthesize Any Scene Text on Images via Diffusion Model | Lingjun Zhang et.al. | 2312.12232 | link |
2023-12-19 | HuTuMotion: Human-Tuned Navigation of Latent Motion Diffusion Models with Minimal Feedback | Gaoge Han et.al. | 2312.12227 | null |
2023-12-19 | FontDiffuser: One-Shot Font Generation via Denoising Diffusion with Multi-Scale Content Aggregation and Style Contrastive Learning | Zhenhua Yang et.al. | 2312.12142 | link |
2023-12-19 | GazeMoDiff: Gaze-guided Diffusion Model for Stochastic Human Motion Prediction | Haodong Yan et.al. | 2312.12090 | null |
2023-12-19 | Learning Subject-Aware Cropping by Outpainting Professional Photos | James Hong et.al. | 2312.12080 | null |
2023-12-18 | A novel diffusion recommendation algorithm based on multi-scale cnn and residual lstm | Yong Niu et.al. | 2312.10885 | null |
2023-12-17 | Your Student is Better Than Expected: Adaptive Teacher-Student Collaboration for Text-Conditional Diffusion Models | Nikita Starodubcev et.al. | 2312.10835 | link |
2023-12-17 | CogCartoon: Towards Practical Story Visualization | Zhongyang Zhu et.al. | 2312.10718 | null |
2023-12-17 | VidToMe: Video Token Merging for Zero-Shot Video Editing | Xirui Li et.al. | 2312.10656 | link |
2023-12-16 | VecFusion: Vector Font Generation with Diffusion | Vikas Thamizharasan et.al. | 2312.10540 | null |
2023-12-16 | A Unified Filter Method for Jointly Estimating State and Parameters of Stochastic Dynamical Systems via the Ensemble Score Filter | Feng Bao et.al. | 2312.10503 | null |
2023-12-16 | Continuous Diffusion for Mixed-Type Tabular Data | Markus Mueller et.al. | 2312.10431 | link |
2023-12-16 | Lecture Notes in Probabilistic Diffusion Models | Inga Strümke et.al. | 2312.10393 | null |
2023-12-16 | Image Restoration Through Generalized Ornstein-Uhlenbeck Bridge | Conghan Yue et.al. | 2312.10299 | link |
2023-12-15 | Two simple criterion to prove the existence of patterns in reaction-diffusion models of two components | Francisco J. Vielma-Leal et.al. | 2312.10231 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-01-16 | Bias for Action: Video Implicit Neural Representations with Bias Modulation | Alper Kayabasi et.al. | 2501.09277 | null |
2025-01-15 | Dynamic-Aware Spatio-temporal Representation Learning for Dynamic MRI Reconstruction | Dayoung Baik et.al. | 2501.09049 | null |
2025-01-13 | Implicit Neural Representations for Registration of Left Ventricle Myocardium During a Cardiac Cycle | Mathias Micheelsen Lowes et.al. | 2501.07248 | link |
2025-01-14 | Generalized and Efficient 2D Gaussian Splatting for Arbitrary-scale Super-Resolution | Du Chen et.al. | 2501.06838 | null |
2025-01-07 | NeuralSVG: An Implicit Representation for Text-to-Vector Generation | Sagi Polaczek et.al. | 2501.03992 | null |
2025-01-06 | Qinco2: Vector Compression and Search with Improved Implicit Neural Codebooks | Théophane Vallaeys et.al. | 2501.03078 | link |
2025-01-05 | MetaNeRV: Meta Neural Representations for Videos with Spatial-Temporal Guidance | Jialong Guo et.al. | 2501.02427 | null |
2025-01-03 | Few-shot Implicit Function Generation via Equivariance | Suizhi Huang et.al. | 2501.01601 | null |
2025-01-02 | Incomplete Data Multi-Source Static Computed Tomography Reconstruction with Diffusion Priors and Implicit Neural Representation | Ziju Shen et.al. | 2501.01013 | null |
2025-01-01 | CoordFlow: Coordinate Flow for Pixel-wise Neural Video Representation | Daniel Silver et.al. | 2501.00975 | null |
2024-12-19 | Quantum Implicit Neural Compression | Takuya Fujihashi et.al. | 2412.19828 | null |
2025-01-09 | STITCH: Surface reconstrucTion using Implicit neural representations with Topology Constraints and persistent Homology | Anushrut Jignasu et.al. | 2412.18696 | null |
2024-12-29 | PartGen: Part-level 3D Generation and Reconstruction with Multi-View Diffusion Models | Minghao Chen et.al. | 2412.18608 | null |
2025-01-04 | S-INF: Towards Realistic Indoor Scene Synthesis via Scene Implicit Neural Field | Zixi Liang et.al. | 2412.17561 | link |
2024-12-26 | LiHi-GS: LiDAR-Supervised Gaussian Splatting for Highway Driving Scene Reconstruction | Pou-Chun Kung et.al. | 2412.15447 | null |
2024-12-17 | iRBSM: A Deep Implicit 3D Breast Shape Model | Maximilian Weiherer et.al. | 2412.13244 | null |
2024-12-17 | Subspace Implicit Neural Representations for Real-Time Cardiac Cine MR Imaging | Wenqi Huang et.al. | 2412.12742 | null |
2024-12-15 | Semi-Implicit Neural Ordinary Differential Equations | Hong Zhang et.al. | 2412.11301 | link |
2024-12-11 | Implicit Neural Compression of Point Clouds | Hongning Ruan et.al. | 2412.10433 | null |
2024-12-13 | EVOS: Efficient Implicit Neural Training via EVOlutionary Selector | Weixiang Zhang et.al. | 2412.10153 | null |
2024-12-12 | Enhancing Implicit Neural Representations via Symmetric Power Transformation | Weixiang Zhang et.al. | 2412.09213 | link |
2024-12-11 | Unicorn: Unified Neural Image Compression with One Number Reconstruction | Qi Zheng et.al. | 2412.08210 | null |
2024-12-11 | INRetouch: Context Aware Implicit Neural Representation for Photography Retouching | Omar Elezabi et.al. | 2412.03848 | null |
2024-12-04 | HIIF: Hierarchical Encoding based Implicit Image Function for Continuous Super-resolution | Yuxuan Jiang et.al. | 2412.03748 | null |
2024-12-03 | Multi-robot autonomous 3D reconstruction using Gaussian splatting with Semantic guidance | Jing Zeng et.al. | 2412.02249 | null |
2024-12-02 | Efficient Compression of Sparse Accelerator Data Using Implicit Neural Representations and Importance Sampling | Xihaier Luo et.al. | 2412.01754 | link |
2024-12-02 | SUICA: Learning Super-high Dimensional Sparse Implicit Neural Representations for Spatial Transcriptomics | Qingtian Zhu et.al. | 2412.01124 | null |
2024-11-27 | Towards Lensless Image Deblurring with Prior-Embedded Implicit Neural Representations in the Low-Data Regime | Abeer Banerjee et.al. | 2411.18189 | null |
2024-11-27 | MeltpoolINR: Predicting temperature field, melt pool geometry, and their rate of change in laser powder bed fusion | Manav Manav et.al. | 2411.18048 | null |
2024-11-21 | Geometric Algebra Planes: Convex Implicit Neural Volumes | Irmak Sivgin et.al. | 2411.13525 | null |
2024-11-16 | Peizhe Xia et.al. | 2411.11906 | null | |
2024-11-20 | TSINR: Capturing Temporal Continuity via Implicit Neural Representations for Time Series Anomaly Detection | Mengxuan Li et.al. | 2411.11641 | link |
2024-11-18 | Superpixel-informed Implicit Neural Representation for Multi-Dimensional Data | Jiayi Li et.al. | 2411.11356 | null |
2024-11-18 | Continuous K-space Recovery Network with Image Guidance for Fast MRI Reconstruction | Yucong Meng et.al. | 2411.11282 | null |
2024-11-17 | VeGaS: Video Gaussian Splatting | Weronika Smolak-Dyżewska et.al. | 2411.11024 | link |
2024-11-12 | Numerical Homogenization by Continuous Super-Resolution | Zhi-Song Liu et.al. | 2411.07576 | null |
2024-11-10 | Local Implicit Wavelet Transformer for Arbitrary-Scale Super-Resolution | Minghong Duan et.al. | 2411.06442 | link |
2024-11-09 | HiHa: Introducing Hierarchical Harmonic Decomposition to Implicit Neural Compression for Atmospheric Data | Zhewen Xu et.al. | 2411.06155 | null |
2024-11-07 | LoFi: Scalable Local Image Reconstruction with Implicit Neural Representation | AmirEhsan Khorashadizadeh et.al. | 2411.04995 | link |
2024-11-07 | VAIR: Visuo-Acoustic Implicit Representations for Low-Cost, Multi-Modal Transparent Surface Reconstruction in Indoor Scenes | Advaith V. Sethuraman et.al. | 2411.04963 | null |
2024-11-06 | Where Do We Stand with Implicit Neural Representations? A Technical and Performance Survey | Amer Essakine et.al. | 2411.03688 | null |
2024-10-31 | MS-Glance: Non-semantic context vectors and the applications in supervising image reconstruction | Ziqi Gao et.al. | 2410.23577 | link |
2024-10-30 | Understanding Representation of Deep Equilibrium Models from Neural Collapse Perspective | Haixiang Sun et.al. | 2410.23391 | null |
2024-10-29 | Predicting the Encoding Error of SIRENs | Jeremy Vonderfecht et.al. | 2410.21645 | null |
2024-10-29 | Neural Experts: Mixture of Experts for Implicit Neural Representations | Yizhak Ben-Shabat et.al. | 2410.21643 | null |
2024-10-29 | EEG-Driven 3D Object Reconstruction with Color Consistency and Diffusion Prior | Xin Xiang et.al. | 2410.20981 | null |
2024-10-16 | Radon Implicit Field Transform (RIFT): Learning Scenes from Radar Signals | Daqian Bao et.al. | 2410.19801 | null |
2024-10-25 | ST-NeRP: Spatial-Temporal Neural Representation Learning with Prior Embedding for Patient-specific Imaging Study | Liang Qiu et.al. | 2410.19283 | null |
2024-10-24 | Environment Maps Editing using Inverse Rendering and Adversarial Implicit Functions | Antonio D'Orazio et.al. | 2410.18622 | null |
2024-10-22 | Scalable Implicit Graphon Learning | Ali Azizpour et.al. | 2410.17464 | link |
2024-10-19 | Implicit neural representation for free-breathing MR fingerprinting (INR-MRF): co-registered 3D whole-liver water T1, water T2, proton density fat fraction, and R2 mapping* | Chao Li et.al. | 2410.15175 | null |
2024-10-17 | Object Pose Estimation Using Implicit Representation For Transparent Objects | Varun Burde et.al. | 2410.13465 | null |
2024-10-17 | Inductive Gradient Adjustment For Spectral Bias In Implicit Neural Representations | Kexuan Shi et.al. | 2410.13271 | null |
2024-10-16 | Optimizing 3D Geometry Reconstruction from Implicit Neural Representations | Shen Fan et.al. | 2410.12725 | null |
2024-10-16 | MING: A Functional Approach to Learning Molecular Generative Models | Van Khoa Nguyen et.al. | 2410.12522 | null |
2024-10-14 | StegaINR4MIH: steganography by implicit neural representation for multi-image hiding | Weina Dong et.al. | 2410.10117 | link |
2024-10-13 | Magnituder Layers for Implicit Neural Representations in 3D | Sang Min Kim et.al. | 2410.09771 | null |
2024-10-18 | IncEventGS: Pose-Free Gaussian Splatting from a Single Event Camera | Jian Huang et.al. | 2410.08107 | link |
2024-10-09 | DreamMesh4D: Video-to-4D Generation with Sparse-Controlled Gaussian-Mesh Hybrid Representation | Zhiqi Li et.al. | 2410.06756 | null |
2024-10-08 | Training Stiff Neural Ordinary Differential Equations with Implicit Single-Step Methods | Colby Fronk et.al. | 2410.05592 | null |
2024-10-11 | Implicitly Learned Neural Phase Functions for Basis-Free Point Spread Function Engineering | Aleksey Valouev et.al. | 2410.05413 | null |
2024-10-08 | FreSh: Frequency Shifting for Accelerated Neural Representation Learning | Adam Kania et.al. | 2410.05050 | link |
2024-10-07 | H-SIREN: Improving implicit neural representations with hyperbolic periodic functions | Rui Gao et.al. | 2410.04716 | null |
2024-10-07 | Neural Fourier Modelling: A Highly Compact Approach to Time-Series Analysis | Minjung Kim et.al. | 2410.04703 | link |
2024-10-07 | SegINR: Segment-wise Implicit Neural Representation for Sequence Alignment in Neural Text-to-Speech | Minchan Kim et.al. | 2410.04690 | null |
2024-10-04 | Shrinking: Reconstruction of Parameterized Surfaces from Signed Distance Fields | Haotian Yin et.al. | 2410.03123 | null |
2024-10-03 | On Logical Extrapolation for Mazes with Recurrent and Implicit Networks | Brandon Knutson et.al. | 2410.03020 | link |
2024-10-02 | MVGS: Multi-view-regulated Gaussian Splatting for Novel View Synthesis | Xiaobiao Du et.al. | 2410.02103 | link |
2024-10-03 | Releasing the Parameter Latency of Neural Representation for High-Efficiency Video Compression | Gai Zhang et.al. | 2410.01654 | null |
2024-10-02 | Coordinate-Based Neural Representation Enabling Zero-Shot Learning for 3D Multiparametric Quantitative MRI | Guoyan Lao et.al. | 2410.01577 | null |
2024-10-02 | MiraGe: Editable 2D Images using Gaussian Splatting | Joanna Waczyńska et.al. | 2410.01521 | link |
2024-09-30 | WildFusion: Multimodal Implicit 3D Reconstructions in the Wild | Yanbaihui Liu et.al. | 2409.19904 | null |
2024-09-28 | Towards Croppable Implicit Neural Representations | Maor Ashkenazi et.al. | 2409.19472 | link |
2024-09-28 | Fast Encoding and Decoding for Implicit Video Representation | Hao Chen et.al. | 2409.19429 | null |
2024-09-27 | Neural Video Representation for Redundancy Reduction and Consistency Preservation | Taiga Hayami et.al. | 2409.18497 | null |
2024-09-25 | Implicit Neural Representations for Simultaneous Reduction and Continuous Reconstruction of Multi-Altitude Climate Data | Alif Bin Abdul Qayyum et.al. | 2409.17367 | link |
2024-09-25 | Streaming Neural Images | Marcos V. Conde et.al. | 2409.17134 | null |
2024-09-25 | Moner: Motion Correction in Undersampled Radial MRI with Unsupervised Neural Representation | Qing Wu et.al. | 2409.16921 | null |
2024-09-25 | Ring Artifacts Removal Based on Implicit Neural Representation of Sinogram Data | Ligen Shi et.al. | 2409.15731 | null |
2024-09-21 | Implicit Neural Representations for Speed-of-Sound Estimation in Ultrasound | Michal Byra et.al. | 2409.14035 | null |
2024-09-21 | MOSE: Monocular Semantic Reconstruction Using NeRF-Lifted Noisy Priors | Zhenhua Du et.al. | 2409.14019 | null |
2024-09-20 | Occupancy-Based Dual Contouring | Jisung Hwang et.al. | 2409.13418 | link |
2024-09-19 | Breaking the Barriers of One-to-One Usage of Implicit Neural Representation in Image Compression: A Linear Combination Approach with Performance Guarantees | Sai Sanjeet et.al. | 2409.13117 | link |
2024-09-18 | Intraoperative Registration by Cross-Modal Inverse Neural Rendering | Maximilian Fehrentz et.al. | 2409.11983 | null |
2024-09-18 | Monomial Matrix Group Equivariant Neural Functional Networks | Hoang V. Tran et.al. | 2409.11697 | link |
2024-09-17 | Compact Implicit Neural Representations for Plane Wave Images | Mathilde Monvoisin et.al. | 2409.11370 | null |
2024-09-17 | SplatFields: Neural Gaussian Splats for Sparse 3D and 4D Reconstruction | Marko Mihajlovic et.al. | 2409.11211 | null |
2024-09-17 | Neural Fields for Adaptive Photoacoustic Computed Tomography | Tianao Li et.al. | 2409.10876 | null |
2024-09-18 | Single-Layer Learnable Activation for Implicit Neural Representation (SL |
Moein Heidari et.al. | 2409.10836 | null |
2024-09-15 | Learning Transferable Features for Implicit Neural Representations | Kushal Vyas et.al. | 2409.09566 | null |
2024-09-14 | Estimating Neural Orientation Distribution Fields on High Resolution Diffusion MRI Scans | Mohammed Munzer Dwedari et.al. | 2409.09387 | link |
2024-09-20 | Implicit Neural Representations with Fourier Kolmogorov-Arnold Networks | Ali Mehrabian et.al. | 2409.09323 | link |
2024-09-12 | DreamHOI: Subject-Driven Generation of 3D Human-Object Interactions with Diffusion Priors | Thomas Hanwen Zhu et.al. | 2409.08278 | null |
2024-09-11 | NVRC: Neural Video Representation Compression | Ho Man Kwan et.al. | 2409.07414 | null |
2024-09-11 | AC-IND: Sparse CT reconstruction based on attenuation coefficient estimation and implicit neural distribution | Wangduo Xie et.al. | 2409.07171 | null |
2024-09-11 | Fast Medical Shape Reconstruction via Meta-learned Implicit Neural Representations | Gaia Romana De Paolis et.al. | 2409.07100 | null |
2024-09-10 | A Latent Implicit 3D Shape Model for Multiple Levels of Detail | Benoit Guillard et.al. | 2409.06231 | null |
2024-09-09 | G-NeLF: Memory- and Data-Efficient Hybrid Neural Light Field for Novel View Synthesis | Lutao Jiang et.al. | 2409.05617 | null |
2024-09-06 | NeCA: 3D Coronary Artery Tree Reconstruction from Two 2D Projections by Neural Implicit Representation | Yiying Wang et.al. | 2409.04596 | link |
2024-09-10 | Diff-INR: Generative Regularization for Electrical Impedance Tomography | Bowen Tong et.al. | 2409.04494 | null |
2024-09-02 | SeCo-INR: Semantically Conditioned Implicit Neural Representations for Improved Medical Image Super-Resolution | Mevan Ekanayake et.al. | 2409.01013 | null |
2024-09-02 | PNVC: Towards Practical INR-based Video Compression | Ge Gao et.al. | 2409.00953 | null |
2024-08-29 | RMMI: Enhanced Obstacle Avoidance for Reactive Mobile Manipulation using an Implicit Neural Map | Nicolas Marticorena et.al. | 2408.16206 | null |
2024-08-20 | NeR-VCP: A Video Content Protection Method Based on Implicit Neural Representation | Yangping Lin et.al. | 2408.15281 | null |
2024-08-27 | Few-Shot Unsupervised Implicit Neural Shape Representation Learning with Spatial Adversaries | Amine Ouasfi et.al. | 2408.15114 | null |
2024-08-27 | Depth Restoration of Hand-Held Transparent Objects for Human-to-Robot Handover | Ran Yu et.al. | 2408.14997 | null |
2024-08-27 | OctFusion: Octree-based Diffusion Models for 3D Shape Generation | Bojun Xiong et.al. | 2408.14732 | link |
2024-08-25 | FreqINR: Frequency Consistency for Implicit Neural Representation with Adaptive DCT Frequency Loss | Meiyi Wei et.al. | 2408.13716 | null |
2024-08-23 | S4D: Streaming 4D Real-World Reconstruction with Gaussians and 3D Control Points | Bing He et.al. | 2408.13036 | link |
2024-08-16 | Modeling the Neonatal Brain Development Using Implicit Neural Representations | Florentin Bieder et.al. | 2408.08647 | link |
2024-08-16 | Reference-free Axial Super-resolution of 3D Microscopy Images using Implicit Neural Representation with a 2D Diffusion Prior | Kyungryun Lee et.al. | 2408.08616 | link |
2024-08-12 | Implicit Neural Representation For Accurate CFD Flow Field Prediction | Laurent de Vito et.al. | 2408.06486 | null |
2024-08-12 | Uncertainty-Informed Volume Visualization using Implicit Neural Representation | Shanu Saklani et.al. | 2408.06018 | null |
2024-08-10 | Residual-INR: Communication Efficient On-Device Learning Using Implicit Neural Representation | Hanqiu Chen et.al. | 2408.05617 | link |
2024-08-20 | Scene123: One Prompt to 3D Scene Generation via Video-Assisted and Consistency-Enhanced MAE | Yiying Yang et.al. | 2408.05477 | null |
2024-08-09 | EclipseNETs: a differentiable description of irregular eclipse conditions | Giacomo Acciarini et.al. | 2408.05387 | null |
2024-08-07 | PHOCUS: Physics-Based Deconvolution for Ultrasound Resolution Enhancement | Felix Duelmer et.al. | 2408.03657 | link |
2024-08-06 | Fast Point Cloud Geometry Compression with Context-based Residual Coding and INR-based Refinement | Hao Xu et.al. | 2408.02966 | null |
2024-08-05 | Latent-INR: A Flexible Framework for Implicit Representations of Videos with Discriminative Semantics | Shishira R Maiya et.al. | 2408.02672 | null |
2024-08-04 | AvatarPose: Avatar-guided 3D Pose Estimation of Close Human Interaction from Sparse Multi-view Videos | Feichi Lu et.al. | 2408.02110 | null |
2024-08-05 | UlRe-NeRF: 3D Ultrasound Imaging through Neural Rendering with Ultrasound Reflection Direction Parameterization | Ziwen Guo et.al. | 2408.00860 | null |
2024-07-30 | Neural Fields for Continuous Periodic Motion Estimation in 4D Cardiovascular Imaging | Simone Garzia et.al. | 2407.20728 | null |
2024-07-29 | Registering Neural 4D Gaussians for Endoscopic Surgery | Yiming Huang et.al. | 2407.20213 | null |
2024-07-29 | Aero-Nef: Neural Fields for Rapid Aircraft Aerodynamics Simulations | Giovanni Catalani et.al. | 2407.19916 | link |
2024-07-28 | UniVoxel: Fast Inverse Rendering by Unified Voxelization of Scene Representation | Shuang Wu et.al. | 2407.19542 | link |
2024-07-28 | FINER++: Building a Family of Variable-periodic Functions for Activating Implicit Neural Representation | Hao Zhu et.al. | 2407.19434 | null |
2024-07-26 | ObjectCarver: Semi-automatic segmentation, reconstruction and separation of 3D objects | Gemmechu Hassena et.al. | 2407.19108 | null |
2024-07-26 | Revisit Event Generation Model: Self-Supervised Learning of Event-to-Video Reconstruction with Implicit Neural Representations | Zipeng Wang et.al. | 2407.18500 | null |
2024-07-25 | GaussianSR: High Fidelity 2D Gaussian Splatting for Arbitrary-Scale Image Super-Resolution | Jintong Hu et.al. | 2407.18046 | null |
2024-07-23 | Uncertainty-Aware Deep Neural Representations for Visual Analysis of Vector Field Data | Atul Kumar et.al. | 2407.16119 | null |
2024-07-22 | Attention Beats Linear for Fast Implicit Neural Representation Generation | Shuyi Zhang et.al. | 2407.15355 | link |
2024-07-19 | SparseCraft: Few-Shot Neural Reconstruction through Stereopsis Guided Geometric Linearization | Mae Younes et.al. | 2407.14257 | null |
2024-07-18 | DiffuX2CT: Diffusion Learning to Reconstruct CT Images from Biplanar X-Rays | Xuhui Liu et.al. | 2407.13545 | null |
2024-07-18 | Learn to Memorize and to Forget: A Continual Learning Perspective of Dynamic SLAM | Baicheng Li et.al. | 2407.13338 | null |
2024-07-17 | A Resolution Independent Neural Operator | Bahador Bahmani et.al. | 2407.13010 | null |
2024-07-17 | Fast Context-Based Low-Light Image Enhancement via Neural Implicit Representations | Tomáš Chobola et.al. | 2407.12511 | link |
2024-07-18 | IPA-NeRF: Illusory Poisoning Attack Against Neural Radiance Fields | Wenxiang Jiang et.al. | 2407.11921 | link |
2024-07-12 | Neural Poisson Solver: A Universal and Continuous Framework for Natural Signal Blending | Delong Wu et.al. | 2407.08457 | null |
2024-07-09 | PDEformer-1: A Foundation Model for One-Dimensional Partial Differential Equations | Zhanhong Ye et.al. | 2407.06664 | null |
2024-07-09 | Implicit Regression in Subspace for High-Sensitivity CEST Imaging | Chu Chen et.al. | 2407.06614 | null |
2024-07-08 | LINEAR: Learning Implicit Neural Representation With Explicit Physical Priors for Accelerated Quantitative T1rho Mapping | Yuanyuan Liu et.al. | 2407.05617 | null |
2024-07-03 | IM-MoCo: Self-supervised MRI Motion Correction using Motion-Guided Implicit Neural Representations | Ziad Al-Haj Hemidi et.al. | 2407.02974 | link |
2024-07-03 | Highly Accelerated MRI via Implicit Neural Representation Guided Posterior Sampling of Diffusion Models | Jiayue Chu et.al. | 2407.02744 | null |
2024-07-03 | BeNeRF: Neural Radiance Fields from a Single Blurry Image and Event Stream | Wenpu Li et.al. | 2407.02174 | link |
2024-07-04 | UltraPixel: Advancing Ultra-High-Resolution Image Synthesis to New Peaks | Jingjing Ren et.al. | 2407.02158 | null |
2024-07-07 | Learning 3D Gaussians for Extremely Sparse-View Cone-Beam CT Reconstruction | Yiqun Lin et.al. | 2407.01090 | link |
2024-06-27 | PNeRV: A Polynomial Neural Representation for Videos | Sonam Gupta et.al. | 2406.19299 | null |
2024-06-25 | Efficient and Effective Implicit Dynamic Graph Neural Network | Yongjian Zhong et.al. | 2406.17894 | link |
2024-06-25 | Sparse-view Signal-domain Photoacoustic Tomography Reconstruction Method Based on Neural Representation | Bowei Yao et.al. | 2406.17578 | null |
2024-06-21 | CoCPF: Coordinate-based Continuous Projection Field for Ill-Posed Inverse Problem in Imaging | Zixuan Chen et.al. | 2406.14976 | null |
2024-06-19 | INFusion: Diffusion Regularized Implicit Neural Representations for 2D and 3D accelerated MRI reconstruction | Yamin Arefeen et.al. | 2406.13895 | null |
2024-06-19 | Enhance the Image: Super Resolution using Artificial Intelligence in MRI | Ziyu Li et.al. | 2406.13625 | null |
2024-06-13 | CodedEvents: Optimal Point-Spread-Function Engineering for 3D-Tracking with Event Cameras | Sachin Shah et.al. | 2406.09409 | null |
2024-06-13 | OpenMaterial: A Comprehensive Dataset of Complex Materials for 3D Reconstruction | Zheng Dang et.al. | 2406.08894 | null |
2024-06-13 | Generalizable Implicit Neural Representation As a Universal Spatiotemporal Traffic Data Learner | Tong Nie et.al. | 2406.08743 | null |
2024-06-11 | NeRSP: Neural 3D Reconstruction for Reflective Objects with Sparse Polarized Images | Yufei Han et.al. | 2406.07111 | null |
2024-06-09 | A Low Rank Neural Representation of Entropy Solutions | Donsub Rim et.al. | 2406.05694 | null |
2024-06-06 | Conv-INR: Convolutional Implicit Neural Representation for Multimodal Visual Signals | Zhicheng Cai et.al. | 2406.04249 | null |
2024-06-06 | Encoding Semantic Priors into the Weights of Implicit Neural Representation | Zhicheng Cai et.al. | 2406.04178 | null |
2024-06-06 | C^2RV: Cross-Regional and Cross-View Learning for Sparse-View CBCT Reconstruction | Yiqun Lin et.al. | 2406.03902 | link |
2024-06-06 | Quantum Implicit Neural Representations | Jiaming Zhao et.al. | 2406.03873 | link |
2024-06-04 | ReLUs Are Sufficient for Learning Implicit Neural Representations | Joseph Shenouda et.al. | 2406.02529 | link |
2024-06-04 | Image steganography based on generative implicit neural representation | Zhong Yangjie et.al. | 2406.01918 | link |
2024-06-01 | Modeling Randomly Observed Spatiotemporal Dynamical Systems | Valerii Iakovlev et.al. | 2406.00368 | null |
2024-05-31 | ImplicitTerrain: a Continuous Surface Model for Terrain Data Analysis | Haoan Feng et.al. | 2406.00227 | null |
2024-05-31 | MeshXL: Neural Coordinate Field for Generative 3D Foundation Models | Sijin Chen et.al. | 2405.20853 | link |
2024-05-29 | Implicit Neural Image Field for Biological Microscopy Image Compression | Gaole Dai et.al. | 2405.19012 | link |
2024-05-28 | Towards a Sampling Theory for Implicit Neural Representations | Mahrokh Najaf et.al. | 2405.18410 | null |
2024-05-28 | A Grid-Free Fluid Solver based on Gaussian Spatial Representation | Jingrui Xing et.al. | 2405.18133 | null |
2024-05-27 | UniCompress: Enhancing Multi-Data Medical Image Compression with Knowledge Distillation | Runzhao Yang et.al. | 2405.16850 | null |
2024-06-04 | Extreme Compression of Adaptive Neural Images | Leo Hoshikawa et.al. | 2405.16807 | null |
2024-05-27 | Transport of Algebraic Structure to Latent Embeddings | Samuel Pfrommer et.al. | 2405.16763 | link |
2024-05-24 | CPT-Interp: Continuous sPatial and Temporal Motion Modeling for 4D Medical Image Interpolation | Xia Li et.al. | 2405.15385 | null |
2024-05-23 | Multi-view Remote Sensing Image Segmentation With SAM priors | Zipeng Qi et.al. | 2405.14171 | null |
2024-05-22 | HR-INR: Continuous Space-Time Video Super-Resolution via Event Camera | Yunfan Lu et.al. | 2405.13389 | null |
2024-05-20 | GarmentDreamer: 3DGS Guided Garment Synthesis with Diverse Geometry and Texture Details | Boqian Li et.al. | 2405.12420 | link |
2024-05-20 | ASMR: Activation-sharing Multi-resolution Coordinate Networks For Efficient Inference | Jason Chun Lok Li et.al. | 2405.12398 | link |
2024-05-19 | Point Cloud Compression with Implicit Neural Representations: A Unified Framework | Hongning Ruan et.al. | 2405.11493 | null |
2024-05-18 | HR Human: Modeling Human Avatars with Triangular Mesh and High-Resolution Textures from Videos | Qifeng Chen et.al. | 2405.11270 | null |
2024-05-17 | Nonparametric Teaching of Implicit Neural Representations | Chen Zhang et.al. | 2405.10531 | link |
2024-05-14 | Achieving Resolution-Agnostic DNN-based Image Watermarking:A Novel Perspective of Implicit Neural Representation | Yuchen Wang et.al. | 2405.08340 | null |
2024-05-11 | Unsupervised Density Neural Representation for CT Metal Artifact Reduction | Qing Wu et.al. | 2405.07047 | null |
2024-05-10 | I3DGS: Improve 3D Gaussian Splatting from Multiple Dimensions | Jinwei Lin et.al. | 2405.06408 | null |
2024-05-10 | Free-Moving Object Reconstruction and Pose Estimation with Virtual Camera | Haixin Shi et.al. | 2405.05858 | null |
2024-05-09 | NeuRSS: Enhancing AUV Localization and Bathymetric Mapping with Neural Rendering for Sidescan SLAM | Yiping Xie et.al. | 2405.05807 | null |
2024-05-09 | Radar Fields: Frequency-Space Neural Scene Representations for FMCW Radar | David Borts et.al. | 2405.04662 | null |
2024-05-06 | 3D LiDAR Mapping in Dynamic Environments Using a 4D Implicit Neural Representation | Xingguang Zhong et.al. | 2405.03388 | link |
2024-05-06 | Spatiotemporal Implicit Neural Representation as a Generalized Traffic Data Learner | Tong Nie et.al. | 2405.03185 | link |
2024-05-03 | Implicit Neural Representations for Robust Joint Sparse-View CT Reconstruction | Jiayang Shi et.al. | 2405.02509 | null |
2024-05-01 | Continuous sPatial-Temporal Deformable Image Registration (CPT-DIR) for motion modelling in radiotherapy: beyond classic voxel-based methods | Xia Li et.al. | 2405.00430 | null |
2024-04-29 | Distributed Stochastic Optimization of a Neural Representation Network for Time-Space Tomography Reconstruction | K. Aditya Mohan et.al. | 2404.19075 | null |
2024-04-27 | DPER: Diffusion Prior Driven Neural Representation for Limited Angle and Sparse View CT Reconstruction | Chenhe Du et.al. | 2404.17890 | null |
2024-04-25 | Latent Modulated Function for Computational Optimal Continuous Image Representation | Zongyao He et.al. | 2404.16451 | link |
2024-04-23 | Fourier-enhanced Implicit Neural Fusion Network for Multispectral and Hyperspectral Image Fusion | Yu-Jie Liang et.al. | 2404.15174 | null |
2024-04-23 | HOIN: High-Order Implicit Neural Representations | Yang Chen et.al. | 2404.14674 | null |
2024-04-22 | Scene Coordinate Reconstruction: Posing of Image Collections via Incremental Learning of a Relocalizer | Eric Brachmann et.al. | 2404.14351 | null |
2024-04-18 | Mapping back and forth between model predictive control and neural networks | Ross Drummond et.al. | 2404.12030 | null |
2024-04-16 | Autonomous Implicit Indoor Scene Reconstruction with Frontier Exploration | Jing Zeng et.al. | 2404.10218 | null |
2024-04-15 | Q2A: Querying Implicit Fully Continuous Feature Pyramid to Align Features for Medical Image Segmentation | Jiahao Yu et.al. | 2404.09472 | null |
2024-04-03 | Dynamic Neural Control Flow Execution: An Agent-Based Deep Equilibrium Approach for Binary Vulnerability Detection | Litao Li et.al. | 2404.08562 | null |
2024-04-09 | Studying the Impact of Latent Representations in Implicit Neural Networks for Scientific Continuous Field Reconstruction | Wei Xu et.al. | 2404.06418 | null |
2024-04-03 | JDEC: JPEG Decoding via Enhanced Continuous Cosine Coefficients | Woo Kyoung Han et.al. | 2404.05558 | link |
2024-04-07 | CycleINR: Cycle Implicit Neural Representation for Arbitrary-Scale Volumetric Super-Resolution of Medical Data | Wei Fang et.al. | 2404.04878 | null |
2024-04-05 | Rethinking Non-Negative Matrix Factorization with Implicit Neural Representations | Krishna Subramani et.al. | 2404.04439 | link |
2024-04-05 | Deep Phase Coded Image Prior | Nimrod Shabtay et.al. | 2404.03906 | null |
2024-04-04 | CSR-dMRI: Continuous Super-Resolution of Diffusion MRI with Anatomical Structure-assisted Implicit Neural Representation Learning | Ruoyou Wu et.al. | 2404.03209 | null |
2024-04-03 | Unsupervised Occupancy Learning from Sparse Point Cloud | Amine Ouasfi et.al. | 2404.02759 | null |
2024-04-02 | Unmasking Correlations in Nuclear Cross Sections with Graph Neural Networks | Sinjini Mitra et.al. | 2404.02332 | null |
2024-04-02 | Federated Multi-Agent Mapping for Planetary Exploration | Tiberiu-Ioan Szatmari et.al. | 2404.02289 | null |
2024-04-02 | Bidirectional Multi-Scale Implicit Neural Representations for Image Deraining | Xiang Chen et.al. | 2404.01547 | link |
2024-03-29 | NeSLAM: Neural Implicit Mapping and Self-Supervised Feature Tracking With Depth Completion and Denoising | Tianchen Deng et.al. | 2403.20034 | link |
2024-03-28 | Benchmarking Implicit Neural Representation and Geometric Rendering in Real-Time RGB-D SLAM | Tongyan Hua et.al. | 2403.19473 | link |
2024-03-28 | D'OH: Decoder-Only random Hypernetworks for Implicit Neural Representations | Cameron Gordon et.al. | 2403.19163 | null |
2024-03-25 | INPC: Implicit Neural Point Clouds for Radiance Field Rendering | Florian Hahlbohm et.al. | 2403.16862 | null |
2024-03-23 | DS-NeRV: Implicit Neural Video Representation with Decomposed Static and Dynamic Codes | Hao Yan et.al. | 2403.15679 | null |
2024-03-21 | Toward Multi-class Anomaly Detection: Exploring Class-aware Unified Model against Inter-class Interference | Xi Jiang et.al. | 2403.14213 | null |
2024-03-20 | Visual Imitation Learning of Task-Oriented Object Grasping and Rearrangement | Yichen Cai et.al. | 2403.14000 | null |
2024-03-20 | MIMO Channel as a Neural Function: Implicit Neural Representations for Extreme CSI Compression in Massive MIMO Systems | Haotian Wu et.al. | 2403.13615 | null |
2024-03-19 | VQ-NeRV: A Vector Quantized Neural Representation for Videos | Yunjie Xu et.al. | 2403.12401 | link |
2024-03-18 | Reachability-based Trajectory Design via Exact Formulation of Implicit Neural Signed Distance Functions | Jonathan Michaux et.al. | 2403.12280 | null |
2024-03-20 | Graph Neural Networks for Learning Equivariant Representations of Neural Networks | Miltiadis Kofinas et.al. | 2403.12143 | link |
2024-03-18 | 3DGS-Calib: 3D Gaussian Splatting for Multimodal SpatioTemporal Calibration | Quentin Herau et.al. | 2403.11577 | null |
2024-03-17 | STAIR: Semantic-Targeted Active Implicit Reconstruction | Liren Jin et.al. | 2403.11233 | link |
2024-03-16 | MicroDiffusion: Implicit Representation-Guided Diffusion for 3D Reconstruction from Limited 2D Microscopy Projections | Mude Hui et.al. | 2403.10815 | link |
2024-03-15 | SWAG: Splatting in the Wild images with Appearance-conditioned Gaussians | Hiba Dahmani et.al. | 2403.10427 | null |
2024-03-15 | Arbitrary-Scale Image Generation and Upsampling using Latent Diffusion Model and Implicit Neural Decoder | Jinseok Kim et.al. | 2403.10255 | null |
2024-03-14 | SketchINR: A First Look into Sketches as Implicit Neural Representations | Hmrishav Bandyopadhyay et.al. | 2403.09344 | link |
2024-03-13 | Representing Anatomical Trees by Denoising Diffusion of Implicit Neural Fields | Ashish Sinha et.al. | 2403.08974 | link |
2024-03-13 | A Novel Implicit Neural Representation for Volume Data | Armin Sheibanifard et.al. | 2403.08566 | null |
2024-03-14 | GaussianImage: 1000 FPS Image Representation and Compression by 2D Gaussian Splatting | Xinjie Zhang et.al. | 2403.08551 | link |
2024-03-13 | CINA: Conditional Implicit Neural Atlas for Spatio-Temporal Representation of Fetal Brains | Maik Dannecker et.al. | 2403.08550 | null |
2024-03-11 | Multi-Scale Implicit Transformer with Re-parameterize for Arbitrary-Scale Super-Resolution | Jinchen Zhu et.al. | 2403.06536 | null |
2024-03-09 | Fast Kernel Scene Flow | Xueqian Li et.al. | 2403.05896 | link |
2024-03-04 | Ice-Tide: Implicit Cryo-ET Imaging and Deformation Estimation | Valentin Debarnot et.al. | 2403.02182 | link |
2024-02-28 | NERV++: An Enhanced Implicit Neural Video Representation | Ahmed Ghorbel et.al. | 2402.18305 | null |
2024-03-08 | Boosting Neural Representations for Videos with a Conditional Decoder | Xinjie Zhang et.al. | 2402.18152 | link |
2024-02-27 | LoDIP: Low light phase retrieval with deep image prior | Raunak Manekar et.al. | 2402.17745 | null |
2024-02-27 | Mesh-Agnostic Decoders for Supercritical Airfoil Prediction and Inverse Design | Runze Li et.al. | 2402.17299 | null |
2024-02-26 | Neural Mesh Fusion: Unsupervised 3D Planar Surface Understanding | Farhad G. Zanjani et.al. | 2402.16739 | null |
2024-02-23 | Smooth and Sparse Latent Dynamics in Operator Learning with Jerk Regularization | Xiaoyu Xie et.al. | 2402.15636 | null |
2024-02-22 | CoLoRA: Continuous low-rank adaptation for reduced implicit neural modeling of parameterized partial differential equations | Jules Berman et.al. | 2402.14646 | link |
2024-02-21 | Improving Efficiency of Iso-Surface Extraction on Implicit Neural Representations Using Uncertainty Propagation | Haoyu Li et.al. | 2402.13861 | null |
2024-02-21 | SealD-NeRF: Interactive Pixel-Level Editing for Dynamic Scenes by Neural Radiance Fields | Zhentao Huang et.al. | 2402.13510 | null |
2024-03-02 | NeRF Solves Undersampled MRI Reconstruction | Tae Jun Jang et.al. | 2402.13226 | null |
2024-02-20 | PDEformer: Towards a Foundation Model for One-Dimensional Partial Differential Equations | Zhanhong Ye et.al. | 2402.12652 | null |
2024-02-14 | DUDF: Differentiable Unsigned Distance Fields with Hyperbolic Scaling | Miguel Fainstein et.al. | 2402.08876 | link |
2024-02-13 | Preconditioners for the Stochastic Training of Implicit Neural Representations | Shin-Fang Chng et.al. | 2402.08784 | null |
2024-02-13 | Pix2Code: Learning to Compose Neural Visual Concepts as Programs | Antonia Wüst et.al. | 2402.08280 | link |
2024-02-10 | Training dynamics in Physics-Informed Neural Networks with feature mapping | Chengxi Zeng et.al. | 2402.06955 | link |
2024-02-08 | A Sampling Theory Perspective on Activations for Implicit Neural Representations | Hemanth Saratchandran et.al. | 2402.05427 | null |
2024-02-06 | OASim: an Open and Adaptive Simulator based on Neural Rendering for Autonomous Driving | Guohang Yan et.al. | 2402.03830 | link |
2024-02-05 | Deep Equilibrium Models are Almost Equivalent to Not-so-deep Explicit Models for High-dimensional Gaussian Mixtures | Zenan Ling et.al. | 2402.02697 | link |
2024-02-03 | Implicit Neural Representation of Tileable Material Textures | Hallison Paz et.al. | 2402.02208 | null |
2024-02-02 | Immersive Video Compression using Implicit Neural Representations | Ho Man Kwan et.al. | 2402.01596 | link |
2024-02-11 | Neural Trajectory Model: Implicit Neural Trajectory Representation for Trajectories Generation | Zihan Yu et.al. | 2402.01254 | link |
**202 |