001-nnFormer: Volumetric Medical Image Segmentation via a 3D Transformer |
Report |
Code |
Ref |
002-Flattening-Net: Deep Regular 2D Representation for 3D Point Cloud Analysis |
Report |
Code |
Ref |
003-Efficient Memory Management for Large Language Model Serving with PagedAttention |
Report |
Code |
Ref |
004-HS2P: Hierarchical Spectral and Structure-Preserving Fusion Network for Multimodal Remote Sensing Image Cloud and Shadow Removal |
Report |
Code |
Ref |
005-Rethink the Scan in MVCC Databases |
Report |
Code |
Ref |
006-Mask3D: Mask Transformer for 3D Instance Segmentation |
Report |
Code |
Ref |
007-Increased Alpha-Band Connectivity During Tic Suppression in Children with Tourette Syndrome Revealed by Source EEG Analyses |
Report |
Code |
Ref |
008-Personalized Federated Learning with Feature Alignment and Classifier Collaboration |
Report |
Code |
Ref |
009-OmniControl: Control Any Joint at Any Time for Human Motion Generation |
Report |
Code |
Ref |
010-SED: A Simple Encoder-Decoder for Open-Vocabulary Semantic Segmentation |
Report |
Code |
Ref |
011-GRACE: Loss-Resilient Real-Time Video through Neural Codecs |
Report |
Code |
Ref |
012-SSUMamba: Spatial-Spectral Selective State Space Model for Hyperspectral Image Denoising |
Report |
Code |
Ref |
013-DiskANN: Fast Accurate Billion-point Nearest Neighbor Search on a Single Node |
Report |
Code |
Ref |
014-GradeSafe:Detecting Jailbreak Prompts for LLMs via Safety-Critical Gradient Analysis |
Report |
Code |
Ref |
015-OpenFedLLM: Training Large Language Models on Decentralized Private Data via Federated Learning |
Report |
Code |
Ref |
016-GaussianTalker: Real-Time High-Fidelity Talking Head Synthesis with Audio-Driven 3D Gaussian Splatting |
Report |
Code |
Ref |
017-Instruction Backdoor Attacks Against Customized LLMs |
Report |
Code |
Ref |
018-Causal Incremental Graph Convolution for Recommender System Retraining |
Report |
Code |
Ref |
019-Investigating Subtler Biases in LLMs: Ageism, Beauty, Institutional, and Nationality Bias in Generative Models |
Report |
Code |
Ref |
020-U-TILISE: A Sequence-to-sequence Model for Cloud Removal in Optical Satellite Time Series |
Report |
Code |
Ref |
021-4D Gaussian Splatting for Real-Time Dynamic Scene Rendering |
Report |
Code |
Ref |
022-Practical Cloud Storage Auditing using Serverless Computing |
Report |
Code |
Ref |
023-SELF-DISCOVER: Large Language Models Self-Compose Reasoning Structures |
Report |
Code |
Ref |
024-ProAgent: Building Proactive Cooperative Agents with Large Language Models |
Report |
Code |
Ref |
025-FontDiffuser: One-Shot Font Generation via Denoising Diffusion with Multi-Scale Content Aggregation and Style Contrastive Learning |
Report |
Code |
Ref |
026-Identifying Sleep Spindles with Multichannel EEG and Classification Optimization |
Report |
Code |
Ref |
027-GraphZoom: A Multi-level Spectral Approach for Accurate and Scalable Graph Embedding |
Report |
Code |
Ref |
028-DETRs Beat YOLOs on Real-time Object Detection |
Report |
Code |
Ref |
029-Cluster-Based Graph Collaborative Filtering |
Report |
Code |
Ref |
030-Real3D-AD: A Dataset of Point Cloud Anomaly Detection |
Report |
Code |
Ref |
031-A ConvNet for the 2020s |
Report |
Code |
Ref |
032-CDNet:Centripetal Direction Network for Nuclear Instance Segmentatio |
Report |
Code |
Ref |
033-Masked Autoencoders for Point Cloud Self-supervised Learning |
Report |
Code |
Ref |
034-QueryCheetah: Fast Automated Discovery of Attribute Inference Attacks Against Query-Based Systems |
Report |
Code |
Ref |
035-DCN-T: Dual Context Network With Transformer for Hyperspectral Image Classification. |
Report |
Code |
Ref |
036-Graph Anomaly Detection via Multi-Scale Contrastive Learning Networks with Augmented View |
Report |
Code |
Ref |
037-Federated Learning over Wireless Networks: Optimization Model Design and Analysis |
Report |
Code |
Ref |
038-MemSAM: Taming Segment Anything Model for Echocardiography Video Segmentation |
Report |
Code |
Ref |
039-STAMP: Outlier-Aware Test-Time Adaptation with Stable Memory Replay |
Report |
Code |
Ref |
040-HAC: Hash-grid Assisted Context for 3D Gaussian Splatting Compression |
Report |
Code |
Ref |
041-U-Net: Convolutional Networks for Biomedical Image Segmentation |
Report |
Code |
Ref |
042-FineSurE: Fine-grained Summarization Evaluation using LLMs |
Report |
Code |
Ref |
043-Temporally and Distributionally Robust Optimization for Cold-Start Recommendation |
Report |
Code |
Ref |
044-EmoGen: Emotional Image Content Generation with Text-to-Image Diffusion Models |
Report |
Code |
Ref |
045-Recommender Systems with Generative Retrieval |
Report |
Code |
Ref |
046-Forget-Me-Not: Learning to Forget in Text-to-Image Diffusion Models |
Report |
Code |
Ref |
047-SurgicalSAM: Efficient Class Promptable Surgical Instrument Segmentation-All Databases |
Report |
Code |
Ref |
048-Mask-guided Spectral-wise Transformer for Efficient Hyperspectral Image Reconstruction |
Report |
Code |
Ref |
049-StrongSORT: Make DeepSORT Great Again |
Report |
Code |
Ref |
050-Diversity of Information Pathways Drives Sparsity in Real-World Networks. |
Report |
Code |
Ref |
051-Multimodal Industrial Anomaly Detection via Hybrid Fusion |
Report |
Code |
Ref |
052-SeGA: Preference-Aware Self–Contrastive Learning with Prompts for Anomalous User Detection on Twitter |
Report |
Code |
Ref |
053-Neural Network-Based Knowledge Transfer for Multitask Optimization |
Report |
Code |
Ref |
054-Reinforcement Learning Multiple Access for Heterogeneous Wireless Networks |
Report |
Code |
Ref |
055-Many-Objective Jaccard-based Evolutionary Feature Selection for High-Dimensional Imbalanced Data Classification |
Report |
Code |
Ref |
056-High-Fidelity Audio Compression with Improved RVQGAN |
Report |
Code |
Ref |
057-LightM-UNet: Mamba Assists in Lightweight UNet for Medical Image Segmentation |
Report |
Code |
Ref |
058-MMDU:A Multi-Turn Multi-Image Dialog Understanding Benchmark and Instruction-Turning Dataset for LVLMs |
Report |
Code |
Ref |
059-Deep Reinforcement Learning for Task Offloading in Mobile Edge Computing Systems |
Report |
Code |
Ref |
060-InteractDiffusion: Interaction Control in Text-to-Image Diffusion Models |
Report |
Code |
Ref |
061-An EEG Motor Imagery Dataset for Brain Computer Interface in Acute Stroke Patients |
Report |
Code |
Ref |
062-Vision-LSTM: xLSTM as Generic Vision Backbone |
Report |
Code |
Ref |
063-TGCA-PVT: Topic-Guided Context-Aware Pyramid Vision Transformer for Sticker Emotion Recognition |
Report |
Code |
Ref |
064-Transfer Learning in Cross-Domain Sequential Recommendation |
Report |
Code |
Ref |
065-Holodeck: Language Guided Generation of 3D Embodied AI Environments |
Report |
Code |
Ref |
066-Incentive-Aware Federated Learning with Training-Time Model Rewards |
Report |
Code |
Ref |
067-IP-Adapter: Text Compatible Image Prompt Adapter for Text-to-Image Diffusion Models |
Report |
Code |
Ref |
068-WeakPolyp: You Only Look Bounding Box for Polyp Segmentation |
Report |
Code |
Ref |
069-IMUNet: Efficient Regression Architecture for Inertial IMU Navigation and Positioning |
Report |
Code |
Ref |
070-Using Machine Learning Algorithms With In Situ Hyperspectral Reflectance Data to Assess Comprehensive Water Quality of Urban Rivers |
Report |
Code |
Ref |
071-Solving High-Dimensional Expensive Multiobjective Optimization Problems by Adaptive Decision Variable Grouping |
Report |
Code |
Ref |
072-Semantics-aware BERT for Language Understanding |
Report |
Code |
Ref |
073-Fast and Secure Distributed Nonnegative Matrix Factorization |
Report |
Code |
Ref |
074-TelME: Teacher-leading Multimodal Fusion Network for Emotion Recognition in Conversation |
Report |
Code |
Ref |
075-Itransformer:Inverted Transformer Are Effective For Time Series Forecasting |
Report |
Code |
Ref |
076-Reconstructing Higher-Order Interactions in Coupled Dynamical Systems |
Report |
Code |
Ref |
077-Preserving Fairness Generalization in Deepfake Detection |
Report |
Code |
Ref |
078-AdaMotif: Graph Simplification via Adaptive Motif Design |
Report |
Code |
Ref |
079-Incentivizing Massive Unknown Workers for Budget-Limited Crowdsensing: From Off-Line and On-Line Perspectives |
Report |
Code |
Ref |
080-Dr.Jit: A Just-In-Time Compiler for Differentiable Rendering |
Report |
Code |
Ref |
081-ComplexGen: CAD Reconstruction by B-Rep Chain Complex Generation |
Report |
Code |
Ref |
082-M3TN: Multi-Gate Mixture-of-Experts Based Multi-Valued Treatment Network for Uplift Modeling |
Report |
Code |
Ref |
083-ECLIPSE: Efficient Continual Learning in Panoptic Segmentation with Visual Prompt Tuning |
Report |
Code |
Ref |
084-Variational Transformer Networks for Layout Generation |
Report |
Code |
Ref |
085-Tambur: Efficient Loss Recovery for Videoconferencing via Streaming Codes |
Report |
Code |
Ref |
086-Towards Practical Plug-and-Play Diffusion Model |
Report |
Code |
Ref |
087-Real-Time View Synthesis for Large Scenes with Millions of Square Meters |
Report |
Code |
Ref |
088-Temporal-Channel Modeling in Multi-head Self-Attention for Synthetic Speech Detection |
Report |
Code |
Ref |
089-SALI: Short-term Alignment and Long-term Interaction Network for Colonoscopy Video Polyp Segmentation |
Report |
Code |
Ref |
090-CodeBERT: A Pre-Trained Model for Programming and Natural Languages |
Report |
Code |
Ref |
091-Effective Processor Verification with Logic Fuzzer Enhanced Co-simulation |
Report |
Code |
Ref |
092-ADALORA: Adaptive Budget Allocation for Parameter-Efficient Fine-Tuning |
Report |
Code |
Ref |
093-MVDiffusion: Enabling Holistic Multi-view Image Generation with Correspondence-Aware Diffusion |
Report |
Code |
Ref |
094-A Deep Learning-Based Super-Resolution Method for Building Height Estimation at 2.5m Spatial Resolution in the Northern Hemisphere |
Report |
Code |
Ref |
095-DeepACO: Neural-enhanced Ant Systems for Combinatorial Optimization |
Report |
Code |
Ref |
096-Learning Smooth Humanoid Locomotion Through Lipschitz-Constrained Policies |
Report |
Code |
Ref |
097-TokenHMR:Advancing Human Mesh Recovery with a Tokenized Pose Representation |
Report |
Code |
Ref |
098-EFVAE: Efficient Federated Variational Autoencoder for Collaborative Filtering |
Report |
Code |
Ref |
099-AlignMixup: Improving Representations By Interpolating Aligned Features |
Report |
Code |
Ref |
100-An Efficient and Adaptive Granular-Ball Generation Method in Classification Problem |
Report |
Code |
Ref |
101-EG-NAS: Neural Architecture Search with Fast Evolutionary Exploration |
Report |
Code |
Ref |
102-MVOT: Mahalanobis Distance based Multi-view Optimal Transport for Multi-view Crowd Localization |
Report |
Code |
Ref |
103-Learning to Prompt for Vision-Language Models |
Report |
Code |
Ref |
104-Bilateral Normal Integration |
Report |
Code |
Ref |
105-Prompt-to-Prompt Image Editing with Cross-Attention Control |
Report |
Code |
Ref |
106-BrainIB: Interpretable Brain Network-based Psychiatric Diagnosis with Graph Information Bottleneck |
Report |
Code |
Ref |
107-Semi-Supervised Dual-Stream Self-Attentive Adversarial Graph Contrastive Learning for Cross-Subject EEG-based Emotion Recognition |
Report |
Code |
Ref |
108-DreamFusion: Text-to-3D using 2D Diffusion |
Report |
Code |
Ref |
109-Drone Path Planning for Detecting Diff Scene |
Report |
Code |
Ref |
110-Iterative Poisson Surface Reconstruction (iPSR) for Unoriented Points |
Report |
Code |
Ref |
111-Urban Drone Occupancy Perception |
Report |
Code |
Ref |
112-MIMO Channel Estimation Using Score-Based Generative Models |
Report |
Code |
Ref |
113-Chatting Makes Perfect: Chat-based Image Retrieval |
Report |
Code |
Ref |
114-Fast Approximate Nearest Neighbor Search with The Navigating Spreading-out Graph |
Report |
Code |
Ref |
115-U-Net: Convolutional Networks for Biomedical Image Segmentation |
Report |
Code |
Ref |
116-Learn to Compress CSI and Allocate Resources in Vehicular Networks |
Report |
Code |
Ref |
117-Large Language Models are Versatile Decomposers: Decompose Evidence and Questions for Table-based Reasoning |
Report |
Code |
Ref |
118-Noise Learning of Instruments for Highcontrast, High-Resolution and Fast Hyperspectral Microscopy and Nanoscopy |
Report |
Code |
Ref |
119-Feature Fusion Based on Mutual-Cross-Attention Mechanism for EEG Emotion Recognition |
Report |
Code |
Ref |
120-A PPM-like, Tag-based Branch Predictor |
Report |
Code |
Ref |
121-Implicit Style-Content Separation using B-LoRA |
Report |
Code |
Ref |
122-A Case Study on Formal Equivalence Verification Between a C/C plus plus Model and Its RTL Design |
Report |
Code |
Ref |
123-MLDA-Net: Multi-Level Dual Attention-Based Network for Self-Supervised Monocular Depth Estimation |
Report |
Code |
Ref |
124-DIN-SQL: Decomposed In-Context Learning of Text-to-SQL with Self-Correction |
Report |
Code |
Ref |
125-Forget-Me-Not: Learning to Forget in Text-to-Image Diffusion Models |
Report |
Code |
Ref |
126-SVGDreamer: Text Guided SVG Generation with Diffusion Model |
Report |
Code |
Ref |
127-Cam4DOcc: Benchmark for Camera-Only 4D Occupancy Forecasting in Autonomous Driving Applications |
Report |
Code |
Ref |
128-MENTOR: Multi-level Self-supervised Learning for Multimodal Recommendation |
Report |
Code |
Ref |
129-A Federated Feature Selection Algorithm Based on Particle Swarm Optimization under Privacy Protection |
Report |
Code |
Ref |
130-Sparse Enhanced Network: An Adversarial Generation Method for Robust Augmentation in Sequential Recommendation |
Report |
Code |
Ref |
131-DETRs Beat YOLOs on Real-time Object Detection |
Report |
Code |
Ref |
132-Graph Neural Networks for Scalable Radio Resource Management: Architecture Design and Theoretical Analysis |
Report |
Code |
Ref |
133-IoT Enabled Smart Lighting System using STM32 Microcontroller with High Performance ARM Cortex-M3 core |
Report |
Code |
Ref |
134-LightGCN: Simplifying and Powering Graph Convolution Network for Recommendation |
Report |
Code |
Ref |
135-Variational Inference with Gaussian Score Matching |
Report |
Code |
Ref |
136-MambaHSI: Spatial–Spectral Mamba for Hyperspectral Image Classification |
Report |
Code |
Ref |
137-An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale |
Report |
Code |
Ref |
138-TrustGeo: Uncertainty-Aware Dynamic Graph Learning for Trustworthy IP Geolocation |
Report |
Code |
Ref |
139-SteganoGAN: High Capacity Image Steganography with GANs |
Report |
Code |
Ref |
140-ExtDM: Distribution Extrapolation Diffusion Model for Video Prediction |
Report |
Code |
Ref |
141-Diffusion Policy: Visuomotor Policy Learning via Action Diffusion |
Report |
Code |
Ref |
142-Decomposition with Adaptive Composite Norm for Evolutionary Multi-Objective Combinatorial Optimization |
Report |
Code |
Ref |
143-Self-Propagation Graph Neural Network for Recommendation |
Report |
Code |
Ref |
144-A Dynamic Kernel Prior Model for Unsupervised Blind Image Super-Resolution |
Report |
Code |
Ref |
145-Deep learning resilience inference for complex networked systems |
Report |
Code |
Ref |
146-Dual-Task Learning for Multi-Behavior Sequential Recommendation |
Report |
Code |
Ref |
147-Beyond Adapting SAM: Towards End-to-End Ultrasound Image Segmentation via Auto Prompting |
Report |
Code |
Ref |
148-Model-Contrastive Federated Learning |
Report |
Code |
Ref |
149-Adaptive Prescribed-Time Control for a Class of Uncertain Nonlinear Systems |
Report |
Code |
Ref |
150-Neighborhood Rough Set Based Heterogeneous Feature Subset Selection |
Report |
Code |
Ref |
151-DynaMo: In-Domain Dynamics Pretraining for Visuo-Motor Control |
Report |
Code |
Ref |
152-Who Should Be Given Incentives? Counterfactual Optimal Treatment Regimes Learning for Recommendation |
Report |
Code |
Ref |
153-Diff-BGM: A Diffusion Model for Video Background Music Generation |
Report |
Code |
Ref |
154-Simple Semantic-Aided Few-Shot Learning |
Report |
Code |
Ref |
155-True Knowledge Comes from Practice: Aligning Large Language Models with Embodied Environments via Reinforcement Learning |
Report |
Code |
Ref |
156-UNetFormer: A UNet-like Transformer for Efficient Semantic Segmentation of Remote Sensing Urban Scene Imagery |
Report |
Code |
Ref |
157-Predicting Lysine Methylation Sites using a Convolutional Neural Network |
Report |
Code |
Ref |
158-What Makes Good In-context Demonstrations for Code Intelligence Tasks with LLMs? |
Report |
Code |
Ref |
159-FloatZone: Accelerating Memory Error Detection using the Floating Point Unit |
Report |
Code |
Ref |
160-Debloating Address Sanitizer |
Report |
Code |
Ref |
161-DRCT: Saving Image Super-resolution away from Information Bottleneck |
Report |
Code |
Ref |
162-CLAP: Isolating Content from Style Through Contrastive Learning with Augmented Prompts |
Report |
Code |
Ref |
163-Long-Range Ambient LoRa Backscatter with Parallel Decoding |
Report |
Code |
Ref |
164-Towards Generalizable Neural Solvers for Vehicle Routing Problems via Ensemble with Transferrable Local Policy |
Report |
Code |
Ref |
165-Bypass and Insertion Algorithms for Exclusive Last-level Caches |
Report |
Code |
Ref |
166-MH-pFLID: Model Heterogeneous personalized Federated Learning via Injection and Distillation for Medical Data Analysis |
Report |
Code |
Ref |
167-Test-Time Classifier Adjustment Module for Model-Agnostic Domain Generalization |
Report |
Code |
Ref |
168-MoME: Mixture of Multimodal Experts for Cancer Survival Prediction |
Report |
Code |
Ref |
169-Structure-aware Visualization Retrieval |
Report |
Code |
Ref |
170-LLM Embeddings Improve Test-time Adaptation to Y X-Shifts |
Report |
Code |
Ref |
171-InstanceDiffusion: Instance-level Control for Image Generation |
Report |
Code |
Ref |
172-MEAformer: Multi-modal Entity Alignment Transformer for Meta Modality Hybrid |
Report |
Code |
Ref |
173-Robust Lightweight Facial Expression Recognition Network with Label Distribution Training |
Report |
Code |
Ref |
174-Intent Contrastive Learning with Cross Subsequences for Sequential Recommendation |
Report |
Code |
Ref |
175-DiffuseMix: Label-Preserving Data Augmentation with Diffusion Models |
Report |
Code |
Ref |
176-Deciphering Spatial Domains from Spatial Multi-omics with SpatialGlue |
Report |
Code |
Ref |
177-Variable-Rate Deep Image Compression through Spatially-Adaptive Feature Transform |
Report |
Code |
Ref |
178-RealFill: Reference-Driven Generation for Authentic Image Completion |
Report |
Code |
Ref |
179-Towards Highly Realistic Artistic Style Transfer via Stable Diffusion with Step-aware and Layer-aware Prompt |
Report |
Code |
Ref |
180-Position-guided Text Prompt for Vision-Language Pre-training |
Report |
Code |
Ref |
181-F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching |
Report |
Code |
Ref |
182-YOLO-Pose: Enhancing YOLO for Multi Person Pose Estimation Using Object Keypoint Similarity Loss |
Report |
Code |
Ref |
183-RoboPianist: Dexterous Piano Playing with Deep Reinforcement Learning. |
Report |
Code |
Ref |
184-Training Bayesian Neural Networks with Sparse Subspace Variational Inference |
Report |
Code |
Ref |
185-Object Detection in Hyperspectral Image via Unified Spectral-Spatial Feature Aggregation |
Report |
Code |
Ref |
186-SVGDreamer: Text Guided SVG Generation with Diffusion Model |
Report |
Code |
Ref |
187-Berti: an Accurate Local-Delta Data Prefetcher |
Report |
Code |
Ref |
188-Blind Image Quality Assessment via Vision-Language Correspondence: A Multitask Learning Perspective |
Report |
Code |
Ref |
189-Diffusion Policy Visuomotor Policy Learning via Action Diffusion |
Report |
Code |
Ref |
190-Stepwise Reconstruction of Higher-Order Networks from Dynamics |
Report |
Code |
Ref |
191-Learning An Animatable Detailed 3D Face Model from in-the-wild Images |
Report |
Code |
Ref |
192-Triplet Spectralwise Transformer Network for Hyperspectral Target Detection |
Report |
Code |
Ref |
193-A ConvNet for the 2020s |
Report |
Code |
Ref |
194-NeurCADRecon: Neural Representation for Reconstructing CAD Surfaces by Enforcing Zero Gaussian Curvature |
Report |
Code |
Ref |
195-MambaIR: A Simple Baseline for Image Restoration with State-Space Model |
Report |
Code |
Ref |
196-Hybrid Federated Learning for Multimodal IoT Systems |
Report |
Code |
Ref |
197-Constrained Multiobjective Optimization via Multitasking and Knowledge Transfer |
Report |
Code |
Ref |
198-Scalable Sparse Subspace Clustering by Orthogonal Matching Pursuit |
Report |
Code |
Ref |
199-Residual Denoising Diffusion Models |
Report |
Code |
Ref |
200-I-Design: Personalized LLM Interior Designer |
Report |
Code |
Ref |
201-An Attentive Inductive Bias for Sequential Recommendation beyond the Self-Attention |
Report |
Code |
Ref |
202-Higher-order Granger Reservoir Computing: Simultaneously Achieving Scalable Complex Structures Inference and Accurate Dynamics Prediction |
Report |
Code |
Ref |
203-RealFill: Reference-Driven Generation for Authentic Image Completion |
Report |
Code |
Ref |
204-Efficient Test-Time Adaptation of Vision-Language Models |
Report |
Code |
Ref |
205-Evolutionary Bilevel Optimization via Multiobjective Transformation-Based Lower-Level Search |
Report |
Code |
Ref |
206-Variational AutoEncoder For Regression: Application to Brain Aging Analysis |
Report |
Code |
Ref |
207-PredRNN: A Recurrent Neural Network for Spatiotemporal Predictive Learning |
Report |
Code |
Ref |
208-Revisiting Adversarial Training under Long-Tailed Distributions |
Report |
Code |
Ref |
209-DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation |
Report |
Code |
Ref |
210-Vision-Language Models are Strong Noisy Label Detectors |
Report |
Code |
Ref |
211-Robust Quantum Dots Charge Autotuning using Neural Network Uncertainty |
Report |
Code |
Ref |
212-YOLO-World: Real-Time Open-Vocabulary Object Detection |
Report |
Code |
Ref |
213-L0 Gradient-Regularization and Scale Space Representation Model for Cartoon and Texture Decomposition |
Report |
Code |
Ref |
214-YOLOv6: A Single-Stage Object Detection Framework for Industrial Applications |
Report |
Code |
Ref |
215-Deep Unfolded Simulated Bifurcation for Massive MIMO Signal Detection |
Report |
Code |
Ref |
216-LightGlue: Local Feature Matching at Light Speed |
Report |
Code |
Ref |
217-Multisurrogate-Assisted Ant Colony Optimization?for Expensive Optimization Problems With Continuous and Categorical Variables |
Report |
Code |
Ref |
218-HAIChart: Human and AI Paired Visualization System |
Report |
Code |
Ref |
219-Spectral Enhanced Rectangle Transformer for Hyperspectral Image Denoising |
Report |
Code |
Ref |
220-MOTRv2: Bootstrapping End-to-End Multi-Object Tracking by Pretrained Object Detectors |
Report |
Code |
Ref |
221-Earthformer: Exploring Space-Time Transformers for Earth System Forecasting |
Report |
Code |
Ref |
222-An End-to-End Transformer Model for Crowd Localization |
Report |
Code |
Ref |
223-DRAMsim3: A Cycle-Accurate, Thermal-Capable DRAM Simulator |
Report |
Code |
Ref |
224-DiffuRec: A Diffusion Model for Sequential Recommendation. |
Report |
Code |
Ref |
225-Hairpin: Rethinking Packet Loss Recovery in Edge-based Interactive Video Streaming |
Report |
Code |
Ref |
226-Deep Reinforcement Learning for Online Computation Offloading in Wireless Powered Mobile-Edge Computing Networks |
Report |
Code |
Ref |
227-WiFall: Device-Free Fall Detection by Wireless Networks |
Report |
Code |
Ref |
228-3D Gaussian Splatting for Real-Time Radiance Field Rendering |
Report |
Code |
Ref |
229-Swin-UMamba: Mamba-based UNet with ImageNet-based Pretraining |
Report |
Code |
Ref |
230-USFM: A Universal Ultrasound Foundation Model Generalized to Tasks and Organs Towards Label Efficient Image Analysis |
Report |
Code |
Ref |
231-Normal Integration via Inverse Plane Fitting With Minimum Point-to-Plane Distance |
Report |
Code |
Ref |
232-DUET: A Tuning-Free Device-Cloud Collaborative Parameters Generation Framework for Efficient Device Model Generalization |
Report |
Code |
Ref |
233-Harmonious Feature Learning for Interactive Hand-Object Pose Estimation |
Report |
Code |
Ref |
234-On-Device Learning for Model Personalization with Large-Scale Cloud-Coordinated Domain Adaption |
Report |
Code |
Ref |
235-Causal-Guided Active Learning for Debiasing Large Language Models |
Report |
Code |
Ref |
236-Bad Actor, Good Advisor: Exploring the Role of Large Language Models in Fake News Detection |
Report |
Code |
Ref |
237-FedProto: Federated Prototype Learning across Heterogeneous Clients |
Report |
Code |
Ref |
238-A Novel Algorithmic Structure of EEG Channel Attention Combined With Swin Transformer for Motor Patterns Classification |
Report |
Code |
Ref |
239-CWASI: A WebAssembly Runtime Shim for Inter-function Communication in the Serverless Edge-Cloud Continuum |
Report |
Code |
Ref |
240-TFDet: Target-Aware Fusion for RGB-T Pedestrian |
Report |
Code |
Ref |
241-Diffusion Policy: Visuomotor Policy Learning via Action Diffusion |
Report |
Code |
Ref |
242-Continuous, Subject-Specific Attribute Control in T2I Models by Identifying Semantic Directions |
Report |
Code |
Ref |
243-UVM Based Testbench Architecture for Unit Verification |
Report |
Code |
Ref |
244-DETRs Beat YOLOs on Real-time Object Detection |
Report |
Code |
Ref |
245-On the Affinity, Rationality, and Diversity of Hierarchical Topic Modeling |
Report |
Code |
Ref |
246-Omni Aggregation Networks for Lightweight Image Super-Resolution |
Report |
Code |
Ref |
247-Small Object Detection via Coarse-to-fine Proposal Generation and Imitation Learning |
Report |
Code |
Ref |
248-Code as Policies: Language Model Programs for Embodied Control |
Report |
Code |
Ref |
249-Polarization and Tipping Points |
Report |
Code |
Ref |
250-Prototypical Networks for Few-shot Learning |
Report |
Code |
Ref |
251-Advancing Multimodal Large Language Models in Chart Question Answering with Visualization-Referenced Instruction Tuning |
Report |
Code |
Ref |
252-AccuMO: Accuracy-Centric Multitask Offloading in Edge-Assisted Mobile Augmented Reality |
Report |
Code |
Ref |
253-Llara: Aligning Large Language Models with Sequential Recommenders. |
Report |
Code |
Ref |
254-ODRL: A Benchmark for Off-Dynamics Reinforcement Learning |
Report |
Code |
Ref |
255-ConvStencil: Transform Stencil Computation to Matrix Multiplication on Tensor Cores |
Report |
Code |
Ref |
256-WeTune: Automatic Discovery and Verification of Query Rewrite Rules |
Report |
Code |
Ref |
257-Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models |
Report |
Code |
Ref |
258-Textured Mesh Quality Assessment: Large-scale Dataset and Deep Learning-based Quality Metric |
Report |
Code |
Ref |
259-Single-Cell Biological Network Inference using a Heterogeneous Graph Transformer |
Report |
Code |
Ref |
260-FedCP: Separating Feature Information for Personalized Federated Learning via Conditional Policy |
Report |
Code |
Ref |
261-Robust Reflection Removal with Reflection-free Flash-only Cues |
Report |
Code |
Ref |
262-CFR-RL: Traffic Engineering With Reinforcement Learning in SDN |
Report |
Code |
Ref |
263-Identifying Partial Topology of Simplicial Complexes |
Report |
Code |
Ref |
264-CASS-NAT: CTC Alignment-based Single Step Non-autoregressive Transformer for Speech Recognition |
Report |
Code |
Ref |
265-Microservice Deployment in Edge Computing Based on Deep Q Learning |
Report |
Code |
Ref |
266-Cross-View Object Geo-Localization in a Local Region With Satellite Imagery |
Report |
Code |
Ref |
267-DUSt3R: Geometric 3D Vision Made Easy |
Report |
Code |
Ref |
268-Language Models are Few-Shot Learners |
Report |
Code |
Ref |
269-FinalMLP: An Enhanced Two-Stream MLP Model for CTR Prediction |
Report |
Code |
Ref |
270-A Recurrent Network Model of Planning Explains Hippocampal Replay and Human Behavior |
Report |
Code |
Ref |
271-BELM: High-quality Exact Inversion sampler of Diffusion Models |
Report |
Code |
Ref |
272-Pseudo Label-Guided Model Inversion Attack via Conditional Generative Adversarial Network |
Report |
Code |
Ref |
273-TimeMixer: Decomposable Multiscale Mixing for Time Series Forecasting |
Report |
Code |
Ref |
274-LineVul: A Transformer-based Line-Level Vulnerability Prediction |
Report |
Code |
Ref |
275-XNet: Wavelet-Based Low and High Frequency Fusion Networks for Fully- and Semi-Supervised Semantic Segmentation of Biomedical Images |
Report |
Code |
Ref |
276-Dirichlet-Based Prediction Calibration for Learning with Noisy Labels |
Report |
Code |
Ref |
277-MARBLE: Music Audio Representation Benchmark for Universal Evaluation |
Report |
Code |
Ref |
278-GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models |
Report |
Code |
Ref |
279-Neural Network-Based Dimensionality Reduction for Large-Scale Binary Optimization with Millions of Variables |
Report |
Code |
Ref |
280-RePlAce: Advancing Solution Quality and Routability Validation in Global Placement |
Report |
Code |
Ref |
281-Combining Implicit-Explicit View Correlation for Light Field Semantic Segmentation. |
Report |
Code |
Ref |
282-StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation |
Report |
Code |
Ref |
283-SBT-LLM: Structural Balance and Text-Enhanced Large Language Model-based Signed Link Prediction |
Report |
Code |
Ref |
284-Towards Open-World Recommendation with Knowledge Augmentation from Large Language Models |
Report |
Code |
Ref |
285-ImageReward: Learning and Evaluating Human Preferences for Text-to-Image Generation |
Report |
Code |
Ref |
286-Anomaly Detection on Attributed Networks via Contrastive Self-Supervised Learning |
Report |
Code |
Ref |
287-nanoBench: A Low-Overhead Tool for Running Microbenchmarks on x86 Systems |
Report |
Code |
Ref |
288-An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale |
Report |
Code |
Ref |
289-Plug-and-Play Diffusion Features for Text-Driven Image-to-Image Translation |
Report |
Code |
Ref |
290-LDRE: LLM-based Divergent Resoning and Ensemble for zero-Shot Composed Image Retrieval |
Report |
Code |
Ref |
291-MANNER: A Variational Memory-Augmented Model for Cross Domain Few-Shot Named Entity Recognition |
Report |
Code |
Ref |
292-OpenVLA: An Open-Source Vision-Language-Action Model |
Report |
Code |
Ref |
293-DLCRec: A Novel Approach for Managing Diversity in LLM-Based Recommender Systems |
Report |
Code |
Ref |
294-Improved DeepFake Detection Using Whisper Features |
Report |
Code |
Ref |
295-Optimal Generalized H-Tree Topology and Buffering for High-Performance and Low-Power Clock Distribution |
Report |
Code |
Ref |
296-Continuous Optical Zooming: A Benchmark for Arbitrary-Scale Image Super-Resolution in Real World |
Report |
Code |
Ref |
297-Value-Evolutionary-Based Reinforcement Learning |
Report |
Code |
Ref |
298-CacheGen: KV Cache Compassion and Streaming for Fast Large Language Model Serving |
Report |
Code |
Ref |
299-AnyGraph: Graph Foundation Model in the Wild |
Report |
Code |
Ref |
300-Information Gain-Based Multi-Objective Evolutionary Algorithm for Feature Selection |
Report |
Code |
Ref |
301-DRCT: Diffusion Reconstruction Contrastive Training towards Universal Detection of Diffusion Generated Images. |
Report |
Code |
Ref |
302-An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale |
Report |
Code |
Ref |
303-VL-SAT: Visual-Linguistic Semantics Assisted Training for 3D Semantic Scene Graph Prediction in Point Cloud |
Report |
Code |
Ref |
304-Self-orienting in human and machine learning |
Report |
Code |
Ref |
305-Going Beyond Popularity and Positivity Bias: Correcting for Multifactorial Bias in Recommender Systems |
Report |
Code |
Ref |
306-A Novel Normalized-Cut Solver With Nearest Neighbor Hierarchical Initialization |
Report |
Code |
Ref |
307-DoRA: Weight-Decomposed Low-Rank Adaptation |
Report |
Code |
Ref |
308-SURE: SUrvey REcipes for Building Reliable and Robust Deep Networks |
Report |
Code |
Ref |
309-UDC: A Unified Neural Divide-and-Conquer Framework for Large-Scale Combinatorial Optimization Problems |
Report |
Code |
Ref |
310-InteractDiffusion: Interaction Control in Text-to-Image Diffusion Models |
Report |
Code |
Ref |
311-FedFTHA: A Fine-Tuning and Head Aggregation Method in Federated Learning |
Report |
Code |
Ref |
312-Practical Efficient Microservice Autoscaling with QoS Assurance |
Report |
Code |
Ref |
313-Robust Test-Time Adaptation in Dynamic Scenarios |
Report |
Code |
Ref |
314-Spatial-Temporal_Knowledge_Transfer_for_Dynamic_Constrained_Multiobjective_Optimization |
Report |
Code |
Ref |
315-Seamless Manga Inpainting with Semantics Awareness |
Report |
Code |
Ref |
316-Deep Semi-Supervised Anomaly Detection |
Report |
Code |
Ref |
317-Suppress and Rebalance: Towards Generalized Multi-Modal Face Anti-Spoofing |
Report |
Code |
Ref |
318-EGTR: Extracting Graph from Transformer for Scene Graph Generation |
Report |
Code |
Ref |
319-YOLOv10: Real-Time End-to-End Object Detection |
Report |
Code |
Ref |
320-ReconFusion: 3D Reconstruction with Diffusion Priors |
Report |
Code |
Ref |
321-Ego-Body Pose Estimation via Ego-Head Pose Estimation |
Report |
Code |
Ref |
322-OmniQuant: Omnidirectionally Calibrated Quantization for Large Language Models |
Report |
Code |
Ref |
323-LinkGPT: Teaching Large Language Models To Predict Missing Links |
Report |
Code |
Ref |
324-YOLOv11: An Overview of the Key Architectural Enhancements |
Report |
Code |
Ref |
325-OpenVLA: An Open-Source Vision-Language-Action Model |
Report |
Code |
Ref |
326-A Variable Granularity Search-Based Multiobjective Feature Selection Algorithm for High-Dimensional Data Classification |
Report |
Code |
Ref |
327-SocialLGN: Light Graph Convolution Network for Social Recommendation |
Report |
Code |
Ref |
328-SRL-SOA: Self-Representation Learning with Sparse 1D-Operational Autoencoder for Hyperspectral Image Band Selection |
Report |
Code |
Ref |
329-Self-Supervised Medical Image Segmentation Using Deep Reinforced Adaptive Masking |
Report |
Code |
Ref |
330-Brain-inspired Global-local Learning Incorporated with Neuromorphic Computing |
Report |
Code |
Ref |
331-Joint Face Detection and Alignment using Multitask Cascaded Convolutional Networks |
Report |
Code |
Ref |
332-Segment Anything in Medical Images and Videos:Benchmark and Deployment |
Report |
Code |
Ref |
333-MedNAS: Multiscale Training-Free Neural Architecture Search for Medical Image Analysis |
Report |
Code |
Ref |
334-AutoDock Koto: A Gradient Boosting Differential Evolution for Molecular Docking |
Report |
Code |
Ref |
335-PlanBench: An Extensible Benchmark for Evaluating Large Language Models on Planning and Reasoning about Change |
Report |
Code |
Ref |
336-End-to-End Object Detection with Transformers |
Report |
Code |
Ref |
337-Dung Beetle Optimizer: A New Meta-Heuristic Algorithm for Global Optimization |
Report |
Code |
Ref |
338-EMCAD:Efficient Multi-scale Convolutional Attention Decoding for Medical Image Segmentation |
Report |
Code |
Ref |
339-SAMformer: Unlocking the Potential of Transformers in Time Series Forecasting with Sharpness-Aware Minimization and Channel-Wise Attention |
Report |
Code |
Ref |
340-Lite-HRNet: A Lightweight High-Resolution Network |
Report |
Code |
Ref |
341-Robust Chemical Analysis with Graphene Chemosensors and Machine Learning |
Report |
Code |
Ref |
342-Sapling Similarity: A Performing and Interpretable Memory-Based Tool Forrecommendation |
Report |
Code |
Ref |
343-HETEROFL: Computation and Communication Efficient Federated Learning for Heterogeneous Clients |
Report |
Code |
Ref |
344-Read, Watch and Scream! Sound Generation from Text and Video |
Report |
Code |
Ref |
345-RoBERTa: A Robustly Optimized BERT Pretraining Approach |
Report |
Code |
Ref |
346-PromptIR: Prompting for All-in-One Blind Image Restoration |
Report |
Code |
Ref |
347-Efficient Deweather Mixture-of-Experts with Uncertainty-Aware Feature-Wise Linear Modulation |
Report |
Code |
Ref |
348-Adapt PointFormer: 3D Point Cloud Analysis via Adapting 2D Visual Transformers |
Report |
Code |
Ref |
349-WinCLIP: Zero-/Few-Shot Anomaly Classification and Segmentation |
Report |
Code |
Ref |
350-3D Object Localization in RGB-D Scans using Natural Language |
Report |
Code |
Ref |
351-Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection |
Report |
Code |
Ref |
352-Long-Tail Learning with Foundation Model: Heavy Fine-Tuning Hurts |
Report |
Code |
Ref |
353-Building Customized Chatbots for Document Summarization and Question Answeringusing Large Language Models using a Framework with OpenAl, LangChain, and Streamlit |
Report |
Code |
Ref |
354-MedMamba: Vision Mamba for Medical Image Classification |
Report |
Code |
Ref |
355-Performance Analysis of Cambricon MLU100 |
Report |
Code |
Ref |
356-QUIC meets ICN: A Versatile Wireless Transport Strategy in Multi-access Edge Environments |
Report |
Code |
Ref |
357-Grounding DINO: Marrying DINO with Grounded Pre-training for Open-Set Object Detection |
Report |
Code |
Ref |
358-A Fast Blind Zero-shot Denoiser |
Report |
Code |
Ref |
359-Meshed-Memory Transformer for Image Captioning |
Report |
Code |
Ref |
360-DCdetector: Dual Attention Contrastive Representation Learning for Time Series Anomaly Detection |
Report |
Code |
Ref |
361-CogVideoX Text-to-Video Diffusion Models with An Expert Transformer |
Report |
Code |
Ref |
362-For SALE: State-Action Representation Learning for Deep Reinforcement Learning |
Report |
Code |
Ref |
363-Gated Linear Attention Transformers with Hardware-Efficient Training |
Report |
Code |
Ref |
364-4D Gaussian Splatting for Real-Time Dynamic Scene Rendering |
Report |
Code |
Ref |
365-CORE4D: A 4D Human-Object-Human Interaction Dataset for Collaborative Object REarrangement |
Report |
Code |
Ref |
366-Exploratory Combinatorial Optimization with Reinforcement Learning |
Report |
Code |
Ref |
367-MaPLe: Multi-modal Prompt Learning |
Report |
Code |
Ref |
368-M3D: Advancing 3D Medical Image Analysis with Multi-Modal Large Language Models |
Report |
Code |
Ref |
369-Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model |
Report |
Code |
Ref |
370-PIDNet: A Real-time Semantic Segmentation Network Inspired by PID Controllers |
Report |
Code |
Ref |
371-R2-Gaussian: Rectifying Radiative Gaussian Splatting for Tomographic Reconstruction |
Report |
Code |
Ref |
372-AffordPose: A Large-scale Dataset of Hand-Object Interactions with Affordance-driven Hand Pose |
Report |
Code |
Ref |
373-LiFteR: Unleash Learned Codecs in Video Streaming with Loose Frame Referencing |
Report |
Code |
Ref |
374-Emotion-LLaMA: Multimodal Emotion Recognition and Reasoning with Instruction Tuning |
Report |
Code |
Ref |
375-EmoTracer: A Wearable Physiological and Psychological Monitoring System With Multi-modal Sensors |
Report |
Code |
Ref |
376-Intelligent Reflecting Surface Configuration for Smart Radio Using Deep Reinforcement Learning |
Report |
Code |
Ref |
377-RGD-Net:Controllable Floorplan Creation via Room Guidance Diffusion |
Report |
Code |
Ref |
378-A Queue Waiting Cost-Aware Control Model for Large Scale Heterogeneous Cloud Datacenter |
Report |
Code |
Ref |
379-Faster-LIO: Lightweight Tightly Coupled Lidar-Inertial Odometry Using Parallel Sparse Incremental Voxels |
Report |
Code |
Ref |
380-AttAcc! Unleashing the Power of PIM for Batched Transformer-based Generative Model Inference |
Report |
Code |
Ref |
381-Reflexion: Language Agents with Verbal Reinforcement Learning |
Report |
Code |
Ref |
382-USFM: A Universal Ultrasound Foundation Model Generalized to Tasks and Organs towards Label Efficient Image Analysis |
Report |
Code |
Ref |
383-HyperSIGMA: Hyperspectral Intelligence Comprehension Foundation Model |
Report |
Code |
Ref |
384-A Learned Index for Exact Similarity Search in Metric Spaces |
Report |
Code |
Ref |
385-Mimicking The Brain's Cognition of Sarcasm from Multidisciplines for Twitter Sarcasm Detection |
Report |
Code |
Ref |
386-FedProto: Federated Prototype Learning across Heterogeneous Clients |
Report |
Code |
Ref |
387-U-RED: Unsupervised 3D Shape Retrieval and Deformation for Partial Point Clouds |
Report |
Code |
Ref |
388-ReAct: Synergizing Reasoning and Acting in Language Models |
Report |
Code |
Ref |
389-UNETR++: Delving Into Efficient and Accurate 3D Medical Image Segmentation |
Report |
Code |
Ref |
390-Adaptive Particle Swarm Optimization |
Report |
Code |
Ref |