Table of Contents
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-07-23 | LTLZinc: a Benchmarking Framework for Continual Learning and Neuro-Symbolic Temporal Reasoning | Luca Salvatore Lorello et.al. | 2507.17482 | null |
2025-07-23 | Filter-And-Refine: A MLLM Based Cascade System for Industrial-Scale Video Content Moderation | Zixuan Wang et.al. | 2507.17204 | null |
2025-07-22 | Combining Language and Topic Models for Hierarchical Text Classification | Jaco du Toit et.al. | 2507.16490 | null |
2025-07-22 | The Cost of Compression: Tight Quadratic Black-Box Attacks on Sketches for |
Sara Ahmadian et.al. | 2507.16345 | null |
2025-07-22 | Cross-Modal Distillation For Widely Differing Modalities | Cairong Zhao et.al. | 2507.16296 | null |
2025-07-22 | MAN++: Scaling Momentum Auxiliary Network for Supervised Local Learning in Vision Tasks | Junhao Su et.al. | 2507.16279 | null |
2025-07-22 | Quality Text, Robust Vision: The Role of Language in Enhancing Visual Robustness of Vision-Language Models | Futa Waseda et.al. | 2507.16257 | null |
2025-07-21 | Stop-band Energy Constraint for Orthogonal Tunable Wavelet Units in Convolutional Neural Networks for Computer Vision problems | An D. Le et.al. | 2507.16114 | null |
2025-07-21 | Optimizing Canaries for Privacy Auditing with Metagradient Descent | Matteo Boglioni et.al. | 2507.15836 | null |
2025-07-21 | GeMix: Conditional GAN-Based Mixup for Improved Medical Image Augmentation | Hugo Carlesso et.al. | 2507.15577 | null |
2025-07-21 | Smart Eyes for Silent Threats: VLMs and In-Context Learning for THz Imaging | Nicolas Poggi et.al. | 2507.15576 | null |
2025-07-21 | An Investigation of Test-time Adaptation for Audio Classification under Background Noise | Weichuang Shao et.al. | 2507.15523 | null |
2025-07-20 | Polymorph: Energy-Efficient Multi-Label Classification for Video Streams on Embedded Devices | Saeid Ghafouri et.al. | 2507.14959 | null |
2025-07-20 | Probabilistic smooth attention for deep multiple instance learning in medical imaging | Francisco M. Castro-Macías et.al. | 2507.14932 | null |
2025-07-20 | Semantic-Aware Representation Learning for Multi-label Image Classification | Ren-Dong Xie et.al. | 2507.14918 | null |
2025-07-20 | The Tsetlin Machine Goes Deep: Logical Learning and Reasoning With Graphs | Ole-Christoffer Granmo et.al. | 2507.14874 | null |
2025-07-19 | Performance comparison of medical image classification systems using TensorFlow Keras, PyTorch, and JAX | Merjem Bećirović et.al. | 2507.14587 | null |
2025-07-18 | Classification of Histopathology Slides with Persistence Homology Convolutions | Shrunal Pothagoni et.al. | 2507.14378 | null |
2025-07-18 | Quantum Boltzmann Machines using Parallel Annealing for Medical Image Classification | Daniëlle Schuman et.al. | 2507.14116 | null |
2025-07-18 | Foundation Models as Class-Incremental Learners for Dermatological Image Classification | Mohamed Elkhayat et.al. | 2507.14050 | null |
2025-07-18 | Evaluating the Effectiveness of Cost-Efficient Large Language Models in Benchmark Biomedical Tasks | Israt Jahan et.al. | 2507.14045 | null |
2025-07-18 | Automatic Classification and Segmentation of Tunnel Cracks Based on Deep Learning and Visual Explanations | Yong Feng et.al. | 2507.14010 | null |
2025-07-18 | Feature Engineering is Not Dead: Reviving Classical Machine Learning with Entropy, HOG, and LBP Feature Fusion for Image Classification | Abhijit Sen et.al. | 2507.13772 | null |
2025-07-18 | Adversarial Training Improves Generalization Under Distribution Shifts in Bioacoustics | René Heinrich et.al. | 2507.13727 | null |
2025-07-18 | Enhanced image classification via hybridizing quantum dynamics with classical neural networks | Ruiyang Zhou et.al. | 2507.13587 | null |
2025-07-17 | Efficient Adaptation of Pre-trained Vision Transformer underpinned by Approximately Orthogonal Fine-Tuning Strategy | Yiting Yang et.al. | 2507.13260 | null |
2025-07-17 | Adversarial attacks to image classification systems using evolutionary algorithms | Sergio Nesmachnow et.al. | 2507.13136 | null |
2025-07-17 | MUPAX: Multidimensional Problem Agnostic eXplainable AI | Vincenzo Dentamaro et.al. | 2507.13090 | null |
2025-07-17 | Making Language Model a Hierarchical Classifier and Generator | Yihong Wang et.al. | 2507.12930 | null |
2025-07-17 | Federated Learning for Commercial Image Sources | Shreyansh Jain et.al. | 2507.12903 | null |
2025-07-17 | LanePerf: a Performance Estimation Framework for Lane Detection | Yin Wu et.al. | 2507.12894 | null |
2025-07-17 | Feature-Enhanced TResNet for Fine-Grained Food Image Classification | Lulu Liu et.al. | 2507.12828 | null |
2025-07-17 | Task-Specific Audio Coding for Machines: Machine-Learned Latent Features Are Codes for That Machine | Anastasia Kuznetsova et.al. | 2507.12701 | null |
2025-07-16 | Comparative Analysis of CNN Performance in Keras, PyTorch and JAX on PathMNIST | Anida Nezović et.al. | 2507.12248 | null |
2025-07-16 | PRISM: Distributed Inference for Foundation Models at Edge | Muhammad Azlan Qazi et.al. | 2507.12145 | null |
2025-07-16 | Effective Fine-Tuning of Vision Transformers with Low-Rank Adaptation for Privacy-Preserving Image Classification | Haiwei Lin et.al. | 2507.11943 | null |
2025-07-16 | Spatial Frequency Modulation for Semantic Segmentation | Linwei Chen et.al. | 2507.11893 | null |
2025-07-16 | ProtoConNet: Prototypical Augmentation and Alignment for Open-Set Few-Shot Image Classification | Kexuan Shi et.al. | 2507.11845 | null |
2025-07-15 | Quantum Adaptive Excitation Network with Variational Quantum Circuits for Channel Attention | Yu-Chao Hsu et.al. | 2507.11217 | null |
2025-07-15 | Hashed Watermark as a Filter: Defeating Forging and Overwriting Attacks in Weight-based Neural Network Watermarking | Yuan Yao et.al. | 2507.11137 | null |
2025-07-15 | Focus on Texture: Rethinking Pre-training in Masked Autoencoders for Medical Image Classification | Chetan Madan et.al. | 2507.10869 | null |
2025-07-14 | AudioMAE++: learning better masked audio representations with SwiGLU FFNs | Sarthak Yadav et.al. | 2507.10464 | null |
2025-07-14 | Improving Remote Sensing Classification using Topological Data Analysis and Convolutional Neural Networks | Aaryam Sharma et.al. | 2507.10381 | null |
2025-07-14 | FTCFormer: Fuzzy Token Clustering Transformer for Image Classification | Muyi Bao et.al. | 2507.10283 | null |
2025-07-14 | Transferring Styles for Reduced Texture Bias and Improved Robustness in Semantic Segmentation Networks | Ben Hamscher et.al. | 2507.10239 | null |
2025-07-14 | MEDebiaser: A Human-AI Feedback System for Mitigating Bias in Multi-label Medical Image Classification | Shaohan Shi et.al. | 2507.10044 | null |
2025-07-14 | Effects of structural properties of neural networks on machine learning performance | Yash Arya et.al. | 2507.10005 | null |
2025-07-14 | Hierarchical Job Classification with Similarity Graph Integration | Md Ahsanul Kabir et.al. | 2507.09949 | null |
2025-07-13 | Post-Training Quantization of Generative and Discriminative LSTM Text Classifiers: A Study of Calibration, Class Balance, and Robustness | Md Mushfiqur Rahaman et.al. | 2507.09687 | null |
2025-07-13 | MLoRQ: Bridging Low-Rank and Quantization for Transformer Compression | Ofir Gordon et.al. | 2507.09616 | null |
2025-07-13 | SDTN and TRN: Adaptive Spectral-Spatial Feature Extraction for Hyperspectral Image Classification | Fuyin Ye et.al. | 2507.09492 | null |
2025-07-11 | A Hybrid Multi-Well Hopfield-CNN with Feature Extraction and K-Means for MNIST Classification | Ahmed Farooq et.al. | 2507.08766 | null |
2025-07-11 | DatasetAgent: A Novel Multi-Agent System for Auto-Constructing Datasets from Real-World Images | Haoran Sun et.al. | 2507.08648 | null |
2025-07-11 | Onboard Neuromorphic Split Computing via Optical Links for LEO Remote Sensing | Zihang Song et.al. | 2507.08490 | null |
2025-07-11 | Interpretability-Aware Pruning for Efficient Medical Image Analysis | Nikita Malik et.al. | 2507.08330 | null |
2025-07-11 | Admissibility of Stein Shrinkage for Batch Normalization in the Presence of Adversarial Attacks | Sofia Ivolgina et.al. | 2507.08261 | null |
2025-07-10 | A Hybrid Multilayer Extreme Learning Machine for Image Classification with an Application to Quadcopters | Rolando A. Hernandez-Hernandez et.al. | 2507.08047 | null |
2025-07-10 | Where are we with calibration under dataset shift in image classification? | Mélanie Roschewitz et.al. | 2507.07780 | null |
2025-07-10 | TRIX- Trading Adversarial Fairness via Mixed Adversarial Training | Tejaswini Medi et.al. | 2507.07768 | null |
2025-07-10 | OPC: One-Point-Contraction Unlearning Toward Deep Feature Forgetting | Jaeheun Jung et.al. | 2507.07754 | null |
2025-07-10 | Temporal Unlearnable Examples: Preventing Personal Video Data from Unauthorized Exploitation by Object Tracking | Qiangqiang Wu et.al. | 2507.07483 | null |
2025-07-10 | EPIC: Efficient Prompt Interaction for Text-Image Classification | Xinyao Yu et.al. | 2507.07415 | null |
2025-07-10 | GNN-CNN: An Efficient Hybrid Model of Convolutional and Graph Neural Networks for Text Representation | Fardin Rastakhiz et.al. | 2507.07414 | null |
2025-07-09 | GNN-ViTCap: GNN-Enhanced Multiple Instance Learning with Vision Transformers for Whole Slide Image Classification and Captioning | S M Taslim Uddin Raju et.al. | 2507.07006 | null |
2025-07-09 | Unifying Re-Identification, Attribute Inference, and Data Reconstruction Risks in Differential Privacy | Bogdan Kulynych et.al. | 2507.06969 | null |
2025-07-09 | Steps Adaptive Decay DPSGD: Enhancing Performance on Imbalanced Datasets with Differential Privacy with HAM10000 | Xiaobo Huang et.al. | 2507.06619 | null |
2025-07-08 | Capsule-ConvKAN: A Hybrid Neural Approach to Medical Image Classification | Laura Pituková et.al. | 2507.06417 | null |
2025-07-08 | SoftReMish: A Novel Activation Function for Enhanced Convolutional Neural Networks for Visual Recognition Performance | Mustafa Bayram Gücen et.al. | 2507.06148 | null |
2025-07-08 | On the Effectiveness of Methods and Metrics for Explainable AI in Remote Sensing Image Scene Classification | Jonas Klotz et.al. | 2507.05916 | null |
2025-07-08 | Knowledge-guided Complex Diffusion Model for PolSAR Image Classification in Contourlet Domain | Junfei Shi et.al. | 2507.05666 | null |
2025-07-08 | Model-free Optical Processors using In Situ Reinforcement Learning with Proximal Policy Optimization | Yuhang Li et.al. | 2507.05583 | null |
2025-07-07 | Experimental data re-uploading with provable enhanced learning capabilities | Martin F. X. Mauser et.al. | 2507.05120 | null |
2025-07-07 | Verified Language Processing with Hybrid Explainability: A Technical Report | Oliver Robert Fox et.al. | 2507.05017 | null |
2025-07-07 | Co-DETECT: Collaborative Discovery of Edge Cases in Text Classification | Chenfei Xiong et.al. | 2507.05010 | null |
2025-07-07 | Bridging KAN and MLP: MJKAN, a Hybrid Architecture with Both Efficiency and Expressiveness | Hanseon Joo et.al. | 2507.04690 | null |
2025-07-07 | Recovering Plasticity of Neural Networks via Soft Weight Rescaling | Seungwon Oh et.al. | 2507.04683 | null |
2025-07-07 | VLM2Vec-V2: Advancing Multimodal Embedding for Videos, Images, and Visual Documents | Rui Meng et.al. | 2507.04590 | null |
2025-07-06 | MVNet: Hyperspectral Remote Sensing Image Classification Based on Hybrid Mamba-Transformer Vision Backbone Architecture | Guandong Li et.al. | 2507.04409 | null |
2025-07-06 | Transferring Visual Explainability of Self-Explaining Models through Task Arithmetic | Yuya Yoshikawa et.al. | 2507.04380 | null |
2025-07-06 | Efficient Training of Deep Networks using Guided Spectral Data Selection: A Step Toward Learning What You Need | Mohammadreza Sharifi et.al. | 2507.04269 | null |
2025-07-06 | Siberian radioheliograph image classification using ensemble of CLIP, EfficientNet and CatBoost models | Yaroslav Egorov et.al. | 2507.04211 | null |
2025-07-03 | Linear Attention with Global Context: A Multipole Attention Mechanism for Vision and Physics | Alex Colagrande et.al. | 2507.02748 | null |
2025-07-03 | ASDA: Audio Spectrogram Differential Attention Mechanism for Self-Supervised Representation Learning | Junyu Wang et.al. | 2507.02666 | null |
2025-07-03 | MedFormer: Hierarchical Medical Vision Transformer with Content-Aware Dual Sparse Selection Attention | Zunhui Xia et.al. | 2507.02488 | null |
2025-07-03 | F^2TTA: Free-Form Test-Time Adaptation on Cross-Domain Medical Image Classification via Image-Level Disentangled Prompt Tuning | Wei Li et.al. | 2507.02437 | null |
2025-07-03 | Cross-domain Hyperspectral Image Classification based on Bi-directional Domain Adaptation | Yuxiang Zhang et.al. | 2507.02268 | null |
2025-07-03 | High-Fidelity Differential-information Driven Binary Vision Transformer | Tian Gao et.al. | 2507.02222 | null |
2025-07-02 | Selective Feature Re-Encoded Quantum Convolutional Neural Network with Joint Optimization for Image Classification | Shaswata Mahernob Sarkar et.al. | 2507.02086 | null |
2025-07-02 | How Well Does GPT-4o Understand Vision? Evaluating Multimodal Foundation Models on Standard Computer Vision Tasks | Rahul Ramachandran et.al. | 2507.01955 | null |
2025-07-02 | evMLP: An Efficient Event-Driven MLP Architecture for Vision | Zhentan Zheng et.al. | 2507.01927 | null |
2025-07-02 | mGRADE: Minimal Recurrent Gating Meets Delay Convolutions for Lightweight Sequence Modeling | Tristan Torchet et.al. | 2507.01829 | null |
2025-07-02 | Are Vision Transformer Representations Semantically Meaningful? A Case Study in Medical Imaging | Montasir Shams et.al. | 2507.01788 | null |
2025-07-02 | Learning from Random Subspace Exploration: Generalized Test-Time Augmentation with Self-supervised Distillation | Andrei Jelea et.al. | 2507.01347 | null |
2025-07-01 | Biorthogonal Tunable Wavelet Unit with Lifting Scheme in Convolutional Neural Network | An Le et.al. | 2507.00739 | null |
2025-07-01 | Rectifying Magnitude Neglect in Linear Attention | Qihang Fan et.al. | 2507.00698 | null |
2025-07-01 | Few-shot Classification as Multi-instance Verification: Effective Backbone-agnostic Transfer across Domains | Xin Xu et.al. | 2507.00401 | null |
2025-06-30 | Two-Stage Reasoning-Infused Learning: Improving Classification with LLM-Generated Reasoning | Mads Henrichsen et.al. | 2507.00214 | null |
2025-06-30 | Toward Simple and Robust Contrastive Explanations for Image Classification by Leveraging Instance Similarity and Concept Relevance | Yuliia Kaidashova et.al. | 2506.23975 | null |
2025-06-30 | Unveiling Decision-Making in LLMs for Text Classification : Extraction of influential and interpretable concepts with Sparse Autoencoders | Mathis Le Bail et.al. | 2506.23951 | null |
2025-06-30 | Controllable Reference-Based Real-World Remote Sensing Image Super-Resolution with Generative Diffusion Priors | Ce Wang et.al. | 2506.23801 | null |
2025-07-01 | Towards the Training of Deeper Predictive Coding Neural Networks | Chang Qi et.al. | 2506.23800 | null |
2025-06-30 | A Unified Framework for Stealthy Adversarial Generation via Latent Optimization and Transferability Enhancement | Gaozheng Pei et.al. | 2506.23676 | null |
2025-06-30 | Robustness of Misinformation Classification Systems to Adversarial Examples Through BeamAttack | Arnisa Fazla et.al. | 2506.23661 | null |
2025-06-30 | AdFair-CLIP: Adversarial Fair Contrastive Language-Image Pre-training for Chest X-rays | Chenlang Yi et.al. | 2506.23467 | null |
2025-06-29 | Federated Breast Cancer Detection Enhanced by Synthetic Ultrasound Image Augmentation | Hongyi Pan et.al. | 2506.23334 | null |
2025-07-01 | Exposing and Mitigating Calibration Biases and Demographic Unfairness in MLLM Few-Shot In-Context Learning for Medical Image Classification | Xing Shen et.al. | 2506.23298 | null |
2025-06-29 | Aggregating Local Saliency Maps for Semi-Global Explainable Image Classification | James Hinns et.al. | 2506.23247 | null |
2025-06-27 | Boosting Classification with Quantum-Inspired Augmentations | Matthias Tschöpe et.al. | 2506.22241 | null |
2025-06-27 | Remote Sensing Large Vision-Language Model: Semantic-augmented Multi-level Alignment and Semantic-aware Expert Modeling | Sungjune Park et.al. | 2506.21863 | null |
2025-06-27 | LinguaSynth: Heterogeneous Linguistic Signals for News Classification | Duo Zhang et.al. | 2506.21848 | null |
2025-06-25 | Disentangled representations of microscopy images | Jacopo Dapueto et.al. | 2506.20649 | null |
2025-06-25 | Counterfactual Influence as a Distributional Quantity | Matthieu Meeus et.al. | 2506.20481 | null |
2025-06-25 | Practical insights on the effect of different encodings, ansätze and measurements in quantum and hybrid convolutional neural networks | Jesús Lozano-Cruz et.al. | 2506.20355 | link |
2025-06-25 | Learning Moderately Input-Sensitive Functions: A Case Study in QR Code Decoding | Kazuki Yoda et.al. | 2506.20305 | null |
2025-06-25 | Hierarchical Mask-Enhanced Dual Reconstruction Network for Few-Shot Fine-Grained Image Classification | Ning Luo et.al. | 2506.20263 | null |
2025-06-25 | Perspectives in Play: A Multi-Perspective Approach for More Inclusive NLP Systems | Benedetta Muscato et.al. | 2506.20209 | null |
2025-06-26 | Towards Scalable and Generalizable Earth Observation Data Mining via Foundation Model Composition | Man Duc Chuc et.al. | 2506.20174 | null |
2025-06-24 | Neuromorphic Wireless Split Computing with Resonate-and-Fire Neurons | Dengyu Wu et.al. | 2506.20015 | null |
2025-06-24 | Ensemble nonlinear optical learner by electrically tunable linear scattering | Tunan Xia et.al. | 2506.19976 | null |
2025-06-25 | One Prototype Is Enough: Single-Prototype Activation for Interpretable Image Classification | Yitao Peng et.al. | 2506.19808 | null |
2025-06-24 | MambaOutRS: A Hybrid CNN-Fourier Architecture for Remote Sensing Image Classification | Minjong Cheon et.al. | 2506.19561 | null |
2025-06-24 | Iterative Quantum Feature Maps | Nasa Matsumoto et.al. | 2506.19461 | null |
2025-06-24 | Comparative Performance of Finetuned ImageNet Pre-trained Models for Electronic Component Classification | Yidi Shao et.al. | 2506.19330 | null |
2025-06-23 | LKA: Large Kernel Adapter for Enhanced Medical Image Classification | Ziquan Zhu et.al. | 2506.19118 | null |
2025-06-23 | Sensitivity Analysis of Image Classification Models using Generalized Polynomial Chaos | Lukas Bahr et.al. | 2506.18751 | null |
2025-06-23 | SIM-Net: A Multimodal Fusion Network Using Inferred 3D Object Shape Point Clouds from RGB Images for 2D Classification | Youcef Sklab et.al. | 2506.18683 | null |
2025-06-23 | SpaNN: Detecting Multiple Adversarial Patches on CNNs by Spanning Saliency Thresholds | Mauricio Byrd Victorica et.al. | 2506.18591 | null |
2025-06-23 | Geometry-aware Distance Measure for Diverse Hierarchical Structures in Hyperbolic Spaces | Pengxiang Li et.al. | 2506.18533 | null |
2025-06-23 | A Set-to-Set Distance Measure in Hyperbolic Space | Pengxiang Li et.al. | 2506.18529 | null |
2025-06-23 | Fully Few-shot Class-incremental Audio Classification Using Multi-level Embedding Extractor and Ridge Regression Classifier | Yongjie Si et.al. | 2506.18406 | null |
2025-06-23 | Open Set Recognition for Endoscopic Image Classification: A Deep Learning Approach on the Kvasir Dataset | Kasra Moazzami et.al. | 2506.18284 | null |
2025-06-22 | Pitfalls of Conformal Predictions for Medical Image Classification | Hendrik Mehrtens et.al. | 2506.18162 | null |
2025-06-22 | HE-LRM: Encrypted Deep Learning Recommendation Models using Fully Homomorphic Encryption | Karthik Garimella et.al. | 2506.18150 | null |
2025-06-22 | Training-free Test-time Improvement for Explainable Medical Image Classification | Hangzhou He et.al. | 2506.18070 | null |
2025-06-20 | Robust Training with Data Augmentation for Medical Imaging Classification | Josué Martínez-Martínez et.al. | 2506.17133 | null |
2025-06-20 | Acquiring and Accumulating Knowledge from Diverse Datasets for Multi-label Driving Scene Classification | Ke Li et.al. | 2506.17101 | null |
2025-06-20 | From Concepts to Components: Concept-Agnostic Attention Module Discovery in Transformers | Jingtong Su et.al. | 2506.17052 | null |
2025-06-20 | With Limited Data for Multimodal Alignment, Let the STRUCTURE Guide You | Fabian Gröger et.al. | 2506.16895 | null |
2025-06-20 | Transition of AI Models in dependence of noise | Thomas Seidler et.al. | 2506.16715 | null |
2025-06-19 | Efficient Transformations in Deep Learning Convolutional Neural Networks | Berk Yilmaz et.al. | 2506.16418 | null |
2025-06-19 | SHREC and PHEONA: Using Large Language Models to Advance Next-Generation Computational Phenotyping | Sarah Pungitore et.al. | 2506.16359 | null |
2025-06-19 | Polyline Path Masked Attention for Vision Transformer | Zhongchen Zhao et.al. | 2506.15940 | null |
2025-06-18 | FedWSIDD: Federated Whole Slide Image Classification via Dataset Distillation | Haolong Jin et.al. | 2506.15365 | link |
2025-06-18 | Enhancing One-run Privacy Auditing with Quantile Regression-Based Membership Inference | Terrance Liu et.al. | 2506.15349 | null |
2025-06-19 | OpenPath: Open-Set Active Learning for Pathology Image Classification via Pre-trained Vision-Language Models | Lanfeng Zhong et.al. | 2506.15318 | null |
2025-06-18 | J3DAI: A tiny DNN-Based Edge AI Accelerator for 3D-Stacked CMOS Image Sensor | Benoit Tain et.al. | 2506.15316 | null |
2025-06-18 | Domain Adaptation for Image Classification of Defects in Semiconductor Manufacturing | Adrian Poniatowski et.al. | 2506.15260 | null |
2025-06-18 | A Comparative Study of Task Adaptation Techniques of Large Language Models for Identifying Sustainable Development Goals | Andrea Cadeddu et.al. | 2506.15208 | null |
2025-06-18 | Identifying social isolation themes in NVDRS text narratives using topic modeling and text-classification methods | Drew Walker et.al. | 2506.15030 | null |
2025-06-17 | DDS-NAS: Dynamic Data Selection within Neural Architecture Search via On-line Hard Example Mining applied to Image Classification | Matt Poyser et.al. | 2506.14667 | null |
2025-06-17 | Train Once, Forget Precisely: Anchored Optimization for Efficient Post-Hoc Unlearning | Prabhav Sanga et.al. | 2506.14515 | null |
2025-06-17 | Compositional Attribute Imbalance in Vision Datasets | Jiayi Chen et.al. | 2506.14418 | null |
2025-06-17 | One-Shot Neural Architecture Search with Network Similarity Directed Initialization for Pathological Image Classification | Renao Yan et.al. | 2506.14176 | null |
2025-06-17 | SeqPE: Transformer with Sequential Position Encoding | Huayang Li et.al. | 2506.13277 | null |
2025-06-15 | Intriguing Frequency Interpretation of Adversarial Robustness for CNNs and ViTs | Lu Chen et.al. | 2506.12875 | null |
2025-06-15 | Medical Argument Mining: Exploitation of Scarce Data Using NLI Systems | Maitane Urruela et.al. | 2506.12823 | null |
2025-06-15 | Cross-architecture universal feature coding via distribution alignment | Changsheng Gao et.al. | 2506.12737 | null |
2025-06-15 | Unsupervised Contrastive Learning Using Out-Of-Distribution Data for Long-Tailed Dataset | Cuong Manh Hoang et.al. | 2506.12698 | null |
2025-06-15 | Evaluating Cell Type Inference in Vision Language Models Under Varying Visual Context | Samarth Singhal et.al. | 2506.12683 | null |
2025-06-14 | OscNet v1.5: Energy Efficient Hopfield Network on CMOS Oscillators for Image Classification | Wenxiao Cai et.al. | 2506.12610 | null |
2025-06-14 | DejaVid: Encoder-Agnostic Learned Temporal Matching for Video Classification | Darryl Ho et.al. | 2506.12585 | null |
2025-06-14 | MVP-CBM:Multi-layer Visual Preference-enhanced Concept Bottleneck Model for Explainable Medical Image Classification | Chunjiang Wang et.al. | 2506.12568 | null |
2025-06-14 | PLD: A Choice-Theoretic List-Wise Knowledge Distillation | Ejafa Bassam et.al. | 2506.12542 | null |
2025-06-13 | GeistBERT: Breathing Life into German NLP | Raphael Scheible-Schmitt et.al. | 2506.11903 | null |
2025-06-13 | Evaluating Fairness and Mitigating Bias in Machine Learning: A Novel Technique using Tensor Data and Bayesian Regression | Kuniko Paxton et.al. | 2506.11627 | null |
2025-06-13 | Machine Unlearning for Robust DNNs: Attribution-Guided Partitioning and Neuron Pruning in Noisy Environments | Deliang Jin et.al. | 2506.11615 | null |
2025-06-13 | Black-Box Edge AI Model Selection with Conformal Latency and Accuracy Guarantees | Anders E. Kalør et.al. | 2506.11391 | null |
2025-06-12 | SNR and Resource Adaptive Deep JSCC for Distributed IoT Image Classification | Ali Waqas et.al. | 2506.10699 | null |
2025-06-13 | PiPViT: Patch-based Visual Interpretable Prototypes for Retinal Image Analysis | Marzieh Oghbaie et.al. | 2506.10669 | link |
2025-06-12 | Boosting Adversarial Transferability for Hyperspectral Image Classification Using 3D Structure-invariant Transformation and Intermediate Feature Distance | Chun Liu et.al. | 2506.10459 | null |
2025-06-12 | Can We Infer Confidential Properties of Training Data from LLMs? | Penguin Huang et.al. | 2506.10364 | null |
2025-06-12 | Flick: Few Labels Text Classification using K-Aware Intermediate Learning in Multi-Task Low-Resource Languages | Ali Almutairi et.al. | 2506.10292 | null |
2025-06-11 | FedMLAC: Mutual Learning Driven Heterogeneous Federated Audio Classification | Jun Bai et.al. | 2506.10207 | null |
2025-06-11 | Detecção da Psoríase Utilizando Visão Computacional: Uma Abordagem Comparativa Entre CNNs e Vision Transformers | Natanael Lucena et.al. | 2506.10119 | null |
2025-06-11 | DeepTraverse: A Depth-First Search Inspired Network for Algorithmic Visual Understanding | Bin Guo et.al. | 2506.10084 | null |
2025-06-11 | Evidential Deep Learning with Spectral-Spatial Uncertainty Disentanglement for Open-Set Hyperspectral Domain Generalization | Amirreza Khoshbakht et.al. | 2506.09460 | null |
2025-06-11 | MSSDF: Modality-Shared Self-supervised Distillation for High-Resolution Multi-modal Remote Sensing Image Learning | Tong Wang et.al. | 2506.09327 | null |
2025-06-10 | ScalableHD: Scalable and High-Throughput Hyperdimensional Computing Inference on Multi-Core CPUs | Dhruv Parikh et.al. | 2506.09282 | null |
2025-06-10 | Hyperbolic Dual Feature Augmentation for Open-Environment | Peilin Yu et.al. | 2506.08906 | null |
2025-06-10 | Normalized Radon Cumulative Distribution Transforms for Invariance and Robustness in Optimal Transport Based Image Classification | Matthias Beckmann et.al. | 2506.08761 | null |
2025-06-12 | InceptionMamba: An Efficient Hybrid Network with Large Band Convolution and Bottleneck Mamba | Yuhang Wang et.al. | 2506.08735 | null |
2025-06-10 | Biologically Inspired Deep Learning Approaches for Fetal Ultrasound Image Classification | Rinat Prochii et.al. | 2506.08623 | null |
2025-06-10 | mSTEB: Massively Multilingual Evaluation of LLMs on Speech and Text Tasks | Luel Hagos Beyene et.al. | 2506.08400 | null |
2025-06-10 | An Adaptive Method Stabilizing Activations for Enhanced Generalization | Hyunseok Seung et.al. | 2506.08353 | null |
2025-06-11 | Hyperspectral Image Classification via Transformer-based Spectral-Spatial Attention Decoupling and Adaptive Gating | Guandong Li et.al. | 2506.08324 | null |
2025-06-09 | TokenBreak: Bypassing Text Classification Models Through Token Manipulation | Kasimir Schulz et.al. | 2506.07948 | null |
2025-06-09 | MultiMatch: Multihead Consistency Regularization Matching for Semi-Supervised Text Classification | Iustin Sirbu et.al. | 2506.07801 | null |
2025-06-09 | Improving Memory Efficiency for Training KANs via Meta Learning | Zhangchi Zhao et.al. | 2506.07549 | null |
2025-06-09 | Mind the Gap: Removing the Discretization Gap in Differentiable Logic Gate Networks | Shakir Yousefi et.al. | 2506.07500 | null |
2025-06-08 | Mobility-Aware Asynchronous Federated Learning with Dynamic Sparsification | Jintao Yan et.al. | 2506.07328 | null |
2025-06-08 | A Stable Whitening Optimizer for Efficient Neural Network Training | Kevin Frans et.al. | 2506.07254 | null |
2025-06-08 | Hierarchical Feature-level Reverse Propagation for Post-Training Neural Networks | Ni Ding et.al. | 2506.07188 | null |
2025-06-08 | CTDGSI: A comprehensive exploitation of instance selection methods for automatic text classification. VII Concurso de Teses, Dissertações e Trabalhos de Graduação em SI -- XXI Simpósio Brasileiro de Sistemas de Informação | Washington Cunha et.al. | 2506.07169 | null |
2025-06-08 | pFedSOP : Accelerating Training Of Personalized Federated Learning Using Second-Order Optimization | Mrinmay Sen et.al. | 2506.07159 | null |
2025-06-07 | Rewriting the Budget: A General Framework for Black-Box Attacks Under Cost Asymmetry | Mahdi Salmani et.al. | 2506.06933 | null |
2025-06-06 | Eigenspectrum Analysis of Neural Networks without Aspect Ratio Bias | Yuanzhe Hu et.al. | 2506.06280 | null |
2025-06-06 | FPDANet: A Multi-Section Classification Model for Intelligent Screening of Fetal Ultrasound | Minglang Chen et.al. | 2506.06054 | null |
2025-06-06 | Enhancing Orthopox Image Classification Using Hybrid Machine Learning and Deep Learning Models | Alejandro Puente-Castro et.al. | 2506.06007 | null |
2025-06-06 | LTG at SemEval-2025 Task 10: Optimizing Context for Classification of Narrative Roles | Egil Rønningstad et.al. | 2506.05976 | null |
2025-06-06 | Integer Binary-Range Alignment Neuron for Spiking Neural Networks | Binghao Ye et.al. | 2506.05679 | null |
2025-06-05 | FRAME: Pre-Training Video Feature Representations via Anticipation and Memory | Sethuraman TV et.al. | 2506.05543 | null |
2025-06-05 | Spectral Graph Neural Networks are Incomplete on Graphs with a Simple Spectrum | Snir Hordan et.al. | 2506.05530 | null |
2025-06-05 | Robustness Evaluation for Video Models with Reinforcement Learning | Ashwin Ramesh Babu et.al. | 2506.05431 | null |
2025-06-05 | Interpretable Few-Shot Image Classification via Prototypical Concept-Guided Mixture of LoRA Experts | Zhong Ji et.al. | 2506.04673 | null |
2025-06-04 | Deep Learning for Absorption-Image Analysis | Jacob Morrey et.al. | 2506.04517 | null |
2025-06-04 | KOALA++: Efficient Kalman-Based Optimization of Neural Networks with Gradient-Covariance Products | Zixuan Xia et.al. | 2506.04432 | null |
2025-06-04 | Benchmarking Time-localized Explanations for Audio Classification Models | Cecilia Bolaños et.al. | 2506.04391 | null |
2025-06-04 | Hierarchical Text Classification Using Contrastive Learning Informed Path Guided Hierarchy | Neeraj Agrawal et.al. | 2506.04381 | null |
2025-06-04 | Recent Advances in Medical Image Classification | Loan Dao et.al. | 2506.04129 | null |
2025-06-04 | Prompt Candidates, then Distill: A Teacher-Student Framework for LLM-driven Data Annotation | Mingxuan Xia et.al. | 2506.03857 | null |
2025-06-04 | RhoDARTS: Differentiable Quantum Architecture Search with Density Matrix Simulations | Swagat Kumar et.al. | 2506.03697 | null |
2025-06-04 | Directional Non-Commutative Monoidal Embeddings for MNIST | Mahesh Godavarti et.al. | 2506.03472 | null |
2025-06-03 | RoNFA: Robust Neural Field-based Approach for Few-Shot Image Classification with Noisy Labels | Nan Xiang et.al. | 2506.03461 | null |
2025-06-02 | Quantifying task-relevant representational similarity using decision variable correlation | Yu et.al. | 2506.02164 | null |
2025-06-02 | Towards Better Generalization and Interpretability in Unsupervised Concept-Based Models | Francesco De Santis et.al. | 2506.02092 | null |
2025-06-02 | OD3: Optimization-free Dataset Distillation for Object Detection | Salwa K. Al Khatib et.al. | 2506.01942 | null |
2025-06-02 | Generalized Gradient Norm Clipping & Non-Euclidean |
Thomas Pethick et.al. | 2506.01913 | null |
2025-06-02 | Beyond Static Responses: Multi-Agent LLM Systems as a New Paradigm for Social Science Research | Jennifer Haase et.al. | 2506.01839 | null |
2025-06-02 | mdok of KInIT: Robustly Fine-tuned LLM for Binary and Multiclass AI-Generated Text Detection | Dominik Macko et.al. | 2506.01702 | null |
2025-06-02 | Data Pruning by Information Maximization | Haoru Tan et.al. | 2506.01701 | null |
2025-06-02 | Domain Lexical Knowledge-based Word Embedding Learning for Text Classification under Small Data | Zixiao Zhu et.al. | 2506.01621 | null |
2025-06-02 | Speed-up of Vision Transformer Models by Attention-aware Token Filtering | Takahiro Naruko et.al. | 2506.01519 | null |
2025-06-02 | A Novel Context-Adaptive Fusion of Shadow and Highlight Regions for Efficient Sonar Image Classification | Kamal Basha S et.al. | 2506.01445 | null |
2025-05-30 | Optimal Weighted Convolution for Classification and Denosing | Simone Cammarasana et.al. | 2505.24558 | null |
2025-05-30 | SASP: Strip-Aware Spatial Perception for Fine-Grained Bird Image Classification | Zheng Wang et.al. | 2505.24380 | null |
2025-05-30 | Spatiotemporal Analysis of Forest Machine Operations Using 3D Video Classification | Maciej Wielgosz et.al. | 2505.24375 | null |
2025-05-30 | GeoVision Labeler: Zero-Shot Geospatial Classification with Vision and Language Models | Gilles Quentin Hacheme et.al. | 2505.24340 | null |
2025-05-30 | Provably Improving Generalization of Few-Shot Models with Synthetic Data | Lan-Cuong Nguyen et.al. | 2505.24190 | null |
2025-05-30 | FeatureSense: Protecting Speaker Attributes in Always-On Audio Sensing System | Bhawana Chhaglani et.al. | 2505.24115 | null |
2025-05-30 | Proxy-FDA: Proxy-based Feature Distribution Alignment for Fine-tuning Vision Foundation Models without Forgetting | Chen Huang et.al. | 2505.24088 | null |
2025-05-29 | BIRD: Behavior Induction via Representation-structure Distillation | Galen Pogoncheff et.al. | 2505.23933 | null |
2025-05-29 | Boosting Domain Incremental Learning: Selecting the Optimal Parameters is All You Need | Qiang Wang et.al. | 2505.23744 | null |
2025-05-29 | Spectrotemporal Modulation: Efficient and Interpretable Feature Representation for Classifying Speech, Music, and Environmental Sounds | Andrew Chang et.al. | 2505.23509 | link |
2025-05-29 | MCFNet: A Multimodal Collaborative Fusion Network for Fine-Grained Semantic Classification | Yang Qiao et.al. | 2505.23365 | null |
2025-05-29 | DSAGL: Dual-Stream Attention-Guided Learning for Weakly Supervised Whole Slide Image Classification | Daoxi Cao et.al. | 2505.23341 | null |
2025-05-29 | Deep Modeling and Optimization of Medical Image Classification | Yihang Wu et.al. | 2505.23040 | link |
2025-05-28 | Leveraging Diffusion Models for Synthetic Data Augmentation in Protein Subcellular Localization Classification | Sylvey Lin et.al. | 2505.22926 | null |
2025-05-28 | Frequency-Adaptive Discrete Cosine-ViT-ResNet Architecture for Sparse-Data Vision | Ziyue Kang et.al. | 2505.22701 | null |
2025-05-28 | S2AFormer: Strip Self-Attention for Efficient Vision Transformer | Guoan Xu et.al. | 2505.22195 | null |
2025-05-28 | Efficient Ensemble for Fine-tuning Language Models on Multiple Datasets | Dongyue Li et.al. | 2505.21930 | null |
2025-05-28 | Test-Time Adaptation of Vision-Language Models for Open-Vocabulary Semantic Segmentation | Mehrdad Noori et.al. | 2505.21844 | null |
2025-05-27 | MedBridge: Bridging Foundation Vision-Language Models to Medical Image Diagnosis | Yitong Li et.al. | 2505.21698 | null |
2025-05-27 | Leveraging large language models and traditional machine learning ensembles for ADHD detection from narrative transcripts | Yuxin Zhu et.al. | 2505.21324 | null |
2025-05-27 | Making Every Event Count: Balancing Data Efficiency and Accuracy in Event Camera Subsampling | Hesam Araghi et.al. | 2505.21187 | null |
2025-05-27 | Information-Theoretic Complementary Prompts for Improved Continual Text Classification | Duzhen Zhang et.al. | 2505.20933 | null |
2025-05-27 | Evidential Deep Active Learning for Semi-Supervised Classification | Shenkai Zhao et.al. | 2505.20691 | null |
2025-05-26 | UORA: Uniform Orthogonal Reinitialization Adaptation in Parameter-Efficient Fine-Tuning of Large Models | Xueyan Zhang et.al. | 2505.20154 | null |
2025-05-26 | Improvement Strategies for Few-Shot Learning in OCT Image Classification of Rare Retinal Diseases | Cheng-Yu Tai et.al. | 2505.20149 | null |
2025-05-26 | Differential Privacy Analysis of Decentralized Gossip Averaging under Varying Threat Models | Antti Koskela et.al. | 2505.19969 | null |
2025-05-26 | Task-Oriented Low-Label Semantic Communication With Self-Supervised Learning | Run Gu et.al. | 2505.19940 | null |
2025-05-26 | Advancements in Medical Image Classification through Fine-Tuning Natural Domain Foundation Models | Mobina Mansoori et.al. | 2505.19779 | link |
2025-05-26 | Mosaic: Data-Free Knowledge Distillation via Mixture-of-Experts for Heterogeneous Distributed Environments | Junming Liu et.al. | 2505.19699 | null |
2025-05-26 | Diagnosing and Mitigating Modality Interference in Multimodal Large Language Models | Rui Cai et.al. | 2505.19616 | null |
2025-05-26 | Applications and Effect Evaluation of Generative Adversarial Networks in Semi-Supervised Learning | Jiyu Hu et.al. | 2505.19522 | null |
2025-05-26 | DiSa: Directional Saliency-Aware Prompt Learning for Generalizable Vision-Language Models | Niloufar Alipour Talemi et.al. | 2505.19373 | null |
2025-05-25 | Remote Sensing Image Classification with Decoupled Knowledge Distillation | Yaping He et.al. | 2505.19111 | null |
2025-05-24 | MoMBS: Mixed-order minibatch sampling enhances model training from diverse-quality images | Han Li et.al. | 2505.18741 | null |
2025-05-23 | SemSegBench & DetecBench: Benchmarking Reliability and Generalization Beyond Classification | Shashank Agnihotri et.al. | 2505.18015 | null |
2025-05-23 | KITINet: Kinetics Theory Inspired Network Architectures with PDE Simulation Approaches | Mingquan Feng et.al. | 2505.17919 | null |
2025-05-23 | Ownership Verification of DNN Models Using White-Box Adversarial Attacks with Specified Probability Manipulation | Teruki Sano et.al. | 2505.17579 | null |
2025-05-23 | Scaling Up Biomedical Vision-Language Models: Fine-Tuning, Instruction Tuning, and Multi-Modal Learning | Cheng Peng et.al. | 2505.17436 | null |
2025-05-23 | EVM-Fusion: An Explainable Vision Mamba Architecture with Neural Algorithmic Fusion | Zichuan Yang et.al. | 2505.17367 | null |
2025-05-22 | Extending Dataset Pruning to Object Detection: A Variance-based Approach | Ryota Yagi et.al. | 2505.17245 | null |
2025-05-23 | TULiP: Test-time Uncertainty Estimation via Linearization and Weight Perturbation | Yuhui Zhang et.al. | 2505.16923 | null |
2025-05-22 | Incremental Sequence Classification with Temporal Consistency | Lucas Maystre et.al. | 2505.16548 | null |
2025-05-22 | Fusion of Foundation and Vision Transformer Model Features for Dermatoscopic Image Classification | Amirreza Mahbod et.al. | 2505.16338 | null |
2025-05-22 | Accelerating Targeted Hard-Label Adversarial Attacks in Low-Query Black-Box Settings | Arjhun Swaminathan et.al. | 2505.16313 | link |
2025-05-22 | Swin Transformer for Robust CGI Images Detection: Intra- and Inter-Dataset Analysis across Multiple Color Spaces | Preeti Mehta et.al. | 2505.16253 | null |
2025-05-22 | When VLMs Meet Image Classification: Test Sets Renovation via Missing Label Identification | Zirui Pang et.al. | 2505.16149 | null |
2025-05-21 | Small Language Models in the Real World: Insights from Industrial Text Classification | Lujun Li et.al. | 2505.16078 | null |
2025-05-21 | GradPCA: Leveraging NTK Alignment for Reliable Out-of-Distribution Detection | Mariia Seleznova et.al. | 2505.16017 | null |
2025-05-21 | Domain Adaptive Skin Lesion Classification via Conformal Ensemble of Vision Transformers | Mehran Zoravar et.al. | 2505.15997 | null |
2025-05-21 | Large Language Models as Computable Approximations to Solomonoff Induction | Jun Wan et.al. | 2505.15784 | null |
2025-05-21 | FragFake: A Dataset for Fine-Grained Detection of Edited Images with Vision Language Models | Zhen Sun et.al. | 2505.15644 | null |
2025-05-21 | SNAP: A Benchmark for Testing the Effects of Capture Conditions on Fundamental Vision Tasks | Iuliia Kotseruba et.al. | 2505.15628 | link |
2025-05-21 | Aligning Explanations with Human Communication | Jacopo Teneggi et.al. | 2505.15626 | null |
2025-05-21 | Beyond Linearity: Squeeze-and-Recalibrate Blocks for Few-Shot Whole Slide Image Classification | Conghao Xiong et.al. | 2505.15504 | null |
2025-05-21 | Adaptive Temperature Scaling with Conformal Prediction | Nikita Kotelevskii et.al. | 2505.15437 | null |
2025-05-21 | Parameter-Efficient Fine-Tuning of Multispectral Foundation Models for Hyperspectral Image Classification | Bernardin Ligan et.al. | 2505.15334 | null |
2025-05-21 | Multicrossmodal Automated Agent for Integrating Diverse Materials Science Data | Adib Bazgir et.al. | 2505.15132 | null |
2025-05-20 | Reliable Decision Support with LLMs: A Framework for Evaluating Consistency in Binary Text Classification Applications | Fadel M. Megahed et.al. | 2505.14918 | null |
2025-05-20 | Solving MNIST with a globally trained Mixture of Quantum Experts | Paolo Alessandro Xavier Tognini et.al. | 2505.14789 | null |
2025-05-20 | Guarded Query Routing for Large Language Models | Richard Šléher et.al. | 2505.14524 | null |
2025-05-20 | PRL: Prompts from Reinforcement Learning | Paweł Batorski et.al. | 2505.14412 | null |
2025-05-20 | Domain Adaptation for Multi-label Image Classification: a Discriminator-free Approach | Inder Pal Singh et.al. | 2505.14333 | link |
2025-05-20 | HausaNLP: Current Status, Challenges and Future Directions for Hausa Natural Language Processing | Shamsuddeen Hassan Muhammad et.al. | 2505.14311 | null |
2025-05-20 | Intra-class Patch Swap for Self-Distillation | Hongjun Choi et.al. | 2505.14124 | link |
2025-05-20 | Scaling Vision Mamba Across Resolutions via Fractal Traversal | Bo Li et.al. | 2505.14062 | null |
2025-05-20 | Learning Concept-Driven Logical Rules for Interpretable and Generalizable Medical Image Classification | Yibo Gao et.al. | 2505.14049 | null |
2025-05-20 | A Challenge to Build Neuro-Symbolic Video Agents | Sahil Shah et.al. | 2505.13851 | null |
2025-05-19 | Synthetic-Powered Predictive Inference | Meshi Bashari et.al. | 2505.13432 | null |
2025-05-20 | Unlabeled Data or Pre-trained Model: Rethinking Semi-Supervised Learning and Pretrain-Finetuning | Song-Lin Li et.al. | 2505.13317 | null |
2025-05-19 | A Physics-Inspired Optimizer: Velocity Regularized Adam | Pranav Vaidhyanathan et.al. | 2505.13196 | null |
2025-05-19 | Emergence of Fixational and Saccadic Movements in a Multi-Level Recurrent Attention Model for Vision | Pengcheng Pan et.al. | 2505.13191 | null |
2025-05-19 | Learning to Adapt to Position Bias in Vision Transformer Classifiers | Robert-Jan Bruintjes et.al. | 2505.13137 | link |
2025-05-19 | When majority rules, minority loses: bias amplification of gradient descent | François Bachoc et.al. | 2505.13122 | null |
2025-05-19 | Expert-Like Reparameterization of Heterogeneous Pyramid Receptive Fields in Efficient CNNs for Fair Medical Image Classification | Xiao Wu et.al. | 2505.13039 | null |
2025-05-19 | EPIC: Explanation of Pretrained Image Classification Networks via Prototype | Piotr Borycki et.al. | 2505.12897 | link |
2025-05-19 | Enhancing Transformers Through Conditioned Embedded Tokens | Hemanth Saratchandran et.al. | 2505.12789 | null |
2025-05-19 | An approach based on class activation maps for investigating the effects of data augmentation on neural networks for image classification | Lucas M. Dorneles et.al. | 2505.12581 | null |
2025-05-16 | Energy efficiency analysis of Spiking Neural Networks for space applications | Paolo Lunghi et.al. | 2505.11418 | null |
2025-05-16 | Harnessing Photon Indistinguishability in Quantum Extreme Learning Machines | Malo Joly et.al. | 2505.11238 | null |
2025-05-16 | CheX-DS: Improving Chest X-ray Image Classification with Ensemble Learning Based on DenseNet and Swin Transformer | Xinran Li et.al. | 2505.11168 | null |
2025-05-16 | Privacy-Aware Lifelong Learning | Ozan Özdenizci et.al. | 2505.10941 | null |
2025-05-16 | MCU: Improving Machine Unlearning through Mode Connectivity | Yingdan Shi et.al. | 2505.10859 | null |
2025-05-15 | CLIP Embeddings for AI-Generated Image Detection: A Few-Shot Study with Lightweight Classifier | Ziyang Ou et.al. | 2505.10664 | null |
2025-05-15 | Research of the Variational Shadow Quantum Circuit Based on the Whale Optimization Algorithm in Image Classification | Shuang Wu et.al. | 2505.09994 | null |
2025-05-14 | Quantum-Enhanced Parameter-Efficient Learning for Typhoon Trajectory Forecasting | Chen-Yu Liu et.al. | 2505.09395 | null |
2025-05-14 | Marigold: Affordable Adaptation of Diffusion-Based Image Generators for Image Analysis | Bingxin Ke et.al. | 2505.09358 | link |
2025-05-17 | PrePrompt: Predictive prompting for class incremental learning | Libo Huang et.al. | 2505.08586 | link |
2025-05-13 | Convolutional Spiking Neural Network for Image Classification | Mikhail Kiselev et.al. | 2505.08514 | null |
2025-05-13 | CNN and ViT Efficiency Study on Tiny ImageNet and DermaMNIST Datasets | Aidar Amangeldi et.al. | 2505.08259 | null |
2025-05-13 | Empowering Vision Transformers with Multi-Scale Causal Intervention for Long-Tailed Image Classification | Xiaoshuo Yan et.al. | 2505.08173 | null |
2025-05-13 | MoKD: Multi-Task Optimization for Knowledge Distillation | Zeeshan Hayder et.al. | 2505.08170 | null |
2025-05-12 | Hierarchical Sparse Attention Framework for Computationally Efficient Classification of Biological Cells | Elad Yoshai et.al. | 2505.07661 | null |
2025-05-12 | Synthetic Similarity Search in Automotive Production | Christoph Huber et.al. | 2505.07256 | null |
2025-05-12 | Discovering Fine-Grained Visual-Concept Relations by Disentangled Optimal Transport Concept Bottleneck Models | Yan Xie et.al. | 2505.07209 | null |
2025-05-12 | KDH-MLTC: Knowledge Distillation for Healthcare Multi-Label Text Classification | Hajar Sakai et.al. | 2505.07162 | null |
2025-05-11 | A Vision-Language Foundation Model for Leaf Disease Identification | Khang Nguyen Quoc et.al. | 2505.07019 | null |
2025-05-11 | Image Classification Using a Diffusion Model as a Pre-Training Model | Kosuke Ukita et.al. | 2505.06890 | null |
2025-05-11 | NeuRN: Neuro-inspired Domain Generalization for Image Classification | Hamd Jalil et.al. | 2505.06881 | null |
2025-05-11 | Active Learning for Multi-class Image Classification | Thien Nhan Vo et.al. | 2505.06825 | null |
2025-05-10 | FNBench: Benchmarking Robust Federated Learning against Noisy Labels | Xuefeng Jiang et.al. | 2505.06684 | link |
2025-05-10 | The Efficiency of Pre-training with Objective Masking in Pseudo Labeling for Semi-Supervised Text Classification | Arezoo Hatefi et.al. | 2505.06624 | null |
2025-05-09 | Adapting a Segmentation Foundation Model for Medical Image Classification | Pengfei Gu et.al. | 2505.06217 | null |
2025-05-09 | Towards Robust Few-Shot Text Classification Using Transformer Architectures and Dual Loss Strategies | Xu Han et.al. | 2505.06145 | null |
2025-05-09 | Short-circuiting Shortcuts: Mechanistic Investigation of Shortcuts in Text Classification | Leon Eshuijs et.al. | 2505.06032 | link |
2025-05-09 | Efficient Quantum Convolutional Neural Networks for Image Classification: Overcoming Hardware Constraints | Peter Röseler et.al. | 2505.05957 | null |
2025-05-09 | Achieving 3D Attention via Triplet Squeeze and Excitation Block | Maan Alhazmi et.al. | 2505.05943 | null |
2025-05-09 | Improving Generalizability of Kolmogorov-Arnold Networks via Error-Correcting Output Codes | Youngjoon Lee et.al. | 2505.05798 | null |
2025-05-09 | Variational Bayesian Logistic Tensor Regression with Application to Image Recognition | Yunzhi Jin et.al. | 2505.05730 | null |
2025-05-08 | V-EfficientNets: Vector-Valued Efficiently Scaled Convolutional Neural Network Models | Guilherme Vieira Neto et.al. | 2505.05659 | link |
2025-05-08 | KG-HTC: Integrating Knowledge Graphs into LLMs for Effective Zero-shot Hierarchical Text Classification | Qianbo Zang et.al. | 2505.05583 | link |
2025-05-08 | Hide & Seek: Transformer Symmetries Obscure Sharpness & Riemannian Geometry Finds It | Marvin F. da Silva et.al. | 2505.05409 | null |
2025-05-08 | Quantum Surrogate-Driven Image Classifier: A Gradient-Free Approach to Avoid Barren Plateaus | Yichen Xie et.al. | 2505.05249 | null |
2025-05-08 | Biomed-DPT: Dual Modality Prompt Tuning for Biomedical Vision-Language Models | Wei Peng et.al. | 2505.05189 | null |
2025-05-08 | CacheFL: Efficient Federated Cache Model Fine-Tuning for Vision-Language Models | Mengjun Yi et.al. | 2505.05130 | null |
2025-05-08 | Direct Image Classification from Fourier Ptychographic Microscopy Measurements without Reconstruction | Navya Sonal Agarwal et.al. | 2505.05054 | null |
2025-05-07 | ORXE: Orchestrating Experts for Dynamically Configurable Efficiency | Qingyuan Wang et.al. | 2505.04850 | null |
2025-05-07 | Label-efficient Single Photon Images Classification via Active Learning | Zili Zhang et.al. | 2505.04376 | null |
2025-05-07 | FRAIN to Train: A Fast-and-Reliable Solution for Decentralized Federated Learning | Sanghyeon Park et.al. | 2505.04223 | null |
2025-05-06 | Read My Ears! Horse Ear Movement Detection for Equine Affective State Assessment | João Alves et.al. | 2505.03554 | null |
2025-05-06 | Noisy HQNNs: A Comprehensive Analysis of Noise Robustness in Hybrid Quantum Neural Networks | Tasnim Ahmed et.al. | 2505.03378 | null |
2025-05-06 | A Vision-Language Model for Focal Liver Lesion Classification | Song Jian et.al. | 2505.03350 | null |
2025-05-06 | Comparative Analysis of Lightweight Deep Learning Models for Memory-Constrained Devices | Tasnim Shahriar et.al. | 2505.03303 | null |
2025-05-06 | Survey of Abstract Meaning Representation: Then, Now, Future | Behrooz Mansouri et.al. | 2505.03229 | null |
2025-05-06 | seq-JEPA: Autoregressive Predictive Learning of Invariant-Equivariant World Models | Hafez Ghaemi et.al. | 2505.03176 | null |
2025-05-06 | Enhancing Glass Defect Detection with Diffusion Models: Addressing Imbalanced Datasets in Manufacturing Quality Control | Sajjad Rezvani Boroujeni et.al. | 2505.03134 | null |
2025-05-05 | Bayesian Robust Aggregation for Federated Learning | Aleksandr Karakulev et.al. | 2505.02490 | null |
2025-05-06 | Adversarial Cooperative Rationalization: The Risk of Spurious Correlations in Even Clean Datasets | Wei Liu et.al. | 2505.02118 | null |
2025-05-03 | Backdoor Attacks Against Patch-based Mixture of Experts | Cedric Chan et.al. | 2505.01811 | null |
2025-05-03 | Low-Complexity Acoustic Scene Classification with Device Information in the DCASE 2025 Challenge | Florian Schmid et.al. | 2505.01747 | null |
2025-05-03 | CLOG-CD: Curriculum Learning based on Oscillating Granularity of Class Decomposed Medical Image Classification | Asmaa Abbas et.al. | 2505.01741 | null |
2025-05-02 | TActiLE: Tiny Active LEarning for wearable devices | Massimo Pavan et.al. | 2505.01160 | null |
2025-04-30 | Towards Improved Cervical Cancer Screening: Vision Transformer-Based Classification and Interpretability | Khoa Tuan Nguyen et.al. | 2504.21340 | null |
2025-04-28 | AGATE: Stealthy Black-box Watermarking for Multimodal Model Copyright Protection | Jianbo Gao et.al. | 2504.21044 | null |
2025-04-29 | Photonic Quantum Convolutional Neural Networks with Adaptive State Injection | Léo Monbroussou et.al. | 2504.20989 | null |
2025-04-30 | DS_FusionNet: Dynamic Dual-Stream Fusion with Bidirectional Knowledge Distillation for Plant Disease Recognition | Yanghui Song et.al. | 2504.20948 | link |
2025-04-29 | MambaMoE: Mixture-of-Spectral-Spatial-Experts State Space Model for Hyperspectral Image Classification | Yichu Xu et.al. | 2504.20509 | null |
2025-04-28 | DeepAndes: A Self-Supervised Vision Foundation Model for Multi-Spectral Remote Sensing Imagery of the Andes | Junlin Guo et.al. | 2504.20303 | null |
2025-04-28 | GenCLS++: Pushing the Boundaries of Generative Classification in LLMs Through Comprehensive SFT and RL Studies Across Diverse Datasets | Mingqian He et.al. | 2504.19898 | null |
2025-04-28 | Reinforcement Learning-Based Heterogeneous Multi-Task Optimization in Semantic Broadcast Communications | Zhilin Lu et.al. | 2504.19806 | null |
2025-04-28 | Explaining Vision GNNs: A Semantic and Visual Analysis of Graph-based Image Classification | Nikolaos Chaidos et.al. | 2504.19682 | null |
2025-04-28 | Hardware/Software Co-Design of RISC-V Extensions for Accelerating Sparse DNNs on FPGAs | Muhammad Sabih et.al. | 2504.19659 | null |
2025-04-28 | Neural network task specialization via domain constraining | Roman Malashin et.al. | 2504.19592 | null |
2025-04-28 | GMAR: Gradient-Driven Multi-Head Attention Rollout for Vision Transformer Interpretability | Sehyeong Jo et.al. | 2504.19414 | null |
2025-04-27 | Dual-Branch Residual Network for Cross-Domain Few-Shot Hyperspectral Image Classification with Refined Prototype | Anyong Qin et.al. | 2504.19074 | null |
2025-04-26 | Advancing Scientific Text Classification: Fine-Tuned Models with Dataset Expansion and Hard-Voting | Zhyar Rzgar K Rostam et.al. | 2504.19021 | null |
2025-04-26 | A Simple Ensemble Strategy for LLM Inference: Towards More Stable Text Classification | Junichiro Niimi et.al. | 2504.18884 | link |
2025-04-26 | IoT Botnet Detection: Application of Vision Transformer to Classification of Network Flow Traffic | Hassan Wasswa et.al. | 2504.18781 | null |
2025-04-25 | Examining the Impact of Optical Aberrations to Image Classification and Object Detection Models | Patrick Müller et.al. | 2504.18510 | null |
2025-04-25 | Pseudo-Asynchronous Local SGD: Robust and Efficient Data-Parallel Training | Hiroki Naganuma et.al. | 2504.18454 | null |
2025-04-25 | Passive All-Optical Nonlinear Neuron Activation via PPLN Nanophotonic Waveguides | Wujie Fu et.al. | 2504.18145 | null |
2025-04-25 | DMS-Net:Dual-Modal Multi-Scale Siamese Network for Binocular Fundus Image Classification | Guohao Huo et.al. | 2504.18046 | null |
2025-04-24 | Disaggregated Deep Learning via In-Physics Computing at Radio Frequency | Zhihui Gao et.al. | 2504.17752 | null |
2025-04-24 | Aerial Image Classification in Scarce and Unconstrained Environments via Conformal Prediction | Farhad Pourkamali-Anaraki et.al. | 2504.17655 | null |
2025-04-24 | Enhanced Sample Selection with Confidence Tracking: Identifying Correctly Labeled yet Hard-to-Learn Samples in Noisy Data | Weiran Pan et.al. | 2504.17474 | null |
2025-04-24 | Dual-Individual Genetic Algorithm: A Dual-Individual Approach for Efficient Training of Multi-Layer Neural Networks | Tran Thuy Nga Truong et.al. | 2504.17346 | null |
2025-04-24 | Evaluating and Mitigating Bias in AI-Based Medical Text Generation | Xiuying Chen et.al. | 2504.17279 | null |
2025-04-24 | Group Downsampling with Equivariant Anti-aliasing | Md Ashiqur Rahman et.al. | 2504.17258 | link |
2025-04-24 | Multi-Modal Traffic Analysis: Integrating Time-Series Forecasting, Accident Prediction, and Image Classification | Nivedita M et.al. | 2504.17232 | null |
2025-04-23 | A Diff-Attention Aware State Space Fusion Model for Remote Sensing Classification | Wenping Ma et.al. | 2504.16665 | null |
2025-04-23 | Streetscape Analysis with Generative AI (SAGAI): Vision-Language Assessment and Mapping of Urban Scenes | Joan Perez et.al. | 2504.16538 | null |
2025-04-24 | An Effective Gram Matrix Characterizes Generalization in Deep Networks | Rubing Yang et.al. | 2504.16450 | null |
2025-04-23 | FrogDogNet: Fourier frequency Retained visual prompt Output Guidance for Domain Generalization of CLIP in Remote Sensing | Hariseetharam Gunduboina et.al. | 2504.16433 | null |
2025-04-22 | CLIP-IT: CLIP-based Pairing for Histology Images Classification | Banafsheh Karimian et.al. | 2504.16181 | null |
2025-04-22 | Automated Bug Report Prioritization in Large Open-Source Projects | Riley Pierson et.al. | 2504.15912 | null |
2025-04-22 | Generative AI for Research Data Processing: Lessons Learnt From Three Use Cases | Modhurita Mitra et.al. | 2504.15829 | null |
2025-04-22 | DualOptim: Enhancing Efficacy and Stability in Machine Unlearning with Dual Optimizers | Xuyang Zhong et.al. | 2504.15827 | null |
2025-04-22 | HS-Mamba: Full-Field Interaction Multi-Groups Mamba for Hyperspectral Image Classification | Hongxing Peng et.al. | 2504.15612 | null |
2025-04-22 | LLM-based Semantic Augmentation for Harmful Content Detection | Elyas Meguellati et.al. | 2504.15548 | null |
2025-04-21 | Feeding LLM Annotations to BERT Classifiers at Your Own Risk | Yucheng Lu et.al. | 2504.15432 | null |
2025-04-21 | Dynamic 3D KAN Convolution with Adaptive Grid Optimization for Hyperspectral Image Classification | Guandong Li et.al. | 2504.15155 | null |
2025-04-21 | Application of Sensitivity Analysis Methods for Studying Neural Network Models | Jiaxuan Miao et.al. | 2504.15100 | null |
2025-04-21 | Trainable Quantum Neural Network for Multiclass Image Classification with the Power of Pre-trained Tree Tensor Networks | Keisuke Murota et.al. | 2504.14995 | null |
2025-04-21 | ECViT: Efficient Convolutional Vision Transformer with Local-Attention and Multi-scale Stages | Zhoujie Qian et.al. | 2504.14825 | null |
2025-04-21 | What Lurks Within? Concept Auditing for Shared Diffusion Models at Scale | Xiaoyong Yuan et.al. | 2504.14815 | null |
2025-04-21 | A Basic Evaluation of Neural Networks Trained with the Error Diffusion Learning Algorithm | Kazuhisa Fujita et.al. | 2504.14814 | null |
2025-04-19 | Learning from Stochastic Teacher Representations Using Student-Guided Knowledge Distillation | Muhammad Haseeb Aslam et.al. | 2504.14307 | null |
2025-04-19 | Exploring Modality Guidance to Enhance VFM-based Feature Fusion for UDA in 3D Semantic Segmentation | Johannes Spoecklberger et.al. | 2504.14231 | null |
2025-04-19 | Enhancing Multimodal In-Context Learning for Image Classification through Coreset Optimization | Huiyi Chen et.al. | 2504.14200 | null |
2025-04-19 | ThyroidEffi 1.0: A Cost-Effective System for High-Performance Multi-Class Thyroid Carcinoma Classification | Hai Pham-Ngoc et.al. | 2504.14139 | null |
2025-04-18 | Feature Alignment and Representation Transfer in Knowledge Distillation for Large Language Models | Junjie Yang et.al. | 2504.13825 | null |
2025-04-18 | CheXWorld: Exploring Image World Modeling for Radiograph Representation Learning | Yang Yue et.al. | 2504.13820 | link |
2025-04-18 | Towards Accurate and Interpretable Neuroblastoma Diagnosis via Contrastive Multi-scale Pathological Image Analysis | Zhu Zhu et.al. | 2504.13754 | null |
2025-04-18 | Human-aligned Deep Learning: Explainability, Causality, and Biological Inspiration | Gianluca Carloni et.al. | 2504.13717 | null |
2025-04-18 | Word Embedding Techniques for Classification of Star Ratings | Hesham Abdelmotaleb et.al. | 2504.13653 | null |
2025-04-18 | Cross-Hierarchical Bidirectional Consistency Learning for Fine-Grained Visual Classification | Pengxiang Gao et.al. | 2504.13608 | null |
2025-04-18 | MAAM: A Lightweight Multi-Agent Aggregation Module for Efficient Image Classification Based on the MindSpore Framework | Zhenkai Qin et.al. | 2504.13574 | null |
2025-04-18 | Bayesian continual learning and forgetting in neural networks | Djohan Bonnet et.al. | 2504.13569 | null |
2025-04-17 | Dynamic Memory-enhanced Transformer for Hyperspectral Image Classification | Muhammad Ahmad et.al. | 2504.13242 | null |
2025-04-17 | Perception Encoder: The best visual embeddings are not at the output of the network | Daniel Bolya et.al. | 2504.13181 | null |
2025-04-17 | Expert Kernel Generation Network Driven by Contextual Mapping for Hyperspectral Image Classification | Guandong Li et.al. | 2504.13045 | null |
2025-04-17 | Quantum Computing Supported Adversarial Attack-Resilient Autonomous Vehicle Perception Module for Traffic Sign Classification | Reek Majumder et.al. | 2504.12644 | null |
2025-04-16 | GLUSE: Enhanced Channel-Wise Adaptive Gated Linear Units SE for Onboard Satellite Earth Observation Image Classification | Thanh-Dung Le et.al. | 2504.12484 | null |
2025-04-16 | FLIP Reasoning Challenge | Andreas Plesner et.al. | 2504.12256 | null |
2025-04-16 | Weakly Semi-supervised Whole Slide Image Classification by Two-level Cross Consistency Supervision | Linhao Qu et.al. | 2504.12132 | null |
2025-04-16 | Exploring Video-Based Driver Activity Recognition under Noisy Labels | Linjuan Fan et.al. | 2504.11966 | link |
2025-04-17 | Selective Attention Federated Learning: Improving Privacy and Efficiency for Clinical Text Classification | Yue Li et.al. | 2504.11793 | null |
2025-04-15 | The Pontryagin Maximum Principle for Training Convolutional Neural Networks | Sebastian Hofmann et.al. | 2504.11647 | null |
2025-04-15 | Deep Learning Approaches for Medical Imaging Under Varying Degrees of Label Availability: A Comprehensive Survey | Siteng Ma et.al. | 2504.11588 | null |
2025-04-15 | Diversity-Driven Learning: Tackling Spurious Correlations and Data Heterogeneity in Federated Models | Gergely D. Németh et.al. | 2504.11216 | null |
2025-04-15 | Embedding Radiomics into Vision Transformers for Multimodal Medical Image Classification | Zhenyu Yang et.al. | 2504.10916 | null |
2025-04-15 | Progressive Rock Music Classification | Arpan Nagar et.al. | 2504.10821 | null |
2025-04-15 | 3D Wavelet Convolutions with Extended Receptive Fields for Hyperspectral Image Classification | Guandong Li et.al. | 2504.10795 | null |
2025-04-14 | Quantum Image Classification: Experiments on Utility-Scale Quantum Computers | Hrant Gharibyan et.al. | 2504.10595 | null |
2025-04-14 | LEMUR Neural Network Dataset: Towards Seamless AutoML | Arash Torabi Goodarzi et.al. | 2504.10552 | null |
2025-04-13 | An Efficient Quantum Classifier Based on Hamiltonian Representations | Federico Tiblias et.al. | 2504.10542 | null |
2025-04-14 | Correlative and Discriminative Label Grouping for Multi-Label Visual Prompt Tuning | LeiLei Ma et.al. | 2504.09990 | null |
2025-04-14 | GFT: Gradient Focal Transformer | Boris Kriuk et.al. | 2504.09852 | null |
2025-04-13 | PCM-SAR: Physics-Driven Contrastive Mutual Learning for SAR Classification | Pengfei Wang et.al. | 2504.09502 | null |
2025-04-13 | InfoBound: A Provable Information-Bounds Inspired Framework for Both OoD Generalization and OoD Detection | Lin Zhu et.al. | 2504.09448 | null |
2025-04-13 | Sparse Deformable Mamba for Hyperspectral Image Classification | Lincoln Linlin Xu et.al. | 2504.09446 | null |
2025-04-12 | Cycle Training with Semi-Supervised Domain Adaptation: Bridging Accuracy and Efficiency for Real-Time Mobile Scene Detection | Huu-Phong Phan-Nguyen et.al. | 2504.09297 | null |
2025-04-12 | Sparse Hybrid Linear-Morphological Networks | Konstantinos Fotopoulos et.al. | 2504.09289 | null |
2025-04-12 | Mixture of Group Experts for Learning Invariant Representations | Lei Kang et.al. | 2504.09265 | null |
2025-04-12 | Langformers: Unified NLP Pipelines for Language Models | Rabindra Lamsal et.al. | 2504.09170 | null |
2025-04-12 | Evolved Hierarchical Masking for Self-Supervised Learning | Zhanzhou Feng et.al. | 2504.09155 | null |
2025-04-11 | Hypergraph Vision Transformers: Images are More than Nodes, More than Edges | Joshua Fixelle et.al. | 2504.08710 | null |
2025-04-11 | Integrated ensemble of BERT- and features-based models for authorship attribution in Japanese literary works | Taisei Kanda et.al. | 2504.08527 | null |
2025-04-11 | An Early Experience with Confidential Computing Architecture for On-Device Model Protection | Sina Abdollahi et.al. | 2504.08508 | null |
2025-04-11 | The inherent convolution property of quantum neural networks | Guangkai Qu et.al. | 2504.08487 | null |
2025-04-11 | A Hybrid Fully Convolutional CNN-Transformer Model for Inherently Interpretable Medical Image Classification | Kerol Djoumessi et.al. | 2504.08481 | null |
2025-04-11 | FocalLens: Instruction Tuning Enables Zero-Shot Conditional Image Representations | Cheng-Yu Hsieh et.al. | 2504.08368 | null |
2025-04-11 | Comparative Analysis of Different Methods for Classifying Polychromatic Sketches | Fahd Baba et.al. | 2504.08186 | null |
2025-04-11 | Pychop: Emulating Low-Precision Arithmetic in Numerical Methods and Neural Networks | Erin Carson et.al. | 2504.07835 | null |
2025-04-10 | Traversal Learning Coordination For Lossless And Efficient Distributed Learning | Erdenebileg Batbaatar et.al. | 2504.07471 | null |
2025-04-09 | Identifying regions of interest in whole slide images of renal cell carcinoma | Mohammed Lamine Benomar et.al. | 2504.07313 | null |
2025-04-09 | A new training approach for text classification in Mental Health: LatentGLoss | Korhan Sevinç et.al. | 2504.07245 | null |
2025-04-09 | Deep Learning for Cardiovascular Risk Assessment: Proxy Features from Carotid Sonography as Predictors of Arterial Damage | Christoph Balada et.al. | 2504.06680 | null |
2025-04-08 | Memory-Modular Classification: Learning to Generalize with Memory Replacement | Dahyun Kang et.al. | 2504.06021 | null |
2025-04-08 | Federated Unlearning Made Practical: Seamless Integration via Negated Pseudo-Gradients | Alessio Mora et.al. | 2504.05822 | null |
2025-04-08 | DefMamba: Deformable Visual State Space Model | Leiye Liu et.al. | 2504.05794 | null |
2025-04-08 | Layer-Aware Embedding Fusion for LLMs in Text Classifications | Jiho Gwak et.al. | 2504.05764 | null |
2025-04-07 | REEF: Relevance-Aware and Efficient LLM Adapter for Video Understanding | Sakib Reza et.al. | 2504.05491 | null |
2025-04-07 | Secure Diagnostics: Adversarial Robustness Meets Clinical Interpretability | Mohammad Hossein Najafi et.al. | 2504.05483 | null |
2025-04-07 | Explaining Low Perception Model Competency with High-Competency Counterfactuals | Sara Pohland et.al. | 2504.05254 | null |
2025-04-07 | Federated Learning for Medical Image Classification: A Comprehensive Benchmark | Zhekai Zhou et.al. | 2504.05238 | null |
2025-04-07 | Batch Aggregation: An Approach to Enhance Text Classification with Correlated Augmented Data | Charco Hui et.al. | 2504.05020 | null |
2025-04-07 | RS-RAG: Bridging Remote Sensing Imagery and Comprehensive Knowledge with a Multi-Modal Dataset and Retrieval-Augmented Generation Model | Congcong Wen et.al. | 2504.04988 | null |
2025-04-06 | Your Image Generator Is Your New Private Dataset | Nicolo Resmini et.al. | 2504.04582 | null |
2025-04-06 | Attributed Synthetic Data Generation for Zero-shot Domain-specific Image Classification | Shijian Wang et.al. | 2504.04510 | null |
2025-04-06 | Spatial-Geometry Enhanced 3D Dynamic Snake Convolutional Neural Network for Hyperspectral Image Classification | Guandong Li et.al. | 2504.04463 | null |
2025-04-05 | A Comparative Study of Explainable AI Methods: Model-Agnostic vs. Model-Specific Approaches | Keerthi Devireddy et.al. | 2504.04276 | null |
2025-04-05 | GlotEval: A Test Suite for Massively Multilingual Evaluation of Large Language Models | Hengyu Luo et.al. | 2504.04155 | null |
2025-04-05 | Scaling Federated Learning Solutions with Kubernetes for Synthesizing Histopathology Images | Andrei-Alexandru Preda et.al. | 2504.04130 | null |
2025-04-04 | Adaptive Classification of Interval-Valued Time Series | Wan Tian et.al. | 2504.03318 | null |
2025-04-04 | Beyond the Next Token: Towards Prompt-Robust Zero-Shot Classification via Efficient Multi-Token Prediction | Junlang Qian et.al. | 2504.03159 | null |
2025-04-03 | HQViT: Hybrid Quantum Vision Transformer for Image Classification | Hui Zhang et.al. | 2504.02730 | null |
2025-04-03 | LLM-Guided Evolution: An Autonomous Model Optimization for Object Detection | YiMing Yu et.al. | 2504.02280 | null |
2025-04-02 | Neural Style Transfer for Synthesising a Dataset of Ancient Egyptian Hieroglyphs | Lewis Matheson Creed et.al. | 2504.02163 | null |
2025-04-02 | A thorough benchmark of automatic text classification: From traditional approaches to large language models | Washington Cunha et.al. | 2504.01930 | link |
2025-04-02 | A Randomized Zeroth-Order Hierarchical Framework for Heterogeneous Federated Learning | Yuyang Qiu et.al. | 2504.01839 | null |
2025-04-02 | A Novel Approach To Implementing Knowledge Distillation In Tsetlin Machines | Calvin Kinateder et.al. | 2504.01798 | null |
2025-04-02 | Token Pruning in Audio Transformers: Optimizing Performance and Decoding Patch Importance | Taehan Lee et.al. | 2504.01690 | link |
2025-04-02 | All Patches Matter, More Patches Better: Enhance AI-Generated Image Detection via Panoptic Patch Learning | Zheng Yang et.al. | 2504.01396 | null |
2025-04-01 | TenAd: A Tensor-based Low-rank Black Box Adversarial Attack for Video Classification | Kimia haghjooei et.al. | 2504.01228 | null |
2025-04-01 | PolygoNet: Leveraging Simplified Polygonal Representation for Effective Image Classification | Salim Khazem et.al. | 2504.01214 | link |
2025-04-01 | Enabling Efficient Processing of Spiking Neural Networks with On-Chip Learning on Commodity Neuromorphic Processors for Edge AI Systems | Rachmad Vidya Wicaksana Putra et.al. | 2504.00957 | null |
2025-04-01 | Impact of Data Duplication on Deep Neural Network-Based Image Classifiers: Robust vs. Standard Models | Alireza Aghabagherloo et.al. | 2504.00638 | null |
2025-04-01 | Geometric Median Matching for Robust k-Subset Selection from Noisy Data | Anish Acharya et.al. | 2504.00564 | null |
2025-03-31 | NoProp: Training Neural Networks without Back-propagation or Forward-propagation | Qinyu Li et.al. | 2503.24322 | null |
2025-03-31 | CIBR: Cross-modal Information Bottleneck Regularization for Robust CLIP Generalization | Yingrui Ji et.al. | 2503.24182 | null |
2025-03-31 | PixelCAM: Pixel Class Activation Mapping for Histology Image Classification and ROI Localization | Alexis Guichemerre et.al. | 2503.24135 | link |
2025-03-31 | Crossmodal Knowledge Distillation with WordNet-Relaxed Text Embeddings for Robust Image Classification | Chenqi Guo et.al. | 2503.24017 | null |
2025-03-31 | FlexiMo: A Flexible Remote Sensing Foundation Model | Xuyang Li et.al. | 2503.23844 | null |
2025-03-31 | Expanding-and-Shrinking Binary Neural Networks | Xulong Shi et.al. | 2503.23709 | link |
2025-03-31 | WHERE and WHICH: Iterative Debate for Biomedical Synthetic Data Augmentation | Zhengyi Zhao et.al. | 2503.23673 | null |
2025-03-30 | Efficient Dynamic Attention 3D Convolution for Hyperspectral Image Classification | Guandong Li et.al. | 2503.23472 | null |
2025-03-30 | KernelDNA: Dynamic Kernel Sharing via Decoupled Naive Adapters | Haiduo Huang et.al. | 2503.23379 | link |
2025-03-29 | Optimizing Distributed Training Approaches for Scaling Neural Networks | Vishnu Vardhan Baligodugula et.al. | 2503.23186 | null |
2025-03-28 | Data-Free Universal Attack by Exploiting the Intrinsic Vulnerability of Deep Models | YangTian Yan et.al. | 2503.22205 | link |
2025-03-28 | Route-and-Aggregate Decentralized Federated Learning Under Communication Errors | Weicai Li et.al. | 2503.22186 | null |
2025-03-27 | On Large Multimodal Models as Open-World Image Classifiers | Alessandro Conti et.al. | 2503.21851 | link |
2025-03-27 | Bayesian Pseudo Posterior Mechanism for Differentially Private Machine Learning | Robert Chew et.al. | 2503.21528 | null |
2025-03-27 | Retinal Fundus Multi-Disease Image Classification using Hybrid CNN-Transformer-Ensemble Architectures | Deependra Singh et.al. | 2503.21465 | link |
2025-03-27 | Fine-Tuning LLMs on Small Medical Datasets: Text Classification and Normalization Effectiveness on Cardiology reports and Discharge records | Noah Losch et.al. | 2503.21349 | null |
2025-03-27 | Improving |
Mario García-Márquez et.al. | 2503.21244 | link |
2025-03-27 | Neural Architecture Search by Learning a Hierarchical Search Space | Mehraveh Javan Roshtkhari et.al. | 2503.21061 | null |
2025-03-26 | TS-Inverse: A Gradient Inversion Attack Tailored for Federated Time Series Forecasting Models | Caspar Meijer et.al. | 2503.20952 | link |
2025-03-26 | VESTA: A Versatile SNN-Based Transformer Accelerator with Unified PEs for Multiple Computational Layers | Ching-Yao Chen et.al. | 2503.20246 | null |
2025-03-26 | BeLightRec: A lightweight recommender system enhanced with BERT | Manh Mai Van et.al. | 2503.20206 | null |
2025-03-25 | Vanishing Depth: A Depth Adapter with Positional Depth Encoding for Generalized Image Encoders | Paul Koch et.al. | 2503.19947 | null |
2025-03-25 | Optimizing Breast Cancer Detection in Mammograms: A Comprehensive Study of Transfer Learning, Resolution Reduction, and Multi-View Classification | Daniel G. P. Petrini et.al. | 2503.19945 | null |
2025-03-25 | Extensions of regret-minimization algorithm for optimal design | Youguang Chen et.al. | 2503.19874 | null |
2025-03-25 | VectorFit : Adaptive Singular & Bias Vector Fine-Tuning of Pre-trained Foundation Models | Suhas G Hegde et.al. | 2503.19530 | null |
2025-03-25 | LRSCLIP: A Vision-Language Foundation Model for Aligning Remote Sensing Image with Longer Text | Weizhi Chen et.al. | 2503.19311 | null |
2025-03-25 | Face Spoofing Detection using Deep Learning | Najeebullah et.al. | 2503.19223 | link |
2025-03-24 | Exploring the Integration of Key-Value Attention Into Pure and Hybrid Transformers for Semantic Segmentation | DeShin Hwa et.al. | 2503.18862 | null |
2025-03-24 | Latent Space Class Dispersion: Effective Test Data Quality Assessment for DNNs | Vivek Vekariya et.al. | 2503.18799 | null |
2025-03-24 | Unbiasing through Textual Descriptions: Mitigating Representation Bias in Video Benchmarks | Nina Shvetsova et.al. | 2503.18637 | null |
2025-03-24 | Explaining Domain Shifts in Language: Concept erasing for Interpretable Image Classification | Zequn Zeng et.al. | 2503.18483 | null |
2025-03-24 | Teaching LLMs for Step-Level Automatic Math Correction via Reinforcement Learning | Junsong Li et.al. | 2503.18432 | null |
2025-03-24 | Sun-Shine: A Large Language Model for Tibetan Culture | Cheng Huang et.al. | 2503.18288 | null |
2025-03-23 | Feature Learning beyond the Lazy-Rich Dichotomy: Insights from Representational Geometry | Chi-Ning Chou et.al. | 2503.18114 | null |
2025-03-23 | What Time Tells Us? An Explorative Study of Time Awareness Learned from Static Images | Dongheng Lin et.al. | 2503.17899 | null |
2025-03-21 | Spatiotemporal Learning with Context-aware Video Tubelets for Ultrasound Video Analysis | Gary Y. Li et.al. | 2503.17475 | null |
2025-03-21 | Leveraging Text-to-Image Generation for Handling Spurious Correlation | Aryan Yazdan Parast et.al. | 2503.17226 | null |
2025-03-21 | CoRLD: Contrastive Representation Learning Of Deformable Shapes In Images | Tonmoy Hossain ana Miaomiao Zhang et.al. | 2503.17162 | null |
2025-03-21 | Beyond Accuracy: What Matters in Designing Well-Behaved Models? | Robin Hesse et.al. | 2503.17110 | null |
2025-03-21 | Symbolic Audio Classification via Modal Decision Tree Learning | Enrico Marzano et.al. | 2503.17018 | null |
2025-03-21 | EasyRobust: A Comprehensive and Easy-to-use Toolkit for Robust and Generalized Vision | Xiaofeng Mao et.al. | 2503.16975 | null |
2025-03-21 | City2Scene: Improving Acoustic Scene Classification with City Features | Yiqiang Cai et.al. | 2503.16862 | null |
2025-03-20 | MobilePlantViT: A Mobile-friendly Hybrid ViT for Generalized Plant Disease Image Classification | Moshiur Rahman Tonmoy et.al. | 2503.16628 | null |
2025-03-20 | PSA-MIL: A Probabilistic Spatial Attention-Based Multiple Instance Learning for Whole Slide Image Classification | Sharon Peled et.al. | 2503.16284 | link |
2025-03-20 | CLS-RL: Image Classification with Rule-Based Reinforcement Learning | Ming Li et.al. | 2503.16188 | null |
2025-03-20 | Corrective In-Context Learning: Evaluating Self-Correction in Large Language Models | Mario Sanz-Guerrero et.al. | 2503.16022 | link |
2025-03-20 | Beyond the Visible: Multispectral Vision-Language Learning for Earth Observation | Clive Tinashe Marimo et.al. | 2503.15969 | null |
2025-03-19 | Graph-Weighted Contrastive Learning for Semi-Supervised Hyperspectral Image Classification | Yuqing Zhang et.al. | 2503.15731 | null |
2025-03-20 | Dynamic Bi-Elman Attention Networks (DBEAN): Dual-Directional Context-Aware Representation Learning for Enhanced Text Classification | ZhengLin Lai et.al. | 2503.15469 | link |
2025-03-19 | Test-Time Backdoor Detection for Object Detection Models | Hangtao Zhang et.al. | 2503.15293 | null |
2025-03-19 | Efficient allocation of image recognition and LLM tasks on multi-GPU system | Marcin Lawenda et.al. | 2503.15252 | null |
2025-03-19 | Comparing Llama3 and DeepSeekR1 on Biomedical Text Classification Tasks | Yuting Guo et.al. | 2503.15169 | null |
2025-03-19 | ARC: Anchored Representation Clouds for High-Resolution INR Classification | Joost Luijmes et.al. | 2503.15156 | null |
2025-03-19 | Ultrasound Image-to-Video Synthesis via Latent Dynamic Diffusion Models | Tingxiu Chen et.al. | 2503.14966 | null |
2025-03-19 | Optimal Transport Adapter Tuning for Bridging Modality Gaps in Few-Shot Remote Sensing Scene Classification | Zhong Ji et.al. | 2503.14938 | null |
2025-03-18 | RAT: Boosting Misclassification Detection Ability without Extra Data | Ge Yan et.al. | 2503.14783 | null |
2025-03-18 | LipShiFT: A Certifiably Robust Shift-based Vision Transformer | Rohan Menon et.al. | 2503.14751 | null |
2025-03-18 | Utilization of Neighbor Information for Image Classification with Different Levels of Supervision | Gihan Jayatilaka et.al. | 2503.14500 | null |
2025-03-17 | Neural Edge Histogram Descriptors for Underwater Acoustic Target Recognition | Atharva Agashe et.al. | 2503.13763 | null |
2025-03-17 | Micro Text Classification Based on Balanced Positive-Unlabeled Learning | Lin-Han Jia et.al. | 2503.13562 | null |
2025-03-17 | Escaping Plato's Cave: Robust Conceptual Reasoning through Interpretable 3D Neural Object Volumes | Nhi Pham et.al. | 2503.13429 | null |
2025-03-17 | Do Vision Models Develop Human-Like Progressive Difficulty Understanding? | Zeyi Huang et.al. | 2503.13058 | null |
2025-03-16 | Domain Generalization for Improved Human Activity Recognition in Office Space Videos Using Adaptive Pre-processing | Partho Ghosh et.al. | 2503.12678 | null |
2025-03-16 | Scaling Semantic Categories: Investigating the Impact on Vision Transformer Labeling Performance | Anthony Lamelas et.al. | 2503.12617 | null |
2025-03-16 | Defense Against Model Stealing Based on Account-Aware Distribution Discrepancy | Jian-Ping Mei et.al. | 2503.12497 | null |
2025-03-16 | GeoRSMLLM: A Multimodal Large Language Model for Vision-Language Tasks in Geoscience and Remote Sensing | Zilun Zhang et.al. | 2503.12490 | null |
2025-03-16 | Shape Bias and Robustness Evaluation via Cue Decomposition for Image Classification and Segmentation | Edgar Heinert et.al. | 2503.12453 | null |
2025-03-16 | MExD: An Expert-Infused Diffusion Model for Whole-Slide Image Classification | Jianwei Zhao et.al. | 2503.12401 | null |
2025-03-15 | TLAC: Two-stage LMM Augmented CLIP for Zero-Shot Classification | Ans Munir et.al. | 2503.12206 | null |
2025-03-15 | Goal-Oriented Source Coding using LDPC Codes for Compressed-Domain Image Classification | Ahcen Aliouat et.al. | 2503.11954 | null |
2025-03-14 | Creating a Good Teacher for Knowledge Distillation in Acoustic Scene Classification | Tobias Morocutti et.al. | 2503.11363 | null |
2025-03-14 | PARIC: Probabilistic Attention Regularization for Language Guided Image Classification from Pre-trained Vison Language Models | Mayank Nautiyal et.al. | 2503.11360 | null |
2025-03-14 | APLA: A Simple Adaptation Method for Vision Transformers | Moein Sorkhei et.al. | 2503.11335 | null |
2025-03-14 | Open-Set Plankton Recognition | Joona Kareinen et.al. | 2503.11318 | null |
2025-03-14 | MEET: A Million-Scale Dataset for Fine-Grained Geospatial Scene Classification with Zoom-Free Remote Sensing Imagery | Yansheng Li et.al. | 2503.11219 | null |
2025-03-14 | Falcon: A Remote Sensing Vision-Language Foundation Model | Kelu Yao et.al. | 2503.11070 | null |
2025-03-13 | Juan Felipe Gomez et.al. | 2503.10945 | null | |
2025-03-13 | Learning Interpretable Logic Rules from Deep Vision Models | Chuqin Geng et.al. | 2503.10547 | null |
2025-03-13 | Extreme Learning Machines for Attention-based Multiple Instance Learning in Whole-Slide Image Classification | Rajiv Krishnakumar et.al. | 2503.10510 | null |
2025-03-13 | RoMA: Scaling up Mamba-based Foundation Models for Remote Sensing | Fengxiang Wang et.al. | 2503.10392 | link |
2025-03-13 | PS3C: An Ensemble-Based Two-Step Framework for Classification of Pep Smear Cell Images | Theo Di Piazza et.al. | 2503.10312 | link |
2025-03-13 | Wikipedia is Not a Dictionary, Delete! Text Classification as a Proxy for Analysing Wiki Deletion Discussions | Hsuvas Borkakoty et.al. | 2503.10294 | null |
2025-03-13 | A Multi-Modal Federated Learning Framework for Remote Sensing Image Classification | Barış Büyüktaş et.al. | 2503.10262 | null |
2025-03-13 | Interpretable Image Classification via Non-parametric Part Prototype Learning | Zhijie Zhu et.al. | 2503.10247 | null |
2025-03-13 | Multiplicative Learning | Han Kim et.al. | 2503.10144 | null |
2025-03-13 | Cognitive-Mental-LLM: Leveraging Reasoning in Large Language Models for Mental Health Prediction via Online Text | Avinash Patil et.al. | 2503.10095 | null |
2025-03-13 | Do We Always Need the Simplicity Bias? Looking for Optimal Inductive Biases in the Wild | Damien Teney et.al. | 2503.10065 | null |
2025-03-12 | Fair Federated Medical Image Classification Against Quality Shift via Inter-Client Progressive State Matching | Nannan Wu et.al. | 2503.09587 | null |
2025-03-12 | Double-Stage Feature-Level Clustering-Based Mixture of Experts Framework | Bakary Badjie et.al. | 2503.09504 | null |
2025-03-12 | ForAug: Recombining Foregrounds and Backgrounds to Improve Vision Transformer Training with Bias Mitigation | Tobias Christian Nauen et.al. | 2503.09399 | null |
2025-03-12 | Membership Inference Attacks fueled by Few-Short Learning to detect privacy leakage tackling data integrity | Daniel Jiménez-López et.al. | 2503.09365 | null |
2025-03-12 | Deep Learning for Climate Action: Computer Vision Analysis of Visual Narratives on X | Katharina Prasse et.al. | 2503.09361 | null |
2025-03-12 | Bayesian Test-Time Adaptation for Vision-Language Models | Lihua Zhou et.al. | 2503.09248 | null |
2025-03-12 | Probing Network Decisions: Capturing Uncertainties and Unveiling Vulnerabilities Without Label Information | Youngju Joung et.al. | 2503.09068 | null |
2025-03-12 | Discovering Influential Neuron Path in Vision Transformers | Yifan Wang et.al. | 2503.09046 | null |
2025-03-11 | KAN-Mixers: a new deep learning architecture for image classification | Jorge Luiz dos Santos Canuto et.al. | 2503.08939 | null |
2025-03-12 | MsaMIL-Net: An End-to-End Multi-Scale Aware Multiple Instance Learning Network for Efficient Whole Slide Image Classification | Jiangping Wen et.al. | 2503.08581 | null |
2025-03-11 | Generalizable and Explainable Deep Learning for Medical Image Computing: An Overview | Ahmad Chaddad et.al. | 2503.08420 | null |
2025-03-11 | Prototype-Based Multiple Instance Learning for Gigapixel Whole Slide Image Classification | Susu Sun et.al. | 2503.08384 | null |
2025-03-11 | Tangentially Aligned Integrated Gradients for User-Friendly Explanations | Lachlan Simpson et.al. | 2503.08240 | null |
2025-03-11 | EnergyFormer: Energy Attention with Fourier Embedding for Hyperspectral Image Classification | Saad Sohail et.al. | 2503.08239 | null |
2025-03-11 | Identification of Star Clusters in M31 from PAndAS Images Based on Deep Learning | Baisong Zhang et.al. | 2503.08130 | null |
2025-03-11 | LabelCoRank: Revolutionizing Long Tail Multi-Label Classification with Co-Occurrence Reranking | Yan Yan et.al. | 2503.07968 | null |
2025-03-12 | Measuring directional bias amplification in image captions using predictability | Rahul Nair et.al. | 2503.07878 | null |
2025-03-10 | Fair Text Classification via Transferable Representations | Thibaud Leteno et.al. | 2503.07691 | null |
2025-03-10 | Keeping Representation Similarity in Finetuning for Medical Image Analysis | Wenqiang Zu et.al. | 2503.07399 | null |
2025-03-10 | Brain Inspired Adaptive Memory Dual-Net for Few-Shot Image Classification | Kexin Di et.al. | 2503.07396 | null |
2025-03-10 | Is My Text in Your AI Model? Gradient-based Membership Inference Test applied to LLMs | Gonzalo Mancera et.al. | 2503.07384 | null |
2025-03-10 | Distilling Knowledge into Quantum Vision Transformers for Biomedical Image Classification | Thomas Boucher et.al. | 2503.07294 | null |
2025-03-10 | A Zero-shot Learning Method Based on Large Language Models for Multi-modal Knowledge Graph Embedding | Bingchen Liu et.al. | 2503.07202 | null |
2025-03-10 | Understanding the Learning Dynamics of LoRA: A Gradient Flow Perspective on Low-Rank Adaptation in Matrix Factorization | Ziqing Xu et.al. | 2503.06982 | null |
2025-03-10 | Task Vector Quantization for Memory-Efficient Model Merging | Youngeun Kim et.al. | 2503.06921 | null |
2025-03-10 | MADS: Multi-Attribute Document Supervision for Zero-Shot Image Classification | Xiangyan Qu et.al. | 2503.06847 | null |
2025-03-09 | Enhancing Layer Attention Efficiency through Pruning Redundant Retrievals | Hanze Li et.al. | 2503.06473 | null |
2025-03-09 | M |
Mingxiang Cao et.al. | 2503.06446 | null |
2025-03-07 | Similarity-Based Domain Adaptation with LLMs | Jie He et.al. | 2503.05281 | null |
2025-03-07 | Spatial Context-Driven Positive Pair Sampling for Enhanced Histopathology Image Classification | Willmer Rafell Quinones Robles et.al. | 2503.05170 | null |
2025-03-07 | Ensemble Debiasing Across Class and Sample Levels for Fairer Prompting Accuracy | Ruixi Lin et.al. | 2503.05157 | null |
2025-03-07 | Grouped Sequential Optimization Strategy -- the Application of Hyperparameter Importance Assessment in Deep Learning | Ruinan Wang et.al. | 2503.05106 | null |
2025-03-06 | HieroLM: Egyptian Hieroglyph Recovery with Next Word Prediction Language Model | Xuheng Cai et.al. | 2503.04996 | null |
2025-03-06 | Label Distribution Learning-Enhanced Dual-KNN for Text Classification | Bo Yuan et.al. | 2503.04869 | null |
2025-03-06 | Guiding LLMs to Generate High-Fidelity and High-Quality Counterfactual Explanations for Text Classification | Van Bach Nguyen et.al. | 2503.04463 | null |
2025-03-06 | WeakSupCon: Weakly Supervised Contrastive Learning for Encoder Pre-training | Bodong Zhang et.al. | 2503.04165 | null |
2025-03-04 | Measurement noise scaling laws for cellular representation learning | Gokul Gowri et.al. | 2503.02726 | null |
2025-03-04 | XFMamba: Cross-Fusion Mamba for Multi-View Medical Image Classification | Xiaoyu Zheng et.al. | 2503.02619 | null |
2025-03-04 | Remote Sensing Image Classification Using Convolutional Neural Network (CNN) and Transfer Learning Techniques | Mustafa Majeed Abd Zaid et.al. | 2503.02510 | null |
2025-03-06 | Union of Experts: Adapting Hierarchical Routing to Equivalently Decomposed Transformer | Yujiao Yang et.al. | 2503.02495 | null |
2025-03-04 | Making Better Mistakes in CLIP-Based Zero-Shot Classification with Hierarchy-Aware Language Prompts | Tong Liang et.al. | 2503.02248 | null |
2025-03-04 | Sharpness-Aware Minimization: General Analysis and Improved Rates | Dimitris Oikonomou et.al. | 2503.02225 | null |
2025-03-03 | Mathematical Foundation of Interpretable Equivariant Surrogate Models | Jacopo Joy Colombini et.al. | 2503.01942 | null |
2025-03-03 | Visual-RFT: Visual Reinforcement Fine-Tuning | Ziyu Liu et.al. | 2503.01785 | link |
2025-03-03 | Mamba base PKD for efficient knowledge compression | José Medina et.al. | 2503.01727 | null |
2025-03-04 | SAR-W-MixMAE: SAR Foundation Model Training Using Backscatter Power Weighting | Ali Caglayan et.al. | 2503.01181 | null |
2025-03-03 | Large Language Models for Healthcare Text Classification: A Systematic Review | Hajar Sakai et.al. | 2503.01159 | null |
2025-03-03 | Fast and Accurate Gigapixel Pathological Image Classification with Hierarchical Distillation Multi-Instance Learning | Jiuyang Dong et.al. | 2502.21130 | null |
2025-02-28 | Comparative study of the ansätze in quantum language models | Jordi Del Castillo et.al. | 2502.20744 | null |
2025-02-28 | Exploring the Impact of Temperature Scaling in Softmax for Classification and Adversarial Robustness | Hao Xuan et.al. | 2502.20604 | null |
2025-02-27 | In-Model Merging for Enhancing the Robustness of Medical Imaging Classification Models | Hu Wang et.al. | 2502.20516 | null |
2025-02-27 | Online Meta-learning for AutoML in Real-time (OnMAR) | Mia Gerber et.al. | 2502.20279 | null |
2025-03-03 | Gradient-Guided Annealing for Domain Generalization | Aristotelis Ballas et.al. | 2502.20162 | link |
2025-02-27 | QPM: Discrete Optimization for Globally Interpretable Image Classification | Thomas Norrenbrock et.al. | 2502.20130 | link |
2025-02-27 | ProAPO: Progressively Automatic Prompt Optimization for Visual Classification | Xiangyan Qu et.al. | 2502.19844 | link |
2025-02-27 | Text classification using machine learning methods | Bogdan Oancea et.al. | 2502.19801 | null |
2025-02-27 | InPK: Infusing Prior Knowledge into Prompt for Vision-Language Models | Shuchang Zhou et.al. | 2502.19777 | null |
2025-02-27 | Learning Mask Invariant Mutual Information for Masked Image Modeling | Tao Huang et.al. | 2502.19718 | null |
2025-02-27 | Language-Informed Hyperspectral Image Synthesis for Imbalanced-Small Sample Classification via Semi-Supervised Conditional Diffusion Model | Yimin Zhu et.al. | 2502.19700 | null |
2025-02-27 | Spatial-Spectral Diffusion Contrastive Representation Network for Hyperspectral Image Classification | Yimin Zhu et.al. | 2502.19699 | null |
2025-02-27 | A Residual Multi-task Network for Joint Classification and Regression in Medical Imaging | Junji Lin et.al. | 2502.19692 | null |
2025-02-26 | I Know What I Don't Know: Improving Model Cascades Through Confidence Tuning | Stephan Rabanser et.al. | 2502.19335 | null |
2025-02-26 | Active Few-Shot Learning for Text Classification | Saeed Ahmadnia et.al. | 2502.18782 | null |
2025-02-25 | Enhancing Image Classification with Augmentation: Data Augmentation Techniques for Improved Image Classification | Saorj Kumar et.al. | 2502.18691 | null |
2025-02-25 | Enhancing Text Classification with a Novel Multi-Agent Collaboration Framework Leveraging BERT | Hediyeh Baban et.al. | 2502.18653 | null |
2025-02-25 | MedKAN: An Advanced Kolmogorov-Arnold Network for Medical Image Classification | Zhuoqin Yang et.al. | 2502.18416 | null |
2025-02-26 | A Fusion Model for Art Author Identification Based on Convolutional Neural Networks and Transformers | Zhenyu Wang et.al. | 2502.18083 | null |
2025-02-25 | MAGE: Multi-Head Attention Guided Embeddings for Low Resource Sentiment Classification | Varun Vashisht et.al. | 2502.17987 | null |
2025-02-25 | Dual Classification Head Self-training Network for Cross-scene Hyperspectral Image Classification | Rong Liu et.al. | 2502.17879 | null |
2025-02-24 | Can Score-Based Generative Modeling Effectively Handle Medical Image Classification? | Sushmita Sarker et.al. | 2502.17727 | null |
2025-02-24 | A Priori Generalizability Estimate for a CNN | Cito Balsells et.al. | 2502.17622 | null |
2025-02-24 | Neural Attention: A Novel Mechanism for Enhanced Expressive Power in Transformer Models | Andrew DiGiugno et.al. | 2502.17206 | null |
2025-02-24 | Disentangling Visual Transformers: Patch-level Interpretability for Image Classification | Guillaume Jeanneret et.al. | 2502.17196 | null |
2025-02-24 | Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment | Chenghao Fan et.al. | 2502.16894 | null |
2025-02-24 | Applying LLMs to Active Learning: Towards Cost-Efficient Cross-Task Text Classification without Manually Labeled Data | Yejian Zhang et.al. | 2502.16892 | null |
2025-02-24 | A Transformer-in-Transformer Network Utilizing Knowledge Distillation for Image Recognition | Dewan Tauhid Rahman et.al. | 2502.16762 | null |
2025-02-23 | AUKT: Adaptive Uncertainty-Guided Knowledge Transfer with Conformal Prediction | Rui Liu et.al. | 2502.16736 | null |
2025-02-22 | MOB-GCN: A Novel Multiscale Object-Based Graph Neural Network for Hyperspectral Image Classification | Tuan-Anh Yang et.al. | 2502.16289 | link |
2025-02-22 | A Multi-Scale Isolation Forest Approach for Real-Time Detection and Filtering of FGSM Adversarial Attacks in Video Streams of Autonomous Vehicles | Richard Abhulimhen et.al. | 2502.16044 | null |
2025-02-21 | MMRAG: Multi-Mode Retrieval-Augmented Generation with Large Language Models for Biomedical In-Context Learning | Zaifu Zhan et.al. | 2502.15954 | null |
2025-02-21 | Directional Gradient Projection for Robust Fine-Tuning of Foundation Models | Chengyue Huang et.al. | 2502.15895 | null |
2025-02-21 | MHQA: A Diverse, Knowledge Intensive Mental Health Question Answering Challenge for Language Models | Suraj Racha et.al. | 2502.15418 | null |
2025-02-21 | A Novel Riemannian Sparse Representation Learning Network for Polarimetric SAR Image Classification | Junfei Shi et.al. | 2502.15302 | null |
2025-02-21 | Quantum autoencoders for image classification | Hinako Asaoka et.al. | 2502.15254 | null |
2025-02-21 | Steganographic Embeddings as an Effective Data Augmentation | Nicholas DiSalvo et.al. | 2502.15245 | null |
2025-02-21 | Learning to Collaborate: A Capability Vectors-based Architecture for Adaptive Human-AI Decision Making | Renlong Jie et.al. | 2502.15196 | null |
2025-02-21 | TransMamba: Fast Universal Architecture Adaption from Transformers to Mamba | Xiuwei Chen et.al. | 2502.15130 | null |
2025-02-20 | Fundamental Survey on Neuromorphic Based Audio Classification | Amlan Basu et.al. | 2502.15056 | null |
2025-02-20 | Reinforcement Learning for Ultrasound Image Analysis A Comprehensive Review of Advances and Applications | Maha Ezzelarab et.al. | 2502.14995 | null |
2025-02-20 | Sparse Activations as Conformal Predictors | Margarida M. Campos et.al. | 2502.14773 | link |
2025-02-20 | An Enhancement of Jiang, Z., et al.s Compression-Based Classification Algorithm Applied to News Article Categorization | Sean Lester C. Benavides et.al. | 2502.14444 | null |
2025-02-20 | Stochastic Resonance Improves the Detection of Low Contrast Images in Deep Learning Models | Siegfried Ludwig et.al. | 2502.14442 | null |
2025-02-20 | Token-Level Density-Based Uncertainty Quantification Methods for Eliciting Truthfulness of Large Language Models | Artem Vazhentsev et.al. | 2502.14427 | null |
2025-02-20 | Reliable Explainability of Deep Learning Spatial-Spectral Classifiers for Improved Semantic Segmentation in Autonomous Driving | Jon Gutiérrez-Zaballa et.al. | 2502.14416 | null |
2025-02-20 | QUAD-LLM-MLTC: Large Language Models Ensemble Learning for Healthcare Text Multi-Label Classification | Hajar Sakai et.al. | 2502.14189 | null |
2025-02-19 | Self-Regularization with Latent Space Explanations for Controllable LLM-based Classification | Xuansheng Wu et.al. | 2502.14133 | null |
2025-02-19 | Medical Image Classification with KAN-Integrated Transformers and Dilated Neighborhood Attention | Omid Nejati Manzari et.al. | 2502.13693 | link |
2025-02-18 | Language Models Can Predict Their Own Behavior | Dhananjay Ashok et.al. | 2502.13329 | null |
2025-02-18 | Performance Evaluation of Sentiment Analysis on Text and Emoji Data Using End-to-End, Transfer Learning, Distributed and Explainable AI Models | Sirisha Velampalli et.al. | 2502.13278 | null |
2025-02-18 | Private Text Generation by Seeding Large Language Model Prompts | Supriya Nagesh et.al. | 2502.13193 | null |
2025-02-18 | RingFormer: Rethinking Recurrent Transformer with Adaptive Level Signals | Jaemu Heo et.al. | 2502.13181 | null |
2025-02-18 | Benchmarking MedMNIST dataset on real quantum hardware | Gurinder Singh et.al. | 2502.13056 | null |
2025-02-18 | Likelihood-Ratio Regularized Quantile Regression: Adapting Conformal Prediction to High-Dimensional Covariate Shifts | Sunay Joshi et.al. | 2502.13030 | null |
2025-02-18 | A Survey of Text Classification Under Class Distribution Shift | Adriana Valentina Costache et.al. | 2502.12965 | null |
2025-02-18 | Task-Informed Anti-Curriculum by Masking Improves Downstream Performance on Text | Andrei Jarca et.al. | 2502.12953 | null |
2025-02-18 | DAMamba: Vision State Space Model with Dynamic Adaptive Scan | Tanzhe Li et.al. | 2502.12627 | null |
2025-02-18 | When Segmentation Meets Hyperspectral Image: New Paradigm for Hyperspectral Image Classification | Weilian Zhou et.al. | 2502.12541 | null |
2025-02-17 | Achieving Upper Bound Accuracy of Joint Training in Continual Learning | Saleh Momeni et.al. | 2502.12388 | null |
2025-02-17 | OCT Data is All You Need: How Vision Transformers with and without Pre-training Benefit Imaging | Zihao Han et.al. | 2502.12379 | null |
2025-02-17 | AdaSplash: Adaptive Sparse Flash Attention | Nuno Gonçalves et.al. | 2502.12082 | null |
2025-02-17 | Masked Latent Prediction and Classification for Self-Supervised Audio Representation Learning | Aurian Quelennec et.al. | 2502.12031 | null |
2025-02-17 | Text Classification in the LLM Era - Where do we stand? | Sowmya Vajjala et.al. | 2502.11830 | null |
2025-02-17 | Variable-frame CNNLSTM for Breast Nodule Classification using Ultrasound Videos | Xiangxiang Cui et.al. | 2502.11481 | null |
2025-02-16 | Leveraging Conditional Mutual Information to Improve Large Language Model Fine-Tuning For Classification | Thanushon Sivakaran et.al. | 2502.11258 | null |
2025-02-16 | UNITE-FND: Reframing Multimodal Fake News Detection through Unimodal Scene Translation | Arka Mukherjee et.al. | 2502.11132 | null |
2025-02-16 | Towards Achieving Concept Completeness for Unsupervised Textual Concept Bottleneck Models | Milan Bhan et.al. | 2502.11100 | null |
2025-02-16 | Leveraging Large Language Models for Cybersecurity: Enhancing SMS Spam Detection with Robust and Context-Aware Text Classification | Mohsen Ahmadi et.al. | 2502.11014 | null |
2025-02-15 | Simulations of Common Unsupervised Domain Adaptation Algorithms for Image Classification | Ahmad Chaddad et.al. | 2502.10694 | null |
2025-02-15 | REAL: Realism Evaluation of Text-to-Image Generation Models for Effective Data Augmentation | Ran Li et.al. | 2502.10663 | null |
2025-02-14 | Simplifying DINO via Coding Rate Regularization | Ziyang Wu et.al. | 2502.10385 | null |
2025-02-14 | Ocular Disease Classification Using CNN with Deep Convolutional Generative Adversarial Network | Arun Kunwar et.al. | 2502.10334 | null |
2025-02-14 | SeWA: Selective Weight Average via Probabilistic Masking | Peng Wang et.al. | 2502.10119 | null |
2025-02-14 | On Space Folds of ReLU Neural Networks | Michal Lewandowski et.al. | 2502.09954 | null |
2025-02-13 | A CNN Approach to Automated Detection and Classification of Brain Tumors | Md. Zahid Hasan et.al. | 2502.09731 | null |
2025-02-13 | GAIA: A Global, Multi-modal, Multi-scale Vision-Language Dataset for Remote Sensing Image Analysis | Angelos Zavras et.al. | 2502.09598 | link |
2025-02-14 | Optimizing GPT for Video Understanding: Zero-Shot Performance and Prompt Engineering | Mark Beliaev et.al. | 2502.09573 | null |
2025-02-13 | Feature-based Graph Attention Networks Improve Online Continual Learning | Adjovi Sim et.al. | 2502.09143 | null |
2025-02-13 | A Hybrid Model for Few-Shot Text Classification Using Transfer and Meta-Learning | Jia Gao et.al. | 2502.09086 | null |
2025-02-13 | Hierarchical Vision Transformer with Prototypes for Interpretable Medical Image Classification | Luisa Gallée et.al. | 2502.08997 | null |
2025-02-13 | Quantum Approaches for Dysphonia Assessment in Small Speech Datasets | Ha Tran et.al. | 2502.08968 | null |
2025-02-12 | Measuring Diversity in Synthetic Datasets | Yuchang Zhu et.al. | 2502.08512 | null |
2025-02-12 | ViLa-MIL: Dual-scale Vision-Language Multiple Instance Learning for Whole Slide Image Classification | Jiangbo Shi et.al. | 2502.08391 | null |
2025-02-12 | Keep your distance: learning dispersed embeddings on |
Evgeniia Tokarchuk et.al. | 2502.08231 | null |
2025-02-12 | Riemannian Complex Hermit Positive Definite Convolution Network for Polarimetric SAR Image Classification | Junfei Shi et.al. | 2502.08137 | null |
2025-02-12 | Knowledge Swapping via Learning and Unlearning | Mingyu Xing et.al. | 2502.08075 | null |
2025-02-12 | Can Machine Learning Support the Selection of Studies for Systematic Literature Review Updates? | Marcelo Costalonga et.al. | 2502.08050 | null |
2025-02-11 | ESPFormer: Doubly-Stochastic Attention with Expected Sliced Transport Plans | Ashkan Shahbazi et.al. | 2502.07962 | null |
2025-02-11 | Optimizing Knowledge Distillation in Transformers: Enabling Multi-Head Attention without Alignment Barriers | Zhaodong Bing et.al. | 2502.07436 | null |
2025-02-11 | MoENAS: Mixture-of-Expert based Neural Architecture Search for jointly Accurate, Fair, and Robust Edge Deep Neural Networks | Lotfi Abdelkrim Mecharbat et.al. | 2502.07422 | null |
2025-02-11 | MGPATH: Vision-Language Model with Multi-Granular Prompt Learning for Few-Shot WSI Classification | Anh-Tien Nguyen et.al. | 2502.07409 | null |
2025-02-11 | Don't Just Demo, Teach Me the Principles: A Principle-Based Multi-Agent Prompting Strategy for Text Classification | Peipei Wei et.al. | 2502.07165 | null |
2025-02-10 | From Image to Video: An Empirical Study of Diffusion Representations | Pedro Vélez et.al. | 2502.07001 | null |
2025-02-10 | Krum Federated Chain (KFC): Using blockchain to defend against adversarial attacks in Federated Learning | Mario García-Márquez et.al. | 2502.06917 | null |
2025-02-10 | Enhancing Performance of Explainable AI Models with Constrained Concept Refinement | Geyu Liang et.al. | 2502.06775 | null |
2025-02-10 | Efficient Scientific Full Text Classification: The Case of EICAT Impact Assessments | Marc Felix Brinner et.al. | 2502.06551 | null |
2025-02-10 | Hybrid State-Space and GRU-based Graph Tokenization Mamba for Hyperspectral Image Classification | Muhammad Ahmad et.al. | 2502.06427 | null |
2025-02-10 | Provably Near-Optimal Federated Ensemble Distillation with Negligible Overhead | Won-Jun Jang et.al. | 2502.06349 | null |
2025-02-10 | From Pixels to Components: Eigenvector Masking for Visual Representation Learning | Alice Bizeul et.al. | 2502.06314 | null |
2025-02-10 | Beyond Batch Learning: Global Awareness Enhanced Domain Adaptation | Lingkun Luo et.al. | 2502.06272 | null |
2025-02-10 | Multi-Scale Transformer Architecture for Accurate Medical Image Classification | Jiacheng Hu et.al. | 2502.06243 | null |
2025-02-10 | Low Tensor-Rank Adaptation of Kolmogorov--Arnold Networks | Yihang Gao et.al. | 2502.06153 | null |
2025-02-09 | Benchmarking Prompt Sensitivity in Large Language Models | Amirhossein Razavi et.al. | 2502.06065 | null |
2025-02-09 | ARISE: Iterative Rule Induction and Synthetic Data Generation for Text Classification | Yashwanth M. et.al. | 2502.05923 | null |
2025-02-07 | Training-free Neural Architecture Search through Variance of Knowledge of Deep Network Weights | Ondřej Týbl et.al. | 2502.04975 | null |
2025-02-07 | Enhancing Disinformation Detection with Explainable AI and Named Entity Replacement | Santiago González-Silot et.al. | 2502.04863 | null |
2025-02-07 | AIQViT: Architecture-Informed Post-Training Quantization for Vision Transformers | Runqing Jiang et.al. | 2502.04628 | null |
2025-02-06 | Augmented Conditioning Is Enough For Effective Training Image Generation | Jiahui Chen et.al. | 2502.04475 | null |
2025-02-06 | How does a Multilingual LM Handle Multiple Languages? | Santhosh Kakarla et.al. | 2502.04269 | null |
2025-02-06 | Cross the Gap: Exposing the Intra-modal Misalignment in CLIP via Modality Inversion | Marco Mistretta et.al. | 2502.04263 | null |
2025-02-06 | Expanding Training Data for Endoscopic Phenotyping of Eosinophilic Esophagitis | Juming Xiong et.al. | 2502.04199 | null |
2025-02-06 | Improving Natural Language Understanding for LLMs via Large-Scale Instruction Synthesis | Lin Yuan et.al. | 2502.03843 | null |
2025-02-06 | Self-Supervised Learning for Solar Radio Spectrum Classification | Siqi Li et.al. | 2502.03778 | null |
2025-02-06 | Conditional Diffusion Models are Medical Image Classifiers that Provide Explainability and Uncertainty for Free | Gian Mario Favero et.al. | 2502.03687 | null |
2025-02-05 | A Study in Dataset Distillation for Image Super-Resolution | Tobias Dietz et.al. | 2502.03656 | null |
2025-02-05 | Gompertz Linear Units: Leveraging Asymmetry for Enhanced Learning Dynamics | Indrashis Das et.al. | 2502.03654 | null |
2025-02-05 | Clinically-Inspired Hierarchical Multi-Label Classification of Chest X-rays with a Penalty-Based Loss Function | Mehrdad Asadi et.al. | 2502.03591 | link |
2025-02-05 | Optimal Task Order for Continual Learning of Multiple Tasks | Ziyan Li et.al. | 2502.03350 | null |
2025-02-05 | Out-of-Distribution Detection using Synthetic Data Generation | Momin Abbas et.al. | 2502.03323 | null |
2025-02-05 | Long-tailed Medical Diagnosis with Relation-aware Representation Learning and Iterative Classifier Calibration | Li Pan et.al. | 2502.03238 | null |
2025-02-05 | Adversarial Dependence Minimization | Pierre-François De Plaen et.al. | 2502.03227 | null |
2025-02-05 | Disentangling CLIP Features for Enhanced Localized Understanding | Samyak Rawelekar et.al. | 2502.02977 | null |
2025-02-05 | Slowing Learning by Erasing Simple Features | Lucia Quirke et.al. | 2502.02820 | null |
2025-02-04 | The Skin Game: Revolutionizing Standards for AI Dermatology Model Comparison | Łukasz Miętkiewicz et.al. | 2502.02500 | null |
2025-02-04 | BRIDLE: Generalized Self-supervised Learning with Quantization | Hoang M. Nguyen et.al. | 2502.02118 | null |
2025-02-04 | DCT-Mamba3D: Spectral Decorrelation and Spatial-Spectral Feature Extraction for Hyperspectral Image Classification | Weijia Cao et.al. | 2502.01986 | null |
2025-02-04 | Generative Data Mining with Longtail-Guided Diffusion | David S. Hayden et.al. | 2502.01980 | null |
2025-02-03 | A Multi-Scale Feature Fusion Framework Integrating Frequency Domain and Cross-View Attention for Dual-View X-ray Security Inspections | Shilong Hong et.al. | 2502.01710 | null |
2025-02-03 | Activation by Interval-wise Dropout: A Simple Way to Prevent Neural Networks from Plasticity Loss | Sangyeon Park et.al. | 2502.01342 | null |
2025-02-03 | A Framework for Double-Blind Federated Adaptation of Foundation Models | Nurbek Tastan et.al. | 2502.01289 | null |
2025-02-02 | Synthetic Artifact Auditing: Tracing LLM-Generated Synthetic Data Usage in Downstream Applications | Yixin Wu et.al. | 2502.00808 | null |
2025-02-02 | Enhanced Convolutional Neural Networks for Improved Image Classification | Xiaoran Yang et.al. | 2502.00663 | null |
2025-02-01 | Fast Vision Mamba: Pooling Spatial Dimensions for Accelerated Processing | Saarthak Kapse et.al. | 2502.00594 | null |
2025-01-31 | Redefining Machine Unlearning: A Conformal Prediction-Motivated Approach | Yingdan Shi et.al. | 2501.19403 | null |
2025-01-31 | An All-digital 65-nm Tsetlin Machine Image Classification Accelerator with 8.6 nJ per MNIST Frame at 60.3k Frames per Second | Svein Anders Tunheim et.al. | 2501.19347 | null |
2025-01-31 | Through the Looking Glass: LLM-Based Analysis of AR/VR Android Applications Privacy Policies | Abdulaziz Alghamdi et.al. | 2501.19223 | null |
2025-01-31 | Fairness Analysis of CLIP-Based Foundation Models for X-Ray Image Classification | Xiangyu Sun et.al. | 2501.19086 | null |
2025-01-31 | Memory-Efficient Fine-Tuning of Transformers via Token Selection | Antoine Simoulin et.al. | 2501.18824 | null |
2025-01-30 | OT-Transformer: A Continuous-time Transformer Architecture with Optimal Transport Regularization | Kelvin Kan et.al. | 2501.18793 | null |
2025-01-29 | Semantic Consistency Regularization with Large Language Models for Semi-supervised Sentiment Analysis | Kunrong Li et.al. | 2501.17598 | null |
2025-01-28 | Extending Information Bottleneck Attribution to Video Sequences | Veronika Solopova et.al. | 2501.16889 | link |
2025-01-28 | Misspellings in Natural Language Processing: A survey | Gianluca Sperduti et.al. | 2501.16836 | null |
2025-01-28 | DebugAgent: Efficient and Interpretable Error Slice Discovery for Comprehensive Model Debugging | Muxi Chen et.al. | 2501.16751 | null |
2025-01-28 | Toward Relative Positional Encoding in Spiking Transformers | Changze Lv et.al. | 2501.16745 | null |
2025-01-28 | Improving Interpretability and Accuracy in Neuro-Symbolic Rule Extraction Using Class-Specific Sparse Filters | Parth Padalkar et.al. | 2501.16677 | null |
2025-01-27 | Generating customized prompts for Zero-Shot Rare Event Medical Image Classification using LLM | Payal Kamboj et.al. | 2501.16481 | link |
2025-01-28 | SPECIAL: Zero-shot Hyperspectral Image Classification With CLIP | Li Pang et.al. | 2501.16222 | link |
2025-01-27 | The Linear Attention Resurrection in Vision Transformer | Chuanyang Zheng et.al. | 2501.16182 | null |
2025-01-27 | Enhancing the Convergence of Federated Learning Aggregation Strategies with Limited Data | Judith Sáinz-Pardo Díaz et.al. | 2501.15949 | null |
2025-01-26 | Quantum-Enhanced Attention Mechanism in NLP: A Hybrid Classical-Quantum Approach | S. M. Yousuf Iqbal Tomal et.al. | 2501.15630 | null |
2025-01-26 | Building Efficient Lightweight CNN Models | Nathan Isong et.al. | 2501.15547 | null |
2025-01-26 | Fuzzy-aware Loss for Source-free Domain Adaptation in Visual Emotion Recognition | Ying Zheng et.al. | 2501.15519 | null |
2025-01-26 | Variational Bayesian Adaptive Learning of Deep Latent Variables for Acoustic Knowledge Transfer | Hu Hu et.al. | 2501.15496 | null |
2025-01-25 | Pre-trained Model Guided Mixture Knowledge Distillation for Adversarial Federated Learning | Yu Qiao et.al. | 2501.15257 | null |
2025-01-24 | Feasible Learning | Juan Ramirez et.al. | 2501.14912 | link |
2025-01-24 | Rethinking Foundation Models for Medical Image Classification through a Benchmark Study on MedMNIST | Fuping Wu et.al. | 2501.14685 | null |
2025-01-24 | Geometric Mean Improves Loss For Few-Shot Learning | Tong Wu et.al. | 2501.14593 | null |
2025-01-24 | Idiom Detection in Sorani Kurdish Texts | Skala Kamaran Omer et.al. | 2501.14528 | null |
2025-01-24 | Guobin Shen et.al. | 2501.14484 | null | |
2025-01-24 | Impact of Batch Normalization on Convolutional Network Representations | Hermanus L. Potgieter et.al. | 2501.14441 | null |
2025-01-24 | Quantum Neural Networks: A Comparative Analysis and Noise Robustness Evaluation | Tasnim Ahmed et.al. | 2501.14412 | null |
2025-01-24 | Correlation-Based Band Selection for Hyperspectral Image Classification | Dibyabha Deb et.al. | 2501.14338 | link |
2025-01-24 | Relative Layer-Wise Relevance Propagation: a more Robust Neural Networks eXplaination | Eric Nyiri et.al. | 2501.14322 | null |
2025-01-24 | A Comprehensive Framework for Semantic Similarity Detection Using Transformer Architectures and Enhanced Ensemble Techniques | Lifu Gao et.al. | 2501.14288 | null |
2025-01-24 | TLXML: Task-Level Explanation of Meta-Learning via Influence Functions | Yoshihiro Mitsuka et.al. | 2501.14271 | null |
2025-01-23 | A Study of the Plausibility of Attention between RNN Encoders in Natural Language Inference | Duc Hau Nguyen et.al. | 2501.13735 | null |
2025-01-23 | A Transformer-based Autoregressive Decoder Architecture for Hierarchical Text Classification | Younes Yousef et.al. | 2501.13598 | link |
2025-01-23 | Multi-Level Attention and Contrastive Learning for Enhanced Text Classification with an Optimized Transformer | Jia Gao et.al. | 2501.13467 | null |
2025-01-23 | Atmospheric Noise-Resilient Image Classification in a Real-World Scenario: Using Hybrid CNN and Pin-GTSVM | Shlok Mehendale et.al. | 2501.13422 | null |
2025-01-23 | AEON: Adaptive Estimation of Instance-Dependent In-Distribution and Out-of-Distribution Label Noise for Robust Learning | Arpit Garg et.al. | 2501.13389 | null |
2025-01-23 | Multi-aspect Knowledge Distillation with Large Language Model | Taegyeong Lee et.al. | 2501.13341 | null |
2025-01-22 | Revisiting Data Augmentation for Ultrasound Images | Adam Tupper et.al. | 2501.13193 | link |
2025-01-22 | Regularization, Semi-supervision, and Supervision for a Plausible Attention-Based Explanation | Duc Hau Nguyen et.al. | 2501.12775 | link |
2025-01-22 | Estimating the Conformal Prediction Threshold from Noisy Labels | Coby Penso et.al. | 2501.12749 | link |
2025-01-22 | Adapting OpenAI's CLIP Model for Few-Shot Image Inspection in Manufacturing Quality Control: An Expository Case Study with Multiple Application Examples | Fadel M. Megahed et.al. | 2501.12596 | null |
2025-01-21 | Efficient Lung Ultrasound Severity Scoring Using Dedicated Feature Extractor | Jiaqi Guo et.al. | 2501.12524 | null |
2025-01-21 | CCESAR: Coastline Classification-Extraction From SAR Images Using CNN-U-Net Combination | Vidhu Arora et.al. | 2501.12384 | null |
2025-01-21 | CBVLM: Training-free Explainable Concept-based Large Vision Language Models for Medical Image Classification | Cristiano Patrício et.al. | 2501.12266 | null |
2025-01-21 | Early Detection and Classification of Breast Cancer Using Deep Learning Techniques | Mst. Mumtahina Labonno et.al. | 2501.12217 | null |
2025-01-21 | UAV-Assisted Real-Time Disaster Detection Using Optimized Transformer Model | Branislava Jankovic et.al. | 2501.12087 | null |
2025-01-20 | Communication-Efficient Federated Learning Based on Explanation-Guided Pruning for Remote Sensing Image Classification | Jonas Klotz et.al. | 2501.11493 | null |
2025-01-22 | QGAIC: Quantum Inspired Genetic Algorithm for Image Classification | Akhilesh Kumar Singh et.al. | 2501.11477 | null |
2025-01-20 | GenVidBench: A Challenging Benchmark for Detecting AI-Generated Video | Zhenliang Ni et.al. | 2501.11340 | null |
2025-01-20 | KPL: Training-Free Medical Knowledge Mining of Vision-Language Models | Jiaxiang Liu et.al. | 2501.11231 | link |
2025-01-19 | CLOFAI: A Dataset of Real And Fake Image Classification Tasks for Continual Learning | William Doherty et.al. | 2501.11140 | link |
2025-01-19 | Leveraging counterfactual concepts for debugging and improving CNN model performance | Syed Ali Tariq et.al. | 2501.11087 | null |
2025-01-17 | A Vision-Language Framework for Multispectral Scene Representation Using Language-Grounded Features | Enes Karanfil et.al. | 2501.10144 | null |
2025-01-17 | Classifier Ensemble for Efficient Uncertainty Calibration of Deep Neural Networks for Image Classification | Michael Schulze et.al. | 2501.10089 | null |
2025-01-17 | One-D-Piece: Image Tokenizer Meets Quality-Controllable Compression | Keita Miwa et.al. | 2501.10064 | null |
2025-01-17 | LWGANet: A Lightweight Group Attention Backbone for Remote Sensing Visual Tasks | Wei Lu et.al. | 2501.10040 | link |
2025-01-16 | Empirical Evaluation of Embedding Models in the Context of Text Classification in Document Review in Construction Delay Disputes | Fusheng Wei et.al. | 2501.09859 | null |
2025-01-16 | SRE-Conv: Symmetric Rotation Equivariant Convolution for Biomedical Image Classification | Yuexi Du et.al. | 2501.09753 | link |
2025-01-16 | Practical Continual Forgetting for Pre-trained Vision Models | Hongbo Zhao et.al. | 2501.09705 | link |
2025-01-16 | Multimodal Marvels of Deep Learning in Medical Diagnosis: A Comprehensive Review of COVID-19 Detection | Md Shofiqul Islama et.al. | 2501.09506 | link |
2025-01-16 | HydraMix: Multi-Image Feature Mixing for Small Data Image Classification | Christoph Reinders et.al. | 2501.09504 | null |
2025-01-16 | Quantum-Enhanced Transformers for Robust Acoustic Scene Classification in IoT Environments | Minh K. Quan et.al. | 2501.09394 | null |
2025-01-16 | Shape-Based Single Object Classification Using Ensemble Method Classifiers | Nur Shazwani Kamarudin et.al. | 2501.09311 | null |
2025-01-16 | Efficient Few-Shot Medical Image Analysis via Hierarchical Contrastive Vision-Language Learning | Harrison Fuller et.al. | 2501.09294 | null |
2025-01-16 | A Simple Graph Contrastive Learning Framework for Short Text Classification | Yonghao Liu et.al. | 2501.09219 | link |
2025-01-16 | Boosting Short Text Classification with Multi-Source Information Exploration and Dual-Level Contrastive Learning | Yonghao Liu et.al. | 2501.09214 | link |
2025-01-15 | Augmenting Human-Annotated Training Data with Large Language Model Generation and Distillation in Open-Response Assessment | Conrad Borchers et.al. | 2501.09126 | null |
2025-01-15 | IDEA: Image Description Enhanced CLIP-Adapter | Zhipeng Ye et.al. | 2501.08816 | null |
2025-01-15 | MIAFEx: An Attention-based Feature Extraction Method for Medical Image Classification | Oscar Ramos-Soto et.al. | 2501.08562 | null |
2025-01-14 | Towards Zero-Shot & Explainable Video Description by Reasoning over Graphs of Events in Space and Time | Mihai Masala et.al. | 2501.08460 | null |
2025-01-14 | Large Language Models For Text Classification: Case Study And Comprehensive Review | Arina Kostina et.al. | 2501.08457 | null |
2025-01-14 | READ: Reinforcement-based Adversarial Learning for Text Classification with Limited Labeled Data | Rohit Sharma et.al. | 2501.08035 | null |
2025-01-14 | Training Hybrid Neural Networks with Multimode Optical Nonlinearities Using Digital Twins | Ilker Oguz et.al. | 2501.07991 | null |
2025-01-14 | deepTerra -- AI Land Classification Made Easy | Andrew Keith Wilkinson et.al. | 2501.07859 | null |
2025-01-14 | A Low-cost and Ultra-lightweight Binary Neural Network for Traffic Signal Recognition | Mingke Xiao et.al. | 2501.07808 | null |
2025-01-14 | Balance Divergence for Knowledge Distillation | Yafei Qi et.al. | 2501.07804 | null |
2025-01-14 | Parameter-Inverted Image Pyramid Networks for Visual Perception and Multimodal Understanding | Zhaokai Wang et.al. | 2501.07783 | link |
2025-01-13 | Universal Training of Neural Networks to Achieve Bayes Optimal Classification Accuracy | Mohammadreza Tavasoli Naeini et.al. | 2501.07754 | null |
2025-01-13 | Uncertainty Guarantees on Automated Precision Weeding using Conformal Prediction | Paul Melki et.al. | 2501.07185 | null |
2025-01-13 | Adaptive Noise-Tolerant Network for Image Segmentation | Weizhi Li et.al. | 2501.07163 | null |
2025-01-12 | LarvSeg: Exploring Image Classification Data For Large Vocabulary Semantic Segmentation via Category-wise Attentive Classifier | Haojun Yu et.al. | 2501.06862 | link |
2025-01-12 | Rice Leaf Disease Detection: A Comparative Study Between CNN, Transformer and Non-neural Network Architectures | Samia Mehnaz et.al. | 2501.06740 | null |
2025-01-12 | Multi-Label Scene Classification in Remote Sensing Benefits from Image Super-Resolution | Ashitha Mudraje et.al. | 2501.06720 | null |
2025-01-11 | Synthetic Feature Augmentation Improves Generalization Performance of Language Models | Ashok Choudhary et.al. | 2501.06434 | null |
2025-01-10 | Kolmogorov-Arnold networks for metal surface defect classification | Maciej Krzywda et.al. | 2501.06389 | null |
2025-01-10 | Merging Feed-Forward Sublayers for Compressed Transformers | Neha Verma et.al. | 2501.06126 | link |
2025-01-10 | Averaged Adam accelerates stochastic optimization in the training of deep neural network approximations for partial differential equation and optimal control problems | Steffen Dereich et.al. | 2501.06081 | link |
2025-01-10 | Constrained Over-the-Air Model Updating for Wireless Online Federated Learning with Delayed Information | Juncheng Wang et.al. | 2501.05637 | null |
2025-01-10 | The Impact of Model Scaling on Seen and Unseen Language Performance | Rhitabrat Pokharel et.al. | 2501.05629 | null |
2025-01-09 | Vision-Language Models for Autonomous Driving: CLIP-Based Dynamic Scene Understanding | Mohammed Elhenawy et.al. | 2501.05566 | null |
2025-01-09 | Spatial Information Integration in Small Language Models for Document Layout Generation and Classification | Pablo Melendez et.al. | 2501.05497 | null |
2025-01-09 | An Empirical Study of Autoregressive Pre-training from Videos | Jathushan Rajasegaran et.al. | 2501.05453 | null |
2025-01-09 | A 1Mb mixed-precision quantized encoder for image classification and patch-based compression | Van Thien Nguyen et.al. | 2501.05097 | null |
2025-01-09 | A CT Image Classification Network Framework for Lung Tumors Based on Pre-trained MobileNetV2 Model and Transfer learning, And Its Application and Market Analysis in the Medical field | Ziyang Gao et.al. | 2501.04996 | null |
2025-01-09 | MambaHSI: Spatial-Spectral Mamba for Hyperspectral Image Classification | Yapeng Li et.al. | 2501.04944 | null |
2025-01-09 | A New Perspective on Privacy Protection in Federated Learning with Granular-Ball Computing | Guannan Lai et.al. | 2501.04940 | link |
2025-01-09 | ThriftLLM: On Cost-Effective Selection of Large Language Models for Classification Queries | Keke Huang et.al. | 2501.04901 | null |
2025-01-09 | Online Continual Learning: A Systematic Literature Review of Approaches, Challenges, and Benchmarks | Seyed Amir Bidaki et.al. | 2501.04897 | link |
2025-01-08 | Planarian Neural Networks: Evolutionary Patterns from Basic Bilateria Shaping Modern Artificial Neural Network Architectures | Ziyuan Huang et.al. | 2501.04700 | null |
2025-01-08 | Discrete Wavelet Transform-Based Capsule Network for Hyperspectral Image Classification | Zhiqiang Gao et.al. | 2501.04643 | null |
2025-01-08 | Enhancing Scene Classification in Cloudy Image Scenarios: A Collaborative Transfer Method with Information Regulation Mechanism using Optical Cloud-Covered and SAR Remote Sensing Images | Yuze Wang et.al. | 2501.04283 | null |
2025-01-08 | Comparison of Neural Models for X-ray Image Classification in COVID-19 Detection | Jimi Togni et.al. | 2501.04196 | null |
2025-01-07 | Temporal Feature Weaving for Neonatal Echocardiographic Viewpoint Video Classification | Satchel French et.al. | 2501.03967 | link |
2025-01-07 | Dolphin: Closed-loop Open-ended Auto-research through Thinking, Practice, and Feedback | Jiakang Yuan et.al. | 2501.03916 | null |
2025-01-07 | MedFocusCLIP : Improving few shot classification in medical datasets using pixel wise attention | Aadya Arora et.al. | 2501.03839 | null |
2025-01-07 | LHGNN: Local-Higher Order Graph Neural Networks For Audio Classification and Tagging | Shubhr Singh et.al. | 2501.03464 | null |
2025-01-06 | FTA-FTL: A Fine-Tuned Aggregation Federated Transfer Learning Scheme for Lithology Microscopic Image Classification | Keyvan RahimiZadeh et.al. | 2501.03349 | link |
2025-01-06 | CM3T: Framework for Efficient Multimodal Learning for Inhomogeneous Interaction Datasets | Tanay Agrawal et.al. | 2501.03332 | null |
2025-01-06 | Plant Leaf Disease Detection and Classification Using Deep Learning: A Review and A Proposed System on Bangladesh's Perspective | Md. Jalal Uddin Chowdhury et.al. | 2501.03305 | null |
2025-01-06 | Deep-Relative-Trust-Based Diffusion for Decentralized Deep Learning | Muyun Li et.al. | 2501.03162 | null |
2025-01-06 | Graph-based Retrieval Augmented Generation for Dynamic Few-shot Text Classification | Yubo Wang et.al. | 2501.02844 | null |
2025-01-06 | TARDiS : Text Augmentation for Refining Diversity and Separability | Kyungmin Kim et.al. | 2501.02739 | null |
2025-01-05 | FedRSClip: Federated Learning for Remote Sensing Scene Classification Using Vision-Language Models | Hui Lin et.al. | 2501.02461 | null |
2025-01-04 | Exploring Secure Machine Learning Through Payload Injection and FGSM Attacks on ResNet-50 | Umesh Yadav et.al. | 2501.02147 | null |
2025-01-03 | A Separable Self-attention Inspired by the State Space Model for Computer Vision | Juntao Zhang et.al. | 2501.02040 | link |
2025-01-03 | Google is all you need: Semi-Supervised Transfer Learning Strategy For Light Multimodal Multi-Task Classification Model | Haixu Liu et.al. | 2501.01611 | null |
2025-01-02 | Multi-Modal Video Feature Extraction for Popularity Prediction | Haixu Liu et.al. | 2501.01422 | null |
2025-01-02 | A Multi-task Supervised Compression Model for Split Computing | Yoshitomo Matsubara et.al. | 2501.01420 | link |
2025-01-02 | Multi-Head Explainer: A General Framework to Improve Explainability in CNNs and Transformers | Bohang Sun et.al. | 2501.01311 | null |
2025-01-02 | FAST: Fast Audio Spectrogram Transformer | Anugunj Naman et.al. | 2501.01104 | null |
2025-01-01 | A Novel Approach using CapsNet and Deep Belief Network for Detection and Identification of Oral Leukopenia | Hirthik Mathesh GV et.al. | 2501.00876 | null |
2025-01-01 | Ensuring superior learning outcomes and data security for authorized learner | Jeongho Bang et.al. | 2501.00754 | null |
2024-12-31 | TSPE: Task-Specific Prompt Ensemble for Improved Zero-Shot Audio Classification | Nishit Anand et.al. | 2501.00398 | null |
2024-12-31 | Exploring Variability in Fine-Tuned Models for Text Classification with DistilBERT | Giuliano Lorenzoni et.al. | 2501.00241 | null |
2024-12-30 | The Text Classification Pipeline: Starting Shallow going Deeper | Marco Siino et.al. | 2501.00174 | null |
2024-12-30 | Text Classification: Neural Networks VS Machine Learning Models VS Pre-trained Models | Christos Petridis et.al. | 2412.21022 | null |
2024-12-30 | FPGA-based Acceleration of Neural Network for Image Classification using Vitis AI | Zhengdong Li et.al. | 2412.20974 | null |
2024-12-30 | Uncertainty-Aware Out-of-Distribution Detection with Gaussian Processes | Yang Chen et.al. | 2412.20918 | null |
2024-12-30 | UniRS: Unifying Multi-temporal Remote Sensing Tasks through Vision Language Models | Yujie Li et.al. | 2412.20742 | null |
2024-12-30 | Improving Acoustic Scene Classification in Low-Resource Conditions | Zhi Chen et.al. | 2412.20722 | null |
2024-12-29 | Hilbert Curve Based Molecular Sequence Analysis | Sarwan Ali et.al. | 2412.20616 | null |
2024-12-29 | A Novel FPGA-based CNN Hardware Accelerator: Optimization for Convolutional Layers using Karatsuba Ofman Multiplier | Amit Sarkar et.al. | 2412.20393 | null |
2024-12-29 | HindiLLM: Large Language Model for Hindi | Sanjay Chouhan et.al. | 2412.20357 | null |
2024-12-29 | Deep Learning in Image Classification: Evaluating VGG19's Performance on Complex Visual Data | Weijie He et.al. | 2412.20345 | null |
2024-12-28 | Few-shot Algorithm Assurance | Dang Nguyen et.al. | 2412.20275 | null |
2024-12-27 | Asymmetrical Reciprocity-based Federated Learning for Resolving Disparities in Medical Diagnosis | Jiaqi Wang et.al. | 2412.19654 | null |
2024-12-27 | Enhancing Fine-grained Image Classification through Attentive Batch Training | Duy M. Le et.al. | 2412.19606 | null |
2024-12-27 | A Comparative Study of Machine Unlearning Techniques for Image and Text Classification Models | Omar M. Safa et.al. | 2412.19583 | null |
2024-12-27 | Multi-label Classification using Deep Multi-order Context-aware Kernel Networks | Mingyuan Jiu et.al. | 2412.19491 | null |
2024-12-27 | Residual Feature-Reutilization Inception Network for Image Classification | Yuanpeng He et.al. | 2412.19433 | null |
2024-12-27 | An In-Depth Analysis of Adversarial Discriminative Domain Adaptation for Digit Classification | Eugene Choi et.al. | 2412.19391 | link |
2024-12-26 | Assessing Pre-trained Models for Transfer Learning through Distribution of Spectral Components | Tengxue Zhang et.al. | 2412.19085 | null |
2024-12-26 | Let the Rule Speak: Enhancing In-context Learning Debiasing with Interpretability | Ruixi Lin et.al. | 2412.19018 | null |
2024-12-25 | Injecting Bias into Text Classification Models using Backdoor Attacks | A. Dilara Yavuz et.al. | 2412.18975 | null |
2024-12-25 | Research Experiment on Multi-Model Comparison for Chinese Text Classification Tasks | JiaCheng Li et.al. | 2412.18908 | null |
2024-12-24 | VisionGRU: A Linear-Complexity RNN Model for Efficient Image Analysis | Shicheng Yin et.al. | 2412.18178 | link |
2024-12-24 | Beyond Gradient Averaging in Parallel Optimization: Improved Robustness through Gradient Agreement Filtering | Francois Chaubard et.al. | 2412.18052 | null |
2024-12-23 | Explainability in Neural Networks for Natural Language Processing Tasks | Melkamu Mersha et.al. | 2412.18036 | null |
2024-12-23 | COBRA: COmBinatorial Retrieval Augmentation for Few-Shot Learning | Arnav M. Das et.al. | 2412.17684 | null |
2024-12-23 | Resource-Aware Arabic LLM Creation: Model Adaptation, Integration, and Multi-Domain Testing | Prakash Aryan et.al. | 2412.17548 | link |
2024-12-23 | Domain-Incremental Learning for Audio Classification | Manjunath Mulimani et.al. | 2412.17424 | null |
2024-12-23 | An Experimental Evaluation of Japanese Tokenizers for Sentiment-Based Text Classification | Andre Rusli et.al. | 2412.17361 | link |
2024-12-23 | DiffFormer: a Differential Spatial-Spectral Transformer for Hyperspectral Image Classification | Muhammad Ahmad et.al. | 2412.17350 | link |
2024-12-22 | Survey on Abstractive Text Summarization: Dataset, Models, and Metrics | Gospel Ozioma Nnadi et.al. | 2412.17165 | link |
2024-12-22 | LH-Mix: Local Hierarchy Correlation Guided Mixup over Hierarchical Prompt Tuning | Fanshuang Kong et.al. | 2412.16963 | link |
2024-12-22 | Predicting the Reliability of an Image Classifier under Image Distortion | Dang Nguyen et.al. | 2412.16881 | null |
2024-12-21 | Forget Vectors at Play: Universal Input Perturbations Driving Machine Unlearning in Image Classification | Changchang Sun et.al. | 2412.16780 | null |
2024-12-21 | UNEM: UNrolled Generalized EM for Transductive Few-Shot Learning | Long Zhou et.al. | 2412.16739 | link |
2024-12-20 | Mamba2D: A Natively Multi-Dimensional State-Space Model for Vision Tasks | Enis Baty et.al. | 2412.16146 | null |
2024-12-20 | Towards Interpretable Radiology Report Generation via Concept Bottlenecks using a Multi-Agentic RAG | Hasan Md Tusfiqur Alam et.al. | 2412.16086 | link |
2024-12-20 | A Thorough Investigation into the Application of Deep CNN for Enhancing Natural Language Processing Capabilities | Chang Weng et.al. | 2412.15900 | null |
2024-12-20 | Continual Learning Using a Kernel-Based Method Over Foundation Models | Saleh Momeni et.al. | 2412.15571 | link |
2024-12-19 | Time Will Tell: Timing Side Channels via Output Token Count in Large Language Models | Tianchen Zhang et.al. | 2412.15431 | null |
2024-12-19 | Till the Layers Collapse: Compressing a Deep Neural Network through the Lenses of Batch Normalization Layers | Zhu Liao et.al. | 2412.15077 | null |
2024-12-18 | Zero-Shot Prompting and Few-Shot Fine-Tuning: Revisiting Document Image Classification Using Large Language Models | Anna Scius-Bertrand et.al. | 2412.13859 | null |
2024-12-18 | Modelling Multi-modal Cross-interaction for ML-FSIC Based on Local Feature Selection | Kun Yan et.al. | 2412.13732 | null |
2024-12-18 | MBInception: A new Multi-Block Inception Model for Enhancing Image Processing Efficiency | Fatemeh Froughirad et.al. | 2412.13703 | null |
2024-12-17 | Identifying Bias in Deep Neural Networks Using Image Transforms | Sai Teja Erukude et.al. | 2412.13079 | link |
2024-12-17 | Token-Level Graphs for Short Text Classification | Gregor Donabauer et.al. | 2412.12754 | link |
2024-12-17 | Your Next State-of-the-Art Could Come from Another Domain: A Cross-Domain Analysis of Hierarchical Text Classification | Nan Li et.al. | 2412.12744 | link |
2024-12-17 | ShotVL: Human-Centric Highlight Frame Retrieval via Language Queries | Wangyu Xue et.al. | 2412.12675 | null |
2024-12-17 | Structural Pruning via Spatial-aware Information Redundancy for Semantic Segmentation | Dongyue Wu et.al. | 2412.12672 | link |
2024-12-19 | RemoteTrimmer: Adaptive Structural Pruning for Remote Sensing Image Classification | Guangwenjie Zou et.al. | 2412.12603 | link |
2024-12-17 | Addressing Small and Imbalanced Medical Image Datasets Using Generative Models: A Comparative Study of DDPM and PGGANs with Random and Greedy K Sampling | Iman Khazrak et.al. | 2412.12532 | link |
2024-12-16 | Gramian Multimodal Representation Learning and Alignment | Giordano Cicchetti et.al. | 2412.11959 | null |
2024-12-16 | The Impact of Generalization Techniques on the Interplay Among Privacy, Utility, and Fairness in Image Classification | Ahmad Hassanpour et.al. | 2412.11951 | null |
2024-12-16 | Does VLM Classification Benefit from LLM Description Semantics? | Pingchuan Ma et.al. | 2412.11917 | link |
2024-12-16 | Discrepancy-Aware Attention Network for Enhanced Audio-Visual Zero-Shot Learning | RunLin Yu et.al. | 2412.11715 | null |
2024-12-16 | LMM-Regularized CLIP Embeddings for Image Classification | Maria Tzelepi et.al. | 2412.11663 | null |
2024-12-16 | Non-Convex Optimization in Federated Learning via Variance Reduction and Adaptive Learning | Dipanwita Thakur et.al. | 2412.11660 | null |
2024-12-16 | CNNtention: Can CNNs do better with Attention? | Julian Glattki et.al. | 2412.11657 | null |
2024-12-16 | Explicit and Implicit Graduated Optimization in Deep Neural Networks | Naoki Sato et.al. | 2412.11501 | link |
2024-12-16 | Towards Better Multi-task Learning: A Framework for Optimizing Dataset Combinations in Large Language Models | Zaifu Zhan et.al. | 2412.11455 | null |
2024-12-16 | Scaled Conjugate Gradient Method for Nonconvex Optimization in Deep Neural Networks | Naoki Sato et.al. | 2412.11400 | null |
2024-12-13 | Robust image classification with multi-modal large language models | Francesco Villani et.al. | 2412.10353 | null |
2024-12-13 | MVQ:Towards Efficient DNN Compression and Acceleration with Masked Vector Quantization | Shuaiting Li et.al. | 2412.10261 | null |
2024-12-13 | Label-template based Few-Shot Text Classification with Contrastive Learning | Guanghua Hou et.al. | 2412.10110 | null |
2024-12-13 | Data Pruning Can Do More: A Comprehensive Data Pruning Approach for Object Re-identification | Zi Yang et.al. | 2412.10091 | link |
2024-12-13 | Low-Resource Fast Text Classification Based on Intra-Class and Inter-Class Distance Calculation | Yanxu Mao et.al. | 2412.09922 | null |
2024-12-12 | DQA: An Efficient Method for Deep Quantization of Deep Neural Network Activations | Wenhao Hu et.al. | 2412.09687 | null |
2024-12-12 | Embeddings are all you need! Achieving High Performance Medical Image Classification through Training-Free Embedding Analysis | Raj Hansini Khoiwal et.al. | 2412.09445 | null |
2024-12-12 | Learned Compression for Compressed Learning | Dan Jacobellis et.al. | 2412.09405 | link |
2024-12-12 | Advancing Attribution-Based Neural Network Explainability through Relative Absolute Magnitude Layer-Wise Relevance Propagation and Multi-Component Evaluation | Davor Vukadin et.al. | 2412.09311 | link |
2024-12-13 | An Efficient Framework for Enhancing Discriminative Models via Diffusion Techniques | Chunxiao Li et.al. | 2412.09063 | null |
2024-12-12 | STEAM: Squeeze and Transform Enhanced Attention Module | Rishabh Sabharwal et.al. | 2412.09023 | null |
2024-12-12 | Stochastic Learning of Non-Conjugate Variational Posterior for Image Classification | Kart-Leong Lim et.al. | 2412.08951 | null |
2024-12-11 | BDA: Bangla Text Data Augmentation Framework | Md. Tariquzzaman et.al. | 2412.08753 | null |
2024-12-11 | Advancing Single- and Multi-task Text Classification through Large Language Model Fine-tuning | Hang Zhao et.al. | 2412.08587 | null |
2024-12-11 | ALoRE: Efficient Visual Adaptation via Aggregating Low Rank Experts | Sinan Du et.al. | 2412.08341 | null |
2024-12-11 | Online training and pruning of photonic neural networks | Jiawei Zhang et.al. | 2412.08184 | null |
2024-12-11 | Wasserstein Distance Rivals Kullback-Leibler Divergence for Knowledge Distillation | Jiaming Lv et.al. | 2412.08139 | null |
2024-12-11 | Concept Bottleneck Large Language Models | Chung-En Sun et.al. | 2412.07992 | link |
2024-12-10 | FastDDS-Based Middleware System for Remote X-Ray Image Classification Using Raspberry Pi | Omar H. Khater et.al. | 2412.07818 | null |
2024-12-10 | Leveraging Content and Context Cues for Low-Light Image Enhancement | Igor Morawski et.al. | 2412.07693 | link |
2024-12-10 | Post-Training Non-Uniform Quantization for Convolutional Neural Networks | Ahmed Luqman et.al. | 2412.07391 | null |
2024-12-10 | Image Classification Using Singular Value Decomposition and Optimization | Isabela M. Yepes et.al. | 2412.07288 | link |
2024-12-10 | An Enhancement of CNN Algorithm for Rice Leaf Disease Image Classification in Mobile Applications | Kayne Uriel K. Rodrigo et.al. | 2412.07182 | null |
2024-12-09 | Convolution goes higher-order: a biologically inspired mechanism empowers image classification | Simone Azeglio et.al. | 2412.06740 | null |
2024-12-09 | Impact of Privacy Parameters on Deep Learning Models for Image Classification | Basanta Chaulagain et.al. | 2412.06689 | null |
2024-12-10 | Data Quality Enhancement on the Basis of Diversity with Large Language Models for Text Classification: Uncovered, Difficult, and Noisy | Min Zeng et.al. | 2412.06575 | null |
2024-12-09 | How Certain are Uncertainty Estimates? Three Novel Earth Observation Datasets for Benchmarking Uncertainty Quantification in Machine Learning | Yuanyuan Wang et.al. | 2412.06451 | null |
2024-12-09 | Optimizing Multi-Task Learning for Enhanced Performance in Large Language Models | Zhen Qi et.al. | 2412.06249 | null |
2024-12-08 | Hyperspectral Image Spectral-Spatial Feature Extraction via Tensor Principal Component Analysis | Yuemei Ren et.al. | 2412.06075 | null |
2024-12-08 | Vision Transformer-based Semantic Communications With Importance-Aware Quantization | Joohyuk Park et.al. | 2412.06038 | null |
2024-12-06 | Sparse autoencoders reveal selective remapping of visual concepts during adaptation | Hyesu Lim et.al. | 2412.05276 | link |
2024-12-06 | MTSpark: Enabling Multi-Task Learning with Spiking Neural Networks for Generalist Agents | Avaneesh Devkota et.al. | 2412.04847 | null |
2024-12-05 | Grounding Descriptions in Images informs Zero-Shot Visual Recognition | Shaunak Halbe et.al. | 2412.04429 | link |
2024-12-05 | FedDUAL: A Dual-Strategy with Adaptive Loss and Dynamic Aggregation for Mitigating Data Heterogeneity in Federated Learning | Pranab Sahoo et.al. | 2412.04416 | link |
2024-12-05 | Enhancing Whole Slide Image Classification through Supervised Contrastive Domain Adaptation | Ilán Carretero et.al. | 2412.04260 | null |
2024-12-05 | Demonstration Selection for In-Context Learning via Reinforcement Learning | Xubin Wang et.al. | 2412.03966 | null |
2024-12-05 | Quantized and Interpretable Learning Scheme for Deep Neural Networks in Classification Task | Alireza Maleki et.al. | 2412.03915 | null |
2024-12-05 | Multisource Collaborative Domain Generalization for Cross-Scene Remote Sensing Image Classification | Zhu Han et.al. | 2412.03897 | null |
2024-12-05 | Dual-Branch Subpixel-Guided Network for Hyperspectral Image Classification | Zhu Han et.al. | 2412.03893 | link |
2024-12-04 | Language Model Meets Prototypes: Towards Interpretable Text Classification Models through Prototypical Networks | Ximing Wen et.al. | 2412.03761 | null |
2024-12-05 | Continual Low-Rank Scaled Dot-product Attention | Ginés Carreto Picón et.al. | 2412.03214 | null |
2024-12-04 | Multi-Level Correlation Network For Few-Shot Image Classification | Yunkai Dang et.al. | 2412.03159 | link |
2024-12-04 | Assessing the performance of CT image denoisers using Laguerre-Gauss Channelized Hotelling Observer for lesion detection | Prabhat Kc et.al. | 2412.02920 | null |
2024-12-04 | Higher Order Transformers: Efficient Attention Mechanism for Tensor Structured Data | Soroush Omranpour et.al. | 2412.02919 | null |
2024-12-03 | Synergistic Development of Perovskite Memristors and Algorithms for Robust Analog Computing | Nanyang Ye et.al. | 2412.02779 | null |
2024-12-03 | Mixture of Physical Priors Adapter for Parameter-Efficient Fine-Tuning | Zhaozhi Wang et.al. | 2412.02759 | null |
2024-12-03 | Multimodal Remote Sensing Scene Classification Using VLMs and Dual-Cross Attention Networks | Jinjin Cai et.al. | 2412.02531 | null |
2024-12-04 | GenMix: Effective Data Augmentation with Generative Diffusion Model Image Editing | Khawar Islam et.al. | 2412.02366 | null |
2024-12-03 | Multi-Granularity Tibetan Textual Adversarial Attack Method Based on Masked Language Model | Xi Cao et.al. | 2412.02343 | null |
2024-12-03 | Active Learning via Classifier Impact and Greedy Selection for Interactive Image Retrieval | Leah Bar et.al. | 2412.02310 | link |
2024-12-03 | A Classic-Quantum Hybrid Network Framework: CQH-Net | Ao Liu et.al. | 2412.02059 | null |
2024-12-02 | PROFIT: A PROximal FIne Tuning Optimizer for Multi-Task Learning | Anirudh S Chakravarthy et.al. | 2412.01930 | null |
2024-12-02 | Concept Based Continuous Prompts for Interpretable Text Classification | Qian Chen et.al. | 2412.01644 | link |
2024-12-02 | NYT-Connections: A Deceptively Simple Text Classification Task that Stumps System-1 Thinkers | Angel Yahir Loredo Lopez et.al. | 2412.01621 | null |
2024-12-02 | Explaining the Unexplained: Revealing Hidden Correlations for Better Interpretability | Wen-Dong Jiang et.al. | 2412.01365 | null |
2024-12-02 | Class Distance Weighted Cross Entropy Loss for Classification of Disease Severity | Gorkem Polat et.al. | 2412.01246 | null |
2024-11-29 | LLM Teacher-Student Framework for Text Classification With No Manually Annotated Data: A Case Study in IPTC News Topic Classification | Taja Kuzman et.al. | 2411.19638 | link |
2024-11-29 | FairDD: Fair Dataset Distillation via Synchronized Matching | Qihang Zhou et.al. | 2411.19623 | null |
2024-11-29 | Memristive Nanowire Network for Energy Efficient Audio Classification: Pre-Processing-Free Reservoir Computing with Reduced Latency | Akshaya Rajesh et.al. | 2411.19611 | null |
2024-11-29 | Contextual Checkerboard Denoise -- A Novel Neural Network-Based Approach for Classification-Aware OCT Image Denoising | Md. Touhidul Islam et.al. | 2411.19549 | link |
2024-11-28 | CLIP meets DINO for Tuning Zero-Shot Classifier using Unlabeled Image Collections | Mohamed Fazli Imam et.al. | 2411.19346 | link |
2024-11-28 | Quantum Neural Networks in Practice: A Comparative Study with Classical Models from Standard Data Sets to Industrial Images | Daniel Basilewitsch et.al. | 2411.19276 | null |
2024-11-28 | Controlling Participation in Federated Learning with Feedback | Michael Cummins et.al. | 2411.19242 | null |
2024-11-28 | Introducing Three New Benchmark Datasets for Hierarchical Text Classification | Jaco du Toit et.al. | 2411.19119 | null |
2024-11-28 | MVFormer: Diversifying Feature Normalization and Token Mixing for Efficient Vision Transformers | Jongseong Bae et.al. | 2411.18995 | null |
2024-11-27 | Fall Leaf Adversarial Attack on Traffic Sign Classification | Anthony Etim et.al. | 2411.18776 | null |
2024-11-27 | Leveraging Semi-Supervised Learning to Enhance Data Mining for Image Classification under Limited Labeled Data | Aoran Shen et.al. | 2411.18622 | null |
2024-11-27 | Pruning Deep Convolutional Neural Network Using Conditional Mutual Information | Tien Vu-Van et.al. | 2411.18578 | null |
2024-11-27 | Mixture of Experts in Image Classification: What's the Sweet Spot? | Mathurin Videau et.al. | 2411.18322 | null |
2024-11-27 | KANs for Computer Vision: An Experimental Study | Karthik Mohan et.al. | 2411.18224 | null |
2024-11-27 | Spectral-Spatial Transformer with Active Transfer Learning for Hyperspectral Image Classification | Muhammad Ahmad et.al. | 2411.18115 | link |
2024-11-27 | Vision Mamba Distillation for Low-resolution Fine-grained Image Classification | Yao Chen et.al. | 2411.17980 | link |
2024-11-27 | Optimized Tradeoffs for Private Prediction with Majority Ensembling | Shuli Jiang et.al. | 2411.17965 | null |
2024-11-26 | What Differentiates Educational Literature? A Multimodal Fusion Approach of Transformers and Computational Linguistics | Jordan J. Bird et.al. | 2411.17593 | null |
2024-11-26 | TinyViM: Frequency Decoupling for Tiny Hybrid Vision Mamba | Xiaowen Ma et.al. | 2411.17473 | link |
2024-11-26 | SpikeAtConv: An Integrated Spiking-Convolutional Attention Architecture for Energy-Efficient Neuromorphic Vision Processing | Wangdan Liao et.al. | 2411.17439 | null |
2024-11-26 | CoA: Chain-of-Action for Generative Semantic Labels | Meng Wei et.al. | 2411.17406 | link |
2024-11-26 | BadScan: An Architectural Backdoor Attack on Visual State Space Models | Om Suhas Deshmukh et.al. | 2411.17283 | null |
2024-11-26 | An In-depth Investigation of Sparse Rate Reduction in Transformer-like Models | Yunzhe Hu et.al. | 2411.17182 | null |
2024-11-25 | Contrastive Multi-graph Learning with Neighbor Hierarchical Sifting for Semi-supervised Text Classification | Wei Ai et.al. | 2411.16787 | null |
2024-11-25 | A Supervised Machine Learning Approach for Assessing Grant Peer Review Reports | Gabriel Okasa et.al. | 2411.16662 | link |
2024-11-25 | Debiasing Classifiers by Amplifying Bias with Latent Diffusion and Large Language Models | Donggeun Ko et.al. | 2411.16079 | null |
2024-11-24 | Context-Aware Detection of Mixed Critical Events using Video Classification | Filza Akhlaq et.al. | 2411.15773 | null |
2024-11-23 | MUNBa: Machine Unlearning via Nash Bargaining | Jing Wu et.al. | 2411.15537 | null |
2024-11-23 | Twin Trigger Generative Networks for Backdoor Attacks against Object Detection | Zhiying Li et.al. | 2411.15439 | null |
2024-11-22 | MME-Survey: A Comprehensive Survey on Evaluation of Multimodal LLMs | Chaoyou Fu et.al. | 2411.15296 | null |
2024-11-21 | CODE-CL: COnceptor-Based Gradient Projection for DEep Continual Learning | Marco Paul E. Apolinario et.al. | 2411.15235 | null |
2024-11-21 | BiomedCoOp: Learning to Prompt for Biomedical Vision-Language Models | Taha Koleilat et.al. | 2411.15232 | null |
2024-11-22 | FOCUS: Knowledge-enhanced Adaptive Visual Compression for Few-shot Whole Slide Image Classification | Zhengrui Guo et.al. | 2411.14743 | link |
2024-11-21 | Adaptable Embeddings Network (AEN) | Stan Loosmore et.al. | 2411.13786 | null |
2024-11-20 | Hierarchical Text Classification (HTC) vs. eXtreme Multilabel Classification (XML): Two Sides of the Same Medal | Nerijus Bertalis et.al. | 2411.13687 | link |
2024-11-20 | Combining Autoregressive and Autoencoder Language Models for Text Classification | João Gonçalves et.al. | 2411.13282 | link |
2024-11-20 | MEGL: Multimodal Explanation-Guided Learning | Yifei Zhang et.al. | 2411.13053 | null |
2024-11-19 | Problem-dependent convergence bounds for randomized linear gradient compression | Thomas Flynn et.al. | 2411.12898 | null |
2024-11-19 | Enhancing Multi-Class Disease Classification: Neoplasms, Cardiovascular, Nervous System, and Digestive Disorders Using Advanced LLMs | Ahmed Akib Jawad Karim et.al. | 2411.12712 | null |
2024-11-22 | STREAM: A Universal State-Space Model for Sparse Geometric Data | Mark Schöne et.al. | 2411.12603 | null |
2024-11-19 | AdaCM |
Yuanbin Man et.al. | 2411.12593 | null |
2024-11-19 | Zero-Shot Crate Digging: DJ Tool Retrieval Using Speech Activity, Music Structure And CLAP Embeddings | Iroro Orife et.al. | 2411.12209 | link |
2024-11-19 | Invariant Shape Representation Learning For Image Classification | Tonmoy Hossain et.al. | 2411.12201 | link |
2024-11-19 | Self-Supervised Learning in Deep Networks: A Pathway to Robust Few-Shot Classification | Yuyang Xiao et.al. | 2411.12151 | null |
2024-11-18 | Just Leaf It: Accelerating Diffusion Classifiers with Hierarchical Class Pruning | Arundhati S. Shanbhag et.al. | 2411.12073 | link |
2024-11-18 | Vision Language Models Are Few-Shot Audio Spectrogram Classifiers | Satvik Dixit et.al. | 2411.12058 | null |
2024-11-18 | Fair Distillation: Teaching Fairness from Biased Teachers in Medical Imaging | Milad Masroor et.al. | 2411.11939 | null |
2024-11-18 | Exploring Emerging Trends and Research Opportunities in Visual Place Recognition | Antonios Gasteratos et.al. | 2411.11481 | null |
2024-11-16 | MetaLA: Unified Optimal Linear Approximation to Softmax Attention Map | Yuhong Chou et.al. | 2411.10741 | null |
2024-11-16 | Diagnostic Text-guided Representation Learning in Hierarchical Classification for Pathological Whole Slide Image | Jiawen Li et.al. | 2411.10709 | null |
2024-11-16 | Multi-perspective Contrastive Logit Distillation | Qi Wang et.al. | 2411.10693 | null |
2024-11-15 | Vision Eagle Attention: A New Lens for Advancing Image Classification | Mahmudul Hasan et.al. | 2411.10564 | link |
2024-11-15 | On the Cost of Model-Serving Frameworks: An Experimental Evaluation | Pasquale De Rosa et.al. | 2411.10337 | null |
2024-11-15 | Embedding Byzantine Fault Tolerance into Federated Learning via Virtual Data-Driven Consistency Scoring Plugin | Youngjoon Lee et.al. | 2411.10212 | link |
2024-11-15 | Outliers resistant image classification by anomaly detection | Anton Sergeev et.al. | 2411.10150 | null |
2024-11-15 | Adapting the Biological SSVEP Response to Artificial Neural Networks | Emirhan Böge et.al. | 2411.10084 | null |
2024-11-15 | Evidential Federated Learning for Skin Lesion Image Classification | Rutger Hendrix et.al. | 2411.10071 | null |
2024-11-14 | Adversarial Attacks Using Differentiable Rendering: A Survey | Matthew Hull et.al. | 2411.09749 | null |
2024-11-14 | ResidualDroppath: Enhancing Feature Reuse over Residual Connections | Sejik Park et.al. | 2411.09475 | null |
2024-11-14 | SAG-ViT: A Scale-Aware, High-Fidelity Patching Approach with Graph Attention for Vision Transformers | Shravan Venkatraman et.al. | 2411.09420 | null |
2024-11-14 | Heuristical Comparison of Vision Transformers Against Convolutional Neural Networks for Semantic Segmentation on Remote Sensing Imagery | Ashim Dahal et.al. | 2411.09101 | link |
2024-11-13 | Computed tomography using meta-optics | Maksym Zhelyeznuyakov et.al. | 2411.08995 | null |
2024-11-13 | CoCoP: Enhancing Text Classification with LLM through Code Completion Prompt | Mohammad Mahdi Mohajeri et.al. | 2411.08979 | null |
2024-11-13 | ScaleNet: Scale Invariance Learning in Directed Graphs | Qin Jiang et.al. | 2411.08758 | link |
2024-11-13 | Efficient Whole Slide Image Classification through Fisher Vector Representation | Ravi Kant Gupta et.al. | 2411.08530 | null |
2024-11-12 | HMIL: Hierarchical Multi-Instance Learning for Fine-Grained Whole Slide Image Classification | Cheng Jin et.al. | 2411.07660 | null |
2024-11-12 | Semantic segmentation on multi-resolution optical and microwave data using deep learning | Jai G Singla et.al. | 2411.07581 | null |
2024-11-11 | The Inherent Adversarial Robustness of Analog In-Memory Computing | Corey Lammie et.al. | 2411.07023 | null |
2024-11-11 | ScaleKD: Strong Vision Transformers Could Be Excellent Teachers | Jiawei Fan et.al. | 2411.06786 | link |
2024-11-11 | A Text Classification Model Combining Adversarial Training with Pre-trained Language Model and neural networks: A Case Study on Telecom Fraud Incident Texts | Liu Zhuoxian et.al. | 2411.06772 | null |
2024-11-11 | Can KAN Work? Exploring the Potential of Kolmogorov-Arnold Networks in Computer Vision | Yueyang Cang et.al. | 2411.06727 | null |
2024-11-10 | Deep Active Learning in the Open World | Tian Xie et.al. | 2411.06353 | null |
2024-11-09 | Clustering Algorithms and RAG Enhancing Semi-Supervised Text Classification with Large LLMs | Shan Zhong et.al. | 2411.06175 | null |
2024-11-09 | AI-Compass: A Comprehensive and Effective Multi-module Testing Tool for AI Systems | Zhiyu Zhu et.al. | 2411.06146 | null |
2024-11-09 | Exploring Structural Nonlinearity in Binary Polariton-Based Neuromorphic Architectures | Evgeny Sedov et.al. | 2411.06124 | null |
2024-11-09 | Mutual-energy inner product optimization method for constructing feature coordinates and image classification in Machine Learning | Yuanxiu Wang et.al. | 2411.06100 | null |
2024-11-08 | GUIDEQ: Framework for Guided Questioning for progressive informational collection and classification | Priya Mishra et.al. | 2411.05991 | link |
2024-11-08 | FisherMask: Enhancing Neural Network Labeling Efficiency in Image Classification Using Fisher Information | Shreen Gul et.al. | 2411.05752 | link |
2024-11-08 | Visual-TCAV: Concept-based Attribution and Saliency Maps for Post-hoc Explainability in Image Classification | Antonio De Santis et.al. | 2411.05698 | null |
2024-11-08 | Efficient Audio-Visual Fusion for Video Classification | Mahrukh Awan et.al. | 2411.05603 | null |
2024-11-08 | Training objective drives the consistency of representational similarity across datasets | Laure Ciernik et.al. | 2411.05561 | link |
2024-11-08 | Estimating the Influence of Sequentially Correlated Literary Properties in Textual Classification: A Data-Centric Hypothesis-Testing Approach | Gideon Yoffe et.al. | 2411.04950 | null |
2024-11-07 | Attention Masks Help Adversarial Attacks to Bypass Safety Detectors | Yunfan Shi et.al. | 2411.04772 | link |
2024-11-07 | Zero-Shot Temporal Resolution Domain Adaptation for Spiking Neural Networks | Sanja Karilanova et.al. | 2411.04760 | null |
2024-11-07 | Is network fragmentation a useful complexity measure? | Coenraad Mouton et.al. | 2411.04695 | null |
2024-11-07 | DISCO: DISCovering Overfittings as Causal Rules for Text Classification Models | Zijian Zhang et.al. | 2411.04649 | null |
2024-11-07 | Neural Fingerprints for Adversarial Attack Detection | Haim Fisher et.al. | 2411.04533 | link |
2024-11-06 | Multimodal Structure-Aware Quantum Data Processing | Hala Hawashin et.al. | 2411.04242 | null |
2024-11-06 | RaVL: Discovering and Mitigating Spurious Correlations in Fine-Tuned Vision-Language Models | Maya Varma et.al. | 2411.04097 | link |
2024-11-06 | Overcoming label shift in targeted federated learning | Edvin Listo Zec et.al. | 2411.03799 | null |
2024-11-06 | Deferred Poisoning: Making the Model More Vulnerable via Hessian Singularization | Yuhao He et.al. | 2411.03752 | null |
2024-11-05 | Judge Like a Real Doctor: Dual Teacher Sample Consistency Framework for Semi-supervised Medical Image Classification | Zhang Qixiang et.al. | 2411.03041 | null |
2024-11-06 | Confidence Calibration of Classifiers with Many Classes | Adrien LeCoz et.al. | 2411.02988 | link |
2024-11-05 | Domain Expansion and Boundary Growth for Open-Set Single-Source Domain Generalization | Pengkun Jiao et.al. | 2411.02920 | null |
2024-11-05 | ADOPT: Modified Adam Can Converge with Any |
Shohei Taniguchi et.al. | 2411.02853 | link |
2024-11-05 | Integrated lithium niobate photonic computing circuit based on efficient and high-speed electro-optic conversion | Yaowen Hu et.al. | 2411.02734 | null |
2024-11-06 | Wave Network: An Ultra-Small Language Model | Xin Zhang et.al. | 2411.02674 | null |
2024-11-04 | FUSECAPS: Investigating Feature Fusion Based Framework for Capsule Endoscopy Image Classification | Bidisha Chakraborty et.al. | 2411.02637 | null |
2024-11-04 | TripletCLIP: Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives | Maitreya Patel et.al. | 2411.02545 | null |
2024-11-04 | A Comparative Analysis of Instruction Fine-Tuning LLMs for Financial Text Classification | Sorouralsadat Fatemi et.al. | 2411.02476 | null |
2024-11-04 | Exploiting Contextual Uncertainty of Visual Data for Efficient Training of Deep Models | Sharat Agarwal et.al. | 2411.01925 | null |
2024-11-03 | Optimizing Gastrointestinal Diagnostics: A CNN-Based Model for VCE Image Classification | Vaneeta Ahlawat et.al. | 2411.01652 | null |
2024-11-03 | ParseCaps: An Interpretable Parsing Capsule Network for Medical Image Diagnosis | Xinyu Geng et.al. | 2411.01564 | null |
2024-11-03 | Efficient Deep Learning Infrastructures for Embedded Computing Systems: A Comprehensive Survey and Future Envision | Xiangzhong Luo et.al. | 2411.01431 | null |
2024-11-02 | Combining Financial Data and News Articles for Stock Price Movement Prediction Using Large Language Models | Ali Elahi et.al. | 2411.01368 | null |
2024-11-02 | Optimizing Violence Detection in Video Classification Accuracy through 3D Convolutional Neural Networks | Aarjav Kavathia et.al. | 2411.01348 | null |
2024-11-02 | MIC: Medical Image Classification Using Chest X-ray (COVID-19 and Pneumonia) Dataset with the Help of CNN and Customized CNN | Nafiz Fahad et.al. | 2411.01163 | null |
2024-11-02 | Few-Class Arena: A Benchmark for Efficient Selection of Vision Models and Dataset Difficulty Measurement | Bryan Bo Cao et.al. | 2411.01099 | link |
2024-11-01 | Towards Robust Text Classification: Mitigating Spurious Correlations with Causal Learning | Yuqing Zhou et.al. | 2411.01045 | null |
2024-11-01 | FISHing in Uncertainty: Synthetic Contrastive Learning for Genetic Aberration Detection | Simon Gutwein et.al. | 2411.01025 | link |
2024-10-31 | Video Token Merging for Long-form Video Understanding | Seon-Ho Lee et.al. | 2410.23782 | null |
2024-10-31 | Neurobench: DCASE 2020 Acoustic Scene Classification benchmark on XyloAudio 2 | Weijie Ke et.al. | 2410.23776 | null |
2024-10-31 | QUEST-A: Untrained Filtering with Trained Focusing led to Enhanced Quantum Architectures | Lian-Hui Yu et.al. | 2410.23560 | link |
2024-11-01 | Large Language Models for Patient Comments Multi-Label Classification | Hajar Sakai et.al. | 2410.23528 | null |
2024-10-30 | Multilingual Vision-Language Pre-training for the Remote Sensing Domain | João Daniel Silva et.al. | 2410.23370 | null |
2024-10-30 | Domain-decomposed image classification algorithms using linear discriminant analysis and convolutional neural networks | Axel Klawonn et.al. | 2410.23359 | null |
2024-10-30 | CLIPErase: Efficient Unlearning of Visual-Textual Associations in CLIP | Tianyu Yang et.al. | 2410.23330 | null |
2024-10-30 | Don't Just Pay Attention, PLANT It: Transfer L2R Models to Fine-tune Attention in Extreme Multi-Label Text Classification | Debjyoti Saharoy et.al. | 2410.23066 | null |
2024-10-30 | Automated Trustworthiness Oracle Generation for Machine Learning Text Classifiers | Lam Nguyen Tung et.al. | 2410.22663 | null |
2024-10-29 | Developing Convolutional Neural Networks using a Novel Lamarckian Co-Evolutionary Algorithm | Zaniar Sharifi et.al. | 2410.22487 | null |
2024-10-29 | EfficientNet with Hybrid Attention Mechanisms for Enhanced Breast Histopathology Classification: A Comprehensive Approach | Naren Sengodan et.al. | 2410.22392 | null |
2024-10-29 | DISCERN: Decoding Systematic Errors in Natural Language for Text Classifiers | Rakesh R. Menon et.al. | 2410.22239 | null |
2024-10-29 | Class-Aware Contrastive Optimization for Imbalanced Text Classification | Grigorii Khvatskii et.al. | 2410.22197 | null |
2024-10-29 | Active Learning for Vision-Language Models | Bardia Safaei et.al. | 2410.22187 | null |
2024-10-29 | Multi-Level Feature Distillation of Joint Teachers Trained on Distinct Image Datasets | Adrian Iordache et.al. | 2410.22184 | link |
2024-10-29 | Natural Language Processing for Analyzing Electronic Health Records and Clinical Notes in Cancer Research: A Review | Muhammad Bilal et.al. | 2410.22180 | null |
2024-10-29 | FakeFormer: Efficient Vulnerability-Driven Transformers for Generalisable Deepfake Detection | Dat Nguyen et.al. | 2410.21964 | null |
2024-10-29 | Bayesian Optimization for Hyperparameters Tuning in Neural Networks | Gabriele Onorato et.al. | 2410.21886 | null |
2024-10-29 | Advancing Efficient Brain Tumor Multi-Class Classification -- New Insights from the Vision Mamba Model in Transfer Learning | Yinyi Lai et.al. | 2410.21872 | null |
2024-10-28 | Audio Classification of Low Feature Spectrograms Utilizing Convolutional Neural Networks | Noel Elias et.al. | 2410.21561 | null |
2024-10-30 | A Novel Score-CAM based Denoiser for Spectrographic Signature Extraction without Ground Truth | Noel Elias et.al. | 2410.21557 | null |
2024-10-28 | Attacking Misinformation Detection Using Adversarial Examples Generated by Language Models | Piotr Przybyła et.al. | 2410.20940 | null |
2024-10-28 | Data-Efficient Low-Complexity Acoustic Scene Classification via Distilling and Progressive Pruning | Bing Han et.al. | 2410.20775 | null |
2024-10-28 | Interpretable Image Classification with Adaptive Prototype-based Vision Transformers | Chiyu Ma et.al. | 2410.20722 | null |
2024-10-27 | Graph Neural Networks on Discriminative Graphs of Words | Yassine Abbahaddou et.al. | 2410.20469 | null |
2024-10-27 | Historical Test-time Prompt Tuning for Vision Foundation Models | Jingyi Zhang et.al. | 2410.20346 | null |
2024-10-27 | Sequential Large Language Model-Based Hyper-Parameter Optimization | Kanan Mahammadli et.al. | 2410.20302 | link |
2024-10-26 | Enhancing CNN Classification with Lamarckian Memetic Algorithms and Local Search | Akhilbaran Ghosh et.al. | 2410.20234 | null |
2024-10-26 | Annotation Efficiency: Identifying Hard Samples via Blocked Sparse Linear Bandits | Adit Jain et.al. | 2410.20041 | null |
2024-10-26 | Attacks against Abstractive Text Summarization Models through Lead Bias and Influence Functions | Poojitha Thota et.al. | 2410.20019 | null |
2024-10-26 | Vulnerability of LLMs to Vertically Aligned Text Manipulations | Zhecheng Li et.al. | 2410.20016 | null |
2024-10-25 | Learning the Regularization Strength for Deep Fine-Tuning via a Data-Emphasized Variational Objective | Ethan Harvey et.al. | 2410.19675 | null |
2024-10-24 | Noise Adaption Network for Morse Code Image Classification | Xiaxia Wang et.al. | 2410.19180 | link |
2024-10-24 | Hybrid Quantum-Classical Feature Extraction approach for Image Classification using Autoencoders and Quantum SVMs | Donovan Slabbert et.al. | 2410.18814 | null |
2024-10-24 | Spatial-Temporal Search for Spiking Neural Networks | Kaiwei Che et.al. | 2410.18580 | null |
2024-10-25 | Interpretable Bilingual Multimodal Large Language Model for Diverse Biomedical Tasks | Lehan Wang et.al. | 2410.18387 | null |
2024-10-23 | Using Cartesian slice plots of a cosmological simulation as input of a convolutional neural network | Guillermo Arreaga-Garcia et.al. | 2410.18320 | null |
2024-10-25 | Backdoor in Seconds: Unlocking Vulnerabilities in Large Pre-trained Models via Model Editing | Dongliang Guo et.al. | 2410.18267 | null |
2024-10-23 | Future Token Prediction -- Causal Language Modelling with Per-Token Semantic State Vector for Multi-Token Prediction | Nicholas Walker et.al. | 2410.18160 | null |
2024-10-23 | Deep Learning for Active Region Classification: A Systematic Study from Convolutional Neural Networks to Vision Transformers | Edoardo Legnaro et.al. | 2410.17816 | null |
2024-10-23 | New Insight in Cervical Cancer Diagnosis Using Convolution Neural Network Architecture | Ach. Khozaimi et.al. | 2410.17735 | null |
2024-10-24 | Advancing Interpretability in Text Classification through Prototype Learning | Bowen Wei et.al. | 2410.17546 | null |
2024-10-23 | Enhancing Multimodal Medical Image Classification using Cross-Graph Modal Contrastive Learning | Jun-En Ding et.al. | 2410.17494 | null |
2024-10-22 | Data Obfuscation through Latent Space Projection (LSP) for Privacy-Preserving AI Governance: Case Studies in Medical Diagnosis and Finance Fraud Detection | Mahesh Vaijainthymala Krishnamoorthy et.al. | 2410.17459 | null |
2024-10-22 | Altogether: Image Captioning via Re-aligning Alt-text | Hu Xu et.al. | 2410.17251 | null |
2024-10-22 | KANICE: Kolmogorov-Arnold Networks with Interactive Convolutional Elements | Md Meftahul Ferdaus et.al. | 2410.17172 | link |
2024-10-22 | Development of CNN Architectures using Transfer Learning Methods for Medical Image Classification | Ganga Prasad Basyal et.al. | 2410.16711 | null |
2024-10-21 | Efficient Neural Network Training via Subset Pretraining | Jan Spörer et.al. | 2410.16523 | null |
2024-10-21 | 1024m at SMM4H 2024: Tasks 3, 5 & 6 -- Ensembles of Transformers and Large Language Models for Medical Text Classification | Ram Mohan Rao Kadiyala et.al. | 2410.15998 | null |
2024-10-21 | Visual Representation Learning Guided By Multi-modal Prior Knowledge | Hongkuan Zhou et.al. | 2410.15981 | null |
2024-10-21 | AutoTrain: No-code training for state-of-the-art models | Abhishek Thakur et.al. | 2410.15735 | link |
2024-10-21 | ViMoE: An Empirical Study of Designing Vision Mixture-of-Experts | Xumeng Han et.al. | 2410.15732 | null |
2024-10-21 | P-YOLOv8: Efficient and Accurate Real-Time Detection of Distracted Driving | Mohamed R. Elshamy et.al. | 2410.15602 | null |
2024-10-20 | Open-vocabulary vs. Closed-set: Best Practice for Few-shot Object Detection Considering Text Describability | Yusuke Hosoya et.al. | 2410.15315 | link |
2024-10-19 | Spatial-Mamba: Effective Visual State Space Models via Structure-Aware State Fusion | Chaodong Xiao et.al. | 2410.15091 | link |
2024-10-19 | PAT: Parameter-Free Audio-Text Aligner to Boost Zero-Shot Audio Classification | Ashish Seth et.al. | 2410.15062 | null |
2024-10-19 | Weakly-supervised diagnosis identification from Italian discharge letters | Vittorio Torri et.al. | 2410.15051 | null |
2024-10-19 | Reflexive Guidance: Improving OoDD in Vision-Language Models via Self-Guided Image-Adaptive Concept Generation | Seulbi Lee et.al. | 2410.14975 | null |
2024-10-18 | A Hybrid Feature Fusion Deep Learning Framework for Leukemia Cancer Detection in Microscopic Blood Sample Using Gated Recurrent Unit and Uncertainty Quantification | Maksuda Akter et.al. | 2410.14536 | null |
2024-10-18 | Unlearning Backdoor Attacks for LLMs with Weak-to-Strong Knowledge Distillation | Shuai Zhao et.al. | 2410.14425 | link |
2024-10-18 | A Novel Method to Metigate Demographic and Expert Bias in ICD Coding with Causal Inference | Bin Zhang et.al. | 2410.14236 | null |
2024-10-18 | Comparative Evaluation of Clustered Federated Learning Method | Michael Ben Ali et.al. | 2410.14212 | link |
2024-10-17 | Reproducibility study of "LICO: Explainable Models with Language-Image Consistency" | Luan Fletcher et.al. | 2410.13989 | link |
2024-10-17 | LoLDU: Low-Rank Adaptation via Lower-Diag-Upper Decomposition for Parameter-Efficient Fine-Tuning | Yiming Shi et.al. | 2410.13618 | link |
2024-10-17 | Augmentation Policy Generation for Image Classification Using Large Language Models | Ant Duru et.al. | 2410.13453 | null |
2024-10-17 | Similarity-Dissimilarity Loss with Supervised Contrastive Learning for Multi-label Classification | Guangming Huang et.al. | 2410.13439 | null |
2024-10-16 | Interpreting and Analyzing CLIP's Zero-Shot Image Classification via Mutual Knowledge | Fawaz Sammani et.al. | 2410.13016 | link |
2024-10-16 | PND-Net: Plant Nutrition Deficiency and Disease Classification using Graph Convolutional Network | Asish Bera et.al. | 2410.12742 | null |
2024-10-16 | Beyond Speech and More: Investigating the Emergent Ability of Speech Foundation Models for Classifying Physiological Time-Series Signals | Orchid Chetia Phukan et.al. | 2410.12645 | null |
2024-10-17 | From Measurement Instruments to Data: Leveraging Theory-Driven Synthetic Training Data for Classifying Social Constructs | Lukas Birkenmaier et.al. | 2410.12622 | null |
2024-10-16 | Feature Augmentation for Self-supervised Contrastive Learning: A Closer Look | Yong Zhang et.al. | 2410.12396 | null |
2024-10-15 | Clustering doc2vec output for topic-dimensionality reduction: A MITRE ATT&CK calibration | Nathan Monnet et.al. | 2410.11573 | null |
2024-10-15 | LoKO: Low-Rank Kalman Optimizer for Online Fine-Tuning of Large Models | Hossein Abdi et.al. | 2410.11551 | null |
2024-10-15 | Reducing Labeling Costs in Sentiment Analysis via Semi-Supervised Learning | Minoo Jafarlou et.al. | 2410.11355 | null |
2024-10-14 | Towards a More Complete Theory of Function Preserving Transforms | Michael Painter et.al. | 2410.11038 | null |
2024-10-14 | Enhancing JEPAs with Spatial Conditioning: Robust and Efficient Representation Learning | Etai Littwin et.al. | 2410.10773 | null |
2024-10-15 | Ensemble of ConvNeXt V2 and MaxViT for Long-Tailed CXR Classification with View-Based Aggregation | Yosuke Yamagishi et.al. | 2410.10710 | link |
2024-10-14 | Queryable Prototype Multiple Instance Learning with Vision-Language Models for Incremental Whole Slide Image Classification | Jiaxiang Gou et.al. | 2410.10573 | null |
2024-10-14 | Dynamic Power Control in a Hardware Neural Network with Error-Configurable MAC Units | Maedeh Ghaderi et.al. | 2410.10545 | null |
2024-10-14 | Improve Meta-learning for Few-Shot Text Classification with All You Can Acquire from the Tasks | Xinyue Liu et.al. | 2410.10454 | link |
2024-10-14 | GlobalMamba: Global Image Serialization for Vision Mamba | Chengkun Wang et.al. | 2410.10316 | link |
2024-10-14 | A Multi-Task Text Classification Pipeline with Natural Language Explanations: A User-Centric Evaluation in Sentiment Analysis and Offensive Language Identification in Greek Tweets | Nikolaos Mylonas et.al. | 2410.10290 | null |
2024-10-14 | big.LITTLE Vision Transformer for Efficient Visual Recognition | He Guo et.al. | 2410.10267 | null |
2024-10-14 | SkillAggregation: Reference-free LLM-Dependent Aggregation | Guangzhi Sun et.al. | 2410.10215 | null |
2024-10-14 | Will the Inclusion of Generated Data Amplify Bias Across Generations in Future Image Classification Models? | Zeliang Zhang et.al. | 2410.10160 | null |
2024-10-11 | Efficient Hyperparameter Importance Assessment for CNNs | Ruinan Wang et.al. | 2410.08920 | null |
2024-10-11 | Parameter-Efficient Fine-Tuning of Large Language Models using Semantic Knowledge Tuning | Nusrat Jahan Prottasha et.al. | 2410.08598 | null |
2024-10-11 | DeBiFormer: Vision Transformer with Deformable Agent Bi-level Routing Attention | Nguyen Huu Bao Long et.al. | 2410.08582 | link |
2024-10-11 | Accelerated Distributed Stochastic Non-Convex Optimization over Time-Varying Directed Networks | Yiyue Chen et.al. | 2410.08508 | null |
2024-10-11 | Semantic Token Reweighting for Interpretable and Controllable Text Embeddings in CLIP | Eunji Kim et.al. | 2410.08469 | null |
2024-10-10 | Bilinear MLPs enable weight-based mechanistic interpretability | Michael T. Pearce et.al. | 2410.08417 | null |
2024-10-10 | What is Left After Distillation? How Knowledge Transfer Impacts Fairness and Bias | Aida Mohammadshahi et.al. | 2410.08407 | null |
2024-10-10 | Time Traveling to Defend Against Adversarial Example Attacks in Image Classification | Anthony Etim et.al. | 2410.08338 | null |
2024-10-10 | More Experts Than Galaxies: Conditionally-overlapping Experts With Biologically-Inspired Fixed Routing | Sagi Shaier et.al. | 2410.08003 | null |
2024-10-10 | When the Small-Loss Trick is Not Enough: Multi-Label Image Classification with Noisy Labels Applied to CCTV Sewer Inspections | Keryan Chelouche et.al. | 2410.07689 | null |
2024-10-10 | Invisibility Cloak: Disappearance under Human Pose Estimation via Backdoor Attacks | Minxing Zhang et.al. | 2410.07670 | null |
2024-10-10 | StablePrompt: Automatic Prompt Tuning using Reinforcement Learning for Large Language Models | Minchan Kwon et.al. | 2410.07652 | null |
2024-10-10 | Explainability of Deep Neural Networks for Brain Tumor Detection | S. Park et.al. | 2410.07613 | link |
2024-10-10 | CSA: Data-efficient Mapping of Unimodal Features to Multimodal Features | Po-han Li et.al. | 2410.07610 | null |
2024-10-09 | One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation | Fabian Paischer et.al. | 2410.07170 | link |
2024-10-09 | JPEG Inspired Deep Learning | Ahmed H. Salamah et.al. | 2410.07081 | link |
2024-10-09 | Optimizing Estimators of Squared Calibration Errors in Classification | Sebastian G. Gruber et.al. | 2410.07014 | null |
2024-10-09 | Spectral and Rhythm Features for Audio Classification with Deep Convolutional Neural Networks | Friedrich Wolf-Monheim et.al. | 2410.06927 | null |
2024-10-09 | QuadMamba: Learning Quadtree-based Selective Scan for Visual State Space Model | Fei Xie et.al. | 2410.06806 | null |
2024-10-09 | Convex Distillation: Efficient Compression of Deep Networks via Convex Optimization | Prateek Varshney et.al. | 2410.06567 | null |
2024-10-08 | A Comparative Study of Hybrid Models in Health Misinformation Text Classification | Mkululi Sikosana et.al. | 2410.06311 | null |
2024-10-08 | Conformal Structured Prediction | Botong Zhang et.al. | 2410.06296 | link |
2024-10-08 | TEOChat: A Large Vision-Language Assistant for Temporal Earth Observation Data | Jeremy Andrew Irvin et.al. | 2410.06234 | null |
2024-10-08 | Manual Verbalizer Enrichment for Few-Shot Text Classification | Quang Anh Nguyen et.al. | 2410.06173 | null |
2024-10-07 | LoTLIP: Improving Language-Image Pre-training for Long Text Understanding | Wei Wu et.al. | 2410.05249 | null |
2024-10-07 | Variable Resolution Pixel Quantization for Low Power Machine Vision Application on Edge | Senorita Deb et.al. | 2410.05189 | null |
2024-10-07 | IGroupSS-Mamba: Interval Group Spatial-Spectral Mamba for Hyperspectral Image Classification | Yan He et.al. | 2410.05100 | null |
2024-10-07 | Explanation sensitivity to the randomness of large language models: the case of journalistic text classification | Jeremie Bogaert et.al. | 2410.05085 | null |
2024-10-07 | Control-oriented Clustering of Visual Latent Representation | Han Qi et.al. | 2410.05063 | null |
2024-10-07 | SELECT: A Large-Scale Benchmark of Data Curation Strategies for Image Classification | Benjamin Feuer et.al. | 2410.05057 | link |
2024-10-07 | Art Forgery Detection using Kolmogorov Arnold and Convolutional Neural Networks | Sandro Boccuzzo et.al. | 2410.04866 | null |
2024-10-06 | MECFormer: Multi-task Whole Slide Image Classification with Expert Consultation Network | Doanh C. Bui et.al. | 2410.04507 | null |
2024-10-06 | Interpret Your Decision: Logical Reasoning Regularization for Generalization in Visual Classification | Zhaorui Tan et.al. | 2410.04492 | link |
2024-10-05 | IT |
Nikita Durasov et.al. | 2410.04201 | null |
2024-10-04 | Classification-Denoising Networks | Louis Thiry et.al. | 2410.03505 | null |
2024-10-04 | A Multimodal Framework for Deepfake Detection | Kashish Gandhi et.al. | 2410.03487 | null |
2024-10-04 | On Uncertainty In Natural Language Processing | Dennis Ulmer et.al. | 2410.03446 | link |
2024-10-04 | Comparing zero-shot self-explanations with human rationales in multilingual text classification | Stephanie Brandl et.al. | 2410.03296 | null |
2024-10-04 | Sm: enhanced localization in Multiple Instance Learning for medical imaging classification | Francisco M. Castro-Macías et.al. | 2410.03276 | null |
2024-10-04 | Selective Transformer for Hyperspectral Image Classification | Yichu Xu et.al. | 2410.03171 | null |
2024-10-03 | CPFD: Confidence-aware Privileged Feature Distillation for Short Video Classification | Jinghao Shi et.al. | 2410.03038 | null |
2024-10-03 | On Expert Estimation in Hierarchical Mixture of Experts: Beyond Softmax Gating Functions | Huy Nguyen et.al. | 2410.02935 | null |
2024-10-03 | Lie Algebra Canonicalization: Equivariant Neural Operators under arbitrary Lie Groups | Zakhar Shumaylov et.al. | 2410.02698 | null |
2024-10-03 | LoGra-Med: Long Context Multi-Graph Alignment for Medical Vision-Language Model | Duy M. H. Nguyen et.al. | 2410.02615 | null |
2024-10-03 | Personalized Quantum Federated Learning for Privacy Image Classification | Jinjing Shi et.al. | 2410.02547 | null |
2024-10-03 | BiSSL: Bilevel Optimization for Self-Supervised Pre-Training and Fine-Tuning | Gustav Wagner Zakarias et.al. | 2410.02387 | null |
2024-10-03 | CTARR: A fast and robust method for identifying anatomical regions on CT images via atlas registration | Thomas Buddenkotte et.al. | 2410.02316 | link |
2024-10-03 | Hard Negative Sample Mining for Whole Slide Image Classification | Wentao Huang et.al. | 2410.02212 | link |
2024-10-02 | Kolmogorov-Arnold Network Autoencoders | Mohammadamin Moradi et.al. | 2410.02077 | link |
2024-10-02 | Synthio: Augmenting Small-Scale Audio Classification Datasets with Synthetic Data | Sreyan Ghosh et.al. | 2410.02056 | null |
2024-10-02 | FLAG: Financial Long Document Classification via AMR-based GNN | Bolun et.al. | 2410.02024 | link |
2024-10-02 | MONICA: Benchmarking on Long-tailed Medical Image Classification | Lie Ju et.al. | 2410.02010 | null |
2024-10-02 | Revisiting Hierarchical Text Classification: Inference and Metrics | Roman Plaud et.al. | 2410.01305 | link |
2024-10-02 | Automatic deductive coding in discourse analysis: an application of large language models in learning analytics | Lishan Zhang et.al. | 2410.01240 | null |
2024-10-01 | Deep Nets with Subsampling Layers Unwittingly Discard Useful Activations at Test-Time | Chiao-An Yang et.al. | 2410.01083 | link |
2024-10-01 | Local-to-Global Self-Supervised Representation Learning for Diabetic Retinopathy Grading | Mostafa Hajighasemloua et.al. | 2410.00779 | null |
2024-10-01 | NECOMIMI: Neural-Cognitive Multimodal EEG-informed Image Generation with Diffusion Models | Chi-Sheng Chen et.al. | 2410.00712 | null |
2024-10-01 | TikGuard: A Deep Learning Transformer-Based Solution for Detecting Unsuitable TikTok Content for Kids | Mazen Balat et.al. | 2410.00403 | null |
2024-09-30 | KPCA-CAM: Visual Explainability of Deep Computer Vision Models using Kernel PCA | Sachin Karmani et.al. | 2410.00267 | null |
2024-09-30 | A Methodology for Explainable Large Language Models with Integrated Gradients and Linguistic Analysis in Text Classification | Marina Ribeiro et.al. | 2410.00250 | null |
2024-09-30 | Evaluating the performance of state-of-the-art esg domain-specific pre-trained large language models in text classification against existing models and traditional machine learning techniques | Tin Yuet Chung et.al. | 2410.00207 | null |
2024-10-02 | Evaluating the fairness of task-adaptive pretraining on unlabeled test data before few-shot text classification | Kush Dubey et.al. | 2410.00179 | link |
2024-09-30 | POMONAG: Pareto-Optimal Many-Objective Neural Architecture Generator | Eugenio Lomurno et.al. | 2409.20447 | null |
2024-09-30 | Satellite image classification with neural quantum kernels | Pablo Rodriguez-Grasa et.al. | 2409.20356 | null |
2024-09-30 | All-optical autoencoder machine learning framework using diffractive processors | Peijie Feng et.al. | 2409.20346 | null |
2024-09-30 | Fine-Tuning Personalization in Federated Learning to Mitigate Adversarial Clients | Youssef Allouah et.al. | 2409.20329 | null |
2024-09-30 | Classroom-Inspired Multi-Mentor Distillation with Adaptive Learning Strategies | Shalini Sarode et.al. | 2409.20237 | null |
2024-09-30 | Classification of Radiological Text in Small and Imbalanced Datasets in a Non-English Language | Vincent Beliveau et.al. | 2409.20147 | null |
2024-09-30 | SATA: Spatial Autocorrelation Token Analysis for Enhancing the Robustness of Vision Transformers | Nick Nikzad et.al. | 2409.19850 | null |
2024-09-29 | Adversarial Examples for DNA Classification | Hyunwoo Yoo et.al. | 2409.19788 | null |
2024-09-29 | FAST: A Dual-tier Few-Shot Learning Paradigm for Whole Slide Image Classification | Kexue Fu et.al. | 2409.19720 | null |
2024-09-29 | Vision-Language Models are Strong Noisy Label Detectors | Tong Wei et.al. | 2409.19696 | link |
2024-09-27 | Unconditional stability of a recurrent neural circuit implementing divisive normalization | Shivang Rawat et.al. | 2409.18946 | null |
2024-09-27 | Subspace Preserving Quantum Convolutional Neural Network Architectures | Léo Monbroussou et.al. | 2409.18918 | null |
2024-09-27 | Med-IC: Fusing a Single Layer Involution with Convolutions for Enhanced Medical Image Classification and Segmentation | Md. Farhadul Islam et.al. | 2409.18506 | null |
2024-09-26 | Towards the Mitigation of Confirmation Bias in Semi-supervised Learning: a Debiased Training Perspective | Yu Wang et.al. | 2409.18316 | null |
2024-09-26 | Realistic Evaluation of Model Merging for Compositional Generalization | Derek Tam et.al. | 2409.18314 | null |
2024-09-26 | DARE: Diverse Visual Question Answering with Robustness Evaluation | Hannah Sterz et.al. | 2409.18023 | null |
2024-09-26 | The Lou Dataset -- Exploring the Impact of Gender-Fair Language in German Text Classification | Andreas Waldis et.al. | 2409.17929 | null |
2024-09-26 | Cascade Prompt Learning for Vision-Language Model Adaptation | Ge Wu et.al. | 2409.17805 | null |
2024-09-26 | Byzantine-Robust Aggregation for Securing Decentralized Federated Learning | Diego Cajaraville-Aboy et.al. | 2409.17754 | null |
2024-09-26 | Let the Quantum Creep In: Designing Quantum Neural Network Models by Gradually Swapping Out Classical Components | Peiyong Wang et.al. | 2409.17583 | link |
2024-09-26 | Leveraging Annotator Disagreement for Text Classification | Jin Xu et.al. | 2409.17577 | null |
2024-09-26 | Uni-Med: A Unified Medical Generalist Foundation Model For Multi-Task Learning Via Connector-MoE | Xun Zhu et.al. | 2409.17508 | null |
2024-09-26 | Reducing and Exploiting Data Augmentation Noise through Meta Reweighting Contrastive Learning for Text Classification | Guanyi Mou et.al. | 2409.17474 | null |
2024-09-26 | Navigating the Shortcut Maze: A Comprehensive Analysis of Shortcut Learning in Text Classification by Language Models | Yuqing Zhou et.al. | 2409.17455 | null |
2024-09-25 | Block Expanded DINORET: Adapting Natural Domain Foundation Models for Retinal Imaging Without Catastrophic Forgetting | Jay Zoellin et.al. | 2409.17332 | null |
2024-09-25 | BitQ: Tailoring Block Floating Point Precision for Improved DNN Efficiency on Resource-Constrained Devices | Yongqi Xu et.al. | 2409.17093 | link |
2024-09-25 | Accumulator-Aware Post-Training Quantization | Ian Colbert et.al. | 2409.17092 | null |
2024-09-26 | HVT: A Comprehensive Vision Framework for Learning in Non-Euclidean Space | Jacob Fein-Ashley et.al. | 2409.16897 | link |
2024-09-25 | Shifting from endangerment to rebirth in the Artificial Intelligence Age: An Ensemble Machine Learning Approach for Hawrami Text Classification | Aram Khaksar et.al. | 2409.16884 | null |
2024-09-25 | Explicitly Modeling Pre-Cortical Vision with a Neuro-Inspired Front-End Improves CNN Robustness | Lucas Piper et.al. | 2409.16838 | link |
2024-09-24 | Unleashing the Potential of Synthetic Images: A Study on Histopathology Image Classification | Leire Benito-Del-Valle et.al. | 2409.16002 | link |
2024-09-24 | An ensemble framework approach of hybrid Quantum convolutional neural networks for classification of breast cancer images | Dibyasree Guha et.al. | 2409.15958 | null |
2024-09-24 | iGAiVA: Integrated Generative AI and Visual Analytics in a Machine Learning Workflow for Text Classification | Yuanzhe Jin et.al. | 2409.15848 | link |
2024-09-23 | Optimizing News Text Classification with Bi-LSTM and Attention Mechanism for Efficient Data Processing | Bingyao Liu et.al. | 2409.15576 | null |
2024-09-23 | Critic Loss for Image Classification | Brendan Hogan Rappazzo et.al. | 2409.15565 | null |
2024-09-23 | VLMine: Long-Tail Data Mining with Vision Language Models | Mao Ye et.al. | 2409.15486 | null |
2024-09-23 | HydroVision: LiDAR-Guided Hydrometric Prediction with Vision Transformers and Hybrid Graph Learning | Naghmeh Shafiee Roudbari et.al. | 2409.15213 | null |
2024-09-23 | Benchmarking Edge AI Platforms for High-Performance ML Inference | Rakshith Jayanth et.al. | 2409.14803 | null |
2024-09-23 | Less yet robust: crucial region selection for scene recognition | Jianqi Zhang et.al. | 2409.14741 | null |
2024-09-22 | Low-Light Enhancement Effect on Classification and Detection: An Empirical Study | Xu Wu et.al. | 2409.14461 | null |
2024-09-18 | Unraveling the Hessian: A Key to Smooth Convergence in Loss Function Landscapes | Nikita Kiselev et.al. | 2409.11995 | link |
2024-09-18 | Data Efficient Acoustic Scene Classification using Teacher-Informed Confusing Class Instruction | Jin Jie Sean Yeo et.al. | 2409.11964 | null |
2024-09-18 | Agglomerative Token Clustering | Joakim Bruslund Haurum et.al. | 2409.11923 | null |
2024-09-18 | Distillation-free Scaling of Large SSMs for Images and Videos | Hamid Suleman et.al. | 2409.11867 | null |
2024-09-18 | Community Shaping in the Digital Age: A Temporal Fusion Framework for Analyzing Discourse Fragmentation in Online Social Networks | Amirhossein Dezhboro et.al. | 2409.11665 | null |
2024-09-18 | Few-Shot Learning Approach on Tuberculosis Classification Based on Chest X-Ray Images | A. A. G. Yogi Pramana et.al. | 2409.11644 | null |
2024-09-18 | Hyperspectral Image Classification Based on Faster Residual Multi-branch Spiking Neural Network | Yang Liu et.al. | 2409.11619 | null |
2024-09-17 | Multi-Cohort Framework with Cohort-Aware Attention and Adversarial Mutual-Information Minimization for Whole Slide Image Classification | Sharon Peled et.al. | 2409.11119 | null |
2024-09-17 | Anti-ESIA: Analyzing and Mitigating Impacts of Electromagnetic Signal Injection Attacks | Denglin Kang et.al. | 2409.10922 | null |
2024-09-16 | Are Deep Learning Models Robust to Partial Object Occlusion in Visual Recognition Tasks? | Kaleb Kassaw et.al. | 2409.10775 | null |
2024-09-16 | Frequency-Guided Masking for Enhanced Vision Self-Supervised Learning | Amin Karimi Monsefi et.al. | 2409.10362 | null |
2024-09-16 | InfoDisent: Explainability of Image Classification Models by Information Disentanglement | Łukasz Struski et.al. | 2409.10329 | null |
2024-09-16 | Enhancing Image Classification in Small and Unbalanced Datasets through Synthetic Data Augmentation | Neil De La Fuente et.al. | 2409.10286 | null |
2024-09-15 | Finetuning CLIP to Reason about Pairwise Differences | Dylan Sam et.al. | 2409.09721 | null |
2024-09-15 | Compositional Audio Representation Learning | Sripathi Sridhar et.al. | 2409.09619 | null |
2024-09-14 | One missing piece in Vision and Language: A Survey on Comics Understanding | Emanuele Vivoli et.al. | 2409.09502 | link |
2024-09-14 | Real-world Adversarial Defense against Patch Attacks based on Diffusion Model | Xingxing Wei et.al. | 2409.09406 | null |
2024-09-14 | Turbo your multi-modal classification with contrastive learning | Zhiyu Zhang et.al. | 2409.09282 | null |
2024-09-14 | Leveraging Foundation Models for Efficient Federated Learning in Resource-restricted Edge Networks | S. Kawa Atapour et.al. | 2409.09273 | null |
2024-09-13 | ReCLAP: Improving Zero Shot Audio Classification by Describing Sounds | Sreyan Ghosh et.al. | 2409.09213 | link |
2024-09-13 | Pushing the boundaries of event subsampling in event-based video classification using CNNs | Hesam Araghi et.al. | 2409.08953 | link |
2024-09-13 | Pushing Joint Image Denoising and Classification to the Edge | Thomas C Markhorst et.al. | 2409.08943 | null |
2024-09-13 | Byzantine-Robust and Communication-Efficient Distributed Learning via Compressed Momentum Filtering | Changxin Liu et.al. | 2409.08640 | null |
2024-09-13 | Anytime Continual Learning for Open Vocabulary Classification | Zhen Zhu et.al. | 2409.08518 | link |
2024-09-12 | Enhancing Few-Shot Image Classification through Learnable Multi-Scale Embedding and Attention Mechanisms | Fatemeh Askari et.al. | 2409.07989 | link |
2024-09-12 | Microscopic-Mamba: Revealing the Secrets of Microscopic Images with Just 4M Parameters | Shun Zou et.al. | 2409.07896 | link |
2024-09-12 | Classifying Images with CoLaNET Spiking Neural Network -- the MNIST Example | Mikhail Kiselev et.al. | 2409.07833 | null |
2024-09-12 | Efficient Privacy-Preserving KAN Inference Using Homomorphic Encryption | Zhizheng Lai et.al. | 2409.07751 | null |
2024-09-12 | DFDG: Data-Free Dual-Generator Adversarial Distillation for One-Shot Federated Learning | Kangyang Luo et.al. | 2409.07734 | null |
2024-09-12 | Cooperative Inference with Interleaved Operator Partitioning for CNNs | Zhibang Liu et.al. | 2409.07693 | null |
2024-09-11 | Token Turing Machines are Efficient Vision Models | Purvish Jajal et.al. | 2409.07613 | null |
2024-09-11 | Minimizing Embedding Distortion for Robust Out-of-Distribution Performance | Tom Shaked et.al. | 2409.07582 | null |
2024-09-11 | A Contrastive Symmetric Forward-Forward Algorithm (SFFA) for Continual Learning Tasks | Erik B. Terres-Escudero et.al. | 2409.07387 | null |
2024-09-11 | Optimizing Neural Network Performance and Interpretability with Diophantine Equation Encoding | Ronald Katende et.al. | 2409.07310 | null |
2024-09-11 | LLM-based feature generation from text for interpretable machine learning | Vojtěch Balek et.al. | 2409.07132 | null |
2024-09-11 | Privacy-Preserving Federated Learning with Consistency via Knowledge Distillation Using Conditional Generator | Kangyang Luo et.al. | 2409.06955 | null |
2024-09-10 | Dynamic Decoupling of Placid Terminal Attractor-based Gradient Descent Algorithm | Jinwei Zhao et.al. | 2409.06542 | null |
2024-09-10 | Seam Carving as Feature Pooling in CNN | Mohammad Imrul Jubair et.al. | 2409.06311 | null |
2024-09-10 | EntAugment: Entropy-Driven Adaptive Data Augmentation Framework for Image Classification | Suorong Yang et.al. | 2409.06290 | link |
2024-09-09 | A Small Claims Court for the NLP: Judging Legal Text Classification Strategies With Small Datasets | Mariana Yukari Noguti et.al. | 2409.05972 | null |
2024-09-09 | SVFit: Parameter-Efficient Fine-Tuning of Large Pre-Trained Models Using Singular Values | Chengwei Sun et.al. | 2409.05926 | null |
2024-09-09 | Adversarial Attacks on Data Attribution | Xinhe Wang et.al. | 2409.05657 | null |
2024-09-09 | Look One and More: Distilling Hybrid Order Relational Knowledge for Cross-Resolution Image Recognition | Shiming Ge et.al. | 2409.05384 | null |
2024-09-09 | RexUniNLU: Recursive Method with Explicit Schema Instructor for Universal NLU | Chengyuan Liu et.al. | 2409.05275 | null |
2024-09-09 | Scalable Frame Sampling for Video Classification: A Semi-Optimal Policy Approach with Reduced Search Space | Junho Lee et.al. | 2409.05260 | null |
2024-09-08 | PatchAlign:Fair and Accurate Skin Disease Image Classification by Alignment with Clinical Labels | Aayushman et.al. | 2409.04975 | link |
2024-09-07 | Activation Function Optimization Scheme for Image Classification | Abdur Rahman et.al. | 2409.04915 | null |
2024-09-07 | LoCa: Logit Calibration for Knowledge Distillation | Runming Yang et.al. | 2409.04778 | null |
2024-09-07 | Swin Transformer for Robust Differentiation of Real and Synthetic Images: Intra- and Inter-Dataset Analysis | Preetu Mehta et.al. | 2409.04734 | null |
2024-09-06 | Connectivity-Inspired Network for Context-Aware Recognition | Gianluca Carloni et.al. | 2409.04360 | null |
2024-09-06 | An optically accelerated extreme learning machine using hot atomic vapors | Pierre Azam et.al. | 2409.04312 | null |
2024-09-06 | PlantSeg: A Large-Scale In-the-wild Dataset for Plant Disease Segmentation | Tianqi Wei et.al. | 2409.04038 | null |
2024-09-05 | Deep Clustering of Remote Sensing Scenes through Heterogeneous Transfer Learning | Isaac Ray et.al. | 2409.03938 | null |
2024-09-05 | WaterMAS: Sharpness-Aware Maximization for Neural Network Watermarking | Carl De Sousa Trias et.al. | 2409.03902 | null |
2024-09-05 | On-board Satellite Image Classification for Earth Observation: A Comparative Study of Pre-Trained Vision Transformer Models | Thanh-Dung Le et.al. | 2409.03901 | null |
2024-09-05 | Have Large Vision-Language Models Mastered Art History? | Ombretta Strafforello et.al. | 2409.03521 | null |
2024-09-05 | Non-Uniform Illumination Attack for Fooling Convolutional Neural Networks | Akshay Jain et.al. | 2409.03458 | link |
2024-09-05 | Training-free Conversion of Pretrained ANNs to SNNs for Low-Power and High-Performance Applications | Tong Bu et.al. | 2409.03368 | null |
2024-09-05 | PEPL: Precision-Enhanced Pseudo-Labeling for Fine-Grained Image Classification in Semi-Supervised Learning | Bowen Tian et.al. | 2409.03192 | null |
2024-09-05 | The AdEMAMix Optimizer: Better, Faster, Older | Matteo Pagliardini et.al. | 2409.03137 | null |
2024-09-04 | iConFormer: Dynamic Parameter-Efficient Tuning with Input-Conditioned Adaptation | Hayeon Jo et.al. | 2409.02838 | null |
2024-09-03 | MedUnA: Language guided Unsupervised Adaptation of Vision-Language Models for Medical Image Classification | Umaima Rahman et.al. | 2409.02729 | null |
2024-09-05 | OpenFact at CheckThat! 2024: Combining Multiple Attack Methods for Effective Adversarial Text Generation | Włodzimierz Lewoniewski et.al. | 2409.02649 | null |
2024-09-04 | Boosting Generalizability towards Zero-Shot Cross-Dataset Single-Image Indoor Depth by Meta-Initialization | Cho-Ying Wu et.al. | 2409.02486 | null |
2024-09-03 | Evaluation and Comparison of Visual Language Models for Transportation Engineering Problems | Sanjita Prajapati et.al. | 2409.02278 | null |
2024-09-05 | Robust Clustering on High-Dimensional Data with Stochastic Quantization | Anton Kozyriev et.al. | 2409.02066 | link |
2024-09-03 | Compressed learning based onboard semantic compression for remote sensing platforms | Protim Bhattacharjee et.al. | 2409.01988 | null |
2024-09-03 | State-of-the-art Advances of Deep-learning Linguistic Steganalysis Research | Yihao Wang et.al. | 2409.01780 | null |
2024-09-03 | Enhancing Fine-Grained Visual Recognition in the Low-Data Regime Through Feature Magnitude Regularization | Avraham Chapman et.al. | 2409.01672 | null |
2024-09-03 | ReSpike: Residual Frames-based Hybrid Spiking Neural Networks for Efficient Action Recognition | Shiting Xiao et.al. | 2409.01564 | null |
2024-08-30 | Assessing Generative Language Models in Classification Tasks: Performance and Self-Evaluation Capabilities in the Environmental and Climate Change Domain | Francesca Grasso et.al. | 2408.17362 | link |
2024-08-30 | Covariance-corrected Whitening Alleviates Network Degeneration on Imbalanced Classification | Zhiwei Zhang et.al. | 2408.17197 | null |
2024-08-30 | Improving Extraction of Clinical Event Contextual Properties from Electronic Health Records: A Comparative Study | Shubham Agarwal et.al. | 2408.17181 | null |
2024-09-02 | Instant Adversarial Purification with Adversarial Consistency Distillation | Chun Tong Lei et.al. | 2408.17064 | null |
2024-08-30 | Generative Modeling Perspective for Control and Reasoning in Robotics | Takuma Yoneda et.al. | 2408.17041 | null |
2024-08-29 | Tex-ViT: A Generalizable, Robust, Texture-based dual-branch cross-attention deepfake detector | Deepak Dagar et.al. | 2408.16892 | null |
2024-08-29 | SODAWideNet++: Combining Attention and Convolutions for Salient Object Detection | Rohit Venkata Sai Dulam et.al. | 2408.16645 | null |
2024-08-29 | Android Malware Detection Based on RGB Images and Multi-feature Fusion | Zhiqiang Wang et.al. | 2408.16555 | null |
2024-08-29 | SAU: A Dual-Branch Network to Enhance Long-Tailed Recognition via Generative Models | Guangxi Li et.al. | 2408.16273 | link |
2024-08-29 | Improving Diffusion-based Data Augmentation with Inversion Spherical Interpolation | Yanghao Wang et.al. | 2408.16266 | null |
2024-08-29 | Low Saturation Confidence Distribution-based Test-Time Adaptation for Cross-Domain Remote Sensing Image Classification | Yu Liang et.al. | 2408.16265 | null |
2024-08-28 | EMP: Enhance Memory in Data Pruning | Jinying Xiao et.al. | 2408.16031 | null |
2024-08-28 | Local Descriptors Weighted Adaptive Threshold Filtering For Few-Shot Learning | Bingchen Yan et.al. | 2408.15924 | null |
2024-08-28 | ModalityMirror: Improving Audio Classification in Modality Heterogeneity Federated Learning with Multimodal Distillation | Tiantian Feng et.al. | 2408.15803 | null |
2024-08-28 | Visual Prompt Engineering for Medical Vision Language Models in Radiology | Stefan Denner et.al. | 2408.15802 | null |
2024-08-28 | Harnessing the Intrinsic Knowledge of Pretrained Language Models for Challenging Text Classification Settings | Lingyu Gao et.al. | 2408.15650 | null |
2024-08-27 | DCT-CryptoNets: Scaling Private Inference in the Frequency Domain | Arjun Roy et.al. | 2408.15231 | null |
2024-08-27 | A Review of Transformer-Based Models for Computer Vision Tasks: Capturing Global Context and Spatial Relationships | Gracile Astlin Pereira et.al. | 2408.15178 | null |
2024-08-28 | AnomalousPatchCore: Exploring the Use of Anomalous Samples in Industrial Anomaly Detection | Mykhailo Koshil et.al. | 2408.15113 | null |
2024-08-27 | Data downlink prioritization using image classification on-board a 6U CubeSat | Keenan A. A. Chatar et.al. | 2408.14865 | null |
2024-08-27 | Leveraging Self-supervised Audio Representations for Data-Efficient Acoustic Scene Classification | Yiqiang Cai et.al. | 2408.14862 | null |
2024-08-27 | Text-guided Foundation Model Adaptation for Long-Tailed Medical Image Classification | Sirui Li et.al. | 2408.14770 | null |
2024-08-26 | On-Chip Learning with Memristor-Based Neural Networks: Assessing Accuracy and Efficiency Under Device Variations, Conductance Errors, and Input Noise | M. Reza Eslami et.al. | 2408.14680 | null |
2024-08-26 | Attend-Fusion: Efficient Audio-Visual Fusion for Video Classification | Mahrukh Awan et.al. | 2408.14441 | null |
2024-08-26 | Uncertainties of Latent Representations in Computer Vision | Michael Kirchhof et.al. | 2408.14281 | null |
2024-08-26 | MSFMamba: Multi-Scale Feature Fusion State Space Model for Multi-Source Remote Sensing Image Classification | Feng Gao et.al. | 2408.14255 | null |
2024-08-26 | Feature Aligning Few shot Learning Method Using Local Descriptors Weighted Rules | Bingchen Yan et.al. | 2408.14192 | null |
2024-08-26 | GenFormer -- Generated Images are All You Need to Improve Robustness of Transformers on Small Datasets | Sven Oehri et.al. | 2408.14131 | null |
2024-08-25 | Few-Shot Histopathology Image Classification: Evaluating State-of-the-Art Methods and Unveiling Performance Insights | Ardhendu Sekhar et.al. | 2408.13816 | null |
2024-08-25 | On the Robustness of Kolmogorov-Arnold Networks: An Adversarial Perspective | Tal Alter et.al. | 2408.13809 | null |
2024-08-25 | Enhancing Adaptive Deep Networks for Image Classification via Uncertainty-aware Decision Fusion | Xu Zhang et.al. | 2408.13744 | link |
2024-08-25 | 3D-RCNet: Learning from Transformer to Build a 3D Relational ConvNet for Hyperspectral Image Classification | Haizhao Jing et.al. | 2408.13728 | null |
2024-08-24 | Enhanced Astronomical Source Classification with Integration of Attention Mechanisms and Vision Transformers | Srinadh Reddy Bhavanam et.al. | 2408.13634 | null |
2024-08-23 | Domain-specific long text classification from sparse relevant information | Célia D'Cruz et.al. | 2408.13253 | null |
2024-08-23 | EAViT: External Attention Vision Transformer for Audio Classification | Aquib Iqbal et.al. | 2408.13201 | null |
2024-08-23 | A gradient system based on anisotropic monochrome image processing with orientation auto-adjustment | Harbir Antil et.al. | 2408.12847 | null |
2024-08-23 | Underwater SONAR Image Classification and Analysis using LIME-based Explainable Artificial Intelligence | Purushothaman Natarajan et.al. | 2408.12837 | null |
2024-08-23 | VALE: A Multimodal Visual and Language Explanation Framework for Image Classifiers using eXplainable AI and Language Models | Purushothaman Natarajan et.al. | 2408.12808 | null |
2024-08-23 | BackdoorLLM: A Comprehensive Benchmark for Backdoor Attacks on Large Language Models | Yige Li et.al. | 2408.12798 | null |
2024-08-23 | Semi-Supervised Variational Adversarial Active Learning via Learning to Rank and Agreement-Based Pseudo Labeling | Zongyao Lyu et.al. | 2408.12774 | null |
2024-08-23 | Symmetric masking strategy enhances the performance of Masked Image Modeling | Khanh-Binh Nguyen et.al. | 2408.12772 | null |
2024-08-22 | ssProp: Energy-Efficient Training for Convolutional Neural Networks with Scheduled Sparse Back Propagation | Lujia Zhong et.al. | 2408.12561 | link |
2024-08-22 | The Russian-focused embedders' exploration: ruMTEB benchmark and Russian embedding model design | Artem Snegirev et.al. | 2408.12503 | null |
2024-08-22 | Enhanced Infield Agriculture with Interpretable Machine Learning Approaches for Crop Classification | Sudi Murindanyi et.al. | 2408.12426 | null |
2024-08-22 | AT-SNN: Adaptive Tokens for Vision Transformer on Spiking Neural Network | Donghwa Kang et.al. | 2408.12293 | null |
2024-08-22 | Whole Slide Image Classification of Salivary Gland Tumours | John Charlton et.al. | 2408.12275 | null |
2024-08-22 | Query-Efficient Video Adversarial Attack with Stylized Logo | Duoxun Tang et.al. | 2408.12099 | null |
2024-08-21 | Approaching Deep Learning through the Spectral Dynamics of Weights | David Yunis et.al. | 2408.11804 | link |
2024-08-21 | SBDet: A Symmetry-Breaking Object Detector via Relaxed Rotation-Equivariance | Zhiqiang Wu et.al. | 2408.11760 | null |
2024-08-21 | Improving Calibration by Relating Focal Loss, Temperature Scaling, and Properness | Viacheslav Komisarenko et.al. | 2408.11598 | link |
2024-08-21 | MSCPT: Few-shot Whole Slide Image Classification with Multi-scale and Context-focused Prompt Tuning | Minghao Han et.al. | 2408.11505 | null |
2024-08-21 | Enabling Small Models for Zero-Shot Classification through Model Label Learning | Jia Zhang et.al. | 2408.11449 | null |
2024-08-21 | Automatic Dataset Construction (ADC): Sample Collection, Data Curation, and Beyond | Minghao Liu et.al. | 2408.11338 | null |
2024-08-21 | Towards Evaluating Large Language Models on Sarcasm Understanding | Yazhou Zhang et.al. | 2408.11319 | null |
2024-08-20 | Privacy-preserving Universal Adversarial Defense for Black-box Models | Qiao Li et.al. | 2408.10647 | null |
2024-08-20 | A Tutorial on Explainable Image Classification for Dementia Stages Using Convolutional Neural Network and Gradient-weighted Class Activation Mapping | Kevin Kam Fung Yuen et.al. | 2408.10572 | null |
2024-08-20 | NoMatterXAI: Generating "No Matter What" Alterfactual Examples for Explaining Black-Box Text Classification Models | Tuc Nguyen et.al. | 2408.10528 | null |
2024-08-20 | Cervical Cancer Detection Using Multi-Branch Deep Learning Model | Tatsuhiro Baba et.al. | 2408.10498 | null |
2024-08-19 | HaSPeR: An Image Repository for Hand Shadow Puppet Recognition | Syed Rifat Raiyan et.al. | 2408.10360 | link |
2024-08-19 | Leveraging Superfluous Information in Contrastive Representation Learning | Xuechu Yu et.al. | 2408.10292 | null |
2024-08-19 | SMILE: Zero-Shot Sparse Mixture of Low-Rank Experts Construction From Pre-Trained Foundation Models | Anke Tang et.al. | 2408.10174 | link |
2024-08-19 | Towards Robust Federated Image Classification: An Empirical Study of Weight Selection Strategies in Manufacturing | Vinit Hegiste et.al. | 2408.10024 | null |
2024-08-19 | Detecting Adversarial Attacks in Semantic Segmentation via Uncertainty Estimation: A Deep Analysis | Kira Maag et.al. | 2408.10021 | null |
2024-08-19 | Active Learning for Identifying Disaster-Related Tweets: A Comparison with Keyword Filtering and Generic Fine-Tuning | David Hanny et.al. | 2408.09914 | null |
2024-08-19 | Ranking Generated Answers: On the Agreement of Retrieval Models with Humans on Consumer Health Questions | Sebastian Heineking et.al. | 2408.09831 | null |
2024-08-19 | AutoML-guided Fusion of Entity and LLM-based representations | Boshko Koloski et.al. | 2408.09794 | null |
2024-08-19 | Dataset Distillation for Histopathology Image Classification | Cong Cong et.al. | 2408.09709 | null |
2024-08-19 | A Strategy to Combine 1stGen Transformers and Open LLMs for Automatic Text Classification | Claudio M. V. de Andrade et.al. | 2408.09629 | null |
2024-08-18 | Attention Is Not What You Need: Revisiting Multi-Instance Learning for Whole Slide Image Classification | Xin Liu et.al. | 2408.09449 | null |
2024-08-17 | Narrowing the Focus: Learned Optimizers for Pretrained Models | Gus Kristiansen et.al. | 2408.09310 | null |
2024-08-16 | DPA: Dual Prototypes Alignment for Unsupervised Adaptation of Vision-Language Models | Eman Ali et.al. | 2408.08855 | null |
2024-08-16 | LEVIS: Large Exact Verifiable Input Spaces for Neural Networks | Mohamad Fares El Hajj Chehade et.al. | 2408.08824 | null |
2024-08-16 | Leveraging FourierKAN Classification Head for Pre-Trained Transformer-based Text Classification | Abdullah Al Imran et.al. | 2408.08803 | null |
2024-08-16 | Xpikeformer: Hybrid Analog-Digital Hardware Acceleration for Spiking Transformers | Zihang Song et.al. | 2408.08794 | null |
2024-08-16 | Quantum convolutional neural networks for jet images classification | Hala Elhag et.al. | 2408.08701 | null |
2024-08-16 | MM-UNet: A Mixed MLP Architecture for Improved Ophthalmic Image Segmentation | Zunjie Xiao et.al. | 2408.08600 | null |
2024-08-16 | Tell Codec What Worth Compressing: Semantically Disentangled Image Coding for Machine with LMMs | Jinming Liu et.al. | 2408.08575 | null |
2024-08-16 | Efficient Image-to-Image Diffusion Classifier for Adversarial Robustness | Hefei Mei et.al. | 2408.08502 | link |
2024-08-15 | Beyond Uniform Query Distribution: Key-Driven Grouped Query Attention | Zohaib Khan et.al. | 2408.08454 | null |
2024-08-15 | Predictive uncertainty estimation in deep learning for lung carcinoma classification in digital pathology under real dataset shifts | Abdur R. Fayjie et.al. | 2408.08432 | null |
2024-08-15 | SLCA++: Unleash the Power of Sequential Fine-tuning for Continual Learning with Pre-training | Gengwei Zhang et.al. | 2408.08295 | link |
2024-08-15 | Moving Healthcare AI-Support Systems for Visually Detectable Diseases onto Constrained Devices | Tess Watt et.al. | 2408.08215 | null |
2024-08-15 | Towards flexible perception with visual memory | Robert Geirhos et.al. | 2408.08172 | null |
2024-08-15 | Category-Prompt Refined Feature Learning for Long-Tailed Multi-Label Image Classification | Jiexuan Yan et.al. | 2408.08125 | link |
2024-08-15 | HAIR: Hypernetworks-based All-in-One Image Restoration | Jin Cao et.al. | 2408.08091 | link |
2024-08-14 | Large Language Models Prompting With Episodic Memory | Dai Do et.al. | 2408.07465 | null |
2024-08-14 | Leveraging Perceptual Scores for Dataset Pruning in Computer Vision Tasks | Raghavendra Singh et.al. | 2408.07243 | null |
2024-08-13 | Efficient Search for Customized Activation Functions with Gradient Descent | Lukas Strack et.al. | 2408.06820 | link |
2024-08-13 | Do Vision-Language Foundational models show Robust Visual Perception? | Shivam Chandhok et.al. | 2408.06781 | link |
2024-08-13 | Towards Cross-Domain Single Blood Cell Image Classification via Large-Scale LoRA-based Segment Anything Model | Yongcheng Li et.al. | 2408.06716 | link |
2024-08-13 | Coherence Awareness in Diffractive Neural Networks | Matan Kleiner et.al. | 2408.06681 | null |
2024-08-12 | Is it a work or leisure travel? Applying text classification to identify work-related travel on social networks | Lucas Félix et.al. | 2408.06341 | null |
2024-08-12 | Audio Enhancement for Computer Audition -- An Iterative Training Paradigm Using Sample Importance | Manuel Milling et.al. | 2408.06264 | null |
2024-08-12 | Deep Learning System Boundary Testing through Latent Space Style Mixing | Amr Abdellatif et.al. | 2408.06258 | null |
2024-08-12 | Global-to-Local Support Spectrums for Language Model Explainability | Lucas Agussurja et.al. | 2408.05976 | null |
2024-08-12 | A Simple Task-aware Contrastive Local Descriptor Selection Strategy for Few-shot Learning between inter class and intra class | Qian Qiao et.al. | 2408.05953 | null |
2024-08-12 | Classifier Guidance Enhances Diffusion-based Adversarial Purification by Preserving Predictive Information | Mingkun Zhang et.al. | 2408.05900 | null |
2024-08-11 | HiLight: A Hierarchy-aware Light Global Model with Hierarchical Local ConTrastive Learning | Zhijian Chen et.al. | 2408.05786 | null |
2024-08-11 | PRECISe : Prototype-Reservation for Explainable Classification under Imbalanced and Scarce-Data Settings | Vaibhav Ganatra et.al. | 2408.05754 | null |
2024-08-11 | Disposable-key-based image encryption for collaborative learning of Vision Transformer | Rei Aso et.al. | 2408.05737 | null |
2024-08-11 | A Novel Momentum-Based Deep Learning Techniques for Medical Image Classification and Segmentation | Koushik Biswas et.al. | 2408.05692 | null |
2024-08-09 | A conformalized learning of a prediction set with applications to medical imaging classification | Roy Hirsch et.al. | 2408.05037 | null |
2024-08-09 | Generalisation First, Memorisation Second? Memorisation Localisation for Natural Language Classification Tasks | Verna Dankers et.al. | 2408.04965 | null |
2024-08-09 | LiD-FL: Towards List-Decodable Federated Learning | Hong Liu et.al. | 2408.04963 | null |
2024-08-09 | In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation | Dahyun Kang et.al. | 2408.04961 | link |
2024-08-08 | Enhanced Prototypical Part Network (EPPNet) For Explainable Image Classification Via Prototypes | Bhushan Atote et.al. | 2408.04606 | null |
2024-08-08 | SCENE: Evaluating Explainable AI Techniques Using Soft Counterfactuals | Haoran Zheng et.al. | 2408.04575 | null |
2024-08-08 | An experimental comparative study of backpropagation and alternatives for training binary neural networks for image classification | Ben Crulis et.al. | 2408.04460 | null |
2024-08-08 | Dual-branch PolSAR Image Classification Based on GraphMAE and Local Feature Extraction | Yuchen Wang et.al. | 2408.04294 | null |
2024-08-07 | FMiFood: Multi-modal Contrastive Learning for Food Image Classification | Xinyue Pan et.al. | 2408.03922 | null |
2024-08-07 | Leveraging Variation Theory in Counterfactual Data Augmentation for Optimized Active Learning | Simret Araya Gebreegziabher et.al. | 2408.03819 | null |
2024-08-07 | Intuitionistic Fuzzy Cognitive Maps for Interpretable Image Classification | Georgia Sovatzidi et.al. | 2408.03745 | null |
2024-08-07 | CAS-ViT: Convolutional Additive Self-attention Vision Transformers for Efficient Mobile Applications | Tianfang Zhang et.al. | 2408.03703 | link |
2024-08-07 | Designing Extremely Memory-Efficient CNNs for On-device Vision Tasks | Jaewook Lee et.al. | 2408.03663 | null |
2024-08-07 | Making Robust Generalizers Less Rigid with Soft Ascent-Descent | Matthew J. Holland et.al. | 2408.03619 | null |
2024-08-06 | AI Foundation Models in Remote Sensing: A Survey | Siqi Lu et.al. | 2408.03464 | null |
2024-08-06 | Compress and Compare: Interactively Evaluating Efficiency and Behavior Across ML Model Compression Experiments | Angie Boggust et.al. | 2408.03274 | null |
2024-08-06 | A Debiased Nearest Neighbors Framework for Multi-Label Text Classification | Zifeng Cheng et.al. | 2408.03202 | null |
2024-08-06 | Leveraging Parameter Efficient Training Methods for Low Resource Text Classification: A Case Study in Marathi | Pranita Deshmukh et.al. | 2408.03172 | null |
2024-08-06 | Comb, Prune, Distill: Towards Unified Pruning for Vision Model Compression | Jonas Schmitt et.al. | 2408.03046 | null |
2024-08-06 | L3iTC at the FinLLM Challenge Task: Quantization for Financial Text Classification & Summarization | Elvys Linhares Pontes et.al. | 2408.03033 | null |
2024-08-06 | Adversarial Robustness of Open-source Text Classification Models and Fine-Tuning Chains | Hao Qin et.al. | 2408.02963 | null |
2024-08-06 | Dual-View Pyramid Pooling in Deep Neural Networks for Improved Medical Image Classification and Confidence Calibration | Xiaoqing Zhang et.al. | 2408.02906 | null |
2024-08-05 | Interpretation of the Intent Detection Problem as Dynamics in a Low-dimensional Space | Eduardo Sanchez-Karhunen et.al. | 2408.02838 | null |
2024-08-05 | Pre-trained Encoder Inference: Revealing Upstream Encoders In Downstream Machine Learning Services | Shaopeng Fu et.al. | 2408.02814 | null |
2024-08-05 | FPT+: A Parameter and Memory Efficient Transfer Learning Method for High-resolution Medical Image Classification | Yijin Huang et.al. | 2408.02426 | null |
2024-08-05 | On the Robustness of Malware Detectors to Adversarial Samples | Muhammad Salman et.al. | 2408.02310 | null |
2024-08-05 | Low-Cost Self-Ensembles Based on Multi-Branch Transformation and Grouped Convolution | Hojung Lee et.al. | 2408.02307 | null |
2024-08-05 | Network Fission Ensembles for Low-Cost Self-Ensembles | Hojung Lee et.al. | 2408.02301 | null |
2024-08-04 | VidModEx: Interpretable and Efficient Black Box Model Extraction for High-Dimensional Spaces | Somnath Sendhil Kumar et.al. | 2408.02140 | null |
2024-08-04 | DeMansia: Mamba Never Forgets Any Tokens | Ricky Fang et.al. | 2408.01986 | null |
2024-08-06 | A Survey and Evaluation of Adversarial Attacks for Object Detection | Khoi Nguyen Tiet Nguyen et.al. | 2408.01934 | null |
2024-08-03 | Safe Semi-Supervised Contrastive Learning Using In-Distribution Data as Positive Examples | Min Gu Kwak et.al. | 2408.01872 | null |
2024-08-03 | LAM3D: Leveraging Attention for Monocular 3D Object Detection | Diana-Alexandra Sas et.al. | 2408.01739 | null |
2024-08-02 | Counterfactual Explanations for Medical Image Classification and Regression using Diffusion Autoencoder | Matan Atad et.al. | 2408.01571 | null |
2024-08-02 | Spatial-Spectral Morphological Mamba for Hyperspectral Image Classification | Muhammad Ahmad et.al. | 2408.01372 | null |
2024-08-02 | WaveMamba: Spatial-Spectral Wavelet Mamba for Hyperspectral Image Classification | Muhammad Ahmad et.al. | 2408.01231 | null |
2024-08-02 | Multi-head Spatial-Spectral Mamba for Hyperspectral Image Classification | Muhammad Ahmad et.al. | 2408.01224 | null |
2024-08-02 | Rethinking Pre-trained Feature Extractor Selection in Multiple Instance Learning for Whole Slide Image Classification | Bryan Wong et.al. | 2408.01167 | null |
2024-08-01 | CERT-ED: Certifiably Robust Text Classification for Edit Distance | Zhuoqun Huang et.al. | 2408.00728 | null |
2024-08-01 | Deep Learning in Medical Image Classification from MRI-based Brain Tumor Images | Xiaoyi Liu et.al. | 2408.00636 | null |
2024-08-01 | DECIDER: Leveraging Foundation Model Priors for Improved Model Failure Detection and Explanation | Rakshith Subramanyam et.al. | 2408.00331 | null |
2024-07-31 | Vera Verto: Multimodal Hijacking Attack | Minxing Zhang et.al. | 2408.00129 | null |
2024-07-31 | Learning Video Context as Interleaved Multimodal Sequences | Kevin Qinghong Lin et.al. | 2407.21757 | null |
2024-07-30 | Contrasting Deep Learning Models for Direct Respiratory Insufficiency Detection Versus Blood Oxygen Saturation Estimation | Marcelo Matheus Gauy et.al. | 2407.20989 | null |
2024-07-30 | Faithful and Plausible Natural Language Explanations for Image Classification: A Pipeline Approach | Adam Wojciechowski et.al. | 2407.20899 | null |
2024-08-01 | DFE-IANet: A Method for Polyp Image Classification Based on Dual-domain Feature Extraction and Interaction Attention | Wei Wang et.al. | 2407.20843 | null |
2024-08-01 | The Susceptibility of Example-Based Explainability Methods to Class Outliers | Ikhtiyor Nematov et.al. | 2407.20678 | null |
2024-07-30 | Knowledge Fused Recognition: Fusing Hierarchical Knowledge for Image Recognition through Quantitative Relativity Modeling and Deep Metric Learning | Yunfeng Zhao et.al. | 2407.20600 | null |
2024-07-30 | Exploring Liquid Neural Networks on Loihi-2 | Wiktoria Agata Pawlak et.al. | 2407.20590 | null |
2024-07-29 | Graphite: A Graph-based Extreme Multi-Label Short Text Classifier for Keyphrase Recommendation | Ashirbad Mishra et.al. | 2407.20462 | null |
2024-07-29 | Diffusion Feedback Helps CLIP See Better | Wenxuan Wang et.al. | 2407.20171 | null |
2024-07-29 | Distilling High Diagnostic Value Patches for Whole Slide Image Classification Using Attention Mechanism | Tianhang Nan et.al. | 2407.19821 | null |
2024-07-28 | Competition-based Adaptive ReLU for Deep Neural Networks | Junjia Chen et.al. | 2407.19441 | null |
2024-07-28 | Depth-Wise Convolutions in Vision Transformers for Efficient Training on Small Datasets | Tianxiao Zhang et.al. | 2407.19394 | link |
2024-07-27 | Inference-Time Selective Debiasing | Gleb Kuzmin et.al. | 2407.19345 | null |
2024-07-27 | Stellar Blend Image Classification Using Computationally Efficient Gaussian Processes | Chinedu Eleh et.al. | 2407.19297 | null |
2024-07-27 | Towards Robust Few-shot Class Incremental Learning in Audio Classification using Contrastive Representation | Riyansha Singh et.al. | 2407.19265 | null |
2024-07-27 | A Survey of Malware Detection Using Deep Learning | Ahmed Bensaoud et.al. | 2407.19153 | null |
2024-07-26 | UniForensics: Face Forgery Detection via General Facial Representation | Ziyuan Fang et.al. | 2407.19079 | null |
2024-07-26 | A Scalable Quantum Non-local Neural Network for Image Classification | Sparsh Gupta et.al. | 2407.18906 | link |
2024-07-26 | Unifying Visual and Semantic Feature Spaces with Diffusion Models for Enhanced Cross-Modal Alignment | Yuze Zheng et.al. | 2407.18854 | null |
2024-07-26 | Local Binary Pattern(LBP) Optimization for Feature Extraction | Zeinab Sedaghatjoo et.al. | 2407.18665 | null |
2024-07-26 | Topology Optimization of Random Memristors for Input-Aware Dynamic SNN | Bo Wang et.al. | 2407.18625 | null |
2024-07-26 | Content-driven Magnitude-Derivative Spectrum Complementary Learning for Hyperspectral Image Classification | Huiyan Bai et.al. | 2407.18593 | null |
2024-07-26 | VSSD: Vision Mamba with Non-Casual State Space Duality | Yuheng Shi et.al. | 2407.18559 | link |
2024-07-25 | Self-supervised pre-training with diffusion model for few-shot landmark detection in x-ray images | Roberto Di Via et.al. | 2407.18125 | null |
2024-07-25 | Mew: Multiplexed Immunofluorescence Image Analysis through an Efficient Multiplex Network | Sukwon Yun et.al. | 2407.17857 | link |
2024-07-25 | SAM-MIL: A Spatial Contextual Aware Multiple Instance Learning Approach for Whole Slide Image Classification | Heng Fang et.al. | 2407.17689 | link |
2024-07-26 | Unsqueeze [CLS] Bottleneck to Learn Rich Representations | Qing Su et.al. | 2407.17671 | link |
2024-07-24 | Explaining the Model, Protecting Your Data: Revealing and Mitigating the Data Privacy Risks of Post-Hoc Model Explanations via Membership Inference | Catherine Huang et.al. | 2407.17663 | null |
2024-07-23 | S-E Pipeline: A Vision Transformer (ViT) based Resilient Classification Pipeline for Medical Imaging Against Adversarial Attacks | Neha A S et.al. | 2407.17587 | null |
2024-07-24 | A Novel Two-Step Fine-Tuning Pipeline for Cold-Start Active Learning in Text Classification Tasks | Fabiano Belém et.al. | 2407.17284 | null |
2024-07-24 | Graph Neural Networks: A suitable Alternative to MLPs in Latent 3D Medical Image Classification? | Johannes Kiechle et.al. | 2407.17219 | link |
2024-07-24 | Quanv4EO: Empowering Earth Observation by means of Quanvolutional Neural Networks | Alessandro Sebastianelli et.al. | 2407.17108 | null |
2024-07-24 | An Adaptive Gradient Regularization Method | Huixiu Jiang et.al. | 2407.16944 | null |
2024-07-23 | Lawma: The Power of Specialization for Legal Tasks | Ricardo Dominguez-Olmedo et.al. | 2407.16615 | null |
2024-07-23 | Deep Bayesian segmentation for colon polyps: Well-calibrated predictions in medical imaging | Daniela L. Ramos et.al. | 2407.16608 | null |
2024-07-23 | Designing robust diffractive neural networks with improved transverse shift tolerance | Daniil V. Soshnikov et.al. | 2407.16456 | null |
2024-07-23 | Image Classification using Fuzzy Pooling in Convolutional Kolmogorov-Arnold Networks | Ayan Igali et.al. | 2407.16268 | null |
2024-07-23 | HSVLT: Hierarchical Scale-Aware Vision-Language Transformer for Multi-Label Image Classification | Shuyi Ouyang et.al. | 2407.16244 | null |
2024-07-23 | Improved Few-Shot Image Classification Through Multiple-Choice Questions | Dipika Khullar et.al. | 2407.16145 | null |
2024-07-22 | Pavement Fatigue Crack Detection and Severity Classification Based on Convolutional Neural Network | Zhen Wang et.al. | 2407.16021 | null |
2024-07-22 | AIDE: Antithetical, Intent-based, and Diverse Example-Based Explanations | Ikhtiyor Nematov et.al. | 2407.16010 | null |
2024-07-22 | Comprehensive Study on Performance Evaluation and Optimization of Model Compression: Bridging Traditional Deep Learning and Large Language Models | Aayush Saxena et.al. | 2407.15904 | null |
2024-07-22 | Beyond Size and Class Balance: Alpha as a New Dataset Quality Metric for Deep Learning | Josiah Couch et.al. | 2407.15724 | null |
2024-07-22 | Retinomorphic Feature Detection and Machine Vision in a Network Laser | Wai Kit Ng et.al. | 2407.15558 | null |
2024-07-22 | Learning deep illumination-robust features from multispectral filter array images | Anis Amziane et.al. | 2407.15472 | null |
2024-07-22 | Is user feedback always informative? Retrieval Latent Defending for Semi-Supervised Domain Adaptation without Source Data | Junha Song et.al. | 2407.15383 | null |
2024-07-22 | FMDNN: A Fuzzy-guided Multi-granular Deep Neural Network for Histopathological Image Classification | Weiping Ding et.al. | 2407.15312 | null |
2024-07-21 | Assessing Sample Quality via the Latent Space of Generative Models | Jingyi Xu et.al. | 2407.15171 | null |
2024-07-21 | A multi-level multi-label text classification dataset of 19th century Ottoman and Russian literary and critical texts | Gokcen Gokceoglu et.al. | 2407.15136 | null |
2024-07-20 | Toward Efficient Convolutional Neural Networks With Structured Ternary Patterns | Christos Kyrkou et.al. | 2407.14831 | link |
2024-07-20 | Subgraph Clustering and Atom Learning for Improved Image Classification | Aryan Singh et.al. | 2407.14772 | null |
2024-07-20 | A Comprehensive Review of Few-shot Action Recognition | Yuyang Wanyan et.al. | 2407.14744 | null |
2024-07-19 | DEPICT: Diffusion-Enabled Permutation Importance for Image Classification Tasks | Sarah Jabbour et.al. | 2407.14509 | null |
2024-07-19 | Enhancing Zero-shot Audio Classification using Sound Attribute Knowledge from Large Language Models | Xuenan Xu et.al. | 2407.14355 | null |
2024-07-19 | EmoCAM: Toward Understanding What Drives CNN-based Emotion Recognition | Youssef Doulfoukar et.al. | 2407.14314 | null |
2024-07-18 | CoAPT: Context Attribute words for Prompt Tuning | Gun Lee et.al. | 2407.13808 | null |
2024-07-18 | GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model | Abdelrahman Shaker et.al. | 2407.13772 | link |
2024-07-18 | Addressing Imbalance for Class Incremental Learning in Medical Image Classification | Xuze Hao et.al. | 2407.13768 | null |
2024-07-18 | Differential Privacy Mechanisms in Neural Tangent Kernel Regression | Jiuxiang Gu et.al. | 2407.13621 | null |
2024-07-18 | CycleMix: Mixing Source Domains for Domain Generalization in Style-Dependent Data | Aristotelis Ballas et.al. | 2407.13421 | link |
2024-07-17 | LookupViT: Compressing visual information to a limited number of tokens | Rajat Koner et.al. | 2407.12753 | null |
2024-07-17 | Toward INT4 Fixed-Point Training via Exploring Quantization Error for Gradients | Dohyung Kim et.al. | 2407.12637 | null |
2024-07-17 | Domain-specific or Uncertainty-aware models: Does it really make a difference for biomedical text classification? | Aman Sinha et.al. | 2407.12626 | null |
2024-07-18 | Benchmarking Robust Self-Supervised Learning Across Diverse Downstream Tasks | Antoni Kowalczuk et.al. | 2407.12588 | link |
2024-07-17 | Non-parametric regularization for class imbalance federated medical image classification | Jeffry Wicaksana et.al. | 2407.12446 | link |
2024-07-17 | FETCH: A Memory-Efficient Replay Approach for Continual Learning in Image Classification | Markus Weißflog et.al. | 2407.12375 | null |
2024-07-17 | Adaptive Cascading Network for Continual Test-Time Adaptation | Kien X. Nguyen et.al. | 2407.12240 | null |
2024-07-16 | Generalized Coverage for More Robust Low-Budget Active Learning | Wonho Bae et.al. | 2407.12212 | null |
2024-07-18 | A Closer Look at Benchmarking Self-Supervised Pre-training with Image Classification | Markus Marks et.al. | 2407.12210 | null |
2024-07-16 | Novel Artistic Scene-Centric Datasets for Effective Transfer Learning in Fragrant Spaces | Shumei Liu et.al. | 2407.11701 | null |
2024-07-16 | Probing the Efficacy of Federated Parameter-Efficient Fine-Tuning of Vision Transformers for Medical Image Classification | Naif Alkhunaizi et.al. | 2407.11573 | null |
2024-07-16 | TCFormer: Visual Recognition via Token Clustering Transformer | Wang Zeng et.al. | 2407.11321 | link |
2024-07-16 | PADRe: A Unifying Polynomial Attention Drop-in Replacement for Efficient Vision Transformer | Pierre-David Letourneau et.al. | 2407.11306 | null |
2024-07-15 | Unconstrained Open Vocabulary Image Classification: Zero-Shot Transfer from Text to Image via CLIP Inversion | Philipp Allgeuer et.al. | 2407.11211 | null |
2024-07-16 | DataDream: Few-shot Guided Dataset Generation | Jae Myung Kim et.al. | 2407.10910 | link |
2024-07-15 | Pathology-knowledge Enhanced Multi-instance Prompt Learning for Few-shot Whole Slide Image Classification | Linhao Qu et.al. | 2407.10814 | null |
2024-07-15 | Employing Sentence Space Embedding for Classification of Data Stream from Fake News Domain | Paweł Zyblewski et.al. | 2407.10807 | null |
2024-07-15 | Anticipating Future Object Compositions without Forgetting | Youssef Zahran et.al. | 2407.10723 | null |
2024-07-15 | GeoMix: Towards Geometry-Aware Data Augmentation | Wentao Zhao et.al. | 2407.10681 | link |
2024-07-15 | Learning Natural Consistency Representation for Face Forgery Video Detection | Daichi Zhang et.al. | 2407.10550 | null |
2024-07-15 | Improving Hyperbolic Representations via Gromov-Wasserstein Regularization | Yifei Yang et.al. | 2407.10495 | null |
2024-07-15 | Backdoor Attacks against Image-to-Image Networks | Wenbo Jiang et.al. | 2407.10445 | null |
2024-07-14 | Deep Learning Algorithms for Early Diagnosis of Acute Lymphoblastic Leukemia | Dimitris Papaioannou et.al. | 2407.10251 | null |
2024-07-14 | Advancing Continual Learning for Robust Deepfake Audio Classification | Feiyi Dong et.al. | 2407.10108 | null |
2024-07-12 | Evaluating the Adversarial Robustness of Semantic Segmentation: Trying Harder Pays Off | Levente Halmosi et.al. | 2407.09150 | link |
2024-07-12 | Open Vocabulary Multi-Label Video Classification | Rohit Gupta et.al. | 2407.09073 | null |
2024-07-12 | GPC: Generative and General Pathology Image Classifier | Anh Tien Nguyen et.al. | 2407.09035 | null |
2024-07-12 | CAMP: Continuous and Adaptive Learning Model in Pathology | Anh Tien Nguyen et.al. | 2407.09030 | null |
2024-07-12 | SlideGCD: Slide-based Graph Collaborative Training with Knowledge Distillation for Whole Slide Image Classification | Tong Shu et.al. | 2407.08968 | null |
2024-07-12 | Domain-Hierarchy Adaptation via Chain of Iterative Reasoning for Few-shot Hierarchical Text Classification | Ke Ji et.al. | 2407.08959 | null |
2024-07-11 | Local Clustering for Lung Cancer Image Classification via Sparse Solution Technique | Jackson Hamel et.al. | 2407.08800 | null |
2024-07-11 | Data Adaptive Traceback for Vision-Language Foundation Models in Image Classification | Wenshuo Peng et.al. | 2407.08787 | null |
2024-07-11 | ElasticAST: An Audio Spectrogram Transformer for All Length and Resolutions | Jiu Feng et.al. | 2407.08691 | link |
2024-07-11 | Histopathological Image Classification with Cell Morphology Aware Deep Neural Networks | Andrey Ignatov et.al. | 2407.08625 | link |
2024-07-11 | BiasPruner: Debiased Continual Learning for Medical Image Classification | Nourhan Bayasi et.al. | 2407.08609 | link |
2024-07-11 | GraphMamba: An Efficient Graph Structure Learning Vision Mamba for Hyperspectral Image Classification | Aitao Yang et.al. | 2407.08255 | link |
2024-07-11 | Beyond Text: Leveraging Multi-Task Learning and Cognitive Appraisal Theory for Post-Purchase Intention Analysis | Gerard Christopher Yeo et.al. | 2407.08182 | null |
2024-07-11 | Enrich the content of the image Using Context-Aware Copy Paste | Qiushi Guo et.al. | 2407.08151 | null |
2024-07-10 | MambaVision: A Hybrid Mamba-Transformer Vision Backbone | Ali Hatamizadeh et.al. | 2407.08083 | link |
2024-07-10 | The Misclassification Likelihood Matrix: Some Classes Are More Likely To Be Misclassified Than Others | Daniel Sikar et.al. | 2407.07818 | null |
2024-07-11 | Trainable Highly-expressive Activation Functions | Irit Chelly et.al. | 2407.07564 | null |
2024-07-10 | HDKD: Hybrid Data-Efficient Knowledge Distillation Network for Medical Image Classification | Omar S. EL-Assiouti et.al. | 2407.07516 | null |
2024-07-10 | Towards a text-based quantitative and explainable histopathology image analysis | Anh Tien Nguyen et.al. | 2407.07360 | null |
2024-07-11 | FALFormer: Feature-aware Landmarks self-attention for Whole-slide Image Classification | Doanh C. Bui et.al. | 2407.07340 | link |
2024-07-10 | Dual-stage Hyperspectral Image Classification Model with Spectral Supertoken | Peifu Liu et.al. | 2407.07307 | link |
2024-07-09 | Exploring Camera Encoder Designs for Autonomous Driving Perception | Barath Lakshmanan et.al. | 2407.07276 | null |
2024-07-09 | CTRL-F: Pairing Convolution with Transformer for Image Classification via Multi-Level Feature Cross-Attention and Representation Learning Fusion | Hosam S. EL-Assiouti et.al. | 2407.06673 | null |
2024-07-09 | NoisyAG-News: A Benchmark for Addressing Instance-Dependent Noise in Text Classification | Hongfei Huang et.al. | 2407.06579 | null |
2024-07-08 | Hybrid Classical-Quantum architecture for vectorised image classification of hand-written sketches | Y. Cordero et.al. | 2407.06416 | null |
2024-07-08 | GeoWATCH for Detecting Heavy Construction in Heterogeneous Time Series of Satellite Images | Jon Crall et.al. | 2407.06337 | null |
2024-07-08 | Multi-Label Plant Species Classification with Self-Supervised Vision Transformers | Murilo Gustineli et.al. | 2407.06298 | link |
2024-07-08 | Active Label Refinement for Robust Training of Imbalanced Medical Image Classification Tasks in the Presence of High Label Noise | Bidur Khanal et.al. | 2407.05973 | null |
2024-07-08 | Wavelet Convolutions for Large Receptive Fields | Shahaf E. Finder et.al. | 2407.05848 | link |
2024-07-08 | Evaluating the Fairness of Neural Collapse in Medical Image Classification | Kaouther Mouheb et.al. | 2407.05843 | null |
2024-07-08 | Learning to Adapt Category Consistent Meta-Feature of CLIP for Few-Shot Classification | Jiaying Shi et.al. | 2407.05647 | null |
2024-07-08 | New Directions in Text Classification Research: Maximizing The Performance of Sentiment Classification from Limited Data | Surya Agustian et.al. | 2407.05627 | null |
2024-07-08 | Momentum Auxiliary Network for Supervised Local Learning | Junhao Su et.al. | 2407.05623 | link |
2024-07-08 | Open-world Multi-label Text Classification with Extremely Weak Supervision | Xintong Li et.al. | 2407.05609 | link |
2024-07-08 | FALIP: Visual Prompt as Foveal Attention Boosts CLIP Zero-Shot Performance | Jiedong Zhuang et.al. | 2407.05578 | null |
2024-07-08 | An accurate detection is not all you need to combat label noise in web-noisy datasets | Paul Albert et.al. | 2407.05528 | null |
2024-07-07 | Leveraging Topological Guidance for Improved Knowledge Distillation | Eun Som Jeon et.al. | 2407.05316 | link |
2024-07-05 | AWT: Transferring Vision-Language Models via Augmentation, Weighting, and Transportation | Yuhan Zhu et.al. | 2407.04603 | null |
2024-07-05 | AMD: Automatic Multi-step Distillation of Large-scale Vision Models | Cheng Han et.al. | 2407.04208 | null |
2024-07-04 | LeDNet: Localization-enabled Deep Neural Network for Multi-Label Radiography Image Classification | Lalit Pant et.al. | 2407.03931 | null |
2024-07-04 | DocXplain: A Novel Model-Agnostic Explainability Method for Document Image Classification | Saifullah Saifullah et.al. | 2407.03830 | null |
2024-07-04 | reBEN: Refined BigEarthNet Dataset for Remote Sensing Image Analysis | Kai Norman Clasen et.al. | 2407.03653 | link |
2024-07-04 | Resampled Datasets Are Not Enough: Mitigating Societal Bias Beyond Single Attributes | Yusuke Hirota et.al. | 2407.03623 | null |
2024-07-04 | Self Adaptive Threshold Pseudo-labeling and Unreliable Sample Contrastive Loss for Semi-supervised Image Classification | Xuerong Zhang et.al. | 2407.03596 | null |
2024-07-04 | DGR-MIL: Exploring Diverse Global Representation in Multiple Instance Learning for Whole Slide Image Classification | Wenhui Zhu et.al. | 2407.03575 | link |
2024-07-03 | A multicategory jet image classification framework using deep neural network | Jairo Orozco Sandoval et.al. | 2407.03524 | null |
2024-07-03 | Model Guidance via Explanations Turns Image Classifiers into Segmentation Models | Xiaoyan Yu et.al. | 2407.03009 | null |
2024-07-03 | ShiftAddAug: Augment Multiplication-Free Tiny Neural Network with Hybrid Computation | Yipin Guo et.al. | 2407.02881 | null |
2024-07-03 | Fine-Grained Scene Image Classification with Modality-Agnostic Adapter | Yiqun Wang et.al. | 2407.02769 | link |
2024-07-03 | ADFQ-ViT: Activation-Distribution-Friendly Post-Training Quantization for Vision Transformers | Yanfeng Jiang et.al. | 2407.02763 | null |
2024-07-02 | Spectral Graph Reasoning Network for Hyperspectral Image Classification | Huiling Wang et.al. | 2407.02647 | null |
2024-07-01 | CGRclust: Chaos Game Representation for Twin Contrastive Clustering of Unlabelled DNA Sequences | Fatemeh Alipour et.al. | 2407.02538 | link |
2024-07-02 | Exploring the Role of Transliteration in In-Context Learning for Low-resource Languages Written in Non-Latin Scripts | Chunlan Ma et.al. | 2407.02320 | null |
2024-07-03 | Federated Distillation for Medical Image Classification: Towards Trustworthy Computer-Aided Diagnosis | Sufen Ren et.al. | 2407.02261 | null |
2024-07-02 | Hybrid Feature Collaborative Reconstruction Network for Few-Shot Fine-Grained Image Classification | Shulei Qiu et.al. | 2407.02123 | null |
2024-07-01 | Optimized Learning for X-Ray Image Classification for Multi-Class Disease Diagnoses with Accelerated Computing Strategies | Sebastian A. Cruz Romero et.al. | 2407.01705 | null |
2024-07-02 | xLSTM-UNet can be an Effective 2D & 3D Medical Image Segmentation Backbone with Vision-LSTM (ViL) better than its Mamba Counterpart | Tianrun Chen et.al. | 2407.01530 | link |
2024-07-01 | Scarecrow monitoring system:employing mobilenet ssd for enhanced animal supervision | Balaji VS et.al. | 2407.01435 | null |
2024-07-01 | Semantic Compositions Enhance Vision-Language Contrastive Learning | Maxwell Aladago et.al. | 2407.01408 | null |
2024-07-01 | GalLoP: Learning Global and Local Prompts for Vision-Language Models | Marc Lafon et.al. | 2407.01400 | null |
2024-07-01 | Protecting Privacy in Classifiers by Token Manipulation | Re'em Harel et.al. | 2407.01334 | null |
2024-07-01 | Gradient-based Class Weighting for Unsupervised Domain Adaptation in Dense Prediction Visual Tasks | Roberto Alcover-Couso et.al. | 2407.01327 | null |
2024-06-28 | Extract More from Less: Efficient Fine-Grained Visual Recognition in Low-Data Regimes | Dmitry Demidov et.al. | 2406.19814 | link |
2024-06-27 | Fibottention: Inceptive Visual Representation Learning with Diverse Attention Across Heads | Ali Khaleghi Rahimian et.al. | 2406.19391 | link |
2024-06-27 | Learning Visual Conditioning Tokens to Correct Domain Shift for Fully Test-time Adaptation | Yushun Tang et.al. | 2406.19341 | null |
2024-06-27 | Spiking Convolutional Neural Networks for Text Classification | Changze Lv et.al. | 2406.19230 | link |
2024-06-27 | Adaptive Stochastic Weight Averaging | Caglar Demir et.al. | 2406.19092 | link |
2024-06-27 | FedMLP: Federated Multi-Label Medical Image Classification under Task Heterogeneity | Zhaobin Sun et.al. | 2406.18995 | link |
2024-06-26 | Detecting Machine-Generated Texts: Not Just "AI vs Humans" and Explainability is Complicated | Jiazhou Ji et.al. | 2406.18259 | null |
2024-06-26 | ViT-1.58b: Mobile Vision Transformers in the 1-bit Era | Zhengqing Yuan et.al. | 2406.18051 | null |
2024-06-25 | Benchmarking Deep Learning Models on NVIDIA Jetson Nano for Real-Time Systems: An Empirical Investigation | Tushar Prasanna Swaminathan et.al. | 2406.17749 | link |
2024-06-25 | Structured Unrestricted-Rank Matrices for Parameter Efficient Fine-tuning | Arijit Sehanobish et.al. | 2406.17740 | null |
2024-06-25 | BayTTA: Uncertainty-aware medical image classification with optimized test-time augmentation using Bayesian model averaging | Zeinab Sherkatghanad et.al. | 2406.17640 | link |
2024-06-26 | Mitigate the Gap: Investigating Approaches for Improving Cross-Modal Alignment in CLIP | Sedigheh Eslami et.al. | 2406.17639 | null |
2024-06-25 | Knowledge Distillation in Automated Annotation: Supervised Text Classification with LLM-Generated Training Labels | Nicholas Pangakis et.al. | 2406.17633 | null |
2024-06-25 | Retrieval-style In-Context Learning for Few-shot Hierarchical Text Classification | Huiyao Chen et.al. | 2406.17534 | link |
2024-06-25 | TSynD: Targeted Synthetic Data Generation for Enhanced Medical Image Classification | Joshua Niemeijer et.al. | 2406.17473 | null |
2024-06-25 | Dynamic Scheduling for Vehicle-to-Vehicle Communications Enhanced Federated Learning | Jintao Yan et.al. | 2406.17470 | null |
2024-06-25 | Implicit-Zoo: A Large-Scale Dataset of Neural Implicit Functions for 2D Images and 3D Scenes | Qi Ma et.al. | 2406.17438 | null |
2024-06-25 | Robustly Optimized Deep Feature Decoupling Network for Fatty Liver Diseases Detection | Peng Huang et.al. | 2406.17338 | null |
2024-06-24 | Evaluation of Language Models in the Medical Context Under Resource-Constrained Settings | Andrea Posada et.al. | 2406.16611 | link |
2024-06-24 | Improving robustness to corruptions with multiplicative weight perturbations | Trung Trinh et.al. | 2406.16540 | null |
2024-06-24 | UNICAD: A Unified Approach for Attack Detection, Noise Reduction and Novel Class Identification | Alvaro Lopez Pellicer et.al. | 2406.16501 | null |
2024-06-24 | Improving Quaternion Neural Networks with Quaternionic Activation Functions | Johannes Pöppelbaum et.al. | 2406.16481 | null |
2024-06-24 | Learning in Wilson-Cowan model for metapopulation | Raffaele Marino et.al. | 2406.16453 | link |
2024-06-24 | Context-augmented Retrieval: A Novel Framework for Fast Information Retrieval based Response Generation using Large Language Model | Sai Ganesh et.al. | 2406.16383 | null |
2024-06-24 | Combining Supervised Learning and Reinforcement Learning for Multi-Label Classification Tasks with Partial Labels | Zixia Jia et.al. | 2406.16293 | null |
2024-06-23 | Jacobian Descent for Multi-Objective Optimization | Pierre Quinton et.al. | 2406.16232 | null |
2024-06-23 | Learning with Noisy Ground Truth: From 2D Classification to 3D Reconstruction | Yangdi Lu et.al. | 2406.15982 | null |
2024-06-22 | PUDD: Towards Robust Multi-modal Prototype-based Deepfake Detection | Alvaro Lopez Pellcier et.al. | 2406.15921 | null |
2024-06-21 | Retrieval Augmented Zero-Shot Text Classification | Tassallah Abdullahi et.al. | 2406.15241 | null |
2024-06-21 | DiffExplainer: Unveiling Black Box Models Via Counterfactual Generation | Yingying Fang et.al. | 2406.15182 | null |
2024-06-21 | This actually looks like that: Proto-BagNets for local and global interpretability-by-design | Kerol Djoumessi et.al. | 2406.15168 | link |
2024-06-21 | Hierarchical thematic classification of major conference proceedings | Arsentii Kuzmin et.al. | 2406.14983 | null |
2024-06-21 | Demonstrating the Efficacy of Kolmogorov-Arnold Networks in Vision Tasks | Minjong Cheon et.al. | 2406.14916 | link |
2024-06-21 | MU-Bench: A Multitask Multimodal Benchmark for Machine Unlearning | Jiali Cheng et.al. | 2406.14796 | null |
2024-06-20 | Depth |
Parker Seegmiller et.al. | 2406.14695 | null |
2024-06-20 | Automatic Labels are as Effective as Manual Labels in Biomedical Images Classification with Deep Learning | Niccolò Marini et.al. | 2406.14351 | null |
2024-06-20 | Self-supervised Interpretable Concept-based Models for Text Classification | Francesco De Santis et.al. | 2406.14335 | null |
2024-06-20 | Adaptive Adversarial Cross-Entropy Loss for Sharpness-Aware Minimization | Tanapat Ratchatorn et.al. | 2406.14329 | null |
2024-06-20 | Boosting Hyperspectral Image Classification with Gate-Shift-Fuse Mechanisms in a Novel CNN-Transformer Approach | Mohamed Fadhlallah Guerri et.al. | 2406.14120 | null |
2024-06-20 | Seg-LSTM: Performance of xLSTM for Semantic Segmentation of Remotely Sensed Images | Qinfeng Zhu et.al. | 2406.14086 | link |
2024-06-21 | CMTNet: Convolutional Meets Transformer Network for Hyperspectral Images Classification | Faxu Guo et.al. | 2406.14080 | null |
2024-06-20 | Communication-Efficient Adaptive Batch Size Strategies for Distributed Local Gradient Methods | Tim Tsz-Kit Lau et.al. | 2406.13936 | null |
2024-06-19 | WATT: Weight Average Test-Time Adaption of CLIP | David Osowiechi et.al. | 2406.13875 | link |
2024-06-19 | CNN Based Flank Predictor for Quadruped Animal Species | Vanessa Suessle et.al. | 2406.13588 | null |
2024-06-19 | Online Domain-Incremental Learning Approach to Classify Acoustic Scenes in All Locations | Manjunath Mulimani et.al. | 2406.13386 | null |
2024-06-18 | LayerMerge: Neural Network Depth Compression through Layer Pruning and Merging | Jinuk Kim et.al. | 2406.12837 | link |
2024-06-18 | Privacy Preserving Federated Learning in Medical Imaging with Uncertainty Estimation | Nikolas Koutsoubis et.al. | 2406.12815 | link |
2024-06-18 | Online Anchor-based Training for Image Classification Tasks | Maria Tzelepi et.al. | 2406.12662 | null |
2024-06-18 | Fighting Randomness with Randomness: Mitigating Optimisation Instability of Fine-Tuning using Delayed Ensemble and Noisy Interpolation | Branislav Pecher et.al. | 2406.12471 | null |
2024-06-18 | GW-MoE: Resolving Uncertainty in MoE Router with Global Workspace Theory | Haoze Wu et.al. | 2406.12375 | null |
2024-06-18 | What Did I Do Wrong? Quantifying LLMs' Sensitivity and Consistency to Prompt Engineering | Federico Errica et.al. | 2406.12334 | null |
2024-06-18 | Unleashing the Potential of Open-set Noisy Samples Against Label Noise for Medical Image Classification | Zehui Liao et.al. | 2406.12293 | null |
2024-06-18 | Advancing Cross-Domain Generalizability in Face Anti-Spoofing: Insights, Design, and Metrics | Hyojin Kim et.al. | 2406.12258 | null |
2024-06-19 | MiSuRe is all you need to explain your image segmentation | Syed Nouman Hasany et.al. | 2406.12173 | null |
2024-06-17 | Enhancing Text Classification through LLM-Driven Active Learning and Human Annotation | Hamidreza Rouzegar et.al. | 2406.12114 | link |
2024-06-17 | Scaling the Codebook Size of VQGAN to 100,000 with a Utilization Rate of 99% | Lei Zhu et.al. | 2406.11837 | link |
2024-06-17 | PrAViC: Probabilistic Adaptation Framework for Real-Time Video Classification | Magdalena Trędowicz et.al. | 2406.11443 | null |
2024-06-17 | Cross-domain Open-world Discovery | Shuo Wen et.al. | 2406.11422 | link |
2024-06-17 | BaFTA: Backprop-Free Test-Time Adaptation For Zero-Shot Vision-Language Models | Xuefeng Hu et.al. | 2406.11309 | null |
2024-06-17 | An Empirical Investigation of Matrix Factorization Methods for Pre-trained Transformers | Ashim Gupta et.al. | 2406.11307 | null |
2024-06-17 | Text Grafting: Near-Distribution Weak Supervision for Minority Classes in Text Classification | Letian Peng et.al. | 2406.11115 | null |
2024-06-16 | Fine-grained Classes and How to Find Them | Matej Grcić et.al. | 2406.11070 | link |
2024-06-16 | Leveraging Foundation Models for Multi-modal Federated Learning with Incomplete Modality | Liwei Che et.al. | 2406.11048 | null |
2024-06-16 | Curating Stopwords in Marathi: A TF-IDF Approach for Improved Text Analysis and Information Retrieval | Rohan Chavan et.al. | 2406.11029 | link |
2024-06-16 | Universal Cross-Lingual Text Classification | Riya Savant et.al. | 2406.11028 | null |
2024-06-14 | UniAudio 1.5: Large Language Model-driven Audio Codec is A Few-shot Audio Task Learner | Dongchao Yang et.al. | 2406.10056 | null |
2024-06-14 | Comparison of fine-tuning strategies for transfer learning in medical image classification | Ana Davila et.al. | 2406.10050 | null |
2024-06-14 | Forgetting Order of Continual Learning: Examples That are Learned First are Forgotten Last | Guy Hacohen et.al. | 2406.09935 | null |
2024-06-13 | MirrorCheck: Efficient Adversarial Defense for Vision-Language Models | Samar Fares et.al. | 2406.09250 | null |
2024-06-13 | Self-Training for Sample-Efficient Active Learning for Text Classification with Pre-Trained Language Models | Christopher Schröder et.al. | 2406.09206 | null |
2024-06-13 | Large-Scale Evaluation of Open-Set Image Classification Techniques | Halil Bisgin et.al. | 2406.09112 | link |
2024-06-13 | LaCoOT: Layer Collapse through Optimal Transport | Victor Quétu et.al. | 2406.08933 | null |
2024-06-13 | The Penalized Inverse Probability Measure for Conformal Classification | Paul Melki et.al. | 2406.08884 | null |
2024-06-13 | Conceptual Learning via Embedding Approximations for Reinforcing Interpretability and Transparency | Maor Dikter et.al. | 2406.08840 | link |
2024-06-13 | DenoiseReID: Denoising Model for Representation Learning of Person Re-Identification | Zhengrui Xu et.al. | 2406.08773 | null |
2024-06-12 | Fine-Tuned 'Small' LLMs (Still) Significantly Outperform Zero-Shot Generative AI Models in Text Classification | Martin Juan José Bucher et.al. | 2406.08660 | null |
2024-06-12 | Intelligent Multi-View Test Time Augmentation | Efe Ozturk et.al. | 2406.08593 | null |
2024-06-12 | Transformation-Dependent Adversarial Attacks | Yaoteng Tan et.al. | 2406.08443 | null |
2024-06-12 | AdaNCA: Neural Cellular Automata As Adaptors For More Robust Vision Transformer | Yitao Xu et.al. | 2406.08298 | null |
2024-06-12 | DistilDoc: Knowledge Distillation for Visually-Rich Document Applications | Jordy Van Landeghem et.al. | 2406.08226 | null |
2024-06-12 | Fully Few-shot Class-incremental Audio Classification Using Expandable Dual-embedding Extractor | Yongjie Si et.al. | 2406.08122 | null |
2024-06-12 | Low-Complexity Acoustic Scene Classification Using Parallel Attention-Convolution Network | Yanxiong Li et.al. | 2406.08119 | null |
2024-06-12 | A |
Lixian Zhang et.al. | 2406.08079 | null |
2024-06-12 | Adversarial Evasion Attack Efficiency against Large Language Models | João Vitorino et.al. | 2406.08050 | null |
2024-06-12 | Accurate Explanation Model for Image Classifiers using Class Association Embedding | Ruitao Xie et.al. | 2406.07961 | link |
2024-06-12 | Multi-Teacher Multi-Objective Meta-Learning for Zero-Shot Hyperspectral Band Selection | Jie Feng et.al. | 2406.07949 | null |
2024-06-12 | Small Scale Data-Free Knowledge Distillation | He Liu et.al. | 2406.07876 | link |
2024-06-11 | fKAN: Fractional Kolmogorov-Arnold Networks with trainable Jacobi basis functions | Alireza Afzal Aghaei et.al. | 2406.07456 | link |
2024-06-11 | Minimizing Energy Costs in Deep Learning Model Training: The Gaussian Sampling Approach | Challapalli Phanindra Revanth et.al. | 2406.07332 | null |
2024-06-11 | Noise-Robust Voice Conversion by Conditional Denoising Training Using Latent Variables of Recording Quality and Environment | Takuto Igarashi et.al. | 2406.07280 | null |
2024-06-11 | EEG-ImageNet: An Electroencephalogram Dataset and Benchmarks with Image Visual Stimuli of Multi-Granularity Labels | Shuqi Zhu et.al. | 2406.07151 | link |
2024-06-11 | RS-Agent: Automating Remote Sensing Tasks through Intelligent Agents | Wenjia Xu et.al. | 2406.07089 | null |
2024-06-11 | DualMamba: A Lightweight Spectral-Spatial Mamba-Convolution Network for Hyperspectral Image Classification | Jiamu Sheng et.al. | 2406.07050 | null |
2024-06-11 | Fairness-Aware Meta-Learning via Nash Bargaining | Yi Zeng et.al. | 2406.07029 | null |
2024-06-11 | Mitigating Boundary Ambiguity and Inherent Bias for Text Classification in the Era of Large Language Models | Zhenyi Lu et.al. | 2406.07001 | link |
2024-06-11 | Scaling up masked audio encoder learning for general audio classification | Heinrich Dinkel et.al. | 2406.06992 | null |
2024-06-10 | Multi-Objective Neural Architecture Search for In-Memory Computing | Md Hasibul Amin et.al. | 2406.06746 | null |
2024-06-10 | Robust Latent Representation Tuning for Image-text Classification | Hao Sun et.al. | 2406.06048 | null |
2024-06-09 | Contrastive Learning from Synthetic Audio Doppelgangers | Manuel Cherep et.al. | 2406.05923 | null |
2024-06-09 | Scaling Graph Convolutions for Mobile Vision | William Avery et.al. | 2406.05850 | link |
2024-06-09 | Evolution-aware VAriance (EVA) Coreset Selection for Medical Image Classification | Yuxin Hong et.al. | 2406.05677 | null |
2024-06-09 | Which Backbone to Use: A Resource-efficient Domain Specific Comparison for Computer Vision | Pranav Jeevan et.al. | 2406.05612 | link |
2024-06-08 | Aligning Human Knowledge with Visual Concepts Towards Explainable Medical Image Classification | Yunhe Gao et.al. | 2406.05596 | null |
2024-06-07 | The Unmet Promise of Synthetic Training Images: Using Retrieved Real Images Performs Better | Scott Geng et.al. | 2406.05184 | link |
2024-06-07 | A Novel Time Series-to-Image Encoding Approach for Weather Phenomena Classification | Christian Giannetti et.al. | 2406.05096 | null |
2024-06-07 | Classification Metrics for Image Explanations: Towards Building Reliable XAI-Evaluations | Benjamin Fresz et.al. | 2406.05068 | link |
2024-06-07 | REP: Resource-Efficient Prompting for On-device Continual Learning | Sungho Jeon et.al. | 2406.04772 | null |
2024-06-07 | AICoderEval: Improving AI Domain Code Generation of Large Language Models | Yinghui Xia et.al. | 2406.04712 | null |
2024-06-07 | Cooperative Meta-Learning with Gradient Augmentation | Jongyun Shin et.al. | 2406.04639 | link |
2024-06-06 | OCCAM: Towards Cost-Efficient and Accuracy-Aware Image Classification Inference | Dujian Ding et.al. | 2406.04508 | null |
2024-06-06 | Can Language Models Use Forecasting Strategies? | Sarah Pratt et.al. | 2406.04446 | null |
2024-06-06 | Parameter-Inverted Image Pyramid Networks | Xizhou Zhu et.al. | 2406.04330 | link |
2024-06-07 | BEADs: Bias Evaluation Across Domains | Shaina Raza et.al. | 2406.04220 | null |
2024-06-06 | What Do Language Models Learn in Context? The Structured Task Hypothesis | Jiaoda Li et.al. | 2406.04216 | null |
2024-06-06 | Pointer-Guided Pre-Training: Infusing Large Language Models with Paragraph-Level Contextual Awareness | Lars Hillebrand et.al. | 2406.04156 | link |
2024-06-07 | ReDistill: Residual Encoded Distillation for Peak Memory Reduction | Fang Chen et.al. | 2406.03744 | null |
2024-06-06 | LLMEmbed: Rethinking Lightweight LLM's Genuine Function in Text Classification | Chun Liu et.al. | 2406.03725 | link |
2024-06-05 | Convolutional Neural Networks and Vision Transformers for Fashion MNIST Classification: A Literature Review | Sonia Bbouzidi et.al. | 2406.03478 | null |
2024-06-05 | IrokoBench: A New Benchmark for African Languages in the Age of Large Language Models | David Ifeoluwa Adelani et.al. | 2406.03368 | null |
2024-06-05 | Audio Mamba: Bidirectional State Space Model for Audio Representation Learning | Mehmet Hamza Erol et.al. | 2406.03344 | link |
2024-06-05 | FusionBench: A Comprehensive Benchmark of Deep Model Fusion | Anke Tang et.al. | 2406.03280 | null |
2024-06-05 | VWise: A novel benchmark for evaluating scene classification for vehicular applications | Pedro Azevedo et.al. | 2406.03273 | null |
2024-06-05 | Tiny models from tiny data: Textual and null-text inversion for few-shot distillation | Erik Landolsi et.al. | 2406.03146 | link |
2024-06-05 | Exploiting LMM-based knowledge for image classification tasks | Maria Tzelepi et.al. | 2406.03071 | null |
2024-06-04 | Randomized Geometric Algebra Methods for Convex Neural Networks | Yifei Wang et.al. | 2406.02806 | null |
2024-06-04 | DL-KDD: Dual-Light Knowledge Distillation for Action Recognition in the Dark | Chi-Jui Chang et.al. | 2406.02468 | null |
2024-06-04 | GrootVL: Tree Topology is All You Need in State Space Model | Yicheng Xiao et.al. | 2406.02395 | link |
2024-06-04 | Hybrid Quantum-Classical Neural Network for LAB Color Space Image Classification | Kwokho Ng et.al. | 2406.02229 | null |
2024-06-03 | Few-Shot Classification of Interactive Activities of Daily Living (InteractADL) | Zane Durante et.al. | 2406.01662 | link |
2024-06-03 | CoLa-DCE -- Concept-guided Latent Diffusion Counterfactual Explanations | Franz Motzkus et.al. | 2406.01649 | null |
2024-06-03 | Asynchronous Multi-Server Federated Learning for Geo-Distributed Clients | Yuncong Zuo et.al. | 2406.01439 | null |
2024-06-03 | Compute-Efficient Medical Image Classification with Softmax-Free Transformers and Sequence Normalization | Firas Khader et.al. | 2406.01314 | null |
2024-06-03 | Continuous Geometry-Aware Graph Diffusion via Hyperbolic Neural PDE | Jiaxu Liu et.al. | 2406.01282 | null |
2024-06-04 | MultiMax: Sparse and Multi-Modal Attention Learning | Yuxuan Zhou et.al. | 2406.01189 | link |
2024-06-03 | Synergizing Unsupervised and Supervised Learning: A Hybrid Approach for Accurate Natural Language Task Modeling | Wrick Talukdar et.al. | 2406.01096 | null |
2024-05-31 | You Only Scan Once: Efficient Multi-dimension Sequential Modeling with LightNet | Zhen Qin et.al. | 2405.21022 | null |
2024-05-31 | Investigating Calibration and Corruption Robustness of Post-hoc Pruned Perception CNNs: An Image Classification Benchmark Study | Pallavi Mitra et.al. | 2405.20876 | null |
2024-05-31 | Improving Generalization and Convergence by Enhancing Implicit Regularization | Mingze Wang et.al. | 2405.20763 | null |
2024-05-31 | Robust Stable Spiking Neural Networks | Jianhao Ding et.al. | 2405.20694 | null |
2024-05-31 | Enhancing Counterfactual Image Generation Using Mahalanobis Distance with Distribution Preferences in Feature Space | Yukai Zhang et.al. | 2405.20685 | null |
2024-05-31 | GenMix: Combining Generative and Mixture Data Augmentation for Medical Image Classification | Hansang Lee et.al. | 2405.20650 | null |
2024-05-31 | ToxVidLLM: A Multimodal LLM-based Framework for Toxicity Detection in Code-Mixed Videos | Krishanu Maity et.al. | 2405.20628 | null |
2024-05-30 | Mitigating the Impact of Labeling Errors on Training via Rockafellian Relaxation | Louis L. Chen et.al. | 2405.20531 | null |
2024-05-30 | DeMamba: AI-Generated Video Detection on Million-Scale GenVideo Benchmark | Haoxing Chen et.al. | 2405.19707 | link |
2024-05-30 | A Novel Approach for Automated Design Information Mining from Issue Logs | Jiuang Zhao et.al. | 2405.19623 | null |
2024-05-29 | I Bet You Did Not Mean That: Testing Semantic Importance via Betting | Jacopo Teneggi et.al. | 2405.19146 | link |
2024-05-29 | Verifiably Robust Conformal Prediction | Linus Jeary et.al. | 2405.18942 | null |
2024-05-29 | Leveraging Many-To-Many Relationships for Defending Against Visual-Language Adversarial Attacks | Futa Waseda et.al. | 2405.18770 | null |
2024-05-29 | GIST: Greedy Independent Set Thresholding for Diverse Data Summarization | Matthew Fahrbach et.al. | 2405.18754 | null |
2024-05-29 | LLM-based Hierarchical Concept Decomposition for Interpretable Fine-Grained Image Classification | Renyi Qu et.al. | 2405.18672 | null |
2024-05-28 | Its Not a Modality Gap: Characterizing and Addressing the Contrastive Gap | Abrar Fahim et.al. | 2405.18570 | null |
2024-05-28 | Why are Visually-Grounded Language Models Bad at Image Classification? | Yuhui Zhang et.al. | 2405.18415 | link |
2024-05-28 | MSPE: Multi-Scale Patch Embedding Prompts Vision Transformers to Any Resolution | Wenzhuo Liu et.al. | 2405.18240 | null |
2024-05-28 | Confidence-aware multi-modality learning for eye disease screening | Ke Zou et.al. | 2405.18167 | link |
2024-05-28 | 4-bit Shampoo for Memory-Efficient Network Training | Sike Wang et.al. | 2405.18144 | null |
2024-05-28 | DMT-JEPA: Discriminative Masked Targets for Joint-Embedding Predictive Architecture | Shentong Mo et.al. | 2405.17995 | null |
2024-05-27 | WASH: Train your Ensemble with Communication-Efficient Weight Shuffling, then Average | Louis Fournier et.al. | 2405.17517 | null |
2024-05-27 | Model-Agnostic Zeroth-Order Policy Optimization for Meta-Learning of Ergodic Linear Quadratic Regulators | Yunian Pan et.al. | 2405.17370 | null |
2024-05-27 | On the Noise Robustness of In-Context Learning for Text Generation | Hongfu Gao et.al. | 2405.17264 | null |
2024-05-27 | Superpixelwise Low-rank Approximation based Partial Label Learning for Hyperspectral Image Classification | Shujun Yang et.al. | 2405.17110 | link |
2024-05-26 | Demystify Mamba in Vision: A Linear Attention Perspective | Dongchen Han et.al. | 2405.16605 | null |
2024-05-26 | AdaFisher: Adaptive Second Order Optimization via Fisher Information | Damien Martins Gomes et.al. | 2405.16397 | null |
2024-05-25 | ModelLock: Locking Your Model With a Spell | Yifeng Gao et.al. | 2405.16285 | null |
2024-05-25 | Accelerating Transformers with Spectrum-Preserving Token Merging | Hoai-Chau Tran et.al. | 2405.16148 | null |
2024-05-25 | Breaking the False Sense of Security in Backdoor Defense through Re-Activation Attack | Mingli Zhu et.al. | 2405.16134 | null |
2024-05-24 | Grounding Stylistic Domain Generalization with Quantitative Domain Shift Measures and Synthetic Scene Images | Yiran Luo et.al. | 2405.15961 | null |
2024-05-24 | A Neurosymbolic Framework for Bias Correction in CNNs | Parth Padalkar et.al. | 2405.15886 | null |
2024-05-24 | What Do You See? Enhancing Zero-Shot Image Classification with Multimodal Large Language Models | Abdelrahman Abdelhamed et.al. | 2405.15668 | null |
2024-05-24 | Class Machine Unlearning for Complex Data via Concepts Inference and Data Poisoning | Wenhan Chang et.al. | 2405.15662 | null |
2024-05-24 | Exposing Image Classifier Shortcuts with Counterfactual Frequency (CoF) Tables | James Hinns et.al. | 2405.15661 | null |
2024-05-24 | Harnessing Increased Client Participation with Cohort-Parallel Federated Learning | Akash Dhasade et.al. | 2405.15644 | null |
2024-05-24 | Transformer-based Federated Learning for Multi-Label Remote Sensing Image Classification | Barış Büyüktaş et.al. | 2405.15405 | null |
2024-05-24 | CLIP model is an Efficient Online Lifelong Learner | Leyuan Wang et.al. | 2405.15155 | null |
2024-05-24 | OptLLM: Optimal Assignment of Queries to Large Language Models | Yueyue Liu et.al. | 2405.15130 | null |
2024-05-23 | A Lost Opportunity for Vision-Language Models: A Comparative Study of Online Test-time Adaptation for Vision-Language Models | Mario Döbler et.al. | 2405.14977 | link |
2024-05-23 | Domain Wall Magnetic Tunnel Junction Reliable Integrate and Fire Neuron | Can Cui1 et.al. | 2405.14851 | null |
2024-05-23 | Explaining Black-box Model Predictions via Two-level Nested Feature Attributions with Consistency Property | Yuya Yoshikawa et.al. | 2405.14522 | null |
2024-05-23 | SIAVC: Semi-Supervised Framework for Industrial Accident Video Classification | Zuoyong Li et.al. | 2405.14506 | null |
2024-05-23 | Scalable Visual State Space Model with Fractal Scanning | Lv Tang et.al. | 2405.14480 | null |
2024-05-23 | Segformer++: Efficient Token-Merging Strategies for High-Resolution Semantic Segmentation | Daniel Kienzle et.al. | 2405.14467 | null |
2024-05-23 | Boosting Robustness by Clipping Gradients in Distributed Learning | Youssef Allouah et.al. | 2405.14432 | null |
2024-05-23 | Advancing Spiking Neural Networks for Sequential Modeling with Central Pattern Generators | Changze Lv et.al. | 2405.14362 | null |
2024-05-23 | Simple Hamiltonian dynamics is a powerful quantum processing resource | Akitada Sakurai et.al. | 2405.14245 | null |
2024-05-23 | ChronosLex: Time-aware Incremental Training for Temporal Generalization of Legal Classification Tasks | T. Y. S. S Santosh et.al. | 2405.14211 | null |
2024-05-22 | Just rotate it! Uncertainty estimation in closed-source models via multiple queries | Konstantinos Pitas et.al. | 2405.13864 | null |
2024-05-21 | Decentralized Federated Learning Over Imperfect Communication Channels | Weicai Li et.al. | 2405.12894 | null |
2024-05-21 | Multimodal Adaptive Inference for Document Image Classification with Anytime Early Exiting | Omar Hamed et.al. | 2405.12705 | null |
2024-05-21 | Exploration of Masked and Causal Language Modelling for Text Generation | Nicolo Micheletti et.al. | 2405.12630 | null |
2024-05-21 | 3DSS-Mamba: 3D-Spectral-Spatial Mamba for Hyperspectral Image Classification | Yan He et.al. | 2405.12487 | null |
2024-05-20 | Alzheimer's Magnetic Resonance Imaging Classification Using Deep and Meta-Learning Models | Nida Nasir et.al. | 2405.12126 | null |
2024-05-20 | Mamba-in-Mamba: Centralized Mamba-Cross-Scan in Tokenized Mamba Model for Hyperspectral Image Classification | Weilian Zhou et.al. | 2405.12003 | link |
2024-05-20 | A Constraint-Enforcing Reward for Adversarial Attacks on Text Classifiers | Tom Roth et.al. | 2405.11904 | null |
2024-05-21 | A Novel Cartography-Based Curriculum Learning Method Applied on RoNLI: The First Romanian Natural Language Inference Corpus | Eduard Poesina et.al. | 2405.11877 | link |
2024-05-20 | SSAMBA: Self-Supervised Audio Representation Learning with Mamba State Space Model | Siavash Shams et.al. | 2405.11831 | link |
2024-05-20 | Exploring Ordinality in Text Classification: A Comparative Study of Explicit and Implicit Techniques | Siva Rajesh Kasa et.al. | 2405.11775 | null |
2024-05-19 | SLAB: Efficient Transformers with Simplified Linear Attention and Progressive Re-parameterized Batch Normalization | Jialong Guo et.al. | 2405.11582 | link |
2024-05-19 | Reproducibility Study of CDUL: CLIP-Driven Unsupervised Learning for Multi-Label Image Classification | Manan Shah et.al. | 2405.11574 | link |
2024-05-19 | An Invisible Backdoor Attack Based On Semantic Feature | Yangming Chen et.al. | 2405.11551 | null |
2024-05-19 | Verification technology for finger vein biometric | George Kumi Kyeremeh et.al. | 2405.11540 | null |
2024-05-17 | Reduced storage direct tensor ring decomposition for convolutional neural networks compression | Mateusz Gabor et.al. | 2405.10802 | link |
2024-05-17 | Benchmarking Large Language Models on CFLUE -- A Chinese Financial Language Understanding Evaluation Dataset | Jie Zhu et.al. | 2405.10542 | link |
2024-05-17 | Smart Expert System: Large Language Models as Text Classifiers | Zhiqiang Wang et.al. | 2405.10523 | link |
2024-05-16 | Data-Efficient Low-Complexity Acoustic Scene Classification in the DCASE 2024 Challenge | Florian Schmid et.al. | 2405.10018 | null |
2024-05-16 | ROCOv2: Radiology Objects in COntext Version 2, an Updated Multimodal Image Dataset | Johannes Rückert et.al. | 2405.10004 | link |
2024-05-15 | Improving Label Error Detection and Elimination with Uncertainty Quantification | Johannes Jakubik et.al. | 2405.09602 | null |
2024-05-15 | Tackling Distribution Shifts in Task-Oriented Communication with Information Bottleneck | Hongru Li et.al. | 2405.09514 | null |
2024-05-15 | Feature-based Federated Transfer Learning: Communication Efficiency, Robustness and Privacy | Feng Wang et.al. | 2405.09014 | link |
2024-05-14 | The Pitfalls and Promise of Conformal Inference Under Adversarial Attacks | Ziquan Liu et.al. | 2405.08886 | link |
2024-05-14 | Harnessing the power of longitudinal medical imaging for eye disease prognosis using Transformer-based sequence modeling | Gregory Holste et.al. | 2405.08780 | null |
2024-05-14 | FolkTalent: Enhancing Classification and Tagging of Indian Folk Paintings | Nancy Hada et.al. | 2405.08776 | null |
2024-05-14 | The impact of Compositionality in Zero-shot Multi-label action recognition for Object-based tasks | Carmela Calabrese et.al. | 2405.08695 | null |
2024-05-14 | Achieving Fairness Through Channel Pruning for Dermatological Disease Diagnosis | Qingpeng Kong et.al. | 2405.08681 | link |
2024-05-14 | Investigating Design Choices in Joint-Embedding Predictive Architectures for General Audio Representation Learning | Alain Riou et.al. | 2405.08679 | null |
2024-05-14 | Dual-Branch Network for Portrait Image Quality Assessment | Wei Sun et.al. | 2405.08555 | null |
2024-05-13 | Who's in and who's out? A case study of multimodal CLIP-filtering in DataComp | Rachel Hong et.al. | 2405.08209 | link |
2024-05-14 | MambaOut: Do We Really Need Mamba for Vision? | Weihao Yu et.al. | 2405.07992 | link |
2024-05-13 | Constrained Exploration via Reflected Replica Exchange Stochastic Gradient Langevin Dynamics | Haoyang Zheng et.al. | 2405.07839 | link |
2024-05-13 | Analysis of the rate of convergence of an over-parametrized convolutional neural network image classifier learned by gradient descent | Michael Kohler et.al. | 2405.07619 | null |
2024-05-13 | On-device Online Learning and Semantic Management of TinyML Systems | Haoyu Ren et.al. | 2405.07601 | null |
2024-05-13 | GLiRA: Black-Box Membership Inference Attack via Knowledge Distillation | Andrey V. Galichin et.al. | 2405.07562 | null |
2024-05-13 | Fine-tuning the SwissBERT Encoder Model for Embedding Sentences and Documents | Juri Grosjean et.al. | 2405.07513 | null |
2024-05-13 | MoVL:Exploring Fusion Strategies for the Domain-Adaptive Application of Pretrained Models in Medical Imaging Tasks | Haijiang Tian et.al. | 2405.07411 | null |
2024-05-12 | Explainable Convolutional Neural Networks for Retinal Fundus Classification and Cutting-Edge Segmentation Models for Retinal Blood Vessels from Fundus Images | Fatema Tuj Johora Faria et.al. | 2405.07338 | null |
2024-05-12 | Differentiable Model Scaling using Differentiable Topk | Kai Liu et.al. | 2405.07194 | null |
2024-05-11 | A framework of text-dependent speaker verification for chinese numerical string corpus | Litong Zheng et.al. | 2405.07029 | null |
2024-05-10 | Pseudo-Prompt Generating in Pre-trained Vision-Language Models for Multi-Label Medical Image Classification | Yaoqin Ye et.al. | 2405.06468 | null |
2024-05-10 | Multi-level Personalized Federated Learning on Heterogeneous and Long-Tailed Data | Rongyu Zhang et.al. | 2405.06413 | null |
2024-05-10 | SaudiBERT: A Large Language Model Pretrained on Saudi Dialect Corpora | Faisal Qarah et.al. | 2405.06239 | null |
2024-05-09 | Deep Multi-Task Learning for Malware Image Classification | Ahmed Bensaoud et.al. | 2405.05906 | null |
2024-05-09 | Enhancing Suicide Risk Detection on Social Media through Semi-Supervised Deep Label Smoothing | Matthew Squires et.al. | 2405.05795 | null |
2024-05-09 | CSA-Net: Channel-wise Spatially Autocorrelated Attention Networks | Nick et.al. | 2405.05755 | null |
2024-05-09 | How Quality Affects Deep Neural Networks in Fine-Grained Image Classification | Joseph Smith et.al. | 2405.05742 | null |
2024-05-09 | End-to-End Generative Semantic Communication Powered by Shared Semantic Knowledge Base | Shuling Li et.al. | 2405.05738 | null |
2024-05-09 | Using Machine Translation to Augment Multilingual Classification | Adam King et.al. | 2405.05478 | null |
2024-05-08 | AFEN: Respiratory Disease Classification using Ensemble Learning | Rahul Nadkarni et.al. | 2405.05467 | null |
2024-05-08 | XAMPLER: Learning to Retrieve Cross-Lingual In-Context Examples | Peiqin Lin et.al. | 2405.05116 | link |
2024-05-08 | Explanation as a Watermark: Towards Harmless and Multi-bit Model Ownership Verification via Watermarking Feature Attribution | Shuo Shao et.al. | 2405.04825 | null |
2024-05-07 | Exploring Explainable AI Techniques for Improved Interpretability in Lung and Colon Cancer Classification | Mukaffi Bin Moin et.al. | 2405.04610 | link |
2024-05-07 | Pragmatist Intelligence: Where the Principle of Usefulness Can Take ANNs | Antonio Bikić et.al. | 2405.04386 | null |
2024-05-07 | Semi-Supervised Disease Classification based on Limited Medical Image Data | Yan Zhang et.al. | 2405.04295 | null |
2024-05-07 | DCNN: Dual Cross-current Neural Networks Realized Using An Interactive Deep Learning Discriminator for Fine-grained Objects | Da Fu et.al. | 2405.04093 | null |
2024-05-07 | Feature Map Convergence Evaluation for Functional Module | Ludan Zhang et.al. | 2405.04041 | null |
2024-05-07 | VMambaCC: A Visual State Space Model for Crowd Counting | Hao-Yuan Ma et.al. | 2405.03978 | null |
2024-05-06 | On Adversarial Examples for Text Classification by Perturbing Latent Representations | Korn Sooksatra et.al. | 2405.03789 | null |
2024-05-06 | CICA: Content-Injected Contrastive Alignment for Zero-Shot Document Image Classification | Sankalp Sinha et.al. | 2405.03660 | null |
2024-05-06 | Deep Space Separable Distillation for Lightweight Acoustic Scene Classification | ShuQi Ye et.al. | 2405.03567 | null |
2024-05-06 | Liberating Seen Classes: Boosting Few-Shot and Zero-Shot Text Classification via Anchor Generation and Classification Reframing | Han Liu et.al. | 2405.03565 | null |
2024-05-06 | A Lightweight Neural Architecture Search Model for Medical Image Classification | Lunchen Xie et.al. | 2405.03462 | null |
2024-05-06 | Interpretable Network Visualizations: A Human-in-the-Loop Approach for Post-hoc Explainability of CNN-based Image Classification | Matteo Bianchi et.al. | 2405.03301 | null |
2024-05-06 | TED: Accelerate Model Training by Internal Generalization | Jinying Xiao et.al. | 2405.03228 | null |
2024-05-06 | Advancing Multimodal Medical Capabilities of Gemini | Lin Yang et.al. | 2405.03162 | null |
2024-05-05 | A scoping review of using Large Language Models (LLMs) to investigate Electronic Health Records (EHRs) | Lingyao Li et.al. | 2405.03066 | null |
2024-05-05 | Parameter-Efficient Fine-Tuning with Discrete Fourier Transform | Ziqi Gao et.al. | 2405.03003 | null |
2024-05-04 | MMEarth: Exploring Multi-Modal Pretext Tasks For Geospatial Representation Learning | Vishal Nedungadi et.al. | 2405.02771 | null |
2024-05-03 | Multi-method Integration with Confidence-based Weighting for Zero-shot Image Classification | Siqi Yin et.al. | 2405.02155 | null |
2024-05-03 | The Trade-off between Performance, Efficiency, and Fairness in Adapter Modules for Text Classification | Minh Duc Bui et.al. | 2405.02010 | null |
2024-05-03 | Which Identities Are Mobilized: Towards an automated detection of social group appeals in political texts | Felicia Riethmüller et.al. | 2405.01904 | null |
2024-05-02 | PVF (Parameter Vulnerability Factor): A Quantitative Metric Measuring AI Vulnerability and Resilience Against Parameter Corruptions | Xun Jiao et.al. | 2405.01741 | null |
2024-05-02 | Development of Skip Connection in Deep Neural Networks for Computer Vision and Medical Image Analysis: A Survey | Guoping Xu et.al. | 2405.01725 | link |
2024-05-02 | SOAR: Advancements in Small Body Object Detection for Aerial Imagery Using State Space Models and Programmable Gradients | Tushar Verma et.al. | 2405.01699 | null |
2024-05-02 | Explainable AI (XAI) in Image Segmentation in Medicine, Industry, and Beyond: A Survey | Rokas Gipiškis et.al. | 2405.01636 | null |
2024-05-02 | Improving Intervention Efficacy via Concept Realignment in Concept Bottleneck Models | Nishad Singhi et.al. | 2405.01531 | null |
2024-05-03 | Decoupling Feature Extraction and Classification Layers for Calibrated Neural Networks | Mikkel Jordahn et.al. | 2405.01196 | null |
2024-05-02 | Uncertainty-aware self-training with expectation maximization basis transformation | Zijia Wang et.al. | 2405.01175 | null |
2024-05-02 | Transformers Fusion across Disjoint Samples for Hyperspectral Image Classification | Muhammad Ahmad et.al. | 2405.01095 | null |
2024-05-02 | Efficient and Flexible Method for Reducing Moderate-size Deep Neural Networks with Condensation | Tianyi Chen et.al. | 2405.01041 | null |
2024-05-02 | Benchmarking Representations for Speech, Music, and Acoustic Events | Moreno La Quatra et.al. | 2405.00934 | link |
2024-05-01 | Digital-analog quantum convolutional neural networks for image classification | Anton Simen et.al. | 2405.00548 | null |
2024-05-03 | BiomedRAG: A Retrieval Augmented Large Language Model for Biomedicine | Mingchen Li et.al. | 2405.00465 | null |
2024-05-01 | Visual and audio scene classification for detecting discrepancies in video: a baseline method and experimental protocol | Konstantinos Apostolidis et.al. | 2405.00384 | null |
2024-05-01 | Data Augmentation Policy Search for Long-Term Forecasting | Liran Nochumsohn et.al. | 2405.00319 | null |
2024-04-30 | Let's Focus: Focused Backdoor Attack against Federated Transfer Learning | Marco Arazzi et.al. | 2404.19420 | null |
2024-04-30 | Large Language Model Informed Patent Image Retrieval | Hao-Cheng Lo et.al. | 2404.19360 | null |
2024-04-30 | Enhancing Intrinsic Features for Debiasing via Investigating Class-Discerning Common Attributes in Bias-Contrastive Pair | Jeonghoon Park et.al. | 2404.19250 | null |
2024-04-29 | Spectral-Spatial Mamba for Hyperspectral Image Classification | Lingbo Huang et.al. | 2404.18401 | null |
2024-04-28 | TextGram: Towards a better domain-adaptive pretraining | Sharayu Hiwarkhedkar et.al. | 2404.18228 | null |
2024-04-28 | L3Cube-MahaNews: News-based Short Text and Long Document Classification Datasets in Marathi | Saloni Mittal et.al. | 2404.18216 | link |
2024-04-28 | S |
Guanchun Wang et.al. | 2404.18213 | null |
2024-04-27 | Implicit Generative Prior for Bayesian Neural Networks | Yijia Liu et.al. | 2404.18008 | link |
2024-04-27 | Towards Privacy-Preserving Audio Classification Systems | Bhawana Chhaglani et.al. | 2404.18002 | null |
2024-04-27 | A Method of Moments Embedding Constraint and its Application to Semi-Supervised Learning | Michael Majurski et.al. | 2404.17978 | null |
2024-04-27 | Spatial, Temporal, and Geometric Fusion for Remote Sensing Images | Hessah Albanwan et.al. | 2404.17851 | null |
2024-04-27 | Leveraging Cross-Modal Neighbor Representation for Improved CLIP Classification | Chao Yi et.al. | 2404.17753 | link |
2024-04-26 | SPLICE -- Streamlining Digital Pathology Image Processing | Areej Alsaafin et.al. | 2404.17704 | null |
2024-04-26 | SDFD: Building a Versatile Synthetic Face Image Dataset with Diverse Attributes | Georgia Baltsou et.al. | 2404.17255 | null |
2024-04-25 | Incorporating Lexical and Syntactic Knowledge for Unsupervised Cross-Lingual Transfer | Jianyu Zheng et.al. | 2404.16627 | link |
2024-04-25 | IMWA: Iterative Model Weight Averaging Benefits Class-Imbalanced Learning Tasks | Zitong Huang et.al. | 2404.16331 | null |
2024-04-25 | Lacunarity Pooling Layers for Plant Image Classification using Texture Analysis | Akshatha Mohan et.al. | 2404.16268 | link |
2024-04-24 | MiMICRI: Towards Domain-centered Counterfactual Explanations of Cardiovascular Image Classification Models | Grace Guo et.al. | 2404.16174 | null |
2024-04-24 | MoDE: CLIP Data Experts via Clustering | Jiawei Ma et.al. | 2404.16030 | link |
2024-04-26 | A Survey on Visual Mamba | Hanwei Zhang et.al. | 2404.15956 | null |
2024-04-24 | Vision Transformer-based Adversarial Domain Adaptation | Yahan Li et.al. | 2404.15817 | link |
2024-04-24 | Rethinking Model Prototyping through the MedMNIST+ Dataset Collection | Sebastian Doerrich et.al. | 2404.15786 | null |
2024-04-24 | Efficient Multi-Model Fusion with Adversarial Complementary Representation Learning | Zuheng Kang et.al. | 2404.15704 | null |
2024-04-24 | Brain Storm Optimization Based Swarm Learning for Diabetic Retinopathy Image Classification | Liang Qu et.al. | 2404.15585 | null |
2024-04-23 | An MRP Formulation for Supervised Learning: Generalized Temporal Difference Learning Models | Yangchen Pan et.al. | 2404.15518 | null |
2024-04-23 | Deep multi-prototype capsule networks | Saeid Abbassi et.al. | 2404.15445 | null |
2024-04-23 | A review of deep learning-based information fusion techniques for multimodal medical image classification | Yihao Li et.al. | 2404.15022 | null |
2024-04-23 | Social Media and Artificial Intelligence for Sustainable Cities and Societies: A Water Quality Analysis Use-case | Muhammad Asif Auyb et.al. | 2404.14977 | null |
2024-04-23 | Traditional to Transformers: A Survey on Current Trends and Future Prospects for Hyperspectral Image Classification | Muhammad Ahmad et.al. | 2404.14955 | link |
2024-04-23 | Pyramid Hierarchical Transformer for Hyperspectral Image Classification | Muhammad Ahmad et.al. | 2404.14945 | link |
2024-04-23 | Importance of Disjoint Sampling in Conventional and Transformer Models for Hyperspectral Image Classification | Muhammad Ahmad et.al. | 2404.14944 | link |
2024-04-23 | CoProNN: Concept-based Prototypical Nearest Neighbors for Explaining Vision Models | Teodor Chiaburu et.al. | 2404.14830 | link |
2024-04-22 | WangLab at MEDIQA-M3G 2024: Multimodal Medical Answer Generation using Large Language Models | Ronald Xie et.al. | 2404.14567 | null |
2024-04-22 | CKD: Contrastive Knowledge Distillation from A Sample-wise Perspective | Wencheng Zhu et.al. | 2404.14109 | null |
2024-04-21 | EncodeNet: A Framework for Boosting DNN Accuracy with Entropy-driven Generalized Converting Autoencoder | Hasanul Mahmud et.al. | 2404.13770 | null |
2024-04-21 | PEACH: Pretrained-embedding Explanation Across Contextual and Hierarchical Structure | Feiqi Cao et.al. | 2404.13645 | link |
2024-04-21 | I2CANSAY:Inter-Class Analogical Augmentation and Intra-Class Significance Analysis for Non-Exemplar Online Task-Free Continual Learning | Songlin Dong et.al. | 2404.13576 | null |
2024-04-21 | IMO: Greedy Layer-Wise Sparse Representation Learning for Out-of-Distribution Text Classification with Pre-trained Models | Tao Feng et.al. | 2404.13504 | null |
2024-04-20 | Nested-TNT: Hierarchical Vision Transformers with Multi-Scale Feature Processing | Yuang Liu et.al. | 2404.13434 | null |
2024-04-20 | Evaluating Subword Tokenization: Alien Subword Composition and OOV Generalization Challenge | Khuyagbaatar Batsuren et.al. | 2404.13292 | link |
2024-04-20 | 3D-Convolution Guided Spectral-Spatial Transformer for Hyperspectral Image Classification | Shyam Varahagiri et.al. | 2404.13252 | link |
2024-04-19 | On-board classification of underwater images using hybrid classical-quantum CNN based method | Sreeraj Rajan Warrier et.al. | 2404.13130 | null |
2024-04-19 | Next Generation Loss Function for Image Classification | Shakhnaz Akhmedova et.al. | 2404.12948 | null |
2024-04-19 | A Hybrid Generative and Discriminative PointNet on Unordered Point Sets | Yang Ye et.al. | 2404.12925 | null |
2024-04-19 | Transformer-Based Classification Outcome Prediction for Multimodal Stroke Treatment | Danqing Ma et.al. | 2404.12634 | null |
2024-04-18 | When LLMs are Unfit Use FastFit: Fast and Effective Text Classification with Many Classes | Asaf Yehudai et.al. | 2404.12365 | null |
2024-04-18 | Observation, Analysis, and Solution: Exploring Strong Lightweight Vision Transformers via Masked Image Modeling Pre-Training | Jin Gao et.al. | 2404.12210 | link |
2024-04-18 | Concept Induction using LLMs: a user experiment for assessment | Adrita Barua et.al. | 2404.11875 | null |
2024-04-17 | Pretraining Billion-scale Geospatial Foundational Models on Frontier | Aristeidis Tsaris et.al. | 2404.11706 | null |
2024-04-17 | AI-Enhanced Cognitive Behavioral Therapy: Deep Learning and Large Language Models for Extracting Cognitive Pathways from Social Media Texts | Meng Jiang et.al. | 2404.11449 | null |
2024-04-17 | Achieving Rotation Invariance in Convolution Operations: Shifting from Data-Driven to Mechanism-Assured | Hanlin Mo et.al. | 2404.11309 | null |
2024-04-17 | A Progressive Framework of Vision-language Knowledge Distillation and Alignment for Multilingual Scene | Wenbo Zhang et.al. | 2404.11249 | null |
2024-04-17 | A Novel ICD Coding Framework Based on Associated and Hierarchical Code Description Distillation | Bin Zhang et.al. | 2404.11132 | null |
2024-04-17 | Small Language Models are Good Too: An Empirical Study of Zero-Shot Classification | Pierre Lepagnol et.al. | 2404.11122 | null |
2024-04-18 | Supervised Contrastive Vision Transformer for Breast Histopathological Image Classification | Mohammad Shiri et.al. | 2404.11052 | null |
2024-04-17 | InfoMatch: Entropy Neural Estimation for Semi-Supervised Image Classification | Qi Han et.al. | 2404.11003 | link |
2024-04-16 | Incubating Text Classifiers Following User Instruction with Nothing but LLM | Letian Peng et.al. | 2404.10877 | null |
2024-04-16 | Vocabulary-free Image Classification and Semantic Segmentation | Alessandro Conti et.al. | 2404.10864 | link |
2024-04-16 | Assessing The Impact of CNN Auto Encoder-Based Image Denoising on Image Classification Tasks | Mohsen Hami et.al. | 2404.10664 | null |
2024-04-16 | Tree Bandits for Generative Bayes | Sean O'Hagan et.al. | 2404.10436 | null |
2024-04-16 | AudioProtoPNet: An interpretable deep learning model for bird sound classification | René Heinrich et.al. | 2404.10420 | null |
2024-04-16 | Lighter, Better, Faster Multi-Source Domain Adaptation with Gaussian Mixture Models and Optimal Transport | Eduardo Fernandes Montesuma et.al. | 2404.10261 | null |
2024-04-15 | Distributed Federated Learning-Based Deep Learning Model for Privacy MRI Brain Tumor Detection | Lisang Zhou et.al. | 2404.10026 | null |
2024-04-15 | Interaction as Explanation: A User Interaction-based Method for Explaining Image Classification Models | Hyeonggeun Yun et.al. | 2404.09828 | null |
2024-04-15 | Quantization of Large Language Models with an Overdetermined Basis | Daniil Merkulov et.al. | 2404.09737 | null |
2024-04-15 | Pseudo-label Learning with Calibrated Confidence Using an Energy-based Model | Masahito Toba et.al. | 2404.09585 | null |
2024-04-14 | Breast Cancer Image Classification Method Based on Deep Transfer Learning | Weimin Wang et.al. | 2404.09226 | null |
2024-04-14 | Coreset Selection for Object Detection | Hojun Lee et.al. | 2404.09161 | null |
2024-04-13 | Exploring Explainability in Video Action Recognition | Avinab Saha et.al. | 2404.09067 | null |
2024-04-13 | Fast Fishing: Approximating BAIT for Efficient and Scalable Deep Active Image Classification | Denis Huseljic et.al. | 2404.08981 | link |
2024-04-13 | PM2: A New Prompting Multi-modal Model Paradigm for Few-shot Medical Image Classification | Zhenwei Wang et.al. | 2404.08915 | null |
2024-04-12 | VertAttack: Taking advantage of Text Classifiers' horizontal vision | Jonathan Rusert et.al. | 2404.08538 | null |
2024-04-12 | SpectralMamba: Efficient Mamba for Hyperspectral Image Classification | Jing Yao et.al. | 2404.08489 | null |
2024-04-12 | OTTER: Improving Zero-Shot Classification via Optimal Transport | Changho Shin et.al. | 2404.08461 | null |
2024-04-12 | A Survey of Neural Network Robustness Assessment in Image Recognition | Jie Wang et.al. | 2404.08285 | null |
2024-04-12 | Convolutional neural network classification of cancer cytopathology images: taking breast cancer as an example | MingXuan Xiao et.al. | 2404.08279 | null |
2024-04-11 | HGRN2: Gated Linear RNNs with State Expansion | Zhen Qin et.al. | 2404.07904 | link |
2024-04-11 | Exploiting Object-based and Segmentation-based Semantic Features for Deep Learning-based Indoor Scene Classification | Ricardo Pereira et.al. | 2404.07739 | null |
2024-04-11 | Contrastive-Based Deep Embeddings for Label Noise-Resilient Histopathology Image Classification | Lucas Dedieu et.al. | 2404.07605 | link |
2024-04-11 | Learning to Classify New Foods Incrementally Via Compressed Exemplars | Justin Yang et.al. | 2404.07507 | null |
2024-04-11 | Interactive Prompt Debugging with Sequence Salience | Ian Tenney et.al. | 2404.07498 | null |
2024-04-11 | Privacy preserving layer partitioning for Deep Neural Network models | Kishore Rajasekar et.al. | 2404.07437 | null |
2024-04-11 | CopilotCAD: Empowering Radiologists with Report Completion Models and Quantitative Evidence from Medical Image Foundation Models | Sheng Wang et.al. | 2404.07424 | null |
2024-04-11 | Improving Shift Invariance in Convolutional Neural Networks with Translation Invariant Polyphase Sampling | Sourajit Saha et.al. | 2404.07410 | null |
2024-04-10 | Lost in Translation: Modern Neural Networks Still Struggle With Small Realistic Image Transformations | Ofir Shifman et.al. | 2404.07153 | null |
2024-04-10 | Learning of deep convolutional network image classifiers via stochastic gradient descent and over-parametrization | Michael Kohler et.al. | 2404.07128 | null |
2024-04-10 | Accelerating Cardiac MRI Reconstruction with CMRatt: An Attention-Driven Approach | Anam Hashmi et.al. | 2404.06941 | null |
2024-04-10 | Multi-Label Continual Learning for the Medical Domain: A Novel Benchmark | Marina Ceccon et.al. | 2404.06859 | null |
2024-04-10 | Neural Optimizer Equation, Decay Function, and Learning Rate Schedule Joint Evolution | Brandon Morgan et.al. | 2404.06679 | null |
2024-04-09 | Variational Stochastic Gradient Descent for Deep Neural Networks | Haotian Chen et.al. | 2404.06549 | link |
2024-04-09 | On adversarial training and the 1 Nearest Neighbor classifier | Amir Hagai et.al. | 2404.06313 | link |
2024-04-09 | Audio-Visual Generalized Zero-Shot Learning using Pre-Trained Large Multi-Modal Models | David Kurzendörfer et.al. | 2404.06309 | link |
2024-04-09 | Counterfactual Reasoning for Multi-Label Image Classification via Patching-Based Training | Ming-Kun Xie et.al. | 2404.06287 | null |
2024-04-09 | Quantum Circuit |
Yuka Hashimoto et.al. | 2404.06218 | null |
2024-04-09 | VI-OOD: A Unified Representation Learning Framework for Textual Out-of-distribution Detection | Li-Ming Zhan et.al. | 2404.06217 | link |
2024-04-09 | Symmetry-guided gradient descent for quantum neural networks | Kaiming Bian et.al. | 2404.06108 | null |
2024-04-10 | Using Few-Shot Learning to Classify Primary Lung Cancer and Other Malignancy with Lung Metastasis in Cytological Imaging via Endobronchial Ultrasound Procedures | Ching-Kai Lin et.al. | 2404.06080 | null |
2024-04-08 | Neural Cellular Automata for Lightweight, Robust and Explainable Classification of White Blood Cell Images | Michael Deutges et.al. | 2404.05584 | null |
2024-04-08 | On the Convergence of Continual Learning with Adaptive Methods | Seungyub Han et.al. | 2404.05555 | null |
2024-04-08 | Multi-Task Learning for Features Extraction in Financial Annual Reports | Syrielle Montariol et.al. | 2404.05281 | link |
2024-04-08 | Allowing humans to interactively guide machines where to look does not always improve a human-AI team's classification accuracy | Giang Nguyen et.al. | 2404.05238 | null |
2024-04-08 | iVPT: Improving Task-relevant Information Sharing in Visual Prompt Tuning by Cross-layer Dynamic Connection | Nan Zhou et.al. | 2404.05207 | null |
2024-04-08 | Semantic Stealth: Adversarial Text Attacks on NLP Using Several Methods | Roopkatha Dey et.al. | 2404.05159 | null |
2024-04-07 | PairAug: What Can Augmented Image-Text Pairs Do for Radiology? | Yutong Xie et.al. | 2404.04960 | link |
2024-04-07 | GvT: A Graph-based Vision Transformer with Talking-Heads Utilizing Sparsity, Trained from Scratch on Small Datasets | Dongjing Shan et.al. | 2404.04924 | null |
2024-04-06 | Focused Active Learning for Histopathological Image Classification | Arne Schmidt et.al. | 2404.04663 | null |
2024-04-06 | Trustless Audits without Revealing Data or Models | Suppakit Waiwitlikhit et.al. | 2404.04500 | null |
2024-04-05 | Evaluating Adversarial Robustness: A Comparison Of FGSM, Carlini-Wagner Attacks, And The Role of Distillation as Defense Mechanism | Trilokesh Ranjan Sarkar et.al. | 2404.04245 | null |
2024-04-05 | Noisy Label Processing for Classification: A Survey | Mengting Li et.al. | 2404.04159 | null |
2024-04-05 | Learning Correlation Structures for Vision Transformers | Manjin Kim et.al. | 2404.03924 | null |
2024-04-05 | LiDAR-Guided Cross-Attention Fusion for Hyperspectral Band Selection and Image Classification | Judy X Yang et.al. | 2404.03883 | null |
2024-04-04 | Dendrites endow artificial neural networks with accurate, robust and parameter-efficient learning | Spyridon Chavlis et.al. | 2404.03708 | null |
2024-04-05 | A Methodology to Study the Impact of Spiking Neural Network Parameters considering Event-Based Automotive Data | Iqra Bano et.al. | 2404.03493 | null |
2024-04-04 | Meta Invariance Defense Towards Generalizable Robustness to Unknown Adversarial Attacks | Lei Zhang et.al. | 2404.03340 | null |
2024-04-04 | Sparse Concept Bottleneck Models: Gumbel Tricks in Contrastive Learning | Andrei Semenov et.al. | 2404.03323 | link |
2024-04-04 | FACTUAL: A Novel Framework for Contrastive Learning Based Robust SAR Image Classification | Xu Wang et.al. | 2404.03225 | null |
2024-04-03 | Exploring the Trade-off Between Model Performance and Explanation Plausibility of Text Classifiers Using Human Rationales | Lucas E. Resck et.al. | 2404.03098 | link |
2024-04-03 | Guarantees of confidentiality via Hammersley-Chapman-Robbins bounds | Kamalika Chaudhuri et.al. | 2404.02866 | link |
2024-04-03 | FPT: Feature Prompt Tuning for Few-shot Readability Assessment | Ziyang Wang et.al. | 2404.02772 | link |
2024-04-03 | Adversarial Attacks and Dimensionality in Text Classifiers | Nandish Chattopadhyay et.al. | 2404.02660 | null |
2024-04-04 | Non-negative Subspace Feature Representation for Few-shot Learning in Medical Imaging | Keqiang Fan et.al. | 2404.02656 | null |
2024-04-03 | Adaptive Cross-lingual Text Classification through In-Context One-Shot Demonstrations | Emilio Villa-Cueva et.al. | 2404.02452 | link |
2024-04-03 | A Novel Approach to Breast Cancer Histopathological Image Classification Using Cross-Colour Space Feature Fusion and Quantum-Classical Stack Ensemble Method | Sambit Mallick et.al. | 2404.02447 | null |
2024-04-03 | Enhancing Low-Resource LLMs Classification with PEFT and Synthetic Data | Parth Patwa et.al. | 2404.02422 | null |
2024-04-02 | Smooth Deep Saliency | Rudolf Herdt et.al. | 2404.02282 | null |
2024-04-02 | Visual Concept Connectome (VCC): Open World Concept Discovery and their Interlayer Connections in Deep Models | Matthew Kowal et.al. | 2404.02233 | null |
2024-04-02 | ImageNot: A contrast with ImageNet preserves model rankings | Olawale Salaudeen et.al. | 2404.02112 | null |
2024-04-02 | Explainability in JupyterLab and Beyond: Interactive XAI Systems for Integrated and Collaborative Workflows | Grace Guo et.al. | 2404.02081 | null |
2024-04-02 | Ukrainian Texts Classification: Exploration of Cross-lingual Knowledge Transfer Approaches | Daryna Dementieva et.al. | 2404.02043 | null |
2024-04-02 | CAM-Based Methods Can See through Walls | Magamed Taimeskhanov et.al. | 2404.01964 | link |
2024-04-02 | Beyond Image Super-Resolution for Image Recognition with Task-Driven Perceptual Loss | Jaeha Kim et.al. | 2404.01692 | null |
2024-04-02 | A Universal Knowledge Embedded Contrastive Learning Framework for Hyperspectral Image Classification | Quanwei Liu et.al. | 2404.01673 | null |
2024-04-01 | Can Biases in ImageNet Models Explain Generalization? | Paul Gavrikov et.al. | 2404.01509 | link |
2024-04-01 | Parallel Proportional Fusion of Spiking Quantum Neural Network for Optimizing Image Classification | Zuyu Xu et.al. | 2404.01359 | null |
2024-04-01 | Bridging Remote Sensors with Multisensor Geospatial Foundation Models | Boran Han et.al. | 2404.01260 | link |
2024-04-01 | Diagnosis of Skin Cancer Using VGG16 and VGG19 Based Transfer Learning Models | Amir Faghihi et.al. | 2404.01160 | null |
2024-03-29 | Learn "No" to Say "Yes" Better: Improving Vision-Language Models via Negations | Jaisidh Singh et.al. | 2403.20312 | link |
2024-03-29 | MCNet: A crowd denstity estimation network based on integrating multiscale attention module | Qiang Guo et.al. | 2403.20173 | null |
2024-03-29 | Segmentation, Classification and Interpretation of Breast Cancer Medical Images using Human-in-the-Loop Machine Learning | David Vázquez-Lema et.al. | 2403.20112 | null |
2024-03-29 | Adverb Is the Key: Simple Text Data Augmentation with Adverb Deletion | Juhwan Choi et.al. | 2403.20015 | null |
2024-03-29 | Diverse Feature Learning by Self-distillation and Reset | Sejik Park et.al. | 2403.19941 | null |
2024-03-29 | Heterogeneous Network Based Contrastive Learning Method for PolSAR Land Cover Classification | Jianfeng Cai et.al. | 2403.19902 | link |
2024-03-28 | X-MIC: Cross-Modal Instance Conditioning for Egocentric Action Generalization | Anna Kukleva et.al. | 2403.19811 | link |
2024-03-28 | RSMamba: Remote Sensing Image Classification with State Space Model | Keyan Chen et.al. | 2403.19654 | link |
2024-03-28 | Enhance Image Classification via Inter-Class Image Mixup with Diffusion Model | Zhicai Wang et.al. | 2403.19600 | link |
2024-03-28 | The Bad Batches: Enhancing Self-Supervised Learning in Image Classification Through Representative Batch Curation | Ozgu Goksu et.al. | 2403.19579 | null |
2024-03-28 | Low-Rank Rescaled Vision Transformer Fine-Tuning: A Residual Design Approach | Wei Dong et.al. | 2403.19067 | link |
2024-03-27 | Evaluating Large Language Models for Health-Related Text Classification Tasks with Public Social Media Data | Yuting Guo et.al. | 2403.19031 | null |
2024-03-27 | Robustness and Visual Explanation for Black Box Image, Video, and ECG Signal Classification with Reinforcement Learning | Soumyendu Sarkar et.al. | 2403.18985 | null |
2024-03-27 | The Impact of Uniform Inputs on Activation Sparsity and Energy-Latency Attacks in Computer Vision | Andreas Müller et.al. | 2403.18587 | link |
2024-03-27 | Uncertainty-Aware SAR ATR: Defending Against Adversarial Attacks via Bayesian Neural Networks | Tian Ye et.al. | 2403.18318 | null |
2024-03-27 | Multi-scale Unified Network for Image Classification | Wenzhuo Liu et.al. | 2403.18294 | null |
2024-03-26 | The Need for Speed: Pruning Transformers with One Recipe | Samir Khaki et.al. | 2403.17921 | link |
2024-03-26 | Compressed Multi-task embeddings for Data-Efficient Downstream training and inference in Earth Observation | Carlos Gomes et.al. | 2403.17886 | null |
2024-03-26 | PlainMamba: Improving Non-Hierarchical Mamba in Visual Recognition | Chenhongyi Yang et.al. | 2403.17695 | link |
2024-03-26 | Language Models for Text Classification: Is In-Context Learning Enough? | Aleksandra Edwards et.al. | 2403.17661 | null |
2024-03-26 | Boosting Few-Shot Learning with Disentangled Self-Supervised Learning and Meta-Learning for Medical Image Classification | Eva Pachetti et.al. | 2403.17530 | null |
2024-03-26 | HILL: Hierarchy-aware Information Lossless Contrastive Learning for Hierarchical Text Classification | He Zhu et.al. | 2403.17307 | link |
2024-03-25 | Histogram Layers for Neural Engineered Features | Joshua Peeples et.al. | 2403.17176 | link |
2024-03-25 | Task2Box: Box Embeddings for Modeling Asymmetric Task Relationships | Rangel Daroya et.al. | 2403.17173 | link |
2024-03-25 | CipherFormer: Efficient Transformer Private Inference with Low Round Complexity | Weize Wang et.al. | 2403.16860 | null |
2024-03-25 | Assessing the Performance of Deep Learning for Automated Gleason Grading in Prostate Cancer | Dominik Müller et.al. | 2403.16695 | null |
2024-03-25 | DeepGleason: a System for Automated Gleason Grading of Prostate Cancer using Deep Neural Networks | Dominik Müller et.al. | 2403.16678 | link |
2024-03-25 | LARA: Linguistic-Adaptive Retrieval-Augmented LLMs for Multi-Turn Intent Classification | Liu Junhua et.al. | 2403.16504 | null |
2024-03-24 | On machine learning analysis of atomic force microscopy images for image classification, sample surface recognition | Igor Sokolov et.al. | 2403.16230 | null |
2024-03-24 | Leveraging Deep Learning and Xception Architecture for High-Accuracy MRI Classification in Alzheimer Diagnosis | Shaojie Li et.al. | 2403.16212 | null |
2024-03-24 | Multi-Task Learning with Multi-Task Optimization | Lu Bai et.al. | 2403.16162 | null |
2024-03-24 | CBGT-Net: A Neuromimetic Architecture for Robust Classification of Streaming Data | Shreya Sharma et.al. | 2403.15974 | link |
2024-03-23 | A Deep Learning Architectures for Kidney Disease Classification | Muhammad Shoaib Farooq et.al. | 2403.15895 | null |
2024-03-23 | VLUE: A New Benchmark and Multi-task Knowledge Transfer Learning for Vietnamese Natural Language Understanding | Phong Nguyen-Thuan Do et.al. | 2403.15882 | null |
2024-03-23 | VLM-CPL: Consensus Pseudo Labels from Vision-Language Models for Human Annotation-Free Pathological Image Classification | Lanfeng Zhong et.al. | 2403.15836 | null |
2024-03-22 | Your Image is My Video: Reshaping the Receptive Field via Image-To-Video Differentiable AutoAugmentation and Fusion | Sofia Casarin et.al. | 2403.15194 | null |
2024-03-22 | Image Classification with Rotation-Invariant Variational Quantum Circuits | Paul San Sebastian et.al. | 2403.15031 | null |
2024-03-22 | Extracting Human Attention through Crowdsourced Patch Labeling | Minsuk Chang et.al. | 2403.15013 | null |
2024-03-22 | Clean-image Backdoor Attacks | Dazhong Rong et.al. | 2403.15010 | null |
2024-03-22 | ParFormer: Vision Transformer Baseline with Parallel Local Global Token Mixer and Convolution Attention Patch Embedding | Novendra Setyawan et.al. | 2403.15004 | null |
2024-03-22 | MasonTigers at SemEval-2024 Task 8: Performance Analysis of Transformer-based Models on Machine-Generated Text Detection | Sadiya Sayara Chowdhury Puspo et.al. | 2403.14989 | null |
2024-03-21 | Learning with SASQuaTCh: a Novel Variational Quantum Transformer Architecture with Kernel-Based Self-Attention | Ethan N. Evans et.al. | 2403.14753 | null |
2024-03-21 | Estimating Physical Information Consistency of Channel Data Augmentation for Remote Sensing Images | Tom Burgert et.al. | 2403.14547 | null |
2024-03-21 | Multi-Level Explanations for Generative Language Models | Lucas Monteiro Paes et.al. | 2403.14459 | null |
2024-03-21 | Tensor network compressibility of convolutional models | Sukhbinder Singh et.al. | 2403.14379 | null |
2024-03-21 | LayoutLLM: Large Language Model Instruction Tuning for Visually Rich Document Understanding | Masato Fujitake et.al. | 2403.14252 | null |
2024-03-21 | Safeguarding Medical Image Segmentation Datasets against Unauthorized Training via Contour- and Texture-Aware Perturbations | Xun Lin et.al. | 2403.14250 | null |
2024-03-21 | Improving Image Classification Accuracy through Complementary Intra-Class and Inter-Class Mixup | Ye Xu et.al. | 2403.14137 | link |
2024-03-20 | Bridge the Modality and Capacity Gaps in Vision-Language Model Selection | Chao Yi et.al. | 2403.13797 | null |
2024-03-20 | Leveraging feature communication in federated learning for remote sensing image classification | Anh-Kiet Duong et.al. | 2403.13575 | null |
2024-03-20 | MTP: Advancing Remote Sensing Foundation Model via Multi-Task Pretraining | Di Wang et.al. | 2403.13430 | link |
2024-03-20 | Building Optimal Neural Architectures using Interpretable Knowledge | Keith G. Mills et.al. | 2403.13293 | link |
2024-03-19 | LUWA Dataset: Learning Lithic Use-Wear Analysis on Microscopic Images | Jing Zhang et.al. | 2403.13171 | null |
2024-03-19 | Improved EATFormer: A Vision Transformer for Medical Image Classification | Yulong Shisu et.al. | 2403.13167 | null |
2024-03-19 | SIFT-DBT: Self-supervised Initialization and Fine-Tuning for Imbalanced Digital Breast Tomosynthesis Image Classification | Yuexi Du et.al. | 2403.13148 | link |
2024-03-19 | Using evolutionary computation to optimize task performance of unclocked, recurrent Boolean circuits in FPGAs | Raphael Norman-Tenazas et.al. | 2403.13105 | null |
2024-03-19 | Investigating Text Shortening Strategy in BERT: Truncation vs Summarization | Mirza Alim Mutasodirin et.al. | 2403.12799 | link |
2024-03-18 | Posterior Uncertainty Quantification in Neural Networks using Data Augmentation | Luhuan Wu et.al. | 2403.12729 | null |
2024-03-19 | SEVEN: Pruning Transformer Model by Reserving Sentinels | Jinying Xiao et.al. | 2403.12688 | link |
2024-03-19 | Simple Hack for Transformers against Heavy Long-Text Classification on a Time- and Memory-Limited GPU Service | Mirza Alim Mutasodirin et.al. | 2403.12563 | null |
2024-03-19 | Prompt-Guided Adaptive Model Transformation for Whole Slide Image Classification | Yi Lin et.al. | 2403.12537 | null |
2024-03-19 | CrossTune: Black-Box Few-Shot Classification with Label Enhancement | Danqing Luo et.al. | 2403.12468 | null |
2024-03-18 | Generalizing deep learning models for medical image classification | Matta Sarah et.al. | 2403.12167 | null |
2024-03-19 | Leveraging Spatial and Semantic Feature Extraction for Skin Cancer Diagnosis with Capsule Networks and Graph Neural Networks | K. P. Santoso et.al. | 2403.12009 | null |
2024-03-18 | High-energy physics image classification: A Survey of Jet Applications | Hamza Kheddar et.al. | 2403.11934 | null |
2024-03-18 | Better (pseudo-)labels for semi-supervised instance segmentation | François Porcher et.al. | 2403.11675 | null |
2024-03-18 | Continual Forgetting for Pre-trained Vision Models | Hongbo Zhao et.al. | 2403.11530 | link |
2024-03-18 | Uncertainty-Calibrated Test-Time Model Adaptation without Forgetting | Mingkui Tan et.al. | 2403.11491 | null |
2024-03-17 | Potential of Domain Adaptation in Machine Learning in Ecology and Hydrology to Improve Model Extrapolability | Haiyang Shi et.al. | 2403.11331 | null |
2024-03-17 | A Modified Word Saliency-Based Adversarial Attack on Text Classification Models | Hetvi Waghela et.al. | 2403.11297 | null |
2024-03-17 | Forging the Forger: An Attempt to Improve Authorship Verification via Data Augmentation | Silvia Corbara et.al. | 2403.11265 | null |
2024-03-17 | Multiple Teachers-Meticulous Student: A Domain Adaptive Meta-Knowledge Distillation Model for Medical Image Classification | Shahabedin Nabavi et.al. | 2403.11226 | null |
2024-03-16 | Forward Learning of Graph Neural Networks | Namyong Park et.al. | 2403.11004 | null |
2024-03-16 | Understanding Robustness of Visual State Space Models for Image Classification | Chengbin Du et.al. | 2403.10935 | null |
2024-03-16 | Automatic location detection based on deep learning | Anjali Karangiya et.al. | 2403.10912 | null |
2024-03-14 | Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models | Akhil Kedia et.al. | 2403.09635 | link |
2024-03-14 | XCoOp: Explainable Prompt Learning for Computer-Aided Diagnosis via Concept-guided Context Optimization | Yequan Bie et.al. | 2403.09410 | null |
2024-03-14 | ConDiSR: Contrastive Disentanglement and Style Regularization for Single Domain Generalization | Aleksandr Matsun et.al. | 2403.09400 | null |
2024-03-14 | A Hierarchical Fused Quantum Fuzzy Neural Network for Image Classification | Sheng-Yao Wu et.al. | 2403.09318 | null |
2024-03-14 | CLIP-EBC: CLIP Can Count Accurately through Enhanced Blockwise Classification | Yiming Ma et.al. | 2403.09281 | null |
2024-03-14 | Are Vision Language Models Texture or Shape Biased and Can We Steer Them? | Paul Gavrikov et.al. | 2403.09193 | null |
2024-03-14 | Randomized Principal Component Analysis for Hyperspectral Image Classification | Mustafa Ustuner et.al. | 2403.09117 | null |
2024-03-14 | CardioCaps: Attention-based Capsule Network for Class-Imbalanced Echocardiogram Classification | Hyunkyung Han et.al. | 2403.09108 | link |
2024-03-14 | The First to Know: How Token Distributions Reveal Hidden Knowledge in Large Vision-Language Models? | Qinyu Zhao et.al. | 2403.09037 | link |
2024-03-13 | PathM3: A Multimodal Multi-Task Multiple Instance Learning Framework for Whole Slide Image Classification and Captioning | Qifeng Zhou et.al. | 2403.08967 | null |
2024-03-13 | DAM: Dynamic Adapter Merging for Continual Video QA Learning | Feng Cheng et.al. | 2403.08755 | link |
2024-03-13 | Leveraging Compressed Frame Sizes For Ultra-Fast Video Classification | Yuxing Han et.al. | 2403.08580 | null |
2024-03-13 | HOLMES: HOLonym-MEronym based Semantic inspection for Convolutional Image Classifiers | Francesco Dibitonto et.al. | 2403.08536 | link |
2024-03-13 | Pig aggression classification using CNN, Transformers and Recurrent Networks | Junior Silva Souza et.al. | 2403.08528 | null |
2024-03-13 | Reduced Jeffries-Matusita distance: A Novel Loss Function to Improve Generalization Performance of Deep Classification Models | Mohammad Lashkari et.al. | 2403.08408 | null |
2024-03-13 | Iterative Online Image Synthesis via Diffusion Model for Imbalanced Classification | Shuhan Li et.al. | 2403.08407 | null |
2024-03-13 | Advancing Security in AI Systems: A Novel Approach to Detecting Backdoors in Deep Neural Networks | Khondoker Murad Hossain et.al. | 2403.08208 | null |
2024-03-13 | Multiscale Low-Frequency Memory Network for Improved Feature Extraction in Convolutional Neural Networks | Fuzhi Wu et.al. | 2403.08157 | link |
2024-03-12 | Harnessing Artificial Intelligence to Combat Online Hate: Exploring the Challenges and Opportunities of Large Language Models in Hate Speech Detection | Tharindu Kumarage et.al. | 2403.08035 | null |
2024-03-13 | Visual Decoding and Reconstruction via EEG Embeddings with Guided Diffusion | Dongyang Li et.al. | 2403.07721 | link |
2024-03-12 | FPT: Fine-grained Prompt Tuning for Parameter and Memory Efficient Fine Tuning in High-resolution Medical Image Classification | Yijin Huang et.al. | 2403.07576 | null |
2024-03-12 | Backdoor Attack with Mode Mixture Latent Modification | Hongwei Zhang et.al. | 2403.07463 | null |
2024-03-12 | In-context learning enables multimodal large language models to classify cancer pathology images | Dyke Ferber et.al. | 2403.07407 | null |
2024-03-12 | Premonition: Using Generative Models to Preempt Future Data Changes in Continual Learning | Mark D. McDonnell et.al. | 2403.07356 | null |
2024-03-12 | How does promoting the minority fraction affect generalization? A theoretical study of the one-hidden-layer neural network on group imbalance | Hongkang Li et.al. | 2403.07310 | null |
2024-03-12 | A Bayesian Approach to OOD Robustness in Image Classification | Prakhar Kaushik et.al. | 2403.07277 | null |
2024-03-11 | LeOCLR: Leveraging Original Images for Contrastive Learning of Visual Representations | Mohammad Alkhalefi et.al. | 2403.06813 | null |
2024-03-11 | Dynamic Perturbation-Adaptive Adversarial Training on Medical Image Classification | Shuai Li et.al. | 2403.06798 | null |
2024-03-11 | Leveraging Internal Representations of Model for Magnetic Image Classification | Adarsh N L et.al. | 2403.06797 | null |
2024-03-11 | Shortcut Learning in Medical Image Segmentation | Manxi Lin et.al. | 2403.06748 | null |
2024-03-11 | Active Generation for Image Classification | Tao Huang et.al. | 2403.06517 | null |
2024-03-11 | Evolving Knowledge Distillation with Large Language Models and Active Learning | Chengyuan Liu et.al. | 2403.06414 | null |
2024-03-11 | 'One size doesn't fit all': Learning how many Examples to use for In-Context Learning for Improved Text Classification | Manish Chandra et.al. | 2403.06402 | null |
2024-03-10 | Probing Image Compression For Class-Incremental Learning | Justin Yang et.al. | 2403.06288 | null |
2024-03-10 | Bayesian Random Semantic Data Augmentation for Medical Image Classification | Yaoyao Zhu et.al. | 2403.06138 | link |
2024-03-10 | Universal Debiased Editing for Fair Medical Image Classification | Ruinan Jin et.al. | 2403.06104 | null |
2024-03-08 | Tune without Validation: Searching for Learning Rate and Weight Decay on Training Sets | Lorenzo Brigato et.al. | 2403.05532 | null |
2024-03-08 | Generalized Correspondence Matching via Flexible Hierarchical Refinement and Patch Descriptor Distillation | Yu Han et.al. | 2403.05388 | null |
2024-03-08 | The Impact of Quantization on the Robustness of Transformer-based Text Classifiers | Seyed Parsa Neshaei et.al. | 2403.05365 | null |
2024-03-08 | Multiple Instance Learning with random sampling for Whole Slide Image Classification | H. Keshvarikhojasteh et.al. | 2403.05351 | null |
2024-03-08 | Learning Expressive And Generalizable Motion Features For Face Forgery Detection | Jingyi Zhang et.al. | 2403.05172 | null |
2024-03-08 | Defending Against Unforeseen Failure Modes with Latent Adversarial Training | Stephen Casper et.al. | 2403.05030 | link |
2024-03-07 | Fooling Neural Networks for Motion Forecasting via Adversarial Attacks | Edgar Medina et.al. | 2403.04954 | null |
2024-03-07 | T-TAME: Trainable Attention Mechanism for Explaining Convolutional Networks and Vision Transformers | Mariano V. Ntrougkas et.al. | 2403.04523 | null |
2024-03-07 | Source Matters: Source Dataset Impact on Model Robustness in Medical Imaging | Dovile Juodelyte et.al. | 2403.04484 | link |
2024-03-07 | Advancing Biomedical Text Mining with Community Challenges | Hui Zong et.al. | 2403.04261 | null |
2024-03-07 | Scalable On-Chip Optical Linear Processing Unit Using a Single Thin-Film Lithium Niobate Ring Modulator | Zhaoang Deng et.al. | 2403.04216 | null |
2024-03-07 | Scalable and Robust Transformer Decoders for Interpretable Image Classification with Foundation Models | Evelyn Mannix et.al. | 2403.04125 | null |
2024-03-07 | Privacy-preserving Fine-tuning of Large Language Models through Flatness | Tiejin Chen et.al. | 2403.04124 | null |
2024-03-06 | MedMamba: Vision Mamba for Medical Image Classification | Yubiao Yue et.al. | 2403.03849 | link |
2024-03-06 | On the Effectiveness of Distillation in Mitigating Backdoors in Pre-trained Encoder | Tingxu Han et.al. | 2403.03846 | link |
2024-03-06 | RADIA -- Radio Advertisement Detection with Intelligent Analytics | Jorge Álvarez et.al. | 2403.03538 | null |
2024-03-06 | Inverse-Free Fast Natural Gradient Descent Method for Deep Learning | Xinwei Ou et.al. | 2403.03473 | null |
2024-03-06 | Sparse Spiking Neural Network: Exploiting Heterogeneity in Timescales for Pruning Recurrent SNN | Biswadeep Chakraborty et.al. | 2403.03409 | null |
2024-03-05 | RulePrompt: Weakly Supervised Text Classification with Prompting PLMs and Self-Iterative Logical Rules | Miaomiao Li et.al. | 2403.02932 | link |
2024-03-05 | Demonstrating Mutual Reinforcement Effect through Information Flow | Chengguang Gan et.al. | 2403.02902 | null |
2024-03-05 | Quantum Mixed-State Self-Attention Network | Fu Chen et.al. | 2403.02871 | null |
2024-03-05 | SOFIM: Stochastic Optimization Using Regularized Fisher Information Matrix | Gayathri C et.al. | 2403.02833 | null |
2024-03-05 | SGD with Partial Hessian for Deep Neural Networks Optimization | Ying Sun et.al. | 2403.02681 | link |
2024-03-05 | G-EvoNAS: Evolutionary Neural Architecture Search Based on Network Growth | Juan Zou et.al. | 2403.02667 | null |
2024-03-05 | Remove that Square Root: A New Efficient Scale-Invariant Version of AdaGrad | Sayantan Choudhury et.al. | 2403.02648 | link |
2024-03-05 | Modeling Collaborator: Enabling Subjective Vision Classification With Minimal Human Effort via LLM Tool-Use | Imad Eddine Toubal et.al. | 2403.02626 | null |
2024-03-04 | When do Convolutional Neural Networks Stop Learning? | Sahan Ahmad et.al. | 2403.02473 | link |
2024-03-04 | NiNformer: A Network in Network Transformer with Token Mixing Generated Gating Function | Abdullah Nazhat Abdullah et.al. | 2403.02411 | link |
2024-03-02 | Can a Confident Prior Replace a Cold Posterior? | Martin Marek et.al. | 2403.01272 | link |
2024-03-02 | Leveraging Self-Supervised Learning for Scene Recognition in Child Sexual Abuse Imagery | Pedro H. V. Valois et.al. | 2403.01183 | null |
2024-03-02 | Auxiliary Tasks Enhanced Dual-affinity Learning for Weakly Supervised Semantic Segmentation | Lian Xu et.al. | 2403.01156 | null |
2024-03-02 | ELA: Efficient Local Attention for Deep Convolutional Neural Networks | Wei Xu et.al. | 2403.01123 | null |
2024-03-01 | Margin Discrepancy-based Adversarial Training for Multi-Domain Text Classification | Yuan Wu et.al. | 2403.00888 | null |
2024-03-01 | Text classification of column headers with a controlled vocabulary: leveraging LLMs for metadata enrichment | Margherita Martorana et.al. | 2403.00884 | null |
2024-03-01 | SURE: SUrvey REcipes for building reliable and robust deep networks | Yuting Li et.al. | 2403.00543 | link |
2024-03-01 | Invariant Test-Time Adaptation for Vision-Language Model Generalization | Huan Ma et.al. | 2403.00376 | null |
2024-02-29 | TELEClass: Taxonomy Enrichment and LLM-Enhanced Hierarchical Text Classification with Minimal Supervision | Yunyi Zhang et.al. | 2403.00165 | null |
2024-02-29 | Assessing Visually-Continuous Corruption Robustness of Neural Networks Relative to Human Performance | Huakun Shen et.al. | 2402.19401 | null |
2024-02-29 | Stitching Gaps: Fusing Situated Perceptual Knowledge with Vision Transformers for High-Level Image Classification | Delfina Sol Martinez Pandiani et.al. | 2402.19339 | null |
2024-02-29 | Generalizable Whole Slide Image Classification with Fine-Grained Visual-Semantic Interaction | Hao Li et.al. | 2402.19326 | null |
2024-02-29 | Decompose-and-Compose: A Compositional Approach to Mitigating Spurious Correlation | Fahimeh Hosseini Noohdani et.al. | 2402.18919 | null |
2024-02-29 | Utilizing Local Hierarchy with Adversarial Training for Hierarchical Text Classification | Zihan Wang et.al. | 2402.18825 | link |
2024-02-28 | Comparing Importance Sampling Based Methods for Mitigating the Effect of Class Imbalance | Indu Panigrahi et.al. | 2402.18742 | link |
2024-02-28 | Deep Neural Network Models Trained With A Fixed Random Classifier Transfer Better Across Domains | Hafiz Tiomoko Ali et.al. | 2402.18614 | null |
2024-02-28 | Orchid: Flexible and Data-Dependent Convolution for Sequence Modeling | Mahdi Karami et.al. | 2402.18508 | null |
2024-02-28 | Prompt-Driven Dynamic Object-Centric Learning for Single Domain Generalization | Deng Li et.al. | 2402.18447 | null |
2024-02-29 | A Modular System for Enhanced Robustness of Multimedia Understanding Networks via Deep Parametric Estimation | Francesco Barbato et.al. | 2402.18402 | null |
2024-02-28 | A Multimodal Handover Failure Detection Dataset and Baselines | Santosh Thoduka et.al. | 2402.18319 | null |
2024-02-28 | Classes Are Not Equal: An Empirical Study on Image Recognition Fairness | Jiequan Cui et.al. | 2402.18133 | null |
2024-02-27 | Understanding Neural Network Binarization with Forward and Backward Proximal Quantizers | Yiwei Lu et.al. | 2402.17710 | null |
2024-02-27 | SDF2Net: Shallow to Deep Feature Fusion Network for PolSAR Image Classification | Mohammed Q. Alkhatib et.al. | 2402.17672 | link |
2024-02-27 | Predict the Next Word: | Evgenia Ilia et.al. | 2402.17527 | null |
2024-02-27 | Scaling Supervised Local Learning with Augmented Auxiliary Networks | Chenxiang Ma et.al. | 2402.17318 | link |
2024-02-26 | Offline Writer Identification Using Convolutional Neural Network Activation Features | Vincent Christlein et.al. | 2402.17029 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-07-23 | Perspective-Invariant 3D Object Detection | Ao Liang et.al. | 2507.17665 | null |
2025-07-23 | Constructing Ophthalmic MLLM for Positioning-diagnosis Collaboration Through Clinical Cognitive Chain Reasoning | Xinyao Liu et.al. | 2507.17539 | null |
2025-07-23 | Illicit object detection in X-ray imaging using deep learning techniques: A comparative evaluation | Jorgen Cani et.al. | 2507.17508 | null |
2025-07-23 | Dynamic-DINO: Fine-Grained Mixture of Experts Tuning for Real-time Open-Vocabulary Object Detection | Yehao Lu et.al. | 2507.17436 | null |
2025-07-23 | SFUOD: Source-Free Unknown Object Detection | Keon-Hee Park et.al. | 2507.17373 | null |
2025-07-23 | Optimizing Delivery Logistics: Enhancing Speed and Safety with Drone Technology | Maharshi Shastri et.al. | 2507.17253 | null |
2025-07-23 | A Low-Cost Machine Learning Approach for Timber Diameter Estimation | Fatemeh Hasanzadeh Fard et.al. | 2507.17219 | null |
2025-07-22 | Few-Shot Learning in Video and 3D Object Detection: A Survey | Md Meftahul Ferdaus et.al. | 2507.17079 | null |
2025-07-22 | Transformer Based Building Boundary Reconstruction using Attraction Field Maps | Muhammad Kamran et.al. | 2507.17038 | null |
2025-07-22 | Divisive Decisions: Improving Salience-Based Training for Generalization in Binary Classification Tasks | Jacob Piland et.al. | 2507.17000 | null |
2025-07-22 | Task-Specific Zero-shot Quantization-Aware Training for Object Detection | Changhao Li et.al. | 2507.16782 | null |
2025-07-22 | Screen2AX: Vision-Based Approach for Automatic macOS Accessibility Generation | Viktor Muryn et.al. | 2507.16704 | null |
2025-07-22 | QRetinex-Net: Quaternion-Valued Retinex Decomposition for Low-Level Computer Vision Applications | Sos Agaian et.al. | 2507.16683 | null |
2025-07-22 | Benchmarking pig detection and tracking under diverse and challenging conditions | Jonathan Henrich et.al. | 2507.16639 | null |
2025-07-22 | A2Mamba: Attention-augmented State Space Models for Visual Recognition | Meng Lou et.al. | 2507.16624 | null |
2025-07-22 | PlantSAM: An Object Detection-Driven Segmentation Pipeline for Herbarium Specimens | Youcef Sklab et.al. | 2507.16506 | null |
2025-07-22 | Towards Railway Domain Adaptation for LiDAR-based 3D Detection: Road-to-Rail and Sim-to-Real via SynDRA-BBox | Xavier Diaz et.al. | 2507.16413 | null |
2025-07-22 | Scene Text Detection and Recognition "in light of" Challenging Environmental Conditions using Aria Glasses Egocentric Vision Cameras | Joseph De Mathia et.al. | 2507.16330 | null |
2025-07-22 | MAN++: Scaling Momentum Auxiliary Network for Supervised Local Learning in Vision Tasks | Junhao Su et.al. | 2507.16279 | null |
2025-07-22 | Edge-case Synthesis for Fisheye Object Detection: A Data-centric Perspective | Seunghyeon Kim et.al. | 2507.16254 | null |
2025-07-21 | Experimenting active and sequential learning in a medieval music manuscript | Sachin Sharma et.al. | 2507.15633 | null |
2025-07-21 | Few-Shot Object Detection via Spatial-Channel State Space Model | Zhimeng Xin et.al. | 2507.15308 | null |
2025-07-21 | Beyond Easy Wins: A Text Hardness-Aware Benchmark for LLM-generated Text Detection | Navid Ayoobi et.al. | 2507.15286 | null |
2025-07-20 | Event-based Graph Representation with Spatial and Motion Vectors for Asynchronous Object Detection | Aayush Atul Verma et.al. | 2507.15150 | null |
2025-07-20 | BleedOrigin: Dynamic Bleeding Source Localization in Endoscopic Submucosal Dissection via Dual-Stage Detection and Tracking | Mengya Xu et.al. | 2507.15094 | null |
2025-07-20 | InsightX Agent: An LMM-based Agentic Framework with Integrated Tools for Reliable X-ray NDT Analysis | Jiale Liu et.al. | 2507.14899 | null |
2025-07-20 | An Uncertainty-aware DETR Enhancement Framework for Object Detection | Xingshu Chen et.al. | 2507.14855 | null |
2025-07-20 | Seeing Through Deepfakes: A Human-Inspired Framework for Multi-Face Detection | Juan Hu et.al. | 2507.14807 | null |
2025-07-19 | GCC-Spam: Spam Detection via GAN, Contrastive Learning, and Character Similarity Networks | Zixin Xu et.al. | 2507.14679 | null |
2025-07-19 | Multispectral State-Space Feature Fusion: Bridging Shared and Cross-Parametric Interactions for Object Detection | Jifeng Shen et.al. | 2507.14643 | null |
2025-07-18 | C-DOG: Training-Free Multi-View Multi-Object Association in Dense Scenes Without Visual Feature via Connected δ-Overlap Graphs | Yung-Hong Sun et.al. | 2507.14095 | null |
2025-07-18 | Enhancing LiDAR Point Features with Foundation Model Priors for 3D Object Detection | Yujian Mo et.al. | 2507.13899 | null |
2025-07-18 | Moving Object Detection from Moving Camera Using Focus of Expansion Likelihood and Segmentation | Masahiro Ogawa et.al. | 2507.13628 | null |
2025-07-17 | NSF-DOE Vera C. Rubin Observatory Observations of Interstellar Comet 3I/ATLAS (C/2025 N1) | Colin Orion Chandler et.al. | 2507.13409 | null |
2025-07-17 | A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains | Antonio Finocchiaro et.al. | 2507.13326 | null |
2025-07-17 | RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images | Xiaozheng Jiang et.al. | 2507.13120 | null |
2025-07-17 | Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection | Riku Inoue et.al. | 2507.13085 | null |
2025-07-17 | Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis | Saswat Priyadarshi Nayak et.al. | 2507.13073 | null |
2025-07-17 | SOD-YOLO: Enhancing YOLO-Based Detection of Small Objects in UAV Imagery | Peijun Wang et.al. | 2507.12727 | null |
2025-07-16 | Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios | Van-Hoang-Anh Phan et.al. | 2507.12449 | null |
2025-07-16 | InterpIoU: Rethinking Bounding Box Regression with Interpolation-Based IoU Optimization | Haoyuan Liu et.al. | 2507.12420 | null |
2025-07-16 | AutoVDC: Automated Vision Data Cleaning Using Vision-Language Models | Santosh Vasa et.al. | 2507.12414 | null |
2025-07-16 | OD-VIRAT: A Large-Scale Benchmark for Object Detection in Realistic Surveillance Environments | Hayat Ullah et.al. | 2507.12396 | null |
2025-07-16 | Improving Lightweight Weed Detection via Knowledge Distillation | Ahmet Oğuz Saltık et.al. | 2507.12344 | null |
2025-07-16 | SS-DC: Spatial-Spectral Decoupling and Coupling Across Visible-Infrared Gap for Domain Adaptive Object Detection | Xiwei Zhang et.al. | 2507.12017 | null |
2025-07-16 | Frequency-Dynamic Attention Modulation for Dense Prediction | Linwei Chen et.al. | 2507.12006 | null |
2025-07-15 | Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping | Yujie Zhang et.al. | 2507.11279 | null |
2025-07-15 | Using Continual Learning for Real-Time Detection of Vulnerable Road Users in Complex Traffic Scenarios | Faryal Aurooj Nasir et.al. | 2507.11046 | null |
2025-07-15 | Combining Transformers and CNNs for Efficient Object Detection in High-Resolution Satellite Imagery | Nicolas Drapier et.al. | 2507.11040 | null |
2025-07-14 | A Lightweight and Robust Framework for Real-Time Colorectal Polyp Detection Using LOF-Based Preprocessing and YOLO-v11n | Saadat Behzadi et.al. | 2507.10864 | null |
2025-07-14 | LLM-Guided Agentic Object Detection for Open-World Understanding | Furkan Mumcu et.al. | 2507.10844 | null |
2025-07-14 | Versatile and Generalizable Manipulation via Goal-Conditioned Reinforcement Learning with Grounded Object Detection | Huiyi Wang et.al. | 2507.10814 | null |
2025-07-14 | Fine-Grained Zero-Shot Object Detection | Hongxu Ma et.al. | 2507.10358 | null |
2025-07-14 | BlueGlass: A Framework for Composite AI Safety | Harshal Nandigramwar et.al. | 2507.10106 | null |
2025-07-14 | SRG/ART-XC All-Sky X-ray Survey: Sensitivity Assessment Based on Aperture Photometry | N. Y. Tyrin et.al. | 2507.10060 | null |
2025-07-14 | 3DGAA: Realistic and Robust 3D Gaussian-based Adversarial Attack for Autonomous Driving | Yixun Zhang et.al. | 2507.09993 | null |
2025-07-14 | Measuring the Impact of Rotation Equivariance on Aerial Object Detection | Xiuyu Wu et.al. | 2507.09896 | null |
2025-07-14 | Secure and Efficient UAV-Based Face Detection via Homomorphic Encryption and Edge Computing | Nguyen Van Duc et.al. | 2507.09860 | null |
2025-07-13 | MLoRQ: Bridging Low-Rank and Quantization for Transformer Compression | Ofir Gordon et.al. | 2507.09616 | null |
2025-07-12 | Stereo-based 3D Anomaly Object Detection for Autonomous Driving: A New Dataset and Baseline | Shiyi Mu et.al. | 2507.09214 | null |
2025-07-12 | On the Fragility of Multimodal Perception to Temporal Misalignment in Autonomous Driving | Md Hasan Shahriar et.al. | 2507.09095 | null |
2025-07-11 | VISTA: A Visual Analytics Framework to Enhance Foundation Model-Generated Data Labels | Xiwei Xuan et.al. | 2507.09008 | null |
2025-07-11 | RoundaboutHD: High-Resolution Real-World Urban Environment Benchmark for Multi-Camera Vehicle Tracking | Yuqiang Lin et.al. | 2507.08729 | null |
2025-07-11 | DatasetAgent: A Novel Multi-Agent System for Auto-Constructing Datasets from Real-World Images | Haoran Sun et.al. | 2507.08648 | null |
2025-07-11 | OnlineBEV: Recurrent Temporal Fusion in Bird's Eye View Representations for Multi-Camera 3D Perception | Junho Koh et.al. | 2507.08644 | null |
2025-07-11 | Smelly, dense, and spreaded: The Object Detection for Olfactory References (ODOR) dataset | Mathias Zinnen et.al. | 2507.08384 | null |
2025-07-11 | Spectroscopic Observations of Four Candidates for Blue Large-Amplitude Pulsators. No BLAPs at High Galactic Latitudes | P. Pietrukowicz et.al. | 2507.08372 | null |
2025-07-11 | Understanding Driving Risks using Large Language Models: Toward Elderly Driver Assessment | Yuki Yoshihara et.al. | 2507.08367 | null |
2025-07-10 | An Embedded Real-time Object Alert System for Visually Impaired: A Monocular Depth Estimation based Approach through Computer Vision | Jareen Anjom et.al. | 2507.08165 | null |
2025-07-10 | Rainbow Artifacts from Electromagnetic Signal Injection Attacks on Image Sensors | Youqian Zhang et.al. | 2507.07773 | null |
2025-07-09 | Automated Video Segmentation Machine Learning Pipeline | Johannes Merz et.al. | 2507.07242 | null |
2025-07-09 | Aerial Maritime Vessel Detection and Identification | Antonella Barisic Kulas et.al. | 2507.07153 | null |
2025-07-09 | DenoiseCP-Net: Efficient Collective Perception in Adverse Weather via Joint LiDAR-Based 3D Object Detection and Denoising | Sven Teufel et.al. | 2507.06976 | null |
2025-07-09 | A multi-modal dataset for insect biodiversity with imagery and DNA at the trap and individual level | Johanna Orsholm et.al. | 2507.06972 | null |
2025-07-09 | Dataset and Benchmark for Enhancing Critical Retained Foreign Object Detection | Yuli Wang et.al. | 2507.06937 | null |
2025-07-09 | Unlocking Thermal Aerial Imaging: Synthetic Enhancement of UAV Datasets | Antonella Barisic Kulas et.al. | 2507.06797 | null |
2025-07-09 | LOVON: Legged Open-Vocabulary Object Navigator | Daojie Peng et.al. | 2507.06747 | null |
2025-07-09 | EA: An Event Autoencoder for High-Speed Vision Sensing | Riadul Islam et.al. | 2507.06459 | null |
2025-07-08 | Hierarchical Multi-Stage Transformer Architecture for Context-Aware Temporal Action Localization | Hayat Ullah et.al. | 2507.06411 | null |
2025-07-08 | ECORE: Energy-Conscious Optimized Routing for Deep Learning Models at the Edge | Daghash K. Alqahtani et.al. | 2507.06011 | null |
2025-07-08 | R-VLM: Region-Aware Vision Language Model for Precise GUI Grounding | Joonhyung Park et.al. | 2507.05673 | null |
2025-07-07 | YOLO-APD: Enhancing YOLOv8 for Robust Pedestrian Detection on Complex Road Geometries | Aquino Joctum et.al. | 2507.05376 | null |
2025-07-07 | From a Different Star: 3I/ATLAS in the context of the Ōtautahi-Oxford interstellar object population model | Matthew J. Hopkins et.al. | 2507.05318 | null |
2025-07-07 | Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR Representations | Xiang Xu et.al. | 2507.05260 | null |
2025-07-07 | AI Generated Text Detection Using Instruction Fine-tuned Large Language and Transformer-Based Models | Chinnappa Guggilla et.al. | 2507.05157 | null |
2025-07-07 | LERa: Replanning with Visual Feedback in Instruction Following | Svyatoslav Pchelintsev et.al. | 2507.05135 | null |
2025-07-07 | Robustifying 3D Perception through Least-Squares Multi-Agent Graphs Object Tracking | Maria Damanaki et.al. | 2507.04762 | null |
2025-07-07 | CVFusion: Cross-View Fusion of 4D Radar and Camera for 3D Object Detection | Hanzhi Zhong et.al. | 2507.04587 | null |
2025-07-06 | MambaFusion: Height-Fidelity Dense Global Fusion for Multi-modal 3D Object Detection | Hanshi Wang et.al. | 2507.04369 | null |
2025-07-06 | DMAT: An End-to-End Framework for Joint Atmospheric Turbulence Mitigation and Object Detection | Paul Hill et.al. | 2507.04323 | null |
2025-07-06 | ZERO: Multi-modal Prompt-based Visual Grounding | Sangbum Choi et.al. | 2507.04270 | null |
2025-07-05 | Towards Accurate and Efficient 3D Object Detection for Autonomous Driving: A Mixture of Experts Computing System on Edge | Linshen Liu et.al. | 2507.04123 | null |
2025-07-04 | Zero Memory Overhead Approach for Protecting Vision Transformer Parameters | Fereshteh Baradaran et.al. | 2507.03816 | null |
2025-07-03 | Partial Weakly-Supervised Oriented Object Detection | Mingxin Liu et.al. | 2507.02751 | null |
2025-07-03 | Automatic Labelling for Low-Light Pedestrian Detection | Dimitrios Bouzoulas et.al. | 2507.02513 | null |
2025-07-03 | Weakly-supervised Contrastive Learning with Quantity Prompts for Moving Infrared Small Target Detection | Weiwei Duan et.al. | 2507.02454 | null |
2025-07-03 | A Late Collaborative Perception Framework for 3D Multi-Object and Multi-Source Association and Fusion | Maryem Fadili et.al. | 2507.02430 | null |
2025-07-03 | PLOT: Pseudo-Labeling via Video Object Tracking for Scalable Monocular 3D Object Detection | Seokyeong Lee et.al. | 2507.02393 | null |
2025-07-03 | Two-Steps Neural Networks for an Automated Cerebrovascular Landmark Detection | Rafic Nader et.al. | 2507.02349 | null |
2025-07-03 | Perception Activator: An intuitive and portable framework for brain cognitive exploration | Le Xu et.al. | 2507.02311 | null |
2025-07-03 | Understanding Trade offs When Conditioning Synthetic Data | Brandon Trabucco et.al. | 2507.02217 | null |
2025-07-02 | How Well Does GPT-4o Understand Vision? Evaluating Multimodal Foundation Models on Standard Computer Vision Tasks | Rahul Ramachandran et.al. | 2507.01955 | null |
2025-07-02 | Survivability of Backdoor Attacks on Unconstrained Face Recognition Systems | Quentin Le Roux et.al. | 2507.01607 | null |
2025-07-02 | Learning from Random Subspace Exploration: Generalized Test-Time Augmentation with Self-supervised Distillation | Andrei Jelea et.al. | 2507.01347 | null |
2025-07-01 | Rapid Salient Object Detection with Difference Convolutional Neural Networks | Zhuo Su et.al. | 2507.01182 | null |
2025-07-01 | Robust Component Detection for Flexible Manufacturing: A Deep Learning Approach to Tray-Free Object Recognition under Variable Lighting | Fatemeh Sadat Daneshmand et.al. | 2507.00852 | null |
2025-07-01 | UAVD-Mamba: Deformable Token Fusion Vision Mamba for Multimodal UAV Detection | Wei Li et.al. | 2507.00849 | null |
2025-07-01 | High-Frequency Semantics and Geometric Priors for End-to-End Detection Transformers in Challenging UAV Imagery | Hongxing Peng et.al. | 2507.00825 | null |
2025-07-01 | Multi-Modal Graph Convolutional Network with Sinusoidal Encoding for Robust Human Action Segmentation | Hao Xing et.al. | 2507.00752 | null |
2025-07-01 | UPRE: Zero-Shot Domain Adaptation for Object Detection via Unified Prompt and Representation Enhancement | Xiao Zhang et.al. | 2507.00721 | null |
2025-07-01 | Rectifying Magnitude Neglect in Linear Attention | Qihang Fan et.al. | 2507.00698 | null |
2025-06-30 | Continual Adaptation: Environment-Conditional Parameter Generation for Object Detection in Dynamic Scenarios | Deng Li et.al. | 2506.24063 | null |
2025-06-30 | Visual Textualization for Image Prompted Object Detection | Yongjian Wu et.al. | 2506.23785 | null |
2025-06-30 | PBCAT: Patch-based composite adversarial training against physically realizable attacks on object detection | Xiao Li et.al. | 2506.23581 | null |
2025-06-30 | Event-based Tiny Object Detection: A Benchmark Dataset and Baseline | Nuo Chen et.al. | 2506.23575 | null |
2025-06-30 | OcRFDet: Object-Centric Radiance Fields for Multi-View 3D Object Detection in Autonomous Driving | Mingqian Ji et.al. | 2506.23565 | null |
2025-06-30 | From Sight to Insight: Unleashing Eye-Tracking in Weakly Supervised Video Salient Object Detection | Qi Qin et.al. | 2506.23519 | null |
2025-06-30 | Improve Underwater Object Detection through YOLOv12 Architecture and Physics-informed Augmentation | Tinh Nguyen et.al. | 2506.23505 | null |
2025-06-29 | Detecting What Matters: A Novel Approach for Out-of-Distribution 3D Object Detection in Autonomous Vehicles | Menna Taha et.al. | 2506.23426 | null |
2025-06-29 | Layer Decomposition and Morphological Reconstruction for Task-Oriented Infrared Image Enhancement | Siyuan Chai et.al. | 2506.23353 | null |
2025-06-29 | GeoProg3D: Compositional Visual Reasoning for City-Scale 3D Language Fields | Shunsuke Yasuki et.al. | 2506.23352 | null |
2025-06-27 | Attention-disentangled Uniform Orthogonal Feature Space Optimization for Few-shot Object Detection | Taijin Zhao et.al. | 2506.22161 | null |
2025-06-27 | Evaluating Pointing Gestures for Target Selection in Human-Robot Collaboration | Noora Sassali et.al. | 2506.22116 | null |
2025-06-27 | CERBERUS: Crack Evaluation & Recognition Benchmark for Engineering Reliability & Urban Stability | Justin Reinman et.al. | 2506.21909 | null |
2025-06-27 | Visual Content Detection in Educational Videos with Transfer Learning and Dataset Enrichment | Dipayan Biswas et.al. | 2506.21903 | null |
2025-06-27 | Embodied Domain Adaptation for Object Detection | Xiangyu Shi et.al. | 2506.21860 | null |
2025-06-26 | PhotonSplat: 3D Scene Reconstruction and Colorization from SPAD Sensors | Sai Sri Teja et.al. | 2506.21680 | null |
2025-06-26 | Towards Reliable Detection of Empty Space: Conditional Marked Point Processes for Object Detection | Tobias J. Riedlinger et.al. | 2506.21486 | null |
2025-06-26 | TITAN: Query-Token based Domain Adaptive Adversarial Learning | Tajamul Ashraf et.al. | 2506.21484 | null |
2025-06-26 | A Comprehensive Dataset for Underground Miner Detection in Diverse Scenario | Cyrus Addy et.al. | 2506.21451 | null |
2025-06-26 | DuET: Dual Incremental Object Detection via Exemplar-Free Task Arithmetic | Munish Monga et.al. | 2506.21260 | null |
2025-06-26 | LASFNet: A Lightweight Attention-Guided Self-Modulation Feature Fusion Network for Multimodal Object Detection | Lei Hao et.al. | 2506.21018 | null |
2025-06-26 | ThermalDiffusion: Visual-to-Thermal Image-to-Image Translation for Autonomous Navigation | Shruti Bansal et.al. | 2506.20969 | null |
2025-06-25 | Lightweight Multi-Frame Integration for Robust YOLO Object Detection in Videos | Yitong Quan et.al. | 2506.20550 | null |
2025-06-25 | Learning-based safety lifting monitoring system for cranes on construction sites | Hao Chen et.al. | 2506.20475 | null |
2025-06-25 | Feature Hallucination for Self-supervised Action Recognition | Lei Wang et.al. | 2506.20342 | null |
2025-06-25 | From Codicology to Code: A Comparative Study of Transformer and YOLO-based Detectors for Layout Analysis in Historical Documents | Sergio Torres Aguilar et.al. | 2506.20326 | null |
2025-06-25 | TDiR: Transformer based Diffusion for Image Restoration Tasks | Abbas Anwar et.al. | 2506.20302 | null |
2025-06-25 | Integrated optomechanical ultrasonic sensors with nano-Pascal-level sensitivity | Xuening Cao et.al. | 2506.20219 | null |
2025-06-24 | A Survey of Multi-sensor Fusion Perception for Embodied AI: Background, Methods, Challenges and Prospects | Shulan Ruan et.al. | 2506.19769 | null |
2025-06-24 | Semantic Scene Graph for Ultrasound Image Explanation and Scanning Guidance | Xuesong Li et.al. | 2506.19683 | null |
2025-06-24 | Probabilistic modelling and safety assurance of an agriculture robot providing light-treatment | Mustafa Adam et.al. | 2506.19620 | null |
2025-06-24 | USIS16K: High-Quality Dataset for Underwater Salient Instance Segmentation | Lin Hong et.al. | 2506.19472 | null |
2025-06-23 | SpaNN: Detecting Multiple Adversarial Patches on CNNs by Spanning Saliency Thresholds | Mauricio Byrd Victorica et.al. | 2506.18591 | null |
2025-06-23 | Improvement on LiDAR-Camera Calibration Using Square Targets | Zhongyuan Li et.al. | 2506.18294 | null |
2025-06-23 | Learning Approach to Efficient Vision-based Active Tracking of a Flying Target by an Unmanned Aerial Vehicle | Jagadeswara PKV Pothuri et.al. | 2506.18264 | null |
2025-06-23 | Ground tracking for improved landmine detection in a GPR system | Li Tang et.al. | 2506.18258 | null |
2025-06-24 | Referring Expression Instance Retrieval and A Strong End-to-End Baseline | Xiangzhao Hao et.al. | 2506.18246 | null |
2025-06-24 | Unfolding the Past: A Comprehensive Deep Learning Approach to Analyzing Incunabula Pages | Klaudia Ropel et.al. | 2506.18069 | null |
2025-06-21 | YOLOv13: Real-Time Object Detection with Hypergraph-Enhanced Adaptive Visual Perception | Mengqi Lei et.al. | 2506.17733 | null |
2025-06-21 | CSDN: A Context-Gated Self-Adaptive Detection Network for Real-Time Object Detection | Wei Haolin et.al. | 2506.17679 | null |
2025-06-21 | DRAMA-X: A Fine-grained Intent Prediction and Risk Reasoning Benchmark For Driving | Mihir Godbole et.al. | 2506.17590 | null |
2025-06-20 | YASMOT: Yet another stereo image multi-object tracker | Ketil Malde et.al. | 2506.17186 | link |
2025-06-20 | Class Agnostic Instance-level Descriptor for Visual Instance Search | Qi-Ying Sun et.al. | 2506.16745 | null |
2025-06-20 | Cross-modal Offset-guided Dynamic Alignment and Fusion for Weakly Aligned UAV Object Detection | Liu Zongzhen et.al. | 2506.16737 | null |
2025-06-19 | How Hard Is Snow? A Paired Domain Adaptation Dataset for Clear and Snowy Weather: CADC+ | Mei Qi Tang et.al. | 2506.16531 | null |
2025-06-19 | Can AI Dream of Unseen Galaxies? Conditional Diffusion Model for Galaxy Morphology Augmentation | Chenrui Ma et.al. | 2506.16233 | null |
2025-06-19 | VideoGAN-based Trajectory Proposal for Automated Vehicles | Annajoyce Mariani et.al. | 2506.16209 | null |
2025-06-19 | BLADE: An Automated Framework for Classifying Light Curves from the Center for Near-Earth Object Studies (CNEOS) Fireball Database | Elizabeth A. Silber et.al. | 2506.16099 | null |
2025-06-19 | Polyline Path Masked Attention for Vision Transformer | Zhongchen Zhao et.al. | 2506.15940 | null |
2025-06-18 | PhantomHunter: Detecting Unseen Privately-Tuned LLM-Generated Text via Family-Aware Learning | Yuhui Shi et.al. | 2506.15683 | null |
2025-06-18 | BoxFusion: Reconstruction-Free Open-Vocabulary 3D Object Detection via Real-Time Multi-View Box Fusion | Yuqing Lan et.al. | 2506.15610 | null |
2025-06-18 | Retrospective Memory for Camouflaged Object Detection | Chenxi Zhang et.al. | 2506.15244 | null |
2025-06-18 | Fiber Signal Denoising Algorithm using Hybrid Deep Learning Networks | Linlin Wang et.al. | 2506.15125 | null |
2025-06-19 | Efficient Retail Video Annotation: A Robust Key Frame Generation Approach for Product and Customer Interaction Analysis | Varun Mannam et.al. | 2506.14854 | null |
2025-06-18 | YOLOv11-RGBT: Towards a Comprehensive Single-Stage Multispectral Object Detection Framework | Dahang Wan et.al. | 2506.14696 | null |
2025-06-17 | VisText-Mosquito: A Multimodal Dataset and Benchmark for AI-Based Mosquito Breeding Site Detection and Reasoning | Md. Adnanul Islam et.al. | 2506.14629 | null |
2025-06-17 | GAMORA: A Gesture Articulated Meta Operative Robotic Arm for Hazardous Material Handling in Containment-Level Environments | Farha Abdul Wasay et.al. | 2506.14513 | null |
2025-06-17 | Comparison of Two Methods for Stationary Incident Detection Based on Background Image | Deepak Ghimire et.al. | 2506.14256 | null |
2025-06-16 | A Point Cloud Completion Approach for the Grasping of Partially Occluded Objects and Its Applications in Robotic Strawberry Harvesting | Ali Abouzeid et.al. | 2506.14066 | link |
2025-06-16 | FindMeIfYouCan: Bringing Open Set metrics to |
Daniel Montoya et.al. | 2506.14008 | null |
2025-06-16 | How Real is CARLAs Dynamic Vision Sensor? A Study on the Sim-to-Real Gap in Traffic Object Detection | Kaiyuan Tan et.al. | 2506.13722 | null |
2025-06-17 | Lecture Video Visual Objects (LVVO) Dataset: A Benchmark for Visual Object Detection in Educational Videos | Dipayan Biswas et.al. | 2506.13657 | link |
2025-06-16 | UAV Object Detection and Positioning in a Mining Industrial Metaverse with Custom Geo-Referenced Data | Vasiliki Balaska et.al. | 2506.13505 | null |
2025-06-16 | Sparse Convolutional Recurrent Learning for Efficient Event-based Neuromorphic Object Detection | Shenqi Wang et.al. | 2506.13440 | null |
2025-06-16 | Cognitive Synergy Architecture: SEGO for Human-Centric Collaborative Robots | Jaehong Oh et.al. | 2506.13149 | null |
2025-06-15 | MGDFIS: Multi-scale Global-detail Feature Integration Strategy for Small Object Detection | Yuxiang Wang et.al. | 2506.12697 | null |
2025-06-14 | UniDet-D: A Unified Dynamic Spectral Attention Model for Object Detection under Adverse Weathers | Yuantao Wang et.al. | 2506.12324 | null |
2025-06-14 | MatchPlant: An Open-Source Pipeline for UAV-Based Single-Plant Detection and Data Extraction | Worasit Sangjan et.al. | 2506.12295 | link |
2025-06-13 | Vision-based Lifting of 2D Object Detections for Automated Driving | Hendrik Königshof et.al. | 2506.11839 | null |
2025-06-13 | Teleoperated Driving: a New Challenge for 3D Object Detection in Compressed Point Clouds | Filippo Bragato et.al. | 2506.11804 | null |
2025-06-13 | GPLQ: A General, Practical, and Lightning QAT Method for Vision Transformers | Guang Liang et.al. | 2506.11784 | null |
2025-06-13 | On the Natural Robustness of Vision-Language Models Against Visual Perception Attacks in Autonomous Driving | Pedram MohajerAnsari et.al. | 2506.11472 | null |
2025-06-12 | Teaching in adverse scenes: a statistically feedback-driven threshold and mask adjustment teacher-student framework for object detection in UAV images under adverse scenes | Hongyu Chen et.al. | 2506.11175 | null |
2025-06-12 | Discrete Lorenz Attractors in 3D Sinusoidal Maps | Sishu Shankar Muni et.al. | 2506.10788 | null |
2025-06-12 | Uncertainty-Masked Bernoulli Diffusion for Camouflaged Object Detection Refinement | Yuqi Shen et.al. | 2506.10712 | null |
2025-06-12 | Semantic-decoupled Spatial Partition Guided Point-supervised Oriented Object Detection | Xinyuan Liu et.al. | 2506.10601 | link |
2025-06-12 | Improving Medical Visual Representation Learning with Pathological-level Cross-Modal Alignment and Correlation Exploration | Jun Wang et.al. | 2506.10573 | null |
2025-06-12 | FSATFusion: Frequency-Spatial Attention Transformer for Infrared and Visible Image Fusion | Tianpei Zhang et.al. | 2506.10366 | link |
2025-06-11 | DySS: Dynamic Queries and State-Space Learning for Efficient 3D Object Detection from Multi-Camera Videos | Rajeev Yasarla et.al. | 2506.10242 | null |
2025-06-11 | CEM-FBGTinyDet: Context-Enhanced Foreground Balance with Gradient Tuning for tiny Objects | Tao Liu et.al. | 2506.09897 | null |
2025-06-11 | 3DGeoDet: General-purpose Geometry-aware Image-based 3D Object Detection | Yi Zhang et.al. | 2506.09541 | null |
2025-06-11 | MSSDF: Modality-Shared Self-supervised Distillation for High-Resolution Multi-modal Remote Sensing Image Learning | Tong Wang et.al. | 2506.09327 | null |
2025-06-10 | Efficient Edge Deployment of Quantized YOLOv4-Tiny for Aerial Emergency Object Detection on Raspberry Pi 5 | Sindhu Boddu et.al. | 2506.09300 | null |
2025-06-10 | Lightweight Object Detection Using Quantized YOLOv4-Tiny for Emergency Response in Aerial Imagery | Sindhu Boddu et.al. | 2506.09299 | null |
2025-06-10 | WD-DETR: Wavelet Denoising-Enhanced Real-Time Object Detection Transformer for Robot Perception with Event Cameras | Yangjie Cui et.al. | 2506.09098 | null |
2025-06-11 | Cosmos-Drive-Dreams: Scalable Synthetic Driving Data Generation with World Foundation Models | Xuanchi Ren et.al. | 2506.09042 | null |
2025-06-10 | ADAM: Autonomous Discovery and Annotation Model using LLMs for Context-Aware Annotations | Amirreza Rouhi et.al. | 2506.08968 | null |
2025-06-10 | Data Augmentation For Small Object using Fast AutoAugment | DaeEun Yoon et.al. | 2506.08956 | null |
2025-06-11 | Gaussian2Scene: 3D Scene Representation Learning via Self-supervised Learning with 3D Gaussian Splatting | Keyi Liu et.al. | 2506.08777 | null |
2025-06-09 | CrosswalkNet: An Optimized Deep Learning Framework for Pedestrian Crosswalk Detection in Aerial Images with High-Performance Computing | Zubin Bhuyan et.al. | 2506.07885 | null |
2025-06-09 | SAM2Auto: Auto Annotation Using FLASH | Arash Rocky et.al. | 2506.07850 | null |
2025-06-09 | Design and Evaluation of Deep Learning-Based Dual-Spectrum Image Fusion Methods | Beining Xu et.al. | 2506.07779 | null |
2025-06-09 | SpikeSMOKE: Spiking Neural Networks for Monocular 3D Object Detection with Cross-Scale Gated Coding | Xuemei Chen et.al. | 2506.07737 | null |
2025-06-09 | Domain Randomization for Object Detection in Manufacturing Applications using Synthetic Data: A Comprehensive Study | Xiaomeng Zhu et.al. | 2506.07539 | null |
2025-06-09 | SpatialLM: Training Large Language Models for Structured Indoor Modeling | Yongsen Mao et.al. | 2506.07491 | null |
2025-06-09 | Happiness Finder: Exploring the Role of AI in Enhancing Well-Being During Four-Leaf Clover Searches | Anna Yokokubo et.al. | 2506.07393 | null |
2025-06-09 | Multiple Object Stitching for Unsupervised Representation Learning | Chengchao Shen et.al. | 2506.07364 | link |
2025-06-09 | CBAM-STN-TPS-YOLO: Enhancing Agricultural Object Detection through Spatially Adaptive Attention Mechanisms | Satvik Praveen et.al. | 2506.07357 | null |
2025-06-08 | UCOD-DPL: Unsupervised Camouflaged Object Detection via Dynamic Pseudo-label Learning | Weiqi Yan et.al. | 2506.07087 | null |
2025-06-06 | Domain-RAG: Retrieval-Guided Compositional Image Generation for Cross-Domain Few-Shot Object Detection | Yu Li et.al. | 2506.05872 | null |
2025-06-06 | Token Transforming: A Unified and Training-Free Token Compression Framework for Vision Transformer Acceleration | Fanhu Zeng et.al. | 2506.05709 | null |
2025-06-06 | Integer Binary-Range Alignment Neuron for Spiking Neural Networks | Binghao Ye et.al. | 2506.05679 | null |
2025-06-05 | CL-ISR: A Contrastive Learning and Implicit Stance Reasoning Framework for Misleading Text Detection on Social Media | Tianyi Huang et.al. | 2506.05107 | null |
2025-06-05 | Synthetic Dataset Generation for Autonomous Mobile Robots Using 3D Gaussian Splatting for Vision Training | Aneesh Deogan et.al. | 2506.05092 | null |
2025-06-06 | Bridging Annotation Gaps: Transferring Labels to Align Object Detection Datasets | Mikhail Kennerley et.al. | 2506.04737 | null |
2025-06-05 | Gen-n-Val: Agentic Image Data Generation and Validation | Jing-En Huang et.al. | 2506.04676 | null |
2025-06-05 | VoxDet: Rethinking 3D Semantic Occupancy Prediction as Dense Object Detection | Wuyang Li et.al. | 2506.04623 | null |
2025-06-04 | FALO: Fast and Accurate LiDAR 3D Object Detection on Resource-Constrained Devices | Shizhong Han et.al. | 2506.04499 | null |
2025-06-04 | Neural Object Detection for 4D STEM: High-Throughput Sub-Pixel Electron Diffraction Pattern Recognition | Arda Genc et.al. | 2506.04477 | null |
2025-06-04 | Diffusion Domain Teacher: Diffusion Guided Domain Adaptive Object Detector | Boyong He et.al. | 2506.04211 | link |
2025-06-04 | FSHNet: Fully Sparse Hybrid Network for 3D Object Detection | Shuai Liu et.al. | 2506.03714 | null |
2025-06-04 | How PARTs assemble into wholes: Learning the relative composition of images | Melika Ayoughi et.al. | 2506.03682 | null |
2025-06-05 | MambaNeXt-YOLO: A Hybrid State Space Model for Real-time Object Detection | Xiaochun Lei et.al. | 2506.03654 | null |
2025-06-04 | DiagNet: Detecting Objects using Diagonal Constraints on Adjacency Matrix of Graph Neural Network | Chong Hyun Lee et.al. | 2506.03571 | null |
2025-06-03 | SportMamba: Adaptive Non-Linear Multi-Object Tracking with State Space Models for Team Sports | Dheeraj Khanna et.al. | 2506.03335 | null |
2025-06-03 | Simulate Any Radar: Attribute-Controllable Radar Simulation via Waveform Parameter Embedding | Weiqing Xiao et.al. | 2506.03134 | null |
2025-06-03 | HACo-Det: A Study Towards Fine-Grained Machine-Generated Text Detection under Human-AI Coauthoring | Zhixiong Su et.al. | 2506.02959 | null |
2025-06-03 | Towards Auto-Annotation from Annotation Guidelines: A Benchmark through 3D LiDAR Detection | Yechi Ma et.al. | 2506.02914 | null |
2025-06-03 | A Dynamic Transformer Network for Vehicle Detection | Chunwei Tian et.al. | 2506.02765 | null |
2025-06-03 | Open-PMC-18M: A High-Fidelity Large Scale Medical Dataset for Multimodal Representation Learning | Negin Baghbanzadeh et.al. | 2506.02738 | null |
2025-06-03 | GeneA-SLAM2: Dynamic SLAM with AutoEncoder-Preprocessed Genetic Keypoints Resampling and Depth Variance-Guided Dynamic Region Removal | Shufan Qing et.al. | 2506.02736 | link |
2025-06-03 | Sight Guide: A Wearable Assistive Perception and Navigation System for the Vision Assistance Race in the Cybathlon 2024 | Patrick Pfreundschuh et.al. | 2506.02676 | null |
2025-06-03 | Probabilistic Online Event Downsampling | Andreu Girbau-Xalabarder et.al. | 2506.02547 | null |
2025-06-03 | Efficient Test-time Adaptive Object Detection via Sensitivity-Guided Pruning | Kunyu Wang et.al. | 2506.02462 | null |
2025-06-03 | Auto-Labeling Data for Object Detection | Brent A. Griffin et.al. | 2506.02359 | null |
2025-05-30 | Stress-testing Machine Generated Text Detection: Shifting Language Models Writing Style to Fool Detectors | Andrea Pedrotti et.al. | 2505.24523 | null |
2025-05-30 | Deformable Attention Mechanisms Applied to Object Detection, case of Remote Sensing | Anasse Boutayeb et.al. | 2505.24489 | null |
2025-05-30 | Leadership Assessment in Pediatric Intensive Care Unit Team Training | Liangyang Ouyang et.al. | 2505.24389 | null |
2025-05-30 | D2AF: A Dual-Driven Annotation and Filtering Framework for Visual Grounding | Yichi Zhang et.al. | 2505.24372 | null |
2025-05-29 | Conformal Object Detection by Sequential Risk Control | Léo Andéol et.al. | 2505.24038 | null |
2025-05-29 | Rooms from Motion: Un-posed Indoor 3D Object Detection as Localization and Mapping | Justin Lazarow et.al. | 2505.23756 | null |
2025-05-29 | Boosting Domain Incremental Learning: Selecting the Optimal Parameters is All You Need | Qiang Wang et.al. | 2505.23744 | null |
2025-05-29 | FMG-Det: Foundation Model Guided Robust Object Detection | Darryl Hannan et.al. | 2505.23726 | null |
2025-05-29 | CF-DETR: Coarse-to-Fine Transformer for Real-Time Object Detection | Woojin Shin et.al. | 2505.23317 | null |
2025-05-30 | WTEFNet: Real-Time Low-Light Object Detection for Advanced Driver Assistance Systems | Hao Wu et.al. | 2505.23201 | null |
2025-05-29 | Language-guided Learning for Object Detection Tackling Multiple Variations in Aerial Images | Sungjune Park et.al. | 2505.23193 | null |
2025-05-29 | DIP-R1: Deep Inspection and Perception with RL Looking Through and Understanding Complex Scenes | Sungjune Park et.al. | 2505.23179 | null |
2025-05-29 | The Meeseeks Mesh: Spatially Consistent 3D Adversarial Objects for BEV Detector | Aixuan Li et.al. | 2505.22499 | null |
2025-05-28 | VME: A Satellite Imagery Dataset and Benchmark for Detecting Vehicles in the Middle East and Beyond | Noora Al-Emadi et.al. | 2505.22353 | link |
2025-05-28 | Task-Driven Implicit Representations for Automated Design of LiDAR Systems | Nikhil Behari et.al. | 2505.22344 | null |
2025-05-29 | YH-MINER: Multimodal Intelligent System for Natural Ecological Reef Metric Extraction | Mingzhuang Wang et.al. | 2505.22250 | null |
2025-05-28 | S2AFormer: Strip Self-Attention for Efficient Vision Transformer | Guoan Xu et.al. | 2505.22195 | null |
2025-05-28 | Learning A Robust RGB-Thermal Detector for Extreme Modality Imbalance | Chao Tian et.al. | 2505.22154 | null |
2025-05-28 | Prototype Embedding Optimization for Human-Object Interaction Detection in Livestreaming | Menghui Zhang et.al. | 2505.22011 | null |
2025-05-28 | Cross-DINO: Cross the Deep MLP and Transformer for Small Object Detection | Guiping Cao et.al. | 2505.21868 | null |
2025-05-27 | Object Concepts Emerge from Motion | Haoqian Liang et.al. | 2505.21635 | null |
2025-05-27 | Active-O3: Empowering Multimodal Large Language Models with Active Perception via GRPO | Muzhi Zhu et.al. | 2505.21457 | null |
2025-05-27 | Visual Product Graph: Bridging Visual Products And Composite Images For End-to-End Style Recommendations | Yue Li Du et.al. | 2505.21454 | null |
2025-05-27 | YOLO-SPCI: Enhancing Remote Sensing Object Detection via Selective-Perspective-Class Integration | Xinyuan Wang et.al. | 2505.21370 | null |
2025-05-27 | Assured Autonomy with Neuro-Symbolic Perception | R. Spencer Hallyburton et.al. | 2505.21322 | null |
2025-05-27 | Robust Video-Based Pothole Detection and Area Estimation for Intelligent Vehicles with Depth Map and Kalman Smoothing | Dehao Wang et.al. | 2505.21049 | null |
2025-05-27 | Facial Attribute Based Text Guided Face Anonymization | Mustafa İzzet Muştu et.al. | 2505.21002 | null |
2025-05-27 | YOLO-FireAD: Efficient Fire Detection via Attention-Guided Inverted Residual Learning and Dual-Pooling Feature Preservation | Weichao Pan et.al. | 2505.20884 | null |
2025-05-27 | Open-Det: An Efficient Learning Framework for Open-Ended Detection | Guiping Cao et.al. | 2505.20639 | null |
2025-05-27 | Roboflow100-VL: A Multi-Domain Object Detection Benchmark for Vision-Language Models | Peter Robicheaux et.al. | 2505.20612 | null |
2025-05-26 | From Data to Modeling: Fully Open-vocabulary Scene Graph Generation | Zuyao Chen et.al. | 2505.20106 | null |
2025-05-26 | Target Tracking via LiDAR-RADAR Sensor Fusion for Autonomous Racing | Marcello Cellina et.al. | 2505.20043 | null |
2025-05-26 | Underwater Diffusion Attention Network with Contrastive Language-Image Joint Learning for Underwater Image Enhancement | Afrah Shaahid et.al. | 2505.19895 | null |
2025-05-26 | ADD-SLAM: Adaptive Dynamic Dense SLAM with Gaussian Splatting | Wenhua Wu et.al. | 2505.19420 | null |
2025-05-26 | Neural nanophotonic object detector with ultra-wide field-of-view | Ji Chen et.al. | 2505.19379 | null |
2025-05-25 | What do Blind and Low-Vision People Really Want from Assistive Smart Devices? Comparison of the Literature with a Focus Study | Bhanuka Gamage et.al. | 2505.19325 | null |
2025-05-25 | VL-SAM-V2: Open-World Object Detection with General and Specific Query Fusion | Zhiwei Lin et.al. | 2505.18986 | null |
2025-05-24 | Mitigating Context Bias in Domain Adaptation for Object Detection using Mask Pooling | Hojun Son et.al. | 2505.18446 | null |
2025-05-23 | Sampling Strategies for Efficient Training of Deep Learning Object Detection Algorithms | Gefei Shen et.al. | 2505.18302 | null |
2025-05-23 | One RL to See Them All: Visual Triple Unified Reinforcement Learning | Yan Ma et.al. | 2505.18129 | null |
2025-05-23 | SemSegBench & DetecBench: Benchmarking Reliability and Generalization Beyond Classification | Shashank Agnihotri et.al. | 2505.18015 | null |
2025-05-23 | RQR3D: Reparametrizing the regression targets for BEV-based 3D object detection | Ozsel Kilinc et.al. | 2505.17732 | null |
2025-05-23 | Adaptive Semantic Token Communication for Transformer-based Edge Inference | Alessio Devoto et.al. | 2505.17604 | null |
2025-05-23 | Distance Estimation in Outdoor Driving Environments Using Phase-only Correlation Method with Event Cameras | Masataka Kobayashi et.al. | 2505.17582 | null |
2025-05-23 | OrionBench: A Benchmark for Chart and Human-Recognizable Object Detection in Infographics | Jiangning Zhu et.al. | 2505.17473 | null |
2025-05-23 | Reflectance Prediction-based Knowledge Distillation for Robust 3D Object Detection in Compressed Point Clouds | Hao Jing et.al. | 2505.17442 | null |
2025-05-23 | Optimizing YOLOv8 for Parking Space Detection: Comparative Analysis of Custom YOLOv8 Architecture | Apar Pokhrel et.al. | 2505.17364 | null |
2025-05-22 | Extending Dataset Pruning to Object Detection: A Variance-based Approach | Ryota Yagi et.al. | 2505.17245 | null |
2025-05-22 | Semi-Supervised State-Space Model with Dynamic Stacking Filter for Real-World Video Deraining | Shangquan Sun et.al. | 2505.16811 | null |
2025-05-22 | Robust Vision-Based Runway Detection through Conformal Prediction and Conformal mAP | Alya Zouzou et.al. | 2505.16740 | link |
2025-05-22 | CodeMerge: Codebook-Guided Model Merging for Robust Test-Time Adaptation in Autonomous Driving | Huitong Yang et.al. | 2505.16524 | null |
2025-05-22 | MAFE R-CNN: Selecting More Samples to Learn Category-aware Features for Small Object Detection | Yichen Li et.al. | 2505.16442 | null |
2025-05-22 | AdvReal: Adversarial Patch Generation Framework with Application to Adversarial Safety Evaluation of Object Detection Systems | Yuanhao Huang et.al. | 2505.16402 | link |
2025-05-22 | Self-Classification Enhancement and Correction for Weakly Supervised Object Detection | Yufei Yin et.al. | 2505.16294 | null |
2025-05-21 | MIKU-PAL: An Automated and Standardized Multi-Modal Method for Speech Paralinguistic and Affect Labeling | Cheng Yifan et.al. | 2505.15772 | null |
2025-05-21 | The Devil is in Fine-tuning and Long-tailed Problems:A New Benchmark for Scene Text Detection | Tianjiao Cao et.al. | 2505.15649 | link |
2025-05-21 | SNAP: A Benchmark for Testing the Effects of Capture Conditions on Fundamental Vision Tasks | Iuliia Kotseruba et.al. | 2505.15628 | link |
2025-05-21 | Detection of Underwater Multi-Targets Based on Self-Supervised Learning and Deformable Path Aggregation Feature Pyramid Network | Chang Liu et.al. | 2505.15518 | null |
2025-05-21 | Trends and Challenges in Authorship Analysis: A Review of ML, DL, and LLM Approaches | Nudrat Habib et.al. | 2505.15422 | null |
2025-05-21 | RAZER: Robust Accelerated Zero-Shot 3D Open-Vocabulary Panoptic Reconstruction with Spatio-Temporal Aggregation | Naman Patel et.al. | 2505.15373 | null |
2025-05-21 | AGENT-X: Adaptive Guideline-based Expert Network for Threshold-free AI-generated teXt detection | Jiatao Li et.al. | 2505.15261 | null |
2025-05-21 | Multispectral Detection Transformer with Infrared-Centric Sensor Fusion | Seongmin Hwang et.al. | 2505.15137 | null |
2025-05-20 | Colors Matter: AI-Driven Exploration of Human Feature Colors | Rama Alyoubi et.al. | 2505.14931 | link |
2025-05-20 | Language Models Optimized to Fool Detectors Still Have a Distinct Style (And How to Change It) | Rafael Rivera Soto et.al. | 2505.14608 | null |
2025-05-20 | SCAN: Semantic Document Layout Analysis for Textual and Visual Retrieval-Augmented Generation | Yuyang Dong et.al. | 2505.14381 | null |
2025-05-20 | FAID: Fine-grained AI-generated Text Detection using Multi-task Auxiliary and Multi-level Contrastive Learning | Minh Ngoc Ta et.al. | 2505.14271 | null |
2025-05-20 | Decoupling Classifier for Boosting Few-shot Object Detection and Instance Segmentation | Bin-Bin Gao et.al. | 2505.14239 | null |
2025-05-20 | Intra-class Patch Swap for Self-Distillation | Hongjun Choi et.al. | 2505.14124 | link |
2025-05-20 | Scaling Vision Mamba Across Resolutions via Fractal Traversal | Bo Li et.al. | 2505.14062 | null |
2025-05-20 | Automated Quality Evaluation of Cervical Cytopathology Whole Slide Images Based on Content Analysis | Lanlan Kang et.al. | 2505.13875 | null |
2025-05-20 | Safety2Drive: Safety-Critical Scenario Benchmark for the Evaluation of Autonomous Driving | Jingzheng Li et.al. | 2505.13872 | null |
2025-05-20 | Domain Gating Ensemble Networks for AI-Generated Text Detection | Arihant Tripathi et.al. | 2505.13855 | null |
2025-05-20 | A Challenge to Build Neuro-Symbolic Video Agents | Sahil Shah et.al. | 2505.13851 | null |
2025-05-19 | Dynamic Graph Induced Contour-aware Heat Conduction Network for Event-based Object Detection | Xiao Wang et.al. | 2505.12908 | link |
2025-05-19 | Rethinking Features-Fused-Pyramid-Neck for Object Detection | Hulin Li et.al. | 2505.12820 | link |
2025-05-19 | Enhancing Transformers Through Conditioned Embedded Tokens | Hemanth Saratchandran et.al. | 2505.12789 | null |
2025-05-19 | LiDAR MOT-DETR: A LiDAR-based Two-Stage Transformer for 3D Multiple Object Tracking | Martha Teiko Teye et.al. | 2505.12753 | null |
2025-05-19 | VLC Fusion: Vision-Language Conditioned Sensor Fusion for Robust Object Detection | Aditya Taparia et.al. | 2505.12715 | null |
2025-05-18 | LM |
Xu Zheng et.al. | 2505.12507 | null |
2025-05-17 | EarthSynth: Generating Informative Earth Observation with Diffusion Models | Jiancheng Pan et.al. | 2505.12108 | null |
2025-05-17 | Experimental Study on Automatically Assembling Custom Catering Packages With a 3-DOF Delta Robot Using Deep Learning Methods | Reihaneh Yourdkhani et.al. | 2505.11879 | null |
2025-05-16 | Improving Object Detection Performance through YOLOv8: A Comprehensive Training and Evaluation Study | Rana Poureskandar et.al. | 2505.11424 | null |
2025-05-16 | MTevent: A Multi-Task Event Camera Dataset for 6D Pose Estimation and Moving Object Detection | Shrutarv Awasthi et.al. | 2505.11282 | null |
2025-05-16 | M4-SAR: A Multi-Resolution, Multi-Polarization, Multi-Scene, Multi-Source Dataset and Benchmark for Optical-SAR Fusion Object Detection | Chao Wang et.al. | 2505.10931 | null |
2025-05-16 | A High-Performance Thermal Infrared Object Detection Framework with Centralized Regulation | Jinke Li et.al. | 2505.10825 | null |
2025-05-15 | StoryReasoning Dataset: Using Chain-of-Thought for Scene Understanding and Grounded Story Generation | Daniel A. P. Oliveira et.al. | 2505.10292 | link |
2025-05-15 | Defect Detection in Photolithographic Patterns Using Deep Learning Models Trained on Synthetic Data | Prashant P. Shinde et.al. | 2505.10192 | null |
2025-05-15 | Application of YOLOv8 in monocular downward multiple Car Target detection | Shijie Lyu et.al. | 2505.10016 | null |
2025-05-14 | EdgeAI Drone for Autonomous Construction Site Demonstrator | Emre Girgin et.al. | 2505.09837 | link |
2025-05-14 | WhatsAI: Transforming Meta Ray-Bans into an Extensible Generative AI Platform for Accessibility | Nasif Zaman et.al. | 2505.09823 | null |
2025-05-14 | MoRAL: Motion-aware Multi-Frame 4D Radar and LiDAR Fusion for Robust 3D Object Detection | Xiangyuan Peng et.al. | 2505.09422 | null |
2025-05-14 | A drone that learns to efficiently find objects in agricultural fields: from simulation to the real world | Rick van Essen et.al. | 2505.09278 | null |
2025-05-14 | DRRNet: Macro-Micro Feature Fusion and Dual Reverse Refinement for Camouflaged Object Detection | Jianlin Sun et.al. | 2505.09168 | link |
2025-05-14 | Beyond General Prompts: Automated Prompt Refinement using Contrastive Class Alignment Scores for Disambiguating Objects in Vision-Language Models | Lucas Choi et.al. | 2505.09139 | null |
2025-05-14 | Promoting SAM for Camouflaged Object Detection via Selective Key Point-based Guidance | Guoying Liang et.al. | 2505.09123 | null |
2025-05-13 | Robustness Analysis against Adversarial Patch Attacks in Fully Unmanned Stores | Hyunsik Na et.al. | 2505.08835 | null |
2025-05-13 | Augmented Reality for RObots (ARRO): Pointing Visuomotor Policies Towards Visual Robustness | Reihaneh Mirjalili et.al. | 2505.08627 | null |
2025-05-14 | Thermal Detection of People with Mobility Restrictions for Barrier Reduction at Traffic Lights Controlled Intersections | Xiao Ni et.al. | 2505.08568 | link |
2025-05-13 | MDF: Multi-Modal Data Fusion with CNN-Based Object Detection for Enhanced Indoor Localization Using LiDAR-SLAM | Saqi Hussain Kalan et.al. | 2505.08388 | null |
2025-05-13 | HMPNet: A Feature Aggregation Architecture for Maritime Object Detection from a Shipborne Perspective | Yu Zhang et.al. | 2505.08231 | link |
2025-05-13 | Object detection in adverse weather conditions for autonomous vehicles using Instruct Pix2Pix | Unai Gurbindo et.al. | 2505.08228 | null |
2025-05-13 | MoKD: Multi-Task Optimization for Knowledge Distillation | Zeeshan Hayder et.al. | 2505.08170 | null |
2025-05-12 | LAMM-ViT: AI Face Detection via Layer-Aware Modulation of Region-Guided Attention | Jiangling Zhang et.al. | 2505.07734 | null |
2025-05-12 | Hybrid Spiking Vision Transformer for Object Detection with Event Cameras | Qi Xu et.al. | 2505.07715 | null |
2025-05-12 | Self-Supervised Event Representations: Towards Accurate, Real-Time Perception on SoC FPGAs | Kamil Jeziorek et.al. | 2505.07556 | null |
2025-05-12 | Automated Visual Attention Detection using Mobile Eye Tracking in Behavioral Classroom Studies | Efe Bozkir et.al. | 2505.07552 | null |
2025-05-12 | DepthFusion: Depth-Aware Hybrid Feature Fusion for LiDAR-Camera 3D Object Detection | Mingqian Ji et.al. | 2505.07398 | null |
2025-05-12 | Language-Driven Dual Style Mixing for Single-Domain Generalized Object Detection | Hongda Qin et.al. | 2505.07219 | link |
2025-05-11 | Differentiable NMS via Sinkhorn Matching for End-to-End Fabric Defect Detection | Zhengyang Lu et.al. | 2505.07040 | null |
2025-05-11 | VALISENS: A Validated Innovative Multi-Sensor System for Cooperative Automated Driving | Lei Wan et.al. | 2505.06980 | null |
2025-05-10 | M3CAD: Towards Generic Cooperative Autonomous Driving Benchmark | Morui Zhu et.al. | 2505.06746 | null |
2025-05-10 | Underwater object detection in sonar imagery with detection transformer and Zero-shot neural architecture search | XiaoTong Gu et.al. | 2505.06694 | null |
2025-05-09 | Camera-Only Bird's Eye View Perception: A Neural Approach to LiDAR-Free Environmental Mapping for Autonomous Vehicles | Anupkumar Bochare et.al. | 2505.06113 | null |
2025-05-09 | Artificial intelligence pioneers the double-strangeness factory | Yan He et.al. | 2505.05802 | null |
2025-05-09 | Dome-DETR: DETR with Density-Oriented Feature-Query Manipulation for Efficient Tiny Object Detection | Zhangchi Hu et.al. | 2505.05741 | null |
2025-05-09 | DiGIT: Multi-Dilated Gated Encoder and Central-Adjacent Region Integrated Decoder for Temporal Action Detection Transformer | Ho-Joong Kim et.al. | 2505.05711 | link |
2025-05-08 | PillarMamba: Learning Local-Global Context for Roadside Point Cloud via Hybrid State Space Model | Zhang Zhang et.al. | 2505.05397 | null |
2025-05-08 | PaniCar: Securing the Perception of Advanced Driving Assistance Systems Against Emergency Vehicle Lighting | Elad Feldman et.al. | 2505.05183 | null |
2025-05-08 | Reliably Bounding False Positives: A Zero-Shot Machine-Generated Text Detection Framework via Multiscaled Conformal Prediction | Xiaowei Zhu et.al. | 2505.05084 | null |
2025-05-08 | FG-CLIP: Fine-Grained Visual and Textual Alignment | Chunyu Xie et.al. | 2505.05071 | null |
2025-05-08 | A Simple Detector with Frame Dynamics is a Strong Tracker | Chenxu Peng et.al. | 2505.04917 | null |
2025-05-08 | Mix-QSAM: Mixed-Precision Quantization of the Segment Anything Model | Navin Ranjan et.al. | 2505.04861 | null |
2025-05-07 | Lightweight RGB-D Salient Object Detection from a Speed-Accuracy Tradeoff Perspective | Songsong Duan et.al. | 2505.04758 | null |
2025-05-07 | Hyb-KAN ViT: Hybrid Kolmogorov-Arnold Networks Augmented Vision Transformer | Sainath Dey et.al. | 2505.04740 | null |
2025-05-08 | MonoCoP: Chain-of-Prediction for Monocular 3D Object Detection | Zhihao Zhang et.al. | 2505.04594 | null |
2025-05-07 | Edge-GPU Based Face Tracking for Face Detection and Recognition Acceleration | Asma Baobaid et.al. | 2505.04524 | null |
2025-05-07 | Leveraging Simultaneous Usage of Edge GPU Hardware Engines for Video Face Detection and Recognition | Asma Baobaid et.al. | 2505.04502 | null |
2025-05-07 | DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perception | Junjie Wang et.al. | 2505.04410 | null |
2025-05-06 | LogisticsVLN: Vision-Language Navigation For Low-Altitude Terminal Delivery Based on Agentic UAVs | Xinyuan Zhang et.al. | 2505.03460 | null |
2025-05-06 | Robustness in AI-Generated Detection: Enhancing Resistance to Adversarial Attacks | Sun Haoxuan et.al. | 2505.03435 | null |
2025-05-06 | From Word to Sentence: A Large-Scale Multi-Instance Dataset for Open-Set Aerial Detection | Guoting Wei et.al. | 2505.03334 | null |
2025-05-06 | VISLIX: An XAI Framework for Validating Vision Models with Slice Discovery and Analysis | Xinyuan Yan et.al. | 2505.03132 | null |
2025-05-05 | Sim2Real Transfer for Vision-Based Grasp Verification | Pau Amargant et.al. | 2505.03046 | link |
2025-05-05 | DPNet: Dynamic Pooling Network for Tiny Object Detection | Luqi Gong et.al. | 2505.02797 | null |
2025-05-05 | RGBX-DiffusionDet: A Framework for Multi-Modal RGB-X Object Detection Using DiffusionDet | Eliraz Orfaig et.al. | 2505.02586 | null |
2025-05-05 | Point Cloud Recombination: Systematic Real Data Augmentation Using Robotic Targets for LiDAR Perception Validation | Hubert Padusinski et.al. | 2505.02476 | null |
2025-05-04 | Robust AI-Generated Face Detection with Imbalanced Data | Yamini Sri Krubha et.al. | 2505.02182 | link |
2025-05-04 | Transforming faces into video stories -- VideoFace2.0 | Branko Brkljač et.al. | 2505.02060 | null |
2025-05-03 | DriveNetBench: An Affordable and Configurable Single-Camera Benchmarking System for Autonomous Driving Networks | Ali Al-Bustami et.al. | 2505.01893 | link |
2025-05-03 | OODTE: A Differential Testing Engine for the ONNX Optimizer | Nikolaos Louloudakis et.al. | 2505.01892 | null |
2025-05-03 | CMAWRNet: Multiple Adverse Weather Removal via a Unified Quaternion Neural Architecture | Vladimir Frants et.al. | 2505.01882 | null |
2025-05-03 | DualDiff: Dual-branch Diffusion Model for Autonomous Driving with Semantic Fusion | Haoteng Li et.al. | 2505.01857 | null |
2025-05-03 | Toward Onboard AI-Enabled Solutions to Space Object Detection for Space Sustainability | Wenxuan Zhang et.al. | 2505.01650 | null |
2025-05-02 | Efficient Vision-based Vehicle Speed Estimation | Andrej Macko et.al. | 2505.01203 | null |
2025-05-02 | CDFormer: Cross-Domain Few-Shot Object Detection Transformer Against Feature Confusion | Boyuan Meng et.al. | 2505.00938 | null |
2025-05-01 | Efficient On-Chip Implementation of 4D Radar-Based 3D Object Detection on Hailo-8L | Woong-Chan Byun et.al. | 2505.00757 | null |
2025-05-03 | Vision Mamba in Remote Sensing: A Comprehensive Survey of Techniques, Applications and Outlook | Muyi Bao et.al. | 2505.00630 | null |
2025-05-01 | Visual Trajectory Prediction of Vessels for Inland Navigation | Alexander Puzicha et.al. | 2505.00599 | null |
2025-05-01 | Synthesizing and Identifying Noise Levels in Autonomous Vehicle Camera Radar Datasets | Mathis Morales et.al. | 2505.00584 | null |
2025-05-01 | X-ray illicit object detection using hybrid CNN-transformer neural network architectures | Jorgen Cani et.al. | 2505.00564 | null |
2025-05-01 | A Robust Deep Networks based Multi-Object MultiCamera Tracking System for City Scale Traffic | Muhammad Imran Zaman et.al. | 2505.00534 | null |
2025-05-01 | Inconsistency-based Active Learning for LiDAR Object Detection | Esteban Rivera et.al. | 2505.00511 | null |
2025-05-01 | HeAL3D: Heuristical-enhanced Active Learning for 3D Object Detection | Esteban Rivera et.al. | 2505.00507 | null |
2025-05-01 | Quaternion Wavelet-Conditioned Diffusion Models for Image Super-Resolution | Luigi Sigillo et.al. | 2505.00334 | null |
2025-04-30 | V3LMA: Visual 3D-enhanced Language Model for Autonomous Driving | Jannik Lübberstedt et.al. | 2505.00156 | null |
2025-04-30 | LLM-Empowered Embodied Agent for Memory-Augmented Task Planning in Household Robotics | Marc Glocker et.al. | 2504.21716 | null |
2025-04-30 | Visual Text Processing: A Comprehensive Review and Unified Evaluation | Yan Shu et.al. | 2504.21682 | null |
2025-04-29 | T2ID-CAS: Diffusion Model and Class Aware Sampling to Mitigate Class Imbalance in Neck Ultrasound Anatomical Landmark Detection | Manikanta Varaganti et.al. | 2504.21231 | null |
2025-04-29 | FLIM-based Salient Object Detection Networks with Adaptive Decoders | Gilson Junior Soares et.al. | 2504.20872 | null |
2025-04-29 | A Survey on Event-based Optical Marker Systems | Nafiseh Jabbari Tofighi et.al. | 2504.20736 | null |
2025-04-29 | Purifying, Labeling, and Utilizing: A High-Quality Pipeline for Small Object Detection | Siwei Wang et.al. | 2504.20602 | null |
2025-04-29 | Style-Adaptive Detection Transformer for Single-Source Domain Generalized Object Detection | Jianhong Han et.al. | 2504.20498 | null |
2025-04-28 | More Clear, More Flexible, More Precise: A Comprehensive Oriented Object Detection benchmark for UAV | Kai Ye et.al. | 2504.20032 | null |
2025-04-28 | Lossy Source Coding with Focal Loss | Alex Dytso et.al. | 2504.19913 | null |
2025-04-28 | Neural network task specialization via domain constraining | Roman Malashin et.al. | 2504.19592 | null |
2025-04-28 | GMAR: Gradient-Driven Multi-Head Attention Rollout for Vision Transformer Interpretability | Sehyeong Jo et.al. | 2504.19414 | null |
2025-04-27 | Improving Small Drone Detection Through Multi-Scale Processing and Data Augmentation | Rayson Laroca et.al. | 2504.19347 | null |
2025-04-27 | ODExAI: A Comprehensive Object Detection Explainable AI Evaluation | Loc Phuc Truong Nguyen et.al. | 2504.19249 | null |
2025-04-27 | Boosting Single-domain Generalized Object Detection via Vision-Language Knowledge Interaction | Xiaoran Xu et.al. | 2504.19086 | null |
2025-04-26 | Federated Learning-based Semantic Segmentation for Lane and Object Detection in Autonomous Driving | Gharbi Khamis Alshammari et.al. | 2504.18939 | null |
2025-04-25 | Dream-Box: Object-wise Outlier Generation for Out-of-Distribution Detection | Brian K. S. Isaac-Medina et.al. | 2504.18746 | null |
2025-04-25 | A Review of 3D Object Detection with Vision-Language Models | Ranjan Sapkota et.al. | 2504.18738 | null |
2025-04-25 | Examining the Impact of Optical Aberrations to Image Classification and Object Detection Models | Patrick Müller et.al. | 2504.18510 | null |
2025-04-25 | Iterative Event-based Motion Segmentation by Variational Contrast Maximization | Ryo Yamaki et.al. | 2504.18447 | null |
2025-04-25 | A Multimodal Hybrid Late-Cascade Fusion Network for Enhanced 3D Object Detection | Carlo Sgaravatti et.al. | 2504.18419 | null |
2025-04-25 | A comprehensive review of classifier probability calibration metrics | Richard Oliver Lane et.al. | 2504.18278 | null |
2025-04-25 | LiDAR-Guided Monocular 3D Object Detection for Long-Range Railway Monitoring | Raul David Dominguez Sanchez et.al. | 2504.18203 | null |
2025-04-25 | Multi-Grained Compositional Visual Clue Learning for Image Intent Recognition | Yin Tang et.al. | 2504.18201 | null |
2025-04-25 | E-InMeMo: Enhanced Prompting for Visual In-Context Learning | Jiahao Zhang et.al. | 2504.18158 | null |
2025-04-25 | MASF-YOLO: An Improved YOLOv11 Network for Small Object Detection on Drone View | Liugang Lu et.al. | 2504.18136 | null |
2025-04-25 | Opportunistic Collaborative Planning with Large Vision Model Guided Control and Joint Query-Service Optimization | Jiayi Chen et.al. | 2504.18057 | null |
2025-04-25 | Direct sampling method to retrieve small objects from two-dimensional limited-aperture scattered field data | Won-Kwang Park et.al. | 2504.18036 | null |
2025-04-24 | DIVE: Inverting Conditional Diffusion Models for Discriminative Tasks | Yinqi Li et.al. | 2504.17253 | link |
2025-04-24 | Perspective-Aware Reasoning in Vision-Language Models via Mental Imagery Simulation | Phillip Y. Lee et.al. | 2504.17207 | null |
2025-04-24 | AUTHENTICATION: Identifying Rare Failure Modes in Autonomous Vehicle Perception Systems using Adversarially Guided Diffusion Models | Mohammad Zarei et.al. | 2504.17179 | null |
2025-04-23 | Scene-Aware Location Modeling for Data Augmentation in Automotive Object Detection | Jens Petersen et.al. | 2504.17076 | null |
2025-04-23 | Gaussian Splatting is an Effective Data Generator for 3D Object Detection | Farhad G. Zanjani et.al. | 2504.16740 | null |
2025-04-23 | EHGCN: Hierarchical Euclidean-Hyperbolic Fusion via Motion-Aware GCN for Hybrid Event Stream Perception | Haosheng Chen et.al. | 2504.16616 | null |
2025-04-23 | Beyond Anonymization: Object Scrubbing for Privacy-Preserving 2D and 3D Vision Tasks | Murat Bilgehan Ertan et.al. | 2504.16557 | null |
2025-04-23 | Assessing the Feasibility of Internet-Sourced Video for Automatic Cattle Lameness Detection | Md Fahimuzzman Sohan et.al. | 2504.16404 | null |
2025-04-23 | Revisiting Radar Camera Alignment by Contrastive Learning for 3D Object Detection | Linhua Kong et.al. | 2504.16368 | null |
2025-04-22 | Vision Controlled Orthotic Hand Exoskeleton | Connor Blais et.al. | 2504.16319 | null |
2025-04-22 | Physical Intelligence et.al. | 2504.16054 | null | |
2025-04-22 | SAGA: Semantic-Aware Gray color Augmentation for Visible-to-Thermal Domain Adaptation across Multi-View Drone and Ground-Based Vision Systems | Manjunath D et.al. | 2504.15728 | null |
2025-04-22 | You Sense Only Once Beneath: Ultra-Light Real-Time Underwater Object Detection | Jun Dong et.al. | 2504.15694 | null |
2025-04-22 | A Vision-Enabled Prosthetic Hand for Children with Upper Limb Disabilities | Md Abdul Baset Sarker et.al. | 2504.15654 | null |
2025-04-21 | Context Aware Grounded Teacher for Source Free Object Detection | Tajamul Ashraf et.al. | 2504.15404 | null |
2025-04-21 | SuoiAI: Building a Dataset for Aquatic Invertebrates in Vietnam | Tue Vo et.al. | 2504.15252 | null |
2025-04-21 | An Efficient Aerial Image Detection with Variable Receptive Fields | Liu Wenbin et.al. | 2504.15165 | null |
2025-04-19 | Balancing Privacy and Action Performance: A Penalty-Driven Approach to Image Anonymization | Nazia Aslam et.al. | 2504.14301 | null |
2025-04-19 | Visual Consensus Prompting for Co-Salient Object Detection | Jie Wang et.al. | 2504.14254 | link |
2025-04-18 | Feature Alignment and Representation Transfer in Knowledge Distillation for Large Language Models | Junjie Yang et.al. | 2504.13825 | null |
2025-04-18 | Lightweight LiDAR-Camera 3D Dynamic Object Detection and Multi-Class Trajectory Prediction | Yushen He et.al. | 2504.13647 | link |
2025-04-18 | DenSe-AdViT: A novel Vision Transformer for Dense SAR Object Detection | Yang Zhang et.al. | 2504.13638 | null |
2025-04-18 | HMPE:HeatMap Embedding for Efficient Transformer-Based Small Object Detection | YangChen Zeng et.al. | 2504.13469 | null |
2025-04-18 | Towards a Multi-Agent Vision-Language System for Zero-Shot Novel Hazardous Object Detection for Autonomous Driving Safety | Shashank Shriram et.al. | 2504.13399 | link |
2025-04-17 | VLLFL: A Vision-Language Model Based Lightweight Federated Learning Framework for Smart Agriculture | Long Li et.al. | 2504.13365 | null |
2025-04-17 | SAR Object Detection with Self-Supervised Pretraining and Curriculum-Aware Sampling | Yasin Almalioglu et.al. | 2504.13310 | null |
2025-04-17 | Weak Cube R-CNN: Weakly Supervised 3D Detection using only 2D Bounding Boxes | Andreas Lau Hansen et.al. | 2504.13297 | null |
2025-04-17 | RF-DETR Object Detection vs YOLOv12 : A Study of Transformer-based and CNN-based Architectures for Single-Class and Multi-Class Greenfruit Detection in Complex Orchard Environments Under Label Ambiguity | Ranjan Sapkota et.al. | 2504.13099 | null |
2025-04-17 | Self-Supervised Pre-training with Combined Datasets for 3D Perception in Autonomous Driving | Shumin Wang et.al. | 2504.12709 | null |
2025-04-18 | RoPETR: Improving Temporal Camera-Only 3D Detection by Integrating Enhanced Rotary Position Embedding | Hang Ji et.al. | 2504.12643 | null |
2025-04-16 | Towards a General-Purpose Zero-Shot Synthetic Low-Light Image and Video Pipeline | Joanne Lin et.al. | 2504.12169 | null |
2025-04-16 | RADLER: Radar Object Detection Leveraging Semantic 3D City Models and Self-Supervised Radar-Image Learning | Yuan Luo et.al. | 2504.12167 | null |
2025-04-16 | pix2pockets: Shot Suggestions in 8-Ball Pool from a Single Image in the Wild | Jonas Myhre Schiøtt et.al. | 2504.12045 | null |
2025-04-16 | A Review of YOLOv12: Attention-Based Enhancements vs. Previous Versions | Rahima Khanam et.al. | 2504.11995 | null |
2025-04-16 | Multimodal Spatio-temporal Graph Learning for Alignment-free RGBT Video Object Detection | Qishun Wang et.al. | 2504.11779 | null |
2025-04-15 | Multi-level Cellular Automata for FLIM networks | Felipe Crispim Salvagnini et.al. | 2504.11406 | null |
2025-04-15 | OpenTuringBench: An Open-Model-based Benchmark and Framework for Machine-Generated Text Detection and Attribution | Lucio La Cava et.al. | 2504.11369 | null |
2025-04-15 | CFIS-YOLO: A Lightweight Multi-Scale Fusion Network for Edge-Deployable Wood Defect Detection | Jincheng Kang et.al. | 2504.11305 | null |
2025-04-15 | TSAL: Few-shot Text Segmentation Based on Attribute Learning | Chenming Li et.al. | 2504.11164 | null |
2025-04-15 | Flyweight FLIM Networks for Salient Object Detection in Biomedical Images | Leonardo M. Joao et.al. | 2504.11112 | null |
2025-04-15 | S |
Yu Lin et.al. | 2504.11111 | null |
2025-04-15 | DRIFT open dataset: A drone-derived intelligence for traffic analysis in urban environmen | Hyejin Lee et.al. | 2504.11019 | null |
2025-04-16 | GATE3D: Generalized Attention-based Task-synergized Estimation in 3D* | Eunsoo Im et.al. | 2504.11014 | null |
2025-04-15 | CDUPatch: Color-Driven Universal Adversarial Patch Attack for Dual-Modal Visible-Infrared Detectors | Jiahuan Long et.al. | 2504.10888 | null |
2025-04-15 | Safe-Construct: Redefining Construction Safety Violation Recognition as 3D Multi-View Engagement Task | Aviral Chharia et.al. | 2504.10880 | null |
2025-04-14 | DiffMOD: Progressive Diffusion Point Denoising for Moving Object Detection in Remote Sensing | Jinyue Zhang et.al. | 2504.10278 | null |
2025-04-14 | Balancing Stability and Plasticity in Pretrained Detector: A Dual-Path Framework for Incremental Object Detection | Songze Li et.al. | 2504.10214 | null |
2025-04-14 | WildLive: Near Real-time Visual Wildlife Tracking onboard UAVs | Nguyen Ngoc Dat et.al. | 2504.10165 | null |
2025-04-14 | COUNTS: Benchmarking Object Detectors and Multimodal Large Language Models under Distribution Shifts | Jiansheng Li et.al. | 2504.10158 | null |
2025-04-14 | SemiETS: Integrating Spatial and Content Consistencies for Semi-Supervised End-to-end Text Spotting | Dongliang Luo et.al. | 2504.09966 | null |
2025-04-14 | Small Object Detection with YOLO: A Performance Analysis Across Model Versions and Hardware | Muhammad Fasih Tariq et.al. | 2504.09900 | null |
2025-04-14 | Density-based Object Detection in Crowded Scenes | Chenyang Zhao et.al. | 2504.09819 | null |
2025-04-13 | Uncertainty Guided Refinement for Fine-Grained Salient Object Detection | Yao Yuan et.al. | 2504.09666 | link |
2025-04-13 | Pillar-Voxel Fusion Network for 3D Object Detection in Airborne Hyperspectral Point Clouds | Yanze Jiang et.al. | 2504.09506 | null |
2025-04-13 | Vision-Language Model for Object Detection and Segmentation: A Review and Evaluation | Yongchao Feng et.al. | 2504.09480 | null |
2025-04-11 | TinyCenterSpeed: Efficient Center-Based Object Detection for Autonomous Racing | Neil Reichlin et.al. | 2504.08655 | null |
2025-04-11 | Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization | Jialu Li et.al. | 2504.08641 | null |
2025-04-10 | Enhanced Cooperative Perception Through Asynchronous Vehicle to Infrastructure Framework with Delay Mitigation for Connected and Automated Vehicles | Nithish Kumar Saravanan et.al. | 2504.08172 | null |
2025-04-10 | Multi-Task Learning with Multi-Annotation Triplet Loss for Improved Object Detection | Meilun Zhou et.al. | 2504.08054 | null |
2025-04-10 | Detect Anything 3D in the Wild | Hanxue Zhang et.al. | 2504.07958 | null |
2025-04-11 | Pychop: Emulating Low-Precision Arithmetic in Numerical Methods and Neural Networks | Erin Carson et.al. | 2504.07835 | null |
2025-04-10 | P2Object: Single Point Supervised Object Detection and Instance Segmentation | Pengfei Chen et.al. | 2504.07813 | null |
2025-04-10 | Nonlocal Retinex-Based Variational Model and its Deep Unfolding Twin for Low-Light Image Enhancement | Daniel Torres et.al. | 2504.07810 | null |
2025-04-10 | Adaptive Detection of Fast Moving Celestial Objects Using a Mixture of Experts and Physical-Inspired Neural Network | Peng Jia et.al. | 2504.07777 | null |
2025-04-10 | Prediction of Usage Probabilities of Shopping-Mall Corridors Using Heterogeneous Graph Neural Networks | Malik M Barakathullah et.al. | 2504.07645 | null |
2025-04-10 | VLM-R1: A Stable and Generalizable R1-style Large Vision-Language Model | Haozhan Shen et.al. | 2504.07615 | link |
2025-04-10 | RASMD: RGB And SWIR Multispectral Driving Dataset for Robust Perception in Adverse Conditions | Youngwan Jin et.al. | 2504.07603 | null |
2025-04-10 | WS-DETR: Robust Water Surface Object Detection through Vision-Radar Fusion with Detection Transformer | Huilin Yin et.al. | 2504.07441 | null |
2025-04-10 | Model Discrepancy Learning: Synthetic Faces Detection Based on Multi-Reconstruction | Qingchao Jiang et.al. | 2504.07382 | link |
2025-04-09 | Generalized Semantic Contrastive Learning via Embedding Side Information for Few-Shot Object Detection | Ruoyu Chen et.al. | 2504.07060 | null |
2025-04-09 | UAV Position Estimation using a LiDAR-based 3D Object Detection Method | Uthman Olawoye et.al. | 2504.07028 | null |
2025-04-09 | Towards Efficient Roadside LiDAR Deployment: A Fast Surrogate Metric Based on Entropy-Guided Visibility | Yuze Jiang et.al. | 2504.06772 | null |
2025-04-09 | Domain-Conditioned Scene Graphs for State-Grounded Task Planning | Jonas Herzog et.al. | 2504.06661 | null |
2025-04-09 | Visually Similar Pair Alignment for Robust Cross-Domain Object Detection | Onkar Krishna et.al. | 2504.06607 | null |
2025-04-08 | From Broadcast to Minimap: Achieving State-of-the-Art SoccerNet Game State Reconstruction | Vladimir Golovkin et.al. | 2504.06357 | null |
2025-04-08 | Analyzing the Impact of Low-Rank Adaptation for Cross-Domain Few-Shot Object Detection in Aerial Images | Hicham Talaoubrid et.al. | 2504.06330 | null |
2025-04-08 | Security Analysis of Thumbnail-Preserving Image Encryption and a New Framework | Dong Xie et.al. | 2504.06083 | null |
2025-04-08 | Balancing long- and short-term dynamics for the modeling of saliency in videos | Theodor Wulff et.al. | 2504.05913 | null |
2025-04-08 | PRIMEDrive-CoT: A Precognitive Chain-of-Thought Framework for Uncertainty-Aware Object Interaction in Driving Scene Scenario | Sriram Mandalika et.al. | 2504.05908 | null |
2025-04-08 | Intrinsic Saliency Guided Trunk-Collateral Network for Unsupervised Video Object Segmentation | Xiangyu Zheng et.al. | 2504.05904 | null |
2025-04-08 | KAN-SAM: Kolmogorov-Arnold Network Guided Segment Anything Model for RGB-T Salient Object Detection | Xingyuan Li et.al. | 2504.05878 | null |
2025-04-08 | DefMamba: Deformable Visual State Space Model | Leiye Liu et.al. | 2504.05794 | null |
2025-04-08 | Event-based Civil Infrastructure Visual Defect Detection: ev-CIVIL Dataset and Benchmark | Udayanga G. W. K. N. Gamage et.al. | 2504.05679 | null |
2025-04-08 | POD: Predictive Object Detection with Single-Frame FMCW LiDAR Point Cloud | Yining Shi et.al. | 2504.05649 | null |
2025-04-08 | AD-Det: Boosting Object Detection in UAV Images with Focused Small Objects and Balanced Tail Classes | Zhenteng Li et.al. | 2504.05601 | null |
2025-04-07 | SSLFusion: Scale & Space Aligned Latent Fusion Model for Multimodal 3D Object Detection | Bonan Ding et.al. | 2504.05170 | null |
2025-04-07 | Inland Waterway Object Detection in Multi-environment: Dataset and Approach | Shanshan Wang et.al. | 2504.04835 | null |
2025-04-07 | Playing Non-Embedded Card-Based Games with Reinforcement Learning | Tianyang Wu et.al. | 2504.04783 | null |
2025-04-07 | Feedback-Enhanced Hallucination-Resistant Vision-Language Model for Real-Time Scene Understanding | Zahir Alsulaimawi et.al. | 2504.04772 | null |
2025-04-07 | Inverse++: Vision-Centric 3D Semantic Occupancy Prediction Assisted with 3D Object Detection | Zhenxing Ming et.al. | 2504.04732 | null |
2025-04-06 | Enhance Then Search: An Augmentation-Search Strategy with Foundation Models for Cross-Domain Few-Shot Object Detection | Jiancheng Pan et.al. | 2504.04517 | link |
2025-04-06 | eKalibr-Stereo: Continuous-Time Spatiotemporal Calibration for Event-Based Stereo Visual Systems | Shuolong Chen et.al. | 2504.04451 | link |
2025-04-05 | Autoregressive High-Order Finite Difference Modulo Imaging: High-Dynamic Range for Computer Vision Applications | Brayan Monroy et.al. | 2504.04228 | null |
2025-04-05 | An Optimized Density-Based Lane Keeping System for A Cost-Efficient Autonomous Vehicle Platform: AurigaBot V1 | Farbod Younesi et.al. | 2504.04217 | null |
2025-04-05 | Learning about the Physical World through Analytic Concepts | Jianhua Sun et.al. | 2504.04170 | null |
2025-04-04 | VISTA-OCR: Towards generative and interactive end to end OCR models | Laziz Hamdi et.al. | 2504.03621 | null |
2025-04-04 | PF3Det: A Prompted Foundation Feature Assisted Visual LiDAR 3D Detector | Kaidong Li et.al. | 2504.03563 | null |
2025-04-04 | ZFusion: An Effective Fuser of Camera and 4D Radar for 3D Object Perception in Autonomous Driving | Sheng Yang et.al. | 2504.03438 | null |
2025-04-04 | Infrared bubble recognition in the Milky Way and beyond using deep learning | Shimpei Nishimoto et.al. | 2504.03367 | null |
2025-04-04 | Real-Time Roadway Obstacle Detection for Electric Scooters Using Deep Learning and Multi-Sensor Fusion | Zeyang Zheng et.al. | 2504.03171 | null |
2025-04-04 | Finding the Reflection Point: Unpadding Images to Remove Data Augmentation Artifacts in Large Open Source Image Datasets for Machine Learning | Lucas Choi et.al. | 2504.03168 | null |
2025-04-03 | Attention-Aware Multi-View Pedestrian Tracking | Reef Alturki et.al. | 2504.03047 | null |
2025-04-03 | LiDAR-based Object Detection with Real-time Voice Specifications | Anurag Kulkarni et.al. | 2504.02920 | null |
2025-04-03 | BOP Challenge 2024 on Model-Based and Model-Free 6D Object Pose Estimation | Van Nguyen Nguyen et.al. | [2504.02812](http://arxiv. |