Artificial Intelligence▲ bullishImpact 7/10
OmniMem: Perturbation-aware Memory Compression for Streaming Audio-Visual LLMs
cs.AI updates on arXiv.org·
✦AI Analysis
OmniMem is a new memory-efficient framework for audio-visual large language models that enhances long-video inference by using a modality-aware memory allocation strategy. It demonstrates improved accuracy in video understanding tasks while maintaining compact memory usage, making it a promising advancement in the field of AI-driven video analysis.
Key Topics
OmniMemaudio-visual LLMsvideo-SALMONN 2+Qwen-2.5-Omni
Originally reported by cs.AI updates on arXiv.org. Read the full article ↗