加载头像
论文笔记
2024
【论文笔记】VisionZip: Longer is Better but Not Necessary in Vision Language Models
【论文笔记】VisionZip: Longer is Better but Not Necessary in Vision Language Models21
【论文笔记】LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment
【论文笔记】LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment22
【论文笔记】BSL-1K: Scaling up co-articulated sign language recognition using mouthing cues
【论文笔记】BSL-1K: Scaling up co-articulated sign language recognition using mouthing cues23
【论文笔记】A Token-level Contrastive Framework for Sign Language Translation
【论文笔记】A Token-level Contrastive Framework for Sign Language Translation24
【论文笔记】A Simple Multi-Modality Transfer Learning Baseline for Sign Language Translation
【论文笔记】A Simple Multi-Modality Transfer Learning Baseline for Sign Language Translation25
【论文笔记】Leveraging the Power of MLLMs for Gloss-Free Sign Language Translation
【论文笔记】Leveraging the Power of MLLMs for Gloss-Free Sign Language Translation26
【论文笔记】Magnifier Prompt: Tackling Multimodal Hallucination via Extremely Simple Instructions
【论文笔记】Magnifier Prompt: Tackling Multimodal Hallucination via Extremely Simple Instructions27
【论文笔记】Towards Online Continuous Sign Language Recognition and Translation
【论文笔记】Towards Online Continuous Sign Language Recognition and Translation28
【论文笔记】Number it: Temporal Grounding Videos like Flipping Manga
【论文笔记】Number it: Temporal Grounding Videos like Flipping Manga29
【论文笔记】Improved Baselines with Visual Instruction Tuning
【论文笔记】Improved Baselines with Visual Instruction Tuning30
引用到评论
随便逛逛博客分类文章标签
复制地址关闭热评深色模式轉為繁體