These new AI-focused SBCs pair multicore processors with integrated NPUs for edge computing, robotics, and embedded ...
We present HunyuanVideo, a novel open-source video foundation model that exhibits performance in video generation that is comparable to, if not superior to, leading closed-source models. In order to ...
TL;DR: We propose SAKURA, a benchmark for LALM multi-hop reasoning, and reveal their deficiency of such reasoning skills. Large audio-language models (LALMs) extend the large language models with ...
Abstract: Jailbreak attacks to Large audio-language models (LALMs) are studied recently, but they exclusively focused on the attack scenario where the adversary can fully manipulate user prompts ...