These new AI-focused SBCs pair multicore processors with integrated NPUs for edge computing, robotics, and embedded ...
We present HunyuanVideo, a novel open-source video foundation model that exhibits performance in video generation that is comparable to, if not superior to, leading closed-source models. In order to ...
TL;DR: We propose SAKURA, a benchmark for LALM multi-hop reasoning, and reveal their deficiency of such reasoning skills. Large audio-language models (LALMs) extend the large language models with ...
Abstract: Jailbreak attacks to Large audio-language models (LALMs) are studied recently, but they exclusively focused on the attack scenario where the adversary can fully manipulate user prompts ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results