Yi-Jen Shih | DeepAI

Chat Image Generator Video Music Voice Chat Photo Editor

Featured Co-authors

Shinji Watanabe
239 publications
Hung-Yi Lee
187 publications
Yu Tsao
127 publications
Yi-Hsuan Yang
70 publications
Abdelrahman Mohamed
41 publications
David Harwath
35 publications
Shang-Wen Li
34 publications
Yi-Ting Chen
26 publications
Haibin Wu
23 publications
Po-Yao Huang
20 publications
Heng-Jui Chang
11 publications

research

∙ 09/19/2023

AV-SUPERB: A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models

Audio-visual representation learning aims to develop systems with human-...

0 Yuan Tseng, et al. ∙

research

∙ 11/02/2022

M-SpeechCLIP: Leveraging Large-Scale, Pre-Trained Models for Multilingual Speech to Image Retrieval

This work investigates the use of large-scale, pre-trained models (CLIP ...

0 Layne Berry, et al. ∙

research

∙ 10/03/2022

SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model

Data-driven speech processing models usually perform well with a large a...

0 Yi-Jen Shih, et al. ∙

research

∙ 11/07/2021

Theme Transformer: Symbolic Music Generation with Theme-Conditioned Transformer

Attention-based Transformer models have been increasingly employed for a...

0 Yi-Jen Shih, et al. ∙

Success!

An error occurred