AutoCAD Text Alignment

A Multiview Text Imagination Network Based on Latent Alignment for Image-Text Matching

Abstract: In image-text matching fields, one of the keys to improving performance is to extract features with more semantic information. Existing works demonstrate that semantic enrichment through ...

IEEE

Improving Image-Text Matching With Bidirectional Consistency of Cross-Modal Alignment

Abstract: Image-text matching is a fundamental task in bridging the semantics between vision and language. The key challenge lies in establishing accurate alignment between two heterogeneous ...

GitHub

Improving Single Domain-Generalized Object Detection: A Focus on Diversification and Alignment [CVPR-2024]

In this work, we tackle the problem of domain generalization for object detection, specifically focusing on the scenario where only a single source domain is available. We propose an effective ...

GitHub

Moshi: a speech-text foundation model for real time dialogue

Finally, the code for the web UI client used in the Moshi demo is provided in the client/ directory. If you want to fine tune Moshi, head out to kyutai-labs/moshi ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results