modalitydance

modalitydance/ar-omni

"AR-Omni: A Unified Autoregressive Model for Any-to-Any Generation"

Python
42
5
MIT License

AR-Omni is a unified autoregressive model that generates text, images, and speech using a single decoder and token stream, designed for researchers and developers working on multimodal AI systems. It addresses modality imbalance through task-aware loss reweighting, improves visual fidelity with token-level perceptual alignment, and offers stability-creativity trade-offs via finite-state decoding strategies.

Total donated
Undistributed
Share with your subscribers:

Recipients

How the donated funds are distributed

Support the dependencies

Support the repos that depend on this repository

Top contributors

Dongjie-Cheng's profile
Dongjie-Cheng
43 contributions
HongruCai's profile
HongruCai
1 contributions

Recent events

Kivach works on the Obyte network, and therefore you can track all donations.

No events yet