Multimodal AI
Artificial intelligence reads, looks, and listens. With all her senses at work, she reaches a fuller understanding than any one alone could give.
- Cross-modal understanding — Combining text, vision, and audio for understanding.
- Cross-modal generation — Generating one modality from another.