Definition
Neural network components that understand and encode the meaning of inputs like text and images.
Detailed Overview
Genie 3 uses multi-modal semantic encoders to process and understand text and image inputs, enabling accurate translation from prompts to visual worlds.
Related Terms
Input Methods
Multimodal Input
Support for both text descriptions and image inputs to generate 3D worlds from various types of prompts.
AI Architecture
Neural Networks
Computing systems inspired by biological neural networks, forming the foundation of modern AI.
AI Architecture
Representation Learning
Learning useful representations of data that capture underlying structure and meaning.