new-star
avatar image $

Feed forward VQGAN+CLIP

0 Любімыя
(0 | 0 voted)
Generating images from text. This model takes as input a text prompt, and returns as an output the VQGAN latent space, which is then transformed into an RGB image. Eventually it minimizes the distance between the CLIP generated image features and the CLIP input text features.

Generating images from text. This model takes as input a text prompt, and returns as an output the VQGAN latent space, which is then transformed into an RGB image. Eventually it minimizes the distance between the CLIP generated image features and the CLIP input text features.

Мадэль фармавання цэн.:

price unknown / product not launched yet
Light
Neutral
Dark
Feed forward VQGAN+CLIP
Feed forward VQGAN+CLIP
Feed forward VQGAN+CLIP
Copy embed code

Аглядайцеся на падобныя прылады штучнага інтэлекту.