Collab with Cynply.
Image generated from VQGAN and CLIP.
VQGAN is a generative adversarial neural network that is good at generating images that look similar to others.
CLIP is another neural network that is able to determine how well a caption (or prompt) matches an image.
Check out this paper for more information
https://arxiv.org/pdf/2204.08583.pdf
Comments1