Description:

Kandinsky 3.0 is an open-source text-to-image diffusion model built upon the Kandinsky2-x model family. In comparison to its predecessors, Kandinsky 3.0 incorporates more data and specifically related to Russian culture, which allows to generate pictures related to Russin culture. Furthermore, enhancements have been made to the text understanding and visual quality of the model, achieved by increasing the size of the text encoder and Diffusion U-Net models, respectively.

  • Stampela
    link
    fedilink
    English
    arrow-up
    3
    ·
    1 year ago

    Unless they aim for a specialized model? I don’t have insight on the matter, just a guess.