Sign in

IFAdapter: Instance Feature Control for Grounded Text-to-Image Generation

By Yinwei Wu and others
While Text-to-Image (T2I) diffusion models excel at generating visually appealing images of individual instances, they struggle to accurately position and control the features generation of multiple instances. The Layout-to-Image (L2I) task was introduced to address the positioning challenges by incorporating bounding boxes as spatial control signals, but it still falls... Show more
September 12, 2024
=
0
Loading PDF…
Loading full text...
Similar articles
Loading recommendations...
=
0
x1
IFAdapter: Instance Feature Control for Grounded Text-to-Image Generation
Click on play to start listening