2 years agoBLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding&Generationykilcher