DoorDash has launched a multimodal machine learning system that aligns product images, text, and user queries in a shared embedding space. Trained on 32 million labeled query-product pairs using ...
Image-2, a text-to-image model ranking third on the Arena leaderboard, but daily caps and square-only output limit its appeal ...