multimodal_model = GenerativeModel("gemini-1.0-pro-vision") image = Image.load_from_file("image.jpg") response = multimodal_model.generate_content(["What is shown in this image?", image]) print(response.text) # A cat is shown in this picture.
End of content
You've reached the end!