Count objects in an image by drawing a region of interest
Extract invoice details from images
Engage in multimedia chat with LLMs and ML models
Extract information from invoices
Transform images and videos using text prompts