rag-agent-vision-model
PublicAn image-to-text agent using NLP and Llama 3.2 11B Vision Model. The agent will analyze the image file, extract keywords, group them semantically, and craft concise sentences demonstrating correct usage.
An image-to-text agent using NLP and Llama 3.2 11B Vision Model. The agent will analyze the image file, extract keywords, group them semantically, and craft concise sentences demonstrating correct usage.