Vision Quest - AI Engineering Course

👁️ Analyze Images with AI Vision

🖼️ Select an Image to Analyze

Chess Tournament

Magnus Carlsen in a chess tournament setting

Sashimi Platter

Beautiful Japanese sashimi and sushi arrangement

Prescription Form

Glasses prescription document

Winter Dog

Dog playing in winter snow scene

Click an image above to select it for analysis

💬 Your Question or Analysis Request

Ask anything about the image - describe, analyze, identify, or extract information

🤖 Vision Model

🔍 Detail Level

⚠️ Analysis may take 10-20 seconds

👁️ About GPT-4 Vision

🖼️ Image Analysis

AI can identify objects, read text, analyze scenes, and understand visual content in remarkable detail.

💡 Use Cases

Document analysis, scene description, object identification, text extraction, visual reasoning, and creative interpretation.

🤖 Models

GPT-4o: Latest multimodal model with excellent vision capabilities
GPT-4o Mini: Faster, cost-effective option
GPT-4 Vision Preview: Original vision model

⚙️ Detail Levels

High: More detailed analysis, higher cost
Auto: Balanced approach
Low: Faster processing, lower cost

💻 Essential Code

Key Kotlin code for AI vision analysis using OpenAI's GPT-4 Vision API:

// Analyze image with GPT-4 Vision
suspend fun createVisionCompletion(
    prompt: String,
    imageUrl: String,
    model: String = "gpt-4o",
    maxTokens: Int = 500,
    detail: String = "auto"
): ChatCompletionResponse {
    val messages = listOf(
        Message(
            role = "user",
            content = listOf(
                ContentPart(type = "text", text = prompt),
                ContentPart(type = "image_url", imageUrl = ImageUrl(url = imageUrl, detail = detail))
            )
        )
    )
    
    return createChatCompletionWithMessages(messages, model, maxTokens)
}

// Usage in controller
val visionResponse = openAI.createVisionCompletion(
    prompt = "Analyze this image and describe what you see",
    imageUrl = "https://example.com/image.jpg",
    model = "gpt-4o",
    detail = "auto"
)
val analysis = visionResponse.text()