How to evaluate image generation


Evaluating generative AI for vision (GenAI vision) is an evolving field, but here are some key approaches to consider:

Human Evaluation:


Subjective Assessment: Since "good" for a generated image can be subjective, human evaluation is crucial. Recruit users to rate the outputs on factors like:
Photorealism: How realistic and detailed does the image appear?
Relevance: Does the image accurately reflect the prompt or concept?
Style: Does the image adhere to the desired artistic style (e.g., impressionistic, photorealistic)?
Creativity: Does the image go beyond a basic representation and showcase originality?
Diversity: Does the model generate a variety of outputs for the same prompt, avoiding monotony?

Platforms for Human Evaluation:

Tools like Adobe GenLens or Replicate Zoo can streamline the human evaluation process by providing interfaces for collecting user ratings on generated images.

Automatic Metrics:

Limited Effectiveness: While helpful in other domains, traditional metrics like Mean Squared Error (MSE) or Structural Similarity Index (SSIM) may not fully capture the quality of generated images. They focus on pixel-level differences which might not reflect the high-level content or style.

Emerging Techniques:

Frechet Inception Distance (FID): This metric attempts to assess the quality of generated images by measuring the distance between the distribution of features extracted from real images and the generated ones.

The future of creativity is generative ai. Here are slides and deep dive for Generative AI

Another Article

From the blog

Build Dataproducts

How Dataknobs help in building data products

Enterprises are most successful when they treat data like a product. It enable to use data in multiple use cases. However data product should be designed differently compared to software product.

Be Data Centric and well governed

Generative AI is one of approach to build data product

Generative AI has enabled many transformative scenarios. We combine generative AI, AI, automation, web scraping, ingesting dataset to build new data products. We have expertise in generative AI, but for business benefit we define our goal to build data product in data centric manner.

Spotlight

Generative AI slides

  • Learn generative AI - applications, LLM, architecture
  • See best practices for prompt engineering
  • Evaluate whether you should use out of box foundation model, fne tune or use in-context learning
  • Most important - be aware of concerns, issues, challenges, risk of genAI and LLM
  • See vendor comparison - Azure, OpenAI, GCP, Bard, Anthropic. Review framework for cost computation for LLM