Skip to content
#

grok-ai

Here is 1 public repository matching this topic...

MindTrial: Evaluate and compare AI language models (LLMs) on text-based tasks with optional file/image attachments and tool use. Supports multiple providers (OpenAI, Google, Anthropic, DeepSeek, Mistral AI, xAI, Alibaba, Moonshot AI), custom tasks in YAML, and HTML/CSV reports.

  • Updated Nov 21, 2025
  • Go

Improve this page

Add a description, image, and links to the grok-ai topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the grok-ai topic, visit your repo's landing page and select "manage topics."

Learn more