129 Integrations with Llama
View a list of Llama integrations and software that integrates with Llama below. Compare the best Llama integrations as well as features, ratings, user reviews, and pricing of software that integrates with Llama. Here are the current Llama integrations in 2026:
-
1
Teradata VantageCloud
Teradata
Teradata VantageCloud: The complete cloud analytics and data platform for AI. Teradata VantageCloud is an enterprise-grade, cloud-native data and analytics platform that unifies data management, advanced analytics, and AI/ML capabilities in a single environment. Designed for scalability and flexibility, VantageCloud supports multi-cloud and hybrid deployments, enabling organizations to manage structured and semi-structured data across AWS, Azure, Google Cloud, and on-premises systems. It offers full ANSI SQL support, integrates with open-source tools like Python and R, and provides built-in governance for secure, trusted AI. VantageCloud empowers users to run complex queries, build data pipelines, and operationalize machine learning models—all while maintaining interoperability with modern data ecosystems. -
2
Evertune
Evertune
Evertune is the Generative Engine Optimization (GEO) platform that helps brands improve visibility in AI search across ChatGPT, AI Overview, AI Mode, Gemini, Claude, Perplexity, Meta, DeepSeek and Copilot. Why Leading Enterprise Marketers Choose Evertune: Data Science at Scale: We prompt across every major LLM at volumes that capture response variations and ensure statistical significance for brand monitoring and competitive intelligence. Actionable Strategy, Not Just Dashboards: Specific content, messaging and distribution tactics that increase your AI search visibility. Dedicated Customer Success: Hands-on training and strategic guidance to turn insights into improved performance in AI search. Built for AI search as a channel: Organic visibility today, paid advertising and commerce tomorrow. Proven Leadership: Founded by The Trade Desk veterans who pioneered data-driven digital advertising. Backed by data scientists from OpenAI, Meta and other AI leaders.Starting Price: $3,000 per month -
3
AiAssistWorks
PT Visi Cerdas Digital
AiAssistWorks is the smartest way to use AI in Google Sheets™, Docs™, and Slides™. In Sheets™, just type a simple instruction — and Smart Command uses AI to do the task for you. Instantly generate product descriptions, create formulas, build charts and pivot tables, format data, create tables, validate entries, and more. No formulas. No scripts. No copy-paste. In Docs™, generate, rewrite, translate, create images, and summarize content — all directly inside your document. In Slides™, generate entire presentations or create AI-powered images in just a few clicks. Powered by 100+ AI models including GPT, Claude, Gemini, Llama, Groq, and more — giving you unmatched flexibility. ✅ Free Forever – 100 executions/month with your own API key ✅ Unlimited usage with a paid plan (API key required) ✅ No formulas needed – Fill 1,000+ rows with AI ✅ Automate SEO content, product listings, ad copy, and data labeling in Sheets™, Docs™, and Slides™.Starting Price: $5/month -
4
1min.AI
1min.AI
💡 1min.AI is an all-in-one AI app that unlock all AI features. You pay only for what you use at 1min.AI, with no hidden costs or setup required elsewhere. 🔮 The unique features of 1min.AI is offering a variety of AI features powered by various AI models. You can see it clearly with the Chat with Many Assistants feature, it includes Gemini, GPT, Claude, Llama, MistralAI, ... 🪄 Other multi-media features like Content, Image, Audio, Video can also be used with different models to utilize their abilities and give out the best results. 💰 Lastly, we offer credit estimation and transparent usage history, so you know exact how does the feature cost before running and can track the usage easily. 🚀 Try for Free and get what you want within 1minStarting Price: $5 -
5
Graydient AI
Graydient AI
Graydient AI is one of the best values in AI, with unlimited image and LLM chats. It features easy tools for beginners and very deep customization for professionals, including a REST API. Beginners can enjoy point and click image creation using preset AI workflows like "realistic iphone photo" or "anime movie poster" and get high defintion images in seconds. Pros can dive deeper with over 10,000 preloaded checkpoints, loras, and embeddings and ComfyUI json import. The most popular models are preloaded like Flux.1 Dev FP32, Stable Diffusion 3.5, Pony Diffusion and Meta Llama 3.1 70B. You can train your own LoRa models unlimited, and create macros called Recipes to use all of the above over Telegram chat or a unified Web UI. Graydient has a satisfaction guarantee, so try it today risk-free.Starting Price: $15.99 per month -
6
Firecrawl
Firecrawl
Crawl and convert any website into clean markdown or structured data, it's also open source. We crawl all accessible subpages and give you a clean markdown for each, no sitemap is required. Enhance your applications with top-tier web scraping and crawling capabilities. Extract markdown or structured data from websites quickly and efficiently. Navigate and retrieve data from all accessible subpages, even without a sitemap. Already fully integrated with the greatest existing tools and workflows. Kick off your journey for free and scale seamlessly as your project expands. Developed transparently and collaboratively. Join our community of contributors. Firecrawl crawls all accessible subpages, even without a sitemap. Firecrawl gathers data even if a website uses JavaScript to render content. Firecrawl returns clean, well-formatted markdown, ready for use in LLM applications. Firecrawl orchestrates the crawling process in parallel for the fastest results.Starting Price: $16 per month -
7
Meta AI
Meta
Meta AI is an intelligent assistant that is capable of complex reasoning, following instructions, visualizing ideas, and solving nuanced problems. Meta AI is an intelligent assistant built on Meta's most advanced model. It is designed to answer any question you might have, help with writing, provide step-by-step advice, and create images to share with friends. It is available within Meta's family of apps, smart glasses, and web platforms.Starting Price: Free -
8
PromptX
VE3 Global
PromptX unifies your data across SharePoint, Google Drive, email, cloud, and legacy systems into a single Enterprise Knowledge System. With AI-Powered Search, users ask conversational questions and get context-aware, verifiable answers in seconds. Auto-ingestion, semantic tagging, smart entity recognition transforms unstructured files, emails, and URLs into Knowledge Cards. Adaptive prompts, split-chat pathways, collaborative workspaces, and agent automations streamline workflows. Deployed across any cloud or hybrid environment, PromptX integrates any LLM or external search engine, PromptX scales to any enterprise, all while providing granular permissions, SSO, audit trails, and built-in AI governance. -
9
Bolna
Bolna
Seamlessly onboard and scale your entire front desk operations to pick up every call. You do not need to be experienced with prompt engineering. We provide demo agents and templates to help you get started. Additionally, our enterprise plans include hands-on assistance in creating and testing your agents. We have integrations with the most natural AI voices that deliver human-like conversations. You can choose the voice that suits your use case perfectly. We already have integrations with leading CRMs and have a knowledge base where you can add documents. Bolna is the end-to-end open source production-ready framework for quickly building LLM-based voice-driven conversational applications. Automate all your customer conversations by building human-like voice AI agents in minutes. You can design your own functions and use them in Bolna. -
10
Mangools
Mangools
Mangools is a package of super-user friendly SEO tools to cover everything from keyword research, rank tracking to competitor analysis & backlink analysis. 1. KWFinder is one of the most popular keyword tools with accurate keyword difficulty, precise local SEO data and search engine results for more than 50k locations. 2. SERPChecker shows you search results for more than 50k locations with SERP features, their impact on the organic search results and 45 SEO metrics. 3. SERPWatcher is a rank tracker to track both mobile/desktop ranks. It comes with aggregate metrics, daily updated ranks and charts to quickly see the overall ranking progress. 4. LinkMiner lets you find the most powerful backlinks of your competitors with a preview of anchor texts on the referring website. 5. SiteProfiler is an SEO analysis tool with all the essential SEO metrics & insights under one roof. 6. Mangools AI Search Grader is a free GEO (Generative Engine Optimization) toolStarting Price: $29.90 per month -
11
AI/ML API
AI/ML API
AI/ML API is a game-changing platform for developers and SaaS entrepreneurs looking to integrate cutting-edge AI capabilities into their products. It offers a single point of access to over 200 state-of-the-art AI models, covering everything from NLP to computer vision. Key Features for Developers: Extensive Model Library: 200+ pre-trained models for rapid prototyping and deployment Developer-Friendly Integration: RESTful APIs and SDKs for seamless incorporation into your stack Serverless Architecture: Focus on coding, not infrastructure management Advantages for SaaS Entrepreneurs: Rapid Time-to-Market: Leverage advanced AI without building from scratch Scalability: From MVP to enterprise-grade solutions, AI/ML API grows with your business Cost-Efficiency: Pay-as-you-go pricing model reduces upfront investment Competitive Edge: Stay ahead with continuously updated AI modelsStarting Price: $4.99/week -
12
GPTZero
GPTZero
GPTZero is a leading AI detection platform built to preserve human-authored writing in an era of generative AI. It accurately identifies content written by models such as ChatGPT, GPT-5, Gemini, Claude, and Llama. The platform combines AI detection with writing quality analysis to provide deeper insight into how text was created. GPTZero offers tools like Advanced Scan, plagiarism checking, and hallucination detection for reliable verification. Its Google Docs integration allows users to watch writing replays and identify copy-paste behavior or unnatural typing patterns. GPTZero is widely used in education and professional settings to promote transparency and trust in writing. Independent benchmarks rank GPTZero among the most accurate commercial AI detectors available.Starting Price: $12.99/month -
13
ZeroGPT
ZeroGPT
ZeroGPT is a powerful and free AI detection platform designed to identify AI-generated content from models such as ChatGPT, GPT-5, Gemini, Claude, Grok, DeepSeek, and LLaMA. It analyzes text with high accuracy and highlights AI-written sentences while displaying an overall AI probability score. ZeroGPT supports multiple languages and provides detailed, automatically generated PDF reports that can be used as proof of originality. The platform goes beyond detection by offering a full suite of writing tools, including plagiarism checking, grammar correction, paraphrasing, summarization, and translation. Its intuitive interface allows users to paste text or upload files for instant analysis. ZeroGPT is widely used by individuals and organizations seeking fast, credible AI detection without barriers. Millions of users rely on it for transparent and reliable content verification.Starting Price: $7.99/month -
14
ZenML
ZenML
Simplify your MLOps pipelines. Manage, deploy, and scale on any infrastructure with ZenML. ZenML is completely free and open-source. See the magic with just two simple commands. Set up ZenML in a matter of minutes, and start with all the tools you already use. ZenML standard interfaces ensure that your tools work together seamlessly. Gradually scale up your MLOps stack by switching out components whenever your training or deployment requirements change. Keep up with the latest changes in the MLOps world and easily integrate any new developments. Define simple and clear ML workflows without wasting time on boilerplate tooling or infrastructure code. Write portable ML code and switch from experimentation to production in seconds. Manage all your favorite MLOps tools in one place with ZenML's plug-and-play integrations. Prevent vendor lock-in by writing extensible, tooling-agnostic, and infrastructure-agnostic code.Starting Price: Free -
15
PostgresML
PostgresML
PostgresML is a complete platform in a PostgreSQL extension. Build simpler, faster, and more scalable models right inside your database. Explore the SDK and test open source models in our hosted database. Combine and automate the entire workflow from embedding generation to indexing and querying for the simplest (and fastest) knowledge-based chatbot implementation. Leverage multiple types of natural language processing and machine learning models such as vector search and personalization with embeddings to improve search results. Leverage your data with time series forecasting to garner key business insights. Build statistical and predictive models with the full power of SQL and dozens of regression algorithms. Return results and detect fraud faster with ML at the database layer. PostgresML abstracts the data management overhead from the ML/AI lifecycle by enabling users to run ML/LLM models directly on a Postgres database.Starting Price: $.60 per hour -
16
Athina AI
Athina AI
Athina is a collaborative AI development platform that enables teams to build, test, and monitor AI applications efficiently. It offers features such as prompt management, evaluation tools, dataset handling, and observability, all designed to streamline the development of reliable AI systems. Athina supports integration with various models and services, including custom models, and ensures data privacy through fine-grained access controls and self-hosted deployment options. The platform is SOC-2 Type 2 compliant, providing a secure environment for AI development. Athina's user-friendly interface allows both technical and non-technical team members to collaborate effectively, accelerating the deployment of AI features.Starting Price: Free -
17
Agenta
Agenta
Agenta is an open-source LLMOps platform designed to help teams build reliable AI applications with integrated prompt management, evaluation workflows, and system observability. It centralizes all prompts, experiments, traces, and evaluations into one structured hub, eliminating scattered workflows across Slack, spreadsheets, and emails. With Agenta, teams can iterate on prompts collaboratively, compare models side-by-side, and maintain full version history for every change. Its evaluation tools replace guesswork with automated testing, LLM-as-a-judge, human annotation, and intermediate-step analysis. Observability features allow developers to trace failures, annotate logs, convert traces into tests, and monitor performance regressions in real time. Agenta helps AI teams transition from siloed experimentation to a unified, efficient LLMOps workflow for shipping more reliable agents and AI products.Starting Price: Free -
18
PromptPal
PromptPal
Unleash your creativity with PromptPal, the ultimate platform for discovering and sharing the best AI prompts. Generate new ideas, and boost productivity. Unlock the power of artificial intelligence with PromptPal's over 3,400 free AI prompts. Explore our great catalog of directions and be inspired and more productive today. Browse our large catalog of ChatGPT prompts and get inspired and more productive today. Earn revenue by posting prompts and sharing your prompt engineering skills with the PromptPal community.Starting Price: $3.74 per month -
19
Fleak
Fleak
Fleak is a low-code serverless API builder for data teams that requires no infrastructure and allows you to instantly embed API endpoints to your existing modern AI & data tech stack. Start by configuring the essential components of your data workflow. With Fleak, you can transform data, generate text embeddings, and connect to vector databases, all in just a few steps. Fleak's intuitive tools eliminate complexity, helping you build workflows efficiently without the need for complex setups. Add and configure nodes to build your workflow, supporting data types like JSON, SQL, CSV, and plain text. Customize your workflow steps with flexible options to handle various data transformations. Test your workflow and preview results instantly to ensure accuracy before moving forward. Once your workflow is built, Fleak allows you to integrate seamlessly with large language models, databases, and other essential tools.Starting Price: $29 per month -
20
AnythingLLM
AnythingLLM
Any LLM, any document, and any agent, fully private. Install AnythingLLM and its full suite of tools as a single application on your desktop. Desktop AnythingLLM only talks to the services you explicitly connect to and can run fully on your machine without internet connectivity. We don't lock you into a single LLM provider. Use enterprise models like GPT-4, a custom model, or an open-source model like Llama, Mistral, and more. PDFs, word documents, and so much more make up your business, now you can use them all. AnythingLLM comes with sensible and locally running defaults for your LLM, embedder, and storage for full privacy out of the box. AnythingLLM is free for desktop or self-hosted via our GitHub. AnythingLLM cloud hosting starts at $50/month and is built for businesses or teams that need the power of AnythingLLM, but want to have a managed instance of AnythingLLM so they don't have to sweat the technical details.Starting Price: $50 per month -
21
Ragas
Ragas
Ragas is an open-source framework designed to test and evaluate Large Language Model (LLM) applications. It offers automatic metrics to assess performance and robustness, synthetic test data generation tailored to specific requirements, and workflows to ensure quality during development and production monitoring. Ragas integrates seamlessly with existing stacks, providing insights to enhance LLM applications. The platform is maintained by a team of passionate individuals leveraging cutting-edge research and pragmatic engineering practices to empower visionaries redefining LLM possibilities. Synthetically generate high-quality and diverse evaluation data customized for your requirements. Evaluate and ensure the quality of your LLM application in production. Use insights to improve your application. Automatic metrics that helps you understand the performance and robustness of your LLM application.Starting Price: Free -
22
HubSpot AI Search Grader
HubSpot
HubSpot's AI Search Grader is a free tool designed to help brands understand and enhance their presence in AI-powered search engines. By analyzing how your brand appears in AI search results, the tool provides insights into brand sentiment and share of voice, offering a comprehensive score that reflects overall performance. This analysis enables marketers, SEO experts, entrepreneurs, and blog administrators to identify areas for improvement, optimize strategies, and increase brand visibility, traffic, awareness, and sales. Currently, AI Search Grader evaluates results from GPT-4o, with plans to incorporate more AI search engines in the future. The tool is free to use and can be applied to assess your own brand or others within your industry to gauge performance and visibility. As more people move to AI search engines like ChatGPT and Perplexity for answers to their queries, brands will need to think beyond traditional search methods.Starting Price: Free -
23
Diaflow
Diaflow
Diaflow is an enterprise platform for scaling AI across your organization by enabling everyone to deploy AI workflows that drive innovation. From manual processes to fully automated ones, create powerful apps and workflows from any data source across your teams. Effortlessly automate your business’s manual processes with solutions your team will love. Build powerful AI-driven internal apps that you are proud of with Diaflow's intuitive interfaces and components. An innovative way for document creation and edition with Diaflow AI-powered editing tool. Leveraging your expertise, to provide 24/7 support and engagement. Easily manage and transform your data with a built-in AI-enabled spreadsheet solution. Discover how easy it is to use Diaflow to build amazing products for your company. Diaflow provides all you need to create apps and workflows in minutes with no coding required.Starting Price: $199 per month -
24
WebLLM
WebLLM
WebLLM is a high-performance, in-browser language model inference engine that leverages WebGPU for hardware acceleration, enabling powerful LLM operations directly within web browsers without server-side processing. It offers full OpenAI API compatibility, allowing seamless integration with functionalities such as JSON mode, function-calling, and streaming. WebLLM natively supports a range of models, including Llama, Phi, Gemma, RedPajama, Mistral, and Qwen, making it versatile for various AI tasks. Users can easily integrate and deploy custom models in MLC format, adapting WebLLM to specific needs and scenarios. The platform facilitates plug-and-play integration through package managers like NPM and Yarn, or directly via CDN, complemented by comprehensive examples and a modular design for connecting with UI components. It supports streaming chat completions for real-time output generation, enhancing interactive applications like chatbots and virtual assistants.Starting Price: Free -
25
Scout
Scout
Scout is a comprehensive platform that enables users to build, launch, and scale AI solutions efficiently. It offers a workflow builder for creating AI automations using models, web scraping, data storage, API calls, and customized logic. Users can set up automated content ingestion from various sources, including websites and documentation, and connect multiple large language models within a single workflow to find optimal solutions. Deployment options include Copilots for delivering AI-generated answers directly on websites, Slack integration for customer interactions, and APIs and SDKs for building custom AI applications at scale. Scout provides comprehensive testing and tuning features, including evaluations, real-time monitoring, and built-in logging to oversee workflow status, latency, and costs. The platform is trusted by teams building the future.Starting Price: $49 per month -
26
fullmoon
fullmoon
Fullmoon is a free, open source application that enables users to interact with large language models directly on their devices, ensuring privacy and offline accessibility. Optimized for Apple silicon, it operates seamlessly across iOS, iPadOS, macOS, and visionOS platforms. Users can personalize the app by adjusting themes, fonts, and system prompts, and it integrates with Apple's Shortcuts for enhanced functionality. Fullmoon supports models like Llama-3.2-1B-Instruct-4bit and Llama-3.2-3B-Instruct-4bit, facilitating efficient on-device AI interactions without the need for an internet connection.Starting Price: Free -
27
MindMac
MindMac
MindMac is a native macOS application designed to enhance productivity by integrating seamlessly with ChatGPT and other AI models. It supports multiple AI providers, including OpenAI, Azure OpenAI, Google AI with Gemini, Google Cloud Vertex AI with Gemini, Anthropic Claude, OpenRouter, Mistral AI, Cohere, Perplexity, OctoAI, and local LLMs via LMStudio, LocalAI, GPT4All, Ollama, and llama.cpp. MindMac offers over 150 built-in prompt templates to facilitate user interaction and allows for extensive customization of OpenAI parameters, appearance, context modes, and keyboard shortcuts. The application features a powerful inline mode, enabling users to generate content or ask questions within any application without switching windows. MindMac ensures privacy by storing API keys securely in the Mac's Keychain and sending data directly to the AI provider without intermediary servers. The app is free to use with basic features, requiring no account for setup.Starting Price: $29 one-time payment -
28
Overseer AI
Overseer AI
Overseer AI is a platform designed to ensure AI-generated content is safe, accurate, and aligned with user-defined policies. It offers compliance enforcement by automating adherence to regulatory standards through custom policy rules, real-time content moderation to block harmful, toxic, or biased outputs from AI, debugging AI outputs by testing and monitoring responses against custom safety policies, policy-driven AI governance by applying centralized safety rules across all AI interactions, and trust-building for AI by guaranteeing safe, accurate, and brand-compliant outputs. The platform caters to various industries, including healthcare, finance, legal technology, customer support, education technology, and ecommerce & retail, providing tailored solutions to ensure AI responses align with industry-specific regulations and standards. Developers can access comprehensive guides and API references to integrate Overseer AI into their applications.Starting Price: $99 per month -
29
Oumi
Oumi
Oumi is a fully open source platform that streamlines the entire lifecycle of foundation models, from data preparation and training to evaluation and deployment. It supports training and fine-tuning models ranging from 10 million to 405 billion parameters using state-of-the-art techniques such as SFT, LoRA, QLoRA, and DPO. The platform accommodates both text and multimodal models, including architectures like Llama, DeepSeek, Qwen, and Phi. Oumi offers tools for data synthesis and curation, enabling users to generate and manage training datasets effectively. For deployment, it integrates with popular inference engines like vLLM and SGLang, ensuring efficient model serving. The platform also provides comprehensive evaluation capabilities across standard benchmarks to assess model performance. Designed for flexibility, Oumi can run on various environments, from local laptops to cloud infrastructures such as AWS, Azure, GCP, and Lambda.Starting Price: Free -
30
NeoAnalyst.ai
NeoAnalyst.ai
NeoAnalyst is an AI-powered data analysis platform designed to provide business leaders with quick and precise insights without the need for coding or data science expertise. Users can upload any dataset, and NeoAnalyst automatically builds context without requiring extensive user instructions or manual data mapping. The platform offers hundreds of pre-built models for exploratory and statistical analysis, along with 25 AI-generated analysis queries to help users get started. It provides predictive analytics, visual data representations, and tailored recommendations to enhance decision-making. NeoAnalyst is accessible through various subscription plans, including a free tier for individuals, and is designed to streamline data analysis processes for professionals across multiple industries.Starting Price: $19 per month -
31
Ontosight.ai
Partex
Ontosight is a cutting-edge research and AI-powered Q&A platform designed to revolutionize how you access and process information. Leveraging advanced artificial intelligence, Ontosight combines the precision of academic and specialized databases with the ease of conversational search. Whether you're an undergraduate, a Ph.D. student, or a seasoned researcher, Ontosight transforms traditional research into an efficient, insightful, and engaging experience. Why Choose Ontosight? Precision and Depth: Unlike generic search engines, Ontosight AI delivers highly targeted results from specialized databases, academic journals, and clinical trials. Transparency: Always know where your information comes from, with clear citations and references. Efficiency: Save hours by summarizing lengthy papers and highlighting critical insights. Exploration of Connections: Highlights relationships between entities—like genes, molecules, or treatments—helping you uncover patterns and ideas for iStarting Price: $11 per month -
32
Basalt
Basalt
Basalt is an AI-building platform that helps teams quickly create, test, and launch better AI features. With Basalt, you can prototype quickly using our no-code playground, allowing you to draft prompts with co-pilot guidance and structured sections. Iterate efficiently by saving and switching between versions and models, leveraging multi-model support and versioning. Improve your prompts with recommendations from our co-pilot. Evaluate and iterate by testing with realistic cases, upload your dataset, or let Basalt generate it for you. Run your prompt at scale on multiple test cases and build confidence with evaluators and expert evaluation sessions. Deploy seamlessly with the Basalt SDK, abstracting and deploying prompts in your codebase. Monitor by capturing logs and monitoring usage in production, and optimize by staying informed of new errors and edge cases.Starting Price: Free -
33
Arch
Arch
Arch is an intelligent gateway designed to protect, observe, and personalize AI agents through seamless integration with your APIs. Built on Envoy Proxy, Arch offers secure handling, intelligent routing, robust observability, and integration with backend systems, all external to business logic. It features an out-of-process architecture compatible with various application languages, enabling quick deployment and transparent upgrades. Engineered with specialized sub-billion parameter Large Language Models (LLMs), Arch excels in critical prompt-related tasks such as function calling for API personalization, prompt guards to prevent toxic or jailbreak prompts, and intent-drift detection to enhance retrieval accuracy and response efficiency. Arch extends Envoy's cluster subsystem to manage upstream connections to LLMs, providing resilient AI application development. It also serves as an edge gateway for AI applications, offering TLS termination, rate limiting, and prompt-based routing.Starting Price: Free -
34
Unsloth
Unsloth
Unsloth is an open source platform designed to accelerate and optimize the fine-tuning and training of Large Language Models (LLMs). It enables users to train custom models, such as ChatGPT, in just 24 hours instead of the typical 30 days, achieving speeds up to 30 times faster than Flash Attention 2 (FA2) while using 90% less memory. Unsloth supports both LoRA and QLoRA fine-tuning techniques, allowing for efficient customization of models like Mistral, Gemma, and Llama versions 1, 2, and 3. Unsloth's efficiency stems from manually deriving computationally intensive mathematical steps and handwriting GPU kernels, resulting in significant performance gains without requiring hardware modifications. Unsloth delivers a 10x speed increase on a single GPU and up to 32x on multi-GPU systems compared to FA2, with compatibility across NVIDIA GPUs from Tesla T4 to H100, and portability to AMD and Intel GPUs.Starting Price: Free -
35
Axolotl
Axolotl
Axolotl is an open source tool designed to streamline the fine-tuning of various AI models, offering support for multiple configurations and architectures. It enables users to train models, supporting methods like full fine-tuning, LoRA, QLoRA, ReLoRA, and GPTQ. Users can customize configurations using simple YAML files or command-line interface overrides, and load different dataset formats, including custom or pre-tokenized datasets. Axolotl integrates with technologies like xFormers, Flash Attention, Liger kernel, RoPE scaling, and multipacking, and works with single or multiple GPUs via Fully Sharded Data Parallel (FSDP) or DeepSpeed. It can be run locally or on the cloud using Docker and supports logging results and checkpoints to several platforms. It is designed to make fine-tuning AI models friendly, fast, and fun, without sacrificing functionality or scale.Starting Price: Free -
36
LLaMA-Factory
hoshi-hiyouga
LLaMA-Factory is an open source platform designed to streamline and enhance the fine-tuning process of over 100 Large Language Models (LLMs) and Vision-Language Models (VLMs). It supports various fine-tuning techniques, including Low-Rank Adaptation (LoRA), Quantized LoRA (QLoRA), and Prefix-Tuning, allowing users to customize models efficiently. It has demonstrated significant performance improvements; for instance, its LoRA tuning offers up to 3.7 times faster training speeds with better Rouge scores on advertising text generation tasks compared to traditional methods. LLaMA-Factory's architecture is designed for flexibility, supporting a wide range of model architectures and configurations. Users can easily integrate their datasets and utilize the platform's tools to achieve optimized fine-tuning results. Detailed documentation and diverse examples are provided to assist users in navigating the fine-tuning process effectively.Starting Price: Free -
37
TypeThink
TypeThink
TypeThinkAI is an all-in-one AI platform that integrates multiple leading AI models and tools into a single, user-friendly ecosystem. It offers features like multi-model chat, image and video generation, real-time web search, and code interpretation, catering to diverse needs such as content creation, research, and problem-solving. Users might choose TypeThinkAI to streamline their workflows, enhance productivity, and access a wide range of AI capabilities without switching between multiple platforms, making it an efficient solution for content creators, researchers, developers, and business professionals alike. Type-Think-AI integrates with the leading AI model providers, giving you access to the best models for your specific needs. Type-Think-AI streamlines the process of working with AI models, making them accessible and easy to use. Seamlessly switch between different AI models during your conversation.Starting Price: $10 per month -
38
Skott
Lyzr AI
Skott is an AI marketing agent that autonomously researches, writes, and posts content, allowing your team to focus more on strategy and creative endeavors. It offers a customizable UI and workflow, providing actionable insights to guide your strategy, stay ahead of trends with real-time data, conduct in-depth competitive analysis, and gain audience insights to tailor your content effectively. Skott excels in stellar content creation by crafting high-impact blog posts, engaging social media content, SEO-optimized writing, and maintaining a consistent brand voice across all platforms. It ensures seamless publishing by allowing you to publish across multiple channels effortlessly, maintain consistent formatting and optimization, automate scheduling, and integrate with major blogging and social media platforms. Skott is cost-effective, offering affordable, high-quality marketing solutions that maximize your ROI without overspending or hiring additional resources.Starting Price: $99 per month -
39
Mastra AI
Mastra AI
Mastra is a powerful TypeScript framework for building intelligent AI agents that can execute tasks, access knowledge bases, and maintain memory persistently within workflows. This framework simplifies the process of creating and deploying AI-powered agents by leveraging TypeScript’s capabilities to streamline development. With features like customizable agent instructions, memory, and task orchestration, Mastra provides developers with the tools to build and scale AI agents for various applications, from personal assistants to specialized domain experts.Starting Price: Free -
40
Llama 4 Behemoth
Meta
Llama 4 Behemoth is Meta's most powerful AI model to date, featuring a massive 288 billion active parameters. It excels in multimodal tasks, outperforming previous models like GPT-4.5 and Gemini 2.0 Pro across multiple STEM-focused benchmarks such as MATH-500 and GPQA Diamond. As the teacher model for the Llama 4 series, Behemoth sets the foundation for models like Llama 4 Maverick and Llama 4 Scout. While still in training, Llama 4 Behemoth demonstrates unmatched intelligence, pushing the boundaries of AI in fields like math, multilinguality, and image understanding.Starting Price: Free -
41
Llama 4 Maverick
Meta
Llama 4 Maverick is one of the most advanced multimodal AI models from Meta, featuring 17 billion active parameters and 128 experts. It surpasses its competitors like GPT-4o and Gemini 2.0 Flash in a broad range of benchmarks, especially in tasks related to coding, reasoning, and multilingual capabilities. Llama 4 Maverick combines image and text understanding, enabling it to deliver industry-leading results in image-grounding tasks and precise, high-quality output. With its efficient performance at a reduced parameter size, Maverick offers exceptional value, especially in general assistant and chat applications.Starting Price: Free -
42
Llama 4 Scout
Meta
Llama 4 Scout is a powerful 17 billion active parameter multimodal AI model that excels in both text and image processing. With an industry-leading context length of 10 million tokens, it outperforms its predecessors, including Llama 3, in tasks such as multi-document summarization and parsing large codebases. Llama 4 Scout is designed to handle complex reasoning tasks while maintaining high efficiency, making it perfect for use cases requiring long-context comprehension and image grounding. It offers cutting-edge performance in image-related tasks and is particularly well-suited for applications requiring both text and visual understanding.Starting Price: Free -
43
Alumnium
Alumnium
Alumnium is an open source AI-powered test automation tool that bridges the gap between human and automated testing by translating plain-language test instructions into executable browser commands. It integrates seamlessly with popular web automation tools like Selenium and Playwright, allowing software and test engineers to accelerate browser test creation without sacrificing precision or control. Alumnium supports any Python test framework and leverages large language models (LLMs) from providers such as Anthropic, Google Gemini, OpenAI, and Meta Llama to interpret instructions and generate browser interactions. Users can write test cases using simple commands: do to describe steps, check to verify results, and get to extract data from the page. Alumnium utilizes the web page's accessibility tree and, if needed, screenshots to execute tests, ensuring compatibility with various web applications.Starting Price: Free -
44
Lorelight
Lorelight
Lorelight is an AI brand monitoring platform that enables communication professionals to track, analyze, and optimize their brand's presence across major AI platforms such as ChatGPT, Claude, Gemini, Meta, Deepseek, and Mistral. By creating a brand project, users can automatically set up monitoring, identify key competitors, and deploy smart prompts tailored to their industry. Lorelight provides share of voice analytics to measure a brand's weighted share compared to competitors in AI-generated responses, utilizing an inverse rank formula to prioritize top mentions. It also offers AI sentiment analysis to understand how AI portrays a brand, positive, negative, or neutral, with context. Users can discover organic mentions of their brand in AI conversations they didn't initiate, gaining insights into their competitive positioning.Starting Price: $49 per month -
45
RankLLM
Castorini
RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking. It offers a suite of rerankers, pointwise models like MonoT5, pairwise models like DuoT5, and listwise models compatible with vLLM, SGLang, or TensorRT-LLM. Additionally, it supports RankGPT and RankGemini variants, which are proprietary listwise rerankers. It includes modules for retrieval, reranking, evaluation, and response analysis, facilitating end-to-end workflows. RankLLM integrates with Pyserini for retrieval and provides integrated evaluation for multi-stage pipelines. It also includes a module for detailed analysis of input prompts and LLM responses, addressing reliability concerns with LLM APIs and non-deterministic behavior in Mixture-of-Experts (MoE) models. The toolkit supports various backends, including SGLang and TensorRT-LLM, and is compatible with a wide range of LLMs.Starting Price: Free -
46
Pinecone Rerank v0
Pinecone
Pinecone Rerank V0 is a cross-encoder model optimized for precision in reranking tasks, enhancing enterprise search and retrieval-augmented generation (RAG) systems. It processes queries and documents together to capture fine-grained relevance, assigning a relevance score from 0 to 1 for each query-document pair. The model's maximum context length is set to 512 tokens to preserve ranking quality. Evaluations on the BEIR benchmark demonstrated that Pinecone Rerank V0 achieved the highest average NDCG@10, outperforming other models on 6 out of 12 datasets. For instance, it showed up to a 60% boost on the Fever dataset compared to Google Semantic Ranker and over 40% on the Climate-Fever dataset relative to cohere-v3-multilingual or voyageai-rerank-2. The model is accessible through Pinecone Inference and is available to all users in public preview.Starting Price: $25 per month -
47
Parasail
Parasail
Parasail is an AI deployment network offering scalable, cost-efficient access to high-performance GPUs for AI workloads. It provides three primary services, serverless endpoints for real-time inference, Dedicated instances for private model deployments, and Batch processing for large-scale tasks. Users can deploy open source models like DeepSeek R1, LLaMA, and Qwen, or bring their own, with the platform's permutation engine matching workloads to optimal hardware, including NVIDIA's H100, H200, A100, and 4090 GPUs. Parasail emphasizes rapid deployment, with the ability to scale from a single GPU to clusters within minutes, and offers significant cost savings, claiming up to 30x cheaper compute compared to legacy cloud providers. It supports day-zero availability for new models and provides a self-service interface without long-term contracts or vendor lock-in.Starting Price: $0.80 per million tokens -
48
kluster.ai
kluster.ai
Kluster.ai is a developer-centric AI cloud platform designed to deploy, scale, and fine-tune large language models (LLMs) with speed and efficiency. Built for developers by developers, it offers Adaptive Inference, a flexible and scalable service that adjusts seamlessly to workload demands, ensuring high-performance processing and consistent turnaround times. Adaptive Inference provides three distinct processing options: real-time inference for ultra-low latency needs, asynchronous inference for cost-effective handling of flexible timing tasks, and batch inference for efficient processing of high-volume, bulk tasks. It supports a range of open-weight, cutting-edge multimodal models for chat, vision, code, and more, including Meta's Llama 4 Maverick and Scout, Qwen3-235B-A22B, DeepSeek-R1, and Gemma 3 . Kluster.ai's OpenAI-compatible API allows developers to integrate these models into their applications seamlessly.Starting Price: $0.15per input -
49
Kodosumi
Masumi
Kodosumi is an open source, framework-agnostic runtime environment built on Ray for deploying, managing, and scaling agentic services at the enterprise level. It enables effortless deployment of AI agents with a single YAML config, offering minimal setup overhead and no vendor lock-in. Designed for handling bursty traffic and long-running workflows, it dynamically scales across Ray clusters to ensure consistent performance. Kodosumi integrates real-time logging and monitoring through the Ray dashboard, providing instant observability and streamlined debugging of complex flows. Core building blocks include autonomous agents (task performers), orchestrated flows, and deployable agentic services, all managed via a pragmatic web admin panel.Starting Price: Free -
50
NativeMind
NativeMind
NativeMind is an open source, on-device AI assistant that runs entirely in your browser via Ollama integration, ensuring absolute privacy by never sending data to the cloud. Everything, from model inference to prompt processing, occurs locally, so there’s no syncing, logging, or data leakage. Users can load and switch between powerful open models such as DeepSeek, Qwen, Llama, Gemma, and Mistral instantly, without additional setup, and leverage native browser features for streamlined workflows. NativeMind offers clean, concise webpage summarization; persistent, context-aware chat across multiple tabs; local web search that retrieves and answers queries directly within the page; and immersive, format-preserving translation of entire pages. Built for speed and security, the extension is fully auditable and community-backed, delivering enterprise-grade performance for real-world use cases without vendor lock-in or hidden telemetry.Starting Price: Free