Page 2 | Best Open Source Mac AI Models 2025

AI Models for Mac

View 108 business solutions

AI Models Mac Clear Filters

Simplify IT and security with a single endpoint management platform
Automate the hardest parts of IT

NinjaOne automates the hardest parts of IT, delivering visibility, security, and control over all endpoints for more than 20,000 customers. The NinjaOne automated endpoint management platform is proven to increase productivity, reduce security risk, and lower costs for IT teams and managed service providers. The company seamlessly integrates with a wide range of IT and security technologies. NinjaOne is obsessed with customer success and provides free and unlimited onboarding, training, and support.

Learn More
Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
1

Hunyuan3D 2.0

High-Resolution 3D Assets Generation with Large Scale Diffusion Models

The Hunyuan3D-2 model, developed by Tencent, is designed for generating high-resolution 3D assets using large-scale diffusion models. This model offers advanced capabilities for creating detailed 3D models, including texture enhancements, multi-view shape generation, and rapid inference for real-time applications. It is particularly useful for industries requiring high-quality 3D content, such as gaming, film, and virtual reality. Hunyuan3D-2 supports various enhancements and is available for deployment through tools like Blender and Hugging Face.

Downloads: 23 This Week

Last Update: 2025-03-20
See Project
2

DiffRhythm

Di♪♪Rhythm: Blazingly Fast & Simple End-to-End Song Generation

DiffRhythm is an open-source, diffusion-based model designed to generate full-length songs. Focused on music creation, it combines advanced AI techniques to produce coherent and creative audio compositions. The model utilizes a latent diffusion architecture, making it capable of producing high-quality, long-form music. It can be accessed on Huggingface, where users can interact with a demo or download the model for further use. DiffRhythm offers tools for both training and inference, and its flexibility makes it ideal for AI-based music production and research in music generation.

1 Review

Downloads: 12 This Week

Last Update: 2025-03-06
See Project
3

FinGPT

Open-Source Financial Large Language Models!

FinGPT is an open-source large language model tailored specifically for financial tasks. Developed by AI4Finance Foundation, it is designed to assist with various financial applications, such as forecasting, financial sentiment analysis, and portfolio management. FinGPT has been trained on a diverse range of financial datasets, making it a powerful tool for finance professionals looking to leverage AI for data-driven decision-making. The model is freely available on platforms like Hugging Face, allowing for easy access and customization. FinGPT's capabilities are extended by its ability to integrate with existing financial systems and enhance predictive analytics in finance.

1 Review

Downloads: 12 This Week

Last Update: 2025-03-03
See Project
4

Grok-1

Open-source, high-performance Mixture-of-Experts large language model

Grok-1 is a 314-billion-parameter Mixture-of-Experts (MoE) large language model developed by xAI. Designed to optimize computational efficiency, it activates only 25% of its weights for each input token. In March 2024, xAI released Grok-1's model weights and architecture under the Apache 2.0 license, making them openly accessible to developers. The accompanying GitHub repository provides JAX example code for loading and running the model. Due to its substantial size, utilizing Grok-1 requires a machine with significant GPU memory. The repository's MoE layer implementation prioritizes correctness over efficiency, avoiding the need for custom kernels. This is a full repo snapshot ZIP file of the Grok-1 code.

1 Review

Downloads: 11 This Week

Last Update: 2025-02-27
See Project
Powering the best of the internet | Fastly
Fastly's edge cloud platform delivers faster, safer, and more scalable sites and apps to customers.

Ensure your websites, applications and services can effortlessly handle the demands of your users with Fastly. Fastly’s portfolio is designed to be highly performant, personalized and secure while seamlessly scaling to support your growth.

Try for free
5

Qwen

Qwen (通义千问) chat/pretrained large language model Alibaba Cloud

Qwen is a series of large language models developed by Alibaba Cloud, consisting of various pretrained versions like Qwen-1.8B, Qwen-7B, Qwen-14B, and Qwen-72B. These models, which range from smaller to larger configurations, are designed for a wide range of natural language processing tasks. They are openly available for research and commercial use, with Qwen's code and model weights shared on GitHub. Qwen's capabilities include text generation, comprehension, and conversation, making it a versatile tool for developers looking to integrate advanced AI functionalities into their applications.

1 Review

Downloads: 11 This Week

Last Update: 2025-03-03
See Project
6

Janus-Pro

Janus-Series: Unified Multimodal Understanding and Generation Models

Janus is a cutting-edge, unified multimodal model designed to advance both multimodal understanding and generation. It features a decoupled visual encoding approach that allows it to handle visual tasks separately from the generative tasks, resulting in enhanced flexibility and performance. With a singular transformer architecture, Janus outperforms previous models by surpassing specialized task-specific models in its ability to handle diverse multimodal inputs and generate high-quality outputs. Its latest iteration, Janus-Pro, improves on this with a more optimized training strategy, expanded data, and larger model scaling, leading to significant advancements in both multimodal understanding and text-to-image generation.

1 Review

Downloads: 5 This Week

Last Update: 2025-03-04
See Project
7

GLM-4-32B-0414

Open Multilingual Multimodal Chat LMs

GLM-4-32B-0414 is a powerful open-source large language model featuring 32 billion parameters, designed to deliver performance comparable to leading models like OpenAI’s GPT series. It supports multilingual and multimodal chat capabilities with an extensive 32K token context length, making it ideal for dialogue, reasoning, and complex task completion. The model is pre-trained on 15 trillion tokens of high-quality data, including substantial synthetic reasoning datasets, and further enhanced with reinforcement learning and human preference alignment for improved instruction-following and function calling. Variants like GLM-Z1-32B-0414 offer deep reasoning and advanced mathematical problem-solving, while GLM-Z1-Rumination-32B-0414 specializes in long-form, complex research-style writing using scaled reinforcement learning and external search tools. Despite its large capacity, the model supports user-friendly local deployment and efficient fine-tuning with available scripts.

Downloads: 8 This Week

Last Update: 2025-06-27
See Project
8

Blazeface

Blazeface is a lightweight model that detects faces in images

Blazeface is a lightweight, high-performance face detection model designed for mobile and embedded devices, developed by TensorFlow. It is optimized for real-time face detection tasks and runs efficiently on mobile CPUs, ensuring minimal latency and power consumption. Blazeface is based on a fast architecture and uses deep learning techniques to detect faces with high accuracy, even in challenging conditions. It supports multiple face detection in varying lighting and poses, and is designed to work in real-world applications like mobile apps, robotics, and other resource-constrained environments.

Downloads: 5 This Week

Last Update: 2025-03-19
See Project
9

Qwen2.5-Coder

Qwen2.5-Coder is the code version of Qwen2.5, the large language model

Qwen2.5-Coder, developed by QwenLM, is an advanced open-source code generation model designed for developers seeking powerful and diverse coding capabilities. It includes multiple model sizes—ranging from 0.5B to 32B parameters—providing solutions for a wide array of coding needs. The model supports over 92 programming languages and offers exceptional performance in generating code, debugging, and mathematical problem-solving. Qwen2.5-Coder, with its long context length of 128K tokens, is ideal for a variety of use cases, from simple code assistants to complex programming scenarios, matching the capabilities of models like GPT-4o.

1 Review

Downloads: 2 This Week

Last Update: 2025-03-04
See Project
Build Securely on Azure with Proven Frameworks
Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.

Download Now
10

MoveNet

A CNN model that predicts human joints from RGB images of a person

The MoveNet model is an efficient, real-time human pose estimation system designed for detecting and tracking keypoints of human bodies. It utilizes deep learning to accurately locate 17 key points across the body, providing precise tracking even with fast movements. Optimized for mobile and embedded devices, MoveNet can be integrated into applications for fitness tracking, augmented reality, and interactive systems.

Downloads: 3 This Week

Last Update: 2025-03-19
See Project
11

CSM (Conversational Speech Model)

A Conversational Speech Generation Model

The CSM (Conversational Speech Model) is a speech generation model developed by Sesame AI that creates RVQ audio codes from text and audio inputs. It uses a Llama backbone and a smaller audio decoder to produce audio codes for realistic speech synthesis. The model has been fine-tuned for interactive voice demos and is hosted on platforms like Hugging Face for testing. CSM offers a flexible setup and is compatible with CUDA-enabled GPUs for efficient execution.

Downloads: 2 This Week

Last Update: 2025-03-19
See Project
12

ChatGLM2-6B

An Open Bilingual Chat LLM | Open Source Bilingual Conversation LLM

ChatGLM2-6B is an advanced open-source bilingual dialogue model developed by THUDM. It is the second iteration of the ChatGLM series, designed to offer enhanced performance while maintaining the strengths of its predecessor, including smooth conversation flow and low deployment barriers. The model is fine-tuned for both Chinese and English languages, making it a versatile tool for various multilingual applications. ChatGLM2-6B aims to push the boundaries of natural language understanding and generation, offering improved accuracy and user experience compared to earlier models.

Downloads: 1 This Week

Last Update: 2025-03-03
See Project
13

MediaPipe Face Detection

Detect faces in an image

The MediaPipe Face Detection model is a high-performance, real-time face detection solution that uses machine learning to identify faces in images and video streams. It is optimized for mobile and embedded platforms, offering fast and accurate face detection while maintaining a small memory footprint. This model supports multiple face detections and is highly efficient, making it suitable for a variety of applications such as augmented reality, user authentication, and facial expression analysis.

Downloads: 1 This Week

Last Update: 2025-03-19
See Project
14

Universal Sentence Encoder

Encoder of greater-than-word length text trained on a variety of data

The Universal Sentence Encoder (USE) is a pre-trained deep learning model designed to encode sentences into fixed-length embeddings for use in various natural language processing (NLP) tasks. It leverages Transformer and Deep Averaging Network (DAN) architectures to generate embeddings that capture the semantic meaning of sentences. The model is designed for tasks like sentiment analysis, semantic textual similarity, and clustering, and provides high-quality sentence representations in a computationally efficient manner.

Downloads: 1 This Week

Last Update: 2025-03-19
See Project
15

stable-diffusion-v1-4

Text-to-image diffusion model for high-quality image generation

stable-diffusion-v1-4 is a high-performance text-to-image latent diffusion model developed by CompVis. It generates photo-realistic images from natural language prompts using a pretrained CLIP ViT-L/14 text encoder and a UNet-based denoising architecture. This version builds on v1-2, fine-tuned over 225,000 steps at 512×512 resolution on the “laion-aesthetics v2 5+” dataset, with 10% text-conditioning dropout for improved classifier-free guidance. It is optimized for use with Hugging Face’s Diffusers library and supports both PyTorch and JAX/Flax frameworks, offering flexibility across GPUs and TPUs. Though powerful, the model has limitations with compositional logic, photorealism, non-English prompts, and rendering accurate text or faces. Intended for research and creative exploration, it includes safety tools to detect NSFW content but may still reflect dataset biases. Users are advised to follow responsible AI practices and avoid harmful, unethical, or out-of-scope applications.

Downloads: 0 This Week

Last Update: 2025-06-26
See Project
16

BLEURT-20-D12

Custom BLEURT model for evaluating text similarity using PyTorch

BLEURT-20-D12 is a PyTorch implementation of BLEURT, a model designed to assess the semantic similarity between two text sequences. It serves as an automatic evaluation metric for natural language generation tasks like summarization and translation. The model predicts a score indicating how similar a candidate sentence is to a reference sentence, with higher scores indicating greater semantic overlap. Unlike standard BLEURT models from TensorFlow, this version is built from a custom PyTorch transformer library. It requires installing the model-specific library from GitHub to function properly. Once set up, it can be used to compute similarity scores with minimal code. BLEURT-20-D12 enables more flexible deployment in PyTorch-based workflows for evaluating language generation outputs.

Downloads: 0 This Week

Last Update: 2025-07-02
See Project
17

Bio_ClinicalBERT

ClinicalBERT model trained on MIMIC notes for clinical NLP tasks

Bio_ClinicalBERT is a domain-specific language model tailored for clinical natural language processing (NLP), extending BioBERT with additional training on clinical notes. It was initialized from BioBERT-Base v1.0 and further pre-trained on all clinical notes from the MIMIC-III database (~880M words), which includes ICU patient records. The training focused on improving performance in tasks like named entity recognition and natural language inference within the healthcare domain. Notes were processed using rule-based sectioning and tokenized with SciSpacy. Training was done for 150,000 steps using a batch size of 32, max sequence length of 128, and a masked language modeling objective with a 0.15 mask probability. Bio_ClinicalBERT is available through Hugging Face's Transformers library for easy integration. It supports medical AI research and applications involving electronic health record understanding, clinical decision support, and biomedical information extraction.

Downloads: 0 This Week

Last Update: 2025-07-02
See Project
18

BitNet

Inference framework for 1-bit LLMs

BitNet (bitnet.cpp) is a high-performance inference framework designed to optimize the execution of 1-bit large language models, making them more efficient for edge devices and local deployment. The framework offers significant speedups and energy reductions, achieving up to 6.17x faster performance on x86 CPUs and 70% energy savings, allowing the running of models such as the BitNet b1.58 100B with impressive efficiency. With support for lossless inference and enhanced processing power, BitNet enables faster AI applications while minimizing resource usage. It is a crucial tool for developers looking to implement LLMs on local systems, offering quick execution without sacrificing performance or energy efficiency.

Downloads: 0 This Week

Last Update: 2025-04-19
See Project
19

CLIP-ViT-bigG-14-laion2B-39B-b160k

CLIP ViT-bigG/14: Zero-shot image-text model trained on LAION-2B

CLIP-ViT-bigG-14-laion2B-39B-b160k is a powerful vision-language model trained on the English subset of the LAION-5B dataset using the OpenCLIP framework. Developed by LAION and trained by Mitchell Wortsman on Stability AI’s compute infrastructure, it pairs a ViT-bigG/14 vision transformer with a text encoder to perform contrastive learning on image-text pairs. This model excels at zero-shot image classification, image-to-text and text-to-image retrieval, and can be adapted for tasks such as image captioning or generation guidance. It achieves an impressive 80.1% top-1 accuracy on ImageNet-1k without any fine-tuning, showcasing its robustness in open-domain settings. Its training dataset is uncurated and web-sourced, meaning it reflects the biases and risks of large-scale internet data. The model is intended for research use and is not recommended for real-world deployment without domain-specific testing and safety evaluations.

Downloads: 0 This Week

Last Update: 2025-07-02
See Project
20

ControlNet

Extension for Stable Diffusion using edge, depth, pose, and more

ControlNet is a neural network architecture that enhances Stable Diffusion by enabling image generation conditioned on specific visual structures such as edges, poses, depth maps, and segmentation masks. By injecting these auxiliary inputs into the diffusion process, ControlNet gives users powerful control over the layout and composition of generated images while preserving the style and flexibility of generative models. It supports a wide range of conditioning types through pretrained modules, including Canny edges, HED (soft edges), Midas depth, OpenPose skeletons, normal maps, MLSD lines, scribbles, and ADE20k-based semantic segmentation. The system includes both ControlNet+SD1.5 model weights and compatible third-party detectors like Midas and OpenPose to extract input features. Each conditioning type is matched with a specific .pth model file to be used alongside Stable Diffusion for fine-grained control.

Downloads: 0 This Week

Last Update: 2025-06-27
See Project
21

ControlNet-v1-1

ControlNet-1 enables precise image generation via input conditioning

ControlNet-v1-1 is an updated version of the ControlNet architecture designed to enhance control over image generation by conditioning diffusion models with additional inputs such as edges, depth, poses, or other structural cues. Built to work alongside models like Stable Diffusion, it allows users to guide the generation process more precisely while maintaining high visual fidelity. This version improves upon the original with refinements to stability, performance, and compatibility. ControlNet-v1-1 enables use cases like pose transfer, depth-aware rendering, and detailed sketch-to-image workflows. It is especially useful for tasks that require consistency between input structure and output style or content. While official documentation is pending, it is actively integrated into community spaces and tools across Hugging Face. The model is released under the OpenRAIL license and is suitable for creative and research applications with responsible use constraints.

Downloads: 0 This Week

Last Update: 2025-06-27
See Project
22

DeepSWE-Preview

State-of-the-art RL-trained coding agent for complex SWE tasks

DeepSWE-Preview is a 32.8B parameter open-source coding agent trained solely with reinforcement learning (RL) to perform complex software engineering (SWE) tasks. Built on top of Qwen3-32B, it achieves 59% accuracy on the SWE-Bench-Verified benchmark—currently the highest among open-weight models. The model navigates and edits large codebases using tools like a file editor, bash execution, and search, within the R2E-Gym environment. Its training emphasizes sparse reward signals, test-time scaling, and innovative policy gradient strategies adapted from GRPO, DAPO, Dr.GRPO, and RLOO. DeepSWE-Preview showcases strong reasoning, file navigation, and patch submission skills. It is ideal for agent-based code repair, debugging, and PR generation across real-world repositories. The model is served using platforms like vLLM and Hugging Face TGI, with support for 64k context length and OpenAI-compatible APIs.

Downloads: 0 This Week

Last Update: 2025-07-04
See Project
23

DeepSeek-R1-0528

DeepSeek-R1-0528 is a powerful reasoning-focused LLM with 64K context

DeepSeek-R1-0528 is an upgraded large language model developed by DeepSeek AI, designed to improve deep reasoning, inference, and programming capabilities. With a context length of up to 64K tokens and 685 billion parameters, it introduces enhanced algorithmic optimizations and expanded token usage per task. Compared to previous versions, it significantly improves benchmark scores in math (e.g., AIME 2025: 87.5%), logic, and coding tasks like LiveCodeBench and SWE Verified. It supports system prompts, function calling, and provides a smoother coding experience known as “vibe coding.” The model minimizes hallucinations and supports advanced applications like file uploading and web search with structured prompt templates. It also enables model distillation, as shown with DeepSeek-R1-0528-Qwen3-8B, which surpasses other open-source small models in reasoning benchmarks. DeepSeek-R1-0528 is licensed under MIT and supports both research and commercial applications.

Downloads: 0 This Week

Last Update: 2025-06-27
See Project
24

DeepSeek-V3-0324

Advanced multilingual LLM with enhanced reasoning and code generation

DeepSeek-V3-0324 is a powerful large language model by DeepSeek AI that significantly enhances performance over its predecessor, especially in reasoning, programming, and Chinese language tasks. It achieves major benchmark improvements, such as +5.3 on MMLU-Pro and +19.8 on AIME, and delivers more executable, aesthetically improved front-end code. Its Chinese writing and search-answering capabilities have also been refined, generating more fluent, contextually aware long-form outputs. Key upgrades include better multi-turn interactions, function calling accuracy, translation quality, and support for structured outputs like JSON. The model is optimized to run at a system temperature of 0.3 for coherent, deterministic responses, even if API users specify higher temperatures. It offers structured prompt templates for tasks involving file input and web search, with advanced citation formatting in both English and Chinese.

Downloads: 0 This Week

Last Update: 2025-06-27
See Project
25

Devstral

Agentic 24B LLM optimized for coding tasks with 128k context support

Devstral-Small-2505 is a 23.6B parameter language model fine-tuned by Mistral AI and All Hands AI, built specifically for agentic software engineering tasks. Based on Mistral-Small-3.1, it supports a 128k context window and excels in exploring codebases, editing multiple files, and tool usage. The model achieves state-of-the-art open-source performance on SWE-Bench Verified with a score of 46.8%, surpassing much larger models. Devstral is designed for local and production-level deployments, compatible with frameworks like vLLM, Transformers, llama.cpp, and Ollama. It is licensed under Apache 2.0 and is fully open for commercial and non-commercial use. Its Tekken tokenizer allows a 131k vocabulary size for high flexibility in programming languages and natural language inputs. Devstral is the preferred backend for OpenHands, where it acts as the reasoning engine for autonomous code agents.

Downloads: 0 This Week

Last Update: 2025-07-14
See Project