Alternatives to OpenCV

Compare OpenCV alternatives for your business or organization using the curated list below. SourceForge ranks the best alternatives to OpenCV in 2026. Compare features, ratings, user reviews, pricing, and more from OpenCV competitors and alternatives in order to make an informed decision for your business.

  • 1
    Google Cloud Vision AI
    Derive insights from your images in the cloud or at the edge with AutoML Vision or use pre-trained Vision API models to detect emotion, understand text, and more. Google Cloud offers two computer vision products that use machine learning to help you understand your images with industry-leading prediction accuracy. Automate the training of your own custom machine learning models. Simply upload images and train custom image models with AutoML Vision’s easy-to-use graphical interface; optimize your models for accuracy, latency, and size; and export them to your application in the cloud, or to an array of devices at the edge. Google Cloud’s Vision API offers powerful pre-trained machine learning models through REST and RPC APIs. Assign labels to images and quickly classify them into millions of predefined categories. Detect objects and faces, read printed and handwritten text, and build valuable metadata into your image catalog.
  • 2
    Dataloop AI

    Dataloop AI

    Dataloop AI

    Manage unstructured data and pipelines to develop AI solutions at amazing speed. Enterprise-grade data platform for vision AI. Dataloop is a one-stop shop for building and deploying powerful computer vision pipelines data labeling, automating data ops, customizing production pipelines and weaving the human-in-the-loop for data validation. Our vision is to make machine learning-based systems accessible, affordable and scalable for all. Explore and analyze vast quantities of unstructured data from diverse sources. Rely on automated preprocessing and embeddings to identify similarities and find the data you need. Curate, version, clean, and route your data to wherever it’s needed to create exceptional AI applications.
  • 3
    Azure Computer Vision
    Boost content discoverability, automate text extraction, analyze video in real time, and create products that more people can use by embedding vision capabilities in your apps. Use visual data processing to label content with objects and concepts, extract text, generate image descriptions, moderate content, and understand people’s movement in physical spaces. No machine learning expertise is required.
  • 4
    SimpleCV

    SimpleCV

    SimpleCV

    SimpleCV is an open-source framework for building computer vision applications. With it, you get access to several high-powered computer vision libraries such as OpenCV, without having to first learn about bit depths, file formats, color spaces, buffer management, eigenvalues, or matrix versus bitmap storage. This is computer vision made easy. These are just a small number of things you can do with SimpleCV. If you would like to learn more please refer to our tutorial. There are also many examples included in the SimpleCV directory under the examples folder which can also be downloaded from here. SimpleCV is an open-source framework, meaning that it is a collection of libraries and software that you can use to develop vision applications. It lets you work with the images or video streams that come from webcams, Kinects, FireWire and IP cameras, or mobile phones. It helps you build software to make your various technologies not only see the world but understand it too.
  • 5
    OpenFaceTracker

    OpenFaceTracker

    OpenFaceTracker

    OpenFaceTracker is a facial recognition program capable to detect one or several faces on a picture or a video, and to identify them via a database. OpenFaceTracker needs OpenCV3.2 and QT4 installed on your machine, you’ve got two options, if you love compiling libraries by hand, please follow build_oft, and installing Opencv and QT using your favorite packaging tool. You can compile OFT as a library or you can compile it as a standalone binary file. You can then open the file and execute the detection and recognition module. You can show help and exit, show the list of all available cameras, you can test the XML DB, read from the OFT config, and check the environment. OpenFaceTrackerLib uses Opencv 3.2. This latter has introduced many new algorithms and features comparing to version 2.4. Some modules have been rewritten, some have been reorganized. Although most of the algorithms from 2.4 are still present, the interfaces can differ.
  • 6
    Darknet

    Darknet

    Darknet

    Darknet is an open-source neural network framework written in C and CUDA. It is fast, easy to install, and supports CPU and GPU computation. You can find the source on GitHub or you can read more about what Darknet can do. Darknet is easy to install with only two optional dependencies, OpenCV if you want a wider variety of supported image types, and CUDA if you want GPU computation. Darknet on the CPU is fast but it's like 500 times faster on GPU! You'll have to have an Nvidia GPU and you'll have to install CUDA. By default, Darknet uses stb_image.h for image loading. If you want more support for weird formats (like CMYK jpegs, thanks Obama) you can use OpenCV instead! OpenCV also allows you to view images and detections without having to save them to disk. Classify images with popular models like ResNet and ResNeXt. Recurrent neural networks are all the rage for time-series data and NLP.
  • 7
    Folio3

    Folio3

    Folio3 Software

    Folio3 machine learning company has a team of dedicated Data Scientists and Consultants that have delivered end-to-end projects related to machine learning, natural language processing, computer vision and predictive analysis. Artificial Intelligence and Machine Learning algorithms have enabled companies to utilize highly-customized solutions equipped with advanced Machine Learning capabilities. Computer vision technology has scaled up visual data analysis, introduced new image- based functionalities and transformed the way companies from various verticals utilize visual content. Predictive analytics solutions offered by Folio3 produce effective and fast results, enabling you to identify opportunities and anomalies in your business processes and strategy.
  • 8
    SikuliX

    SikuliX

    SikuliX

    SikuliX is an open source automation tool that enables users to automate any visible element on their desktop screens across Windows, Mac, or certain Linux/Unix systems. It utilizes image recognition powered by OpenCV to identify and interact with screen elements, allowing for the automation of tasks that are otherwise difficult to script. SikuliX offers an Integrated Development Environment (IDE) for writing visual scripts using screenshots, as well as a Java API for integrating image-based automation into existing applications. The software packages representing SikuliX are open source under the MIT license and publicly available for whatever use. SikuliX internally uses OpenCV to support image-related features and Tesseract for text features. The latest stable version, SikuliX 1.1.1, is recommended for use.
    Starting Price: Free
  • 9
    Kibsi

    Kibsi

    Kibsi

    Kibsi is the no-code computer vision platform to build and launch video AI solutions in minutes – not months. Stretch your tech without spending a fortune. From security cameras to webcams, Kibsi converts any live stream camera feed into rich streams of insights and data. View live data, uncover trends, trigger alerts, and automate actions that empower analysts and business leaders with real-time understanding and historical analysis. Kibsi does more than just identify objects, it adds context and relational rules to computer vision through machine learning and proprietary algorithms. Kibsi’s no-code, drag-and-drop experience gets you answers faster. Computer vision programmers and developers are welcome but certainly not required. With 1000s of ready-to-use, built-in objects and classes, you can start getting insights right away. Of course, adding your own objects is easy and automated, too.
    Starting Price: $99 per month
  • 10
    GPUonCLOUD

    GPUonCLOUD

    GPUonCLOUD

    Traditionally, deep learning, 3D modeling, simulations, distributed analytics, and molecular modeling take days or weeks time. However, with GPUonCLOUD’s dedicated GPU servers, it's a matter of hours. You may want to opt for pre-configured systems or pre-built instances with GPUs featuring deep learning frameworks like TensorFlow, PyTorch, MXNet, TensorRT, libraries e.g. real-time computer vision library OpenCV, thereby accelerating your AI/ML model-building experience. Among the wide variety of GPUs available to us, some of the GPU servers are best fit for graphics workstations and multi-player accelerated gaming. Instant jumpstart frameworks increase the speed and agility of the AI/ML environment with effective and efficient environment lifecycle management.
    Starting Price: $1 per hour
  • 11
    Prophesee Metavision
    Metavision is an advanced event-based vision software toolkit developed by Prophesee, designed to facilitate the evaluation, design, and commercialization of event-based vision products. The SDK offers a comprehensive suite of tools, including 64 algorithms, 105 code samples, and 17 tutorials, enabling developers to efficiently build and deploy event-based applications. The open source architecture of Metavision SDK ensures full interoperability between software and hardware devices, fostering a rapidly growing event-based vision community. The platform covers a wide range of computer vision fields, such as machine learning, computer vision, camera calibration, and high-performance applications. Developers have access to extensive documentation, including over 300 pages of content, programming guides, and reference data, providing a solid foundation for product development. Metavision SDK5 PRO includes advanced add-ons like high-speed counting, spatter monitoring, and more.
    Starting Price: Free
  • 12
    Eyewey

    Eyewey

    Eyewey

    Train your own models, get access to pre-trained computer vision models and app templates, learn how to create AI apps or solve a business problem using computer vision in a couple of hours. Start creating your own dataset for detection by adding the images of the object you need to train. You can add up to 5000 images per dataset. After images are added to your dataset, they are pushed automatically into training. Once the model is finished training, you will be notified accordingly. You can simply download your model to be used for detection. You can also integrate your model to our pre-existing app templates for quick coding. Our mobile app which is available on both Android and IOS utilizes the power of computer vision to help people with complete blindness in their day-to-day lives. It is capable of alerting hazardous objects or signs, detecting common objects, recognizing text as well as currencies and understanding basic scenarios through deep learning.
    Starting Price: $6.67 per month
  • 13
    Supervisely

    Supervisely

    Supervisely

    The leading platform for entire computer vision lifecycle. Iterate from image annotation to accurate neural networks 10x faster. With our best-in-class data labeling tools transform your images / videos / 3d point cloud into high-quality training data. Train your models, track experiments, visualize and continuously improve model predictions, build custom solution within the single environment. Our self-hosted solution guaranties data privacy, powerful customization capabilities, and easy integration into your technology stack. A turnkey solution for Computer Vision: multi-format data annotation & management, quality control at scale and neural networks training in end-to-end platform. Inspired by professional video editing software, created by data scientists for data scientists — the most powerful video labeling tool for machine learning and more.
  • 14
    Azure AI Custom Vision
    Create a custom computer vision model in minutes. Customize and embed state-of-the-art computer vision image analysis for specific domains with AI Custom Vision, part of Azure AI Services. Build frictionless customer experiences, optimize manufacturing processes, accelerate digital marketing campaigns, and more. No machine learning expertise is required. Set your model to perceive a particular object for your use case. Easily build your image identifier model using the simple interface. Start training your computer vision model by simply uploading and labeling a few images. The model tests itself on these and continually improves precision through a feedback loop as you add images. To speed development, use customizable, built-in models for retail, manufacturing, and food. See how Minsur, one of the world's largest tin mines, uses AI Custom Vision for sustainable mining. Rely on enterprise-grade security and privacy for your data and any trained models.
    Starting Price: $2 per 1,000 transactions
  • 15
    alwaysAI

    alwaysAI

    alwaysAI

    alwaysAI provides developers with a simple and flexible way to build, train, and deploy computer vision applications to a wide variety of IoT devices. Select from a catalog of deep learning models or upload your own. Use our flexible and customizable APIs to quickly enable core computer vision services. Quickly prototype, test and iterate with a variety of camera-enabled ARM-32, ARM-64 and x86 devices. Identify objects in an image by name or classification. Identify and count objects appearing in a real-time video feed. Follow the same object across a series of frames. Find faces or full bodies in a scene to count or track. Locate and define borders around separate objects. Separate key objects in an image from background visuals. Determine human body poses, fall detection, emotions. Use our model training toolkit to train an object detection model to identify virtually any object. Create a model tailored to your specific use-case.
  • 16
    Vize by Ximilar
    Use deep learning algorithms with the highest accuracy on the market. Implement cutting-edge vision automation faster with no development costs. Create powerful and custom image recognizers in intuitive web interface. We always improve the underlying machine learning algorithms so you are up-to-date. Train custom neural network to recognize your specific images. Ximilar, industry leader in Visual AI an Search, acquired Vize, made it better, faster, and added business-critical features. Go to Ximilar Homepage to discover our services.
  • 17
    AI Verse

    AI Verse

    AI Verse

    When real-life data capture is challenging, we generate diverse, fully labeled image datasets. Our procedural technology ensures the highest quality, unbiased, labeled synthetic datasets that will improve your computer vision model’s accuracy. AI Verse empowers users with full control over scene parameters, ensuring you can fine-tune the environments for unlimited image generation, giving you an edge in the competitive landscape of computer vision development.
  • 18
    Ultralytics

    Ultralytics

    Ultralytics

    Ultralytics offers a full-stack vision-AI platform built around its flagship YOLO model suite that enables teams to train, validate, and deploy computer-vision models with minimal friction. The platform allows you to drag and drop datasets, select from pre-built templates or fine-tune custom models, then export to a wide variety of formats for cloud, edge or mobile deployment. With support for tasks including object detection, instance segmentation, image classification, pose estimation and oriented bounding-box detection, Ultralytics’ models deliver high accuracy and efficiency and are optimized for both embedded devices and large-scale inference. The product also includes Ultralytics HUB, a web-based tool where users can upload their images/videos, train models online, preview results (even on a phone), collaborate with team members, and deploy via an inference API.
  • 19
    Keymakr

    Keymakr

    Keymakr

    Keymakr specializes in providing image and video data annotation, data creation, data collection, and data validation services for AI/ML Computer Vision projects. With a strong technological foundation and expertise, Keymakr efficiently manages data across various domains. Keymakr's motto, "Human teaching for machine learning," reflects its commitment to the human-in-the-loop approach. The company maintains an in-house team of over 600 highly skilled annotators. Keymakr's goal is to deliver custom datasets that enhance the accuracy and efficiency of ML systems. Our services: - Image annotation - Video annotation - Data validation - Data creation - Data collection - Generative AI - Custom AI
    Starting Price: $7/hour
  • 20
    Roboflow

    Roboflow

    Roboflow

    Roboflow has everything you need to build and deploy computer vision models. Connect Roboflow at any step in your pipeline with APIs and SDKs, or use the end-to-end interface to automate the entire process from image to inference. Whether you’re in need of data labeling, model training, or model deployment, Roboflow gives you building blocks to bring custom computer vision solutions to your business.
    Starting Price: $250/month
  • 21
    NeuralVision

    NeuralVision

    Cyth Systems, Inc.

    NeuralVision is a machine vision platform at the forefront of deep learning and artificial intelligence-like abilities applied to the industrial inspection space. For the first time companies are able to have total control of the performance of their machine vision systems and not be dependent on external vision experts to make changes or incorporate new product lines. Traditional machine vision is highly dependent on having a controlled environment, rigid positional tolerances, and ultimately the skill of the vision programmer. It is up to engineers to come up with every algorithm required to inspect a part from measurements to color to correct locations and everything in between. NeuralVision from Cyth Systems was designed to allow a person with no machine vision experience to inspect and classify products. Machine vision systems traditionally work by having an experienced programmer choose one of many analysis algorithms to apply to an image.
  • 22
    Torch

    Torch

    Torch

    Torch is a scientific computing framework with wide support for machine learning algorithms that puts GPUs first. It is easy to use and efficient, thanks to an easy and fast scripting language, LuaJIT, and an underlying C/CUDA implementation. The goal of Torch is to have maximum flexibility and speed in building your scientific algorithms while making the process extremely simple. Torch comes with a large ecosystem of community-driven packages in machine learning, computer vision, signal processing, parallel processing, image, video, audio and networking among others, and builds on top of the Lua community. At the heart of Torch are the popular neural network and optimization libraries which are simple to use, while having maximum flexibility in implementing complex neural network topologies. You can build arbitrary graphs of neural networks, and parallelize them over CPUs and GPUs in an efficient manner.
  • 23
    Cogito

    Cogito

    Cogito Tech LLC

    Cogito Tech is a leading AI data solutions provider specializing in data labeling and annotation services. We deliver high-quality data for applications across computer vision, natural language processing (NLP), and content services. Our expertise extends to fine-tuning large language models (LLMs) through techniques like Reinforcement Learning from Human Feedback (RLHF), enabling rapid deployment and customization to meet business objectives. The company is headquartered in the United States and was featured in The Financial Times’ FT ranking: The Americas’ Fastest-Growing Companies 2025 and Everest Group’s report Data Annotation and Labeling (DAL) Solutions for AI/ML PEAK Matrix® Assessment 2024 Services offered by Cogito: • Image Annotation Service • AI-assisted Data Labeling Service • Medical Image Annotation • NLP & Audio Annotation Service • ADAS Annotation Services • Healthcare Training Data for AI • Audio & Video Transcription Services
    Starting Price: $25/Hour
  • 24
    Ailiverse NeuCore
    Build & scale with ease. With NeuCore you can develop, train and deploy your computer vision model in a few minutes and scale it to millions. A one-stop platform that manages the model lifecycle, including development, training, deployment, and maintenance. Advanced data encryption is applied to protect your information at all stages of the process, from training to inference. Fully integrable vision AI models fit into your existing workflows and systems, or even edge devices easily. Seamless scalability accommodates your growing business needs and evolving business requirements. Divides an image into segments of different objects within the image. Extracts text from images, making it machine-readable. This model also works on handwriting. With NeuCore, building computer vision models is as easy as drag-and-drop and one-click. For more customization, advanced users can access provided code scripts and follow tutorial videos.
  • 25
    Clarifai

    Clarifai

    Clarifai

    Clarifai is a leading AI platform for modeling image, video, text and audio data at scale. Our platform combines computer vision, natural language processing and audio recognition as building blocks for developing better, faster and stronger AI. We help our customers create innovative solutions for visual search, content moderation, aerial surveillance, visual inspection, intelligent document analysis, and more. The platform comes with the broadest repository of pre-trained, out-of-the-box AI models built with millions of inputs and context. Our models give you a head start; extending your own custom AI models. Clarifai Community builds upon this and offers 1000s of pre-trained models and workflows from Clarifai and other leading AI builders. Users can build and share models with other community members. Founded in 2013 by Matt Zeiler, Ph.D., Clarifai has been recognized by leading analysts, IDC, Forrester and Gartner, as a leading computer vision AI platform. Visit clarifai.com
  • 26
    Sightbit

    Sightbit

    Sightbit

    SightBit provides an AI-powered solution for enhancing safety and security around open water. The company’s proprietary deep-learning AI models and computer vision technology enable capabilities including object detection and classification, drowning detection, hazard detection and prediction, object penetration detection and pollution detection. SightBit’s technology addresses climate challenges by detecting, monitoring, and providing alerts regarding events such as tsunamis and rip currents, while simultaneously providing management capabilities. The company’s solution can easily be deployed using off-the-shelf video cameras, without the need for sensors, edge processors, or customization. SightBit’s core system is based on deep-learning computer vision technology that transmits real-time information to monitors in various control rooms, sounding an alarm when people are in danger, and providing alerts when a system or structure is likely to fail.
  • 27
    Amazon Lookout for Vision
    Easily create a machine learning (ML) model to spot anomalies from your live process line with as few as 30 images. Identify visual anomalies in real time to reduce and prevent defects and improve product quality. Prevent unplanned downtime and reduce operational costs by using visual inspection data to spot potential issues and take corrective action. Spot damage to a product’s surface quality, color, and shape during the fabrication and assembly process. Determine what’s missing based on the absence, presence, or placement of objects, like a missing capacitor in a printed circuit board. Detect defects with repeating patterns, such as repeated scratches in the same spot on a silicon wafer. Amazon Lookout for Vision is an ML service that uses computer vision to spot defects in manufactured products at scale. Spot product defects using computer vision to automate quality inspection.
  • 28
    Fractal Analytics
    Reveal valuable insights by accurately recognizing objects in images and videos. From surveilling people in real-time at events to detecting if products are in the right place in shopping aisles, AI can drive value in many ways. Create in-depth analyses by placing image objects into relevant segments. AI-based algorithms can help insurers analyze home and auto damage to create more accurate claims for customers. Get immediate insights to take action when it matters most. AI algorithms enable real-time processing for a variety of valuable uses, such as face recognition. Understand customer behavior by identifying their actions from video, both in-store and in real-time. AI helps reveal how customers interact with products and brands to drive better experiences. AI-based analytics on satellite images can be used to detect traffic in real-time, analyze parking lots, and segment buildings.
  • 29
    FABIMAGE

    FABIMAGE

    Opto Engineering

    FabImage Studio Professional is data-flow-based software designed for machine vision engineers. It does not require any programming skills, but it is still so powerful that it can win even with solutions based on low-level programming libraries. Also, the architecture is highly flexible, ensuring that users can easily adapt the product to the way they work and to the specific requirements of any project. No low-level programming knowledge is required. Data-flow-based software. Fast and optimized algorithms. 1000+ high-performance functions. Custom machine vision filters. There are over 1000 ready-for-use machine filters tested and optimized on hundreds of applications. They have many advanced capabilities such as outlier suppression, subpixel precision or any-shape region-of-interest. FabImage® Studio is a GigE Vision compliant product, supporting the GenTL interface, as well as a number of vendor-specific APIs.
  • 30
    Weasis

    Weasis

    Weasis

    ​Weasis is a free, open source DICOM viewer designed for both standalone and web-based use, featuring a highly modular architecture. It is widely utilized in healthcare settings, including hospitals, health networks, multicenter research trials, and by patients. As cross-platform software, Weasis offers flexible integration with PACS, RIS, HIS, or EHR systems. The viewer leverages the OpenCV library to deliver high-performance and high-quality medical imaging renderings. From version 4 onwards, Weasis features a responsive user interface aligned with operating system options, offering an enhanced experience on high-resolution screens. Key features include support for a wide range of DICOM files, such as multi-frame, enhanced, MPEG-2, MPEG-4, and more. Users can import DICOM files via DICOM Query/Retrieve (C-GET, C-MOVE, and WADO-URI) and DICOMWeb (QUERY and RETRIEVE), as well as import and export DICOM CD/DVD with DICOMDIR.
    Starting Price: Free
  • 31
    Alfi

    Alfi

    Alfi

    Alfi, Inc. engages in creating interactive digital out-of-home advertising experiences. Alfi utilizes artificial intelligence and computer vision to better serve ads to people. Alfi’s proprietary Ai algorithm understands small facial cues and perceptual details that make potential customers a good candidate for a particular product. The automation works in a way that respects user privacy; without tracking, storing cookies, or using identifiable personal information. Ad agencies are empowered to examine real-time analytics data including interactive experiences, engagement, sentiment, and click-through rate that are otherwise unavailable to out-of-home advertisers. Alfi, powered by AI and machine learning, collects data to understand human behavior for improved analytics with relevant content for a better consumer experience.
  • 32
    Accord.NET Framework

    Accord.NET Framework

    Accord.NET Framework

    The Accord.NET Framework is a .NET machine learning framework combined with audio and image processing libraries completely written in C#. It is a complete framework for building production-grade computer vision, computer audition, signal processing and statistics applications even for commercial use. A comprehensive set of sample applications provide a fast start to get up and running quickly, and an extensive documentation and wiki helps fill in the details.
  • 33
    AWS Panorama
    Add computer vision (CV) to your existing fleet of cameras with AWS Panorama devices, which integrate seamlessly with your local area network. Make predictions locally with high accuracy and low latency from a single management interface, where you can analyze video feeds in milliseconds. Process video feeds at the edge, so you can control where your data is stored and operate with limited internet bandwidth. AWS Panorama is a collection of machine learning (ML) devices and a software development kit (SDK) that brings CV to on-premises internet protocol (IP) cameras. Easily track throughput, optimize freight operations, and recognize objects such as parts or products, or text in labels or barcodes. Monitor traffic lanes for issues such as stopped vehicles, and send real-time alerts to staff to keep traffic flowing. Quickly detect manufacturing anomalies so you can take corrective action and decrease costs.
  • 34
    Ambient.ai

    Ambient.ai

    Ambient.ai

    With Ambient.ai, computer vision intelligence is transforming security tools, operations & outcomes, moving physical security teams from reactive to proactive operations. From autonomous vehicles to robot chefs, computer vision is changing the way that humans & machines collaborate in the real world. By automating repeatable tasks, computer vision enables outsized gains in human productivity. We are a team of machine perception & security experts applying leading-edge computer vision research to the needs of physical security organizations. The privacy vs. security trade-off is a false dichotomy. You can respect individual privacy and increase group security. That’s why we don’t & won’t embrace facial recognition.
  • 35
    Voxel51

    Voxel51

    Voxel51

    FiftyOne by Voxel51 - the most powerful visual AI and computer vision data platform. Without the right data, even the smartest AI models fail. FiftyOne gives machine learning engineers the power to deeply understand and evaluate their visual datasets—across images, videos, 3D point clouds, geospatial, and medical data. With over 2.8 million open source installs and customers like Walmart, GM, Bosch, Medtronic, and the University of Michigan Health, FiftyOne is an indispensable tool for building computer vision systems that work in the real world, not just in the lab. FiftyOne streamlines visual data curation and model analysis with workflows to simplify the labor-intensive processes of visualizing and analyzing insights during data curation and model refinement—addressing a major challenge in large-scale data pipelines with billions of samples. Proven impact with FiftyOne: ⬆️30% increase in model accuracy ⏱️5+ months of development time saved 📈30% boost in productivity
  • 36
    Strong Analytics

    Strong Analytics

    Strong Analytics

    Our platforms provide a trusted foundation upon which to design, build, and deploy custom machine learning and artificial intelligence solutions. Build next-best-action applications that learn, adapt, and optimize using reinforcement-learning based algorithms. Custom, continuously-improving deep learning vision models to solve your unique challenges. Predict the future using state-of-the-art forecasts. Enable smarter decisions throughout your organization with cloud based tools to monitor and analyze. The process of taking a modern machine learning application from research and ad-hoc code to a robust, scalable platform remains a key challenge for experienced data science and engineering teams. Strong ML simplifies this process with a complete suite of tools to manage, deploy, and monitor your machine learning applications.
  • 37
    Innotescus

    Innotescus

    Innotescus

    Innotescus is a collaborative video and image annotation platform built to streamline Computer Vision development processes via seamless data handling, smart annotation tools, and intuitive collaboration features. Additionally, its data visualization tools and cross-functional collaboration features identify data bias early, improve data accuracy, and enable faster, cost-efficient deployment of high performance Artificial Intelligence.
  • 38
    Descartes Labs

    Descartes Labs

    Descartes Labs

    The Descartes Labs Platform is designed to answer some of the world’s most complex and pressing geospatial analytics questions. Our customers use the platform to build algorithms and models that transform their businesses quickly, efficiently, and cost-effectively. By giving data scientists and their line-of-business colleagues the best geospatial data and modeling tools in one package, we help turn AI into a core competency. Data science teams can use our scaling infrastructure to design models faster than ever, using our massive data archive or their own. Customers rely on our cloud-based platform to quickly and securely scale computer vision, statistical, and machine learning models to inform business decisions with powerful raster-based analytics. Our extensive API documentation, tutorials, guides and demos provide a deep knowledge base for users allowing them to quickly deploy high-value applications across diverse industries.
  • 39
    SHARK

    SHARK

    SHARK

    SHARK is a fast, modular, feature-rich open-source C++ machine learning library. It provides methods for linear and nonlinear optimization, kernel-based learning algorithms, neural networks, and various other machine learning techniques. It serves as a powerful toolbox for real-world applications as well as research. Shark depends on Boost and CMake. It is compatible with Windows, Solaris, MacOS X, and Linux. Shark is licensed under the permissive GNU Lesser General Public License. Shark provides an excellent trade-off between flexibility and ease-of-use on the one hand, and computational efficiency on the other. Shark offers numerous algorithms from various machine learning and computational intelligence domains in a way that they can be easily combined and extended. Shark comes with a lot of powerful algorithms that are to our best knowledge not implemented in any other library.
  • 40
    MatConvNet
    The VLFeat open source library implements popular computer vision algorithms specializing in image understanding and local features extraction and matching. Algorithms include Fisher Vector, VLAD, SIFT, MSER, k-means, hierarchical k-means, agglomerative information bottleneck, SLIC superpixels, quick shift superpixels, large scale SVM training, and many others. It is written in C for efficiency and compatibility, with interfaces in MATLAB for ease of use, and detailed documentation throughout. It supports Windows, Mac OS X, and Linux. MatConvNet is a MATLAB toolbox implementing Convolutional Neural Networks (CNNs) for computer vision applications. It is simple, efficient, and can run and learn state-of-the-art CNNs. Many pre-trained CNNs for image classification, segmentation, face recognition, and text detection are available.
  • 41
    BytePlus Effects

    BytePlus Effects

    Byteplus Pte Ltd

    Bring augmented reality experiences to life with our world-class computer vision capabilities. Enables real-time detection of human bodies in images or videos. Supports multi-person detection, half-body detection, position framing and key point output. Detects 18 key points on the human body, including the head, shoulders, feet and others. Tracks movements such as hand raising, bending, jumping and more. Powered by industry-leading algorithms, BytePlus Effects products are highly efficient in computing power consumption, providing unrivaled accuracy and performance. Our software has a proven track record of delivering best-in-class performance, used by apps such as TikTok and Ulike that support hundreds of millions of users. Our engineers continually upgrade algorithms, while our service team provides reliable support.
  • 42
    Wekinator

    Wekinator

    Wekinator

    The Wekinator is free, open source software. Wekinator 1.0 was originally created in 2009 by Rebecca Fiebrink. In 2015, Rebecca released Wekinator 2.0, an entirely new version with redesigned interactions, new algorithms, and ability to connect easily to dozens of other creative coding tools and sensors. Wekinator 2.0 continues to be gently updated with bug fixes and feature requests. It allows anyone to use machine learning to build new musical instruments, gestural game controllers, computer vision or computer listening systems, and more. The Wekinator allows users to build new interactive systems by demonstrating human actions and computer responses, instead of writing programming code. Create mappings between gesture and computer sounds. Control a drum machine using your webcam! Play Ableton using a Kinect! Control interactive visual environments created in Processing, OpenFrameworks, or Quartz Composer, or game engines like Unity, using gestures sensed from webcam, Kinect, etc.
  • 43
    AForge.NET

    AForge.NET

    AForge.NET

    AForge.NET is an open source C# framework designed for developers and researchers in the fields of Computer Vision and Artificial Intelligence - image processing, neural networks, genetic algorithms, fuzzy logic, machine learning, robotics, etc. The work on the framework's improvement is in constants progress, what means that new feature and namespaces are coming constantly. To get knowledge about its progress you may track source repository's log or visit project discussion group to get the latest information about it. The framework is provided not only with different libraries and their sources, but with many sample applications, which demonstrate the use of this framework, and with documentation help files, which are provided in HTML Help format.
  • 44
    Apache Mahout

    Apache Mahout

    Apache Software Foundation

    Apache Mahout is a powerful, scalable, and versatile machine learning library designed for distributed data processing. It offers a comprehensive set of algorithms for various tasks, including classification, clustering, recommendation, and pattern mining. Built on top of the Apache Hadoop ecosystem, Mahout leverages MapReduce and Spark to enable data processing on large-scale datasets. Apache Mahout(TM) is a distributed linear algebra framework and mathematically expressive Scala DSL designed to let mathematicians, statisticians, and data scientists quickly implement their own algorithms. Apache Spark is the recommended out-of-the-box distributed back-end or can be extended to other distributed backends. Matrix computations are a fundamental part of many scientific and engineering applications, including machine learning, computer vision, and data analysis. Apache Mahout is designed to handle large-scale data processing by leveraging the power of Hadoop and Spark.
  • 45
    Bittensor

    Bittensor

    Bittensor

    Bittensor is an open-source protocol that powers a decentralized, blockchain-based machine-learning network. Machine learning models train collaboratively and are rewarded in TAO according to the informational value they offer the collective. TAO also grants external access, allowing users to extract information from the network while tuning its activities to their needs. Ultimately, our vision is to create a pure market for artificial intelligence, an incentivized arena in which consumers and producers of this valuable commodity can interact in a trustless, open, and transparent context. A novel, optimized strategy for the development and distribution of artificial intelligence technology by leveraging the possibilities of a distributed ledger. specifically, its facilitation of open access/ownership, decentralized governance, and the ability to harness globally-distributed resources of computing power and innovation within an incentivized framework.
    Starting Price: Free
  • 46
    VisionAgent

    VisionAgent

    LandingAI

    VisionAgent is a generative Visual AI application builder developed by Landing AI, designed to accelerate the creation and deployment of vision-enabled applications. By inputting a simple prompt, users can describe their vision task, and VisionAgent intelligently selects the most suitable models from a curated collection of effective open-source models to address the task. It then generates, tests, and deploys the necessary code, enabling the rapid development of applications involving object detection, segmentation, object tracking, and activity recognition. This streamlined process allows developers to build vision-enabled applications in minutes, significantly reducing development time and effort. Enhance efficiency with instant code generation for custom post-processing steps. VisionAgent selects the best model for your use case from a curated collection of the most effective open-source models.
  • 47
    Alegion

    Alegion

    Alegion

    Alegion is the data labeling solution for enterprise-grade Machine Learning. We lead the industry in streaming, high-resolution, high-density video annotation, delivering accurately-annotated, model-ready data to train and validate ML models. Alegion provides both the platform and workforce to operate with quality at scale, processing structured and unstructured data including video, image, audio, and text. Our ML powered platform speeds up task completion by as much as 70%, including classless object tracking and single click smart polygon generation. Segmentation options include Keypoint, Bounding Box, Polyline, & Polygon segmentation, for image and video. Semantic Segmentation tools deliver seamless entity boundaries with pixel perfect accuracy. NLP and NER capabilities support text and audio classification and sentiment analysis. The platform is highly configurable to support hybrid use cases. Available via SaaS (Alegion Control), Managed Platform, and Managed Labeling Services.
    Starting Price: $5000
  • 48
    GazeInsight

    GazeInsight

    GazeRecorder

    Our technology turns a simple webcam into a Accurate Eye-Tracker. Advances in machine learning and computer vision, allow us to track eye movements with high precision. You can take your research outside the lab and scale to a large number of participants. Online solutions for remote usability research. It allows you to do UX research both on desktop and mobile remotely. You get high-quality session recordings and see through the users’ eyes. Track consumers’ attention, and get to know your brand perception and marketing communication performance. GazeRecorder is designed to handle any type of content (banners, videos, Live web pages ). You can ask testers across the globe. They only need a computer with a webcam. You will have immediate access to the results. Invite participants to complete tests on their own devices at home. This keeps testers in their natural environment.
  • 49
    Hive Data
    Create training datasets for computer vision models with our fully managed solution. We believe that data labeling is the most important factor in building effective deep learning models. We are committed to being the field's leading data labeling platform and helping companies take full advantage of AI's capabilities. Organize your media with discrete categories. Identify items of interest with one or many bounding boxes. Like bounding boxes, but with additional precision. Annotate objects with accurate width, depth, and height. Classify each pixel of an image. Mark individual points in an image. Annotate straight lines in an image. Measure, yaw, pitch, and roll of an item of interest. Annotate timestamps in video and audio content. Annotate freeform lines in an image.
    Starting Price: $25 per 1,000 annotations
  • 50
    Paravision

    Paravision

    Paravision

    Paravision provides a computer vision developer platform that powers face recognition applications serving mission-critical use cases. Our SDK's and API's enable comprehensive security and frictionless experiences and are powered by an industry-leading feature set. Our SDKs and Vision AI engines can be integrated into modern, secure infrastructure. We also build advanced solutions for identity-based security threats, like spoof attempts and deepfakes. Utilizing the most advanced AI frameworks and partnered with leading providers of hardware accelerators for AI and deep learning, Paravision delivers speed, scalability, and responsiveness while lowering operating costs. Paravision is proud to be a US-based leader in Vision AI. Whether in technical partnership, working through end-user challenges, or collaborating on market strategy, we strive to be dynamic, responsive, and focused on delivering excellence.