Alternatives to Cognex VisionPro

Compare Cognex VisionPro alternatives for your business or organization using the curated list below. SourceForge ranks the best alternatives to Cognex VisionPro in 2026. Compare features, ratings, user reviews, pricing, and more from Cognex VisionPro competitors and alternatives in order to make an informed decision for your business.

  • 1
    Google Cloud Vision AI
    Derive insights from your images in the cloud or at the edge with AutoML Vision or use pre-trained Vision API models to detect emotion, understand text, and more. Google Cloud offers two computer vision products that use machine learning to help you understand your images with industry-leading prediction accuracy. Automate the training of your own custom machine learning models. Simply upload images and train custom image models with AutoML Vision’s easy-to-use graphical interface; optimize your models for accuracy, latency, and size; and export them to your application in the cloud, or to an array of devices at the edge. Google Cloud’s Vision API offers powerful pre-trained machine learning models through REST and RPC APIs. Assign labels to images and quickly classify them into millions of predefined categories. Detect objects and faces, read printed and handwritten text, and build valuable metadata into your image catalog.
  • 2
    Dataloop AI

    Dataloop AI

    Dataloop AI

    Manage unstructured data and pipelines to develop AI solutions at amazing speed. Enterprise-grade data platform for vision AI. Dataloop is a one-stop shop for building and deploying powerful computer vision pipelines data labeling, automating data ops, customizing production pipelines and weaving the human-in-the-loop for data validation. Our vision is to make machine learning-based systems accessible, affordable and scalable for all. Explore and analyze vast quantities of unstructured data from diverse sources. Rely on automated preprocessing and embeddings to identify similarities and find the data you need. Curate, version, clean, and route your data to wherever it’s needed to create exceptional AI applications.
  • 3
    EVLib

    EVLib

    Irida Labs

    EV Lib is a complete embedded vision software library based on deep learning and AI with functionalities for people, vehicle and object detection, identification tracking and 3D pose estimation.
  • 4
    Rosepetal AI

    Rosepetal AI

    Rosepetal AI

    Rosepetal AI is an innovative technology company specializing in advanced artificial vision and deep-learning solutions designed specifically for industrial quality control. Our platform integrates dataset handling, automated labelling and training of adaptive neural networks, enabling real-time defect detection without requiring advanced technical expertise. This intuitive, no-code SaaS solution democratizes access to sophisticated AI, significantly enhancing efficiency, reducing waste, and driving operational excellence across multiple industries such as automotive, food processing, pharmaceuticals, plastics, and electronics. The unique strength of Rosepetal AI lies in its dynamic adaptability and scalability. Our system allows industrial companies to quickly deploy robust AI models directly onto their production lines, continuously adjusting to new product variations and emerging defects. This capability ensures consistent quality, minimizes downtime.
  • 5
    alwaysAI

    alwaysAI

    alwaysAI

    alwaysAI provides developers with a simple and flexible way to build, train, and deploy computer vision applications to a wide variety of IoT devices. Select from a catalog of deep learning models or upload your own. Use our flexible and customizable APIs to quickly enable core computer vision services. Quickly prototype, test and iterate with a variety of camera-enabled ARM-32, ARM-64 and x86 devices. Identify objects in an image by name or classification. Identify and count objects appearing in a real-time video feed. Follow the same object across a series of frames. Find faces or full bodies in a scene to count or track. Locate and define borders around separate objects. Separate key objects in an image from background visuals. Determine human body poses, fall detection, emotions. Use our model training toolkit to train an object detection model to identify virtually any object. Create a model tailored to your specific use-case.
  • 6
    AegisVision

    AegisVision

    AegisVision AI

    AegisVision is an advanced AI-driven computer vision platform that transforms ordinary camera feeds into actionable business intelligence. Designed for enterprise environments, AegisVision uses cutting-edge deep learning and adaptive vision models to automate visual inspection, detect defects, monitor safety compliance, and deliver insights in real time — whether deployed on the cloud or at the edge. With real-time defect detection, AegisVision identifies surface flaws, assembly errors, and anomalies instantly, replacing manual inspection with consistent automated precision. Its self-learning models continually improve performance and adapt to new product types or changing conditions with minimal retraining.
  • 7
    NeuralVision

    NeuralVision

    Cyth Systems, Inc.

    NeuralVision is a machine vision platform at the forefront of deep learning and artificial intelligence-like abilities applied to the industrial inspection space. For the first time companies are able to have total control of the performance of their machine vision systems and not be dependent on external vision experts to make changes or incorporate new product lines. Traditional machine vision is highly dependent on having a controlled environment, rigid positional tolerances, and ultimately the skill of the vision programmer. It is up to engineers to come up with every algorithm required to inspect a part from measurements to color to correct locations and everything in between. NeuralVision from Cyth Systems was designed to allow a person with no machine vision experience to inspect and classify products. Machine vision systems traditionally work by having an experienced programmer choose one of many analysis algorithms to apply to an image.
  • 8
    Prophesee Metavision
    Metavision is an advanced event-based vision software toolkit developed by Prophesee, designed to facilitate the evaluation, design, and commercialization of event-based vision products. The SDK offers a comprehensive suite of tools, including 64 algorithms, 105 code samples, and 17 tutorials, enabling developers to efficiently build and deploy event-based applications. The open source architecture of Metavision SDK ensures full interoperability between software and hardware devices, fostering a rapidly growing event-based vision community. The platform covers a wide range of computer vision fields, such as machine learning, computer vision, camera calibration, and high-performance applications. Developers have access to extensive documentation, including over 300 pages of content, programming guides, and reference data, providing a solid foundation for product development. Metavision SDK5 PRO includes advanced add-ons like high-speed counting, spatter monitoring, and more.
  • 9
    IMPACT Software Suite
    IMPACT Software Suite, with over 120 inspection tools and 50 user interface controls, allows users to create unique inspection programs and develop user interfaces quickly and easily. All this can be done without the loss of flexibility, like traditional configurable systems, or the need for vast amounts of development time. IMPACT Software Suite also provides a Software Development Kit (SDK) that guarantees full integration of machine vision monitoring capabilities into HMI software applications. Vision Program Manager (VPM) provides hundreds of image processing and analysis functions. Use VPM to enhance images, locate features, measure objects, check for presence or absence, and read text and bar codes. Control Panel Manager (CPM) simplifies development of operator interfaces with the ability to make on-the-fly adjustments to critical machine controls. CPM creates operator interface panels to view and adjust critical machine controls. IMPACT Software Development Kit (SDK) consists of
  • 10
    SolVision

    SolVision

    Solomon

    SolVision is an advanced AI vision system developed by Solomon 3D, designed to enhance industrial automation through rapid and accurate visual inspections. Leveraging Solomon’s proprietary rapid AI model training technology, SolVision enables users to train AI models in minutes, significantly reducing setup time compared to traditional systems. It excels in various applications, including defect detection, item classification, optical character recognition, and presence/absence checks, making it suitable for industries such as manufacturing, food & beverage, textiles, and electronics. A standout feature is its ability to learn from as few as 1–5 image samples, streamlining the training process and minimizing the need for extensive data annotation. SolVision's intuitive user interface allows for simultaneous labeling of multiple defect types, facilitating complex classification tasks.
  • 11
    VisionAgent

    VisionAgent

    LandingAI

    VisionAgent is a generative Visual AI application builder developed by Landing AI, designed to accelerate the creation and deployment of vision-enabled applications. By inputting a simple prompt, users can describe their vision task, and VisionAgent intelligently selects the most suitable models from a curated collection of effective open-source models to address the task. It then generates, tests, and deploys the necessary code, enabling the rapid development of applications involving object detection, segmentation, object tracking, and activity recognition. This streamlined process allows developers to build vision-enabled applications in minutes, significantly reducing development time and effort. Enhance efficiency with instant code generation for custom post-processing steps. VisionAgent selects the best model for your use case from a curated collection of the most effective open-source models.
  • 12
    Unleash live
    Unleash live is an A.I. video analytics enterprise solution provider. We take a vision from any camera and combine it with computer vision to deliver actionable data in real-time so that your organization has immediate insights to drive down costs, improve productivity, increase accuracy, and improve safety. Support for a wide range of cameras. Connect any combination of IP/CCTV, drone, body cam, mobile or robotic cameras. Live stream in the field and share it with your team while operations are in progress, or upload footage into your account. Apply A. I Apps from our app store to detect, inspect and monitor objects and items of interest or create 2D orthomaps and 3D models. Integrate results into your operational workflow, from live dashboards, to notifications and API integrations. Take the complexity and time out of collaboration. Instantly connect any mix of cameras to share over a live stream with stakeholders and 3rd parties. No plugs-in, no downloads, all in the browser.
    Starting Price: $99 per month
  • 13
    OpenCV

    OpenCV

    OpenCV

    OpenCV (Open Source Computer Vision Library) is an open-source computer vision and machine learning software library. OpenCV was built to provide a common infrastructure for computer vision applications and to accelerate the use of machine perception in commercial products. Being a BSD-licensed product, OpenCV makes it easy for businesses to utilize and modify the code. The library has more than 2500 optimized algorithms, which includes a comprehensive set of both classic and state-of-the-art computer vision and machine learning algorithms. These algorithms can be used to detect and recognize faces, identify objects, classify human actions in videos, track camera movements, track moving objects, extract 3D models of objects, produce 3D point clouds from stereo cameras, and stitch images together to produce a high-resolution image of an entire scene, find similar images from an image database, remove red eyes from images taken using flash, follow eye movements, recognize scenery, etc.
  • 14
    Plainsight

    Plainsight

    Plainsight

    Remove the complexity from your machine learning projects with our vision AI platform built from the ground up for fast, effective video analytics application development. With easy, no-code point-and-click features all in one platform, Plainsight slashes your time-to-production and accelerates the success of vision AI-powered solutions across industries. Connect, administer, & control cameras, sensors & edge devices in one interface. Collect accurate training datasets to provide a high-quality training foundation for models. Accelerate labeling with smart polygon selection, predictive labeling, & automated object recognition. Easily train models with a breakthrough process designed to reduce time to vision AI solutions. Quickly deploy & scale applications at the edge, in the cloud, or on-premises to meet business needs.
  • 15
    Linker Vision

    Linker Vision

    Linker Vision

    Linker VisionAI Platform is a comprehensive, end-to-end solution for vision AI, encompassing simulation, training, and deployment to empower smart cities and enterprises. It comprises three core components, Mirra, for synthetic data generation using NVIDIA Omniverse and NVIDIA Cosmos; DataVerse, facilitating data curation, annotation, and model training with NVIDIA NeMo and NVIDIA TAO; and Observ, enabling large-scale Vision Language Model (VLM) deployment with NVIDIA NIM. This integrated approach allows for the seamless transition from data simulation to real-world application, ensuring that AI models are robust and adaptable. Linker VisionAI Platform supports a range of applications, including traffic and transportation management, worker safety, disaster response, and more, by leveraging urban camera networks and AI to drive responsive decisions.
  • 16
    Flexible Vision

    Flexible Vision

    Flexible Vision

    Flexible Vision is an AI machine vision software and hardware solution that enables your team to quickly and easily solve difficult visual inspections. The cloud portal allows your teams to collaborate and share vision inspection programs across factory floors. Collect 5-10 images of good parts and bad parts. Our software will optionally increase this sample size with augmentation. With a click of a button, your model will begin to be created. Your model will be ready for production in a matter of minutes. Your AI model will automatically deploy and be ready for validation. Download or sync the model to as many on-prem production lines as needed. Our high speed industrial processors quickly process your images. Simply select the ai model from a dropdown and watch the detections live on screen. Our systems are designed for either manual inspection stations or incorporated into traditional factory automation. Our systems are IO and field-bus compatible.
  • 17
    Neurala

    Neurala

    Neurala

    Neurala is on a mission to help manufacturers improve their vision inspection process. Supply chain issues, labor shortages, and the risk of recalls are driving the need for more automation. Our Visual Inspection Automation (VIA) software goes beyond the capabilities of traditional machine vision in detecting anomalies and defects, even when products have natural variations. Using our proven vision AI technology, manufacturers can scale production, reduce waste and adapt to workforce changes, while achieving even higher levels of quality control. Neurala software uses our patented Lifelong-Deep Neural Network (L-DNN)™ technology, offering the first cost-effective vision AI tool that can be easily retrofitted into your existing production line infrastructure, without the need for AI experts or expensive capital expenditures. Neurala gives you the flexibility to deploy your vision AI models to meet your specific business needs, either to the cloud or on-premise.
  • 18
    Viso Suite

    Viso Suite

    Viso Suite

    Viso Suite is the world’s only end-to-end platform for computer vision. It enables teams to rapidly train, create, deploy and manage computer vision applications – without writing code from scratch. Use Viso Suite to deliver industry-leading computer vision and real-time deep learning systems with low-code and automated software infrastructure. The use of traditional development methods, fragmented software tools, and the lack of experienced engineers are costing organizations lots of time and leading to inefficient, low-performing, and expensive computer vision systems. Build and deploy better computer vision applications faster by abstracting and automating the entire lifecycle with Viso Suite, the all-in-one enterprise vision platform.​ Collect data for computer vision annotation with Viso Suite. Use automated collection capabilities to gather high-quality training data. Control and secure all data collection. Enable continuous data collection to further improve your AI models.
  • 19
    Kibsi

    Kibsi

    Kibsi

    Kibsi is the no-code computer vision platform to build and launch video AI solutions in minutes – not months. Stretch your tech without spending a fortune. From security cameras to webcams, Kibsi converts any live stream camera feed into rich streams of insights and data. View live data, uncover trends, trigger alerts, and automate actions that empower analysts and business leaders with real-time understanding and historical analysis. Kibsi does more than just identify objects, it adds context and relational rules to computer vision through machine learning and proprietary algorithms. Kibsi’s no-code, drag-and-drop experience gets you answers faster. Computer vision programmers and developers are welcome but certainly not required. With 1000s of ready-to-use, built-in objects and classes, you can start getting insights right away. Of course, adding your own objects is easy and automated, too.
    Starting Price: $99 per month
  • 20
    Amazon Lookout for Vision
    Easily create a machine learning (ML) model to spot anomalies from your live process line with as few as 30 images. Identify visual anomalies in real time to reduce and prevent defects and improve product quality. Prevent unplanned downtime and reduce operational costs by using visual inspection data to spot potential issues and take corrective action. Spot damage to a product’s surface quality, color, and shape during the fabrication and assembly process. Determine what’s missing based on the absence, presence, or placement of objects, like a missing capacitor in a printed circuit board. Detect defects with repeating patterns, such as repeated scratches in the same spot on a silicon wafer. Amazon Lookout for Vision is an ML service that uses computer vision to spot defects in manufactured products at scale. Spot product defects using computer vision to automate quality inspection.
  • 21
    Vertex AI Vision
    Easily build, deploy, and manage computer vision applications with a fully managed, end-to-end application development environment that reduces the time to build computer vision applications from days to minutes at one-tenth the cost of current offerings. Quickly and conveniently ingest real-time video and image streams at a global scale. Easily build computer vision applications using a drag-and-drop interface. Store and search petabytes of data with built-in AI capabilities. Vertex AI Vision includes all the tools needed to manage the life cycle of computer vision applications, across ingestion, analysis, storage, and deployment. Easily connect application output to a data destination, like BigQuery for analytics, or live streaming to drive real-time business actions. Ingest thousands of video streams from across the globe. With a monthly pricing model, enjoy up to one-tenth lower costs than previous offerings.
    Starting Price: $0.0085 per GB
  • 22
    SimpleCV

    SimpleCV

    SimpleCV

    SimpleCV is an open-source framework for building computer vision applications. With it, you get access to several high-powered computer vision libraries such as OpenCV, without having to first learn about bit depths, file formats, color spaces, buffer management, eigenvalues, or matrix versus bitmap storage. This is computer vision made easy. These are just a small number of things you can do with SimpleCV. If you would like to learn more please refer to our tutorial. There are also many examples included in the SimpleCV directory under the examples folder which can also be downloaded from here. SimpleCV is an open-source framework, meaning that it is a collection of libraries and software that you can use to develop vision applications. It lets you work with the images or video streams that come from webcams, Kinects, FireWire and IP cameras, or mobile phones. It helps you build software to make your various technologies not only see the world but understand it too.
  • 23
    FABIMAGE

    FABIMAGE

    Opto Engineering

    FabImage Studio Professional is data-flow-based software designed for machine vision engineers. It does not require any programming skills, but it is still so powerful that it can win even with solutions based on low-level programming libraries. Also, the architecture is highly flexible, ensuring that users can easily adapt the product to the way they work and to the specific requirements of any project. No low-level programming knowledge is required. Data-flow-based software. Fast and optimized algorithms. 1000+ high-performance functions. Custom machine vision filters. There are over 1000 ready-for-use machine filters tested and optimized on hundreds of applications. They have many advanced capabilities such as outlier suppression, subpixel precision or any-shape region-of-interest. FabImage® Studio is a GigE Vision compliant product, supporting the GenTL interface, as well as a number of vendor-specific APIs.
  • 24
    Robovision

    Robovision

    Robovision

    The Robovision AI software platform can be easily integrated into current infrastructures and operations. It makes advanced machine vision accessible to any team member with or without AI experience because the user interface is designed to be low-barrier and easy to use. The platform handles training AI models and deploying them at scale, simplifying the complexities of machine vision and shifting the focus to faster results, less time is wasted figuring out technical hurdles. By combining artificial intelligence and deep learning, raw visual data can be turned into advanced, and actionable, insights. Robovision’s machine vision system is designed to handle incredibly complex visual inputs in various scenarios, including inspecting products on an assembly line, tracking inventory in real-time, or diagnosing medical conditions.
  • 25
    Sightbit

    Sightbit

    Sightbit

    SightBit provides an AI-powered solution for enhancing safety and security around open water. The company’s proprietary deep-learning AI models and computer vision technology enable capabilities including object detection and classification, drowning detection, hazard detection and prediction, object penetration detection and pollution detection. SightBit’s technology addresses climate challenges by detecting, monitoring, and providing alerts regarding events such as tsunamis and rip currents, while simultaneously providing management capabilities. The company’s solution can easily be deployed using off-the-shelf video cameras, without the need for sensors, edge processors, or customization. SightBit’s core system is based on deep-learning computer vision technology that transmits real-time information to monitors in various control rooms, sounding an alarm when people are in danger, and providing alerts when a system or structure is likely to fail.
  • 26
    Datature

    Datature

    Datature

    Datature is a comprehensive, end-to-end, no-code computer vision and MLOps platform that simplifies the entire deep-learning lifecycle by letting users manage data, annotate images and videos, train models, evaluate performance, and deploy AI vision solutions, all within one unified environment without coding. Its intuitive visual interface and workflow tools guide you through dataset onboarding and annotation (including bounding boxes, segmentation, and advanced labeling), let you build automated training pipelines, monitor model training, and assess model accuracy with rich performance analytics, and then deploy models via API or for edge use so trained models can be used in real-world applications. Designed to democratize access to AI vision, Datature accelerates project timelines by reducing manual coding and debugging, supports collaboration across teams, and accommodates tasks like object detection, classification, semantic segmentation, and video analysis.
  • 27
    Apera AI

    Apera AI

    Apera AI

    Forge Lab makes AI training and simulation for vision-guided robotics fast and accessible. Manufacturing engineers can receive ready vision programs and test their automation strategies. AI-powered vision can drive huge improvements in reliability and product quality. This includes new cells or retrofitting existing cells and manual processes. Vision driven by AI makes robotic cells more reliable and productive. You can now use vision-guided robotics with less expertise and risk. Vue software can change robotic guidance, bin picking, assembly and more in your facilities. The AI learns to understand your parts completely, so the robot can take the fastest, safest, most reliable path in and out of movements to handle the parts. Vue understands how to avoid collisions within the operating area, even with the object in hand. Since the AI also understands how the object has been picked up, it can precisely and accurately place it, or assemble it with another object.
  • 28
    Overview

    Overview

    Overview

    Reliable, adaptable computer vision systems for any factory. AI and image capture are integrated into every step of manufacturing. Overview’s inspection systems are built with deep learning technology which allows us to find mistakes more consistently and in a wider variety of situations. Enhanced traceability with remote access and support. Our solutions create a traceable visual record of every unit. You can quickly identify the root cause of production problems and quality issues. Whether you are just digitizing your inspection or have an existing vision system that is underperforming, Overview has a solution that can drive waste out of your manufacturing operations. Demo the Snap platform to see how we improve your factory efficiency. Deep learning automated inspection solutions radically improve defect detection. Improved yields, better traceability, easy setup, and outstanding support.
  • 29
    Oxipital AI

    Oxipital AI

    Oxipital AI

    Our solutions are designed to have an immediate impact and require no code, no DIY, and no machine learning expertise to deploy into production. User-friendly, web-based setup tools, and dashboards take the mystery out of AI, leaving your business with insights that you can act on right now. Our fully integrated solutions enable manufacturers to tap into their most potent source of business intelligence, their own data. By addressing the most pervasive challenges of high-variability manufacturing environments, our visual AI platform provides the clarity to help businesses sharpen their operational vision. Our advanced AI vision supercharges operations in complex and high-variability manufacturing environments including food processing, agriculture, and consumer packaged goods, industries with challenges that evade existing machine vision technologies.
  • 30
    Ultralytics

    Ultralytics

    Ultralytics

    Ultralytics offers a full-stack vision-AI platform built around its flagship YOLO model suite that enables teams to train, validate, and deploy computer-vision models with minimal friction. The platform allows you to drag and drop datasets, select from pre-built templates or fine-tune custom models, then export to a wide variety of formats for cloud, edge or mobile deployment. With support for tasks including object detection, instance segmentation, image classification, pose estimation and oriented bounding-box detection, Ultralytics’ models deliver high accuracy and efficiency and are optimized for both embedded devices and large-scale inference. The product also includes Ultralytics HUB, a web-based tool where users can upload their images/videos, train models online, preview results (even on a phone), collaborate with team members, and deploy via an inference API.
  • 31
    Eyeris

    Eyeris

    Eyeris

    Driven by excellence, inspired by you. At Eyeris, our technology was inspired by the late-night worker, the caring parent, the aspiring entrepreneur. Keeping every driver in mind, our innovative technology promises to push towards a safer and better road ahead. ​In-Cabin cameras are the most common sensor used for driver and occupant monitoring. Eyeris AI Software interprets the entire interior scene through these cameras. Allows the ability to collect data from different sensor types to interpret the scene with redundant data for high data accuracy. Innovation in hardware is improving to accommodate and run sophisicated AI software in the most efficient and fastest manner. Our vision-based neural networks provide the richest source of information. Using the latest image sensors, our pre-trained vision AI models understand the entire in-cabin space under the widest range of lighting spectrum.
  • 32
    Azure AI Custom Vision
    Create a custom computer vision model in minutes. Customize and embed state-of-the-art computer vision image analysis for specific domains with AI Custom Vision, part of Azure AI Services. Build frictionless customer experiences, optimize manufacturing processes, accelerate digital marketing campaigns, and more. No machine learning expertise is required. Set your model to perceive a particular object for your use case. Easily build your image identifier model using the simple interface. Start training your computer vision model by simply uploading and labeling a few images. The model tests itself on these and continually improves precision through a feedback loop as you add images. To speed development, use customizable, built-in models for retail, manufacturing, and food. See how Minsur, one of the world's largest tin mines, uses AI Custom Vision for sustainable mining. Rely on enterprise-grade security and privacy for your data and any trained models.
    Starting Price: $2 per 1,000 transactions
  • 33
    FanWide COMPL-AI
    FanWide COMPL-AI ("comply") provides event health and crowd safety management software designed to help prevent coronavirus (COVID-19) spreading at public facilities and spaces. FanWide COMPL-AI uses your existing security cameras and video management systems (VMS) with artificial intelligence (AI) computer vision to proactively detect and report compliance incidences. FanWide can customize every camera to capture capacity counts in specific areas, detect overcrowding, enforce facemask compliance, measure temperatures or run dozens of other safety, security or guest experience AI Rules. Now you can optimize your business operations while increasing guest safety for a fraction of the cost of hiring additional staff.
  • 34
    Ailiverse NeuCore
    Build & scale with ease. With NeuCore you can develop, train and deploy your computer vision model in a few minutes and scale it to millions. A one-stop platform that manages the model lifecycle, including development, training, deployment, and maintenance. Advanced data encryption is applied to protect your information at all stages of the process, from training to inference. Fully integrable vision AI models fit into your existing workflows and systems, or even edge devices easily. Seamless scalability accommodates your growing business needs and evolving business requirements. Divides an image into segments of different objects within the image. Extracts text from images, making it machine-readable. This model also works on handwriting. With NeuCore, building computer vision models is as easy as drag-and-drop and one-click. For more customization, advanced users can access provided code scripts and follow tutorial videos.
  • 35
    PaliGemma 2
    PaliGemma 2, the next evolution in tunable vision-language models, builds upon the performant Gemma 2 models, adding the power of vision and making it easier than ever to fine-tune for exceptional performance. With PaliGemma 2, these models can see, understand, and interact with visual input, opening up a world of new possibilities. It offers scalable performance with multiple model sizes (3B, 10B, 28B parameters) and resolutions (224px, 448px, 896px). PaliGemma 2 generates detailed, contextually relevant captions for images, going beyond simple object identification to describe actions, emotions, and the overall narrative of the scene. Our research demonstrates leading performance in chemical formula recognition, music score recognition, spatial reasoning, and chest X-ray report generation, as detailed in the technical report. Upgrading to PaliGemma 2 is a breeze for existing PaliGemma users.
  • 36
    Eyewey

    Eyewey

    Eyewey

    Train your own models, get access to pre-trained computer vision models and app templates, learn how to create AI apps or solve a business problem using computer vision in a couple of hours. Start creating your own dataset for detection by adding the images of the object you need to train. You can add up to 5000 images per dataset. After images are added to your dataset, they are pushed automatically into training. Once the model is finished training, you will be notified accordingly. You can simply download your model to be used for detection. You can also integrate your model to our pre-existing app templates for quick coding. Our mobile app which is available on both Android and IOS utilizes the power of computer vision to help people with complete blindness in their day-to-day lives. It is capable of alerting hazardous objects or signs, detecting common objects, recognizing text as well as currencies and understanding basic scenarios through deep learning.
    Starting Price: $6.67 per month
  • 37
    Command A Vision
    Command A Vision is Cohere’s multimodal AI solution built for enterprise use that combines image understanding with language capabilities to drive business outcomes while keeping compute costs low; it extends the Command family by adding vision comprehension, allowing organizations to interpret and act on visual content in concert with text, and integrates into workplace systems to surface insights, boost productivity, and enable more intelligent search and discovery. The offering is positioned alongside Cohere’s broader AI stack and emphasizes putting AI to work in real-world workflows, helping teams unify multimodal signals, extract actionable meaning from images and associated metadata, and surface relevant business intelligence without excessive infrastructure overhead. Command A Vision excels at understanding and analyzing a wide range of visual and multilingual data, including charts, graphs, tables, and diagrams.
  • 38
    AdMobilize

    AdMobilize

    AdMobilize

    Analyze people, crowds, vehicles, and other objects in real time with your cameras. Measure people, crowds, and vehicles anonymously in real time with your cameras. Gather smart metrics from your IP/security cameras in one step. Our technology works with several types of cameras and operating systems used around the world. On the go, at your desk or wherever you may go, your AdDashboard is with you all the time. View metrics that are important to your business and share them with customers. The strictest privacy and reliability methodologies have earned our reputation as the industry’s most trusted measurement company. We understand your need to intuitively access, utilize, and integrate our real-time data; so we made it effortless. Our computer vision infrastructure caters to all of our client’s needs, always ensuring the highest performance regardless of implementation.
  • 39
    Vaidio AI Vision Platform
    IronYun's Vaidio® AI Vision Platform delivers 30+ advanced AI video analytics functions to add a layer of superhuman intelligence and market leading accuracy to existing camera and video infrastructure. Vaidio works with any IP camera and integrates out of the box with 28 market leading video management systems. Vaidio AI accelerates and scales intelligence across real-time, forensic, and video data applications. These applications include intrusion detection, person and vehicle counting, face and license plate recognition, vehicle make and model, loitering, crowding, PPE, weapon, smoke, and fire recognition and more. In the past three years the Vaidio Platform has won ISC West New Product Showcase Awards for Commercial Monitoring, Loss Prevention, and Video Analytics.
  • 40
    EyePop.ai

    EyePop.ai

    EyePop.ai

    Streamlining visual data analysis for easy, accessible AI-powered insights, regardless of industry or technical knowledge. Build your tailored AI application with EyePop. Embark on your project journey today, leveraging our advanced computer vision technology. Discover the untapped potential in your images and videos. Our platform delivers deep insights into your media, enhancing user experiences and boosting engagement. Building a custom application is a breeze with our intuitive no/low code platform. Anyone can easily create Pops that work with existing images, videos, or even real-time streams. Develop powerful, tailored computer vision solutions and make the most of your visual data. Empower decision-making with AI-driven insights, revolutionizing computer vision interaction. Build custom computer vision apps effortlessly with EyePop.ai’s no/low code platform for all skill levels.
  • 41
    RoboRealm

    RoboRealm

    RoboRealm

    RoboRealm is a Windows-based machine vision software designed to simplify vision programming and enable rapid prototyping with advanced modules. It features an intuitive GUI requiring no or low code, making it accessible for both casual users and serious robotic scientists. It supports hundreds of image processing modules and is camera agnostic, allowing for flexibility in hardware choices. Users can experience real-time parameter changes, and the software includes a fully supported server API for integration with other systems. RoboRealm accommodates multiple image sources and offers various output interfaces, including file, web, FTP, and email. Its plugin framework allows for the development of custom modules, and an active online community provides expert assistance. It enables the combination of modules through an easy-to-use pipeline to create tailored solutions for tasks such as surface defect detection, measurement, counting, detection, etc.
    Starting Price: $25 per month
  • 42
    Roboflow

    Roboflow

    Roboflow

    Roboflow has everything you need to build and deploy computer vision models. Connect Roboflow at any step in your pipeline with APIs and SDKs, or use the end-to-end interface to automate the entire process from image to inference. Whether you’re in need of data labeling, model training, or model deployment, Roboflow gives you building blocks to bring custom computer vision solutions to your business.
  • 43
    CameraMatics

    CameraMatics

    CameraMatics

    CameraMatics is an AI-enabled fleet operations platform that combines video-based safety, telematics, and workflow automation to help organizations manage vehicles, drivers, and operational risk in a single unified system. It uses advanced camera systems and computer vision to provide 360-degree visibility around vehicles, actively monitoring both inside and outside the cab to detect hazards such as pedestrians, cyclists, fatigue, and driver distraction in real time, while delivering in-cab alerts to prevent accidents before they occur. It integrates telematics data such as GPS tracking, vehicle diagnostics, and driving behavior analytics, enabling fleet managers to monitor mileage, fuel usage, idle time, and maintenance needs while optimizing vehicle performance and utilization. CameraMatics also digitizes and automates operational workflows, including vehicle inspections, compliance reporting, routing, and driver communication, reducing paperwork.
  • 44
    Gravio

    Gravio

    Gravio

    Gravio enables new ways to connect and interact with your environment through the power of IoT, sensors, edge computing, computer vision, and AI without programming knowledge. Gravio is an easy-to-use software platform that runs on Windows, macOS, or Linux. You can connect to various inputs and outputs, including some bundled IoT sensors, computer vision/AI cameras, and MQTT or HTTP APIs. Gravio is very easy to use without software programming knowledge. Gravio unlocks the power of connected technologies by connecting sensors, input devices, cameras, and APIs within a space, then continuously gathering and sharing their information, enabling new ways to interact with, learn from and enhance a physical space. To create these experiences, Gravio provides a powerful low-code/no-code environment to enable entrepreneurs and organizations of all sizes, across industries, to build custom, connected experiences for new and existing environments.
    Starting Price: $4.99 per month
  • 45
    Arvist AI

    Arvist AI

    Arvist AI

    Arvist AI is a computer vision platform that empowers warehouses worldwide to improve shipment quality, enhance worker safety, and achieve compliance by utilizing AI integrated with existing security cameras. It is camera-agnostic, allowing integration with any visual input device, and scales effortlessly across multiple warehousing sites. Arvist automates shipment inspections, reducing OS&D claims and inspection labor costs, while providing visual proof for dispute resolution. It simplifies quality control by detecting labeling errors, verifying expiry dates, and identifying incorrect or damaged shipments. Arvist AI enhances safety and compliance by monitoring for bonded warehouse and customs compliance, forklift and vehicle collisions, food safety compliance, and employee ergonomics and PPE compliance. Arvist installs quickly, delivering critical operational visibility and enhanced safety from day one, and adapts to specific operational needs through continuous learning.
  • 46
    Ambient.ai

    Ambient.ai

    Ambient.ai

    With Ambient.ai, computer vision intelligence is transforming security tools, operations & outcomes, moving physical security teams from reactive to proactive operations. From autonomous vehicles to robot chefs, computer vision is changing the way that humans & machines collaborate in the real world. By automating repeatable tasks, computer vision enables outsized gains in human productivity. We are a team of machine perception & security experts applying leading-edge computer vision research to the needs of physical security organizations. The privacy vs. security trade-off is a false dichotomy. You can respect individual privacy and increase group security. That’s why we don’t & won’t embrace facial recognition.
  • 47
    Cloudastructure

    Cloudastructure

    Cloudastructure

    Enables a live unified view of multiple sites from any device and history up to 10x faster than on-premises systems. The first cloud-native video surveillance platform with AI and computer vision analytics for better and more cost-effective enterprise security. Eliminates security risks, no video or data is stored or accessed on the network. Significantly reduce IT server management and maintenance costs versus on-premises or hybrid systems. Simplifies site management and provides centralized administration. Scales to an unlimited number of locations and cameras. Cloud video surveillance systems are easy to manage, use and install. The user-friendly interface makes set-up a breeze. No special technical skills are required. Advanced vehicle and people detection, counting, classification, license plate recognition, wrong-way detection, etc. Search by social distance violation, know how many people are in space and their physical distance.
  • 48
    Matroid

    Matroid

    Matroid

    Trusted for mission-critical applications, no coding required. Detect any visual defects with any camera and in any spectrum. Matroid's computer vision software enables reliable safety-critical inspection with digital traceability. Matroid automatically validates that human operators follow standard operating procedures. Matroid continuously monitors and verifies manual operations to capture various timestamps, cycle counts, and cycle times. Matroid allows for user-defined real-time alerts, video analytics, playback, and more. Capture actionable insights for continuous improvement. Implement cutting-edge technology for detecting unsafe conditions, get real-time notifications, and report safety instances with video playback. Matroid continuously monitors and verifies all tasks completed at gates to provide real-time operational insights with video analytics to implement continuous improvement initiatives for ground operations.
  • 49
    Paravision

    Paravision

    Paravision

    Paravision provides a computer vision developer platform that powers face recognition applications serving mission-critical use cases. Our SDK's and API's enable comprehensive security and frictionless experiences and are powered by an industry-leading feature set. Our SDKs and Vision AI engines can be integrated into modern, secure infrastructure. We also build advanced solutions for identity-based security threats, like spoof attempts and deepfakes. Utilizing the most advanced AI frameworks and partnered with leading providers of hardware accelerators for AI and deep learning, Paravision delivers speed, scalability, and responsiveness while lowering operating costs. Paravision is proud to be a US-based leader in Vision AI. Whether in technical partnership, working through end-user challenges, or collaborating on market strategy, we strive to be dynamic, responsive, and focused on delivering excellence.
  • 50
    Cloneable

    Cloneable

    Cloneable

    Cloneable packs sophisticated logic into an incredibly easy-to-use, no-code builder to develop custom, deep-tech applications compatible with any device. Cloneable integrates deep tech with your unique business logic, so you can create and deploy tailored apps to any edge device. Apps can be built in minutes, making it perfect for non-technical audiences to make instant process changes and for engineers who want to rapidly develop and iterate on complex field tools. Launch, update and test your AI and computer vision models on any device (phone, IoT, cloud, robot). Apps are instantly deployable from the Cloneable builder. Bring your own model or build from one of our templates to move any data collection process to the edge. Cloneable was built with unlimited flexibility, so you can count, measure, inspect, and track assets across any location. Intelligent apps can digitize manual processes, scale human expertise, increase transparency, improve auditability, and much more.