Compare the Top Computer Vision Software for Cloud as of October 2025 - Page 2

  • 1
    Prophesee Metavision
    Metavision is an advanced event-based vision software toolkit developed by Prophesee, designed to facilitate the evaluation, design, and commercialization of event-based vision products. The SDK offers a comprehensive suite of tools, including 64 algorithms, 105 code samples, and 17 tutorials, enabling developers to efficiently build and deploy event-based applications. The open source architecture of Metavision SDK ensures full interoperability between software and hardware devices, fostering a rapidly growing event-based vision community. The platform covers a wide range of computer vision fields, such as machine learning, computer vision, camera calibration, and high-performance applications. Developers have access to extensive documentation, including over 300 pages of content, programming guides, and reference data, providing a solid foundation for product development. Metavision SDK5 PRO includes advanced add-ons like high-speed counting, spatter monitoring, and more.
    Starting Price: Free
  • 2
    Qwen2.5-VL

    Qwen2.5-VL

    Alibaba

    Qwen2.5-VL is the latest vision-language model from the Qwen series, representing a significant advancement over its predecessor, Qwen2-VL. This model excels in visual understanding, capable of recognizing a wide array of objects, including text, charts, icons, graphics, and layouts within images. It functions as a visual agent, capable of reasoning and dynamically directing tools, enabling applications such as computer and phone usage. Qwen2.5-VL can comprehend videos exceeding one hour in length and can pinpoint relevant segments within them. Additionally, it accurately localizes objects in images by generating bounding boxes or points and provides stable JSON outputs for coordinates and attributes. The model also supports structured outputs for data like scanned invoices, forms, and tables, benefiting sectors such as finance and commerce. Available in base and instruct versions across 3B, 7B, and 72B sizes, Qwen2.5-VL is accessible through platforms like Hugging Face and ModelScope.
    Starting Price: Free
  • 3
    Rapid Monitor

    Rapid Monitor

    Rapid Global

    Rapid Global’s AI Safety Software is a computer vision platform designed to enhance workplace safety by detecting unsafe acts and hazardous conditions in real time. Compatible with most IP cameras, it seamlessly integrates with existing surveillance systems, ensuring easy deployment and secure, on-site data processing. Users can customize monitoring parameters by selecting specific objects, areas, and timeframes, and set tailored alarm notifications to identify unsafe behaviors as they occur. It detects missing personal protective equipment, tracks forklift-pedestrian near misses, and monitors unauthorized activity within designated zones, such as individuals standing on conveyor belts or moving outside assigned walkways. These capabilities enable organizations to proactively prevent incidents and improve safety outcomes.
    Starting Price: Free
  • 4
    EarthCam

    EarthCam

    EarthCam

    EarthCam offers a comprehensive suite of construction camera solutions designed to monitor, document, and promote projects through high-quality visual content. It provides advanced AI video analytics, enabling real-time insights into jobsite readiness, activity, and stress metrics, akin to a smartwatch biometrics report for your project. EarthCam's innovative webcams facilitate live streaming, 4K time-lapse, and 360° VR tours, enhancing visual collaboration and security with 24/7 recordings. EarthCam identifies over 30 job site materials, integrating seamlessly with Procore for schedule overlays and safety advisories. EarthCam's time-lapse services include image stabilization, enhancement, and customized music, delivering polished videos in multiple formats for marketing and archival purposes.
    Starting Price: Free
  • 5
    Rosepetal AI

    Rosepetal AI

    Rosepetal AI

    Rosepetal AI is an innovative technology company specializing in advanced artificial vision and deep-learning solutions designed specifically for industrial quality control. Our platform integrates dataset handling, automated labelling and training of adaptive neural networks, enabling real-time defect detection without requiring advanced technical expertise. This intuitive, no-code SaaS solution democratizes access to sophisticated AI, significantly enhancing efficiency, reducing waste, and driving operational excellence across multiple industries such as automotive, food processing, pharmaceuticals, plastics, and electronics. The unique strength of Rosepetal AI lies in its dynamic adaptability and scalability. Our system allows industrial companies to quickly deploy robust AI models directly onto their production lines, continuously adjusting to new product variations and emerging defects. This capability ensures consistent quality, minimizes downtime.
    Starting Price: $195
  • 6
    Scandit

    Scandit

    Scandit

    Scandit is the leader in smart data capture giving superpowers to workers, customers and businesses by providing actionable insights and automating end-to-end processes. Our Smart Data Capture platform enables smart devices, such as smartphones, drones, digital eyewear and robots to interact with physical items by capturing data from barcodes, text, IDs and objects with unmatched speed, accuracy and intelligence. Scandit accurately scans up to 3x faster than dedicated scanners in challenging light or at angles, on damaged labels, across multiple codes on any smart device. We enable innovation that delivers significant cost savings, increases employee retention and customer loyalty. Scandit partners with customers at every step with trials, solution design, integration and customer success support included. Visit scandit.com to learn why many market leaders trust us.
  • 7
    Partium

    Partium

    Partium

    Partium is a multi-modal AI-supported Enterprise Part Search. It makes it easy for your users in Maintenance and After sales & Service environments to find parts in spare parts portals, web shops, and maintenance systems. It allows technicians to search by image, text, filter, bill of materials, and tags. Hotline agents can confirm part search results and connect with the users. Partium also offers insights in your users' search behavior. Partium handles millions of spare part searches every month. Caterpillar, Parker, Liebherr, Deutsche Bahn, New Holland, The Home Depot, ENGEL, Wien Energie, and many other companies use Partium to provide not just a great search for their internal employees and customers, but a search that converts at higher rates because of relevancy, accuracy, and ease-of-use.
  • 8
    Interplay

    Interplay

    Iterate.ai

    Interplay Platform is a patented low-code platform with 475 pre-built connectors (enterprise, AI, IoT, Startup Technologies). It's used as middleware and as a rapid app building platform by big companies like Circle K, Ulta Beauty, and many others. As middleware, it operates Pay-by-Plate (frictionless payments at the gas pump) in Europe, Weapons Detection (to predict robberies), AI-based Chat, online personalization tools, low price guarantee tools, computer vision applications such as damage estimation, and much more. It also helps companies to go to market with their digital solutions 10X to 17X faster than in old ways.
  • 9
    Amazon Rekognition
    Amazon Rekognition makes it easy to add image and video analysis to your applications using proven, highly scalable, deep learning technology that requires no machine learning expertise to use. With Amazon Rekognition, you can identify objects, people, text, scenes, and activities in images and videos, as well as detect any inappropriate content. Amazon Rekognition also provides highly accurate facial analysis and facial search capabilities that you can use to detect, analyze, and compare faces for a wide variety of user verification, people counting, and public safety use cases. With Amazon Rekognition Custom Labels, you can identify the objects and scenes in images that are specific to your business needs. For example, you can build a model to classify specific machine parts on your assembly line or to detect unhealthy plants. Amazon Rekognition Custom Labels takes care of the heavy lifting of model development for you, so no machine learning experience is required.
  • 10
    Supervisely

    Supervisely

    Supervisely

    The leading platform for entire computer vision lifecycle. Iterate from image annotation to accurate neural networks 10x faster. With our best-in-class data labeling tools transform your images / videos / 3d point cloud into high-quality training data. Train your models, track experiments, visualize and continuously improve model predictions, build custom solution within the single environment. Our self-hosted solution guaranties data privacy, powerful customization capabilities, and easy integration into your technology stack. A turnkey solution for Computer Vision: multi-format data annotation & management, quality control at scale and neural networks training in end-to-end platform. Inspired by professional video editing software, created by data scientists for data scientists — the most powerful video labeling tool for machine learning and more.
  • 11
    Hive Data
    Create training datasets for computer vision models with our fully managed solution. We believe that data labeling is the most important factor in building effective deep learning models. We are committed to being the field's leading data labeling platform and helping companies take full advantage of AI's capabilities. Organize your media with discrete categories. Identify items of interest with one or many bounding boxes. Like bounding boxes, but with additional precision. Annotate objects with accurate width, depth, and height. Classify each pixel of an image. Mark individual points in an image. Annotate straight lines in an image. Measure, yaw, pitch, and roll of an item of interest. Annotate timestamps in video and audio content. Annotate freeform lines in an image.
    Starting Price: $25 per 1,000 annotations
  • 12
    FindFace

    FindFace

    NtechLab

    NtechLab platform processes video and recognizes human faces, bodies and actions, as well as cars and plate numbers. AI-powered technology enables record breaking accuracy and high speed of recognition. The multi-object and analytical capabilities of FindFace Multi unlock new scenarios for responding challenges of public sector and business. FindFace Multi quickly and accurately recognizes faces, human bodies, cars, and license plate numbers in a live video stream or in a video archive. Searching for faces, bodies, and vehicles in a database or in an archive is available both by a photo sample and by specific features, for example, by age, clothes color, or vehicle model. NtechLab developers are constantly improving recognition algorithms, increasing their performance and accuracy. With FindFace Multi it takes less than a second to detect a face in a video stream, recognize it, and search for a match in a database with billions of images.
  • 13
    Unleash live
    Unleash live is an A.I. video analytics enterprise solution provider. We take a vision from any camera and combine it with computer vision to deliver actionable data in real-time so that your organization has immediate insights to drive down costs, improve productivity, increase accuracy, and improve safety. Support for a wide range of cameras. Connect any combination of IP/CCTV, drone, body cam, mobile or robotic cameras. Live stream in the field and share it with your team while operations are in progress, or upload footage into your account. Apply A. I Apps from our app store to detect, inspect and monitor objects and items of interest or create 2D orthomaps and 3D models. Integrate results into your operational workflow, from live dashboards, to notifications and API integrations. Take the complexity and time out of collaboration. Instantly connect any mix of cameras to share over a live stream with stakeholders and 3rd parties. No plugs-in, no downloads, all in the browser.
    Starting Price: $99 per month
  • 14
    SiaSearch

    SiaSearch

    SiaSearch

    We want ML engineers to worry less about data engineering and focus on what they love, building better models in less time. Our product is a powerful framework that makes it 10x easier and faster for developers to explore, understand and share visual data at scale. Automatically create custom interval attributes using pre-trained extractors or any other model. Visualize data and analyze model performance using custom attributes combined with all common KPIs. Use custom attributes to query, find rare edge cases and curate new training data across your whole data lake. Easily save, edit, version, comment and share frames, sequences or objects with colleagues or 3rd parties. SiaSearch, a data management platform that automatically extracts frame-level, contextual metadata and utilizes it for fast data exploration, selection and evaluation. Automating these tasks with metadata can more than double engineering productivity and remove the bottleneck to building industrial AI.
  • 15
    VisionSense
    Real-time computer vision and advanced image processing solution that leverages advanced models of convolutional neural networks. The top application of the product has been in building management, identity verification and fraud detection, manufacturing and quality control. Winjit is one of India’s leading technology providers with over a decade of experience in innovating engineering solutions across industries.
  • 16
    Vyntelligence

    Vyntelligence

    Vyntelligence

    Boost operational efficiency and reduce risk and costs with the power of Vyn SmartVideoNotes. Video-enabled structured data capture into enterprise systems, to enhance and replace manual/text form fields in just 60 seconds. Timely, auto-labeled and rich data to drive higher compliance and productivity to save on costs as leaders gain better insight to act faster. Enterprise-grade security, open API SaaS platform designed for any workflow integration e.g. CRM (Salesforce), FSM and people systems. AI-powered Computer Vision & Natural language processing, video search and analyses deliver quantitative trends from qualitative data for richer, smarter business decisions. Bring your processes to life in a whole new way by quickly building intelligence from your field teams with vyn, so you see what’s happening and why. vyn captures SmartVideoNotes, on the go, by asking the right people the right questions at the right time - all in a minute or less.
  • 17
    Black.ai

    Black.ai

    Black.ai

    Respond to events and make better decisions with the help of AI and your existing IP camera infrastructure. Cameras are almost exclusively used for security and surveillance purposes. We add cutting-edge Machine Vision models to unlock a high-impact resource available to your team daily. We help you to improve operations for your staff and customers without compromising privacy. No facial recognition, or long-term tracking, no exceptions. Fewer people in the loop. A reliance on staff compiling and watching footage is invasive and unscalable. We help you to review only the things that matter and only at the right time. Black.ai creates a privacy layer that sits between security cameras and operations teams, so you can build a better experience for people without breaching their trust. Black.ai interfaces with your existing cameras using parallel streaming protocols. Our system is installed without additional infrastructure cost or any risk of obstructing operations.
  • 18
    Plainsight

    Plainsight

    Plainsight

    Remove the complexity from your machine learning projects with our vision AI platform built from the ground up for fast, effective video analytics application development. With easy, no-code point-and-click features all in one platform, Plainsight slashes your time-to-production and accelerates the success of vision AI-powered solutions across industries. Connect, administer, & control cameras, sensors & edge devices in one interface. Collect accurate training datasets to provide a high-quality training foundation for models. Accelerate labeling with smart polygon selection, predictive labeling, & automated object recognition. Easily train models with a breakthrough process designed to reduce time to vision AI solutions. Quickly deploy & scale applications at the edge, in the cloud, or on-premises to meet business needs.
  • 19
    TuMeke

    TuMeke

    TuMeke Ergonomics

    No need for wearables, goniometers, or other equipment. Measure and automatically track the safety of employees without stopping production. Stop filling out long assessment worksheets so you can focus on giving great recommendations. Manage videos and assessment results across teams and devices. Enterprise features to make the most of your resources. Our platform includes a phone and web app that work together to allow teams to collaborate across locations, get automatic recommendations on postures to investigate and a dashboard to track performance over time.
  • 20
    Amazon Lookout for Vision
    Easily create a machine learning (ML) model to spot anomalies from your live process line with as few as 30 images. Identify visual anomalies in real time to reduce and prevent defects and improve product quality. Prevent unplanned downtime and reduce operational costs by using visual inspection data to spot potential issues and take corrective action. Spot damage to a product’s surface quality, color, and shape during the fabrication and assembly process. Determine what’s missing based on the absence, presence, or placement of objects, like a missing capacitor in a printed circuit board. Detect defects with repeating patterns, such as repeated scratches in the same spot on a silicon wafer. Amazon Lookout for Vision is an ML service that uses computer vision to spot defects in manufactured products at scale. Spot product defects using computer vision to automate quality inspection.
  • 21
    Ai-RGUS

    Ai-RGUS

    Ai-RGUS

    Ai-rgususes Artificial Intelligence and custom-built software to automatically catch camera view problems; camera/NVR/DVR misconfigurations or failures; wrong timestamp; and missing or not enough days of recordings. With Ai-rgus you will save time compared to doing it manually and you will have peace of mind that your camera system has the footage you need before an incident. Efficient: Automated verification, saves time from manually reviewing cameras, and enables hassle-free camera system growth. AI verification is reliable and consistent. Proactive verification, providing confidence that desired image exists including for slip and falls and loss prevention cases. Ai-RGUS makes sure that the task of camera verification is done, with a consistent verification quality, and sends automatic email alerts.
  • 22
    CVEDIA

    CVEDIA

    CVEDIA

    CVEDIA-RT is our AI software stack that comes pre-installed with dozens of video analytics and computer vision solutions. It's easy to configure and customize to your use case, even if you're not a data scientist or developer. For a single low price, you have access to all of our AI solutions now and in the future. This means you can discover new use cases and expand your AI capabilities risk-free! If you couldn't find what you are looking for, or you want to run on another device, no problem. We are happy to develop custom solutions based on your requirements. Reach out to us for a free call! What sets us apart from everyone else is our use of synthetic data. Our analytics are more accurate, faster, and affordable than traditional solutions. Your team is busy and deadlines are near, we get it. If you like, we can take care of everything, from development to integration of the analytics. All you have to do is build a product around it!
    Starting Price: Free
  • 23
    FlyPix AI

    FlyPix AI

    FlyPix AI

    FlyPix AI is an object detection platform designed for analyzing satellite and drone imagery . It allows users to effortlessly detect, segment and localize objects and areas within geospatial data. Users can use FlyPix AI advanced functionalities to track changes and detect anomalies. Plus, it's user-friendly and intuitive interface empowers users without coding expertise to create customized use cases and extract valuable information from earth observation data.
    Starting Price: €890
  • 24
    Yandex Vision
    Yandex Vision OCR recognizes text in an image and outputs it along with automatic punctuation. The service supports and automatically identifies more than 50 languages. Extract standard fields and recognize text in templates and documents, e.g., passports, driver’s licenses, vehicle registration certificates, and license plates. With support for Russian and English, as well as combinations of handwritten and printed texts. The service scans the table structure and outputs text in row and column coordinates. Optical character recognition (OCR), document recognition, and license plate number recognition. Yandex Vision OCR allows you to work with JPEG, PNG, and PDF formats. File sizes should be no larger than 20 MB with no more than 300 pages per file. The service can scan images and find passports from 20 countries, driver’s licenses, vehicle registration documents, and license plates.
  • 25
    Campedia

    Campedia

    Campedia

    Campedia is like ChatGPT but for the real world. You snap a photo and ask any question. Identify a plant, ask about an attraction, or let it create a recipe from ingredients in your refrigerator. Campedia is powered by GPT-4 Vision, which is able to take in images and answer questions about them. It is a breakthrough new technology that enables an AI to see. A revolutionary new AI meets a radically simplified user interface. Campedia turns your entire screen into 1 single button. Simply tap & hold to snap, ask your question, and then release to get an answer. Campedia speaks your language. Currently, we support English, German, French, Italian, Spanish, Japanese, Korean, Portuguese and Chinese. Campedia is an AI camera App that works like ChatGPT but for photos. You simply snap a photo and can ask any question. Campedia can be used for an unlimited array of use cases. Popular examples are detecting plants or animals, and asking for info about a wine or a landmark.
    Starting Price: Free
  • 26
    Oxipital AI

    Oxipital AI

    Oxipital AI

    Our solutions are designed to have an immediate impact and require no code, no DIY, and no machine learning expertise to deploy into production. User-friendly, web-based setup tools, and dashboards take the mystery out of AI, leaving your business with insights that you can act on right now. Our fully integrated solutions enable manufacturers to tap into their most potent source of business intelligence, their own data. By addressing the most pervasive challenges of high-variability manufacturing environments, our visual AI platform provides the clarity to help businesses sharpen their operational vision. Our advanced AI vision supercharges operations in complex and high-variability manufacturing environments including food processing, agriculture, and consumer packaged goods, industries with challenges that evade existing machine vision technologies.
  • 27
    VisionAgent

    VisionAgent

    LandingAI

    VisionAgent is a generative Visual AI application builder developed by Landing AI, designed to accelerate the creation and deployment of vision-enabled applications. By inputting a simple prompt, users can describe their vision task, and VisionAgent intelligently selects the most suitable models from a curated collection of effective open-source models to address the task. It then generates, tests, and deploys the necessary code, enabling the rapid development of applications involving object detection, segmentation, object tracking, and activity recognition. This streamlined process allows developers to build vision-enabled applications in minutes, significantly reducing development time and effort. Enhance efficiency with instant code generation for custom post-processing steps. VisionAgent selects the best model for your use case from a curated collection of the most effective open-source models.
  • 28
    Chance AI

    Chance AI

    Chance AI

    Chance AI is an AI-powered visual search engine that enables users to interact with images to access information, news, and stories. Recognizing objects within images, allows users to delve into the layers of emotion and context behind each visual. This innovative tool aims to make art and imagery more accessible and meaningful, fostering genuine connections in an increasingly disconnected world. Founded by a team passionate about art and technology, Chance AI seeks to restore the richness of visual storytelling, providing insights that go beyond mere images. Users can explore and understand the narratives hidden within every picture, from the mysteries of distant planets to the history behind a painting in a museum. The platform is designed for creative and curious minds, offering a unique way to connect with the emotions and stories that art can inspire. By utilizing advanced visual intelligence, Chance AI transforms the way people interact with visual content.
    Starting Price: Free
  • 29
    SolVision

    SolVision

    Solomon

    SolVision is an advanced AI vision system developed by Solomon 3D, designed to enhance industrial automation through rapid and accurate visual inspections. Leveraging Solomon’s proprietary rapid AI model training technology, SolVision enables users to train AI models in minutes, significantly reducing setup time compared to traditional systems. It excels in various applications, including defect detection, item classification, optical character recognition, and presence/absence checks, making it suitable for industries such as manufacturing, food & beverage, textiles, and electronics. A standout feature is its ability to learn from as few as 1–5 image samples, streamlining the training process and minimizing the need for extensive data annotation. SolVision's intuitive user interface allows for simultaneous labeling of multiple defect types, facilitating complex classification tasks.
  • 30
    TechSee

    TechSee

    TechSee

    Deploy a unified platform to augment your organization with visual knowledge and automate processes over time. TechSee’s platform creates a single picture of customer issues across the organization, allowing warm transfer between channels and leveraging visual data to deliver AI-powered automation. The platform is proven to support large departments and tens of thousands of reps, with the ability to support more agents, technicians and end users in new geographic locations, without impacting availability or performance. The platform leverages visual data to automate processes using Computer Vision AI, including real-time decision support for agents and self- service for customers. A full record of the visual session history of each customer provides the organization with the context of each contact. This information can also be leveraged for internal collaboration, aligned with privacy policy.
    Starting Price: $29.99/month/user