Browse free open source Search Engines and projects below. Use the toggles on the left to filter open source Search Engines by OS, license, language, programming language, and project status.

  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 1
    The Lemur Project

    The Lemur Project

    Search engine and data mining applications and ClueWeb datasets.

    The Lemur Project develops search engines, browser toolbars, text analysis tools, and data resources that support research and development of information retrieval and text mining software, including the Indri search engine in C++, the Galago search engine research framework in Java, the RankLib learning to rank library, ClueWeb09 and ClueWeb12 datasets and the Sifaka data mining application.
    Downloads: 19 This Week
    Last Update:
    See Project
  • 2
    The stuff here has no documentation and some of it may never be completed. This is my playground, use at your own risk.
    Downloads: 19 This Week
    Last Update:
    See Project
  • 3
    PDFBox is a Java PDF Library. This project will allow access to all of the components in a PDF document. More PDF manipulation features will be added as the project matures. This ships with a utility to take a PDF document and output a text file.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 4
    XODA - the KISSed Document manager.

    XODA - the KISSed Document manager.

    Simple yet feature-rich Document Management System

    XODA is a KISSed (Keep Simple and Stupid) System for Organizing Documents using AJAX. This is a Document Management System without backend database, though making possible organizing files/directories by descriptions, filters and more. Visit xoda.org
    Downloads: 2 This Week
    Last Update:
    See Project
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 5
    PHP Crawler is a simple website search script for small-to-medium websites. The only requrements are PHP and MySQL, no shell access required.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 6
    WebHarvest - web data extraction tool
    Web data extraction (web data mining, web scraping) tool. It leverages well proved XML and text processing techologies in order to easely extract useful data from arbitrary web pages.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    Carrot2
    Project moved to GitHub! https://github.com/carrot2/carrot2 Carrot2 is an Open Source Search Results Clustering Engine. It can automatically organize small collections of documents, e.g. search results, into thematic categories. Carrot2 integrates very well with both Open Source and proprietary search engines.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 8
    SkyCD

    SkyCD

    SkyCD is a program for indexing your files in CDs and CDs also.

    SkyCD is a program for indexing your files in CDs and CDs also. All indexing information is saved in text files, so anyone can edit or view with existing text editor/viewer. You can also send these files to your friends & they will know what CD you have.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 9
    Sperowider Website Archiving Suite is a set of Java applications, the primary purpose of which is to spider dynamic websites, and to create static distributable archives with a full text search index usable by an associated Java applet.
    Downloads: 3 This Week
    Last Update:
    See Project
  • Application Monitoring That Won't Slow Your App Down Icon
    Application Monitoring That Won't Slow Your App Down

    AppSignal's Rust-based agent is lightweight and stable. Already running in thousands of production apps.

    Full APM with errors, performance, logs, and uptime monitoring. 99.999% uptime SLA on the platform itself.
    Start Free
  • 10

    eXtensible Text Framework (XTF)

    Framework for search and display of heterogenous document collections.

    NOTICE: This code repository is deprecated. Please visit https://github.com/cdlib/xtf for the latest updates. Obsolete Description: The eXtensible Text Framework (XTF) is an architecture that supports searching across collections of heterogeneous textual data (XML, PDF, HTML, text, and more), and the presentation of results and documents in a highly configurable manner. Includes highly customized versions of the proven open-source components Lucene and Saxon.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    A drop-in framework for adding tagging (folksonomy) capabilities to existing applications
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    QZARCH - Quick free-text search The project aims to deliver a light-weight file-based free-text search engine for Java based websites to adopt easily. The features include: - Search for one or more keywords in the content of one or more files -
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    "Swish-e is a fast, flexible, and free open source system for indexing collections of Web pages or other files" (http://swish-e.org/ ) This module provides a Python API for this software.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    A Microformatted Registry Alternative (AMRA) is an alternative web service description registry using simple, microformatted documents.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Robust featureful multi-threaded CLI web spider using apache commons httpclient v3.0 written in java. ASpider downloads any files matching your given mime-types from a website. Tries to reg.exp. match emails by default, logging all results using log4j.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Aperture is a Java framework for extracting and querying full-text content and metadata from various information systems (file systems, web sites, mail boxes, ...) and the file formats (documents, images, ...) occurring in these systems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    A web app for creating a repository of pictures (our focus is birds). Users submit pictures, with a wizard that generates RDF descriptiors. Sumissions are forwarded to Admins for aproval. Instances will export the RDF so that repositories may cooperate.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Blogometro is simplified implementation of "Blodgex", to track the weblog's links and updates.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    BlueBox is PHP-MySQL powered search engine. It can be installed on every webserver without any permission. Only FTP and database management rights are required. BlueBox is very fast even on more than 1'000'000 pages scanned.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    The Cornell Web Lab Collaboration Server is a suite of tools and services for GUI-based extraction, analysis and sharing of archived web data. See http://weblab.infosci.cornell.edu/ and http://www.cs.cornell.edu/~weigel for details about the project.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    CoverYourASP.com - complete Active Server Pages source (JScript) for this popular web site. Includes full membership system, diary, online db admin, banner ad system and loads more.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Port of the Google sitemap generator, from Python to Csharp aka C-Sharp aka C# aka .NET aka dotNet.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    DVDWeb is a Web Service which provides organization/search/lookup services through JAX-RPC API. The search can be done against the builtin DB (the user\'s private list of DVDs according to UPC codes) or against other Internet sites such as imdb or yahoo.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    An extensible framework for the automated harvesting, indexing and collation of publicly available data from internet resources for example RSS feeds and webservices.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Ever heard of Deep Web, thats the real Internet. The Deep Web is several orders of magnitude larger than the surface web and using Google's Advanced operators its now possible to search this unexplored part of the Internet.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB