Showing 109 open source projects for "html parse"

View related business solutions
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 1
    jsoup

    jsoup

    Java library for working with real-world HTML

    ...The parser will make every attempt to create a clean parse from the HTML you provide, regardless of whether the HTML is well-formed or not. You have HTML in a Java String, and you want to parse that HTML to get at its contents, or to make sure it's well formed, or to modify it. The String may have come from user input, a file, or from the web.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    html-react-parser

    html-react-parser

    HTML to React parser

    HTML to React parser that works on both the server (Node.js) and the client (browser). The parser converts an HTML string to one or more React elements. Available as part of the Tidelift Subscription. For TypeScript projects, you may need to check that domNode is an instance of domhandler's Element. Make sure to render parsed adjacent elements under a parent element.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    fast-xml-parser

    fast-xml-parser

    Validate XML, Parse XML and Build XML rapidly

    Validate XML, Parse XML to JS Object, or Build XML from JS Object without C/C++ based libraries and no callback.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    ngx-markdown

    ngx-markdown

    Angular markdown component/directive/pipe/service to parse static

    Angular markdown component/directive/pipe/service to parse static, dynamic or remote content to HTML with syntax highlight and more. As the library is using Marked parser you will need to add node_modules/marked/marked.min.js to your application. To use the line numbers plugin that shows line numbers in code blocks, in addition to Prism.js configuration files, you will need to include the following files from prismjs/plugins/line-numbers directory to your application. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Fully Managed MySQL, PostgreSQL, and SQL Server Icon
    Fully Managed MySQL, PostgreSQL, and SQL Server

    Automatic backups, patching, replication, and failover. Focus on your app, not your database.

    Cloud SQL handles your database ops end to end, so you can focus on your app.
    Try Free
  • 5
    Floki

    Floki

    Floki is a simple HTML parser that enables search for nodes using CSS

    Floki is a simple HTML parser that enables search for nodes using CSS selectors. Floki needs the :leex module in order to compile. Normally this module is installed with Erlang in a complete installation. By default, Floki uses a patched version of mochiweb_html for parsing fragments due to its ease of installation (it's written in Erlang and has no outside dependencies). fast_html is generally faster, according to the benchmarks conducted by its developers.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    Emogrifier

    Emogrifier

    Converts CSS styles into inline style attributes in your HTML code

    Emogrifier converts CSS styles into inline style attributes in your HTML code. This ensures proper display on email and mobile device readers that lack stylesheet support. This utility was developed as part of Intervals to deal with the problems posed by certain email clients (namely Outlook 2007 and GoogleMail) when it comes to the way they handle styling contained in HTML emails. As many web developers and designers already know, certain email clients are notorious for their lack of CSS...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 7
    rehype

    rehype

    HTML processor powered by plugins part of the @unifiedjs collective

    ...You can use the many existing plugins or you can make your own. You can use rehype for many different things. unified is the core project that transforms content with ASTs. rehype adds support for HTML to unified. hast is the HTML AST that rehype uses. Depending on the input you have and output you want, you can use different parts of rehype. If the input is HTML, you can use rehype-parse with unified. If the output is HTML, you can use rehype-stringify with unified If both the input and output are HTML, you can use rehype on its own.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Symfony DomCrawler

    Symfony DomCrawler

    Eases DOM navigation for HTML and XML documents

    Symfony DomCrawler is a PHP component that provides powerful tools for navigating and extracting data from HTML and XML documents. It allows developers to parse, filter, and manipulate web pages using CSS selectors and XPath expressions. DomCrawler is widely used for web scraping, testing, and processing structured content, and integrates well with other Symfony components like BrowserKit.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    HtmlSanitizer

    HtmlSanitizer

    Cleans HTML to avoid XSS attacks

    HtmlSanitizer is a .NET library for cleaning HTML fragments and documents from constructs that can lead to XSS attacks. It uses AngleSharp to parse, manipulate, and render HTML and CSS. Because HtmlSanitizer is based on a robust HTML parser it can also shield you from deliberate or accidental "tag poisoning" where invalid HTML in one fragment can corrupt the whole document leading to broken layout or style.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 10
    LlamaParse

    LlamaParse

    Parse files for optimal RAG

    LlamaParse is a GenAI-native document parser that can parse complex document data for any downstream LLM use case (RAG, agents). Load in 160+ data sources and data formats, from unstructured, and semi-structured, to structured data (API's, PDFs, documents, SQL, etc.) Store and index your data for different use cases. Integrate with 40+ vector stores, document stores, graph stores, and SQL db providers.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 11
    Crowbook

    Crowbook

    Converts books written in Markdown to HTML, LaTeX/PDF and EPUB

    Crowbook's aim is to allow you to write a book in Markdown without worrying about formatting or typography and let the program generate HTML, PDF and EPUB output for you. Its focus is novels and fiction, and the default settings should (hopefully) generate readable books with correct typography without requiring you to worry about it. To see what Crowbook's output looks like, you can read the Crowbook guide rendered in HTML, PDF or EPUB. Crowbook will parse this file and generate HTML, EPUB, and/or PDF output formats, according to the settings in the configuration file. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    commonmark-java

    commonmark-java

    Java library for parsing and rendering CommonMark (Markdown)

    Java library for parsing and rendering Markdown text according to the CommonMark specification (and some extensions). Provides classes for parsing input to an abstract syntax tree of nodes (AST), visiting and manipulating nodes, and rendering to HTML. It started out as a port of commonmark.js, but has since evolved into a full library with a nice API.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Markdig

    Markdig

    A fast, powerful, CommonMark compliant, extensible Markdown processor

    ...Parses trivia (whitespace, newlines and other characters) to support lossless parse ⭢ render roundtrip. This enables changing markdown documents without introducing undesired trivia changes. Special attributes or attached HTML attributes (inspired from PHP Markdown Extra - Special Attributes). Diagrams extension whenever a fenced code block contains a special keyword, it will be converted to a div block with the content as-is.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    SAN

    SAN

    A fast, portable, flexible JavaScript component framework

    ...By modifying the data, the view engine can directly refresh the view area that needs to be changed without any detection and better performance. Optimized for the first screen and output HTML from the server-side. Reverse parse components from existing elements and build bindings. Volume less that 17kb (gzipped), no need to worry about the burden of page downloads. The gospel of patients with volume obsessive. The ESNext Module or AMD can be selected in the project. Of course, if you want to use global variables it is also supported.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 15
    Highlightr

    Highlightr

    iOS & OSX Syntax Highlighter

    ...Carthage is a decentralized dependency manager that builds your dependencies and provides you with binary frameworks. Yes, Highlightr relies on iOS & macOS JavaScriptCore to parse the code using highlight.js. This is actually quite fast! It will never be as fast as a native solution, but it's fast enough to be used on a real-time editor. It comes with a custom-made HTML parser for creating NSAttributtedStrings, is pre-processing the themes, and is preloading the JS libraries. As result it's taking around of 50 ms on my iPhone 6s for processing 500 lines of code.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    jQuery Calx

    jQuery Calx

    A jQuery plugin for creating formula-based calculation form

    jQuery Calx is an Excel calculation engine and formula parser built as a jQuery plugin, it offers spreadsheet ability to process and calculate the formula without bringing a table-like spreadsheet interface. In the core of jQuery Calx is a formula parser generated using jison which parse and processes the formula and set of formulas imported from formula.js.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Markdown-Resume-Template

    Markdown-Resume-Template

    Resume template designed to be edited in Markdown

    ...Because it’s plain text, you can track changes in Git, branch for different roles, and keep a full revision history of your career materials. The structure avoids flashy but unreadable designs, favoring a hierarchy and layout that ATS systems and human reviewers can parse quickly. You can fork the repo, make it your own, and export to HTML or PDF using standard tooling. It’s especially useful for developers who want a low-friction, repeatable way to maintain multiple resume variants while keeping formatting consistent.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 18
    Cheerio

    Cheerio

    Implementation of core jQuery designed for the server

    ...As a result parsing, manipulating, and rendering are incredibly efficient. Cheerio wraps around parse5 parser and can optionally use @FB55's forgiving htmlparser2. Cheerio can parse nearly any HTML or XML document. Cheerio parses markup and provides an API for traversing/manipulating the resulting data structure. It does not interpret the result as a web browser does. Specifically, it does not produce a visual rendering, apply CSS, load external resources, or execute JavaScript. If your use case requires any of this functionality, you should consider projects like Puppeteer or JSDom.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 19
    panicparse

    panicparse

    Crash your app in style (Golang)

    Parses panic stack traces, densifies and deduplicates goroutines with similar stack traces. Helps debugging crashes and deadlocks in heavily parallelized processes. Race detector support, e.g. it can parse output produced by go test -race. HTML export. Easy to use as an HTTP Handler middleware. High-performance parsing. HTTP web server that serves a very tight and swell snapshot of your goroutines, much more readable than net/http/pprof. >50% more compact output than original stack dump yet more readable. Deduplicates redundant goroutine stacks. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    XRAY

    XRAY

    XRay for recon, mapping and OSINT gathering from public networks

    XRAY is a modular security toolset that helps developers and security professionals analyze, fuzz, and test web applications, protocols, and network services for vulnerabilities. It provides a framework for writing and executing inspection modules that can parse structured data (JSON, XML, HTML), traverse graphs of endpoints, and perform intelligent probing guided by discovered surface area. XRay is typically used as a reconnaissance and vulnerability discovery engine in red-team or app-security workflows: it leverages extensible plugins to adapt to different protocols, inject payloads, and detect common bug classes such as injection flaws, misconfigurations, and unsafe endpoints. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 21
    GoNB

    GoNB

    GoNB, a Go Notebook Kernel for Jupyter

    ...GoNB leverages that compilation speed to implement a full-featured (at least it's getting there) Jupyter notebook kernel. As a side benefit it works with packages that use CGO — although it won't parse C code in the cells, so it can't be used as a C kernel. It already includes many goodies: cache between cell of results, contextual help and auto-complete (with gopls), compilation error context (by mousing over), bash command execution, images, html, etc. See the tutorial. It's been heavily used by the author (in developing GoMLX, a machine learning framework for Go), but should still be seen as experimental — if we hear success stories from others, we can change this.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 22
    CyberScraper 2077

    CyberScraper 2077

    A Powerful web scraper powered by LLM | OpenAI, Gemini & Ollama

    CyberScraper 2077 is not just another web scraping tool – it's a glimpse into the future of data extraction. Born from the neon-lit streets of a cyberpunk world, this AI-powered scraper uses OpenAI, Gemini and LocalLLM Models to slice through the web's defenses, extracting the data you need with unparalleled precision and style.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    AngleSharp

    AngleSharp

    The ultimate angle brackets parser library parsing HTML5, MathML, SVG

    AngleSharp follows the W3C specifications and gives you the same results as state of the art browsers. Besides the official API AngleSharp adds some useful extension methods on top. This makes working with the DOM convenient. AngleSharp integrates everything you need to explore and mutate the DOM tree. Node retrieval is straight forward by using powerful CSS query selectors. The CSS queries in AngleSharp are super fast and very simple to use. AngleSharp respects the relationship of HTML with...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 24
    HTTP Shortcuts for Android

    HTTP Shortcuts for Android

    Android app to create home screen shortcuts

    A simple Android app that allows you to create shortcuts that can be placed on your home screen. Each shortcut, when clicked, triggers an HTTP request, with the possibility to process and display the response in various ways. Run arbitrary JavaScript before & after execution, which allows for lots of customization with features such as compute values such as timestamps, random numbers, UUIDs, hashes, HMACs, base64, etc. Parse JSON or XML and extract data from them. Show toast message or...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 25
    Python-Spider

    Python-Spider

    Python3 web crawler practice

    Python-Spider is a repository intended to teach or provide examples for writing web spiders / crawlers in Python — part of a broader learning and resource collection by its author. The code and documentation are oriented toward beginners or intermediate learners who want to learn how to fetch, parse, and extract data from websites programmatically. As part of the author’s public learning-path repositories, python-spider likely includes examples of HTTP requests, HTML parsing, maybe concurrency or scheduling to crawl multiple pages, and techniques to handle common web-scraping issues. For people wanting to get hands-on with building scrapers, collecting data, or learning how to navigate web programming in Python, this repository acts as a didactic reference or starting point. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB