Alternatives to Beautiful Soup
Compare Beautiful Soup alternatives for your business or organization using the curated list below. SourceForge ranks the best alternatives to Beautiful Soup in 2026. Compare features, ratings, user reviews, pricing, and more from Beautiful Soup competitors and alternatives in order to make an informed decision for your business.
-
1
Firecrawl
Firecrawl
Crawl and convert any website into clean markdown or structured data, it's also open source. We crawl all accessible subpages and give you a clean markdown for each, no sitemap is required. Enhance your applications with top-tier web scraping and crawling capabilities. Extract markdown or structured data from websites quickly and efficiently. Navigate and retrieve data from all accessible subpages, even without a sitemap. Already fully integrated with the greatest existing tools and workflows. Kick off your journey for free and scale seamlessly as your project expands. Developed transparently and collaboratively. Join our community of contributors. Firecrawl crawls all accessible subpages, even without a sitemap. Firecrawl gathers data even if a website uses JavaScript to render content. Firecrawl returns clean, well-formatted markdown, ready for use in LLM applications. Firecrawl orchestrates the crawling process in parallel for the fastest results.Starting Price: $16 per month -
2
TextBlob
TextBlob
TextBlob is a Python library for processing textual data, offering a simple API to perform common natural language processing tasks such as part-of-speech tagging, noun phrase extraction, sentiment analysis, and classification. It stands on the giant shoulders of NLTK and Pattern, and plays nicely with both. Key features include tokenization (splitting text into words and sentences), word and phrase frequencies, parsing, n-grams, word inflection (pluralization and singularization) lemmatization, spelling correction, and WordNet integration. TextBlob is compatible with Python versions 2.7 and above, and 3.5 and above. It is actively developed on GitHub and is licensed under the MIT License. Comprehensive documentation, including a quick start guide and tutorials, is available to assist users in implementing various NLP tasks. -
3
BrainSoup
Nurgo
With BrainSoup, transform your way of working. Here, you craft custom agents, each built to serve a specific need. From routine tasks, to complex assignments, BrainSoup's agents have the potential to revolutionize your workflow. But the recipe to efficiency doesn't stop at solo tasking. In BrainSoup, agents seamlessly work together, enabling multi-agent collaborations to conquer complex projects. And the best part? All this is managed through simple, natural language. Speak to BrainSoup's agents as you would do with any team member, giving instructions and driving automations through natural conversations. With BrainSoup, you have an adaptable team right on your desktop. Enhance AI knowledge using your documents providing valuable guidance to the AI agents, or let the flow of conversations between you and your agents organically expand their knowledge base. -
4
jsoup
jsoup
jsoup is a Java library that simplifies working with real-world HTML and XML. It offers an easy-to-use API for URL fetching, data parsing, extraction, and manipulation using DOM API methods, CSS, and XPath selectors. jsoup implements the WHATWG HTML5 specification and parses HTML to the same DOM as modern browsers. With jsoup, you can scrape and parse HTML from a URL, file, or string; find and extract data using DOM traversal or CSS selectors; manipulate HTML elements, attributes, and text; clean user-submitted content against a safelist to prevent XSS attacks; and output tidy HTML. jsoup is designed to deal with all varieties of HTML found in the wild, from pristine and validating to invalid tag-soup, creating a sensible parse tree. For example, you can fetch the Wikipedia homepage, parse it to a DOM, and select the headlines from the "In the news" section into a list of elements. -
5
Dux-Soup
Dux-Soup
Dux-Soup is a LinkedIn lead generation tool born out of necessity. Back in 2015 founder Will van der Sanden realized he was spending waaay too much time obtaining, compiling and managing qualified contacts from LinkedIn. His focus needed to be on nurturing prospects through the sales process and winning more business. Put simply, Will knew he needed to work smarter to grow faster. Dux-Soup automatically engages with the prospects you select on LinkedIn. It endorses their skills, sends them personalized connection requests and multiple, delayed, personalized direct messages. It does this on your behalf, from your browser, providing you with full control and visibility of the activity in your LinkedIn account. Our mission is to make LinkedIn Automation accessible to everyone. That’s why our plans are cost-effective and flexible, to meet the needs of individuals, teams and digital agencies.Starting Price: $11.25 per month -
6
broot
broot
The ROOT data analysis framework is used much in High Energy Physics (HEP) and has its own output format (.root). ROOT can be easily interfaced with software written in C++. For software tools in Python there exists pyROOT. Unfortunately, pyROOT does not work well with python3.4. broot is a small library that converts data in python numpy ndarrays to ROOT files containing trees with a branch for each array. The goal of this library is to provide a generic way of writing python numpy datastructures to ROOT files. The library should be portable and supports both python2, python3, ROOT v5 and ROOT v6 (requiring no modifications on the ROOT part, just the default installation). Installation of the library should only require a user to compile to library once or install it as a python package.Starting Price: Free -
7
ruffus
ruffus
Ruffus is a computation pipeline library for python. It is open-sourced, powerful and user-friendly, and widely used in science and bioinformatics. Ruffus is designed to allow scientific and other analyses to be automated with the minimum of fuss and the least effort. Suitable for the simplest of tasks. Handles even fiendishly complicated pipelines which would cause make or scons to go cross-eyed and recursive. No "clever magic", no pre-processing. Unambitious, the lightweight syntax which tries to do this one small thing well. Ruffus is available under the permissive MIT free software license. This permits free use and inclusion even within proprietary software. It is good practice to run your pipeline in a temporary, “working” directory away from your original data. Ruffus is a lightweight python module for building computational pipelines. Ruffus requires Python 2.6 or higher or Python 3.0 or higher.Starting Price: Free -
8
CVXOPT
CVXOPT
CVXOPT is a free software package for convex optimization based on the Python programming language. It can be used with the interactive Python interpreter, on the command line by executing Python scripts, or integrated in other software via Python extension modules. Its main purpose is to make the development of software for convex optimization applications straightforward by building on Python’s extensive standard library and on the strengths of Python as a high-level programming language. Efficient Python classes for dense and sparse matrices (real and complex), with Python indexing and slicing and overloaded operations for matrix arithmetic. Interfaces to the linear programming solver in GLPK, the semidefinite programming solver in DSDP5, and the linear, quadratic and second-order cone programming solvers in MOSEK.Starting Price: Free -
9
parsel
Python Software Foundation
Parsel is a BSD-licensed Python library to extract and remove data from HTML and XML using XPath and CSS selectors, optionally combined with regular expressions. Create a selector object for the HTML or XML text that you want to parse. Then use CSS or XPath expressions to select elements. CSS is a language for applying styles to HTML documents. It defines selectors to associate those styles with specific HTML elements. XPath is a language for selecting nodes in XML documents, which can also be used with HTML. You can use either CSS or XPath. CSS is usually more readable, but some things can only be done with XPath. Being built atop lxml, parsel selectors support some EXSLT extensions and come with pre-registered namespaces to use in XPath expressions. Parsel selectors allow you to chain selectors, so most of the time you can just select by class using CSS and then switch to XPath when needed.Starting Price: Free -
10
pyglet
pyglet
The cross-platform windowing and multimedia library for Python. pyglet is a powerful, yet easy-to-use Python library for developing games and other visually-rich applications on Windows, Mac OS X, and Linux. It supports windowing, user interface event handling, Joysticks, OpenGL graphics, loading images, and videos, and playing sounds and music. All of this with a friendly Pythonic API, that's simple to learn and doesn't get in your way. pyglet is provided under the BSD open-source license, allowing you to use it for both commercial and other open-source projects with very little restriction. No external dependencies or installation requirements. For most application and game requirements, pyglet needs nothing else besides Python, simplifying distribution and installation. This makes it easy to package your project with freezers such as PyInstaller. pyglet provides real platform native windows, allowing you to take advantage of multiple windows and multi-monitor desktops.Starting Price: Free -
11
urllib3
urllib3
urllib3 is a powerful, user-friendly HTTP client for Python. Much of the Python ecosystem already uses urllib3 and you should too. urllib3 brings many critical features that are missing from the Python standard libraries. Thread safety, connection pooling, client-side TLS/SSL verification. File uploads with multipart encoding. Helpers for retrying requests and dealing with HTTP redirects. Support for gzip, deflate, and brotli encoding. Proxy support for HTTP and SOCKS. 100% test coverage. urllib3 is one of the most downloaded packages on PyPI and is a dependency of many popular Python packages like Requests, Pip, and more! urllib3 is made available under the MIT License. The API Reference documentation provides API-level documentation. The User Guide is the place to go to learn how to use the library and accomplish common tasks. The more in-depth Advanced Usage guide is the place to go for lower-level tweaking.Starting Price: Free -
12
UnionSoft
UnionSoft
UnionSoft, LLC specializes in custom development for Labor Union Software, HealthCare Software and Human Resources/Labor Relations Professionals. UnionSoft's Software as a Service (SaaS) model has one key goal. It is simple to use, regardless of our clients' computer skills. We have built custom solutions accessed in the cloud since 1998. And, we can custom develop a system for your organization and support/host it securely online or implement it on your in-house secure server(s). Notice our portfolio of services below and you will understand why UnionSoft is the choice of great companies like Southwest Airlines Co and Campbell Soup. -
13
pygame
pygame
Pygame is a set of Python modules designed for writing video games. Pygame adds functionality on top of the excellent SDL library. This allows you to create fully featured games and multimedia programs in the python language. Pygame is highly portable and runs on nearly every platform and operating system. Pygame is free. Released under the LGPL license, you can create open-source, freeware, shareware, and commercial games with it. With dual-core CPUs common, and 8-core CPUs cheaply available on desktop systems, making use of multi-core CPUs allows you to do more in your game. Selected pygame functions release the dreaded python GIL, which is something you can do from C code. Uses optimized C and assembly code for core functions. C code is often 10-20 times faster than python code, and assembly code can easily be 100x or more times faster than python code. Comes with many operating systems. Just an apt-get, emerge, pkg_add, or just install away.Starting Price: Free -
14
websockets
Python Software Foundation
An implementation of the WebSocket Protocol (RFC 6455 & 7692). websockets is a library for building WebSocket servers and clients in Python with a focus on correctness, simplicity, robustness, and performance. Built on top of asyncio, Python’s standard asynchronous I/O framework, it provides an elegant coroutine-based API. websockets is heavily tested for compliance with RFC 6455. Continuous integration fails under 100% branch coverage. websockets is built for production. For example, it was the only library to handle backpressure correctly before the issue became widely known in the Python community. Memory usage is optimized and configurable. A C extension accelerates expensive operations. It’s pre-compiled for Linux, macOS, and Windows and packaged in the wheel format for each system and Python version. websockets takes care of everything under the hood so you can focus on your application!Starting Price: Free -
15
Chomp SMS
Chomp SMS
A souped-up ultra-customizable alternative to the boring stock messaging app. Chomp comes with a bunch of great features, notably a passcode lock, privacy options, scheduled SMS sender (reminders, birthday wishes), stop a text while sending, pin favs to top, backup, blocklisting / SMS blocker, signatures, text snippets, quick reply popup (even on lock screen), multi-select picture gallery, dual sim, better MMS and group messaging and much more. Plus unlimited customization options when receiving messages: change notification LED colors, ringtones, and vibrate patterns. In Chomp you can also customize the screen color, font style, text size and background wallpaper. -
16
imageio
imageio
Imageio is a Python library that provides an easy interface to read and write a wide range of image data, including animated images, volumetric data, and scientific formats. It is cross-platform, runs on Python 3.5+, and is easy to install. Imageio is written in pure Python, so installation is easy. Imageio works on Python 3.5+. It also works on Pypy. Imageio depends on Numpy and Pillow. For some formats, imageio needs additional libraries/executables (e.g. ffmpeg), which imageio helps you to download/install. If something doesn’t work as it should, you need to know where to search for causes. The overview on this page aims to help you in this regard by giving you an idea of how things work, and - hence - where things may go sideways.Starting Price: Free -
17
Contrast Security
Contrast Security
Modern software development must match the speed of the business. But the modern AppSec tool soup lacks integration and creates complexity that slows software development life cycles. Contrast simplifies the complexity that impedes today’s development teams. Legacy AppSec employs a one-size-fits-all vulnerability detection and remediation approach that is inefficient and costly. Contrast automatically applies the best analysis and remediation technique, dramatically improving efficiencies and efficacy. Separate AppSec tools create silos that obfuscate the gathering of actionable intelligence across the application attack surface. Contrast delivers centralized observability that is critical to managing risks and capitalizing on operational efficiencies, both for security and development teams. Contrast Scan is pipeline native and delivers the speed, accuracy, and integration demanded by modern software development.Starting Price: $0 -
18
Plotly Dash
Plotly
Dash & Dash Enterprise let you build & deploy analytic web apps using Python, R, and Julia. No JavaScript or DevOps required. Through Dash, the world's largest companies elevate AI, ML, and Python analytics to business users at 5% the cost of a full-stack development approach. Deliver apps and dashboards that run advanced analytics: ML, NLP, forecasting, computer vision and more. Work in the languages you love: Python, R, and Julia. Reduce costs by migrating legacy, per-seat licensed software to Dash Enterprise's open-core, unlimited end-user pricing model. Move faster by deploying and updating Dash apps without an IT or DevOps team. Create pixel-perfect dashboards & web apps, without writing any CSS. Scale effortlessly with Kubernetes. Support mission-critical Python applications with high availability. -
19
Campus Cloud Services SIS
Campus Cloud Services
Campus Cloud Services’ Student Information System (SIS) is the first cloud based, real time SIS utilizing the newest, secure technologies. Low cost annual subscription with no long term commitment. All maintenance and regulatory updates are free. Expert, hands-on implementation support and training are included. No costly infrastructure required. You get our entire system for one affordable price. No hidden fees. No surprise charges. Just plain and simple pricing. Complete training and customer support included in price. Never pay a cent for servers, bandwidth and data storage. Powerful functionality on any device, any time, at any location. Backups and system security are included. Guaranteed up time and built in hot sites mean no business interruptions. It is a “soup to nuts” solution to the complexities of enrolling, tracking and managing a student from start to graduate placement and everything in between. -
20
Pillow
Pillow
The Python Imaging Library adds image processing capabilities to your Python interpreter. This library provides extensive file format support, an efficient internal representation, and fairly powerful image processing capabilities. The core image library is designed for fast access to data stored in a few basic pixel formats. It should provide a solid foundation for a general image processing tool. Pillow for enterprise is available via the Tidelift subscription. The Python Imaging Library is ideal for image archival and batch processing applications. You can use the library to create thumbnails, convert between file formats, print images, etc. The current version identifies and reads a large number of formats. Write support is intentionally restricted to the most commonly used interchange and presentation formats. The library contains basic image processing functionality, including point operations, filtering with a set of built-in convolution kernels, and color space conversions.Starting Price: Free -
21
yarl
Python Software Foundation
All URL parts, scheme, user, password, host, port, path, query, and fragment are accessible by properties. All URL manipulations produce a new URL object. Strings passed to constructor and modification methods are automatically encoded giving canonical representation as result. Regular properties are percent-decoded, use raw_ versions for getting encoded strings. Human-readable representation of URL is available as .human_repr(). PyPI contains binary wheels for Linux, Windows and MacOS. If you want to install yarl on another operating system (like Alpine Linux, which is not manylinux-compliant because of the missing glibc and therefore, cannot be used with our wheels) the tarball will be used to compile the library from the source code. It requires a C compiler and Python headers installed. Please note that the pure-Python (uncompiled) version is much slower. However, PyPy always uses a pure-Python implementation, and, as such, it is unaffected by this variable.Starting Price: Free -
22
Pylons
Python Software Foundation
The Pylons web framework is designed for building web applications and sites in an easy and concise manner. They can range from as small as a single Python module, to a substantial directory layout for larger and more complex web applications. Pylons comes with project templates that help boot-strap a new web application project, or you can start from scratch and set things up exactly as desired. A framework to make writing web applications in Python easy. Utilizes a minimalist, component-based philosophy that makes it easy to expand on. Harness existing knowledge about Python. Extensible application design. Fast and efficient, an incredibly small per-request call stack provides top performance. Uses existing and well-tested Python packages. Pylons 1.0 series is stable and production-ready but in maintenance-only mode. The Pylons Project now maintains the Pyramid web framework for future development. Pylons 1.0 users should strongly consider using Pyramid for their next project.Starting Price: Free -
23
Kiteworks
Kiteworks
The only security platform authorized by FedRAMP that provides support for file sharing, managed file transfer, and email data communications to meet the compliance requirements of standards such as CMMC 2.0, ITAR, IRAP, NIS 2, HIPAA, and others. A content communication “tool soup” ratchets up cost and resource inefficiencies. Managing zero-trust security policies centrally is virtually impossible, and organizations lack consolidated security and compliance visibility over the communications of sensitive content, which increases security and compliance risks. Compliance and security risks increase due to the lack of governance. Organizations must control and track who can access content, who can edit it, to whom it can be sent and shared, and where it is sent and shared. Cybercriminals and malicious insiders target sensitive content like PII, IP, financial documents, and PHI because it can be monetized or even weaponized. -
24
Genesis HR
Genesis HR Solutions
Make sense of the alphabet soup of employer regulations. Ever-changing laws that vary by state and the size of your company can seem impossible to understand, much less enforce. We’ll help simplify it, so you can stay in front of regulations that impact your business. We handle single and multi-state payroll, and we give you precise payroll and tax administration so you can file with confidence. We simplify ever-changing laws and regulations so that you remain compliant. We position to capitalize on change—not get lost in it. We ensure you’re set up for success, whether it means navigating employees in multiple states or across multiple countries. Whatever the challenge, we have you covered. We ensure smooth onboarding and reporting, and we’ll help develop HR practices that support your culture and encourage growth. -
25
NutriAssistant
NutriAssistant
Using NutriAssistant's cutting-edge algorithm you can literally cut down the time required planning a 7-day meal plan for your client to a few minutes. On top of the recombined, enhanced and new technical features we also have a completely new and humble philosophy to easing the global obesity crises. Professional grade weekly meal plan creation in under 60 seconds. We consider 46 macro and micronutrients +15 types of allergens. Such as smoothies, soups, main dishes and so much more. Vegan, Pescetarian, Halal, Kosher and others. Simply adjustable macro and micronutrient requirements to fit your client's needs. Easily access & share meal plans from any device. It is possible to integrate NutriAssistant's meal-planning capabilities through our APIs. Interested? Unlimited creation of meal plans. Share 20 meal plans per month.Starting Price: $19 per user per month -
26
python-docx
python-docx
python-docx is a Python library for creating and updating Microsoft Word (.docx) files. Paragraphs are fundamental in Word. They’re used for body text, but also for headings and list items like bullets. You’re free to specify both width and height, but usually, you wouldn’t want to. If you specify only one, python-docx uses it to calculate the properly scaled value of the other. This way the aspect ratio is preserved and your picture doesn’t look stretched. If you don’t know what a Word paragraph style is you should definitely check it out. Basically, it allows you to apply a whole set of formatting options to a paragraph at once. python-docx allows you to create new documents as well as make changes to existing ones. Actually, it only lets you make changes to existing documents; it’s just that if you start with a document that doesn’t have any content, it might feel at first like you’re creating one from scratch.Starting Price: Free -
27
openpyxl
openpyxl
openpyxl is a Python library to read/write Excel 2010 xlsx/xlsm/xltx/xltm files. It was born from a lack of an existing library to read/write natively from Python in the Office Open XML format. All kudos to the PHPExcel team as openpyxl was initially based on PHPExcel. By default, openpyxl does not guard against quadratic blowup or billion laughs XML attacks. To guard against these attacks install defusedxml. Install openpyxl using pip. It is advisable to do this in a Python virtualenv without system packages. Sometimes you might want to work with the checkout of a particular version. This may be the case if bugs have been fixed but a release has not yet been made. There is no need to create a file on the filesystem to get started with openpyxl. Just import the Workbook class and start work. Sheets are given a name automatically when they are created. Once you gave a worksheet a name, you can get it as a key of the workbook.Starting Price: Free -
28
Mako
Mako
It provides a familiar, non-XML syntax that compiles into Python modules for maximum performance. Mako's syntax and API borrows from the best ideas of many others, including Django and Jinja2 templates, Cheetah, Myghty, and Genshi. Conceptually, Mako is an embedded Python (i.e. Python Server Page) language, which refines the familiar ideas of componentized layout and inheritance to produce one of the most straightforward and flexible models available, while also maintaining close ties to Python calling and scoping semantics. As templates are ultimately compiled into Python bytecode, Mako's approach is extremely efficient and was originally written to be just as fast as Cheetah. Today, Mako is very close in speed to Jinja2, which uses a similar approach and for which Mako was an inspiration. Can access variables from their enclosing scope as well as the template's request contextStarting Price: Free -
29
pexpect
pexpect
Pexpect makes Python a better tool for controlling other applications. Pexpect is a pure Python module for spawning child applications; controlling them, and responding to expected patterns in their output. Pexpect works like Don Libes’ Expect. Pexpect allows your script to spawn a child application and control it as if a human were typing commands. Pexpect can be used for automating interactive applications such as ssh, FTP, passwd, telnet, etc. It can be used to automate setup scripts for duplicating software package installations on different servers. It can be used for automated software testing. Pexpect is in the spirit of Don Libes’ Expect, but Pexpect is pure Python. Unlike other Expect-like modules for Python, Pexpect does not require TCL or Expect nor does it require C extensions to be compiled. It should work on any platform that supports the standard Python pty module. The Pexpect interface was designed to be easy to use.Starting Price: Free -
30
tox
tox
tox aims to automate and standardize testing in Python. It is part of a larger vision of easing the packaging, testing and release process of Python software. tox is a generic virtualenv management and test command-line tool you can use for checking that your package installs correctly with different Python versions and interpreters, running your tests in each of the environments, configuring your test tool of choice, and acting as a frontend to continuous integration servers, greatly reducing boilerplate and merging CI and shell-based testing. First, install tox with pip install tox. Then put basic information about your project and the test environments you want your project to run in into a tox.ini file residing right next to your setup.py file. You can also try generating a tox.ini file automatically, by running tox-quickstart and then answering a few simple questions. Install and test your project against Python2.7 and Python3.6.Starting Price: Free -
31
PyQtGraph
PyQtGraph
PyQtGraph is a pure-python graphics and GUI library built on PyQt/PySide and NumPy. It is intended for use in mathematics/scientific/engineering applications. Despite being written entirely in python, the library is very fast due to its heavy leverage of NumPy for number crunching and Qt's GraphicsView framework for fast display. PyQtGraph is distributed under the MIT open-source license. Basic 2D plotting in interactive view boxes. Line and scatter plots. Data can be panned/scaled by mouse. Fast drawing for real-time data display and interaction. Displays most data types (int or float; any bit depth; RGB, RGBA, or luminance). Functions for slicing multidimensional images at arbitrary angles (great for MRI data). Rapid update for video display or real-time interaction. Image display with interactive lookup tables and level control. Mesh rendering with isosurface generation. Interactive viewports rotate/zoom with mouse. Basic 3D scenegraph for easier programming.Starting Price: Free -
32
python-sql
Python Software Foundation
python-sql is a library to write SQL queries in a pythonic way. Simple selects, select with where condition. Select with join or select with multiple joins. Select with group_by and select with output name. Select with order_by, or select with sub-select. Select on other schema and insert query with default values. Insert query with values, and insert query with query. Update query with values. Update query with where condition. Update query with from the list. Delete query with where condition, and delete query with sub-query. Provides limit style, qmark style, and numeric style.Starting Price: Free -
33
NetworkX
NetworkX
NetworkX is a Python package for the creation, manipulation, and study of the structure, dynamics, and functions of complex networks. Generators for classic graphs, random graphs, and synthetic networks. Additional benefits from Python include fast prototyping, easy to teach, and multi-platform. Network structure and analysis measures.Starting Price: Free -
34
Kartograph
Kartograph
Kartograph is a simple and lightweight framework for building interactive map applications without Google Maps or any other mapping service. It was created with the needs of designers and data journalists in mind. Actually, Kartograph is two libraries. One generates beautiful & compact SVG maps; the other helps you to create interactive maps that run across all major browsers. A powerful Python library for generating beautiful, Illustrator-friendly SVG maps. A JavaScript library for creating interactive maps based on Kartograph SVG maps. -
35
zope.interface
Python Software Foundation
This package is intended to be independently reusable in any Python project. It is maintained by the Zope Toolkit project. This package provides an implementation of “object interfaces” for Python. Interfaces are a mechanism for labeling objects as conforming to a given API or contract. So, this package can be considered as an implementation of the Design By Contract methodology support in Python. Interfaces are objects that specify (document) the external behavior of objects that “provide” them. An interface specifies behavior through informal documentation in a doc string, attribute definitions, and invariants, which are conditions that must hold for objects that provide the interface. Attribute definitions specify specific attributes. They define the attribute name and provide documentation and constraints of attribute values. Attribute definitions can take a number of forms.Starting Price: Free -
36
h5py
HDF5
The h5py package is a Pythonic interface to the HDF5 binary data format. It lets you store huge amounts of numerical data, and easily manipulate that data from NumPy. For example, you can slice into multi-terabyte datasets stored on disk, as if they were real NumPy arrays. Thousands of datasets can be stored in a single file, categorized and tagged however you want. H5py uses straightforward NumPy and Python metaphors, like dictionary and NumPy array syntax. For example, you can iterate over datasets in a file, or check out the .shape or .dtype attributes of datasets. You don't need to know anything special about HDF5 to get started. In addition to the easy-to-use high level interface, h5py rests on a object-oriented Cython wrapping of the HDF5 C API. Almost anything you can do from C in HDF5, you can do from h5py.Starting Price: Free -
37
Luminal
Luminal
Luminal gives you the power of Python for spreadsheet processing with none of the complexity. Clean, transform or analyze large amounts of data using nothing but natural language. If it's supported by Python, it's supported by Luminal. Perform powerful cleaning operations, answer sophisticated questions and create beautiful visualizations. Don't waste your time manually processing data in Excel or writing a Python script to do the work. Let Luminal take care of it and save hours. At Luminal, we prioritize the protection of our customers' data and strictly comply with security regulations. Ensuring the security of customer information is core to our company culture.Starting Price: $16 per user per month -
38
NumPy
NumPy
Fast and versatile, the NumPy vectorization, indexing, and broadcasting concepts are the de-facto standards of array computing today. NumPy offers comprehensive mathematical functions, random number generators, linear algebra routines, Fourier transforms, and more. NumPy supports a wide range of hardware and computing platforms, and plays well with distributed, GPU, and sparse array libraries. The core of NumPy is well-optimized C code. Enjoy the flexibility of Python with the speed of compiled code. NumPy’s high level syntax makes it accessible and productive for programmers from any background or experience level. NumPy brings the computational power of languages like C and Fortran to Python, a language much easier to learn and use. With this power comes simplicity: a solution in NumPy is often clear and elegant.Starting Price: Free -
39
Matplotlib
Matplotlib
Matplotlib is a comprehensive library for creating static, animated, and interactive visualizations in Python. Matplotlib makes easy things easy and hard things possible. A large number of third party packages extend and build on Matplotlib functionality, including several higher-level plotting interfaces (seaborn, HoloViews, ggplot, ...), and a projection and mapping toolkit (Cartopy).Starting Price: Free -
40
gevent
gevent
gevent is a coroutine-based Python networking library that uses greenlet to provide a high-level synchronous API on top of the libev or libuv event loop. gevent is inspired by eventlet but features a more consistent API, simpler implementation and better performance. Read why others use gevent and check out the list of the open source projects based on gevent.Starting Price: Free -
41
IronPython
IronPython
IronPython is an open-source implementation of the Python programming language which is tightly integrated with .NET. IronPython can use .NET and Python libraries, and other .NET languages can use Python code just as easily. Experience a more interactive .NET and Python development experience with Python Tools for Visual Studio. IronPython is an excellent addition to .NET, providing Python developers with the power of the .NET. Existing .NET developers can also use IronPython as a fast and expressive scripting language for embedding, testing, or writing a new application from scratch. The CLR is a great platform for creating programming languages, and the DLR makes it all the better for dynamic languages. Also, the .NET (base class library, presentation foundation, etc.) gives developers an amazing amount of functionality and power. IronPython uses Python syntax and standard libraries and so your Python code will need to be updated accordingly.Starting Price: Free -
42
waiting
Python Software Foundation
waiting is a small library for waiting for stuff to happen. It basically waits for a function to return True, in various modes. Waiting is compatible with flux for simulated timelines. The most basic usage is when you have a function you want to wait for. Waiting forever is very simple. If your predicate returns a value, it will be returned as the result of wait(). A timeout parameter can also be specified. When a timeout expires without the predicate being fulfilled, an exception is thrown. Sleeping polls the predicate at a certain interval (by default 1 second). The interval can be changed with the sleep_seconds argument. When waiting for multiple predicates, waiting provides two simple facilities to help aggregate them, any and all. They resemble Python’s built-in any() and all(), except that they don’t call a predicate once it has been satisfied (this is useful when the predicates are inefficient and take time to complete).Starting Price: Free -
43
Python
Python
The core of extensible programming is defining functions. Python allows mandatory and optional arguments, keyword arguments, and even arbitrary argument lists. Whether you're new to programming or an experienced developer, it's easy to learn and use Python. Python can be easy to pick up whether you're a first-time programmer or you're experienced with other languages. The following pages are a useful first step to get on your way to writing programs with Python! The community hosts conferences and meetups to collaborate on code, and much more. Python's documentation will help you along the way, and the mailing lists will keep you in touch. The Python Package Index (PyPI) hosts thousands of third-party modules for Python. Both Python's standard library and the community-contributed modules allow for endless possibilities.Starting Price: Free -
44
statsmodels
statsmodels
statsmodels is a Python module that provides classes and functions for the estimation of many different statistical models, as well as for conducting statistical tests and statistical data exploration. An extensive list of result statistics is available for each estimator. The results are tested against existing statistical packages to ensure that they are correct. The package is released under the open-source Modified BSD (3-clause) license. statsmodels supports specifying models using R-style formulas and pandas DataFrames. Have a look at dir(results) to see available results. Attributes are described in results.__doc__ and results methods have their own docstrings. You can also use numpy arrays instead of formulas. The easiest way to install statsmodels is to install it as part of the Anaconda distribution, a cross-platform distribution for data analysis and scientific computing. This is the recommended installation method for most users.Starting Price: Free -
45
Bokeh
Bokeh
Bokeh makes it simple to create common plots, but also can handle custom or specialized use-cases. Plots, dashboards, and apps can be published in web pages or Jupyter notebooks. Python has an incredible ecosystem of powerful analytics tools: NumPy, Scipy, Pandas, Dask, Scikit-Learn, OpenCV, and more. With a wide array of widgets, plot tools, and UI events that can trigger real Python callbacks, the Bokeh server is the bridge that lets you connect these tools to rich, interactive visualizations in the browser. Microscopium is a project maintained by researchers at Monash University. It allows researchers to discover new gene or drug functions by exploring large image datasets with Bokeh’s interactive tools. Panel is a tool for polished data presentation that utilizes the Bokeh server. It is created and supported by Anaconda. Panel makes it simple to create custom interactive web apps and dashboards by connecting user-defined widgets to plots, images, tables, or text.Starting Price: Free -
46
pdf2docx
Artifex
pdf2docx is a Python library that uses PyMuPDF to extract data from PDF files, parse their layouts according to rules, and generate corresponding .docx files via python-docx. It supports conversion of text, images, tables, and other structural elements; it includes tools to extract tables, handle formatting, and preserve layout as much as possible. It offers both a command-line interface and a graphical user interface. The internal architecture is modular; it includes packages for handling pages, layout, tables, images, shape paths, text spans/blocks, and other elements, enabling fine control over how PDF content is mapped into Word documents. Developers can use the API for batch conversions or integrate it into workflows; there's documentation on installation (from PyPI or source), usage, and technical details of layout-parsing, table extraction, and internal modules. The project is open source, hosted on GitHub, and made available under its license with no warranty.Starting Price: Free -
47
Python RPA
Python RPA
Powerful and affordable RPA platform. Use the flexibility of Python, the convenience of low code, and the potential of AI for intelligent automation. Python RPA is an easy-to-use platform for developing and managing bots in Python. The capabilities of Python make the platform an effective and powerful tool for automating business processes. Enterprise-grade orchestrator for managing Python scripts and low-code projects. Basic Python knowledge is enough to start your automation journey. Stay ahead with instant notifications and a status management board. Uninterrupted flow of process execution, keeping things running smoothly. Ensure secured and managed user access. Keep your credentials secured and ensure activities are being logged. Use any library or framework for creating your project. Develop your Python automation in any open-source Python development environment.Starting Price: $275 per month -
48
gTTS
gTTS
gTTS (Google Text-to-Speech), a Python library and CLI tool to interface with Google Translate's text-to-speech API. Write spoken mp3 data to a file, a file-like object (bytestring) for further audio manipulation, or stdout. Or simply pre-generate Google Translate TTS request URLs to feed to an external program. Customizable speech-specific sentence tokenizer that allows for unlimited lengths of text to be read, all while keeping proper intonation, abbreviations, decimals and more. Customizable text pre-processors which can, for example, provide pronunciation corrections.Starting Price: Free -
49
E-ICEBLUE
E-ICEBLUE
E-iceblue provides a comprehensive suite of professional development libraries and APIs designed to enable developers to create, read, write, edit, convert, print, manipulate and view a wide range of document formats programmatically across multiple programming environments without relying on external applications like Microsoft Office or Adobe Acrobat. Its product range includes Spire.Office and individual components for .NET platforms (such as Spire.Doc, Spire.XLS, Spire.Presentation, Spire.PDF, Spire.Barcode, Spire.Email and Spire.OCR) that handle Word, Excel, PowerPoint, PDF, barcode generation and email operations in C#, VB.NET, ASP.NET, .NET Core, Xamarin and WPF applications, plus viewer libraries for embedded document display. E-iceblue also offers equivalent APIs for Java, C++, Python and JavaScript, as well as mobile and cloud libraries (including Spire.Cloud.Office with HTML5 browser support for Word and Excel), supporting document processing tasks. -
50
Seaborn
Seaborn
Seaborn is a Python data visualization library based on matplotlib. It provides a high-level interface for drawing attractive and informative statistical graphics. For a brief introduction to the ideas behind the library, you can read the introductory notes or the paper. Visit the installation page to see how you can download the package and get started with it. You can browse the example gallery to see some of the things that you can do with seaborn, and then check out the tutorials or API reference to find out how. To see the code or report a bug, please visit the GitHub repository. General support questions are most at home on StackOverflow, which has a dedicated channel for seaborn.