Showing 43 open source projects for "q learning"

View related business solutions
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 1
    The Arcade Learning Environment

    The Arcade Learning Environment

    The Arcade Learning Environment (ALE) -- a platform for AI research

    ...This environment suite has been central to many RL breakthroughs, including value-based agents, deep Q-nets, and general-agent benchmarking, because the Atari games span many genres and present diverse learning challenges (pixels, actions, delayed rewards). The repository supports multi‐platform build (Linux, macOS, Windows), vectorized execution of games, Python bindings, Gymnasium registration, and a large set of game ROMs bundled for convenience.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    RL with PyTorch

    RL with PyTorch

    Clean, Robust, and Unified PyTorch implementation

    RL with PyTorch is a research-oriented repository that provides implementations of deep reinforcement learning algorithms using the PyTorch framework. The project focuses on helping developers and researchers understand reinforcement learning methods by providing clean and reproducible implementations of well-known algorithms. It includes code for popular deep reinforcement learning techniques such as Deep Q-Networks, policy gradient methods, actor-critic architectures, and other modern RL approaches. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Adapters

    Adapters

    A Unified Library for Parameter-Efficient Learning

    Adapters is an add-on library to HuggingFace's Transformers, integrating 10+ adapter methods into 20+ state-of-the-art Transformer models with minimal coding overhead for training and inference. Adapters provide a unified interface for efficient fine-tuning and modular transfer learning, supporting a myriad of features like full-precision or quantized training (e.g. Q-LoRA, Q-Bottleneck Adapters, or Q-PrefixTuning), adapter merging via task arithmetics or the composition of multiple adapters via composition blocks, allowing advanced research in parameter-efficient transfer learning for NLP tasks.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Deep-Learning-Interview-Book

    Deep-Learning-Interview-Book

    Interview guide for machine learning, mathematics, and deep learning

    Deep-Learning-Interview-Book collects structured notes, Q&A, and concept summaries tailored to deep-learning interviews, turning scattered study into a coherent playbook. It spans the core math (linear algebra, probability, optimization) and the practitioner topics candidates actually face, like CNNs, RNNs/Transformers, attention, regularization, and training tricks.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 5
    DeepTutor

    DeepTutor

    AI-Powered Personalized Learning Assistant

    DeepTutor is an AI-powered tutoring and learning assistant framework designed to automatically teach, explain, and reinforce academic or technical concepts in depth according to a learner’s specific needs. It goes beyond simple Q&A by constructing multi-stage educational narratives, breaking down complex topics into sequenced “lesson steps,” and offering prompts, examples, and exercises that build on each other in a logical curriculum.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    EduCDM

    EduCDM

    The Model Zoo of cognitive diagnosis models

    The Model Zoo of Cognitive Diagnosis Models, including classic Item Response Ranking (IRT), Multidimensional Item Response Ranking (MIRT), Deterministic Input, Noisy "And" model(DINA), and advanced Fuzzy Cognitive Diagnosis Framework (FuzzyCDF), Neural Cognitive Diagnosis Model (NCDM), Item Response Ranking framework (IRR), Incremental Cognitive Diagnosis (ICD) and Knowledge-association baesd extension of NeuralCD (KaNCD). Cognitive diagnosis model (CDM) for intelligent educational systems is a type of model that infers students' knowledge states from their learning behaviors (especially exercise response logs). Typically, the input of a CDM could be the students' response logs of items (i.e., exercises/questions), the Q-matrix that denotes the correlation between items and knowledge concepts (skills). The output is the diagnosed student knowledge states, such as students' abilities and students' proficiencies on each knowledge concepts.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    DeepPavlov

    DeepPavlov

    A library for deep learning end-to-end dialog systems and chatbots

    ...It has comprehensive and flexible tools that let developers and NLP researchers create production-ready conversational skills and complex multi-skill conversational assistants. Use BERT and other state-of-the-art deep learning models to solve classification, NER, Q&A and other NLP tasks. DeepPavlov Agent allows building industrial solutions with multi-skill integration via API services.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    PennyLane

    PennyLane

    A cross-platform Python library for differentiable programming

    ...You can combine multiple quantum devices with classical processing arbitrarily! Support for hybrid quantum and classical models, and compatible with existing machine learning libraries. Quantum circuits can be set up to interface with either NumPy, PyTorch, JAX, or TensorFlow, allowing hybrid CPU-GPU-QPU computations. The same quantum circuit model can be run on different devices. Install plugins to run your computational circuits on more devices, including Strawberry Fields, Amazon Braket, Qiskit and IBM Q, Google Cirq, Rigetti Forest, and the Microsoft QDK.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    All-in-RAG

    All-in-RAG

    Big Model Application Development Practice 1

    All-in-RAG is an open-source educational project designed to teach developers how to build applications using retrieval-augmented generation techniques. The repository provides a structured learning path that covers both theoretical foundations and practical implementation steps for RAG systems. It explains the full development pipeline required to create knowledge-aware AI assistants, including data preparation, document indexing, vector embedding generation, and retrieval strategies. The...
    Downloads: 0 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    spider_collection

    spider_collection

    Collection of Python web scraping scripts for data extraction tasks

    spider_collection is a collection of Python web crawler scripts created primarily for experimentation, learning, and practical scraping tasks. spider_collection gathers multiple independent spiders designed to collect data from different platforms and services, demonstrating a variety of scraping techniques and workflows. These crawlers make use of common Python scraping tools such as requests, parsel, BeautifulSoup, and the Scrapy framework to extract structured information from web pages....
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    Krypt

    Krypt

    Build a Web 3.0 React Solidity blockchain application

    ...Collaboration and Q&A are encouraged through the repo’s community channels, making it a practical launchpad for first-time Web3 builders.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    GenAI Agents

    GenAI Agents

    Implementations for various Generative AI Agent techniques

    GenAI Agents is a large, tutorial-driven repository that teaches you how to design, build, and experiment with generative AI agents. It spans a spectrum from simple conversational bots and basic question-answering agents to complex multi-agent systems that coordinate on research, education, business workflows, and creative tasks. The implementations leverage modern frameworks such as LangChain, LangGraph, AutoGen, PydanticAI, CrewAI, and more, showing how each can be wired into realistic...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    OpenCV

    OpenCV

    Open Source Computer Vision Library

    The Open Source Computer Vision Library has >2500 algorithms, extensive documentation and sample code for real-time computer vision. It works on Windows, Linux, Mac OS X, Android, iOS in your browser through JavaScript. Languages: C++, Python, Julia, Javascript Homepage: https://opencv.org Q&A forum: https://forum.opencv.org/ Documentation: https://docs.opencv.org Source code: https://github.com/opencv Please pay special attention to our tutorials!...
    Leader badge
    Downloads: 2,968 This Week
    Last Update:
    See Project
  • 14
    GLM-4-32B-0414

    GLM-4-32B-0414

    Open Multilingual Multimodal Chat LMs

    GLM-4-32B-0414 is a powerful open-source large language model featuring 32 billion parameters, designed to deliver performance comparable to leading models like OpenAI’s GPT series. It supports multilingual and multimodal chat capabilities with an extensive 32K token context length, making it ideal for dialogue, reasoning, and complex task completion. The model is pre-trained on 15 trillion tokens of high-quality data, including substantial synthetic reasoning datasets, and further enhanced...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    EarQuiz Frequencies

    EarQuiz Frequencies

    Software for technical ear training on equalization

    EarQuiz Frequencies is a software for ear training on equalization. Its goal is to help musicians, audio professionals, hobbyists and students learn how to hear frequency bands. Available for Windows 10, 11 (x64), macOS 11 or higher (both for Intel and Apple Silicon) and Linux. This application is based on (and deeply inspired by) the world-renowned Golden Ears method of David Moulton, whose course is half dedicated to building this essential critical listening skill. The overall...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 16
    Parallel and Distributed Process System

    Parallel and Distributed Process System

    OmniSim simulates parallel and distributed processing systems

    Parallel and Distributed Process OmniSim Computational Neuroscience: Large-scale neural population dynamics, brain-inspired computing architectures, and neuro-symbolic AI systems 🧬 Scientific Overview PDP-OmniSim is an advanced computational framework for simulating parallel and distributed processing systems, with cutting-edge applications in computational neuroscience, distributed computing, and complex systems modeling. The framework provides researchers with robust tools for...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 17
    minimalRL-pytorch

    minimalRL-pytorch

    Implementations of basic RL algorithms with minimal lines of codes

    ...The repository includes examples of widely used reinforcement learning methods such as REINFORCE, Deep Q-Networks, Proximal Policy Optimization, and Actor-Critic architectures. Most experiments are designed to run quickly using the CartPole environment so that users can focus on understanding algorithm logic rather than computational infrastructure.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Reinforcement-learning

    Reinforcement-learning

    Implementation of Reinforcement Learning Algorithms. Python, OpenAI

    Reinforcement-learning is a widely used educational repository that provides implementations, exercises, and solutions for a broad range of reinforcement learning algorithms, designed to complement foundational texts and courses in the field. The project collects popular approaches such as dynamic programming, Monte Carlo methods, temporal difference learning, Q-learning, SARSA, deep Q-networks, and policy gradient techniques, often demonstrated with Python and OpenAI Gym environments so users can experiment with agents learning in simulated tasks. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Reinforcement Learning Methods

    Reinforcement Learning Methods

    Simple Reinforcement learning tutorials

    Reinforcement-Learning-with-TensorFlow is an educational repository that walks through key reinforcement learning algorithms implemented in TensorFlow. It provides clear code examples for foundational techniques like Q-learning, policy gradients, deep Q-networks, actor-critic methods, and value function approximation within familiar simulation environments.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    TRFL

    TRFL

    TensorFlow Reinforcement Learning

    ...TRFL supports both CPU and GPU TensorFlow environments, though TensorFlow itself must be installed separately. It exposes clean, modular APIs for various RL methods including Q-learning, policy gradient, and actor-critic algorithms, among others. Each function returns not only the computed loss tensor but also a detailed structure containing auxiliary information like TD errors and targets.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    neurojs

    neurojs

    A JavaScript deep learning and reinforcement learning library

    ...The framework supports neural network architectures and reinforcement learning methods such as deep Q-networks and actor-critic algorithms. Several interactive demonstrations included with the project illustrate how neural networks can be used to train agents in simulated tasks, including a browser-based self-driving car example. These demos allow users to visualize how reinforcement learning agents improve their behavior over time as they receive rewards and update their neural networks.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    ...The goal of the whole network is to maintain an energy neutrality state and to maximize the effective data throughput of the network. The cooperative Q-learning is applied in this multi-agent environment the solar-powered wireless sensor networks to keep harvested energy more balanced among the whole clustered network.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    PaddlePaddle models

    PaddlePaddle models

    Pre-trained and Reproduced Deep Learning Models

    Pre-trained and Reproduced Deep Learning Models ("Flying Paddle" official model library, including a variety of academic frontier and industrial scene verification of deep learning models) Flying Paddle's industrial-level model library includes a large number of mainstream models that have been polished by industrial practice for a long time and models that have won championships in international competitions; it provides many scenarios for semantic understanding, image classification,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    deep-q-learning

    deep-q-learning

    Minimal Deep Q Learning (DQN & DDQN) implementations in Keras

    The deep-q-learning repository authored by keon provides a Python-based implementation of the Deep Q-Learning algorithm — a cornerstone method in reinforcement learning. It implements the core logic needed to train an agent using Q-learning with neural networks (i.e. approximating Q-values via deep nets), setting up environment interaction loops, experience replay, network updates, and policy behavior.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Deep Reinforcement Learning TensorFlow

    Deep Reinforcement Learning TensorFlow

    TensorFlow implementation of Deep Reinforcement Learning papers

    Deep Reinforcement Learning TensorFlow is a comprehensive TensorFlow codebase that implements several foundational deep reinforcement learning algorithms for educational and experimental use. The repository focuses on clarity and modularity so users can study how different RL approaches are built and compare their behavior across environments. It includes implementations of well-known algorithms such as Deep Q-Networks (DQN), policy gradients, and related variants, demonstrating how neural networks can be trained through interaction with simulated environments. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB