MedicalGPT Files

MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training

This is an exact mirror of the MedicalGPT project, hosted at https://github.com/shibing624/MedicalGPT. SourceForge is not affiliated with MedicalGPT. For more information, see the SourceForge Open Source Mirror Directory.

The interactive file manager requires Javascript. Please enable it or use sftp or scp.
You may still browse the files here.

Name	Modified	Size	InfoDownloads / Week
Parent folder
README.md	2024-08-02	1.6 kB	0
v2.2.0 source code.tar.gz	2024-08-02	9.0 MB	0
v2.2.0 source code.zip	2024-08-02	9.0 MB	0
Totals: 3 Items		18.0 MB	0

v2.2.0

支持了角色扮演模型训练
新增了医患对话SFT数据生成脚本role_play_data

造角色扮演对话

本数据集使用OpenAI API接口生成，流程：

种子特征集和基础设定：
手工编写的种子集包含基本角色特征。
LLM从这个种子集生成角色的基础设定。
角色设定的进化：
第二个种子集包含指导角色设定进化的指令Prompt。
这些进化角色的指令Prompt被放到一个指令池中。基于这些进化Prompt，LLM对基础设定实施进化。
反馈循环：
由人类评估者和GPT-4组成的混合评价系统。此系统对进化后的设定给出反馈。
反馈用于迭代更新种子集。如此迭代，我们最终得到一个细致的角色设定数据集。
角色扮演和对话生成：
使用self-instruction框架基于角色设定生成角色的对话数据。
生成角色设定，分别生成护士角色和患者角色

:::bash cd role_play_data

python role_generate.py
生成医患之间的多轮对话 LLM选择：分别用gpt-4o的api和豆包的doubao-character-pro-32k的api生成对话

:::bash python roleplay_data_generate_gpt4.py

python roleplay_data_generate_doubao.py

What's Changed

add full_train.py and run_full_train.sh by @ZhuangXialie in https://github.com/shibing624/MedicalGPT/pull/394

Full Changelog: https://github.com/shibing624/MedicalGPT/compare/2.1.0...2.2.0

Source: README.md, updated 2024-08-02

Other Useful Business Software

Our Free Plans just got better! | Auth0 Icon

Our Free Plans just got better! | Auth0

With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now

Automate contact and company data extraction Icon

Automate contact and company data extraction

Build lead generation pipelines that pull emails, phone numbers, and company details from directories, maps, social platforms. Full API access.

Generate leads at scale without building or maintaining scrapers. Use 10,000+ ready-made tools that handle authentication, pagination, and anti-bot protection. Pull data from business directories, social profiles, and public sources, then export to your CRM or database via API. Schedule recurring extractions, enrich existing datasets, and integrate with your workflows.

Explore Apify Store

Recommended Projects

PaLM + RLHF - Pytorch
Implementation of RLHF (Reinforcement Learning with Human Feedback)
HY-Motion 1.0
HY-Motion model for 3D character animation generation
ImageReward
[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences
Transformer Reinforcement Learning X
A repo for distributed training of language models with Reinforcement
StyleTTS 2
Towards Human-Level Text-to-Speech through Style Diffusion