Chinese-LLaMA-Alpaca 2 Files

Chinese LLaMA-2 & Alpaca-2 Large Model Phase II Project

This is an exact mirror of the Chinese-LLaMA-Alpaca 2 project, hosted at https://github.com/ymcui/Chinese-LLaMA-Alpaca-2. SourceForge is not affiliated with Chinese-LLaMA-Alpaca 2.

The interactive file manager requires Javascript. Please enable it or use sftp or scp.
You may still browse the files here.

Name	Modified	Size	InfoDownloads / Week
Parent folder
README.md	2023-10-26	1.8 kB	0
Zhong Wen Yang Tuo Da Mo Xing Er Qi v3.2 source code.tar.gz	2023-10-26	8.4 MB	0
Zhong Wen Yang Tuo Da Mo Xing Er Qi v3.2 source code.zip	2023-10-26	8.4 MB	0
Totals: 3 Items		16.8 MB	0

本次更新推出小参数量基座/聊天模型Chinese-LLaMA-2-1.3B和Chinese-Alpaca-2-1.3B，以及对投机采样解码策略的支持

🚀 Chinese-LLaMA-2-1.3B、Chinese-Alpaca-2-1.3B，投机采样解码策略

推出4层的小参数量中文LLaMA/Alpaca模型，使用和大模型相同数据量进行了中文预训练（Chinese-LLaMA-2-1.3B）和指令精调训练（Chinese-Alpaca-2-1.3B）。
投机采样是一种解码加速策略，借助能力稍弱但速度较快的小模型加速大模型的推理。其理论细节可查看相关论文。本次更新实现了投机采样解码策略，可使用小模型加速大模型的解码，并在gradio_demo.py和inference_hf.py中添加了使用投机采样的参数。
经测试，A40-48G GPU上使用Chinese-Alpaca-2-1.3B模型加速Chinese-Alpaca-2-7B/13B模型推理，平均推理速度提升了1.3~1.6倍。详细用法和加速效果请参考wiki

注意事项：小参数量模型可以像7B/13B模型一样直接用于推理，但结果会比大模型差，建议用于投机采样加速大模型推理。

其他更新

添加了对kbits训练的支持 (#229)
Peft相关更新和修复 (#246, [#251])
FAQ：添加了问题12、13 (#249)
C-Eval: 更新了prompt模板 (#255)
LongBench: 更新了测试结果 (#259)
LangChain: 更新了示例中的超参设置 (#271)
修复了推理脚本中量化推理相关问题 (#302)
适配了FlashAttention对推理的优化，现在可以在推理时搭配FlashAttention进行加速。使用方法参考wiki (#367)

Source: README.md, updated 2023-10-26

Other Useful Business Software

Powerful App Monitoring Without Surprise Bills Icon

Powerful App Monitoring Without Surprise Bills

AppSignal starts at $23/month with all features included. No overages, no hidden fees. 30-day free trial.

Tired of monitoring tools that punish you for scaling? AppSignal offers transparent, predictable pricing with every feature unlocked on every plan. Track errors, monitor performance, detect anomalies, and manage logs across Ruby, Python, Node.js, and more. Trusted by developers since 2012 with free dev-to-dev support. No credit card required to start your 30-day trial.

Try AppSignal Free

Our Free Plans just got better! | Auth0 Icon

Our Free Plans just got better! | Auth0

With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now

Featured

Try Google Cloud Risk-Free With $300 in Credit

No hidden charges. No surprise bills. Cancel anytime.

Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.

Start Free

Recommended Projects

Chinese-LLaMA-Alpaca-3
Chinese Llama-3 LLMs) developed from Meta Llama 3
Llama-Chinese
Llama Chinese community, real-time aggregation
Chinese Llama 2 7B
The first Chinese LLaMA2 model in the open source community
Huatuo-Llama-Med-Chinese
Instruction-tuning LLM with Chinese Medical Knowledge
Llama 2 Everywhere (L2E)
Llama 2 Everywhere (L2E)