v1.0.0
What's Changed
- Support vLLM as generation engine.
- Support custom flow.
- Support efficient memory sharing.
- Support CPU module.
- Add Llama2 DPO/OnlineDPO/GRPO example based on Megatron-LM.
- Add QWen2 DPO example based on DeepSpeed.
Full Changelog: v0.2.0...v1.0.0