medicalgpt online with Winfy
We have hosted the application medicalgpt in order to run this application in our online workstations with Wine or directly.
Quick description about medicalgpt:
MedicalGPT training medical GPT model with ChatGPT training pipeline, implementation of Pretraining, Supervised Finetuning, Reward Modeling and Reinforcement Learning. MedicalGPT trains large medical models, including secondary pre-training, supervised fine-tuning, reward modeling, and reinforcement learning training.Features:
- The first stage: PT (Continue PreTraining) incremental pre-training, pre-training the GPT model twice on massive domain document data to inject domain knowledge
- The second stage: SFT (Supervised Fine-tuning) has supervised fine-tuning, constructs an instruction fine-tuning data set, and performs instruction fine-tuning on the basis of the pre-training model to align instruction intentions
- The third stage: RM (Reward Model) reward model modeling, constructing a human preference ranking data set, training the reward model to align human preferences, mainly the "HHH" principle, specifically "helpful, honest, harmless"
- The fourth stage: RL (Reinforcement Learning) is based on human feedback reinforcement learning (RLHF), using the reward model to train the SFT model, and the generation model uses rewards or penalties to update its strategy in order to generate higher quality, more in line with human preferences
- We provide a simple Gradio-based interactive web interface
- After the training is complete, now we load the trained model to verify the effect of the model generating text
Programming Language: Python.
Categories:
Large Language Models (LLM)
©2024. Winfy. All Rights Reserved.
By OD Group OU – Registry code: 1609791 -VAT number: EE102345621.