DocScribe is a medical question-and-answer chatbot that revolutionizes how we interact with medical data. DocScribe provides quick, accurate responses to both general medical inquiries and patient-specific questions. Our main goal is to enhance medical report accessibility and comprehension. DocScribe achieves this through:
DocScribe's architecture facilitates a seamless interaction between users and medical data. It incorporates:
Our project leverages a diverse range of data sources to train our model, including:
Dataset | Instruction | Input | Output |
---|---|---|---|
Wikidoc | Answer this question truthfully | Can you provide an overview of the lung's squamous cell carcinoma? | Squamous cell carcinoma of the lung may be classified according to the WHO histological classification system into 4 main types: papillary, clear cell, small cell, and basaloid. |
WikiPatient | Answer this question truthfully | When to seek urgent medical care when I have Alstrom syndrome? | Call your healthcare provider if you or your child have symptoms of diabetes such as increased thirst and urination. Seek medical attention promptly if you think that your child cannot see or hear normally. |
MT Samples | Based on the given medical transcript generate prompt and answer to train LLM | What was the patient's preoperative diagnosis? | The patient's preoperative diagnosis was prostate cancer. |
We opted for the Vicuna-13B model, fine-tuned with LoRA, PEFT, and bitsandbytes. Our approach was validated through rigorous testing and has shown promising results in interpreting medical data.
DocScribe has shown remarkable proficiency in processing medical queries and summarizing patient reports. Future directions include expanding the model's training on medical corpuses, incorporating medical image analysis, and exploring its application in clinical research.
git clone https://github.com/kmnis/DocScribe.git
cd DocScribe
pip install -r requirements.txt
# Start the jupyter server by running
jupyter notebook
# Open your browser and open http://localhost:8888/inference and open a notebook