ArXiv Digest and Personalized Recommendations using Large Language Models.
This repo aims to provide a better daily digest for newly published arXiv papers based on your own research interests and natural-language descriptions, using relevancy ratings from GPT.
You can try it out on Hugging Face using your own OpenAI API key.
You can also create a daily subscription pipeline to email you the results.
Staying up to date on arXiv papers can take a considerable amount of time, with on the order of hundreds of new papers each day to filter through. There is an official daily digest service, however large categories like cs.AI still have 50-100 papers a day. Determining if these papers are relevant and important to you means reading through the title and abstract, which is time-consuming.
This repository offers a method to curate a daily digest, sorted by relevance, using large language models. These models are conditioned based on your personal research interests, which are described in natural language.
config.yaml
with an arXiv Subject, some set of Categories, and a natural language statement about the type of papers you are interested in.gpt-3.5-turbo-16k
.We provide a demo at https://huggingface.co/spaces/AutoLLM/ArxivDigest. Simply enter your OpenAI API key and then fill in the configuration on the right. Note that we do not store your key.
You can also send yourself an email of the digest by creating a SendGrid account and API key.
The recommended way to get started using this repository is to:
config.yaml
and merge the changes into your main branch.OPENAI_API_KEY
From OpenAISENDGRID_API_KEY
From SendGridFROM_EMAIL
This value must match the email you used to create the SendGrid API Key.TO_EMAIL
See Advanced Usage for more details, including step-by-step images, further customization, and alternate usage.
To locally run the same UI as the Huggign Face space:
src/requirements.txt
as well as gradio
.python src/app.py
and go to the local URL. From there you will be able to preview the papers from today, as well as the generated digests..env
file for your secrets, you can copy .env.template
to .env
and then set the environment variables in .env
..env.template
with your keys or your email address, since .template.env
is tracked by git and editing it might cause you to commit your secrets.WARNING: Do not edit and commit your
.env.template
with your personal keys or email address! Doing so may expose these to the world!
You may (and are encourage to) modify the code in this repository to suit your personal needs. If you think your modifications would be in any way useful to others, please submit a pull request.
These types of modifications include things like changes to the prompt, different language models, or additional ways for the digest is delivered to you.