Re-Search is an academic search engine that can be used to search open-source datasets and academic research papers similar to Google Scholar. This was built as part of a coursework for Edinburgh University's Text Technologies for Data Science course.
? This project also received the Best Project Award among 250 students / 50+ groups for the same course.
Re-Search allows you to search for publicly available datasets and research papers using three different ranking algorithms - TF-IDF, BM25, and ScaNN. It also supports Author Search (wherever the datapoint allows it), Phrase Search, and Proximity Search along with its default search type.
Re-Search uses React for the frontend with the backend supported by Flask and stores data in a MongoDB database. It has a separate microservice for the ScaNN algorithm as it only runs on Linux servers. We use Redis for the distributed cache but we also provide an LRU Cache implementation that works without distributed caching. We provided load balancing and horizontal scalability with Google Cloud Platform's App Engine.
npm install
and pip install -r requirements.txt
npm start
and flask run
respectively.sudo service mongod start
.You can run the backend files in production with waitress
using the prod_*.py
files, however, currently, the app isn't configured to provide scalability or transfer data securely.
We'll soon publish this project on SMASH Research Group's servers in the summer. Until then you can request database access by contacting Leo/Yuto.
Mozilla Public License 2.0 ©️ The Re-Search Team