The Postman Open Technologies team maintains a project dedicated to mining and extracting knowledge from the API universe. There is a wealth of knowledge present in the OpenAPI, Swagger, Postman Collections, Spectral, and other API artifacts available on GitHub, but also on the open web.
To expand the current knowledge base, we want to develop an open-source approach for finding Swagger and OpenAPI definitions on the open web, crawling web pages looking for API definitions, validating them, and then consuming and indexing them as part of an ongoing search.
There are already known sources like GitHub, SwaggerHub, and APIs.guru to find OpenAPI/Swagger specifications but we want to focus on extracting API definitions from lesser-known sources and presenting them to the world. The dataset can later be used to analyze the specifications to obtain insights into some of the practices common among APIs.
Open API Web Search project is all about providing a simple way for developers to find existing Swagger and OpenAPI definitions on the open web—mostly from lesser-known sources. The ultimate goal of this project is to build a search engine for APIs where API consumers and producers can discover APIs using keywords that abstract away the complexity of searching the web for specific terms, helping identify APIs in a sea of web pages. Learn how Open API Web Search can help unleash the power of open APIs.
The goal of this project can be achieved with the following milestones:
Fork and/or clone the OpenAPI Web Search repo and change directory into it:
git clone https://github.com/<username>/openapi-web-search.git
cd openapi-web-search/src/server
Install dependencies via yarn:
yarn install
Start local server:
yarn run dev
After launching the local server, we can use Postman to begin sending http requests to the specified endpoints. I've included a postman collection in root of the project to get you started:
Run the following endpoints in the specified order after configuring Postman with the collection above:
1. http://localhost:1337/api/v1/run/crawler?latest=true
2. http://localhost:1337/api/v1/process/index-files?skip=0&limit=20&sort=aes
3. http://localhost:1337/api/v1/indexing
4. http://localhost:1337/api/v1/search?q=<query>
Explanation: