duckduckgo_search
v6.3.5
使用DuckDuckgo.com搜索引擎搜索單詞,文檔,圖像,視頻,新聞,地圖和文本翻譯。將文件和圖像下載到本地硬盤驅動器。
pip install - U duckduckgo_search
筆記
您可以安裝LXML以使用backend='html'
或backend='lite'
(尺寸≈12MB)使用text
函數pip install -U duckduckgo_search[lxml]
ddgs - - help
CLI示例:
# AI chat
ddgs chat
# text search
ddgs text - k "Assyrian siege of Jerusalem"
# find and download pdf files via proxy
ddgs text - k "Economics in one lesson filetype:pdf" - r wt - wt - m 50 - p https : // 1.2 . 3.4 : 1234 - d - dd economics_reading
# using Tor Browser as a proxy (`tb` is an alias for `socks5://127.0.0.1:9150`)
ddgs text - k "'The history of the Standard Oil Company' filetype:doc" - m 50 - d - p tb
# find and save to csv
ddgs text - k "'neuroscience exploring the brain' filetype:pdf" - m 70 - o neuroscience_list . csv
# don't verify SSL when making the request
ddgs text - k "Mississippi Burning" - v false
# find and download images
ddgs images - k "beware of false prophets" - r wt - wt - type photo - m 500 - d
# get news for the last day and save to json
ddgs news - k "sanctions" - m 100 - t d - o json
去上面
關鍵字示例 | 結果 |
---|---|
貓狗 | 關於貓或狗的結果 |
“貓和狗” | 確切術語“貓和狗”的結果。如果找不到結果,將顯示相關的結果。 |
貓 - 狗 | 結果較少 |
貓 +狗 | 結果更多的狗 |
CATS FILETYPE:PDF | 關於貓的PDF。支持的文件類型:PDF,DOC(X),XLS(X),PPT(X),HTML |
狗網站:example.com | example.com上關於狗的頁面 |
貓 - 站點:example.com | 關於貓的頁面,不包括example.com |
Intitle:狗 | 頁面標題包括“狗”一詞 |
Inurl:貓 | 頁面URL包括“貓”一詞 |
去上面
xa-ar for Arabia
xa-en for Arabia (en)
ar-es for Argentina
au-en for Australia
at-de for Austria
be-fr for Belgium (fr)
be-nl for Belgium (nl)
br-pt for Brazil
bg-bg for Bulgaria
ca-en for Canada
ca-fr for Canada (fr)
ct-ca for Catalan
cl-es for Chile
cn-zh for China
co-es for Colombia
hr-hr for Croatia
cz-cs for Czech Republic
dk-da for Denmark
ee-et for Estonia
fi-fi for Finland
fr-fr for France
de-de for Germany
gr-el for Greece
hk-tzh for Hong Kong
hu-hu for Hungary
in-en for India
id-id for Indonesia
id-en for Indonesia (en)
ie-en for Ireland
il-he for Israel
it-it for Italy
jp-jp for Japan
kr-kr for Korea
lv-lv for Latvia
lt-lt for Lithuania
xl-es for Latin America
my-ms for Malaysia
my-en for Malaysia (en)
mx-es for Mexico
nl-nl for Netherlands
nz-en for New Zealand
no-no for Norway
pe-es for Peru
ph-en for Philippines
ph-tl for Philippines (tl)
pl-pl for Poland
pt-pt for Portugal
ro-ro for Romania
ru-ru for Russia
sg-en for Singapore
sk-sk for Slovak Republic
sl-sl for Slovenia
za-en for South Africa
es-es for Spain
se-sv for Sweden
ch-de for Switzerland (de)
ch-fr for Switzerland (fr)
ch-it for Switzerland (it)
tw-tzh for Taiwan
th-th for Thailand
tr-tr for Turkey
ua-uk for Ukraine
uk-en for United Kingdom
us-en for United States
ue-es for United States (es)
ve-es for Venezuela
vn-vi for Vietnam
wt-wt for No region
去上面
DDGS和ASYNCDGS類用於從DuckDuckgo.com檢索搜索結果。要使用AsyncDDGS類,您可以使用Python的Asyncio庫執行異步操作。為了初始化DDGS或AsyncDDGS類的實例,您可以提供以下可選參數:
class DDGS :
"""DuckDuckgo_search class to get search results from duckduckgo.com
Args:
headers (dict, optional): Dictionary of headers for the HTTP client. Defaults to None.
proxy (str, optional): proxy for the HTTP client, supports http/https/socks5 protocols.
example: "http://user:[email protected]:3128". Defaults to None.
timeout (int, optional): Timeout value for the HTTP client. Defaults to 10.
verify (bool): SSL verification when making the request. Defaults to True.
"""
這是初始化DDGS類的示例。
from duckduckgo_search import DDGS
results = DDGS (). text ( "python programming" , max_results = 5 )
print ( results )
這是初始化asyncddgs類的示例:
import asyncio
from duckduckgo_search import AsyncDDGS
async def aget_results ( word ):
results = await AsyncDDGS ( proxy = None ). atext ( word , max_results = 100 )
return results
async def main ():
words = [ "sun" , "earth" , "moon" ]
tasks = [ aget_results ( w ) for w in words ]
results = await asyncio . gather ( * tasks )
print ( results )
if __name__ == "__main__" :
asyncio . run ( main ())
去上面
軟件包支持HTTP/HTTPS/SOCKS代理。示例: http://user:[email protected]:3128
。使用旋轉代理。否則,使用每個DDGS或異步初始化的新代理。
1。最簡單的方法。啟動TOR瀏覽器
ddgs = DDGS ( proxy = "tb" , timeout = 20 ) # "tb" is an alias for "socks5://127.0.0.1:9150"
results = ddgs . text ( "something you need" , max_results = 50 )
2。使用任何代理服務器(示例帶有iproyal旋轉住宅代理)
ddgs = DDGS ( proxy = "socks5h://user:[email protected]:32325" , timeout = 20 )
results = ddgs . text ( "something you need" , max_results = 50 )
去上面
例外:
DuckDuckGoSearchException
:DuckDuckgo_Search錯誤的基本例外。RatelimitException
:從DuckDuckgoSearchException繼承,為超過API請求率限製而籌集。TimeoutException
:從DuckDuckGoSearchException繼承,為API請求超時提出。去上面
def chat ( self , keywords : str , model : str = "gpt-4o-mini" , timeout : int = 30 ) -> str :
"""Initiates a chat session with DuckDuckGo AI.
Args:
keywords (str): The initial message or question to send to the AI.
model (str): The model to use: "gpt-4o-mini", "claude-3-haiku", "llama-3.1-70b", "mixtral-8x7b".
Defaults to "gpt-4o-mini".
timeout (int): Timeout value for the HTTP client. Defaults to 30.
Returns:
str: The response from the AI.
"""
例子
results = DDGS (). chat ( "summarize Daniel Defoe's The Consolidator" , model = 'claude-3-haiku' )
# async
results = await AsyncDDGS (). achat ( 'describe the characteristic habits and behaviors of humans as a species' )
去上面
def text (
keywords : str ,
region : str = "wt-wt" ,
safesearch : str = "moderate" ,
timelimit : str | None = None ,
backend : str = "api" ,
max_results : int | None = None ,
) -> list [ dict [ str , str ]]:
"""DuckDuckGo text search generator. Query params: https://duckduckgo.com/params.
Args:
keywords: keywords for query.
region: wt-wt, us-en, uk-en, ru-ru, etc. Defaults to "wt-wt".
safesearch: on, moderate, off. Defaults to "moderate".
timelimit: d, w, m, y. Defaults to None.
backend: api, html, lite. Defaults to api.
api - collect data from https://duckduckgo.com,
html - collect data from https://html.duckduckgo.com,
lite - collect data from https://lite.duckduckgo.com.
max_results: max number of results. If None, returns results only from the first response. Defaults to None.
Returns:
List of dictionaries with search results.
"""
例子
results = DDGS (). text ( 'live free or die' , region = 'wt-wt' , safesearch = 'off' , timelimit = 'y' , max_results = 10 )
# Searching for pdf files
results = DDGS (). text ( 'russia filetype:pdf' , region = 'wt-wt' , safesearch = 'off' , timelimit = 'y' , max_results = 10 )
# async
results = await AsyncDDGS (). atext ( 'sun' , region = 'wt-wt' , safesearch = 'off' , timelimit = 'y' , max_results = 10 )
print ( results )
[
{
"title" : "News, sport, celebrities and gossip | The Sun" ,
"href" : "https://www.thesun.co.uk/" ,
"body" : "Get the latest news, exclusives, sport, celebrities, showbiz, politics, business and lifestyle from The Sun" ,
}, ...
]
去上面
def answers ( keywords : str ) -> list [ dict [ str , str ]]:
"""DuckDuckGo instant answers. Query params: https://duckduckgo.com/params.
Args:
keywords: keywords for query,
Returns:
List of dictionaries with instant answers results.
"""
例子
results = DDGS (). answers ( "sun" )
# async
results = await AsyncDDGS (). aanswers ( "sun" )
print ( results )
[
{
"icon" : None ,
"text" : "The Sun is the star at the center of the Solar System. It is a massive, nearly perfect sphere of hot plasma, heated to incandescence by nuclear fusion reactions in its core, radiating the energy from its surface mainly as visible light and infrared radiation with 10% at ultraviolet energies. It is by far the most important source of energy for life on Earth. The Sun has been an object of veneration in many cultures. It has been a central subject for astronomical research since antiquity. The Sun orbits the Galactic Center at a distance of 24,000 to 28,000 light-years. From Earth, it is 1 AU or about 8 light-minutes away. Its diameter is about 1,391,400 km, 109 times that of Earth. Its mass is about 330,000 times that of Earth, making up about 99.86% of the total mass of the Solar System. Roughly three-quarters of the Sun's mass consists of hydrogen; the rest is mostly helium, with much smaller quantities of heavier elements, including oxygen, carbon, neon, and iron." ,
"topic" : None ,
"url" : "https://en.wikipedia.org/wiki/Sun" ,
}, ...
]
去上面
def images (
keywords : str ,
region : str = "wt-wt" ,
safesearch : str = "moderate" ,
timelimit : str | None = None ,
size : str | None = None ,
color : str | None = None ,
type_image : str | None = None ,
layout : str | None = None ,
license_image : str | None = None ,
max_results : int | None = None ,
) -> list [ dict [ str , str ]]:
"""DuckDuckGo images search. Query params: https://duckduckgo.com/params.
Args:
keywords: keywords for query.
region: wt-wt, us-en, uk-en, ru-ru, etc. Defaults to "wt-wt".
safesearch: on, moderate, off. Defaults to "moderate".
timelimit: Day, Week, Month, Year. Defaults to None.
size: Small, Medium, Large, Wallpaper. Defaults to None.
color: color, Monochrome, Red, Orange, Yellow, Green, Blue,
Purple, Pink, Brown, Black, Gray, Teal, White. Defaults to None.
type_image: photo, clipart, gif, transparent, line.
Defaults to None.
layout: Square, Tall, Wide. Defaults to None.
license_image: any (All Creative Commons), Public (PublicDomain),
Share (Free to Share and Use), ShareCommercially (Free to Share and Use Commercially),
Modify (Free to Modify, Share, and Use), ModifyCommercially (Free to Modify, Share, and
Use Commercially). Defaults to None.
max_results: max number of results. If None, returns results only from the first response. Defaults to None.
Returns:
List of dictionaries with images search results.
"""
例子
results = DDGS (). images (
keywords = "butterfly" ,
region = "wt-wt" ,
safesearch = "off" ,
size = None ,
color = "Monochrome" ,
type_image = None ,
layout = None ,
license_image = None ,
max_results = 100 ,
)
# async
results = await AsyncDDGS (). aimages ( 'sun' , region = 'wt-wt' , safesearch = 'off' , max_results = 20 )
print ( images )
[
{
"title" : "File:The Sun by the Atmospheric Imaging Assembly of NASA's Solar ..." ,
"image" : "https://upload.wikimedia.org/wikipedia/commons/b/b4/The_Sun_by_the_Atmospheric_Imaging_Assembly_of_NASA's_Solar_Dynamics_Observatory_-_20100819.jpg" ,
"thumbnail" : "https://tse4.mm.bing.net/th?id=OIP.lNgpqGl16U0ft3rS8TdFcgEsEe&pid=Api" ,
"url" : "https://en.wikipedia.org/wiki/File:The_Sun_by_the_Atmospheric_Imaging_Assembly_of_NASA's_Solar_Dynamics_Observatory_-_20100819.jpg" ,
"height" : 3860 ,
"width" : 4044 ,
"source" : "Bing" ,
}, ...
]
去上面
def videos (
keywords : str ,
region : str = "wt-wt" ,
safesearch : str = "moderate" ,
timelimit : str | None = None ,
resolution : str | None = None ,
duration : str | None = None ,
license_videos : str | None = None ,
max_results : int | None = None ,
) -> list [ dict [ str , str ]]:
"""DuckDuckGo videos search. Query params: https://duckduckgo.com/params.
Args:
keywords: keywords for query.
region: wt-wt, us-en, uk-en, ru-ru, etc. Defaults to "wt-wt".
safesearch: on, moderate, off. Defaults to "moderate".
timelimit: d, w, m. Defaults to None.
resolution: high, standart. Defaults to None.
duration: short, medium, long. Defaults to None.
license_videos: creativeCommon, youtube. Defaults to None.
max_results: max number of results. If None, returns results only from the first response. Defaults to None.
Returns:
List of dictionaries with videos search results.
"""
例子
results = DDGS (). videos (
keywords = "cars" ,
region = "wt-wt" ,
safesearch = "off" ,
timelimit = "w" ,
resolution = "high" ,
duration = "medium" ,
max_results = 100 ,
)
# async
results = await AsyncDDGS (). avideos ( 'sun' , region = 'wt-wt' , safesearch = 'off' , timelimit = 'y' , max_results = 10 )
print ( results )
[
{
"content" : "https://www.youtube.com/watch?v=6901-C73P3g" ,
"description" : "Watch the Best Scenes of popular Tamil Serial #Meena that airs on Sun TV. Watch all Sun TV serials immediately after the TV telecast on Sun NXT app. *Free for Indian Users only Download here: Android - http://bit.ly/SunNxtAdroid iOS: India - http://bit.ly/sunNXT Watch on the web - https://www.sunnxt.com/ Two close friends, Chidambaram ..." ,
"duration" : "8:22" ,
"embed_html" : '<iframe width="1280" height="720" src="https://www.youtube.com/embed/6901-C73P3g?autoplay=1" frameborder="0" allowfullscreen></iframe>' ,
"embed_url" : "https://www.youtube.com/embed/6901-C73P3g?autoplay=1" ,
"image_token" : "6c070b5f0e24e5972e360d02ddeb69856202f97718ea6c5d5710e4e472310fa3" ,
"images" : {
"large" : "https://tse4.mm.bing.net/th?id=OVF.JWBFKm1u%2fHd%2bz2e1GitsQw&pid=Api" ,
"medium" : "https://tse4.mm.bing.net/th?id=OVF.JWBFKm1u%2fHd%2bz2e1GitsQw&pid=Api" ,
"motion" : "" ,
"small" : "https://tse4.mm.bing.net/th?id=OVF.JWBFKm1u%2fHd%2bz2e1GitsQw&pid=Api" ,
},
"provider" : "Bing" ,
"published" : "2024-07-03T05:30:03.0000000" ,
"publisher" : "YouTube" ,
"statistics" : { "viewCount" : 29059 },
"title" : "Meena - Best Scenes | 02 July 2024 | Tamil Serial | Sun TV" ,
"uploader" : "Sun TV" ,
}, ...
]
去上面
def news (
keywords : str ,
region : str = "wt-wt" ,
safesearch : str = "moderate" ,
timelimit : str | None = None ,
max_results : int | None = None ,
) -> list [ dict [ str , str ]]:
"""DuckDuckGo news search. Query params: https://duckduckgo.com/params.
Args:
keywords: keywords for query.
region: wt-wt, us-en, uk-en, ru-ru, etc. Defaults to "wt-wt".
safesearch: on, moderate, off. Defaults to "moderate".
timelimit: d, w, m. Defaults to None.
max_results: max number of results. If None, returns results only from the first response. Defaults to None.
Returns:
List of dictionaries with news search results.
"""
例子
results = DDGS (). news ( keywords = "sun" , region = "wt-wt" , safesearch = "off" , timelimit = "m" , max_results = 20 )
# async
results = await AsyncDDGS (). anews ( 'sun' , region = 'wt-wt' , safesearch = 'off' , timelimit = 'd' , max_results = 10 )
print ( results )
[
{
"date" : "2024-07-03T16:25:22+00:00" ,
"title" : "Murdoch's Sun Endorses Starmer's Labour Day Before UK Vote" ,
"body" : "Rupert Murdoch's Sun newspaper endorsed Keir Starmer and his opposition Labour Party to win the UK general election, a dramatic move in the British media landscape that illustrates the country's shifting political sands." ,
"url" : "https://www.msn.com/en-us/money/other/murdoch-s-sun-endorses-starmer-s-labour-day-before-uk-vote/ar-BB1plQwl" ,
"image" : "https://img-s-msn-com.akamaized.net/tenant/amp/entityid/BB1plZil.img?w=2000&h=1333&m=4&q=79" ,
"source" : "Bloomberg on MSN.com" ,
}, ...
]
去上面
def maps (
keywords ,
place : str | None = None ,
street : str | None = None ,
city : str | None = None ,
county : str | None = None ,
state : str | None = None ,
country : str | None = None ,
postalcode : str | None = None ,
latitude : str | None = None ,
longitude : str | None = None ,
radius : int = 0 ,
max_results : int | None = None ,
) -> list [ dict [ str , str ]]:
"""DuckDuckGo maps search. Query params: https://duckduckgo.com/params.
Args:
keywords: keywords for query
place: if set, the other parameters are not used. Defaults to None.
street: house number/street. Defaults to None.
city: city of search. Defaults to None.
county: county of search. Defaults to None.
state: state of search. Defaults to None.
country: country of search. Defaults to None.
postalcode: postalcode of search. Defaults to None.
latitude: geographic coordinate (north-south position). Defaults to None.
longitude: geographic coordinate (east-west position); if latitude and
longitude are set, the other parameters are not used. Defaults to None.
radius: expand the search square by the distance in kilometers. Defaults to 0.
max_results: max number of results. If None, returns results only from the first response. Defaults to None.
Returns:
List of dictionaries with maps search results.
"""
例子
results = DDGS (). maps ( "school" , place = "Uganda" , max_results = 50 )
# async
results = await AsyncDDGS (). amaps ( 'shop' , place = "Baltimor" , max_results = 10 )
print ( results )
[
{
"title" : "The Bun Shop" ,
"address" : "239 W Read St, Baltimore, MD 21201-4845" ,
"country_code" : None ,
"url" : "https://www.facebook.com/TheBunShop/" ,
"phone" : "+14109892033" ,
"latitude" : 39.3006042 ,
"longitude" : - 76.6195788 ,
"source" : "https://www.tripadvisor.com/Restaurant_Review-g60811-d4819859-Reviews-The_Bun_Shop-Baltimore_Maryland.html?m=63959" ,
"image" : "" ,
"desc" : "" ,
"hours" : {
"Fri" : "07:00:00–03:00:00" ,
"Mon" : "07:00:00–03:00:00" ,
"Sat" : "07:00:00–03:00:00" ,
"Sun" : "07:00:00–03:00:00" ,
"Thu" : "07:00:00–03:00:00" ,
"Tue" : "07:00:00–03:00:00" ,
"Wed" : "07:00:00–03:00:00" ,
"closes_soon" : 0 ,
"is_open" : 1 ,
"opens_soon" : 0 ,
"state_switch_time" : "03:00" ,
},
"category" : "Cafe" ,
"facebook" : "" ,
"instagram" : "" ,
"twitter" : "" ,
}, ...
]
去上面
def translate (
self ,
keywords : str ,
from_ : str | None = None ,
to : str = "en" ,
) -> list [ dict [ str , str ]]:
"""DuckDuckGo translate.
Args:
keywords: string or list of strings to translate.
from_: translate from (defaults automatically). Defaults to None.
to: what language to translate. Defaults to "en".
Returns:
List od dictionaries with translated keywords.
"""
例子
keywords = 'school'
# also valid
keywords = [ 'school' , 'cat' ]
results = DDGS (). translate ( keywords , to = "de" )
# async
results = await AsyncDDGS (). atranslate ( 'sun' , to = "de" )
print ( results )
[{ "detected_language" : "en" , "translated" : "Sonne" , "original" : "sun" }]
去上面
def suggestions (
keywords ,
region : str = "wt-wt" ,
) -> list [ dict [ str , str ]]:
"""DuckDuckGo suggestions. Query params: https://duckduckgo.com/params.
Args:
keywords: keywords for query.
region: wt-wt, us-en, uk-en, ru-ru, etc. Defaults to "wt-wt".
Returns:
List of dictionaries with suggestions results.
"""
例子
results = DDGS (). suggestions ( "fly" )
# async
results = await AsyncDDGS (). asuggestions ( 'sun' )
print ( results )
[
{ "phrase" : "sunshine live" },
{ "phrase" : "sunexpress" },
{ "phrase" : "sunday natural" },
{ "phrase" : "sunrise village spiel" },
{ "phrase" : "sunny portal" },
{ "phrase" : "sundair" },
{ "phrase" : "sunny cars" },
{ "phrase" : "sunexpress online check-in" },
]
該圖書館不隸屬於Duckduckgo,僅用於教育目的。它不是用於商業用途或任何違反Duckduckgo服務條款的目的。通過使用此庫,您可以承認您不會以侵犯Duckduckgo的條款的方式使用它。可以在https://duckduckgo.com上找到官方的DuckDuckgo網站。
去上面