duckduckgo_search
v6.3.5
使用DuckDuckgo.com搜索引擎搜索单词,文档,图像,视频,新闻,地图和文本翻译。将文件和图像下载到本地硬盘驱动器。
pip install - U duckduckgo_search
笔记
您可以安装LXML以使用backend='html'
或backend='lite'
(尺寸≈12MB)使用text
函数pip install -U duckduckgo_search[lxml]
ddgs - - help
CLI示例:
# AI chat
ddgs chat
# text search
ddgs text - k "Assyrian siege of Jerusalem"
# find and download pdf files via proxy
ddgs text - k "Economics in one lesson filetype:pdf" - r wt - wt - m 50 - p https : // 1.2 . 3.4 : 1234 - d - dd economics_reading
# using Tor Browser as a proxy (`tb` is an alias for `socks5://127.0.0.1:9150`)
ddgs text - k "'The history of the Standard Oil Company' filetype:doc" - m 50 - d - p tb
# find and save to csv
ddgs text - k "'neuroscience exploring the brain' filetype:pdf" - m 70 - o neuroscience_list . csv
# don't verify SSL when making the request
ddgs text - k "Mississippi Burning" - v false
# find and download images
ddgs images - k "beware of false prophets" - r wt - wt - type photo - m 500 - d
# get news for the last day and save to json
ddgs news - k "sanctions" - m 100 - t d - o json
去上面
关键字示例 | 结果 |
---|---|
猫狗 | 关于猫或狗的结果 |
“猫和狗” | 确切术语“猫和狗”的结果。如果找不到结果,将显示相关的结果。 |
猫 - 狗 | 结果较少 |
猫 +狗 | 结果更多的狗 |
CATS FILETYPE:PDF | 关于猫的PDF。支持的文件类型:PDF,DOC(X),XLS(X),PPT(X),HTML |
狗网站:example.com | example.com上关于狗的页面 |
猫 - 站点:example.com | 关于猫的页面,不包括example.com |
Intitle:狗 | 页面标题包括“狗”一词 |
Inurl:猫 | 页面URL包括“猫”一词 |
去上面
xa-ar for Arabia
xa-en for Arabia (en)
ar-es for Argentina
au-en for Australia
at-de for Austria
be-fr for Belgium (fr)
be-nl for Belgium (nl)
br-pt for Brazil
bg-bg for Bulgaria
ca-en for Canada
ca-fr for Canada (fr)
ct-ca for Catalan
cl-es for Chile
cn-zh for China
co-es for Colombia
hr-hr for Croatia
cz-cs for Czech Republic
dk-da for Denmark
ee-et for Estonia
fi-fi for Finland
fr-fr for France
de-de for Germany
gr-el for Greece
hk-tzh for Hong Kong
hu-hu for Hungary
in-en for India
id-id for Indonesia
id-en for Indonesia (en)
ie-en for Ireland
il-he for Israel
it-it for Italy
jp-jp for Japan
kr-kr for Korea
lv-lv for Latvia
lt-lt for Lithuania
xl-es for Latin America
my-ms for Malaysia
my-en for Malaysia (en)
mx-es for Mexico
nl-nl for Netherlands
nz-en for New Zealand
no-no for Norway
pe-es for Peru
ph-en for Philippines
ph-tl for Philippines (tl)
pl-pl for Poland
pt-pt for Portugal
ro-ro for Romania
ru-ru for Russia
sg-en for Singapore
sk-sk for Slovak Republic
sl-sl for Slovenia
za-en for South Africa
es-es for Spain
se-sv for Sweden
ch-de for Switzerland (de)
ch-fr for Switzerland (fr)
ch-it for Switzerland (it)
tw-tzh for Taiwan
th-th for Thailand
tr-tr for Turkey
ua-uk for Ukraine
uk-en for United Kingdom
us-en for United States
ue-es for United States (es)
ve-es for Venezuela
vn-vi for Vietnam
wt-wt for No region
去上面
DDGS和ASYNCDGS类用于从DuckDuckgo.com检索搜索结果。要使用AsyncDDGS类,您可以使用Python的Asyncio库执行异步操作。为了初始化DDGS或AsyncDDGS类的实例,您可以提供以下可选参数:
class DDGS :
"""DuckDuckgo_search class to get search results from duckduckgo.com
Args:
headers (dict, optional): Dictionary of headers for the HTTP client. Defaults to None.
proxy (str, optional): proxy for the HTTP client, supports http/https/socks5 protocols.
example: "http://user:[email protected]:3128". Defaults to None.
timeout (int, optional): Timeout value for the HTTP client. Defaults to 10.
verify (bool): SSL verification when making the request. Defaults to True.
"""
这是初始化DDGS类的示例。
from duckduckgo_search import DDGS
results = DDGS (). text ( "python programming" , max_results = 5 )
print ( results )
这是初始化asyncddgs类的示例:
import asyncio
from duckduckgo_search import AsyncDDGS
async def aget_results ( word ):
results = await AsyncDDGS ( proxy = None ). atext ( word , max_results = 100 )
return results
async def main ():
words = [ "sun" , "earth" , "moon" ]
tasks = [ aget_results ( w ) for w in words ]
results = await asyncio . gather ( * tasks )
print ( results )
if __name__ == "__main__" :
asyncio . run ( main ())
去上面
软件包支持HTTP/HTTPS/SOCKS代理。示例: http://user:[email protected]:3128
。使用旋转代理。否则,使用每个DDGS或异步初始化的新代理。
1。最简单的方法。启动TOR浏览器
ddgs = DDGS ( proxy = "tb" , timeout = 20 ) # "tb" is an alias for "socks5://127.0.0.1:9150"
results = ddgs . text ( "something you need" , max_results = 50 )
2。使用任何代理服务器(示例带有iproyal旋转住宅代理)
ddgs = DDGS ( proxy = "socks5h://user:[email protected]:32325" , timeout = 20 )
results = ddgs . text ( "something you need" , max_results = 50 )
去上面
例外:
DuckDuckGoSearchException
:DuckDuckgo_Search错误的基本例外。RatelimitException
:从DuckDuckgoSearchException继承,为超过API请求率限制而筹集。TimeoutException
:从DuckDuckGoSearchException继承,为API请求超时提出。去上面
def chat ( self , keywords : str , model : str = "gpt-4o-mini" , timeout : int = 30 ) -> str :
"""Initiates a chat session with DuckDuckGo AI.
Args:
keywords (str): The initial message or question to send to the AI.
model (str): The model to use: "gpt-4o-mini", "claude-3-haiku", "llama-3.1-70b", "mixtral-8x7b".
Defaults to "gpt-4o-mini".
timeout (int): Timeout value for the HTTP client. Defaults to 30.
Returns:
str: The response from the AI.
"""
例子
results = DDGS (). chat ( "summarize Daniel Defoe's The Consolidator" , model = 'claude-3-haiku' )
# async
results = await AsyncDDGS (). achat ( 'describe the characteristic habits and behaviors of humans as a species' )
去上面
def text (
keywords : str ,
region : str = "wt-wt" ,
safesearch : str = "moderate" ,
timelimit : str | None = None ,
backend : str = "api" ,
max_results : int | None = None ,
) -> list [ dict [ str , str ]]:
"""DuckDuckGo text search generator. Query params: https://duckduckgo.com/params.
Args:
keywords: keywords for query.
region: wt-wt, us-en, uk-en, ru-ru, etc. Defaults to "wt-wt".
safesearch: on, moderate, off. Defaults to "moderate".
timelimit: d, w, m, y. Defaults to None.
backend: api, html, lite. Defaults to api.
api - collect data from https://duckduckgo.com,
html - collect data from https://html.duckduckgo.com,
lite - collect data from https://lite.duckduckgo.com.
max_results: max number of results. If None, returns results only from the first response. Defaults to None.
Returns:
List of dictionaries with search results.
"""
例子
results = DDGS (). text ( 'live free or die' , region = 'wt-wt' , safesearch = 'off' , timelimit = 'y' , max_results = 10 )
# Searching for pdf files
results = DDGS (). text ( 'russia filetype:pdf' , region = 'wt-wt' , safesearch = 'off' , timelimit = 'y' , max_results = 10 )
# async
results = await AsyncDDGS (). atext ( 'sun' , region = 'wt-wt' , safesearch = 'off' , timelimit = 'y' , max_results = 10 )
print ( results )
[
{
"title" : "News, sport, celebrities and gossip | The Sun" ,
"href" : "https://www.thesun.co.uk/" ,
"body" : "Get the latest news, exclusives, sport, celebrities, showbiz, politics, business and lifestyle from The Sun" ,
}, ...
]
去上面
def answers ( keywords : str ) -> list [ dict [ str , str ]]:
"""DuckDuckGo instant answers. Query params: https://duckduckgo.com/params.
Args:
keywords: keywords for query,
Returns:
List of dictionaries with instant answers results.
"""
例子
results = DDGS (). answers ( "sun" )
# async
results = await AsyncDDGS (). aanswers ( "sun" )
print ( results )
[
{
"icon" : None ,
"text" : "The Sun is the star at the center of the Solar System. It is a massive, nearly perfect sphere of hot plasma, heated to incandescence by nuclear fusion reactions in its core, radiating the energy from its surface mainly as visible light and infrared radiation with 10% at ultraviolet energies. It is by far the most important source of energy for life on Earth. The Sun has been an object of veneration in many cultures. It has been a central subject for astronomical research since antiquity. The Sun orbits the Galactic Center at a distance of 24,000 to 28,000 light-years. From Earth, it is 1 AU or about 8 light-minutes away. Its diameter is about 1,391,400 km, 109 times that of Earth. Its mass is about 330,000 times that of Earth, making up about 99.86% of the total mass of the Solar System. Roughly three-quarters of the Sun's mass consists of hydrogen; the rest is mostly helium, with much smaller quantities of heavier elements, including oxygen, carbon, neon, and iron." ,
"topic" : None ,
"url" : "https://en.wikipedia.org/wiki/Sun" ,
}, ...
]
去上面
def images (
keywords : str ,
region : str = "wt-wt" ,
safesearch : str = "moderate" ,
timelimit : str | None = None ,
size : str | None = None ,
color : str | None = None ,
type_image : str | None = None ,
layout : str | None = None ,
license_image : str | None = None ,
max_results : int | None = None ,
) -> list [ dict [ str , str ]]:
"""DuckDuckGo images search. Query params: https://duckduckgo.com/params.
Args:
keywords: keywords for query.
region: wt-wt, us-en, uk-en, ru-ru, etc. Defaults to "wt-wt".
safesearch: on, moderate, off. Defaults to "moderate".
timelimit: Day, Week, Month, Year. Defaults to None.
size: Small, Medium, Large, Wallpaper. Defaults to None.
color: color, Monochrome, Red, Orange, Yellow, Green, Blue,
Purple, Pink, Brown, Black, Gray, Teal, White. Defaults to None.
type_image: photo, clipart, gif, transparent, line.
Defaults to None.
layout: Square, Tall, Wide. Defaults to None.
license_image: any (All Creative Commons), Public (PublicDomain),
Share (Free to Share and Use), ShareCommercially (Free to Share and Use Commercially),
Modify (Free to Modify, Share, and Use), ModifyCommercially (Free to Modify, Share, and
Use Commercially). Defaults to None.
max_results: max number of results. If None, returns results only from the first response. Defaults to None.
Returns:
List of dictionaries with images search results.
"""
例子
results = DDGS (). images (
keywords = "butterfly" ,
region = "wt-wt" ,
safesearch = "off" ,
size = None ,
color = "Monochrome" ,
type_image = None ,
layout = None ,
license_image = None ,
max_results = 100 ,
)
# async
results = await AsyncDDGS (). aimages ( 'sun' , region = 'wt-wt' , safesearch = 'off' , max_results = 20 )
print ( images )
[
{
"title" : "File:The Sun by the Atmospheric Imaging Assembly of NASA's Solar ..." ,
"image" : "https://upload.wikimedia.org/wikipedia/commons/b/b4/The_Sun_by_the_Atmospheric_Imaging_Assembly_of_NASA's_Solar_Dynamics_Observatory_-_20100819.jpg" ,
"thumbnail" : "https://tse4.mm.bing.net/th?id=OIP.lNgpqGl16U0ft3rS8TdFcgEsEe&pid=Api" ,
"url" : "https://en.wikipedia.org/wiki/File:The_Sun_by_the_Atmospheric_Imaging_Assembly_of_NASA's_Solar_Dynamics_Observatory_-_20100819.jpg" ,
"height" : 3860 ,
"width" : 4044 ,
"source" : "Bing" ,
}, ...
]
去上面
def videos (
keywords : str ,
region : str = "wt-wt" ,
safesearch : str = "moderate" ,
timelimit : str | None = None ,
resolution : str | None = None ,
duration : str | None = None ,
license_videos : str | None = None ,
max_results : int | None = None ,
) -> list [ dict [ str , str ]]:
"""DuckDuckGo videos search. Query params: https://duckduckgo.com/params.
Args:
keywords: keywords for query.
region: wt-wt, us-en, uk-en, ru-ru, etc. Defaults to "wt-wt".
safesearch: on, moderate, off. Defaults to "moderate".
timelimit: d, w, m. Defaults to None.
resolution: high, standart. Defaults to None.
duration: short, medium, long. Defaults to None.
license_videos: creativeCommon, youtube. Defaults to None.
max_results: max number of results. If None, returns results only from the first response. Defaults to None.
Returns:
List of dictionaries with videos search results.
"""
例子
results = DDGS (). videos (
keywords = "cars" ,
region = "wt-wt" ,
safesearch = "off" ,
timelimit = "w" ,
resolution = "high" ,
duration = "medium" ,
max_results = 100 ,
)
# async
results = await AsyncDDGS (). avideos ( 'sun' , region = 'wt-wt' , safesearch = 'off' , timelimit = 'y' , max_results = 10 )
print ( results )
[
{
"content" : "https://www.youtube.com/watch?v=6901-C73P3g" ,
"description" : "Watch the Best Scenes of popular Tamil Serial #Meena that airs on Sun TV. Watch all Sun TV serials immediately after the TV telecast on Sun NXT app. *Free for Indian Users only Download here: Android - http://bit.ly/SunNxtAdroid iOS: India - http://bit.ly/sunNXT Watch on the web - https://www.sunnxt.com/ Two close friends, Chidambaram ..." ,
"duration" : "8:22" ,
"embed_html" : '<iframe width="1280" height="720" src="https://www.youtube.com/embed/6901-C73P3g?autoplay=1" frameborder="0" allowfullscreen></iframe>' ,
"embed_url" : "https://www.youtube.com/embed/6901-C73P3g?autoplay=1" ,
"image_token" : "6c070b5f0e24e5972e360d02ddeb69856202f97718ea6c5d5710e4e472310fa3" ,
"images" : {
"large" : "https://tse4.mm.bing.net/th?id=OVF.JWBFKm1u%2fHd%2bz2e1GitsQw&pid=Api" ,
"medium" : "https://tse4.mm.bing.net/th?id=OVF.JWBFKm1u%2fHd%2bz2e1GitsQw&pid=Api" ,
"motion" : "" ,
"small" : "https://tse4.mm.bing.net/th?id=OVF.JWBFKm1u%2fHd%2bz2e1GitsQw&pid=Api" ,
},
"provider" : "Bing" ,
"published" : "2024-07-03T05:30:03.0000000" ,
"publisher" : "YouTube" ,
"statistics" : { "viewCount" : 29059 },
"title" : "Meena - Best Scenes | 02 July 2024 | Tamil Serial | Sun TV" ,
"uploader" : "Sun TV" ,
}, ...
]
去上面
def news (
keywords : str ,
region : str = "wt-wt" ,
safesearch : str = "moderate" ,
timelimit : str | None = None ,
max_results : int | None = None ,
) -> list [ dict [ str , str ]]:
"""DuckDuckGo news search. Query params: https://duckduckgo.com/params.
Args:
keywords: keywords for query.
region: wt-wt, us-en, uk-en, ru-ru, etc. Defaults to "wt-wt".
safesearch: on, moderate, off. Defaults to "moderate".
timelimit: d, w, m. Defaults to None.
max_results: max number of results. If None, returns results only from the first response. Defaults to None.
Returns:
List of dictionaries with news search results.
"""
例子
results = DDGS (). news ( keywords = "sun" , region = "wt-wt" , safesearch = "off" , timelimit = "m" , max_results = 20 )
# async
results = await AsyncDDGS (). anews ( 'sun' , region = 'wt-wt' , safesearch = 'off' , timelimit = 'd' , max_results = 10 )
print ( results )
[
{
"date" : "2024-07-03T16:25:22+00:00" ,
"title" : "Murdoch's Sun Endorses Starmer's Labour Day Before UK Vote" ,
"body" : "Rupert Murdoch's Sun newspaper endorsed Keir Starmer and his opposition Labour Party to win the UK general election, a dramatic move in the British media landscape that illustrates the country's shifting political sands." ,
"url" : "https://www.msn.com/en-us/money/other/murdoch-s-sun-endorses-starmer-s-labour-day-before-uk-vote/ar-BB1plQwl" ,
"image" : "https://img-s-msn-com.akamaized.net/tenant/amp/entityid/BB1plZil.img?w=2000&h=1333&m=4&q=79" ,
"source" : "Bloomberg on MSN.com" ,
}, ...
]
去上面
def maps (
keywords ,
place : str | None = None ,
street : str | None = None ,
city : str | None = None ,
county : str | None = None ,
state : str | None = None ,
country : str | None = None ,
postalcode : str | None = None ,
latitude : str | None = None ,
longitude : str | None = None ,
radius : int = 0 ,
max_results : int | None = None ,
) -> list [ dict [ str , str ]]:
"""DuckDuckGo maps search. Query params: https://duckduckgo.com/params.
Args:
keywords: keywords for query
place: if set, the other parameters are not used. Defaults to None.
street: house number/street. Defaults to None.
city: city of search. Defaults to None.
county: county of search. Defaults to None.
state: state of search. Defaults to None.
country: country of search. Defaults to None.
postalcode: postalcode of search. Defaults to None.
latitude: geographic coordinate (north-south position). Defaults to None.
longitude: geographic coordinate (east-west position); if latitude and
longitude are set, the other parameters are not used. Defaults to None.
radius: expand the search square by the distance in kilometers. Defaults to 0.
max_results: max number of results. If None, returns results only from the first response. Defaults to None.
Returns:
List of dictionaries with maps search results.
"""
例子
results = DDGS (). maps ( "school" , place = "Uganda" , max_results = 50 )
# async
results = await AsyncDDGS (). amaps ( 'shop' , place = "Baltimor" , max_results = 10 )
print ( results )
[
{
"title" : "The Bun Shop" ,
"address" : "239 W Read St, Baltimore, MD 21201-4845" ,
"country_code" : None ,
"url" : "https://www.facebook.com/TheBunShop/" ,
"phone" : "+14109892033" ,
"latitude" : 39.3006042 ,
"longitude" : - 76.6195788 ,
"source" : "https://www.tripadvisor.com/Restaurant_Review-g60811-d4819859-Reviews-The_Bun_Shop-Baltimore_Maryland.html?m=63959" ,
"image" : "" ,
"desc" : "" ,
"hours" : {
"Fri" : "07:00:00–03:00:00" ,
"Mon" : "07:00:00–03:00:00" ,
"Sat" : "07:00:00–03:00:00" ,
"Sun" : "07:00:00–03:00:00" ,
"Thu" : "07:00:00–03:00:00" ,
"Tue" : "07:00:00–03:00:00" ,
"Wed" : "07:00:00–03:00:00" ,
"closes_soon" : 0 ,
"is_open" : 1 ,
"opens_soon" : 0 ,
"state_switch_time" : "03:00" ,
},
"category" : "Cafe" ,
"facebook" : "" ,
"instagram" : "" ,
"twitter" : "" ,
}, ...
]
去上面
def translate (
self ,
keywords : str ,
from_ : str | None = None ,
to : str = "en" ,
) -> list [ dict [ str , str ]]:
"""DuckDuckGo translate.
Args:
keywords: string or list of strings to translate.
from_: translate from (defaults automatically). Defaults to None.
to: what language to translate. Defaults to "en".
Returns:
List od dictionaries with translated keywords.
"""
例子
keywords = 'school'
# also valid
keywords = [ 'school' , 'cat' ]
results = DDGS (). translate ( keywords , to = "de" )
# async
results = await AsyncDDGS (). atranslate ( 'sun' , to = "de" )
print ( results )
[{ "detected_language" : "en" , "translated" : "Sonne" , "original" : "sun" }]
去上面
def suggestions (
keywords ,
region : str = "wt-wt" ,
) -> list [ dict [ str , str ]]:
"""DuckDuckGo suggestions. Query params: https://duckduckgo.com/params.
Args:
keywords: keywords for query.
region: wt-wt, us-en, uk-en, ru-ru, etc. Defaults to "wt-wt".
Returns:
List of dictionaries with suggestions results.
"""
例子
results = DDGS (). suggestions ( "fly" )
# async
results = await AsyncDDGS (). asuggestions ( 'sun' )
print ( results )
[
{ "phrase" : "sunshine live" },
{ "phrase" : "sunexpress" },
{ "phrase" : "sunday natural" },
{ "phrase" : "sunrise village spiel" },
{ "phrase" : "sunny portal" },
{ "phrase" : "sundair" },
{ "phrase" : "sunny cars" },
{ "phrase" : "sunexpress online check-in" },
]
该图书馆不隶属于Duckduckgo,仅用于教育目的。它不是用于商业用途或任何违反Duckduckgo服务条款的目的。通过使用此库,您可以承认您不会以侵犯Duckduckgo的条款的方式使用它。可以在https://duckduckgo.com上找到官方的DuckDuckgo网站。
去上面