Ponysearch

Author	SHA1	Message	Date
Markus Heiser	44392bd436	[mod] improve implementation of presearch engine Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-11-27 14:16:42 +01:00
Bnyro	23582aac5c	[feat] implementation of presearch engine	2023-11-27 14:16:42 +01:00
Bnyro	c3cc24be12	[feat] engine: implementation of destatis	2023-11-27 13:54:48 +01:00
Émilien (perso)	4280318fc5	fixing results parsing brave	2023-10-13 11:47:30 +02:00
Hackurei	efada7cba2	[fix] hackernews keyerror problem	2023-10-13 08:16:47 +02:00
Hackurei	af071121de	[fix] imgur - incorrect wikidata id	2023-10-12 09:14:00 +02:00
Markus Heiser	14323d683f	[fix] ddg-lite & ddg-extra: don't send empty vqd value DDG's bot detection is sensitive to the vqd value. For some search terms (such as extremely long search terms that are often sent by bots), no vqd value can be determined. If SearXNG cannot determine a vqd value, then no request should go out to DDG (WEB): a request with a wrong vqd value leads to DDG temporarily putting SearXNG's IP on a block list. Requests from IPs in this block list run into timeouts. Not sure, but it seems the block list is a sliding window: to get my IP rid from the bot list I had to cool down my IP for 1h (send no requests from that IP to DDG). Since such issues can't reproduce in a local instance I tested this patch 24h on my public SearXNG instance: There are still errors (rare), but the reliability is still 100%. Related: - https://github.com/searxng/searxng/pull/2922 - https://github.com/searxng/searxng/pull/2923 Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-10-12 08:52:28 +02:00
Markus Heiser	3388441917	[fix] ddg-lite vqd value: some search terms do not have a vqd value Some search terms do not have results and therefore no vqd value BTW: remove a leftover from `9197efa` Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-10-10 09:12:30 +02:00
Markus Heiser	9197efa2a7	[fix] duckduckgo lite engine: set HTTP header 'Referer' We have had problems with this before, the bot protection from ddg-lite seems to have included this referer in the rating [1][2]. From reverse engineering: - The Referer ``https://google.com/`` was set in commt `257dc7d6c4` --> DDG lite does not like this referer anymore! - The 'Referer' header is only set on second and follow up pages but not on the first page - The vqd value is not needed on the first page, the ddg-lite client sets this value only on follow up pages / this can help to reduce the vqd requests from SearXNG. Related to 'Referer' header & ddg requests: [1] https://github.com/searxng/searxng/pull/2161 [2] https://github.com/searxng/searxng/pull/2081 Closes: https://github.com/searxng/searxng/issues/2796 Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-10-10 08:40:53 +02:00
Bnyro	fa5b2a7948	[mod] yacy: use official instance by default and fix crashes	2023-10-09 20:50:24 +02:00
Hackurei	ff78b1a902	[feat] implement hackernews engine - news.ycombinator.com	2023-10-09 14:00:04 +02:00
Aine	213cb74378	[fix] matrixrooms add proper MRS integration Related: - https://github.com/searxng/searxng/issues/2918	2023-10-09 13:25:13 +02:00
Bnyro	48cb58bd2e	[feat] duckduckgo: support for videos and news	2023-10-09 06:53:43 +02:00
Bnyro	c3ab49cd90	[fix] kickass: crash when no results	2023-10-07 11:48:23 +02:00
Bnyro	f22daf8b47	[mod] piped: always show video length if available	2023-10-07 11:45:46 +02:00
Bnyro	ce270961e8	[feat] engine: implementation of mastodon	2023-10-06 10:58:23 +02:00
Markus Heiser	fd1422a670	[mod] engine - simplify region & lang handling, make filters configurable Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-10-05 10:55:08 +02:00
Bnyro	3e2ae756f0	[feat] engine: implementation of radio-browser.info	2023-10-05 10:55:08 +02:00
Jinyuan Huang	e509cb7c45	[typo] solved a typo in yahoo error message.	2023-10-01 08:29:06 +02:00
Jinyuan Huang	d4d9f2073e	[fix] Bug: Yahoo results for simplified Chinese search sometimes have the first character cut off #2866 Co-authored-by: Blair Noctis <n@sail.ng>	2023-10-01 08:29:06 +02:00
Bnyro	fe9386b58d	[fix] emojipedia: fix engine	2023-10-01 08:19:45 +02:00
Markus Heiser	32a4ea350e	[fix] Revision of the Bing engines Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-10-01 08:01:38 +02:00
jazzzooo	079636c079	[fix] engine - bing fix search, pagination, remove safesearch	2023-10-01 08:01:38 +02:00
Bnyro	5ce1792432	[feat] engine: implementation of pinterest	2023-09-30 15:01:45 +02:00
Bnyro	6096457e4d	[fix] matrixrooms.info: pagination not working properly	2023-09-30 14:51:07 +02:00
Markus Heiser	e1a8b8189f	[fix] engine - moviepilot instead of thumbnail use img_src Instead of thumbnail use img_src in the result item, otherwise the "movies" categories looks clunky. Related: - `b4e0d2eedc (r128785388)` Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-09-30 11:29:19 +02:00
Bnyro	159629c588	[mod] tagesschau: add option to only use tagesschau urls	2023-09-30 11:00:11 +02:00
Bnyro	2ca60a19fc	[feat] engine: implementation of matrixrooms.info	2023-09-30 09:09:23 +02:00
Bnyro	fc4a20f734	[mod] add movies category for tmdb, imdb and moviepilot	2023-09-29 22:37:51 +02:00
jazzzooo	e37d775fa2	[fix] engine - currency fix and simplify	2023-09-28 08:29:38 +02:00
Jinyuan Huang	ae28d429c9	[fix] bilibili new api used	2023-09-28 08:24:51 +02:00
jazzzooo	1a66d74673	[fix] engine - kickass update url, fix parsing, use multiple mirrors	2023-09-27 10:19:41 +02:00
Markus Heiser	b428ccc5a0	[fix] engine brave - fetch traits (modified settings menu) Brave has changed it settings menu fundamental. Region codes are no longer in the HTML page, we have to read the regional codes from a JS: https://cdn.search.brave.com/serp/v2/_app/immutable/chunks/parameters.734c106a.js Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-09-27 09:12:38 +02:00
Markus Heiser	3a456b1282	[fix] engine annas archive - fetch traits (modified xpath selectors) Anna’s Archive has cleaned up their languages, available file extensions and changed the HTML form. Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-09-27 09:12:38 +02:00
Émilien (perso)	1851f27154	[mod] remove twitter (#2843 )	2023-09-24 08:32:53 +00:00
Bnyro	75c9de02d1	[feat] engine: implementation of imgur	2023-09-22 20:50:53 +02:00
Bnyro	fb72f71f0a	[fix] internet archive scholar: crash when there's no title	2023-09-22 18:49:39 +02:00
Markus Heiser	71358e9c67	Revert "[fix] engine - duckduckgo vqd edge-case" This reverts commit `102502a4f0`.	2023-09-22 09:31:25 +02:00
Bnyro	51236ae47a	[feat] engine: implementation of chefkoch.de	2023-09-21 17:23:59 +02:00
jazzzooo	8bcca0e620	[fix] engine - brave don't show ads	2023-09-21 16:55:39 +02:00
jazzzooo	b729542a66	[fix] engine - google images error when no results	2023-09-21 16:38:37 +02:00
Bnyro	cc2e0537a3	[feat] engine: implementation of google icons/material design icons	2023-09-21 15:16:49 +02:00
Bnyro	c999cfb422	[feat] engine: implementation of wallhaven	2023-09-21 14:25:43 +02:00
jazzzooo	102502a4f0	[fix] engine - duckduckgo vqd edge-case	2023-09-20 20:05:06 +02:00
Markus Heiser	043dcbf7c5	[fix] engine qwant (web-lite) - ignore advertising adds Closes: https://github.com/searxng/searxng/issues/2812 Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-09-19 17:06:56 +02:00
Émilien (perso)	ad725ce7d7	wikipedia wikidata infobox + disable wikisource (#2806 ) Co-authored-by: Markus Heiser <markus.heiser@darmarit.de>	2023-09-19 10:31:02 +02:00
Bnyro	efd3a2d6d1	[feat] engine: implementation of internet archive scholar	2023-09-18 18:12:00 +02:00
jazzzooo	223b3487c3	[fix] spelling	2023-09-18 16:20:27 +02:00
Markus Heiser	a9b6963971	[fix] engine - qwant delivers only 5 pages maximum all qwant engines (incl qwant-lite) delivers only 5 pages maximum Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-09-18 01:43:43 +02:00
jazzzooo	da1446c5ed	[fix] engine - qwant wrong error type	2023-09-18 01:43:43 +02:00
Markus Heiser	7398d525c8	[fix] qwant: subsequent fix of commit `d9dbcedeb` Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-09-17 17:52:56 +02:00
Markus Heiser	d9dbcedeb6	[feat] implementation of qwant lite for web search Related: https://github.com/searxng/searxng/issues/2719 Replace: https://github.com/searxng/searxng/pull/2748 Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-09-17 16:53:25 +02:00
Bnyro	b4e0d2eedc	[feat] engine: implemenation of moviepilot (de)	2023-09-17 14:30:56 +02:00
jazzzooo	7dfcc3386e	[fix] tagesschau videos	2023-09-16 18:40:26 +02:00
jazzzooo	ec540a967a	[fix] brave.videos	2023-09-15 22:00:09 +02:00
jazzzooo	27477f51fd	[fix] brave.news	2023-09-15 22:00:09 +02:00
Justas Zabulionis	41ef73ca3a	[fix] rumble redirect	2023-09-14 19:21:21 +02:00
Justas Zabulionis	be888810ba	[fix] pubmed content being None	2023-09-14 18:40:15 +02:00
Justas Zabulionis	92d39de410	[fix] solidtorrents redirects	2023-09-14 18:03:21 +02:00
Justas Zabulionis	cf8a6cf6db	[fix] solidtorrents pagination	2023-09-14 18:03:21 +02:00
Justas Zabulionis	8172f89075	[fix] solidtorrents	2023-09-14 18:03:21 +02:00
jazzzooo	74600c028d	[fix] engine - Crossref Crossref was broken on result types journal-issue and component .. The old code had lots of assumptions, and broke during parsing. Now the assumptions are more explicit and checked them with the API.	2023-09-14 17:39:23 +02:00
Bnyro	3568a3cafb	[feat] odysee: implement fetch_traits for language support	2023-09-13 21:41:33 +02:00
Bnyro	09c61dabc9	[mod] odysee: time range support	2023-09-13 21:41:33 +02:00
jazzzooo	b98907e91f	[fix] engine - piped.music incorrect timestamps	2023-09-13 21:39:37 +02:00
jazzzooo	6039dbf211	[fix] engine - invidious thumbnails	2023-09-13 11:37:42 +02:00
jazzzooo	b2fd6304bf	[fix] engine - openstreetmap currency rendering	2023-09-13 10:56:52 +02:00
jazzzooo	54a3e03b45	[fix] engine - openstreetmap currency matching	2023-09-12 20:57:05 +02:00
Bnyro	64d9587ac8	[feat] new engine: svgrepo	2023-09-12 20:38:36 +02:00
jazzzooo	b189578b6b	[fix] engine - brave	2023-09-12 11:31:43 +02:00
Bnyro	f182abd6f8	[mod] library of congress: fix engine	2023-09-11 19:42:31 +02:00
Bnyro	e73a6f5d14	[fix] engine deviantart: review of the result-scrapper The deviantart site changed and hence deviantart is currently unusable.	2023-09-11 13:22:36 +02:00
Alexandre Flament	d07c006aed	Replace chompjs with pure Python code The new implementation is good enough for the current usage (brave)	2023-09-09 13:02:36 +02:00
Bnyro	9e83c0dedc	[feat] engine: implementation of Yummly Co-authored-by: Markus Heiser <markus.heiser@damarit.de>	2023-09-08 11:47:13 +02:00
Bnyro	a3d7e9c285	[mod] utils.py: add markdown_to_text helper function	2023-09-08 11:47:13 +02:00
Hackurei	1f21ac7d62	[feat] engine: implementation of bilibili https://www.bilibili.com	2023-09-05 22:53:03 +02:00
Markus Heiser	696c35d2c3	[fix] engine - duckduckgo_images / determination of vqd value incorrect Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-09-05 22:24:51 +02:00
bonswouar	4fb26cd96d	[fix] engine duckduckgo weather api changes	2023-09-05 16:55:00 +02:00
Markus Heiser	01be9e0e20	[fix] engine: wikicommons - don't quoute ':\|' in URL parameters From [1]: It seems to be because of [2] For some reason it gets url encoded twice, resulting in - ``filetype%253Abitmap%257Cdrawing+birds`` instead of - ``filetype:bitmap%7Cdrawing+birds`` [1] https://github.com/searxng/searxng/issues/2707 [2] https://github.com/searxng/searxng/blob/master/searx/engines/wikicommons.py#L43 Closes: #2707 Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-09-03 18:45:39 +02:00
Markus Heiser	4f8895c6de	[fix] follow-up of `4da7003ae` / add missing review from @Bnyro [1] https://github.com/searxng/searxng/pull/2656#pullrequestreview-1607956209 Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-09-02 09:26:45 +02:00
Hackurei	4da7003ae0	[feat] engine: implementation of odysee	2023-09-02 09:14:12 +02:00
Bnyro	9c4e9d3814	[feat] implementation of Wikimedia commons for images	2023-09-01 18:39:24 +02:00
Alexandre Flament	faa4280e1a	[mod] bing: resolve redirect without additional requests Remove the usage of searx.network.multi_requests The results from Bing contains the target URL encoded in base64 See the u parameter, remove the first two character "a1", and done. Also add a comment the check of the result_len / pageno ( from https://github.com/searx/searx/pull/1387 )	2023-08-29 07:39:06 +02:00
Markus Heiser	b0d2cd5ca9	[doc] add documentation of Mwmbl engine & autocompleter Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-08-27 17:25:26 +02:00
Bnyro	19300a5659	[mod] engine mwmbl: add link to official api docs	2023-08-27 17:25:26 +02:00
Alexandre Flament	e16c007c22	[fix] openstreetmap engine It seems there is an API change: extratags can be either a dictionnary or None. This commit avoid crash when extratags is None Test query "!osm gare du nord"	2023-08-27 11:49:16 +02:00
Markus Heiser	0647f83a3e	[fix] google engine: don't overspecify the search query to Google The method EngineTraits.get_region(..) returns engine's region string that best fits to SearXNG's locale. This means it returns a region (country) if only a language is set in the locale. By example the method returns for a locale tag `es` a region `ES`. Google's search parameter `cr` restricts search results to documents originating in a particular country / in case of a locale tag (language) as described above, this argument should be unset in the query send to Google. Closes: https://github.com/searxng/searxng/issues/2672 Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-08-26 07:47:07 +02:00
Markus Heiser	4b42644579	[fix] engine google_video: google has changed the layout of the rsponse Closes: https://github.com/searxng/searxng/issues/2664 Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-08-22 08:34:04 +02:00
Bnyro	c59ae91b76	[feat] engine: implementation of mwmbl	2023-08-19 18:23:42 +02:00
Markus Heiser	c741fc6f00	[mod] currency_convert: support for showing the answer source url Show URL of the ddg-search page, not the URL of a (generic) Javascript. The latter one is not usefull for the user. Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-08-18 19:07:14 +02:00
Markus Heiser	e2744520f8	[mod] google: support for showing the answer source url Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-08-18 19:07:14 +02:00
Bnyro	5ec7df3480	[mod] engine duckduckgo definitions: support for answer source	2023-08-18 19:07:14 +02:00
Bnyro	64bc98b5fb	[mod] brave: support for showing the answer source url	2023-08-18 19:07:14 +02:00
Markus Heiser	9100a48541	[mod] improve seekr engines and add documentation Tis patch adds some more fields to the result items and changed paging to the ``nextResultSet`` given in seekr's JSON response. Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-08-15 16:17:42 +02:00
Bnyro	2bab658d39	[feat] engine: implementation of seekr for news, images and videos	2023-08-15 16:17:42 +02:00
Bnyro	e25d1c7288	[feat] engine: implementation of German news, Tagesschau Co-authored-by: Markus Heiser <markus.heiser@darmarit.de>	2023-08-10 20:27:54 +02:00
Bnyro	834e1c3f12	[mod] engine lemmy: increase thumbnail quality to align with theme	2023-08-10 12:58:40 +02:00
Markus Heiser	c381fc001f	[mod] settings: remove lemmy from categ 'general' & enable by default Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-08-10 12:58:40 +02:00
Markus Heiser	fda111c0c9	[mod] engine lemmy: add more info fields to the result items Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-08-10 12:58:40 +02:00
Bnyro	224f2250ae	[feat] engine: support for lemmy communities, posts, comments and users	2023-08-10 12:58:40 +02:00
Bnyro	9f82c39610	[mod] engine google_play: raise error on unsupported category	2023-08-10 12:35:24 +02:00
Bnyro	0a99dc85b9	[mod] engine brave: raise error on unsupported category	2023-08-10 12:35:24 +02:00
allendema_searxng_pi	c00c0c5434	[mod] remove discontinued petalsearch engines	2023-08-09 07:17:40 +02:00
Markus Heiser	b8352eca0c	[mod] brave engines: add fetch_traits() / improve language support Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-08-08 16:21:45 +02:00
Markus Heiser	460bbe5b81	[mod] implement brave (WEB) engine to replace XPath configuration Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-08-08 16:21:45 +02:00
Bnyro	d151497db3	[feat] engine: brave - support for news	2023-08-08 16:21:45 +02:00
Bnyro	cae06f2781	[feat] engine: brave - support for videos	2023-08-08 16:21:45 +02:00
Bnyro	73364e158e	[feat] engine: brave - support for images	2023-08-08 16:21:45 +02:00
Markus Heiser	1d0abb7157	[doc] engine bt4g: add documentation to docs/dev/engines/online/ Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-08-06 09:30:48 +02:00
Emilien Devos	0fc8f99ecc	[feat] new engine: bt4g added & enabled and disable by default btdigg Disable btdigg because on most SearXNG instances, SearXNG is blocked by btdigg due to cloudflare too many requests. This impementation did not parse the HTML page because there is an API in XML (RSS). The RSS feed provides fewer data like amount of seeders/leechers and the files in the torrent file. It's a tradeoff for a "stable" engine as the XML from RSS content will change way less than the HTML page. Closes: https://github.com/searxng/searxng/issues/2553	2023-08-06 09:30:48 +02:00
Markus Heiser	db522cf76d	[mod] engine: wikimedia - improve results, add addition settings & doc Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-08-04 19:06:50 +02:00
Bnyro	7d8c20c80d	[feat] new engine: wikispecies	2023-08-04 19:06:50 +02:00
Markus Heiser	1b030d4b41	[doc] engine: Yacy Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-08-03 19:58:51 +02:00
zutto	ca518c6803	add option to change yacy search mode	2023-08-03 19:58:51 +02:00
Markus Heiser	203f1f0928	[fix] engine piped: 'invalid content' SearXNG does not allow a None value in the content field of a result item. If the key (shortDescription, uploaderName) in the JSON response from piped exists but is set to None, SearXNG ignores this result item:: DEBUG searx : result: invalid content: { .., 'content': None, ..} Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-08-03 16:23:36 +02:00
Markus Heiser	207fcc0c8c	[mod] engine piped: add paging support Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-08-03 16:23:36 +02:00
Markus Heiser	ef5831cd84	[mod] engine piped: split into two dedicated engiens for video & music Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-08-03 16:23:36 +02:00
Markus Heiser	7aa95d2d52	[doc] engine piped: add documentation to docs/dev/engines/online/ Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-08-03 16:23:36 +02:00
Bnyro	636bfdac68	[feat] engine: implementation of Piped	2023-08-03 16:23:36 +02:00
Paolo Basso	cada89ee36	[feat] engine: re-enables z-library (zlibrary-global.se) - re-enables z-library as the new domain zlibrary-global.se is now available from the open web. The announcement of the domain: https://www.reddit.com/r/zlibrary/comments/13whe08/mod_note_zlibraryglobalse_domain_is_officially/ It is an official domain, it requires to log in to the "personal" subdomain only to download files, but the search works. - changes the result template of zlibrary to paper.html, filling the appropriate fields - implements language filtering for zlibrary - implement zlibrary custom filters (engine traits) - refactor and document the zlibrary engine	2023-07-07 21:36:51 +02:00
Markus Heiser	5720844fcd	[doc] rearranges Settings & Engines docs for better readability We have built up detailed documentation of the settings and the engines over the past few years. However, this documentation was still spread over various chapters and was difficult to navigate in its entirety. This patch rearranges the Settings & Engines documentation for better readability. To review new ordered docs:: make docs.clean docs.live Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-07-01 22:45:19 +02:00
Markus Heiser	87e7926ae9	[fix] engine: Anna's Archive - grep results from '.js-scroll-hidden' elements The renderuing of the WEB page is very strange; except the firts position all other positions of Anna's result page are enclosed in SGML comments. These cooments are uncommented by some JS code, see query of the class '.js-scroll-hidden' in Anna's HTML template [1]. [1] https://annas-software.org/AnnaArchivist/annas-archive/-/blob/main/allthethings/templates/macros/md5_list.html Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-06-29 09:32:57 +02:00
Markus Heiser	e2df6b77a3	[mod] engine: Anna's Archive - additionl settings (content, sort, ext) Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-06-29 09:32:57 +02:00
Markus Heiser	eafc2906f1	[mod] engine: Anna's Archive - fetch search arguments from search form Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-06-29 09:32:57 +02:00
Paolo Basso	7adb9090e5	[mod] engine: Anna's Archive - add language support	2023-06-29 09:32:57 +02:00
Paolo Basso	e5637fe7b9	[feat] engine: implementation of Anna's Archive Anna's Archive [1] is a free non-profit online shadow library metasearch engine providing access to a variety of book resources (also via IPFS), created by a team of anonymous archivists [2]. [1] https://annas-archive.org/ [2] https://annas-software.org/AnnaArchivist/annas-archive	2023-06-29 09:32:57 +02:00
Paolo Basso	401561cb58	[mod] engine torznab - refactor & option to hide links - torznab engine using types and clearer code - torznab option to hide torrent and magnet links. - document the torznab engine - add myself to authors Closes: https://github.com/searxng/searxng/issues/1124 Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-06-28 10:03:44 +02:00
Markus Heiser	da7c30291d	[fix] Google API changed It seems that Google is rolling out a modified WEB API [1][2]. In the past there was only the UI language in the `hl` argument but nowadays it seems a combination of the UI language and the "search region" is mixed in this argument and the `gl` argument has been removed. I'm very surprised that google is starting to mix the parameters of the UI with the parameters of the search index. This patch modifies the get_google_info(..) function. Beside Google-WEB this function is also used by other Google services, here are some examples to test region & language of .. - Google-WEB: `!go dragon boat :en-CA` - Google-News: `!gon dragon boat :en-CA` - Google-Videos: `!gov bmw :en-CA` - Goolge-Images `!goi bmw :en-CA` - [1] https://github.com/searxng/searxng/issues/2515#issuecomment-1606294635 - [2] https://github.com/searxng/searxng/issues/2515#issuecomment-1607150817 Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-06-26 18:28:09 +02:00
Markus Heiser	e8706fb738	[fix] engine & network issues / documentation and type annotations This patch fixes some quirks and issues related to the engines and the network. Each engine has its own network and this network was broken for the following engines[1]: - archlinux - bing - dailymotion - duckduckgo - google - peertube - startpage - wikipedia Since the files have been touched anyway, the type annotaions of the engine modules has also been completed so that error messages from the type checker are no longer reported. Related and (partial) fixed issue: - [1] https://github.com/searxng/searxng/issues/762#issuecomment-1605323861 - [2] https://github.com/searxng/searxng/issues/2513 - [3] https://github.com/searxng/searxng/issues/2515 Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-06-25 13:58:26 +02:00
pankaj	4900c091a6	use logger.warning logger.warn() is depricated. logger.warning is already being used in some files.	2023-05-19 19:35:29 +05:30
Markus Heiser	caebd297e9	[fix] engine ddg: minor change in the API of ddg Closes: https://github.com/searxng/searxng/issues/2419 Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-05-12 18:58:49 +02:00
Markus Heiser	9b575a997b	[fix] doc of locales.get_engine_locale() / zh-classical is missleading Wikipedia's zh-classical is not zh_Hant (see doc-string of engines.wikipedia). Fixed the example in the doc-string of locales.get_engine_locale() to 'zh_TW'. Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-04-17 08:48:57 +02:00
Markus Heiser	f1b6351ae1	[fix] engine: google play movies Closes: https://github.com/searxng/searxng/pull/1746 Closes: https://github.com/searxng/searxng/issues/1599 Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-04-16 19:15:44 +02:00
Markus Heiser	27369ebec2	[fix] searxng_extra/update/update_engine_descriptions.py (part 1) Follow up of #2269 The script to update the descriptions of the engines does no longer work since PR #2269 has been merged. searx/engines/wikipedia.py ========================== 1. There was a misusage of zh-classical.wikipedia.org: - `zh-classical` is dedicate to classical Chinese [1] which is not traditional Chinese [2]. - zh.wikipedia.org has LanguageConverter enabled [3] and is going to dynamically show simplified or traditional Chinese according to the HTTP Accept-Language header. 2. The update_engine_descriptions.py needs a list of all wikipedias. The implementation from #2269 included only a reduced list: - https://meta.wikimedia.org/wiki/Wikipedia_article_depth - https://meta.wikimedia.org/wiki/List_of_Wikipedias searxng_extra/update/update_engine_descriptions.py ================================================== Before PR #2269 there was a match_language() function that did an approximation using various methods. With PR #2269 there are only the types in the data model of the languages, which can be recognized by babel. The approximation methods, which are needed (only here) in the determination of the descriptions, must be replaced by other methods. [1] https://en.wikipedia.org/wiki/Classical_Chinese [2] https://en.wikipedia.org/wiki/Traditional_Chinese_characters [3] https://www.mediawiki.org/wiki/Writing_systems#LanguageConverter Closes: https://github.com/searxng/searxng/issues/2330 Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-04-15 16:03:59 +02:00
Markus Heiser	23ac964e35	[fix] Bing-WEB: use <span class='algoSlug_icon'> for the description On some result items from Bing-WEB the `<span class='algoSlug_icon'>` tag is the only tag that contains a description. The issue can be reproduced by [1]:: !bi vmware [1] https://github.com/searxng/searxng/issues/1764#issuecomment-1417990531 Reported-by: @AlyoshaVasilieva Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-04-08 09:43:04 +02:00
Markus Heiser	2ffd446e5c	[mod] clarify the difference of the default category and subgrouping This PR does no functional change it is just an attempt to make more clear in the code, what a default category is and what a subcategory is. The previous name 'others' leads to confusion with the category 'other'. If a engine is not assigned to a category, the default is assigned:: DEFAULT_CATEGORY = 'other' If an engine has only one category and this category is shown as tab in the user interface, this engine has no further subgrouping:: NO_SUBGROUPING = 'without further subgrouping' Related: - https://github.com/searxng/searxng/issues/1604 - https://github.com/searxng/searxng/pull/1545 Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-04-07 11:03:25 +02:00
Markus Heiser	5234e45010	[fix] Gigablast.com has been erased [1] https://www.reddit.com/r/searchengines/comments/128wdcp/gigablastcom_has_been_erased/ Closes: https://github.com/searxng/searxng/issues/2321 Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-04-06 08:22:57 +02:00
Markus Heiser	a762172bf7	[fix] engine ddg: quote !bangs in a request send to ddg Closes: https://github.com/searxng/searxng/issues/392 Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-04-03 09:52:16 +02:00
Markus Heiser	0430662189	[fix] engine google-News: fix decoding of URLs (part 2) Follow up of `8de8070ed` to fix the issue reported by AlyoshaVasilieva [1]. [1] https://github.com/searxng/searxng/issues/1959#issuecomment-1493300574 Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-04-02 19:19:59 +02:00
Markus Heiser	8de8070ed9	[fix] engine google-News: fix decoding of URLs Google-News returns internal links where the origin URL is encoded in a base64 (RFC 2045 aka URL-safe) string. Closes: https://github.com/searxng/searxng/issues/1959 Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-04-01 19:33:13 +02:00
Markus Heiser	509afbbb84	[fix] engine seznam: fix issues reported by black & pylint Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-03-31 17:25:39 +02:00
Venca24	c8d78355ff	[fix] engine seznam	2023-03-31 16:11:27 +02:00
Markus Heiser	270ad18897	[fix] engine flickr: adapt to the new data model from flicker's response Closes: https://github.com/searxng/searxng/issues/1879 Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-03-30 21:04:53 +02:00
Markus Heiser	2b8dfab33f	[fix] engine gigablast: add &userid=<User ID>&code=<Feed Code> Gigablast's API does block unauthorized request[1]. [1] https://gigablast.com/searchfeed.html Closes: https://github.com/searxng/searxng/issues/1454 Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-03-29 16:18:02 +02:00
Markus Heiser	6f9e678346	[fix] engine: google has changed the layout of its response Since 28. March google has changed its response, this patch fixes the google engine to scrap out the results & images from the new designed response. closes: https://github.com/searxng/searxng/issues/2287 Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-03-28 14:39:16 +02:00
Markus Heiser	4d4aa13e1f	[mod] remove obsolete EngineTraits.supported_languages All engines has been migrated from ``supported_languages`` to the ``fetch_traits`` concept. There is no longer a need for the obsolete code that implements the ``supported_languages`` concept. Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-03-24 10:37:42 +01:00
Markus Heiser	96a2eec3b5	[mod] Archlinux Wiki: improved request API & upgrade to data_type: traits_v1 re-implementation of the Archlinux Wiki: - fetch_traits(): fetch languages, wiki URLs and title arguments - add content field to the result list - add documentation Wikis from wiki.archlinux.fr, wiki.archlinux.ro, archtr.org/wiki do no longer exists (has been merged in the main wiki). Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-03-24 10:37:42 +01:00
Markus Heiser	057e9bc1d1	[mod] SepiaSearch: re-engineered & upgrade to data_type: traits_v1 - fetch_traits() SepiaSearch and Peertube are using identical languages. Replace module's dictionary `supported_languages` by `engine.traits.languages` (data_type: `traits_v1`). - fixed code to pass pylint - request(): add argument boostLanguages - response(): is replaced by peertube's video_response() function, which adds metadata from channel name, host & tags Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-03-24 10:37:42 +01:00
Markus Heiser	8a8c584fec	[mod] Dailymotion: improved request API & upgrade to data_type: traits_v1 - fetch_traits(): fetch locales (and languages) from dailymotion API - removed obsolete data-type "supported_languages" - add documentation - improved argument list of the HTTP request: - add argument: family_filter_map - add conditional argument: localization Don't add localization and country arguments if the user does select a language (:de, :en, ..) - improve code quality (mainly improve readability) Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-03-24 10:37:42 +01:00
Markus Heiser	2499899554	[mod] Google: reversed engineered & upgrade to data_type: traits_v1 Partial reverse engineering of the Google engines including a improved language and region handling based on the engine.traits_v1 data. When ever possible the implementations of the Google engines try to make use of the async REST APIs. The get_lang_info() has been generalized to a get_google_info() function / especially the region handling has been improved by adding the cr parameter. searx/data/engine_traits.json Add data type "traits_v1" generated by the fetch_traits() functions from: - Google (WEB), - Google images, - Google news, - Google scholar and - Google videos and remove data from obsolete data type "supported_languages". A traits.custom type that maps region codes to supported_domains is fetched from https://www.google.com/supported_domains searx/autocomplete.py: Reversed engineered autocomplete from Google WEB. Supports Google's languages and subdomains. The old API suggestqueries.google.com/complete has been replaced by the async REST API: https://{subdomain}/complete/search?{args} searx/engines/google.py Reverse engineering and extensive testing .. - fetch_traits(): Fetch languages & regions from Google properties. - always use the async REST API (formally known as 'use_mobile_ui') - use supported_domains from traits - improved the result list by fetching './/div[@data-content-feature]' and parsing the type of the various content features --> thumbnails are added searx/engines/google_images.py Reverse engineering and extensive testing .. - fetch_traits(): Fetch languages & regions from Google properties. - use supported_domains from traits - if exists, freshness_date is added to the result - issue 1864: result list has been improved a lot (due to the new cr parameter) searx/engines/google_news.py Reverse engineering and extensive testing .. - fetch_traits(): Fetch languages & regions from Google properties. supported_domains is not needed but a ceid list has been added. - different region handling compared to Google WEB - fixed for various languages & regions (due to the new ceid parameter) / avoid CONSENT page - Google News do no longer support time range - result list has been fixed: XPath of pub_date and pub_origin searx/engines/google_videos.py - fetch_traits(): Fetch languages & regions from Google properties. - use supported_domains from traits - add paging support - implement a async request ('asearch': 'arc' & 'async': 'use_ac:true,_fmt:html') - simplified code (thanks to '_fmt:html' request) - issue 1359: fixed xpath of video length data searx/engines/google_scholar.py - fetch_traits(): Fetch languages & regions from Google properties. - use supported_domains from traits - request(): include patents & citations - response(): fixed CAPTCHA detection (Scholar has its own CATCHA manager) - hardening XPath to iterate over results - fixed XPath of pub_type (has been change from gs_ct1 to gs_cgt2 class) - issue 1769 fixed: new request implementation is no longer incompatible Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-03-24 10:37:42 +01:00

1 2 3 4 5 ...

1692 commits