searxng

mirror of https://github.com/searxng/searxng synced 2024-01-01 19:24:07 +01:00

Author	SHA1	Message	Date
Marc Abonce Seguin	d6681fd33b	remove articles number from engines_languages.json	2021-02-25 23:54:21 -07:00
Marc Abonce Seguin	9b6ffed061	fix fetch_languages for bing Bing has a list of regions that it supports and some of these regions may have more than one possible language. In some cases, like Switzerland, these languages are always shown as options, so there is no issue. But in other cases, like Andorra, Bing will only show one language at the time, either the region's default or the request's language if the latter is supported by that region. For example, if the HTTP request is in French, Andorra will appear as fr-AD but if the same page is requested in any other language Andorra will appear as ca-AD. This is specially a problem when Bing assumes that the request is in English because it overrides enough language codes to make several major languages like Arabic dissappear from the languages.py file. To avoid that issue, I set the Accept-Language header to a language that's only supported in one region to hopefully avoid these overrides.	2021-02-25 23:51:49 -07:00
Alexandre Flament	7c1847d5f2	[mod] add utils/fetch_external_bangs.py Based on duckduckgo bangs Store bangs on a trie to allow autocomplete (not in this commit)	2021-02-24 18:48:36 +01:00
Alexandre Flament	5f4a085fc4	Merge pull request #2595 from dalf/update-wikidata-units [mod] update wikidata_units.json and fetch_wikidata_units.py	2021-02-23 17:22:37 +01:00
Alexandre Flament	46ca32c3cc	[mod] update currencies.json and fetch_currencies.py use a sparql request on wikidata to get the list of currencies. currencies.json contains the translation for all supported searx languages. Supersede #993	2021-02-23 16:42:28 +01:00
Alexandre Flament	93d1da4906	[mod] update wikidata_units.json and fetch_wikidata_units.py The fetch_wikidata_units.py result won't change randomly. See comments in the script.	2021-02-23 13:10:38 +01:00
Noémi Ványi	1be6ab2a91	Fix paging of Bing Images	2021-02-22 21:19:34 +01:00
datagram1	1d0a32a2c5	Added rumble.com video search engine. TODO video embedding. Update rumble.py some lines too long. Disable Rumble engine disabled : True PEP8 fix change line spacing	2021-02-20 12:48:56 +00:00
Alexandre Flament	44a6593c13	Merge pull request #2573 from unixfox/yggtorrent update yggtorrent url + add it back	2021-02-16 08:22:07 +01:00
Emilien Devos	4b37e10dd9	fix yggtorrent url + add it back	2021-02-15 13:38:34 +01:00
Thorben Günther	fbbd4cc21f	Improve peertube searching At the moment videos without a description are not shown - setting default content to "" fixes this. Another current bug is that thumbnails are not displayed. This is caused by a double slash in the url. For this every trailing slash is now stripped (for backwards compatibility) and the API response is correctly parsed.	2021-02-13 19:47:33 +01:00
Alexandre Flament	45027765e3	Merge pull request #2566 from dalf/remove-yandex [remove] yandex engine	2021-02-12 17:12:07 +01:00
Alexandre Flament	c22d4c764c	[fix] duckduckgo engine: "!ddg !g" do not redirect to google * searx understand "!ddg !g time" as : send "!g time" to DDG * !g a DDG bang for Google: DDG return a HTTP redirect to Google This commit adds a the allows_redirect param not to follow HTTP redirect. The DDG engine returns a empty result as before without HTTP redirect.	2021-02-12 11:10:08 +01:00
Alexandre Flament	d76660463b	Merge pull request #2562 from dalf/mod-json-engine [mod] json_engine: add content_html_to_text and title_html_to_text	2021-02-12 10:58:28 +01:00
Alexandre Flament	7dcf67a47a	Merge pull request #2565 from dalf/upd-wikipedia [upd] wikipedia engine: return an empty result on query with illegal characters	2021-02-12 10:57:05 +01:00
Alexandre Flament	2b60d0d243	Merge pull request #2564 from dalf/fix-seznam [fix] fix seznam engine	2021-02-12 10:56:53 +01:00
Alexandre Flament	7e83818879	Merge pull request #2560 from dalf/fix-duckduckgo Fix duckduckgo	2021-02-12 10:56:40 +01:00
Alexandre Flament	63d6ccfbc2	Merge pull request #2557 from dalf/fix-raise_for_httperror Fix: activate raise_for_error by default	2021-02-12 10:56:25 +01:00
Alexandre Flament	74c8b5606f	Merge pull request #2541 from return42/mediathekviewweb [enh] add engine MediathekViewWeb (API)	2021-02-11 15:11:26 +01:00
Alexandre Flament	5d9db6c2f7	[remove] yandex engine	2021-02-11 14:28:06 +01:00
Alexandre Flament	35dd069402	[fix] fix seznam engine no paging support	2021-02-11 12:53:19 +01:00
Alexandre Flament	7d6e69e2f9	[upd] wikipedia engine: return an empty result on query with illegal characters on some queries (like an IT error message), wikipedia returns an HTTP error 400. this commit returns an empty result instead of showing an error to the user.	2021-02-11 12:29:21 +01:00
Alexandre Flament	ff84a1af35	[mod] json_engine: add content_html_to_text and title_html_to_text Some JSON API returns HTML in either in the HTML or the content. This commit adds two new parameters to the json_engine: content_html_to_text and title_html_to_text, False by default. If True, then the searx.utils.html_to_text removes the HTML tags. Update crossref, openairedatasets and openairepublications engines	2021-02-10 16:42:11 +01:00
Alexandre Flament	436d366448	Merge pull request #2544 from mrwormo/congresslibrary [Engine] Add Library of Congress engine	2021-02-10 10:13:46 +01:00
Alexandre Flament	eafd27f42a	Merge pull request #2556 from dalf/fix-apk-mirror [fix] fix apk_mirror engine	2021-02-10 10:12:37 +01:00
Alexandre Flament	d2dac11392	[mod] duckduckgo engine: better support of the language preference After the main request, send a second to https://duckduckgo.com/t/sl_h See https://github.com/searx/searx/issues/2259	2021-02-09 14:36:43 +01:00
Alexandre Flament	74d56f6cfb	[mod] poolrequests: for one (user request, engine) always use the same HTTPAdapter The duckduckgo engine requires an additional request after the results have been sent. This commit makes sure that the second request uses the same HTTPAdapter = the same IP address, and the same proxy.	2021-02-09 14:33:36 +01:00
Markus Heiser	bc1be3f0e9	[enh] add engine MediathekViewWeb (API) Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2021-02-09 13:08:01 +01:00
mrwormo	051da88328	Add Library of Congress engine	2021-02-09 12:45:39 +01:00
Alexandre Flament	9211cdfe9b	[upd] remove google_play_music engine Google Play Music has been replaced by Youtube music.	2021-02-09 11:38:50 +01:00
Alexandre Flament	aedf03c0f7	Fix: activate raise_for_error by default Fix commit `d703119d3a` : Some engines need to parse the HTTP error but raise_for_error is always set to False in the "request" function.	2021-02-09 11:27:41 +01:00
Alexandre Flament	5e055b069b	[fix) fix apk_mirror engine	2021-02-09 11:02:12 +01:00
Alexandre Flament	e4cc7f13a3	Merge pull request #2542 from kvch/fix-naver-engine Fix XPATHs in Naver engine	2021-02-09 08:52:38 +01:00
Alexandre Flament	bec9e30fe7	Merge pull request #2554 from MarcAbonce/zh-variants-in-wikipedia Add support for Chinese variants in Wikipedia	2021-02-09 08:49:59 +01:00
Daniel Hones	138f32471c	Updated webutils.highlight_content to ignore double-quotes when highlighting query parts	2021-02-08 23:58:54 -05:00
Marc Abonce Seguin	64e81794fe	add support for Chinese variants in Wikipedia	2021-02-08 21:56:45 -07:00
Noémi Ványi	ac309f5b8d	Fix naver engine Closes #2540	2021-02-07 18:58:13 +01:00
Markus Heiser	41c03cf011	[drop] metager - xpath engine won't work anymore The new version of MetaGer needs to reload the reults (into a iframe) with a unique tag (see HTML response below). Implementing a dedicated metager-engine for searx makes no sense to me. The great days of MetaGer seems to be ended. I remember the good old days this project started in the 90's of the last century. But in the last few years it becomes more and more crap. As the name suggested, MetaGer was made for germans in the first place. They have added a english and spain translation but the i18n is very poor compared to what searx offers. It's a pity, lets drop MetaGer. This is the first response, the id (b82679980656899ba5a17ffd02a56846) is unique for each query: $ curl "https://metager.org/meta/meta.ger3?eingabe=foo&submit-query=&focus=web" <!DOCTYPE html> <html lang="en"> <head> <meta charset="UTF-8"> <link rel="stylesheet" href="/index.css?id=b82679980656899ba5a17ffd02a56846"> <script src="/index.js?id=b82679980656899ba5a17ffd02a56846"></script> <title>foo - MetaGer</title> <meta name="viewport" content="width=device-width, initial-scale=1, maximum-scale=1" /> </head> <body> <iframe id="mg-framed" src="https://metager.org/meta/meta.ger3?eingabe=foo&submit-query=&focus=web&mgv=b82679980656899ba5a17ffd02a56846" autofocus="true" onload="this.contentWindow.focus();"></iframe> </body> Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2021-02-07 14:55:21 +01:00
Hermógenes Oliveira	514faa9162	[feat] recoll: paged json support	2021-02-07 10:05:35 -03:00
Marc Abonce Seguin	c937a9e85f	[fix] get correct locale with country from browser Some of our interface locales include uppercase country codes, which are separated by `_` instead of the more common `-`. Also, a browser's `Accept-Language` header could be in lowercase. This commit attempts to normalize those cases so a browser's language+country codes can better match with our locales. This solution assumes that our UI locales have nothing more than language and optionally country. If we ever add a script specific locale like `zh-Hant-TW` this would have to change to accomodate that, but the idea would be pretty much the same as this fix.	2021-02-04 19:53:59 -07:00
mrwormo	c4c1636b18	Add Creative Commons search engine	2021-02-04 11:31:35 +01:00
Alexandre Flament	ca93a01844	[mod] dynamically set language_support variable The language_support variable is set to True by default, and set to False in only 5 engines. Except the documentation and the /config URL, this variable is not used. This commit remove the variable definition in the engines, and set value according to supported_languages length: False when the length is 0, True otherwise. Close #2485	2021-02-01 17:10:37 +01:00
Markus Heiser	7f505bdc6f	[fix] google: avoid unnecessary SearxEngineXPathException errors Avoid SearxEngineXPathException errors when parsing non valid results:: .//div[@class="yuRUbf"]//a/@href index 0 not found Traceback (most recent call last): File "./searx/engines/google.py", line 274, in response url = eval_xpath_getindex(result, href_xpath, 0) File "./searx/searx/utils.py", line 608, in eval_xpath_getindex raise SearxEngineXPathException(xpath_spec, 'index ' + str(index) + ' not found') searx.exceptions.SearxEngineXPathException: .//div[@class="yuRUbf"]//a/@href index 0 not found Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2021-01-28 10:08:50 +01:00
Markus Heiser	e436287385	[mod] checker: add some additional tests BTW: fix indentation by 2 spaces The additional tests has been commented out in the google engines to not release any CAPTCHA issues. Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2021-01-28 10:08:50 +01:00
Markus Heiser	b1fefec40d	[fix] normalize the language & region aspects of all google engines BTW: make the engines ready for search.checker: - replace eval_xpath by eval_xpath_getindex and eval_xpath_list - google_images: remove outer try/except block Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2021-01-28 10:08:46 +01:00
Markus Heiser	ff6804e545	[data] make engines.languages Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2021-01-24 09:52:32 +01:00
Markus Heiser	8cdad5d85d	[fix] google-videos: parse values for 'length' & 'author' The 'video.html' template from the 'oscar' design supports replacement for author and length. Google-videos does not have an author, alternatively the publisher info from is used for the author. Hint: these replacements are not supported by the 'simple' design. Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2021-01-24 09:51:24 +01:00
Markus Heiser	89b3050b5c	[fix] revise of the google-Video engine This revise is based on the methods developed in the revise of the google engine (see commit `410c2f9`). Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2021-01-24 09:39:30 +01:00
Alexandre Flament	8c46b767d0	[fix] google_news: avoid one HTTP redirect except for the English results also add params['soft_max_redirects'] = 1 to avoid false error reporting in /stats/errors	2021-01-24 08:53:35 +01:00
Markus Heiser	5f92dfcdbe	[fix] google-news: query uses locale without country tag Wthout country-region tag google will redirect to correct the contry tag [1]: SEARX_DEBUG=1 searx-checker -v "google news" ... https://news.google.com:443 "GET /search?q=computer&hl=en... HTTP/1.1" 302 0 https://news.google.com:443 "GET /search?q=computer&hl=en-US&.... HTTP/1.1" 200 None ... [1] https://github.com/searx/searx/pull/2483#issuecomment-765600849 Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2021-01-23 11:37:14 +01:00

... 4 5 6 7 8 ...

2888 commits