searxngRebrandZaclys

Author	SHA1	Message	Date
Alexandre Flament	ff84a1af35	[mod] json_engine: add content_html_to_text and title_html_to_text Some JSON API returns HTML in either in the HTML or the content. This commit adds two new parameters to the json_engine: content_html_to_text and title_html_to_text, False by default. If True, then the searx.utils.html_to_text removes the HTML tags. Update crossref, openairedatasets and openairepublications engines	2021-02-10 16:42:11 +01:00
Alexandre Flament	436d366448	Merge pull request #2544 from mrwormo/congresslibrary [Engine] Add Library of Congress engine	2021-02-10 10:13:46 +01:00
Alexandre Flament	eafd27f42a	Merge pull request #2556 from dalf/fix-apk-mirror [fix] fix apk_mirror engine	2021-02-10 10:12:37 +01:00
Alexandre Flament	d2dac11392	[mod] duckduckgo engine: better support of the language preference After the main request, send a second to https://duckduckgo.com/t/sl_h See https://github.com/searx/searx/issues/2259	2021-02-09 14:36:43 +01:00
Alexandre Flament	74d56f6cfb	[mod] poolrequests: for one (user request, engine) always use the same HTTPAdapter The duckduckgo engine requires an additional request after the results have been sent. This commit makes sure that the second request uses the same HTTPAdapter = the same IP address, and the same proxy.	2021-02-09 14:33:36 +01:00
Markus Heiser	bc1be3f0e9	[enh] add engine MediathekViewWeb (API) Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2021-02-09 13:08:01 +01:00
mrwormo	051da88328	Add Library of Congress engine	2021-02-09 12:45:39 +01:00
Alexandre Flament	9211cdfe9b	[upd] remove google_play_music engine Google Play Music has been replaced by Youtube music.	2021-02-09 11:38:50 +01:00
Alexandre Flament	aedf03c0f7	Fix: activate raise_for_error by default Fix commit `d703119d3a` : Some engines need to parse the HTTP error but raise_for_error is always set to False in the "request" function.	2021-02-09 11:27:41 +01:00
Alexandre Flament	5e055b069b	[fix) fix apk_mirror engine	2021-02-09 11:02:12 +01:00
Alexandre Flament	e4cc7f13a3	Merge pull request #2542 from kvch/fix-naver-engine Fix XPATHs in Naver engine	2021-02-09 08:52:38 +01:00
Alexandre Flament	bec9e30fe7	Merge pull request #2554 from MarcAbonce/zh-variants-in-wikipedia Add support for Chinese variants in Wikipedia	2021-02-09 08:49:59 +01:00
Daniel Hones	138f32471c	Updated webutils.highlight_content to ignore double-quotes when highlighting query parts	2021-02-08 23:58:54 -05:00
Marc Abonce Seguin	64e81794fe	add support for Chinese variants in Wikipedia	2021-02-08 21:56:45 -07:00
Noémi Ványi	ac309f5b8d	Fix naver engine Closes #2540	2021-02-07 18:58:13 +01:00
Markus Heiser	41c03cf011	[drop] metager - xpath engine won't work anymore The new version of MetaGer needs to reload the reults (into a iframe) with a unique tag (see HTML response below). Implementing a dedicated metager-engine for searx makes no sense to me. The great days of MetaGer seems to be ended. I remember the good old days this project started in the 90's of the last century. But in the last few years it becomes more and more crap. As the name suggested, MetaGer was made for germans in the first place. They have added a english and spain translation but the i18n is very poor compared to what searx offers. It's a pity, lets drop MetaGer. This is the first response, the id (b82679980656899ba5a17ffd02a56846) is unique for each query: $ curl "https://metager.org/meta/meta.ger3?eingabe=foo&submit-query=&focus=web" <!DOCTYPE html> <html lang="en"> <head> <meta charset="UTF-8"> <link rel="stylesheet" href="/index.css?id=b82679980656899ba5a17ffd02a56846"> <script src="/index.js?id=b82679980656899ba5a17ffd02a56846"></script> <title>foo - MetaGer</title> <meta name="viewport" content="width=device-width, initial-scale=1, maximum-scale=1" /> </head> <body> <iframe id="mg-framed" src="https://metager.org/meta/meta.ger3?eingabe=foo&submit-query=&focus=web&mgv=b82679980656899ba5a17ffd02a56846" autofocus="true" onload="this.contentWindow.focus();"></iframe> </body> Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2021-02-07 14:55:21 +01:00
Hermógenes Oliveira	514faa9162	[feat] recoll: paged json support	2021-02-07 10:05:35 -03:00
Marc Abonce Seguin	c937a9e85f	[fix] get correct locale with country from browser Some of our interface locales include uppercase country codes, which are separated by `_` instead of the more common `-`. Also, a browser's `Accept-Language` header could be in lowercase. This commit attempts to normalize those cases so a browser's language+country codes can better match with our locales. This solution assumes that our UI locales have nothing more than language and optionally country. If we ever add a script specific locale like `zh-Hant-TW` this would have to change to accomodate that, but the idea would be pretty much the same as this fix.	2021-02-04 19:53:59 -07:00
mrwormo	c4c1636b18	Add Creative Commons search engine	2021-02-04 11:31:35 +01:00
Alexandre Flament	ca93a01844	[mod] dynamically set language_support variable The language_support variable is set to True by default, and set to False in only 5 engines. Except the documentation and the /config URL, this variable is not used. This commit remove the variable definition in the engines, and set value according to supported_languages length: False when the length is 0, True otherwise. Close #2485	2021-02-01 17:10:37 +01:00
Markus Heiser	7f505bdc6f	[fix] google: avoid unnecessary SearxEngineXPathException errors Avoid SearxEngineXPathException errors when parsing non valid results:: .//div[@class="yuRUbf"]//a/@href index 0 not found Traceback (most recent call last): File "./searx/engines/google.py", line 274, in response url = eval_xpath_getindex(result, href_xpath, 0) File "./searx/searx/utils.py", line 608, in eval_xpath_getindex raise SearxEngineXPathException(xpath_spec, 'index ' + str(index) + ' not found') searx.exceptions.SearxEngineXPathException: .//div[@class="yuRUbf"]//a/@href index 0 not found Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2021-01-28 10:08:50 +01:00
Markus Heiser	e436287385	[mod] checker: add some additional tests BTW: fix indentation by 2 spaces The additional tests has been commented out in the google engines to not release any CAPTCHA issues. Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2021-01-28 10:08:50 +01:00
Markus Heiser	b1fefec40d	[fix] normalize the language & region aspects of all google engines BTW: make the engines ready for search.checker: - replace eval_xpath by eval_xpath_getindex and eval_xpath_list - google_images: remove outer try/except block Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2021-01-28 10:08:46 +01:00
Markus Heiser	ff6804e545	[data] make engines.languages Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2021-01-24 09:52:32 +01:00
Markus Heiser	8cdad5d85d	[fix] google-videos: parse values for 'length' & 'author' The 'video.html' template from the 'oscar' design supports replacement for author and length. Google-videos does not have an author, alternatively the publisher info from is used for the author. Hint: these replacements are not supported by the 'simple' design. Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2021-01-24 09:51:24 +01:00
Markus Heiser	89b3050b5c	[fix] revise of the google-Video engine This revise is based on the methods developed in the revise of the google engine (see commit `410c2f9`). Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2021-01-24 09:39:30 +01:00
Alexandre Flament	8c46b767d0	[fix] google_news: avoid one HTTP redirect except for the English results also add params['soft_max_redirects'] = 1 to avoid false error reporting in /stats/errors	2021-01-24 08:53:35 +01:00
Markus Heiser	5f92dfcdbe	[fix] google-news: query uses locale without country tag Wthout country-region tag google will redirect to correct the contry tag [1]: SEARX_DEBUG=1 searx-checker -v "google news" ... https://news.google.com:443 "GET /search?q=computer&hl=en... HTTP/1.1" 302 0 https://news.google.com:443 "GET /search?q=computer&hl=en-US&.... HTTP/1.1" 200 None ... [1] https://github.com/searx/searx/pull/2483#issuecomment-765600849 Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2021-01-23 11:37:14 +01:00
Markus Heiser	baec54c492	[fix] revise of the google-news engine This revise is based on the methods developed in the revise of the google engine (see commit `410c2f9`). Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2021-01-22 18:49:45 +01:00
Alexandre Flament	73c86f9bf2	[mod] checker: disable by default	2021-01-19 21:44:48 +01:00
Alexandre Flament	3b7b852aa8	[fix] checker: minor fix about language detection	2021-01-19 21:29:31 +01:00
Alexandre Flament	aa887eb375	[mod] checker : replace pycld3 by langdetect pycld3 requires the native library cld3 langdetect is a pure python package	2021-01-19 21:26:04 +01:00
Alexandre Flament	67a1aab0d5	[fix] /stats/checker : remove the timestamp field when the checker is disabled	2021-01-18 08:19:53 +01:00
Alexandre Flament	d473407ec9	[fix] checker: fix engine statistics Without this commit, the URL /stats/errors shows percentage above 100% after the checker has run.	2021-01-18 08:19:44 +01:00
Alexandre Flament	ca76f3119a	[fix] error_recorder: record code and lineno about the engine since the PR #2225 , code and lineno were sometimes meaningless see /stats/errors	2021-01-17 16:25:11 +01:00
Alexandre Flament	80d7411f2c	Merge pull request #2452 from kvch/add-wilby-engine Add wiby.me engine	2021-01-16 22:36:31 +01:00
Alexandre Flament	b405646749	Merge pull request #2451 from mrwormo/invidious-engine [Fix] Invidious Engine	2021-01-16 19:25:45 +01:00
Alexandre Flament	a4dcfa025c	[enh] engines: add about variable move meta information from comment to the about variable so the preferences, the documentation can show these information	2021-01-14 20:57:17 +01:00
mrwormo	2dff3887f0	[fix] Invidious engine by enabling requests by randomly picking amongst working instances	2021-01-14 12:12:56 +01:00
Alexandre Flament	912c7e975c	[fix] checker: don't run the checker when uwsgi is not properly configured Before this commit, even with the scheduler disabled, the checker was running at least once for each uwsgi worker.	2021-01-13 14:07:39 +01:00
Alexandre Flament	7f0c508598	[fix] checker: fix typo unknown instead of unknow	2021-01-12 11:47:17 +01:00
Alexandre Flament	a0c8b413a6	[mod] searx.shared: minor tweaks searx.shared.shared_abstract.SharedDict inherit from abc.ABC searx.shared.shared_uwsgi.schedule can schedule multiple functions without issue	2021-01-12 11:47:17 +01:00
Alexandre Flament	87bafbc32b	[mod] checker: add status and timestamp to the result for each engine: replace status by success	2021-01-12 11:47:17 +01:00
Alexandre Flament	f3e1bd308f	[mod] checker: minor adjustements on the default tests the query "time" is convinient because most of the search engine will return some results, but some engines in the general category will return documentation about the HTML tags <time> or <input type="time">	2021-01-12 11:47:17 +01:00
Alexandre Flament	45bfab77d0	\|mod] checker: improve searx-checker command line * output is unbuffered * verbose mode describe more precisly the errrors	2021-01-12 11:47:17 +01:00
Alexandre Flament	3a9f513521	[enh] checker: background check See settings.yml for the options SIGUSR1 signal starts the checker. The result is available at /stats/checker	2021-01-12 11:47:17 +01:00
Alexandre Flament	6e2872f436	[enh] add searx.shared shared dictionary between the workers (UWSGI or werkzeug) scheduler: run a task once every x seconds (UWSGI or werkzeug)	2021-01-12 11:47:17 +01:00
Markus Heiser	9c581466e1	[fix] do not colorize output on dumb terminals Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2021-01-12 11:47:17 +01:00
Alexandre Flament	ca0889d488	[enh] checker: wikidata & ddd: add specific tests	2021-01-12 11:47:17 +01:00
Alexandre Flament	16a889dd8f	[enh] checker: add rosebud test	2021-01-12 11:47:17 +01:00

... 2 3 4 5 6 ...

2766 commits