searxng

mirror of https://github.com/searxng/searxng synced 2024-01-01 19:24:07 +01:00

Author	SHA1	Message	Date
Alexandre Flament	28cc644f0a	[fix] duckduckgo_definitions: fix relative image URL ddg returns relative URL to https://duckduckgo.com/	2020-12-06 10:14:09 +01:00
Alexandre Flament	cdceec1cbb	Merge pull request #2354 from dalf/fix-wikipedia [fix] wikipedia engine: don't raise an error when the query is not found	2020-12-04 20:42:45 +01:00
Alexandre Flament	f0054d67f1	[fix] wikipedia engine: don't raise an error when the query is not found Add a new parameter "raise_for_status", set by default to True. When True, any HTTP status code >= 300 raise an exception ( #2332 ) When False, the engine can manage the HTTP status code by itself.	2020-12-04 20:04:39 +01:00
Alexandre Flament	bef2f2efa8	[fix] wikidata: fix crash when the item has no description at all and at least one URL.	2020-12-04 17:17:20 +01:00
Alexandre Flament	244e812f37	[fix] remove searx/engines/filecrop.py (dead code)	2020-12-04 16:48:15 +01:00
Alexandre Flament	fa909c7c02	[mod] stackoverflow & yandex: detect CAPTCHA response	2020-12-03 13:23:19 +01:00
Alexandre Flament	64cccae99e	[mod] various engines: use eval_xpath* functions and searx.exceptions.* Engine list: ahmia, duckduckgo_images, elasticsearch, google, google_images, google_videos, youtube_api	2020-12-03 10:22:48 +01:00
Alexandre Flament	ad72803ed9	[mod] xpath, 1337x, acgsou, apkmirror, archlinux, arxiv: use eval_xpath_* functions	2020-12-03 10:22:48 +01:00
Alexandre Flament	de887c6347	[mod] bing_news: use eval_xpath_getindex remove unused function searx.utils.list_get	2020-12-03 10:22:48 +01:00
Alexandre Flament	1d0c368746	[enh] record details exception per engine add an new API /stats/errors	2020-12-03 10:22:48 +01:00
Markus Heiser	bef185723a	[refactor] digg - improve results and clean up source code - strip html tags and superfluous quotation marks from content - remove not needed cookie from request - remove superfluous imports Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2020-12-02 21:54:27 +01:00
Markus Heiser	6b0a896f01	[mod] digg - pylint searx/engines/digg.py Eliminate redundant file names which are tested by test.pylint and ignored by test.pep8 Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2020-12-02 20:59:30 +01:00
Markus Heiser	173b744ef0	[fix] digg - the ISO time stamp of published date has been changed Error pattern:: Engines cannot retrieve results: digg (unexpected crash time data '2020-10-16T14:09:55Z' does not match format '%Y-%m-%d %H:%M:%S') Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2020-12-02 20:40:12 +01:00
Alexandre Flament	b00d108673	[mod] pylint: numerous minor code fixes	2020-12-01 15:21:19 +01:00
Alexandre Flament	9ed3ee2beb	[mod] wikidata: WDGeoAttribute class: doesn't change the method signature of get_str	2020-12-01 15:21:17 +01:00
Alexandre Flament	3cfef61123	[fix] /stats: report error percentage instead of error count This bug exists since the PR https://github.com/searx/searx/pull/751	2020-12-01 15:07:09 +01:00
Noémi Ványi	4a36a3044d	Add recoll engine (#2325 ) recoll is a local search engine based on Xapian: http://www.lesbonscomptes.com/recoll/ By itself recoll does not offer web or API access, this can be achieved using recoll-webui: https://framagit.org/medoc92/recollwebui.git This engine uses a custom 'files' result template set `base_url` to the location where recoll-webui can be reached set `dl_prefix` to a location where the file hierarchy as indexed by recoll can be reached set `search_dir` to the part of the indexed file hierarchy to be searched, use an empty string to search the entire search domain	2020-11-30 08:35:15 +01:00
M. Efe Çetin	d1f527c3af	Photon API Link Update Via https://photon.komoot.io/	2020-11-27 10:22:28 +03:00
Alexandre Flament	3786920df9	[enh] Add multiple outgoing proxies credits go to @bauruine see https://github.com/searx/searx/pull/1958	2020-11-20 15:29:21 +01:00
Markus Heiser	c71d214b0c	[refactor] deviantart - improve results and clean up source code Devian's request and response forms has been changed. - fixed title - fixed time_range_dict to 'popular--**' - use image from <noscript> if exists - drop obsolete "http to https, remove domain sharding" - use query URL https://www.deviantart.com/search/deviations?page=5&q=foo - add searx/engines/deviantart.py to pylint check (test.pylint) Error pattern:: There DEBUG:searx:result: invalid title: {'url': 'https://www.deviantart.com/ ... Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2020-11-14 17:09:56 +01:00
Alexandre Flament	3038052c79	[mod] remove unused import use from searx.engines.duckduckgo import _fetch_supported_languages, supported_languages_url # NOQA so it is possible to easily remove all unused import using autoflake: autoflake --in-place --recursive --remove-all-unused-imports searx tests	2020-11-14 14:11:02 +01:00
Alexandre Flament	c3d9b17c2a	Merge pull request #2292 from kvch/elasticsearch-engine New engine: Elasticsearch	2020-11-14 13:25:08 +01:00
Alexandre Flament	102c08838b	Merge pull request #2289 from dalf/pylint [mod] pylint: add extension-pkg-whitelist=lxml.etree	2020-11-14 13:24:31 +01:00
Noémi Ványi	43e697681e	New engine: Elasticsearch	2020-11-10 19:53:38 +01:00
Alexandre Flament	58d72f2692	[mod] pylint: minor code change to allow pylint globally This commit is only a step, it doesn't fix all the issues reported by pylint	2020-11-03 11:35:53 +01:00
Alexandre Flament	eed43783f9	[fix] comamnd engine: fix import	2020-11-03 10:55:08 +01:00
Alexandre Flament	a08df82574	[fix] scanr_structure engine: fix import	2020-11-03 10:54:02 +01:00
Alexandre Flament	95bd6033fa	[mod] wikidata engine: use one SPARQL request instead of 2 HTTP requests.	2020-10-28 08:09:25 +01:00
Alexandre Flament	ca593728af	[mod] duckduckgo_definitions: display only user friendly attributes / URL various bug fixes	2020-10-28 08:09:25 +01:00
a01200356	c3daa08537	[enh] Add onions category with Ahmia, Not Evil and Torch Xpath engine and results template changed to account for the fact that archive.org doesn't cache .onions, though some onion engines migth have their own cache. Disabled by default. Can be enabled by setting the SOCKS proxies to wherever Tor is listening and setting using_tor_proxy as True. Requires Tor and updating packages. To avoid manually adding the timeout on each engine, you can set extra_proxy_timeout to account for Tor's (or whatever proxy used) extra time.	2020-10-25 17:59:05 -07:00
Nicholas Kegler	8e15d3e4c1	Open Semantic Search Engine	2020-10-25 17:50:00 +01:00
Noémi Ványi	e158eeee4b	Propagate error messages from YouTube API	2020-10-09 17:34:26 +02:00
Adam Tauber	835d16cbb1	Merge pull request #2255 from kvch/yacy-improvements Add yacy improvements: HTTP digest auth, category checking	2020-10-09 16:34:42 +02:00
Alexandre Flament	cfd21bc475	[fix] fix duckduckgo engine - remove paging support: a "vqd" parameter is required between each request. This parameter is uniq for each request - update the URL (no redirect), use the POST method - language support: works if there is no more than request per minute, otherwise it is ignored !	2020-10-09 16:00:42 +02:00
Noémi Ványi	72c7fd25fe	Add yacy improvements: HTTP digest auth, category checking	2020-10-09 15:06:05 +02:00
Noémi Ványi	f0278d41fc	add ebay enginte to shopping category	2020-10-08 13:20:55 +02:00
Alexandre Flament	a9dc54bebc	[mod] Add searx.data module Instead of loading the data/*.json in different location, load these files in the new searx.data module.	2020-10-07 10:29:34 +02:00
Alexandre Flament	8659212f5a	[fix] drop Python 2: use collections.abc.Iterable instead of collections.Iterable	2020-10-06 09:43:24 +02:00
Alexandre Flament	b728cb610b	Merge pull request #2241 from dalf/move-extract-text-and-url Move the extract_text and extract_url functions to searx.utils	2020-10-04 09:06:20 +02:00
Finn	53c8d945b4	[enh] Add SepiaSearch engine (#2227 ) supported_languages values: see https://framagit.org/framasoft/peertube/search-index/-/blob/master/client/src/views/Search.vue#L618-641	2020-10-03 13:00:10 +02:00
Alexandre Flament	2006eb4680	[mod] move extract_text, extract_url to searx.utils	2020-10-02 18:13:56 +02:00
Markus Heiser	8162d7aff4	[fix] google engine - div classes has been renamed in HTML reult Since 1. October 2020 google has changed the 'class' attribute of the HTML result page. Fix the xpath expressions and ignore <div class="g" ../> sections which do not match to title's xpath expression. Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2020-10-01 09:44:29 +02:00
Alexandre Flament	f204e4903d	[fix] migration from github.com/asciimoo/searx to github.com/searx/searx : fix URLs	2020-09-28 16:44:14 +02:00
Marc Abonce Seguin	ecf5899153	fetch google's search langs rather than ui langs	2020-09-22 11:37:44 +02:00
Marc Abonce Seguin	41800835f9	fetch supported languages for startpage engine	2020-09-22 11:37:44 +02:00
Marc Abonce Seguin	ea9d979cc3	add language names in qwant's fetch languages function	2020-09-22 11:37:44 +02:00
Dalf	c225db45c8	Drop Python 2 (4/n): SearchQuery.query is a str instead of bytes	2020-09-10 10:49:42 +02:00
Dalf	1022228d95	Drop Python 2 (1/n): remove unicode string and url_utils	2020-09-10 10:39:04 +02:00
Marc Abonce Seguin	ab20ca182c	use Wikipedia's REST v1 API	2020-09-10 09:54:30 +02:00
Noémi Ványi	f0ca1c3483	[enh] Add command line engines: git grep, find, etc. (#2128 ) A new "base" engine called command is introduced. It is the foundation for all command line engines for now. You can use this engine to create your own command line engine. Add some engines (commented out to make sure no one enables anything accidentally): * git grep: This engine lets you grep in the searx repo. * locate: If locate is installed and initialized, you can search on the FS. * find: You can find files with a specific name from where you started searx. * pattern search in files: This engine utilizes the command fgrep. * regex search in files: This engine runs `grep` to find a file based on its contents.	2020-09-08 09:51:53 +02:00

1 2 3 4 5 ...

1094 commits