searxng/searx/engines/www1x.py

# SPDX-License-Identifier: AGPL-3.0-or-later
# lint: pylint
"""1x (Images)

"""

from urllib.parse import urlencode, urljoin
from lxml import html, etree

from searx.utils import extract_text, eval_xpath_list, eval_xpath_getindex

# about
about = {
    "website": 'https://1x.com/',
    "wikidata_id": None,
    "official_api_documentation": None,
    "use_official_api": False,
    "require_api_key": False,
    "results": 'HTML',
}

# engine dependent config
categories = ['images']
paging = False

# search-url
base_url = 'https://1x.com'
search_url = base_url + '/backend/search.php?{query}'
gallery_url = 'https://gallery.1x.com/'


# do search-request
def request(query, params):
    params['url'] = search_url.format(query=urlencode({'q': query}))

    return params


# get response from search-request
def response(resp):
    results = []
    xmldom = etree.fromstring(resp.content)
    xmlsearchresult = eval_xpath_getindex(xmldom, '//data', 0)
    dom = html.fragment_fromstring(xmlsearchresult.text, create_parent='div')
    for link in eval_xpath_list(dom, '//a'):
        url = urljoin(base_url, link.attrib.get('href'))
        title = extract_text(link)
        thumbnail_src = urljoin(
            gallery_url, (eval_xpath_getindex(link, './/img', 0).attrib['src']).replace(base_url, '')
        )
        # append result
        results.append(
            {
                'url': url,
                'title': title,
                'img_src': thumbnail_src,
                'content': '',
                'thumbnail_src': thumbnail_src,
                'template': 'images.html',
            }
        )

    # return results
    return results
[enh] engines: add about variable move meta information from comment to the about variable so the preferences, the documentation can show these information 2021-01-13 10:31:25 +00:00			`# SPDX-License-Identifier: AGPL-3.0-or-later`
[fix] 1x engine 1x changed the XML result layout. 2022-01-30 14:59:58 +00:00			`# lint: pylint`
			`"""1x (Images)`

update versions.cfg to use the current up-to-date packages 2015-05-02 13:45:17 +00:00			`"""`
[enh] add 1x.com engine * Deacivated by default, because of the big amount of results 2015-02-01 10:27:28 +00:00
Drop Python 2 (1/n): remove unicode string and url_utils 2020-08-06 15:42:46 +00:00			`from urllib.parse import urlencode, urljoin`
[fix] 1x engine 1x changed the XML result layout. 2022-01-30 14:59:58 +00:00			`from lxml import html, etree`

[fix] 1x engine 2020-12-07 14:46:00 +00:00			`from searx.utils import extract_text, eval_xpath_list, eval_xpath_getindex`
[enh] add 1x.com engine * Deacivated by default, because of the big amount of results 2015-02-01 10:27:28 +00:00
[enh] engines: add about variable move meta information from comment to the about variable so the preferences, the documentation can show these information 2021-01-13 10:31:25 +00:00			`# about`
			`about = {`
			`"website": 'https://1x.com/',`
			`"wikidata_id": None,`
			`"official_api_documentation": None,`
			`"use_official_api": False,`
			`"require_api_key": False,`
			`"results": 'HTML',`
			`}`

[enh] add 1x.com engine * Deacivated by default, because of the big amount of results 2015-02-01 10:27:28 +00:00			`# engine dependent config`
			`categories = ['images']`
			`paging = False`

www1x engine: remove comment about unavailable https (https is working now) 2015-06-06 17:44:41 +00:00			`# search-url`
bing_images & www1x engines use https connections 2015-06-06 17:23:07 +00:00			`base_url = 'https://1x.com'`
[fix] pep8 compatibilty 2016-01-18 11:47:31 +00:00			`search_url = base_url + '/backend/search.php?{query}'`
[fix] 1x engine 2020-12-07 14:46:00 +00:00			`gallery_url = 'https://gallery.1x.com/'`
[enh] add 1x.com engine * Deacivated by default, because of the big amount of results 2015-02-01 10:27:28 +00:00

			`# do search-request`
			`def request(query, params):`
			`params['url'] = search_url.format(query=urlencode({'q': query}))`

			`return params`


			`# get response from search-request`
			`def response(resp):`
			`results = []`
[fix] 1x engine 2020-12-07 14:46:00 +00:00			`xmldom = etree.fromstring(resp.content)`
[fix] 1x engine 1x changed the XML result layout. 2022-01-30 14:59:58 +00:00			`xmlsearchresult = eval_xpath_getindex(xmldom, '//data', 0)`
[fix] 1x engine 2020-12-07 14:46:00 +00:00			`dom = html.fragment_fromstring(xmlsearchresult.text, create_parent='div')`
[fix] 1x engine 1x changed the XML result layout. 2022-01-30 14:59:58 +00:00			`for link in eval_xpath_list(dom, '//a'):`
[enh] add 1x.com engine * Deacivated by default, because of the big amount of results 2015-02-01 10:27:28 +00:00			`url = urljoin(base_url, link.attrib.get('href'))`
[fix] update 1x engine 2019-10-16 11:27:05 +00:00			`title = extract_text(link)`
[fix] 1x engine 1x changed the XML result layout. 2022-01-30 14:59:58 +00:00			`thumbnail_src = urljoin(`
			`gallery_url, (eval_xpath_getindex(link, './/img', 0).attrib['src']).replace(base_url, '')`
			`)`
[enh] add 1x.com engine * Deacivated by default, because of the big amount of results 2015-02-01 10:27:28 +00:00			`# append result`
[format.python] initial formatting of the python code This patch was generated by black [1]:: make format.python [1] https://github.com/psf/black Signed-off-by: Markus Heiser <markus.heiser@darmarit.de> 2021-12-27 08:26:22 +00:00			`results.append(`
			`{`
			`'url': url,`
			`'title': title,`
			`'img_src': thumbnail_src,`
			`'content': '',`
			`'thumbnail_src': thumbnail_src,`
			`'template': 'images.html',`
			`}`
			`)`
[enh] add 1x.com engine * Deacivated by default, because of the big amount of results 2015-02-01 10:27:28 +00:00
			`# return results`
			`return results`