searxngRebrandZaclys/searx/engines/dailymotion.py

"""
 Dailymotion (Videos)

 @website     https://www.dailymotion.com
 @provide-api yes (http://www.dailymotion.com/developer)

 @using-api   yes
 @results     JSON
 @stable      yes
 @parse       url, title, thumbnail, publishedDate, embedded

 @todo        set content-parameter with correct data
"""

from json import loads
from datetime import datetime
from urllib.parse import urlencode
from searx.utils import match_language, html_to_text

# engine dependent config
categories = ['videos']
paging = True
language_support = True

# search-url
# see http://www.dailymotion.com/doc/api/obj-video.html
search_url = 'https://api.dailymotion.com/videos?fields=created_time,title,description,duration,url,thumbnail_360_url,id&sort=relevance&limit=5&page={pageno}&{query}'  # noqa
embedded_url = '<iframe frameborder="0" width="540" height="304" ' +\
    'data-src="https://www.dailymotion.com/embed/video/{videoid}" allowfullscreen></iframe>'

supported_languages_url = 'https://api.dailymotion.com/languages'


# do search-request
def request(query, params):
    if params['language'] == 'all':
        locale = 'en-US'
    else:
        locale = match_language(params['language'], supported_languages)

    params['url'] = search_url.format(
        query=urlencode({'search': query, 'localization': locale}),
        pageno=params['pageno'])

    return params


# get response from search-request
def response(resp):
    results = []

    search_res = loads(resp.text)

    # return empty array if there are no results
    if 'list' not in search_res:
        return []

    # parse results
    for res in search_res['list']:
        title = res['title']
        url = res['url']
        content = html_to_text(res['description'])
        thumbnail = res['thumbnail_360_url']
        publishedDate = datetime.fromtimestamp(res['created_time'], None)
        embedded = embedded_url.format(videoid=res['id'])

        # http to https
        thumbnail = thumbnail.replace("http://", "https://")

        results.append({'template': 'videos.html',
                        'url': url,
                        'title': title,
                        'content': content,
                        'publishedDate': publishedDate,
                        'embedded': embedded,
                        'thumbnail': thumbnail})

    # return results
    return results


# get supported languages from their site
def _fetch_supported_languages(resp):
    supported_languages = {}

    response_json = loads(resp.text)

    for language in response_json['list']:
        supported_languages[language['code']] = {}

        name = language['native_name']
        if name:
            supported_languages[language['code']]['name'] = name
        english_name = language['name']
        if english_name:
            supported_languages[language['code']]['english_name'] = english_name

    return supported_languages
update versions.cfg to use the current up-to-date packages 2015-05-02 13:45:17 +00:00			`"""`
			`Dailymotion (Videos)`

			`@website https://www.dailymotion.com`
			`@provide-api yes (http://www.dailymotion.com/developer)`

			`@using-api yes`
			`@results JSON`
			`@stable yes`
			`@parse url, title, thumbnail, publishedDate, embedded`

			`@todo set content-parameter with correct data`
			`"""`
fix dailymotion engine and add comments to it 2014-09-01 13:36:53 +00:00
add dailymotion engine 2013-12-30 21:42:37 +00:00			`from json import loads`
Integrated media in results + Deezer Engine New "embedded" item for the results, allow to give an iframe to display the media directly in the results. Note that the attributes src of the iframes are not set, but instead data-src is set, allowing to only load the iframe when clicked. Deezer engine based on public API (no key). 2015-01-05 01:04:23 +00:00			`from datetime import datetime`
Drop Python 2 (1/n): remove unicode string and url_utils 2020-08-06 15:42:46 +00:00			`from urllib.parse import urlencode`
[fix] dailymotion engine: remove HTML tags from the description 2019-07-31 06:37:51 +00:00			`from searx.utils import match_language, html_to_text`
add dailymotion engine 2013-12-30 21:42:37 +00:00
fix dailymotion engine and add comments to it 2014-09-01 13:36:53 +00:00			`# engine dependent config`
add dailymotion engine 2013-12-30 21:42:37 +00:00			`categories = ['videos']`
fix dailymotion engine and add comments to it 2014-09-01 13:36:53 +00:00			`paging = True`
[enh] dailymotion engine: add language support 2014-09-07 15:14:42 +00:00			`language_support = True`
add dailymotion engine 2013-12-30 21:42:37 +00:00
fix dailymotion engine and add comments to it 2014-09-01 13:36:53 +00:00			`# search-url`
add dailymotion engine 2013-12-30 21:42:37 +00:00			`# see http://www.dailymotion.com/doc/api/obj-video.html`
Integrated media in results + Deezer Engine New "embedded" item for the results, allow to give an iframe to display the media directly in the results. Note that the attributes src of the iframes are not set, but instead data-src is set, allowing to only load the iframe when clicked. Deezer engine based on public API (no key). 2015-01-05 01:04:23 +00:00			`search_url = 'https://api.dailymotion.com/videos?fields=created_time,title,description,duration,url,thumbnail_360_url,id&sort=relevance&limit=5&page={pageno}&{query}' # noqa`
			`embedded_url = '<iframe frameborder="0" width="540" height="304" ' +\`
embedded iframe (youtube, dailymotion, vimeo): use https 2019-07-13 05:57:10 +00:00			`'data-src="https://www.dailymotion.com/embed/video/{videoid}" allowfullscreen></iframe>'`
[enh] paging support for dailymotion 2014-01-29 23:01:42 +00:00
[mod] fetch supported languages for several engines utils/fetch_languages.py gets languages supported by each engine and generates engines_languages.json with each engine's supported language. 2016-11-06 02:51:38 +00:00			`supported_languages_url = 'https://api.dailymotion.com/languages'`

fix: robot fw, entry points, some flake8, package searx egg 2014-01-19 21:59:01 +00:00
fix dailymotion engine and add comments to it 2014-09-01 13:36:53 +00:00			`# do search-request`
add dailymotion engine 2013-12-30 21:42:37 +00:00			`def request(query, params):`
Revert "remove 'all' option from search languages" This reverts commit 4d1770398a6af8902e75c0bd885781584d39e796. 2019-01-06 14:27:46 +00:00			`if params['language'] == 'all':`
			`locale = 'en-US'`
			`else:`
			`locale = match_language(params['language'], supported_languages)`
[enh] dailymotion engine: add language support 2014-09-07 15:14:42 +00:00
fix: robot fw, entry points, some flake8, package searx egg 2014-01-19 21:59:01 +00:00			`params['url'] = search_url.format(`
[enh] paging support for dailymotion 2014-01-29 23:01:42 +00:00			`query=urlencode({'search': query, 'localization': locale}),`
			`pageno=params['pageno'])`
fix dailymotion engine and add comments to it 2014-09-01 13:36:53 +00:00
add dailymotion engine 2013-12-30 21:42:37 +00:00			`return params`


fix dailymotion engine and add comments to it 2014-09-01 13:36:53 +00:00			`# get response from search-request`
add dailymotion engine 2013-12-30 21:42:37 +00:00			`def response(resp):`
			`results = []`
fix dailymotion engine and add comments to it 2014-09-01 13:36:53 +00:00
add dailymotion engine 2013-12-30 21:42:37 +00:00			`search_res = loads(resp.text)`
fix dailymotion engine and add comments to it 2014-09-01 13:36:53 +00:00
			`# return empty array if there are no results`
update versions.cfg to use the current up-to-date packages 2015-05-02 13:45:17 +00:00			`if 'list' not in search_res:`
fix dailymotion engine and add comments to it 2014-09-01 13:36:53 +00:00			`return []`

			`# parse results`
add dailymotion engine 2013-12-30 21:42:37 +00:00			`for res in search_res['list']:`
			`title = res['title']`
			`url = res['url']`
[fix] dailymotion engine: remove HTML tags from the description 2019-07-31 06:37:51 +00:00			`content = html_to_text(res['description'])`
fix dailymotion engine and add comments to it 2014-09-01 13:36:53 +00:00			`thumbnail = res['thumbnail_360_url']`
Integrated media in results + Deezer Engine New "embedded" item for the results, allow to give an iframe to display the media directly in the results. Note that the attributes src of the iframes are not set, but instead data-src is set, allowing to only load the iframe when clicked. Deezer engine based on public API (no key). 2015-01-05 01:04:23 +00:00			`publishedDate = datetime.fromtimestamp(res['created_time'], None)`
			`embedded = embedded_url.format(videoid=res['id'])`
[fix] dailymotion engine : no more html tag in the description 2014-01-05 12:55:17 +00:00
[enh] reduce the number of http outgoing connections. engines that still use http : gigablast, bing image for thumbnails, 1x and dbpedia autocompleter 2015-05-02 09:43:12 +00:00			`# http to https`
			`thumbnail = thumbnail.replace("http://", "https://")`

fix dailymotion engine and add comments to it 2014-09-01 13:36:53 +00:00			`results.append({'template': 'videos.html',`
			`'url': url,`
			`'title': title,`
			`'content': content,`
Integrated media in results + Deezer Engine New "embedded" item for the results, allow to give an iframe to display the media directly in the results. Note that the attributes src of the iframes are not set, but instead data-src is set, allowing to only load the iframe when clicked. Deezer engine based on public API (no key). 2015-01-05 01:04:23 +00:00			`'publishedDate': publishedDate,`
			`'embedded': embedded,`
fix dailymotion engine and add comments to it 2014-09-01 13:36:53 +00:00			`'thumbnail': thumbnail})`
fix: robot fw, entry points, some flake8, package searx egg 2014-01-19 21:59:01 +00:00
fix dailymotion engine and add comments to it 2014-09-01 13:36:53 +00:00			`# return results`
			`return results`
[mod] fetch supported languages for several engines utils/fetch_languages.py gets languages supported by each engine and generates engines_languages.json with each engine's supported language. 2016-11-06 02:51:38 +00:00

			`# get supported languages from their site`
tests for _fetch_supported_languages in engines and refactor method to make it testable without making requests 2016-12-15 06:34:43 +00:00			`def _fetch_supported_languages(resp):`
[mod] fetch supported languages for several engines utils/fetch_languages.py gets languages supported by each engine and generates engines_languages.json with each engine's supported language. 2016-11-06 02:51:38 +00:00			`supported_languages = {}`

tests for _fetch_supported_languages in engines and refactor method to make it testable without making requests 2016-12-15 06:34:43 +00:00			`response_json = loads(resp.text)`
[mod] fetch supported languages for several engines utils/fetch_languages.py gets languages supported by each engine and generates engines_languages.json with each engine's supported language. 2016-11-06 02:51:38 +00:00
			`for language in response_json['list']:`
			`supported_languages[language['code']] = {}`

			`name = language['native_name']`
			`if name:`
			`supported_languages[language['code']]['name'] = name`
			`english_name = language['name']`
			`if english_name:`
			`supported_languages[language['code']]['english_name'] = english_name`

			`return supported_languages`