Marc Abonce Seguin
|
343e555ee9
|
[fix] append http if no scheme is provided in xpath's extact_url
This solves a bug with Yahoo where some results don't specify
a protocol.
|
2018-04-08 20:35:34 -05:00 |
Adam Tauber
|
1972a044a3
|
[fix] produce valid urls if scheme is missing
|
2017-05-22 15:48:37 +02:00 |
Adam Tauber
|
52e615dede
|
[enh] py3 compatibility
|
2017-05-15 12:02:30 +02:00 |
David A Roberts
|
7492997c51
|
[fix] allow empty content
|
2017-01-17 21:14:33 +10:00 |
Alexandre Flament
|
90e1db3e5c
|
[fix] extract_text: use html.tostring instead html_to_text. Fix #711
|
2016-12-31 13:56:09 +01:00 |
David A Roberts
|
1e9dab08e6
|
[fix] behaviour for page_size>1 and first_page_num>0
eg. pageno=1,21,41,... instead of 20,40,60,...
|
2016-08-14 22:10:25 +10:00 |
Kirill Isakov
|
bacc9a3df1
|
Add paging support to XPath & Erowid engines
|
2016-03-28 19:15:03 +06:00 |
Adam Tauber
|
bd22e9a336
|
[fix] pep8 compatibilty
|
2016-01-18 12:47:31 +01:00 |
Cqoicebordel
|
44c9216c49
|
Sanitize extract_text
|
2015-01-25 20:04:44 +01:00 |
potato
|
6f535b6fae
|
[fix] error when xpath_results in extraxt_text is _ElementUnicodeResult instead of _ElementStringResult
|
2014-03-04 19:43:41 +01:00 |
asciimoo
|
c1d7d30b8e
|
[mod] len() removed from conditions
|
2014-02-11 13:13:51 +01:00 |
asciimoo
|
b647244abf
|
[fix] function parameters
|
2014-01-30 03:10:20 +01:00 |
asciimoo
|
3dcb835910
|
[fix] function parameters
|
2014-01-30 02:36:05 +01:00 |
asciimoo
|
fe82637eac
|
[enh] importable url extractor
|
2014-01-30 02:32:58 +01:00 |
asciimoo
|
59eeeaab87
|
[fix] html tag removal
|
2014-01-23 11:08:08 +01:00 |
asciimoo
|
b2492c94f4
|
[fix] pep/flake8 compatibility
|
2014-01-20 02:31:20 +01:00 |
asciimoo
|
060ea4d2f5
|
[fix] whitespaces removed
|
2014-01-12 18:48:38 +01:00 |
Dalf
|
3dc3fc7770
|
[mod][fix] xpath engine simplified, yahoo engine never returns truncated urls
|
2014-01-05 14:06:52 +01:00 |
dalf
|
664c039b38
|
xpath engine: bug fix
|
2013-12-30 22:34:35 +01:00 |
asciimoo
|
e50a72b0e3
|
[enh] suggestion support for xpath engine
|
2013-11-13 19:33:09 +01:00 |
asciimoo
|
17bf00ee42
|
[enh] removing result html tags
|
2013-11-09 18:39:20 +01:00 |
asciimoo
|
7965da55a7
|
[fix] urlparsing fix
|
2013-10-27 12:01:03 +01:00 |
asciimoo
|
5d764f95cf
|
[enh] xpath engine absolute xpath support
|
2013-10-26 13:45:43 +02:00 |
asciimoo
|
badd988545
|
[enh] xpath engine added
|
2013-10-26 02:22:20 +02:00 |