Uncaught exception in readability


#1

Describe the problem you’re having:
I recently enabled af_readability on a bunch of feeds, and due to reading the ParseException thread I checked the system log, and it was filled with errors, all for the same feed.
The error is:
Uncaught andreskrey\Readability\ParseException: Could not parse text. in /var/www/ttrss/vendor/andreskrey/Readability/Readability.php:201 Stack trace: #0 /var/www/ttrss/plugins/af_readability/init.php(177): andreskrey\Readability\Readability->parse(’…’) #1 /var/www/ttrss/plugins/af_readability/init.php(210): Af_Readability->extract_content(‘https://jylland…’) #2 /var/www/ttrss/plugins/af_readability/init.php(230): Af_Readability->process_article(Array) #3 /var/www/ttrss/classes/rssutils.php(754): Af_Readability->hook_article_filter(Array) #4 /var/www/ttrss/classes/rssutils.php(190): RSSUtils::update_rss_feed(149, true, false) #5 /var/www/ttrss/update.php(199): RSSUtils::update_daemon_common(50) #6 {main} thrown

And when I try to debug the feed is seems to crash also:
TRUNCATED
[12:19:39/20286] article processed
[12:19:39/20286] guid 1,https://jyllands-posten.dk/protected/premium/erhverv/ECE10775643/eulandenes-gaeld-er-faldende/ / SHA1:e00d5d0beb3d1b005360fb3d5bee7131ea67156f
[12:19:39/20286] orig date: 1533022200
[12:19:39/20286] date 1533022200 [2018/07/31 07:30:00]
[12:19:39/20286] title EU-landenes gæld er faldende
[12:19:39/20286] link https://jyllands-posten.dk/protected/premium/erhverv/ECE10775643/eulandenes-gaeld-er-faldende/
[12:19:39/20286] author Martin Flink
[12:19:39/20286] num_comments: 0
[12:19:39/20286] looking for tags…
[12:19:39/20286] tags found: erhverv
[12:19:39/20286] done collecting data.
[12:19:39/20286] article hash: 49c87287dd3e2eb00329b9b02d1c6e4c81478f4d [stored=]
[12:19:39/20286] hash differs, applying plugin filters:
[12:19:39/20286] … Af_Comics
[12:19:39/20286] === 0.0000 (sec)
[12:19:39/20286] … Af_ComicsClean
[12:19:39/20286] === 0.0000 (sec)
[12:19:39/20286] … Af_Fsckportal
[12:19:39/20286] === 0.0001 (sec)
[12:19:39/20286] … Af_Readability

If possible include steps to reproduce the problem:

tt-rss version (including git commit id):
v17.12 (a2d1fa5)

Platform (i.e. Linux distro, PHP, PostgreSQL, etc) versions:
Debian 9 VPS
PHP 7
postgres 9.6

Please provide any additional information below:
feed URL https://jyllands-posten.dk/?service=rssfeed&mode=top

What am I missing here?


#2

next time at least try to search before posting


#3

Sorry about that fox.

But after doing that the error changes to a warning:
lib/jimIcon.php:107 imagecreatefromstring(): Data is not in a recognized format


#4

that’s not really related

if exception catch change works its good enough

e: https://git.tt-rss.org/fox/tt-rss/commit/c925f4e3fdad04442c5c2f542883dbd26c81cc23


ParseException thrown in af_redditimgur