Hello,
I faced the aforementioned error recently with the latest version of readability-lxml. This is part of the stacktrace:
text = document_fromstring(Document(html,negative_keywords=['related']).summary()).text_content()
File "build/bdist.linux-x86_64/egg/readability/readability.py", line 98, in __init__
self.negative_keywords = compile_pattern(negative_keywords)
File "build/bdist.linux-x86_64/egg/readability/readability.py", line 74, in compile_pattern
elements = elements.split(',')
AttributeError: 'list' object has no attribute 'split'
And how it is used in our codebase:
from readability.readability import Document
html = ...
text = document_fromstring(Document(html,negative_keywords=['related']).summary()).text_content()
However, this is OK if I pin to the version 0.3.0.6 of the library.
$ python --version
Python 2.7.6
$ pip --version
pip 1.5.4 from /usr/lib/python2.7/dist-packages (python 2.7)
Any idea why such a breakage?
Thanks
cc @olivierthereaux
Hello,
I faced the aforementioned error recently with the latest version of readability-lxml. This is part of the stacktrace:
And how it is used in our codebase:
However, this is OK if I pin to the version
0.3.0.6of the library.Any idea why such a breakage?
Thanks
cc @olivierthereaux