Skip to content

AttributeError: 'list' object has no attribute 'split' #70

@thom4parisot

Description

@thom4parisot

Hello,

I faced the aforementioned error recently with the latest version of readability-lxml. This is part of the stacktrace:

    text = document_fromstring(Document(html,negative_keywords=['related']).summary()).text_content()
  File "build/bdist.linux-x86_64/egg/readability/readability.py", line 98, in __init__
    self.negative_keywords = compile_pattern(negative_keywords)
  File "build/bdist.linux-x86_64/egg/readability/readability.py", line 74, in compile_pattern
    elements = elements.split(',')
AttributeError: 'list' object has no attribute 'split'

And how it is used in our codebase:

from readability.readability import Document

html = ...

text = document_fromstring(Document(html,negative_keywords=['related']).summary()).text_content()

However, this is OK if I pin to the version 0.3.0.6 of the library.

$ python --version
Python 2.7.6

$ pip --version
pip 1.5.4 from /usr/lib/python2.7/dist-packages (python 2.7)

Any idea why such a breakage?

Thanks

cc @olivierthereaux

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions