Skip to content

paragraph.text is missing text #1123

Open
@maya-burnard

Description

@maya-burnard

There's a certain type of comment (not to be confused with the comments stored in comments.xml) where old text is deleted and new text is inserted.
screenshot

Extracting this text via

file=docx.Document(filepath)
print(file.paragraphs[0].text)

produces the following output

The civil war literature has substantial gaps. wo primary theoriescivil wars occur when citizens 1) become sufficiently motivated and 2) have the opportunity to rebel.  explain

In this circumstance the correct output should be

The civil war literature has substantial gaps. Two primary theories argue civil wars occur when citizens 1) become sufficiently motivated to commit violence and 2) have the opportunity to rebel. Neither theory explains

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions