BUG: read_html has unexpected behavior parsing th & td with colspan attribute. #56591
Closed
3 tasks done
Labels
Bug
Closing Candidate
May be closeable, needs more eyeballs
IO HTML
read_html, to_html, Styler.apply, Styler.applymap
Needs Triage
Issue that has not been reviewed by a pandas team member
Pandas version checks
I have checked that this issue has not already been reported.
I have confirmed this bug exists on the latest version of pandas.
I have confirmed this bug exists on the main branch of pandas.
Reproducible Example
Issue Description
Here is the output:
Expected Behavior
Note:
In the real scenario I get the duplicated header names as 'Unnamed: 1,2,3'
.Example:
Installed Versions
The text was updated successfully, but these errors were encountered: