Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ENH: Restore support for reading Stata 104 format dta files, and add support for 103 #58555

Open
wants to merge 2 commits into
base: main
Choose a base branch
from

Conversation

cmjcharlton
Copy link
Contributor

I wasn't sure whether to label this as a bug fix or enhancement as support has existed in the Pandas previously, but it last worked a long time ago.

I have included 103 support in this too as for the purposes of reading the format is identical (103 does not support data of byte type).

@jbrockmendel
Copy link
Member

cc @bashtage

@mroeschke mroeschke added the IO Stata read_stata, to_stata label May 4, 2024
@bashtage
Copy link
Contributor

bashtage commented May 8, 2024

Seems pretty simple. Are the dta files produced by Stata or something else?

@cmjcharlton
Copy link
Contributor Author

The new test files are produced with a program that I wrote based on the published specifications, but I test them with Stata (and occasionally a hex editor to be sure). I have also tested with a variety of historic files available from the Stata Technical Bulletin and the Stata Journal.

@mroeschke mroeschke requested a review from bashtage May 8, 2024 23:00
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
IO Stata read_stata, to_stata
Projects
None yet
Development

Successfully merging this pull request may close these issues.

BUG: Unexpected behaviour when attempting to read Stata 104 format dta files
4 participants