Skip to content

[BUG] Yahoo duplicate row #610

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
okomarov opened this issue Jan 26, 2019 · 3 comments
Closed

[BUG] Yahoo duplicate row #610

okomarov opened this issue Jan 26, 2019 · 3 comments

Comments

@okomarov
Copy link

I am on 0.7.0 pdr:

pdr.get_data_yahoo('AMZN', start='2019-01-25', end='2019-01-26')

where 2019-01-26 is a Saturday.

This outputs:

                   High          Low    Open        Close   Volume    Adj Close
Date
2019-01-25  1683.449951  1662.000000  1670.5  1670.569946  4959679  1670.569946
2019-01-25  1683.479980  1661.609985  1670.5  1670.569946  4945900  1670.569946

It should only return the last row.

@okomarov
Copy link
Author

okomarov commented Jan 26, 2019

This seems to be an issue with yahoo itself.

Querying same day, returns nothing:

screenshot 2019-01-26 at 22 04 51

Querying Friday to Sunday returns the two rows listed above:


screenshot 2019-01-26 at 21 56 31

and params for the GET request are:

  name value 
formatted true
crumb plThqsOcXUi
lang en-US
region US
period1 1548374400
period2 1548547200
interval 1d
events div|split
corsDomain finance.yahoo.com

Workaround

Drop first as df is in ascending order while yahoo web page lists in descending :

df = df[~df.index.duplicated(keep='last')]

@okomarov
Copy link
Author

okomarov commented Jan 30, 2019

Reported bug on Yahoo forums as well.

@bashtage
Copy link
Contributor

Closing as answered.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants