CLN: to_datetime internals #21702

mroeschke · 2018-07-02T00:39:06Z

tests passed
passes git diff upstream/master -u -- "*.py" | flake8 --diff

The internals of to_datetime is getting a bit unwieldy, so split out the _convert_listlike logic and origin shifting logic to _convert_listlike_datetime and adjust_to_origin methods respectively outside of to_datetime. The logic was not changed.

pep8speaks · 2018-07-02T00:39:07Z

Hello @mroeschke! Thanks for updating the PR.

In the file pandas/core/tools/datetimes.py, following are the PEP8 issues :

Line 243:17: E722 do not use bare except'
Line 315:9: E722 do not use bare except'

Comment last updated on July 03, 2018 at 05:24 Hours UTC

codecov · 2018-07-02T01:19:30Z

Codecov Report

Merging #21702 into master will increase coverage by <.01%.
The diff coverage is 88.88%.

@@            Coverage Diff             @@
##           master   #21702      +/-   ##
==========================================
+ Coverage    91.9%   91.91%   +<.01%     
==========================================
  Files         158      158              
  Lines       49690    49695       +5     
==========================================
+ Hits        45670    45675       +5     
  Misses       4020     4020

Flag	Coverage Δ
#multiple	`90.28% <88.88%> (ø)`	⬆️
#single	`41.95% <51.28%> (ø)`	⬆️

Impacted Files	Coverage Δ
pandas/core/tools/datetimes.py	`85.22% <88.88%> (+0.23%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 7cd2679...e9320f1. Read the comment docs.

jreback · 2018-07-02T10:39:58Z

lgtm. can you do a perf check to make sure nothing changed (as the caching logic was lightly touched here)

jorisvandenbossche · 2018-07-02T13:45:32Z

pandas/core/tools/datetimes.py

@@ -38,7 +39,7 @@ def _guess_datetime_format_for_array(arr, **kwargs):
        return _guess_datetime_format(arr[non_nan_elements[0]], **kwargs)


-def _maybe_cache(arg, format, cache, tz, convert_listlike):


The tz here was not used?

tz was passed into the convert_listlike function further down, but now I am embedding it into convert_listlike with functools.partial in to_datetime

jbrockmendel · 2018-07-02T18:42:41Z

pandas/core/tools/datetimes.py

+            raise e
+
+
+def _adjust_to_origin(arg, origin, unit):


+1 for separating this out

jbrockmendel · 2018-07-02T18:44:28Z

pandas/core/tools/datetimes.py

+        passed unit from to_datetime, must be 'D'
+    Returns
+    -------
+    ndarray of adjusted dates


Is it necessarily an ndarray? Couldn't it be a Timestamp?

need newline before Returns

Good catch, yeah this can be a scalar value. Will fix that tonight.

…etime

mroeschke · 2018-07-03T05:31:38Z

My asv setup is still a little broken, but here's a benchmark showing no performance hit (with cache)

In [1]: from pandas import *

In [3]: N = 100

In [4]: dup_string_with_tz = ['2000-02-11 15:00:00-0800'] * N

# this branch
In [5]: %timeit to_datetime(dup_string_with_tz, cache=True)
1.31 ms ± 44.5 µs per loop (mean ± std. dev. of 7 runs, 1000 loops each)

In [6]: %timeit to_datetime(dup_string_with_tz, cache=False)
2.3 ms ± 41.2 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)

# Master
In [4]: %timeit to_datetime(dup_string_with_tz, cache=True)
1.29 ms ± 13.3 µs per loop (mean ± std. dev. of 7 runs, 1000 loops each)

In [5]: %timeit to_datetime(dup_string_with_tz, cache=False)
2.31 ms ± 68.9 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)

jreback · 2018-07-03T14:23:49Z

thanks @mroeschke happily take refactorings to clean up things / make more readable

mroeschke added 3 commits July 1, 2018 00:03

CLN: to_datetime internals

fcad4d0

More cleanup

b2b1104

fix yearfirst typo

3e0c36b

mroeschke added the Clean label Jul 2, 2018

jreback added this to the 0.24.0 milestone Jul 2, 2018

jorisvandenbossche reviewed Jul 2, 2018

View reviewed changes

jbrockmendel reviewed Jul 2, 2018

View reviewed changes

pandas/core/tools/datetimes.py

raise e

def _adjust_to_origin(arg, origin, unit):

Copy link

Member

jbrockmendel Jul 2, 2018

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

+1 for separating this out

jbrockmendel reviewed Jul 2, 2018

View reviewed changes

mroeschke added 2 commits July 2, 2018 19:43

Merge remote-tracking branch 'upstream/master' into reorganize_to_dat…

7f3e18d

…etime

adjust spacing

e9320f1

jorisvandenbossche approved these changes Jul 3, 2018

View reviewed changes

jreback merged commit 1de57da into pandas-dev:master Jul 3, 2018

mroeschke deleted the reorganize_to_datetime branch July 3, 2018 15:13

Sup3rGeo pushed a commit to Sup3rGeo/pandas that referenced this pull request Oct 1, 2018

CLN: to_datetime internals (pandas-dev#21702)

a5fa6bd

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CLN: to_datetime internals #21702

CLN: to_datetime internals #21702

mroeschke commented Jul 2, 2018

pep8speaks commented Jul 2, 2018 •

edited

Loading

codecov bot commented Jul 2, 2018 •

edited

Loading

jreback commented Jul 2, 2018

jorisvandenbossche Jul 2, 2018

mroeschke Jul 2, 2018

jbrockmendel Jul 2, 2018

jbrockmendel Jul 2, 2018

mroeschke Jul 2, 2018

mroeschke commented Jul 3, 2018

jreback commented Jul 3, 2018

		@@ -38,7 +39,7 @@ def _guess_datetime_format_for_array(arr, **kwargs):
		return _guess_datetime_format(arr[non_nan_elements[0]], **kwargs)


		def _maybe_cache(arg, format, cache, tz, convert_listlike):

CLN: to_datetime internals #21702

CLN: to_datetime internals #21702

Conversation

mroeschke commented Jul 2, 2018

pep8speaks commented Jul 2, 2018 • edited Loading

Comment last updated on July 03, 2018 at 05:24 Hours UTC

codecov bot commented Jul 2, 2018 • edited Loading

Codecov Report

jreback commented Jul 2, 2018

jorisvandenbossche Jul 2, 2018

Choose a reason for hiding this comment

mroeschke Jul 2, 2018

Choose a reason for hiding this comment

jbrockmendel Jul 2, 2018

Choose a reason for hiding this comment

jbrockmendel Jul 2, 2018

Choose a reason for hiding this comment

mroeschke Jul 2, 2018

Choose a reason for hiding this comment

mroeschke commented Jul 3, 2018

jreback commented Jul 3, 2018

pep8speaks commented Jul 2, 2018 •

edited

Loading

codecov bot commented Jul 2, 2018 •

edited

Loading