REF: Fuse all the types #23022

jbrockmendel · 2018-10-06T23:52:18Z

Everything is passing locally, just want to run this through the CI for good measure before continuing down this path.

pep8speaks · 2018-10-06T23:52:21Z

Hello @jbrockmendel! Thanks for submitting the PR.

There are no PEP8 issues in the file pandas/core/internals/blocks.py !

jbrockmendel · 2018-10-06T23:53:33Z

pandas/core/internals/blocks.py

@@ -1153,7 +1153,7 @@ def check_int_bool(self, inplace):
                                               inplace=inplace, limit=limit,
                                               fill_value=fill_value,
                                               coerce=coerce,
-                                               downcast=downcast, mgr=mgr)


Edits here are unrelated, should be removed from this PR.

jreback · 2018-10-06T23:55:04Z

pandas/_libs/groupby_helper.pxi.in

        int64_t lab

    N, K = (<object> values).shape
    accum = np.empty_like(values)
-    accum.fill({{inf_val}})
+    if groupby_t is int64_t:


can u make this more generic ? she what if we expand this to other int types?

Presumably. The MO with these PRs is to keep the logic unchanged.

I think there is also a cost in compile-time.

this is a small cost of compile time (actually maybe nothing as cython is pretty smart). but i suppose can handle later.

Fair enough. Easy to implement if/when its actually needed.

see some comments above

codecov · 2018-10-07T00:30:00Z

Codecov Report

Merging #23022 into master will not change coverage.
The diff coverage is n/a.

@@           Coverage Diff           @@
##           master   #23022   +/-   ##
=======================================
  Coverage   92.19%   92.19%           
=======================================
  Files         169      169           
  Lines       50959    50959           
=======================================
  Hits        46980    46980           
  Misses       3979     3979

Flag	Coverage Δ
#multiple	`90.61% <ø> (ø)`	⬆️
#single	`42.29% <ø> (ø)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 913f71f...400e708. Read the comment docs.

jbrockmendel · 2018-10-07T16:31:49Z

Just pushed, moving some more functions over. I think I should stop here for this PR before the diff gets out of hand.

Made a bug report to cython about a mysterious failure.

jbrockmendel · 2018-10-14T15:33:55Z

Thoughts here?

jreback · 2018-10-14T17:05:32Z

pandas/_libs/algos_common_helper.pxi.in

-                                 ndarray[int64_t] indexer, Py_ssize_t loc,
-                                 ndarray[{{dest_type2}}] out):
+def put2d_{{name}}_{{dest_type[:-2]}}(ndarray[{{c_type}}, ndim=2, cast=True] values,
+                                      ndarray[int64_t] indexer, Py_ssize_t loc,


this is a it obtuse can u make it more explicit (the slice)

jreback · 2018-10-14T17:07:26Z

pandas/_libs/groupby_helper.pxi.in

@@ -638,7 +623,12 @@ def group_max_{{name}}(ndarray[{{dest_type2}}, ndim=2] out,
    nobs = np.zeros_like(out)

    maxx = np.empty_like(out)
-    maxx.fill(-{{inf_val}})
+    if groupby_t is int64_t:


can u comment / add a Todo here

jreback · 2018-10-14T17:09:01Z

pandas/_libs/groupby_helper.pxi.in

+                            maxx[lab, j] = val
+                else:
+                    if val == val and val != nan_val:
+                        nobs[lab, j] += 1


we should have a function that does the null comparisons though with the template code it is slightly
tricky, maybe have a suite of isna_int, isna_float functions

we should have a function that does the null comparisons though with the template code

I've been thinking about something similar, will give it a go in the next pass.

jreback · 2018-10-14T17:09:35Z

pandas/_libs/groupby_helper.pxi.in

                else:
                    out[i, j] = maxx[i, j]


+group_max_float64 = group_max["float64_t"]
+group_max_float32 = group_max["float32_t"]
+group_max_int64 = group_max["int64_t"]


we DO need to expand these to all int/unit dtypes
FYI

jreback · 2018-10-14T17:09:53Z

pandas/_libs/groupby_helper.pxi.in

        int64_t lab

    N, K = (<object> values).shape
    accum = np.empty_like(values)
-    accum.fill({{inf_val}})
+    if groupby_t is int64_t:


see some comments above

jreback · 2018-10-14T17:11:37Z

pandas/_libs/sparse_op_helper.pxi.in

-                'mod': '__mod_{2}({0}, {1})',
-                'truediv': '__truediv_{2}({0}, {1})',
-                'floordiv': '__floordiv_{2}({0}, {1})',
+                'div': '__div({0}, {1})',


these names are odd

maybe just call them div and so on

jbrockmendel · 2018-10-14T19:53:41Z

@datapythonista I don’t understand what is causing the travis failure. Do you recognize it?

datapythonista · 2018-10-14T20:05:52Z

I'mc checking from my cell phone which makes it tricky to check, but if I see it correctly, this is the error: pandas/_libs/algos_rank_helper.pxi.in:177:80: E501 line too long (80 > 79 characters).

jreback

small question, other lgtm.

pandas/_libs/algos_rank_helper.pxi.in

pandas/_libs/groupby_helper.pxi.in

jreback · 2018-10-17T12:36:22Z

thanks!

jbrockmendel added 5 commits October 6, 2018 08:20

use fused types for some sparse functions

49f06ed

use fused types in groupby_helper

d24ec56

Use fused types for more of groupby_helper

54520e2

fuse more

1c79958

remove unnecessary arg

e600277

jbrockmendel commented Oct 6, 2018

View reviewed changes

jreback reviewed Oct 6, 2018

View reviewed changes

jbrockmendel added 2 commits October 6, 2018 19:14

cleanup and fuse

89997ee

revert non-central changes

b13317b

gfyoung added Refactor Internal refactoring of code Internals Related to non-user accessible pandas implementation labels Oct 7, 2018

fuse more things

db9d796

jbrockmendel changed the title ~~WIP: Fuse all the types~~ REF: Fuse all the types Oct 10, 2018

jreback requested changes Oct 14, 2018

View reviewed changes

jbrockmendel added 4 commits October 14, 2018 11:03

nicer names

a69438b

requested comments/cleanups

cdcde6c

Merge branch 'master' of https://github.com/pandas-dev/pandas into temp5

1e98add

Dummy commit to force CI

adbc67c

jbrockmendel closed this Oct 14, 2018

jbrockmendel reopened this Oct 14, 2018

wrap long line

dc76269

jbrockmendel mentioned this pull request Oct 15, 2018

REF: use fused types for join_helper #23171

Merged

jbrockmendel added 2 commits October 15, 2018 15:44

Merge branch 'master' of https://github.com/pandas-dev/pandas into temp5

9105795

Merge branch 'master' of https://github.com/pandas-dev/pandas into temp5

400e708

jreback approved these changes Oct 17, 2018

View reviewed changes

pandas/_libs/algos_rank_helper.pxi.in Show resolved Hide resolved

pandas/_libs/groupby_helper.pxi.in Show resolved Hide resolved

jreback added this to the 0.24.0 milestone Oct 17, 2018

jreback merged commit e8d29e7 into pandas-dev:master Oct 17, 2018

jbrockmendel deleted the temp5 branch October 17, 2018 16:00

tm9k1 pushed a commit to tm9k1/pandas that referenced this pull request Nov 19, 2018

REF: Fuse all the types (pandas-dev#23022)

fb01a69

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

REF: Fuse all the types #23022

REF: Fuse all the types #23022

jbrockmendel commented Oct 6, 2018

pep8speaks commented Oct 6, 2018

jbrockmendel Oct 6, 2018

jreback Oct 6, 2018

jbrockmendel Oct 7, 2018

jreback Oct 10, 2018

jbrockmendel Oct 10, 2018

jreback Oct 14, 2018

codecov bot commented Oct 7, 2018 •

edited

Loading

jbrockmendel commented Oct 7, 2018

jbrockmendel commented Oct 14, 2018

jreback Oct 14, 2018

jreback Oct 14, 2018

jreback Oct 14, 2018

jbrockmendel Oct 14, 2018

jreback Oct 17, 2018

jreback Oct 14, 2018

jreback Oct 14, 2018

jreback Oct 14, 2018

jbrockmendel commented Oct 14, 2018

datapythonista commented Oct 14, 2018

jreback left a comment

jreback commented Oct 17, 2018

REF: Fuse all the types #23022

REF: Fuse all the types #23022

Conversation

jbrockmendel commented Oct 6, 2018

pep8speaks commented Oct 6, 2018

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

codecov bot commented Oct 7, 2018 • edited Loading

Codecov Report

jbrockmendel commented Oct 7, 2018

jbrockmendel commented Oct 14, 2018

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jbrockmendel commented Oct 14, 2018

datapythonista commented Oct 14, 2018

jreback left a comment

Choose a reason for hiding this comment

jreback commented Oct 17, 2018

codecov bot commented Oct 7, 2018 •

edited

Loading