[Fix] Ufuncs: Support float32 scalars for Add, Multply, and Divide #291

HannanNaeem · 2024-09-10T01:43:59Z

Our current implementation does not support float32 scalars for add, multiply, and divide. This PR introduces a simple fix in the logic to allow computation with float32 scalars.

Currently Pykokkos always casts scalars as pk.double which is equivalent to float64 in numpy terms. We also then enforce that both operands, for the aforementioned ufuncs, be the same type .e.g. either float32 or float64. This creates a problem when passing a float32 value as scalar with a float32 view. The scalar is casted as float64 and the type assertion fails by our own doing.

To fix this:

Scalar takes the type Float32 or float64 based on the view it is passed with (so they remain the same)
Tweak the float impls to use modulus indexing to support scalars

Quality tweak:

Updated error messages to be verbose about what types mismatched

kennykos · 2026-01-25T00:36:18Z

pykokkos/lib/ufuncs.py

 @pk.workunit
 def add_impl_1d_float(tid: int, viewA: pk.View1D[pk.float], viewB: pk.View1D[pk.float], out: pk.View1D[pk.float]):
-    out[tid] = viewA[tid] + viewB[tid]
+    out[tid] = viewA[tid] + viewB[tid % viewB.extent(0)]


Integer modulus is quite costly performance wise, is there a reason this is necessary here?

Its been a while, but I am able to recall that this has to do with supporting operations with scalars. I think we were, at the time, using this pattern to support operations with scalars (by putting them in a single-value view) by not needing a whole other workunit for them.

There are obviously questions around what happens if the sizes are not the same AND the viewB is not a singleton... I am assuming at the time we were OK with this behavior.

IvanGrigorik · 2026-01-25T01:30:11Z

pykokkos/lib/ufuncs.py

+    if not isinstance(viewA, pk.ViewType) and viewA.dtype.__name__ not in [
+        "float32",
+        "float64",
+    ]:


I hate the fact that each time something is related to existing data types, we need to do it this way.
We should make a type mapping (from PyKokkos to Kokkos and vice versa) and use this mapper each time whenever we are dealing with data types.
The string comparison is not good at all, but for this PR, this is alright.

pykokkos/lib/ufuncs.py

IvanGrigorik · 2026-01-25T01:36:36Z

Overall LG!

IvanGrigorik · 2026-01-25T01:38:14Z

@kennykos are you ok with modulus, or do you want me to check out other options?

gliga · 2026-01-25T18:28:23Z

We can always merge as is and then improve later if needed, but I will let @kennykos make a final call on his comment.

kennykos · 2026-01-25T18:41:27Z

We can always merge as is and then improve later if needed, but I will let @kennykos make a final call on his comment.

Yes, I am fine merging this now, but avoiding integer % and / on device is something we should keep in mind for the future.

gliga · 2026-01-25T18:47:08Z

We can always merge as is and then improve later if needed, but I will let @kennykos make a final call on his comment.

Yes, I am fine merging this now, but avoiding integer % and / on device is something we should keep in mind for the future.

You wanted to see more of shifts? Wouldn't we do this in the translation anyway and not in the python code?

kennykos · 2026-01-25T18:54:23Z

We can always merge as is and then improve later if needed, but I will let @kennykos make a final call on his comment.

Yes, I am fine merging this now, but avoiding integer % and / on device is something we should keep in mind for the future.

You wanted to see more of shifts? Wouldn't we do this in the translation anyway and not in the python code?

I'm a little confused, are you suggesting that we check if a%b can be replaced with a-b in translation, and is so change the workunit code?

gliga · 2026-01-25T18:58:56Z

We can always merge as is and then improve later if needed, but I will let @kennykos make a final call on his comment.

Yes, I am fine merging this now, but avoiding integer % and / on device is something we should keep in mind for the future.

You wanted to see more of shifts? Wouldn't we do this in the translation anyway and not in the python code?

I'm a little confused, are you suggesting that we check if a%b can be replaced with a-b in translation, and is so change the workunit code?

Yes, I am suggesting that optimizations should be done in translation and not count on a user to optimize for a specific platform.

kennykos · 2026-01-25T19:06:19Z

We can always merge as is and then improve later if needed, but I will let @kennykos make a final call on his comment.

Yes, I am fine merging this now, but avoiding integer % and / on device is something we should keep in mind for the future.

You wanted to see more of shifts? Wouldn't we do this in the translation anyway and not in the python code?

I'm a little confused, are you suggesting that we check if a%b can be replaced with a-b in translation, and is so change the workunit code?

Yes, I am suggesting that optimizations should be done in translation and not count on a user to optimize for a specific platform.

Ah, that makes perfect sense, sounds good.

HannanNaeem added 2 commits September 4, 2024 19:58

ufuncs fix: Add scalar add not working with float 32 views

afce4de

Add, Multiply, Divide now support float32 scalars

43148f4

IvanGrigorik self-requested a review December 19, 2025 15:44

kennykos reviewed Jan 25, 2026

View reviewed changes

IvanGrigorik added 3 commits January 24, 2026 18:18

Merge branch 'main' into touch_up_add

f6b6d34

formatting

06e1647

fix typo

d673579

IvanGrigorik approved these changes Jan 25, 2026

View reviewed changes

IvanGrigorik merged commit b4ee95c into kokkos:main Jan 26, 2026
22 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Fix] Ufuncs: Support float32 scalars for Add, Multply, and Divide #291

[Fix] Ufuncs: Support float32 scalars for Add, Multply, and Divide #291

Uh oh!

HannanNaeem commented Sep 10, 2024

Uh oh!

kennykos Jan 25, 2026

Uh oh!

HannanNaeem Jan 25, 2026

Uh oh!

IvanGrigorik Jan 25, 2026

Uh oh!

Uh oh!

IvanGrigorik commented Jan 25, 2026

Uh oh!

IvanGrigorik commented Jan 25, 2026

Uh oh!

gliga commented Jan 25, 2026

Uh oh!

kennykos commented Jan 25, 2026

Uh oh!

gliga commented Jan 25, 2026

Uh oh!

kennykos commented Jan 25, 2026

Uh oh!

gliga commented Jan 25, 2026

Uh oh!

kennykos commented Jan 25, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[Fix] Ufuncs: Support float32 scalars for Add, Multply, and Divide #291

[Fix] Ufuncs: Support float32 scalars for Add, Multply, and Divide #291

Uh oh!

Conversation

HannanNaeem commented Sep 10, 2024

Uh oh!

kennykos Jan 25, 2026

Choose a reason for hiding this comment

Uh oh!

HannanNaeem Jan 25, 2026

Choose a reason for hiding this comment

Uh oh!

IvanGrigorik Jan 25, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

IvanGrigorik commented Jan 25, 2026

Uh oh!

IvanGrigorik commented Jan 25, 2026

Uh oh!

gliga commented Jan 25, 2026

Uh oh!

kennykos commented Jan 25, 2026

Uh oh!

gliga commented Jan 25, 2026

Uh oh!

kennykos commented Jan 25, 2026

Uh oh!

gliga commented Jan 25, 2026

Uh oh!

kennykos commented Jan 25, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants