Stop using floats

nifty@lemmy.world · 1 year ago

Stop using floats

davidgro@lemmy.world · 1 year ago

Serious answer: Posits seem cool, like they do most of what floats do, but better (in a given amount of space). I think supporting them in hardware would be awesome, but of course there’s a chicken and egg problem there with supporting them in programming languages.

Quetzalcutlass@lemmy.world · edit-2 1 year ago

Posits aside, that page had one of the best, clearest explanations of how floating point works that I’ve ever read. The authors of my college textbooks could have learned a thing or two about clarity from this writer.

Kodiack@lemmy.world · edit-2 1 year ago

I had the great honour of seeing John Gustafson give a presentation about unums shortly after he first proposed posits (type III unums). The benefits over floating point arithmetic seemed incredible, and they seemed largely much more simple.

I also got to chat with him about “Gustafson’s Law”, which kinda flips Amdahl’s Law on its head. Parallel computing has long been a bit of an interest for me I was also in my last year of computer science studies then and we were covering similar subjects at the time. I found that timing to be especially amusing.

Buttons@programming.dev · 1 year ago

No real use you say? How would they engineer boats without floats?

WhiskyTangoFoxtrot@lemmy.world · 1 year ago

Just invert a sink.

anton@lemmy.blahaj.zone · 1 year ago

Just build submarines, smh my head.

Blackmist@feddit.uk · 1 year ago

I know this is in jest, but if 0.1+0.2!=0.3 hasn’t caught you out at least once, then you haven’t even done any programming.

labsin@sh.itjust.works · 1 year ago

IMO they should just remove the equality operator on floats.

dylanTheDeveloper@lemmy.world · 1 year ago

what if i add more =

nexussapphire@lemm.ee · 1 year ago

Me making my first calculator in c.

CanadaPlus · 1 year ago

That should really be written as the gamma function, because factorial is only defined for members of Z. /s

wischi@programming.dev · 4 months ago

But that’s not because floats are inaccurate. A very very pedantic compiler wouldn’t even let you write f64 x = 0.1; because 0.1 (and also 0.2 and 0.3) can’t be converted to a float exactly (note that 0.5, 0.25, 0.125, etc. can be stored exactly!)

The moment you write f64 x = 0.1; and expect the computer to store that inside a float you already made a wrong assumption. What the computer actually stores is the float value that is as close as possible to 0.1. But not because floats are inaccurate, but because floats are base 2. Note that floating point types in general don’t have to be base 2 - they can be any base (for example decimal types are base 10) but IEEE754 floats are base 2, because it allows for simpler hardware implementations.

An even more pedantic compiler would only let you write floating point in binary like 10.10110001b and let you do the conversation, because it would make it blatantly obvious that most base 10 decimals can’t even be converted without information loss. So the “inaccuracy” is not(!) because float calculations are inaccurate but because many people wrongly assume that the base 10 literal they wrote can be stored inside a float.

Floats are actually really accurate (ignoring some Intel FPU hardware bugs). I skipped a lot of details which you can find here: https://zeta.one/floats-are-not-inaccurate/

Equipped with that knowledge your calculation 0.1+0.2 != 0.3 can simply be translated into: “The closest float to 0.1” + “The closest float to 0.2” is not equal to “The closest float to 0.3”. Keep in mind that the addition itself is perfectly accurate and without any error/rounding(!) on every EEE754 conforming implementation.

xmunk@sh.itjust.works · 1 year ago

Based and precision pilled.

RustyNova@lemmy.world · 1 year ago

Floats are only great if you deal with numbers that have no needs for precision and accuracy. Want to calculate the F cost of an a* node? Floats are good enough.

But every time I need to get any kind of accuracy, I go straight for actual decimal numbers. Unless you are in extreme scenarios, you can afford the extra 64 to 256 bits in your memory

wischi@programming.dev · 4 months ago

That’s not really true and it depends on what you mean. If your decimal datatype has the same number of bits it’s not more accurate than base 2 floats. This is often hidden because many decimal implementations aren’t 64 bit but 128 bit or more. But what it can do is exactly represent base 10 numbers which is not a requirement for a lot of applications.

You can use floats everywhere where you don’t need numbers to be base 10. With base 2 floats the operations couldn’t be more accurate given the limit of 64 bits. But if you write f64 x = 0.1; and one assumes that the computer somehow stored 0.1 inside x they already made a wrong assumption. 0.1 can’t be converted into a float because it’s a periodic in base 2. A very very pedantic compiler wouldn’t even let you compile that and force you to pick a value that actually can be represented.

Down the rabbit hole: https://zeta.one/floats-are-not-inaccurate/

RustyNova@lemmy.world · 4 months ago

Good and bad use-cases for floats

Floats can be used everywhere where it doesn’t matter that you can’t store a 100% accurate base ten representations. For example positions and speeds in 3D games and animations, “analog” values like temperatures, speed of a vehicle, geo positions with longitude and latitude, a persons weight or heart pressure. In fact if you develop games there is no way around 32 bit floats because GPUs are f32 number crunching beasts. Modern 3D games wouldn’t be possible without all those fast f32 calculations.

You shouldn’t use binary floats if you need or expect accurate base ten calculations (addition, subtraction, multiplication, - note that divisions also introduce errors quickly in decimal types) and for dimensions that have a smallest unit that can’t be broken down, for example like money. If you need to handle money just store the amount of cents as integers and only divide by 100 in your display function.

This is exactly my point. Don’t use floats when you need to get accurate stuff, but use it when you need a “feel” for it

wischi@programming.dev · 4 months ago

Don’t use floats when you need to get accurate stuff

Floats are accurate. Could you name a situation (except money) where you think floats are not accurate enough to handle it?

jabjoe@feddit.uk · 1 year ago

As a programmer who grew up without a FPU (Archimedes/Acorn), I have never liked float. But I thought this war had been lost a long time ago. Floats are everywhere. I’ve not done graphics for a bit, but I never saw a graphics card that took any form of fixed point. All geometry you load in is in floats. The shaders all work in floats.

Briefly ARM MCU work was non-float, but loads of those have float support now.

I mean you can tell good low level programmers because of how they feel about floats. But the battle does seam lost. There is lots of bit of technology that has taken turns I don’t like. Sometimes the market/bazaar has spoken and it’s wrong, but you still have to grudgingly go with it or everything is too difficult.

AnUnusualRelic@lemmy.world · 1 year ago

But if you throw an FPU in water, does it not sink?

It’s all lies.

GroteStreet 🦘@aussie.zone · 1 year ago

all work in floats

We even have float16 / float8 now for low-accuracy hi-throughput work.

frezik@midwest.social · edit-2 1 year ago

Even float4. You get +/- 0, 0.5, 1, 1.5, 2, 3, Inf, and two values for NaN.

Come to think of it, the idea of -NaN tickles me a bit. “It’s not a number, but it’s a negative not a number”.

zaphod@feddit.de · edit-2 1 year ago

I think you got that wrong, you got +Inf, -Inf and two NaNs, but they’re both just NaN. As you wrote signed NaN makes no sense, though technically speaking they still have a sign bit.

frezik@midwest.social · 1 year ago

Right, there’s no -NaN. There are two different values of NaN. Which is why I tried to separate that clause, but maybe it wasn’t clear enough.

gandalf_der_12te@feddit.de · 1 year ago

IMO, floats model real observations.

And since there is no precision in nature, there shouldn’t be precision in floats either.

So their odd behavior is actually entirely justified. This is why I can accept them.

jabjoe@feddit.uk · 1 year ago

I just gave up fighting. There is no system that is going to both fast and infinitely precision.

So long ago I worked in a game middleware company. One of the most common problems was skinning in local space vs global space. We kept having customers try and have global skinning and massive worlds, then upset by geometry distortion when miles away from the origin.

swordsmanluke@programming.dev · 1 year ago

How do y’all solve that, out of curiosity?

I’m a hobbyist game dev and when I was playing with large map generation I ended up breaking the world into a hierarchy of map sections. Tiles in a chunk were locally mapped using floats within comfortable boundaries. But when addressing portions of the map, my global coordinates included the chunk coords as an extra pair.

So an object’s location in the 2D world map might be ((122, 45), (12.522, 66.992)), where the first elements are the map chunk location and the last two are the precise “offset” coordinates within that chunk.

It wasn’t the most elegant to work with, but I was still able to generate an essentially limitless map without floating point errors poking holes in my tiling.

I’ve always been curious how that gets done in real game dev though. if you don’t mind sharing, I’d love to learn!

jabjoe@feddit.uk · 1 year ago

That’s pretty neat. Game streaming isn’t that different. It basically loads the adjacent scene blocks ready for you to wonder in that direction. Some load in LOD (Level Of Detail) versions of the scene blocks so you can see into the distance. The further away, the lower the LOD of course. Also, you shouldn’t really keep the same origin, or you will hit the distort geometry issue. Have the origin as the centre of tha current block.

calcopiritus@lemmy.world · 1 year ago

I’d have to boulder check, but I think old handheld consoles like the Gameboy or the DS use fixed-point.

jabjoe@feddit.uk · 1 year ago

I’m pretty sure they do, but the key word there is “old”.

ZILtoid1991@lemmy.world · 1 year ago

Floats make a lot of math way simpler, especially for audio, but then you run into the occasional NaN error.

jabjoe@feddit.uk · 1 year ago

On the PS3 cell processor vector units, any NaN meant zero. Makes life easier if there is errors in the data.

Ephera@lemmy.ml · 1 year ago

I have been thinking that maybe modern programming languages should move away from supporting IEEE 754 all within one data type.

Like, we’ve figured out that having a null value for everything always is a terrible idea. Instead, we’ve started encoding potential absence into our type system with Option or Result types, which also encourages dealing with such absence at the edges of our program, where it should be done.

Well, NaN is null all over again. Instead, we could make the division operator an associated function which returns a Result<f64> and disallow f64 from ever being NaN.

My main concern is interop with the outside world. So, I guess, there would still need to be a IEEE 754 compliant data type. But we could call it ieee_754_f64 to really get on the nerves of anyone wanting to use it when it’s not strictly necessary.

Well, and my secondary concern, which is that AI models would still want to just calculate with tons of floats, without error-handling at every intermediate step, even if it sometimes means that the end result is a shitty vector of NaNs, that would be supported with that, too.

xmunk@sh.itjust.works · 1 year ago

I agree with moving away from floats but I have a far simpler proposal… just use a struct of two integers - a value and an offset. If you want to make it an IEEE standard where the offset is a four bit signed value and the value is just a 28 or 60 bit regular old integer then sure - but I can count the number of times I used floats on one hand and I can count the number of times I wouldn’t have been better off just using two integers on -0 hands.

Floats specifically solve the issue of how to store a ln absurdly large range of values in an extremely modest amount of space - that’s not a problem we need to generalize a solution for. In most cases having values up to the million magnitude with three decimals of precision is good enough. Generally speaking when you do float arithmetic your numbers will be with an order of magnitude or two… most people aren’t adding the length of the universe in seconds to the width of an atom in meters… and if they are floats don’t work anyways.

I think the concept of having a fractionally defined value with a magnitude offset was just deeply flawed from the get-go - we need some way to deal with decimal values on computers but expressing those values as fractions is needlessly imprecise.

RustyNova@lemmy.world · 1 year ago

While I get your proposal, I’d think this would make dealing with float hell. Do you really want to .unwrap() every time you deal with it? Surely not.

One thing that would be great, is that the / operator could work between Result and f64, as well as between Result and Result. Would be like doing a .map(|left| left / right) operation.

Ephera@lemmy.ml · 1 year ago

Well, not every time. Only if I do a division or get an ieee_754_f64 from the outside world. That doesn’t happen terribly often in the applications I’ve worked on.

And if it does go wrong, I do want it to explode right then and there. Worst case would be, if it writes random NaNs into some database and no one knows where they came from.

As for your suggestion with the slash accepting Results, yeah, that could resolve some pain, but I’ve rarely seen multiple divisions being necessary back-to-back and I don’t want people passing around a Result<f64> in the codebase. Then you can’t see where it went wrong anymore either.
So, personally, I wouldn’t put that division operator into the stdlib, but having it available as a library, if someone needs it, would be cool, yeah.

Lmaydev@programming.dev · 1 year ago

Nan isn’t like null at all. It doesn’t mean there isn’t anything. It means the result of the operation is not a number that can be represented.

The only option is that operations that would result in nan are errors. Which doesn’t seem like a great solution.

Ephera@lemmy.ml · 1 year ago

Well, that is what I meant. That NaN is effectively an error state. It’s only like null in that any float can be in this error state, because you can’t rule out this error state via the type system.

Why do you feel like it’s not a great solution to make NaN an explicit error?

CapeWearingAeroplane@sopuli.xyz · 1 year ago

Theres plenty of cases where I would like to do some large calculation that can potentially give a NaN at many intermediate steps. I prefer to check for the NaN at the end of the calculation, rather than have a bunch of checks in every intermediate step.

How I handle the failed calculation is rarely dependent on which intermediate step gave a NaN.

This feels like people want to take away a tool that makes development in the engineering world a whole lot easier because “null bad”, or because they can’t see the use of multiplying 1e27 with 1e-30.

Ephera@lemmy.ml · 1 year ago

Well, I’m not saying that I want to take tools away. I’m explicitly saying that a ieee_754_f64 type could exist. I just want it to be named annoyingly, so anyone who doesn’t know why they should use it, will avoid it.

If you chain a whole bunch of calculations where you don’t care for NaN, that’s also perfectly unproblematic. I just think, it would be helpful to:

Nudge people towards doing a NaN check after such a chain of calculations, because it can be a real pain, if you don’t do it.
Document in the type system that this check has already taken place. If you know that a float can’t be NaN, then you have guarantees that, for example, addition will never produce a NaN. It allows you to remove some of the defensive checks, you might have felt the need to perform on parameters.

Special cases are allowed to exist and shouldn’t be made noticeably more annoying. I just want it to not be the default, because it’s more dangerous and in the average applications, lots of floats are just passed through, so it would make sense to block NaNs right away.

gandalf_der_12te@feddit.de · 1 year ago

What do you do about a dataset which contains 11999 fine numbers, but one of them is NaN because George called in sick that week? Throw away the whole dataset because it doesn’t fit the data type?

gandalf_der_12te@feddit.de · 1 year ago

idk if you ever had to actually work with floats,

but in statistics, you deal with NaNs all the time. Data is absent from the data set. If it would be an error every time, you wouldn’t get anything done.

Kissaki@programming.dev · 1 year ago

It doesn’t have to “error” if the result case is offered and handled.

Lmaydev@programming.dev · 1 year ago

Float processing is at the hardware level. It needs a way to signal when an unrepresented value would be returned.

Ephera@lemmy.ml · 1 year ago

My thinking is that a call to the safe division method would check after the division, whether the result is a NaN. And if it is, then it returns an Error-value, which you can handle.

Obviously, you could do the same with a NaN by just throwing an if-else after any division statement, but I would like to enforce it in the type system that this check is done.

Lmaydev@programming.dev · edit-2 1 year ago

I feel like that’s adding overhead to every operation to catch the few operations that could result in a nan.

But I guess you could provide alternative safe versions of float operations to account for this. Which may be what you meant thinking about it lol

Ephera@lemmy.ml · 1 year ago

I would want the safe version to be the default, but yeah, both should exist. 🙃

kekwa@lemmy.world · 1 year ago

Float is bloat!

Magnetar@feddit.de · 1 year ago

Call me when you found a way to encode transcendental numbers.

YTG123@feddit.ch · edit-2 1 year ago

Perhaps you can encode them as computation (i.e. a function of arbitrary precision)

Magnetar@feddit.de · 1 year ago

Hard to do as those functions are often limits and need infinite function applications. I’m telling you, math.PI is a finite lie!

smeg@feddit.uk · 1 year ago

Do we even have a good way of encoding them in real life without computers?

fossphi@lemm.ee · 1 year ago

Just think about them real hard

Magnetar@feddit.de · 1 year ago

wischi@programming.dev · 4 months ago

Sure, just asign them a random Greek letter and call it a day 🤣

smeg@feddit.uk · 4 months ago

Doesn’t even need to be Greek!

Knock_Knock_Lemmy_In@lemmy.world · 1 year ago

Here you go

ⲡ

Chadus_Maximus@lemm.ee · edit-2 1 year ago

May I propose a dedicated circuit (analog because you can only ever approximate their value) that stores and returns transcendental/irrational numbers exclusively? We can just assume they’re going to be whatever value we need whenever we need them.

frezik@midwest.social · 1 year ago

Wouldn’t noise in the circuit mean it’d only be reliable to certain level of precision, anyway?

Chadus_Maximus@lemm.ee · edit-2 1 year ago

I mean, every irrational number used in computation is reliable to a certain level of precision. Just because the current (heh) methods aren’t precise enough doesn’t mean they’ll never be.

anton@lemmy.blahaj.zone · 1 year ago

You can always increase the precision of a computation, analog signals are limited by quantum physics.

Psythik@lemmy.world · 1 year ago

While we’re at it, what the hell is -0 and how does it differ from 0?

Reddfugee42@lemmy.world · 1 year ago

It’s the negative version

ShepherdPie@midwest.social · 1 year ago

So it’s just like 0 but with an evil goatee?

Knock_Knock_Lemmy_In@lemmy.world · 1 year ago

Look at the graph of y=tan(x)+ⲡ/2

-0 and +0 are completely different.

computerscientistI@lemm.ee · 1 year ago

For integers it really doesn’t exist. An algorithm for multiplying an integer with -1 is: Invert all bits and add 1 to the right-most bit. You can do that for 0 of course, it won’t hurt.

dejected_warp_core@lemmy.world · 1 year ago

There are probably a lot of scientific applications (e.g. statistics, audio, 3D graphics) where exponential notation is the norm and there’s an understanding about precision and significant digits/bits. It’s a space where fixed-point would absolutely destroy performance, because you’d need as many bits as required to store your largest terms. Yes, NaN and negative zero are utter disasters in the corners of the IEEE spec, but so is trying to do math with 256bit integers.

For a practical explanation about how stark a difference this is, the PlayStation (one) uses an integer z-buffer (“fixed point”). This is responsible for the vertex popping/warping that the platform is known for. Floating-point z-buffers became the norm almost immediately after the console’s launch, and we’ve used them ever since.

CrayonRosary@lemmy.world · 1 year ago

While it’s true the PS1 couldn’t do floating point math, it did NOT have a z-buffer at all.

https://www.ncesc.com/gaming-faq/does-ps1-have-z-buffer/

anton@lemmy.blahaj.zone · 1 year ago

What’s the problem with -0?
It conceptually makes sense for to negativ values to close to 0 to be represented as -0.
In practice I have never seen a problem with -0.

On NaN: While its use cases can nowadays be replaced with language constructs like result types, it was created before exceptions or sum types. The way it propagates kind of mirrors Haskells monadic Maybe.
We should be demanding more and better wrapper types from our language/standard library designers.

33550336@lemmy.world · 1 year ago

From time to time I see this pattern in memes, but what is the original meme / situation?

Sadbutdru@sopuli.xyz · 1 year ago

It’s my favourite format. I think the original was ‘stop doing math’

33550336@lemmy.world · 1 year ago

Thank you 😁

gandalf_der_12te@feddit.de · edit-2 1 year ago

math are numbers and therefore non-physical, and therefore esoterical, so stop giving it credit.

/s

Eyck_of_denesle@lemmy.zip · 1 year ago

Out of topic but how does one get a profile pic on lemmy? Also love you ken.

gandalf_der_12te@feddit.de · 1 year ago

you can configure it in the web interface. just go to your profile

33550336@lemmy.world · 1 year ago

Thank you!

Go to “Settings” (cog wheel) and then “Avatar”:

dejected_warp_core@lemmy.world · 1 year ago

deleted by creator

ripcord@lemmy.world · 1 year ago

That doesn’t really answer the question, which is about the origins of the meme templete

dejected_warp_core@lemmy.world · 1 year ago

Yikes. placed this in the wrong spot. Thank you.

LinearArray@programming.dev · 1 year ago

Precision piled.

rimjob_rainer@discuss.tchncs.de · 1 year ago

The meme is right for once

qevlarr@lemmy.world · edit-2 1 year ago

I’m like, it’s that code on the right what I think it is? And it is! I’m so happy now

https://en.wikipedia.org/wiki/Fast_inverse_square_root