I’ll try.
As you can see, the sensor in the camera would need to move more to compensate my movement of the handheld lens. Usually we have a firm grip at the camera and the other hand carries the weight of a long lens. Switch off any vibration compensation (OS or IBIS or both) and you’ll see how unsteady the image in the finder gets.
You can imagine a balance. The OS element in the lens is close to the crossing point of the light beams. This little lens element only needs a fraction of the sensor distance to compensate the movement. It’s bit less precise than the sensor movement, so a little hint of movement is visible when using only OS. But if the camera is able to calculate sensor and OS movements, speed of OS and precision of IBIS come together and can compensate large vibrations still precisely and fast.
And I admit, I’m not familiar enough with optical vocabulary in English. But I’m sure, if you google “difference of OS and IBIS” you get more and better explanations.