In short, set the centrepoint of the scene with
XROrigin3D,
with a childXRCamera3D to enable vision, and two
XRController3D (one for each hand) to add the controllers.
REMEMBER: use XRServer::find_interface. disable vsync at runtime using DisplayServer::window_set_vsync_mode(). And set Viewport::use_xr.