This is something you do as part of the post-processing. You receive landmarks in normalized coordinates and you need to compute them in the original image resolution.
Here is an example of how it's done:
https://github.com/axinc-ai/ailia-models/blob/master/pose_estimation/blazepose/blazepose.py#L212