It is suggested to interpolate the missing data first and try normal linear regression lm to see what the result is. Then consider using randomforest as much as possible, and it is also recommended to use the ranger function of the ranger package, which is fast and efficient, and the difference is obvious when the data is large.
Random forest uses the method of replacing features when ranking the importance of features.