All of these issues have been thoughtfully described, with the pros and cons inherent in the process, here:
https://www.fastrawviewer.com/blog/in-camera-histogram-doesn%27t-represent-exposurehttps://www.rawdigger.com/howtouse/exposure-for-raw-or-for-jpegshttps://www.rawdigger.com/howtouse/beware-histogramhttps://www.rawdigger.com/howtouse/lightmeter-calibrationPutting aside for a moment the idea that the histogram is influenced by the JPEG produced by the user's cameras settings, there are some assumptions that one is making when attempting to establish the camera's histogram as a useful tool for evaluating raw exposure - the most important being the white balance itself, as this will dramatically change the per-channel "exposure" (brightness) indicated by the histogram relative to the actual exposure of the sensor. In daylight, the green channel is a good proxy for evaluating sensor saturation as red and blue will channels will receive less exposure than green in the raw data. But consider non-daylit scenes - the prevailing lighting will affect the usefulness of the histogram as well and may completely skew the histogram away from the underlying raw exposure when white balanced, either toward green or toward the "correct" white balance. Finally, because we want to expose, more or less, the sensor for highlight detail, we need to be careful with respect to the actual color of the critical highlights (either the inherent color of the highlight surface or the lighting falling on and reflecting off of the surface) when evaluating the per-channel clipping.
Unless one understands how the JPEG histogram relates to the raw exposure, simply setting a green WB (akin to UniWB) will only get you so far - other in-camera settings will affect where the tones are placed on the histogram such that, even if one fixes a constant exposure (aperture and SS), changes to the in-camera settings that affect the JPEG (color profiles, DR settings, etc.) will change the JPEG histogram even when sensor exposure remains constant.
I have found that I can get optimal exposure 99% of the time by metering the critical highlights (and it works better if these are close to neutral, clouds being a good example) with a spot meter and adding approximately 3 to 3.3 EV to the metered reading (usually be changing SS). On my Sekonic meter I can enter this offset as a filter compensation value and, at the press of a button, get the compensation in SS that I need to expose for the metered highlights.
If you really want to push things to the edge, you can test your various raw converters and see how each responds to clipping in one or more channels (raw saturation) and evaluate how much extra compensation you can get away with if the highlights are neutral and the raw converter can RECONSTRUCT the partially clipped highlights from the intact raw channel data. In some circumstances I can dial in over 4EV on my Fujifilm X-H1 with CaptureOne and reconstruct neutral highlights, for example. I do not advise setting exposure based on your raw converter though, but it is good to know what result a combination of camera exposure plus software will give you for the times when you really need to turn exposure up to 11.
All of this is predicated upon studying the content of the raw data using Raw Digger or a similar tool.
Kirk