Pixels imply sampling at some spatial frequency. Anything above half that frequency will be "reflected" around the sampling frequency and become part of the signal.
Once it's part of the signal... it's part of the signal. There is no way to tell whether it was put there by the thing being photographed, or via aliasing, so unless the digital processing has god's phone number, it can't possibly fix it.
Maybe it's easier to understand in audio, where we think more naturally in frequency terms. Suppose you are using an AD converter sampling at 48kHz, without an AA filter. In that case, a tone arriving at 25kHz will be encoded in exactly the same way as a tone at 23kHz. So there is nothing to be done... In practice, you either hope there is no >24kHz sound present, or you apply an analogue AA filter to remove it.
Of course what happened in digital audio was that AD technology improved, and it was possible to sample at (for eg) 192kHz and you only needed to worry about sounds above 170kHz: those between 96kHz and 170kHz would be encoded between 96kHz and 22kHz, and could be filtered in the digital domain. The important difference is that ears only work up to 16-20kHz... whereas in spatial sampling, you can enlarge the image change the spatial frequency. The audio equivalent of pixel peeping would be to slow the play-back speed, but so far as I'm aware no one has been nuts enough to propose this...
So... yes you can make use of the higher res of a 50Mpx sensor to do digital AA, provided you're happy to get a de-aliased version of a lower-resolution. Ie you could remove the Moiré by down-sizing your file to be equivalent to that from a 12Mpx sensor... but no one wants to do that.
Btw, this essential information issue is why there can never be a perfect de-noising or sharpening technique: necessarily if you photograph something that looks exactly like noise, and apply de-noise, it will be removed. If you photograph something with slightly fuzzy edges and sharpen, the fuzziness will disappear. Those techniques only work because you know that what you photographed, and that it didn't look like noise or have fuzzy edges. If you apply them automatically, you're asking the software to guess what you photographed. In the case of case of digital AA, you would have to know that you hadn't photographed something that actually looked like a Moiré pattern... or be happy to live with a lower resolution that averages it away. It's a cake vs full stomach issue.