While downsampling with the nearest neighbor method is certainly suboptimal (as I've demonstrated here), it is a quick and dirty method to visualize aliasing artifacts of a camera without optical low-pass filter (OLPF) without the risk of false color demosaicing. It is not going to give exactly the same result as the Foveon sensor will produce, but it does create a sharpness/crispness that's somewhat similar, and which seems to be in a large part the appeal of Foveon images to many observers.
I see that you are trying to help people do this for themselves.
But in the interest of doing it "right" (even if that makes it less accessible), I would suggest the following:
A OLPF-less Foveon sensor should (ideally) integrate an unweighted area of dimensions NxM and sample the photon count. We would like to see what that leads to in terms of IQ,
1. Find a good high-resolution image with pleny of details up to the Nyquist limit, but few debayering artifacts or moire.
2. Optionally, do image blurring (using e.g. Gaussian blur)
3. Do a integer decimation of either 2x2, 3x3 etc by averaging a neighborhood
By doing/not doing step 2, one could get an idea of what the benefit/drawback of OLPF would be for a Foveon sensor.
Of course, this depends on a range of idealised models. I assume that the accumulated blurring from motion-blur, optics and OLPF might as well be modelled by a Gaussian, but real-life may not be that simple.
I think that adhering to/breaking the Shannon-Nyquist sampling theory can be explained (somewhat exaggerated) by the following analogy:
When using a topography map to navigate the terrain. Would you rather that each elevation reading represented:
A) An infinitely small point in the centre of each square
B) A flat average covering exactly the representation square
C) A weighted average that is most depending on the center value, less to the edges of the square, but also of values outside the square
A) Means no prefiltering, maximum acuteness and maximum aliasing/moire. It also cannot be implemented in real sensors. C) can mean optimal sampling in the Nyquist sense, but that cannot be implemented fully in a sensor either. B) Is what ideal OLPF-less Foveon sensors would do, and contains characteristics of both A) and C). OLPF-equipped Foveons would be somewhere between B) and C).
-h