The pair on the left side of the frame anchors the "ambiguous" and "narrative" elements, for me. The central figure and the right side of the frame are just context, and I admit that's a lot of context for not a lot of street.
The composition feels quite nice to me. Slobodan has a point that it's a bit open, but I am ok with a lot of space in photographs. More than most people, I have noticed, I am comfortable with forms having a lot of elbow room. The geometry is not particularly strong, but I like the distribution of figures around the central group of 3 windows.
I only see one, or possibly two, hobos in here. Everyone else is, probably, waiting for a bus, or I prefer to think, waiting for something.