See&Trek: Training-Free Spatial Prompting for Multimodal Large Language Model