STIC-LVLM (Self-Training on Image Comprehension)