GeoChat: Grounded Large Vision-Language Model for Remote Sensing
Computer Vision and Pattern Recognition(2024)
Key words
Remote Sensing,Vision-language Models,Question Answering,Spatial Coordinates,Small Objects,Visual Content,Remote Sensing Images,Scene Classification,Multimodal Dataset,Remote Sensing Imagery,Visual Question Answering,High-resolution Remote Sensing Images,Natural Language,Object Detection,Bounding Box,Object Location,Language Model,Short Description,Evaluation Dataset,Visual Scene,Linear Layer,Image Captioning,Conversation Task,Ground-truth Box,Visual Classification,Image Descriptors,Object Detection Dataset
AI Read Science
Must-Reading Tree
Example
![](https://originalfileserver.aminer.cn/sys/aminer/pubs/mrt_preview.jpeg)
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined