publications

publications by categories in reversed chronological order. generated by jekyll-scholar.

2024

  1. InternLM2 Technical Report
    Zheng Cai , Maosong Cao , Haojiong Chen , and 8 more authors
    arXiv preprint arXiv:2403.17297, 2024
  2. AAAI
    Vigc: Visual instruction generation and correction
    Bin Wang , Fan Wu , Xiao Han , and 8 more authors
    In Proceedings of the AAAI Conference on Artificial Intelligence , 2024
  3. Benchmarking Chinese Commonsense Reasoning of LLMs: From Chinese-Specifics to Reasoning-Memorization Correlations
    Jiaxing Sun , Weiquan Huang , Jiang Wu , and 5 more authors
    arXiv preprint arXiv:2403.14112, 2024
  4. ITGRS
    Weakly-supervised 3D Building Reconstruction from Monocular Remote Sensing Images
    Weijia Li , Zhenghao Hu , Lingxuan Meng , and 7 more authors
    IEEE Transactions on Geoscience and Remote Sensing, 2024
  5. LOCR: Location-Guided Transformer for Optical Character Recognition
    Yu Sun , Dongzhan Zhou , Chen Lin , and 3 more authors
    arXiv preprint arXiv:2403.02127, 2024
  6. WanJuan-CC: A Safe and High-Quality Open-sourced English Webtext Dataset
    Jiantao Qiu , Haijun Lv , Zhenjiang Jin , and 8 more authors
    arXiv preprint arXiv:2402.19282, 2024
  7. SongComposer: A Large Language Model for Lyric and Melody Composition in Song Generation
    Shuangrui Ding , Zihan Liu , Xiaoyi Dong , and 5 more authors
    arXiv preprint arXiv:2402.17645, 2024
  8. LongWanjuan: Towards Systematic Measurement for Long Text Quality
    Kai Lv , Xiaoran Liu , Qipeng Guo , and 4 more authors
    arXiv preprint arXiv:2402.13583, 2024
  9. SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models
    Peng Gao , Renrui Zhang , Chris Liu , and 8 more authors
    arXiv preprint arXiv:2402.05935, 2024
  10. IJAEOG
    Exploring the user guidance for more accurate building segmentation from high-resolution remote sensing images
    Dinghao Yang , Bin Wang , Weijia Li , and 1 more author
    International Journal of Applied Earth Observation and Geoinformation, 2024
  11. InternLM-XComposer2: Mastering free-form text-image composition and comprehension in vision-language large model
    Xiaoyi Dong , Pan Zhang , Yuhang Zang , and 8 more authors
    arXiv preprint arXiv:2401.16420, 2024

2023

  1. Parrot Captions Teach CLIP to Spot Text
    Yiqi Lin , Conghui He , Alex Jinpeng Wang , and 3 more authors
    arXiv preprint arXiv:2312.14232, 2023
  2. Opera: Alleviating hallucination in multi-modal large language models via over-trust penalty and retrospection-allocation
    Qidong Huang , Xiaoyi Dong , Pan Zhang , and 6 more authors
    arXiv preprint arXiv:2311.17911, 2023
  3. Beyond hallucinations: Enhancing lvlms through hallucination-aware direct preference optimization
    Zhiyuan Zhao , Bin Wang , Linke Ouyang , and 3 more authors
    arXiv preprint arXiv:2311.16839, 2023
  4. Sharegpt4v: Improving large multi-modal models with better captions
    Lin Chen , Jisong Li , Xiaoyi Dong , and 5 more authors
    arXiv preprint arXiv:2311.12793, 2023
  5. Internlm-xcomposer: A vision-language large model for advanced text-image comprehension and composition
    Pan Zhang , Xiaoyi Dong Bin Wang , Yuhang Cao , and 8 more authors
    arXiv preprint arXiv:2309.15112, 2023
  6. MiChao-HuaFen 1.0: A Specialized Pre-trained Corpus Dataset for Domain-specific Large Models
    Yidong Liu , Conghui He , Wei Li , and 4 more authors
    arXiv preprint arXiv:2309.13079, 2023
  7. DropQueries: A Simple Way to Discover Comprehensive Segment Representations
    Haojie Ding , Bin Wang , Guoliang Kang , and 4 more authors
    IEEE Transactions on Multimedia, 2023
  8. Mllm-dataengine: An iterative refinement approach for mllm
    Zhiyuan Zhao , Linke Ouyang , Bin Wang , and 5 more authors
    arXiv preprint arXiv:2308.13566, 2023
  9. Wanjuan: A comprehensive multimodal dataset for advancing english and chinese large models
    Conghui He , Zhenjiang Jin , Chao Xu , and 6 more authors
    arXiv preprint arXiv:2308.10755, 2023
  10. Internvid: A large-scale video-text dataset for multimodal understanding and generation
    Yi Wang , Yinan He , Yizhuo Li , and 8 more authors
    arXiv preprint arXiv:2307.06942, 2023
  11. Mmbench: Is your multi-modal model an all-around player?
    Yuan Liu , Haodong Duan , Yuanhan Zhang , and 8 more authors
    arXiv preprint arXiv:2307.06281, 2023
  12. ISPRS
    Joint semantic–geometric learning for polygonal building segmentation from high-resolution remote sensing images
    Weijia Li , Wenqian Zhao , Jinhua Yu , and 4 more authors
    ISPRS Journal of Photogrammetry and Remote Sensing, 2023
  13. gao2023llama.png
    Llama-adapter v2: Parameter-efficient visual instruction model
    Peng Gao , Jiaming Han , Renrui Zhang , and 8 more authors
    arXiv preprint arXiv:2304.15010, 2023
  14. V3det: Vast vocabulary visual detection dataset
    Jiaqi Wang , Pan Zhang , Tao Chu , and 6 more authors
    arXiv preprint arXiv:2304.03752, 2023
  15. CVPR
    Think Twice before Driving: Towards Scalable Decoders for End-to-End Autonomous Driving
    Xiaosong Jia , Penghao Wu , Li Chen , and 4 more authors
    In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , 2023
  16. CVPR
    OmniCity: Omnipotent city understanding with multi-level and multi-view images
    Weijia Li , Yawen Lai , Linning Xu , and 5 more authors
    In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , 2023
  17. lin2022sept.png
    SEPT: Towards Scalable and Efficient Visual Pre-Training
    Yiqi Lin , Huabin Zheng , Huaping Zhong , and 4 more authors
    In Proceedings of the AAAI Conference on Artificial Intelligence , 2023

2022

  1. ECCV
    Persformer: 3d lane detection via perspective transformer and the openlane benchmark
    Li Chen , Chonghao Sima , Yang Li , and 8 more authors
    In Computer Vision–ECCV 2022: 17th European Conference, Tel Aviv, Israel, October 23–27, 2022, Proceedings, Part XXXVIII , 2022
  2. Unified interactive image matting
    Stephen Yang , Bin Wang , Weijia Li , and 3 more authors
    arXiv preprint arXiv:2205.08324, 2022

2021

  1. AAAI
    Joint semantic-geometric learning for polygonal building segmentation
    Weijia Li , Wenqian Zhao , Huaping Zhong , and 2 more authors
    In Proceedings of the AAAI Conference on Artificial Intelligence , 2021
  2. shao2021intern.png
    Intern: A new learning paradigm towards general vision
    Jing Shao , Siyu Chen , Yangguang Li , and 8 more authors
    arXiv preprint arXiv:2111.08687, 2021
  3. ICCV
    Influence selection for active learning
    Zhuoming Liu , Hao Ding , Huaping Zhong , and 3 more authors
    In Proceedings of the IEEE/CVF International Conference on Computer Vision , 2021
  4. ICCV
    3d building reconstruction from monocular remote sensing images
    Weijia Li , Lingxuan Meng , Jinwang Wang , and 3 more authors
    In Proceedings of the IEEE/CVF International Conference on Computer Vision , 2021

2020

  1. UIST
    Flava: Find, localize, adjust and verify to annotate lidar-based point clouds
    Tai Wang , Conghui He , Zhe Wang , and 2 more authors
    In Adjunct Proceedings of the 33rd Annual ACM Symposium on User Interface Software and Technology , 2020

2019

  1. RS
    A real-time tree crown detection approach for large-scale remote sensing images on FPGAs
    Weijia Li , Conghui He , Haohuan Fu , and 5 more authors
    Remote Sensing, 2019
  2. RS
    Semantic segmentation-based building footprint extraction using very high-resolution satellite images and multi-source GIS data
    Weijia Li , Conghui He , Jiarui Fang , and 3 more authors
    Remote Sensing, 2019
  3. TPDS
    Optimizing finite volume method solvers on Nvidia GPUs
    Jingheng Xu , Haohuan Fu , Wayne Luk , and 7 more authors
    IEEE Transactions on Parallel and Distributed Systems, 2019
  4. BIGDATA
    Finding mutual X at WeChat-scale social network in ten minitues
    Conghui He , Shijie Sun , Benli Li , and 2 more authors
    In 2019 IEEE International Conference on Big Data (Big Data) , 2019

2018

  1. SC18
    Simulating the Wenchuan earthquake with accurate surface topography on Sunway TaihuLight
    Bingwei Chen , Haohuan Fu , Yanwen Wei , and 8 more authors
    In SC18: International Conference for High Performance Computing, Networking, Storage and Analysis , 2018
  2. CLUSTER
    swcaffe: A parallel framework for accelerating deep learning applications on sunway taihulight
    Liandeng Li , Jiarui Fang , Haohuan Fu , and 5 more authors
    In 2018 IEEE International Conference on Cluster Computing (CLUSTER) , 2018
  3. CVPRW
    Semantic segmentation based building extraction method using multi-source gis map datasets and satellite imagery
    Weijia Li , Conghui He , Jiarui Fang , and 1 more author
    In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops , 2018

2017

  1. TC
    A fully-pipelined hardware design for gaussian mixture models
    Conghui He , Haohuan Fu , Ce Guo , and 2 more authors
    IEEE Transactions on Computers, 2017
  2. SC17
    9-Pflops nonlinear earthquake simulation on Sunway TaihuLight: enabling depiction of 18-Hz and 8-meter scenarios
    Haohuan Fu , Conghui He , Bingwei Chen , and 8 more authors
    In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis , 2017
  3. ICFPT
    An FPGA-based tree crown detection approach for remote sensing images
    Weijia Li , Conghui He , Haohuan Fu , and 1 more author
    In 2017 International Conference on Field Programmable Technology (ICFPT) , 2017
  4. FPL
    Exploring the potential of reconfigurable platforms for order book update
    Conghui He , Haohuan Fu , Wayne Luk , and 2 more authors
    In 2017 27th International Conference on Field Programmable Logic and Applications (FPL) , 2017
  5. EAGE
    Approximating Q Propagations for Elastic Modeling on GPUs
    C He , H Fu , Y Shen , and 2 more authors
    In 79th EAGE Conference and Exhibition 2017 , 2017
  6. FCCM
    A Nanosecond–Level Hybrid Table Design for Financial Market Data Generators
    Haohuan Fu , Conghui He , Wayne Luk , and 2 more authors
    In 2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM) , 2017
  7. FPGA
    Accelerating Financial Market Server through Hybrid List Design
    Haohuan Fu , Conghui He , Huabin Ruan , and 6 more authors
    In Proceedings of the 2017 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays , 2017

2016

  1. SC16
    Refactoring and optimizing the community atmosphere model (CAM) on the sunway taihulight supercomputer
    Haohuan Fu , Junfeng Liao , Wei Xue , and 8 more authors
    In SC’16: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis , 2016
  2. C&G
    A time-space domain stereo finite difference method for 3D scalar wave propagation
    Yushu Chen , Guangwen Yang , Xiao Ma , and 2 more authors
    Computers & Geosciences, 2016

2015

  1. SEG
    A GPU-based Parallel Beam Migration Design
    Conghui He , Haohuan Fu , Bangtian Liu , and 4 more authors
    In 2015 SEG Annual Meeting , 2015
  2. EAGE
    Ensemble full wave inversion with source encoding
    C He , Y Chen , H Fu , and 1 more author
    In 77th EAGE Conference and Exhibition 2015 , 2015

2014

  1. RS
    Global-scale associations of vegetation phenology with rainfall and temperature at a high spatio-temporal resolution
    Nicholas Clinton , Le Yu , Haohuan Fu , and 2 more authors
    Remote Sensing, 2014