publications | Conghui He

2024

InternLM2 Technical Report

Zheng Cai , Maosong Cao , Haojiong Chen , and 8 more authors

arXiv preprint arXiv:2403.17297, 2024

arXiv
AAAI

Vigc: Visual instruction generation and correction

Bin Wang , Fan Wu , Xiao Han , and 8 more authors

In Proceedings of the AAAI Conference on Artificial Intelligence , 2024

PDF
Benchmarking Chinese Commonsense Reasoning of LLMs: From Chinese-Specifics to Reasoning-Memorization Correlations

Jiaxing Sun , Weiquan Huang , Jiang Wu , and 5 more authors

arXiv preprint arXiv:2403.14112, 2024

arXiv
ITGRS

Weakly-supervised 3D Building Reconstruction from Monocular Remote Sensing Images

Weijia Li , Zhenghao Hu , Lingxuan Meng , and 7 more authors

IEEE Transactions on Geoscience and Remote Sensing, 2024

PDF
LOCR: Location-Guided Transformer for Optical Character Recognition

Yu Sun , Dongzhan Zhou , Chen Lin , and 3 more authors

arXiv preprint arXiv:2403.02127, 2024

arXiv
WanJuan-CC: A Safe and High-Quality Open-sourced English Webtext Dataset

Jiantao Qiu , Haijun Lv , Zhenjiang Jin , and 8 more authors

arXiv preprint arXiv:2402.19282, 2024

arXiv
SongComposer: A Large Language Model for Lyric and Melody Composition in Song Generation

Shuangrui Ding , Zihan Liu , Xiaoyi Dong , and 5 more authors

arXiv preprint arXiv:2402.17645, 2024

arXiv
LongWanjuan: Towards Systematic Measurement for Long Text Quality

Kai Lv , Xiaoran Liu , Qipeng Guo , and 4 more authors

arXiv preprint arXiv:2402.13583, 2024

arXiv
SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models

Peng Gao , Renrui Zhang , Chris Liu , and 8 more authors

arXiv preprint arXiv:2402.05935, 2024

arXiv
IJAEOG

Exploring the user guidance for more accurate building segmentation from high-resolution remote sensing images

Dinghao Yang , Bin Wang , Weijia Li , and 1 more author

International Journal of Applied Earth Observation and Geoinformation, 2024

PDF
InternLM-XComposer2: Mastering free-form text-image composition and comprehension in vision-language large model

Xiaoyi Dong , Pan Zhang , Yuhang Zang , and 8 more authors

arXiv preprint arXiv:2401.16420, 2024

arXiv

2023

Parrot Captions Teach CLIP to Spot Text

Yiqi Lin , Conghui He , Alex Jinpeng Wang , and 3 more authors

arXiv preprint arXiv:2312.14232, 2023

arXiv
Opera: Alleviating hallucination in multi-modal large language models via over-trust penalty and retrospection-allocation

Qidong Huang , Xiaoyi Dong , Pan Zhang , and 6 more authors

arXiv preprint arXiv:2311.17911, 2023

arXiv
Beyond hallucinations: Enhancing lvlms through hallucination-aware direct preference optimization

Zhiyuan Zhao , Bin Wang , Linke Ouyang , and 3 more authors

arXiv preprint arXiv:2311.16839, 2023

arXiv
Sharegpt4v: Improving large multi-modal models with better captions

Lin Chen , Jisong Li , Xiaoyi Dong , and 5 more authors

arXiv preprint arXiv:2311.12793, 2023

arXiv
Internlm-xcomposer: A vision-language large model for advanced text-image comprehension and composition

Pan Zhang , Xiaoyi Dong Bin Wang , Yuhang Cao , and 8 more authors

arXiv preprint arXiv:2309.15112, 2023

arXiv
MiChao-HuaFen 1.0: A Specialized Pre-trained Corpus Dataset for Domain-specific Large Models

Yidong Liu , Conghui He , Wei Li , and 4 more authors

arXiv preprint arXiv:2309.13079, 2023

arXiv
DropQueries: A Simple Way to Discover Comprehensive Segment Representations

Haojie Ding , Bin Wang , Guoliang Kang , and 4 more authors

IEEE Transactions on Multimedia, 2023
Mllm-dataengine: An iterative refinement approach for mllm

Zhiyuan Zhao , Linke Ouyang , Bin Wang , and 5 more authors

arXiv preprint arXiv:2308.13566, 2023

arXiv
Wanjuan: A comprehensive multimodal dataset for advancing english and chinese large models

Conghui He , Zhenjiang Jin , Chao Xu , and 6 more authors

arXiv preprint arXiv:2308.10755, 2023

arXiv
Internvid: A large-scale video-text dataset for multimodal understanding and generation

Yi Wang , Yinan He , Yizhuo Li , and 8 more authors

arXiv preprint arXiv:2307.06942, 2023

arXiv
Mmbench: Is your multi-modal model an all-around player?

Yuan Liu , Haodong Duan , Yuanhan Zhang , and 8 more authors

arXiv preprint arXiv:2307.06281, 2023

arXiv
ISPRS

Joint semantic–geometric learning for polygonal building segmentation from high-resolution remote sensing images

Weijia Li , Wenqian Zhao , Jinhua Yu , and 4 more authors

ISPRS Journal of Photogrammetry and Remote Sensing, 2023

HTML PDF
Llama-adapter v2: Parameter-efficient visual instruction model

Peng Gao , Jiaming Han , Renrui Zhang , and 8 more authors

arXiv preprint arXiv:2304.15010, 2023

arXiv Code
V3det: Vast vocabulary visual detection dataset

Jiaqi Wang , Pan Zhang , Tao Chu , and 6 more authors

arXiv preprint arXiv:2304.03752, 2023

arXiv
CVPR

Think Twice before Driving: Towards Scalable Decoders for End-to-End Autonomous Driving

Xiaosong Jia , Penghao Wu , Li Chen , and 4 more authors

In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , 2023

arXiv Code
CVPR

OmniCity: Omnipotent city understanding with multi-level and multi-view images

Weijia Li , Yawen Lai , Linning Xu , and 5 more authors

In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , 2023

arXiv PDF
SEPT: Towards Scalable and Efficient Visual Pre-Training

Yiqi Lin , Huabin Zheng , Huaping Zhong , and 4 more authors

In Proceedings of the AAAI Conference on Artificial Intelligence , 2023

arXiv

2022

ECCV

Persformer: 3d lane detection via perspective transformer and the openlane benchmark

Li Chen , Chonghao Sima , Yang Li , and 8 more authors

In Computer Vision–ECCV 2022: 17th European Conference, Tel Aviv, Israel, October 23–27, 2022, Proceedings, Part XXXVIII , 2022

arXiv HTML Code
Unified interactive image matting

Stephen Yang , Bin Wang , Weijia Li , and 3 more authors

arXiv preprint arXiv:2205.08324, 2022

arXiv

2021

AAAI

Joint semantic-geometric learning for polygonal building segmentation

Weijia Li , Wenqian Zhao , Huaping Zhong , and 2 more authors

In Proceedings of the AAAI Conference on Artificial Intelligence , 2021

HTML PDF
Intern: A new learning paradigm towards general vision

Jing Shao , Siyu Chen , Yangguang Li , and 8 more authors

arXiv preprint arXiv:2111.08687, 2021

arXiv HTML
ICCV

Influence selection for active learning

Zhuoming Liu , Hao Ding , Huaping Zhong , and 3 more authors

In Proceedings of the IEEE/CVF International Conference on Computer Vision , 2021

arXiv PDF Code
ICCV

3d building reconstruction from monocular remote sensing images

Weijia Li , Lingxuan Meng , Jinwang Wang , and 3 more authors

In Proceedings of the IEEE/CVF International Conference on Computer Vision , 2021

PDF

2020

UIST

Flava: Find, localize, adjust and verify to annotate lidar-based point clouds

Tai Wang , Conghui He , Zhe Wang , and 2 more authors

In Adjunct Proceedings of the 33rd Annual ACM Symposium on User Interface Software and Technology , 2020

arXiv

2019

RS

A real-time tree crown detection approach for large-scale remote sensing images on FPGAs

Weijia Li , Conghui He , Haohuan Fu , and 5 more authors

Remote Sensing, 2019

HTML PDF
RS

Semantic segmentation-based building footprint extraction using very high-resolution satellite images and multi-source GIS data

Weijia Li , Conghui He , Jiarui Fang , and 3 more authors

Remote Sensing, 2019

HTML PDF
TPDS

Optimizing finite volume method solvers on Nvidia GPUs

Jingheng Xu , Haohuan Fu , Wayne Luk , and 7 more authors

IEEE Transactions on Parallel and Distributed Systems, 2019

HTML
BIGDATA

Finding mutual X at WeChat-scale social network in ten minitues

Conghui He , Shijie Sun , Benli Li , and 2 more authors

In 2019 IEEE International Conference on Big Data (Big Data) , 2019

HTML PDF

2018

SC18

Simulating the Wenchuan earthquake with accurate surface topography on Sunway TaihuLight

Bingwei Chen , Haohuan Fu , Yanwen Wei , and 8 more authors

In SC18: International Conference for High Performance Computing, Networking, Storage and Analysis , 2018

HTML PDF
CLUSTER

swcaffe: A parallel framework for accelerating deep learning applications on sunway taihulight

Liandeng Li , Jiarui Fang , Haohuan Fu , and 5 more authors

In 2018 IEEE International Conference on Cluster Computing (CLUSTER) , 2018

arXiv
CVPRW

Semantic segmentation based building extraction method using multi-source gis map datasets and satellite imagery

Weijia Li , Conghui He , Jiarui Fang , and 1 more author

In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops , 2018

HTML

2017

TC

A fully-pipelined hardware design for gaussian mixture models

Conghui He , Haohuan Fu , Ce Guo , and 2 more authors

IEEE Transactions on Computers, 2017

HTML
SC17

9-Pflops nonlinear earthquake simulation on Sunway TaihuLight: enabling depiction of 18-Hz and 8-meter scenarios

Haohuan Fu , Conghui He , Bingwei Chen , and 8 more authors

In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis , 2017

HTML
ICFPT

An FPGA-based tree crown detection approach for remote sensing images

Weijia Li , Conghui He , Haohuan Fu , and 1 more author

In 2017 International Conference on Field Programmable Technology (ICFPT) , 2017

HTML
FPL

Exploring the potential of reconfigurable platforms for order book update

Conghui He , Haohuan Fu , Wayne Luk , and 2 more authors

In 2017 27th International Conference on Field Programmable Logic and Applications (FPL) , 2017

HTML
EAGE

Approximating Q Propagations for Elastic Modeling on GPUs

C He , H Fu , Y Shen , and 2 more authors

In 79th EAGE Conference and Exhibition 2017 , 2017
FCCM

A Nanosecond–Level Hybrid Table Design for Financial Market Data Generators

Haohuan Fu , Conghui He , Wayne Luk , and 2 more authors

In 2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM) , 2017

HTML
FPGA

Accelerating Financial Market Server through Hybrid List Design

Haohuan Fu , Conghui He , Huabin Ruan , and 6 more authors

In Proceedings of the 2017 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays , 2017

HTML

2016

SC16

Refactoring and optimizing the community atmosphere model (CAM) on the sunway taihulight supercomputer

Haohuan Fu , Junfeng Liao , Wei Xue , and 8 more authors

In SC’16: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis , 2016

HTML
C&G

A time-space domain stereo finite difference method for 3D scalar wave propagation

Yushu Chen , Guangwen Yang , Xiao Ma , and 2 more authors

Computers & Geosciences, 2016

HTML

2015

SEG

A GPU-based Parallel Beam Migration Design

Conghui He , Haohuan Fu , Bangtian Liu , and 4 more authors

In 2015 SEG Annual Meeting , 2015

HTML
EAGE

Ensemble full wave inversion with source encoding

C He , Y Chen , H Fu , and 1 more author

In 77th EAGE Conference and Exhibition 2015 , 2015

HTML

2014

RS

Global-scale associations of vegetation phenology with rainfall and temperature at a high spatio-temporal resolution

Nicholas Clinton , Le Yu , Haohuan Fu , and 2 more authors

Remote Sensing, 2014

HTML