publications
publications by categories in reversed chronological order. generated by jekyll-scholar.
2024
-
- AAAIVigc: Visual instruction generation and correctionIn Proceedings of the AAAI Conference on Artificial Intelligence , 2024
- Benchmarking Chinese Commonsense Reasoning of LLMs: From Chinese-Specifics to Reasoning-Memorization CorrelationsarXiv preprint arXiv:2403.14112, 2024
- ITGRSWeakly-supervised 3D Building Reconstruction from Monocular Remote Sensing ImagesIEEE Transactions on Geoscience and Remote Sensing, 2024
- LOCR: Location-Guided Transformer for Optical Character RecognitionarXiv preprint arXiv:2403.02127, 2024
- WanJuan-CC: A Safe and High-Quality Open-sourced English Webtext DatasetarXiv preprint arXiv:2402.19282, 2024
- SongComposer: A Large Language Model for Lyric and Melody Composition in Song GenerationarXiv preprint arXiv:2402.17645, 2024
- LongWanjuan: Towards Systematic Measurement for Long Text QualityarXiv preprint arXiv:2402.13583, 2024
- SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language ModelsarXiv preprint arXiv:2402.05935, 2024
- IJAEOGExploring the user guidance for more accurate building segmentation from high-resolution remote sensing imagesInternational Journal of Applied Earth Observation and Geoinformation, 2024
- InternLM-XComposer2: Mastering free-form text-image composition and comprehension in vision-language large modelarXiv preprint arXiv:2401.16420, 2024
2023
-
- Opera: Alleviating hallucination in multi-modal large language models via over-trust penalty and retrospection-allocationarXiv preprint arXiv:2311.17911, 2023
- Beyond hallucinations: Enhancing lvlms through hallucination-aware direct preference optimizationarXiv preprint arXiv:2311.16839, 2023
-
- Internlm-xcomposer: A vision-language large model for advanced text-image comprehension and compositionarXiv preprint arXiv:2309.15112, 2023
- MiChao-HuaFen 1.0: A Specialized Pre-trained Corpus Dataset for Domain-specific Large ModelsarXiv preprint arXiv:2309.13079, 2023
- DropQueries: A Simple Way to Discover Comprehensive Segment RepresentationsIEEE Transactions on Multimedia, 2023
- Mllm-dataengine: An iterative refinement approach for mllmarXiv preprint arXiv:2308.13566, 2023
- Wanjuan: A comprehensive multimodal dataset for advancing english and chinese large modelsarXiv preprint arXiv:2308.10755, 2023
- Internvid: A large-scale video-text dataset for multimodal understanding and generationarXiv preprint arXiv:2307.06942, 2023
- Mmbench: Is your multi-modal model an all-around player?arXiv preprint arXiv:2307.06281, 2023
-
- SEPT: Towards Scalable and Efficient Visual Pre-TrainingIn Proceedings of the AAAI Conference on Artificial Intelligence , 2023
2022
2021
- ICCV3d building reconstruction from monocular remote sensing imagesIn Proceedings of the IEEE/CVF International Conference on Computer Vision , 2021
2020
- UISTFlava: Find, localize, adjust and verify to annotate lidar-based point cloudsIn Adjunct Proceedings of the 33rd Annual ACM Symposium on User Interface Software and Technology , 2020
2019
- TPDSOptimizing finite volume method solvers on Nvidia GPUsIEEE Transactions on Parallel and Distributed Systems, 2019
2018
- CLUSTERswcaffe: A parallel framework for accelerating deep learning applications on sunway taihulightIn 2018 IEEE International Conference on Cluster Computing (CLUSTER) , 2018
- CVPRWSemantic segmentation based building extraction method using multi-source gis map datasets and satellite imageryIn Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops , 2018
2017
- TCA fully-pipelined hardware design for gaussian mixture modelsIEEE Transactions on Computers, 2017
- SC179-Pflops nonlinear earthquake simulation on Sunway TaihuLight: enabling depiction of 18-Hz and 8-meter scenariosIn Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis , 2017
- ICFPTAn FPGA-based tree crown detection approach for remote sensing imagesIn 2017 International Conference on Field Programmable Technology (ICFPT) , 2017
- FPLExploring the potential of reconfigurable platforms for order book updateIn 2017 27th International Conference on Field Programmable Logic and Applications (FPL) , 2017
- EAGEApproximating Q Propagations for Elastic Modeling on GPUsIn 79th EAGE Conference and Exhibition 2017 , 2017
- FCCMA Nanosecond–Level Hybrid Table Design for Financial Market Data GeneratorsIn 2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM) , 2017
- FPGAAccelerating Financial Market Server through Hybrid List DesignIn Proceedings of the 2017 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays , 2017
2016
- SC16Refactoring and optimizing the community atmosphere model (CAM) on the sunway taihulight supercomputerIn SC’16: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis , 2016
- C&GA time-space domain stereo finite difference method for 3D scalar wave propagationComputers & Geosciences, 2016
2015
- SEG
- EAGEEnsemble full wave inversion with source encodingIn 77th EAGE Conference and Exhibition 2015 , 2015
2014
- RSGlobal-scale associations of vegetation phenology with rainfall and temperature at a high spatio-temporal resolutionRemote Sensing, 2014