
GPT-type Technology Empowers Digital Humanities: Conceptual Deconstruction, Application Prospects and Practical Problems
Gao Xiang
Knowledge Management Forum ›› 2024, Vol. 9 ›› Issue (2) : 109-119.
GPT-type Technology Empowers Digital Humanities: Conceptual Deconstruction, Application Prospects and Practical Problems
[Objective/Significance] GPT technology is expected to help the development of digital humanities. By discussing its application in digital humanities, we hope to accelerate the integration of digital humanities and emerging technologies, promote the integration of digital humanities with the times, and provide a new direction for the development of digital humanities. [Method/Process] By analyzing the meaning, development course and supporting technology of GPT technology, and comparing them with traditional digital humanities tools, this paper summarized the application prospects of GPT technology in the field of digital humanities and the existing practical problems. [Result/Conclusion] GPT technology has a broad prospects in digital humanities, and can be used as an intelligent research assistant, realizing full-scale graphic analysis, integrating fragmented knowledge, multilingual translation, and promoting the development of humanities projects, etc. However, practical problems such as data leakage security, algorithmic ethics and intellectual copyright of generated content, the accuracy of humanistic knowledge and knowledge innovation ability need to be further resolved.
GPT / digital humanities / artificial intelligence generated content technology / conversational language model
[1] |
智东西 ZeR0. 时间线复盘ChatGPT爆火之路:改变互联网圈的两个月[EB/OL].[2023-12-06]. https://www.jiemian.com/article/8893975.html.(WISE THINGS ZeR0. Timeline resumption ChatGPT explosion road: two months to change the internet circle[EB/OL].[2023-12-06]. https://www.jiemian.com/article/8893975.html.)
|
[2] |
OpenAI .ChatGPT can now see, hear, and speak[EB/OL]. [2023-12-06].https://openai.com/blog/chatgpt-can-now-see-hear-and-speak.
|
[3] |
中商产业研究院. 2022年中国人工智能行业最新政策汇总一览(表) [EB/OL].[2023-12-06]. https://www.askci.com/news/chanye/20220824/0921361966713.shtml.(CHINA COMMERCIAL INDUSTRY RESEARCH INSTITUTE. Summary of the latest policies of artificial intelligence industry in China in 2022 (Table) [EB/OL].[2023-12-06]. https://www.askci.com/news/chanye/20220824/0921361966713.shtml.)
|
[4] |
王丽华,刘炜,刘圣婴.数字人文的理论化趋势前瞻[J].中国图书馆学报,2020,46(3):17-23.(WANG L H,LIU W,LIU S Y. Perspective research of digital humanities theory[J]. Journal of library science in China, 2020,46(3):17-23.)
|
[5] |
朱本军,聂华.跨界与融合:全球视野下的数字人文——首届北京大学“数字人文论坛”会议综述[J].大学图书馆学报,2016,34(5):16-21.(ZHU B J, NIE H. Crossing boundaries and engaging communities: digital humanities in a global perspective [J]. Journal of academic librariese,2016,34(5):16-21.)
|
[6] |
DIS E V, BOLLEN J, ZUIDEMA W, et al. ChatGPT: five priorities for research conversational AI is a game-changer for science. here’s how to respond[EB/OL].Nature, 2023,614(7947):224-226.
|
[7] |
OpenAI. Introducing ChatGPT[EB/OL].[2023-12-13]. https://openai.com/blog/chatgpt?ref=the-writesonic-blog-making-content-your-superpower.
|
[8] |
ZHU Y, KIROS R, ZEMELET R,et al. Aligning books and movies: towards story-like visual explanations by watching movies and reading books[EB/OL].[2023-12-16]. https://arxiv.org/abs/1506.06724.
|
[9] |
知乎大师兄.预训练语言模型之GPT-1,GPT-2和GPT-3[EB/OL].[2023-12-16].https://zhuanlan.zhihu.com/p/350017443.(ZHIHU DA SHI XIONG. GPT-1, GPT-2 and GPT-3 of pre-training language model. [EB/OL].[2023-12-16].https://zhuanlan.zhihu.com/p/350017443.)
|
[10] |
BROWN T B,MANN B,RYDER N,et al. Language models are few-Shot learners[C]//Proceedings of the 34th international conference on neural information processing systems. New York: ACM, 2020:1877-1901.
|
[11] |
OUYANG L, WU J, JIANG X, et al. Training language models to follow instructions with human feedback[EB/OL]. [2023-12-21]. https://arxiv.org/abs/2203.02155.
|
[12] |
CHEN M, TWOREK J, JUN H, et al. Evaluating large language models trained on code[EB/OL]. [2023-12-21]. https://arxiv.org/abs/1706.03762.
|
[13] |
FU Y, PENG H, TUSHAR K. How does GPT obtain its ability? Tracing emergent abilities of language models to their sources[EB/OL].[2023-12-22]. https://yaofu.notion.site/How-does-GPT-Obtain-its-Ability-Tracing-Emergent-Abilities-of-Language-Models-to-their-Sources-b9a57ac0fcf74f30a1ab9e3e36fa1dc1.
|
[14] |
程序员苍何.【抢先体验】开通使用 ChatGPT 语音版功能保姆级教程[EB/OL]. [2023-12-22].https://blog.csdn.net/qq_43270074/article/details/133578491.(PROGRAMMER CANG HE. [Preemptive experience] Open a nanny-level tutorial using ChatGPT voice version[EB/OL]. [2023-10-17].https://blog.csdn.net/qq_43270074/article/details/133578491.)
|
[15] |
智东西.ChatGPT能语音聊天和看图了,五种音色选项,背后模型细节公开[EB/OL].[2023-12-25].https://36kr.com/p/2448933549496450.(SMART THINGS. ChatGPT can voice chat and look at pictures. there are five timbre options, and the details behind the model are open[EB/OL].[2023-10-17].https://36kr.com/p/2448933549496450.)
|
[16] |
VASWNI A, SHAZEER N,PARMAR N, et al. Attention is all you need[EB/OL].[2023-12-27]. https://arxiv.org/abs/1706.03762.
|
[17] |
LAMBERT N,CASTRICATO L, WERRA L O, et al. Illustrating reinforcement learning from human feedback (RLHF) [EB/OL]. [2023-12-27]. https://huggingface.co/blog/rlhf.
|
[18] |
楷文狗.【科普向】Chat GPT背后的技术:什么是RLHF(人类反馈强化学习)?[EB/OL].[2023-12-27].https://www.bilibili.com/read/cv22006067.(KAIWEN DOG. [Popular science direction]The technology behind Chat GPT: what is RLHF (human feedback reinforcement learning)? [EB/OL]. [2023-12-27].https://www.bilibili.com/read/cv22006067.)
|
[19] |
新浪科技. 微软向OpenAI投资10亿美元 在Azure平台上开发AI技术[EB/OL]. [2023-12-27]. https://tech.sina.com.cn/it/2019-07-22/doc-ihytcerm5517562.shtml.(SINA TECHNOLOGY. Microsoft invested $1 billion in OpenAI to develop AI technology on the Azure platform)[EB/OL]. [2023-12-27]. https://tech.sina.com.cn/it/2019-07-22/doc-ihytcerm5517562.shtml.)
|
[20] |
刘炜,叶鹰.数字人文的技术体系与理论结构探讨[J].中国图书馆学报,2017,43(5):32-41. (LIU W, YE Y. Exploring technical system and theoretical structure of digital humanities[J]. Journal of library science in China, 2017,43(5):32-41.)
|
[21] |
澎湃新闻. 澎湃圆桌|ChatGPT、人工智能与数字人文:传统学问的科技未来?[EB/OL]. [2023-12-27]. https://m.thepaper.cn/newsDetail_forward_21973969.(PENGPAI NEWS. surging round table|ChatGPT, artificial intelligence and digital humanities: the technological future of traditional learning? [EB/OL]. [2023-12-27]. https://m.thepaper.cn/newsDetail_forward_21973969.)
|
[22] |
中国科学院文献情报中心.《ChatGPT对文献情报工作的影响》研究报告(简版)公开发布[EB/OL].[2023-12-30].http://www.las.cas.cn/zhxw/202302/t20230228_6685890.html.(NATIONAL SCIENCE LIBRARY,CHINESE ACADEMY OF SCIENCES. The research report "ChatGPT's influence on literature and information work" (short version) was released to the public[EB/OL]. [2023-12-30].http://www.las.cas.cn/zhxw/202302/t20230228_6685890.html.)
|
[23] |
陈果,陈晶,肖璐.词汇语义链:领域分析视角下的词汇语义挖掘理论框架[J].情报理论与实践,2022,45(4):170-176,183. (CHEN G, CHEN J, XIAO L. Lexical semantic chain: a theoretical framework for lexical semantic mining in the perspective of domain analysis[J]. Information studies: theory & application, 2022,45(4):170-176,183.)
|
[24] |
陆伟,刘家伟,马永强,等.ChatGPT为代表的大模型对信息资源管理的影响[J].图书情报知识,2023,40(2):6-9,70.(LU W,LIU J W,MA Y Q, et al. The influence of large language models represented by ChatGPT on information resources management[J]. Documentation, information & knowledge, 2023,40(2):6-9,70.)
|
[25] |
赵瑞雪,黄永文,马玮璐,等.ChatGPT对图书馆智能知识服务的启示与思考[J].农业图书情报学报,2023,35(1):29-38.(ZHAO R X,HUANG Y W,MA W L, et al. Insights and reflections of the impact of ChatGPT on intelligent knowledge services in libraries [J]. Journal of library and information science in agriculture,2023,35(1):29-38.)
|
[26] |
王树义,张庆薇.ChatGPT给科研工作者带来的机遇与挑战[J].图书馆论坛,2023,43(3):109-118.(WANG S Y,ZHANG Q W. ChatGPT’s opportunities and challenges for researchers [J]. Library tribune, 2023,43(3):109-118.)
|
[27] |
任安麒.数字出版领域智能语言模型的应用、风险与治理——基于ChatGPT技术特征的分析[J].出版科学,2023,31(3):94-102.(REN A Q. Application, challenges and governance of intelligent language models in digital publishing: an analysis based on ChatGPT technology features[J] Publishing journal, 2023,31(3):94-102.)
|
[28] |
付永华,张文欣,司俊勇.ChatGPT影响下的人工智能档案服务:突破与挑战[J].档案管理,2023(3):58-61.(FU Y H,ZHANG W X,SI J Y. Artificial intelligence file service under the influence of ChatGPT: breakthrough and challenge[J]. Archives management, 2023(3):58-61.)
|
[29] |
张玥,庄碧琛,李青宇,等.同质化困境:信息茧房概念解析与理论框架构建[J].中国图书馆学报,2023,49(3):107-122.(ZHANG Y, ZHUANG B C, LI Q Y,et al. Homogenization dilemma: concept analysis and theoretical framework construction of information cocoons[J]. Journal of library science in China, 2023,49(3):107-122.)
|
[30] |
知乎武幺六. ChatGPT3.5和4.0真的使用差距很大吗?[EB/OL]. [2023-12-30]. https://www.zhihu.com/question/595517134.(ZHIHU WU YAO LIU. Is there really a big gap between chatgpt-3.5 and 4.0? [EB/OL].[2023-07-01]. https://www.zhihu.com/question/595517134.)
|
[31] |
中国日报中文网. 数字敦煌:一眼千年,回首又见画中人[EB/OL]. [2023-12-30].https://cn.chinadaily.com.cn/a/202101/25/WS600e84a5a3101e7ce973c929.html.(CHINA DAILY. Digital Dunhuang: looking back at the Millennium, I can see the people in the painting again[EB/OL]. [2023-12-30].https://cn.chinadaily.com.cn/a/202101/25/WS600e84a5a3101e7ce973c929.html.)
|
[32] |
方言保护计划[EB/OL]. [2023-12-30].https://fangyan.xunfei.cn/#/.(Dialect protection plan[EB/OL]. [2023-12-30].https://fangyan.xunfei.cn/#/.)
|
[33] |
澎湃新闻. 从ChatGPT数据泄露事件,看组织安全稳定自动化的重要性[EB/OL]. [2023-12-30]. https://www.thepaper.cn/newsDetail_forward_22632495.(PENGPAI NEWS. From the ChatGPT data leakage incident, see the importance of organizational security, stability and automation [EB/OL].[2023-12-30]. https://www.thepaper.cn/newsDetail_forward_22632495.)
|
[34] |
王晓丽,严驰.生成式AI大模型的风险问题与规制进路:以GPT-4为例[J/OL].北京航空航天大学学报(社会科学版):1-11[2023-12-30].https://doi.org/10.13766/j.bhsk.1008-2204.2023.0535.(WANG X L,YAN C. Risk problem and regulation approach of generative AI foundation models: a case study of GPT-4[J/OL].Journal of Beijing University of Aeronautics and Astronautics(Social sciences edition):1-11[2023-12-30]. https://doi.org/10.13766/j.bhsk.1008-2204.2023.0535.)
|
[35] |
丛立先,李泳霖.聊天机器人生成内容的版权风险及其治理——以ChatGPT的应用场景为视角[J].中国出版,2023(5):16-21.(CONG L X, LI Y L. Copyright risk of chatbot-generated content and its governance—from the perspective of ChatGPT application scenario[J].China publishing journal,2023(5):16-21.)
|
/
〈 |
|
〉 |