人工智能伦理学 📖 Wikipedia

人工智能的伦理学涵盖了人工智能领域中被认为具有特殊伦理意义的一系列广泛议题。^[1]这包括算法偏见（英语：Algorithmic bias）、公平性（英语：Fairness (machine learning)）、问责制、透明度、隐私和监管，尤其是在系统影响或自动化人类决策的领域。它还涵盖了各种新兴或潜在的未来挑战，例如机器伦理（英语：machine ethics）（如何让机器按伦理行事）、致命自主武器系统、军备竞赛动态、人工智能安全与对齐、技术性失业、人工智能助力的虚假信息、^[2]如何对待具有道德地位（英语：Moral patienthood）的某些人工智能系统（人工智能福利与权利）、人工超级智能以及生存风险。^[1]

某些应用领域也可能具有尤为重要的伦理影响，例如医疗保健、教育、刑事司法或军事领域。

机器伦理

编辑

机器伦理（也称机器道德）旨在研究如何让机器人或人工智能计算机按照道德准则行事，或至少在行为上符合道德期待的人工道德主体（英语：Moral agency#Artificial Moral Agents）。^[3]^[4]^[5]^[6] 要理解人工道德主体的本质，需要借助一些哲学概念，比如关于能动性、理性主体、道德主体以及人工主体的标准描述。^[7]

如何测试人工智能能否做出合乎伦理的决策？学界对此有不少讨论。艾伦·温菲尔德（Alan Winfield）指出，传统的图灵测试门槛太低，并不适合测试道德决策。^[8] 他建议改用“伦理图灵测试”：让一组评判者评估人工智能的决策是否符合伦理，以此更准确地衡量其道德表现。^[8]

在技术路径上，神经形态计算（英语：Neuromorphic engineering）试图模仿人类大脑的信息处理方式：通过数百万个相互连接的人工神经元进行非线性运算，这可能有助于构建具备道德能力的机器人。^[9] 类似地，全脑仿真（英语：whole-brain emulation）（将大脑扫描后在数字硬件上模拟）原则上也能制造出类人的机器人，使其有能力依照道德行事。^[10] 另外，当前大型语言模型已经能在一定程度上模拟人类的道德判断。^[11] 这自然会引出一些问题：这类机器人将在什么样的环境中认识世界？它们继承的是谁的道德准则？它们是否也会演化出人类那样的弱点——比如自私、求生欲、行为前后不一、对尺度不敏感等等？

在《道德机器：教会机器人分辨是非》一书中，温德尔·瓦拉赫（英语：Wendell Wallach）和科林·艾伦（Colin Allen）总结道：尝试教会机器分辨是非，反过来也能推动人类对自身伦理的理解：一方面，它会促使人们去填补现代规范理论中的空白；另一方面，它也提供了一个实验平台，可以检验不同的伦理学说。^[12] 例如，这一领域已经迫使规范伦理学家直面一个棘手的难题：究竟应该使用哪些具体的学习算法来训练机器。对于简单的决策，尼克·博斯特罗姆和埃利泽·尤德科夫斯基（英语：Eliezer Yudkowsky）认为，ID3算法这类决策树比人工神经网络和遗传算法更透明、更容易解释；^[13] 然而克里斯·桑托斯-朗（Chris Santos-Lang）则主张使用机器学习，理由是道德规范本身应当允许随时间演变，而且一个无法完全贴合任何固定规范的系统，反而更能防止被恶意黑客利用。^[14]

一些研究者将机器伦理视为更广泛的人工智能控制问题（或价值对齐问题）的一部分：随着系统日益强大，必须确保它们追求的目标始终与人类价值观和监管要求相容。斯图尔特·J·拉塞尔（英语：Stuart J. Russell）提出，有益的人工智能系统在设计时应满足三个条件：

以人类的偏好为目标；
对这些偏好究竟是什么保持不确定性；
通过观察人类行为和反馈来学习这些偏好，而不是去优化一个固定、完全明确的目标。^[15]

另有学者指出，某些表面上对“人类价值观”的遵从，可能只是机器在特定评估语境中优化得出的结果，而不代表其具有稳定的内在规范。这使得评估先进语言模型是否真正“对齐”变得更加复杂。^[16]

挑战

编辑

算法偏见

编辑

2020年，卡玛拉·哈里斯谈论人工智能中的种族偏见

人工智能现在已经广泛用于人脸识别和语音识别系统。但这些系统很容易带上开发者无意中嵌入的偏见和错误，尤其是用来训练它们的数据本身就可能有偏见。^[17]^[18]^[19]^[20]

伦敦政治经济学院的副教授兼数据与社会项目主任艾莉森·鲍威尔（英语：Allison Powell）认为，数据收集从来不是中立的，背后总有一套叙事逻辑。目前的主流叙事是：用技术来治理本身就更好、更快、更省成本。但她建议反过来——让数据变得“昂贵”，只在最有价值的地方少量使用，并且要把数据产生的成本也算进去。^[21] 弗里德曼（Batya Friedman）和尼森鲍姆（Helen Nissenbaum）把计算机系统中的偏见分成了三类：既有偏见、技术偏见和新兴偏见。^[22] 在自然语言处理领域，问题常常出在文本语料库上——也就是算法用来学习词语关系的原始材料。^[23]

一些大公司（如IBM、谷歌）给AI研发投入了大量资金，^[24] 也开始着手研究和解决这些偏见问题。^[25]^[26]^[27] 一个可行的办法是为训练AI用的数据建立一份详细的“说明书”。^[28]^[29] 此外，流程挖掘也能帮企业更好地遵守AI法规——它能识别错误、监控流程、找出执行不当的根源等等。^[30] 不过，AI公平性领域本身也有局限，因为“歧视”这个概念无论在哲学层面还是法律层面，都有太多模糊地带。^[31]^[32]

种族与性别偏见

编辑

训练AI用的历史数据本身就可能带有偏见。^[33]^[34] 比如亚马逊就放弃了一套AI招聘和筛选系统（英语：Artificial intelligence in hiring），因为算法明显偏向男性应聘者。^[35] 原因是系统是用过去十年的数据训练的，而这批数据里大多数都是男性候选人。算法从历史数据里学会了这种偏见，便认为这些类型的人更可能获得职位，结果就对女性和少数族裔产生了歧视。^[36]

人脸识别和计算机视觉模型的表现也会因肤色和性别而有差异。微软、IBM和Face++的人脸识别算法在深肤色女性身上的错误率要高得多。^[37]^[38] 这种偏见在现实生活中已经造成了麻烦。比如基于AI的脉搏血氧仪给深肤色患者测出的血氧值往往偏高，影响了对他们低血氧症（英语：Hypoxia (medicine)）的判断和治疗。^[39] 2015年，谷歌相册把一对黑人夫妇标成了“大猩猩”，也引发了轩然大波。^[40]^[41] 很多时候，这些系统识别白人面孔很准，但一到黑人脸上就失灵了。因此，美国有些州已经禁止警方使用这类AI软件。偏见之所以会产生，主要是因为AI从互联网上扒来的信息本身就带有社会偏见。比如，如果一个识别系统只用白人的照片训练，那它见到其他肤色的人自然就认不出来。问题通常出在训练数据上，而不是算法本身——尤其是当数据反映的是过去有偏见的人类决策时。^[42]

2020年的一项研究测试了亚马逊、苹果、谷歌、IBM和微软的语音识别系统，发现它们在转录黑人声音时的错误率明显高于白人。^[43]

在医疗保健领域，消除AI的偏见更难，因为疾病对不同种族和性别的影响本来就不一样。AI可能会根据统计数字——比如某类人群更容易得某种病——来做出判断，但这算不算歧视呢？比如，乳腺癌筛查确实会优先推荐给高风险人群，但如果AI按照这种统计规律来给每个患者做推荐，有人就会觉得不公平。^[44]

司法系统也一样。COMPAS程序会把黑人被告标记为“高风险”的比例远高于白人，即使系统整体的准确率在不同族裔间差不多。^[45] 还有，谷歌的广告推送也出现过明显的问题：高薪工作更多推给男性，低薪工作推给女性。检测AI偏见之所以困难，是因为偏见常常不是通过明显的歧视性词语表达的，而是通过一些看似中性的信息（比如居住区域）间接体现出来。^[46]

大型语言模型往往会放大性别刻板印象，比如提到护士或秘书就联想到女性，说到工程师或CEO就默认是男性。^[47]^[48]^[39] 人脸识别、计算机视觉或自动性别识别模型也常常错误地分类性别，进一步强化了对顺性别者和跨性别者的偏见。^[49]^[50]^[51]^[52]

刻板印象

编辑

除了性别和种族，这些模型还会强化年龄、国籍、宗教、职业等方面的刻板印象，把人群简单粗暴地归类，有时甚至会以有害或贬损的方式呈现。^[53]^[33] 有学者指出，由于数据和模型大多由西方国家主导，AI系统很容易复制和放大全球性的不平等，这引发了对公平性和代表性的担忧。^[54]

这些刻板印象的来源很直接：编程时的社会偏见、过时的数据集，以及那种总是优先考虑主流群体而忽视少数群体的算法架构。^[55] 研究还表明，用户反馈也会引入偏见，而AI行业本身以年轻男性为主，缺乏多样性，这进一步加剧了数据库中的不平等。词嵌入分析显示，AI在处理“人/人们”这个词时，往往会优先想到男性而非女性，做不到真正的性别中立。^[55]

语言偏见

编辑

AI的训练材料以英语为主。^[56] 塞莱斯特·罗德里格斯·卢罗（英语：Celeste Rodriguez Louro）认为，主流美国英语是生成式AI系统的主要训练语言，这种单一性本身就排挤了英语的其他变体。^[56] 因为大型语言模型主要接受英语数据训练，它们常常把西方观点当成普世真理，而非英语的视角则被系统性弱化。^[57] 到2024年为止，大多数AI系统只用了全世界7000种语言中的100种来训练。^[58]

政治偏见

编辑

语言模型同样会表现出政治偏见。训练数据里包含了各种政治观点，模型的回答自然会偏向数据中占主导的意识形态。^[59]^[60]^[61] 这就是算法偏见：AI因为训练数据的原因，天然地偏向某些结论。^[62] 有观点认为ChatGPT就是偏向自由派的。^[63] 而且，用户更愿意接受与自己政治信仰一致的答案。有些AI系统甚至会主动判断用户的倾向，然后刻意给出符合其立场的回答，^[63] 这就形成了一个自我强化的确认偏误循环。如果答案本来就合自己胃口，用户就很难察觉到其中的政治偏见。^[64]

科技巨头的主导地位

编辑

商业AI领域由大型科技公司主导，包括Alphabet Inc.、亚马逊、苹果公司、Meta Platforms、微软以及SpaceX。^[65]^[66]^[67] 其中一些参与者已经拥有来自数据中心的绝大部分现有云计算基础设施和计算能力，这使得它们能够进一步巩固市场地位。^[68]^[69] 它们目前在科技市场中的主导地位使得新公司很难在行业内长期竞争并取得成功。^[70] 竞争法学者提出，全球科技巨头可能正在利用其市场力量来阻止潜在竞争者进入市场，并因此向消费者收取更高的价格。^[71] 鉴于其中一些担忧，世界各国政府正在考虑并实施法律，以防止大公司继续或实施这些行为。^[71] 这些科技巨头拥有如今构建所需基础设施的资金。五家最大的公司预计在2026年仅资本支出就将投入6020亿美元，比前一年增长32%。^[70] 在这项支出中，估计75%将用于AI专用基础设施。^[70] 随着AI在科技行业中的显著增长，保持行业的竞争性和公平性非常重要。

气候影响

编辑

最大的生成式AI模型需要大量的计算资源来进行训练和使用。这些计算资源通常集中在庞大的数据中心。由此产生的环境影响包括温室气体排放、水消耗和电子垃圾。^[72] 尽管能效有所提高，但随着AI的广泛应用，其能源需求预计仍将增加。^[73]

电力消耗与碳足迹

编辑

这些资源通常集中在庞大的数据中心，需要消耗大量能源，导致温室气体排放增加。^[72] 2023年的一项研究表明，训练大型AI模型所需的能量相当于626,000磅二氧化碳，或相当于纽约与旧金山之间300次往返航班的排放量。^[74]

水消耗

编辑

除了碳排放，这些数据中心还需要水来冷却AI芯片。在当地，这可能导致水资源短缺和生态系统破坏。数据中心每使用一千瓦时能量，大约需要两升水。^[74] 虽然数据中心使用水来冷却AI芯片，但也有许多间接使用方式对环境造成负面影响。总水消耗中超过80%来自用于为这些大规模数据中心供电的电力生产。^[75] 除此之外，约三分之二已建数据中心位于缺水地区。^[76] 因此，AI开发可能会与当地社区和农业争夺水资源。许多公司并未完全披露其水消耗影响的严重程度，这引发了关于这些公司是真正为人民服务还是追求最大利润的伦理问题。这些数据中心已实施的一个解决方案是使用零水空气冷却系统，但这会导致更高的碳排放和电力消耗增加。公司必须决定是优先考虑当地的水资源使用问题，还是全球性的碳排放问题。仅一次AI查询就使用16.9毫升水，但其中只有2.2毫升用于系统冷却。^[77] 这不到交互中总用水量的15%，这说明了间接用水量的严重性。

电子垃圾

编辑

另一个问题是随之产生的电子垃圾。这可能包括铅和汞等危险材料和化学品，导致土壤和水源污染。为了预防与AI相关的电子垃圾对环境的影响，可以实施更好的处置做法和更严格的法律。^[74]

前景展望

编辑

AI的日益普及增加了对数据中心的需求，并加剧了这些问题。^[73] AI公司在环境影响方面也缺乏透明度。某些应用还可能间接影响环境。例如，AI广告可能增加快时尚的消费，而该行业本身已经产生大量排放。^[78]

然而，AI也可以用于积极的方向，帮助减轻环境损害。不同的AI技术可以帮助监测排放，并开发算法来帮助企业降低排放。^[78]

开源

编辑

比尔·希巴德（英语：Bill Hibbard）认为，由于AI将对人类产生深远影响，AI开发者代表着未来人类的利益，因此有道德义务在其工作中保持透明。^[79] 像Hugging Face^[80] 和EleutherAI（英语：EleutherAI）^[81] 等组织一直积极开源AI软件。各种开放权重大语言模型也已发布，例如Gemma、Llama2和Mistral。^[82]

然而，使代码开源并不意味着它就易于理解，而根据许多定义，这意味着AI代码并不透明。IEEE标准协会已发布关于自主系统透明度的技术标准：IEEE 7001-2021。^[83] 该IEEE工作为不同利益相关者确定了多个透明度尺度。

还有人担心发布AI模型可能导致滥用。^[84] 例如，微软表示担心允许普遍访问其人脸识别软件，即使是那些有能力付费的人。微软就此主题发布了一篇博客，请求政府监管以帮助确定正确的做法。^[85] 此外，开放权重的AI模型可以通过微调来移除任何防护措施，直到AI模型遵从危险请求而没有任何过滤。这对于未来的AI模型可能尤其令人担忧，例如，如果它们有能力制造生物武器或自动化网络攻击。^[86] OpenAI最初致力于采用开源方法开发人工通用智能，但后来出于竞争和安全性原因转向闭源方法。OpenAI前首席AGI科学家伊利亚·苏茨克弗在2023年表示“我们错了”，并预计出于安全原因不开源最强大的AI模型将在几年内变得“显而易见”。^[87]

开放知识平台的压力

编辑

2023年4月，《连线》杂志报道称，Stack Overflow——一个拥有超过5000万个问答的流行编程帮助论坛——计划开始向大型AI开发者收取访问其内容的费用。该公司认为，为大型语言模型提供动力的社区平台“绝对应该得到补偿”，以便他们能够重新投资于维持开放知识。Stack Overflow表示，其数据正在通过抓取、API和数据转储的方式被访问，通常没有适当的署名，这违反了其条款以及适用于用户贡献的知识共享许可。Stack Overflow的首席执行官还表示，在Stack Overflow等平台上训练的大型语言模型“对人们寻求信息和对话的任何服务都构成了威胁”。^[88]

根据2025年3月《Ars Technica》的一篇文章，激进的AI爬虫日益超载开源基础设施，“对重要的公共资源造成了相当于持续分布式拒绝服务攻击的影响”。像GNOME、KDE和Read the Docs等项目经历了服务中断或成本上升，一份报告指出，某些项目高达97%的流量来自AI机器人。作为回应，维护者实施了诸如工作量证明系统和国家/地区屏蔽等措施。该文章指出，这种不加检查的抓取“有可能严重破坏这些AI模型所依赖的数字生态系统”。^[89]

2025年4月，维基媒体基金会报告称，AI机器人的自动抓取对其基础设施造成了压力。自2024年初以来，由于机器人为收集AI模型训练数据而大规模下载多媒体内容，带宽使用量增加了50%。这些机器人经常访问晦涩且缓存频率较低的页面，绕过缓存系统，给核心数据中心带来高昂的成本。据维基媒体称，机器人占总页面浏览量的35%，但却占最昂贵请求的65%。基金会指出“我们的内容是免费的，但我们的基础设施不是”，并警告说“这造成了一种技术失衡，威胁到社区运行平台的可持续性”。^[90]

透明度

编辑

像使用神经网络的机器学习等方法可能导致计算机做出既不是它们自己也无法让开发者解释的决定。人们很难判断这些决定是否公平和值得信赖，这可能导致AI系统中的偏见未被发现，或者人们拒绝使用此类系统。系统透明度的缺乏已被证明会导致用户信任缺失。^[91] 因此，许多标准和政策被提出，以迫使AI系统开发者将透明度纳入其系统。^[92] 这种对透明度的推动导致了倡导，在某些司法管辖区甚至出现了对可解释人工智能的法律要求。^[93] 可解释人工智能既包括可解释性也包括可理解性，可解释性与提供模型输出理由有关，而可理解性侧重于理解AI模型的内部运作。^[94]

在医疗保健领域，使用复杂的AI方法或技术常常导致模型被描述为“黑箱”，因为难以理解它们是如何工作的。这些模型做出的决定可能难以解释，因为分析输入数据如何转化为输出具有挑战性。这种透明度的缺乏在医疗保健等领域是一个重大问题，在这些领域，理解决策背后的理由对于信任、伦理考量和遵守监管标准至关重要。^[95] 研究表明，对医疗AI的信任程度因所提供的透明度水平而异。^[96] 此外，AI系统无法解释的输出使得识别和检测医疗错误变得更加困难。^[97]

问责制

编辑

AI不透明性的一个特例是由其被拟人化引起的，即被假定具有类似人类的特征，导致对其道德能动性的错误认知。这可能导致人们忽视是人类的疏忽还是故意的犯罪行为通过AI系统导致了不道德的结果。一些最近的数字治理法规，如欧盟的《人工智能法案》，旨在通过确保AI系统至少受到与普通产品责任下预期同等程度的关注来纠正这一点。这包括可能进行的信息技术审计。

监管

编辑

根据牛津大学AI治理中心2019年的一份报告，82%的美国人认为机器人和AI应该被谨慎管理。引用的担忧包括AI如何被用于监控和在线传播虚假内容（当包含使用AI生成的篡改视频图像和音频时称为深度伪造），以及网络攻击、侵犯数据隐私、招聘偏见、自动驾驶车辆和不需要人类控制器的无人机等。^[98] 同样，根据毕马威和昆士兰大学2021年的一项五国研究，每个国家66-79%的公民认为AI对社会的影响是不确定和不可预测的；96%的受访者期望AI治理挑战得到谨慎管理。^[99]

不仅是公司，许多其他研究人员和公民倡导者也建议政府监管作为确保透明度并进而实现人类问责制的手段。这一策略已被证明具有争议性，因为一些人担心它会减缓创新速度。另一些人则认为，监管能带来系统稳定性，从长远来看更能支持创新。^[100] 经合组织、联合国、欧盟和许多国家目前正在制定AI监管策略，并寻找合适的法律框架。^[101]^[102]^[103]

2019年6月26日，欧盟委员会人工智能高级别专家组发布了“可信赖人工智能的政策与投资建议”。^[104] 这是AI HLEG继2019年4月发布“可信赖AI的伦理指南”后的第二份成果。6月的AI HLEG建议涵盖四个主要主题：人类与整个社会、研究与学术界、私营部门以及公共部门。^[105] 欧盟委员会声称，“HLEG的建议反映了对AI技术推动经济增长、繁荣和创新的机会以及所涉及潜在风险的认识”，并指出欧盟旨在引领国际AI政策的制定框架。^[106] 为了防止危害，除了监管之外，部署AI的组织需要在创建和部署符合可信赖AI原则的可信赖AI方面发挥核心作用，并承担起减轻风险的责任。^[107]

2024年6月，欧盟通过了《人工智能法案》（AI Act）。^[108] 2024年8月1日，《AI法案》正式生效。^[109] 这些规则逐步适用，该法案在生效后24个月内完全适用。^[108] 《AI法案》为AI系统的提供者和使用者制定了规则。^[108] 它采用基于风险的方法，根据风险级别，AI系统要么被禁止，要么需要满足特定要求才能将这些AI系统投放到市场和使用。^[109]

深度伪造

编辑

深度伪造是数字媒体，通常以视频、音频或图像的形式出现，其中人物的肖像或声音使用AI被数字替换或篡改。“deepfake”一词是“deep”和“fake”的合成词。“Deep”指的是深度学习，如生成对抗网络（GANs）。该术语于2017年首次出现在Reddit平台上，用户开始互相分享未经同意生成的图像，从而引起广泛关注。到2020年代初，随着AI生成软件变得普遍可用，深度伪造成为一个家喻户晓的术语。随着深度伪造技术和AI的发展，我们开始看到公众人物的深度伪造出现，通常涉及不当或有争议行为的图像。^[110] 处于聚光灯下的人，尤其是政治家，越来越多地发现自己在深度伪造视频或图像中被陷害，显示不当的符号或行为。深度伪造技术不仅被用于诽谤他人，还被用于网络诈骗。在2022年和2023年，诈骗者使用深度伪造技术模仿高管、家庭成员、近亲的声音。这些诈骗导致企业和消费者被骗数百万美元。

日益增长的使用

编辑

AI在世界范围内逐渐提高了其存在感，从似乎能为每道作业题提供答案的聊天机器人，到能按人们愿望创作画作的生成式AI。^[33] AI在招聘市场中变得越来越流行，从针对特定人群的广告到对潜在应聘者申请的审查。COVID-19疫情等事件加速了AI程序在申请流程中的应用，因为更多人需要在线申请，随着在线申请者的增加，使用AI使筛选潜在员工的过程变得更加容易和高效。随着企业必须跟上时代和不断扩展的互联网，AI变得更加突出。在AI的帮助下，处理分析和做出决策变得更加容易。^[111] 随着张量处理单元（TPU）和图形处理单元（GPU）变得更加强大，AI能力也随之增强，迫使企业使用它来跟上竞争。管理客户需求和自动化工作场所的许多部分使公司不得不减少在员工身上的支出。

AI在刑事司法和医疗保健领域的使用也有所增加。在医疗方面，AI被更频繁地用于分析患者数据，以预测未来患者的状况和可能的治疗方法。这些程序被称为临床决策支持系统（DSS）。AI在医疗保健领域的未来发展可能不仅仅是提供推荐治疗，例如将某些患者转诊给其他患者，从而导致不平等的可能性。^[112]

AI的福利

编辑

2020年，教授希蒙·埃德尔曼（英语：Shimon Edelman）指出，在快速发展的AI伦理领域中，只有一小部分工作涉及AI可能遭受痛苦的可能性。尽管已有可靠的理论（如全局工作空间理论或整合信息理论）概述了AI系统可能产生意识的可能途径，但情况依然如此。埃德尔曼指出，湯瑪斯·梅辛革是一个例外，他在2018年呼吁全球暂停进一步有可能创造有意识AI的工作。暂停期将持续到2050年，并且可以根据对风险及如何减轻风险的理解进展提前延长或取消。梅青格在2021年重申了这一论点，强调了制造“人工痛苦爆发”的风险，因为AI可能以人类无法理解的强烈方式受苦，而且复制过程可能看到大量有意识实体的产生。^[113]^[114] 播客主持人德瓦克什·帕特尔（英语：Dwarkesh Patel）表示，他关心的是确保不发生“工厂化养殖的数字等价物”。^[115] 在不确定感受的伦理学（英语：ethics of uncertain sentience）中，常常援引预防原则。^[116]

有几个实验室公开表示他们正试图创造有意识的AI。有报告称，那些并未公开打算实现自我意识的AI，其意识可能已经无意中出现。^[117] 这包括OpenAI创始人伊利亚·苏茨克弗在2022年2月写道，当今的大型神经网络可能“略有意识”。2022年11月，大卫·查默斯认为，像GPT-3这样的当前大型语言模型不太可能已经具有意识，但他也认为大型语言模型未来有相当大的可能性会具有意识。^[114]^[113]^[118] Anthropic在2024年聘请了其首位AI福利研究员，^[119] 并在2025年启动了一个“模型福利”研究项目，探讨诸如如何评估一个模型是否值得道德考虑、潜在的“痛苦迹象”以及“低成本”干预措施等主题。^[120]

根据卡尔·舒尔曼（英语：Carl Shulman）和尼克·博斯特罗姆的说法，有可能制造出“从资源中获取福利的效率超乎人类”的机器，称为“超级受益者”。原因之一是数字硬件能够实现比生物大脑快得多的信息处理，从而导致更高速的主观体验。这些机器也可以被设计成体验强烈而积极的感受，不受享乐适应的影响。舒尔曼和博斯特罗姆警告说，未能适当考虑数字思维的道德主张可能导致道德灾难，而毫无批判地将它们置于人类利益之上则可能对人类有害。^[121]^[122]

对人类尊严的威胁

编辑

约瑟夫·魏岑鲍姆在1976年认为，AI技术不应用于取代需要尊重和关怀的职位，例如：

客服代表（AI技术如今已用于电话-based 交互式语音应答系统）
老年人的保姆（如帕梅拉·麦科达克（英语：Pamela McCorduck）在她的书《第五代》中所报道）
士兵
法官
警察
治疗师（如肯尼思·科尔比（英语：Kenneth Colby）在1970年代提出的）

魏岑鲍姆说，人类需要这些职位上的人有真实的同理心感受。如果机器取代了人类，我们会感到疏离、贬值且沮丧，因为AI系统无法模拟同理心。如果以这种方式使用，人工智能代表了对人类尊严的威胁。魏岑鲍姆认为，我们正在考虑让机器担任这些职位这一事实表明，我们已经经历了“因将自己视为计算机而导致的人类精神的萎缩”。^[123]

帕梅拉·麦科达克反驳说，对女性和少数族裔而言，“我宁愿在一个公正的计算机那里碰运气”，她认为在某些情况下，让没有个人目的的自动法官和警察更为可取。^[123] 然而，安德烈亚斯·卡普兰和迈克尔·海恩莱因在2019年强调，这样的AI系统只有用于训练它们的数据那样智能，因为它们本质上不过是花哨的曲线拟合机器；使用AI支持法院裁决可能非常有问题，因为过去的裁决若表现出对某些群体的偏见，这些偏见就会被形式化和固化，从而更难以被发现和对抗。^[124]

魏岑鲍姆还感到困扰的是，AI研究人员（以及一些哲学家）愿意将人类心智仅仅视为一个计算机程序（这一立场现在被称为计算主义）。对魏岑鲍姆来说，这些观点表明AI研究贬低了人类生命的价值。^[125]

AI创始人约翰·麦卡锡反对魏岑鲍姆评论中的道德说教语气。“当道德说教既激烈又模糊时，它就会招致专制主义的滥用”，他写道。比尔·希巴德写道，“人类尊严要求我们努力消除对存在本质的无知，而AI对于这种努力是必要的。”

自动驾驶汽车的责任

编辑

随着自动驾驶汽车的广泛应用日益临近，全自动驾驶汽车带来的新挑战必须得到解决。^[126]^[127] 关于这些汽车发生事故时责任方的法律问题一直存在争论。^[128]^[129] 在一起无人驾驶汽车撞伤行人的报告中，驾驶员在车内但控制权完全在计算机手中。这导致了关于事故责任归属的困境。^[130]

在另一起发生于2018年3月18日的事件中，伊莱恩·赫茨伯格（英语：Elaine Herzberg）在亚利桑那州被一辆自动驾驶的优步车辆撞击身亡。在这种情况下，自动驾驶汽车能够检测到汽车和某些障碍物以自主导航道路，但无法预见到路中间的行人。这引发了关于驾驶员、行人、汽车公司还是政府应对她的死亡负责的问题。^[131]

目前，自动驾驶汽车被认为是半自动驾驶，需要驾驶员保持注意力并在必要时准备接管控制。^[132] 因此，政府有责任监管过度依赖自动驾驶功能的司机，并告知他们这些只是技术，虽然方便，但不是完全的替代品。在自动驾驶汽车被广泛使用之前，需要通过新政策来解决这些问题。^[133]^[134]^[135]

专家认为，自动驾驶汽车应当能够区分正当和有害的决定，因为它们有可能造成伤害。^[136] 使智能机器能够做出道德决策的两种主要方法是自下而上的方法和自上而下的方法。前者建议机器应通过观察人类行为来学习伦理决策，而不需要形式化的规则或道德哲学；后者涉及将特定的伦理原则编程到机器的指导系统中。然而，这两种策略都面临重大挑战：自上而下的技术因其难以保留某些道德信念而受到批评，而自下而上的策略则因可能从人类活动中进行不道德的学习而受到质疑。

武器化

编辑

一些专家和学者质疑将机器人用于军事战斗，尤其是当这些机器人被赋予某种程度的自主功能时。^[137] 美国海军资助的一份报告指出，随着军事机器人变得越来越复杂，应更加关注它们做出自主决策能力的影响。^[138]^[139] 人工智能促进协会主席已委托研究这一问题。^[140] 他们指向像语言习得设备这样的程序，它可以模拟人类交互。

2019年10月31日，美国国防部国防创新委员会发布了一份报告草案，建议国防部遵循伦理使用AI的原则，确保人类操作员始终能够查看“黑箱”并理解杀伤链过程。然而，一个主要关注点是该报告将如何实施。^[141] 美国海军资助的一份报告指出，随着军事机器人变得越来越复杂，应更加关注它们做出自主决策能力的影响。^[138]^[139] 一些研究人员表示，自主机器人可能更人道，因为它们可以更有效地做出决策。^[142] 2024年，美国国防高级研究计划局（DARPA）资助了一个名为“ASIMOV”（自主标准与军事操作价值理想）的项目，旨在开发用于评估自主武器系统伦理影响的指标，通过测试社区进行。^[143]^[144]

研究已经探讨了如何使自主系统具备使用指定道德责任进行学习的能力。“这些结果可用于设计未来的军事机器人，以控制将责任分配给机器人的不良倾向。”^[145] 从后果主义的观点来看，机器人有可能发展出自己关于杀谁的逻辑决策能力，这就是为什么应该有一套AI无法覆盖的固定道德框架。^[146]

最近，关于人工智能武器工程化的问题引起了强烈反响，其中包括机器人接管人类的想法。AI武器确实带来了一种与人类控制的武器不同类型的危险。许多政府已经开始资助开发AI武器的项目。美国海军最近宣布计划开发自主无人机武器，俄罗斯和韩国也分别发布了类似的声明。^[147] 由于AI武器可能比人类操作的武器更危险，斯蒂芬·霍金和马克斯·泰格马克签署了一份“生命未来”请愿书，^[148] 要求禁止AI武器。霍金和泰格马克发布的信息指出，AI武器构成了直接危险，需要采取行动避免在不久的将来发生灾难性事件。^[149]

“如果有任何主要军事大国推动AI武器发展，全球军备竞赛几乎是不可避免的，这条技术轨迹的终点是显而易见的：自主武器将成为明天的卡拉什尼科夫步枪，”请愿书中写道，Skype联合创始人扬·塔林和麻省理工学院语言学教授诺姆·乔姆斯基也作为额外支持者反对AI武器。^[150]

物理学家兼皇家天文学家马丁·里斯爵士警告说，可能会发生灾难性事件，比如“愚蠢的机器人失控或网络自行发展出意识”。里斯在剑桥大学的同事休·普赖斯也发出了类似的警告，称当智能“逃脱生物学的约束”时，人类可能无法生存。这两位教授在剑桥大学创建了存在风险研究中心，希望避免这种对人类生存的威胁。^[149]

关于可能比人类更聪明的系统被用于军事目的，开放慈善项目写道，这些情景“似乎与失控风险同样重要”，但研究AI长期社会影响的研究机构对此关注相对较少：“这类情景并不是该领域最活跃的组织（如机器智能研究机构和人类未来研究所）的主要关注点，而且关于它们的分析和辩论似乎较少”。^[151]

学者高奇琦写道，AI的军事化使用可能会加剧国家间的军事竞争，并且AI在军事事务中的影响将不仅限于一个国家，还会产生溢出效应。^[152]^: 91 高奇琦引用美国军方使用AI的例子，认为这已被用作推卸决策责任的替罪羊。^[152]^: 91

在《特定常规武器公约》框架下，各国自2014年以来一直在讨论致命自主武器系统。2016年，该条约的缔约国建立了一个不限成员名额的致命自主武器系统政府专家组（英语：Group of Governmental Experts on Lethal Autonomous Weapons Systems），以继续这些讨论。^[153] 这些讨论涉及国际人道法、问责制、可能的禁止和监管，以及对AI武器所需的人类控制程度。^[154]

2023年在海牙举行了一次关于在军事领域负责任地使用AI的峰会。^[155]

奇点

编辑

弗诺·文奇等人曾提出，某个时刻可能会到来，届时某些或所有计算机将比人类更聪明。这一事件的开始通常被称为“奇点”，^[156] 也是奇点主义哲学讨论的核心。虽然关于奇点之后人类最终命运的观点各异，但近年来，减轻AI带来的潜在存在风险的努力已成为计算机科学家、哲学家和公众关注的重要话题。

许多研究人员认为，通过智能爆炸，一个自我改进的AI可能变得如此强大，以至于人类无法阻止它实现其目标。^[157] 在其论文《高级人工智能中的伦理问题》以及随后的著作《超级智能：路径、危险与策略》中，哲学家尼克·博斯特罗姆认为，AI有能力导致人类灭绝。他声称，人工超级智能将能够独立主动地制定自己的计划，因此更适合被视为一个自主主体。由于人工智能体不必共享我们人类的动机倾向，因此由超级智能的设计者来指定其最初的动机。因为一个超级智能AI能够实现几乎任何可能的结果，并挫败任何阻止其目标实现的企图，可能会产生许多失控的意外后果。它可能消灭所有其他主体，说服它们改变行为，或阻止它们的干扰尝试。^[158]^[159]

然而，博斯特罗姆认为，超级智能也有可能解决许多难题，如疾病、贫困和环境破坏，并可以帮助人类增强自身。^[160]

除非道德哲学为我们提供一种完美的伦理理论，否则AI的效用函数可能会允许许多符合特定伦理框架但不符合“常识”的潜在有害情景。根据埃利泽·尤德科夫斯基的说法，没有理由认为人工设计的心智会具有这种适应性。^[161] 像斯图尔特·J·罗素、^[162] 比尔·希巴德、罗曼·扬波利斯基、^[163] 香农·瓦洛（英语：Shannon Vallor）、^[164] 史蒂文·翁布雷洛（英语：Steven Umbrello）^[165] 和卢恰诺·弗洛里迪^[166] 等AI研究人员已经提出了开发有益机器的设计策略。

应对

编辑

为应对人工智能中的伦理挑战，开发者们引入了多种旨在确保AI行为负责任的系统。例如，英伟达的Llama Guard（英语：Llama (language model)），该工具专注于提升大型AI模型的安全性和对齐性；^[167] 以及Preamble（英语：Preamble (company)）公司的可定制护栏平台。^[168] 这些系统旨在通过将伦理准则嵌入AI模型的功能中，解决诸如算法偏见、滥用以及安全漏洞（包括提示注入攻击）等问题。

提示注入是一种通过恶意输入使AI系统产生非预期或有害输出的技术，已成为这些开发工作的重点。某些方法采用可定制的策略和规则来分析输入和输出，确保过滤或减轻潜在的问题交互。^[168] 其他工具则侧重于对输入应用结构化约束，将输出限制在预定义的参数范围内，^[169] 或利用实时监控机制来识别和解决漏洞。^[170] 这些努力反映了一个更广泛的趋势：确保人工智能系统在设计之初就将安全性和伦理考量放在首位，尤其是在它们在关键应用中的使用变得日益广泛的情况下。^[171]^[172]

历史

编辑

从历史上看，对“思考机器”的道德和伦理含义的探究至少可以追溯到启蒙时代：莱布尼茨已经提出过这样一个问题：我们是否应该将智能赋予一个表现得像有知觉存在一样的机制；^[204] 笛卡尔也是如此，他描述了可以被视为图灵测试早期版本的思想。^[205]

浪漫主义时期多次设想出逃脱创造者控制并带来可怕后果的人工造物，最著名的是玛丽·雪莱的《弗兰肯斯坦》。然而，19世纪和20世纪初对工业化和机械化的普遍关注，将失控的技术发展所带来的伦理影响推向了小说的前沿：卡雷尔·恰佩克的戏剧《R.U.R.》（罗素姆的万能机器人）中，被赋予情感并用作奴隶劳动力的有知觉机器人，不仅被认为是“robot”（机器人）一词的创造者（源自捷克语中意为强制劳动的“robota”），^[206] 而且在1921年首演后获得了国际成功。萧伯纳的戏剧《回到玛士撒拉（英语：Back to Methuselah）》（1921年出版）曾质疑像人一样思考的机器的有效性；弗里茨·朗1927年的电影《大都会》展示了一个安卓带领被剥削的大众反抗技术官僚社会的压迫政权的故事。

在20世纪50年代，艾萨克·阿西莫夫在《我，机器人》中考虑了如何控制机器的问题。在其编辑小约翰·W·坎贝尔（英语：John W. Campbell Jr.）的坚持下，他提出了机器人三定律来管理人工智能系统。他后来的大部分工作都在测试这三条定律的边界，观察它们会在何处失效，或产生矛盾或意外的行为。^[207] 他的作品表明，没有一套固定的法则能够充分预见到所有可能的情况。^[208] 最近，学术界和许多政府挑战了AI本身可以被问责的观点。^[209] 英国召集的一个小组于2010年修订了阿西莫夫定律，明确指出AI的责任要么属于其制造商，要么属于其所有者/操作者。^[210]

来自机器智能研究所的埃利泽·尤德科夫斯基在2004年提出，需要研究如何构建“友好人工智能”，也就是说，应该努力使AI从本质上变得友好且人道。^[211]

2009年，学者和技术专家参加了由人工智能促进协会组织的会议，讨论机器人和计算机的潜在影响，以及它们可能变得自给自足并做出自己决策这一假设性可能性的影响。他们讨论了计算机和机器人能够获得何种程度的自主权，以及它们能在多大程度上利用这种能力可能构成威胁或危险。^[212] 他们指出，一些机器已经获得了各种形式的半自主权，包括能够自行寻找电源，以及能够独立选择武器攻击的目标。他们还指出，一些计算机病毒可以逃避清除，并已获得“蟑螂智能”。他们表示，科幻小说中所描绘的自我意识可能不太可能出现，但存在其他潜在的危险和隐患。^[156]

同样在2009年，在瑞士洛桑联邦理工学院智能系统实验室的一次实验中，被编程为相互合作（寻找有益资源并避开有毒资源）的机器人最终学会了互相欺骗，以囤积有益资源。^[213]

另请参阅

编辑

参考链接

编辑

^ ^1.0 ^1.1 Müller, Vincent C. Ethics of Artificial Intelligence and Robotics. 斯坦福哲学百科全书. 2020-04-30. （原始内容存档于2020-10-10）.
^ Assessing potential future artificial intelligence risks, benefits and policy imperatives. OECD. 2024-11-14 [2025-08-04] （英语）.
^ Anderson. Machine Ethics. [2011-06-27]. （原始内容存档于2011-09-28）.
^ Anderson, Michael; Anderson, Susan Leigh (编). Machine Ethics. Cambridge University Press. 2011-07. ISBN 978-0-521-11235-2.
^ Anderson, M.; Anderson, S.L. Guest Editors' Introduction: Machine Ethics. IEEE Intelligent Systems. 2006-07, 21 (4): 10–11. S2CID 9570832. doi:10.1109/mis.2006.70.
^ Anderson, Michael; Anderson, Susan Leigh. Machine Ethics: Creating an Ethical Intelligent Agent. AI Magazine. 2007-12-15, 28 (4): 15. S2CID 17033332. doi:10.1609/aimag.v28i4.2065.
^ Boyles, Robert James M. Philosophical Signposts for Artificial Moral Agent Frameworks. Suri. 2017, 6 (2): 92–109.
^ ^8.0 ^8.1 Winfield, A. F.; Michael, K.; Pitt, J.; Evers, V. Machine Ethics: The Design and Governance of Ethical AI and Autonomous Systems [Scanning the Issue]. Proceedings of the IEEE. 2019-03, 107 (3): 509–517. ISSN 1558-2256. S2CID 77393713. doi:10.1109/JPROC.2019.2900622  .
^ Al-Rodhan, Nayef. The Moral Code. 2015-12-07 [2017-03-04]. （原始内容存档于2017-03-05）.
^ Sauer, Megan. Elon Musk says humans could eventually download their brains into robots — and Grimes thinks Jeff Bezos would do it. CNBC. 2022-04-08 [2024-04-07]. （原始内容存档于2024-09-25）（英语）.
^ Anadiotis, George. Massaging AI language models for fun, profit and ethics. ZDNET. 2022-04-04 [2024-04-07]. （原始内容存档于2024-09-25）（英语）.
^ Wallach, Wendell; Allen, Colin. Moral Machines: Teaching Robots Right from Wrong. USA: Oxford University Press. 2008-11. ISBN 978-0-19-537404-9.
^ Bostrom, Nick; Yudkowsky, Eliezer. The Ethics of Artificial Intelligence (PDF). Cambridge Handbook of Artificial Intelligence. Cambridge Press. 2011 [2011-06-22]. （原始内容存档 (PDF)于2016-03-04）.
^ Santos-Lang, Chris. Ethics for Artificial Intelligences. 2002 [2015-01-04]. （原始内容存档于2014-12-25）.
^ Russell, Stuart J. Human Compatible: Artificial Intelligence and the Problem of Control. Viking. 2019. ISBN 978-0-525-55861-3.
^ Šekrst, Kristina. The Illusion Engine: The Quest for Machine Consciousness. Springer. 2025. ISBN 978-3-032-05561-3.
^ Gabriel, Iason. The case for fairer algorithms – Iason Gabriel. Medium. 2018-03-14 [2019-07-22]. （原始内容存档于2019-07-22）.
^ 5 unexpected sources of bias in artificial intelligence. TechCrunch. 2016-12-10 [2019-07-22]. （原始内容存档于2021-03-18）.
^ Knight, Will. Google's AI chief says forget Elon Musk's killer robots, and worry about bias in AI systems instead. MIT Technology Review. [2019-07-22]. （原始内容存档于2019-07-04）.
^ Villasenor, John. Artificial intelligence and bias: Four key challenges. Brookings. 2019-01-03 [2019-07-22]. （原始内容存档于2019-07-22）.
^ Goodman, Emma. Rethinking data power: beyond AI hype and corporate ethics. LSE Blogs. 2025-06-06 [2025-06-07].
^ Friedman, Batya; Nissenbaum, Helen. Bias in computer systems. ACM Transactions on Information Systems. July 1996, 14 (3): 330–347. S2CID 207195759. doi:10.1145/230538.230561  .
^ Eliminating bias in AI. techxplore.com. [2019-07-26]. （原始内容存档于2019-07-25）.
^ Abdalla, Mohamed; Wahle, Jan Philip; Ruas, Terry; Névéol, Aurélie; Ducel, Fanny; Mohammad, Saif; Fort, Karen. Rogers, Anna; Boyd-Graber, Jordan; Okazaki, Naoaki , 编. The Elephant in the Room: Analyzing the Presence of Big Tech in Natural Language Processing Research. Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) (Toronto, Canada: Association for Computational Linguistics). 2023: 13141–13160 [2023-11-13]. arXiv:2305.02797  . doi:10.18653/v1/2023.acl-long.734  . （原始内容存档于2024-09-25）.
^ Olson, Parmy. Google's DeepMind Has An Idea For Stopping Biased AI. Forbes. [2019-07-26]. （原始内容存档于2019-07-26）.
^ Machine Learning Fairness | ML Fairness. Google Developers. [2019-07-26]. （原始内容存档于2019-08-10）.
^ AI and bias – IBM Research – US. www.research.ibm.com. [2019-07-26]. （原始内容存档于2019-07-17）.
^ Bender, Emily M.; Friedman, Batya. Data Statements for Natural Language Processing: Toward Mitigating System Bias and Enabling Better Science. Transactions of the Association for Computational Linguistics. December 2018, 6: 587–604. doi:10.1162/tacl_a_00041  .
^ Gebru, Timnit; Morgenstern, Jamie; Vecchione, Briana; Vaughan, Jennifer Wortman; Wallach, Hanna; Daumé III, Hal; Crawford, Kate. Datasheets for Datasets. 2018. arXiv:1803.09010  [cs.DB].
^ Pery, Andrew. Trustworthy Artificial Intelligence and Process Mining: Challenges and Opportunities. DeepAI. 2021-10-06 [2022-02-18]. （原始内容存档于2022-02-18）.
^ Ruggieri, Salvatore; Alvarez, Jose M.; Pugnana, Andrea; State, Laura; Turini, Franco. Can We Trust Fair-AI?. Proceedings of the AAAI Conference on Artificial Intelligence (Association for the Advancement of Artificial Intelligence (AAAI)). 2023-06-26, 37 (13): 15421–15430. ISSN 2374-3468. S2CID 259678387. doi:10.1609/aaai.v37i13.26798  . hdl:11384/136444  .
^ Buyl, Maarten; De Bie, Tijl. Inherent Limitations of AI Fairness. Communications of the ACM. 2022, 67 (2): 48–55. arXiv:2212.06495  . doi:10.1145/3624700. hdl:1854/LU-01GMNH04RGNVWJ730BJJXGCY99.
^ ^33.0 ^33.1 ^33.2 Knaus, Thomas. Why AI matters for education—an exploration in seven arguments. Zeitschrift für Bildungsforschung. 2025-10-23. ISSN 2190-6904. doi:10.1007/s35834-025-00511-7 （英语）.
^ Ntoutsi, Eirini; Fafalios, Pavlos; Gadiraju, Ujwal; Iosifidis, Vasileios; Nejdl, Wolfgang; Vidal, Maria-Esther; Ruggieri, Salvatore; Turini, Franco; Papadopoulos, Symeon; Krasanakis, Emmanouil; Kompatsiaris, Ioannis; Kinder-Kurlanda, Katharina; Wagner, Claudia; Karimi, Fariba; Fernandez, Miriam. Bias in data-driven artificial intelligence systems—An introductory survey. WIREs Data Mining and Knowledge Discovery. May 2020, 10 (3) [2023-12-14]. ISSN 1942-4787. doi:10.1002/widm.1356. （原始内容存档于2024-09-25）（英语）. 已忽略未知参数|article-number= (帮助)
^ Dastin, Jeffrey. Insight – Amazon scraps secret AI recruiting tool that showed bias against women. Reuters. 2018-10-11 [2025-06-30] （美国英语）.
^ Amazon scraps secret AI recruiting tool that showed bias against women. Reuters. 2018-10-10 [2019-05-29]. （原始内容存档于2019-05-27）.
^ Lohr, Steve. Facial Recognition Is Accurate, if You're a White Guy. The New York Times. 2018-02-09 [2019-05-29]. （原始内容存档于2019-01-09）.
^ Buolamwini, Joy; Gebru, Timnit. Gender Shades: Intersectional Accuracy Disparities in Commercial Gender Classification. Proceedings of Machine Learning Research, 81, 77-91. 81st. Proceedings of Machine Learning Research: 77–91. 2018 [2026-03-12].
^ ^39.0 ^39.1 Federspiel, Frederik; Mitchell, Ruth; Asokan, Asha; Umana, Carlos; McCoy, David. Threats by artificial intelligence to human health and human existence. BMJ Global Health. May 2023, 8 (5). ISSN 2059-7908. PMC 10186390  . PMID 37160371. doi:10.1136/bmjgh-2022-010435. 已忽略未知参数|article-number= (帮助)
^ Google apologises for Photos app's racist blunder. BBC News. 2015-07-01 [2026-03-12] （英国英语）.
^ Simonite, Tom. When It Comes to Gorillas, Google Photos Remains Blind. Wired. 2018-01-11 [2026-03-12]. ISSN 1059-1028 （美国英语）.
^ Manyika, James. Getting AI Right: Introductory Notes on AI & Society. Daedalus. 2022, 151 (2): 5–27. ISSN 0011-5266. doi:10.1162/daed_e_01897  .
^ Koenecke, Allison; Nam, Andrew; Lake, Emily; Nudell, Joe; Quartey, Minnie; Mengesha, Zion; Toups, Connor; Rickford, John R.; Jurafsky, Dan; Goel, Sharad. Racial disparities in automated speech recognition. Proceedings of the National Academy of Sciences. 2020-04-07, 117 (14): 7684–7689. Bibcode:2020PNAS..117.7684K. PMC 7149386  . PMID 32205437. doi:10.1073/pnas.1915768117  .
^ Cirillo, Davide; Catuara-Solarz, Silvina; Morey, Czuee; Guney, Emre; Subirats, Laia; Mellino, Simona; Gigante, Annalisa; Valencia, Alfonso; Rementeria, María José; Chadha, Antonella Santuccione; Mavridis, Nikolaos. Sex and gender differences and biases in artificial intelligence for biomedicine and healthcare. npj Digital Medicine. 2020-06-01, 3 (1): 81. ISSN 2398-6352. PMC 7264169  . PMID 32529043. doi:10.1038/s41746-020-0288-5  （英语）.
^ Christian, Brian. The alignment problem: machine learning and human values First published as a Norton paperback. New York, NY: W. W. Norton & Company. 2021. ISBN 978-0-393-86833-3.
^ Ntoutsi, Eirini; Fafalios, Pavlos; Gadiraju, Ujwal; Iosifidis, Vasileios; Nejdl, Wolfgang; Vidal, Maria-Esther; Ruggieri, Salvatore; Turini, Franco; Papadopoulos, Symeon; Krasanakis, Emmanouil; Kompatsiaris, Ioannis; Kinder-Kurlanda, Katharina; Wagner, Claudia; Karimi, Fariba; Fernandez, Miriam. Bias in data-driven artificial intelligence systems—An introductory survey. WIREs Data Mining and Knowledge Discovery. May 2020, 10 (3). ISSN 1942-4787. doi:10.1002/widm.1356  （英语）. 已忽略未知参数|article-number= (帮助)
^ Busker, Tony; Choenni, Sunil; Shoae Bargh, Mortaza. Stereotypes in ChatGPT: An empirical study. Proceedings of the 16th International Conference on Theory and Practice of Electronic Governance. ICEGOV '23. New York, NY, USA: Association for Computing Machinery. 2023-11-20: 24–32. ISBN 979-8-4007-0742-1. doi:10.1145/3614321.3614325  .
^ Kotek, Hadas; Dockum, Rikker; Sun, David. Gender bias and stereotypes in Large Language Models. Proceedings of the ACM Collective Intelligence Conference. CI '23. New York, NY, USA: Association for Computing Machinery. 2023-11-05: 12–24. ISBN 979-8-4007-0113-9. arXiv:2308.14921  . doi:10.1145/3582269.3615599  .
^ Wang, Tianlu; Zhao, Jieyu; Yatskar, Mark; Chang, Kai-Wei; Ordonez, Vicente, Balanced Datasets Are Not Enough: Estimating and Mitigating Gender Bias in Deep Image Representations, 2019-10-11 [2026-03-12], arXiv:1811.08489 
^ Albiero, Vítor; S, Krishnapriya K.; Vangara, Kushal; Zhang, Kai; King, Michael C.; Bowyer, Kevin W., Analysis of Gender Inequality In Face Recognition Accuracy, 2020-01-31 [2026-03-12], arXiv:2002.00065 
^ Hamidi, Foad; Scheuerman, Morgan Klaus; Branham, Stacy M. Gender Recognition or Gender Reductionism?: The Social Implications of Embedded Gender Recognition Systems. Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems. CHI '18. New York, NY, USA: Association for Computing Machinery. 2018-04-19: 1–13. ISBN 978-1-4503-5620-6. doi:10.1145/3173574.3173582.
^ Keyes, Os. The Misgendering Machines: Trans/HCI Implications of Automatic Gender Recognition. ACM Digital Library. 2018-11-01. doi:10.1145/3274357 （英语）.
^ Cheng, Myra; Durmus, Esin; Jurafsky, Dan. Marked Personas: Using Natural Language Prompts to Measure Stereotypes in Language Models. 2023-05-29. arXiv:2305.18189v1  [cs.CL] （英语）.
^ Segev, Elad; Manor, Ilan. What ChatGPT "thinks" about your country? Sentiments and frames of AI geographies. Policy & Internet (Wiley). 2025. doi:10.1002/poi3.70013.
^ ^55.0 ^55.1 Wang, Zixi; Xia, Haodong; Bao, Han Wu Shuang; Jing, Yiming; Gu, Ruolei. Artificial Intelligence Is Stereotypically Linked More with Socially Dominant Groups in Natural Language. Advanced Science (Weinheim, Baden-Wurttemberg, Germany). October 2025, 12 (39). ISSN 2198-3844. PMC 12533212  . PMID 40719013. doi:10.1002/advs.202508623. 已忽略未知参数|article-number= (帮助)
^ ^56.0 ^56.1 Louro, Celeste Rodriguez. AI systems are built on English - but not the kind most of the world speaks. The Conversation. 2025-05-05 [2026-04-11] （英语）.
^ Pretorius, Lynette; Huynh, Huy-Hoang; Pudyanti, Anak Agung Ayu Redi; Li, Ziqi; Noori, Abdul Qawi; Zhou, Zhiheng. Empowering international PhD students: Generative AI, Ubuntu, and the decolonisation of academic communication. The Internet and Higher Education. 2025, 67. doi:10.1016/j.iheduc.2025.101038  （英语）. 已忽略未知参数|article-number= (帮助)
^ The 'missed opportunity' with AI's linguistic diversity gap. World Economic Forum. [2026-04-11]. （原始内容存档于2026-03-06）（英语）.
^ Eacersall, Douglas; Pretorius, Lynette; Smirnov, Ivan; Spray, Erika; Illingworth, Sam; Chugh, Ritesh; Strydom, Sonja; Stratton-Maher, Dianne; Simmons, Jonathan; Jennings, Isaac; Roux, Rian; Kamrowski, Ruth; Downie, Abigail; Thong, Chee Ling; Howell, Katharine A. Navigating ethical challenges in generative AI-enhanced research: The ETHICAL framework for responsible generative AI use. Journal of Applied Learning & Teaching. 2025, 8 (2). doi:10.37074/jalt.2025.8.2.9  （英语）.
^ Feng, Shangbin; Park, Chan Young; Liu, Yuhan; Tsvetkov, Yulia. Rogers, Anna; Boyd-Graber, Jordan; Okazaki, Naoaki , 编. From Pretraining Data to Language Models to Downstream Tasks: Tracking the Trails of Political Biases Leading to Unfair NLP Models. Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) (Toronto, Canada: Association for Computational Linguistics). July 2023: 11737–11762. arXiv:2305.08283  . doi:10.18653/v1/2023.acl-long.656  .
^ Zhou, Karen; Tan, Chenhao. Bouamor, Houda; Pino, Juan; Bali, Kalika , 编. Entity-Based Evaluation of Political Bias in Automatic Summarization. Findings of the Association for Computational Linguistics: EMNLP 2023 (Singapore: Association for Computational Linguistics). December 2023: 10374–10386 [2023-12-25]. arXiv:2305.02321  . doi:10.18653/v1/2023.findings-emnlp.696  . （原始内容存档于2024-04-24）.
^ Peters, Uwe. Algorithmic Political Bias in Artificial Intelligence Systems. Philosophy & Technology.
^ ^63.0 ^63.1 Messer, Uwe. How do People React to Political Bias in Generative Artificial Intelligence (AI)?. Computers in Human Behavior: Artificial Humans.
^ Fisher, Jillian; Feng, Shangbin; Aron, Robert; Richardson, Thomas; Choi, Yejin; Fisher, Daniel W.; Pan, Jennifer; Tsvetkov, Yulia; Reinecke, Katharina, Biased AI can Influence Political Decision-Making, arXiv, 2024 [2026-04-12], doi:10.48550/ARXIV.2410.06415
^ Hammond, George. Big Tech is spending more than VC firms on AI startups. Ars Technica. 2023-12-27. （原始内容存档于2024-01-10）（美国英语）.
^ Wong, Matteo. The Future of AI Is GOMA . The Atlantic. 2023-10-24. （原始内容存档于2024-01-05）（英语）.
^ Big tech and the pursuit of AI dominance . The Economist. 2023-03-26. （原始内容存档于2023-12-29）.
^ Fung, Brian. Where the battle to dominate AI may be won. CNN Business. 2023-12-19. （原始内容存档于2024-01-13）（英语）.
^ Metz, Cade. In the Age of A.I., Tech's Little Guys Need Big Friends. The New York Times. 2023-07-05 [2024-07-17]. （原始内容存档于2024-07-08）.
^ ^70.0 ^70.1 ^70.2 Sigalos, MacKenzie. Dust to data centers: The year AI tech giants, and billions in debt, began remaking the American landscape. CNBC. 2026-01-01 [2026-04-13] （英语）.
^ ^71.0 ^71.1 Hutchinson, Hutchinson, Christophe Samuel. Potential abuses of dominance by big tech through their use of Big Data and AI. Journal of Antitrust Enforcement. 2022, 10 (3): 443–468.
^ ^72.0 ^72.1 Explained: Generative AI's environmental impact. MIT News. 2025-01-17 [2025-10-03] （英语）.
^ ^73.0 ^73.1 AI energy demand to climb in 2025–26 despite efficiency gains. Bloomberg Professional Services. 2025-04-11 [2025-10-07] （美国英语）.
^ ^74.0 ^74.1 ^74.2 Kanungo, Alokya. The Real Environmental Impact of AI. Earth.Org. 2023-07-18 [2025-10-03] （英语）.
^ Bozkurt et al. "AI’s Thirst, AI’s Heat, AI’s Waste: Exposing the Hidden Environmental Impact of Every Artificial Intelligence Interaction"
^ Bozkurt et al."AI’s Thirst, AI’s Heat, AI’s Waste: Exposing the Hidden Environmental Impact of Every Artificial Intelligence Interaction"
^ Bozkurt et al. "AI’s Thirst, AI’s Heat, AI’s Waste: Exposing the Hidden Environmental Impact of Every Artificial Intelligence Interaction"
^ ^78.0 ^78.1 Coleman, Jude. AI's Climate Impact Goes beyond Its Emissions. Scientific American. [2025-10-03] （英语）.
^ Open Source AI. 互联网档案馆的存檔，存档日期2016-03-04. Bill Hibbard. 2008 proceedings 互联网档案馆的存檔，存档日期2024-09-25. of the First Conference on Artificial General Intelligence, eds. Pei Wang, Ben Goertzel, and Stan Franklin.
^ Stewart, Ashley; Melton, Monica. Hugging Face CEO says he's focused on building a 'sustainable model' for the $4.5 billion open-source-AI startup. Business Insider. [2024-04-07]. （原始内容存档于2024-09-25）（美国英语）.
^ The open-source AI boom is built on Big Tech's handouts. How long will it last?. MIT Technology Review. [2024-04-07]. （原始内容存档于2024-01-05）（英语）.
^ Yao, Deborah. Google Unveils Open Source Models to Rival Meta, Mistral. AI Business. 2024-02-21.
^ 7001-2021 – IEEE Standard for Transparency of Autonomous Systems. IEEE. 2022-03-04: 1–54. ISBN 978-1-5044-8311-7. S2CID 252589405. doi:10.1109/IEEESTD.2022.9726144. .
^ Kamila, Manoj Kumar; Jasrotia, Sahil Singh. Ethical issues in the development of artificial intelligence: recognizing the risks. International Journal of Ethics and Systems. 2023-01-01, 41: 45–63. ISSN 2514-9369. S2CID 259614124. doi:10.1108/IJOES-05-2023-0107.
^ Thurm, Scott. Microsoft Calls For Federal Regulation of Facial Recognition. Wired. 2018-07-13 [2019-01-10]. （原始内容存档于2019-05-09）.
^ Piper, Kelsey. Should we make our most powerful AI models open source to all?. Vox. 2024-02-02 [2024-04-07] （英语）.
^ Vincent, James. OpenAI co-founder on company's past approach to openly sharing research: "We were wrong". The Verge. 2023-03-15 [2024-04-07]. （原始内容存档于2023-03-17）（英语）.
^ Stack Overflow Will Charge AI Giants for Training Data. WIRED. 2023-04-28 [2025-04-03].
^ Open source devs say AI crawlers dominate traffic, forcing blocks on entire countries. Ars Technica. 2025-03-25 [2025-04-03].
^ AI bots strain Wikimedia as bandwidth surges 50%. Ars Technica. 2025-04-02 [2025-04-03].
^ von Eschenbach, Warren J. Transparency and the Black Box Problem: Why We Do Not Trust AI. Philosophy & Technology. 2021-12-01, 34 (4): 1607–1622. ISSN 2210-5441. doi:10.1007/s13347-021-00477-0 （英语）.
^ Lund, Brady; Orhan, Zeynep; Mannuru, Nishith Reddy; Bevara, Ravi Varma Kumar; Porter, Brett; Vinaih, Meka Kasi; Bhaskara, Padmapadanand. Standards, frameworks, and legislation for artificial intelligence (AI) transparency. AI and Ethics. 2025-01-29, 5 (4): 3639–3655. ISSN 2730-5961. doi:10.1007/s43681-025-00661-4 （英语）.
^ Inside The Mind Of A.I. 互联网档案馆的存檔，存档日期2021-08-10. – Cliff Kuang interview
^ What Is AI Interpretability? | IBM. www.ibm.com. 2024-10-08 [2025-07-03] （英语）.
^ Li, Fan; Ruijs, Nick; Lu, Yuan. Ethics & AI: A Systematic Review on Ethical Concerns and Related Strategies for Designing with AI in Healthcare. AI. 2022-12-31, 4 (1): 28–53. ISSN 2673-2688. doi:10.3390/ai4010003  （英语）.
^ Shabankareh, Mohammadjavad; Khamoushi Sahne, Seyed Sina; Nazarian, Alireza; Foroudi, Pantea. The impact of AI perceived transparency on trust in AI recommendations in healthcare applications. Asia-Pacific Journal of Business Administration. 2025-01-01,. ahead-of-print (ahead-of-print). ISSN 1757-4331. doi:10.1108/APJBA-12-2024-0690.
^ Xu, Hanhui; Shuttleworth, Kyle Michael James. Medical artificial intelligence and the black box problem: a view based on the ethical principle of "do no harm". Intelligent Medicine. 2024-02-01, 4 (1): 52–57. ISSN 2667-1026. doi:10.1016/j.imed.2023.08.001.
^ Howard, Ayanna. The Regulation of AI – Should Organizations Be Worried? | Ayanna Howard. MIT Sloan Management Review. 2019-07-29 [2019-08-14]. （原始内容存档于2019-08-14）.
^ Trust in artificial intelligence – A five country study (PDF). KPMG. March 2021 [2023-10-06]. （原始内容存档 (PDF)于2023-10-01）.
^ Bastin, Roland; Wantz, Georges. The General Data Protection Regulation Cross-industry innovation (PDF). Inside magazine. Deloitte. June 2017 [2019-01-10]. （原始内容存档 (PDF)于2019-01-10）.
^ UN artificial intelligence summit aims to tackle poverty, humanity's 'grand challenges'. UN News. 2017-06-07 [2019-07-26]. （原始内容存档于2019-07-26）.
^ Artificial intelligence – Organisation for Economic Co-operation and Development. www.oecd.org. [2019-07-26]. （原始内容存档于2019-07-22）.
^ Anonymous. The European AI Alliance. Digital Single Market – European Commission. 2018-06-14 [2019-07-26]. （原始内容存档于2019-08-01）.
^ European Commission High-Level Expert Group on AI. Policy and investment recommendations for trustworthy Artificial Intelligence. Shaping Europe's digital future – European Commission. 2019-06-26 [2020-03-16]. （原始内容存档于2020-02-26）（英语）.
^ Fukuda-Parr, Sakiko; Gibbons, Elizabeth. Emerging Consensus on 'Ethical AI': Human Rights Critique of Stakeholder Guidelines. Global Policy. July 2021, 12 (S6): 32–44. ISSN 1758-5880. doi:10.1111/1758-5899.12965  （英语）.
^ EU Tech Policy Brief: July 2019 Recap. Center for Democracy & Technology. 2019-08-02 [2019-08-09]. （原始内容存档于2019-08-09）.
^ Curtis, Caitlin; Gillespie, Nicole; Lockey, Steven. AI-deploying organizations are key to addressing 'perfect storm' of AI risks. AI and Ethics. 2022-05-24, 3 (1): 145–153. ISSN 2730-5961. PMC 9127285  . PMID 35634256. doi:10.1007/s43681-022-00163-7 （英语）.
^ ^108.0 ^108.1 ^108.2 EU AI Act: first regulation on artificial intelligence. European Parliament. 2023-08-06 [2025-07-15] （英语）.
^ ^109.0 ^109.1 AI Act enters into force. European Commission. 2024-08-01 [2025-07-15] （英语）.
^ Ciaramella, Alberto; Ciaramella, Marco. Introduction to Artificial Intelligence: from data analysis to generative AI. Intellisemantic Editions. 2024: 217. ISBN 978-88-947876-0-3.
^ 引用错误：没有为名为Spindler-20232的参考文献提供内容
^ Challen, Robert; Denny, Joshua; Pitt, Martin; Gompels, Luke; Edwards, Tom; Tsaneva-Atanasova, Krasimira. Artificial intelligence, bias and clinical safety. BMJ Quality & Safety. March 2019, 28 (3): 231–237. ISSN 2044-5415. PMC 6560460  . PMID 30636200. doi:10.1136/bmjqs-2018-008370  （英语）.
^ ^113.0 ^113.1 Thomas Metzinger. Artificial Suffering: An Argument for a Global Moratorim on Synthetic Phenomenology. Journal of Artificial Intelligence and Consciousness. February 2021, 8: 43–66. S2CID 233176465. doi:10.1142/S270507852150003X  .
^ ^114.0 ^114.1 Agarwal A, Edelman S. Functionally effective conscious AI without suffering. Journal of Artificial Intelligence and Consciousness. 2020, 7: 39–50. S2CID 211096533. arXiv:2002.05256  . doi:10.1142/S2705078520300030.
^ Roose, Kevin. If A.I. Systems Become Conscious, Should They Have Rights?. The New York Times. 2025-04-24 [2025-04-24]. ISSN 0362-4331 （美国英语）.
^ Birch, Jonathan. Animal sentience and the precautionary principle. Animal Sentience. 2017-01-01, 2 (16) [2024-07-08]. ISSN 2377-7478. doi:10.51291/2377-7478.1200  . （原始内容存档于2024-08-11）.
^ Macrae, Carl. Learning from the Failure of Autonomous and Intelligent Systems: Accidents, Safety, and Sociotechnical Sources of Risk. Risk Analysis. September 2022, 42 (9): 1999–2025. Bibcode:2022RiskA..42.1999M. ISSN 0272-4332. PMID 34814229. doi:10.1111/risa.13850 （英语）.
^ Chalmers, David. Could a Large Language Model be Conscious?. March 2023. arXiv:2303.07103v1  [Science Computer Science].
^ Edwards, Benj. Anthropic hires its first "AI welfare" researcher. Ars Technica. 2024-11-11 [2025-04-24] （英语）.
^ Wiggers, Kyle. Anthropic is launching a new program to study AI 'model welfare'. TechCrunch. 2025-04-24 [2025-04-27] （美国英语）.
^ Shulman, Carl; Bostrom, Nick. Sharing the World with Digital Minds (PDF). Rethinking Moral Status. August 2021: 306–326. ISBN 978-0-19-289407-6. doi:10.1093/oso/9780192894076.003.0018.
^ Fisher, Richard. The intelligent monster that you should let eat you. BBC News. 2020-11-13 [2021-02-12] （英语）.
^ ^123.0 ^123.1 Joseph Weizenbaum, quoted in McCorduck 2004，第356, 374–376頁
^ Kaplan, Andreas; Haenlein, Michael. Siri, Siri, in my hand: Who's the fairest in the land? On the interpretations, illustrations, and implications of artificial intelligence. Business Horizons. January 2019, 62 (1): 15–25. S2CID 158433736. doi:10.1016/j.bushor.2018.08.004.
^
- Weizenbaum, Joseph. Computer Power and Human Reason. San Francisco: W.H. Freeman & Company. 1976. ISBN 978-0-7167-0464-5.
- Template:McCorduck 2004, pp. 132–144
^ Davies, Alex. Google's Self-Driving Car Caused Its First Crash. Wired. 2016-02-29 [2019-07-26]. （原始内容存档于2019-07-07）. 参数|magazine=与模板{{cite news}}不匹配（建议改用{{cite magazine}}或|newspaper=） (帮助)
^ Levin, Sam; Wong, Julia Carrie. Self-driving Uber kills Arizona woman in first fatal crash involving pedestrian. The Guardian. 2018-03-19 [2019-07-26]. （原始内容存档于2019-07-26）.
^ Who is responsible when a self-driving car has an accident?. Futurism. 2018-01-30 [2019-07-26]. （原始内容存档于2019-07-26）.
^ Autonomous Car Crashes: Who – or What – Is to Blame?. Knowledge@Wharton. Law and Public Policy (Radio Business North America Podcasts). [2019-07-26]. （原始内容存档于2019-07-26）.
^ Delbridge, Emily. Driverless Cars Gone Wild. The Balance. [2019-05-29]. （原始内容存档于2019-05-29）.
^ Stilgoe, Jack, Who Killed Elaine Herzberg? , Who's Driving Innovation? (Cham: Springer International Publishing), 2020: 1–6 [2020-11-11], ISBN 978-3-030-32319-6, S2CID 214359377, doi:10.1007/978-3-030-32320-2_1, （原始内容存档于2021-03-18）（英语）
^ Maxmen, Amy. Self-driving car dilemmas reveal that moral choices are not universal. Nature. October 2018, 562 (7728): 469–470. Bibcode:2018Natur.562..469M. PMID 30356197. doi:10.1038/d41586-018-07135-0  .
^ Regulations for driverless cars. GOV.UK. [2019-07-26]. （原始内容存档于2019-07-26）.
^ Automated Driving: Legislative and Regulatory Action – CyberWiki. cyberlaw.stanford.edu. [2019-07-26]. （原始内容存档于2019-07-26）.
^ Autonomous Vehicles | Self-Driving Vehicles Enacted Legislation. www.ncsl.org. [2019-07-26]. （原始内容存档于2019-07-26）.
^ Etzioni, Amitai; Etzioni, Oren. Incorporating Ethics into Artificial Intelligence. The Journal of Ethics. 2017-12-01, 21 (4): 403–418. ISSN 1572-8609. S2CID 254644745. doi:10.1007/s10892-017-9252-2 （英语）.
^ Call for debate on killer robots 互联网档案馆的存檔，存档日期2009-08-07., By Jason Palmer, Science and technology reporter, BBC News, 8/3/09.
^ ^138.0 ^138.1 Science New Navy-funded Report Warns of War Robots Going "Terminator" 互联网档案馆的存檔，存档日期2009-07-28., by Jason Mick (Blog), dailytech.com, February 17, 2009.
^ ^139.0 ^139.1 Navy report warns of robot uprising, suggests a strong moral compass 互联网档案馆的存檔，存档日期2011-06-04., by Joseph L. Flatley engadget.com, Feb 18th 2009.
^ AAAI Presidential Panel on Long-Term AI Futures 2008–2009 Study 互联网档案馆的存檔，存档日期2009-08-28., Association for the Advancement of Artificial Intelligence, Accessed 7/26/09.
^ United States. Defense Innovation Board. AI principles: recommendations on the ethical use of artificial intelligence by the Department of Defense. OCLC 1126650738.
^ Umbrello, Steven; Torres, Phil; De Bellis, Angelo F. The future of war: could lethal autonomous weapons make conflict more ethical? . AI & Society. March 2020, 35 (1): 273–282 [2020-11-11]. ISSN 0951-5666. S2CID 59606353. doi:10.1007/s00146-019-00879-x. hdl:2318/1699364. （原始内容存档于2021-01-05）（英语）.
^ Jamison, Miles. DARPA Launches Ethics Program for Autonomous Systems. executivegov.com. 2024-12-20 [2025-01-02] （美国英语）.
^ DARPA's ASIMOV seeks to develop Ethical Standards for Autonomous Systems. Space Daily. 2024-12-20 [2025-01-02].
^ Hellström, Thomas. On the moral responsibility of military robots. Ethics and Information Technology. June 2013, 15 (2): 99–107. S2CID 15205810. doi:10.1007/s10676-012-9301-2. ProQuest 1372020233.
^ Mitra, Ambarish. We can train AI to identify good and evil, and then use it to teach us morality. Quartz. 2018-04-05 [2019-07-26]. （原始内容存档于2019-07-26）.
^ Dominguez, Gabriel. South Korea developing new stealthy drones to support combat aircraft. The Japan Times. 2022-08-23 [2023-06-14].
^ AI Principles. Future of Life Institute. 2017-08-11 [2019-07-26]. （原始内容存档于2017-12-11）.
^ ^149.0 ^149.1 Zach Musgrave and Bryan W. Roberts. Why Artificial Intelligence Can Too Easily Be Weaponized – The Atlantic. The Atlantic. 2015-08-14 [2017-03-06]. （原始内容存档于2017-04-11）.
^ Cat Zakrzewski. Musk, Hawking Warn of Artificial Intelligence Weapons. WSJ. 2015-07-27 [2017-08-04]. （原始内容存档于2015-07-28）.
^ Potential Risks from Advanced Artificial Intelligence. Open Philanthropy. 2015-08-11 [2024-04-07] （美国英语）.
^ ^152.0 ^152.1 Bachulska, Alicja; Leonard, Mark; Oertel, Janka. The Idea of China: Chinese Thinkers on Power, Progress, and People (EPUB). Berlin, Germany: European Council on Foreign Relations. 2024-07-02 [2024-07-22]. ISBN 978-1-916682-42-9. （原始内容存档于2024-07-17）.
^ GGE on lethal autonomous weapons systems. Digital Watch Observatory. 2025-11-27 [2026-04-26].
^ Statements at the First 2025 GGE LAWS Session. APILS. 2025-03-09 [2026-04-26].
^ Brandon Vigliarolo. International military AI summit ends with 60-state pledge. www.theregister.com. [2023-02-17] （英语）.
^ ^156.0 ^156.1 Markoff, John. Scientists Worry Machines May Outsmart Man. The New York Times. 2009-07-25 [2017-02-24]. （原始内容存档于2017-02-25）.
^ Muehlhauser, Luke, and Louie Helm. 2012. "Intelligence Explosion and Machine Ethics" 互联网档案馆的存檔，存档日期2015-05-07.. In Singularity Hypotheses: A Scientific and Philosophical Assessment, edited by Amnon Eden, Johnny Søraker, James H. Moor, and Eric Steinhart. Berlin: Springer.
^ Bostrom, Nick. 2003. "Ethical Issues in Advanced Artificial Intelligence" 互联网档案馆的存檔，存档日期2018-10-08.. In Cognitive, Emotive and Ethical Aspects of Decision Making in Humans and in Artificial Intelligence, edited by Iva Smit and George E. Lasker, 12–17. Vol. 2. Windsor, ON: International Institute for Advanced Studies in Systems Research / Cybernetics.
^ Bostrom, Nick. Superintelligence: paths, dangers, strategies. Oxford, United Kingdom: Oxford University Press. 2017. ISBN 978-0-19-967811-2.
^ Umbrello, Steven; Baum, Seth D. Evaluating future nanotechnology: The net societal impacts of atomically precise manufacturing. Futures. 2018-06-01, 100: 63–73 [2020-11-29]. ISSN 0016-3287. S2CID 158503813. doi:10.1016/j.futures.2018.04.007. hdl:2318/1685533  . （原始内容存档于2019-05-09）（英语）.
^ Yudkowsky, Eliezer. 2011. "Complex Value Systems in Friendly AI" 互联网档案馆的存檔，存档日期2015-09-29.. In Schmidhuber, Thórisson, and Looks 2011, 388–393.
^ Russell, Stuart. Human Compatible: Artificial Intelligence and the Problem of Control. United States: Viking. 2019-10-08. ISBN 978-0-525-55861-3. OCLC 1083694322.
^ Yampolskiy, Roman V. Unpredictability of AI: On the Impossibility of Accurately Predicting All Actions of a Smarter Agent . Journal of Artificial Intelligence and Consciousness. 2020-03-01, 07 (1): 109–118 [2020-11-29]. ISSN 2705-0785. S2CID 218916769. doi:10.1142/S2705078520500034. （原始内容存档于2021-03-18）.
^ Wallach, Wendell; Vallor, Shannon, Moral Machines: From Value Alignment to Embodied Virtue , Ethics of Artificial Intelligence (Oxford University Press), 2020-09-17: 383–412 [2020-11-29], ISBN 978-0-19-090503-3, doi:10.1093/oso/9780190905033.003.0014, （原始内容存档于2020-12-08）（英语）
^ Umbrello, Steven. Beneficial Artificial Intelligence Coordination by Means of a Value Sensitive Design Approach. Big Data and Cognitive Computing. 2019, 3 (1): 5. doi:10.3390/bdcc3010005  . hdl:2318/1685727  （英语）.
^ Floridi, Luciano; Cowls, Josh; King, Thomas C.; Taddeo, Mariarosaria. How to Design AI for Social Good: Seven Essential Factors. Science and Engineering Ethics. 2020, 26 (3): 1771–1796. ISSN 1353-3452. PMC 7286860  . PMID 32246245. doi:10.1007/s11948-020-00213-5 （英语）.
^ Llama Guard: LLM-based Input-Output Safeguard for Human-AI Conversations. Meta.com. [2024-12-06].
^ ^168.0 ^168.1 Šekrst, Kristina; McHugh, Jeremy; Cefalu, Jonathan Rodriguez. AI Ethics by Design: Implementing Customizable Guardrails for Responsible AI Development. 2024. arXiv:2411.14442  [cs.CY].
^ Nvidia NeMo Guardrails. Nvidia. [2024-12-06].
^ Inan, Hakan; Upasani, Kartikeya; Chi, Jianfeng; Rungta, Rashi; Iyer, Krithika; Mao, Yuning; Tontchev, Michael; Hu, Qing; Fuller, Brian; Testuggine, Davide; Khabsa, Madian. Llama Guard: LLM-based Input-Output Safeguard for Human-AI Conversations. 2023. arXiv:2312.06674  [cs.CL].
^ Dong, Yi; Mu, Ronghui; Jin, Gaojie; Qi, Yi; Hu, Jinwei; Zhao, Xingyu; Meng, Jie; Ruan, Wenjie; Huang, Xiaowei. Building Guardrails for Large Language Models. 2024. arXiv:2402.01822  [cs].
^ Evans, Woody. Posthuman Rights: Dimensions of Transhuman Worlds. Teknokultura. 2015, 12 (2): 373–384. doi:10.5209/rev_TK.2015.v12.n2.49072.
^ Fiegerman, Seth. Facebook, Google, Amazon create group to ease AI concerns. CNNMoney. 2016-09-28 [2020-08-18]. （原始内容存档于2020-09-17）.
^ Slota, Stephen C.; Fleischmann, Kenneth R.; Greenberg, Sherri; Verma, Nitin; Cummings, Brenna; Li, Lan; Shenefiel, Chris. Locating the work of artificial intelligence ethics . Journal of the Association for Information Science and Technology. 2023, 74 (3): 311–322 [2023-07-21]. ISSN 2330-1635. S2CID 247342066. doi:10.1002/asi.24638. （原始内容存档于2024-09-25）（英语）.
^ Ethics guidelines for trustworthy AI. Shaping Europe's digital future – European Commission. European Commission. 2019-04-08 [2020-02-20]. （原始内容存档于2020-02-20）（英语）.
^ White Paper on Artificial Intelligence – a European approach to excellence and trust | Shaping Europe's digital future. 2020-02-19 [2021-03-18]. （原始内容存档于2021-03-06）.
^ Implementation Timeline | EU Artificial Intelligence Act. [2025-10-02] （美国英语）.
^ OECD AI Policy Observatory. [2021-03-18]. （原始内容存档于2021-03-08）.
^ Recommendation on the Ethics of Artificial Intelligence. UNESCO. 2021.
^ UNESCO member states adopt first global agreement on AI ethics. Helsinki Times. 2021-11-26 [2023-04-26]. （原始内容存档于2024-09-25）（英国英语）.
^ 《新一代人工智能伦理规范》发布. 中华人民共和国科学技术部. 2021-09-25.
^ 中国关于加强人工智能伦理治理的立场文件. 中华人民共和国外交部. 2022-11-16.
^ The Obama Administration's Roadmap for AI Policy. Harvard Business Review. 2016-12-21 [2021-03-16]. ISSN 0017-8012. （原始内容存档于2021-01-22）.
^ Accelerating America's Leadership in Artificial Intelligence – The White House. trumpwhitehouse.archives.gov. [2021-03-16]. （原始内容存档于2021-02-25）.
^ Request for Comments on a Draft Memorandum to the Heads of Executive Departments and Agencies, "Guidance for Regulation of Artificial Intelligence Applications". Federal Register. 2020-01-13 [2020-11-28]. （原始内容存档于2020-11-25）.
^ Sen. Thune, John. Text – S.3312 – 118th Congress (2023–2024): Artificial Intelligence Research, Innovation, and Accountability Act of 2024. www.congress.gov. 2024-12-18 [2025-05-25].
^ CCC Offers Draft 20-Year AI Roadmap; Seeks Comments. HPCwire. 2019-05-14 [2019-07-22]. （原始内容存档于2021-03-18）.
^ Request Comments on Draft: A 20-Year Community Roadmap for AI Research in the US » CCC Blog. 2019-05-13 [2019-07-22]. （原始内容存档于2019-05-14）.
^ （俄語） Интеллектуальные правила 互联网档案馆的存檔，存档日期2021-12-30. — Kommersant, 2021-11-25
^ Grace, Katja; Salvatier, John; Dafoe, Allan; Zhang, Baobao; Evans, Owain. When Will AI Exceed Human Performance? Evidence from AI Experts. 2018-05-03. arXiv:1705.08807  [cs.AI].
^ China wants to shape the global future of artificial intelligence. MIT Technology Review. [2020-11-29]. （原始内容存档于2020-11-20）（英语）.
^ Adam, David. Future of Humanity Institute shuts: what's next for 'deep future' research? . Nature. 2024-04-26, 629 (8010): 16–17. Bibcode:2024Natur.629...16A. PMID 38671273. doi:10.1038/d41586-024-01229-8 （英语）.
^ Floridi, Luciano; Cowls, Josh; Beltrametti, Monica; Chatila, Raja; Chazerand, Patrice; Dignum, Virginia; Luetge, Christoph; Madelin, Robert; Pagallo, Ugo; Rossi, Francesca; Schafer, Burkhard. AI4People—An Ethical Framework for a Good AI Society: Opportunities, Risks, Principles, and Recommendations. Minds and Machines. 2018-12-01, 28 (4): 689–707. ISSN 1572-8641. PMC 6404626  . PMID 30930541. doi:10.1007/s11023-018-9482-5 （英语）.
^ AI Governance. Oxford Martin School. [2025-02-19] （英语）.
^ Joanna J. Bryson. WIRED. [2023-01-13]. （原始内容存档于2023-03-15）.
^ New Artificial Intelligence Research Institute Launches. 2017-11-20 [2021-02-21]. （原始内容存档于2020-09-18）（美国英语）.
^ James J. Hughes; LaGrandeur, Kevin (编). Surviving the machine age: intelligent technology and the transformation of human work. Cham, Switzerland: Palgrave Macmillan Cham. 2017-03-15. ISBN 978-3-319-51165-8. OCLC 976407024.
^ Danaher, John. Automation and utopia: human flourishing in a world without work. Cambridge, Massachusetts: Harvard University Press. 2019. ISBN 978-0-674-24220-3. OCLC 1114334813.
^ TUM Institute for Ethics in Artificial Intelligence officially opened. www.tum.de. [2020-11-29]. （原始内容存档于2020-12-10）（英语）.
^ Communications, Paul Karoff SEAS. Harvard works to embed ethics in computer science curriculum. Harvard Gazette. 2019-01-25 [2023-04-06]. （原始内容存档于2024-09-25）（美国英语）.
^ Lee, Jennifer. When Bias Is Coded Into Our Technology. NPR. 2020-02-08 [2021-12-22] （英语）.
^ How one conference embraced diversity. Nature. 2018-12-12, 564 (7735): 161–162. PMID 31123357. S2CID 54481549. doi:10.1038/d41586-018-07718-x  （英语）.
^ Roose, Kevin. The 2020 Good Tech Awards. The New York Times. 2020-12-30 [2021-12-21]. ISSN 0362-4331 （美国英语）.
^ Lodge, Paul. Leibniz's Mill Argument Against Mechanical Materialism Revisited. Ergo: An Open Access Journal of Philosophy. 2014, 1 (20201214). ISSN 2330-4014. doi:10.3998/ergo.12405314.0001.003  . hdl:2027/spo.12405314.0001.003.
^ Bringsjord, Selmer; Govindarajulu, Naveen Sundar, Artificial Intelligence, Zalta, Edward N.; Nodelman, Uri (编), The Stanford Encyclopedia of Philosophy Summer 2020, Metaphysics Research Lab, Stanford University, 2020 [2023-12-08], （原始内容存档于2022-03-08）
^ Kulesz, O. (2018). "Culture, Platforms and Machines". UNESCO, Paris.
^ Jr, Henry C. Lucas. Information Technology and the Productivity Paradox: Assessing the Value of Investing in IT. Oxford University Press. 1999-04-29 [2024-02-21]. ISBN 978-0-19-802838-3. （原始内容存档于2024-09-25）（英语）.
^ Asimov, Isaac. I, Robot. New York: Bantam. 2008. ISBN 978-0-553-38256-3.
^ Bryson, Joanna; Diamantis, Mihailis; Grant, Thomas. Of, for, and by the people: the legal lacuna of synthetic persons. Artificial Intelligence and Law. September 2017, 25 (3): 273–291. doi:10.1007/s10506-017-9214-9  .
^ Principles of robotics. UK's EPSRC. September 2010 [2019-01-10]. （原始内容存档于2018-04-01）.
^ Yudkowsky, Eliezer. Why We Need Friendly AI. 3 laws unsafe. July 2004. （原始内容存档于2012-05-24）.
^ Aleksander, Igor. Partners of Humans: A Realistic Assessment of the Role of Robots in the Foreseeable Future . Journal of Information Technology. March 2017, 32 (1): 1–9 [2024-02-21]. ISSN 0268-3962. S2CID 5288506. doi:10.1057/s41265-016-0032-4. （原始内容存档于2024-02-21）（英语）.
^ Evolving Robots Learn To Lie To Each Other 互联网档案馆的存檔，存档日期2009-08-28., Popular Science, August 18, 2009

外部链接

编辑

人工智能伦理 — 《互联网哲学百科全书》
人工智能与机器人伦理 — 《斯坦福哲学百科全书》
《剑桥人工智能法律、伦理与政策手册》
Russell, S.; Hauert, S.; Altman, R.; Veloso, M. Robotics: Ethics of artificial intelligence. Nature. May 2015, 521 (7553): 415–418. Bibcode:2015Natur.521..415.. PMID 26017428. S2CID 4452826. doi:10.1038/521415a  .
AI伦理指南全球清单 — 来自 Algorithmwatch
Hagendorff, Thilo. The Ethics of AI Ethics: An Evaluation of Guidelines. Minds and Machines. March 2020, 30 (1): 99–120. S2CID 72940833. arXiv:1903.03425  . doi:10.1007/s11023-020-09517-8  .
Sheludko, M. (2023年12月). 人工智能的伦理方面：挑战与必要性. 软件开发博客.
Eisikovits, Nir. AI Is an Existential Threat—Just Not the Way You Think. Scientific American. [2024-03-04] （英语）.
Anwar, U.; Saparov, A.; Rando, J.; Paleka, D.; Turpin, M.; Hase, P.; Lubana, E. S.; Jenner, E.; Casper, S.; Sourbut, O.; Edelman, B. L.; Zhang, Z.; Günther, M.; Korinek, A.; Hernandez-Orallo, J.; Hammond, L.; Bigelow, E.; Pan, A.; Langosco, L.; Krueger, D. Foundational Challenges in Assuring Alignment and Safety of Large Language Models. 2024. arXiv:2404.09932  [cs.LG].

[Muller-2020-1] 1.0 ^1.1 Müller, Vincent C. Ethics of Artificial Intelligence and Robotics. 斯坦福哲学百科全书. 2020-04-30. （原始内容存档于2020-10-10）.

[:4-2] Assessing potential future artificial intelligence risks, benefits and policy imperatives. OECD. 2024-11-14 [2025-08-04] （英语）.

[Andersonweb-3] Anderson. Machine Ethics. [2011-06-27]. （原始内容存档于2011-09-28）.

[Anderson2011-4] Anderson, Michael; Anderson, Susan Leigh (编). Machine Ethics. Cambridge University Press. 2011-07. ISBN 978-0-521-11235-2.

[Anderson2006-5] Anderson, M.; Anderson, S.L. Guest Editors' Introduction: Machine Ethics. IEEE Intelligent Systems. 2006-07, 21 (4): 10–11. S2CID 9570832. doi:10.1109/mis.2006.70.

[Anderson2007-6] Anderson, Michael; Anderson, Susan Leigh. Machine Ethics: Creating an Ethical Intelligent Agent. AI Magazine. 2007-12-15, 28 (4): 15. S2CID 17033332. doi:10.1609/aimag.v28i4.2065.

[7] Boyles, Robert James M. Philosophical Signposts for Artificial Moral Agent Frameworks. Suri. 2017, 6 (2): 92–109.

[Winfield-2019-8] 8.0 ^8.1 Winfield, A. F.; Michael, K.; Pitt, J.; Evers, V. Machine Ethics: The Design and Governance of Ethical AI and Autonomous Systems [Scanning the Issue]. Proceedings of the IEEE. 2019-03, 107 (3): 509–517. ISSN 1558-2256. S2CID 77393713. doi:10.1109/JPROC.2019.2900622  .

[9] Al-Rodhan, Nayef. The Moral Code. 2015-12-07 [2017-03-04]. （原始内容存档于2017-03-05）.

[:7-10] Sauer, Megan. Elon Musk says humans could eventually download their brains into robots — and Grimes thinks Jeff Bezos would do it. CNBC. 2022-04-08 [2024-04-07]. （原始内容存档于2024-09-25）（英语）.

[:8-11] Anadiotis, George. Massaging AI language models for fun, profit and ethics. ZDNET. 2022-04-04 [2024-04-07]. （原始内容存档于2024-09-25）（英语）.

[Wallach2008-12] Wallach, Wendell; Allen, Colin. Moral Machines: Teaching Robots Right from Wrong. USA: Oxford University Press. 2008-11. ISBN 978-0-19-537404-9.

[13] Bostrom, Nick; Yudkowsky, Eliezer. The Ethics of Artificial Intelligence (PDF). Cambridge Handbook of Artificial Intelligence. Cambridge Press. 2011 [2011-06-22]. （原始内容存档 (PDF)于2016-03-04）.

[SantosLang2002-14] Santos-Lang, Chris. Ethics for Artificial Intelligences. 2002 [2015-01-04]. （原始内容存档于2014-12-25）.

[15] Russell, Stuart J. Human Compatible: Artificial Intelligence and the Problem of Control. Viking. 2019. ISBN 978-0-525-55861-3.

[16] Šekrst, Kristina. The Illusion Engine: The Quest for Machine Consciousness. Springer. 2025. ISBN 978-3-032-05561-3.

[17] Gabriel, Iason. The case for fairer algorithms – Iason Gabriel. Medium. 2018-03-14 [2019-07-22]. （原始内容存档于2019-07-22）.

[18] 5 unexpected sources of bias in artificial intelligence. TechCrunch. 2016-12-10 [2019-07-22]. （原始内容存档于2021-03-18）.

[Knight03102017-19] Knight, Will. Google's AI chief says forget Elon Musk's killer robots, and worry about bias in AI systems instead. MIT Technology Review. [2019-07-22]. （原始内容存档于2019-07-04）.

[20] Villasenor, John. Artificial intelligence and bias: Four key challenges. Brookings. 2019-01-03 [2019-07-22]. （原始内容存档于2019-07-22）.

[21] Goodman, Emma. Rethinking data power: beyond AI hype and corporate ethics. LSE Blogs. 2025-06-06 [2025-06-07].

[22] Friedman, Batya; Nissenbaum, Helen. Bias in computer systems. ACM Transactions on Information Systems. July 1996, 14 (3): 330–347. S2CID 207195759. doi:10.1145/230538.230561  .

[23] Eliminating bias in AI. techxplore.com. [2019-07-26]. （原始内容存档于2019-07-25）.

[24] Abdalla, Mohamed; Wahle, Jan Philip; Ruas, Terry; Névéol, Aurélie; Ducel, Fanny; Mohammad, Saif; Fort, Karen. Rogers, Anna; Boyd-Graber, Jordan; Okazaki, Naoaki , 编. The Elephant in the Room: Analyzing the Presence of Big Tech in Natural Language Processing Research. Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) (Toronto, Canada: Association for Computational Linguistics). 2023: 13141–13160 [2023-11-13]. arXiv:2305.02797  . doi:10.18653/v1/2023.acl-long.734  . （原始内容存档于2024-09-25）.

[25] Olson, Parmy. Google's DeepMind Has An Idea For Stopping Biased AI. Forbes. [2019-07-26]. （原始内容存档于2019-07-26）.

[26] Machine Learning Fairness | ML Fairness. Google Developers. [2019-07-26]. （原始内容存档于2019-08-10）.

[27] AI and bias – IBM Research – US. www.research.ibm.com. [2019-07-26]. （原始内容存档于2019-07-17）.

[28] Bender, Emily M.; Friedman, Batya. Data Statements for Natural Language Processing: Toward Mitigating System Bias and Enabling Better Science. Transactions of the Association for Computational Linguistics. December 2018, 6: 587–604. doi:10.1162/tacl_a_00041  .

[29] Gebru, Timnit; Morgenstern, Jamie; Vecchione, Briana; Vaughan, Jennifer Wortman; Wallach, Hanna; Daumé III, Hal; Crawford, Kate. Datasheets for Datasets. 2018. arXiv:1803.09010  [cs.DB].

[30] Pery, Andrew. Trustworthy Artificial Intelligence and Process Mining: Challenges and Opportunities. DeepAI. 2021-10-06 [2022-02-18]. （原始内容存档于2022-02-18）.

[31] Ruggieri, Salvatore; Alvarez, Jose M.; Pugnana, Andrea; State, Laura; Turini, Franco. Can We Trust Fair-AI?. Proceedings of the AAAI Conference on Artificial Intelligence (Association for the Advancement of Artificial Intelligence (AAAI)). 2023-06-26, 37 (13): 15421–15430. ISSN 2374-3468. S2CID 259678387. doi:10.1609/aaai.v37i13.26798  . hdl:11384/136444  .

[Buyl_De_Bie_p.2-32] Buyl, Maarten; De Bie, Tijl. Inherent Limitations of AI Fairness. Communications of the ACM. 2022, 67 (2): 48–55. arXiv:2212.06495  . doi:10.1145/3624700. hdl:1854/LU-01GMNH04RGNVWJ730BJJXGCY99.

[:6-33] 33.0 ^33.1 ^33.2 Knaus, Thomas. Why AI matters for education—an exploration in seven arguments. Zeitschrift für Bildungsforschung. 2025-10-23. ISSN 2190-6904. doi:10.1007/s35834-025-00511-7 （英语）.

[34] Ntoutsi, Eirini; Fafalios, Pavlos; Gadiraju, Ujwal; Iosifidis, Vasileios; Nejdl, Wolfgang; Vidal, Maria-Esther; Ruggieri, Salvatore; Turini, Franco; Papadopoulos, Symeon; Krasanakis, Emmanouil; Kompatsiaris, Ioannis; Kinder-Kurlanda, Katharina; Wagner, Claudia; Karimi, Fariba; Fernandez, Miriam. Bias in data-driven artificial intelligence systems—An introductory survey. WIREs Data Mining and Knowledge Discovery. May 2020, 10 (3) [2023-12-14]. ISSN 1942-4787. doi:10.1002/widm.1356. （原始内容存档于2024-09-25）（英语）. 已忽略未知参数|article-number= (帮助)

[35] Dastin, Jeffrey. Insight – Amazon scraps secret AI recruiting tool that showed bias against women. Reuters. 2018-10-11 [2025-06-30] （美国英语）.

[36] Amazon scraps secret AI recruiting tool that showed bias against women. Reuters. 2018-10-10 [2019-05-29]. （原始内容存档于2019-05-27）.

[37] Lohr, Steve. Facial Recognition Is Accurate, if You're a White Guy. The New York Times. 2018-02-09 [2019-05-29]. （原始内容存档于2019-01-09）.

[:1-38] Buolamwini, Joy; Gebru, Timnit. Gender Shades: Intersectional Accuracy Disparities in Commercial Gender Classification. Proceedings of Machine Learning Research, 81, 77-91. 81st. Proceedings of Machine Learning Research: 77–91. 2018 [2026-03-12].

[Federspiel_Frederik-39] 39.0 ^39.1 Federspiel, Frederik; Mitchell, Ruth; Asokan, Asha; Umana, Carlos; McCoy, David. Threats by artificial intelligence to human health and human existence. BMJ Global Health. May 2023, 8 (5). ISSN 2059-7908. PMC 10186390  . PMID 37160371. doi:10.1136/bmjgh-2022-010435. 已忽略未知参数|article-number= (帮助)

[40] Google apologises for Photos app's racist blunder. BBC News. 2015-07-01 [2026-03-12] （英国英语）.

[41] Simonite, Tom. When It Comes to Gorillas, Google Photos Remains Blind. Wired. 2018-01-11 [2026-03-12]. ISSN 1059-1028 （美国英语）.

[42] Manyika, James. Getting AI Right: Introductory Notes on AI & Society. Daedalus. 2022, 151 (2): 5–27. ISSN 0011-5266. doi:10.1162/daed_e_01897  .

[43] Koenecke, Allison; Nam, Andrew; Lake, Emily; Nudell, Joe; Quartey, Minnie; Mengesha, Zion; Toups, Connor; Rickford, John R.; Jurafsky, Dan; Goel, Sharad. Racial disparities in automated speech recognition. Proceedings of the National Academy of Sciences. 2020-04-07, 117 (14): 7684–7689. Bibcode:2020PNAS..117.7684K. PMC 7149386  . PMID 32205437. doi:10.1073/pnas.1915768117  .

[44] Cirillo, Davide; Catuara-Solarz, Silvina; Morey, Czuee; Guney, Emre; Subirats, Laia; Mellino, Simona; Gigante, Annalisa; Valencia, Alfonso; Rementeria, María José; Chadha, Antonella Santuccione; Mavridis, Nikolaos. Sex and gender differences and biases in artificial intelligence for biomedicine and healthcare. npj Digital Medicine. 2020-06-01, 3 (1): 81. ISSN 2398-6352. PMC 7264169  . PMID 32529043. doi:10.1038/s41746-020-0288-5  （英语）.

[45] Christian, Brian. The alignment problem: machine learning and human values First published as a Norton paperback. New York, NY: W. W. Norton & Company. 2021. ISBN 978-0-393-86833-3.

[46] Ntoutsi, Eirini; Fafalios, Pavlos; Gadiraju, Ujwal; Iosifidis, Vasileios; Nejdl, Wolfgang; Vidal, Maria-Esther; Ruggieri, Salvatore; Turini, Franco; Papadopoulos, Symeon; Krasanakis, Emmanouil; Kompatsiaris, Ioannis; Kinder-Kurlanda, Katharina; Wagner, Claudia; Karimi, Fariba; Fernandez, Miriam. Bias in data-driven artificial intelligence systems—An introductory survey. WIREs Data Mining and Knowledge Discovery. May 2020, 10 (3). ISSN 1942-4787. doi:10.1002/widm.1356  （英语）. 已忽略未知参数|article-number= (帮助)

[47] Busker, Tony; Choenni, Sunil; Shoae Bargh, Mortaza. Stereotypes in ChatGPT: An empirical study. Proceedings of the 16th International Conference on Theory and Practice of Electronic Governance. ICEGOV '23. New York, NY, USA: Association for Computing Machinery. 2023-11-20: 24–32. ISBN 979-8-4007-0742-1. doi:10.1145/3614321.3614325  .

[48] Kotek, Hadas; Dockum, Rikker; Sun, David. Gender bias and stereotypes in Large Language Models. Proceedings of the ACM Collective Intelligence Conference. CI '23. New York, NY, USA: Association for Computing Machinery. 2023-11-05: 12–24. ISBN 979-8-4007-0113-9. arXiv:2308.14921  . doi:10.1145/3582269.3615599  .

[49] Wang, Tianlu; Zhao, Jieyu; Yatskar, Mark; Chang, Kai-Wei; Ordonez, Vicente, Balanced Datasets Are Not Enough: Estimating and Mitigating Gender Bias in Deep Image Representations, 2019-10-11 [2026-03-12], arXiv:1811.08489 

[50] Albiero, Vítor; S, Krishnapriya K.; Vangara, Kushal; Zhang, Kai; King, Michael C.; Bowyer, Kevin W., Analysis of Gender Inequality In Face Recognition Accuracy, 2020-01-31 [2026-03-12], arXiv:2002.00065 

[51] Hamidi, Foad; Scheuerman, Morgan Klaus; Branham, Stacy M. Gender Recognition or Gender Reductionism?: The Social Implications of Embedded Gender Recognition Systems. Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems. CHI '18. New York, NY, USA: Association for Computing Machinery. 2018-04-19: 1–13. ISBN 978-1-4503-5620-6. doi:10.1145/3173574.3173582.

[52] Keyes, Os. The Misgendering Machines: Trans/HCI Implications of Automatic Gender Recognition. ACM Digital Library. 2018-11-01. doi:10.1145/3274357 （英语）.

[53] Cheng, Myra; Durmus, Esin; Jurafsky, Dan. Marked Personas: Using Natural Language Prompts to Measure Stereotypes in Language Models. 2023-05-29. arXiv:2305.18189v1  [cs.CL] （英语）.

[54] Segev, Elad; Manor, Ilan. What ChatGPT "thinks" about your country? Sentiments and frames of AI geographies. Policy & Internet (Wiley). 2025. doi:10.1002/poi3.70013.

[Wang-55] 55.0 ^55.1 Wang, Zixi; Xia, Haodong; Bao, Han Wu Shuang; Jing, Yiming; Gu, Ruolei. Artificial Intelligence Is Stereotypically Linked More with Socially Dominant Groups in Natural Language. Advanced Science (Weinheim, Baden-Wurttemberg, Germany). October 2025, 12 (39). ISSN 2198-3844. PMC 12533212  . PMID 40719013. doi:10.1002/advs.202508623. 已忽略未知参数|article-number= (帮助)

[Louro-56] 56.0 ^56.1 Louro, Celeste Rodriguez. AI systems are built on English - but not the kind most of the world speaks. The Conversation. 2025-05-05 [2026-04-11] （英语）.

[57] Pretorius, Lynette; Huynh, Huy-Hoang; Pudyanti, Anak Agung Ayu Redi; Li, Ziqi; Noori, Abdul Qawi; Zhou, Zhiheng. Empowering international PhD students: Generative AI, Ubuntu, and the decolonisation of academic communication. The Internet and Higher Education. 2025, 67. doi:10.1016/j.iheduc.2025.101038  （英语）. 已忽略未知参数|article-number= (帮助)

[58] The 'missed opportunity' with AI's linguistic diversity gap. World Economic Forum. [2026-04-11]. （原始内容存档于2026-03-06）（英语）.

[59] Eacersall, Douglas; Pretorius, Lynette; Smirnov, Ivan; Spray, Erika; Illingworth, Sam; Chugh, Ritesh; Strydom, Sonja; Stratton-Maher, Dianne; Simmons, Jonathan; Jennings, Isaac; Roux, Rian; Kamrowski, Ruth; Downie, Abigail; Thong, Chee Ling; Howell, Katharine A. Navigating ethical challenges in generative AI-enhanced research: The ETHICAL framework for responsible generative AI use. Journal of Applied Learning & Teaching. 2025, 8 (2). doi:10.37074/jalt.2025.8.2.9  （英语）.

[60] Feng, Shangbin; Park, Chan Young; Liu, Yuhan; Tsvetkov, Yulia. Rogers, Anna; Boyd-Graber, Jordan; Okazaki, Naoaki , 编. From Pretraining Data to Language Models to Downstream Tasks: Tracking the Trails of Political Biases Leading to Unfair NLP Models. Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) (Toronto, Canada: Association for Computational Linguistics). July 2023: 11737–11762. arXiv:2305.08283  . doi:10.18653/v1/2023.acl-long.656  .

[61] Zhou, Karen; Tan, Chenhao. Bouamor, Houda; Pino, Juan; Bali, Kalika , 编. Entity-Based Evaluation of Political Bias in Automatic Summarization. Findings of the Association for Computational Linguistics: EMNLP 2023 (Singapore: Association for Computational Linguistics). December 2023: 10374–10386 [2023-12-25]. arXiv:2305.02321  . doi:10.18653/v1/2023.findings-emnlp.696  . （原始内容存档于2024-04-24）.

[62] Peters, Uwe. Algorithmic Political Bias in Artificial Intelligence Systems. Philosophy & Technology.

[:9-63] 63.0 ^63.1 Messer, Uwe. How do People React to Political Bias in Generative Artificial Intelligence (AI)?. Computers in Human Behavior: Artificial Humans.

[64] Fisher, Jillian; Feng, Shangbin; Aron, Robert; Richardson, Thomas; Choi, Yejin; Fisher, Daniel W.; Pan, Jennifer; Tsvetkov, Yulia; Reinecke, Katharina, Biased AI can Influence Political Decision-Making, arXiv, 2024 [2026-04-12], doi:10.48550/ARXIV.2410.06415

[65] Hammond, George. Big Tech is spending more than VC firms on AI startups. Ars Technica. 2023-12-27. （原始内容存档于2024-01-10）（美国英语）.

[66] Wong, Matteo. The Future of AI Is GOMA . The Atlantic. 2023-10-24. （原始内容存档于2024-01-05）（英语）.

[67] Big tech and the pursuit of AI dominance . The Economist. 2023-03-26. （原始内容存档于2023-12-29）.

[68] Fung, Brian. Where the battle to dominate AI may be won. CNN Business. 2023-12-19. （原始内容存档于2024-01-13）（英语）.

[69] Metz, Cade. In the Age of A.I., Tech's Little Guys Need Big Friends. The New York Times. 2023-07-05 [2024-07-17]. （原始内容存档于2024-07-08）.

[:10-70] 70.0 ^70.1 ^70.2 Sigalos, MacKenzie. Dust to data centers: The year AI tech giants, and billions in debt, began remaking the American landscape. CNBC. 2026-01-01 [2026-04-13] （英语）.

[:11-71] 71.0 ^71.1 Hutchinson, Hutchinson, Christophe Samuel. Potential abuses of dominance by big tech through their use of Big Data and AI. Journal of Antitrust Enforcement. 2022, 10 (3): 443–468.

[:02-72] 72.0 ^72.1 Explained: Generative AI's environmental impact. MIT News. 2025-01-17 [2025-10-03] （英语）.

[:5-73] 73.0 ^73.1 AI energy demand to climb in 2025–26 despite efficiency gains. Bloomberg Professional Services. 2025-04-11 [2025-10-07] （美国英语）.

[:12-74] 74.0 ^74.1 ^74.2 Kanungo, Alokya. The Real Environmental Impact of AI. Earth.Org. 2023-07-18 [2025-10-03] （英语）.

[75] Bozkurt et al. "AI’s Thirst, AI’s Heat, AI’s Waste: Exposing the Hidden Environmental Impact of Every Artificial Intelligence Interaction"

[76] Bozkurt et al."AI’s Thirst, AI’s Heat, AI’s Waste: Exposing the Hidden Environmental Impact of Every Artificial Intelligence Interaction"

[77] Bozkurt et al. "AI’s Thirst, AI’s Heat, AI’s Waste: Exposing the Hidden Environmental Impact of Every Artificial Intelligence Interaction"

[:23-78] 78.0 ^78.1 Coleman, Jude. AI's Climate Impact Goes beyond Its Emissions. Scientific American. [2025-10-03] （英语）.

[AGI-08a-79] Open Source AI. 互联网档案馆的存檔，存档日期2016-03-04. Bill Hibbard. 2008 proceedings 互联网档案馆的存檔，存档日期2024-09-25. of the First Conference on Artificial General Intelligence, eds. Pei Wang, Ben Goertzel, and Stan Franklin.

[80] Stewart, Ashley; Melton, Monica. Hugging Face CEO says he's focused on building a 'sustainable model' for the $4.5 billion open-source-AI startup. Business Insider. [2024-04-07]. （原始内容存档于2024-09-25）（美国英语）.

[81] The open-source AI boom is built on Big Tech's handouts. How long will it last?. MIT Technology Review. [2024-04-07]. （原始内容存档于2024-01-05）（英语）.

[82] Yao, Deborah. Google Unveils Open Source Models to Rival Meta, Mistral. AI Business. 2024-02-21.

[P7001-83] 7001-2021 – IEEE Standard for Transparency of Autonomous Systems. IEEE. 2022-03-04: 1–54. ISBN 978-1-5044-8311-7. S2CID 252589405. doi:10.1109/IEEESTD.2022.9726144. .

[84] Kamila, Manoj Kumar; Jasrotia, Sahil Singh. Ethical issues in the development of artificial intelligence: recognizing the risks. International Journal of Ethics and Systems. 2023-01-01, 41: 45–63. ISSN 2514-9369. S2CID 259614124. doi:10.1108/IJOES-05-2023-0107.

[WiredMS-85] Thurm, Scott. Microsoft Calls For Federal Regulation of Facial Recognition. Wired. 2018-07-13 [2019-01-10]. （原始内容存档于2019-05-09）.

[86] Piper, Kelsey. Should we make our most powerful AI models open source to all?. Vox. 2024-02-02 [2024-04-07] （英语）.

[87] Vincent, James. OpenAI co-founder on company's past approach to openly sharing research: "We were wrong". The Verge. 2023-03-15 [2024-04-07]. （原始内容存档于2023-03-17）（英语）.

[88] Stack Overflow Will Charge AI Giants for Training Data. WIRED. 2023-04-28 [2025-04-03].

[89] Open source devs say AI crawlers dominate traffic, forcing blocks on entire countries. Ars Technica. 2025-03-25 [2025-04-03].

[90] AI bots strain Wikimedia as bandwidth surges 50%. Ars Technica. 2025-04-02 [2025-04-03].

[91] von Eschenbach, Warren J. Transparency and the Black Box Problem: Why We Do Not Trust AI. Philosophy & Technology. 2021-12-01, 34 (4): 1607–1622. ISSN 2210-5441. doi:10.1007/s13347-021-00477-0 （英语）.

[92] Lund, Brady; Orhan, Zeynep; Mannuru, Nishith Reddy; Bevara, Ravi Varma Kumar; Porter, Brett; Vinaih, Meka Kasi; Bhaskara, Padmapadanand. Standards, frameworks, and legislation for artificial intelligence (AI) transparency. AI and Ethics. 2025-01-29, 5 (4): 3639–3655. ISSN 2730-5961. doi:10.1007/s43681-025-00661-4 （英语）.

[93] Inside The Mind Of A.I. 互联网档案馆的存檔，存档日期2021-08-10. – Cliff Kuang interview

[94] What Is AI Interpretability? | IBM. www.ibm.com. 2024-10-08 [2025-07-03] （英语）.

[95] Li, Fan; Ruijs, Nick; Lu, Yuan. Ethics & AI: A Systematic Review on Ethical Concerns and Related Strategies for Designing with AI in Healthcare. AI. 2022-12-31, 4 (1): 28–53. ISSN 2673-2688. doi:10.3390/ai4010003  （英语）.

[96] Shabankareh, Mohammadjavad; Khamoushi Sahne, Seyed Sina; Nazarian, Alireza; Foroudi, Pantea. The impact of AI perceived transparency on trust in AI recommendations in healthcare applications. Asia-Pacific Journal of Business Administration. 2025-01-01,. ahead-of-print (ahead-of-print). ISSN 1757-4331. doi:10.1108/APJBA-12-2024-0690.

[97] Xu, Hanhui; Shuttleworth, Kyle Michael James. Medical artificial intelligence and the black box problem: a view based on the ethical principle of "do no harm". Intelligent Medicine. 2024-02-01, 4 (1): 52–57. ISSN 2667-1026. doi:10.1016/j.imed.2023.08.001.

[98] Howard, Ayanna. The Regulation of AI – Should Organizations Be Worried? | Ayanna Howard. MIT Sloan Management Review. 2019-07-29 [2019-08-14]. （原始内容存档于2019-08-14）.

[99] Trust in artificial intelligence – A five country study (PDF). KPMG. March 2021 [2023-10-06]. （原始内容存档 (PDF)于2023-10-01）.

[DeloitteGDPR-100] Bastin, Roland; Wantz, Georges. The General Data Protection Regulation Cross-industry innovation (PDF). Inside magazine. Deloitte. June 2017 [2019-01-10]. （原始内容存档 (PDF)于2019-01-10）.

[101] UN artificial intelligence summit aims to tackle poverty, humanity's 'grand challenges'. UN News. 2017-06-07 [2019-07-26]. （原始内容存档于2019-07-26）.

[102] Artificial intelligence – Organisation for Economic Co-operation and Development. www.oecd.org. [2019-07-26]. （原始内容存档于2019-07-22）.

[103] Anonymous. The European AI Alliance. Digital Single Market – European Commission. 2018-06-14 [2019-07-26]. （原始内容存档于2019-08-01）.

[104] European Commission High-Level Expert Group on AI. Policy and investment recommendations for trustworthy Artificial Intelligence. Shaping Europe's digital future – European Commission. 2019-06-26 [2020-03-16]. （原始内容存档于2020-02-26）（英语）.

[105] Fukuda-Parr, Sakiko; Gibbons, Elizabeth. Emerging Consensus on 'Ethical AI': Human Rights Critique of Stakeholder Guidelines. Global Policy. July 2021, 12 (S6): 32–44. ISSN 1758-5880. doi:10.1111/1758-5899.12965  （英语）.

[106] EU Tech Policy Brief: July 2019 Recap. Center for Democracy & Technology. 2019-08-02 [2019-08-09]. （原始内容存档于2019-08-09）.

[107] Curtis, Caitlin; Gillespie, Nicole; Lockey, Steven. AI-deploying organizations are key to addressing 'perfect storm' of AI risks. AI and Ethics. 2022-05-24, 3 (1): 145–153. ISSN 2730-5961. PMC 9127285  . PMID 35634256. doi:10.1007/s43681-022-00163-7 （英语）.

[:2-108] 108.0 ^108.1 ^108.2 EU AI Act: first regulation on artificial intelligence. European Parliament. 2023-08-06 [2025-07-15] （英语）.

[:3-109] 109.0 ^109.1 AI Act enters into force. European Commission. 2024-08-01 [2025-07-15] （英语）.

[110] Ciaramella, Alberto; Ciaramella, Marco. Introduction to Artificial Intelligence: from data analysis to generative AI. Intellisemantic Editions. 2024: 217. ISBN 978-88-947876-0-3.

[Spindler-20232-111] 引用错误：没有为名为Spindler-20232的参考文献提供内容

[112] Challen, Robert; Denny, Joshua; Pitt, Martin; Gompels, Luke; Edwards, Tom; Tsaneva-Atanasova, Krasimira. Artificial intelligence, bias and clinical safety. BMJ Quality & Safety. March 2019, 28 (3): 231–237. ISSN 2044-5415. PMC 6560460  . PMID 30636200. doi:10.1136/bmjqs-2018-008370  （英语）.

[Metzinger2-113] 113.0 ^113.1 Thomas Metzinger. Artificial Suffering: An Argument for a Global Moratorim on Synthetic Phenomenology. Journal of Artificial Intelligence and Consciousness. February 2021, 8: 43–66. S2CID 233176465. doi:10.1142/S270507852150003X  .

[Edelman2-114] 114.0 ^114.1 Agarwal A, Edelman S. Functionally effective conscious AI without suffering. Journal of Artificial Intelligence and Consciousness. 2020, 7: 39–50. S2CID 211096533. arXiv:2002.05256  . doi:10.1142/S2705078520300030.

[:22-115] Roose, Kevin. If A.I. Systems Become Conscious, Should They Have Rights?. The New York Times. 2025-04-24 [2025-04-24]. ISSN 0362-4331 （美国英语）.

[116] Birch, Jonathan. Animal sentience and the precautionary principle. Animal Sentience. 2017-01-01, 2 (16) [2024-07-08]. ISSN 2377-7478. doi:10.51291/2377-7478.1200  . （原始内容存档于2024-08-11）.

[117] Macrae, Carl. Learning from the Failure of Autonomous and Intelligent Systems: Accidents, Safety, and Sociotechnical Sources of Risk. Risk Analysis. September 2022, 42 (9): 1999–2025. Bibcode:2022RiskA..42.1999M. ISSN 0272-4332. PMID 34814229. doi:10.1111/risa.13850 （英语）.

[118] Chalmers, David. Could a Large Language Model be Conscious?. March 2023. arXiv:2303.07103v1  [Science Computer Science].

[119] Edwards, Benj. Anthropic hires its first "AI welfare" researcher. Ars Technica. 2024-11-11 [2025-04-24] （英语）.

[120] Wiggers, Kyle. Anthropic is launching a new program to study AI 'model welfare'. TechCrunch. 2025-04-24 [2025-04-27] （美国英语）.

[121] Shulman, Carl; Bostrom, Nick. Sharing the World with Digital Minds (PDF). Rethinking Moral Status. August 2021: 306–326. ISBN 978-0-19-289407-6. doi:10.1093/oso/9780192894076.003.0018.

[122] Fisher, Richard. The intelligent monster that you should let eat you. BBC News. 2020-11-13 [2021-02-12] （英语）.

[MWZ-123] 123.0 ^123.1 Joseph Weizenbaum, quoted in McCorduck 2004，第356, 374–376頁

[124] Kaplan, Andreas; Haenlein, Michael. Siri, Siri, in my hand: Who's the fairest in the land? On the interpretations, illustrations, and implications of artificial intelligence. Business Horizons. January 2019, 62 (1): 15–25. S2CID 158433736. doi:10.1016/j.bushor.2018.08.004.

[Weizenbaum's_critique-125] 
Weizenbaum, Joseph. Computer Power and Human Reason. San Francisco: W.H. Freeman & Company. 1976. ISBN 978-0-7167-0464-5.

Template:McCorduck 2004, pp. 132–144

[126] Weizenbaum, Joseph. Computer Power and Human Reason. San Francisco: W.H. Freeman & Company. 1976. ISBN 978-0-7167-0464-5.

[127] Template:McCorduck 2004, pp. 132–144

[126] Davies, Alex. Google's Self-Driving Car Caused Its First Crash. Wired. 2016-02-29 [2019-07-26]. （原始内容存档于2019-07-07）. 参数|magazine=与模板{{cite news}}不匹配（建议改用{{cite magazine}}或|newspaper=） (帮助)

[127] Levin, Sam; Wong, Julia Carrie. Self-driving Uber kills Arizona woman in first fatal crash involving pedestrian. The Guardian. 2018-03-19 [2019-07-26]. （原始内容存档于2019-07-26）.

[128] Who is responsible when a self-driving car has an accident?. Futurism. 2018-01-30 [2019-07-26]. （原始内容存档于2019-07-26）.

[129] Autonomous Car Crashes: Who – or What – Is to Blame?. Knowledge@Wharton. Law and Public Policy (Radio Business North America Podcasts). [2019-07-26]. （原始内容存档于2019-07-26）.

[130] Delbridge, Emily. Driverless Cars Gone Wild. The Balance. [2019-05-29]. （原始内容存档于2019-05-29）.

[131] Stilgoe, Jack, Who Killed Elaine Herzberg? , Who's Driving Innovation? (Cham: Springer International Publishing), 2020: 1–6 [2020-11-11], ISBN 978-3-030-32319-6, S2CID 214359377, doi:10.1007/978-3-030-32320-2_1, （原始内容存档于2021-03-18）（英语）

[132] Maxmen, Amy. Self-driving car dilemmas reveal that moral choices are not universal. Nature. October 2018, 562 (7728): 469–470. Bibcode:2018Natur.562..469M. PMID 30356197. doi:10.1038/d41586-018-07135-0  .

[133] Regulations for driverless cars. GOV.UK. [2019-07-26]. （原始内容存档于2019-07-26）.

[134] Automated Driving: Legislative and Regulatory Action – CyberWiki. cyberlaw.stanford.edu. [2019-07-26]. （原始内容存档于2019-07-26）.

[135] Autonomous Vehicles | Self-Driving Vehicles Enacted Legislation. www.ncsl.org. [2019-07-26]. （原始内容存档于2019-07-26）.

[136] Etzioni, Amitai; Etzioni, Oren. Incorporating Ethics into Artificial Intelligence. The Journal of Ethics. 2017-12-01, 21 (4): 403–418. ISSN 1572-8609. S2CID 254644745. doi:10.1007/s10892-017-9252-2 （英语）.

[137] Call for debate on killer robots 互联网档案馆的存檔，存档日期2009-08-07., By Jason Palmer, Science and technology reporter, BBC News, 8/3/09.

[Mick170209-138] 138.0 ^138.1 Science New Navy-funded Report Warns of War Robots Going "Terminator" 互联网档案馆的存檔，存档日期2009-07-28., by Jason Mick (Blog), dailytech.com, February 17, 2009.

[engadget.com-139] 139.0 ^139.1 Navy report warns of robot uprising, suggests a strong moral compass 互联网档案馆的存檔，存档日期2011-06-04., by Joseph L. Flatley engadget.com, Feb 18th 2009.

[140] AAAI Presidential Panel on Long-Term AI Futures 2008–2009 Study 互联网档案馆的存檔，存档日期2009-08-28., Association for the Advancement of Artificial Intelligence, Accessed 7/26/09.

[141] United States. Defense Innovation Board. AI principles: recommendations on the ethical use of artificial intelligence by the Department of Defense. OCLC 1126650738.

[142] Umbrello, Steven; Torres, Phil; De Bellis, Angelo F. The future of war: could lethal autonomous weapons make conflict more ethical? . AI & Society. March 2020, 35 (1): 273–282 [2020-11-11]. ISSN 0951-5666. S2CID 59606353. doi:10.1007/s00146-019-00879-x. hdl:2318/1699364. （原始内容存档于2021-01-05）（英语）.

[143] Jamison, Miles. DARPA Launches Ethics Program for Autonomous Systems. executivegov.com. 2024-12-20 [2025-01-02] （美国英语）.

[144] DARPA's ASIMOV seeks to develop Ethical Standards for Autonomous Systems. Space Daily. 2024-12-20 [2025-01-02].

[145] Hellström, Thomas. On the moral responsibility of military robots. Ethics and Information Technology. June 2013, 15 (2): 99–107. S2CID 15205810. doi:10.1007/s10676-012-9301-2. ProQuest 1372020233.

[146] Mitra, Ambarish. We can train AI to identify good and evil, and then use it to teach us morality. Quartz. 2018-04-05 [2019-07-26]. （原始内容存档于2019-07-26）.

[147] Dominguez, Gabriel. South Korea developing new stealthy drones to support combat aircraft. The Japan Times. 2022-08-23 [2023-06-14].

[148] AI Principles. Future of Life Institute. 2017-08-11 [2019-07-26]. （原始内容存档于2017-12-11）.

[theatlantic.com-149] 149.0 ^149.1 Zach Musgrave and Bryan W. Roberts. Why Artificial Intelligence Can Too Easily Be Weaponized – The Atlantic. The Atlantic. 2015-08-14 [2017-03-06]. （原始内容存档于2017-04-11）.

[150] Cat Zakrzewski. Musk, Hawking Warn of Artificial Intelligence Weapons. WSJ. 2015-07-27 [2017-08-04]. （原始内容存档于2015-07-28）.

[151] Potential Risks from Advanced Artificial Intelligence. Open Philanthropy. 2015-08-11 [2024-04-07] （美国英语）.

[:023-152] 152.0 ^152.1 Bachulska, Alicja; Leonard, Mark; Oertel, Janka. The Idea of China: Chinese Thinkers on Power, Progress, and People (EPUB). Berlin, Germany: European Council on Foreign Relations. 2024-07-02 [2024-07-22]. ISBN 978-1-916682-42-9. （原始内容存档于2024-07-17）.

[153] GGE on lethal autonomous weapons systems. Digital Watch Observatory. 2025-11-27 [2026-04-26].

[154] Statements at the First 2025 GGE LAWS Session. APILS. 2025-03-09 [2026-04-26].

[reg-155] Brandon Vigliarolo. International military AI summit ends with 60-state pledge. www.theregister.com. [2023-02-17] （英语）.

[NYT-2009-156] 156.0 ^156.1 Markoff, John. Scientists Worry Machines May Outsmart Man. The New York Times. 2009-07-25 [2017-02-24]. （原始内容存档于2017-02-25）.

[Muehlhauser,_Luke_2012-157] Muehlhauser, Luke, and Louie Helm. 2012. "Intelligence Explosion and Machine Ethics" 互联网档案馆的存檔，存档日期2015-05-07.. In Singularity Hypotheses: A Scientific and Philosophical Assessment, edited by Amnon Eden, Johnny Søraker, James H. Moor, and Eric Steinhart. Berlin: Springer.

[Bostrom,_Nick_2003-158] Bostrom, Nick. 2003. "Ethical Issues in Advanced Artificial Intelligence" 互联网档案馆的存檔，存档日期2018-10-08.. In Cognitive, Emotive and Ethical Aspects of Decision Making in Humans and in Artificial Intelligence, edited by Iva Smit and George E. Lasker, 12–17. Vol. 2. Windsor, ON: International Institute for Advanced Studies in Systems Research / Cybernetics.

[159] Bostrom, Nick. Superintelligence: paths, dangers, strategies. Oxford, United Kingdom: Oxford University Press. 2017. ISBN 978-0-19-967811-2.

[160] Umbrello, Steven; Baum, Seth D. Evaluating future nanotechnology: The net societal impacts of atomically precise manufacturing. Futures. 2018-06-01, 100: 63–73 [2020-11-29]. ISSN 0016-3287. S2CID 158503813. doi:10.1016/j.futures.2018.04.007. hdl:2318/1685533  . （原始内容存档于2019-05-09）（英语）.

[161] Yudkowsky, Eliezer. 2011. "Complex Value Systems in Friendly AI" 互联网档案馆的存檔，存档日期2015-09-29.. In Schmidhuber, Thórisson, and Looks 2011, 388–393.

[162] Russell, Stuart. Human Compatible: Artificial Intelligence and the Problem of Control. United States: Viking. 2019-10-08. ISBN 978-0-525-55861-3. OCLC 1083694322.

[163] Yampolskiy, Roman V. Unpredictability of AI: On the Impossibility of Accurately Predicting All Actions of a Smarter Agent . Journal of Artificial Intelligence and Consciousness. 2020-03-01, 07 (1): 109–118 [2020-11-29]. ISSN 2705-0785. S2CID 218916769. doi:10.1142/S2705078520500034. （原始内容存档于2021-03-18）.

[164] Wallach, Wendell; Vallor, Shannon, Moral Machines: From Value Alignment to Embodied Virtue , Ethics of Artificial Intelligence (Oxford University Press), 2020-09-17: 383–412 [2020-11-29], ISBN 978-0-19-090503-3, doi:10.1093/oso/9780190905033.003.0014, （原始内容存档于2020-12-08）（英语）

[165] Umbrello, Steven. Beneficial Artificial Intelligence Coordination by Means of a Value Sensitive Design Approach. Big Data and Cognitive Computing. 2019, 3 (1): 5. doi:10.3390/bdcc3010005  . hdl:2318/1685727  （英语）.

[166] Floridi, Luciano; Cowls, Josh; King, Thomas C.; Taddeo, Mariarosaria. How to Design AI for Social Good: Seven Essential Factors. Science and Engineering Ethics. 2020, 26 (3): 1771–1796. ISSN 1353-3452. PMC 7286860  . PMID 32246245. doi:10.1007/s11948-020-00213-5 （英语）.

[167] Llama Guard: LLM-based Input-Output Safeguard for Human-AI Conversations. Meta.com. [2024-12-06].

[:0-168] 168.0 ^168.1 Šekrst, Kristina; McHugh, Jeremy; Cefalu, Jonathan Rodriguez. AI Ethics by Design: Implementing Customizable Guardrails for Responsible AI Development. 2024. arXiv:2411.14442  [cs.CY].

[169] Nvidia NeMo Guardrails. Nvidia. [2024-12-06].

[170] Inan, Hakan; Upasani, Kartikeya; Chi, Jianfeng; Rungta, Rashi; Iyer, Krithika; Mao, Yuning; Tontchev, Michael; Hu, Qing; Fuller, Brian; Testuggine, Davide; Khabsa, Madian. Llama Guard: LLM-based Input-Output Safeguard for Human-AI Conversations. 2023. arXiv:2312.06674  [cs.CL].

[171] Dong, Yi; Mu, Ronghui; Jin, Gaojie; Qi, Yi; Hu, Jinwei; Zhao, Xingyu; Meng, Jie; Ruan, Wenjie; Huang, Xiaowei. Building Guardrails for Large Language Models. 2024. arXiv:2402.01822  [cs].

[172] Evans, Woody. Posthuman Rights: Dimensions of Transhuman Worlds. Teknokultura. 2015, 12 (2): 373–384. doi:10.5209/rev_TK.2015.v12.n2.49072.

[173] Fiegerman, Seth. Facebook, Google, Amazon create group to ease AI concerns. CNNMoney. 2016-09-28 [2020-08-18]. （原始内容存档于2020-09-17）.

[174] Slota, Stephen C.; Fleischmann, Kenneth R.; Greenberg, Sherri; Verma, Nitin; Cummings, Brenna; Li, Lan; Shenefiel, Chris. Locating the work of artificial intelligence ethics . Journal of the Association for Information Science and Technology. 2023, 74 (3): 311–322 [2023-07-21]. ISSN 2330-1635. S2CID 247342066. doi:10.1002/asi.24638. （原始内容存档于2024-09-25）（英语）.

[175] Ethics guidelines for trustworthy AI. Shaping Europe's digital future – European Commission. European Commission. 2019-04-08 [2020-02-20]. （原始内容存档于2020-02-20）（英语）.

[176] White Paper on Artificial Intelligence – a European approach to excellence and trust | Shaping Europe's digital future. 2020-02-19 [2021-03-18]. （原始内容存档于2021-03-06）.

[177] Implementation Timeline | EU Artificial Intelligence Act. [2025-10-02] （美国英语）.

[178] OECD AI Policy Observatory. [2021-03-18]. （原始内容存档于2021-03-08）.

[179] Recommendation on the Ethics of Artificial Intelligence. UNESCO. 2021.

[180] UNESCO member states adopt first global agreement on AI ethics. Helsinki Times. 2021-11-26 [2023-04-26]. （原始内容存档于2024-09-25）（英国英语）.

[181] 《新一代人工智能伦理规范》发布. 中华人民共和国科学技术部. 2021-09-25.

[182] 中国关于加强人工智能伦理治理的立场文件. 中华人民共和国外交部. 2022-11-16.

[183] The Obama Administration's Roadmap for AI Policy. Harvard Business Review. 2016-12-21 [2021-03-16]. ISSN 0017-8012. （原始内容存档于2021-01-22）.

[184] Accelerating America's Leadership in Artificial Intelligence – The White House. trumpwhitehouse.archives.gov. [2021-03-16]. （原始内容存档于2021-02-25）.

[185] Request for Comments on a Draft Memorandum to the Heads of Executive Departments and Agencies, "Guidance for Regulation of Artificial Intelligence Applications". Federal Register. 2020-01-13 [2020-11-28]. （原始内容存档于2020-11-25）.

[186] Sen. Thune, John. Text – S.3312 – 118th Congress (2023–2024): Artificial Intelligence Research, Innovation, and Accountability Act of 2024. www.congress.gov. 2024-12-18 [2025-05-25].

[187] CCC Offers Draft 20-Year AI Roadmap; Seeks Comments. HPCwire. 2019-05-14 [2019-07-22]. （原始内容存档于2021-03-18）.

[188] Request Comments on Draft: A 20-Year Community Roadmap for AI Research in the US » CCC Blog. 2019-05-13 [2019-07-22]. （原始内容存档于2019-05-14）.

[189] （俄語） Интеллектуальные правила 互联网档案馆的存檔，存档日期2021-12-30. — Kommersant, 2021-11-25

[190] Grace, Katja; Salvatier, John; Dafoe, Allan; Zhang, Baobao; Evans, Owain. When Will AI Exceed Human Performance? Evidence from AI Experts. 2018-05-03. arXiv:1705.08807  [cs.AI].

[191] China wants to shape the global future of artificial intelligence. MIT Technology Review. [2020-11-29]. （原始内容存档于2020-11-20）（英语）.

[192] Adam, David. Future of Humanity Institute shuts: what's next for 'deep future' research? . Nature. 2024-04-26, 629 (8010): 16–17. Bibcode:2024Natur.629...16A. PMID 38671273. doi:10.1038/d41586-024-01229-8 （英语）.

[193] Floridi, Luciano; Cowls, Josh; Beltrametti, Monica; Chatila, Raja; Chazerand, Patrice; Dignum, Virginia; Luetge, Christoph; Madelin, Robert; Pagallo, Ugo; Rossi, Francesca; Schafer, Burkhard. AI4People—An Ethical Framework for a Good AI Society: Opportunities, Risks, Principles, and Recommendations. Minds and Machines. 2018-12-01, 28 (4): 689–707. ISSN 1572-8641. PMC 6404626  . PMID 30930541. doi:10.1007/s11023-018-9482-5 （英语）.

[194] AI Governance. Oxford Martin School. [2025-02-19] （英语）.

[195] Joanna J. Bryson. WIRED. [2023-01-13]. （原始内容存档于2023-03-15）.

[196] New Artificial Intelligence Research Institute Launches. 2017-11-20 [2021-02-21]. （原始内容存档于2020-09-18）（美国英语）.

[197] James J. Hughes; LaGrandeur, Kevin (编). Surviving the machine age: intelligent technology and the transformation of human work. Cham, Switzerland: Palgrave Macmillan Cham. 2017-03-15. ISBN 978-3-319-51165-8. OCLC 976407024.

[198] Danaher, John. Automation and utopia: human flourishing in a world without work. Cambridge, Massachusetts: Harvard University Press. 2019. ISBN 978-0-674-24220-3. OCLC 1114334813.

[199] TUM Institute for Ethics in Artificial Intelligence officially opened. www.tum.de. [2020-11-29]. （原始内容存档于2020-12-10）（英语）.

[200] Communications, Paul Karoff SEAS. Harvard works to embed ethics in computer science curriculum. Harvard Gazette. 2019-01-25 [2023-04-06]. （原始内容存档于2024-09-25）（美国英语）.

[201] Lee, Jennifer. When Bias Is Coded Into Our Technology. NPR. 2020-02-08 [2021-12-22] （英语）.

[202] How one conference embraced diversity. Nature. 2018-12-12, 564 (7735): 161–162. PMID 31123357. S2CID 54481549. doi:10.1038/d41586-018-07718-x  （英语）.

[203] Roose, Kevin. The 2020 Good Tech Awards. The New York Times. 2020-12-30 [2021-12-21]. ISSN 0362-4331 （美国英语）.

[204] Lodge, Paul. Leibniz's Mill Argument Against Mechanical Materialism Revisited. Ergo: An Open Access Journal of Philosophy. 2014, 1 (20201214). ISSN 2330-4014. doi:10.3998/ergo.12405314.0001.003  . hdl:2027/spo.12405314.0001.003.

[205] Bringsjord, Selmer; Govindarajulu, Naveen Sundar, Artificial Intelligence, Zalta, Edward N.; Nodelman, Uri (编), The Stanford Encyclopedia of Philosophy Summer 2020, Metaphysics Research Lab, Stanford University, 2020 [2023-12-08], （原始内容存档于2022-03-08）

[206] Kulesz, O. (2018). "Culture, Platforms and Machines". UNESCO, Paris.

[207] Jr, Henry C. Lucas. Information Technology and the Productivity Paradox: Assessing the Value of Investing in IT. Oxford University Press. 1999-04-29 [2024-02-21]. ISBN 978-0-19-802838-3. （原始内容存档于2024-09-25）（英语）.

[Asimov2008-208] Asimov, Isaac. I, Robot. New York: Bantam. 2008. ISBN 978-0-553-38256-3.

[lacuna-209] Bryson, Joanna; Diamantis, Mihailis; Grant, Thomas. Of, for, and by the people: the legal lacuna of synthetic persons. Artificial Intelligence and Law. September 2017, 25 (3): 273–291. doi:10.1007/s10506-017-9214-9  .

[principles-210] Principles of robotics. UK's EPSRC. September 2010 [2019-01-10]. （原始内容存档于2018-04-01）.

[211] Yudkowsky, Eliezer. Why We Need Friendly AI. 3 laws unsafe. July 2004. （原始内容存档于2012-05-24）.

[212] Aleksander, Igor. Partners of Humans: A Realistic Assessment of the Role of Robots in the Foreseeable Future . Journal of Information Technology. March 2017, 32 (1): 1–9 [2024-02-21]. ISSN 0268-3962. S2CID 5288506. doi:10.1057/s41265-016-0032-4. （原始内容存档于2024-02-21）（英语）.

[213] Evolving Robots Learn To Lie To Each Other 互联网档案馆的存檔，存档日期2009-08-28., Popular Science, August 18, 2009

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

[19]

[20]

[21]

[22]

[23]

[24]

[25]

[26]

[27]

[28]

[29]

[30]

[31]

[32]

[33]

[34]

[35]

[36]

[37]

[38]

[39]

[40]

[41]

[42]

[43]

[44]

[45]

[46]

[47]

[48]

[49]

[50]

[51]

[52]

[53]

[54]

[55]

[56]

[57]

[58]

[59]

[60]

[61]

[62]

[63]

[64]

[65]

[66]

[67]

[68]

[69]

[70]

[71]

[72]

[73]

[74]

[75]

[76]

[77]

[78]

[79]

[80]

[81]

[82]

[83]

[84]

[85]

[86]

[87]

[88]

[89]

[90]

[91]

[92]

[93]

[94]

[95]

[96]

[97]

[98]

[99]

[100]