Ethics of Artificial Intelligence and Robotics

First published Thu Apr 30, 2020; substantive revision Fri Mar 27, 2026

After around 70 years of development, it is now clear that AI and robotics do have substantial impact on the world, even if the exact nature and depth of that impact is unclear. Which impact AI and robotics should have also remains a largely unsolved question, and that is the main question of this article. In addition, AI and robotics have also given us occasion to reflect theoretically on fundamental philosophical and ethical issues. The main debates that seem to have practical impact or theoretical relevance currently are about privacy, human autonomy, automated decisions, human-machine interaction, employment and the impact on society as a whole – plus the more abstract notions of autonomy, agency and superintelligence. For each of these issues, we outline the existing positions and arguments, and how these hang together with other issues. Overall, the ethics of AI and robotics should mainly allow us to understand and evaluate techno-social development, thus enabling decisions which developments to avoid, but also enabling a positive vision for a world with AI and robotics that is worth wanting.

1. Introduction

1.1 Scope: Ethics of AI & Robotics

The design and use of AI and robotics have raised many ethical issues, some specific to these technologies, some more general. These often go back to “concerns” of various sorts – which are a typical response to new technologies. Many such concerns turn out to be rather quaint (such as that trains are too fast for souls), some are predictably wrong when they suggest that the technology fundamentally changes humans (telephones will destroy personal communication, writing will destroy memory), some broadly correct but moderately relevant (digital technology will destroy industries), and some broadly correct and deeply relevant (cars will change towns, smart phones will generate dependence). The task of an article such as this is to explain what philosophical research has done to analyse the issues, to move from vague fears to well-understood problems, and to deflate the non-issues – while remaining cautious in the judgment what the important issues really are.

The ethics of AI and robotics is particularly difficult, since it involves an understanding of normative ethics and metaethics, an understanding of the technologies, plus expertise in the many areas of social impact. There are discussions of AI ethics not only within philosophy and computer science, but also in other academic disciplines, especially in political science, economics, media studies, education, environmental studies and law – and in large spheres of society outside academia. This seems to indicate that we need a better understanding of which issues should be treated in philosophy, and a better division of labour with other disciplines. Within philosophy, the fields that have contributed to the debates about AI and robotics have widened from philosophy of mind and normative ethics to metaethics, philosophy of science, epistemology, philosophy of language, metaethics, and political philosophy. Our topic gives us pause to reflect on the relation between philosophical and non-philosophical approaches to AI ethics, and on fundamental concepts of philosophy itself – the latter activity is sometimes called “AI philosophy” (Müller 2025b; Müller and Löhr forthcoming).

AI somehow gets closer to our skin than other technologies: This has to do with the fact that the project of AI is to create machines that have features which are central to how we humans see ourselves, namely as feeling, thinking, intelligent beings. The classic notion of “artificial intelligent agency” involves a sequence of sense-model-plan-act, but current AI applications also include perception, text analysis, natural language processing (NLP), logical reasoning, game-playing, decision (support) systems, data analytics, predictive analytics, as well as autonomous vehicles, humanoid robots, and other forms of robotics (P. Stone et al. 2021). In the following it will become clear that it is often useful to think of AI as decision-machines, perhaps as implementations of rational choice (cf. S. Russell 2019, 23; S. Russell and Norvig 2020, 57).

Which issues are discussed here and how are they ordered? Since the SEP is a philosophical encyclopedia, this article is ordered by ethical problems, while other surveys focus on policy and compliance (Corrêa et al. 2023), or on safety of certain technologies. One could thus think of the landscape of AI ethics as a matrix of ethical issues vs. technologies. What is considered an “important problem” for AI ethics often has to do with the context people come from: Privacy protection is central for individual autonomy, while racial discrimination or economic injustice are more prominent in certain societies, as is the discrimination of women, perhaps environmental damage and superintelligence seem more central to people in affluent and peaceful societies. We aim for a fairly wide understanding here, based on both a) societal relevance and b) theoretical interest.

There are two perspectives that often run under the heading of “ethics”, but will play a less central role here. One of these is “AI policy”, which specifies societal structures and measures to handle AI, including regulation and standardisation. It appears, however, that the debates on policy are now largely divorced from philosophy and more of a political, legal and technical nature – even though they often rely on ethical judgments, which was even more obvious in the era of “AI Principles” (ca. 2015–20). In this article, policy matters are discussed in a short separate section (2.10), rather than with each ethical issue.

Another prominent perspective is to discuss AI ethics in terms of risks of AI and of technical safety from risk (e.g. Brundage et al. 2018; Bengio et al. 2024; Narayanan and Kapoor 2024; Bengio et al. 2025 [Other Internet Resources]; Hendrycks 2025). These discussions aim to find out which consequences are probable, and how to avoid negative consequences through technical and societal means. While many of these issues are central ethical issues, it seems clear that the discussion of AI ethics should not be framed entirely in terms of “risk”: a) some ethical considerations are not consequentialist; b) there is ethical discussion how to evaluate consequences; c) technical discussions about avoiding risks are not part of ethics; d) some negative consequences are actual, not risks; and e) the discussion on AI ethics should not be framed only in negative terms, but it should generate a positive vision of a future worth wanting. Technical AI safety (sometimes called “alignment”) is a societally highly important task that is closely related to AI ethics, but is not identical to it. Large AI technology firms typically had some form of “ethics” or “safety” committee, but most were dismantled in the 2020s. Problems of risk will feature in the discussions of most ethical issues.

Finally, for a problem to qualify as a “problem for ethics”, we would require that it is unclear what is ethical in the particular case, thus excluding things that are simply “ethically problematic” or “unethical” (e.g. fraud and murder). Having said that, technical systems in a social context (techno-social systems) always make some human decisions and some societal developments more likely than others, e.g. they provide a “nudge”, they have affordances, or they change the epistemic situation. For example, an LLM might make it possible for a person to build a virus, to hack a database, to commit fraud, or to impersonate someone else (OpenAI 2025). It this thus within the scope of this article to consider where AI and robotics enable such behaviour or a particular societal development.

1.2 Techno-social Background

To some extent, the field of “applied ethics” is driven by technology: When a new kind of system comes out, new problems appear. Though prediction has its risks, it seems clear that if the technical progress of AI continues the way it does now (early 2026), we are looking at a major technical, societal and environmental change that is as fundamental as the one generated by the industrial revolution, with its move from human- and horse-power to water/steam/electricity.

In the 20^th Century, AI was usually understood as a research programme where the cognitive sciences would develop models of human intelligence as computation over meaningful symbols, and the computer sciences would implement these models on digital hardware, thus both testing the models, and achieving a general form of artificial intelligence. This programme is evident in Searle’s notion of “strong AI”: “computers given the right programs can be literally said to understand and have other cognitive states” (Searle 1980, 417). It was later called “classical AI”, and it was in some competition with “connectionist AI” which proposed to focus on modelling the brain in “neural networks”, rather than in its functional architecture. Both versions of AI ran into significant problems during the “AI winter(s)”, ca. 1975–1995, and they spun off some specialised technical disciplines (e.g. image pattern recognition or robotics) that often avoided the tainted name “AI”.

With the advent of more successful machine learning (ML) systems in the 2010s, mainly with multi-layered (“deep”) neural networks (LeCun, Bengio, and Hinton 2015; Schmidhuber 2015), and especially with popular generative AI, such as large language models (LLMs) around 2020, the use of the term “AI” has broadened massively. The major change towards ML has to do with algorithmic progress (e.g. general adversarial networks, or transformer models) and the fact that the cost of computing power and storage has reduced, while the investment has increased. The resulting exponential gains in computing power and data storage have led to the ability to train models on massive data (essentially all available data), and to generate very large models (typically at a size of 20–50% of the training data). This “scaling” achieves radical improvements of ML systems, that often went from performance at 10% human level to beyond human level in a few years. Current AI trends are tracked in several places; one prominent source is the HAI “AI Index” (https://hai.stanford.edu/ai-index/).

The general aim of classical AI had been all but quietly given up after the AI winters, though a small group pushed for the idea that classical AI should focus directly on “Artificial General Intelligence” (AGI) – organising a niche conference series under that title since 2008. In the last 10 years, it has become common to ask once again, whether we might be on a road to a general AI, now understood to be based mainly on ML. The old label “AGI” is now often used to differentiate the aim of general artificial intelligence at roughly human level from technical AI on specific problems, with the implication that something important happens at that point (this assumption drives some of the current hyper-investment into AI). It remains an open question whether AI is on course towards general intelligence (AAAI 2025, 58–63; Bengio et al. 2025) – this question matters to ethics in many ways, e.g. suggestions that minor AI quibbles do not matter compared to the enormous benefits of AGI, or that the arguments for existential risk from superintelligence become more urgent the closer we move to AGI.

Given that AI runs on computers, some there is a question which role quantum computing could play, where superposition qbits allows computation that can be so much more efficient on resources (e.g. on time), that some functions that were practically impossible to compute, or “intractable”, do become practically possible to compute (e.g. taking seconds rather than centuries) – while the set of quantum computable functions is not larger than that of the Turing-computable functions (Deutsch 1985). This efficiency gain has enormous practical relevance because public key encryption, and thus most of computer security and identification, relies on “trap-door functions” that are practically easy to compute in one direction but very hard to compute in the opposite direction (e.g. “multiply these two prime numbers”, vs. “find out which two prime numbers, if multiplied, would result in this number”). The security problem of quantum computing, if it were achieved on a practical level, is thus not specific to AI, but it would affect AI, too.

While AI is entirely software, robots are physical machines that are subject to physical forces; they have “sensors” and they exert physical force onto the world through “actuators”, like a gripper or a turning wheel. Accordingly, autonomous cars or planes are robots, and only a very small portion of current robots is “humanoid” (human-shaped). Some robots use AI, and some do not: Typical industrial robots still blindly follow completely defined scripts with minimal sensory input and no learning or reasoning, in a maximally controlled environment. However, there is an increasing use of AI in robotics, including humanoids, mainly for sensation and action-control. Robots and AI systems can thus be seen as two overlapping sets: Systems that are only AI, systems that are only robotics, and systems that are both. We are interested in all three; the scope of this article is the union of both sets.

It is worth remembering that the ethics of AI and robotics is a very young discipline: The first publications on robot-ethics or machine ethics appeared in the early 2000s (Moor 2006), the first conference on AI ethics took place in 2012 (Müller 2014), the first books appeared on superintelligence (Bostrom 2014), machine ethics (Misselhorn 2018/22), and the control problem (S. Russell 2019). Then, the first survey articles (Müller 2020) and books appeared (Dignum 2019; Coeckelbergh 2020; Dubber, Pasquale, and Das 2020; Gordon and Nyholm 2021), as well as more policy-oriented work (Floridi et al. 2018; Taddeo and Floridi 2018; Taylor et al. 2018; Walsh 2018; Bryson 2019; Gibert 2019; Whittlestone et al. 2019). We probably still lack a well-established scope, method, or canonical works (L. E. Frank and Klincewicz 2024). Also, there is little history of AI ethics (Casiraghi 2023), and of the wider field of computer ethics or digital ethics (but see Müller 2022). Useful surveys for the ethics of robotics include (Calo, Froomkin, and Kerr 2016; Royakkers and van Est 2016; Tzafestas 2016; Coeckelbergh 2022b); a standard collection of papers is (Lin, Abney, and Jenkins 2017). Handbooks and surveys of AI ethics include (Boddington 2023; Floridi 2023b; Bullock et al. 2024; Gunkel 2024; Floridi and Taddeo 2025; Hähnel and Müller 2025; Smuha 2025), with (Hagendorff 2024) on generative AI and (Nyholm, Kasirzadeh, and Zerilli 2026) on contemporary debates.

The world has changed in the last few years. It is likely that more literature has been published on the topic of this article since the first version (early 2020) than in the entire time before. Also, papers are now published in mainstream philosophical journals who used to ignore AI and computing. The selection of the literature that is referred to here thus must be narrower and will be more prone to errors. The main ordering that was proposed in 2020, in the absence of a prior survey article or book, appears to have held up reasonably well.

A note on tone: It is characteristic of the “ethics of x” that it tends to deal with ethical problems of x, rather than with the merits of x. The result is that the ethics of AI appears “negative about AI”. This is a tendency we try to mitigate here: The point of an ethics of AI is to find out what AI should be like, to analyse the difficulties of finding out what is the right way to act with respect to AI. So, ideally, AI ethics provides direction for “good engineering” (this comes under various names, such as “trustworthy”, “humane”, “aligned”, “reliable” or “ethical”). Thus, the societal task of AI ethics is to provide guidance for good AI design and use, for a world worth wanting (if AI is part of such a world).

2. Main Debates

2.1 Privacy & Data Protection

Privacy has several well recognised aspects, e.g. “the right to be let alone”, control over information about oneself (Rachels 1975), privacy as an aspect of personhood, and – in English – also local and bodily privacy (Bennett and Raab 2018; Roessler 2018). The discussion about privacy and surveillance in information technology (e.g. Macnish 2017; Roessler 2017) mainly concerns personally identifiable data, i.e. the access to, and control over that data, or a combination of both (Véliz 2024). Classic privacy studies focused on state surveillance by secret services but now include surveillance by other state agents, businesses and even individuals. The relevant information technology has changed significantly in the last decades, while regulation has been slow to respond (though there is the (GDPR 2016)) – the result is a certain anarchy that is exploited by the most powerful players and that has produced significant political discussion (Véliz 2020).

2.1.1 The Digital Sphere

The digital sphere has widened greatly in recent decades: All data collection and storage is now digital, our lives are increasingly digital, there is more and more sensor technology in use that generates data about non-digital aspects of our lives, and most digital data is connected to a single Internet. In addition, much of the data is traded between agents, usually for a fee. As a result, controlling who collects which data, and who has access, is much harder in the digital world than it was in the analogue world of paper and telephone calls.

AI increases both the possibilities of intelligent data collection and the possibilities for data analysis – and it increases the value of data, since it can be used for training ML systems. For example, face recognition in photos and videos, “device fingerprinting” and a host of other techniques researched in computer science (e.g. Rocher, Hendrickx, and de Montjoye 2019) allow real time identification and thus profiling and searching for individual humans or their devices (Whittaker et al. 2018, 15ff ). The result is that “In this vast ocean of data, there is a frighteningly complete picture of us” (Smolan 2016, 1:01) … a remark that already seems trivial now.

The data trail we leave behind is how our “free” services are paid for – but we are not told about that data collection and the value of this new raw material, and we are manipulated into leaving ever more such data. For the “big 5” companies (Amazon, Google/Alphabet, Microsoft, Apple, Facebook/Meta), the main data-collection part of their business appears to be based on deception, exploiting human weaknesses, furthering procrastination, generating addiction, and manipulation (Harris 2016; Klenk and Jongepier 2022). The primary focus of social media, gaming, and most of the Internet in this “surveillance economy” is to gain, maintain and direct attention – and thus continued data supply. As Schneier said, “surveillance is the business model of the Internet” (Schneier 2015); this is sometimes captured in the catchword “surveillance capitalism” (Williams 2018; Zuboff 2019; Königs 2024).

2.1.2 Losing Control of Data?

One useful perspective on privacy from the “control” angle is to see it as the demand that information integrity is preserved in the relevant contexts, i.e. “contextual integrity” demands that the flow of information is appropriately controlled (Nissenbaum 2004). The problem of AI generated images and videos of actual people may suit this approach. It has been argued that a loss of “freedom” is the characteristic feature of the AI era (Santoni de Sio 2024). This loss has caused many attempts to escape from the grasp of surveillance capitalism, e.g. in exercises of “minimalism” (Newport 2019), or through the open-source movement – but it is doubtful that present-day citizens have the autonomy needed.

Surveillance systems will often reveal facts about us that we ourselves wish to suppress or are not aware of: they know more about us than we know ourselves. Even just observing online behaviour allows insights into our mental states (Burr and Christianini 2019) and manipulation (see below (2.1.2)). This has led to calls for the protection of “derived data” (Wachter and Mittelstadt 2019). With the last sentence of his bestselling book Homo Deus (Harari 2016) asks about the long-term consequences of AI: “What will happen to society, politics and daily life when non-conscious but highly intelligent algorithms know us better than we know ourselves?” (never mind whether an algorithm can “know”).

Robotic systems are beginning to play a major role in this area, esp. from the air. Together with the “Internet of things”, the “smart” systems (phone, TV, oven, lamp, virtual assistant, home, …), the “smart city” (Sennett 2018), “smart governance”, and “smart agentic AI”, they are set to become part of the data-gathering system that offers more detailed data in real time, with ever more information.

There is an “arms race” between surveillance and individual freedom and there is an important societal question where a good balance lies. Both “sides” use information technology, e.g. privacy can be protected by encryption, anonymity, cryptocurrency. For some research purposes, there are privacy-preserving techniques that can largely conceal the identity of persons or groups, e.g. “differential privacy” by adding calibrated noise to encrypt the output of queries (Dwork et al. 2006; Garfinkel 2025). While some companies sell surveillance, others sell protection from surveillance, e.g. in operation systems for computers and phones.

One of the major practical difficulties is to actually enforce regulation, both on the level of the state and on the level of the individual who has a claim. They must identify the responsible legal entity, prove the action, perhaps prove intent, find a court that declares itself competent … and eventually get the court to actually enforce its decision. Well-established legal protection of rights such as consumer rights, product liability and other civil liability or protection of intellectual property rights is often missing in digital products, or hard to enforce. This means that companies with a “digital” background are used to testing their products on the consumers, without fear of liability, while defending their intellectual property rights. This “Internet Libertarianism” is sometimes taken to assume that technical solutions will take care of societal problems by themselves (Mozorov 2013).

2.2 Human Autonomy & Manipulation

The ethical issues of AI in surveillance go beyond the mere accumulation of data, control of data flow and direction of attention: They include the use of information to manipulate behaviour, online and offline. Assuming choice has the standard “control condition” and “epistemic condition”, manipulation can use both. Human action selection is often quite far from cool rational choice, so our tendencies to procrastination and other sub-optimal behaviour, which can be exploited for manipulation. While efforts to manipulate behaviour are as old as humanity (Noggle 2022), they gain a new quality with AI systems (Prunkl 2024; Schneider 2025). Jaron Lanier already wrote in 2014: “When you are wearing sensors on your body all the time, such as the GPS and camera on your smartphone and constantly piping data to a megacomputer owned by a corporation that is paid by ”advertisers“ to subtly manipulate you ... you gradually become less free.” (Lanier 2014).

With sufficient prior data, algorithms can be used to target individuals or small groups with just the kind of personalised input that is likely to influence these particular individuals. Given users’ intense interaction with data systems and the deep knowledge about individuals this provides, they are vulnerable to positive “nudges” (Thaler and Sunstein 2008), emotional manipulation, as well as negative manipulation and deception. Profit-oriented businesses exploit behavioural biases, deception, and the generation of addiction (Costa and Halpern 2019) – e.g. through “dark patterns” on web pages or in games (Mathur et al. 2019; Luguri and Strahilevitz 2021; Klenk 2022). While offline gambling and the sale of addictive substances are highly regulated, online manipulation and addiction is not. Control is increased by system lock-in and software monopolies. One detail of this loss of control is the absence of “informed consent” (Faden and Beauchamp 1986) when agreeing to “terms and conditions” of software and IT services. The culmination of this problem is reached when decisions are handed over to an AI agent, and the human does not need to be manipulated any more.

Improved AI “faking” technologies make what once was reliable evidence into unreliable evidence – this has already happened to digital photos, sound recordings and video … and it is now quite easy to create (rather than alter) “deep fake” text, photos and video material with any content desired. Sophisticated real-time interaction with persons over texting, phone or video can be faked, too. So, we cannot trust digital interaction, while we are at the same time increasingly dependent on such interaction. It has been argued that technological manipulation poses significant threats to our opportunities to live meaningful lives (Nyholm 2022) and that the basis of our epistemic practices (see the next section) for establishing belief and knowledge is threatened by this development (Rini 2020; Robert Sparrow and Flenady 2025).

2.3 Epistemic Issues: Opacity & Explainability

When an AI system makes a decision, e.g. “you are denied a credit card”, it will often be impossible for the affected person to know how the system came to this output, i.e. the system is “opaque” or “black box” to that person. Furthermore, many AI systems rely on machine learning techniques in (simulated) neural networks that will extract patterns from a given dataset, with or without “correct” solutions provided; i.e. supervised, semi-supervised or unsupervised. With these techniques, the “learning” captures patterns in the data and these are labelled in a way that appears useful to the decision the system makes, while the programmer does not really know which patterns in the data the system has used. What this means is that the outcome is opaque to the expert programmers, too – this is standard opacity (Durán and Jongsma 2021). Sometimes, there is perhaps even “deep” or “essential” opacity that cannot be removed, in principle (Humphreys 2009; Müller 2025a; Beisbart 2026).

It is well known that epistemic conditions have an impact on normative issues. In our case, if there is opacity, then any bias in decision-making will be hard to detect. Opacity and bias are central issues in what is now sometimes called “data ethics”, “big data ethics” or “ethics of algorithms” (Floridi and Taddeo 2016; Mittelstadt et al. 2016). AI systems for automated decision support and “predictive analytics” raise “significant concerns about lack of due process, accountability, community engagement, and auditing” (Danaher 2016b; Whittaker et al. 2018, 18ff). It is not clear that the idea of a right to a human decision or justification, as in the GDPR, is really what is needed; some authors have argued that instead we have a right to right to a “well-calibrated machine decision” (Huq 2020) or that we need to avoid a “testimony gap” (Robert Sparrow and Flenady 2025). Sometimes opacity and manipulation are discussed under the headings of “epistemic technology” (Alvarado 2023) or “epistemic risk”, but it remains to be seen whether notions and questions from traditional epistemology (about the conditions for true justified belief) are suitable for the many ways in which human perception and knowledge are impacted by AI technologies.

The opacity has generated attempts to outline the constraints for explainable AI (XAI) (Zednik 2021) and the role of cognitive models (Budding and Zednik 2024). What exactly “interpretable” means is under discussion, especially its relation to explanation (Sullivan 2022; Zerilli 2022) and trust (Baron 2025; Robertson 2025) – the typical distinction is between the causes that lead to a decision and the reasons that a rational agent would provide, where the latter is what counts for responsibility. There are many activities that aim to remove or remedy opacity through “explainable AI” (XAI), which is now a significant technical field (Schwalbe and Finzel 2024).

2.4 Good Decisions: Fairness & Bias

2.4.1 Decisions

As mentioned above, it can be useful to regard all AI systems as decision systems. What constitutes a “good” decision is hotly debated in the theory of rational choice (Thoma 2019). It has also been argued that some hard choices are “on a par” (Chang 2002), but it does matter which one is chosen, because we make it “ours” for the future (Chang 2020). It is worth considering whether AI decisions or recommendations should be based on the assumption of user “preferences”. This is a common terminology in IT, but it is not clear that the science that specialises on the prediction of human behaviour (psychology) makes much use of it.

Automated AI decision support systems and “predictive analytics” operate on data and produce a decision as “output”. This output may range from the relatively trivial to the highly significant: “this restaurant matches your preferences”, “bail is denied”, or “target identified and engaged”. Data analysis is often used in “predictive analytics” in business, healthcare and other fields, to foresee future developments – since prediction is easier, it will also become a cheaper commodity. One use of prediction is in “predictive policing” (Meijer and Wessels 2019), which involves a threat to public liberties (Alikhademi et al. 2022) because it can take away power from the people who are predicted. It appears, however, that many of the worries about policing depend on futuristic scenarios where law enforcement foresees and punishes planned actions, rather than waiting until a crime has been committed (like in the 2002 film “Minority Report”). In principle, there could be merits in the approach for all stakeholders (Asaro 2019).

With the advent of LLMs, it became more conceivable to have AI systems themselves advising on ethical matters. Perhaps AI systems could even allow us to become better humans, by our own standards (O’Neill, Klincewicz, and Kemmer 2022). It appears that even early versions were doing well enough such that “no significant difference in the perceived value of the advice between human generated ethical advice and AI-generated ethical advice” was seen (Terwiesch, Meincke, and Nave 2023).

2.4.2 Bias

AI systems can be used in the support or replacement of human decisions, and in these cases the decisions can have a “bias”, i.e. it can make a decision on the basis of criteria that are irrelevant, and perhaps thus discriminate negatively against some individual or group (cf. also Friedman 1996). Bias typically surfaces when unfair judgments are made because the individual making the judgment is influenced by a characteristic that is actually irrelevant to the matter at hand, typically a discriminatory preconception about members of a group. The person having a bias may not be aware of having that bias – they may even be honestly and explicitly opposed to a bias they are found to have (e.g. through priming, cf. (Graham and Lowery 2004)). On fairness vs. bias in machine learning, see (Binns 2018), on the more general notion of bias (Johnson 2024).

Apart from the social phenomenon of learned bias, the human cognitive system is generally prone to have various kinds of “cognitive biases”, e.g. the “confirmation bias”: humans tend to interpret information as confirming what they already believe. Though these forms of bias is often said to impede performance in rational judgment (Kahnemann 2011), they are really just a way for cognitive systems to deal with the fact that resources available for a given decision (time, data) are always limited. This is known as “bounded optimality” in computer science (S. Russell 2016) or “resource-rational analysis” in psychology (Lieder and Griffiths 2020).

A third form of bias is in present in data, when it exhibits systematic error, e.g. one of the various kinds of “statistical bias”. Strictly, any given dataset will only be unbiased for a single kind of issue, so the mere creation of a dataset involves the danger that may it be used for a different kind of issue, and then turn out to be biased for that kind. Machine learning on the basis of such data would then not only fail to recognise the bias, but codify and automate the “historical bias”. The problem with such systems is thus bias plus humans placing excessive trust in the systems. The political repercussions of such automated systems can be significant (Eubanks 2018). Furthermore, the quality of the program depends heavily on the quality of the data provided, following the old slogan “garbage in, garbage out”. So, if the data already involved a bias (e.g. police data about the skin colour of suspects), then the program will reproduce that bias. Some have argued that the ethical problems of today are the result of technical “shortcuts” AI has taken (Marcus 2018 [Other Internet Resources]; Cristianini 2023) – which links the problem of bias to the general philosophy of technology (see below).

It appears that technological fixes for the problem of bias have inherent limits in that they need a mathematical notion of fairness, which is hard to come by (Whittaker et al. 2018, 24ff; Selbst et al. 2019); as is a formal notion of key terms, like “race” (see Benthall and Haynes 2019) or “woman” (Mason 2022).

2.5 Human-Robot Interaction

Human-robot interaction (HRI) is an academic fields in its own right, which now pays significant attention to ethical matters, the dynamics of perception from both sides, and both the different interests present in and the intricacy of the social context, including co-working (e.g. Arnold and Scheutz 2017).

AI it can also be used to drive robots that are problematic if their processes or appearance involve deception, threaten human dignity, or violate the Kantian requirement of “respect for humanity”. It appears that humans very easily attribute mental properties to objects, and empathise with them, especially when the outer appearance of these objects is similar to that of living beings. This can be used to deceive humans (or animals) into attributing more intellectual or even emotional significance to robots or AI systems than they deserve. Some parts of humanoid and animal robotics are problematic in this regard.

Basic constraints of business ethics and law apply to robots, too: product safety and liability, or non-deception in advertisement. It appears that these existing constraints take care of many concerns that are raised. There are cases, however, where human-human interaction has aspects that appear specifically human in ways that can perhaps not be replaced by robots: care, love and sex.

These issues will be more urgent soon, when robots actually leave the industrial “yellow cages”, with the help of AI, and appear in more everyday life circumstances.

2.5.1 Example (a) Care Robots

The use of robots in health care for humans is currently at the level of concept studies in real environments, but it may become a usable technology in a few years, and has raised a number of concerns for a dystopian future of de-humanised care (A. Sharkey and Sharkey 2011; Rob Sparrow 2016). Current systems include robots that support human carers/caregivers (e.g. in lifting patients, or transporting material), robots that enable patients to do certain things by themselves (e.g. eat with a robotic arm), but also robots that are given to patients as company and comfort (e.g. the “Paro” robot seal). For an overview, see (van Wynsberghe 2016; Nørskov 2017; Fosch-Villaronga and Albo-Canals 2019), for a survey of users (Draper et al. 2014).

One reason why the issue of care has come to the fore is that people have argued that we will need robots in ageing societies. It is not very clear that there really is an issue here, since the discussion mostly focuses on the fear of robots de-humanising care, but the actual and foreseeable robots in care are for classic automation of technical tasks as assistive robots. They are thus “care robots” only in a behavioural sense of performing tasks in care environments, not in the sense that a human “cares” for the patients. If anything, the risk of robots in care is the absence of such intentional care – because less human carers may be needed. Interestingly, caring for something, even a virtual agent, can be good for the carer themselves (Lee et al. 2019). A system that pretends to care would be deceptive and thus problematic – unless the deception is countered by sufficiently large utility gain (Coeckelbergh 2016). Perhaps feeling cared for by a machine, to some extent, can be progress in some cases?

2.5.2 Example (b) Sex Robots

It has been argued by several tech optimists that humans will likely be interested in sex and companionship with robots and be comfortable with the idea (Levy 2007). Given the variation of human sexual preferences, including sex toys and sex dolls, this seems very likely: The question is whether such devices should be manufactured and promoted, and whether there should be limits to use in this murky area. It seems to have moved into the mainstream of “robot philosophy” in recent times (Sullins 2012; Danaher and McArthur 2017; N. Sharkey et al. 2017; Bendel 2018; Devlin 2018).

Humans have long had deep emotional attachments to objects, so perhaps companionship or even love with a predictable android is attractive, especially to people who struggle with actual humans, and already prefer dogs, cats, a bird, a computer or a tamagotchi. Some authors (Nyholm, Danaher, and Earp 2022) argue that this can be true friendship, and is thus is a valuable goal. It certainly looks like such friendship might increase overall utility, even if lacking in depth and function (Miyahara and Shimizu 2025). In all this area there is an issue of deception, since a robot cannot (at present) mean what it says, or have feelings for a human. It is well known that humans are prone to attribute feelings and thoughts to entities that behave as if they had sentience, and even to clearly inanimate objects that show no behaviour at all. Also, paying for deception seems to be an elementary part of the traditional sex industry.

Finally, there are concerns that have often accompanied matters of sex, namely consent (L. Frank and Nyholm 2017), aesthetic concerns, and the worry that humans may be “corrupted” by certain experiences. Old fashioned though this may seem, human behaviour is influenced by experience, and it is likely that pornography or sex robots support the perception of other humans as mere objects of desire, or even as recipients of abuse, and thus ruin a deeper sexual and erotic experience. The “Campaign Against Sex Robots” argues that these devices are a continuation of slavery and prostitution (Richardson 2016).

2.6 Autonomous AI Systems

2.6.1 Autonomy Generally

There are several notions of autonomy in the discussion of autonomous systems. A stronger notion is involved in philosophical debates where autonomy is the basis for responsibility and personhood (Christman 2018). In this context, responsibility implies autonomy, but not inversely, so there can be systems that have degrees of technical autonomy without raising issues of responsibility. The weaker, more technical, notion of autonomy in robotics is relative and gradual: A system is said to be autonomous with respect to human control to a certain degree (Müller 2012). There is a parallel here to the issues of bias and opacity in AI since autonomy also concerns a power-relation: who is in control, and who is responsible? Higher autonomy implies higher risk, but higher autonomy is also a requirement for higher productivity gains, so the development is towards higher risk. So, the issue is mainly one of risk and misuse, less of serious autonomy.

In most jurisdictions, there is a sophisticated system of civil and criminal law to resolve issues of liability. Technical standards, e.g. for the safe use of machinery in medical environments, will likely need to be adjusted. There is already a field of “verifiable AI” for such safety-critical systems, and for “security applications”. Technical standards are an important part of such regulation and self-regulation.

Among the many autonomous systems on land, on water, under water, in the air or in space, we discuss two samples: autonomous vehicles and autonomous weapons. There is now a development towards AI personal assistants, robotical or not. It seems plausible that these assistants will face significant hurdles, since the ethical and legal responsibility for the actions of the assistant would have to be clarified (Milano and Nyholm 2024).

2.6.2 Example (a) Autonomous Vehicles

Autonomous vehicles hold the promise to reduce the very significant damage that human driving currently causes – with approximately 1 million humans being killed per year, many more injured, the environment polluted, earth sealed with concrete and tarmac, cities full of unused (parked) cars, etc. However, there seem to be questions on how autonomous vehicles should behave, and how responsibility and risk should be distributed in the complicated system the vehicles operates in. (There is also significant disagreement over how long the development of fully autonomous, or “level 5” cars (SAE 2015) will actually take, or whether it has happened already.)

There is significant discussion of “trolley problems” in this context. In the classic “trolley problems” (Thompson 1976), SEP (Woollard and Howard-Snyder 2021) various dilemmas are presented. The simplest version is that of a trolley train on a track that is heading towards five people and will kill them, unless the train is diverted onto a side track, but on that track there is one person, who will be killed if the train takes that side track. The example goes back to a remark in (Foot 1967, 6), who discusses a number of dilemma cases where tolerated differ from intended consequences of an action. “Trolley problems” are not supposed to describe actual ethical problems or to be solved with a “right” choice. Rather, they are thought-experiments where choice is artificially constrained to a small finite number of distinct one-off options and where the agent has perfect knowledge. These problems are used as a theoretical tool to investigate ethical intuitions and theories (Kamm and Rakowski 2016). This type of problem has reminded many of the problems encountered in actual driving, and in autonomous driving (Lin 2015). It is doubtful, however, that an actual driver or autonomous car will ever have to solve trolley problems and it is debated whether this issues has relevance to the ethics of autonomous vehicles (Awad et al. 2018; Paulo 2023).

2.6.3 Example (b) Autonomous Weapons

The notion of automated weapons is fairly old: “For example, instead of fielding simple guided missiles or remotely piloted vehicles, we might launch completely autonomous land, sea, and air vehicles capable of complex, far-ranging reconnaissance and attack missions.” (DARPA 1983, 1). This proposal was ridiculed as “fantasy” at the time (Dreyfus, Dreyfus, and Athanasiou 1986, ix), but it is now a reality, at least for more easily identifiable targets (missiles, vehicles, planes, ships, buildings, etc.). The main arguments against (lethal) autonomous weapon systems (AWS or LAWS), are that they support extrajudicial killings, take responsibility away from humans, and make wars or killings more likely – for a detailed list of issues see (Lin, Bekey, and Abney 2008, 73–86; Taddeo 2024).

Would autonomous weapon systems make wars more likely? It appears that increasing the availability of autonomous weapons systems and reducing the probability of being held accountable would increase the probability of their use. However, the crucial asymmetry of impunity already exists in conventional wars with distance weapons (e.g. remote controlled). These are the kinds of cases brought forward by the Campaign to Stop Killer Robots and other activist groups. Some seem to be equivalent to saying that autonomous weapons are indeed weapons …, and weapons kill, but we still make them in gigantic numbers. On the matter of accountability, autonomous weapons might make identification and prosecution of the responsible agents more difficult – but this is not clear, given the digital records that one can keep, at least in a conventional war. The difficulty of allocating punishment is sometimes called the “retribution gap” (Danaher 2016a). The development in the last 5 years seems towards the acceptance of AI in warfare, but many ethical issues remain, and – officially at least – there are still always “humans in the loop”, even though the use of AI, e.g. for target identification, reduces human input, especially when speed is crucial. In regulatory terms, a total ban now seems off the cards.

Another question seems to be whether using autonomous weapons in war would make wars worse, or perhaps make wars less bad? If robots reduce casualties, war crimes and crimes in war, the answer may well be positive and has been used as an argument in favour of these weapons (Arkin 2009; Müller 2016a) but also as an argument against (Amoroso and Tamburrini 2018). Arguably the main threat is not the use of such weapons in conventional warfare, but in asymmetric conflicts or by non-state agents, including criminals.

It has also been said that autonomous weapons cannot conform to International Humanitarian Law, which requires observance of the principles of distinction (between combatants and civilians), proportionality (of force) and military necessity (of force) in military conflict (A. Sharkey 2019). This now seems more of a technical issue, such that weapons must be constructed and used such that they do violate Humanitarian Law. There are strong indications that human rights are a useful framework for AI ethics, but this has not been much used, with some exceptions (Smuha 2021a) – inversely, it is widely recognised in human rights research that AI has significant impact on the field. Sometimes the concept of human dignity has been raised in this context (A. Sharkey 2019; cf. Rueda et al. 2025).

2.7 Moral Status, Machine Ethics, Responsibility

2.7.1 Moral Status

In a first approximation, the issue of moral status of AI and robotics systems concerns the question whether they have obligations, and whether we have obligations to them – this is often put in terms of agency: an entity is a moral agent if and only if it has obligations, and it is a moral patient, if and only if moral agents (e.g. humans) have obligations towards it. Humans are the typical examples of moral agents, and sentient animals the typical example of patients. The standard view is that AI and robotics systems have no moral status of any kind … but this view has come under pressure, and the discussion has led to insights about the nature of moral status (Misselhorn 2020; Powers and Ganascia 2020).

Some authors have indicated that it should be seriously considered whether current robots must be allocated rights (Gunkel 2018; Turner 2019; Danaher 2020). This position seems to rely largely on criticism of the opponents and on the empirical observation that robots and other non-persons are sometimes treated as having rights. In this vein, a “relational turn” has been proposed: If we relate to robots as though they had rights, then it might be futile to search whether they really do have such rights (Coeckelbergh 2010, 2012, 2018). This raises the question how far such anti-realism can go, and what it means then to say that “robots have rights” in a human-centred approach (Gerdes 2016). On the other side of the debate, Bryson has insisted that robots should not enjoy rights (Bryson 2010), though she considers it a possibility (Gunkel and Bryson 2014). It seems that the discussion of “rights” is now moving to either the area of superintelligence (Gordon 2022) or a discussion of “moral status”, see (Clarke and Savulescu 2021; Clarke, Zohny, and Savulescu 2021; Müller 2021), where there is a trend in favour of the view that sentience is at least a necessary condition for moral status (Königs 2025). Accordingly, there is a concern whether it would be ethical to create AI systems with consciousness (Butlin et al. 2023 [Other Internet Resources]; Dung 2023), since this might enable sentience, and thus suffering. Some authors have called for a “moratorium on synthetic phenomenology” (Bentley et al. 2018, 28f; Metzinger 2021).

There is a wholly separate issue whether robots (or other AI systems) should be given the status of “legal entities”, or “legal persons” – in a sense in which natural persons, but also states, businesses or organisations are “legal entities”, namely they can have financial rights and liability, but not criminal liability (cf. Bryson, Diamantis, and Grant 2017; Bertolini and Aiello 2018). In environmental ethics there is a long-standing discussion about legal rights for natural objects (C. D. Stone 1972).

2.7.2 Agency

The classical “thick” notion of a moral agent is designed to assign moral responsibility, the ability to take blame and praise. It thus implies an epistemic condition (knowledge about the world), a control condition (ability to act in the world), a normative condition (ability to act on reasons, especially reflected preferences), and the possibility to be influenced by suffering or joy, technically phenomenal states with valence, or sentience (Müller 2021; Dung 2025). Some authors looking at the issue from a technical side have concluded that LLMs lack essential features of moral agency: There is no individual agent, the agent does not generate its own norms, and the agent is not shaped by its interaction (Barandiaran and Almendros 2024 [Other Internet Resources]). It is tempting to say that current AI has no “real values” at all, just preferences according to which it may act. It lacks the responsibility for them, since it cannot reflect on values or change them. Moral agency is closely related to personhood, which is associated with free will (Frankfurt 1971; Strawson 2004).

However, there is a “thin” notion of agency that is taken from the technical notion of agency, “a thing that does something”, and used in early AI ethics, where a thin notion of “machine ethics” is also used (see below). E.g. “Machine ethics extends the field of computer ethics beyond concern for what people do with their computers to questions about what the machines themselves do.” (Allen, Smit, and Wallach 2006, 15).

James Moor (Moor 2006, 19–20) distinguishes four types of machine agents: ethical impact agents (example: robot jockeys), implicit ethical agents (example: safe autopilot), explicit ethical agents (example: using formal methods to estimate utility), and full ethical agents (“can make explicit ethical judgments and generally is competent to reasonably justify them. An average adult human is a full ethical agent”).

Programmed agents are sometimes not considered “full” agents because they are “competent without comprehension”, just like the neurons in a brain (Dennett 2017; Hakli and Mäkelä 2019). A notable critic of strong claims for the abilities of AI systems, Luciano Floridi, has also suggested that we should use a minimal notion of agency, but a demanding notion of intelligence – so, current AI systems, even a merely reactive LLM, come out as agents, but without intelligence (Floridi 2023a; 2023b, ch. 2) – for a criticism of this approach, see (Zafar 2024).

2.7.3 Machine Ethics

Machine ethics is ethics for machines, for “ethical machines”, for machines as subjects, rather than for the human use of machines as objects. In earlier literature it was often not very clear whether this is supposed to cover all of AI ethics or to be a part of it (Floridi and Saunders 2004; Moor 2006; Anderson and Anderson 2011; Wallach and Asaro 2017), but thankfully the identification of AI ethics with machine ethics has been overcome by now. Sometimes there was the risky inference at play that if machines are ethically relevant, then we need a machine ethics (Anderson and Anderson 2007, 15). This might be true, if “machine ethics” is taken in the thin sense of Moor’s machine agents, where mere “ethical impact” is enough for being an “ethical agent” (Moor 2006, 19) or an “artificial moral agent” (Allen, Varner, and Zinser 2000). This thin sense of machine ethics is identical to “ethics of design”, which is now the more common term (Brey and Dainow 2024) (Friedman 1996; Houkes and Vermaas 2010; Verbeek 2011). If machine ethics is meant in a more demanding sense, then we are back at the discussions about moral agents and patients above. The conflation of both senses of “ethical machine” is now less common than it once was, though there is now occasional use of “alignment” for design (e.g. van de Poel 2020, 388) and thin machine ethics (Cave et al. 2019) as well.

2.7.4 Responsibility

Allocation of responsibility is often a complicated matter: A car maker is responsible for the technical safety of the car, a driver is responsible for driving, a mechanic is responsible for maintenance, the public authorities are responsible for the technical conditions of the roads, etc. In general, “The effects of decisions or actions based on AI are often the result of countless interactions among many actors, including designers, developers, users, software, and hardware. … With distributed agency comes distributed responsibility.” (Taddeo and Floridi 2018, 751). How this distribution might occur is not a problem that is specific to AI, but it gains particular urgency in this context (Nyholm 2018a, 2018b). In classical control engineering, distributed control is often achieved through a control hierarchy plus control loops across these hierarchies. Even individual work is now co-working and co-creation with AI. At the same time, many societal systems depend on allocating responsibility to a person, e.g. for blame and praise, or for liability and copyright.

There have been discussions about the difficulties of allocating responsibility for the killings of an autonomous weapon, and a “responsibility gap” has been suggested (esp. Rob Sparrow 2007), meaning that neither the human nor the machine may be responsible. Perhaps the solution is to keep humans “in the loop” or “on the loop” or in “meaningful control” (Santoni de Sio and van den Hoven 2018), which would allow us to credit where credit is due (Danaher and Nyholm 2021). However, perhaps there truly are tragic choices (Danaher 2022), or we should not assume that for every event there is someone responsible for that event, so the real issue may well be the distribution of risk (Simpson and Müller 2016). Classic risk analysis (Hansson 2013) indicates it is crucial to identify who is exposed to risk, who is a potential beneficiary, and who takes the decisions (Hansson 2018, 1822–1824). For more sceptical accounts of responsibility gaps, see (Tigard 2021; Königs 2022; Da Silva 2024). It remains an open question whether the advent AI systems truly challenge our allocations of responsibility, or rather just generates temporary confusion.

2.8 Superintelligence & Existential Risk

2.8.1 Singularity & Superintelligence

The discussion so far was limited to current and clearly foreseeable forms of AI and their societal consequences. In addition to this “short-term” AI ethics, there is also “long-term” AI ethics, namely the discussion whether in a longer term a new set of problems might develop – perhaps including problems that are serious enough to deserve our attention now (Sætra and Danaher 2025).

The classic narrative for long-term risk from AI has two steps: 1) The current trajectory of artificial intelligence will reach up to systems that surpass the human level of intelligence, that is they are “superintelligent”, and at this point, there is a sharp discontinuity, a “singularity”, from where onwards the development of AI is out of human control and hard to predict (Kurzweil 2005, 487). 2) Once that point is reached, massive negative consequences, including existential risk for the human species (XRisk), have significant probability.

Optimists like Kurzweil or Dario Amodei (the current CEO of Anthropic) make the first step, and then expect a positive development, whereas pessimists like Bostrom or Yudkowsky expect that the risk in the 2^nd step follows from the first. Having said that, more recently, some of the discussion about long-term risk is moving away from XRisk and superintelligence, returning to general considerations of risk from AI.

Historically, the fear that “the robots we created will take over the world” had captured human imagination even before there were computers (e.g. Butler 1863) and it is the central theme in Čapek’s famous play that introduced the word “robot” (Čapek 1920): The robots rise up against humans after they have been provided with feelings. This fear was first formulated as a possible trajectory of existing AI into an “intelligence explosion” by Irvin Good:

Let an ultraintelligent machine be defined as a machine that can far surpass all the intellectual activities of any man however clever. Since the design of machines is one of these intellectual activities, an ultraintelligent machine could design even better machines; there would then unquestionably be an “intelligence explosion”, and the intelligence of man would be left far behind. Thus the first ultraintelligent machine is the last invention that man need ever make, provided that the machine is docile enough to tell us how to keep it under control. (Good 1965, 33).

The argument from acceleration to singularity was spelled out by Ray Kurzweil. He pointed out that computing power has been increasing exponentially, i.e. doubling ca. every 2 years since 1970 in accordance with “Moore’s Law” on the number of transistors, and that it will continue to do so for some time in the future. He then predicted in (Kurzweil 1999; cf. Kurzweil 2005) that by 2010 supercomputers will reach human computation capacity, by 2030 “mind uploading” will be possible, and by 2045 the “singularity” will occur.

Despite obvious weaknesses in the identification of “intelligence” with processing power, Kurzweil seems right that humans tend to underestimate the power of exponential growth. Mini-test: If you walked in steps in such a way that each step is double the previous, starting with a step of one metre, how far would you get with 30 steps? (Answer: to Earth’s only permanent natural satellite.) Indeed, most progress in AI is readily attributable to the availability of degrees of magnitude faster processors, larger storage, and higher investment (Müller 2018). Since ca. 2010, the actual development of computing power is accelerating faster than predicted by Kurzweil, due to the massive reduction in price for computation, plus the massive increase in investment and a relative success in “scaling laws” – this has led to many benchmarks being passed, and it has changed estimates of when human-level professional ability might be reached, e.g. (Grace et al. 2024 [Other Internet Resources]) vs. (Müller and Bostrom 2016).

The version of this argument that is now used more commonly (Chalmers 2010), talks about an increase in “intelligence” of the AI system to a level of “superintelligence”. (Bostrom 2014) explains in some detail what would happen at that point, and what the risks for humanity are. The discussion is summarised in (Eden et al. 2012; Armstrong 2014; Shanahan 2015). There are possible paths to superintelligence other than computing power increase, e.g. the complete emulation of the human brain on a computer or in a robot (Kurzweil 2012; Sandberg 2013), biological paths, or networks and organisations (Bostrom 2014, 22–51).

2.8.2 Existential Risk

Once a stage of artificial superintelligence were reached, it seems that they may well have preferences that conflict with the existence of humans on Earth, and may thus decide to end that existence – and given their superior intelligence, they will have the power to do so (or they may happen to end it because they do not really care): This is the risk of extinction of the human species (XRisk) or at least some other “catastrophic risk”. The details of this 2^nd step are disputed, but they seem to involve orthogonality and the view that technical development is close to AGI.

One question is what kinds of goals and values the superintelligent system might have. Classic arguments for XRisk suggest that even a programme with relatively benign goals, like “maximise paperclips on earth” (Bostrom 2003b), or “optimise chess performance” (Omohundro 2014) would turn towards XRisk because they would realise that their goals are best reached if certain sub-goals, e.g. to acquire resources, are also reached. This notion is called “instrumental convergence” (Bales, D’Alessandro, and Kirk-Giannini 2024, 4; Gallow 2024).

This only works on the assumption that superintelligence does not imply benevolence – contrary to Kantian traditions in ethics that have argued higher levels of rationality or intelligence would go along with a better understanding of what is moral, and better ability to act morally (Gewirth 1978; Chalmers 2010, 36f). Bostrom expresses this thought as follows:

“The Orthogonality Thesis: Intelligence and final goals are orthogonal axes along which possible agents can freely vary. In other words, more or less any level of intelligence could in principle be combined with more or less any final goal.” (Bostrom 2012, 73; cf. Bostrom 2014, 105–109)

Some authors (Müller and Cannon 2022) have argued that orthogonality plus instrumental convergence incoherently imply two notions of intelligence with and without moral cognition (cf. Greene 2015), while others have supported orthogonality (Dung 2024).

Thinking in the long term is the crucial feature of this literature. Whether the singularity (or another catastrophic event) occurs in 3 years, or in 30 or 3000, does not really matter (Baum et al. 2019). These issues are sometimes taken more broadly as concerning any “catastropic risk” for the species (Rees 2018) – of which AI is only one (Häggström 2016; Ord 2020). The evaluation of XRisk and other catastrophic risk from AI also depends on the details of how superintelligence comes about, abruptly or incrementally (Kasirzadeh 2025). While the XRisk literature is focussed on speculative destruction scenarios, Cappelen et al. (Cappelen, Goldstein, and Hawthorne forthcoming) argue those who dismiss these prospects are advancing equally speculative “survival stories”.

Several collections of papers have investigated the risks of artificial general intelligence (AGI) and the factors that might make this development more or less risk-laden (Müller 2016b; Callaghan et al. 2017; Yampolskiy 2018). The battle lines on the XRisk side now seem hardened (Bostrom and Yudkovski 2014; Yampolskiy 2022; Yudkovski and Soares 2025), while on the side of general considerations of risk (Bengio et al. 2024; Gyevnár and Kasirzadeh 2025)or “instrumental convergence”, there is more movement.

Once argumentative steps towards XRisk from AI are made, the question arises how such consequences could be avoided, if possible. This is usually called the “control problem”: How can humans control a superintelligent AI system (Bostrom 2014, 127ff)? Since actual control of a deployed superintelligent system by mere humans seems very difficult (even for an oracle-type system), the discussion is mostly about how to make sure the values of the system are “aligned” to human values at design stage (in the spirit of machine ethics), so if these remain stable, the system does not need to be controlled further. This raises many technical questions, but also more philosophical ones, like what values are (Zhi-Xuan et al. 2024), what alignment is, which values to select (Gabriel 2020), and how to recognise human values (S. Russell 2019). The discussion about mis-alignment is related to the discussion about misuse of AI, and it has been argued that both can generate XRisk (Hellrigel-Holderbaum and Dung forthcoming)

The discussion of XRisk is often wedded to a version “fanaticism”, i.e. the view that a very large utility in the long run, even with low probability, will overrule medium-size but securely foreseeable utility. This has led to some recent discussion about the acceptability of fanaticism (Wilkinson 2022; J. S. Russell 2024; Bottomley and Williamson 2025). It appears that there is cost both in erring on the too-risky and in erring on the too-cautious side (Müller 2026).

Many participants in this debate share a cultural sphere where technology will develop rapidly and bring broadly radical changes, including “transhuman” views of survival for humankind in a different physical form, e.g. uploaded on a computer (Moravec 1998; Bostrom 2003a; More and Vita-More 2013). They also consider the prospects of “human enhancement”, in various respects, including intelligence (Erler and Müller 2024). Some discussions about superintelligence have religious undertones, e.g. speculation about omniscient beings, the radical changes on a “latter day”, and the promise of immortality through transcendence of our current bodily form (Capurro 1993; Geraci 2008, 2010; O’Connell 2017, 160ff; Gertz 2018).

However, the discussion has moved to the mainstream, and “avoiding XRisk from AGI” has become a common theme in major AI companies, given that AI is advancing massively (Aguirre 2025), while some earlier contributions have tried to stop the discussion about the “myth” of singularity (Floridi 2016; Ganascia 2017). The narrative of XRisk has raised public awareness of philosophy and generated much discussion in the field itself, e.g. on whether a singularity is a trajectory of current AI (Floridi 2023a; Müller 2025b; Thorstad 2025), whether it is even possible, conceptually or practically, and what the role sof its assumptions and central terms are, such as utility, goals, agency, moral rules (D’Alessandro 2025), and risk.

Even philosophers who think that the argumentative situation is not to be analysed in terms of singularity and XRisk, or who think that there are compelling reasons to reject the arguments for XRisk, should acknowledge that they might be wrong. So, the discussion can be justified and fruitful, even if one thinks the probability of XRisk from AI is very low.

2.9 Society, Politics, Economics

2.9.1 Philosophy of Technology

There is a classical tradition of “philosophy of technology”, mostly in Europe, and often referring to M. Heidegger, G. Anders or even K. Marx, which deals with the general relevance of technology for the human condition (Grunwald and Hillerbrand 2021; Gutmann, Wiegerling, and Rathgeber 2024). This tradition stems from classical anthropology of the human as toolmaker (Prometheus), and from the philosophical and societal response to the industrial revolution and is thus relevant for the understanding of present-day developments in AI and robotics. While it has much to say about the similarities with earlier technologies, e.g. the increased “speed” of things (Rosa 2010, 2020), it is often struggling to articulate what is different about these new technologies, not to mention what is positive (Han 2022). In this tradition there is also a form of technology ethics, e.g. under the headings of “responsible research and innovation” (Burget, Bardone, and Pedaste 2017), “technology assessment” (Grunwald 2018) or “ethics by design” (Brey and Dainow 2024), often involving the wider socio-technical context (Koudina and Van de Poel 2024; Wang and Blok 2025).

A more political angle of technology is discussed in “Science and Technology Studies” (STS). As books like The Ethics of Invention (Jasanoff 2016) show, concerns in STS are often quite similar to those in ethics (Jacobs et al. 2019). Generally, the philosophy of AI is probably outgrowing the general philosophy and ethics of technology (SEP Article Franssen, Lokhorst, and van de Poel 2024), but it certainly remains the case that it is part of that larger picture, and has much to learn from the experience with technologies that were once new and revolutionary, such as water power, the steam engine, trains, electricity, cars, the personal computer, or the Internet. It may well be that the ever “smarter” environment we live in (Halpern and Mitchell 2023), and specifically AI, is both a curse (Vallor 2024) and a blessing for humanity. The human condition and the meaning of our lives are fundamentally affected by our socio-technical conditions, and AI will have massive impact on these conditions, so it looks like there should really be an analogous effort in academia and public discourse to understand the effects we want and those that we can achieve from this technology.

2.9.2 Politics

Traditionally, the ethics of AI had an unhealthy obsession with individual choice, rather than with social and political choice and conditions. There is now a focus on specifically political issues with AI (Crawford 2021), especially race, gender, minorities, feminism (Gebru 2020; Browne et al. 2023; Toupin 2024), environmental impact (O’Brolcháin and Grau Ruiz 2020; van Wynsberghe 2021), global justice (Lundgren et al. 2024) and the threat to democratic process, such as information and trust (Coeckelbergh 2024) – a survey is in (Coeckelbergh 2022a).

It appears that information technology had both impacts towards more fairness (universal access to information, e.g. scientific publications) but also towards less fairness (concentration of wealth). The larger the impact of AI and robotics will be, the more urgent these concerns will become. Also, they will seem larger to those that are more negatively affected – all of this would have sounded familiar to analysts like Karl Marx (Marx 1867/2024) and has begun to affect political theory (Risse 2023).

One of the fundamentals of democratic societies is the free spread of information, since a free political decision is only as good as the information it is based on. However, with the help of AI, fake news is automatically generated and spread, sometimes tailored to recipients – it has become cheaper than employing humans, and it spreads “farther, faster, deeper, and more broadly than the truth in all categories of information.” (Vosoughi, Roy, and Aral 2018). The control over information and narratives is often in the hand of oligarchs and monopolies, e.g. search engines are subject to essentially one company; see the recent SEP article (Tavani and Zimmer 2025). News distribution channels are also increasingly AI steered and generated.

AI generated social media are now the prime locations for political propaganda. This influence can be used to steer voting behaviour, as in the Facebook-Cambridge Analytica “scandal” (Woolley and Howard 2017; Bradshaw, Neudert, and Howard 2019) and – if successful – it may harm the autonomy of individuals (Susser, Roessler, and Nissenbaum 2019). Some authors (Sanders and Schneier 2024) stress that we should not make the same mistakes with AI that we made with social media, though the severity of the issue is disputed (Altay, Berriche, and Acerbi 2023). The main similarity is that problematic surveillance and manipulation can be automated, and thus applied to masses of people, while being personalised. AI can help individuals and societies make better choices – but not all agents in our information space want that to happen.

These developments not only support deception and manipulation, but also the era of “bullshit” (Frankfurt 2005), or post-truth, where anyone can have their own “alternative facts”, and thus any information is as good as any other (this is sometimes confused with “free speech”). Characteristically, the main aim of political propaganda through AI-generated content is not to spread misinformation, but to spread the impression that any information is as good as any other (Goldstein et al. 2023 [Other Internet Resources]). In the AI dominated news world, all news is reduced to vying for attention, to “click-baiting” – meaning that what is actually read is not what readers want to pay attention to. If the information on the basis of which humans take decisions is compromised or biased in a way that some voices are heard more than others, this is sometimes called “epistemic injustice” (Fricker 2007; Kay, Kasirzadeh, and Mohamed 2024). These developments have found significant interest in political science and media studies (Helbing et al. 2019; Coeckelbergh 2024), and some authors have diagnosed a form of fascism in contemporary AI developments (Mühlhoff 2025). Others have argued that the “hallucinations” that plague LLMs are a form of this “bullshit” (Hicks, Humphries, and Slater 2024) or create a “testimony gap” (Robert Sparrow and Flenady 2025).

It was common to consider what a repressive regime like that Hitler might have done with such techniques, but we do not need these fictional cases. It is already evident, that such techniques are used in pseudo-democracies, to ensure that elections go the intended way, that dissenting voices are silenced, and people controlled (e.g. in the name of “national security” or “fighting terrorism”). E.g. there is good reason to think that these moves from mainstream news media to AI-mediated “sharded media” have directly influenced the US presidential elections in 2024 (Merrin and Hoskins 2025). The technical angle of this discussion has been discussed by (O’Neil 2016) in her influential book Weapons of Math Destruction, and in (Yeung and Lodge 2019). – The politician Henry Kissinger has said we have “generated a potentially dominating technology in search of a guiding philosophy” (Kissinger 2018).

2.9.3 Natural Environment

One interesting question, that has received more attention in recent years, is whether the development of AI is environmentally sustainable: While there are clearly ecological benefits to AI (e.g. optimising energy efficiency (Skare, Gavurova, and Sinkovic 2025), or just replacing some business travel by video conferencing), AI hardware systems produce waste that is very hard to recycle, use materials that are environmentally damaging to generate, consume water for cooling, and consume vast amounts of energy, especially for the training and use (inference) of machine learning systems. Unfortunately, industry is actively hiding data about energy use. For the year 2022, where there is data, estimates for the percentage of total electricity used world-wide are: 0.04% for AI processors (De Vries 2023; Luers and Jonathan Koomey 2024), 0.38% for Bitcoin mining (Neumueller 2023), 1.8–2.6% for all data centres (Kamiya and Bertoldi 2024). A hardware-based prediction (de Vries-Gao 2025) puts all AI systems taken together on a power consumption level close to the Netherlands by 2025. (But still only half that of bitcoin mining and 20% of all non-crypto data centre use.) The massive investment into AI data centres indicates that from 2026 we might look at 20% increases year-on-year (O’Donnell and Crownhart 2025) – while any ecological gain is far less evident.

Giving the increasing problem of energy consumption, there is now significant technical research into making AI development and use more energy efficient. However, it is not clear that this will lead to reduced energy use, due to the “Jevons paradox”: More efficient use might cause more use, and thus more energy use overall. It appears that some actors in this space offload ecological costs to the general society. There is significant literature that stresses the importance of environmental issues in political philosophy of AI, e.g. (Bender et al. 2021; Crawford 2021; van Wynsberghe 2021; Bolte and van Wynsberghe 2024) – what is less clear is what the contribution of philosophical work in this space could be.

2.9.4 Economics

One fundamental societal impact of AI and robotics is on economic structures. There are changes in productivity, the distribution of wealth, the process of production, and in the labour market. While these are of political importance and of philosophical interest, we need to tread carefully here, rather than doing amateur economics.

The issue of fair distribution of goods in a society, and of justice, have been central in political philosophy since its beginnings. A standard view is that distributive justice should be rationally decided from behind a “veil of ignorance” (Rawls 1971), i.e. as if one does not know what position in a society one would actually be taking (labourer or industrialist, etc.). Rawls thought the chosen principles would then support basic liberties and a distribution that is of greatest benefit to the least-advantaged members of society. It would appear that the AI economy has three features that make such justice unlikely: First, it operates in a largely unregulated environment where responsibility is often hard to allocate. Second, it operates in markets that have a “winner takes all” feature, where monopolies develop quickly. Third the “new economy” of the digital service industries is based on intangible assets, also called “capitalism without capital” (Haskel and Westlake 2017). This means that it is difficult to control multinational digital corporations that do not rely on a physical plant in a particular physical location. These three features seem to suggest that if we leave the distribution of wealth to free market forces, the result would be an unjust distribution. This seems to be happening in the USA in the 2020s with a “k-shaped” economy where the benefits of economic growth are skewed towards high-wealth individuals.

One area where this is visible is that AI and robotics are technologies that are developed and exploited mainly in affluent societies, sometimes through exploitation of cheap labour. This has led to suggestions that some of the issues in this connection are “colonial” (Adams 2021; Mhlambi and Tiribelli 2023) and contribute to the massive global disparity of wealth – e.g. university professors in Africa will earn ca. 10% of those in Europe. Even within affluent societies the access to AI and participation in its use are unequally distributed – roughly along the lines of general inequality. There is also a significant discrepancy between US and Europe, in that the US dominates digital markets, especially services.

The area that is most prominent in public discussion is that “robots will take our jobs”, now perhaps “AI will take our jobs”. There is an experience, especially among manual labourers, that productivity gains through automation have led to unemployment. Is AI just another case of this? Unlike in 2020, an impact of AI systems on the labour market is now clearly visible, and more is widely predicted – after all, the main motivation of the enormous current AI investment (ca. 0.5 trillion US$ in 2025) is expected profit from productivity gains.

Philosophical responses to the issue of unemployment from AI have ranged from the alarmed (Carl Benedikt Frey and Osborne 2013; Westlake 2014) to the neutral (Metcalf, Keller, and Boyd 2016; Calo 2018; Carl Benedict Frey 2019) and the optimistic (Brynjolfsson and McAfee 2016; Harari 2016; Danaher 2019). For the moment, it appears that the overall impact of AI is that of generating more employment (World Economic Forum 2025), but there are indications that this will change.

In principle, the labour market effect of automation seems to be fairly well understood as involving two channels: “(i) the nature of interactions between differently skilled workers and new technologies affecting labour demand and (ii) the equilibrium effects of technological progress through consequent changes in labour supply and product markets” (Goos 2018, 362). What currently seems to happen in the labour market as a result of AI & robotics automation is “job polarisation” of the “dumbbell” shape (Goos, Manning, and Salomons 2009) where growth occurs at the high-skill and the low-skill ends of the market, while mid-skill jobs, i.e. the majority of jobs, are under pressure and reduced because they most likely to be automated (Baldwin 2019).

It seems clear that AI and robotics will lead to significant gains in productivity and thus overall societal wealth. It is said that the emphasis on “growth” is a modern phenomenon (Harari 2016, 240). The use of automation aims to increase productivity, such that fewer humans are required for the same output, or the product or service can be generated with lower cost.

This reduction in human workforce does not necessarily imply a loss of overall employment, because available wealth increases and that can increase demand sufficiently to counteract the productivity gain. In the long run, higher productivity in industrial societies has led to more wealth overall, especially since the industrial revolution. Major labour market disruptions have occurred in the past, e.g. in 1800 farming employed over 60% of the workforce in Europe and North-America, while by 2010 it employed ca. 5% in the same places (Anonymous 2013). Sometimes, these changes are fairly rapid, e.g. in the 20 years between 1950 and 1970 the number of hired agricultural workers in the UK was reduced by 50% (Zayed and Loft 2019). Also, the Jevons paradox (mentioned above on energy) might strike again: If, in some domain, the efficiency gain through AI is high enough to make a product cheaper, then the use might increase so much that overall need for workers is increased.

Classic automation in the industrial revolution replaced human muscle and manual labour, whereas digital automation now replaces human thought and intellectual work. Which areas of work are more prone to automation through is a question for the philosophy of AI, and it seems that artificial intelligence thrives where there are rules, patterns and norms, and especially where work is done with computers, already. Such rules and norms are ubiquitous: A lawyer knows how to draft a contract, an investment banker understands how to structure a deal, and a civil engineer can design a drainage system. In robotics, the classic argument is that the machines will take over the “dirty, dumb and dangerous” (DDD) jobs. If a job follows DDD patterns that can be taught, AI can learn it. It has been argued that the work of “professions” like lawyers, doctors, teachers has become an outdated monopoly (Susskind and Susskind 2022). In the creative industries there is a massive concern that the value of human creativity is merely exploited by ML systems (Anantrasirichai and Bull 2022) with disregard for intellectual property rights; but also that creativity will generally need to be re-evaluated (Du Sautoy 2019; Misselhorn 2023). There are now clear indications that academic teaching and academic research are similarly under threat: We are moving towards a world where machines “read” scientific publications “do research”, and “write”, “edit” or “review” scientific publications. (Perhaps science is moving in the direction of other publication systems that are overtaken by AI-generated “slop”.)

AI has a long history of overestimating the impact of its technology, but in the last 5 years the majority of predictions were underestimates. E.g. many in the field were surprised about the progress of generative AI, particularly in language (ChatGPT in Nov. 2022). However, actual human jobs stubbornly turn out more complicated than estimated. E.g., in 2016, Geoffrey Hinton said “We should stop training radiologists now. It’s just completely obvious that within five years, deep learning is going to do better than radiologists.”, but it turns out that this is not true, nearly a decade later, mainly because causal modelling is difficult (Obuchowicz et al. 2025). Given the high stakes, it is difficult to find an ethically good way to combine human and machine expertise in making such medical decisions (Savulescu et al. 2024). – The revolution in physical interaction with autonomous vehicles and robotics is taking a little longer, just like it is taking longer to make robots that can play football than it took to make computers that can play chess.

For the labour market, the main question is: Is it different, this time? Will the creation of new jobs and wealth keep up with the destruction of jobs? And even if it is not different, what are the transition costs, and who bears them? Do we need to make societal adjustments for a fair distribution of costs and benefits of digital automation? – Perhaps enormous productivity gains will allow an “age of leisure” to be realised, something (Keynes 1930) had predicted to occur around 2030, assuming a growth rate of 1% per annum? Actually, we have already reached the level he anticipated for 2030, but we are still working. Harari explains how this economical development allowed humanity to overcome hunger, disease and war – and now we aim for immortality and eternal bliss through AI, thus his book title Homo Deus (Harari 2016, 75 etc.). The question remains whether an age of leisure is desirable, and how we would live in such an age.

2.10 AI Policy

In our context, “policy” is the general name for measures that states or other organisations take to support a politically desirable impact; it can take many forms, though it is often rule-based, e.g. tax breaks on home-building will encourage citizens to build their own homes. Such policy is driven by all sorts of political aims, but many of these will be motivated by “ethics”, e.g. justice or positive impact for vulnerable people.

In the history of AI policy, there was a phase in around 2015–20 when many organisations, including national governments, the UN, OECD and EU, felt the need to formulate “ethical principles”, and many philosophers were involved in this work. The next phase of actual policy will involve stakeholders and a few experts on technology and ethics, as well as lawyers who actually formulate the laws and regulations – the attention given to philosophical work in that phase is much reduced, but much of it still runs under the heading of “AI ethics”.

Among philosophers there is still some discussion about the value of “ethical principles” for AI, for example the five principles, based on the classic four “middle principles” in bioethics (Beauchamp and Childress 2013): (1) beneficence, (2) non-maleficence, (3) autonomy, and (4) justice, plus (5) explicability (Floridi and Cowls 2019). The initiative “Artificial Intelligence for Social Good” (AI4SG) suggests seven – quite different – principles: (1) falsifiability and incremental deployment; (2) safeguards against the manipulation of predictors; (3) receiver-contextualised intervention; (4) receiver-contextualised explanation and transparent purposes; (5) privacy protection and data subject consent; (6) situational fairness; and (7) human-friendly “semanticisation” (Floridi et al. 2020; Floridi 2023b). The discussion on principles and guidelines ranges from the view that they are meaningless, toothless and politically harmful (Munn 2023), to useful “work in progress” (Lundgren 2023) towards operationalisation (Stix 2021; Bleher and Braun 2023; Taddeo, Blanchard, and Thomas 2024), software engineering (Antikainen et al. 2021), and a systemic approach (Wang and Blok 2025).

Current AI policy activities worldwide have been surveyed by the OECD AI policy observatory (https://oecd.ai/) since 2020. For activities prior to 2020, see (Jobin, Ienca, and Vayena 2019), and the useful history in (Smuha 2021b). The basic handbook is now (Bullock et al. 2024).

The most influential policy tools are the OECD “AI Principles” for a “human-centric approach to artificial intelligence” (adopted in 2019 and updated in 2024, https://oecd.ai/en/ai-principles) (OECD 2024). These focus on: 1) Inclusive growth, sustainable development and well-being, 2) Respect for the rule of law, human rights and democratic values, including fairness and privacy, 3) Transparency and explainability, 4) Robustness, security and safety, and 5) Accountability. OECD policy is now developed with the “Global Partnership on AI” (GPAI), which was founded in 2022 to advise global governments on AI regulation (https://gpai.ai).

The most advanced legal regulation exists in the EU, with the hope of a “Brussels Effect”, as with the General Data Protection Regulation (GDPR 2016): To lead the world on regulation (Lundgren et al. 2024). The AI regulation itself goes back to the “High-level expert group on AI”, which produced several relevant reports (AI HLEG 2019). The main component of this legal framework is the AI Act (EU Parliament 2024), which legally regulates AI applications by their risk, graded at five levels: unacceptable/high/general/limited/minimal. This includes risks in a consequentialist sense, as well as “risks to rights”. Wider policy concerns for the digital world that are strongly relevant for AI and robotics are laid down in the 2022 Digital Markets Act (EU Parliament 2022a), e.g. on monopolies of digital platforms and fair competition, and the 2022 Digital Services Act (EU Parliament 2022b), e.g. on disinformation and illegal content.

There is now a host of regulation relevant to AI and robotics in most nation states, though the approaches differ substantially, from libertarian support of free market (and Hobbes’ “war of all against all”) to state control. Industry sometimes claims that regulation stifles innovation and sometimes demands regulatory “guardrails”. The hopes for a world-wide regulation, as envisaged through GPAI and OECD, now seem low, since several state actors, especially the USA and China, are more interested in competitive advantage. Having said that, there is now a series of AI Safety Summits on the highest governmental level (Paris 2025, Seoul 2024, Bletchley Park 2023), the AI4People summits of the EU, the AI4Good summits of the ITU, etc. – so the issues of AI policy remain on the agenda of world politics.

3. Closing Note

The problems of the very concept of AI through computation have played a prominent role with several sections here, from privacy and manipulation, to politics, to superintelligence. It is remarkable how imagination or “vision” has played a central role since the very beginning of the discipline at the “Dartmouth Summer Research Project” (McCarthy et al. 1955; Simon and Newell 1958). And the evaluation of this vision is subject to dramatic change: In a few decades, we went from the slogans “AI is impossible” (Dreyfus 1972) and “AI is just automation” (Lighthill 1973) to “AI will solve all problems” (Kurzweil 1999), “AI may well kill us all” (Bostrom 2014), and “we know how to build AGI” (Altman 2025). This created media attention and PR efforts, but it also raises the problem how much of this “philosophy and ethics of AI” is really about AI, rather than about an imagined technology. – As we said at the outset, AI and robotics have raised fundamental questions about what we should do with these systems, what the systems themselves should do, and what risks they have in the long term. They also challenge the human view of humanity as the intelligent and dominant species on Earth. We have seen issues that have been raised, and we will have to analyse technological and social developments closely to learn for traditional problems of philosophy, and to help develop a positive vision for human society with AI and robotics.

Bibliography

AAAI, 2025, “AAAI 2025 Presidential Panel on the Future of AI Research”, March 2025, AAAI: 88.
Adams, Rachel, 2021, “Can Artificial Intelligence Be Decolonized?”, Interdisciplinary Science Reviews, 46(1–2): 176–197. doi:10.1080/03080188.2020.1840225
Aguirre, Anthony, 2025, “Keep the Future Human: Why and How We Should Close the Gates to AGI and Superintelligence, and What We Should Build Instead”, SSRN, 05. March 2025. doi:10.2139/ssrn.4608505
AI HLEG, 2019, “High-Level Expert Group on Artificial Intelligence: Ethics Guidelines for Trustworthy AI”, European Commission, August 4, 2019, Brussels: 1–37.
Alikhademi, Kiana, Emma Drobina, Diandra Prioleau, Brianna Richardson, Duncan Purves, and Juan E. Gilbert, 2022, “A Review of Predictive Policing from the Perspective of Fairness”, Artificial Intelligence and Law, 30(1): 1–17. doi:10.1007/s10506-021-09286-4
Allen, Colin, Iva Smit, and Wendell Wallach, 2006, “Why Machine Ethics?”, IEEE Intelligent Systems, 21(4): 12–17. doi:10.1109/MIS.2006.83
Allen, Colin, Gary Varner, and Jason Zinser, 2000, “Prolegomena to Any Future Artificial Moral Agent”, Journal of Experimental & Theoretical Artificial Intelligence, 12(3): 251–261. doi:10.1080/09528130050111428
Altay, Sacha, Manon Berriche, and Alberto Acerbi, 2023, “Misinformation on Misinformation: Conceptual and Methodological Challenges”, Social Media + Society, 9(1), 20563051221150412.
Altman, Sam, 2025, “Reflections”, Blog, January 6, 2025. [Altman 2025 available online]
Alvarado, Ramón, 2023, “AI as an Epistemic Technology”, Science and Engineering Ethics, 29(5), 32.
Amoroso, Daniele and Guglielmo Tamburrini, 2018, “The Ethical and Legal Case Against Autonomy in Weapons Systems”, Global Jurist, 18(1). doi:10.1515/gj-2017-0012
Anantrasirichai, Nantheera and David Bull, 2022, “Artificial Intelligence in the Creative Industries: A Review”, Artificial Intelligence Review, 55(1): 589–656.
Anderson, Michael and Susan Leigh Anderson, 2007, “Machine Ethics: Creating an Ethical Intelligent Agent”, AI Magazine, 28(4): 15–26.
–––, 2011, Machine Ethics, Cambridge: Cambridge University Press.
Anonymous, 2013, “How Many People Work in Agriculture in the European Union? An Answer Based on Eurostat Data Sources”, EU Agricultural Economics Briefs, 8 (July). [Anonymous 2013 available online]
Antikainen, J., M. Agbese, H. K. Alanen, E. Halme, H. Isomäki, M. Jantunen, K. K. Kemell, R. Rousi, H. Vainio-Pekka, and V. Vakkuri, 2021, “A Deployment Model to Extend Ethically Aligned AI Implementation Method ECCOLA”, in , 230–235. doi:10.1109/REW53955.2021.00043
Arkin, Ronald C, 2009, Governing Lethal Behavior in Autonomous Robots, Boca Raton: CRC Press. [Arkin 2009 available online]
Armstrong, Stuart, 2014, Smarter than Us, Berkeley: MIRI.
Arnold, Thomas and Matthias Scheutz, 2017, “Beyond Moral Dilemmas: Exploring the Ethical Landscape in HRI”, 2017 12th ACM/IEEE International Conference on Human-Robot Interaction (HRI), 445–452.
Asaro, Peter M, 2019, “AI Ethics in Predictive Policing: From Models of Threat to an Ethics of Care”, IEEE Technology and Society Magazine, 38(2): 40–53. doi:10.1109/MTS.2019.2915154
Awad, Edmond, Sohan Dsouza, Richard Kim, Jonathan Schulz, Joseph Henrich, Azim Shariff, Jean-François Bonnefon, and Iyad Rahwan, 2018, “The Moral Machine Experiment”, Nature, 563(7729): 59–64. doi:10.1038/s41586-018-0637-6
Baldwin, Richard, 2019, The Globotics Upheaval: Globalisation, Robotics and the Future of Work, London: Weidenfeld & Nicolson. [Baldwin 2019 available online]
Bales, Adam, William D’Alessandro, and Cameron Domenico Kirk-Giannini, 2024, “Artificial Intelligence: Arguments for Catastrophic Risk”, Philosophy Compass, 19(2): e12964. doi:10.1111/phc3.12964
Baum, Seth D., Stuart Armstrong, Timoteus Ekenstedt, Olle Häggström, Robin Hanson, Karin Kuhlemann, Matthijs M. Maas, James D. Miller, Markus Salmela, Anders Sandberg, et al., 2019, “Long-Term Trajectories of Human Civilization”, Foresight, 21(1): 53–83. doi:10.1108/FS-04-2018-0037
Beauchamp, Tom L and James F Childress, 2013, Principles of Biomedical Ethics, 7th edition, New York: Oxford University Press.
Beisbart, Claus, 2026, “In Which Ways Is Machine Learning Opaque?”, In Durán, Juan M and Georgia Pozzi (eds.), Philosophy of Science for Machine Learning: Core Issues and New Perspectives (pp. 3–24). Berlin: Springer.
Bendel, Oliver, 2018, “Sexroboter Aus Sicht Der Maschinenethik”, in Handbuch Maschinenethik, Oliver Bendel (ed.), Wiesbaden: Springer Fachmedien Wiesbaden, 1–19. doi:10.1007/978-3-658-17484-2_22-1
Bender, Emily M., Timnit Gebru, Angelina McMillan-Major, and Shmargaret Shmitchell, 2021, “On the Dangers of Stochastic Parrots: Can Language Models Be Too Big?”, FAcct ’21: Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency, 610–623. doi:10.1145/3442188.3445922
Bengio, Yoshua, Geoffrey Hinton, Andrew Yao, Dawn Song, Pieter Abbeel, Trevor Darrell, Yuval Noah Harari, Ya-Qin Zhang, Lan Xue, Shai Shalev-Shwartz, et al., 2024, “Managing Extreme AI Risks amid Rapid Progress”, Science, 384(6698): 842–845. doi:10.1126/science.adn0117
Bennett, Colin J. and Charles D. Raab, 2018, “Revisiting the Governance of Privacy: Contemporary Policy Instruments in Global Perspective”, Regulation & Governance, no. 27, September 2018. doi:10.1111/rego.12222
Benthall, Sebastian and Bruce D Haynes, 2019, “Racial Categories in Machine Learning”, Proceedings of ACM FAT* ’19), January 29–31, 2019, Atlanta, GA, 1–10.
Bentley, Peter J, Miles Brundage, Olle Häggström, and Thomas Metzinger, 2018, “Should We Fear Artificial Intelligence? In-Depth Analysis”, European Parliamentary Research Service, Scientific Foresight Unit (STOA), March 2018(PE 614.547), Brussels: 1–40.
Bertolini, Andrea and Giuseppe Aiello, 2018, “Robot Companions: A Legal and Ethical Analysis”, The Information Society, 34(3): 130–140. doi:10.1080/01972243.2018.1444249
Binns, Reuben, 2018, “Fairness in Machine Learning: Lessons from Political Philosophy”, Proceedings of Machine Learning Research, 81(1): 1–11.
Bleher, Hannah and Matthias Braun, 2023, “Reflections on Putting AI Ethics into Practice: How Three AI Ethics Approaches Conceptualize Theory and Practice”, Science and Engineering Ethics, 29(3): 21. doi:10.1007/s11948-023-00443-3
Boddington, Paula, 2023, AI Ethics: A Textbook, Singapore: Springer International Publishing.
Bolte, Larissa and Aimee van Wynsberghe, 2024, “Sustainable AI and the Third Wave of AI Ethics: A Structural Turn”, AI and Ethics, 16: 1–10. doi:10.1007/s43681-024-00522-6
Bostrom, Nick, 2003a, “Are You Living in a Computer Simulation?”, Philosophical Quarterly, 53(211): 243–255.
–––, 2003b, “Ethical Issues in Advanced Artificial Intelligence”, in Cognitive, Emotive and Ethical Aspects of Decision Making in Humans and in Artificial Intelligence, I. Smit et al. (ed.), Int. Institute of Advanced Studies in Systems Research and Cybernetics, 12–17. [Bostrom 2003b available online]
–––, 2012, “The Superintelligent Will: Motivation and Instrumental Rationality in Advanced Artificial Agents”, Minds and Machines, 22(2-special issue ‘Philosophy of AI’ ed. Vincent C. Müller): 71–85.
–––, 2014, Superintelligence: Paths, Dangers, Strategies, Oxford: Oxford University Press.
Bostrom, Nick and Eliezer Yudkovski, 2014, “The Ethics of Artificial Intelligence”, in The Cambridge Handbook of Artificial Intelligence, Keith Frankish (ed.), Cambridge: Cambridge University Press, 316–334. [Bostrom & Yudkovski 2014 available online]
Bottomley, Christopher and Timothy Luke Williamson, 2025, “On the Offense against Fanaticism”, Ethics, 135(2): 320–332. doi:10.1086/732617
Bradshaw, Samantha, Lisa-Maria Neudert, and Phil Howard, 2019, “Government Responses to Malicious Use of Social Media”, Oxford Project on Computational Propaganda, Working Paper 2019.2, Oxford. [Bradshaw, Neudertavailable, & Howard 2019 available online]
Brey, Philip and Brandt Dainow, 2024, “Ethics by Design for Artificial Intelligence”, AI and Ethics, 4(4): 1265–1277.
Browne, Jude, Stephen Cave, Eleanor Drage, and Kerry McInerney, 2023, Feminist AI: Critical Perspectives on Algorithms, Data, and Intelligent Machines, Oxford: Oxford University Press.
Brundage, Miles, Shahar Avin, Jack Clark, Helen Toner, Peter Eckersley, Ben Garfinkel, Allan Dafoe, Paul Scharre, Thomas Zeitzoff, and Bobby Filar, 2018, “The Malicious Use of Artificial Intelligence: Forecasting, Prevention, and Mitigation”, FHI/CSER/CNAS/EFF/OpenAI Report, Cambridge, 1–101.
Brynjolfsson, Erik and Andrew McAfee, 2016, The Second Machine Age: Work, Progress, and Prosperity in a Time of Brilliant Technologies, New York: W. W. Norton.
Bryson, Joanna J, 2010, “Robots Should Be Slaves”, in Close Engagements with Artificial Companions: Key Social, Psychological, Ethical and Design Issues, Yorick Wilks (ed.), Amsterdam: John Benjamins Publishing, 63–74.
–––, 2019, “The Past Decade and Future of AI’s Impact on Society”, in Towards a New Enlightenment: A Transcendent Decade, Anonymous (ed.), Madrid: Turner – BVVA. [Bryson 2019 available online]
Bryson, Joanna J, Mihailis E Diamantis, and Thomas D Grant, 2017, “Of, for, and by the People: The Legal Lacuna of Synthetic Persons”, Artificial Intelligence and Law, 25(3): 273–291. doi:10.1007/s10506-017-9214-9
Budding, Céline, and Carlos Zednik, 2024, “Does Explainable AI Need Cognitive Models?”, Proceedings of the Annual Meeting of the Cognitive Science Society, 46: 5244–5250.
Bullock, Justin B, Yu-Che Chen, Johannes Himmelreich, Valerie M Hudson, Anton Korinek, Matthew M Young, and Baobao Zhang, 2024, The Oxford Handbook of AI Governance, Oxford: Oxford University Press.
Burget, Mirjam, Emanuele Bardone, and Margus Pedaste, 2017, “Definitions and Conceptual Dimensions of Responsible Research and Innovation: A Literature Review”, Science and Engineering Ethics, 23(1): 1–19. doi:10.1007/s11948-016-9782-1
Burr, Christopher and Nello Christianini, 2019, “Can Machines Read Our Minds?”, Minds and Machines, 29(3): 461–494.
Butler, Samuel, 1863, “Darwin among the Machines: Letter to the Editor”, The Press (Christchurch), 13.06.1863. [Butler 1863 available online]
Callaghan, Victor, James Miller, Roman V Yampolskiy, and Stuart Armstrong, 2017, The Technological Singularity: Managing the Journey, Berlin: Springer. [Callaghan et al. 2017 available online]
Calo, Ryan, 2018, “Artificial Intelligence Policy: A Primer and Roadmap”, University of Bologna Law Review, 3(2): 180–218. doi:10.2139/ssrn.3015350
Calo, Ryan, Michael A Froomkin, and Ian Kerr, 2016, Robot Law, Cheltenham: Edward Elgar.
Čapek, Karel, 1920, R.U.R., Peter Majer (trans.), London: Methuen 1999.
Cappelen, Herman, Simon Goldstein, and John Hawthorne, forthcoming, “AI Survival Stories: A Taxonomic Analysis of AI Existential Risk”, Philosophy of AI.
Capurro, Raphael, 1993, “Ein Grinsen ohne Katze: Von der Vergleichbarkeit zwischen ‘künstlicher Intelligenz’ und ‘getrennten Intelligenzen’”, Zeitschrift für Philosophische Forschung, 47: 93–102.
Casiraghi, Simone, 2023, “Anything New under the Sun? Insights from a History of Institutionalized AI Ethics”, Ethics and Information Technology, 25(2): 28. doi:10.1007/s10676-023-09702-0
Cave, Stephen, Rune Nyrup, Katarina Vold, and Adrian Weller, 2019, “Motivations and Risks of Machine Ethics”, Proceedings of the IEEE, 107(3): 562–574.
Chalmers, David J, 2010, “The Singularity: A Philosophical Analysis”, Journal of Consciousness Studies, 17(9–10): 7–65.
Chang, Ruth, 2002, “The Possibility of Parity”, Ethics, 112(4): 659–688.
–––, 2020, “Do We Have Normative Powers?”, Proceedings of the Aristotelian Society (Supplementary Volume), 94(1): 275–300.
Christman, John, 2018, “Autonomy in Moral and Political Philosophy”, in Stanford Encyclopedia of Philosophy (Spring 2018 Edition), Edward N. Zalta (ed.), URL = <https://plato.stanford.edu/archives/spr2018/entries/autonomy-moral/>
Clarke, Steve and Julian Savulescu, 2021, “Rethinking Our Assumptions about Moral Status”, in Rethinking Moral Status, Steve Clarke, Hazem Zohny, and Julian Savulescu (eds.), Oxford University Press, 1–20. doi:10.1093/oso/9780192894076.003.0001
Clarke, Steve, Hazem Zohny, and Julian Savulescu, 2021, Rethinking Moral Status, Oxford University Press. doi:10.1093/oso/9780192894076.001.0001
Coeckelbergh, Mark, 2010, “Robot Rights? Towards a Social-Relational Justification of Moral Consideration”, Ethics and Information Technology, 12(3): 209–221. doi:10.1007/s10676-010-9235-5
–––, 2012, Growing Moral Relations: Critique of Moral Status Ascription, London: Palgrave. [Coeckelbergh 2010 available online]
–––, 2016, “Care Robots and the Future of ICT-Mediated Elderly Care: A Response to Doom Scenarios”, AI & Society, 31(4): 455–462.
–––, 2018, “What Do We Mean by a Relational Ethics? Growing a Relational Approach to the Moral Standing of Plants, Robots and Other Non-Humans”, in Plant Ethics, Angela Kallhoff, Marcello Di Paola, and Maria Schörgenhumer (eds.), London: Routledge, 110–121.
–––, 2020, AI Ethics, Cambridge, MA: MIT Press. [Coeckelbergh 2020 available online]
–––, 2022a, Robot Ethics, Cambridge, MA: MIT Press.
–––, 2022b, The Political Philosophy of AI, Cambridge: Polity.
–––, 2024, Why AI Undermines Democracy and What to Do about It, Cambridge: Polity.
Corrêa, Nicholas Kluge, Camila Galvão, James William Santos, Carolina Del Pino, Edson Pontes Pinto, Camila Barbosa, Diogo Massmann, Rodrigo Mambrini, Luiza Galvão, Edmund Terem, and Nythamar de Oliveira, 2023, “Worldwide AI Ethics: A Review of 200 Guidelines and Recommendations for AI Governance”, Patterns, 4(10). doi:10.1016/j.patter.2023.100857
Costa, Elisabeth and David Halpern, 2019, “The Behavioural Science of Online Harm and Manipulation, and What to Do about It: An Exploratory Paper to Spark Ideas and Debate”, The Behavioural Insights Team Report, London, 1–82.
Crawford, Kate, 2021, The Atlas of AI: Power, Politics, and the Planetary Costs of Artificial Intelligence, New Haven: Yale University Press.
Cristianini, Nello, 2023, The Shortcut: Why Intelligent Machines Do Not Think Like Us, Boca Raton: CRC Press. doi:10.1201/9781003335818
D’Alessandro, William, 2025, “Deontology and Safe Artificial Intelligence”, Philosophical Studies, 182(7): 1681–1704.
Da Silva, Michael, 2024, “Responsibility Gaps”, Philosophy Compass, 19(9–10).
Danaher, John, 2016a, “Robots, Law and the Retribution Gap”, Ethics and Information Technology, 18(4): 299–309. doi:10.1007/s10676-016-9403-3
–––, 2016b, “The Threat of Algocracy: Reality, Resistance and Accommodation”, Philosophy & Technology, 29(3): 245–268. doi:10.1007/s13347-015-0211-1
–––, 2019a, Automation and Utopia: Human Flourishing in a World without Work, Cambridge, MA: Harvard University Press.
–––, 2019b, “Welcoming Robots into the Moral Circle: A Defence of Ethical Behaviourism”, Science and Engineering Ethics, 26(June): 2023–2049. doi:10.1007/s11948-019-00119-x
–––, 2022, “Tragic Choices and the Virtue of Techno-Responsibility Gaps”, Philosophy & Technology, 35(2): 26. doi:10.1007/s13347-022-00519-1
Danaher, John and Neil McArthur, 2017, Robot Sex: Social and Ethical Implications, Boston, MA: MIT Press.
Danaher, John and Sven Nyholm, 2021, “Automation, Work and the Achievement Gap”, AI and Ethics, 1(3): 227–237. doi:10.1007/s43681-020-00028-x
DARPA, 1983, “Strategic Computing – New-Generation Computing Technology: A Strategic Plan for Its Development an Application to Critical Problems in Defense (28.10.1983)”. [DARPA 1983 available online]
de Vries, Alex, 2023, “The Growing Energy Footprint of Artificial Intelligence”, Joule, 7(10): 2191–2194.
de Vries-Gao, Alex, 2025, “Artificial Intelligence: Supply Chain Constraints and Energy Implications”, Joule, May (101961).
Dennett, Daniel C, 2017, From Bacteria to Bach and Back: The Evolution of Minds, New York: W.W. Norton.
Deutsch, David, 1985, “Quantum Theory, the Church-Turing Principle and the Universal Quantum Computer”, Proceedings of the Royal Society of London, A(400): 97–117.
Devlin, Kate, 2018, Turned on: Science, Sex and Robots, London: Bloomsbury. [Devlin 2018 available online]
Dignum, Virginia, 2019, Responsible Artificial Intelligence, Berlin: Springer. [Dignum 2019 available online]
Draper, Heather, Tom Sorell, Sandra Bedaf, Dag Sverre Syrdal, Carolina Gutierrez-Ruiz, Alexandre Duclos, and Farshid Amirabdollahian, 2014, “Ethical Dimensions of Human-Robot Interactions in the Care of Older People: Insights from 21 Focus Groups Convened in the UK, France and the Netherlands”, in International Conference on Social Robotics, M Beetz, B Johnston, and MA Williams (eds.), Vol. LNCS 8755, Cham: Springer.
Dreyfus, Hubert L., 1972, What Computers Still Can’t Do: A Critique of Artificial Reason, second edition, Cambridge, MA: MIT Press 1992.
Dreyfus, Hubert L., Stuart E. Dreyfus, and Tom Athanasiou, 1986, Mind over Machine: The Power of Human Intuition and Expertise in the Era of the Computer, New York: Free Press.
Du Sautoy, Marcus, 2019, “The Creativity Code: Art and Innovation in the Age of AI”, in The Creativity Code, Cambridge, MA: Harvard University Press.
Dubber, Markus D, Frank Pasquale, and Sunnit Das, 2020, Oxford Handbook of Ethics of Artificial Intelligence, New York: Oxford University Press. [Dubber, Pasquale, & Das 2020 available online]
Dung, Leonard, 2023, “How to Deal with Risks of AI Suffering”, Inquiry, 22. doi:10.1080/0020174x.2023.2238287
–––, 2024, “Is Superintelligence Necessarily Moral?”, Analysis, 84(4): 730–738.
–––, 2025, “Understanding Artificial Agency”, The Philosophical Quarterly, 75(2): 450–472. doi:10.1093/pq/pqae010
Durán, Juan Manuel, and Karin Rolanda Jongsma, 2021, “Who Is Afraid of Black Box Algorithms? On the Epistemological and Ethical Basis of Trust in Medical AI”, Journal of Medical Ethics, 47(5), 329–335.
Dwork, Cynthia, Frank McSherry, Kobbi Nissim, and Adam Smith, 2006, “Calibrating Noise to Sensitivity in Private Data Analysis”, in Tal Rabin (ed.), Theory of Cryptography, Berlin, Heidelberg: Springer Berlin Heidelberg, 265–284.
Eden, Amnon, James H. Moor, Johnny Hartz Søraker, and Eric Steinhart, 2012, Singularity Hypotheses: A Scientific and Philosophical Assessment (The Frontiers Collection), Berlin: Springer.
Erler, Alexandre and Vincent C. Müller, 2024, “AI as IA: The Use and Abuse of Artificial Intelligence (AI) for Human Enhancement through Intellectual Augmentation (IA)”, in The Routledge Handbook of the Ethics of Human Enhancement, Marcello Ienca and Fabrice Jotterand (eds.), London: Routledge, 187–199. doi:10.4324/9781003105596-19
EU Parliament, 2022a, “Digital Markets Act”, Regulation (EU) 2022/1925 of the European Parliament and of the Council of 14 September 2022 on Contestable and Fair Markets in the Digital Sector, 2022/1925, Brussels. [EU Parliament 2022a available online]
–––, 2022b, “Digital Services Act”, Regulation (EU) 2022/2065 of the European Parliament and of the Council of 19 October 2022 on a Single Market For Digital Services, 2022/2065, Brussels. [EU Parliament 2022b available online]
–––, 2024, “Artificial Intelligence Act”, Regulation (EU) 2024/1689 of the European Parliament and of the Council of 13 June 2024 Laying down Harmonised Rules on Artificial Intelligence and Amending Regulations, 2024/1689, Brussels. [EU Parliament 2024 available online]
Eubanks, Virginia, 2018, Automating Inequality: How High-Tech Tools Profile, Police, and Punish the Poor, London: St. Martin’s Press.
Faden, Ruth R and Tom L Beauchamp, 1986, A History and Theory of Informed Consent, Oxford: Oxford University Press.
Floridi, Luciano, 2016, “Should We Be Afraid of AI? Machines Seem to Be Getting Smarter and Smarter and Much Better at Human Jobs, yet True AI Is Utterly Implausible. Why?”, Aeon, September 5, 2016. [Floridi 2016 available online]
–––, 2023a, “AI as Agency without Intelligence: On ChatGPT, Large Language Models, and Other Generative Models”, Philosophy and Technology, 36(15). doi:10.2139/ssrn.4358789
–––, 2023b, The Ethics of Artificial Intelligence: Principles, Challenges, and Opportunities, Oxford: Oxford University Press. [Floridi 2023b available online]
Floridi, Luciano and Josh Cowls, 2019, “A Unified Framework of Five Principles for AI in Society”, Harvard Data Science Review, 1(1). doi:10.1162/99608f92.8cd550d1
Floridi, Luciano, Josh Cowls, Monica Beltrametti, Raja Chatila, Patrice Chazerand, Virginia Dignum, Christoph Luetge, Robert Madelin, Ugo Pagallo, Francesca Rossi, et al., 2018, “AI4People—An Ethical Framework for a Good AI Society: Opportunities, Risks, Principles, and Recommendations”, Minds and Machines, 28(4): 689–707.
Floridi, Luciano, Josh Cowls, Thomas C. King, and Mariarosaria Taddeo, 2020, “How to Design AI for Social Good: Seven Essential Factors”, Science and Engineering Ethics, 26(3): 1771–1796. doi:10.1007/s11948-020-00213-5
Floridi, Luciano and Jeff W. Saunders, 2004, “On the Morality of Artificial Agents”, Minds and Machines, 14: 349–379.
Floridi, Luciano and Mariarosaria Taddeo, 2016, “What Is Data Ethics?”, Phil. Trans. R. Soc. A, 374(2083): 20160360.
Floridi, Luciano, and Mariarosaria Taddeo (eds.), 2025, A Companion to Digital Ethics. Hoboken, NJ: John Wiley & Sons.
Foot, Philippa, 1967, “The Problem of Abortion and the Doctrine of the Double Effect”, Oxford Review, 5: 5–15.
Fosch-Villaronga, Eduard and Jordi Albo-Canals, 2019, “‘I’ll Take Care of You,’ Said the Robot”, Paladyn, Journal of Behavioral Robotics, 10(1): 77. doi:10.1515/pjbr-2019-0006
Frank, Lily E. and Michal Klincewicz, 2024, “Uses and Abuses of AI Ethics”, in Handbook on the Ethics of Artificial Intelligence, David J. Gunkel (ed.), London: Edward Elgar Publishing, 205–217. [Frank & Klincewicz 2024 available online]
Frank, Lily and Sven Nyholm, 2017, “Robot Sex and Consent: Is Consent to Sex between a Robot and a Human Conceivable, Possible, and Desirable?”, Artificial Intelligence and Law, 25(3): 305–323.
Frankfurt, Harry G, 1971, “Freedom of the Will and the Concept of a Person”, The Journal of Philosophy, LXVIII(1): 5–20.
–––, 2005, On Bullshit, Princeton, NJ: Princeton University Press.
Franssen, Maarten, Gert-Jan Lokhorst, and Ibo van de Poel, 2024, “Philosophy of Technology”, in The Stanford Encyclopedia of Philosophy (Fall 2024 Edition), Edward N Zalta and Uri Nodelman (eds.) Edition, URL = <https://plato.stanford.edu/archives/fall2024/entries/technology/>.
Frey, Carl Benedict, 2019, The Technology Trap: Capital, Labour, and Power in the Age of Automation, Princeton: Princeton University Press.
Frey, Carl Benedikt and Michael A. Osborne, 2013, “The Future of Employment: How Susceptible Are Jobs to Computerisation?”, Oxford Martin School Working Papers, Oxford. [Frey & Osborne 2013 available online]
Fricker, Miranda, 2007, Epistemic Injustice: Power and the Ethics of Knowing, Oxford: Oxford University Press.
Friedman, Batya, 1996, “Value-Sensitive Design”, ACM Interactions, 3(6): 16–23.
Gabriel, Iason, 2020, “Artificial Intelligence, Values, and Alignment”, Minds and Machines, 30(3): 411–437. doi:10.1007/s11023-020-09539-2
Gallow, J. Dmitri, 2024, “Instrumental Divergence”, Philosophical Studies, 182(7): 1581–1607.. doi:10.1007/s11098-024-02129-3
Ganascia, Jean-Gabriel, 2017, Le Mythe de La Singularité, Paris: Éditions du Seuil.
Garfinkel, Simson L., 2025, Differential Privacy, Cambridge, MA: MIT Press.
GDPR, 2016, “General Data Protection Regulation: Regulation (EU) 2016/679 of the European Parliament and of the Council of 27 April 2016 on the Protection of Natural Persons with Regard to the Processing of Personal Data and on the Free Movement of Such Data, and Repealing Directive 95/46/EC”, Official Journal of the European Union, 119(04.05.2016): 1–88.
Gebru, Timnit, 2020, “Race and Gender”, in The Oxford Handbook of Ethics of AI, Markus D Dubber, Frank Pasquale, and Sunnit Das (eds.), Oxford: Oxford University Press, 253–270.
Geraci, Robert M, 2008, “Apocalyptic AI: Religion and the Promise of Artificial Intelligence”, Journal of the American Academy of Religion, 76(1): 138–166.
–––, 2010, Apocalyptic AI: Vision of Heaven in Robotics, Artificial Intelligence and Virtual Reality, Oxford: Oxford University Press.
Gerdes, Anne, 2016, “The Issue of Moral Consideration in Robot Ethics”, SIGCAS Comput. Soc., 45(3): 274–279. doi:10.1145/2874239.2874278
Gertz, Nolen, 2018, Nihilism and Technology, London: Rowman & Littlefield.
Gewirth, Alan, 1978, “The Golden Rule Rationalized”, Midwest Studies in Philosophy, III(1): 133–147.
Gibert, Martin, 2018, “Éthique Artificielle (Version Grand Public)”, Encyclopédie Philosophique, December. [Gibert 2018 available online]
Good, Irvin J, 1965, “Speculations Concerning the First Ultraintelligent Machine”, in Advances in Computers, Franz L Alt and Morris Ruminoff (eds.), New York & London: Academic Press, 6: 31–88.
Goos, Maarten, 2018, “The Impact of Technological Progress on Labour Markets: Policy Challenges”, Oxford Review of Economic Policy, 34(3): 362–375. doi:10.1093/oxrep/gry002
Goos, Maarten, Alan Manning, and Anna Salomons, 2009, “Job Polarization in Europe”, American Economic Review, 99(2): 58–63. doi:10.1257/aer.99.2.58
Gordon, John-Stewart, 2022, “Are Superintelligent Robots Entitled to Human Rights?”, Ratio, 35(3): 181–193. doi:10.1111/rati.12346
Gordon, John-Stewart and Sven Nyholm, 2021, “Ethics of Artificial Intelligence”, Internet Encyclopedia of Philosophy. [Gordon & Nyholm 2021 available online]
Graham, Sandra and Brian S. Lowery, 2004, “Priming Unconscious Racial Stereotypes About Adolescent Offenders”, Law and Human Behavior, 28(5): 483–504. doi:10.1023/B:LAHU.0000046430.65485.1f
Greene, Joshua D, 2015, “The Rise of Moral Cognition”, Cognition, 135: 39–42.
Grunwald, Armin, 2018, Technology Assessment in Practice and Theory, London: Routledge.
Grunwald, Armin and Rafaela Hillerbrand, 2021, Handbuch Technikethik, Stuttgart: J.B. Metzler. doi:10.1007/978-3-476-04901-8_19
Gunkel, David J, 2018, Robot Rights, Boston, MA: MIT Press.
–––, 2024, Handbook on the Ethics of Artificial Intelligence, London: Elgar Publishing.
Gunkel, David J and Joanna J Bryson, 2014, Special Issue on Machine Morality, Vol. 27, Philosophy & Technology.
Gutmann, Mathias, Klaus Wiegerling, and Benjamin Rathgeber, 2024, Handbuch Technikphilosophie, Stuttgart: J.B. Metzler. doi:10.1007/978-3-476-05991-8_6
Gyevnár, Bálint, and Atoosa Kasirzadeh, 2025, “AI Safety for Everyone”, Nature Machine Intelligence, 7(4): 531–542.
Hagendorff, Thilo, 2024, “Mapping the Ethics of Generative AI: A Comprehensive Scoping Review”, Minds and Machines, 34(4): 1–27. doi:10.1007/s11023-024-09694-w
Häggström, Olle, 2016, Here Be Dragons: Science, Technology and the Future of Humanity, Oxford: Oxford University Press. [Häggström 2016 available online]
Hähnel, Martin and Regina Müller, 2025, A Companion to Applied Philosophy of AI (Blackwell Companions to Philosophy), London: Wiley-Blackwell. [Hähnel & Müller 2025 available online]
Hakli, Raul and Pekka Mäkelä, 2019, “Moral Responsibility of Robots and Hybrid Agents”, The Monist, 102(2): 259–275.
Halpern, Orit and Robert Mitchell, 2023, The Smartness Mandate, Cambridge, MA: MIT Press.
Han, Byung-Chul, 2022, Infocracy: Digitization and the Crisis of Democracy, London: Polity Press.
Hansson, Sven Ove, 2013, The Ethics of Risk: Ethical Analysis in an Uncertain World, New York: Palgrave Macmillan.
–––, 2018, “How to Perform an Ethical Risk Analysis (eRA)”, Risk Analysis, 38(9): 1820–1829. doi:10.1111/risa.12978
Harari, Yuval Noah, 2016, Homo Deus: A Brief History of Tomorrow, New York: Harper.
Harris, Tristan, 2016, “How Technology Is Hijacking Your Mind — from a Magician and Google Design Ethicist”, Medium.Com, Thrive Global (May 18, 2016). [Harris 2016 available online]
Haskel, Jonathan and Stian Westlake, 2017, Capitalism without Capital: The Rise of the Intangible Economy, Princeton, NJ: Princeton University Press. [Haskel & Westlake 2017 available online]
Helbing, Dirk, Bruno S. Frey, Gerd Gigerenzer, Ernst Hafen, Michael Hagner, Yvonne Hofstetter, Jeroen van den Hoven, Roberto V. Zicari, and Andrej Zwitter, 2019, “Will Democracy Survive Big Data and Artificial Intelligence?”, in Towards Digital Enlightenment: Essays on the Dark and Light Sides of the Digital Revolution, Dirk Helbing (ed.), Cham: Springer International Publishing, 73–98. doi:10.1007/978-3-319-90869-4_7
Hellrigel-Holderbaum, Max, and Leonard Dung, forthcoming, “Misalignment or Misuse? The AGI Alignment Tradeoff”, Philosophical Studies.
Hendrycks, Dan, 2025, Introduction to AI Safety, Ethics, and Society, London: Taylor & Francis.
Hicks, Michael Townsen, James Humphries, and Joe Slater, 2024, “ChatGPT Is Bullshit”, Ethics and Information Technology, 26(2): 38.
Houkes, Wybo and Pieter E Vermaas, 2010, Technical Functions: On the Use and Design of Artefacts, Berlin: Springer.
Humphreys, Paul W, 2009, “The Philosophical Novelty of Computer Simulation Methods”, Synthese, 196 (3): 615–626.
Huq, Aziz Z., 2020, “A Right to a Human Decision”, Virginia Law Review, 106(3): 611–688.
Jacobs, An, Lynn Tytgat, Michel Maus, Romain Meeusen, and Bram Vanderborght, 2019, Homo Roboticus: 30 Questions and Answers on Man, Technology, Science & Art, Brussels: ASP. [Jacobs et al. 2019 available online]
Jasanoff, Sheila, 2016, The Ethics of Invention: Technology and the Human Future, New York: Norton. [Jasanoff 2016 available online]
Jobin, Anna, Marcello Ienca, and Effy Vayena, 2019, “The Global Landscape of AI Ethics Guidelines”, Nature Machine Intelligence, 1(9): 389–399. doi:10.1038/s42256-019-0088-2
Johnson, Gabbrielle M., 2024, “Varieties of Bias”, Philosophy Compass, 19(7).
Kahnemann, Daniel, 2011, Thinking Fast and Slow, London: Macmillan.
Kamiya, G. and P. Bertoldi, 2024, “Energy Consumption in Data Centres and Broadband Communication Networks in the EU”, European Commission, Joint Research Centre, JRC135926, Luxembourg: Publications Office of the European Union. [Kamiya & Bertoldi 2024 available online]
Kamm, Frances Myrna and Eric Rakowski, 2016, The Trolley Problem Mysteries, New York: Oxford University Press.
Kasirzadeh, Atoosa, 2025, “Two Types of AI Existential Risk: Decisive and Accumulative”, Philosophical Studies, 182: 1975–2003. doi:10.1007/s11098-025-02301-3
Kay, Jackie, Atoosa Kasirzadeh, and Shakir Mohamed, 10AD, “Epistemic Injustice in Generative AI”, Proceedings of the AAAI/ACM Conference on AI, Ethics, and Society, 7(1): 684–697. doi:10.1609/aies.v7i1.31671
Keynes, John Maynard, 1930, “Economic Possibilities for Our Grandchildren”, in Essays in Persuasion, New York: Harcourt Brace 1932, 358–373.
Kissinger, Henry A., 2018, “How the Enlightenment Ends: Philosophically, Intellectually—in Every Way—Human Society Is Unprepared for the Rise of Artificial Intelligence”, The Atlantic, June. [Kissinger 2018 available online]
Klenk, Michael, and Fleur Jongepier (eds.), 2022, The Philosophy of Online Manipulation, London: Routledge.
Königs, Peter, 2022, “Artificial Intelligence and Responsibility Gaps: What Is the Problem?”, Ethics and Information Technology, 24(3): 36. doi:10.1007/s10676-022-09643-0
–––, 2024, “In Defense of ‘Surveillance Capitalism’”, Philosophy & Technology, 37 (122), 1–33.
–––, 2025, “No Wellbeing for Robots (and Hence No Rights)”, American Philosophical Quarterly, 62(2): 191–208.
Koudina, Olya and Ibo Van de Poel, 2024, “A Sociotechnical System Perspective on AI”, Minds and Machines, 34(21) doi:10.1007/s11023-024-09680-2
Kurzweil, Ray, 1999, The Age of Spiritual Machines: When Computers Exceed Human Intelligence, London: Penguin.
–––, 2005, The Singularity Is near: When Humans Transcend Biology, London: Viking. [Kurzweil 2005 available online]
–––, 2012, How to Create a Mind: The Secret of Human Thought Revealed, New York: Viking.
Lanier, Jaron, 2014, Who Owns the Future?, New York: Simon & Schuster.
LeCun, Yan, Yoshua Bengio, and Geoffrey Hinton, 2015, “Deep Learning”, Nature, 521(7553): 436–444.
Lee, Minha, Sander Ackermans, Nena van As, Hanwen Chang, Enzo Lucas, and Wijnand IJsselsteijn, 2019, “Caring for Vincent: A Chatbot for Self-Compassion”, (CHI ’19) Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems, 702: 1–13.
Levy, David, 2007, Love and Sex with Robots: The Evolution of Human-Robot Relationships, New York: Harper & Co.
Lieder, Falk and Thomas L. Griffiths, 2020, “Resource-Rational Analysis: Understanding Human Cognition as the Optimal Use of Limited Computational Resources”, Behavioral and Brain Sciences, 43: e1. doi:10.1017/S0140525X1900061X
Lighthill, James, 1973, “Artificial Intelligence: A General Survey”, in Artificial Intelligence: A Paper Symposium (London), London: Science Research Council. [Lighthill 1973 available online]
Lin, Patrick, 2015, “Why Ethics Matters for Autonomous Cars”, in Autonomous Driving, M. Maurer et al. (ed.), Berlin: Springer, 69–85. doi:10.1007/978-3-662-48847-8_4
Lin, Patrick, Keith Abney, and Ryan Jenkins, 2017, Robot Ethics 2.0: From Autonomous Cars to Artificial Intelligence, New York: Oxford University Press.
Lin, Patrick, George Bekey, and Keith Abney, 2008, “Autonomous Military Robotics: Risk, Ethics, and Design”, US Department of Navy, Office of Naval Research, no. December 20, 2008: 1–112.
Luers, Amy and Eric Masanet Jonathan Koomey Owen Gaffney, Felix Creutzig, Juan Lavista Ferres, Eric Horvitz, 2024, “Will AI Accelerate or Delay the Race to Net-Zero Emissions?”, Nature, 628(Comment): 718–720.
Luguri, Jamie and Lior Jacob Strahilevitz, 2021, “Shining a Light on Dark Patterns”, Journal of Legal Analysis, 13(1): 43–109. doi:10.1093/jla/laaa006
Lundgren, Björn, 2023, “In Defense of Ethical Guidelines”, AI and Ethics, 3(3): 1013–1020. doi:10.1007/s43681-022-00244-7
Lundgren, Björn, Eleonora Catena, Ian Robertson, Max Hellrigel-Holderbaum, Ibifuro Robert Jaja, and Leonard Dung, 2024, “On the Need for a Global AI Ethics”, Journal of Global Ethics, 20(3): 330–342.
Macnish, Kevin, 2017, The Ethics of Surveillance: An Introduction, London: Routledge.
Marx, Karl, 1867, Capital: Critique of Political Economy (Volume 1), Paul Reitter (ed. & tran.), Princeton, NJ: Princeton University Press.
Mason, Rebecca, 2022, “Women Are Not Adult Human Females”, Australasian Journal of Philosophy, 102(1): 180–191. doi:10.1080/00048402.2022.2149824
Mathur, Arunesh, Gunes Acar, Michael Friedman, Elena Lucherini, Jonathan Mayer, Marshini Chetty, and Arvind Narayanan, 2019, “Dark Patterns at Scale: Findings from a Crawl of 11K Shopping Websites”, Proceedings of the ACM Human-Computer Interaction, 3(81): 1–32.
McCarthy, John, Marvin Minsky, Nathaniel Rochester, and Claude E. Shannon, 1955, “A Proposal for the Dartmouth Summer Research Project on Artificial Intelligence”. [McCarthy et al. 1955 available online]
Meijer, Albert and Martijn Wessels, 2019, “Predictive Policing: Review of Benefits and Drawbacks”, International Journal of Public Administration, 42(12): 1031–1039.
Merrin, William and Andrew Hoskins, 2025, Sharded Media: Trump’s Rage against the Mainstream, London: Palgrave Macmillan. doi:10.1007/978-3-031-84786-8_1
Metcalf, Jacob, Emily F. Keller, and Danah Boyd, 2016, “Perspectives on Big Data, Ethics, and Society”, Council for Big Data, Ethics, and Society, May 23, 2016: 23pp.
Metzinger, Thomas, 2021, “Artificial Suffering: An Argument for a Global Moratorium on Synthetic Phenomenology”, Journal of Artificial Intelligence and Consciousness, 8(01): 43–66.
Mhlambi, Sábëlo, and Simona Tiribelli, 2023, “Decolonizing AI Ethics: Relational Autonomy as a Means to Counter AI Harms”, Topoi, 42(3): 867–880.
Milano, Silvia and Sven Nyholm, 2024, “Advanced AI Assistants That Act on Our Behalf May Not Be Ethically or Legally Feasible”, Nature Machine Intelligence, 6(8): 846–847. doi:10.1038/s42256-024-00877-9
Misselhorn, Catrin, 2018, Grundfragen Der Maschinenethik, fifth edition, Ditzingen: Reclam.
–––, 2020, “Artificial Systems with Moral Capacities? A Research Design and Its Implementation in a Geriatric Care System”, Artificial Intelligence, 278(January): 103179. doi:10.1016/j.artint.2019.103179
–––, 2023, Künstliche Intelligenz – Das Ende Der Kunst?, Stuttgart: Reclam.
Mittelstadt, Brent Daniel, Patrick Allo, Mariarosaria Taddeo, Sandra Wachter, and Luciano Floridi, 2016, “The Ethics of Algorithms: Mapping the Debate”, Big Data & Society, 3(2). [Mittelstadt et al. 2016 available online]
Moor, James H., 2006, “The Nature, Importance, and Difficulty of Machine Ethics”, IEEE Intelligent Systems, 21(4): 18–21.
Moravec, Hans, 1998, Robot: Mere Machine to Transcendent Mind, New York: Oxford University Press.
More, Max and Natasha Vita-More, 2013, The Transhumanist Reader, London: John Wiley. doi:10.1002/9781118555927.fmatter
Mozorov, Eygeny, 2013, To Save Everything, Click Here: The Folly of Technological Solutionism, New York: Public Affairs.
Mühlhoff, Rainer, 2025, Künstliche Intelligenz Und Der Neue Faschismus, Stuttgart: Reclam.
Müller, Vincent C., 2012, “Autonomous Cognitive Systems in Real-World Environments: Less Control, More Flexibility and Better Interaction”, Cognitive Computation, 4(3): 212–215. doi:10.1007/s12559-012-9129-4
–––, 2014, “Editorial: Risks of General Artificial Intelligence”, Journal of Experimental and Theoretical Artificial Intelligence, 26(3): 297–301. [Müller 2014 available online]
–––, 2016a, “Autonomous Killer Robots Are Probably Good News”, in Drones and Responsibility: Legal, Philosophical and Socio-Technical Perspectives on the Use of Remotely Controlled Weapons, Ezio Di Nucci and Filippo Santoni de Sio (eds.), London: Routledge, 67–81. doi:10.4324/9781315578187-4
–––, 2016b, Risks of Artificial Intelligence, London: Chapman & Hall – CRC Press. doi:10.1201/b19187
–––, 2018, “In 30 Schritten Zum Mond? Zukünftiger Fortschritt in Der KI”, Medienkorrespondenz, 20(May 10, 2018): 5–15.
–––, 2020, “Ethics of Artificial Intelligence and Robotics”, in Stanford Encyclopedia of Philosophy (Summer 2020 Edition), Edward N. Zalta (ed.), URL = <https://plato.stanford.edu/archives/sum2020/entries/ethics-ai/>
–––, 2021, “Is It Time for Robot Rights? Moral Status in Artificial Entities”, Ethics & Information Technology, 23(3): 579–587.
–––, 2022, “The History of Digital Ethics”, in Oxford Handbook of Digital Ethics, Carissa Véliz (ed.), Oxford: Oxford University Press, 3–19. doi:10.1093/oxfordhb/9780198857815.013.1
–––, 2025a, “Deep Opacity and AI: A Treat to XAI and to Privacy Protection Mechanisms”, in A Companion to Applied Philosophy of AI (Blackwell Companions to Philosophy), Martin Hähnel and Regina Müller (eds.), London: Wiley-Blackwell. [Müller 2025a available online]
–––, 2025b, “Philosophy of AI: A Structured Overview”, in Cambridge Handbook on the Law, Ethics and Policy of Artificial Intelligence, Nathalie A. Smuha (ed.), Cambridge: Cambridge University Press, 40–58. [Müller 2025b available online]
Müller, Vincent C. and Nick Bostrom, 2016, “Future Progress in Artificial Intelligence: A Survey of Expert Opinion”, in Fundamental Issues of Artificial Intelligence (Synthese Library 377), Vincent C. Müller (ed.), Berlin: Springer, 553–570. [Müller & Bostrom 2016 available online]
–––, 2026, “Short-Term or Long-Term AI Ethics? A Dilemma for Ethical Singularity Only”, in Nyholm, Sven, Atoosa Kasirzadeh and John Zerilli (eds.), Contemporary Debates in the Ethics of Artificial Intelligence, 309–318, London: Wiley
Müller, Vincent C. and Michael Cannon, 2022, “Existential Risk from AI and Orthogonality: Can We Have It Both Ways?”, Ratio, 35(1): 25–36. doi:10.1111/rati.12320
Müller, Vincent C. and Guido Löhr, forthcoming, Artificial Minds (Cambridge Elements), Cambridge: Cambridge University Press.
Munn, Luke, 2023, “The Uselessness of AI Ethics”, AI and Ethics, 3(3): 869–877. doi:10.1007/s43681-022-00209-w
Miyahara, Katsunori, and Hayate Shimizu, 2025, “Instrumental, Intrinsic, and Functional Scarcity”, Philosophy & Technology, 38(3): 103.
Narayanan, Arvind and Sayash Kapoor, 2024, AI Snake Oil, Princeton, NJ: Princeton University Press. doi:10.1515/9780691249643
Neumueller, Alexander, 2023, “Bitcoin Electricity Consumption: An Improved Assessment”, Judge Business School, August 31, 2023, Cambridge, UK: University of Cambridge. [Neumueller 2023 available online]
Newport, Cal, 2019, Digital Minimalism: On Living Better with Less Technology, London: Penguin.
Nissenbaum, Helen, 2004, “Privacy as Contextual Integrity”, Washington Law Review, 79(1): 119–157.
Noggle, Robert, 2022, “The Ethics of Manipulation”, in The Stanford Encyclopedia of Philosophy (Summer 2022 Edition), Edward N Zalta (ed.), URL = <https://plato.stanford.edu/archives/sum2022/entries/ethics-manipulation/>
Nørskov, Marco, 2017, Social Robots, London: Routledge.
Nyholm, Sven, 2018a, “The Ethics of Crashes with Self-Driving Cars: A Roadmap, II”, Philosophy Compass, 13(7): e12506. doi:10.1111/phc3.12506
–––, 2018b, “Attributing Agency to Automated Systems: Reflections on Human–Robot Collaborations and Responsibility-Loci”, Science and Engineering Ethics, 24(4): 1201–1219. doi:10.1007/s11948-017-9943-x
–––, 2022, “Technological Manipulation and Threats to Meaning in Life”, in The Philosophy of Online Manipulation, Fleur Jongepier and Michael Klenk (eds.), New York: Routledge, 1–18.
Nyholm, Sven, John Danaher, and Brian D Earp, 2022, “The Technological Future of Love”, in Philosophy of Love in the Past, Present, and Future, André Grahle, Natasha McKeever, and Joe Sanders (eds.), London: Routledge, 224–239. doi:10.4324/9781003014331-18
Nyholm, Sven, Atoosa Kasirzadeh, and John Zerilli (eds.), 2026, Contemporary Debates in the Ethics of Artificial Intelligence, London: Wiley.
O’Brolcháin, Fiachra and María Amparo Grau Ruiz, 2020, “Environmental Impact of Robotics: Ethical Concerns and Legal Alternatives”, in Industry, Innovation and Infrastructure, Walter Leal Filho, Anabela Marisa Azul, Luciana Brandli, Amanda Lange Salvia, and Tony Wall (eds.), Cham: Springer International Publishing, 1–15. doi:10.1007/978-3-319-71059-4_147-1
Obuchowicz, R., J. Lasek, M. Wodziński, A. Piórkowski, M. Strzelecki, and K. Nurzynska, 2025, “Artificial Intelligence-Empowered Radiology-Current Status and Critical Review”, Diagnostics, 15(3). doi:10.3390/diagnostics15030282
O’Donnell, James, and Casey Crownhart, 2025, “We Did the Math on AI’s Energy Footprint. Here’s the Story You Haven’t Heard”, MIT Technology Review, May 20, 2025.[O’Connell 2025available online]
O’Connell, Mark, 2017, To Be a Machine: Adventures among Cyborgs, Utopians, Hackers, and the Futurists Solving the Modest Problem of Death, London: Granta. [O’Connell 2017 available online]
OECD, 2024, “Recommendation of the Council on Artificial Intelligence”, C/MIN(2024)16/FINAL(05.03.2024), OECD Legal Instruments. [OCED 2024 available online]
Omohundro, Steve, 2014, “Autonomous Technology and the Greater Human Good”, Journal of Experimental and Theoretical Artificial Intelligence, 26(3-Special issue ‘Risks of General Artificial Intelligence’, ed. V. Müller): 303–315.
O’Neil, Cathy, 2016, Weapons of Math Destruction: How Big Data Increases Inequality and Threatens Democracy, Largo, ML: Crown.
O’Neil, Elizabeth, Michal Klincewicz, and Michiel Kemmer, 2022, “Ethical Issues with Artificial Ethics Assistants”, in Oxford Handbook of Digital Ethics, Carissa Véliz (ed.), Oxford: Oxford University Press, 312–335. doi:10.1093/oxfordhb/9780198857815.013.1
OpenAI, 2025, “OpenAI GPT-4.5 System Card”, no. February 27, 2025. [OpenAI 2025 available online]
Ord, Toby, 2020, The Precipice: Existential Risk and the Future of Humanity, London: Bloomsbury.
Paulo, Norbert, 2023, “The Trolley Problem in the Ethics of Autonomous Vehicles”, The Philosophical Quarterly, 73(4): 1046–1066. doi:10.1093/pq/pqad051
Poel, Ibo van de, 2020, “Embedding Values in Artificial Intelligence (AI) Systems”, Minds and Machines, 30(3): 385–409. doi:10.1007/s11023-020-09537-4
Powers, Thomas M and Jean-Gabriel Ganascia, 2020, “The Ethics of the Ethics of AI”, in Oxford Handbook of Ethics of Artificial Intelligence, Markus D Dubber, Frank Pasquale, and Sunnit Das (eds.), New York: Oxford University Press. doi:10.1093/oxfordhb/9780190067397.013.2
Prunkl, Carina, 2024, “Human Autonomy at Risk? An Analysis of the Challenges from AI”, Minds and Machines, 34(3): 26.
Rachels, James, 1975, “Why Privacy Is Important”, Philosophy and Public Affairs, 4(4): 323–333.
Rawls, John, 1971, A Theory of Justice, Cambridge, MA: Belknap Press.
Rees, Martin, 2018, On the Future: Prospects for Humanity, Princeton: Princeton University Press. [Rees 2018 available online]
Richardson, Kathleen, 2016, “Sex Robot Matters: Slavery, the Prostituted, and the Rights of Machines”, IEEE Technology and Society, 35(2). [Richardson 2016 available online]
Rini, Regina, 2020, “Deepfakes and the Epistemic Backstop”, Philosophers’ Imprint, 20(24): 1–16.
Risse, Mathias, 2023, Political Theory of the Digital Age: Where Artificial Intelligence Might Take Us, Cambridge: Cambridge University Press.
Rocher, Luc, Julien M. Hendrickx, and Yves-Alexandre de Montjoye, 2019, “Estimating the Success of Re-Identifications in Incomplete Datasets Using Generative Models”, Nature Communications, 10(1): 3069. doi:10.1038/s41467-019-10933-3
Roessler, Beate, 2017, “Privacy as a Human Right”, Proceedings of the Aristotelian Society, 2(CXVII): 187–206.
Rosa, Hartmut, 2010, High-Speed Society: Social Acceleration, Power, and Modernity, University Park: Penn State Press.
–––, 2020, The Uncontrollability of the World, New York: John Wiley & Sons.
Robertson, Ian, 2025, “AI, Trust and Reliability”, Philosophy & Technology, 38(3): 94.
Royakkers, Lambèr and Rinie van Est, 2016, Just Ordinary Robots: Automation from Love to War, Boca Raton: CRC Press, Taylor & Francis.
Rueda, Jon, Txetxu Ausín, Mark Coeckelbergh, Juan Ignacio del Valle, Francisco Lara, Belén Liedo, Joan Llorca Albareda, Heidi Mertes, Robert Ranisch, Vera Lúcia Raposo, et al., 2025, “Why Dignity Is a Troubling Concept for AI Ethics”, Patterns, 6(3). doi:10.1016/j.patter.2025.101207
Russell, Jeffrey Sanford, 2024, “On Two Arguments for Fanaticism”, Noûs, 58(3): 565–595. doi:10.1111/nous.12461
Russell, Stuart, 2016, “Rationality and Intelligence: A Brief Update”, in Fundamental Issues of Artificial Intelligence (Synthese Library 376), Vincent C. Müller (ed.), Cham: Springer, 7–28. doi:10.1007/978-3-319-26485-1_2
–––, 2019, Human Compatible: Artificial Intelligence and the Problem of Control, New York: Viking.
–––, 2022, “If We Succeed”, Dædalus, 151(2): 43–57.
Russell, Stuart, and Peter Norvig, 2020, Artificial Intelligence: A Modern Approach (4 ed.). Upper Saddle River: Prentice Hall.
SAE, 2015, “Taxonomy and Definitions for Terms Related to Driving Automation Systems for On-Road Motor Vehicles”, SAE Recommended Practice, J3016_201806(2018-06–15). [SAE 2015 available online]
Sætra, Henrik Skaug and John Danaher, 2025, “Resolving the Battle of Short- vs. Long-Term AI Risks”, AI and Ethics, 5(1): 723–728. doi:10.1007/s43681-023-00336-y
Sandberg, Anders, 2013, “Feasibility of Whole Brain Emulation”, in Theory and Philosophy of Artificial Intelligence (SAPERE), Vincent C. Müller (ed.), Berlin: Springer, 251–264.
Sanders, Nathan E and Bruce Schneier, 2024, “Let’s Not Make the Same Mistakes with AI That We Made with Social Media”, MIT Technology Review, March 13, 2024. [Sanders & Schneier 2024 available online]
Santoni de Sio, Filippo, 2024, Human Freedom in the Age of AI, New York: Routledge.
Santoni de Sio, Filippo and Jeroen van den Hoven, 2018, “Meaningful Human Control over Autonomous Systems: A Philosophical Account”, Frontiers in Robotics and AI, 5(15): 1–15. doi:10.3389/frobt.2018.00015
Savulescu, Julian, Alberto Giubilini, Robert Vandersluis, and Abhishek Mishra, 2024, “Ethics of AI in Medicine”, Singapore Medical Journal, 65(3): 150–158.
Schmidhuber, Jürgen, 2015, “Deep Learning in Neural Networks: An Overview”, Neural Networks, 61: 85–117.
Schneider, Susan, 2025, “Chatbot Epistemology”, Social Epistemology, 39(5): 570–589.
Schneier, Bruce, 2015, Data and Goliath: The Hidden Battles to Collect Your Data and Control Your World, New York: W. W. Norton.
Schwalbe, Gesina and Bettina Finzel, 2024, “A Comprehensive Taxonomy for Explainable Artificial Intelligence: A Systematic Survey of Surveys on Methods and Concepts”, Data Mining and Knowledge Discovery, 38(5): 3043–3101. doi:10.1007/s10618-022-00867-8
Searle, John R., 1980, “Minds, Brains and Programs”, Behavioral and Brain Sciences, 3: 417–457. doi:10.1017/S0140525X00005756pin
Selbst, Andrew D., Danah Boyd, Sorelle A. Friedler, Suresh Venkatasubramanian, and Janet Vertesi, 2019, “Fairness and Abstraction in Sociotechnical Systems”, in Proceedings of the Conference on Fairness, Accountability, and Transparency, Atlanta, GA, USA: ACM, 59–68. doi:10.1145/3287560.3287598
Sennett, Richard, 2018, Building and Dwelling: Ethics for the City, London: Allen Lane.
Shanahan, Murray, 2015, The Technological Singularity, Cambridge, MA: MIT Press.
Sharkey, Amanda, 2019, “Autonomous Weapons Systems, Killer Robots and Human Dignity”, Ethics and Information Technology, 21(2): 75–87. doi:10.1007/s10676-018-9494-0
Sharkey, Amanda and Noel Sharkey, 2011, “The Rights and Wrongs of Robot Care”, in Robot Ethics: The Ethical and Social Implications of Robotics, Patrick Lin, Keith Abney, and George Bekey (eds.), Cambridge, MA: MIT Press, 267–282.
Sharkey, Noel, Aimee van Wynsberghe, Scott Robbins, and Eleanor Hancock, 2017, “Report: Our Sexual Future with Robots”, Responsible Robotics, 1–44.
Simon, Herbert and Allen Newell, 1958, “Heuristic Problem Solving: The next Advance in Operations Research”, Operations Research, 6(1): 1–10.
Simpson, Thomas W and Vincent C. Müller, 2016, “Just War and Robots’ Killings”, The Philosophical Quarterly, 66(263): 302–322. doi:10.1093/pq/pqv075
Skare, Marinko, Beata Gavurova, and Dean Sinkovic, 2025, “Measuring Artificial Intelligence’s Impact on Sustainable Energy Transition: Empirical Insights and Policy Implications”, Energy Economics, 150: 108825.
Smolan, Sandy, 2016, “The Human Face of Big Data”, PBS Documentary, no. 24 February 2016, PBS-Luminous Content: 56 mins.
Smuha, Nathalie A., 2021a, “From a ‘Race to AI’ to a ‘Race to AI Regulation’: Regulatory Competition for Artificial Intelligence”, Law, Innovation and Technology, 13(1): 57–84. doi:10.1080/17579961.2021.1898300
–––, 2021b, “Beyond a Human Rights-Based Approach to AI Governance: Promise, Pitfalls, Plea”, Philosophy & Technology, 34(1): 91–104. doi:10.1007/s13347-020-00403-w
–––, 2025, Cambridge Handbook on the Law, Ethics and Policy of Artificial Intelligence, Cambridge: Cambridge University Press. [Smuha 2025 available online]
Sparrow, Rob, 2007, “Killer Robots”, Journal of Applied Philosophy, 24(1): 62–77.
–––, 2016, “Robots in Aged Care: A Dystopian Future”, AI & Society, 31(4): 1–10.
Sparrow, Rob, and Gene Flenady, 2025, “The Testimony Gap: Machines and Reasons”, Minds and Machines, 35(1): 12.
Stix, Charlotte, 2021, “Actionable Principles for Artificial Intelligence Policy: Three Pathways”, Science and Engineering Ethics, 27(1): 15. doi:10.1007/s11948-020-00277-3
Stone, Christopher D, 1972, “Should Trees Have Standing – Toward Legal Rights for Natural Objects”, Southern California Law Review, 2: 450–501.
Stone, Peter, Rodney Brooks, Erik Brynjolfsson, Ryan Calo, Oren Etzioni, Greg Hager, Julia Hirschberg, Shivaram Kalyanakrishnan, Ece Kamar, Sarit Kraus, et al., 2021, “Gathering Strength, Gathering Storms: The One Hundred Year Study on Artificial Intelligence (AI100) 2021 Study Panel Report”, September 2021, Palo Alto: Stanford. [Stone et al. 2021 available online]
Strawson, Galen, 2011, “Free Will”, Routledge Encyclopedia of Philosophy, London: Routledge. doi:10.4324/9780415249126-V014-2
Sullins, John P, 2012, “Robots, Love, and Sex: The Ethics of Building a Love Machine”, IEEE Transactions on Affective Computing, 3(4): 398–409. doi:10.1109/T-AFFC.2012.31
Sullivan, Emily, 2022, “Inductive Risk, Understanding, and Opaque Machine Learning Models”, Philosophy of Science, 89(5): 1065–1074. doi:10.1017/psa.2022.62
Susser, Daniel, Beate Roessler, and Helen Nissenbaum, 2019, “Technology, Autonomy, and Manipulation”, Internet Policy Review, 8(2). doi:10.14763/2019.2.1410
Susskind, Richard and Daniel Susskind, 2022, The Future of the Professions: How Technology Will Transform the Work of Human Experts, Oxford: Oxford University Press.
Taddeo, Mariarosaria, 2024, The Ethics of Artificial Intelligence in Defense, Oxford: Oxford University Press.
Taddeo, Mariarosaria, Alexander Blanchard, and Christopher Thomas, 2024, “From AI Ethics Principles to Practices: A Teleological Methodology to Apply AI Ethics Principles in The Defence Domain”, Philosophy & Technology, 37(1): 42. doi:10.1007/s13347-024-00710-6
Taddeo, Mariarosaria and Luciano Floridi, 2018, “How AI Can Be a Force for Good”, Science, 361(6404): 751–752. doi:10.1126/science.aat5991
Tavani, Herman T. and Michael Zimmer, 2025, “Search Engines and Ethics”, in The Stanford Encyclopedia of Philosophy (Spring 2025 Edition), Edward N. Zalta (ed.), URL = <https://plato.stanford.edu/archives/spr2025/entries/ethics-search/>
Taylor, Steve, Brian Pickering, Michael Boniface, Michael Anderson, David Danks, Asbjørn Følstad, Matthias Leese, Vincent C. Müller, Tom Sorrell, Alan Winfield, and Fiona Wollard, 2018, “Responsible AI – Key Themes, Concerns & Recommendations for European Research and Innovation: Summary of Consultation with Multidisciplinary Experts”, June. [Taylor et al. 2018 available online]
Terwiesch, Christian, Lennart Meincke, and Gideon Nave, 2023, “The AI Ethicist: Fact or Fiction?”, SSRN: The Wharton School Research Paper, no. 4609825: 1–14.
Thaler, Richard H and Cass Sunstein, 2008, Nudge: Improving Decisions about Health, Wealth and Happiness, New York: Penguin.
Thoma, Johanna, 2019, “Decision Theory”, in The Open Handbook of Formal Epistemology, Richard Pettigrew and Jonathan Weisberg (eds.), PhilPapers, 57–106.
Thompson, Judith Jarvis, 1976, “Killing, Letting Die and the Trolley Problem”, Monist, 59: 204–217.
Thorstad, David, 2025, “Against the Singularity Hypothesis”, Philosophical Studies, 182(7): 1627–1651.
Tigard, Daniel W, 2021, “There Is No Techno-Responsibility Gap”, Philosophy & Technology, 34 (3): 589–607.
Toupin, Sophie, 2024, “Shaping Feminist Artificial Intelligence”, New Media & Society, 26(1): 580–595. doi:10.1177/14614448221150776
Turner, Jacob, 2019, Robot Rules: Regulating Artificial Intelligence, Berlin: Springer. [Turner 2019 available online]
Tzafestas, Spyros G, 2016, Roboethics: A Navigating Overview, Berlin: Springer. [Tzafesstas 2016 available online]
Vallor, Shannon, 2024, The AI Mirror: How to Reclaim Our Humanity in an Age of Machine Thinking, Oxford: Oxford University Press. doi:10.1093/oso/9780197759066.001.0001
Véliz, Carissa, 2020, Privacy Is Power, London: Penguin.
–––, 2024, The Ethics of Privacy and Surveillance. Oxford: Oxford University Press.
Verbeek, Peter-Paul, 2011, Moralizing Technology: Understanding and Designing the Morality of Things, Chicago: University of Chicago Press.
Vosoughi, Soroush, Deb Roy, and Sinan Aral, 2018, “The Spread of True and False News Online”, Science, 359(6380): 1146–1151. doi:10.1126/science.aap9559
Wachter, Sandra and Brent Daniel Mittelstadt, 2019, “A Right to Reasonable Inferences: Re-Thinking Data Protection Law in the Age of Big Data and AI”, Columbia Business Law Review, 494. [Wachter & Mittelstadt 2019 available online]
Wallach, Wendell and Peter M Asaro, 2017, Machine Ethics and Robot Ethics, London: Routledge.
Walsh, Toby, 2018, Machines That Think: The Future of Artificial Intelligence, Amherst, MA: Prometheus Books.
Wang, Hao, and Vincent Blok, 2025, “Why Putting Artificial Intelligence Ethics into Practice Is Not Enough: Towards a Multi-Level Framework”, Big Data & Society, 12(2): 20539517251340620.
Westlake, Stian, 2014, Our Work Here Is Done: Visions of a Robot Economy, London: [Westlake 2014 available online]
Whittaker, Meredith, Kate Crawford, Roel Dobbe, Genevieve Fried, Elizabeth Kaziunas, Varoon Mathur, Sarah Myers West, Rashida Richardson, and Jason Schultz, 2018, “AI Now Report 2018”, New York: New York University. [Whittaker et al. 2018 available online]
Whittlestone, Jess, Rune Nyrup, Anna Alexandrova, Kanta Dihal, and Stephen Cave, 2019, “Ethical and Societal Implications of Algorithms, Data, and Artificial Intelligence: A Roadmap for Research”, February, London: Nuffield Foundation, 1–59.
Wilkinson, Hayden, 2022, “In Defense of Fanaticism”, Ethics, 132(2): 445–477. doi:10.1086/716869
Williams, James, 2018, Stand out of Our Light: Freedom and Resistance in the Attention Economy, Cambridge: Cambridge University Press.
Woollard, Fiona and Frances Howard-Snyder, 2021, “Doing vs. Allowing Harm”, in Stanford Encyclopedia of Philosophy (Fall 2021 Edition), Edward N Zalta (ed.), URL = <https://plato.stanford.edu/archives/fall2021/entries/doing-allowing/>
Woolley, Sam and Phil Howard, 2017, Computational Propaganda: Political Parties, Politicians, and Political Manipulation on Social Media, Oxford: Oxford University Press.
World Economic Forum, 2025, “The Future of Jobs Report 2025”, no. 7, January 2025. [Word Economic Forum 2025 available online]
Wynsberghe, Aimee van, 2016, Healthcare Robots: Ethics, Design and Implementation, London: Routledge.
–––, 2021, “Sustainable AI: AI for Sustainability and the Sustainability of AI”, AI and Ethics, 1(3): 213–218.
Yampolskiy, Roman V, 2018, Artificial Intelligence Safety and Security, London: Chapman and Hall/CRC. doi:10.1201/9781351251389
–––, 2022, “AI Risk Skepticism”, in Philosophy and Theory of Artificial Intelligence 2021, Vincent C. Müller (ed.), Cham: Springer International Publishing, 225–248.
Yeung, Karen and Martin Lodge, 2019, Algorithmic Regulation, Oxford: Oxford University Press.
Yudkovski, Eliezer, and Nate Soares, 2025, If Anyone Builds It, Everyone Dies, New York: Little, Brown and Company.
Zafar, Mandy, 2024, “Normativity and AI Moral Agency”, AI and Ethics, September. doi:10.1007/s43681-024-00566-8
Zayed, Yago and Philip Loft, 2019, “Agriculture: Historical Statistics”, House of Commons Briefing Paper, 3339(25 June 2019): 1–19.
Zednik, Carlos, 2021, “Solving the Black Box Problem: A Normative Framework for Explainable Artificial Intelligence”, Philosophy & Technology, 34(2): 265–288. doi:10.1007/s13347-019-00382-7
Zerilli, John, 2022, “Explaining Machine Learning Decisions”, Philosophy of Science, 89(1): 1–19. doi:10.1017/psa.2021.13
Zhi-Xuan, Tan, Micah Carroll, Matija Franklin, and Hal Ashton, 2024, “Beyond Preferences in AI Alignment”, Philosophical Studies, 182: 1813–1863. doi:10.1007/s11098-024-02249-w
Zuboff, Shoshana, 2019, The Age of Surveillance Capitalism: The Fight for a Human Future at the New Frontier of Power, New York: Public Affairs. [Zuboff 2019 available online]

Academic Tools

How to cite this entry.

Preview the PDF version of this entry at the Friends of the SEP Society.

Look up topics and thinkers related to this entry at the Internet Philosophy Ontology Project (InPhO).

Enhanced bibliography for this entry at PhilPapers, with links to its database.

Other Internet Resources

Research Organizations

Conferences

AAAI/AIES
ACM FAT/FAccT
ETHICOMP (different venues, annually)
International Association for Computing and Philosophy (IACAP)
Philosophy of AI
WE ROBOT (different venues, annually)

Policy Documents

Other Relevant pages

Acknowledgments

The colleagues worldwide did all the work that is reported here. The members of my Centre for Philosophy and AI Research {PAIR} at Erlangen have been and are an inspiration. I am particularly grateful for detailed comments to Leonard Dung, Max Hellrigel-Holderbaum, Hadeel Naeem, and Ian Robertson.

I am very grateful for funding from the Alexander von Humboldt Foundation, the German Ministry for Education and Research (BMBF), the European Commission, the Dutch Research Council (NWO) and the Research Council of Norway (NF).

Open access to the SEP is made possible by a world-wide funding initiative.
The Encyclopedia Now Needs Your Support
Please Read How You Can Help Keep the Encyclopedia Free

	How to cite this entry.
	Preview the PDF version of this entry at the Friends of the SEP Society.
	Look up topics and thinkers related to this entry at the Internet Philosophy Ontology Project (InPhO).
	Enhanced bibliography for this entry at PhilPapers, with links to its database.