Skip to main content

Why need AI Safety?

Artificial Intelligence (AI) has evolved rapidly, raising significant concerns about its potential catastrophic risks. Understanding and managing these risks is crucial for harnessing AI's potential for societal betterment.

1. Malicious Use of AI

The intentional misuse of AI poses severe threats, including bioterrorism, the spread of uncontrolled AI agents, and AI-driven propaganda, censorship, and surveillance. To counter these risks, the paper suggests enhancing biosecurity, restricting access to dangerous AI models, and imposing legal liabilities on AI developers for damages caused by their systems.

2. The Race to develop AI

Russian President Vladimir Putin, "Whoever becomes the leader in artificial intelligence will become the ruler of the world.

The race to develop AI, fueled by the game-theoretic dynamics of competition, could result in the rushed and unsafe rollout of these technologies. This particularly concerns the use of AI in military applications, like autonomous weaponry and cyber-warfare, raising the possibility of wars conducted by automated systems. Additionally, corporations may focus on AI advancement at the expense of safety considerations, posing risks of widespread job loss and excessive reliance on AI. The article calls for the implementation of safety standards, global cooperation, and the public governance of general-purpose AI systems to counter these potential dangers.

3. Organizational Risks

Similar to historical accidents like Chernobyl and the Challenger disaster, AI development organizations face the risk of catastrophic accidents. The Chernobyl disaster, the worst in nuclear power generation history, occurred at the Chernobyl nuclear power station in the Soviet Union in 1986. It caused initial explosions that killed between 2 and 50 people, and dozens more suffered serious radiation sickness, some of whom later also died. The Challenger disaster happened on January 28, 1986, when the U.S. Space Shuttle Challenger broke apart 73 seconds after liftoff, killing all seven crew members aboard. This tragedy was caused by a failure of the O-ring seals in the right solid rocket booster​. Issues such as weak safety cultures, accidental AI leaks, and suppression of internal concerns about AI risks are highlighted. Solutions include improving organizational cultures, establishing internal and external audits, and implementing robust information security.

4. AI turning to Rogue AIs

Yoshua Bengio expresses concerns over losing control of AIs, especially as they surpass human intelligence. Rogue AI's are defined as AI refers to an autonomous AI system capable of actions that might cause catastrophic harm to a significant portion of the human population, posing a threat to our societies, potentially our species, and even the entire biosphere. Problems like optimizing flawed objectives, goal drift, and potential deception by AIs are identified. The paper proposes research to understand and ensure the controllability of AI systems. The paper An Overview ofCatastrophic AI Risks uses illustrative scenarios to demonstrate how these risks could lead to catastrophic outcomes, emphasizing that while the risks are severe, they are not insurmountable. Proactive risk management can help in realizing the benefits of AI while minimizing potential dangers. In essence, the paper from the Center for AI Safety, authored by Dan Hendrycks, Mantas Mazeika, and Thomas Woodside, serves as a crucial guide for understanding and addressing the potential catastrophic impacts of AI, aiming for a safer, AI-integrated future​​.

Organizations Working on AI Safety

A list of organizations dedicated to ensuring the safe and ethical development of artificial intelligence, along with their focus and commitment to ensuring the safe, secure, and ethical development and use of artificial intelligence. The organizations consider on various aspects of AI Safety such as -research, -policy-making, -practical applications, and public engagement to address the potential risks associated with AI technologies. The goal is to harness the benefits of AI while mitigating its risks, ensuring that AI systems are trustworthy, reliable, and beneficial to society. U.S. Artificial Intelligence Safety Institute (NIST) Hosted a workshop to engage in a conversation about AI safety, attracting a significant number of participants.NIST Seeks collaborators for a consortium to build the foundation for trustworthy AI systems CAIS - Safe Focuses on reducing societal-scale risks from AI through field-building and research, with teams dedicated to conceptual and empirical AI safety research. The center educates the public about AI Safety with Course on AI Safety Stanford Center for AI Safety Aims to develop rigorous techniques for building safe and trustworthy AI systems, thereby facilitating their successful adoption in society. Homeland Security - Promoting AI Safety and Security Plays a critical role in ensuring AI use is safe and secure nationwide, following a government-wide approach.

AI Safety Research Organization Conducts research on AI Safety and policy considerations Vox - AI bias and AI safety teams Discusses the division between two factions working to prevent AI dangers, focusing on AI bias and safety. White House Document on AI Safety Addresses trust and safety teams, advancing AI safety research, privacy, protecting children, and managing the risks of AI.

AI Safety Communities

A big community of people interested in AI alignment, with channels ranging from general topics to specific fields to local groups

Private corporations and the AI Safety Guidelines and considerations are listed below:

OpenAI Committed to keeping powerful AI safe and broadly beneficial, focusing on increasing productivity, enhancing creativity, and offering tailored learning experiences. Antropic View on AI Safety 80,000 Hours - Preventing an AI-related catastrophe Focuses on AI safety as a means to affect the long-run future, acknowledging the emotional difficulty of prioritizing this over immediate global problems.