Advanced Search

Narrow Results

Results: 1 - 3of3

Follow results:

refine search

Filters

per page:

Sort: Relevance

Context for search term 1Search term 1*

All Dates

LastSelect static range

Custom Range

Select starting monthSelect starting year

Select ending monthSelect ending year

Advanced

Search name	Searched On	Run search
Author: Chen, Jia-Jin Jason (1)	28 Mar 2025	Run
Author: Yampolskiy, Roman V (3)	28 Mar 2025	Run
Author: Shanmugam, N R Siva (2)	28 Mar 2025	Run
Author: Ulański, Piotr (1)	28 Mar 2025	Run

articleNo Access
Unpredictability of AI: On the Impossibility of Accurately Predicting All Actions of a Smarter Agent
- Roman V. Yampolskiy
Journal of Artificial Intelligence and Consciousness01 Mar 2020
Preview Abstract
The young field of AI Safety is still in the process of identifying its challenges and limitations. In this paper, we formally describe one such impossibility result, namely Unpredictability of AI. We prove that it is impossible to precisely and consistently predict what specific actions a smarter-than-human intelligent system will take to achieve its objectives, even if we know the terminal goals of the system. In conclusion, the impact of Unpredictability on AI Safety is discussed.
articleNo Access
Unexplainability and Incomprehensibility of AI
- Roman V. Yampolskiy
Journal of Artificial Intelligence and Consciousness17 Jul 2020
Preview Abstract
Explainability and comprehensibility of AI are important requirements for intelligent systems deployed in real-world domains. Users want and frequently need to understand how decisions impacting them are made. Similarly, it is important to understand how an intelligent system functions for safety and security reasons. In this paper, we describe two complementary impossibility results (Unexplainability and Incomprehensibility), essentially showing that advanced AIs would not be able to accurately explain some of their decisions and for the decisions they could explain people would not understand some of those explanations.
articleNo Access
Metaverse: A Solution to the Multi-Agent Value Alignment Problem
- Roman V. Yampolskiy
Journal of Artificial Intelligence and Consciousness01 Dec 2022
Preview Abstract
AI Safety researchers attempting to align values of highly capable intelligent systems with those of humanity face a number of challenges including personal value extraction, multi-agent value merger and finally in-silico encoding. State-of-the-art research in value alignment shows difficulties in every stage in this process, but merger of incompatible preferences is a particularly difficult challenge to overcome. In this paper, we assume that the value extraction problem will be solved and propose a possible way to implement an AI solution which optimally aligns with individual preferences of each user. We conclude by analyzing benefits and limitations of the proposed approach.