1 / 13

Safety of AI and robots

Safety of AI and robots. Roland Pihlakas Institute of Technology in University of Tartu 26. April 2008. You may do it like. Good states - goals Bad states - don’t do-s. In contrast, proposed approach is kind of orthogonal.

donnyg
Download Presentation

Safety of AI and robots

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Safety of AI and robots Roland Pihlakas Institute of Technology in University of Tartu 26. April 2008

  2. You may do it like • Good states - goals • Bad states - don’t do-s

  3. In contrast, proposed approach is kind of orthogonal • Implicit: avoid any irreversibilities (does not need pre-specified bad states) • Explicit: goals (good states)

  4. A small addition... • Implicit: avoid any irreversibilities (no pre-specified bad states) • Explicit: rights (still no pre-specified states; instead - sorts of states) • Explicit: goals (good states)

  5. In short • 1. Rights • 2. Goals

  6. An analogy: the three laws of robotics • 1. Don’t do harm. => Avoid irreversibilities • 2. Do what you are ordered to do. • a. Goals • b. May include explicitly stated bad states • 3. Be optimal, be tidy, clean up reversible temporary changes. Recommendations.

  7. “passive” safety - avoid only own mistakes

  8. Goals value of indicator current measured preferred state planned action

  9. Reversibilities first action value of indicator current measured first measured state second action

  10. IR-reversibilities No-no action: don’t do this - don’t cause irreversible states value of indicator state to avoid first measured state cannot achieve - cannot reverse the first action

  11. Permissions Not reversible, but okay action value of indicator state,which we dont need toavoid first measured state cannot achieve - cannot reverse the first action

  12. Implementation • Logical programming; planning is using theorem provers • Statistical / machine learning; control system

  13. A safe AI should not increase entropy, except only temporarily.

More Related