<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Empathy in the machine</title>
	<atom:link href="http://ferrouswheel.me/2010/03/empathy-in-the-machine/feed/" rel="self" type="application/rss+xml" />
	<link>http://ferrouswheel.me/2010/03/empathy-in-the-machine/</link>
	<description>watching the world turn.</description>
	<lastBuildDate>Sat, 10 Dec 2011 02:46:55 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
	<item>
		<title>By: Joel</title>
		<link>http://ferrouswheel.me/2010/03/empathy-in-the-machine/comment-page-1/#comment-2482</link>
		<dc:creator>Joel</dc:creator>
		<pubDate>Mon, 22 Mar 2010 23:41:12 +0000</pubDate>
		<guid isPermaLink="false">http://ferrouswheel.me/?p=317#comment-2482</guid>
		<description>@Vladimir I can agree with what you&#039;ve mentioned. In particular reference to the system having a &quot;preference&quot; for friendly behaviour. As I&#039;ve understood it in the past, there seemed to be a section of the friendly AI community that were focused on &lt;i&gt;guaranteeing&lt;/i&gt; friendliness.

I guess it depends if friendliness is to be judged by actions or intention. But as the saying goes, the road to hell is paved with good intention.

@James I think that&#039;s a similar but related idea which I wouldn&#039;t mind learning more about. The representation of other people&#039;s consciousness is more of a two way thing and at a slightly more direct level than my impression of Jung&#039;s &#039;collective unconscious&#039; - perhaps we should call it our &#039;collective consciousness&#039;? ;-)</description>
		<content:encoded><![CDATA[<p>@Vladimir I can agree with what you&#8217;ve mentioned. In particular reference to the system having a &#8220;preference&#8221; for friendly behaviour. As I&#8217;ve understood it in the past, there seemed to be a section of the friendly AI community that were focused on <i>guaranteeing</i> friendliness.</p>
<p>I guess it depends if friendliness is to be judged by actions or intention. But as the saying goes, the road to hell is paved with good intention.</p>
<p>@James I think that&#8217;s a similar but related idea which I wouldn&#8217;t mind learning more about. The representation of other people&#8217;s consciousness is more of a two way thing and at a slightly more direct level than my impression of Jung&#8217;s &#8216;collective unconscious&#8217; &#8211; perhaps we should call it our &#8216;collective consciousness&#8217;? <img src='http://ferrouswheel.me/wp-includes/images/smilies/icon_wink.gif' alt=';-)' class='wp-smiley' /> </p>
]]></content:encoded>
	</item>
	<item>
		<title>By: James MM</title>
		<link>http://ferrouswheel.me/2010/03/empathy-in-the-machine/comment-page-1/#comment-2480</link>
		<dc:creator>James MM</dc:creator>
		<pubDate>Mon, 22 Mar 2010 12:48:46 +0000</pubDate>
		<guid isPermaLink="false">http://ferrouswheel.me/?p=317#comment-2480</guid>
		<description>&quot;Hofstadter... says that, due to us modelling of the behaviour of others, we also start representing their consciousness too. The eventual conclusion, which is explained in much greater and philosophical detail in his book, is that our “consciousness” isn’t just the sum of what’s in our head but is a holistic total of ourselves and everyone’s representation of us in their heads.&quot;

This sounds to me very much like the &#039;collective unconscious&#039; that Jung talks about—the cultural paradigm that we use to filter our ideas and actions through in daily life. We need to empathise in order to be successful and to attain happiness in our lives, and we use this paradigm as a tool of interpretation. If we hold in our minds an accurate image of the collective unconscious, then we interact in more meaningful and beneficial ways with others, and thus become more successful and achieve more happiness.

Seems only fair that any AI entity should have the same goals, using empathy as a tool to that end.</description>
		<content:encoded><![CDATA[<p>&#8220;Hofstadter&#8230; says that, due to us modelling of the behaviour of others, we also start representing their consciousness too. The eventual conclusion, which is explained in much greater and philosophical detail in his book, is that our “consciousness” isn’t just the sum of what’s in our head but is a holistic total of ourselves and everyone’s representation of us in their heads.&#8221;</p>
<p>This sounds to me very much like the &#8216;collective unconscious&#8217; that Jung talks about—the cultural paradigm that we use to filter our ideas and actions through in daily life. We need to empathise in order to be successful and to attain happiness in our lives, and we use this paradigm as a tool of interpretation. If we hold in our minds an accurate image of the collective unconscious, then we interact in more meaningful and beneficial ways with others, and thus become more successful and achieve more happiness.</p>
<p>Seems only fair that any AI entity should have the same goals, using empathy as a tool to that end.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Vladimir Nesov</title>
		<link>http://ferrouswheel.me/2010/03/empathy-in-the-machine/comment-page-1/#comment-2479</link>
		<dc:creator>Vladimir Nesov</dc:creator>
		<pubDate>Sat, 20 Mar 2010 13:11:53 +0000</pubDate>
		<guid isPermaLink="false">http://ferrouswheel.me/?p=317#comment-2479</guid>
		<description>Identical programs will have identical outputs on identical inputs. Thus, placed in the same situations (which includes the inputs), the identical programs will do the same thing. Note that this is a theoretical relation, you don&#039;t need to have both programs around, or do experiments on them, or have one of the programs examine the other, to know that the relation holds true.

With Friendly AI (FAI), you care about a certain relation between it and (say) a human: in each situation X (incl. all input, etc.), the FAI needs to prefer to do the same action as the human would prefer in the same situation X (on reflection, if they were both capable of understanding X fully, etc.). This is more complicated than same behavior of of two identical programs, since preferring the same thing doesn&#039;t necessarily mean that they&#039;ll actually do the same thing, or do the most preferred thing: when solving a difficult program, your goal might be to find the right solution, but you&#039;ll only be able to find some approximation to a solution, or a candidate solution that you aren&#039;t sure is the right solution. This holds both for us stupid apes, and for superintelligent AIs, given appropriately difficult problems.

This is to reply to two ideas you voiced in this post: first, you don&#039;t need FAI to actually interact or observe humans, to &quot;emphasize&quot;, in order to have a knowable property of being Friendly, at least not during the actual operation (you of course need to confer the info about humans into the FAI at the start, but this is again the info about humans as systems, not magic knowledge of what they&#039;ll actually do, or explicit info about what they will do in all possible situations). Second, uncomputability or chaotic behavior of environment are not game-stoppers, as you only need to have a certain abstract property of FAI as a system, that can then be let to freely develop for any unanticipated circumstance it happens to encounter. Friendliness need to be shown in general, as a property of a system, from which Friendly behavior in the specific situation (which is not knowable in advance) follows as a special case.

You might want to read the &lt;a href=&quot;http://causalityrelay.wordpress.com/sequences/friendly-ai-problem/&quot; rel=&quot;nofollow&quot;&gt;current sequence&lt;/a&gt; I&#039;m writing on my blog, to get a better idea of where I&#039;m coming from.</description>
		<content:encoded><![CDATA[<p>Identical programs will have identical outputs on identical inputs. Thus, placed in the same situations (which includes the inputs), the identical programs will do the same thing. Note that this is a theoretical relation, you don&#8217;t need to have both programs around, or do experiments on them, or have one of the programs examine the other, to know that the relation holds true.</p>
<p>With Friendly AI (FAI), you care about a certain relation between it and (say) a human: in each situation X (incl. all input, etc.), the FAI needs to prefer to do the same action as the human would prefer in the same situation X (on reflection, if they were both capable of understanding X fully, etc.). This is more complicated than same behavior of of two identical programs, since preferring the same thing doesn&#8217;t necessarily mean that they&#8217;ll actually do the same thing, or do the most preferred thing: when solving a difficult program, your goal might be to find the right solution, but you&#8217;ll only be able to find some approximation to a solution, or a candidate solution that you aren&#8217;t sure is the right solution. This holds both for us stupid apes, and for superintelligent AIs, given appropriately difficult problems.</p>
<p>This is to reply to two ideas you voiced in this post: first, you don&#8217;t need FAI to actually interact or observe humans, to &#8220;emphasize&#8221;, in order to have a knowable property of being Friendly, at least not during the actual operation (you of course need to confer the info about humans into the FAI at the start, but this is again the info about humans as systems, not magic knowledge of what they&#8217;ll actually do, or explicit info about what they will do in all possible situations). Second, uncomputability or chaotic behavior of environment are not game-stoppers, as you only need to have a certain abstract property of FAI as a system, that can then be let to freely develop for any unanticipated circumstance it happens to encounter. Friendliness need to be shown in general, as a property of a system, from which Friendly behavior in the specific situation (which is not knowable in advance) follows as a special case.</p>
<p>You might want to read the <a href="http://causalityrelay.wordpress.com/sequences/friendly-ai-problem/" rel="nofollow">current sequence</a> I&#8217;m writing on my blog, to get a better idea of where I&#8217;m coming from.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Joel</title>
		<link>http://ferrouswheel.me/2010/03/empathy-in-the-machine/comment-page-1/#comment-2478</link>
		<dc:creator>Joel</dc:creator>
		<pubDate>Fri, 19 Mar 2010 22:46:58 +0000</pubDate>
		<guid isPermaLink="false">http://ferrouswheel.me/?p=317#comment-2478</guid>
		<description>@Vladimir Can you point me to the proof for your statement:

&quot;If you have two identical complicated programs, of which you don’t and can’t know what they compute on what possible inputs, you can still be sure that they’ll compute exactly the same thing on all occasions.&quot;

Unless you mean that they receive exactly the same inputs at the same times, then yes, I agree. But any intelligent agent doesn&#039;t exist in a world with an invariant temporal chain of inputs.

So two identical programs could diverge depending on the relative ordering of inputs. Not necessarily diverge from friendliness, but in terms of eventual knowledge and belief.</description>
		<content:encoded><![CDATA[<p>@Vladimir Can you point me to the proof for your statement:</p>
<p>&#8220;If you have two identical complicated programs, of which you don’t and can’t know what they compute on what possible inputs, you can still be sure that they’ll compute exactly the same thing on all occasions.&#8221;</p>
<p>Unless you mean that they receive exactly the same inputs at the same times, then yes, I agree. But any intelligent agent doesn&#8217;t exist in a world with an invariant temporal chain of inputs.</p>
<p>So two identical programs could diverge depending on the relative ordering of inputs. Not necessarily diverge from friendliness, but in terms of eventual knowledge and belief.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Vladimir Nesov</title>
		<link>http://ferrouswheel.me/2010/03/empathy-in-the-machine/comment-page-1/#comment-2477</link>
		<dc:creator>Vladimir Nesov</dc:creator>
		<pubDate>Fri, 19 Mar 2010 09:36:58 +0000</pubDate>
		<guid isPermaLink="false">http://ferrouswheel.me/?p=317#comment-2477</guid>
		<description>A &quot;proof of Friendliness&quot; doesn&#039;t let you know object-level facts about what specifically the AI will do, it only tells you an abstract fact that whatever the AI will do is preferable from the point of view of humans. If you have two identical complicated programs, of which you don&#039;t and can&#039;t know what they compute on what possible inputs, you can still be sure that they&#039;ll compute exactly the same thing on all occasions. Similarly, a Friendly AI needs to prefer the same course of action as humans would (on reflection), in all specific circumstances. This might be proved even if we have no idea of what that exactly our of Friendly AI&#039;s preference might be.</description>
		<content:encoded><![CDATA[<p>A &#8220;proof of Friendliness&#8221; doesn&#8217;t let you know object-level facts about what specifically the AI will do, it only tells you an abstract fact that whatever the AI will do is preferable from the point of view of humans. If you have two identical complicated programs, of which you don&#8217;t and can&#8217;t know what they compute on what possible inputs, you can still be sure that they&#8217;ll compute exactly the same thing on all occasions. Similarly, a Friendly AI needs to prefer the same course of action as humans would (on reflection), in all specific circumstances. This might be proved even if we have no idea of what that exactly our of Friendly AI&#8217;s preference might be.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Schneider</title>
		<link>http://ferrouswheel.me/2010/03/empathy-in-the-machine/comment-page-1/#comment-2476</link>
		<dc:creator>Schneider</dc:creator>
		<pubDate>Thu, 18 Mar 2010 23:47:27 +0000</pubDate>
		<guid isPermaLink="false">http://ferrouswheel.me/?p=317#comment-2476</guid>
		<description>Quite interesting. But, as many human beings are, couldn&#039;t we build a &quot;depressed empathic machine&quot;, which couldn&#039;t, for example, be able to cause any injury in humans?
I mean, we&#039;re talking about &quot;machine psychology&quot; hehehe, we could build a not-so-healthy one :)
I think it&#039;s quite safe then.</description>
		<content:encoded><![CDATA[<p>Quite interesting. But, as many human beings are, couldn&#8217;t we build a &#8220;depressed empathic machine&#8221;, which couldn&#8217;t, for example, be able to cause any injury in humans?<br />
I mean, we&#8217;re talking about &#8220;machine psychology&#8221; hehehe, we could build a not-so-healthy one <img src='http://ferrouswheel.me/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /><br />
I think it&#8217;s quite safe then.</p>
]]></content:encoded>
	</item>
</channel>
</rss>

