<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="ko">
	<id>http://samediff.kr/wiki/index.php?action=history&amp;feed=atom&amp;title=0401_reinforcement_learning%EC%9D%98_%EC%95%8C%EA%B3%A0%EB%A6%AC%EC%A6%98_%EB%B3%84_%EB%88%84%EC%A0%81_reward</id>
	<title>0401 reinforcement learning의 알고리즘 별 누적 reward - 편집 역사</title>
	<link rel="self" type="application/atom+xml" href="http://samediff.kr/wiki/index.php?action=history&amp;feed=atom&amp;title=0401_reinforcement_learning%EC%9D%98_%EC%95%8C%EA%B3%A0%EB%A6%AC%EC%A6%98_%EB%B3%84_%EB%88%84%EC%A0%81_reward"/>
	<link rel="alternate" type="text/html" href="http://samediff.kr/wiki/index.php?title=0401_reinforcement_learning%EC%9D%98_%EC%95%8C%EA%B3%A0%EB%A6%AC%EC%A6%98_%EB%B3%84_%EB%88%84%EC%A0%81_reward&amp;action=history"/>
	<updated>2026-04-27T21:14:39Z</updated>
	<subtitle>이 문서의 편집 역사</subtitle>
	<generator>MediaWiki 1.34.0</generator>
	<entry>
		<id>http://samediff.kr/wiki/index.php?title=0401_reinforcement_learning%EC%9D%98_%EC%95%8C%EA%B3%A0%EB%A6%AC%EC%A6%98_%EB%B3%84_%EB%88%84%EC%A0%81_reward&amp;diff=13456&amp;oldid=prev</id>
		<title>Admin: Admin님이 0401 문서를 넘겨주기를 만들지 않고 0401 reinforcement learning의 알고리즘 별 누적 reward 문서로 이동했습니다</title>
		<link rel="alternate" type="text/html" href="http://samediff.kr/wiki/index.php?title=0401_reinforcement_learning%EC%9D%98_%EC%95%8C%EA%B3%A0%EB%A6%AC%EC%A6%98_%EB%B3%84_%EB%88%84%EC%A0%81_reward&amp;diff=13456&amp;oldid=prev"/>
		<updated>2017-08-02T08:28:12Z</updated>

		<summary type="html">&lt;p&gt;Admin님이 &lt;a href=&quot;/wiki/index.php?title=0401&amp;amp;action=edit&amp;amp;redlink=1&quot; class=&quot;new&quot; title=&quot;0401 (없는 문서)&quot;&gt;0401&lt;/a&gt; 문서를 넘겨주기를 만들지 않고 &lt;a href=&quot;/wiki/index.php/0401_reinforcement_learning%EC%9D%98_%EC%95%8C%EA%B3%A0%EB%A6%AC%EC%A6%98_%EB%B3%84_%EB%88%84%EC%A0%81_reward&quot; title=&quot;0401 reinforcement learning의 알고리즘 별 누적 reward&quot;&gt;0401 reinforcement learning의 알고리즘 별 누적 reward&lt;/a&gt; 문서로 이동했습니다&lt;/p&gt;
&lt;table class=&quot;diff diff-contentalign-left&quot; data-mw=&quot;interface&quot;&gt;
				&lt;tr class=&quot;diff-title&quot; lang=&quot;ko&quot;&gt;
				&lt;td colspan=&quot;1&quot; style=&quot;background-color: #fff; color: #222; text-align: center;&quot;&gt;← 이전 판&lt;/td&gt;
				&lt;td colspan=&quot;1&quot; style=&quot;background-color: #fff; color: #222; text-align: center;&quot;&gt;2017년 8월 2일 (수) 08:28 판&lt;/td&gt;
				&lt;/tr&gt;&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-notice&quot; lang=&quot;ko&quot;&gt;&lt;div class=&quot;mw-diff-empty&quot;&gt;(차이 없음)&lt;/div&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;/table&gt;</summary>
		<author><name>Admin</name></author>
		
	</entry>
	<entry>
		<id>http://samediff.kr/wiki/index.php?title=0401_reinforcement_learning%EC%9D%98_%EC%95%8C%EA%B3%A0%EB%A6%AC%EC%A6%98_%EB%B3%84_%EB%88%84%EC%A0%81_reward&amp;diff=13431&amp;oldid=prev</id>
		<title>Admin: 새 문서: file:rlreward.png  [http://artint.info/html/ArtInt_267.html Evaluating Reinforcement Learning Algorithms]   reinforcement learning의 알고리즘 별 누적 reward graph.&lt;br /&gt;...</title>
		<link rel="alternate" type="text/html" href="http://samediff.kr/wiki/index.php?title=0401_reinforcement_learning%EC%9D%98_%EC%95%8C%EA%B3%A0%EB%A6%AC%EC%A6%98_%EB%B3%84_%EB%88%84%EC%A0%81_reward&amp;diff=13431&amp;oldid=prev"/>
		<updated>2017-08-02T07:53:48Z</updated>

		<summary type="html">&lt;p&gt;새 문서: &lt;a href=&quot;/wiki/index.php/%ED%8C%8C%EC%9D%BC:Rlreward.png&quot; title=&quot;파일:Rlreward.png&quot;&gt;file:rlreward.png&lt;/a&gt;  [http://artint.info/html/ArtInt_267.html Evaluating Reinforcement Learning Algorithms]   reinforcement learning의 알고리즘 별 누적 reward graph.&amp;lt;br /&amp;gt;...&lt;/p&gt;
&lt;p&gt;&lt;b&gt;새 문서&lt;/b&gt;&lt;/p&gt;&lt;div&gt;[[file:rlreward.png]]&lt;br /&gt;
&lt;br /&gt;
[http://artint.info/html/ArtInt_267.html Evaluating Reinforcement Learning Algorithms]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
reinforcement learning의 알고리즘 별 누적 reward graph.&amp;lt;br /&amp;gt;&lt;br /&gt;
마음에 안드는 점은 만약 주식거래 프로그램을 만들었을 때 수익곡선이 저렇다면 초반의 손해를 계속 감수해야 한다는 뜻이다. 그것도 물론 주식시장이 항상 같은 방식으로 반응해준다는 가정 하에 말이다. 대략 난감하다.&lt;/div&gt;</summary>
		<author><name>Admin</name></author>
		
	</entry>
</feed>