지원사업
학술연구/단체지원/교육 등 연구자 활동을 지속하도록 DBpia가 지원하고 있어요.
커뮤니티
연구자들이 자신의 연구와 전문성을 널리 알리고, 새로운 협력의 기회를 만들 수 있는 네트워킹 공간이에요.
이용수7
Ⅰ. 서 론 ········································································································· 11. 연구의 필요성 및 목적 ·············································································· 12. 연구 문제 ······································································································ 2Ⅱ. 이론적 배경 ·························································································· 41. 님게임의 필승전략 ······················································································ 41) 공정한 조합 게임 ··················································································· 42) 님게임의 정의 ························································································· 53) 님게임의 필승전략 ················································································· 52. 강화학습 ········································································································ 61) 강화학습의 정의와 종류 ······································································· 62) DQN(Deep Q-Network) 알고리즘 ···················································· 83) REINFORCE(Monte-Carlo policy gradient) 알고리즘 ··············· 94) Actor-Critic 알고리즘 ······································································· 11Ⅲ. 연구 방법 및 절차 ········································································ 121. 시뮬레이션 환경 및 구성 ······································································· 122. 강화학습 에이전트와 필승전략 에이전트 대결 ································· 123. 강화학습 에이전트끼리 대결 ································································· 14-i-- ii -Ⅳ. 연구 결과 분석 ················································································ 151. 강화학습 알고리즘 미적용 무작위 ( 선택 에이전트) ·························· 152. DQN 에이전트 ·························································································· 173. REINFORCE 에이전트 ············································································ 224. Actor-Critic 에이전트 ············································································ 255. DQN 에이전트와 DQN 에이전트 ························································· 266. DQN 에이전트와 Actor-Critic 에이전트 ··········································· 28Ⅴ. 결론 및 제언 ····················································································· 30참 고 문 헌 ······································································································· 32
0