메뉴 건너뛰기
.. 내서재 .. 알림
소속 기관/학교 인증
인증하면 논문, 학술자료 등을  무료로 열람할 수 있어요.
한국대학교, 누리자동차, 시립도서관 등 나의 기관을 확인해보세요
(국내 대학 90% 이상 구독 중)
로그인 회원가입 고객센터 ENG
주제분류

추천
검색

논문 기본 정보

자료유형
학위논문
저자정보

여현규 (서울과학기술대학교, 서울과학기술대학교 대학원)

지도교수
홍정식
발행연도
2018
저작권
서울과학기술대학교 논문은 저작권에 의해 보호받습니다.

이용수1

표지
AI에게 요청하기
추천
검색

이 논문의 연구 히스토리 (2)

초록· 키워드

오류제보하기
의사결정나무는 입력변수의 차원으로 이루어진 데이터 공간을 재귀적으로 분할하는 과정에서 하나의 규칙을 생성하며, 각각의 규칙들은 계층적으로 표현된다. 계층적으로 표현된 규칙들은 분석가에게 목표변수와의 관계를 파악할 수 있는 정보를 제공한다. 그러나 이러한 구조의 규칙은 단순할 때는 직관적인 정보를 제공하지만, 복잡해지면 그것을 이해하는 것이 매우 어렵다. 또한 각각의 규칙에 해당하는 샘플의 수가 너무 적거나, 동질성이 낮다면 이 또한 분석가에게 충분한 정보를 제공한다고 보기 어렵다. 이에 본 논문에서는 해석력을 높이기 위한 새로운 분기 기준을 제안한다. 제안한 분기 기준에 대해 특성을 분석하고, 검증을 위해 성능 측면과 해석력 측면에서 기존 모델과의 비교를 수행한다. 기존 모델과의 비교에는 총 21가지 데이터 셋이 사용되었으며 성능의 경우 정확도(Accuracy), 민감도(Sensitivity, Recall), 정밀도(Precision), F1- measure, AUC(Area Under the Curve)를 성능 측정 지표로 활용하였다. 성능 측정 결과 AUC(macro)를 기준으로 제안한 모델은 11개의 사례에서 높았고, CART(Classification And Regression Trees)는 2개의 사례에서 높았다.(유의수준 1%) 해석력의 경우 성능 측정 과정에서 생성된 모델 중 구조적으로 다른 몇 가지 예시를 확인하고, 수치적인 비교를 수행하였다. 생성된 예시를 통해 제안한 모델이 CART 보다 동질성 높은 규칙을 생성하면서 One-sided tree 보다 많은 샘플을 포함하는 것을 확인하였다. 해석력에 대한 수치는 깊이가 얕은 단순한 규칙에 포함 되는 샘플의 비율과 목표변수 범주의 최대 확률 값을 통해 측정하였다. 그 결과 깊이가 3 이하에서 샘플 비율과 동질성이 모두 높은 사례는 제안한 모델이 6개, CART가 1개로 제안한 모델이 5개 더 많았으며, 깊이가 4이하인 경우에서도 제안한 모델이 4개, CART가 1개로 제안한 모델이 높은 사례가 더 많았다.(유의수준 1%) 비교 실험을 통해 우리가 제안한 분기 기준이 성능을 더 높이면서도 해석력 좋은 규칙을 유도하는 것을 확인하였다.

목차

1. 서 론 ·····························································································································1
2. 관련연구 ·······················································································································3
2.1. 분기 기준 및 알고리즘에 대한 기존 연구 ························································3
2.2. 해석력에 대한 기존 연구 ······················································································5
3. 새로운 분기 기준 ·······································································································6
3.1. One-sided tree ·········································································································6
3.2. 새로운 분기 기준 ····································································································7
3.3. 새로운 분기 기준의 특성 ··················································································10
4. 실 험 ···························································································································18
4.1. 실험 데이터 ············································································································18
4.2. 해석력 ······················································································································19
4.3. 성능 ··························································································································25
5. 결 론 ···························································································································30
참고문헌 ··························································································································32
부 록 ································································································································36
영문초록(Abstract) ·········································································································39

최근 본 자료

전체보기

댓글(0)

0