메뉴 건너뛰기
.. 내서재 .. 알림
소속 기관/학교 인증
인증하면 논문, 학술자료 등을  무료로 열람할 수 있어요.
한국대학교, 누리자동차, 시립도서관 등 나의 기관을 확인해보세요
(국내 대학 90% 이상 구독 중)
로그인 회원가입 고객센터 ENG
주제분류

추천
검색

논문 기본 정보

자료유형
학위논문
저자정보

김청하 (부산대학교, 부산대학교 대학원)

지도교수
권혁철
발행연도
2021
저작권
부산대학교 논문은 저작권에 의해 보호받습니다.

이용수7

표지
AI에게 요청하기
추천
검색

이 논문의 연구 히스토리 (2)

초록· 키워드

오류제보하기
본 연구에서는 질의응답에 다수 사용되는 한국어의 수사를 포함한 질의응답을 처리하기 위해 대한민국 법령문서를 이용하여 관련 데이터 세트를 구축하였다. 이를 BERT 기반 전이학습을 이용한 언어모형에 훈련하여 한국어 수사를 포함한 문서를 위한 질의응답 시스템을 개발했다. 한국어 수사를 처리하기 위해 한국어의 수사, 수분류사, 분류사, 물음말을 이용하고 일반화를 위해 국제단위계를 이용하여 데이터 세트를 구축했다. 또한, 법령문서의 구조적 특징을 반영한 문서 가공 방식을 제안했다. 데이터 세트의 일반화를 알아보기 위해 한국어 수사를 포함한 문서를 위한 질의응답 데이터 세트와 전자 신문 기사를 이용한 평가 데이터 세트로 실험을 수행했다. 실험 결과는 구축한 시스템이 EM 82.91, F1 90.95로 기존과 비교하여 EM 13.57, F1 9.55만큼 향상된 성능을 보였다. 생성한 데이터 세트 평가를 위해 한국어 대용량 질의응답 데이터 세트인 KorQuAD의 검증 데이터 세트 중 한국어 수사를 포함하는 질의응답을 추출하여 실험을 수행했고, 이 또한 EM 13.12, F1 7.15만큼 향상된 성능을 보였다.

목차

I. 서론 ·································································································································· 1
1. 연구 배경 및 필요성 ············································································································ 1
2. 연구 목적 및 범위 ················································································································ 2
3. 연구 방법 및 절차 ················································································································ 3
II. 관련 연구 ······················································································································· 5
1. 질의응답 데이터 ···················································································································· 5
2. 사전훈련 언어모형 ················································································································ 8
3. 한국어 수사(數詞) 처리 ····································································································· 14
4. 법령문서를 활용한 연구 ···································································································· 17
III. 시스템 및 데이터 세트 구축 ················································································· 20
1. 한국어 수사를 포함한 문서를 위한 질의응답 시스템 구축········································· 20
2. 데이터 세트 구축 ················································································································ 23
3. 기계 학습 및 미세조정 ······································································································ 31
4. 평가 방법 ······························································································································ 34
5. 실험 결과 및 검증 ·············································································································· 37
IV. 결론 ····························································································································· 41
참고문헌 ···························································································································· 43

최근 본 자료

전체보기

댓글(0)

0