메뉴 건너뛰기
.. 내서재 .. 알림
소속 기관/학교 인증
인증하면 논문, 학술자료 등을  무료로 열람할 수 있어요.
한국대학교, 누리자동차, 시립도서관 등 나의 기관을 확인해보세요
(국내 대학 90% 이상 구독 중)
로그인 회원가입 고객센터 ENG
주제분류

추천
검색

논문 기본 정보

자료유형
학위논문
저자정보

이호민 (동의대학교, 동의대학교 대학원)

지도교수
옥승호
발행연도
2021
저작권
동의대학교 논문은 저작권에 의해 보호받습니다.

이용수68

표지
AI에게 요청하기
추천
검색

이 논문의 연구 히스토리 (3)

초록· 키워드

오류제보하기
Convolutional neural network(CNN)가 인간보다 높은 추론 및 인식 정확도를 가지는 연구 결과에 따라 많은 분야에서 응용 연구가 이루어지고 있다. 특히 CNN을 솔루션으로 이용하여 모바일 디바이스에 적용하고자 하는 연구가 활발하다. 그러나 CNN의 높은 정확도를 유지하기 위해서는 깊은 신경망 계층에 따른 가중치로 인해 모바일의 CPU만으로는 한계가 뚜렷하다. 이를 해결하기 위해 병렬 연산에 강한 GPU를 사용하여 CNN의 연산 처리 성능을 높이는 등의 연구들이 이루어지고 있으나 어플리케이션을 범용적으로 사용하기 위한 하드웨어로써 엣지 디바이스 및 임베디드 시스템에의 CNN 연산이 최적화되지 않는 문제를 가진다.
CNN을 엣지 디바이스 및 임베디드 시스템에 적용하기 위해 높은 에너지 효율을 가지는 FPGA 기반 CNN 가속기 연구가 진행되고 있으며 주로 PE 단위 systolic array 기반으로 구현된다. 그러나 이러한 가속기의 경우 합성곱 계층만을 가속하는 것에 초점을 두고 있으며 전용 컴파일러 및 스케줄러를 사용함으로써 개발에 오랜 기간과 난이도를 요구한다. 이에 따라 개발 기간을 최소화할 수 있으며 CNN의 모든 계층을 하드웨어로 구현할 수 있는 구조가 필요하다.
본 논문에서는 명령어 집합을 통해 CNN을 재설계할 수 있는 모듈 기반 CNN 가속기 구조를 제안한다. CNN 프레임워크 및 C언어 기반 검증 프로그램 모델링을 통해 각 연산 모듈의 디버깅 및 검증을 수행하였으며 Xilinx 사의 SoC FPGA 보드 ZC706을 사용하여 ResNet-18을 구현하였다. 명령어 집합을 이용한 모듈 기반 CNN 가속기는 가변적인 명령어를 사용함으로써 CNN 구조를 재설계할 수 있음을 보였다.

목차

I. 서 론 ························································································································· 1
II. 임베디드 시스템을 위한 CNN ············································································· 3
1. CNN 개요 ············································································································ 3
2. FPGA 기반 CNN 가속기 구조 연구 ······························································· 6
1) Systolic array를 이용한 합성곱 연산기 ··················································· 7
2) CNN 연산 처리 성능 향상 ··········································································· 8
3) 데이터 압축 기법 ························································································· 9
III. 제안하는 FPGA 기반 CNN 가속기 구조 ······················································· 10
1. 모듈 기반 CNN 가속기 구조 ········································································· 10
2. CNN 연산 모듈 동작을 위한 명령어 구성 ················································· 18
3. CNN 연산 모듈의 Data precision 최적화 ················································· 20
1) Fixed-point를 이용한 소수점 연산 최적화 ············································ 20
2) 양자화 기법을 이용한 합성곱 연산 최적화 ············································ 21
IV. 실험 및 구현 ······································································································· 24
1. 실험 환경 구축 및 검증 프로그램 모델링 ·················································· 24
2. FPGA 구현 및 검증 ························································································ 26
V. 결론 ························································································································ 31
VI. 참고 문헌 ············································································································· 32

최근 본 자료

전체보기

댓글(0)

0