ETRI-Knowledge Sharing Plaform

ENGLISH

성과물

논문 검색
구분 SCI
연도 ~ 키워드

상세정보

학술대회 An Efficient Text Filter for Adult Web Documents
Cited 13 time in scopus Download 0 time Share share facebook twitter linkedin kakaostory
저자
김영수, 남택용
발행일
200602
출처
International Conference on Advanced Communication Technology (ICACT) 2006, pp.438-440
협약과제
05MK1600, 내용기반 유해정보 방지기술 개발, 장종수
초록
The openness of the Web allows any users to access almost any type of information. However, some information, such as adult content, is not appropriate for all users, notably children. Additionally for adults, some contents included in abnormal pornographic sites can do ordinary people's mental health harm. In this paper, we propose a new criterion and divide contents of web documents into 4 grades. We use a hierarchical way of filtering texts. At first, we filter off 0-grade texts contain no adult contents using a pattern matching algorithm, and classify 1-grade, 2-grade and 3-grade texts using a machine learning algorithm.
KSP 제안 키워드
Machine Learning Algorithms, Ordinary people, Pattern Matching Algorithm, Type of information, Web Documents, mental health