synonym 8

2-3-06. Most Fields

Full-text search is a battle between recall—returning all the documents that are relevant—and precision—not returning irrelevant documents. The goal is to present the user with the most relevant documents on the first page of results.full-text 검색은 recall(적합한 document 모두를 반환하는)과 정확성(precision)(부적합한 document를 반환하지 않는)의 전쟁이다. 결과의 첫 번째 page에 가장 적합한 document를 사용자에게 제시하는 것이 목표이다.To improve recall, we ..

3-6. Synonyms

While stemming helps to broaden the scope of search by simplifying inflected words to their root form, synonyms broaden the scope by relating concepts and ideas. Perhaps no documents match a query for "English queen", but documents that contain "British monarch" would probably be considered a good match.형태소 분석은 굴절된 단어를 원형으로 단순화하여, 검색의 범위를 확장하는데 도움이 되는 반면에, 동의어는, 개념과 뜻을 관련시켜, 범위를 확대한다. "English q..

3-6-1. Using Synonyms

Synonyms can replace existing tokens or be added to the token stream by using the synonym token filter:동의어는 기존의 token을 대체하거나, synonym token filter를 사용하여 token stream에 추가될 수 있다.PUT /my_index { "settings": { "analysis": { "filter": { "my_synonym_filter": { "type": "synonym", "synonyms": [ "british,english", "queen,monarch" ] } }, "analyzer": { "my_synonyms": { "tokenizer": "standard", "filter": [ ..

3-6-2. Formatting Synonyms

In their simplest form, synonyms are listed as comma-separated values:가장 단순한 형태로, 동의어는, 다음과 같이, comma로 구분된 값으로 나열된다."jump,leap,hop"If any of these terms is encountered, it is replaced by all of the listed synonyms. For instance:이들 단어 중 하나를 만나면, 그것은 나열된 동의어 모두로 대체된다. 예를 들자면,Original terms: Replaced by: ──────────────────────────────── jump → (jump,leap,hop) leap → (jump,leap,hop) hop → (jump,leap..

3-6-3. Expand or contract

In Formatting Synonyms, we have seen that it is possible to replace synonyms by simple expansion, simple contraction, or generic expansion. We will look at the trade-offs of each of these techniques in this section.Formatting Synonyms에서, 단순한 확장(simple expansion), 단순한 축소(simple contraction) 또는 장르 확장(generic expansion) 을 통해, 동의어를 대체할 수 있음을 알게 됐다. 아래에서 각 기술의 장단점을 살펴보도록 하자.This section deals with si..

3-6-4. Synonyms and The Analysis Chain

The example we showed in Formatting Synonyms, used u s a as a synonym. Why did we use that instead of U.S.A.? The reason is that the synonym token filter sees only the terms that the previous token filter or tokenizer has emitted.Formatting Synonyms에서 보여준 예제에서, 동의어로서 u s a 를 사용했다. 왜 U.S.A. 대신 저것을 사용했을까? 그 이유는 synonym token filter 만이 직전의 token filter나 tokenizer가 출력한 단어를 알기 때문이다.Imagine that we ha..

3-6-5. Multiword Synonyms and Phrase Queries

So far, synonyms appear to be quite straightforward. Unfortunately, this is where things start to go wrong. For phrase queries to function correctly, Elasticsearch needs to know the position that each term occupies in the original text. Multiword synonyms can play havoc with term positions, especially when the injected synonyms are of differing lengths.지금까지는, 동의어가 매우 간단한 것으로 보인다. 유감스럽게도, 이것이 문제의..

3-6-6. Symbol Synonyms

The final part of this chapter is devoted to symbol synonyms, which are unlike the synonyms we have discussed until now. Symbol synonyms are string aliases used to represent symbols that would otherwise be removed during tokenization.이 장의 마지막은, 지금까지 이야기했던 동의어와는 다른, symbol synonym이다. 상징의 동의어(Symbol Synonyms) 는 token을 만드는 도중에 제거되는, 상징을 표시하는데 사용되는 문자열로 이루어진 별칭이다.While most punctuation is seldom imp..