1-11-4. Making Changes Persistent

2.X/1. Getting Started

1-11-4. Making Changes Persistent

drscg 2017. 9. 30. 16:38

Without an fsync to flush data in the filesystem cache to disk, we cannot be sure that the data will still be there after a power failure, or even after exiting the application normally. For Elasticsearch to be reliable, it needs to ensure that changes are persisted to disk.

fsync 없이, filesystem cache에서 디스크로, 데이터를 flush하면, 정전 후에 또는 응용프로그램이 정상적으로 종료된 후에, 데이터가 거기 있다고 확신할 수 없다. 신뢰할 수 있는 Elasticsearch를 위해, 변경 사항이 디스크에 유지되도록 보장해야 한다.

In Dynamically Updatable Indices, we said that a full commit flushes segments to disk and writes a commit point, which lists all known segments. Elasticsearch uses this commit point during startup or when reopening an index to decide which segments belong to the current shard.

Dynamically Updatable Indices에서, full commit은 segment를 디스크에 flush하고, 알려진 모든 segment 목록인, commit point를 기록한다고 했다. Elasticsearch는 시작하는 동안이나, 현재 shard에 포함되는 segment를 결정하기 위해 index를 다시 여는 경우에, 이 commit point를 사용한다.

While we refresh once every second to achieve near real-time search, we still need to do full commits regularly to make sure that we can recover from failure. But what about the document changes that happen between commits? We don’t want to lose those either.

거의 실시간인 검색을 수행하기 위해, 매초마다 refresh를 하는 동안, 오류를 복구할 수 있는지를 보장하기 위해, 여전히 주기적으로 전체 commit을 해야 한다. 그러면, commit하는 동안 발생하는 document의 변경은 어떻게 해야 하나? 이들 중 어느 것도 잃지 않아야 한다.

Elasticsearch added a translog, or transaction log, which records every operation in Elasticsearch as it happens. With the translog, the process now looks like this:

Elasticsearch는 translog(transaction log) 를 추가했다. 이것은 Elasticsearch에서 발생하는 모든 연산을 기록한다. translog의 프로세스는 아래와 같다.

When a document is indexed, it is added to the in-memory buffer and appended to the translog, as shown in Figure 21, “새로운 document가 in-memory buffer에 추가되고, translog에 덧붙여진다.”.
document가 색인될 때, in-memory buffer에 추가된다. _그리고_, Figure 21, “새로운 document가 in-memory buffer에 추가되고, translog에 덧붙여진다.”에서 보듯이, translog에 덧붙여진다.
Figure 21. 새로운 document가 in-memory buffer에 추가되고, translog에 덧붙여진다.
The refresh leaves the shard in the state depicted in Figure 22, “refresh후에 buffer는 지워지지만, translog는 지워지지 않는다.”. Once every second, the shard is refreshed:
refresh를 하면, shard는 Figure 22, “refresh후에 buffer는 지워지지만, translog는 지워지지 않는다.”의 상태가 된다. 매초마다 shard는 refresh된다.
- The docs in the in-memory buffer are written to a new segment, without an fsync.
  fsync 없이, in-memory buffer에 있는 document는 새로운 segment에 기록된다.
- The segment is opened to make it visible to search.
  검색 시에 보이게 하기 위해, segment가 열린다.
- The in-memory buffer is cleared.
  in-memory buffer는 지워진다.
Figure 22. refresh후에 buffer는 지워지지만, translog는 지워지지 않는다.
This process continues with more documents being added to the in-memory buffer and appended to the transaction log (see Figure 23, “translog는 document의 축적을 계속한다.”).
이 프로세스는 in-memory buffer에 추가되고, transaction log에 덧붙여진 더 많은 document와 함께 계속된다.(Figure 23, “translog는 document의 축적을 계속한다.” 참고)
Figure 23. translog는 document의 축적을 계속한다.
Every so often—such as when the translog is getting too big—the index is flushed; a new translog is created, and a full commit is performed (see Figure 24, “flush 후에, segment는 전부 commit되고, translog는 지워진다.”):
가끔, translog가 너무 커지는 경우, index는 flush된다. 새로운 translog가 생성되고, 전체 commit이 수행된다.(Figure 24, “flush 후에, segment는 전부 commit되고, translog는 지워진다.” 참고)
- Any docs in the in-memory buffer are written to a new segment.
  in-memory buffer에 있는 모든 document는 새로운 segment에 기록된다.
- The buffer is cleared.
  buffer는 지워진다.
- A commit point is written to disk.
  commit point는 디스크에 기록된다.
- The filesystem cache is flushed with an fsync.
  filesystem cache는 fsync 를 통해 flush된다.
- The old translog is deleted.
  기존의 translog는 지워진다.

The translog provides a persistent record of all operations that have not yet been flushed to disk. When starting up, Elasticsearch will use the last commit point to recover known segments from disk, and will then replay all operations in the translog to add the changes that happened after the last commit.

translog는 디스크에 아직 flush되지 않은, 모든 연산을 유지한 기록이다. 시작 시에, Elasticsearch는 알고 있는 segment를 디스크에서 복구하기 위해, 최근 commit point를 사용한다. 그리고, 마지막 commit후에 발생한 변경 사항을 추가하기 위해, translog에 있는 모든 연산을 다시 한다.

The translog is also used to provide real-time CRUD. When you try to retrieve, update, or delete a document by ID, it first checks the translog for any recent changes before trying to retrieve the document from the relevant segment. This means that it always has access to the latest known version of the document, in real-time.

translog는 또한 실시간 CRUD를 제공하기 위해, 사용되기도 한다. ID를 이용해 document를 읽고, 업데이트하고, 지우기를 시도하면, 관련 있는 segment에서 document를 가져오려고 시도하기 전에, 최근의 변경 사항을 translog에서 먼저 확인한다. 즉, 실시간으로, 항상 document의 가장 최근 버전을 액세스한다.

Figure 24. flush 후에, segment는 전부 commit되고, translog는 지워진다.

flush APIedit

The action of performing a commit and truncating the translog is known in Elasticsearch as a flush.Shards are flushed automatically every 30 minutes, or when the translog becomes too big. See thetranslog documentation for settings that can be used to control these thresholds:

commit을 수행하고, translog를 지우는 동작은, Elasticsearch에서 flush 라 알려져 있다. shard는 매 30분마다 자동으로, 또는 translog가 너무 크면 flush한다. 이 기준을 제어하는데 사용되는 설정은 translogdocumentation을 참조하자.

The flush API can be used to perform a manual flush:

The flush API는 수동으로 flush를 수행할 때 사용된다.

POST /blogs/_flush 

POST /_flush?wait_for_ongoing

	`blogs` index를 flush
	모든 indices를 flush하고, 반환하기 전에 모든 flush가 완료될 때까지 기다린다.

You seldom need to issue a manual flush yourself; usually, automatic flushing is all that is required.

직접 수동으로 flush 할 필요는 거의 없다. 일반적으로 자동 flush로 충분하다.

That said, it is beneficial to flush your indices before restarting a node or closing an index. When Elasticsearch tries to recover or reopen an index, it has to replay all of the operations in the translog, so the shorter the log, the faster the recovery.

그렇지만, node를 다시 시작하거나 index를 닫기(closing index)전에, index를 flush하는 것이 유용하다. Elasticsearch가 index를 다시 열거나 복구할 때, translog의 모든 연산을 다시 한다. 따라서, log가 짧을수록, 복구는 더 빠르다.

translog는 얼마나 안전한가?

The purpose of the translog is to ensure that operations are not lost. This begs the question: how safe is the translog?

translog의 목적은 연산이 손실되지 않도록 보장하는 것이다. 질문이 있을 것이다. translog가 얼마나 안전한가?

Writes to a file will not survive a reboot until the file has been fsync'ed to disk. By default, the translog is fsync'ed every 5 seconds and after a write request completes (e.g. index, delete, update, bulk). This process occurs on both the primary and replica shards. Ultimately, that means your client won’t receive a 200 OK response until the entire request has been fsync'ed in the translog of the primary and all replicas.

파일이 디스크에 fsync 되기 전에, 재부팅이 일어나면, 파일에 기록한 것을 잃어버릴 것이다. 기본적으로 translog는 매 5초마다 fsync 한다. _그리고_ 그 후에 쓰기 request는 완료된다.(예를 들자면, index, delete, update, bulk) 이 과정은 primary와 replica shard 양쪽 모두에서 일어난다. 결국, primary와 모든 replica의 translog에서 전체 request가 fsync 될 때까지, 200 OK 라는 response를 받을 수 없다.

Executing an fsync after every request does come with some performance cost, although in practice it is relatively small (especially for bulk ingestion, which amortizes the cost over many documents in the single request).

모든 request에서 fsync를 실행하는 것은 약간의 성능상의 비용이 발생하지만, 실제로 상대적으로 작다. (특히, bulk 색인의 경우, 단일 request로 많은 document를 처리하여, 그 비용을 절감할 수 있다)

But for some high-volume clusters where losing a few seconds of data is not critical, it can be advantageous to fsync asynchronously. E.g. writes are buffered in memory and fsync'ed together every 5s.

그러나, 거대한 cluster에서 몇 초 분량의 데이터를 잃어버리는 것은 그리 심각하지 않기 때문에,비동기적으로 fsync하는 것이 유리할 수 있다. 예를 들자면, 쓰기 연산을 memory에 버퍼링하다가, 매 5초마다 fsync 하는 것이다.

This behavior can be enabled by setting the durability parameter to async:

이 동작은 durability 매개변수에 async 를 설정함으로써 활성화할 수 있다.

PUT /my_index/_settings
{
    "index.translog.durability": "async",
    "index.translog.sync_interval": "5s"
}

This setting can be configured per-index and is dynamically updatable. If you decide to enable async translog behavior, you are guaranteed to lose sync_interval's worth of data if a crash happens. Please be aware of this characteristic before deciding!

이 설정은 index별로 설정할 수 있고, 동적으로 업데이트할 수 있다. translog 동작을 async로 하는 경우, crash가 발생하면, sync_interval 에 설정한 시간만큼의 데이터를 잃어버릴 수 있다는 것을 감안해야 한다. 결정하기 전에 이 특징을 반드시 기억하자.

If you are unsure the ramifications of this action, it is best to use the default ("index.translog.durability": "request") to avoid data-loss.

이 동작의 결과를 확신하지 못한다면, 데이터 손실을 막기 위해, 기본 설정("index.translog.durability": "request")을 사용하는 것이 최선이다.

'2.X > 1. Getting Started' 카테고리의 다른 글

1-11. Inside a Shard (0)	2017.09.30
1-11-1. Making Text Searchable (0)	2017.09.30
1-11-2. Dynamically Updatable (0)	2017.09.30
1-11-3. Near Real-Time Search (0)	2017.09.30
1-11-5. Segment Merging (0)	2017.09.30

현재글1-11-4. Making Changes Persistent

elasticsearch, definitive guide

Query, replica, score, parent, inverted, Size, MATCH, cache, Shard, json, Type, index, Filter, Relevance, Term, full-text, primary, Mapping, phrase, Cluster,

Today :
Yesterday :

일	월	화	수	목	금	토
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30

내 블로그 - 관리자 홈 전환	`Q` `Q`
새 글 쓰기	`W` `W`

글 수정 (권한 있는 경우)	`E` `E`
댓글 영역으로 이동	`C` `C`

이 페이지의 URL 복사	`S` `S`
맨 위로 이동	`T` `T`
티스토리 홈 이동	`H` `H`
단축키 안내	`Shift` + `/` `⇧` + `/`

不爲也比不能也