We can't find the internet
Attempting to reconnect
Something went wrong!
Hang in there while we get back on track
Apache Flink - A Must-Have For Your Streams | Systems Design Interview 0 to 1 With Ex-Google SWE
Summary
Description
Again, go to iHop, crazy calories per dollar
To be clear, the reason why these snapshots work is because every snapshot on a given node says that the state on that node is only derived from the messages *before* the snapshot barrier on that queue. Because Flink can create multiple copies of state, the consumer can keep going when it receives some, but not all of its barriers. Ultimately the checkpointed state will only have the messages up to the barriers, so that all of the consumers can replay messages starting from the barrier after a failure. Hope this makes sense.
Translated At: 2025-03-13T08:54:43Z