[go: up one dir, main page]

Skip to content

Retain user event data

Event records older than 12 months are being pruned. We need to disable this to prevent useful data being removed. We should not ever prune data unless explicitly done by an instance administrator.

Further details

Context by @yorickpeterse in https://gitlab.com/gitlab-org/gitlab-ce/issues/24244#note_60995986:

We prune old events for two reasons:

  1. The contribution calendar only displays the last 12 months of data

  2. Not pruning data will result in this table growing forever, and the table is already 33 GB in size. Assuming linear/even growth that means for 5 years of data we'd need about 170 GB. Since events don't grow linear (I expect them to grow exponentially as GitLab.com becomes more popular) we're probably looking at much more space being necessary.

Proposal

  • immediately update the prune old events worker to use 18 or 24 months on GitLab.com and for on-premises installation

This will have an increasing performance impact over time. It is therefore critical that we understand the timeline for when this will become a critical problem for GitLab.com https://gitlab.com/gitlab-org/gitlab-ce/issues/52247

Links / references

Edited by James Ramsay (ex-GitLab)