Details
-
Type:
Epic
-
Status: Done
-
Resolution: Done
-
Fix Version/s: None
-
Component/s: Qserv
-
Labels:None
-
Epic Name:W16 Secondary Index
-
Story Points:48
-
WBS:02C.06.02.03
-
Team:Data Access and Database
-
Cycle:Winter 2016
Description
Work on the secondary index (objectId --> chunkId / subChunkId mapping). This needs to be scalable to 40B entries. Since we are planning to ingest all data from DRP in <2 days, building should take <2 days. This epic involves researching applicable technologies (including experimenting with most promising ones). Deliverable: proposed technology / architecture along with measures performance at production scale (40 B entries).
Attachments
Issue Links
Key | Summary | Story Points | Assignee | Status | |
---|---|---|---|---|---|
|
5 | Unassigned | Done | ||
|
6 | Unassigned | Done | ||
|
6 | Unassigned | Done | ||
|
Experiment with light-weight SQL databases for secondary index |
8 | Unassigned | Done | |
|
3 | Unassigned | Done | ||
|
7 | Unassigned | Done | ||
|
3 | Unassigned | Done | ||
|
Collect multi-host and bulk-update performance data for secondary index |
5 | Unassigned | Won't Fix | |
|
5 | Unassigned | Won't Fix |
Discovered that performance plots are inaccurate: Labels for "Clock" should have been elapsed wall-clock time, but were instead total CPU usage. Regenerating all performance data with both CPU and clock times done properly.