This is the completed report that I am publishing to Power BI to be used by management and my team.

Landing page with key information:

Untitled

Submission numbers by year:

This graph was made to show the number of submissions received within a given year, which can also be compared to previous years. This is useful as it shows how the level of submissions has varied, and can help us forecast the number of submissions at the end of the present year.

Untitled

These dax formulas were used to calculate the cumulative number:

SubID_Count = COUNTA(Sheet1[SubID])
Cumulative Count = 
CALCULATE(
    [SubID_Count],
    FILTER(
        ALLSELECTED(Sheet1[Month]),
        Sheet1[Month] <= MAX(Sheet1[Month])
    )
)

Submission numbers by source in a given year:

This graph adds in the “Source” filter to look at which groups are submitting to the database. This enables us to track which group is submitting the most in a given year.

Untitled

Submission numbers by source and year:

This graph was then broken down further, so each source has its own graph to allow me to compare the number of submissions each source submitted across different years. This can be useful as it shows us if they are submitting more or less and how that may affect the number of sequences we will have named at the end of the year. This is useful as it helps with funding and is also good to track if numbers are down in a particular year for a given source, which can be queried if it is lower than expected.

Untitled

Pending and Waiting submissions:

This visual was requested by my stakeholder to look at the number of new submissions that we had received that month. This is useful as we name sequences on a monthly basis, this allows the stakeholder to have a rough idea of how many sequences will need naming that month. It also shows the number of submissions that require additional information, which we are waiting on responses for.

Untitled

Breakdown of genes (treemap):

This graph involved analysing the assigned names of all of the alleles in the database. In HLA, the allele name is a combination of the gene name followed by a string of numbers, separated by colons.