spotify / big-data-rosetta-code
File Size

The distribution of size of files (measured in lines of code).

Intro
Learn more...
File Size Overall
0% | 0% | 0% | 8% | 91%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: grouped by folders | grouped by size | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
scala0% | 0% | 0% | 9% | 90%
sbt0% | 0% | 0% | 0% | 100%
yaml0% | 0% | 0% | 0% | 100%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
src0% | 0% | 0% | 9% | 90%
ROOT0% | 0% | 0% | 0% | 100%
project0% | 0% | 0% | 0% | 100%
Longest Files (Top 30)
File# lines# units
TfIdf.scala
in src/main/scala/com/spotify/bdrc/pipeline
116 4
JoinLogAndMetadata.scala
in src/main/scala/com/spotify/bdrc/pipeline
95 7
Sessions.scala
in src/main/scala/com/spotify/bdrc/pipeline
64 4
TopItems.scala
in src/main/scala/com/spotify/bdrc/pipeline
61 6
JoinLogs.scala
in src/main/scala/com/spotify/bdrc/pipeline
61 4
PageRank.scala
in src/main/scala/com/spotify/bdrc/pipeline
61 3
FieldStatistics.scala
in src/main/scala/com/spotify/bdrc/pipeline
59 2
MinItemPerUser.scala
in src/main/scala/com/spotify/bdrc/pipeline
56 7
MaxItemPerUser.scala
in src/main/scala/com/spotify/bdrc/pipeline
56 7
build.sbt
in root
55 -
TotalAndDistinctCount.scala
in src/main/scala/com/spotify/bdrc/pipeline
54 5
AverageScorePerItem.scala
in src/main/scala/com/spotify/bdrc/pipeline
47 5
SumPerItem.scala
in src/main/scala/com/spotify/bdrc/pipeline
46 6
CountDistinctItems.scala
in src/main/scala/com/spotify/bdrc/pipeline
46 6
TopItemsPerUser.scala
in src/main/scala/com/spotify/bdrc/pipeline
45 5
CountUsers.scala
in src/main/scala/com/spotify/bdrc/pipeline
41 6
Statistics.scala
in src/main/scala/com/spotify/bdrc/pipeline
39 3
Count.scala
in src/main/scala/com/spotify/bdrc/pipeline
36 5
WordCount.scala
in src/main/scala/com/spotify/bdrc/pipeline
30 4
BloomFilterSetDifference.scala
in src/main/scala/com/spotify/bdrc/pipeline
30 3
HandlingOptions.scala
in src/main/scala/com/spotify/bdrc/scala
30 3
FilterMessyData.scala
in src/main/scala/com/spotify/bdrc/scala
28 2
InvertedIndex.scala
in src/main/scala/com/spotify/bdrc/pipeline
27 3
Collections.scala
in src/main/scala/com/spotify/bdrc/scala
27 -
FindMedian.scala
in src/main/scala/com/spotify/bdrc/pipeline
24 3
DistinctItems.scala
in src/main/scala/com/spotify/bdrc/pipeline
22 3
JavaPrimitives.scala
in src/main/scala/com/spotify/bdrc/scala
17 1
7 -
Records.scala
in src/main/scala/com/spotify/bdrc/util
6 -
plugins.sbt
in project
3 -
Files With Most Units (Top 25)
File# lines# units
MinItemPerUser.scala
in src/main/scala/com/spotify/bdrc/pipeline
56 7
MaxItemPerUser.scala
in src/main/scala/com/spotify/bdrc/pipeline
56 7
JoinLogAndMetadata.scala
in src/main/scala/com/spotify/bdrc/pipeline
95 7
TopItems.scala
in src/main/scala/com/spotify/bdrc/pipeline
61 6
CountUsers.scala
in src/main/scala/com/spotify/bdrc/pipeline
41 6
SumPerItem.scala
in src/main/scala/com/spotify/bdrc/pipeline
46 6
CountDistinctItems.scala
in src/main/scala/com/spotify/bdrc/pipeline
46 6
TopItemsPerUser.scala
in src/main/scala/com/spotify/bdrc/pipeline
45 5
TotalAndDistinctCount.scala
in src/main/scala/com/spotify/bdrc/pipeline
54 5
Count.scala
in src/main/scala/com/spotify/bdrc/pipeline
36 5
AverageScorePerItem.scala
in src/main/scala/com/spotify/bdrc/pipeline
47 5
TfIdf.scala
in src/main/scala/com/spotify/bdrc/pipeline
116 4
JoinLogs.scala
in src/main/scala/com/spotify/bdrc/pipeline
61 4
Sessions.scala
in src/main/scala/com/spotify/bdrc/pipeline
64 4
WordCount.scala
in src/main/scala/com/spotify/bdrc/pipeline
30 4
FindMedian.scala
in src/main/scala/com/spotify/bdrc/pipeline
24 3
Statistics.scala
in src/main/scala/com/spotify/bdrc/pipeline
39 3
DistinctItems.scala
in src/main/scala/com/spotify/bdrc/pipeline
22 3
BloomFilterSetDifference.scala
in src/main/scala/com/spotify/bdrc/pipeline
30 3
PageRank.scala
in src/main/scala/com/spotify/bdrc/pipeline
61 3
InvertedIndex.scala
in src/main/scala/com/spotify/bdrc/pipeline
27 3
HandlingOptions.scala
in src/main/scala/com/spotify/bdrc/scala
30 3
FieldStatistics.scala
in src/main/scala/com/spotify/bdrc/pipeline
59 2
FilterMessyData.scala
in src/main/scala/com/spotify/bdrc/scala
28 2
JavaPrimitives.scala
in src/main/scala/com/spotify/bdrc/scala
17 1
Files With Long Lines (Top 0)

There are 0 files with lines longer than 120 characters. In total, there are 0 long lines.

File# lines# units# long lines
Correlations

File Size vs. Commits (all time): 30 points

build.sbt x: 110 commits (all time) y: 55 lines of code project/plugins.sbt x: 16 commits (all time) y: 3 lines of code catalog-info.yaml x: 2 commits (all time) y: 7 lines of code src/main/scala/com/spotify/bdrc/pipeline/FieldStatistics.scala x: 5 commits (all time) y: 59 lines of code src/main/scala/com/spotify/bdrc/pipeline/Statistics.scala x: 7 commits (all time) y: 39 lines of code src/main/scala/com/spotify/bdrc/pipeline/MaxItemPerUser.scala x: 6 commits (all time) y: 56 lines of code src/main/scala/com/spotify/bdrc/pipeline/TopItems.scala x: 4 commits (all time) y: 61 lines of code src/main/scala/com/spotify/bdrc/pipeline/TopItemsPerUser.scala x: 6 commits (all time) y: 45 lines of code src/main/scala/com/spotify/bdrc/pipeline/Count.scala x: 5 commits (all time) y: 36 lines of code src/main/scala/com/spotify/bdrc/pipeline/CountDistinctItems.scala x: 5 commits (all time) y: 46 lines of code src/main/scala/com/spotify/bdrc/pipeline/CountUsers.scala x: 6 commits (all time) y: 41 lines of code src/main/scala/com/spotify/bdrc/pipeline/InvertedIndex.scala x: 4 commits (all time) y: 27 lines of code src/main/scala/com/spotify/bdrc/pipeline/JoinLogAndMetadata.scala x: 4 commits (all time) y: 95 lines of code src/main/scala/com/spotify/bdrc/pipeline/JoinLogs.scala x: 6 commits (all time) y: 61 lines of code src/main/scala/com/spotify/bdrc/pipeline/Sessions.scala x: 5 commits (all time) y: 64 lines of code src/main/scala/com/spotify/bdrc/pipeline/TfIdf.scala x: 4 commits (all time) y: 116 lines of code src/main/scala/com/spotify/bdrc/pipeline/BloomFilterSetDifference.scala x: 2 commits (all time) y: 30 lines of code src/main/scala/com/spotify/bdrc/pipeline/SumPerItem.scala x: 4 commits (all time) y: 46 lines of code src/main/scala/com/spotify/bdrc/pipeline/AverageScorePerItem.scala x: 3 commits (all time) y: 47 lines of code src/main/scala/com/spotify/bdrc/pipeline/TotalAndDistinctCount.scala x: 2 commits (all time) y: 54 lines of code src/main/scala/com/spotify/bdrc/scala/FilterMessyData.scala x: 4 commits (all time) y: 28 lines of code src/main/scala/com/spotify/bdrc/pipeline/DistinctItems.scala x: 3 commits (all time) y: 22 lines of code src/main/scala/com/spotify/bdrc/pipeline/WordCount.scala x: 3 commits (all time) y: 30 lines of code src/main/scala/com/spotify/bdrc/scala/Collections.scala x: 1 commits (all time) y: 27 lines of code src/main/scala/com/spotify/bdrc/scala/JavaPrimitives.scala x: 1 commits (all time) y: 17 lines of code src/main/scala/com/spotify/bdrc/pipeline/FindMedian.scala x: 2 commits (all time) y: 24 lines of code src/main/scala/com/spotify/bdrc/util/Records.scala x: 4 commits (all time) y: 6 lines of code
116.0
lines of code
  min: 3.0
  average: 42.97
  25th percentile: 27.0
  median: 43.0
  75th percentile: 56.75
  max: 116.0
0 110.0
commits (all time)
min: 1.0 | average: 7.87 | 25th percentile: 2.75 | median: 4.0 | 75th percentile: 6.0 | max: 110.0

File Size vs. Contributors (all time): 30 points

build.sbt x: 12 contributors (all time) y: 55 lines of code project/plugins.sbt x: 5 contributors (all time) y: 3 lines of code catalog-info.yaml x: 2 contributors (all time) y: 7 lines of code src/main/scala/com/spotify/bdrc/pipeline/FieldStatistics.scala x: 3 contributors (all time) y: 59 lines of code src/main/scala/com/spotify/bdrc/pipeline/Statistics.scala x: 3 contributors (all time) y: 39 lines of code src/main/scala/com/spotify/bdrc/pipeline/MaxItemPerUser.scala x: 4 contributors (all time) y: 56 lines of code src/main/scala/com/spotify/bdrc/pipeline/TopItems.scala x: 2 contributors (all time) y: 61 lines of code src/main/scala/com/spotify/bdrc/pipeline/TopItemsPerUser.scala x: 4 contributors (all time) y: 45 lines of code src/main/scala/com/spotify/bdrc/pipeline/Count.scala x: 3 contributors (all time) y: 36 lines of code src/main/scala/com/spotify/bdrc/pipeline/CountDistinctItems.scala x: 3 contributors (all time) y: 46 lines of code src/main/scala/com/spotify/bdrc/pipeline/CountUsers.scala x: 3 contributors (all time) y: 41 lines of code src/main/scala/com/spotify/bdrc/pipeline/InvertedIndex.scala x: 3 contributors (all time) y: 27 lines of code src/main/scala/com/spotify/bdrc/pipeline/JoinLogAndMetadata.scala x: 3 contributors (all time) y: 95 lines of code src/main/scala/com/spotify/bdrc/pipeline/JoinLogs.scala x: 3 contributors (all time) y: 61 lines of code src/main/scala/com/spotify/bdrc/pipeline/Sessions.scala x: 3 contributors (all time) y: 64 lines of code src/main/scala/com/spotify/bdrc/pipeline/TfIdf.scala x: 3 contributors (all time) y: 116 lines of code src/main/scala/com/spotify/bdrc/pipeline/BloomFilterSetDifference.scala x: 2 contributors (all time) y: 30 lines of code src/main/scala/com/spotify/bdrc/pipeline/AverageScorePerItem.scala x: 2 contributors (all time) y: 47 lines of code src/main/scala/com/spotify/bdrc/pipeline/TotalAndDistinctCount.scala x: 2 contributors (all time) y: 54 lines of code src/main/scala/com/spotify/bdrc/scala/FilterMessyData.scala x: 2 contributors (all time) y: 28 lines of code src/main/scala/com/spotify/bdrc/pipeline/DistinctItems.scala x: 1 contributors (all time) y: 22 lines of code src/main/scala/com/spotify/bdrc/pipeline/WordCount.scala x: 1 contributors (all time) y: 30 lines of code src/main/scala/com/spotify/bdrc/scala/Collections.scala x: 1 contributors (all time) y: 27 lines of code src/main/scala/com/spotify/bdrc/scala/JavaPrimitives.scala x: 1 contributors (all time) y: 17 lines of code src/main/scala/com/spotify/bdrc/pipeline/FindMedian.scala x: 1 contributors (all time) y: 24 lines of code src/main/scala/com/spotify/bdrc/util/Records.scala x: 1 contributors (all time) y: 6 lines of code
116.0
lines of code
  min: 3.0
  average: 42.97
  25th percentile: 27.0
  median: 43.0
  75th percentile: 56.75
  max: 116.0
0 12.0
contributors (all time)
min: 1.0 | average: 2.83 | 25th percentile: 2.0 | median: 3.0 | 75th percentile: 3.0 | max: 12.0

File Size vs. Commits (30 days): 0 points

No data for "commits (30d)" vs. "lines of code".

File Size vs. Contributors (30 days): 0 points

No data for "contributors (30d)" vs. "lines of code".


File Size vs. Commits (90 days): 0 points

No data for "commits (90d)" vs. "lines of code".

File Size vs. Contributors (90 days): 0 points

No data for "contributors (90d)" vs. "lines of code".