twitter / communitynotes
File Size

The distribution of size of files (measured in lines of code).

Intro
Learn more...
File Size Overall
13% | 32% | 35% | 12% | 6%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: grouped by folders | grouped by size | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
py13% | 32% | 35% | 12% | 6%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
sourcecode13% | 32% | 35% | 12% | 6%
Longest Files (Top 42)
File# lines# units
run_scoring.py
in sourcecode/scoring
1489 25
mf_base_scorer.py
in sourcecode/scoring
923 19
constants.py
in sourcecode/scoring
827 5
scoring_rules.py
in sourcecode/scoring
702 38
process_data.py
in sourcecode/scoring
564 26
pflip_model.py
in sourcecode/scoring
535 22
pandas_utils.py
in sourcecode/scoring
479 21
matrix_factorization.py
in sourcecode/scoring/matrix_factorization
467 17
contributor_state.py
in sourcecode/scoring
442 17
reputation_matrix_factorization.py
in sourcecode/scoring/reputation_matrix_factorization
439 16
note_ratings.py
in sourcecode/scoring
412 8
389 20
scorer.py
in sourcecode/scoring
325 20
pseudo_raters.py
in sourcecode/scoring/matrix_factorization
286 12
runner.py
in sourcecode/scoring
247 3
post_selection_similarity.py
in sourcecode/scoring
215 9
note_status_history.py
in sourcecode/scoring
201 7
mf_group_scorer.py
in sourcecode/scoring
177 13
mf_topic_scorer.py
in sourcecode/scoring
173 10
topic_model.py
in sourcecode/scoring
167 9
diligence_model.py
in sourcecode/scoring/reputation_matrix_factorization
167 4
helpfulness_scores.py
in sourcecode/scoring
158 4
reputation_scorer.py
in sourcecode/scoring
136 12
helpfulness_model.py
in sourcecode/scoring/reputation_matrix_factorization
129 3
normalized_loss.py
in sourcecode/scoring/matrix_factorization
120 6
incorrect_filter.py
in sourcecode/scoring
120 4
tag_consensus.py
in sourcecode/scoring
94 2
mf_expansion_scorer.py
in sourcecode/scoring
79 9
explanation_tags.py
in sourcecode/scoring
76 3
mf_expansion_plus_scorer.py
in sourcecode/scoring
73 9
mf_core_scorer.py
in sourcecode/scoring
64 6
weighted_loss.py
in sourcecode/scoring/reputation_matrix_factorization
61 4
tag_filter.py
in sourcecode/scoring
59 5
model.py
in sourcecode/scoring/matrix_factorization
52 5
dataset.py
in sourcecode/scoring/reputation_matrix_factorization
41 1
mf_multi_group_scorer.py
in sourcecode/scoring
31 4
enums.py
in sourcecode/scoring
23 1
main.py
in sourcecode
5 -
__init__.py
in sourcecode
1 -
__init__.py
in sourcecode/scoring/matrix_factorization
1 -
__init__.py
in sourcecode/scoring
1 -
__init__.py
in sourcecode/scoring/reputation_matrix_factorization
1 -
Files With Most Units (Top 37)
File# lines# units
scoring_rules.py
in sourcecode/scoring
702 38
process_data.py
in sourcecode/scoring
564 26
run_scoring.py
in sourcecode/scoring
1489 25
pflip_model.py
in sourcecode/scoring
535 22
pandas_utils.py
in sourcecode/scoring
479 21
389 20
scorer.py
in sourcecode/scoring
325 20
mf_base_scorer.py
in sourcecode/scoring
923 19
matrix_factorization.py
in sourcecode/scoring/matrix_factorization
467 17
contributor_state.py
in sourcecode/scoring
442 17
reputation_matrix_factorization.py
in sourcecode/scoring/reputation_matrix_factorization
439 16
mf_group_scorer.py
in sourcecode/scoring
177 13
pseudo_raters.py
in sourcecode/scoring/matrix_factorization
286 12
reputation_scorer.py
in sourcecode/scoring
136 12
mf_topic_scorer.py
in sourcecode/scoring
173 10
mf_expansion_scorer.py
in sourcecode/scoring
79 9
mf_expansion_plus_scorer.py
in sourcecode/scoring
73 9
post_selection_similarity.py
in sourcecode/scoring
215 9
topic_model.py
in sourcecode/scoring
167 9
note_ratings.py
in sourcecode/scoring
412 8
note_status_history.py
in sourcecode/scoring
201 7
mf_core_scorer.py
in sourcecode/scoring
64 6
normalized_loss.py
in sourcecode/scoring/matrix_factorization
120 6
model.py
in sourcecode/scoring/matrix_factorization
52 5
constants.py
in sourcecode/scoring
827 5
tag_filter.py
in sourcecode/scoring
59 5
mf_multi_group_scorer.py
in sourcecode/scoring
31 4
incorrect_filter.py
in sourcecode/scoring
120 4
helpfulness_scores.py
in sourcecode/scoring
158 4
weighted_loss.py
in sourcecode/scoring/reputation_matrix_factorization
61 4
diligence_model.py
in sourcecode/scoring/reputation_matrix_factorization
167 4
runner.py
in sourcecode/scoring
247 3
explanation_tags.py
in sourcecode/scoring
76 3
helpfulness_model.py
in sourcecode/scoring/reputation_matrix_factorization
129 3
tag_consensus.py
in sourcecode/scoring
94 2
enums.py
in sourcecode/scoring
23 1
dataset.py
in sourcecode/scoring/reputation_matrix_factorization
41 1
Files With Long Lines (Top 7)

There are 7 files with lines longer than 120 characters. In total, there are 31 long lines.

File# lines# units# long lines
process_data.py
in sourcecode/scoring
564 26 15
note_status_history.py
in sourcecode/scoring
201 7 5
contributor_state.py
in sourcecode/scoring
442 17 4
mf_base_scorer.py
in sourcecode/scoring
923 19 2
run_scoring.py
in sourcecode/scoring
1489 25 2
note_ratings.py
in sourcecode/scoring
412 8 2
helpfulness_scores.py
in sourcecode/scoring
158 4 1
Correlations

File Size vs. Commits (all time): 42 points

sourcecode/scoring/constants.py x: 70 commits (all time) y: 827 lines of code sourcecode/scoring/pandas_utils.py x: 14 commits (all time) y: 479 lines of code sourcecode/scoring/run_scoring.py x: 74 commits (all time) y: 1489 lines of code sourcecode/scoring/scoring_rules.py x: 58 commits (all time) y: 702 lines of code sourcecode/scoring/runner.py x: 29 commits (all time) y: 247 lines of code sourcecode/scoring/topic_model.py x: 10 commits (all time) y: 167 lines of code sourcecode/main.py x: 23 commits (all time) y: 5 lines of code sourcecode/scoring/matrix_factorization/matrix_factorization.py x: 29 commits (all time) y: 467 lines of code sourcecode/scoring/matrix_factorization/pseudo_raters.py x: 19 commits (all time) y: 286 lines of code sourcecode/scoring/mf_base_scorer.py x: 69 commits (all time) y: 923 lines of code sourcecode/scoring/note_status_history.py x: 24 commits (all time) y: 201 lines of code sourcecode/scoring/pflip_model.py x: 2 commits (all time) y: 535 lines of code sourcecode/scoring/process_data.py x: 64 commits (all time) y: 564 lines of code sourcecode/scoring/post_selection_similarity.py x: 14 commits (all time) y: 215 lines of code sourcecode/scoring/post_selection_similarity_old.py x: 2 commits (all time) y: 389 lines of code sourcecode/scoring/scorer.py x: 31 commits (all time) y: 325 lines of code sourcecode/scoring/contributor_state.py x: 18 commits (all time) y: 442 lines of code sourcecode/scoring/helpfulness_scores.py x: 19 commits (all time) y: 158 lines of code sourcecode/scoring/matrix_factorization/model.py x: 10 commits (all time) y: 52 lines of code sourcecode/scoring/note_ratings.py x: 42 commits (all time) y: 412 lines of code sourcecode/scoring/reputation_matrix_factorization/diligence_model.py x: 13 commits (all time) y: 167 lines of code sourcecode/scoring/reputation_matrix_factorization/helpfulness_model.py x: 6 commits (all time) y: 129 lines of code sourcecode/scoring/reputation_matrix_factorization/reputation_matrix_factorization.py x: 20 commits (all time) y: 439 lines of code sourcecode/scoring/reputation_scorer.py x: 16 commits (all time) y: 136 lines of code sourcecode/scoring/tag_consensus.py x: 13 commits (all time) y: 94 lines of code sourcecode/scoring/tag_filter.py x: 10 commits (all time) y: 59 lines of code sourcecode/scoring/enums.py x: 16 commits (all time) y: 23 lines of code sourcecode/scoring/mf_group_scorer.py x: 30 commits (all time) y: 177 lines of code sourcecode/scoring/mf_multi_group_scorer.py x: 2 commits (all time) y: 31 lines of code sourcecode/scoring/matrix_factorization/normalized_loss.py x: 4 commits (all time) y: 120 lines of code sourcecode/scoring/mf_core_scorer.py x: 30 commits (all time) y: 64 lines of code sourcecode/scoring/mf_expansion_scorer.py x: 31 commits (all time) y: 79 lines of code sourcecode/scoring/reputation_matrix_factorization/dataset.py x: 8 commits (all time) y: 41 lines of code sourcecode/scoring/mf_expansion_plus_scorer.py x: 10 commits (all time) y: 73 lines of code sourcecode/scoring/mf_topic_scorer.py x: 12 commits (all time) y: 173 lines of code sourcecode/scoring/incorrect_filter.py x: 21 commits (all time) y: 120 lines of code sourcecode/scoring/explanation_tags.py x: 21 commits (all time) y: 76 lines of code sourcecode/scoring/matrix_factorization/__init__.py x: 4 commits (all time) y: 1 lines of code sourcecode/scoring/reputation_matrix_factorization/__init__.py x: 3 commits (all time) y: 1 lines of code sourcecode/scoring/reputation_matrix_factorization/weighted_loss.py x: 6 commits (all time) y: 61 lines of code sourcecode/__init__.py x: 1 commits (all time) y: 1 lines of code
1489.0
lines of code
  min: 1.0
  average: 260.74
  25th percentile: 60.5
  median: 162.5
  75th percentile: 418.75
  max: 1489.0
0 74.0
commits (all time)
min: 1.0 | average: 21.4 | 25th percentile: 7.5 | median: 16.0 | 75th percentile: 29.25 | max: 74.0

File Size vs. Contributors (all time): 42 points

sourcecode/scoring/constants.py x: 7 contributors (all time) y: 827 lines of code sourcecode/scoring/pandas_utils.py x: 3 contributors (all time) y: 479 lines of code sourcecode/scoring/run_scoring.py x: 6 contributors (all time) y: 1489 lines of code sourcecode/scoring/scoring_rules.py x: 7 contributors (all time) y: 702 lines of code sourcecode/scoring/runner.py x: 3 contributors (all time) y: 247 lines of code sourcecode/scoring/topic_model.py x: 3 contributors (all time) y: 167 lines of code sourcecode/main.py x: 6 contributors (all time) y: 5 lines of code sourcecode/scoring/matrix_factorization/matrix_factorization.py x: 3 contributors (all time) y: 467 lines of code sourcecode/scoring/matrix_factorization/pseudo_raters.py x: 3 contributors (all time) y: 286 lines of code sourcecode/scoring/mf_base_scorer.py x: 7 contributors (all time) y: 923 lines of code sourcecode/scoring/note_status_history.py x: 5 contributors (all time) y: 201 lines of code sourcecode/scoring/pflip_model.py x: 2 contributors (all time) y: 535 lines of code sourcecode/scoring/process_data.py x: 8 contributors (all time) y: 564 lines of code sourcecode/scoring/post_selection_similarity.py x: 1 contributors (all time) y: 215 lines of code sourcecode/scoring/post_selection_similarity_old.py x: 1 contributors (all time) y: 389 lines of code sourcecode/scoring/scorer.py x: 6 contributors (all time) y: 325 lines of code sourcecode/scoring/contributor_state.py x: 5 contributors (all time) y: 442 lines of code sourcecode/scoring/helpfulness_scores.py x: 3 contributors (all time) y: 158 lines of code sourcecode/scoring/matrix_factorization/model.py x: 1 contributors (all time) y: 52 lines of code sourcecode/scoring/note_ratings.py x: 5 contributors (all time) y: 412 lines of code sourcecode/scoring/reputation_matrix_factorization/helpfulness_model.py x: 1 contributors (all time) y: 129 lines of code sourcecode/scoring/reputation_matrix_factorization/reputation_matrix_factorization.py x: 3 contributors (all time) y: 439 lines of code sourcecode/scoring/reputation_scorer.py x: 3 contributors (all time) y: 136 lines of code sourcecode/scoring/tag_consensus.py x: 3 contributors (all time) y: 94 lines of code sourcecode/scoring/tag_filter.py x: 3 contributors (all time) y: 59 lines of code sourcecode/scoring/enums.py x: 4 contributors (all time) y: 23 lines of code sourcecode/scoring/mf_group_scorer.py x: 4 contributors (all time) y: 177 lines of code sourcecode/scoring/mf_multi_group_scorer.py x: 1 contributors (all time) y: 31 lines of code sourcecode/scoring/matrix_factorization/normalized_loss.py x: 3 contributors (all time) y: 120 lines of code sourcecode/scoring/mf_core_scorer.py x: 5 contributors (all time) y: 64 lines of code sourcecode/scoring/mf_expansion_scorer.py x: 4 contributors (all time) y: 79 lines of code sourcecode/scoring/reputation_matrix_factorization/dataset.py x: 3 contributors (all time) y: 41 lines of code sourcecode/scoring/mf_expansion_plus_scorer.py x: 3 contributors (all time) y: 73 lines of code sourcecode/scoring/mf_topic_scorer.py x: 3 contributors (all time) y: 173 lines of code sourcecode/scoring/incorrect_filter.py x: 4 contributors (all time) y: 120 lines of code sourcecode/scoring/explanation_tags.py x: 7 contributors (all time) y: 76 lines of code sourcecode/scoring/matrix_factorization/__init__.py x: 1 contributors (all time) y: 1 lines of code sourcecode/scoring/reputation_matrix_factorization/__init__.py x: 2 contributors (all time) y: 1 lines of code
1489.0
lines of code
  min: 1.0
  average: 260.74
  25th percentile: 60.5
  median: 162.5
  75th percentile: 418.75
  max: 1489.0
0 8.0
contributors (all time)
min: 1.0 | average: 3.57 | 25th percentile: 2.75 | median: 3.0 | 75th percentile: 5.0 | max: 8.0

File Size vs. Commits (30 days): 0 points

No data for "commits (30d)" vs. "lines of code".

File Size vs. Contributors (30 days): 0 points

No data for "contributors (30d)" vs. "lines of code".


File Size vs. Commits (90 days): 4 points

sourcecode/scoring/constants.py x: 2 commits (90d) y: 827 lines of code sourcecode/scoring/pandas_utils.py x: 2 commits (90d) y: 479 lines of code sourcecode/scoring/run_scoring.py x: 2 commits (90d) y: 1489 lines of code sourcecode/scoring/scoring_rules.py x: 2 commits (90d) y: 702 lines of code
1489.0
lines of code
  min: 479.0
  average: 874.25
  25th percentile: 534.75
  median: 764.5
  75th percentile: 1323.5
  max: 1489.0
0 2.0
commits (90d)
min: 2.0 | average: 2.0 | 25th percentile: 2.0 | median: 2.0 | 75th percentile: 2.0 | max: 2.0

File Size vs. Contributors (90 days): 4 points

sourcecode/scoring/constants.py x: 2 contributors (90d) y: 827 lines of code sourcecode/scoring/pandas_utils.py x: 2 contributors (90d) y: 479 lines of code sourcecode/scoring/run_scoring.py x: 2 contributors (90d) y: 1489 lines of code sourcecode/scoring/scoring_rules.py x: 2 contributors (90d) y: 702 lines of code
1489.0
lines of code
  min: 479.0
  average: 874.25
  25th percentile: 534.75
  median: 764.5
  75th percentile: 1323.5
  max: 1489.0
0 2.0
contributors (90d)
min: 2.0 | average: 2.0 | 25th percentile: 2.0 | median: 2.0 | 75th percentile: 2.0 | max: 2.0