twitter / the-algorithm-ml
File Size

The distribution of size of files (measured in lines of code).

Intro
Learn more...
File Size Overall
0% | 0% | 32% | 17% | 50%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: grouped by folders | grouped by size | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
py0% | 0% | 26% | 19% | 54%
yaml0% | 0% | 85% | 0% | 14%
toml0% | 0% | 0% | 0% | 100%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
projects0% | 0% | 37% | 25% | 37%
core0% | 0% | 66% | 0% | 33%
common0% | 0% | 0% | 31% | 68%
reader0% | 0% | 0% | 0% | 100%
metrics0% | 0% | 0% | 0% | 100%
machines0% | 0% | 0% | 0% | 100%
optimizers0% | 0% | 0% | 0% | 100%
ROOT0% | 0% | 0% | 0% | 100%
tools0% | 0% | 0% | 0% | 100%
ml_logging0% | 0% | 0% | 0% | 100%
Longest Files (Top 50)
File# lines# units
local_prod.yaml
in projects/home/recap/config
476 -
421 22
dataset.py
in projects/home/recap/data
338 16
235 6
entrypoint.py
in projects/home/recap/model
234 6
config.py
in projects/home/recap/data
182 1
snapshot.py
in common/checkpointing
173 18
config.py
in projects/home/recap/model
169 1
preprocessors.py
in projects/home/recap/data
142 11
optimizer.py
in projects/home/recap/optimizer
124 4
models.py
in projects/twhin/models
108 5
dds.py
in reader
93 4
dataset.py
in reader
91 10
metrics.py
in core
91 15
rce.py
in metrics
90 10
auroc.py
in metrics
90 4
feature_transform.py
in projects/home/recap/model
89 14
edges.py
in projects/twhin/data
86 4
optimizer.py
in optimizers
80 6
mask_net.py
in projects/home/recap/model
79 5
main.py
in projects/home/recap
79 1
environment.py
in machines
78 14
tfe_parsing.py
in projects/home/recap/data
76 5
run.py
in projects/twhin
75 2
losses.py
in core
74 6
util.py
in projects/home/recap/data
70 7
69 2
local.yaml
in projects/twhin/config
67 -
batch.py
in common
65 11
utils.py
in reader
64 4
63 2
pq.py
in tools
55 8
55 6
generate_random_data.py
in projects/home/recap/data
55 6
config.py
in optimizers
55 1
config.py
in projects/home/recap/embedding
48 -
model_and_loss.py
in projects/home/recap/model
47 2
embedding.py
in common/modules/embedding
45 2
mlp.py
in projects/home/recap/model
44 5
model.py
in root
44 4
optimizer.py
in projects/twhin
42 2
config.py
in projects/twhin/models
41 2
aggregation.py
in metrics
38 5
config.py
in projects/home/recap
37 -
config.py
in common/modules/embedding
34 -
torch_logging.py
in ml_logging
33 1
base_config.py
in core/config
32 4
training.py
in core/config
32 -
get_env.py
in machines
31 1
28 1
Files With Most Units (Top 50)
File# lines# units
421 22
snapshot.py
in common/checkpointing
173 18
dataset.py
in projects/home/recap/data
338 16
metrics.py
in core
91 15
feature_transform.py
in projects/home/recap/model
89 14
environment.py
in machines
78 14
preprocessors.py
in projects/home/recap/data
142 11
batch.py
in common
65 11
dataset.py
in reader
91 10
rce.py
in metrics
90 10
pq.py
in tools
55 8
util.py
in projects/home/recap/data
70 7
235 6
losses.py
in core
74 6
55 6
entrypoint.py
in projects/home/recap/model
234 6
generate_random_data.py
in projects/home/recap/data
55 6
optimizer.py
in optimizers
80 6
aggregation.py
in metrics
38 5
mask_net.py
in projects/home/recap/model
79 5
mlp.py
in projects/home/recap/model
44 5
tfe_parsing.py
in projects/home/recap/data
76 5
models.py
in projects/twhin/models
108 5
utils.py
in reader
64 4
dds.py
in reader
93 4
auroc.py
in metrics
90 4
base_config.py
in core/config
32 4
optimizer.py
in projects/home/recap/optimizer
124 4
edges.py
in projects/twhin/data
86 4
model.py
in root
44 4
util.py
in common/filesystem
15 3
utils.py
in common
23 3
numeric_calibration.py
in projects/home/recap/model
13 2
model_and_loss.py
in projects/home/recap/model
47 2
run.py
in projects/twhin
75 2
config.py
in projects/twhin/models
41 2
optimizer.py
in projects/twhin
42 2
device.py
in common
23 2
63 2
69 2
embedding.py
in common/modules/embedding
45 2
is_venv.py
in machines
13 2
config_load.py
in core/config
13 1
25 1
config.py
in projects/home/recap/model
169 1
main.py
in projects/home/recap
79 1
config.py
in projects/home/recap/data
182 1
metrics.py
in projects/twhin
14 1
data.py
in projects/twhin/data
14 1
config.py
in optimizers
55 1
Files With Long Lines (Top 4)

There are 4 files with lines longer than 120 characters. In total, there are 7 long lines.

File# lines# units# long lines
entrypoint.py
in projects/home/recap/model
234 6 3
preprocessors.py
in projects/home/recap/data
142 11 2
config.py
in projects/home/recap/data
182 1 1
snapshot.py
in common/checkpointing
173 18 1
Correlations

File Size vs. Commits (all time): 0 points

No data for "commits (all time)" vs. "lines of code".

File Size vs. Contributors (all time): 0 points

No data for "contributors (all time)" vs. "lines of code".


File Size vs. Commits (30 days): 0 points

No data for "commits (30d)" vs. "lines of code".

File Size vs. Contributors (30 days): 0 points

No data for "contributors (30d)" vs. "lines of code".


File Size vs. Commits (90 days): 0 points

No data for "commits (90d)" vs. "lines of code".

File Size vs. Contributors (90 days): 0 points

No data for "contributors (90d)" vs. "lines of code".