Dataset statistics
Number of variables | 30 |
---|---|
Number of observations | 350997 |
Missing cells | 6741010 |
Missing cells (%) | 64.0% |
Total size in memory | 80.3 MiB |
Average record size in memory | 240.0 B |
Variable types
Text | 23 |
---|---|
Numeric | 6 |
Unsupported | 1 |
Dataset
Description | A dataset from the WAMEX database. |
---|---|
URL | https://www.dmp.wa.gov.au/WAMEX-Minerals-Exploration-1476.aspx |
PRIORITY has constant value "" | Constant |
Strat_Sum has 305746 (87.1%) missing values | Missing |
Strat has 346567 (98.7%) missing values | Missing |
Mj1 has 54470 (15.5%) missing values | Missing |
Mj2 has 172094 (49.0%) missing values | Missing |
Mj3 has 261399 (74.5%) missing values | Missing |
Mj4 has 302371 (86.1%) missing values | Missing |
Mj5 has 350871 (> 99.9%) missing values | Missing |
Mn1 has 223856 (63.8%) missing values | Missing |
Mn2 has 298133 (84.9%) missing values | Missing |
Mn3 has 325115 (92.6%) missing values | Missing |
Mn4 has 332154 (94.6%) missing values | Missing |
Mn5 has 350982 (> 99.9%) missing values | Missing |
Tr1 has 303543 (86.5%) missing values | Missing |
Tr2 has 337810 (96.2%) missing values | Missing |
Tr3 has 344588 (98.2%) missing values | Missing |
Tr4 has 339159 (96.6%) missing values | Missing |
Tr5 has 350836 (> 99.9%) missing values | Missing |
Chip_pct has 161178 (45.9%) missing values | Missing |
Shape1 has 71713 (20.4%) missing values | Missing |
Shape2 has 342475 (97.6%) missing values | Missing |
Max_Dia has 62831 (17.9%) missing values | Missing |
Hardness has 350997 (100.0%) missing values | Missing |
Colour has 102224 (29.1%) missing values | Missing |
LithComment has 302145 (86.1%) missing values | Missing |
Ore_Texture has 347753 (99.1%) missing values | Missing |
Hardness is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
GEOLFROM has 14973 (4.3%) zeros | Zeros |
Reproduction
Analysis started | 2023-07-19 23:01:26.125119 |
---|---|
Analysis finished | 2023-07-19 23:01:36.396523 |
Duration | 10.27 seconds |
Software version | ydata-profiling vv4.3.1 |
Download configuration | config.json |
HOLEID
Text
Distinct | 6403 |
---|---|
Distinct (%) | 1.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.7 MiB |
Value | Count | Frequency (%) |
cc1164 | 314 | 0.1% |
cc1595 | 298 | 0.1% |
cc1596 | 284 | 0.1% |
cc1165 | 268 | 0.1% |
cc1166 | 268 | 0.1% |
cc1459 | 264 | 0.1% |
cc1443 | 256 | 0.1% |
cc1442 | 250 | 0.1% |
cc0001 | 236 | 0.1% |
cc1441 | 236 | 0.1% |
Other values (6393) | 348323 |
Most occurring characters
Value | Count | Frequency (%) |
C | 584375 | |
1 | 238862 | |
0 | 197982 | 9.4% |
2 | 191223 | 9.1% |
3 | 158961 | 7.5% |
4 | 114346 | 5.4% |
5 | 104310 | 4.9% |
6 | 103931 | 4.9% |
9 | 101964 | 4.8% |
7 | 96961 | 4.6% |
Other values (7) | 217227 | 10.3% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 1403988 | |
Uppercase Letter | 706154 |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 238862 | |
0 | 197982 | |
2 | 191223 | |
3 | 158961 | |
4 | 114346 | |
5 | 104310 | |
6 | 103931 | |
9 | 101964 | |
7 | 96961 | |
8 | 95448 | 6.8% |
Uppercase Letter
Value | Count | Frequency (%) |
C | 584375 | |
B | 89825 | 12.7% |
W | 14447 | 2.0% |
K | 10434 | 1.5% |
F | 4013 | 0.6% |
D | 1960 | 0.3% |
P | 1100 | 0.2% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 1403988 | |
Latin | 706154 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
1 | 238862 | |
0 | 197982 | |
2 | 191223 | |
3 | 158961 | |
4 | 114346 | |
5 | 104310 | |
6 | 103931 | |
9 | 101964 | |
7 | 96961 | |
8 | 95448 | 6.8% |
Latin
Value | Count | Frequency (%) |
C | 584375 | |
B | 89825 | 12.7% |
W | 14447 | 2.0% |
K | 10434 | 1.5% |
F | 4013 | 0.6% |
D | 1960 | 0.3% |
P | 1100 | 0.2% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 2110142 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
C | 584375 | |
1 | 238862 | |
0 | 197982 | 9.4% |
2 | 191223 | 9.1% |
3 | 158961 | 7.5% |
4 | 114346 | 5.4% |
5 | 104310 | 4.9% |
6 | 103931 | 4.9% |
9 | 101964 | 4.8% |
7 | 96961 | 4.6% |
Other values (7) | 217227 | 10.3% |
PROJECTCODE
Text
Distinct | 4 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.7 MiB |
Value | Count | Frequency (%) |
cc | 246725 | |
cb | 89825 | 25.6% |
wk | 10434 | 3.0% |
wf | 4013 | 1.1% |
Most occurring characters
Value | Count | Frequency (%) |
C | 583275 | |
B | 89825 | 12.8% |
W | 14447 | 2.1% |
K | 10434 | 1.5% |
F | 4013 | 0.6% |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 701994 |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
C | 583275 | |
B | 89825 | 12.8% |
W | 14447 | 2.1% |
K | 10434 | 1.5% |
F | 4013 | 0.6% |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 701994 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
C | 583275 | |
B | 89825 | 12.8% |
W | 14447 | 2.1% |
K | 10434 | 1.5% |
F | 4013 | 0.6% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 701994 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
C | 583275 | |
B | 89825 | 12.8% |
W | 14447 | 2.1% |
K | 10434 | 1.5% |
F | 4013 | 0.6% |
GEOLFROM
Real number (ℝ)
ZEROS
 
Distinct | 614 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 24.56020222 |
Minimum | 0 |
---|---|
Maximum | 143 |
Zeros | 14973 |
Zeros (%) | 4.3% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.7 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 1 |
Q1 | 9 |
median | 20 |
Q3 | 35 |
95-th percentile | 64 |
Maximum | 143 |
Range | 143 |
Interquartile range (IQR) | 26 |
Descriptive statistics
Standard deviation | 20.26223983 |
---|---|
Coefficient of variation (CV) | 0.8250029722 |
Kurtosis | 1.639174278 |
Mean | 24.56020222 |
Median Absolute Deviation (MAD) | 12 |
Skewness | 1.216507032 |
Sum | 8620557.3 |
Variance | 410.558363 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 14973 | 4.3% |
6 | 8581 | 2.4% |
4 | 8562 | 2.4% |
12 | 8543 | 2.4% |
5 | 8506 | 2.4% |
9 | 8502 | 2.4% |
10 | 8487 | 2.4% |
7 | 8465 | 2.4% |
3 | 8464 | 2.4% |
8 | 8456 | 2.4% |
Other values (604) | 259458 |
Value | Count | Frequency (%) |
0 | 14973 | |
0.2 | 2 | < 0.1% |
0.3 | 2 | < 0.1% |
0.4 | 1 | < 0.1% |
0.5 | 14 | < 0.1% |
Value | Count | Frequency (%) |
143 | 2 | |
142 | 2 | |
141 | 2 | |
140 | 2 | |
139 | 4 |
GEOLTO
Real number (ℝ)
Distinct | 631 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 26.30145272 |
Minimum | 0.2 |
---|---|
Maximum | 144 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.7 MiB |
Quantile statistics
Minimum | 0.2 |
---|---|
5-th percentile | 3 |
Q1 | 11 |
median | 22 |
Q3 | 37 |
95-th percentile | 66 |
Maximum | 144 |
Range | 143.8 |
Interquartile range (IQR) | 26 |
Descriptive statistics
Standard deviation | 20.28137304 |
---|---|
Coefficient of variation (CV) | 0.771112275 |
Kurtosis | 1.634311732 |
Mean | 26.30145272 |
Median Absolute Deviation (MAD) | 12 |
Skewness | 1.214693604 |
Sum | 9231731 |
Variance | 411.3340925 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
10 | 8744 | 2.5% |
16 | 8712 | 2.5% |
12 | 8671 | 2.5% |
4 | 8602 | 2.5% |
6 | 8597 | 2.4% |
5 | 8524 | 2.4% |
9 | 8521 | 2.4% |
7 | 8503 | 2.4% |
3 | 8487 | 2.4% |
8 | 8451 | 2.4% |
Other values (621) | 265185 |
Value | Count | Frequency (%) |
0.2 | 2 | < 0.1% |
0.3 | 2 | < 0.1% |
0.4 | 1 | < 0.1% |
0.5 | 13 | |
0.6 | 1 | < 0.1% |
Value | Count | Frequency (%) |
144 | 4 | |
143 | 2 | < 0.1% |
142 | 2 | < 0.1% |
141 | 2 | < 0.1% |
140 | 6 |
PRIORITY
Real number (ℝ)
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1 |
Minimum | 1 |
---|---|
Maximum | 1 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.7 MiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 1 |
median | 1 |
Q3 | 1 |
95-th percentile | 1 |
Maximum | 1 |
Range | 0 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 0 |
---|---|
Coefficient of variation (CV) | 0 |
Kurtosis | 0 |
Mean | 1 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 0 |
Sum | 350997 |
Variance | 0 |
Monotonicity | Increasing |
Value | Count | Frequency (%) |
1 | 350997 |
Value | Count | Frequency (%) |
1 | 350997 |
Value | Count | Frequency (%) |
1 | 350997 |
Strat_Sum
Text
MISSING
 
Distinct | 24 |
---|---|
Distinct (%) | 0.1% |
Missing | 305746 |
Missing (%) | 87.1% |
Memory size | 2.7 MiB |
Value | Count | Frequency (%) |
mum | 8268 | |
mub | 7237 | |
hc | 5263 | |
ta | 4720 | |
tdi | 3116 | 6.9% |
mus | 2935 | 6.5% |
muh | 2914 | 6.4% |
mut | 2504 | 5.5% |
tds | 2154 | 4.8% |
muf | 2041 | 4.5% |
Other values (14) | 4099 |
Most occurring characters
Value | Count | Frequency (%) |
M | 27027 | |
U | 27027 | |
T | 11219 | |
m | 8753 | 7.1% |
b | 7237 | 5.8% |
d | 5755 | 4.6% |
H | 5567 | 4.5% |
s | 5389 | 4.3% |
c | 5270 | 4.2% |
a | 4720 | 3.8% |
Other values (19) | 16050 |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 73221 | |
Lowercase Letter | 50793 |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
m | 8753 | |
b | 7237 | |
d | 5755 | |
s | 5389 | |
c | 5270 | |
a | 4720 | |
i | 3116 | 6.1% |
h | 2914 | 5.7% |
t | 2504 | 4.9% |
f | 2334 | 4.6% |
Other values (5) | 2801 | 5.5% |
Uppercase Letter
Value | Count | Frequency (%) |
M | 27027 | |
U | 27027 | |
T | 11219 | |
H | 5567 | 7.6% |
C | 778 | 1.1% |
J | 543 | 0.7% |
D | 434 | 0.6% |
I | 430 | 0.6% |
F | 93 | 0.1% |
L | 55 | 0.1% |
Other values (4) | 48 | 0.1% |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 124014 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
M | 27027 | |
U | 27027 | |
T | 11219 | |
m | 8753 | 7.1% |
b | 7237 | 5.8% |
d | 5755 | 4.6% |
H | 5567 | 4.5% |
s | 5389 | 4.3% |
c | 5270 | 4.2% |
a | 4720 | 3.8% |
Other values (19) | 16050 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 124014 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
M | 27027 | |
U | 27027 | |
T | 11219 | |
m | 8753 | 7.1% |
b | 7237 | 5.8% |
d | 5755 | 4.6% |
H | 5567 | 4.5% |
s | 5389 | 4.3% |
c | 5270 | 4.2% |
a | 4720 | 3.8% |
Other values (19) | 16050 |
Strat
Text
MISSING
 
Distinct | 16 |
---|---|
Distinct (%) | 0.4% |
Missing | 346567 |
Missing (%) | 98.7% |
Memory size | 2.7 MiB |
Value | Count | Frequency (%) |
mub | 836 | |
ta | 702 | |
mum | 648 | |
tdi | 504 | |
hc | 372 | |
muh | 268 | 6.0% |
tds | 240 | 5.4% |
mut | 216 | 4.9% |
muf | 192 | 4.3% |
tdm | 142 | 3.2% |
Other values (6) | 310 | 7.0% |
Most occurring characters
Value | Count | Frequency (%) |
M | 2306 | |
U | 2306 | |
T | 1588 | |
d | 886 | 7.4% |
b | 836 | 6.9% |
m | 790 | 6.6% |
a | 702 | 5.8% |
i | 504 | 4.2% |
H | 374 | 3.1% |
c | 372 | 3.1% |
Other values (11) | 1390 |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 6736 | |
Lowercase Letter | 5318 |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
d | 886 | |
b | 836 | |
m | 790 | |
a | 702 | |
i | 504 | |
c | 372 | |
s | 370 | |
h | 268 | 5.0% |
f | 224 | 4.2% |
t | 216 | 4.1% |
Other values (4) | 150 | 2.8% |
Uppercase Letter
Value | Count | Frequency (%) |
M | 2306 | |
U | 2306 | |
T | 1588 | |
H | 374 | 5.6% |
J | 108 | 1.6% |
C | 32 | 0.5% |
F | 22 | 0.3% |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 12054 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
M | 2306 | |
U | 2306 | |
T | 1588 | |
d | 886 | 7.4% |
b | 836 | 6.9% |
m | 790 | 6.6% |
a | 702 | 5.8% |
i | 504 | 4.2% |
H | 374 | 3.1% |
c | 372 | 3.1% |
Other values (11) | 1390 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 12054 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
M | 2306 | |
U | 2306 | |
T | 1588 | |
d | 886 | 7.4% |
b | 836 | 6.9% |
m | 790 | 6.6% |
a | 702 | 5.8% |
i | 504 | 4.2% |
H | 374 | 3.1% |
c | 372 | 3.1% |
Other values (11) | 1390 |
Mj1
Text
MISSING
 
Distinct | 88 |
---|---|
Distinct (%) | < 0.1% |
Missing | 54470 |
Missing (%) | 15.5% |
Memory size | 2.7 MiB |
Value | Count | Frequency (%) |
ch | 70733 | |
ghm | 36371 | |
gom | 36269 | |
klf | 25300 | 8.5% |
vgh | 21633 | 7.3% |
goh | 10550 | 3.6% |
klp | 8517 | 2.9% |
hsm | 8269 | 2.8% |
hom | 8224 | 2.8% |
shm | 7850 | 2.6% |
Other values (78) | 62811 |
Most occurring characters
Value | Count | Frequency (%) |
H | 201610 | |
G | 137454 | |
M | 114341 | |
C | 72925 | 9.0% |
O | 62691 | 7.7% |
F | 50314 | 6.2% |
K | 40407 | 5.0% |
L | 40224 | 5.0% |
S | 40124 | 4.9% |
V | 21633 | 2.7% |
Other values (16) | 30020 | 3.7% |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 804961 | |
Lowercase Letter | 6782 | 0.8% |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
H | 201610 | |
G | 137454 | |
M | 114341 | |
C | 72925 | 9.1% |
O | 62691 | 7.8% |
F | 50314 | 6.3% |
K | 40407 | 5.0% |
L | 40224 | 5.0% |
S | 40124 | 5.0% |
V | 21633 | 2.7% |
Other values (13) | 23238 | 2.9% |
Lowercase Letter
Value | Count | Frequency (%) |
d | 4031 | |
p | 2747 | |
c | 4 | 0.1% |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 811743 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
H | 201610 | |
G | 137454 | |
M | 114341 | |
C | 72925 | 9.0% |
O | 62691 | 7.7% |
F | 50314 | 6.2% |
K | 40407 | 5.0% |
L | 40224 | 5.0% |
S | 40124 | 4.9% |
V | 21633 | 2.7% |
Other values (16) | 30020 | 3.7% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 811743 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
H | 201610 | |
G | 137454 | |
M | 114341 | |
C | 72925 | 9.0% |
O | 62691 | 7.7% |
F | 50314 | 6.2% |
K | 40407 | 5.0% |
L | 40224 | 5.0% |
S | 40124 | 4.9% |
V | 21633 | 2.7% |
Other values (16) | 30020 | 3.7% |
Mj2
Text
MISSING
 
Distinct | 93 |
---|---|
Distinct (%) | 0.1% |
Missing | 172094 |
Missing (%) | 49.0% |
Memory size | 2.7 MiB |
Value | Count | Frequency (%) |
gom | 22089 | |
ghm | 22052 | |
ch | 20268 | |
shm | 16400 | 9.2% |
klf | 10474 | 5.9% |
vgh | 10177 | 5.7% |
hsm | 8827 | 4.9% |
ogf | 7288 | 4.1% |
goh | 6778 | 3.8% |
hom | 5861 | 3.3% |
Other values (83) | 48689 |
Most occurring characters
Value | Count | Frequency (%) |
H | 121231 | |
G | 91153 | |
M | 88331 | |
S | 47255 | 9.2% |
O | 44786 | 8.8% |
F | 37703 | 7.4% |
C | 22148 | 4.3% |
K | 14956 | 2.9% |
L | 14851 | 2.9% |
V | 10177 | 2.0% |
Other values (15) | 18521 | 3.6% |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 506839 | |
Lowercase Letter | 4273 | 0.8% |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
H | 121231 | |
G | 91153 | |
M | 88331 | |
S | 47255 | 9.3% |
O | 44786 | 8.8% |
F | 37703 | 7.4% |
C | 22148 | 4.4% |
K | 14956 | 3.0% |
L | 14851 | 2.9% |
V | 10177 | 2.0% |
Other values (13) | 14248 | 2.8% |
Lowercase Letter
Value | Count | Frequency (%) |
d | 2421 | |
p | 1852 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 511112 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
H | 121231 | |
G | 91153 | |
M | 88331 | |
S | 47255 | 9.2% |
O | 44786 | 8.8% |
F | 37703 | 7.4% |
C | 22148 | 4.3% |
K | 14956 | 2.9% |
L | 14851 | 2.9% |
V | 10177 | 2.0% |
Other values (15) | 18521 | 3.6% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 511112 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
H | 121231 | |
G | 91153 | |
M | 88331 | |
S | 47255 | 9.2% |
O | 44786 | 8.8% |
F | 37703 | 7.4% |
C | 22148 | 4.3% |
K | 14956 | 2.9% |
L | 14851 | 2.9% |
V | 10177 | 2.0% |
Other values (15) | 18521 | 3.6% |
Mj3
Text
MISSING
 
Distinct | 81 |
---|---|
Distinct (%) | 0.1% |
Missing | 261399 |
Missing (%) | 74.5% |
Memory size | 2.7 MiB |
Value | Count | Frequency (%) |
ghm | 10636 | |
gom | 9475 | 10.6% |
klf | 8177 | 9.1% |
ch | 7807 | 8.7% |
shm | 7383 | 8.2% |
hsm | 4924 | 5.5% |
vgh | 4622 | 5.2% |
ogf | 4147 | 4.6% |
hsf | 3350 | 3.7% |
goh | 3037 | 3.4% |
Other values (71) | 26040 |
Most occurring characters
Value | Count | Frequency (%) |
H | 54241 | |
G | 42328 | |
M | 42163 | |
S | 24021 | |
F | 22568 | |
O | 20207 | 7.9% |
K | 12130 | 4.8% |
L | 11997 | 4.7% |
C | 8598 | 3.4% |
V | 4623 | 1.8% |
Other values (15) | 12445 | 4.9% |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 253628 | |
Lowercase Letter | 1693 | 0.7% |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
H | 54241 | |
G | 42328 | |
M | 42163 | |
S | 24021 | |
F | 22568 | |
O | 20207 | 8.0% |
K | 12130 | 4.8% |
L | 11997 | 4.7% |
C | 8598 | 3.4% |
V | 4623 | 1.8% |
Other values (13) | 10752 | 4.2% |
Lowercase Letter
Value | Count | Frequency (%) |
d | 1020 | |
p | 673 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 255321 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
H | 54241 | |
G | 42328 | |
M | 42163 | |
S | 24021 | |
F | 22568 | |
O | 20207 | 7.9% |
K | 12130 | 4.8% |
L | 11997 | 4.7% |
C | 8598 | 3.4% |
V | 4623 | 1.8% |
Other values (15) | 12445 | 4.9% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 255321 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
H | 54241 | |
G | 42328 | |
M | 42163 | |
S | 24021 | |
F | 22568 | |
O | 20207 | 7.9% |
K | 12130 | 4.8% |
L | 11997 | 4.7% |
C | 8598 | 3.4% |
V | 4623 | 1.8% |
Other values (15) | 12445 | 4.9% |
Mj4
Text
MISSING
 
Distinct | 74 |
---|---|
Distinct (%) | 0.2% |
Missing | 302371 |
Missing (%) | 86.1% |
Memory size | 2.7 MiB |
Value | Count | Frequency (%) |
ghm | 6031 | |
gom | 5337 | 11.0% |
klp | 3791 | 7.8% |
ch | 3601 | 7.4% |
shm | 3414 | 7.0% |
vgh | 2817 | 5.8% |
klf | 2772 | 5.7% |
hsm | 2614 | 5.4% |
pi | 1846 | 3.8% |
goh | 1808 | 3.7% |
Other values (64) | 14595 |
Most occurring characters
Value | Count | Frequency (%) |
H | 27896 | |
G | 23497 | |
M | 23247 | |
S | 11972 | |
O | 10645 | 7.7% |
F | 9135 | 6.6% |
K | 7411 | 5.3% |
L | 7316 | 5.3% |
P | 5874 | 4.2% |
C | 3990 | 2.9% |
Other values (15) | 7589 | 5.5% |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 137753 | |
Lowercase Letter | 819 | 0.6% |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
H | 27896 | |
G | 23497 | |
M | 23247 | |
S | 11972 | |
O | 10645 | 7.7% |
F | 9135 | 6.6% |
K | 7411 | 5.4% |
L | 7316 | 5.3% |
P | 5874 | 4.3% |
C | 3990 | 2.9% |
Other values (13) | 6770 | 4.9% |
Lowercase Letter
Value | Count | Frequency (%) |
d | 602 | |
p | 217 | 26.5% |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 138572 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
H | 27896 | |
G | 23497 | |
M | 23247 | |
S | 11972 | |
O | 10645 | 7.7% |
F | 9135 | 6.6% |
K | 7411 | 5.3% |
L | 7316 | 5.3% |
P | 5874 | 4.2% |
C | 3990 | 2.9% |
Other values (15) | 7589 | 5.5% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 138572 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
H | 27896 | |
G | 23497 | |
M | 23247 | |
S | 11972 | |
O | 10645 | 7.7% |
F | 9135 | 6.6% |
K | 7411 | 5.3% |
L | 7316 | 5.3% |
P | 5874 | 4.2% |
C | 3990 | 2.9% |
Other values (15) | 7589 | 5.5% |
Mj5
Text
MISSING
 
Distinct | 25 |
---|---|
Distinct (%) | 19.8% |
Missing | 350871 |
Missing (%) | > 99.9% |
Memory size | 2.7 MiB |
Value | Count | Frequency (%) |
ghm | 25 | |
gom | 17 | |
vgh | 15 | |
shm | 12 | |
klf | 9 | 7.1% |
ch | 7 | 5.6% |
hsf | 5 | 4.0% |
vg | 4 | 3.2% |
gsf | 3 | 2.4% |
rs | 3 | 2.4% |
Other values (15) | 26 |
Most occurring characters
Value | Count | Frequency (%) |
G | 71 | |
H | 69 | |
M | 65 | |
S | 34 | |
O | 24 | 6.8% |
F | 22 | 6.2% |
V | 19 | 5.4% |
K | 12 | 3.4% |
L | 12 | 3.4% |
C | 7 | 2.0% |
Other values (6) | 17 | 4.8% |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 352 |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
G | 71 | |
H | 69 | |
M | 65 | |
S | 34 | |
O | 24 | 6.8% |
F | 22 | 6.2% |
V | 19 | 5.4% |
K | 12 | 3.4% |
L | 12 | 3.4% |
C | 7 | 2.0% |
Other values (6) | 17 | 4.8% |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 352 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
G | 71 | |
H | 69 | |
M | 65 | |
S | 34 | |
O | 24 | 6.8% |
F | 22 | 6.2% |
V | 19 | 5.4% |
K | 12 | 3.4% |
L | 12 | 3.4% |
C | 7 | 2.0% |
Other values (6) | 17 | 4.8% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 352 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
G | 71 | |
H | 69 | |
M | 65 | |
S | 34 | |
O | 24 | 6.8% |
F | 22 | 6.2% |
V | 19 | 5.4% |
K | 12 | 3.4% |
L | 12 | 3.4% |
C | 7 | 2.0% |
Other values (6) | 17 | 4.8% |
Mn1
Text
MISSING
 
Distinct | 91 |
---|---|
Distinct (%) | 0.1% |
Missing | 223856 |
Missing (%) | 63.8% |
Memory size | 2.7 MiB |
Value | Count | Frequency (%) |
ghm | 13482 | 10.6% |
gom | 13066 | 10.3% |
ch | 11988 | 9.4% |
vgh | 8478 | 6.7% |
ogf | 7901 | 6.2% |
hsm | 7711 | 6.1% |
klf | 6038 | 4.7% |
hom | 5734 | 4.5% |
shm | 5576 | 4.4% |
gsm | 4530 | 3.6% |
Other values (81) | 42637 |
Most occurring characters
Value | Count | Frequency (%) |
H | 77352 | |
G | 63125 | |
M | 60334 | |
S | 33429 | |
O | 33218 | |
F | 31648 | |
C | 14521 | 4.0% |
K | 10018 | 2.8% |
L | 9304 | 2.6% |
V | 8484 | 2.4% |
Other values (15) | 17709 | 4.9% |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 356731 | |
Lowercase Letter | 2411 | 0.7% |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
H | 77352 | |
G | 63125 | |
M | 60334 | |
S | 33429 | |
O | 33218 | |
F | 31648 | |
C | 14521 | 4.1% |
K | 10018 | 2.8% |
L | 9304 | 2.6% |
V | 8484 | 2.4% |
Other values (13) | 15298 | 4.3% |
Lowercase Letter
Value | Count | Frequency (%) |
d | 1483 | |
p | 928 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 359142 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
H | 77352 | |
G | 63125 | |
M | 60334 | |
S | 33429 | |
O | 33218 | |
F | 31648 | |
C | 14521 | 4.0% |
K | 10018 | 2.8% |
L | 9304 | 2.6% |
V | 8484 | 2.4% |
Other values (15) | 17709 | 4.9% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 359142 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
H | 77352 | |
G | 63125 | |
M | 60334 | |
S | 33429 | |
O | 33218 | |
F | 31648 | |
C | 14521 | 4.0% |
K | 10018 | 2.8% |
L | 9304 | 2.6% |
V | 8484 | 2.4% |
Other values (15) | 17709 | 4.9% |
Mn2
Text
MISSING
 
Distinct | 84 |
---|---|
Distinct (%) | 0.2% |
Missing | 298133 |
Missing (%) | 84.9% |
Memory size | 2.7 MiB |
Value | Count | Frequency (%) |
gom | 5480 | 10.4% |
ghm | 4933 | 9.3% |
ch | 3924 | 7.4% |
hsm | 3674 | 6.9% |
ogf | 3374 | 6.4% |
vgh | 3180 | 6.0% |
shm | 3129 | 5.9% |
hom | 2201 | 4.2% |
gsm | 2125 | 4.0% |
mi | 2057 | 3.9% |
Other values (74) | 18787 |
Most occurring characters
Value | Count | Frequency (%) |
H | 29691 | |
M | 27068 | |
G | 25185 | |
S | 15553 | |
O | 13333 | |
F | 12645 | |
C | 4476 | 3.0% |
K | 3880 | 2.6% |
I | 3654 | 2.5% |
L | 3506 | 2.4% |
Other values (15) | 9504 | 6.4% |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 147783 | |
Lowercase Letter | 712 | 0.5% |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
H | 29691 | |
M | 27068 | |
G | 25185 | |
S | 15553 | |
O | 13333 | |
F | 12645 | |
C | 4476 | 3.0% |
K | 3880 | 2.6% |
I | 3654 | 2.5% |
L | 3506 | 2.4% |
Other values (13) | 8792 | 5.9% |
Lowercase Letter
Value | Count | Frequency (%) |
d | 422 | |
p | 290 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 148495 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
H | 29691 | |
M | 27068 | |
G | 25185 | |
S | 15553 | |
O | 13333 | |
F | 12645 | |
C | 4476 | 3.0% |
K | 3880 | 2.6% |
I | 3654 | 2.5% |
L | 3506 | 2.4% |
Other values (15) | 9504 | 6.4% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 148495 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
H | 29691 | |
M | 27068 | |
G | 25185 | |
S | 15553 | |
O | 13333 | |
F | 12645 | |
C | 4476 | 3.0% |
K | 3880 | 2.6% |
I | 3654 | 2.5% |
L | 3506 | 2.4% |
Other values (15) | 9504 | 6.4% |
Mn3
Text
MISSING
 
Distinct | 72 |
---|---|
Distinct (%) | 0.3% |
Missing | 325115 |
Missing (%) | 92.6% |
Memory size | 2.7 MiB |
Value | Count | Frequency (%) |
gom | 2394 | 9.2% |
ghm | 2379 | 9.2% |
mi | 2207 | 8.5% |
ch | 1878 | 7.3% |
hsm | 1632 | 6.3% |
shm | 1327 | 5.1% |
vgh | 1317 | 5.1% |
gsf | 1205 | 4.7% |
gsm | 1108 | 4.3% |
hom | 1089 | 4.2% |
Other values (62) | 9346 |
Most occurring characters
Value | Count | Frequency (%) |
M | 13954 | |
H | 12977 | |
G | 11303 | |
S | 7297 | |
O | 5429 | 7.6% |
F | 5406 | 7.6% |
I | 3278 | 4.6% |
K | 2194 | 3.1% |
C | 2105 | 3.0% |
P | 2063 | 2.9% |
Other values (15) | 4975 | 7.0% |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 70747 | |
Lowercase Letter | 234 | 0.3% |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
M | 13954 | |
H | 12977 | |
G | 11303 | |
S | 7297 | |
O | 5429 | 7.7% |
F | 5406 | 7.6% |
I | 3278 | 4.6% |
K | 2194 | 3.1% |
C | 2105 | 3.0% |
P | 2063 | 2.9% |
Other values (13) | 4741 | 6.7% |
Lowercase Letter
Value | Count | Frequency (%) |
d | 121 | |
p | 113 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 70981 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
M | 13954 | |
H | 12977 | |
G | 11303 | |
S | 7297 | |
O | 5429 | 7.6% |
F | 5406 | 7.6% |
I | 3278 | 4.6% |
K | 2194 | 3.1% |
C | 2105 | 3.0% |
P | 2063 | 2.9% |
Other values (15) | 4975 | 7.0% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 70981 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
M | 13954 | |
H | 12977 | |
G | 11303 | |
S | 7297 | |
O | 5429 | 7.6% |
F | 5406 | 7.6% |
I | 3278 | 4.6% |
K | 2194 | 3.1% |
C | 2105 | 3.0% |
P | 2063 | 2.9% |
Other values (15) | 4975 | 7.0% |
Mn4
Text
MISSING
 
Distinct | 62 |
---|---|
Distinct (%) | 0.3% |
Missing | 332154 |
Missing (%) | 94.6% |
Memory size | 2.7 MiB |
Value | Count | Frequency (%) |
mi | 1855 | 9.8% |
ch | 1576 | 8.4% |
gom | 1553 | 8.2% |
pi | 1511 | 8.0% |
ghm | 1088 | 5.8% |
klp | 938 | 5.0% |
vgh | 888 | 4.7% |
gsf | 861 | 4.6% |
hsf | 804 | 4.3% |
hsm | 761 | 4.0% |
Other values (52) | 7008 |
Most occurring characters
Value | Count | Frequency (%) |
M | 8861 | |
H | 8107 | |
G | 6599 | |
S | 4728 | |
F | 3961 | |
I | 3370 | 6.7% |
O | 3290 | 6.6% |
P | 2527 | 5.1% |
K | 2222 | 4.5% |
L | 1720 | 3.4% |
Other values (15) | 4543 |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 49808 | |
Lowercase Letter | 120 | 0.2% |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
M | 8861 | |
H | 8107 | |
G | 6599 | |
S | 4728 | |
F | 3961 | |
I | 3370 | 6.8% |
O | 3290 | 6.6% |
P | 2527 | 5.1% |
K | 2222 | 4.5% |
L | 1720 | 3.5% |
Other values (13) | 4423 |
Lowercase Letter
Value | Count | Frequency (%) |
d | 68 | |
p | 52 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 49928 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
M | 8861 | |
H | 8107 | |
G | 6599 | |
S | 4728 | |
F | 3961 | |
I | 3370 | 6.7% |
O | 3290 | 6.6% |
P | 2527 | 5.1% |
K | 2222 | 4.5% |
L | 1720 | 3.4% |
Other values (15) | 4543 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 49928 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
M | 8861 | |
H | 8107 | |
G | 6599 | |
S | 4728 | |
F | 3961 | |
I | 3370 | 6.7% |
O | 3290 | 6.6% |
P | 2527 | 5.1% |
K | 2222 | 4.5% |
L | 1720 | 3.4% |
Other values (15) | 4543 |
Mn5
Text
MISSING
 
Distinct | 5 |
---|---|
Distinct (%) | 33.3% |
Missing | 350982 |
Missing (%) | > 99.9% |
Memory size | 2.7 MiB |
Value | Count | Frequency (%) |
pi | 6 | |
ch | 5 | |
ogf | 2 | 13.3% |
ys | 1 | 6.7% |
ggm | 1 | 6.7% |
Most occurring characters
Value | Count | Frequency (%) |
P | 6 | |
I | 6 | |
C | 5 | |
H | 5 | |
G | 4 | |
O | 2 | 6.1% |
F | 2 | 6.1% |
Y | 1 | 3.0% |
S | 1 | 3.0% |
M | 1 | 3.0% |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 33 |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
P | 6 | |
I | 6 | |
C | 5 | |
H | 5 | |
G | 4 | |
O | 2 | 6.1% |
F | 2 | 6.1% |
Y | 1 | 3.0% |
S | 1 | 3.0% |
M | 1 | 3.0% |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 33 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
P | 6 | |
I | 6 | |
C | 5 | |
H | 5 | |
G | 4 | |
O | 2 | 6.1% |
F | 2 | 6.1% |
Y | 1 | 3.0% |
S | 1 | 3.0% |
M | 1 | 3.0% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 33 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
P | 6 | |
I | 6 | |
C | 5 | |
H | 5 | |
G | 4 | |
O | 2 | 6.1% |
F | 2 | 6.1% |
Y | 1 | 3.0% |
S | 1 | 3.0% |
M | 1 | 3.0% |
Tr1
Text
MISSING
 
Distinct | 76 |
---|---|
Distinct (%) | 0.2% |
Missing | 303543 |
Missing (%) | 86.5% |
Memory size | 2.7 MiB |
Value | Count | Frequency (%) |
mi | 18742 | |
mn | 3323 | 7.0% |
ch | 2966 | 6.3% |
ka | 2546 | 5.4% |
ogf | 1891 | 4.0% |
gom | 1468 | 3.1% |
pi | 1420 | 3.0% |
ghm | 1264 | 2.7% |
vgh | 1188 | 2.5% |
klp | 947 | 2.0% |
Other values (66) | 11699 |
Most occurring characters
Value | Count | Frequency (%) |
M | 30157 | |
I | 20545 | |
H | 10834 | 9.8% |
G | 8415 | 7.6% |
O | 6139 | 5.6% |
F | 5395 | 4.9% |
S | 5243 | 4.8% |
K | 4981 | 4.5% |
N | 3709 | 3.4% |
C | 3403 | 3.1% |
Other values (15) | 11278 | 10.2% |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 109867 | |
Lowercase Letter | 232 | 0.2% |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
M | 30157 | |
I | 20545 | |
H | 10834 | 9.9% |
G | 8415 | 7.7% |
O | 6139 | 5.6% |
F | 5395 | 4.9% |
S | 5243 | 4.8% |
K | 4981 | 4.5% |
N | 3709 | 3.4% |
C | 3403 | 3.1% |
Other values (13) | 11046 | 10.1% |
Lowercase Letter
Value | Count | Frequency (%) |
p | 120 | |
d | 112 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 110099 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
M | 30157 | |
I | 20545 | |
H | 10834 | 9.8% |
G | 8415 | 7.6% |
O | 6139 | 5.6% |
F | 5395 | 4.9% |
S | 5243 | 4.8% |
K | 4981 | 4.5% |
N | 3709 | 3.4% |
C | 3403 | 3.1% |
Other values (15) | 11278 | 10.2% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 110099 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
M | 30157 | |
I | 20545 | |
H | 10834 | 9.8% |
G | 8415 | 7.6% |
O | 6139 | 5.6% |
F | 5395 | 4.9% |
S | 5243 | 4.8% |
K | 4981 | 4.5% |
N | 3709 | 3.4% |
C | 3403 | 3.1% |
Other values (15) | 11278 | 10.2% |
Tr2
Text
MISSING
 
Distinct | 63 |
---|---|
Distinct (%) | 0.5% |
Missing | 337810 |
Missing (%) | 96.2% |
Memory size | 2.7 MiB |
Value | Count | Frequency (%) |
mi | 4357 | |
pi | 947 | 7.2% |
ka | 914 | 6.9% |
klp | 732 | 5.6% |
ch | 555 | 4.2% |
ghm | 523 | 4.0% |
gom | 475 | 3.6% |
mn | 443 | 3.4% |
ogf | 420 | 3.2% |
mo | 351 | 2.7% |
Other values (53) | 3470 |
Most occurring characters
Value | Count | Frequency (%) |
M | 7585 | |
I | 5425 | |
H | 3002 | 9.5% |
G | 2546 | 8.1% |
K | 2050 | 6.5% |
O | 1751 | 5.6% |
P | 1698 | 5.4% |
F | 1670 | 5.3% |
S | 1660 | 5.3% |
L | 1136 | 3.6% |
Other values (15) | 2994 | 9.5% |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 31450 | |
Lowercase Letter | 67 | 0.2% |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
M | 7585 | |
I | 5425 | |
H | 3002 | 9.5% |
G | 2546 | 8.1% |
K | 2050 | 6.5% |
O | 1751 | 5.6% |
P | 1698 | 5.4% |
F | 1670 | 5.3% |
S | 1660 | 5.3% |
L | 1136 | 3.6% |
Other values (13) | 2927 | 9.3% |
Lowercase Letter
Value | Count | Frequency (%) |
p | 44 | |
d | 23 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 31517 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
M | 7585 | |
I | 5425 | |
H | 3002 | 9.5% |
G | 2546 | 8.1% |
K | 2050 | 6.5% |
O | 1751 | 5.6% |
P | 1698 | 5.4% |
F | 1670 | 5.3% |
S | 1660 | 5.3% |
L | 1136 | 3.6% |
Other values (15) | 2994 | 9.5% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 31517 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
M | 7585 | |
I | 5425 | |
H | 3002 | 9.5% |
G | 2546 | 8.1% |
K | 2050 | 6.5% |
O | 1751 | 5.6% |
P | 1698 | 5.4% |
F | 1670 | 5.3% |
S | 1660 | 5.3% |
L | 1136 | 3.6% |
Other values (15) | 2994 | 9.5% |
Tr3
Text
MISSING
 
Distinct | 45 |
---|---|
Distinct (%) | 0.7% |
Missing | 344588 |
Missing (%) | 98.2% |
Memory size | 2.7 MiB |
Value | Count | Frequency (%) |
mi | 1971 | |
klp | 847 | |
ghm | 549 | 8.6% |
ch | 369 | 5.8% |
gom | 303 | 4.7% |
ka | 296 | 4.6% |
ogf | 172 | 2.7% |
pi | 159 | 2.5% |
vgh | 148 | 2.3% |
mn | 132 | 2.1% |
Other values (35) | 1463 |
Most occurring characters
Value | Count | Frequency (%) |
M | 3664 | |
I | 2213 | |
H | 1823 | |
G | 1628 | |
K | 1188 | 7.4% |
P | 1022 | 6.4% |
L | 892 | 5.6% |
O | 864 | 5.4% |
F | 667 | 4.2% |
S | 628 | 3.9% |
Other values (14) | 1397 | 8.7% |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 15952 | |
Lowercase Letter | 34 | 0.2% |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
M | 3664 | |
I | 2213 | |
H | 1823 | |
G | 1628 | |
K | 1188 | 7.4% |
P | 1022 | 6.4% |
L | 892 | 5.6% |
O | 864 | 5.4% |
F | 667 | 4.2% |
S | 628 | 3.9% |
Other values (12) | 1363 | 8.5% |
Lowercase Letter
Value | Count | Frequency (%) |
p | 19 | |
d | 15 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 15986 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
M | 3664 | |
I | 2213 | |
H | 1823 | |
G | 1628 | |
K | 1188 | 7.4% |
P | 1022 | 6.4% |
L | 892 | 5.6% |
O | 864 | 5.4% |
F | 667 | 4.2% |
S | 628 | 3.9% |
Other values (14) | 1397 | 8.7% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 15986 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
M | 3664 | |
I | 2213 | |
H | 1823 | |
G | 1628 | |
K | 1188 | 7.4% |
P | 1022 | 6.4% |
L | 892 | 5.6% |
O | 864 | 5.4% |
F | 667 | 4.2% |
S | 628 | 3.9% |
Other values (14) | 1397 | 8.7% |
Tr4
Text
MISSING
 
Distinct | 40 |
---|---|
Distinct (%) | 0.3% |
Missing | 339159 |
Missing (%) | 96.6% |
Memory size | 2.7 MiB |
Value | Count | Frequency (%) |
mi | 6796 | |
mo | 2090 | 17.7% |
mn | 600 | 5.1% |
ghm | 345 | 2.9% |
mnf | 281 | 2.4% |
klp | 268 | 2.3% |
ogf | 259 | 2.2% |
ch | 219 | 1.8% |
hof | 140 | 1.2% |
pi | 112 | 0.9% |
Other values (30) | 728 | 6.1% |
Most occurring characters
Value | Count | Frequency (%) |
M | 10447 | |
I | 6910 | |
O | 2678 | 10.5% |
H | 974 | 3.8% |
G | 925 | 3.6% |
N | 891 | 3.5% |
F | 772 | 3.0% |
P | 415 | 1.6% |
K | 392 | 1.5% |
L | 302 | 1.2% |
Other values (11) | 760 | 3.0% |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 25459 | |
Lowercase Letter | 7 | < 0.1% |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
M | 10447 | |
I | 6910 | |
O | 2678 | 10.5% |
H | 974 | 3.8% |
G | 925 | 3.6% |
N | 891 | 3.5% |
F | 772 | 3.0% |
P | 415 | 1.6% |
K | 392 | 1.5% |
L | 302 | 1.2% |
Other values (9) | 753 | 3.0% |
Lowercase Letter
Value | Count | Frequency (%) |
p | 6 | |
d | 1 | 14.3% |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 25466 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
M | 10447 | |
I | 6910 | |
O | 2678 | 10.5% |
H | 974 | 3.8% |
G | 925 | 3.6% |
N | 891 | 3.5% |
F | 772 | 3.0% |
P | 415 | 1.6% |
K | 392 | 1.5% |
L | 302 | 1.2% |
Other values (11) | 760 | 3.0% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 25466 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
M | 10447 | |
I | 6910 | |
O | 2678 | 10.5% |
H | 974 | 3.8% |
G | 925 | 3.6% |
N | 891 | 3.5% |
F | 772 | 3.0% |
P | 415 | 1.6% |
K | 392 | 1.5% |
L | 302 | 1.2% |
Other values (11) | 760 | 3.0% |
Tr5
Text
MISSING
 
Distinct | 5 |
---|---|
Distinct (%) | 3.1% |
Missing | 350836 |
Missing (%) | > 99.9% |
Memory size | 2.7 MiB |
Value | Count | Frequency (%) |
mi | 153 | |
pi | 3 | 1.9% |
ghm | 2 | 1.2% |
mo | 2 | 1.2% |
klp | 1 | 0.6% |
Most occurring characters
Value | Count | Frequency (%) |
M | 157 | |
I | 156 | |
P | 4 | 1.2% |
G | 2 | 0.6% |
H | 2 | 0.6% |
O | 2 | 0.6% |
K | 1 | 0.3% |
L | 1 | 0.3% |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 325 |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
M | 157 | |
I | 156 | |
P | 4 | 1.2% |
G | 2 | 0.6% |
H | 2 | 0.6% |
O | 2 | 0.6% |
K | 1 | 0.3% |
L | 1 | 0.3% |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 325 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
M | 157 | |
I | 156 | |
P | 4 | 1.2% |
G | 2 | 0.6% |
H | 2 | 0.6% |
O | 2 | 0.6% |
K | 1 | 0.3% |
L | 1 | 0.3% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 325 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
M | 157 | |
I | 156 | |
P | 4 | 1.2% |
G | 2 | 0.6% |
H | 2 | 0.6% |
O | 2 | 0.6% |
K | 1 | 0.3% |
L | 1 | 0.3% |
Chip_pct
Real number (ℝ)
MISSING
 
Distinct | 45 |
---|---|
Distinct (%) | < 0.1% |
Missing | 161178 |
Missing (%) | 45.9% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 34.27222775 |
Minimum | 0 |
---|---|
Maximum | 100 |
Zeros | 22 |
Zeros (%) | < 0.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.7 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 10 |
Q1 | 20 |
median | 30 |
Q3 | 40 |
95-th percentile | 65 |
Maximum | 100 |
Range | 100 |
Interquartile range (IQR) | 20 |
Descriptive statistics
Standard deviation | 16.37143038 |
---|---|
Coefficient of variation (CV) | 0.4776879548 |
Kurtosis | 0.5188388138 |
Mean | 34.27222775 |
Median Absolute Deviation (MAD) | 10 |
Skewness | 0.7228901153 |
Sum | 6505520 |
Variance | 268.0237327 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
30 | 42579 | 12.1% |
40 | 35170 | 10.0% |
20 | 31516 | 9.0% |
50 | 21133 | 6.0% |
10 | 12707 | 3.6% |
25 | 9669 | 2.8% |
60 | 9648 | 2.7% |
15 | 6362 | 1.8% |
35 | 4687 | 1.3% |
70 | 3669 | 1.0% |
Other values (35) | 12679 | 3.6% |
(Missing) | 161178 |
Value | Count | Frequency (%) |
0 | 22 | |
1 | 5 | < 0.1% |
2 | 43 | |
3 | 39 | |
4 | 6 | < 0.1% |
Value | Count | Frequency (%) |
100 | 100 | < 0.1% |
95 | 47 | < 0.1% |
90 | 911 | |
86 | 1 | < 0.1% |
85 | 141 | < 0.1% |
Shape1
Text
MISSING
 
Distinct | 6 |
---|---|
Distinct (%) | < 0.1% |
Missing | 71713 |
Missing (%) | 20.4% |
Memory size | 2.7 MiB |
Value | Count | Frequency (%) |
aa | 120593 | |
sa | 110536 | |
sr | 32558 | 11.7% |
rr | 8529 | 3.1% |
va | 6293 | 2.3% |
wr | 775 | 0.3% |
Most occurring characters
Value | Count | Frequency (%) |
A | 358015 | |
S | 143094 | 25.6% |
R | 50391 | 9.0% |
V | 6293 | 1.1% |
W | 775 | 0.1% |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 558568 |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
A | 358015 | |
S | 143094 | 25.6% |
R | 50391 | 9.0% |
V | 6293 | 1.1% |
W | 775 | 0.1% |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 558568 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
A | 358015 | |
S | 143094 | 25.6% |
R | 50391 | 9.0% |
V | 6293 | 1.1% |
W | 775 | 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 558568 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
A | 358015 | |
S | 143094 | 25.6% |
R | 50391 | 9.0% |
V | 6293 | 1.1% |
W | 775 | 0.1% |
Shape2
Text
MISSING
 
Distinct | 4 |
---|---|
Distinct (%) | < 0.1% |
Missing | 342475 |
Missing (%) | 97.6% |
Memory size | 2.7 MiB |
Value | Count | Frequency (%) |
sr | 3003 | |
sa | 2810 | |
rr | 1492 | |
aa | 1217 |
Most occurring characters
Value | Count | Frequency (%) |
R | 5987 | |
S | 5813 | |
A | 5244 |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 17044 |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
R | 5987 | |
S | 5813 | |
A | 5244 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 17044 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
R | 5987 | |
S | 5813 | |
A | 5244 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 17044 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
R | 5987 | |
S | 5813 | |
A | 5244 |
Max_Dia
Real number (ℝ)
MISSING
 
Distinct | 56 |
---|---|
Distinct (%) | < 0.1% |
Missing | 62831 |
Missing (%) | 17.9% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 14.60768793 |
Minimum | 0 |
---|---|
Maximum | 525 |
Zeros | 24 |
Zeros (%) | < 0.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.7 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 5 |
Q1 | 10 |
median | 12 |
Q3 | 20 |
95-th percentile | 30 |
Maximum | 525 |
Range | 525 |
Interquartile range (IQR) | 10 |
Descriptive statistics
Standard deviation | 7.195335875 |
---|---|
Coefficient of variation (CV) | 0.4925718505 |
Kurtosis | 209.682909 |
Mean | 14.60768793 |
Median Absolute Deviation (MAD) | 3 |
Skewness | 4.28658828 |
Sum | 4209439 |
Variance | 51.77285836 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
10 | 117872 | |
20 | 55764 | |
15 | 54933 | |
5 | 20654 | 5.9% |
25 | 15768 | 4.5% |
30 | 10603 | 3.0% |
35 | 3038 | 0.9% |
7 | 2799 | 0.8% |
40 | 2069 | 0.6% |
12 | 881 | 0.3% |
Other values (46) | 3785 | 1.1% |
(Missing) | 62831 |
Value | Count | Frequency (%) |
0 | 24 | < 0.1% |
1 | 51 | < 0.1% |
2 | 510 | |
3 | 717 | |
4 | 237 | 0.1% |
Value | Count | Frequency (%) |
525 | 2 | |
405 | 1 | < 0.1% |
151 | 1 | < 0.1% |
145 | 1 | < 0.1% |
140 | 3 |
Hardness
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 350997 |
---|---|
Missing (%) | 100.0% |
Memory size | 2.7 MiB |
Colour
Real number (ℝ)
MISSING
 
Distinct | 90 |
---|---|
Distinct (%) | < 0.1% |
Missing | 102224 |
Missing (%) | 29.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 27.14761248 |
Minimum | 1 |
---|---|
Maximum | 96 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.7 MiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 9 |
Q1 | 21 |
median | 25 |
Q3 | 34 |
95-th percentile | 50 |
Maximum | 96 |
Range | 95 |
Interquartile range (IQR) | 13 |
Descriptive statistics
Standard deviation | 12.72642994 |
---|---|
Coefficient of variation (CV) | 0.4687863415 |
Kurtosis | 4.562979416 |
Mean | 27.14761248 |
Median Absolute Deviation (MAD) | 7 |
Skewness | 1.558933637 |
Sum | 6753593 |
Variance | 161.9620189 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
24 | 32057 | 9.1% |
25 | 30375 | 8.7% |
34 | 23317 | 6.6% |
33 | 12439 | 3.5% |
22 | 12281 | 3.5% |
27 | 11128 | 3.2% |
35 | 10710 | 3.1% |
17 | 10555 | 3.0% |
14 | 8783 | 2.5% |
18 | 8587 | 2.4% |
Other values (80) | 88541 | |
(Missing) | 102224 |
Value | Count | Frequency (%) |
1 | 206 | 0.1% |
2 | 779 | 0.2% |
3 | 1209 | |
4 | 1507 | |
5 | 2449 |
Value | Count | Frequency (%) |
96 | 26 | < 0.1% |
95 | 12 | < 0.1% |
94 | 2 | < 0.1% |
93 | 359 | |
92 | 143 | < 0.1% |
LithComment
Text
MISSING
 
Distinct | 13963 |
---|---|
Distinct (%) | 28.6% |
Missing | 302145 |
Missing (%) | 86.1% |
Memory size | 2.7 MiB |
Length
Max length | 146 |
---|---|
Median length | 103 |
Mean length | 14.23139278 |
Min length | 1 |
Characters and Unicode
Total characters | 695232 |
---|---|
Distinct characters | 87 |
Distinct categories | 11 ? |
Distinct scripts | 2 ? |
Distinct blocks | 2 ? |
Unique
Unique | 7455 ? |
---|---|
Unique (%) | 15.3% |
Sample
1st row | Interbedded shales |
---|---|
2nd row | Interbedded shales |
3rd row | Shales interbedded |
4th row | Black shales with chert |
5th row | Duplicate missed out |
Value | Count | Frequency (%) |
7794 | 6.3% | |
duplicate | 5354 | 4.3% |
damp | 3286 | 2.6% |
mn | 2957 | 2.4% |
eoh | 2753 | 2.2% |
lab | 2659 | 2.1% |
clay | 2550 | 2.1% |
wet | 2290 | 1.8% |
to | 1612 | 1.3% |
stained | 1515 | 1.2% |
Other values (9621) | 91424 |
Most occurring characters
Value | Count | Frequency (%) |
81028 | 11.7% | |
e | 28985 | 4.2% |
i | 24694 | 3.6% |
E | 24249 | 3.5% |
S | 22931 | 3.3% |
a | 22755 | 3.3% |
T | 22345 | 3.2% |
A | 21515 | 3.1% |
I | 21411 | 3.1% |
t | 21020 | 3.0% |
Other values (77) | 404299 |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 297852 | |
Lowercase Letter | 238018 | |
Space Separator | 81028 | 11.7% |
Decimal Number | 59085 | 8.5% |
Other Punctuation | 10070 | 1.4% |
Dash Punctuation | 6356 | 0.9% |
Close Punctuation | 995 | 0.1% |
Open Punctuation | 993 | 0.1% |
Math Symbol | 814 | 0.1% |
Connector Punctuation | 19 | < 0.1% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
e | 28985 | |
i | 24694 | |
a | 22755 | 9.6% |
t | 21020 | 8.8% |
l | 17424 | 7.3% |
s | 14965 | 6.3% |
n | 13816 | 5.8% |
o | 11902 | 5.0% |
c | 11847 | 5.0% |
r | 9937 | 4.2% |
Other values (16) | 60673 |
Uppercase Letter
Value | Count | Frequency (%) |
E | 24249 | 8.1% |
S | 22931 | 7.7% |
T | 22345 | 7.5% |
A | 21515 | 7.2% |
I | 21411 | 7.2% |
D | 20455 | 6.9% |
L | 20231 | 6.8% |
M | 19681 | 6.6% |
O | 18421 | 6.2% |
N | 13326 | 4.5% |
Other values (16) | 93287 |
Other Punctuation
Value | Count | Frequency (%) |
. | 3176 | |
? | 1868 | |
/ | 1736 | |
@ | 1439 | |
% | 562 | 5.6% |
& | 389 | 3.9% |
* | 255 | 2.5% |
: | 253 | 2.5% |
' | 198 | 2.0% |
! | 102 | 1.0% |
Other values (2) | 92 | 0.9% |
Decimal Number
Value | Count | Frequency (%) |
3 | 7490 | |
2 | 7406 | |
1 | 6714 | |
0 | 6383 | |
4 | 6226 | |
9 | 6076 | |
6 | 4975 | |
5 | 4760 | |
8 | 4590 | |
7 | 4465 |
Math Symbol
Value | Count | Frequency (%) |
= | 483 | |
< | 250 | |
> | 38 | 4.7% |
+ | 37 | 4.5% |
~ | 6 | 0.7% |
Close Punctuation
Value | Count | Frequency (%) |
) | 993 | |
] | 2 | 0.2% |
Open Punctuation
Value | Count | Frequency (%) |
( | 991 | |
[ | 2 | 0.2% |
Space Separator
Value | Count | Frequency (%) |
81028 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 6356 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 19 |
Other Symbol
Value | Count | Frequency (%) |
� | 2 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 535870 | |
Common | 159362 | 22.9% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
e | 28985 | 5.4% |
i | 24694 | 4.6% |
E | 24249 | 4.5% |
S | 22931 | 4.3% |
a | 22755 | 4.2% |
T | 22345 | 4.2% |
A | 21515 | 4.0% |
I | 21411 | 4.0% |
t | 21020 | 3.9% |
D | 20455 | 3.8% |
Other values (42) | 305510 |
Common
Value | Count | Frequency (%) |
81028 | ||
3 | 7490 | 4.7% |
2 | 7406 | 4.6% |
1 | 6714 | 4.2% |
0 | 6383 | 4.0% |
- | 6356 | 4.0% |
4 | 6226 | 3.9% |
9 | 6076 | 3.8% |
6 | 4975 | 3.1% |
5 | 4760 | 3.0% |
Other values (25) | 21948 | 13.8% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 695230 | |
Specials | 2 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
81028 | 11.7% | |
e | 28985 | 4.2% |
i | 24694 | 3.6% |
E | 24249 | 3.5% |
S | 22931 | 3.3% |
a | 22755 | 3.3% |
T | 22345 | 3.2% |
A | 21515 | 3.1% |
I | 21411 | 3.1% |
t | 21020 | 3.0% |
Other values (76) | 404297 |
Specials
Value | Count | Frequency (%) |
� | 2 |
Ore_Texture
Text
MISSING
 
Distinct | 1030 |
---|---|
Distinct (%) | 31.8% |
Missing | 347753 |
Missing (%) | 99.1% |
Memory size | 2.7 MiB |
Length
Max length | 84 |
---|---|
Median length | 59 |
Mean length | 17.19235512 |
Min length | 1 |
Characters and Unicode
Total characters | 55772 |
---|---|
Distinct characters | 83 |
Distinct categories | 10 ? |
Distinct scripts | 2 ? |
Distinct blocks | 2 ? |
Unique
Unique | 195 ? |
---|---|
Unique (%) | 6.0% |
Sample
1st row | low recovery |
---|---|
2nd row | Steel blue hematite and bedding |
3rd row | clay in P of GHHp |
4th row | clay in P of GHHp |
5th row | clay in P of GHHp |
Value | Count | Frequency (%) |
588 | 6.3% | |
duplicate | 274 | 2.9% |
clay | 227 | 2.4% |
mn | 165 | 1.8% |
of | 159 | 1.7% |
in | 144 | 1.5% |
injected | 142 | 1.5% |
damp | 141 | 1.5% |
and | 115 | 1.2% |
vgh | 114 | 1.2% |
Other values (984) | 7283 |
Most occurring characters
Value | Count | Frequency (%) |
9583 | 17.2% | |
e | 3823 | 6.9% |
i | 3036 | 5.4% |
a | 2790 | 5.0% |
t | 2590 | 4.6% |
l | 2371 | 4.3% |
s | 2253 | 4.0% |
n | 1805 | 3.2% |
c | 1732 | 3.1% |
o | 1728 | 3.1% |
Other values (73) | 24061 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 32928 | |
Uppercase Letter | 9647 | 17.3% |
Space Separator | 9583 | 17.2% |
Decimal Number | 2095 | 3.8% |
Other Punctuation | 907 | 1.6% |
Dash Punctuation | 394 | 0.7% |
Close Punctuation | 78 | 0.1% |
Open Punctuation | 74 | 0.1% |
Math Symbol | 64 | 0.1% |
Other Symbol | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
e | 3823 | |
i | 3036 | 9.2% |
a | 2790 | 8.5% |
t | 2590 | 7.9% |
l | 2371 | 7.2% |
s | 2253 | 6.8% |
n | 1805 | 5.5% |
c | 1732 | 5.3% |
o | 1728 | 5.2% |
d | 1588 | 4.8% |
Other values (16) | 9212 |
Uppercase Letter
Value | Count | Frequency (%) |
H | 889 | 9.2% |
M | 787 | 8.2% |
O | 779 | 8.1% |
S | 757 | 7.8% |
T | 737 | 7.6% |
I | 684 | 7.1% |
E | 635 | 6.6% |
L | 520 | 5.4% |
G | 514 | 5.3% |
A | 505 | 5.2% |
Other values (16) | 2840 |
Other Punctuation
Value | Count | Frequency (%) |
. | 460 | |
? | 152 | 16.8% |
/ | 122 | 13.5% |
% | 69 | 7.6% |
: | 54 | 6.0% |
& | 21 | 2.3% |
* | 11 | 1.2% |
' | 8 | 0.9% |
; | 3 | 0.3% |
# | 3 | 0.3% |
Other values (2) | 4 | 0.4% |
Decimal Number
Value | Count | Frequency (%) |
0 | 330 | |
9 | 308 | |
1 | 274 | |
2 | 257 | |
7 | 179 | |
8 | 160 | |
4 | 148 | |
5 | 147 | |
3 | 147 | |
6 | 145 |
Math Symbol
Value | Count | Frequency (%) |
= | 42 | |
< | 11 | 17.2% |
+ | 9 | 14.1% |
> | 2 | 3.1% |
Space Separator
Value | Count | Frequency (%) |
9583 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 394 |
Close Punctuation
Value | Count | Frequency (%) |
) | 78 |
Open Punctuation
Value | Count | Frequency (%) |
( | 74 |
Other Symbol
Value | Count | Frequency (%) |
� | 2 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 42575 | |
Common | 13197 | 23.7% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
e | 3823 | 9.0% |
i | 3036 | 7.1% |
a | 2790 | 6.6% |
t | 2590 | 6.1% |
l | 2371 | 5.6% |
s | 2253 | 5.3% |
n | 1805 | 4.2% |
c | 1732 | 4.1% |
o | 1728 | 4.1% |
d | 1588 | 3.7% |
Other values (42) | 18859 |
Common
Value | Count | Frequency (%) |
9583 | ||
. | 460 | 3.5% |
- | 394 | 3.0% |
0 | 330 | 2.5% |
9 | 308 | 2.3% |
1 | 274 | 2.1% |
2 | 257 | 1.9% |
7 | 179 | 1.4% |
8 | 160 | 1.2% |
? | 152 | 1.2% |
Other values (21) | 1100 | 8.3% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 55770 | |
Specials | 2 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
9583 | 17.2% | |
e | 3823 | 6.9% |
i | 3036 | 5.4% |
a | 2790 | 5.0% |
t | 2590 | 4.6% |
l | 2371 | 4.3% |
s | 2253 | 4.0% |
n | 1805 | 3.2% |
c | 1732 | 3.1% |
o | 1728 | 3.1% |
Other values (72) | 24059 |
Specials
Value | Count | Frequency (%) |
� | 2 |
HOLEID | PROJECTCODE | GEOLFROM | GEOLTO | PRIORITY | Strat_Sum | Strat | Mj1 | Mj2 | Mj3 | Mj4 | Mj5 | Mn1 | Mn2 | Mn3 | Mn4 | Mn5 | Tr1 | Tr2 | Tr3 | Tr4 | Tr5 | Chip_pct | Shape1 | Shape2 | Max_Dia | Hardness | Colour | LithComment | Ore_Texture | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | CC0001 | CC | 0.0 | 2.0 | 1 | Ta | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
1 | CC0001 | CC | 0.0 | 1.0 | 1 | NaN | NaN | GH | GO | HO | NaN | NaN | MI | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 50.0 | SR | NaN | 20.0 | NaN | NaN | NaN | NaN |
2 | CC0001 | CC | 1.0 | 2.0 | 1 | NaN | NaN | GH | GO | NaN | NaN | NaN | MI | NaN | NaN | NaN | NaN | HS | NaN | NaN | NaN | NaN | 70.0 | SR | NaN | 20.0 | NaN | NaN | NaN | NaN |
3 | CC0001 | CC | 2.0 | 3.0 | 1 | NaN | NaN | GOM | HO | GH | NaN | NaN | MI | NaN | NaN | NaN | NaN | HS | NaN | NaN | NaN | NaN | 50.0 | SR | NaN | 15.0 | NaN | NaN | NaN | NaN |
4 | CC0001 | CC | 2.0 | 4.0 | 1 | Tdi | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
5 | CC0001 | CC | 3.0 | 4.0 | 1 | NaN | NaN | HO | GO | NaN | NaN | NaN | MI | GH | NaN | NaN | NaN | HS | KS | NaN | NaN | NaN | 50.0 | SR | NaN | 15.0 | NaN | NaN | NaN | NaN |
6 | CC0001 | CC | 4.0 | 6.0 | 1 | Tds | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
7 | CC0001 | CC | 4.0 | 5.0 | 1 | NaN | NaN | HO | GH | GO | NaN | NaN | MI | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 60.0 | SR | NaN | 20.0 | NaN | NaN | NaN | NaN |
8 | CC0001 | CC | 5.0 | 6.0 | 1 | NaN | NaN | GH | GO | NaN | NaN | NaN | MI | HO | OG | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 60.0 | SR | NaN | 20.0 | NaN | NaN | NaN | NaN |
9 | CC0001 | CC | 6.0 | 7.0 | 1 | NaN | NaN | HO | GH | NaN | NaN | NaN | MI | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 80.0 | SR | NaN | 25.0 | NaN | NaN | NaN | NaN |