Dataset statistics
| Number of variables | 30 |
|---|---|
| Number of observations | 350997 |
| Missing cells | 6741010 |
| Missing cells (%) | 64.0% |
| Total size in memory | 80.3 MiB |
| Average record size in memory | 240.0 B |
Variable types
| Text | 23 |
|---|---|
| Numeric | 6 |
| Unsupported | 1 |
Dataset
| Description | A dataset from the WAMEX database. |
|---|---|
| URL | https://www.dmp.wa.gov.au/WAMEX-Minerals-Exploration-1476.aspx |
PRIORITY has constant value "" | Constant |
Strat_Sum has 305746 (87.1%) missing values | Missing |
Strat has 346567 (98.7%) missing values | Missing |
Mj1 has 54470 (15.5%) missing values | Missing |
Mj2 has 172094 (49.0%) missing values | Missing |
Mj3 has 261399 (74.5%) missing values | Missing |
Mj4 has 302371 (86.1%) missing values | Missing |
Mj5 has 350871 (> 99.9%) missing values | Missing |
Mn1 has 223856 (63.8%) missing values | Missing |
Mn2 has 298133 (84.9%) missing values | Missing |
Mn3 has 325115 (92.6%) missing values | Missing |
Mn4 has 332154 (94.6%) missing values | Missing |
Mn5 has 350982 (> 99.9%) missing values | Missing |
Tr1 has 303543 (86.5%) missing values | Missing |
Tr2 has 337810 (96.2%) missing values | Missing |
Tr3 has 344588 (98.2%) missing values | Missing |
Tr4 has 339159 (96.6%) missing values | Missing |
Tr5 has 350836 (> 99.9%) missing values | Missing |
Chip_pct has 161178 (45.9%) missing values | Missing |
Shape1 has 71713 (20.4%) missing values | Missing |
Shape2 has 342475 (97.6%) missing values | Missing |
Max_Dia has 62831 (17.9%) missing values | Missing |
Hardness has 350997 (100.0%) missing values | Missing |
Colour has 102224 (29.1%) missing values | Missing |
LithComment has 302145 (86.1%) missing values | Missing |
Ore_Texture has 347753 (99.1%) missing values | Missing |
Hardness is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
GEOLFROM has 14973 (4.3%) zeros | Zeros |
Reproduction
| Analysis started | 2023-07-19 23:01:26.125119 |
|---|---|
| Analysis finished | 2023-07-19 23:01:36.396523 |
| Duration | 10.27 seconds |
| Software version | ydata-profiling vv4.3.1 |
| Download configuration | config.json |
HOLEID
Text
| Distinct | 6403 |
|---|---|
| Distinct (%) | 1.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.7 MiB |
Length
| Max length | 9 |
|---|---|
| Median length | 6 |
| Mean length | 6.011851953 |
| Min length | 6 |
Characters and Unicode
| Total characters | 2110142 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | CC0001 |
|---|---|
| 2nd row | CC0001 |
| 3rd row | CC0001 |
| 4th row | CC0001 |
| 5th row | CC0001 |
| Value | Count | Frequency (%) |
| cc1164 | 314 | 0.1% |
| cc1595 | 298 | 0.1% |
| cc1596 | 284 | 0.1% |
| cc1165 | 268 | 0.1% |
| cc1166 | 268 | 0.1% |
| cc1459 | 264 | 0.1% |
| cc1443 | 256 | 0.1% |
| cc1442 | 250 | 0.1% |
| cc0001 | 236 | 0.1% |
| cc1441 | 236 | 0.1% |
| Other values (6393) | 348323 |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 584375 | |
| 1 | 238862 | |
| 0 | 197982 | 9.4% |
| 2 | 191223 | 9.1% |
| 3 | 158961 | 7.5% |
| 4 | 114346 | 5.4% |
| 5 | 104310 | 4.9% |
| 6 | 103931 | 4.9% |
| 9 | 101964 | 4.8% |
| 7 | 96961 | 4.6% |
| Other values (7) | 217227 | 10.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1403988 | |
| Uppercase Letter | 706154 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 238862 | |
| 0 | 197982 | |
| 2 | 191223 | |
| 3 | 158961 | |
| 4 | 114346 | |
| 5 | 104310 | |
| 6 | 103931 | |
| 9 | 101964 | |
| 7 | 96961 | |
| 8 | 95448 | 6.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 584375 | |
| B | 89825 | 12.7% |
| W | 14447 | 2.0% |
| K | 10434 | 1.5% |
| F | 4013 | 0.6% |
| D | 1960 | 0.3% |
| P | 1100 | 0.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1403988 | |
| Latin | 706154 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 238862 | |
| 0 | 197982 | |
| 2 | 191223 | |
| 3 | 158961 | |
| 4 | 114346 | |
| 5 | 104310 | |
| 6 | 103931 | |
| 9 | 101964 | |
| 7 | 96961 | |
| 8 | 95448 | 6.8% |
Latin
| Value | Count | Frequency (%) |
| C | 584375 | |
| B | 89825 | 12.7% |
| W | 14447 | 2.0% |
| K | 10434 | 1.5% |
| F | 4013 | 0.6% |
| D | 1960 | 0.3% |
| P | 1100 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2110142 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| C | 584375 | |
| 1 | 238862 | |
| 0 | 197982 | 9.4% |
| 2 | 191223 | 9.1% |
| 3 | 158961 | 7.5% |
| 4 | 114346 | 5.4% |
| 5 | 104310 | 4.9% |
| 6 | 103931 | 4.9% |
| 9 | 101964 | 4.8% |
| 7 | 96961 | 4.6% |
| Other values (7) | 217227 | 10.3% |
PROJECTCODE
Text
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.7 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 701994 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | CC |
|---|---|
| 2nd row | CC |
| 3rd row | CC |
| 4th row | CC |
| 5th row | CC |
| Value | Count | Frequency (%) |
| cc | 246725 | |
| cb | 89825 | 25.6% |
| wk | 10434 | 3.0% |
| wf | 4013 | 1.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 583275 | |
| B | 89825 | 12.8% |
| W | 14447 | 2.1% |
| K | 10434 | 1.5% |
| F | 4013 | 0.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 701994 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 583275 | |
| B | 89825 | 12.8% |
| W | 14447 | 2.1% |
| K | 10434 | 1.5% |
| F | 4013 | 0.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 701994 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| C | 583275 | |
| B | 89825 | 12.8% |
| W | 14447 | 2.1% |
| K | 10434 | 1.5% |
| F | 4013 | 0.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 701994 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| C | 583275 | |
| B | 89825 | 12.8% |
| W | 14447 | 2.1% |
| K | 10434 | 1.5% |
| F | 4013 | 0.6% |
GEOLFROM
Real number (ℝ)
ZEROS 
| Distinct | 614 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 24.56020222 |
| Minimum | 0 |
|---|---|
| Maximum | 143 |
| Zeros | 14973 |
| Zeros (%) | 4.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 9 |
| median | 20 |
| Q3 | 35 |
| 95-th percentile | 64 |
| Maximum | 143 |
| Range | 143 |
| Interquartile range (IQR) | 26 |
Descriptive statistics
| Standard deviation | 20.26223983 |
|---|---|
| Coefficient of variation (CV) | 0.8250029722 |
| Kurtosis | 1.639174278 |
| Mean | 24.56020222 |
| Median Absolute Deviation (MAD) | 12 |
| Skewness | 1.216507032 |
| Sum | 8620557.3 |
| Variance | 410.558363 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 14973 | 4.3% |
| 6 | 8581 | 2.4% |
| 4 | 8562 | 2.4% |
| 12 | 8543 | 2.4% |
| 5 | 8506 | 2.4% |
| 9 | 8502 | 2.4% |
| 10 | 8487 | 2.4% |
| 7 | 8465 | 2.4% |
| 3 | 8464 | 2.4% |
| 8 | 8456 | 2.4% |
| Other values (604) | 259458 |
| Value | Count | Frequency (%) |
| 0 | 14973 | |
| 0.2 | 2 | < 0.1% |
| 0.3 | 2 | < 0.1% |
| 0.4 | 1 | < 0.1% |
| 0.5 | 14 | < 0.1% |
| Value | Count | Frequency (%) |
| 143 | 2 | |
| 142 | 2 | |
| 141 | 2 | |
| 140 | 2 | |
| 139 | 4 |
GEOLTO
Real number (ℝ)
| Distinct | 631 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 26.30145272 |
| Minimum | 0.2 |
|---|---|
| Maximum | 144 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.7 MiB |
Quantile statistics
| Minimum | 0.2 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 11 |
| median | 22 |
| Q3 | 37 |
| 95-th percentile | 66 |
| Maximum | 144 |
| Range | 143.8 |
| Interquartile range (IQR) | 26 |
Descriptive statistics
| Standard deviation | 20.28137304 |
|---|---|
| Coefficient of variation (CV) | 0.771112275 |
| Kurtosis | 1.634311732 |
| Mean | 26.30145272 |
| Median Absolute Deviation (MAD) | 12 |
| Skewness | 1.214693604 |
| Sum | 9231731 |
| Variance | 411.3340925 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10 | 8744 | 2.5% |
| 16 | 8712 | 2.5% |
| 12 | 8671 | 2.5% |
| 4 | 8602 | 2.5% |
| 6 | 8597 | 2.4% |
| 5 | 8524 | 2.4% |
| 9 | 8521 | 2.4% |
| 7 | 8503 | 2.4% |
| 3 | 8487 | 2.4% |
| 8 | 8451 | 2.4% |
| Other values (621) | 265185 |
| Value | Count | Frequency (%) |
| 0.2 | 2 | < 0.1% |
| 0.3 | 2 | < 0.1% |
| 0.4 | 1 | < 0.1% |
| 0.5 | 13 | |
| 0.6 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 144 | 4 | |
| 143 | 2 | < 0.1% |
| 142 | 2 | < 0.1% |
| 141 | 2 | < 0.1% |
| 140 | 6 |
PRIORITY
Real number (ℝ)
CONSTANT 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1 |
| Minimum | 1 |
|---|---|
| Maximum | 1 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.7 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 1 |
| Maximum | 1 |
| Range | 0 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0 |
|---|---|
| Coefficient of variation (CV) | 0 |
| Kurtosis | 0 |
| Mean | 1 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0 |
| Sum | 350997 |
| Variance | 0 |
| Monotonicity | Increasing |
| Value | Count | Frequency (%) |
| 1 | 350997 |
| Value | Count | Frequency (%) |
| 1 | 350997 |
| Value | Count | Frequency (%) |
| 1 | 350997 |
Strat_Sum
Text
MISSING 
| Distinct | 24 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 305746 |
| Missing (%) | 87.1% |
| Memory size | 2.7 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.740580319 |
| Min length | 2 |
Characters and Unicode
| Total characters | 124014 |
|---|---|
| Distinct characters | 29 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Ta |
|---|---|
| 2nd row | Tdi |
| 3rd row | Tds |
| 4th row | Tdm |
| 5th row | MUb |
| Value | Count | Frequency (%) |
| mum | 8268 | |
| mub | 7237 | |
| hc | 5263 | |
| ta | 4720 | |
| tdi | 3116 | 6.9% |
| mus | 2935 | 6.5% |
| muh | 2914 | 6.4% |
| mut | 2504 | 5.5% |
| tds | 2154 | 4.8% |
| muf | 2041 | 4.5% |
| Other values (14) | 4099 |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 27027 | |
| U | 27027 | |
| T | 11219 | |
| m | 8753 | 7.1% |
| b | 7237 | 5.8% |
| d | 5755 | 4.6% |
| H | 5567 | 4.5% |
| s | 5389 | 4.3% |
| c | 5270 | 4.2% |
| a | 4720 | 3.8% |
| Other values (19) | 16050 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 73221 | |
| Lowercase Letter | 50793 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| m | 8753 | |
| b | 7237 | |
| d | 5755 | |
| s | 5389 | |
| c | 5270 | |
| a | 4720 | |
| i | 3116 | 6.1% |
| h | 2914 | 5.7% |
| t | 2504 | 4.9% |
| f | 2334 | 4.6% |
| Other values (5) | 2801 | 5.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 27027 | |
| U | 27027 | |
| T | 11219 | |
| H | 5567 | 7.6% |
| C | 778 | 1.1% |
| J | 543 | 0.7% |
| D | 434 | 0.6% |
| I | 430 | 0.6% |
| F | 93 | 0.1% |
| L | 55 | 0.1% |
| Other values (4) | 48 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 124014 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| M | 27027 | |
| U | 27027 | |
| T | 11219 | |
| m | 8753 | 7.1% |
| b | 7237 | 5.8% |
| d | 5755 | 4.6% |
| H | 5567 | 4.5% |
| s | 5389 | 4.3% |
| c | 5270 | 4.2% |
| a | 4720 | 3.8% |
| Other values (19) | 16050 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 124014 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| M | 27027 | |
| U | 27027 | |
| T | 11219 | |
| m | 8753 | 7.1% |
| b | 7237 | 5.8% |
| d | 5755 | 4.6% |
| H | 5567 | 4.5% |
| s | 5389 | 4.3% |
| c | 5270 | 4.2% |
| a | 4720 | 3.8% |
| Other values (19) | 16050 |
Strat
Text
MISSING 
| Distinct | 16 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 346567 |
| Missing (%) | 98.7% |
| Memory size | 2.7 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.720993228 |
| Min length | 2 |
Characters and Unicode
| Total characters | 12054 |
|---|---|
| Distinct characters | 21 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Ta |
|---|---|
| 2nd row | Ta |
| 3rd row | Ta |
| 4th row | Ta |
| 5th row | Ta |
| Value | Count | Frequency (%) |
| mub | 836 | |
| ta | 702 | |
| mum | 648 | |
| tdi | 504 | |
| hc | 372 | |
| muh | 268 | 6.0% |
| tds | 240 | 5.4% |
| mut | 216 | 4.9% |
| muf | 192 | 4.3% |
| tdm | 142 | 3.2% |
| Other values (6) | 310 | 7.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 2306 | |
| U | 2306 | |
| T | 1588 | |
| d | 886 | 7.4% |
| b | 836 | 6.9% |
| m | 790 | 6.6% |
| a | 702 | 5.8% |
| i | 504 | 4.2% |
| H | 374 | 3.1% |
| c | 372 | 3.1% |
| Other values (11) | 1390 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 6736 | |
| Lowercase Letter | 5318 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| d | 886 | |
| b | 836 | |
| m | 790 | |
| a | 702 | |
| i | 504 | |
| c | 372 | |
| s | 370 | |
| h | 268 | 5.0% |
| f | 224 | 4.2% |
| t | 216 | 4.1% |
| Other values (4) | 150 | 2.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 2306 | |
| U | 2306 | |
| T | 1588 | |
| H | 374 | 5.6% |
| J | 108 | 1.6% |
| C | 32 | 0.5% |
| F | 22 | 0.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 12054 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| M | 2306 | |
| U | 2306 | |
| T | 1588 | |
| d | 886 | 7.4% |
| b | 836 | 6.9% |
| m | 790 | 6.6% |
| a | 702 | 5.8% |
| i | 504 | 4.2% |
| H | 374 | 3.1% |
| c | 372 | 3.1% |
| Other values (11) | 1390 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12054 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| M | 2306 | |
| U | 2306 | |
| T | 1588 | |
| d | 886 | 7.4% |
| b | 836 | 6.9% |
| m | 790 | 6.6% |
| a | 702 | 5.8% |
| i | 504 | 4.2% |
| H | 374 | 3.1% |
| c | 372 | 3.1% |
| Other values (11) | 1390 |
Mj1
Text
MISSING 
| Distinct | 88 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 54470 |
| Missing (%) | 15.5% |
| Memory size | 2.7 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 3 |
| Mean length | 2.737501138 |
| Min length | 2 |
Characters and Unicode
| Total characters | 811743 |
|---|---|
| Distinct characters | 26 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | GH |
|---|---|
| 2nd row | GH |
| 3rd row | GOM |
| 4th row | HO |
| 5th row | HO |
| Value | Count | Frequency (%) |
| ch | 70733 | |
| ghm | 36371 | |
| gom | 36269 | |
| klf | 25300 | 8.5% |
| vgh | 21633 | 7.3% |
| goh | 10550 | 3.6% |
| klp | 8517 | 2.9% |
| hsm | 8269 | 2.8% |
| hom | 8224 | 2.8% |
| shm | 7850 | 2.6% |
| Other values (78) | 62811 |
Most occurring characters
| Value | Count | Frequency (%) |
| H | 201610 | |
| G | 137454 | |
| M | 114341 | |
| C | 72925 | 9.0% |
| O | 62691 | 7.7% |
| F | 50314 | 6.2% |
| K | 40407 | 5.0% |
| L | 40224 | 5.0% |
| S | 40124 | 4.9% |
| V | 21633 | 2.7% |
| Other values (16) | 30020 | 3.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 804961 | |
| Lowercase Letter | 6782 | 0.8% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| H | 201610 | |
| G | 137454 | |
| M | 114341 | |
| C | 72925 | 9.1% |
| O | 62691 | 7.8% |
| F | 50314 | 6.3% |
| K | 40407 | 5.0% |
| L | 40224 | 5.0% |
| S | 40124 | 5.0% |
| V | 21633 | 2.7% |
| Other values (13) | 23238 | 2.9% |
Lowercase Letter
| Value | Count | Frequency (%) |
| d | 4031 | |
| p | 2747 | |
| c | 4 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 811743 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| H | 201610 | |
| G | 137454 | |
| M | 114341 | |
| C | 72925 | 9.0% |
| O | 62691 | 7.7% |
| F | 50314 | 6.2% |
| K | 40407 | 5.0% |
| L | 40224 | 5.0% |
| S | 40124 | 4.9% |
| V | 21633 | 2.7% |
| Other values (16) | 30020 | 3.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 811743 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| H | 201610 | |
| G | 137454 | |
| M | 114341 | |
| C | 72925 | 9.0% |
| O | 62691 | 7.7% |
| F | 50314 | 6.2% |
| K | 40407 | 5.0% |
| L | 40224 | 5.0% |
| S | 40124 | 4.9% |
| V | 21633 | 2.7% |
| Other values (16) | 30020 | 3.7% |
Mj2
Text
MISSING 
| Distinct | 93 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 172094 |
| Missing (%) | 49.0% |
| Memory size | 2.7 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 3 |
| Mean length | 2.856922466 |
| Min length | 2 |
Characters and Unicode
| Total characters | 511112 |
|---|---|
| Distinct characters | 25 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | GO |
|---|---|
| 2nd row | GO |
| 3rd row | HO |
| 4th row | GO |
| 5th row | GH |
| Value | Count | Frequency (%) |
| gom | 22089 | |
| ghm | 22052 | |
| ch | 20268 | |
| shm | 16400 | 9.2% |
| klf | 10474 | 5.9% |
| vgh | 10177 | 5.7% |
| hsm | 8827 | 4.9% |
| ogf | 7288 | 4.1% |
| goh | 6778 | 3.8% |
| hom | 5861 | 3.3% |
| Other values (83) | 48689 |
Most occurring characters
| Value | Count | Frequency (%) |
| H | 121231 | |
| G | 91153 | |
| M | 88331 | |
| S | 47255 | 9.2% |
| O | 44786 | 8.8% |
| F | 37703 | 7.4% |
| C | 22148 | 4.3% |
| K | 14956 | 2.9% |
| L | 14851 | 2.9% |
| V | 10177 | 2.0% |
| Other values (15) | 18521 | 3.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 506839 | |
| Lowercase Letter | 4273 | 0.8% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| H | 121231 | |
| G | 91153 | |
| M | 88331 | |
| S | 47255 | 9.3% |
| O | 44786 | 8.8% |
| F | 37703 | 7.4% |
| C | 22148 | 4.4% |
| K | 14956 | 3.0% |
| L | 14851 | 2.9% |
| V | 10177 | 2.0% |
| Other values (13) | 14248 | 2.8% |
Lowercase Letter
| Value | Count | Frequency (%) |
| d | 2421 | |
| p | 1852 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 511112 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| H | 121231 | |
| G | 91153 | |
| M | 88331 | |
| S | 47255 | 9.2% |
| O | 44786 | 8.8% |
| F | 37703 | 7.4% |
| C | 22148 | 4.3% |
| K | 14956 | 2.9% |
| L | 14851 | 2.9% |
| V | 10177 | 2.0% |
| Other values (15) | 18521 | 3.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 511112 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| H | 121231 | |
| G | 91153 | |
| M | 88331 | |
| S | 47255 | 9.2% |
| O | 44786 | 8.8% |
| F | 37703 | 7.4% |
| C | 22148 | 4.3% |
| K | 14956 | 2.9% |
| L | 14851 | 2.9% |
| V | 10177 | 2.0% |
| Other values (15) | 18521 | 3.6% |
Mj3
Text
MISSING 
| Distinct | 81 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 261399 |
| Missing (%) | 74.5% |
| Memory size | 2.7 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 3 |
| Mean length | 2.84962834 |
| Min length | 2 |
Characters and Unicode
| Total characters | 255321 |
|---|---|
| Distinct characters | 25 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | HO |
|---|---|
| 2nd row | GH |
| 3rd row | GO |
| 4th row | GH |
| 5th row | GO |
| Value | Count | Frequency (%) |
| ghm | 10636 | |
| gom | 9475 | 10.6% |
| klf | 8177 | 9.1% |
| ch | 7807 | 8.7% |
| shm | 7383 | 8.2% |
| hsm | 4924 | 5.5% |
| vgh | 4622 | 5.2% |
| ogf | 4147 | 4.6% |
| hsf | 3350 | 3.7% |
| goh | 3037 | 3.4% |
| Other values (71) | 26040 |
Most occurring characters
| Value | Count | Frequency (%) |
| H | 54241 | |
| G | 42328 | |
| M | 42163 | |
| S | 24021 | |
| F | 22568 | |
| O | 20207 | 7.9% |
| K | 12130 | 4.8% |
| L | 11997 | 4.7% |
| C | 8598 | 3.4% |
| V | 4623 | 1.8% |
| Other values (15) | 12445 | 4.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 253628 | |
| Lowercase Letter | 1693 | 0.7% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| H | 54241 | |
| G | 42328 | |
| M | 42163 | |
| S | 24021 | |
| F | 22568 | |
| O | 20207 | 8.0% |
| K | 12130 | 4.8% |
| L | 11997 | 4.7% |
| C | 8598 | 3.4% |
| V | 4623 | 1.8% |
| Other values (13) | 10752 | 4.2% |
Lowercase Letter
| Value | Count | Frequency (%) |
| d | 1020 | |
| p | 673 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 255321 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| H | 54241 | |
| G | 42328 | |
| M | 42163 | |
| S | 24021 | |
| F | 22568 | |
| O | 20207 | 7.9% |
| K | 12130 | 4.8% |
| L | 11997 | 4.7% |
| C | 8598 | 3.4% |
| V | 4623 | 1.8% |
| Other values (15) | 12445 | 4.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 255321 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| H | 54241 | |
| G | 42328 | |
| M | 42163 | |
| S | 24021 | |
| F | 22568 | |
| O | 20207 | 7.9% |
| K | 12130 | 4.8% |
| L | 11997 | 4.7% |
| C | 8598 | 3.4% |
| V | 4623 | 1.8% |
| Other values (15) | 12445 | 4.9% |
Mj4
Text
MISSING 
| Distinct | 74 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 302371 |
| Missing (%) | 86.1% |
| Memory size | 2.7 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 3 |
| Mean length | 2.849751162 |
| Min length | 2 |
Characters and Unicode
| Total characters | 138572 |
|---|---|
| Distinct characters | 25 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | GSM |
|---|---|
| 2nd row | HSM |
| 3rd row | GOM |
| 4th row | GSF |
| 5th row | CH |
| Value | Count | Frequency (%) |
| ghm | 6031 | |
| gom | 5337 | 11.0% |
| klp | 3791 | 7.8% |
| ch | 3601 | 7.4% |
| shm | 3414 | 7.0% |
| vgh | 2817 | 5.8% |
| klf | 2772 | 5.7% |
| hsm | 2614 | 5.4% |
| pi | 1846 | 3.8% |
| goh | 1808 | 3.7% |
| Other values (64) | 14595 |
Most occurring characters
| Value | Count | Frequency (%) |
| H | 27896 | |
| G | 23497 | |
| M | 23247 | |
| S | 11972 | |
| O | 10645 | 7.7% |
| F | 9135 | 6.6% |
| K | 7411 | 5.3% |
| L | 7316 | 5.3% |
| P | 5874 | 4.2% |
| C | 3990 | 2.9% |
| Other values (15) | 7589 | 5.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 137753 | |
| Lowercase Letter | 819 | 0.6% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| H | 27896 | |
| G | 23497 | |
| M | 23247 | |
| S | 11972 | |
| O | 10645 | 7.7% |
| F | 9135 | 6.6% |
| K | 7411 | 5.4% |
| L | 7316 | 5.3% |
| P | 5874 | 4.3% |
| C | 3990 | 2.9% |
| Other values (13) | 6770 | 4.9% |
Lowercase Letter
| Value | Count | Frequency (%) |
| d | 602 | |
| p | 217 | 26.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 138572 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| H | 27896 | |
| G | 23497 | |
| M | 23247 | |
| S | 11972 | |
| O | 10645 | 7.7% |
| F | 9135 | 6.6% |
| K | 7411 | 5.3% |
| L | 7316 | 5.3% |
| P | 5874 | 4.2% |
| C | 3990 | 2.9% |
| Other values (15) | 7589 | 5.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 138572 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| H | 27896 | |
| G | 23497 | |
| M | 23247 | |
| S | 11972 | |
| O | 10645 | 7.7% |
| F | 9135 | 6.6% |
| K | 7411 | 5.3% |
| L | 7316 | 5.3% |
| P | 5874 | 4.2% |
| C | 3990 | 2.9% |
| Other values (15) | 7589 | 5.5% |
Mj5
Text
MISSING 
| Distinct | 25 |
|---|---|
| Distinct (%) | 19.8% |
| Missing | 350871 |
| Missing (%) | > 99.9% |
| Memory size | 2.7 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.793650794 |
| Min length | 2 |
Characters and Unicode
| Total characters | 352 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 8 ? |
|---|---|
| Unique (%) | 6.3% |
Sample
| 1st row | GSF |
|---|---|
| 2nd row | PI |
| 3rd row | HSF |
| 4th row | HSF |
| 5th row | GHM |
| Value | Count | Frequency (%) |
| ghm | 25 | |
| gom | 17 | |
| vgh | 15 | |
| shm | 12 | |
| klf | 9 | 7.1% |
| ch | 7 | 5.6% |
| hsf | 5 | 4.0% |
| vg | 4 | 3.2% |
| gsf | 3 | 2.4% |
| rs | 3 | 2.4% |
| Other values (15) | 26 |
Most occurring characters
| Value | Count | Frequency (%) |
| G | 71 | |
| H | 69 | |
| M | 65 | |
| S | 34 | |
| O | 24 | 6.8% |
| F | 22 | 6.2% |
| V | 19 | 5.4% |
| K | 12 | 3.4% |
| L | 12 | 3.4% |
| C | 7 | 2.0% |
| Other values (6) | 17 | 4.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 352 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 71 | |
| H | 69 | |
| M | 65 | |
| S | 34 | |
| O | 24 | 6.8% |
| F | 22 | 6.2% |
| V | 19 | 5.4% |
| K | 12 | 3.4% |
| L | 12 | 3.4% |
| C | 7 | 2.0% |
| Other values (6) | 17 | 4.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 352 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| G | 71 | |
| H | 69 | |
| M | 65 | |
| S | 34 | |
| O | 24 | 6.8% |
| F | 22 | 6.2% |
| V | 19 | 5.4% |
| K | 12 | 3.4% |
| L | 12 | 3.4% |
| C | 7 | 2.0% |
| Other values (6) | 17 | 4.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 352 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| G | 71 | |
| H | 69 | |
| M | 65 | |
| S | 34 | |
| O | 24 | 6.8% |
| F | 22 | 6.2% |
| V | 19 | 5.4% |
| K | 12 | 3.4% |
| L | 12 | 3.4% |
| C | 7 | 2.0% |
| Other values (6) | 17 | 4.8% |
Mn1
Text
MISSING 
| Distinct | 91 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 223856 |
| Missing (%) | 63.8% |
| Memory size | 2.7 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 3 |
| Mean length | 2.82475362 |
| Min length | 2 |
Characters and Unicode
| Total characters | 359142 |
|---|---|
| Distinct characters | 25 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | MI |
|---|---|
| 2nd row | MI |
| 3rd row | MI |
| 4th row | MI |
| 5th row | MI |
| Value | Count | Frequency (%) |
| ghm | 13482 | 10.6% |
| gom | 13066 | 10.3% |
| ch | 11988 | 9.4% |
| vgh | 8478 | 6.7% |
| ogf | 7901 | 6.2% |
| hsm | 7711 | 6.1% |
| klf | 6038 | 4.7% |
| hom | 5734 | 4.5% |
| shm | 5576 | 4.4% |
| gsm | 4530 | 3.6% |
| Other values (81) | 42637 |
Most occurring characters
| Value | Count | Frequency (%) |
| H | 77352 | |
| G | 63125 | |
| M | 60334 | |
| S | 33429 | |
| O | 33218 | |
| F | 31648 | |
| C | 14521 | 4.0% |
| K | 10018 | 2.8% |
| L | 9304 | 2.6% |
| V | 8484 | 2.4% |
| Other values (15) | 17709 | 4.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 356731 | |
| Lowercase Letter | 2411 | 0.7% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| H | 77352 | |
| G | 63125 | |
| M | 60334 | |
| S | 33429 | |
| O | 33218 | |
| F | 31648 | |
| C | 14521 | 4.1% |
| K | 10018 | 2.8% |
| L | 9304 | 2.6% |
| V | 8484 | 2.4% |
| Other values (13) | 15298 | 4.3% |
Lowercase Letter
| Value | Count | Frequency (%) |
| d | 1483 | |
| p | 928 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 359142 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| H | 77352 | |
| G | 63125 | |
| M | 60334 | |
| S | 33429 | |
| O | 33218 | |
| F | 31648 | |
| C | 14521 | 4.0% |
| K | 10018 | 2.8% |
| L | 9304 | 2.6% |
| V | 8484 | 2.4% |
| Other values (15) | 17709 | 4.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 359142 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| H | 77352 | |
| G | 63125 | |
| M | 60334 | |
| S | 33429 | |
| O | 33218 | |
| F | 31648 | |
| C | 14521 | 4.0% |
| K | 10018 | 2.8% |
| L | 9304 | 2.6% |
| V | 8484 | 2.4% |
| Other values (15) | 17709 | 4.9% |
Mn2
Text
MISSING 
| Distinct | 84 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 298133 |
| Missing (%) | 84.9% |
| Memory size | 2.7 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 3 |
| Mean length | 2.809000454 |
| Min length | 2 |
Characters and Unicode
| Total characters | 148495 |
|---|---|
| Distinct characters | 25 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 6 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | GH |
|---|---|
| 2nd row | HO |
| 3rd row | OG |
| 4th row | OG |
| 5th row | HO |
| Value | Count | Frequency (%) |
| gom | 5480 | 10.4% |
| ghm | 4933 | 9.3% |
| ch | 3924 | 7.4% |
| hsm | 3674 | 6.9% |
| ogf | 3374 | 6.4% |
| vgh | 3180 | 6.0% |
| shm | 3129 | 5.9% |
| hom | 2201 | 4.2% |
| gsm | 2125 | 4.0% |
| mi | 2057 | 3.9% |
| Other values (74) | 18787 |
Most occurring characters
| Value | Count | Frequency (%) |
| H | 29691 | |
| M | 27068 | |
| G | 25185 | |
| S | 15553 | |
| O | 13333 | |
| F | 12645 | |
| C | 4476 | 3.0% |
| K | 3880 | 2.6% |
| I | 3654 | 2.5% |
| L | 3506 | 2.4% |
| Other values (15) | 9504 | 6.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 147783 | |
| Lowercase Letter | 712 | 0.5% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| H | 29691 | |
| M | 27068 | |
| G | 25185 | |
| S | 15553 | |
| O | 13333 | |
| F | 12645 | |
| C | 4476 | 3.0% |
| K | 3880 | 2.6% |
| I | 3654 | 2.5% |
| L | 3506 | 2.4% |
| Other values (13) | 8792 | 5.9% |
Lowercase Letter
| Value | Count | Frequency (%) |
| d | 422 | |
| p | 290 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 148495 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| H | 29691 | |
| M | 27068 | |
| G | 25185 | |
| S | 15553 | |
| O | 13333 | |
| F | 12645 | |
| C | 4476 | 3.0% |
| K | 3880 | 2.6% |
| I | 3654 | 2.5% |
| L | 3506 | 2.4% |
| Other values (15) | 9504 | 6.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 148495 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| H | 29691 | |
| M | 27068 | |
| G | 25185 | |
| S | 15553 | |
| O | 13333 | |
| F | 12645 | |
| C | 4476 | 3.0% |
| K | 3880 | 2.6% |
| I | 3654 | 2.5% |
| L | 3506 | 2.4% |
| Other values (15) | 9504 | 6.4% |
Mn3
Text
MISSING 
| Distinct | 72 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 325115 |
| Missing (%) | 92.6% |
| Memory size | 2.7 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 3 |
| Mean length | 2.742485125 |
| Min length | 2 |
Characters and Unicode
| Total characters | 70981 |
|---|---|
| Distinct characters | 25 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | OG |
|---|---|
| 2nd row | WC |
| 3rd row | KS |
| 4th row | GOM |
| 5th row | GSF |
| Value | Count | Frequency (%) |
| gom | 2394 | 9.2% |
| ghm | 2379 | 9.2% |
| mi | 2207 | 8.5% |
| ch | 1878 | 7.3% |
| hsm | 1632 | 6.3% |
| shm | 1327 | 5.1% |
| vgh | 1317 | 5.1% |
| gsf | 1205 | 4.7% |
| gsm | 1108 | 4.3% |
| hom | 1089 | 4.2% |
| Other values (62) | 9346 |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 13954 | |
| H | 12977 | |
| G | 11303 | |
| S | 7297 | |
| O | 5429 | 7.6% |
| F | 5406 | 7.6% |
| I | 3278 | 4.6% |
| K | 2194 | 3.1% |
| C | 2105 | 3.0% |
| P | 2063 | 2.9% |
| Other values (15) | 4975 | 7.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 70747 | |
| Lowercase Letter | 234 | 0.3% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 13954 | |
| H | 12977 | |
| G | 11303 | |
| S | 7297 | |
| O | 5429 | 7.7% |
| F | 5406 | 7.6% |
| I | 3278 | 4.6% |
| K | 2194 | 3.1% |
| C | 2105 | 3.0% |
| P | 2063 | 2.9% |
| Other values (13) | 4741 | 6.7% |
Lowercase Letter
| Value | Count | Frequency (%) |
| d | 121 | |
| p | 113 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 70981 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| M | 13954 | |
| H | 12977 | |
| G | 11303 | |
| S | 7297 | |
| O | 5429 | 7.6% |
| F | 5406 | 7.6% |
| I | 3278 | 4.6% |
| K | 2194 | 3.1% |
| C | 2105 | 3.0% |
| P | 2063 | 2.9% |
| Other values (15) | 4975 | 7.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 70981 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| M | 13954 | |
| H | 12977 | |
| G | 11303 | |
| S | 7297 | |
| O | 5429 | 7.6% |
| F | 5406 | 7.6% |
| I | 3278 | 4.6% |
| K | 2194 | 3.1% |
| C | 2105 | 3.0% |
| P | 2063 | 2.9% |
| Other values (15) | 4975 | 7.0% |
Mn4
Text
MISSING 
| Distinct | 62 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 332154 |
| Missing (%) | 94.6% |
| Memory size | 2.7 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 3 |
| Mean length | 2.649684233 |
| Min length | 2 |
Characters and Unicode
| Total characters | 49928 |
|---|---|
| Distinct characters | 25 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 7 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | HSM |
|---|---|
| 2nd row | CH |
| 3rd row | GOM |
| 4th row | VGH |
| 5th row | CH |
| Value | Count | Frequency (%) |
| mi | 1855 | 9.8% |
| ch | 1576 | 8.4% |
| gom | 1553 | 8.2% |
| pi | 1511 | 8.0% |
| ghm | 1088 | 5.8% |
| klp | 938 | 5.0% |
| vgh | 888 | 4.7% |
| gsf | 861 | 4.6% |
| hsf | 804 | 4.3% |
| hsm | 761 | 4.0% |
| Other values (52) | 7008 |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 8861 | |
| H | 8107 | |
| G | 6599 | |
| S | 4728 | |
| F | 3961 | |
| I | 3370 | 6.7% |
| O | 3290 | 6.6% |
| P | 2527 | 5.1% |
| K | 2222 | 4.5% |
| L | 1720 | 3.4% |
| Other values (15) | 4543 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 49808 | |
| Lowercase Letter | 120 | 0.2% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 8861 | |
| H | 8107 | |
| G | 6599 | |
| S | 4728 | |
| F | 3961 | |
| I | 3370 | 6.8% |
| O | 3290 | 6.6% |
| P | 2527 | 5.1% |
| K | 2222 | 4.5% |
| L | 1720 | 3.5% |
| Other values (13) | 4423 |
Lowercase Letter
| Value | Count | Frequency (%) |
| d | 68 | |
| p | 52 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 49928 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| M | 8861 | |
| H | 8107 | |
| G | 6599 | |
| S | 4728 | |
| F | 3961 | |
| I | 3370 | 6.7% |
| O | 3290 | 6.6% |
| P | 2527 | 5.1% |
| K | 2222 | 4.5% |
| L | 1720 | 3.4% |
| Other values (15) | 4543 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 49928 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| M | 8861 | |
| H | 8107 | |
| G | 6599 | |
| S | 4728 | |
| F | 3961 | |
| I | 3370 | 6.7% |
| O | 3290 | 6.6% |
| P | 2527 | 5.1% |
| K | 2222 | 4.5% |
| L | 1720 | 3.4% |
| Other values (15) | 4543 |
Mn5
Text
MISSING 
| Distinct | 5 |
|---|---|
| Distinct (%) | 33.3% |
| Missing | 350982 |
| Missing (%) | > 99.9% |
| Memory size | 2.7 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 2 |
| Mean length | 2.2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 33 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 13.3% |
Sample
| 1st row | OGF |
|---|---|
| 2nd row | CH |
| 3rd row | PI |
| 4th row | PI |
| 5th row | CH |
| Value | Count | Frequency (%) |
| pi | 6 | |
| ch | 5 | |
| ogf | 2 | 13.3% |
| ys | 1 | 6.7% |
| ggm | 1 | 6.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| P | 6 | |
| I | 6 | |
| C | 5 | |
| H | 5 | |
| G | 4 | |
| O | 2 | 6.1% |
| F | 2 | 6.1% |
| Y | 1 | 3.0% |
| S | 1 | 3.0% |
| M | 1 | 3.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 33 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 6 | |
| I | 6 | |
| C | 5 | |
| H | 5 | |
| G | 4 | |
| O | 2 | 6.1% |
| F | 2 | 6.1% |
| Y | 1 | 3.0% |
| S | 1 | 3.0% |
| M | 1 | 3.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 33 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| P | 6 | |
| I | 6 | |
| C | 5 | |
| H | 5 | |
| G | 4 | |
| O | 2 | 6.1% |
| F | 2 | 6.1% |
| Y | 1 | 3.0% |
| S | 1 | 3.0% |
| M | 1 | 3.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 33 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| P | 6 | |
| I | 6 | |
| C | 5 | |
| H | 5 | |
| G | 4 | |
| O | 2 | 6.1% |
| F | 2 | 6.1% |
| Y | 1 | 3.0% |
| S | 1 | 3.0% |
| M | 1 | 3.0% |
Tr1
Text
MISSING 
| Distinct | 76 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 303543 |
| Missing (%) | 86.5% |
| Memory size | 2.7 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 2 |
| Mean length | 2.320120538 |
| Min length | 2 |
Characters and Unicode
| Total characters | 110099 |
|---|---|
| Distinct characters | 25 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | HS |
|---|---|
| 2nd row | HS |
| 3rd row | HS |
| 4th row | KS |
| 5th row | MI |
| Value | Count | Frequency (%) |
| mi | 18742 | |
| mn | 3323 | 7.0% |
| ch | 2966 | 6.3% |
| ka | 2546 | 5.4% |
| ogf | 1891 | 4.0% |
| gom | 1468 | 3.1% |
| pi | 1420 | 3.0% |
| ghm | 1264 | 2.7% |
| vgh | 1188 | 2.5% |
| klp | 947 | 2.0% |
| Other values (66) | 11699 |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 30157 | |
| I | 20545 | |
| H | 10834 | 9.8% |
| G | 8415 | 7.6% |
| O | 6139 | 5.6% |
| F | 5395 | 4.9% |
| S | 5243 | 4.8% |
| K | 4981 | 4.5% |
| N | 3709 | 3.4% |
| C | 3403 | 3.1% |
| Other values (15) | 11278 | 10.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 109867 | |
| Lowercase Letter | 232 | 0.2% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 30157 | |
| I | 20545 | |
| H | 10834 | 9.9% |
| G | 8415 | 7.7% |
| O | 6139 | 5.6% |
| F | 5395 | 4.9% |
| S | 5243 | 4.8% |
| K | 4981 | 4.5% |
| N | 3709 | 3.4% |
| C | 3403 | 3.1% |
| Other values (13) | 11046 | 10.1% |
Lowercase Letter
| Value | Count | Frequency (%) |
| p | 120 | |
| d | 112 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 110099 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| M | 30157 | |
| I | 20545 | |
| H | 10834 | 9.8% |
| G | 8415 | 7.6% |
| O | 6139 | 5.6% |
| F | 5395 | 4.9% |
| S | 5243 | 4.8% |
| K | 4981 | 4.5% |
| N | 3709 | 3.4% |
| C | 3403 | 3.1% |
| Other values (15) | 11278 | 10.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 110099 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| M | 30157 | |
| I | 20545 | |
| H | 10834 | 9.8% |
| G | 8415 | 7.6% |
| O | 6139 | 5.6% |
| F | 5395 | 4.9% |
| S | 5243 | 4.8% |
| K | 4981 | 4.5% |
| N | 3709 | 3.4% |
| C | 3403 | 3.1% |
| Other values (15) | 11278 | 10.2% |
Tr2
Text
MISSING 
| Distinct | 63 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 337810 |
| Missing (%) | 96.2% |
| Memory size | 2.7 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 2 |
| Mean length | 2.390005308 |
| Min length | 2 |
Characters and Unicode
| Total characters | 31517 |
|---|---|
| Distinct characters | 25 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | KS |
|---|---|
| 2nd row | KLF |
| 3rd row | SHF |
| 4th row | SHF |
| 5th row | OGF |
| Value | Count | Frequency (%) |
| mi | 4357 | |
| pi | 947 | 7.2% |
| ka | 914 | 6.9% |
| klp | 732 | 5.6% |
| ch | 555 | 4.2% |
| ghm | 523 | 4.0% |
| gom | 475 | 3.6% |
| mn | 443 | 3.4% |
| ogf | 420 | 3.2% |
| mo | 351 | 2.7% |
| Other values (53) | 3470 |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 7585 | |
| I | 5425 | |
| H | 3002 | 9.5% |
| G | 2546 | 8.1% |
| K | 2050 | 6.5% |
| O | 1751 | 5.6% |
| P | 1698 | 5.4% |
| F | 1670 | 5.3% |
| S | 1660 | 5.3% |
| L | 1136 | 3.6% |
| Other values (15) | 2994 | 9.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 31450 | |
| Lowercase Letter | 67 | 0.2% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 7585 | |
| I | 5425 | |
| H | 3002 | 9.5% |
| G | 2546 | 8.1% |
| K | 2050 | 6.5% |
| O | 1751 | 5.6% |
| P | 1698 | 5.4% |
| F | 1670 | 5.3% |
| S | 1660 | 5.3% |
| L | 1136 | 3.6% |
| Other values (13) | 2927 | 9.3% |
Lowercase Letter
| Value | Count | Frequency (%) |
| p | 44 | |
| d | 23 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 31517 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| M | 7585 | |
| I | 5425 | |
| H | 3002 | 9.5% |
| G | 2546 | 8.1% |
| K | 2050 | 6.5% |
| O | 1751 | 5.6% |
| P | 1698 | 5.4% |
| F | 1670 | 5.3% |
| S | 1660 | 5.3% |
| L | 1136 | 3.6% |
| Other values (15) | 2994 | 9.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 31517 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| M | 7585 | |
| I | 5425 | |
| H | 3002 | 9.5% |
| G | 2546 | 8.1% |
| K | 2050 | 6.5% |
| O | 1751 | 5.6% |
| P | 1698 | 5.4% |
| F | 1670 | 5.3% |
| S | 1660 | 5.3% |
| L | 1136 | 3.6% |
| Other values (15) | 2994 | 9.5% |
Tr3
Text
MISSING 
| Distinct | 45 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 344588 |
| Missing (%) | 98.2% |
| Memory size | 2.7 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 2 |
| Mean length | 2.494304884 |
| Min length | 2 |
Characters and Unicode
| Total characters | 15986 |
|---|---|
| Distinct characters | 24 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 6 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | OGF |
|---|---|
| 2nd row | GOM |
| 3rd row | MI |
| 4th row | MI |
| 5th row | MI |
| Value | Count | Frequency (%) |
| mi | 1971 | |
| klp | 847 | |
| ghm | 549 | 8.6% |
| ch | 369 | 5.8% |
| gom | 303 | 4.7% |
| ka | 296 | 4.6% |
| ogf | 172 | 2.7% |
| pi | 159 | 2.5% |
| vgh | 148 | 2.3% |
| mn | 132 | 2.1% |
| Other values (35) | 1463 |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 3664 | |
| I | 2213 | |
| H | 1823 | |
| G | 1628 | |
| K | 1188 | 7.4% |
| P | 1022 | 6.4% |
| L | 892 | 5.6% |
| O | 864 | 5.4% |
| F | 667 | 4.2% |
| S | 628 | 3.9% |
| Other values (14) | 1397 | 8.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 15952 | |
| Lowercase Letter | 34 | 0.2% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 3664 | |
| I | 2213 | |
| H | 1823 | |
| G | 1628 | |
| K | 1188 | 7.4% |
| P | 1022 | 6.4% |
| L | 892 | 5.6% |
| O | 864 | 5.4% |
| F | 667 | 4.2% |
| S | 628 | 3.9% |
| Other values (12) | 1363 | 8.5% |
Lowercase Letter
| Value | Count | Frequency (%) |
| p | 19 | |
| d | 15 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 15986 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| M | 3664 | |
| I | 2213 | |
| H | 1823 | |
| G | 1628 | |
| K | 1188 | 7.4% |
| P | 1022 | 6.4% |
| L | 892 | 5.6% |
| O | 864 | 5.4% |
| F | 667 | 4.2% |
| S | 628 | 3.9% |
| Other values (14) | 1397 | 8.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 15986 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| M | 3664 | |
| I | 2213 | |
| H | 1823 | |
| G | 1628 | |
| K | 1188 | 7.4% |
| P | 1022 | 6.4% |
| L | 892 | 5.6% |
| O | 864 | 5.4% |
| F | 667 | 4.2% |
| S | 628 | 3.9% |
| Other values (14) | 1397 | 8.7% |
Tr4
Text
MISSING 
| Distinct | 40 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 339159 |
| Missing (%) | 96.6% |
| Memory size | 2.7 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 2 |
| Mean length | 2.151207974 |
| Min length | 2 |
Characters and Unicode
| Total characters | 25466 |
|---|---|
| Distinct characters | 21 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | MI |
|---|---|
| 2nd row | MI |
| 3rd row | MI |
| 4th row | MI |
| 5th row | MI |
| Value | Count | Frequency (%) |
| mi | 6796 | |
| mo | 2090 | 17.7% |
| mn | 600 | 5.1% |
| ghm | 345 | 2.9% |
| mnf | 281 | 2.4% |
| klp | 268 | 2.3% |
| ogf | 259 | 2.2% |
| ch | 219 | 1.8% |
| hof | 140 | 1.2% |
| pi | 112 | 0.9% |
| Other values (30) | 728 | 6.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 10447 | |
| I | 6910 | |
| O | 2678 | 10.5% |
| H | 974 | 3.8% |
| G | 925 | 3.6% |
| N | 891 | 3.5% |
| F | 772 | 3.0% |
| P | 415 | 1.6% |
| K | 392 | 1.5% |
| L | 302 | 1.2% |
| Other values (11) | 760 | 3.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 25459 | |
| Lowercase Letter | 7 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 10447 | |
| I | 6910 | |
| O | 2678 | 10.5% |
| H | 974 | 3.8% |
| G | 925 | 3.6% |
| N | 891 | 3.5% |
| F | 772 | 3.0% |
| P | 415 | 1.6% |
| K | 392 | 1.5% |
| L | 302 | 1.2% |
| Other values (9) | 753 | 3.0% |
Lowercase Letter
| Value | Count | Frequency (%) |
| p | 6 | |
| d | 1 | 14.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 25466 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| M | 10447 | |
| I | 6910 | |
| O | 2678 | 10.5% |
| H | 974 | 3.8% |
| G | 925 | 3.6% |
| N | 891 | 3.5% |
| F | 772 | 3.0% |
| P | 415 | 1.6% |
| K | 392 | 1.5% |
| L | 302 | 1.2% |
| Other values (11) | 760 | 3.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 25466 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| M | 10447 | |
| I | 6910 | |
| O | 2678 | 10.5% |
| H | 974 | 3.8% |
| G | 925 | 3.6% |
| N | 891 | 3.5% |
| F | 772 | 3.0% |
| P | 415 | 1.6% |
| K | 392 | 1.5% |
| L | 302 | 1.2% |
| Other values (11) | 760 | 3.0% |
Tr5
Text
MISSING 
| Distinct | 5 |
|---|---|
| Distinct (%) | 3.1% |
| Missing | 350836 |
| Missing (%) | > 99.9% |
| Memory size | 2.7 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 2 |
| Mean length | 2.01863354 |
| Min length | 2 |
Characters and Unicode
| Total characters | 325 |
|---|---|
| Distinct characters | 8 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 0.6% |
Sample
| 1st row | GHM |
|---|---|
| 2nd row | GHM |
| 3rd row | MI |
| 4th row | MI |
| 5th row | MI |
| Value | Count | Frequency (%) |
| mi | 153 | |
| pi | 3 | 1.9% |
| ghm | 2 | 1.2% |
| mo | 2 | 1.2% |
| klp | 1 | 0.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 157 | |
| I | 156 | |
| P | 4 | 1.2% |
| G | 2 | 0.6% |
| H | 2 | 0.6% |
| O | 2 | 0.6% |
| K | 1 | 0.3% |
| L | 1 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 325 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 157 | |
| I | 156 | |
| P | 4 | 1.2% |
| G | 2 | 0.6% |
| H | 2 | 0.6% |
| O | 2 | 0.6% |
| K | 1 | 0.3% |
| L | 1 | 0.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 325 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| M | 157 | |
| I | 156 | |
| P | 4 | 1.2% |
| G | 2 | 0.6% |
| H | 2 | 0.6% |
| O | 2 | 0.6% |
| K | 1 | 0.3% |
| L | 1 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 325 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| M | 157 | |
| I | 156 | |
| P | 4 | 1.2% |
| G | 2 | 0.6% |
| H | 2 | 0.6% |
| O | 2 | 0.6% |
| K | 1 | 0.3% |
| L | 1 | 0.3% |
Chip_pct
Real number (ℝ)
MISSING 
| Distinct | 45 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 161178 |
| Missing (%) | 45.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 34.27222775 |
| Minimum | 0 |
|---|---|
| Maximum | 100 |
| Zeros | 22 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 10 |
| Q1 | 20 |
| median | 30 |
| Q3 | 40 |
| 95-th percentile | 65 |
| Maximum | 100 |
| Range | 100 |
| Interquartile range (IQR) | 20 |
Descriptive statistics
| Standard deviation | 16.37143038 |
|---|---|
| Coefficient of variation (CV) | 0.4776879548 |
| Kurtosis | 0.5188388138 |
| Mean | 34.27222775 |
| Median Absolute Deviation (MAD) | 10 |
| Skewness | 0.7228901153 |
| Sum | 6505520 |
| Variance | 268.0237327 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 30 | 42579 | 12.1% |
| 40 | 35170 | 10.0% |
| 20 | 31516 | 9.0% |
| 50 | 21133 | 6.0% |
| 10 | 12707 | 3.6% |
| 25 | 9669 | 2.8% |
| 60 | 9648 | 2.7% |
| 15 | 6362 | 1.8% |
| 35 | 4687 | 1.3% |
| 70 | 3669 | 1.0% |
| Other values (35) | 12679 | 3.6% |
| (Missing) | 161178 |
| Value | Count | Frequency (%) |
| 0 | 22 | |
| 1 | 5 | < 0.1% |
| 2 | 43 | |
| 3 | 39 | |
| 4 | 6 | < 0.1% |
| Value | Count | Frequency (%) |
| 100 | 100 | < 0.1% |
| 95 | 47 | < 0.1% |
| 90 | 911 | |
| 86 | 1 | < 0.1% |
| 85 | 141 | < 0.1% |
Shape1
Text
MISSING 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 71713 |
| Missing (%) | 20.4% |
| Memory size | 2.7 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 558568 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | SR |
|---|---|
| 2nd row | SR |
| 3rd row | SR |
| 4th row | SR |
| 5th row | SR |
| Value | Count | Frequency (%) |
| aa | 120593 | |
| sa | 110536 | |
| sr | 32558 | 11.7% |
| rr | 8529 | 3.1% |
| va | 6293 | 2.3% |
| wr | 775 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 358015 | |
| S | 143094 | 25.6% |
| R | 50391 | 9.0% |
| V | 6293 | 1.1% |
| W | 775 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 558568 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 358015 | |
| S | 143094 | 25.6% |
| R | 50391 | 9.0% |
| V | 6293 | 1.1% |
| W | 775 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 558568 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 358015 | |
| S | 143094 | 25.6% |
| R | 50391 | 9.0% |
| V | 6293 | 1.1% |
| W | 775 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 558568 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 358015 | |
| S | 143094 | 25.6% |
| R | 50391 | 9.0% |
| V | 6293 | 1.1% |
| W | 775 | 0.1% |
Shape2
Text
MISSING 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 342475 |
| Missing (%) | 97.6% |
| Memory size | 2.7 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 17044 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | SA |
|---|---|
| 2nd row | SA |
| 3rd row | SR |
| 4th row | SR |
| 5th row | SR |
| Value | Count | Frequency (%) |
| sr | 3003 | |
| sa | 2810 | |
| rr | 1492 | |
| aa | 1217 |
Most occurring characters
| Value | Count | Frequency (%) |
| R | 5987 | |
| S | 5813 | |
| A | 5244 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 17044 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 5987 | |
| S | 5813 | |
| A | 5244 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 17044 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| R | 5987 | |
| S | 5813 | |
| A | 5244 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17044 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| R | 5987 | |
| S | 5813 | |
| A | 5244 |
Max_Dia
Real number (ℝ)
MISSING 
| Distinct | 56 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 62831 |
| Missing (%) | 17.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 14.60768793 |
| Minimum | 0 |
|---|---|
| Maximum | 525 |
| Zeros | 24 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 5 |
| Q1 | 10 |
| median | 12 |
| Q3 | 20 |
| 95-th percentile | 30 |
| Maximum | 525 |
| Range | 525 |
| Interquartile range (IQR) | 10 |
Descriptive statistics
| Standard deviation | 7.195335875 |
|---|---|
| Coefficient of variation (CV) | 0.4925718505 |
| Kurtosis | 209.682909 |
| Mean | 14.60768793 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 4.28658828 |
| Sum | 4209439 |
| Variance | 51.77285836 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10 | 117872 | |
| 20 | 55764 | |
| 15 | 54933 | |
| 5 | 20654 | 5.9% |
| 25 | 15768 | 4.5% |
| 30 | 10603 | 3.0% |
| 35 | 3038 | 0.9% |
| 7 | 2799 | 0.8% |
| 40 | 2069 | 0.6% |
| 12 | 881 | 0.3% |
| Other values (46) | 3785 | 1.1% |
| (Missing) | 62831 |
| Value | Count | Frequency (%) |
| 0 | 24 | < 0.1% |
| 1 | 51 | < 0.1% |
| 2 | 510 | |
| 3 | 717 | |
| 4 | 237 | 0.1% |
| Value | Count | Frequency (%) |
| 525 | 2 | |
| 405 | 1 | < 0.1% |
| 151 | 1 | < 0.1% |
| 145 | 1 | < 0.1% |
| 140 | 3 |
Hardness
Unsupported
MISSING  REJECTED  UNSUPPORTED 
| Missing | 350997 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 2.7 MiB |
Colour
Real number (ℝ)
MISSING 
| Distinct | 90 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 102224 |
| Missing (%) | 29.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 27.14761248 |
| Minimum | 1 |
|---|---|
| Maximum | 96 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.7 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 9 |
| Q1 | 21 |
| median | 25 |
| Q3 | 34 |
| 95-th percentile | 50 |
| Maximum | 96 |
| Range | 95 |
| Interquartile range (IQR) | 13 |
Descriptive statistics
| Standard deviation | 12.72642994 |
|---|---|
| Coefficient of variation (CV) | 0.4687863415 |
| Kurtosis | 4.562979416 |
| Mean | 27.14761248 |
| Median Absolute Deviation (MAD) | 7 |
| Skewness | 1.558933637 |
| Sum | 6753593 |
| Variance | 161.9620189 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 24 | 32057 | 9.1% |
| 25 | 30375 | 8.7% |
| 34 | 23317 | 6.6% |
| 33 | 12439 | 3.5% |
| 22 | 12281 | 3.5% |
| 27 | 11128 | 3.2% |
| 35 | 10710 | 3.1% |
| 17 | 10555 | 3.0% |
| 14 | 8783 | 2.5% |
| 18 | 8587 | 2.4% |
| Other values (80) | 88541 | |
| (Missing) | 102224 |
| Value | Count | Frequency (%) |
| 1 | 206 | 0.1% |
| 2 | 779 | 0.2% |
| 3 | 1209 | |
| 4 | 1507 | |
| 5 | 2449 |
| Value | Count | Frequency (%) |
| 96 | 26 | < 0.1% |
| 95 | 12 | < 0.1% |
| 94 | 2 | < 0.1% |
| 93 | 359 | |
| 92 | 143 | < 0.1% |
LithComment
Text
MISSING 
| Distinct | 13963 |
|---|---|
| Distinct (%) | 28.6% |
| Missing | 302145 |
| Missing (%) | 86.1% |
| Memory size | 2.7 MiB |
Length
| Max length | 146 |
|---|---|
| Median length | 103 |
| Mean length | 14.23139278 |
| Min length | 1 |
Characters and Unicode
| Total characters | 695232 |
|---|---|
| Distinct characters | 87 |
| Distinct categories | 11 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 7455 ? |
|---|---|
| Unique (%) | 15.3% |
Sample
| 1st row | Interbedded shales |
|---|---|
| 2nd row | Interbedded shales |
| 3rd row | Shales interbedded |
| 4th row | Black shales with chert |
| 5th row | Duplicate missed out |
| Value | Count | Frequency (%) |
| 7794 | 6.3% | |
| duplicate | 5354 | 4.3% |
| damp | 3286 | 2.6% |
| mn | 2957 | 2.4% |
| eoh | 2753 | 2.2% |
| lab | 2659 | 2.1% |
| clay | 2550 | 2.1% |
| wet | 2290 | 1.8% |
| to | 1612 | 1.3% |
| stained | 1515 | 1.2% |
| Other values (9621) | 91424 |
Most occurring characters
| Value | Count | Frequency (%) |
| 81028 | 11.7% | |
| e | 28985 | 4.2% |
| i | 24694 | 3.6% |
| E | 24249 | 3.5% |
| S | 22931 | 3.3% |
| a | 22755 | 3.3% |
| T | 22345 | 3.2% |
| A | 21515 | 3.1% |
| I | 21411 | 3.1% |
| t | 21020 | 3.0% |
| Other values (77) | 404299 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 297852 | |
| Lowercase Letter | 238018 | |
| Space Separator | 81028 | 11.7% |
| Decimal Number | 59085 | 8.5% |
| Other Punctuation | 10070 | 1.4% |
| Dash Punctuation | 6356 | 0.9% |
| Close Punctuation | 995 | 0.1% |
| Open Punctuation | 993 | 0.1% |
| Math Symbol | 814 | 0.1% |
| Connector Punctuation | 19 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 28985 | |
| i | 24694 | |
| a | 22755 | 9.6% |
| t | 21020 | 8.8% |
| l | 17424 | 7.3% |
| s | 14965 | 6.3% |
| n | 13816 | 5.8% |
| o | 11902 | 5.0% |
| c | 11847 | 5.0% |
| r | 9937 | 4.2% |
| Other values (16) | 60673 |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 24249 | 8.1% |
| S | 22931 | 7.7% |
| T | 22345 | 7.5% |
| A | 21515 | 7.2% |
| I | 21411 | 7.2% |
| D | 20455 | 6.9% |
| L | 20231 | 6.8% |
| M | 19681 | 6.6% |
| O | 18421 | 6.2% |
| N | 13326 | 4.5% |
| Other values (16) | 93287 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3176 | |
| ? | 1868 | |
| / | 1736 | |
| @ | 1439 | |
| % | 562 | 5.6% |
| & | 389 | 3.9% |
| * | 255 | 2.5% |
| : | 253 | 2.5% |
| ' | 198 | 2.0% |
| ! | 102 | 1.0% |
| Other values (2) | 92 | 0.9% |
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 7490 | |
| 2 | 7406 | |
| 1 | 6714 | |
| 0 | 6383 | |
| 4 | 6226 | |
| 9 | 6076 | |
| 6 | 4975 | |
| 5 | 4760 | |
| 8 | 4590 | |
| 7 | 4465 |
Math Symbol
| Value | Count | Frequency (%) |
| = | 483 | |
| < | 250 | |
| > | 38 | 4.7% |
| + | 37 | 4.5% |
| ~ | 6 | 0.7% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 993 | |
| ] | 2 | 0.2% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 991 | |
| [ | 2 | 0.2% |
Space Separator
| Value | Count | Frequency (%) |
| 81028 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 6356 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 19 |
Other Symbol
| Value | Count | Frequency (%) |
| � | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 535870 | |
| Common | 159362 | 22.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 28985 | 5.4% |
| i | 24694 | 4.6% |
| E | 24249 | 4.5% |
| S | 22931 | 4.3% |
| a | 22755 | 4.2% |
| T | 22345 | 4.2% |
| A | 21515 | 4.0% |
| I | 21411 | 4.0% |
| t | 21020 | 3.9% |
| D | 20455 | 3.8% |
| Other values (42) | 305510 |
Common
| Value | Count | Frequency (%) |
| 81028 | ||
| 3 | 7490 | 4.7% |
| 2 | 7406 | 4.6% |
| 1 | 6714 | 4.2% |
| 0 | 6383 | 4.0% |
| - | 6356 | 4.0% |
| 4 | 6226 | 3.9% |
| 9 | 6076 | 3.8% |
| 6 | 4975 | 3.1% |
| 5 | 4760 | 3.0% |
| Other values (25) | 21948 | 13.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 695230 | |
| Specials | 2 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 81028 | 11.7% | |
| e | 28985 | 4.2% |
| i | 24694 | 3.6% |
| E | 24249 | 3.5% |
| S | 22931 | 3.3% |
| a | 22755 | 3.3% |
| T | 22345 | 3.2% |
| A | 21515 | 3.1% |
| I | 21411 | 3.1% |
| t | 21020 | 3.0% |
| Other values (76) | 404297 |
Specials
| Value | Count | Frequency (%) |
| � | 2 |
Ore_Texture
Text
MISSING 
| Distinct | 1030 |
|---|---|
| Distinct (%) | 31.8% |
| Missing | 347753 |
| Missing (%) | 99.1% |
| Memory size | 2.7 MiB |
Length
| Max length | 84 |
|---|---|
| Median length | 59 |
| Mean length | 17.19235512 |
| Min length | 1 |
Characters and Unicode
| Total characters | 55772 |
|---|---|
| Distinct characters | 83 |
| Distinct categories | 10 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 195 ? |
|---|---|
| Unique (%) | 6.0% |
Sample
| 1st row | low recovery |
|---|---|
| 2nd row | Steel blue hematite and bedding |
| 3rd row | clay in P of GHHp |
| 4th row | clay in P of GHHp |
| 5th row | clay in P of GHHp |
| Value | Count | Frequency (%) |
| 588 | 6.3% | |
| duplicate | 274 | 2.9% |
| clay | 227 | 2.4% |
| mn | 165 | 1.8% |
| of | 159 | 1.7% |
| in | 144 | 1.5% |
| injected | 142 | 1.5% |
| damp | 141 | 1.5% |
| and | 115 | 1.2% |
| vgh | 114 | 1.2% |
| Other values (984) | 7283 |
Most occurring characters
| Value | Count | Frequency (%) |
| 9583 | 17.2% | |
| e | 3823 | 6.9% |
| i | 3036 | 5.4% |
| a | 2790 | 5.0% |
| t | 2590 | 4.6% |
| l | 2371 | 4.3% |
| s | 2253 | 4.0% |
| n | 1805 | 3.2% |
| c | 1732 | 3.1% |
| o | 1728 | 3.1% |
| Other values (73) | 24061 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 32928 | |
| Uppercase Letter | 9647 | 17.3% |
| Space Separator | 9583 | 17.2% |
| Decimal Number | 2095 | 3.8% |
| Other Punctuation | 907 | 1.6% |
| Dash Punctuation | 394 | 0.7% |
| Close Punctuation | 78 | 0.1% |
| Open Punctuation | 74 | 0.1% |
| Math Symbol | 64 | 0.1% |
| Other Symbol | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 3823 | |
| i | 3036 | 9.2% |
| a | 2790 | 8.5% |
| t | 2590 | 7.9% |
| l | 2371 | 7.2% |
| s | 2253 | 6.8% |
| n | 1805 | 5.5% |
| c | 1732 | 5.3% |
| o | 1728 | 5.2% |
| d | 1588 | 4.8% |
| Other values (16) | 9212 |
Uppercase Letter
| Value | Count | Frequency (%) |
| H | 889 | 9.2% |
| M | 787 | 8.2% |
| O | 779 | 8.1% |
| S | 757 | 7.8% |
| T | 737 | 7.6% |
| I | 684 | 7.1% |
| E | 635 | 6.6% |
| L | 520 | 5.4% |
| G | 514 | 5.3% |
| A | 505 | 5.2% |
| Other values (16) | 2840 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 460 | |
| ? | 152 | 16.8% |
| / | 122 | 13.5% |
| % | 69 | 7.6% |
| : | 54 | 6.0% |
| & | 21 | 2.3% |
| * | 11 | 1.2% |
| ' | 8 | 0.9% |
| ; | 3 | 0.3% |
| # | 3 | 0.3% |
| Other values (2) | 4 | 0.4% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 330 | |
| 9 | 308 | |
| 1 | 274 | |
| 2 | 257 | |
| 7 | 179 | |
| 8 | 160 | |
| 4 | 148 | |
| 5 | 147 | |
| 3 | 147 | |
| 6 | 145 |
Math Symbol
| Value | Count | Frequency (%) |
| = | 42 | |
| < | 11 | 17.2% |
| + | 9 | 14.1% |
| > | 2 | 3.1% |
Space Separator
| Value | Count | Frequency (%) |
| 9583 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 394 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 78 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 74 |
Other Symbol
| Value | Count | Frequency (%) |
| � | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 42575 | |
| Common | 13197 | 23.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 3823 | 9.0% |
| i | 3036 | 7.1% |
| a | 2790 | 6.6% |
| t | 2590 | 6.1% |
| l | 2371 | 5.6% |
| s | 2253 | 5.3% |
| n | 1805 | 4.2% |
| c | 1732 | 4.1% |
| o | 1728 | 4.1% |
| d | 1588 | 3.7% |
| Other values (42) | 18859 |
Common
| Value | Count | Frequency (%) |
| 9583 | ||
| . | 460 | 3.5% |
| - | 394 | 3.0% |
| 0 | 330 | 2.5% |
| 9 | 308 | 2.3% |
| 1 | 274 | 2.1% |
| 2 | 257 | 1.9% |
| 7 | 179 | 1.4% |
| 8 | 160 | 1.2% |
| ? | 152 | 1.2% |
| Other values (21) | 1100 | 8.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 55770 | |
| Specials | 2 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 9583 | 17.2% | |
| e | 3823 | 6.9% |
| i | 3036 | 5.4% |
| a | 2790 | 5.0% |
| t | 2590 | 4.6% |
| l | 2371 | 4.3% |
| s | 2253 | 4.0% |
| n | 1805 | 3.2% |
| c | 1732 | 3.1% |
| o | 1728 | 3.1% |
| Other values (72) | 24059 |
Specials
| Value | Count | Frequency (%) |
| � | 2 |
| HOLEID | PROJECTCODE | GEOLFROM | GEOLTO | PRIORITY | Strat_Sum | Strat | Mj1 | Mj2 | Mj3 | Mj4 | Mj5 | Mn1 | Mn2 | Mn3 | Mn4 | Mn5 | Tr1 | Tr2 | Tr3 | Tr4 | Tr5 | Chip_pct | Shape1 | Shape2 | Max_Dia | Hardness | Colour | LithComment | Ore_Texture | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | CC0001 | CC | 0.0 | 2.0 | 1 | Ta | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 1 | CC0001 | CC | 0.0 | 1.0 | 1 | NaN | NaN | GH | GO | HO | NaN | NaN | MI | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 50.0 | SR | NaN | 20.0 | NaN | NaN | NaN | NaN |
| 2 | CC0001 | CC | 1.0 | 2.0 | 1 | NaN | NaN | GH | GO | NaN | NaN | NaN | MI | NaN | NaN | NaN | NaN | HS | NaN | NaN | NaN | NaN | 70.0 | SR | NaN | 20.0 | NaN | NaN | NaN | NaN |
| 3 | CC0001 | CC | 2.0 | 3.0 | 1 | NaN | NaN | GOM | HO | GH | NaN | NaN | MI | NaN | NaN | NaN | NaN | HS | NaN | NaN | NaN | NaN | 50.0 | SR | NaN | 15.0 | NaN | NaN | NaN | NaN |
| 4 | CC0001 | CC | 2.0 | 4.0 | 1 | Tdi | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 5 | CC0001 | CC | 3.0 | 4.0 | 1 | NaN | NaN | HO | GO | NaN | NaN | NaN | MI | GH | NaN | NaN | NaN | HS | KS | NaN | NaN | NaN | 50.0 | SR | NaN | 15.0 | NaN | NaN | NaN | NaN |
| 6 | CC0001 | CC | 4.0 | 6.0 | 1 | Tds | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 7 | CC0001 | CC | 4.0 | 5.0 | 1 | NaN | NaN | HO | GH | GO | NaN | NaN | MI | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 60.0 | SR | NaN | 20.0 | NaN | NaN | NaN | NaN |
| 8 | CC0001 | CC | 5.0 | 6.0 | 1 | NaN | NaN | GH | GO | NaN | NaN | NaN | MI | HO | OG | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 60.0 | SR | NaN | 20.0 | NaN | NaN | NaN | NaN |
| 9 | CC0001 | CC | 6.0 | 7.0 | 1 | NaN | NaN | HO | GH | NaN | NaN | NaN | MI | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 80.0 | SR | NaN | 25.0 | NaN | NaN | NaN | NaN |