Dataset statistics
Number of variables | 25 |
---|---|
Number of observations | 87 |
Missing cells | 513 |
Missing cells (%) | 23.6% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 115.3 KiB |
Average record size in memory | 1.3 KiB |
Variable types
Text | 9 |
---|---|
Numeric | 4 |
Categorical | 11 |
URL | 1 |
abilities is highly overall correlated with sex | High correlation |
birth_era is highly overall correlated with films | High correlation |
birth_year is highly overall correlated with sex and 1 other fields | High correlation |
death_era is highly overall correlated with films | High correlation |
films is highly overall correlated with birth_era and 2 other fields | High correlation |
gender is highly overall correlated with pronoun and 1 other fields | High correlation |
height is highly overall correlated with mass and 1 other fields | High correlation |
mass is highly overall correlated with height and 3 other fields | High correlation |
pronoun is highly overall correlated with gender and 1 other fields | High correlation |
sex is highly overall correlated with abilities and 7 other fields | High correlation |
skin_color is highly overall correlated with mass and 2 other fields | High correlation |
species is highly overall correlated with birth_year and 4 other fields | High correlation |
birth_era is highly imbalanced (53.1%) | Imbalance |
height has 1 (1.1%) missing values | Missing |
mass has 22 (25.3%) missing values | Missing |
hair_color has 5 (5.7%) missing values | Missing |
skin_color has 1 (1.1%) missing values | Missing |
birth_year has 37 (42.5%) missing values | Missing |
birth_era has 37 (42.5%) missing values | Missing |
birth_place has 50 (57.5%) missing values | Missing |
death_year has 25 (28.7%) missing values | Missing |
death_era has 25 (28.7%) missing values | Missing |
death_place has 30 (34.5%) missing values | Missing |
homeworld has 4 (4.6%) missing values | Missing |
cybernetics has 80 (92.0%) missing values | Missing |
abilities has 32 (36.8%) missing values | Missing |
equipment has 25 (28.7%) missing values | Missing |
vehicles has 72 (82.8%) missing values | Missing |
starships has 67 (77.0%) missing values | Missing |
name has unique values | Unique |
photo has unique values | Unique |
death_year has 10 (11.5%) zeros | Zeros |
Reproduction
Analysis started | 2023-12-30 08:20:12.093341 |
---|---|
Analysis finished | 2023-12-30 08:20:13.802663 |
Duration | 1.71 second |
Software version | ydata-profiling v0.0.dev0 |
Download configuration | config.json |
name
Text
UNIQUE
 
Distinct | 87 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 6.0 KiB |
Length
Max length | 21 |
---|---|
Median length | 15 |
Mean length | 10.804598 |
Min length | 4 |
Characters and Unicode
Total characters | 940 |
---|---|
Distinct characters | 60 |
Distinct categories | 6 ? |
Distinct scripts | 2 ? |
Distinct blocks | 2 ? |
Unique
Unique | 87 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | Luke Skywalker |
---|---|
2nd row | C-3PO |
3rd row | R2-D2 |
4th row | Darth Vader |
5th row | Leia Organa Solo |
Value | Count | Frequency (%) |
lars | 4 | 2.5% |
skywalker | 4 | 2.5% |
fett | 2 | 1.2% |
organa | 2 | 1.2% |
solo | 2 | 1.2% |
antilles | 2 | 1.2% |
darth | 2 | 1.2% |
jar | 2 | 1.2% |
kenobi | 1 | 0.6% |
biggs | 1 | 0.6% |
Other values (139) | 139 |
Most occurring characters
Value | Count | Frequency (%) |
a | 97 | 10.3% |
74 | 7.9% | |
e | 63 | 6.7% |
r | 57 | 6.1% |
i | 56 | 6.0% |
o | 52 | 5.5% |
n | 47 | 5.0% |
s | 43 | 4.6% |
l | 41 | 4.4% |
t | 35 | 3.7% |
Other values (50) | 375 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 670 | |
Uppercase Letter | 173 | 18.4% |
Space Separator | 74 | 7.9% |
Decimal Number | 11 | 1.2% |
Dash Punctuation | 10 | 1.1% |
Other Punctuation | 2 | 0.2% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
a | 97 | |
e | 63 | |
r | 57 | 8.5% |
i | 56 | 8.4% |
o | 52 | 7.8% |
n | 47 | 7.0% |
s | 43 | 6.4% |
l | 41 | 6.1% |
t | 35 | 5.2% |
u | 29 | 4.3% |
Other values (15) | 150 |
Uppercase Letter
Value | Count | Frequency (%) |
S | 14 | 8.1% |
B | 13 | 7.5% |
T | 12 | 6.9% |
W | 12 | 6.9% |
P | 12 | 6.9% |
D | 11 | 6.4% |
L | 11 | 6.4% |
A | 10 | 5.8% |
G | 9 | 5.2% |
R | 9 | 5.2% |
Other values (15) | 60 |
Decimal Number
Value | Count | Frequency (%) |
8 | 3 | |
4 | 2 | |
2 | 2 | |
1 | 1 | 9.1% |
7 | 1 | 9.1% |
3 | 1 | 9.1% |
5 | 1 | 9.1% |
Space Separator
Value | Count | Frequency (%) |
74 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 10 |
Other Punctuation
Value | Count | Frequency (%) |
' | 2 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 843 | |
Common | 97 | 10.3% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
a | 97 | 11.5% |
e | 63 | 7.5% |
r | 57 | 6.8% |
i | 56 | 6.6% |
o | 52 | 6.2% |
n | 47 | 5.6% |
s | 43 | 5.1% |
l | 41 | 4.9% |
t | 35 | 4.2% |
u | 29 | 3.4% |
Other values (40) | 323 |
Common
Value | Count | Frequency (%) |
74 | ||
- | 10 | 10.3% |
8 | 3 | 3.1% |
4 | 2 | 2.1% |
2 | 2 | 2.1% |
' | 2 | 2.1% |
1 | 1 | 1.0% |
7 | 1 | 1.0% |
3 | 1 | 1.0% |
5 | 1 | 1.0% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 936 | |
None | 4 | 0.4% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
a | 97 | 10.4% |
74 | 7.9% | |
e | 63 | 6.7% |
r | 57 | 6.1% |
i | 56 | 6.0% |
o | 52 | 5.6% |
n | 47 | 5.0% |
s | 43 | 4.6% |
l | 41 | 4.4% |
t | 35 | 3.7% |
Other values (49) | 371 |
None
Value | Count | Frequency (%) |
é | 4 |
height
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 48 |
---|---|
Distinct (%) | 55.8% |
Missing | 1 |
Missing (%) | 1.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 173.61628 |
Minimum | 66 |
---|---|
Maximum | 264 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 828.0 B |
Quantile statistics
Minimum | 66 |
---|---|
5-th percentile | 94.5 |
Q1 | 167 |
median | 180 |
Q3 | 191 |
95-th percentile | 222 |
Maximum | 264 |
Range | 198 |
Interquartile range (IQR) | 24 |
Descriptive statistics
Standard deviation | 36.141281 |
---|---|
Coefficient of variation (CV) | 0.20816758 |
Kurtosis | 2.1353603 |
Mean | 173.61628 |
Median Absolute Deviation (MAD) | 12.5 |
Skewness | -1.1850659 |
Sum | 14931 |
Variance | 1306.1922 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
183 | 7 | 8.0% |
188 | 5 | 5.7% |
170 | 5 | 5.7% |
196 | 4 | 4.6% |
178 | 4 | 4.6% |
180 | 4 | 4.6% |
191 | 3 | 3.4% |
175 | 3 | 3.4% |
165 | 3 | 3.4% |
163 | 2 | 2.3% |
Other values (38) | 46 |
Value | Count | Frequency (%) |
66 | 1 | |
67 | 1 | |
79 | 1 | |
80 | 1 | |
94 | 1 | |
96 | 2 | |
97 | 1 | |
112 | 1 | |
122 | 1 | |
137 | 1 |
Value | Count | Frequency (%) |
264 | 1 | |
234 | 1 | |
229 | 1 | |
228 | 1 | |
224 | 1 | |
216 | 1 | |
213 | 1 | |
206 | 2 | |
202 | 1 | |
201 | 1 |
mass
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 43 |
---|---|
Distinct (%) | 66.2% |
Missing | 22 |
Missing (%) | 25.3% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 94.353846 |
Minimum | 15 |
---|---|
Maximum | 1358 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 828.0 B |
Quantile statistics
Minimum | 15 |
---|---|
5-th percentile | 22.4 |
Q1 | 55 |
median | 79 |
Q3 | 84 |
95-th percentile | 136 |
Maximum | 1358 |
Range | 1343 |
Interquartile range (IQR) | 29 |
Descriptive statistics
Standard deviation | 161.754 |
---|---|
Coefficient of variation (CV) | 1.714334 |
Kurtosis | 60.779027 |
Mean | 94.353846 |
Median Absolute Deviation (MAD) | 11 |
Skewness | 7.6742474 |
Sum | 6133 |
Variance | 26164.357 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
80 | 7 | 8.0% |
79 | 4 | 4.6% |
77 | 3 | 3.4% |
75 | 3 | 3.4% |
84 | 3 | 3.4% |
48 | 2 | 2.3% |
45 | 2 | 2.3% |
50 | 2 | 2.3% |
55 | 2 | 2.3% |
82 | 2 | 2.3% |
Other values (33) | 35 | |
(Missing) | 22 |
Value | Count | Frequency (%) |
15 | 1 | |
17 | 1 | |
18 | 1 | |
20 | 1 | |
32 | 2 | |
40 | 1 | |
45 | 2 | |
48 | 2 | |
49 | 1 | |
50 | 2 |
Value | Count | Frequency (%) |
1358 | 1 | |
159 | 1 | |
140 | 1 | |
136 | 2 | |
120 | 1 | |
113 | 1 | |
112 | 1 | |
110 | 1 | |
102 | 1 | |
91 | 1 |
hair_color
Categorical
MISSING
 
Distinct | 12 |
---|---|
Distinct (%) | 14.6% |
Missing | 5 |
Missing (%) | 5.7% |
Memory size | 5.2 KiB |
none | |
---|---|
brown | |
black | |
blond | |
white | |
Other values (7) |
Common Values
Value | Count | Frequency (%) |
none | 36 | |
brown | 16 | |
black | 11 | 12.6% |
blond | 4 | 4.6% |
white | 4 | 4.6% |
dark brown | 3 | 3.4% |
auburn | 3 | 3.4% |
sandy-blond | 1 | 1.1% |
red | 1 | 1.1% |
light brown | 1 | 1.1% |
Other values (2) | 2 | 2.3% |
(Missing) | 5 | 5.7% |
Length
Value | Count | Frequency (%) |
none | 36 | |
brown | 20 | |
black | 11 | 12.8% |
blond | 4 | 4.7% |
white | 4 | 4.7% |
dark | 3 | 3.5% |
auburn | 3 | 3.5% |
sandy-blond | 1 | 1.2% |
red | 1 | 1.2% |
light | 1 | 1.2% |
Other values (2) | 2 | 2.3% |
Most occurring characters
Value | Count | Frequency (%) |
n | 101 | |
o | 62 | |
e | 41 | |
b | 39 | 9.8% |
r | 28 | 7.0% |
w | 24 | 6.0% |
a | 19 | 4.8% |
l | 18 | 4.5% |
k | 14 | 3.5% |
d | 11 | 2.8% |
Other values (10) | 43 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 395 | |
Space Separator | 4 | 1.0% |
Dash Punctuation | 1 | 0.2% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
n | 101 | |
o | 62 | |
e | 41 | |
b | 39 | 9.9% |
r | 28 | 7.1% |
w | 24 | 6.1% |
a | 19 | 4.8% |
l | 18 | 4.6% |
k | 14 | 3.5% |
d | 11 | 2.8% |
Other values (8) | 38 | 9.6% |
Space Separator
Value | Count | Frequency (%) |
4 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 395 | |
Common | 5 | 1.2% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
n | 101 | |
o | 62 | |
e | 41 | |
b | 39 | 9.9% |
r | 28 | 7.1% |
w | 24 | 6.1% |
a | 19 | 4.8% |
l | 18 | 4.6% |
k | 14 | 3.5% |
d | 11 | 2.8% |
Other values (8) | 38 | 9.6% |
Common
Value | Count | Frequency (%) |
4 | ||
- | 1 | 20.0% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 400 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
n | 101 | |
o | 62 | |
e | 41 | |
b | 39 | 9.8% |
r | 28 | 7.0% |
w | 24 | 6.0% |
a | 19 | 4.8% |
l | 18 | 4.5% |
k | 14 | 3.5% |
d | 11 | 2.8% |
Other values (10) | 43 |
skin_color
Categorical
HIGH CORRELATION
  MISSING
 
Distinct | 33 |
---|---|
Distinct (%) | 38.4% |
Missing | 1 |
Missing (%) | 1.1% |
Memory size | 5.5 KiB |
light | |
---|---|
fair | |
green | |
tan | |
blue | |
Other values (28) |
Common Values
Value | Count | Frequency (%) |
light | 19 | |
fair | 10 | 11.5% |
green | 8 | 9.2% |
tan | 6 | 6.9% |
blue | 4 | 4.6% |
dark | 4 | 4.6% |
white | 3 | 3.4% |
pale | 3 | 3.4% |
orange | 2 | 2.3% |
yellow | 2 | 2.3% |
Other values (23) | 25 |
Length
Value | Count | Frequency (%) |
light | 20 | |
white | 11 | |
fair | 10 | |
green | 9 | 8.4% |
blue | 7 | 6.5% |
tan | 6 | 5.6% |
brown | 6 | 5.6% |
dark | 5 | 4.7% |
red | 5 | 4.7% |
orange | 4 | 3.7% |
Other values (14) | 24 |
Most occurring characters
Value | Count | Frequency (%) |
e | 65 | |
l | 48 | 8.8% |
r | 48 | 8.8% |
t | 45 | 8.3% |
i | 45 | 8.3% |
g | 40 | 7.4% |
a | 36 | 6.6% |
h | 31 | 5.7% |
n | 27 | 5.0% |
w | 21 | 3.9% |
Other values (14) | 137 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 505 | |
Space Separator | 21 | 3.9% |
Other Punctuation | 15 | 2.8% |
Dash Punctuation | 2 | 0.4% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
e | 65 | |
l | 48 | |
r | 48 | |
t | 45 | |
i | 45 | |
g | 40 | |
a | 36 | 7.1% |
h | 31 | 6.1% |
n | 27 | 5.3% |
w | 21 | 4.2% |
Other values (11) | 99 |
Space Separator
Value | Count | Frequency (%) |
21 |
Other Punctuation
Value | Count | Frequency (%) |
, | 15 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 2 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 505 | |
Common | 38 | 7.0% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
e | 65 | |
l | 48 | |
r | 48 | |
t | 45 | |
i | 45 | |
g | 40 | |
a | 36 | 7.1% |
h | 31 | 6.1% |
n | 27 | 5.3% |
w | 21 | 4.2% |
Other values (11) | 99 |
Common
Value | Count | Frequency (%) |
21 | ||
, | 15 | |
- | 2 | 5.3% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 543 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
e | 65 | |
l | 48 | 8.8% |
r | 48 | 8.8% |
t | 45 | 8.3% |
i | 45 | 8.3% |
g | 40 | 7.4% |
a | 36 | 6.6% |
h | 31 | 5.7% |
n | 27 | 5.0% |
w | 21 | 3.9% |
Other values (14) | 137 |
eye_color
Categorical
Distinct | 16 |
---|---|
Distinct (%) | 18.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 5.4 KiB |
blue | |
---|---|
brown | |
black | |
orange | |
yellow | |
Other values (11) |
Common Values
Value | Count | Frequency (%) |
blue | 19 | |
brown | 18 | |
black | 11 | |
orange | 9 | |
yellow | 8 | |
red | 7 | 8.0% |
hazel | 4 | 4.6% |
blue-gray | 2 | 2.3% |
gold | 2 | 2.3% |
green-gold | 1 | 1.1% |
Other values (6) | 6 | 6.9% |
Length
Value | Count | Frequency (%) |
blue | 20 | |
brown | 19 | |
black | 12 | |
orange | 9 | |
yellow | 8 | 8.8% |
red | 8 | 8.8% |
hazel | 4 | 4.4% |
blue-gray | 2 | 2.2% |
gold | 2 | 2.2% |
green-gold | 1 | 1.1% |
Other values (6) | 6 | 6.6% |
Most occurring characters
Value | Count | Frequency (%) |
l | 60 | |
e | 57 | |
b | 55 | |
r | 43 | |
o | 41 | |
n | 31 | |
w | 30 | 6.5% |
a | 29 | 6.3% |
u | 23 | 5.0% |
g | 17 | 3.7% |
Other values (12) | 75 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 452 | |
Dash Punctuation | 4 | 0.9% |
Space Separator | 4 | 0.9% |
Other Punctuation | 1 | 0.2% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
l | 60 | |
e | 57 | |
b | 55 | |
r | 43 | |
o | 41 | |
n | 31 | |
w | 30 | |
a | 29 | |
u | 23 | 5.1% |
g | 17 | 3.8% |
Other values (9) | 66 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 4 |
Space Separator
Value | Count | Frequency (%) |
4 |
Other Punctuation
Value | Count | Frequency (%) |
, | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 452 | |
Common | 9 | 2.0% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
l | 60 | |
e | 57 | |
b | 55 | |
r | 43 | |
o | 41 | |
n | 31 | |
w | 30 | |
a | 29 | |
u | 23 | 5.1% |
g | 17 | 3.8% |
Other values (9) | 66 |
Common
Value | Count | Frequency (%) |
- | 4 | |
4 | ||
, | 1 | 11.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 461 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
l | 60 | |
e | 57 | |
b | 55 | |
r | 43 | |
o | 41 | |
n | 31 | |
w | 30 | 6.5% |
a | 29 | 6.3% |
u | 23 | 5.0% |
g | 17 | 3.7% |
Other values (12) | 75 |
birth_year
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 38 |
---|---|
Distinct (%) | 76.0% |
Missing | 37 |
Missing (%) | 42.5% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 78.94 |
Minimum | 2 |
---|---|
Maximum | 896 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 828.0 B |
Quantile statistics
Minimum | 2 |
---|---|
5-th percentile | 9.35 |
Q1 | 29 |
median | 47.5 |
Q3 | 70.75 |
95-th percentile | 160.4 |
Maximum | 896 |
Range | 894 |
Interquartile range (IQR) | 41.75 |
Descriptive statistics
Standard deviation | 145.35704 |
---|---|
Coefficient of variation (CV) | 1.8413611 |
Kurtosis | 23.66993 |
Mean | 78.94 |
Median Absolute Deviation (MAD) | 21.5 |
Skewness | 4.7374785 |
Sum | 3947 |
Variance | 21128.67 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
19 | 3 | 3.4% |
82 | 3 | 3.4% |
41 | 3 | 3.4% |
48 | 2 | 2.3% |
72 | 2 | 2.3% |
15 | 2 | 2.3% |
31 | 2 | 2.3% |
29 | 2 | 2.3% |
52 | 2 | 2.3% |
67 | 1 | 1.1% |
Other values (28) | 28 | |
(Missing) | 37 |
Value | Count | Frequency (%) |
2 | 1 | 1.1% |
6 | 1 | 1.1% |
8 | 1 | 1.1% |
11 | 1 | 1.1% |
15 | 2 | |
19 | 3 | |
21 | 1 | 1.1% |
22 | 1 | 1.1% |
24 | 1 | 1.1% |
29 | 2 |
Value | Count | Frequency (%) |
896 | 1 | 1.1% |
600 | 1 | 1.1% |
200 | 1 | 1.1% |
112 | 1 | 1.1% |
110 | 1 | 1.1% |
102 | 1 | 1.1% |
93 | 1 | 1.1% |
92 | 1 | 1.1% |
82 | 3 | |
72 | 2 |
birth_era
Categorical
HIGH CORRELATION
  IMBALANCE
  MISSING
 
Distinct | 2 |
---|---|
Distinct (%) | 4.0% |
Missing | 37 |
Missing (%) | 42.5% |
Memory size | 4.2 KiB |
BBY | |
---|---|
ABY |
Common Values
Value | Count | Frequency (%) |
BBY | 45 | |
ABY | 5 | 5.7% |
(Missing) | 37 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
bby | 45 | |
aby | 5 | 10.0% |
Most occurring characters
Value | Count | Frequency (%) |
B | 95 | |
Y | 50 | |
A | 5 | 3.3% |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 150 |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
B | 95 | |
Y | 50 | |
A | 5 | 3.3% |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 150 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
B | 95 | |
Y | 50 | |
A | 5 | 3.3% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 150 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
B | 95 | |
Y | 50 | |
A | 5 | 3.3% |
birth_place
Text
MISSING
 
Distinct | 30 |
---|---|
Distinct (%) | 81.1% |
Missing | 50 |
Missing (%) | 57.5% |
Memory size | 4.0 KiB |
Value | Count | Frequency (%) |
tatooine | 4 | 8.7% |
naboo | 3 | 6.5% |
polis | 2 | 4.3% |
massa | 2 | 4.3% |
coruscant | 2 | 4.3% |
nal | 1 | 2.2% |
cala | 1 | 2.2% |
chandrila | 1 | 2.2% |
affa | 1 | 2.2% |
socorro | 1 | 2.2% |
Other values (28) | 28 |
Most occurring characters
Value | Count | Frequency (%) |
a | 43 | |
o | 36 | |
r | 23 | 8.0% |
n | 22 | 7.7% |
i | 16 | 5.6% |
s | 14 | 4.9% |
t | 13 | 4.5% |
e | 12 | 4.2% |
l | 10 | 3.5% |
9 | 3.1% | |
Other values (28) | 88 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 232 | |
Uppercase Letter | 44 | 15.4% |
Space Separator | 9 | 3.1% |
Decimal Number | 1 | 0.3% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
a | 43 | |
o | 36 | |
r | 23 | |
n | 22 | |
i | 16 | 6.9% |
s | 14 | 6.0% |
t | 13 | 5.6% |
e | 12 | 5.2% |
l | 10 | 4.3% |
u | 7 | 3.0% |
Other values (12) | 36 |
Uppercase Letter
Value | Count | Frequency (%) |
C | 7 | |
T | 5 | |
M | 5 | |
S | 4 | |
N | 4 | |
H | 3 | |
P | 3 | |
A | 3 | |
K | 3 | |
D | 2 | 4.5% |
Other values (4) | 5 |
Space Separator
Value | Count | Frequency (%) |
9 |
Decimal Number
Value | Count | Frequency (%) |
4 | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 276 | |
Common | 10 | 3.5% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
a | 43 | |
o | 36 | |
r | 23 | 8.3% |
n | 22 | 8.0% |
i | 16 | 5.8% |
s | 14 | 5.1% |
t | 13 | 4.7% |
e | 12 | 4.3% |
l | 10 | 3.6% |
u | 7 | 2.5% |
Other values (26) | 80 |
Common
Value | Count | Frequency (%) |
9 | ||
4 | 1 | 10.0% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 286 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
a | 43 | |
o | 36 | |
r | 23 | 8.0% |
n | 22 | 7.7% |
i | 16 | 5.6% |
s | 14 | 4.9% |
t | 13 | 4.5% |
e | 12 | 4.2% |
l | 10 | 3.5% |
9 | 3.1% | |
Other values (28) | 88 |
death_year
Real number (ℝ)
MISSING
  ZEROS
 
Distinct | 19 |
---|---|
Distinct (%) | 30.6% |
Missing | 25 |
Missing (%) | 28.7% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 16.370968 |
Minimum | 0 |
---|---|
Maximum | 45 |
Zeros | 10 |
Zeros (%) | 11.5% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 828.0 B |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 4 |
median | 19 |
Q3 | 22 |
95-th percentile | 34.95 |
Maximum | 45 |
Range | 45 |
Interquartile range (IQR) | 18 |
Descriptive statistics
Standard deviation | 11.627037 |
---|---|
Coefficient of variation (CV) | 0.71022297 |
Kurtosis | -0.71644317 |
Mean | 16.370968 |
Median Absolute Deviation (MAD) | 8 |
Skewness | 0.097364044 |
Sum | 1015 |
Variance | 135.188 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
19 | 17 | |
0 | 10 | 11.5% |
4 | 5 | 5.7% |
3 | 4 | 4.6% |
22 | 4 | 4.6% |
34 | 3 | 3.4% |
20 | 3 | 3.4% |
35 | 3 | 3.4% |
32 | 2 | 2.3% |
18 | 2 | 2.3% |
Other values (9) | 9 | 10.3% |
(Missing) | 25 |
Value | Count | Frequency (%) |
0 | 10 | |
3 | 4 | 4.6% |
4 | 5 | 5.7% |
9 | 1 | 1.1% |
11 | 1 | 1.1% |
14 | 1 | 1.1% |
18 | 2 | 2.3% |
19 | 17 | |
20 | 3 | 3.4% |
21 | 1 | 1.1% |
Value | Count | Frequency (%) |
45 | 1 | 1.1% |
35 | 3 | |
34 | 3 | |
32 | 2 | |
29 | 1 | 1.1% |
27 | 1 | 1.1% |
25 | 1 | 1.1% |
24 | 1 | 1.1% |
22 | 4 | |
21 | 1 | 1.1% |
death_era
Categorical
HIGH CORRELATION
  MISSING
 
Distinct | 2 |
---|---|
Distinct (%) | 3.2% |
Missing | 25 |
Missing (%) | 28.7% |
Memory size | 4.5 KiB |
BBY | |
---|---|
ABY |
Common Values
Value | Count | Frequency (%) |
BBY | 42 | |
ABY | 20 | |
(Missing) | 25 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
bby | 42 | |
aby | 20 |
Most occurring characters
Value | Count | Frequency (%) |
B | 104 | |
Y | 62 | |
A | 20 | 10.8% |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 186 |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
B | 104 | |
Y | 62 | |
A | 20 | 10.8% |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 186 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
B | 104 | |
Y | 62 | |
A | 20 | 10.8% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 186 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
B | 104 | |
Y | 62 | |
A | 20 | 10.8% |
death_place
Text
MISSING
 
Distinct | 32 |
---|---|
Distinct (%) | 56.1% |
Missing | 30 |
Missing (%) | 34.5% |
Memory size | 4.7 KiB |
Value | Count | Frequency (%) |
coruscant | 9 | 11.4% |
tatooine | 8 | 10.1% |
system | 4 | 5.1% |
star | 3 | 3.8% |
death | 3 | 3.8% |
mustafar | 3 | 3.8% |
naboo | 3 | 3.8% |
felucia | 3 | 3.8% |
bespin | 2 | 2.5% |
tantive | 2 | 2.5% |
Other values (36) | 39 |
Most occurring characters
Value | Count | Frequency (%) |
a | 67 | |
o | 48 | 9.3% |
t | 42 | 8.1% |
e | 33 | 6.4% |
n | 32 | 6.2% |
s | 31 | 6.0% |
i | 31 | 6.0% |
r | 29 | 5.6% |
22 | 4.3% | |
u | 17 | 3.3% |
Other values (34) | 165 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 411 | |
Uppercase Letter | 81 | 15.7% |
Space Separator | 22 | 4.3% |
Dash Punctuation | 2 | 0.4% |
Decimal Number | 1 | 0.2% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
a | 67 | |
o | 48 | |
t | 42 | |
e | 33 | |
n | 32 | |
s | 31 | |
i | 31 | |
r | 29 | |
u | 17 | 4.1% |
l | 15 | 3.6% |
Other values (13) | 66 |
Uppercase Letter
Value | Count | Frequency (%) |
C | 13 | |
T | 11 | |
S | 8 | |
I | 7 | |
M | 6 | 7.4% |
D | 5 | 6.2% |
B | 5 | 6.2% |
F | 4 | 4.9% |
N | 4 | 4.9% |
V | 3 | 3.7% |
Other values (8) | 15 |
Space Separator
Value | Count | Frequency (%) |
22 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 2 |
Decimal Number
Value | Count | Frequency (%) |
1 | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 492 | |
Common | 25 | 4.8% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
a | 67 | |
o | 48 | 9.8% |
t | 42 | 8.5% |
e | 33 | 6.7% |
n | 32 | 6.5% |
s | 31 | 6.3% |
i | 31 | 6.3% |
r | 29 | 5.9% |
u | 17 | 3.5% |
l | 15 | 3.0% |
Other values (31) | 147 |
Common
Value | Count | Frequency (%) |
22 | ||
- | 2 | 8.0% |
1 | 1 | 4.0% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 517 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
a | 67 | |
o | 48 | 9.3% |
t | 42 | 8.1% |
e | 33 | 6.4% |
n | 32 | 6.2% |
s | 31 | 6.0% |
i | 31 | 6.0% |
r | 29 | 5.6% |
22 | 4.3% | |
u | 17 | 3.3% |
Other values (34) | 165 |
sex
Categorical
HIGH CORRELATION
 
Distinct | 4 |
---|---|
Distinct (%) | 4.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 5.4 KiB |
male | |
---|---|
female | |
none | 6 |
hermaphroditic | 1 |
Common Values
Value | Count | Frequency (%) |
male | 62 | |
female | 18 | 20.7% |
none | 6 | 6.9% |
hermaphroditic | 1 | 1.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
male | 62 | |
female | 18 | 20.7% |
none | 6 | 6.9% |
hermaphroditic | 1 | 1.1% |
Most occurring characters
Value | Count | Frequency (%) |
e | 105 | |
m | 81 | |
a | 81 | |
l | 80 | |
f | 18 | 4.6% |
n | 12 | 3.0% |
o | 7 | 1.8% |
h | 2 | 0.5% |
r | 2 | 0.5% |
i | 2 | 0.5% |
Other values (4) | 4 | 1.0% |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 394 |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
e | 105 | |
m | 81 | |
a | 81 | |
l | 80 | |
f | 18 | 4.6% |
n | 12 | 3.0% |
o | 7 | 1.8% |
h | 2 | 0.5% |
r | 2 | 0.5% |
i | 2 | 0.5% |
Other values (4) | 4 | 1.0% |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 394 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
e | 105 | |
m | 81 | |
a | 81 | |
l | 80 | |
f | 18 | 4.6% |
n | 12 | 3.0% |
o | 7 | 1.8% |
h | 2 | 0.5% |
r | 2 | 0.5% |
i | 2 | 0.5% |
Other values (4) | 4 | 1.0% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 394 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
e | 105 | |
m | 81 | |
a | 81 | |
l | 80 | |
f | 18 | 4.6% |
n | 12 | 3.0% |
o | 7 | 1.8% |
h | 2 | 0.5% |
r | 2 | 0.5% |
i | 2 | 0.5% |
Other values (4) | 4 | 1.0% |
gender
Categorical
HIGH CORRELATION
 
Distinct | 2 |
---|---|
Distinct (%) | 2.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 5.7 KiB |
masculine | |
---|---|
feminine |
Common Values
Value | Count | Frequency (%) |
masculine | 68 | |
feminine | 19 | 21.8% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
masculine | 68 | |
feminine | 19 | 21.8% |
Most occurring characters
Value | Count | Frequency (%) |
i | 106 | |
n | 106 | |
e | 106 | |
m | 87 | |
a | 68 | |
s | 68 | |
c | 68 | |
u | 68 | |
l | 68 | |
f | 19 | 2.5% |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 764 |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
i | 106 | |
n | 106 | |
e | 106 | |
m | 87 | |
a | 68 | |
s | 68 | |
c | 68 | |
u | 68 | |
l | 68 | |
f | 19 | 2.5% |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 764 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
i | 106 | |
n | 106 | |
e | 106 | |
m | 87 | |
a | 68 | |
s | 68 | |
c | 68 | |
u | 68 | |
l | 68 | |
f | 19 | 2.5% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 764 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
i | 106 | |
n | 106 | |
e | 106 | |
m | 87 | |
a | 68 | |
s | 68 | |
c | 68 | |
u | 68 | |
l | 68 | |
f | 19 | 2.5% |
pronoun
Categorical
HIGH CORRELATION
 
Distinct | 2 |
---|---|
Distinct (%) | 2.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 5.5 KiB |
he/him | |
---|---|
she/her |
Common Values
Value | Count | Frequency (%) |
he/him | 68 | |
she/her | 19 | 21.8% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
he/him | 68 | |
she/her | 19 | 21.8% |
Most occurring characters
Value | Count | Frequency (%) |
h | 174 | |
e | 106 | |
/ | 87 | |
i | 68 | 12.6% |
m | 68 | 12.6% |
s | 19 | 3.5% |
r | 19 | 3.5% |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 454 | |
Other Punctuation | 87 | 16.1% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
h | 174 | |
e | 106 | |
i | 68 | 15.0% |
m | 68 | 15.0% |
s | 19 | 4.2% |
r | 19 | 4.2% |
Other Punctuation
Value | Count | Frequency (%) |
/ | 87 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 454 | |
Common | 87 | 16.1% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
h | 174 | |
e | 106 | |
i | 68 | 15.0% |
m | 68 | 15.0% |
s | 19 | 4.2% |
r | 19 | 4.2% |
Common
Value | Count | Frequency (%) |
/ | 87 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 541 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
h | 174 | |
e | 106 | |
/ | 87 | |
i | 68 | 12.6% |
m | 68 | 12.6% |
s | 19 | 3.5% |
r | 19 | 3.5% |
homeworld
Text
MISSING
 
Distinct | 54 |
---|---|
Distinct (%) | 65.1% |
Missing | 4 |
Missing (%) | 4.6% |
Memory size | 5.5 KiB |
Value | Count | Frequency (%) |
naboo | 11 | 12.0% |
tatooine | 10 | 10.9% |
alderaan | 3 | 3.3% |
coruscant | 3 | 3.3% |
kamino | 3 | 3.3% |
mirial | 2 | 2.2% |
kashyyyk | 2 | 2.2% |
corellia | 2 | 2.2% |
ryloth | 2 | 2.2% |
nal | 1 | 1.1% |
Other values (53) | 53 |
Most occurring characters
Value | Count | Frequency (%) |
a | 82 | |
o | 77 | |
n | 45 | 7.6% |
i | 41 | 6.9% |
e | 38 | 6.4% |
r | 34 | 5.7% |
t | 27 | 4.6% |
l | 27 | 4.6% |
s | 18 | 3.0% |
u | 16 | 2.7% |
Other values (36) | 187 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 490 | |
Uppercase Letter | 92 | 15.5% |
Space Separator | 9 | 1.5% |
Decimal Number | 1 | 0.2% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
a | 82 | |
o | 77 | |
n | 45 | |
i | 41 | |
e | 38 | |
r | 34 | 6.9% |
t | 27 | 5.5% |
l | 27 | 5.5% |
s | 18 | 3.7% |
u | 16 | 3.3% |
Other values (12) | 85 |
Uppercase Letter
Value | Count | Frequency (%) |
T | 15 | |
N | 14 | |
C | 9 | |
K | 7 | 7.6% |
S | 7 | 7.6% |
A | 5 | 5.4% |
M | 5 | 5.4% |
H | 4 | 4.3% |
D | 4 | 4.3% |
R | 3 | 3.3% |
Other values (12) | 19 |
Space Separator
Value | Count | Frequency (%) |
9 |
Decimal Number
Value | Count | Frequency (%) |
4 | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 582 | |
Common | 10 | 1.7% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
a | 82 | |
o | 77 | |
n | 45 | 7.7% |
i | 41 | 7.0% |
e | 38 | 6.5% |
r | 34 | 5.8% |
t | 27 | 4.6% |
l | 27 | 4.6% |
s | 18 | 3.1% |
u | 16 | 2.7% |
Other values (34) | 177 |
Common
Value | Count | Frequency (%) |
9 | ||
4 | 1 | 10.0% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 592 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
a | 82 | |
o | 77 | |
n | 45 | 7.6% |
i | 41 | 6.9% |
e | 38 | 6.4% |
r | 34 | 5.7% |
t | 27 | 4.6% |
l | 27 | 4.6% |
s | 18 | 3.0% |
u | 16 | 2.7% |
Other values (36) | 187 |
species
Categorical
HIGH CORRELATION
 
Distinct | 39 |
---|---|
Distinct (%) | 44.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 5.5 KiB |
Human | |
---|---|
Droid | |
Gungan | 3 |
Mirialan | 2 |
Wookiee | 2 |
Other values (34) |
Common Values
Value | Count | Frequency (%) |
Human | 38 | |
Droid | 6 | 6.9% |
Gungan | 3 | 3.4% |
Mirialan | 2 | 2.3% |
Wookiee | 2 | 2.3% |
Twi'lek | 2 | 2.3% |
Kaminoan | 2 | 2.3% |
Neimodian | 1 | 1.1% |
Hutt | 1 | 1.1% |
Yoda's species | 1 | 1.1% |
Other values (29) | 29 |
Length
Value | Count | Frequency (%) |
human | 38 | |
droid | 6 | 6.6% |
gungan | 3 | 3.3% |
mirialan | 2 | 2.2% |
wookiee | 2 | 2.2% |
twi'lek | 2 | 2.2% |
kaminoan | 2 | 2.2% |
zabrak | 2 | 2.2% |
besalisk | 1 | 1.1% |
tholothian | 1 | 1.1% |
Other values (32) | 32 |
Most occurring characters
Value | Count | Frequency (%) |
a | 84 | |
n | 73 | |
u | 52 | |
m | 44 | 8.1% |
H | 39 | 7.2% |
o | 32 | 5.9% |
i | 31 | 5.7% |
e | 24 | 4.4% |
r | 21 | 3.9% |
l | 15 | 2.8% |
Other values (35) | 129 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 446 | |
Uppercase Letter | 90 | 16.5% |
Space Separator | 4 | 0.7% |
Other Punctuation | 4 | 0.7% |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
H | 39 | |
D | 8 | 8.9% |
T | 7 | 7.8% |
M | 4 | 4.4% |
C | 4 | 4.4% |
K | 4 | 4.4% |
G | 4 | 4.4% |
N | 2 | 2.2% |
S | 2 | 2.2% |
I | 2 | 2.2% |
Other values (12) | 14 | 15.6% |
Lowercase Letter
Value | Count | Frequency (%) |
a | 84 | |
n | 73 | |
u | 52 | |
m | 44 | |
o | 32 | 7.2% |
i | 31 | 7.0% |
e | 24 | 5.4% |
r | 21 | 4.7% |
l | 15 | 3.4% |
d | 13 | 2.9% |
Other values (11) | 57 |
Space Separator
Value | Count | Frequency (%) |
4 |
Other Punctuation
Value | Count | Frequency (%) |
' | 4 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 536 | |
Common | 8 | 1.5% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
a | 84 | |
n | 73 | |
u | 52 | |
m | 44 | 8.2% |
H | 39 | 7.3% |
o | 32 | 6.0% |
i | 31 | 5.8% |
e | 24 | 4.5% |
r | 21 | 3.9% |
l | 15 | 2.8% |
Other values (33) | 121 |
Common
Value | Count | Frequency (%) |
4 | ||
' | 4 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 544 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
a | 84 | |
n | 73 | |
u | 52 | |
m | 44 | 8.1% |
H | 39 | 7.2% |
o | 32 | 5.9% |
i | 31 | 5.7% |
e | 24 | 4.4% |
r | 21 | 3.9% |
l | 15 | 2.8% |
Other values (35) | 129 |
occupation
Text
Distinct | 51 |
---|---|
Distinct (%) | 58.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 6.5 KiB |
Length
Max length | 42 |
---|---|
Median length | 29 |
Mean length | 16.977011 |
Min length | 4 |
Characters and Unicode
Total characters | 1477 |
---|---|
Distinct characters | 51 |
Distinct categories | 6 ? |
Distinct scripts | 2 ? |
Distinct blocks | 2 ? |
Unique
Unique | 40 ? |
---|---|
Unique (%) | 46.0% |
Sample
1st row | Jedi Master |
---|---|
2nd row | Protocol droid |
3rd row | Astromech droid |
4th row | Dark Lord of the Sith |
5th row | Princess of Alderaan |
Value | Count | Frequency (%) |
jedi | 20 | 8.5% |
of | 19 | 8.1% |
the | 15 | 6.4% |
master | 12 | 5.1% |
pilot | 9 | 3.8% |
hunter | 6 | 2.5% |
droid | 6 | 2.5% |
general | 5 | 2.1% |
alliance | 5 | 2.1% |
podracer | 5 | 2.1% |
Other values (90) | 134 |
Most occurring characters
Value | Count | Frequency (%) |
e | 164 | 11.1% |
151 | 10.2% | |
r | 115 | 7.8% |
i | 110 | 7.4% |
t | 100 | 6.8% |
a | 97 | 6.6% |
o | 96 | 6.5% |
n | 76 | 5.1% |
d | 64 | 4.3% |
l | 55 | 3.7% |
Other values (41) | 449 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 1154 | |
Uppercase Letter | 166 | 11.2% |
Space Separator | 151 | 10.2% |
Dash Punctuation | 2 | 0.1% |
Other Punctuation | 2 | 0.1% |
Decimal Number | 2 | 0.1% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
e | 164 | |
r | 115 | |
i | 110 | |
t | 100 | |
a | 97 | |
o | 96 | |
n | 76 | 6.6% |
d | 64 | 5.5% |
l | 55 | 4.8% |
s | 43 | 3.7% |
Other values (14) | 234 |
Uppercase Letter
Value | Count | Frequency (%) |
J | 22 | |
C | 21 | |
M | 18 | |
A | 17 | |
P | 16 | |
G | 11 | 6.6% |
S | 11 | 6.6% |
B | 8 | 4.8% |
R | 7 | 4.2% |
H | 6 | 3.6% |
Other values (12) | 29 |
Decimal Number
Value | Count | Frequency (%) |
9 | 1 | |
0 | 1 |
Space Separator
Value | Count | Frequency (%) |
151 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 2 |
Other Punctuation
Value | Count | Frequency (%) |
' | 2 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 1320 | |
Common | 157 | 10.6% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
e | 164 | |
r | 115 | 8.7% |
i | 110 | 8.3% |
t | 100 | 7.6% |
a | 97 | 7.3% |
o | 96 | 7.3% |
n | 76 | 5.8% |
d | 64 | 4.8% |
l | 55 | 4.2% |
s | 43 | 3.3% |
Other values (36) | 400 |
Common
Value | Count | Frequency (%) |
151 | ||
- | 2 | 1.3% |
' | 2 | 1.3% |
9 | 1 | 0.6% |
0 | 1 | 0.6% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 1476 | |
None | 1 | 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
e | 164 | 11.1% |
151 | 10.2% | |
r | 115 | 7.8% |
i | 110 | 7.5% |
t | 100 | 6.8% |
a | 97 | 6.6% |
o | 96 | 6.5% |
n | 76 | 5.1% |
d | 64 | 4.3% |
l | 55 | 3.7% |
Other values (40) | 448 |
None
Value | Count | Frequency (%) |
é | 1 |
cybernetics
Text
MISSING
 
Distinct | 7 |
---|---|
Distinct (%) | 100.0% |
Missing | 80 |
Missing (%) | 92.0% |
Memory size | 3.3 KiB |
Length
Max length | 45 |
---|---|
Median length | 38 |
Mean length | 30 |
Min length | 20 |
Characters and Unicode
Total characters | 210 |
---|---|
Distinct characters | 34 |
Distinct categories | 7 ? |
Distinct scripts | 2 ? |
Distinct blocks | 2 ? |
Unique
Unique | 7 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | Prosthetic right hand |
---|---|
2nd row | Prosthetic arms and legs, life-support system |
3rd row | Cybernetic right arm |
4th row | AJ^6 cyborg construct |
5th row | Six-legged apparatus, Two cybernetic legs |
Value | Count | Frequency (%) |
prosthetic | 2 | 7.4% |
legs | 2 | 7.4% |
cybernetic | 2 | 7.4% |
right | 2 | 7.4% |
six-legged | 1 | 3.7% |
for | 1 | 3.7% |
except | 1 | 3.7% |
cebernetic | 1 | 3.7% |
completely | 1 | 3.7% |
annunciator | 1 | 3.7% |
Other values (13) | 13 |
Most occurring characters
Value | Count | Frequency (%) |
e | 19 | 9.0% |
t | 18 | 8.6% |
18 | 8.6% | |
r | 17 | 8.1% |
c | 13 | 6.2% |
a | 11 | 5.2% |
o | 11 | 5.2% |
i | 11 | 5.2% |
s | 10 | 4.8% |
n | 10 | 4.8% |
Other values (24) | 72 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 173 | |
Space Separator | 20 | 9.5% |
Uppercase Letter | 10 | 4.8% |
Other Punctuation | 3 | 1.4% |
Dash Punctuation | 2 | 1.0% |
Modifier Symbol | 1 | 0.5% |
Decimal Number | 1 | 0.5% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
e | 19 | |
t | 18 | 10.4% |
r | 17 | 9.8% |
c | 13 | 7.5% |
a | 11 | 6.4% |
o | 11 | 6.4% |
i | 11 | 6.4% |
s | 10 | 5.8% |
n | 10 | 5.8% |
l | 7 | 4.0% |
Other values (11) | 46 |
Uppercase Letter
Value | Count | Frequency (%) |
A | 2 | |
C | 2 | |
P | 2 | |
J | 1 | |
S | 1 | |
T | 1 | |
V | 1 |
Space Separator
Value | Count | Frequency (%) |
18 | ||
 | 2 | 10.0% |
Other Punctuation
Value | Count | Frequency (%) |
, | 3 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 2 |
Modifier Symbol
Value | Count | Frequency (%) |
^ | 1 |
Decimal Number
Value | Count | Frequency (%) |
6 | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 183 | |
Common | 27 | 12.9% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
e | 19 | 10.4% |
t | 18 | 9.8% |
r | 17 | 9.3% |
c | 13 | 7.1% |
a | 11 | 6.0% |
o | 11 | 6.0% |
i | 11 | 6.0% |
s | 10 | 5.5% |
n | 10 | 5.5% |
l | 7 | 3.8% |
Other values (18) | 56 |
Common
Value | Count | Frequency (%) |
18 | ||
, | 3 | 11.1% |
- | 2 | 7.4% |
 | 2 | 7.4% |
^ | 1 | 3.7% |
6 | 1 | 3.7% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 208 | |
None | 2 | 1.0% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
e | 19 | 9.1% |
t | 18 | 8.7% |
18 | 8.7% | |
r | 17 | 8.2% |
c | 13 | 6.2% |
a | 11 | 5.3% |
o | 11 | 5.3% |
i | 11 | 5.3% |
s | 10 | 4.8% |
n | 10 | 4.8% |
Other values (23) | 70 |
None
Value | Count | Frequency (%) |
 | 2 |
abilities
Categorical
HIGH CORRELATION
  MISSING
 
Distinct | 25 |
---|---|
Distinct (%) | 45.5% |
Missing | 32 |
Missing (%) | 36.8% |
Memory size | 5.6 KiB |
Lightsaber abilities, Force powers | |
---|---|
Piloting | |
Piloting, Racing | |
Politics | |
Lightsaber training, Force powers, Other abilities | |
Other values (20) |
Length
Max length | 53 |
---|---|
Median length | 50 |
Mean length | 25.509091 |
Min length | 8 |
Characters and Unicode
Total characters | 1403 |
---|---|
Distinct characters | 38 |
Distinct categories | 5 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 18 ? |
---|---|
Unique (%) | 32.7% |
Sample
1st row | Lightsaber abilities, Force powers, Other abilities |
---|---|
2nd row | Language known, Other skills |
3rd row | Lightsaber abilities, Force powers, Language known |
4th row | Jedi training, Force powers, Other abilities |
5th row | Blaster abilities |
Common Values
Value | Count | Frequency (%) |
Lightsaber abilities, Force powers | 11 | 12.6% |
Piloting | 10 | 11.5% |
Piloting, Racing | 5 | 5.7% |
Politics | 4 | 4.6% |
Lightsaber training, Force powers, Other abilities | 3 | 3.4% |
Lightsaber training, Force powers | 2 | 2.3% |
Lightsaber abilities, Force powers, Force lightning | 2 | 2.3% |
Force sensitivy | 1 | 1.1% |
Lightsaber abilities, Force powers, Language known | 1 | 1.1% |
Jedi training, Force powers, Other abilities | 1 | 1.1% |
Other values (15) | 15 | |
(Missing) | 32 |
Length
Value | Count | Frequency (%) |
force | 29 | |
abilities | 25 | |
lightsaber | 24 | |
powers | 22 | |
piloting | 17 | |
training | 9 | 5.4% |
other | 8 | 4.8% |
racing | 5 | 3.0% |
politics | 5 | 3.0% |
lightning | 3 | 1.8% |
Other values (18) | 21 |
Most occurring characters
Value | Count | Frequency (%) |
i | 185 | |
e | 121 | 8.6% |
113 | 8.1% | |
t | 102 | 7.3% |
r | 100 | 7.1% |
s | 87 | 6.2% |
o | 79 | 5.6% |
a | 78 | 5.6% |
g | 68 | 4.8% |
n | 61 | 4.3% |
Other values (28) | 409 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 1142 | |
Space Separator | 113 | 8.1% |
Uppercase Letter | 101 | 7.2% |
Other Punctuation | 45 | 3.2% |
Dash Punctuation | 2 | 0.1% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
i | 185 | |
e | 121 | |
t | 102 | |
r | 100 | |
s | 87 | |
o | 79 | 6.9% |
a | 78 | 6.8% |
g | 68 | 6.0% |
n | 61 | 5.3% |
l | 58 | 5.1% |
Other values (12) | 203 |
Uppercase Letter
Value | Count | Frequency (%) |
F | 29 | |
L | 26 | |
P | 22 | |
O | 8 | 7.9% |
R | 5 | 5.0% |
S | 3 | 3.0% |
B | 2 | 2.0% |
T | 1 | 1.0% |
K | 1 | 1.0% |
D | 1 | 1.0% |
Other values (3) | 3 | 3.0% |
Space Separator
Value | Count | Frequency (%) |
113 |
Other Punctuation
Value | Count | Frequency (%) |
, | 45 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 2 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 1243 | |
Common | 160 | 11.4% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
i | 185 | |
e | 121 | 9.7% |
t | 102 | 8.2% |
r | 100 | 8.0% |
s | 87 | 7.0% |
o | 79 | 6.4% |
a | 78 | 6.3% |
g | 68 | 5.5% |
n | 61 | 4.9% |
l | 58 | 4.7% |
Other values (25) | 304 |
Common
Value | Count | Frequency (%) |
113 | ||
, | 45 | 28.1% |
- | 2 | 1.2% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 1403 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
i | 185 | |
e | 121 | 8.6% |
113 | 8.1% | |
t | 102 | 7.3% |
r | 100 | 7.1% |
s | 87 | 6.2% |
o | 79 | 5.6% |
a | 78 | 5.6% |
g | 68 | 4.8% |
n | 61 | 4.3% |
Other values (28) | 409 |
equipment
Text
MISSING
 
Distinct | 31 |
---|---|
Distinct (%) | 50.0% |
Missing | 25 |
Missing (%) | 28.7% |
Memory size | 6.0 KiB |
Length
Max length | 180 |
---|---|
Median length | 124.5 |
Mean length | 27.370968 |
Min length | 7 |
Characters and Unicode
Total characters | 1697 |
---|---|
Distinct characters | 59 |
Distinct categories | 6 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 26 ? |
---|---|
Unique (%) | 41.9% |
Sample
1st row | Lightsabers, Blasters |
---|---|
2nd row | Buzz saw, Electric pike, Drinks tray, Fusion welder, Scomp link, Power recharge coupler, Rocket boosters, Holographic projector, Motorized all-terrain treads, Retractable third leg |
3rd row | Clothing, Lightsabers |
4th row | Defender sporting blaster pistol, X-30 Lancer target blast pistol, Lightsaber |
5th row | Blasters, SX-14 Field Hover-Ute, GX-8 Moisture Vaporators, Droids |
Value | Count | Frequency (%) |
lightsabers | 24 | 11.8% |
clothing | 13 | 6.4% |
blasters | 8 | 3.9% |
weapons | 7 | 3.4% |
armor | 4 | 2.0% |
helmet | 4 | 2.0% |
flight | 3 | 1.5% |
belt | 2 | 1.0% |
utility | 2 | 1.0% |
tools | 2 | 1.0% |
Other values (118) | 135 |
Most occurring characters
Value | Count | Frequency (%) |
e | 144 | 8.5% |
142 | 8.4% | |
s | 127 | 7.5% |
r | 123 | 7.2% |
t | 110 | 6.5% |
o | 107 | 6.3% |
a | 99 | 5.8% |
i | 98 | 5.8% |
l | 82 | 4.8% |
n | 71 | 4.2% |
Other values (49) | 594 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 1312 | |
Uppercase Letter | 150 | 8.8% |
Space Separator | 142 | 8.4% |
Other Punctuation | 71 | 4.2% |
Decimal Number | 12 | 0.7% |
Dash Punctuation | 10 | 0.6% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
e | 144 | |
s | 127 | |
r | 123 | |
t | 110 | 8.4% |
o | 107 | 8.2% |
a | 99 | 7.5% |
i | 98 | 7.5% |
l | 82 | 6.2% |
n | 71 | 5.4% |
h | 65 | 5.0% |
Other values (15) | 286 |
Uppercase Letter
Value | Count | Frequency (%) |
L | 29 | |
C | 17 | |
H | 11 | 7.3% |
B | 10 | 6.7% |
A | 9 | 6.0% |
W | 9 | 6.0% |
R | 8 | 5.3% |
M | 7 | 4.7% |
G | 7 | 4.7% |
F | 6 | 4.0% |
Other values (12) | 37 |
Decimal Number
Value | Count | Frequency (%) |
4 | 3 | |
8 | 2 | |
3 | 2 | |
0 | 2 | |
9 | 1 | 8.3% |
7 | 1 | 8.3% |
1 | 1 | 8.3% |
Other Punctuation
Value | Count | Frequency (%) |
, | 68 | |
/ | 2 | 2.8% |
. | 1 | 1.4% |
Space Separator
Value | Count | Frequency (%) |
142 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 10 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 1462 | |
Common | 235 | 13.8% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
e | 144 | 9.8% |
s | 127 | 8.7% |
r | 123 | 8.4% |
t | 110 | 7.5% |
o | 107 | 7.3% |
a | 99 | 6.8% |
i | 98 | 6.7% |
l | 82 | 5.6% |
n | 71 | 4.9% |
h | 65 | 4.4% |
Other values (37) | 436 |
Common
Value | Count | Frequency (%) |
142 | ||
, | 68 | |
- | 10 | 4.3% |
4 | 3 | 1.3% |
8 | 2 | 0.9% |
3 | 2 | 0.9% |
0 | 2 | 0.9% |
/ | 2 | 0.9% |
9 | 1 | 0.4% |
. | 1 | 0.4% |
Other values (2) | 2 | 0.9% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 1697 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
e | 144 | 8.5% |
142 | 8.4% | |
s | 127 | 7.5% |
r | 123 | 7.2% |
t | 110 | 6.5% |
o | 107 | 6.3% |
a | 99 | 5.8% |
i | 98 | 5.8% |
l | 82 | 4.8% |
n | 71 | 4.2% |
Other values (49) | 594 |
films
Categorical
HIGH CORRELATION
 
Distinct | 24 |
---|---|
Distinct (%) | 27.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 8.2 KiB |
Attack of the Clones | |
---|---|
The Phantom Menace | |
The Phantom Menace, Attack of the Clones, Revenge of the Sith | |
Attack of the Clones, Revenge of the Sith | |
The Force Awakens | |
Other values (19) |
Length
Max length | 137 |
---|---|
Median length | 95 |
Mean length | 38.218391 |
Min length | 10 |
Characters and Unicode
Total characters | 3325 |
---|---|
Distinct characters | 35 |
Distinct categories | 4 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 8 ? |
---|---|
Unique (%) | 9.2% |
Sample
1st row | A New Hope, The Empire Strikes Back, Return of the Jedi, Revenge of the Sith, The Force Awakens |
---|---|
2nd row | A New Hope, The Empire Strikes Back, Return of the Jedi, The Phantom Menace, Attack of the Clones, Revenge of the Sith |
3rd row | A New Hope, The Empire Strikes Back, Return of the Jedi, The Phantom Menace, Attack of the Clones, Revenge of the Sith, The Force Awakens |
4th row | A New Hope, The Empire Strikes Back, Return of the Jedi, Revenge of the Sith |
5th row | A New Hope, The Empire Strikes Back, Return of the Jedi, Revenge of the Sith, The Force Awakens |
Common Values
Value | Count | Frequency (%) |
Attack of the Clones | 13 | |
The Phantom Menace | 13 | |
The Phantom Menace, Attack of the Clones, Revenge of the Sith | 8 | 9.2% |
Attack of the Clones, Revenge of the Sith | 7 | 8.0% |
The Force Awakens | 5 | 5.7% |
Return of the Jedi | 5 | 5.7% |
A New Hope | 4 | 4.6% |
The Phantom Menace, Attack of the Clones | 4 | 4.6% |
The Empire Strikes Back | 3 | 3.4% |
Revenge of the Sith | 3 | 3.4% |
Other values (14) | 22 |
Length
Value | Count | Frequency (%) |
the | 155 | |
of | 94 | |
attack | 40 | 6.4% |
clones | 40 | 6.4% |
phantom | 34 | 5.4% |
menace | 34 | 5.4% |
revenge | 34 | 5.4% |
sith | 34 | 5.4% |
return | 20 | 3.2% |
jedi | 20 | 3.2% |
Other values (8) | 124 |
Most occurring characters
Value | Count | Frequency (%) |
542 | ||
e | 495 | |
t | 278 | 8.4% |
h | 223 | 6.7% |
o | 197 | 5.9% |
n | 173 | 5.2% |
a | 135 | 4.1% |
c | 101 | 3.0% |
f | 94 | 2.8% |
i | 86 | 2.6% |
Other values (25) | 1001 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 2256 | |
Space Separator | 542 | 16.3% |
Uppercase Letter | 441 | 13.3% |
Other Punctuation | 86 | 2.6% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
e | 495 | |
t | 278 | |
h | 223 | |
o | 197 | 8.7% |
n | 173 | 7.7% |
a | 135 | 6.0% |
c | 101 | 4.5% |
f | 94 | 4.2% |
i | 86 | 3.8% |
k | 83 | 3.7% |
Other values (10) | 391 |
Uppercase Letter
Value | Count | Frequency (%) |
A | 69 | |
T | 61 | |
R | 54 | |
S | 50 | |
C | 40 | |
M | 34 | |
P | 34 | |
J | 20 | 4.5% |
N | 18 | 4.1% |
H | 18 | 4.1% |
Other values (3) | 43 |
Space Separator
Value | Count | Frequency (%) |
542 |
Other Punctuation
Value | Count | Frequency (%) |
, | 86 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 2697 | |
Common | 628 | 18.9% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
e | 495 | |
t | 278 | 10.3% |
h | 223 | 8.3% |
o | 197 | 7.3% |
n | 173 | 6.4% |
a | 135 | 5.0% |
c | 101 | 3.7% |
f | 94 | 3.5% |
i | 86 | 3.2% |
k | 83 | 3.1% |
Other values (23) | 832 |
Common
Value | Count | Frequency (%) |
542 | ||
, | 86 | 13.7% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 3325 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
542 | ||
e | 495 | |
t | 278 | 8.4% |
h | 223 | 6.7% |
o | 197 | 5.9% |
n | 173 | 5.2% |
a | 135 | 4.1% |
c | 101 | 3.0% |
f | 94 | 2.8% |
i | 86 | 2.6% |
Other values (25) | 1001 |
vehicles
Text
MISSING
 
Distinct | 14 |
---|---|
Distinct (%) | 93.3% |
Missing | 72 |
Missing (%) | 82.8% |
Memory size | 3.5 KiB |
Length
Max length | 49 |
---|---|
Median length | 26 |
Mean length | 20.8 |
Min length | 5 |
Characters and Unicode
Total characters | 312 |
---|---|
Distinct characters | 48 |
Distinct categories | 6 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 13 ? |
---|---|
Unique (%) | 86.7% |
Sample
1st row | Snowspeeder, Imperial Speeder Bike |
---|---|
2nd row | Imperial Speeder Bike |
3rd row | Zephyr-G swoop bike, V-35 Courier, T-16 Skyhopper |
4th row | Tribubble bongo |
5th row | Zephyr-G swoop bike, XJ-6 airspeeder |
Value | Count | Frequency (%) |
bike | 5 | 11.6% |
speeder | 4 | 9.3% |
tribubble | 2 | 4.7% |
snowspeeder | 2 | 4.7% |
imperial | 2 | 4.7% |
zephyr-g | 2 | 4.7% |
swoop | 2 | 4.7% |
bongo | 2 | 4.7% |
airspeeder | 2 | 4.7% |
flitknot | 1 | 2.3% |
Other values (19) | 19 |
Most occurring characters
Value | Count | Frequency (%) |
e | 47 | |
28 | 9.0% | |
r | 27 | 8.7% |
o | 19 | 6.1% |
p | 18 | 5.8% |
i | 16 | 5.1% |
d | 13 | 4.2% |
b | 11 | 3.5% |
s | 10 | 3.2% |
a | 9 | 2.9% |
Other values (38) | 114 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 224 | |
Uppercase Letter | 36 | 11.5% |
Space Separator | 28 | 9.0% |
Decimal Number | 12 | 3.8% |
Dash Punctuation | 8 | 2.6% |
Other Punctuation | 4 | 1.3% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
e | 47 | |
r | 27 | |
o | 19 | |
p | 18 | 8.0% |
i | 16 | 7.1% |
d | 13 | 5.8% |
b | 11 | 4.9% |
s | 10 | 4.5% |
a | 9 | 4.0% |
l | 8 | 3.6% |
Other values (12) | 46 |
Uppercase Letter
Value | Count | Frequency (%) |
T | 7 | |
S | 7 | |
B | 3 | |
Z | 2 | 5.6% |
I | 2 | 5.6% |
P | 2 | 5.6% |
V | 2 | 5.6% |
G | 2 | 5.6% |
X | 1 | 2.8% |
J | 1 | 2.8% |
Other values (7) | 7 |
Decimal Number
Value | Count | Frequency (%) |
6 | 3 | |
3 | 3 | |
2 | 2 | |
1 | 2 | |
5 | 1 | 8.3% |
7 | 1 | 8.3% |
Space Separator
Value | Count | Frequency (%) |
28 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 8 |
Other Punctuation
Value | Count | Frequency (%) |
, | 4 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 260 | |
Common | 52 | 16.7% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
e | 47 | |
r | 27 | 10.4% |
o | 19 | 7.3% |
p | 18 | 6.9% |
i | 16 | 6.2% |
d | 13 | 5.0% |
b | 11 | 4.2% |
s | 10 | 3.8% |
a | 9 | 3.5% |
l | 8 | 3.1% |
Other values (29) | 82 |
Common
Value | Count | Frequency (%) |
28 | ||
- | 8 | 15.4% |
, | 4 | 7.7% |
6 | 3 | 5.8% |
3 | 3 | 5.8% |
2 | 2 | 3.8% |
1 | 2 | 3.8% |
5 | 1 | 1.9% |
7 | 1 | 1.9% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 312 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
e | 47 | |
28 | 9.0% | |
r | 27 | 8.7% |
o | 19 | 6.1% |
p | 18 | 5.8% |
i | 16 | 5.1% |
d | 13 | 4.2% |
b | 11 | 3.5% |
s | 10 | 3.2% |
a | 9 | 2.9% |
Other values (38) | 114 |
starships
Text
MISSING
 
Distinct | 15 |
---|---|
Distinct (%) | 75.0% |
Missing | 67 |
Missing (%) | 77.0% |
Memory size | 3.8 KiB |
Length
Max length | 104 |
---|---|
Median length | 35 |
Mean length | 23.7 |
Min length | 6 |
Characters and Unicode
Total characters | 474 |
---|---|
Distinct characters | 41 |
Distinct categories | 6 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 12 ? |
---|---|
Unique (%) | 60.0% |
Sample
1st row | X-wing, Imperial shuttle |
---|---|
2nd row | TIE Advanced x1 |
3rd row | X-wing |
4th row | Jedi starfighter, Trade Federation cruiser, Naboo star skiff, Jedi Interceptor, Belbullab-22 starfighter |
5th row | Naboo fighter, Trade Federation cruiser, Jedi Interceptor |
Value | Count | Frequency (%) |
naboo | 6 | 9.7% |
x-wing | 5 | 8.1% |
falcon | 4 | 6.5% |
jedi | 4 | 6.5% |
starfighter | 4 | 6.5% |
millennium | 4 | 6.5% |
imperial | 3 | 4.8% |
shuttle | 3 | 4.8% |
fighter | 3 | 4.8% |
belbullab-22 | 2 | 3.2% |
Other values (18) | 24 |
Most occurring characters
Value | Count | Frequency (%) |
42 | 8.9% | |
i | 38 | 8.0% |
e | 38 | 8.0% |
a | 32 | 6.8% |
r | 30 | 6.3% |
t | 29 | 6.1% |
l | 26 | 5.5% |
n | 24 | 5.1% |
o | 21 | 4.4% |
s | 14 | 3.0% |
Other values (31) | 180 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 361 | |
Uppercase Letter | 45 | 9.5% |
Space Separator | 42 | 8.9% |
Other Punctuation | 11 | 2.3% |
Dash Punctuation | 9 | 1.9% |
Decimal Number | 6 | 1.3% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
i | 38 | 10.5% |
e | 38 | 10.5% |
a | 32 | 8.9% |
r | 30 | 8.3% |
t | 29 | 8.0% |
l | 26 | 7.2% |
n | 24 | 6.6% |
o | 21 | 5.8% |
s | 14 | 3.9% |
g | 13 | 3.6% |
Other values (13) | 96 |
Uppercase Letter
Value | Count | Frequency (%) |
N | 7 | |
I | 6 | |
F | 6 | |
X | 5 | |
J | 4 | |
M | 4 | |
T | 3 | |
S | 3 | |
A | 2 | 4.4% |
B | 2 | 4.4% |
Other values (3) | 3 |
Decimal Number
Value | Count | Frequency (%) |
2 | 4 | |
1 | 2 |
Space Separator
Value | Count | Frequency (%) |
42 |
Other Punctuation
Value | Count | Frequency (%) |
, | 11 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 9 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 406 | |
Common | 68 | 14.3% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
i | 38 | 9.4% |
e | 38 | 9.4% |
a | 32 | 7.9% |
r | 30 | 7.4% |
t | 29 | 7.1% |
l | 26 | 6.4% |
n | 24 | 5.9% |
o | 21 | 5.2% |
s | 14 | 3.4% |
g | 13 | 3.2% |
Other values (26) | 141 |
Common
Value | Count | Frequency (%) |
42 | ||
, | 11 | 16.2% |
- | 9 | 13.2% |
2 | 4 | 5.9% |
1 | 2 | 2.9% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 474 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
42 | 8.9% | |
i | 38 | 8.0% |
e | 38 | 8.0% |
a | 32 | 6.8% |
r | 30 | 6.3% |
t | 29 | 6.1% |
l | 26 | 5.5% |
n | 24 | 5.1% |
o | 21 | 4.4% |
s | 14 | 3.0% |
Other values (31) | 180 |
photo
URL
UNIQUE
 
Distinct | 87 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 11.1 KiB |
https://static.wikia.nocookie.net/starwars/images/d/d9/Luke-rotjpromo.jpg | 1 |
---|---|
https://static.wikia.nocookie.net/starwars/images/9/96/Yarael_Poof.png | 1 |
https://static.wikia.nocookie.net/starwars/images/a/a4/BarrissOffee-OP.png | 1 |
https://static.wikia.nocookie.net/starwars/images/9/91/LuminaraUnduli-Encyclopedia.png | 1 |
https://static.wikia.nocookie.net/starwars/images/9/93/Poggle_the_lesser_-_sw_card_trader.png | 1 |
Other values (82) |
Value | Count | Frequency (%) |
https://static.wikia.nocookie.net/starwars/images/d/d9/Luke-rotjpromo.jpg | 1 | 1.1% |
https://static.wikia.nocookie.net/starwars/images/9/96/Yarael_Poof.png | 1 | 1.1% |
https://static.wikia.nocookie.net/starwars/images/a/a4/BarrissOffee-OP.png | 1 | 1.1% |
https://static.wikia.nocookie.net/starwars/images/9/91/LuminaraUnduli-Encyclopedia.png | 1 | 1.1% |
https://static.wikia.nocookie.net/starwars/images/9/93/Poggle_the_lesser_-_sw_card_trader.png | 1 | 1.1% |
https://static.wikia.nocookie.net/starwars/images/c/c1/ClieggLars-FF72.png | 1 | 1.1% |
https://static.wikia.nocookie.net/starwars/images/9/95/Corde-SWCTP.png | 1 | 1.1% |
https://static.wikia.nocookie.net/starwars/images/6/6f/GregarTypho-FF103.png | 1 | 1.1% |
https://static.wikia.nocookie.net/starwars/images/1/14/MasAmedda-BTAHE3.png | 1 | 1.1% |
https://static.wikia.nocookie.net/starwars/images/c/c4/Plo_Koon_TPM.png | 1 | 1.1% |
Other values (77) | 77 |
Value | Count | Frequency (%) |
https | 87 |
Value | Count | Frequency (%) |
static.wikia.nocookie.net | 87 |
Value | Count | Frequency (%) |
/starwars/images/d/d9/Luke-rotjpromo.jpg | 1 | 1.1% |
/starwars/images/9/96/Yarael_Poof.png | 1 | 1.1% |
/starwars/images/a/a4/BarrissOffee-OP.png | 1 | 1.1% |
/starwars/images/9/91/LuminaraUnduli-Encyclopedia.png | 1 | 1.1% |
/starwars/images/9/93/Poggle_the_lesser_-_sw_card_trader.png | 1 | 1.1% |
/starwars/images/c/c1/ClieggLars-FF72.png | 1 | 1.1% |
/starwars/images/9/95/Corde-SWCTP.png | 1 | 1.1% |
/starwars/images/6/6f/GregarTypho-FF103.png | 1 | 1.1% |
/starwars/images/1/14/MasAmedda-BTAHE3.png | 1 | 1.1% |
/starwars/images/c/c4/Plo_Koon_TPM.png | 1 | 1.1% |
Other values (77) | 77 |
Value | Count | Frequency (%) |
87 |
Value | Count | Frequency (%) |
87 |
abilities | birth_era | birth_year | death_era | death_year | eye_color | films | gender | hair_color | height | mass | pronoun | sex | skin_color | species | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
abilities | 1.000 | 0.000 | -0.024 | 0.435 | -0.017 | 0.345 | 0.446 | 0.119 | 0.320 | -0.157 | -0.111 | 0.119 | 0.542 | 0.159 | 0.000 |
birth_era | 0.000 | 1.000 | 0.469 | 0.000 | -0.343 | 0.000 | 0.764 | 0.000 | 0.163 | 0.088 | 0.252 | 0.000 | 0.000 | 0.000 | 0.000 |
birth_year | -0.024 | 0.469 | 1.000 | 0.170 | -0.258 | 0.441 | 0.287 | 0.000 | 0.287 | 0.099 | 0.174 | 0.000 | 0.531 | 0.380 | 0.738 |
death_era | 0.435 | 0.000 | 0.170 | 1.000 | -0.196 | 0.000 | 0.709 | 0.000 | 0.291 | 0.084 | -0.098 | 0.000 | 0.060 | 0.000 | 0.000 |
death_year | -0.017 | -0.343 | -0.258 | -0.196 | 1.000 | 0.000 | 0.486 | 0.000 | 0.247 | -0.072 | -0.243 | 0.000 | 0.000 | 0.100 | 0.089 |
eye_color | 0.345 | 0.000 | 0.441 | 0.000 | 0.000 | 1.000 | 0.000 | 0.221 | 0.284 | -0.066 | -0.026 | 0.221 | 0.309 | 0.360 | 0.437 |
films | 0.446 | 0.764 | 0.287 | 0.709 | 0.486 | 0.000 | 1.000 | 0.000 | 0.314 | 0.111 | -0.095 | 0.000 | 0.519 | 0.054 | 0.000 |
gender | 0.119 | 0.000 | 0.000 | 0.000 | 0.000 | 0.221 | 0.000 | 1.000 | 0.092 | 0.307 | 0.421 | 0.966 | 0.959 | 0.000 | 0.000 |
hair_color | 0.320 | 0.163 | 0.287 | 0.291 | 0.247 | 0.284 | 0.314 | 0.092 | 1.000 | 0.189 | 0.004 | 0.092 | 0.000 | 0.000 | 0.000 |
height | -0.157 | 0.088 | 0.099 | 0.084 | -0.072 | -0.066 | 0.111 | 0.307 | 0.189 | 1.000 | 0.757 | 0.418 | 0.359 | 0.442 | 0.576 |
mass | -0.111 | 0.252 | 0.174 | -0.098 | -0.243 | -0.026 | -0.095 | 0.421 | 0.004 | 0.757 | 1.000 | 0.000 | 0.686 | 0.768 | 0.718 |
pronoun | 0.119 | 0.000 | 0.000 | 0.000 | 0.000 | 0.221 | 0.000 | 0.966 | 0.092 | 0.418 | 0.000 | 1.000 | 0.959 | 0.000 | 0.000 |
sex | 0.542 | 0.000 | 0.531 | 0.060 | 0.000 | 0.309 | 0.519 | 0.959 | 0.000 | 0.359 | 0.686 | 0.959 | 1.000 | 0.622 | 0.609 |
skin_color | 0.159 | 0.000 | 0.380 | 0.000 | 0.100 | 0.360 | 0.054 | 0.000 | 0.000 | 0.442 | 0.768 | 0.000 | 0.622 | 1.000 | 0.576 |
species | 0.000 | 0.000 | 0.738 | 0.000 | 0.089 | 0.437 | 0.000 | 0.000 | 0.000 | 0.576 | 0.718 | 0.000 | 0.609 | 0.576 | 1.000 |
name | height | mass | hair_color | skin_color | eye_color | birth_year | birth_era | birth_place | death_year | death_era | death_place | sex | gender | pronoun | homeworld | species | occupation | cybernetics | abilities | equipment | films | vehicles | starships | photo | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | Luke Skywalker | 172.0 | 77.0 | blond | light | blue | 19.0 | BBY | Polis Massa | 34.0 | ABY | Ahch-To | male | masculine | he/him | Tatooine | Human | Jedi Master | Prosthetic right hand | Lightsaber abilities, Force powers, Other abilities | Lightsabers, Blasters | A New Hope, The Empire Strikes Back, Return of the Jedi, Revenge of the Sith, The Force Awakens | Snowspeeder, Imperial Speeder Bike | X-wing, Imperial shuttle | https://static.wikia.nocookie.net/starwars/images/d/d9/Luke-rotjpromo.jpg |
1 | C-3PO | 167.0 | 75.0 | NaN | gold | yellow | 112.0 | BBY | Affa | 3.0 | ABY | Bespin | none | masculine | he/him | Tatooine | Droid | Protocol droid | NaN | Language known, Other skills | NaN | A New Hope, The Empire Strikes Back, Return of the Jedi, The Phantom Menace, Attack of the Clones, Revenge of the Sith | NaN | NaN | https://static.wikia.nocookie.net/starwars/images/5/51/C-3PO_EP3.png |
2 | R2-D2 | 96.0 | 32.0 | NaN | blue, silver, white | red | 33.0 | BBY | NaN | 20.0 | BBY | Carida | none | masculine | he/him | Naboo | Droid | Astromech droid | NaN | NaN | Buzz saw, Electric pike, Drinks tray, Fusion welder, Scomp link, Power recharge coupler, Rocket boosters, Holographic projector, Motorized all-terrain treads, Retractable third leg | A New Hope, The Empire Strikes Back, Return of the Jedi, The Phantom Menace, Attack of the Clones, Revenge of the Sith, The Force Awakens | NaN | NaN | https://static.wikia.nocookie.net/starwars/images/6/6d/R2D2-Chronicles.png |
3 | Darth Vader | 202.0 | 136.0 | sandy-blond | pale | yellow | 41.0 | BBY | Tatooine | 4.0 | ABY | Death Star II | male | masculine | he/him | Tatooine | Human | Dark Lord of the Sith | Prosthetic arms and legs, life-support system | Lightsaber abilities, Force powers, Language known | Clothing, Lightsabers | A New Hope, The Empire Strikes Back, Return of the Jedi, Revenge of the Sith | NaN | TIE Advanced x1 | https://static.wikia.nocookie.net/starwars/images/a/a3/ANOVOS_Darth_Vader_1.png |
4 | Leia Organa Solo | 150.0 | 49.0 | dark brown | light | brown | 19.0 | BBY | Polis Massa | 35.0 | ABY | Ajan Kloss | female | feminine | she/her | Alderaan | Human | Princess of Alderaan | NaN | Jedi training, Force powers, Other abilities | Defender sporting blaster pistol, X-30 Lancer target blast pistol, Lightsaber | A New Hope, The Empire Strikes Back, Return of the Jedi, Revenge of the Sith, The Force Awakens | Imperial Speeder Bike | NaN | https://static.wikia.nocookie.net/starwars/images/8/89/Leia_endorpromo02.jpg |
5 | Owen Lars | 178.0 | 120.0 | brown | light | blue | 52.0 | BBY | Ator | 0.0 | BBY | Tatooine | male | masculine | he/him | Tatooine | Human | Moisture farmer | NaN | Blaster abilities | Blasters, SX-14 Field Hover-Ute, GX-8 Moisture Vaporators, Droids | A New Hope, Attack of the Clones, Revenge of the Sith | Zephyr-G swoop bike, V-35 Courier, T-16 Skyhopper | NaN | https://static.wikia.nocookie.net/starwars/images/9/91/OwenLarsHS-SWE.jpg |
6 | Beru Whitesun Lars | 165.0 | 75.0 | brown | light | blue | 47.0 | BBY | NaN | 0.0 | BBY | Tatooine | female | feminine | she/her | Tatooine | Human | Moisture farmer | NaN | NaN | Light robes | A New Hope, Attack of the Clones, Revenge of the Sith | NaN | NaN | https://static.wikia.nocookie.net/starwars/images/7/76/Beru_headshot2.jpg |
7 | R5-D4 | 97.0 | 32.0 | NaN | white, red, blue | red | NaN | NaN | NaN | NaN | NaN | NaN | none | masculine | he/him | Tatooine | Droid | Astromech droid | NaN | NaN | Holoprojector, Rocket booster, Scomp link | A New Hope | NaN | NaN | https://static.wikia.nocookie.net/starwars/images/2/2c/R5d4.jpg |
8 | Biggs Darklighter | 183.0 | 84.0 | black | light | brown | 24.0 | BBY | Tatooine | 0.0 | BBY | Yavin system | male | masculine | he/him | Tatooine | Human | Pilot | NaN | Piloting | NaN | A New Hope | NaN | X-wing | https://static.wikia.nocookie.net/starwars/images/0/00/BiggsHS-ANH.png |
9 | Obi-Wan Kenobi | 182.0 | 77.0 | auburn | fair | blue-gray | 57.0 | BBY | Stewjon | 0.0 | BBY | DS-1 Orbital Battle Station | male | masculine | he/him | Stewjon | Human | Jedi General | NaN | Lightsaber training, Force powers, Other abilities | Lightsabers | A New Hope, The Empire Strikes Back, Return of the Jedi, The Phantom Menace, Attack of the Clones, Revenge of the Sith | Tribubble bongo | Jedi starfighter, Trade Federation cruiser, Naboo star skiff, Jedi Interceptor, Belbullab-22 starfighter | https://static.wikia.nocookie.net/starwars/images/7/74/OWK-SWFB.png |
name | height | mass | hair_color | skin_color | eye_color | birth_year | birth_era | birth_place | death_year | death_era | death_place | sex | gender | pronoun | homeworld | species | occupation | cybernetics | abilities | equipment | films | vehicles | starships | photo | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
77 | Grievous | 216.0 | 159.0 | none | brown, white | gold | NaN | NaN | NaN | 19.0 | BBY | Utapau | male | masculine | he/him | Kalee | Kaleesh | Jedi Hunter | Completely cebernetic except for brain | Lightsaber abilities | Lightsabers | Revenge of the Sith | Tsmeu-6 personal wheel bike | Belbullab-22 starfighter | https://static.wikia.nocookie.net/starwars/images/c/ca/Grievoushead-OP.png |
78 | Tarfful | 234.0 | 136.0 | brown | brown | blue | NaN | NaN | NaN | NaN | NaN | NaN | male | masculine | he/him | Kashyyyk | Wookiee | Wookiee chieftain | NaN | NaN | Weapons | Revenge of the Sith | NaN | NaN | https://static.wikia.nocookie.net/starwars/images/3/37/Tarfful_RotS.png |
79 | Raymus Antilles | 188.0 | 79.0 | brown | light | brown | NaN | NaN | NaN | 0.0 | BBY | Tantive IV | male | masculine | he/him | Alderaan | Human | Captain of the CR90 corvette | NaN | Piloting | NaN | A New Hope, Revenge of the Sith | NaN | NaN | https://static.wikia.nocookie.net/starwars/images/8/82/RaymusAntilles-FFp46.png |
80 | Sly Moore | 178.0 | 48.0 | none | pale | white | NaN | NaN | NaN | 18.0 | BBY | NaN | female | feminine | she/her | Umbara | Umbaran | Personal aide | NaN | Politics, Force powers | NaN | Attack of the Clones, Revenge of the Sith | NaN | NaN | https://static.wikia.nocookie.net/starwars/images/b/b7/SlyMooreStare-OP.png |
81 | Tion Medon | 206.0 | 80.0 | none | gray, red | black | NaN | NaN | NaN | NaN | NaN | NaN | male | masculine | he/him | Utapau | Pau'an | Port Administrator | NaN | Politics | Clothing | Revenge of the Sith | NaN | NaN | https://static.wikia.nocookie.net/starwars/images/c/c0/TionMedon-SS.png |
82 | Finn | 178.0 | 73.0 | black | dark | brown | 11.0 | ABY | NaN | NaN | NaN | NaN | male | masculine | he/him | NaN | Human | General in the Rebel Alliance | NaN | NaN | NaN | The Force Awakens | NaN | NaN | https://static.wikia.nocookie.net/starwars/images/1/1a/Finn-TSWB.png |
83 | Rey Skywalker | 170.0 | 54.0 | brown | light | hazel | 15.0 | ABY | Hyperkarn | 35.0 | ABY | Exegol | female | feminine | she/her | Jakku | Human | Jedi | NaN | Lightsaber abilities, Force powers, Force lightning | Hellhound two, Vehicles, Tools and wapons, Lightsabers | The Force Awakens | NaN | NaN | https://static.wikia.nocookie.net/starwars/images/2/2b/Rey_TROS_Fathead.png |
84 | Poe Dameron | 172.0 | 80.0 | dark brown | tan | brown | 2.0 | ABY | Yavin 4 | NaN | NaN | NaN | male | masculine | he/him | Yavin 4 | Human | General in the Rebel Alliance | NaN | Piloting | Clothing | The Force Awakens | NaN | X-wing | https://static.wikia.nocookie.net/starwars/images/6/6b/PoeDameron-Heroes2023.png |
85 | BB-8 | 67.0 | 18.0 | none | white, orange | red | 29.0 | ABY | NaN | NaN | NaN | NaN | none | masculine | he/him | Hosnian Prime | Droid | Astromech droid | NaN | NaN | Grappling spike launcher, Welding torch, Holoprojector, Arc welder | The Force Awakens | NaN | NaN | https://static.wikia.nocookie.net/starwars/images/6/68/BB8-Fathead.png |
86 | Captain Phasma | 200.0 | 76.0 | gold | light | blue | 6.0 | ABY | Parnassos | 34.0 | ABY | Crait system | female | feminine | she/her | Parnassos | Human | Stormtrooper commander | NaN | Shooting | Rust-read war mask, Armor coated, Weapons, Blasters | The Force Awakens | NaN | NaN | https://static.wikia.nocookie.net/starwars/images/0/02/Phasma.png |