David MacKay
.






Search :

.

English and German letter frequencies
A B C D E F G H I J K L M N O P Q R S T U V W X Y Z -
ENGLISH (e) 0.07 0.01 0.03 0.03 0.1 0.02 0.02 0.05 0.06 0.001 0.006 0.03 0.02 0.06 0.06 0.02 0.0009 0.05 0.05 0.08 0.02 0.008 0.02 0.002 0.01 0.0008 0.17
GERMAN (g) 0.06 0.02 0.03 0.04 0.15 0.01 0.03 0.04 0.07 0.002 0.01 0.03 0.02 0.08 0.02 0.007 0.0002 0.06 0.06 0.05 0.04 0.006 0.02 0.0003 0.0003 0.01 0.14

The entropies of these two distributions are H(e) = 4.1 bits; H(g) = 4.1 bits; and the relative entropies between them are D_KL( e || g) = 0.16 bits and D_KL( g || e) = 0.12 bits. The relative entropies between the uniform distribution u and the English distribution e are D_KL( e || u) ~= 0.6 bits and D_KL( u || e) ~= 1 bits.


Site last modified Sat Sep 16 17:35:51 BST 2006