Further to my last post and various requests, I’ve published the complete list of languages detected within the whole collection of geolocated tweets in London.
The list contains the full counts ranked for each language (excluding Tagalog), as well as the count of detections classed as ‘Unknown’ – probably due to the tweet being too short, or too colloquial, for the detector to work out what language is being written.
You can find that full list here.