310 likes | 444 Views
ABI speech corpus. Shona D’Arcy, Martin Russell Electronic, Electrical and Computer Engineering University of Birmingham. Motivation. Wide variation in pronunciation of English in the British Isles Perceived as problem for speech technologies No systematic study published to date
E N D
ABI speech corpus Shona D’Arcy, Martin Russell Electronic, Electrical and Computer Engineering University of Birmingham
Motivation • Wide variation in pronunciation of English in the British Isles • Perceived as problem for speech technologies • No systematic study published to date • Need for corpus of speech representing different accents
Variety of Accents • British isles • Northern England • Southern England • Wales • Scotland • Northern Ireland • Southern Ireland
Corpus design • What is an accent? • What accents are required for corpus? • List of 13 accents to be recorded • Standard Southern english
London West County
London West County
Ireland London West County
East Anglia Ireland London West County
East Anglia Ireland Birmingham London West County
Liverpool East Anglia Ireland Birmingham London West County
N Ireland Liverpool East Anglia Ireland Birmingham London West County
N Ireland Lancashire Liverpool East Anglia Ireland Birmingham London West County
N Ireland Yorkshire Lancashire Liverpool East Anglia Ireland Birmingham London West County
N Ireland Newcastle Yorkshire Lancashire Liverpool East Anglia Ireland Birmingham London West County
Glasgow N Ireland Newcastle Yorkshire Lancashire Liverpool East Anglia Ireland Birmingham London West County
Scottish Highlands Glasgow N Ireland Newcastle Yorkshire Lancashire Liverpool East Anglia Ireland Birmingham London West County
Corpus design • Who are good subjects? • People who had lived in the area most of their lives • People who’s parents had lived there most of their lives • Between the ages of 18 and 50 • 10 male and 10 female
Text to be recorded • Accent specific texts • “when a sailor in a small craft faces the might of the vast Atlantic ocean today” • Scribe sentences • Where were you while we were away • Vowels in control contexts, • Had, who’d, Hudd (to rhyme with bud), hid, heard
Texts to recorded • Standard prompts for speech technologies • Digits triples • Letters, phonetic alphabet • GYP, Golf, Yankee, Polo • Equipment specific commands • Climate control 71 degrees
Preparation • Recording software • List of prompts • Venue selection • Chose town • Library/community centre
Hardware • Laptop with external soundcard, (Edirol) • Emkay head mounted microphone • Generic desk mounted microphone
Strategy for subjects • Press releases • Local media, radio (~25) TV(~7) & newspapers (~40) • Interviews • Free phone • Subject selection • Subject criteria • Based on audio
Statistics • 14 locations • 20 people per location, between 16 and 70 • 95 hours of recordings • Annotated at phrase level
Lessons learnt • Identifying accents • Choosing a town likely to have appropriate accent and produce enough candidates • Choice of recording location
Rural settings • Association of ‘accent’ with ‘place’ not reliable • Expectation of situation • Less migration, hence less diluting of accents • Smaller variations over large area • Findings • Young people more likely to adopt other accents • Mainly older people had ‘good’ accent
Urban settings • Most cities homogeneous accent • Liverpool, Glasgow • Huge variety of accents in London
Choice of location • Library • Relatively consistent noise level • Library users available to recruit • Other (e.g. community centre, universities) • More variation in noise • Older people
Press release + free-phone • Mostly successful (would do it again) • Coverage approximately equal in each location • Variable response • When successful • Many volunteers with good accents within our age range • When unsuccessful • Not enough volunteers • Had to ‘recruit from the street’
Future recordings • Fill in holes existing in current database • Incorporating lessons learnt • Record accents not covered in this corpus