HT TECH wants to start sending you push notifications. Click allow to subscribe

Your voice is my command

It has a long way to go, but speech recognition is finally beginning to work.

By: PRASANTO K ROY
Updated on: Jan 16 2010, 20:07 IST

Hi, I'm Simon! How may I help you today?'
'Er… we're calling about a lost bag on the United flight from London to New Orleans…'

'I'm sorry. I didn't understand that. Are you calling about a reservation? If so, please say YES.'
'No.'

'Simon is happy to help. Are you calling about lost baggage?'
'YES.'

'I understand you are calling about lost baggage. Please tell us where you are.'
'New Orleans, Louisiana.'

You may be interested in

Mobiles Tablets Laptops
7% OFF
Apple iPhone 15 Pro Max
  • Black Titanium
  • 8 GB RAM
  • 256 GB Storage
₹148,900₹159,900
Buy now
28% OFF
Samsung Galaxy S23 Ultra 5G
  • Green
  • 12 GB RAM
  • 256 GB Storage
₹107,999₹149,999
Buy now
Google Pixel 8 Pro
  • Obsidian
  • 12 GB RAM
  • 128 GB Storage
₹106,998
Check details
Apple iPhone 15 Plus
  • Black
  • 6 GB RAM
  • 128 GB Storage
₹87,900
Check details
21% OFF
Acer Swift Go SFG14 41 NX KG3SI 002 Laptop
  • Pure Silver
  • 8 GB RAM
  • 512 GB SSD
₹58,999₹74,999
Buy now
39% OFF
Acer Aspire 5 A515 57G Laptop
  • Gray
  • 16 GB RAM
  • 512 GB SSD
₹54,949₹89,999
Buy now
22% OFF
Acer Aspire 3 A315 24 NX KDESI 004 Laptop
  • Silver
  • 8 GB RAM
  • 512 GB SSD
₹33,499₹42,999
Buy now
40% OFF
Asus VivoBook 15 X515JA BQ322WS Laptop
  • Transparent Silver
  • 8 GB RAM
  • 512 GB SSD
₹31,350₹51,990
Buy now
34% OFF
Xiaomi Pad 6
  • Mist Blue
  • 6 GB RAM
  • 128 GB Storage
₹26,299₹39,999
Buy now
55% OFF
Lenovo Tab M10 5G
  • Abyss Blue
  • 6 GB RAM
  • 128 GB Storage
₹20,999₹47,000
Buy now
21% OFF
Realme Pad 2
  • Imagination Grey
  • 6 GB RAM
  • 128 GB Storage
₹19,749₹24,990
Buy now
Honor Pad X9
  • Gray
  • 4 GB RAM
  • 128 GB Storage
₹14,999
Check details

'I'm sorry, I didn't understand that. Please say the state clearly...'

This would have gone on forever, but after three such calls we had finally figured out how to beat the system. We would switch to loud, abusive Hindi, and after three failed attempts, the United Airlines toll-free voice-response would give up and transfer us to a human being who would, amazingly, be sitting on the other side of the planet, back home in Gurgaon, and who would actually understand us and help out!

That was 2000 AD. Fast forward to 2009.

Also read: Looking for a smartphone? To check mobile finder click here.

'Engage Autopilot on Mach One point five, altitude six thousand,' said Flt Lt Mathews. 'Roger, autopilot on Mach One...,' the cockpit voice response system repeated, and the RAF Eurofighter Typhoon climbed up and then levelled off six kilometres above sea level, cruising at one and a half times the speed of sound.

'Caperberry Bangalore,' I said, touching a button on my phone. Six seconds later, a name, address and location map popped up on my phone screen, along with the number 080-2559-4567. One click, and it dialled the number. A half-minute down, I'd booked a table for two at this tony downtown tapas lounge that I hadn't heard of till that day.

We were both doing the same thing. Okay, there were a few minor differences. The RAF fighter-pilot was talking to a $90 million delta-wing multi-role aircraft, and I was in a Meru cab talking to a piece of free software from Google in a 15,000 mobile phone handset. But we were both using voice recognition that actually worked - and which rapidly gave us usable information, or let us command a system that responded.

LESS THAN MAINSTREAM

Speech recognition has quite a few real-world uses today, where it works well. These are mostly situations with a small vocabulary of very well defined, structured speech, such as in the Eurofighter example (though very few aircraft have as yet implemented voice-assist for flight operations). Another area is training, where clear speech and response is a key part of operations, such as air traffic control operations, where the need for a full-time 'actor' to converse with every trainee is avoided. Speech recognition and synthesis plays a key role here.

It might seem odd that the most obvious mainstream applications are exactly where speech recognition hasn't taken off yet. Take dictation and transcription.

You'd think one could simply speak to a computer, and it would transcribe whatever you said into a neat little document, so that you didn't ever have to type anything in. In practice, speech recognition, even two decades after it was born, hasn't evolved to the stage where normal (and widely varying) speech from different people with different accents can be reliably transcribed.

In fact, you hear the term 'voice recognition' more often, where the recognition system is trained to a particular speaker - such as in most PC voice recognition software. So it works best with a particular voice, a speaker who trains the system, just as with the Eurofighter that's trained by a particular pilot. Speech recognition is broader, and describes systems that can recognise any speech - such as a call centre system. These have improved a lot since my Simon experience 10 years ago, but are still limited in their vocabulary and fussy about pronunciation.

Voice synthesis, on the other hand, is thoroughly mainstream. Lots of systems convert text to speech, including, probably, your mobile phone (many have an SMS and menus read-loud option). You probably use Acrobat Reader to view PDF files; click on View and Read Out Aloud to have the document read to you in a moronic monotone. The Kindle reads out the text of books you load onto it.

My favorite voice-command system is Google's Mobile App (get it from m.google.com on your data-enabled phone). It lets you speak out search terms, and recognises them rather well if you speak clearly and don't have much background noise. It also integrates Gmail, Maps, Picasa and more into one simple application on your home screen.

The author is chief editor and green evangelist at CyberMedia, publisher of 15 specialty titles and sites such as LD2.in. pkr@cybermedia.co.in, twitter.com/prasanto

Catch all the Latest Tech News, Mobile News, Laptop News, Gaming news, Wearables News , How To News, also keep up with us on ,Twitter, Facebook, , and Instagram. For our latest videos, subscribe to our YouTube channel.

First Published Date: 16 Jan, 11:58 IST

Sale

Mobiles Tablets Laptops
4% OFF
Samsung Galaxy S24 Ultra
  • Titanium Black
  • 12 GB RAM
  • 256 GB Storage
₹129,999₹134,999
Buy now
5% OFF
Apple iPhone 15 Pro Max
  • Black Titanium
  • 8 GB RAM
  • 256 GB Storage
₹137,990₹144,900
Buy now
13% OFF
Xiaomi 14
  • Matte Black
  • 12 GB RAM
  • 512 GB Storage
₹69,999₹79,999
Buy now
10% OFF
Apple iPhone 15 Plus
  • Black
  • 6 GB RAM
  • 128 GB Storage
₹80,590₹89,900
Buy now
38% OFF
Lenovo Tab M10 5G
  • Abyss Blue
  • 6 GB RAM
  • 128 GB Storage
₹20,999₹34,000
Buy now
28% OFF
Realme Pad 2
  • Imagination Grey
  • 6 GB RAM
  • 128 GB Storage
₹17,999₹24,999
Buy now
41% OFF
Lenovo Tab M9
  • Frost Blue
  • 3 GB RAM
  • 32 GB Storage
₹9,449₹16,000
Buy now
27% OFF
Samsung Galaxy Tab S8
  • Silver
  • 8 GB RAM
  • 128 GB Storage
₹50,990₹69,999
Buy now
38% OFF
Acer Aspire 3 A315 24 NX KDESI 004 Laptop
  • Silver
  • 8 GB RAM
  • 512 GB SSD
₹32,790₹52,999
Buy now
27% OFF
Infinix INBook X1 Neo XL22 Laptop Intel Celeron Quad Core 8 GB 256 GB SSD Windows 11
  • Blue
  • 4 GB RAM
  • 128 GB SSD
₹21,990₹29,990
Buy now
36% OFF
Infinix INBook X1 Pro Laptop
  • Black
  • 8 GB RAM
  • 256 GB SSD
₹44,990₹69,999
Buy now
29% OFF
Asus VivoBook 15 X515JA EJ522TS Laptop
  • Grey
  • 8 GB RAM
  • 512 GB SSD
₹44,689₹62,889
Buy now