asrman: March 2017

Sunday, March 19, 2017

Speech signal processing technology for smart devices to achieve multilingual speech translation service

Speech signal processing technology for smart devices to achieve multilingual speech translation service:

November 17, 2015

Hitachi today announced that it has developed a speech signal processing technology for smart devices to achieve a better multilingual speech translation service on the market. By removing background noise excluding speaker's voice, this innovative technology offers a speech recognition capability in noisy urban street environments in which its noise level is 70 dB. In addition, its automatic detection of speech intervals enhances usability with an accurate recognition of speech timing without requiring user to press a button for determining the intervals. This technology will contribute to the commercialization of the multilingual speech translation service at service counters in various stores or at information center in public transportation systems.

As the growing popularity of visiting Japan, the number of foreign tourists has been increasing every year. Consequently, a demand of multilingual speech translationservices is rising from the practical needs of performing effective communications between foreign tourists and local service counter clerks without feeling language barrier in public transportation services or shopping centers.

However, in a crowded and noisy environment such as public transportation or shopping center, to specifically recognize speaker's voice for translation service is quite challenging due to the background noise that is recorded by microphone. In order to enhance noise reduction, Hitachi has been developing the innovative noise reduction technology on special purpose device using multiple microphones. Furthermore, an issue of conventional multilingual speech translation service is that users must press a button for translating each phrase of their conversations. This is very inconvenient for users when they often carry many bags in a situation of visiting service counter for information or services.

Based on the speech signal processing technology that has been cultivated by Hitachi for many years, Hitachi has developed a speech signal technology for general purpose smart devices instead of special purpose device. This newly developed technology has achieved the multilingual speech translation using smart device under a crowded environment such as public transportation area or shopping center. It is also capable of automatically recognizing speech intervals accurately without pressing any button to determining speech timing for translation.

The following are the features of the developed speech signal processing technology.

1. Noise reduction utilizing microphone inputs of multiple smart devices

In the conventional multi-microphone-based noise reduction technology on special purpose devices, noise is reduced by using the time difference among the microphones. Specifically, its process is to collect speaker's voice that is closest to one microphone first, then to collect other voices from other microphones. The voice processing is to identify the direction of the targeted speech source and remove any noise from other directions. This technology is not easy to apply to the smart devices available on the market due to the slight differences among the devices that cause small gap in recording timing. To solve this problem, the developed technology separates target's voice and background noise using the differences of sound energy that is less easily to be influenced by timing gap of noise signals. Then, by correcting the time differences from timing gap of noise signals while comparing sound sources, the high-accuracy noise reduction using the time-difference-based approach as same as special purpose devices has been achieved.

2. Decreasing the time for speech input

The newly developed speech signal processing technology is capable of reducing noise and enhancing user's voice that offers accurate automatic recognition of speech intervals. As a result, there is no need to press any button for determining speech intervals. Furthermore, it is capable of decreasing the input time, and responding to continuous input for simultaneous translation for each phrases as live chat due to the accurate speech intervals.

The newly developed technology performs its speech processing and translation on the cloud system. Therefore, users can use this system easily by installing the dedicated application into the existing smart devices.

To confirm the performance of this innovative technology, we constructed a prototype system using a multilingual speech translation engine developed by National Institute of Information and Communications Technology and two general purpose smart devices, and carried out a validation experiment. As a result, we confirmed that the developed technology is capable of translating speech in a noisy urban street environment in which the noise level is 70 dB.

Hitachi will promote the development of this technology for practical applications, and contribute to provide high satisfactory hospitality services to Japan where many foreigners will visit.

During heavy traffic, is it more efficient to aggressively switch lanes or to remain in one lane only?

https://www.quora.com/During-heavy-traffic-is-it-more-efficient-to-aggressively-switch-lanes-or-to-remain-in-one-lane-only/answer/Mark-Werner-4

During heavy traffic, is it more efficient to aggressively switch lanes or to remain in one lane only?

Assuming each lane starts/stops seemingly randomly, you'd think someone trying to weave through traffic couldn't predict the change in flow well enough to make a good decision. It seems like someone trying to cheat their way through either ends up right back where they started or even further behind

I’m a police officer and also a long time observer of how people drive and the psychology behind that.

An anecdote….

I went to a doctor’s appointment and got out about 5pm… Full-on rush hour. I was proceeding down the major north-south highway in our area… wall-to-wall traffic with frequent slowings and stoppings.

Most folks were content to simply sit it out. That’s what I do. However, I became aware of one juicy young lad in a sporty car who just couldn’t stand it. In the 10 miles or so of my commute… This fellow must have changed lanes 50 times… Over here, over there… (Never signalling, of course)… This lane would move a little faster so he’d jump in, and then out again as that lane slowed.

Over the half-hour or so I was on the highway…This guy passed me at least 3 times. When I got off the highway, he was still within 100 feet of me…. Having gained absolutely nothing.

Watch people! Unless they are dead lucky, this foolish tactic gains nothing whatever, and greatly increases both stress levels and the likelyhood of being involved in an accident.

I’ve talked to people who do this…. (My wife’s sister is one..) They are what I call “emotional drivers”. They do not think about what they’re doing. They do not plan ahead, and have little regard for safety or the law. The usual statement is… “I just wanna GO! “

Childish.

'via Blog this'

During heavy traffic, is it more efficient to aggressively switch lanes or to remain in one lane only?

I’m a police officer and also a long time observer of how people drive and the psychology behind that.

An anecdote….

Over the half-hour or so I was on the highway…This guy passed me at least 3 times. When I got off the highway, he was still within 100 feet of me…. Having gained absolutely nothing.

Watch people! Unless they are dead lucky, this foolish tactic gains nothing whatever, and greatly increases both stress levels and the likelyhood of being involved in an accident.

Childish.

'via Blog this'

Saturday, March 18, 2017

百度Deep Voice作者与Bengio团队切磋五大技术细节，端到端的语音合成还有多远？

百度Deep Voice作者与Bengio团队切磋五大技术细节，端到端的语音合成还有多远？:

'via Blog this'

How to Use Python 'SimpleHTTPServer' to Create Webserver or Serve Files Instantly

How to Use Python 'SimpleHTTPServer' to Create Webserver or Serve Files Instantly: "How to Use Python ‘SimpleHTTPServer’ to Create Webserver or Serve Files Instantly"

http://www.tecmint.com/python-simplehttpserver-to-create-webserver-or-serve-files-instantly/

其实这是个小技巧，记录下。

最近，我一直在windows环境下，用虚拟机进行linux，PuTTY ssh连接虚拟机进行程序开发的。

后来，发现两个系统间传递文件实在有点麻烦，本来想搭个ftp或是写个HTTPserver，后来发现SimpleHttpServer十分好用。

直接python -m SimpleHttpServer 8000 ，就可以在当前目录下直接创建一个http server。

在其他机器上，用浏览器就可是访问这个文件夹了，共享文件轻松多了。

pushd /your/Other/path; python -m SimpleHTTPServer 9999; popd;

'via Blog this'

asrman

Blog Archive

Sunday, March 19, 2017

Speech signal processing technology for smart devices to achieve multilingual speech translation service

November 17, 2015

During heavy traffic, is it more efficient to aggressively switch lanes or to remain in one lane only?

During heavy traffic, is it more efficient to aggressively switch lanes or to remain in one lane only?

Saturday, March 18, 2017

百度Deep Voice作者与Bengio团队切磋五大技术细节，端到端的语音合成还有多远？

How to Use Python 'SimpleHTTPServer' to Create Webserver or Serve Files Instantly

Tuesday, March 7, 2017

谈一谈我在阿里的成长 | 李靖 | Pulse | LinkedIn

Tensorflow Tutorial 2: image classifier using convolutional neural network » CV-Tricks.com

Andrew's Exercise Solutions: Algorithms: Design and Analysis, Part 1 - Index