Blog Archive

Thursday, April 30, 2015

Saturday, April 25, 2015

关于CS,一些你不知道的东西(附加一些东西让准备转专业的人看 - 未名空间(mitbbs.com)

关于CS,一些你不知道的东西(附加一些东西让准备转专业的人看 - 未名空间(mitbbs.com): "为了H1?H1最多只有6年,满了你还是得滚蛋。如果你不喜欢CS,这条路真心没必要。
为了绿卡?之前我说过了CS拿绿卡的难度,你不要拿着“某几个"人可能有天赋转了来
忽悠所有人转。经"



'via Blog this'

Thursday, April 16, 2015

userguide LIA_SpkDe



http://mistral.univ-avignon.fr/doc/userguide_LIA_SpkDet.002.pdf

Monday, April 13, 2015

Writing shell scripts - Lesson 13: Positional Parameters

http://linuxcommand.org/wss0130.php



As we discussed before, many programs, particularly ones from the GNU Project, support both short and long command line options. For example, to display a help message for many of these programs, you may use either the "-h" option or the longer "--help" option. Long option names are typically preceded by a double dash. We will adopt this convention for our scripts.
Here is the code we will use to process our command line:
interactive=
filename=~/system_page.html

while [ "$1" != "" ]; do
    case $1 in
        -f | --file )           shift
                                filename=$1
                                ;;
        -i | --interactive )    interactive=1
                                ;;
        -h | --help )           usage
                                exit
                                ;;
        * )                     usage
                                exit 1
    esac
    shift
done

Saturday, April 11, 2015

大数据在工业界流行的黑暗真相 - 未名空间(mitbbs.com)

大数据在工业界流行的黑暗真相 - 未名空间(mitbbs.com):

这两天在faculty版上灌水纠结大数据概念,让我突然想明白了大数据在工业界兴起的
原因:
很多规律在数据量大了以后都会变得很显然,算法再傻B,在超大量数据作用下效果也
会变得很好。因为开发牛B算法开销很大风险很高,而获取数据对不少公司而言很容易。
所以会出现一个大数据潮流,大家都指着数据变成救命法宝。
比如大数据的源头Google,Pagerank其实就是个幌子,真正优化Google搜索结果的其实
是人民群众。结果被点得多了排名就上去了。算法够傻B吧。还有Google翻译也用到了
大数据技术。算法是否傻B是和系统复杂度相对的。比如linear regression,数据量小用
atlab三五行就写出来了。数据量大了就得上Hadoop, Spark,支持运算的平台比真正的
算法要复杂得多得多。

但真相果真如此吗?做过machine learning的人都知道,数据量增大对于结果改善的边
际效应是递减的,而且往往是指数递减的。需要获得10个百分点的提高,可能需要成千
上万,甚至亿万倍的数据量。真是有点愚公移山的感觉了。

大数据的驱动力背后还有更黑暗的真相:提高barrier to entry。这世上聪明人很多。
工业界忽悠算法其实是最危险的。因为算法的发明往往是单枪匹马干的。如果一个大公
司建立在一个算法的基础之上,岂不是有被但强匹马干掉的危险?特别是学术界虽然不
济,几百年才出一个牛顿爱因斯坦这样的人物,但方法论一直是正确的,难保不出一两
个真正管用的新算法。现在告诉你,你没个一千台机器就没法做研究,把智力门槛转换
成资本门槛,并且搞
一个虚无飘渺的概念,把funding标准从科研水准转换成忽悠水准。以此打击真正的科
学研究,试图多维持几年大公司的盈利。其用心不可不谓险恶。

此计一出,大合那些脑子已经使不动了的学霸的心意,于是大数据一夜之间就火了。



'via Blog this'

Tuesday, April 7, 2015

CHANGE HOSTNAME PERMANENTLY USING THE COMMAND LINE

Ubuntu – Change Hostname Permanently Using the Command Line | Michael Lane's Blog: "#!/bin/bash
#Assign existing hostname to $hostn
hostn=$(cat /etc/hostname)

#Display existing hostname
echo "Existing hostname is $hostn"

#Ask for new hostname $newhost
echo "Enter new hostname: "
read newhost

#change hostname in /etc/hosts & /etc/hostname
sudo sed -i "s/$hostn/$newhost/g" /etc/hosts
sudo sed -i "s/$hostn/$newhost/g" /etc/hostname

#display new hostname
echo "Your new hostname is $newhost"

#Press a key to reboot
read -s -n 1 -p "Press any key to reboot"
sudo reboot"



https://pricklytech.wordpress.com/2013/04/24/ubuntu-change-hostname-permanently-using-the-command-line/



'via Blog this'

How to install Teamviewer on 14.04?

Reference:

http://askubuntu.com/questions/453157/how-to-install-teamviewer-on-14-04





 First, install gdebi

sudo dpkg --add-architecture i386; 

sudo apt-get update; 

sudo apt-get install gdebi

sudo apt-get -f install

 In the same directory you downloaded the .deb file just run: 

sudo gdebi teamviewer_linux.deb #replace teamviewer_linux.deb  with your downloaded deb




Option 2:

  1. Download the official image from http://www.teamviewer.com/en/download/linux.aspx (always get 32bit)     http://download.teamviewer.com/download/teamviewer_i386.deb   [May 4,2015]
  2. Open a terminal
  3. Run command cd ~/Downloads/
  4. Run command sudo dpkg -i teamviewer_i386.deb
  5. Follow whatever steps it needs you to take.
Note:

In case you encounter any error and depends on your linux machine, you may need install 
libjpeg62.
libsm6

Option 3:
Install Xmanager
'via Blog this'

Friday, April 3, 2015

LDL Cholesterol: The Test

LDL Cholesterol: The Test: "LDL Cholesterol"



http://labtestsonline.org/understanding/analytes/ldl/tab/test/



https://www.pritikin.com/your-health/health-benefits/lower-cholesterol/1468-7-tips-for-improving-your-ldl-cholesterol.html#.VR9fbObF_9k





The Test



'via Blog this'

Speech Signal Processing Toolkit (SPTK)

Speech Signal Processing Toolkit (SPTK):



'via Blog this'

Wednesday, April 1, 2015

GMM on Hadoop

Source:

http://www.eie.polyu.edu.hk/~mwmak/Download.htm



1. GMM on Hadoop

This is a set of Java programs that I developed recently for teaching the subject "Distributed Systems and Cloud Computing". The package contains both sequential and parallel version of the EM algorithm for training Gaussian mixture models (GMM). The parallel version is designed to run on Hadoop clusters. For the source code, visithttps://github.com/enmwmak/GMM-on-Hadoop

'via Blog this'