999精品在线视频,手机成人午夜在线视频,久久不卡国产精品无码,中日无码在线观看,成人av手机在线观看,日韩精品亚洲一区中文字幕,亚洲av无码人妻,四虎国产在线观看 ?

Barge Database

2014-10-27 23:46:49ByWuJiang
KNOWLEDGE IS POWER 2014年10期

By+Wu+Jiang

The last week of September, 2014 saw the official listing of Alibaba( A Chinas giant company) in the New York Stock Exchange(NYSE:BABA), which is the first Initial public offerings( IPO) and the largest scale in history, also marks the Interent evolves into a new era---a big data era that belongs to Chinese domestic internet enterprises.

The past and present big data

Big data or mass data refers to the data size is so large that it can not be extracted, managed, handled and processed as the information that can be interpreted by human beings with a proper range of time. Under the same condition, compared with those independent small-scale dataset which could analyzed data individually, more additional information and relational data base will be obtained if the analysis is based on the grouping of each small data. Such approach can be applied to forecast the commercial trend, judge the quality of research, avoid the widespread of disease, fight against crimes or predict real-time traffic and others.

Though far away from our daily life, big data has close ties with our daily life in deed. For example, Douban Music( a name of a Chinese social network) can infer which song is most liked by a certain user after its analysis of behaviour of user population, even users favorite movie can also be inffered. Through confluence analysis of sales data of its retail stores, Adidas can exactly know the consumers preference over their products in different regional culture so as to make a more resonable strategy of inventory stocking up in a smarter way. A love and marrige website in China is trying to introduce a system that can identify facial resemblance, the company is able to conclude which facial form is most enjoyed by its users on the basis of used information, then they can provide such popular service among its users. Taobao(the biggest C2C shopping website in Chinas mainland) can predict the possible goods that each consumer is interested in, thereout, individualized recommendation targeted to each user can be produced, this is what most people often see in the side bar of it commodity recommendation. Through the analysis of the information of classified commodities by large database model, Taobao is able to answer some interesting questions which are hard to most people, such as what is the favourite color of the T-shirt for the age group of 18 , or what is the difference between the people living in South and North China when it comes to preference of sports beverage?

The simple analysis of user behaviour will not produce too much value, while if the analysis is based on a quite large scale, then we can obtain valuable prediction from its performing trend, the decision-making in business in particular. In the past, take the well-known NongFu Spring (A Chinese enterprise of drinking water production) for example, if the company wants to get such market data to help them to make decisions as how to pile up can promote its sales? The people of which age group can spend most time in front of the pile? What is their purchasing volume each time? What changes of purchasing behaviour might take place for the change of temperature? How its competitors new packing influence its own sales? Though seem easy, these questions are hard to get convincing answers.

To answer the above questions, a lot of data needs to be collected. The salesmen from NongFu Spring have to come to local supermarkets to take ten pictures every day: the piling of the bottles, the change of their location, the height of the bottle piling and so on. Every day they have to cover 15 places for investigation and survey, and upload 150 pictures, producing data size about 10M which is not a large figure. While there are 10,000 salesmen across China, that means the data size is 100G, 3TB each month. Though these data seem easy, but without the support of relevant technology concerning about big data, such analysis could not be obtained.

There is one in Google had pointed out:” what really matters is not what we can do, but what is the right size can we do.”

It only needs several pieces of paper and a pen if you can just analyze 100 lines of data every day. But if you want to analyze 100,000 lines of data, according to the processing capacity of modern computer, you just need a computer and design programme. But if the data size has reached 1000000000 lines(1TB), even a powerful server station will satisfy your need, especially when you want a real-time or close to real-time processing speed. Thus, the field of computer and numerical calculation witnesses the occurrence of a trend—distributed computing which is a science requires a system by the connecting of cluster of computers through network and then engineering data that needs massive calculation will be divided into small computing areas, then the data will be processed by each computer of the network, after uploading the calculating results which will be combined to arrive at a final data conclusion. But in order to make full use of distributed computing, we have to solve such problems as how to divide the data? How can we achieve a balanced processing of the operating load of each computer? How to combine each result into a final data efficiently? Many computing models and concepts have been designed for the purpose of solving these problems from the hardware and software of computers. Some of the most representative are cloud computing, MapReduce (Handoop) , virtualization and others. While this might only be the beginning of the computing tide. Just like Jack Ma had said:” we are moving from an era of information science and technology to an era of data science and technology.”

Mass data and

the new occupations

of the Internet

To do well in mass data, the first thing of vital importance is to get massive valuable data, which is an advantage that most native Chinese Internet enterprises have. China has a large population, dynamic economy, millions of internet users, the abundance of users behavor data is directly decided by the abundance of user data resources. Taobao has 300 million registered users and Tencents registered users has already exceeded 1 billion. All the user data is absolutely a goldmine.

A new generation technology is bound to bring up full demand of technicians of a new generation. In an era of big data, data scientist and data engineer have been one of the hottest occupation in Silicon Valley. Comparing to the traditional software engineer, data scientist is a group of researchers who stand between mathematics(statitics) and computer science, their job includes both software design and development and data modelling and statistic analysis, meantime, they are able to turn data processing model into feasible software solutions. So the native Chinese internet enterprises also attach great importance to the reservation of talents in the field of data science, in the foreseeable future, practitioners of data science must be very popular in the job market.

主站蜘蛛池模板: 456亚洲人成高清在线| 国产精品综合久久久| 国产成人高清亚洲一区久久| 国产一区二区丝袜高跟鞋| 国产精品成| 67194亚洲无码| 欧美无专区| 国产91丝袜| 无码专区在线观看| 欧美日韩一区二区在线播放| a亚洲视频| 国产高清免费午夜在线视频| 国产激爽爽爽大片在线观看| 欧美一区二区丝袜高跟鞋| 中文字幕在线日韩91| 国产又大又粗又猛又爽的视频| 国产精品成人一区二区| 精品福利国产| 亚洲高清无在码在线无弹窗| 欧美亚洲一区二区三区在线| 日韩美毛片| 亚洲五月激情网| 亚洲男女天堂| 欧美a在线看| 日本a级免费| 国产av一码二码三码无码| 日日噜噜夜夜狠狠视频| 亚洲开心婷婷中文字幕| 91久久青青草原精品国产| 狠狠色综合久久狠狠色综合| 九九香蕉视频| 亚洲综合九九| 91色综合综合热五月激情| 秋霞国产在线| 亚洲中文字幕23页在线| 91黄视频在线观看| 蜜芽国产尤物av尤物在线看| 成人字幕网视频在线观看| 亚洲无码高清视频在线观看| 超碰aⅴ人人做人人爽欧美| 国产亚洲精品无码专| 91尤物国产尤物福利在线| 久久精品国产精品一区二区| 四虎国产精品永久一区| 亚洲最新在线| 秋霞一区二区三区| 伊人成人在线视频| 日本欧美视频在线观看| 国禁国产you女视频网站| 久久精品丝袜| 91年精品国产福利线观看久久| 麻豆精品在线| 欧美在线国产| 日本在线国产| 97成人在线观看| 亚洲资源在线视频| 亚洲精品第一页不卡| 她的性爱视频| 97视频精品全国在线观看| 亚洲va精品中文字幕| 亚洲成a人在线观看| 免费无遮挡AV| 99视频在线免费| 亚洲va在线∨a天堂va欧美va| 国产精品福利社| 91色综合综合热五月激情| 欧美日韩精品一区二区在线线| 91精选国产大片| 伊人AV天堂| 久久久国产精品无码专区| 日韩AV无码一区| 免费在线看黄网址| 国产区人妖精品人妖精品视频| 91丝袜美腿高跟国产极品老师| 91热爆在线| 日本影院一区| 欧美啪啪网| 亚洲成人免费在线| 久久不卡精品| 亚洲V日韩V无码一区二区| 久无码久无码av无码| 中文字幕色站|