999精品在线视频,手机成人午夜在线视频,久久不卡国产精品无码,中日无码在线观看,成人av手机在线观看,日韩精品亚洲一区中文字幕,亚洲av无码人妻,四虎国产在线观看 ?

Barge Database

2014-10-27 23:46:49ByWuJiang
KNOWLEDGE IS POWER 2014年10期

By+Wu+Jiang

The last week of September, 2014 saw the official listing of Alibaba( A Chinas giant company) in the New York Stock Exchange(NYSE:BABA), which is the first Initial public offerings( IPO) and the largest scale in history, also marks the Interent evolves into a new era---a big data era that belongs to Chinese domestic internet enterprises.

The past and present big data

Big data or mass data refers to the data size is so large that it can not be extracted, managed, handled and processed as the information that can be interpreted by human beings with a proper range of time. Under the same condition, compared with those independent small-scale dataset which could analyzed data individually, more additional information and relational data base will be obtained if the analysis is based on the grouping of each small data. Such approach can be applied to forecast the commercial trend, judge the quality of research, avoid the widespread of disease, fight against crimes or predict real-time traffic and others.

Though far away from our daily life, big data has close ties with our daily life in deed. For example, Douban Music( a name of a Chinese social network) can infer which song is most liked by a certain user after its analysis of behaviour of user population, even users favorite movie can also be inffered. Through confluence analysis of sales data of its retail stores, Adidas can exactly know the consumers preference over their products in different regional culture so as to make a more resonable strategy of inventory stocking up in a smarter way. A love and marrige website in China is trying to introduce a system that can identify facial resemblance, the company is able to conclude which facial form is most enjoyed by its users on the basis of used information, then they can provide such popular service among its users. Taobao(the biggest C2C shopping website in Chinas mainland) can predict the possible goods that each consumer is interested in, thereout, individualized recommendation targeted to each user can be produced, this is what most people often see in the side bar of it commodity recommendation. Through the analysis of the information of classified commodities by large database model, Taobao is able to answer some interesting questions which are hard to most people, such as what is the favourite color of the T-shirt for the age group of 18 , or what is the difference between the people living in South and North China when it comes to preference of sports beverage?

The simple analysis of user behaviour will not produce too much value, while if the analysis is based on a quite large scale, then we can obtain valuable prediction from its performing trend, the decision-making in business in particular. In the past, take the well-known NongFu Spring (A Chinese enterprise of drinking water production) for example, if the company wants to get such market data to help them to make decisions as how to pile up can promote its sales? The people of which age group can spend most time in front of the pile? What is their purchasing volume each time? What changes of purchasing behaviour might take place for the change of temperature? How its competitors new packing influence its own sales? Though seem easy, these questions are hard to get convincing answers.

To answer the above questions, a lot of data needs to be collected. The salesmen from NongFu Spring have to come to local supermarkets to take ten pictures every day: the piling of the bottles, the change of their location, the height of the bottle piling and so on. Every day they have to cover 15 places for investigation and survey, and upload 150 pictures, producing data size about 10M which is not a large figure. While there are 10,000 salesmen across China, that means the data size is 100G, 3TB each month. Though these data seem easy, but without the support of relevant technology concerning about big data, such analysis could not be obtained.

There is one in Google had pointed out:” what really matters is not what we can do, but what is the right size can we do.”

It only needs several pieces of paper and a pen if you can just analyze 100 lines of data every day. But if you want to analyze 100,000 lines of data, according to the processing capacity of modern computer, you just need a computer and design programme. But if the data size has reached 1000000000 lines(1TB), even a powerful server station will satisfy your need, especially when you want a real-time or close to real-time processing speed. Thus, the field of computer and numerical calculation witnesses the occurrence of a trend—distributed computing which is a science requires a system by the connecting of cluster of computers through network and then engineering data that needs massive calculation will be divided into small computing areas, then the data will be processed by each computer of the network, after uploading the calculating results which will be combined to arrive at a final data conclusion. But in order to make full use of distributed computing, we have to solve such problems as how to divide the data? How can we achieve a balanced processing of the operating load of each computer? How to combine each result into a final data efficiently? Many computing models and concepts have been designed for the purpose of solving these problems from the hardware and software of computers. Some of the most representative are cloud computing, MapReduce (Handoop) , virtualization and others. While this might only be the beginning of the computing tide. Just like Jack Ma had said:” we are moving from an era of information science and technology to an era of data science and technology.”

Mass data and

the new occupations

of the Internet

To do well in mass data, the first thing of vital importance is to get massive valuable data, which is an advantage that most native Chinese Internet enterprises have. China has a large population, dynamic economy, millions of internet users, the abundance of users behavor data is directly decided by the abundance of user data resources. Taobao has 300 million registered users and Tencents registered users has already exceeded 1 billion. All the user data is absolutely a goldmine.

A new generation technology is bound to bring up full demand of technicians of a new generation. In an era of big data, data scientist and data engineer have been one of the hottest occupation in Silicon Valley. Comparing to the traditional software engineer, data scientist is a group of researchers who stand between mathematics(statitics) and computer science, their job includes both software design and development and data modelling and statistic analysis, meantime, they are able to turn data processing model into feasible software solutions. So the native Chinese internet enterprises also attach great importance to the reservation of talents in the field of data science, in the foreseeable future, practitioners of data science must be very popular in the job market.

主站蜘蛛池模板: 久久鸭综合久久国产| 毛片免费网址| 天天色综网| 久久亚洲黄色视频| 免费激情网址| 国产精品播放| 高清亚洲欧美在线看| 亚洲国产成人麻豆精品| 亚洲丝袜第一页| 免费看久久精品99| 免费国产高清精品一区在线| 免费国产好深啊好涨好硬视频| 一级高清毛片免费a级高清毛片| 久久人搡人人玩人妻精品| 日本手机在线视频| 激情综合五月网| 国产成人a毛片在线| 国产人人干| 高清色本在线www| 99精品久久精品| 亚洲第一成年免费网站| 国产哺乳奶水91在线播放| 亚洲精品国产日韩无码AV永久免费网| 一级不卡毛片| 直接黄91麻豆网站| 99999久久久久久亚洲| 国产欧美在线视频免费| 国产福利在线免费| 欧美一区二区人人喊爽| 久久精品国产精品青草app| 尤物在线观看乱码| 青青草综合网| 亚洲天堂视频网站| 欧美国产中文| 久久99国产乱子伦精品免| 五月天在线网站| 日韩毛片免费观看| 国产一区二区网站| 青青久久91| 亚洲愉拍一区二区精品| 国产精品护士| 中文字幕在线看视频一区二区三区| 国产一区二区三区在线精品专区| 亚洲国产成人在线| 欧美日一级片| 波多野结衣在线一区二区| 久久人人妻人人爽人人卡片av| 欧美综合在线观看| 亚洲三级色| 亚洲伊人天堂| 真实国产乱子伦视频| 99视频在线免费看| 午夜色综合| 亚洲中文无码av永久伊人| 中国一级特黄大片在线观看| 中国精品久久| 国产啪在线91| 在线播放真实国产乱子伦| 激情成人综合网| 欧美狠狠干| 18禁高潮出水呻吟娇喘蜜芽| 成人免费网站在线观看| 国产精品久久自在自2021| 2021最新国产精品网站| 国产亚洲现在一区二区中文| 怡春院欧美一区二区三区免费| 久久精品人妻中文视频| 色欲国产一区二区日韩欧美| 成年片色大黄全免费网站久久| 在线日韩一区二区| 99草精品视频| 暴力调教一区二区三区| 在线观看91精品国产剧情免费| 67194在线午夜亚洲| 亚洲美女久久| 好紧太爽了视频免费无码| 精品久久久久久成人AV| 国产地址二永久伊甸园| 婷婷六月综合| 国产第一色| 99视频全部免费| 黄色网在线免费观看|