Microblogging, as a typical representative of social networking applications, its advantages in interpersonal communication and information dissemination are very obvious. At present, micro-blog has become an important enterprise information dissemination channels and customer management tools. In this paper, based on the construction of enterprise microblogging knowledge network, combined with social network analysis, the author puts forward a set of micro-blog enterprises in-depth analysis of practical methods. It has a certain reference value for the further development of the value of enterprise microblogging data.
Microblogging has become an important platform for public opinion, media communication, corporate brand and product promotion because of the nature of sharing, interaction and openness. Enterprise microblogging is a kind of accounts that enterprise open in the microblogging platform [
Enterprise microblogging, an enterprise account in the microblogging platform, as an independent medium on social media, has very important commercial value. B. J. Jansen and M. Zhang, who analyzed 150,000 microblogging information in Twitter, found that 3.8% of the microblogging messages conveyed the brand’s emotional tendencies, and that the micro-blog site to become the platform of enterprise marketing, customer relationship maintenance and word of mouth [
The concept of knowledge networks was first proposed by the Swedish industry, the relevant research began in the 20th century, the mid-90s. Beckmann described the knowledge network as institutions and activities for production and dissemination of scientific knowledge in the academic point of view [
At present, Sinamicroblogging API (Application Programming Interface) content is not fully open. There are many restrictions in the number of single-query results, access to data resources and call frequency. So simply using the API provided by Sinamicroblogging to obtain comprehensive microblogging business data is more difficult. The data acquisition efficiency is relatively low too. Therefore, this paper choose to use the Web crawler technology to obtain business microblogging data.
Web crawler is a client program, used to obtain information on the web page. How Web Crawlers Work:
1) Establishes a connection with the server, sends an Http request to the server, and requests a web page;
2) The server receives the request, make a response, and return to the web crawler page html source code;
3) Web crawler analysis html page, access to the URL contained in html, joined the URL queue;
4) If the URL is still not crawled in the URL queue, return to step 1 to continue.
Using the Web crawler to get a collection of microblogging content as follows:
M = { ( m i ) | i = 1 , ⋯ , a } (1)
where i is the microblogging number, a is the total number of microblogs to obtain, M is the collection of all microblogging content.
In order to obtain the key words of each micro-blog text, we must first segment the micro-blog text, and then get the enterprise micro-blog node set K, as follows:
K = { ( k j ) | j = 1 , ⋯ , n } (2)
The word frequency of k j is expressed by q ( k j ) , and the high frequency word set K1 is established as follows:
K 1 = { ( k j ) | q ( k j ) > q 1 } (3)
q 1 is the threshold value, used to distinguish between high frequency and low frequency keywords.
In the enterprise microblogging, the keywords with high frequency often re- present the focus of microblogging communication. And obtains frequency weight set Q of the high frequency keywords according to the word frequency of the acquired keywords:
Q ( K 1 ) = { q ( k j ) | k j ∈ K , q ( k j ) > q 1 , j = 1 , ⋯ , n } (4)
In the enterprise micro-blog social network, the two keywords can be linked through microblogging content. The more the number of times two keywords appear in the same microblogging content, the closer the relationship between the two topics is [
E = { ( e i j ) | e i j = 1 , q ( k i ) > q 1 , q ( k j ) > q 1 ; i , j = 1 , 2 , ⋯ , n } (5)
Based on the number of co-occurrence relations between high-frequency nodes, the co-occurrence relationship weight set of the enterprise micro-blog high-frequency nodes can be constructed as follows:
Q ( E ) = { q ( e i j ) | i , j = 1 , ⋯ , n } (6)
The greater the value of q ( e i j ) , the greater the link between the two keywords. Through the statistics the number of times that all the keywords appeared in the same micro-blog, we could be able to build enterprise micro-blog high frequency keywords co-occurrence matrix.
According to the enterprise micro-blog high-frequency keyword set obtained from (3), high frequency key word frequency set obtained from (4) and high frequency node co-occurrence times obtained from (6), we can get enterprise micro-blog Weighted Knowledge Network Model as follows:
E M K N = { K 1 , Q ( K 1 ) , Q ( E ) } (7)
From the formula (7), we can know that the nodes have two kinds of weights: Q(K) and Q(E), where Q(K) denotes the number of times the node appears, Q(E) represents the number of co-occurrence between nodes. We can weigh the importance of the node from these two respects.
Using the EMKN model, the nodes with larger weights can be identified and analyzed. The high-frequency keyword set is composed of keywords whose word frequency is greater than a certain high-frequency threshold. This collection represents the point at which the enterprise wants to focus on disseminating knowledge to users in all published microblogs. The threshold of high-frequency keywords can be determined by the enterprise microblogging according to the actual situation.
In the social network, the centrality can be used to measure the node’s position in the overall network. The nodes with high centrality are at the core position in the whole network. Other nodes are either directly related to them, or connected to other nodes through them [
In the network, if the degree of a node is larger, it means that the higher the centrality of this node, the greater the importance of the node in the network.
Without considering the weight of the edge, the degree centralization of the node ki is:
C D ( k i ) = [ ∑ j = 1 n e i j ] / ( n − 1 ) (8)
Considering the weight of edge, the degree centralization of node ki is:
W C D ( k i ) = [ ∑ j = 1 n q ( e i j ) ] / ( n − 1 ) (9)
The significance of betweenness centralization can be expressed as follows: if two non-adjacent nodes s and t want to interact with each other and node i is on their path, the node i may control the interaction between them [
Similarly, we can calculate the betweenness centralization C B ( k i ) of K i :
C B ( k i ) = 2 ∑ s < t g s t ( k i ) / g s t ( n − 1 ) ( n − 2 ) (10)
In the above formula, g s t denotes the number of shortcuts for nodes k s to k t , and k t , indicates the number of shortcuts to pass through k i .
Based on the above centrality metric, we can find that the node with high centrality is very important for discovering the microblog topic information.
Using the EMKN model, we can cluster the enterprise micro-blog by the method of cohesive subgroup analysis. At the same time, we can use the network midpoint and edge weights to carry on the further analysis. This way is not only able to analyze the weight of various groups of enterprise micro-blog and the relationship between groups, but also the composition of various groups. Importantly, the visualization of the analysis results helps to identify the key communication points of the enterprise micro-blog effectively. In addition, the knowledge points of any subgroup can be represented by a knowledge subnet. With this knowledge subnet, we can deeply analyze the internal structure, hot spots and associated patterns of sub-groups.
In this study, we selected the Club of Huawei, official microblogging of Huawei, as the research object in Sina micro-blogging platform. In recent years, Huawei’s brand awareness and reputation have been greatly improved. The National Federation of Industry and Commerce released “2016 top 500 Chinese private enterprises” list, Huawei become the top 500 list with 395.09 billion Yuan in annual revenue. Club of Huawei is an interaction platform for Huawei’s fans. It answers questions of fans, presents the latest product and service information, provides rich online content and offline interaction at the first time. Therefore, this paper selects this microblogging account for data collection.
First of all, the preparation of reptiles collects all the microblogging of Club of Huawei in January 1, 2015 to December 31, 2015, a total of 2974. Then segment the microblog text: we use the NLPIR/ICTCLAS 2014 Chinese word segmentation system to preprocess the crawled microblogging text, and get the keywords by all the microblogs. The third step is to obtain high-frequency keywords. In this paper, we choose 104 words with frequency more than 50 as high frequency keywords.
According to the formula (4), we can get the word frequency of the high frequency keywords. Construct the matrix between microblogging content and keywords, form 2974 × 104 word matrix. In the enterprise microblogging text, there must be some kind of association between co-occurrence keywords, the degree of association can be expressed with the frequency of co-occurrence. According to formula (6), the co-occurrence matrix of 104 high-frequency keywords is obtained, which is 104 × 104 co-word matrix.
According to the co-occurrence relationship between high frequency keywords, the knowledge network model of Club of Huawei was constructed with Ucinet software, as shown in
Keyword | Freq. | Keyword | Freq. | Keyword | Freq. | Keyword | Freq. |
---|---|---|---|---|---|---|---|
Pollen | 2455 | Live micro-blog | 339 | Honor 7I | 238 | Pollen annual meeting | 194 |
Huawei | 1137 | Micro-blog | 313 | Entertainment | 227 | China | 180 |
Mobile phone | 738 | Winning yesterday | 267 | Function | 210 | Global | 177 |
Honor | 681 | Gift | 264 | Youth | 199 | Intelligence | 165 |
Minister | 509 | Conference | 256 | World | 195 | Product | 162 |
Topic | 478 | Huawei P8 | 238 | Excellent Purchase Code | 195 | Headset | 158 |
Enterprise microblogging, as a main positions of enterprise marketing and promotion and an important window of connecting with the user directly, its content often represents the main information enterprise wants to pass to its users. Therefore, in general, the high-frequency words appeared in the enterprise microblogging, often on behalf of the focus of the enterprise microblogging content.
We can see from
In order to further study the importance of the key nodes in the Club of Huawei microblogging, refers to the centrality of the social network, this paper uses the degree centralization and the betweenness centralization of node to identify the central point in the microblogging. The analysis results are shown below:
As can be seen from
Pollen | Huawei | Mobile phone | Honor | Minister | Topic | Live micro-blog | |
---|---|---|---|---|---|---|---|
Pollen | 1724 | 323 | 224 | 475 | 251 | 221 | 134 |
Huawei | 323 | 652 | 179 | 155 | 78 | 162 | 150 |
Mobile phone | 224 | 179 | 452 | 180 | 50 | 71 | 42 |
Honor | 475 | 155 | 180 | 1002 | 183 | 180 | 138 |
Minister | 251 | 78 | 50 | 183 | 444 | 57 | 39 |
Topic | 221 | 162 | 71 | 180 | 57 | 475 | 312 |
Live micro-blog | 134 | 150 | 42 | 138 | 39 | 312 | 339 |
The betweenness centrality directly reveals the importance of the node location in the whole knowledge network. As we can see from
It is noteworthy that although the frequency and degree centralization of “Huawei P8” are relatively high, its betweenness centralization is only 17.241,
which means that it is not very important in the whole network. This is because, as an independent high-end model in addition to the honor series, the betweenness role of “Huawei P8” is not obvious.
By analyzing the cohesive subgroup of the enterprise microblogging knowledge network, it is possible to divide the microblog topic. Analysis results into the following:
As shown in
It can be seen from
Group 1 mainly includes some topics and activities to increase the stickiness of microblogging fans, such as the activities of “Pollen Handy Photos”, “food”, and “entertainment”. These microblogging related to the daily life of fans can effectively increase the fan’s activity.
Group 2 mainly includes some marketing activities of “Honor Changwan 5X”. According to the composition of the key words group 2, we can find that, for this product, microblogging marketing activities are excellent purchase code and lottery mainly.
Group 3 mainly includes marketing activities of “Huawei P8”. This type of group contains the most keywords, we can see, for this high-end model, the marketing activities are the most diverse.
Group 3 mainly includes marketing activities of “Huawei P8”. This type of group contains the most keywords, we can see, for this high-end models, the marketing activities are the most diverse.
Group 4 mainly includes marketing activities of new product of honor series in 2015. From the composition of group 4, we could found that the Club of Huawei launched a number of marketing activities for college students. Therefore, we can easily draw the conclusion that the products of honor series mainly targeted at young users, especially college students. This is also very consistent with the low-end machine positioning of honor series products.
Group 5 mainly includes “tablet”, “Honor Changwan 4C”, “Huawei Changxiang” and so on. By the group 5, we can know that these products are not Club of Huawei microblogging marketing focus, their marketing activities are also very single, and “Price”, “brand” are their main selling point.
Group 6 is the product information most relevant to the actual use of the user, including “key”, “lens”, “video”, “information”, and “function”.
Group 7 is the topic of product design, including the “battery”, “craft”, “metal”, and “fuselage”.
Group 8 mainly includes keywords related to product features. From these key words, we can easily find that Huawei mobile phones in 2015 mainly has made a
Cluster | Keywords |
---|---|
1 | Pollen (Huawei’s fans), smart key, mobile phone, honor, custom gifts, food, friend, entertainment, time, Pollen Handy Photos, network |
2 | Honor Changwan 5X, gift, custom, USB flash disk, Chance to win, opportunity, Excellent Purchase Code, bolster, hat, winning yesterday, headset |
3 | micro-blog, Huawei P8, award, Club of Huawei, Huawei, news, comment, minister, photo, China, market, center, Shanghai, product, world, fashion, partners, GO Honor World Carnival, lighten, scenery, Shenzhen, witness, life, global, time, city, science and technology, picture, Paris, Milan, Pollen Annual Meeting, problem |
4 | movie, conference, band, topic, youth, power, Honor Voice Maker, game, GO brave, dream, Honor Changwan 4X, Honor 4A, Beijing, music, college, nationwide, creativity, Fenfen (The nickname of pollen), Honor 7I, story, campus, face score, new products |
5 | tablet, brand, Netcom, live micro-blog, Honor Changwan 4C, price, Vmall, Huawei Changxiang |
6 | intelligence, key, user, lens, video, information, function, mode |
7 | battery, craft, metal, fuselage |
8 | screen, camera, fingerprint, chip, system, technology |
breakthrough in the “screen”, “camera”, “fingerprint”, “chip”, “system”, and “technology”.
This paper constructs a micro-blogging knowledge network with micro-blog text keywords as network nodes, keyword frequency and the co-occurrence relation between them as weights and edges and effectively found the important nodes, the central nodes and the categories of the microblogging transmission through the central analysis of the micro-blogging knowledge network and the cohesive subgroup analysis. Then, with Huawei’s official micro-blog “Club of Huawei” as the research object, this paper constructs the knowledge network of Club of Huawei, and effectively analyzes its key products, marketing selling points and main marketing activities, and further analyzes the major improvement in the field of mobile phone products in 2015 in Huawei. Through the example, we found that the knowledge network model can carry on the thorough and comprehensive analysis to the enterprise’s micro-blog, which is of great significance to the micro-blog marketing and the enterprise’s competitive intelligence.
Liu, W.J. (2017) The Research on Enterprise Micro-Blog Ana- lysis Method Based on Knowledge Network. Open Journal of Social Sciences, 5, 127-138. https://doi.org/10.4236/jss.2017.52013