Respiratory diseases have a large proportion among those various diseases. Among those, the main diseases that we are now dealing with are viruses which have no majority vaccine found: Human Rhinovirus 14 (HRV), Human Coronavirus OC43 (HCoV), Respiratory Syncytial Virus (RSV), and Human Para influenza virus 1(HVJ). Even though the body can cure most of these viruses by itself, there are some incidents which end up with death. Starting an experiment with those reasons, we separated viruses by the basic symptoms and appearances, and by using data mining, we found similarities and differences of various sequences. As a result, having a high frequency, decision tree prove that each sequences are too different from each other, but still decision tree only shows the difference of the sequences. According to apriori algorithm, it could be able to find a remedy which can block amino acid L, Leucine.
Respiratory diseases have a large proportion among those various diseases. Among those, the main diseases that we are now dealing with are viruses which have no majority vaccine found: Human Rhinovirus 14 (HRV), Human Coronavirus OC43 (HCoV), Respiratory Syncytial Virus (RSV), and Human Para Influenza Virus 1 (HVJ). Even though the body can cure most of these viruses by itself, there are some incidents which end up with death. Starting an experiment with those reasons, we separated viruses by the basic symptoms and appearances, and by using data mining, we found similarities and differences of various sequences. As a result, having a high frequency, decision tree prove that each sequences are too different from each other, but still decision tree only shows the difference of the sequences. According to apriori algorithm, it could be able to find a remedy which can block amino acid L, Leucine.
Coronavirus has a single strain RNA for genetic material and symmetric spiral nucleocapsid. It is also a virus with an envelope. Especially, SARS coronavirus among corona virus has S amino acid and hemagglutinin esterase on the envelope. These amino acids help virus to attach on the cell membrane.
Coronavirus spins the vertebrates as hosts and causes various diseases. Among a variety of coronavirus, only 6 of them are known to infect human. This virus usually infects upper airway among respiratory system and gastro-intestinal tract. However, SARS coronavirus infects both upper airway and lower airway for its unique pathogenesis.
Corona virus is known as a major cause of the common cold in adults which appears mostly during spring and winter. However, unfortunately, the culture in the laboratory is difficult to accurately determine the affection to cold. It can cause viral pneumonia or bacterial pneumonia if it gets serious.
There is a vaccine for coronavirus that infects dogs, but for now there is no vaccine or remedy for human. Fortunately, the recent study shows that an inhibitory effect of chemical compound K22 on the proliferation of the coronavirus and the therapeutic agent is likely to be developed.
RSV has a single strain RNA for genetic material it is also a virus with an envelope [
Spread mainly by physical contact, and it has an incubation period about 5 days. It infects both upper and lower airway, but to adults, there is no serious symptoms [
FI-RSV was made as a vaccine for RSV, but it was found out that it exacerbates the disease.
There are three studies for countermeasures of RSV. The first study uses passive immunization. This uses amino acid F and G on the envelope which has a major role in the initial infection of RSV. We use palivizumab for infants with low immune system, the monoclonal antibody which targets amino acid F.
Second study is about using antiviral. There is Rivavirin the antiviral for RSV, but the effect is not certain.
The last study uses active immunity, and this is still in the middle of the process. Among the vaccines, recombinant vaccine is a vaccine injected into the nasal cavity which combines attenuated recombinant RSV mutants. Moreover it uses amino acid F as an antigen.
Parainfluenza virus has a single RNA for genetic material and has an envelope. On the envelope, there are F protein, M protein, P protein and spike protein. It also has 5 serotypes.
When the virus infects adults, it exhibits upper respiratory inflammation and when children are infected they exhibit bronchiolitis and pneumonia. Its main symptom is acute laryngotracheobronchitis but in 2014, as it gained heat, the scale of the virus became wider and showed severe symptoms such as pneumonia, bronchiolitis and degeneration of asthma. When the acute laryngotracheobronchitis is aggravated, it usually ends up in progressive cough that accompanies stridor, hyperventilation and inspiratory retraction. It develops phlegm, but in older age groups, it only develops slight symptoms.
There is no vaccine but it is discovered that ultraviolet rays can deactivate the virus.
Rhinovirus has a single RNA for genetic material and does not have an envelope. It has more than 100 serotypes, so it is hard to prevent it by vaccine.
It appears regardless of seasons and it usually appears on spring and autumn. It has a very low degree of heat and acid tolerance and mostly ends up in upper respiratory inflammation. When the pH goes below 6, the virus is deactivated unlike other enteroviruses. Until adulthood the host contains neutralizing antibody of almost all serotypes. Furthermore, regardless of serotypes the immunity last for 2 - 16 weeks. Rhinovirus adheres to specific cell acceptor to infect the cell. In particular, self-inoculation after contact between hands and touching the conjunctiva and nasal mucosa happen the most. The incubation period is almost zero and its main hosts are mammals. Acute upper respiratory inflammation and the mucus glands in the lower nasal mucosa show hyperactivity state and congest the nasal concha and close the exit of the paranasal sinuses. Children are frequently infected, and last for 4 - 9 days. It does not display lower respiratory inflammation but shows bronchiolitis, pneumonia and asthma. Without side effects, it disappears in a short term. However ear infection, acute sinusitis and complications can occur due to closure of Eustachian tube and exit of paranasal sinuses.
Infection of rhinovirus usually does not require treatment. However, antibacterial antibiotics are required if bacterial complications occur.
The apriori algorithm is mainly used to find association rules in data mining process. The algorithm drains the elements that are repeated in a section and extends to wider range and find the repetition of the same element [
The Decision Tree algorithm is used in rule mining. The algorithm continues to find a common node, the root node, with the categorized data and find features that can bind the respective data into a specific group [
In
We experimented by using 10 fold cross-validation, and we figure out that each sequences did not follow the rule of other sequences. There was no data set that did not show any rule, but 17window of
4 (rhinovirus) had different length of sequences and to extract the result we had to amplify the sequence except for
Amino acid T which is shown considerably in
Amino acid F which is shown considerably in
Human parainfluenza virus 1 | ||
---|---|---|
Window | Rule | Frequency |
Window 9 | pos1 = E pos6 = L pos8 = T pos1 = E pos6 = L pos4 = M pos5 = R pos1 = E | 0.929 0.923 0.917 |
Window 13 | pos9 = T pos9 = T pos6 = K pos12 = G | 0.909 0.917 |
Window 17 | pos5 = L pos15 = Q pos15 = Q pos15 = Q pos2 = Y pos1 = P pos2 = Y | 0.941 0.933 9.292 0.923 0.917 0.909 |
Human parainfluenza virus 1 | ||
---|---|---|
Window | Rule | Frequency |
Window 9 | pos3 = D pos5 = L pos3 = F pos6 = L pos3 = V pos6 = V pos3 = V pos6 = V pos1 = L pos4 = A pos2 = F pos5 = V pos5 = A pos8 = V pos3 = F pos6 = L | 0.917 0.917 0.909 0.909 0.909 0.909 |
Window 13 | pos2 = I pos7 = V pos2 = I pos7 = V pos7 = A pos12 = L pos7 = V pos12 = F | 0.929 0.923 0.917 0.909 |
Window 17 | pos2 = A pos4 = S | 0.909 |
Human parainfluenza virus 1 | ||
---|---|---|
Window | Rule | Frequency |
Window 9 | pos6 = Y pos8 = S pos5 = Y pos9 = Q pos6 = Y pos8 = S | 0.917 0.909 0.909 |
Window 13 | pos5 = Y pos12 = Y pos5 = Y pos12 = Y | 0.909 0.917 |
Window 17 | pos7 = K pos10 = S pos15 = N pos2 = F pos4 = R | 0.917 0.909 |
According to apriori algorithm, there are two main features. Because every virus had that amino acid, it means that it could be the big reason of respiratory diseases. Unfortunately, the studies about the connection between
Human parainfluenza virus 1 | ||
---|---|---|
Window | Rule | Frequency |
Window 9 | pos3 = C pos7 = W pos3 = C pos7 = W pos3 = S pos9 = W pos8 = C pos3 = C pos7 = W pos3 = C pos7 = W | 0.929 0.923 0.923 0.917 0.909 |
Window 13 | pos11 = W pos12 = D pos9 = T pos12 = W pos11 = W pos12 = D pos6 = R pos12 = C pos9 = T pos12 = W pos9 = T pos12=W pos11 = W pos12 = D pos13 = I | 0.929 0.929 0.923 0.923 0.917 0.917 |
Window 17 | pos1 = N pos2 = S pos1 = N pos2 = S pos1 = N pos2 = S pos1 = N pos2 = S pos2 = S pos12 = N pos4 = A pos7 = R pos2 = S pos12 = N pos2 = K pos8 = P pos2 = K pos8 = P pos8 = S pos14 = W pos4 = H pos7 = R pos2 = F pos7 = H pos8 = N pos5 = C pos9 = F | 0.962 0.958 0.957 0.952 0.941 0.941 0.938 0.929 0.923 0.923 0.917 0.909 |
those amino acids and respiratory virus are not in this period. We could not find the main work of those amino acids and how to make remedy that reacts with amino acids and block them. By common features shown in apriori algorithm and considering the difference shown in decision tree algorithm, common vaccine can be found but still, it should be studied more.
Chaewon Ham,Haeun Jung,Taeseon Yoon, (2015) Analysis and Comparison about the Common Remedy of Respiratory Viruses through Data Mining. Journal of Biosciences and Medicines,03,54-59. doi: 10.4236/jbm.2015.36009