Monday, June 3, 2019
The Heart Disease Prediction System Computer Science Essay
The Heart Disease Prediction System Computer Science EssayThere be enormous kernel of selective in coifion available from checkup exam industry which could be useful for medical practitioners when it is used for discovering unfathomable pattern with help of existing selective information digging techniques. The prefatory medical records from a patients profile can be useful in identifying hidden pattern with entropy mining techniques. In this paper, Nave verbalise algorithmic programic program to phone heart disease is implemented with basic records of patients like age, sex, heart rate, blood compel etc., from a example distribution dataset. The benefits, limitations, and technical details of this implementation allow also be discussed in this paper.1 IntroductionOver these years in medical history, many types of medical problems have been identified and many data are available regarding a particular problem. But not all the medical data are same, but there are many pa tterns hidden inside those data which needs to be identified. Data mining techniques could help identify these hidden patterns by knowledge discovery. In the medical field, patients health issues are predicted by doctors intuition or experience 2 where the knowledge rich data is suppressed which endpoints in high medical expenses and unnecessary medical tests. In late(a) years, there are many researches being conducted in order to regard the hidden pattern from basic medical data 1. Identifying these hidden pattern would result in a developing an efficient decision making system in medical industry which aide as a tool to run doctors decision making or at least serve as a prediction system for any medical issues.In this paper, we have taken into consideration of heart disease and predict it using the set of data that are already in existence with the help of data mining technique. The algorithm that we have chosen is the Nave bay algorithm, this algorithm is ideal for a vast am ount of database that may contain hundreds and thousands of rows and columns. The Nave Bayes algorithm provides the mean outturn faster and more accurate as the number of data in the database increase.1.1 Problem ScenarioThere are only few decision uphold systems available in medical industry whose functionalities are very limited. As mentioned earlier, medical decisions are made with doctors intuition and not from the rich data from the medical database. Wrong treatment due to misdiagnosis causes serious threat in medical field. In order to solve these issues data mining solution was with help of medical databases was introduced.1.2 Related WorkThere are many techniques available to discover knowledge from medical database 1. Researchers at Southern California used data mining technique to discover the success and failure of back surgery in order to improve medical treatment 3. Shouman et al 4 implemented predictive data mining to diagnose heart disease of patients. Palaniappan et al 2 developed a prototype Intelligent Heart Disease Prediction System (IHDPS), using data mining techniques.1.3 ObjectiveIn this paper, Nave Bayes algorithm to predict heart disease is implemented with basic records of patients like age, sex, heart rate, blood pressure etc., from a hear dataset. Based on the literature survey Nave Bayes algorithm was found to be an effective technique. The probabilistic method helped in finding the converse probability of the conditional relationship. The dependence relation may exist between two attributes of data set which can be laid with this algorithm.2 Data PreparationIn order to implement the algorithm, a medical data was required. The sample dataset used for the purpose of implementation of algorithm was obtained from Cleveland Clinic Foundation. The sample of dataset is shown in the below figure (Figure1.)CUsersMadan KumarDesktopUntitled2.jpgFigure1. Sample dataset2.1 Dataset SourceThe Cleveland institute medical data was downloaded f rom website of University of California, Irvine.2.2 Dataset AttributesThe dataset consists of 16 attributes. The last attribute of dataset consists of abide by 0 and 1. The value 0 indicates that the patient does not have heart disease whereas 1 indicates that the patient has a heart disease. The prediction of algorithm can be confirm with this value while evaluating the algorithm. The first 15 attributes are shown in the figure2.CUsersMadan KumarDesktopattri.jpgFigure2. Dataset attributes3 Program ArchitectureThe program was implemented using JAVA. Apache TOMCAT horde and MySQL Database is also used. The Nave Bayes algorithm has three class files Calculation.java, Prediction.java, and Detection.java. Detection.java reads the data file from the source path and stores the attributes into temporary array list. The mean and standard deflexion values calculations are performed and probability calculation is also done in Prediction.java. All the dataset attributes are defined in calc ulation.java where mean and standard deviation of attributes were calculated. The calculation.java calls the former(a) two classes while executing the program. Figure3 represents the program architecture.CUsersKirubanidhyDesktopArchitecture.jpgFigure3. Architecture3.1 Building and running a DemoTOMCAT server is used to present the output in web based form. The output will run in localhost. The MySQL database is used to identify the patient records. At the execution point, the local host is accessed and 15 questions will be displayed which will be obtained from user and algorithm will be called to calculate and predict the disease possibility on that person. A insure will be generated at the end of the demo which says if the person is predicted with heart disease or not.In general,1. Obtains the values from user.2. Reads the data file.3. Calls the algorithm and calculates mean, deviation, and probability of attributes.4. Generates a report displaying the values given with the predic tion of disease.4. ImplementationAll the attributes of dataset is of a numerical value that has some meaning. The meaning of dataset attributes are as shown in figure2. deterrent example the attributes sex is denoted with values 1 and 0 where 1 denotes Male and 0 denote Female. Fasting blood sugar values are also denoted using 1 and 0 where 1 denotes 120mg of fasting blood sugar level and 0 denotes These values from the data file are accessed by the Nave Bayes algorithm. The values 0 and 1 are extracted from data file and stored to an array list for each attribute e.g. age array list, sex array list, and knocker pain type array list etc., in order to perform calculation. Here, the values are defined on what those values stands for before storing to the array list. The sample of the interface (for obtaining slope value) is shown in figure4. Here the un-sloping, flat, and down- sloping represents the value 1, 2, and 3 respectively.CUsersMadan KumarDownloadsUntitled.jpgFigure 4. Inte rface SampleCUsersMadan KumarDownloadsUntitled2.jpgFigure5. Sample of report format5. Modules DescriptionAnalyzing the Data setThe attribute Diagnosis was identified as the predictable attribute with value 1 for patients with heart disease and value 0 for patients with no heart disease. The attribute PatientID was used as the key the rest are input attributes. It is assumed that problems such as missing data, inconsistent data, and counterpart data have all been resolved.Naives Bayes Implementation in MiningBayes Theorem finds the probability of an event occurring given the probability of an different event that has already occurred. If B represents the dependent event and A represents the prior event, Bayes theorem can be stated as follows.5.2.1 Bayes TheoremProb (B given A) = Prob (A and B)/Prob (A)To calculate the probability of B given A, the algorithm counts the number of cases where A and B occur together and divide it by the cases where A occurs alone. Applying Nave Bayes to data with numerical attributes, predict the class using Nave Bayes classificationFigure6 (a) Top Mean (b) Bottom Standard DeviationFigure6 (c) Laplace Transform6. EvaluationUser enters the values for the questionnaire to find out whether the patient has a heart disease or not. By feeding sample data from the dataset and performing the mining operations with the Nave Bayes algorithm, it is found out that the Nave Bayes algorithm gives 95% probability in predicting if patient have heart disease or not. 95% true statement is quite good to use as a decision weather system.The figure shows the accuracy of Nave Bayes algorithm (figure7). The figure shows the highest probability of pass up predictions and lowest probability of incorrect predictions.CUsersMadan KumarDesktopUntitled1.jpgFigure7. Model Results of three algorithms 27. LimitationsApart from the benefits like probabilistic approaches and fast reliable algorithm of Nave Bayes, the serious blemish of the algorithm is its ab ility in handling small datasets. Nave Bayes classifier requires relatively large dataset to obtain best results. Yet, studies showed that Naive Bayes algorithm outperforms another(prenominal) algorithms in accuracy and efficiency. Notable limitation of this paper is the usage of small dataset. This dataset can be used for training or testing purpose only. excessively the dataset could include more attributes for a more effective prediction in supporting clinical decisions.8. Future WorkThe algorithm is working well with this sample dataset. Implementing the algorithm with large dataset could give better results which can aid as a supporting tool in making medical decisions. In future, other possible algorithms could be implemented where efficiency of all algorithms could be analyzed to decide on best suitable technique in terms of speed, reliability, and accuracy.9. cultureIn this paper, Nave Bayes algorithm is the only algorithm used for calculation of attributes and prediction . Efficiency and accuracy of the algorithm in predicting were discussed. Designing effective models are constrained by size of the datasets and noisy, incorrect, missing data values. The prototype developed so far has been generally tested by computer experts and not by the doctors. For effective understanding of the health issues, medical experts have to work collaboratively and test the prototypes in order to implement the system in real life to support medical experts in taking clinical decisions.
Subscribe to:
Post Comments (Atom)
No comments:
Post a Comment
Note: Only a member of this blog may post a comment.