Data Mining @ Information Age |
Author : M. Arunmozhi , R. Usha |
Abstract | Full Text |
Abstract :In the information age, data turns to be the vital.
Hence it is important to understand the data in order to face the
future information challenges. This paper deals with the
importance of data mining while explaining the concepts and
life cycle involved. It extracts the basic gist of the topic
presented in a user-friendly way. Further, in developing
different stages of data mining followed by its extended
application usage in practical business platform.
|
|
An Approach for Breast Cancer Classification using Neural Networks |
Author : S.Vijaya , D. Gladis |
Abstract | Full Text |
Abstract :Breast Cancer,an increasing predominant death
causing disease among women has become a social concern.
Early detection and efficient treatment helps to reduce the
breastcancerrisk.AdaptiveResonanceTheory(ART1),anunsuper
vised neural network has become an efficient tool in the
classification of breast cancer as Benign(non dangerous
tumour) or Malignant (dangerous tumour). 400 instances were
pre processed to convert real data into binary data and the
classification was carried out using ART1 network. The
results of the classified data and the physician diagnosed data
were compared and the standard performance measures
accuracy, sensitivity and specificity were computed. The
results show that the simulation results are analogous to the
clinical results. |
|
Leanness Assessment using Fuzzy Logic Approach: A Case of Indian Horn Manufacturing Company |
Author : Pradeep Kumar Balasubramanian , K Hemamala |
Abstract | Full Text |
Abstract :— Lean principles are being implemented by many
industries today that focus on improving the efficiency of the
operations for reducing the waste, efforts and consumption.
Organizations implementing lean principles can be assessed
using the some tools. This paper attempts to assess the lean
implementation in a leading Horn manufacturing industry in
South India. The twofold objectives are set to be achieved
through this paper. First is to find the leanness level of a
manufacturing organization for which a horn manufacturing
company has been selected as the case company. Second is to
find the critical obstacles for the lean implementation. The
fuzzy logic computation method is used to extract the
perceptions about the particular variables by using linguistic
values and then match it with fuzzy numbers to compute the
precise value of the leanness level of the organization. Based
on the results obtained from this analysis, it was found that the
case study company has performed in the lean to vey lean
range and the weaker areas have been identified to improve the
performance further.
|
|
Leanness Assessment using Fuzzy Logic Approach: A Case of Indian Horn Manufacturing Company |
Author : Pradeep Kumar Balasubramanian , K Hemamala |
Abstract | Full Text |
Abstract :— Lean principles are being implemented by many
industries today that focus on improving the efficiency of the
operations for reducing the waste, efforts and consumption.
Organizations implementing lean principles can be assessed
using the some tools. This paper attempts to assess the lean
implementation in a leading Horn manufacturing industry in
South India. The twofold objectives are set to be achieved
through this paper. First is to find the leanness level of a
manufacturing organization for which a horn manufacturing
company has been selected as the case company. Second is to
find the critical obstacles for the lean implementation. The
fuzzy logic computation method is used to extract the
perceptions about the particular variables by using linguistic
values and then match it with fuzzy numbers to compute the
precise value of the leanness level of the organization. Based
on the results obtained from this analysis, it was found that the
case study company has performed in the lean to vey lean
range and the weaker areas have been identified to improve the
performance further.
|
|
Market Basket Analysis using Improved FP-tree |
Author : Abhishek Priyadarshi ,Chirag Gupta, G Poornalatha |
Abstract | Full Text |
Abstract :The Market Basket Analysis helps in identifying
the purchasing patterns of customers such as, which products
are purchased more and which products are purchased together.
This helps in decision making process. For example, if two or
more products are frequently purchased together then they can
be kept at the same place so as to facilitate the customer, to
further increase their sale. The price of products that are not
frequently purchased can be reduced in order to enhance their
purchase. Additionally the promotion of one product will also
increase the sales of other products which are purchased
together with the product being promoted. The traditional
Apriori algorithm based on candidate generation cannot be
used in Market Basket Analysis because it generates candidate
sets and scans database regularly for the generation of frequent
itemsets. The FP-growth algorithm cannot be used despite of
the fact that it does not generate candidate sets and scans the
database only twice because, it generates a lot of conditional
trees recursively. Therefore, an efficient algorithm needs to be
used. In this paper an efficient algorithm is used for
development of market basket analysis application. This
efficient algorithm neither generates candidate sets nor
conditional FP- tree; like FP-growth scans the database twice. |
|
A Study on MRI Liver Image Segmentation using Fuzzy Connected and Watershed Techniques |
Author : A.S. Thenmozhi , N. Radhakrishnan |
Abstract | Full Text |
Abstract :A comparison study between automatic and
interactive methods for liver segmentation from contrastenhanced
MRI images is ocean. A collection of 20 clinical
images with reference segmentations was provided to train and
tune algorithms in advance. Employed algorithms include
statistical shape models, atlas registration, level-sets, graphcuts
and rule-based systems. All results were compared to refer
five error measures that highlight different aspects of
segmentation accuracy. The measures were combined
according to a specific scoring system relating the obtained
values to human expert variability. In general, interactive
methods like Fuzzy Connected and Watershed Methods
reached higher average scores than automatic approaches and
featured a better consistency of segmentation quality.
However, the best automatic methods (mainly based on
statistical shape models with some additional free deformation)
could compete well on the majority of test images. The study
provides an insight in performance of different segmentation
approaches under real-world conditions and highlights
achievements and limitations of current image analysis
techniques. In this paper only Fuzzy Connected and
Watershed Methods are discussed. |
|
Survey on Segmentation Techniques for Spinal Cord Images |
Author : S. Shyni Carmel Mary , S. Sasikala |
Abstract | Full Text |
Abstract :Medical imaging is a technique which is used to
expose the interior part of the body, to diagnose the diseases
and to treat them as well. Different modalities are used to
process the medical images. It helps the human specialists to
make diagnosis ailments. In this paper, we surveyed
segmentation on the spinal cord images using different
techniques such as Data mining, Support vector machine,
Neural Networks and Genetic Algorithm which are applied to
find the disorders and syndromes affected in the spinal cord
system. As a result, we have gained knowledge in an identified
disarrays and ailments affected in lumbar vertebra,
thoracolumbar vertebra and spinal canal. Finally how the Disc
Similarity Index values are generated in each method is also
analysed |
|
Comparative Analysis of Weighted Emphirical Optimization Algorithm and Lazy Classification Algorithms |
Author : P. Suganya , C. P. Sumathi |
Abstract | Full Text |
Abstract :- Health care has millions of centric data to discover
the essential data is more important. In data mining the
discovery of hidden information can be more innovative and
useful for much necessity constraint in the field of forecasting,
patient’s behavior, executive information system, e-governance
the data mining tools and technique play a vital role. In
Parkinson health care domain the hidden concept predicts the
possibility of likelihood of the disease and also ensures the
important feature attribute. The explicit patterns are converted
to implicit by applying various algorithms i.e., association,
clustering, classification to arrive at the full potential of the
medical data. In this research work Parkinson dataset have
been used with different classifiers to estimate the accuracy,
sensitivity, specificity, kappa and roc characteristics. The
proposed weighted empirical optimization algorithm is
compared with other classifiers to be efficient in terms of
accuracy and other related measures. The proposed model
exhibited utmost accuracy of 87.17% with a robust kappa
statistics measurement and roc degree indicated the strong
stability of the model when compared to other classifiers. The
total penalty cost generated by the proposed model is less when
compared with the penalty cost of other classifiers in addition
to accuracy and other performance measures. |
|
Weather Impact over Uttarakhand using k-Means Clustering Technique for Cloudburst Prediction |
Author : Deepalakshmi, J.Karthikeyan |
Abstract | Full Text |
Abstract :with the advancement of information technology
and their tremendous development, ‘Numerical Weather
Prediction’ is used by many meteorological services for
predicting weather forecasts. This is available mostly for the
welfare of the public. As this needs more scientific knowledge,
Global Forecast Model came into existence for prediction of
weather development from Numerical Weather Prediction.
Data mining Clustering technique is applied in this analysis for
forecasting the National Centre for Medium Range Weather
Forecasting model which helps in predicting cloudburst. Tamil
Nadu recently overcame a dreadful cloudburst on October
2015. Foretelling of cloudburst is exceptionally hard. This
could be foretold only a few hours before. In difference to the
on top of statement we have a tendency to predict cloud burst
two or three days before. In this article rainstorm over
Uttarkhand that created a heavy loss has been analysed by kmeans
algorithm |
|
Performance Evaluation of Feature Selection Algorithms in Educational Data Mining |
Author : T.Velmurugan , C. Anuradha |
Abstract | Full Text |
Abstract :Educational Data mining(EDM)is a prominent field
concerned with developing methods for exploring the unique
and increasingly large scale data that come from educational
settings and using those methods to better understand students
in which they learn. It has been proved in various studies and
by the previous study by the authors that data mining
techniques find widespread applications in the educational
decision making process for improving the performance of
students in higher educational institutions. Classification
techniques assumes significant importance in the machine
learning tasks and are mostly employed in the prediction
related problems. In machine learning problems, feature
selection techniques are used to reduce the attributes of the
class variables by removing the redundant and irrelevant
features from the dataset. The aim of this research work is to
compares the performance of various feature selection
techniques is done using WEKA tool in the prediction of
students’ performance in the final semester examination using
different classification algorithms. Particularly J48, Naïve
Bayes, Bayes Net, IBk, OneR, and JRip are used in this
research work. The dataset for the study were collected from
the student’s performance report of a private college in Tamil
Nadu state of India. The effectiveness of various feature
selection algorithms was compared with six classifiers and the
results are discussed. The results of this study shows that the
accuracy of IBK is 99.680% which is found to be high than
other classifiers over the CFS subset evaluator. Also found that
overall accuracy of CFS subset evaluator seems to be high than
other feature selection algorithms. The future work will
concentrate on the implementation of a proposed hybrid
method by considering large dataset collected from many
institutions. |
|
A Review of Edge Detection Techniques for Image Segmentation |
Author : S.Jeyalaksshmi , S . Prasanna |
Abstract | Full Text |
Abstract :: Edge detection is a key stride in Image investigation.
Edges characterize the limits between areas in a image, which
assists with division and article acknowledgment.Edge
discovery is a image preparing method for finding the limits of
articles inside Image. It works by distinguishing irregular in
brilliance and utilized for Image division and information
extraction in zones, for example, Image preparing, PC vision
and Image vision. There are likely more algorithms in a writing
of upgrading and distinguishing edges than whatever other
single subject.In this paper, the principle is to concentrate most
usually utilized edge methods for Image segmentation.
|
|
Genetic Based ID3 Classification Algorithm Diagnosis and Prognosis of Oral Cancer |
Author : K.Jamberi , E.Ramaraj |
Abstract | Full Text |
Abstract :: In order to analyse the chosen data from various
points of view, data mining is used as the effective process.
This process is also used to sum-up all those views into useful
information. There are several types of algorithms in data
mining such as Classification algorithms, Regression,
Segmentation algorithms, association algorithms, sequence
analysis algorithms, etc.,. The classification algorithm can be
usedto bifurcate the data set from the given data set and foretell
one or more discrete variables, based on the other attributes in
the dataset. The ID3 (Iterative Dichotomiser 3) algorithm is an
original data set S as the root node. An unutilised attribute of
the data set S calculates the entropy H(S) (or Information gain
IG (A)) of the attribute. Upon its selection, the attribute should
have the smallest entropy (or largest information gain) value. A
genetic algorithm (GA) is a heuristic quest that imitates the
process of natural selection. Genetic algorithm can easily select
cancer data set, from the given data set using GA operators,
such as mutation, selection, and crossover. A method existed
earlier (KNN+GA) was not successful for oral cancer and
primary tumor. Our method of creating new algorithm
GA+ID3 easily identifiesoral cancer data set from the given
data set. The genetic based ID3 classification algorithm
diagnosis and prognosis of oral cancer data set is identified by
this paper |
|
Efficiency of k-Means and k-Medoids Clustering Algorithms using Lung Cancer Dataset |
Author : A.Dharmarajan , T.Velmurugan |
Abstract | Full Text |
Abstract :– The objective of this research work is focused on
the right cluster creation of lung cancer data and analyzed the
efficiency of k-Means and k-Medoids algorithms. This
research work would help the developers to identify the
characteristics and flow of algorithms. In this research work is
pertinent for the department of oncology in cancer centers.
This implementation helps the oncologist to make decision
with lesser execution time of the algorithm.It is also enhances
the medical care applications. This work is very suitable for
selection of cluster development algorithm for lung cancer data
analysis.Clustering is an important technique in data mining
which is applied in many fields including medical diagnosis to
find diseases. It is the process of grouping data, where
grouping is recognized by discovering similarities between
data based on their features. In this research work, the lung
cancer data is used to find the performance of clustering
algorithms via its computational time. Considering a limited
number attributes of lung cancer data, the algorithmic steps are
applied to get results and compare the performance of
algorithms. The partition based clustering algorithms k-Means
and k-Mediods are selected to analyze the lung cancer data.The
efficiency of both the algorithms is analyzed based on the
results produced by this approach. The finest outcome of the
performance of the algorithm is reported for the chosen data
concept.
|
|
An Approach for ECG Feature Extraction and Classification of Cardiac Abnormalities |
Author : B.S.Viswanath , M.Thangapandian2, R.Ramarajiv |
Abstract | Full Text |
Abstract :This inquires about article presents a unused
approach to the Programmed location and classification of
electrocardiogram (ECG) signals is of tremendous significance
for determination of cardiac anomalies. A strategy is proposed
here to classify distinctive cardiac variations from the norm
like Ventricular Arrythmias, Myocardial infarction,
Myocardial hypertrophy and Valvular heart malady. Support
Vector Machine (SVM) has been utilized to classify the
designs inborn in the highlights extricated through Continuous
Wavelet Transform (CWT) of distinctive ECG signals. CWT
permits a time space flag to be changed into time-frequency
space such that recurrence characteristics and the area of
specific highlights in a time arrangement may be highlighted at
the same time. Hence it permits precise extraction of highlight
from non-stationary signals like ECG. At that point the support
vector machine (SVM) with Gaussian part is utilized to
classify diverse ECG heart cadence. In the display work, SVM
in relapse mode has been effectively applied.
|
|
Finding Influential Healthcare Interventions of Different Socio-Economically and Educationally Segmented Regions by using Data Mining Techniques: Case Study on Nine High Focus States of India |
Author : Partha Saha, Uttam Kumar Banerjee |
Abstract | Full Text |
Abstract :- United Nations at Millennium Summit 2000 made
targets on Under-five Mortality Ratio (U5MR) and Maternal
Mortality Ratio (MMR) for improving health condition of
mothers and children. Though India did not be able to achieve
those targets but have improved significantly. Aim of the study
is to find out influential healthcare interventions of socioeconomically
and educationally different regions which have
high impact on their HIs. At resource constrained condition,
strategic evidence based planning will help healthcare
department to reduce inequity in HIs among different regions.
Data of different HIs has been collected from Family Welfare
Statistics of India 2012 and healthcare interventions have been
collected from District Level Household Survey 3. 192 districts
from ‘Nine High Focus States of India’ have been used as case
study area in this research work. Both hierarchical and kmeans,
clustering techniques have been used for segmenting
192 districts based on their socio-economic and educational
status and decision tree classification technique has been used
for building relationship model for each segment. Total six
decision tree classifiers have been developed for identifying
most influential interventions on Infant Mortality Rate (IMR)
and U5MR. From this work it has become clear that impact of
healthcare interventions on healthcare indicators varies from
region to region. In hilly regions, adolescent interventions had
more impact on U5MR and IMR than child age interventions. |
|
A Survey on Educational Data Mining Techniques |
Author : A.S. Arunachalam , T.Velmurugan |
Abstract | Full Text |
Abstract :- Educational data mining (EDM) creates high impact
in the field of academic domain. The methods used in this topic
are playing a major advanced key role in increasing knowledge
among students. EDM explores and gives ideas in
understanding behavioral patterns of students to choose a
correct path for choosing their carrier. This survey focuses on
such category and it discusses on various techniques involved
in making educational data mining for their knowledge
improvement. Also, it discusses about different types of EDM
tools and techniques in this article. Among the different tools
and techniques, best categories are suggested for real world
usage.
|
|
Image Segmentation Based Survey on the Lung Cancer MRI Images |
Author : S. Perumal1 , T. Velmurugan2 |
Abstract | Full Text |
Abstract :Differentiating cancer affected part in lungs and
giving proper solution to the problem are toughest job in
medical field. Doctors face many problems in correctly
spotting up of cancer affected area in lungs. Image processing
can be a solvent for this type of issues, especially to identify
the cancer affected areas in lungs. Historical data of different
types of lung cancer images are collected and image processing
methods are carried out for the identification of cancer affected
regions in the lungs by the physicians and experts. This
research work carried out a survey on the lung cancer data
analysis done by various researchers. Also, it suggests the best
method and technique applied for the prediction of cancer in
the affected parts of lungs.
|
|
Modified Algorithm for Drift Avoidance in PV System using Neural Network |
Author : K.Geetha , M.Thenmozhi , C.Gowthaman |
Abstract | Full Text |
Abstract :- As the Photovoltaic System uses the solar energy as
one of the renewable energies for the electrical energy
production has an enormous potential. The PV system is
developing very rapidly as compared to its counterparts of the
renewable energies. The DC voltage generated by the PV
system is boosted by the DC-DC Boost converter. The utility
grid is incorporated with the PV Solar Power Generator
through the 3-i PWM DC-AC inverter, whose control is
provided by a constant current controller. This controller uses a
3-i phase locked loop (PLL) for tracking the phase angle of the
utility grid and reacts fast enough to the changes in load or grid
connection states , as a result, it seems to be efficient in
supplying to load the constant voltage without phase jump. An
artificial neuron is a device with many inputs and one output.
By using artificial neuron networks, the control algorithm
implemented in the boost converter enhances by reducing the
response time and hence, transient response for this converter
is improved.
|
|
The Preface Layer for Auditing Sensual Interacts of Primary Distress Concealing through Sensible Cardinal Spectroscopy |
Author : V.Ramesh , G.Arul Dalton |
Abstract | Full Text |
Abstract : Resting anterior brain electrical activity,
self-report measures of Behavioral Approach
System (BAS) and Behavioral Inhibition System
(BIS) strength, and common levels of Positive
Affect (PA) and Negative Affect (NA) were
composed from 46 unselected undergraduates two
split occasions Electroencephalogram (EEG)
measures of prefrontal asymmetry and the selfreport
measures showed excellent internal
reliability, steadiness and tolerable test-retest
stability. Strong connection betweens the
unconstrained facial emotional expressions and the
full of feeling states correlated cerebrum movement.
When seeing dreadful as contrasted with unbiased
faces, members showed larger amounts of actuation
inside the privilege average prefrontal cortex (PFC).
To propose a multimodal method to deal with assess
Efficient Practical near Infrared Spectroscopy
(EPNIS) signals and EEG signals for full of feeling
state identification. Outcomes demonstrate that
proposed technique with EPNIS enhances execution
over EPNIS methodologies. Based on Experimental
evaluations, proposed EPNIS algorithm enhances
Mean Time (MT) 209.15 milliseconds, Standard
Deviation (SD) 101.4 milliseconds and Accuracy
4.45 % of the proposed framework compared than
previous methodologies. |
|
A Clustering Based Collaborative and Pattern based Filtering approach for Big Data Application |
Author : M.Roberts Masillamani , C.Vijayakumar , R.Rajesh |
Abstract | Full Text |
Abstract :With web services developing and aggregating in
application range, benefit revelation has turned into a hot issue
for benefit organization and service management. Service
clustering gives a promising approach to part the entire seeking
space into little areas in order to limit the disclosure time
successfully. In any case, semantic data is a basic component
amid the entire arranging process. Current industrialized Web
Service Portrayal Language (WSPL) does not contain enough
data for benefit depiction. Thusly, a service clustering
technique has been proposed, which upgrades unique WSPL
report with semantic data by methods for Connected Open
Information (COI). Examination based genuine service
information has been performed, and correlation with
comparable techniques has additionally been given to exhibit
the adequacy of the strategy. It is demonstrated that using
semantic data from COI improves the exactness of service
grouping. Furthermore, it shapes a sound base for promote
thorough preparing with semantic data.
|
|
Feature Based Underwater Fish Recognition Using SVM Classifier |
Author : Sunil Kumar C , Umagowri R , Elangovan M |
Abstract | Full Text |
Abstract :: An approach for underwater fish recognition
based on wavelet transform is presented in this paper.
This approach decomposes the input image into subbands
by using the multi resolutional analysis known as
Discrete Wavelet Transform (DWT). As each sub-band
in the decomposed image contains useful information
about the image, the mean values of every sub-band are
assumed as features. This approach is tested on
Underwater Photography - A Fish Database. The
database contains 7953 pictures of 1458 different species.
The database is considered for the classification based on
Support Vector machine (SVM) classifier. The result
shows that maximum recognition accuracy of 90.74% is
achieved by the wavelet features.
|
|
Hadoop and Hive Inspecting Maintenance of Mobile Application for Groceries Expenditure |
Author : Rao P N , Logeshwari V , Loganathan R |
Abstract | Full Text |
Abstract :Numerous movable applications on secure groceries
expenditure and e-health have designed recently. Health aware
clients respect such applications for secure groceries
expenditure, particularly to avoid irritating groceries and added
substances. However, there is the lack of a complete database
including organized or unstructured information to help such
applications. In the paper propose the Multiple Scoring
Frameworks (MSF), a healthy groceries expenditure search
service for movable applications using Hadoop and
MapReduce (MR). The MSF works in a procedure behind a
portable application to give a search service for data on
groceries and groceries added substances. MSF works with
similar logic from a web search engine (WSE) and it crawls
over Web sources cataloguing important data for possible
utilize in reacting to questions from movable applications.
MSF outline and advancement are featured in the paper during
its framework design, inquiry understanding, its utilization of
the Hadoop/MapReduce infrastructure, and activity contents. |
|