SlideShare a Scribd company logo
International Journal of Data Mining & Knowledge Management Process (IJDKP) Vol.4, No.4, July 2014
DOI : 10.5121/ijdkp.2014.4405 55
APPLICATION OF DATA MINING TOOLS FOR
SELECTED SCRIPTS OF STOCK MARKET
K. S. Mahajan1
and Dr. R. V. Kulkarni2
1
Research student, Chh. Shahu Institute of Business Education and Research Center,
Kolhapur, India
2
Professor and HOD, Chh. Shahu Institute of Business Education and Research Center,
Kolhapur, India
ABSTRACT
One of the most important problems in modern finance is finding efficient ways to summarize and visualize
the stock market data to give individuals or institutions useful information about the market behavior for
investment decisions Therefore, Investment can be considered as one of the fundamental pillars of national
economy. So, at the present time many investors look to find criterion to compare stocks together and
selecting the best and also investors choose strategies that maximize the earning value of the investment
process. Therefore the enormous amount of valuable data generated by the stock market has attracted
researchers to explore this problem domain using different methodologies. Therefore research in data
mining has gained a high attraction due to the importance of its applications and the increasing generation
information. So, Data mining tools such as association rule, rule induction method and Apriori algorithm
techniques are used to find association between different scripts of stock market, and also much of the
research and development has taken place regarding the reasons for fluctuating Indian stock exchange.
But, now days there are two important factors such as gold prices and US Dollar Prices are more
dominating on Indian Stock Market and to find out the correlation between gold prices, dollar prices and
BSE index statistical correlation is used and this helps the activities of stock operators, brokers, investors
and jobbers. They are based on the forecasting the fluctuation of index share prices, gold prices, dollar
prices and transactions of customers. Hence researcher has considered these problems as a topic for
research.
KEYWORDS
Stock Market, Association Rules, Rule Induction Methods, Apriori Algorithm, Correlation, Data Mining.
1. INTRODUCTION
Data mining, the science and technology of exploring data in order to discover previously
unknown patterns, is a part of the overall process of knowledge discovery in databases (KDD). In
today’s computer-driven world, these databases contain massive quantities of information. The
accessibility of this information makes data mining important and necessary. Data mining often
can improve existing models by finding additional, important variables, indentifying interaction
terms and detecting nonlinear relationships.
International Journal of Data Mining & Knowledge Management Process (IJDKP) Vol.4, No.4, July 2014
56
Financial institutions such as stock markets produce huge datasets that build a foundation for
approaching these enormously complex and dynamic problems with data mining tools. Potential
significant benefits of solving these problems motivated extensive research for years. Specifics
of data mining in finance are coming from the need to accommodate specific efficiency criteria
(e.g., the maximum of trading profit) to prediction accuracy, coordinated multiresolution forecast
(minutes, days, weeks, months, and years), Be able to benefit from very subtle patterns with a
short life time, and incorporate the impact of market players on market regularities , Impact of
gold and US dollar prices on stock market and also to find association between different scripts
of stock market which helps investors to earn more profit.
The techniques that are used in this project are:
1. Association rules
2. Apirori algorithm
3. Rule induction Method
4. Statistical Correlation
1.1 Association Rule:
Unlike the other data mining functions, association is transaction based. In transaction
processing, a case consists of a transactions such as a market basket analysis. The collection of
items in the transaction is a multi- record attributes.
Association rules are IF/THEN Statements.
Example: “if a customer purchases Infosys Ltd, Then customer also purchases Wipro Ltd with
60% confidence”.
An association rule has two parts, an antecedent (if) and a consequent (then), an antecedent is an
item found in the data. A consequent is an item that is found in combination with the antecedent.
Association Rule is created by analyzing data for frequent IF/THEN patterns & and using the
criteria Support & Confidence to identify the most important relationships. Support and
Confidence are two measures of association rule.
Association Rule take following form x=>y, where x and y are the sets of items. The goal is to
discover all the rules that have the Support & Confidence greater than or equal to the minimum
support and minimum confidence respectively.
Steps To Generate Association Rules:
1. Generate all possible association rules.
2. Compute the support and confidence of all possible association rules.
International Journal of Data Mining & Knowledge Management Process (IJDKP) Vol.4, No.4, July 2014
57
3. Apply two threshold criteria minimum support and minimum confidence to obtain
association rule.
4. Minimum support and minimum confidence is taken as an average of all the calculated
support and calculated confidence.
5. If the calculated support and confidence is greater than or equal to the minimum support
and minimum confidence then these items are said to be associated with each other by
association rule.
SUPPORT: The Support of a rule indicates how frequently the item in the rule occurs together.
Example: Dr.Reddy’s lab and Cipla Ltd might appear together in 10% of the transaction.
Support is calculated as below:
Support (x=>y) = (Number of transaction Containing x&y) / (Total Number of transaction).
CONFIDENCE:
Confidence is the number of times the IF/THEN statements have been found to be true. The
confidence of a rule indicates the probability of both the antecedent and the consequent appearing
in the same transaction.
Example: Dr.Reddy’s lab might appear in 20 transactions, 10 of the 20 might also include Cipla
Ltd.
Therefore Dr.Reddy’s Lab implies Cipla Ltd with 67% confidence.
And Confidence is calculated as below:
Confidence(x->y) = [Support(x->y)] / [Support of x].
Example: Association Rules from BSE SENSEX, Here Researcher has selected sector wise
scripts for the calculation of association between the same sector scripts:
Pharmaceuticals Sector:
From BSE SENSEX researcher has selected
Cipla Ltd, Dr.Reddy’s Lab, SunPharma India Ltd, Glenmark Ltd, Orchid Chemicals Ltd to
calculate association between these same sector scripts.
Here minimum support is the average of all the calculated support.
And the MINIMUM SUPPORT: sum of support / total number of scripts
=56/10
=5.6 %
International Journal of Data Mining & Knowledge Management Process (IJDKP) Vol.4, No.4, July 2014
58
So, minimum support is 5.6%
MINIMUM CONFIDENCE: sum of confidence / total number of scripts
= 350 / 10
=35%
So minimum confidence is 35%.
Researcher applied the above rule to calculate min.Support and min.Confidence to obtain result
for other sector scripts.
So from the above data analysis researcher can conclude that Cipla ltd and Dr.Reddy’s Ltd go
hand in hand and also Dr.reddy’s lab And Sun Pharma India Ltd goes hand in hand. So researcher
can say these scripts are strongly associated with each other.
2. APRIORI ALGORITHM:
Apriori is a classical algorithm and is designed to operate on databases containing transactions.
The theory of Apriori algorithm is that “All nonempty subsets of a frequent item set must also be
frequent.”
Apriori principle can be shown as below:
International Journal of Data Mining & Knowledge Management Process (IJDKP) Vol.4, No.4, July 2014
59
For all(x, y) :( x belongs to y) => s(x)>=s(y)
i.e. support of an item set never exceeds the support of its subsets. This property is also known as
monotone property of support. Algorithm is used to mine the frequent item sets.
Apriori Algorithm is as follows:
– Let K=1.
– Generate frequent item sets of length l
– Repeat until no frequent item sets are identified.
Example:
Support count (Dr.Reddy’s pharma lab Ltd) = No of transactions containing Dr. Reddy’s Pharma
ltd = 18.
3. RULE INDUCTION TECHNIQUE
Rule induction technique retrieves all interesting patterns from database.
In rule induction technique, the rule if of “if this then this”. For example a rule that a stock
market might find in their data collected from market transaction report would be: “if Reliance
Industries Ltd script is purchased then Oil and Natural Gas Corporation is purchased”.
or If Tata steel then SAIL
If Mahindra then Hindustan motors
In order for the rules to be useful there are two pieces on information that must be supplied as
well as the actual rule:
Accuracy- How often is the rule correct?
Coverage- How often does the rule apply?
International Journal of Data Mining & Knowledge Management Process (IJDKP) Vol.4, No.4, July 2014
60
From the above observation we conclude that Dr.Reddy’s Lab and Sun Pharma India Ltd are
associated with each other as these satisfies both the minimum accuracy= 34 and minimum
coverage=16.
So the rule is true. So, when the customer purchases Dr.Reddy’s lab customer will also go for
Sun Pharma India Ltd with 66% Accuracy. So these are strongly associated with each other.
4. CORRELATION:
To find out the impact of fluctuating gold prices and BSE sensex and the impact of dollar prices
and BSE sensex from 2008 to 2013 researcher has used a statistical formula coefficient of
correlation.
The mathematical formula for computing r is:
r = n ∑xy – (∑x)(∑y) / √ n(∑ x2
) – ( ∑ x ) 2
√ n ( ∑ y2
) – ( ∑ y )2
Where x and y are the sample means of X and Y, and sx and sy are the sample standard deviations
of X and Y.
If x and y are results of measurements that contain measurement error, the realistic limits on the
correlation coefficient are not −1 to +1 but a smaller range.
International Journal of Data Mining & Knowledge Management Process (IJDKP) Vol.4, No.4, July 2014
61
The value of r is such that -1 < r < +1. The + and - signs are used for positive linear correlation
and negative linear correlations, respectively.
1. Positive correlation: If x and y have a strong positive linear correlation, r is close to
+1. An r value of exactly +1 indicates a perfect positive fit. Positive values indicate a
relationship between x and y variables such that as values for x increase, values for y also
increase.
2. Negative correlation: If x and y have a strong negative linear correlation, r is close to -
1. An r value of exactly -1 indicates a perfect negative fit. Negative values indicate a
relationship between x and y such that as values for x increase, values for y decrease.
3. No correlation: If there is no linear correlation or a weak linear correlation, r is close to
0. A value near zero means that there is a random, nonlinear relationship between the
two variables.
4. Note that r is a dimensionless quantity; that is; it does not depend on the units employed.
5. A perfect correlation of ± 1 occurs only when the data points all lie exactly on a straight
line. If r = +1, the slope of this line is positive. If r = -1, the slope of this line is negative.
6. A correlation greater than 0.8 is generally described as strong, whereas a correlation less
than 0.5 are generally described as weak. These values can vary based upon the “type"
of data being examined. A study utilizing scientific data may require a stronger
correlation than a study using social science data.
Impact of gold prices on stock market:
According to Indian scenario, Indian culture and tradition majority of the Indian women would
like to invest in gold because of their tradition and their liking for gold. This leads to invest in
gold because of its nature of keeping value, low risk, and as India is having parallel economy
there are no any rules or fix criteria for investing in gold. So Indian people feel more beneficial to
invest in gold therefore, Gold is having more impact on Equity Market.
EXAMPLE: Table shows the correlations between GOLD and BSE SENSEX from January 2008
to august 2013
International Journal of Data Mining & Knowledge Management Process (IJDKP) Vol.4, No.4, July 2014
62
Table shows the correlation between US DOLLAR and BSE SENSEX from January 2008 to
august 2013:
Currency market launched in 1999. 4000cr is daily turnover of the exchange because of which
currency market became strong. And the more popular and effective currency is US Dollar. As
compare to Equity, Dollar fluctuates slowly and more effective to the investors which leads to
investors to prefer investing in currency market. Dollar is not only important for investment
purpose but every countries financial strategy for planning to balance their currency with dollar
for good economic results which leads to become dollar stronger. Therefore dollar is having more
impact on equity market from last few years.
International Journal of Data Mining & Knowledge Management Process (IJDKP) Vol.4, No.4, July 2014
63
5. CONCLUSION
An Association between selected scripts of Indian Stock Market and a correlation between Indian
Gold Prices and BSE SENSEX INDEX and Dollar Prices and BSE SENSEX INDEX has been
described. In this paper Data Mining Tools such as Association Rule, Apriori Algorithm, and
Rule Induction Methods are used for Association of Indian Stock market in order to find out
which scripts are much associated with each other. Results shows that sector wise scripts are
much associated with each other which helps investors, brokers, jobbers for investment decision.
Statistical Correlation result shows that, Gold Prices and Dollar Prices has an impact on Indian
Stock Market.
REFERENCES
[1] Alex Berson and Stephen j. Smith, “Data Warehousing, Data Mining, and OLAP”, MC Graw Hill,
1997.
[2] A. D. Devale and Dr. R. V. Kulkarni, “Application Of Data Mining Techniques In Life Insurance”,
International Journal Of Data Mining and Knowledge Management Process Vol.2. No.4, July 2012.
[3] Arun. K. Pujari, “Data Mining Techniques”, Universities Press (India) PVT Ltd, 2001.
[4] C.R. Kothari, “Research Methodology: Methods and Techniques”, New Age International (p) Ltd,
2004.
[5] Chengqi Zhang, Shichao Zhang, “Association Rule Mining: Models and Algorithm, Springer, 2002.
[6] David Cheung, Vincent T., Ada W. Fu and Yongjian Fv, “Efficient Mining of Association Rules in
Distributed Databases”, IEEE, 1996.
[7] J. Date, “An Introduction to Database Systems”, Addition Wesley longman, Seven Edition, 2000.
[8] J.K. Sharma, “Business Statistics” Pearson Education, 2008.
[9] Ken Orr, “Data Warehousing Technology”, Copyright. The Ken Or Institute, 1997.
[10] Krzysztof J. Cios, Witold Pedryez and Roman W. Surniarski, “Data Mining Methods for Knowledge
Discovery”, Kluwer Academic Publishers 1998 Second Printing 2000.
[11] L. M. Bhole, “Financial Institutions and Markets: Structure, Growth and Innovation, MC Graw Hill,
2006.
[12] Ming-Syan chen, Jiawei Han and Philip S. Yu, “Data Mining: An Overview From a Database
Perspective”, IEEE Transactions on Knowledge and Data Engineering Vol. 8, No. 6, Dec. 1996.
[13] NSE’s Certification In Financial Markets, National Stock Exchange of India ltd
Ad

More Related Content

What's hot (20)

Data mining and data warehouse lab manual updated
Data mining and data warehouse lab manual updatedData mining and data warehouse lab manual updated
Data mining and data warehouse lab manual updated
Yugal Kumar
 
A SURVEY ON DATA MINING IN STEEL INDUSTRIES
A SURVEY ON DATA MINING IN STEEL INDUSTRIESA SURVEY ON DATA MINING IN STEEL INDUSTRIES
A SURVEY ON DATA MINING IN STEEL INDUSTRIES
IJCSES Journal
 
A unified approach for spatial data query
A unified approach for spatial data queryA unified approach for spatial data query
A unified approach for spatial data query
IJDKP
 
SOURCE CODE RETRIEVAL USING SEQUENCE BASED SIMILARITY
SOURCE CODE RETRIEVAL USING SEQUENCE BASED SIMILARITYSOURCE CODE RETRIEVAL USING SEQUENCE BASED SIMILARITY
SOURCE CODE RETRIEVAL USING SEQUENCE BASED SIMILARITY
IJDKP
 
Enhancing the labelling technique of
Enhancing the labelling technique ofEnhancing the labelling technique of
Enhancing the labelling technique of
IJDKP
 
Z36149154
Z36149154Z36149154
Z36149154
IJERA Editor
 
Introduction to feature subset selection method
Introduction to feature subset selection methodIntroduction to feature subset selection method
Introduction to feature subset selection method
IJSRD
 
The 8 Step Data Mining Process
The 8 Step Data Mining ProcessThe 8 Step Data Mining Process
The 8 Step Data Mining Process
Marc Berman
 
USING ONTOLOGIES TO IMPROVE DOCUMENT CLASSIFICATION WITH TRANSDUCTIVE SUPPORT...
USING ONTOLOGIES TO IMPROVE DOCUMENT CLASSIFICATION WITH TRANSDUCTIVE SUPPORT...USING ONTOLOGIES TO IMPROVE DOCUMENT CLASSIFICATION WITH TRANSDUCTIVE SUPPORT...
USING ONTOLOGIES TO IMPROVE DOCUMENT CLASSIFICATION WITH TRANSDUCTIVE SUPPORT...
IJDKP
 
4113ijaia09
4113ijaia094113ijaia09
4113ijaia09
Rajkishorepanda
 
GCUBE INDEXING
GCUBE INDEXINGGCUBE INDEXING
GCUBE INDEXING
IJDKP
 
The International Journal of Engineering and Science
The International Journal of Engineering and ScienceThe International Journal of Engineering and Science
The International Journal of Engineering and Science
theijes
 
V2 i9 ijertv2is90699-1
V2 i9 ijertv2is90699-1V2 i9 ijertv2is90699-1
V2 i9 ijertv2is90699-1
warishali570
 
Research trends in data warehousing and data mining
Research trends in data warehousing and data miningResearch trends in data warehousing and data mining
Research trends in data warehousing and data mining
Er. Nawaraj Bhandari
 
Data Mining In Market Research
Data Mining In Market ResearchData Mining In Market Research
Data Mining In Market Research
kevinlan
 
Data mining: Classification and prediction
Data mining: Classification and predictionData mining: Classification and prediction
Data mining: Classification and prediction
DataminingTools Inc
 
01 Introduction to Data Mining
01 Introduction to Data Mining01 Introduction to Data Mining
01 Introduction to Data Mining
Valerii Klymchuk
 
Data Mining And Data Warehousing Laboratory File Manual
Data Mining And Data Warehousing Laboratory File ManualData Mining And Data Warehousing Laboratory File Manual
Data Mining And Data Warehousing Laboratory File Manual
Nitin Bhasin
 
Review on: Techniques for Predicting Frequent Items
Review on: Techniques for Predicting Frequent ItemsReview on: Techniques for Predicting Frequent Items
Review on: Techniques for Predicting Frequent Items
vivatechijri
 
A Quantified Approach for large Dataset Compression in Association Mining
A Quantified Approach for large Dataset Compression in Association MiningA Quantified Approach for large Dataset Compression in Association Mining
A Quantified Approach for large Dataset Compression in Association Mining
IOSR Journals
 
Data mining and data warehouse lab manual updated
Data mining and data warehouse lab manual updatedData mining and data warehouse lab manual updated
Data mining and data warehouse lab manual updated
Yugal Kumar
 
A SURVEY ON DATA MINING IN STEEL INDUSTRIES
A SURVEY ON DATA MINING IN STEEL INDUSTRIESA SURVEY ON DATA MINING IN STEEL INDUSTRIES
A SURVEY ON DATA MINING IN STEEL INDUSTRIES
IJCSES Journal
 
A unified approach for spatial data query
A unified approach for spatial data queryA unified approach for spatial data query
A unified approach for spatial data query
IJDKP
 
SOURCE CODE RETRIEVAL USING SEQUENCE BASED SIMILARITY
SOURCE CODE RETRIEVAL USING SEQUENCE BASED SIMILARITYSOURCE CODE RETRIEVAL USING SEQUENCE BASED SIMILARITY
SOURCE CODE RETRIEVAL USING SEQUENCE BASED SIMILARITY
IJDKP
 
Enhancing the labelling technique of
Enhancing the labelling technique ofEnhancing the labelling technique of
Enhancing the labelling technique of
IJDKP
 
Introduction to feature subset selection method
Introduction to feature subset selection methodIntroduction to feature subset selection method
Introduction to feature subset selection method
IJSRD
 
The 8 Step Data Mining Process
The 8 Step Data Mining ProcessThe 8 Step Data Mining Process
The 8 Step Data Mining Process
Marc Berman
 
USING ONTOLOGIES TO IMPROVE DOCUMENT CLASSIFICATION WITH TRANSDUCTIVE SUPPORT...
USING ONTOLOGIES TO IMPROVE DOCUMENT CLASSIFICATION WITH TRANSDUCTIVE SUPPORT...USING ONTOLOGIES TO IMPROVE DOCUMENT CLASSIFICATION WITH TRANSDUCTIVE SUPPORT...
USING ONTOLOGIES TO IMPROVE DOCUMENT CLASSIFICATION WITH TRANSDUCTIVE SUPPORT...
IJDKP
 
GCUBE INDEXING
GCUBE INDEXINGGCUBE INDEXING
GCUBE INDEXING
IJDKP
 
The International Journal of Engineering and Science
The International Journal of Engineering and ScienceThe International Journal of Engineering and Science
The International Journal of Engineering and Science
theijes
 
V2 i9 ijertv2is90699-1
V2 i9 ijertv2is90699-1V2 i9 ijertv2is90699-1
V2 i9 ijertv2is90699-1
warishali570
 
Research trends in data warehousing and data mining
Research trends in data warehousing and data miningResearch trends in data warehousing and data mining
Research trends in data warehousing and data mining
Er. Nawaraj Bhandari
 
Data Mining In Market Research
Data Mining In Market ResearchData Mining In Market Research
Data Mining In Market Research
kevinlan
 
Data mining: Classification and prediction
Data mining: Classification and predictionData mining: Classification and prediction
Data mining: Classification and prediction
DataminingTools Inc
 
01 Introduction to Data Mining
01 Introduction to Data Mining01 Introduction to Data Mining
01 Introduction to Data Mining
Valerii Klymchuk
 
Data Mining And Data Warehousing Laboratory File Manual
Data Mining And Data Warehousing Laboratory File ManualData Mining And Data Warehousing Laboratory File Manual
Data Mining And Data Warehousing Laboratory File Manual
Nitin Bhasin
 
Review on: Techniques for Predicting Frequent Items
Review on: Techniques for Predicting Frequent ItemsReview on: Techniques for Predicting Frequent Items
Review on: Techniques for Predicting Frequent Items
vivatechijri
 
A Quantified Approach for large Dataset Compression in Association Mining
A Quantified Approach for large Dataset Compression in Association MiningA Quantified Approach for large Dataset Compression in Association Mining
A Quantified Approach for large Dataset Compression in Association Mining
IOSR Journals
 

Viewers also liked (20)

Comparison between riss and dcharm for mining gene expression data
Comparison between riss and dcharm for mining gene expression dataComparison between riss and dcharm for mining gene expression data
Comparison between riss and dcharm for mining gene expression data
IJDKP
 
EFFECTIVE ARABIC STEMMER BASED HYBRID APPROACH FOR ARABIC TEXT CATEGORIZATION
EFFECTIVE ARABIC STEMMER BASED HYBRID APPROACH FOR ARABIC TEXT CATEGORIZATIONEFFECTIVE ARABIC STEMMER BASED HYBRID APPROACH FOR ARABIC TEXT CATEGORIZATION
EFFECTIVE ARABIC STEMMER BASED HYBRID APPROACH FOR ARABIC TEXT CATEGORIZATION
IJDKP
 
IMBALANCED DATA LEARNING APPROACHES REVIEW
IMBALANCED DATA LEARNING APPROACHES REVIEWIMBALANCED DATA LEARNING APPROACHES REVIEW
IMBALANCED DATA LEARNING APPROACHES REVIEW
IJDKP
 
Applying the apriori algorithm for investigating the associations between dem...
Applying the apriori algorithm for investigating the associations between dem...Applying the apriori algorithm for investigating the associations between dem...
Applying the apriori algorithm for investigating the associations between dem...
IJDKP
 
Dormancy prediction model in a
Dormancy prediction model in aDormancy prediction model in a
Dormancy prediction model in a
IJDKP
 
ONTOLOGY INTEGRATION APPROACHES AND ITS IMPACT ON TEXT CATEGORIZATION
ONTOLOGY INTEGRATION APPROACHES AND ITS IMPACT ON TEXT CATEGORIZATIONONTOLOGY INTEGRATION APPROACHES AND ITS IMPACT ON TEXT CATEGORIZATION
ONTOLOGY INTEGRATION APPROACHES AND ITS IMPACT ON TEXT CATEGORIZATION
IJDKP
 
Experimental study of Data clustering using k- Means and modified algorithms
Experimental study of Data clustering using k- Means and modified algorithmsExperimental study of Data clustering using k- Means and modified algorithms
Experimental study of Data clustering using k- Means and modified algorithms
IJDKP
 
Gv index scientific contribution rating index that takes into account the gro...
Gv index scientific contribution rating index that takes into account the gro...Gv index scientific contribution rating index that takes into account the gro...
Gv index scientific contribution rating index that takes into account the gro...
IJDKP
 
A FUZZY BASED APPROACH TO TEXT MINING AND DOCUMENT CLUSTERING
A FUZZY BASED APPROACH TO TEXT MINING AND DOCUMENT CLUSTERINGA FUZZY BASED APPROACH TO TEXT MINING AND DOCUMENT CLUSTERING
A FUZZY BASED APPROACH TO TEXT MINING AND DOCUMENT CLUSTERING
IJDKP
 
The Next Alternative: Private Equity Asset Class Summary
The Next Alternative: Private Equity Asset Class SummaryThe Next Alternative: Private Equity Asset Class Summary
The Next Alternative: Private Equity Asset Class Summary
State Street
 
The Innovator’s Journey: Asset Owners Insights
The Innovator’s Journey: Asset Owners Insights The Innovator’s Journey: Asset Owners Insights
The Innovator’s Journey: Asset Owners Insights
State Street
 
A novel algorithm for mining closed sequential patterns
A novel algorithm for mining closed sequential patternsA novel algorithm for mining closed sequential patterns
A novel algorithm for mining closed sequential patterns
IJDKP
 
Relative parameter quantification in data
Relative parameter quantification in dataRelative parameter quantification in data
Relative parameter quantification in data
IJDKP
 
Study on body fat density prediction
Study on body fat density predictionStudy on body fat density prediction
Study on body fat density prediction
IJDKP
 
Confidential data identification using
Confidential data identification usingConfidential data identification using
Confidential data identification using
IJDKP
 
WEB-BASED DATA MINING TOOLS : PERFORMING FEEDBACK ANALYSIS AND ASSOCIATION RU...
WEB-BASED DATA MINING TOOLS : PERFORMING FEEDBACK ANALYSIS AND ASSOCIATION RU...WEB-BASED DATA MINING TOOLS : PERFORMING FEEDBACK ANALYSIS AND ASSOCIATION RU...
WEB-BASED DATA MINING TOOLS : PERFORMING FEEDBACK ANALYSIS AND ASSOCIATION RU...
IJDKP
 
A data mining approach to predict
A data mining approach to predictA data mining approach to predict
A data mining approach to predict
IJDKP
 
Evaluation of rule extraction algorithms
Evaluation of rule extraction algorithmsEvaluation of rule extraction algorithms
Evaluation of rule extraction algorithms
IJDKP
 
Content based indexing of music
Content based indexing of musicContent based indexing of music
Content based indexing of music
IJDKP
 
Il Salone del Risparmio - Presentation
Il Salone del Risparmio - PresentationIl Salone del Risparmio - Presentation
Il Salone del Risparmio - Presentation
State Street
 
Comparison between riss and dcharm for mining gene expression data
Comparison between riss and dcharm for mining gene expression dataComparison between riss and dcharm for mining gene expression data
Comparison between riss and dcharm for mining gene expression data
IJDKP
 
EFFECTIVE ARABIC STEMMER BASED HYBRID APPROACH FOR ARABIC TEXT CATEGORIZATION
EFFECTIVE ARABIC STEMMER BASED HYBRID APPROACH FOR ARABIC TEXT CATEGORIZATIONEFFECTIVE ARABIC STEMMER BASED HYBRID APPROACH FOR ARABIC TEXT CATEGORIZATION
EFFECTIVE ARABIC STEMMER BASED HYBRID APPROACH FOR ARABIC TEXT CATEGORIZATION
IJDKP
 
IMBALANCED DATA LEARNING APPROACHES REVIEW
IMBALANCED DATA LEARNING APPROACHES REVIEWIMBALANCED DATA LEARNING APPROACHES REVIEW
IMBALANCED DATA LEARNING APPROACHES REVIEW
IJDKP
 
Applying the apriori algorithm for investigating the associations between dem...
Applying the apriori algorithm for investigating the associations between dem...Applying the apriori algorithm for investigating the associations between dem...
Applying the apriori algorithm for investigating the associations between dem...
IJDKP
 
Dormancy prediction model in a
Dormancy prediction model in aDormancy prediction model in a
Dormancy prediction model in a
IJDKP
 
ONTOLOGY INTEGRATION APPROACHES AND ITS IMPACT ON TEXT CATEGORIZATION
ONTOLOGY INTEGRATION APPROACHES AND ITS IMPACT ON TEXT CATEGORIZATIONONTOLOGY INTEGRATION APPROACHES AND ITS IMPACT ON TEXT CATEGORIZATION
ONTOLOGY INTEGRATION APPROACHES AND ITS IMPACT ON TEXT CATEGORIZATION
IJDKP
 
Experimental study of Data clustering using k- Means and modified algorithms
Experimental study of Data clustering using k- Means and modified algorithmsExperimental study of Data clustering using k- Means and modified algorithms
Experimental study of Data clustering using k- Means and modified algorithms
IJDKP
 
Gv index scientific contribution rating index that takes into account the gro...
Gv index scientific contribution rating index that takes into account the gro...Gv index scientific contribution rating index that takes into account the gro...
Gv index scientific contribution rating index that takes into account the gro...
IJDKP
 
A FUZZY BASED APPROACH TO TEXT MINING AND DOCUMENT CLUSTERING
A FUZZY BASED APPROACH TO TEXT MINING AND DOCUMENT CLUSTERINGA FUZZY BASED APPROACH TO TEXT MINING AND DOCUMENT CLUSTERING
A FUZZY BASED APPROACH TO TEXT MINING AND DOCUMENT CLUSTERING
IJDKP
 
The Next Alternative: Private Equity Asset Class Summary
The Next Alternative: Private Equity Asset Class SummaryThe Next Alternative: Private Equity Asset Class Summary
The Next Alternative: Private Equity Asset Class Summary
State Street
 
The Innovator’s Journey: Asset Owners Insights
The Innovator’s Journey: Asset Owners Insights The Innovator’s Journey: Asset Owners Insights
The Innovator’s Journey: Asset Owners Insights
State Street
 
A novel algorithm for mining closed sequential patterns
A novel algorithm for mining closed sequential patternsA novel algorithm for mining closed sequential patterns
A novel algorithm for mining closed sequential patterns
IJDKP
 
Relative parameter quantification in data
Relative parameter quantification in dataRelative parameter quantification in data
Relative parameter quantification in data
IJDKP
 
Study on body fat density prediction
Study on body fat density predictionStudy on body fat density prediction
Study on body fat density prediction
IJDKP
 
Confidential data identification using
Confidential data identification usingConfidential data identification using
Confidential data identification using
IJDKP
 
WEB-BASED DATA MINING TOOLS : PERFORMING FEEDBACK ANALYSIS AND ASSOCIATION RU...
WEB-BASED DATA MINING TOOLS : PERFORMING FEEDBACK ANALYSIS AND ASSOCIATION RU...WEB-BASED DATA MINING TOOLS : PERFORMING FEEDBACK ANALYSIS AND ASSOCIATION RU...
WEB-BASED DATA MINING TOOLS : PERFORMING FEEDBACK ANALYSIS AND ASSOCIATION RU...
IJDKP
 
A data mining approach to predict
A data mining approach to predictA data mining approach to predict
A data mining approach to predict
IJDKP
 
Evaluation of rule extraction algorithms
Evaluation of rule extraction algorithmsEvaluation of rule extraction algorithms
Evaluation of rule extraction algorithms
IJDKP
 
Content based indexing of music
Content based indexing of musicContent based indexing of music
Content based indexing of music
IJDKP
 
Il Salone del Risparmio - Presentation
Il Salone del Risparmio - PresentationIl Salone del Risparmio - Presentation
Il Salone del Risparmio - Presentation
State Street
 
Ad

Similar to Application of data mining tools for (20)

Data Mining For Supermarket Sale Analysis Using Association Rule
Data Mining For Supermarket Sale Analysis Using Association RuleData Mining For Supermarket Sale Analysis Using Association Rule
Data Mining For Supermarket Sale Analysis Using Association Rule
ijtsrd
 
IRJET- Minning Frequent Patterns,Associations and Correlations
IRJET-  	  Minning Frequent Patterns,Associations and CorrelationsIRJET-  	  Minning Frequent Patterns,Associations and Correlations
IRJET- Minning Frequent Patterns,Associations and Correlations
IRJET Journal
 
Paper id 212014126
Paper id 212014126Paper id 212014126
Paper id 212014126
IJRAT
 
Research Inventy : International Journal of Engineering and Science
Research Inventy : International Journal of Engineering and ScienceResearch Inventy : International Journal of Engineering and Science
Research Inventy : International Journal of Engineering and Science
researchinventy
 
Research Inventy : International Journal of Engineering and Science
Research Inventy : International Journal of Engineering and ScienceResearch Inventy : International Journal of Engineering and Science
Research Inventy : International Journal of Engineering and Science
researchinventy
 
PROJECT-109,93.pdf data miiining project
PROJECT-109,93.pdf data miiining projectPROJECT-109,93.pdf data miiining project
PROJECT-109,93.pdf data miiining project
sampathkumarkorada
 
Data Mining based on Hashing Technique
Data Mining based on Hashing TechniqueData Mining based on Hashing Technique
Data Mining based on Hashing Technique
ijtsrd
 
Multiple Minimum Support Implementations with Dynamic Matrix Apriori Algorith...
Multiple Minimum Support Implementations with Dynamic Matrix Apriori Algorith...Multiple Minimum Support Implementations with Dynamic Matrix Apriori Algorith...
Multiple Minimum Support Implementations with Dynamic Matrix Apriori Algorith...
ijsrd.com
 
BINARY DECISION TREE FOR ASSOCIATION RULES MINING IN INCREMENTAL DATABASES
BINARY DECISION TREE FOR ASSOCIATION RULES MINING IN INCREMENTAL DATABASESBINARY DECISION TREE FOR ASSOCIATION RULES MINING IN INCREMENTAL DATABASES
BINARY DECISION TREE FOR ASSOCIATION RULES MINING IN INCREMENTAL DATABASES
IJDKP
 
BINARY DECISION TREE FOR ASSOCIATION RULES MINING IN INCREMENTAL DATABASES
BINARY DECISION TREE FOR ASSOCIATION RULES MINING IN INCREMENTAL DATABASES BINARY DECISION TREE FOR ASSOCIATION RULES MINING IN INCREMENTAL DATABASES
BINARY DECISION TREE FOR ASSOCIATION RULES MINING IN INCREMENTAL DATABASES
IJDKP
 
prediction using data mining.pdf
prediction using data mining.pdfprediction using data mining.pdf
prediction using data mining.pdf
NavAhmed3
 
Ae32208215
Ae32208215Ae32208215
Ae32208215
IJERA Editor
 
Developing-a-Clustering-Model-based-on-K-Means-Algorithm-in-order-to-Creating...
Developing-a-Clustering-Model-based-on-K-Means-Algorithm-in-order-to-Creating...Developing-a-Clustering-Model-based-on-K-Means-Algorithm-in-order-to-Creating...
Developing-a-Clustering-Model-based-on-K-Means-Algorithm-in-order-to-Creating...
saeed ghoreyshi
 
An Optimal Approach to derive Disjunctive Positive and Negative Rules from As...
An Optimal Approach to derive Disjunctive Positive and Negative Rules from As...An Optimal Approach to derive Disjunctive Positive and Negative Rules from As...
An Optimal Approach to derive Disjunctive Positive and Negative Rules from As...
IOSR Journals
 
50120140503005
5012014050300550120140503005
50120140503005
IAEME Publication
 
Glossary
GlossaryGlossary
Glossary
asfawm
 
Data Mining Apriori Algorithm Implementation using R
Data Mining Apriori Algorithm Implementation using RData Mining Apriori Algorithm Implementation using R
Data Mining Apriori Algorithm Implementation using R
IRJET Journal
 
V34132136
V34132136V34132136
V34132136
IJERA Editor
 
Association rule mining and Apriori algorithm
Association rule mining and Apriori algorithmAssociation rule mining and Apriori algorithm
Association rule mining and Apriori algorithm
hina firdaus
 
KIT-601 Lecture Notes-UNIT-4.pdf Frequent Itemsets and Clustering
KIT-601 Lecture Notes-UNIT-4.pdf Frequent Itemsets and ClusteringKIT-601 Lecture Notes-UNIT-4.pdf Frequent Itemsets and Clustering
KIT-601 Lecture Notes-UNIT-4.pdf Frequent Itemsets and Clustering
Dr. Radhey Shyam
 
Data Mining For Supermarket Sale Analysis Using Association Rule
Data Mining For Supermarket Sale Analysis Using Association RuleData Mining For Supermarket Sale Analysis Using Association Rule
Data Mining For Supermarket Sale Analysis Using Association Rule
ijtsrd
 
IRJET- Minning Frequent Patterns,Associations and Correlations
IRJET-  	  Minning Frequent Patterns,Associations and CorrelationsIRJET-  	  Minning Frequent Patterns,Associations and Correlations
IRJET- Minning Frequent Patterns,Associations and Correlations
IRJET Journal
 
Paper id 212014126
Paper id 212014126Paper id 212014126
Paper id 212014126
IJRAT
 
Research Inventy : International Journal of Engineering and Science
Research Inventy : International Journal of Engineering and ScienceResearch Inventy : International Journal of Engineering and Science
Research Inventy : International Journal of Engineering and Science
researchinventy
 
Research Inventy : International Journal of Engineering and Science
Research Inventy : International Journal of Engineering and ScienceResearch Inventy : International Journal of Engineering and Science
Research Inventy : International Journal of Engineering and Science
researchinventy
 
PROJECT-109,93.pdf data miiining project
PROJECT-109,93.pdf data miiining projectPROJECT-109,93.pdf data miiining project
PROJECT-109,93.pdf data miiining project
sampathkumarkorada
 
Data Mining based on Hashing Technique
Data Mining based on Hashing TechniqueData Mining based on Hashing Technique
Data Mining based on Hashing Technique
ijtsrd
 
Multiple Minimum Support Implementations with Dynamic Matrix Apriori Algorith...
Multiple Minimum Support Implementations with Dynamic Matrix Apriori Algorith...Multiple Minimum Support Implementations with Dynamic Matrix Apriori Algorith...
Multiple Minimum Support Implementations with Dynamic Matrix Apriori Algorith...
ijsrd.com
 
BINARY DECISION TREE FOR ASSOCIATION RULES MINING IN INCREMENTAL DATABASES
BINARY DECISION TREE FOR ASSOCIATION RULES MINING IN INCREMENTAL DATABASESBINARY DECISION TREE FOR ASSOCIATION RULES MINING IN INCREMENTAL DATABASES
BINARY DECISION TREE FOR ASSOCIATION RULES MINING IN INCREMENTAL DATABASES
IJDKP
 
BINARY DECISION TREE FOR ASSOCIATION RULES MINING IN INCREMENTAL DATABASES
BINARY DECISION TREE FOR ASSOCIATION RULES MINING IN INCREMENTAL DATABASES BINARY DECISION TREE FOR ASSOCIATION RULES MINING IN INCREMENTAL DATABASES
BINARY DECISION TREE FOR ASSOCIATION RULES MINING IN INCREMENTAL DATABASES
IJDKP
 
prediction using data mining.pdf
prediction using data mining.pdfprediction using data mining.pdf
prediction using data mining.pdf
NavAhmed3
 
Developing-a-Clustering-Model-based-on-K-Means-Algorithm-in-order-to-Creating...
Developing-a-Clustering-Model-based-on-K-Means-Algorithm-in-order-to-Creating...Developing-a-Clustering-Model-based-on-K-Means-Algorithm-in-order-to-Creating...
Developing-a-Clustering-Model-based-on-K-Means-Algorithm-in-order-to-Creating...
saeed ghoreyshi
 
An Optimal Approach to derive Disjunctive Positive and Negative Rules from As...
An Optimal Approach to derive Disjunctive Positive and Negative Rules from As...An Optimal Approach to derive Disjunctive Positive and Negative Rules from As...
An Optimal Approach to derive Disjunctive Positive and Negative Rules from As...
IOSR Journals
 
Glossary
GlossaryGlossary
Glossary
asfawm
 
Data Mining Apriori Algorithm Implementation using R
Data Mining Apriori Algorithm Implementation using RData Mining Apriori Algorithm Implementation using R
Data Mining Apriori Algorithm Implementation using R
IRJET Journal
 
Association rule mining and Apriori algorithm
Association rule mining and Apriori algorithmAssociation rule mining and Apriori algorithm
Association rule mining and Apriori algorithm
hina firdaus
 
KIT-601 Lecture Notes-UNIT-4.pdf Frequent Itemsets and Clustering
KIT-601 Lecture Notes-UNIT-4.pdf Frequent Itemsets and ClusteringKIT-601 Lecture Notes-UNIT-4.pdf Frequent Itemsets and Clustering
KIT-601 Lecture Notes-UNIT-4.pdf Frequent Itemsets and Clustering
Dr. Radhey Shyam
 
Ad

Recently uploaded (20)

UiPath AgentHack - Build the AI agents of tomorrow_Enablement 1.pptx
UiPath AgentHack - Build the AI agents of tomorrow_Enablement 1.pptxUiPath AgentHack - Build the AI agents of tomorrow_Enablement 1.pptx
UiPath AgentHack - Build the AI agents of tomorrow_Enablement 1.pptx
anabulhac
 
How Top Companies Benefit from Outsourcing
How Top Companies Benefit from OutsourcingHow Top Companies Benefit from Outsourcing
How Top Companies Benefit from Outsourcing
Nascenture
 
OpenAI Just Announced Codex: A cloud engineering agent that excels in handlin...
OpenAI Just Announced Codex: A cloud engineering agent that excels in handlin...OpenAI Just Announced Codex: A cloud engineering agent that excels in handlin...
OpenAI Just Announced Codex: A cloud engineering agent that excels in handlin...
SOFTTECHHUB
 
IT488 Wireless Sensor Networks_Information Technology
IT488 Wireless Sensor Networks_Information TechnologyIT488 Wireless Sensor Networks_Information Technology
IT488 Wireless Sensor Networks_Information Technology
SHEHABALYAMANI
 
Master Data Management - Enterprise Application Integration
Master Data Management - Enterprise Application IntegrationMaster Data Management - Enterprise Application Integration
Master Data Management - Enterprise Application Integration
Sherif Rasmy
 
Agentic Automation - Delhi UiPath Community Meetup
Agentic Automation - Delhi UiPath Community MeetupAgentic Automation - Delhi UiPath Community Meetup
Agentic Automation - Delhi UiPath Community Meetup
Manoj Batra (1600 + Connections)
 
RTP Over QUIC: An Interesting Opportunity Or Wasted Time?
RTP Over QUIC: An Interesting Opportunity Or Wasted Time?RTP Over QUIC: An Interesting Opportunity Or Wasted Time?
RTP Over QUIC: An Interesting Opportunity Or Wasted Time?
Lorenzo Miniero
 
Slack like a pro: strategies for 10x engineering teams
Slack like a pro: strategies for 10x engineering teamsSlack like a pro: strategies for 10x engineering teams
Slack like a pro: strategies for 10x engineering teams
Nacho Cougil
 
Config 2025 presentation recap covering both days
Config 2025 presentation recap covering both daysConfig 2025 presentation recap covering both days
Config 2025 presentation recap covering both days
TrishAntoni1
 
Building a research repository that works by Clare Cady
Building a research repository that works by Clare CadyBuilding a research repository that works by Clare Cady
Building a research repository that works by Clare Cady
UXPA Boston
 
accessibility Considerations during Design by Rick Blair, Schneider Electric
accessibility Considerations during Design by Rick Blair, Schneider Electricaccessibility Considerations during Design by Rick Blair, Schneider Electric
accessibility Considerations during Design by Rick Blair, Schneider Electric
UXPA Boston
 
Kit-Works Team Study_아직도 Dockefile.pdf_김성호
Kit-Works Team Study_아직도 Dockefile.pdf_김성호Kit-Works Team Study_아직도 Dockefile.pdf_김성호
Kit-Works Team Study_아직도 Dockefile.pdf_김성호
Wonjun Hwang
 
Sustainable_Development_Goals_INDIANWraa
Sustainable_Development_Goals_INDIANWraaSustainable_Development_Goals_INDIANWraa
Sustainable_Development_Goals_INDIANWraa
03ANMOLCHAURASIYA
 
IT484 Cyber Forensics_Information Technology
IT484 Cyber Forensics_Information TechnologyIT484 Cyber Forensics_Information Technology
IT484 Cyber Forensics_Information Technology
SHEHABALYAMANI
 
Refactoring meta-rauc-community: Cleaner Code, Better Maintenance, More Machines
Refactoring meta-rauc-community: Cleaner Code, Better Maintenance, More MachinesRefactoring meta-rauc-community: Cleaner Code, Better Maintenance, More Machines
Refactoring meta-rauc-community: Cleaner Code, Better Maintenance, More Machines
Leon Anavi
 
May Patch Tuesday
May Patch TuesdayMay Patch Tuesday
May Patch Tuesday
Ivanti
 
Computer Systems Quiz Presentation in Purple Bold Style (4).pdf
Computer Systems Quiz Presentation in Purple Bold Style (4).pdfComputer Systems Quiz Presentation in Purple Bold Style (4).pdf
Computer Systems Quiz Presentation in Purple Bold Style (4).pdf
fizarcse
 
Top 5 Qualities to Look for in Salesforce Partners in 2025
Top 5 Qualities to Look for in Salesforce Partners in 2025Top 5 Qualities to Look for in Salesforce Partners in 2025
Top 5 Qualities to Look for in Salesforce Partners in 2025
Damco Salesforce Services
 
MULTI-STAKEHOLDER CONSULTATION PROGRAM On Implementation of DNF 2.0 and Way F...
MULTI-STAKEHOLDER CONSULTATION PROGRAM On Implementation of DNF 2.0 and Way F...MULTI-STAKEHOLDER CONSULTATION PROGRAM On Implementation of DNF 2.0 and Way F...
MULTI-STAKEHOLDER CONSULTATION PROGRAM On Implementation of DNF 2.0 and Way F...
ICT Frame Magazine Pvt. Ltd.
 
Who's choice? Making decisions with and about Artificial Intelligence, Keele ...
Who's choice? Making decisions with and about Artificial Intelligence, Keele ...Who's choice? Making decisions with and about Artificial Intelligence, Keele ...
Who's choice? Making decisions with and about Artificial Intelligence, Keele ...
Alan Dix
 
UiPath AgentHack - Build the AI agents of tomorrow_Enablement 1.pptx
UiPath AgentHack - Build the AI agents of tomorrow_Enablement 1.pptxUiPath AgentHack - Build the AI agents of tomorrow_Enablement 1.pptx
UiPath AgentHack - Build the AI agents of tomorrow_Enablement 1.pptx
anabulhac
 
How Top Companies Benefit from Outsourcing
How Top Companies Benefit from OutsourcingHow Top Companies Benefit from Outsourcing
How Top Companies Benefit from Outsourcing
Nascenture
 
OpenAI Just Announced Codex: A cloud engineering agent that excels in handlin...
OpenAI Just Announced Codex: A cloud engineering agent that excels in handlin...OpenAI Just Announced Codex: A cloud engineering agent that excels in handlin...
OpenAI Just Announced Codex: A cloud engineering agent that excels in handlin...
SOFTTECHHUB
 
IT488 Wireless Sensor Networks_Information Technology
IT488 Wireless Sensor Networks_Information TechnologyIT488 Wireless Sensor Networks_Information Technology
IT488 Wireless Sensor Networks_Information Technology
SHEHABALYAMANI
 
Master Data Management - Enterprise Application Integration
Master Data Management - Enterprise Application IntegrationMaster Data Management - Enterprise Application Integration
Master Data Management - Enterprise Application Integration
Sherif Rasmy
 
RTP Over QUIC: An Interesting Opportunity Or Wasted Time?
RTP Over QUIC: An Interesting Opportunity Or Wasted Time?RTP Over QUIC: An Interesting Opportunity Or Wasted Time?
RTP Over QUIC: An Interesting Opportunity Or Wasted Time?
Lorenzo Miniero
 
Slack like a pro: strategies for 10x engineering teams
Slack like a pro: strategies for 10x engineering teamsSlack like a pro: strategies for 10x engineering teams
Slack like a pro: strategies for 10x engineering teams
Nacho Cougil
 
Config 2025 presentation recap covering both days
Config 2025 presentation recap covering both daysConfig 2025 presentation recap covering both days
Config 2025 presentation recap covering both days
TrishAntoni1
 
Building a research repository that works by Clare Cady
Building a research repository that works by Clare CadyBuilding a research repository that works by Clare Cady
Building a research repository that works by Clare Cady
UXPA Boston
 
accessibility Considerations during Design by Rick Blair, Schneider Electric
accessibility Considerations during Design by Rick Blair, Schneider Electricaccessibility Considerations during Design by Rick Blair, Schneider Electric
accessibility Considerations during Design by Rick Blair, Schneider Electric
UXPA Boston
 
Kit-Works Team Study_아직도 Dockefile.pdf_김성호
Kit-Works Team Study_아직도 Dockefile.pdf_김성호Kit-Works Team Study_아직도 Dockefile.pdf_김성호
Kit-Works Team Study_아직도 Dockefile.pdf_김성호
Wonjun Hwang
 
Sustainable_Development_Goals_INDIANWraa
Sustainable_Development_Goals_INDIANWraaSustainable_Development_Goals_INDIANWraa
Sustainable_Development_Goals_INDIANWraa
03ANMOLCHAURASIYA
 
IT484 Cyber Forensics_Information Technology
IT484 Cyber Forensics_Information TechnologyIT484 Cyber Forensics_Information Technology
IT484 Cyber Forensics_Information Technology
SHEHABALYAMANI
 
Refactoring meta-rauc-community: Cleaner Code, Better Maintenance, More Machines
Refactoring meta-rauc-community: Cleaner Code, Better Maintenance, More MachinesRefactoring meta-rauc-community: Cleaner Code, Better Maintenance, More Machines
Refactoring meta-rauc-community: Cleaner Code, Better Maintenance, More Machines
Leon Anavi
 
May Patch Tuesday
May Patch TuesdayMay Patch Tuesday
May Patch Tuesday
Ivanti
 
Computer Systems Quiz Presentation in Purple Bold Style (4).pdf
Computer Systems Quiz Presentation in Purple Bold Style (4).pdfComputer Systems Quiz Presentation in Purple Bold Style (4).pdf
Computer Systems Quiz Presentation in Purple Bold Style (4).pdf
fizarcse
 
Top 5 Qualities to Look for in Salesforce Partners in 2025
Top 5 Qualities to Look for in Salesforce Partners in 2025Top 5 Qualities to Look for in Salesforce Partners in 2025
Top 5 Qualities to Look for in Salesforce Partners in 2025
Damco Salesforce Services
 
MULTI-STAKEHOLDER CONSULTATION PROGRAM On Implementation of DNF 2.0 and Way F...
MULTI-STAKEHOLDER CONSULTATION PROGRAM On Implementation of DNF 2.0 and Way F...MULTI-STAKEHOLDER CONSULTATION PROGRAM On Implementation of DNF 2.0 and Way F...
MULTI-STAKEHOLDER CONSULTATION PROGRAM On Implementation of DNF 2.0 and Way F...
ICT Frame Magazine Pvt. Ltd.
 
Who's choice? Making decisions with and about Artificial Intelligence, Keele ...
Who's choice? Making decisions with and about Artificial Intelligence, Keele ...Who's choice? Making decisions with and about Artificial Intelligence, Keele ...
Who's choice? Making decisions with and about Artificial Intelligence, Keele ...
Alan Dix
 

Application of data mining tools for

  • 1. International Journal of Data Mining & Knowledge Management Process (IJDKP) Vol.4, No.4, July 2014 DOI : 10.5121/ijdkp.2014.4405 55 APPLICATION OF DATA MINING TOOLS FOR SELECTED SCRIPTS OF STOCK MARKET K. S. Mahajan1 and Dr. R. V. Kulkarni2 1 Research student, Chh. Shahu Institute of Business Education and Research Center, Kolhapur, India 2 Professor and HOD, Chh. Shahu Institute of Business Education and Research Center, Kolhapur, India ABSTRACT One of the most important problems in modern finance is finding efficient ways to summarize and visualize the stock market data to give individuals or institutions useful information about the market behavior for investment decisions Therefore, Investment can be considered as one of the fundamental pillars of national economy. So, at the present time many investors look to find criterion to compare stocks together and selecting the best and also investors choose strategies that maximize the earning value of the investment process. Therefore the enormous amount of valuable data generated by the stock market has attracted researchers to explore this problem domain using different methodologies. Therefore research in data mining has gained a high attraction due to the importance of its applications and the increasing generation information. So, Data mining tools such as association rule, rule induction method and Apriori algorithm techniques are used to find association between different scripts of stock market, and also much of the research and development has taken place regarding the reasons for fluctuating Indian stock exchange. But, now days there are two important factors such as gold prices and US Dollar Prices are more dominating on Indian Stock Market and to find out the correlation between gold prices, dollar prices and BSE index statistical correlation is used and this helps the activities of stock operators, brokers, investors and jobbers. They are based on the forecasting the fluctuation of index share prices, gold prices, dollar prices and transactions of customers. Hence researcher has considered these problems as a topic for research. KEYWORDS Stock Market, Association Rules, Rule Induction Methods, Apriori Algorithm, Correlation, Data Mining. 1. INTRODUCTION Data mining, the science and technology of exploring data in order to discover previously unknown patterns, is a part of the overall process of knowledge discovery in databases (KDD). In today’s computer-driven world, these databases contain massive quantities of information. The accessibility of this information makes data mining important and necessary. Data mining often can improve existing models by finding additional, important variables, indentifying interaction terms and detecting nonlinear relationships.
  • 2. International Journal of Data Mining & Knowledge Management Process (IJDKP) Vol.4, No.4, July 2014 56 Financial institutions such as stock markets produce huge datasets that build a foundation for approaching these enormously complex and dynamic problems with data mining tools. Potential significant benefits of solving these problems motivated extensive research for years. Specifics of data mining in finance are coming from the need to accommodate specific efficiency criteria (e.g., the maximum of trading profit) to prediction accuracy, coordinated multiresolution forecast (minutes, days, weeks, months, and years), Be able to benefit from very subtle patterns with a short life time, and incorporate the impact of market players on market regularities , Impact of gold and US dollar prices on stock market and also to find association between different scripts of stock market which helps investors to earn more profit. The techniques that are used in this project are: 1. Association rules 2. Apirori algorithm 3. Rule induction Method 4. Statistical Correlation 1.1 Association Rule: Unlike the other data mining functions, association is transaction based. In transaction processing, a case consists of a transactions such as a market basket analysis. The collection of items in the transaction is a multi- record attributes. Association rules are IF/THEN Statements. Example: “if a customer purchases Infosys Ltd, Then customer also purchases Wipro Ltd with 60% confidence”. An association rule has two parts, an antecedent (if) and a consequent (then), an antecedent is an item found in the data. A consequent is an item that is found in combination with the antecedent. Association Rule is created by analyzing data for frequent IF/THEN patterns & and using the criteria Support & Confidence to identify the most important relationships. Support and Confidence are two measures of association rule. Association Rule take following form x=>y, where x and y are the sets of items. The goal is to discover all the rules that have the Support & Confidence greater than or equal to the minimum support and minimum confidence respectively. Steps To Generate Association Rules: 1. Generate all possible association rules. 2. Compute the support and confidence of all possible association rules.
  • 3. International Journal of Data Mining & Knowledge Management Process (IJDKP) Vol.4, No.4, July 2014 57 3. Apply two threshold criteria minimum support and minimum confidence to obtain association rule. 4. Minimum support and minimum confidence is taken as an average of all the calculated support and calculated confidence. 5. If the calculated support and confidence is greater than or equal to the minimum support and minimum confidence then these items are said to be associated with each other by association rule. SUPPORT: The Support of a rule indicates how frequently the item in the rule occurs together. Example: Dr.Reddy’s lab and Cipla Ltd might appear together in 10% of the transaction. Support is calculated as below: Support (x=>y) = (Number of transaction Containing x&y) / (Total Number of transaction). CONFIDENCE: Confidence is the number of times the IF/THEN statements have been found to be true. The confidence of a rule indicates the probability of both the antecedent and the consequent appearing in the same transaction. Example: Dr.Reddy’s lab might appear in 20 transactions, 10 of the 20 might also include Cipla Ltd. Therefore Dr.Reddy’s Lab implies Cipla Ltd with 67% confidence. And Confidence is calculated as below: Confidence(x->y) = [Support(x->y)] / [Support of x]. Example: Association Rules from BSE SENSEX, Here Researcher has selected sector wise scripts for the calculation of association between the same sector scripts: Pharmaceuticals Sector: From BSE SENSEX researcher has selected Cipla Ltd, Dr.Reddy’s Lab, SunPharma India Ltd, Glenmark Ltd, Orchid Chemicals Ltd to calculate association between these same sector scripts. Here minimum support is the average of all the calculated support. And the MINIMUM SUPPORT: sum of support / total number of scripts =56/10 =5.6 %
  • 4. International Journal of Data Mining & Knowledge Management Process (IJDKP) Vol.4, No.4, July 2014 58 So, minimum support is 5.6% MINIMUM CONFIDENCE: sum of confidence / total number of scripts = 350 / 10 =35% So minimum confidence is 35%. Researcher applied the above rule to calculate min.Support and min.Confidence to obtain result for other sector scripts. So from the above data analysis researcher can conclude that Cipla ltd and Dr.Reddy’s Ltd go hand in hand and also Dr.reddy’s lab And Sun Pharma India Ltd goes hand in hand. So researcher can say these scripts are strongly associated with each other. 2. APRIORI ALGORITHM: Apriori is a classical algorithm and is designed to operate on databases containing transactions. The theory of Apriori algorithm is that “All nonempty subsets of a frequent item set must also be frequent.” Apriori principle can be shown as below:
  • 5. International Journal of Data Mining & Knowledge Management Process (IJDKP) Vol.4, No.4, July 2014 59 For all(x, y) :( x belongs to y) => s(x)>=s(y) i.e. support of an item set never exceeds the support of its subsets. This property is also known as monotone property of support. Algorithm is used to mine the frequent item sets. Apriori Algorithm is as follows: – Let K=1. – Generate frequent item sets of length l – Repeat until no frequent item sets are identified. Example: Support count (Dr.Reddy’s pharma lab Ltd) = No of transactions containing Dr. Reddy’s Pharma ltd = 18. 3. RULE INDUCTION TECHNIQUE Rule induction technique retrieves all interesting patterns from database. In rule induction technique, the rule if of “if this then this”. For example a rule that a stock market might find in their data collected from market transaction report would be: “if Reliance Industries Ltd script is purchased then Oil and Natural Gas Corporation is purchased”. or If Tata steel then SAIL If Mahindra then Hindustan motors In order for the rules to be useful there are two pieces on information that must be supplied as well as the actual rule: Accuracy- How often is the rule correct? Coverage- How often does the rule apply?
  • 6. International Journal of Data Mining & Knowledge Management Process (IJDKP) Vol.4, No.4, July 2014 60 From the above observation we conclude that Dr.Reddy’s Lab and Sun Pharma India Ltd are associated with each other as these satisfies both the minimum accuracy= 34 and minimum coverage=16. So the rule is true. So, when the customer purchases Dr.Reddy’s lab customer will also go for Sun Pharma India Ltd with 66% Accuracy. So these are strongly associated with each other. 4. CORRELATION: To find out the impact of fluctuating gold prices and BSE sensex and the impact of dollar prices and BSE sensex from 2008 to 2013 researcher has used a statistical formula coefficient of correlation. The mathematical formula for computing r is: r = n ∑xy – (∑x)(∑y) / √ n(∑ x2 ) – ( ∑ x ) 2 √ n ( ∑ y2 ) – ( ∑ y )2 Where x and y are the sample means of X and Y, and sx and sy are the sample standard deviations of X and Y. If x and y are results of measurements that contain measurement error, the realistic limits on the correlation coefficient are not −1 to +1 but a smaller range.
  • 7. International Journal of Data Mining & Knowledge Management Process (IJDKP) Vol.4, No.4, July 2014 61 The value of r is such that -1 < r < +1. The + and - signs are used for positive linear correlation and negative linear correlations, respectively. 1. Positive correlation: If x and y have a strong positive linear correlation, r is close to +1. An r value of exactly +1 indicates a perfect positive fit. Positive values indicate a relationship between x and y variables such that as values for x increase, values for y also increase. 2. Negative correlation: If x and y have a strong negative linear correlation, r is close to - 1. An r value of exactly -1 indicates a perfect negative fit. Negative values indicate a relationship between x and y such that as values for x increase, values for y decrease. 3. No correlation: If there is no linear correlation or a weak linear correlation, r is close to 0. A value near zero means that there is a random, nonlinear relationship between the two variables. 4. Note that r is a dimensionless quantity; that is; it does not depend on the units employed. 5. A perfect correlation of ± 1 occurs only when the data points all lie exactly on a straight line. If r = +1, the slope of this line is positive. If r = -1, the slope of this line is negative. 6. A correlation greater than 0.8 is generally described as strong, whereas a correlation less than 0.5 are generally described as weak. These values can vary based upon the “type" of data being examined. A study utilizing scientific data may require a stronger correlation than a study using social science data. Impact of gold prices on stock market: According to Indian scenario, Indian culture and tradition majority of the Indian women would like to invest in gold because of their tradition and their liking for gold. This leads to invest in gold because of its nature of keeping value, low risk, and as India is having parallel economy there are no any rules or fix criteria for investing in gold. So Indian people feel more beneficial to invest in gold therefore, Gold is having more impact on Equity Market. EXAMPLE: Table shows the correlations between GOLD and BSE SENSEX from January 2008 to august 2013
  • 8. International Journal of Data Mining & Knowledge Management Process (IJDKP) Vol.4, No.4, July 2014 62 Table shows the correlation between US DOLLAR and BSE SENSEX from January 2008 to august 2013: Currency market launched in 1999. 4000cr is daily turnover of the exchange because of which currency market became strong. And the more popular and effective currency is US Dollar. As compare to Equity, Dollar fluctuates slowly and more effective to the investors which leads to investors to prefer investing in currency market. Dollar is not only important for investment purpose but every countries financial strategy for planning to balance their currency with dollar for good economic results which leads to become dollar stronger. Therefore dollar is having more impact on equity market from last few years.
  • 9. International Journal of Data Mining & Knowledge Management Process (IJDKP) Vol.4, No.4, July 2014 63 5. CONCLUSION An Association between selected scripts of Indian Stock Market and a correlation between Indian Gold Prices and BSE SENSEX INDEX and Dollar Prices and BSE SENSEX INDEX has been described. In this paper Data Mining Tools such as Association Rule, Apriori Algorithm, and Rule Induction Methods are used for Association of Indian Stock market in order to find out which scripts are much associated with each other. Results shows that sector wise scripts are much associated with each other which helps investors, brokers, jobbers for investment decision. Statistical Correlation result shows that, Gold Prices and Dollar Prices has an impact on Indian Stock Market. REFERENCES [1] Alex Berson and Stephen j. Smith, “Data Warehousing, Data Mining, and OLAP”, MC Graw Hill, 1997. [2] A. D. Devale and Dr. R. V. Kulkarni, “Application Of Data Mining Techniques In Life Insurance”, International Journal Of Data Mining and Knowledge Management Process Vol.2. No.4, July 2012. [3] Arun. K. Pujari, “Data Mining Techniques”, Universities Press (India) PVT Ltd, 2001. [4] C.R. Kothari, “Research Methodology: Methods and Techniques”, New Age International (p) Ltd, 2004. [5] Chengqi Zhang, Shichao Zhang, “Association Rule Mining: Models and Algorithm, Springer, 2002. [6] David Cheung, Vincent T., Ada W. Fu and Yongjian Fv, “Efficient Mining of Association Rules in Distributed Databases”, IEEE, 1996. [7] J. Date, “An Introduction to Database Systems”, Addition Wesley longman, Seven Edition, 2000. [8] J.K. Sharma, “Business Statistics” Pearson Education, 2008. [9] Ken Orr, “Data Warehousing Technology”, Copyright. The Ken Or Institute, 1997. [10] Krzysztof J. Cios, Witold Pedryez and Roman W. Surniarski, “Data Mining Methods for Knowledge Discovery”, Kluwer Academic Publishers 1998 Second Printing 2000. [11] L. M. Bhole, “Financial Institutions and Markets: Structure, Growth and Innovation, MC Graw Hill, 2006. [12] Ming-Syan chen, Jiawei Han and Philip S. Yu, “Data Mining: An Overview From a Database Perspective”, IEEE Transactions on Knowledge and Data Engineering Vol. 8, No. 6, Dec. 1996. [13] NSE’s Certification In Financial Markets, National Stock Exchange of India ltd
  翻译: