no code implementations • 8 Oct 2018 • Kumarjit Pathak, Jitin Kapila
Philosophy of this algorithm is to find similar data items and group them together based on any distance function in multidimensional space.
no code implementations • 7 Oct 2018 • Kumarjit Pathak, Jitin Kapila
There are state of the art methodologies to detect the impact of concept drift, however general strategy considered to overcome the issue in performance is to rebuild or re-calibrate the model periodically as the variable patterns for the model changes significantly due to market change or consumer behavior change etc.
no code implementations • 11 Jul 2018 • Aasheesh Barvey, Jitin Kapila, Kumarjit Pathak
To predict the employee attrition beforehand and to enable management to take individualized preventive action.
no code implementations • 25 May 2018 • Kumarjit Pathak, Jitin Kapila, Aasheesh Barvey
Classification is one of the widely used analytical techniques in data science domain across different business to associate a pattern which contribute to the occurrence of certain event which is predicted with some likelihood.
no code implementations • 25 May 2018 • Kumarjit Pathak, Jitin Kapila, Aasheesh Barvey
Personalized Influence Estimation is a technique introduced in this paper, which can estimate key factor influence for individual observations, which contribute most for each observations behavior pattern based on the dependent class or estimate.
no code implementations • 12 May 2018 • Kumarjit Pathak, Prabhukiran G, Jitin Kapila, Nikit Gawande
High volume of data, perceived as either challenge or opportunity.
no code implementations • 4 May 2018 • Kumarjit Pathak, Jitin Kapila, Aasheesh Barvey, Nikit Gawande
In regression modelling approach, the main step is to fit the regression line as close as possible to the target variable.