add visualization of k means clustering as excel format #2104

beqakd · 2020-06-12T12:57:04Z

Describe your change:

Add nice feature to convert Dataframe with clustering number in it to excel style format. It is easily readable and also has a lot of features in it to navigate through data.(like mean, mean_with_zero, median, max, min and etc.)

Add an algorithm? K mean clustering into excel style. #2102 K mean clustering into excel style. #2102

Checklist:

I have read CONTRIBUTING.md.
This pull request is all my own work -- I have not plagiarized.
I know that pull requests will not be merged if they fail the automated tests.
This PR only changes one algorithm file. To ease review, please open separate PRs for separate algorithms.
All new Python files are placed inside an existing directory.
All filenames are in all lowercase characters with no spaces or dashes.
All functions and variable names follow Python naming conventions.
All function parameters and return values are annotated with Python type hints.

TravisBuddy · 2020-06-12T12:59:57Z

Hey @beqakd,
Something went wrong with the build.

TravisCI finished with status errored, which means the build failed because of something unrelated to the tests, such as a problem with a dependency or the build process itself.

View build log

TravisBuddy Request Identifier: 9d0eb2c0-acac-11ea-8245-836d9da6cb4c

TravisBuddy · 2020-06-12T16:18:47Z

Hey @beqakd,
Something went wrong with the build.

TravisCI finished with status errored, which means the build failed because of something unrelated to the tests, such as a problem with a dependency or the build process itself.

View build log

TravisBuddy Request Identifier: 640c3620-acc8-11ea-8245-836d9da6cb4c

TravisBuddy · 2020-06-12T19:35:40Z

Hey @beqakd,
Something went wrong with the build.

TravisCI finished with status errored, which means the build failed because of something unrelated to the tests, such as a problem with a dependency or the build process itself.

View build log

TravisBuddy Request Identifier: e4967240-ace3-11ea-8245-836d9da6cb4c

beqakd · 2020-06-12T19:36:42Z

Error is not from my PR. Style changes fixed.

cclauss · 2020-06-17T19:26:59Z

machine_learning/k_means_clust.py

@@ -202,3 +204,127 @@ def kmeans(
        verbose=True,
    )
    plot_heterogeneity(heterogeneity, k)
+
+
+def ReportGenerator(df, ClusteringVariables, FillMissingReport=None):


Type hints? Doctests? Function and variable names need to be snake_case -- See CONTRIBUTING.md.

Typehints okey. but i cant write doctests. It returns pandas dataframe can you suggest me some ideas how i can do it? We can test manually to be sure that it works

Is it impossible to examine various elements of a dataframe to ensure that those elements contain expected values?

Its not but i wont be able to check all of them, but i will do good testing with different approaches. Thanks!

Just a few sanity checks will be good enough for our purposes. Thx.

cclauss · 2020-06-17T19:28:05Z

machine_learning/k_means_clust.py

+    """
+    Function generates easy-erading clustering report. It takes 2 arguments as an input:
+        DataFrame - dataframe with predicted cluester column;
+        FillMissingReport - dcitionary of rules how we are going to fill missing


Suggested change

FillMissingReport - dcitionary of rules how we are going to fill missing

FillMissingReport - dictionary of rules how we are going to fill missing

TravisBuddy · 2020-06-19T11:05:37Z

Hey @beqakd,
Something went wrong with the build.

TravisCI finished with status errored, which means the build failed because of something unrelated to the tests, such as a problem with a dependency or the build process itself.

View build log

TravisBuddy Request Identifier: ccec7b60-b21c-11ea-b4e4-a33918451e6b

TravisBuddy · 2020-06-19T11:12:15Z

Hey @beqakd,
Something went wrong with the build.

TravisCI finished with status errored, which means the build failed because of something unrelated to the tests, such as a problem with a dependency or the build process itself.

View build log

TravisBuddy Request Identifier: ba40dc30-b21d-11ea-b4e4-a33918451e6b

machine_learning/k_means_clust.py

Co-authored-by: Christian Clauss <[email protected]>

machine_learning/k_means_clust.py

Co-authored-by: Christian Clauss <[email protected]>

cclauss

Thanks for your work here!

beqakd · 2020-06-19T16:47:18Z

Thanks for your work here!

Thanks a lot!

…s#2104) * add visualization of kmneas clust as excel format * style changes * style changes * Add doctest and typehint! * style change * Update machine_learning/k_means_clust.py Co-authored-by: Christian Clauss <[email protected]> * Update machine_learning/k_means_clust.py Co-authored-by: Christian Clauss <[email protected]> Co-authored-by: Christian Clauss <[email protected]>

add visualization of kmneas clust as excel format

aea5f0d

style changes

3a62fc0

style changes

4f49185

cclauss reviewed Jun 17, 2020

View reviewed changes

Add doctest and typehint!

b523bd5

style change

9a6ba97

cclauss reviewed Jun 19, 2020

View reviewed changes

machine_learning/k_means_clust.py Outdated Show resolved Hide resolved

Update machine_learning/k_means_clust.py

8367113

Co-authored-by: Christian Clauss <[email protected]>

cclauss reviewed Jun 19, 2020

View reviewed changes

machine_learning/k_means_clust.py Outdated Show resolved Hide resolved

Update machine_learning/k_means_clust.py

f1e2cec

Co-authored-by: Christian Clauss <[email protected]>

cclauss approved these changes Jun 19, 2020

View reviewed changes

cclauss merged commit d034add into TheAlgorithms:master Jun 19, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add visualization of k means clustering as excel format #2104

add visualization of k means clustering as excel format #2104

beqakd commented Jun 12, 2020 •

edited

Loading

TravisBuddy commented Jun 12, 2020

TravisBuddy commented Jun 12, 2020

TravisBuddy commented Jun 12, 2020

beqakd commented Jun 12, 2020

cclauss Jun 17, 2020 •

edited

Loading

beqakd Jun 17, 2020

cclauss Jun 17, 2020

beqakd Jun 17, 2020

cclauss Jun 17, 2020

cclauss Jun 17, 2020

TravisBuddy commented Jun 19, 2020

TravisBuddy commented Jun 19, 2020

cclauss left a comment

beqakd commented Jun 19, 2020

	FillMissingReport - dcitionary of rules how we are going to fill missing
	FillMissingReport - dictionary of rules how we are going to fill missing

add visualization of k means clustering as excel format #2104

add visualization of k means clustering as excel format #2104

Conversation

beqakd commented Jun 12, 2020 • edited Loading

Describe your change:

Checklist:

TravisBuddy commented Jun 12, 2020

TravisBuddy Request Identifier: 9d0eb2c0-acac-11ea-8245-836d9da6cb4c

TravisBuddy commented Jun 12, 2020

TravisBuddy Request Identifier: 640c3620-acc8-11ea-8245-836d9da6cb4c

TravisBuddy commented Jun 12, 2020

TravisBuddy Request Identifier: e4967240-ace3-11ea-8245-836d9da6cb4c

beqakd commented Jun 12, 2020

cclauss Jun 17, 2020 • edited Loading

Choose a reason for hiding this comment

beqakd Jun 17, 2020

Choose a reason for hiding this comment

cclauss Jun 17, 2020

Choose a reason for hiding this comment

beqakd Jun 17, 2020

Choose a reason for hiding this comment

cclauss Jun 17, 2020

Choose a reason for hiding this comment

cclauss Jun 17, 2020

Choose a reason for hiding this comment

TravisBuddy commented Jun 19, 2020

TravisBuddy Request Identifier: ccec7b60-b21c-11ea-b4e4-a33918451e6b

TravisBuddy commented Jun 19, 2020

TravisBuddy Request Identifier: ba40dc30-b21d-11ea-b4e4-a33918451e6b

cclauss left a comment

Choose a reason for hiding this comment

beqakd commented Jun 19, 2020

beqakd commented Jun 12, 2020 •

edited

Loading

cclauss Jun 17, 2020 •

edited

Loading