feat: Implement Principal Component Analysis (PCA) #12595

parikshit2111 · 2025-03-01T04:28:28Z

Added a Python implementation of PCA using NumPy and scikit-learn
Standardizes the dataset before applying PCA for better performance
Computes principal components and explained variance ratio
Uses the Iris dataset as a sample for demonstration
Provides a modular structure for easy extension and dataset modification

Describe your change:

[✅] Add an algorithm?
Fix a bug or typo in an existing algorithm?
Add or change doctests? -- Note: Please avoid changing both code and tests in a single pull request.
Documentation change?

Checklist:

[ ✅] I have read CONTRIBUTING.md.
[✅ ] This pull request is all my own work -- I have not plagiarized.
[✅ ] I know that pull requests will not be merged if they fail the automated tests.
[ ✅] This PR only changes one algorithm file. To ease review, please open separate PRs for separate algorithms.
[ ✅] All new Python files are placed inside an existing directory.
[ ✅] All filenames are in all lowercase characters with no spaces or dashes.
[✅ ] All functions and variable names follow Python naming conventions.
[ ✅] All function parameters and return values are annotated with Python type hints.
All functions have doctests that pass the automated testing.
All new algorithms include at least one URL that points to Wikipedia or another similar explanation.
If this pull request resolves one or more open issues then the description above includes the issue number(s) with a closing keyword: "Fixes #ISSUE-NUMBER".

- Added a Python implementation of PCA using NumPy and scikit-learn - Standardizes the dataset before applying PCA for better performance - Computes principal components and explained variance ratio - Uses the Iris dataset as a sample for demonstration - Provides a modular structure for easy extension and dataset modification

for more information, see https://pre-commit.ci

…11/Python into principle-component-analysis

algorithms-keeper

Click here to look at the relevant links ⬇️

🔗 Relevant Links

Repository:

Contributing guidelines

Project Euler solution guidelines

Python:

Formatted string literals (f-strings)

Type hints

doctest

unittest

pytest

Automated review generated by algorithms-keeper. If there's any problem regarding this review, please open an issue about it.

algorithms-keeper commands and options

algorithms-keeper actions can be triggered by commenting on this PR:

@algorithms-keeper review to trigger the checks for only added pull request files

@algorithms-keeper review-all to trigger the checks for all the pull request files, including the modified files. As we cannot post review comments on lines not part of the diff, this command will post all the messages in one comment.

NOTE: Commands are in beta and so this feature is restricted only to a member or owner of the organization.

algorithms-keeper · 2025-03-01T04:33:50Z

machine_learning/principle_component_analysis.py

+from sklearn.datasets import load_iris
+
+
+def collect_dataset():


As there is no test file in this pull request nor any test function or class in the file machine_learning/principle_component_analysis.py, please provide doctest for the function collect_dataset

Please provide return type hint for the function: collect_dataset. If the function does not return a value, please provide the type hint as: def function() -> None:

algorithms-keeper · 2025-03-01T04:33:50Z

machine_learning/principle_component_analysis.py

+    return np.array(data.data), np.array(data.target)
+
+
+def apply_pca(data_x, n_components):


As there is no test file in this pull request nor any test function or class in the file machine_learning/principle_component_analysis.py, please provide doctest for the function apply_pca

Please provide return type hint for the function: apply_pca. If the function does not return a value, please provide the type hint as: def function() -> None:

Please provide type hint for the parameter: data_x

Please provide type hint for the parameter: n_components

algorithms-keeper · 2025-03-01T04:33:50Z

machine_learning/principle_component_analysis.py

+    return principal_components, explained_variance
+
+
+def main():


As there is no test file in this pull request nor any test function or class in the file machine_learning/principle_component_analysis.py, please provide doctest for the function main

Please provide return type hint for the function: main. If the function does not return a value, please provide the type hint as: def function() -> None:

parikshit2111 and others added 2 commits March 1, 2025 09:47

[pre-commit.ci] auto fixes from pre-commit.com hooks

ac605b5

for more information, see https://pre-commit.ci

algorithms-keeper bot added the tests are failing Do not merge until tests pass label Mar 1, 2025

parikshit2111 added 2 commits March 1, 2025 10:02

refactor:Removed requests from imports

e735cbe

Merge branch 'principle-component-analysis' of github.com:parikshit21…

75faa9c

…11/Python into principle-component-analysis

algorithms-keeper bot added require tests Tests [doctest/unittest/pytest] are required require type hints https://docs.python.org/3/library/typing.html labels Mar 1, 2025

algorithms-keeper bot reviewed Mar 1, 2025

View reviewed changes

algorithms-keeper bot added the awaiting reviews This PR is ready to be reviewed label Mar 1, 2025

parikshit2111 closed this Mar 1, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Implement Principal Component Analysis (PCA) #12595

feat: Implement Principal Component Analysis (PCA) #12595

parikshit2111 commented Mar 1, 2025

algorithms-keeper bot left a comment

algorithms-keeper bot Mar 1, 2025

algorithms-keeper bot Mar 1, 2025

algorithms-keeper bot Mar 1, 2025

		from sklearn.datasets import load_iris


		def collect_dataset():

		return np.array(data.data), np.array(data.target)


		def apply_pca(data_x, n_components):

feat: Implement Principal Component Analysis (PCA) #12595

feat: Implement Principal Component Analysis (PCA) #12595

Conversation

parikshit2111 commented Mar 1, 2025

Describe your change:

Checklist:

algorithms-keeper bot left a comment

Choose a reason for hiding this comment

🔗 Relevant Links

Repository:

Python:

Automated review generated by algorithms-keeper. If there's any problem regarding this review, please open an issue about it.

algorithms-keeper actions can be triggered by commenting on this PR:

algorithms-keeper bot Mar 1, 2025

Choose a reason for hiding this comment

algorithms-keeper bot Mar 1, 2025

Choose a reason for hiding this comment

algorithms-keeper bot Mar 1, 2025

Choose a reason for hiding this comment