-
-
Notifications
You must be signed in to change notification settings - Fork 46.6k
feat: Implement Principal Component Analysis (PCA) #12595
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat: Implement Principal Component Analysis (PCA) #12595
Conversation
- Added a Python implementation of PCA using NumPy and scikit-learn - Standardizes the dataset before applying PCA for better performance - Computes principal components and explained variance ratio - Uses the Iris dataset as a sample for demonstration - Provides a modular structure for easy extension and dataset modification
for more information, see https://pre-commit.ci
…11/Python into principle-component-analysis
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Click here to look at the relevant links ⬇️
🔗 Relevant Links
Repository:
Python:
Automated review generated by algorithms-keeper. If there's any problem regarding this review, please open an issue about it.
algorithms-keeper
commands and options
algorithms-keeper actions can be triggered by commenting on this PR:
@algorithms-keeper review
to trigger the checks for only added pull request files@algorithms-keeper review-all
to trigger the checks for all the pull request files, including the modified files. As we cannot post review comments on lines not part of the diff, this command will post all the messages in one comment.NOTE: Commands are in beta and so this feature is restricted only to a member or owner of the organization.
from sklearn.datasets import load_iris | ||
|
||
|
||
def collect_dataset(): |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
As there is no test file in this pull request nor any test function or class in the file machine_learning/principle_component_analysis.py
, please provide doctest for the function collect_dataset
Please provide return type hint for the function: collect_dataset
. If the function does not return a value, please provide the type hint as: def function() -> None:
return np.array(data.data), np.array(data.target) | ||
|
||
|
||
def apply_pca(data_x, n_components): |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
As there is no test file in this pull request nor any test function or class in the file machine_learning/principle_component_analysis.py
, please provide doctest for the function apply_pca
Please provide return type hint for the function: apply_pca
. If the function does not return a value, please provide the type hint as: def function() -> None:
Please provide type hint for the parameter: data_x
Please provide type hint for the parameter: n_components
return principal_components, explained_variance | ||
|
||
|
||
def main(): |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
As there is no test file in this pull request nor any test function or class in the file machine_learning/principle_component_analysis.py
, please provide doctest for the function main
Please provide return type hint for the function: main
. If the function does not return a value, please provide the type hint as: def function() -> None:
Describe your change:
Checklist: