Understand model predictions with the interactive partial dependence/ICE plot. Perform cross-validation to guarantee model stability.Įxplain machine learning models with LIME, Shap/Shapley values. Validate models by applying performance metrics including Accuracy, R2, AUC, and ROC. Optimize model performance with hyperparameter optimization, boosting, bagging, stacking, or building complex ensembles. Manipulate text, apply formulas on numerical data, and apply rules to filter out or mark samples.īuild machine learning models for classification, regression, dimension reduction, or clustering, using advanced algorithms including deep learning, tree-based methods, and logistic regression. Extract and select features (or construct new ones) to prepare your dataset for machine learning with genetic algorithms, random search, or backward- and forward feature elimination.
Detect out of range values with outlier and anomaly detection algorithms. Aggregate, sort, filter, and join data either on your local machine, in-database, or in distributed big data environments.Ĭlean data through normalization, data type conversion, and missing value handling. Integrate dimensions reduction, correlation analysis, and more into your workflows. Access and retrieve data from sources such as Twitter, AWS S3, Google Sheets, and Azure.ĭerive statistics, including mean, quantiles, and standard deviation, or apply statistical tests to validate a hypothesis. Load Avro, Parquet, or ORC files from HDFS, S3, or Azure. Connect to a host of databases and data warehouses to integrate data from Oracle, Microsoft SQL, Apache Hive, and more. Open and combine simple text formats (CSV, PDF, XLS, JSON, XML, etc), unstructured data types (images, documents, networks, molecules, etc), or time-series data. Intuitive, open, and continuously integrating new developments, KNIME for Windows PC makes understanding data and designing data science workflows and reusable components accessible to everyone.
KNIME Analytics Platform is the open-source software for creating data science.