You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: machine-learning/README.md
+8-4Lines changed: 8 additions & 4 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -17,15 +17,19 @@ The following structure represent the available APIs and environments for the di
17
17
*__r__ - Oracle Machine Learning for R Notebooks, and Oracle Machine Learning for Spark Notebooks
18
18
*__sql__ - Notebooks for the Oracle Machine Learning for SQL and also the Oracle Machine Learning Notebooks (based on Zeppelin) included with Oracle ADW and ATP
19
19
20
-
*__python__ - Oracle Machine Learning for Python examples (coming soon)
20
+
*__python__ (Python language examples)
21
+
*__oml4py__ - Oracle Machine Learning for Python examples (coming soon)
21
22
22
-
*__r__
23
+
*__r__ (R language examples)
23
24
*__oml4r__ - Oracle Machine Learning for R (in-Database) R code examples
24
25
*__oml4spark__ - Oracle Machine Learning for Spark (in-Data Lake) R code examples
25
26
26
-
*__sql__
27
+
*__sql__ (SQL and PL/SQL examples)
27
28
*__19c__ - Oracle Machine Learning for SQL examples for Oracle Database 19c
28
29
*__20c__ - Oracle Machine Learning for SQL examples for Oracle Database 20c
29
30
30
-
For more information please visit our main page for Oracle Machine Learning at https://oracle.com/machine-learning
31
+
For more information please visit our main page [Oracle Machine Learning](https://oracle.com/machine-learning)
31
32
33
+
#### Copyright (c) 2020 Oracle Corporation and/or its affilitiates.
34
+
35
+
###### [The Universal Permissive License (UPL), Version 1.0](https://oss.oracle.com/licenses/upl/)
# Oracle Machine Learning sample Notebooks in R language
2
+
The sample notebooks on this folder are expected to be used with Oracle Machine Learning for R and Oracle Machine Learning for Spark.
3
+
4
+
The notebooks available in JSON format are compatible with any open-source Apache Zeppelin notebook environment.
5
+
6
+
That includes an Apache Zeppelin running on a server that can connect to an Oracle Database (for using the OML4R notebook examples), or an Apache Zeppelin running on a server (Hadoop Edge Node) that can connect to a Hadoop Cluster.
7
+
8
+
The OML4R notebooks can be used with an on-premise Oracle Database and also a Database Cloud Service (except the Autonomous Database, which runs its own Oracle Machine Learning Notebooks).
9
+
10
+
The OML4Spark notebooks can also be used with the Oracle Big Data Manager (available in the Oracle Big Data Appliance or Oracle Big Data Cloud Service).
11
+
12
+
#### Copyright (c) 2020 Oracle Corporation and/or its affilitiates.
13
+
14
+
###### [The Universal Permissive License (UPL), Version 1.0](https://oss.oracle.com/licenses/upl/)
Oracle Machine Learning is a collaborative user interface for data scientists and business and data analysts who perform machine learning in the Autonomous Databases--Autonomous Data Warehouse (ADW) and Autonomous Transactional Database (ATP).
1
+
# Oracle Machine Learning Notebooks
2
+
Oracle Machine Learning Notebooks is a collaborative user interface for data scientists and business and data analysts who perform machine learning in Oracle Autonomous Database -Autonomous Data Warehouse (ADW) and Autonomous Transactional Database (ATP).
3
3
4
-
Oracle Machine Learning enables data scientists, citizen data scientists, and data analysts to work together to explore their data visually and develop analytical methodologies in the Autonomous Data Warehouse Cloud. Oracle's high performance, parallel and scalable in-Database implementations of machine learning algorithms are exposed via SQL and PL/SQL using notebook technologies. Oracle Machine Learning enables teams to collaborate to build, assess, and deploy machine learning models, while increasing data scientist productivity Oracle Machine Learning focuses on ease of use and simplified machine learning for data science – from preparation through deployment – all in the Autonomous Database.
5
-
6
-
Based on Apache Zeppelin notebook technology, Oracle Machine Learning provides a common platform with a single interface that can connect to multiple data sources and access multiple back-end Autonomous Database servers. Multi-user collaboration enables the same notebook document to be opened simultaneously by different users, such that changes made by one user to a notebook are instantaneously reflected to all users viewing that notebook. To support enterprise requirements for security, authentication, and auditing, Oracle Machine Learning supports privilege-based access to data, models, and notebooks, as well as being integrated with Oracle security protocols.
4
+
Oracle Machine Learning Notebooks enables data scientists, citizen data scientists, and data analysts to work together to explore their data visually and develop analytical methodologies in the Autonomous Data Warehouse Cloud. Oracle's high performance, parallel and scalable in-Database implementations of machine learning algorithms are exposed via SQL and PL/SQL using notebook technologies. Oracle Machine Learning enables teams to collaborate to build, assess, and deploy machine learning models, while increasing data scientist productivity Oracle Machine Learning focuses on ease of use and simplified machine learning for data science – from preparation through deployment – all in Oracle Autonomous Database.
7
5
6
+
Based on Apache Zeppelin notebook technology, Oracle Machine Learning Notebooks provides a common platform with a single interface that can connect to multiple data sources and access multiple back-end Autonomous Database servers. Multi-user collaboration enables the same notebook document to be opened simultaneously by different users, such that changes made by one user to a notebook are instantaneously reflected to all users viewing that notebook. To support enterprise requirements for security, authentication, and auditing, Oracle Machine Learning supports privilege-based access to data, models, and notebooks, as well as being integrated with Oracle security protocols.
8
7
9
8
Key Features:
10
9
@@ -15,4 +14,8 @@ Key Features:
15
14
* Enables and supports deployments of enterprise machine learning methodologies in both Autonomous Data Warehouse (ADW) and Autonomous Transactional Database (ATP)
16
15
17
16
18
-
See Introducing Oracle Machine Learning SQL Notebooks for the Oracle Autonomous Data Warehouse Cloud! blog post (https://blogs.oracle.com/datamining/introducing-oracle-machine-learning-sql-notebooks-for-the-oracle-autonomous-data-warehouse-cloud ) for more information.
17
+
See [Introducing Oracle Machine Learning Notebooks for the Oracle Autonomous Data Warehouse](https://blogs.oracle.com/datamining/introducing-oracle-machine-learning-sql-notebooks-for-the-oracle-autonomous-data-warehouse-cloud) blog post for more information.
18
+
19
+
#### Copyright (c) 2020 Oracle Corporation and/or its affilitiates.
20
+
21
+
###### [The Universal Permissive License (UPL), Version 1.0](https://oss.oracle.com/licenses/upl/)
## Tutorials for OML4Spark (a.k.a. ORAAH) release 2.8.x
4
+
**The [Oracle Machine Learning for Spark][1] (OML4Spark) is a set of R packages and Java libraries**
5
+
**It provides several features:**
6
+
- An R interface for manipulating data stored in a local File System, HDFS, HIVE, Impala or JDBC sources, and creating Distributed Model Matrices across a Cluster of Hadoop Nodes in preparation for ML
7
+
- A general computation framework where users invoke parallel, distributed MapReduce jobs from R, writing custom mappers and reducers in R while also leveraging open source CRAN packages
8
+
- Parallel and distributed Machine Learning algorithms that take advantage of all the nodes of a Hadoop cluster for scalable, high performance modeling on big data. Functions use the expressive R formula object optimized for Spark parallel execution
9
+
ORAAH's custom LM/GLM/MLP NN algorithms on Spark scale better and run faster than the open-source Spark MLlib functions, but ORAAH provides interfaces to MLlib as well
10
+
- Core Analytics functionality halso available in a standalone Java library that can be used directly without the need of the R language, and can be called from any Java or Scala platform.
11
+
12
+
13
+
**The following are a list of demos containing R code for learning about (OML4Spark)**
14
+
- Files on the current folder
15
+
- Introduction to OML4Spark (oml4spark_tutorial_getting_started_with_hdfs.r)
16
+
- Working with HIVE, IMPALA and Spark Data Frames (oml4spark_tutorial_getting_started_with_hive_impala_spark.r)
17
+
- Function in R to visualize Hadoop Data in Apache Zeppelin z.show (oml4spark_function_zeppelin_visualization_z_show.r)
18
+
- AutoML for Classification using Cross Validation with OML4Spark
19
+
* Sample Execution of the Cross Validation (oml4spark_execute_cross_validation.r)
20
+
* Function to run the Cross Validation (oml4spark_function_run_cross_validation.r)
21
+
* Function to Create a Balanced input Dataset (oml4spark_function_create_balanced_input.r)
22
+
* Function to run Variable Selection via GLM Logistic (oml4spark_function_variable_selection_via_glm.r)
23
+
* Function to run Variable Selection via Singular Value Decomposition (oml4spark_function_variable_selection_via_pca.r)
24
+
* Function to compute Confusion Matrix and statistics (oml4spark_function_confusion_matrix_in_spark.r)
25
+
* Function to build all OML4Spark models (oml4spark_function_build_all_classification_models.r)
0 commit comments