Oracle

Oracle

Application and Data / Data Stores / Databases
Data Scientist ·

I recently started a new position as a data scientist at an E-commerce company. The company is founded about 4-5 years ago and is new to many data-related areas. Specifically, I'm their first data science employee. So I have to take care of both data analysis tasks as well as bringing new technologies to the company.

  1. They have used Elasticsearch (and Kibana) to have reporting dashboards on their daily purchases and users interactions on their e-commerce website.

  2. They also use the Oracle database system to keep records of their daily turnovers and lists of their current products, clients, and sellers lists.

  3. They use Data-Warehouse with cockpit 10 for generating reports on different aspects of their business including number 2 in this list.

At the moment, I grab batches of data from their system to perform predictive analytics from data science perspectives. In some cases, I use a static form of data such as monthly turnover, client values, and high-demand products, and run my predictive analysis using Python (VS code). Also, I use Google Datastudio or Google Sheets to present my findings. In other cases, I try to do time-series analysis using offline batches of data extracted from Elastic Search to do user recommendations and user personalization.

I really want to use modern data science tools such as Apache Spark, Google BigQuery, AWS, Azure, or others where they really fit. I think these tools can improve my performance as a data scientist and can provide more continuous analytics of their business interactions. But honestly, I'm not sure where each tool is needed and what part of their system should be replaced by or combined with the current state of technology to improve productivity from the above perspectives.

READ MORE
5 upvotes·334.5K views
Replies (2)
Data Scientist / Data Engineer at The Prosperity Company AG·

It's hard to make a suggestion here as your use case isn't clear enough.

Use BigQuery if you want to replicate your probably on premise Oracle and Elasticsearch databases so you can profit from the speed of BigQuery. You can do the replication via Google Cloud Functions. Your Google Sheets can be connected to BigQuery and BigQuery can easily be connected to DataStudio.

If you do data science on the data there would be BigQuery ML and Google Colab that would fit into your stack.

In case you do BigData analysis you can go with Apache Spark if you have enough resources (on-prem or Cloud). I suggest you to use a Kubernetes backbone for this as you only reserve the resources when in use and the cluster can be used for other stuff as well.

For dashboarding find your preference and the preference of your audience with DataStudio, Tableau or Apache Superset

READ MORE
9 upvotes·2 comments·5.4K views
Bob hs
Bob hs
·
March 16th 2022 at 12:09PM

Thank you for the answer.

A few days ago the head of IT told me to try AWS if I need cloud resources. I cannot migrate everything from on-premise to cloud. But, I need to choose what data I need for my Data Science tasks on the cloud. For example, I need to extract their daily sale records stored in Oracle as well as their web usage from Elasticsearch.

My main tasks would be "sale/demand forecast", "user retention prediction", "recommendation systems", and "user activity analysis". So, BigData analysis would be part of the job.

I think BigQuery and Datastudio would be out of my options. I need to use resources offered by AWS or compatible with AWS. I'm not sure if I need to grab their web data directly from their web platform's server or from Elasticsearch.

Also, what dashboarding tool is better when I use AWS for my DS pipeline?

·
Reply
peol solutions
peol solutions
·
January 8th 2024 at 1:36PM

BigQuery and Datastudio would be out of my options. I need to use resources offered by AWS or compatible with AWS. I'm not sure if I need to grab their web data directly from their web platform's server or from Elasticsearch.

·
Reply

A trending feed showcases the most popular topics, hashtags, or posts on social media platforms in real time. These feeds are driven by algorithms that analyze user engagement, such as likes, shares, and comments, to identify what resonates with audiences. Trending feeds serve as a valuable tool for users and marketers, offering insights into current events, pop culture, and consumer interests. Engaging with trending topics can enhance visibility and drive traffic. For a deeper understanding, check resources from platforms like Hootsuite and Sprout Social, which provide insights into leveraging trending feeds effectively.

READ MORE
1 upvote·76 views
Software Engineer ·
Needs advice
on
CockroachDBCockroachDBMongoDBMongoDB
and
PostgreSQLPostgreSQL

So we are re-engineering our application database to make it cloud-native and deploy on the Kubernetes platform. Currently, our data lies on the Oracle 19c database and it is normalized extensively. We store pdfs, txt files and allow a user to edit, delete, view, create new transactions. Now I want to pick a DB, which makes the re-engineering, not a big deal but allows us to store data in a distributed manner on Kubernetes. Please assist me.

READ MORE
8 upvotes·66.9K views

Hello guys! I would ask for your advice. Our situation is like that there will be a project to revamp workflows and introduce new services like mobile apps, machine learning, and some online services that would use cloud storage. We use JSF, JavaScript, Ajax, Spring, Oracle 12c running on Linux (VM) and providing online services to internal users and the public. But, we are not technically savvy enough to evaluate what tools should be introduced. Personally, I am evaluating whether to take this opportunity to change our practice/PM approach from Prince to Scrum/Agile (It seemed that DevOps is popular) ... Since we adopt ISO 27001 and ISO 20000, security is a crucial factor that we consider. Would you please help to recommend a list of tools and explain the reasons why you recommend them? Thanks in advance~!

READ MORE
5 upvotes·162.2K views
Needs advice
on
ASP.NETASP.NETLaravelLaravel
and
Node.jsNode.js

Hi. We are planning to develop web, desktop, and mobile app for procurement, logistics, and contracts. Procure to Pay and Source to pay, spend management, supplier management, catalog management. ( similar to SAP Ariba, gap.com, coupa.com, ivalua.com vroozi.com, procurify.com

We got stuck when deciding which technology stack is good for the future. We look forward to your kind guidance that will help us.

We want to integrate with multiple databases with seamless bidirectional integration. What APIs and middleware available are best to achieve this? SAP HANA, Oracle, MySQL, MongoDB...

ASP.NET / Node.js / Laravel. ......?

Please guide us

READ MORE
6 upvotes·728K views
Replies (1)
Recommends
on
ASP.NET

i recommended .NET because the library so rich, you can integrated any sources to computed , compiling, integrating, your apps to high complexity, easy to communicated with SAP BAPI. used Oracle DB, Cheers.

READ MORE
3 upvotes·267.9K views
Needs advice
on
IBM DB2IBM DB2MySQLMySQL
and
OracleOracle
in

Hi, Which RDBMS can replace Oracle when it comes to high availability & heavy transactional load with zero RTO & RPO.

Thanks, G

READ MORE
1 upvote·71.7K views
Application Devloper at Bny Mellon·

I have just started learning Python 3 week back. I want to create REST api using python. The api will be use to save form data in Oracle database. The front end is using AngularJS 8 with Angular Material. In python there are so many framework for developing REST ** I am looking for some suggestions which REST framework to choose? ** Here are some feature I am looking for * Easy integration and unit testing like in Angular we just run command. * Code packageing, like in Java maven project we can build and package. I am looking for something which I can push in artifactory and deploy whole code as package. *Support for swagger/ OpenAPI * Support for JSON Web Token * Support for testcase coverage report Framework can have feature included or can be available by extension.

READ MORE
9 upvotes·324.3K views