联系方式

  • QQ:99515681
  • 邮箱:99515681@qq.com
  • 工作时间:8:00-21:00
  • 微信:codinghelp

您当前位置:首页 >> Database作业Database作业

日期:2025-01-13 09:13

TP5 – Analytics with Dataiku

Objectifs of the lab :

Use postgreSQL and Dataiku to interact with the database in order to compute some KPIs.

Tools and Versions

• postgres : v15

• DSS : v11.2.0

Postgres :

PostgreSQL is a popular open-source relational database management system (RDBMS). It is

known for its robustness, scalability, and feature-richness, and is used by many organizations

and developers worldwide. PostgreSQL provides support for a wide range of SQL features

and data types, as well as many advanced features, such as stored procedures, triggers, and

views. It also offers many built-in functions and extensions for advanced data processing,

such as full-text search and geographic information systems (GIS).

In addition, PostgreSQL has a large and active community of users and developers, which

provides support and contributes to its ongoing development and improvement. It is

available on a wide range of platforms, including Linux, Windows, and macOS, and can be

used with many programming languages and frameworks.

Dataiku :

Dataiku is a collaborative data science platform that enables businesses to build and deploy

data-driven solutions more efficiently. It provides a comprehensive platform for data

analysts, data scientists, and engineers to collaborate on building and deploying predictive

analytics, machine learning, and other data-driven solutions. The platform includes a visual

interface for building workflows, data preparation tools, machine learning models, and other

analytical solutions. It also supports collaboration and version control, allowing teams to

work together on projects and share their work with others.

Dataiku supports a wide range of data sources and integrates with popular data storage and

processing technologies, including Hadoop, Spark, and SQL databases. It also includes

features for data governance, security, and auditing, making it suitable for use in regulated

industries. Overall, Dataiku aims to provide a user-friendly, scalable, and secure platform for

organizations to build, deploy, and manage data-driven solutions.

 

Installation

You need to have docker already install on your machine.

Download the docker image postgres. Then start it (If this step have been done before you

just need to start the container) :

$ docker pull totofunku/sql-cours

$ docker run --name postgresql -e POSTGRES_USER=admin \

-e POSTGRES_PASSWORD=adminadmin -p 5432:5432 \

-v /data:/var/lib/postgresql/data -d totofunku/sql-cours:lat

est

Same thinks for dataiku :

$ docker pull dataiku/dss

$ docker run -p 10000:10000 -d dataiku/dss

Then we will configure the connection to the postgres Database

Configure the connection with postgres information as we did with pgAdmin.  

Now you can use this connection in you flow.

QUESTIONS  

1. Print the current city on a map. First geocode the address with API call or the plugin

in Dataiku name “Zipcode geocoding”. And then make a simple graph to see those

points on a map.

2. Add demographic data so you know how many people in the city live there and how

many rent a DVD.

3. Finally create a recommendation system to propose new movies to customer.


相关文章

【上一篇】:到头了
【下一篇】:没有了

版权所有:编程辅导网 2021 All Rights Reserved 联系方式:QQ:99515681 微信:codinghelp 电子信箱:99515681@qq.com
免责声明:本站部分内容从网络整理而来,只供参考!如有版权问题可联系本站删除。 站长地图

python代写
微信客服:codinghelp