Krzysztof Dmowski @xaphanpl

**Habr** @habr@zhub.link · Mar 31

Sandbox DB: универсальная песочница для погружения в Big Data, аналитику и визуализацию

Запускайте PostgreSQL, ClickHouse, Airflow, Superset и другие инструменты одним кликом: учите, экспериментируйте, осваивайте новое!

https://habr.com/ru/articles/896054/

ХабрSandbox DB: универсальная песочница для погружения в Big Data, аналитику и визуализациюПесочница Sandbox DB Приветствую всех читателей! Сегодня я хочу рассказать вам о своем мини‑проекте под названием «Песочница Sandbox DB» . Если вам лень читать статью, вот ссылка...

#анализ_данных #базы_данных #python

**Pedro Faria** @pedropark99@fosstodon.org · Mar 17

Mar 17

Pedro Faria @pedropark99@fosstodon.org

New post about how to write data from a Apache Spark DataFrame into a Elasticsearch/Opensearch database #datascience #databricks #elasticsearch #opensearch #bigdata #apachespark #spark #tech #programming #python:

https://pedro-faria.netlify.app/posts/2025/2025-03-16-spark-elasticsearch/en/

pedro-faria.netlify.appWriting Spark DataFrames to Elasticsearch/Opensearch databases – home |> dplyr::glimpse()Elasticsearch and Opensearch are two very popular No-SQL databases. In this post, I want to address how can you write data from a Spark DataFrame into an Elasticsearch/Opensearch database.

**SwissNationalScienceFoundation** @snsf_ch@social.anoxinon.de · Feb 10 *

Feb 10 *

SwissNationalScienceFoundation @snsf_ch@social.anoxinon.de

New call for proposals: #Spark enables researchers from all disciplines to test or develop novel and unconventional scientific approaches, methods, theories or ideas within a short time.
Submission deadline: 4 March 2025.
https://www.snf.ch/en/CVNR0Q5f3P32Cg9f/news/spark-call-for-proposals

Swiss National Science Foundation (SNSF)Spark: call for proposalsThe Swiss National Science Foundation (SNSF) funds excellent research at universities and other institutions.

**Markus Suhr** @msuhr@gruene.social · Jan 30

Jan 30

Markus Suhr @msuhr@gruene.social

Just caught up with the recent Delta Lake webinar,

> Revolutionizing Delta Lake workflows on AWS Lambda with Polars, DuckDB, Daft & Rust

Some interesting hints there regarding lightweight processing of big-ish data. Easy to relate to any other framework instead of Lambda, e.g. #ApacheAirflow tasks

https://youtu.be/BR9oFD0QMAs

youtu.be- YouTubeEnjoy the videos and music you love, upload original content, and share it all with friends, family, and the world on YouTube.

#dataengineering #datascience #duckdb

**musicman** @musicman@beige.party · Jan 29

Jan 29

musicman @musicman@beige.party

This is a customer-facing role, so if that's not your thing, keep scrolling.

TLDR: If you know Hadoop and live close enough to Belfast to commute, you should apply.

I've posted this before, but it's been a little while #fedihire. Also, adding some additional information this time. This is my team. We are already on three continents and 6 timezones, but #Belfast is a new location for the team. I know literally nothing about the office.

I know a lot of places Hadoop is the past, and sure we see a ton of #Spark (I do not understand why that is not listed in the job description but maybe because they want to emphasis that we need hadoop expertise?). You can see all the projects we support at https://www.openlogic.com/supported-technology

It depends on how you count, as I was on two teams during tradition, but I've been on this team for over 5 years now. It's a great team. I've been with the company now right at 7 years. I cannot say how we compare to Belfast employers but this is well more than double where I have stayed at any other employer (even if you count UNC-CH as a single employer rather than the different departments, I've beat them by well over a year at this point).

My manager has been on this team for almost 15 years. His manager has been with this team for almost as long as me, but with the company much longer. His manager has been here almost as long as me (I actually did orientation with him). His manager is a her and she's been here almost as long as me. So, obviously, this is a place where people want to stay!

Our team has a lot of testosterone, but when I started, our CEO was a woman. The GM for the division is a woman.

My manager is black. The manager of our sister team is black.

I think you'll find our team and company is concerned about your work product and not how you dress, what bathroom you use, or the color of your skin.

If you take a look at our careers page, you'll see this:

Work Should Be Fun
There’s always something to look forward to as a Perforce employee: scavenger hunts, community lunches, summer events, virtual games, and year-end celebrations just to name a few.

We take that shit seriously. Nauseatingly so sometimes, lol.

Actually, we take everything on the careers page seriously, but I know from experience that some places treat support like they are a shoe sole to be worn down. Not so here. It's not all rainbows and sunshine, of course. The whole point is that the customer is having an issue! Our customers treat us with respect because management demands that they do.

------

The Director of Product Development at Perforce is searching for a Enterprise Architect (#BigData Solutions) to join the team. We are looking for an individual who loves data solutions, views technology as a lifestyle, and has a passion for open source software. In this position, you’ll get hands on experience building, configuring, deploying, and troubleshooting our big data solutions, and you’ll contribute to our most strategic product offerings.

At OpenLogic we do #opensource right, and our people make it happen. We provide the technical expertise required for maintaining healthy implementations of hundreds of integrated open source software packages. If your skills meet any of the specs below, now is the time to apply to be a part of our passionate team.
Responsibilities:

Troubleshoot and conduct root cause analysis on enterprise scale big data systems operated by third-party clients. Assisting them in resolving complex issues in mission critical environments.
Install, configure, validate, and monitor a bundle of open source packages that deliver a cohesive world class big data solution.
Evaluate existing Big Data systems operated by third-party clients and identify areas for improvement.
Administer automation for provisioning and updating our big data distribution.

Requirements:

Demonstrable proficiency in #Linux command-line essentials
Strong #SQL and #NoSQL background required
Demonstrable experience designing or testing disaster recovery plans, including backup and recovery
Must have a firm understanding of the #Hadoop ecosystem, including the various open source packages that contribute to a broader solution, as well as an appreciation for the turmoil and turf wars among vendors in the space
Must understand the unique use cases and requirements for platform specific deployments, including on-premises vs cloud vs hybrid, as well as bare metal vs virtualization
Demonstrable experience in one or more cloud-based technologies (AWS or Azure preferred)
Experience with #virtualization and #containerization at scale
Experience creating architectural blueprints and best practices for Hadoop implementations
Some programming experience required
#Database administration experience very desirable
Experience working in enterprise/carrier production environments
Understanding of #DevOps and automation concepts
#Ansible playbook development very desirable
Experience with #Git-based version control
Be flexible and willing to support occasional after-hours and weekend work
Experience working with a geographically dispersed virtual team

https://jobs.lever.co/perforce/479dfdd6-6e76-4651-9ddb-c4b652ab7b74

www.openlogic.comSupported Open Source Technologies | OpenLogicOpenLogic by Perforce supports hundreds of open source technologies. Search this list to see which packages we most often support for customers.

**Sarah Lea** @Sarah_Lea@techhub.social · Jan 23

Jan 23

Sarah Lea @Sarah_Lea@techhub.social

Day 4 of 12: Understanding key terms for data professionals

As more and more data is generated, we need technologies to process it efficiently. Companies also want to be able to process data in (near) real time. This is where tools such as Spark or Kafka (Big Data Technologies) come into play.

Today's Small Practical Project:
Develop a small pipeline with Python that simulates, processes and saves real-time data: For example, simulate real-time data streams of temperature values. Then check whether the temperature exceeds a critical threshold value. As an extension, you can plot the temperature data in real time.

#data #datascience #dataengineering

**Posit** @Posit@fosstodon.org · Jan 21

Jan 21

Posit @Posit@fosstodon.org

We’re thrilled to announce the release of orbital 0.3.0!

The orbital package allows you to run predictions from tidymodels workflows directly inside databases. This new version brings support for classification models and the `augment()` function.

Read more in the tidyverse blog: https://www.tidyverse.org/blog/2025/01/orbital-0-3-0/

#RStats #SQL #Spark

**Maciej Maciejewski** @maciej_mnet@pol.social · Jan 2 *

Jan 2 *

Maciej Maciejewski @maciej_mnet@pol.social

#Spark niestety w moim flow upadł. Wersja na desktopa jest dla mnie nieużywana z racji tego, że korzystam z wielu funkcji macOS (Tagi, Skróty klawiszowe: Sent in Mail w menu, pracuje na katalogach i regułach na serwerach). Mail śmiga i się integruje, w Sparku muszę albo kombinować jak coś obejść, albo się nie da (np. z podglądu wydruku często korzystam z opcji Save to PDF i Sent in Mail i tej drugiej nie udało mi się zastąpić).
Na mobilnym urządzeniu jest lepiej do mojego flow, ale wartości dodane czyli inteligentne katalogowanie nie jest mi aż tak potrzebne jak mam reguły.
Tak, że ten. Zachwyt aplikacja ok, jest super. Przydatność - to zależy - u mnie nie daje korzyści.

@voland @michael @moridin @iMagazinePL

**DataCenterWires** @datacenterwires@flipboard.social · Dec 18, 2024

Dec 18, 2024

DataCenterWires @datacenterwires@flipboard.social

#FirstCape Group Dramatically Boosts Stake in #Spark #NewZealand to Over 5%, Fueling #DataCenter Growth Amid Spark’s Major Restructuring

https://datacenterwires.com/telecoms-connectivity/firstcape-group-dramatically-boosts-stake-in-spark-new-zealand-to-over-5-fueling-data-center-growth-amid-sparks-major-restructuring/

**DataCenterWires** @datacenterwires@flipboard.social · Dec 13, 2024

Dec 13, 2024

DataCenterWires @datacenterwires@flipboard.social

#Spark New Zealand Sells Final Stake in #Connexa for $181 Million, Strengthens #DataCenter Strategy Amid Major Shift

https://datacenterwires.com/telecoms-connectivity/spark-new-zealand-sells-final-stake-in-connexa-for-181-million-strengthens-data-center-strategy-amid-major-shift/

**ADMIN magazine** @adminmagazine@hachyderm.io · Nov 12, 2024

Nov 12, 2024

ADMIN magazine @adminmagazine@hachyderm.io

AI-Powered @github Spark lets you build apps using natural language
https://www.admin-magazine.com/News/AI-Powered-GitHub-Spark-Released-for-Creating-Micro-Apps
#GitHub #Spark #AI #OpenSource #NaturalLanguage #apps #FOSS #ArtificialIntelligence

**Pieter de Bruin** @pdebruin@hachyderm.io · Nov 6, 2024 *

Nov 6, 2024 *

Pieter de Bruin @pdebruin@hachyderm.io

Did you miss #GitHub #Universe last week? Check out the demo of GitHub #Copilot in #VSCode by @cassidoo https://youtu.be/dSf8QOjazrQ?si=ssK9rhjvJ4cx-IBA&t=800
The keynote contains lots of other cool updates on #claude, #gemini, #models, #workspaces, and #spark

**Новини Українською** @rss_ukr_news@mastodon.social · Oct 30, 2024

Oct 30, 2024

Новини Українською @rss_ukr_news@mastodon.social

GitHub Copilot підтримуватиме ШІ-моделі від Anthropic, Google й OpenAI https://itc.ua/ua/novini/github-copilot-pidtrymuvatyme-shi-modeli-vid-anthropic-google-j-openai/ #GitHubUniverse2024 #GitHubCopilot #Технології #Новини #GitHub #OpenAI #Spark #Софт

ITC.ua · Oct 30, 2024GitHub Copilot підтримуватиме ШІ-моделі від Anthropic, Google й OpenAIНа конференції GitHub Universe 2024 компанія GitHub анонсувала перехід свого інструменту для

Oct 9, 2024

Oct 9, 2024

Irish Independent RSS @IrishIndependentIndependentRSS@mastodon.ozioso.online

[17:21] ‘What a mad few months’ – Irish rap hit The Spark longlisted for 2025 Grammys

The Kabin and Lisdoovarna Crew's viral hit The Spark has been longlisted for the 2025 Grammy Awards.

https://www.independent.ie/irish-news/what-a-mad-few-months-irish-rap-hit-the-spark-longlisted-for-2025-grammys/a1016205444.html

#Kabin #LisdoovarnaCrew's #Spark #2025 #GrammyAwards

Irish Independent · Oct 9, 2024‘What a mad few months’ – Irish rap hit The Spark longlisted for 2025 GrammysBy Tessa Ndjonkou

**Posit** @Posit@fosstodon.org · Aug 27, 2024 *

Aug 27, 2024 *

Posit @Posit@fosstodon.org

If you’re an Apache #Spark user, you benefit from its speed and scalability for #BigData processing.

However, you might still want to leverage #RStats’s extensive ecosystem of packages and intuitive syntax. One effective way to do this is by writing user-defined functions (UDFs) with sparklyr.

UDFs enable you to execute R functions within Spark, harnessing Spark’s processing power and combining the strengths of both tools.

Learn more in a recent blog post → https://posit.co/blog/databricks-udfs/

**Isabella Velásquez** @ivelasq3@fosstodon.org · Aug 1, 2024

Aug 1, 2024

Isabella Velásquez @ivelasq3@fosstodon.org

Are you a Spark user who prefers writing in R? User-defined functions with sparklyr might be what you need

With `spark_apply()`, you can write functions in #RStats and use them in #Spark queries.

Learn more in the blog post: https://posit.co/blog/databricks-udfs/

**Reinald Kirchner** @Reinald@nrw.social · Jul 21, 2024 *

Jul 21, 2024 *

Reinald Kirchner @Reinald@nrw.social

Ein 50 Jahre alter Verstärker von Fender oder Marshall oder Hiwatt ist heute ein begehrter Klassiker. Und er funktioniert (wenn gewartet) heute immer noch, wie vor 50 Jahren: Gitarre, Kabel, Strom, fertig.

Was wird aus den Amps, die heute mit #Bluetooth, #DSP und App auf den Markt kommen: wird es da in 5 Jahren noch eine angepasste und aktualisierte App für geben? Oder werden die Funktionen langsam sterben, wenn die passenden Handys verschwinden?

#nachhaltigkeit #app #spark

Continued thread

**TheWholeTruthXX** @adwright@mastodon.social · Jun 23, 2024 *

Jun 23, 2024 *

TheWholeTruthXX @adwright@mastodon.social

Here's a link to CBC Radio One Spark, and its final episode, about reasons for hope in the tech community.

The headliner was us all right here, members of the #Fediverse.

https://www.cbc.ca/radio/spark

CBCHome | Spark with Nora Young | CBC RadioOn CBC Radio One's Spark, Nora Young helps you navigate your digital life by connecting you to fresh ideas in surprising ways.

#CBC #Spark

**Posit** @Posit@fosstodon.org · May 8, 2024

May 8, 2024

Posit @Posit@fosstodon.org

The sparklyr package and friends have been getting some important updates in the past few months!

sparklyr is a package that allows you to interact with Spark using familiar R interfaces, such as dplyr, broom, and DBI. You can also gain access to Spark's distributed Machine Learning libraries, Structure Streaming, and ML Pipelines from R.

Read more in the blog post: https://blogs.rstudio.com/ai/posts/2024-04-22-sparklyr-updates/

Posit AI BlogPosit AI Blog: News from the sparkly-verseHighlights to the most recent updates to `sparklyr` and friends

#RStats #Databricks #Spark

**Fernanda Foertter** @hpcprogrammer@mastodon.social · Apr 16, 2024

Apr 16, 2024

Fernanda Foertter @hpcprogrammer@mastodon.social

My company is hosting a webinar in 5 minutes! buff.ly/3VWGQi9

Come see a live demo on 100TB of data and talk to our engineers. (read not a marketing presentation) #dataanalytics #gpu #hpc #etlongpu #spark #datascience #data

Recent searches

Search options

Administered by:

Server stats:

#spark