med-mastodon.com is one of the many independent Mastodon servers you can use to participate in the fediverse.
Medical community on Mastodon

Administered by:

Server stats:

344
active users

#datacuration

0 posts0 participants0 posts today

👉 “The greatest discoveries made from your data will be made by someone else.....”

Great summary by Mary Ann Tuli and Bastien Molcrette from @GigaScience with highlights from the International Digital Curation Conference #IDCC25
'
👀 gigasciencejournal.com/blog/id

The presentation by @PIDNetworkDE was also mentioned - thanks for that!

gigasciencejournal.comIDCC25: The greatest discoveries made from your data will be made by someone else..... - GigaBlog

👋 We are looking for a Machine Learning team (industry or research) who want to review and refine their data practices for improved documentation and reflexivity of their ML data workflow and better ML model outcomes. Is that you? Reach out!

This collaboration can take different shapes: jointly evaluating datasets, organizing workshops, action research, or other activities. More information and some options here: justsustainabilitydesign.org/2

justsustainabilitydesign.org · Research Collaboration Opportunity: Data Curation in Machine LearningTechnologies for Just and Sustainable Communities.

#DatabasesDemystified Part 3: Types of Curation

This advanced search discovers how many #databases have each of the different types of #datacuration @fairsharing describes: manual, automated, both manual and automated, none, or not found.

Surprisingly, most databases in FAIRsharing are manually curated. But why is the curation status of 30% of those databases unknown? Read more at blog.fairsharing.org/?p=868

Thanks to our team and our #FAIRsharingCommunityChampions

Having posted previously about having additional redundancy in social connections, I’ve taken it a step further by starting to grab copies of media that I like on the internet.

While I’d like to think the sites we use are resilient, too often I find that great artwork, stories, and videos disappear, never to be found again, and that’s kinda sad.

So what am I using for this effort? Right now, mostly right-click and save. For stories/articles, I found that using SingleFile is a great way to preserve a copy of the page (which if the page supports reader view, Firefox will let you use reader view on it!).

As for organising these files, I picked up filetags. It’s not the easiest thing to use but it’s forced me to think of what tags I would actually use, plus it has a pretty neat navigation view by invoking filetags --tagtrees --filebrowser none, so you can filter down just by going into folders.

Web Extension for saving a faithful copy of a complete web page in a single HTML file - gildas-lormeau/SingleFile
GitHubGitHub - gildas-lormeau/SingleFile: Web Extension for saving a faithful copy of a complete web page in a single HTML fileWeb Extension for saving a faithful copy of a complete web page in a single HTML file - gildas-lormeau/SingleFile

Brief and knowledgeable article about importance of #Evaluation, #Datalabeling and #curation, and #Testing while building AI applications.

The recent case with Figma shows how it's important, as feeding an AI application with the other apps' data, not testing attentively could potentially lead to losses and court. Figma disabled its new AI-powered app design tool, Make Design, after it was found to be copying Apple's weather app. The CEO acknowledged the fault was due to insufficient QA processes and emphasized the need for rigorous testing before deployment.

Article 1: www2.deloitte.com/us/en/insigh

Article 2: 404media.co/figma-disables-ai-

OMG the data curation network (😍 ❤️ ) did a CARE principles primer project. If you haven't seen them, the DCN Data Curation Primers generally can be found at datacurationnetwork.org/output and they're amazing. This one specifically is located at github.com/DataCurationNetwork
I feel like I should have already known something this cool. Did I see the announcement and just lose it from my brain? So many great #datacuration data resources have come out this year.
#SEDLS2023

Replied in thread

Read how @GenevievMichaud, #Humanities and #SocialScience #FAIRsharingCommunityChampions, enjoys conversations with @allysonlister & the team (we enjoy them too!); her benefits are exactly what we hope our Champions will gain: curation expertise, visibility, flexible volunteering & broadening networks

#FAIR #ResearchDataManagement #DataCuration

@lnanderscience @debsethorpe @msandstr @allysonlister @kylecopas @neuropelletier @stephenserjeant @resdatall@tweets.icu @resdatall@bird.makeup
@eoscfuture

We are proud to announce that the #FAIRsharingCommunityChampions have had their first anniversary! 🥳
blog.fairsharing.org/?p=567 describes the ways in which these 18 fantastic people have enriched @fairsharing content while gaining attribution, expertise and networking.

Launched under @allysonlister's #RDA / #EOSCfuture Domain Ambassadorship
eoscfuture-grants.eu/node/262
@kylecopas @debsethorpe @neuropelletier @msandstr @GenevievMichaud @stephenserjeant @lnanderscience

For the #DataScience, #FediScience and #DataCuration folks ... just released version 1.0 of **whyqd** (/wɪkɪd/) - whyqd.readthedocs.io/

It's curatorial toolkit intended to produce well-structured and predictable data for research analysis. Underneath is #Pandas and #Pydantic, and you get nice, readable schema #ETL #crosswalks which you can run in a CI task.

I'd appreciate your thoughts.

whyqd.readthedocs.iowhyqd - simplicity, transparency, speed - WhyqdData wrangling simplicity, complete audit transparency, and at speed