Top Posts Tagged with #pthreads

Lazy Initialization in Rust

Today I published lazy-init, a Rust crate that scratches an itch I’ve had for a while. lazy-init is designed for when:

you want to do some work (a computation, disk I/O, etc) lazily,

the product of this work is immutable once it is created,

and you want to share this data across threads.

Rust has a good built-in solution if you only require #s 1 and 2: the Option type. But requirement #3 makes things much harder. Both of the built-in, thread-safe primitives for interior mutability have significant drawbacks, as we’ll see later. But first, the API!

impl<T> Lazy<T> { /// Construct a new, uninitialized `Lazy<T>`. pub fn new() -> Lazy<T>;

/// Get a reference to the contained value, invoking `f` to create it /// if the `Lazy<T>` is uninitialized. It is guaranteed that if multiple /// calls to `get_or_create` race, only one will invoke its closure, and /// every call will receive a reference to the newly created value. /// /// The value stored in the `Lazy<T>` is immutable after the closure returns /// it, so think carefully about what you want to put inside! pub fn get_or_create<'a, F>(&'a self, f: F) -> &'a T where F: FnOnce() -> T;

/// Get a reference to the contained value, returning `Some(ref)` if the /// `Lazy<T>` has been initialized or `None` if it has not. It is /// guaranteed that if a reference is returned it is to the value inside /// the `Lazy<T>`. pub fn get<'a>(&'a self) -> Option<&'a T>; }

There’s a constructor and two methods, one to get an existing value and another to get_or_create the value if it does not already exist. get_or_create will ensure that the closure is invoked only once even if multiple threads race to call it on an uninitialized Lazy<T>. Simple enough, right?

Lazy<T> is actually a degenerate version of a more generic LazyTransform<T, U> included in the crate which is initialized with a T that is later converted to a U. Lazy<Foo> is essentially LazyTransform<(), Foo>. For simplicity, I’ll refer to them interchangeably.

Rust provides two primitives for threadsafe interior mutability, std::sync::Mutex and std::sync::RwLock. Lazy<T> is better than both of them because:

Unlike the locking types, Lazy<T> guarantees immutability after the value is created. This also means you can hold an immutable reference to the interior value without having to hold the lock.

Unlike std::sync::Mutex, Lazy<T> does not exclude multiple readers after the value is created, and a panic while reading the value will not poison the Lazy<T>.

Lazy<T> is at least no worse in performance compared to either locking type, and likely much better.

The first two are self-explanatory, so lets dive into the third one. On Unix systems, std::sync::Mutex and std::sync::RwLock boil down to pthread_mutex_t and pthread_rwlock_init respectively. Lazy<T> meanwhile, becomes a single std::sync::Mutex and a std::sync::atomic::AtomicBool.

The (slightly simplified, to elide details not relevant to synchronization) code inside get_or_create looks like

if !self.initialized.load(Ordering::Acquire) { // We *may* not be initialized. We have to block to be certain. let _lock = self.lock.lock().unwrap(); if !self.initialized.load(Ordering::Relaxed) { // Ok, we're definitely uninitialized. // Safe to fiddle with the UnsafeCell now, because we're locked, // and there can't be any outstanding references. let value = unsafe { &mut *self.value.get() }; *value = f(value); self.initialized.store(true, Ordering::Release); } else { // We raced, and someone else initialized us. We can fall // through now. } }

// We're initialized, our value is immutable, no synchronization needed. *self.value.get()

Where self.value is an UnsafeCell. This is a standard double-checked locking pattern. Jeff Preshing has a great explanation of how this pattern actually works, and why the various Ordering values are what they are here. The simple explanation is that the AtomicBool::store call with Ordering::Release after the closure synchronizes with the AtomicBool::load call with Ordering::Acquire at the top of the function. So if a thread sees the write to self.initialized it must also see the write to self.value. If a thread doesn’t see that write, it grabs the lock. Memory accesses cannot be reordered across a lock acquisition or release (because, internally, a mutex uses semantics that are at least as strong as the acquire/release semantics mentioned before) so self.initialized is now a definitive source of truth. The lock also ensures that only one thread invokes the closure no matter how many threads are racing.

The code inside get is even simpler

if self.initialized.load(Ordering::Acquire) { // We're initialized, our value is immutable, no synchronization needed. Some(&*self.value.get()) } else { None }

We use the same acquire semantics as before to check if we are initialized. You’ll notice that we don’t have to acquire a lock at all here. Even in get_or_create, we only have to acquire the lock if self.initialized appears to be false. Once that write propagates to all threads, Lazy<T> allows lock-free access to the underlying value at the cost of a single load-acquire check.

Best of all, with x86′s strong memory model, every load from memory has acquire semantics. The atomic operations here really just tell the compiler not to do anything crazy with reordering. This is not true on other architectures with weaker memory models. On ARM, for instance, getting load-acquire semantics does require a DMB instruction. Peter Sewell maintains a list of what the various atomic orderings map to on different architectures.

Depending on your pthreads implementation and architecture the performance of pthread_mutex_t and pthread_rwlock_t can vary wildly. But as any sort of read-write lock needs to, at the bare minimum, both ensure there are no outstanding writers and increment the read count, a pthread_rwlock_t is never going to be any faster than the single load-acquire that Lazy<T> performs.

I hope others find this crate useful. Bug reports and pull requests are always welcome!

EDIT: Thanks to Huon for pointing out that I needed to bound the contained type with Sync

#mozilla #rust #threads #pthreads #rustlang

•18+ Adults Only

Watch Anya Live on Cam

Anya is live and ready to show you everything. Watch her strip, dance, and perform exclusive shows just for you. Interact in real-time and make your fantasies come true.

✓ Live Streaming✓ Interactive Chat✓ Private Shows✓ HD Quality✓ Free Actions

Free to watch • No registration required • HD streaming

#C #linux #pthreads #software development

Threads on C/C++ - 1.0.1

regarding the idlemultithread GutHub repository I posted recently, I’m looking into a way of making it so that it can use more threads (up to N), getting it to be really thread-safe on all variable access operations and most importantly, written in what I’d call “elegant code” this is probably going to take me a long time, what with the other repositories (both posted and to be posted) and college classes and all, also, like the pointer reference code, I might leave alternatives on how to do the same thing, if you’re interested in that particular repository be sure to run “git pull” from time to time

#update #posix threads #threads #pthreads #c language #c++ language #c++#c

threads on C/C++

I made a small example on thread usage with POSIX threads, I will most likely add some more functionality to it, but as it is now it works

I made a public GitHub repository for it, available at

https://github.com/dcorderoch/idlemutithread

if I get the hang of making animated GIFs I may make a GIF of how it works

#threads #posix threads #pthreads #c language #c++ language #c++#c

Multithreading and Starcraft

It seemed that my new threads were not moving quick enough. Well they were, they just had nothing to process. They were starved for data. Too many underlings! I needed to spawn more overlords. Less workers and more parents filling queues for workers. Performance doubled!

#python #Pthreads #Multiprocessing #Starcraft

•18+ Adults Only

Watch Anya Live on Cam

Anya is live and ready to show you everything. Watch her strip, dance, and perform exclusive shows just for you. Interact in real-time and make your fantasies come true.

✓ Live Streaming✓ Interactive Chat✓ Private Shows✓ HD Quality✓ Free Actions

Free to watch • No registration required • HD streaming

Answer: Why do pthreads’ condition variable functions require a mutex? #solu tion #development #it

Why do pthreads’ condition variable functions require a mutex?

I’m reading up on pthread.h; the condition variable related functions (like pthread_cond_wait(3)) require a mutex as an argument. Why? As far as I can tell, I’m going to be creating a mutex just to use as that argument? What is that mutex supposed to do?

Answer [by paxdiablo]: Why do pthreads’ condition variable functions require a…

View On WordPress

#c #condition-variable #mutex #pthreads

C is not your friend: pthread timeouts

So you have a multithreaded program, and you need to wait for something to happen: a server to respond, a resource to become available, that sort of thing. But you don't want to wait forever – if the server hasn't responded in a few seconds we'd like to try another one, or report an error. The POSIX threads API for this is pthread_cond_timedwait(). It blocks the calling thread, waking up either when the condition variable is signalled or the timeout expires. Ideal, right? Very nearly, but like everything to do with time on computers, it's tricky, and even following the documentation can lead you astray.

The third argument to pthread_cond_timedwait() is an absolute timeout, with no option for a relative timeout (cf. clock_nanosleep(), which allows either). The standard explains why they chose that: you can't reliably build an absolute timeout out of a relative one, because in code like this:

now = get_absolute_time(); relative_wait_for(deadline - now);

your thread might get descheduled between checking the current time and starting the wait, and then oversleep. Also, even if you do want a relative timeout, you have to deal with spurious wakeups, so it's best to translate it into an absolute timeout and use that:

now = get_absolute_time(); deadline = now + delta; while (!predicate()) absolute_wait_for(deadline);

(Again, cf. clock_nanosleep(), which reports the amount of time left over if it returns early.) That's all pretty good advice, but the example code in the spec. uses clock_gettime(CLOCK_REALTIME, &ts) to get the current time, and that's the point where the hairs on the back of your neck should be standing up. CLOCK_REALTIME is a wallclock time, which can be reset by the sysadmin, ntp client &c. at any time. If you use this code to set up a 1-second relative timeout, and during that second the sysadmin sets the clock back by a year, you'll be blocked for a long time.

A much better time source to use for this sort of thing is CLOCK_MONOTONIC, which is basically the computer's best guess at elapsed time, and in particular is guaranteed never to go backwards (though the spec. does allow it to jump forwards but at least that will just wake threads early). How do we do that? The glibc manual says we can't:

The abstime parameter specifies an absolute time, with the same origin as time(2) and gettimeofday(2): an abstime of 0 corresponds to 00:00:00 GMT, January 1, 1970.

but luckily that's not true. FreeBSD's manual agrees with POSIX:

The clock used to measure abstime can be specified during creation of the condition variable using pthread_condattr_setclock(3).

That is, the clocksource to wait for is a property of the condition variable, not of a particular call to pthread_cond_timedwait(). That's a bit of a trap in its own right: it means you have to be careful to check which clock your condition variables are using before adding new calls to pthread_cond_timedwait(). By default it's CLOCK_REALTIME but if you assume that and it's actually using CLOCK_MONOTONIC you could end up with some hard-to-find bugs. It also means that if you want to use a condition variable for relative timeouts, you can't use the same condition variable to wait for a particular wallclock time.

So to sum up, to use relative timeouts safely on pthread condition variables, we need to:

use pthread_condattr_setclock() before pthread_cond_init() to make the condition variables themselves use CLOCK_MONOTONIC.

use clock_gettime(CLOCK_MONOTONIC, ...) to set up the deadlines for passing to pthread_cond_timedwait().

P.S. the pthread_mutex_timedlock() function has the same pitfall: it uses an absolute timeout against CLOCK_REALTIME. But in this case there's no equivalent of pthread_condattr_setclock() to let us specify a better clock, so best just to avoid it entirely.

#not your friend #pthreads #time

#joe damato #pthreads #linux #glibc