waiting for the first of many pipes to send data

Skaperen · May-01-2019, 05:20 AM

i am writing a script the will run up to about 16 child processes that send back output for the parent to read and print. the timing can be rather random because each is contacting different servers on the net. they will all run in parallel to reduce the total time. some servers have been known to take several minutes because of complex searches on huge databases. once a child process starts getting data and printing it over the pipe to the parent, it goes quite fast plus there is a requirement to not allow output lines to be mixed although the order does not matter. so one a child begins to send data the parent reads all from that child only until EOF on that pipe. but until it knows which child will output next, it needs to wait until one of the pipes is ready. in C i would call poll() to do this then loop around read() until EOF. in Python my first thought is to do it basically the same way. but i would like to know if there is a better alternative for that.

**Gribouillis** · May-01-2019, 07:56 PM

I would use the select or selectors module for this.

Skaperen · May-02-2019, 04:10 AM

it looks like selectors is not much different than the kernel syscall stuff. i'll be experimenting with this over the next few days. got a lot of subprocesses to make.

Skaperen · May-04-2019, 12:26 AM

the example in the docs for selectors is confusing to me. i'll just have a bunch of (list of) pipes already open for read, with the other end being a child process stdout. all i need is to wait for one of them to be ready and know which is. i already know how to do that with direct syscalls (select() or poll()) using file descriptors. my code will then loop and read that one pipe until EOF (not interleaving with any others) then wait to see which (less the one i just got EOF on) is ready the next time.

the example seems to be trying to run functions via selectors which i see as complicating the level of simplicity i need.

**Gribouillis** · May-04-2019, 07:02 AM

Use the selectors module: it is recent and high-level. You don't need to use callbacks as they do in the documentation's example. You can very well read directly from the pipes in the while True loop.

Skaperen · (This post was last modified: May-05-2019, 12:07 AM by Skaperen.)

i don't want to read from any pipes until i know which one is ready, first. then i want to read only that one pipe, blocking in each read, until EOF on it. then back to checking for the next pipe to be ready. i'll probably need to change the pipe to non-blocking mode for the wait and to blocking mode for the read-to-EOF loop, and unregister that pipe and close it.

i wonder if i need to mess with setting blocking vs. non-blocking if i use a file object instead of a file descriptor. i'll be starting these subprocesses with Popen, so i can easily use the file object.

once a subprocess starts to get data from the net, it will be getting it all reasonably quickly with only small times between each line (they can be made to flush per line).

**Gribouillis** · (This post was last modified: May-05-2019, 06:20 AM by Gribouillis.)

You can simply do this with the selectors module

import selectors

with selectors.DefaultSelector() as sel:
    npipes = 0
    for p in my_pipes:
        sel.register(p, selectors.EVENT_READ)
        npipes += 1

    while npipes:
        events = sel.select()
        for key, mask in events:
            sel.unregister(key.fileobj)
            npipes -= 1
            data = key.fileobj.read()
            key.fileobj.close()
            # do something with data

Skaperen · May-05-2019, 10:57 PM

i'll try that. i'll probably make a function to do the wait thing to get just one result at a time. when there is more than one event (more than one pipe is ready) the order doesn't really matter. the parallelism is just for parallel performance, not to reveal which source is fastest. a previous version of this did the parallel stuff but the pipes were read in the order they were in the original list (so the time to see first output would usually be longer).

Possibly Related Threads…
Thread		Author	Replies	Views	Last Post
	Waiting for input from serial port, then move on	KenHorse	3	4,536	Apr-17-2024, 07:21 AM Last Post: DeaD_EyE
	pip stops waiting for python	walker	6	3,189	Nov-28-2023, 06:55 PM Last Post: walker
	Waiting for heavy functions question	philipbergwerf	14	6,240	Apr-29-2022, 07:31 PM Last Post: philipbergwerf
	How to create waiting process?	samuelbachorik	4	3,484	Sep-02-2021, 05:41 PM Last Post: bowlofred
	How to send data from a python application to an external application	aditya_rajiv	1	3,122	Jul-26-2021, 06:00 AM Last Post: ndc85430
	how to run linux command with multi pipes by python !!	evilcode1	2	9,265	Jan-25-2021, 11:19 AM Last Post: DeaD_EyE
	Duplex pipes	GrahamL	0	2,437	Dec-16-2020, 09:44 AM Last Post: GrahamL
	Waiting and listening	test	2	3,203	Nov-13-2020, 04:43 PM Last Post: michael1789
	waiting for barcode scanner output, while main program continues to run	lightframe109	3	6,531	Sep-03-2020, 02:19 PM Last Post: DeaD_EyE
	waiting to connect	Skaperen	9	5,439	Aug-17-2020, 05:58 AM Last Post: Skaperen

waiting for the first of many pipes to send data

User Panel Messages

Announcements